GocrawlPolite, slim and concurrent web crawler.
grobotstxtgrobotstxt is a native Go port of Google's robots.txt parser and matcher library.
nuxt-humans-txt🧑🏻👩🏻 "We are people, not machines" - An initiative to know the creators of a website. Contains the information about humans to the web building - A Nuxt Module to statically integrate and generate a humans.txt author file - Based on the HumansTxt Project.
.NetCorePluginManager.Net Core Plugin Manager, extend web applications using plugin technology enabling true SOLID and DRY principles when developing applications
jsitemapgeneratorJava sitemap generator. This library generates a web sitemap, can ping Google, generate RSS feed, robots.txt and more with friendly, easy to use Java 8 functional style of programming
robots-parserNodeJS robots.txt parser with support for wildcard (*) matching.
robots.txt🤖 robots.txt as a service. Crawls robots.txt files, downloads and parses them to check rules through an API
protegoA pure-Python robots.txt parser with support for modern conventions.