protegoA pure-Python robots.txt parser with support for modern conventions.
Stars: ✭ 36 (+176.92%)
robots-parserNodeJS robots.txt parser with support for wildcard (*) matching.
Stars: ✭ 117 (+800%)
orkid-nodeReliable and modern Redis Streams based task queue for Node.js 🤖
Stars: ✭ 61 (+369.23%)
parceraGrammar-based Clojure(script) parser
Stars: ✭ 100 (+669.23%)
libraJava Predicate, supports SQL-like syntax
Stars: ✭ 30 (+130.77%)
antlr4-toolA useful Antlr4 tool with full TypeScript support
Stars: ✭ 34 (+161.54%)
Free proxy pool对免费代理IP网站进行爬取,收集汇总为自己的代理池。关键是验证代理的有效性、匿名性、去重复
Stars: ✭ 66 (+407.69%)
yahdlA programming language for FPGAs.
Stars: ✭ 20 (+53.85%)
speedy-antlr-toolGenerate an accelerator extension that makes your Antlr parser in Python super-fast!
Stars: ✭ 22 (+69.23%)
java-astJava Parser for JavaScript/TypeScript (based on antlr4ts)
Stars: ✭ 58 (+346.15%)
BaiduSpider项目已经移动至:https://github.com/BaiduSpider/BaiduSpider !! 一个爬取百度搜索结果的爬虫,目前支持百度网页搜索,百度图片搜索,百度知道搜索,百度视频搜索,百度资讯搜索,百度文库搜索,百度经验搜索和百度百科搜索。
Stars: ✭ 29 (+123.08%)
AnimalRecognitionDemoAn example of using Redis Streams, RedisGears and RedisAI for Realtime Video Analytics (i.e. filtering cats)
Stars: ✭ 35 (+169.23%)
GocrawlPolite, slim and concurrent web crawler.
Stars: ✭ 1,962 (+14992.31%)
grobotstxtgrobotstxt is a native Go port of Google's robots.txt parser and matcher library.
Stars: ✭ 83 (+538.46%)
robots.jsParser for robots.txt for node.js
Stars: ✭ 64 (+392.31%)
nuxt-humans-txt🧑🏻👩🏻 "We are people, not machines" - An initiative to know the creators of a website. Contains the information about humans to the web building - A Nuxt Module to statically integrate and generate a humans.txt author file - Based on the HumansTxt Project.
Stars: ✭ 27 (+107.69%)
.NetCorePluginManager.Net Core Plugin Manager, extend web applications using plugin technology enabling true SOLID and DRY principles when developing applications
Stars: ✭ 17 (+30.77%)
jsitemapgeneratorJava sitemap generator. This library generates a web sitemap, can ping Google, generate RSS feed, robots.txt and more with friendly, easy to use Java 8 functional style of programming
Stars: ✭ 38 (+192.31%)
robotify-netcoreProvides robots.txt middleware for .NET core
Stars: ✭ 15 (+15.38%)
Antlr4ANTLR (ANother Tool for Language Recognition) is a powerful parser generator for reading, processing, executing, or translating structured text or binary files.
Stars: ✭ 11,227 (+86261.54%)
snowstarHere lies the code for the Snow* programming language, currently being rewritten.
Stars: ✭ 31 (+138.46%)
kolasuKotlin Language Support – AST Library
Stars: ✭ 45 (+246.15%)
xqdocAn Antlr4 implementation of xqDoc for XQuery
Stars: ✭ 14 (+7.69%)
ProfaneScripting language for derps
Stars: ✭ 18 (+38.46%)
ANTLR4ParseTreeVisualizerVisual Studio debugging visualizer, and .NET visualization controls, for ANTLR4 parse trees
Stars: ✭ 59 (+353.85%)
MPLA language to generate command blocks for Minecraft 1.9 and higher
Stars: ✭ 18 (+38.46%)
bsl-parserКоллекция парсеров языка 1С (BSL) в формате ANTLR4.
Stars: ✭ 23 (+76.92%)
Awesome Python Login Model模拟登陆基本采用的是直接登录或者使用selenium+webdriver的方式,有的网站直接登录难度很大,比如qq空间,bilibili等如果采用selenium就相对轻松一些。
Stars: ✭ 13,953 (+107230.77%)
AbotCross Platform C# web crawler framework built for speed and flexibility. Please star this project! +1.
Stars: ✭ 1,961 (+14984.62%)
goSpidersome small project and some articles
Stars: ✭ 56 (+330.77%)