arachnodHigh performance crawler for Nodejs
Stars: ✭ 17 (-93.39%)
Pylinkvalidatorpylinkvalidator is a standalone and pure python link validator and crawler that traverses a web site and reports errors (e.g., 500 and 404 errors) encountered.
Stars: ✭ 109 (-57.59%)
Vulnxvulnx 🕷️ is an intelligent bot auto shell injector that detect vulnerabilities in multiple types of cms { `wordpress , joomla , drupal , prestashop .. `}
Stars: ✭ 1,009 (+292.61%)
Spoon🥄 A package for building specific Proxy Pool for different Sites.
Stars: ✭ 173 (-32.68%)
LinkcrawlerCross-platform persistent and distributed web crawler 🔗
Stars: ✭ 109 (-57.59%)
ComicBookMakerScript to fetch webcomics and use them to create ebooks.
Stars: ✭ 27 (-89.49%)
LumberjackAn automated website accessibility scanner and cli
Stars: ✭ 109 (-57.59%)
GoreconGorecon is a All in one Reconnaissance Tool , a.k.a swiss knife for Reconnaissance , A tool that every pentester/bughunter might wanna consider into their arsenal
Stars: ✭ 208 (-19.07%)
FawkesFawkes is a tool to search for targets vulnerable to SQL Injection. Performs the search using Google search engine.
Stars: ✭ 108 (-57.98%)
GargantuaThe fast website crawler
Stars: ✭ 35 (-86.38%)
Proxy poolPython爬虫代理IP池(proxy pool)
Stars: ✭ 13,964 (+5333.46%)
DiskoverFile system crawler, disk space usage, file search engine and file system analytics powered by Elasticsearch
Stars: ✭ 977 (+280.16%)
scrapy-fieldstatsA Scrapy extension to log items coverage when the spider shuts down
Stars: ✭ 17 (-93.39%)
News Pleasenews-please - an integrated web crawler and information extractor for news that just works.
Stars: ✭ 969 (+277.04%)
Fun crawlerCrawl some picture for fun
Stars: ✭ 169 (-34.24%)
Vw Crawler🐞简单轻便的Java爬虫框架,只要会一点简单的正则表达式和简单的css选择器就能轻松的采集数据。
Stars: ✭ 32 (-87.55%)
Douyin crawler 抖音爬虫,tiktok crawler,抖音数据采集接口,抖音视频去水印,百分百成功,不需要服务器,不需要代理 IP。
Stars: ✭ 169 (-34.24%)
PapercrawlerCrawler used to crawl papers
Stars: ✭ 20 (-92.22%)
Onion CrawlerTor website crawler (specific for Alphabay at the time)
Stars: ✭ 15 (-94.16%)
TumblTwoTumblTwo, an Improved Fork of TumblOne, a Tumblr Downloader.
Stars: ✭ 57 (-77.82%)
AxegrinderCrawl websites for accessibility issues from the command line.
Stars: ✭ 12 (-95.33%)
Datmusic ApiAlternative for VK Audio API
Stars: ✭ 160 (-37.74%)
CcrawlSimple CORPORA list crawler
Stars: ✭ 11 (-95.72%)
auctusDataset search engine, discovering data from a variety of sources, profiling it, and allowing advanced queries on the index
Stars: ✭ 34 (-86.77%)
Yispider一款分布式爬虫平台,帮助你更好的管理和开发爬虫。 内置一套爬虫定义规则(模版),可使用模版快速定义爬虫,也可当作框架手动开发爬虫。(兴趣使然的项目,用的不爽了就更新)
Stars: ✭ 158 (-38.52%)
crawlerA simple and flexible web crawler framework for java.
Stars: ✭ 20 (-92.22%)
Sqlivmassive SQL injection vulnerability scanner
Stars: ✭ 840 (+226.85%)
ScrapitScraping scripts for various websites.
Stars: ✭ 25 (-90.27%)
doc crawler.pyExplore a website recursively and download all the wanted documents (PDF, ODT…)
Stars: ✭ 22 (-91.44%)
TumblthreeA Tumblr Blog Backup Application
Stars: ✭ 923 (+259.14%)
CrawlerAn easy to use, powerful crawler implemented in PHP. Can execute Javascript.
Stars: ✭ 2,055 (+699.61%)
eastmoneypython requests + Django+ nodejs koa+ mysql to crawl eastmoney fund and stock data,for data analysis and visualiaztion .
Stars: ✭ 56 (-78.21%)
TumblthreeA Tumblr Backup Application
Stars: ✭ 211 (-17.9%)
WebmagicA scalable web crawler framework for Java.
Stars: ✭ 10,186 (+3863.42%)
Zhihu Crawlerzhihu-crawler是一个基于Java的高性能、支持免费http代理池、支持横向扩展、分布式爬虫项目
Stars: ✭ 890 (+246.3%)
Python3 SpiderPython爬虫实战 - 模拟登陆各大网站 包含但不限于:滑块验证、拼多多、美团、百度、bilibili、大众点评、淘宝,如果喜欢请start ❤️
Stars: ✭ 2,129 (+728.4%)
PythonPython脚本。模拟登录知乎, 爬虫,操作excel,微信公众号,远程开机
Stars: ✭ 7,355 (+2761.87%)
socials👨👩👦 Social account detection and extraction in Python, e.g. for crawling/scraping.
Stars: ✭ 37 (-85.6%)
JlitespiderA lite distributed Java spider framework :-)
Stars: ✭ 151 (-41.25%)
crawlkitA crawler based on Phantom. Allows discovery of dynamic content and supports custom scrapers.
Stars: ✭ 23 (-91.05%)
Goose ParserUniversal scrapping tool, which allows you to extract data using multiple environments
Stars: ✭ 211 (-17.9%)
Crawler Detect🕷 CrawlerDetect is a PHP class for detecting bots/crawlers/spiders via the user agent
Stars: ✭ 1,549 (+502.72%)
Crawler爬虫, http代理, 模拟登陆!
Stars: ✭ 106 (-58.75%)
Algoliasearch NetlifyOfficial Algolia Plugin for Netlify. Index your website to Algolia when deploying your project to Netlify with the Algolia Crawler
Stars: ✭ 208 (-19.07%)
D4n155OWASP D4N155 - Intelligent and dynamic wordlist using OSINT
Stars: ✭ 105 (-59.14%)
scrapy-distributedA series of distributed components for Scrapy. Including RabbitMQ-based components, Kafka-based components, and RedisBloom-based components for Scrapy.
Stars: ✭ 38 (-85.21%)