Awesome PuppeteerA curated list of awesome puppeteer resources.
Stars: ✭ 1,728 (+910.53%)
ScrappleA framework for creating semi-automatic web content extractors
Stars: ✭ 464 (+171.35%)
Proxy poolPython爬虫代理IP池(proxy pool)
Stars: ✭ 13,964 (+8066.08%)
Operative Frameworkoperative framework is a OSINT investigation framework, you can interact with multiple targets, execute multiple modules, create links with target, export rapport to PDF file, add note to target or results, interact with RESTFul API, write your own modules.
Stars: ✭ 511 (+198.83%)
Datmusic ApiAlternative for VK Audio API
Stars: ✭ 160 (-6.43%)
Go jobs带你了解一下Golang的市场行情
Stars: ✭ 526 (+207.6%)
Haipproxy💖 High available distributed ip proxy pool, powerd by Scrapy and Redis
Stars: ✭ 4,993 (+2819.88%)
JikanUnofficial MyAnimeList PHP+REST API which provides functions other than the official API
Stars: ✭ 531 (+210.53%)
Body ParserNode.js body parsing middleware
Stars: ✭ 4,962 (+2801.75%)
XsrfprobeThe Prime Cross Site Request Forgery (CSRF) Audit and Exploitation Toolkit.
Stars: ✭ 532 (+211.11%)
DouyinAPI of DouYin for Humans used to Crawl Popular Videos and Musics
Stars: ✭ 580 (+239.18%)
NetdiscoveryNetDiscovery 是一款基于 Vert.x、RxJava 2 等框架实现的通用爬虫框架/中间件。
Stars: ✭ 573 (+235.09%)
FilemastaA search application to explore, discover and share online files
Stars: ✭ 571 (+233.92%)
IcrawlerA multi-thread crawler framework with many builtin image crawlers provided.
Stars: ✭ 629 (+267.84%)
Creeper🐾 Creeper - The Next Generation Crawler Framework (Go)
Stars: ✭ 762 (+345.61%)
Xxl CrawlerA distributed web crawler framework.(分布式爬虫框架XXL-CRAWLER)
Stars: ✭ 561 (+228.07%)
Imagescraper✂️ High performance, multi-threaded image scraper
Stars: ✭ 630 (+268.42%)
Grab SiteThe archivist's web crawler: WARC output, dashboard for all crawls, dynamic ignore patterns
Stars: ✭ 680 (+297.66%)
Yispider一款分布式爬虫平台,帮助你更好的管理和开发爬虫。 内置一套爬虫定义规则(模版),可使用模版快速定义爬虫,也可当作框架手动开发爬虫。(兴趣使然的项目,用的不爽了就更新)
Stars: ✭ 158 (-7.6%)
Zhihu Crawlerzhihu-crawler是一个基于Java的高性能、支持免费http代理池、支持横向扩展、分布式爬虫项目
Stars: ✭ 890 (+420.47%)
TorbotDark Web OSINT Tool
Stars: ✭ 821 (+380.12%)
PypergrabberFetches PubMed article IDs (PMIDs) from email inbox, then crawls PubMed, Google Scholar and Sci-Hub for respective PDF files.
Stars: ✭ 14 (-91.81%)
PypatentSearch for and retrieve US Patent and Trademark Office Patent Data
Stars: ✭ 31 (-81.87%)
MamanRust Web Crawler saving pages on Redis
Stars: ✭ 39 (-77.19%)
News Pleasenews-please - an integrated web crawler and information extractor for news that just works.
Stars: ✭ 969 (+466.67%)
Lizard💐 Full Amazon Automatic Download
Stars: ✭ 41 (-76.02%)
GospiderGospider - Fast web spider written in Go
Stars: ✭ 785 (+359.06%)
Nodespider[DEPRECATED] Simple, flexible, delightful web crawler/spider package
Stars: ✭ 33 (-80.7%)
CrawlabDistributed web crawler admin platform for spiders management regardless of languages and frameworks. 分布式爬虫管理平台,支持任何语言和框架
Stars: ✭ 8,392 (+4807.6%)
PhotonIncredibly fast crawler designed for OSINT.
Stars: ✭ 8,332 (+4772.51%)
Social ScraperTổng hợp script crawl dữ liệu từ các mạng xã hội & website tiếng Việt
Stars: ✭ 47 (-72.51%)
BeanbunBeanbun 是用 PHP 编写的多进程网络爬虫框架,具有良好的开放性、高可扩展性,基于 Workerman。
Stars: ✭ 1,096 (+540.94%)
Car PricesGolang爬虫 爬取汽车之家 二手车产品库
Stars: ✭ 57 (-66.67%)
GainWeb crawling framework based on asyncio.
Stars: ✭ 2,002 (+1070.76%)
Spiderpython crawler spider
Stars: ✭ 70 (-59.06%)
Jd AutobuyPython爬虫,京东自动登录,在线抢购商品
Stars: ✭ 1,174 (+586.55%)
LearnpythonPython的基础练习代码与各种爬虫代码
Stars: ✭ 451 (+163.74%)
AbotCross Platform C# web crawler framework built for speed and flexibility. Please star this project! +1.
Stars: ✭ 1,961 (+1046.78%)
GoscraperGolang pkg to quickly return a preview of a webpage (title/description/images)
Stars: ✭ 72 (-57.89%)
Instagram Scraperscrapes medias, likes, followers, tags and all metadata. Inspired by instagram-php-scraper,bot
Stars: ✭ 2,209 (+1191.81%)
Secret AgentThe web browser that's built for scraping.
Stars: ✭ 151 (-11.7%)
GrawlerGrawler is a tool written in PHP which comes with a web interface that automates the task of using google dorks, scrapes the results, and stores them in a file.
Stars: ✭ 98 (-42.69%)
Email ExtractorThe main functionality is to extract all the emails from one or several URLs - La funcionalidad principal es extraer todos los correos electrónicos de una o varias Url
Stars: ✭ 81 (-52.63%)
ScrapoxyScrapoxy hides your scraper behind a cloud. It starts a pool of proxies to send your requests. Now, you can crawl without thinking about blacklisting!
Stars: ✭ 1,322 (+673.1%)
Fun crawlerCrawl some picture for fun
Stars: ✭ 169 (-1.17%)
Douyinsdk抖音 SDK,数据采集,爬虫抓取不是梦
Stars: ✭ 99 (-42.11%)
RuiaAsync Python 3.6+ web scraping micro-framework based on asyncio
Stars: ✭ 1,366 (+698.83%)
WombatLightweight Ruby web crawler/scraper with an elegant DSL which extracts structured data from pages.
Stars: ✭ 1,220 (+613.45%)
Gopa AbandonedGOPA, a spider written in Go.(NOTE: this project moved to https://github.com/infinitbyte/gopa )
Stars: ✭ 98 (-42.69%)
D4n155OWASP D4N155 - Intelligent and dynamic wordlist using OSINT
Stars: ✭ 105 (-38.6%)
WebmagicA scalable web crawler framework for Java.
Stars: ✭ 10,186 (+5856.73%)