Gopa[WIP] GOPA, a spider written in Golang, for Elasticsearch. DEMO: http://index.elasticsearch.cn
Stars: ✭ 277 (+7.78%)
AntchAntch, a fast, powerful and extensible web crawling & scraping framework for Go
Stars: ✭ 198 (-22.96%)
flink-crawlerContinuous scalable web crawler built on top of Flink and crawler-commons
Stars: ✭ 48 (-81.32%)
bots-zooNo description or website provided.
Stars: ✭ 59 (-77.04%)
Sasila一个灵活、友好的爬虫框架
Stars: ✭ 286 (+11.28%)
img-cliAn interactive Command-Line Interface Build in NodeJS for downloading a single or multiple images to disk from URL
Stars: ✭ 15 (-94.16%)
CrawlyCrawly, a high-level web crawling & scraping framework for Elixir.
Stars: ✭ 440 (+71.21%)
Awesome CrawlerA collection of awesome web crawler,spider in different languages
Stars: ✭ 4,793 (+1764.98%)
CrawlBoxEasy way to brute-force web directory.
Stars: ✭ 118 (-54.09%)
CrawlerGo process used to crawl websites
Stars: ✭ 147 (-42.8%)
Lulu[Unmaintained] A simple and clean video/music/image downloader 👾
Stars: ✭ 789 (+207%)
Crawlab LiteLite version of Crawlab. 轻量版 Crawlab 爬虫管理平台
Stars: ✭ 122 (-52.53%)
ScrapyrtHTTP API for Scrapy spiders
Stars: ✭ 637 (+147.86%)
Skycaiji蓝天采集器是一款免费的数据采集发布爬虫软件,采用php+mysql开发,可部署在云服务器,几乎能采集所有类型的网页,无缝对接各类CMS建站程序,免登录实时发布数据,全自动无需人工干预!是网页大数据采集软件中完全跨平台的云端爬虫系统
Stars: ✭ 1,514 (+489.11%)
NutchApache Nutch is an extensible and scalable web crawler
Stars: ✭ 2,277 (+785.99%)
Spider Flow新一代爬虫平台,以图形化方式定义爬虫流程,不写代码即可完成爬虫。
Stars: ✭ 365 (+42.02%)
Pspider简单易用的Python爬虫框架,QQ交流群:597510560
Stars: ✭ 1,611 (+526.85%)
SquidwarcSquidwarc is a high fidelity, user scriptable, archival crawler that uses Chrome or Chromium with or without a head
Stars: ✭ 125 (-51.36%)
MamanRust Web Crawler saving pages on Redis
Stars: ✭ 39 (-84.82%)
CrawlabDistributed web crawler admin platform for spiders management regardless of languages and frameworks. 分布式爬虫管理平台,支持任何语言和框架
Stars: ✭ 8,392 (+3165.37%)
ScrapyScrapy, a fast high-level web crawling & scraping framework for Python.
Stars: ✭ 42,343 (+16375.88%)
ArachnidPowerful web scraping framework for Crystal
Stars: ✭ 68 (-73.54%)
Linkedin Profile Scraper🕵️♂️ LinkedIn profile scraper returning structured profile data in JSON. Works in 2020.
Stars: ✭ 171 (-33.46%)
N2h4네이버 뉴스 수집을 위한 도구
Stars: ✭ 177 (-31.13%)
FerretDeclarative web scraping
Stars: ✭ 4,837 (+1782.1%)
SupercrawlerA web crawler. Supercrawler automatically crawls websites. Define custom handlers to parse content. Obeys robots.txt, rate limits and concurrency limits.
Stars: ✭ 306 (+19.07%)
SpidrA versatile Ruby web spidering library that can spider a site, multiple domains, certain links or infinitely. Spidr is designed to be fast and easy to use.
Stars: ✭ 656 (+155.25%)
Webstera reliable high-level web crawling & scraping framework for Node.js.
Stars: ✭ 364 (+41.63%)
InfinitycrawlerA simple but powerful web crawler library for .NET
Stars: ✭ 97 (-62.26%)
DotnetcrawlerDotnetCrawler is a straightforward, lightweight web crawling/scrapying library for Entity Framework Core output based on dotnet core. This library designed like other strong crawler libraries like WebMagic and Scrapy but for enabling extandable your custom requirements. Medium link : https://medium.com/@mehmetozkaya/creating-custom-web-crawler-with-dotnet-core-using-entity-framework-core-ec8d23f0ca7c
Stars: ✭ 100 (-61.09%)
AbotCross Platform C# web crawler framework built for speed and flexibility. Please star this project! +1.
Stars: ✭ 1,961 (+663.04%)
Instagram BotAn Instagram bot developed using the Selenium Framework
Stars: ✭ 138 (-46.3%)
Strong Web Crawler基于C#.NET+PhantomJS+Sellenium的高级网络爬虫程序。可执行Javascript代码、触发各类事件、操纵页面Dom结构。
Stars: ✭ 238 (-7.39%)
NewspaperNews, full-text, and article metadata extraction in Python 3. Advanced docs:
Stars: ✭ 11,545 (+4392.22%)
CollyElegant Scraper and Crawler Framework for Golang
Stars: ✭ 15,535 (+5944.75%)
Mimo-CrawlerA web crawler that uses Firefox and js injection to interact with webpages and crawl their content, written in nodejs.
Stars: ✭ 22 (-91.44%)
spiderable-middleware🤖 Prerendering for JavaScript powered websites. Great solution for PWAs (Progressive Web Apps), SPAs (Single Page Applications), and other websites based on top of front-end JavaScript frameworks
Stars: ✭ 29 (-88.72%)
html-queryA fluent and functional approach to querying HTML
Stars: ✭ 48 (-81.32%)
domfindA Python DNS crawler to find identical domain names under different TLDs.
Stars: ✭ 22 (-91.44%)
medium-stat-boxPractical pinned gist which show your latest medium status 📌
Stars: ✭ 29 (-88.72%)
php-googleGoogle search results crawler, get google search results that you need - php
Stars: ✭ 23 (-91.05%)
snapcrawlCrawl a website and take screenshots
Stars: ✭ 37 (-85.6%)
SharinganWe will try to find your visible basic footprint from social media as much as possible - 😤 more sites is comming soon
Stars: ✭ 13 (-94.94%)
ComicBookMakerScript to fetch webcomics and use them to create ebooks.
Stars: ✭ 27 (-89.49%)
arachnodHigh performance crawler for Nodejs
Stars: ✭ 17 (-93.39%)
TumblTwoTumblTwo, an Improved Fork of TumblOne, a Tumblr Downloader.
Stars: ✭ 57 (-77.82%)
eastmoneypython requests + Django+ nodejs koa+ mysql to crawl eastmoney fund and stock data,for data analysis and visualiaztion .
Stars: ✭ 56 (-78.21%)
dijnet-botAz összes számlád még egy helyen :)
Stars: ✭ 17 (-93.39%)
crawlerA simple and flexible web crawler framework for java.
Stars: ✭ 20 (-92.22%)