A series of distributed components for Scrapy. Including RabbitMQ-based components, Kafka-based components, and RedisBloom-based components for Scrapy.

Stars: ✭ 38 (+31.03%)

Mutual labels: spider, crawling

Linkedin Profile Scraper

🕵️‍♂️ LinkedIn profile scraper returning structured profile data in JSON. Works in 2020.

Stars: ✭ 171 (+489.66%)

Mutual labels: spider, crawling

wget-lua

Wget-AT is a modern Wget with Lua hooks, Zstandard (+dictionary) WARC compression and URL-agnostic deduplication.

Stars: ✭ 52 (+79.31%)

Mutual labels: spider, crawling

Skycaiji

蓝天采集器是一款免费的数据采集发布爬虫软件，采用php+mysql开发，可部署在云服务器，几乎能采集所有类型的网页，无缝对接各类CMS建站程序，免登录实时发布数据，全自动无需人工干预！是网页大数据采集软件中完全跨平台的云端爬虫系统

Stars: ✭ 1,514 (+5120.69%)

Mutual labels: spider, crawling

goSpider

some small project and some articles

Stars: ✭ 56 (+93.1%)

Mutual labels: spider, spiders

Crawly

Crawly, a high-level web crawling & scraping framework for Elixir.

Stars: ✭ 440 (+1417.24%)

Mutual labels: spider, crawling

Arachnid

Powerful web scraping framework for Crystal

Stars: ✭ 68 (+134.48%)

Mutual labels: spider, crawling

Abot

Cross Platform C# web crawler framework built for speed and flexibility. Please star this project! +1.

Stars: ✭ 1,961 (+6662.07%)

Mutual labels: spider, spiders

Colly

Elegant Scraper and Crawler Framework for Golang

Stars: ✭ 15,535 (+53468.97%)

Mutual labels: spider, crawling

Laravel Crawler Detect

A Laravel wrapper for CrawlerDetect - the web crawler detection library

Stars: ✭ 227 (+682.76%)

Mutual labels: spider

View All Similar Projects ➔

BaiduSpider上发布！ !!

BaiduSpider

BaiduSpider是一个爬取百度搜索结果的Python爬虫，目前支持百度网页搜索，百度图片搜索，百度知道搜索，百度视频搜索，百度资讯搜索，百度文库搜索，百度经验搜索和百度百科搜索。详情请参见文档。

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

samzhangjy / BaiduSpider

Programming Languages

Labels

Projects that are alternatives of or similar to BaiduSpider

!! 本项目已经移动至https://github.com/BaiduSpider/BaiduSpider，此仓库将不再更新，之后的更新将在BaiduSpider/BaiduSpider上发布！ !!

BaiduSpider