All Projects → Spider_job → Similar Projects or Alternatives

1331 Open source projects that are alternatives of or similar to Spider_job

豆瓣电影top250、斗鱼爬取json数据以及爬取美女图片、淘宝、有缘、CrawlSpider爬取红娘网相亲人的部分基本信息以及红娘网分布式爬取和存储redis、爬虫小demo、Selenium、爬取多点、django开发接口、爬取有缘网信息、模拟知乎登录、模拟github登录、模拟图虫网登录、爬取多点商城整站数据、爬取微信公众号历史文章、爬取微信群或者微信好友分享的文章、itchat监听指定微信公众号分享的文章

Stars: ✭ 615 (+162.82%)

Mutual labels: spider, scrapy, mongodb

Scrapy demo

all kinds of scrapy demo

Stars: ✭ 128 (-45.3%)

Mutual labels: spider, scrapy, mongodb

Alltheplaces

A set of spiders and scrapers to extract location information from places that post their location on the internet.

Stars: ✭ 277 (+18.38%)

Mutual labels: spider, scrapy

Jspider

JSpider会每周更新至少一个网站的JS解密方式，欢迎 Star，交流微信：13298307816

Stars: ✭ 914 (+290.6%)

Mutual labels: spider, scrapy

Crawlab

Distributed web crawler admin platform for spiders management regardless of languages and frameworks. 分布式爬虫管理平台，支持任何语言和框架

Stars: ✭ 8,392 (+3486.32%)

Mutual labels: spider, scrapy

toutiao

今日头条科技新闻接口爬虫

Stars: ✭ 17 (-92.74%)

Mutual labels: spider, scrapy

Dpspider

大众点评爬虫、API，可以进行单独城市、单独地区、单独商铺的爬取、搜索、多类型地区搜索、信息获取、提供MongoDB数据库存储支持，可以进行点评文本解密的爬取、存储

Stars: ✭ 259 (+10.68%)

Mutual labels: spider, mongodb

Webhubbot

Python + Scrapy + MongoDB . 5 million data per day !!!💥 The world's largest website.

Stars: ✭ 5,427 (+2219.23%)

Mutual labels: scrapy, mongodb

Jd spider

两只蠢萌京东的分布式爬虫.

Stars: ✭ 738 (+215.38%)

Mutual labels: scrapy, mongodb

Scrala

Unmaintained 🐳 ☕️ 🕷 Scala crawler(spider) framework, inspired by scrapy, created by @gaocegege

Stars: ✭ 113 (-51.71%)

Mutual labels: spider, scrapy

Feapder

feapder是一款支持分布式、批次采集、任务防丢、报警丰富的python爬虫框架

Stars: ✭ 110 (-52.99%)

Mutual labels: spider, scrapy

Py Elasticsearch Django

基于python语言开发的千万级别搜索引擎

Stars: ✭ 207 (-11.54%)

Mutual labels: spider, scrapy

scrapy facebooker

Collection of scrapy spiders which can scrape posts, images, and so on from public Facebook Pages.

Stars: ✭ 22 (-90.6%)

Mutual labels: spider, scrapy

scrapy-admin

A django admin site for scrapy

Stars: ✭ 44 (-81.2%)

Mutual labels: spider, scrapy

Tieba spider

百度贴吧爬虫(基于scrapy和mysql)

Stars: ✭ 257 (+9.83%)

Mutual labels: spider, scrapy

PttImageSpider

PTT 圖片下載器 (抓取整個看板的圖片，並用文章標題作為資料夾的名稱 ) (使用Scrapy)

Stars: ✭ 16 (-93.16%)

Mutual labels: spider, scrapy

Fbcrawl

A Facebook crawler

Stars: ✭ 536 (+129.06%)

Mutual labels: spider, scrapy

Istock

👉一个基于spring boot 实现的java股票爬虫(仅支持A股)，如果你❤️请⭐️ . V2升级版正在开发中！

Stars: ✭ 622 (+165.81%)

Mutual labels: spider, mongodb

Mailinglistscraper

A python web scraper for public email lists.

Stars: ✭ 19 (-91.88%)

Mutual labels: spider, scrapy

Elves

🎊 Design and implement of lightweight crawler framework.

Stars: ✭ 315 (+34.62%)

Mutual labels: spider, scrapy

Capturer

capture pictures from website like sina, lofter, huaban and so on

Stars: ✭ 76 (-67.52%)

Mutual labels: spider, scrapy

Distributed Multi User Scrapy System With A Web Ui

Django based application that allows creating, deploying and running Scrapy spiders in a distributed manner

Stars: ✭ 88 (-62.39%)

Mutual labels: scrapy, mongodb

Yspider

yspider -- 轻量级爬虫系统

Stars: ✭ 125 (-46.58%)

Mutual labels: spider, mongodb

Copybook

用爬虫爬取小说网站上所有小说，存储到数据库中，并用爬到的数据构建自己的小说网站

Stars: ✭ 117 (-50%)

Mutual labels: spider, scrapy

Fp Server

Free proxy server, continuously crawling and providing proxies, based on Tornado and Scrapy. 免费代理服务器，基于Tornado和Scrapy，在本地搭建属于自己的代理池

Stars: ✭ 154 (-34.19%)

Mutual labels: spider, scrapy

Scrapingoutsourcing

ScrapingOutsourcing专注分享爬虫代码尽量每周更新一个

Stars: ✭ 164 (-29.91%)

Mutual labels: spider, scrapy

Scrapydweb

Web app for Scrapyd cluster management, Scrapy log analysis & visualization, Auto packaging, Timer tasks, Monitor & Alert, and Mobile UI. DEMO 👉

Stars: ✭ 2,385 (+919.23%)

Mutual labels: spider, scrapy

V2EX Spider

V2EX爬虫

Stars: ✭ 21 (-91.03%)

Mutual labels: spider, scrapy

Scrapy-Spiders

一个基于Scrapy的数据采集爬虫代码库

Stars: ✭ 34 (-85.47%)

Mutual labels: spider, scrapy

douban-spider

基于Scrapy框架的豆瓣电影爬虫

Stars: ✭ 25 (-89.32%)

Mutual labels: spider, scrapy

python-fxxk-spider

收集各种免费的 Python 爬虫项目

Stars: ✭ 184 (-21.37%)

Mutual labels: spider, scrapy

Douban Crawler

Uno Crawler por https://douban.com

Stars: ✭ 13 (-94.44%)

Mutual labels: spider, scrapy

ip proxy pool

Generating spiders dynamically to crawl and check those free proxy ip on the internet with scrapy.

Stars: ✭ 39 (-83.33%)

Mutual labels: spider, scrapy

Happy Spiders

🔧 🔩 🔨 收集整理了爬虫相关的工具、模拟登陆技术、代理IP、scrapy模板代码等内容。

Stars: ✭ 261 (+11.54%)

Mutual labels: spider, scrapy

python-spider

python爬虫小项目【持续更新】【笔趣阁小说下载、Tweet数据抓取、天气查询、网易云音乐逆向、天天基金网查询、微博数据抓取（生成cookie）、有道翻译逆向、企查查免登陆爬虫、大众点评svg加密破解、B站用户爬虫、拉钩免登录爬虫、自如租房字体加密、知乎问答

Stars: ✭ 45 (-80.77%)

Mutual labels: spider, scrapy

Haipproxy

💖 High available distributed ip proxy pool, powerd by Scrapy and Redis

Stars: ✭ 4,993 (+2033.76%)

Mutual labels: spider, scrapy

Icrawler

A multi-thread crawler framework with many builtin image crawlers provided.

Stars: ✭ 629 (+168.8%)

Mutual labels: spider, scrapy

Bdp Dataplatform

大数据生态解决方案数据平台：基于大数据、数据平台、微服务、机器学习、商城、自动化运维、DevOps、容器部署平台、数据平台采集、数据平台存储、数据平台计算、数据平台开发、数据平台应用搭建的大数据解决方案。

Stars: ✭ 456 (+94.87%)

Mutual labels: spider, mongodb

Seeker

Seeker - another job board aggregator.

Stars: ✭ 16 (-93.16%)

Mutual labels: spider, scrapy

Funpyspidersearchengine

Word2vec 千人千面个性化搜索 + Scrapy2.3.0(爬取数据) + ElasticSearch7.9.1(存储数据并提供对外Restful API) + Django3.1.1 搜索

Stars: ✭ 782 (+234.19%)

Mutual labels: spider, scrapy

App comments spider

爬取百度贴吧、TapTap、appstore、微博官方博主上的游戏评论(基于redis_scrapy)，过滤器采用了bloomfilter。

Stars: ✭ 38 (-83.76%)

Mutual labels: spider, scrapy

scrapy-distributed

A series of distributed components for Scrapy. Including RabbitMQ-based components, Kafka-based components, and RedisBloom-based components for Scrapy.

Stars: ✭ 38 (-83.76%)

Mutual labels: spider, scrapy

Gerapy

Distributed Crawler Management Framework Based on Scrapy, Scrapyd, Django and Vue.js

Stars: ✭ 2,601 (+1011.54%)

Mutual labels: spider, scrapy

Image Downloader

Download images from Google, Bing, Baidu. 谷歌、百度、必应图片下载.

Stars: ✭ 1,173 (+401.28%)

Mutual labels: spider, scrapy

Hive

lots of spider (很多爬虫）

Stars: ✭ 110 (-52.99%)

Mutual labels: spider, scrapy

Alipayspider Scrapy

AlipaySpider on Scrapy(use chrome driver); 支付宝爬虫(基于Scrapy)

Stars: ✭ 70 (-70.09%)

Mutual labels: spider, scrapy

Crawlab Lite

Lite version of Crawlab. 轻量版 Crawlab 爬虫管理平台

Stars: ✭ 122 (-47.86%)

Mutual labels: spider, scrapy

Docs

《数据采集从入门到放弃》源码。内容简介：爬虫介绍、就业情况、爬虫工程师面试题；HTTP协议介绍； Requests使用；解析器Xpath介绍； MongoDB与MySQL；多线程爬虫； Scrapy介绍；Scrapy-redis介绍；使用docker部署；使用nomad管理docker集群；使用EFK查询docker日志

Stars: ✭ 118 (-49.57%)

Mutual labels: scrapy, mongodb

Spiderkeeper

admin ui for scrapy/open source scrapinghub

Stars: ✭ 2,562 (+994.87%)

Mutual labels: spider, scrapy

Reptile

🏀 Python3 网络爬虫实战（部分含详细教程）猫眼腾讯视频豆瓣研招网微博笔趣阁小说百度热点 B站 CSDN 网易云阅读阿里文学百度股票今日头条微信公众号网易云音乐拉勾有道 unsplash 实习僧汽车之家英雄联盟盒子大众点评链家 LPL赛程台风梦幻西游、阴阳师藏宝阁天气牛客网百度文库睡前故事知乎 Wish

Stars: ✭ 1,048 (+347.86%)

Mutual labels: spider, scrapy

Python3 Spider

Python爬虫实战 - 模拟登陆各大网站包含但不限于：滑块验证、拼多多、美团、百度、bilibili、大众点评、淘宝，如果喜欢请start ❤️

Stars: ✭ 2,129 (+809.83%)

Mutual labels: spider, scrapy

Awesome Web Scraper

A collection of awesome web scaper, crawler.

Stars: ✭ 147 (-37.18%)

Mutual labels: spider, scrapy

Marmot

💐Marmot | Web Crawler/HTTP protocol Download Package 🐭

Stars: ✭ 186 (-20.51%)

Mutual labels: spider, scrapy

Taobaoscrapy

😩Tool For Taobao/Tmall| 儿时玩具已经过时

Stars: ✭ 146 (-37.61%)

Mutual labels: spider, scrapy

photo-spider-scrapy

10 photo website spiders, 10 个国外图库的 scrapy 爬虫代码

Stars: ✭ 17 (-92.74%)

Mutual labels: spider, scrapy

OpenScraper

An open source webapp for scraping: towards a public service for webscraping

Stars: ✭ 80 (-65.81%)

Mutual labels: spider, scrapy

Django Dynamic Scraper

Creating Scrapy scrapers via the Django admin interface

Stars: ✭ 1,024 (+337.61%)

Mutual labels: spider, scrapy

Zi5book

book.zi5.me全站kindle电子书籍爬取，按照作者书籍名分类，每本书有mobi和equb两种格式，采用分布式进行全站爬取

Stars: ✭ 191 (-18.38%)

Mutual labels: spider, mongodb

Goribot

[Crawler/Scraper for Golang]🕷A lightweight distributed friendly Golang crawler framework.一个轻量的分布式友好的 Golang 爬虫框架。

Stars: ✭ 190 (-18.8%)

Mutual labels: spider, scrapy

Fooproxy

稳健高效的评分制-针对性- IP代理池 + API服务，可以自己插入采集器进行代理IP的爬取，针对你的爬虫的一个或多个目标网站分别生成有效的IP代理数据库，支持MongoDB 4.0 使用 Python3.7（Scored IP proxy pool ,customise proxy data crawler can be added anytime）

Stars: ✭ 195 (-16.67%)

Mutual labels: spider, mongodb

1-60 of 1331 similar projects

›

next*5