All Projects → weaver → Similar Projects or Alternatives

442 Open source projects that are alternatives of or similar to weaver

稳健高效的评分制-针对性- IP代理池 + API服务，可以自己插入采集器进行代理IP的爬取，针对你的爬虫的一个或多个目标网站分别生成有效的IP代理数据库，支持MongoDB 4.0 使用 Python3.7（Scored IP proxy pool ,customise proxy data crawler can be added anytime）

Stars: ✭ 195 (+170.83%)

Mutual labels: spider

Taobaoscrapy

😩Tool For Taobao/Tmall| 儿时玩具已经过时

Stars: ✭ 146 (+102.78%)

Mutual labels: spider

Jd mask robot

京东口罩库存监控爬虫(非selenium)，扫码登录、查价、加购、下单、秒杀

Stars: ✭ 216 (+200%)

Mutual labels: spider

Venom

All Terrain Autonomous Quadruped

Stars: ✭ 145 (+101.39%)

Mutual labels: spider

Goribot

[Crawler/Scraper for Golang]🕷A lightweight distributed friendly Golang crawler framework.一个轻量的分布式友好的 Golang 爬虫框架。

Stars: ✭ 190 (+163.89%)

Mutual labels: spider

Qiandao

🌟⏳🌟 各种网站的签到（停止维护）

Stars: ✭ 141 (+95.83%)

Mutual labels: spider

Ppspider

web spider built by puppeteer, support task-queue and task-scheduling by decorators，support nedb / mongodb, support data visualization; 基于puppeteer的web爬虫框架，提供灵活的任务队列管理调度方案，提供便捷的数据保存方案（nedb/mongodb），提供数据可视化和用户交互的实现方案

Stars: ✭ 237 (+229.17%)

Mutual labels: spider

Go spider

[爬虫框架 (golang)] An awesome Go concurrent Crawler(spider) framework. The crawler is flexible and modular. It can be expanded to an Individualized crawler easily or you can use the default crawl components only.

Stars: ✭ 1,745 (+2323.61%)

Mutual labels: spider

Videospider

抓取豆瓣，bilibili等中的电视剧、电影、动漫演员等信息

Stars: ✭ 186 (+158.33%)

Mutual labels: spider

Bilibili User Information Spider

B站3亿用户信息爬虫（mid号，昵称，性别，关注，粉丝，等级）

Stars: ✭ 136 (+88.89%)

Mutual labels: spider

Lspider

LSpider 一个为被动扫描器定制的前端爬虫

Stars: ✭ 214 (+197.22%)

Mutual labels: spider

Digger

Digger is a powerful and flexible web crawler implemented by pure golang

Stars: ✭ 130 (+80.56%)

Mutual labels: spider

Lianjia Beike Spider

链家网和贝壳网房价爬虫，采集北京上海广州深圳等21个中国主要城市的房价数据（小区，二手房，出租房，新房），稳定可靠快速！支持csv,MySQL, MongoDB,Excel, json存储，支持Python2和3，图表展示数据，注释丰富，点星支持，仅供学习参考，请勿用于商业用途，后果自负。

Stars: ✭ 2,257 (+3034.72%)

Mutual labels: spider

Weibo Topic Spider

微博超级话题爬虫，微博词频统计+情感分析+简单分类，新增肺炎超话爬取数据

Stars: ✭ 128 (+77.78%)

Mutual labels: spider

Awesome Spider

爬虫集合

Stars: ✭ 16,623 (+22987.5%)

Mutual labels: spider

Feapder

feapder是一款支持分布式、批次采集、任务防丢、报警丰富的python爬虫框架

Stars: ✭ 110 (+52.78%)

Mutual labels: spider

Zhihu Crawler People

A simple distributed crawler for zhihu && data analysis

Stars: ✭ 182 (+152.78%)

Mutual labels: spider

Douban crawler

备份豆瓣计划

Stars: ✭ 124 (+72.22%)

Mutual labels: spider

Dht

BitTorrent DHT Protocol && DHT Spider.

Stars: ✭ 2,459 (+3315.28%)

Mutual labels: spider

Crawlab Lite

Lite version of Crawlab. 轻量版 Crawlab 爬虫管理平台

Stars: ✭ 122 (+69.44%)

Mutual labels: spider

Crack Js Spider

破解JS反爬虫加密参数，已破解中国裁判文书网（2020-06-30更新），淘宝密码，天安保险登录，b站登录，房天下登录，WPS登录，微博登录，有道翻译，网易登录，微信公众号登录，空中网登录，今目标登录，学生信息管理系统登录，共赢金融登录，重庆科技资源共享平台登录，网易云音乐下载，一键解析视频链接，财联社登录。

Stars: ✭ 175 (+143.06%)

Mutual labels: spider

Pddspider

拼多多爬虫，爬取所有商品、评论等信息

Stars: ✭ 121 (+68.06%)

Mutual labels: spider

Article spider

微信公众号爬虫

Stars: ✭ 235 (+226.39%)

Mutual labels: spider

Free proxy website

获取免费socks/https/http代理的网站集合

Stars: ✭ 119 (+65.28%)

Mutual labels: spider

Stackoverflow Spider

📖 爬取 Stackoverflow 100万条问答并简单分析

Stars: ✭ 174 (+141.67%)

Mutual labels: spider

Copybook

用爬虫爬取小说网站上所有小说，存储到数据库中，并用爬到的数据构建自己的小说网站

Stars: ✭ 117 (+62.5%)

Mutual labels: spider

Py Elasticsearch Django

基于python语言开发的千万级别搜索引擎

Stars: ✭ 207 (+187.5%)

Mutual labels: spider

House Price Prediction

房价预测完整项目：1.爬取链家网数据 2.处理后，用sklearn中几个逻辑回归机器学习模型和keras神经网络搭建模型预测房价最终结果神经网络效果更好，R^2值0.75左右

Stars: ✭ 116 (+61.11%)

Mutual labels: spider

Spoon

🥄 A package for building specific Proxy Pool for different Sites.

Stars: ✭ 173 (+140.28%)

Mutual labels: spider

Bilibili member crawler

B站用户爬虫好耶~是爬虫

Stars: ✭ 115 (+59.72%)

Mutual labels: spider

dht-spider

一个简单的基于DHT协议的BT磁力链接爬虫

Stars: ✭ 16 (-77.78%)

Mutual labels: spider

Geetest

滑动验证码，希望对你们有所帮助❤️

Stars: ✭ 114 (+58.33%)

Mutual labels: spider

Linkedin Profile Scraper

🕵️‍♂️ LinkedIn profile scraper returning structured profile data in JSON. Works in 2020.

Stars: ✭ 171 (+137.5%)

Mutual labels: spider

Scrala

Unmaintained 🐳 ☕️ 🕷 Scala crawler(spider) framework, inspired by scrapy, created by @gaocegege

Stars: ✭ 113 (+56.94%)

Mutual labels: spider

Wereader

一个功能全面的微信读书爬虫 wereader

Stars: ✭ 207 (+187.5%)

Mutual labels: spider

Cockroach

又一个 java 内容（pa）获取（chong）工具

Stars: ✭ 112 (+55.56%)

Mutual labels: spider

Gain

Web crawling framework based on asyncio.

Stars: ✭ 2,002 (+2680.56%)

Mutual labels: spider

Jobs Search

🕷招聘网站爬虫合集，不定期更新分支

Stars: ✭ 111 (+54.17%)

Mutual labels: spider

Spiderkeeper

admin ui for scrapy/open source scrapinghub

Stars: ✭ 2,562 (+3458.33%)

Mutual labels: spider

Not Your Average Web Crawler

A web crawler (for bug hunting) that gathers more than you can imagine.

Stars: ✭ 107 (+48.61%)

Mutual labels: spider

Jandan spider

使用Python3爬取煎蛋妹纸图片

Stars: ✭ 168 (+133.33%)

Mutual labels: spider

Daily scripts

日常小脚本，懒人欢乐多。

Stars: ✭ 105 (+45.83%)

Mutual labels: spider

Jssoup

JavaScript + BeautifulSoup = JSSoup

Stars: ✭ 203 (+181.94%)

Mutual labels: spider

Animesearcher

整合第三方网站的视频和弹幕资源, 为白嫖党提供最佳看番追剧体验

Stars: ✭ 101 (+40.28%)

Mutual labels: spider

Scrapingoutsourcing

ScrapingOutsourcing专注分享爬虫代码尽量每周更新一个

Stars: ✭ 164 (+127.78%)

Mutual labels: spider

Ruia

Async Python 3.6+ web scraping micro-framework based on asyncio

Stars: ✭ 1,366 (+1797.22%)

Mutual labels: spider

Fast Lianjia Crawler

直接通过链家 API 抓取数据的极速爬虫，宇宙最快~~ 🚀

Stars: ✭ 247 (+243.06%)

Mutual labels: spider

Douyinsdk

抖音 SDK，数据采集，爬虫抓取不是梦

Stars: ✭ 99 (+37.5%)

Mutual labels: spider

Yispider

一款分布式爬虫平台，帮助你更好的管理和开发爬虫。内置一套爬虫定义规则（模版），可使用模版快速定义爬虫，也可当作框架手动开发爬虫。(兴趣使然的项目，用的不爽了就更新)

Stars: ✭ 158 (+119.44%)

Mutual labels: spider

Economic audit knowledge graph

经济责任审计知识图谱：网络爬虫、关系抽取、领域词汇判定

Stars: ✭ 98 (+36.11%)

Mutual labels: spider

Zhihuspider

多线程知乎用户爬虫，基于python3

Stars: ✭ 201 (+179.17%)

Mutual labels: spider

Zhihuspider

知乎用户公开个人信息爬虫, 能够爬取用户关注关系，基于Python、使用代理、多线程

Stars: ✭ 92 (+27.78%)

Mutual labels: spider

Abot

Cross Platform C# web crawler framework built for speed and flexibility. Please star this project! +1.

Stars: ✭ 1,961 (+2623.61%)

Mutual labels: spider

Csdn Spider

爬取CSDN上的博客文章

Stars: ✭ 89 (+23.61%)

Mutual labels: spider

Chromium for spider

dynamic crawler for web vulnerability scanner

Stars: ✭ 220 (+205.56%)

Mutual labels: spider

Zhihu Spider

知乎爬虫程序，定时跟踪问题数据，定时推送热门话题

Stars: ✭ 87 (+20.83%)

Mutual labels: spider

Portia Dashboard

portia-dashboard is a visual web crawler based on scrapinghub/portia

Stars: ✭ 199 (+176.39%)

Mutual labels: spider

Python3 Spider

Python爬虫实战 - 模拟登陆各大网站包含但不限于：滑块验证、拼多多、美团、百度、bilibili、大众点评、淘宝，如果喜欢请start ❤️

Stars: ✭ 2,129 (+2856.94%)

Mutual labels: spider

Fp Server

Free proxy server, continuously crawling and providing proxies, based on Tornado and Scrapy. 免费代理服务器，基于Tornado和Scrapy，在本地搭建属于自己的代理池

Stars: ✭ 154 (+113.89%)

Mutual labels: spider

python-spider

零基础学习python爬虫

Stars: ✭ 31 (-56.94%)

Mutual labels: spider

61-120 of 442 similar projects

‹

›

next*5