All Projects → Ppspider → Similar Projects or Alternatives

2403 Open source projects that are alternatives of or similar to Ppspider

Learnpython
Python的基础练习代码与各种爬虫代码
Stars: ✭ 451 (+90.3%)
Mutual labels:  crawler, spider
Douyin
API of DouYin for Humans used to Crawl Popular Videos and Musics
Stars: ✭ 580 (+144.73%)
Mutual labels:  crawler, spider
Puppeteer Api Zh cn
📖 Puppeteer中文文档(官方指定的中文文档)
Stars: ✭ 697 (+194.09%)
Mutual labels:  puppeteer, headless
Headless Chrome Crawler
Distributed crawler powered by Headless Chrome
Stars: ✭ 5,129 (+2064.14%)
Mutual labels:  crawler, puppeteer
Gospider
Gospider - Fast web spider written in Go
Stars: ✭ 785 (+231.22%)
Mutual labels:  crawler, spider
Lianjia Beike Spider
链家网和贝壳网房价爬虫,采集北京上海广州深圳等21个中国主要城市的房价数据(小区,二手房,出租房,新房),稳定可靠快速!支持csv,MySQL, MongoDB,Excel, json存储,支持Python2和3,图表展示数据,注释丰富 ,点星支持,仅供学习参考,请勿用于商业用途,后果自负。
Stars: ✭ 2,257 (+852.32%)
Mutual labels:  crawler, spider
Thal
Getting started with Puppeteer and Chrome Headless for Web Scraping
Stars: ✭ 2,345 (+889.45%)
Mutual labels:  mongodb, puppeteer
Maman
Rust Web Crawler saving pages on Redis
Stars: ✭ 39 (-83.54%)
Mutual labels:  crawler, spider
Jssoup
JavaScript + BeautifulSoup = JSSoup
Stars: ✭ 203 (-14.35%)
Mutual labels:  crawler, spider
Serverless Puppeteer Layers
Serverless Framework + AWS Lambda Layers + Puppeteer = ❤️
Stars: ✭ 247 (+4.22%)
Mutual labels:  puppeteer, headless
Magic google
Google search results crawler, get google search results that you need
Stars: ✭ 247 (+4.22%)
Mutual labels:  crawler, spider
Axegrinder
Crawl websites for accessibility issues from the command line.
Stars: ✭ 12 (-94.94%)
Mutual labels:  crawler, headless
ZSpider
基于Electron爬虫程序
Stars: ✭ 37 (-84.39%)
Mutual labels:  spider, puppeteer
puppeteer-lambda
Module for using Headless-Chrome by Puppeteer on AWS Lambda.
Stars: ✭ 117 (-50.63%)
Mutual labels:  headless, puppeteer
crawler
A simple and flexible web crawler framework for java.
Stars: ✭ 20 (-91.56%)
Mutual labels:  crawler, spider
Bdp Dataplatform
大数据生态解决方案数据平台:基于大数据、数据平台、微服务、机器学习、商城、自动化运维、DevOps、容器部署平台、数据平台采集、数据平台存储、数据平台计算、数据平台开发、数据平台应用搭建的大数据解决方案。
Stars: ✭ 456 (+92.41%)
Mutual labels:  spider, mongodb
bots-zoo
No description or website provided.
Stars: ✭ 59 (-75.11%)
Mutual labels:  crawler, puppeteer
WebCrawler
一个轻量级、快速、多线程、多管道、灵活配置的网络爬虫。
Stars: ✭ 39 (-83.54%)
Mutual labels:  crawler, spider
galer
A fast tool to fetch URLs from HTML attributes by crawl-in.
Stars: ✭ 138 (-41.77%)
Mutual labels:  crawler, spider
slime
🍰 一个可视化的爬虫平台
Stars: ✭ 27 (-88.61%)
Mutual labels:  crawler, spider
Hacker News Digest
📰 A responsive interface of Hacker News with summaries and thumbnails.
Stars: ✭ 278 (+17.3%)
Mutual labels:  crawler, spider
Gopa
[WIP] GOPA, a spider written in Golang, for Elasticsearch. DEMO: http://index.elasticsearch.cn
Stars: ✭ 277 (+16.88%)
Mutual labels:  crawler, spider
Gospider
golang实现的爬虫框架,使用者只需关心页面规则,提供web管理界面。基于colly开发。
Stars: ✭ 285 (+20.25%)
Mutual labels:  crawler, spider
arachnod
High performance crawler for Nodejs
Stars: ✭ 17 (-92.83%)
Mutual labels:  crawler, spider
91porn Api
🌭💦 91porn爬虫在线无限制API接口(永久有效,口令每日更新) 及 在线web预览
Stars: ✭ 341 (+43.88%)
Mutual labels:  crawler, spider
Zhihu Login
知乎模拟登录,支持提取验证码和保存 Cookies
Stars: ✭ 340 (+43.46%)
Mutual labels:  crawler, spider
Freshonions Torscraper
Fresh Onions is an open source TOR spider / hidden service onion crawler hosted at zlal32teyptf4tvi.onion
Stars: ✭ 348 (+46.84%)
Mutual labels:  crawler, spider
Toapi
Every web site provides APIs.
Stars: ✭ 3,209 (+1254.01%)
Mutual labels:  crawler, spider
Signature algorithm
各种App、小程序、网站的请求签名或加密算法。 现已有:自如、小红书、蛋壳公寓、luckin coffee(瑞幸咖啡)、bangkokair(曼谷航空)
Stars: ✭ 380 (+60.34%)
Mutual labels:  crawler, spider
Spider Flow
新一代爬虫平台,以图形化方式定义爬虫流程,不写代码即可完成爬虫。
Stars: ✭ 365 (+54.01%)
Mutual labels:  crawler, spider
Go jobs
带你了解一下Golang的市场行情
Stars: ✭ 526 (+121.94%)
Mutual labels:  crawler, spider
Zi5book
book.zi5.me全站kindle电子书籍爬取,按照作者书籍名分类,每本书有mobi和equb两种格式,采用分布式进行全站爬取
Stars: ✭ 191 (-19.41%)
Mutual labels:  spider, mongodb
Awesome Crawler
A collection of awesome web crawler,spider in different languages
Stars: ✭ 4,793 (+1922.36%)
Mutual labels:  crawler, spider
Spider
python crawler spider
Stars: ✭ 70 (-70.46%)
Mutual labels:  crawler, spider
Digger
Digger is a powerful and flexible web crawler implemented by pure golang
Stars: ✭ 130 (-45.15%)
Mutual labels:  crawler, spider
Fun crawler
Crawl some picture for fun
Stars: ✭ 169 (-28.69%)
Mutual labels:  crawler, spider
Zhihu Crawler People
A simple distributed crawler for zhihu && data analysis
Stars: ✭ 182 (-23.21%)
Mutual labels:  crawler, spider
Goribot
[Crawler/Scraper for Golang]🕷A lightweight distributed friendly Golang crawler framework.一个轻量的分布式友好的 Golang 爬虫框架。
Stars: ✭ 190 (-19.83%)
Mutual labels:  crawler, spider
Spider job
招聘网数据爬虫
Stars: ✭ 234 (-1.27%)
Mutual labels:  spider, mongodb
Smartproxy
HTTP(S) Rotating Residential proxies - Code examples & General information
Stars: ✭ 205 (-13.5%)
Mutual labels:  proxy
Clojurenews
Clojure News Web Application - (Hacker News Clone)
Stars: ✭ 217 (-8.44%)
Mutual labels:  mongodb
Mongoke
Instant Graphql for MongoDb (active branch is golang, rewrite in process)
Stars: ✭ 203 (-14.35%)
Mutual labels:  mongodb
Root Cause
🔍 Root Cause is a tool for troubleshooting Puppeteer and Playwright tests. 🔎
Stars: ✭ 205 (-13.5%)
Mutual labels:  puppeteer
Doxycannon
A poorman's proxycannon and botnet, using docker, ovpn files, and a dante socks5 proxy
Stars: ✭ 216 (-8.86%)
Mutual labels:  proxy
Woid
Simple news aggregator displaying top stories in real time
Stars: ✭ 204 (-13.92%)
Mutual labels:  crawler
Koolreport
This is an Open Source PHP Reporting Framework which you can use to write perfect data reports or to construct awesome dashboards using PHP
Stars: ✭ 204 (-13.92%)
Mutual labels:  mongodb
Jsonbox
HTTP-based JSON storage.
Stars: ✭ 2,440 (+929.54%)
Mutual labels:  mongodb
Hivemq Mqtt Tensorflow Kafka Realtime Iot Machine Learning Training Inference
Real Time Big Data / IoT Machine Learning (Model Training and Inference) with HiveMQ (MQTT), TensorFlow IO and Apache Kafka - no additional data store like S3, HDFS or Spark required
Stars: ✭ 204 (-13.92%)
Mutual labels:  mongodb
Store
A beautifully-simple framework-agnostic modern state management library.
Stars: ✭ 204 (-13.92%)
Mutual labels:  proxy
Chameleon
Customizable honeypots for monitoring network traffic, bots activities and username\password credentials (DNS, HTTP Proxy, HTTP, HTTPS, SSH, POP3, IMAP, STMP, RDP, VNC, SMB, SOCKS5, Redis, TELNET, Postgres and MySQL)
Stars: ✭ 230 (-2.95%)
Mutual labels:  proxy
Pending Xhr Puppeteer
Small tool to wait that all xhr are finished in puppeteer
Stars: ✭ 227 (-4.22%)
Mutual labels:  puppeteer
Artipub
Article publishing platform that automatically distributes your articles to various media channels
Stars: ✭ 2,685 (+1032.91%)
Mutual labels:  mongodb
Mongo Perl Driver
Perl driver for the MongoDB
Stars: ✭ 203 (-14.35%)
Mutual labels:  mongodb
Sanity
The Sanity Studio – Collaborate in real-time on structured content
Stars: ✭ 3,007 (+1168.78%)
Mutual labels:  headless
Socks
socks -- a proxy server.
Stars: ✭ 202 (-14.77%)
Mutual labels:  proxy
Nodejs
Node.js基础与应用教程,适合初学者入门,以及有一定经验的开发者提高。Node.js全栈交流QQ群:423652352,node.js或者全栈开发培训QQ群:579500717
Stars: ✭ 202 (-14.77%)
Mutual labels:  mongodb
Page Skeleton Webpack Plugin
Webpack plugin to generate the skeleton page automatically
Stars: ✭ 2,632 (+1010.55%)
Mutual labels:  puppeteer
Skipper
An HTTP router and reverse proxy for service composition, including use cases like Kubernetes Ingress
Stars: ✭ 2,606 (+999.58%)
Mutual labels:  proxy
Net Shield
An Easy and Simple Anti-DDoS solution for VPS,Dedicated Servers and IoT devices - Beta
Stars: ✭ 202 (-14.77%)
Mutual labels:  proxy
Tiktok Signature
Generate tiktok signature token using node
Stars: ✭ 202 (-14.77%)
Mutual labels:  puppeteer
121-180 of 2403 similar projects