All Projects → Pyspider → Similar Projects or Alternatives

398 Open source projects that are alternatives of or similar to Pyspider

Ruia

Async Python 3.6+ web scraping micro-framework based on asyncio

Stars: ✭ 1,366 (-91.04%)

Mutual labels: crawler

Mmjpg

👩 美女写真套图爬虫（一）

Stars: ✭ 398 (-97.39%)

Mutual labels: crawler

Fooproxy

稳健高效的评分制-针对性- IP代理池 + API服务，可以自己插入采集器进行代理IP的爬取，针对你的爬虫的一个或多个目标网站分别生成有效的IP代理数据库，支持MongoDB 4.0 使用 Python3.7（Scored IP proxy pool ,customise proxy data crawler can be added anytime）

Stars: ✭ 195 (-98.72%)

Mutual labels: crawler

Signature algorithm

各种App、小程序、网站的请求签名或加密算法。现已有：自如、小红书、蛋壳公寓、luckin coffee(瑞幸咖啡)、bangkokair(曼谷航空)

Stars: ✭ 380 (-97.51%)

Mutual labels: crawler

Antispider

Stars: ✭ 99 (-99.35%)

Mutual labels: crawler

Netease Music Cracker

🎵 将可下载的网易云音乐的缓存文件转换为 MP3 文件

Stars: ✭ 373 (-97.55%)

Mutual labels: crawler

Ngmeta

Dynamic meta tags in your AngularJS single page application

Stars: ✭ 152 (-99%)

Mutual labels: crawler

Jivesearch

A search engine that doesn't track you.

Stars: ✭ 364 (-97.61%)

Mutual labels: crawler

Gopa Abandoned

GOPA, a spider written in Go.（NOTE: this project moved to https://github.com/infinitbyte/gopa ）

Stars: ✭ 98 (-99.36%)

Mutual labels: crawler

Fictiondown

Stars: ✭ 362 (-97.62%)

Mutual labels: crawler

Fast Lianjia Crawler

直接通过链家 API 抓取数据的极速爬虫，宇宙最快~~ 🚀

Stars: ✭ 247 (-98.38%)

Mutual labels: crawler

Instagramcrawler

A non API python program to crawl public photos, posts or followers

Stars: ✭ 349 (-97.71%)

Mutual labels: crawler

Amazonrobot

Amazon商品引流的 python 爬虫

Stars: ✭ 97 (-99.36%)

Mutual labels: crawler

Scavenger

Crawler (Bot) searching for credential leaks on different paste sites.

Stars: ✭ 347 (-97.72%)

Mutual labels: crawler

Ptt Alertor

📢 Ptt 文章通知機器人！Notify Ptt Article in Realtime

Stars: ✭ 150 (-99.02%)

Mutual labels: crawler

Pornhub Downloader

Download videos from pornhub.

Stars: ✭ 346 (-97.73%)

Mutual labels: crawler

Scaleable Crawler With Docker Cluster

a scaleable and efficient crawelr with docker cluster , crawl million pages in 2 hours with a single machine

Stars: ✭ 96 (-99.37%)

Mutual labels: crawler

Ttbot

今日头条机器人，支持用户登陆、关注、取消关注、获取关注粉丝、发文、发悟空问答、点赞、评论、采集各种类型新闻讯息等，使用今日头条网页版API实现

Stars: ✭ 338 (-97.78%)

Mutual labels: crawler

Google Group Crawler

Get (almost) original messages from google group archives. Your data is yours.

Stars: ✭ 190 (-98.75%)

Mutual labels: crawler

Zhihu Login

知乎模拟登录，支持提取验证码和保存 Cookies

Stars: ✭ 340 (-97.77%)

Mutual labels: crawler

Gf Secrets

Secret and/ credential patterns used for gf.

Stars: ✭ 96 (-99.37%)

Mutual labels: crawler

91porn Crawler

🌭💦 91porn爬虫在线API接口（永久有效）及在线web预览

Stars: ✭ 329 (-97.84%)

Mutual labels: crawler

Cocrawler

CoCrawler is a versatile web crawler built using modern tools and concurrency.

Stars: ✭ 148 (-99.03%)

Mutual labels: crawler

Dom Crawler

The DomCrawler component eases DOM navigation for HTML and XML documents.

Stars: ✭ 3,499 (-77.04%)

Mutual labels: crawler

Hotnewsanalysis

利用文本挖掘技术进行新闻热点关注问题分析

Stars: ✭ 93 (-99.39%)

Mutual labels: crawler

Scylla

Intelligent proxy pool for Humans™ (Maintainer needed)

Stars: ✭ 3,409 (-77.63%)

Mutual labels: crawler

Jd mask robot

京东口罩库存监控爬虫(非selenium)，扫码登录、查价、加购、下单、秒杀

Stars: ✭ 216 (-98.58%)

Mutual labels: crawler

Toapi

Every web site provides APIs.

Stars: ✭ 3,209 (-78.94%)

Mutual labels: crawler

Proxy Pool

爬虫代理IP池服务，可供其他爬虫程序通过restapi获取

Stars: ✭ 91 (-99.4%)

Mutual labels: crawler

Hquery.php

An extremely fast web scraper that parses megabytes of invalid HTML in a blink of an eye. PHP5.3+, no dependencies.

Stars: ✭ 295 (-98.06%)

Mutual labels: crawler

Pachong

一些爬虫的代码

Stars: ✭ 147 (-99.04%)

Mutual labels: crawler

Python Automation Scripts

Simple yet powerful automation stuffs.

Stars: ✭ 292 (-98.08%)

Mutual labels: crawler

Geziyor

Geziyor, a fast web crawling & scraping framework for Go. Supports JS rendering.

Stars: ✭ 1,246 (-91.82%)

Mutual labels: crawler

Sasila

一个灵活、友好的爬虫框架

Stars: ✭ 286 (-98.12%)

Mutual labels: crawler

Gecco

Easy to use lightweight web crawler（易用的轻量化网络爬虫）

Stars: ✭ 2,310 (-84.84%)

Mutual labels: crawler

Crawlertutorial

爬蟲極簡教學（fetch, parse, search, multiprocessing, API）- PTT 為例

Stars: ✭ 282 (-98.15%)

Mutual labels: crawler

Taiwan News Crawlers

Scrapy-based Crawlers for news of Taiwan

Stars: ✭ 83 (-99.46%)

Mutual labels: crawler

Dotnetspider

DotnetSpider, a .NET standard web crawling library. It is lightweight, efficient and fast high-level web crawling & scraping framework

Stars: ✭ 3,233 (-78.79%)

Mutual labels: crawler

Th Music Video Generator

Touhou Project random music video generator/player, crawling image and video from websites to generate MV.

Stars: ✭ 146 (-99.04%)

Mutual labels: crawler

Gopa

[WIP] GOPA, a spider written in Golang, for Elasticsearch. DEMO: http://index.elasticsearch.cn

Stars: ✭ 277 (-98.18%)

Mutual labels: crawler

Is Google

Verify that a request is from Google crawlers using Google's DNS verification steps

Stars: ✭ 82 (-99.46%)

Mutual labels: crawler

Rcrawler

An R web crawler and scraper

Stars: ✭ 274 (-98.2%)

Mutual labels: crawler

Awesome Java Crawler

本仓库收集整理爬虫相关资源，开发语言以Java为主

Stars: ✭ 228 (-98.5%)

Mutual labels: crawler

Line Bot Tutorial

line-bot-tutorial use python flask

Stars: ✭ 267 (-98.25%)

Mutual labels: crawler

Wombat

Lightweight Ruby web crawler/scraper with an elegant DSL which extracts structured data from pages.

Stars: ✭ 1,220 (-92%)

Mutual labels: crawler

Weibo terminator workflow

Update Version of weibo_terminator, This is Workflow Version aim at Get Job Done!

Stars: ✭ 259 (-98.3%)

Mutual labels: crawler

Crawler

Go process used to crawl websites

Stars: ✭ 147 (-99.04%)

Mutual labels: crawler

Spidy

The simple, easy to use command line web crawler.

Stars: ✭ 257 (-98.31%)

Mutual labels: crawler

Puppeteer Walker

a puppeteer walker 🕷 🕸

Stars: ✭ 78 (-99.49%)

Mutual labels: crawler

galer

A fast tool to fetch URLs from HTML attributes by crawl-in.

Stars: ✭ 138 (-99.09%)

Mutual labels: crawler

Marmot

💐Marmot | Web Crawler/HTTP protocol Download Package 🐭

Stars: ✭ 186 (-98.78%)

Mutual labels: crawler

Webb

Python: An all-in-one Web Crawler, Web Parser and Web Scrapping library!

Stars: ✭ 77 (-99.49%)

Mutual labels: crawler

Polite

Be nice on the web

Stars: ✭ 253 (-98.34%)

Mutual labels: crawler

Weibopicdownloader

免登录下载微博图片爬虫 Download Weibo Images without Logging-in

Stars: ✭ 247 (-98.38%)

Mutual labels: crawler

Skrape.it

A Kotlin-based testing/scraping/parsing library providing the ability to analyze and extract data from HTML (server & client-side rendered). It places particular emphasis on ease of use and a high level of readability by providing an intuitive DSL. It aims to be a testing lib, but can also be used to scrape websites in a convenient fashion.

Stars: ✭ 231 (-98.48%)

Mutual labels: crawler

Proxybroker

Proxy [Finder | Checker | Server]. HTTP(S) & SOCKS 🎭

Stars: ✭ 2,767 (-81.85%)

Mutual labels: crawler

Googlescraper

A Python module to scrape several search engines (like Google, Yandex, Bing, Duckduckgo, ...). Including asynchronous networking support.

Stars: ✭ 2,363 (-84.5%)

Mutual labels: crawler

Fun crawler

Crawl some picture for fun