All Projects → Ruia → Similar Projects or Alternatives

1134 Open source projects that are alternatives of or similar to Ruia

Fooproxy
稳健高效的评分制-针对性- IP代理池 + API服务,可以自己插入采集器进行代理IP的爬取,针对你的爬虫的一个或多个目标网站分别生成有效的IP代理数据库,支持MongoDB 4.0 使用 Python3.7(Scored IP proxy pool ,customise proxy data crawler can be added anytime)
Stars: ✭ 195 (-85.72%)
Mutual labels:  asyncio, crawler, spider, aiohttp
Gain
Web crawling framework based on asyncio.
Stars: ✭ 2,002 (+46.56%)
Mutual labels:  asyncio, crawler, spider, aiohttp
Ok ip proxy pool
🍿爬虫代理IP池(proxy pool) python🍟一个还ok的IP代理池
Stars: ✭ 196 (-85.65%)
Mutual labels:  crawler, spider, aiohttp
Owllook
owllook-小说搜索引擎
Stars: ✭ 2,163 (+58.35%)
Mutual labels:  asyncio, spider, aiohttp
yutto
🧊 一个可爱且任性的 B 站视频下载器(bilili V2)
Stars: ✭ 383 (-71.96%)
Mutual labels:  spider, aiohttp, asyncio
Learnpython
Python的基础练习代码与各种爬虫代码
Stars: ✭ 451 (-66.98%)
Mutual labels:  crawler, spider
Go jobs
带你了解一下Golang的市场行情
Stars: ✭ 526 (-61.49%)
Mutual labels:  crawler, spider
Douyinsdk
抖音 SDK,数据采集,爬虫抓取不是梦
Stars: ✭ 99 (-92.75%)
Mutual labels:  crawler, spider
Baiduimagespider
一个超级轻量的百度图片爬虫
Stars: ✭ 591 (-56.73%)
Mutual labels:  crawler, spider
Icrawler
A multi-thread crawler framework with many builtin image crawlers provided.
Stars: ✭ 629 (-53.95%)
Mutual labels:  crawler, spider
Grab Site
The archivist's web crawler: WARC output, dashboard for all crawls, dynamic ignore patterns
Stars: ✭ 680 (-50.22%)
Mutual labels:  crawler, spider
Python Dependency Injector
Dependency injection framework for Python
Stars: ✭ 1,203 (-11.93%)
Mutual labels:  asyncio, aiohttp
Geziyor
Geziyor, a fast web crawling & scraping framework for Go. Supports JS rendering.
Stars: ✭ 1,246 (-8.78%)
Mutual labels:  crawler, spider
Html2article
Html网页正文提取
Stars: ✭ 441 (-67.72%)
Mutual labels:  crawler, spider
Haipproxy
💖 High available distributed ip proxy pool, powerd by Scrapy and Redis
Stars: ✭ 4,993 (+265.52%)
Mutual labels:  crawler, spider
Aiojobs
Jobs scheduler for managing background task (asyncio)
Stars: ✭ 492 (-63.98%)
Mutual labels:  asyncio, aiohttp
Raven Aiohttp
An aiohttp transport for raven-python
Stars: ✭ 92 (-93.27%)
Mutual labels:  asyncio, aiohttp
Easy Scraping Tutorial
Simple but useful Python web scraping tutorial code.
Stars: ✭ 583 (-57.32%)
Mutual labels:  asyncio, crawler
Spidr
A versatile Ruby web spidering library that can spider a site, multiple domains, certain links or infinitely. Spidr is designed to be fast and easy to use.
Stars: ✭ 656 (-51.98%)
Mutual labels:  crawler, spider
Fbcrawl
A Facebook crawler
Stars: ✭ 536 (-60.76%)
Mutual labels:  crawler, spider
Ant nest
Simple, clear and fast Web Crawler framework build on python3.6+, powered by asyncio.
Stars: ✭ 90 (-93.41%)
Mutual labels:  asyncio, spider
Aiomixcloud
Mixcloud API wrapper for Python and Async IO
Stars: ✭ 23 (-98.32%)
Mutual labels:  asyncio, aiohttp
Scrapit
Scraping scripts for various websites.
Stars: ✭ 25 (-98.17%)
Mutual labels:  crawler, spider
Nodespider
[DEPRECATED] Simple, flexible, delightful web crawler/spider package
Stars: ✭ 33 (-97.58%)
Mutual labels:  crawler, spider
V3n0m Scanner
Popular Pentesting scanner in Python3.6 for SQLi/XSS/LFI/RFI and other Vulns
Stars: ✭ 847 (-37.99%)
Mutual labels:  asyncio, aiohttp
Rocketgram
Modern and powerful asynchronous telegram bot framework.
Stars: ✭ 37 (-97.29%)
Mutual labels:  asyncio, aiohttp
Lizard
💐 Full Amazon Automatic Download
Stars: ✭ 41 (-97%)
Mutual labels:  crawler, spider
Gosint
OSINT Swiss Army Knife
Stars: ✭ 401 (-70.64%)
Mutual labels:  crawler, spider
Bilili
🍻 bilibili video (including bangumi) and danmaku downloader | B站视频(含番剧)、弹幕下载器
Stars: ✭ 379 (-72.25%)
Mutual labels:  crawler, spider
Crawly
Crawly, a high-level web crawling & scraping framework for Elixir.
Stars: ✭ 440 (-67.79%)
Mutual labels:  crawler, spider
Signature algorithm
各种App、小程序、网站的请求签名或加密算法。 现已有:自如、小红书、蛋壳公寓、luckin coffee(瑞幸咖啡)、bangkokair(曼谷航空)
Stars: ✭ 380 (-72.18%)
Mutual labels:  crawler, spider
Aiohttp Demos
Demos for aiohttp project
Stars: ✭ 517 (-62.15%)
Mutual labels:  asyncio, aiohttp
Awesome Crawler
A collection of awesome web crawler,spider in different languages
Stars: ✭ 4,793 (+250.88%)
Mutual labels:  crawler, spider
Xsrfprobe
The Prime Cross Site Request Forgery (CSRF) Audit and Exploitation Toolkit.
Stars: ✭ 532 (-61.05%)
Mutual labels:  crawler, spider
Spider Flow
新一代爬虫平台,以图形化方式定义爬虫流程,不写代码即可完成爬虫。
Stars: ✭ 365 (-73.28%)
Mutual labels:  crawler, spider
Douyin
API of DouYin for Humans used to Crawl Popular Videos and Musics
Stars: ✭ 580 (-57.54%)
Mutual labels:  crawler, spider
Netdiscovery
NetDiscovery 是一款基于 Vert.x、RxJava 2 等框架实现的通用爬虫框架/中间件。
Stars: ✭ 573 (-58.05%)
Mutual labels:  crawler, spider
Newcrawler
Free Web Scraping Tool with Java
Stars: ✭ 589 (-56.88%)
Mutual labels:  crawler, spider
Xxl Crawler
A distributed web crawler framework.(分布式爬虫框架XXL-CRAWLER)
Stars: ✭ 561 (-58.93%)
Mutual labels:  crawler, spider
Puppeteer Walker
a puppeteer walker 🕷 🕸
Stars: ✭ 78 (-94.29%)
Mutual labels:  crawler, spider
Aiobotocore
asyncio support for botocore library using aiohttp
Stars: ✭ 630 (-53.88%)
Mutual labels:  asyncio, aiohttp
Gopa Abandoned
GOPA, a spider written in Go.(NOTE: this project moved to https://github.com/infinitbyte/gopa )
Stars: ✭ 98 (-92.83%)
Mutual labels:  crawler, spider
Webster
a reliable high-level web crawling & scraping framework for Node.js.
Stars: ✭ 364 (-73.35%)
Mutual labels:  crawler, spider
Zhihu Crawler
zhihu-crawler是一个基于Java的高性能、支持免费http代理池、支持横向扩展、分布式爬虫项目
Stars: ✭ 890 (-34.85%)
Mutual labels:  crawler, spider
Torbot
Dark Web OSINT Tool
Stars: ✭ 821 (-39.9%)
Mutual labels:  crawler, spider
Aioslacker
slacker wrapper for asyncio
Stars: ✭ 23 (-98.32%)
Mutual labels:  asyncio, aiohttp
Gospider
Gospider - Fast web spider written in Go
Stars: ✭ 785 (-42.53%)
Mutual labels:  crawler, spider
Crawler examples
Some classic web crawler projects.一些经典的爬虫
Stars: ✭ 74 (-94.58%)
Mutual labels:  crawler, spider
Heroku Aiohttp Web
A project starter template for deploying an aiohttp app to Heroku
Stars: ✭ 14 (-98.98%)
Mutual labels:  asyncio, aiohttp
Maman
Rust Web Crawler saving pages on Redis
Stars: ✭ 39 (-97.14%)
Mutual labels:  crawler, spider
Crawler
A high performance web crawler in Elixir.
Stars: ✭ 781 (-42.83%)
Mutual labels:  crawler, spider
Awesome Python Primer
自学入门 Python 优质中文资源索引,包含 书籍 / 文档 / 视频,适用于 爬虫 / Web / 数据分析 / 机器学习 方向
Stars: ✭ 57 (-95.83%)
Mutual labels:  crawler, spider
Photon
Incredibly fast crawler designed for OSINT.
Stars: ✭ 8,332 (+509.96%)
Mutual labels:  crawler, spider
Car Prices
Golang爬虫 爬取汽车之家 二手车产品库
Stars: ✭ 57 (-95.83%)
Mutual labels:  crawler, spider
Hproxy
hproxy - Asynchronous IP proxy pool, aims to make getting proxy as convenient as possible.(异步爬虫代理池)
Stars: ✭ 62 (-95.46%)
Mutual labels:  asyncio, crawler
Avbook
AV 电影管理系统, avmoo , javbus , javlibrary 爬虫,线上 AV 影片图书馆,AV 磁力链接数据库,Japanese Adult Video Library,Adult Video Magnet Links - Japanese Adult Video Database
Stars: ✭ 8,133 (+495.39%)
Mutual labels:  crawler, spider
Beanbun
Beanbun 是用 PHP 编写的多进程网络爬虫框架,具有良好的开放性、高可扩展性,基于 Workerman。
Stars: ✭ 1,096 (-19.77%)
Mutual labels:  crawler, spider
Rororo
Implement aiohttp.web OpenAPI 3 server applications with schema first approach.
Stars: ✭ 95 (-93.05%)
Mutual labels:  asyncio, aiohttp
Pyfailsafe
Simple failure handling. Failsafe implementation in Python
Stars: ✭ 70 (-94.88%)
Mutual labels:  asyncio, aiohttp
Freshonions Torscraper
Fresh Onions is an open source TOR spider / hidden service onion crawler hosted at zlal32teyptf4tvi.onion
Stars: ✭ 348 (-74.52%)
Mutual labels:  crawler, spider
1-60 of 1134 similar projects