All Projects → Gain → Similar Projects or Alternatives

1134 Open source projects that are alternatives of or similar to Gain

golang实现的爬虫框架，使用者只需关心页面规则，提供web管理界面。基于colly开发。

Stars: ✭ 285 (-85.76%)

Mutual labels: crawler, spider

爬取 www.mzitu.com 全站图片，截至目前共5162个图集，16.5万多张美女图片，使用 asyncio 和 aiohttp 实现的异步版本只需要不到2小时就能爬取完成。按日期创建图集目录，保存更合理。控制台只显示下载的进度条，详细信息保存在日志文件中。支持异常处理，不会终止爬虫程序。失败的请求，下次再执行爬虫程序时会自动下载

Stars: ✭ 275 (-86.26%)

Mutual labels: asyncio, aiohttp

Go spider

[爬虫框架 (golang)] An awesome Go concurrent Crawler(spider) framework. The crawler is flexible and modular. It can be expanded to an Individualized crawler easily or you can use the default crawl components only.

Stars: ✭ 1,745 (-12.84%)

Mutual labels: crawler, spider

Python Slack Sdk

Slack Developer Kit for Python

Stars: ✭ 3,307 (+65.18%)

Mutual labels: asyncio, aiohttp

Diy Async Web Framework

Learn how modern async web frameworks work, by writing simple clone from scratch

Stars: ✭ 309 (-84.57%)

Mutual labels: asyncio, aiohttp

Bt Btt

磁力網站U3C3介紹以及域名更新

Stars: ✭ 261 (-86.96%)

Mutual labels: crawler, spider

Ttbot

今日头条机器人，支持用户登陆、关注、取消关注、获取关注粉丝、发文、发悟空问答、点赞、评论、采集各种类型新闻讯息等，使用今日头条网页版API实现

Stars: ✭ 338 (-83.12%)

Mutual labels: crawler, spider

91porn Api

🌭💦 91porn爬虫在线无限制API接口（永久有效，口令每日更新）及在线web预览

Stars: ✭ 341 (-82.97%)

Mutual labels: crawler, spider

Fictiondown

Stars: ✭ 362 (-81.92%)

Mutual labels: crawler, spider

Zhihu Login

知乎模拟登录，支持提取验证码和保存 Cookies

Stars: ✭ 340 (-83.02%)

Mutual labels: crawler, spider

Signature algorithm

各种App、小程序、网站的请求签名或加密算法。现已有：自如、小红书、蛋壳公寓、luckin coffee(瑞幸咖啡)、bangkokair(曼谷航空)

Stars: ✭ 380 (-81.02%)

Mutual labels: crawler, spider

Spider Flow

新一代爬虫平台，以图形化方式定义爬虫流程，不写代码即可完成爬虫。

Stars: ✭ 365 (-81.77%)

Mutual labels: crawler, spider

Pymxget

mxget的Python实现

Stars: ✭ 136 (-93.21%)

Mutual labels: asyncio, aiohttp

Web Main

🎉 Ultimate Emoji Generator

Stars: ✭ 261 (-86.96%)

Mutual labels: asyncio, aiohttp

Aiohttp

Asynchronous HTTP client/server framework for asyncio and Python

Stars: ✭ 11,972 (+498%)

Mutual labels: asyncio, aiohttp

Learnpython

Python的基础练习代码与各种爬虫代码

Stars: ✭ 451 (-77.47%)

Mutual labels: crawler, spider

Aiohttp Demos

Demos for aiohttp project

Stars: ✭ 517 (-74.18%)

Mutual labels: asyncio, aiohttp

Crawly

Crawly, a high-level web crawling & scraping framework for Elixir.

Stars: ✭ 440 (-78.02%)

Mutual labels: crawler, spider

Mm131

MM131网站图片爬取 🚨

Stars: ✭ 129 (-93.56%)

Mutual labels: crawler, spider

Xsrfprobe

The Prime Cross Site Request Forgery (CSRF) Audit and Exploitation Toolkit.

Stars: ✭ 532 (-73.43%)

Mutual labels: crawler, spider

Digger

Digger is a powerful and flexible web crawler implemented by pure golang

Stars: ✭ 130 (-93.51%)

Mutual labels: crawler, spider

Html2article

Html网页正文提取

Stars: ✭ 441 (-77.97%)

Mutual labels: crawler, spider

Weibo Topic Spider

微博超级话题爬虫，微博词频统计+情感分析+简单分类，新增肺炎超话爬取数据

Stars: ✭ 128 (-93.61%)

Mutual labels: crawler, spider

Easy Scraping Tutorial

Simple but useful Python web scraping tutorial code.

Stars: ✭ 583 (-70.88%)

Mutual labels: asyncio, crawler

Icrawler

A multi-thread crawler framework with many builtin image crawlers provided.

Stars: ✭ 629 (-68.58%)

Mutual labels: crawler, spider

Douyin

API of DouYin for Humans used to Crawl Popular Videos and Musics

Stars: ✭ 580 (-71.03%)

Mutual labels: crawler, spider

Grab Site

The archivist's web crawler: WARC output, dashboard for all crawls, dynamic ignore patterns

Stars: ✭ 680 (-66.03%)

Mutual labels: crawler, spider

Spidr

A versatile Ruby web spidering library that can spider a site, multiple domains, certain links or infinitely. Spidr is designed to be fast and easy to use.

Stars: ✭ 656 (-67.23%)

Mutual labels: crawler, spider

Crawler

A high performance web crawler in Elixir.

Stars: ✭ 781 (-60.99%)

Mutual labels: crawler, spider

galer

A fast tool to fetch URLs from HTML attributes by crawl-in.

Stars: ✭ 138 (-93.11%)

Mutual labels: crawler, spider

Scrapit

Scraping scripts for various websites.

Stars: ✭ 25 (-98.75%)

Mutual labels: crawler, spider

Aioslacker

slacker wrapper for asyncio

Stars: ✭ 23 (-98.85%)

Mutual labels: asyncio, aiohttp

Nodespider

[DEPRECATED] Simple, flexible, delightful web crawler/spider package

Stars: ✭ 33 (-98.35%)

Mutual labels: crawler, spider

Aiomixcloud

Mixcloud API wrapper for Python and Async IO

Stars: ✭ 23 (-98.85%)

Mutual labels: asyncio, aiohttp

Crawlab

Distributed web crawler admin platform for spiders management regardless of languages and frameworks. 分布式爬虫管理平台，支持任何语言和框架

Stars: ✭ 8,392 (+319.18%)

Mutual labels: crawler, spider

Lizard

💐 Full Amazon Automatic Download

Stars: ✭ 41 (-97.95%)

Mutual labels: crawler, spider

Awesome Python Primer

自学入门 Python 优质中文资源索引，包含书籍 / 文档 / 视频，适用于爬虫 / Web / 数据分析 / 机器学习方向

Stars: ✭ 57 (-97.15%)

Mutual labels: crawler, spider

Zhihu Crawler

zhihu-crawler是一个基于Java的高性能、支持免费http代理池、支持横向扩展、分布式爬虫项目

Stars: ✭ 890 (-55.54%)

Mutual labels: crawler, spider

Pyfailsafe

Simple failure handling. Failsafe implementation in Python

Stars: ✭ 70 (-96.5%)

Mutual labels: asyncio, aiohttp

Arachnid

Powerful web scraping framework for Crystal

Stars: ✭ 68 (-96.6%)

Mutual labels: crawler, spider

Crawlab Lite

Lite version of Crawlab. 轻量版 Crawlab 爬虫管理平台

Stars: ✭ 122 (-93.91%)

Mutual labels: crawler, spider

Hproxy

hproxy - Asynchronous IP proxy pool, aims to make getting proxy as convenient as possible.(异步爬虫代理池)

Stars: ✭ 62 (-96.9%)

Mutual labels: asyncio, crawler

Ant nest

Simple, clear and fast Web Crawler framework build on python3.6+, powered by asyncio.

Stars: ✭ 90 (-95.5%)

Mutual labels: asyncio, spider

Geziyor

Geziyor, a fast web crawling & scraping framework for Go. Supports JS rendering.

Stars: ✭ 1,246 (-37.76%)

Mutual labels: crawler, spider

Python Simple Rest Client

Simple REST client for python 3.6+

Stars: ✭ 143 (-92.86%)

Mutual labels: asyncio, aiohttp

Torbot

Dark Web OSINT Tool

Stars: ✭ 821 (-58.99%)

Mutual labels: crawler, spider

Aioauth

Asynchronous OAuth 2.0 framework and provider for Python 3

Stars: ✭ 102 (-94.91%)

Mutual labels: asyncio, aiohttp

Skycaiji

蓝天采集器是一款免费的数据采集发布爬虫软件，采用php+mysql开发，可部署在云服务器，几乎能采集所有类型的网页，无缝对接各类CMS建站程序，免登录实时发布数据，全自动无需人工干预！是网页大数据采集软件中完全跨平台的云端爬虫系统

Stars: ✭ 1,514 (-24.38%)

Mutual labels: crawler, spider

Douyinsdk

抖音 SDK，数据采集，爬虫抓取不是梦

Stars: ✭ 99 (-95.05%)

Mutual labels: crawler, spider

Pkulaw spider

爬取北大法宝网http://www.pkulaw.cn/Case/

Stars: ✭ 113 (-94.36%)

Mutual labels: crawler, spider

Baiduspider

BaiduSpider，一个爬取百度搜索结果的爬虫，目前支持百度网页搜索，百度图片搜索，百度知道搜索，百度视频搜索，百度资讯搜索，百度文库搜索，百度经验搜索和百度百科搜索。

Stars: ✭ 105 (-94.76%)

Mutual labels: crawler, spider

Free proxy website

获取免费socks/https/http代理的网站集合

Stars: ✭ 119 (-94.06%)

Mutual labels: crawler, spider

Gopa Abandoned

GOPA, a spider written in Go.（NOTE: this project moved to https://github.com/infinitbyte/gopa ）

Stars: ✭ 98 (-95.1%)

Mutual labels: crawler, spider

Decryptlogin

APIs for loginning some websites by using requests.

Stars: ✭ 1,861 (-7.04%)

Mutual labels: crawler, spider

Examples Of Web Crawlers

一些非常有趣的python爬虫例子,对新手比较友好,主要爬取淘宝、天猫、微信、豆瓣、QQ等网站。(Some interesting examples of python crawlers that are friendly to beginners. )

Stars: ✭ 10,724 (+435.66%)

Mutual labels: crawler, spider

Fun crawler

Crawl some picture for fun

Stars: ✭ 169 (-91.56%)

Mutual labels: crawler, spider

Python3 Spider

Python爬虫实战 - 模拟登陆各大网站包含但不限于：滑块验证、拼多多、美团、百度、bilibili、大众点评、淘宝，如果喜欢请start ❤️

Stars: ✭ 2,129 (+6.34%)

Mutual labels: crawler, spider

ZhengFang System Spider

🐛一只登录正方教务管理系统，爬取数据的小爬虫

Stars: ✭ 21 (-98.95%)

Mutual labels: crawler, spider

binance-chain-python

Binance chain SDK in Python

Stars: ✭ 22 (-98.9%)

Mutual labels: aiohttp, asyncio

Gospider

Gospider - Fast web spider written in Go

Stars: ✭ 785 (-60.79%)

Mutual labels: crawler, spider

61-120 of 1134 similar projects

‹

›

next*5