All Projects → Scrapingoutsourcing → Similar Projects or Alternatives

1080 Open source projects that are alternatives of or similar to Scrapingoutsourcing

APIs for loginning some websites by using requests.

Stars: ✭ 1,861 (+1034.76%)

Mutual labels: crawler, spider, requests

🏀 Python3 网络爬虫实战（部分含详细教程）猫眼腾讯视频豆瓣研招网微博笔趣阁小说百度热点 B站 CSDN 网易云阅读阿里文学百度股票今日头条微信公众号网易云音乐拉勾有道 unsplash 实习僧汽车之家英雄联盟盒子大众点评链家 LPL赛程台风梦幻西游、阴阳师藏宝阁天气牛客网百度文库睡前故事知乎 Wish

Stars: ✭ 1,048 (+539.02%)

Mutual labels: spider, scrapy, requests

Icrawler

A multi-thread crawler framework with many builtin image crawlers provided.

Stars: ✭ 629 (+283.54%)

Mutual labels: crawler, spider, scrapy

Python3 Spider

Python爬虫实战 - 模拟登陆各大网站包含但不限于：滑块验证、拼多多、美团、百度、bilibili、大众点评、淘宝，如果喜欢请start ❤️

Stars: ✭ 2,129 (+1198.17%)

Mutual labels: crawler, spider, scrapy

Crawlab Lite

Lite version of Crawlab. 轻量版 Crawlab 爬虫管理平台

Stars: ✭ 122 (-25.61%)

Mutual labels: crawler, spider, scrapy

Goribot

[Crawler/Scraper for Golang]🕷A lightweight distributed friendly Golang crawler framework.一个轻量的分布式友好的 Golang 爬虫框架。

Stars: ✭ 190 (+15.85%)

Mutual labels: crawler, spider, scrapy

Bilili

🍻 bilibili video (including bangumi) and danmaku downloader | B站视频（含番剧）、弹幕下载器

Stars: ✭ 379 (+131.1%)

Mutual labels: crawler, spider, requests

python-fxxk-spider

收集各种免费的 Python 爬虫项目

Stars: ✭ 184 (+12.2%)

Mutual labels: spider, requests, scrapy

Docs

《数据采集从入门到放弃》源码。内容简介：爬虫介绍、就业情况、爬虫工程师面试题；HTTP协议介绍； Requests使用；解析器Xpath介绍； MongoDB与MySQL；多线程爬虫； Scrapy介绍；Scrapy-redis介绍；使用docker部署；使用nomad管理docker集群；使用EFK查询docker日志

Stars: ✭ 118 (-28.05%)

Mutual labels: crawler, scrapy, requests

Haipproxy

💖 High available distributed ip proxy pool, powerd by Scrapy and Redis

Stars: ✭ 4,993 (+2944.51%)

Mutual labels: crawler, spider, scrapy

Bilibili member crawler

B站用户爬虫好耶~是爬虫

Stars: ✭ 115 (-29.88%)

Mutual labels: crawler, spider, requests

Fbcrawl

A Facebook crawler

Stars: ✭ 536 (+226.83%)

Mutual labels: crawler, spider, scrapy

Scrapy-Spiders

一个基于Scrapy的数据采集爬虫代码库

Stars: ✭ 34 (-79.27%)

Mutual labels: spider, scrapy, appium

Marmot

💐Marmot | Web Crawler/HTTP protocol Download Package 🐭

Stars: ✭ 186 (+13.41%)

Mutual labels: crawler, spider, scrapy

Easy Scraping Tutorial

Simple but useful Python web scraping tutorial code.

Stars: ✭ 583 (+255.49%)

Mutual labels: crawler, scrapy, requests

Crawlab

Distributed web crawler admin platform for spiders management regardless of languages and frameworks. 分布式爬虫管理平台，支持任何语言和框架

Stars: ✭ 8,392 (+5017.07%)

Mutual labels: crawler, spider, scrapy

Arachnid

Powerful web scraping framework for Crystal

Stars: ✭ 68 (-58.54%)

Mutual labels: crawler, spider

Alipayspider Scrapy

AlipaySpider on Scrapy(use chrome driver); 支付宝爬虫(基于Scrapy)

Stars: ✭ 70 (-57.32%)

Mutual labels: spider, scrapy

Awesome Web Scraper

A collection of awesome web scaper, crawler.

Stars: ✭ 147 (-10.37%)

Mutual labels: spider, scrapy

Capturer

capture pictures from website like sina, lofter, huaban and so on

Stars: ✭ 76 (-53.66%)

Mutual labels: spider, scrapy

Image Downloader

Download images from Google, Bing, Baidu. 谷歌、百度、必应图片下载.

Stars: ✭ 1,173 (+615.24%)

Mutual labels: spider, scrapy

Puppeteer Walker

a puppeteer walker 🕷 🕸

Stars: ✭ 78 (-52.44%)

Mutual labels: crawler, spider

Geziyor

Geziyor, a fast web crawling & scraping framework for Go. Supports JS rendering.

Stars: ✭ 1,246 (+659.76%)

Mutual labels: crawler, spider

Fp Server

Free proxy server, continuously crawling and providing proxies, based on Tornado and Scrapy. 免费代理服务器，基于Tornado和Scrapy，在本地搭建属于自己的代理池

Stars: ✭ 154 (-6.1%)

Mutual labels: spider, scrapy

Ruia

Async Python 3.6+ web scraping micro-framework based on asyncio

Stars: ✭ 1,366 (+732.93%)

Mutual labels: crawler, spider

Dotnetcrawler

DotnetCrawler is a straightforward, lightweight web crawling/scrapying library for Entity Framework Core output based on dotnet core. This library designed like other strong crawler libraries like WebMagic and Scrapy but for enabling extandable your custom requirements. Medium link : https://medium.com/@mehmetozkaya/creating-custom-web-crawler-with-dotnet-core-using-entity-framework-core-ec8d23f0ca7c

Stars: ✭ 100 (-39.02%)

Mutual labels: crawler, scrapy

Terpene Profile Parser For Cannabis Strains

Parser and database to index the terpene profile of different strains of Cannabis from online databases

Stars: ✭ 63 (-61.59%)

Mutual labels: crawler, scrapy

Beanbun

Beanbun 是用 PHP 编写的多进程网络爬虫框架，具有良好的开放性、高可扩展性，基于 Workerman。

Stars: ✭ 1,096 (+568.29%)

Mutual labels: crawler, spider

Spider

python crawler spider

Stars: ✭ 70 (-57.32%)

Mutual labels: crawler, spider

Car Prices

Golang爬虫爬取汽车之家二手车产品库

Stars: ✭ 57 (-65.24%)

Mutual labels: crawler, spider

Crawler examples

Some classic web crawler projects.一些经典的爬虫

Stars: ✭ 74 (-54.88%)

Mutual labels: crawler, spider

Scrapy Examples

Some scrapy and web.py exmaples

Stars: ✭ 71 (-56.71%)

Mutual labels: crawler, scrapy

Taiwan News Crawlers

Scrapy-based Crawlers for news of Taiwan

Stars: ✭ 83 (-49.39%)

Mutual labels: crawler, scrapy

Awesome Python Primer

自学入门 Python 优质中文资源索引，包含书籍 / 文档 / 视频，适用于爬虫 / Web / 数据分析 / 机器学习方向

Stars: ✭ 57 (-65.24%)

Mutual labels: crawler, spider

Douyinsdk

抖音 SDK，数据采集，爬虫抓取不是梦

Stars: ✭ 99 (-39.63%)

Mutual labels: crawler, spider

Gopa Abandoned

GOPA, a spider written in Go.（NOTE: this project moved to https://github.com/infinitbyte/gopa ）

Stars: ✭ 98 (-40.24%)

Mutual labels: crawler, spider

Taobaoscrapy

😩Tool For Taobao/Tmall| 儿时玩具已经过时

Stars: ✭ 146 (-10.98%)

Mutual labels: spider, scrapy

Scrapoxy

Scrapoxy hides your scraper behind a cloud. It starts a pool of proxies to send your requests. Now, you can crawl without thinking about blacklisting!

Stars: ✭ 1,322 (+706.1%)

Mutual labels: crawler, scrapy

Crawler

爬虫, http代理, 模拟登陆!

Stars: ✭ 106 (-35.37%)

Mutual labels: crawler, scrapy

Crawler Detect

🕷 CrawlerDetect is a PHP class for detecting bots/crawlers/spiders via the user agent

Stars: ✭ 1,549 (+844.51%)

Mutual labels: crawler, spider

Not Your Average Web Crawler

A web crawler (for bug hunting) that gathers more than you can imagine.