All Projects → Github Spider → Similar Projects or Alternatives

594 Open source projects that are alternatives of or similar to Github Spider

Infinitycrawler

A simple but powerful web crawler library for .NET

Stars: ✭ 97 (-48.95%)

Mutual labels: crawler

Netflix like full-stack application with SPA client and backend implemented in service oriented architecture

Stars: ✭ 156 (-17.89%)

Mutual labels: scrapy

抖音爬虫，tiktok crawler，抖音数据采集接口，抖音视频去水印，百分百成功，不需要服务器，不需要代理 IP。

Stars: ✭ 169 (-11.05%)

Mutual labels: crawler

😩Tool For Taobao/Tmall| 儿时玩具已经过时

Stars: ✭ 146 (-23.16%)

Mutual labels: scrapy

爬取北大法宝网http://www.pkulaw.cn/Case/

Stars: ✭ 113 (-40.53%)

Mutual labels: crawler

Analysis of the characteristics of different countries

Stars: ✭ 30 (-84.21%)

Mutual labels: scrapy

Headless Chrome Crawler

Distributed crawler powered by Headless Chrome

Stars: ✭ 5,129 (+2599.47%)

Mutual labels: crawler

Crawl a website and run it through Google lighthouse

Stars: ✭ 1,339 (+604.74%)

Mutual labels: crawler

一个获取知乎用户主页信息的多线程Python爬虫程序。

Stars: ✭ 137 (-27.89%)

Mutual labels: crawler

Some research experiments

Stars: ✭ 95 (-50%)

Mutual labels: scrapy

支援 PTT 還有 PTT2 的 PTT API

Stars: ✭ 527 (+177.37%)

Mutual labels: crawler

直播网站数据采集

Stars: ✭ 188 (-1.05%)

Mutual labels: scrapy

JSpider会每周更新至少一个网站的JS解密方式，欢迎 Star，交流微信：13298307816

Stars: ✭ 914 (+381.05%)

Mutual labels: scrapy

Doujinshi downloader 绅士漫画下载

Stars: ✭ 504 (+165.26%)

Mutual labels: crawler

4chan Downloader

Python3 script to continuously download all images/webms of multiple 4chan thread simultaneously - without installation

Stars: ✭ 136 (-28.42%)

Mutual labels: crawler

Awesome Crawler

A collection of awesome web crawler,spider in different languages

Stars: ✭ 4,793 (+2422.63%)

Mutual labels: crawler

Ktspeechcrawler

Automatically constructing corpus for automatic speech recognition from YouTube videos

Stars: ✭ 92 (-51.58%)

Mutual labels: crawler

Scrapy Rotating Proxies

use multiple proxies with Scrapy

Stars: ✭ 488 (+156.84%)

Mutual labels: scrapy

Cross Platform C# web crawler framework built for speed and flexibility. Please star this project! +1.

Stars: ✭ 1,961 (+932.11%)

Mutual labels: crawler

最新动态在这里【我的程序员日志】

Stars: ✭ 112 (-41.05%)

Mutual labels: scrapy

Crawler used to crawl papers

Stars: ✭ 20 (-89.47%)

Mutual labels: crawler

Distributed Multi User Scrapy System With A Web Ui

Django based application that allows creating, deploying and running Scrapy spiders in a distributed manner

Stars: ✭ 88 (-53.68%)

Mutual labels: scrapy

LinkedIn Scraper (currently working 2020)

Stars: ✭ 453 (+138.42%)

Mutual labels: crawler

News, full-text, and article metadata extraction in Python 3. Advanced docs:

Stars: ✭ 11,545 (+5976.32%)

Mutual labels: crawler

Crawl BookCorpus

Stars: ✭ 443 (+133.16%)

Mutual labels: crawler

Geziyor, a fast web crawling & scraping framework for Go. Supports JS rendering.

Stars: ✭ 1,246 (+555.79%)

Mutual labels: crawler

Html网页正文提取

Stars: ✭ 441 (+132.11%)

Mutual labels: crawler

Sensitivefilescan

Stars: ✭ 174 (-8.42%)

Mutual labels: crawler

爬取菜鸟教程网站并转PDF__python_crawer_by_chrome

Stars: ✭ 430 (+126.32%)

Mutual labels: crawler

Iclr2020 Openreviewdata

Script that crawls meta data from ICLR OpenReview webpage. Tutorials on installing and using Selenium and ChromeDriver on Ubuntu.

Stars: ✭ 426 (+124.21%)

Mutual labels: crawler

MM131网站图片爬取 🚨

Stars: ✭ 129 (-32.11%)

Mutual labels: crawler

Opensearchserver

Open-source Enterprise Grade Search Engine Software

Stars: ✭ 408 (+114.74%)

Mutual labels: crawler

Verify that a request is from Google crawlers using Google's DNS verification steps

Stars: ✭ 82 (-56.84%)

Mutual labels: crawler

豆瓣电影/豆瓣读书 Scarpy 爬虫

Stars: ✭ 400 (+110.53%)

Mutual labels: scrapy

An easy to use, powerful crawler implemented in PHP. Can execute Javascript.

Stars: ✭ 2,055 (+981.58%)

Mutual labels: crawler

👩 美女写真套图爬虫（一）

Stars: ✭ 398 (+109.47%)

Mutual labels: crawler

Download comics novels 小说漫画下载工具小説漫画のダウンローダ小說漫畫下載:腾讯漫画大角虫漫画有妖气知音漫客咪咕 SF漫画哦漫画看漫画漫画柜汗汗酷漫動漫伊甸園快看漫画微博动漫 733动漫网大古漫画网漫画DB 無限動漫動漫狂卡推漫画动漫之家动漫屋古风漫画网 36漫画网亲亲漫画网乙女漫画 comico webtoons 咚漫ニコニコ静画 ComicWalker ヤングエースUP モアイ pixivコミックサイコミ;アルファポリスカクヨムハーメルン小説家になろう起点中文网八一中文网顶点小说落霞小说网努努书坊笔趣阁→epub.

Stars: ✭ 1,224 (+544.21%)

Mutual labels: crawler

Docs and files for ScrapydWeb, Scrapyd, Scrapy, and other projects

Stars: ✭ 390 (+105.26%)

Mutual labels: scrapy

Weibo Topic Spider

微博超级话题爬虫，微博词频统计+情感分析+简单分类，新增肺炎超话爬取数据

Stars: ✭ 128 (-32.63%)

Mutual labels: crawler

🍻 bilibili video (including bangumi) and danmaku downloader | B站视频（含番剧）、弹幕下载器

Stars: ✭ 379 (+99.47%)

Mutual labels: crawler

Swiftlinkpreview

It makes a preview from an URL, grabbing all the information such as title, relevant texts and images.

Stars: ✭ 1,216 (+540%)

Mutual labels: crawler

E Commerce Crawlers

🚀电商网站爬虫合集，淘宝京东亚马逊等

Stars: ✭ 377 (+98.42%)

Mutual labels: scrapy

This is a sina weibo spider built by scrapy [微博爬虫/持续维护]

Stars: ✭ 2,408 (+1167.37%)

Mutual labels: scrapy

Netease Music Cracker

🎵 将可下载的网易云音乐的缓存文件转换为 MP3 文件

Stars: ✭ 373 (+96.32%)

Mutual labels: crawler

POOPAK - TOR Hidden Service Crawler

Stars: ✭ 78 (-58.95%)

Mutual labels: crawler

Post Tuto Deployment

Build and deploy a machine learning app from scratch 🚀

Stars: ✭ 368 (+93.68%)

Mutual labels: scrapy

Kuaishou Crawler

As you can see, a kuaishou crawler

Stars: ✭ 126 (-33.68%)

Mutual labels: crawler

a reliable high-level web crawling & scraping framework for Node.js.

Stars: ✭ 364 (+91.58%)

Mutual labels: crawler

Anticrawlersolution

It covers the blockade principle of most anti-climbing strategies and corresponding solutions.👽👽👽👽（涵盖了大部分的反爬策略的封锁原理以及对应的解决方案。）

Stars: ✭ 77 (-59.47%)

Mutual labels: crawler

台灣股票即時爬蟲。Taiwan Stock Exchange Real Time Crawler

Stars: ✭ 359 (+88.95%)

Mutual labels: crawler

Free proxy server, continuously crawling and providing proxies, based on Tornado and Scrapy. 免费代理服务器，基于Tornado和Scrapy，在本地搭建属于自己的代理池

Stars: ✭ 154 (-18.95%)

Mutual labels: scrapy

Python Dcdownloader

由Python编写的全异步实现的动漫之家(dmzj)漫画批量下载器（爬虫）

Stars: ✭ 146 (-23.16%)

Mutual labels: crawler

一只优雅的正方教务系统爬虫。

Stars: ✭ 112 (-41.05%)

Mutual labels: crawler

Tor website crawler (specific for Alphabay at the time)

Stars: ✭ 15 (-92.11%)

Mutual labels: crawler

Fetches PubMed article IDs (PMIDs) from email inbox, then crawls PubMed, Google Scholar and Sci-Hub for respective PDF files.

Stars: ✭ 14 (-92.63%)

Mutual labels: crawler

Code for the second edition Web Scraping with Python book by Packt Publications

Stars: ✭ 112 (-41.05%)

Mutual labels: scrapy

Crawl websites for accessibility issues from the command line.

Stars: ✭ 12 (-93.68%)

Mutual labels: crawler

capture pictures from website like sina, lofter, huaban and so on

Stars: ✭ 76 (-60%)

Mutual labels: scrapy

新闻抓取（微信、微博、头条...）

Stars: ✭ 190 (+0%)

Mutual labels: scrapy

Sina Stock Crawler

Sina stock options crawler with CSV output 新浪上证ETF期权数据爬虫

Stars: ✭ 12 (-93.68%)

Mutual labels: crawler

301-360 of 594 similar projects