All Projects → Crawler → Similar Projects or Alternatives

453 Open source projects that are alternatives of or similar to Crawler

Headless Chrome Crawler
Distributed crawler powered by Headless Chrome
Stars: ✭ 5,129 (+3389.12%)
Mutual labels:  crawler, crawling
Instagram Bot
An Instagram bot developed using the Selenium Framework
Stars: ✭ 138 (-6.12%)
Mutual labels:  crawler, crawling
flink-crawler
Continuous scalable web crawler built on top of Flink and crawler-commons
Stars: ✭ 48 (-67.35%)
Mutual labels:  crawler, crawling
Skycaiji
蓝天采集器是一款免费的数据采集发布爬虫软件,采用php+mysql开发,可部署在云服务器,几乎能采集所有类型的网页,无缝对接各类CMS建站程序,免登录实时发布数据,全自动无需人工干预!是网页大数据采集软件中完全跨平台的云端爬虫系统
Stars: ✭ 1,514 (+929.93%)
Mutual labels:  crawler, crawling
Webster
a reliable high-level web crawling & scraping framework for Node.js.
Stars: ✭ 364 (+147.62%)
Mutual labels:  crawler, crawling
Easy Scraping Tutorial
Simple but useful Python web scraping tutorial code.
Stars: ✭ 583 (+296.6%)
Mutual labels:  crawler, crawling
Ferret
Declarative web scraping
Stars: ✭ 4,837 (+3190.48%)
Mutual labels:  crawler, crawling
Sasila
一个灵活、友好的爬虫框架
Stars: ✭ 286 (+94.56%)
Mutual labels:  crawler, crawling
Colly
Elegant Scraper and Crawler Framework for Golang
Stars: ✭ 15,535 (+10468.03%)
Mutual labels:  crawler, crawling
Gopa
[WIP] GOPA, a spider written in Golang, for Elasticsearch. DEMO: http://index.elasticsearch.cn
Stars: ✭ 277 (+88.44%)
Mutual labels:  crawler, crawling
Scrapyrt
HTTP API for Scrapy spiders
Stars: ✭ 637 (+333.33%)
Mutual labels:  crawler, crawling
Arachnid
Powerful web scraping framework for Crystal
Stars: ✭ 68 (-53.74%)
Mutual labels:  crawler, crawling
Squidwarc
Squidwarc is a high fidelity, user scriptable, archival crawler that uses Chrome or Chromium with or without a head
Stars: ✭ 125 (-14.97%)
Mutual labels:  crawler, crawling
Newspaper
News, full-text, and article metadata extraction in Python 3. Advanced docs:
Stars: ✭ 11,545 (+7753.74%)
Mutual labels:  crawler, crawling
img-cli
An interactive Command-Line Interface Build in NodeJS for downloading a single or multiple images to disk from URL
Stars: ✭ 15 (-89.8%)
Mutual labels:  crawler, crawling
Antch
Antch, a fast, powerful and extensible web crawling & scraping framework for Go
Stars: ✭ 198 (+34.69%)
Mutual labels:  crawler, crawling
Lulu
[Unmaintained] A simple and clean video/music/image downloader 👾
Stars: ✭ 789 (+436.73%)
Mutual labels:  crawler, crawling
bots-zoo
No description or website provided.
Stars: ✭ 59 (-59.86%)
Mutual labels:  crawler, crawling
Spidy
The simple, easy to use command line web crawler.
Stars: ✭ 257 (+74.83%)
Mutual labels:  crawler, crawling
Linkedin Profile Scraper
🕵️‍♂️ LinkedIn profile scraper returning structured profile data in JSON. Works in 2020.
Stars: ✭ 171 (+16.33%)
Mutual labels:  crawler, crawling
N2h4
네이버 뉴스 수집을 위한 도구
Stars: ✭ 177 (+20.41%)
Mutual labels:  crawler, crawling
Crawly
Crawly, a high-level web crawling & scraping framework for Elixir.
Stars: ✭ 440 (+199.32%)
Mutual labels:  crawler, crawling
Dotnetcrawler
DotnetCrawler is a straightforward, lightweight web crawling/scrapying library for Entity Framework Core output based on dotnet core. This library designed like other strong crawler libraries like WebMagic and Scrapy but for enabling extandable your custom requirements. Medium link : https://medium.com/@mehmetozkaya/creating-custom-web-crawler-with-dotnet-core-using-entity-framework-core-ec8d23f0ca7c
Stars: ✭ 100 (-31.97%)
Mutual labels:  crawler, crawling
Scrapy
Scrapy, a fast high-level web crawling & scraping framework for Python.
Stars: ✭ 42,343 (+28704.76%)
Mutual labels:  crawler, crawling
Qqmusicspider
基于Scrapy的QQ音乐爬虫(QQ Music Spider),爬取歌曲信息、歌词、精彩评论等,并且分享了QQ音乐中排名前6400名的内地和港台歌手的49万+的音乐语料
Stars: ✭ 120 (-18.37%)
Mutual labels:  crawler
4chan Downloader
Python3 script to continuously download all images/webms of multiple 4chan thread simultaneously - without installation
Stars: ✭ 136 (-7.48%)
Mutual labels:  crawler
Awesome Puppeteer
A curated list of awesome puppeteer resources.
Stars: ✭ 1,728 (+1075.51%)
Mutual labels:  crawling
Skill Share Crawler Dl
Download Videos Skill Share per ID or per Class
Stars: ✭ 122 (-17.01%)
Mutual labels:  crawler
Onegram
This repository is no longer maintained.
Stars: ✭ 137 (-6.8%)
Mutual labels:  crawler
Pspider
简单易用的Python爬虫框架,QQ交流群:597510560
Stars: ✭ 1,611 (+995.92%)
Mutual labels:  crawler
Massivedl
Download a large list of files concurrently
Stars: ✭ 141 (-4.08%)
Mutual labels:  crawling
Tiebamanager
(已跑路)百度贴吧吧务管理工具,自动扫描帖子并处理违规帖
Stars: ✭ 119 (-19.05%)
Mutual labels:  crawler
Goclone
Website Cloner - Utilizes powerful Go routines to clone websites to your computer within seconds.
Stars: ✭ 134 (-8.84%)
Mutual labels:  crawler
Php Crawler
A php crawler that finds emails on the internets
Stars: ✭ 119 (-19.05%)
Mutual labels:  crawler
Youtube Projects
This repository contains all the code I use in my YouTube tutorials.
Stars: ✭ 144 (-2.04%)
Mutual labels:  crawler
Oddish
To crawl all csgo skins from website.
Stars: ✭ 139 (-5.44%)
Mutual labels:  crawler
Free proxy website
获取免费socks/https/http代理的网站集合
Stars: ✭ 119 (-19.05%)
Mutual labels:  crawler
Sentinel Crawler
Xenomorph Crawler, a Concise, Declarative and Observable Distributed Crawler(Node / Go / Java / Rust) For Web, RDB, OS, also can act as a Monitor(with Prometheus) or ETL for Infrastructure 💫 多语言执行器,分布式爬虫
Stars: ✭ 118 (-19.73%)
Mutual labels:  crawler
Docs
《数据采集从入门到放弃》源码。内容简介:爬虫介绍、就业情况、爬虫工程师面试题 ;HTTP协议介绍; Requests使用 ;解析器Xpath介绍; MongoDB与MySQL; 多线程爬虫; Scrapy介绍 ;Scrapy-redis介绍; 使用docker部署; 使用nomad管理docker集群; 使用EFK查询docker日志
Stars: ✭ 118 (-19.73%)
Mutual labels:  crawler
Red hawk
All in one tool for Information Gathering, Vulnerability Scanning and Crawling. A must have tool for all penetration testers
Stars: ✭ 1,898 (+1191.16%)
Mutual labels:  crawler
Moodle Downloader 2
A Moodle downloader that downloads course content fast from Moodle (eg. lecture pdfs)
Stars: ✭ 118 (-19.73%)
Mutual labels:  crawler
Decryptlogin
APIs for loginning some websites by using requests.
Stars: ✭ 1,861 (+1165.99%)
Mutual labels:  crawler
Amazonbigspider
😱Full Automatic Amazon Distributed Spider | 亚马逊分布式四国际站采集选款产品|账号admin,密码adminadmin
Stars: ✭ 140 (-4.76%)
Mutual labels:  crawler
Mm131
MM131网站图片爬取 🚨
Stars: ✭ 129 (-12.24%)
Mutual labels:  crawler
Examples Of Web Crawlers
一些非常有趣的python爬虫例子,对新手比较友好,主要爬取淘宝、天猫、微信、豆瓣、QQ等网站。(Some interesting examples of python crawlers that are friendly to beginners. )
Stars: ✭ 10,724 (+7195.24%)
Mutual labels:  crawler
Baiducrawler
Sample of using proxies to crawl baidu search results.
Stars: ✭ 116 (-21.09%)
Mutual labels:  crawler
Digger
Digger is a powerful and flexible web crawler implemented by pure golang
Stars: ✭ 130 (-11.56%)
Mutual labels:  crawler
Prerender Java
java framework for prerender
Stars: ✭ 115 (-21.77%)
Mutual labels:  crawler
Indonesian Nlp Resources
data resource untuk NLP bahasa indonesia
Stars: ✭ 143 (-2.72%)
Mutual labels:  crawler
Crawler China Mainland Universities
中国大陆大学列表爬虫
Stars: ✭ 143 (-2.72%)
Mutual labels:  crawler
Memex Explorer
Viewers for statistics and dashboarding of Domain Search Engine data
Stars: ✭ 115 (-21.77%)
Mutual labels:  crawler
Weibo Topic Spider
微博超级话题爬虫,微博词频统计+情感分析+简单分类,新增肺炎超话爬取数据
Stars: ✭ 128 (-12.93%)
Mutual labels:  crawler
Bilibili member crawler
B站用户爬虫 好耶~是爬虫
Stars: ✭ 115 (-21.77%)
Mutual labels:  crawler
Jianso movie
🎬 电影资源爬虫,电影图片抓取脚本,Flask|Nginx|wsgi
Stars: ✭ 114 (-22.45%)
Mutual labels:  crawler
Bhban rpa
6개월 치 업무를 하루 만에 끝내는 업무 자동화(생능출판사, 2020)의 예제 코드입니다. 파이썬을 한 번도 배워본 적 없는 분들을 위한 예제이며, 엑셀부터 디자인, 매크로, 크롤링까지 업무 자동화와 관련된 다양한 분야 예제가 제공됩니다.
Stars: ✭ 124 (-15.65%)
Mutual labels:  crawling
Patentcrawler
scrapy专利爬虫(停止维护)
Stars: ✭ 114 (-22.45%)
Mutual labels:  crawler
Douban Movie
Golang爬虫 爬取豆瓣电影Top250
Stars: ✭ 114 (-22.45%)
Mutual labels:  crawler
Search
An Open Source Search Engine
Stars: ✭ 139 (-5.44%)
Mutual labels:  crawler
Kuaishou Crawler
As you can see, a kuaishou crawler
Stars: ✭ 126 (-14.29%)
Mutual labels:  crawler
Pkulaw spider
爬取北大法宝网http://www.pkulaw.cn/Case/
Stars: ✭ 113 (-23.13%)
Mutual labels:  crawler
1-60 of 453 similar projects