All Projects → Spidy → Similar Projects or Alternatives

487 Open source projects that are alternatives of or similar to Spidy

Gopa
[WIP] GOPA, a spider written in Golang, for Elasticsearch. DEMO: http://index.elasticsearch.cn
Stars: ✭ 277 (+7.78%)
Mutual labels:  crawler, crawling, web-crawler
Antch
Antch, a fast, powerful and extensible web crawling & scraping framework for Go
Stars: ✭ 198 (-22.96%)
Mutual labels:  crawler, crawling, web-crawler
flink-crawler
Continuous scalable web crawler built on top of Flink and crawler-commons
Stars: ✭ 48 (-81.32%)
Mutual labels:  crawler, web-crawler, crawling
bots-zoo
No description or website provided.
Stars: ✭ 59 (-77.04%)
Mutual labels:  crawler, crawling
Terpene Profile Parser For Cannabis Strains
Parser and database to index the terpene profile of different strains of Cannabis from online databases
Stars: ✭ 63 (-75.49%)
Mutual labels:  crawler, web-crawler
Easy Scraping Tutorial
Simple but useful Python web scraping tutorial code.
Stars: ✭ 583 (+126.85%)
Mutual labels:  crawler, crawling
Sasila
一个灵活、友好的爬虫框架
Stars: ✭ 286 (+11.28%)
Mutual labels:  crawler, crawling
img-cli
An interactive Command-Line Interface Build in NodeJS for downloading a single or multiple images to disk from URL
Stars: ✭ 15 (-94.16%)
Mutual labels:  crawler, crawling
Crawly
Crawly, a high-level web crawling & scraping framework for Elixir.
Stars: ✭ 440 (+71.21%)
Mutual labels:  crawler, crawling
Awesome Crawler
A collection of awesome web crawler,spider in different languages
Stars: ✭ 4,793 (+1764.98%)
Mutual labels:  crawler, web-crawler
CrawlBox
Easy way to brute-force web directory.
Stars: ✭ 118 (-54.09%)
Mutual labels:  crawler, web-crawler
Crawler
Go process used to crawl websites
Stars: ✭ 147 (-42.8%)
Mutual labels:  crawler, crawling
Lulu
[Unmaintained] A simple and clean video/music/image downloader 👾
Stars: ✭ 789 (+207%)
Mutual labels:  crawler, crawling
Crawlab Lite
Lite version of Crawlab. 轻量版 Crawlab 爬虫管理平台
Stars: ✭ 122 (-52.53%)
Mutual labels:  crawler, web-crawler
Scrapyrt
HTTP API for Scrapy spiders
Stars: ✭ 637 (+147.86%)
Mutual labels:  crawler, crawling
Skycaiji
蓝天采集器是一款免费的数据采集发布爬虫软件,采用php+mysql开发,可部署在云服务器,几乎能采集所有类型的网页,无缝对接各类CMS建站程序,免登录实时发布数据,全自动无需人工干预!是网页大数据采集软件中完全跨平台的云端爬虫系统
Stars: ✭ 1,514 (+489.11%)
Mutual labels:  crawler, crawling
Nutch
Apache Nutch is an extensible and scalable web crawler
Stars: ✭ 2,277 (+785.99%)
Mutual labels:  crawling, web-crawler
Spider Flow
新一代爬虫平台,以图形化方式定义爬虫流程,不写代码即可完成爬虫。
Stars: ✭ 365 (+42.02%)
Mutual labels:  crawler, web-crawler
Pspider
简单易用的Python爬虫框架,QQ交流群:597510560
Stars: ✭ 1,611 (+526.85%)
Mutual labels:  crawler, web-crawler
Squidwarc
Squidwarc is a high fidelity, user scriptable, archival crawler that uses Chrome or Chromium with or without a head
Stars: ✭ 125 (-51.36%)
Mutual labels:  crawler, crawling
Maman
Rust Web Crawler saving pages on Redis
Stars: ✭ 39 (-84.82%)
Mutual labels:  crawler, web-crawler
Crawlab
Distributed web crawler admin platform for spiders management regardless of languages and frameworks. 分布式爬虫管理平台,支持任何语言和框架
Stars: ✭ 8,392 (+3165.37%)
Mutual labels:  crawler, web-crawler
Scrapy
Scrapy, a fast high-level web crawling & scraping framework for Python.
Stars: ✭ 42,343 (+16375.88%)
Mutual labels:  crawler, crawling
Arachnid
Powerful web scraping framework for Crystal
Stars: ✭ 68 (-73.54%)
Mutual labels:  crawler, crawling
Linkedin Profile Scraper
🕵️‍♂️ LinkedIn profile scraper returning structured profile data in JSON. Works in 2020.
Stars: ✭ 171 (-33.46%)
Mutual labels:  crawler, crawling
N2h4
네이버 뉴스 수집을 위한 도구
Stars: ✭ 177 (-31.13%)
Mutual labels:  crawler, crawling
Zhihu Crawler People
A simple distributed crawler for zhihu && data analysis
Stars: ✭ 182 (-29.18%)
Mutual labels:  crawler, web-crawler
Ferret
Declarative web scraping
Stars: ✭ 4,837 (+1782.1%)
Mutual labels:  crawler, crawling
Supercrawler
A web crawler. Supercrawler automatically crawls websites. Define custom handlers to parse content. Obeys robots.txt, rate limits and concurrency limits.
Stars: ✭ 306 (+19.07%)
Mutual labels:  crawler, web-crawler
Spidr
A versatile Ruby web spidering library that can spider a site, multiple domains, certain links or infinitely. Spidr is designed to be fast and easy to use.
Stars: ✭ 656 (+155.25%)
Mutual labels:  crawler, web-crawler
Webster
a reliable high-level web crawling & scraping framework for Node.js.
Stars: ✭ 364 (+41.63%)
Mutual labels:  crawler, crawling
Infinitycrawler
A simple but powerful web crawler library for .NET
Stars: ✭ 97 (-62.26%)
Mutual labels:  crawler, web-crawler
Dotnetcrawler
DotnetCrawler is a straightforward, lightweight web crawling/scrapying library for Entity Framework Core output based on dotnet core. This library designed like other strong crawler libraries like WebMagic and Scrapy but for enabling extandable your custom requirements. Medium link : https://medium.com/@mehmetozkaya/creating-custom-web-crawler-with-dotnet-core-using-entity-framework-core-ec8d23f0ca7c
Stars: ✭ 100 (-61.09%)
Mutual labels:  crawler, crawling
Abot
Cross Platform C# web crawler framework built for speed and flexibility. Please star this project! +1.
Stars: ✭ 1,961 (+663.04%)
Mutual labels:  crawler, web-crawler
Instagram Bot
An Instagram bot developed using the Selenium Framework
Stars: ✭ 138 (-46.3%)
Mutual labels:  crawler, crawling
Strong Web Crawler
基于C#.NET+PhantomJS+Sellenium的高级网络爬虫程序。可执行Javascript代码、触发各类事件、操纵页面Dom结构。
Stars: ✭ 238 (-7.39%)
Mutual labels:  crawler, web-crawler
Headless Chrome Crawler
Distributed crawler powered by Headless Chrome
Stars: ✭ 5,129 (+1895.72%)
Mutual labels:  crawler, crawling
Newspaper
News, full-text, and article metadata extraction in Python 3. Advanced docs:
Stars: ✭ 11,545 (+4392.22%)
Mutual labels:  crawler, crawling
Colly
Elegant Scraper and Crawler Framework for Golang
Stars: ✭ 15,535 (+5944.75%)
Mutual labels:  crawler, crawling
Mimo-Crawler
A web crawler that uses Firefox and js injection to interact with webpages and crawl their content, written in nodejs.
Stars: ✭ 22 (-91.44%)
Mutual labels:  web-crawler, crawling
spiderable-middleware
🤖 Prerendering for JavaScript powered websites. Great solution for PWAs (Progressive Web Apps), SPAs (Single Page Applications), and other websites based on top of front-end JavaScript frameworks
Stars: ✭ 29 (-88.72%)
Mutual labels:  crawler
html-query
A fluent and functional approach to querying HTML
Stars: ✭ 48 (-81.32%)
Mutual labels:  crawler
ptt-web-crawler
PTT 網路版爬蟲
Stars: ✭ 20 (-92.22%)
Mutual labels:  crawler
domfind
A Python DNS crawler to find identical domain names under different TLDs.
Stars: ✭ 22 (-91.44%)
Mutual labels:  crawler
MyCrawler
我的爬虫合集
Stars: ✭ 55 (-78.6%)
Mutual labels:  crawler
ZhengFang System Spider
🐛一只登录正方教务管理系统,爬取数据的小爬虫
Stars: ✭ 21 (-91.83%)
Mutual labels:  crawler
medium-stat-box
Practical pinned gist which show your latest medium status 📌
Stars: ✭ 29 (-88.72%)
Mutual labels:  crawler
php-google
Google search results crawler, get google search results that you need - php
Stars: ✭ 23 (-91.05%)
Mutual labels:  crawler
snapcrawl
Crawl a website and take screenshots
Stars: ✭ 37 (-85.6%)
Mutual labels:  crawler
Sharingan
We will try to find your visible basic footprint from social media as much as possible - 😤 more sites is comming soon
Stars: ✭ 13 (-94.94%)
Mutual labels:  crawler
ComicBookMaker
Script to fetch webcomics and use them to create ebooks.
Stars: ✭ 27 (-89.49%)
Mutual labels:  web-crawler
weibo-scraper
Simple Weibo Scraper
Stars: ✭ 50 (-80.54%)
Mutual labels:  crawler
arachnod
High performance crawler for Nodejs
Stars: ✭ 17 (-93.39%)
Mutual labels:  crawler
sse-option-crawler
SSE 50 index options crawler 上证50期权数据爬虫
Stars: ✭ 17 (-93.39%)
Mutual labels:  crawler
TumblTwo
TumblTwo, an Improved Fork of TumblOne, a Tumblr Downloader.
Stars: ✭ 57 (-77.82%)
Mutual labels:  crawler
auto crawler ptt beauty image
Auto Crawler Ptt Beauty Image Use Python Schedule
Stars: ✭ 35 (-86.38%)
Mutual labels:  crawler
eastmoney
python requests + Django+ nodejs koa+ mysql to crawl eastmoney fund and stock data,for data analysis and visualiaztion .
Stars: ✭ 56 (-78.21%)
Mutual labels:  crawler
dijnet-bot
Az összes számlád még egy helyen :)
Stars: ✭ 17 (-93.39%)
Mutual labels:  crawler
crawler
A simple and flexible web crawler framework for java.
Stars: ✭ 20 (-92.22%)
Mutual labels:  crawler
TaobaoAnalysis
练习NLP,分析淘宝评论的项目
Stars: ✭ 28 (-89.11%)
Mutual labels:  crawler
1-60 of 487 similar projects