All Projects → Jlitespider → Similar Projects or Alternatives

1790 Open source projects that are alternatives of or similar to Jlitespider

crawler
A simple and flexible web crawler framework for java.
Stars: ✭ 20 (-86.75%)
Mutual labels:  crawler, spider
scrapy-distributed
A series of distributed components for Scrapy. Including RabbitMQ-based components, Kafka-based components, and RedisBloom-based components for Scrapy.
Stars: ✭ 38 (-74.83%)
Mutual labels:  spider, rabbitmq
slime
🍰 一个可视化的爬虫平台
Stars: ✭ 27 (-82.12%)
Mutual labels:  crawler, spider
core
Microservice abstract class
Stars: ✭ 37 (-75.5%)
Mutual labels:  distributed-systems, rabbitmq
humainary-signals-services-java
Observability Signaling for Distributed Computation
Stars: ✭ 23 (-84.77%)
Mutual labels:  distributed-systems, distributed
NScrapy
NScrapy is a .net core corss platform Distributed Spider Framework which provide an easy way to write your own Spider
Stars: ✭ 88 (-41.72%)
Mutual labels:  spider, distributed
Oklog
A distributed and coördination-free log management system
Stars: ✭ 2,937 (+1845.03%)
Mutual labels:  distributed-systems, distributed
Hacker News Digest
📰 A responsive interface of Hacker News with summaries and thumbnails.
Stars: ✭ 278 (+84.11%)
Mutual labels:  crawler, spider
Gospider
golang实现的爬虫框架,使用者只需关心页面规则,提供web管理界面。基于colly开发。
Stars: ✭ 285 (+88.74%)
Mutual labels:  crawler, spider
Gopa
[WIP] GOPA, a spider written in Golang, for Elasticsearch. DEMO: http://index.elasticsearch.cn
Stars: ✭ 277 (+83.44%)
Mutual labels:  crawler, spider
Dis Seckill
👊SpringBoot+Zookeeper+Dubbo打造分布式高并发商品秒杀系统
Stars: ✭ 315 (+108.61%)
Mutual labels:  rabbitmq, distributed-systems
Toapi
Every web site provides APIs.
Stars: ✭ 3,209 (+2025.17%)
Mutual labels:  crawler, spider
Zhihu Login
知乎模拟登录,支持提取验证码和保存 Cookies
Stars: ✭ 340 (+125.17%)
Mutual labels:  crawler, spider
leek
Celery Tasks Monitoring Tool
Stars: ✭ 77 (-49.01%)
Mutual labels:  rabbitmq, distributed
Fictiondown
小说下载|小说爬取|起点|笔趣阁|导出Markdown|导出txt|转换epub|广告过滤|自动校对
Stars: ✭ 362 (+139.74%)
Mutual labels:  crawler, spider
Diplomat
A HTTP Ruby API for Consul
Stars: ✭ 358 (+137.09%)
Mutual labels:  distributed-systems, distributed
Signature algorithm
各种App、小程序、网站的请求签名或加密算法。 现已有:自如、小红书、蛋壳公寓、luckin coffee(瑞幸咖啡)、bangkokair(曼谷航空)
Stars: ✭ 380 (+151.66%)
Mutual labels:  crawler, spider
Freshonions Torscraper
Fresh Onions is an open source TOR spider / hidden service onion crawler hosted at zlal32teyptf4tvi.onion
Stars: ✭ 348 (+130.46%)
Mutual labels:  crawler, spider
Gosint
OSINT Swiss Army Knife
Stars: ✭ 401 (+165.56%)
Mutual labels:  crawler, spider
Libvineyard
libvineyard: an in-memory immutable data manager.
Stars: ✭ 392 (+159.6%)
Mutual labels:  distributed-systems, distributed
Mm131
MM131网站图片爬取 🚨
Stars: ✭ 129 (-14.57%)
Mutual labels:  crawler, spider
Xcrawler
快速、简洁且强大的PHP爬虫框架
Stars: ✭ 344 (+127.81%)
Mutual labels:  crawler, spider
Awesome Crawler
A collection of awesome web crawler,spider in different languages
Stars: ✭ 4,793 (+3074.17%)
Mutual labels:  crawler, spider
Hazelcast
Open-source distributed computation and storage platform
Stars: ✭ 4,662 (+2987.42%)
Mutual labels:  distributed, distributed-systems
Learnpython
Python的基础练习代码与各种爬虫代码
Stars: ✭ 451 (+198.68%)
Mutual labels:  crawler, spider
Fbcrawl
A Facebook crawler
Stars: ✭ 536 (+254.97%)
Mutual labels:  crawler, spider
Scrapy Redis
Redis-based components for Scrapy.
Stars: ✭ 4,998 (+3209.93%)
Mutual labels:  crawler, distributed
moqui-hazelcast
Moqui Framework tool component for Hazelcast, used for distributed async services, entity distributed cache invalidation, web session replication, and distributed cache (javax.cache)
Stars: ✭ 12 (-92.05%)
Mutual labels:  distributed-systems, distributed
Examples Of Web Crawlers
一些非常有趣的python爬虫例子,对新手比较友好,主要爬取淘宝、天猫、微信、豆瓣、QQ等网站。(Some interesting examples of python crawlers that are friendly to beginners. )
Stars: ✭ 10,724 (+7001.99%)
Mutual labels:  crawler, spider
Grab Site
The archivist's web crawler: WARC output, dashboard for all crawls, dynamic ignore patterns
Stars: ✭ 680 (+350.33%)
Mutual labels:  crawler, spider
Spidr
A versatile Ruby web spidering library that can spider a site, multiple domains, certain links or infinitely. Spidr is designed to be fast and easy to use.
Stars: ✭ 656 (+334.44%)
Mutual labels:  crawler, spider
Crawler
A high performance web crawler in Elixir.
Stars: ✭ 781 (+417.22%)
Mutual labels:  crawler, spider
Icrawler
A multi-thread crawler framework with many builtin image crawlers provided.
Stars: ✭ 629 (+316.56%)
Mutual labels:  crawler, spider
Zhihu Crawler
zhihu-crawler是一个基于Java的高性能、支持免费http代理池、支持横向扩展、分布式爬虫项目
Stars: ✭ 890 (+489.4%)
Mutual labels:  crawler, spider
Torbot
Dark Web OSINT Tool
Stars: ✭ 821 (+443.71%)
Mutual labels:  crawler, spider
Disec
Distributed Image Search Engine Crawler
Stars: ✭ 11 (-92.72%)
Mutual labels:  crawler, distributed
Baiduimagespider
一个超级轻量的百度图片爬虫
Stars: ✭ 591 (+291.39%)
Mutual labels:  crawler, spider
Bilibili member crawler
B站用户爬虫 好耶~是爬虫
Stars: ✭ 115 (-23.84%)
Mutual labels:  crawler, spider
Decryptlogin
APIs for loginning some websites by using requests.
Stars: ✭ 1,861 (+1132.45%)
Mutual labels:  crawler, spider
Maman
Rust Web Crawler saving pages on Redis
Stars: ✭ 39 (-74.17%)
Mutual labels:  crawler, spider
Photon
Incredibly fast crawler designed for OSINT.
Stars: ✭ 8,332 (+5417.88%)
Mutual labels:  crawler, spider
Nodespider
[DEPRECATED] Simple, flexible, delightful web crawler/spider package
Stars: ✭ 33 (-78.15%)
Mutual labels:  crawler, spider
Beanbun
Beanbun 是用 PHP 编写的多进程网络爬虫框架,具有良好的开放性、高可扩展性,基于 Workerman。
Stars: ✭ 1,096 (+625.83%)
Mutual labels:  crawler, spider
Car Prices
Golang爬虫 爬取汽车之家 二手车产品库
Stars: ✭ 57 (-62.25%)
Mutual labels:  crawler, spider
Crawler examples
Some classic web crawler projects.一些经典的爬虫
Stars: ✭ 74 (-50.99%)
Mutual labels:  crawler, spider
Newcrawler
Free Web Scraping Tool with Java
Stars: ✭ 589 (+290.07%)
Mutual labels:  crawler, spider
Gopa Abandoned
GOPA, a spider written in Go.(NOTE: this project moved to https://github.com/infinitbyte/gopa )
Stars: ✭ 98 (-35.1%)
Mutual labels:  crawler, spider
Foundatio
Pluggable foundation blocks for building distributed apps.
Stars: ✭ 1,365 (+803.97%)
Mutual labels:  distributed-systems, distributed
Storj
Ongoing Storj v3 development. Decentralized cloud object storage that is affordable, easy to use, private, and secure.
Stars: ✭ 1,278 (+746.36%)
Mutual labels:  distributed-systems, distributed
Not Your Average Web Crawler
A web crawler (for bug hunting) that gathers more than you can imagine.
Stars: ✭ 107 (-29.14%)
Mutual labels:  crawler, spider
Crawler Detect
🕷 CrawlerDetect is a PHP class for detecting bots/crawlers/spiders via the user agent
Stars: ✭ 1,549 (+925.83%)
Mutual labels:  crawler, spider
Baiduspider
BaiduSpider,一个爬取百度搜索结果的爬虫,目前支持百度网页搜索,百度图片搜索,百度知道搜索,百度视频搜索,百度资讯搜索,百度文库搜索,百度经验搜索和百度百科搜索。
Stars: ✭ 105 (-30.46%)
Mutual labels:  crawler, spider
Geziyor
Geziyor, a fast web crawling & scraping framework for Go. Supports JS rendering.
Stars: ✭ 1,246 (+725.17%)
Mutual labels:  crawler, spider
Sandglass
Sandglass is a distributed, horizontally scalable, persistent, time sorted message queue.
Stars: ✭ 1,531 (+913.91%)
Mutual labels:  distributed-systems, distributed
Magic google
Google search results crawler, get google search results that you need
Stars: ✭ 247 (+63.58%)
Mutual labels:  crawler, spider
nebula
A distributed, fast open-source graph database featuring horizontal scalability and high availability
Stars: ✭ 8,196 (+5327.81%)
Mutual labels:  distributed-systems, distributed
Douyin
API of DouYin for Humans used to Crawl Popular Videos and Musics
Stars: ✭ 580 (+284.11%)
Mutual labels:  crawler, spider
Puppeteer Walker
a puppeteer walker 🕷 🕸
Stars: ✭ 78 (-48.34%)
Mutual labels:  crawler, spider
Douban Movie
Golang爬虫 爬取豆瓣电影Top250
Stars: ✭ 114 (-24.5%)
Mutual labels:  crawler, spider
Free proxy website
获取免费socks/https/http代理的网站集合
Stars: ✭ 119 (-21.19%)
Mutual labels:  crawler, spider
61-120 of 1790 similar projects