All Projects → Awesome Web Scraper → Similar Projects or Alternatives

1095 Open source projects that are alternatives of or similar to Awesome Web Scraper

Taiwan News Crawlers
Scrapy-based Crawlers for news of Taiwan
Stars: ✭ 83 (-43.54%)
Mutual labels:  scrapy
Dexie.js
A Minimalistic Wrapper for IndexedDB
Stars: ✭ 7,337 (+4891.16%)
Mutual labels:  storage
Node Disk Manager
Kubernetes Storage Device Management
Stars: ✭ 128 (-12.93%)
Mutual labels:  storage
Api
SODA API is an open source implementation of SODA API Standards for Data and Storage Management.
Stars: ✭ 795 (+440.82%)
Mutual labels:  storage
Echo360
Commandline tool for automated downloads of echo360 videos hosted by university
Stars: ✭ 81 (-44.9%)
Mutual labels:  phantomjs
Awesome Python Primer
自学入门 Python 优质中文资源索引,包含 书籍 / 文档 / 视频,适用于 爬虫 / Web / 数据分析 / 机器学习 方向
Stars: ✭ 57 (-61.22%)
Mutual labels:  spider
Bilibili member crawler
B站用户爬虫 好耶~是爬虫
Stars: ✭ 115 (-21.77%)
Mutual labels:  spider
Libaums
Open source library to access USB Mass Storage devices on Android without rooting your device
Stars: ✭ 769 (+423.13%)
Mutual labels:  storage
Email Extractor
The main functionality is to extract all the emails from one or several URLs - La funcionalidad principal es extraer todos los correos electrónicos de una o varias Url
Stars: ✭ 81 (-44.9%)
Mutual labels:  scrapy
Qqmusicspider
基于Scrapy的QQ音乐爬虫(QQ Music Spider),爬取歌曲信息、歌词、精彩评论等,并且分享了QQ音乐中排名前6400名的内地和港台歌手的49万+的音乐语料
Stars: ✭ 120 (-18.37%)
Mutual labels:  scrapy
React Native Shared Preferences
Android's Native key value storage system in React Native
Stars: ✭ 101 (-31.29%)
Mutual labels:  storage
Arc
📎 Flexible file upload and attachment library for Elixir
Stars: ✭ 1,087 (+639.46%)
Mutual labels:  storage
House Renting
Possibly the best practice of Scrapy 🕷 and renting a house 🏡
Stars: ✭ 741 (+404.08%)
Mutual labels:  scrapy
React Native Persistent Job
Run async tasks that retry after a crash, connection loss or exception
Stars: ✭ 80 (-45.58%)
Mutual labels:  storage
Defaults
Swifty and modern UserDefaults
Stars: ✭ 734 (+399.32%)
Mutual labels:  storage
Douban Movie
Golang爬虫 爬取豆瓣电影Top250
Stars: ✭ 114 (-22.45%)
Mutual labels:  spider
Bilibili Api
哔哩哔哩的API调用模块
Stars: ✭ 704 (+378.91%)
Mutual labels:  spider
Ultimate Dork
Web Crawler
Stars: ✭ 79 (-46.26%)
Mutual labels:  web-crawler
Minio
High Performance, Kubernetes Native Object Storage
Stars: ✭ 30,698 (+20782.99%)
Mutual labels:  storage
Dialogue.moe
Stars: ✭ 127 (-13.61%)
Mutual labels:  scrapy
Dnsfs
Store your data in others DNS revolvers cache
Stars: ✭ 696 (+373.47%)
Mutual labels:  storage
Detect Cms
PHP Library for detecting CMS
Stars: ✭ 78 (-46.94%)
Mutual labels:  web-scraper
Redux Storage
Persistence layer for redux with flexible backends
Stars: ✭ 681 (+363.27%)
Mutual labels:  storage
Weibo hot search
微博爬虫:每天定时爬取微博热搜榜的内容,留下互联网人的记忆。
Stars: ✭ 113 (-23.13%)
Mutual labels:  scrapy
Oneblog
👽 OneBlog,一个简洁美观、功能强大并且自适应的Java博客
Stars: ✭ 678 (+361.22%)
Mutual labels:  spider
Olxscraper
OLX Scraper in Python Scrapy
Stars: ✭ 76 (-48.3%)
Mutual labels:  scrapy
Wechatbot4xianyu
🤖 微信订阅机器人 | 🐟 微信订阅机器人之闲鱼二手商品监控
Stars: ✭ 56 (-61.9%)
Mutual labels:  spider
Rook
Storage Orchestration for Kubernetes
Stars: ✭ 9,369 (+6273.47%)
Mutual labels:  storage
Bojack
🐴 The unreliable key-value store
Stars: ✭ 101 (-31.29%)
Mutual labels:  storage
Hexa
Hexa: The ultimate companion for Azure. Setup and deploy in seconds
Stars: ✭ 56 (-61.9%)
Mutual labels:  storage
Ffdl
Fabric for Deep Learning (FfDL, pronounced fiddle) is a Deep Learning Platform offering TensorFlow, Caffe, PyTorch etc. as a Service on Kubernetes
Stars: ✭ 640 (+335.37%)
Mutual labels:  storage
Phantomjs Installer
A Composer Package which installs the PhantomJS binary (Linux, Windows, Mac) into /bin of your project.
Stars: ✭ 145 (-1.36%)
Mutual labels:  phantomjs
Rafter
Kubernetes-native S3-like files/assets store based on CRDs and powered by MinIO
Stars: ✭ 145 (-1.36%)
Mutual labels:  storage
Collector Http
Norconex HTTP Collector is a flexible web crawler for collecting, parsing, and manipulating data from the Internet (or Intranet) to various data repositories such as search engines.
Stars: ✭ 130 (-11.56%)
Mutual labels:  web-crawler
Pulsar
Turn large Web sites into tables and charts using simple SQLs.
Stars: ✭ 100 (-31.97%)
Mutual labels:  web-crawler
Bleeper
Library to manage your firmware configurations written in C++
Stars: ✭ 54 (-63.27%)
Mutual labels:  storage
Scrapyrt
HTTP API for Scrapy spiders
Stars: ✭ 637 (+333.33%)
Mutual labels:  scrapy
Crawler examples
Some classic web crawler projects.一些经典的爬虫
Stars: ✭ 74 (-49.66%)
Mutual labels:  spider
Store
A better way to use localStorage and sessionStorage
Stars: ✭ 1,646 (+1019.73%)
Mutual labels:  storage
Zbox
Zero-details, privacy-focused in-app file system.
Stars: ✭ 1,185 (+706.12%)
Mutual labels:  storage
Infospider
INFO-SPIDER 是一个集众多数据源于一身的爬虫工具箱🧰,旨在安全快捷的帮助用户拿回自己的数据,工具代码开源,流程透明。支持数据源包括GitHub、QQ邮箱、网易邮箱、阿里邮箱、新浪邮箱、Hotmail邮箱、Outlook邮箱、京东、淘宝、支付宝、中国移动、中国联通、中国电信、知乎、哔哩哔哩、网易云音乐、QQ好友、QQ群、生成朋友圈相册、浏览器浏览历史、12306、博客园、CSDN博客、开源中国博客、简书。
Stars: ✭ 5,984 (+3970.75%)
Mutual labels:  spider
Programer log
最新动态在这里【我的程序员日志】
Stars: ✭ 112 (-23.81%)
Mutual labels:  scrapy
Quick Media
media(audio/image/qrcode/markdown/html/svg) support web service (多媒体编辑服务, 酷炫二维码, 音频, 图片, svg, markdown, html渲染服务支持)
Stars: ✭ 612 (+316.33%)
Mutual labels:  phantomjs
Scrapy Examples
Some scrapy and web.py exmaples
Stars: ✭ 71 (-51.7%)
Mutual labels:  scrapy
Papa
一个浏览器端数据爬虫,做每个人的数据助手
Stars: ✭ 145 (-1.36%)
Mutual labels:  spider
Domain hunter
A Burp Suite Extension that try to find all sub-domain, similar-domain and related-domain of an organization automatically! 基于流量自动收集整个企业或组织的子域名、相似域名、相关域名的burp插件
Stars: ✭ 594 (+304.08%)
Mutual labels:  spider
Spider
python crawler spider
Stars: ✭ 70 (-52.38%)
Mutual labels:  spider
Baiduimagespider
一个超级轻量的百度图片爬虫
Stars: ✭ 591 (+302.04%)
Mutual labels:  spider
Cockroach
又一个 java 内容(pa)获取(chong)工具
Stars: ✭ 112 (-23.81%)
Mutual labels:  spider
Filegator
Powerful Multi-User File Manager
Stars: ✭ 587 (+299.32%)
Mutual labels:  storage
Nodestream
Storage-agnostic streaming library for binary data transfers
Stars: ✭ 70 (-52.38%)
Mutual labels:  storage
Douyin
API of DouYin for Humans used to Crawl Popular Videos and Musics
Stars: ✭ 580 (+294.56%)
Mutual labels:  spider
Soup
Web Scraper in Go, similar to BeautifulSoup
Stars: ✭ 1,685 (+1046.26%)
Mutual labels:  web-scraper
Btlet
Some toolkits implements part of BT Protocol, like DHT spider.
Stars: ✭ 54 (-63.27%)
Mutual labels:  spider
Dotnetcrawler
DotnetCrawler is a straightforward, lightweight web crawling/scrapying library for Entity Framework Core output based on dotnet core. This library designed like other strong crawler libraries like WebMagic and Scrapy but for enabling extandable your custom requirements. Medium link : https://medium.com/@mehmetozkaya/creating-custom-web-crawler-with-dotnet-core-using-entity-framework-core-ec8d23f0ca7c
Stars: ✭ 100 (-31.97%)
Mutual labels:  scrapy
Gotools
create some tools use go lang.
Stars: ✭ 54 (-63.27%)
Mutual labels:  spider
Last Statement Of Death Row
Last-Statement-of-Death-Row, 人之将死,其言也善
Stars: ✭ 53 (-63.95%)
Mutual labels:  spider
Diskover Web
Web file manager, disk space usage, storage search engine and file system analytics for diskover
Stars: ✭ 121 (-17.69%)
Mutual labels:  storage
Laravel Storage
A simple filesystem abstraction package for Laravel 4.
Stars: ✭ 100 (-31.97%)
Mutual labels:  storage
Project Tauro
A Router WiFi key recovery/cracking tool with a twist.
Stars: ✭ 52 (-64.63%)
Mutual labels:  web-scraper
301-360 of 1095 similar projects