All Projects → Awesome Web Scraper → Similar Projects or Alternatives

1095 Open source projects that are alternatives of or similar to Awesome Web Scraper

91porn php
最简单的91porn爬虫php版本
Stars: ✭ 557 (+278.91%)
Mutual labels:  spider
Blivet
A python module for configuration of block devices
Stars: ✭ 68 (-53.74%)
Mutual labels:  storage
Phantomjs Installer
A Composer Package which installs the PhantomJS binary (Linux, Windows, Mac) into /bin of your project.
Stars: ✭ 145 (-1.36%)
Mutual labels:  phantomjs
Rafter
Kubernetes-native S3-like files/assets store based on CRDs and powered by MinIO
Stars: ✭ 145 (-1.36%)
Mutual labels:  storage
Collector Http
Norconex HTTP Collector is a flexible web crawler for collecting, parsing, and manipulating data from the Internet (or Intranet) to various data repositories such as search engines.
Stars: ✭ 130 (-11.56%)
Mutual labels:  web-crawler
Pulsar
Turn large Web sites into tables and charts using simple SQLs.
Stars: ✭ 100 (-31.97%)
Mutual labels:  web-crawler
Bleeper
Library to manage your firmware configurations written in C++
Stars: ✭ 54 (-63.27%)
Mutual labels:  storage
Gulp Mocha Phantomjs
run client-side Mocha tests with PhantomJS
Stars: ✭ 67 (-54.42%)
Mutual labels:  phantomjs
Xsrfprobe
The Prime Cross Site Request Forgery (CSRF) Audit and Exploitation Toolkit.
Stars: ✭ 532 (+261.9%)
Mutual labels:  spider
Yspider
yspider -- 轻量级爬虫系统
Stars: ✭ 125 (-14.97%)
Mutual labels:  spider
Antcolony
Nodejs实现的一个磁力链接爬虫 http://findit.keenwon.com (原域名http://findit.so )
Stars: ✭ 1,151 (+682.99%)
Mutual labels:  spider
Scrapy Fake Useragent
Random User-Agent middleware based on fake-useragent
Stars: ✭ 520 (+253.74%)
Mutual labels:  scrapy
Electron Storage
Simply save/load json files to/from file system in electron applications
Stars: ✭ 109 (-25.85%)
Mutual labels:  storage
Coursera Dl
Script for downloading Coursera.org videos and naming them.
Stars: ✭ 8,609 (+5756.46%)
Mutual labels:  storage
Anti Webspider
Web 端反爬技术方案
Stars: ✭ 486 (+230.61%)
Mutual labels:  spider
Netease Music Spider
netease-music-spider is a sipder that you can find beautiful girlfriend or handsome boyfriend.
Stars: ✭ 147 (+0%)
Mutual labels:  spider
Binaryprefs
Rapidly fast and lightweight re-implementation of SharedPreferences which stores each preference in files separately, performs disk operations via NIO with memory mapped byte buffers and works IPC (between processes). Written from scratch.
Stars: ✭ 484 (+229.25%)
Mutual labels:  storage
Terraform Aws S3 Log Storage
This module creates an S3 bucket suitable for receiving logs from other AWS services such as S3, CloudFront, and CloudTrail
Stars: ✭ 65 (-55.78%)
Mutual labels:  storage
Offline Plugin
Offline plugin (ServiceWorker, AppCache) for webpack (https://webpack.js.org/)
Stars: ✭ 4,444 (+2923.13%)
Mutual labels:  storage
Not Your Average Web Crawler
A web crawler (for bug hunting) that gathers more than you can imagine.
Stars: ✭ 107 (-27.21%)
Mutual labels:  spider
Movieheavens
🎬 基于Pyqt5的简单电影搜索工具
Stars: ✭ 465 (+216.33%)
Mutual labels:  spider
Cvpr2019
Displays all the 2019 CVPR Accepted Papers in a way that they are easy to parse.
Stars: ✭ 65 (-55.78%)
Mutual labels:  web-crawler
Proxy
A simple tool for fetching usable proxies from several websites.
Stars: ✭ 124 (-15.65%)
Mutual labels:  web-crawler
Qzoneexport
QQ空间导出助手,用于备份QQ空间的说说、日志、私密日记、相册、视频、留言板、QQ好友、收藏夹、分享、最近访客为文件,便于迁移与保存
Stars: ✭ 456 (+210.2%)
Mutual labels:  spider
Btlet
Some toolkits implements part of BT Protocol, like DHT spider.
Stars: ✭ 54 (-63.27%)
Mutual labels:  spider
Bdp Dataplatform
大数据生态解决方案数据平台:基于大数据、数据平台、微服务、机器学习、商城、自动化运维、DevOps、容器部署平台、数据平台采集、数据平台存储、数据平台计算、数据平台开发、数据平台应用搭建的大数据解决方案。
Stars: ✭ 456 (+210.2%)
Mutual labels:  spider
Crawler Detect
🕷 CrawlerDetect is a PHP class for detecting bots/crawlers/spiders via the user agent
Stars: ✭ 1,549 (+953.74%)
Mutual labels:  spider
Csi Digitalocean
A Container Storage Interface (CSI) Driver for DigitalOcean Block Storage
Stars: ✭ 452 (+207.48%)
Mutual labels:  storage
Taobao duoshou
使用Scrapy采集淘宝数据,Flask展示
Stars: ✭ 63 (-57.14%)
Mutual labels:  scrapy
Spring Backend Boilerplate
The modularized backend boilerplate based on Spring Boot Framework, easy to get started and add your business part.
Stars: ✭ 134 (-8.84%)
Mutual labels:  storage
Dotnetcrawler
DotnetCrawler is a straightforward, lightweight web crawling/scrapying library for Entity Framework Core output based on dotnet core. This library designed like other strong crawler libraries like WebMagic and Scrapy but for enabling extandable your custom requirements. Medium link : https://medium.com/@mehmetozkaya/creating-custom-web-crawler-with-dotnet-core-using-entity-framework-core-ec8d23f0ca7c
Stars: ✭ 100 (-31.97%)
Mutual labels:  scrapy
Gotools
create some tools use go lang.
Stars: ✭ 54 (-63.27%)
Mutual labels:  spider
Qqzonemood
QQZone mood spider and analysis. QQ空间多线程爬虫和数据挖掘。提供线上服务,扫码登陆即可自动爬取和分析数据,还有网易云年度报告风格的数据展示;使用docker-compose打包程序,方便部署;额外提供QQ空间抽奖小程序。
Stars: ✭ 439 (+198.64%)
Mutual labels:  spider
Phantomjs Maven Plugin
A maven plugin for installing the phantomjs binary on your system automatically.
Stars: ✭ 62 (-57.82%)
Mutual labels:  phantomjs
Jsstore
A complete IndexedDB wrapper with SQL like syntax.
Stars: ✭ 430 (+192.52%)
Mutual labels:  storage
Crawler
爬虫, http代理, 模拟登陆!
Stars: ✭ 106 (-27.89%)
Mutual labels:  scrapy
Kinto
A generic JSON document store with sharing and synchronisation capabilities.
Stars: ✭ 4,150 (+2723.13%)
Mutual labels:  storage
T66y spider
Python多线程下载 草榴(t66y.com) 网站【新時代的我們】和【達蓋爾的旗幟】两个板块帖子内的图片
Stars: ✭ 62 (-57.82%)
Mutual labels:  spider
Toplist
今日热榜,一个获取各大热门网站热门头条的聚合网站,使用Go语言编写,多协程异步快速抓取信息,预览:https://mo.fish
Stars: ✭ 4,331 (+2846.26%)
Mutual labels:  spider
Apiproject
[https://www.sofineday.com], golang项目开发脚手架,集成最佳实践(gin+gorm+go-redis+mongo+cors+jwt+json日志库zap(支持日志收集到kafka或mongo)+消息队列kafka+微信支付宝支付gopay+api加密+api反向代理+go modules依赖管理+headless爬虫chromedp+makefile+二进制压缩+livereload热加载)
Stars: ✭ 124 (-15.65%)
Mutual labels:  spider
Openstorage
A multi-host clustered implementation of the open storage specification
Stars: ✭ 407 (+176.87%)
Mutual labels:  storage
Boj Autocommit
When you solve the problem of Baekjoon Online Judge, it automatically commits and pushes to the remote repository.
Stars: ✭ 60 (-59.18%)
Mutual labels:  phantomjs
Ramcloud
**No Longer Maintained** Official RAMCloud repo
Stars: ✭ 405 (+175.51%)
Mutual labels:  storage
Scrapyd Cluster On Heroku
Set up free and scalable Scrapyd cluster for distributed web-crawling with just a few clicks. DEMO 👉
Stars: ✭ 106 (-27.89%)
Mutual labels:  scrapy
Scrapydouban
豆瓣电影/豆瓣读书 Scarpy 爬虫
Stars: ✭ 400 (+172.11%)
Mutual labels:  scrapy
Test demo
Testing Using Python Demo. 使用Python测试脚本demo。
Stars: ✭ 60 (-59.18%)
Mutual labels:  spider
Venom
All Terrain Autonomous Quadruped
Stars: ✭ 145 (-1.36%)
Mutual labels:  spider
Spiderman
基于 scrapy-redis 的通用分布式爬虫框架
Stars: ✭ 392 (+166.67%)
Mutual labels:  scrapy
Heketi
RESTful based volume management framework for GlusterFS
Stars: ✭ 1,106 (+652.38%)
Mutual labels:  storage
Files
Docs and files for ScrapydWeb, Scrapyd, Scrapy, and other projects
Stars: ✭ 390 (+165.31%)
Mutual labels:  scrapy
Splash
Splash, a flexible Spark shuffle manager that supports user-defined storage backends for shuffle data storage and exchange
Stars: ✭ 105 (-28.57%)
Mutual labels:  storage
Last Statement Of Death Row
Last-Statement-of-Death-Row, 人之将死,其言也善
Stars: ✭ 53 (-63.95%)
Mutual labels:  spider
Diskover Web
Web file manager, disk space usage, storage search engine and file system analytics for diskover
Stars: ✭ 121 (-17.69%)
Mutual labels:  storage
Laravel Storage
A simple filesystem abstraction package for Laravel 4.
Stars: ✭ 100 (-31.97%)
Mutual labels:  storage
Project Tauro
A Router WiFi key recovery/cracking tool with a twist.
Stars: ✭ 52 (-64.63%)
Mutual labels:  web-scraper
Serverless Html Pdf
Convert HTML to PDF thru a lambda function using PhantomJS.
Stars: ✭ 51 (-65.31%)
Mutual labels:  phantomjs
Ruia
Async Python 3.6+ web scraping micro-framework based on asyncio
Stars: ✭ 1,366 (+829.25%)
Mutual labels:  spider
Lmlcspider production
🐞 立马理财销售统计(爬虫+页面展示)
Stars: ✭ 51 (-65.31%)
Mutual labels:  spider
Cloudmusic
网易云爬虫解决方案
Stars: ✭ 51 (-65.31%)
Mutual labels:  spider
Mm131
MM131网站图片爬取 🚨
Stars: ✭ 129 (-12.24%)
Mutual labels:  spider
361-420 of 1095 similar projects