ScrapoxyScrapoxy hides your scraper behind a cloud. It starts a pool of proxies to send your requests. Now, you can crawl without thinking about blacklisting!
Stars: ✭ 1,322 (-88.55%)
RcrawlerAn R web crawler and scraper
Stars: ✭ 274 (-97.63%)
OnegramThis repository is no longer maintained.
Stars: ✭ 137 (-98.81%)
Skycaiji蓝天采集器是一款免费的数据采集发布爬虫软件,采用php+mysql开发,可部署在云服务器,几乎能采集所有类型的网页,无缝对接各类CMS建站程序,免登录实时发布数据,全自动无需人工干预!是网页大数据采集软件中完全跨平台的云端爬虫系统
Stars: ✭ 1,514 (-86.89%)
Freshonions TorscraperFresh Onions is an open source TOR spider / hidden service onion crawler hosted at zlal32teyptf4tvi.onion
Stars: ✭ 348 (-96.99%)
DataflowkitExtract structured data from web sites. Web sites scraping.
Stars: ✭ 456 (-96.05%)
ScrapitScraping scripts for various websites.
Stars: ✭ 25 (-99.78%)
PypergrabberFetches PubMed article IDs (PMIDs) from email inbox, then crawls PubMed, Google Scholar and Sci-Hub for respective PDF files.
Stars: ✭ 14 (-99.88%)
CrawlerA high performance web crawler in Elixir.
Stars: ✭ 781 (-93.24%)
civic-scraperTools for downloading agendas, minutes and other documents produced by local government
Stars: ✭ 21 (-99.82%)
Social ScraperTổng hợp script crawl dữ liệu từ các mạng xã hội & website tiếng Việt
Stars: ✭ 47 (-99.59%)
CrawlerGo process used to crawl websites
Stars: ✭ 147 (-98.73%)
Youtube ProjectsThis repository contains all the code I use in my YouTube tutorials.
Stars: ✭ 144 (-98.75%)
AvbookAV 电影管理系统, avmoo , javbus , javlibrary 爬虫,线上 AV 影片图书馆,AV 磁力链接数据库,Japanese Adult Video Library,Adult Video Magnet Links - Japanese Adult Video Database
Stars: ✭ 8,133 (-29.55%)
ArachnidPowerful web scraping framework for Crystal
Stars: ✭ 68 (-99.41%)
Google Play ScraperGoogle play scraper for Python inspired by <facundoolano/google-play-scraper>
Stars: ✭ 143 (-98.76%)
AntchAntch, a fast, powerful and extensible web crawling & scraping framework for Go
Stars: ✭ 198 (-98.28%)
JvppeteerHeadless Chrome For Java (Java 爬虫)
Stars: ✭ 193 (-98.33%)
Instagram BotAn Instagram bot developed using the Selenium Framework
Stars: ✭ 138 (-98.8%)
Skrape.itA Kotlin-based testing/scraping/parsing library providing the ability to analyze and extract data from HTML (server & client-side rendered). It places particular emphasis on ease of use and a high level of readability by providing an intuitive DSL. It aims to be a testing lib, but can also be used to scrape websites in a convenient fashion.
Stars: ✭ 231 (-98%)
Annie👾 Fast and simple video download library and CLI tool written in Go
Stars: ✭ 16,369 (+41.78%)
Ruiji.netcrawler framework, distributed crawler extractor
Stars: ✭ 220 (-98.09%)
GoscraperGolang pkg to quickly return a preview of a webpage (title/description/images)
Stars: ✭ 72 (-99.38%)
MalScraperScrape everything you can from MyAnimeList.net
Stars: ✭ 132 (-98.86%)
wget-luaWget-AT is a modern Wget with Lua hooks, Zstandard (+dictionary) WARC compression and URL-agnostic deduplication.
Stars: ✭ 52 (-99.55%)
ScrapedinLinkedIn Scraper (currently working 2020)
Stars: ✭ 453 (-96.08%)
News Pleasenews-please - an integrated web crawler and information extractor for news that just works.
Stars: ✭ 969 (-91.61%)
Jd AutobuyPython爬虫,京东自动登录,在线抢购商品
Stars: ✭ 1,174 (-89.83%)
WombatLightweight Ruby web crawler/scraper with an elegant DSL which extracts structured data from pages.
Stars: ✭ 1,220 (-89.43%)
FawkesFawkes is a tool to search for targets vulnerable to SQL Injection. Performs the search using Google search engine.
Stars: ✭ 108 (-99.06%)
WebmagicA scalable web crawler framework for Java.
Stars: ✭ 10,186 (-11.77%)
Sentinel CrawlerXenomorph Crawler, a Concise, Declarative and Observable Distributed Crawler(Node / Go / Java / Rust) For Web, RDB, OS, also can act as a Monitor(with Prometheus) or ETL for Infrastructure 💫 多语言执行器,分布式爬虫
Stars: ✭ 118 (-98.98%)
Crawler Detect🕷 CrawlerDetect is a PHP class for detecting bots/crawlers/spiders via the user agent
Stars: ✭ 1,549 (-86.58%)
Haxe.ioThe home of the Haxe Roundup's (Work in Progress)
Stars: ✭ 106 (-99.08%)
Mm131MM131网站图片爬取 🚨
Stars: ✭ 129 (-98.88%)
Docs《数据采集从入门到放弃》源码。内容简介:爬虫介绍、就业情况、爬虫工程师面试题 ;HTTP协议介绍; Requests使用 ;解析器Xpath介绍; MongoDB与MySQL; 多线程爬虫; Scrapy介绍 ;Scrapy-redis介绍; 使用docker部署; 使用nomad管理docker集群; 使用EFK查询docker日志
Stars: ✭ 118 (-98.98%)
Crawler爬虫, http代理, 模拟登陆!
Stars: ✭ 106 (-99.08%)
Moodle Downloader 2A Moodle downloader that downloads course content fast from Moodle (eg. lecture pdfs)
Stars: ✭ 118 (-98.98%)
D4n155OWASP D4N155 - Intelligent and dynamic wordlist using OSINT
Stars: ✭ 105 (-99.09%)
RodA Devtools driver for web automation and scraping
Stars: ✭ 1,392 (-87.94%)
GdeltpyrPython based framework to retreive Global Database of Events, Language, and Tone (GDELT) version 1.0 and version 2.0 data.
Stars: ✭ 124 (-98.93%)
Swift ZhiiOS ZhiHuDaily client, implemented in Swift
Stars: ✭ 103 (-99.11%)
SeleniumcrawlerAn example using Selenium webdrivers for python and Scrapy framework to create a web scraper to crawl an ASP site
Stars: ✭ 117 (-98.99%)
Learning Social Media Analytics With RThis repository contains code and bonus content which will be added from time to time for the book "Learning Social Media Analytics with R" by Packt
Stars: ✭ 102 (-99.12%)
DiggerDigger is a powerful and flexible web crawler implemented by pure golang
Stars: ✭ 130 (-98.87%)
Black WidowGUI based offensive penetration testing tool (Open Source)
Stars: ✭ 124 (-98.93%)
Cumcomic updater, mangafied
Stars: ✭ 117 (-98.99%)
YiifeedPre-moderated news aggregator
Stars: ✭ 100 (-99.13%)
RuiaAsync Python 3.6+ web scraping micro-framework based on asyncio
Stars: ✭ 1,366 (-88.17%)
DecryptloginAPIs for loginning some websites by using requests.
Stars: ✭ 1,861 (-83.88%)
SillyniumAutomate the creation of Python Selenium Scripts by drawing coloured boxes on webpage elements
Stars: ✭ 100 (-99.13%)