PoliteBe nice on the web
Stars: ✭ 253 (+289.23%)
newspaperjsNews extraction and scraping. Article Parsing
Stars: ✭ 59 (-9.23%)
getCRUCLdataCRU CL v. 2.0 Climatology Client for R
Stars: ✭ 17 (-73.85%)
SpydanA web spider for shodan.io without using the Developer API.
Stars: ✭ 30 (-53.85%)
AutoscraperA Smart, Automatic, Fast and Lightweight Web Scraper for Python
Stars: ✭ 4,077 (+6172.31%)
Java Spider一个基于webmagic框架二次开发的java爬虫框架实战,已实现能爬取腾讯,搜狐,今日头条(单独集成功能)等资讯内容,配合elasticsearch框架用法,实现了自动爬虫,已投入线上生产使用。
Stars: ✭ 276 (+324.62%)
GosintOSINT Swiss Army Knife
Stars: ✭ 401 (+516.92%)
Awesome CrawlerA collection of awesome web crawler,spider in different languages
Stars: ✭ 4,793 (+7273.85%)
GeziyorGeziyor, a fast web crawling & scraping framework for Go. Supports JS rendering.
Stars: ✭ 1,246 (+1816.92%)
riem✈️ ☀️ R package for accessing ASOS data via the Iowa Environment Mesonet ☁️ ✈️
Stars: ✭ 38 (-41.54%)
cyphrHumane encryption
Stars: ✭ 91 (+40%)
aliexscrapeGet Aliexpress product details in JSON
Stars: ✭ 80 (+23.08%)
roadoiUse Unpaywall with R
Stars: ✭ 60 (-7.69%)
arachnodHigh performance crawler for Nodejs
Stars: ✭ 17 (-73.85%)
Instagram-Scraper-2021Scrape Instagram content and stories anonymously, using a new technique based on the har file (No Token + No public API).
Stars: ✭ 57 (-12.31%)
Freshonions TorscraperFresh Onions is an open source TOR spider / hidden service onion crawler hosted at zlal32teyptf4tvi.onion
Stars: ✭ 348 (+435.38%)
newsembleAPI for fetching data from news websites.
Stars: ✭ 42 (-35.38%)
HuginnCreate agents that monitor and act on your behalf. Your agents are standing by!
Stars: ✭ 33,694 (+51736.92%)
AvbookAV 电影管理系统, avmoo , javbus , javlibrary 爬虫,线上 AV 影片图书馆,AV 磁力链接数据库,Japanese Adult Video Library,Adult Video Magnet Links - Japanese Adult Video Database
Stars: ✭ 8,133 (+12412.31%)
Goribot[Crawler/Scraper for Golang]🕷A lightweight distributed friendly Golang crawler framework.一个轻量的分布式友好的 Golang 爬虫框架。
Stars: ✭ 190 (+192.31%)
Querylist🕷️ The progressive PHP crawler framework! 优雅的渐进式PHP采集框架。
Stars: ✭ 2,392 (+3580%)
CollyElegant Scraper and Crawler Framework for Golang
Stars: ✭ 15,535 (+23800%)
rdflib📦 High level wrapper around the redland package for common rdf applications
Stars: ✭ 47 (-27.69%)
geoparser⛔ ARCHIVED ⛔ R package for the Geoparser.io API
Stars: ✭ 38 (-41.54%)
bittrexA R Client for the Bittrex Crypto-Currency Exchange
Stars: ✭ 26 (-60%)
NLMR📦 R package to simulate neutral landscape models 🏔
Stars: ✭ 57 (-12.31%)
rrliteR interface to rlite https://github.com/seppo0010/rlite
Stars: ✭ 16 (-75.38%)
OpenScraperAn open source webapp for scraping: towards a public service for webscraping
Stars: ✭ 80 (+23.08%)
wget-luaWget-AT is a modern Wget with Lua hooks, Zstandard (+dictionary) WARC compression and URL-agnostic deduplication.
Stars: ✭ 52 (-20%)
Mimo-CrawlerA web crawler that uses Firefox and js injection to interact with webpages and crawl their content, written in nodejs.
Stars: ✭ 22 (-66.15%)
ropenaq⛔ ARCHIVED ⛔ Accesses Air Quality Data from the Open Data Platform OpenAQ
Stars: ✭ 69 (+6.15%)
bing-ip2hostsbingip2hosts is a Bing.com web scraper that discovers websites by IP address
Stars: ✭ 99 (+52.31%)
scrapy facebookerCollection of scrapy spiders which can scrape posts, images, and so on from public Facebook Pages.
Stars: ✭ 22 (-66.15%)
RcrawlerAn R web crawler and scraper
Stars: ✭ 274 (+321.54%)
metacritic apiPHP Metacritic API - Mirrored by my GitLab
Stars: ✭ 31 (-52.31%)
Xcrawler快速、简洁且强大的PHP爬虫框架
Stars: ✭ 344 (+429.23%)
XidelCommand line tool to download and extract data from HTML/XML pages or JSON-APIs, using CSS, XPath 3.0, XQuery 3.0, JSONiq or pattern matching. It can also create new or transformed XML/HTML/JSON documents.
Stars: ✭ 335 (+415.38%)
CrawlyCrawly, a high-level web crawling & scraping framework for Elixir.
Stars: ✭ 440 (+576.92%)
opencage🌐 R package for the OpenCage API -- both forward and reverse geocoding 🌐
Stars: ✭ 82 (+26.15%)
ScrapitScraping scripts for various websites.
Stars: ✭ 25 (-61.54%)
antA web crawler for Go
Stars: ✭ 264 (+306.15%)
CrawlerA high performance web crawler in Elixir.
Stars: ✭ 781 (+1101.54%)
Linkedin Profile Scraper🕵️♂️ LinkedIn profile scraper returning structured profile data in JSON. Works in 2020.
Stars: ✭ 171 (+163.08%)
Youtube ProjectsThis repository contains all the code I use in my YouTube tutorials.
Stars: ✭ 144 (+121.54%)
SpidrA versatile Ruby web spidering library that can spider a site, multiple domains, certain links or infinitely. Spidr is designed to be fast and easy to use.
Stars: ✭ 656 (+909.23%)
scraper图片爬取下载工具,极速爬取下载 站酷https://www.zcool.com.cn/, CNU 视觉 http://www.cnu.cc/ 设计师/用户 上传的 图片/照片/插画。
Stars: ✭ 64 (-1.54%)
TikTokDownloader PyWebIO🚀「Douyin_TikTok_Download_API」是一个开箱即用的高性能异步抖音|TikTok数据爬取工具,支持API调用,在线批量解析及下载。
Stars: ✭ 919 (+1313.85%)
nlrxnlrx NetLogo R
Stars: ✭ 66 (+1.54%)
rdefrardefra: Interact with the UK AIR Pollution Database from DEFRA
Stars: ✭ 14 (-78.46%)
suppdataGrabbing SUPPlementary DATA in R
Stars: ✭ 31 (-52.31%)
PostcodesioRAPI wrapper around postcodes.io - free UK postcode lookup and geocoder
Stars: ✭ 36 (-44.62%)
weathercanR package for downloading weather data from Environment and Climate Change Canada
Stars: ✭ 83 (+27.69%)
FbcrawlA Facebook crawler
Stars: ✭ 536 (+724.62%)
medrxivrAccess and search medRxiv and bioRxiv preprint data
Stars: ✭ 34 (-47.69%)