Thepiratebay💀 The Pirate Bay node.js client
Stars: ✭ 191 (+730.43%)
ReadablewebproxyRewriting web proxy and archival tool. At this point, it just tries to download all the things.
Stars: ✭ 172 (+647.83%)
Scrape Linkedin Selenium`scrape_linkedin` is a python package that allows you to scrape personal LinkedIn profiles & company pages - turning the data into structured json.
Stars: ✭ 239 (+939.13%)
Jsonframe Cheeriosimple multi-level scraper json input/output for Cheerio
Stars: ✭ 196 (+752.17%)
DemeterDemeter is a tool for scraping the calibre web ui
Stars: ✭ 155 (+573.91%)
TwitterScraperScrape a User's Twitter data! Bypass the 3,200 tweet API limit for a User!
Stars: ✭ 80 (+247.83%)
RedditExtractorA minimalistic R wrapper for the Reddit API
Stars: ✭ 58 (+152.17%)
Datmusic ApiAlternative for VK Audio API
Stars: ✭ 160 (+595.65%)
Scrapysharpreborn of https://bitbucket.org/rflechner/scrapysharp
Stars: ✭ 226 (+882.61%)
Weibo terminaterFinal Weibo Crawler Scrap Anything From Weibo, comments, weibo contents, followers, anything. The Terminator
Stars: ✭ 2,295 (+9878.26%)
Scraperwiki PythonScraperWiki Python library for scraping and saving data
Stars: ✭ 146 (+534.78%)
Node Ytdl CoreYouTube video downloader in javascript.
Stars: ✭ 3,004 (+12960.87%)
jd-autobuyPython爬虫,京东自动登录,在线抢购商品
Stars: ✭ 1,262 (+5386.96%)
Goribot[Crawler/Scraper for Golang]🕷A lightweight distributed friendly Golang crawler framework.一个轻量的分布式友好的 Golang 爬虫框架。
Stars: ✭ 190 (+726.09%)
Heroku ebooksA script to generate Markov chains and to post to an _ebooks account on Twitter using Heroku
Stars: ✭ 251 (+991.3%)
Instagram CrawlerCrawl instagram photos, posts and videos for download.
Stars: ✭ 178 (+673.91%)
latent space adventuresBuckle up, adventure in the styleGAN2-ada-pytorch network latent space awaits
Stars: ✭ 59 (+156.52%)
Scrape Twitter🐦 Access Twitter data without an API key. [DEPRECATED]
Stars: ✭ 166 (+621.74%)
Skrape.itA Kotlin-based testing/scraping/parsing library providing the ability to analyze and extract data from HTML (server & client-side rendered). It places particular emphasis on ease of use and a high level of readability by providing an intuitive DSL. It aims to be a testing lib, but can also be used to scrape websites in a convenient fashion.
Stars: ✭ 231 (+904.35%)
Covid19 mobilityCOVID-19 Mobility Data Aggregator. Scraper of Google, Apple, Waze and TomTom COVID-19 Mobility Reports🚶🚘🚉
Stars: ✭ 156 (+578.26%)
google-scraperThis class can retrieve search results from Google.
Stars: ✭ 33 (+43.48%)
Goose ParserUniversal scrapping tool, which allows you to extract data using multiple environments
Stars: ✭ 211 (+817.39%)
CollyElegant Scraper and Crawler Framework for Golang
Stars: ✭ 15,535 (+67443.48%)
Google2csvGoogle2Csv a simple google scraper that saves the results on a csv/xlsx/jsonl file
Stars: ✭ 145 (+530.43%)
MangDLThe most inefficient Manga downloader for PC
Stars: ✭ 40 (+73.91%)
Querylist🕷️ The progressive PHP crawler framework! 优雅的渐进式PHP采集框架。
Stars: ✭ 2,392 (+10300%)
JvppeteerHeadless Chrome For Java (Java 爬虫)
Stars: ✭ 193 (+739.13%)
UnfurlScraper for oEmbed, Twitter Cards and Open Graph metadata - fast and Promise-based ⚡️
Stars: ✭ 193 (+739.13%)
lopezCrawling and scraping the Web for fun and profit
Stars: ✭ 20 (-13.04%)
Anime DlAnime-dl is a command-line program to download anime from CrunchyRoll and Funimation.
Stars: ✭ 190 (+726.09%)
PoliteBe nice on the web
Stars: ✭ 253 (+1000%)
GmdbGMDB is the ultra-simple, cross-platform Movie Library with Features (Search, Take Note, Watch Later, Like, Import, Learn, Instantly Torrent Magnet Watch)
Stars: ✭ 189 (+721.74%)
proxy-scraper⭐️ A proxy scraper made using Protractor | Proxy list Updates every three hour 🔥
Stars: ✭ 201 (+773.91%)
Unhtml.rsA magic html parser
Stars: ✭ 180 (+682.61%)
Instagram Proxy ApiCORS compliant API to access Instagram's public data
Stars: ✭ 245 (+965.22%)
Linkedin Profile Scraper🕵️♂️ LinkedIn profile scraper returning structured profile data in JSON. Works in 2020.
Stars: ✭ 171 (+643.48%)
nyt-first-saidTweets when words are published for the first time in the NYT
Stars: ✭ 222 (+865.22%)
Novel基于 Laravel 5.2 的小说网站
Stars: ✭ 172 (+647.83%)
GetsyA simple browser/client-side web scraper.
Stars: ✭ 238 (+934.78%)
Scrapelib⛏ a library for scraping things
Stars: ✭ 164 (+613.04%)
scrapetubeGet all videos from a youtube channel, get all videos from a playlist, get all videos that match a search
Stars: ✭ 120 (+421.74%)
OpensanctionsAn open database of international sanctions data, persons of interest and politically exposed persons
Stars: ✭ 157 (+582.61%)
Annie👾 Fast and simple video download library and CLI tool written in Go
Stars: ✭ 16,369 (+71069.57%)
Instagram Scraperscrapes medias, likes, followers, tags and all metadata. Inspired by instagram-php-scraper,bot
Stars: ✭ 2,209 (+9504.35%)
file-extensionsJSON collection of scraped file extensions, along with their description and type, from FileInfo.com
Stars: ✭ 15 (-34.78%)
SerpscrapSEO python scraper to extract data from major searchengine result pages. Extract data like url, title, snippet, richsnippet and the type from searchresults for given keywords. Detect Ads or make automated screenshots. You can also fetch text content of urls provided in searchresults or by your own. It's usefull for SEO and business related research tasks.
Stars: ✭ 153 (+565.22%)
Ruiji.netcrawler framework, distributed crawler extractor
Stars: ✭ 220 (+856.52%)
PhpscraperPHP Scraper - an highly opinionated web-interface for PHP
Stars: ✭ 148 (+543.48%)
yellowpages-scraperYellowpages.com Web Scraper written in Python and LXML to extract business details available based on a particular category and location.
Stars: ✭ 56 (+143.48%)
Media ScraperScrapes all photos and videos in a web page / Instagram / Twitter / Tumblr / Reddit / pixiv / TikTok
Stars: ✭ 206 (+795.65%)
tinyPornManagerMade for pornhub. Fork from tinyMediaManager v3
Stars: ✭ 57 (+147.83%)
Pahe.ph-ScraperPahe.ph [Pahe.in] Movies Website Scraper
Stars: ✭ 57 (+147.83%)
TradeTheEventImplementation of "Trade the Event: Corporate Events Detection for News-Based Event-Driven Trading." In Findings of ACL2021
Stars: ✭ 64 (+178.26%)
Tianyanchapip安装的天眼查爬虫API,指定的单个/多个企业工商信息一键保存为Excel/JSON格式。A Battery-included Scraper API of Tianyancha, the best Chinese business data and investigation platform.
Stars: ✭ 206 (+795.65%)