scrapmanRetrieve real (with Javascript executed) HTML code from an URL, ultra fast and supports multiple parallel loading of webs
Stars: ✭ 21 (+10.53%)
reason-rust-scraper🦀 Scraping & crawling websites using Rust, and ReasonML
Stars: ✭ 21 (+10.53%)
big-data-upfRECSM-UPF Summer School: Social Media and Big Data Research
Stars: ✭ 21 (+10.53%)
gochanges**[ARCHIVED]** website changes tracker 🔍
Stars: ✭ 12 (-36.84%)
readability-cliA CLI for Mozilla Readability. Get clean, uncluttered, ready-to-read HTML from any webpage!
Stars: ✭ 41 (+115.79%)
pupflareA webpage proxy that request through Chromium (puppeteer) - can be used to bypass Cloudflare anti bot / anti ddos on any application (like curl)
Stars: ✭ 183 (+863.16%)
TradeTheEventImplementation of "Trade the Event: Corporate Events Detection for News-Based Event-Driven Trading." In Findings of ACL2021
Stars: ✭ 64 (+236.84%)
ryuanimeA free anime streaming , using the jkanime content by scraping the jkanime website.
Stars: ✭ 20 (+5.26%)
Cloudflare ScrapeA Python module to bypass Cloudflare's anti-bot page.
Stars: ✭ 2,606 (+13615.79%)
FerretDeclarative web scraping
Stars: ✭ 4,837 (+25357.89%)
Text-AnalysisExplaining textual analysis tools in Python. Including Preprocessing, Skip Gram (word2vec), and Topic Modelling.
Stars: ✭ 48 (+152.63%)
thal译文:Puppeteer 与 Chrome Headless —— 从入门到爬虫
Stars: ✭ 651 (+3326.32%)
metafetchNodeJS package that fetches a given URL's title, description, images, links etc.
Stars: ✭ 21 (+10.53%)
imdb-scraper🎬 An attempt at the most complete IMDb API
Stars: ✭ 24 (+26.32%)
scrapisma work-in-progress guide to web scraping as an artistic and critical practice
Stars: ✭ 43 (+126.32%)
document-dlCommand line program to download documents from web portals.
Stars: ✭ 14 (-26.32%)
OLX Scraper📻 An OLX Scraper using Scrapy + MongoDB. It Scrapes recent ads posted regarding requested product and dumps to NOSQL MONGODB.
Stars: ✭ 15 (-21.05%)
torchestratorSpin up Tor containers and then proxy HTTP requests via these Tor instances
Stars: ✭ 32 (+68.42%)
scavengerScrape and take screenshots of dynamic and static webpages
Stars: ✭ 14 (-26.32%)
proxycrawl-pythonProxyCrawl Python library for scraping and crawling
Stars: ✭ 51 (+168.42%)
ebayMarketAnalyzerScrape all eBay sold listings to determine average/median pricing, plot listings over time with trend lines, and extract to excel
Stars: ✭ 116 (+510.53%)
Instagram-to-discordMonitor instagram user account and automatically post new images to discord channel via a webhook. Working 2022!
Stars: ✭ 113 (+494.74%)
youtube-audioextract videos from youtube in audio format using webscraping techniques 🎶
Stars: ✭ 68 (+257.89%)
LeetCodeAt present contains scraped data from around 1500 problems present on the site. More to follow....
Stars: ✭ 45 (+136.84%)