ReaderExtract clean(er), readable text from web pages via Mercury Web Parser.
Stars: ✭ 75 (+82.93%)
gochanges**[ARCHIVED]** website changes tracker 🔍
Stars: ✭ 12 (-70.73%)
pupflareA webpage proxy that request through Chromium (puppeteer) - can be used to bypass Cloudflare anti bot / anti ddos on any application (like curl)
Stars: ✭ 183 (+346.34%)
proxycrawl-pythonProxyCrawl Python library for scraping and crawling
Stars: ✭ 51 (+24.39%)
AutoscraperA Smart, Automatic, Fast and Lightweight Web Scraper for Python
Stars: ✭ 4,077 (+9843.9%)
easy reader⏮ ⏯ ⏭ A Rust library for easily navigating forward, backward or randomly through the lines of huge files.
Stars: ✭ 83 (+102.44%)
ha-multiscrapeHome Assistant custom component for scraping (html, xml or json) multiple values (from a single HTTP request) with a separate sensor/attribute for each value. Support for (login) form-submit functionality.
Stars: ✭ 103 (+151.22%)
document-dlCommand line program to download documents from web portals.
Stars: ✭ 14 (-65.85%)
trafilaturaPython & command-line tool to gather text on the Web: web crawling/scraping, extraction of text, metadata, comments
Stars: ✭ 711 (+1634.15%)
web-clipperEasily download the main content of a web page in html, markdown, and/or epub format from command line.
Stars: ✭ 15 (-63.41%)
reason-rust-scraper🦀 Scraping & crawling websites using Rust, and ReasonML
Stars: ✭ 21 (-48.78%)
torchestratorSpin up Tor containers and then proxy HTTP requests via these Tor instances
Stars: ✭ 32 (-21.95%)
scavengerScrape and take screenshots of dynamic and static webpages
Stars: ✭ 14 (-65.85%)
diffbot-php-client[Deprecated - Maintenance mode - use APIs directly please!] The official Diffbot client library
Stars: ✭ 53 (+29.27%)
ReadabilityReadability is Elixir library for extracting and curating articles.
Stars: ✭ 188 (+358.54%)
Just ReadA customizable read mode web extension.
Stars: ✭ 874 (+2031.71%)
PoReader本地小说阅读器,支持深色模式,Wifi传书,代码简洁有注释(local text reader, support dark modal, upload text by wifi)
Stars: ✭ 41 (+0%)
Instagram-to-discordMonitor instagram user account and automatically post new images to discord channel via a webhook. Working 2022!
Stars: ✭ 113 (+175.61%)
scrapersscrapers for building your own image databases
Stars: ✭ 46 (+12.2%)
Cloudflare ScrapeA Python module to bypass Cloudflare's anti-bot page.
Stars: ✭ 2,606 (+6256.1%)
Simpread简悦 ( SimpRead ) - 让你瞬间进入沉浸式阅读的扩展
Stars: ✭ 5,352 (+12953.66%)
scrapmanRetrieve real (with Javascript executed) HTML code from an URL, ultra fast and supports multiple parallel loading of webs
Stars: ✭ 21 (-48.78%)
Elixir ScrapeScrape any website, article or RSS/Atom Feed with ease!
Stars: ✭ 306 (+646.34%)
FerretDeclarative web scraping
Stars: ✭ 4,837 (+11697.56%)
Whatsapp-NetGenerate a network graph of connections from your WhatsApp groups data
Stars: ✭ 75 (+82.93%)
Nim websitecreatorNim fullstack website framework - deploy a website within minutes
Stars: ✭ 124 (+202.44%)
PywebcopyPython library to mirror webpage and websites.
Stars: ✭ 156 (+280.49%)
tvseriesTV Series is a tool that scrapes Episode Synopsis' of popular TV Series' from websites like Wikipedia / IMDb and show in one place with a user-friendly navigation UI.
Stars: ✭ 37 (-9.76%)
DocumentReader-iOSiOS Framework for reading and validation of identification documents
Stars: ✭ 54 (+31.71%)
Flutterweb PizzaSimple Pizza webpage made with Flutter Web
Stars: ✭ 93 (+126.83%)
Brew.sh🔖 The Homebrew homepage
Stars: ✭ 91 (+121.95%)
osmosfeedTurn GitHub into an RSS reader
Stars: ✭ 839 (+1946.34%)
MachineLearningMachine learning for beginner(Data Science enthusiast)
Stars: ✭ 104 (+153.66%)
GoscraperGolang pkg to quickly return a preview of a webpage (title/description/images)
Stars: ✭ 72 (+75.61%)
SinglefilezWeb Extension for Firefox/Chrome/MS Edge and CLI tool to save a faithful copy of an entire web page in a self-extracting HTML/ZIP polyglot file
Stars: ✭ 882 (+2051.22%)
1click Webpage ScreenshotEntire page Screenshot extension for Google Chrome. I'm developing open source extension for Google Chrome. All extension are free for use. Let's make Chrome great again!
Stars: ✭ 406 (+890.24%)
LRReaderA feature-complete reader and client for LANraragi
Stars: ✭ 62 (+51.22%)
Webpage2htmlsave/convert web pages to a standalone editable html file for offline archive/view/edit/play/whatever
Stars: ✭ 323 (+687.8%)
Balena DashBuild a Raspberry Pi based desktop dashboard for stats, photos, videos and more!
Stars: ✭ 292 (+612.2%)
attributesPHP Attributes Reader. Subtree split of the Spiral Attributes component (see spiral/framework)
Stars: ✭ 22 (-46.34%)
google-scraperThis class can retrieve search results from Google.
Stars: ✭ 33 (-19.51%)
spoti-voteWeb application to vote the next Song in Spotify Queue
Stars: ✭ 14 (-65.85%)
urlbox-screenshots-nodeCapture website thumbnails using the urlbox.io screenshot as a service API in node
Stars: ✭ 14 (-65.85%)
TradeTheEventImplementation of "Trade the Event: Corporate Events Detection for News-Based Event-Driven Trading." In Findings of ACL2021
Stars: ✭ 64 (+56.1%)
img-cliAn interactive Command-Line Interface Build in NodeJS for downloading a single or multiple images to disk from URL
Stars: ✭ 15 (-63.41%)
AG NTRIP ESPAG Rooftop controller with NTRIP client and IMU (ESP32 Controller)
Stars: ✭ 25 (-39.02%)
scotch-scraping-nodeSimple app for scraping author profiles and tutorials from Scotch.io - https://scotch.io.
Stars: ✭ 15 (-63.41%)
Pahe.ph-ScraperPahe.ph [Pahe.in] Movies Website Scraper
Stars: ✭ 57 (+39.02%)
MangDLThe most inefficient Manga downloader for PC
Stars: ✭ 40 (-2.44%)
ulboracmsUlbora CMS is a self-contained CMS (no database needed) written in Golang. It uses a JSON datastore with content saved in both json files and in memory. You can download and upload a single binary backup file containing content, images, and templates as needed. It also has a built-in mail sender.
Stars: ✭ 42 (+2.44%)
DomModern DOM API.
Stars: ✭ 88 (+114.63%)
ChampA Telegram bot combined with python to serve some basic functions like weather, music charts, cricket score and much more.
Stars: ✭ 22 (-46.34%)
jekyll-dataA plugin to read '_config.yml' and data files within Jekyll theme gems
Stars: ✭ 40 (-2.44%)
ryuanimeA free anime streaming , using the jkanime content by scraping the jkanime website.
Stars: ✭ 20 (-51.22%)
pageshotPageshot as a service.
Stars: ✭ 45 (+9.76%)