turtleInstagram Photo Downloader
Stars: ✭ 15 (-37.5%)
node-red-contrib-nbrowserProvides a virtual web browser (a.k.a. "headless browser") appearing as a node.
Stars: ✭ 31 (+29.17%)
socials👨👩👦 Social account detection and extraction in Python, e.g. for crawling/scraping.
Stars: ✭ 37 (+54.17%)
Pahe.ph-ScraperPahe.ph [Pahe.in] Movies Website Scraper
Stars: ✭ 57 (+137.5%)
reason-rust-scraper🦀 Scraping & crawling websites using Rust, and ReasonML
Stars: ✭ 21 (-12.5%)
ksoupKotlin Wrapper for Jsoup
Stars: ✭ 59 (+145.83%)
Euro2016 TerminalApp⚽ Instantly find 🏆EURO 2016 live-streams & highlights, now a Web App!
Stars: ✭ 54 (+125%)
shorter.recipesA website dedicated to making recipes from any website easy to read.
Stars: ✭ 27 (+12.5%)
Whatsapp-NetGenerate a network graph of connections from your WhatsApp groups data
Stars: ✭ 75 (+212.5%)
selectorlibA library to read a YML file with Xpath or CSS Selectors and extract data from HTML pages using them
Stars: ✭ 53 (+120.83%)
scrapersscrapers for building your own image databases
Stars: ✭ 46 (+91.67%)
browser-automation-apiBrowser automation API for repetitive web-based tasks, with a friendly user interface. You can use it to scrape content or do many other things like capture a screenshot, generate pdf, extract content or execute custom Puppeteer, Playwright functions.
Stars: ✭ 24 (+0%)
double-agentA test suite of common scraper detection techniques. See how detectable your scraper stack is.
Stars: ✭ 123 (+412.5%)
docker-selenium-lambdaThe simplest demo of chrome automation by python and selenium in AWS Lambda
Stars: ✭ 172 (+616.67%)
info-bot🤖 A Versatile Telegram Bot
Stars: ✭ 37 (+54.17%)
proxycrawl-pythonProxyCrawl Python library for scraping and crawling
Stars: ✭ 51 (+112.5%)
readability-cliA CLI for Mozilla Readability. Get clean, uncluttered, ready-to-read HTML from any webpage!
Stars: ✭ 41 (+70.83%)
asyncio-hnPython (asyncio) wrapper for hackernews api
Stars: ✭ 27 (+12.5%)
PythonScrapyBasicSetupBasic setup with random user agents and IP addresses for Python Scrapy Framework.
Stars: ✭ 57 (+137.5%)
github-languagesTiny little ruby on rails website that crawls though your public github repos to find out what your favourite languages are.
Stars: ✭ 23 (-4.17%)
diffbot-php-client[Deprecated - Maintenance mode - use APIs directly please!] The official Diffbot client library
Stars: ✭ 53 (+120.83%)
ScrapeBotA Selenium-driven tool for automated website interaction and scraping.
Stars: ✭ 16 (-33.33%)
google-scraperThis class can retrieve search results from Google.
Stars: ✭ 33 (+37.5%)
ArchiteuthisMITM HTTP(S) proxy with integrated load-balancing, rate-limiting and error handling. Built for automated web scraping.
Stars: ✭ 35 (+45.83%)
ha-multiscrapeHome Assistant custom component for scraping (html, xml or json) multiple values (from a single HTTP request) with a separate sensor/attribute for each value. Support for (login) form-submit functionality.
Stars: ✭ 103 (+329.17%)
trafilaturaPython & command-line tool to gather text on the Web: web crawling/scraping, extraction of text, metadata, comments
Stars: ✭ 711 (+2862.5%)
crawling-frameworkEasily crawl news portals or blog sites using Storm Crawler.
Stars: ✭ 22 (-8.33%)
etf4u📊 Python tool to scrape real-time information about ETFs from the web and mixing them together by proportionally distributing their assets allocation
Stars: ✭ 29 (+20.83%)
scrapScrapping Facebook with JavaScript.
Stars: ✭ 25 (+4.17%)
gochanges**[ARCHIVED]** website changes tracker 🔍
Stars: ✭ 12 (-50%)
zcrawlAn open source web crawling platform
Stars: ✭ 21 (-12.5%)
covid19br-pubProjeto de monitoramento de publicações oficiais relacionadas a COVID-19 no Brasil.
Stars: ✭ 12 (-50%)
GoiratePillaging the seven seas for torrents, pieces of eight and other bounty.
Stars: ✭ 20 (-16.67%)
copycatA PHP Scraping Class
Stars: ✭ 70 (+191.67%)
chopperChopper is a tool to extract elements from HTML by preserving ancestors and CSS rules
Stars: ✭ 22 (-8.33%)
oversmashOverwatch API library for player details and career stats
Stars: ✭ 42 (+75%)
scrapmanRetrieve real (with Javascript executed) HTML code from an URL, ultra fast and supports multiple parallel loading of webs
Stars: ✭ 21 (-12.5%)
scotch-scraping-nodeSimple app for scraping author profiles and tutorials from Scotch.io - https://scotch.io.
Stars: ✭ 15 (-37.5%)
iowebWeb Scraping Framework
Stars: ✭ 31 (+29.17%)
MachineLearningMachine learning for beginner(Data Science enthusiast)
Stars: ✭ 104 (+333.33%)
Instagram-to-discordMonitor instagram user account and automatically post new images to discord channel via a webhook. Working 2022!
Stars: ✭ 113 (+370.83%)
pickall.NET agile and extensible web searching API
Stars: ✭ 25 (+4.17%)
scrapy-fieldstatsA Scrapy extension to log items coverage when the spider shuts down
Stars: ✭ 17 (-29.17%)
tvseriesTV Series is a tool that scrapes Episode Synopsis' of popular TV Series' from websites like Wikipedia / IMDb and show in one place with a user-friendly navigation UI.
Stars: ✭ 37 (+54.17%)
puppeteer-botcheck🕵♂ Bot detection tests for Puppeteer. Hide and seek!
Stars: ✭ 42 (+75%)
RARBG-scraperWith Selenium headless browsing and CAPTCHA solving
Stars: ✭ 38 (+58.33%)
anime-scraper[partially working] Scrape and add anime episode stream URLs to uGet (Linux) or IDM (Windows) ~ Python3
Stars: ✭ 21 (-12.5%)
InstaBotSimple and friendly Bot for Instagram, using Selenium and Scrapy with Python.
Stars: ✭ 32 (+33.33%)
htmltabCommand-line utility to convert HTML tables into CSV files
Stars: ✭ 13 (-45.83%)
yttrexyoutube & tiktok analysis + youchoose recommendation custmizer. backend, extensions, and tooling
Stars: ✭ 31 (+29.17%)
4catThe 4CAT Capture and Analysis Toolkit provides modular data capture & analysis for a variety of social media platforms.
Stars: ✭ 144 (+500%)