Browser automation API for repetitive web-based tasks, with a friendly user interface. You can use it to scrape content or do many other things like capture a screenshot, generate pdf, extract content or execute custom Puppeteer, Playwright functions.

Stars: ✭ 24 (+0%)

Mutual labels: scraping

double-agent

A test suite of common scraper detection techniques. See how detectable your scraper stack is.

Stars: ✭ 123 (+412.5%)

Mutual labels: scraping

docker-selenium-lambda

The simplest demo of chrome automation by python and selenium in AWS Lambda

Stars: ✭ 172 (+616.67%)

Mutual labels: scraping

info-bot

🤖 A Versatile Telegram Bot

Stars: ✭ 37 (+54.17%)

Mutual labels: scraping

proxycrawl-python

ProxyCrawl Python library for scraping and crawling

Stars: ✭ 51 (+112.5%)

Mutual labels: scraping

readability-cli

A CLI for Mozilla Readability. Get clean, uncluttered, ready-to-read HTML from any webpage!

Stars: ✭ 41 (+70.83%)

Mutual labels: scraping

asyncio-hn

Python (asyncio) wrapper for hackernews api

Stars: ✭ 27 (+12.5%)

Mutual labels: scraping

PythonScrapyBasicSetup

Basic setup with random user agents and IP addresses for Python Scrapy Framework.

Stars: ✭ 57 (+137.5%)

Mutual labels: scraping

html-table-extractor

extract data from html table

Stars: ✭ 74 (+208.33%)

Mutual labels: scraping

github-languages

Tiny little ruby on rails website that crawls though your public github repos to find out what your favourite languages are.

Stars: ✭ 23 (-4.17%)

Mutual labels: scraping

diffbot-php-client

[Deprecated - Maintenance mode - use APIs directly please!] The official Diffbot client library

Stars: ✭ 53 (+120.83%)

Mutual labels: scraping

ScrapeBot

A Selenium-driven tool for automated website interaction and scraping.

Stars: ✭ 16 (-33.33%)

Mutual labels: scraping

google-scraper

This class can retrieve search results from Google.

Stars: ✭ 33 (+37.5%)

Mutual labels: scraping

LInkedIn-Reverese-Lookup

🔎Search LinkedIn profile by email address📧

Stars: ✭ 20 (-16.67%)

Mutual labels: scraping

Architeuthis

MITM HTTP(S) proxy with integrated load-balancing, rate-limiting and error handling. Built for automated web scraping.

Stars: ✭ 35 (+45.83%)

Mutual labels: scraping

ha-multiscrape

Home Assistant custom component for scraping (html, xml or json) multiple values (from a single HTTP request) with a separate sensor/attribute for each value. Support for (login) form-submit functionality.

Stars: ✭ 103 (+329.17%)

Mutual labels: scraping

trafilatura

Python & command-line tool to gather text on the Web: web crawling/scraping, extraction of text, metadata, comments

Stars: ✭ 711 (+2862.5%)

Mutual labels: scraping

crawling-framework

Easily crawl news portals or blog sites using Storm Crawler.

Stars: ✭ 22 (-8.33%)

Mutual labels: scraping

etf4u

📊 Python tool to scrape real-time information about ETFs from the web and mixing them together by proportionally distributing their assets allocation

Stars: ✭ 29 (+20.83%)

Mutual labels: scraping

scrap

Scrapping Facebook with JavaScript.

Stars: ✭ 25 (+4.17%)

Mutual labels: scraping

NBA-Fantasy-Optimizer

NBA Daily Fantasy Lineup Optimizer for FanDuel Using Python

Stars: ✭ 21 (-12.5%)

Mutual labels: scraping

PrawWallpaperDownloader

Download images from reddit

Stars: ✭ 18 (-25%)

Mutual labels: scraping

gochanges

**[ARCHIVED]** website changes tracker 🔍

Stars: ✭ 12 (-50%)

Mutual labels: scraping

zcrawl

An open source web crawling platform

Stars: ✭ 21 (-12.5%)

Mutual labels: scraping

scrape-github-trending

Tutorial for web scraping / crawling with Node.js.

Stars: ✭ 42 (+75%)

Mutual labels: scraping

covid19br-pub

Projeto de monitoramento de publicações oficiais relacionadas a COVID-19 no Brasil.

Stars: ✭ 12 (-50%)

Mutual labels: scraping

Goirate

Pillaging the seven seas for torrents, pieces of eight and other bounty.

Stars: ✭ 20 (-16.67%)

Mutual labels: scraping

copycat

A PHP Scraping Class

Stars: ✭ 70 (+191.67%)

Mutual labels: scraping

chopper

Chopper is a tool to extract elements from HTML by preserving ancestors and CSS rules

Stars: ✭ 22 (-8.33%)

Mutual labels: scraping

oversmash

Overwatch API library for player details and career stats

Stars: ✭ 42 (+75%)

Mutual labels: scraping

pythonista-chromeless

Serverless selenium which dynamically execute any given code.

Stars: ✭ 31 (+29.17%)

Mutual labels: scraping

scrapman

Retrieve real (with Javascript executed) HTML code from an URL, ultra fast and supports multiple parallel loading of webs

Stars: ✭ 21 (-12.5%)

Mutual labels: scraping

scotch-scraping-node

Simple app for scraping author profiles and tutorials from Scotch.io - https://scotch.io.

Stars: ✭ 15 (-37.5%)

Mutual labels: scraping

ioweb

Web Scraping Framework

Stars: ✭ 31 (+29.17%)

Mutual labels: scraping

MachineLearning

Machine learning for beginner(Data Science enthusiast)

Stars: ✭ 104 (+333.33%)

Mutual labels: scraping

Instagram-to-discord

Monitor instagram user account and automatically post new images to discord channel via a webhook. Working 2022!

Stars: ✭ 113 (+370.83%)

Mutual labels: scraping

pickall

.NET agile and extensible web searching API

Stars: ✭ 25 (+4.17%)

Mutual labels: scraping

scrapy-fieldstats

A Scrapy extension to log items coverage when the spider shuts down

Stars: ✭ 17 (-29.17%)

Mutual labels: scraping

tvseries

TV Series is a tool that scrapes Episode Synopsis' of popular TV Series' from websites like Wikipedia / IMDb and show in one place with a user-friendly navigation UI.

Stars: ✭ 37 (+54.17%)

Mutual labels: scraping

puppeteer-botcheck

🕵‍♂ Bot detection tests for Puppeteer. Hide and seek!

Stars: ✭ 42 (+75%)

Mutual labels: scraping

RARBG-scraper

With Selenium headless browsing and CAPTCHA solving

Stars: ✭ 38 (+58.33%)

Mutual labels: scraping

anime-scraper

[partially working] Scrape and add anime episode stream URLs to uGet (Linux) or IDM (Windows) ~ Python3