All Categories → No Category → scrape

Top 21 scrape open source projects

Cloudflare Scrape

A Python module to bypass Cloudflare's anti-bot page.

✭ 2,606

python Makefile cloudflare anti-bot-page protected-page scrape scraping-websites

Instagram Scraper

scrapes medias, likes, followers, tags and all metadata. Inspired by instagram-php-scraper,bot

✭ 2,209

python bot crawler instagram scraper scrape ig igramscraper

Twint

An advanced Twitter scraping & OSINT tool written in Python that doesn't use Twitter's API, allowing you to scrape a user's followers, following, Tweets and more while evading most API limitations.

✭ 12,102

python Dockerfile elasticsearch twitter osint kibana tweets scrape twint tweep scrape-followers scrape-likes scrape-following

Autoscraper

A Smart, Automatic, Fast and Lightweight Web Scraper for Python

✭ 4,077

python machine-learning automation artificial-intelligence ai crawler scraper scraping web-scraping webscraping scrape webautomation

imgur-scraper

Retrieve years of imgur.com's data without any authentication.

✭ 26

python machine-learning data-mining imgur pypi scrape imgur-api command-line-tool no-authentication imgur-scraper hacktoberfest2021

InstagramLocationScraper

No description or website provided.

✭ 13

python shell instagram crawler scraper location selenium instagram-scraper scrape

PastaBean

Python Script to Scrape Pastebin with Regex

✭ 0

python api json regex pastebin bean requests pip scrape pasta 2017 pastabean tu5k4rr

cero

Scrape domain names from SSL certificates of arbitrary hosts

✭ 316

go Makefile tls ssl scrape recon websecurity domain-names

Crawler pubg.op.gg

This is a web crawler for pubg.op.gg, written by Ruichong Liu. 绝地求生游戏数据抓取

✭ 15

python crawler selenium scrape pubg beautifulsoup4

Spider

Spider项目将会不断更新本人学习使用过的爬虫方法！！！

✭ 16

python spider selenium scrape

ha-multiscrape

Home Assistant custom component for scraping (html, xml or json) multiple values (from a single HTTP request) with a separate sensor/attribute for each value. Support for (login) form-submit functionality.

✭ 103

python scraper rest sensor scraping scrape home-assistant hacs home-assistant-custom

GChan

Scrape boards and threads from 4chan (8kun WIP). Downloads images, videos and HTML if desired.

✭ 31

C#scraper dotnet daemon winforms scrape 4chan 4chan-downloader gchan 4chan-scraper

Scrape-Finance-Data-v2

A standalone package to scrape financial data from listed Vietnamese companies via Vietstock

✭ 45

python shell Dockerfile docker redis finance data scrape

diffbot-php-client

[Deprecated - Maintenance mode - use APIs directly please!] The official Diffbot client library