All Projects → chirps → Similar Projects or Alternatives

622 Open source projects that are alternatives of or similar to chirps

schedule-tweet

Schedules tweets using TweetDeck

Stars: ✭ 14 (-60%)

Mutual labels: twitter, scraping

Socialreaper

Social media scraping / data collection library for Facebook, Twitter, Reddit, YouTube, Pinterest, and Tumblr APIs

Stars: ✭ 338 (+865.71%)

Mutual labels: twitter, scraping

Social Media Profiles Regexs

📇 Extract social media profiles and more with regular expressions

Stars: ✭ 324 (+825.71%)

Mutual labels: twitter, scraping

Reaper

Social media scraping / data collection tool for the Facebook, Twitter, Reddit, YouTube, Pinterest, and Tumblr APIs

Stars: ✭ 240 (+585.71%)

Mutual labels: twitter, scraping

rubium

Rubium is a lightweight alternative to Selenium/Capybara/Watir if you need to perform some operations (like web scraping) using Headless Chromium and Ruby

Stars: ✭ 65 (+85.71%)

Mutual labels: scraping

ha-multiscrape

Home Assistant custom component for scraping (html, xml or json) multiple values (from a single HTTP request) with a separate sensor/attribute for each value. Support for (login) form-submit functionality.

Stars: ✭ 103 (+194.29%)

Mutual labels: scraping

scrapman

Retrieve real (with Javascript executed) HTML code from an URL, ultra fast and supports multiple parallel loading of webs

Stars: ✭ 21 (-40%)

Mutual labels: scraping

LInkedIn-Reverese-Lookup

🔎Search LinkedIn profile by email address📧

Stars: ✭ 20 (-42.86%)

Mutual labels: scraping

ferenda

Transform unstructured document collections to structured Linked Data

Stars: ✭ 22 (-37.14%)

Mutual labels: scraping

angel.co-companies-list-scraping

No description or website provided.

Stars: ✭ 54 (+54.29%)

Mutual labels: scraping

PrawWallpaperDownloader

Download images from reddit

Stars: ✭ 18 (-48.57%)

Mutual labels: scraping

Instagram-to-discord

Monitor instagram user account and automatically post new images to discord channel via a webhook. Working 2022!

Stars: ✭ 113 (+222.86%)

Mutual labels: scraping

chesf

CHeSF is the Chrome Headless Scraping Framework, a very very alpha code to scrape javascript intensive web pages

Stars: ✭ 18 (-48.57%)

Mutual labels: scraping

zcrawl

An open source web crawling platform

Stars: ✭ 21 (-40%)

Mutual labels: scraping

feedsearch-crawler

Crawl sites for RSS, Atom, and JSON feeds.

Stars: ✭ 23 (-34.29%)

Mutual labels: scraping

puppeteer-botcheck

🕵‍♂ Bot detection tests for Puppeteer. Hide and seek!

Stars: ✭ 42 (+20%)

Mutual labels: scraping

ogpParser

Open Graph Protocol Parser for Node.js

Stars: ✭ 43 (+22.86%)

Mutual labels: scraping

crawling-framework

Easily crawl news portals or blog sites using Storm Crawler.

Stars: ✭ 22 (-37.14%)

Mutual labels: scraping

kuwala

Kuwala is the no-code data platform for BI analysts and engineers enabling you to build powerful analytics workflows. We are set out to bring state-of-the-art data engineering tools you love, such as Airbyte, dbt, or Great Expectations together in one intuitive interface built with React Flow. In addition we provide third-party data into data sc…

Stars: ✭ 474 (+1254.29%)

Mutual labels: scraping

browser-pool

A Node.js library to easily manage and rotate a pool of web browsers, using any of the popular browser automation libraries like Puppeteer, Playwright, or SecretAgent.

Stars: ✭ 71 (+102.86%)

Mutual labels: scraping

covid19br-pub

Projeto de monitoramento de publicações oficiais relacionadas a COVID-19 no Brasil.

Stars: ✭ 12 (-65.71%)

Mutual labels: scraping

oversmash

Overwatch API library for player details and career stats

Stars: ✭ 42 (+20%)

Mutual labels: scraping

ioweb

Web Scraping Framework

Stars: ✭ 31 (-11.43%)

Mutual labels: scraping

internet-affordability

🌍 Dataset that shows the Internet affordability by country (a shocking reality!)

Stars: ✭ 13 (-62.86%)

Mutual labels: scraping

Scrapping

Mastering the art of scrapping 🎓

Stars: ✭ 24 (-31.43%)

Mutual labels: scraping

scrapy-fieldstats

A Scrapy extension to log items coverage when the spider shuts down

Stars: ✭ 17 (-51.43%)

Mutual labels: scraping

proxycrawl-python

ProxyCrawl Python library for scraping and crawling

Stars: ✭ 51 (+45.71%)

Mutual labels: scraping

document-dl

Command line program to download documents from web portals.

Stars: ✭ 14 (-60%)

Mutual labels: scraping

htmltab

Command-line utility to convert HTML tables into CSV files

Stars: ✭ 13 (-62.86%)

Mutual labels: scraping

subscene scraper

Library to download subtitles from subscene.com

Stars: ✭ 14 (-60%)

Mutual labels: scraping

browser-automation-api

Browser automation API for repetitive web-based tasks, with a friendly user interface. You can use it to scrape content or do many other things like capture a screenshot, generate pdf, extract content or execute custom Puppeteer, Playwright functions.

Stars: ✭ 24 (-31.43%)

Mutual labels: scraping

go-scrapy

Web crawling and scraping framework for Golang

Stars: ✭ 17 (-51.43%)

Mutual labels: scraping

ksoup

Kotlin Wrapper for Jsoup

Stars: ✭ 59 (+68.57%)

Mutual labels: scraping

web-clipper

Easily download the main content of a web page in html, markdown, and/or epub format from command line.

Stars: ✭ 15 (-57.14%)

Mutual labels: scraping

html-table-extractor

extract data from html table

Stars: ✭ 74 (+111.43%)

Mutual labels: scraping

wget-lua

Wget-AT is a modern Wget with Lua hooks, Zstandard (+dictionary) WARC compression and URL-agnostic deduplication.

Stars: ✭ 52 (+48.57%)

Mutual labels: scraping

yttrex

youtube & tiktok analysis + youchoose recommendation custmizer. backend, extensions, and tooling

Stars: ✭ 31 (-11.43%)

Mutual labels: scraping

Captcha-Tools

All-in-one Python (And now Go!) module to help solve captchas with Capmonster, 2captcha and Anticaptcha API's!

Stars: ✭ 23 (-34.29%)

Mutual labels: scraping

selectorlib

A library to read a YML file with Xpath or CSS Selectors and extract data from HTML pages using them

Stars: ✭ 53 (+51.43%)

Mutual labels: scraping

sg-food-ml

This script is used to scrap images from the Internet to classify 5 common noodle "mee" dishes in Singapore. Wanton Mee, Bak Chor Mee, Lor Mee, Prawn Mee and Mee Siam.

Stars: ✭ 18 (-48.57%)

Mutual labels: scraping

reason-rust-scraper

🦀 Scraping & crawling websites using Rust, and ReasonML

Stars: ✭ 21 (-40%)

Mutual labels: scraping

Scraper-Projects

🕸 List of mini projects that involve web scraping 🕸

Stars: ✭ 25 (-28.57%)

Mutual labels: scraping

docker-selenium-lambda

The simplest demo of chrome automation by python and selenium in AWS Lambda

Stars: ✭ 172 (+391.43%)

Mutual labels: scraping

torchestrator

Spin up Tor containers and then proxy HTTP requests via these Tor instances

Stars: ✭ 32 (-8.57%)

Mutual labels: scraping

node-red-contrib-nbrowser

Provides a virtual web browser (a.k.a. "headless browser") appearing as a node.

Stars: ✭ 31 (-11.43%)

Mutual labels: scraping

proxi

Proxy pool. Finds and checks proxies with rest api for querying results. Can find over 25k proxies in under 5 minutes.

Stars: ✭ 32 (-8.57%)

Mutual labels: scraping

asyncio-hn

Python (asyncio) wrapper for hackernews api

Stars: ✭ 27 (-22.86%)

Mutual labels: scraping

scavenger

Scrape and take screenshots of dynamic and static webpages

Stars: ✭ 14 (-60%)

Mutual labels: scraping

shorter.recipes

A website dedicated to making recipes from any website easy to read.

Stars: ✭ 27 (-22.86%)

Mutual labels: scraping

AngleParse

HTML parsing and processing tool for PowerShell.

Stars: ✭ 35 (+0%)

Mutual labels: scraping

anime-scraper

[partially working] Scrape and add anime episode stream URLs to uGet (Linux) or IDM (Windows) ~ Python3

Stars: ✭ 21 (-40%)

Mutual labels: scraping

diffbot-php-client

[Deprecated - Maintenance mode - use APIs directly please!] The official Diffbot client library

Stars: ✭ 53 (+51.43%)

Mutual labels: scraping

RARBG-scraper

With Selenium headless browsing and CAPTCHA solving

Stars: ✭ 38 (+8.57%)

Mutual labels: scraping

4cat

The 4CAT Capture and Analysis Toolkit provides modular data capture & analysis for a variety of social media platforms.

Stars: ✭ 144 (+311.43%)

Mutual labels: scraping

scrapy-distributed

A series of distributed components for Scrapy. Including RabbitMQ-based components, Kafka-based components, and RedisBloom-based components for Scrapy.

Stars: ✭ 38 (+8.57%)

Mutual labels: scraping

copycat

A PHP Scraping Class

Stars: ✭ 70 (+100%)

Mutual labels: scraping

ScrapeBot

A Selenium-driven tool for automated website interaction and scraping.

Stars: ✭ 16 (-54.29%)

Mutual labels: scraping

html-table-to-json

Generate JSON representations of HTML tables