Instagram ScraperScrape the Instagram frontend. Inspired from twitter-scraper by @kennethreitz.
Stars: ✭ 903 (+5211.76%)
Instagram-to-discordMonitor instagram user account and automatically post new images to discord channel via a webhook. Working 2022!
Stars: ✭ 113 (+564.71%)
Instagram-Comments-ScraperInstagram comment scraper using python and selenium. Save the comments into excel.
Stars: ✭ 73 (+329.41%)
dustArchive web pages with all relevant assets or save as a single file HTML
Stars: ✭ 19 (+11.76%)
gunaydinYour good mornings ☀️
Stars: ✭ 16 (-5.88%)
AngleParseHTML parsing and processing tool for PowerShell.
Stars: ✭ 35 (+105.88%)
proxiProxy pool. Finds and checks proxies with rest api for querying results. Can find over 25k proxies in under 5 minutes.
Stars: ✭ 32 (+88.24%)
go-scrapyWeb crawling and scraping framework for Golang
Stars: ✭ 17 (+0%)
igFame📷 igFame - Tool for automated Instagram interactions [PHP]
Stars: ✭ 16 (-5.88%)
ogpParserOpen Graph Protocol Parser for Node.js
Stars: ✭ 43 (+152.94%)
web-clipperEasily download the main content of a web page in html, markdown, and/or epub format from command line.
Stars: ✭ 15 (-11.76%)
TorScrapperA Scraper made 100% in Python using BeautifulSoup and Tor. It can be used to scrape both normal and onion links. Happy Scraping :)
Stars: ✭ 24 (+41.18%)
subscene scraperLibrary to download subtitles from subscene.com
Stars: ✭ 14 (-17.65%)
raspagem-de-dados-fatec📓 Minicurso de raspagem de dados web com Python ministrado na Semana de Tecnologia da FATEC Jundiaí
Stars: ✭ 22 (+29.41%)
Captcha-ToolsAll-in-one Python (And now Go!) module to help solve captchas with Capmonster, 2captcha and Anticaptcha API's!
Stars: ✭ 23 (+35.29%)
humanparserParse a human name string into salutation, first name, middle name, last name, suffix.
Stars: ✭ 78 (+358.82%)
scrapy-distributedA series of distributed components for Scrapy. Including RabbitMQ-based components, Kafka-based components, and RedisBloom-based components for Scrapy.
Stars: ✭ 38 (+123.53%)
ARGUSARGUS is an easy-to-use web scraping tool. The program is based on the Scrapy Python framework and is able to crawl a broad range of different websites. On the websites, ARGUS is able to perform tasks like scraping texts or collecting hyperlinks between websites. See: https://link.springer.com/article/10.1007/s11192-020-03726-9
Stars: ✭ 68 (+300%)
chesfCHeSF is the Chrome Headless Scraping Framework, a very very alpha code to scrape javascript intensive web pages
Stars: ✭ 18 (+5.88%)
scrapy facebookerCollection of scrapy spiders which can scrape posts, images, and so on from public Facebook Pages.
Stars: ✭ 22 (+29.41%)
rubiumRubium is a lightweight alternative to Selenium/Capybara/Watir if you need to perform some operations (like web scraping) using Headless Chromium and Ruby
Stars: ✭ 65 (+282.35%)
chirpsTwitter bot powering @arichduvet
Stars: ✭ 35 (+105.88%)
browser-poolA Node.js library to easily manage and rotate a pool of web browsers, using any of the popular browser automation libraries like Puppeteer, Playwright, or SecretAgent.
Stars: ✭ 71 (+317.65%)
naos📉 Uptime and error monitoring CLI
Stars: ✭ 30 (+76.47%)
ZeiverA Scraper, Downloader, & Recorder for static open directories.
Stars: ✭ 14 (-17.65%)
kuwalaKuwala is the no-code data platform for BI analysts and engineers enabling you to build powerful analytics workflows. We are set out to bring state-of-the-art data engineering tools you love, such as Airbyte, dbt, or Great Expectations together in one intuitive interface built with React Flow. In addition we provide third-party data into data sc…
Stars: ✭ 474 (+2688.24%)
scraperNodejs web scraper. Contains a command line, docker container, terraform module and ansible roles for distributed cloud scraping. Supported databases: SQLite, MySQL, PostgreSQL. Supported headless clients: Puppeteer, Playwright, Cheerio, JSdom.
Stars: ✭ 37 (+117.65%)
top-github-scraperScape top GitHub repositories and users based on keywords
Stars: ✭ 40 (+135.29%)
papercutPapercut is a scraping/crawling library for Node.js built on top of JSDOM. It provides basic selector features together with features like Page Caching and Geosearch.
Stars: ✭ 15 (-11.76%)
nanogram.js📷 An easy-to-use and simple Instagram package that allows you to fetch media content without API and access token.
Stars: ✭ 62 (+264.71%)
jazzThe Scripting Engine that Combines Speed, Safety, and Simplicity
Stars: ✭ 132 (+676.47%)
InstantInstaAndroid Application To Download and Manage Instagram Images And Videos
Stars: ✭ 47 (+176.47%)
ferendaTransform unstructured document collections to structured Linked Data
Stars: ✭ 22 (+29.41%)
memes-apiAPI for scrapping common meme sites
Stars: ✭ 17 (+0%)
internet-affordability🌍 Dataset that shows the Internet affordability by country (a shocking reality!)
Stars: ✭ 13 (-23.53%)
scrapy-zyte-smartproxyZyte Smart Proxy Manager (formerly Crawlera) middleware for Scrapy
Stars: ✭ 317 (+1764.71%)
instastory.jsThis is a jQuery plugin to make it easy to get a feed from instagram. No need of access tokens and other stuff, Only thing needed is jQuery.
Stars: ✭ 36 (+111.76%)
auto-Instagram-posting-botA bot that downloads 9gag and Instagram posts, and re-uploads it to your Instagram account
Stars: ✭ 87 (+411.76%)
document-dlCommand line program to download documents from web portals.
Stars: ✭ 14 (-17.65%)
pompScreen scraping and web crawling framework
Stars: ✭ 61 (+258.82%)
Instagram-Giveaways-WinnerInstagram Bot which when given a post url will spam mentions to increase the chances of winning. Win Instagram Giveaways!
Stars: ✭ 95 (+458.82%)
webdextIntelligent Web Data Extractor
Stars: ✭ 75 (+341.18%)
dmi-instascraperA GUI for Instaloader to scrape users and hashtags with on Instagram
Stars: ✭ 21 (+23.53%)
wget-luaWget-AT is a modern Wget with Lua hooks, Zstandard (+dictionary) WARC compression and URL-agnostic deduplication.
Stars: ✭ 52 (+205.88%)
bots-zooNo description or website provided.
Stars: ✭ 59 (+247.06%)
sg-food-mlThis script is used to scrap images from the Internet to classify 5 common noodle "mee" dishes in Singapore. Wanton Mee, Bak Chor Mee, Lor Mee, Prawn Mee and Mee Siam.
Stars: ✭ 18 (+5.88%)
shupA POSIX shell script to parse HTML
Stars: ✭ 28 (+64.71%)
torchestratorSpin up Tor containers and then proxy HTTP requests via these Tor instances
Stars: ✭ 32 (+88.24%)
PyLexPerform lexical analysis on words, one word at a time.
Stars: ✭ 60 (+252.94%)
scavengerScrape and take screenshots of dynamic and static webpages
Stars: ✭ 14 (-17.65%)
image-collectorDownload images from Google Image Search
Stars: ✭ 38 (+123.53%)
Instagram-Scraper-2021Scrape Instagram content and stories anonymously, using a new technique based on the har file (No Token + No public API).
Stars: ✭ 57 (+235.29%)
facebook-discussion-tkA collection of tools to (semi-)automatically collect and analyze data from online discussions on Facebook groups and pages.
Stars: ✭ 33 (+94.12%)
policy-data-analyzerBuilding a model to recognize incentives for landscape restoration in environmental policies from Latin America, the US and India. Bringing NLP to the world of policy analysis through an extensible framework that includes scraping, preprocessing, active learning and text analysis pipelines.
Stars: ✭ 22 (+29.41%)