All Categories β†’ Data Processing β†’ webscraping

Top 111 webscraping open source projects

hk0weather
Web scraper project to collect the useful Hong Kong weather data from HKO website
koishi
Python wrapper for the unofficial scraped API of the satori testing system.
TrackPurchase
단 λͺ‡μ€„μ˜ μ½”λ“œλ‘œ λ‹€μ–‘ν•œ μ‡Όν•‘ ν”Œλž«νΌμ—μ„œ 결제 내역을 κΈμ–΄μ˜€μž!
metacritic api
PHP Metacritic API - Mirrored by my GitLab
scrapism
a work-in-progress guide to web scraping as an artistic and critical practice
image-crawler
An image scraper that scraps images from unsplash.com
repository.colossus
Colossus Repository for Kodi Addons - Kodi is a registered trademark of the XBMC Foundation. We are not connected to or in any other way affiliated with Kodi - DMCA: [email protected]
Mimo-Crawler
A web crawler that uses Firefox and js injection to interact with webpages and crawl their content, written in nodejs.
Android-Web-Scraper
Android Web Scraper is a simple library for android web automation. You can perform web task in background to fetch website data programmatically.
ir
Projeto de calculo de Imposto de Renda em operacoes na bovespa automaticamente. Tags:canal eletronico do investidor, CEI, selenium, bovespa, IRPF, IR, imposto de renda, finance, yahoo finance, acao, fii, etf, python, crawler, webscraping, calculadora ir
chesf
CHeSF is the Chrome Headless Scraping Framework, a very very alpha code to scrape javascript intensive web pages
supervised-machine-learning
This repo contains regression and classification projects. Examples: development of predictive models for comments on social media websites; building classifiers to predict outcomes in sports competitions; churn analysis; prediction of clicks on online ads; analysis of the opioids crisis and an analysis of retail store expansion strategies using…
VideoRecognition-realtime-autotrainer-alerts
State of the art object detection in real-time using YOLOV3 algorithm. Augmented with a process that allows easy training of the classifier as a plug & play solution . Provides alert if an item in an alert list is detected.
web-scraping-101
An Introduction to Web Scraping
requestsR
R interface to Python requests module
Email-Crawler-Lead-Generator
This email crawler will visit all pages of a provided website and parse and save emails found to a csv file.
Catalyst
A VS code Extension to accelerate the process of solving problems on Codeforces.
ebayMarketAnalyzer
Scrape all eBay sold listings to determine average/median pricing, plot listings over time with trend lines, and extract to excel
gotor
This program provides efficient web scraping services for Tor and non-Tor sites. The program has both a CLI and REST API.
youtube-audio
extract videos from youtube in audio format using webscraping techniques 🎢
CourseDownloader
GUI app for downloading whole online courses with folder structure from one url
browser-automation-api
Browser automation API for repetitive web-based tasks, with a friendly user interface. You can use it to scrape content or do many other things like capture a screenshot, generate pdf, extract content or execute custom Puppeteer, Playwright functions.
robotstxt
robots.txt file parsing and checking for R
non-api-fb-scraper
Scrape public FaceBook posts from any group or user into a .csv file without needing to register for any API access
Youtube-Scraping-Selenium
Automatically creates a Youtube channel dashboard
Sneakers Project
Using Selenium, Neha scraped data about 35 top selling sneakers of Nike and Adidas from stockx.com. She used this data to draw insights about sneaker resales.
fBrowser
Helpful Selenium functions to make web-scraping easier and faster
android-web-scraping-app-jsoup
Sometimes we need to scrap web data from our Android App. To achieve this goal jsoup library is a good option. I wrote a blog post on this topic in my personal blog. If you know Bengali language then you can visit this link.
animeflv
Animeflv is a custom API that has the entire catalog of the animeflv.net website. You can enjoy all the content with subtitles in Spanish and the latest in the world of anime for free.
medium-scrapper
Scrap Medium Articles using tags.
BookingScraper
🌎 🏨 Scrape Booking.com 🏨 🌎
toronto-apartment-finder
[really old and probably doesn't work] Slack bot to post relevant Toronto apartment listings from Kijiji & Craigslist
google scraper live view
Application for extracting large amounts of data from the Google search results page
super-anime-downloader
A program which takes an Anime name or URL and downloads the specified range of episodes.
google-search-results-nodejs
SerpApi client library for Node.js. Previously: Google Search Results Node.js.
mirror-mirror
A library to get images from social media
GoodReadsScraper
πŸ“š A GoodReads.com Scraper script to get books reviews including text and rating.
NordVPN-switcher
Rotate between different NordVPN servers with ease. Works both on Linux and Windows without any required changes to your code!
aws-pdf-textract-pipeline
πŸ” Data pipeline for crawling PDFs from the Web and transforming their contents into structured data using AWS textract. Built with AWS CDK + TypeScript
61-111 of 111 webscraping projects