syncabook📖🎧 A tool for creating ebooks with synchronized text and audio (EPUB3 with Media Overlays)
Stars: ✭ 70 (+400%)
WaWebSessionHandler(DISCONTINUED) Save WhatsApp Web Sessions as files and open them everywhere!
Stars: ✭ 27 (+92.86%)
faexportThe API for Furaffinity you wish existed
Stars: ✭ 61 (+335.71%)
actor-content-checkerYou can use this act to monitor any page's content and get a notification when content changes.
Stars: ✭ 16 (+14.29%)
kobo-book-downloaderA tool to download and remove DRM from your purchased Kobo.com ebooks and audiobooks.
Stars: ✭ 171 (+1121.43%)
OLX Scraper📻 An OLX Scraper using Scrapy + MongoDB. It Scrapes recent ads posted regarding requested product and dumps to NOSQL MONGODB.
Stars: ✭ 15 (+7.14%)
iowebWeb Scraping Framework
Stars: ✭ 31 (+121.43%)
tableau-scrapingTableau scraper python library. R and Python scripts to scrape data from Tableau viz
Stars: ✭ 91 (+550%)
extractnetA Dragnet that also extract author, headline, date, keywords from context
Stars: ✭ 52 (+271.43%)
Neural-Scam-ArtistWeb Scraping, Document Deduplication & GPT-2 Fine-tuning with a newly created scam dataset.
Stars: ✭ 18 (+28.57%)
crawlzoneCrawlzone is a fast asynchronous internet crawling framework for PHP.
Stars: ✭ 70 (+400%)
grailerweb scraping tool for grailed.com
Stars: ✭ 30 (+114.29%)
IMDB-ScraperScrapy project for scraping data from IMDB with Movie Dataset including 58,623 movies' data.
Stars: ✭ 37 (+164.29%)
rymscraperPython API to extract data from rateyourmusic.com.
Stars: ✭ 63 (+350%)
heroshiHeroshi – open source web crawler.
Stars: ✭ 51 (+264.29%)
TikTokDownloader PyWebIO🚀「Douyin_TikTok_Download_API」是一个开箱即用的高性能异步抖音|TikTok数据爬取工具,支持API调用,在线批量解析及下载。
Stars: ✭ 919 (+6464.29%)
Node-js-functionalitiesThis repository contains very useful restful API's and functionalities in node-js containing many important tutorial code for mastering node-js, all tutorials have been published on medium.com, tutorials link is given below
Stars: ✭ 69 (+392.86%)
investigation-amazon-brandsMaterials to reproduce our findings in our stories, "Amazon Puts Its Own 'Brands' First Above Better-Rated Products" and "When Amazon Takes the Buy Box, it Doesn’t Give it up"
Stars: ✭ 56 (+300%)
web-poetWeb scraping Page Objects core library
Stars: ✭ 67 (+378.57%)
iwwAI based web-wrapper for web-content-extraction
Stars: ✭ 61 (+335.71%)
2017-summer-workshopExercises, data, and more for our 2017 summer workshop (funded by the Estes Fund and in partnership with Project Jupyter and Berkeley's D-Lab)
Stars: ✭ 33 (+135.71%)
PythonScrapyBasicSetupBasic setup with random user agents and IP addresses for Python Scrapy Framework.
Stars: ✭ 57 (+307.14%)
htmlunit🕸🧰☕️Tools to Scrape Dynamic Web Content via the 'HtmlUnit' Java Library
Stars: ✭ 39 (+178.57%)
Data-Wrangling-with-PythonSimplify your ETL processes with these hands-on data sanitation tips, tricks, and best practices
Stars: ✭ 90 (+542.86%)
restaurant-finder-featureReviewsBuild a Flask web application to help users retrieve key restaurant information and feature-based reviews (generated by applying market-basket model – Apriori algorithm and NLP on user reviews).
Stars: ✭ 21 (+50%)
cl-torrentsSearching torrents on popular trackers - CLI, readline, GUI, web client. Tutorial and binaries (issue tracker on https://gitlab.com/vindarel/cl-torrents/)
Stars: ✭ 83 (+492.86%)
audiobookshelfSelf-hosted audiobook and podcast server
Stars: ✭ 1,316 (+9300%)
selectorlibA library to read a YML file with Xpath or CSS Selectors and extract data from HTML pages using them
Stars: ✭ 53 (+278.57%)
codechef-rank-comparatorWeb application hosted on Heroku cloud platform based on web scraping in python using lxml library (XML Path Language).
Stars: ✭ 23 (+64.29%)
Pythoncovers python basic to advance topics, practice questions, logical problems in python, web development using html, css, bootstrap, jquery, DOM, Django 🚀🚀. 💥 🌈
Stars: ✭ 29 (+107.14%)
automation-scriptsSimple scripts that I'm using to automate the boring things.
Stars: ✭ 14 (+0%)
reapr🕸→ℹ️ Reap Information from Websites
Stars: ✭ 14 (+0%)
top-github-scraperScape top GitHub repositories and users based on keywords
Stars: ✭ 40 (+185.71%)
savedditBulk Downloader for Reddit
Stars: ✭ 130 (+828.57%)
leetcode-compensationCompensation analysis on the posts scraped from leetcode.com/discuss/compensation. At present, the reports have been generated only for Indian cities.
Stars: ✭ 83 (+492.86%)
Stock-Market-PredictorStock Market Predictor with LSTM network. Web scraping and analyzing tools (ohlc, mean)
Stars: ✭ 28 (+100%)
sp-subway-scraper🚆This web scraper builds a dataset for São Paulo subway operation status
Stars: ✭ 24 (+71.43%)
scrapy-wayback-machineA Scrapy middleware for scraping time series data from Archive.org's Wayback Machine.
Stars: ✭ 92 (+557.14%)
rreddit𝐫⟋ Get Reddit data
Stars: ✭ 49 (+250%)
trafilaturaPython & command-line tool to gather text on the Web: web crawling/scraping, extraction of text, metadata, comments
Stars: ✭ 711 (+4978.57%)
codepen-puppeteerUse Puppeteer to download pens from Codepen.io as single html pages
Stars: ✭ 22 (+57.14%)
browser-poolA Node.js library to easily manage and rotate a pool of web browsers, using any of the popular browser automation libraries like Puppeteer, Playwright, or SecretAgent.
Stars: ✭ 71 (+407.14%)
coreThe complete web scraping toolkit for PHP.
Stars: ✭ 1,110 (+7828.57%)
GSoC-Data-AnalyserSimple search for organisations participating/participated in the GSoC
Stars: ✭ 29 (+107.14%)
reading-listMy reading list since January 1996. Commits include comments on what I read beginning in June 2015
Stars: ✭ 34 (+142.86%)
HiA Programming language for Web Scraping
Stars: ✭ 14 (+0%)
lopezCrawling and scraping the Web for fun and profit
Stars: ✭ 20 (+42.86%)
scraping-ebayScraping Ebay's products using Scrapy Web Crawling Framework
Stars: ✭ 79 (+464.29%)
IdealMediaAwesome app to listen music and audiobooks on the device and online at vk.com. Search, download, set as ringtone, sort by albums, authors, folder. Powerful equalizer.
Stars: ✭ 28 (+100%)
linkextractorA Docker tutorial using a link extraction application example
Stars: ✭ 41 (+192.86%)
halfstaff🇺🇸 Is the US flag at half-staff?
Stars: ✭ 22 (+57.14%)
actor-scraperHouse of Apify Scrapers. Generic scraping actors with a simple UI to handle complex web crawling and scraping use cases.
Stars: ✭ 83 (+492.86%)
librivox-catalogLibriVox catalog and reader workflow application
Stars: ✭ 20 (+42.86%)
Linkedin-ClientWeb scraper for grabing data from Linkedin profiles or company pages (personal project)
Stars: ✭ 42 (+200%)