Heroku ebooksA script to generate Markov chains and to post to an _ebooks account on Twitter using Heroku
Scrape Linkedin Selenium`scrape_linkedin` is a python package that allows you to scrape personal LinkedIn profiles & company pages - turning the data into structured json.
GetsyA simple browser/client-side web scraper.
Skrape.itA Kotlin-based testing/scraping/parsing library providing the ability to analyze and extract data from HTML (server & client-side rendered). It places particular emphasis on ease of use and a high level of readability by providing an intuitive DSL. It aims to be a testing lib, but can also be used to scrape websites in a convenient fashion.
Annie👾 Fast and simple video download library and CLI tool written in Go
Scrapysharpreborn of https://bitbucket.org/rflechner/scrapysharp
Ruiji.netcrawler framework, distributed crawler extractor
Goose ParserUniversal scrapping tool, which allows you to extract data using multiple environments
Media ScraperScrapes all photos and videos in a web page / Instagram / Twitter / Tumblr / Reddit / pixiv / TikTok
Tianyanchapip安装的天眼查爬虫API,指定的单个/多个企业工商信息一键保存为Excel/JSON格式。A Battery-included Scraper API of Tianyancha, the best Chinese business data and investigation platform.
CollyElegant Scraper and Crawler Framework for Golang
Weibo terminaterFinal Weibo Crawler Scrap Anything From Weibo, comments, weibo contents, followers, anything. The Terminator
Querylist🕷️ The progressive PHP crawler framework! 优雅的渐进式PHP采集框架。
UnfurlScraper for oEmbed, Twitter Cards and Open Graph metadata - fast and Promise-based ⚡️
Anime DlAnime-dl is a command-line program to download anime from CrunchyRoll and Funimation.
Goribot[Crawler/Scraper for Golang]🕷A lightweight distributed friendly Golang crawler framework.一个轻量的分布式友好的 Golang 爬虫框架。
GmdbGMDB is the ultra-simple, cross-platform Movie Library with Features (Search, Take Note, Watch Later, Like, Import, Learn, Instantly Torrent Magnet Watch)
ReadablewebproxyRewriting web proxy and archival tool. At this point, it just tries to download all the things.
Novel基于 Laravel 5.2 的小说网站
OpensanctionsAn open database of international sanctions data, persons of interest and politically exposed persons
Covid19 mobilityCOVID-19 Mobility Data Aggregator. Scraper of Google, Apple, Waze and TomTom COVID-19 Mobility Reports🚶🚘🚉
Instagram Scraperscrapes medias, likes, followers, tags and all metadata. Inspired by instagram-php-scraper,bot
DemeterDemeter is a tool for scraping the calibre web ui
SerpscrapSEO python scraper to extract data from major searchengine result pages. Extract data like url, title, snippet, richsnippet and the type from searchresults for given keywords. Detect Ads or make automated screenshots. You can also fetch text content of urls provided in searchresults or by your own. It's usefull for SEO and business related research tasks.
PhpscraperPHP Scraper - an highly opinionated web-interface for PHP
Google2csvGoogle2Csv a simple google scraper that saves the results on a csv/xlsx/jsonl file
Youtube ProjectsThis repository contains all the code I use in my YouTube tutorials.
Google Play ScraperGoogle play scraper for Python inspired by <facundoolano/google-play-scraper>
ZillowZillow Scraper for Python using Selenium
OnegramThis repository is no longer maintained.
UdemycoursegrabberYour will to enroll in Udemy course is here, but the money isn't? Search no more! This python program searches for your desired course in more than [insert big number here] websites, compares the last updated date, and gives you the download link of the latest one back, but you also have the choice to see the other ones as well!
NewspaperNews, full-text, and article metadata extraction in Python 3. Advanced docs:
ProxyscrapePython library for retrieving free proxies (HTTP, HTTPS, SOCKS4, SOCKS5).
ScraperA scraper that switches between normal mode and gentleman mode, built on Eletron, React
MwofflinerScrape any online Mediawiki motorised wiki (like Wikipedia) to your local filesystem
ArxivscraperA python module to scrape arxiv.org for specific date range and categories
Youtube Comment SuiteDownload YouTube comments from numerous videos, playlists, and channels for archiving, general search, and showing activity.
SeleniumcrawlerAn example using Selenium webdrivers for python and Scrapy framework to create a web scraper to crawl an ASP site
Cumcomic updater, mangafied
Ridereceipts🚕 Simple automation desktop app to download and organize your receipts from Uber/Lyft. Try out our new Ride Receipts PRO !
Instagram Python ScraperA instagram scraper wrote in python. Similar to instagram-php-scraper.Usages are in example.py. Enjoy it!
HeadlesschromeA Go package for working with headless Chrome. Run interactive JavaScript commands on web pages with Go and Chrome.
JobfunnelScrape job websites into a single spreadsheet with no duplicates.