CrawlyCrawly, a high-level web crawling & scraping framework for Elixir.
Stars: ✭ 440 (+1813.04%)
wget-luaWget-AT is a modern Wget with Lua hooks, Zstandard (+dictionary) WARC compression and URL-agnostic deduplication.
Stars: ✭ 52 (+126.09%)
Lulu[Unmaintained] A simple and clean video/music/image downloader 👾
Stars: ✭ 789 (+3330.43%)
DataflowkitExtract structured data from web sites. Web sites scraping.
Stars: ✭ 456 (+1882.61%)
proxycrawl-pythonProxyCrawl Python library for scraping and crawling
Stars: ✭ 51 (+121.74%)
web-crawlerPython Web Crawler with Selenium and PhantomJS
Stars: ✭ 19 (-17.39%)
Linkedin Profile Scraper🕵️♂️ LinkedIn profile scraper returning structured profile data in JSON. Works in 2020.
Stars: ✭ 171 (+643.48%)
bots-zooNo description or website provided.
Stars: ✭ 59 (+156.52%)
CollyElegant Scraper and Crawler Framework for Golang
Stars: ✭ 15,535 (+67443.48%)
NewspaperNews, full-text, and article metadata extraction in Python 3. Advanced docs:
Stars: ✭ 11,545 (+50095.65%)
Mimo-CrawlerA web crawler that uses Firefox and js injection to interact with webpages and crawl their content, written in nodejs.
Stars: ✭ 22 (-4.35%)
img-cliAn interactive Command-Line Interface Build in NodeJS for downloading a single or multiple images to disk from URL
Stars: ✭ 15 (-34.78%)
ScrapyrtHTTP API for Scrapy spiders
Stars: ✭ 637 (+2669.57%)
FerretDeclarative web scraping
Stars: ✭ 4,837 (+20930.43%)
Goose ParserUniversal scrapping tool, which allows you to extract data using multiple environments
Stars: ✭ 211 (+817.39%)
diffbot-php-client[Deprecated - Maintenance mode - use APIs directly please!] The official Diffbot client library
Stars: ✭ 53 (+130.43%)
copycatA PHP Scraping Class
Stars: ✭ 70 (+204.35%)
site-audit-seoWeb service and CLI tool for SEO site audit: crawl site, lighthouse all pages, view public reports in browser. Also output to console, json, csv, xlsx, Google Drive.
Stars: ✭ 91 (+295.65%)
go-jd京东App自动登录,在线商品自动下单
Stars: ✭ 158 (+586.96%)
custom-crawler🌌 High productivity semi-automatic crawler generator 🛠️🧰
Stars: ✭ 33 (+43.48%)
PDAP-ScrapersCode relating to scraping public police data.
Stars: ✭ 72 (+213.04%)
freeDictionaryAPIThere was no free Dictionary API on the web when I wanted one for my friend, so I created one.
Stars: ✭ 1,352 (+5778.26%)
ogePage metadata as a service
Stars: ✭ 22 (-4.35%)
pydermanInstall Selenium-compatible Chrome/Firefox/Opera/PhantomJS/Edge webdrivers automatically.
Stars: ✭ 24 (+4.35%)
scraperA web scraper starter project
Stars: ✭ 18 (-21.74%)
youtubeCreate a ZIM file from a Youtube channel/username/playlist
Stars: ✭ 25 (+8.7%)
google-this🔎 A simple yet powerful module to retrieve organic search results and much more from Google.
Stars: ✭ 88 (+282.61%)
premeStockMonitors for restocks
Stars: ✭ 53 (+130.43%)
aliexscrapeGet Aliexpress product details in JSON
Stars: ✭ 80 (+247.83%)
scraperA simple web scraper built around the JavaFX WebEngine
Stars: ✭ 13 (-43.48%)
Linkedin-ClientWeb scraper for grabing data from Linkedin profiles or company pages (personal project)
Stars: ✭ 42 (+82.61%)
youtube-unofficialAccess parts of your account unavailable through normal YouTube API access.
Stars: ✭ 33 (+43.48%)
gutenbergScraper for downloading the entire ebooks repository of project Gutenberg
Stars: ✭ 100 (+334.78%)
scraperNode.js based scraper using headless chrome
Stars: ✭ 45 (+95.65%)
Instagram-to-discordMonitor instagram user account and automatically post new images to discord channel via a webhook. Working 2022!
Stars: ✭ 113 (+391.3%)
TelegramScraperUsing this tool you can easily add so many members from any group to your group. Less than 2 minutes. Super easy. Time saver. But this tool is only for educational purpose. You could be banned from Telegram. So be careful. Recommanded to use this tool only on Termux.
Stars: ✭ 234 (+917.39%)
TrollHunterTwitter Troll & Fake News Hunter - Crawls news websites and twitter to identify fake news
Stars: ✭ 38 (+65.22%)
OpenScraperAn open source webapp for scraping: towards a public service for webscraping
Stars: ✭ 80 (+247.83%)
go-scrapyWeb crawling and scraping framework for Golang
Stars: ✭ 17 (-26.09%)
ScrapeMA monadic web scraping library
Stars: ✭ 17 (-26.09%)
LeetCodeAt present contains scraped data from around 1500 problems present on the site. More to follow....
Stars: ✭ 45 (+95.65%)
VK-ScraperScrapes VK user's photos
Stars: ✭ 42 (+82.61%)
youtube-playlist❄️ Extract links, ids, and names from a youtube playlist
Stars: ✭ 73 (+217.39%)
ha-multiscrapeHome Assistant custom component for scraping (html, xml or json) multiple values (from a single HTTP request) with a separate sensor/attribute for each value. Support for (login) form-submit functionality.
Stars: ✭ 103 (+347.83%)
siteshooter📷 Automate full website screenshots and PDF generation with multiple viewport support.
Stars: ✭ 63 (+173.91%)
scraped-tvtime-apiA free TVTime API based on scraping TVTime website. No API key required
Stars: ✭ 23 (+0%)
civic-scraperTools for downloading agendas, minutes and other documents produced by local government
Stars: ✭ 21 (-8.7%)
CourseCakeBy serving course 📚 data that is more "edible" 🍰 for developers, we hope CourseCake offers a smooth approach to build useful tools for students.
Stars: ✭ 21 (-8.7%)
phantom-lordHandy API for Headless Chromium
Stars: ✭ 24 (+4.35%)
unfurlExtract rich metadata from URLs
Stars: ✭ 41 (+78.26%)
TikTokDownload public videos on TikTok using Python with Selenium
Stars: ✭ 37 (+60.87%)
document-dlCommand line program to download documents from web portals.
Stars: ✭ 14 (-39.13%)