PhpscraperPHP Scraper - an highly opinionated web-interface for PHP
Stars: ✭ 148 (+1133.33%)
SpidrA versatile Ruby web spidering library that can spider a site, multiple domains, certain links or infinitely. Spidr is designed to be fast and easy to use.
Stars: ✭ 656 (+5366.67%)
ScrapersA list of scrapers from around the web.
Stars: ✭ 366 (+2950%)
OLX Scraper📻 An OLX Scraper using Scrapy + MongoDB. It Scrapes recent ads posted regarding requested product and dumps to NOSQL MONGODB.
Stars: ✭ 15 (+25%)
Linkedin-ClientWeb scraper for grabing data from Linkedin profiles or company pages (personal project)
Stars: ✭ 42 (+250%)
Scrape Linkedin Selenium`scrape_linkedin` is a python package that allows you to scrape personal LinkedIn profiles & company pages - turning the data into structured json.
Stars: ✭ 239 (+1891.67%)
Awesome CrawlerA collection of awesome web crawler,spider in different languages
Stars: ✭ 4,793 (+39841.67%)
GetsyA simple browser/client-side web scraper.
Stars: ✭ 238 (+1883.33%)
yellowpages-scraperYellowpages.com Web Scraper written in Python and LXML to extract business details available based on a particular category and location.
Stars: ✭ 56 (+366.67%)
VK-ScraperScrapes VK user's photos
Stars: ✭ 42 (+250%)
yt-videos-listCreate and **automatically** update a list of all videos on a YouTube channel (in txt/csv/md form) via YouTube bot with end-to-end web scraping - no API tokens required. Multi-threaded support for YouTube videos list updates.
Stars: ✭ 64 (+433.33%)
wget-luaWget-AT is a modern Wget with Lua hooks, Zstandard (+dictionary) WARC compression and URL-agnostic deduplication.
Stars: ✭ 52 (+333.33%)
OpenScraperAn open source webapp for scraping: towards a public service for webscraping
Stars: ✭ 80 (+566.67%)
diostsA Go scraper that validates security.txt files and outputs them in the disclose.io JSON format.
Stars: ✭ 18 (+50%)
Mimo-CrawlerA web crawler that uses Firefox and js injection to interact with webpages and crawl their content, written in nodejs.
Stars: ✭ 22 (+83.33%)
kenpompyA simple yet comprehensive web scraper for kenpom.com.
Stars: ✭ 41 (+241.67%)
youtube-unofficialAccess parts of your account unavailable through normal YouTube API access.
Stars: ✭ 33 (+175%)
go-jd京东App自动登录,在线商品自动下单
Stars: ✭ 158 (+1216.67%)
Captcha-ToolsAll-in-one Python (And now Go!) module to help solve captchas with Capmonster, 2captcha and Anticaptcha API's!
Stars: ✭ 23 (+91.67%)
SpydanA web spider for shodan.io without using the Developer API.
Stars: ✭ 30 (+150%)
impartus-downloaderDownload Impartus lectures, convert to mkv for offline viewing.
Stars: ✭ 19 (+58.33%)
freeDictionaryAPIThere was no free Dictionary API on the web when I wanted one for my friend, so I created one.
Stars: ✭ 1,352 (+11166.67%)
stock-market-scraperScraps historical stock market data from Yahoo Finance (https://finance.yahoo.com/)
Stars: ✭ 110 (+816.67%)
proxycrawl-pythonProxyCrawl Python library for scraping and crawling
Stars: ✭ 51 (+325%)
aliexscrapeGet Aliexpress product details in JSON
Stars: ✭ 80 (+566.67%)
ScrapeMA monadic web scraping library
Stars: ✭ 17 (+41.67%)
newspaperjsNews extraction and scraping. Article Parsing
Stars: ✭ 59 (+391.67%)
scraped-tvtime-apiA free TVTime API based on scraping TVTime website. No API key required
Stars: ✭ 23 (+91.67%)
document-dlCommand line program to download documents from web portals.
Stars: ✭ 14 (+16.67%)
esajScrapers for many e-SAJ systems
Stars: ✭ 35 (+191.67%)
opensea-scraperScrapes nft floor prices and additional information from opensea. Used for https://nftfloorprice.info
Stars: ✭ 129 (+975%)
ogePage metadata as a service
Stars: ✭ 22 (+83.33%)
scraperA web scraper starter project
Stars: ✭ 18 (+50%)
copycatA PHP Scraping Class
Stars: ✭ 70 (+483.33%)
get-sauceA command line program to download hentai videos and images from multiple websites
Stars: ✭ 40 (+233.33%)
youtubeCreate a ZIM file from a Youtube channel/username/playlist
Stars: ✭ 25 (+108.33%)
Instagram-Giveaways-WinnerInstagram Bot which when given a post url will spam mentions to increase the chances of winning. Win Instagram Giveaways!
Stars: ✭ 95 (+691.67%)
premeStockMonitors for restocks
Stars: ✭ 53 (+341.67%)
awesome-interfaceAngularJS SPA interface for awesome lists. Awesome lists parsed using python.
Stars: ✭ 25 (+108.33%)
Instagram-to-discordMonitor instagram user account and automatically post new images to discord channel via a webhook. Working 2022!
Stars: ✭ 113 (+841.67%)
gutenbergScraper for downloading the entire ebooks repository of project Gutenberg
Stars: ✭ 100 (+733.33%)
web-scraping-engineA simple web scraping engine supporting concurrent and anonymous scraping
Stars: ✭ 27 (+125%)
scraperNode.js based scraper using headless chrome
Stars: ✭ 45 (+275%)
VideoRecognition-realtime-autotrainer-alertsState of the art object detection in real-time using YOLOV3 algorithm. Augmented with a process that allows easy training of the classifier as a plug & play solution . Provides alert if an item in an alert list is detected.
Stars: ✭ 36 (+200%)
arxiv leaksWhisper of the arxiv: read comments in tex of papers
Stars: ✭ 22 (+83.33%)
TelegramScraperUsing this tool you can easily add so many members from any group to your group. Less than 2 minutes. Super easy. Time saver. But this tool is only for educational purpose. You could be banned from Telegram. So be careful. Recommanded to use this tool only on Termux.
Stars: ✭ 234 (+1850%)
crawlkitA crawler based on Phantom. Allows discovery of dynamic content and supports custom scrapers.
Stars: ✭ 23 (+91.67%)
azurapi-jsOpen Source Unofficial json based api that returns Azur Lane data
Stars: ✭ 27 (+125%)
LeetCodeAt present contains scraped data from around 1500 problems present on the site. More to follow....
Stars: ✭ 45 (+275%)
site-audit-seoWeb service and CLI tool for SEO site audit: crawl site, lighthouse all pages, view public reports in browser. Also output to console, json, csv, xlsx, Google Drive.
Stars: ✭ 91 (+658.33%)
youtube-playlist❄️ Extract links, ids, and names from a youtube playlist
Stars: ✭ 73 (+508.33%)
ha-multiscrapeHome Assistant custom component for scraping (html, xml or json) multiple values (from a single HTTP request) with a separate sensor/attribute for each value. Support for (login) form-submit functionality.
Stars: ✭ 103 (+758.33%)
TrollHunterTwitter Troll & Fake News Hunter - Crawls news websites and twitter to identify fake news
Stars: ✭ 38 (+216.67%)
google-this🔎 A simple yet powerful module to retrieve organic search results and much more from Google.
Stars: ✭ 88 (+633.33%)
civic-scraperTools for downloading agendas, minutes and other documents produced by local government
Stars: ✭ 21 (+75%)