All Projects → matthewnau → mirror-mirror

matthewnau / mirror-mirror

Licence: MIT license
A library to get images from social media

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to mirror-mirror

Soup
Web Scraper in Go, similar to BeautifulSoup
Stars: ✭ 1,685 (+11133.33%)
Mutual labels:  webscraping
Hq bot
📲 Bot to help solve HQ trivia
Stars: ✭ 167 (+1013.33%)
Mutual labels:  webscraping
Nytimes App
🗽 A Simple Demonstration of the New York Times App 📱 using Jsoup web crawler with MVVM Architecture 🔥
Stars: ✭ 246 (+1540%)
Mutual labels:  webscraping
Operating Systems Three Easy Pieces
operating systems three easy pieces by Rezmi
Stars: ✭ 128 (+753.33%)
Mutual labels:  webscraping
Youtube Projects
This repository contains all the code I use in my YouTube tutorials.
Stars: ✭ 144 (+860%)
Mutual labels:  webscraping
Instascrape
Powerful and flexible Instagram scraping library for Python, providing easy-to-use and expressive tools for accessing data programmatically
Stars: ✭ 202 (+1246.67%)
Mutual labels:  webscraping
Php Crawler
A php crawler that finds emails on the internets
Stars: ✭ 119 (+693.33%)
Mutual labels:  webscraping
NordVPN-switcher
Rotate between different NordVPN servers with ease. Works both on Linux and Windows without any required changes to your code!
Stars: ✭ 143 (+853.33%)
Mutual labels:  webscraping
Decryptr
An extensible API for breaking captchas
Stars: ✭ 154 (+926.67%)
Mutual labels:  webscraping
Crosslinked
LinkedIn enumeration tool to extract valid employee names from an organization through search engine scraping
Stars: ✭ 223 (+1386.67%)
Mutual labels:  webscraping
Ralger
ralger makes it easy to scrape a website. Built on the shoulders of titans: rvest, xml2.
Stars: ✭ 130 (+766.67%)
Mutual labels:  webscraping
Stardox
Github stargazers information gathering tool
Stars: ✭ 130 (+766.67%)
Mutual labels:  webscraping
Tradingview Data Scraper
Extract price and indicator data from TradingView charts to create ML datasets
Stars: ✭ 203 (+1253.33%)
Mutual labels:  webscraping
Anirip
🎬 A Crunchyroll show/season ripper
Stars: ✭ 127 (+746.67%)
Mutual labels:  webscraping
Polite
Be nice on the web
Stars: ✭ 253 (+1586.67%)
Mutual labels:  webscraping
Geeksforgeeksscrapper
Scrapes g4g and creates PDF
Stars: ✭ 124 (+726.67%)
Mutual labels:  webscraping
Falkor
Open Source web scraping API. Falkor turns web pages into queryable JSON
Stars: ✭ 184 (+1126.67%)
Mutual labels:  webscraping
GoodReadsScraper
📚 A GoodReads.com Scraper script to get books reviews including text and rating.
Stars: ✭ 36 (+140%)
Mutual labels:  webscraping
aws-pdf-textract-pipeline
🔍 Data pipeline for crawling PDFs from the Web and transforming their contents into structured data using AWS textract. Built with AWS CDK + TypeScript
Stars: ✭ 141 (+840%)
Mutual labels:  webscraping
R Web Scraping Cheat Sheet
Guide, reference and cheatsheet on web scraping using rvest, httr and Rselenium.
Stars: ✭ 207 (+1280%)
Mutual labels:  webscraping

Mirror Mirror

Contributions welcome Packagist

Mirror Mirror is a library for scraping publicly available images from social websites. It's purpose is to help show user's how much of their own data is truly available for anyone to see. Mirror Mirror scrapes every image url off of the profile page for a specified user, and then returns the uncompressed links.


Walkthrough

Here is a walkthrough on how to begin collecting high resolution images using Mirror Mirror! All you need to do is download the script for the particular site that you want to scrape, or download mirror-mirror.py, which contains all methods in one convenient file.

To use Mirror Mirror, simply call the file through the command line. Then, fill in the service, and username arguments.

The complete syntax should look similar to the following, where service is the platform, and username is the username of the intended profile(s) to be scraped. More than one user may be scraped from the same service at a time.

python3 mirror-mirror.py <service> <username...>

That's all there is to it! This command line tool will return a list containing all scraped image links. If the profile does not exist, or the user account has no photos associated with it, the list will return empty.


Supported services

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].