Instagram-to-discordMonitor instagram user account and automatically post new images to discord channel via a webhook. Working 2022!
Stars: ✭ 113 (-26.14%)
scraperNodejs web scraper. Contains a command line, docker container, terraform module and ansible roles for distributed cloud scraping. Supported databases: SQLite, MySQL, PostgreSQL. Supported headless clients: Puppeteer, Playwright, Cheerio, JSdom.
Stars: ✭ 37 (-75.82%)
TorScrapperA Scraper made 100% in Python using BeautifulSoup and Tor. It can be used to scrape both normal and onion links. Happy Scraping :)
Stars: ✭ 24 (-84.31%)
PhpscraperPHP Scraper - an highly opinionated web-interface for PHP
Stars: ✭ 148 (-3.27%)
Osint collectionMaintained collection of OSINT related resources. (All Free & Actionable)
Stars: ✭ 809 (+428.76%)
browser-automation-apiBrowser automation API for repetitive web-based tasks, with a friendly user interface. You can use it to scrape content or do many other things like capture a screenshot, generate pdf, extract content or execute custom Puppeteer, Playwright functions.
Stars: ✭ 24 (-84.31%)
Captcha-ToolsAll-in-one Python (And now Go!) module to help solve captchas with Capmonster, 2captcha and Anticaptcha API's!
Stars: ✭ 23 (-84.97%)
Scraper-Projects🕸 List of mini projects that involve web scraping 🕸
Stars: ✭ 25 (-83.66%)
KatanaA Python Tool For google Hacking
Stars: ✭ 355 (+132.03%)
Imagescraper✂️ High performance, multi-threaded image scraper
Stars: ✭ 630 (+311.76%)
Pahe.ph-ScraperPahe.ph [Pahe.in] Movies Website Scraper
Stars: ✭ 57 (-62.75%)
JobfunnelScrape job websites into a single spreadsheet with no duplicates.
Stars: ✭ 1,528 (+898.69%)
lopezCrawling and scraping the Web for fun and profit
Stars: ✭ 20 (-86.93%)
scrapmanRetrieve real (with Javascript executed) HTML code from an URL, ultra fast and supports multiple parallel loading of webs
Stars: ✭ 21 (-86.27%)
gochanges**[ARCHIVED]** website changes tracker 🔍
Stars: ✭ 12 (-92.16%)
site-audit-seoWeb service and CLI tool for SEO site audit: crawl site, lighthouse all pages, view public reports in browser. Also output to console, json, csv, xlsx, Google Drive.
Stars: ✭ 91 (-40.52%)
siteshooter📷 Automate full website screenshots and PDF generation with multiple viewport support.
Stars: ✭ 63 (-58.82%)
papercutPapercut is a scraping/crawling library for Node.js built on top of JSDOM. It provides basic selector features together with features like Page Caching and Geosearch.
Stars: ✭ 15 (-90.2%)
proxycrawl-pythonProxyCrawl Python library for scraping and crawling
Stars: ✭ 51 (-66.67%)
Linux CommandLinux命令大全搜索工具,内容包含Linux命令手册、详解、学习、搜集。https://git.io/linux
Stars: ✭ 17,481 (+11325.49%)
LinkedinLinkedin Scraper using Selenium Web Driver, Chromium headless, Docker and Scrapy
Stars: ✭ 309 (+101.96%)
CrawlyCrawly, a high-level web crawling & scraping framework for Elixir.
Stars: ✭ 440 (+187.58%)
Spam Bot 3000Social media research and promotion, semi-autonomous CLI bot
Stars: ✭ 79 (-48.37%)
Email ExtractorThe main functionality is to extract all the emails from one or several URLs - La funcionalidad principal es extraer todos los correos electrónicos de una o varias Url
Stars: ✭ 81 (-47.06%)
google-scraperThis class can retrieve search results from Google.
Stars: ✭ 33 (-78.43%)
Search Engine Optimization🔍 A helpful checklist/collection of Search Engine Optimization (SEO) tips and techniques.
Stars: ✭ 1,798 (+1075.16%)
Search Engine ParserLightweight package to query popular search engines and scrape for result titles, links and descriptions
Stars: ✭ 216 (+41.18%)
Scrape Linkedin Selenium`scrape_linkedin` is a python package that allows you to scrape personal LinkedIn profiles & company pages - turning the data into structured json.
Stars: ✭ 239 (+56.21%)
diffbot-php-client[Deprecated - Maintenance mode - use APIs directly please!] The official Diffbot client library
Stars: ✭ 53 (-65.36%)
scrapersscrapers for building your own image databases
Stars: ✭ 46 (-69.93%)
ha-multiscrapeHome Assistant custom component for scraping (html, xml or json) multiple values (from a single HTTP request) with a separate sensor/attribute for each value. Support for (login) form-submit functionality.
Stars: ✭ 103 (-32.68%)
Scrapysharpreborn of https://bitbucket.org/rflechner/scrapysharp
Stars: ✭ 226 (+47.71%)
wget-luaWget-AT is a modern Wget with Lua hooks, Zstandard (+dictionary) WARC compression and URL-agnostic deduplication.
Stars: ✭ 52 (-66.01%)
document-dlCommand line program to download documents from web portals.
Stars: ✭ 14 (-90.85%)
copycatA PHP Scraping Class
Stars: ✭ 70 (-54.25%)
scrapy facebookerCollection of scrapy spiders which can scrape posts, images, and so on from public Facebook Pages.
Stars: ✭ 22 (-85.62%)
ZeiverA Scraper, Downloader, & Recorder for static open directories.
Stars: ✭ 14 (-90.85%)
Goose ParserUniversal scrapping tool, which allows you to extract data using multiple environments
Stars: ✭ 211 (+37.91%)
fb-scraperScrape a Facebook profile and turn it into a JSON file
Stars: ✭ 18 (-88.24%)
SearchScraperAPIAiohttp web server API, which scrapes Google and returns scrape results as response. Supports proxies, multiple geos and number of results.
Stars: ✭ 31 (-79.74%)
AutoscraperA Smart, Automatic, Fast and Lightweight Web Scraper for Python
Stars: ✭ 4,077 (+2564.71%)
facebook-discussion-tkA collection of tools to (semi-)automatically collect and analyze data from online discussions on Facebook groups and pages.
Stars: ✭ 33 (-78.43%)
FerretDeclarative web scraping
Stars: ✭ 4,837 (+3061.44%)
DataflowkitExtract structured data from web sites. Web sites scraping.
Stars: ✭ 456 (+198.04%)
Lulu[Unmaintained] A simple and clean video/music/image downloader 👾
Stars: ✭ 789 (+415.69%)
bots-zooNo description or website provided.
Stars: ✭ 59 (-61.44%)
SerpGoogle Search SERP Scraper
Stars: ✭ 40 (-73.86%)
GeziyorGeziyor, a fast web crawling & scraping framework for Go. Supports JS rendering.
Stars: ✭ 1,246 (+714.38%)
PypatentSearch for and retrieve US Patent and Trademark Office Patent Data
Stars: ✭ 31 (-79.74%)
Jsonframe Cheeriosimple multi-level scraper json input/output for Cheerio
Stars: ✭ 196 (+28.1%)
CollyElegant Scraper and Crawler Framework for Golang
Stars: ✭ 15,535 (+10053.59%)
uspto-opendata-pythonA client library for accessing the USPTO Open Data APIs, written in Python.
Stars: ✭ 51 (-66.67%)
DuckduckgoAn unofficial DuckDuckGo search API.
Stars: ✭ 6 (-96.08%)
SeleniumcrawlerAn example using Selenium webdrivers for python and Scrapy framework to create a web scraper to crawl an ASP site
Stars: ✭ 117 (-23.53%)
UdemycoursegrabberYour will to enroll in Udemy course is here, but the money isn't? Search no more! This python program searches for your desired course in more than [insert big number here] websites, compares the last updated date, and gives you the download link of the latest one back, but you also have the choice to see the other ones as well!
Stars: ✭ 137 (-10.46%)