Top 392 scraper open source projects

Google Play Scraper
Node.js scraper to get data from Google Play
Not Your Average Web Crawler
A web crawler (for bug hunting) that gathers more than you can imagine.
Reactriot2017 Dotamania
🌐 Web scraping made easy with the visual 🗺 mind map editor to JSON
Laravel Scavenger
The most integrated web scraper package for Laravel.
Scrapoxy
Scrapoxy hides your scraper behind a cloud. It starts a pool of proxies to send your requests. Now, you can crawl without thinking about blacklisting!
Awesome Dl
This is a list of repositories and libraries that allow for scripted downloading of online content.
Lambda Phantom Scraper
PhantomJS/Node.js web scraper for AWS Lambda
Hockey Scraper
Python Package for scraping NHL Play-by-Play and Shift data
Googlemaps Scraper
Google Maps reviews scraping
Image search
Python Library to download images and metadata from popular search engines.
Instaloctrack
An Instagram OSINT tool to collect all the geotagged locations available on an Instagram profile in order to plot them on a map, and dump them in a JSON.
Geziyor
Geziyor, a fast web crawling & scraping framework for Go. Supports JS rendering.
Hooman
http interceptor to hoomanize cloudflare requests
Email Extractor
The main functionality is to extract all the emails from one or several URLs - La funcionalidad principal es extraer todos los correos electrónicos de una o varias Url
Wombat
Lightweight Ruby web crawler/scraper with an elegant DSL which extracts structured data from pages.
Kikoeru Express
kikoeru 后端,不再维护,请到https://github.com/umonaca/kikoeru-express 获取更新
Proxy Scraper
Library for scraping free proxies lists
Instascrape
🚀 A fast and lightweight utility and Python library for downloading posts, stories, and highlights from Instagram.
Pittapi
An API to easily get data from the University of Pittsburgh
Pymarketcap
Python3 API wrapper and web scraper for https://coinmarketcap.com
Goscraper
Golang pkg to quickly return a preview of a webpage (title/description/images)
Jd Autobuy
Python爬虫,京东自动登录,在线抢购商品
Skraper
Kotlin/Java library and cli tool for scraping posts and media from various sources with neither authorization nor full page rendering (Facebook, Instagram, Twitter, Youtube, Tiktok, Telegram, Twitch, Reddit, 9GAG, Pinterest, Flickr, Tumblr, IFunny, VK, Pikabu)
Goscrape
Web scraper that can create an offline readable version of a website
Pitchfork
🎶 Unofficial python API for pitchfork.com reviews.
Pastebin Scraper
Live-scraping pastebin to fight boredom.
Scrape
Distributed Scraper
Bad Robo
🐙 Get Daily 400-500 Real Followers 👽 [BadRobo] is Best Instagram Bot Available Now with All Features!. Our BOT did not violate any of Instagram's rules, so you don't have to worry about getting ACTION BLOCK!
Warta Scrap
Indonesia Index News Crawler, including 10 online media
Tangerine
Tangerine Bank scraper
Anutimetable
Intuitive timetable builder for the Australian National University.
Pitchfork Npm
An Unofficial Pitchfork Music API client for Node.js
Social Scraper
Tổng hợp script crawl dữ liệu từ các mạng xã hội & website tiếng Việt
Repository.kodibae
Kodi Bae Repository - Kodi is a registered trademark of the XBMC Foundation. We are not connected to or in any other way affiliated with Kodi - DMCA: [email protected]
Django Dynamic Scraper
Creating Scrapy scrapers via the Django admin interface
Public Instagram
Tool to fetch Instagram's public content.
Shopifyscraper
Shopify Scraper (not monitor)
Avbook
AV 电影管理系统, avmoo , javbus , javlibrary 爬虫,线上 AV 影片图书馆,AV 磁力链接数据库,Japanese Adult Video Library,Adult Video Magnet Links - Japanese Adult Video Database
Serp
Google Search SERP Scraper
Botvid 19
Messenger Bot that scrapes for COVID-19 data and periodically updates subscribers via Facebook Messages. Created using Python/Flask, MYSQL, HTML, Heroku
Real Estate Scraper
Web scraper that makes it easier to find real estate in Slovenia.
Pypatent
Search for and retrieve US Patent and Trademark Office Patent Data
Anitop
Anitop is an unofficial simple API from https://anitrendz.net/ site
Ratatouille
A Node.js wrapper for scraping allrecipes.com
Node Website Scraper
Download website to local directory (including all css, images, js, etc.)
Pypergrabber
Fetches PubMed article IDs (PMIDs) from email inbox, then crawls PubMed, Google Scholar and Sci-Hub for respective PDF files.
Emby.plugins.javscraper
Emby/Jellyfin 的一个日本电影刮削器插件,可以从某些网站抓取影片信息。
Voyages Sncf Api
A scrapy spider that scraps times and prices from Voyages Sncf. It uses scrapyrt to provide an API interface.
Twitter Get Old Tweets Scraper
A data scraper for retrieving old tweets in Twitter using Python3.
Scrapit
Scraping scripts for various websites.
Gisaid Scrapper
Scrapping tool for GISAID data regarding SARS-CoV-2
61-120 of 392 scraper projects