Scrapoxy hides your scraper behind a cloud. It starts a pool of proxies to send your requests. Now, you can crawl without thinking about blacklisting!

Stars: ✭ 1,322 (+2394.34%)

Mutual labels: scraper

ogcheckr-api

An api to check social media username availability on a variety of services

Stars: ✭ 18 (-66.04%)

Mutual labels: scraper

scrapeer

Essential PHP library that scrapes HTTP(S) and UDP trackers for torrent information.

Stars: ✭ 81 (+52.83%)

Mutual labels: scraper

pysoundcloud

Scraping the Un–scrapable™

Stars: ✭ 63 (+18.87%)

Mutual labels: scraper

Lambda Phantom Scraper

PhantomJS/Node.js web scraper for AWS Lambda

Stars: ✭ 93 (+75.47%)

Mutual labels: scraper

scoopi-scraper

Scoopi Web Scraper is a heavy duty tool to extract data from HTML pages.

Stars: ✭ 18 (-66.04%)

Mutual labels: scraper

tvseries

TV Series is a tool that scrapes Episode Synopsis' of popular TV Series' from websites like Wikipedia / IMDb and show in one place with a user-friendly navigation UI.

Stars: ✭ 37 (-30.19%)

Mutual labels: scraping

fb-page-chat-download

Python script to download messages from a Facebook page to a CSV file

Stars: ✭ 51 (-3.77%)

Mutual labels: scraper

Googlemaps Scraper

Google Maps reviews scraping

Stars: ✭ 87 (+64.15%)

Mutual labels: scraper

Spidr

A versatile Ruby web spidering library that can spider a site, multiple domains, certain links or infinitely. Spidr is designed to be fast and easy to use.

Stars: ✭ 656 (+1137.74%)

Mutual labels: scraper

NBA-Fantasy-Optimizer

NBA Daily Fantasy Lineup Optimizer for FanDuel Using Python

Stars: ✭ 21 (-60.38%)

Mutual labels: scraping

evine

Interactive CLI Web Crawler

Stars: ✭ 140 (+164.15%)

Mutual labels: scraper

Instaloctrack

An Instagram OSINT tool to collect all the geotagged locations available on an Instagram profile in order to plot them on a map, and dump them in a JSON.

Stars: ✭ 85 (+60.38%)

Mutual labels: scraper

BaiduSpider

项目已经移动至：https://github.com/BaiduSpider/BaiduSpider ！！一个爬取百度搜索结果的爬虫，目前支持百度网页搜索，百度图片搜索，百度知道搜索，百度视频搜索，百度资讯搜索，百度文库搜索，百度经验搜索和百度百科搜索。

Stars: ✭ 29 (-45.28%)

Mutual labels: crawling

Gmdb

GMDB is the ultra-simple, cross-platform Movie Library with Features (Search, Take Note, Watch Later, Like, Import, Learn, Instantly Torrent Magnet Watch)

Stars: ✭ 189 (+256.6%)

Mutual labels: scraper

Surgeon

Declarative DOM extraction expression evaluator. 👨‍⚕️

Stars: ✭ 653 (+1132.08%)

Mutual labels: scraper

Hooman

http interceptor to hoomanize cloudflare requests

Stars: ✭ 82 (+54.72%)

Mutual labels: scraper

INMET-API-temperature

Crawler dos dados metereológicos de estações convencionais do INMET (BDMEP)

Stars: ✭ 32 (-39.62%)

Mutual labels: scraper

nyt-first-said

Tweets when words are published for the first time in the NYT

Stars: ✭ 222 (+318.87%)

Mutual labels: scraper

Docsearch Scraper

DocSearch - Scraper

Stars: ✭ 188 (+254.72%)

Mutual labels: scraper

Instagram Crawler

Get Instagram posts/profile/hashtag data without using Instagram API

Stars: ✭ 643 (+1113.21%)

Mutual labels: scraper

wishlist

Read an Amazon wishlist programmatically with Python

Stars: ✭ 44 (-16.98%)

Mutual labels: scraper

Wombat

Lightweight Ruby web crawler/scraper with an elegant DSL which extracts structured data from pages.

Stars: ✭ 1,220 (+2201.89%)

Mutual labels: scraper

ammobin-client

client for https://ammobin.ca

Stars: ✭ 18 (-66.04%)

Mutual labels: scraper

AzurLaneWikiScrapers

A console application that can scrape the Azur Lane wiki and export the data to Json files

Stars: ✭ 12 (-77.36%)

Mutual labels: scraper

Kikoeru Express

kikoeru 后端，不再维护，请到https://github.com/umonaca/kikoeru-express 获取更新

Stars: ✭ 79 (+49.06%)

Mutual labels: scraper

awesome-interface

AngularJS SPA interface for awesome lists. Awesome lists parsed using python.

Stars: ✭ 25 (-52.83%)

Mutual labels: scraper

scrapy-LBC

Araignée LeBonCoin avec Scrapy et ElasticSearch

Stars: ✭ 14 (-73.58%)

Mutual labels: scraper

Instascrape

🚀 A fast and lightweight utility and Python library for downloading posts, stories, and highlights from Instagram.

Stars: ✭ 76 (+43.4%)

Mutual labels: scraper

arxiv leaks

Whisper of the arxiv: read comments in tex of papers

Stars: ✭ 22 (-58.49%)

Mutual labels: scraper

TradeTheEvent

Implementation of "Trade the Event: Corporate Events Detection for News-Based Event-Driven Trading." In Findings of ACL2021

Stars: ✭ 64 (+20.75%)

Mutual labels: scraper

Pymarketcap

Python3 API wrapper and web scraper for https://coinmarketcap.com

Stars: ✭ 73 (+37.74%)

Mutual labels: scraper

Unhtml.rs

A magic html parser

Stars: ✭ 180 (+239.62%)

Mutual labels: scraper

Scala Scraper

A Scala library for scraping content from HTML pages

Stars: ✭ 631 (+1090.57%)

Mutual labels: scraper

gHarvester

Proof of concept for a security issue (in my opinion) that I found in accounts.google.com

Stars: ✭ 20 (-62.26%)

Mutual labels: scraper

linkedin-scraper

Tool to scrape linkedin

Stars: ✭ 74 (+39.62%)

Mutual labels: scraping

Instagram Crawler

Crawl instagram photos, posts and videos for download.

Stars: ✭ 178 (+235.85%)

Mutual labels: scraper

Instagram4j

📷 Instagram private API in Java

Stars: ✭ 629 (+1086.79%)

Mutual labels: scraper

Jd Autobuy

Python爬虫，京东自动登录，在线抢购商品

Stars: ✭ 1,174 (+2115.09%)

Mutual labels: scraper

tieba-zhuaqu

百度贴吧分布式爬虫，用于贴吧数据挖掘。从贴吧维度和用户维度进行数据分析

Stars: ✭ 56 (+5.66%)

Mutual labels: scraper

fiveN1-rent-scraper

🏠 a.k.a 591 rent scraper（591 租屋網爬蟲）

Stars: ✭ 51 (-3.77%)

Mutual labels: scraper

Goscrape

Web scraper that can create an offline readable version of a website

Stars: ✭ 69 (+30.19%)

Mutual labels: scraper

yt-videos-list

Create and **automatically** update a list of all videos on a YouTube channel (in txt/csv/md form) via YouTube bot with end-to-end web scraping - no API tokens required. Multi-threaded support for YouTube videos list updates.

Stars: ✭ 64 (+20.75%)

Mutual labels: scraper

pyscrapers

Scrapers for vk, facebook, instagram and more

Stars: ✭ 18 (-66.04%)

Mutual labels: scrape

Cheerio

Fast, flexible, and lean implementation of core jQuery designed specifically for the server.

Stars: ✭ 24,616 (+46345.28%)

Mutual labels: scraper

html-table-to-json

Generate JSON representations of HTML tables