A versatile Ruby web spidering library that can spider a site, multiple domains, certain links or infinitely. Spidr is designed to be fast and easy to use.

Stars: ✭ 656 (+1071.43%)

Mutual labels: scraper, web-scraper

Anime Dl

Anime-dl is a command-line program to download anime from CrunchyRoll and Funimation.

Stars: ✭ 190 (+239.29%)

Mutual labels: scraper

Goribot

[Crawler/Scraper for Golang]🕷A lightweight distributed friendly Golang crawler framework.一个轻量的分布式友好的 Golang 爬虫框架。

Stars: ✭ 190 (+239.29%)

Mutual labels: scraper

Gmdb

GMDB is the ultra-simple, cross-platform Movie Library with Features (Search, Take Note, Watch Later, Like, Import, Learn, Instantly Torrent Magnet Watch)

Stars: ✭ 189 (+237.5%)

Mutual labels: scraper

Unhtml.rs

A magic html parser

Stars: ✭ 180 (+221.43%)

Mutual labels: scraper

DotGrok

Parse text with pattern. Inspired by grok filter.

Stars: ✭ 26 (-53.57%)

Mutual labels: parsing

Annie

👾 Fast and simple video download library and CLI tool written in Go

Stars: ✭ 16,369 (+29130.36%)

Mutual labels: scraper

Linkedin Profile Scraper

🕵️‍♂️ LinkedIn profile scraper returning structured profile data in JSON. Works in 2020.

Stars: ✭ 171 (+205.36%)

Mutual labels: scraper

Thepiratebay

💀 The Pirate Bay node.js client

Stars: ✭ 191 (+241.07%)

Mutual labels: scraper

Blacksmith

Blacksmith is a tool for viewing, extracting, and converting textures, 3D models, and sounds from Assassin's Creed: Odyssey/Origins/Valhalla and Steep.

Stars: ✭ 104 (+85.71%)

Mutual labels: extract

Novel

基于 Laravel 5.2 的小说网站

Stars: ✭ 172 (+207.14%)

Mutual labels: scraper

Docsearch Scraper

DocSearch - Scraper

Stars: ✭ 188 (+235.71%)

Mutual labels: scraper

Skrape.it

A Kotlin-based testing/scraping/parsing library providing the ability to analyze and extract data from HTML (server & client-side rendered). It places particular emphasis on ease of use and a high level of readability by providing an intuitive DSL. It aims to be a testing lib, but can also be used to scrape websites in a convenient fashion.

Stars: ✭ 231 (+312.5%)

Mutual labels: scraper

Instagram Crawler

Crawl instagram photos, posts and videos for download.

Stars: ✭ 178 (+217.86%)

Mutual labels: scraper

MangDL

The most inefficient Manga downloader for PC

Stars: ✭ 40 (-28.57%)

Mutual labels: scraper

Readablewebproxy

Rewriting web proxy and archival tool. At this point, it just tries to download all the things.

Stars: ✭ 172 (+207.14%)

Mutual labels: scraper

Ruiji.net

crawler framework, distributed crawler extractor

Stars: ✭ 220 (+292.86%)

Mutual labels: scraper

Scrapelib

⛏ a library for scraping things

Stars: ✭ 164 (+192.86%)

Mutual labels: scraper

Scrape Twitter

🐦 Access Twitter data without an API key. [DEPRECATED]

Stars: ✭ 166 (+196.43%)

Mutual labels: scraper

autumn

A Java parser combinator library written with an unmatched feature set.

Stars: ✭ 112 (+100%)

Mutual labels: parsing

Datmusic Api

Alternative for VK Audio API

Stars: ✭ 160 (+185.71%)

Mutual labels: scraper

Opensanctions

An open database of international sanctions data, persons of interest and politically exposed persons

Stars: ✭ 157 (+180.36%)

Mutual labels: scraper

Covid19 mobility

COVID-19 Mobility Data Aggregator. Scraper of Google, Apple, Waze and TomTom COVID-19 Mobility Reports🚶🚘🚉

Stars: ✭ 156 (+178.57%)

Mutual labels: scraper

postcss-jsx

PostCSS syntax for parsing CSS in JS literals

Stars: ✭ 73 (+30.36%)

Mutual labels: parsing

CaptCC

A tiny C compiler written purely in JavaScript.

Stars: ✭ 175 (+212.5%)

Mutual labels: parsing

TwitterScraper

Scrape a User's Twitter data! Bypass the 3,200 tweet API limit for a User!

Stars: ✭ 80 (+42.86%)

Mutual labels: scraper

Media Scraper

Scrapes all photos and videos in a web page / Instagram / Twitter / Tumblr / Reddit / pixiv / TikTok

Stars: ✭ 206 (+267.86%)

Mutual labels: scraper

Instagram Scraper

scrapes medias, likes, followers, tags and all metadata. Inspired by instagram-php-scraper,bot

Stars: ✭ 2,209 (+3844.64%)

Mutual labels: scraper

Demeter

Demeter is a tool for scraping the calibre web ui

Stars: ✭ 155 (+176.79%)

Mutual labels: scraper

Tianyancha

pip安装的天眼查爬虫API，指定的单个/多个企业工商信息一键保存为Excel/JSON格式。A Battery-included Scraper API of Tianyancha, the best Chinese business data and investigation platform.

Stars: ✭ 206 (+267.86%)

Mutual labels: scraper

Serpscrap

SEO python scraper to extract data from major searchengine result pages. Extract data like url, title, snippet, richsnippet and the type from searchresults for given keywords. Detect Ads or make automated screenshots. You can also fetch text content of urls provided in searchresults or by your own. It's usefull for SEO and business related research tasks.

Stars: ✭ 153 (+173.21%)

Mutual labels: scraper

serlist

Search engine results page scraper

Stars: ✭ 12 (-78.57%)

Mutual labels: lxml

Colly

Elegant Scraper and Crawler Framework for Golang

Stars: ✭ 15,535 (+27641.07%)

Mutual labels: scraper

Nooverviewavailable.com

A survey of Apple developer documentation.