A Kotlin-based testing/scraping/parsing library providing the ability to analyze and extract data from HTML (server & client-side rendered). It places particular emphasis on ease of use and a high level of readability by providing an intuitive DSL. It aims to be a testing lib, but can also be used to scrape websites in a convenient fashion.

Stars: ✭ 231 (+904.35%)

Mutual labels: scraper

Covid19 mobility

COVID-19 Mobility Data Aggregator. Scraper of Google, Apple, Waze and TomTom COVID-19 Mobility Reports🚶🚘🚉

Stars: ✭ 156 (+578.26%)

Mutual labels: scraper

google-scraper

This class can retrieve search results from Google.

Stars: ✭ 33 (+43.48%)

Mutual labels: scraper

Nooverviewavailable.com

A survey of Apple developer documentation.

Stars: ✭ 152 (+560.87%)

Mutual labels: scraper

Goose Parser

Universal scrapping tool, which allows you to extract data using multiple environments

Stars: ✭ 211 (+817.39%)

Mutual labels: scraper

Colly

Elegant Scraper and Crawler Framework for Golang

Stars: ✭ 15,535 (+67443.48%)

Mutual labels: scraper

Google2csv

Google2Csv a simple google scraper that saves the results on a csv/xlsx/jsonl file

Stars: ✭ 145 (+530.43%)

Mutual labels: scraper

MangDL

The most inefficient Manga downloader for PC

Stars: ✭ 40 (+73.91%)

Mutual labels: scraper

Querylist

🕷️ The progressive PHP crawler framework! 优雅的渐进式PHP采集框架。

Stars: ✭ 2,392 (+10300%)

Mutual labels: scraper

lezhin-comics-downloader

📥 Downloader for lezhin comics

Stars: ✭ 30 (+30.43%)

Mutual labels: scraper

Jvppeteer

Headless Chrome For Java （Java 爬虫）

Stars: ✭ 193 (+739.13%)

Mutual labels: scraper

tv grab fr telerama

XMLTV Grabber using telerama api data

Stars: ✭ 36 (+56.52%)

Mutual labels: scraper

Unfurl

Scraper for oEmbed, Twitter Cards and Open Graph metadata - fast and Promise-based ⚡️

Stars: ✭ 193 (+739.13%)

Mutual labels: scraper

lopez

Crawling and scraping the Web for fun and profit

Stars: ✭ 20 (-13.04%)

Mutual labels: scraper

Anime Dl

Anime-dl is a command-line program to download anime from CrunchyRoll and Funimation.

Stars: ✭ 190 (+726.09%)

Mutual labels: scraper

Polite

Be nice on the web

Stars: ✭ 253 (+1000%)

Mutual labels: scraper

Gmdb

GMDB is the ultra-simple, cross-platform Movie Library with Features (Search, Take Note, Watch Later, Like, Import, Learn, Instantly Torrent Magnet Watch)

Stars: ✭ 189 (+721.74%)

Mutual labels: scraper

proxy-scraper

⭐️ A proxy scraper made using Protractor | Proxy list Updates every three hour 🔥

Stars: ✭ 201 (+773.91%)

Mutual labels: scraper

Unhtml.rs

A magic html parser

Stars: ✭ 180 (+682.61%)

Mutual labels: scraper

Instagram Proxy Api

CORS compliant API to access Instagram's public data

Stars: ✭ 245 (+965.22%)

Mutual labels: scraper

Linkedin Profile Scraper

🕵️‍♂️ LinkedIn profile scraper returning structured profile data in JSON. Works in 2020.

Stars: ✭ 171 (+643.48%)

Mutual labels: scraper

nyt-first-said

Tweets when words are published for the first time in the NYT

Stars: ✭ 222 (+865.22%)

Mutual labels: scraper

Novel

基于 Laravel 5.2 的小说网站

Stars: ✭ 172 (+647.83%)

Mutual labels: scraper

Getsy

A simple browser/client-side web scraper.

Stars: ✭ 238 (+934.78%)

Mutual labels: scraper

Scrapelib

⛏ a library for scraping things

Stars: ✭ 164 (+613.04%)

Mutual labels: scraper

scrapetube

Get all videos from a youtube channel, get all videos from a playlist, get all videos that match a search

Stars: ✭ 120 (+421.74%)

Mutual labels: scraper

Opensanctions

An open database of international sanctions data, persons of interest and politically exposed persons

Stars: ✭ 157 (+582.61%)

Mutual labels: scraper

Annie

👾 Fast and simple video download library and CLI tool written in Go

Stars: ✭ 16,369 (+71069.57%)

Mutual labels: scraper

Instagram Scraper

scrapes medias, likes, followers, tags and all metadata. Inspired by instagram-php-scraper,bot

Stars: ✭ 2,209 (+9504.35%)

Mutual labels: scraper

file-extensions

JSON collection of scraped file extensions, along with their description and type, from FileInfo.com

Stars: ✭ 15 (-34.78%)

Mutual labels: scraper

Serpscrap

SEO python scraper to extract data from major searchengine result pages. Extract data like url, title, snippet, richsnippet and the type from searchresults for given keywords. Detect Ads or make automated screenshots. You can also fetch text content of urls provided in searchresults or by your own. It's usefull for SEO and business related research tasks.

Stars: ✭ 153 (+565.22%)

Mutual labels: scraper

Ruiji.net

crawler framework, distributed crawler extractor

Stars: ✭ 220 (+856.52%)

Mutual labels: scraper

Phpscraper

PHP Scraper - an highly opinionated web-interface for PHP

Stars: ✭ 148 (+543.48%)

Mutual labels: scraper

yellowpages-scraper

Yellowpages.com Web Scraper written in Python and LXML to extract business details available based on a particular category and location.

Stars: ✭ 56 (+143.48%)

Mutual labels: scraper

Media Scraper

Scrapes all photos and videos in a web page / Instagram / Twitter / Tumblr / Reddit / pixiv / TikTok

Stars: ✭ 206 (+795.65%)

Mutual labels: scraper

tinyPornManager

Made for pornhub. Fork from tinyMediaManager v3

Stars: ✭ 57 (+147.83%)

Mutual labels: scraper

wikipedia-reference-scraper

Wikipedia API wrapper for references

Stars: ✭ 34 (+47.83%)

Mutual labels: scraper

Pahe.ph-Scraper

Pahe.ph [Pahe.in] Movies Website Scraper

Stars: ✭ 57 (+147.83%)

Mutual labels: scraper

TradeTheEvent

Implementation of "Trade the Event: Corporate Events Detection for News-Based Event-Driven Trading." In Findings of ACL2021

Stars: ✭ 64 (+178.26%)

Mutual labels: scraper

Tianyancha

pip安装的天眼查爬虫API，指定的单个/多个企业工商信息一键保存为Excel/JSON格式。A Battery-included Scraper API of Tianyancha, the best Chinese business data and investigation platform.

Stars: ✭ 206 (+795.65%)

Mutual labels: scraper

1-60 of 400 similar projects

›

next*5