get-set-fetch / extension

Licence: MIT license

web scraping extension

Programming Languages

typescript

32286 projects

HTML

75241 projects

javascript

184084 projects - #8 most used programming language

Projects that are alternatives of or similar to extension

acebase

A fast, low memory, transactional, index & query enabled NoSQL database engine and server for node.js and browser with realtime data change notifications

Stars: ✭ 288 (+928.57%)

Mutual labels: indexeddb

scrapeer

Essential PHP library that scrapes HTTP(S) and UDP trackers for torrent information.

Stars: ✭ 81 (+189.29%)

Mutual labels: scraper

avbot-charts

Aviation charts

Stars: ✭ 20 (-28.57%)

Mutual labels: scraper

Federal-Parliament-Scraper

A scraper for obtaining information on the workings of the Belgian Federal Parliament.

Stars: ✭ 18 (-35.71%)

Mutual labels: scraper

rose

Analyse all kinds of data for a TV series

Stars: ✭ 34 (+21.43%)

Mutual labels: scraper

diffbot-php-client

[Deprecated - Maintenance mode - use APIs directly please!] The official Diffbot client library

Stars: ✭ 53 (+89.29%)

Mutual labels: scraper

kvs

Lightweight key-value storage library for Browser, Node.js, and In-Memory.

Stars: ✭ 126 (+350%)

Mutual labels: indexeddb

scraper

图片爬取下载工具，极速爬取下载站酷https://www.zcool.com.cn/, CNU 视觉 http://www.cnu.cc/ 设计师/用户上传的图片/照片/插画。

Stars: ✭ 64 (+128.57%)

Mutual labels: scraper

pyitau

Unofficial client to access your Itaú bank data

Stars: ✭ 28 (+0%)

Mutual labels: scraper

GChan

Scrape boards and threads from 4chan (8kun WIP). Downloads images, videos and HTML if desired.

Stars: ✭ 31 (+10.71%)

Mutual labels: scraper

pinterest-web-scraper

Scraping Visually Similar Images from Pinterest

Stars: ✭ 26 (-7.14%)

Mutual labels: scraper

twpy

Twitter High level scraper for humans.

Stars: ✭ 58 (+107.14%)

Mutual labels: scraper

extract-emails

Extract emails from a given website

Stars: ✭ 58 (+107.14%)

Mutual labels: scraper

trawler

scraper for facebook, gab, google and tiktok

Stars: ✭ 20 (-28.57%)

Mutual labels: scraper

saveddit

Bulk Downloader for Reddit

Stars: ✭ 130 (+364.29%)

Mutual labels: scraper

python web scraping

Web scraping using python, requests and selenium

Stars: ✭ 40 (+42.86%)

Mutual labels: scraper

ant

A web crawler for Go

Stars: ✭ 264 (+842.86%)

Mutual labels: scraper

vsco-scraper

Easily allows for scraping a VSCO

Stars: ✭ 106 (+278.57%)

Mutual labels: scraper

website-to-json

Converts website to json using jQuery selectors

Stars: ✭ 37 (+32.14%)

Mutual labels: scraper

blinkist-m4a-downloader

Grabs all of the audio files from all of the Blinkist books

Stars: ✭ 100 (+257.14%)

Mutual labels: scraper

View All Similar Projects ➔

Browser Extension

get-set, Fetch! is a browser extension for scraping sites through out a series of parametrizable scraping scenarios.

Currently supported browsers: Chrome, Firefox, Edge.

The most common use cases are handled by builtin scenarios:

Scrape Static Content
- Extracts text and binary content from static html pages based on CSS selectors.
Scrape Dynamic Content
- Extracts text and binary content from dynamic (javascript) pages based on CSS selectors.

You can also install community based scenarios:

Extract Html Headings - v0.2.0
- "Hello World" example of writing a scrape scenario.
Extract Article Content - v0.2.0
- Extract article content using Mozilla Readability library.

If you wrote a scraping scenario and want to share it, please update the above list and make a pull request.

The extension is structured as a monorepo with the following sub-packages:

commons: mostly typescript definitions
background: parses pages and stores relevant data in the builtin browser database (IndexedDB)
popup: toolbar appearance
admin: front-end for the background capabilities
scrape-static-content: builtin scenario
scrape-dynamic-content: builtin scenario
extension: builds the extension files and runs a comprehensive suite of integration tests

You can find technical tidbits in each sub-package readme file.

A detailed documentation with a series of examples is available at getsetfetch.org.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

get-set-fetch / extension

Programming Languages

Labels

Projects that are alternatives of or similar to extension

Browser Extension