All Projects → get-set-fetch → extension

get-set-fetch / extension

Licence: MIT license
web scraping extension

Programming Languages

typescript
32286 projects
HTML
75241 projects
javascript
184084 projects - #8 most used programming language

Projects that are alternatives of or similar to extension

acebase
A fast, low memory, transactional, index & query enabled NoSQL database engine and server for node.js and browser with realtime data change notifications
Stars: ✭ 288 (+928.57%)
Mutual labels:  indexeddb
scrapeer
Essential PHP library that scrapes HTTP(S) and UDP trackers for torrent information.
Stars: ✭ 81 (+189.29%)
Mutual labels:  scraper
avbot-charts
Aviation charts
Stars: ✭ 20 (-28.57%)
Mutual labels:  scraper
Federal-Parliament-Scraper
A scraper for obtaining information on the workings of the Belgian Federal Parliament.
Stars: ✭ 18 (-35.71%)
Mutual labels:  scraper
rose
Analyse all kinds of data for a TV series
Stars: ✭ 34 (+21.43%)
Mutual labels:  scraper
diffbot-php-client
[Deprecated - Maintenance mode - use APIs directly please!] The official Diffbot client library
Stars: ✭ 53 (+89.29%)
Mutual labels:  scraper
kvs
Lightweight key-value storage library for Browser, Node.js, and In-Memory.
Stars: ✭ 126 (+350%)
Mutual labels:  indexeddb
scraper
图片爬取下载工具,极速爬取下载 站酷https://www.zcool.com.cn/, CNU 视觉 http://www.cnu.cc/ 设计师/用户 上传的 图片/照片/插画。
Stars: ✭ 64 (+128.57%)
Mutual labels:  scraper
pyitau
Unofficial client to access your Itaú bank data
Stars: ✭ 28 (+0%)
Mutual labels:  scraper
GChan
Scrape boards and threads from 4chan (8kun WIP). Downloads images, videos and HTML if desired.
Stars: ✭ 31 (+10.71%)
Mutual labels:  scraper
pinterest-web-scraper
Scraping Visually Similar Images from Pinterest
Stars: ✭ 26 (-7.14%)
Mutual labels:  scraper
twpy
Twitter High level scraper for humans.
Stars: ✭ 58 (+107.14%)
Mutual labels:  scraper
extract-emails
Extract emails from a given website
Stars: ✭ 58 (+107.14%)
Mutual labels:  scraper
trawler
scraper for facebook, gab, google and tiktok
Stars: ✭ 20 (-28.57%)
Mutual labels:  scraper
saveddit
Bulk Downloader for Reddit
Stars: ✭ 130 (+364.29%)
Mutual labels:  scraper
python web scraping
Web scraping using python, requests and selenium
Stars: ✭ 40 (+42.86%)
Mutual labels:  scraper
ant
A web crawler for Go
Stars: ✭ 264 (+842.86%)
Mutual labels:  scraper
vsco-scraper
Easily allows for scraping a VSCO
Stars: ✭ 106 (+278.57%)
Mutual labels:  scraper
website-to-json
Converts website to json using jQuery selectors
Stars: ✭ 37 (+32.14%)
Mutual labels:  scraper
blinkist-m4a-downloader
Grabs all of the audio files from all of the Blinkist books
Stars: ✭ 100 (+257.14%)
Mutual labels:  scraper

FOSSA Status dependencies Status Known Vulnerabilities Build Status Coverage Status

Browser Extension

get-set, Fetch! is a browser extension for scraping sites through out a series of parametrizable scraping scenarios.

Currently supported browsers: Chrome, Firefox, Edge.

The most common use cases are handled by builtin scenarios:

  • Scrape Static Content
    • Extracts text and binary content from static html pages based on CSS selectors.
  • Scrape Dynamic Content
    • Extracts text and binary content from dynamic (javascript) pages based on CSS selectors.

You can also install community based scenarios:

If you wrote a scraping scenario and want to share it, please update the above list and make a pull request.

The extension is structured as a monorepo with the following sub-packages:

  • commons: mostly typescript definitions
  • background: parses pages and stores relevant data in the builtin browser database (IndexedDB)
  • popup: toolbar appearance
  • admin: front-end for the background capabilities
  • scrape-static-content: builtin scenario
  • scrape-dynamic-content: builtin scenario
  • extension: builds the extension files and runs a comprehensive suite of integration tests

You can find technical tidbits in each sub-package readme file.

A detailed documentation with a series of examples is available at getsetfetch.org.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].