All Projects → node-warc → Similar Projects or Alternatives

34 Open source projects that are alternatives of or similar to node-warc

warc
📇 Tools to Work with the Web Archive Ecosystem in R
Stars: ✭ 21 (-69.57%)
Mutual labels:  warc, warc-files
mixnode-warcreader-php
Read Web ARChive (WARC) files in PHP.
Stars: ✭ 20 (-71.01%)
Mutual labels:  warc, webarchive
wget-lua
Wget-AT is a modern Wget with Lua hooks, Zstandard (+dictionary) WARC compression and URL-agnostic deduplication.
Stars: ✭ 52 (-24.64%)
Mutual labels:  warc, webarchiving
wail
🐋 One-Click User Instigated Preservation
Stars: ✭ 107 (+55.07%)
Mutual labels:  warc, web-archiving
chatnoir-resiliparse
A robust web archive analytics toolkit
Stars: ✭ 26 (-62.32%)
Mutual labels:  warc, webarchive
Archivebox
🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...
Stars: ✭ 12,383 (+17846.38%)
Mutual labels:  warc, web-archiving
awesome-memento
A list of things related to software, literature, and other content for 🕣 Memento
Stars: ✭ 62 (-10.14%)
Mutual labels:  webarchiving
munin-indexer
A social media open post web archiving tool
Stars: ✭ 16 (-76.81%)
Mutual labels:  webarchiving
Collect
A server to collect & archive websites that also supports video downloads
Stars: ✭ 62 (-10.14%)
Mutual labels:  web-archiving
MemGator
A Memento Aggregator CLI and Server in Go
Stars: ✭ 42 (-39.13%)
Mutual labels:  web-archiving
CommonCrawlDocumentDownload
A small tool which uses the CommonCrawl URL Index to download documents with certain file types or mime-types. This is used for mass-testing of frameworks like Apache POI and Apache Tika
Stars: ✭ 43 (-37.68%)
Mutual labels:  warc
domcurl
cUrl-like utility for fetching a resource (in this case we will run JS and return after network is idle) - great for JS heavy apps
Stars: ✭ 84 (+21.74%)
Mutual labels:  pupeteer
warrick
Recover lost websites from the Web Infrastructure
Stars: ✭ 76 (+10.14%)
Mutual labels:  web-archiving
warcworker
A dockerized, queued high fidelity web archiver based on Squidwarc
Stars: ✭ 48 (-30.43%)
Mutual labels:  webarchiving
vandal
Navigator for Web Archive
Stars: ✭ 146 (+111.59%)
Mutual labels:  webarchive
MementoEmbed
A service that provides archive-aware oEmbed-compatible embeddable surrogates (social cards, thumbnails, etc.) for archived web pages (mementos).
Stars: ✭ 13 (-81.16%)
Mutual labels:  web-archives
Heritrix3
Heritrix is the Internet Archive's open-source, extensible, web-scale, archival-quality web crawler project.
Stars: ✭ 2,104 (+2949.28%)
Mutual labels:  warc
warc
⚙️ A Rust library for reading and writing WARC files
Stars: ✭ 26 (-62.32%)
Mutual labels:  warc
Archivenow
A Tool To Push Web Resources Into Web Archives
Stars: ✭ 253 (+266.67%)
Mutual labels:  web-archiving
Archiveror
Archiveror will help you preserve the webpages you love. 💾
Stars: ✭ 246 (+256.52%)
Mutual labels:  web-archiving
Wail
🐋 Web Archiving Integration Layer: One-Click User Instigated Preservation
Stars: ✭ 232 (+236.23%)
Mutual labels:  web-archiving
Warcio
Streaming WARC/ARC library for fast web archive IO
Stars: ✭ 195 (+182.61%)
Mutual labels:  web-archiving
Warcreate
Chrome extension to "Create WARC files from any webpage"
Stars: ✭ 143 (+107.25%)
Mutual labels:  web-archiving
Sfm Ui
Social Feed Manager user interface application.
Stars: ✭ 129 (+86.96%)
Mutual labels:  web-archiving
Archivespark
An Apache Spark framework for easy data processing, extraction as well as derivation for web archives and archival collections, developed at Internet Archive.
Stars: ✭ 111 (+60.87%)
Mutual labels:  web-archiving
Replayweb.page
Serverless Web Archive Replay directly in the browser
Stars: ✭ 84 (+21.74%)
Mutual labels:  web-archiving
Conifer
Collect and revisit web pages.
Stars: ✭ 1,259 (+1724.64%)
Mutual labels:  web-archiving
Archiveweb.page
A High-Fidelity Web Archiving Extension for Chrome and Chromium based browsers!
Stars: ✭ 69 (+0%)
Mutual labels:  web-archiving
Pywb
Core Python Web Archiving Toolkit for replay and recording of web archives
Stars: ✭ 798 (+1056.52%)
Mutual labels:  web-archiving
Webrecorder Player
Webrecorder Player for Desktop (OSX/Windows/Linux). (Built with Electron + Webrecorder)
Stars: ✭ 368 (+433.33%)
Mutual labels:  web-archiving
Ipwb
InterPlanetary Wayback: A distributed and persistent archive replay system using IPFS
Stars: ✭ 350 (+407.25%)
Mutual labels:  web-archiving
Perma
Indelible links
Stars: ✭ 272 (+294.2%)
Mutual labels:  web-archiving
dappeteer
🏌🏼‍E2E testing for dApps using Puppeteer + MetaMask
Stars: ✭ 138 (+100%)
Mutual labels:  pupeteer
jseval
Evaluate JavaScript on a URL through headless Chrome browser.
Stars: ✭ 19 (-72.46%)
Mutual labels:  pupeteer
1-34 of 34 similar projects