GitPlanet
Projects
Users
Categories
Languages
About
All Categories
→
No Category
→ webarchiving
Top 5 webarchiving open source projects
node-warc
Parse And Create Web ARChive (WARC) files with node.js
✭ 69
javascript
warc
web-archiving
webarchive
web-archives
webarchiving
warc-files
chrome-remote-interface
pupeteer
awesome-memento
A list of things related to software, literature, and other content for 🕣 Memento
✭ 62
awesome
memento
awesome-list
webarchiving
memento-rfc
munin-indexer
A social media open post web archiving tool
✭ 16
javascript
HTML
CSS
python
archiving
preservation
webarchiving
high-fidelity-preservation
wget-lua
Wget-AT is a modern Wget with Lua hooks, Zstandard (+dictionary) WARC compression and URL-agnostic deduplication.
✭ 52
c
python
perl
shell
Module Management System
M4
crawler
scraper
downloader
spider
ftp
scraping
crawling
archiving
wget
crawl
zstd
crawlers
warc
webarchiving
archiveteam
wget-lua
warcworker
A dockerized, queued high fidelity web archiver based on Squidwarc
✭ 48
python
Dockerfile
HTML
javascript
CSS
archiving
preservation
webarchiving
webarchives
high-fidelity-preservation
1-5
of
5
webarchiving projects