All Categories → Data Storage → archiving

Top 46 archiving open source projects

Unifiedarchive
UnifiedArchive - an archive manager with a unified way for different formats. Supports all basic (listing, reading, extracting and creation) and specific features (compression level, password-protection). Bundled with console program for working with archives.
Reprozip
ReproZip is a tool that simplifies the process of creating reproducible experiments from command-line executions, a frequently-used common denominator in computational science.
Archivebot
ArchiveBot, an IRC bot for archiving websites
Jarchivelib
A simple archiving and compression library for Java
Wikipedia Mirror
🌐 Guide and tools to run a full offline mirror of Wikipedia.org with three different approaches: Nginx caching proxy, Kimix + ZIM dump, and MediaWiki/XOWA + XML dump
Archiveis
A simple Python wrapper for the archive.is capturing service
Wal G
Archival and Restoration for Postgres
I7j Pdfhtml
pdfHTML is an iText 7 add-on for Java that allows you to easily convert HTML and CSS into standards compliant PDFs that are accessible, searchable and usable for indexing.
Cli
A tiny CLI for HedgeDoc
Mkstage4
Bash Utility for Creating Stage 4 Tarballs
Paperless
Scan, index, and archive all of your paper documents
Static Filez
Build compressed archives for static files and serve them over HTTP
Crocoite
Web archiving using Google Chrome
Warc
Golang WARC (Web ARChive) Library
Itext7
iText 7 for Java represents the next level of SDKs for developers that want to take advantage of the benefits PDF can bring. Equipped with a better document engine, high and low-level programming capabilities and the ability to create, edit and enhance PDF documents, iText 7 can be a boon to nearly every workflow.
Django Urlarchivefield
A custom Django model field that automatically archives a URL
Itext7 Dotnet
iText 7 for .NET is the .NET version of the iText 7 library, formerly known as iTextSharp, which it replaces. iText 7 represents the next level of SDKs for developers that want to take advantage of the benefits PDF can bring. Equipped with a better document engine, high and low-level programming capabilities and the ability to create, edit and enhance PDF documents, iText 7 can be a boon to nearly every workflow.
Grab Site
The archivist's web crawler: WARC output, dashboard for all crawls, dynamic ignore patterns
Linkace
Your self-hosted bookmark archive. Free and open source.
Pg probackup
Backup and recovery manager for PostgreSQL
Nb
CLI and local web plain text note‑taking, bookmarking, and archiving with linking, tagging, filtering, search, Git versioning & syncing, Pandoc conversion, + more, in a single portable script.
compose-dump
Dump and restore Docker Compose-projects
PharTools
A powerful PHP-CLI tool to manage phar (PHP-Archive) files
storytracker
Tools for tracking stories on news homepages
earkweb
E-ARK Web is a software for the creation and management of archival information packages, and it supports full-text search for individual files contained in them.
archivers-harvesting-tools
ARCHIVED--Collection of scripts and code snippets for data harvesting after generating the zip starter
paperless-ng
A supercharged version of paperless: scan, index and archive all your physical documents
i7n-pdfhtml
pdfHTML is an iText 7 add-on for C# (.NET) that allows you to easily convert HTML and CSS into standards compliant PDFs that are accessible, searchable and usable for indexing.
archiveis
A simple Python wrapper for the archive.is capturing service
wget-lua
Wget-AT is a modern Wget with Lua hooks, Zstandard (+dictionary) WARC compression and URL-agnostic deduplication.
irc-docs
Collected IRC protocol documentation
jupyter-archive
A Jupyter/Jupyterlab extension to make, download and extract archive files.
savepagenow
A simple Python wrapper and command-line interface for archive.org’s "Save Page Now" capturing service
anchorage
Save your bookmark collection in the Internet Archive, or locally.
1-46 of 46 archiving projects