munin-indexerA social media open post web archiving tool
Stars: ✭ 16 (-66.67%)
wget-luaWget-AT is a modern Wget with Lua hooks, Zstandard (+dictionary) WARC compression and URL-agnostic deduplication.
Stars: ✭ 52 (+8.33%)
chronicle-etl📜 A CLI toolkit for extracting and working with your digital history
Stars: ✭ 78 (+62.5%)
anchorageSave your bookmark collection in the Internet Archive, or locally.
Stars: ✭ 19 (-60.42%)
UnifiedarchiveUnifiedArchive - an archive manager with a unified way for different formats. Supports all basic (listing, reading, extracting and creation) and specific features (compression level, password-protection). Bundled with console program for working with archives.
Stars: ✭ 246 (+412.5%)
ArchiverorArchiveror will help you preserve the webpages you love. 💾
Stars: ✭ 246 (+412.5%)
ReprozipReproZip is a tool that simplifies the process of creating reproducible experiments from command-line executions, a frequently-used common denominator in computational science.
Stars: ✭ 231 (+381.25%)
ArchivebotArchiveBot, an IRC bot for archiving websites
Stars: ✭ 218 (+354.17%)
Pdf ArchiverA tool for tagging files and archiving tasks.
Stars: ✭ 182 (+279.17%)
JarchivelibA simple archiving and compression library for Java
Stars: ✭ 162 (+237.5%)
Wikipedia Mirror🌐 Guide and tools to run a full offline mirror of Wikipedia.org with three different approaches: Nginx caching proxy, Kimix + ZIM dump, and MediaWiki/XOWA + XML dump
Stars: ✭ 160 (+233.33%)
ArchiveisA simple Python wrapper for the archive.is capturing service
Stars: ✭ 140 (+191.67%)
Wal GArchival and Restoration for Postgres
Stars: ✭ 1,974 (+4012.5%)
LibarchiveMulti-format archive and compression library
Stars: ✭ 1,625 (+3285.42%)
I7j PdfhtmlpdfHTML is an iText 7 add-on for Java that allows you to easily convert HTML and CSS into standards compliant PDFs that are accessible, searchable and usable for indexing.
Stars: ✭ 104 (+116.67%)
CliA tiny CLI for HedgeDoc
Stars: ✭ 94 (+95.83%)
Mkstage4Bash Utility for Creating Stage 4 Tarballs
Stars: ✭ 55 (+14.58%)
PaperlessScan, index, and archive all of your paper documents
Stars: ✭ 7,662 (+15862.5%)
Static FilezBuild compressed archives for static files and serve them over HTTP
Stars: ✭ 33 (-31.25%)
CrocoiteWeb archiving using Google Chrome
Stars: ✭ 30 (-37.5%)
WarcGolang WARC (Web ARChive) Library
Stars: ✭ 25 (-47.92%)
Itext7iText 7 for Java represents the next level of SDKs for developers that want to take advantage of the benefits PDF can bring. Equipped with a better document engine, high and low-level programming capabilities and the ability to create, edit and enhance PDF documents, iText 7 can be a boon to nearly every workflow.
Stars: ✭ 913 (+1802.08%)
PgbackrestReliable PostgreSQL Backup & Restore
Stars: ✭ 766 (+1495.83%)
Itext7 DotnetiText 7 for .NET is the .NET version of the iText 7 library, formerly known as iTextSharp, which it replaces. iText 7 represents the next level of SDKs for developers that want to take advantage of the benefits PDF can bring. Equipped with a better document engine, high and low-level programming capabilities and the ability to create, edit and enhance PDF documents, iText 7 can be a boon to nearly every workflow.
Stars: ✭ 698 (+1354.17%)
Grab SiteThe archivist's web crawler: WARC output, dashboard for all crawls, dynamic ignore patterns
Stars: ✭ 680 (+1316.67%)
LinkaceYour self-hosted bookmark archive. Free and open source.
Stars: ✭ 657 (+1268.75%)
BareosMain repository with the code for the libraries and daemons
Stars: ✭ 651 (+1256.25%)
Pg probackupBackup and recovery manager for PostgreSQL
Stars: ✭ 383 (+697.92%)
Wal EContinuous Archiving for Postgres
Stars: ✭ 3,313 (+6802.08%)
NbCLI and local web plain text note‑taking, bookmarking, and archiving with linking, tagging, filtering, search, Git versioning & syncing, Pandoc conversion, + more, in a single portable script.
Stars: ✭ 3,846 (+7912.5%)
compose-dumpDump and restore Docker Compose-projects
Stars: ✭ 14 (-70.83%)
PharToolsA powerful PHP-CLI tool to manage phar (PHP-Archive) files
Stars: ✭ 27 (-43.75%)
storytrackerTools for tracking stories on news homepages
Stars: ✭ 47 (-2.08%)
earkwebE-ARK Web is a software for the creation and management of archival information packages, and it supports full-text search for individual files contained in them.
Stars: ✭ 18 (-62.5%)
archivers-harvesting-toolsARCHIVED--Collection of scripts and code snippets for data harvesting after generating the zip starter
Stars: ✭ 31 (-35.42%)
fimfarchivePreserves stories from Fimfiction
Stars: ✭ 15 (-68.75%)
paperless-ngA supercharged version of paperless: scan, index and archive all your physical documents
Stars: ✭ 4,840 (+9983.33%)
i7n-pdfhtmlpdfHTML is an iText 7 add-on for C# (.NET) that allows you to easily convert HTML and CSS into standards compliant PDFs that are accessible, searchable and usable for indexing.
Stars: ✭ 111 (+131.25%)
archiveisA simple Python wrapper for the archive.is capturing service
Stars: ✭ 152 (+216.67%)
irc-docsCollected IRC protocol documentation
Stars: ✭ 47 (-2.08%)
deptoolkitThe Toolkit API, app, and browser extension. Start preserving now.
Stars: ✭ 40 (-16.67%)
jupyter-archiveA Jupyter/Jupyterlab extension to make, download and extract archive files.
Stars: ✭ 57 (+18.75%)
savepagenowA simple Python wrapper and command-line interface for archive.org’s "Save Page Now" capturing service
Stars: ✭ 140 (+191.67%)
rodaRODA - Repository of Authentic Digital Objects
Stars: ✭ 54 (+12.5%)
d2dxD2DX is a complete solution to make Diablo II run well on modern PCs, with high fps and better resolutions.
Stars: ✭ 214 (+345.83%)
mailbagA tool for creating and managing Mailbags, a package for preserving email using multiple preservation formats
Stars: ✭ 29 (-39.58%)
dbptk-uiDBPTK base UI for both Desktop and Enterprise
Stars: ✭ 20 (-58.33%)
rscplusRuneScape Classic client mod & preservation platform
Stars: ✭ 29 (-39.58%)
checkit tiff"checkit_tiff" is an incredibly fast conformance checker for baseline TIFFs (with various extensions)
Stars: ✭ 14 (-70.83%)
node-warcParse And Create Web ARChive (WARC) files with node.js
Stars: ✭ 69 (+43.75%)
awesome-mementoA list of things related to software, literature, and other content for 🕣 Memento
Stars: ✭ 62 (+29.17%)
autThe Archives Unleashed Toolkit is an open-source toolkit for analyzing web archives.
Stars: ✭ 111 (+131.25%)
robustlinksLinks on the web break all the time, robustify them!
Stars: ✭ 40 (-16.67%)
wail🐋 One-Click User Instigated Preservation
Stars: ✭ 107 (+122.92%)