All Projects → warcworker → Similar Projects or Alternatives

57 Open source projects that are alternatives of or similar to warcworker

munin-indexer
A social media open post web archiving tool
Stars: ✭ 16 (-66.67%)
wget-lua
Wget-AT is a modern Wget with Lua hooks, Zstandard (+dictionary) WARC compression and URL-agnostic deduplication.
Stars: ✭ 52 (+8.33%)
Mutual labels:  archiving, webarchiving
chronicle-etl
📜 A CLI toolkit for extracting and working with your digital history
Stars: ✭ 78 (+62.5%)
Mutual labels:  archiving
anchorage
Save your bookmark collection in the Internet Archive, or locally.
Stars: ✭ 19 (-60.42%)
Mutual labels:  archiving
Unifiedarchive
UnifiedArchive - an archive manager with a unified way for different formats. Supports all basic (listing, reading, extracting and creation) and specific features (compression level, password-protection). Bundled with console program for working with archives.
Stars: ✭ 246 (+412.5%)
Mutual labels:  archiving
Archiveror
Archiveror will help you preserve the webpages you love. 💾
Stars: ✭ 246 (+412.5%)
Mutual labels:  archiving
Reprozip
ReproZip is a tool that simplifies the process of creating reproducible experiments from command-line executions, a frequently-used common denominator in computational science.
Stars: ✭ 231 (+381.25%)
Mutual labels:  archiving
Archivebot
ArchiveBot, an IRC bot for archiving websites
Stars: ✭ 218 (+354.17%)
Mutual labels:  archiving
Pdf Archiver
A tool for tagging files and archiving tasks.
Stars: ✭ 182 (+279.17%)
Mutual labels:  archiving
Jarchivelib
A simple archiving and compression library for Java
Stars: ✭ 162 (+237.5%)
Mutual labels:  archiving
Wikipedia Mirror
🌐 Guide and tools to run a full offline mirror of Wikipedia.org with three different approaches: Nginx caching proxy, Kimix + ZIM dump, and MediaWiki/XOWA + XML dump
Stars: ✭ 160 (+233.33%)
Mutual labels:  archiving
Archiveis
A simple Python wrapper for the archive.is capturing service
Stars: ✭ 140 (+191.67%)
Mutual labels:  archiving
Wal G
Archival and Restoration for Postgres
Stars: ✭ 1,974 (+4012.5%)
Mutual labels:  archiving
Libarchive
Multi-format archive and compression library
Stars: ✭ 1,625 (+3285.42%)
Mutual labels:  archiving
I7j Pdfhtml
pdfHTML is an iText 7 add-on for Java that allows you to easily convert HTML and CSS into standards compliant PDFs that are accessible, searchable and usable for indexing.
Stars: ✭ 104 (+116.67%)
Mutual labels:  archiving
Cli
A tiny CLI for HedgeDoc
Stars: ✭ 94 (+95.83%)
Mutual labels:  archiving
Mkstage4
Bash Utility for Creating Stage 4 Tarballs
Stars: ✭ 55 (+14.58%)
Mutual labels:  archiving
Paperless
Scan, index, and archive all of your paper documents
Stars: ✭ 7,662 (+15862.5%)
Mutual labels:  archiving
Static Filez
Build compressed archives for static files and serve them over HTTP
Stars: ✭ 33 (-31.25%)
Mutual labels:  archiving
Crocoite
Web archiving using Google Chrome
Stars: ✭ 30 (-37.5%)
Mutual labels:  archiving
Warc
Golang WARC (Web ARChive) Library
Stars: ✭ 25 (-47.92%)
Mutual labels:  archiving
Itext7
iText 7 for Java represents the next level of SDKs for developers that want to take advantage of the benefits PDF can bring. Equipped with a better document engine, high and low-level programming capabilities and the ability to create, edit and enhance PDF documents, iText 7 can be a boon to nearly every workflow.
Stars: ✭ 913 (+1802.08%)
Mutual labels:  archiving
Django Urlarchivefield
A custom Django model field that automatically archives a URL
Stars: ✭ 5 (-89.58%)
Mutual labels:  archiving
Pgbackrest
Reliable PostgreSQL Backup & Restore
Stars: ✭ 766 (+1495.83%)
Mutual labels:  archiving
Itext7 Dotnet
iText 7 for .NET is the .NET version of the iText 7 library, formerly known as iTextSharp, which it replaces. iText 7 represents the next level of SDKs for developers that want to take advantage of the benefits PDF can bring. Equipped with a better document engine, high and low-level programming capabilities and the ability to create, edit and enhance PDF documents, iText 7 can be a boon to nearly every workflow.
Stars: ✭ 698 (+1354.17%)
Mutual labels:  archiving
Grab Site
The archivist's web crawler: WARC output, dashboard for all crawls, dynamic ignore patterns
Stars: ✭ 680 (+1316.67%)
Mutual labels:  archiving
Linkace
Your self-hosted bookmark archive. Free and open source.
Stars: ✭ 657 (+1268.75%)
Mutual labels:  archiving
Bareos
Main repository with the code for the libraries and daemons
Stars: ✭ 651 (+1256.25%)
Mutual labels:  archiving
Pg probackup
Backup and recovery manager for PostgreSQL
Stars: ✭ 383 (+697.92%)
Mutual labels:  archiving
Wal E
Continuous Archiving for Postgres
Stars: ✭ 3,313 (+6802.08%)
Mutual labels:  archiving
Nb
CLI and local web plain text note‑taking, bookmarking, and archiving with linking, tagging, filtering, search, Git versioning & syncing, Pandoc conversion, + more, in a single portable script.
Stars: ✭ 3,846 (+7912.5%)
Mutual labels:  archiving
compose-dump
Dump and restore Docker Compose-projects
Stars: ✭ 14 (-70.83%)
Mutual labels:  archiving
pastpages.org
The news homepage archive
Stars: ✭ 81 (+68.75%)
Mutual labels:  archiving
PharTools
A powerful PHP-CLI tool to manage phar (PHP-Archive) files
Stars: ✭ 27 (-43.75%)
Mutual labels:  archiving
storytracker
Tools for tracking stories on news homepages
Stars: ✭ 47 (-2.08%)
Mutual labels:  archiving
earkweb
E-ARK Web is a software for the creation and management of archival information packages, and it supports full-text search for individual files contained in them.
Stars: ✭ 18 (-62.5%)
Mutual labels:  archiving
archivers-harvesting-tools
ARCHIVED--Collection of scripts and code snippets for data harvesting after generating the zip starter
Stars: ✭ 31 (-35.42%)
Mutual labels:  archiving
fimfarchive
Preserves stories from Fimfiction
Stars: ✭ 15 (-68.75%)
Mutual labels:  archiving
paperless-ng
A supercharged version of paperless: scan, index and archive all your physical documents
Stars: ✭ 4,840 (+9983.33%)
Mutual labels:  archiving
i7n-pdfhtml
pdfHTML is an iText 7 add-on for C# (.NET) that allows you to easily convert HTML and CSS into standards compliant PDFs that are accessible, searchable and usable for indexing.
Stars: ✭ 111 (+131.25%)
Mutual labels:  archiving
archiveis
A simple Python wrapper for the archive.is capturing service
Stars: ✭ 152 (+216.67%)
Mutual labels:  archiving
irc-docs
Collected IRC protocol documentation
Stars: ✭ 47 (-2.08%)
Mutual labels:  archiving
deptoolkit
The Toolkit API, app, and browser extension. Start preserving now.
Stars: ✭ 40 (-16.67%)
Mutual labels:  archiving
jupyter-archive
A Jupyter/Jupyterlab extension to make, download and extract archive files.
Stars: ✭ 57 (+18.75%)
Mutual labels:  archiving
savepagenow
A simple Python wrapper and command-line interface for archive.org’s "Save Page Now" capturing service
Stars: ✭ 140 (+191.67%)
Mutual labels:  archiving
roda
RODA - Repository of Authentic Digital Objects
Stars: ✭ 54 (+12.5%)
Mutual labels:  preservation
d2dx
D2DX is a complete solution to make Diablo II run well on modern PCs, with high fps and better resolutions.
Stars: ✭ 214 (+345.83%)
Mutual labels:  preservation
mailbag
A tool for creating and managing Mailbags, a package for preserving email using multiple preservation formats
Stars: ✭ 29 (-39.58%)
Mutual labels:  preservation
dbptk-ui
DBPTK base UI for both Desktop and Enterprise
Stars: ✭ 20 (-58.33%)
Mutual labels:  preservation
islandora vagrant
Islandora testing and development environment
Stars: ✭ 36 (-25%)
Mutual labels:  preservation
rscplus
RuneScape Classic client mod & preservation platform
Stars: ✭ 29 (-39.58%)
Mutual labels:  preservation
checkit tiff
"checkit_tiff" is an incredibly fast conformance checker for baseline TIFFs (with various extensions)
Stars: ✭ 14 (-70.83%)
Mutual labels:  preservation
node-warc
Parse And Create Web ARChive (WARC) files with node.js
Stars: ✭ 69 (+43.75%)
Mutual labels:  webarchiving
awesome-memento
A list of things related to software, literature, and other content for 🕣 Memento
Stars: ✭ 62 (+29.17%)
Mutual labels:  webarchiving
aut
The Archives Unleashed Toolkit is an open-source toolkit for analyzing web archives.
Stars: ✭ 111 (+131.25%)
Mutual labels:  webarchives
robustlinks
Links on the web break all the time, robustify them!
Stars: ✭ 40 (-16.67%)
Mutual labels:  webarchives
wail
🐋 One-Click User Instigated Preservation
Stars: ✭ 107 (+122.92%)
1-57 of 57 similar projects