All Projects → N0taN3rd → wail

N0taN3rd / wail

Licence: GPL-3.0 license
🐋 One-Click User Instigated Preservation

Projects that are alternatives of or similar to wail

Archivebox
🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...
Stars: ✭ 12,383 (+11472.9%)
Mutual labels:  warc, web-archiving
node-warc
Parse And Create Web ARChive (WARC) files with node.js
Stars: ✭ 69 (-35.51%)
Mutual labels:  warc, web-archiving
Archivespark
An Apache Spark framework for easy data processing, extraction as well as derivation for web archives and archival collections, developed at Internet Archive.
Stars: ✭ 111 (+3.74%)
Mutual labels:  web-archiving
chatnoir-resiliparse
A robust web archive analytics toolkit
Stars: ✭ 26 (-75.7%)
Mutual labels:  warc
mixnode-warcreader-php
Read Web ARChive (WARC) files in PHP.
Stars: ✭ 20 (-81.31%)
Mutual labels:  warc
Warcreate
Chrome extension to "Create WARC files from any webpage"
Stars: ✭ 143 (+33.64%)
Mutual labels:  web-archiving
Sfm Ui
Social Feed Manager user interface application.
Stars: ✭ 129 (+20.56%)
Mutual labels:  web-archiving
Conifer
Collect and revisit web pages.
Stars: ✭ 1,259 (+1076.64%)
Mutual labels:  web-archiving
CommonCrawlDocumentDownload
A small tool which uses the CommonCrawl URL Index to download documents with certain file types or mime-types. This is used for mass-testing of frameworks like Apache POI and Apache Tika
Stars: ✭ 43 (-59.81%)
Mutual labels:  warc
wget-lua
Wget-AT is a modern Wget with Lua hooks, Zstandard (+dictionary) WARC compression and URL-agnostic deduplication.
Stars: ✭ 52 (-51.4%)
Mutual labels:  warc
Heritrix3
Heritrix is the Internet Archive's open-source, extensible, web-scale, archival-quality web crawler project.
Stars: ✭ 2,104 (+1866.36%)
Mutual labels:  warc
Archivenow
A Tool To Push Web Resources Into Web Archives
Stars: ✭ 253 (+136.45%)
Mutual labels:  web-archiving
Warcio
Streaming WARC/ARC library for fast web archive IO
Stars: ✭ 195 (+82.24%)
Mutual labels:  web-archiving
warc
⚙️ A Rust library for reading and writing WARC files
Stars: ✭ 26 (-75.7%)
Mutual labels:  warc
warcworker
A dockerized, queued high fidelity web archiver based on Squidwarc
Stars: ✭ 48 (-55.14%)
Mutual labels:  high-fidelity-preservation
Replayweb.page
Serverless Web Archive Replay directly in the browser
Stars: ✭ 84 (-21.5%)
Mutual labels:  web-archiving
warc
📇 Tools to Work with the Web Archive Ecosystem in R
Stars: ✭ 21 (-80.37%)
Mutual labels:  warc
MemGator
A Memento Aggregator CLI and Server in Go
Stars: ✭ 42 (-60.75%)
Mutual labels:  web-archiving
warrick
Recover lost websites from the Web Infrastructure
Stars: ✭ 76 (-28.97%)
Mutual labels:  web-archiving
Archiveror
Archiveror will help you preserve the webpages you love. 💾
Stars: ✭ 246 (+129.91%)
Mutual labels:  web-archiving

WAIL logo
 Web Archiving Integration Layer (WAIL)

"One-Click User Instigated Preservation"

Web Archiving Integration Layer (WAIL)

"One-Click User Instigated Preservation"

Web Archiving Integration Layer (WAIL) is a graphical user interface (GUI) atop multiple web archiving tools intended to be used as an easy way for anyone to preserve and replay web pages. Tools included and accessible through the GUI are Heritrix 3.2.0 and PyWb 0.33.0.

More information about the motivations behind WAIL see the Motivations section in the projects wiki.

This work is supported by the National Endowment for the Humanities (NEH), through Digital Humanities grants HD-51670-13 and HK-50181-14

WAIL Electron

js-standard-style

WAIL Home Screen

Usage

You can download the latest release here.

For information on using WAIL please consult the wiki.

To get up and running from source consult the Development section in this projects wiki.

Slides from Archives Unleased 2.0

Are Wails Electric?

Problems? Questions?

Please see the Frequently Asked Questions page.

Contact

WAIL is a project of the Web Science and Digital Libraries (WS-DL) research group at Old Dominion University (ODU), created by Mat Kelly.

For support e-mail [email protected] or tweet to us at @johnaberlin and/or @WebSciDL.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].