GitPlanet
Projects
Users
Categories
Languages
About
All Categories
→
No Category
→ webarchives
Top 3 webarchives open source projects
aut
The Archives Unleashed Toolkit is an open-source toolkit for analyzing web archives.
✭ 111
scala
java
python
big-data
spark
apache-spark
hadoop
analysis
pyspark
digital-humanities
dataframe
big-data-analytics
webarchives
robustlinks
Links on the web break all the time, robustify them!
✭ 40
javascript
CSS
html
links
webarchives
robust-links
warcworker
A dockerized, queued high fidelity web archiver based on Squidwarc
✭ 48
python
Dockerfile
HTML
javascript
CSS
archiving
preservation
webarchiving
webarchives
high-fidelity-preservation
1-3
of
3
webarchives projects