All Projects → datatogether → Warc

datatogether / Warc

Licence: agpl-3.0
Golang WARC (Web ARChive) Library

Programming Languages

go
31211 projects - #10 most used programming language
golang
3204 projects

Projects that are alternatives of or similar to Warc

Sublimetext Codeformatter
Code Formatter plugin for ST2/ST3
Stars: ✭ 765 (+2960%)
Mutual labels:  package
Nsga Ii
an implementation of NSGA-II in java
Stars: ✭ 18 (-28%)
Mutual labels:  package
Snm
🤏 Smol Node Manager written in Rust
Stars: ✭ 24 (-4%)
Mutual labels:  package
Ts3server Package
A script that generates a debian package for a TeamSpeak 3 Server
Stars: ✭ 5 (-80%)
Mutual labels:  package
Laravel Bootstrap Table List
Bootstrap table list generator for Laravel.
Stars: ✭ 16 (-36%)
Mutual labels:  package
Compinstall
Delphi utility app to auto-install component packages into IDE
Stars: ✭ 19 (-24%)
Mutual labels:  package
Lambda Packages
Various popular python libraries, pre-compiled to be compatible with AWS Lambda
Stars: ✭ 713 (+2752%)
Mutual labels:  package
Unitycopylightingsettings
Unity editor extension to copy&paste lighting settings from one scene to another.
Stars: ✭ 25 (+0%)
Mutual labels:  package
Js Package Template
Boilerplate for creating a new javascript package
Stars: ✭ 16 (-36%)
Mutual labels:  package
Pagarme Laravel
Pagar.me SDK for Laravel applications.
Stars: ✭ 23 (-8%)
Mutual labels:  package
Django Urlarchivefield
A custom Django model field that automatically archives a URL
Stars: ✭ 5 (-80%)
Mutual labels:  archiving
Aetherupload Laravel
A Laravel package to upload large files 上传大文件的Laravel扩展包
Stars: ✭ 835 (+3240%)
Mutual labels:  package
Itext7
iText 7 for Java represents the next level of SDKs for developers that want to take advantage of the benefits PDF can bring. Equipped with a better document engine, high and low-level programming capabilities and the ability to create, edit and enhance PDF documents, iText 7 can be a boon to nearly every workflow.
Stars: ✭ 913 (+3552%)
Mutual labels:  archiving
Pgbackrest
Reliable PostgreSQL Backup & Restore
Stars: ✭ 766 (+2964%)
Mutual labels:  archiving
Laravel Aws Sns
Laravel package for the AWS SNS Events
Stars: ✭ 24 (-4%)
Mutual labels:  package
Laravel Caffeine
Keeping Your Laravel Forms Awake.
Stars: ✭ 723 (+2792%)
Mutual labels:  package
Umbraco Nexu
Umbraco package that let's you track internal links in Umbraco
Stars: ✭ 18 (-28%)
Mutual labels:  package
Tidymv
Tidy Model Visualisation for Generalised Additive Models
Stars: ✭ 25 (+0%)
Mutual labels:  package
Atom Korean Spell Daum
Daum 맞춤법 검사기로 텍스트를 보내 손쉽게 교정 가능한 Atom package
Stars: ✭ 24 (-4%)
Mutual labels:  package
Libra Sdk Go
Go SDK for the Libra cryptocurrency
Stars: ✭ 23 (-8%)
Mutual labels:  package

warc

GitHub Slack GoDoc License

warc is an implementation of ISO28500 1.0, the WebARCive specfication. it provides readers, writers, and structs for working with warc records.

from the spec:

The WARC (Web ARChive) file format offers a convention for concatenating multiple resource records (data objects), each consisting of a set of simple text headers and an arbitrary data block into one long file. The WARC format is an extension of the ARC File Format [ARC] that has traditionally been used to store "web crawls" as sequences of content blocks harvested from the World Wide Web. Each capture in an ARC file is preceded by a one-line header that very briefly describes the harvested content and its length. This is directly followed by the retrieval protocol response messages and content. The original ARC format file is used by the Internet Archive (IA) since 1996 for managing billions of objects, and by several national libraries. package warc

License & Copyright

Affero General Public License v3

Getting Involved

We would love involvement from more people! If you notice any errors or would like to submit changes, please see our Contributing Guidelines.

We use GitHub issues for tracking bugs and feature requests and Pull Requests (PRs) for submitting changes

Usage

import "github.com/datatogether/warc"

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].