All Projects → petdance → htmlparsing

petdance / htmlparsing

Licence: other
htmlparsing.com, a website devoted to helping people parse HTML correctly

Programming Languages

CSS
56736 projects
perl
6916 projects
Makefile
30231 projects

Projects that are alternatives of or similar to htmlparsing

pysub-parser
Library for extracting text and timestamps from multiple subtitle files (.ass, .ssa, .srt, .sub, .txt).
Stars: ✭ 40 (+37.93%)
Mutual labels:  parsing
FullFIX
A library for parsing FIX (Financial Information eXchange) protocol messages.
Stars: ✭ 60 (+106.9%)
Mutual labels:  parsing
puppeteer-autoscroll-down
Handle infinite scroll on websites by puppeteer
Stars: ✭ 40 (+37.93%)
Mutual labels:  parsing
Ramble
A R parser based on combinatory parsers.
Stars: ✭ 19 (-34.48%)
Mutual labels:  parsing
arborist
Arborist is a PEG parser that supports left-associative left recursion
Stars: ✭ 17 (-41.38%)
Mutual labels:  parsing
Whatsapp-Chat-Exporter
A customizable Android and iPhone WhatsApp database parser that will give you the history of your WhatsApp conversations in HTML and JSON. Android Backup Crypt12, Crypt14 and Crypt15 supported.
Stars: ✭ 150 (+417.24%)
Mutual labels:  parsing
wasmbin
A self-generating WebAssembly parser & serializer in Rust.
Stars: ✭ 40 (+37.93%)
Mutual labels:  parsing
wallhaven4j
Wallhaven API for Java
Stars: ✭ 17 (-41.38%)
Mutual labels:  parsing
php-fast-xml-parser
Fast SAX XML parser for PHP.
Stars: ✭ 25 (-13.79%)
Mutual labels:  parsing
attoparser
A tiny but fast java event-style markup parser.
Stars: ✭ 46 (+58.62%)
Mutual labels:  parsing
extract-emails
Extract emails from a given website
Stars: ✭ 58 (+100%)
Mutual labels:  parsing
pypact
A Python package for parsing FISPACT-II output
Stars: ✭ 19 (-34.48%)
Mutual labels:  parsing
logstash-config
logstash-config provides a parser and abstract syntax tree (AST) for the Logstash config format, written in Go
Stars: ✭ 26 (-10.34%)
Mutual labels:  parsing
twitter-to-rss
Simple python script to parse twitter feed to generate a rss feed.
Stars: ✭ 15 (-48.28%)
Mutual labels:  parsing
YAPDFKit
Yet another PDF Kit for parsing and modifying PDF's. For OS X and iOS.
Stars: ✭ 27 (-6.9%)
Mutual labels:  parsing
ParsecSharp
The faster monadic parser combinator library for C#
Stars: ✭ 23 (-20.69%)
Mutual labels:  parsing
biaffine-ner
Named Entity Recognition as Dependency Parsing
Stars: ✭ 293 (+910.34%)
Mutual labels:  parsing
CVparser
CVparser is software for parsing or extracting data out of CV/resumes.
Stars: ✭ 28 (-3.45%)
Mutual labels:  parsing
domainatrex
😈 A library for parsing TLDs from urls in Elixir
Stars: ✭ 29 (+0%)
Mutual labels:  parsing
sb-dynlex
Configurable lexer for PHP featuring a fluid API.
Stars: ✭ 27 (-6.9%)
Mutual labels:  parsing

This project is the source code for http://htmlparsing.com, plus the Perl code that converts it from Markdown format into HTML and uploads it to the server.

Repository layout

s               page bodies in Markdown format
tt              templates in Template::Toolkit format
static          images and styles
t               tests
build           output

Requirements

Perl and additional CPAN modules.

For building:

File::Slurp
Template
Text::Markdown

For testing:

Test::HTML::Tidy5

Contributing page content

  1. Modify templates or page bodies. New pages have to be registered in the file crank.
  2. Run make to build the site and inspect the result in the build directory.
  3. Run make test to check for HTML errors.
  4. Commit/publish changes, see s/index.md.
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].