Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Docker-powered html convert to pdf(html2pdf), html to image(html2image like jpeg,png),which using chrome(golang) kernel, add watermarks to pdf, convert pdf to images etc.

Stars: ✭ 141 (-2.76%)

Mutual labels: pdf

Pyecharts Snapshot

renders the output of pyecharts as png, jpeg, gif, svg, eps, pdf and raw base64

Stars: ✭ 142 (-2.07%)

Mutual labels: pdf

Svglib

Read SVG files and convert them to other formats.

Stars: ✭ 139 (-4.14%)

Mutual labels: pdf

View All Similar Projects ➔

pdf-toolbox

A collection of tools for processing PDF files

Stable and HEAD

See "stable" branch for Hackage version. The current "master" branch is in a middle of API rewrite, see here for details.

Features

Written in Haskell
Parsing on demand. You don't need to parse or load into memory the entire PDF file just to extract one image
Different levels of abstraction. You can inspect high level (catalog, page tree, pages) or low level (xref, trailer, object) structure of PDF file. You can even switch between levels of details on the fly.
Extremely fast and memory efficient when you need to inspect only part of the document
Resonably fast and memory efficient in general case
Text extraction with exact glyph positions (mostly works, but in progress yet). It can be used e.g. to implement text selection and copying in pdf viewer
Full support of xref streams and object streams
Supports editing of PDF files (incremental updates)
Basic support for PDF file generating
Encrypted PDF documents are partially supported

Still in TODO list

Linearized PDF files
Content stream tools: extract text, images, etc (basic implementation is already included)
Higher level API for incremental updates and PDF generating

Examples

(Also see examples and viewer directories)

Inspect high level structure:

import Pdf.Document

main =
  withPdfFile "input.pdf" $ \pdf ->
    encrypted <- isEncrypted pdf
    when encrypted $ do
      ok <- setUserPassword pdf defaultUserPassword
      unless ok $
        fail "need password"
    doc <- document pdf
    catalog <- documentCatalog doc
    rootNode <- catalogPageNode catalog
    count <- pageNodeNKids rootNode
    print count
    -- the first page of the document
    page <- pageNodePageByNum rootNode 0
    -- extract text
    txt <- pageExtractText page
    print txt
    ...

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 145

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (12) 🔗