All Projects → bronson → pdfdir

bronson / pdfdir

Licence: other
Utilities to operate on lots of PDF files

Programming Languages

shell
77523 projects

Projects that are alternatives of or similar to pdfdir

Pdf
Rust library to read, manipulate and write PDF files.
Stars: ✭ 265 (+1104.55%)
Mutual labels:  pdf-files
Pybooks
python books
Stars: ✭ 87 (+295.45%)
Mutual labels:  pdf-files
pdfbox
📄◻️ Create, Maniuplate and Extract Data from PDF Files (R Apache PDFBox wrapper)
Stars: ✭ 46 (+109.09%)
Mutual labels:  pdf-files
Pdftools
Text Extraction, Rendering and Converting of PDF Documents
Stars: ✭ 349 (+1486.36%)
Mutual labels:  pdf-files
Unidoc
This repository has moved! https://github.com/unidoc/unipdf
Stars: ✭ 694 (+3054.55%)
Mutual labels:  pdf-files
Docnet
DocNET is as fast PDF editing and reading library for modern .NET applications
Stars: ✭ 128 (+481.82%)
Mutual labels:  pdf-files
PDF-Bot
A bot for PDF for doing Many Things....
Stars: ✭ 38 (+72.73%)
Mutual labels:  pdf-files
pdftricks
A simple, efficient application for small manipulations in PDF files using Ghostscript.
Stars: ✭ 69 (+213.64%)
Mutual labels:  pdf-files
Pdfio.jl
PDF Reader Library for Native Julia.
Stars: ✭ 56 (+154.55%)
Mutual labels:  pdf-files
Traprange
(Java)A Method to Extract Tabular Content from PDF Files
Stars: ✭ 236 (+972.73%)
Mutual labels:  pdf-files
Pdfpig
Read and extract text and other content from PDFs in C# (port of PdfBox)
Stars: ✭ 391 (+1677.27%)
Mutual labels:  pdf-files
Images To Pdf
An app to convert images to PDF file!
Stars: ✭ 602 (+2636.36%)
Mutual labels:  pdf-files
Pdfcompare
A simple Java library to compare two PDF files
Stars: ✭ 128 (+481.82%)
Mutual labels:  pdf-files
Hummusrecipe
A powerful PDF tool for NodeJS based on HummusJS.
Stars: ✭ 274 (+1145.45%)
Mutual labels:  pdf-files
ByteScout-SDK-SourceCode
ALL source code samples for ByteScout SDKs and Web API API products.
Stars: ✭ 24 (+9.09%)
Mutual labels:  pdf-files
Boxable
Boxable is a library that can be used to easily create tables in pdf documents.
Stars: ✭ 253 (+1050%)
Mutual labels:  pdf-files
Htmldoc
HTML Conversion Software
Stars: ✭ 99 (+350%)
Mutual labels:  pdf-files
zowie
Adds Zotero "select" links to attachment files in a Zotero database on macOS, so that outside of Zotero, you can find the bibliographic entry to which a file belongs. (Only works for local storage, not linked attachments.)
Stars: ✭ 71 (+222.73%)
Mutual labels:  pdf-files
Android-KeepLearning
Just for learning android well.
Stars: ✭ 23 (+4.55%)
Mutual labels:  pdf-files
Pdfcpu
A PDF processor written in Go.
Stars: ✭ 2,852 (+12863.64%)
Mutual labels:  pdf-files

pdfdir

Turns a directory tree of PDFs into a single bookmarked PDF. Automatically handles the table of contents.

Tested on Linux and Mac.

Usage

If you arrange your PDF files in folders like this:

book/01-Table of Contents.pdf
book/02-First Generation/01-Mary Cunningham.pdf
book/02-First Generation/02-Peter Cunningham.pdf
book/02-First Generation/02-:more-notes.pdf
book/03-Second Generation/01-John Mendell Cunningham.pdf
book/99-Index.pdf

and run:

$ pdfdir-join book

you will find the result in "book.pdf"

The PDF's table of contents will be automatically generated from the filenames:

Table of Contents
First Generation
  Mary Cunningham
  Peter Cunningham
Second Generation
  John Mendell Cunningham
Index

The 01-, 02- prefixes determine the order of the chapters in the final book and don't appear in the bookmarks.

If you don't want a file to be added to the TOC, adding a : to the beginning of its filename will suppress it (02-:more-notes.pdf above).

Prerequisites

MacOS: brew install ghostscript Linux: apt-get install ghostscript

And also Ruby. Hopefully this is temporary.

Verify PDFs

This package also includes some tools to help assemble the input files. This will find corrupt PDFs:

$ pdfdir-verify book

It uses Ghostscript to carefully process every page of every PDF file. This is awfully slow. You can specify --quick for a 10X speedup at the risk of missing some obscure corruptions.

Re-encode PDFs

If you're having trouble with encrypted or corrupt PDFs, try using pdfdir-copy to duplicate your entire directory structure. It takes a while but, because it re-encodes each PDF, the result is sure to be valid.

$ pdfdir-copy book /tmp/book-fixed
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].