All Projects → dimi2 → DyAnnotationExtractor

dimi2 / DyAnnotationExtractor

Licence: Apache-2.0 license
DyAnnotationExtractor is software for extracting annotations (highlighted text and comments) from e-documents like PDF.

Programming Languages

java
68154 projects - #9 most used programming language

Projects that are alternatives of or similar to DyAnnotationExtractor

Jupytergraffiti
Create interactive screencasts inside Jupyter Notebook that anybody can play back
Stars: ✭ 114 (+235.29%)
Mutual labels:  annotations, highlight
keyword-extract
简单高效的URL关键词提取工具
Stars: ✭ 15 (-55.88%)
Mutual labels:  extract
kirby-highlighter
🌐 Server-side syntax highlighting for the Kirby code block & KirbyText
Stars: ✭ 18 (-47.06%)
Mutual labels:  highlight
qresExtract
Qt binary resource (qres) extractor
Stars: ✭ 26 (-23.53%)
Mutual labels:  extract
controller-logger
AOP based API logging for Spring Boot
Stars: ✭ 57 (+67.65%)
Mutual labels:  annotations
Base2Tone-iterm2
Syntax highlighting colorschemes for iTerm2
Stars: ✭ 29 (-14.71%)
Mutual labels:  highlight
Blacksmith
Blacksmith is a tool for viewing, extracting, and converting textures, 3D models, and sounds from Assassin's Creed: Odyssey/Origins/Valhalla and Steep.
Stars: ✭ 104 (+205.88%)
Mutual labels:  extract
goat
Annotate Images (or goats) On The Web™
Stars: ✭ 75 (+120.59%)
Mutual labels:  annotations
Euro2016 TerminalApp
⚽ Instantly find 🏆EURO 2016 live-streams & highlights, now a Web App!
Stars: ✭ 54 (+58.82%)
Mutual labels:  highlight
serde with
This crate provides custom de/serialization helpers to use in combination with serde's `with`-annotation and with the improved `serde_as`-annotation.
Stars: ✭ 392 (+1052.94%)
Mutual labels:  annotations
attributes
PHP Attributes Reader. Subtree split of the Spiral Attributes component (see spiral/framework)
Stars: ✭ 22 (-35.29%)
Mutual labels:  annotations
yellowpages-scraper
Yellowpages.com Web Scraper written in Python and LXML to extract business details available based on a particular category and location.
Stars: ✭ 56 (+64.71%)
Mutual labels:  extract
boost-reflection
This library provides Java-like Reflection API to C++ language.
Stars: ✭ 16 (-52.94%)
Mutual labels:  annotations
HighlightTranslator
Highlight Translator can help you to translate the words quickly and accurately. By only highlighting, copying, or screenshoting the content you want to translate anywhere on your computer (ex. PDF, PPT, WORD etc.), the translated results will then be automatically displayed before you.
Stars: ✭ 54 (+58.82%)
Mutual labels:  highlight
obsidian-hypothesis-plugin
An Obsidian.md plugin that syncs highlights from Hypothesis.
Stars: ✭ 164 (+382.35%)
Mutual labels:  annotations
Yolo-to-COCO-format-converter
Yolo to COCO annotation format converter
Stars: ✭ 176 (+417.65%)
Mutual labels:  annotations
guide
A new feature guide component by react 🧭
Stars: ✭ 597 (+1655.88%)
Mutual labels:  highlight
Library-Spring
The library web application where you can borrow books. It's Spring MVC and Hibernate project.
Stars: ✭ 73 (+114.71%)
Mutual labels:  annotations
colocat
Fegeya Colocat, Colorized 'cat' implementation. Written in C++17.
Stars: ✭ 14 (-58.82%)
Mutual labels:  highlight
TouchPortalPluginSDK
This Project is an SDK to create a Touch Portal Plugin using Java or Kotlin and Gradle
Stars: ✭ 32 (-5.88%)
Mutual labels:  annotations

DyAnnotationExtractor

DyAnnotationExtractor is software for extracting annotations (highlighted text and comments) from e-documents like PDF. The extracted parts can be used to build summary/resume of the document.

Usage

Imagine you have ebook (PDF) which is 100 pages long. While reading the book, you highlight the important parts in your favorite reader:

Then use the DyAnnotationExtractor tool to get just the highlighted parts.

On the comman line execute following command.
For Windows:

DyAnnotationExtractor -input "Getting Started with Ubuntu 16.04.pdf"

For Linux:

./DyAnnotationExtractor.sh -input "Getting Started with Ubuntu 16.04.pdf"

This will create a file with same name in the same directory, with added '.md' suffix.

Now you have extract of the book which is not 100 but 5-6 pages. So, you can skim just the exported text instead of re-reading the entire book.

Supported Input Formats

  • PDF (Portable Document Format)

Supported Output Formats

  • MD (Markdown)

Requirements

  • Java 8+.

Download

Get the latest release.

There are separate files for: distribution, binary and sources.
End users need to download only the distribution.

Installation

Extract the downloaded archive in some local directory.
Run the provided 'DyAnnotationExtractor' script to perform extraction.

Build

To build the project from sources, you will need Gradle build tool. Go into the project home directory (PROJ_HOME) and execute command:

gradle

The result will appear in directory PROJ_HOME/build/distributions.

Dependencies

  • iTextPdf 7.1.2+ (PDF handling library)
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].