All Projects → lizadaly → Blackout

lizadaly / Blackout

Licence: unlicense
NaNoGenMo 2016 entry #2

Programming Languages

python
139335 projects - #7 most used programming language
grammar
57 projects

Projects that are alternatives of or similar to Blackout

ocreval
Update of the ISRI Analytic Tools for OCR Evaluation with UTF-8 support
Stars: ✭ 48 (+33.33%)
Mutual labels:  ocr, tesseract-ocr
TesseractStudio.Net
A free Windows graphical interface to the Tesseract 4.0 OCR engine.
Stars: ✭ 38 (+5.56%)
Mutual labels:  ocr, tesseract-ocr
NLP-image-to-text
code to extract text from images
Stars: ✭ 28 (-22.22%)
Mutual labels:  ocr, tesseract-ocr
Tesseract4java
Java GUI and Tools for Tesseract OCR
Stars: ✭ 214 (+494.44%)
Mutual labels:  ocr, tesseract-ocr
Idcardocr
离线环境下第二代居民身份证信息识别
Stars: ✭ 328 (+811.11%)
Mutual labels:  ocr, tesseract-ocr
Image2text
📋 Python wrapper to grab text from images and save as text files using Tesseract Engine
Stars: ✭ 243 (+575%)
Mutual labels:  ocr, tesseract-ocr
Nkocr
🔎📝 This is a module to make specifics OCRs at food products and nutritional tables.
Stars: ✭ 15 (-58.33%)
Mutual labels:  ocr, tesseract-ocr
Tesseract4android
Fork of tess-two rewritten from scratch to support latest version of Tesseract OCR.
Stars: ✭ 148 (+311.11%)
Mutual labels:  ocr, tesseract-ocr
idcardocr
离线环境下第二代居民身份证信息识别
Stars: ✭ 358 (+894.44%)
Mutual labels:  ocr, tesseract-ocr
BasicArabicOCR
A very basic Arabic OCR based on tesseract OCR engine written in Java.
Stars: ✭ 19 (-47.22%)
Mutual labels:  ocr, tesseract-ocr
Pyocr
A Python wrapper for Tesseract and Cuneiform -- Moved to Gnome's Gitlab
Stars: ✭ 932 (+2488.89%)
Mutual labels:  ocr, tesseract-ocr
React Native Tesseract Ocr
Tesseract OCR wrapper for React Native
Stars: ✭ 384 (+966.67%)
Mutual labels:  ocr, tesseract-ocr
Tesseract
Bindings to Tesseract OCR engine for R
Stars: ✭ 192 (+433.33%)
Mutual labels:  ocr, tesseract-ocr
receipt-manager-app
Receipt parser application written in dart.
Stars: ✭ 140 (+288.89%)
Mutual labels:  ocr, tesseract-ocr
Text Detection
Text detection with mainly MSER and SWT
Stars: ✭ 167 (+363.89%)
Mutual labels:  ocr, tesseract-ocr
How-to-use-tesseract-ocr-4.0-with-csharp
How to use Tesseract OCR 4.0 with C#
Stars: ✭ 60 (+66.67%)
Mutual labels:  ocr, tesseract-ocr
Aadhaar Card Ocr
Extract text information from Aadhaar Card using tesseract-ocr 😎
Stars: ✭ 112 (+211.11%)
Mutual labels:  ocr, tesseract-ocr
Tesseract Ocr for windows
Visual Studio Projects for Tessearct and dependencies
Stars: ✭ 122 (+238.89%)
Mutual labels:  ocr, tesseract-ocr
breach-protocol-autosolver
Solve breach protocol minigame in second(s). Windows/Linux/GeForce Now/Google Stadia. Every language.
Stars: ✭ 28 (-22.22%)
Mutual labels:  ocr, tesseract-ocr
Ccextractor
CCExtractor - Official version maintained by the core team
Stars: ✭ 356 (+888.89%)
Mutual labels:  ocr, tesseract-ocr

█ Blackout █

For NaNoGenMo 2016.

Blackout generates pages of text from book or newspaper scans in the style of Newspaper Blackout Poetry, popularized by Austin Kleon (related to work like A Humument by Tom Phillips).

Blackout does the following:

  1. Take, as input, an image of text, from a newspaper or book.
  2. Run OCR against the image, identifying the words and their bounding boxes.
  3. Feed the extracted text into a natural language parser, categorizing each part of speech.
  4. Given one of many randomly selected Tracery grammars, select words from the current page that match the parts of speech of that grammar.
  5. Draw around those words and "scribble" out all other text on the page image.
  6. Output the final page as a new image.

Pen width, line length, line direction, number of strokes, and stroke opacity are all randomly fuzzed. The pen color is always black, except in rare cases it is blood red.

Blackout has produced one creative work:

"The Days Left Forebodings and Water"

The source material is A Vindication of the Rights of Women by Mary Wollstonecraft (1792).

Read The Days Left Forebodings and Water. 45 pages long, consists of entries that were generated randomly, but hand-picked and ordered on November 9, 2016.

(The full NaNoGenMo entry of ~50,000 words was a [9.3GB PDF] of nearly 10,000 pages. If for some reason you want it, just ask.)

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].