All Projects → rocketraman → Sane Scan Pdf

rocketraman / Sane Scan Pdf

Licence: mit
Sane command-line scan-to-pdf script on Linux with OCR and deskew support

Programming Languages

shell
77523 projects

Projects that are alternatives of or similar to Sane Scan Pdf

scanbot-sdk-example-ios
No description or website provided.
Stars: ✭ 17 (-70.69%)
Mutual labels:  ocr, scanner
Document-Scanner-and-OCR
A simple document scanner with OCR implemented using Python and OpenCV
Stars: ✭ 31 (-46.55%)
Mutual labels:  ocr, scanner
Inventory Kamera
Scans Genshin Impact characters, artifacts, and weapons from the game window into a JSON file.
Stars: ✭ 348 (+500%)
Mutual labels:  ocr, scanner
Open Paperless
Scan, index, and archive all of your paper documents (acquired by Mayan EDMS)
Stars: ✭ 2,538 (+4275.86%)
Mutual labels:  ocr, scanner
Passportscanner
Scan the MRZ code of a passport and extract the firstname, lastname, passport number, nationality, date of birth, expiration date and personal numer.
Stars: ✭ 417 (+618.97%)
Mutual labels:  ocr, scanner
Opencv
📷 Computer-Vision Demos
Stars: ✭ 244 (+320.69%)
Mutual labels:  ocr, scanning
docus
Android application for scanning and managing documents.
Stars: ✭ 39 (-32.76%)
Mutual labels:  scanner, scanning
Phonia
Phonia Toolkit is one of the most advanced toolkits to scan phone numbers using only free resources. The goal is to first gather standard information such as country, area, carrier and line type on any international phone numbers with a very good accuracy.
Stars: ✭ 221 (+281.03%)
Mutual labels:  scanner, scanning
Konan
Konan - Advanced Web Application Dir Scanner
Stars: ✭ 412 (+610.34%)
Mutual labels:  scanner, scanning
scanbot-sdk-example-ionic
Scanbot scanner SDK example app for Ionic with Cordova.
Stars: ✭ 24 (-58.62%)
Mutual labels:  ocr, scanner
Paperwork
Personal document manager (Linux/Windows) -- Moved to Gnome's Gitlab
Stars: ✭ 2,392 (+4024.14%)
Mutual labels:  ocr, scanner
Gimagereader
A Gtk/Qt front-end to tesseract-ocr.
Stars: ✭ 786 (+1255.17%)
Mutual labels:  ocr, scanner
Scanbot Sdk Example Android
Document scanning SDK example apps for the Scanbot SDK for Android.
Stars: ✭ 67 (+15.52%)
Mutual labels:  ocr, scanner
CleanSCAN
A simple, smart and efficient document scanner for Android
Stars: ✭ 151 (+160.34%)
Mutual labels:  ocr, scanner
Rengine
reNgine is an automated reconnaissance framework for web applications with a focus on highly configurable streamlined recon process via Engines, recon data correlation and organization, continuous monitoring, backed by a database, and simple yet intuitive User Interface. reNgine makes it easy for penetration testers to gather reconnaissance with…
Stars: ✭ 3,439 (+5829.31%)
Mutual labels:  scanner, scanning
Firebase-HMS-ML-Kit-Scanner-Demo
Various scanner use cases using Firebase or HMS ML-Kit
Stars: ✭ 22 (-62.07%)
Mutual labels:  ocr, scanner
Unimap
Scan only once by IP address and reduce scan times with Nmap for large amounts of data.
Stars: ✭ 141 (+143.1%)
Mutual labels:  scanner, scanning
Php Antimalware Scanner
AMWScan (PHP Antimalware Scanner) is a free tool to scan php files and analyze your project to find any malicious code inside it.
Stars: ✭ 181 (+212.07%)
Mutual labels:  scanner, scanning
core
An advanced and highly optimized Java library to build frameworks: it's useful for scanning class paths, generating classes at runtime, facilitating the use of reflection, scanning the filesystem, executing stringified source code and much more...
Stars: ✭ 100 (+72.41%)
Mutual labels:  scanner, scanning
Phoneinfoga
PhoneInfoga is one of the most advanced tools to scan international phone numbers using only free resources. It allows you to first gather standard information such as country, area, carrier and line type on any international phone number. Then search for footprints on search engines to try to find the VoIP provider or identify the owner.
Stars: ✭ 5,927 (+10118.97%)
Mutual labels:  scanner, scanning

SANE Command-Line Scan to PDF

Sane command-line scanning bash shell script on Linux with OCR and deskew support. The script automates common scan-to-pdf operations for scanners with an automatic document feeder, such as the awesome Fujitsu ScanSnap S1500, with output to PDF files.

Tested and run regularly on Fedora, but should work on other distributions with the requirements below.

Features

  • Join scanned pages into a single output file, or specify a name for each page
  • Deskew (if supported by scanner driver, or software-based via unpaper)
  • Crop (if supported by scanner driver)
  • Creates searchable PDFs (with tesseract)
  • Duplex (if scanner supports it)
  • Specify resolution
  • Truncate n pages explicitly from end of scan e.g. duplex scanning with last page truncated
  • Skip white-only pages automatically (with ImageMagick)
  • Specify page width and height for odd size pages, or common sizes (Letter, Legal, A4)
  • Performance: scanner run in parallel with page post-processing
  • Limit parallel processing for very fast scanners or constrained environments (if sem installed)

Requirements

The following dependencies are requirements of the script. See also Dependencies Installation.

  • bash
  • pnmtops (netpbm-progs)
  • ps2pdf (ghostscript)
  • pdfunite (poppler-utils)
  • units (units)
  • ImageMagick (if --skip-empty-pages or --ocr is used)

Optional

  • unpaper (for software deskew)
  • flock (usually provided by util-linux) (for properly ordered verbose logs)
  • tesseract (to make searchable PDFs)
  • sem (via gnu-parallels, to constrain resource usage during page processing)
  • bc (for whitepage detection percentage calculations)

Getting Started

# scan --help
scan [OPTIONS]... [OUTPUT]

OPTIONS
 -v, --verbose
   Verbose output (this will slow down the scan due to the need to prevent interleaved output)
 -d, --duplex
   Duplex scanning
 -m, --mode
   Mode e.g. Lineart (default), Halftone, Gray, Color, etc.
 -r, --resolution
   Resolution e.g 300 (default)
 -a, --append
   Append output to existing scan
 -e, --max <pages>
   Max number of pages e.g. 2 (default is all pages)
 -t, --truncate <pages>
   Truncate number of pages from end e.g. 1 (default is none)
 -s, --size
   Page Size as type e.g. Letter (default), Legal, A4, no effect if --crop is specified
 -ph, --page-height
   Custom Page Height in mm
 -pw, --page-width
   Custom Page Width in mm
 -x, --device
   Override scanner device name, defaulting to `fujitsu`
 --crop
   Crop to contents (driver must support this)
 --deskew
   Run driver deskew (driver must support this)
 --unpaper
   Run post-processing deskew and black edge detection (requires unpaper)
 --ocr
   Run OCR to make the PDF searchable (requires tesseract and ImageMagick)
 --skip-empty-pages
   remove empty pages from resulting PDF document (e.g. one sided doc in duplex mode)

OUTPUT
 -o, --output <outputfile>
   Output to named file default=scan.pdf
 -l, --outputlist <outputfile-1...outputfile-n> Output to named files for each scanned page, can be used with append

Author

Contributors

Other Useful Software

  • OCRmyPDF - forgot to use the --ocr option at scanning time? use this
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].