All Projects → devforth → imagetotext.app

devforth / imagetotext.app

Licence: other
Copy text from the raster images online

Programming Languages

HTML
75241 projects
python
139335 projects - #7 most used programming language
shell
77523 projects
Dockerfile
14818 projects

Projects that are alternatives of or similar to imagetotext.app

Sod
An Embedded Computer Vision & Machine Learning Library (CPU Optimized & IoT Capable)
Stars: ✭ 1,460 (+3073.91%)
Mutual labels:  image-recognition, image-analysis
ocr
Simple app to extract text from pictures using Tesseract
Stars: ✭ 98 (+113.04%)
Mutual labels:  tesseract, image-recognition
image space
Interactive Image similarity and Visual Search and Retrieval application
Stars: ✭ 91 (+97.83%)
Mutual labels:  image-recognition, image-analysis
Image Similarity
计算图片之间的相似度
Stars: ✭ 292 (+534.78%)
Mutual labels:  image-recognition, image-analysis
Recogcaptcha
Técnica de reconhecimento de captcha utilizando o framework AForge.NET e o Tesseract
Stars: ✭ 29 (-36.96%)
Mutual labels:  tesseract, image-recognition
Python Computer Vision from Scratch
This repository explores the variety of techniques commonly used to analyze and interpret images. It also describes challenging real-world applications where vision is being successfully used, both for specialized applications such as medical imaging, and for fun, consumer-level tasks such as image editing and stitching, which students can apply…
Stars: ✭ 219 (+376.09%)
Mutual labels:  image-recognition, image-analysis
deforestation
A machine learning exercise, using KNN to classify deforested areas
Stars: ✭ 26 (-43.48%)
Mutual labels:  image-recognition, image-analysis
phoenix
phoenix is a small command line image forensics tool
Stars: ✭ 39 (-15.22%)
Mutual labels:  image-analysis
Iron-OCR-Image-to-Text-in-CSharp
Image to Text Tutorial in C# - See https://ironsoftware.com/csharp/ocr/tutorials/how-to-read-text-from-an-image-in-csharp-net/
Stars: ✭ 65 (+41.3%)
Mutual labels:  imagetotext
Symbol
Symbol .net library
Stars: ✭ 14 (-69.57%)
Mutual labels:  image-recognition
pyslide
Digital Pathology Whole Slide Image Analysis Toolbox
Stars: ✭ 38 (-17.39%)
Mutual labels:  image-analysis
Reproducibilty-Challenge-ECANET
Unofficial Implementation of ECANets (CVPR 2020) for the Reproducibility Challenge 2020.
Stars: ✭ 27 (-41.3%)
Mutual labels:  image-recognition
hierse
Zero-shot image tagging by hierarchical semantic embedding
Stars: ✭ 77 (+67.39%)
Mutual labels:  image-recognition
face-attendence
Face Attendance (AWS rekognition)
Stars: ✭ 39 (-15.22%)
Mutual labels:  image-recognition
image-checker
Provides image optimisation information within the browser
Stars: ✭ 14 (-69.57%)
Mutual labels:  image-analysis
PlayerDetection
Player detection and ball detection in football matches using image processing(opencv).
Stars: ✭ 50 (+8.7%)
Mutual labels:  image-recognition
BoneJ2
Plugins for bone image analysis
Stars: ✭ 17 (-63.04%)
Mutual labels:  image-analysis
matImage
Image Processing library for Matlab
Stars: ✭ 75 (+63.04%)
Mutual labels:  image-analysis
object-flaw-detector-python
Detect various irregularities of a product as it moves along a conveyor belt.
Stars: ✭ 17 (-63.04%)
Mutual labels:  image-recognition
amazon-rekognition-engagement-meter
The Engagement Meter calculates and shows engagement levels of an audience participating in a meeting
Stars: ✭ 49 (+6.52%)
Mutual labels:  image-analysis

Deployed application: imagetotext.app/

Text extraction is based on hint: How to extract text from image in Python If you have any questions, please ask in comments under hint.

How to self-host it

This repo holds a Dockerfile with all needed (Correct version of Python and all dependencies which are managed via Pipenv).

Example of instantiation, e.g. Docker Compose:


services:
  imagetotext:
    network_mode: host
    build: <path_to_this_repo>
    command: /bin/bash -c "cd /code/ && pipenv run uvicorn main:app --reload --host 0.0.0.0 --port 8314 --workers 6"
    restart: always

Then you can use http://<ip_of_server>:8314 or use any proxy server to route requests to port 8314, e.g. Nginx:

# imagetotext.app
server {
    server_name imagetotext.app;
    listen 80;
    
    listen 443 ssl;
    ssl_certificate /etc/ssl/cert/$CERT_NAME.crt;
    ssl_certificate_key /etc/ssl/cert/$CERT_NAME.key;

    charset utf-8;
    client_max_body_size 75M;

    location / {
        proxy_set_header Authorization "";
        proxy_connect_timeout 400s;
        proxy_read_timeout 400s;
        proxy_set_header X-Forwarded-For $$proxy_add_x_forwarded_for;
        proxy_set_header Host $$http_host;
        proxy_redirect off;
        
        proxy_buffers 128 8k; 
        proxy_buffer_size 16k;

        proxy_pass http://127.0.0.1:8314;

        add_header Last-Modified $$date_gmt;
        etag off;
    }
}

Run dev autorelaod server locally

Tested on Ubuntu 20.04 (Native and WSL 2):

1. Install deps

apt install tesseract-ocr libtesseract-dev libleptonica-dev pkg-config

Do this before pipenv sync

Tesseract should have version 4.1.1+.

2. Then enter a repo dir and do:

pipenv sync

3. Run app:

pipenv run uvicorn main:app --reload

Testing and troubleshooting

👷‍♂️ To fix any exception produced by python code use fixexception.com – works very well 💪

curl -d '{"base64":"baeldung"}' -H 'Content-Type: application/json' http://127.0.0.1:8000/upload/

Or paste same JSON in testreq.json file and run:

curl -d @testreq.json -H 'Content-Type: application/json' http://127.0.0.1:8000/upload/

To test only tesseract on your computer:

tesseract `absolute path to any image with text (/home/ykorolikhin/Pictures/test_text.png)` stdout 
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].