All Projects → see-- → kuzushiji-recognition

see-- / kuzushiji-recognition

Licence: MIT license
5th place solution for the Kaggle Kuzushiji Recognition Challenge

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to kuzushiji-recognition

kuzushiji-recognition
Kuzushiji Recognition Kaggle 2019. Build a DL model to transcribe ancient Kuzushiji into contemporary Japanese characters. Opening the door to a thousand years of Japanese culture.
Stars: ✭ 16 (-60.98%)
Mutual labels:  ocr, centernet
MouseTooltipTranslator
chrome extension - When mouse hover on text, it shows translated tooltip using google translate
Stars: ✭ 93 (+126.83%)
Mutual labels:  ocr
CleanSCAN
A simple, smart and efficient document scanner for Android
Stars: ✭ 151 (+268.29%)
Mutual labels:  ocr
BankCard-Recognizer
Identifying numbers from bankcard, based on Deep Learning with Keras [China Software Cup 2019]
Stars: ✭ 74 (+80.49%)
Mutual labels:  ocr
deep-learning-for-document-dewarping
An application of high resolution GANs to dewarp images of perturbed documents
Stars: ✭ 100 (+143.9%)
Mutual labels:  ocr
ocr space
Free Online OCR for Ruby - Convert images to text
Stars: ✭ 62 (+51.22%)
Mutual labels:  ocr
ocreval
Update of the ISRI Analytic Tools for OCR Evaluation with UTF-8 support
Stars: ✭ 48 (+17.07%)
Mutual labels:  ocr
pytorch.ctpn
pytorch, ctpn ,text detection ,ocr,文本检测
Stars: ✭ 123 (+200%)
Mutual labels:  ocr
ReadToMe
No description or website provided.
Stars: ✭ 51 (+24.39%)
Mutual labels:  ocr
Game2Text
Complete toolbox for gamifying language learning
Stars: ✭ 86 (+109.76%)
Mutual labels:  ocr
papermerge-core
Papermerge RESTful backend structured as reusable Django app
Stars: ✭ 103 (+151.22%)
Mutual labels:  ocr
ScribeBot
A highly scriptable automation system full of cool features. Automate everything with a little bit of Lua.
Stars: ✭ 72 (+75.61%)
Mutual labels:  ocr
ID-Card-Passport-Recognition-SDK-Android
On-Device ID Card & Passport & Driver License Recognition SDK for Android
Stars: ✭ 223 (+443.9%)
Mutual labels:  ocr
word2vec-movies
Bag of words meets bags of popcorn in Python 3 中文教程
Stars: ✭ 54 (+31.71%)
Mutual labels:  kaggle-competition
ocr-machine-learning
OCR Machine Learning in python
Stars: ✭ 42 (+2.44%)
Mutual labels:  ocr
awesome-document-understanding
A curated list of resources for Document Understanding (DU) topic
Stars: ✭ 620 (+1412.2%)
Mutual labels:  ocr
AgentOCR
一个多语言支持、易使用的 OCR 项目。An easy-to-use OCR project with multilingual support.
Stars: ✭ 98 (+139.02%)
Mutual labels:  ocr
pmOCR
A wrapper for tesseract / abbyyOCR11 ocr4linux finereader cli that can perform batch operations or monitor a directory and launch an OCR conversion on file activity
Stars: ✭ 53 (+29.27%)
Mutual labels:  ocr
Quora QuestionPairs DL
Kaggle Competition: Using deep learning to solve quora's question pairs problem
Stars: ✭ 54 (+31.71%)
Mutual labels:  kaggle-competition
deep-license-plate-recognition
Automatic License Plate Recognition (ALPR) or Automatic Number Plate Recognition (ANPR) software that works with any camera.
Stars: ✭ 309 (+653.66%)
Mutual labels:  ocr

Overview

Code for the Kaggle Kuzushiji Recognition Challenge. My team finished as 5th with a F1-score of 0.94. The challenge was to develop better algorithms for Kuzushiji recognition.

Setup

The training data can be downloaded from here: https://www.kaggle.com/c/kuzushiji-recognition/data

or using the Kaggle API:

kaggle competitions download -c kuzushiji-recognition

Requirements:

pip install --upgrade tqdm opencv-python==3.4.5.20 timm --user
pip install --upgrade git+https://github.com/albu/albumentations --user

Install apex for your system setup as explained here: https://github.com/NVIDIA/apex This is optional if you just want to run inference.

$ ls | grep images
>>> test_images
>>> train_images

Training

$ python train.py

This will produce the weights Logdir_038_f00/f00-ep-0125-val_hm_acc-0.9944-val_classes_acc-0.4986.pth. Training takes ~8 hours on a single V100 GPU with mixed precision training. Alternatively, the weights can be downloaded from here. Just put them into Logdir_038_f00.

Submission

$ python submit.py

This will produce the submission file Logdir_038_f00/f00-TTA-V7-PREDS-p0.40-.csv.

Visualization

You can visualize the predictions via: python vis_submission.py --fn ../Kuzushiji/Logdir_038_f00/f00-TTA-V7-PREDS-p0.40-.csv.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].