samSAM: Software Automatic Mouth (Ported from https://github.com/vidarh/SAM)
Stars: ✭ 33 (+6.45%)
Caffe OneclickUse caffe to train your own data in just one click
Stars: ✭ 187 (+503.23%)
TalkifyJavascript Text to speech library
Stars: ✭ 132 (+325.81%)
spokestack-androidExtensible Android mobile voice framework: wakeword, ASR, NLU, and TTS. Easily add voice to any Android app!
Stars: ✭ 52 (+67.74%)
SwiftytesseractA Swift wrapper around Tesseract for use in iOS, macOS, and Linux applications
Stars: ✭ 170 (+448.39%)
Nlp Pretrained ModelA collection of Natural language processing pre-trained models.
Stars: ✭ 122 (+293.55%)
Ocr TableExtract tables from scanned image PDFs using Optical Character Recognition.
Stars: ✭ 165 (+432.26%)
erpnext ocr🐍 ⚗️ Optical Character Recognition using tesseract within Frappe.
Stars: ✭ 58 (+87.1%)
awesome-vacuumA curated list of free and open source software and hardware to build and control a robot vacuum.
Stars: ✭ 187 (+503.23%)
persian-tts🔊 A simple human-based text-to-speach synthesiser and ReactNative app for Persian language.
Stars: ✭ 18 (-41.94%)
Articulate.jsA jQuery plugin that lets the browser speak to you.
Stars: ✭ 116 (+274.19%)
PdftabextractA set of tools for extracting tables from PDF files helping to do data mining on (OCR-processed) scanned documents.
Stars: ✭ 1,969 (+6251.61%)
Lambda Text ExtractorAWS Lambda functions to extract text from various binary formats.
Stars: ✭ 159 (+412.9%)
CrystalCrystal - C++ implementation of a unified framework for multilingual TTS synthesis engine with SSML specification as interface.
Stars: ✭ 108 (+248.39%)
Tools Ocr树洞 OCR 文字识别(一款跨平台的 OCR 小工具)
Stars: ✭ 2,303 (+7329.03%)
OcrtableRecognize tables and text from scanned images that contain tables. 从包含表格的扫描图片中识别表格和文字
Stars: ✭ 155 (+400%)
WavernnWaveRNN Vocoder + TTS
Stars: ✭ 1,636 (+5177.42%)
Tesseract MacosObjective C wrapper for the open source OCR Engine Tesseract (macOS)
Stars: ✭ 154 (+396.77%)
CleanSCANA simple, smart and efficient document scanner for Android
Stars: ✭ 151 (+387.1%)
CrnnConvolutional Recurrent Neural Network (CRNN) for image-based sequence recognition.
Stars: ✭ 1,901 (+6032.26%)
Spokestack PythonSpokestack is a library that allows a user to easily incorporate a voice interface into any Python application.
Stars: ✭ 103 (+232.26%)
FFTNetFFTNet: a Real-Time Speaker-Dependent Neural Vocoder
Stars: ✭ 63 (+103.23%)
Stb TesterAutomated Testing for Set-Top Boxes and Smart TVs
Stars: ✭ 148 (+377.42%)
Openseq2seqToolkit for efficient experimentation with Speech Recognition, Text2Speech and NLP
Stars: ✭ 1,378 (+4345.16%)
talkieText-to-speech browser extension button. Select text on any web page, and have the computer read it out loud for you by simply clicking the Talkie button.
Stars: ✭ 43 (+38.71%)
ocrevalUpdate of the ISRI Analytic Tools for OCR Evaluation with UTF-8 support
Stars: ✭ 48 (+54.84%)
sawyer robotSawyer-specific components for the Sawyer robot for use with the intera_sdk.
Stars: ✭ 39 (+25.81%)
Expressive-FastSpeech2PyTorch Implementation of Non-autoregressive Expressive (emotional, conversational) TTS based on FastSpeech2, supporting English, Korean, and your own languages.
Stars: ✭ 139 (+348.39%)
pmOCRA wrapper for tesseract / abbyyOCR11 ocr4linux finereader cli that can perform batch operations or monitor a directory and launch an OCR conversion on file activity
Stars: ✭ 53 (+70.97%)
IMS-ToucanText-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart. Objectives of the development are simplicity, modularity, controllability and multilinguality.
Stars: ✭ 295 (+851.61%)
JoytanCreative Audio/Textbook Maker 🎵 📖 See our YouTube channel
Stars: ✭ 91 (+193.55%)
Craft kerasKeras implementation of Character Region Awareness for Text Detection (CRAFT)
Stars: ✭ 143 (+361.29%)
Ambar🔍 Ambar: Document Search Engine
Stars: ✭ 1,829 (+5800%)
LprAndroid 车牌识别--OCR
Stars: ✭ 139 (+348.39%)
uniquifyUniquify is a Telegram bot interface used to remove duplicate media files from a chat
Stars: ✭ 45 (+45.16%)
EasyocrReady-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
Stars: ✭ 13,379 (+43058.06%)
Bvae TtsOfficial implementation of BVAE-TTS
Stars: ✭ 85 (+174.19%)
dynamixel-workbenchROS packages for Dynamixel controllers, msgs, single_manager, toolbox, tutorials
Stars: ✭ 91 (+193.55%)
WatbotAn Android ChatBot powered by IBM Watson Services (Assistant V1, Text-to-Speech, and Speech-to-Text with Speaker Recognition) on IBM Cloud.
Stars: ✭ 64 (+106.45%)
RobinRObust document image BINarization
Stars: ✭ 131 (+322.58%)
ni-translateA translator for Linux, running at the background which wakes up with the translation of the last selected text on command.
Stars: ✭ 82 (+164.52%)
VoicenetSpeech synthesis platform based on tensorflow and sonnet
Stars: ✭ 60 (+93.55%)
wb-toolboxSimulink toolbox to rapidly prototype robot controllers
Stars: ✭ 20 (-35.48%)
ttslearnttslearn: Library for Pythonで学ぶ音声合成 (Text-to-speech with Python)
Stars: ✭ 158 (+409.68%)
ldraw2stlConvert LEGO LDraw files into STL
Stars: ✭ 76 (+145.16%)
AthenaA free and open source replacement for Google Assistant on Android devices, meant to integrate with the Sapphire Framework. It contains both speech-to-text and text-to-speech services. It does not require Google services or network connectivity
Stars: ✭ 73 (+135.48%)
LVCNetLVCNet: Efficient Condition-Dependent Modeling Network for Waveform Generation
Stars: ✭ 67 (+116.13%)
mlp-singerOfficial implementation of MLP Singer: Towards Rapid Parallel Korean Singing Voice Synthesis (IEEE MLSP 2021)
Stars: ✭ 103 (+232.26%)
tts📝 🔉 A simple text-to-speech tool. Converts your text to speech with any of Streamlab's voices. Frontend built with GatsbyJS, backend is serverless Node.js
Stars: ✭ 133 (+329.03%)