PykaldiA Python wrapper for Kaldi
Stars: ✭ 756 (-9.24%)
RhinoOn-device speech-to-intent engine powered by deep learning
Stars: ✭ 406 (-51.26%)
CheetahOn-device streaming speech-to-text engine powered by deep learning
Stars: ✭ 383 (-54.02%)
Voice Overlay Ios🗣 An overlay that gets your user’s voice permission and input as text in a customizable UI
Stars: ✭ 440 (-47.18%)
WenetProduction First and Production Ready End-to-End Speech Recognition Toolkit
Stars: ✭ 617 (-25.93%)
CtcwordbeamsearchConnectionist Temporal Classification (CTC) decoder with dictionary and language model for TensorFlow.
Stars: ✭ 398 (-52.22%)
Wav2letterFacebook AI Research's Automatic Speech Recognition Toolkit
Stars: ✭ 5,907 (+609.12%)
Silero ModelsSilero Models: pre-trained STT models and benchmarks made embarrassingly simple
Stars: ✭ 522 (-37.33%)
EspnetEnd-to-End Speech Processing Toolkit
Stars: ✭ 4,533 (+444.18%)
DeltapyDeltaPy - Tabular Data Augmentation (by @firmai)
Stars: ✭ 344 (-58.7%)
Libreasr💬 An On-Premises, Streaming Speech Recognition System
Stars: ✭ 633 (-24.01%)
Asrt speechrecognitionA Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统
Stars: ✭ 4,943 (+493.4%)
Annyang💬 Speech recognition for your site
Stars: ✭ 6,216 (+646.22%)
HctsaHighly comparative time-series analysis
Stars: ✭ 406 (-51.26%)
Athenaan open-source implementation of sequence-to-sequence based speech processing engine
Stars: ✭ 542 (-34.93%)
Awesome KaldiThis is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )
Stars: ✭ 393 (-52.82%)
Stephanie VaStephanie is an open-source platform built specifically for voice-controlled applications as well as to automate daily tasks imitating much of an virtual assistant's work.
Stars: ✭ 772 (-7.32%)
SubsyncSubtitle Speech Synchronizer
Stars: ✭ 379 (-54.5%)
CtcdecoderConnectionist Temporal Classification (CTC) decoding algorithms: best path, prefix search, beam search and token passing. Implemented in Python.
Stars: ✭ 529 (-36.49%)
Libfaceidlibfaceid is a research framework for prototyping of face recognition solutions. It seamlessly integrates multiple detection, recognition and liveness models w/ speech synthesis and speech recognition.
Stars: ✭ 354 (-57.5%)
Efficientnet PytorchA PyTorch implementation of EfficientNet and EfficientNetV2 (coming soon!)
Stars: ✭ 6,685 (+702.52%)
Java Speech ApiThe J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.
Stars: ✭ 490 (-41.18%)
DeepspeechDeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
Stars: ✭ 18,680 (+2142.5%)
MedpyMedical image processing in Python
Stars: ✭ 321 (-61.46%)
RhasspyOffline private voice assistant for many human languages
Stars: ✭ 458 (-45.02%)
Speech Emotion AnalyzerThe neural network model is capable of detecting five different male/female emotions from audio speeches. (Deep Learning, NLP, Python)
Stars: ✭ 633 (-24.01%)
UspeechSpeech recognition toolkit for the arduino
Stars: ✭ 448 (-46.22%)
EesenThe official repository of the Eesen project
Stars: ✭ 738 (-11.4%)
Awesome Feature EngineeringA curated list of resources dedicated to Feature Engineering Techniques for Machine Learning
Stars: ✭ 433 (-48.02%)
VadVoice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.
Stars: ✭ 622 (-25.33%)
SpecaugmentA Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain
Stars: ✭ 408 (-51.02%)
MeydaAudio feature extraction for JavaScript.
Stars: ✭ 792 (-4.92%)
Neural spEnd-to-end ASR/LM implementation with PyTorch
Stars: ✭ 408 (-51.02%)
PyradiomicsOpen-source python package for the extraction of Radiomics features from 2D and 3D images and binary masks. Support: https://discourse.slicer.org/c/community/radiomics
Stars: ✭ 563 (-32.41%)
Tensorflowasr⚡️ TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwords
Stars: ✭ 400 (-51.98%)
AdaptAdapt Intent Parser
Stars: ✭ 690 (-17.17%)
Feature SelectionFeatures selector based on the self selected-algorithm, loss function and validation method
Stars: ✭ 534 (-35.89%)
NmtpytorchSequence-to-Sequence Framework in PyTorch
Stars: ✭ 392 (-52.94%)
KurDescriptive Deep Learning
Stars: ✭ 811 (-2.64%)
Zamia SpeechOpen tools and data for cloudless automatic speech recognition
Stars: ✭ 374 (-55.1%)
Sonus💬 /so.nus/ STT (speech to text) for Node with offline hotword detection
Stars: ✭ 532 (-36.13%)
Alan Sdk WebAlan AI Web SDK adds a voice assistant or chatbot to your app. Supports React, Angular, Vue, Ember, JavaScript, Electron.
Stars: ✭ 368 (-55.82%)
Awesome DiarizationA curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
Stars: ✭ 673 (-19.21%)
BrevitasBrevitas: quantization-aware training in PyTorch
Stars: ✭ 343 (-58.82%)
SincnetSincNet is a neural architecture for efficiently processing raw audio samples.
Stars: ✭ 764 (-8.28%)
FcgfFully Convolutional Geometric Features: Fast and accurate 3D features for registration and correspondence.
Stars: ✭ 328 (-60.62%)
MachinelearnjsMachine Learning library for the web and Node.
Stars: ✭ 498 (-40.22%)
J.a.r.v.i.spython powered Intelligent System
Stars: ✭ 325 (-60.98%)
Speech recognitionSpeech recognition module for Python, supporting several engines and APIs, online and offline.
Stars: ✭ 5,999 (+620.17%)
TfidfSimple TF IDF Library
Stars: ✭ 6 (-99.28%)
EspressoEspresso: A Fast End-to-End Neural Speech Recognition Toolkit
Stars: ✭ 808 (-3%)
Mycroft PreciseA lightweight, simple-to-use, RNN wake word listener
Stars: ✭ 481 (-42.26%)
Alan Sdk IosAlan AI iOS SDK adds a voice assistant or chatbot to your app. Supports Swift, Objective-C.
Stars: ✭ 318 (-61.82%)
TsfreshAutomatic extraction of relevant features from time series:
Stars: ✭ 6,077 (+629.53%)