pyjsgfJSpeech Grammar Format (JSGF) compiler, matcher and parser package for Python.
Stars: ✭ 40 (-96.72%)
NmtpytorchSequence-to-Sequence Framework in PyTorch
Stars: ✭ 392 (-67.84%)
mixupspeechpro.com/
Stars: ✭ 23 (-98.11%)
Assistant ClientИнструмент для тестирования и отладки СanvasApps c семейством Виртуальных Ассистентов "Салют"
Stars: ✭ 26 (-97.87%)
KARENKAREN: Unifying Hatespeech Detection and Benchmarking
Stars: ✭ 18 (-98.52%)
lightning-asrModular and extensible speech recognition library leveraging pytorch-lightning and hydra.
Stars: ✭ 36 (-97.05%)
Vosk Android DemoOffline speech recognition for Android with Vosk library.
Stars: ✭ 271 (-77.77%)
quran-alignWord-accurate timestamps for Qur'anic audio.
Stars: ✭ 139 (-88.6%)
PocketsphinxPocketSphinx is a lightweight speech recognition engine, specifically tuned for handheld and mobile devices, though it works equally well on the desktop
Stars: ✭ 2,934 (+140.69%)
Formant AnalyzeriOS application for finding formants in spoken sounds
Stars: ✭ 43 (-96.47%)
Voice SynthesisThis repository is an implementation of Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis (SV2TTS) with a vocoder that works in real-time. SV2TTS is a three-stage deep learning framework that allows to create a numerical representation of a voice from a few seconds of audio, and to use it to condition a text-to-speech model trained to generalize to new voices.
Stars: ✭ 51 (-95.82%)
Wavenet SttAn end-to-end speech recognition system with Wavenet. Built using C++ and python.
Stars: ✭ 18 (-98.52%)
Zamia SpeechOpen tools and data for cloudless automatic speech recognition
Stars: ✭ 374 (-69.32%)
opensnipsOpen source projects related to Snips https://snips.ai/.
Stars: ✭ 50 (-95.9%)
rustfstRust re-implementation of OpenFST - library for constructing, combining, optimizing, and searching weighted finite-state transducers (FSTs). A Python binding is also available.
Stars: ✭ 104 (-91.47%)
Speech Alignerspeech-aligner,是一个从“人声语音”及其“语言文本”,产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech and its transcription
Stars: ✭ 259 (-78.75%)
PhomemeSimple sentence mixing tool (work in progress)
Stars: ✭ 18 (-98.52%)
Libreasr💬 An On-Premises, Streaming Speech Recognition System
Stars: ✭ 633 (-48.07%)
SubsyncSubtitle Speech Synchronizer
Stars: ✭ 379 (-68.91%)
pocketsphinxUpdated ROS bindings to pocketsphinx
Stars: ✭ 36 (-97.05%)
Noise2Noise-audio denoising without clean training dataSource code for the paper titled "Speech Denoising without Clean Training Data: a Noise2Noise Approach". Paper accepted at the INTERSPEECH 2021 conference. This paper tackles the problem of the heavy dependence of clean speech data required by deep learning based audio denoising methods by showing that it is possible to train deep speech denoisi…
Stars: ✭ 49 (-95.98%)
nlp-classA Natural Language Processing course taught by Professor Ghassemi
Stars: ✭ 95 (-92.21%)
Speechpy💬 SpeechPy - A Library for Speech Processing and Recognition: http://speechpy.readthedocs.io/en/latest/
Stars: ✭ 833 (-31.67%)
Alan Sdk WebAlan AI Web SDK adds a voice assistant or chatbot to your app. Supports React, Angular, Vue, Ember, JavaScript, Electron.
Stars: ✭ 368 (-69.81%)
TASNETTime-domain Audio Separation Network (IN PYTORCH)
Stars: ✭ 18 (-98.52%)
Tts🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Stars: ✭ 305 (-74.98%)
Voice2MeshCVPR 2022: Cross-Modal Perceptionist: Can Face Geometry be Gleaned from Voices?
Stars: ✭ 67 (-94.5%)
Android-TTS-STTOne line solution for Android Text to speech(TTS) & Speech to Text(STT) translation problem
Stars: ✭ 77 (-93.68%)
UnityASRAutomatic Speech Recognition in Unity.
Stars: ✭ 14 (-98.85%)
Nlp Paper自然语言处理领域下的对话语音领域,整理相关论文(附阅读笔记),复现模型以及数据处理等(代码含TensorFlow和PyTorch两版本)
Stars: ✭ 67 (-94.5%)
EspnetEnd-to-End Speech Processing Toolkit
Stars: ✭ 4,533 (+271.86%)
Speech-RecognitionEnd-to-end Automatic Speech Recognition for Madarian and English in Tensorflow
Stars: ✭ 21 (-98.28%)
Android Speech RecognitionContinuous speech recognition library for Android with options to use GoogleVoiceIme dialog and offline mode.
Stars: ✭ 72 (-94.09%)
Athenaan open-source implementation of sequence-to-sequence based speech processing engine
Stars: ✭ 542 (-55.54%)
Voice BuilderAn opensource text-to-speech (TTS) voice building tool
Stars: ✭ 362 (-70.3%)
NLP ToolkitLibrary of state-of-the-art models (PyTorch) for NLP tasks
Stars: ✭ 92 (-92.45%)
audio noise clusteringhttps://dodiku.github.io/audio_noise_clustering/results/ ==> An experiment with a variety of clustering (and clustering-like) techniques to reduce noise on an audio speech recording.
Stars: ✭ 24 (-98.03%)
Recording-BotA bot built to record and transcribe audio fragments from Discord.
Stars: ✭ 22 (-98.2%)
benchmarksttOpen Source AI Benchmarking toolkit for benchmarking speech to text services
Stars: ✭ 43 (-96.47%)
Dialectid e2eEnd to End Dialect Identification using Convolutional Neural Network
Stars: ✭ 40 (-96.72%)
EspressoEspresso: A Fast End-to-End Neural Speech Recognition Toolkit
Stars: ✭ 808 (-33.72%)
web-speech-demoLearn how to build a simple text-to-speech voice app for the web using the Web Speech API.
Stars: ✭ 19 (-98.44%)
awesome-keyword-spottingThis repository is a curated list of awesome Speech Keyword Spotting (Wake-Up Word Detection).
Stars: ✭ 150 (-87.69%)
Libfaceidlibfaceid is a research framework for prototyping of face recognition solutions. It seamlessly integrates multiple detection, recognition and liveness models w/ speech synthesis and speech recognition.
Stars: ✭ 354 (-70.96%)
soxanWav2Vec for speech recognition, classification, and audio classification
Stars: ✭ 113 (-90.73%)
Parrots Automatic Speech Recognition(ASR), Text-To-Speech(TTS) engine for Chinese.
Stars: ✭ 48 (-96.06%)
Tensorflow-Keyword-SpottingKeyword spotting using various architecture like convolutional vggnet , 1D convolutional network and CTC.
Stars: ✭ 27 (-97.79%)
InaspeechsegmenterCNN-based audio segmentation toolkit. Allows to detect speech, music and speaker gender. Has been designed for large scale gender equality studies based on speech time per gender.
Stars: ✭ 352 (-71.12%)
scriptionAn editor for speech-to-text transcripts such as AWS Transcribe and Mozilla DeepSpeech
Stars: ✭ 46 (-96.23%)
gtranscribeSoftware for interview transcription
Stars: ✭ 12 (-99.02%)