ventib📈 Ventib records your voice, transcribes it in realtime, and performs speech pattern analysis to give you objective statistics about how you speak.
Stars: ✭ 43 (-68.15%)
Alan Sdk CordovaAlan AI Cordova SDK adds a voice assistant or chatbot to your app.
Stars: ✭ 269 (+99.26%)
Cortex M KwsCortex M KWS example with Tengine Lite.
Stars: ✭ 45 (-66.67%)
txt2speechConvert text to speech using Google Translate API
Stars: ✭ 38 (-71.85%)
good-speech-web-clientPractice your speech level in any language using speech recognition
Stars: ✭ 26 (-80.74%)
WikipronMassively multilingual pronunciation mining
Stars: ✭ 99 (-26.67%)
Amazing Python Scripts🚀 Curated collection of Amazing Python scripts from Basics to Advance with automation task scripts.
Stars: ✭ 229 (+69.63%)
KARENKAREN: Unifying Hatespeech Detection and Benchmarking
Stars: ✭ 18 (-86.67%)
SeganSpeech Enhancement Generative Adversarial Network in TensorFlow
Stars: ✭ 661 (+389.63%)
Formant AnalyzeriOS application for finding formants in spoken sounds
Stars: ✭ 43 (-68.15%)
Multimodal-Gesture-Recognition-with-LSTMs-and-CTCAn end-to-end system that performs temporal recognition of gesture sequences using speech and skeletal input. The model combines three networks with a CTC output layer that recognises gestures from continuous stream.
Stars: ✭ 25 (-81.48%)
Listen-Attend-Spell-v2PyTorch implementation of Listen Attend and Spell Automatic Speech Recognition (ASR).
Stars: ✭ 29 (-78.52%)
lightning-asrModular and extensible speech recognition library leveraging pytorch-lightning and hydra.
Stars: ✭ 36 (-73.33%)
UHV-OTS-SpeechA data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.
Stars: ✭ 94 (-30.37%)
hifigan-denoiserHiFi-GAN: High Fidelity Denoising and Dereverberation Based on Speech Deep Features in Adversarial Networks
Stars: ✭ 88 (-34.81%)
Transformers🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Stars: ✭ 55,742 (+41190.37%)
Masr中文语音识别; Mandarin Automatic Speech Recognition;
Stars: ✭ 1,246 (+822.96%)
Speech recognitionSpeech recognition module for Python, supporting several engines and APIs, online and offline.
Stars: ✭ 5,999 (+4343.7%)
Dc ttsA TensorFlow Implementation of DC-TTS: yet another text-to-speech model
Stars: ✭ 1,017 (+653.33%)
browser-apis🦄 Cool & Fun Browser Web APIs 🥳
Stars: ✭ 21 (-84.44%)
minutes🔭 Speaker diarization via transfer learning
Stars: ✭ 25 (-81.48%)
Cn2an📦 快速转化「中文数字」和「阿拉伯数字」~ (最新特性:分数,日期、温度等转化)
Stars: ✭ 249 (+84.44%)
demo vietasrVietnamese Speech Recognition
Stars: ✭ 22 (-83.7%)
Dialectid e2eEnd to End Dialect Identification using Convolutional Neural Network
Stars: ✭ 40 (-70.37%)
NemoNeMo: a toolkit for conversational AI
Stars: ✭ 3,685 (+2629.63%)
flite-goGo bindings for Flite (festival-lite)
Stars: ✭ 14 (-89.63%)
Kaldi Active GrammarPython Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time
Stars: ✭ 196 (+45.19%)
quran-alignWord-accurate timestamps for Qur'anic audio.
Stars: ✭ 139 (+2.96%)
opensnipsOpen source projects related to Snips https://snips.ai/.
Stars: ✭ 50 (-62.96%)
Voice🎤 React Native Voice Recognition library for iOS and Android (Online and Offline Support)
Stars: ✭ 993 (+635.56%)
StageMateStageMate is the smart assistant for your presentation. It will cover all aspects of your pitch from skipping slides to reminding you if you miss some major point.
Stars: ✭ 60 (-55.56%)
B.e.n.j.i.B.E.N.J.I.- The Impossible Missions Force's digital assistant
Stars: ✭ 83 (-38.52%)
Libreasr💬 An On-Premises, Streaming Speech Recognition System
Stars: ✭ 633 (+368.89%)
pocketsphinxUpdated ROS bindings to pocketsphinx
Stars: ✭ 36 (-73.33%)
KtspeechcrawlerAutomatically constructing corpus for automatic speech recognition from YouTube videos
Stars: ✭ 92 (-31.85%)
Voice Overlay Android🗣 An overlay that gets your user’s voice permission and input as text in a customizable UI
Stars: ✭ 189 (+40%)
tt-vae-ganTimbre transfer with variational autoencoding and cycle-consistent adversarial networks. Able to transfer the timbre of an audio source to that of another.
Stars: ✭ 37 (-72.59%)
WsayWindows "say"
Stars: ✭ 36 (-73.33%)
edittsOfficial implementation of EdiTTS: Score-based Editing for Controllable Text-to-Speech
Stars: ✭ 74 (-45.19%)
Lip Reading Deeplearning🔓 Lip Reading - Cross Audio-Visual Recognition using 3D Architectures
Stars: ✭ 1,641 (+1115.56%)
Kaldi OnnxKaldi model converter to ONNX
Stars: ✭ 174 (+28.89%)
download audioset📁 This repo makes it easy to download the raw audio files from AudioSet (32.45 GB, 632 classes).
Stars: ✭ 53 (-60.74%)
LightspeechLightSpeech: Lightweight and Fast Text to Speech with Neural Architecture Search
Stars: ✭ 31 (-77.04%)
nlp-classA Natural Language Processing course taught by Professor Ghassemi
Stars: ✭ 95 (-29.63%)
TASNETTime-domain Audio Separation Network (IN PYTORCH)
Stars: ✭ 18 (-86.67%)
Avpian open source voice command macro software
Stars: ✭ 130 (-3.7%)
Alan Sdk PcfAlan AI Power Apps SDK adds a voice assistant or chatbot to your Microsoft Power Apps project.
Stars: ✭ 128 (-5.19%)
vosk-asteriskSpeech Recognition in Asterisk with Vosk Server
Stars: ✭ 52 (-61.48%)
DeepspeechrecognitionA Chinese Deep Speech Recognition System 包括基于深度学习的声学模型和基于深度学习的语言模型
Stars: ✭ 1,421 (+952.59%)
Laibot Client开源人工智能,基于开源软硬件构建语音对话机器人、智能音箱……人机对话、自然交互,来宝拥有无限可能。特别说明,来宝运行于Python 3!
Stars: ✭ 81 (-40%)
WenetProduction First and Production Ready End-to-End Speech Recognition Toolkit
Stars: ✭ 617 (+357.04%)