kosrKorean speech recognition based on transformer (트랜스포머 기반 한국어 음성 인식)
Stars: ✭ 25 (-99.45%)
Vosk ApiOffline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Stars: ✭ 1,357 (-70.06%)
KaldiioA pure python module for reading and writing kaldi ark files
Stars: ✭ 160 (-96.47%)
Openseq2seqToolkit for efficient experimentation with Speech Recognition, Text2Speech and NLP
Stars: ✭ 1,378 (-69.6%)
kospeechOpen-Source Toolkit for End-to-End Korean Automatic Speech Recognition leveraging PyTorch and Hydra.
Stars: ✭ 456 (-89.94%)
web-speech-cognitive-servicesPolyfill Web Speech API with Cognitive Services Bing Speech for both speech-to-text and text-to-speech service.
Stars: ✭ 35 (-99.23%)
Kaldi Active GrammarPython Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time
Stars: ✭ 196 (-95.68%)
UniSpeechUniSpeech - Large Scale Self-Supervised Learning for Speech
Stars: ✭ 224 (-95.06%)
NLP ToolkitLibrary of state-of-the-art models (PyTorch) for NLP tasks
Stars: ✭ 92 (-97.97%)
Speech-RecognitionEnd-to-end Automatic Speech Recognition for Madarian and English in Tensorflow
Stars: ✭ 21 (-99.54%)
TinyCogSmall Robot, Toy Robot platform
Stars: ✭ 29 (-99.36%)
Speech aiSimple speech linguistic AI with Python
Stars: ✭ 66 (-98.54%)
Vosk Android DemoOffline speech recognition for Android with Vosk library.
Stars: ✭ 271 (-94.02%)
Tacotron PytorchA Pytorch Implementation of Tacotron: End-to-end Text-to-speech Deep-Learning Model
Stars: ✭ 104 (-97.71%)
Zamia SpeechOpen tools and data for cloudless automatic speech recognition
Stars: ✭ 374 (-91.75%)
Wav2letterFacebook AI Research's Automatic Speech Recognition Toolkit
Stars: ✭ 5,907 (+30.31%)
Athenaan open-source implementation of sequence-to-sequence based speech processing engine
Stars: ✭ 542 (-88.04%)
Deep Learning DrizzleDrench yourself in Deep Learning, Reinforcement Learning, Machine Learning, Computer Vision, and NLP by learning from these exciting lectures!!
Stars: ✭ 9,717 (+114.36%)
E2e AsrPyTorch Implementations for End-to-End Automatic Speech Recognition
Stars: ✭ 106 (-97.66%)
speechreca simple speech recognition app using the Web Speech API Interfaces
Stars: ✭ 18 (-99.6%)
ppg-vcPPG-Based Voice Conversion
Stars: ✭ 154 (-96.6%)
speech separationConstrained Permutation Invariant Training, Speech Separation
Stars: ✭ 27 (-99.4%)
Wire Ios📱 Wire for iOS (iPhone and iPad)
Stars: ✭ 3,079 (-32.08%)
React Transcript EditorA React component to make correcting automated transcriptions of audio and video easier and faster. By BBC News Labs. - Work in progress
Stars: ✭ 285 (-93.71%)
sova-asrSOVA ASR (Automatic Speech Recognition)
Stars: ✭ 123 (-97.29%)
Voice-Denoising-ANA Conditional Generative Adverserial Network (cGAN) was adapted for the task of source de-noising of noisy voice auditory images. The base architecture is adapted from Pix2Pix.
Stars: ✭ 42 (-99.07%)
Alan Sdk AndroidAlan AI Android SDK adds a voice assistant or chatbot to your app. Supports Java, Kotlin.
Stars: ✭ 278 (-93.87%)
Ajax-ChatAjax Chat is a complete web chat in javascript, ajax, php and mysql compatible with Phonegap
Stars: ✭ 19 (-99.58%)
EspeakeSpeak NG is an open source speech synthesizer that supports 101 languages and accents.
Stars: ✭ 339 (-92.52%)
Gp GanOfficial Chainer implementation of GP-GAN: Towards Realistic High-Resolution Image Blending (ACMMM 2019, oral)
Stars: ✭ 317 (-93.01%)
nepali-translatorNeural Machine Translation on the Nepali-English language pair
Stars: ✭ 29 (-99.36%)
Recording-BotA bot built to record and transcribe audio fragments from Discord.
Stars: ✭ 22 (-99.51%)
StageMateStageMate is the smart assistant for your presentation. It will cover all aspects of your pitch from skipping slides to reminding you if you miss some major point.
Stars: ✭ 60 (-98.68%)
ParakeetPAddle PARAllel text-to-speech toolKIT (supporting WaveFlow, WaveNet, Transformer TTS and Tacotron2)
Stars: ✭ 279 (-93.85%)
Alan Sdk FlutterAlan AI Flutter SDK adds a voice assistant or chatbot to your app.
Stars: ✭ 309 (-93.18%)
Neuraldialog CvaeTensorflow Implementation of Knowledge-Guided CVAE for dialog generation ACL 2017. It is released by Tiancheng Zhao (Tony) from Dialog Research Center, LTI, CMU
Stars: ✭ 279 (-93.85%)
dropclass speakerDropClass and DropAdapt - repository for the paper accepted to Speaker Odyssey 2020
Stars: ✭ 20 (-99.56%)
edittsOfficial implementation of EdiTTS: Score-based Editing for Controllable Text-to-Speech
Stars: ✭ 74 (-98.37%)
Multi-Hotword SpottingWon't it be cool to build a speech assistant like Alexa or Siri yourself without voice API and network connection?
Stars: ✭ 31 (-99.32%)
download audioset📁 This repo makes it easy to download the raw audio files from AudioSet (32.45 GB, 632 classes).
Stars: ✭ 53 (-98.83%)
Multilingual text to speechAn implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.
Stars: ✭ 324 (-92.85%)
Cognitive Speech TtsMicrosoft Text-to-Speech API sample code in several languages, part of Cognitive Services.
Stars: ✭ 312 (-93.12%)
TransformerA Pytorch Implementation of "Attention is All You Need" and "Weighted Transformer Network for Machine Translation"
Stars: ✭ 271 (-94.02%)
musicologistMusic advice from a conversational interface powered by Algolia
Stars: ✭ 19 (-99.58%)
ocaml-otrOff-the-record (OTR) messaging protocol, purely in OCaml
Stars: ✭ 39 (-99.14%)
ZhihuThis repo contains the source code in my personal column (https://zhuanlan.zhihu.com/zhaoyeyu), implemented using Python 3.6. Including Natural Language Processing and Computer Vision projects, such as text generation, machine translation, deep convolution GAN and other actual combat code.
Stars: ✭ 3,307 (-27.05%)
Alan Sdk CordovaAlan AI Cordova SDK adds a voice assistant or chatbot to your app.
Stars: ✭ 269 (-94.07%)