All Projects → gtranscribe → Similar Projects or Alternatives

202 Open source projects that are alternatives of or similar to gtranscribe

AdaSpeech
AdaSpeech: Adaptive Text to Speech for Custom Voice
Stars: ✭ 108 (+800%)
Mutual labels:  speech
Speech Denoiser
A speech denoise lv2 plugin based on RNNoise library
Stars: ✭ 220 (+1733.33%)
Mutual labels:  speech
FAST-RIR
This is the official implementation of our neural-network-based fast diffuse room impulse response generator (FAST-RIR) for generating room impulse responses (RIRs) for a given acoustic environment.
Stars: ✭ 90 (+650%)
Mutual labels:  speech
Tts Cube
End-2-end speech synthesis with recurrent neural networks
Stars: ✭ 213 (+1675%)
Mutual labels:  speech
deepspeech.mxnet
A MXNet implementation of Baidu's DeepSpeech architecture
Stars: ✭ 82 (+583.33%)
Mutual labels:  speech
Edgedict
Working online speech recognition based on RNN Transducer. ( Trained model release available in release )
Stars: ✭ 205 (+1608.33%)
Mutual labels:  speech
eidos-audition
Collection of auditory models.
Stars: ✭ 25 (+108.33%)
Mutual labels:  speech
Esp8266sam
Speech synthesis for ESP8266 using S.A.M. port
Stars: ✭ 199 (+1558.33%)
Mutual labels:  speech
MajorDomo-Scenarios
Сценарии для системы домашней автоматизации Majordomo
Stars: ✭ 12 (+0%)
Mutual labels:  speech
Speechtotext Websockets Javascript
SDK & Sample to do speech recognition using websockets in Javascript
Stars: ✭ 191 (+1491.67%)
Mutual labels:  speech
cape
Continuous Augmented Positional Embeddings (CAPE) implementation for PyTorch
Stars: ✭ 29 (+141.67%)
Mutual labels:  speech
Depression Detect
Predicting depression from acoustic features of speech using a Convolutional Neural Network.
Stars: ✭ 187 (+1458.33%)
Mutual labels:  speech
DeepSegmentor
Sequence Segmentation using Joint RNN and Structured Prediction Models (ICASSP 2017)
Stars: ✭ 17 (+41.67%)
Mutual labels:  speech
React Native Dialogflow
A React-Native Bridge for the Google Dialogflow (API.AI) SDK
Stars: ✭ 182 (+1416.67%)
Mutual labels:  speech
icassp2019-latex-template
ICASSP 2019 official Latex template
Stars: ✭ 21 (+75%)
Mutual labels:  speech
End2end Asr Pytorch
End-to-End Automatic Speech Recognition on PyTorch
Stars: ✭ 175 (+1358.33%)
Mutual labels:  speech
glaemscribe
Glaemscribe, the tolkienian languages/writings transcription engine.
Stars: ✭ 29 (+141.67%)
Mutual labels:  transcription
Chatbot Watson Android
An Android ChatBot powered by Watson Services - Assistant, Speech-to-Text and Text-to-Speech on IBM Cloud.
Stars: ✭ 169 (+1308.33%)
Mutual labels:  speech
asr24
24-hour Automatic Speech Recognition
Stars: ✭ 27 (+125%)
Mutual labels:  transcription
Tacotron asr
Speech Recognition Using Tacotron
Stars: ✭ 165 (+1275%)
Mutual labels:  speech
Tacotron pytorch
PyTorch implementation of Tacotron speech synthesis model.
Stars: ✭ 242 (+1916.67%)
Mutual labels:  speech
Aeneas
aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)
Stars: ✭ 1,942 (+16083.33%)
Mutual labels:  speech
pytorch-pcen
PyTorch reimplementation of per-channel energy normalization for audio.
Stars: ✭ 80 (+566.67%)
Mutual labels:  speech
Tacotron
A TensorFlow Implementation of Tacotron: A Fully End-to-End Text-To-Speech Synthesis Model
Stars: ✭ 1,756 (+14533.33%)
Mutual labels:  speech
kaldi ag training
Docker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-grammar.
Stars: ✭ 14 (+16.67%)
Mutual labels:  speech
Diffwave
DiffWave is a fast, high-quality neural vocoder and waveform synthesizer.
Stars: ✭ 139 (+1058.33%)
Mutual labels:  speech
txt2speech
Convert text to speech using Google Translate API
Stars: ✭ 38 (+216.67%)
Mutual labels:  speech
Voice activity detection
Voice Activity Detection based on Deep Learning & TensorFlow
Stars: ✭ 132 (+1000%)
Mutual labels:  speech
linear16
Converts an audio file to LINEAR16 Google-speech compatible file.
Stars: ✭ 14 (+16.67%)
Mutual labels:  speech
Voc
A physical model of the human vocal tract using literate programming, based on Pink Trombone.
Stars: ✭ 129 (+975%)
Mutual labels:  speech
anycontrol
Voice control for your websites and applications
Stars: ✭ 53 (+341.67%)
Mutual labels:  speech
Reconstructing faces from voices
An example of the paper "reconstructing faces from voices"
Stars: ✭ 127 (+958.33%)
Mutual labels:  speech
StyleSpeech
Official implementation of Meta-StyleSpeech and StyleSpeech
Stars: ✭ 161 (+1241.67%)
Mutual labels:  speech
Pytorch Asr
ASR with PyTorch
Stars: ✭ 124 (+933.33%)
Mutual labels:  speech
Multimodal-Gesture-Recognition-with-LSTMs-and-CTC
An end-to-end system that performs temporal recognition of gesture sequences using speech and skeletal input. The model combines three networks with a CTC output layer that recognises gestures from continuous stream.
Stars: ✭ 25 (+108.33%)
Mutual labels:  speech
Tts
Text-to-Speech for Arduino
Stars: ✭ 118 (+883.33%)
Mutual labels:  speech
speech-to-text
Python helper for Google and IBM Watson speech-to-text cloud APIs.
Stars: ✭ 14 (+16.67%)
Mutual labels:  transcription
Tfg Voice Conversion
Deep Learning-based Voice Conversion system
Stars: ✭ 115 (+858.33%)
Mutual labels:  speech
react-native-speech-bubble
💬 A speech bubble dialog component for React Native.
Stars: ✭ 50 (+316.67%)
Mutual labels:  speech
Durian
Implementation of "Duration Informed Attention Network for Multimodal Synthesis" (https://arxiv.org/pdf/1909.01700.pdf) paper.
Stars: ✭ 111 (+825%)
Mutual labels:  speech
audio noise clustering
https://dodiku.github.io/audio_noise_clustering/results/ ==> An experiment with a variety of clustering (and clustering-like) techniques to reduce noise on an audio speech recording.
Stars: ✭ 24 (+100%)
Mutual labels:  speech
Delta
DELTA is a deep learning based natural language and speech processing platform.
Stars: ✭ 1,479 (+12225%)
Mutual labels:  speech
idear
🎙️ Handsfree Audio Development Interface
Stars: ✭ 84 (+600%)
Mutual labels:  speech
Wikipron
Massively multilingual pronunciation mining
Stars: ✭ 99 (+725%)
Mutual labels:  speech
datasets
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
Stars: ✭ 13,870 (+115483.33%)
Mutual labels:  speech
Wavenet Enhancement
Speech Enhancement using Bayesian WaveNet
Stars: ✭ 86 (+616.67%)
Mutual labels:  speech
browser-apis
🦄 Cool & Fun Browser Web APIs 🥳
Stars: ✭ 21 (+75%)
Mutual labels:  speech
Julius
Open-Source Large Vocabulary Continuous Speech Recognition Engine
Stars: ✭ 1,258 (+10383.33%)
Mutual labels:  speech
simple-obs-stt
Speech-to-text and keyboard input captions for OBS.
Stars: ✭ 89 (+641.67%)
Mutual labels:  speech
Deepspeech
A PaddlePaddle implementation of ASR.
Stars: ✭ 1,219 (+10058.33%)
Mutual labels:  speech
Voice Gender
Gender recognition by voice and speech analysis
Stars: ✭ 248 (+1966.67%)
Mutual labels:  speech
Nlp Paper
自然语言处理领域下的对话语音领域,整理相关论文(附阅读笔记),复现模型以及数据处理等(代码含TensorFlow和PyTorch两版本)
Stars: ✭ 67 (+458.33%)
Mutual labels:  speech
data-at-hand-mobile
Mobile application for exploring fitness data using both speech and touch interaction.
Stars: ✭ 50 (+316.67%)
Mutual labels:  speech
Speechbrain.github.io
The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.
Stars: ✭ 242 (+1916.67%)
Mutual labels:  speech
Speech Feature Extraction
Feature extraction of speech signal is the initial stage of any speech recognition system.
Stars: ✭ 78 (+550%)
Mutual labels:  speech
speech-transformer
Transformer implementation speciaized in speech recognition tasks using Pytorch.
Stars: ✭ 40 (+233.33%)
Mutual labels:  speech
JD-NMF
Joint Dictionary Learning-based Non-Negative Matrix Factorization for Voice Conversion (TBME 2016)
Stars: ✭ 20 (+66.67%)
Mutual labels:  speech
voice-based-email-for-blind
Emailing System for visually impaired persons
Stars: ✭ 35 (+191.67%)
Mutual labels:  speech
room-impulse-responses
A list of publicly available room impulse response datasets and scripts to download them.
Stars: ✭ 143 (+1091.67%)
Mutual labels:  speech
Lhotse
Stars: ✭ 236 (+1866.67%)
Mutual labels:  speech
61-120 of 202 similar projects