All Projects → gtranscribe → Similar Projects or Alternatives

202 Open source projects that are alternatives of or similar to gtranscribe

AdaSpeech

AdaSpeech: Adaptive Text to Speech for Custom Voice

Stars: ✭ 108 (+800%)

Mutual labels: speech

Speech Denoiser

A speech denoise lv2 plugin based on RNNoise library

Stars: ✭ 220 (+1733.33%)

Mutual labels: speech

FAST-RIR

This is the official implementation of our neural-network-based fast diffuse room impulse response generator (FAST-RIR) for generating room impulse responses (RIRs) for a given acoustic environment.

Stars: ✭ 90 (+650%)

Mutual labels: speech

Tts Cube

End-2-end speech synthesis with recurrent neural networks

Stars: ✭ 213 (+1675%)

Mutual labels: speech

deepspeech.mxnet

A MXNet implementation of Baidu's DeepSpeech architecture

Stars: ✭ 82 (+583.33%)

Mutual labels: speech

Edgedict

Working online speech recognition based on RNN Transducer. ( Trained model release available in release )

Stars: ✭ 205 (+1608.33%)

Mutual labels: speech

eidos-audition

Collection of auditory models.

Stars: ✭ 25 (+108.33%)

Mutual labels: speech

Esp8266sam

Speech synthesis for ESP8266 using S.A.M. port

Stars: ✭ 199 (+1558.33%)

Mutual labels: speech

MajorDomo-Scenarios

Сценарии для системы домашней автоматизации Majordomo

Stars: ✭ 12 (+0%)

Mutual labels: speech

Speechtotext Websockets Javascript

SDK & Sample to do speech recognition using websockets in Javascript

Stars: ✭ 191 (+1491.67%)

Mutual labels: speech

cape

Continuous Augmented Positional Embeddings (CAPE) implementation for PyTorch

Stars: ✭ 29 (+141.67%)

Mutual labels: speech

Depression Detect

Predicting depression from acoustic features of speech using a Convolutional Neural Network.

Stars: ✭ 187 (+1458.33%)

Mutual labels: speech

DeepSegmentor

Sequence Segmentation using Joint RNN and Structured Prediction Models (ICASSP 2017)

Stars: ✭ 17 (+41.67%)

Mutual labels: speech

React Native Dialogflow

A React-Native Bridge for the Google Dialogflow (API.AI) SDK

Stars: ✭ 182 (+1416.67%)

Mutual labels: speech

icassp2019-latex-template

ICASSP 2019 official Latex template

Stars: ✭ 21 (+75%)

Mutual labels: speech

End2end Asr Pytorch

End-to-End Automatic Speech Recognition on PyTorch

Stars: ✭ 175 (+1358.33%)

Mutual labels: speech

glaemscribe

Glaemscribe, the tolkienian languages/writings transcription engine.

Stars: ✭ 29 (+141.67%)

Mutual labels: transcription

Chatbot Watson Android

An Android ChatBot powered by Watson Services - Assistant, Speech-to-Text and Text-to-Speech on IBM Cloud.

Stars: ✭ 169 (+1308.33%)

Mutual labels: speech

asr24

24-hour Automatic Speech Recognition

Stars: ✭ 27 (+125%)

Mutual labels: transcription

Tacotron asr

Speech Recognition Using Tacotron

Stars: ✭ 165 (+1275%)

Mutual labels: speech

Tacotron pytorch

PyTorch implementation of Tacotron speech synthesis model.

Stars: ✭ 242 (+1916.67%)

Mutual labels: speech

Aeneas

aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)

Stars: ✭ 1,942 (+16083.33%)

Mutual labels: speech

pytorch-pcen

PyTorch reimplementation of per-channel energy normalization for audio.

Stars: ✭ 80 (+566.67%)

Mutual labels: speech

Tacotron

A TensorFlow Implementation of Tacotron: A Fully End-to-End Text-To-Speech Synthesis Model

Stars: ✭ 1,756 (+14533.33%)

Mutual labels: speech

kaldi ag training

Docker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-grammar.

Stars: ✭ 14 (+16.67%)

Mutual labels: speech

Diffwave

DiffWave is a fast, high-quality neural vocoder and waveform synthesizer.

Stars: ✭ 139 (+1058.33%)

Mutual labels: speech

txt2speech

Convert text to speech using Google Translate API

Stars: ✭ 38 (+216.67%)

Mutual labels: speech

Voice activity detection

Voice Activity Detection based on Deep Learning & TensorFlow

Stars: ✭ 132 (+1000%)

Mutual labels: speech

linear16

Converts an audio file to LINEAR16 Google-speech compatible file.

Stars: ✭ 14 (+16.67%)

Mutual labels: speech

Voc

A physical model of the human vocal tract using literate programming, based on Pink Trombone.

Stars: ✭ 129 (+975%)

Mutual labels: speech

anycontrol

Voice control for your websites and applications

Stars: ✭ 53 (+341.67%)

Mutual labels: speech

Reconstructing faces from voices

An example of the paper "reconstructing faces from voices"

Stars: ✭ 127 (+958.33%)

Mutual labels: speech

StyleSpeech

Official implementation of Meta-StyleSpeech and StyleSpeech

Stars: ✭ 161 (+1241.67%)

Mutual labels: speech

Pytorch Asr

ASR with PyTorch

Stars: ✭ 124 (+933.33%)

Mutual labels: speech

Multimodal-Gesture-Recognition-with-LSTMs-and-CTC

An end-to-end system that performs temporal recognition of gesture sequences using speech and skeletal input. The model combines three networks with a CTC output layer that recognises gestures from continuous stream.

Stars: ✭ 25 (+108.33%)

Mutual labels: speech

Tts

Text-to-Speech for Arduino

Stars: ✭ 118 (+883.33%)

Mutual labels: speech

speech-to-text

Python helper for Google and IBM Watson speech-to-text cloud APIs.

Stars: ✭ 14 (+16.67%)

Mutual labels: transcription

Tfg Voice Conversion

Deep Learning-based Voice Conversion system

Stars: ✭ 115 (+858.33%)

Mutual labels: speech

react-native-speech-bubble

💬 A speech bubble dialog component for React Native.

Stars: ✭ 50 (+316.67%)

Mutual labels: speech

Durian

Implementation of "Duration Informed Attention Network for Multimodal Synthesis" (https://arxiv.org/pdf/1909.01700.pdf) paper.

Stars: ✭ 111 (+825%)

Mutual labels: speech

audio noise clustering

https://dodiku.github.io/audio_noise_clustering/results/ ==> An experiment with a variety of clustering (and clustering-like) techniques to reduce noise on an audio speech recording.

Stars: ✭ 24 (+100%)

Mutual labels: speech

Delta

DELTA is a deep learning based natural language and speech processing platform.

Stars: ✭ 1,479 (+12225%)

Mutual labels: speech

idear

🎙️ Handsfree Audio Development Interface

Stars: ✭ 84 (+600%)

Mutual labels: speech

Wikipron

Massively multilingual pronunciation mining

Stars: ✭ 99 (+725%)

Mutual labels: speech

datasets

🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools

Stars: ✭ 13,870 (+115483.33%)

Mutual labels: speech

Wavenet Enhancement

Speech Enhancement using Bayesian WaveNet

Stars: ✭ 86 (+616.67%)

Mutual labels: speech

browser-apis

🦄 Cool & Fun Browser Web APIs 🥳

Stars: ✭ 21 (+75%)

Mutual labels: speech

Julius

Open-Source Large Vocabulary Continuous Speech Recognition Engine

Stars: ✭ 1,258 (+10383.33%)

Mutual labels: speech

simple-obs-stt

Speech-to-text and keyboard input captions for OBS.

Stars: ✭ 89 (+641.67%)

Mutual labels: speech

Deepspeech

A PaddlePaddle implementation of ASR.

Stars: ✭ 1,219 (+10058.33%)

Mutual labels: speech

Voice Gender

Gender recognition by voice and speech analysis

Stars: ✭ 248 (+1966.67%)

Mutual labels: speech

Nlp Paper

自然语言处理领域下的对话语音领域，整理相关论文（附阅读笔记），复现模型以及数据处理等（代码含TensorFlow和PyTorch两版本）

Stars: ✭ 67 (+458.33%)

Mutual labels: speech

data-at-hand-mobile

Mobile application for exploring fitness data using both speech and touch interaction.

Stars: ✭ 50 (+316.67%)

Mutual labels: speech

Speechbrain.github.io

The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.

Stars: ✭ 242 (+1916.67%)

Mutual labels: speech

Speech Feature Extraction

Feature extraction of speech signal is the initial stage of any speech recognition system.

Stars: ✭ 78 (+550%)

Mutual labels: speech

speech-transformer

Transformer implementation speciaized in speech recognition tasks using Pytorch.

Stars: ✭ 40 (+233.33%)

Mutual labels: speech

JD-NMF

Joint Dictionary Learning-based Non-Negative Matrix Factorization for Voice Conversion (TBME 2016)