AmazonSpeechTranslatorEnd-to-end Solution for Speech Recognition, Text Translation, and Text-to-Speech for iOS using Amazon Translate and Amazon Polly as AWS Machine Learning managed services.
Stars: ✭ 50 (+284.62%)
TacotronA TensorFlow Implementation of Tacotron: A Fully End-to-End Text-To-Speech Synthesis Model
Stars: ✭ 1,756 (+13407.69%)
K6neleAn Android app that offers speech-to-text services and user interfaces to other apps
Stars: ✭ 196 (+1407.69%)
hf-experimentsExperiments with Hugging Face 🔬 🤗
Stars: ✭ 37 (+184.62%)
NemoNeMo: a toolkit for conversational AI
Stars: ✭ 3,685 (+28246.15%)
StlThe ITU-T Software Tool Library (G.191)
Stars: ✭ 44 (+238.46%)
obviA Polymer 3+ webcomponent / button for doing speech recognition
Stars: ✭ 54 (+315.38%)
leon🧠 Leon is your open-source personal assistant.
Stars: ✭ 8,560 (+65746.15%)
Dialectid e2eEnd to End Dialect Identification using Convolutional Neural Network
Stars: ✭ 40 (+207.69%)
Dictate.jsA small Javascript library for browser-based real-time speech recognition, which uses Recorderjs for audio capture, and a WebSocket connection to the Kaldi GStreamer server for speech recognition.
Stars: ✭ 195 (+1400%)
WavegradA fast, high-quality neural vocoder.
Stars: ✭ 138 (+961.54%)
WsayWindows "say"
Stars: ✭ 36 (+176.92%)
TF-Speech-Recognition-Challenge-SolutionSource code of the model used in Tensorflow Speech Recognition Challenge (https://www.kaggle.com/c/tensorflow-speech-recognition-challenge). The solution ranked in top 5% in private leaderboard.
Stars: ✭ 58 (+346.15%)
AthenaA free and open source replacement for Google Assistant on Android devices, meant to integrate with the Sapphire Framework. It contains both speech-to-text and text-to-speech services. It does not require Google services or network connectivity
Stars: ✭ 73 (+461.54%)
DiffwaveDiffWave is a fast, high-quality neural vocoder and waveform synthesizer.
Stars: ✭ 139 (+969.23%)
Voice Overlay Android🗣 An overlay that gets your user’s voice permission and input as text in a customizable UI
Stars: ✭ 189 (+1353.85%)
PraatPraat: Doing Phonetics By Computer
Stars: ✭ 675 (+5092.31%)
IMS-ToucanText-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart. Objectives of the development are simplicity, modularity, controllability and multilinguality.
Stars: ✭ 295 (+2169.23%)
dataflow-contact-center-speech-analysisSpeech Analysis Framework, a collection of components and code from Google Cloud that you can use to transcribe audio files to create analytics.
Stars: ✭ 46 (+253.85%)
Speech Emotion AnalyzerThe neural network model is capable of detecting five different male/female emotions from audio speeches. (Deep Learning, NLP, Python)
Stars: ✭ 633 (+4769.23%)
frogFrog is an integration of memory-based natural language processing (NLP) modules developed for Dutch. All NLP modules are based on Timbl, the Tilburg memory-based learning software package.
Stars: ✭ 70 (+438.46%)
ArabicProcessingCogA Python package that do stemming, tokenization, sentence breaking, segmentation, normalization, POS tagging for Arabic language.
Stars: ✭ 19 (+46.15%)
Tensorflow Speech Recognition🎙Speech recognition using the tensorflow deep learning framework, sequence-to-sequence neural networks
Stars: ✭ 2,118 (+16192.31%)
megsA merged version of multiple open-source German speech datasets.
Stars: ✭ 21 (+61.54%)
AllosaurusAllosaurus is a pretrained universal phone recognizer for more than 2000 languages
Stars: ✭ 135 (+938.46%)
Speaker adapted ttsMaking a TTS model with 1 minute of speech samples within 10 minutes
Stars: ✭ 183 (+1307.69%)
sembei🍘 単語分割を経由しない単語埋め込み 🍘
Stars: ✭ 14 (+7.69%)
SentimentAnalysisSentiment Analysis: Deep Bi-LSTM+attention model
Stars: ✭ 32 (+146.15%)
audio noise clusteringhttps://dodiku.github.io/audio_noise_clustering/results/ ==> An experiment with a variety of clustering (and clustering-like) techniques to reduce noise on an audio speech recording.
Stars: ✭ 24 (+84.62%)
datastories-semeval2017-task6Deep-learning model presented in "DataStories at SemEval-2017 Task 6: Siamese LSTM with Attention for Humorous Text Comparison".
Stars: ✭ 20 (+53.85%)
idear🎙️ Handsfree Audio Development Interface
Stars: ✭ 84 (+546.15%)
Kaldi OnnxKaldi model converter to ONNX
Stars: ✭ 174 (+1238.46%)
CboardAAC communication system with text-to-speech for the browser
Stars: ✭ 437 (+3261.54%)
KaldiioA pure python module for reading and writing kaldi ark files
Stars: ✭ 160 (+1130.77%)
datalinguistStanford CoreNLP in idiomatic Clojure.
Stars: ✭ 93 (+615.38%)
Py Kaldi AsrSome simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.
Stars: ✭ 156 (+1100%)
Neural spEnd-to-end ASR/LM implementation with PyTorch
Stars: ✭ 408 (+3038.46%)
browser-apis🦄 Cool & Fun Browser Web APIs 🥳
Stars: ✭ 21 (+61.54%)
VoskVOSK Speech Recognition Toolkit
Stars: ✭ 182 (+1300%)
Elpis🙊 WIP software for creating speech recognition models.
Stars: ✭ 101 (+676.92%)
Factorized TdnnPyTorch implementation of the Factorized TDNN (TDNN-F) from "Semi-Orthogonal Low-Rank Matrix Factorization for Deep Neural Networks" and Kaldi
Stars: ✭ 98 (+653.85%)
ShifterPitch shifter using WSOLA and resampling implemented by Python3
Stars: ✭ 22 (+69.23%)
Voice BuilderAn opensource text-to-speech (TTS) voice building tool
Stars: ✭ 362 (+2684.62%)
torchainWIP: pytorch FFI wrapper for Kaldi chain loss (a.k.a. Lattice Free MMI)
Stars: ✭ 20 (+53.85%)
open-speech-corpora💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
Stars: ✭ 841 (+6369.23%)
CVCCVC: Contrastive Learning for Non-parallel Voice Conversion (INTERSPEECH 2021, in PyTorch)
Stars: ✭ 45 (+246.15%)
2018-dlslUPC Deep Learning for Speech and Language 2018
Stars: ✭ 18 (+38.46%)
NBSSThe official repo of "Multi-channel Narrow-band Deep Speech Separation with Full-band Permutation Invariant Training", "Multichannel Speech Separation with Narrow-band Conformer" and "NBC2: Multichannel Speech Separation with Revised Narrow-band Conformer".
Stars: ✭ 77 (+492.31%)
Deepspeech ServerA testing server for a speech to text service based on mozilla deepspeech
Stars: ✭ 176 (+1253.85%)