All Projects → kaldi_helpers → Similar Projects or Alternatives

388 Open source projects that are alternatives of or similar to kaldi_helpers

End-to-end Solution for Speech Recognition, Text Translation, and Text-to-Speech for iOS using Amazon Translate and Amazon Polly as AWS Machine Learning managed services.

Stars: ✭ 50 (+284.62%)

Mutual labels: speech-to-text

react-native-spokestack

Spokestack: give your React Native app a voice interface!

Stars: ✭ 53 (+307.69%)

Mutual labels: speech-to-text

Tacotron

A TensorFlow Implementation of Tacotron: A Fully End-to-End Text-To-Speech Synthesis Model

Stars: ✭ 1,756 (+13407.69%)

Mutual labels: speech

K6nele

An Android app that offers speech-to-text services and user interfaces to other apps

Stars: ✭ 196 (+1407.69%)

Mutual labels: speech-to-text

hf-experiments

Experiments with Hugging Face 🔬 🤗

Stars: ✭ 37 (+184.62%)

Mutual labels: automatic-speech-recognition

Nemo

NeMo: a toolkit for conversational AI

Stars: ✭ 3,685 (+28246.15%)

Mutual labels: speech-to-text

Stl

The ITU-T Software Tool Library (G.191)

Stars: ✭ 44 (+238.46%)

Mutual labels: speech

obvi

A Polymer 3+ webcomponent / button for doing speech recognition

Stars: ✭ 54 (+315.38%)

Mutual labels: automatic-speech-recognition

leon

🧠 Leon is your open-source personal assistant.

Stars: ✭ 8,560 (+65746.15%)

Mutual labels: speech-to-text

Dialectid e2e

End to End Dialect Identification using Convolutional Neural Network

Stars: ✭ 40 (+207.69%)

Mutual labels: speech

Dictate.js

A small Javascript library for browser-based real-time speech recognition, which uses Recorderjs for audio capture, and a WebSocket connection to the Kaldi GStreamer server for speech recognition.

Stars: ✭ 195 (+1400%)

Mutual labels: speech-to-text

Wavegrad

A fast, high-quality neural vocoder.

Stars: ✭ 138 (+961.54%)

Mutual labels: speech

Expressive tacotron

Tensorflow Implementation of Expressive Tacotron

Stars: ✭ 192 (+1376.92%)

Mutual labels: speech-to-text

Wsay

Windows "say"

Stars: ✭ 36 (+176.92%)

Mutual labels: speech

TF-Speech-Recognition-Challenge-Solution

Source code of the model used in Tensorflow Speech Recognition Challenge (https://www.kaggle.com/c/tensorflow-speech-recognition-challenge). The solution ranked in top 5% in private leaderboard.

Stars: ✭ 58 (+346.15%)

Mutual labels: speech

Athena

A free and open source replacement for Google Assistant on Android devices, meant to integrate with the Sapphire Framework. It contains both speech-to-text and text-to-speech services. It does not require Google services or network connectivity

Stars: ✭ 73 (+461.54%)

Mutual labels: speech-to-text

Automatic Speech Recognition

🎧 Automatic Speech Recognition: DeepSpeech & Seq2Seq (TensorFlow)

Stars: ✭ 192 (+1376.92%)

Mutual labels: speech-to-text

eidos-audition

Collection of auditory models.

Stars: ✭ 25 (+92.31%)

Mutual labels: speech

Diffwave

DiffWave is a fast, high-quality neural vocoder and waveform synthesizer.

Stars: ✭ 139 (+969.23%)

Mutual labels: speech

Voice Overlay Android

🗣 An overlay that gets your user’s voice permission and input as text in a customizable UI

Stars: ✭ 189 (+1353.85%)

Mutual labels: speech-to-text

Praat

Praat: Doing Phonetics By Computer

Stars: ✭ 675 (+5092.31%)

Mutual labels: speech

digital-paper-edit-client

Work in progress - BBC News Labs digital paper edit project - React Client

Stars: ✭ 36 (+176.92%)

Mutual labels: speech-to-text

IMS-Toucan

Text-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart. Objectives of the development are simplicity, modularity, controllability and multilinguality.

Stars: ✭ 295 (+2169.23%)

Mutual labels: speech

dataflow-contact-center-speech-analysis

Speech Analysis Framework, a collection of components and code from Google Cloud that you can use to transcribe audio files to create analytics.

Stars: ✭ 46 (+253.85%)

Mutual labels: speech-to-text

Speech Emotion Analyzer

The neural network model is capable of detecting five different male/female emotions from audio speeches. (Deep Learning, NLP, Python)

Stars: ✭ 633 (+4769.23%)

Mutual labels: speech

revai-node-sdk

Node.js SDK for the Rev AI API

Stars: ✭ 21 (+61.54%)

Mutual labels: speech-to-text

frog

Frog is an integration of memory-based natural language processing (NLP) modules developed for Dutch. All NLP modules are based on Timbl, the Tilburg memory-based learning software package.

Stars: ✭ 70 (+438.46%)

Mutual labels: computational-linguistics

ArabicProcessingCog

A Python package that do stemming, tokenization, sentence breaking, segmentation, normalization, POS tagging for Arabic language.

Stars: ✭ 19 (+46.15%)

Mutual labels: computational-linguistics

Tensorflow Speech Recognition

🎙Speech recognition using the tensorflow deep learning framework, sequence-to-sequence neural networks

Stars: ✭ 2,118 (+16192.31%)

Mutual labels: speech-to-text

megs

A merged version of multiple open-source German speech datasets.

Stars: ✭ 21 (+61.54%)

Mutual labels: speech-to-text

Allosaurus

Allosaurus is a pretrained universal phone recognizer for more than 2000 languages

Stars: ✭ 135 (+938.46%)

Mutual labels: speech

Speaker adapted tts

Making a TTS model with 1 minute of speech samples within 10 minutes

Stars: ✭ 183 (+1307.69%)

Mutual labels: speech-to-text

sembei

🍘 単語分割を経由しない単語埋め込み 🍘

Stars: ✭ 14 (+7.69%)

Mutual labels: computational-linguistics

Speech Denoising Wavenet

A neural network for end-to-end speech denoising

Stars: ✭ 516 (+3869.23%)

Mutual labels: speech

SentimentAnalysis

Sentiment Analysis: Deep Bi-LSTM+attention model

Stars: ✭ 32 (+146.15%)

Mutual labels: computational-linguistics

audio noise clustering

https://dodiku.github.io/audio_noise_clustering/results/ ==> An experiment with a variety of clustering (and clustering-like) techniques to reduce noise on an audio speech recording.

Stars: ✭ 24 (+84.62%)

Mutual labels: speech

sentiment-analysis-of-tweets-in-russian

Sentiment analysis of tweets in Russian using Convolutional Neural Networks (CNN) with Word2Vec embeddings.

Stars: ✭ 51 (+292.31%)

Mutual labels: computational-linguistics

datastories-semeval2017-task6

Deep-learning model presented in "DataStories at SemEval-2017 Task 6: Siamese LSTM with Attention for Humorous Text Comparison".

Stars: ✭ 20 (+53.85%)

Mutual labels: computational-linguistics

idear

🎙️ Handsfree Audio Development Interface

Stars: ✭ 84 (+546.15%)

Mutual labels: speech

Kaldi Onnx

Kaldi model converter to ONNX

Stars: ✭ 174 (+1238.46%)

Mutual labels: kaldi

Cboard

AAC communication system with text-to-speech for the browser

Stars: ✭ 437 (+3261.54%)

Mutual labels: speech

Kaldiio

A pure python module for reading and writing kaldi ark files

Stars: ✭ 160 (+1130.77%)

Mutual labels: kaldi

datalinguist

Stanford CoreNLP in idiomatic Clojure.

Stars: ✭ 93 (+615.38%)

Mutual labels: computational-linguistics

Py Kaldi Asr

Some simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.

Stars: ✭ 156 (+1100%)

Mutual labels: kaldi

Neural sp

End-to-end ASR/LM implementation with PyTorch

Stars: ✭ 408 (+3038.46%)

Mutual labels: speech

browser-apis

🦄 Cool & Fun Browser Web APIs 🥳

Stars: ✭ 21 (+61.54%)

Mutual labels: speech

Vosk

VOSK Speech Recognition Toolkit

Stars: ✭ 182 (+1300%)

Mutual labels: speech-to-text

Elpis

🙊 WIP software for creating speech recognition models.

Stars: ✭ 101 (+676.92%)

Mutual labels: kaldi

Voice Converter Cyclegan

Voice Converter Using CycleGAN and Non-Parallel Data

Stars: ✭ 384 (+2853.85%)

Mutual labels: speech

Factorized Tdnn

PyTorch implementation of the Factorized TDNN (TDNN-F) from "Semi-Orthogonal Low-Rank Matrix Factorization for Deep Neural Networks" and Kaldi

Stars: ✭ 98 (+653.85%)

Mutual labels: kaldi

Shifter

Pitch shifter using WSOLA and resampling implemented by Python3

Stars: ✭ 22 (+69.23%)

Mutual labels: speech

Voice Builder

An opensource text-to-speech (TTS) voice building tool

Stars: ✭ 362 (+2684.62%)

Mutual labels: speech

srvk-eesen-offline-transcriber

Top level code to transcribe English audio/video files into text/subtitles

Stars: ✭ 22 (+69.23%)

Mutual labels: kaldi

torchain

WIP: pytorch FFI wrapper for Kaldi chain loss (a.k.a. Lattice Free MMI)

Stars: ✭ 20 (+53.85%)

Mutual labels: kaldi

open-speech-corpora

💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies

Stars: ✭ 841 (+6369.23%)

Mutual labels: speech-to-text

CVC

CVC: Contrastive Learning for Non-parallel Voice Conversion (INTERSPEECH 2021, in PyTorch)

Stars: ✭ 45 (+246.15%)

Mutual labels: speech

2018-dlsl

UPC Deep Learning for Speech and Language 2018

Stars: ✭ 18 (+38.46%)

Mutual labels: automatic-speech-recognition

NBSS

The official repo of "Multi-channel Narrow-band Deep Speech Separation with Full-band Permutation Invariant Training", "Multichannel Speech Separation with Narrow-band Conformer" and "NBC2: Multichannel Speech Separation with Revised Narrow-band Conformer".

Stars: ✭ 77 (+492.31%)

Mutual labels: speech

Voice activity detection

Voice Activity Detection based on Deep Learning & TensorFlow

Stars: ✭ 132 (+915.38%)

Mutual labels: speech

Deepspeech Server

A testing server for a speech to text service based on mozilla deepspeech

Stars: ✭ 176 (+1253.85%)

Mutual labels: speech-to-text

301-360 of 388 similar projects

first

‹

›