All Projects → UniSpeech → Similar Projects or Alternatives

528 Open source projects that are alternatives of or similar to UniSpeech

Speechtotext Websockets Javascript
SDK & Sample to do speech recognition using websockets in Javascript
Stars: ✭ 191 (-14.73%)
Mutual labels:  speech, speech-recognition
wav2vec2-live
A live speech recognition using Facebooks wav2vec 2.0 model.
Stars: ✭ 205 (-8.48%)
Mutual labels:  speech, speech-recognition
simple-obs-stt
Speech-to-text and keyboard input captions for OBS.
Stars: ✭ 89 (-60.27%)
Mutual labels:  speech, speech-recognition
Annyang
💬 Speech recognition for your site
Stars: ✭ 6,216 (+2675%)
Mutual labels:  speech, speech-recognition
Pncc
A implementation of Power Normalized Cepstral Coefficients: PNCC
Stars: ✭ 40 (-82.14%)
Java Speech Api
The J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.
Stars: ✭ 490 (+118.75%)
Mutual labels:  speech, speech-recognition
Julius
Open-Source Large Vocabulary Continuous Speech Recognition Engine
Stars: ✭ 1,258 (+461.61%)
Mutual labels:  speech, speech-recognition
bob
Bob is a free signal-processing and machine learning toolbox originally developed by the Biometrics group at Idiap Research Institute, in Switzerland. - Mirrored from https://gitlab.idiap.ch/bob/bob
Stars: ✭ 38 (-83.04%)
ASR-Audio-Data-Links
A list of publically available audio data that anyone can download for ASR or other speech activities
Stars: ✭ 179 (-20.09%)
Mutual labels:  speech, speech-recognition
Pytorch Kaldi
pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.
Stars: ✭ 2,097 (+836.16%)
Mutual labels:  speech, speech-recognition
Allosaurus
Allosaurus is a pretrained universal phone recognizer for more than 2000 languages
Stars: ✭ 135 (-39.73%)
Mutual labels:  speech, speech-recognition
Edgedict
Working online speech recognition based on RNN Transducer. ( Trained model release available in release )
Stars: ✭ 205 (-8.48%)
Mutual labels:  speech, speech-recognition
QuantumSpeech-QCNN
IEEE ICASSP 21 - Quantum Convolution Neural Networks for Speech Processing and Automatic Speech Recognition
Stars: ✭ 71 (-68.3%)
TF-Speech-Recognition-Challenge-Solution
Source code of the model used in Tensorflow Speech Recognition Challenge (https://www.kaggle.com/c/tensorflow-speech-recognition-challenge). The solution ranked in top 5% in private leaderboard.
Stars: ✭ 58 (-74.11%)
Mutual labels:  speech, speech-recognition
UHV-OTS-Speech
A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.
Stars: ✭ 94 (-58.04%)
Awesome Speech Recognition Speech Synthesis Papers
Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)
Stars: ✭ 2,085 (+830.8%)
Huawei-Challenge-Speaker-Identification
Trained speaker embedding deep learning models and evaluation pipelines in pytorch and tesorflow for speaker recognition.
Stars: ✭ 34 (-84.82%)
Neural sp
End-to-end ASR/LM implementation with PyTorch
Stars: ✭ 408 (+82.14%)
Mutual labels:  speech, speech-recognition
React Native Dialogflow
A React-Native Bridge for the Google Dialogflow (API.AI) SDK
Stars: ✭ 182 (-18.75%)
Mutual labels:  speech, speech-processing
anycontrol
Voice control for your websites and applications
Stars: ✭ 53 (-76.34%)
Mutual labels:  speech, speech-recognition
torchsubband
Pytorch implementation of subband decomposition
Stars: ✭ 63 (-71.87%)
awesome-speech-enhancement
A curated list of awesome Speech Enhancement papers, libraries, datasets, and other resources.
Stars: ✭ 48 (-78.57%)
Phomeme
Simple sentence mixing tool (work in progress)
Stars: ✭ 18 (-91.96%)
Mutual labels:  speech
react-client
An React client library for Speechly API
Stars: ✭ 71 (-68.3%)
Mutual labels:  speech-recognition
VoiceBridge
VoiceBridge - an AI-TOOLKIT Open Source C++ Speech Recognition Toolkit
Stars: ✭ 17 (-92.41%)
Mutual labels:  speech-recognition
VAD-LTSD
Efficient voice activity detection algorithm using long-term speech information
Stars: ✭ 37 (-83.48%)
Mutual labels:  speech
srvk-eesen-offline-transcriber
Top level code to transcribe English audio/video files into text/subtitles
Stars: ✭ 22 (-90.18%)
Mutual labels:  speech-recognition
PCPM
Presenting Collection of Pretrained Models. Links to pretrained models in NLP and voice.
Stars: ✭ 21 (-90.62%)
Mutual labels:  speech-recognition
Inimesed
An Android app that lets you search your contacts by voice. Internet not required. Based on Pocketsphinx. Uses Estonian acoustic models.
Stars: ✭ 65 (-70.98%)
Mutual labels:  speech-recognition
kaldi-long-audio-alignment
Long audio alignment using Kaldi
Stars: ✭ 21 (-90.62%)
Mutual labels:  speech-recognition
ml-with-audio
HF's ML for Audio study group
Stars: ✭ 104 (-53.57%)
Mutual labels:  speech-recognition
DeepSpeech-API
The code enables users to use Mozilla's Deep Speech model over the Web Browser.
Stars: ✭ 31 (-86.16%)
Mutual labels:  speech-recognition
gtranscribe
Software for interview transcription
Stars: ✭ 12 (-94.64%)
Mutual labels:  speech
speaker-recognition-papers
Share some recent speaker recognition papers and their implementations.
Stars: ✭ 92 (-58.93%)
Mutual labels:  speaker-verification
speaker extraction
target speaker extraction and verification for multi-talker speech
Stars: ✭ 85 (-62.05%)
Mutual labels:  speaker-verification
End-to-End-Mandarin-ASR
End-to-end speech recognition on AISHELL dataset.
Stars: ✭ 20 (-91.07%)
Mutual labels:  speech-recognition
AmazonSpeechTranslator
End-to-end Solution for Speech Recognition, Text Translation, and Text-to-Speech for iOS using Amazon Translate and Amazon Polly as AWS Machine Learning managed services.
Stars: ✭ 50 (-77.68%)
Mutual labels:  speech-recognition
speechless
Speech-to-text based on wav2letter built for transfer learning
Stars: ✭ 92 (-58.93%)
Mutual labels:  speech-recognition
api
Speechly public API definitions and generated code
Stars: ✭ 15 (-93.3%)
Mutual labels:  speech-recognition
2018-dlsl
UPC Deep Learning for Speech and Language 2018
Stars: ✭ 18 (-91.96%)
Mutual labels:  speech-recognition
A chronology of deep learning
Tracing back and exposing in chronological order the main ideas in the field of deep learning, to help everyone better understand the current intense research in AI.
Stars: ✭ 47 (-79.02%)
Mutual labels:  speech-recognition
Unity live caption
Use Google Speech-to-Text API to do real-time live stream caption on Unity! Best when combined with your virtual character!
Stars: ✭ 26 (-88.39%)
Mutual labels:  speech-recognition
rnnt decoder cuda
An efficient implementation of RNN-T Prefix Beam Search in C++/CUDA.
Stars: ✭ 60 (-73.21%)
Mutual labels:  speech-recognition
mongolian-nlp
Useful resources for Mongolian NLP
Stars: ✭ 119 (-46.87%)
Mutual labels:  speech-recognition
salutejs
SmartApp Framework для создания навыков семейства Виртуальных Ассистентов "Салют" на языке JavaScript
Stars: ✭ 35 (-84.37%)
Mutual labels:  speech-recognition
Android-TTS-STT
One line solution for Android Text to speech(TTS) & Speech to Text(STT) translation problem
Stars: ✭ 77 (-65.62%)
Mutual labels:  speech-recognition
formulas-python
Ritchie CLI formulas in Python 🐍
Stars: ✭ 17 (-92.41%)
Mutual labels:  speech-recognition
Speech Feature Extraction
Feature extraction of speech signal is the initial stage of any speech recognition system.
Stars: ✭ 78 (-65.18%)
Mutual labels:  speech
BookLibrary
Book Library of P&W Studio
Stars: ✭ 13 (-94.2%)
Mutual labels:  speech-processing
melgan
MelGAN implementation with Multi-Band and Full Band supports...
Stars: ✭ 54 (-75.89%)
Mutual labels:  speech
houndify-sdk-go
The official Houndify SDK for Go
Stars: ✭ 23 (-89.73%)
Mutual labels:  speech-recognition
Zero-Shot-TTS
Unofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration
Stars: ✭ 33 (-85.27%)
Mutual labels:  speech
CNN-VAD
A Convolutional Neural Network based Voice Activity Detector for Smartphones
Stars: ✭ 60 (-73.21%)
Mutual labels:  speech-processing
telltime
iOS application to tell the time in the British way 🇬🇧⏰
Stars: ✭ 49 (-78.12%)
Mutual labels:  speech-recognition
datasets
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
Stars: ✭ 13,870 (+6091.96%)
Mutual labels:  speech
StyleSpeech
Official implementation of Meta-StyleSpeech and StyleSpeech
Stars: ✭ 161 (-28.12%)
Mutual labels:  speech
hf-experiments
Experiments with Hugging Face 🔬 🤗
Stars: ✭ 37 (-83.48%)
Mutual labels:  speech-recognition
Khronos
The open source intelligent personal assistant
Stars: ✭ 25 (-88.84%)
Mutual labels:  speech-recognition
web-speech-cognitive-services
Polyfill Web Speech API with Cognitive Services Bing Speech for both speech-to-text and text-to-speech service.
Stars: ✭ 35 (-84.37%)
Mutual labels:  speech-recognition
ctc-asr
End-to-end trained speech recognition system, based on RNNs and the connectionist temporal classification (CTC) cost function.
Stars: ✭ 112 (-50%)
Mutual labels:  speech-recognition
61-120 of 528 similar projects