All Projects → UHV-OTS-Speech → Similar Projects or Alternatives

405 Open source projects that are alternatives of or similar to UHV-OTS-Speech

Pytorch implementation of Generalized End-to-End Loss for speaker verification

Stars: ✭ 72 (-23.4%)

Mutual labels: speaker-diarization, speaker-identification

[wip]Speech recognition tool-box written by Nim. Based on Arraymancer.

Stars: ✭ 17 (-81.91%)

Mutual labels: speech-recognition, speech-processing

Speech-Backbones

This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.

Stars: ✭ 205 (+118.09%)

Mutual labels: speech-recognition, speech-processing

UniSpeech - Large Scale Self-Supervised Learning for Speech

Stars: ✭ 224 (+138.3%)

Mutual labels: speech-recognition, speech-processing

Speechbrain.github.io

The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.

Stars: ✭ 242 (+157.45%)

Mutual labels: speech-recognition, speech-processing

SincNet is a neural architecture for efficiently processing raw audio samples.

Stars: ✭ 764 (+712.77%)

Mutual labels: speech-recognition, speech-processing

Zzz Retired openstt

RETIRED - OpenSTT is now retired. If you would like more information on Mycroft AI's open source STT projects, please visit:

Stars: ✭ 146 (+55.32%)

Mutual labels: speech-recognition, speech-processing

UPC Deep Learning for Speech and Language 2018

Stars: ✭ 18 (-80.85%)

Mutual labels: speech-recognition, speaker-identification

awesome-keyword-spotting

This repository is a curated list of awesome Speech Keyword Spotting (Wake-Up Word Detection).

Stars: ✭ 150 (+59.57%)

Mutual labels: speech-recognition, speech-processing

open-speech-corpora

💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies

Stars: ✭ 841 (+794.68%)

Mutual labels: speech-recognition, speech-processing

Keras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)

Stars: ✭ 47 (-50%)

Mutual labels: speech-recognition, speech-processing

A implementation of Power Normalized Cepstral Coefficients: PNCC

Stars: ✭ 40 (-57.45%)

Mutual labels: speech-recognition, speech-processing

Awesome Diarization

A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.

Stars: ✭ 673 (+615.96%)

Mutual labels: speech-recognition, speech-processing

Spokestack: give your iOS app a voice interface!

Stars: ✭ 27 (-71.28%)

Mutual labels: speech-recognition, speech-processing

Speech recognition toolkit for the arduino

Stars: ✭ 448 (+376.6%)

Mutual labels: speech-recognition, speech-processing

react-native-spokestack

Spokestack: give your React Native app a voice interface!

Stars: ✭ 53 (-43.62%)

Mutual labels: speech-recognition, speech-processing

Huawei-Challenge-Speaker-Identification

Trained speaker embedding deep learning models and evaluation pipelines in pytorch and tesorflow for speaker recognition.

Stars: ✭ 34 (-63.83%)

Mutual labels: speech-processing, speaker-identification

a simple speech recognition app using the Web Speech API Interfaces

Stars: ✭ 18 (-80.85%)

Mutual labels: speech-recognition, speech-processing

Experiments with Hugging Face 🔬 🤗

Stars: ✭ 37 (-60.64%)

Mutual labels: speech-recognition, topic-detection

Scripts for LIUM SpkDiarization tools

Stars: ✭ 28 (-70.21%)

Mutual labels: speech-processing, speaker-diarization

Pytorch implementation of subband decomposition

Stars: ✭ 63 (-32.98%)

Mutual labels: speech-recognition, speech-processing

QuantumSpeech-QCNN

IEEE ICASSP 21 - Quantum Convolution Neural Networks for Speech Processing and Automatic Speech Recognition

Stars: ✭ 71 (-24.47%)

Mutual labels: speech-recognition, speech-processing

kaldi-long-audio-alignment

Long audio alignment using Kaldi

Stars: ✭ 21 (-77.66%)

Mutual labels: speech-recognition, speech-transcription

Formant Analyzer

iOS application for finding formants in spoken sounds

Stars: ✭ 43 (-54.26%)

Mutual labels: speech-recognition, speech-processing

Nonautoreggenprogress

Tracking the progress in non-autoregressive generation (translation, transcription, etc.)

Stars: ✭ 118 (+25.53%)

Mutual labels: speech-recognition, speech-processing

Speech Recognition Using Tacotron

Stars: ✭ 165 (+75.53%)

Mutual labels: speech-recognition

pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.

Stars: ✭ 2,097 (+2130.85%)

Mutual labels: speech-recognition

Automatic Speech Recognition

🎧 Automatic Speech Recognition: DeepSpeech & Seq2Seq (TensorFlow)

Stars: ✭ 192 (+104.26%)

Mutual labels: speech-recognition

NeMo: a toolkit for conversational AI

Stars: ✭ 3,685 (+3820.21%)

Mutual labels: speech-recognition

Deep Learning based Automatic Speech Recognition with attention for the Nvidia Jetson.

Stars: ✭ 161 (+71.28%)

Mutual labels: speech-recognition

Speechtotext Websockets Javascript

SDK & Sample to do speech recognition using websockets in Javascript

Stars: ✭ 191 (+103.19%)

Mutual labels: speech-recognition

A pure python module for reading and writing kaldi ark files

Stars: ✭ 160 (+70.21%)

Mutual labels: speech-recognition

Interspeech2019 Tutorial

INTERSPEECH 2019 Tutorial Materials

Stars: ✭ 160 (+70.21%)

Mutual labels: speech-recognition

Open-Source Toolkit for End-to-End Korean Automatic Speech Recognition.

Stars: ✭ 190 (+102.13%)

Mutual labels: speech-recognition

Rnnt Speech Recognition

End-to-end speech recognition using RNN Transducers in Tensorflow 2.0

Stars: ✭ 158 (+68.09%)

Mutual labels: speech-recognition

Some simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.

Stars: ✭ 156 (+65.96%)

Mutual labels: speech-recognition

📦 快速转化「中文数字」和「阿拉伯数字」～ (最新特性：分数，日期、温度等转化）

Stars: ✭ 249 (+164.89%)

Mutual labels: speech-recognition

Recurrent Neural Network and Long Short Term Memory (LSTM) with Connectionist Temporal Classification implemented in Theano. Includes a Toy training example.

Stars: ✭ 220 (+134.04%)

Mutual labels: speech-recognition

Python module for evaluating ASR hypotheses (e.g. word error rate, word recognition rate).

Stars: ✭ 190 (+102.13%)

Mutual labels: speech-recognition

ClovaCall dataset and Pytorch LAS baseline code (Interspeech 2020)

Stars: ✭ 151 (+60.64%)

Mutual labels: speech-recognition

Speech To Text Russian

Проект для распознавания речи на русском языке на основе pykaldi.

Stars: ✭ 151 (+60.64%)

Mutual labels: speech-recognition

Voice Overlay Android

🗣 An overlay that gets your user’s voice permission and input as text in a customizable UI

Stars: ✭ 189 (+101.06%)

Mutual labels: speech-recognition

A speech recognition framework designed for SwiftUI.

Stars: ✭ 149 (+58.51%)

Mutual labels: speech-recognition

Speech recognition framework allowing powerful Python-based scripting and extension of Dragon NaturallySpeaking (DNS), Windows Speech Recognition (WSR), Kaldi and CMU Pocket Sphinx

Stars: ✭ 209 (+122.34%)

Mutual labels: speech-recognition

Tensorflow Speech Recognition

🎙Speech recognition using the tensorflow deep learning framework, sequence-to-sequence neural networks

Stars: ✭ 2,118 (+2153.19%)

Mutual labels: speech-recognition

Speech Recognition Neural Network

This is the end-to-end Speech Recognition neural network, deployed in Keras. This was my final project for Artificial Intelligence Nanodegree @Udacity.

Stars: ✭ 148 (+57.45%)

Mutual labels: speech-recognition

Speechrecognizerbutton

UIButton subclass with push to talk recording, speech recognition and Siri-style waveform view.

Stars: ✭ 144 (+53.19%)

Mutual labels: speech-recognition

Kaldi Offline Transcriber

Offline transcription system for Estonian using Kaldi

Stars: ✭ 182 (+93.62%)

Mutual labels: speech-recognition

Deep learning for audio processing

Stars: ✭ 142 (+51.06%)

Mutual labels: speech-recognition

Aimybox Android Assistant

Embeddable custom voice assistant for Android applications

Stars: ✭ 139 (+47.87%)

Mutual labels: speech-recognition

Automatic speech recognition

End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow

Stars: ✭ 2,751 (+2826.6%)

Mutual labels: speech-recognition

Kaldi-based Korean ASR (한국어 음성인식) open-source project

Stars: ✭ 248 (+163.83%)

Mutual labels: speech-recognition

Working online speech recognition based on RNN Transducer. ( Trained model release available in release )

Stars: ✭ 205 (+118.09%)

Mutual labels: speech-recognition

VOSK Speech Recognition Toolkit

Stars: ✭ 182 (+93.62%)

Mutual labels: speech-recognition

Go Astideepspeech

Golang bindings for Mozilla's DeepSpeech speech-to-text library

Stars: ✭ 137 (+45.74%)

Mutual labels: speech-recognition

Allosaurus is a pretrained universal phone recognizer for more than 2000 languages

Stars: ✭ 135 (+43.62%)

Mutual labels: speech-recognition

Deepspeech German

Automatic Speech Recognition (ASR) - German

Stars: ✭ 179 (+90.43%)

Mutual labels: speech-recognition

Awesome Ai Services

An overview of the AI-as-a-service landscape

Stars: ✭ 133 (+41.49%)

Mutual labels: speech-recognition

Voice activity detection

Voice Activity Detection based on Deep Learning & TensorFlow

Stars: ✭ 132 (+40.43%)

Mutual labels: speech-recognition

Kaldi Active Grammar

Python Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time

Stars: ✭ 196 (+108.51%)

Mutual labels: speech-recognition

1-60 of 405 similar projects