All Projects → Awesome Diarization → Similar Projects or Alternatives

377 Open source projects that are alternatives of or similar to Awesome Diarization

speechrec
a simple speech recognition app using the Web Speech API Interfaces
Stars: ✭ 18 (-97.33%)
open-speech-corpora
💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
Stars: ✭ 841 (+24.96%)
Sincnet
SincNet is a neural architecture for efficiently processing raw audio samples.
Stars: ✭ 764 (+13.52%)
react-native-spokestack
Spokestack: give your React Native app a voice interface!
Stars: ✭ 53 (-92.12%)
Speechbrain.github.io
The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.
Stars: ✭ 242 (-64.04%)
Keras Sincnet
Keras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)
Stars: ✭ 47 (-93.02%)
Uspeech
Speech recognition toolkit for the arduino
Stars: ✭ 448 (-33.43%)
UniSpeech
UniSpeech - Large Scale Self-Supervised Learning for Speech
Stars: ✭ 224 (-66.72%)
torchsubband
Pytorch implementation of subband decomposition
Stars: ✭ 63 (-90.64%)
Nonautoreggenprogress
Tracking the progress in non-autoregressive generation (translation, transcription, etc.)
Stars: ✭ 118 (-82.47%)
awesome-keyword-spotting
This repository is a curated list of awesome Speech Keyword Spotting (Wake-Up Word Detection).
Stars: ✭ 150 (-77.71%)
Speech-Backbones
This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.
Stars: ✭ 205 (-69.54%)
QuantumSpeech-QCNN
IEEE ICASSP 21 - Quantum Convolution Neural Networks for Speech Processing and Automatic Speech Recognition
Stars: ✭ 71 (-89.45%)
Formant Analyzer
iOS application for finding formants in spoken sounds
Stars: ✭ 43 (-93.61%)
Zzz Retired openstt
RETIRED - OpenSTT is now retired. If you would like more information on Mycroft AI's open source STT projects, please visit:
Stars: ✭ 146 (-78.31%)
Pncc
A implementation of Power Normalized Cepstral Coefficients: PNCC
Stars: ✭ 40 (-94.06%)
UHV-OTS-Speech
A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.
Stars: ✭ 94 (-86.03%)
spokestack-ios
Spokestack: give your iOS app a voice interface!
Stars: ✭ 27 (-95.99%)
scim
[wip]Speech recognition tool-box written by Nim. Based on Arraymancer.
Stars: ✭ 17 (-97.47%)
Deepspeech Examples
Examples of how to use or integrate DeepSpeech
Stars: ✭ 356 (-47.1%)
Mutual labels:  speech-recognition
Libfaceid
libfaceid is a research framework for prototyping of face recognition solutions. It seamlessly integrates multiple detection, recognition and liveness models w/ speech synthesis and speech recognition.
Stars: ✭ 354 (-47.4%)
Mutual labels:  speech-recognition
Pase
Problem Agnostic Speech Encoder
Stars: ✭ 348 (-48.29%)
Mutual labels:  speech-processing
Deepspeech
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
Stars: ✭ 18,680 (+2675.63%)
Mutual labels:  speech-recognition
Ctcdecoder
Connectionist Temporal Classification (CTC) decoding algorithms: best path, prefix search, beam search and token passing. Implemented in Python.
Stars: ✭ 529 (-21.4%)
Mutual labels:  speech-recognition
Asrt speechrecognition
A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统
Stars: ✭ 4,943 (+634.47%)
Mutual labels:  speech-recognition
Alan Sdk Ios
Alan AI iOS SDK adds a voice assistant or chatbot to your app. Supports Swift, Objective-C.
Stars: ✭ 318 (-52.75%)
Mutual labels:  speech-recognition
Espnet
End-to-End Speech Processing Toolkit
Stars: ✭ 4,533 (+573.55%)
Mutual labels:  speech-recognition
Speech Demo
语音api示例
Stars: ✭ 454 (-32.54%)
Mutual labels:  speech-recognition
Sonus
💬 /so.nus/ STT (speech to text) for Node with offline hotword detection
Stars: ✭ 532 (-20.95%)
Mutual labels:  speech-recognition
Alan Sdk Flutter
Alan AI Flutter SDK adds a voice assistant or chatbot to your app.
Stars: ✭ 309 (-54.09%)
Mutual labels:  speech-recognition
Brevitas
Brevitas: quantization-aware training in PyTorch
Stars: ✭ 343 (-49.03%)
Mutual labels:  speech-recognition
Voice Overlay Ios
🗣 An overlay that gets your user’s voice permission and input as text in a customizable UI
Stars: ✭ 440 (-34.62%)
Mutual labels:  speech-recognition
J.a.r.v.i.s
python powered Intelligent System
Stars: ✭ 325 (-51.71%)
Mutual labels:  speech-recognition
Vad
Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.
Stars: ✭ 622 (-7.58%)
Mutual labels:  speech-recognition
Surfboard
Novoic's audio feature extraction library
Stars: ✭ 318 (-52.75%)
Mutual labels:  speech-processing
Specaugment
A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain
Stars: ✭ 408 (-39.38%)
Mutual labels:  speech-recognition
Silero Models
Silero Models: pre-trained STT models and benchmarks made embarrassingly simple
Stars: ✭ 522 (-22.44%)
Mutual labels:  speech-recognition
Nnmnkwii
Library to build speech synthesis systems designed for easy and fast prototyping.
Stars: ✭ 308 (-54.23%)
Mutual labels:  speech-processing
Rhino
On-device speech-to-intent engine powered by deep learning
Stars: ✭ 406 (-39.67%)
Mutual labels:  speech-recognition
Tensorflow end2end speech recognition
End-to-End speech recognition implementation base on TensorFlow (CTC, Attention, and MTL training)
Stars: ✭ 305 (-54.68%)
Mutual labels:  speech-recognition
Speech recognition
A Flutter plugin to use speech recognition on iOS & Android (Swift/Java)
Stars: ✭ 302 (-55.13%)
Mutual labels:  speech-recognition
Neural sp
End-to-end ASR/LM implementation with PyTorch
Stars: ✭ 408 (-39.38%)
Mutual labels:  speech-recognition
Pocketsphinx Python
Python interface to CMU Sphinxbase and Pocketsphinx libraries
Stars: ✭ 298 (-55.72%)
Mutual labels:  speech-recognition
Pysptk
A python wrapper for Speech Signal Processing Toolkit (SPTK).
Stars: ✭ 297 (-55.87%)
Mutual labels:  speech-processing
Speech Emotion Analyzer
The neural network model is capable of detecting five different male/female emotions from audio speeches. (Deep Learning, NLP, Python)
Stars: ✭ 633 (-5.94%)
Mutual labels:  speech-recognition
Wenet
Production First and Production Ready End-to-End Speech Recognition Toolkit
Stars: ✭ 617 (-8.32%)
Mutual labels:  speech-recognition
Speech Denoising Wavenet
A neural network for end-to-end speech denoising
Stars: ✭ 516 (-23.33%)
Mutual labels:  speech-processing
Tensorflowasr
⚡️ TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwords
Stars: ✭ 400 (-40.56%)
Mutual labels:  speech-recognition
Alan Sdk Ionic
Alan AI Ionic SDK adds a voice assistant or chatbot to your app. Supports React, Angular.
Stars: ✭ 287 (-57.36%)
Mutual labels:  speech-recognition
Alan Sdk Android
Alan AI Android SDK adds a voice assistant or chatbot to your app. Supports Java, Kotlin.
Stars: ✭ 278 (-58.69%)
Mutual labels:  speech-recognition
Ctcwordbeamsearch
Connectionist Temporal Classification (CTC) decoder with dictionary and language model for TensorFlow.
Stars: ✭ 398 (-40.86%)
Mutual labels:  speech-recognition
Vosk Server
WebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi libraries
Stars: ✭ 277 (-58.84%)
Mutual labels:  speech-recognition
Phonetisaurus
Phonetisaurus G2P
Stars: ✭ 277 (-58.84%)
Mutual labels:  speech-recognition
Java Speech Api
The J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.
Stars: ✭ 490 (-27.19%)
Mutual labels:  speech-recognition
Free Spoken Digit Dataset
A free audio dataset of spoken digits. Think MNIST for audio.
Stars: ✭ 396 (-41.16%)
Mutual labels:  speech-recognition
Vosk Android Demo
Offline speech recognition for Android with Vosk library.
Stars: ✭ 271 (-59.73%)
Mutual labels:  speech-recognition
Alan Sdk Cordova
Alan AI Cordova SDK adds a voice assistant or chatbot to your app.
Stars: ✭ 269 (-60.03%)
Mutual labels:  speech-recognition
Awesome Kaldi
This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )
Stars: ✭ 393 (-41.6%)
Mutual labels:  speech-recognition
Neural Voice Cloning With Few Samples
This repository has implementation for "Neural Voice Cloning With Few Samples"
Stars: ✭ 262 (-61.07%)
Mutual labels:  speech-processing
Pocketsphinx
PocketSphinx is a lightweight speech recognition engine, specifically tuned for handheld and mobile devices, though it works equally well on the desktop
Stars: ✭ 2,934 (+335.96%)
Mutual labels:  speech-recognition
1-60 of 377 similar projects