All Projects → Awesome Diarization → Similar Projects or Alternatives

377 Open source projects that are alternatives of or similar to Awesome Diarization

a simple speech recognition app using the Web Speech API Interfaces

Stars: ✭ 18 (-97.33%)

Mutual labels: speech-recognition, speech-processing

open-speech-corpora

💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies

Stars: ✭ 841 (+24.96%)

Mutual labels: speech-recognition, speech-processing

SincNet is a neural architecture for efficiently processing raw audio samples.

Stars: ✭ 764 (+13.52%)

Mutual labels: speech-recognition, speech-processing

react-native-spokestack

Spokestack: give your React Native app a voice interface!

Stars: ✭ 53 (-92.12%)

Mutual labels: speech-recognition, speech-processing

Speechbrain.github.io

The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.

Stars: ✭ 242 (-64.04%)

Mutual labels: speech-recognition, speech-processing

Keras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)

Stars: ✭ 47 (-93.02%)

Mutual labels: speech-recognition, speech-processing

Speech recognition toolkit for the arduino

Stars: ✭ 448 (-33.43%)

Mutual labels: speech-recognition, speech-processing

UniSpeech - Large Scale Self-Supervised Learning for Speech

Stars: ✭ 224 (-66.72%)

Mutual labels: speech-recognition, speech-processing

Pytorch implementation of subband decomposition

Stars: ✭ 63 (-90.64%)

Mutual labels: speech-recognition, speech-processing

Nonautoreggenprogress

Tracking the progress in non-autoregressive generation (translation, transcription, etc.)

Stars: ✭ 118 (-82.47%)

Mutual labels: speech-recognition, speech-processing

awesome-keyword-spotting

This repository is a curated list of awesome Speech Keyword Spotting (Wake-Up Word Detection).

Stars: ✭ 150 (-77.71%)

Mutual labels: speech-recognition, speech-processing

Speech-Backbones

This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.

Stars: ✭ 205 (-69.54%)

Mutual labels: speech-recognition, speech-processing

QuantumSpeech-QCNN

IEEE ICASSP 21 - Quantum Convolution Neural Networks for Speech Processing and Automatic Speech Recognition

Stars: ✭ 71 (-89.45%)

Mutual labels: speech-recognition, speech-processing

Formant Analyzer

iOS application for finding formants in spoken sounds

Stars: ✭ 43 (-93.61%)

Mutual labels: speech-recognition, speech-processing

Zzz Retired openstt

RETIRED - OpenSTT is now retired. If you would like more information on Mycroft AI's open source STT projects, please visit:

Stars: ✭ 146 (-78.31%)

Mutual labels: speech-recognition, speech-processing

A implementation of Power Normalized Cepstral Coefficients: PNCC

Stars: ✭ 40 (-94.06%)

Mutual labels: speech-recognition, speech-processing

A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.

Stars: ✭ 94 (-86.03%)

Mutual labels: speech-recognition, speech-processing

Spokestack: give your iOS app a voice interface!

Stars: ✭ 27 (-95.99%)

Mutual labels: speech-recognition, speech-processing

[wip]Speech recognition tool-box written by Nim. Based on Arraymancer.

Stars: ✭ 17 (-97.47%)

Mutual labels: speech-recognition, speech-processing

Deepspeech Examples

Examples of how to use or integrate DeepSpeech

Stars: ✭ 356 (-47.1%)

Mutual labels: speech-recognition

libfaceid is a research framework for prototyping of face recognition solutions. It seamlessly integrates multiple detection, recognition and liveness models w/ speech synthesis and speech recognition.

Stars: ✭ 354 (-47.4%)

Mutual labels: speech-recognition

Problem Agnostic Speech Encoder

Stars: ✭ 348 (-48.29%)

Mutual labels: speech-processing

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.

Stars: ✭ 18,680 (+2675.63%)

Mutual labels: speech-recognition

Connectionist Temporal Classification (CTC) decoding algorithms: best path, prefix search, beam search and token passing. Implemented in Python.

Stars: ✭ 529 (-21.4%)

Mutual labels: speech-recognition

Asrt speechrecognition

A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统

Stars: ✭ 4,943 (+634.47%)

Mutual labels: speech-recognition

Alan AI iOS SDK adds a voice assistant or chatbot to your app. Supports Swift, Objective-C.

Stars: ✭ 318 (-52.75%)

Mutual labels: speech-recognition

End-to-End Speech Processing Toolkit

Stars: ✭ 4,533 (+573.55%)

Mutual labels: speech-recognition

语音api示例

Stars: ✭ 454 (-32.54%)

Mutual labels: speech-recognition

💬 /so.nus/ STT (speech to text) for Node with offline hotword detection

Stars: ✭ 532 (-20.95%)

Mutual labels: speech-recognition

Alan Sdk Flutter

Alan AI Flutter SDK adds a voice assistant or chatbot to your app.

Stars: ✭ 309 (-54.09%)

Mutual labels: speech-recognition

Brevitas: quantization-aware training in PyTorch

Stars: ✭ 343 (-49.03%)

Mutual labels: speech-recognition

Voice Overlay Ios

🗣 An overlay that gets your user’s voice permission and input as text in a customizable UI

Stars: ✭ 440 (-34.62%)

Mutual labels: speech-recognition

python powered Intelligent System

Stars: ✭ 325 (-51.71%)

Mutual labels: speech-recognition

Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.

Stars: ✭ 622 (-7.58%)

Mutual labels: speech-recognition

Novoic's audio feature extraction library

Stars: ✭ 318 (-52.75%)

Mutual labels: speech-processing

A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain

Stars: ✭ 408 (-39.38%)

Mutual labels: speech-recognition

Silero Models: pre-trained STT models and benchmarks made embarrassingly simple

Stars: ✭ 522 (-22.44%)

Mutual labels: speech-recognition

Library to build speech synthesis systems designed for easy and fast prototyping.

Stars: ✭ 308 (-54.23%)

Mutual labels: speech-processing

On-device speech-to-intent engine powered by deep learning

Stars: ✭ 406 (-39.67%)

Mutual labels: speech-recognition

Tensorflow end2end speech recognition

End-to-End speech recognition implementation base on TensorFlow (CTC, Attention, and MTL training)

Stars: ✭ 305 (-54.68%)

Mutual labels: speech-recognition

Speech recognition

A Flutter plugin to use speech recognition on iOS & Android (Swift/Java)

Stars: ✭ 302 (-55.13%)

Mutual labels: speech-recognition

End-to-end ASR/LM implementation with PyTorch

Stars: ✭ 408 (-39.38%)

Mutual labels: speech-recognition

Pocketsphinx Python

Python interface to CMU Sphinxbase and Pocketsphinx libraries

Stars: ✭ 298 (-55.72%)

Mutual labels: speech-recognition

A python wrapper for Speech Signal Processing Toolkit (SPTK).

Stars: ✭ 297 (-55.87%)

Mutual labels: speech-processing

Speech Emotion Analyzer

The neural network model is capable of detecting five different male/female emotions from audio speeches. (Deep Learning, NLP, Python)

Stars: ✭ 633 (-5.94%)

Mutual labels: speech-recognition

Production First and Production Ready End-to-End Speech Recognition Toolkit

Stars: ✭ 617 (-8.32%)

Mutual labels: speech-recognition

Speech Denoising Wavenet

A neural network for end-to-end speech denoising

Stars: ✭ 516 (-23.33%)

Mutual labels: speech-processing

⚡️ TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwords

Stars: ✭ 400 (-40.56%)

Mutual labels: speech-recognition

Alan AI Ionic SDK adds a voice assistant or chatbot to your app. Supports React, Angular.

Stars: ✭ 287 (-57.36%)

Mutual labels: speech-recognition

Alan Sdk Android

Alan AI Android SDK adds a voice assistant or chatbot to your app. Supports Java, Kotlin.

Stars: ✭ 278 (-58.69%)

Mutual labels: speech-recognition

Ctcwordbeamsearch

Connectionist Temporal Classification (CTC) decoder with dictionary and language model for TensorFlow.

Stars: ✭ 398 (-40.86%)

Mutual labels: speech-recognition

WebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi libraries

Stars: ✭ 277 (-58.84%)

Mutual labels: speech-recognition

Phonetisaurus G2P

Stars: ✭ 277 (-58.84%)

Mutual labels: speech-recognition

Java Speech Api

The J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.

Stars: ✭ 490 (-27.19%)

Mutual labels: speech-recognition

Free Spoken Digit Dataset

A free audio dataset of spoken digits. Think MNIST for audio.

Stars: ✭ 396 (-41.16%)

Mutual labels: speech-recognition

Vosk Android Demo

Offline speech recognition for Android with Vosk library.

Stars: ✭ 271 (-59.73%)

Mutual labels: speech-recognition

Alan Sdk Cordova

Alan AI Cordova SDK adds a voice assistant or chatbot to your app.

Stars: ✭ 269 (-60.03%)

Mutual labels: speech-recognition

This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )

Stars: ✭ 393 (-41.6%)

Mutual labels: speech-recognition

Neural Voice Cloning With Few Samples

This repository has implementation for "Neural Voice Cloning With Few Samples"

Stars: ✭ 262 (-61.07%)

Mutual labels: speech-processing

PocketSphinx is a lightweight speech recognition engine, specifically tuned for handheld and mobile devices, though it works equally well on the desktop

Stars: ✭ 2,934 (+335.96%)

Mutual labels: speech-recognition

1-60 of 377 similar projects