All Projects → Awesome Diarization → Similar Projects or Alternatives

377 Open source projects that are alternatives of or similar to Awesome Diarization

语音api示例

Stars: ✭ 454 (-32.54%)

StageMate is the smart assistant for your presentation. It will cover all aspects of your pitch from skipping slides to reminding you if you miss some major point.

Stars: ✭ 60 (-91.08%)

Mutual labels: speech-recognition

Libfaceid

libfaceid is a research framework for prototyping of face recognition solutions. It seamlessly integrates multiple detection, recognition and liveness models w/ speech synthesis and speech recognition.

Stars: ✭ 354 (-47.4%)

Mutual labels: speech-recognition

Sonus

💬 /so.nus/ STT (speech to text) for Node with offline hotword detection

Stars: ✭ 532 (-20.95%)

Mutual labels: speech-recognition

Speech-Recognition

End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow

Stars: ✭ 21 (-96.88%)

Mutual labels: speech-recognition

download audioset

📁 This repo makes it easy to download the raw audio files from AudioSet (32.45 GB, 632 classes).

Stars: ✭ 53 (-92.12%)

Mutual labels: speech-recognition

Brevitas

Brevitas: quantization-aware training in PyTorch

Stars: ✭ 343 (-49.03%)

Mutual labels: speech-recognition

leon

🧠 Leon is your open-source personal assistant.

Stars: ✭ 8,560 (+1171.92%)

Mutual labels: speech-recognition

Voice Overlay Ios

🗣 An overlay that gets your user’s voice permission and input as text in a customizable UI

Stars: ✭ 440 (-34.62%)

Mutual labels: speech-recognition

sepia-stt-server

SEPIA server to support open-source speech recognition via WebSocket connection.

Stars: ✭ 45 (-93.31%)

Mutual labels: speech-recognition

J.a.r.v.i.s

python powered Intelligent System

Stars: ✭ 325 (-51.71%)

Mutual labels: speech-recognition

speech-to-text

mixlingual speech recognition system; hybrid (GMM+NNet) model; Kaldi + Keras

Stars: ✭ 61 (-90.94%)

Mutual labels: speech-recognition

Vad

Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.

Stars: ✭ 622 (-7.58%)

Mutual labels: speech-recognition

Neural Voice Cloning With Few Samples

This repository has implementation for "Neural Voice Cloning With Few Samples"

Stars: ✭ 262 (-61.07%)

Mutual labels: speech-processing

speech-to-text-code-pattern

React app using the Watson Speech to Text service to transform voice audio into written text.

Stars: ✭ 37 (-94.5%)

Mutual labels: speech-recognition

Surfboard

Novoic's audio feature extraction library

Stars: ✭ 318 (-52.75%)

Mutual labels: speech-processing

kosr

Korean speech recognition based on transformer (트랜스포머 기반 한국어 음성 인식)

Stars: ✭ 25 (-96.29%)

Mutual labels: speech-recognition

Specaugment

A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain

Stars: ✭ 408 (-39.38%)

Mutual labels: speech-recognition

speech-recognition

SDKs and docs for Skit's speech to text service

Stars: ✭ 20 (-97.03%)

Mutual labels: speech-recognition

Nnmnkwii

Library to build speech synthesis systems designed for easy and fast prototyping.

Stars: ✭ 308 (-54.23%)

Mutual labels: speech-processing

YouTube-Tutorials--Italian

📂 Source Code for (some of) the Programming Tutorials from my Italian YouTube Channel and website ProgrammareInPython.it. This is just a small portion of the content: please visit the website for more.

Stars: ✭ 28 (-95.84%)

Mutual labels: speech-recognition

Silero Models

Silero Models: pre-trained STT models and benchmarks made embarrassingly simple

Stars: ✭ 522 (-22.44%)

Mutual labels: speech-recognition

rosecho

Tianbot Rosecho (Tianecho)，中文语音人机交互模块，支持ROS即插即用

Stars: ✭ 28 (-95.84%)

Mutual labels: speech-recognition

Speech recognition

A Flutter plugin to use speech recognition on iOS & Android (Swift/Java)

Stars: ✭ 302 (-55.13%)

Mutual labels: speech-recognition

awesome-multimodal-ml

Reading list for research topics in multimodal machine learning

Stars: ✭ 3,125 (+364.34%)

Mutual labels: speech-processing

Neural sp

End-to-end ASR/LM implementation with PyTorch

Stars: ✭ 408 (-39.38%)

Mutual labels: speech-recognition

sepia-docs

Documentation and Wiki for SEPIA. Please post your questions and bug-reports here in the issues section! Thank you :-)

Stars: ✭ 160 (-76.23%)

Mutual labels: speech-recognition

Pysptk

A python wrapper for Speech Signal Processing Toolkit (SPTK).

Stars: ✭ 297 (-55.87%)

Mutual labels: speech-processing

deepspeech

A PyTorch implementation of DeepSpeech and DeepSpeech2.

Stars: ✭ 45 (-93.31%)

Mutual labels: speech-recognition

Speech Emotion Analyzer

The neural network model is capable of detecting five different male/female emotions from audio speeches. (Deep Learning, NLP, Python)

Stars: ✭ 633 (-5.94%)

Mutual labels: speech-recognition

mixup

speechpro.com/

Stars: ✭ 23 (-96.58%)

Mutual labels: speech-recognition

Alan Sdk Android

Alan AI Android SDK adds a voice assistant or chatbot to your app. Supports Java, Kotlin.

Stars: ✭ 278 (-58.69%)

Mutual labels: speech-recognition

lightning-asr

Modular and extensible speech recognition library leveraging pytorch-lightning and hydra.

Stars: ✭ 36 (-94.65%)

Mutual labels: speech-recognition

Ctcwordbeamsearch

Connectionist Temporal Classification (CTC) decoder with dictionary and language model for TensorFlow.

Stars: ✭ 398 (-40.86%)

Mutual labels: speech-recognition

quran-align

Word-accurate timestamps for Qur'anic audio.

Stars: ✭ 139 (-79.35%)

Mutual labels: speech-recognition

Phonetisaurus

Phonetisaurus G2P

Stars: ✭ 277 (-58.84%)

Mutual labels: speech-recognition

pocketsphinx

Updated ROS bindings to pocketsphinx

Stars: ✭ 36 (-94.65%)

Mutual labels: speech-recognition

Java Speech Api

The J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.

Stars: ✭ 490 (-27.19%)

Mutual labels: speech-recognition

Alan Sdk Cordova

Alan AI Cordova SDK adds a voice assistant or chatbot to your app.

Stars: ✭ 269 (-60.03%)

Mutual labels: speech-recognition

NLP Toolkit

Library of state-of-the-art models (PyTorch) for NLP tasks

Stars: ✭ 92 (-86.33%)

Mutual labels: speech-recognition

Awesome Kaldi

This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )

Stars: ✭ 393 (-41.6%)

Mutual labels: speech-recognition

Pocketsphinx

PocketSphinx is a lightweight speech recognition engine, specifically tuned for handheld and mobile devices, though it works equally well on the desktop

Stars: ✭ 2,934 (+335.96%)

Mutual labels: speech-recognition

formulas-python

Ritchie CLI formulas in Python 🐍

Stars: ✭ 17 (-97.47%)

Mutual labels: speech-recognition

SpeechEnhancement

Combining Weighted Multi-resolution STFT Loss and Distance Fusion to Optimize Speech Enhancement Generative Adversarial Networks

Stars: ✭ 49 (-92.72%)

Mutual labels: speech-processing

soxan

Wav2Vec for speech recognition, classification, and audio classification

Stars: ✭ 113 (-83.21%)

Mutual labels: speech-recognition

Athena

an open-source implementation of sequence-to-sequence based speech processing engine

Stars: ✭ 542 (-19.47%)

Mutual labels: speech-recognition

Mycroft Precise

A lightweight, simple-to-use, RNN wake word listener

Stars: ✭ 481 (-28.53%)

Mutual labels: speech-recognition

Nmtpytorch

Sequence-to-Sequence Framework in PyTorch

Stars: ✭ 392 (-41.75%)

Mutual labels: speech-recognition

Awesome Speech Enhancement

A tutorial for Speech Enhancement researchers and practitioners. The purpose of this repo is to organize the world’s resources for speech enhancement and make them universally accessible and useful.

Stars: ✭ 257 (-61.81%)

Mutual labels: speech-processing

Tensorflow-Keyword-Spotting

Keyword spotting using various architecture like convolutional vggnet , 1D convolutional network and CTC.

Stars: ✭ 27 (-95.99%)

Mutual labels: speech-recognition

pyssp

python speech signal processing library

Stars: ✭ 18 (-97.33%)

Mutual labels: speech-processing

SpeechTransProgress

Tracking the progress in end-to-end speech translation

Stars: ✭ 139 (-79.35%)

Mutual labels: speech-processing

A chronology of deep learning

Tracing back and exposing in chronological order the main ideas in the field of deep learning, to help everyone better understand the current intense research in AI.

Stars: ✭ 47 (-93.02%)

Mutual labels: speech-recognition

BookLibrary

Book Library of P&W Studio

Stars: ✭ 13 (-98.07%)

Mutual labels: speech-processing

Cheetah

On-device streaming speech-to-text engine powered by deep learning

Stars: ✭ 383 (-43.09%)

Mutual labels: speech-recognition

HotVoice

Adds Speech Recognition support to AutoHotkey, via a C# DLL