All Projects → wav2vec2-live → Similar Projects or Alternatives

529 Open source projects that are alternatives of or similar to wav2vec2-live

Windows "say"

Stars: ✭ 36 (-82.44%)

Mutual labels: speech

This is the official implementation of our neural-network-based fast diffuse room impulse response generator (FAST-RIR) for generating room impulse responses (RIRs) for a given acoustic environment.

Stars: ✭ 90 (-56.1%)

Mutual labels: speech

This application is developed to help speechless people interact with others with ease. It detects voice and converts the input speech into a sign language based video.

Stars: ✭ 21 (-89.76%)

Mutual labels: speech

Siricontrol System

Control anything with Siri voice commands.

Stars: ✭ 180 (-12.2%)

Mutual labels: speech

LightSpeech: Lightweight and Fast Text to Speech with Neural Architecture Search

Stars: ✭ 31 (-84.88%)

Mutual labels: speech

Collection of auditory models.

Stars: ✭ 25 (-87.8%)

Mutual labels: speech

Software for interview transcription

Stars: ✭ 12 (-94.15%)

Mutual labels: speech

Data manipulation and transformation for audio signal processing, powered by PyTorch

Stars: ✭ 1,262 (+515.61%)

Mutual labels: speech

百度云流式语音识别客户端 SDK

Stars: ✭ 62 (-69.76%)

Mutual labels: asr

Text-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart. Objectives of the development are simplicity, modularity, controllability and multilinguality.

Stars: ✭ 295 (+43.9%)

Mutual labels: speech

Gender recognition by voice and speech analysis

Stars: ✭ 248 (+20.98%)

Mutual labels: speech

The official repo of "Multi-channel Narrow-band Deep Speech Separation with Full-band Permutation Invariant Training", "Multichannel Speech Separation with Narrow-band Conformer" and "NBC2: Multichannel Speech Separation with Revised Narrow-band Conformer".

Stars: ✭ 77 (-62.44%)

Mutual labels: speech

Continuous Augmented Positional Embeddings (CAPE) implementation for PyTorch

Stars: ✭ 29 (-85.85%)

Mutual labels: speech

icassp2019-latex-template

ICASSP 2019 official Latex template

Stars: ✭ 21 (-89.76%)

Mutual labels: speech

Sequence Segmentation using Joint RNN and Structured Prediction Models (ICASSP 2017)

Stars: ✭ 17 (-91.71%)

Mutual labels: speech

Tools to convert text to speech 📚💬

Stars: ✭ 84 (-59.02%)

Mutual labels: speech

🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools

Stars: ✭ 13,870 (+6665.85%)

Mutual labels: speech

📈 Ventib records your voice, transcribes it in realtime, and performs speech pattern analysis to give you objective statistics about how you speak.

Stars: ✭ 43 (-79.02%)

Mutual labels: speech

Praat: Doing Phonetics By Computer

Stars: ✭ 675 (+229.27%)

Mutual labels: speech

PyTorch reimplementation of per-channel energy normalization for audio.

Stars: ✭ 80 (-60.98%)

Mutual labels: speech

Deep speaker Speaker recognition system

Keras implementation of ‘’Deep Speaker: an End-to-End Neural Speaker Embedding System‘’ (speaker recognition)

Stars: ✭ 174 (-15.12%)

Mutual labels: speech

Speech Enhancement Generative Adversarial Network in TensorFlow

Stars: ✭ 661 (+222.44%)

Mutual labels: speech

Automatic speech recognition

End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow

Stars: ✭ 2,751 (+1241.95%)

Mutual labels: speech-recognition

Implementation of Google Brain's WaveGrad high-fidelity vocoder (paper: https://arxiv.org/pdf/2009.00713.pdf). First implementation on GitHub.

Stars: ✭ 245 (+19.51%)

Mutual labels: speech

Chatbot Watson Android

An Android ChatBot powered by Watson Services - Assistant, Speech-to-Text and Text-to-Speech on IBM Cloud.

Stars: ✭ 169 (-17.56%)

Mutual labels: speech

Speech recognition framework allowing powerful Python-based scripting and extension of Dragon NaturallySpeaking (DNS), Windows Speech Recognition (WSR), Kaldi and CMU Pocket Sphinx

Stars: ✭ 209 (+1.95%)

Mutual labels: speech-recognition

react-native-speech-bubble

💬 A speech bubble dialog component for React Native.

Stars: ✭ 50 (-75.61%)

Mutual labels: speech

Speech Denoising Wavenet

A neural network for end-to-end speech denoising

Stars: ✭ 516 (+151.71%)

Mutual labels: speech

Audio samples accompanying publications related to Tacotron, an end-to-end speech synthesis model.

Stars: ✭ 493 (+140.49%)

Mutual labels: speech

Subtitle Speech Synchronizer

Stars: ✭ 379 (+84.88%)

Mutual labels: speech-recognition

data-at-hand-mobile

Mobile application for exploring fitness data using both speech and touch interaction.

Stars: ✭ 50 (-75.61%)

Mutual labels: speech

End-to-End Speech Processing Toolkit

Stars: ✭ 4,533 (+2111.22%)

Mutual labels: speech-recognition

libfaceid is a research framework for prototyping of face recognition solutions. It seamlessly integrates multiple detection, recognition and liveness models w/ speech synthesis and speech recognition.

Stars: ✭ 354 (+72.68%)

Mutual labels: speech-recognition

CVC: Contrastive Learning for Non-parallel Voice Conversion (INTERSPEECH 2021, in PyTorch)

Stars: ✭ 45 (-78.05%)

Mutual labels: speech

Emotion Classification From Audio Files

Understanding emotions from audio files using neural networks and multiple datasets.

Stars: ✭ 189 (-7.8%)

Mutual labels: speech

Alan AI iOS SDK adds a voice assistant or chatbot to your app. Supports Swift, Objective-C.

Stars: ✭ 318 (+55.12%)

Mutual labels: speech-recognition

aframe-speech-controls-component

alternative form of inputs for in-VR interaction with the content of a scene

Stars: ✭ 13 (-93.66%)

Mutual labels: speech

🐸 collection of TTS papers

Stars: ✭ 160 (-21.95%)

Mutual labels: speech

🎧 🎼 Advanced JavaFX Media Player

Stars: ✭ 472 (+130.24%)

Mutual labels: speech

Kaldi Offline Transcriber

Offline transcription system for Estonian using Kaldi

Stars: ✭ 182 (-11.22%)

Mutual labels: speech-recognition

Multimodal-Gesture-Recognition-with-LSTMs-and-CTC

An end-to-end system that performs temporal recognition of gesture sequences using speech and skeletal input. The model combines three networks with a CTC output layer that recognises gestures from continuous stream.

Stars: ✭ 25 (-87.8%)

Mutual labels: speech

Deepspeech German

Automatic Speech Recognition (ASR) - German

Stars: ✭ 179 (-12.68%)

Mutual labels: speech-recognition

AAC communication system with text-to-speech for the browser

Stars: ✭ 437 (+113.17%)

Mutual labels: speech

The CIDLib general purpose C++ development environment

Stars: ✭ 179 (-12.68%)

Mutual labels: speech-recognition

aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)

Stars: ✭ 1,942 (+847.32%)

Mutual labels: speech

Kaldi model converter to ONNX

Stars: ✭ 174 (-15.12%)

Mutual labels: speech-recognition

Official implementation of Meta-StyleSpeech and StyleSpeech

Stars: ✭ 161 (-21.46%)

Mutual labels: speech

PocketSphinx is a lightweight speech recognition engine, specifically tuned for handheld and mobile devices, though it works equally well on the desktop

Stars: ✭ 2,934 (+1331.22%)

Mutual labels: speech-recognition

The ITU-T Software Tool Library (G.191)

Stars: ✭ 44 (-78.54%)

Mutual labels: speech

Awesome Speech Recognition Speech Synthesis Papers

Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)

Stars: ✭ 2,085 (+917.07%)

Mutual labels: speech-recognition

Pitch shifter using WSOLA and resampling implemented by Python3

Stars: ✭ 22 (-89.27%)

Mutual labels: speech

Wavenet vocoder

WaveNet vocoder

Stars: ✭ 1,926 (+839.51%)

Mutual labels: speech

Cordova Plugin Speechrecognition

🎤 Cordova Plugin for Speech Recognition

Stars: ✭ 174 (-15.12%)

Mutual labels: speech-recognition

Gst Kaldi Nnet2 Online

GStreamer plugin around Kaldi's online neural network decoder

Stars: ✭ 171 (-16.59%)

Mutual labels: speech-recognition

A pure python module for reading and writing kaldi ark files

Stars: ✭ 160 (-21.95%)

Mutual labels: speech-recognition

Voice Converter Cyclegan

Voice Converter Using CycleGAN and Non-Parallel Data

Stars: ✭ 384 (+87.32%)

Mutual labels: speech

Interspeech2019 Tutorial

INTERSPEECH 2019 Tutorial Materials

Stars: ✭ 160 (-21.95%)

Mutual labels: speech-recognition

A Polymer 3+ webcomponent / button for doing speech recognition

Stars: ✭ 54 (-73.66%)

Mutual labels: speech-recognition

Tacotron pytorch

PyTorch implementation of Tacotron speech synthesis model.

Stars: ✭ 242 (+18.05%)

Mutual labels: speech

A TensorFlow Implementation of Tacotron: A Fully End-to-End Text-To-Speech Synthesis Model

Stars: ✭ 1,756 (+756.59%)

Mutual labels: speech

301-360 of 529 similar projects