opensource-voice-toolsA repo listing known open source voice tools, ordered by where they sit in the voice stack
Stars: ✭ 21 (-77.42%)
Athenaan open-source implementation of sequence-to-sequence based speech processing engine
Stars: ✭ 542 (+482.8%)
Wukong Robot🤖 wukong-robot 是一个简单、灵活、优雅的中文语音对话机器人/智能音箱项目,还可能是首个支持脑机交互的开源智能音箱项目。
Stars: ✭ 3,110 (+3244.09%)
leopard-chat-ui-teneoLeopard Chat UI - A Teneo Chat Client based on Vue and Vuetify
Stars: ✭ 65 (-30.11%)
klaamArabic speech recognition, classification and text-to-speech.
Stars: ✭ 151 (+62.37%)
spokestack-tray-androidA UI component that makes it easy to add voice interaction to your app.
Stars: ✭ 13 (-86.02%)
spokestack-androidExtensible Android mobile voice framework: wakeword, ASR, NLU, and TTS. Easily add voice to any Android app!
Stars: ✭ 52 (-44.09%)
Mrcp Plugin With Freeswitch使用FreeSWITCH接受用户手机呼叫,通过UniMRCP Server集成讯飞开放平台(xfyun)插件将用户语音进行语音识别(ASR),并根据自定义业务逻辑调用语音合成(TTS),构建简单的端到端语音呼叫中心。
Stars: ✭ 168 (+80.65%)
LingvoLingvo
Stars: ✭ 2,361 (+2438.71%)
DSP-TestbenchA DSP Testbench for users of the JUCE framework
Stars: ✭ 40 (-56.99%)
dspDSP and filtering library
Stars: ✭ 36 (-61.29%)
ASR-Audio-Data-LinksA list of publically available audio data that anyone can download for ASR or other speech activities
Stars: ✭ 179 (+92.47%)
wav2vec2-liveA live speech recognition using Facebooks wav2vec 2.0 model.
Stars: ✭ 205 (+120.43%)
AnotherBadBeatSaberCloneThis is a discontinued but perhaps helpful VR project created during my Master's degree at FH Wedel.
Stars: ✭ 22 (-76.34%)
DtBlkFxFast-Fourier-Transform (FFT) based VST plug-in
Stars: ✭ 99 (+6.45%)
leopardOn-device speech-to-text engine powered by deep learning
Stars: ✭ 354 (+280.65%)
myG2PMyanmar (Burmese) Language Grapheme to Phoneme (myG2P) Conversion Dictionary for speech recognition (ASR) and speech synthesis (TTS).
Stars: ✭ 43 (-53.76%)
gensoundPythonic audio processing and generation framework
Stars: ✭ 69 (-25.81%)
IMS-ToucanText-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart. Objectives of the development are simplicity, modularity, controllability and multilinguality.
Stars: ✭ 295 (+217.2%)
matchering-web🎚️ Self-Hosted LANDR / eMastered Alternative
Stars: ✭ 25 (-73.12%)
fmcw-RADAR[mmWave based fmcw radar design files] based on AWR1843 chip operating at 76-GHz to 81-GHz.
Stars: ✭ 41 (-55.91%)
spafe🔉 spafe: Simplified Python Audio Features Extraction
Stars: ✭ 310 (+233.33%)
hawkingThe retro text-to-speech bot for Discord
Stars: ✭ 24 (-74.19%)
lessamplerlessampler is a Singing Voice Synthesizer
Stars: ✭ 59 (-36.56%)
Pitch-TrackingPitch tracking in real-time with the Kalman filter
Stars: ✭ 78 (-16.13%)
audio noise clusteringhttps://dodiku.github.io/audio_noise_clustering/results/ ==> An experiment with a variety of clustering (and clustering-like) techniques to reduce noise on an audio speech recording.
Stars: ✭ 24 (-74.19%)
asr2424-hour Automatic Speech Recognition
Stars: ✭ 27 (-70.97%)
TFGANTFGAN: Time and Frequency Domain Based Generative Adversarial Network for High-fidelity Speech Synthesis
Stars: ✭ 65 (-30.11%)
pie百度云流式语音识别客户端 SDK
Stars: ✭ 62 (-33.33%)
say-itTTS in command line -- Pronounce the Chinese and English words you typed in.
Stars: ✭ 19 (-79.57%)
avsr-tf1Audio-Visual Speech Recognition using Sequence to Sequence Models
Stars: ✭ 76 (-18.28%)
ApolloApollo is a Open-Source music player for playback and organization of audio files on Microsoft Windows, built using Python.
Stars: ✭ 13 (-86.02%)
simple-obs-sttSpeech-to-text and keyboard input captions for OBS.
Stars: ✭ 89 (-4.3%)
oouraJavascript port of Ooura FFT implementation
Stars: ✭ 23 (-75.27%)
brasilttsBrasil TTS é um conjunto de sintetizadores de voz, em português do Brasil, que lê telas para portadores de deficiência visual. Transforma texto em áudio, permitindo que pessoas cegas ou com baixa visão tenham acesso ao conteúdo exibido na tela. Embora o principal público-alvo de sistemas de conversão texto-fala – como o Brasil TTS – seja formado…
Stars: ✭ 34 (-63.44%)
megsA merged version of multiple open-source German speech datasets.
Stars: ✭ 21 (-77.42%)
dsp-collection-javaA collection of Java classes for Digital Signal Processing
Stars: ✭ 41 (-55.91%)
ppt presenterConvert ppt to video with audio track, using text to speech synthesis
Stars: ✭ 38 (-59.14%)
NohmadNohmad modules for VCV Rack
Stars: ✭ 25 (-73.12%)
FergunAn utility Discord bot written in C# using Discord.Net
Stars: ✭ 26 (-72.04%)
FiltersAn Arduino finite impulse response and infinite impulse response filter library.
Stars: ✭ 36 (-61.29%)
lsp-dsp-libDSP library for signal processing
Stars: ✭ 37 (-60.22%)
concatenative granulationLive concatenative granular processing for click-free looping, complex wavetable oscillation, and non-overlapping granulation with rectangular windowing.
Stars: ✭ 35 (-62.37%)
voderAn emulation of the Voder Speech Synthesizer.
Stars: ✭ 19 (-79.57%)
edgeofchaosThis repository is not maintained anymore. If I have any significant contributions, I usually do a PR for the Faust libraries. This repository contains the Faust libraries for sound and information processing that I use to implement my music complex adaptive systems.
Stars: ✭ 51 (-45.16%)
rasrThe RWTH ASR Toolkit.
Stars: ✭ 43 (-53.76%)
oddvoicesAn indie singing synthesizer
Stars: ✭ 4 (-95.7%)
kospeechOpen-Source Toolkit for End-to-End Korean Automatic Speech Recognition leveraging PyTorch and Hydra.
Stars: ✭ 456 (+390.32%)
uosUnited Open-libraries of Sound. United procedures for open-source audio libraries. For FPC/Lazarus/fpGUI/MSEgui.
Stars: ✭ 112 (+20.43%)
SpleeterRTReal time monaural source separation base on fully convolutional neural network operates on Time-frequency domain.
Stars: ✭ 111 (+19.35%)
GnuradioGNU Radio – the Free and Open Software Radio Ecosystem
Stars: ✭ 3,297 (+3445.16%)
Dsp.jlFilter design, periodograms, window functions, and other digital signal processing functionality
Stars: ✭ 226 (+143.01%)
FFTVisualizerThis project demonstrates DSP capabilities of Terasic DE2-115
Stars: ✭ 17 (-81.72%)
RendermanCommand line C++ and Python VSTi Host library with MFCC, FFT, RMS and audio extraction and .wav writing.
Stars: ✭ 225 (+141.94%)