VAD-LTSDEfficient voice activity detection algorithm using long-term speech information
Stars: ✭ 37 (-87.58%)
Asr audio data linksA list of publically available audio data that anyone can download for ASR or other speech activities
Stars: ✭ 128 (-57.05%)
Java Speech ApiThe J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.
Stars: ✭ 490 (+64.43%)
EdgedictWorking online speech recognition based on RNN Transducer. ( Trained model release available in release )
Stars: ✭ 205 (-31.21%)
PhomemeSimple sentence mixing tool (work in progress)
Stars: ✭ 18 (-93.96%)
deepspeech.mxnetA MXNet implementation of Baidu's DeepSpeech architecture
Stars: ✭ 82 (-72.48%)
Alan Sdk IonicAlan AI Ionic SDK adds a voice assistant or chatbot to your app. Supports React, Angular.
Stars: ✭ 287 (-3.69%)
saltychat-fivemFiveM implementation of Salty Chat (TeamSpeak 3 based Voice Plugin)
Stars: ✭ 64 (-78.52%)
Speech Alignerspeech-aligner,是一个从“人声语音”及其“语言文本”,产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech and its transcription
Stars: ✭ 259 (-13.09%)
gm 8bitAudio interception utils for garry's mod
Stars: ✭ 28 (-90.6%)
vasisualyVasisualy it's a simple Russian voice assistant written on Python for GNU/Linux, Windows and Android.
Stars: ✭ 33 (-88.93%)
ReminderProReminderPro(location, note, voice recording)
Stars: ✭ 27 (-90.94%)
Voice-Denoising-ANA Conditional Generative Adverserial Network (cGAN) was adapted for the task of source de-noising of noisy voice auditory images. The base architecture is adapted from Pix2Pix.
Stars: ✭ 42 (-85.91%)
twilio-voice-notification-appReference app built in ReactJS that demonstrates how to leverage Twilio Programmable Voice and Twilio SDKs to create a voice notification system.
Stars: ✭ 21 (-92.95%)
DisgordGo module for interacting with the documented Discord's bot interface; Gateway, REST requests and voice
Stars: ✭ 277 (-7.05%)
Vmsg🎵 Library for creating voice messages
Stars: ✭ 257 (-13.76%)
Recording-BotA bot built to record and transcribe audio fragments from Discord.
Stars: ✭ 22 (-92.62%)
talkieText-to-speech browser extension button. Select text on any web page, and have the computer read it out loud for you by simply clicking the Talkie button.
Stars: ✭ 43 (-85.57%)
univoiceVoice chat/VoIP solution for unity. P2P implementation included.
Stars: ✭ 192 (-35.57%)
StageMateStageMate is the smart assistant for your presentation. It will cover all aspects of your pitch from skipping slides to reminding you if you miss some major point.
Stars: ✭ 60 (-79.87%)
LIUMScripts for LIUM SpkDiarization tools
Stars: ✭ 28 (-90.6%)
YouTube-Tutorials--Italian📂 Source Code for (some of) the Programming Tutorials from my Italian YouTube Channel and website ProgrammareInPython.it. This is just a small portion of the content: please visit the website for more.
Stars: ✭ 28 (-90.6%)
Amazing Python Scripts🚀 Curated collection of Amazing Python scripts from Basics to Advance with automation task scripts.
Stars: ✭ 229 (-23.15%)
vortexRevolt voice server
Stars: ✭ 61 (-79.53%)
speech recognition ctcUse ctc to do chinese speech recognition by keras / 通过keras和ctc实现中文语音识别
Stars: ✭ 40 (-86.58%)
tt-vae-ganTimbre transfer with variational autoencoding and cycle-consistent adversarial networks. Able to transfer the timbre of an audio source to that of another.
Stars: ✭ 37 (-87.58%)
CCAligner🔮 Word by word audio subtitle synchronisation tool and API. Developed under GSoC 2017 with CCExtractor.
Stars: ✭ 131 (-56.04%)
Sednndeep learning based speech enhancement using keras or pytorch, make it easy to use
Stars: ✭ 288 (-3.36%)
Discord RsRust library for the Discord chat client API
Stars: ✭ 272 (-8.72%)
assisterPrivate Open General Assistant Platform
Stars: ✭ 42 (-85.91%)
Speech256An FPGA implementation of a classic 80ies speech synthesizer. Done for the Retro Challenge 2017/10.
Stars: ✭ 51 (-82.89%)
rosechoTianbot Rosecho (Tianecho),中文语音人机交互模块,支持ROS即插即用
Stars: ✭ 28 (-90.6%)
sampvoiceSoftware Development Kit for implementing Pawn voice systems for SA:MP servers.
Stars: ✭ 79 (-73.49%)
edittsOfficial implementation of EdiTTS: Score-based Editing for Controllable Text-to-Speech
Stars: ✭ 74 (-75.17%)
wav2letterFacebook AI Research's Automatic Speech Recognition Toolkit
Stars: ✭ 6,026 (+1922.15%)
jackpairp2p speech encrypting device with analog audio interface suitable for GSM phones
Stars: ✭ 26 (-91.28%)
HotVoiceAdds Speech Recognition support to AutoHotkey, via a C# DLL
Stars: ✭ 41 (-86.24%)
Multi-Hotword SpottingWon't it be cool to build a speech assistant like Alexa or Siri yourself without voice API and network connection?
Stars: ✭ 31 (-89.6%)
ttslearnttslearn: Library for Pythonで学ぶ音声合成 (Text-to-speech with Python)
Stars: ✭ 158 (-46.98%)
sepia-docsDocumentation and Wiki for SEPIA. Please post your questions and bug-reports here in the issues section! Thank you :-)
Stars: ✭ 160 (-46.31%)
fadeA Simulation Framework for Auditory Discrimination Experiments
Stars: ✭ 12 (-95.97%)
voce-browserVoice Controlled Chromium Web Browser
Stars: ✭ 34 (-88.59%)
Vosk Android DemoOffline speech recognition for Android with Vosk library.
Stars: ✭ 271 (-9.06%)
Listen-Attend-Spell-v2PyTorch implementation of Listen Attend and Spell Automatic Speech Recognition (ASR).
Stars: ✭ 29 (-90.27%)
musicologistMusic advice from a conversational interface powered by Algolia
Stars: ✭ 19 (-93.62%)
MelNet-SpeechGenerationImplementation of MelNet in PyTorch to generate high-fidelity audio samples
Stars: ✭ 19 (-93.62%)
nabaztag-phpa simple php implementation of a Nabaztag server
Stars: ✭ 14 (-95.3%)
leon🧠 Leon is your open-source personal assistant.
Stars: ✭ 8,560 (+2772.48%)
deepspeechA PyTorch implementation of DeepSpeech and DeepSpeech2.
Stars: ✭ 45 (-84.9%)
HTKThe Hidden Markov Model Toolkit (HTK) from University of Cambridge, with fixed issues.
Stars: ✭ 23 (-92.28%)
Noise2Noise-audio denoising without clean training dataSource code for the paper titled "Speech Denoising without Clean Training Data: a Noise2Noise Approach". Paper accepted at the INTERSPEECH 2021 conference. This paper tackles the problem of the heavy dependence of clean speech data required by deep learning based audio denoising methods by showing that it is possible to train deep speech denoisi…
Stars: ✭ 49 (-83.56%)
ser-with-w2v2Official implementation of INTERSPEECH 2021 paper 'Emotion Recognition from Speech Using Wav2vec 2.0 Embeddings'
Stars: ✭ 40 (-86.58%)
pyjsgfJSpeech Grammar Format (JSGF) compiler, matcher and parser package for Python.
Stars: ✭ 40 (-86.58%)
mixupspeechpro.com/
Stars: ✭ 23 (-92.28%)