Pocketsphinx PythonPython interface to CMU Sphinxbase and Pocketsphinx libraries
Stars: ✭ 298 (+129.23%)
opensource-voice-toolsA repo listing known open source voice tools, ordered by where they sit in the voice stack
Stars: ✭ 21 (-83.85%)
Android SpeechAndroid speech recognition and text to speech made easy
Stars: ✭ 310 (+138.46%)
anycontrolVoice control for your websites and applications
Stars: ✭ 53 (-59.23%)
PhomemeSimple sentence mixing tool (work in progress)
Stars: ✭ 18 (-86.15%)
VoicerAGI-server voice recognizer for #Asterisk
Stars: ✭ 73 (-43.85%)
VAD-LTSDEfficient voice activity detection algorithm using long-term speech information
Stars: ✭ 37 (-71.54%)
SignDetectThis application is developed to help speechless people interact with others with ease. It detects voice and converts the input speech into a sign language based video.
Stars: ✭ 21 (-83.85%)
web-speech-demoLearn how to build a simple text-to-speech voice app for the web using the Web Speech API.
Stars: ✭ 19 (-85.38%)
Java Speech ApiThe J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.
Stars: ✭ 490 (+276.92%)
fadeA Simulation Framework for Auditory Discrimination Experiments
Stars: ✭ 12 (-90.77%)
spokestack-androidExtensible Android mobile voice framework: wakeword, ASR, NLU, and TTS. Easily add voice to any Android app!
Stars: ✭ 52 (-60%)
Annyang💬 Speech recognition for your site
Stars: ✭ 6,216 (+4681.54%)
JuliusOpen-Source Large Vocabulary Continuous Speech Recognition Engine
Stars: ✭ 1,258 (+867.69%)
Voice GenderGender recognition by voice and speech analysis
Stars: ✭ 248 (+90.77%)
Speech Emotion AnalyzerThe neural network model is capable of detecting five different male/female emotions from audio speeches. (Deep Learning, NLP, Python)
Stars: ✭ 633 (+386.92%)
Dialectid e2eEnd to End Dialect Identification using Convolutional Neural Network
Stars: ✭ 40 (-69.23%)
AudioData manipulation and transformation for audio signal processing, powered by PyTorch
Stars: ✭ 1,262 (+870.77%)
DurianImplementation of "Duration Informed Attention Network for Multimodal Synthesis" (https://arxiv.org/pdf/1909.01700.pdf) paper.
Stars: ✭ 111 (-14.62%)
TtsTools to convert text to speech 📚💬
Stars: ✭ 84 (-35.38%)
SeganA PyTorch implementation of SEGAN based on INTERSPEECH 2017 paper "SEGAN: Speech Enhancement Generative Adversarial Network"
Stars: ✭ 82 (-36.92%)
LabelboxLabelbox is the fastest way to annotate data to build and ship computer vision applications.
Stars: ✭ 1,588 (+1121.54%)
DeltaDELTA is a deep learning based natural language and speech processing platform.
Stars: ✭ 1,479 (+1037.69%)
FigaroReal-time voice-changer for voice-chat, etc. Will support many different voice-filters and features in the future. 🎵
Stars: ✭ 80 (-38.46%)
HolobotHoloBot is a reusable 3D interface that allows HoloLens & VR users to interact with any bot using Mixed Reality & Speech.
Stars: ✭ 114 (-12.31%)
Code Switching PapersA curated list of research papers and resources on code-switching
Stars: ✭ 122 (-6.15%)
Ccpd[ECCV 2018] CCPD: a diverse and well-annotated dataset for license plate detection and recognition
Stars: ✭ 1,252 (+863.08%)
TeaspeakThe TeaSpeak server issue tracker
Stars: ✭ 81 (-37.69%)
Alan Sdk PcfAlan AI Power Apps SDK adds a voice assistant or chatbot to your Microsoft Power Apps project.
Stars: ✭ 128 (-1.54%)
Midi2voiceSinging synthesis from MIDI file
Stars: ✭ 102 (-21.54%)
DeepspeechA PaddlePaddle implementation of ASR.
Stars: ✭ 1,219 (+837.69%)
VokaturiandroidEmotion recognition by speech in android.
Stars: ✭ 79 (-39.23%)
PhormaticsUsing A.I. and computer vision to build a virtual personal fitness trainer. (Most Startup-Viable Hack - HackNYU2018)
Stars: ✭ 79 (-39.23%)
TtsText-to-Speech for Arduino
Stars: ✭ 118 (-9.23%)
Vonage Java SdkVonage Server SDK for Java. API support for SMS, Voice, Text-to-Speech, Numbers, Verify (2FA) and more.
Stars: ✭ 75 (-42.31%)
Vonage Dotnet SdkNexmo REST API client for .NET, ASP.NET, ASP.NET MVC written in C#. API support for SMS, Voice, Text-to-Speech, Numbers, Verify (2FA) and more.
Stars: ✭ 76 (-41.54%)
AssistantjsTypeScript framework to build cross-platform voice applications (alexa, google home, ...).
Stars: ✭ 100 (-23.08%)
Android Kotlin Chat AppOpen-source Voice & Video Calling and Text Chat App for Kotlin (Android)
Stars: ✭ 76 (-41.54%)
Face recognitionFace recognition docker image to provide a web service which is able to register and recognize faces
Stars: ✭ 74 (-43.08%)
Asr audio data linksA list of publically available audio data that anyone can download for ASR or other speech activities
Stars: ✭ 128 (-1.54%)
3d Densenet3D Dense Connected Convolutional Network (3D-DenseNet for action recognition)
Stars: ✭ 118 (-9.23%)
Online place recognitionGraph-based image sequences matching for the visual place recognition in changing environments.
Stars: ✭ 100 (-23.08%)
Unityrtc基于webrtc的unity多人游戏实时语音(A Unity Demo for Impl Real-time Game Voice Among Mutiplayers Based On WEBRTC)
Stars: ✭ 74 (-43.08%)
AudiomatePython library for handling audio datasets.
Stars: ✭ 99 (-23.85%)
OpenasrA pytorch based end2end speech recognition system.
Stars: ✭ 69 (-46.92%)
Mad TwinnetThe code for the MaD TwinNet. Demo page:
Stars: ✭ 99 (-23.85%)
AudioswitchAn Android audio management library for real-time communication apps.
Stars: ✭ 69 (-46.92%)
WikipronMassively multilingual pronunciation mining
Stars: ✭ 99 (-23.85%)
Nlp Paper自然语言处理领域下的对话语音领域,整理相关论文(附阅读笔记),复现模型以及数据处理等(代码含TensorFlow和PyTorch两版本)
Stars: ✭ 67 (-48.46%)
Mtcnnface detection and alignment with mtcnn
Stars: ✭ 66 (-49.23%)