ml-with-audioHF's ML for Audio study group
Stars: ✭ 104 (+57.58%)
NemoNeMo: a toolkit for conversational AI
Stars: ✭ 3,685 (+5483.33%)
open-speech-corpora💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
Stars: ✭ 841 (+1174.24%)
porfirГолосовой ассистент Порфирьевич
Stars: ✭ 23 (-65.15%)
web-speech-cognitive-servicesPolyfill Web Speech API with Cognitive Services Bing Speech for both speech-to-text and text-to-speech service.
Stars: ✭ 35 (-46.97%)
spokestack-iosSpokestack: give your iOS app a voice interface!
Stars: ✭ 27 (-59.09%)
idear🎙️ Handsfree Audio Development Interface
Stars: ✭ 84 (+27.27%)
AmazonSpeechTranslatorEnd-to-end Solution for Speech Recognition, Text Translation, and Text-to-Speech for iOS using Amazon Translate and Amazon Polly as AWS Machine Learning managed services.
Stars: ✭ 50 (-24.24%)
KhronosThe open source intelligent personal assistant
Stars: ✭ 25 (-62.12%)
Athenaan open-source implementation of sequence-to-sequence based speech processing engine
Stars: ✭ 542 (+721.21%)
Openseq2seqToolkit for efficient experimentation with Speech Recognition, Text2Speech and NLP
Stars: ✭ 1,378 (+1987.88%)
KalliopeKalliope is a framework that will help you to create your own personal assistant.
Stars: ✭ 1,509 (+2186.36%)
Artyom.jsA voice control - voice commands - speech recognition and speech synthesis javascript library. Create your own siri,google now or cortana with Google Chrome within your website.
Stars: ✭ 1,011 (+1431.82%)
Spokestack PythonSpokestack is a library that allows a user to easily incorporate a voice interface into any Python application.
Stars: ✭ 103 (+56.06%)
TinyCogSmall Robot, Toy Robot platform
Stars: ✭ 29 (-56.06%)
Cross vcCross-lingual Voice Conversion
Stars: ✭ 91 (+37.88%)
NaomiThe Naomi Project is an open source, technology agnostic platform for developing always-on, voice-controlled applications!
Stars: ✭ 171 (+159.09%)
LingvoLingvo
Stars: ✭ 2,361 (+3477.27%)
speechreca simple speech recognition app using the Web Speech API Interfaces
Stars: ✭ 18 (-72.73%)
spokestack-androidExtensible Android mobile voice framework: wakeword, ASR, NLU, and TTS. Easily add voice to any Android app!
Stars: ✭ 52 (-21.21%)
leon🧠 Leon is your open-source personal assistant.
Stars: ✭ 8,560 (+12869.7%)
EspnetEnd-to-End Speech Processing Toolkit
Stars: ✭ 4,533 (+6768.18%)
Speech-BackbonesThis is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.
Stars: ✭ 205 (+210.61%)
Libfaceidlibfaceid is a research framework for prototyping of face recognition solutions. It seamlessly integrates multiple detection, recognition and liveness models w/ speech synthesis and speech recognition.
Stars: ✭ 354 (+436.36%)
Java Speech ApiThe J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.
Stars: ✭ 490 (+642.42%)
Wavenet SttAn end-to-end speech recognition system with Wavenet. Built using C++ and python.
Stars: ✭ 18 (-72.73%)
Cortex M KwsCortex M KWS example with Tengine Lite.
Stars: ✭ 45 (-31.82%)
Speechpy💬 SpeechPy - A Library for Speech Processing and Recognition: http://speechpy.readthedocs.io/en/latest/
Stars: ✭ 833 (+1162.12%)
PororoPORORO: Platform Of neuRal mOdels for natuRal language prOcessing
Stars: ✭ 812 (+1130.3%)
BiglittlenetOfficial repository for Big-Little Net
Stars: ✭ 57 (-13.64%)
Formant AnalyzeriOS application for finding formants in spoken sounds
Stars: ✭ 43 (-34.85%)
KurDescriptive Deep Learning
Stars: ✭ 811 (+1128.79%)
EspressoEspresso: A Fast End-to-End Neural Speech Recognition Toolkit
Stars: ✭ 808 (+1124.24%)
Avsr Deep SpeechGoogle Summer of Code 2017 Project: Development of Speech Recognition Module for Red Hen Lab
Stars: ✭ 43 (-34.85%)
Espeak NgeSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.
Stars: ✭ 799 (+1110.61%)
Stephanie VaStephanie is an open-source platform built specifically for voice-controlled applications as well as to automate daily tasks imitating much of an virtual assistant's work.
Stars: ✭ 772 (+1069.7%)
Pink TromboneA programmable version of Neil Thapen's Pink Trombone
Stars: ✭ 54 (-18.18%)
Tacotron2A PyTorch implementation of Tacotron2, an end-to-end text-to-speech(TTS) system described in "Natural TTS Synthesis By Conditioning Wavenet On Mel Spectrogram Predictions".
Stars: ✭ 43 (-34.85%)
WorldA high-quality speech analysis, manipulation and synthesis system
Stars: ✭ 769 (+1065.15%)
SincnetSincNet is a neural architecture for efficiently processing raw audio samples.
Stars: ✭ 764 (+1057.58%)
PykaldiA Python wrapper for Kaldi
Stars: ✭ 756 (+1045.45%)
Rhvoicea free and open source speech synthesizer for Russian and other languages
Stars: ✭ 750 (+1036.36%)
Iflytek awaken asruse iflytek's technology to realize awaken and order recognition
Stars: ✭ 53 (-19.7%)
PnccA implementation of Power Normalized Cepstral Coefficients: PNCC
Stars: ✭ 40 (-39.39%)
EesenThe official repository of the Eesen project
Stars: ✭ 738 (+1018.18%)
Annyang💬 Speech recognition for your site
Stars: ✭ 6,216 (+9318.18%)
Voice🎤 React Native Voice Recognition library for iOS and Android (Online and Offline Support)
Stars: ✭ 993 (+1404.55%)
AdaptAdapt Intent Parser
Stars: ✭ 690 (+945.45%)
ParallelwaveganUnofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN) with Pytorch
Stars: ✭ 682 (+933.33%)
Dragonfirethe open-source virtual assistant for Ubuntu based Linux distributions
Stars: ✭ 1,120 (+1596.97%)
Cs224n Gpu That TalksAttention, I'm Trying to Speak: End-to-end speech synthesis (CS224n '18)
Stars: ✭ 52 (-21.21%)
DiscordspeechbotA speech-to-text bot for discord with music commands and more using NodeJS. Ideally for controlling your Discord server using voice commands, can also be useful for hearing-impaired people.
Stars: ✭ 35 (-46.97%)
Wav2letterFacebook AI Research's Automatic Speech Recognition Toolkit
Stars: ✭ 5,907 (+8850%)
Awesome DiarizationA curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
Stars: ✭ 673 (+919.7%)