Speech And TextSpeech to text (PocketSphinx, Iflytex API, Baidu API) and text to speech (pyttsx3) | 语音转文字(PocketSphinx、百度 API、科大讯飞 API)和文字转语音(pyttsx3)
Stars: ✭ 102 (-54.05%)
AthenaA free and open source replacement for Google Assistant on Android devices, meant to integrate with the Sapphire Framework. It contains both speech-to-text and text-to-speech services. It does not require Google services or network connectivity
Stars: ✭ 73 (-67.12%)
Openseq2seqToolkit for efficient experimentation with Speech Recognition, Text2Speech and NLP
Stars: ✭ 1,378 (+520.72%)
Speech recognitionSpeech recognition module for Python, supporting several engines and APIs, online and offline.
Stars: ✭ 5,999 (+2602.25%)
Aeneasaeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)
Stars: ✭ 1,942 (+774.77%)
NemoNeMo: a toolkit for conversational AI
Stars: ✭ 3,685 (+1559.91%)
Google Speech V2💬 Reverse Engineering Google's Speech To Text API (v2)
Stars: ✭ 435 (+95.95%)
open-speech-corpora💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
Stars: ✭ 841 (+278.83%)
musicologistMusic advice from a conversational interface powered by Algolia
Stars: ✭ 19 (-91.44%)
WatbotAn Android ChatBot powered by IBM Watson Services (Assistant V1, Text-to-Speech, and Speech-to-Text with Speaker Recognition) on IBM Cloud.
Stars: ✭ 64 (-71.17%)
Spokestack PythonSpokestack is a library that allows a user to easily incorporate a voice interface into any Python application.
Stars: ✭ 103 (-53.6%)
spokestack-iosSpokestack: give your iOS app a voice interface!
Stars: ✭ 27 (-87.84%)
Dragonfirethe open-source virtual assistant for Ubuntu based Linux distributions
Stars: ✭ 1,120 (+404.5%)
leon🧠 Leon is your open-source personal assistant.
Stars: ✭ 8,560 (+3755.86%)
AmazonSpeechTranslatorEnd-to-end Solution for Speech Recognition, Text Translation, and Text-to-Speech for iOS using Amazon Translate and Amazon Polly as AWS Machine Learning managed services.
Stars: ✭ 50 (-77.48%)
SoloudFree, easy, portable audio engine for games
Stars: ✭ 1,048 (+372.07%)
web-speech-cognitive-servicesPolyfill Web Speech API with Cognitive Services Bing Speech for both speech-to-text and text-to-speech service.
Stars: ✭ 35 (-84.23%)
NaomiThe Naomi Project is an open source, technology agnostic platform for developing always-on, voice-controlled applications!
Stars: ✭ 171 (-22.97%)
WavefileA Ruby gem for reading and writing sound files in Wave format (*.wav)
Stars: ✭ 193 (-13.06%)
Optivideoeditor For AndroidNative Video editor : Video trim, Audio, Video merge, Slow and fast motion, Text and image, etc...
Stars: ✭ 209 (-5.86%)
PlayxSearch and play any song from terminal
Stars: ✭ 194 (-12.61%)
Jcplayer🎵 A simple audio player for Android applications.
Stars: ✭ 209 (-5.86%)
Waveform analysisFunctions and scripts for analyzing waveforms, primarily audio. This is currently somewhat disorganized and unfinished.
Stars: ✭ 193 (-13.06%)
PyacoustidPython bindings for Chromaprint acoustic fingerprinting and the Acoustid Web service
Stars: ✭ 214 (-3.6%)
WaveglowA PyTorch implementation of the WaveGlow: A Flow-based Generative Network for Speech Synthesis
Stars: ✭ 205 (-7.66%)
JavascriptmusicLive coding music and synthesis in Javascript / AssemblyScript (WebAssembly)
Stars: ✭ 193 (-13.06%)
SymphoniaPure Rust multimedia format demuxing, tag reading, and audio decoding library
Stars: ✭ 191 (-13.96%)
Rnn ctcRecurrent Neural Network and Long Short Term Memory (LSTM) with Connectionist Temporal Classification implemented in Theano. Includes a Toy training example.
Stars: ✭ 220 (-0.9%)
Mimiummimium (MInimal Musical medIUM) a programming language as an infrastructure for sound and music.
Stars: ✭ 212 (-4.5%)
EdgedictWorking online speech recognition based on RNN Transducer. ( Trained model release available in release )
Stars: ✭ 205 (-7.66%)
YoutagiOS music player app that downloads music from the internet, even YouTube
Stars: ✭ 193 (-13.06%)
BtrackA Real-Time Beat Tracker
Stars: ✭ 204 (-8.11%)
MwengineAudio engine and DSP for Android, written in C++ providing low latency performance in a musical context, supporting both OpenSL and AAudio.
Stars: ✭ 190 (-14.41%)
Rf24audioArduino library for streaming data/audio from analog inputs via NRF24L01 modules
Stars: ✭ 190 (-14.41%)
Recorderhtml5 js 录音 mp3 wav ogg webm amr 格式,支持pc和Android、ios部分浏览器、和Hybrid App(提供Android IOS App源码),微信也是支持的,提供H5版语音通话聊天示例 和DTMF编解码
Stars: ✭ 2,891 (+1202.25%)
DiscorddjDiscord DJ Bot. Play music in your server. Inspired by PlugDJ
Stars: ✭ 204 (-8.11%)
Libvlc GoGo bindings for libVLC and high-level media player interface
Stars: ✭ 188 (-15.32%)
Vonage Ruby SdkVonage REST API client for Ruby. API support for SMS, Voice, Text-to-Speech, Numbers, Verify (2FA) and more.
Stars: ✭ 203 (-8.56%)
JpsxdecjPSXdec: cross-platform PlayStation 1 audio and video converter
Stars: ✭ 219 (-1.35%)
Speech DenoiserA speech denoise lv2 plugin based on RNNoise library
Stars: ✭ 220 (-0.9%)
CsfmlOfficial binding of SFML for C
Stars: ✭ 211 (-4.95%)
OttoSampler, Sequencer, Multi-engine synth and effects - in a box! [WIP]
Stars: ✭ 2,390 (+976.58%)
Voice Overlay Android🗣 An overlay that gets your user’s voice permission and input as text in a customizable UI
Stars: ✭ 189 (-14.86%)
Openl3OpenL3: Open-source deep audio and image embeddings
Stars: ✭ 200 (-9.91%)
GeonkickA free software percussion synthesizer for GNU/Linux
Stars: ✭ 187 (-15.77%)
SupysonicSupysonic is a Python implementation of the Subsonic server API.
Stars: ✭ 187 (-15.77%)
Tts CubeEnd-2-end speech synthesis with recurrent neural networks
Stars: ✭ 213 (-4.05%)
Kaldi Active GrammarPython Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time
Stars: ✭ 196 (-11.71%)