SoloudFree, easy, portable audio engine for games
Stars: ✭ 1,048 (+376.36%)
AudiomatePython library for handling audio datasets.
Stars: ✭ 99 (-55%)
AudioData manipulation and transformation for audio signal processing, powered by PyTorch
Stars: ✭ 1,262 (+473.64%)
TacotronAudio samples accompanying publications related to Tacotron, an end-to-end speech synthesis model.
Stars: ✭ 493 (+124.09%)
Source separationDeep learning based speech source separation using Pytorch
Stars: ✭ 226 (+2.73%)
Aeneasaeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)
Stars: ✭ 1,942 (+782.73%)
RnnoiseRecurrent neural network for audio noise reduction
Stars: ✭ 2,266 (+930%)
AudioowlFast and simple music and audio analysis using RNN in Python 🕵️♀️ 🥁
Stars: ✭ 151 (-31.36%)
Pytorch Kaldipytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.
Stars: ✭ 2,097 (+853.18%)
PlayxSearch and play any song from terminal
Stars: ✭ 194 (-11.82%)
Optivideoeditor For AndroidNative Video editor : Video trim, Audio, Video merge, Slow and fast motion, Text and image, etc...
Stars: ✭ 209 (-5%)
Waveform analysisFunctions and scripts for analyzing waveforms, primarily audio. This is currently somewhat disorganized and unfinished.
Stars: ✭ 193 (-12.27%)
Recorderhtml5 js 录音 mp3 wav ogg webm amr 格式,支持pc和Android、ios部分浏览器、和Hybrid App(提供Android IOS App源码),微信也是支持的,提供H5版语音通话聊天示例 和DTMF编解码
Stars: ✭ 2,891 (+1214.09%)
EdgedictWorking online speech recognition based on RNN Transducer. ( Trained model release available in release )
Stars: ✭ 205 (-6.82%)
SymphoniaPure Rust multimedia format demuxing, tag reading, and audio decoding library
Stars: ✭ 191 (-13.18%)
Char Rnn ChineseMulti-layer Recurrent Neural Networks (LSTM, GRU, RNN) for character-level language models in Torch. Based on code of https://github.com/karpathy/char-rnn. Support Chinese and other things.
Stars: ✭ 192 (-12.73%)
DiscorddjDiscord DJ Bot. Play music in your server. Inspired by PlugDJ
Stars: ✭ 204 (-7.27%)
Rf24audioArduino library for streaming data/audio from analog inputs via NRF24L01 modules
Stars: ✭ 190 (-13.64%)
PyacoustidPython bindings for Chromaprint acoustic fingerprinting and the Acoustid Web service
Stars: ✭ 214 (-2.73%)
Tts CubeEnd-2-end speech synthesis with recurrent neural networks
Stars: ✭ 213 (-3.18%)
Chameleon recsysSource code of CHAMELEON - A Deep Learning Meta-Architecture for News Recommender Systems
Stars: ✭ 202 (-8.18%)
Libvlc GoGo bindings for libVLC and high-level media player interface
Stars: ✭ 188 (-14.55%)
Vudio.js音频可视化展示模块
Stars: ✭ 194 (-11.82%)
ShakkalaDeep learning for Arabic text Vocalization - التشكيل الالي للنصوص العربية
Stars: ✭ 208 (-5.45%)
WavefileA Ruby gem for reading and writing sound files in Wave format (*.wav)
Stars: ✭ 193 (-12.27%)
Mimiummimium (MInimal Musical medIUM) a programming language as an infrastructure for sound and music.
Stars: ✭ 212 (-3.64%)
Doc Han AttHierarchical Attention Networks for Chinese Sentiment Classification
Stars: ✭ 206 (-6.36%)
JavascriptmusicLive coding music and synthesis in Javascript / AssemblyScript (WebAssembly)
Stars: ✭ 193 (-12.27%)
YoutagiOS music player app that downloads music from the internet, even YouTube
Stars: ✭ 193 (-12.27%)
BtrackA Real-Time Beat Tracker
Stars: ✭ 204 (-7.27%)
CsfmlOfficial binding of SFML for C
Stars: ✭ 211 (-4.09%)
MwengineAudio engine and DSP for Android, written in C++ providing low latency performance in a musical context, supporting both OpenSL and AAudio.
Stars: ✭ 190 (-13.64%)
TimitThe DARPA TIMIT Acoustic-Phonetic Continuous Speech Corpus.
Stars: ✭ 202 (-8.18%)
Shikwasa An audio player born for podcast
Stars: ✭ 216 (-1.82%)
IseebetteriSeeBetter: Spatio-Temporal Video Super Resolution using Recurrent-Generative Back-Projection Networks | Python3 | PyTorch | GANs | CNNs | ResNets | RNNs | Published in Springer Journal of Computational Visual Media, September 2020, Tsinghua University Press
Stars: ✭ 202 (-8.18%)
Depression DetectPredicting depression from acoustic features of speech using a Convolutional Neural Network.
Stars: ✭ 187 (-15%)
PicardMusicBrainz Picard audio file tagger
Stars: ✭ 2,605 (+1084.09%)
OttoSampler, Sequencer, Multi-engine synth and effects - in a box! [WIP]
Stars: ✭ 2,390 (+986.36%)
GeonkickA free software percussion synthesizer for GNU/Linux
Stars: ✭ 187 (-15%)
SupysonicSupysonic is a Python implementation of the Subsonic server API.
Stars: ✭ 187 (-15%)
Esp8266samSpeech synthesis for ESP8266 using S.A.M. port
Stars: ✭ 199 (-9.55%)
StylenetA cute multi-layer LSTM that can perform like a human 🎶
Stars: ✭ 187 (-15%)
Vq Vae SpeechPyTorch implementation of VQ-VAE + WaveNet by [Chorowski et al., 2019] and VQ-VAE on speech signals by [van den Oord et al., 2017]
Stars: ✭ 187 (-15%)
Openl3OpenL3: Open-source deep audio and image embeddings
Stars: ✭ 200 (-9.09%)
Jinabox.jsA lightweight, customizable omnibox in Javascript, for use with a Jina backend.
Stars: ✭ 186 (-15.45%)