pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.

Stars: ✭ 2,097 (+853.18%)

Mutual labels: rnn, speech

Playx

Search and play any song from terminal

Stars: ✭ 194 (-11.82%)

Mutual labels: audio

Optivideoeditor For Android

Native Video editor : Video trim, Audio, Video merge, Slow and fast motion, Text and image, etc...

Stars: ✭ 209 (-5%)

Mutual labels: audio

Waveform analysis

Functions and scripts for analyzing waveforms, primarily audio. This is currently somewhat disorganized and unfinished.

Stars: ✭ 193 (-12.27%)

Mutual labels: audio

Nn compression

Stars: ✭ 193 (-12.27%)

Mutual labels: rnn

Recorder

html5 js 录音 mp3 wav ogg webm amr 格式，支持pc和Android、ios部分浏览器、和Hybrid App（提供Android IOS App源码），微信也是支持的，提供H5版语音通话聊天示例和DTMF编解码

Stars: ✭ 2,891 (+1214.09%)

Mutual labels: audio

Edgedict

Working online speech recognition based on RNN Transducer. ( Trained model release available in release )

Stars: ✭ 205 (-6.82%)

Mutual labels: speech

Symphonia

Pure Rust multimedia format demuxing, tag reading, and audio decoding library

Stars: ✭ 191 (-13.18%)

Mutual labels: audio

Char Rnn Chinese

Multi-layer Recurrent Neural Networks (LSTM, GRU, RNN) for character-level language models in Torch. Based on code of https://github.com/karpathy/char-rnn. Support Chinese and other things.

Stars: ✭ 192 (-12.73%)

Mutual labels: rnn

Discorddj

Discord DJ Bot. Play music in your server. Inspired by PlugDJ

Stars: ✭ 204 (-7.27%)

Mutual labels: audio

Speechtotext Websockets Javascript

SDK & Sample to do speech recognition using websockets in Javascript

Stars: ✭ 191 (-13.18%)

Mutual labels: speech

Rf24audio

Arduino library for streaming data/audio from analog inputs via NRF24L01 modules

Stars: ✭ 190 (-13.64%)

Mutual labels: audio

Pyacoustid

Python bindings for Chromaprint acoustic fingerprinting and the Acoustid Web service

Stars: ✭ 214 (-2.73%)

Mutual labels: audio

Tts Cube

End-2-end speech synthesis with recurrent neural networks

Stars: ✭ 213 (-3.18%)

Mutual labels: speech

Chameleon recsys

Source code of CHAMELEON - A Deep Learning Meta-Architecture for News Recommender Systems

Stars: ✭ 202 (-8.18%)

Mutual labels: rnn

Libvlc Go

Go bindings for libVLC and high-level media player interface

Stars: ✭ 188 (-14.55%)

Mutual labels: audio

Vudio.js

音频可视化展示模块

Stars: ✭ 194 (-11.82%)

Mutual labels: audio

Shakkala

Deep learning for Arabic text Vocalization - التشكيل الالي للنصوص العربية

Stars: ✭ 208 (-5.45%)

Mutual labels: rnn

Wavefile

A Ruby gem for reading and writing sound files in Wave format (*.wav)

Stars: ✭ 193 (-12.27%)

Mutual labels: audio

Mimium

mimium (MInimal Musical medIUM) a programming language as an infrastructure for sound and music.

Stars: ✭ 212 (-3.64%)

Mutual labels: audio

Examples Electron

Examples for Electron applications.

Stars: ✭ 193 (-12.27%)

Mutual labels: audio

Doc Han Att

Hierarchical Attention Networks for Chinese Sentiment Classification

Stars: ✭ 206 (-6.36%)

Mutual labels: rnn

Javascriptmusic

Live coding music and synthesis in Javascript / AssemblyScript (WebAssembly)

Stars: ✭ 193 (-12.27%)

Mutual labels: audio

Sign Language Gesture Recognition

Sign Language Gesture Recognition From Video Sequences Using RNN And CNN

Stars: ✭ 214 (-2.73%)

Mutual labels: rnn

Youtag

iOS music player app that downloads music from the internet, even YouTube

Stars: ✭ 193 (-12.27%)

Mutual labels: audio

Btrack

A Real-Time Beat Tracker

Stars: ✭ 204 (-7.27%)

Mutual labels: audio

Gru4rec tensorflow

TensorFlow implemenation of GRu4Rec model

Stars: ✭ 192 (-12.73%)

Mutual labels: rnn

Csfml

Official binding of SFML for C

Stars: ✭ 211 (-4.09%)

Mutual labels: audio

Mwengine

Audio engine and DSP for Android, written in C++ providing low latency performance in a musical context, supporting both OpenSL and AAudio.

Stars: ✭ 190 (-13.64%)

Mutual labels: audio

Timit

The DARPA TIMIT Acoustic-Phonetic Continuous Speech Corpus.

Stars: ✭ 202 (-8.18%)

Mutual labels: speech

Shikwasa

An audio player born for podcast

Stars: ✭ 216 (-1.82%)

Mutual labels: audio

Multi Room Audio Centralized Audio For Home

🎵 This Github Repository provides details on setting up a centralized audio system for your home using nothing but Raspberry Pi's and Old Speakers.

Stars: ✭ 189 (-14.09%)

Mutual labels: audio

Iseebetter

Stars: ✭ 202 (-8.18%)

Mutual labels: rnn

Devicehive Audio Analysis

Stars: ✭ 189 (-14.09%)

Mutual labels: audio

Depression Detect

Predicting depression from acoustic features of speech using a Convolutional Neural Network.

Stars: ✭ 187 (-15%)

Mutual labels: speech

Picard

MusicBrainz Picard audio file tagger

Stars: ✭ 2,605 (+1084.09%)

Mutual labels: audio

Otto

Sampler, Sequencer, Multi-engine synth and effects - in a box! [WIP]

Stars: ✭ 2,390 (+986.36%)

Mutual labels: audio

Geonkick

A free software percussion synthesizer for GNU/Linux

Stars: ✭ 187 (-15%)

Mutual labels: audio

Supysonic

Supysonic is a Python implementation of the Subsonic server API.

Stars: ✭ 187 (-15%)

Mutual labels: audio

Esp8266sam

Speech synthesis for ESP8266 using S.A.M. port

Stars: ✭ 199 (-9.55%)

Mutual labels: speech

Stylenet

A cute multi-layer LSTM that can perform like a human 🎶

Stars: ✭ 187 (-15%)

Mutual labels: rnn

Vq Vae Speech

PyTorch implementation of VQ-VAE + WaveNet by [Chorowski et al., 2019] and VQ-VAE on speech signals by [van den Oord et al., 2017]

Stars: ✭ 187 (-15%)

Mutual labels: speech

Speech Enhancement

Deep learning for audio denoising

Stars: ✭ 207 (-5.91%)

Mutual labels: speech

Neural Voice Cloning With Few Samples

Implementation of Neural Voice Cloning with Few Samples Research Paper by Baidu

Stars: ✭ 211 (-4.09%)

Mutual labels: speech

Openl3

OpenL3: Open-source deep audio and image embeddings

Stars: ✭ 200 (-9.09%)

Mutual labels: audio

Jinabox.js

A lightweight, customizable omnibox in Javascript, for use with a Jina backend.

Stars: ✭ 186 (-15.45%)

Mutual labels: audio

Opentok Ios Sdk Samples

Example applications that use the OpenTok iOS SDK

Stars: ✭ 186 (-15.45%)

Mutual labels: audio

1-60 of 1072 similar projects

›

next*5