Hey JetsonDeep Learning based Automatic Speech Recognition with attention for the Nvidia Jetson.
Stars: ✭ 161 (+62.63%)
LearnopencvLearn OpenCV : C++ and Python Examples
Stars: ✭ 15,385 (+15440.4%)
Halite IiSeason 2 of @twosigma's artificial intelligence programming challenge
Stars: ✭ 201 (+103.03%)
Vue Howler[UNMAINTAINED] A Howler.js mixin for Vue 2 that makes it easy to create custom audio player components
Stars: ✭ 103 (+4.04%)
Best ai paper 2020A curated list of the latest breakthroughs in AI by release date with a clear video explanation, link to a more in-depth article, and code
Stars: ✭ 2,140 (+2061.62%)
WavBattle tested Wav decoder/encoder
Stars: ✭ 139 (+40.4%)
MediafileA unified reader of metadata from audio & video files.
Stars: ✭ 138 (+39.39%)
Har Keras CnnHuman Activity Recognition (HAR) with 1D Convolutional Neural Network in Python and Keras
Stars: ✭ 97 (-2.02%)
SymphoniaPure Rust multimedia format demuxing, tag reading, and audio decoding library
Stars: ✭ 191 (+92.93%)
AtldotnetFully managed, portable and easy-to-use C# library to read and edit audio data and metadata (tags) from various audio formats, playlists and CUE sheets
Stars: ✭ 180 (+81.82%)
FlaconAudio File Encoder. Extracts audio tracks from an audio CD image to separate tracks.
Stars: ✭ 252 (+154.55%)
Zi2ziLearning Chinese Character style with conditional GAN
Stars: ✭ 1,988 (+1908.08%)
Music-Style-TransferSource code for "Transferring the Style of Homophonic Music Using Recurrent Neural Networks and Autoregressive Model"
Stars: ✭ 16 (-83.84%)
audio degraderAudio degradation toolbox in python, with a command-line tool. It is useful to apply controlled degradations to audio: e.g. data augmentation, evaluation in noisy conditions, etc.
Stars: ✭ 40 (-59.6%)
voxpopuliPython wrapper for Espeak and Mbrola, for simple local TTS
Stars: ✭ 21 (-78.79%)
recurrent-defocus-deblurring-synth-dual-pixelReference github repository for the paper "Learning to Reduce Defocus Blur by Realistically Modeling Dual-Pixel Data". We propose a procedure to generate realistic DP data synthetically. Our synthesis approach mimics the optical image formation found on DP sensors and can be applied to virtual scenes rendered with standard computer software. Lev…
Stars: ✭ 30 (-69.7%)
DTMF-DecoderA Java program to implement a DMTF Decoder.
Stars: ✭ 28 (-71.72%)
AurioAudio Fingerprinting & Retrieval for .NET
Stars: ✭ 84 (-15.15%)
spafe🔉 spafe: Simplified Python Audio Features Extraction
Stars: ✭ 310 (+213.13%)
CurlCURL: Contrastive Unsupervised Representation Learning for Sample-Efficient Reinforcement Learning
Stars: ✭ 346 (+249.49%)
RmdlRMDL: Random Multimodel Deep Learning for Classification
Stars: ✭ 375 (+278.79%)
SurfboardNovoic's audio feature extraction library
Stars: ✭ 318 (+221.21%)
Auto EditorAuto-Editor: Effort free video editing!
Stars: ✭ 382 (+285.86%)
MusigA shazam like tool to store songs fingerprints and retrieve them
Stars: ✭ 388 (+291.92%)
Recorderhtml5 js 浏览器 web端录音
Stars: ✭ 429 (+333.33%)
MarianaThe Cutest Deep Learning Framework which is also a wonderful Declarative Language
Stars: ✭ 151 (+52.53%)
ChromaprintC library for generating audio fingerprints used by AcoustID
Stars: ✭ 553 (+458.59%)
Trending Deep LearningTop 100 trending deep learning repositories sorted by the number of stars gained on a specific day.
Stars: ✭ 543 (+448.48%)
Ner LstmNamed Entity Recognition using multilayered bidirectional LSTM
Stars: ✭ 532 (+437.37%)
FfdlFabric for Deep Learning (FfDL, pronounced fiddle) is a Deep Learning Platform offering TensorFlow, Caffe, PyTorch etc. as a Service on Kubernetes
Stars: ✭ 640 (+546.46%)
Speech Emotion AnalyzerThe neural network model is capable of detecting five different male/female emotions from audio speeches. (Deep Learning, NLP, Python)
Stars: ✭ 633 (+539.39%)
Keras AttentionVisualizing RNNs using the attention mechanism
Stars: ✭ 697 (+604.04%)
QC++ Library for Audio Digital Signal Processing
Stars: ✭ 481 (+385.86%)
Quickdraw Implementation of Quickdraw - an online game developed by Google
Stars: ✭ 805 (+713.13%)
Deep Learning Time SeriesList of papers, code and experiments using deep learning for time series forecasting
Stars: ✭ 796 (+704.04%)
Concise Ipython Notebooks For Deep LearningIpython Notebooks for solving problems like classification, segmentation, generation using latest Deep learning algorithms on different publicly available text and image data-sets.
Stars: ✭ 23 (-76.77%)
SincnetSincNet is a neural architecture for efficiently processing raw audio samples.
Stars: ✭ 764 (+671.72%)
GuitardNode based multi effects audio processor
Stars: ✭ 31 (-68.69%)
Theano Kaldi RnnTHEANO-KALDI-RNNs is a project implementing various Recurrent Neural Networks (RNNs) for RNN-HMM speech recognition. The Theano Code is coupled with the Kaldi decoder.
Stars: ✭ 31 (-68.69%)
Flops Counter.pytorchFlops counter for convolutional networks in pytorch framework
Stars: ✭ 1,223 (+1135.35%)
Music MetadataStream and file based music metadata parser for node. Supporting a wide range of audio and tag formats.
Stars: ✭ 455 (+359.6%)
Chime🎵 Python sound notifications made easy
Stars: ✭ 56 (-43.43%)
DeepseqslamThe Official Deep Learning Framework for Route-based Place Recognition
Stars: ✭ 49 (-50.51%)
Keras SincnetKeras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)
Stars: ✭ 47 (-52.53%)
FigaroReal-time voice-changer for voice-chat, etc. Will support many different voice-filters and features in the future. 🎵
Stars: ✭ 80 (-19.19%)
AudioswitchAn Android audio management library for real-time communication apps.
Stars: ✭ 69 (-30.3%)
SangitaA Natural Language Toolkit for Indian Languages
Stars: ✭ 43 (-56.57%)
Arc PytorchThe first public PyTorch implementation of Attentive Recurrent Comparators
Stars: ✭ 147 (+48.48%)
AndroidtensorflowmnistexampleAndroid TensorFlow MachineLearning MNIST Example (Building Model with TensorFlow for Android)
Stars: ✭ 449 (+353.54%)
LudwigData-centric declarative deep learning framework
Stars: ✭ 8,018 (+7998.99%)
BeepA little package that brings sound to any Go application. Suitable for playback and audio-processing.
Stars: ✭ 1,168 (+1079.8%)
Xamarin PluginsCross-platform Plugins for Xamarin, Xamarin.Forms and Windows
Stars: ✭ 97 (-2.02%)
360sd NetPytorch implementation of ICRA 2020 paper "360° Stereo Depth Estimation with Learnable Cost Volume"
Stars: ✭ 94 (-5.05%)