All Projects → balavenkatesh3322 → Audio Pretrained Model

balavenkatesh3322 / Audio Pretrained Model

Licence: mit
A collection of Audio and Speech pre-trained models.

Programming Languages

python3
1442 projects

Projects that are alternatives of or similar to Audio Pretrained Model

web-voice-processor
A library for real-time voice processing in web browsers
Stars: ✭ 69 (+13.11%)
Mutual labels:  speech-recognition, speech-to-text, audio-processing
Automatic Speech Recognition
🎧 Automatic Speech Recognition: DeepSpeech & Seq2Seq (TensorFlow)
Stars: ✭ 192 (+214.75%)
Mutual labels:  speech-recognition, speech-to-text, tensorflow-models
Sincnet
SincNet is a neural architecture for efficiently processing raw audio samples.
Stars: ✭ 764 (+1152.46%)
Mutual labels:  speech-recognition, audio, audio-processing
Pytorch2keras
PyTorch to Keras model convertor
Stars: ✭ 676 (+1008.2%)
Mutual labels:  keras-tensorflow, tensorflow-models, keras-models
pytorch2keras
PyTorch to Keras model convertor
Stars: ✭ 788 (+1191.8%)
Mutual labels:  keras-models, tensorflow-models, keras-tensorflow
deepspeech.mxnet
A MXNet implementation of Baidu's DeepSpeech architecture
Stars: ✭ 82 (+34.43%)
Mutual labels:  mxnet, speech-recognition, speech-to-text
Dkeras
Distributed Keras Engine, Make Keras faster with only one line of code.
Stars: ✭ 181 (+196.72%)
Mutual labels:  keras-tensorflow, tensorflow-models, keras-models
TF-Model-Deploy-Tutorial
A tutorial exploring multiple approaches to deploy a trained TensorFlow (or Keras) model or multiple models for prediction.
Stars: ✭ 51 (-16.39%)
Mutual labels:  keras-models, tensorflow-models, keras-tensorflow
Speech recognition
Speech recognition module for Python, supporting several engines and APIs, online and offline.
Stars: ✭ 5,999 (+9734.43%)
Mutual labels:  speech-recognition, speech-to-text, audio
Keras Sincnet
Keras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)
Stars: ✭ 47 (-22.95%)
Mutual labels:  speech-recognition, audio, audio-processing
Face Mask Detection
Face Mask Detection system based on computer vision and deep learning using OpenCV and Tensorflow/Keras
Stars: ✭ 774 (+1168.85%)
Mutual labels:  caffe, keras-tensorflow
Stephanie Va
Stephanie is an open-source platform built specifically for voice-controlled applications as well as to automate daily tasks imitating much of an virtual assistant's work.
Stars: ✭ 772 (+1165.57%)
Mutual labels:  speech-recognition, speech-to-text
Kur
Descriptive Deep Learning
Stars: ✭ 811 (+1229.51%)
Mutual labels:  speech-recognition, speech-to-text
Giada
Your Hardcore Loop Machine.
Stars: ✭ 903 (+1380.33%)
Mutual labels:  audio, audio-processing
Mlt
MLT Multimedia Framework
Stars: ✭ 836 (+1270.49%)
Mutual labels:  audio, audio-processing
Machine Learning Curriculum
💻 Make machines learn so that you don't have to struggle to program them; The ultimate list
Stars: ✭ 761 (+1147.54%)
Mutual labels:  caffe, mxnet
Steppy Toolkit
Curated set of transformers that make your work with steppy faster and more effective 🔭
Stars: ✭ 21 (-65.57%)
Mutual labels:  tensorflow-models, keras-models
Guitard
Node based multi effects audio processor
Stars: ✭ 31 (-49.18%)
Mutual labels:  audio, audio-processing
Eesen
The official repository of the Eesen project
Stars: ✭ 738 (+1109.84%)
Mutual labels:  speech-recognition, speech-to-text
Mxnet2caffe
convert model from mxnet to caffe without lossing precision
Stars: ✭ 20 (-67.21%)
Mutual labels:  caffe, mxnet

Maintenance GitHub GitHub GitHub

Audio and Speech Pre-trained Models

NLP logo

What is pre-trained Model?

A pre-trained model is a model created by some one else to solve a similar problem. Instead of building a model from scratch to solve a similar problem, we can use the model trained on other problem as a starting point. A pre-trained model may not be 100% accurate in your application.

Other Pre-trained Models

Framework

Model visualization

You can see visualizations of each model's network architecture by using Netron.

NLP logo

Tensorflow

Model Name Description Framework
Wavenet This is a TensorFlow implementation of the WaveNet generative neural network architecture for audio generation. Tensorflow
Lip Reading Cross Audio-Visual Recognition using 3D Architectures in TensorFlow Tensorflow
MusicGenreClassification Academic research in the field of Deep Learning (Deep Neural Networks) and Sound Processing, Tel Aviv University. Tensorflow
Audioset Models and supporting code for use with AudioSet. Tensorflow
DeepSpeech Automatic speech recognition. Tensorflow

Keras

Model Name Description Framework
Ultrasound nerve segmentation This tutorial shows how to use Keras library to build deep neural network for ultrasound image nerve segmentation. Keras

PyTorch

Model Name Description Framework
espnet End-to-End Speech Processing Toolkit espnet.github.io/espnet PyTorch
TTS Deep learning for Text2Speech PyTorch
Neural Sequence labeling model Sequence labeling models are quite popular in many NLP tasks, such as Named Entity Recognition (NER), part-of-speech (POS) tagging and word segmentation. PyTorch
waveglow A Flow-based Generative Network for Speech Synthesis. PyTorch
deepvoice3_pytorch PyTorch implementation of convolutional networks-based text-to-speech synthesis models. PyTorch
deepspeech2 Implementation of DeepSpeech2 using Baidu Warp-CTC. Creates a network based on the DeepSpeech2 architecture, trained with the CTC activation function. PyTorch
loop A method to generate speech across multiple speakers. PyTorch
audio Simple audio I/O for pytorch. PyTorch
speech PyTorch ASR Implementation. PyTorch
samplernn-pytorch PyTorch implementation of SampleRNN: An Unconditional End-to-End Neural Audio Generation Model. PyTorch
torch_waveglow A PyTorch implementation of the WaveGlow: A Flow-based Generative Network for Speech Synthesis. PyTorch

MXNet

Model Name Description Framework
deepspeech This example based on DeepSpeech2 of Baidu helps you to build Speech-To-Text (STT) models at scale using MXNet
mxnet-audio Implementation of music genre classification, audio-to-vec, song recommender, and music search in mxnet. MXNet

Caffe

Model Name Description Framework
Speech Recognition Speech Recognition with the caffe deep learning framework. Caffe

Contributions

Your contributions are always welcome!! Please have a look at contributing.md

License

MIT License

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].