balavenkatesh3322 / Audio Pretrained Model
Licence: mit
A collection of Audio and Speech pre-trained models.
Stars: ✭ 61
Programming Languages
python3
1442 projects
Labels
Projects that are alternatives of or similar to Audio Pretrained Model
web-voice-processor
A library for real-time voice processing in web browsers
Stars: ✭ 69 (+13.11%)
Mutual labels: speech-recognition, speech-to-text, audio-processing
Automatic Speech Recognition
🎧 Automatic Speech Recognition: DeepSpeech & Seq2Seq (TensorFlow)
Stars: ✭ 192 (+214.75%)
Mutual labels: speech-recognition, speech-to-text, tensorflow-models
Sincnet
SincNet is a neural architecture for efficiently processing raw audio samples.
Stars: ✭ 764 (+1152.46%)
Mutual labels: speech-recognition, audio, audio-processing
Pytorch2keras
PyTorch to Keras model convertor
Stars: ✭ 676 (+1008.2%)
Mutual labels: keras-tensorflow, tensorflow-models, keras-models
pytorch2keras
PyTorch to Keras model convertor
Stars: ✭ 788 (+1191.8%)
Mutual labels: keras-models, tensorflow-models, keras-tensorflow
deepspeech.mxnet
A MXNet implementation of Baidu's DeepSpeech architecture
Stars: ✭ 82 (+34.43%)
Mutual labels: mxnet, speech-recognition, speech-to-text
Dkeras
Distributed Keras Engine, Make Keras faster with only one line of code.
Stars: ✭ 181 (+196.72%)
Mutual labels: keras-tensorflow, tensorflow-models, keras-models
TF-Model-Deploy-Tutorial
A tutorial exploring multiple approaches to deploy a trained TensorFlow (or Keras) model or multiple models for prediction.
Stars: ✭ 51 (-16.39%)
Mutual labels: keras-models, tensorflow-models, keras-tensorflow
Speech recognition
Speech recognition module for Python, supporting several engines and APIs, online and offline.
Stars: ✭ 5,999 (+9734.43%)
Mutual labels: speech-recognition, speech-to-text, audio
Keras Sincnet
Keras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)
Stars: ✭ 47 (-22.95%)
Mutual labels: speech-recognition, audio, audio-processing
Face Mask Detection
Face Mask Detection system based on computer vision and deep learning using OpenCV and Tensorflow/Keras
Stars: ✭ 774 (+1168.85%)
Mutual labels: caffe, keras-tensorflow
Stephanie Va
Stephanie is an open-source platform built specifically for voice-controlled applications as well as to automate daily tasks imitating much of an virtual assistant's work.
Stars: ✭ 772 (+1165.57%)
Mutual labels: speech-recognition, speech-to-text
Kur
Descriptive Deep Learning
Stars: ✭ 811 (+1229.51%)
Mutual labels: speech-recognition, speech-to-text
Machine Learning Curriculum
💻 Make machines learn so that you don't have to struggle to program them; The ultimate list
Stars: ✭ 761 (+1147.54%)
Mutual labels: caffe, mxnet
Steppy Toolkit
Curated set of transformers that make your work with steppy faster and more effective 🔭
Stars: ✭ 21 (-65.57%)
Mutual labels: tensorflow-models, keras-models
Guitard
Node based multi effects audio processor
Stars: ✭ 31 (-49.18%)
Mutual labels: audio, audio-processing
Eesen
The official repository of the Eesen project
Stars: ✭ 738 (+1109.84%)
Mutual labels: speech-recognition, speech-to-text
Mxnet2caffe
convert model from mxnet to caffe without lossing precision
Stars: ✭ 20 (-67.21%)
Mutual labels: caffe, mxnet
Audio and Speech Pre-trained Models
What is pre-trained Model?
A pre-trained model is a model created by some one else to solve a similar problem. Instead of building a model from scratch to solve a similar problem, we can use the model trained on other problem as a starting point. A pre-trained model may not be 100% accurate in your application.
Other Pre-trained Models
Framework
Model visualization
You can see visualizations of each model's network architecture by using Netron.
Tensorflow
Model Name | Description | Framework |
---|---|---|
Wavenet | This is a TensorFlow implementation of the WaveNet generative neural network architecture for audio generation. | Tensorflow |
Lip Reading | Cross Audio-Visual Recognition using 3D Architectures in TensorFlow | Tensorflow |
MusicGenreClassification | Academic research in the field of Deep Learning (Deep Neural Networks) and Sound Processing, Tel Aviv University. | Tensorflow |
Audioset | Models and supporting code for use with AudioSet. | Tensorflow |
DeepSpeech | Automatic speech recognition. | Tensorflow |
Keras
Model Name | Description | Framework |
---|---|---|
Ultrasound nerve segmentation | This tutorial shows how to use Keras library to build deep neural network for ultrasound image nerve segmentation. | Keras |
PyTorch
Model Name | Description | Framework |
---|---|---|
espnet | End-to-End Speech Processing Toolkit espnet.github.io/espnet | PyTorch |
TTS | Deep learning for Text2Speech | PyTorch |
Neural Sequence labeling model | Sequence labeling models are quite popular in many NLP tasks, such as Named Entity Recognition (NER), part-of-speech (POS) tagging and word segmentation. | PyTorch |
waveglow | A Flow-based Generative Network for Speech Synthesis. | PyTorch |
deepvoice3_pytorch | PyTorch implementation of convolutional networks-based text-to-speech synthesis models. | PyTorch |
deepspeech2 | Implementation of DeepSpeech2 using Baidu Warp-CTC. Creates a network based on the DeepSpeech2 architecture, trained with the CTC activation function. | PyTorch |
loop | A method to generate speech across multiple speakers. | PyTorch |
audio | Simple audio I/O for pytorch. | PyTorch |
speech | PyTorch ASR Implementation. | PyTorch |
samplernn-pytorch | PyTorch implementation of SampleRNN: An Unconditional End-to-End Neural Audio Generation Model. | PyTorch |
torch_waveglow | A PyTorch implementation of the WaveGlow: A Flow-based Generative Network for Speech Synthesis. | PyTorch |
MXNet
Model Name | Description | Framework |
---|---|---|
deepspeech | This example based on DeepSpeech2 of Baidu helps you to build Speech-To-Text (STT) models at scale using | MXNet |
mxnet-audio | Implementation of music genre classification, audio-to-vec, song recommender, and music search in mxnet. | MXNet |
Caffe
Model Name | Description | Framework |
---|---|---|
Speech Recognition | Speech Recognition with the caffe deep learning framework. | Caffe |
Contributions
Your contributions are always welcome!! Please have a look at contributing.md
License
Note that the project description data, including the texts, logos, images, and/or trademarks,
for each open source project belongs to its rightful owner.
If you wish to add or remove any projects, please contact us at [email protected].