Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Stephanie is an open-source platform built specifically for voice-controlled applications as well as to automate daily tasks imitating much of an virtual assistant's work.

Stars: ✭ 772 (+1165.57%)

Mutual labels: speech-recognition, speech-to-text

Kur

Descriptive Deep Learning

Stars: ✭ 811 (+1229.51%)

Mutual labels: speech-recognition, speech-to-text

Giada

Your Hardcore Loop Machine.

Stars: ✭ 903 (+1380.33%)

Mutual labels: audio, audio-processing

Mlt

MLT Multimedia Framework

Stars: ✭ 836 (+1270.49%)

Mutual labels: audio, audio-processing

Machine Learning Curriculum

💻 Make machines learn so that you don't have to struggle to program them; The ultimate list

Stars: ✭ 761 (+1147.54%)

Mutual labels: caffe, mxnet

Steppy Toolkit

Curated set of transformers that make your work with steppy faster and more effective 🔭

Stars: ✭ 21 (-65.57%)

Mutual labels: tensorflow-models, keras-models

Guitard

Node based multi effects audio processor

Stars: ✭ 31 (-49.18%)

Mutual labels: audio, audio-processing

Eesen

The official repository of the Eesen project

Stars: ✭ 738 (+1109.84%)

Mutual labels: speech-recognition, speech-to-text

Mxnet2caffe

convert model from mxnet to caffe without lossing precision

Stars: ✭ 20 (-67.21%)

Mutual labels: caffe, mxnet

View All Similar Projects ➔

Audio and Speech Pre-trained Models

What is pre-trained Model?

A pre-trained model is a model created by some one else to solve a similar problem. Instead of building a model from scratch to solve a similar problem, we can use the model trained on other problem as a starting point. A pre-trained model may not be 100% accurate in your application.

Other Pre-trained Models

Model visualization

You can see visualizations of each model's network architecture by using Netron.

Tensorflow

Model Name	Description	Framework
Wavenet	This is a TensorFlow implementation of the WaveNet generative neural network architecture for audio generation.	`Tensorflow`
Lip Reading	Cross Audio-Visual Recognition using 3D Architectures in TensorFlow	`Tensorflow`
MusicGenreClassification	Academic research in the field of Deep Learning (Deep Neural Networks) and Sound Processing, Tel Aviv University.	`Tensorflow`
Audioset	Models and supporting code for use with AudioSet.	`Tensorflow`
DeepSpeech	Automatic speech recognition.	`Tensorflow`

↥ Back To Top

Keras

Model Name	Description	Framework
Ultrasound nerve segmentation	This tutorial shows how to use Keras library to build deep neural network for ultrasound image nerve segmentation.	`Keras`

↥ Back To Top

PyTorch

Model Name	Description	Framework
espnet	End-to-End Speech Processing Toolkit espnet.github.io/espnet	`PyTorch`
TTS	Deep learning for Text2Speech	`PyTorch`
Neural Sequence labeling model	Sequence labeling models are quite popular in many NLP tasks, such as Named Entity Recognition (NER), part-of-speech (POS) tagging and word segmentation.	`PyTorch`
waveglow	A Flow-based Generative Network for Speech Synthesis.	`PyTorch`
deepvoice3_pytorch	PyTorch implementation of convolutional networks-based text-to-speech synthesis models.	`PyTorch`
deepspeech2	Implementation of DeepSpeech2 using Baidu Warp-CTC. Creates a network based on the DeepSpeech2 architecture, trained with the CTC activation function.	`PyTorch`
loop	A method to generate speech across multiple speakers.	`PyTorch`
audio	Simple audio I/O for pytorch.	`PyTorch`
speech	PyTorch ASR Implementation.	`PyTorch`
samplernn-pytorch	PyTorch implementation of SampleRNN: An Unconditional End-to-End Neural Audio Generation Model.	`PyTorch`
torch_waveglow	A PyTorch implementation of the WaveGlow: A Flow-based Generative Network for Speech Synthesis.	`PyTorch`

↥ Back To Top

MXNet

Model Name	Description	Framework
deepspeech	This example based on DeepSpeech2 of Baidu helps you to build Speech-To-Text (STT) models at scale using	`MXNet`
mxnet-audio	Implementation of music genre classification, audio-to-vec, song recommender, and music search in mxnet.	`MXNet`

↥ Back To Top

Caffe

Model Name	Description	Framework
Speech Recognition	Speech Recognition with the caffe deep learning framework.	`Caffe`

↥ Back To Top

Contributions

Your contributions are always welcome!! Please have a look at contributing.md

License

MIT License

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 61

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (0) 🔗