Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → pannous → Caffe Speech Recognition

pannous / Caffe Speech Recognition

Speech Recognition with the Caffe deep learning framework, migrating to

Labels

jupyter-notebook

Projects that are alternatives of or similar to Caffe Speech Recognition

Standalone Deeplearning

2019 KAIST 딥러닝 홀로서기 세미나용 저장소입니다.

Stars: ✭ 318 (-1.55%)

Mutual labels: jupyter-notebook

Carnd Traffic Sign Classifier Project

Classify Traffic Signs.

Stars: ✭ 323 (+0%)

Mutual labels: jupyter-notebook

Machine Learning For Trading

Code for Machine Learning for Algorithmic Trading, 2nd edition.

Stars: ✭ 4,979 (+1441.49%)

Mutual labels: jupyter-notebook

Repository for the tutorial on Sequence-Aware Recommender Systems held at TheWebConf 2019 and ACM RecSys 2018

Stars: ✭ 320 (-0.93%)

Mutual labels: jupyter-notebook

Machine Learning interview prep guide

Stars: ✭ 298 (-7.74%)

Mutual labels: jupyter-notebook

Cs329s Ml Deployment Tutorial

Code and files to go along with CS329s machine learning model deployment tutorial.

Stars: ✭ 314 (-2.79%)

Mutual labels: jupyter-notebook

Notes on Scientific Computing for Biomechanics and Motor Control

Stars: ✭ 319 (-1.24%)

Mutual labels: jupyter-notebook

Stars: ✭ 323 (+0%)

Mutual labels: jupyter-notebook

Joyplots in Python with matplotlib & pandas 📈

Stars: ✭ 322 (-0.31%)

Mutual labels: jupyter-notebook

Automatic headphone equalization from frequency responses

Stars: ✭ 5,989 (+1754.18%)

Mutual labels: jupyter-notebook

Bokeh Python Visualization

A Bokeh project developed for learning and teaching Bokeh interactive plotting!

Stars: ✭ 321 (-0.62%)

Mutual labels: jupyter-notebook

Notebooks Contrib

RAPIDS Community Notebooks

Stars: ✭ 321 (-0.62%)

Mutual labels: jupyter-notebook

Probabilistic reasoning and statistical analysis in TensorFlow

Stars: ✭ 3,550 (+999.07%)

Mutual labels: jupyter-notebook

Recurrent Neural Networks

Learning about and doing projects with recurrent neural networks

Stars: ✭ 320 (-0.93%)

Mutual labels: jupyter-notebook

Bdci2019 Sentiment Classification

CCF BDCI 2019 互联网新闻情感分析复赛top1解决方案

Stars: ✭ 317 (-1.86%)

Mutual labels: jupyter-notebook

En formato estándar

Stars: ✭ 316 (-2.17%)

Mutual labels: jupyter-notebook

Apply custom CSS styling to your jupyter notebooks

Stars: ✭ 322 (-0.31%)

Mutual labels: jupyter-notebook

😌 Automatically detects and crops faces from batches of pictures.

Stars: ✭ 320 (-0.93%)

Mutual labels: jupyter-notebook

Your First Machine Learning Project End To End In Python

这是一个完整的，端到端的机器学习项目，非常适合有一定基础后拿来练习，以提高对完整机器学习项目的认识

Stars: ✭ 323 (+0%)

Mutual labels: jupyter-notebook

Fast and flexible AutoML with learning guarantees.

Stars: ✭ 3,340 (+934.06%)

Mutual labels: jupyter-notebook

View All Similar Projects ➔

Speech Recognition with BVLC caffe

Speech Recognition with the caffe deep learning framework

UPDATE: We are migrating to tensorflow

This project is quite fresh and only the first of three milestones is accomplished: Even now it might be useful if you just want to train a handful of commands/options (1,2,3..yes/no/cancel/...)

training spoken numbers:

get spectogram training images from http://pannous.net/spoken_numbers.tar (470 MB)
start ./train.sh
test with ipython notebook test-speech-recognition.ipynb or caffe test ... or <caffe-root>/python/classify.py
99% accuracy, nice!
online recognition and learning with ./recognition-server.py and ./record.py scripts

Sample spectrogram, Karen uttering 'zero' with 160 words per minute.

training words:

4GB of training data
net topology: work in progress ...
todo: use upcoming new caffe LSTM layers etc
UPDATE LSTMs get rolling, still not merged
UPDATE since the caffe project leaders have a hindering merging policy and this pull request was shifted many times without ever being merged, we are migrating to tensorflow
todo: add extra categories for a) silence b) common noises like typing, achoo c) ALL other noises

training speech:

todo!
100GB of training data here: http://www.openslr.org/12/
TIMIT dataset $27,000.00 membership fee or $250 for non-members+$2400 under research-only license?
combine with google n-grams

Theoretical background: papers

A. Graves and N. Jaitly. Towards end-to-end speech recognition with recurrent neural networks. In ICML, 2014

O. Vinyals, S. V. Ravuri, and D. Povey. Revisiting recurrent neural networks for robust ASR. In ICASSP, 2012

Andrew Ng et al / Baidu

Hinton et al / Toronto

good old Hinton

Schmidhuber et al using new 'ClockWork-RNNs'

The book: Automatic Speech Recognition: A Deep Learning Approach (Signals and Communication Technology) Hardcover – November 11, 2014 by Dong Yu (Author) and Li Deng (Author)

Related work

Also see the Kaldi project, which seems a bit messy but already uses deep learning with LSTM Another experimental LSTM network, which works out-of-the-box: Currennt

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 323

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (5) 🔗