All Projects → pannous → Caffe Speech Recognition

pannous / Caffe Speech Recognition

Speech Recognition with the Caffe deep learning framework, migrating to

Projects that are alternatives of or similar to Caffe Speech Recognition

Standalone Deeplearning
2019 KAIST 딥러닝 홀로서기 세미나용 저장소입니다.
Stars: ✭ 318 (-1.55%)
Mutual labels:  jupyter-notebook
Carnd Traffic Sign Classifier Project
Classify Traffic Signs.
Stars: ✭ 323 (+0%)
Mutual labels:  jupyter-notebook
Machine Learning For Trading
Code for Machine Learning for Algorithmic Trading, 2nd edition.
Stars: ✭ 4,979 (+1441.49%)
Mutual labels:  jupyter-notebook
Sars tutorial
Repository for the tutorial on Sequence-Aware Recommender Systems held at TheWebConf 2019 and ACM RecSys 2018
Stars: ✭ 320 (-0.93%)
Mutual labels:  jupyter-notebook
Ml prep
Machine Learning interview prep guide
Stars: ✭ 298 (-7.74%)
Mutual labels:  jupyter-notebook
Cs329s Ml Deployment Tutorial
Code and files to go along with CS329s machine learning model deployment tutorial.
Stars: ✭ 314 (-2.79%)
Mutual labels:  jupyter-notebook
Bmc
Notes on Scientific Computing for Biomechanics and Motor Control
Stars: ✭ 319 (-1.24%)
Mutual labels:  jupyter-notebook
Textspotter
Stars: ✭ 323 (+0%)
Mutual labels:  jupyter-notebook
Joypy
Joyplots in Python with matplotlib & pandas 📈
Stars: ✭ 322 (-0.31%)
Mutual labels:  jupyter-notebook
Autoeq
Automatic headphone equalization from frequency responses
Stars: ✭ 5,989 (+1754.18%)
Mutual labels:  jupyter-notebook
Bokeh Python Visualization
A Bokeh project developed for learning and teaching Bokeh interactive plotting!
Stars: ✭ 321 (-0.62%)
Mutual labels:  jupyter-notebook
Notebooks Contrib
RAPIDS Community Notebooks
Stars: ✭ 321 (-0.62%)
Mutual labels:  jupyter-notebook
Probability
Probabilistic reasoning and statistical analysis in TensorFlow
Stars: ✭ 3,550 (+999.07%)
Mutual labels:  jupyter-notebook
Recurrent Neural Networks
Learning about and doing projects with recurrent neural networks
Stars: ✭ 320 (-0.93%)
Mutual labels:  jupyter-notebook
Bdci2019 Sentiment Classification
CCF BDCI 2019 互联网新闻情感分析 复赛top1解决方案
Stars: ✭ 317 (-1.86%)
Mutual labels:  jupyter-notebook
Datos Covid19
En formato estándar
Stars: ✭ 316 (-2.17%)
Mutual labels:  jupyter-notebook
Jupyter Themer
Apply custom CSS styling to your jupyter notebooks
Stars: ✭ 322 (-0.31%)
Mutual labels:  jupyter-notebook
Autocrop
😌 Automatically detects and crops faces from batches of pictures.
Stars: ✭ 320 (-0.93%)
Mutual labels:  jupyter-notebook
Your First Machine Learning Project End To End In Python
这是一个完整的,端到端的机器学习项目,非常适合有一定基础后拿来练习,以提高对完整机器学习项目的认识
Stars: ✭ 323 (+0%)
Mutual labels:  jupyter-notebook
Adanet
Fast and flexible AutoML with learning guarantees.
Stars: ✭ 3,340 (+934.06%)
Mutual labels:  jupyter-notebook

Speech Recognition with BVLC caffe

Speech Recognition with the caffe deep learning framework

UPDATE: We are migrating to tensorflow

This project is quite fresh and only the first of three milestones is accomplished: Even now it might be useful if you just want to train a handful of commands/options (1,2,3..yes/no/cancel/...)

  1. training spoken numbers:
  • get spectogram training images from http://pannous.net/spoken_numbers.tar (470 MB)
  • start ./train.sh
  • test with ipython notebook test-speech-recognition.ipynb or caffe test ... or <caffe-root>/python/classify.py
  • 99% accuracy, nice!
  • online recognition and learning with ./recognition-server.py and ./record.py scripts

Sample spectrogram, That's what she said, too laid?

Sample spectrogram, Karen uttering 'zero' with 160 words per minute.

  1. training words:
  • 4GB of training data
  • net topology: work in progress ...
  • todo: use upcoming new caffe LSTM layers etc
  • UPDATE LSTMs get rolling, still not merged
  • UPDATE since the caffe project leaders have a hindering merging policy and this pull request was shifted many times without ever being merged, we are migrating to tensorflow
  • todo: add extra categories for a) silence b) common noises like typing, achoo c) ALL other noises
  1. training speech:

Theoretical background: papers

A. Graves and N. Jaitly. Towards end-to-end speech recognition with recurrent neural networks. In ICML, 2014

O. Vinyals, S. V. Ravuri, and D. Povey. Revisiting recurrent neural networks for robust ASR. In ICASSP, 2012

Andrew Ng et al / Baidu

Hinton et al / Toronto

good old Hinton

Schmidhuber et al using new 'ClockWork-RNNs'

The book: Automatic Speech Recognition: A Deep Learning Approach (Signals and Communication Technology) Hardcover – November 11, 2014 by Dong Yu (Author) and Li Deng (Author)

Related work

Also see the Kaldi project, which seems a bit messy but already uses deep learning with LSTM Another experimental LSTM network, which works out-of-the-box: Currennt

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].