All Projects → a2tm7a → Papers

a2tm7a / Papers

A list of paper, books and sites for various different topics related to machine learning and deep learning along with various field in which it is implemented

Projects that are alternatives of or similar to Papers

Kur
Descriptive Deep Learning
Stars: ✭ 811 (+1187.3%)
Mutual labels:  speech-recognition, image-recognition
Biglittlenet
Official repository for Big-Little Net
Stars: ✭ 57 (-9.52%)
Mutual labels:  speech-recognition, image-recognition
Cortex M Kws
Cortex M KWS example with Tengine Lite.
Stars: ✭ 45 (-28.57%)
Mutual labels:  speech-recognition
Imago
Optical position recognition for the game of Go
Stars: ✭ 59 (-6.35%)
Mutual labels:  image-recognition
Iflytek awaken asr
use iflytek's technology to realize awaken and order recognition
Stars: ✭ 53 (-15.87%)
Mutual labels:  speech-recognition
Keras Sincnet
Keras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)
Stars: ✭ 47 (-25.4%)
Mutual labels:  speech-recognition
Py Nltools
A collection of basic python modules for spoken natural language processing
Stars: ✭ 46 (-26.98%)
Mutual labels:  speech-recognition
Avsr Deep Speech
Google Summer of Code 2017 Project: Development of Speech Recognition Module for Red Hen Lab
Stars: ✭ 43 (-31.75%)
Mutual labels:  speech-recognition
People Counter Python
Create a smart video application using the Intel Distribution of OpenVINO toolkit. The toolkit uses models and inference to run single-class object detection.
Stars: ✭ 62 (-1.59%)
Mutual labels:  image-recognition
Estimated Depth Map Helps Image Classification
Depth estimation with neural network, and learning on RGBD images
Stars: ✭ 52 (-17.46%)
Mutual labels:  image-recognition
Rostensorflow
TensorFlow ImageNet demo using ROS sensor_msgs/Image
Stars: ✭ 59 (-6.35%)
Mutual labels:  image-recognition
Cordova Plugin Tensorflow
On-device image recognition via TensorFlow/Inception. For Cordova/PhoneGap.
Stars: ✭ 51 (-19.05%)
Mutual labels:  image-recognition
Textnormalizationcoveringgrammars
Covering grammars for English and Russian text normalization
Stars: ✭ 46 (-26.98%)
Mutual labels:  speech-recognition
Dolphinattack
Inaudible Voice Commands
Stars: ✭ 57 (-9.52%)
Mutual labels:  speech-recognition
Audio Pretrained Model
A collection of Audio and Speech pre-trained models.
Stars: ✭ 61 (-3.17%)
Mutual labels:  speech-recognition
Formant Analyzer
iOS application for finding formants in spoken sounds
Stars: ✭ 43 (-31.75%)
Mutual labels:  speech-recognition
Opencv Face Filters
Snapchat-like Face Filters in OpenCV
Stars: ✭ 51 (-19.05%)
Mutual labels:  image-recognition
Image recognition
Packages for image recognition - Robocup TU/e Robotics
Stars: ✭ 53 (-15.87%)
Mutual labels:  image-recognition
Dragonfire
the open-source virtual assistant for Ubuntu based Linux distributions
Stars: ✭ 1,120 (+1677.78%)
Mutual labels:  speech-recognition
Angle
⦠ Angle: new speakable syntax for python 💡
Stars: ✭ 61 (-3.17%)
Mutual labels:  speech-recognition

Basics of machine learning and deep learning and its application in various fields.

  1. True understanding is making all parts of a concept so clear and vivid as if they are objects in front of you that you can manipulate as you wish.
  2. This repository contains a list various tasks that can be achieved with Machine Learning/Deep Learning.
  3. It doesn't contain any benchmark paper because I love in doing new things rather than improving previous research.

Pull Requests are welcome to add any new blog, paper, site link.

Theory:

Scikit Learn is a wonderful library for implementing basic machine learning tasks.

Naive Bayes:
  • Basics of naive bayes with term frequency and inverse document frequecy intuition [site]

Deep Learning:

Text Embedding:

Applications:

Keyword Spotting system / Wake-word detection

  • Small-footprint keyword spotting using deep neural networks [paper]
  • Convolutional Neural Networks for Small-footprint Keyword Spotting [paper]
  • A hidden Markov model based keyword recognition system [paper]

Voice activity detection

  • A simple but effective real-time voice activity detection algorithm [paper]
  • Recurrent neural networks for voice activity detection [paper]

Speech recognition and acoustic modeling

  • Training LVCSR systems on thousands of hours of data [paper]
  • Applying CNN on hybrid NN-HMM model for speech recognition [paper]
  • Convolutional Neural Networks for Speech Recognition [paper]
  • Long Short-Term Memory Recurrent Neural Network Architectures for Large Scale Acoustic Modeling [paper]
  • Understanding how deep beleif networks perform acoustic modelling [paper]

Activity Detection

  • Identifying Types of Physical Activity With a Single Accelerometer: Evaluating Laboratory-trained Algorithms in Daily Life [paper]
  • A Study on Human Activity Recognition Using Accelerometer Data from Smartphones [paper]
  • A Practical Approach to Recognizing Physical Activities [paper]

Face-recognition using FPGA

  • A self-configurable systolic architecture for face recognition system based on principal component neural network [paper]

Text Classification

  • Support Vector Machines and Word2vec for Text Classification with Semantic Features [paper]

Time Series Analysis

  • An Introductory Study on Time Series Modeling and Forecasting [paper]

Retinal vessel segmentation

  • Retinal Vessel Segmentation Using Deep Neural Networks [paper]
  • Segmenting Retinal Blood Vessels With Deep Neural Networks [paper] [implementation]
  • Retinal Vessel Segmentation Using the 2-D Gabor Wavelet and Supervised Classification[paper]

Text to Image Generation

  • Text to image Synthesis using GANs [paper] [implementation]
  • StackGAN: Text to Photo-realistic Image Synthesis with Stacked Generative Adversarial Networks [paper] [implementation]
  • StackGAN++: Realistic Image Synthesis with Stacked Generative Adversarial Networks [paper] [implementation]
  • AttnGAN: Fine-Grained Text to Image Generation with Attentional Generative Adversarial Networks [paper]

Object Classification and Recognition


Neural Talk 2


Neural VQA


Genre Classification

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].