Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → MITESHPUTHRANNEU → Speech Emotion Analyzer

MITESHPUTHRANNEU / Speech Emotion Analyzer

Licence: mit

The neural network model is capable of detecting five different male/female emotions from audio speeches. (Deep Learning, NLP, Python)

Programming Languages

python3

1442 projects

Labels

jupyter-notebook deep-learning data-science keras neural-network natural-language-processing deep-neural-networks speech-recognition speech voice natural-language-understanding emotion

Projects that are alternatives of or similar to Speech Emotion Analyzer

spokestack-android

Extensible Android mobile voice framework: wakeword, ASR, NLU, and TTS. Easily add voice to any Android app!

Stars: ✭ 52 (-91.79%)

Mutual labels: voice, speech, speech-recognition, natural-language-understanding

anycontrol

Voice control for your websites and applications

Stars: ✭ 53 (-91.63%)

Mutual labels: voice, speech, speech-recognition

Pytorch Geometric Yoochoose

This is a tutorial for PyTorch Geometric on the YooChoose dataset

Stars: ✭ 198 (-68.72%)

Mutual labels: jupyter-notebook, data-science, deep-neural-networks

opensource-voice-tools

A repo listing known open source voice tools, ordered by where they sit in the voice stack

Stars: ✭ 21 (-96.68%)

Mutual labels: voice, speech, speech-recognition

Web Database Analytics

Web scrapping and related analytics using Python tools

Stars: ✭ 175 (-72.35%)

Mutual labels: jupyter-notebook, data-science, natural-language-processing

Andrew Ng Notes

This is Andrew NG Coursera Handwritten Notes.

Stars: ✭ 180 (-71.56%)

Mutual labels: jupyter-notebook, data-science, deep-neural-networks

Cardio

CardIO is a library for data science research of heart signals

Stars: ✭ 218 (-65.56%)

Mutual labels: jupyter-notebook, data-science, deep-neural-networks

Data Science

Collection of useful data science topics along with code and articles

Stars: ✭ 315 (-50.24%)

Mutual labels: jupyter-notebook, data-science, natural-language-processing

Pocketsphinx Python

Python interface to CMU Sphinxbase and Pocketsphinx libraries

Stars: ✭ 298 (-52.92%)

Mutual labels: speech-recognition, speech, voice

Oie Resources

A curated list of Open Information Extraction (OIE) resources: papers, code, data, etc.

Stars: ✭ 283 (-55.29%)

Mutual labels: data-science, natural-language-processing, natural-language-understanding

Deep Math Machine Learning.ai

A blog which talks about machine learning, deep learning algorithms and the Math. and Machine learning algorithms written from scratch.

Stars: ✭ 173 (-72.67%)

Mutual labels: jupyter-notebook, natural-language-processing, deep-neural-networks

Practical Nlp

Official Repository for 'Practical Natural Language Processing' by O'Reilly Media

Stars: ✭ 452 (-28.59%)

Mutual labels: jupyter-notebook, natural-language-processing, natural-language-understanding

Fixy

Amacımız Türkçe NLP literatüründeki birçok farklı sorunu bir arada çözebilen, eşsiz yaklaşımlar öne süren ve literatürdeki çalışmaların eksiklerini gideren open source bir yazım destekleyicisi/denetleyicisi oluşturmak. Kullanıcıların yazdıkları metinlerdeki yazım yanlışlarını derin öğrenme yaklaşımıyla çözüp aynı zamanda metinlerde anlamsal analizi de gerçekleştirerek bu bağlamda ortaya çıkan yanlışları da fark edip düzeltebilmek.

Stars: ✭ 165 (-73.93%)

Mutual labels: jupyter-notebook, data-science, natural-language-processing

Germanwordembeddings

Toolkit to obtain and preprocess german corpora, train models using word2vec (gensim) and evaluate them with generated testsets

Stars: ✭ 189 (-70.14%)

Mutual labels: jupyter-notebook, natural-language-processing, deep-neural-networks

Hey Jetson

Deep Learning based Automatic Speech Recognition with attention for the Nvidia Jetson.

Stars: ✭ 161 (-74.57%)

Mutual labels: jupyter-notebook, deep-neural-networks, speech-recognition

Natural Language Processing Specialization

This repo contains my coursework, assignments, and Slides for Natural Language Processing Specialization by deeplearning.ai on Coursera

Stars: ✭ 151 (-76.15%)

Mutual labels: jupyter-notebook, natural-language-processing, natural-language-understanding

Nlpaug

Data augmentation for NLP

Stars: ✭ 2,761 (+336.18%)

Mutual labels: jupyter-notebook, data-science, natural-language-processing

Multihead Siamese Nets

Implementation of Siamese Neural Networks built upon multihead attention mechanism for text semantic similarity task.

Stars: ✭ 144 (-77.25%)

Mutual labels: jupyter-notebook, natural-language-processing, deep-neural-networks

Awesome Distributed Deep Learning

A curated list of awesome Distributed Deep Learning resources.

Stars: ✭ 277 (-56.24%)

Mutual labels: data-science, natural-language-processing, deep-neural-networks

Code search

Code For Medium Article: "How To Create Natural Language Semantic Search for Arbitrary Objects With Deep Learning"

Stars: ✭ 436 (-31.12%)

Mutual labels: jupyter-notebook, data-science, natural-language-processing

View All Similar Projects ➔

Speech Emotion Analyzer

The idea behind creating this project was to build a machine learning model that could detect emotions from the speech we have with each other all the time. Nowadays personalization is something that is needed in all the things we experience everyday.
So why not have a emotion detector that will guage your emotions and in the future recommend you different things based on your mood. This can be used by multiple industries to offer different services like marketing company suggesting you to buy products based on your emotions, automotive industry can detect the persons emotions and adjust the speed of autonomous cars as required to avoid any collisions etc.

Analyzing audio signals

Datasets:

Made use of two different datasets:

RAVDESS. This dataset includes around 1500 audio file input from 24 different actors. 12 male and 12 female where these actors record short audios in 8 different emotions i.e 1 = neutral, 2 = calm, 3 = happy, 4 = sad, 5 = angry, 6 = fearful, 7 = disgust, 8 = surprised.
Each audio file is named in such a way that the 7th character is consistent with the different emotions that they represent.
SAVEE. This dataset contains around 500 audio files recorded by 4 different male actors. The first two characters of the file name correspond to the different emotions that the potray.

Audio files:

Tested out the audio files by plotting out the waveform and a spectrogram to see the sample audio files.
Waveform

Spectrogram

Feature Extraction

The next step involves extracting the features from the audio files which will help our model learn between these audio files. For feature extraction we make use of the LibROSA library in python which is one of the libraries used for audio analysis.

Here there are some things to note. While extracting the features, all the audio files have been timed for 3 seconds to get equal number of features.
The sampling rate of each file is doubled keeping sampling frequency constant to get more features which will help classify the audio file when the size of dataset is small.

The extracted features looks as follows

These are array of values with lables appended to them.

Building Models

Since the project is a classification problem, Convolution Neural Network seems the obivious choice. We also built Multilayer perceptrons and Long Short Term Memory models but they under-performed with very low accuracies which couldn't pass the test while predicting the right emotions.

Building and tuning a model is a very time consuming process. The idea is to always start small without adding too many layers just for the sake of making it complex. After testing out with layers, the model which gave the max validation accuracy against test data was little more than 70%

Predictions

After tuning the model, tested it out by predicting the emotions for the test data. For a model with the given accuracy these are a sample of the actual vs predicted values.

Testing out with live voices.

In order to test out our model on voices that were completely different than what we have in our training and test data, we recorded our own voices with dfferent emotions and predicted the outcomes. You can see the results below: The audio contained a male voice which said "This coffee sucks" in a angry tone.

As you can see that the model has predicted the male voice and emotion very accurately in the image above.

NOTE: If you are using the model directly and want to decode the output ranging from 0 to 9 then the following list will help you.

0 - female_angry
1 - female_calm
2 - female_fearful
3 - female_happy
4 - female_sad
5 - male_angry
6 - male_calm
7 - male_fearful
8 - male_happy
9 - male_sad

Conclusion

Building the model was a challenging task as it involved lot of trail and error methods, tuning etc. The model is very well trained to distinguish between male and female voices and it distinguishes with 100% accuracy. The model was tuned to detect emotions with more than 70% accuracy. Accuracy can be increased by including more audio files for training.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 633

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (12) 🔗