All Projects → primaryobjects → Voice Gender

primaryobjects / Voice Gender

Gender recognition by voice and speech analysis

Programming Languages

r
7636 projects

Projects that are alternatives of or similar to Voice Gender

Speech Emotion Analyzer
The neural network model is capable of detecting five different male/female emotions from audio speeches. (Deep Learning, NLP, Python)
Stars: ✭ 633 (+155.24%)
Mutual labels:  data-science, speech, voice
Ml Auto Baseball Pitching Overlay
⚾🤖⚾ Automatic baseball pitching overlay in realtime
Stars: ✭ 200 (-19.35%)
Mutual labels:  artificial-intelligence, ai, data-science
Hyperparameter hunter
Easy hyperparameter optimization and automatic result saving across machine learning algorithms and libraries
Stars: ✭ 648 (+161.29%)
Mutual labels:  artificial-intelligence, ai, data-science
Spacy
💫 Industrial-strength Natural Language Processing (NLP) in Python
Stars: ✭ 21,978 (+8762.1%)
Mutual labels:  artificial-intelligence, ai, data-science
Modelchimp
Experiment tracking for machine and deep learning projects
Stars: ✭ 121 (-51.21%)
Mutual labels:  artificial-intelligence, ai, data-science
Caer
High-performance Vision library in Python. Scale your research, not boilerplate.
Stars: ✭ 452 (+82.26%)
Mutual labels:  artificial-intelligence, ai, data-science
Machine Learning Open Source
Monthly Series - Machine Learning Top 10 Open Source Projects
Stars: ✭ 943 (+280.24%)
Mutual labels:  artificial-intelligence, ai, data-science
Polyaxon
Machine Learning Platform for Kubernetes (MLOps tools for experimentation and automation)
Stars: ✭ 2,966 (+1095.97%)
Mutual labels:  artificial-intelligence, ai, data-science
Blurr
Data transformations for the ML era
Stars: ✭ 96 (-61.29%)
Mutual labels:  artificial-intelligence, ai, data-science
Ml
A high-level machine learning and deep learning library for the PHP language.
Stars: ✭ 1,270 (+412.1%)
Mutual labels:  artificial-intelligence, ai, data-science
Free Ai Resources
🚀 FREE AI Resources - 🎓 Courses, 👷 Jobs, 📝 Blogs, 🔬 AI Research, and many more - for everyone!
Stars: ✭ 192 (-22.58%)
Mutual labels:  artificial-intelligence, ai, data-science
Fixy
Amacımız Türkçe NLP literatüründeki birçok farklı sorunu bir arada çözebilen, eşsiz yaklaşımlar öne süren ve literatürdeki çalışmaların eksiklerini gideren open source bir yazım destekleyicisi/denetleyicisi oluşturmak. Kullanıcıların yazdıkları metinlerdeki yazım yanlışlarını derin öğrenme yaklaşımıyla çözüp aynı zamanda metinlerde anlamsal analizi de gerçekleştirerek bu bağlamda ortaya çıkan yanlışları da fark edip düzeltebilmek.
Stars: ✭ 165 (-33.47%)
Mutual labels:  artificial-intelligence, ai, data-science
Csinva.github.io
Slides, paper notes, class notes, blog posts, and research on ML 📉, statistics 📊, and AI 🤖.
Stars: ✭ 342 (+37.9%)
Mutual labels:  artificial-intelligence, ai, data-science
Imodels
Interpretable ML package 🔍 for concise, transparent, and accurate predictive modeling (sklearn-compatible).
Stars: ✭ 194 (-21.77%)
Mutual labels:  artificial-intelligence, ai, data-science
Artificio
Deep Learning Computer Vision Algorithms for Real-World Use
Stars: ✭ 326 (+31.45%)
Mutual labels:  artificial-intelligence, ai, data-science
Autodl
Automated Deep Learning without ANY human intervention. 1'st Solution for AutoDL [email protected]
Stars: ✭ 854 (+244.35%)
Mutual labels:  artificial-intelligence, ai, data-science
Atlas
An Open Source, Self-Hosted Platform For Applied Deep Learning Development
Stars: ✭ 259 (+4.44%)
Mutual labels:  artificial-intelligence, ai, data-science
Pycm
Multi-class confusion matrix library in Python
Stars: ✭ 1,076 (+333.87%)
Mutual labels:  artificial-intelligence, ai, data-science
Nlpaug
Data augmentation for NLP
Stars: ✭ 2,761 (+1013.31%)
Mutual labels:  artificial-intelligence, ai, data-science
Pytorch Lightning
The lightweight PyTorch wrapper for high-performance AI research. Scale your models, not the boilerplate.
Stars: ✭ 16,641 (+6610.08%)
Mutual labels:  artificial-intelligence, ai, data-science

Voice Gender

Gender Recognition by Voice and Speech Analysis

Read the full article.

This project trains a computer program to identify a voice as male or female, based upon acoustic properties of the voice and speech. The model is trained on a dataset consisting of 3,168 recorded voice samples, collected from male and female speakers. The voice samples are pre-processed by acoustic analysis in R and then processed with artificial intelligence/machine learning algorithms to learn gender-specific traits for classifying the voice as male or female.

The best model achieves an accuracy of 100% on the training set and 89% on the test set.

Update: By narrowing the frequency range analyzed to 0hz-280hz (human vocal range), the best accuracy is boosted to 100%/99%.

The Dataset

Download the pre-processed dataset as a CSV file.

The CSV file contains the following fields:

"meanfreq","sd","median","Q25","Q75","IQR","skew","kurt","sp.ent","sfm","mode","centroid","meanfun","minfun","maxfun","meandom","mindom","maxdom","dfrange","modindx","label"

"label" corresponds to the gender classification of the sample. The remaining fields are acoustic properties, detailed below.

In R, you can load the dataset file data.bin directly as a data.frame with the command load('data.bin').

In addition to the pre-processed dataset, the raw voice samples used for training are included as .WAV files in a separate repository. The .WAV files are pre-processed in R to produce the above dataset.

Accuracy

The trained models have achieved the following accuracies (train/test):

Baseline Algorithm (always male)

50%/50%

Baseline Algorithm (simple frequency threshold)

61%/59%

Logistic Regression

72%/71%

Classification and Regression Tree (CART)

81%/78%

Random Forest

100%/87%

Generalized Boosted Tree Regression

91%/84%

XGBoost

100%/87%

XGBoost (Updated with frequency range 0hz-280hz)

100%/99%

Acoustic Properties Measured

The following acoustic properties of each voice are measured:

  • duration: length of signal
  • meanfreq: mean frequency (in kHz)
  • sd: standard deviation of frequency
  • median: median frequency (in kHz)
  • Q25: first quantile (in kHz)
  • Q75: third quantile (in kHz)
  • IQR: interquantile range (in kHz)
  • skew: skewness (see note in specprop description)
  • kurt: kurtosis (see note in specprop description)
  • sp.ent: spectral entropy
  • sfm: spectral flatness
  • mode: mode frequency
  • centroid: frequency centroid (see specprop)
  • peakf: peak frequency (frequency with highest energy)
  • meanfun: average of fundamental frequency measured across acoustic signal
  • minfun: minimum fundamental frequency measured across acoustic signal
  • maxfun: maximum fundamental frequency measured across acoustic signal
  • meandom: average of dominant frequency measured across acoustic signal
  • mindom: minimum of dominant frequency measured across acoustic signal
  • maxdom: maximum of dominant frequency measured across acoustic signal
  • dfrange: range of dominant frequency measured across acoustic signal
  • modindx: modulation index. Calculated as the accumulated absolute difference between adjacent measurements of fundamental frequencies divided by the frequency range

Classification and Regression Decision Tree

The following decision tree, produced by the CART model, provides a high-level overview of important properties of the voice samples that may determine a specific gender classification of male versus female.

Screenshot 1

After narrowing the frequency range to 0hz-280hz with a sound threshold of 15%, the accuracy is boosted to near perfect, and the following CART model is described. Mean fundamental frequency serves as a powerful indicator of voice gender, with a threshold of 140hz separating male from female classifications.

Screenshot 2

References

The Harvard-Haskins Database of Regularly-Timed Speech

Telecommunications & Signal Processing Laboratory (TSP) Speech Database at McGill University, Home

VoxForge Speech Corpus, Home

Festvox CMU_ARCTIC Speech Database at Carnegie Mellon University

Copyright

Copyright (c) 2017 Kory Becker http://primaryobjects.com/kory-becker

Author

Kory Becker http://www.primaryobjects.com

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].