All Projects → anujdutt9 → Audio-Scene-Classification

anujdutt9 / Audio-Scene-Classification

Licence: other
Scene Classification using Audio in the nearby Environment.

Programming Languages

Jupyter Notebook
11667 projects
python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Audio-Scene-Classification

AudioClassification-PaddlePaddle
基于PaddlePaddle实现的音频分类,博客地址:
Stars: ✭ 32 (+77.78%)
Mutual labels:  audio-classification, librosa
DCASE-models
Python library for rapid prototyping of environmental sound analysis systems
Stars: ✭ 35 (+94.44%)
Mutual labels:  audio-classification
CityNet
A neural network classifier for urban soundscapes
Stars: ✭ 21 (+16.67%)
Mutual labels:  audio-classification
ESC-CNN-microcontroller
Environmental Sound Classification on Microcontrollers using Convolutional Neural Networks
Stars: ✭ 85 (+372.22%)
Mutual labels:  audio-classification
tomomibot
Artificial intelligence bot for live voice improvisation
Stars: ✭ 30 (+66.67%)
Mutual labels:  librosa
spoken-command-recognition
A large, free audio sample database (10M words pronounced), a test bed for voice activity detection algorithms and for single-syllable word recognition
Stars: ✭ 59 (+227.78%)
Mutual labels:  audio-classification
mxnet-audio
Implementation of music genre classification, audio-to-vec, song recommender, and music search in mxnet
Stars: ✭ 42 (+133.33%)
Mutual labels:  audio-classification
Audio-Classification-using-CNN-MLP
Multi class audio classification using Deep Learning (MLP, CNN): The objective of this project is to build a multi class classifier to identify sound of a bee, cricket or noise.
Stars: ✭ 36 (+100%)
Mutual labels:  audio-classification
Text and Audio classification with Bert
Text Classification in Turkish Texts with Bert
Stars: ✭ 34 (+88.89%)
Mutual labels:  audio-classification
audio-classification
Audio Classification - Multilayer Neural Networks using TensorFlow
Stars: ✭ 28 (+55.56%)
Mutual labels:  audio-classification
MAX-Audio-Classifier
Identify sounds in short audio clips
Stars: ✭ 115 (+538.89%)
Mutual labels:  audio-classification
label-studio-frontend
Data labeling react app that is backend agnostic and can be embedded into your applications — distributed as an NPM package
Stars: ✭ 230 (+1177.78%)
Mutual labels:  audio-classification
Audio Classification using LSTM
Classification of Urban Sound Audio Dataset using LSTM-based model.
Stars: ✭ 47 (+161.11%)
Mutual labels:  audio-classification
dcase2019-task5-urban-sound-tagging
1st place solution to the DCASE 2019 - Task 5 - Urban Sound Tagging
Stars: ✭ 28 (+55.56%)
Mutual labels:  audio-classification
psla
Code for the TASLP paper "PSLA: Improving Audio Tagging With Pretraining, Sampling, Labeling, and Aggregation".
Stars: ✭ 85 (+372.22%)
Mutual labels:  audio-classification
Tensorflow-Audio-Classification
Audio classification with VGGish as feature extractor in TensorFlow
Stars: ✭ 105 (+483.33%)
Mutual labels:  audio-classification
Librosa
Python library for audio and music analysis
Stars: ✭ 4,901 (+27127.78%)
Mutual labels:  librosa

Audio Scene Classification

This repository contains the code for project "Audio Scene Classification". This project uses audio in the nearby environment to classify the things in a scene without using a visual component.

PROJECT STATUS: Ongoing

To convert WAVE audio files from 44.1 or 48 KHz to 16 KHz PCM WAVE file, use the following command from the current audio files folder:

for f in *.wav;do 
  ffmpeg -i $f -ar 16000 path_to_destination_folder/${f}; 
done

Requirements

1. Python 3.6

2. Librosa 0.6 [Audio Processing Library]

pip3 install librosa --upgrade

3. Matplotlib

pip3 install matplotlib --upgrade

4. Keras

pip3 install keras --upgrade

5. Tensorflow

pip3 install tensorflow --upgrade

or

pip3 install tensorflow-gpu --upgrade

NOTE: Tensorflow GPU requires CUDA and cuDNN.

6. Pickle

pip3 install pickle --upgrade

7. TQDM [for Progressbar]

pip3 install tqdm --upgrade

Dataset

The dataset I am using for this project is the "UrbanSound dataset".

Download the dataset from the link below and place inside the dataset folder.

https://serv.cusp.nyu.edu/projects/urbansounddataset/

Extracted Audio Features

The main extracted features from the audio are:

a). Mel Spectrogram: Mel-scaled Power Spectrogram

b). MFCC: Mel-Frequency Cepstral Coefficients

c). Chorma STFT: Compute a chromagram from a waveform or power spectrogram

d). Spectral Contrast: Compute spectral contrast

e). Tonnetz: Computes the tonal centroid features (tonnetz)

Following are the extracted features for some audio files:

1. Air Conditioner Audio Features

2. Car Horn Audio Features

3. Children Playing Audio Features

4. Dog Barking Audio Features

5. Idle Engine Audio Features

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].