jonnor / Machinehearing
Machine Learning applied to sound
Stars: ✭ 83
Programming Languages
javascript
184084 projects - #8 most used programming language
Projects that are alternatives of or similar to Machinehearing
Audio-Classification-using-CNN-MLP
Multi class audio classification using Deep Learning (MLP, CNN): The objective of this project is to build a multi class classifier to identify sound of a bee, cricket or noise.
Stars: ✭ 36 (-56.63%)
Mutual labels: audio-analysis, audio-processing
Chromaprint.scala
Chromaprint/AcoustID audio fingerprinting for the JVM
Stars: ✭ 81 (-2.41%)
Mutual labels: audio-processing, audio-analysis
Aukit
audio toolkit. 好用的语音处理工具箱,包含语音降噪、音频格式转换、特征频谱生成等模块。
Stars: ✭ 105 (+26.51%)
Mutual labels: audio-processing, audio-analysis
MusicVisualizer
A music visualizer based on the ATMEGA328P-AU
Stars: ✭ 30 (-63.86%)
Mutual labels: audio-analysis, audio-processing
MixingBear
Package for automatic beat-mixing of music files in Python 🐻🎚
Stars: ✭ 73 (-12.05%)
Mutual labels: audio-analysis, audio-processing
audio noise clustering
https://dodiku.github.io/audio_noise_clustering/results/ ==> An experiment with a variety of clustering (and clustering-like) techniques to reduce noise on an audio speech recording.
Stars: ✭ 24 (-71.08%)
Mutual labels: audio-analysis, audio-processing
ACA-Slides
Slides and Code for "An Introduction to Audio Content Analysis," also taught at Georgia Tech as MUSI-6201. This introductory course on Music Information Retrieval is based on the text book "An Introduction to Audio Content Analysis", Wiley 2012/2022
Stars: ✭ 84 (+1.2%)
Mutual labels: audio-analysis, audio-processing
tsunami
A simple but powerful audio editor
Stars: ✭ 41 (-50.6%)
Mutual labels: audio-analysis, audio-processing
Chromaprint
C library for generating audio fingerprints used by AcoustID
Stars: ✭ 553 (+566.27%)
Mutual labels: audio-processing, audio-analysis
Awesome Web Audio
A list of resources and projects to help learn about audio
Stars: ✭ 73 (-12.05%)
Mutual labels: audio-processing
Notes
A zero dependency shell script that makes it really simple to manage your text notes.
Stars: ✭ 62 (-25.3%)
Mutual labels: notes
Lepton
💻 Democratizing Snippet Management (macOS/Win/Linux)
Stars: ✭ 9,067 (+10824.1%)
Mutual labels: notes
Enex Dump
Dump the content of .enex files, preserving attachements, some metadata and optionally converting notes to Markdown.
Stars: ✭ 73 (-12.05%)
Mutual labels: notes
Audio Pretrained Model
A collection of Audio and Speech pre-trained models.
Stars: ✭ 61 (-26.51%)
Mutual labels: audio-processing
Real Time Rendering 3rd Cn Summary Ebook
📘 电子书 -《Real-Time Rendering 3rd》提炼总结 | 全书共9万7千余字。你可以把它看做中文通俗版的《Real-Time Rendering 3rd》,也可以把它看做《Real-Time Rendering 3rd》的解读版与配套学习伴侣,或者《Real-Time Rendering 4th》的前置阅读材料。
Stars: ✭ 1,159 (+1296.39%)
Mutual labels: notes
Machine Hearing
Classifying sound using Machine Learning
At KnowIt Oslo, 2020. Video recording, slides, notes
Environmental Sound Classification on Microcontrollers using Convolutional Neural Networks
Master thesis. Report and code available in the Github repository.
EuroPython2019: Audio Classification using Machine Learning
Presentation at EuroPython2019. Video recording, notes
PyCode2019: Recognizing sounds with Machine Learning and Python
Presentation at PyCode Conference 2019 in Gdansk. Slides, notes
Video recording. Coming, maybe in November.
SenseCamp2019: Classification of Environmental Sound using IoT sensors
Presentation at SenseCamp 2019 hosted by FORCE Technology Senselab. Slides: web, .PDF
NMBU lecture on Audio Classification
Report and lecture at NMBU Data Science.
Stack Overflow answers
With example code in Python
- Loading Youtube audio data with youtube-dl and librosa
- Extracting fixed-size analysis windows from audio
- Classifying an audio clip of many analysis windows using Keras Timedistributed and GlobalAveragePooling
- Classifying an audio clip by voting over analysis windows. Mean/majority voting.
- Annotating/labeling audio data using Audacity
- Preprocessing audio into mel-spectrograms
- Multi-core preprocessing of audio files using joblib
- Compute MFCC or mel-spectrogram from existing STFT spectrograms
- Converting mel-spectrograms into PNG images
- Converting mel-spectrogram or MFCC back to audio waveform using librosa
Notes
Rough notes on various topics.
- Applications. Practical applications of Machine Hearing
- Tasks. Established problem formulations
- Audio Quality. Metrics for measuring audio quality
- Explainable models for Audio.
- Features. Feature representations
- Preprocessing. Preprocessing techniques
- DCASE2018. Notes from DCASE2018 challenge and conference
- Commercial solutions. Companies and products in Machine Hearing
- Speech. Speech-specific techniques and applications
- Music. Music-specific techniques and applications
- Compressive Sensing.
Resources
Useful resources to learn more.
Presentations
- Audio Event Detection w/Deep Learning. By Robert Coop, Ph.D, Head of AI and ML @ Stanley B&D. From Data Science Connect, 2028.
Books
- Computational Analysis of Sound Scenes and Events. Tuomas Virtanen, Mark D. Plumbley, Dan Ellis. 2018.
- Human and Machine Hearing - Extracting Meaning from Sound. Richard F. Lyon. 2017, revised 2018.
- An Introduction to Audio Content Analysis - Applications in Signal Processing and Music Informatics. Alexander Lerch. 2012. Companion website: https://www.audiocontentanalysis.org/
- Machine Learning for Audio, Image and Video Analysis: Theory and Applications (Advanced Information and Knowledge Processing). Francesco Camastra, 3 sections. From Perception to Computation, Machine Learning, Applications.
Online courses
- CSC 83060: Speech and Audio Understanding. http://mr-pc.org/t/csc83060/ Brooklyn College (CUNY).
Software
Feature extraction
- librosa. The go-to Python module.
- essentia. C++ library, with Python bindings. Lots of Music Analysis extractors. Used by FreeSound and Acousticbrainz.
- kapre. On-demand GPU computation of melspectrograms, for Keras
- torchaudio.
Data Augmentation
Lecture notes
- Audio Classification. http://www.cs.tut.fi/~sgn24006/PDF/L04-audio-classification.pdf Covers low-level features, MFCC. Classification by distance metrics. GMM. HMM.
- Speech Signal Analysis, Lecture 2. January 2017, Hiroshi Shimodaira and Steve Renals. ! great diagrams of audio discretization, mel filters, wide versus narrow-band spectrograms.
Competions
- Kaggle Whale detection
- Kaggle FreeSound tagging 2018
- Kaggle FreeSound
- DCASE2014
- DCASE2018
- DCASE2019
Datasets
Online Communities
- https://mircommunity.slack.com/ - Music Information Retrieval
Lists
- Awesome Deep Learning Music
- Fast.ai forums: Deep Learning with Audio. Large lists of resources, both in first post and "popular links". Feb 2019, 315 replies over 4 months.
Note that the project description data, including the texts, logos, images, and/or trademarks,
for each open source project belongs to its rightful owner.
If you wish to add or remove any projects, please contact us at [email protected].