Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Use full GM set of musical instruments to play MIDI and single sounds or effects. Support for reverberation and equaliser. No plugins, no Flash. Pure HTML5 implementation compatible with desktop and mobile browser. See live examples.

Stars: ✭ 600 (+852.38%)

Mutual labels: audio, sound

Simpletones.js

The goal of simpleTones.js is to provide every JavaScript developer with a lightweight solution for creating custom sounds in their web applications. This documentation has been written in hopes that the least experienced developer can read, understand and go on to do great things. You can check out several examples at this link:

Stars: ✭ 45 (-28.57%)

Mutual labels: audio, sound

Oto

♪ A low-level library to play sound on multiple platforms ♪

Stars: ✭ 789 (+1152.38%)

Mutual labels: audio, sound

Pyo

Python DSP module

Stars: ✭ 904 (+1334.92%)

Mutual labels: audio, sound

Audiomentations

A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.

Stars: ✭ 439 (+596.83%)

Mutual labels: audio, sound

Swiftysound

SwiftySound is a simple library that lets you play sounds with a single line of code.

Stars: ✭ 995 (+1479.37%)

Mutual labels: audio, sound

Sound

core sound data structures and interfaces

Stars: ✭ 37 (-41.27%)

Mutual labels: audio, sound

Swift Video Generator

Stars: ✭ 517 (+720.63%)

Mutual labels: audio, sound

Soloud

Free, easy, portable audio engine for games

Stars: ✭ 1,048 (+1563.49%)

Mutual labels: audio, sound

Romplayer

AudioKit Sample Player (ROM Player) - EXS24, Sound Font, Wave Player

Stars: ✭ 445 (+606.35%)

Mutual labels: audio, sound

Chime

🎵 Python sound notifications made easy

Stars: ✭ 56 (-11.11%)

Mutual labels: audio, sound

Pulsemixer

CLI and curses mixer for PulseAudio

Stars: ✭ 441 (+600%)

Mutual labels: audio, sound

Minimp3

Minimalistic MP3 decoder single header library

Stars: ✭ 898 (+1325.4%)

Mutual labels: audio, sound

Dx7 Supercollider

My accurate Yamaha DX-7 clone. Programmed in Supercollider.

Stars: ✭ 395 (+526.98%)

Mutual labels: audio, sound

Matchering

🎚️ Open Source Audio Matching and Mastering

Stars: ✭ 398 (+531.75%)

Mutual labels: audio, sound

Minimumaudioplugin

Minimum implementation of a native audio plugin for Unity

Stars: ✭ 33 (-47.62%)

Mutual labels: audio, sound

Pandoraplayer

🅿️ PandoraPlayer is a lightweight music player for iOS, based on AudioKit and completely written in Swift.

Stars: ✭ 1,037 (+1546.03%)

Mutual labels: audio, sound

View All Similar Projects ➔

AudioSegment

Wrapper for pydub AudioSegment objects. An audiosegment.AudioSegment object wraps a pydub.AudioSegment object. Any methods or properties it has, this also has.

Docs are hosted by GitHub Pages, but are currently hideous. I've got to do something about them as soon as I find some time. You can also try Read The Docs, though the docs there don't seem to be building for some reason.... also something I need to look into. Up-to-date docs are also built and pushed and are in the docs folder of this repository.

Notes

There is a hidden dependency on the command line program 'sox'. Pip will not install it for you. You will have to install sox by:

Debian/Ubuntu: sudo apt-get install sox
Mac OS X: brew install sox
Windows: choco install sox

Also, I use librosa and scipy, for some of the functionality. These dependencies are hefty, and I have decided to make them optional. If you do not install them, you may get warnings when using audiosegment.

So, a full installation on Debian/Ubuntu would like like this:

sudo apt-get install sox
pip3 install --user audiosegment

# To get scipy, you will need some lapack/blas resources:
sudo apt-get install libatlas-base-dev gfortran
pip3 install --user scipy

# To get librosa, you will need numba, which requires LLVMlite, which requires LLVM.
sudo apt-get install llvm
pip3 install --user librosa

Make suitable adjustments to fit your own OS's package management system.

TODO

The following is the list of items I plan on implementing.

Finish implementing auditory scene analysis (a.k.a blind source separation)
Add voice-pass filtering and make voice activity detection better
Add language classification for English and Chinese (and show how to do it for other languages)
Add more examples to README (especially filterbank)
Finish removing the SOX dependency

I am open to other suggestions. Open an issue if you have requests, or better yet, if you can do it yourself and open a pull request, I'll take a look and merge in if I think it makes sense.

Example Usage

Basic information

import audiosegment

print("Reading in the wave file...")
seg = audiosegment.from_file("whatever.wav")

print("Information:")
print("Channels:", seg.channels)
print("Bits per sample:", seg.sample_width * 8)
print("Sampling frequency:", seg.frame_rate)
print("Length:", seg.duration_seconds, "seconds")

Voice Detection

# ...
print("Detecting voice...")
seg = seg.resample(sample_rate_Hz=32000, sample_width=2, channels=1)
results = seg.detect_voice()
voiced = [tup[1] for tup in results if tup[0] == 'v']
unvoiced = [tup[1] for tup in results if tup[0] == 'u']

print("Reducing voiced segments to a single wav file 'voiced.wav'")
voiced_segment = voiced[0].reduce(voiced[1:])
voiced_segment.export("voiced.wav", format="WAV")

print("Reducing unvoiced segments to a single wav file 'unvoiced.wav'")
unvoiced_segment = unvoiced[0].reduce(unvoiced[1:])
unvoiced_segment.export("unvoiced.wav", format="WAV")

Silence Removal

import matplotlib.pyplot as plt

# ...
print("Plotting before silence...")
plt.subplot(211)
plt.title("Before Silence Removal")
plt.plot(seg.get_array_of_samples())

seg = seg.filter_silence(duration_s=0.2, threshold_percentage=5.0)
outname_silence = "nosilence.wav"
seg.export(outname_silence, format="wav")

print("Plotting after silence...")
plt.subplot(212)
plt.title("After Silence Removal")

plt.tight_layout()
plt.plot(seg.get_array_of_samples())
plt.show()

FFT

import matplotlib.pyplot as plt
import numpy as np

#...
# Do it just for the first 3 seconds of audio
hist_bins, hist_vals = seg[1:3000].fft()
hist_vals_real_normed = np.abs(hist_vals) / len(hist_vals)
plt.plot(hist_bins / 1000, hist_vals_real_normed)
plt.xlabel("kHz")
plt.ylabel("dB")
plt.show()

Spectrogram

import matplotlib.pyplot as plt

#...
freqs, times, amplitudes = seg.spectrogram(window_length_s=0.03, overlap=0.5)
amplitudes = 10 * np.log10(amplitudes + 1e-9)

# Plot
plt.pcolormesh(times, freqs, amplitudes)
plt.xlabel("Time in Seconds")
plt.ylabel("Frequency in Hz")
plt.show()

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 63

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (0) 🔗