All Projects → HidekiKawahara → CAPRICEP

HidekiKawahara / CAPRICEP

Licence: Apache-2.0 license
An extended TSP (Time Stretched Pulse). CAPRICEP substantially replaces FVN. CAPRICEP enables interactive and real-time measurement of the linear time-invariant, the non-linear time-invariant, and random and time varying responses simultaneously.

Programming Languages

matlab
3953 projects

Projects that are alternatives of or similar to CAPRICEP

audio degrader
Audio degradation toolbox in python, with a command-line tool. It is useful to apply controlled degradations to audio: e.g. data augmentation, evaluation in noisy conditions, etc.
Stars: ✭ 40 (+73.91%)
Mutual labels:  wav, data-augmentation
esp32 MusicPlayer
ESP32 Music Player with simple code in arduino-esp32 (I2S interface)
Stars: ✭ 59 (+156.52%)
Mutual labels:  wav
Steganography
Least Significant Bit Steganography for bitmap images (.bmp and .png), WAV sound files, and byte sequences. Simple LSB Steganalysis (LSB extraction) for bitmap images.
Stars: ✭ 229 (+895.65%)
Mutual labels:  wav
mrnet
Building an ACL tear detector to spot knee injuries from MRIs with PyTorch (MRNet)
Stars: ✭ 98 (+326.09%)
Mutual labels:  data-augmentation
Flacon
Audio File Encoder. Extracts audio tracks from an audio CD image to separate tracks.
Stars: ✭ 252 (+995.65%)
Mutual labels:  wav
python-tsp-simulated-annealing
Visualisation of Simulated Annealing algorithm to solve TSP
Stars: ✭ 41 (+78.26%)
Mutual labels:  tsp
Wavefile
A Ruby gem for reading and writing sound files in Wave format (*.wav)
Stars: ✭ 193 (+739.13%)
Mutual labels:  wav
Image-Rotation-and-Cropping-tensorflow
Image rotation and cropping out the black borders in TensorFlow
Stars: ✭ 14 (-39.13%)
Mutual labels:  data-augmentation
advchain
[Medical Image Analysis] Adversarial Data Augmentation with Chained Transformations (AdvChain)
Stars: ✭ 32 (+39.13%)
Mutual labels:  data-augmentation
ChineseNER
中文NER的那些事儿
Stars: ✭ 241 (+947.83%)
Mutual labels:  data-augmentation
DFPlayerMini Fast
Fast and easy to understand Arduino library to use the DFPlayer Mini MP3 module from DFRobot.com. This is a huge improvement (both in terms of execution speed and simplicity) to the standard library provided by DFRobot.com.
Stars: ✭ 164 (+613.04%)
Mutual labels:  wav
DataAugmentationTF
Implementation of modern data augmentation techniques in TensorFlow 2.x to be used in your training pipeline.
Stars: ✭ 35 (+52.17%)
Mutual labels:  data-augmentation
GaNDLF
A generalizable application framework for segmentation, regression, and classification using PyTorch
Stars: ✭ 77 (+234.78%)
Mutual labels:  data-augmentation
Simple Recorderjs Demo
A simple HTML5/JS demo that uses Recorder.js to record audio as uncompressed pcm (wav) and POST it to a server side script.
Stars: ✭ 234 (+917.39%)
Mutual labels:  wav
sox-stream
📣 A stream-friendly wrapper around SoX
Stars: ✭ 50 (+117.39%)
Mutual labels:  wav
Recorder
html5 js 录音 mp3 wav ogg webm amr 格式,支持pc和Android、ios部分浏览器、和Hybrid App(提供Android IOS App源码),微信也是支持的,提供H5版语音通话聊天示例 和DTMF编解码
Stars: ✭ 2,891 (+12469.57%)
Mutual labels:  wav
tsp-essay
A fun study of some heuristics for the Travelling Salesman Problem.
Stars: ✭ 15 (-34.78%)
Mutual labels:  tsp
machine learning course
Artificial intelligence/machine learning course at UCF in Spring 2020 (Fall 2019 and Spring 2019)
Stars: ✭ 47 (+104.35%)
Mutual labels:  data-augmentation
zxtap-to-wav
Converter of .TAP files (a ZX-Spectrum emulator data format) into .WAV sound files
Stars: ✭ 34 (+47.83%)
Mutual labels:  wav
specAugment
Tensor2tensor experiment with SpecAugment
Stars: ✭ 46 (+100%)
Mutual labels:  data-augmentation

CAPRICEP

An extended TSP (Time Stretched Pulse) that enables interactive and real-time measurement of the linear time-invariant, the non-linear time-invariant, and random and time varying responses simultaneously. This is the substantial revision of FVN and related tools. I will stop updating FVN.

All components are uploaded. Type "realTimeTester" to start the application. Supporting materials will be ready soon.

Reference

Kawahara, H. and Yatabe, K.: Cascaded all-pass filterswith randomized center frequencies and phase polarity for acoustic and speech measurement and data augmentation,arXiv:2010.13185(2020) (Accepted: ICASSP2021). (Link to PDF)

Abstract

We introduce a new member of TSP (Time Stretched Pulse) for acoustic and speech measurement infrastructure, based on a simple all-pass filter and systematic randomization. This new infrastructure fundamentally upgrades our previous measurement procedure, which enables simultaneous measurement of multiple attributes, including non-linear ones without requiring extra filtering nor post-processing. Our new proposal establishes a theoretically solid, flexible, and extensible foundation in acoustic measurement. Moreover, it is general enough to provide versatile research tools for other fields, such as biological signal analysis. We illustrate using acoustic measurements and data augmentation as representative examples among various prospective applications. We open-sourced MATLAB implementation. It consists of an interactive and real-time acoustic tool, MATLAB functions, and supporting materials. Recently we introduced an interactive and real-time tool for measureing voice fundamental frequency response to auditory test signal with frequency modulation.

Measurement of pitch extractors' response to FM test signal of simulated vowel /a/

Folder "fmResponseTester" consists of the tester. The following command tests MATLAB's pitch extractor function "pitch.m" using a test signal with 240 Hz pitch and 1 cent modulation depth. The output reports (graphics and MATLAB data) are stored on the current working directory. The last argument "@pitchNCF" is a function pointer to the interface program (user has to write) for the target pitch extractor (a MATLAB function "pitch.m").

output = fmTransfTestPeriodicRevSNR(targetFo, fMcent, @pitchNCF, 'pink', snr)

You can also check @pitchYANG. It tests the pitch extractor for yang_STRAIGHT. It provides better pitch trajectory than using NCF in MATLAB.

Reference

Hideki Kawahara, Kohei Yatabe, Ken-Ichi Sakakibara, Tatsuya Kitamura, Hideki Banno, Masanori Morise: Measuring pitch extractors' response to frequency-modulated multi-component signals, arXiv:2204.00911 (2022). (Link to arXiv

Hideki Kawahara, Kohei Yatabe, Ken-Ichi Sakakibara, Tatsuya Kitamura, Hideki Banno, Masanori Morise: An objective test tool for pitch extractors' response attributes, arXiv:2204.00902 (2022). (Link to arXiv

Hideki Kawahara, Kohei Yatabe, Ken-Ichi Sakakibara, Tatsuya Kitamura, Hideki Banno, Masanori Morise: Objective measurement of pitch extractors' responses to frequency modulated sounds and two reference pitch extraction methods for analyzing voice pitch responses to auditory stimulation, arXiv:2111.03629 (2021). (Link to PDF)

Note that fmTransfTestGaussN is a new replacement of fmTransfTestGauss

Interactive and real-time tool for measureing voice fo response to FM test signals

Dwonload the directory "auditoryResponseTester." Then, try "auditoryResponseTester" to invoke the tool. Watch the video

Getting started: instruction manual

English and Japanese versions are in doc folder.

Reference

Hideki Kawahara, Toshie Matsui, Kohei Yatabe, Ken-Ichi Sakakibara, Minoru Tsuzaki, Masanori Morise, Toshio Irino: Mixture of orthogonal sequences made from extended time-stretched pulses enables measurement of involuntary voice fundamental frequency response to pitch perturbation, arXiv:2104.01444 (2021). (Link to PDF)

The following method is adopted for CAPRICEP and build in this tool.

Hideki Kawahara; Ken-Ichi Sakakibara; Mitsunori Mizumachi; Masanori Morise; Hideki Banno: Simultaneous measurement of time-invariant linear and nonlinear, and random and extra responses using frequency domain variant of velvet noise, Proc. APSIPA ASC, pp.174-183 (2020). (Link to PDF)

Interactive and real-time acoustic measurement tool (Compatible with 32-bit floating)

Try "realTimeTester" to invoke the tool. Watch the video

Use "capResultReporter" for visualize and report the measurement results

Off-line acoustic measurement (Compatible with 32-bit floating)

Try:

app = struct;
app.CommonSignal = struct;
devices = getAudioDevices(audioPlayerRecorder);
[deviceID,tf] = listdlg('ListString',devices, 'SelectionMode','single');
if tf
    app.CommonSignal.DeviceName = devices{deviceID};
    app.DeviceLabel.Text = devices{deviceID};
    fs = 44100;
    app.CommonSignal.Reader = audioPlayerRecorder(fs,"RecorderChannelMapping", 1, ...
        "BitDepth", "24-bit integer", "Device", devices{deviceID});
    app.DriverLabel.Text = 'simultaneousIO'; %get(app.CommonSignal.Reader,"Driver");
    tResponse = 400;
    nRepetition = 30;
    outChannel = 'L-ch';
    option.calibrationConst = 200;
    inChannel = 1;
    option.DeviceName = devices{deviceID};
    analysisStr = capricepResponseTest(fs, tResponse, nRepetition, ...
        outChannel, inChannel, 'acoustic_system', option);
    analysisStrCore = struct;
    analysisStrCore.yRecorded = analysisStr.yRecorded;
    analysisStrCore.pinkLPC = analysisStr.pinkLPC;
    analysisStrCore.fs = fs;
    analysisStrCore.tResponse = analysisStr.tResponse;
    analysisStrCore.nRepetition = analysisStr.nRepetition;
    analysisStrCore.outChannel = analysisStr.outChannel;
    analysisStrCore.numChannels = analysisStr.numChannels;
    analysisStrCore.selectedChannels = analysisStr.selectedChannels;
    analysisStrCore.lAeq = analysisStr.lAeq;
    analysisStrCore.calibrationConst = analysisStr.calibrationConst;
    app.CommonSignal.outPath = ".";
    fname = "test";
    save(app.CommonSignal.outPath + "/" + fname, "analysisStrCore");
    tmp = capResultReporter([char(app.CommonSignal.outPath)   '/'  char(fname)], 'NONE');
  end

For usual room, tResponse = 400; provides better result in the low-frequency end. 800 is better for a classroom.

Sample files

The following files sound pervcptually indistinguishable (at least for me).

Please refer to Fig.5 of the reference. Please download and check.

The worst SNR (-1dB) file filtered by CAPRICEP

capricepWorstSNRminus1dB.wav

The best SNR (6dB) file filtered by CAPRICEP

capricepBestSNR6dB.wav

The original file (level adjusted to CAPRICEP samples)

originalLevelAdjusted.wav

Links to installer of compiled stand alone application

These installers are still experimental (17 Jan. 2021);

Installer for macOS

Link to installer Please download this folder as a whole. Then, execute as the installer application.

Installer for Windows10 (64bit)

Link to installer Please click this link. Then execute the downloaded installer application.

Memo (working)

Auditory to response tester bug fix and update (24 Nov. 2021)

Auditory to response tester bug fix and update (03 Sept. 2021)

Urgent bug fix release for OTOGAKU symposiun (18 June 2021)

Auditory to voice fo response tester is added (6 April 2021)

Driver assignment is added to off-line measurement function (13 March 2021)

Calibration error and driver compatibility issue fixed (08 March 2021)

Voice memo fix for ASIO compatibility (25 Jan. 2021)

Tuning for Windows (24 Jan. 2021)

Minor fix of GUI behavior (23 Jun. 2021)

192kHz sampling measurement is now functional on Windows (23 Jan. 2021)

Binary installers are updated (17 Jan. 2021)

192kHz sampling rate for off-line measurement is added (16.Jan. 2021)

Buf fix for report vertical axis shouw sound pressure level now (16 Jan. 2021)

Bug fix for off-line measurement (22 Dec. 2020)

Added 176400 Hz and 192000 Hz sampling frequency for off-line measurement functions

Revised event detection (05 Dec. 2020)

Revised for resilience (02 Dec. 2020)

Multi channel input (30 Nov. 2020)

Generalized for multi input channels and added test code (27 Nov. 2020) Note: only for off-line measurement

Data inspection fundtion added (24 Nov. 2020)

Event detection refinement

Bug fix in reverberation time and made robust (23 Nov. 2020)

Added test folder

Visualization and report function is added (20 Nov. 2020).

Detailed report added (13 Nov. 2020).

Figure appearance compatibility in report is now better (12 Nov. 2020).

All components are uploaded. Type "realTimeTester" to start the application

Added off-line analysis function

Uploaded unit-CAPRICEPs for 100, 200, 400, 800, and 1600 ms

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].