All Projects → r9y9 → Sptk

r9y9 / Sptk

Licence: other
A modified version of Speech Signal Processing Toolkit (SPTK)

Programming Languages

c
50402 projects - #5 most used programming language

Projects that are alternatives of or similar to Sptk

Awesome Speech Enhancement
A tutorial for Speech Enhancement researchers and practitioners. The purpose of this repo is to organize the world’s resources for speech enhancement and make them universally accessible and useful.
Stars: ✭ 257 (+261.97%)
Mutual labels:  speech-processing
Awesome Diarization
A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
Stars: ✭ 673 (+847.89%)
Mutual labels:  speech-processing
Formant Analyzer
iOS application for finding formants in spoken sounds
Stars: ✭ 43 (-39.44%)
Mutual labels:  speech-processing
Pysptk
A python wrapper for Speech Signal Processing Toolkit (SPTK).
Stars: ✭ 297 (+318.31%)
Mutual labels:  speech-processing
Uspeech
Speech recognition toolkit for the arduino
Stars: ✭ 448 (+530.99%)
Mutual labels:  speech-processing
Sincnet
SincNet is a neural architecture for efficiently processing raw audio samples.
Stars: ✭ 764 (+976.06%)
Mutual labels:  speech-processing
hifigan-denoiser
HiFi-GAN: High Fidelity Denoising and Dereverberation Based on Speech Deep Features in Adversarial Networks
Stars: ✭ 88 (+23.94%)
Mutual labels:  speech-processing
Dnc
Discriminative Neural Clustering for Speaker Diarisation
Stars: ✭ 60 (-15.49%)
Mutual labels:  speech-processing
Speech Denoising Wavenet
A neural network for end-to-end speech denoising
Stars: ✭ 516 (+626.76%)
Mutual labels:  speech-processing
Pncc
A implementation of Power Normalized Cepstral Coefficients: PNCC
Stars: ✭ 40 (-43.66%)
Mutual labels:  speech-processing
Nnmnkwii
Library to build speech synthesis systems designed for easy and fast prototyping.
Stars: ✭ 308 (+333.8%)
Mutual labels:  speech-processing
Pase
Problem Agnostic Speech Encoder
Stars: ✭ 348 (+390.14%)
Mutual labels:  speech-processing
Rte Speech Generator
Natural Language Processing to generate new speeches for the President of Turkey.
Stars: ✭ 22 (-69.01%)
Mutual labels:  speech-processing
Neural Voice Cloning With Few Samples
This repository has implementation for "Neural Voice Cloning With Few Samples"
Stars: ✭ 262 (+269.01%)
Mutual labels:  speech-processing
Keras Sincnet
Keras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)
Stars: ✭ 47 (-33.8%)
Mutual labels:  speech-processing
SpeechTransProgress
Tracking the progress in end-to-end speech translation
Stars: ✭ 139 (+95.77%)
Mutual labels:  speech-processing
Audino
Open source audio annotation tool for humans™
Stars: ✭ 740 (+942.25%)
Mutual labels:  speech-processing
Gcommandspytorch
ConvNets for Audio Recognition using Google Commands Dataset
Stars: ✭ 65 (-8.45%)
Mutual labels:  speech-processing
Fullsubnet
PyTorch implementation of "A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement."
Stars: ✭ 51 (-28.17%)
Mutual labels:  speech-processing
Pyannote Audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
Stars: ✭ 978 (+1277.46%)
Mutual labels:  speech-processing

SPTK

Build Status Build status

This repository hosts a modified version of Speech Signal Processing Toolkit (SPTK) to provide a SPTK library rather than CLI tools so that it can easily be used from FFI interfaces (e.g., cython or cffi in Python).

This version of SPTK is used by the following bindings:

Note that if you want the command line programs of SPTK, please check the original one instead.

Supported Platforms

  • Linux
  • Mac OS X
  • Windows

Changes

The important changes from the original SPTK are summerized as follows:

  • Integrate waf build tool
  • Add swipe inteface to SPTK.h SPTK.h#L382
  • Windows 32/64 bit support. DLLs are avilable on the release page (https://github.com/r9y9/SPTK/releases)
  • Avoid exit inside a function, as possible.
  • Remove wavsplit and wavjoin from target sources to compile for cross-platform compilation ability, since original wavsplit and wavjoin use direct.h that only exists in POSIX enviroments. (#8)
  • Add DLLEXPORT macro to expose API functions explicitly in MSVC environments (NOTE: this is not fully used for now) (#8)
  • Ensure c89 compatibility
  • Export excite function (#11)
  • Cleanup and export RAPT function (#12)

Installation

Type the following commands from terminal:

./waf configure
./waf
sudo ./waf install

License

Modified BSD

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].