https://dodiku.github.io/audio_noise_clustering/results/ ==> An experiment with a variety of clustering (and clustering-like) techniques to reduce noise on an audio speech recording.

Stars: ✭ 24 (-74.19%)

Mutual labels: dsp

dsp

DSP and filtering library

Stars: ✭ 36 (-61.29%)

Mutual labels: dsp

TFGAN

TFGAN: Time and Frequency Domain Based Generative Adversarial Network for High-fidelity Speech Synthesis

Stars: ✭ 65 (-30.11%)

Mutual labels: tts

tai5-uan5 gian5-gi2 kang1-ku7

臺灣言語工具

Stars: ✭ 79 (-15.05%)

Mutual labels: tts

AnotherBadBeatSaberClone

This is a discontinued but perhaps helpful VR project created during my Master's degree at FH Wedel.

Stars: ✭ 22 (-76.34%)

Mutual labels: dsp

DtBlkFx

Fast-Fourier-Transform (FFT) based VST plug-in

Stars: ✭ 99 (+6.45%)

Mutual labels: dsp

avsr-tf1

Audio-Visual Speech Recognition using Sequence to Sequence Models

Stars: ✭ 76 (-18.28%)

Mutual labels: asr

View All Similar Projects ➔

YSDA Speech Processing Course

Materials for each week are in ./week* folders

Course program

Week 1: Introduction to Speech
- Lecture: In this lecture we introduce the area of speech processing, discuss historical background and current trends. In the second half of the lecture we introduce the concept fo speech as a separate modality from text or images and foreshadow concepts from later lectures.
Week 2: Digital Signal Processing
- Lecture: In this lecture we discuss how to transform an audio signal into a form which is convenient for use in Speech Recognition and Synthesis. We discuss: how an audio wave is sampled and digitized; The Fourier Transform and the Discrete Fourier Transform and how they can be used to obtain the frequency spectrum of the signal; How to use the Short-Time-Fourier-Transform to represent sound as a Spectrogram; finally, we discuss the Mel-Scale and how to obtain a Mel-Spectrogram.
- Seminar: In part 1 we will implement the Short-Time-Fourier-Transform and obtain a Mel-Spectrogram. In part 2 we will: recover a Spectrogram from a Mel-Spectrogram. Reconstruct the original audio signal via the Griffin-Lim algorithm and do some simple voice warping.
- Homework: Audio-MNIST: Implement a Neural Network model to do simple digit classification based on a mel-spectrogram.

Contributors & course staff

Andrey Malinin - Course admin, lectures, seminars, homeworks
Vladimir Kirichenko - lectures, seminars, homeworks
Segey Dukanov - lecures, seminars, homeworks

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

yandexdataschool / speech_course

Programming Languages

Labels

Projects that are alternatives of or similar to speech course

YSDA Speech Processing Course

Course program

Contributors & course staff