All Projects → fulldecent → Formant Analyzer

fulldecent / Formant Analyzer

Licence: mit
iOS application for finding formants in spoken sounds

Programming Languages

swift
15916 projects
language
365 projects

Projects that are alternatives of or similar to Formant Analyzer

Pncc
A implementation of Power Normalized Cepstral Coefficients: PNCC
Stars: ✭ 40 (-6.98%)
Mutual labels:  speech-recognition, speech-processing
Webpack Pwa Manifest
Progressive Web App Manifest Generator for Webpack, with auto icon resizing and fingerprinting support.
Stars: ✭ 447 (+939.53%)
Mutual labels:  application, app
Appify
Create a macOS Application from an executable (like a Go binary)
Stars: ✭ 372 (+765.12%)
Mutual labels:  application, app
nimblenote
Simple keyboard-driven note taking application for macOS, Linux and Windows.
Stars: ✭ 31 (-27.91%)
Mutual labels:  application, app
Awesome Diarization
A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
Stars: ✭ 673 (+1465.12%)
Mutual labels:  speech-recognition, speech-processing
Livebot
An app that allows you to be inside a bot!
Stars: ✭ 271 (+530.23%)
Mutual labels:  application, app
Spectro
🎶 Real-time audio spectrogram generator for the web
Stars: ✭ 383 (+790.7%)
Mutual labels:  application, app
Speech-Backbones
This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.
Stars: ✭ 205 (+376.74%)
Mutual labels:  speech-recognition, speech-processing
Obfuscapk
An automatic obfuscation tool for Android apps that works in a black-box fashion, supports advanced obfuscation features and has a modular architecture easily extensible with new techniques
Stars: ✭ 456 (+960.47%)
Mutual labels:  application, app
Gatsby Mail
A Gatsby email *application*
Stars: ✭ 450 (+946.51%)
Mutual labels:  application, app
scim
[wip]Speech recognition tool-box written by Nim. Based on Arraymancer.
Stars: ✭ 17 (-60.47%)
Mutual labels:  speech-recognition, speech-processing
Diycode
[暂停维护]diycode android app
Stars: ✭ 931 (+2065.12%)
Mutual labels:  application, app
UniSpeech
UniSpeech - Large Scale Self-Supervised Learning for Speech
Stars: ✭ 224 (+420.93%)
Mutual labels:  speech-recognition, speech-processing
Marky
A markdown editor built with Electron and React
Stars: ✭ 355 (+725.58%)
Mutual labels:  application, app
spokestack-ios
Spokestack: give your iOS app a voice interface!
Stars: ✭ 27 (-37.21%)
Mutual labels:  speech-recognition, speech-processing
Nineanimator
An elegant way of discovering anime on iOS.
Stars: ✭ 369 (+758.14%)
Mutual labels:  application, app
QuantumSpeech-QCNN
IEEE ICASSP 21 - Quantum Convolution Neural Networks for Speech Processing and Automatic Speech Recognition
Stars: ✭ 71 (+65.12%)
Mutual labels:  speech-recognition, speech-processing
open-speech-corpora
💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
Stars: ✭ 841 (+1855.81%)
Mutual labels:  speech-recognition, speech-processing
Uspeech
Speech recognition toolkit for the arduino
Stars: ✭ 448 (+941.86%)
Mutual labels:  speech-recognition, speech-processing
Sincnet
SincNet is a neural architecture for efficiently processing raw audio samples.
Stars: ✭ 764 (+1676.74%)
Mutual labels:  speech-recognition, speech-processing

FORMANT PLOTTER

Buy on the App Store: https://itunes.apple.com/us/app/formant-analyzer/id799183655?mt=8&uo=4&at=11l6hc&ct=fnd

This is an iOS project to analyze formants. The user speaks and the formant is plotted on the screen immediately. It is designed for speaking a single vowel syllable. It will try to isolate the vowel sound from any surrounding consonants if it can.

Formant Research

Other related tools and formant information

Vowel formant chart:

vowel		F1	F2	F3
ee	male	270	2290	3010
	female	310	2790	3310
	child	370	3200	3730
e	male	530	1840	2480
	female	610	2330	2990
	child	690	2610	3570
ae	male	660	1720	2410
	female	850	2050	2850
	child	1030	2320	3320
ah	male	730	1090	2440
	female	590	1220	2810
	child	680	1370	3170
oo	male	300	870	2240
	female	370	950	2670
	child	430	1170	3260

The Formant Plotter

The program starts in green state. When the user starts talking (i.e. RMS goes above 0dBm for at least 0.1 seconds), the program goes into listening state and records the sound. When the user stops talking (i.e. RMS goes below 0dBm for at least 0.1 seconds), the program returns to ready state and draws graphs.

Graph drawing is done as follows: The recorded sound is truncated to remove the first and last 10% of the data. Then perform a Fast Fourier Transform (FFT) with autocorrelation. The result is plotted linear from 0 - 4000 Hz on the X axis and from -60 to 0 dB log scale on the Y axis.

The second graph is drawn as follows: An image is placed on the background for the chart (you create an image to start with) and two dots are plotted on the chart, representing the highest and lowest sample value from the recording. That's it.

The correct algorithm which takes the FFT results which were plotted above and creates the vowel plot is discussed in Formant Research above.

Some potential next steps include:

  • Use autocorrelation to increase trimming accuracy
  • Windowing on the truncated sound buffer so that edge samples have an attenuated effect
  • Root polishing. The code has been written but commented out (please see PlotView.m). If we can test and refine this part, we will have better estimates of roots of LPC polynomials, and formant frequencies. We may not want VERY accurate estimates of formant frequencies and may not need root polishing.
  • Elimination of weak roots (far away from unit circle). They do not produce a peak in H(w) and should be ignored. I hope that if we reduce order of LPC, we may not see such weak roots. This should be investigated after reduction of LPC filter order.
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].