All Projects → jpinedaa → Voice-ML

jpinedaa / Voice-ML

Licence: other
MobileNet trained with VoxCeleb dataset and used for voice verification

Programming Languages

python
139335 projects - #7 most used programming language
tcl
693 projects
c
50402 projects - #5 most used programming language
cython
566 projects
C++
36643 projects - #6 most used programming language
TeX
3793 projects

Projects that are alternatives of or similar to Voice-ML

Speaker-Recognition
This repo contains my attempt to create a Speaker Recognition and Verification system using SideKit-1.3.1
Stars: ✭ 94 (+526.67%)
Mutual labels:  speaker-verification, speaker-identification
Huawei-Challenge-Speaker-Identification
Trained speaker embedding deep learning models and evaluation pipelines in pytorch and tesorflow for speaker recognition.
Stars: ✭ 34 (+126.67%)
Mutual labels:  speaker-verification, speaker-identification
GE2E-Loss
Pytorch implementation of Generalized End-to-End Loss for speaker verification
Stars: ✭ 72 (+380%)
Mutual labels:  speaker-verification, speaker-identification
KaldiBasedSpeakerVerification
Kaldi based speaker verification
Stars: ✭ 43 (+186.67%)
Mutual labels:  speaker-verification, speaker-identification
wavenet-classifier
Keras Implementation of Deepmind's WaveNet for Supervised Learning Tasks
Stars: ✭ 54 (+260%)
Mutual labels:  speaker-verification, speaker-identification
dropclass speaker
DropClass and DropAdapt - repository for the paper accepted to Speaker Odyssey 2020
Stars: ✭ 20 (+33.33%)
Mutual labels:  speaker-verification, speaker-identification
bnn-icestick
Binary Neural Network on IceStick FPGA.
Stars: ✭ 45 (+200%)
Mutual labels:  hardware-acceleration
mpp
Rockchip MPP(Media Process Platfrom)
Stars: ✭ 86 (+473.33%)
Mutual labels:  hardware-acceleration
Awesome Speech Recognition Speech Synthesis Papers
Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)
Stars: ✭ 2,085 (+13800%)
Mutual labels:  speaker-verification
Delta
DELTA is a deep learning based natural language and speech processing platform.
Stars: ✭ 1,479 (+9760%)
Mutual labels:  speaker-verification
MobileNetV3-SSD-Compact-Version
MobileNetV3 SSD的简洁版本
Stars: ✭ 65 (+333.33%)
Mutual labels:  mobilenet
Olive
Free open-source non-linear video editor
Stars: ✭ 5,682 (+37780%)
Mutual labels:  hardware-acceleration
KRS
The Kria Robotics Stack (KRS) is a ROS 2 superset for industry, an integrated set of robot libraries and utilities to accelerate the development, maintenance and commercialization of industrial-grade robotic solutions while using adaptive computing.
Stars: ✭ 26 (+73.33%)
Mutual labels:  hardware-acceleration
tapasco
The Task Parallel System Composer (TaPaSCo)
Stars: ✭ 66 (+340%)
Mutual labels:  hardware-acceleration
react-canvaskit
Experiment in creating a custom react renderer using an offscreen webgl canvas on top of Skia CanvasKit
Stars: ✭ 57 (+280%)
Mutual labels:  hardware-acceleration
optiga-trust-m
OPTIGA™ Trust M Software Framework
Stars: ✭ 86 (+473.33%)
Mutual labels:  hardware-acceleration
UHV-OTS-Speech
A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.
Stars: ✭ 94 (+526.67%)
Mutual labels:  speaker-identification
Kaldi
kaldi-asr/kaldi is the official location of the Kaldi project.
Stars: ✭ 11,151 (+74240%)
Mutual labels:  speaker-verification
stonne
STONNE: A Simulation Tool for Neural Networks Engines
Stars: ✭ 57 (+280%)
Mutual labels:  hardware-acceleration
Onnxruntime
ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
Stars: ✭ 5,910 (+39300%)
Mutual labels:  hardware-acceleration

Voice-ML

IoT devices are becoming every day a more intrinsic part of our daily lives, one of the most natural and common ways to interface with these devices is through voice commands. With Voice ID we can take advantage of voice interfacing and add a variety of features for both security and convenience.

Voice ID uses an AI model to recognize a user by using his voice independently of what he is saying. This way a user can save his voice ID which can then be used by devices for multiple applications such as parental control, device locking or delivering specific content to a user.

Voice ID runs on the mobile device itself so it doesn’t require internet access, it can connect directly with other devices using a wifi or Bluetooth connection. Since Voice ID will be optimized for hardware acceleration it could run on more specific devices other than a smartphone, allowing users to have more flexibility with their Voice ID’s

Model Training folder contains code for processing and training of tensorflow model

App folder contains android app that uses converted tensorflow model for verification

Results

Model EER
GMM -UBM 17.1
I-Vectors 12.8
I-Vectors + PLDA 11.5
CNN-20148 11.3
CNN-256 + Pair Selections 10.5
Mobilenet+ Siamese (this repository) 11.4

The Method used is Mobilenet+ Siamese, the other metrics are existing methods using the same dataset added for comparison, taken from Hossein Salehgaffaripaper “Speaker Verification using Convolutional NeuralNetworks”

Performance

Hardware Acceleration Enrollment(ms) Verification(ms)
No 642 634
Yes 171 191

Tested on : Pixel 3 with Android 9

Using NNAPI for hardware acceleration reduces inference time by around 70%

More Info

Report: https://1drv.ms/b/s!AvrId-VjUEW7g_RhsiMxPiswnVlRPQ?e=iopMA7

Report Chinese Version: https://1drv.ms/p/s!AvrId-VjUEW7hOR1v4vlDUSaknKN3Q?e=qy2Ey6

App demo (Chinese): https://1drv.ms/v/s!AvrId-VjUEW7hOR2T1cYDO3X4g6Oow?e=3hMmjX

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].