Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)

Stars: ✭ 2,085 (+13800%)

Mutual labels: speaker-verification

Delta

DELTA is a deep learning based natural language and speech processing platform.

Stars: ✭ 1,479 (+9760%)

Mutual labels: speaker-verification

MobileNetV3-SSD-Compact-Version

MobileNetV3 SSD的简洁版本

Stars: ✭ 65 (+333.33%)

Mutual labels: mobilenet

Olive

Free open-source non-linear video editor

Stars: ✭ 5,682 (+37780%)

Mutual labels: hardware-acceleration

KRS

The Kria Robotics Stack (KRS) is a ROS 2 superset for industry, an integrated set of robot libraries and utilities to accelerate the development, maintenance and commercialization of industrial-grade robotic solutions while using adaptive computing.

Stars: ✭ 26 (+73.33%)

Mutual labels: hardware-acceleration

tapasco

The Task Parallel System Composer (TaPaSCo)

Stars: ✭ 66 (+340%)

Mutual labels: hardware-acceleration

react-canvaskit

Experiment in creating a custom react renderer using an offscreen webgl canvas on top of Skia CanvasKit

Stars: ✭ 57 (+280%)

Mutual labels: hardware-acceleration

optiga-trust-m

OPTIGA™ Trust M Software Framework

Stars: ✭ 86 (+473.33%)

Mutual labels: hardware-acceleration

UHV-OTS-Speech

A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.

Stars: ✭ 94 (+526.67%)

Mutual labels: speaker-identification

Kaldi

kaldi-asr/kaldi is the official location of the Kaldi project.

Stars: ✭ 11,151 (+74240%)

Mutual labels: speaker-verification

stonne

STONNE: A Simulation Tool for Neural Networks Engines

Stars: ✭ 57 (+280%)

Mutual labels: hardware-acceleration

Onnxruntime

ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator

Stars: ✭ 5,910 (+39300%)

Mutual labels: hardware-acceleration

View All Similar Projects ➔

Voice-ML

IoT devices are becoming every day a more intrinsic part of our daily lives, one of the most natural and common ways to interface with these devices is through voice commands. With Voice ID we can take advantage of voice interfacing and add a variety of features for both security and convenience.

Voice ID uses an AI model to recognize a user by using his voice independently of what he is saying. This way a user can save his voice ID which can then be used by devices for multiple applications such as parental control, device locking or delivering specific content to a user.

Voice ID runs on the mobile device itself so it doesn’t require internet access, it can connect directly with other devices using a wifi or Bluetooth connection. Since Voice ID will be optimized for hardware acceleration it could run on more specific devices other than a smartphone, allowing users to have more flexibility with their Voice ID’s

Model Training folder contains code for processing and training of tensorflow model

App folder contains android app that uses converted tensorflow model for verification

Results

Model	EER
GMM -UBM	17.1
I-Vectors	12.8
I-Vectors + PLDA	11.5
CNN-20148	11.3
CNN-256 + Pair Selections	10.5
Mobilenet+ Siamese (this repository)	11.4

The Method used is Mobilenet+ Siamese, the other metrics are existing methods using the same dataset added for comparison, taken from Hossein Salehgaffaripaper “Speaker Verification using Convolutional NeuralNetworks”

Performance

Hardware Acceleration	Enrollment(ms)	Verification(ms)
No	642	634
Yes	171	191

Tested on : Pixel 3 with Android 9

Using NNAPI for hardware acceleration reduces inference time by around 70%

More Info

Report: https://1drv.ms/b/s!AvrId-VjUEW7g_RhsiMxPiswnVlRPQ?e=iopMA7

Report Chinese Version: https://1drv.ms/p/s!AvrId-VjUEW7hOR1v4vlDUSaknKN3Q?e=qy2Ey6

App demo (Chinese): https://1drv.ms/v/s!AvrId-VjUEW7hOR2T1cYDO3X4g6Oow?e=3hMmjX

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

jpinedaa / Voice-ML

Programming Languages

Labels

Projects that are alternatives of or similar to Voice-ML

Voice-ML

Results

Performance

More Info