All Projects → crouchred → Speaker Recognition Py3

crouchred / Speaker Recognition Py3

Licence: apache-2.0
Base on MFCC and GMM(基于MFCC和高斯混合模型的语音识别)

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Speaker Recognition Py3

Voice datasets
🔊 A comprehensive list of open-source datasets for voice and sound computing (50+ datasets).
Stars: ✭ 494 (+144.55%)
Mutual labels:  voice-recognition
Nativescript Speech Recognition
💬 Speech to text, using the awesome engines readily available on the device.
Stars: ✭ 72 (-64.36%)
Mutual labels:  voice-recognition
Swiftspeech
A speech recognition framework designed for SwiftUI.
Stars: ✭ 149 (-26.24%)
Mutual labels:  voice-recognition
Eddiscovery
Captains log and 3d star map for Elite Dangerous
Stars: ✭ 541 (+167.82%)
Mutual labels:  voice-recognition
Node Julius
Node.js module for voice recognition using Julius
Stars: ✭ 69 (-65.84%)
Mutual labels:  voice-recognition
Stayfit
📱 🏃 🍎 Fitness application that’s used to keep track of your physical fitness data, daily calorie count, invite friends to work out together and ultimately get healthy.
Stars: ✭ 90 (-55.45%)
Mutual labels:  voice-recognition
Speech To Text Benchmark
speech to text benchmark framework
Stars: ✭ 481 (+138.12%)
Mutual labels:  voice-recognition
Voice Overlay Android
🗣 An overlay that gets your user’s voice permission and input as text in a customizable UI
Stars: ✭ 189 (-6.44%)
Mutual labels:  voice-recognition
Asr benchmark
Program to benchmark various speech recognition APIs
Stars: ✭ 71 (-64.85%)
Mutual labels:  voice-recognition
Mmm Awesome Alexa
Turn your MagicMirror into an 'Amazon Echo'. Activated when you say 'Alexa'.
Stars: ✭ 122 (-39.6%)
Mutual labels:  voice-recognition
Jarvis
Jarvis.sh is a simple configurable multi-lang assistant.
Stars: ✭ 701 (+247.03%)
Mutual labels:  voice-recognition
Voicemail
🔈 📧 Voice Based Email for (Blinds?)
Stars: ✭ 40 (-80.2%)
Mutual labels:  voice-recognition
Vosk Api
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Stars: ✭ 1,357 (+571.78%)
Mutual labels:  voice-recognition
Sonus
💬 /so.nus/ STT (speech to text) for Node with offline hotword detection
Stars: ✭ 532 (+163.37%)
Mutual labels:  voice-recognition
Angular Search Experience
Algolia + Angular = 🔥🔥🔥
Stars: ✭ 167 (-17.33%)
Mutual labels:  voice-recognition
Mycroft Precise
A lightweight, simple-to-use, RNN wake word listener
Stars: ✭ 481 (+138.12%)
Mutual labels:  voice-recognition
Voicer
AGI-server voice recognizer for #Asterisk
Stars: ✭ 73 (-63.86%)
Mutual labels:  voice-recognition
Project news alan ai
In this video, we're going to build a Conversational Voice Controlled React News Application using Alan AI. Alan AI is a revolutionary speech recognition software that allows you to add voice capabilities to your applications.
Stars: ✭ 202 (+0%)
Mutual labels:  voice-recognition
Vosk
VOSK Speech Recognition Toolkit
Stars: ✭ 182 (-9.9%)
Mutual labels:  voice-recognition
Spokestack Python
Spokestack is a library that allows a user to easily incorporate a voice interface into any Python application.
Stars: ✭ 103 (-49.01%)
Mutual labels:  voice-recognition

About

This project is a simple python3 version of speaker-recognition and I make a little change for the convenience of command line usage.

difference with speaker-recognition of python2

  • Neither use MFCC implementation of bob nor implement that myself. Use the python_speech_features instead.
  • Remove the GUI and you can only use the command line to train and predict the model.
  • Replace the function and class in sklearn which will be removed in the later version.
  • Use softmax function to output the probability.
  • convert to mono if the origin audio if stereo.

Usage

usage: speaker-recognition.py [-h] -t TASK -i INPUT -m MODEL

Speaker Recognition Command Line Tool

optional arguments:
  -h, --help            show this help message and exit
  -t TASK, --task TASK  Task to do. Either "enroll" or "predict"
  -i INPUT, --input INPUT
                        Input Files(to predict) or Directories(to enroll)
  -m MODEL, --model MODEL
                        Model file to save(in enroll) or use(in predict)

Wav files in each input directory will be labeled as the basename of the directory.
Note that wildcard inputs should be *quoted*, and they will be sent to glob module.

Examples:
    Train:
    ./speaker-recognition.py -t enroll -i "/tmp/person* ./mary" -m model.out

    Predict:
    ./speaker-recognition.py -t predict -i "./*.wav" -m model.out
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].