Alias is a teachable “parasite” that is designed to give users more control over their smart assistants, both when it comes to customisation and privacy. Through a simple app the user can train Alias to react on a custom wake-word/sound, and once trained, Alias can take control over your home assistant by activating it for you.

Stars: ✭ 1,577 (+822.22%)

Mutual labels: speech-recognition

Ai Study

人工智能学习资料超全整理，包含机器学习基础ML、深度学习基础DL、计算机视觉CV、自然语言处理NLP、推荐系统、语音识别、图神经网路、算法工程师面试题

Stars: ✭ 93 (-45.61%)

Mutual labels: speech-recognition

Aimybox Android Assistant

Embeddable custom voice assistant for Android applications

Stars: ✭ 139 (-18.71%)

Mutual labels: speech-recognition

Rnn Transducer

MXNet implementation of RNN Transducer (Graves 2012): Sequence Transduction with Recurrent Neural Networks

Stars: ✭ 114 (-33.33%)

Mutual labels: speech-recognition

Clovacall

ClovaCall dataset and Pytorch LAS baseline code (Interspeech 2020)

Stars: ✭ 151 (-11.7%)

Mutual labels: speech-recognition

Transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Stars: ✭ 55,742 (+32497.66%)

Mutual labels: speech-recognition

Voice activity detection

Voice Activity Detection based on Deep Learning & TensorFlow

Stars: ✭ 132 (-22.81%)

Mutual labels: speech-recognition

Pansori

Tools for ASR Corpus Generation from Online Video

Stars: ✭ 106 (-38.01%)

Mutual labels: speech-recognition

Kaldiio

A pure python module for reading and writing kaldi ark files

Stars: ✭ 160 (-6.43%)

Mutual labels: speech-recognition

Delta

DELTA is a deep learning based natural language and speech processing platform.

Stars: ✭ 1,479 (+764.91%)

Mutual labels: speech-recognition

Pytorch Speech Commands

Speech commands recognition with PyTorch

Stars: ✭ 128 (-25.15%)

Mutual labels: speech-recognition

Speech And Text

Speech to text (PocketSphinx, Iflytex API, Baidu API) and text to speech (pyttsx3) | 语音转文字（PocketSphinx、百度 API、科大讯飞 API）和文字转语音（pyttsx3）

Stars: ✭ 102 (-40.35%)

Mutual labels: speech-recognition

Speech Recognition Neural Network

This is the end-to-end Speech Recognition neural network, deployed in Keras. This was my final project for Artificial Intelligence Nanodegree @Udacity.

Stars: ✭ 148 (-13.45%)

Mutual labels: speech-recognition

Factorized Tdnn

PyTorch implementation of the Factorized TDNN (TDNN-F) from "Semi-Orthogonal Low-Rank Matrix Factorization for Deep Neural Networks" and Kaldi

Stars: ✭ 98 (-42.69%)

Mutual labels: speech-recognition

Keras Kaldi

Keras Interface for Kaldi ASR

Stars: ✭ 124 (-27.49%)

Mutual labels: speech-recognition

Wer are we

Attempt at tracking states of the arts and recent results (bibliography) on speech recognition.

Stars: ✭ 1,684 (+884.8%)

Mutual labels: speech-recognition

Ktspeechcrawler

Automatically constructing corpus for automatic speech recognition from YouTube videos

Stars: ✭ 92 (-46.2%)

Mutual labels: speech-recognition

Dla

Deep learning for audio processing

Stars: ✭ 142 (-16.96%)

Mutual labels: speech-recognition

Sounder

An intent recognizing algorithm to predict the intent of a given text.

Stars: ✭ 118 (-30.99%)

Mutual labels: speech-recognition

Py Kaldi Asr

Some simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.

Stars: ✭ 156 (-8.77%)

Mutual labels: speech-recognition

Holobot

HoloBot is a reusable 3D interface that allows HoloLens & VR users to interact with any bot using Mixed Reality & Speech.

Stars: ✭ 114 (-33.33%)

Mutual labels: speech-recognition

Go Astideepspeech

Golang bindings for Mozilla's DeepSpeech speech-to-text library

Stars: ✭ 137 (-19.88%)

Mutual labels: speech-recognition

Kontinuousspeechrecognizer

A Kotlin Speech Recognizer that runs continuously and is triggered with an activation keyword

Stars: ✭ 113 (-33.92%)

Mutual labels: speech-recognition

Hey Jetson

Deep Learning based Automatic Speech Recognition with attention for the Nvidia Jetson.

Stars: ✭ 161 (-5.85%)

Mutual labels: speech-recognition

Kalliope

Kalliope is a framework that will help you to create your own personal assistant.

Stars: ✭ 1,509 (+782.46%)

Mutual labels: speech-recognition

Awesome Ai Services

An overview of the AI-as-a-service landscape

Stars: ✭ 133 (-22.22%)

Mutual labels: speech-recognition

Deepspeechrecognition

A Chinese Deep Speech Recognition System 包括基于深度学习的声学模型和基于深度学习的语言模型

Stars: ✭ 1,421 (+730.99%)

Mutual labels: speech-recognition

Speech To Text Russian

Проект для распознавания речи на русском языке на основе pykaldi.

Stars: ✭ 151 (-11.7%)

Mutual labels: speech-recognition

E2e Asr

PyTorch Implementations for End-to-End Automatic Speech Recognition

Stars: ✭ 106 (-38.01%)

Mutual labels: speech-recognition

Persephone

A tool for automatic phoneme transcription

Stars: ✭ 130 (-23.98%)

Mutual labels: speech-recognition

Bigcidian

Pronunciation lexicon covering both English and Chinese languages for Automatic Speech Recognition.

Stars: ✭ 99 (-42.11%)

Mutual labels: speech-recognition

Pytorch Kaldi

pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.

Stars: ✭ 2,097 (+1126.32%)

Mutual labels: speech-recognition

Ios ml

List of Machine Learning, AI, NLP solutions for iOS. The most recent version of this article can be found on my blog.

Stars: ✭ 1,409 (+723.98%)

Mutual labels: speech-recognition

Alan Sdk Pcf

Alan AI Power Apps SDK adds a voice assistant or chatbot to your Microsoft Power Apps project.

Stars: ✭ 128 (-25.15%)

Mutual labels: speech-recognition

Kaldi Gop

Computes the GMM-based Goodness of Pronunciation (GOP). Bases on Kaldi.

Stars: ✭ 104 (-39.18%)

Mutual labels: speech-recognition

Awesome Speech Recognition Speech Synthesis Papers

Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)

Stars: ✭ 2,085 (+1119.3%)

Mutual labels: speech-recognition

Spokestack Python

Spokestack is a library that allows a user to easily incorporate a voice interface into any Python application.

Stars: ✭ 103 (-39.77%)

Mutual labels: speech-recognition

Tensorflow Ctc Speech Recognition

Application of Connectionist Temporal Classification (CTC) for Speech Recognition (Tensorflow 1.0 but compatible with 2.0).

Stars: ✭ 127 (-25.73%)

Mutual labels: speech-recognition

Openseq2seq

Toolkit for efficient experimentation with Speech Recognition, Text2Speech and NLP