MIVisionX toolkit is a set of comprehensive computer vision and machine intelligence libraries, utilities, and applications bundled into a single toolkit. AMD MIVisionX also delivers a highly optimized open-source implementation of the Khronos OpenVX™ and OpenVX™ Extensions.

Stars: ✭ 100 (-42.53%)

Mutual labels: onnx

EfficientIR

人工智障本地图片检索工具 | An EfficientNet based image retrieval tool

Stars: ✭ 64 (-63.22%)

Mutual labels: onnx

Multi Model Server

Multi Model Server is a tool for serving neural net models for inference

Stars: ✭ 770 (+342.53%)

Mutual labels: onnx

learning invariances in speech recognition

In this work I investigate the speech command task developing and analyzing deep learning models. The state of the art technology uses convolutional neural networks (CNN) because of their intrinsic nature of learning correlated represen- tations as is the speech. In particular I develop different CNNs trained on the Google Speech Command Dataset…

Stars: ✭ 15 (-91.38%)

Mutual labels: speech-recognition

KaldiBasedSpeakerVerification

Kaldi based speaker verification

Stars: ✭ 43 (-75.29%)

Mutual labels: kaldi

Biglittlenet

Official repository for Big-Little Net

Stars: ✭ 57 (-67.24%)

Mutual labels: speech-recognition

sparsify

Easy-to-use UI for automatically sparsifying neural networks and creating sparsification recipes for better inference performance and a smaller footprint

Stars: ✭ 138 (-20.69%)

Mutual labels: onnx

Audiomate

Python library for handling audio datasets.

Stars: ✭ 99 (-43.1%)

Mutual labels: speech-recognition

yolov5 tensorrt int8 tools

tensorrt int8 量化yolov5 onnx模型

Stars: ✭ 105 (-39.66%)

Mutual labels: onnx

Annyang

💬 Speech recognition for your site

Stars: ✭ 6,216 (+3472.41%)

Mutual labels: speech-recognition

Keras Kaldi

Keras Interface for Kaldi ASR

Stars: ✭ 124 (-28.74%)

Mutual labels: speech-recognition

kim-voice-assistant

Kim，你的私人语音助理。

Stars: ✭ 70 (-59.77%)

Mutual labels: speech-recognition

deepspeech.mxnet

A MXNet implementation of Baidu's DeepSpeech architecture

Stars: ✭ 82 (-52.87%)

Mutual labels: speech-recognition

Adapt

Adapt Intent Parser

Stars: ✭ 690 (+296.55%)

Mutual labels: speech-recognition

deep avsr

A PyTorch implementation of the Deep Audio-Visual Speech Recognition paper.

Stars: ✭ 104 (-40.23%)

Mutual labels: speech-recognition

Mongolian Speech Recognition

Mongolian speech recognition with PyTorch

Stars: ✭ 97 (-44.25%)

Mutual labels: speech-recognition

spokestack-ios

Spokestack: give your iOS app a voice interface!

Stars: ✭ 27 (-84.48%)

Mutual labels: speech-recognition

Wav2letter

Facebook AI Research's Automatic Speech Recognition Toolkit

Stars: ✭ 5,907 (+3294.83%)

Mutual labels: speech-recognition

cobra

On-device voice activity detection (VAD) powered by deep learning.

Stars: ✭ 76 (-56.32%)

Mutual labels: speech-recognition

Nngen

NNgen: A Fully-Customizable Hardware Synthesis Compiler for Deep Neural Network

Stars: ✭ 149 (-14.37%)

Mutual labels: onnx

Python Speech recognition

A simple example for use speech recognition baidu api with python.

Stars: ✭ 106 (-39.08%)

Mutual labels: speech-recognition

Iflytek awaken asr

use iflytek's technology to realize awaken and order recognition

Stars: ✭ 53 (-69.54%)

Mutual labels: speech-recognition

sova-asr

SOVA ASR (Automatic Speech Recognition)

Stars: ✭ 123 (-29.31%)

Mutual labels: speech-recognition

torchain

WIP: pytorch FFI wrapper for Kaldi chain loss (a.k.a. Lattice Free MMI)

Stars: ✭ 20 (-88.51%)

Mutual labels: kaldi

Ai Study

人工智能学习资料超全整理，包含机器学习基础ML、深度学习基础DL、计算机视觉CV、自然语言处理NLP、推荐系统、语音识别、图神经网路、算法工程师面试题

Stars: ✭ 93 (-46.55%)

Mutual labels: speech-recognition

Unity live caption

Use Google Speech-to-Text API to do real-time live stream caption on Unity! Best when combined with your virtual character!

Stars: ✭ 26 (-85.06%)

Mutual labels: speech-recognition

Speech recognition

Speech recognition module for Python, supporting several engines and APIs, online and offline.

Stars: ✭ 5,999 (+3347.7%)

Mutual labels: speech-recognition

Speech-Backbones

This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.

Stars: ✭ 205 (+17.82%)

Mutual labels: speech-recognition

syn-speech-samples

An application that demostrate the usage of Syn.Speech library for Speech Recognition

Stars: ✭ 24 (-86.21%)

Mutual labels: speech-recognition

Speech Emotion Analyzer

The neural network model is capable of detecting five different male/female emotions from audio speeches. (Deep Learning, NLP, Python)

Stars: ✭ 633 (+263.79%)

Mutual labels: speech-recognition

tractjs

Run ONNX and TensorFlow inference in the browser.

Stars: ✭ 67 (-61.49%)

Mutual labels: onnx

Ktspeechcrawler

Automatically constructing corpus for automatic speech recognition from YouTube videos

Stars: ✭ 92 (-47.13%)

Mutual labels: speech-recognition

pytorch audio

audio processing module for pytorch:stft, istft

Stars: ✭ 33 (-81.03%)

Mutual labels: speech-recognition

Vad

Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.

Stars: ✭ 622 (+257.47%)

Mutual labels: speech-recognition

Chinese-automatic-speech-recognition

Chinese speech recognition

Stars: ✭ 147 (-15.52%)

Mutual labels: speech-recognition

Interspeech2019 Tutorial

INTERSPEECH 2019 Tutorial Materials

Stars: ✭ 160 (-8.05%)

Mutual labels: speech-recognition

VoiceDictation

迅飞语音听写 WebAPI - 把语音(≤60秒)转换成对应的文字信息，让机器能够“听懂”人类语言，相当于给机器安装上“耳朵”，使其具备“能听”的功能。

Stars: ✭ 36 (-79.31%)

Mutual labels: speech-recognition

Mmdnn

MMdnn is a set of tools to help users inter-operate among different deep learning frameworks. E.g. model conversion and visualization. Convert models between Caffe, Keras, MXNet, Tensorflow, CNTK, PyTorch Onnx and CoreML.

Stars: ✭ 5,472 (+3044.83%)

Mutual labels: onnx

Transformer-Transducer

PyTorch implementation of "Transformer Transducer: A Streamable Speech Recognition Model with Transformer Encoders and RNN-T Loss" (ICASSP 2020)

Stars: ✭ 61 (-64.94%)

Mutual labels: speech-recognition

Deep Learning Drizzle

Drench yourself in Deep Learning, Reinforcement Learning, Machine Learning, Computer Vision, and NLP by learning from these exciting lectures!!

Stars: ✭ 9,717 (+5484.48%)

Mutual labels: speech-recognition

Athena

an open-source implementation of sequence-to-sequence based speech processing engine

Stars: ✭ 542 (+211.49%)

Mutual labels: speech-recognition

Recording-Bot

A bot built to record and transcribe audio fragments from Discord.

Stars: ✭ 22 (-87.36%)

Mutual labels: speech-recognition

mtomo

Multiple types of NN model optimization environments. It is possible to directly access the host PC GUI and the camera to verify the operation. Intel iHD GPU (iGPU) support. NVIDIA GPU (dGPU) support.

Stars: ✭ 24 (-86.21%)

Mutual labels: onnx

Pytorch onnx tensorrt

A tutorial about how to build a TensorRT Engine from a PyTorch Model with the help of ONNX

Stars: ✭ 122 (-29.89%)

Mutual labels: onnx

Onnx tflite yolov3

A Conversion tool to convert YOLO v3 Darknet weights to TF Lite model (YOLO v3 PyTorch > ONNX > TensorFlow > TF Lite), and to TensorRT (YOLO v3 Pytorch > ONNX > TensorRT).

Stars: ✭ 52 (-70.11%)

Mutual labels: onnx

StageMate

StageMate is the smart assistant for your presentation. It will cover all aspects of your pitch from skipping slides to reminding you if you miss some major point.

Stars: ✭ 60 (-65.52%)

Mutual labels: speech-recognition

dropclass speaker

DropClass and DropAdapt - repository for the paper accepted to Speaker Odyssey 2020

Stars: ✭ 20 (-88.51%)

Mutual labels: kaldi

Sonus

💬 /so.nus/ STT (speech to text) for Node with offline hotword detection

Stars: ✭ 532 (+205.75%)

Mutual labels: speech-recognition

Cordova Plugin Speechrecognition

🎤 Cordova Plugin for Speech Recognition