demo vietasrVietnamese Speech Recognition
Stars: ✭ 22 (-87.36%)
UniSpeechUniSpeech - Large Scale Self-Supervised Learning for Speech
Stars: ✭ 224 (+28.74%)
Speechpy💬 SpeechPy - A Library for Speech Processing and Recognition: http://speechpy.readthedocs.io/en/latest/
Stars: ✭ 833 (+378.74%)
NLP ToolkitLibrary of state-of-the-art models (PyTorch) for NLP tasks
Stars: ✭ 92 (-47.13%)
speech-to-text-code-patternReact app using the Watson Speech to Text service to transform voice audio into written text.
Stars: ✭ 37 (-78.74%)
popartPoplar Advanced Runtime for the IPU
Stars: ✭ 62 (-64.37%)
MivisionxMIVisionX toolkit is a set of comprehensive computer vision and machine intelligence libraries, utilities, and applications bundled into a single toolkit. AMD MIVisionX also delivers a highly optimized open-source implementation of the Khronos OpenVX™ and OpenVX™ Extensions.
Stars: ✭ 100 (-42.53%)
EfficientIR人工智障本地图片检索工具 | An EfficientNet based image retrieval tool
Stars: ✭ 64 (-63.22%)
Multi Model ServerMulti Model Server is a tool for serving neural net models for inference
Stars: ✭ 770 (+342.53%)
learning invariances in speech recognitionIn this work I investigate the speech command task developing and analyzing deep learning models. The state of the art technology uses convolutional neural networks (CNN) because of their intrinsic nature of learning correlated represen- tations as is the speech. In particular I develop different CNNs trained on the Google Speech Command Dataset…
Stars: ✭ 15 (-91.38%)
BiglittlenetOfficial repository for Big-Little Net
Stars: ✭ 57 (-67.24%)
sparsifyEasy-to-use UI for automatically sparsifying neural networks and creating sparsification recipes for better inference performance and a smaller footprint
Stars: ✭ 138 (-20.69%)
AudiomatePython library for handling audio datasets.
Stars: ✭ 99 (-43.1%)
Annyang💬 Speech recognition for your site
Stars: ✭ 6,216 (+3472.41%)
Keras KaldiKeras Interface for Kaldi ASR
Stars: ✭ 124 (-28.74%)
deepspeech.mxnetA MXNet implementation of Baidu's DeepSpeech architecture
Stars: ✭ 82 (-52.87%)
AdaptAdapt Intent Parser
Stars: ✭ 690 (+296.55%)
deep avsrA PyTorch implementation of the Deep Audio-Visual Speech Recognition paper.
Stars: ✭ 104 (-40.23%)
spokestack-iosSpokestack: give your iOS app a voice interface!
Stars: ✭ 27 (-84.48%)
Wav2letterFacebook AI Research's Automatic Speech Recognition Toolkit
Stars: ✭ 5,907 (+3294.83%)
cobraOn-device voice activity detection (VAD) powered by deep learning.
Stars: ✭ 76 (-56.32%)
NngenNNgen: A Fully-Customizable Hardware Synthesis Compiler for Deep Neural Network
Stars: ✭ 149 (-14.37%)
Iflytek awaken asruse iflytek's technology to realize awaken and order recognition
Stars: ✭ 53 (-69.54%)
sova-asrSOVA ASR (Automatic Speech Recognition)
Stars: ✭ 123 (-29.31%)
torchainWIP: pytorch FFI wrapper for Kaldi chain loss (a.k.a. Lattice Free MMI)
Stars: ✭ 20 (-88.51%)
Ai Study人工智能学习资料超全整理,包含机器学习基础ML、深度学习基础DL、计算机视觉CV、自然语言处理NLP、推荐系统、语音识别、图神经网路、算法工程师面试题
Stars: ✭ 93 (-46.55%)
Unity live captionUse Google Speech-to-Text API to do real-time live stream caption on Unity! Best when combined with your virtual character!
Stars: ✭ 26 (-85.06%)
Speech recognitionSpeech recognition module for Python, supporting several engines and APIs, online and offline.
Stars: ✭ 5,999 (+3347.7%)
Speech-BackbonesThis is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.
Stars: ✭ 205 (+17.82%)
syn-speech-samplesAn application that demostrate the usage of Syn.Speech library for Speech Recognition
Stars: ✭ 24 (-86.21%)
Speech Emotion AnalyzerThe neural network model is capable of detecting five different male/female emotions from audio speeches. (Deep Learning, NLP, Python)
Stars: ✭ 633 (+263.79%)
tractjsRun ONNX and TensorFlow inference in the browser.
Stars: ✭ 67 (-61.49%)
KtspeechcrawlerAutomatically constructing corpus for automatic speech recognition from YouTube videos
Stars: ✭ 92 (-47.13%)
pytorch audioaudio processing module for pytorch:stft, istft
Stars: ✭ 33 (-81.03%)
VadVoice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.
Stars: ✭ 622 (+257.47%)
VoiceDictation迅飞 语音听写 WebAPI - 把语音(≤60秒)转换成对应的文字信息,让机器能够“听懂”人类语言,相当于给机器安装上“耳朵”,使其具备“能听”的功能。
Stars: ✭ 36 (-79.31%)
MmdnnMMdnn is a set of tools to help users inter-operate among different deep learning frameworks. E.g. model conversion and visualization. Convert models between Caffe, Keras, MXNet, Tensorflow, CNTK, PyTorch Onnx and CoreML.
Stars: ✭ 5,472 (+3044.83%)
Transformer-TransducerPyTorch implementation of "Transformer Transducer: A Streamable Speech Recognition Model with Transformer Encoders and RNN-T Loss" (ICASSP 2020)
Stars: ✭ 61 (-64.94%)
Deep Learning DrizzleDrench yourself in Deep Learning, Reinforcement Learning, Machine Learning, Computer Vision, and NLP by learning from these exciting lectures!!
Stars: ✭ 9,717 (+5484.48%)
Athenaan open-source implementation of sequence-to-sequence based speech processing engine
Stars: ✭ 542 (+211.49%)
Recording-BotA bot built to record and transcribe audio fragments from Discord.
Stars: ✭ 22 (-87.36%)
mtomoMultiple types of NN model optimization environments. It is possible to directly access the host PC GUI and the camera to verify the operation. Intel iHD GPU (iGPU) support. NVIDIA GPU (dGPU) support.
Stars: ✭ 24 (-86.21%)
Pytorch onnx tensorrtA tutorial about how to build a TensorRT Engine from a PyTorch Model with the help of ONNX
Stars: ✭ 122 (-29.89%)
Onnx tflite yolov3A Conversion tool to convert YOLO v3 Darknet weights to TF Lite model (YOLO v3 PyTorch > ONNX > TensorFlow > TF Lite), and to TensorRT (YOLO v3 Pytorch > ONNX > TensorRT).
Stars: ✭ 52 (-70.11%)
StageMateStageMate is the smart assistant for your presentation. It will cover all aspects of your pitch from skipping slides to reminding you if you miss some major point.
Stars: ✭ 60 (-65.52%)
dropclass speakerDropClass and DropAdapt - repository for the paper accepted to Speaker Odyssey 2020
Stars: ✭ 20 (-88.51%)
Sonus💬 /so.nus/ STT (speech to text) for Node with offline hotword detection
Stars: ✭ 532 (+205.75%)
NaomiThe Naomi Project is an open source, technology agnostic platform for developing always-on, voice-controlled applications!
Stars: ✭ 171 (-1.72%)
Ctc pytorchCTC end -to-end ASR for timit and 863 corpus.
Stars: ✭ 161 (-7.47%)
ClovacallClovaCall dataset and Pytorch LAS baseline code (Interspeech 2020)
Stars: ✭ 151 (-13.22%)