A repository that shares tuning results of trained models generated by TensorFlow / Keras. Post-training quantization (Weight Quantization, Integer Quantization, Full Integer Quantization, Float16 Quantization), Quantization-aware training. TensorFlow Lite. OpenVINO. CoreML. TensorFlow.js. TF-TRT. MediaPipe. ONNX. [.tflite,.h5,.pb,saved_model,tfjs,tftrt,mlmodel,.xml/.bin, .onnx]

Stars: ✭ 634 (+939.34%)

Mutual labels: caffe, tensorflow-models

Dali

A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.

Stars: ✭ 3,624 (+5840.98%)

Mutual labels: mxnet, audio-processing

Segmentation models

Segmentation models with pretrained backbones. Keras and TensorFlow Keras.

Stars: ✭ 3,575 (+5760.66%)

Mutual labels: keras-tensorflow, keras-models

Deepspeech

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.

Stars: ✭ 18,680 (+30522.95%)

Mutual labels: speech-recognition, speech-to-text

Phonetisaurus

Phonetisaurus G2P

Stars: ✭ 277 (+354.1%)

Mutual labels: speech-recognition, speech-to-text

Musig

A shazam like tool to store songs fingerprints and retrieve them

Stars: ✭ 388 (+536.07%)

Mutual labels: audio, audio-processing

Cheetah

On-device streaming speech-to-text engine powered by deep learning

Stars: ✭ 383 (+527.87%)

Mutual labels: speech-recognition, speech-to-text

Auto Editor

Auto-Editor: Effort free video editing!

Stars: ✭ 382 (+526.23%)

Mutual labels: audio, audio-processing

Deep Learning Model Convertor

The convertor/conversion of deep learning models for different deep learning frameworks/softwares.

Stars: ✭ 3,044 (+4890.16%)

Mutual labels: caffe, mxnet

Speech Demo

语音api示例

Stars: ✭ 454 (+644.26%)

Mutual labels: speech-recognition, speech-to-text

Voice Overlay Ios

🗣 An overlay that gets your user’s voice permission and input as text in a customizable UI

Stars: ✭ 440 (+621.31%)

Mutual labels: speech-recognition, speech-to-text

Java Speech Api

The J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.

Stars: ✭ 490 (+703.28%)

Mutual labels: speech-recognition, speech-to-text

Asrt speechrecognition

A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统

Stars: ✭ 4,943 (+8003.28%)

Mutual labels: speech-recognition, speech-to-text

Chromaprint

C library for generating audio fingerprints used by AcoustID

Stars: ✭ 553 (+806.56%)

Mutual labels: audio, audio-processing

Sonus

💬 /so.nus/ STT (speech to text) for Node with offline hotword detection

Stars: ✭ 532 (+772.13%)

Mutual labels: speech-recognition, speech-to-text

Audio Visualizer Android

🎵 [Android Library] A light-weight and easy-to-use Audio Visualizer for Android.

Stars: ✭ 581 (+852.46%)

Mutual labels: audio, audio-processing

Resnetcam Keras

Keras implementation of a ResNet-CAM model

Stars: ✭ 269 (+340.98%)

Mutual labels: keras-tensorflow, keras-models

Adapt

Adapt Intent Parser

Stars: ✭ 690 (+1031.15%)

Mutual labels: speech-recognition, speech-to-text

Ffmediaelement

FFME: The Advanced WPF MediaElement (based on FFmpeg)

Stars: ✭ 733 (+1101.64%)

Mutual labels: audio, audio-processing

Deepo

Setup and customize deep learning environment in seconds.

Stars: ✭ 6,145 (+9973.77%)

Mutual labels: caffe, mxnet

Mmdnn

MMdnn is a set of tools to help users inter-operate among different deep learning frameworks. E.g. model conversion and visualization. Convert models between Caffe, Keras, MXNet, Tensorflow, CNTK, PyTorch Onnx and CoreML.

Stars: ✭ 5,472 (+8870.49%)

Mutual labels: caffe, mxnet

Eesen

The official repository of the Eesen project

Stars: ✭ 738 (+1109.84%)

Mutual labels: speech-recognition, speech-to-text

Machine Learning Curriculum

💻 Make machines learn so that you don't have to struggle to program them; The ultimate list

Stars: ✭ 761 (+1147.54%)

Mutual labels: caffe, mxnet

Beethoven

🎸 A maestro of pitch detection.

Stars: ✭ 601 (+885.25%)

Mutual labels: audio, audio-processing

Face Mask Detection

Face Mask Detection system based on computer vision and deep learning using OpenCV and Tensorflow/Keras

Stars: ✭ 774 (+1168.85%)

Mutual labels: caffe, keras-tensorflow

Annyang

💬 Speech recognition for your site

Stars: ✭ 6,216 (+10090.16%)

Mutual labels: speech-recognition, speech-to-text

Stephanie Va

Stephanie is an open-source platform built specifically for voice-controlled applications as well as to automate daily tasks imitating much of an virtual assistant's work.

Stars: ✭ 772 (+1165.57%)

Mutual labels: speech-recognition, speech-to-text

Kur

Descriptive Deep Learning

Stars: ✭ 811 (+1229.51%)

Mutual labels: speech-recognition, speech-to-text

Steppy Toolkit

Curated set of transformers that make your work with steppy faster and more effective 🔭

Stars: ✭ 21 (-65.57%)

Mutual labels: tensorflow-models, keras-models

Mxnet2caffe

convert model from mxnet to caffe without lossing precision

Stars: ✭ 20 (-67.21%)

Mutual labels: caffe, mxnet

Guitard

Node based multi effects audio processor

Stars: ✭ 31 (-49.18%)

Mutual labels: audio, audio-processing

Soloud

Free, easy, portable audio engine for games

Stars: ✭ 1,048 (+1618.03%)

Mutual labels: speech-to-text, audio

Giada

Your Hardcore Loop Machine.

Stars: ✭ 903 (+1380.33%)

Mutual labels: audio, audio-processing

Kfr

Fast, modern C++ DSP framework, FFT, Sample Rate Conversion, FIR/IIR/Biquad Filters (SSE, AVX, AVX-512, ARM NEON)

Stars: ✭ 985 (+1514.75%)

Mutual labels: audio, audio-processing

Discordspeechbot

A speech-to-text bot for discord with music commands and more using NodeJS. Ideally for controlling your Discord server using voice commands, can also be useful for hearing-impaired people.

Stars: ✭ 35 (-42.62%)

Mutual labels: speech-recognition, speech-to-text

1-60 of 1764 similar projects

›

next*5