SincnetSincNet is a neural architecture for efficiently processing raw audio samples.
Stars: ✭ 764 (+1525.53%)
SurfboardNovoic's audio feature extraction library
Stars: ✭ 318 (+576.6%)
IresnetImproved Residual Networks (https://arxiv.org/pdf/2004.04989.pdf)
Stars: ✭ 163 (+246.81%)
NonautoreggenprogressTracking the progress in non-autoregressive generation (translation, transcription, etc.)
Stars: ✭ 118 (+151.06%)
DtlnTensorflow 2.x implementation of the DTLN real time speech denoising model. With TF-lite, ONNX and real-time audio processing support.
Stars: ✭ 147 (+212.77%)
PyconvPyramidal Convolution: Rethinking Convolutional Neural Networks for Visual Recognition (https://arxiv.org/pdf/2006.11538.pdf)
Stars: ✭ 231 (+391.49%)
Transfer Learning SuiteTransfer Learning Suite in Keras. Perform transfer learning using any built-in Keras image classification model easily!
Stars: ✭ 212 (+351.06%)
Image classifierCNN image classifier implemented in Keras Notebook 🖼️.
Stars: ✭ 139 (+195.74%)
Wav2letterSpeech Recognition model based off of FAIR research paper built using Pytorch.
Stars: ✭ 78 (+65.96%)
spokestack-iosSpokestack: give your iOS app a voice interface!
Stars: ✭ 27 (-42.55%)
NmtpytorchSequence-to-Sequence Framework in PyTorch
Stars: ✭ 392 (+734.04%)
DeepfaceDeep Learning Models for Face Detection/Recognition/Alignments, implemented in Tensorflow
Stars: ✭ 409 (+770.21%)
Multi Class Text Classification CnnClassify Kaggle Consumer Finance Complaints into 11 classes. Build the model with CNN (Convolutional Neural Network) and Word Embeddings on Tensorflow.
Stars: ✭ 410 (+772.34%)
Fast SrganA Fast Deep Learning Model to Upsample Low Resolution Videos to High Resolution at 30fps
Stars: ✭ 417 (+787.23%)
PbaEfficient Learning of Augmentation Policy Schedules
Stars: ✭ 461 (+880.85%)
NumpycnnBuilding Convolutional Neural Networks From Scratch using NumPy
Stars: ✭ 436 (+827.66%)
QC++ Library for Audio Digital Signal Processing
Stars: ✭ 481 (+923.4%)
Cortex M KwsCortex M KWS example with Tengine Lite.
Stars: ✭ 45 (-4.26%)
Trending Deep LearningTop 100 trending deep learning repositories sorted by the number of stars gained on a specific day.
Stars: ✭ 543 (+1055.32%)
ChromaprintC library for generating audio fingerprints used by AcoustID
Stars: ✭ 553 (+1076.6%)
Wavesurfer.jsNavigable waveform built on Web Audio and Canvas
Stars: ✭ 5,905 (+12463.83%)
Avsr Deep SpeechGoogle Summer of Code 2017 Project: Development of Speech Recognition Module for Red Hen Lab
Stars: ✭ 43 (-8.51%)
Neural spEnd-to-end ASR/LM implementation with PyTorch
Stars: ✭ 408 (+768.09%)
Asrt speechrecognitionA Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统
Stars: ✭ 4,943 (+10417.02%)
Tf Pose EstimationDeep Pose Estimation implemented using Tensorflow with Custom Architectures for fast inference.
Stars: ✭ 3,856 (+8104.26%)
Food Recipe Cnnfood image to recipe with deep convolutional neural networks.
Stars: ✭ 448 (+853.19%)
UspeechSpeech recognition toolkit for the arduino
Stars: ✭ 448 (+853.19%)
Regl CnnDigit recognition with Convolutional Neural Networks in WebGL
Stars: ✭ 490 (+942.55%)
Auto EditorAuto-Editor: Effort free video editing!
Stars: ✭ 382 (+712.77%)
Computervision RecipesBest Practices, code samples, and documentation for Computer Vision.
Stars: ✭ 8,214 (+17376.6%)
Athenaan open-source implementation of sequence-to-sequence based speech processing engine
Stars: ✭ 542 (+1053.19%)
SoundfingerprintingOpen source audio fingerprinting in .NET. An efficient algorithm for acoustic fingerprinting written purely in C#.
Stars: ✭ 554 (+1078.72%)
Music recommenderMusic recommender using deep learning with Keras and TensorFlow
Stars: ✭ 528 (+1023.4%)
Beethoven🎸 A maestro of pitch detection.
Stars: ✭ 601 (+1178.72%)
WenetProduction First and Production Ready End-to-End Speech Recognition Toolkit
Stars: ✭ 617 (+1212.77%)
YannThis toolbox is support material for the book on CNN (http://www.convolution.network).
Stars: ✭ 41 (-12.77%)
Silero ModelsSilero Models: pre-trained STT models and benchmarks made embarrassingly simple
Stars: ✭ 522 (+1010.64%)
Audio Visualizer Android🎵 [Android Library] A light-weight and easy-to-use Audio Visualizer for Android.
Stars: ✭ 581 (+1136.17%)
Deeplearning.aideeplearning.ai , By Andrew Ng, All video link
Stars: ✭ 625 (+1229.79%)
Libreasr💬 An On-Premises, Streaming Speech Recognition System
Stars: ✭ 633 (+1246.81%)
EesenThe official repository of the Eesen project
Stars: ✭ 738 (+1470.21%)
FfmediaelementFFME: The Advanced WPF MediaElement (based on FFmpeg)
Stars: ✭ 733 (+1459.57%)
Hardhat DetectorA convolutional neural network implementation of a script that detects whether an individual is wearing a hardhat or not.
Stars: ✭ 41 (-12.77%)
TorchioMedical image preprocessing and augmentation toolkit for deep learning
Stars: ✭ 708 (+1406.38%)
AudinoOpen source audio annotation tool for humans™
Stars: ✭ 740 (+1474.47%)
PykaldiA Python wrapper for Kaldi
Stars: ✭ 756 (+1508.51%)
Svhn CnnGoogle Street View House Number(SVHN) Dataset, and classifying them through CNN
Stars: ✭ 44 (-6.38%)
Tf cnnvisCNN visualization tool in TensorFlow
Stars: ✭ 769 (+1536.17%)
PnccA implementation of Power Normalized Cepstral Coefficients: PNCC
Stars: ✭ 40 (-14.89%)
Awesome DiarizationA curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
Stars: ✭ 673 (+1331.91%)
MltMLT Multimedia Framework
Stars: ✭ 836 (+1678.72%)
EspressoEspresso: A Fast End-to-End Neural Speech Recognition Toolkit
Stars: ✭ 808 (+1619.15%)