Vq Vae SpeechPyTorch implementation of VQ-VAE + WaveNet by [Chorowski et al., 2019] and VQ-VAE on speech signals by [van den Oord et al., 2017]
Stars: ✭ 187 (+62.61%)
SeganSpeech Enhancement Generative Adversarial Network in TensorFlow
Stars: ✭ 661 (+474.78%)
ttslearnttslearn: Library for Pythonで学ぶ音声合成 (Text-to-speech with Python)
Stars: ✭ 158 (+37.39%)
LIUMScripts for LIUM SpkDiarization tools
Stars: ✭ 28 (-75.65%)
PysptkA python wrapper for Speech Signal Processing Toolkit (SPTK).
Stars: ✭ 297 (+158.26%)
IMS-ToucanText-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart. Objectives of the development are simplicity, modularity, controllability and multilinguality.
Stars: ✭ 295 (+156.52%)
Awesome Speech EnhancementA tutorial for Speech Enhancement researchers and practitioners. The purpose of this repo is to organize the world’s resources for speech enhancement and make them universally accessible and useful.
Stars: ✭ 257 (+123.48%)
Andrew Ng NotesThis is Andrew NG Coursera Handwritten Notes.
Stars: ✭ 180 (+56.52%)
UniSpeechUniSpeech - Large Scale Self-Supervised Learning for Speech
Stars: ✭ 224 (+94.78%)
Gcc NmfReal-time GCC-NMF Blind Speech Separation and Enhancement
Stars: ✭ 231 (+100.87%)
Yolo Tf2yolo(all versions) implementation in keras and tensorflow 2.4
Stars: ✭ 695 (+504.35%)
Speechbrain.github.ioThe SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.
Stars: ✭ 242 (+110.43%)
ShifterPitch shifter using WSOLA and resampling implemented by Python3
Stars: ✭ 22 (-80.87%)
Pytorch Kaldipytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.
Stars: ✭ 2,097 (+1723.48%)
Sednndeep learning based speech enhancement using keras or pytorch, make it easy to use
Stars: ✭ 288 (+150.43%)
GcommandspytorchConvNets for Audio Recognition using Google Commands Dataset
Stars: ✭ 65 (-43.48%)
Tutorial separationThis repo summarizes the tutorials, datasets, papers, codes and tools for speech separation and speaker extraction task. You are kindly invited to pull requests.
Stars: ✭ 151 (+31.3%)
datasets🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
Stars: ✭ 13,870 (+11960.87%)
Speech Emotion AnalyzerThe neural network model is capable of detecting five different male/female emotions from audio speeches. (Deep Learning, NLP, Python)
Stars: ✭ 633 (+450.43%)
hifigan-denoiserHiFi-GAN: High Fidelity Denoising and Dereverberation Based on Speech Deep Features in Adversarial Networks
Stars: ✭ 88 (-23.48%)
PykaldiA Python wrapper for Kaldi
Stars: ✭ 756 (+557.39%)
DareblopyData Reading Blocks for Python
Stars: ✭ 82 (-28.7%)
Numpy CnNumPy官方中文文档(完整版)
Stars: ✭ 1,570 (+1265.22%)
GpndGenerative Probabilistic Novelty Detection with Adversarial Autoencoders
Stars: ✭ 112 (-2.61%)
OpentpodOpen Toolkit for Painless Object Detection
Stars: ✭ 106 (-7.83%)
Slick DnnTiny and elegant deep learning library
Stars: ✭ 114 (-0.87%)
DurianImplementation of "Duration Informed Attention Network for Multimodal Synthesis" (https://arxiv.org/pdf/1909.01700.pdf) paper.
Stars: ✭ 111 (-3.48%)
JlmA fast LSTM Language Model for large vocabulary language like Japanese and Chinese
Stars: ✭ 105 (-8.7%)
Planematch[ECCV'18 Oral] PlaneMatch: Patch Coplanarity Prediction for Robust RGB-D Reconstruction
Stars: ✭ 105 (-8.7%)
NumbaggFast N-dimensional aggregation functions with Numba
Stars: ✭ 104 (-9.57%)
HolobotHoloBot is a reusable 3D interface that allows HoloLens & VR users to interact with any bot using Mixed Reality & Speech.
Stars: ✭ 114 (-0.87%)
Deep architectA general, modular, and programmable architecture search framework
Stars: ✭ 110 (-4.35%)
DeltaDELTA is a deep learning based natural language and speech processing platform.
Stars: ✭ 1,479 (+1186.09%)
Torch DreamsMaking neural networks more interpretable, for research and art 🔎 💻 :brain: 🎨
Stars: ✭ 102 (-11.3%)
BoxdetectionA Box detection algorithm for any image containing boxes.
Stars: ✭ 104 (-9.57%)
DeepcpgDeep neural networks for predicting CpG methylation
Stars: ✭ 113 (-1.74%)
Video To Retail PlatformAn intelligent multimodal-learning based system for video, product and ads analysis. Based on the system, people can build a lot of downstream applications such as product recommendation, video retrieval, etc.
Stars: ✭ 108 (-6.09%)
PytorchnlpbookCode and data accompanying Natural Language Processing with PyTorch published by O'Reilly Media https://nlproc.info
Stars: ✭ 1,390 (+1108.7%)
100 Pandas Puzzles100 data puzzles for pandas, ranging from short and simple to super tricky (60% complete)
Stars: ✭ 1,382 (+1101.74%)
PytorchTensors and Dynamic neural networks in Python with strong GPU acceleration
Stars: ✭ 52,811 (+45822.61%)
PythonstudyPython related technologies used in work: crawler, data analysis, timing tasks, RPC, page parsing, decorator, built-in functions, Python objects, multi-threading, multi-process, asynchronous, redis, mongodb, mysql, openstack, etc.
Stars: ✭ 103 (-10.43%)
Crfasrnn pytorchCRF-RNN PyTorch version http://crfasrnn.torr.vision
Stars: ✭ 102 (-11.3%)
BitwrkBitcoin-fueled Peer-to-Peer Blender Rendering (and more)
Stars: ✭ 114 (-0.87%)
Adversarialdnn PlaygroundVizSec17: Web-based visualization tool for adversarial machine learning / LiveDemo
Stars: ✭ 113 (-1.74%)
FaceswapDeepfakes Software For All
Stars: ✭ 39,911 (+34605.22%)
ModelsDLTK Model Zoo
Stars: ✭ 101 (-12.17%)