deadsfuDead-simple WebRTC broadcasting. From the browser, or your application. Cloud-native and scalable.
Stars: ✭ 23 (-90.04%)
Vq Vae SpeechPyTorch implementation of VQ-VAE + WaveNet by [Chorowski et al., 2019] and VQ-VAE on speech signals by [van den Oord et al., 2017]
Stars: ✭ 187 (-19.05%)
PysptkA python wrapper for Speech Signal Processing Toolkit (SPTK).
Stars: ✭ 297 (+28.57%)
ttslearnttslearn: Library for Pythonで学ぶ音声合成 (Text-to-speech with Python)
Stars: ✭ 158 (-31.6%)
LinuxXanMod: Linux kernel source code tree
Stars: ✭ 310 (+34.2%)
ShifterPitch shifter using WSOLA and resampling implemented by Python3
Stars: ✭ 22 (-90.48%)
IMS-ToucanText-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart. Objectives of the development are simplicity, modularity, controllability and multilinguality.
Stars: ✭ 295 (+27.71%)
WerkHigh-throughput / low-latency C++ application framework
Stars: ✭ 30 (-87.01%)
UniSpeechUniSpeech - Large Scale Self-Supervised Learning for Speech
Stars: ✭ 224 (-3.03%)
XpediteA non-sampling profiler purpose built to measure and optimize performance of ultra low latency/real time systems
Stars: ✭ 89 (-61.47%)
rippleSimple shared surface streaming application
Stars: ✭ 17 (-92.64%)
LIUMScripts for LIUM SpkDiarization tools
Stars: ✭ 28 (-87.88%)
hifigan-denoiserHiFi-GAN: High Fidelity Denoising and Dereverberation Based on Speech Deep Features in Adversarial Networks
Stars: ✭ 88 (-61.9%)
Speechbrain.github.ioThe SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.
Stars: ✭ 242 (+4.76%)
python-rtmixer🎤 Reliable low-latency audio playback and recording with Python 🐍
Stars: ✭ 44 (-80.95%)
Esp8266samSpeech synthesis for ESP8266 using S.A.M. port
Stars: ✭ 199 (-13.85%)
Fixed pointC++ Binary Fixed-Point Arithmetic
Stars: ✭ 199 (-13.85%)
Python SocketioPython Socket.IO server and client
Stars: ✭ 2,655 (+1049.35%)
Voronoi image manipulationA system independent tool for interactive image manipulation with Voronoi and Delaunay data structures.
Stars: ✭ 196 (-15.15%)
Autobahn PythonWebSocket and WAMP in Python for Twisted and asyncio
Stars: ✭ 2,305 (+897.84%)
YaveYet Another Vulkan Engine
Stars: ✭ 211 (-8.66%)
OmniscidbOmniSciDB (formerly MapD Core)
Stars: ✭ 2,601 (+1025.97%)
Caffe2 IosCaffe2 on iOS Real-time Demo. Test with Your Own Model and Photos.
Stars: ✭ 221 (-4.33%)
MocapnetWe present MocapNET2, a real-time method that estimates the 3D human pose directly in the popular Bio Vision Hierarchy (BVH) format, given estimations of the 2D body joints originating from monocular color images. Our contributions include: (a) A novel and compact 2D pose NSRM representation. (b) A human body orientation classifier and an ensemble of orientation-tuned neural networks that regress the 3D human pose by also allowing for the decomposition of the body to an upper and lower kinematic hierarchy. This permits the recovery of the human pose even in the case of significant occlusions. (c) An efficient Inverse Kinematics solver that refines the neural-network-based solution providing 3D human pose estimations that are consistent with the limb sizes of a target person (if known). All the above yield a 33% accuracy improvement on the Human 3.6 Million (H3.6M) dataset compared to the baseline method (MocapNET) while maintaining real-time performance (70 fps in CPU-only execution).
Stars: ✭ 194 (-16.02%)
Rtm3dUnofficial PyTorch implementation of "RTM3D: Real-time Monocular 3D Detection from Object Keypoints for Autonomous Driving" (ECCV 2020)
Stars: ✭ 211 (-8.66%)
LingvoLingvo
Stars: ✭ 2,361 (+922.08%)
Source separationDeep learning based speech source separation using Pytorch
Stars: ✭ 226 (-2.16%)
Cmake ScriptsA selection of useful scripts for use in CMake projects, include code coverage, sanitizers, and dependency graph generation.
Stars: ✭ 202 (-12.55%)
MwengineAudio engine and DSP for Android, written in C++ providing low latency performance in a musical context, supporting both OpenSL and AAudio.
Stars: ✭ 190 (-17.75%)
Fairmot[IJCV-2021] FairMOT: On the Fairness of Detection and Re-Identification in Multi-Object Tracking
Stars: ✭ 3,194 (+1282.68%)
SwellrtSwellRT main project. Server, JavaScript and Java clients
Stars: ✭ 205 (-11.26%)
A2jCode for paper "A2J: Anchor-to-Joint Regression Network for 3D Articulated Pose Estimation from a Single Depth Image". ICCV2019
Stars: ✭ 190 (-17.75%)
Chatify DemoChatify Laravel Package Demo application
Stars: ✭ 189 (-18.18%)
FeathersA framework for real-time applications and REST APIs with JavaScript and TypeScript
Stars: ✭ 13,761 (+5857.14%)
Depression DetectPredicting depression from acoustic features of speech using a Convolutional Neural Network.
Stars: ✭ 187 (-19.05%)
TosdatabridgeA collection of resources for pulling real-time streaming data off of TDAmeritrade's ThinkOrSwim(TOS) platform; providing C, C++, Java and Python interfaces.
Stars: ✭ 229 (-0.87%)
DeepprunerDeepPruner: Learning Efficient Stereo Matching via Differentiable PatchMatch (ICCV 2019)
Stars: ✭ 226 (-2.16%)
Speech DenoiserA speech denoise lv2 plugin based on RNNoise library
Stars: ✭ 220 (-4.76%)
Centerface.pytorchunofficial version of centerface, which achieves the best balance between speed and accuracy at face detection
Stars: ✭ 187 (-19.05%)
EdgedictWorking online speech recognition based on RNN Transducer. ( Trained model release available in release )
Stars: ✭ 205 (-11.26%)
GoaccessGoAccess is a real-time web log analyzer and interactive viewer that runs in a terminal in *nix systems or through your browser.
Stars: ✭ 14,096 (+6002.16%)
HopeSource code of CVPR 2020 paper, "HOPE-Net: A Graph-based Model for Hand-Object Pose Estimation"
Stars: ✭ 184 (-20.35%)
SanityThe Sanity Studio – Collaborate in real-time on structured content
Stars: ✭ 3,007 (+1201.73%)
BtrackA Real-Time Beat Tracker
Stars: ✭ 204 (-11.69%)
McmotReal time one-stage multi-class & multi-object tracking based on anchor-free detection and re-id
Stars: ✭ 181 (-21.65%)
TimitThe DARPA TIMIT Acoustic-Phonetic Continuous Speech Corpus.
Stars: ✭ 202 (-12.55%)