Text-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart. Objectives of the development are simplicity, modularity, controllability and multilinguality.

Stars: ✭ 295 (+27.71%)

Mutual labels: speech, speech-processing

Werk

High-throughput / low-latency C++ application framework

Stars: ✭ 30 (-87.01%)

Mutual labels: real-time, low-latency

UniSpeech

UniSpeech - Large Scale Self-Supervised Learning for Speech

Stars: ✭ 224 (-3.03%)

Mutual labels: speech, speech-processing

Xpedite

A non-sampling profiler purpose built to measure and optimize performance of ultra low latency/real time systems

Stars: ✭ 89 (-61.47%)

Mutual labels: real-time, low-latency

ripple

Simple shared surface streaming application

Stars: ✭ 17 (-92.64%)

Mutual labels: real-time, low-latency

A Convolutional Recurrent Neural Network For Real Time Speech Enhancement

A minimum unofficial implementation of the "A Convolutional Recurrent Neural Network for Real-Time Speech Enhancement" (CRN) using PyTorch

Stars: ✭ 123 (-46.75%)

Mutual labels: speech-processing, real-time

LIUM

Scripts for LIUM SpkDiarization tools

Stars: ✭ 28 (-87.88%)

Mutual labels: speech, speech-processing

Restoring-Extremely-Dark-Images-In-Real-Time

The project is the official implementation of our CVPR 2021 paper, "Restoring Extremely Dark Images in Real Time"

Stars: ✭ 79 (-65.8%)

Mutual labels: real-time, low-latency

hifigan-denoiser

HiFi-GAN: High Fidelity Denoising and Dereverberation Based on Speech Deep Features in Adversarial Networks

Stars: ✭ 88 (-61.9%)

Mutual labels: speech, speech-processing

Speechbrain.github.io

The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.

Stars: ✭ 242 (+4.76%)

Mutual labels: speech, speech-processing

Speech Denoising Wavenet

A neural network for end-to-end speech denoising

Stars: ✭ 516 (+123.38%)

Mutual labels: speech, speech-processing

python-rtmixer

🎤 Reliable low-latency audio playback and recording with Python 🐍

Stars: ✭ 44 (-80.95%)

Mutual labels: real-time, low-latency

Tfg Voice Conversion

Deep Learning-based Voice Conversion system

Stars: ✭ 115 (-50.22%)

Mutual labels: speech, speech-processing

Wavenet vocoder

WaveNet vocoder

Stars: ✭ 1,926 (+733.77%)

Mutual labels: speech, speech-processing

Esp8266sam

Speech synthesis for ESP8266 using S.A.M. port

Stars: ✭ 199 (-13.85%)

Mutual labels: speech

Fixed point

C++ Binary Fixed-Point Arithmetic

Stars: ✭ 199 (-13.85%)

Mutual labels: gcc

Python Socketio

Python Socket.IO server and client

Stars: ✭ 2,655 (+1049.35%)

Mutual labels: low-latency

Voronoi image manipulation

A system independent tool for interactive image manipulation with Voronoi and Delaunay data structures.

Stars: ✭ 196 (-15.15%)

Mutual labels: real-time

Autobahn Python

WebSocket and WAMP in Python for Twisted and asyncio

Stars: ✭ 2,305 (+897.84%)

Mutual labels: real-time

Yave

Yet Another Vulkan Engine

Stars: ✭ 211 (-8.66%)

Mutual labels: real-time

Omniscidb

OmniSciDB (formerly MapD Core)

Stars: ✭ 2,601 (+1025.97%)

Mutual labels: real-time

Caffe2 Ios

Caffe2 on iOS Real-time Demo. Test with Your Own Model and Photos.

Stars: ✭ 221 (-4.33%)

Mutual labels: real-time

Laravel Echo Server

Socket.io server for Laravel Echo

Stars: ✭ 2,487 (+976.62%)

Mutual labels: real-time

Nginx Http Echo Module

A simple Nginx echo module

Stars: ✭ 192 (-16.88%)

Mutual labels: gcc

Mocapnet

We present MocapNET2, a real-time method that estimates the 3D human pose directly in the popular Bio Vision Hierarchy (BVH) format, given estimations of the 2D body joints originating from monocular color images. Our contributions include: (a) A novel and compact 2D pose NSRM representation. (b) A human body orientation classifier and an ensemble of orientation-tuned neural networks that regress the 3D human pose by also allowing for the decomposition of the body to an upper and lower kinematic hierarchy. This permits the recovery of the human pose even in the case of significant occlusions. (c) An efficient Inverse Kinematics solver that refines the neural-network-based solution providing 3D human pose estimations that are consistent with the limb sizes of a target person (if known). All the above yield a 33% accuracy improvement on the Human 3.6 Million (H3.6M) dataset compared to the baseline method (MocapNET) while maintaining real-time performance (70 fps in CPU-only execution).

Stars: ✭ 194 (-16.02%)

Mutual labels: real-time

Rtm3d

Unofficial PyTorch implementation of "RTM3D: Real-time Monocular 3D Detection from Object Keypoints for Autonomous Driving" (ECCV 2020)

Stars: ✭ 211 (-8.66%)

Mutual labels: real-time

Lingvo

Stars: ✭ 2,361 (+922.08%)

Mutual labels: speech

Source separation

Deep learning based speech source separation using Pytorch

Stars: ✭ 226 (-2.16%)

Mutual labels: speech

Speechtotext Websockets Javascript

SDK & Sample to do speech recognition using websockets in Javascript

Stars: ✭ 191 (-17.32%)

Mutual labels: speech

Cmake Scripts

A selection of useful scripts for use in CMake projects, include code coverage, sanitizers, and dependency graph generation.

Stars: ✭ 202 (-12.55%)

Mutual labels: gcc

Mwengine

Audio engine and DSP for Android, written in C++ providing low latency performance in a musical context, supporting both OpenSL and AAudio.

Stars: ✭ 190 (-17.75%)

Mutual labels: low-latency

Emotion Classification From Audio Files

Understanding emotions from audio files using neural networks and multiple datasets.

Stars: ✭ 189 (-18.18%)

Mutual labels: speech

Fairmot

[IJCV-2021] FairMOT: On the Fairness of Detection and Re-Identification in Multi-Object Tracking

Stars: ✭ 3,194 (+1282.68%)

Mutual labels: real-time

Swellrt

SwellRT main project. Server, JavaScript and Java clients

Stars: ✭ 205 (-11.26%)

Mutual labels: real-time

A2j

Code for paper "A2J: Anchor-to-Joint Regression Network for 3D Articulated Pose Estimation from a Single Depth Image". ICCV2019

Stars: ✭ 190 (-17.75%)

Mutual labels: real-time

Chatify Demo

Chatify Laravel Package Demo application

Stars: ✭ 189 (-18.18%)

Mutual labels: real-time

Feathers

A framework for real-time applications and REST APIs with JavaScript and TypeScript

Stars: ✭ 13,761 (+5857.14%)

Mutual labels: real-time

Depression Detect

Predicting depression from acoustic features of speech using a Convolutional Neural Network.

Stars: ✭ 187 (-19.05%)

Mutual labels: speech

Tosdatabridge

A collection of resources for pulling real-time streaming data off of TDAmeritrade's ThinkOrSwim(TOS) platform; providing C, C++, Java and Python interfaces.

Stars: ✭ 229 (-0.87%)

Mutual labels: real-time

Deeppruner

DeepPruner: Learning Efficient Stereo Matching via Differentiable PatchMatch (ICCV 2019)

Stars: ✭ 226 (-2.16%)

Mutual labels: real-time

Speech Denoiser

A speech denoise lv2 plugin based on RNNoise library

Stars: ✭ 220 (-4.76%)

Mutual labels: speech

Pytorch realtime multi Person pose estimation

Pytorch version of Realtime Multi-Person Pose Estimation project