https://dodiku.github.io/audio_noise_clustering/results/ ==> An experiment with a variety of clustering (and clustering-like) techniques to reduce noise on an audio speech recording.

Stars: ✭ 24 (-46.67%)

Mutual labels: speech

CycleGAN-Models

Models generated by CycleGAN

Stars: ✭ 42 (-6.67%)

Mutual labels: cyclegan

ASR-Audio-Data-Links

A list of publically available audio data that anyone can download for ASR or other speech activities

Stars: ✭ 179 (+297.78%)

Mutual labels: speech

Generative-Model

Repository for implementation of generative models with Tensorflow 1.x

Stars: ✭ 66 (+46.67%)

Mutual labels: cyclegan

TFGAN

TFGAN: Time and Frequency Domain Based Generative Adversarial Network for High-fidelity Speech Synthesis

Stars: ✭ 65 (+44.44%)

Mutual labels: speech

wav2vec2-live

A live speech recognition using Facebooks wav2vec 2.0 model.

Stars: ✭ 205 (+355.56%)

Mutual labels: speech

CycleGAN-gluon-mxnet

this repo attemps to reproduce Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks(CycleGAN) use gluon reimplementation

Stars: ✭ 31 (-31.11%)

Mutual labels: cyclegan

DisCont

Code for the paper "DisCont: Self-Supervised Visual Attribute Disentanglement using Context Vectors".

Stars: ✭ 13 (-71.11%)

Mutual labels: contrastive-learning

eidos-audition

Collection of auditory models.

Stars: ✭ 25 (-44.44%)

Mutual labels: speech

TCE

This repository contains the code implementation used in the paper Temporally Coherent Embeddings for Self-Supervised Video Representation Learning (TCE).

Stars: ✭ 51 (+13.33%)

Mutual labels: contrastive-learning

icassp2019-latex-template

ICASSP 2019 official Latex template

Stars: ✭ 21 (-53.33%)

Mutual labels: speech

simple-obs-stt

Speech-to-text and keyboard input captions for OBS.

Stars: ✭ 89 (+97.78%)

Mutual labels: speech

pytorch-pcen

PyTorch reimplementation of per-channel energy normalization for audio.

Stars: ✭ 80 (+77.78%)

Mutual labels: speech

info-nce-pytorch

PyTorch implementation of the InfoNCE loss for self-supervised learning.

Stars: ✭ 160 (+255.56%)

Mutual labels: contrastive-learning

cycleGAN-PyTorch

A clean and lucid implementation of cycleGAN using PyTorch

Stars: ✭ 107 (+137.78%)

Mutual labels: cyclegan

room-impulse-responses

A list of publicly available room impulse response datasets and scripts to download them.

Stars: ✭ 143 (+217.78%)

Mutual labels: speech

RSC-Net

Implementation for "3D human pose, shape and texture from low-resolution images and videos", TPAMI 2021

Stars: ✭ 43 (-4.44%)

Mutual labels: contrastive-learning

anycontrol

Voice control for your websites and applications

Stars: ✭ 53 (+17.78%)

Mutual labels: speech

Zero-Shot-TTS

Unofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration

Stars: ✭ 33 (-26.67%)

Mutual labels: speech

opensource-voice-tools

A repo listing known open source voice tools, ordered by where they sit in the voice stack

Stars: ✭ 21 (-53.33%)

Mutual labels: speech

pix2pix

This project uses a conditional generative adversarial network (cGAN) named Pix2Pix for the Image to image translation task.

Stars: ✭ 28 (-37.78%)

Mutual labels: cyclegan

pytorch-gans

PyTorch implementation of GANs (Generative Adversarial Networks). DCGAN, Pix2Pix, CycleGAN, SRGAN

Stars: ✭ 21 (-53.33%)

Mutual labels: cyclegan

CLMR

Official PyTorch implementation of Contrastive Learning of Musical Representations

Stars: ✭ 216 (+380%)

Mutual labels: contrastive-learning

CCL

PyTorch Implementation on Paper [CVPR2021]Distilling Audio-Visual Knowledge by Compositional Contrastive Learning

Stars: ✭ 76 (+68.89%)

Mutual labels: contrastive-learning

kaldi ag training

Docker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-grammar.

Stars: ✭ 14 (-68.89%)

Mutual labels: speech

awesome-efficient-gnn

Code and resources on scalable and efficient Graph Neural Networks

Stars: ✭ 498 (+1006.67%)

Mutual labels: contrastive-learning

ViCC

[WACV'22] Code repository for the paper "Self-supervised Video Representation Learning with Cross-Stream Prototypical Contrasting", https://arxiv.org/abs/2106.10137.

Stars: ✭ 33 (-26.67%)

Mutual labels: contrastive-learning

NBSS

The official repo of "Multi-channel Narrow-band Deep Speech Separation with Full-band Permutation Invariant Training", "Multichannel Speech Separation with Narrow-band Conformer" and "NBC2: Multichannel Speech Separation with Revised Narrow-band Conformer".

Stars: ✭ 77 (+71.11%)

Mutual labels: speech

awesome-graph-self-supervised-learning-based-recommendation

A curated list of awesome graph & self-supervised-learning-based recommendation.

Stars: ✭ 37 (-17.78%)

Mutual labels: contrastive-learning

cape

Continuous Augmented Positional Embeddings (CAPE) implementation for PyTorch

Stars: ✭ 29 (-35.56%)

Mutual labels: speech

Revisiting-Contrastive-SSL

Revisiting Contrastive Methods for Unsupervised Learning of Visual Representations. [NeurIPS 2021]

Stars: ✭ 81 (+80%)

Mutual labels: contrastive-learning

day2night

Image2Image Translation Research

Stars: ✭ 46 (+2.22%)

Mutual labels: cyclegan

ventib

📈 Ventib records your voice, transcribes it in realtime, and performs speech pattern analysis to give you objective statistics about how you speak.

Stars: ✭ 43 (-4.44%)

Mutual labels: speech

Parametric-Contrastive-Learning

Parametric Contrastive Learning (ICCV2021)

Stars: ✭ 155 (+244.44%)

Mutual labels: contrastive-learning

object-aware-contrastive

Object-aware Contrastive Learning for Debiased Scene Representation (NeurIPS 2021)

Stars: ✭ 44 (-2.22%)

Mutual labels: contrastive-learning

UPIT

A fastai/PyTorch package for unpaired image-to-image translation.

Stars: ✭ 94 (+108.89%)

Mutual labels: cyclegan

txt2speech

Convert text to speech using Google Translate API

Stars: ✭ 38 (-15.56%)

Mutual labels: speech

KeenASR-Android-PoC

A proof-of-concept app using KeenASR SDK on Android. WE ARE HIRING: https://keenresearch.com/careers.html

Stars: ✭ 21 (-53.33%)

Mutual labels: speech

CLSA

official implemntation for "Contrastive Learning with Stronger Augmentations"

Stars: ✭ 48 (+6.67%)

Mutual labels: contrastive-learning

MediumVC

Any-to-any voice conversion using synthetic specific-speaker speeches as intermedium features

Stars: ✭ 46 (+2.22%)

Mutual labels: voice-conversion

Supervised-Contrastive-Learning-in-TensorFlow-2

Implements the ideas presented in https://arxiv.org/pdf/2004.11362v1.pdf by Khosla et al.

Stars: ✭ 117 (+160%)

Mutual labels: contrastive-learning

gans-2.0

Generative Adversarial Networks in TensorFlow 2.0

Stars: ✭ 76 (+68.89%)

Mutual labels: cyclegan

TF-Speech-Recognition-Challenge-Solution

Source code of the model used in Tensorflow Speech Recognition Challenge (https://www.kaggle.com/c/tensorflow-speech-recognition-challenge). The solution ranked in top 5% in private leaderboard.

Stars: ✭ 58 (+28.89%)

Mutual labels: speech

GeDML

Generalized Deep Metric Learning.

Stars: ✭ 30 (-33.33%)

Mutual labels: contrastive-learning

lidbox

End-to-end spoken language identification out of the box.

Stars: ✭ 39 (-13.33%)

Mutual labels: speech

Multimodal-Gesture-Recognition-with-LSTMs-and-CTC

An end-to-end system that performs temporal recognition of gesture sequences using speech and skeletal input. The model combines three networks with a CTC output layer that recognises gestures from continuous stream.

Stars: ✭ 25 (-44.44%)

Mutual labels: speech

IMS-Toucan

Text-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart. Objectives of the development are simplicity, modularity, controllability and multilinguality.

Stars: ✭ 295 (+555.56%)

Mutual labels: speech

StyleSpeech

Official implementation of Meta-StyleSpeech and StyleSpeech

Stars: ✭ 161 (+257.78%)

Mutual labels: speech

FAST-RIR

This is the official implementation of our neural-network-based fast diffuse room impulse response generator (FAST-RIR) for generating room impulse responses (RIRs) for a given acoustic environment.

Stars: ✭ 90 (+100%)

Mutual labels: speech

react-native-speech-bubble

💬 A speech bubble dialog component for React Native.

Stars: ✭ 50 (+11.11%)

Mutual labels: speech

MajorDomo-Scenarios

Сценарии для системы домашней автоматизации Majordomo

Stars: ✭ 12 (-73.33%)

Mutual labels: speech

aframe-speech-controls-component

alternative form of inputs for in-VR interaction with the content of a scene