All Projects → CVC → Similar Projects or Alternatives

292 Open source projects that are alternatives of or similar to CVC

Voice Converter Cyclegan
Voice Converter Using CycleGAN and Non-Parallel Data
Stars: ✭ 384 (+753.33%)
Mutual labels:  speech, cyclegan
JD-NMF
Joint Dictionary Learning-based Non-Negative Matrix Factorization for Voice Conversion (TBME 2016)
Stars: ✭ 20 (-55.56%)
Mutual labels:  speech, voice-conversion
VQMIVC
Official implementation of VQMIVC: One-shot (any-to-any) Voice Conversion @ Interspeech 2021 + Online playing demo!
Stars: ✭ 278 (+517.78%)
Mutual labels:  speech, voice-conversion
Shifter
Pitch shifter using WSOLA and resampling implemented by Python3
Stars: ✭ 22 (-51.11%)
Mutual labels:  speech, voice-conversion
Phomeme
Simple sentence mixing tool (work in progress)
Stars: ✭ 18 (-60%)
Mutual labels:  speech, voice-conversion
S2-BNN
S2-BNN: Bridging the Gap Between Self-Supervised Real and 1-bit Neural Networks via Guided Distribution Calibration (CVPR 2021)
Stars: ✭ 53 (+17.78%)
Mutual labels:  contrastive-learning
audio noise clustering
https://dodiku.github.io/audio_noise_clustering/results/ ==> An experiment with a variety of clustering (and clustering-like) techniques to reduce noise on an audio speech recording.
Stars: ✭ 24 (-46.67%)
Mutual labels:  speech
CycleGAN-Models
Models generated by CycleGAN
Stars: ✭ 42 (-6.67%)
Mutual labels:  cyclegan
ASR-Audio-Data-Links
A list of publically available audio data that anyone can download for ASR or other speech activities
Stars: ✭ 179 (+297.78%)
Mutual labels:  speech
Generative-Model
Repository for implementation of generative models with Tensorflow 1.x
Stars: ✭ 66 (+46.67%)
Mutual labels:  cyclegan
TFGAN
TFGAN: Time and Frequency Domain Based Generative Adversarial Network for High-fidelity Speech Synthesis
Stars: ✭ 65 (+44.44%)
Mutual labels:  speech
wav2vec2-live
A live speech recognition using Facebooks wav2vec 2.0 model.
Stars: ✭ 205 (+355.56%)
Mutual labels:  speech
CycleGAN-gluon-mxnet
this repo attemps to reproduce Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks(CycleGAN) use gluon reimplementation
Stars: ✭ 31 (-31.11%)
Mutual labels:  cyclegan
DisCont
Code for the paper "DisCont: Self-Supervised Visual Attribute Disentanglement using Context Vectors".
Stars: ✭ 13 (-71.11%)
Mutual labels:  contrastive-learning
eidos-audition
Collection of auditory models.
Stars: ✭ 25 (-44.44%)
Mutual labels:  speech
TCE
This repository contains the code implementation used in the paper Temporally Coherent Embeddings for Self-Supervised Video Representation Learning (TCE).
Stars: ✭ 51 (+13.33%)
Mutual labels:  contrastive-learning
icassp2019-latex-template
ICASSP 2019 official Latex template
Stars: ✭ 21 (-53.33%)
Mutual labels:  speech
simple-obs-stt
Speech-to-text and keyboard input captions for OBS.
Stars: ✭ 89 (+97.78%)
Mutual labels:  speech
pytorch-pcen
PyTorch reimplementation of per-channel energy normalization for audio.
Stars: ✭ 80 (+77.78%)
Mutual labels:  speech
info-nce-pytorch
PyTorch implementation of the InfoNCE loss for self-supervised learning.
Stars: ✭ 160 (+255.56%)
Mutual labels:  contrastive-learning
cycleGAN-PyTorch
A clean and lucid implementation of cycleGAN using PyTorch
Stars: ✭ 107 (+137.78%)
Mutual labels:  cyclegan
room-impulse-responses
A list of publicly available room impulse response datasets and scripts to download them.
Stars: ✭ 143 (+217.78%)
Mutual labels:  speech
RSC-Net
Implementation for "3D human pose, shape and texture from low-resolution images and videos", TPAMI 2021
Stars: ✭ 43 (-4.44%)
Mutual labels:  contrastive-learning
anycontrol
Voice control for your websites and applications
Stars: ✭ 53 (+17.78%)
Mutual labels:  speech
Zero-Shot-TTS
Unofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration
Stars: ✭ 33 (-26.67%)
Mutual labels:  speech
opensource-voice-tools
A repo listing known open source voice tools, ordered by where they sit in the voice stack
Stars: ✭ 21 (-53.33%)
Mutual labels:  speech
pix2pix
This project uses a conditional generative adversarial network (cGAN) named Pix2Pix for the Image to image translation task.
Stars: ✭ 28 (-37.78%)
Mutual labels:  cyclegan
pytorch-gans
PyTorch implementation of GANs (Generative Adversarial Networks). DCGAN, Pix2Pix, CycleGAN, SRGAN
Stars: ✭ 21 (-53.33%)
Mutual labels:  cyclegan
CLMR
Official PyTorch implementation of Contrastive Learning of Musical Representations
Stars: ✭ 216 (+380%)
Mutual labels:  contrastive-learning
CCL
PyTorch Implementation on Paper [CVPR2021]Distilling Audio-Visual Knowledge by Compositional Contrastive Learning
Stars: ✭ 76 (+68.89%)
Mutual labels:  contrastive-learning
kaldi ag training
Docker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-grammar.
Stars: ✭ 14 (-68.89%)
Mutual labels:  speech
awesome-efficient-gnn
Code and resources on scalable and efficient Graph Neural Networks
Stars: ✭ 498 (+1006.67%)
Mutual labels:  contrastive-learning
ViCC
[WACV'22] Code repository for the paper "Self-supervised Video Representation Learning with Cross-Stream Prototypical Contrasting", https://arxiv.org/abs/2106.10137.
Stars: ✭ 33 (-26.67%)
Mutual labels:  contrastive-learning
NBSS
The official repo of "Multi-channel Narrow-band Deep Speech Separation with Full-band Permutation Invariant Training", "Multichannel Speech Separation with Narrow-band Conformer" and "NBC2: Multichannel Speech Separation with Revised Narrow-band Conformer".
Stars: ✭ 77 (+71.11%)
Mutual labels:  speech
awesome-graph-self-supervised-learning-based-recommendation
A curated list of awesome graph & self-supervised-learning-based recommendation.
Stars: ✭ 37 (-17.78%)
Mutual labels:  contrastive-learning
cape
Continuous Augmented Positional Embeddings (CAPE) implementation for PyTorch
Stars: ✭ 29 (-35.56%)
Mutual labels:  speech
Revisiting-Contrastive-SSL
Revisiting Contrastive Methods for Unsupervised Learning of Visual Representations. [NeurIPS 2021]
Stars: ✭ 81 (+80%)
Mutual labels:  contrastive-learning
day2night
Image2Image Translation Research
Stars: ✭ 46 (+2.22%)
Mutual labels:  cyclegan
ventib
📈 Ventib records your voice, transcribes it in realtime, and performs speech pattern analysis to give you objective statistics about how you speak.
Stars: ✭ 43 (-4.44%)
Mutual labels:  speech
Parametric-Contrastive-Learning
Parametric Contrastive Learning (ICCV2021)
Stars: ✭ 155 (+244.44%)
Mutual labels:  contrastive-learning
object-aware-contrastive
Object-aware Contrastive Learning for Debiased Scene Representation (NeurIPS 2021)
Stars: ✭ 44 (-2.22%)
Mutual labels:  contrastive-learning
UPIT
A fastai/PyTorch package for unpaired image-to-image translation.
Stars: ✭ 94 (+108.89%)
Mutual labels:  cyclegan
txt2speech
Convert text to speech using Google Translate API
Stars: ✭ 38 (-15.56%)
Mutual labels:  speech
KeenASR-Android-PoC
A proof-of-concept app using KeenASR SDK on Android. WE ARE HIRING: https://keenresearch.com/careers.html
Stars: ✭ 21 (-53.33%)
Mutual labels:  speech
CLSA
official implemntation for "Contrastive Learning with Stronger Augmentations"
Stars: ✭ 48 (+6.67%)
Mutual labels:  contrastive-learning
MediumVC
Any-to-any voice conversion using synthetic specific-speaker speeches as intermedium features
Stars: ✭ 46 (+2.22%)
Mutual labels:  voice-conversion
Supervised-Contrastive-Learning-in-TensorFlow-2
Implements the ideas presented in https://arxiv.org/pdf/2004.11362v1.pdf by Khosla et al.
Stars: ✭ 117 (+160%)
Mutual labels:  contrastive-learning
gans-2.0
Generative Adversarial Networks in TensorFlow 2.0
Stars: ✭ 76 (+68.89%)
Mutual labels:  cyclegan
TF-Speech-Recognition-Challenge-Solution
Source code of the model used in Tensorflow Speech Recognition Challenge (https://www.kaggle.com/c/tensorflow-speech-recognition-challenge). The solution ranked in top 5% in private leaderboard.
Stars: ✭ 58 (+28.89%)
Mutual labels:  speech
GeDML
Generalized Deep Metric Learning.
Stars: ✭ 30 (-33.33%)
Mutual labels:  contrastive-learning
lidbox
End-to-end spoken language identification out of the box.
Stars: ✭ 39 (-13.33%)
Mutual labels:  speech
Multimodal-Gesture-Recognition-with-LSTMs-and-CTC
An end-to-end system that performs temporal recognition of gesture sequences using speech and skeletal input. The model combines three networks with a CTC output layer that recognises gestures from continuous stream.
Stars: ✭ 25 (-44.44%)
Mutual labels:  speech
IMS-Toucan
Text-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart. Objectives of the development are simplicity, modularity, controllability and multilinguality.
Stars: ✭ 295 (+555.56%)
Mutual labels:  speech
StyleSpeech
Official implementation of Meta-StyleSpeech and StyleSpeech
Stars: ✭ 161 (+257.78%)
Mutual labels:  speech
FAST-RIR
This is the official implementation of our neural-network-based fast diffuse room impulse response generator (FAST-RIR) for generating room impulse responses (RIRs) for a given acoustic environment.
Stars: ✭ 90 (+100%)
Mutual labels:  speech
react-native-speech-bubble
💬 A speech bubble dialog component for React Native.
Stars: ✭ 50 (+11.11%)
Mutual labels:  speech
MajorDomo-Scenarios
Сценарии для системы домашней автоматизации Majordomo
Stars: ✭ 12 (-73.33%)
Mutual labels:  speech
aframe-speech-controls-component
alternative form of inputs for in-VR interaction with the content of a scene
Stars: ✭ 13 (-71.11%)
Mutual labels:  speech
publications-arruda-ijcnn-2019
Cross-Domain Car Detection Using Unsupervised Image-to-Image Translation: From Day to Night
Stars: ✭ 59 (+31.11%)
Mutual labels:  cyclegan
idear
🎙️ Handsfree Audio Development Interface
Stars: ✭ 84 (+86.67%)
Mutual labels:  speech
1-60 of 292 similar projects