AESRC2020a deep accent recognition network
Stars: ✭ 35 (-38.6%)
InsightfaceState-of-the-art 2D and 3D Face Analysis Project
Stars: ✭ 10,886 (+18998.25%)
PLSCPaddle Large Scale Classification Tools,supports ArcFace, CosFace, PartialFC, Data Parallel + Model Parallel. Model includes ResNet, ViT, DeiT, FaceViT.
Stars: ✭ 113 (+98.25%)
meta-embeddingsMeta-embeddings are a probabilistic generalization of embeddings in machine learning.
Stars: ✭ 22 (-61.4%)
Paddle-SEQ低代码序列数据处理框架,最短两行即可完成训练任务!
Stars: ✭ 13 (-77.19%)
voice gender detection♂️♀️ Detect a person's gender from a voice file (90.7% +/- 1.3% accuracy).
Stars: ✭ 51 (-10.53%)
AmazonSpeechTranslatorEnd-to-end Solution for Speech Recognition, Text Translation, and Text-to-Speech for iOS using Amazon Translate and Amazon Polly as AWS Machine Learning managed services.
Stars: ✭ 50 (-12.28%)
Smart container🍰🍎ColugoMum--Intelligent Retail Settlement Platform can accurately locate and identify each commodity, and can return a complete shopping list and the actual total price of commodities that customers should pay.
Stars: ✭ 141 (+147.37%)
FSL-MateFSL-Mate: A collection of resources for few-shot learning (FSL).
Stars: ✭ 1,346 (+2261.4%)
open-speech-corpora💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
Stars: ✭ 841 (+1375.44%)
wavenet-classifierKeras Implementation of Deepmind's WaveNet for Supervised Learning Tasks
Stars: ✭ 54 (-5.26%)
kaldi-timit-sre-ivectorDevelop speaker recognition model based on i-vector using TIMIT database
Stars: ✭ 17 (-70.18%)
InsightFace-RESTInsightFace REST API for easy deployment of face recognition services with TensorRT in Docker.
Stars: ✭ 308 (+440.35%)
Paddle-CLIPA PaddlePaddle version implementation of CLIP of OpenAI.
Stars: ✭ 51 (-10.53%)
VoiceNET.Library.NET library to easily create Voice Command Control feature.
Stars: ✭ 14 (-75.44%)
myprosodyA Python library for measuring the acoustic features of speech (simultaneous speech, high entropy) compared to ones of native speech.
Stars: ✭ 162 (+184.21%)
FreeSRA Free Library for Speaker Recognition (Verification),implemented by ncnn.
Stars: ✭ 21 (-63.16%)
spokestack-iosSpokestack: give your iOS app a voice interface!
Stars: ✭ 27 (-52.63%)
FaceRecognitionCppLarge input size REAL-TIME Face Detector on Cpp. It can also support face verification using MobileFaceNet+Arcface with real-time inference. 480P Over 30FPS on CPU
Stars: ✭ 40 (-29.82%)
insight-face-paddleEnd-to-end face detection and recognition system using PaddlePaddle.
Stars: ✭ 52 (-8.77%)
meta-SRPytorch implementation of Meta-Learning for Short Utterance Speaker Recognition with Imbalance Length Pairs (Interspeech, 2020)
Stars: ✭ 58 (+1.75%)
Speaker-IdentificationA program for automatic speaker identification using deep learning techniques.
Stars: ✭ 84 (+47.37%)
InterpretDLInterpretDL: Interpretation of Deep Learning Models,基于『飞桨』的模型可解释性算法库。
Stars: ✭ 121 (+112.28%)
QuietVRA Quiet Place in VR: Generate any 3D object with your voice. It's magic!
Stars: ✭ 17 (-70.18%)
Paddle-Adversarial-ToolboxPaddle-Adversarial-Toolbox (PAT) is a Python library for Deep Learning Security based on PaddlePaddle.
Stars: ✭ 16 (-71.93%)
Paddle-RLBooksPaddle-RLBooks is a reinforcement learning code study guide based on pure PaddlePaddle.
Stars: ✭ 113 (+98.25%)
octopusOn-device speech-to-index engine powered by deep learning.
Stars: ✭ 30 (-47.37%)
SmartMirrorMy MagicMirror running on a Raspberry Pi
Stars: ✭ 110 (+92.98%)
PaddleTokenizer使用 PaddlePaddle 实现基于深度神经网络的中文分词引擎 | A DNN Chinese Tokenizer by Using PaddlePaddle
Stars: ✭ 14 (-75.44%)
D-TDNNPyTorch implementation of Densely Connected Time Delay Neural Network
Stars: ✭ 60 (+5.26%)
brasilttsBrasil TTS é um conjunto de sintetizadores de voz, em português do Brasil, que lê telas para portadores de deficiência visual. Transforma texto em áudio, permitindo que pessoas cegas ou com baixa visão tenham acesso ao conteúdo exibido na tela. Embora o principal público-alvo de sistemas de conversão texto-fala – como o Brasil TTS – seja formado…
Stars: ✭ 34 (-40.35%)
esp32 CloudSpeechTranscribe your voice by Google's Cloud Speech-to-Text API with esp32
Stars: ✭ 72 (+26.32%)
KeenASR-Android-PoCA proof-of-concept app using KeenASR SDK on Android. WE ARE HIRING: https://keenresearch.com/careers.html
Stars: ✭ 21 (-63.16%)
BookSource《深度学习应用实战之PaddlePaddle》的源码
Stars: ✭ 17 (-70.18%)
picovoiceThe end-to-end platform for building voice products at scale
Stars: ✭ 316 (+454.39%)
MiniVoxCode for our ACML and INTERSPEECH papers: "Speaker Diarization as a Fully Online Bandit Learning Problem in MiniVox".
Stars: ✭ 15 (-73.68%)
Paddle-DALL-EA PaddlePaddle version implementation of DALL-E of OpenAI.
Stars: ✭ 38 (-33.33%)
voce-browserVoice Controlled Chromium Web Browser
Stars: ✭ 34 (-40.35%)
PaTTAA test times augmentation toolkit based on paddle2.0.
Stars: ✭ 106 (+85.96%)