All Projects → oscarknagg → Voicemap

oscarknagg / Voicemap

Identifying people from small audio fragments

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Voicemap

Imagenet
Pytorch Imagenet Models Example + Transfer Learning (and fine-tuning)
Stars: ✭ 134 (-6.94%)
Mutual labels:  convolutional-neural-networks
Image classifier
CNN image classifier implemented in Keras Notebook 🖼️.
Stars: ✭ 139 (-3.47%)
Mutual labels:  convolutional-neural-networks
Image Caption Generator
[DEPRECATED] A Neural Network based generative model for captioning images using Tensorflow
Stars: ✭ 141 (-2.08%)
Mutual labels:  convolutional-neural-networks
Mrcnn Scene Recognition
MR-CNNs for Large-Scale Scene Recognition
Stars: ✭ 135 (-6.25%)
Mutual labels:  convolutional-neural-networks
Lung Segmentation 2d
Lung fields segmentation on CXR images using convolutional neural networks.
Stars: ✭ 138 (-4.17%)
Mutual labels:  convolutional-neural-networks
Abnormal event detection
Abnormal Event Detection in Videos using SpatioTemporal AutoEncoder
Stars: ✭ 139 (-3.47%)
Mutual labels:  convolutional-neural-networks
Deep Learning With Pytorch Tutorials
深度学习与PyTorch入门实战视频教程 配套源代码和PPT
Stars: ✭ 1,986 (+1279.17%)
Mutual labels:  convolutional-neural-networks
Shainet
SHAInet - a pure Crystal machine learning library
Stars: ✭ 143 (-0.69%)
Mutual labels:  convolutional-neural-networks
Wsddn
Weakly Supervised Deep Detection Networks (CVPR 2016)
Stars: ✭ 138 (-4.17%)
Mutual labels:  convolutional-neural-networks
Bender
Easily craft fast Neural Networks on iOS! Use TensorFlow models. Metal under the hood.
Stars: ✭ 1,728 (+1100%)
Mutual labels:  convolutional-neural-networks
Reproduce Chexnet
Reproduce CheXNet
Stars: ✭ 136 (-5.56%)
Mutual labels:  convolutional-neural-networks
Easycnn
easy convolution neural network
Stars: ✭ 136 (-5.56%)
Mutual labels:  convolutional-neural-networks
Hey Waldo
Labeled images of the Where's Waldo puzzle for use in classification and image recognition problems.
Stars: ✭ 138 (-4.17%)
Mutual labels:  convolutional-neural-networks
Chainer Cifar10
Various CNN models for CIFAR10 with Chainer
Stars: ✭ 134 (-6.94%)
Mutual labels:  convolutional-neural-networks
Cnn Quantization
Quantization of Convolutional Neural networks.
Stars: ✭ 141 (-2.08%)
Mutual labels:  convolutional-neural-networks
Intelegent lock
lock mechanism with face recognition and liveness detection
Stars: ✭ 134 (-6.94%)
Mutual labels:  convolutional-neural-networks
Deeplearning.ai
Stars: ✭ 139 (-3.47%)
Mutual labels:  convolutional-neural-networks
Livianet
This repository contains the code of LiviaNET, a 3D fully convolutional neural network that was employed in our work: "3D fully convolutional networks for subcortical segmentation in MRI: A large-scale study"
Stars: ✭ 143 (-0.69%)
Mutual labels:  convolutional-neural-networks
Attribute Aware Attention
[ACM MM 2018] Attribute-Aware Attention Model for Fine-grained Representation Learning
Stars: ✭ 143 (-0.69%)
Mutual labels:  convolutional-neural-networks
Practical Machine Learning With Python
Master the essential skills needed to recognize and solve complex real-world problems with Machine Learning and Deep Learning by leveraging the highly popular Python Machine Learning Eco-system.
Stars: ✭ 1,868 (+1197.22%)
Mutual labels:  convolutional-neural-networks

voicemap

This repository contains code to build deep learning models to identify different speakers based on audio samples containg their voice.

The eventual aim is for this repository to become a pip-installable python package for quickly and easily performing speaker identification related tasks.

This tensorflow/Keras/python2.7 branch is discontinued. Work is continuing on the pytorch-python-3.6 branch which will become the master branch.

Instructions

Requirements

Make a new virtualenv and install requirements from requirements.txt with the following command.

pip install -r requirements.txt

This project was written in Python 2.7.12 so I cannot guarantee it works on any other version.

Data

Get training data here: http://www.openslr.org/12

  • train-clean-100.tar.gz
  • train-clean-360.tar.gz
  • dev-clean.tar.gz

Place the unzipped training data into the data/ folder so the file structure is as follows:

data/
    LibriSpeech/
        dev-clean/
        train-clean-100/
        train-clean-360/
        SPEAKERS.TXT

Please use the SPEAKERS.TXT supplied in the repo as I've made a few corrections to the one found at openslr.org.

Run tests

This requires the LibriSpeech data.

python -m unittest tests.tests

Contents

voicemap

This package contains re-usable code for defining network architectures, interacting with datasets and many utility functions.

experiments

This package contains experiments in the form of python scripts.

notebooks

This folder contains Jupyter notebooks used for interactive visualisation and analysis.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].