All Projects → pyannote → Pyannote Audio

pyannote / Pyannote Audio

Licence: mit
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Pyannote Audio

Breast cancer classifier
Deep Neural Networks Improve Radiologists' Performance in Breast Cancer Screening
Stars: ✭ 614 (-37.22%)
Mutual labels:  pretrained-models
Pytorchinsight
a pytorch lib with state-of-the-art architectures, pretrained models and real-time updated results
Stars: ✭ 713 (-27.1%)
Mutual labels:  pretrained-models
Prosr
Repository containing an independent implementation of the paper: "A Fully Progressive Approach to Single-Image Super-Resolution"
Stars: ✭ 923 (-5.62%)
Mutual labels:  pretrained-models
Pinto model zoo
A repository that shares tuning results of trained models generated by TensorFlow / Keras. Post-training quantization (Weight Quantization, Integer Quantization, Full Integer Quantization, Float16 Quantization), Quantization-aware training. TensorFlow Lite. OpenVINO. CoreML. TensorFlow.js. TF-TRT. MediaPipe. ONNX. [.tflite,.h5,.pb,saved_model,tfjs,tftrt,mlmodel,.xml/.bin, .onnx]
Stars: ✭ 634 (-35.17%)
Mutual labels:  pretrained-models
Efficientnet Pytorch
A PyTorch implementation of EfficientNet and EfficientNetV2 (coming soon!)
Stars: ✭ 6,685 (+583.54%)
Mutual labels:  pretrained-models
Srgan Tensorflow
Tensorflow implementation of the SRGAN algorithm for single image super-resolution
Stars: ✭ 754 (-22.9%)
Mutual labels:  pretrained-models
Self Driving Car In Video Games
A deep neural network that learns to drive in video games
Stars: ✭ 559 (-42.84%)
Mutual labels:  pretrained-models
Asteroid
The PyTorch-based audio source separation toolkit for researchers
Stars: ✭ 862 (-11.86%)
Mutual labels:  pretrained-models
Awesome Diarization
A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
Stars: ✭ 673 (-31.19%)
Mutual labels:  speech-processing
Bert Ner
Pytorch-Named-Entity-Recognition-with-BERT
Stars: ✭ 829 (-15.24%)
Mutual labels:  pretrained-models
Nlp Recipes
Natural Language Processing Best Practices & Examples
Stars: ✭ 5,783 (+491.31%)
Mutual labels:  pretrained-models
Pytorch Cnn Finetune
Fine-tune pretrained Convolutional Neural Networks with PyTorch
Stars: ✭ 653 (-33.23%)
Mutual labels:  pretrained-models
Sincnet
SincNet is a neural architecture for efficiently processing raw audio samples.
Stars: ✭ 764 (-21.88%)
Mutual labels:  speech-processing
Gensim Data
Data repository for pretrained NLP models and NLP corpora.
Stars: ✭ 622 (-36.4%)
Mutual labels:  pretrained-models
Musical Onset Efficient
Supplementary information and code for the paper: An efficient deep learning model for musical onset detection
Stars: ✭ 26 (-97.34%)
Mutual labels:  pretrained-models
Functional Zoo
PyTorch and Tensorflow functional model definitions
Stars: ✭ 577 (-41%)
Mutual labels:  pretrained-models
Audino
Open source audio annotation tool for humans™
Stars: ✭ 740 (-24.34%)
Mutual labels:  speech-processing
Rte Speech Generator
Natural Language Processing to generate new speeches for the President of Turkey.
Stars: ✭ 22 (-97.75%)
Mutual labels:  speech-processing
Classification models
Classification models trained on ImageNet. Keras.
Stars: ✭ 938 (-4.09%)
Mutual labels:  pretrained-models
Bert Keras
Keras implementation of BERT with pre-trained weights
Stars: ✭ 820 (-16.16%)
Mutual labels:  pretrained-models

⚠️ Checkout develop branch to see what is coming in pyannote.audio 2.0:

Neural speaker diarization with pyannote-audio

pyannote.audio is an open-source toolkit written in Python for speaker diarization. Based on PyTorch machine learning framework, it provides a set of trainable end-to-end neural building blocks that can be combined and jointly optimized to build speaker diarization pipelines:

pyannote.audio also comes with pretrained models covering a wide range of domains for voice activity detection, speaker change detection, overlapped speech detection, and speaker embedding:

segmentation

Open In Colab

Installation

pyannote.audio only supports Python 3.7 (or later) on Linux and macOS. It might work on Windows but there is no garantee that it does, nor any plan to add official support for Windows.

The instructions below assume that pytorch has been installed using the instructions from https://pytorch.org.

$ pip install pyannote.audio==1.1.1

Documentation and tutorials

Until a proper documentation is released, note that part of the API is described in this tutorial.

Citation

If you use pyannote.audio please use the following citation

@inproceedings{Bredin2020,
  Title = {{pyannote.audio: neural building blocks for speaker diarization}},
  Author = {{Bredin}, Herv{\'e} and {Yin}, Ruiqing and {Coria}, Juan Manuel and {Gelly}, Gregory and {Korshunov}, Pavel and {Lavechin}, Marvin and {Fustes}, Diego and {Titeux}, Hadrien and {Bouaziz}, Wassim and {Gill}, Marie-Philippe},
  Booktitle = {ICASSP 2020, IEEE International Conference on Acoustics, Speech, and Signal Processing},
  Address = {Barcelona, Spain},
  Month = {May},
  Year = {2020},
}
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].