Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → pyannote → Pyannote Audio

pyannote / Pyannote Audio

Licence: mit

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Programming Languages

python

139335 projects - #7 most used programming language

Labels

pytorch pretrained-models speech-processing

Projects that are alternatives of or similar to Pyannote Audio

Breast cancer classifier

Deep Neural Networks Improve Radiologists' Performance in Breast Cancer Screening

Stars: ✭ 614 (-37.22%)

Mutual labels: pretrained-models

Pytorchinsight

a pytorch lib with state-of-the-art architectures, pretrained models and real-time updated results

Stars: ✭ 713 (-27.1%)

Mutual labels: pretrained-models

Prosr

Repository containing an independent implementation of the paper: "A Fully Progressive Approach to Single-Image Super-Resolution"

Stars: ✭ 923 (-5.62%)

Mutual labels: pretrained-models

Pinto model zoo

A repository that shares tuning results of trained models generated by TensorFlow / Keras. Post-training quantization (Weight Quantization, Integer Quantization, Full Integer Quantization, Float16 Quantization), Quantization-aware training. TensorFlow Lite. OpenVINO. CoreML. TensorFlow.js. TF-TRT. MediaPipe. ONNX. [.tflite,.h5,.pb,saved_model,tfjs,tftrt,mlmodel,.xml/.bin, .onnx]

Stars: ✭ 634 (-35.17%)

Mutual labels: pretrained-models

Efficientnet Pytorch

A PyTorch implementation of EfficientNet and EfficientNetV2 (coming soon!)

Stars: ✭ 6,685 (+583.54%)

Mutual labels: pretrained-models

Srgan Tensorflow

Tensorflow implementation of the SRGAN algorithm for single image super-resolution

Stars: ✭ 754 (-22.9%)

Mutual labels: pretrained-models

Self Driving Car In Video Games

A deep neural network that learns to drive in video games

Stars: ✭ 559 (-42.84%)

Mutual labels: pretrained-models

Asteroid

The PyTorch-based audio source separation toolkit for researchers

Stars: ✭ 862 (-11.86%)

Mutual labels: pretrained-models

Awesome Diarization

A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.

Stars: ✭ 673 (-31.19%)

Mutual labels: speech-processing

Bert Ner

Pytorch-Named-Entity-Recognition-with-BERT

Stars: ✭ 829 (-15.24%)

Mutual labels: pretrained-models

Nlp Recipes

Natural Language Processing Best Practices & Examples

Stars: ✭ 5,783 (+491.31%)

Mutual labels: pretrained-models

Pytorch Cnn Finetune

Fine-tune pretrained Convolutional Neural Networks with PyTorch

Stars: ✭ 653 (-33.23%)

Mutual labels: pretrained-models

Sincnet

SincNet is a neural architecture for efficiently processing raw audio samples.

Stars: ✭ 764 (-21.88%)

Mutual labels: speech-processing

Gensim Data

Data repository for pretrained NLP models and NLP corpora.

Stars: ✭ 622 (-36.4%)

Mutual labels: pretrained-models

Musical Onset Efficient

Supplementary information and code for the paper: An efficient deep learning model for musical onset detection

Stars: ✭ 26 (-97.34%)

Mutual labels: pretrained-models

Functional Zoo

PyTorch and Tensorflow functional model definitions

Stars: ✭ 577 (-41%)

Mutual labels: pretrained-models

Audino

Open source audio annotation tool for humans™

Stars: ✭ 740 (-24.34%)

Mutual labels: speech-processing

Rte Speech Generator

Natural Language Processing to generate new speeches for the President of Turkey.

Stars: ✭ 22 (-97.75%)

Mutual labels: speech-processing

Classification models

Classification models trained on ImageNet. Keras.

Stars: ✭ 938 (-4.09%)

Mutual labels: pretrained-models

Bert Keras

Keras implementation of BERT with pre-trained weights

Stars: ✭ 820 (-16.16%)

Mutual labels: pretrained-models

View All Similar Projects ➔

⚠️ Checkout develop branch to see what is coming in pyannote.audio 2.0:

a much smaller and cleaner codebase
Python-first API (the good old pyannote-audio CLI will still be available, though)
multi-GPU and TPU training thanks to pytorch-lightning
data augmentation with torch-audiomentations
huggingface model hosting
prodigy recipes for audio annotations
online demo based on streamlit

Neural speaker diarization with `pyannote-audio`

pyannote.audio is an open-source toolkit written in Python for speaker diarization. Based on PyTorch machine learning framework, it provides a set of trainable end-to-end neural building blocks that can be combined and jointly optimized to build speaker diarization pipelines:

pyannote.audio also comes with pretrained models covering a wide range of domains for voice activity detection, speaker change detection, overlapped speech detection, and speaker embedding:

Installation

pyannote.audio only supports Python 3.7 (or later) on Linux and macOS. It might work on Windows but there is no garantee that it does, nor any plan to add official support for Windows.

The instructions below assume that pytorch has been installed using the instructions from https://pytorch.org.

$ pip install pyannote.audio==1.1.1

Documentation and tutorials

Use pretrained models and pipelines
- Apply pretrained pipelines on your own data
- Apply pretrained models on your own data
Prepare your own data
- Annotate your own data semi-automatically with Prodigy
- Prepare your own dataset for training
Train models on your own data
Tune pipelines on your own data
- Speech activity detection pipeline
- Speaker diarization pipeline

Until a proper documentation is released, note that part of the API is described in this tutorial.

Citation

If you use pyannote.audio please use the following citation

@inproceedings{Bredin2020,
  Title = {{pyannote.audio: neural building blocks for speaker diarization}},
  Author = {{Bredin}, Herv{\'e} and {Yin}, Ruiqing and {Coria}, Juan Manuel and {Gelly}, Gregory and {Korshunov}, Pavel and {Lavechin}, Marvin and {Fustes}, Diego and {Titeux}, Hadrien and {Bouaziz}, Wassim and {Gill}, Marie-Philippe},
  Booktitle = {ICASSP 2020, IEEE International Conference on Acoustics, Speech, and Signal Processing},
  Address = {Barcelona, Spain},
  Month = {May},
  Year = {2020},
}

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 978

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (86) 🔗

Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

pyannote / Pyannote Audio

Programming Languages

Labels

Projects that are alternatives of or similar to Pyannote Audio

Neural speaker diarization with pyannote-audio

Installation

Documentation and tutorials

Citation

Neural speaker diarization with `pyannote-audio`