adiyoss / DeepSegmentor

Licence: MIT license

Sequence Segmentation using Joint RNN and Structured Prediction Models (ICASSP 2017)

Programming Languages

python

139335 projects - #7 most used programming language

lua

6591 projects

Projects that are alternatives of or similar to DeepSegmentor

Inaspeechsegmenter

CNN-based audio segmentation toolkit. Allows to detect speech, music and speaker gender. Has been designed for large scale gender equality studies based on speech time per gender.

Stars: ✭ 352 (+1970.59%)

Mutual labels: speech, segmentation

Tfvos

Semi-Supervised Video Object Segmentation (VOS) with Tensorflow. Includes implementation of *MaskRNN: Instance Level Video Object Segmentation (NIPS 2017)* as part of the NIPS Paper Implementation Challenge.

Stars: ✭ 151 (+788.24%)

Mutual labels: recurrent-neural-networks, segmentation

Pytorch Kaldi

pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.

Stars: ✭ 2,097 (+12235.29%)

Mutual labels: speech, recurrent-neural-networks

Awesome Tensorlayer

A curated list of dedicated resources and applications

Stars: ✭ 248 (+1358.82%)

Mutual labels: recurrent-neural-networks, segmentation

TF-Speech-Recognition-Challenge-Solution

Source code of the model used in Tensorflow Speech Recognition Challenge (https://www.kaggle.com/c/tensorflow-speech-recognition-challenge). The solution ranked in top 5% in private leaderboard.

Stars: ✭ 58 (+241.18%)

Mutual labels: speech, recurrent-neural-networks

D-TDNN

PyTorch implementation of Densely Connected Time Delay Neural Network

Stars: ✭ 60 (+252.94%)

Mutual labels: speech

lung-image-analysis

A basic framework for pulmonary nodule detection and characterization in CT

Stars: ✭ 26 (+52.94%)

Mutual labels: segmentation

datastories-semeval2017-task6

Deep-learning model presented in "DataStories at SemEval-2017 Task 6: Siamese LSTM with Attention for Humorous Text Comparison".

Stars: ✭ 20 (+17.65%)

Mutual labels: recurrent-neural-networks

kaldi helpers

🙊 A set of scripts to use in preparing a corpus for speech-to-text processing with the Kaldi Automatic Speech Recognition Library.

Stars: ✭ 13 (-23.53%)

Mutual labels: speech

entity-network

Tensorflow implementation of "Tracking the World State with Recurrent Entity Networks" [https://arxiv.org/abs/1612.03969] by Henaff, Weston, Szlam, Bordes, and LeCun.

Stars: ✭ 58 (+241.18%)

Mutual labels: recurrent-neural-networks

VAD-LTSD

Efficient voice activity detection algorithm using long-term speech information

Stars: ✭ 37 (+117.65%)

Mutual labels: speech

datasets

🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools

Stars: ✭ 13,870 (+81488.24%)

Mutual labels: speech

deepspeech.mxnet

A MXNet implementation of Baidu's DeepSpeech architecture

Stars: ✭ 82 (+382.35%)

Mutual labels: speech

Polygonization-by-Frame-Field-Learning

This repository contains the code for our fast polygonal building extraction from overhead images pipeline.

Stars: ✭ 161 (+847.06%)

Mutual labels: segmentation

deep-learning

Assignmends done for Udacity's Deep Learning MOOC with Vincent Vanhoucke

Stars: ✭ 94 (+452.94%)

Mutual labels: recurrent-neural-networks

nicMSlesions

Easy multiple sclerosis white matter lesion segmentation using convolutional deep neural networks.

Stars: ✭ 33 (+94.12%)

Mutual labels: segmentation

wasr network

WaSR Segmentation Network for Unmanned Surface Vehicles v0.5

Stars: ✭ 32 (+88.24%)

Mutual labels: segmentation

JD-NMF

Joint Dictionary Learning-based Non-Negative Matrix Factorization for Voice Conversion (TBME 2016)

Stars: ✭ 20 (+17.65%)

Mutual labels: speech

SpeakerDiarization RNN CNN LSTM

Speaker Diarization is the problem of separating speakers in an audio. There could be any number of speakers and final result should state when speaker starts and ends. In this project, we analyze given audio file with 2 channels and 2 speakers (on separate channels).

Stars: ✭ 56 (+229.41%)

Mutual labels: recurrent-neural-networks

AttentionGatedVNet3D

Attention Gated VNet3D Model for KiTS19——2019 Kidney Tumor Segmentation Challenge

Stars: ✭ 35 (+105.88%)

Mutual labels: segmentation

View All Similar Projects ➔

Deep Segmentor - Sequence Segmentation using Joint RNN and Structured Prediction Models

We describe and analyze a simple and effective algorithm for sequence segmentation applied to speech processing tasks. We propose a neural architecture that is composed of two modules trained jointly: a recurrent neural network (RNN) module and a structured prediction model. The RNN outputs are considered as feature functions to the structured model. The overall model is trained with a structured loss function which can be designed to the given segmentation task. We demonstrate the effectiveness of our method by applying it to two simple tasks commonly used in phonetic studies: word segmentation and voice onset time segmentation.

If you find our work useful please cite: [Sequence Segmentation using Joint RNN and Structured Prediction Models] (https://arxiv.org/pdf/1610.07918v1.pdf)

@article{adi2016sequence,
  title={Sequence Segmentation Using Joint RNN and Structured Prediction Models},
  author={Adi, Yossi and Keshet, Joseph and Cibelli, Emily and Goldrick, Matthew},
  journal={arXiv preprint arXiv:1610.07918},
  year={2016}
}

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

adiyoss / DeepSegmentor

Programming Languages

Labels

Projects that are alternatives of or similar to DeepSegmentor

Deep Segmentor - Sequence Segmentation using Joint RNN and Structured Prediction Models