Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Ipython Notebooks for solving problems like classification, segmentation, generation using latest Deep learning algorithms on different publicly available text and image data-sets.

Stars: ✭ 23 (-90.38%)

Mutual labels: deep-neural-networks, deeplearning, text-classification

Servenet

Service Classification based on Service Description

Stars: ✭ 21 (-91.21%)

Mutual labels: deep-neural-networks, cnn, deeplearning

Nlp Pretrained Model

A collection of Natural language processing pre-trained models.

Stars: ✭ 122 (-48.95%)

Mutual labels: deep-neural-networks, text-classification, nlp-machine-learning

Bilinear Cnn Tensorflow

This is an implementation of Bilinear CNN for fine grained visual recognition using TensorFlow.

Stars: ✭ 187 (-21.76%)

Mutual labels: cnn, deeplearning

Hdltex

HDLTex: Hierarchical Deep Learning for Text Classification

Stars: ✭ 191 (-20.08%)

Mutual labels: deep-neural-networks, text-classification

Cnn Text Classification Tf Chinese

CNN for Chinese Text Classification in Tensorflow

Stars: ✭ 237 (-0.84%)

Mutual labels: cnn, text-classification

Sparse Structured Attention

Sparse and structured neural attention mechanisms

Stars: ✭ 198 (-17.15%)

Mutual labels: deep-neural-networks, deeplearning

Datastories Semeval2017 Task4

Deep-learning model presented in "DataStories at SemEval-2017 Task 4: Deep LSTM with Attention for Message-level and Topic-based Sentiment Analysis".

Stars: ✭ 184 (-23.01%)

Mutual labels: deeplearning, nlp-machine-learning

Awesome Deep Learning Music

List of articles related to deep learning applied to music

Stars: ✭ 2,195 (+818.41%)

Mutual labels: deep-neural-networks, deeplearning

Pyconv

Pyramidal Convolution: Rethinking Convolutional Neural Networks for Visual Recognition (https://arxiv.org/pdf/2006.11538.pdf)

Stars: ✭ 231 (-3.35%)

Mutual labels: deep-neural-networks, cnn

Speech Emotion Recognition

Speaker independent emotion recognition

Stars: ✭ 169 (-29.29%)

Mutual labels: deep-neural-networks, deeplearning

Keras Unet

Helper package with multiple U-Net implementations in Keras as well as useful utility tools helpful when working with image semantic segmentation tasks. This library and underlying tools come from multiple projects I performed working on semantic segmentation tasks

Stars: ✭ 196 (-17.99%)

Mutual labels: deep-neural-networks, deeplearning

Iresnet

Improved Residual Networks (https://arxiv.org/pdf/2004.04989.pdf)

Stars: ✭ 163 (-31.8%)

Mutual labels: deep-neural-networks, cnn

Liteflownet2

A Lightweight Optical Flow CNN - Revisiting Data Fidelity and Regularization, TPAMI 2020

Stars: ✭ 195 (-18.41%)

Mutual labels: cnn, deeplearning

Halite Ii

Season 2 of @twosigma's artificial intelligence programming challenge

Stars: ✭ 201 (-15.9%)

Mutual labels: deep-neural-networks, deeplearning

Trixi

Manage your machine learning experiments with trixi - modular, reproducible, high fashion. An experiment infrastructure optimized for PyTorch, but flexible enough to work for your framework and your tastes.

Stars: ✭ 211 (-11.72%)

Mutual labels: deep-neural-networks, deeplearning

View All Similar Projects ➔

[PYTORCH] Hierarchical Attention Networks for Document Classification

Introduction

Here is my pytorch implementation of the model described in the paper Hierarchical Attention Networks for Document Classification paper.

An example of app demo for my model's output for Dbpedia dataset.

An example of my model's performance for Dbpedia dataset.

How to use my code

With my code, you can:

Train your model with any dataset
Given either my trained model or yours, you could evaluate any test dataset whose have the same set of classes
Run a simple web app for testing purpose

Requirements:

python 3.6
pytorch 0.4
tensorboard
tensorboardX (This library could be skipped if you do not use SummaryWriter)
numpy

Datasets:

Statistics of datasets I used for experiments. These datasets could be download from link

Dataset	Classes	Train samples	Test samples
AG’s News	4	120 000	7 600
Sogou News	5	450 000	60 000
DBPedia	14	560 000	70 000
Yelp Review Polarity	2	560 000	38 000
Yelp Review Full	5	650 000	50 000
Yahoo! Answers	10	1 400 000	60 000
Amazon Review Full	5	3 000 000	650 000
Amazon Review Polarity	2	3 600 000	400 000

Additionally, I also use word2vec pre-trained models, taken from GLOVE, which you could download from link. I run experiments with all 4 word2vec files (50d, 100d, 200d and 300d). You could easily switch to other common word2vec models, like the one provided in FastText link In the paper, it is said that a pre-trained word2vec is used. However, to the best of my knowledge, at least in pytorch, there is no implementation on github using it. In all HAN github repositories I have seen so far, a default embedding layer was used, without loading pre-trained word2vec model. I admit that we could still train HAN model without any pre-trained word2vec model. However, to serve the purpose of re-implementing origin model, in all experiments, as mentioned above, I used 1 out of 4 pre-trained word2vec models as initilization for embedding layer.

Setting:

During my experiments, I found out that given different datasets and different embedding layer's dimension, some combinations of batch size and learning rate yield better performance (faster convergence and higher accuracy) than others. Particularly in some cases, if you set wrong values for these 2 very important parameters, your model will never converge. Detail setting for each experiments will be shown in Experiments part. I have not set a fixed number of epoches for each experiment. Instead, I apply early stopping technique, to stop training phase after test loss has not been improved for n epoches.

Training

If you want to train a model with default parameters, you could run:

python train.py

If you want to train a model with your preference parameters, like optimizer and learning rate, you could run:

python train.py --batch_size batch_size --lr learning_rate: For example, python train.py --batch_size 512 --lr 0.01

If you want to train a model with your preference word2vec model, you could run:

python train.py --word2vec_path path/to/your/word2vec

Test

For testing a trained model with your test file, please run the following command:

python test.py --word2vec_path path/to/your/word2vec, with the word2vec file is the same as the one you use in training phase.

You could find some trained models I have trained in link

Experiments:

Results for test set are presented as follows: A(B/C):

A is accuracy.
B is learning rate used.
C is batch size.

Each experiment is run over 10 epochs.

GLOVE word2vec	50	100	200	300
ag_news	updated soon	updated soon	updated soon	updated soon
sogu_news	updated soon	updated soon	updated soon	updated soon
db_pedia	updated soon	updated soon	updated soon	updated soon
yelp_polarity	updated soon	updated soon	updated soon	updated soon
yelp_review	updated soon	updated soon	updated soon	updated soon
yahoo_answer	updated soon	updated soon	updated soon	updated soon
amazon_review	updated soon	updated soon	updated soon	updated soon
amazon_polarity	updated soon	updated soon	updated soon	updated soon

The training/test loss/accuracy curves for each dataset's experiments (with the order from left to right, top to bottom is 50d, 100d, 200d and 300d word2vec) are shown below:

ag_news

db_pedia

yelp_polarity

yelp_review

Yahoo! Answers

amazon_review

amazon_polarity

There are some experiments I have not had time to train. For such experiments, statistics as well as loss/accuracy visualization are empty. Additionally, there are some other experiments, I can not wait until they are finished, hence I stopped training phase before it should be . You could see whether a model was stopped by early stopping technique or by me by looking at the test loss curve, if the loss is not improved for 5 consecutive epoches, it is the former case. Othewise, if the loss is still going down, it is the latter case. When I have time, I will complete the incomplete experiments, and update results here.

After completing training phase, you could see model's parameters you have set, accuracy, loss and confusion matrix for test set at the end of each epoch at root_folder/trained_models/logs.txt. One example is shown below:

An example of logs.txt for Dbpedia dataset.

Demo:

I wrote a simple web which is suitable for quick test with any input text. In order to use the app, you could follow the following steps:

Step 1: Run the script app.py

Step 2: Web interface

Step 3: Select trained model

Step 4: Select word2vec model

Step 5 (Optional): Select file containing classes (one class per line)

Step 6: After all necessary files are selected, press submit button

Step 7: You could paste any text to the textbox

Step 8: A sample text

Step 9: After submit button pressed, predicted category and probability are shown

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 239

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (10) 🔗