All Projects → shashiongithub → sidenet

shashiongithub / sidenet

Licence: BSD-3-Clause License
SideNet: Neural Extractive Summarization with Side Information

Programming Languages

python
139335 projects - #7 most used programming language
shell
77523 projects

Projects that are alternatives of or similar to sidenet

code summarization public
source code for 'Improving automatic source code summarization via deep reinforcement learning'
Stars: ✭ 71 (+36.54%)
Mutual labels:  summarization
Entity2Topic
[NAACL2018] Entity Commonsense Representation for Neural Abstractive Summarization
Stars: ✭ 20 (-61.54%)
Mutual labels:  summarization
textdigester
TextDigester: document summarization java library
Stars: ✭ 23 (-55.77%)
Mutual labels:  summarization
verseagility
Ramp up your custom natural language processing (NLP) task, allowing you to bring your own data, use your preferred frameworks and bring models into production.
Stars: ✭ 23 (-55.77%)
Mutual labels:  summarization
PlanSum
[AAAI2021] Unsupervised Opinion Summarization with Content Planning
Stars: ✭ 25 (-51.92%)
Mutual labels:  summarization
gazeta
Gazeta: Dataset for automatic summarization of Russian news / Газета: набор данных для автоматического реферирования на русском языке
Stars: ✭ 25 (-51.92%)
Mutual labels:  summarization
seq3
Source code for the NAACL 2019 paper "SEQ^3: Differentiable Sequence-to-Sequence-to-Sequence Autoencoder for Unsupervised Abstractive Sentence Compression"
Stars: ✭ 121 (+132.69%)
Mutual labels:  summarization
technical-articles
Technical Pieces collected in practices
Stars: ✭ 35 (-32.69%)
Mutual labels:  summarization
rouge
A Javascript implementation of the Recall-Oriented Understudy for Gisting Evaluation (ROUGE) evaluation metric for summaries.
Stars: ✭ 36 (-30.77%)
Mutual labels:  summarization
video-summarizer
Summarizes videos into much shorter videos. Ideal for long lecture videos.
Stars: ✭ 92 (+76.92%)
Mutual labels:  summarization
DocSum
A tool to automatically summarize documents abstractively using the BART or PreSumm Machine Learning Model.
Stars: ✭ 58 (+11.54%)
Mutual labels:  summarization
data-summ-cnn dailymail
non-anonymized cnn/dailymail dataset for text summarization
Stars: ✭ 12 (-76.92%)
Mutual labels:  summarization
PyRouge
A python library to compute rouge score for summarization
Stars: ✭ 54 (+3.85%)
Mutual labels:  summarization
nlp-akash
Natural Language Processing notes and implementations.
Stars: ✭ 66 (+26.92%)
Mutual labels:  summarization
query-focused-sum
Official code repository for "Exploring Neural Models for Query-Focused Summarization".
Stars: ✭ 17 (-67.31%)
Mutual labels:  summarization
FewSum
Few-shot learning framework for opinion summarization published at EMNLP 2020.
Stars: ✭ 29 (-44.23%)
Mutual labels:  summarization
fawkes
🚀🚀 Fetch, parse, categorize, summarize user reviews 🚀🚀
Stars: ✭ 83 (+59.62%)
Mutual labels:  summarization
article-summary-deep-learning
📖 Using deep learning and scraping to analyze/summarize articles! Just drop in any URL!
Stars: ✭ 18 (-65.38%)
Mutual labels:  summarization
text2text
Text2Text: Cross-lingual natural language processing and generation toolkit
Stars: ✭ 188 (+261.54%)
Mutual labels:  summarization
2021-dialogue-summary-competition
[2021 훈민정음 한국어 음성•자연어 인공지능 경진대회] 대화요약 부문 알라꿍달라꿍 팀의 대화요약 학습 및 추론 코드를 공유하기 위한 레포입니다.
Stars: ✭ 86 (+65.38%)
Mutual labels:  summarization

Neural Extractive Summarization with Side Information

This repository releases codes for SideNet (Neural Extractive Summarization with Side Information). They use Tensorflow 0.10, please use scripts provided by Tensorflow to translate them to newer upgrades.

Please contact me at [email protected] for any question.

Please cite this paper if you use any of these:

Neural Extractive Summarization with Side Information, Shashi Narayan, Nikos Papasarantopoulos, Shay B. Cohen, Mirella Lapata, ILCC, School of Informatics, University of Edinburgh, arXiv:1704.04530 (preprint)

Most extractive summarization methods focus on the main body of the document from which sentences need to be extracted. The gist of the document often lies in the side information of the document, such as title and image captions. These types of side information are often available for newswire articles. We propose to explore side information in the context of single document extractive summarization. We develop a framework for single-document summarization composed of a hierarchical document encoder and an attentionbased extractor with attention over side information. We evaluate our models on a large scale news dataset. We show that extractive summarization with side information consistently outperforms its counterpart (that does not use any side information), in terms on both informativeness and fluency.

The CNN and DM dataset (Hermann et al 2015) with Side Information

Dataset with sideinfo: http://kinloch.inf.ed.ac.uk/public/direct/cnn-dm-sideinfo-data.zip

Dataset with oracle labels: http://kinloch.inf.ed.ac.uk/public/direct/cnn-dm-sidenet-oracle.zip

Preprocessed CNN dataset used for training and testing

Preprocessed CNN dataset: http://kinloch.inf.ed.ac.uk/public/direct/sidenet-cnn-inputs.tar.gz

CNN Original Sentence (test and validation sets): http://kinloch.inf.ed.ac.uk/public/direct/cnn-original-sents.zip

CNN Gold Highlights (test and validation sets): http://kinloch.inf.ed.ac.uk/public/direct/cnn-gold-highlights.zip

Demonstration

Live Demo: http://kinloch.inf.ed.ac.uk/sidenet.html

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].