All Projects → guokr → Brief

guokr / Brief

Licence: GPL-3.0 license
In a nutshell, this is a Text Summarizer

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Brief

TextSummarizer
TextRank implementation for C#
Stars: ✭ 29 (+0%)
Mutual labels:  text-summarization, text-summarizer
PlanSum
[AAAI2021] Unsupervised Opinion Summarization with Content Planning
Stars: ✭ 25 (-13.79%)
Mutual labels:  text-summarization
TextSumma
reimplementing Neural Summarization by Extracting Sentences and Words
Stars: ✭ 16 (-44.83%)
Mutual labels:  text-summarization
Text-Summarization
Abstractive and Extractive Text summarization using Transformers.
Stars: ✭ 38 (+31.03%)
Mutual labels:  text-summarization
Scripts-for-extractive-summarization
Scripts for an upcoming blog "Extractive vs. Abstractive Summarization" for RaRe Technologies.
Stars: ✭ 12 (-58.62%)
Mutual labels:  text-summarization
nlp-akash
Natural Language Processing notes and implementations.
Stars: ✭ 66 (+127.59%)
Mutual labels:  text-summarization
Text Summarizer Pytorch
Pytorch implementation of "A Deep Reinforced Model for Abstractive Summarization" paper and pointer generator network
Stars: ✭ 203 (+600%)
Mutual labels:  text-summarization
NLP Toolkit
Library of state-of-the-art models (PyTorch) for NLP tasks
Stars: ✭ 92 (+217.24%)
Mutual labels:  text-summarization
Text-Summarization-Repo
텍스트 요약 분야의 주요 연구 주제, Must-read Papers, 이용 가능한 model 및 data 등을 추천 자료와 함께 정리한 저장소입니다.
Stars: ✭ 213 (+634.48%)
Mutual labels:  text-summarization
pytorch-translm
An implementation of transformer-based language model for sentence rewriting tasks such as summarization, simplification, and grammatical error correction.
Stars: ✭ 22 (-24.14%)
Mutual labels:  text-summarization
Bidirectiona-LSTM-for-text-summarization-
A bidirectional encoder-decoder LSTM neural network is trained for text summarization on the cnn/dailymail dataset. (MIT808 project)
Stars: ✭ 73 (+151.72%)
Mutual labels:  text-summarization
NLP-Extractive-NEWS-summarization-using-MMR
A simple python implementation of the Maximal Marginal Relevance (MMR) baseline system for text summarization.
Stars: ✭ 59 (+103.45%)
Mutual labels:  text-summarization
DocSum
A tool to automatically summarize documents abstractively using the BART or PreSumm Machine Learning Model.
Stars: ✭ 58 (+100%)
Mutual labels:  text-summarization
email-summarization
A module for E-mail Summarization which uses clustering of skip-thought sentence embeddings.
Stars: ✭ 81 (+179.31%)
Mutual labels:  text-summarization
xl-sum
This repository contains the code, data, and models of the paper titled "XL-Sum: Large-Scale Multilingual Abstractive Summarization for 44 Languages" published in Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021.
Stars: ✭ 160 (+451.72%)
Mutual labels:  text-summarization
Text summarization with tensorflow
Implementation of a seq2seq model for summarization of textual data. Demonstrated on amazon reviews, github issues and news articles.
Stars: ✭ 226 (+679.31%)
Mutual labels:  text-summarization
Persian-Summarization
Statistical and Semantical Text Summarizer in Persian Language
Stars: ✭ 38 (+31.03%)
Mutual labels:  text-summarization
allsummarizer
Multilingual automatic text summarizer using statistical approach and extraction
Stars: ✭ 28 (-3.45%)
Mutual labels:  text-summarization
gazeta
Gazeta: Dataset for automatic summarization of Russian news / Газета: набор данных для автоматического реферирования на русском языке
Stars: ✭ 25 (-13.79%)
Mutual labels:  text-summarization
Entity2Topic
[NAACL2018] Entity Commonsense Representation for Neural Abstractive Summarization
Stars: ✭ 20 (-31.03%)
Mutual labels:  text-summarization

Brief

Brief is a text summarizer based on sequence to sequence framework, implemented in Python and Facebook's PyTorch project. In a nutshell, this is Brief.

Pypi package GitHub release GitHub issues Travis CI

DemoRequirementsInstallPre-trained modelsTrainExamplesDocument

Quick demo

from brief import BriefModel
model = BriefModel("./checkpoint_transformer")

long_text = ("为 期 三 个 月 的 全 国 公 路 执 法 专 项 整 改 工 作 刚 结 束 ,
              整 治 重 点 包 括 对 非 法 超 限 运 输 车 辆 只 收 费 不 卸 载 、 
              伙 同 社 会 闲 散 人 员 擅 自 放 行 等 。 然 而 , 在 重 要 省 道 滨 唐 公 路 津 冀 交 界 处 ,
              执 法 治 超 沦 为 摆 设 , 大 肆 收 费 后 擅 自 放 行 , 超 载 问 题 严 重 失 控 。")
             
model.summarize([long_text])
>>> 津 冀 交 界 公 路 治 超 载 乱 象 严 重 , 官 卡 执 法 沦 为 摆 设 。

long_text = ("眼 下 , 白 酒 业 “ 塑 化 剂 门 ” 继 续 发 酵 , 
              业 内 业 外 各 有 说 法 。 酒 鬼 酒 公 司 股 票 继 续 停 牌 。 
              记 者 走 访 郑 州 市 场 发 现 , 商 家 并 未 将 酒 鬼 酒 和 其 他 白 酒 下 架 , 
              白 酒 销 量 暂 时 稳 定 , 未 受 影 响 。 专 家 提 醒 : 塑 化 剂 溶 于 酒 精 , 
              应 避 免 用 塑 料 制 品 盛 装 白 酒 。")
              
model.summarize([long_text])
>>> 河 南 白 酒 未 受 “ 塑 化 剂 门 ” 冲 击 , 喝 酒 别 用 塑 料 杯 。

Requirements

  • PyTorch
  • tqdm
  • torchtext
  • dill
  • numpy
  • Python3

Install

$ pip install brief --user

Did you guys have some pre-trained models

Yes, we will release two pre-trained models on LCSTS dataset on word-level and char-level separately.

How to train on your own dataset

python3 train.py --train_filename train_full.tsv 
                 --valid_filename valid_full.tsv
                 --checkpoint_dir /data_hdd/brief_dev/checkpoint_exp
                 --epoch 10
                 --batch_size 128

More examples

Updating...
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].