All Projects → theeluwin → Lexrankr

theeluwin / Lexrankr

Licence: mit
LexRank for Korean.

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Lexrankr

video-summarizer
Summarizes videos into much shorter videos. Ideal for long lecture videos.
Stars: ✭ 92 (+84%)
Mutual labels:  summarization
Copycat-abstractive-opinion-summarizer
ACL 2020 Unsupervised Opinion Summarization as Copycat-Review Generation
Stars: ✭ 76 (+52%)
Mutual labels:  summarization
Abstractive Summarization With Transfer Learning
Abstractive summarisation using Bert as encoder and Transformer Decoder
Stars: ✭ 358 (+616%)
Mutual labels:  summarization
query-focused-sum
Official code repository for "Exploring Neural Models for Query-Focused Summarization".
Stars: ✭ 17 (-66%)
Mutual labels:  summarization
sidenet
SideNet: Neural Extractive Summarization with Side Information
Stars: ✭ 52 (+4%)
Mutual labels:  summarization
summary-explorer
Summary Explorer is a tool to visually explore the state-of-the-art in text summarization.
Stars: ✭ 34 (-32%)
Mutual labels:  summarization
PyRouge
A python library to compute rouge score for summarization
Stars: ✭ 54 (+8%)
Mutual labels:  summarization
Summary loop
Codebase for the Summary Loop paper at ACL2020
Stars: ✭ 26 (-48%)
Mutual labels:  summarization
FYP-AutoTextSum
Automatic Text Summarization with Machine Learning
Stars: ✭ 16 (-68%)
Mutual labels:  summarization
Statsbase.jl
Basic statistics for Julia
Stars: ✭ 326 (+552%)
Mutual labels:  summarization
text2text
Text2Text: Cross-lingual natural language processing and generation toolkit
Stars: ✭ 188 (+276%)
Mutual labels:  summarization
article-summary-deep-learning
📖 Using deep learning and scraping to analyze/summarize articles! Just drop in any URL!
Stars: ✭ 18 (-64%)
Mutual labels:  summarization
TextRank-node
No description or website provided.
Stars: ✭ 21 (-58%)
Mutual labels:  summarization
textdigester
TextDigester: document summarization java library
Stars: ✭ 23 (-54%)
Mutual labels:  summarization
Headlines
Automatically generate headlines to short articles
Stars: ✭ 516 (+932%)
Mutual labels:  summarization
2021-dialogue-summary-competition
[2021 훈민정음 한국어 음성•자연어 인공지능 경진대회] 대화요약 부문 알라꿍달라꿍 팀의 대화요약 학습 및 추론 코드를 공유하기 위한 레포입니다.
Stars: ✭ 86 (+72%)
Mutual labels:  summarization
summarize-radiology-findings
Code and pretrained model for paper "Learning to Summarize Radiology Findings"
Stars: ✭ 63 (+26%)
Mutual labels:  summarization
Textrank
TextRank implementation for Python 3.
Stars: ✭ 1,008 (+1916%)
Mutual labels:  summarization
Pointer summarizer
pytorch implementation of "Get To The Point: Summarization with Pointer-Generator Networks"
Stars: ✭ 629 (+1158%)
Mutual labels:  summarization
Seq2seq Summarizer
Pointer-generator reinforced seq2seq summarization in PyTorch
Stars: ✭ 306 (+512%)
Mutual labels:  summarization

lexrankr

Build Status Coverage Status PyPI version

Clustering based multi-document selective text summarization using LexRank algorithm.

This repository is a source code for the paper 설진석, 이상구. "lexrankr: LexRank 기반 한국어 다중 문서 요약." 한국정보과학회 학술발표논문집 (2016): 458-460.

  • Mostly designed for Korean, but not limited to.
  • Click here to see how to install KoNLPy properly.
  • Check out textrankr, which is a simpler summarizer using TextRank.

Installation

pip install lexrankr

Tokenizers

Tokenizers are not included. You have to implement one by yourself.

Example:

from typing import List

class MyTokenizer:
    def __call__(self, text: str) -> List[str]:
        tokens: List[str] = text.split()
        return tokens

한국어의 경우 KoNLPy를 사용하는 방법이 있습니다.

from typing import List
from konlpy.tag import Okt

class OktTokenizer:
    okt: Okt = Okt()

    def __call__(self, text: str) -> List[str]:
        tokens: List[str] = self.okt.pos(text, norm=True, stem=True, join=True)
        return tokens

Usage

from typing import List
from lexrankr import LexRank

# 1. init
mytokenizer: MyTokenizer = MyTokenizer()
lexrank: LexRank = LexRank(mytokenizer)

# 2. summarize (like, pre-computation)
lexrank.summarize(your_text_here)

# 3. probe (like, query-time)
summaries: List[str] = lexrank.probe()
for summary in summaries:
    print(summary)

Test

Use docker.

docker build -t lexrankr -f Dockerfile .
docker run --rm -it lexrankr
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].