Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → theeluwin → Lexrankr

theeluwin / Lexrankr

Licence: mit

LexRank for Korean.

Programming Languages

python

139335 projects - #7 most used programming language

Labels

nlp summarization

Projects that are alternatives of or similar to Lexrankr

video-summarizer

Summarizes videos into much shorter videos. Ideal for long lecture videos.

Stars: ✭ 92 (+84%)

Mutual labels: summarization

Copycat-abstractive-opinion-summarizer

ACL 2020 Unsupervised Opinion Summarization as Copycat-Review Generation

Stars: ✭ 76 (+52%)

Mutual labels: summarization

Abstractive Summarization With Transfer Learning

Abstractive summarisation using Bert as encoder and Transformer Decoder

Stars: ✭ 358 (+616%)

Mutual labels: summarization

query-focused-sum

Official code repository for "Exploring Neural Models for Query-Focused Summarization".

Stars: ✭ 17 (-66%)

Mutual labels: summarization

sidenet

SideNet: Neural Extractive Summarization with Side Information

Stars: ✭ 52 (+4%)

Mutual labels: summarization

summary-explorer

Summary Explorer is a tool to visually explore the state-of-the-art in text summarization.

Stars: ✭ 34 (-32%)

Mutual labels: summarization

PyRouge

A python library to compute rouge score for summarization

Stars: ✭ 54 (+8%)

Mutual labels: summarization

Summary loop

Codebase for the Summary Loop paper at ACL2020

Stars: ✭ 26 (-48%)

Mutual labels: summarization

FYP-AutoTextSum

Automatic Text Summarization with Machine Learning

Stars: ✭ 16 (-68%)

Mutual labels: summarization

Statsbase.jl

Basic statistics for Julia

Stars: ✭ 326 (+552%)

Mutual labels: summarization

text2text

Text2Text: Cross-lingual natural language processing and generation toolkit

Stars: ✭ 188 (+276%)

Mutual labels: summarization

article-summary-deep-learning

📖 Using deep learning and scraping to analyze/summarize articles! Just drop in any URL!

Stars: ✭ 18 (-64%)

Mutual labels: summarization

TextRank-node

No description or website provided.

Stars: ✭ 21 (-58%)

Mutual labels: summarization

textdigester

TextDigester: document summarization java library

Stars: ✭ 23 (-54%)

Mutual labels: summarization

Headlines

Automatically generate headlines to short articles

Stars: ✭ 516 (+932%)

Mutual labels: summarization

2021-dialogue-summary-competition

[2021 훈민정음 한국어 음성•자연어 인공지능 경진대회] 대화요약 부문 알라꿍달라꿍 팀의 대화요약 학습 및 추론 코드를 공유하기 위한 레포입니다.

Stars: ✭ 86 (+72%)

Mutual labels: summarization

summarize-radiology-findings

Code and pretrained model for paper "Learning to Summarize Radiology Findings"

Stars: ✭ 63 (+26%)

Mutual labels: summarization

Textrank

TextRank implementation for Python 3.

Stars: ✭ 1,008 (+1916%)

Mutual labels: summarization

Pointer summarizer

pytorch implementation of "Get To The Point: Summarization with Pointer-Generator Networks"

Stars: ✭ 629 (+1158%)

Mutual labels: summarization

Seq2seq Summarizer

Pointer-generator reinforced seq2seq summarization in PyTorch

Stars: ✭ 306 (+512%)

Mutual labels: summarization

View All Similar Projects ➔

lexrankr

Clustering based multi-document selective text summarization using LexRank algorithm.

This repository is a source code for the paper 설진석, 이상구. "lexrankr: LexRank 기반 한국어 다중 문서 요약." 한국정보과학회 학술발표논문집 (2016): 458-460.

Mostly designed for Korean, but not limited to.
Click here to see how to install KoNLPy properly.
Check out textrankr, which is a simpler summarizer using TextRank.

Installation

pip install lexrankr

Tokenizers

Tokenizers are not included. You have to implement one by yourself.

Example:

from typing import List

class MyTokenizer:
    def __call__(self, text: str) -> List[str]:
        tokens: List[str] = text.split()
        return tokens

한국어의 경우 KoNLPy를 사용하는 방법이 있습니다.

from typing import List
from konlpy.tag import Okt

class OktTokenizer:
    okt: Okt = Okt()

    def __call__(self, text: str) -> List[str]:
        tokens: List[str] = self.okt.pos(text, norm=True, stem=True, join=True)
        return tokens

Usage

from typing import List
from lexrankr import LexRank

# 1. init
mytokenizer: MyTokenizer = MyTokenizer()
lexrank: LexRank = LexRank(mytokenizer)

# 2. summarize (like, pre-computation)
lexrank.summarize(your_text_here)

# 3. probe (like, query-time)
summaries: List[str] = lexrank.probe()
for summary in summaries:
    print(summary)

Test

Use docker.

docker build -t lexrankr -f Dockerfile .
docker run --rm -it lexrankr

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 50

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (0) 🔗