saidziani / Sumrized

Licence: GPL-3.0 license

Automatic Text Summarization (English/Arabic).

Programming Languages

Jupyter Notebook

11667 projects

python

139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Sumrized

pytorch-translm

An implementation of transformer-based language model for sentence rewriting tasks such as summarization, simplification, and grammatical error correction.

Stars: ✭ 22 (-40.54%)

Mutual labels: text-summarization, nlp-machine-learning

arabic-tagger

AQMAR Arabic Tagger: Sequence tagger with cost-augmented structured perceptron training

Stars: ✭ 38 (+2.7%)

Mutual labels: nlp-machine-learning, arabic-nlp

Text summarization with tensorflow

Implementation of a seq2seq model for summarization of textual data. Demonstrated on amazon reviews, github issues and news articles.

Stars: ✭ 226 (+510.81%)

Mutual labels: text-summarization

NLP-Flask-Website

A simple Flask website for all NLP tasks which includes Text Preprocessing, Keyword Extraction, Text Summarization etc. Created Date: 30 Jan 2019

Stars: ✭ 43 (+16.22%)

Mutual labels: nlp-machine-learning

query-wellformedness

25,100 queries from the Paralex corpus (Fader et al., 2013) annotated with human ratings of whether they are well-formed natural language questions.

Stars: ✭ 80 (+116.22%)

Mutual labels: nlp-machine-learning

tkseem

Arabic Tokenization Library. It provides many tokenization algorithms.

Stars: ✭ 45 (+21.62%)

Mutual labels: arabic-nlp

awesome-yoruba-nlp

📖 A curated list of resources dedicated to Natural Language Processing (NLP) in the Yoruba Language.

Stars: ✭ 21 (-43.24%)

Mutual labels: nlp-machine-learning

Kr Wordrank

비지도학습 방법으로 한국어 텍스트에서 단어/키워드를 자동으로 추출하는 라이브러리입니다

Stars: ✭ 182 (+391.89%)

Mutual labels: text-summarization

NLP-Extractive-NEWS-summarization-using-MMR

A simple python implementation of the Maximal Marginal Relevance (MMR) baseline system for text summarization.

Stars: ✭ 59 (+59.46%)

Mutual labels: text-summarization

TextSumma

reimplementing Neural Summarization by Extracting Sentences and Words

Stars: ✭ 16 (-56.76%)

Mutual labels: text-summarization

SENet-for-Weakly-Supervised-Relation-Extraction

No description or website provided.

Stars: ✭ 39 (+5.41%)

Mutual labels: nlp-machine-learning

merkalysis

A marketing tool that helps you to market your products using organic marketing. This tool can potentially save you 1000s of dollars every year. The tool predicts the reach of your posts on social media and also suggests you hashtags for captions in such a way that it increases your reach.

Stars: ✭ 28 (-24.32%)

Mutual labels: nlp-machine-learning

presidential-rnn

Project 4 for Metis bootcamp. Objective was generation of character-level RNN trained on Donald Trump's statements using Keras. Also generated Markov chains, and quick pyTorch RNN as baseline. Attempted semi-supervised GAN, but was unable to test in time.

Stars: ✭ 26 (-29.73%)

Mutual labels: nlp-machine-learning

email-summarization

A module for E-mail Summarization which uses clustering of skip-thought sentence embeddings.

Stars: ✭ 81 (+118.92%)

Mutual labels: text-summarization

Paribhasha

paribhasha.herokuapp.com/

Stars: ✭ 21 (-43.24%)

Mutual labels: nlp-machine-learning

fake-news

This is a further development of the kdnuggets article on fake news classification by George McIntyre

Stars: ✭ 15 (-59.46%)

Mutual labels: nlp-machine-learning

Text Summarizer Pytorch

Pytorch implementation of "A Deep Reinforced Model for Abstractive Summarization" paper and pointer generator network

Stars: ✭ 203 (+448.65%)

Mutual labels: text-summarization

schrutepy

The Entire Transcript from the Office in Tidy Format

Stars: ✭ 22 (-40.54%)

Mutual labels: nlp-machine-learning

nmatheg

A simple strategy for training and finetuning NLP models for Arabic. Specify the parameters and just wait for the results. A simple design that makes use of the different tools in our NLP pipeline.

Stars: ✭ 19 (-48.65%)

Mutual labels: arabic-nlp

scicle-stopclickbait

Userscript that changes Clickbait headlines by headlines more honest to the news it links to.

Stars: ✭ 16 (-56.76%)

Mutual labels: nlp-machine-learning

View All Similar Projects ➔

Arabic Text Summarization

Starting Project

Make sure you have installed pip (Python 3)
```
sudo apt-get install python3-pip
```
Make sure you have NLTK (Natural Language Tool Kit) installed (Python 3)
```
sudo pip install -U nltk
```

Project content

.  
├── docs                    <- All documentation about project
│   ├── reports             <- Reports for current project advancement 
│   ├── references          <- All references papers, links related to this project goes here   
│   └── sphinx              <- Automaticaly genereated API documentation form stringdocs in code
│
├── lib                     <- All project's source code goes here
│   ├── data-generation     <- Code for data generation if needed 
│   └── preprocessing       <- Code for data preprocessing 
│ 
├── models                  <- Contains code to train, test and run models 
│   ├── dumps               <- trained models file
│   └── scripts             <- script to run models 
│ 
├── Readme.md               <- Contains current project info
├── requirements.txt        <- Packages and modules needed for the current project to run
└── tests                   <- Unit test for the code in lib/ 
    └── lib  
        ├── analysis  
        ├── data-generation   
        └── preprocessing

Guide lines

Data

Symlink to your Raw data

    user@host:/my/awesome/project$ ln -s /path/to/your/raw/data .

Data location must follow this structure:

data  
├── raw             <- Raw data
├── temp            <- transformed data stored temporarily if needed 
└── preprocessed    <- preprocessed data to run in a model

Data is immutable

Treat the data/raw (and its format) as immutable. Don't ever edit your raw data, especially not manually, and especially not in Excel.
Don't overwrite your raw data. Don't save multiple versions of the raw data.
The code you write should move the raw data through a pipeline to your final analysis.
You shouldn't have to run all of the steps every time you want to make a new figure, but anyone should be able to reproduce the final products with only the code in lib/ and the data in data/raw.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

saidziani / Sumrized

Programming Languages

Labels

Projects that are alternatives of or similar to Sumrized

Arabic Text Summarization

Starting Project

Project content

Guide lines

Data

Data is immutable