All Projects → saidziani → Sumrized

saidziani / Sumrized

Licence: GPL-3.0 license
Automatic Text Summarization (English/Arabic).

Programming Languages

Jupyter Notebook
11667 projects
python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Sumrized

pytorch-translm
An implementation of transformer-based language model for sentence rewriting tasks such as summarization, simplification, and grammatical error correction.
Stars: ✭ 22 (-40.54%)
Mutual labels:  text-summarization, nlp-machine-learning
arabic-tagger
AQMAR Arabic Tagger: Sequence tagger with cost-augmented structured perceptron training
Stars: ✭ 38 (+2.7%)
Mutual labels:  nlp-machine-learning, arabic-nlp
Text summarization with tensorflow
Implementation of a seq2seq model for summarization of textual data. Demonstrated on amazon reviews, github issues and news articles.
Stars: ✭ 226 (+510.81%)
Mutual labels:  text-summarization
NLP-Flask-Website
A simple Flask website for all NLP tasks which includes Text Preprocessing, Keyword Extraction, Text Summarization etc. Created Date: 30 Jan 2019
Stars: ✭ 43 (+16.22%)
Mutual labels:  nlp-machine-learning
query-wellformedness
25,100 queries from the Paralex corpus (Fader et al., 2013) annotated with human ratings of whether they are well-formed natural language questions.
Stars: ✭ 80 (+116.22%)
Mutual labels:  nlp-machine-learning
tkseem
Arabic Tokenization Library. It provides many tokenization algorithms.
Stars: ✭ 45 (+21.62%)
Mutual labels:  arabic-nlp
awesome-yoruba-nlp
📖 A curated list of resources dedicated to Natural Language Processing (NLP) in the Yoruba Language.
Stars: ✭ 21 (-43.24%)
Mutual labels:  nlp-machine-learning
Kr Wordrank
비지도학습 방법으로 한국어 텍스트에서 단어/키워드를 자동으로 추출하는 라이브러리입니다
Stars: ✭ 182 (+391.89%)
Mutual labels:  text-summarization
NLP-Extractive-NEWS-summarization-using-MMR
A simple python implementation of the Maximal Marginal Relevance (MMR) baseline system for text summarization.
Stars: ✭ 59 (+59.46%)
Mutual labels:  text-summarization
TextSumma
reimplementing Neural Summarization by Extracting Sentences and Words
Stars: ✭ 16 (-56.76%)
Mutual labels:  text-summarization
SENet-for-Weakly-Supervised-Relation-Extraction
No description or website provided.
Stars: ✭ 39 (+5.41%)
Mutual labels:  nlp-machine-learning
merkalysis
A marketing tool that helps you to market your products using organic marketing. This tool can potentially save you 1000s of dollars every year. The tool predicts the reach of your posts on social media and also suggests you hashtags for captions in such a way that it increases your reach.
Stars: ✭ 28 (-24.32%)
Mutual labels:  nlp-machine-learning
presidential-rnn
Project 4 for Metis bootcamp. Objective was generation of character-level RNN trained on Donald Trump's statements using Keras. Also generated Markov chains, and quick pyTorch RNN as baseline. Attempted semi-supervised GAN, but was unable to test in time.
Stars: ✭ 26 (-29.73%)
Mutual labels:  nlp-machine-learning
email-summarization
A module for E-mail Summarization which uses clustering of skip-thought sentence embeddings.
Stars: ✭ 81 (+118.92%)
Mutual labels:  text-summarization
Paribhasha
paribhasha.herokuapp.com/
Stars: ✭ 21 (-43.24%)
Mutual labels:  nlp-machine-learning
fake-news
This is a further development of the kdnuggets article on fake news classification by George McIntyre
Stars: ✭ 15 (-59.46%)
Mutual labels:  nlp-machine-learning
Text Summarizer Pytorch
Pytorch implementation of "A Deep Reinforced Model for Abstractive Summarization" paper and pointer generator network
Stars: ✭ 203 (+448.65%)
Mutual labels:  text-summarization
schrutepy
The Entire Transcript from the Office in Tidy Format
Stars: ✭ 22 (-40.54%)
Mutual labels:  nlp-machine-learning
nmatheg
A simple strategy for training and finetuning NLP models for Arabic. Specify the parameters and just wait for the results. A simple design that makes use of the different tools in our NLP pipeline.
Stars: ✭ 19 (-48.65%)
Mutual labels:  arabic-nlp
scicle-stopclickbait
Userscript that changes Clickbait headlines by headlines more honest to the news it links to.
Stars: ✭ 16 (-56.76%)
Mutual labels:  nlp-machine-learning

Sumrized

Arabic Text Summarization

Starting Project

  • Make sure you have installed pip (Python 3)

    sudo apt-get install python3-pip
    
  • Make sure you have NLTK (Natural Language Tool Kit) installed (Python 3)

    sudo pip install -U nltk
    

Project content

.  
├── docs                    <- All documentation about project
│   ├── reports             <- Reports for current project advancement 
│   ├── references          <- All references papers, links related to this project goes here   
│   └── sphinx              <- Automaticaly genereated API documentation form stringdocs in code
│
├── lib                     <- All project's source code goes here
│   ├── data-generation     <- Code for data generation if needed 
│   └── preprocessing       <- Code for data preprocessing 
│ 
├── models                  <- Contains code to train, test and run models 
│   ├── dumps               <- trained models file
│   └── scripts             <- script to run models 
│ 
├── Readme.md               <- Contains current project info
├── requirements.txt        <- Packages and modules needed for the current project to run
└── tests                   <- Unit test for the code in lib/ 
    └── lib  
        ├── analysis  
        ├── data-generation   
        └── preprocessing
   

Guide lines

Data

  • Symlink to your Raw data

        user@host:/my/awesome/project$ ln -s /path/to/your/raw/data . 
    
  • Data location must follow this structure:

data  
├── raw             <- Raw data
├── temp            <- transformed data stored temporarily if needed 
└── preprocessed    <- preprocessed data to run in a model

Data is immutable

  • Treat the data/raw (and its format) as immutable. Don't ever edit your raw data, especially not manually, and especially not in Excel.
  • Don't overwrite your raw data. Don't save multiple versions of the raw data.
  • The code you write should move the raw data through a pipeline to your final analysis.
  • You shouldn't have to run all of the steps every time you want to make a new figure, but anyone should be able to reproduce the final products with only the code in lib/ and the data in data/raw.
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].