All Projects → trinker → readability

trinker / readability

Licence: other
Fast readability scores for text data

Programming Languages

r
7636 projects

Projects that are alternatives of or similar to readability

trafilatura
Python & command-line tool to gather text on the Web: web crawling/scraping, extraction of text, metadata, comments
Stars: ✭ 711 (+3131.82%)
Mutual labels:  text-mining, readability
Gwu data mining
Materials for GWU DNSC 6279 and DNSC 6290.
Stars: ✭ 217 (+886.36%)
Mutual labels:  text-mining
Awesome Text Classification
Awesome-Text-Classification Projects,Papers,Tutorial .
Stars: ✭ 158 (+618.18%)
Mutual labels:  text-mining
Hdltex
HDLTex: Hierarchical Deep Learning for Text Classification
Stars: ✭ 191 (+768.18%)
Mutual labels:  text-mining
Lazynlp
Library to scrape and clean web pages to create massive datasets.
Stars: ✭ 1,985 (+8922.73%)
Mutual labels:  text-mining
Fake news detection
Fake News Detection in Python
Stars: ✭ 194 (+781.82%)
Mutual labels:  text-mining
Chemdataextractor
Automatically extract chemical information from scientific documents
Stars: ✭ 152 (+590.91%)
Mutual labels:  text-mining
text-analysis
Weaving analytical stories from text data
Stars: ✭ 12 (-45.45%)
Mutual labels:  text-mining
Cnn Text Classification Keras
Text Classification by Convolutional Neural Network in Keras
Stars: ✭ 213 (+868.18%)
Mutual labels:  text-mining
Breadability
Reworked https://www.readability.com/ parsing library (now https://mercury.postlight.com/ is living alternative)
Stars: ✭ 186 (+745.45%)
Mutual labels:  text-mining
Texthero
Text preprocessing, representation and visualization from zero to hero.
Stars: ✭ 2,407 (+10840.91%)
Mutual labels:  text-mining
Tokenizers
Fast, Consistent Tokenization of Natural Language Text
Stars: ✭ 161 (+631.82%)
Mutual labels:  text-mining
Shallowlearn
An experiment about re-implementing supervised learning models based on shallow neural network approaches (e.g. fastText) with some additional exclusive features and nice API. Written in Python and fully compatible with Scikit-learn.
Stars: ✭ 196 (+790.91%)
Mutual labels:  text-mining
Udpipe
R package for Tokenization, Parts of Speech Tagging, Lemmatization and Dependency Parsing Based on the UDPipe Natural Language Processing Toolkit
Stars: ✭ 160 (+627.27%)
Mutual labels:  text-mining
Aravec
AraVec is a pre-trained distributed word representation (word embedding) open source project which aims to provide the Arabic NLP research community with free to use and powerful word embedding models.
Stars: ✭ 239 (+986.36%)
Mutual labels:  text-mining
Awesome Nlp
📖 A curated list of resources dedicated to Natural Language Processing (NLP)
Stars: ✭ 12,626 (+57290.91%)
Mutual labels:  text-mining
Nlp profiler
A simple NLP library allows profiling datasets with one or more text columns. When given a dataset and a column name containing text data, NLP Profiler will return either high-level insights or low-level/granular statistical information about the text in that column.
Stars: ✭ 181 (+722.73%)
Mutual labels:  text-mining
Pyss3
A Python package implementing a new machine learning model for text classification with visualization tools for Explainable AI
Stars: ✭ 191 (+768.18%)
Mutual labels:  text-mining
koshort
(deprecated) 🐱 koshort is a Python package for Korean internet spoken language crawling and processing... or maybe Korean domestic cat.
Stars: ✭ 62 (+181.82%)
Mutual labels:  text-mining
clustext
Easy, fast clustering of texts
Stars: ✭ 18 (-18.18%)
Mutual labels:  text-mining

readability Follow

Project Status: Inactive – The project has reached a stable, usable state but is no longer being actively developed; support/maintenance will be provided as time allows. Build Status Coverage Status DOI Version

readability utilizes the syllable package for fast calculation of readability scores by grouping variables.

Table of Contents

Installation

To download the development version of readability:

Download the zip ball or tar ball, decompress and run R CMD INSTALL on it, or use the pacman package to install the development version:

if (!require("pacman")) install.packages("pacman")
pacman::p_load_gh(c(
    'trinker/lexicon',
    'trinker/textclean',
    'trinker/textshape',
    "trinker/syllable", 
    "trinker/readability"
))

Contact

You are welcome to:

Examples

if (!require("pacman")) install.packages("pacman")
pacman::p_load(syllable, readability)

(x <- with(presidential_debates_2012, readability(dialogue, list(person, time))))

##        person   time Flesch_Kincaid Gunning_Fog_Index Coleman_Liau SMOG
##  1:  QUESTION time 2            8.6              12.5         10.1 12.0
##  2:     OBAMA time 1            9.1              12.2          8.9 11.4
##  3:     OBAMA time 3            8.5              12.0          8.8 11.5
##  4:    ROMNEY time 1            6.8              10.2          8.1 10.3
##  5:     OBAMA time 2            6.8              10.2          7.8 10.3
##  6:    ROMNEY time 3            6.7              10.1          7.5 10.2
##  7:    ROMNEY time 2            6.1               9.3          7.4  9.7
##  8: SCHIEFFER time 3            5.1               8.7          6.8  9.3
##  9:    LEHRER time 1            4.3               8.3          5.9  9.0
## 10:   CROWLEY time 2            4.3               7.6          5.6  8.5
##     Automated_Readability_Index Average_Grade_Level
##  1:                         8.0                10.2
##  2:                         9.0                10.1
##  3:                         8.0                 9.7
##  4:                         6.0                 8.3
##  5:                         6.0                 8.2
##  6:                         6.0                 8.1
##  7:                         5.0                 7.5
##  8:                         4.0                 6.8
##  9:                         3.0                 6.1
## 10:                         3.0                 5.8

plot(x)

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].