All Projects → waseem18 → Node Rake

waseem18 / Node Rake

Licence: mit
A NodeJS implementation of the Rapid Automatic Keyword Extraction algorithm.

Programming Languages

javascript
184084 projects - #8 most used programming language

Projects that are alternatives of or similar to Node Rake

Diff Match Patch
Diff Match Patch is a high-performance library in multiple languages that manipulates plain text.
Stars: ✭ 4,910 (+5676.47%)
Mutual labels:  text-processing
Qp Trie Rs
An idiomatic and fast QP-trie implementation in pure Rust.
Stars: ✭ 47 (-44.71%)
Mutual labels:  text-processing
Applied Text Mining In Python
Repo for Applied Text Mining in Python (coursera) by University of Michigan
Stars: ✭ 59 (-30.59%)
Mutual labels:  text-processing
Whatlanggo
Natural language detection library for Go
Stars: ✭ 479 (+463.53%)
Mutual labels:  text-processing
Concise Ipython Notebooks For Deep Learning
Ipython Notebooks for solving problems like classification, segmentation, generation using latest Deep learning algorithms on different publicly available text and image data-sets.
Stars: ✭ 23 (-72.94%)
Mutual labels:  text-processing
Lingua Franca
Mycroft's multilingual text parsing and formatting library
Stars: ✭ 51 (-40%)
Mutual labels:  text-processing
Ekphrasis
Ekphrasis is a text processing tool, geared towards text from social networks, such as Twitter or Facebook. Ekphrasis performs tokenization, word normalization, word segmentation (for splitting hashtags) and spell correction, using word statistics from 2 big corpora (english Wikipedia, twitter - 330mil english tweets).
Stars: ✭ 433 (+409.41%)
Mutual labels:  text-processing
Unix Text Commands
Unix Text Processing Command Reference
Stars: ✭ 78 (-8.24%)
Mutual labels:  text-processing
Fxt
A large scale feature extraction tool for text-based machine learning
Stars: ✭ 25 (-70.59%)
Mutual labels:  text-processing
Javascript Text Expander
Expands texts as you type, naturally
Stars: ✭ 58 (-31.76%)
Mutual labels:  text-processing
Gohn
Hatena Notation (はてな記法) Parser written in Go
Stars: ✭ 17 (-80%)
Mutual labels:  text-processing
Text Mining
Text Mining in Python
Stars: ✭ 18 (-78.82%)
Mutual labels:  text-processing
Pipeit
PipeIt is a text transformation, conversion, cleansing and extraction tool.
Stars: ✭ 57 (-32.94%)
Mutual labels:  text-processing
Python Nameparser
A simple Python module for parsing human names into their individual components
Stars: ✭ 462 (+443.53%)
Mutual labels:  text-processing
Ter
Text Expression Runner – Readable and easy to use text expressions
Stars: ✭ 67 (-21.18%)
Mutual labels:  text-processing
Open Korean Text
Open Korean Text Processor - An Open-source Korean Text Processor
Stars: ✭ 438 (+415.29%)
Mutual labels:  text-processing
Pyparsing
Python library for creating PEG parsers
Stars: ✭ 1,052 (+1137.65%)
Mutual labels:  text-processing
Kefirbb
A flexible Java text processor. BB, BBCode, BB-code, HTML, Textile, Markdown, parser, translator, converter.
Stars: ✭ 83 (-2.35%)
Mutual labels:  text-processing
Virastar
Cleaning-up Persian Texts!
Stars: ✭ 77 (-9.41%)
Mutual labels:  text-processing
Go Search Replace
🚀 Search & replace URLs in WordPress SQL files.
Stars: ✭ 57 (-32.94%)
Mutual labels:  text-processing

node-rake

Build Status npm npm

A NodeJS implementation of the Rapid Automatic Keyword Extraction algorithm.

Installation

npm install node-rake

Usage

rake.generate(text, opts);

The opts param is an object that allows to pass custom params to generate method. Options:

  • stopwords: Optional. An array containing a custom stopwords list. By default, the method uses a stopwords list which comes along (take a look at Stopwords source).

Example of usage:

const rake = require('node-rake')
const keywords = rake.generate("LDA stands for Latent Dirichlet Allocation")
// it'll output: [ 'Latent Dirichlet Allocation', 'LDA stands' ]

//or

const myStopwords = ['for', 'the', 'a', 'stands', 'test', 'man', 'woman'];
const opts = {stopwords: myStopwords};

const keywords = rake.generate("LDA stands for Latent Dirichlet Allocation", opts);
// it'll output: [ 'Latent Dirichlet Allocation', 'LDA' ]

Algorithm sources:

1.https://www.researchgate.net/publication/227988510_Automatic_Keyword_Extraction_from_Individual_Documents 2.https://www.ijarcsse.com/docs/papers/Volume_6/5_May2016/V6I5-0392.pdf

Stopwords source:

  1. http://dev.mysql.com/doc/refman/5.7/en/fulltext-stopwords.html

Example :

  1. https://runkit.com/waseem18/59fe9c7d20e52f0012606fbe
  2. https://runkit.com/waseem18/59fe9d1b5fbc910012f99891

npm

Note

Looking for a maintainer for this project.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].