All Projects → eellak → nlpbuddy

eellak / nlpbuddy

Licence: AGPL-3.0 license
A text analysis application for performing common NLP tasks through a web dashboard interface and an API

Programming Languages

HTML
75241 projects
javascript
184084 projects - #8 most used programming language
CSS
56736 projects
python
139335 projects - #7 most used programming language
Makefile
30231 projects

Projects that are alternatives of or similar to nlpbuddy

Shallowlearn
An experiment about re-implementing supervised learning models based on shallow neural network approaches (e.g. fastText) with some additional exclusive features and nice API. Written in Python and fully compatible with Scikit-learn.
Stars: ✭ 196 (+70.43%)
Mutual labels:  text-classification, gensim, fasttext
Text Analytics With Python
Learn how to process, classify, cluster, summarize, understand syntax, semantics and sentiment of text data with the power of Python! This repository contains code and datasets used in my book, "Text Analytics with Python" published by Apress/Springer.
Stars: ✭ 1,132 (+884.35%)
Mutual labels:  text-classification, spacy, gensim
Fastrtext
R wrapper for fastText
Stars: ✭ 103 (-10.43%)
Mutual labels:  text-classification, fasttext
Ml Projects
ML based projects such as Spam Classification, Time Series Analysis, Text Classification using Random Forest, Deep Learning, Bayesian, Xgboost in Python
Stars: ✭ 127 (+10.43%)
Mutual labels:  text-classification, gensim
Awesome Text Classification
Awesome-Text-Classification Projects,Papers,Tutorial .
Stars: ✭ 158 (+37.39%)
Mutual labels:  text-classification, text-analysis
Fasttext.py
A Python interface for Facebook fastText
Stars: ✭ 1,091 (+848.7%)
Mutual labels:  text-classification, fasttext
Doc2vec
📓 Long(er) text representation and classification using Doc2Vec embeddings
Stars: ✭ 92 (-20%)
Mutual labels:  text-classification, gensim
Text Classification Demos
Neural models for Text Classification in Tensorflow, such as cnn, dpcnn, fasttext, bert ...
Stars: ✭ 144 (+25.22%)
Mutual labels:  text-classification, fasttext
Nlp In Practice
Starter code to solve real world text data problems. Includes: Gensim Word2Vec, phrase embeddings, Text Classification with Logistic Regression, word count with pyspark, simple text preprocessing, pre-trained embeddings and more.
Stars: ✭ 790 (+586.96%)
Mutual labels:  text-classification, gensim
Fake news detection
Fake News Detection in Python
Stars: ✭ 194 (+68.7%)
Mutual labels:  text-classification, text-analysis
Ai law
all kinds of baseline models for long text classificaiton( text categorization)
Stars: ✭ 243 (+111.3%)
Mutual labels:  text-classification, fasttext
Keras Textclassification
中文长文本分类、短句子分类、多标签分类、两句子相似度(Chinese Text Classification of Keras NLP, multi-label classify, or sentence classify, long or short),字词句向量嵌入层(embeddings)和网络层(graph)构建基类,FastText,TextCNN,CharCNN,TextRNN, RCNN, DCNN, DPCNN, VDCNN, CRNN, Bert, Xlnet, Albert, Attention, DeepMoji, HAN, 胶囊网络-CapsuleNet, Transformer-encode, Seq2seq, SWEM, LEAM, TextGCN
Stars: ✭ 914 (+694.78%)
Mutual labels:  text-classification, fasttext
Text classification
all kinds of text classification models and more with deep learning
Stars: ✭ 7,179 (+6142.61%)
Mutual labels:  text-classification, fasttext
Bert language understanding
Pre-training of Deep Bidirectional Transformers for Language Understanding: pre-train TextCNN
Stars: ✭ 933 (+711.3%)
Mutual labels:  text-classification, fasttext
nlp workshop odsc europe20
Extensive tutorials for the Advanced NLP Workshop in Open Data Science Conference Europe 2020. We will leverage machine learning, deep learning and deep transfer learning to learn and solve popular tasks using NLP including NER, Classification, Recommendation \ Information Retrieval, Summarization, Classification, Language Translation, Q&A and T…
Stars: ✭ 127 (+10.43%)
Mutual labels:  spacy, gensim
Fasttext.js
FastText for Node.js
Stars: ✭ 127 (+10.43%)
Mutual labels:  text-classification, fasttext
Meta
A Modern C++ Data Sciences Toolkit
Stars: ✭ 600 (+421.74%)
Mutual labels:  text-classification, text-analysis
Textclassification Keras
Text classification models implemented in Keras, including: FastText, TextCNN, TextRNN, TextBiRNN, TextAttBiRNN, HAN, RCNN, RCNNVariant, etc.
Stars: ✭ 621 (+440%)
Mutual labels:  text-classification, fasttext
Textvec
Text vectorization tool to outperform TFIDF for classification tasks
Stars: ✭ 167 (+45.22%)
Mutual labels:  text-classification, text-analysis
NLP Quickbook
NLP in Python with Deep Learning
Stars: ✭ 516 (+348.7%)
Mutual labels:  text-classification, spacy

NLPBuddy - Open Source Text Analysis Tool

About the project

NLPBuddy is a text analysis application for performing common NLP tasks through a web dashboard interface and an API.

It leverages Spacy for the NLP tasks plus Gensim's implementation of the TextRank algorithm for text summarization.

It supports texts in the following languages: Greek, English, German, Spanish, Portoguese, French, Italian and Dutch. Language identification is performed automatically through langid

Tasks include:

  1. Text tokenization
  2. Sentence splitting (lemmatized sentences too)
  3. Part of Speech tags identification (verbs, nouns etc)
  4. Named Entity Recognition (Location, Person, Organisation etc)
  5. Text summarization (using TextRank algorithm, implemented by Gensim)
  6. Keywords extraction
  7. Language identification
  8. For the Greek language, Categorization of text

Text can either be provided or imported after specifying a url - we use library python readability for this plus BeautifulSoup4

The Greek classifier is built with FastText and is trained in 20.000 articles labeled in these categories.

Demo

A working demo can be found on http://www.nlpbuddy.io/

Usage

Enter text and hit 'Analyze it',

alt text

API Usage

https://github.com/eellak/text-analysis/wiki/API-usage

Installation

Find development and deployment instructions here: https://github.com/eellak/text-analysis/wiki/Install

License

The code is provided under the GNU AGPL v3.0 License.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].