All Projects → the-new-sky → Kadot

the-new-sky / Kadot

Licence: mit
Kadot, the unsupervised natural language processing library.

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Kadot

Concise Ipython Notebooks For Deep Learning
Ipython Notebooks for solving problems like classification, segmentation, generation using latest Deep learning algorithms on different publicly available text and image data-sets.
Stars: ✭ 23 (-78.7%)
Mutual labels:  text-classification, word-embeddings, text-generation
Textblob Ar
Arabic support for textblob
Stars: ✭ 60 (-44.44%)
Mutual labels:  natural-language-processing, text-classification, word-embeddings
Fastrtext
R wrapper for fastText
Stars: ✭ 103 (-4.63%)
Mutual labels:  text-classification, word-embeddings
Market Reporter
Automatic Generation of Brief Summaries of Time-Series Data
Stars: ✭ 54 (-50%)
Mutual labels:  natural-language-processing, text-generation
Delta
DELTA is a deep learning based natural language and speech processing platform.
Stars: ✭ 1,479 (+1269.44%)
Mutual labels:  text-classification, text-generation
Greynir
The greynir.is natural language processing website for Icelandic
Stars: ✭ 47 (-56.48%)
Mutual labels:  tokenizer, natural-language-processing
Thot
Thot toolkit for statistical machine translation
Stars: ✭ 53 (-50.93%)
Mutual labels:  tokenizer, natural-language-processing
Scdv
Text classification with Sparse Composite Document Vectors.
Stars: ✭ 54 (-50%)
Mutual labels:  natural-language-processing, text-classification
Easy Deep Learning With Allennlp
🔮Deep Learning for text made easy with AllenNLP
Stars: ✭ 32 (-70.37%)
Mutual labels:  natural-language-processing, text-classification
Monkeylearn Ruby
Official Ruby client for the MonkeyLearn API. Build and consume machine learning models for language processing from your Ruby apps.
Stars: ✭ 76 (-29.63%)
Mutual labels:  natural-language-processing, text-classification
Nlp Tutorial
A list of NLP(Natural Language Processing) tutorials
Stars: ✭ 1,188 (+1000%)
Mutual labels:  natural-language-processing, text-classification
Bible text gcn
Pytorch implementation of "Graph Convolutional Networks for Text Classification"
Stars: ✭ 90 (-16.67%)
Mutual labels:  natural-language-processing, text-classification
Py Nltools
A collection of basic python modules for spoken natural language processing
Stars: ✭ 46 (-57.41%)
Mutual labels:  tokenizer, natural-language-processing
Coursera Natural Language Processing Specialization
Programming assignments from all courses in the Coursera Natural Language Processing Specialization offered by deeplearning.ai.
Stars: ✭ 39 (-63.89%)
Mutual labels:  natural-language-processing, word-embeddings
Easy Bert
A Dead Simple BERT API for Python and Java (https://github.com/google-research/bert)
Stars: ✭ 106 (-1.85%)
Mutual labels:  natural-language-processing, word-embeddings
Ml Classify Text Js
Machine learning based text classification in JavaScript using n-grams and cosine similarity
Stars: ✭ 38 (-64.81%)
Mutual labels:  natural-language-processing, text-classification
Magnitude
A fast, efficient universal vector embedding utility package.
Stars: ✭ 1,394 (+1190.74%)
Mutual labels:  natural-language-processing, word-embeddings
Syntree2vec
An algorithm to augment syntactic hierarchy into word embeddings
Stars: ✭ 9 (-91.67%)
Mutual labels:  natural-language-processing, word-embeddings
Omnicat Bayes
Naive Bayes text classification implementation as an OmniCat classifier strategy. (#ruby #naivebayes)
Stars: ✭ 30 (-72.22%)
Mutual labels:  tokenizer, text-classification
Text Analytics With Python
Learn how to process, classify, cluster, summarize, understand syntax, semantics and sentiment of text data with the power of Python! This repository contains code and datasets used in my book, "Text Analytics with Python" published by Apress/Springer.
Stars: ✭ 1,132 (+948.15%)
Mutual labels:  natural-language-processing, text-classification

Kadot

Unsupervised natural language processing library.

Build Status Code Health PyPI version GitHub license

Kadot just lets you process a text easily.

>>> hello_world = Text("Kadot just lets you process a text easily.")
>>> hello_world.ngrams(n=2)

[('Kadot', 'just'), ('just', 'lets'), ('lets', 'you'), ('you', 'process'), ('process', 'a'), ('a', 'text'), ('text', 'easily')]

🔋 What's included ?

Kadot includes tokenizers, text generators, classifiers, word-level and document-level vectorizers as well as a spell checker, a fuzzy string matching utility or a stopwords detector.

The philosophy of Kadot is "never hardcode the language rules" : use unsupervised solutions to support most languages. So it will never includes Treebank based algorithms (like a POS Tagger) : use TextBlob to do that.

🤔 How to use it ?

You can play with the TextBlob-like syntax :

>>> from kadot import Text
>>> example_text = Text("This is a text sample !")
>>> example_text.words

['This', 'is', 'a', 'text', 'sample']

>>> example_text.ngrams(n=2)

[('This', 'is'), ('is', 'a'), ('a', 'text'), ('text', 'sample')]

Or you can use the words vectorizer to get words relations :

>>> history_book = text_from_file('history_book.txt')
>>> vectors = history_book.vectorize(window=20, reduce_rate=300)
>>> vectors.apply_translation(vectors['man'], vectors['woman'], vectors['king'], best=1)

# 'man' is to 'woman' what 'king' is to...
[('queen', 0.98872148869)]

For more usages, check examples. An advanced documentation is coming.

🔨 Installation

Use the pip command that refair to you Python 3.x interpreter. In my case :

$ pip3 install kadot

It actually require the Python's standard library, Numpy, Scipy and Scikit-Learn.

⚖️ License

Kadot is under MIT license.

🚀 Contribute

Issues and pull requests are gratefully welcome. Come help us !

forthebadge

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].