All Projects → cadmiumcr → Cadmium

cadmiumcr / Cadmium

Licence: mit
Natural Language Processing (NLP) library for Crystal

Programming Languages

crystal
512 projects

Projects that are alternatives of or similar to Cadmium

lorca
Natural Language Processing for Spanish in Node.js. Stemmer, sentiment analysis, readability, tf-idf with batteries, concordance and more!
Stars: ✭ 95 (-44.77%)
Mutual labels:  sentiment-analysis, readability, tf-idf, stemmer
soan
Social Analysis based on Whatsapp data
Stars: ✭ 106 (-38.37%)
Mutual labels:  sentiment-analysis, tf-idf
Py Readability Metrics
📗 Score text readability using a number of formulas: Flesch-Kincaid Grade Level, Gunning Fog, ARI, Dale Chall, SMOG, and more
Stars: ✭ 132 (-23.26%)
Mutual labels:  readability
Textvec
Text vectorization tool to outperform TFIDF for classification tasks
Stars: ✭ 167 (-2.91%)
Mutual labels:  tf-idf
Amazon Product Recommender System
Sentiment analysis on Amazon Review Dataset available at http://snap.stanford.edu/data/web-Amazon.html
Stars: ✭ 158 (-8.14%)
Mutual labels:  sentiment-analysis
Stemmer
An English (Porter2) stemming implementation in Elixir.
Stars: ✭ 134 (-22.09%)
Mutual labels:  stemmer
Readability
visualise readability
Stars: ✭ 160 (-6.98%)
Mutual labels:  readability
Stock market sentiment analysis
股市情感分析
Stars: ✭ 130 (-24.42%)
Mutual labels:  sentiment-analysis
Vntk
Vietnamese NLP Toolkit for Node
Stars: ✭ 170 (-1.16%)
Mutual labels:  tf-idf
Char Cnn Text Classification Pytorch
Character-level Convolutional Neural Networks for text classification in PyTorch
Stars: ✭ 147 (-14.53%)
Mutual labels:  sentiment-analysis
Reading List Mover
A Python utility for moving bookmarks/reading lists between services
Stars: ✭ 166 (-3.49%)
Mutual labels:  readability
Indonesian Nlp Resources
data resource untuk NLP bahasa indonesia
Stars: ✭ 143 (-16.86%)
Mutual labels:  sentiment-analysis
Onnxt5
Summarization, translation, sentiment-analysis, text-generation and more at blazing speed using a T5 version implemented in ONNX.
Stars: ✭ 143 (-16.86%)
Mutual labels:  sentiment-analysis
Hey Jetson
Deep Learning based Automatic Speech Recognition with attention for the Nvidia Jetson.
Stars: ✭ 161 (-6.4%)
Mutual labels:  sentiment-analysis
Awesome Ai Services
An overview of the AI-as-a-service landscape
Stars: ✭ 133 (-22.67%)
Mutual labels:  sentiment-analysis
Newspaper
Read webpages in readability mode, inside your terminal.
Stars: ✭ 168 (-2.33%)
Mutual labels:  readability
Twitter Sentiment Visualisation
🌍 The R&D of a sentiment analysis module, and the implementation of it on real-time social media data, to generate a series of live visual representations of sentiment towards a specific topic or by location in order to find trends.
Stars: ✭ 132 (-23.26%)
Mutual labels:  sentiment-analysis
Googlelanguager
R client for the Google Translation API, Google Cloud Natural Language API and Google Cloud Speech API
Stars: ✭ 145 (-15.7%)
Mutual labels:  sentiment-analysis
Nlp bahasa resources
A Curated List of Dataset and Usable Library Resources for NLP in Bahasa Indonesia
Stars: ✭ 158 (-8.14%)
Mutual labels:  sentiment-analysis
Nlp Tensorflow
NLP Tensorflow Tutorials
Stars: ✭ 171 (-0.58%)
Mutual labels:  sentiment-analysis

Logo

Cadmium is a Natural Language Processing (NLP) library for Crystal.

For full API documentation check out the docs.

For more complete and up to date information about specific parts of Cadmium, check out each relevant shard repository.

Shard name Description
cadmium_tokenizer Contains several types of string tokenizers
cadmium_stemmer Contains a Porter stemmer, useful to get the stems of english words
cadmium_ngrams Contains methods to obtain unigram, bigrams, trigrams or ngrams from strings
cadmium_classifier Contains two probabilistic classifiers used in NLP operations like language detection or POS tagging for example
cadmium_readability Analyzes blocks of text and determine, using various algorithms, the readability of the text.
cadmium_tfidf Calculates the Term Frequency–Inverse Document Frequency of a corpus
cadmium_glove Pure Crystal implementation of Global Vectors for Word Representations
cadmium_pos_tagger Tags each token of a text with its Part Of Speech category
cadmium_lemmatizer Returns the lemma of each given string token
cadmium_summarizer Extracts the most meaningful sentences of a text to create a summary
cadmium_sentiment Evaluates the sentiment of a text
cadmium_distance Provides two string distance algorithms
cadmium_transliterator Provides the ability to transliterate UTF-8 strings into pure ASCII so that they can be safely displayed in URL slugs or file names.
cadmium_phonetics Allows to match a string with its sound representation
cadmium_inflector Allows to inflect english words (nouns, verbs and numbers)
cadmium_graph EdgeWeightedDigraph represents a digraph, you can add an edge, get the number vertexes, edges, get all edges and use toString to print the Digraph.
cadmium_trie A trie is a data structure for efficiently storing and retrieving strings with identical prefixes, like "meet" and "meek".
cadmium_wordnet Pure crystal implementation of Stanford NLPs WordNet
cadmium_util A collection of useful utilities used internally in Cadmium.
cadmium_language_detector Returns the most probable language code of the analysed text.

Installation

Your project should only include the Cadmium shard(s) you need.

However, in case you want to test out all of Cadmium in a simple way, you can install all modules of the project in a few lines.

Add this to your application's shard.yml:

dependencies:
  cadmium:
    github: cadmiumcr/cadmium
    branch: master

Contributing

  1. Fork it ( https://github.com/cadmiumcr/cadmium/fork )
  2. Create your feature branch (git checkout -b my-new-feature)
  3. Commit your changes (git commit -am 'Add some feature')
  4. Push to the branch (git push origin my-new-feature)
  5. Create a new Pull Request

Contributors

This project exists thanks to all the people who contribute.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].