cadmiumcr / Cadmium
Licence: mit
Natural Language Processing (NLP) library for Crystal
Stars: ✭ 172
Programming Languages
crystal
512 projects
Projects that are alternatives of or similar to Cadmium
lorca
Natural Language Processing for Spanish in Node.js. Stemmer, sentiment analysis, readability, tf-idf with batteries, concordance and more!
Stars: ✭ 95 (-44.77%)
Mutual labels: sentiment-analysis, readability, tf-idf, stemmer
soan
Social Analysis based on Whatsapp data
Stars: ✭ 106 (-38.37%)
Mutual labels: sentiment-analysis, tf-idf
Py Readability Metrics
📗 Score text readability using a number of formulas: Flesch-Kincaid Grade Level, Gunning Fog, ARI, Dale Chall, SMOG, and more
Stars: ✭ 132 (-23.26%)
Mutual labels: readability
Textvec
Text vectorization tool to outperform TFIDF for classification tasks
Stars: ✭ 167 (-2.91%)
Mutual labels: tf-idf
Amazon Product Recommender System
Sentiment analysis on Amazon Review Dataset available at http://snap.stanford.edu/data/web-Amazon.html
Stars: ✭ 158 (-8.14%)
Mutual labels: sentiment-analysis
Stemmer
An English (Porter2) stemming implementation in Elixir.
Stars: ✭ 134 (-22.09%)
Mutual labels: stemmer
Char Cnn Text Classification Pytorch
Character-level Convolutional Neural Networks for text classification in PyTorch
Stars: ✭ 147 (-14.53%)
Mutual labels: sentiment-analysis
Reading List Mover
A Python utility for moving bookmarks/reading lists between services
Stars: ✭ 166 (-3.49%)
Mutual labels: readability
Indonesian Nlp Resources
data resource untuk NLP bahasa indonesia
Stars: ✭ 143 (-16.86%)
Mutual labels: sentiment-analysis
Onnxt5
Summarization, translation, sentiment-analysis, text-generation and more at blazing speed using a T5 version implemented in ONNX.
Stars: ✭ 143 (-16.86%)
Mutual labels: sentiment-analysis
Hey Jetson
Deep Learning based Automatic Speech Recognition with attention for the Nvidia Jetson.
Stars: ✭ 161 (-6.4%)
Mutual labels: sentiment-analysis
Awesome Ai Services
An overview of the AI-as-a-service landscape
Stars: ✭ 133 (-22.67%)
Mutual labels: sentiment-analysis
Newspaper
Read webpages in readability mode, inside your terminal.
Stars: ✭ 168 (-2.33%)
Mutual labels: readability
Twitter Sentiment Visualisation
🌍 The R&D of a sentiment analysis module, and the implementation of it on real-time social media data, to generate a series of live visual representations of sentiment towards a specific topic or by location in order to find trends.
Stars: ✭ 132 (-23.26%)
Mutual labels: sentiment-analysis
Googlelanguager
R client for the Google Translation API, Google Cloud Natural Language API and Google Cloud Speech API
Stars: ✭ 145 (-15.7%)
Mutual labels: sentiment-analysis
Nlp bahasa resources
A Curated List of Dataset and Usable Library Resources for NLP in Bahasa Indonesia
Stars: ✭ 158 (-8.14%)
Mutual labels: sentiment-analysis
Cadmium is a Natural Language Processing (NLP) library for Crystal.
For full API documentation check out the docs.
For more complete and up to date information about specific parts of Cadmium, check out each relevant shard repository.
Shard name | Description |
---|---|
cadmium_tokenizer | Contains several types of string tokenizers |
cadmium_stemmer | Contains a Porter stemmer, useful to get the stems of english words |
cadmium_ngrams | Contains methods to obtain unigram, bigrams, trigrams or ngrams from strings |
cadmium_classifier | Contains two probabilistic classifiers used in NLP operations like language detection or POS tagging for example |
cadmium_readability | Analyzes blocks of text and determine, using various algorithms, the readability of the text. |
cadmium_tfidf | Calculates the Term Frequency–Inverse Document Frequency of a corpus |
cadmium_glove | Pure Crystal implementation of Global Vectors for Word Representations |
cadmium_pos_tagger | Tags each token of a text with its Part Of Speech category |
cadmium_lemmatizer | Returns the lemma of each given string token |
cadmium_summarizer | Extracts the most meaningful sentences of a text to create a summary |
cadmium_sentiment | Evaluates the sentiment of a text |
cadmium_distance | Provides two string distance algorithms |
cadmium_transliterator | Provides the ability to transliterate UTF-8 strings into pure ASCII so that they can be safely displayed in URL slugs or file names. |
cadmium_phonetics | Allows to match a string with its sound representation |
cadmium_inflector | Allows to inflect english words (nouns, verbs and numbers) |
cadmium_graph | EdgeWeightedDigraph represents a digraph, you can add an edge, get the number vertexes, edges, get all edges and use toString to print the Digraph. |
cadmium_trie | A trie is a data structure for efficiently storing and retrieving strings with identical prefixes, like "meet" and "meek". |
cadmium_wordnet | Pure crystal implementation of Stanford NLPs WordNet |
cadmium_util | A collection of useful utilities used internally in Cadmium. |
cadmium_language_detector | Returns the most probable language code of the analysed text. |
Installation
Your project should only include the Cadmium shard(s) you need.
However, in case you want to test out all of Cadmium in a simple way, you can install all modules of the project in a few lines.
Add this to your application's shard.yml
:
dependencies:
cadmium:
github: cadmiumcr/cadmium
branch: master
Contributing
- Fork it ( https://github.com/cadmiumcr/cadmium/fork )
- Create your feature branch (git checkout -b my-new-feature)
- Commit your changes (git commit -am 'Add some feature')
- Push to the branch (git push origin my-new-feature)
- Create a new Pull Request
Contributors
This project exists thanks to all the people who contribute.
Note that the project description data, including the texts, logos, images, and/or trademarks,
for each open source project belongs to its rightful owner.
If you wish to add or remove any projects, please contact us at [email protected].