All Projects → james-bowman → Nlp

james-bowman / Nlp

Licence: mit
Selected Machine Learning algorithms for natural language processing and semantic analysis in Golang

Programming Languages

go
31211 projects - #10 most used programming language
golang
3204 projects

Projects that are alternatives of or similar to Nlp

nlp-lt
Natural Language Processing for Lithuanian language
Stars: ✭ 17 (-94.41%)
Mutual labels:  lda, svd
Greynir
The greynir.is natural language processing website for Icelandic
Stars: ✭ 47 (-84.54%)
Mutual labels:  natural-language-processing, tf-idf
Nlp In Practice
Starter code to solve real world text data problems. Includes: Gensim Word2Vec, phrase embeddings, Text Classification with Logistic Regression, word count with pyspark, simple text preprocessing, pre-trained embeddings and more.
Stars: ✭ 790 (+159.87%)
Mutual labels:  natural-language-processing, tf-idf
Multilingual Latent Dirichlet Allocation Lda
A Multilingual Latent Dirichlet Allocation (LDA) Pipeline with Stop Words Removal, n-gram features, and Inverse Stemming, in Python.
Stars: ✭ 64 (-78.95%)
Mutual labels:  natural-language-processing, lda
Textvec
Text vectorization tool to outperform TFIDF for classification tasks
Stars: ✭ 167 (-45.07%)
Mutual labels:  natural-language-processing, tf-idf
Lda Topic Modeling
A PureScript, browser-based implementation of LDA topic modeling.
Stars: ✭ 91 (-70.07%)
Mutual labels:  natural-language-processing, lda
How To Mine Newsfeed Data And Extract Interactive Insights In Python
A practical guide to topic mining and interactive visualizations
Stars: ✭ 61 (-79.93%)
Mutual labels:  natural-language-processing, tf-idf
Vntk
Vietnamese NLP Toolkit for Node
Stars: ✭ 170 (-44.08%)
Mutual labels:  natural-language-processing, tf-idf
Lda
LDA topic modeling for node.js
Stars: ✭ 262 (-13.82%)
Mutual labels:  natural-language-processing, lda
Languagecrunch
LanguageCrunch NLP server docker image
Stars: ✭ 281 (-7.57%)
Mutual labels:  natural-language-processing
Ner
Named Entity Recognition
Stars: ✭ 288 (-5.26%)
Mutual labels:  natural-language-processing
Machine Learning With Python
Python code for common Machine Learning Algorithms
Stars: ✭ 3,334 (+996.71%)
Mutual labels:  lda
Oie Resources
A curated list of Open Information Extraction (OIE) resources: papers, code, data, etc.
Stars: ✭ 283 (-6.91%)
Mutual labels:  natural-language-processing
Deep Learning Nlp Rl Papers
Recent Deep Learning papers in NLU and RL
Stars: ✭ 288 (-5.26%)
Mutual labels:  natural-language-processing
2018 Machinelearning Lectures Esa
Machine Learning Lectures at the European Space Agency (ESA) in 2018
Stars: ✭ 280 (-7.89%)
Mutual labels:  tf-idf
Libpostal
A C library for parsing/normalizing street addresses around the world. Powered by statistical NLP and open geo data.
Stars: ✭ 3,312 (+989.47%)
Mutual labels:  natural-language-processing
Swem
The Tensorflow code for this ACL 2018 paper: "Baseline Needs More Love: On Simple Word-Embedding-Based Models and Associated Pooling Mechanisms"
Stars: ✭ 279 (-8.22%)
Mutual labels:  natural-language-processing
Awesome Distributed Deep Learning
A curated list of awesome Distributed Deep Learning resources.
Stars: ✭ 277 (-8.88%)
Mutual labels:  natural-language-processing
Pyresparser
A simple resume parser used for extracting information from resumes
Stars: ✭ 297 (-2.3%)
Mutual labels:  natural-language-processing
Autogluon
AutoGluon: AutoML for Text, Image, and Tabular Data
Stars: ✭ 3,920 (+1189.47%)
Mutual labels:  natural-language-processing

Natural Language Processing

License: MIT GoDoc Build Status Go Report Card codecov Mentioned in Awesome Go Sourcegraph

nlp

Implementations of selected machine learning algorithms for natural language processing in golang. The primary focus for the package is the statistical semantics of plain-text documents supporting semantic analysis and retrieval of semantically similar documents.

Built upon the Gonum package for linear algebra and scientific computing with some inspiration taken from Python's scikit-learn and Gensim.

Check out the companion blog post or the Go documentation page for full usage and examples.


Features

Planned

  • Expanded persistence support
  • Stemming to treat words with common root as the same e.g. "go" and "going"
  • Clustering algorithms e.g. Heirachical, K-means, etc.
  • Classification algorithms e.g. SVM, KNN, random forest, etc.

References

  1. Rosario, Barbara. Latent Semantic Indexing: An overview. INFOSYS 240 Spring 2000
  2. Latent Semantic Analysis, a scholarpedia article on LSA written by Tom Landauer, one of the creators of LSA.
  3. Thomo, Alex. Latent Semantic Analysis (Tutorial).
  4. Latent Semantic Indexing. Standford NLP Course
  5. Charikar, Moses S. "Similarity Estimation Techniques from Rounding Algorithms" in Proceedings of the thiry-fourth annual ACM symposium on Theory of computing - STOC ’02, 2002, p. 380.
  6. M. Bawa, T. Condie, and P. Ganesan, “LSH forest: self-tuning indexes for similarity search,” Proc. 14th Int. Conf. World Wide Web - WWW ’05, p. 651, 2005.
  7. A. Gionis, P. Indyk, and R. Motwani, “Similarity Search in High Dimensions via Hashing,” VLDB ’99 Proc. 25th Int. Conf. Very Large Data Bases, vol. 99, no. 1, pp. 518–529, 1999.
  8. Kanerva, Pentti, Kristoferson, Jan and Holst, Anders (2000). Random Indexing of Text Samples for Latent Semantic Analysis
  9. Rangan, Venkat. Discovery of Related Terms in a corpus using Reflective Random Indexing
  10. Vasuki, Vidya and Cohen, Trevor. Reflective random indexing for semi-automatic indexing of the biomedical literature
  11. QasemiZadeh, Behrang and Handschuh, Siegfried. Random Indexing Explained with High Probability
  12. Foulds, James; Boyles, Levi; Dubois, Christopher; Smyth, Padhraic; Welling, Max (2013). Stochastic Collapsed Variational Bayesian Inference for Latent Dirichlet Allocation
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].