MagnitudeA fast, efficient universal vector embedding utility package.
Stars: ✭ 1,394 (-89.08%)
TadwAn implementation of "Network Representation Learning with Rich Text Information" (IJCAI '15).
Stars: ✭ 43 (-99.66%)
ShallowlearnAn experiment about re-implementing supervised learning models based on shallow neural network approaches (e.g. fastText) with some additional exclusive features and nice API. Written in Python and fully compatible with Scikit-learn.
Stars: ✭ 196 (-98.46%)
ScattertextBeautiful visualizations of how language differs among document types.
Stars: ✭ 1,722 (-86.51%)
Text2vecFast vectorization, topic modeling, distances and GloVe word embeddings in R.
Stars: ✭ 715 (-94.4%)
Vec4irWord Embeddings for Information Retrieval
Stars: ✭ 188 (-98.53%)
GermanwordembeddingsToolkit to obtain and preprocess german corpora, train models using word2vec (gensim) and evaluate them with generated testsets
Stars: ✭ 189 (-98.52%)
Lda Topic ModelingA PureScript, browser-based implementation of LDA topic modeling.
Stars: ✭ 91 (-99.29%)
Nlp JourneyDocuments, papers and codes related to Natural Language Processing, including Topic Model, Word Embedding, Named Entity Recognition, Text Classificatin, Text Generation, Text Similarity, Machine Translation),etc. All codes are implemented intensorflow 2.0.
Stars: ✭ 1,290 (-89.89%)
word embeddingSample code for training Word2Vec and FastText using wiki corpus and their pretrained word embedding..
Stars: ✭ 21 (-99.84%)
BiolitmapCode for the paper "BIOLITMAP: a web-based geolocated and temporal visualization of the evolution of bioinformatics publications" in Oxford Bioinformatics.
Stars: ✭ 18 (-99.86%)
Lmdb EmbeddingsFast word vectors with little memory usage in Python
Stars: ✭ 404 (-96.83%)
Data Science ToolkitCollection of stats, modeling, and data science tools in Python and R.
Stars: ✭ 169 (-98.68%)
Product-Categorization-NLPMulti-Class Text Classification for products based on their description with Machine Learning algorithms and Neural Networks (MLP, CNN, Distilbert).
Stars: ✭ 30 (-99.76%)
SWDMSIGIR 2017: Embedding-based query expansion for weighted sequential dependence retrieval model
Stars: ✭ 35 (-99.73%)
Nlp In PracticeStarter code to solve real world text data problems. Includes: Gensim Word2Vec, phrase embeddings, Text Classification with Logistic Regression, word count with pyspark, simple text preprocessing, pre-trained embeddings and more.
Stars: ✭ 790 (-93.81%)
Pytorch Sentiment AnalysisTutorials on getting started with PyTorch and TorchText for sentiment analysis.
Stars: ✭ 3,209 (-74.86%)
Text mining resourcesResources for learning about Text Mining and Natural Language Processing
Stars: ✭ 358 (-97.2%)
lda2vecMixing Dirichlet Topic Models and Word Embeddings to Make lda2vec from this paper https://arxiv.org/abs/1605.02019
Stars: ✭ 27 (-99.79%)
BiosentvecBioWordVec & BioSentVec: pre-trained embeddings for biomedical words and sentences
Stars: ✭ 308 (-97.59%)
Sense2vec🦆 Contextually-keyed word vectors
Stars: ✭ 1,184 (-90.72%)
DexDex : The Data Explorer -- A data visualization tool written in Java/Groovy/JavaFX capable of powerful ETL and publishing web visualizations.
Stars: ✭ 1,238 (-90.3%)
Ja.text8Japanese text8 corpus for word embedding.
Stars: ✭ 79 (-99.38%)
Glove As A Tensorflow Embedding LayerTaking a pretrained GloVe model, and using it as a TensorFlow embedding weight layer **inside the GPU**. Therefore, you only need to send the index of the words through the GPU data transfer bus, reducing data transfer overhead.
Stars: ✭ 85 (-99.33%)
MlA high-level machine learning and deep learning library for the PHP language.
Stars: ✭ 1,270 (-90.05%)
Tsv UtilseBay's TSV Utilities: Command line tools for large, tabular data files. Filtering, statistics, sampling, joins and more.
Stars: ✭ 1,215 (-90.48%)
Topic Modeling ToolA point-and-click tool for creating and analyzing topic models produced by MALLET.
Stars: ✭ 85 (-99.33%)
ForteForte is a flexible and powerful NLP builder FOR TExt. This is part of the CASL project: http://casl-project.ai/
Stars: ✭ 89 (-99.3%)
Applied Ml📚 Papers & tech blogs by companies sharing their work on data science & machine learning in production.
Stars: ✭ 17,824 (+39.65%)
Tsrepr TSrepr: R package for time series representations
Stars: ✭ 75 (-99.41%)
Tageditor🏖TagEditor - Annotation tool for spaCy
Stars: ✭ 92 (-99.28%)
Dict2vecDict2vec is a framework to learn word embeddings using lexical dictionaries.
Stars: ✭ 91 (-99.29%)
Postgres Word2vecutils to use word embedding like word2vec vectors in a postgres database
Stars: ✭ 96 (-99.25%)
D2l EnInteractive deep learning book with multi-framework code, math, and discussions. Adopted at 300 universities from 55 countries including Stanford, MIT, Harvard, and Cambridge.
Stars: ✭ 11,837 (-7.26%)
Papers Literature Ml Dl Rl AiHighly cited and useful papers related to machine learning, deep learning, AI, game theory, reinforcement learning
Stars: ✭ 1,341 (-89.49%)
VizukaExplore high-dimensional datasets and how your algo handles specific regions.
Stars: ✭ 100 (-99.22%)
FastrtextR wrapper for fastText
Stars: ✭ 103 (-99.19%)
Jupyterlab Prodigy🧬 A JupyterLab extension for annotating data with Prodigy
Stars: ✭ 97 (-99.24%)
CodesearchnetDatasets, tools, and benchmarks for representation learning of code.
Stars: ✭ 1,378 (-89.2%)
Repo 2016R, Python and Mathematica Codes in Machine Learning, Deep Learning, Artificial Intelligence, NLP and Geolocation
Stars: ✭ 103 (-99.19%)
Numpy MlMachine learning, in numpy
Stars: ✭ 11,100 (-13.03%)
Easy BertA Dead Simple BERT API for Python and Java (https://github.com/google-research/bert)
Stars: ✭ 106 (-99.17%)
WebvectorsWeb-ify your word2vec: framework to serve distributional semantic models online
Stars: ✭ 154 (-98.79%)
MusaeThe reference implementation of "Multi-scale Attributed Node Embedding".
Stars: ✭ 75 (-99.41%)
Text SummarizerPython Framework for Extractive Text Summarization
Stars: ✭ 96 (-99.25%)
KadotKadot, the unsupervised natural language processing library.
Stars: ✭ 108 (-99.15%)
AllennlpAn open-source NLP research library, built on PyTorch.
Stars: ✭ 10,699 (-16.17%)
Awesome Embedding ModelsA curated list of awesome embedding models tutorials, projects and communities.
Stars: ✭ 1,486 (-88.36%)
FlairA very simple framework for state-of-the-art Natural Language Processing (NLP)
Stars: ✭ 11,065 (-13.3%)
Dat8General Assembly's 2015 Data Science course in Washington, DC
Stars: ✭ 1,516 (-88.12%)
Dna2vecdna2vec: Consistent vector representations of variable-length k-mers
Stars: ✭ 117 (-99.08%)
Cogcomp NlpyCogComp's light-weight Python NLP annotators
Stars: ✭ 115 (-99.1%)
Embedding As ServiceOne-Stop Solution to encode sentence to fixed length vectors from various embedding techniques
Stars: ✭ 151 (-98.82%)