Taking a pretrained GloVe model, and using it as a TensorFlow embedding weight layer **inside the GPU**. Therefore, you only need to send the index of the words through the GPU data transfer bus, reducing data transfer overhead.

Stars: ✭ 85 (-22.02%)

Mutual labels: word2vec, word-embeddings

Gensim

Topic Modelling for Humans

Stars: ✭ 12,763 (+11609.17%)

Mutual labels: word2vec, word-embeddings

Chameleon recsys

Source code of CHAMELEON - A Deep Learning Meta-Architecture for News Recommender Systems

Stars: ✭ 202 (+85.32%)

Mutual labels: word2vec, word-embeddings

word-benchmarks

Benchmarks for intrinsic word embeddings evaluation.

Stars: ✭ 45 (-58.72%)

Mutual labels: word2vec, word-embeddings

Dna2vec

dna2vec: Consistent vector representations of variable-length k-mers

Stars: ✭ 117 (+7.34%)

Mutual labels: word2vec, word-embeddings

Wego

Word Embeddings (e.g. Word2Vec) in Go!

Stars: ✭ 336 (+208.26%)

Mutual labels: word2vec, word-embeddings

Koan

A word2vec negative sampling implementation with correct CBOW update.

Stars: ✭ 232 (+112.84%)

Mutual labels: word2vec, word-embeddings

Postgres Word2vec

utils to use word embedding like word2vec vectors in a postgres database

Stars: ✭ 96 (-11.93%)

Mutual labels: word2vec, word-embeddings

word embedding

Sample code for training Word2Vec and FastText using wiki corpus and their pretrained word embedding..

Stars: ✭ 21 (-80.73%)

Mutual labels: word2vec, word-embeddings

Fasttext.js

FastText for Node.js

Stars: ✭ 127 (+16.51%)

Mutual labels: word2vec, word-embeddings

word2vec-tsne

Google News and Leo Tolstoy: Visualizing Word2Vec Word Embeddings using t-SNE.

Stars: ✭ 59 (-45.87%)

Mutual labels: word2vec, word-embeddings

Text-Analysis

Explaining textual analysis tools in Python. Including Preprocessing, Skip Gram (word2vec), and Topic Modelling.

Stars: ✭ 48 (-55.96%)

Mutual labels: word2vec, word-embeddings

NTUA-slp-nlp

💻Speech and Natural Language Processing (SLP & NLP) Lab Assignments for ECE NTUA

Stars: ✭ 19 (-82.57%)

Mutual labels: word2vec, word-embeddings

Text Summarizer

Python Framework for Extractive Text Summarization

Stars: ✭ 96 (-11.93%)

Mutual labels: word2vec, word-embeddings

Arabic-Word-Embeddings-Word2vec

Arabic Word Embeddings Word2vec

Stars: ✭ 26 (-76.15%)

Mutual labels: word2vec, word-embeddings

Word2vec Win32

A word2vec port for Windows.

Stars: ✭ 41 (-62.39%)

Mutual labels: word2vec, word-embeddings

Magnitude

A fast, efficient universal vector embedding utility package.

Stars: ✭ 1,394 (+1178.9%)

Mutual labels: word2vec, word-embeddings

Dict2vec

Dict2vec is a framework to learn word embeddings using lexical dictionaries.

Stars: ✭ 91 (-16.51%)

Mutual labels: word2vec, word-embeddings

Germanwordembeddings

Toolkit to obtain and preprocess german corpora, train models using word2vec (gensim) and evaluate them with generated testsets

Stars: ✭ 189 (+73.39%)

Mutual labels: word2vec, word-embeddings

Shallowlearn

An experiment about re-implementing supervised learning models based on shallow neural network approaches (e.g. fastText) with some additional exclusive features and nice API. Written in Python and fully compatible with Scikit-learn.

Stars: ✭ 196 (+79.82%)

Mutual labels: word2vec, word-embeddings

Text2vec

Fast vectorization, topic modeling, distances and GloVe word embeddings in R.

Stars: ✭ 715 (+555.96%)

Mutual labels: word2vec, word-embeddings

Simple-Sentence-Similarity

Exploring the simple sentence similarity measurements using word embeddings

Stars: ✭ 99 (-9.17%)

Mutual labels: word2vec, word-embeddings

sentiment-analysis-of-tweets-in-russian

Sentiment analysis of tweets in Russian using Convolutional Neural Networks (CNN) with Word2Vec embeddings.

Stars: ✭ 51 (-53.21%)

Mutual labels: word2vec, word-embeddings

SWDM

SIGIR 2017: Embedding-based query expansion for weighted sequential dependence retrieval model

Stars: ✭ 35 (-67.89%)

Mutual labels: word2vec, word-embeddings

Scattertext

Beautiful visualizations of how language differs among document types.

Stars: ✭ 1,722 (+1479.82%)

Mutual labels: word2vec, word-embeddings

Word2vec

訓練中文詞向量 Word2vec, Word2vec was created by a team of researchers led by Tomas Mikolov at Google.

Stars: ✭ 48 (-55.96%)

Mutual labels: wikidata, word2vec

two-stream-cnn

A two-stream convolutional neural network for learning abitrary similarity functions over two sets of training data

Stars: ✭ 24 (-77.98%)

Mutual labels: word2vec, word-embeddings

word2vec

Use word2vec to improve search result

Stars: ✭ 63 (-42.2%)

Mutual labels: word2vec

DeepLearning-Lab

Code lab for deep learning. Including rnn,seq2seq,word2vec,cross entropy,bidirectional rnn,convolution operation,pooling operation,InceptionV3,transfer learning.

Stars: ✭ 83 (-23.85%)

Mutual labels: word2vec

Word2VecJava

Word2Vec In Java (2013 google word2vec opensource)

Stars: ✭ 13 (-88.07%)

Mutual labels: word2vec

entity-fishing

A machine learning tool for fishing entities

Stars: ✭ 176 (+61.47%)

Mutual labels: wikidata

SentimentAnalysis

(BOW, TF-IDF, Word2Vec, BERT) Word Embeddings + (SVM, Naive Bayes, Decision Tree, Random Forest) Base Classifiers + Pre-trained BERT on Tensorflow Hub + 1-D CNN and Bi-Directional LSTM on IMDB Movie Reviews Dataset

Stars: ✭ 40 (-63.3%)

Mutual labels: word2vec

SIFRank

The code of our paper "SIFRank: A New Baseline for Unsupervised Keyphrase Extraction Based on Pre-trained Language Model"

Stars: ✭ 96 (-11.93%)

Mutual labels: word-embeddings

word2vec-from-scratch-with-python

A very simple, bare-bones, inefficient, implementation of skip-gram word2vec from scratch with Python

Stars: ✭ 85 (-22.02%)

Mutual labels: word2vec

NLP PEMDC

NLP Predtrained Embeddings, Models and Datasets Collections(NLP_PEMDC). The collection will keep updating.

Stars: ✭ 58 (-46.79%)

Mutual labels: word2vec

chainer-notebooks

Jupyter notebooks for Chainer hands-on

Stars: ✭ 23 (-78.9%)

Mutual labels: word2vec

wmd4j

wmd4j is a Java library for calculating Word Mover's Distance (WMD)

Stars: ✭ 31 (-71.56%)

Mutual labels: word2vec

doubanIMDb

IMDb + Rotten Tomatoes + Wikipedia on Douban Movie

Stars: ✭ 93 (-14.68%)

Mutual labels: wikidata

QuestionClustering

Clasificador de preguntas escrito en python 3 que fue implementado en el siguiente vídeo: https://youtu.be/qnlW1m6lPoY

Stars: ✭ 15 (-86.24%)

Mutual labels: word-embeddings

RolX

An alternative implementation of Recursive Feature and Role Extraction (KDD11 & KDD12)

Stars: ✭ 52 (-52.29%)

Mutual labels: word2vec

dnn-lstm-word-segment

Chinese Word Segmention Base on the Deep Learning and LSTM Neural Network

Stars: ✭ 24 (-77.98%)

Mutual labels: word2vec

test word2vec uyghur

Bu Uyghur yéziqini Pythonning gensim ambiridiki word2vec algorizimida sinap baqqan misal.

Stars: ✭ 15 (-86.24%)

Mutual labels: word2vec

word2vec

Rust interface to word2vec.

Stars: ✭ 22 (-79.82%)

Mutual labels: word2vec

wikiapi

JavaScript MediaWiki API for node.js

Stars: ✭ 28 (-74.31%)

Mutual labels: wikidata

compress-fasttext

Tools for shrinking fastText models (in gensim format)

Stars: ✭ 124 (+13.76%)

Mutual labels: word-embeddings

text-classification-cn

中文文本分类实践，基于搜狗新闻语料库，采用传统机器学习方法以及预训练模型等方法

Stars: ✭ 81 (-25.69%)

Mutual labels: word2vec

walklets

A lightweight implementation of Walklets from "Don't Walk Skip! Online Learning of Multi-scale Network Embeddings" (ASONAM 2017).

Stars: ✭ 94 (-13.76%)

Mutual labels: word2vec

transparencia-dados-abertos-brasil

A survey of Brazilian states' and municipalities' transparency and open data portals, as well as institutional websites, obtained from several public data sources. 🇧🇷 Levantamento de portais estaduais e municipais de transparência e dados abertos, bem como os portais institucionais, obtido a partir de diversas fontes públicas de dados.

Stars: ✭ 46 (-57.8%)

Mutual labels: wikidata

ordia

Wikidata lexemes presentations

Stars: ✭ 21 (-80.73%)

Mutual labels: wikidata

MorphologicalPriorsForWordEmbeddings

Code for EMNLP 2016 paper: Morphological Priors for Probabilistic Word Embeddings

Stars: ✭ 53 (-51.38%)

Mutual labels: word-embeddings

wdumper

Tool for generating filtered Wikidata RDF exports

Stars: ✭ 25 (-77.06%)

Mutual labels: wikidata

sarcasm-detection-for-sentiment-analysis

Sarcasm Detection for Sentiment Analysis

Stars: ✭ 21 (-80.73%)

Mutual labels: word2vec

revery

A personal semantic search engine capable of surfacing relevant bookmarks, journal entries, notes, blogs, contacts, and more, built on an efficient document embedding algorithm and Monocle's personal search index.

Stars: ✭ 200 (+83.49%)

Mutual labels: word2vec

word2vec pipeline

NLP pipeline using word2vec (preprocessing/embedding/prediction/clustering)

Stars: ✭ 108 (-0.92%)

Mutual labels: word2vec

biovec

ProtVec can be used in protein interaction predictions, structure prediction, and protein data visualization.

Stars: ✭ 23 (-78.9%)

Mutual labels: word2vec

1-60 of 332 similar projects

›

next*5