MagnitudeA fast, efficient universal vector embedding utility package.
Stars: ✭ 1,394 (+263.02%)
Lmdb EmbeddingsFast word vectors with little memory usage in Python
Stars: ✭ 404 (+5.21%)
EmbeddingsvizVisualize word embeddings of a vocabulary in TensorBoard, including the neighbors
Stars: ✭ 40 (-89.58%)
Nlp JourneyDocuments, papers and codes related to Natural Language Processing, including Topic Model, Word Embedding, Named Entity Recognition, Text Classificatin, Text Generation, Text Similarity, Machine Translation),etc. All codes are implemented intensorflow 2.0.
Stars: ✭ 1,290 (+235.94%)
Keras Textclassification中文长文本分类、短句子分类、多标签分类、两句子相似度(Chinese Text Classification of Keras NLP, multi-label classify, or sentence classify, long or short),字词句向量嵌入层(embeddings)和网络层(graph)构建基类,FastText,TextCNN,CharCNN,TextRNN, RCNN, DCNN, DPCNN, VDCNN, CRNN, Bert, Xlnet, Albert, Attention, DeepMoji, HAN, 胶囊网络-CapsuleNet, Transformer-encode, Seq2seq, SWEM, LEAM, TextGCN
Stars: ✭ 914 (+138.02%)
GensimTopic Modelling for Humans
Stars: ✭ 12,763 (+3223.7%)
Diff2vecReference implementation of Diffusion2Vec (Complenet 2018) built on Gensim and NetworkX.
Stars: ✭ 108 (-71.87%)
ShallowlearnAn experiment about re-implementing supervised learning models based on shallow neural network approaches (e.g. fastText) with some additional exclusive features and nice API. Written in Python and fully compatible with Scikit-learn.
Stars: ✭ 196 (-48.96%)
FastrtextR wrapper for fastText
Stars: ✭ 103 (-73.18%)
Embedding As ServiceOne-Stop Solution to encode sentence to fixed length vectors from various embedding techniques
Stars: ✭ 151 (-60.68%)
word-embeddings-from-scratchCreating word embeddings from scratch and visualize them on TensorBoard. Using trained embeddings in Keras.
Stars: ✭ 22 (-94.27%)
Cw2veccw2vec: Learning Chinese Word Embeddings with Stroke n-gram Information
Stars: ✭ 224 (-41.67%)
nlpbuddyA text analysis application for performing common NLP tasks through a web dashboard interface and an API
Stars: ✭ 115 (-70.05%)
cadeCompass-aligned Distributional Embeddings. Align embeddings from different corpora
Stars: ✭ 29 (-92.45%)
relation-networkTensorflow Implementation of Relation Networks for the bAbI QA Task, detailed in "A Simple Neural Network Module for Relational Reasoning," [https://arxiv.org/abs/1706.01427] by Santoro et. al.
Stars: ✭ 45 (-88.28%)
word embeddingSample code for training Word2Vec and FastText using wiki corpus and their pretrained word embedding..
Stars: ✭ 21 (-94.53%)
PolyfuzzFuzzy string matching, grouping, and evaluation.
Stars: ✭ 292 (-23.96%)
node-fasttextNodejs binding for fasttext representation and classification.
Stars: ✭ 39 (-89.84%)
Product-Categorization-NLPMulti-Class Text Classification for products based on their description with Machine Learning algorithms and Neural Networks (MLP, CNN, Distilbert).
Stars: ✭ 30 (-92.19%)
watset-javaAn implementation of the Watset clustering algorithm in Java.
Stars: ✭ 24 (-93.75%)
SentimentAnalysisSentiment Analysis: Deep Bi-LSTM+attention model
Stars: ✭ 32 (-91.67%)
go2vecRead and use word2vec vectors in Go
Stars: ✭ 44 (-88.54%)
CODERCODER: Knowledge infused cross-lingual medical term embedding for term normalization. [JBI, ACL-BioNLP 2022]
Stars: ✭ 24 (-93.75%)
IR2VecImplementation of IR2Vec, published in ACM TACO
Stars: ✭ 28 (-92.71%)
game2vecTensorFlow implementation of word2vec applied on https://www.kaggle.com/tamber/steam-video-games dataset, using both CBOW and Skip-gram.
Stars: ✭ 62 (-83.85%)
extremeTextLibrary for fast text representation and extreme classification.
Stars: ✭ 141 (-63.28%)
Contextualized Topic ModelsA python package to run contextualized topic modeling. CTMs combine BERT with topic models to get coherent topics. Also supports multilingual tasks. Cross-lingual Zero-shot model published at EACL 2021.
Stars: ✭ 318 (-17.19%)
EmbeddingEmbedding模型代码和学习笔记总结
Stars: ✭ 25 (-93.49%)
wordfish-pythonextract relationships from standardized terms from corpus of interest with deep learning 🐟
Stars: ✭ 19 (-95.05%)
Tensorflow fasttextSimple embedding based text classifier inspired by fastText, implemented in tensorflow
Stars: ✭ 290 (-24.48%)
hcnHybrid Code Networks https://arxiv.org/abs/1702.03274
Stars: ✭ 81 (-78.91%)
word2vec-pt-brImplementação e modelo gerado com o treinamento (trigram) da wikipedia em pt-br
Stars: ✭ 34 (-91.15%)
whatliesToolkit to help understand "what lies" in word embeddings. Also benchmarking!
Stars: ✭ 351 (-8.59%)
Nlp CubeNatural Language Processing Pipeline - Sentence Splitting, Tokenization, Lemmatization, Part-of-speech Tagging and Dependency Parsing
Stars: ✭ 353 (-8.07%)
VectorhubVector Hub - Library for easy discovery, and consumption of State-of-the-art models to turn data into vectors. (text2vec, image2vec, video2vec, graph2vec, bert, inception, etc)
Stars: ✭ 317 (-17.45%)
DecagonGraph convolutional neural network for multirelational link prediction
Stars: ✭ 268 (-30.21%)
pydataberlin-2017Repo for my talk at the PyData Berlin 2017 conference
Stars: ✭ 63 (-83.59%)
code-compassa contextual search engine for software packages built on import2vec embeddings (https://www.code-compass.com)
Stars: ✭ 33 (-91.41%)
Keras-Application-ZooReference implementations of popular DL models missing from keras-applications & keras-contrib
Stars: ✭ 31 (-91.93%)
fastText1607Unofficial Implementation of "Bag of Tricks for Efficient Text Classification", 2016, Armand Joulin et al. (https://arxiv.org/pdf/1607.01759.pdf)
Stars: ✭ 20 (-94.79%)
KGE-LDAKnowledge Graph Embedding LDA. AAAI 2017
Stars: ✭ 35 (-90.89%)
HubA library for transfer learning by reusing parts of TensorFlow models.
Stars: ✭ 3,007 (+683.07%)
lda2vecMixing Dirichlet Topic Models and Word Embeddings to Make lda2vec from this paper https://arxiv.org/abs/1605.02019
Stars: ✭ 27 (-92.97%)
nccNeural Code Comprehension: A Learnable Representation of Code Semantics
Stars: ✭ 162 (-57.81%)
BiosentvecBioWordVec & BioSentVec: pre-trained embeddings for biomedical words and sentences
Stars: ✭ 308 (-19.79%)
resume tailorAn unsupervised analysis combining topic modeling and clustering to preserve an individuals work history and credentials while tailoring their resume towards a new career field
Stars: ✭ 15 (-96.09%)
entity-embedPyTorch library for transforming entities like companies, products, etc. into vectors to support scalable Record Linkage / Entity Resolution using Approximate Nearest Neighbors.
Stars: ✭ 96 (-75%)