NLP-StuffPrograms with word vectors, RNN, NLP stuff, etc
Text-AnalysisExplaining textual analysis tools in Python. Including Preprocessing, Skip Gram (word2vec), and Topic Modelling.
SPINECode for SPINE - Sparse Interpretable Neural Embeddings. Jhamtani H.*, Pruthi D.*, Subramanian A.*, Berg-Kirkpatrick T., Hovy E. AAAI 2018
SWDMSIGIR 2017: Embedding-based query expansion for weighted sequential dependence retrieval model
textlyticsText processing library for sentiment analysis and related tasks
word embeddingSample code for training Word2Vec and FastText using wiki corpus and their pretrained word embedding..
conecContext Encoders (ConEc) as a simple but powerful extension of the word2vec model for learning word embeddings
Naive-Resume-MatchingText Similarity Applied to resume, to compare Resumes with Job Descriptions and create a score to rank them. Similar to an ATS.
Word-recognition-EmbedNet-CABCode implementation for our ICPR, 2020 paper titled "Improving Word Recognition using Multiple Hypotheses and Deep Embeddings"
NTUA-slp-nlp💻Speech and Natural Language Processing (SLP & NLP) Lab Assignments for ECE NTUA
context2vecPyTorch implementation of context2vec from Melamud et al., CoNLL 2016
lda2vecMixing Dirichlet Topic Models and Word Embeddings to Make lda2vec from this paper https://arxiv.org/abs/1605.02019
QuestionClusteringClasificador de preguntas escrito en python 3 que fue implementado en el siguiente vídeo: https://youtu.be/qnlW1m6lPoY
word2vec-tsneGoogle News and Leo Tolstoy: Visualizing Word2Vec Word Embeddings using t-SNE.
SiameseCBOWImplementation of Siamese CBOW using keras whose backend is tensorflow.
SIFRankThe code of our paper "SIFRank: A New Baseline for Unsupervised Keyphrase Extraction Based on Pre-trained Language Model"
datastories-semeval2017-task6Deep-learning model presented in "DataStories at SemEval-2017 Task 6: Siamese LSTM with Attention for Humorous Text Comparison".
JoSH[KDD 2020] Hierarchical Topic Mining via Joint Spherical Tree and Text Embedding
Active-Explainable-ClassificationA set of tools for leveraging pre-trained embeddings, active learning and model explainability for effecient document classification
pair2vecpair2vec: Compositional Word-Pair Embeddings for Cross-Sentence Inference
contextualLSTMContextual LSTM for NLP tasks like word prediction and word embedding creation for Deep Learning
dasemDanish Semantic analysis
S-WMDCode for Supervised Word Mover's Distance (SWMD)
PersianNERNamed-Entity Recognition in Persian Language
fuzzymaxCode for the paper: Don't Settle for Average, Go for the Max: Fuzzy Sets and Max-Pooled Word Vectors, ICLR 2019.
wefeWEFE: The Word Embeddings Fairness Evaluation Framework. WEFE is a framework that standardizes the bias measurement and mitigation in Word Embeddings models. Please feel welcome to open an issue in case you have any questions or a pull request if you want to contribute to the project!
sisterSImple SenTence EmbeddeR
Word2VecfJavaWord2VecfJava: Java implementation of Dependency-Based Word Embeddings and extensions
two-stream-cnnA two-stream convolutional neural network for learning abitrary similarity functions over two sets of training data
HiCECode for ACL'19 "Few-Shot Representation Learning for Out-Of-Vocabulary Words"