GermanwordembeddingsToolkit to obtain and preprocess german corpora, train models using word2vec (gensim) and evaluate them with generated testsets
Stars: ✭ 189 (+320%)
SWDMSIGIR 2017: Embedding-based query expansion for weighted sequential dependence retrieval model
Stars: ✭ 35 (-22.22%)
Superpixel BenchmarkAn extensive evaluation and comparison of 28 state-of-the-art superpixel algorithms on 5 datasets.
Stars: ✭ 275 (+511.11%)
WegoWord Embeddings (e.g. Word2Vec) in Go!
Stars: ✭ 336 (+646.67%)
MagnitudeA fast, efficient universal vector embedding utility package.
Stars: ✭ 1,394 (+2997.78%)
wikidata-corpusTrain Wikidata with word2vec for word embedding tasks
Stars: ✭ 109 (+142.22%)
ShallowlearnAn experiment about re-implementing supervised learning models based on shallow neural network approaches (e.g. fastText) with some additional exclusive features and nice API. Written in Python and fully compatible with Scikit-learn.
Stars: ✭ 196 (+335.56%)
Dna2vecdna2vec: Consistent vector representations of variable-length k-mers
Stars: ✭ 117 (+160%)
Chameleon recsysSource code of CHAMELEON - A Deep Learning Meta-Architecture for News Recommender Systems
Stars: ✭ 202 (+348.89%)
Nas Benchmark"NAS evaluation is frustratingly hard", ICLR2020
Stars: ✭ 126 (+180%)
Dict2vecDict2vec is a framework to learn word embeddings using lexical dictionaries.
Stars: ✭ 91 (+102.22%)
Text2vecFast vectorization, topic modeling, distances and GloVe word embeddings in R.
Stars: ✭ 715 (+1488.89%)
DiscEvalDiscourse Based Evaluation of Language Understanding
Stars: ✭ 18 (-60%)
word2vec-tsneGoogle News and Leo Tolstoy: Visualizing Word2Vec Word Embeddings using t-SNE.
Stars: ✭ 59 (+31.11%)
word embeddingSample code for training Word2Vec and FastText using wiki corpus and their pretrained word embedding..
Stars: ✭ 21 (-53.33%)
lda2vecMixing Dirichlet Topic Models and Word Embeddings to Make lda2vec from this paper https://arxiv.org/abs/1605.02019
Stars: ✭ 27 (-40%)
Postgres Word2vecutils to use word embedding like word2vec vectors in a postgres database
Stars: ✭ 96 (+113.33%)
Text SummarizerPython Framework for Extractive Text Summarization
Stars: ✭ 96 (+113.33%)
DebiasweRemove problematic gender bias from word embeddings.
Stars: ✭ 175 (+288.89%)
ScattertextBeautiful visualizations of how language differs among document types.
Stars: ✭ 1,722 (+3726.67%)
EvoPython package for the evaluation of odometry and SLAM
Stars: ✭ 1,373 (+2951.11%)
EvalneSource code for EvalNE, a Python library for evaluating Network Embedding methods.
Stars: ✭ 67 (+48.89%)
word2vec-on-wikipediaA pipeline for training word embeddings using word2vec on wikipedia corpus.
Stars: ✭ 68 (+51.11%)
Text-AnalysisExplaining textual analysis tools in Python. Including Preprocessing, Skip Gram (word2vec), and Topic Modelling.
Stars: ✭ 48 (+6.67%)
NTUA-slp-nlp💻Speech and Natural Language Processing (SLP & NLP) Lab Assignments for ECE NTUA
Stars: ✭ 19 (-57.78%)
Glove As A Tensorflow Embedding LayerTaking a pretrained GloVe model, and using it as a TensorFlow embedding weight layer **inside the GPU**. Therefore, you only need to send the index of the words through the GPU data transfer bus, reducing data transfer overhead.
Stars: ✭ 85 (+88.89%)
codenamesCodenames AI using Word Vectors
Stars: ✭ 41 (-8.89%)
Fasttext.jsFastText for Node.js
Stars: ✭ 127 (+182.22%)
GensimTopic Modelling for Humans
Stars: ✭ 12,763 (+28262.22%)
CBLUE中文医疗信息处理基准CBLUE: A Chinese Biomedical Language Understanding Evaluation Benchmark
Stars: ✭ 379 (+742.22%)
Vec4irWord Embeddings for Information Retrieval
Stars: ✭ 188 (+317.78%)
Deep learning nlpKeras, PyTorch, and NumPy Implementations of Deep Learning Architectures for NLP
Stars: ✭ 407 (+804.44%)
KoanA word2vec negative sampling implementation with correct CBOW update.
Stars: ✭ 232 (+415.56%)
Hpatches BenchmarkPython & Matlab code for local feature descriptor evaluation with the HPatches dataset.
Stars: ✭ 129 (+186.67%)
two-stream-cnnA two-stream convolutional neural network for learning abitrary similarity functions over two sets of training data
Stars: ✭ 24 (-46.67%)
dnstraceCommand-line DNS benchmark
Stars: ✭ 68 (+51.11%)
word2vec-pytorchExtremely simple and fast word2vec implementation with Negative Sampling + Sub-sampling
Stars: ✭ 145 (+222.22%)
utils⚡ A collection of common functions but with better performance, less allocations and less dependencies created for Fiber.
Stars: ✭ 21 (-53.33%)
PersianNERNamed-Entity Recognition in Persian Language
Stars: ✭ 48 (+6.67%)
textaugmentTextAugment: Text Augmentation Library
Stars: ✭ 280 (+522.22%)
wordmapVisualize large text collections with WebGL
Stars: ✭ 23 (-48.89%)
streamalgExtensible stream pipelines with object algebras.
Stars: ✭ 26 (-42.22%)
receiptdIDReceipt.ID is a multi-label, multi-class, hierarchical classification system implemented in a two layer feed forward network.
Stars: ✭ 22 (-51.11%)
WebAudioEvaluationToolA tool based on the HTML5 Web Audio API to perform perceptual audio evaluation tests locally or on remote machines over the web.
Stars: ✭ 101 (+124.44%)
rust-web-benchmarksBenchmarking web frameworks written in rust with rewrk tool.
Stars: ✭ 97 (+115.56%)
perforatorRecord "perf" performance metrics for individual functions/regions of an ELF binary.
Stars: ✭ 33 (-26.67%)
S-WMDCode for Supervised Word Mover's Distance (SWMD)
Stars: ✭ 90 (+100%)
naacl2018-feverFact Extraction and VERification baseline published in NAACL2018
Stars: ✭ 109 (+142.22%)
acl2017 document clusteringcode for "Determining Gains Acquired from Word Embedding Quantitatively Using Discrete Distribution Clustering" ACL 2017
Stars: ✭ 21 (-53.33%)
StreamBenchMeasuring the performance of popular streaming engines with Yahoo's Streaming Benchmark
Stars: ✭ 52 (+15.56%)