ir datasetsProvides a common interface to many IR ranking datasets.
Stars: ✭ 190 (+442.86%)
ProQAProgressively Pretrained Dense Corpus Index for Open-Domain QA and Information Retrieval
Stars: ✭ 44 (+25.71%)
patzillaPatZilla is a modular patent information research platform and data integration toolkit with a modern user interface and access to multiple data sources.
Stars: ✭ 71 (+102.86%)
Transorthogonal LinguisticsUses a distributed word representation to finds words along the hyperchord of two input words.
Stars: ✭ 93 (+165.71%)
Information-RetrievalInformation Retrieval algorithms developed in python. To follow the blog posts, click on the link:
Stars: ✭ 103 (+194.29%)
pqlite⚡ A fast embedded library for approximate nearest neighbor search
Stars: ✭ 141 (+302.86%)
allsummarizerMultilingual automatic text summarizer using statistical approach and extraction
Stars: ✭ 28 (-20%)
Cw2vec基于字符训练词向量
Stars: ✭ 80 (+128.57%)
FinBERT-QAFinancial Domain Question Answering with pre-trained BERT Language Model
Stars: ✭ 70 (+100%)
Russian news corpusRussian mass media stemmed texts corpus / Корпус лемматизированных (морфологически нормализованных) текстов российских СМИ
Stars: ✭ 76 (+117.14%)
AsneA sparsity aware and memory efficient implementation of "Attributed Social Network Embedding" (TKDE 2018).
Stars: ✭ 73 (+108.57%)
IR-exercisesSolutions of the various test exams of the Information Retrieval course
Stars: ✭ 28 (-20%)
IP-TrackerTrack any ip address with IP-Tracker. IP-Tracker is developed for Linux and Termux. you can retrieve any ip address information using IP-Tracker.
Stars: ✭ 53 (+51.43%)
text2textText2Text: Cross-lingual natural language processing and generation toolkit
Stars: ✭ 188 (+437.14%)
Repo 2017Python codes in Machine Learning, NLP, Deep Learning and Reinforcement Learning with Keras and Theano
Stars: ✭ 1,123 (+3108.57%)
sigir19-neural-irSource code for: On the Effect of Low-Frequency Terms on Neural-IR Models, SIGIR'19
Stars: ✭ 44 (+25.71%)
Word2vec訓練中文詞向量 Word2vec, Word2vec was created by a team of researchers led by Tomas Mikolov at Google.
Stars: ✭ 48 (+37.14%)
RolXAn alternative implementation of Recursive Feature and Role Extraction (KDD11 & KDD12)
Stars: ✭ 52 (+48.57%)
CukatifyCukatify is a music social media project
Stars: ✭ 21 (-40%)
rust-stemmersA rust implementation of some popular snowball stemming algorithms
Stars: ✭ 85 (+142.86%)
autocompleteEfficient and effective query auto-completion in C++.
Stars: ✭ 28 (-20%)
Word2vec Russian NovelsInspired by word2vec-pride-vis the replacement of words of Russian most valuable novels text with closest word2vec model words. By Boris Orekhov
Stars: ✭ 39 (+11.43%)
cs6101The Web IR / NLP Group (WING)'s public reading group at the National University of Singapore.
Stars: ✭ 17 (-51.43%)
ConceptualsearchTrain a Word2Vec model or LSA model, and Implement Conceptual Search\Semantic Search in Solr\Lucene - Simon Hughes Dice.com, Dice Tech Jobs
Stars: ✭ 245 (+600%)
Philo2vecAn implementation of word2vec applied to [stanford philosophy encyclopedia](http://plato.stanford.edu/)
Stars: ✭ 33 (-5.71%)
ServenetService Classification based on Service Description
Stars: ✭ 21 (-40%)
CatalystAccelerated deep learning R&D
Stars: ✭ 2,804 (+7911.43%)
BagofconceptsPython implementation of bag-of-concepts
Stars: ✭ 18 (-48.57%)
Lightnlp基于Pytorch和torchtext的自然语言处理深度学习框架。
Stars: ✭ 739 (+2011.43%)
RanknetMy (slightly modified) Keras implementation of RankNet and PyTorch implementation of LambdaRank.
Stars: ✭ 211 (+502.86%)
awesome-semantic-searchA curated list of awesome resources related to Semantic Search🔎 and Semantic Similarity tasks.
Stars: ✭ 161 (+360%)
biovecProtVec can be used in protein interaction predictions, structure prediction, and protein data visualization.
Stars: ✭ 23 (-34.29%)
Cs224nCS224n: Natural Language Processing with Deep Learning Assignments Winter, 2017
Stars: ✭ 656 (+1774.29%)
HdltexHDLTex: Hierarchical Deep Learning for Text Classification
Stars: ✭ 191 (+445.71%)
Graph2vecA parallel implementation of "graph2vec: Learning Distributed Representations of Graphs" (MLGWorkshop 2017).
Stars: ✭ 605 (+1628.57%)
DRhardSIGIR'21: Optimizing DR with hard negatives and achieving SOTA first-stage retrieval performance on TREC DL Track.
Stars: ✭ 93 (+165.71%)
OpenmatchAn Open-Source Package for Information Retrieval.
Stars: ✭ 186 (+431.43%)
Active-Explainable-ClassificationA set of tools for leveraging pre-trained embeddings, active learning and model explainability for effecient document classification
Stars: ✭ 28 (-20%)
fuzzymaxCode for the paper: Don't Settle for Average, Go for the Max: Fuzzy Sets and Max-Pooled Word Vectors, ICLR 2019.
Stars: ✭ 43 (+22.86%)
cherche📑 Neural Search
Stars: ✭ 196 (+460%)
Naive-Resume-MatchingText Similarity Applied to resume, to compare Resumes with Job Descriptions and create a score to rank them. Similar to an ATS.
Stars: ✭ 27 (-22.86%)
NeuralqaNeuralQA: A Usable Library for Question Answering on Large Datasets with BERT
Stars: ✭ 185 (+428.57%)
BM25Transformer(Python) transform a document-term matrix to an Okapi/BM25 representation
Stars: ✭ 50 (+42.86%)
Text Cnn嵌入Word2vec词向量的CNN中文文本分类
Stars: ✭ 298 (+751.43%)
RankingLearning to Rank in TensorFlow
Stars: ✭ 2,362 (+6648.57%)
LanguagecrunchLanguageCrunch NLP server docker image
Stars: ✭ 281 (+702.86%)
SiameseCBOWImplementation of Siamese CBOW using keras whose backend is tensorflow.
Stars: ✭ 14 (-60%)
Movietaster OpenA practical movie recommend project based on Item2vec.
Stars: ✭ 253 (+622.86%)
Bm25A Python implementation of the BM25 ranking function.
Stars: ✭ 159 (+354.29%)
textlyticsText processing library for sentiment analysis and related tasks
Stars: ✭ 25 (-28.57%)
wsdm-digg-2020No description or website provided.
Stars: ✭ 15 (-57.14%)
COVID19-IRQANo description or website provided.
Stars: ✭ 32 (-8.57%)
AravecAraVec is a pre-trained distributed word representation (word embedding) open source project which aims to provide the Arabic NLP research community with free to use and powerful word embedding models.
Stars: ✭ 239 (+582.86%)
word2vec.r📐Julia's implementation of word2vec in R
Stars: ✭ 23 (-34.29%)
AI booklet CE-AUTBooklet and exam of Artificial Intelligence Master Degree at Amirkabir University of technology.
Stars: ✭ 14 (-60%)
netizenshipa commandline #OSINT tool to find the online presence of a username in popular social media websites like Facebook, Instagram, Twitter, etc.
Stars: ✭ 33 (-5.71%)