solrApache Solr open-source search software
Stars: ✭ 651 (+1487.8%)
pqlite⚡ A fast embedded library for approximate nearest neighbor search
Stars: ✭ 141 (+243.9%)
netizenshipa commandline #OSINT tool to find the online presence of a username in popular social media websites like Facebook, Instagram, Twitter, etc.
Stars: ✭ 33 (-19.51%)
AquiladbDrop in solution for Decentralized Neural Information Retrieval. Index latent vectors along with JSON metadata and do efficient k-NN search.
Stars: ✭ 222 (+441.46%)
tutorialsA tutorial series by Preferred.AI
Stars: ✭ 136 (+231.71%)
ImageRetrievalContent Based Image Retrieval Techniques (e.g. knn, svm using MatLab GUI)
Stars: ✭ 51 (+24.39%)
ComposeAEOfficial code for WACV 2021 paper - Compositional Learning of Image-Text Query for Image Retrieval
Stars: ✭ 49 (+19.51%)
3d model retrieverExperimenting with a newly published deep learning paper and how it can be used for content-based 3D model retrieval. (info retrieval for CAD)
Stars: ✭ 45 (+9.76%)
rust-stemmersA rust implementation of some popular snowball stemming algorithms
Stars: ✭ 85 (+107.32%)
Rank bm25A Collection of BM25 Algorithms in Python
Stars: ✭ 187 (+356.1%)
DRhardSIGIR'21: Optimizing DR with hard negatives and achieving SOTA first-stage retrieval performance on TREC DL Track.
Stars: ✭ 93 (+126.83%)
beirA Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.
Stars: ✭ 738 (+1700%)
FieldedSDMFielded Sequential Dependence Model (code and runs)
Stars: ✭ 32 (-21.95%)
ConvDRCode repo for SIGIR 2021 paper "Few-Shot Conversational Dense Retrieval"
Stars: ✭ 36 (-12.2%)
ml4irMachine Learning for Information Retrieval
Stars: ✭ 75 (+82.93%)
gplPowerful unsupervised domain adaptation method for dense retrieval. Requires only unlabeled corpus and yields massive improvement: "GPL: Generative Pseudo Labeling for Unsupervised Domain Adaptation of Dense Retrieval" https://arxiv.org/abs/2112.07577
Stars: ✭ 216 (+426.83%)
ir datasetsProvides a common interface to many IR ranking datasets.
Stars: ✭ 190 (+363.41%)
IR-exercisesSolutions of the various test exams of the Information Retrieval course
Stars: ✭ 28 (-31.71%)
HARCode for WWW2019 paper "A Hierarchical Attention Retrieval Model for Healthcare Question Answering"
Stars: ✭ 22 (-46.34%)
TrinityTrinity IR Infrastructure
Stars: ✭ 227 (+453.66%)
IP-TrackerTrack any ip address with IP-Tracker. IP-Tracker is developed for Linux and Termux. you can retrieve any ip address information using IP-Tracker.
Stars: ✭ 53 (+29.27%)
PwnbackBurp Extender plugin that generates a sitemap of a website using Wayback Machine
Stars: ✭ 203 (+395.12%)
EMNLP2020This is official Pytorch code and datasets of the paper "Where Are the Facts? Searching for Fact-checked Information to Alleviate the Spread of Fake News", EMNLP 2020.
Stars: ✭ 55 (+34.15%)
ProQAProgressively Pretrained Dense Corpus Index for Open-Domain QA and Information Retrieval
Stars: ✭ 44 (+7.32%)
OpenmatchAn Open-Source Package for Information Retrieval.
Stars: ✭ 186 (+353.66%)
bookworm📚 social networks from novels
Stars: ✭ 72 (+75.61%)
COVID19-IRQANo description or website provided.
Stars: ✭ 32 (-21.95%)
BM25Transformer(Python) transform a document-term matrix to an Okapi/BM25 representation
Stars: ✭ 50 (+21.95%)
LuceneTutorialA simple tutorial of Lucene for LIS 501 Introduction to Text Mining students at the University of Wisconsin-Madison (Fall 2021).
Stars: ✭ 62 (+51.22%)
MimirOSINT Threat Intel Interface - CLI for HoneyDB
Stars: ✭ 104 (+153.66%)
srctools for fast reading of docs
Stars: ✭ 40 (-2.44%)
perkeA keyphrase extractor for Persian
Stars: ✭ 60 (+46.34%)
memex-gateGeneral Architecture for Text Engineering
Stars: ✭ 47 (+14.63%)
query-wellformedness25,100 queries from the Paralex corpus (Fader et al., 2013) annotated with human ratings of whether they are well-formed natural language questions.
Stars: ✭ 80 (+95.12%)
GNN-Recommender-SystemsAn index of recommendation algorithms that are based on Graph Neural Networks.
Stars: ✭ 505 (+1131.71%)
patzillaPatZilla is a modular patent information research platform and data integration toolkit with a modern user interface and access to multiple data sources.
Stars: ✭ 71 (+73.17%)
Intention-Mining-Intention Mining in Social Networking. It Mines Emotions and polarity for the given keyword . For the keyword it searchers the twitter for the comments and analyzes the results for various events such as Election results, Sports prediction Movie ratings, Breaking news events such as demonetisation and many more. Bayes , Maximum Entropy and Hidde…
Stars: ✭ 19 (-53.66%)
FinBERT-QAFinancial Domain Question Answering with pre-trained BERT Language Model
Stars: ✭ 70 (+70.73%)
sigir19-neural-irSource code for: On the Effect of Low-Frequency Terms on Neural-IR Models, SIGIR'19
Stars: ✭ 44 (+7.32%)
allsummarizerMultilingual automatic text summarizer using statistical approach and extraction
Stars: ✭ 28 (-31.71%)
ConceptualsearchTrain a Word2Vec model or LSA model, and Implement Conceptual Search\Semantic Search in Solr\Lucene - Simon Hughes Dice.com, Dice Tech Jobs
Stars: ✭ 245 (+497.56%)
kexKex is a python library for unsupervised keyword extraction from a document, providing an easy interface and benchmarks on 15 public datasets.
Stars: ✭ 46 (+12.2%)
CatalystAccelerated deep learning R&D
Stars: ✭ 2,804 (+6739.02%)
AI booklet CE-AUTBooklet and exam of Artificial Intelligence Master Degree at Amirkabir University of technology.
Stars: ✭ 14 (-65.85%)
RanknetMy (slightly modified) Keras implementation of RankNet and PyTorch implementation of LambdaRank.
Stars: ✭ 211 (+414.63%)
MixGCFMixGCF: An Improved Training Method for Graph Neural Network-based Recommender Systems, KDD2021
Stars: ✭ 73 (+78.05%)
HdltexHDLTex: Hierarchical Deep Learning for Text Classification
Stars: ✭ 191 (+365.85%)
nalcosSearch Git commits in natural language
Stars: ✭ 50 (+21.95%)
BERT-QECode and resources for the paper "BERT-QE: Contextualized Query Expansion for Document Re-ranking".
Stars: ✭ 43 (+4.88%)
intergoA package for interleaving / multileaving ranking generation in go
Stars: ✭ 30 (-26.83%)
lldaLabeled LDA in Python
Stars: ✭ 19 (-53.66%)
ake-datasetsLarge, curated set of benchmark datasets for evaluating automatic keyphrase extraction algorithms.
Stars: ✭ 125 (+204.88%)
cs6101The Web IR / NLP Group (WING)'s public reading group at the National University of Singapore.
Stars: ✭ 17 (-58.54%)
naacl2018-feverFact Extraction and VERification baseline published in NAACL2018
Stars: ✭ 109 (+165.85%)