All Projects → Albert-Ma → awesome-pretrained-models-for-information-retrieval

Albert-Ma / awesome-pretrained-models-for-information-retrieval

Licence: other
A curated list of awesome papers related to pre-trained models for information retrieval (a.k.a., pretraining for IR).

Projects that are alternatives of or similar to awesome-pretrained-models-for-information-retrieval

JPQ
CIKM'21: JPQ substantially improves the efficiency of Dense Retrieval with 30x compression ratio, 10x CPU speedup and 2x GPU speedup.
Stars: ✭ 39 (-85.97%)
Mutual labels:  information-retrieval, web-search, dense-retrieval
DRhard
SIGIR'21: Optimizing DR with hard negatives and achieving SOTA first-stage retrieval performance on TREC DL Track.
Stars: ✭ 93 (-66.55%)
Mutual labels:  information-retrieval, web-search
Catalyst
Accelerated deep learning R&D
Stars: ✭ 2,804 (+908.63%)
Mutual labels:  information-retrieval
ConvDR
Code repo for SIGIR 2021 paper "Few-Shot Conversational Dense Retrieval"
Stars: ✭ 36 (-87.05%)
Mutual labels:  information-retrieval
patzilla
PatZilla is a modular patent information research platform and data integration toolkit with a modern user interface and access to multiple data sources.
Stars: ✭ 71 (-74.46%)
Mutual labels:  information-retrieval
Conceptualsearch
Train a Word2Vec model or LSA model, and Implement Conceptual Search\Semantic Search in Solr\Lucene - Simon Hughes Dice.com, Dice Tech Jobs
Stars: ✭ 245 (-11.87%)
Mutual labels:  information-retrieval
query-wellformedness
25,100 queries from the Paralex corpus (Fader et al., 2013) annotated with human ratings of whether they are well-formed natural language questions.
Stars: ✭ 80 (-71.22%)
Mutual labels:  information-retrieval
Ranknet
My (slightly modified) Keras implementation of RankNet and PyTorch implementation of LambdaRank.
Stars: ✭ 211 (-24.1%)
Mutual labels:  information-retrieval
LuceneTutorial
A simple tutorial of Lucene for LIS 501 Introduction to Text Mining students at the University of Wisconsin-Madison (Fall 2021).
Stars: ✭ 62 (-77.7%)
Mutual labels:  information-retrieval
pqlite
⚡ A fast embedded library for approximate nearest neighbor search
Stars: ✭ 141 (-49.28%)
Mutual labels:  information-retrieval
perke
A keyphrase extractor for Persian
Stars: ✭ 60 (-78.42%)
Mutual labels:  information-retrieval
FinBERT-QA
Financial Domain Question Answering with pre-trained BERT Language Model
Stars: ✭ 70 (-74.82%)
Mutual labels:  information-retrieval
ComposeAE
Official code for WACV 2021 paper - Compositional Learning of Image-Text Query for Image Retrieval
Stars: ✭ 49 (-82.37%)
Mutual labels:  information-retrieval
ImageRetrieval
Content Based Image Retrieval Techniques (e.g. knn, svm using MatLab GUI)
Stars: ✭ 51 (-81.65%)
Mutual labels:  information-retrieval
Trinity
Trinity IR Infrastructure
Stars: ✭ 227 (-18.35%)
Mutual labels:  information-retrieval
SENet-for-Weakly-Supervised-Relation-Extraction
No description or website provided.
Stars: ✭ 39 (-85.97%)
Mutual labels:  information-retrieval
Aquiladb
Drop in solution for Decentralized Neural Information Retrieval. Index latent vectors along with JSON metadata and do efficient k-NN search.
Stars: ✭ 222 (-20.14%)
Mutual labels:  information-retrieval
IR-exercises
Solutions of the various test exams of the Information Retrieval course
Stars: ✭ 28 (-89.93%)
Mutual labels:  information-retrieval
gpl
Powerful unsupervised domain adaptation method for dense retrieval. Requires only unlabeled corpus and yields massive improvement: "GPL: Generative Pseudo Labeling for Unsupervised Domain Adaptation of Dense Retrieval" https://arxiv.org/abs/2112.07577
Stars: ✭ 216 (-22.3%)
Mutual labels:  information-retrieval
beir
A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.
Stars: ✭ 738 (+165.47%)
Mutual labels:  information-retrieval


logo of awesome repository

awesome-pretrained-models-for-information-retrieval

A curated list of awesome papers related to pre-trained models for information retrieval (a.k.a., pre-training for IR). If I missed any papers, feel free to open a PR to include them! And any feedback and contributions are welcome!

Pre-training for IR

Survey Papers

First Stage Retrieval

Sparse Retrieval

Neural term re-weighting

Query or document expansion

Sparse representation learning

Dense Retrieval

Hard negative sampling

Late interaction and multi-vector representation

Knowledge distillation

Pre-training tailored for dense retrieval

Jointly learning retrieval and indexing

Domain adaptation

Query reformulation

Bias

Hybrid Retrieval

Re-ranking Stage

Basic Usage

Discriminative ranking models

Representation-focused
Interanction-focused

Generative ranking models

Hybrid ranking models

Long Document Processing Techniques

Passage score aggregation

Passage representation aggregation

Designing new architectures

Improving Efficiency

Decoupling the interaction

Knowledge distillation

Partial Fine-tuning

Early exit

Other Topics

Query Expansion

Re-weighting Training Samples

Pre-training Tailored for Re-ranking

Adversarial Attack and Defence

Cross-lingual Retrieval

Jointly Learning Retrieval and Re-ranking

Model-based IR System

Multimodal Retrieval

Unified Single-stream Architecture

Multi-stream Architecture Applied on Input

Other Resources

Some Retrieval Toolkits

Other Resources About Pre-trained Models in NLP

Surveys About Efficient Transformers

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].