tutorialsA tutorial series by Preferred.AI
Stars: ✭ 136 (+615.79%)
Tutorial Utilizing KgResources for Tutorial on "Utilizing Knowledge Graphs in Text-centric Information Retrieval"
Stars: ✭ 148 (+678.95%)
beirA Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.
Stars: ✭ 738 (+3784.21%)
Rated Ranking EvaluatorSearch Quality Evaluation Tool for Apache Solr & Elasticsearch search-based infrastructures
Stars: ✭ 134 (+605.26%)
solrApache Solr open-source search software
Stars: ✭ 651 (+3326.32%)
Dan Jurafsky Chris Manning NlpMy solution to the Natural Language Processing course made by Dan Jurafsky, Chris Manning in Winter 2012.
Stars: ✭ 124 (+552.63%)
ml4irMachine Learning for Information Retrieval
Stars: ✭ 75 (+294.74%)
Haystack🔍 Haystack is an open source NLP framework that leverages Transformer models. It enables developers to implement production-ready neural search, question answering, semantic document search and summarization for a wide range of applications.
Stars: ✭ 3,409 (+17842.11%)
ConvDRCode repo for SIGIR 2021 paper "Few-Shot Conversational Dense Retrieval"
Stars: ✭ 36 (+89.47%)
Pytrec evalpytrec_eval is an Information Retrieval evaluation tool for Python, based on the popular trec_eval.
Stars: ✭ 114 (+500%)
ake-datasetsLarge, curated set of benchmark datasets for evaluating automatic keyphrase extraction algorithms.
Stars: ✭ 125 (+557.89%)
Ds2iA library of inverted index data structures
Stars: ✭ 104 (+447.37%)
ImageRetrievalContent Based Image Retrieval Techniques (e.g. knn, svm using MatLab GUI)
Stars: ✭ 51 (+168.42%)
FlexneuartFlexible classic and NeurAl Retrieval Toolkit
Stars: ✭ 99 (+421.05%)
ForteForte is a flexible and powerful NLP builder FOR TExt. This is part of the CASL project: http://casl-project.ai/
Stars: ✭ 89 (+368.42%)
gplPowerful unsupervised domain adaptation method for dense retrieval. Requires only unlabeled corpus and yields massive improvement: "GPL: Generative Pseudo Labeling for Unsupervised Domain Adaptation of Dense Retrieval" https://arxiv.org/abs/2112.07577
Stars: ✭ 216 (+1036.84%)
Pyndripyndri is a Python interface to the Indri search engine.
Stars: ✭ 85 (+347.37%)
IP-TrackerTrack any ip address with IP-Tracker. IP-Tracker is developed for Linux and Termux. you can retrieve any ip address information using IP-Tracker.
Stars: ✭ 53 (+178.95%)
VectorsinsearchDice.com repo to accompany the dice.com 'Vectors in Search' talk by Simon Hughes, from the Activate 2018 search conference, and the 'Searching with Vectors' talk from Haystack 2019 (US). Builds upon my conceptual search and semantic search work from 2015
Stars: ✭ 71 (+273.68%)
pqlite⚡ A fast embedded library for approximate nearest neighbor search
Stars: ✭ 141 (+642.11%)
GaanaapiUnofficial Gaana API
Stars: ✭ 59 (+210.53%)
HARCode for WWW2019 paper "A Hierarchical Attention Retrieval Model for Healthcare Question Answering"
Stars: ✭ 22 (+15.79%)
IR-exercisesSolutions of the various test exams of the Information Retrieval course
Stars: ✭ 28 (+47.37%)
Domain discovery toolThis repository contains the Domain Discovery Tool (DDT) project. DDT is an interactive system that helps users explore and better understand a domain (or topic) as it is represented on the Web.
Stars: ✭ 33 (+73.68%)
ir datasetsProvides a common interface to many IR ranking datasets.
Stars: ✭ 190 (+900%)
PkePython Keyphrase Extraction module
Stars: ✭ 855 (+4400%)
ComposeAEOfficial code for WACV 2021 paper - Compositional Learning of Image-Text Query for Image Retrieval
Stars: ✭ 49 (+157.89%)
Date InfoAPI to let user fetch the events that happen(ed) on a specific date
Stars: ✭ 7 (-63.16%)
3d model retrieverExperimenting with a newly published deep learning paper and how it can be used for content-based 3D model retrieval. (info retrieval for CAD)
Stars: ✭ 45 (+136.84%)
FxtA large scale feature extraction tool for text-based machine learning
Stars: ✭ 25 (+31.58%)
TrinityTrinity IR Infrastructure
Stars: ✭ 227 (+1094.74%)
cs6101The Web IR / NLP Group (WING)'s public reading group at the National University of Singapore.
Stars: ✭ 17 (-10.53%)
AnseriniA Lucene toolkit for replicable information retrieval research
Stars: ✭ 573 (+2915.79%)
AquiladbDrop in solution for Decentralized Neural Information Retrieval. Index latent vectors along with JSON metadata and do efficient k-NN search.
Stars: ✭ 222 (+1068.42%)
Deep Semantic Similarity ModelMy Keras implementation of the Deep Semantic Similarity Model (DSSM)/Convolutional Latent Semantic Model (CLSM) described here: http://research.microsoft.com/pubs/226585/cikm2014_cdssm_final.pdf.
Stars: ✭ 509 (+2578.95%)
EMNLP2020This is official Pytorch code and datasets of the paper "Where Are the Facts? Searching for Fact-checked Information to Alleviate the Spread of Fake News", EMNLP 2020.
Stars: ✭ 55 (+189.47%)
PisaPISA: Performant Indexes and Search for Academia
Stars: ✭ 489 (+2473.68%)
PwnbackBurp Extender plugin that generates a sitemap of a website using Wayback Machine
Stars: ✭ 203 (+968.42%)
Telegram Scrapertelegram group scraper tool. fetch all information about group members
Stars: ✭ 450 (+2268.42%)
FieldedSDMFielded Sequential Dependence Model (code and runs)
Stars: ✭ 32 (+68.42%)
Sequence Semantic EmbeddingTools and recipes to train deep learning models and build services for NLP tasks such as text classification, semantic search ranking and recall fetching, cross-lingual information retrieval, and question answering etc.
Stars: ✭ 435 (+2189.47%)
Rank bm25A Collection of BM25 Algorithms in Python
Stars: ✭ 187 (+884.21%)
Osi.igInformation Gathering Instagram.
Stars: ✭ 377 (+1884.21%)
naacl2018-feverFact Extraction and VERification baseline published in NAACL2018
Stars: ✭ 109 (+473.68%)
SparklerSpark-Crawler: Apache Nutch-like crawler that runs on Apache Spark.
Stars: ✭ 362 (+1805.26%)
Vec4irWord Embeddings for Information Retrieval
Stars: ✭ 188 (+889.47%)
GetaltnameExtract subdomains from SSL certificates in HTTPS sites.
Stars: ✭ 320 (+1584.21%)
DRhardSIGIR'21: Optimizing DR with hard negatives and achieving SOTA first-stage retrieval performance on TREC DL Track.
Stars: ✭ 93 (+389.47%)
K NrmK-NRM: End-to-End Neural Ad-hoc Ranking with Kernel Pooling
Stars: ✭ 183 (+863.16%)
MimirOSINT Threat Intel Interface - CLI for HoneyDB
Stars: ✭ 104 (+447.37%)
AI booklet CE-AUTBooklet and exam of Artificial Intelligence Master Degree at Amirkabir University of technology.
Stars: ✭ 14 (-26.32%)
memex-gateGeneral Architecture for Text Engineering
Stars: ✭ 47 (+147.37%)
BM25Transformer(Python) transform a document-term matrix to an Okapi/BM25 representation
Stars: ✭ 50 (+163.16%)
COVID19-IRQANo description or website provided.
Stars: ✭ 32 (+68.42%)
Bm25A Python implementation of the BM25 ranking function.
Stars: ✭ 159 (+736.84%)