MilvusAn open-source vector database for embedding similarity search and AI applications.
Stars: ✭ 9,015 (+6293.62%)
product-quantization🙃Implementation of vector quantization algorithms, codes for Norm-Explicit Quantization: Improving Vector Quantization for Maximum Inner Product Search.
Stars: ✭ 40 (-71.63%)
cherche📑 Neural Search
Stars: ✭ 196 (+39.01%)
Haystack🔍 Haystack is an open source NLP framework that leverages Transformer models. It enables developers to implement production-ready neural search, question answering, semantic document search and summarization for a wide range of applications.
Stars: ✭ 3,409 (+2317.73%)
JinaCloud-native neural search framework for 𝙖𝙣𝙮 kind of data
Stars: ✭ 12,618 (+8848.94%)
MoTISMobile(iOS) Text-to-Image search powered by multimodal semantic representation models(e.g., OpenAI's CLIP). Accepted at NAACL 2022.
Stars: ✭ 60 (-57.45%)
instant-distanceFast approximate nearest neighbor searching in Rust, based on HNSW index
Stars: ✭ 140 (-0.71%)
gplPowerful unsupervised domain adaptation method for dense retrieval. Requires only unlabeled corpus and yields massive improvement: "GPL: Generative Pseudo Labeling for Unsupervised Domain Adaptation of Dense Retrieval" https://arxiv.org/abs/2112.07577
Stars: ✭ 216 (+53.19%)
JPQCIKM'21: JPQ substantially improves the efficiency of Dense Retrieval with 30x compression ratio, 10x CPU speedup and 2x GPU speedup.
Stars: ✭ 39 (-72.34%)
Scilla🏴☠️ Information Gathering tool 🏴☠️ DNS / Subdomains / Ports / Directories enumeration
Stars: ✭ 116 (-17.73%)
NeuralqaNeuralQA: A Usable Library for Question Answering on Large Datasets with BERT
Stars: ✭ 185 (+31.21%)
CatalystAccelerated deep learning R&D
Stars: ✭ 2,804 (+1888.65%)
VtextSimple NLP in Rust with Python bindings
Stars: ✭ 108 (-23.4%)
RankingLearning to Rank in TensorFlow
Stars: ✭ 2,362 (+1575.18%)
SertSemantic Entity Retrieval Toolkit
Stars: ✭ 100 (-29.08%)
Bm25A Python implementation of the BM25 ranking function.
Stars: ✭ 159 (+12.77%)
SolrpluginsDice Solr Plugins from Simon Hughes Dice.com
Stars: ✭ 86 (-39.01%)
Textrank Keyword ExtractionKeyword extraction using TextRank algorithm after pre-processing the text with lemmatization, filtering unwanted parts-of-speech and other techniques.
Stars: ✭ 79 (-43.97%)
sigir19-neural-irSource code for: On the Effect of Low-Frequency Terms on Neural-IR Models, SIGIR'19
Stars: ✭ 44 (-68.79%)
RanknetMy (slightly modified) Keras implementation of RankNet and PyTorch implementation of LambdaRank.
Stars: ✭ 211 (+49.65%)
GensimTopic Modelling for Humans
Stars: ✭ 12,763 (+8951.77%)
Wordtokenizers.jlHigh performance tokenizers for natural language processing and other related tasks
Stars: ✭ 63 (-55.32%)
FreediscoveryWeb Service for E-Discovery Analytics
Stars: ✭ 59 (-58.16%)
Dan Jurafsky Chris Manning NlpMy solution to the Natural Language Processing course made by Dan Jurafsky, Chris Manning in Winter 2012.
Stars: ✭ 124 (-12.06%)
Vec4irWord Embeddings for Information Retrieval
Stars: ✭ 188 (+33.33%)
PyseriniPython interface to the Anserini IR toolkit built on Lucene
Stars: ✭ 148 (+4.96%)
ScdvText classification with Sparse Composite Document Vectors.
Stars: ✭ 54 (-61.7%)
TrinityTrinity IR Infrastructure
Stars: ✭ 227 (+60.99%)
Pytrec evalpytrec_eval is an Information Retrieval evaluation tool for Python, based on the popular trec_eval.
Stars: ✭ 114 (-19.15%)
K NrmK-NRM: End-to-End Neural Ad-hoc Ranking with Kernel Pooling
Stars: ✭ 183 (+29.79%)
Ds2iA library of inverted index data structures
Stars: ✭ 104 (-26.24%)
FergunAn utility Discord bot written in C# using Discord.Net
Stars: ✭ 26 (-81.56%)
FlexneuartFlexible classic and NeurAl Retrieval Toolkit
Stars: ✭ 99 (-29.79%)
BooksBooks worth spreading
Stars: ✭ 161 (+14.18%)
ForteForte is a flexible and powerful NLP builder FOR TExt. This is part of the CASL project: http://casl-project.ai/
Stars: ✭ 89 (-36.88%)
AquiladbDrop in solution for Decentralized Neural Information Retrieval. Index latent vectors along with JSON metadata and do efficient k-NN search.
Stars: ✭ 222 (+57.45%)
Pyndripyndri is a Python interface to the Indri search engine.
Stars: ✭ 85 (-39.72%)
Sf1r LiteSearch Formula-1——A distributed high performance massive data engine for enterprise/vertical search
Stars: ✭ 158 (+12.06%)
VectorsinsearchDice.com repo to accompany the dice.com 'Vectors in Search' talk by Simon Hughes, from the Activate 2018 search conference, and the 'Searching with Vectors' talk from Haystack 2019 (US). Builds upon my conceptual search and semantic search work from 2015
Stars: ✭ 71 (-49.65%)
IR-exercisesSolutions of the various test exams of the Information Retrieval course
Stars: ✭ 28 (-80.14%)
GaanaapiUnofficial Gaana API
Stars: ✭ 59 (-58.16%)
PwnbackBurp Extender plugin that generates a sitemap of a website using Wayback Machine
Stars: ✭ 203 (+43.97%)
Domain discovery toolThis repository contains the Domain Discovery Tool (DDT) project. DDT is an interactive system that helps users explore and better understand a domain (or topic) as it is represented on the Web.
Stars: ✭ 33 (-76.6%)
Tutorial Utilizing KgResources for Tutorial on "Utilizing Knowledge Graphs in Text-centric Information Retrieval"
Stars: ✭ 148 (+4.96%)
NprfNPRF: A Neural Pseudo Relevance Feedback Framework for Ad-hoc Information Retrieval
Stars: ✭ 31 (-78.01%)
PkePython Keyphrase Extraction module
Stars: ✭ 855 (+506.38%)
ComposeAEOfficial code for WACV 2021 paper - Compositional Learning of Image-Text Query for Image Retrieval
Stars: ✭ 49 (-65.25%)
HdltexHDLTex: Hierarchical Deep Learning for Text Classification
Stars: ✭ 191 (+35.46%)
InvoicenetDeep neural network to extract intelligent information from invoice documents.
Stars: ✭ 1,886 (+1237.59%)
Knowledge GraphsA collection of research on knowledge graphs
Stars: ✭ 845 (+499.29%)
Date InfoAPI to let user fetch the events that happen(ed) on a specific date
Stars: ✭ 7 (-95.04%)
Drl4nlp.scratchpadNotes on Deep Reinforcement Learning for Natural Language Processing papers
Stars: ✭ 26 (-81.56%)
FxtA large scale feature extraction tool for text-based machine learning
Stars: ✭ 25 (-82.27%)
Rank bm25A Collection of BM25 Algorithms in Python
Stars: ✭ 187 (+32.62%)