DatasketchMinHash, LSH, LSH Forest, Weighted MinHash, HyperLogLog, HyperLogLog++, LSH Ensemble
DolphinnHigh Dimensional Approximate Near(est) Neighbor
H2 ALSHAccurate and Fast ALSH for Maximum Inner Product Search (KDD 2018)
product-quantization🙃Implementation of vector quantization algorithms, codes for Norm-Explicit Quantization: Improving Vector Quantization for Maximum Inner Product Search.
image-ndd-lshNear-duplicate image detection using Locality Sensitive Hashing
MoTISMobile(iOS) Text-to-Image search powered by multimodal semantic representation models(e.g., OpenAI's CLIP). Accepted at NAACL 2022.
lshensembleLSH index for approximate set containment search
lsh-rsLocality Sensitive Hashing in Rust with Python bindings
Neural-Scam-ArtistWeb Scraping, Document Deduplication & GPT-2 Fine-tuning with a newly created scam dataset.
lshLocality Sensitive Hashing for Go (Multi-probe LSH, LSH Forest, basic LSH)