Neural-Scam-ArtistWeb Scraping, Document Deduplication & GPT-2 Fine-tuning with a newly created scam dataset.
Stars: ✭ 18 (-71.87%)
cisip-FIReFast Image Retrieval (FIRe) is an open source project to promote image retrieval research. It implements most of the major binary hashing methods to date, together with different popular backbone networks and public datasets.
Stars: ✭ 40 (-37.5%)
Content-based-Recommender-SystemIt is a content based recommender system that uses tf-idf and cosine similarity for N Most SImilar Items from a dataset
Stars: ✭ 64 (+0%)
bns-short-text-similarity📖 Use Bi-normal Separation to find document vectors which is used to compute similarity for shorter sentences.
Stars: ✭ 24 (-62.5%)
lshLocality Sensitive Hashing for Go (Multi-probe LSH, LSH Forest, basic LSH)
Stars: ✭ 92 (+43.75%)
koolslaFood recommendation tool with Machine learning.
Stars: ✭ 21 (-67.19%)
DatasketchMinHash, LSH, LSH Forest, Weighted MinHash, HyperLogLog, HyperLogLog++, LSH Ensemble
Stars: ✭ 1,635 (+2454.69%)
DolphinnHigh Dimensional Approximate Near(est) Neighbor
Stars: ✭ 32 (-50%)
H2 ALSHAccurate and Fast ALSH for Maximum Inner Product Search (KDD 2018)
Stars: ✭ 18 (-71.87%)
product-quantization🙃Implementation of vector quantization algorithms, codes for Norm-Explicit Quantization: Improving Vector Quantization for Maximum Inner Product Search.
Stars: ✭ 40 (-37.5%)
image-ndd-lshNear-duplicate image detection using Locality Sensitive Hashing
Stars: ✭ 42 (-34.37%)
MoTISMobile(iOS) Text-to-Image search powered by multimodal semantic representation models(e.g., OpenAI's CLIP). Accepted at NAACL 2022.
Stars: ✭ 60 (-6.25%)
lshensembleLSH index for approximate set containment search
Stars: ✭ 48 (-25%)
Java String SimilarityImplementation of various string similarity and distance algorithms: Levenshtein, Jaro-winkler, n-Gram, Q-Gram, Jaccard index, Longest Common Subsequence edit distance, cosine similarity ...
Stars: ✭ 2,403 (+3654.69%)
keras-knnCode for the blog post Nearest Neighbors with Keras and CoreML
Stars: ✭ 25 (-60.94%)
Simple-Plagiarism-CheckerWeb Application for checking the similarity between query and document using the concept of Cosine Similarity.
Stars: ✭ 47 (-26.56%)
AI-for-Trading📈This repo contains detailed notes and multiple projects implemented in Python related to AI and Finance. Follow the blog here: https://purvasingh.medium.com
Stars: ✭ 59 (-7.81%)
set-sketch-paperSetSketch: Filling the Gap between MinHash and HyperLogLog
Stars: ✭ 23 (-64.06%)
live-cctvTo detect any reasonable change in a live cctv to avoid large storage of data. Once, we notice a change, our goal would be track that object or person causing it. We would be using Computer vision concepts. Our major focus will be on Deep Learning and will try to add as many features in the process.
Stars: ✭ 23 (-64.06%)
Naive-Resume-MatchingText Similarity Applied to resume, to compare Resumes with Job Descriptions and create a score to rank them. Similar to an ATS.
Stars: ✭ 27 (-57.81%)
tika-similarityTika-Similarity uses the Tika-Python package (Python port of Apache Tika) to compute file similarity based on Metadata features.
Stars: ✭ 92 (+43.75%)
Information-RetrievalInformation Retrieval algorithms developed in python. To follow the blog posts, click on the link:
Stars: ✭ 103 (+60.94%)
Plagiarism-checker-PythonA python project for checking plagiarism of documents based on cosine similarity
Stars: ✭ 114 (+78.13%)
Img2VecCosSim-Django-PytorchExtract a feature vector for any image and find the cosine similarity for comparison using Pytorch. I have used ResNet-18 to extract the feature vector of images. Finally a Django app is developed to input two images and to find the cosine similarity.
Stars: ✭ 20 (-68.75%)
stringdistanceA fuzzy matching string distance library for Scala and Java that includes Levenshtein distance, Jaro distance, Jaro-Winkler distance, Dice coefficient, N-Gram similarity, Cosine similarity, Jaccard similarity, Longest common subsequence, Hamming distance, and more..
Stars: ✭ 60 (-6.25%)
solr-vector-scoringVector Plugin for Solr: calculate dot product / cosine similarity on documents
Stars: ✭ 28 (-56.25%)