All Projects → entity-embed → Similar Projects or Alternatives

328 Open source projects that are alternatives of or similar to entity-embed

disent
🧶 Modular VAE disentanglement framework for python built with PyTorch Lightning ▸ Including metrics and datasets ▸ With strongly supervised, weakly supervised and unsupervised methods ▸ Easily configured and run with Hydra config ▸ Inspired by disentanglement_lib
Stars: ✭ 41 (-57.29%)
Mutual labels:  representation-learning
RG-Flow
This is project page for the paper "RG-Flow: a hierarchical and explainable flow model based on renormalization group and sparse prior". Paper link: https://arxiv.org/abs/2010.00029
Stars: ✭ 58 (-39.58%)
Mutual labels:  representation-learning
self-supervised
Whitening for Self-Supervised Representation Learning | Official repository
Stars: ✭ 83 (-13.54%)
Mutual labels:  representation-learning
ladder-vae-pytorch
Ladder Variational Autoencoders (LVAE) in PyTorch
Stars: ✭ 59 (-38.54%)
Mutual labels:  representation-learning
Data Matching Software
A list of free data matching and record linkage software.
Stars: ✭ 206 (+114.58%)
Mutual labels:  deduplication
Lsh
Locality Sensitive Hashing using MinHash in Python/Cython to detect near duplicate text documents
Stars: ✭ 182 (+89.58%)
Mutual labels:  deduplication
Restic
Fast, secure, efficient backup program
Stars: ✭ 15,105 (+15634.38%)
Mutual labels:  deduplication
Kvdo
A pair of kernel modules which provide pools of deduplicated and/or compressed block storage.
Stars: ✭ 168 (+75%)
Mutual labels:  deduplication
Dupeguru
Find duplicate files
Stars: ✭ 2,385 (+2384.38%)
Mutual labels:  deduplication
Dejavu
Quickly detect already witnessed data.
Stars: ✭ 151 (+57.29%)
Mutual labels:  deduplication
Vdo
Userspace tools for managing VDO volumes.
Stars: ✭ 138 (+43.75%)
Mutual labels:  deduplication
Spark Lucenerdd
Spark RDD with Lucene's query and entity linkage capabilities
Stars: ✭ 114 (+18.75%)
Mutual labels:  deduplication
Fingerprints
Make it easier to compare and cross-reference the names of companies and people by applying strong normalisation.
Stars: ✭ 91 (-5.21%)
Mutual labels:  deduplication
Rltk
Record Linkage ToolKit (Find and link entities)
Stars: ✭ 71 (-26.04%)
Mutual labels:  deduplication
Rmlint
Extremely fast tool to remove duplicates and other lint from your filesystem
Stars: ✭ 996 (+937.5%)
Mutual labels:  deduplication
Fastcdc Rs
FastCDC implementation in Rust
Stars: ✭ 31 (-67.71%)
Mutual labels:  deduplication
Dupandas
📊 python package for performing deduplication using flexible text matching and cleaning in pandas dataframe
Stars: ✭ 20 (-79.17%)
Mutual labels:  deduplication
Borgmatic
Simple, configuration-driven backup software for servers and workstations
Stars: ✭ 902 (+839.58%)
Mutual labels:  deduplication
Jdupes
A powerful duplicate file finder and an enhanced fork of 'fdupes'.
Stars: ✭ 790 (+722.92%)
Mutual labels:  deduplication
Rdedup
Data deduplication engine, supporting optional compression and public key encryption.
Stars: ✭ 690 (+618.75%)
Mutual labels:  deduplication
Talisman
Straightforward fuzzy matching, information retrieval and NLP building blocks for JavaScript.
Stars: ✭ 584 (+508.33%)
Mutual labels:  deduplication
Recordlinkage
A toolkit for record linkage and duplicate detection in Python
Stars: ✭ 532 (+454.17%)
Mutual labels:  deduplication
Kopia
Cross-platform backup tool for Windows, macOS & Linux with fast, incremental backups, client-side end-to-end encryption, compression and data deduplication. CLI and GUI included.
Stars: ✭ 507 (+428.13%)
Mutual labels:  deduplication
Alertmanager
Prometheus Alertmanager
Stars: ✭ 4,574 (+4664.58%)
Mutual labels:  deduplication
lieu
Dedupe/batch geocode addresses and venues around the world with libpostal
Stars: ✭ 73 (-23.96%)
Mutual labels:  deduplication
UMICollapse
Accelerating the deduplication and collapsing process for reads with Unique Molecular Identifiers (UMI). Heavily optimized for scalability and orders of magnitude faster than a previous tool.
Stars: ✭ 31 (-67.71%)
Mutual labels:  deduplication
RocketMQDedupListener
RocketMQ消息幂等去重消费者,支持使用MySQL或者Redis做幂等表,开箱即用
Stars: ✭ 132 (+37.5%)
Mutual labels:  deduplication
gencore
Generate duplex/single consensus reads to reduce sequencing noises and remove duplications
Stars: ✭ 91 (-5.21%)
Mutual labels:  deduplication
301-328 of 328 similar projects