All Projects → kdeldycke → mail-deduplicate

kdeldycke / mail-deduplicate

Licence: GPL-2.0 license
📧 CLI to deduplicate mails from mail boxes.

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to mail-deduplicate

Neomutt
✉️ Teaching an Old Dog New Tricks -- IRC: #neomutt on irc.libera.chat
Stars: ✭ 2,343 (+1648.51%)
Mutual labels:  mail, maildir, mbox
maildir2gmail
Maildir 2 Gmail
Stars: ✭ 14 (-89.55%)
Mutual labels:  mailbox, maildir
yadf
Yet Another Dupes Finder
Stars: ✭ 32 (-76.12%)
Mutual labels:  dedupe, deduplication
zingg
Scalable identity resolution, entity resolution, data mastering and deduplication using ML
Stars: ✭ 655 (+388.81%)
Mutual labels:  dedupe, deduplication
dduper
Fast block-level out-of-band BTRFS deduplication tool.
Stars: ✭ 108 (-19.4%)
Mutual labels:  dedupe, deduplication
Restic
Fast, secure, efficient backup program
Stars: ✭ 15,105 (+11172.39%)
Mutual labels:  dedupe, deduplication
mailing
📬 Sending emails with pleasure and prepared templates.
Stars: ✭ 15 (-88.81%)
Mutual labels:  mail, mailbox
is-biz-mail-php
isBizMail tells you whether a given email address belongs to a free email account provider (gmail.com, yahoo.es, yandex.ru etc) or not.
Stars: ✭ 19 (-85.82%)
Mutual labels:  mail, mailbox
node-mbox
mbox file parser for Node.js
Stars: ✭ 64 (-52.24%)
Mutual labels:  mail, mbox
yii2-symfonymailer
Yii 2 Symfony mailer extension.
Stars: ✭ 29 (-78.36%)
Mutual labels:  mail
laravel-mail-export
A simple mailable trait and interface to export mails to a storage disk once being sent.
Stars: ✭ 82 (-38.81%)
Mutual labels:  mail
enough mail
IMAP, POP3 and SMTP clients for Dart developers. Contains both low level as well as a high level API.
Stars: ✭ 78 (-41.79%)
Mutual labels:  mail
dedupsqlfs
Deduplicating filesystem via Python3, FUSE and SQLite
Stars: ✭ 24 (-82.09%)
Mutual labels:  deduplication
imail
small mail server
Stars: ✭ 88 (-34.33%)
Mutual labels:  mail
SplitShare
Shamir's Secret Sharing Algorithm implementation in golang combined with PGP and a mail delivery system
Stars: ✭ 31 (-76.87%)
Mutual labels:  mail
deduplication
Fast multi-threaded content-dependent chunking deduplication for Buffers in C++ with a reference implementation in Javascript. Ships with extensive tests, a fuzz test and a benchmark.
Stars: ✭ 59 (-55.97%)
Mutual labels:  deduplication
MailDemon
Smtp server for mass emailing, managing email lists and more. Built on .NET Core. Linux, MAC and Windows compatible.
Stars: ✭ 113 (-15.67%)
Mutual labels:  mail
cleanup
This tool was built for me and you to help us `cleanup` our folders with just a single command from the terminal.
Stars: ✭ 29 (-78.36%)
Mutual labels:  cleanup
scripts
A collection of random scripts I coded up
Stars: ✭ 17 (-87.31%)
Mutual labels:  mail
django-yubin
Send e-mails asyncronously using cron
Stars: ✭ 44 (-67.16%)
Mutual labels:  mail

Mail Deduplicate

Last release Python versions Unittests status Documentation status Coverage status DOI

What is Mail Deduplicate?

Provides the mdedup CLI, an utility to deduplicate mails from a set of boxes.

Mail Deduplicate

Features

  • Duplicate detection based on cherry-picked and normalized mail headers.
  • Fetch mails from multiple sources.
  • Reads and writes to mbox, maildir, babyl, mh and mmdf formats.
  • Deduplication strategies based on size, content, timestamp, file path or random choice.
  • Copy, move or delete the resulting set of duplicates.
  • Dry-run mode.
  • Protection against false-positives with safety checks on size and content differences.
  • Supports macOS, Linux and Windows.
  • Shell auto-completion for Bash, Zsh and Fish.

⚠️ Warning: Performances

mdedup implementation is quite naive at the moment and everything resides in memory.

If this is good enough for a volume of a couple of gigabytes, the more emails mdedup try to parse, the closer you'll reach the memory limits of your machine. In which case mdedup will exit abrubtly, zapped by the OOM killer of your OS. Of course your mileage may vary depending on your hardware.

You can influence implementation of this feature with pull requests, or purchase of business support 🤝 and sponsorship 🫶.

Example

Quickstart

Easiest way is to install mdedup with pipx:

$ pipx install mail-deduplicate

Other alternatives installation methods are available in the documentation.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].