All Projects → microsoft → Presidio

microsoft / Presidio

Licence: mit
Context aware, pluggable and customizable data protection and anonymization SDK for text and images

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Presidio

Gdpr Checklist
The GDPR Checklist
Stars: ✭ 655 (-50.38%)
Mutual labels:  gdpr, privacy
Duckduckgo Privacy Extension
DuckDuckGo Privacy Essentials browser extension for Firefox, Chrome.
Stars: ✭ 692 (-47.58%)
Mutual labels:  privacy, privacy-protection
Social Amnesia
Forget the past. Social Amnesia makes sure your social media accounts only show your posts from recent history, not from "that phase" 5 years ago.
Stars: ✭ 656 (-50.3%)
Mutual labels:  privacy, privacy-protection
Gitmask
Contribute Code Anonymously
Stars: ✭ 355 (-73.11%)
Mutual labels:  privacy, privacy-protection
Vpn At Home
1-click, self-hosted deployment of OpenVPN with DNS ad blocking sinkhole
Stars: ✭ 1,106 (-16.21%)
Mutual labels:  privacy, privacy-protection
Hosts
Hostfile blocklist for ads and tracking, updated regularly
Stars: ✭ 398 (-69.85%)
Mutual labels:  privacy, privacy-protection
Datadefender
Sensitive Data Management: Data Discovery and Anonymization toolkit
Stars: ✭ 79 (-94.02%)
Mutual labels:  gdpr, privacy
fidesops
Privacy as Code for DSAR Orchestration: Privacy Request automation to fulfill GDPR, CCPA, and LGPD data subject requests.
Stars: ✭ 32 (-97.58%)
Mutual labels:  privacy, gdpr
Privatezilla
👀👮🐢🔥Performs a privacy & security check of Windows 10
Stars: ✭ 1,045 (-20.83%)
Mutual labels:  privacy, privacy-protection
Meething Ml Camera
Machine-Learning powered Virtual Camera with SVG Animation (alpha)
Stars: ✭ 36 (-97.27%)
Mutual labels:  privacy, privacy-protection
Thgtoa
The Hitchhiker’s Guide to Online Anonymity
Stars: ✭ 326 (-75.3%)
Mutual labels:  privacy, privacy-protection
Securityprivacyreferencearchitecture
Open Repository for the Open Security and Privacy Reference Architecture
Stars: ✭ 73 (-94.47%)
Mutual labels:  gdpr, privacy
Opendsr
A common framework enabling companies to work together to protect consumers' privacy and data rights.
Stars: ✭ 295 (-77.65%)
Mutual labels:  gdpr, privacy
Yett
🔐A small webpage library to control the execution of (third party) scripts
Stars: ✭ 615 (-53.41%)
Mutual labels:  gdpr, privacy
data
The data behind the Datenanfragen.de project. This contains a directory of contact information and privacy-related data on companies under the scope of the EU GDPR, a directory of supervisory authorities for privacy concerns, a collection of templates for GDPR requests and a list of suggested companies to send access requests to.
Stars: ✭ 61 (-95.38%)
Mutual labels:  privacy, gdpr
Shreddit
Remove your comment history on Reddit as deleting an account does not do so.
Stars: ✭ 669 (-49.32%)
Mutual labels:  privacy, privacy-protection
Hemmelig.app
Keep your sensitive information out of chat logs, emails, and more with encrypted secrets.
Stars: ✭ 183 (-86.14%)
Mutual labels:  gdpr, privacy-protection
kodex
A privacy and security engineering toolkit: Discover, understand, pseudonymize, anonymize, encrypt and securely share sensitive and personal data: Privacy and security as code.
Stars: ✭ 70 (-94.7%)
Mutual labels:  gdpr, privacy-protection
Mailtrackerblocker
Email tracker, read receipt and spy pixel blocker plugin for macOS Apple Mail
Stars: ✭ 821 (-37.8%)
Mutual labels:  privacy, privacy-protection
Conceal Core
Conceal Core - Daemon & Wallets (CLI)
Stars: ✭ 72 (-94.55%)
Mutual labels:  privacy, privacy-protection

Presidio - Data Protection and Anonymization API

Context aware, pluggable and customizable PII anonymization service for text and images.


Build Status MIT license Release Pypi Downloads PyPI package version

What is Presidio

Presidio (Origin from Latin praesidium ‘protection, garrison’) helps to ensure sensitive data is properly managed and governed. It provides fast identification and anonymization modules for private entities in text such as credit card numbers, names, locations, social security numbers, bitcoin wallets, US phone numbers, financial data and more.

Presidio demo gif

Goals

  • Allow organizations to preserve privacy in a simpler way by democratizing de-identification technologies and introducing transparency in decisions.
  • Embrace extensibility and customizability to a specific business need.
  • Facilitate both fully automated and semi-automated PII de-identification flows on multiple platforms.

Main features

  1. Predefined or custom PII recognizers leveraging Named Entity Recognition, regular expressions, rule based logic and checksum with relevant context in multiple languages.
  2. Options for connecting to external PII detection models.
  3. Multiple usage options, from Python or PySpark workloads through Docker to Kubernetes.
  4. Customizability in PII identification and anonymization.
  5. Module for redacting PII text in images.

⚠️ Presidio can help identify sensitive/PII data in un/structured text. However, because Presidio is using trained ML models, there is no guarantee that Presidio will find all sensitive information. Consequently, additional systems and protections should be employed.

📔 Full documentation

💭 Try Presidio with your own data

Installing Presidio

  1. Using pip
  2. Using Docker
  3. From source
  4. Supported Python Versions
  5. Migrating from V1 to V2

Running Presidio

  1. Getting started
  2. Setting up a development environment
  3. PII anonymization in text
  4. PII anonymization in images
  5. Usage samples and example deployments

Support

Contributing

For details on contributing to this repository, see the contributing guide.

This project welcomes contributions and suggestions. Most contributions require you to agree to a Contributor License Agreement (CLA) declaring that you have the right to, and actually do, grant us the rights to use your contribution. For details, visit https://cla.microsoft.com.

When you submit a pull request, a CLA-bot will automatically determine whether you need to provide a CLA and decorate the PR appropriately (e.g., label, comment). Simply follow the instructions provided by the bot. You will only need to do this once across all repos using our CLA.

This project has adopted the Microsoft Open Source Code of Conduct. For more information see the Code of Conduct FAQ or contact [email protected] with any additional questions or comments.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].