The Personal Identifiable Information (PII) operator aims to detect and redact
personally identifiable information in datasets by combining pattern match and machine learning
solution.
PII refers to any information that can identify an individual, encompassing financial,
medical, educational, and employment records. Failure to protect PII can lead to identity
theft, financial loss, and affected reputations of individuals and businesses, highlighting
the importance of taking appropriate measures to safeguard sensitive information. The
Operators framework is OCI's most extensible,
low-code, managed ecosystem for detecting and redacting personally identifiable information in
dataset.
Use ads opctl for detecting and redacting personally identifiable
information tasks. This module is engineered with the principles of low-code development in
mind, making it accessible to users with varying degrees of technical expertise. It operates
on managed infrastructure, ensuring reliability and scalability, while its being configurable
through YAML lets you customize redactions to your specific needs.
For more information, see the PII section of the ADS documentation.
Automated Detection and Classification
By using pattern matching and AI-powered solutions, the PII Operator identifies sensitive
data on free-form texts.
Intelligent Co-reference Resolution 🔗
A feature of the PII Operator is its ability to maintain co-reference entity relationships
even after anonymization. Not only does this anonymize the data, but preserves the statistical
properties of the data.