Personal Identifiable Information
The Personal Identifiable Information (PII) operator aims to detect and redact personally identifiable information in datasets by combining pattern match and machine learning solution.
PII refers to any information that can identify an individual, encompassing financial, medical, educational, and employment records. Failure to protect PII can lead to identity theft, financial loss, and affected reputations of individuals and businesses, highlighting the importance of taking appropriate measures to safeguard sensitive information. The Operators framework is OCI's most extensible, low-code, managed ecosystem for detecting and redacting personally identifiable information in dataset.
Use ads opctl
for detecting and redacting personally identifiable
information tasks. This module is engineered with the principles of low-code development in
mind, making it accessible to users with varying degrees of technical expertise. It operates
on managed infrastructure, ensuring reliability and scalability, while its being configurable
through YAML lets you customize redactions to your specific needs.
For more information, see the PII section of the ADS documentation.
Automated Detection and Classification
By using pattern matching and AI-powered solutions, the PII Operator identifies sensitive data on free-form texts.
Intelligent Co-reference Resolution
A feature of the PII Operator is its ability to maintain co-reference entity relationships even after anonymization. Not only does this anonymize the data, but preserves the statistical properties of the data.