Skip to main content

PII Tagging Suggestions

Catalog uses an auto-detection mechanism to handle Personally Identifiable Information (PII). The process analyzes metadata to identify and categorize PII.

Auto-Detection Process

Metadata Scanning

Catalog automatically scans metadata to identify potential PII. By analyzing metadata, it can detect sensitive information without accessing the actual content of the data.

Detection Patterns

The auto-detection model uses predefined detection patterns to identify PII. These patterns include rules and regular expressions tailored to recognize common forms of PII.

Categorization of PII

Identification and Tagging

Once Catalog detects potential PII, it categorizes and tags the information accordingly. Tagging the detected PII helps classify and manage sensitive information effectively, ensuring that it is handled appropriately.

CastorDoc interface displaying a table with column names, attributes, descriptions, tags, and details for a dataset in BigQuery.
Table with two rows labeled name and email, indicating fields for user information. Email field marked as sensitive PII.