pgEdge Anonymizer
pgEdge Anonymizer is a command-line tool for anonymizing personally identifiable information (PII) in a PostgreSQL database. Anonymizer replaces sensitive data with realistic but fake values while maintaining data consistency and referential integrity, so your development team has a working copy for experimentation and testing.
Warning
pgEdge Anonymizer is a development tool intended to obscure PII data, and should not be applied to data in a production environment.
pgEdge Anonymizer features:
- Pattern-based anonymization: 100+ built-in patterns for common PII types
- Consistent replacement: Same input values produce the same anonymized output within a run
- Foreign key awareness: Automatically handles
CASCADErelationships - Large database support: Efficient batch processing with server-side cursors
- Format preservation: Maintains original data formatting where possible
- Single transaction: All changes committed atomically or rolled back
- Extensible: Define custom patterns for your specific needs
pgEdge Anonymizer automatically analyzes foreign key relationships:
-
CASCADE updates: If a column has referencing foreign keys with
ON UPDATE CASCADE, the tool updates the source column and PostgreSQL propagates changes automatically. -
Processing order: Columns are processed in dependency order to maintain referential integrity.
-
Skip targets: Columns that are
CASCADEtargets of other configured columns are automatically skipped to avoid duplicate processing.
For help with pgEdge Anonymizer issues, visit: