Data masking capabilities
Random data masking
Random masking consists of masking an input value with a randomly generated value.
When there are multiple occurrences of the same value in the input dataset, it can be masked with different values.
Different values from the input dataset can be masked with the same value.
Random data masking examples
Input values | Extra Parameter | Examples of masked values |
---|---|---|
newuser@domain.com | "4" | ohsbser@domain.com |
admin@company.com | "4" | lneen@company.com |
newuser@domain.com | "4" | qzmaser@domain.com |
Input values | Extra Parameter | Examples of masked values |
---|---|---|
newuser@domain.com | "aaaaaa" | rxvsas |
admin@company.com | "aaaaaa" | bbwpba |
newuser@domain.com | "a9aaa9" | r8daw1 |
Input values | Examples of masked values |
---|---|
190049418437621 | 2590459222147 22 |
271083561478941 | 1900846274448 17 |
190049418437621 | 2730364078284 70 |
117029 | 1750694861914 69 |
Consistent data masking
When the same value appears twice in the input data, consistent masking functions output the same masked value in the same Job execution.
However, two different input values can be masked with the same value in the output.
Consistent data masking examples
Input values | Extra Parameter | Examples of masked values |
---|---|---|
newuser@domain.com | "talend,value,newcompany" | newuser@newcompany.com |
admin@company.com | "talend,value,newcompany" | admin@value.com |
newuser@domain.com | "talend,value,newcompany" | newuser@newcompany.com |
user@company.com | "talend,value,newcompany" | user@value.com |
user@domain.com | "talend,value,newcompany" | user@newcompany.com |
Bijective data masking
- They are consistent masking functions.
- They are injective, meaning that they output two different masked values for two different input values.
- They check that the input data is in a valid format. If the input value is valid, bijective masking functions output a valid value. If the input value is not valid, they output an invalid value or replace values with null, depending of the masking function used.
Bijective data masking examples
Input values | Example of masked values |
---|---|
190049418437621 | 289052428331901 |
271083561478941 | 234112758889352 |
190049418437621 | 289052428331901 |
117029 | null |
Repeatable data masking
To produce repeatable masked values between Job executions, define a seed or a password in the Advanced settings of the component.
For a given combination of input and seed values, the same masked value is produced.
When using Format-Preserving Encryption methods, the same masked value is produced for a given combination of an input value and a password.