In this example, you are working in a e-commerce company. You noticed some orders have
not been shipped yet and you want to know why.
The purpose is to create two rules that will prevent any shipping delay. To do that, you
need to check that the country is correct and the Tax Identification Number (TIN) is
filled in.
Two data quality rules are used in this example:
One validates that: If the order status is In process,
then the country is not empty and is spelled correctly according to the country
semantic type.
The other one validates: If the customer is identified as a company, then the TIN
is filled in.
Here is a sample of the dataset:
Procedure
Log in as a rule manager.
In the left panel, click Data quality rules > Add rule.
Enter the name: Country value check.
Enter a description.
The description is optional. It helps you find a
rule when the rule names are similar.
In the If part, click
Add a row:
Select Variable and enter the name
order_status.
The supported characters are [a-z], [A-Z], [0-9] and special
characters: _.@$#.
Information noteNote: Data quality rules are templates. You will associate the variables with fields
when applying the rule to a dataset.
Select the operator is.
For more information on the operators, see the The operators.
Select Value and enter In
Process.
In the Then part, add two rows:
Select the logical operator And.
For the first row, select Variable and enter
country.
Select the operator is not empty.
For the second row, select Variable and enter
country.
Select the operator is of type and select the semantic
type Country.
The rule is defined as follows:
Click Save.
The first rule is created.
Following the previous steps, create the second rule named Customer Tax
ID check.