Talend Cloud Data Preparation Examples
The use cases described in this document show several way of using Talend Cloud Data Preparation to perform formatting or cleansing operations to your data.
- The Customers dataset is used as source for the Standardize preparation. It shows how you can standardize state names, delete invalid records, consolidate phone numbers, change their format as well as date formats, or use a masking function to protect data. Finally, the Magic Fill function is used to convert dates to the corresponding day of the week.
- The Build_email preparation uses the Marketing Leads and the Emails Reference datasets and joins them in order to recreate emails addresses from customers names, company names, and their corresponding domain addresses.
In addition to these preparations that are available directly in the application, you can download additional datasets and use them to complete the following examples:
- Based on the
CRM_export.csv
dataset, build a preparation to consolidate in a new column all the mobile
phones or landline phone numbers of your customers to make sure you have at least a
working number for each.
See Consolidating a list of phone numbers coming from a CRM solution.
- The
marketing_leads.zip
and
emails_reference.zip
datasets can be used together in the same preparation via a lookup operation to
join the information about companies domain addresses and recreate entire email
addresses.
See Recreating email addresses before uploading them to a marketing solution.
- With the
HRMS_export.xlsx
dataset, learn how to change date formats and extract specific strings of
characters from any column.
See Cleansing data coming from a human resource management system.
- The
customer_contact_data.csv
dataset can be improved by cleaning the empty and invalid datasets, and removing
unnecessary blank spaces in text records.
See Preparing client data to upload it to a marketing solution.
- With the
video_customers.xlsx
dataset, follow a scenario to combine filters and create "if" conditions to
isolate the data on a specific customers group.
See Using filters to create "if" conditions on customer data.
- The
car_dealership.xlsx
dataset is used as base for a preparation illustrating how to use regular
expression to match data that cannot be selected with usual filters.
See Using regular expressions and filters to create "or" conditions on customer data.