Skip to main content Skip to complementary content

tAddressRowCloud

Verifies and formats international addresses in the Cloud by using online services.

tAddressRowCloud enables you to parse address data and get formatted addresses quickly, accurately and without installing any software.

Address data is corrected against the latest online reference data from several providers including Loqate, MelissaData, Google or QAS. tAddressRowCloud proposes alternatives for missing address data such as country or postal code, and addresses are enriched with other elements such as latitude longitude.

For further information about the terms of services of Google Places API, see Terms of Service.

This component is not shipped with your Talend Studio by default. You need to install it using the Feature Manager. For more information, see Installing features using the Feature Manager.

tAddressRowCloud Standard properties

These properties are used to configure tAddressRowCloud running in the Standard Job framework.

The Standard tAddressRowCloud component belongs to the Data Quality family.

This component is available in Talend Data Management Platform, Talend Big Data Platform, Talend Real-Time Big Data Platform, Talend Data Services Platform and in Talend Data Fabric.

Basic settings

Properties Description

Schema

  • A schema is a row description. It defines the number of fields (columns) to be processed and passed on to the next component. When you create a Spark Job, avoid the reserved word line when naming the fields.

  • Built-In: You create and store the schema locally for this component only.

  • Repository: You have already created the schema and stored it in the Repository. You can reuse it in various projects and Job designs.

Edit Schema

Click the [...] button and define the input and output schema of the address data.

The output schema of tAddressRowCloud proposes several read-only address columns including a VerificationLevel column which provides you with a verification status of the processed address. The verification levels in this column are defined by Talend . For further information, see Address verification levels in tAddressRowCloud.

As the output schema depends on the address provider, visit its website for further information.

Also some of the output columns could be empty depending on what address provider you select in the component basic settings when executing the Job.

Address Provider

Select from the list the provider of the reference data against which you want to validate and format input addresses.

The list of address providers includes Google, Loqate, QAS and MelissaData.

License/API key

Enter the license or the API key provided by the address provider you select from the list. You must visit the provider website, register and get the license/API key.

To enter a key, click the […] button next to the field.

When you select Google as a provider, the component uses the Google Places API. You must generate the key from the Google Developer Console at https://developers.google.com/console/help/new/ and set the key in this field.

Processing Mode

This option is applied only to the Loqate provider.

Select from the list the mode of address validation you want to have:
  • Verify and Geocode (selected by default): with this mode, the component standardizes and corrects addresses and enriches them with latitude and longitude information.

    You must define the address and country columns in the input schema and add them to the Mapping table.

  • Verify only: with this mode, the component standardizes and corrects addresses without enriching them with latitude and longitude information.

    You must define the address column in the input schema and add it to the Mapping table.

Information noteNote: These processing modes cost extra credits. For further information, see the Loqate website.

Country

This option is applied only to the QAS provider.

Select from the list the country corresponding to your input addresses.

When you select QAS as a provider, the component uses the QAS Pro OnDemand service. For further information about Experian address verification, see the product sheet at https://www.edq.com/globalassets/product-sheets/address-verification.pdf.

QAS OnDemand username

This option is applied only to the QAS provider.

Enter the username you can find in the license provided by QAS.

You can check your username from the QAS OnDemand portal.

Password

This option is applied only to the QAS provider.

Enter the password you can find in the license provided by QAS.

You can check your password from the QAS OnDemand portal.

Use security mode to connect

Select this check box to connect to the Cloud in a secure mode. This may have a slight impact on performance.

This check box is not available with all address providers.

Mapping

Address field: add lines to the table and select from a predefined address list the fields that will hold input addresses.

The address list includes the following columns for all address providers: Address, PostalCode, Locality, AdministrativeArea and Country.

Input Column: add lines to the table and select from the list the columns that hold addresses from input schema. The input schema can have one or multiple columns and can have columns that do not represent address data.

Use Additional Output

This option is not available for the QAS provider.

Select this check box and use the Output Mapping table to add more address columns to the output schema:

Address field: add lines to the table and select from a predefined address list the fields of the extra information you want to output.

These predefined address fields vary according to the provider you select from the Address Provider list. For further information about the additional address fields, check the provider website.

Output Column: select from the list the columns that will hold the additional addresses information. You must first add these additional columns to the tAddressRowCloud output schema through the Edit Schema button.

tAddressRowCloud maps the values of the address fields to the output columns in the Output Column.

If you select to have an output column in the Output Address table that has the exact name of an input column, the input column value will be overwritten by the value given by the component.

Die on error

Select the check box to stop the execution of the Job when an error occurs.

Clear the check box to skip any rows on error and complete the process for error-free rows. When errors are skipped, you can collect the rows on error using a Row > Reject link.

Advanced settings

Properties Description

tStat Catcher Statistics

Select this check box to collect log data at the component level.

Global Variables

Variables Description

Global Variables

ERROR_MESSAGE: the error message generated by the component when an error occurs. This is an After variable and it returns a string. This variable functions only if the Die on error check box is cleared, if the component has this check box.

A Flow variable functions during the execution of a component while an After variable functions after the execution of the component.

To fill up a field or expression with a variable, press Ctrl+Space to access the variable list and choose the variable to use from it.

For more information about variables, see Using contexts and variables.

Usage

Usage guidance Description

Usage rule

This component is usually used as an intermediate component, and it requires an input component and an output component.

This component enables you to create a data flow, using a Row > Main link, and to create a reject flow with a Row > Reject link filtering the data in error.

Did this page help you?

If you find any issues with this page or its content – a typo, a missing step, or a technical error – please let us know!