Skip to main content Skip to complementary content

Finding customer companies based on leads

A pipeline with a source, a Field selector processor, two Split processors, and a destination.

Before you begin

  • You have previously added the dataset holding your source data.

    Download and extract the file: split-leads.zip. It contains a dataset with a list of customer leads including first names, last names, emails, addresses, etc.

  • You also have created the connection and the related dataset that will hold the processed data.

    Here, a file stored in a Test Connection.

Procedure

  1. Click Add pipeline on the Pipelines page. Your new pipeline opens.
  2. Click ADD SOURCE to open the panel allowing you to select your source data, here a list of customer leads entered manually as a test dataset.
  3. Select your dataset and click Select in order to add it to the pipeline.
    Rename it if needed.
  4. Click Plus and add a Field Selector processor to the pipeline. The configuration panel opens.
  5. Give a meaningful name to the processor.

    Example

    select main info
  6. In the Selectors area:
    1. Select .first_name in the Input list and enter firstname in the Output list, as you want to select and rename the first_name field.
    2. Click the + sign to add a new element and select .last_name in the Input list and enter lastname in the Output list, as you want to select and rename the last_name field.
    3. Click the + sign to add a new element and select .email in the Input list and enter email in the Output list, as you want to select the email field.
  7. Click Save to save your configuration.
    (Optional) Look at the preview of the processor to compare your data before and after the restructuring operation.
    Preview of the Field selector processor after restructuring customer records.
  8. Click Plus and add a Split processor to the pipeline. The configuration panel opens.
  9. Give a meaningful name to the processor.

    Example

    split emails
  10. Configure the processor:
    1. Select Extract email parts in the Function name list, as you want to split the local and the domain parts of the customers emails.
    2. Select .email in the Fields to process field.
    3. Click Save to save your configuration.
  11. Click Plus and add another Split processor to the pipeline. The configuration panel opens.
  12. Give a meaningful name to the processor.

    Example

    validate companies
  13. Configure the processor:
    1. Select Extract values by semantic type in the Function name list, as you want to validate the domain part of the customers emails against company semantic types.
    2. Select .email_domain in the Fields to process field.
    3. Select Company in the Semantic type list.
    4. Click Save to save your configuration.
  14. (Optional) Look at the preview of the Split processor to see your data after the extract operation.

    Example

    Preview of the Split processor after extracting email domains from email records.
  15. Click ADD DESTINATION and select the dataset that will hold your reorganized data.
    Rename it if needed.
  16. On the top toolbar of Talend Cloud Pipeline Designer, click the Run button to open the panel allowing you to select your run profile.
  17. Select your run profile in the list (for more information, see Run profiles), then click Run to run your pipeline.

Results

Your pipeline is being executed, the leads data has been processed, customer companies have been validated against company semantic types and the output flow is sent to the target system you have indicated.

Did this page help you?

If you find any issues with this page or its content – a typo, a missing step, or a technical error – please let us know!