Creating and configuring the experiment
The first step is to create and configure the experiment. You will use the training dataset you uploaded earlier to train the model until it is ready to be deployed for making predictions.
Creating a new experiment
Do the following:
-
In the Qlik Cloud Analytics hub, click Add new, and then select New ML experiment.
-
Enter a name for your experiment, for example, Customer churn tutorial.
-
Optionally, add a description and tags.
-
Choose a space for your experiment. It can be your personal space or a shared space.
-
Click Create.
-
Select the Customer churn data - training.csv file.
Reviewing the data
Now you are ready to start configuring your experiment, but before you start, let's have a look at the dataset.
We start out in the schema view. Here we can see a table where each row represents a column in your dataset. Statistics and insights have been generated in automatic data preparation. You might have to scroll to the right-hand side of the schema to see the Insights.
We can see that AccountID has been excluded due to high cardinality. This means that the column contains too many unique values. The feature Country has been excluded for the opposite reason: the value is the same for all rows. These two features would not provide any value to the machine learning models.
We can also see that the categorical feature Territory has been impact encoded. Hover over the warning and information icons for more information.
Click the data view icon to change to the data view. Here we can see more information about each column, including sample data.
Selecting a target
We want our machine learning model to predict customer churn, so we select Churned, the final column in the dataset, as our target.
Do the following:
-
Click the schema view icon to switch back to schema view.
-
Hover over Churned and click the target icon that appears.
On the Experiment configuration panel, we can now see that Churned has been selected. We can also see which features are automatically selected and excluded. Since Churned is the target, it will not be used as a feature. We can also see that this experiment will be treated as a binary classification problem.
Selecting features
For this first run of our experiment, we will include all features and algorithms that have been selected by default. However, if you already know that certain features have no influence on the target—based on your business knowledge—you could deselect them at this point to exclude them from the training.
Training the experiment
The configuration is done and we are ready to start the training.
Do the following:
-
In the bottom right corner of the experiment window, click Run experiment.
When the experiment has finished running, we can move on to the next step, which is to review the resulting model metrics.