Creating prediction data
Once you have deployed and approved your model, you can create predictions on new data.
So far, we have used existing customer data to create a machine learning model. The goal with this process has been to recognize patterns through machine learning, with the goal of applying them to new data.
There is also a use case for only analyzing the SHAP values in the training data. This information is also highly valuable in its own right.
For more information about working with predictions in Qlik AutoML, see Creating predictions on datasets.
The first step in the process is to choose a dataset to which you will be applying your model to generate predictions. After this, you configure your desired properties for the output and generate the files.
Do the following:
In the catalog, open your new ML deployment.
-
In the bottom right corner of the ML deployment interface, click Create prediction.
-
Click to open the Prediction configuration pane, if not already open.
-
Next, you need to select your apply dataset. Under Apply data, click Select apply dataset.
-
Select the apply dataset file. This will be either of the following, depending on whether you are working with CSV or QVD:
-
AutoML Tutorial - Churn data - apply.csv
-
AutoML Tutorial - Churn data - apply.qvd
-
-
Compare the schemas of the training and apply datasets. Because you deselected features during training, your apply dataset will have more fields than the training dataset. This is not any cause for concern. When generating predictions, AutoML simply ignores fields from the apply dataset which do not exist in the training dataset.
-
In the Prediction configuration pane, click Name prediction dataset. Type a name for the dataset (or set it to the default name) and select a format. The default is Parquet, but for this tutorial, we use CSV.
-
Select a space and click Confirm.
-
Under Prediction options, you configure the datasets you want to generate with this prediction. In this case, confirm that Coordinate SHAP is the only selected option. Allow AccountID to be used as the index column.
Click Save and predict now. The process can be monitored in Dataset predictions.
You can review the output in the catalog by clicking Open in the popup that appears.
When the prediction has finished running, you can start building an app using the new datasets. Move to the next topic.