Skip to main content Skip to complementary content

Visualizing the prediction data in a Qlik Sense app

The final step in the process is to create a Qlik Sense analytics app to visualize the predictions data you have generated.

Alternative workflow: Upload pre-configured app

Rather than following the remaining steps in this topic, you can upload the Customer churn predictions.qvf app, which you downloaded at the start of this tutorial, into your hub.

The remaining sections on this page guide you through the creation of a similar app. There will be some differences between the prediction data in the app, and your own prediction data.

The downloadable app contains an additional sheet titled Individual Customer Analysis, which uses conditional expressions to encourage granular analysis of a single customer account at a time.

Getting started

From the hub, click Add new > New analytics app. Select a space for the app, and name it Customer churn predictions.

Click Create.

Information noteThe way you interact with your app is different if using a touch-screen device, or if touch support is turned on. For more information, see Long-touch menu on touch devices.

Creating the data model

Selecting your data

You need to add the necessary data into your app. This consists of the apply dataset (uploaded at the start of the tutorial) and the two datasets generated from the prediction.

  1. In the new app, click to add data from Data catalog.

  2. In Data catalog, click the checkboxes next to the following three datasets:

    • Apply datasetCustomer churn data - apply.csv

    • Prediction dataset: Customer churn data - apply_Prediction.csv

    • Coordinate SHAP dataset: Customer churn data - apply_Prediction_Coordinate_SHAP.csv

  3. Click Next.

  4. A summary panel appears. Click the Customer churn data - apply.csv dataset to expand the available fields to load.

  5. Click the Cross mark icon next to any features that were deselected during model training. This should be DaysSinceLastService, as well as any features you deselected due to low influence on the model.

  6. Click Load into app.

Associating the data

At this point, you have added the data to the app, but a connected data model is not constructed until you associate the datasets together. In this case, the AccountID field will be the index linking the datasets together.

In your app, Data manager is now opened. You will see the three datasets, but they are shown as separate entities that are not connected.

  1. In your app, Data manager is now opened. You will see the three datasets, but they are shown as separate entities that are not connected.

  2. In the right-hand pane of Data manager, find the AccountID association suggestion.

  3. Click Apply. The tables should now be connected together.

  4. Click Load data.

Checking your associations in Data model viewer

Before building any visualizations in the app, you should verify that the data has been associated properly.

  1. In the navigation bar at the top of the interface, click the dropdown for the Prepare tab and select Data model viewer.

  2. Check that the three datasets have been connected properly. The AccountID field should be acting as a key across all three datasets. The app's data model should look like the following image.

    Data model viewer showing the data model of the predictions app

    Checking that the data model of our newly created predictions app has been configured correctly. In this case, the three datasets should be connected using the AccountID field.

You are now ready to start building analytics content. Click Sheet within the Analyze tab in the navigation bar.

First sheet: Aggregated predictions dashboard

The first sheet will focus on analysis of the predictions in aggregated form.

  1. You should now be in sheet edit mode. Click Advanced options.

  2. Right-click any existing visualization in the sheet, and delete it. This is a chart suggestion feature that we will not be using for this tutorial.

  3. Give the sheet a title, for example, Aggregated Predictions. To do this, click anywhere in the blank sheet to open Sheet properties on the right-hand side of the interface. Under Title, type a name for the sheet.

Pie chart

A pie chart lets you easily visualize the proportional breakdown of your data. In this case, it makes sense to break down the data by the predicted outcomes (yes or no).

  1. Drag a pie chart to the sheet.

  2. Add Churned_predicted as a dimension.

  3. Add Count(AccountID) as a measure.

  4. If desired, turn off Dimension label under AppearancePresentation in the properties panel.

  5. Expand the dimension under  Data in the properties panel, and modify the Label for the dimension to Predicted to Churn.

    The label updates in the legend.

Filter pane

This dashboard will be for broad analysis of the entire data model. However, it will almost certainly be important to be able to quickly filter the data across various dimensions to analyze specific cohorts.

Create a filter pane with multiple listboxes. Each listbox allows selections of values within a single dimension.

  1. Drag a filter pane to the sheet.

  2. Add fields to the filter pane. These could be dimensions you expect to be significant to the predicted outcomes.

  3. Modify each listbox as need by clicking the field under Data in the properties panel. This opens the Listbox properties.

    For example, you might want to change the field label, and set Collapse listbox to Always to save sheet space.

Resize the filter pane so that it appears as a panel on one side of the sheet.

KPI objects

KPIs are a great way to display a single value for a particularly important analysis of your data. Here, we will create three KPI charts.

  1. Create a blank KPI chart. Add the following measure:

    =Count(Churned_predicted)

    This will calculate the total number of customers for whom we have generated churn predictions.

  2. Label the measure Total # of Customers.

  3. Set Number Formatting to Number and present the values in the 1,000 format (without decimals or percentage values).

  4. Create another blank KPI chart. Add the following measure:

    =Count({<Churned_predicted={"yes"}>}(Churned_predicted))

    This is a set expression calculating the number of times the value of yes was displayed in the Churned_predicted field.

  5. Label the measure Predicted to Churn.

  6. Configure the number formatting of the chart with the same settings as the first KPI.

  7. Create a third blank KPI chart. Add the following measure:

    =Count({<Churned_predicted={"no"}>}(Churned_predicted))

    This calculates the number of times the value of no was displayed in the Churned_predicted field.

  8. Label the measure Predicted to Stay.

Histogram

You can visualize the distribution of prediction probability values with a histogram.

Drag a histogram to the sheet and give it a title. Add Churned_yes as the dimension.

Stacked bar charts

To analyze predictions by a categorical field, such as plan type, you can create a bar chart with individual set analyses stacked on top of one another.

  1. Drag a bar chart onto the sheet and title it Churn Predictions by Plan Type.

  2. Add PlanType as a dimension. Label it Plan Type.

  3. Add the following measure:

    =Count({<Churned_predicted={"no"}>}(Churned_predicted))

    Label the measure Predicted to Stay.

  4. Add the following measure:

    =Count({<Churned_predicted={"yes"}>}(Churned_predicted))

    Label the measure Predicted to Churn.

  5. Under AppearancePresentation, change the layout from Grouped to Stacked.

  6. Remove the scrollbar and set Value labels to On.

  7. Configure Segment labels and Total labels to Auto.

When done, you can create a similar bar chart with the same two measures, but with a different categorical dimension, such as NumberOfPenalties.

Distribution plot to analyze individual numeric field values

A distribution plot can be used to present predictions data by a measure with diverse numeric values. In this section, you will create a chart to display churn predictions by base fee, with distinct groupings for the customer's service rating.

  1. Drag a distribution plot onto the sheet and title it Churn Predictions by Base Fee and Service Rating.

  2. Add AccountID as the dimension for the points.

  3. Add the following measure on the x-axis:

    =Avg(BaseFee)

  4. Add the following calculated dimension on the y-axis:

    =Round(ServiceRating,2)

    This creates groupings along the y-axis for approximate ServiceRating values. The higher the value, the higher the customer has rated the quality of their service.

  5. Under AppearancePresentation, reduce the Bubble size to 20 and turn on Jitter points.

  6. In Colors and legend, set Colors to Custom.

  7. Select the option to color the data by the following dimension:

    =Churned_predicted

    This will assign one color for each of the values in the Churned_predicted field. In this case, there will be two colors displayed.

  8. Add labels to the dimension and the measure, and add the following label to the custom color setting: Predicted to Churn.

Similar charts can be created for analyses of different numeric fields by substituting other field aggregations in the place of the BaseFee measure.

Configure the visualizations on the sheet to look similar to the image below.

Predictions analysis sheet

Predictions dashboard with chart objects for analyzing predictions data.

Second sheet: Aggregated SHAP dashboard

Next, create a sheet dedicated to analyzing the aggregated SHAP values.

  1. Create a new sheet and open advanced edit mode.

  2. Give the sheet a title (for example, Aggregated SHAP).

SHAP importance ranking

This type of chart is similar to the autogenerated one you might have seen when configuring your experiment versions. Here, we will make one for the predicted dataset.

The purpose of this type of chart is to show the features that contribute most to the outcome seen in the Churned column. We need to be aware that since we will use absolute values, the SHAP results could be either positive (value of yes) or negative (value of no). If you prefer, you can also use an expression that does not calculate the absolute value.

  1. Drag a bar chart onto the empty sheet.

  2. Title the chart SHAP Importance by Feature.

  3. Add automl_feature as a dimension, and label it Feature.

    This allows you to present the aggregated data across all included features.

  4. Add the following calculated measure:

    =fabs(Avg(SHAP_value))

  5. Under Label, type Average Absolute SHAP.

  6. In the properties panel, under Appearance > Presentation, remove the scrollbar if present, and set the chart orientation from Vertical to Horizontal.

  7. Under Sorting, drag the measure above the dimension if not done already. Leave the sorting of each field to Auto.

    This sorts the chart by measure values in a descending pattern.

Trellis container

You can make a Trellis container to filter data for specific field values within a dimension. In this case, we can break down the SHAP values for each of the four plan types offered to customers.

To create a Trellis object, you need to specify a dimension and a master visualization. We will convert the SHAP importance ranking that we just created into a master visualization.

  1. Right-click the Average Absolute SHAP chart in the sheet, and then select Linked Object Add to master items.
  2. Keep the default name and click Add.
  3. Under Custom objects > Qlik Visualization Bundle in the assets panel, drag a Trellis container onto the sheet.

  4. Add PlanType as a dimension.

  5. Click Add master visualization inside the chart, and then add the Average Absolute SHAP chart.

  6. Under AppearanceGeneral in the properties panel, turn on Show titles and title the chart SHAP Importance by Plan Type.

  7. Resize the Trellis container so that it is wide enough to display the names of features in the charts. If all labels do not display, adjust the zoom in the browser window.

Alternatively, you could display this data by creating individual charts with set expressions for each dimension value you are looking to analyze. In this case, you could create a separate chart for each of the four plan types.

Reusing the filter pane

The filter pane you created on the first sheet can be re-used on this sheet. In advanced edit mode, right-click the filter pane and copy it. Paste it into the Aggregated SHAP sheet.

Treemap

You can also visualize the breakdown of the SHAP values with a treemap. Drag a treemap onto the sheet and add the same dimension and measure that you used in the SHAP importance ranking.

Configure the visualizations on the sheet to look similar to the image below.

Aggregated SHAP sheet in analyze mode

Full sheet in consumption mode showing various SHAP-based visualizations.

Using the app

After you have finished building the app, it is ready to be used for data analysis.

Click Edit sheet to switch to analysis mode. In this view, you can make selections (in the filter panes or elsewhere in the sheet) to filter the data for specific analysis of subsets of the data. For example, you might want to analyze a specific customer plan type or region, and compare the findings with other subsets of data.

Thank you!

You have reached the end of this tutorial. We hope that you have learned a few things and realized that automated machine learning is a fast and easy way to generate predictive models. Qlik Sense is a powerful tool that makes it easy to visualize your predictions data and give meaningful insights into your data.

Did this page help you?

If you find any issues with this page or its content – a typo, a missing step, or a technical error – let us know how we can improve!