Introducing automated machine learning
Automate machine learning for your analytics team with Qlik AutoML. With the simple code-free interface, you can easily create machine learning experiments to generate models and make predictions.
What is machine learning
Machine learning is a branch of artificial intelligence and data science focused on recognizing patterns in historical data to predict future outcomes. Algorithms are trained on the data to build a predictive model without being explicitly programmed to do so. A machine learning model can help you answer questions like:
-
Will a gym customer cancel their membership?
-
What is the expected lifetime value for a given customer?
Read more about the basic concepts in Understanding machine learning. You will also learn about the structured framework for defining machine learning questions and preparing datasets.
What can you do with automated machine learning
Create automated machine learning experiments in Qlik Sense. In the Analytics activity center, you can collaborate on the experiment with other users and easily load data from Catalog to train the model.
Integrate your predictive analytics with Qlik Sense apps to share your findings. Explore further with visualizations and interactive what-if scenarios to understand how changing different parameters might affect your desired outcome.
You can make predictions for datasets stored in Catalog or make operational predictions in real-time using Qlik Sense APIs.
How do experiments work
An experiment trains machine learning algorithms on a particular dataset with a particular target. The training generates machine learning models, which you can use to make predictions.
Most of the process is automated in automated machine learning. When you create an experiment and load a dataset, the dataset is automatically analyzed and the data is preprocessed to make it ready for machine learning. Statistics and other information about each column are displayed to help you select a target. When you start the training, several algorithms start searching for patterns in the data. For more information about creating and training experiments, see Working with experiments.
When the training is finished, scores and ranks let you evaluate the generated machine learning models. By changing parameters and iterating the training, you can generate a number of versions. Choose the best-performing model for your dataset and deploy it to start making predictions. For more information, see Working with ML deployments.
As the following image shows, an experiment can have multiple versions, each using one or more algorithms. The model with the best performing algorithm can be deployed to make predictions. This means that one experiment can result in several ML deployments.
User access to AutoML
For full details about how users can access Qlik AutoML, see Who can work with Qlik AutoML.
AutoML limitations and license-governed capacities
AutoML is an additional paid-for capability. Some limited functionality is included in applicable subscriptions. The specific capabilities and capacity are dependent on your subscription tier.
Limitations
-
Qlik AutoML has an API rate limit of 300 requests per minute.
-
Maximum number of columns in the dataset: 500
This applies to the datasets used for training experiments and generating predictions. The limit is the number of columns used as features in an experiment version. More columns can be in the dataset and limits are calculated when columns are included in the dataset.
License-governed capacities
Your customer license determines the capacity of various metrics governing how you can use Qlik AutoML. Your usage metrics are measured as a combination of your use of the AutoML services through the AutoML user interface, key driver analysis in a Qlik Sense app, and through the public APIs.
There are multiple tiers of AutoML available, depending on your business needs. There are the following two types of tiers:
-
Included tier: This is the basic AutoML functionality with limited capabilities. It is included with a subscription to Qlik Sense Enterprise SaaS, Qlik Cloud Analytics Standard, Qlik Cloud Analytics Premium, Qlik Cloud Enterprise, or Qlik Talend Cloud(Standard, Premium or Enterprise). The included tier is suitable for trial purposes, and for evaluating how Qlik AutoML can help meet your business needs. It is not suitable for production use cases. For more comprehensive capabilities, consider a paid tier of Qlik AutoML.
-
Paid tiers: There are a number of packages providing the comprehensive AutoML capacities needed for production use cases. They are available as additional paid add-ons to a Qlik Cloud subscription.
The following features are only available in the paid tiers of Qlik AutoML:
-
Hyperparameter optimization
-
Scheduled predictions
-
Real-time predictions
-
Qlik AutoML connector
For more information about what is included in each type of AutoML tier, see the table below.
Metric | Description | Available in included tier | Available in paid tiers |
---|---|---|---|
Deployed models | The tier included in a Qlik Cloud subscription defines a maximum number of deployed models that can be created across all tenants created within the license. This consumption limit is defined per model, meaning that multiple ML deployments created from a single model count as a single deployed model. | Yes | Yes |
Concurrent training | This is the number of models a tenant can train in parallel. In the included tier of Qlik AutoML, each model will be run one after another. Paid tiers include capacity allowing your tenant to train multiple models simultaneously. | No | Yes |
Increased dataset size | Paid tiers provide for increased dataset sizes for training models. | No | Yes |
Manual batch predictions | Predict all rows in a dataset manually. | Yes | Yes |
Scheduled batch predictions | Configure your predictions to run on a schedule, rather than initiating them manually. Scheduled predictions are only available with paid tiers of Qlik AutoML. For more information, see Scheduling predictions. | No | Yes |
Real-time predictions | Employ this API to use your ML deployments to run predictions in real time. For more information, see Creating real-time predictions. | No | Yes |
Qlik AutoML connector in Qlik Cloud Analytics | With this analytics connector, you can load data from the integrated Qlik AutoML platform into Qlik Cloud. For more information, see Qlik AutoML analytics source. | No | Yes |
Hyperparameter optimization | Hyperparameter optimization allows you to fine-tune your AutoML models for increased control over the learning process. For more information, see Hyperparameter optimization. | No | Yes |
Deployed model monitoring | Use built-in monitoring tools to assess the models deployed into ML deployments. You can monitor feature drift over time, as well as details about the usage of the model for predictions. For more information, see Monitoring performance and usage of deployed models. | No | Yes |
For detailed information about your license metrics, see the Qlik Cloud® Subscriptions product description. Administrators can view license information and monitor the number of deployed models in the Administration activity center. For more information, see Monitoring resource consumption.