Box plot
The box plot is suitable for comparing range and distribution for groups of numerical data, illustrated by a box with whiskers, and a center line in the middle. The whiskers represent high and low reference values for excluding outlier values.
You can define the box start and end points, and whiskers ranges with a few different presets, or define your own settings using expressions.
- First whisker
- Box start
- Center line
- Box end
- Last whisker
When to use it
The box plot is suitable for comparing range and distribution for groups of numerical data.
Advantages: The box plot organizes large amounts of data, and visualizes outlier values.
Disadvantages: The box plot is not relevant for detailed analysis of the data as it deals with a summary of the data distribution.
By default, the Standard (Tukey) preset is used.
Changing the definition of the box plot
You can use one of the three presets, found under Box plot elements in the properties panel, to define your box plot.
-
Standard (Tukey)
This preset is based on the original box plot definition by J. Tukey. The center line represents the median (second quartile), and the box start and end points represent the first and third quartiles. Whisker length can be set to 1, 1.5 or 2 inter-quartile ranges. An inter-quartile range represents the difference between the first and third quartiles.
-
Percentile-based
This preset is also defined with the box start and end points representing the first and third quartiles, and the center line representing the median, but the whisker length is adjusted by setting a percentile based whisker position.
-
Standard deviation
This preset is based on standard deviations, with the center line representing the average value, and the box start and end points representing one standard deviation variance. You can set the whisker length to a multiple of standard deviations.
You can also define a custom box plot where you set the value of each box plot element using an expression.