Skip to main content

Box plot

The box plot (a.k.a. box and whisker diagram) is a standardized way of displaying the distribution of data based on the five number summary: minimum, first quartile, median, third quartile, and maximum.

Developers: To add the viewer from the console, use: grok.shell.tv.addViewer('Box plot');

General:

Right clickContext menu
Alt+FShow in full screen

Box Plot

T-test

When the "category" column contains exactly two categories, press "T" to perform Welch's t-test. The resulting p-value will then be shown on top of the box plot.

Inverted whiskers

On certain datasets, you would see whiskers extending inside the box plot. This is not a rendering bug. The bottom of the box is at the 1st quartile (Q1), and the top is at the 3rd quartile (Q3) value. The whiskers are the lines that extend from the top and bottom of the box to the adjacent values. The adjacent values are the lowest and highest observations that are still inside the region defined by the following limits:

Lower Limit: Q1 - 1.5 (Q3 - Q1) Upper Limit: Q3 + 1.5 (Q3 - Q1)

However, the upper adjacent value can be less than Q3, which forces the whisker to be drawn from Q3 into the box. The lower adjacent value can also be greater than Q1, which forces the whisker to be drawn from Q1 into the box.

For instance, you would get the upper whisker inverted on the following data: [0, 41, 42, 70]

Videos

Box Plot

Properties

PropertyTypeDescription
Data
Category1 Column NamestringIf Category 2 is not defined, sets Markers Color the same as the currently selected column.
Category1 MapstringTime unit map function for Category 1 Column Names (applicable to dates only).
Category2 Column NamestringIf defined, sets Markers Color the same as the currently selected column.
Category2 MapstringTime unit map function for Category 2 Column Names (applicable to dates only).
FilterstringFormula that filters out rows to show. Examples: ${AGE} > 20 or ${WEIGHT / 2)} > 100, ${SEVERITY} == ''Medium'', ${RACE}.endsWith(''sian'')
Tablestring
Statistics
Show Statisticsboolean
Show Total Countboolean
Show Inliers CountbooleanShown values count inside lower and upper bounds, where: IQR = Q3 - Q1. Lower Bound = Q1 - (1.5 * IQR). Upper Bound = Q3 + (1.5 * IQR).
Show Outliers CountbooleanShown values count outside lower and upper bounds, where: IQR = Q3 - Q1. Lower Bound = Q1 - (1.5 * IQR). Upper Bound = Q3 + (1.5 * IQR).
Show Minboolean
Show Maxboolean
Show Avgboolean
Show Medboolean
Show Stdevboolean
Show Q1boolean
Show Q3boolean
Category
Show Category Axisboolean
Show Category Selectorboolean
Label Orientationtextorientation
Show Minor CategoriesbooleanDisplay subcategories - category combinations in the x axis table.
Value
Value Column Namestring
Axis Typestring
Value Minnumber
Value Maxnumber
Invert Y Axisboolean
Show Value Axisboolean
Show Value Selectorboolean
Show Empty CategoriesbooleanInclude plots, which are empty or have null values.
Color
Bin Color Column NamestringColumn to color-code boxes (Q2-Q3 region) or inner violin shapes. See also Bin Color Aggr Type.
Bin Color Aggr TypestringAggregation function for color-coding. See also Bin Color.
Show Color Selectorboolean
Marker Color Column NamestringColumn to color-code markers. Changing Category 1 or Category 2 sets the color scheme to categorical (same as selected category column).
Marker Color MapstringCategorical coloring time unit map function (applicable to dates only).
Color Axis Typestring
Invert Color Schemeboolean
Color Minnumber
Color Maxnumber
Marker
Markers Column Namestring
Markers MapstringMarker category time unit map function (applicable to dates only).
Marker Min Sizenumber
Marker Max Sizenumber
Marker Typestring
Marker Sizenumber
Marker Opacitynumber
Size
Show Size Selectorboolean
Marker Size Column Namestring
General
Show Mean Crossboolean
Show Lower Dashboolean
Show Upper Dashboolean
Show Median Dashboolean
Show Values LimitnumberPoints are not shown if the number of rows is greater than Show Values Limit.
Show Inside ValuesbooleanShow points inside the interquartile range (Q3 - Q1).
Show Outside ValuesbooleanShow points outside the interquartile range (Q3 - Q1).
Show P ValuebooleanShow p-value. Press T to toggle. Currently works only when there are two categories. Welch''s t-test is used for calculating the p-value.
Show Mouse Over Pointboolean
Show Mouse Over Row Groupboolean
Statisticslist
Whisker Colornumber
Violin Whisker Colornumber
Back Colornumber
Filtered Rows Colornumber
Filtered Out Rows Colornumber
Selected Rows Colornumber
Missing Value Colornumber
Default Box Colornumber
Legend Visibilityvisibilitymode
Legend Positionflexautoposition
Row SourcestringDetermines the rows shown on the plot.
Allow Dynamic Menusboolean
Show Context MenubooleanProperties common for all viewers todo: use code generation
Titlestring
DescriptionstringViewer description that gets shown at the Descriptor Position. Markup is supported.
HelpstringHelp to be shown when user clicks on the ''?'' icon on top. Could either be in markdown, or a URL (starting with ''/'' or ''http'').
Description Positionflexposition
Description Visibility Modevisibilitymode
Style
Auto Layoutboolean
Show All CategoriesbooleanWhen checked, both categories from Category 1 and Category 2 columns are shown ignoring axis size adjustments.
Plot Stylestring
Axis Fontstring
Category Fontstring
Statistics Fontstring
Whisker Line Widthnumber
Interquartile Line Widthnumber
Whisker Width Rationumber
Axis Use Column Formatboolean
BinsnumberNumber of KDE bins to display a violin plot.
Linear Color Schemelist
Categorical Color Schemelist
Controls FontstringViewer controls elements font.
Tooltip
Show TooltipstringControls box plot tooltip visibility
Show Labelsvisibilitymode
Row TooltipstringNewline-separated list of column names to be used in a tooltip. Requires showTooltip to be enabled.
Description
Show Titleboolean

See also: