Descriptive Statistics

Help center Get started Knowledge center Product documentation API documentation

List of Sections

Overview

The Descriptive Statistics node summarizes the columns of a table with descriptive statistics. For each selected variable, one or more statistics can be computed, and the result is presented as a table with one column per variable-and-statistic combination.

The node sits under the Statistics tab and, unlike the statistical-plot nodes, produces a tabular summary rather than a chart. Configuration has three parts: a Target Variable, the variables to summarize, and a per-variable statistic selection.

Note: “Workflow” is the in-UI term for what some Weave documentation calls a pipeline. This reference uses “Workflow.”

When to use it

Profiling columns with summary statistics before deeper analysis.
Checking data quality through counts of nulls, negatives, and outliers.
Comparing the distribution shape of columns via percentiles.
Producing a compact statistics table for selected variables.

Configuration components

Component	Description
Select Target Variable	Dropdown that selects the target variable of the dataset.
Descriptive Statistics (Select)	Opens the popup listing the columns to summarize.
Variable checkboxes	Choose the variables to compute statistics for.
Select Aggregation Function	Per-variable dropdown to choose one or more statistics; selections appear as removable tags.
Apply	Commits the variable and statistic selections.

Available statistics

Statistic	Description
Mean	The average of the values.
Count	The number of values.
Standard Deviation	The standard deviation of the values.
Kurtosis	The kurtosis of the distribution.
Skewness	The skew (asymmetry) of the distribution.
25th Percentile	The first-quartile value.
50th Percentile	The median value.
75th Percentile	The third-quartile value.
Minimum	The smallest value.
Maximum	The largest value.
Data Type	The data type of the column.
Negative Value count	The number of negative values.
Null Value Count	The number of null values.
Outlier Count	The number of outlier values.

Output

Output element	Description
Result columns	One column per variable-and-statistic pair, named variable_statistic (e.g. SEPALWIDTHCM_Mean).
Result row	The computed values for the selected statistics.

Key behaviors

Tabular output. Unlike the statistical-plot nodes under the Statistics tab, Descriptive Statistics returns a summary table rather than a chart.

Per-variable statistics. Each selected variable carries its own set of statistics, so different columns can be summarized differently in one node.

Column naming. Each result column is named by combining the variable and the statistic, so the table can hold many statistics across many variables without ambiguity.