GroupByBuilder
Fluid API for building an aggregation query against a DataFrame. Build a query by calling the following methods: key, pivot, count, uniqueCount, missingValueCount, valueCount, min, max, sum, avg, stdev, variance, q1, q2, q3.
When the query is constructed, execute it by calling aggregate, which will produce a DataFrame.
See samples: https://public.datagrok.ai/js/samples/data-frame/aggregation
Example
let avgAgesByRaceAndSex = demographicsTable
.groupBy(['race', 'sex'])
.avg('age')
.aggregate();
Constructors
new GroupByBuilder()
new GroupByBuilder(
dart):GroupByBuilder
Parameters
| Parameter | Type |
|---|---|
dart | any |
Returns
Source
Properties
| Property | Modifier | Type |
|---|---|---|
dart | private | any |
Methods
add()
add(
agg,colName?,resultColName?):GroupByBuilder
Performs the aggregation
Parameters
| Parameter | Type | Description |
|---|---|---|
agg | "values" | "max" | "key" | "min" | "sum" | "pivot" | "first" | "count" | "unique" | "nulls" | "med" | "avg" | "stdev" | "variance" | "skew" | "kurt" | "q1" | "q2" | "q3" | "#selected" | Aggregation type. |
colName? | null | string | Column name. |
resultColName? | null | string | Name of the resulting column. Default value is agg(colName). |
Returns
Source
aggregate()
aggregate(
options?):DataFrame
Performs the aggregation
Parameters
| Parameter | Type |
|---|---|
options? | object |
options.autoName? | boolean |
Returns
Source
avg()
avg(
srcColName,resultColName?):GroupByBuilder
Adds an aggregation that calculates average value for the specified column. Call aggregate when the query is constructed.
Parameters
| Parameter | Type | Default value | Description |
|---|---|---|---|
srcColName | string | undefined | column name in the source table |
resultColName? | null | string | null | column name in the resulting DataFrame |
Returns
Source
count()
count(
resultColName?):GroupByBuilder
Adds an aggregation that counts rows, including these will null values. See also count, valueCount, uniqueCount, missingValueCount Call aggregate when the query is constructed.
Parameters
| Parameter | Type | Default value | Description |
|---|---|---|---|
resultColName? | string | 'count' | column name in the resulting DataFrame |
Returns
Source
first()
first(
srcColName,resultColName?):GroupByBuilder
Adds an aggregation that takes first value for the specified column. Call aggregate when the query is constructed.
Parameters
| Parameter | Type | Default value | Description |
|---|---|---|---|
srcColName | string | undefined | column name in the source table |
resultColName? | null | string | null | column name in the resulting DataFrame |
Returns
Source
getGroups()
getGroups():
Map<string,DataFrame>
Gets groups of DataFrames
Returns
Map<string, DataFrame>
- where keys are stings in format 'columnName=value' and values are DataFrames
Source
key()
key(
srcColName,resultColName?):GroupByBuilder
Adds a key column to group values on. Call aggregate when the query is constructed.
Parameters
| Parameter | Type | Default value | Description |
|---|---|---|---|
srcColName | string | undefined | column name in the source table |
resultColName? | null | string | null | column name in the resulting DataFrame |
Returns
Source
max()
max(
srcColName,resultColName?):GroupByBuilder
Adds an aggregation that calculates maximum value for the specified column. Call aggregate when the query is constructed.
Parameters
| Parameter | Type | Default value | Description |
|---|---|---|---|
srcColName | string | undefined | column name in the source table |
resultColName? | null | string | null | column name in the resulting DataFrame |
Returns
Source
med()
med(
srcColName,resultColName?):GroupByBuilder
Adds an aggregation that calculates median value for the specified column. Call aggregate when the query is constructed.
Parameters
| Parameter | Type | Default value | Description |
|---|---|---|---|
srcColName | string | undefined | column name in the source table |
resultColName? | null | string | null | column name in the resulting DataFrame |
Returns
Source
min()
min(
srcColName,resultColName?):GroupByBuilder
Adds an aggregation that calculates minimum value for the specified column. Call aggregate when the query is constructed.
Parameters
| Parameter | Type | Default value | Description |
|---|---|---|---|
srcColName | string | undefined | column name in the source table |
resultColName? | null | string | null | column name in the resulting DataFrame |
Returns
Source
missingValueCount()
missingValueCount(
srcColName,resultColName?):GroupByBuilder
Adds an aggregation that counts number of missing values in the specified column. See also count, valueCount, uniqueCount, missingValueCount Call aggregate when the query is constructed.
Parameters
| Parameter | Type | Default value | Description |
|---|---|---|---|
srcColName | string | undefined | column name in the source table |
resultColName? | null | string | null | column name in the resulting DataFrame |
Returns
Source
pivot()
pivot(
srcColName,resultColName?):GroupByBuilder
Adds a column to pivot values on. Call aggregate when the query is constructed.
Parameters
| Parameter | Type | Default value | Description |
|---|---|---|---|
srcColName | string | undefined | column name in the source table |
resultColName? | null | string | null | column name in the resulting DataFrame |
Returns
Source
q1()
q1(
srcColName,resultColName?):GroupByBuilder
Adds an aggregation that calculates first quartile for the specified column. Call aggregate when the query is constructed.
Parameters
| Parameter | Type | Default value | Description |
|---|---|---|---|
srcColName | string | undefined | column name in the source table |
resultColName? | null | string | null | column name in the resulting DataFrame |
Returns
Source
q2()
q2(
srcColName,resultColName?):GroupByBuilder
Adds an aggregation that calculates second quartile for the specified column. Call aggregate when the query is constructed.
Parameters
| Parameter | Type | Default value | Description |
|---|---|---|---|
srcColName | string | undefined | column name in the source table |
resultColName? | null | string | null | column name in the resulting DataFrame |
Returns
Source
q3()
q3(
srcColName,resultColName?):GroupByBuilder
Adds an aggregation that calculates third quartile for the specified column. Call aggregate when the query is constructed.
Parameters
| Parameter | Type | Default value | Description |
|---|---|---|---|
srcColName | string | undefined | column name in the source table |
resultColName? | null | string | null | column name in the resulting DataFrame |
Returns
Source
stdev()
stdev(
srcColName,resultColName?):GroupByBuilder
Adds an aggregation that calculates standard deviation for the specified column. Call aggregate when the query is constructed.
Parameters
| Parameter | Type | Default value | Description |
|---|---|---|---|
srcColName | string | undefined | column name in the source table |
resultColName? | null | string | null | column name in the resulting DataFrame |
Returns
Source
sum()
sum(
srcColName,resultColName?):GroupByBuilder
Adds an aggregation that calculates sum of the values for the specified column. Call aggregate when the query is constructed.
Parameters
| Parameter | Type | Default value | Description |
|---|---|---|---|
srcColName | string | undefined | column name in the source table |
resultColName? | null | string | null | column name in the resulting DataFrame |
Returns
Source
toString()
toString():
string
Returns
string
Source
uniqueCount()
uniqueCount(
srcColName,resultColName?):GroupByBuilder
Adds an aggregation that counts number of unique values in the specified column. See also count, valueCount, missingValueCount Call aggregate when the query is constructed.
Parameters
| Parameter | Type | Default value | Description |
|---|---|---|---|
srcColName | string | undefined | column name in the source table |
resultColName? | null | string | null | column name in the resulting DataFrame |
Returns
Source
valueCount()
valueCount(
srcColName,resultColName?):GroupByBuilder
Adds an aggregation that counts rows, including these will null values. See also count, valueCount, uniqueCount, missingValueCount Call aggregate when the query is constructed.
Parameters
| Parameter | Type | Default value | Description |
|---|---|---|---|
srcColName | string | undefined | column name in the source table |
resultColName? | null | string | null | column name in the resulting DataFrame |
Returns
Source
variance()
variance(
srcColName,resultColName?):GroupByBuilder
Adds an aggregation that calculates varians for the specified column. Call aggregate when the query is constructed.
Parameters
| Parameter | Type | Default value | Description |
|---|---|---|---|
srcColName | string | undefined | column name in the source table |
resultColName? | null | string | null | column name in the resulting DataFrame |
Returns
Source
where()
where(
pattern):GroupByBuilder
Specifies the filter for the source rows.
Parameters
| Parameter | Type |
|---|---|
pattern | string | object |
Returns
Input
pattern
Source
whereRowMask()
whereRowMask(
bitset):GroupByBuilder
Parameters
| Parameter | Type | Description |
|---|---|---|
bitset | BitSet |