# Types of summarizations

The Tabulation Function menu in the Tabulation panel contains a number of calculations, called summarizations, from which you can choose.

You can choose from the following types of summarizations when performing a quick summary, tabulation, or cross tabulation.
Note: In the list below, [COLUMN] indicates either the column name or the column label of the selected column. By default, the column label is displayed. However, when the Show with column names option is selected in the Tabulation panel, the column name is displayed instead.
 # rows The number of rows in each group.  The number of rows includes those with N/A values. # unique [COLUMN] The number of unique values in the Column for each group. If a group contains N/A values, those rows are ignored (i.e., the result is the number of unique non-N/A values). If a group contains only N/A values, the result for the group is 0. Approx. # unique [COLUMN] The approximate number of unique values in the Column for each group. This can be much faster than # unique [COLUMN] for very large cardinality columns, but provides an approximate number that can be off by a few percent. Uses the hyper-log-log algorithm. If a group contains N/A values, those rows are ignored (i.e., the result is the number of unique non-N/A values). If a group contains only N/A values, the result for the group is 0. Note: While this function can be used for small tables, it is generally used to compute the approximate number of distinct values in a column with a large number of unique values, or across a table with a very large number of rows. # valid [COLUMN] The number of non-N/A values in the Column for each group. # invalid [COLUMN] The number of N/A values in the Column for each group. # valid pairs [COLUMN] and The number of rows where both the Column and the Reference Column are not N/A, for each group. # invalid pairs [COLUMN] and The number of rows where either the Column or the Reference Column is N/A, for each group. Sum of [COLUMN] The subtotal of the Column for each group. (numeric columns only) If a group contains N/A values, those rows are ignored (i.e., the result is the sum of the non-N/A values). If a group contains only N/A values, the result for the group is 0. Weighted sum of [COLUMN] and The dot product of the Column and the Reference Column, for each group. (numeric columns only) If, for a particular group, either the Column or the Reference Column contains N/A values, those rows are ignored (i.e., the result for the group is the dot product for those rows where neither the Column nor the Reference Column is N/A). If there are no such rows, the result for the group is 0. Mean of [COLUMN] The average of the Column for each group. (numeric columns only) If a group contains N/A values, those rows are ignored (i.e., the result is the average of the non-N/A values). If a group contains only N/A values, the result for the group is N/A. Weighted mean of [COLUMN] and The average of the Column, weighted by the Reference Column, for each group. (numeric columns only) If, for a particular group, either the Column or the Reference Column contains N/A values, those rows are ignored (i.e., the result for the group is the weighted average for those rows where neither the Column nor the Reference Column is N/A). If there are no such rows, the result for the group is N/A. Std. Dev. of [COLUMN] The standard deviation of the Column for each group. (numeric columns only) If a group contains N/A values, those rows are ignored (i.e., the result is the standard deviation of the non-N/A values). If a group contains only N/A values, the result for the group is N/A. Weighted SD of [COLUMN] and The standard deviation of the Column, weighted by the Reference Column, for each group. (numeric columns only) If, for a particular group, either the Column or the Reference Column contains N/A values, those rows are ignored (i.e., the result for the group is the weighted standard deviation for those rows where neither the Column nor the Reference Column is N/A). If there are no such rows, the result for the group is N/A. Variance of [COLUMN] The variance of the Column for each group. (numeric columns only) If a group contains N/A values, those rows are ignored (i.e., the result is the variance of the non-N/A values). If a group contains only N/A values, the result for the group is N/A. Weighted var. of [COLUMN] and The variance of the Column, weighted by the Reference Column, for each group. (numeric columns only) If, for a particular group, either the Column or the Reference Column contains N/A values, those rows are ignored (i.e., the result for the group is the weighted variance for those rows where neither the Column nor the Reference Column is N/A). If there are no such rows, the result for the group is N/A. Covariance of [COLUMN] and The covariance of the Column and the Reference Column, for each group. (numeric columns only) If, for a particular group, either the Column or the Reference Column contains N/A values, those rows are ignored (i.e., the result for the group is the covariance for those rows where neither the Column nor the Reference Column is N/A). If there are no such rows, the result for the group is N/A. Correlation of [COLUMN] and The correlation of the Column and the Reference Column, for each group. (numeric columns only) If, for a particular group, either the Column or the Reference Column contains N/A values, those rows are ignored (i.e., the result for the group is the correlation for those rows where neither the Column nor the Reference Column is N/A). If there aren't at least two such rows in the group, the result is N/A. Sum of squares [COLUMN] The sum of squares of the Column for each group. (numeric columns only) If a group contains N/A values, those rows are ignored (i.e., the result is the sum of squares of the non-N/A values). If a group contains only N/A values, the result for the group is 0. Highest value of [COLUMN] The highest number in the Column for each group. (numeric columns only) If a group contains N/A values, those rows are ignored (i.e., the result is the highest non-N/A value). If a group contains only N/A values, the result for the group is N/A. Lowest value of [COLUMN] The lowest number in the Column for each group. (numeric columns only) If a group contains N/A values, those rows are ignored (i.e., the result is the lowest non-N/A value). If a group contains only N/A values, the result for the group is N/A. First value of [COLUMN] The first value in the Column for each group. (alphanumeric and numeric columns)  This is the first value based on the current sort order of the table (i.e., the way the data is shown when the table is viewed multiple rows at a time). If a group contains N/A values, those rows are ignored (i.e., the result is the first of the non-N/A values). If a group contains only N/A values, the result for the group is N/A. Last value of [COLUMN] The last value in the Column for each group. (alphanumeric and numeric columns)This is the last value based on the current sort order of the table (i.e., the way the data is shown when the table is viewed multiple rows at a time). If a group contains N/A values, those rows are ignored (i.e., the result is the last of the non-N/A values). If a group contains only N/A values, the result for the group is N/A. Median of [COLUMN] The median of the Column for each group. (numeric columns only) If a group contains N/A values, those rows are ignored (i.e., the result is the median of the non-N/A values). If there are an even number of values, the median is the average of the two middle values. If a group contains only N/A values, the result for the group is N/A. Mode of [COLUMN] The mode of the Column for each group. (numeric columns only) If a group contains N/A values, those rows are ignored (i.e., the result is the mode of the non-N/A values). If, for a given group, there is no unique mode, the result is arbitrary. If a group contains only N/A values, the result for the group is N/A. F(mode) of [COLUMN] The frequency of the mode of the Column for each group. (numeric columns only) If a group contains N/A values, those rows are ignored (i.e., the result is the frequency of the mode of the non-N/A values). If a group contains only N/A values, the result for the group is 0. # modes of [COLUMN] The number of modes in the Column for each group. (numeric columns only) If a group contains N/A values, those rows are ignored (i.e., the result is the number of modes within the non-N/A values). If a group contains only N/A values, the result for the group is 0. Rank among all values of The rank of each group, in terms of its number of records, with respect to all other groups. The rank includes rows with N/A values. column values of The rank of each group, in terms of its number of records, with respect to the other groups in the same column of the cross tabulation result. The rank includes rows with N/A values. row values of The rank of each group, in terms of its number of records, with respect to the other groups on the same row of the cross tabulation result. The rank includes rows with N/A values. Percent of grand total of The subtotal of the Column for each group as a percentage of the grand total. (numeric columns only) If a group contains N/A values, those rows are ignored (i.e., the result is the sum of the non-N/A values in the group as a percentage of the sum of the non-N/A values in the entire column). If a group contains only N/A values, the result for the group is 0. Note: The result is a whole number, not a fractional value. For instance, a result of 20% would be represented as 20, not 0.20. column total of The subtotal of the Column for each group as a percentage of the total for that column of the cross tabulation result. (numeric columns only) If a group contains N/A values, those rows are ignored (i.e., the result is the sum of the non-N/A values in the group as a percentage of the sum of the non-N/A values in that column of the cross tabulation result). If a group contains only N/A values, the result for the group is 0. Note: The result is a whole number, not a fractional value. For instance, a result of 20% would be represented as 20, not 0.20. row total of The subtotal of the Column for each group as a percentage of the total for that row of the cross tabulation result. (numeric columns only) If a group contains N/A values, those rows are ignored (i.e., the result is the sum of the non-N/A values in the group as a percentage of the sum of the non-N/A values in that row of the cross tabulation result). If a group contains only N/A values, the result for the group is 0. Note: The result is a whole number, not a fractional value. For instance, a result of 20% would be represented as 20, not 0.20. Fraction of grand total of The subtotal of the Column for each group as a fraction of the grand total. (numeric columns only) If a group contains N/A values, those rows are ignored (i.e., the result is the sum of the non-N/A values in the group as a fraction of the sum of the non-N/A values in the entire column). If a group contains only N/A values, the result for the group is 0. Note: The result is a fractional value. For instance, a result of 20% would be represented as 0.20. column total of The subtotal of the Column for each group as a fraction of the total for that column of the cross tabulation result. (numeric columns only) If a group contains N/A values, those rows are ignored (i.e., the result is the sum of the non-N/A values in the group as a fraction of the sum of the non-N/A values in that column of the cross tabulation result). If a group contains only N/A values, the result for the group is 0. Note: The result is a fractional value. For instance, a result of 20% would be represented as 0.20. row total of The subtotal of the Column for each group as a fraction of the total for that row of the cross tabulation result. (numeric columns only) If a group contains N/A values, those rows are ignored (i.e., the result is the sum of the non-N/A values in the group as a fraction of the sum of the non-N/A values in that row of the cross tabulation result). If a group contains only N/A values, the result for the group is 0. Note: The result is a fractional value. For instance, a result of 20% would be represented as 0.20.