1010data Insights Platform offers a rich set of functions that may be used in value expressions when creating computed columns and in selection expressions when performing row selections.
Group functions, or g_functions, are used to perform operations, such as summarizations, on rows that have the same values in a set of given columns.
1010data provides a whole range of functions that can provide information and act on your data.
The time/date functions are vector functions that operate on columns containing time-related data and return specific information about the date/time for each row.
Provides an overview of basic functionality of g_functions in 1010data.
All g_functions contain an optional S parameter, or a selection column, to tell the g_function whether to include or exclude a row in its calculations.
S
aucnt_estimate(M)
Returns the approximate number of unique values from an aucnt object. (Available as of version 10.30)
aucnt
g_aucnt(G;S;X;Z)
Returns an aucnt object that can be used to compute the approximate unique count of values in a given column. (Available as of version 10.30)
g_aucnt_merge(G;S;M;N)
Returns an aucnt object that contains a signature of two or more merged aucnt objects. (Available as of version 10.30)
g_auroc(G;S;X;Y)
Returns a model object with four different representations of the Area Under the Receiver Operator Characteristic (AUROC) as well as the perfect model value. (Available as of version 10.42)
g_avg(G;S;X)
Returns the average of values within a given group.
g_cdf(G;S;O;X)
Returns the cumulative frequency distribution of values within a given group.
g_cor(G;S;X;Y)
Returns the correlation coefficient of two sets of values within a given group.
g_cov(G;S;X;Y)
Returns the covariance of two sets of values within a given group.
g_cumfreq(G;S;O;X)
Returns the cumulative frequency of values within a given group.
g_median(G;S;X)
Returns the median of values within a given group.
g_mode(G;S;X)
Returns the mode of values within a given group.
g_ntile(G;S;O;X;N)
Returns an integer representing the quantile interval (or "bucket") for each row that is a member of a given group based on the values in a particular column. Resultant buckets are of roughly equal size.
g_outlier(G;S;X;Z)
Returns a numeric value indicating outliers or extreme values. (Available as of version 10.44)
g_quantile(G;S;X;P)
Returns the quantile for values within a given group.
g_rank(G;S;O;X)
Returns the rank of every value within a given group.
g_rankavg(G;S;X)
Returns the rank of unique values within a given group, averaging the rank for repeated values.
g_rankskip(G;S;X)
Returns the rank of unique values within a given group, skipping the rank after repeated values.
g_rankuniq(G;S;X)
Returns the dense rank of unique values within a given group, giving repeated values the same rank.
g_std(G;S;X)
Returns the population standard deviation of values within a given group.
g_std_s(G;S;X)
Returns the sample standard deviation of values within a given group.
g_sumsqr(G;S;X)
Returns the sum of squares of the values within a given group.
g_var(G;S;X)
Returns the population variance of values within a given group.
g_var_s(G;S;X)
Returns the sample variance of values within a given group.
g_wstd(G;S;X;Y)
Returns the weighted standard deviation of values within a given group.
g_wvar(G;S;X;Y)
Returns the weighted variance of values within a given group.
Math functions are vector functions that perform mathematical operations on one or more columns and return a column of results based on the operation.
Categorization functions are vector functions that organize data and determine logical groupings. These functions can be used for conditionalizing results and/or bucketing value ranges.
String functions are vector functions that manipulate vectors of strings to provide information about a string or substring, concatenate and split strings, or transform strings based on specified criteria.
List functions are scalar functions that return lists, subsets of lists, and combinations of lists, among other functionality. Lists, along with packages, are compound scalar data types that facilitate programmatic interaction with scalar data values and variables.
Data-handling functions are vector functions that return a hash value based on an input column or columns.
Row functions are vector functions that return computational outputs for row inputs, as opposed to column outputs. Row inputs are defined as a space- or comma-separated list of column names.
SQL compatibility functions are functions that treat null values as SQL nulls.
System functions are special functions in 1010data that return information about users, tables, and other system objects.
The object functions can be used to check the existence, type, and accessibility of objects such as folders, tables, and queries on the 1010data Insights Platform.
This category of functions contains special functions that don't neatly fit into other categories of functions.