Class UnivariateStats

  • All Implemented Interfaces:
    org.pentaho.di.core.ExtensionDataInterface, HasLogChannelInterface, org.pentaho.di.core.logging.LoggingObjectInterface, org.pentaho.di.core.logging.LoggingObjectLifecycleInterface, org.pentaho.di.core.variables.VariableSpace, StepInterface

    public class UnivariateStats
    extends BaseStep
    implements StepInterface
    Calculate univariate statistics based on one column of the input data.

    Calculates N, mean, standard deviation, minimum, maximum, median and arbitrary percentiles. Percentiles can be calculated using interpolation or a simple method. See The Engineering Statistics Handbook for details.

    Version:
    1.0
    Author:
    Mark Hall (mhall{[at]}pentaho.org)
    • Constructor Detail

      • UnivariateStats

        public UnivariateStats​(StepMeta stepMeta,
                               StepDataInterface stepDataInterface,
                               int copyNr,
                               TransMeta transMeta,
                               Trans trans)
        Creates a new UnivariateStats instance.
        Parameters:
        stepMeta - holds the step's meta data
        stepDataInterface - holds the step's temporary data
        copyNr - the number assigned to the step
        transMeta - meta data for the transformation
        trans - a Trans value
    • Method Detail

      • processRow

        public boolean processRow​(StepMetaInterface smi,
                                  StepDataInterface sdi)
                           throws org.pentaho.di.core.exception.KettleException
        Process an incoming row of data.
        Specified by:
        processRow in interface StepInterface
        Overrides:
        processRow in class BaseStep
        Parameters:
        smi - a StepMetaInterface value
        sdi - a StepDataInterface value
        Returns:
        a boolean value
        Throws:
        org.pentaho.di.core.exception.KettleException - if an error occurs