Class RssInput

  • All Implemented Interfaces:
    org.pentaho.di.core.ExtensionDataInterface, HasLogChannelInterface, org.pentaho.di.core.logging.LoggingObjectInterface, org.pentaho.di.core.logging.LoggingObjectLifecycleInterface, org.pentaho.di.core.variables.VariableSpace, StepInterface

    public class RssInput
    extends BaseStep
    implements StepInterface
    Read data from RSS and writes these to one or more output streams.

    When error handling is turned on:
    • The input row will be passed through if it is present
    • The "Nr of errors" field will always be 1 (we do not presently check for multiple errors)
    • The "Error descriptions" field contains a .toString of the caught exception (Usually contains useful info, such as the HTTP return code)
    • The "Error fields" contains the URL that caused the failure
    • The "Error code" field contains one of the following Strings:
      • UnknownError - Unexpected; Check the "Error description" field
      • XMLError - Typically the file is not XML; Could be non-xml HTML
      • FileNotFound - Can be caused by a HTTP/404
      • UnknownHost - Domain name cannot be resolved; May be caused by network outage
      • TransferError - Can be caused by any Server error code (401, 403, 500, 502, etc...)
      • BadURL - Url cannot be understood; May lack protocol (e.g.- http://) or use an unrecognized protocol
      • BadRSSFormat - Typically the file is valid XML, but is not RSS
    Notes: Turn on debug logging to see the full stack trace from a handled error.
    Since:
    13-10-2007
    Author:
    Samatar
    • Method Detail

      • processRow

        public boolean processRow​(StepMetaInterface smi,
                                  StepDataInterface sdi)
                           throws org.pentaho.di.core.exception.KettleException
        Description copied from interface: StepInterface
        Perform the equivalent of processing one row. Typically this means reading a row from input (getRow()) and passing a row to output (putRow)).
        Specified by:
        processRow in interface StepInterface
        Overrides:
        processRow in class BaseStep
        Parameters:
        smi - The steps metadata to work with
        sdi - The steps temporary working data to work with (database connections, result sets, caches, temporary variables, etc.)
        Returns:
        false if no more rows can be processed or an error occurred.
        Throws:
        org.pentaho.di.core.exception.KettleException