Class TextFileInputMeta
java.lang.Object
org.pentaho.di.trans.step.BaseStepMeta
org.pentaho.di.trans.steps.file.BaseFileMeta
org.pentaho.di.trans.steps.file.BaseFileInputMeta<BaseFileInputAdditionalField,BaseFileInputFiles,BaseFileField>
org.pentaho.di.trans.steps.fileinput.text.TextFileInputMeta
- All Implemented Interfaces:
Cloneable,StepAttributesInterface,StepMetaInterface,CsvInputAwareMeta
public class TextFileInputMeta
extends BaseFileInputMeta<BaseFileInputAdditionalField,BaseFileInputFiles,BaseFileField>
implements StepMetaInterface, CsvInputAwareMeta
-
Nested Class Summary
Nested Classes -
Field Summary
FieldsModifier and TypeFieldDescriptionThe name of the field that will contain the number of errors in the rowThe name of the field that will contain the names of the fields that generated errors, separated by ,booleanIf error line are skipped, you can replay without introducing doubles.The name of the field that will contain the error texts, separated by CRstatic final intstatic final intstatic final intstatic final intstatic final intFields inherited from class org.pentaho.di.trans.steps.file.BaseFileInputMeta
additionalOutputFields, errorHandling, inputFields, inputFiles, NO, RequiredFilesCode, RequiredFilesDesc, YESFields inherited from class org.pentaho.di.trans.steps.file.BaseFileMeta
servletOutputFields inherited from class org.pentaho.di.trans.step.BaseStepMeta
attributes, databases, log, loggingObject, parentStepMeta, repository, STEP_ATTRIBUTES_FILE -
Constructor Summary
Constructors -
Method Summary
Modifier and TypeMethodDescriptionvoidallocate(int nrfiles, int nrfields, int nrfilters) voidallocateFiles(int nrFiles) voidcheck(List<org.pentaho.di.core.CheckResultInterface> remarks, TransMeta transMeta, StepMeta stepMeta, org.pentaho.di.core.row.RowMetaInterface prev, String[] input, String[] output, org.pentaho.di.core.row.RowMetaInterface info, org.pentaho.di.core.variables.VariableSpace space, Repository repository, org.pentaho.metastore.api.IMetaStore metaStore) Checks the settings of this step and puts the findings in a remarks List.clone()Make an exact copy of this step, make sure to explicitly copy Collections etc.exportResources(org.pentaho.di.core.variables.VariableSpace space, Map<String, ResourceDefinition> definitions, ResourceNamingInterface resourceNamingInterface, Repository repository, org.pentaho.metastore.api.IMetaStore metaStore) Since the exported transformation that runs this will reside in a ZIP file, we can't reference files relatively.Required for the Data Lineage.Required for the Data Lineage.Required for the Data Lineage.voidgetFields(org.pentaho.di.core.row.RowMetaInterface inputRowMeta, String name, org.pentaho.di.core.row.RowMetaInterface[] info, StepMeta nextStep, org.pentaho.di.core.variables.VariableSpace space) Deprecated.voidgetFields(org.pentaho.di.core.row.RowMetaInterface row, String name, org.pentaho.di.core.row.RowMetaInterface[] info, StepMeta nextStep, org.pentaho.di.core.variables.VariableSpace space, Repository repository, org.pentaho.metastore.api.IMetaStore metaStore) Gets the fields.intString[]String[]getFilePaths(org.pentaho.di.core.bowl.Bowl bowl, org.pentaho.di.core.variables.VariableSpace space) intorg.apache.commons.vfs2.FileObjectgetHeaderFileObject(TransMeta transMeta) Returns aFileObjectthat corresponds to the first encountered input file.String[]getStep(StepMeta stepMeta, StepDataInterface stepDataInterface, int cnr, TransMeta transMeta, Trans trans) Get the executing step, needed by Trans to launch a step.Get a new instance of the appropriate data class.org.pentaho.di.core.fileinput.FileInputListgetTextFileList(org.pentaho.di.core.bowl.Bowl bowl, org.pentaho.di.core.variables.VariableSpace space) getXML()Produces the XML string that describes this step's information.booleanbooleanRequired for the Data Lineage.booleanprotected StringloadSource(Node filenode, Node filenamenode, int i, org.pentaho.metastore.api.IMetaStore metaStore) protected StringloadSourceRep(Repository rep, org.pentaho.di.repository.ObjectId id_step, int i, org.pentaho.metastore.api.IMetaStore metaStore) voidloadXML(Node stepnode, List<org.pentaho.di.core.database.DatabaseMeta> databases, org.pentaho.metastore.api.IMetaStore metaStore) Load the values for this step from an XML NodevoidreadRep(Repository rep, org.pentaho.metastore.api.IMetaStore metaStore, org.pentaho.di.repository.ObjectId id_step, List<org.pentaho.di.core.database.DatabaseMeta> databases) Read the steps information from a Kettle repositoryvoidsaveRep(Repository rep, org.pentaho.metastore.api.IMetaStore metaStore, org.pentaho.di.repository.ObjectId id_transformation, org.pentaho.di.repository.ObjectId id_step) Save the steps data into a Kettle repositoryprotected voidsaveSource(StringBuilder retVal, String source) protected voidsaveSourceRep(Repository rep, org.pentaho.di.repository.ObjectId id_transformation, org.pentaho.di.repository.ObjectId id_step, int i, String fileName) voidsearchInfoAndTargetSteps(List<StepMeta> steps) Change step names into step objects to allow them to be name-changed etc.voidsetAcceptingStep(StepMeta acceptingStep) voidSet default valuesvoidsetErrorCountField(String errorCountField) voidsetErrorFieldsField(String errorFieldsField) voidsetErrorLineSkipped(boolean errorLineSkipped) voidsetErrorTextField(String errorTextField) voidsetFileName(String[] fileName) voidsetFileNameForTest(String[] fileName) voidsetFilter(TextFileFilter[] filter) voidbooleanMethods inherited from class org.pentaho.di.trans.steps.file.BaseFileInputMeta
getFileInputList, getFileInputList, getFilePaths, getInputFields, getRequiredFilesCode, getResourceDependencies, inputFiles_fileRequired, inputFiles_includeSubFolders, inputFiles_includeSubFoldersMethods inherited from class org.pentaho.di.trans.steps.file.BaseFileMeta
passDataToServletOutput, writesToFileMethods inherited from class org.pentaho.di.trans.step.BaseStepMeta
analyseImpact, analyseImpact, cancelQueries, check, check, createEntry, excludeFromCopyDistributeVerification, excludeFromRowLayoutVerification, exportResources, extractStepMetadataEntries, findAttribute, findParent, findParentEntry, getActiveReferencedObjectDescription, getDescription, getDialogClassName, getLog, getLogChannelId, getName, getObjectCopy, getObjectId, getObjectRevision, getObjectType, getOptionalStreams, getParent, getParentStepMeta, getReferencedObjectDescriptions, getRepCode, getRepositoryDirectory, getRequiredFields, getRequiredFields, getSQLStatements, getSQLStatements, getStepInjectionMetadataEntries, getStepIOMeta, getStepIOMeta, getStepMetaInjectionInterface, getSupportedTransformationTypes, getTableFields, getTooltip, getUsedArguments, getUsedDatabaseConnections, getUsedLibraries, getXmlCode, handleStreamSelection, hasChanged, hasRepositoryReferences, isBasic, isDebug, isDetailed, isReferencedObjectEnabled, isRowLevel, loadReferencedObject, loadReferencedObject, loadStepAttributes, loadXML, loadXML, logBasic, logBasic, logDebug, logDebug, logDetailed, logDetailed, logError, logError, logError, logMinimal, logMinimal, logRowlevel, logRowlevel, lookupRepositoryReferences, readRep, readRep, resetStepIoMeta, saveRep, setChanged, setChanged, setParentStepMeta, setStepIOMetaMethods inherited from class java.lang.Object
equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, waitMethods inherited from interface org.pentaho.di.trans.step.StepMetaInterface
analyseImpact, analyseImpact, cancelQueries, check, cleanAfterHopFromRemove, cleanAfterHopFromRemove, cleanAfterHopToRemove, excludeFromCopyDistributeVerification, excludeFromRowLayoutVerification, exportResources, extractStepMetadataEntries, fetchTransMeta, getActiveReferencedObjectDescription, getDialogClassName, getOptionalStreams, getParentStepMeta, getReferencedObjectDescriptions, getRequiredFields, getResourceDependencies, getSQLStatements, getSQLStatements, getStepIOMeta, getStepMetaInjectionInterface, getSupportedTransformationTypes, getTableFields, getUsedArguments, getUsedDatabaseConnections, getUsedLibraries, handleStreamSelection, hasChanged, hasRepositoryReferences, isReferencedObjectEnabled, loadReferencedObject, loadXML, lookupRepositoryReferences, passDataToServletOutput, readRep, resetStepIoMeta, saveRep, setChanged, setParentStepMeta
-
Field Details
-
FILE_FORMAT_DOS
public static final int FILE_FORMAT_DOS- See Also:
-
FILE_FORMAT_UNIX
public static final int FILE_FORMAT_UNIX- See Also:
-
FILE_FORMAT_MIXED
public static final int FILE_FORMAT_MIXED- See Also:
-
FILE_TYPE_CSV
public static final int FILE_TYPE_CSV- See Also:
-
FILE_TYPE_FIXED
public static final int FILE_TYPE_FIXED- See Also:
-
-
errorCountField
The name of the field that will contain the number of errors in the row -
errorFieldsField
The name of the field that will contain the names of the fields that generated errors, separated by , -
errorTextField
The name of the field that will contain the error texts, separated by CR -
errorLineSkipped
public boolean errorLineSkippedIf error line are skipped, you can replay without introducing doubles.
-
-
Constructor Details
-
TextFileInputMeta
public TextFileInputMeta()
-
-
Method Details
-
getFileName
- Returns:
- Returns the fileName.
-
setFileName
- Parameters:
fileName- The fileName to set.
-
getFilter
- Returns:
- The array of filters for the metadata of this text file input step.
-
setFilter
- Parameters:
filter- The array of filters to use
-
loadXML
public void loadXML(Node stepnode, List<org.pentaho.di.core.database.DatabaseMeta> databases, org.pentaho.metastore.api.IMetaStore metaStore) throws org.pentaho.di.core.exception.KettleXMLException Description copied from interface:StepMetaInterfaceLoad the values for this step from an XML Node- Specified by:
loadXMLin interfaceStepMetaInterface- Overrides:
loadXMLin classBaseStepMeta- Parameters:
stepnode- the Node to get the info fromdatabases- The available list of databases to reference tometaStore- the metastore to optionally load external reference metadata from- Throws:
org.pentaho.di.core.exception.KettleXMLException- When an unexpected XML error occurred. (malformed etc.)
-
clone
Description copied from interface:StepMetaInterfaceMake an exact copy of this step, make sure to explicitly copy Collections etc.- Specified by:
clonein interfaceStepMetaInterface- Overrides:
clonein classBaseFileInputMeta<BaseFileInputAdditionalField,BaseFileInputFiles, BaseFileField> - Returns:
- an exact copy of this step
-
allocate
public void allocate(int nrfiles, int nrfields, int nrfilters) -
allocateFiles
public void allocateFiles(int nrFiles) -
setDefault
public void setDefault()Description copied from interface:StepMetaInterfaceSet default values- Specified by:
setDefaultin interfaceStepMetaInterface
-
getFields
public void getFields(org.pentaho.di.core.row.RowMetaInterface row, String name, org.pentaho.di.core.row.RowMetaInterface[] info, StepMeta nextStep, org.pentaho.di.core.variables.VariableSpace space, Repository repository, org.pentaho.metastore.api.IMetaStore metaStore) throws org.pentaho.di.core.exception.KettleStepException Description copied from class:BaseStepMetaGets the fields.- Specified by:
getFieldsin interfaceStepMetaInterface- Overrides:
getFieldsin classBaseStepMeta- Parameters:
row- the input row meta that is modified in this method to reflect the output row metadata of the stepname- Name of the step to use as input for the origin field in the valuesinfo- Fields used as extra lookup informationnextStep- the next step that is targetedspace- the space The variable space to use to replace variablesrepository- the repository to use to load Kettle metadata objects impacting the output fieldsmetaStore- the MetaStore to use to load additional external data or metadata impacting the output fields- Throws:
org.pentaho.di.core.exception.KettleStepException- the kettle step exception
-
getFields
@Deprecated public void getFields(org.pentaho.di.core.row.RowMetaInterface inputRowMeta, String name, org.pentaho.di.core.row.RowMetaInterface[] info, StepMeta nextStep, org.pentaho.di.core.variables.VariableSpace space) throws org.pentaho.di.core.exception.KettleStepException Deprecated.Description copied from class:BaseStepMetaGets the fields.- Specified by:
getFieldsin interfaceStepMetaInterface- Overrides:
getFieldsin classBaseStepMeta- Parameters:
inputRowMeta- the input row meta that is modified in this method to reflect the output row metadata of the stepname- Name of the step to use as input for the origin field in the valuesinfo- Fields used as extra lookup informationnextStep- the next step that is targetedspace- the space The variable space to use to replace variables- Throws:
org.pentaho.di.core.exception.KettleStepException- the kettle step exception
-
getXML
Description copied from class:BaseStepMetaProduces the XML string that describes this step's information.- Specified by:
getXMLin interfaceStepMetaInterface- Overrides:
getXMLin classBaseStepMeta- Returns:
- String containing the XML describing this step.
-
getLookupStepname
-
searchInfoAndTargetSteps
Description copied from class:BaseStepMetaChange step names into step objects to allow them to be name-changed etc.- Specified by:
searchInfoAndTargetStepsin interfaceStepMetaInterface- Overrides:
searchInfoAndTargetStepsin classBaseStepMeta- Parameters:
steps- optionally search the info step in a list of steps
-
getInfoSteps
-
readRep
public void readRep(Repository rep, org.pentaho.metastore.api.IMetaStore metaStore, org.pentaho.di.repository.ObjectId id_step, List<org.pentaho.di.core.database.DatabaseMeta> databases) throws org.pentaho.di.core.exception.KettleException Description copied from interface:StepMetaInterfaceRead the steps information from a Kettle repository- Specified by:
readRepin interfaceStepMetaInterface- Overrides:
readRepin classBaseStepMeta- Parameters:
rep- The repository to read frommetaStore- The MetaStore to read external information fromid_step- The step IDdatabases- The databases to reference- Throws:
org.pentaho.di.core.exception.KettleException- When an unexpected error occurred (database, network, etc)
-
saveRep
public void saveRep(Repository rep, org.pentaho.metastore.api.IMetaStore metaStore, org.pentaho.di.repository.ObjectId id_transformation, org.pentaho.di.repository.ObjectId id_step) throws org.pentaho.di.core.exception.KettleException Description copied from interface:StepMetaInterfaceSave the steps data into a Kettle repository- Specified by:
saveRepin interfaceStepMetaInterface- Overrides:
saveRepin classBaseStepMeta- Parameters:
rep- The Kettle repository to save tometaStore- the metaStore to optionally write toid_transformation- The transformation IDid_step- The step ID- Throws:
org.pentaho.di.core.exception.KettleException- When an unexpected error occurred (database, network, etc)
-
check
public void check(List<org.pentaho.di.core.CheckResultInterface> remarks, TransMeta transMeta, StepMeta stepMeta, org.pentaho.di.core.row.RowMetaInterface prev, String[] input, String[] output, org.pentaho.di.core.row.RowMetaInterface info, org.pentaho.di.core.variables.VariableSpace space, Repository repository, org.pentaho.metastore.api.IMetaStore metaStore) Description copied from interface:StepMetaInterfaceChecks the settings of this step and puts the findings in a remarks List.- Specified by:
checkin interfaceStepMetaInterface- Overrides:
checkin classBaseStepMeta- Parameters:
remarks- The list to put the remarks in @see org.pentaho.di.core.CheckResultstepMeta- The stepMeta to help checkingprev- The fields coming from the previous stepinput- The input step namesoutput- The output step namesinfo- The fields that are used as information by the stepspace- the variable space to resolve variable expressions withrepository- the repository to use to load Kettle metadata objects impacting the output fieldsmetaStore- the MetaStore to use to load additional external data or metadata impacting the output fields
-
getStep
public StepInterface getStep(StepMeta stepMeta, StepDataInterface stepDataInterface, int cnr, TransMeta transMeta, Trans trans) Description copied from interface:StepMetaInterfaceGet the executing step, needed by Trans to launch a step.- Specified by:
getStepin interfaceStepMetaInterface- Parameters:
stepMeta- The step infostepDataInterface- the step data interface linked to this step. Here the step can store temporary data, database connections, etc.cnr- The copy nr to gettransMeta- The transformation infotrans- The launching transformation
-
getStepData
Description copied from interface:StepMetaInterfaceGet a new instance of the appropriate data class. This data class implements the StepDataInterface. It basically contains the persisting data that needs to live on, even if a worker thread is terminated.- Specified by:
getStepDatain interfaceStepMetaInterface- Returns:
- The appropriate StepDataInterface class.
-
getErrorCountField
-
setErrorCountField
-
getErrorFieldsField
-
setErrorFieldsField
-
getErrorTextField
-
setErrorTextField
-
getRequiredFilesDesc
-
isErrorLineSkipped
public boolean isErrorLineSkipped() -
setErrorLineSkipped
public void setErrorLineSkipped(boolean errorLineSkipped) -
setAcceptingStep
- Parameters:
acceptingStep- The acceptingStep to set.
-
getFileFormatTypeNr
public int getFileFormatTypeNr()- Specified by:
getFileFormatTypeNrin interfaceCsvInputAwareMeta
-
getFileTypeNr
public int getFileTypeNr() -
exportResources
public String exportResources(org.pentaho.di.core.variables.VariableSpace space, Map<String, ResourceDefinition> definitions, ResourceNamingInterface resourceNamingInterface, Repository repository, org.pentaho.metastore.api.IMetaStore metaStore) throws org.pentaho.di.core.exception.KettleExceptionSince the exported transformation that runs this will reside in a ZIP file, we can't reference files relatively. So what this does is turn the name of files into absolute paths OR it simply includes the resource in the ZIP file. For now, we'll simply turn it into an absolute path and pray that the file is on a shared drive or something like that.- Specified by:
exportResourcesin interfaceStepMetaInterface- Overrides:
exportResourcesin classBaseStepMeta- Parameters:
space- the variable space to usedefinitions-resourceNamingInterface-repository- The repository to optionally load other resources from (to be converted to XML)metaStore- the metaStore in which non-kettle metadata could reside.- Returns:
- the filename of the exported resource
- Throws:
org.pentaho.di.core.exception.KettleException
-
supportsErrorHandling
public boolean supportsErrorHandling()- Specified by:
supportsErrorHandlingin interfaceStepMetaInterface- Overrides:
supportsErrorHandlingin classBaseStepMeta- Returns:
- true if this step supports error "reporting" on rows: the ability to send rows to a certain target step.
-
setFileNameForTest
-
loadSource
-
saveSource
-
loadSourceRep
protected String loadSourceRep(Repository rep, org.pentaho.di.repository.ObjectId id_step, int i, org.pentaho.metastore.api.IMetaStore metaStore) throws org.pentaho.di.core.exception.KettleException - Throws:
org.pentaho.di.core.exception.KettleException
-
saveSourceRep
protected void saveSourceRep(Repository rep, org.pentaho.di.repository.ObjectId id_transformation, org.pentaho.di.repository.ObjectId id_step, int i, String fileName) throws org.pentaho.di.core.exception.KettleException - Throws:
org.pentaho.di.core.exception.KettleException
-
getEncoding
- Specified by:
getEncodingin interfaceCsvInputAwareMeta- Specified by:
getEncodingin classBaseFileInputMeta<BaseFileInputAdditionalField,BaseFileInputFiles, BaseFileField>
-
getLength
- Returns:
- the length
-
setLength
- Parameters:
length- the length to set
-
isAcceptingFilenames
public boolean isAcceptingFilenames()Required for the Data Lineage.- Overrides:
isAcceptingFilenamesin classBaseFileInputMeta<BaseFileInputAdditionalField,BaseFileInputFiles, BaseFileField>
-
getAcceptingStepName
Required for the Data Lineage.- Overrides:
getAcceptingStepNamein classBaseFileInputMeta<BaseFileInputAdditionalField,BaseFileInputFiles, BaseFileField>
-
getAcceptingStep
Required for the Data Lineage. -
getAcceptingField
Required for the Data Lineage.- Overrides:
getAcceptingFieldin classBaseFileInputMeta<BaseFileInputAdditionalField,BaseFileInputFiles, BaseFileField>
-
getFilePaths
public String[] getFilePaths(org.pentaho.di.core.bowl.Bowl bowl, org.pentaho.di.core.variables.VariableSpace space) -
getTextFileList
public org.pentaho.di.core.fileinput.FileInputList getTextFileList(org.pentaho.di.core.bowl.Bowl bowl, org.pentaho.di.core.variables.VariableSpace space) -
hasHeader
public boolean hasHeader()- Specified by:
hasHeaderin interfaceCsvInputAwareMeta
-
getEscapeCharacter
- Specified by:
getEscapeCharacterin interfaceCsvInputAwareMeta
-
getDelimiter
- Specified by:
getDelimiterin interfaceCsvInputAwareMeta
-
getEnclosure
- Specified by:
getEnclosurein interfaceCsvInputAwareMeta
-
getHeaderFileObject
Description copied from interface:CsvInputAwareMetaReturns aFileObjectthat corresponds to the first encountered input file. This object is used to read the file headers for the purpose of field parsing.- Specified by:
getHeaderFileObjectin interfaceCsvInputAwareMeta- Parameters:
transMeta- theTransMeta- Returns:
- null if the
FileObjectcannot be created.
-