Class TextFileInputMeta
java.lang.Object
org.pentaho.di.trans.step.BaseStepMeta
org.pentaho.di.trans.steps.file.BaseFileMeta
org.pentaho.di.trans.steps.file.BaseFileInputMeta<BaseFileInputAdditionalField,BaseFileInputFiles,BaseFileField>
org.pentaho.di.trans.steps.fileinput.text.TextFileInputMeta
- All Implemented Interfaces:
Cloneable
,StepAttributesInterface
,StepMetaInterface
,CsvInputAwareMeta
public class TextFileInputMeta
extends BaseFileInputMeta<BaseFileInputAdditionalField,BaseFileInputFiles,BaseFileField>
implements StepMetaInterface, CsvInputAwareMeta
-
Nested Class Summary
Nested Classes -
Field Summary
FieldsModifier and TypeFieldDescriptionThe name of the field that will contain the number of errors in the rowThe name of the field that will contain the names of the fields that generated errors, separated by ,boolean
If error line are skipped, you can replay without introducing doubles.The name of the field that will contain the error texts, separated by CRstatic final int
static final int
static final int
static final int
static final int
Fields inherited from class org.pentaho.di.trans.steps.file.BaseFileInputMeta
additionalOutputFields, errorHandling, inputFields, inputFiles, NO, RequiredFilesCode, RequiredFilesDesc, YES
Fields inherited from class org.pentaho.di.trans.steps.file.BaseFileMeta
servletOutput
Fields inherited from class org.pentaho.di.trans.step.BaseStepMeta
attributes, databases, log, loggingObject, parentStepMeta, repository, STEP_ATTRIBUTES_FILE
-
Constructor Summary
Constructors -
Method Summary
Modifier and TypeMethodDescriptionvoid
allocate
(int nrfiles, int nrfields, int nrfilters) void
allocateFiles
(int nrFiles) void
check
(List<org.pentaho.di.core.CheckResultInterface> remarks, TransMeta transMeta, StepMeta stepMeta, org.pentaho.di.core.row.RowMetaInterface prev, String[] input, String[] output, org.pentaho.di.core.row.RowMetaInterface info, org.pentaho.di.core.variables.VariableSpace space, Repository repository, org.pentaho.metastore.api.IMetaStore metaStore) Checks the settings of this step and puts the findings in a remarks List.clone()
Make an exact copy of this step, make sure to explicitly copy Collections etc.exportResources
(org.pentaho.di.core.variables.VariableSpace space, Map<String, ResourceDefinition> definitions, ResourceNamingInterface resourceNamingInterface, Repository repository, org.pentaho.metastore.api.IMetaStore metaStore) Since the exported transformation that runs this will reside in a ZIP file, we can't reference files relatively.Required for the Data Lineage.Required for the Data Lineage.Required for the Data Lineage.void
getFields
(org.pentaho.di.core.row.RowMetaInterface inputRowMeta, String name, org.pentaho.di.core.row.RowMetaInterface[] info, StepMeta nextStep, org.pentaho.di.core.variables.VariableSpace space) Deprecated.void
getFields
(org.pentaho.di.core.row.RowMetaInterface row, String name, org.pentaho.di.core.row.RowMetaInterface[] info, StepMeta nextStep, org.pentaho.di.core.variables.VariableSpace space, Repository repository, org.pentaho.metastore.api.IMetaStore metaStore) Gets the fields.int
String[]
String[]
getFilePaths
(org.pentaho.di.core.bowl.Bowl bowl, org.pentaho.di.core.variables.VariableSpace space) int
org.apache.commons.vfs2.FileObject
getHeaderFileObject
(TransMeta transMeta) Returns aFileObject
that corresponds to the first encountered input file.String[]
getStep
(StepMeta stepMeta, StepDataInterface stepDataInterface, int cnr, TransMeta transMeta, Trans trans) Get the executing step, needed by Trans to launch a step.Get a new instance of the appropriate data class.org.pentaho.di.core.fileinput.FileInputList
getTextFileList
(org.pentaho.di.core.bowl.Bowl bowl, org.pentaho.di.core.variables.VariableSpace space) getXML()
Produces the XML string that describes this step's information.boolean
boolean
Required for the Data Lineage.boolean
protected String
loadSource
(Node filenode, Node filenamenode, int i, org.pentaho.metastore.api.IMetaStore metaStore) protected String
loadSourceRep
(Repository rep, org.pentaho.di.repository.ObjectId id_step, int i, org.pentaho.metastore.api.IMetaStore metaStore) void
loadXML
(Node stepnode, List<org.pentaho.di.core.database.DatabaseMeta> databases, org.pentaho.metastore.api.IMetaStore metaStore) Load the values for this step from an XML Nodevoid
readRep
(Repository rep, org.pentaho.metastore.api.IMetaStore metaStore, org.pentaho.di.repository.ObjectId id_step, List<org.pentaho.di.core.database.DatabaseMeta> databases) Read the steps information from a Kettle repositoryvoid
saveRep
(Repository rep, org.pentaho.metastore.api.IMetaStore metaStore, org.pentaho.di.repository.ObjectId id_transformation, org.pentaho.di.repository.ObjectId id_step) Save the steps data into a Kettle repositoryprotected void
saveSource
(StringBuilder retVal, String source) protected void
saveSourceRep
(Repository rep, org.pentaho.di.repository.ObjectId id_transformation, org.pentaho.di.repository.ObjectId id_step, int i, String fileName) void
searchInfoAndTargetSteps
(List<StepMeta> steps) Change step names into step objects to allow them to be name-changed etc.void
setAcceptingStep
(StepMeta acceptingStep) void
Set default valuesvoid
setErrorCountField
(String errorCountField) void
setErrorFieldsField
(String errorFieldsField) void
setErrorLineSkipped
(boolean errorLineSkipped) void
setErrorTextField
(String errorTextField) void
setFileName
(String[] fileName) void
setFileNameForTest
(String[] fileName) void
setFilter
(TextFileFilter[] filter) void
boolean
Methods inherited from class org.pentaho.di.trans.steps.file.BaseFileInputMeta
getFileInputList, getFileInputList, getFilePaths, getInputFields, getRequiredFilesCode, getResourceDependencies, inputFiles_fileRequired, inputFiles_includeSubFolders, inputFiles_includeSubFolders
Methods inherited from class org.pentaho.di.trans.steps.file.BaseFileMeta
passDataToServletOutput, writesToFile
Methods inherited from class org.pentaho.di.trans.step.BaseStepMeta
analyseImpact, analyseImpact, cancelQueries, check, check, createEntry, excludeFromCopyDistributeVerification, excludeFromRowLayoutVerification, exportResources, extractStepMetadataEntries, findAttribute, findParent, findParentEntry, getActiveReferencedObjectDescription, getDescription, getDialogClassName, getLog, getLogChannelId, getName, getObjectCopy, getObjectId, getObjectRevision, getObjectType, getOptionalStreams, getParent, getParentStepMeta, getReferencedObjectDescriptions, getRepCode, getRepositoryDirectory, getRequiredFields, getRequiredFields, getSQLStatements, getSQLStatements, getStepInjectionMetadataEntries, getStepIOMeta, getStepIOMeta, getStepMetaInjectionInterface, getSupportedTransformationTypes, getTableFields, getTooltip, getUsedArguments, getUsedDatabaseConnections, getUsedLibraries, getXmlCode, handleStreamSelection, hasChanged, hasRepositoryReferences, isBasic, isDebug, isDetailed, isReferencedObjectEnabled, isRowLevel, loadReferencedObject, loadReferencedObject, loadStepAttributes, loadXML, loadXML, logBasic, logBasic, logDebug, logDebug, logDetailed, logDetailed, logError, logError, logError, logMinimal, logMinimal, logRowlevel, logRowlevel, lookupRepositoryReferences, readRep, readRep, resetStepIoMeta, saveRep, setChanged, setChanged, setParentStepMeta, setStepIOMeta
Methods inherited from class java.lang.Object
equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
Methods inherited from interface org.pentaho.di.trans.step.StepMetaInterface
analyseImpact, analyseImpact, cancelQueries, check, cleanAfterHopFromRemove, cleanAfterHopFromRemove, cleanAfterHopToRemove, excludeFromCopyDistributeVerification, excludeFromRowLayoutVerification, exportResources, extractStepMetadataEntries, fetchTransMeta, getActiveReferencedObjectDescription, getDialogClassName, getOptionalStreams, getParentStepMeta, getReferencedObjectDescriptions, getRequiredFields, getResourceDependencies, getSQLStatements, getSQLStatements, getStepIOMeta, getStepMetaInjectionInterface, getSupportedTransformationTypes, getTableFields, getUsedArguments, getUsedDatabaseConnections, getUsedLibraries, handleStreamSelection, hasChanged, hasRepositoryReferences, isReferencedObjectEnabled, loadReferencedObject, loadXML, lookupRepositoryReferences, passDataToServletOutput, readRep, resetStepIoMeta, saveRep, setChanged, setParentStepMeta
-
Field Details
-
FILE_FORMAT_DOS
public static final int FILE_FORMAT_DOS- See Also:
-
FILE_FORMAT_UNIX
public static final int FILE_FORMAT_UNIX- See Also:
-
FILE_FORMAT_MIXED
public static final int FILE_FORMAT_MIXED- See Also:
-
FILE_TYPE_CSV
public static final int FILE_TYPE_CSV- See Also:
-
FILE_TYPE_FIXED
public static final int FILE_TYPE_FIXED- See Also:
-
-
errorCountField
The name of the field that will contain the number of errors in the row -
errorFieldsField
The name of the field that will contain the names of the fields that generated errors, separated by , -
errorTextField
The name of the field that will contain the error texts, separated by CR -
errorLineSkipped
public boolean errorLineSkippedIf error line are skipped, you can replay without introducing doubles.
-
-
Constructor Details
-
TextFileInputMeta
public TextFileInputMeta()
-
-
Method Details
-
getFileName
- Returns:
- Returns the fileName.
-
setFileName
- Parameters:
fileName
- The fileName to set.
-
getFilter
- Returns:
- The array of filters for the metadata of this text file input step.
-
setFilter
- Parameters:
filter
- The array of filters to use
-
loadXML
public void loadXML(Node stepnode, List<org.pentaho.di.core.database.DatabaseMeta> databases, org.pentaho.metastore.api.IMetaStore metaStore) throws org.pentaho.di.core.exception.KettleXMLException Description copied from interface:StepMetaInterface
Load the values for this step from an XML Node- Specified by:
loadXML
in interfaceStepMetaInterface
- Overrides:
loadXML
in classBaseStepMeta
- Parameters:
stepnode
- the Node to get the info fromdatabases
- The available list of databases to reference tometaStore
- the metastore to optionally load external reference metadata from- Throws:
org.pentaho.di.core.exception.KettleXMLException
- When an unexpected XML error occurred. (malformed etc.)
-
clone
Description copied from interface:StepMetaInterface
Make an exact copy of this step, make sure to explicitly copy Collections etc.- Specified by:
clone
in interfaceStepMetaInterface
- Overrides:
clone
in classBaseFileInputMeta<BaseFileInputAdditionalField,
BaseFileInputFiles, BaseFileField> - Returns:
- an exact copy of this step
-
allocate
public void allocate(int nrfiles, int nrfields, int nrfilters) -
allocateFiles
public void allocateFiles(int nrFiles) -
setDefault
public void setDefault()Description copied from interface:StepMetaInterface
Set default values- Specified by:
setDefault
in interfaceStepMetaInterface
-
getFields
public void getFields(org.pentaho.di.core.row.RowMetaInterface row, String name, org.pentaho.di.core.row.RowMetaInterface[] info, StepMeta nextStep, org.pentaho.di.core.variables.VariableSpace space, Repository repository, org.pentaho.metastore.api.IMetaStore metaStore) throws org.pentaho.di.core.exception.KettleStepException Description copied from class:BaseStepMeta
Gets the fields.- Specified by:
getFields
in interfaceStepMetaInterface
- Overrides:
getFields
in classBaseStepMeta
- Parameters:
row
- the input row meta that is modified in this method to reflect the output row metadata of the stepname
- Name of the step to use as input for the origin field in the valuesinfo
- Fields used as extra lookup informationnextStep
- the next step that is targetedspace
- the space The variable space to use to replace variablesrepository
- the repository to use to load Kettle metadata objects impacting the output fieldsmetaStore
- the MetaStore to use to load additional external data or metadata impacting the output fields- Throws:
org.pentaho.di.core.exception.KettleStepException
- the kettle step exception
-
getFields
@Deprecated public void getFields(org.pentaho.di.core.row.RowMetaInterface inputRowMeta, String name, org.pentaho.di.core.row.RowMetaInterface[] info, StepMeta nextStep, org.pentaho.di.core.variables.VariableSpace space) throws org.pentaho.di.core.exception.KettleStepException Deprecated.Description copied from class:BaseStepMeta
Gets the fields.- Specified by:
getFields
in interfaceStepMetaInterface
- Overrides:
getFields
in classBaseStepMeta
- Parameters:
inputRowMeta
- the input row meta that is modified in this method to reflect the output row metadata of the stepname
- Name of the step to use as input for the origin field in the valuesinfo
- Fields used as extra lookup informationnextStep
- the next step that is targetedspace
- the space The variable space to use to replace variables- Throws:
org.pentaho.di.core.exception.KettleStepException
- the kettle step exception
-
getXML
Description copied from class:BaseStepMeta
Produces the XML string that describes this step's information.- Specified by:
getXML
in interfaceStepMetaInterface
- Overrides:
getXML
in classBaseStepMeta
- Returns:
- String containing the XML describing this step.
-
getLookupStepname
-
searchInfoAndTargetSteps
Description copied from class:BaseStepMeta
Change step names into step objects to allow them to be name-changed etc.- Specified by:
searchInfoAndTargetSteps
in interfaceStepMetaInterface
- Overrides:
searchInfoAndTargetSteps
in classBaseStepMeta
- Parameters:
steps
- optionally search the info step in a list of steps
-
getInfoSteps
-
readRep
public void readRep(Repository rep, org.pentaho.metastore.api.IMetaStore metaStore, org.pentaho.di.repository.ObjectId id_step, List<org.pentaho.di.core.database.DatabaseMeta> databases) throws org.pentaho.di.core.exception.KettleException Description copied from interface:StepMetaInterface
Read the steps information from a Kettle repository- Specified by:
readRep
in interfaceStepMetaInterface
- Overrides:
readRep
in classBaseStepMeta
- Parameters:
rep
- The repository to read frommetaStore
- The MetaStore to read external information fromid_step
- The step IDdatabases
- The databases to reference- Throws:
org.pentaho.di.core.exception.KettleException
- When an unexpected error occurred (database, network, etc)
-
saveRep
public void saveRep(Repository rep, org.pentaho.metastore.api.IMetaStore metaStore, org.pentaho.di.repository.ObjectId id_transformation, org.pentaho.di.repository.ObjectId id_step) throws org.pentaho.di.core.exception.KettleException Description copied from interface:StepMetaInterface
Save the steps data into a Kettle repository- Specified by:
saveRep
in interfaceStepMetaInterface
- Overrides:
saveRep
in classBaseStepMeta
- Parameters:
rep
- The Kettle repository to save tometaStore
- the metaStore to optionally write toid_transformation
- The transformation IDid_step
- The step ID- Throws:
org.pentaho.di.core.exception.KettleException
- When an unexpected error occurred (database, network, etc)
-
check
public void check(List<org.pentaho.di.core.CheckResultInterface> remarks, TransMeta transMeta, StepMeta stepMeta, org.pentaho.di.core.row.RowMetaInterface prev, String[] input, String[] output, org.pentaho.di.core.row.RowMetaInterface info, org.pentaho.di.core.variables.VariableSpace space, Repository repository, org.pentaho.metastore.api.IMetaStore metaStore) Description copied from interface:StepMetaInterface
Checks the settings of this step and puts the findings in a remarks List.- Specified by:
check
in interfaceStepMetaInterface
- Overrides:
check
in classBaseStepMeta
- Parameters:
remarks
- The list to put the remarks in @see org.pentaho.di.core.CheckResultstepMeta
- The stepMeta to help checkingprev
- The fields coming from the previous stepinput
- The input step namesoutput
- The output step namesinfo
- The fields that are used as information by the stepspace
- the variable space to resolve variable expressions withrepository
- the repository to use to load Kettle metadata objects impacting the output fieldsmetaStore
- the MetaStore to use to load additional external data or metadata impacting the output fields
-
getStep
public StepInterface getStep(StepMeta stepMeta, StepDataInterface stepDataInterface, int cnr, TransMeta transMeta, Trans trans) Description copied from interface:StepMetaInterface
Get the executing step, needed by Trans to launch a step.- Specified by:
getStep
in interfaceStepMetaInterface
- Parameters:
stepMeta
- The step infostepDataInterface
- the step data interface linked to this step. Here the step can store temporary data, database connections, etc.cnr
- The copy nr to gettransMeta
- The transformation infotrans
- The launching transformation
-
getStepData
Description copied from interface:StepMetaInterface
Get a new instance of the appropriate data class. This data class implements the StepDataInterface. It basically contains the persisting data that needs to live on, even if a worker thread is terminated.- Specified by:
getStepData
in interfaceStepMetaInterface
- Returns:
- The appropriate StepDataInterface class.
-
getErrorCountField
-
setErrorCountField
-
getErrorFieldsField
-
setErrorFieldsField
-
getErrorTextField
-
setErrorTextField
-
getRequiredFilesDesc
-
isErrorLineSkipped
public boolean isErrorLineSkipped() -
setErrorLineSkipped
public void setErrorLineSkipped(boolean errorLineSkipped) -
setAcceptingStep
- Parameters:
acceptingStep
- The acceptingStep to set.
-
getFileFormatTypeNr
public int getFileFormatTypeNr()- Specified by:
getFileFormatTypeNr
in interfaceCsvInputAwareMeta
-
getFileTypeNr
public int getFileTypeNr() -
exportResources
public String exportResources(org.pentaho.di.core.variables.VariableSpace space, Map<String, ResourceDefinition> definitions, ResourceNamingInterface resourceNamingInterface, Repository repository, org.pentaho.metastore.api.IMetaStore metaStore) throws org.pentaho.di.core.exception.KettleExceptionSince the exported transformation that runs this will reside in a ZIP file, we can't reference files relatively. So what this does is turn the name of files into absolute paths OR it simply includes the resource in the ZIP file. For now, we'll simply turn it into an absolute path and pray that the file is on a shared drive or something like that.- Specified by:
exportResources
in interfaceStepMetaInterface
- Overrides:
exportResources
in classBaseStepMeta
- Parameters:
space
- the variable space to usedefinitions
-resourceNamingInterface
-repository
- The repository to optionally load other resources from (to be converted to XML)metaStore
- the metaStore in which non-kettle metadata could reside.- Returns:
- the filename of the exported resource
- Throws:
org.pentaho.di.core.exception.KettleException
-
supportsErrorHandling
public boolean supportsErrorHandling()- Specified by:
supportsErrorHandling
in interfaceStepMetaInterface
- Overrides:
supportsErrorHandling
in classBaseStepMeta
- Returns:
- true if this step supports error "reporting" on rows: the ability to send rows to a certain target step.
-
setFileNameForTest
-
loadSource
-
saveSource
-
loadSourceRep
protected String loadSourceRep(Repository rep, org.pentaho.di.repository.ObjectId id_step, int i, org.pentaho.metastore.api.IMetaStore metaStore) throws org.pentaho.di.core.exception.KettleException - Throws:
org.pentaho.di.core.exception.KettleException
-
saveSourceRep
protected void saveSourceRep(Repository rep, org.pentaho.di.repository.ObjectId id_transformation, org.pentaho.di.repository.ObjectId id_step, int i, String fileName) throws org.pentaho.di.core.exception.KettleException - Throws:
org.pentaho.di.core.exception.KettleException
-
getEncoding
- Specified by:
getEncoding
in interfaceCsvInputAwareMeta
- Specified by:
getEncoding
in classBaseFileInputMeta<BaseFileInputAdditionalField,
BaseFileInputFiles, BaseFileField>
-
getLength
- Returns:
- the length
-
setLength
- Parameters:
length
- the length to set
-
isAcceptingFilenames
public boolean isAcceptingFilenames()Required for the Data Lineage.- Overrides:
isAcceptingFilenames
in classBaseFileInputMeta<BaseFileInputAdditionalField,
BaseFileInputFiles, BaseFileField>
-
getAcceptingStepName
Required for the Data Lineage.- Overrides:
getAcceptingStepName
in classBaseFileInputMeta<BaseFileInputAdditionalField,
BaseFileInputFiles, BaseFileField>
-
getAcceptingStep
Required for the Data Lineage. -
getAcceptingField
Required for the Data Lineage.- Overrides:
getAcceptingField
in classBaseFileInputMeta<BaseFileInputAdditionalField,
BaseFileInputFiles, BaseFileField>
-
getFilePaths
public String[] getFilePaths(org.pentaho.di.core.bowl.Bowl bowl, org.pentaho.di.core.variables.VariableSpace space) -
getTextFileList
public org.pentaho.di.core.fileinput.FileInputList getTextFileList(org.pentaho.di.core.bowl.Bowl bowl, org.pentaho.di.core.variables.VariableSpace space) -
hasHeader
public boolean hasHeader()- Specified by:
hasHeader
in interfaceCsvInputAwareMeta
-
getEscapeCharacter
- Specified by:
getEscapeCharacter
in interfaceCsvInputAwareMeta
-
getDelimiter
- Specified by:
getDelimiter
in interfaceCsvInputAwareMeta
-
getEnclosure
- Specified by:
getEnclosure
in interfaceCsvInputAwareMeta
-
getHeaderFileObject
Description copied from interface:CsvInputAwareMeta
Returns aFileObject
that corresponds to the first encountered input file. This object is used to read the file headers for the purpose of field parsing.- Specified by:
getHeaderFileObject
in interfaceCsvInputAwareMeta
- Parameters:
transMeta
- theTransMeta
- Returns:
- null if the
FileObject
cannot be created.
-