Class MySQLBulkLoaderMeta
- java.lang.Object
-
- org.pentaho.di.trans.step.BaseStepMeta
-
- org.pentaho.di.trans.steps.mysqlbulkloader.MySQLBulkLoaderMeta
-
- All Implemented Interfaces:
Cloneable
,ProvidesDatabaseConnectionInformation
,StepAttributesInterface
,StepMetaInterface
public class MySQLBulkLoaderMeta extends BaseStepMeta implements StepMetaInterface, ProvidesDatabaseConnectionInformation
Here are the steps that we need to take to make streaming loading possible for MySQL:
The following steps are carried out by the step at runtime:
- create a unique FIFO file (using mkfifo, LINUX ONLY FOLKS!)
- Create a target table using standard Kettle SQL generation
- Execute the LOAD DATA SQL Command to bulk load in a separate SQL thread in the background:
- Write to the FIFO file
- At the end, close the output stream to the FIFO file
* At the end, remove the FIFO file
Created on 24-oct-2007- Author:
- Matt Casters
-
-
Nested Class Summary
Nested Classes Modifier and Type Class Description static class
MySQLBulkLoaderMeta.FieldFormatTypeConverter
-
Field Summary
Fields Modifier and Type Field Description static int
FIELD_FORMAT_TYPE_DATE
static int
FIELD_FORMAT_TYPE_NUMBER
static int
FIELD_FORMAT_TYPE_OK
static int
FIELD_FORMAT_TYPE_STRING_ESCAPE
static int
FIELD_FORMAT_TYPE_TIMESTAMP
-
Fields inherited from class org.pentaho.di.trans.step.BaseStepMeta
attributes, databases, log, loggingObject, parentStepMeta, repository, STEP_ATTRIBUTES_FILE
-
-
Constructor Summary
Constructors Constructor Description MySQLBulkLoaderMeta()
-
Method Summary
All Methods Static Methods Instance Methods Concrete Methods Modifier and Type Method Description void
afterInjectionSynchronization()
If we use injection we can have different arrays lengths.void
allocate(int nrvalues)
void
analyseImpact(List<DatabaseImpact> impact, TransMeta transMeta, StepMeta stepMeta, org.pentaho.di.core.row.RowMetaInterface prev, String[] input, String[] output, org.pentaho.di.core.row.RowMetaInterface info, Repository repository, org.pentaho.metastore.api.IMetaStore metaStore)
Each step must be able to report on the impact it has on a database, table field, etc.void
check(List<org.pentaho.di.core.CheckResultInterface> remarks, TransMeta transMeta, StepMeta stepMeta, org.pentaho.di.core.row.RowMetaInterface prev, String[] input, String[] output, org.pentaho.di.core.row.RowMetaInterface info, org.pentaho.di.core.variables.VariableSpace space, Repository repository, org.pentaho.metastore.api.IMetaStore metaStore)
Checks the settings of this step and puts the findings in a remarks List.Object
clone()
Make an exact copy of this step, make sure to explicitly copy Collections etc.String
getBulkSize()
org.pentaho.di.core.database.DatabaseMeta
getDatabaseMeta()
Returns the database meta for this stepString
getDelimiter()
String
getEnclosure()
String
getEncoding()
String
getEscapeChar()
int[]
getFieldFormatType()
static int
getFieldFormatType(String codeOrDescription)
static String
getFieldFormatTypeCode(int type)
static String[]
getFieldFormatTypeCodes()
static String
getFieldFormatTypeDescription(int type)
static String[]
getFieldFormatTypeDescriptions()
void
getFields(org.pentaho.di.core.row.RowMetaInterface rowMeta, String origin, org.pentaho.di.core.row.RowMetaInterface[] info, StepMeta nextStep, org.pentaho.di.core.variables.VariableSpace space, Repository repository, org.pentaho.metastore.api.IMetaStore metaStore)
Gets the fields.String[]
getFieldStream()
String[]
getFieldTable()
String
getFifoFileName()
String
getMissingDatabaseConnectionInformationMessage()
Provides a way for this object to return a custom message when database connection information is incomplete or missing.org.pentaho.di.core.row.RowMetaInterface
getRequiredFields(org.pentaho.di.core.variables.VariableSpace space)
The natural way of data flow in a transformation is source-to-target.String
getSchemaName()
Returns the schema name for this step.org.pentaho.di.core.SQLStatement
getSQLStatements(TransMeta transMeta, StepMeta stepMeta, org.pentaho.di.core.row.RowMetaInterface prev, Repository repository, org.pentaho.metastore.api.IMetaStore metaStore)
Standard method to return an SQLStatement object with SQL statements that the step needs in order to work correctly.StepInterface
getStep(StepMeta stepMeta, StepDataInterface stepDataInterface, int cnr, TransMeta transMeta, Trans trans)
Get the executing step, needed by Trans to launch a step.StepDataInterface
getStepData()
Get a new instance of the appropriate data class.String
getTableName()
Returns the table name for this steporg.pentaho.di.core.database.DatabaseMeta[]
getUsedDatabaseConnections()
This method returns all the database connections that are used by the step.String
getXML()
Produces the XML string that describes this step's information.boolean
isIgnoringErrors()
boolean
isLocalFile()
boolean
isReplacingData()
void
loadXML(Node stepnode, List<org.pentaho.di.core.database.DatabaseMeta> databases, org.pentaho.metastore.api.IMetaStore metaStore)
Load the values for this step from an XML Nodevoid
readRep(Repository rep, org.pentaho.metastore.api.IMetaStore metaStore, org.pentaho.di.repository.ObjectId id_step, List<org.pentaho.di.core.database.DatabaseMeta> databases)
Read the steps information from a Kettle repositoryvoid
saveRep(Repository rep, org.pentaho.metastore.api.IMetaStore metaStore, org.pentaho.di.repository.ObjectId id_transformation, org.pentaho.di.repository.ObjectId id_step)
Save the steps data into a Kettle repositoryvoid
setBulkSize(String bulkSize)
void
setDatabaseMeta(org.pentaho.di.core.database.DatabaseMeta database)
void
setDefault()
Set default valuesvoid
setDelimiter(String delimiter)
void
setEnclosure(String enclosure)
void
setEncoding(String encoding)
void
setEscapeChar(String escapeChar)
void
setFieldFormatType(int[] fieldFormatType)
void
setFieldStream(String[] fieldStream)
void
setFieldTable(String[] fieldTable)
void
setFifoFileName(String fifoFileName)
void
setIgnoringErrors(boolean ignoringErrors)
void
setLocalFile(boolean localFile)
void
setReplacingData(boolean replacingData)
void
setSchemaName(String schemaName)
void
setTableName(String tableName)
-
Methods inherited from class org.pentaho.di.trans.step.BaseStepMeta
analyseImpact, cancelQueries, check, check, createEntry, excludeFromCopyDistributeVerification, excludeFromRowLayoutVerification, exportResources, exportResources, extractStepMetadataEntries, findAttribute, findParent, findParentEntry, getActiveReferencedObjectDescription, getDescription, getDialogClassName, getFields, getLog, getLogChannelId, getName, getObjectCopy, getObjectId, getObjectRevision, getObjectType, getOptionalStreams, getParent, getParentStepMeta, getReferencedObjectDescriptions, getRepCode, getRepositoryDirectory, getRequiredFields, getResourceDependencies, getSQLStatements, getStepInjectionMetadataEntries, getStepIOMeta, getStepIOMeta, getStepMetaInjectionInterface, getSupportedTransformationTypes, getTableFields, getTooltip, getUsedArguments, getUsedLibraries, getXmlCode, handleStreamSelection, hasChanged, hasRepositoryReferences, isBasic, isDebug, isDetailed, isReferencedObjectEnabled, isRowLevel, loadReferencedObject, loadReferencedObject, loadStepAttributes, loadXML, loadXML, logBasic, logBasic, logDebug, logDebug, logDetailed, logDetailed, logError, logError, logError, logMinimal, logMinimal, logRowlevel, logRowlevel, lookupRepositoryReferences, readRep, readRep, resetStepIoMeta, saveRep, searchInfoAndTargetSteps, setChanged, setChanged, setParentStepMeta, setStepIOMeta, supportsErrorHandling
-
Methods inherited from class java.lang.Object
equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
-
Methods inherited from interface org.pentaho.di.trans.step.StepMetaInterface
analyseImpact, cancelQueries, check, cleanAfterHopFromRemove, cleanAfterHopFromRemove, cleanAfterHopToRemove, excludeFromCopyDistributeVerification, excludeFromRowLayoutVerification, exportResources, exportResources, extractStepMetadataEntries, fetchTransMeta, getActiveReferencedObjectDescription, getDialogClassName, getFields, getOptionalStreams, getParentStepMeta, getReferencedObjectDescriptions, getResourceDependencies, getSQLStatements, getStepIOMeta, getStepMetaInjectionInterface, getSupportedTransformationTypes, getTableFields, getUsedArguments, getUsedLibraries, handleStreamSelection, hasChanged, hasRepositoryReferences, isReferencedObjectEnabled, loadReferencedObject, loadXML, lookupRepositoryReferences, passDataToServletOutput, readRep, resetStepIoMeta, saveRep, searchInfoAndTargetSteps, setChanged, setParentStepMeta, supportsErrorHandling
-
-
-
-
Field Detail
-
FIELD_FORMAT_TYPE_OK
public static final int FIELD_FORMAT_TYPE_OK
- See Also:
- Constant Field Values
-
FIELD_FORMAT_TYPE_DATE
public static final int FIELD_FORMAT_TYPE_DATE
- See Also:
- Constant Field Values
-
FIELD_FORMAT_TYPE_TIMESTAMP
public static final int FIELD_FORMAT_TYPE_TIMESTAMP
- See Also:
- Constant Field Values
-
FIELD_FORMAT_TYPE_NUMBER
public static final int FIELD_FORMAT_TYPE_NUMBER
- See Also:
- Constant Field Values
-
FIELD_FORMAT_TYPE_STRING_ESCAPE
public static final int FIELD_FORMAT_TYPE_STRING_ESCAPE
- See Also:
- Constant Field Values
-
-
Method Detail
-
getDatabaseMeta
public org.pentaho.di.core.database.DatabaseMeta getDatabaseMeta()
Description copied from interface:ProvidesDatabaseConnectionInformation
Returns the database meta for this step- Specified by:
getDatabaseMeta
in interfaceProvidesDatabaseConnectionInformation
- Returns:
- Returns the database.
-
setDatabaseMeta
public void setDatabaseMeta(org.pentaho.di.core.database.DatabaseMeta database)
- Parameters:
database
- The database to set.
-
getTableName
public String getTableName()
Description copied from interface:ProvidesDatabaseConnectionInformation
Returns the table name for this step- Specified by:
getTableName
in interfaceProvidesDatabaseConnectionInformation
- Returns:
- Returns the tableName.
-
setTableName
public void setTableName(String tableName)
- Parameters:
tableName
- The tableName to set.
-
getFieldTable
public String[] getFieldTable()
- Returns:
- Returns the fieldTable.
-
setFieldTable
public void setFieldTable(String[] fieldTable)
- Parameters:
fieldTable
- The fieldTable to set.
-
getFieldStream
public String[] getFieldStream()
- Returns:
- Returns the fieldStream.
-
setFieldStream
public void setFieldStream(String[] fieldStream)
- Parameters:
fieldStream
- The fieldStream to set.
-
loadXML
public void loadXML(Node stepnode, List<org.pentaho.di.core.database.DatabaseMeta> databases, org.pentaho.metastore.api.IMetaStore metaStore) throws org.pentaho.di.core.exception.KettleXMLException
Description copied from interface:StepMetaInterface
Load the values for this step from an XML Node- Specified by:
loadXML
in interfaceStepMetaInterface
- Overrides:
loadXML
in classBaseStepMeta
- Parameters:
stepnode
- the Node to get the info fromdatabases
- The available list of databases to reference tometaStore
- the metastore to optionally load external reference metadata from- Throws:
org.pentaho.di.core.exception.KettleXMLException
- When an unexpected XML error occurred. (malformed etc.)
-
allocate
public void allocate(int nrvalues)
-
clone
public Object clone()
Description copied from interface:StepMetaInterface
Make an exact copy of this step, make sure to explicitly copy Collections etc.- Specified by:
clone
in interfaceStepMetaInterface
- Overrides:
clone
in classBaseStepMeta
- Returns:
- an exact copy of this step
-
setDefault
public void setDefault()
Description copied from interface:StepMetaInterface
Set default values- Specified by:
setDefault
in interfaceStepMetaInterface
-
getXML
public String getXML()
Description copied from class:BaseStepMeta
Produces the XML string that describes this step's information.- Specified by:
getXML
in interfaceStepMetaInterface
- Overrides:
getXML
in classBaseStepMeta
- Returns:
- String containing the XML describing this step.
-
readRep
public void readRep(Repository rep, org.pentaho.metastore.api.IMetaStore metaStore, org.pentaho.di.repository.ObjectId id_step, List<org.pentaho.di.core.database.DatabaseMeta> databases) throws org.pentaho.di.core.exception.KettleException
Description copied from interface:StepMetaInterface
Read the steps information from a Kettle repository- Specified by:
readRep
in interfaceStepMetaInterface
- Overrides:
readRep
in classBaseStepMeta
- Parameters:
rep
- The repository to read frommetaStore
- The MetaStore to read external information fromid_step
- The step IDdatabases
- The databases to reference- Throws:
org.pentaho.di.core.exception.KettleException
- When an unexpected error occurred (database, network, etc)
-
saveRep
public void saveRep(Repository rep, org.pentaho.metastore.api.IMetaStore metaStore, org.pentaho.di.repository.ObjectId id_transformation, org.pentaho.di.repository.ObjectId id_step) throws org.pentaho.di.core.exception.KettleException
Description copied from interface:StepMetaInterface
Save the steps data into a Kettle repository- Specified by:
saveRep
in interfaceStepMetaInterface
- Overrides:
saveRep
in classBaseStepMeta
- Parameters:
rep
- The Kettle repository to save tometaStore
- the metaStore to optionally write toid_transformation
- The transformation IDid_step
- The step ID- Throws:
org.pentaho.di.core.exception.KettleException
- When an unexpected error occurred (database, network, etc)
-
getFields
public void getFields(org.pentaho.di.core.row.RowMetaInterface rowMeta, String origin, org.pentaho.di.core.row.RowMetaInterface[] info, StepMeta nextStep, org.pentaho.di.core.variables.VariableSpace space, Repository repository, org.pentaho.metastore.api.IMetaStore metaStore) throws org.pentaho.di.core.exception.KettleStepException
Description copied from class:BaseStepMeta
Gets the fields.- Specified by:
getFields
in interfaceStepMetaInterface
- Overrides:
getFields
in classBaseStepMeta
- Parameters:
rowMeta
- the input row meta that is modified in this method to reflect the output row metadata of the steporigin
- Name of the step to use as input for the origin field in the valuesinfo
- Fields used as extra lookup informationnextStep
- the next step that is targetedspace
- the space The variable space to use to replace variablesrepository
- the repository to use to load Kettle metadata objects impacting the output fieldsmetaStore
- the MetaStore to use to load additional external data or metadata impacting the output fields- Throws:
org.pentaho.di.core.exception.KettleStepException
- the kettle step exception
-
check
public void check(List<org.pentaho.di.core.CheckResultInterface> remarks, TransMeta transMeta, StepMeta stepMeta, org.pentaho.di.core.row.RowMetaInterface prev, String[] input, String[] output, org.pentaho.di.core.row.RowMetaInterface info, org.pentaho.di.core.variables.VariableSpace space, Repository repository, org.pentaho.metastore.api.IMetaStore metaStore)
Description copied from interface:StepMetaInterface
Checks the settings of this step and puts the findings in a remarks List.- Specified by:
check
in interfaceStepMetaInterface
- Overrides:
check
in classBaseStepMeta
- Parameters:
remarks
- The list to put the remarks in @see org.pentaho.di.core.CheckResultstepMeta
- The stepMeta to help checkingprev
- The fields coming from the previous stepinput
- The input step namesoutput
- The output step namesinfo
- The fields that are used as information by the stepspace
- the variable space to resolve variable expressions withrepository
- the repository to use to load Kettle metadata objects impacting the output fieldsmetaStore
- the MetaStore to use to load additional external data or metadata impacting the output fields
-
getSQLStatements
public org.pentaho.di.core.SQLStatement getSQLStatements(TransMeta transMeta, StepMeta stepMeta, org.pentaho.di.core.row.RowMetaInterface prev, Repository repository, org.pentaho.metastore.api.IMetaStore metaStore) throws org.pentaho.di.core.exception.KettleStepException
Description copied from class:BaseStepMeta
Standard method to return an SQLStatement object with SQL statements that the step needs in order to work correctly. This can mean "create table", "create index" statements but also "alter table ... add/drop/modify" statements.- Specified by:
getSQLStatements
in interfaceStepMetaInterface
- Overrides:
getSQLStatements
in classBaseStepMeta
- Parameters:
transMeta
- TransInfo object containing the complete transformationstepMeta
- StepMeta object containing the complete stepprev
- Row containing meta-data for the input fields (no data)repository
- the repository to use to load Kettle metadata objects impacting the output fieldsmetaStore
- the MetaStore to use to load additional external data or metadata impacting the output fields- Returns:
- The SQL Statements for this step. If nothing has to be done, the SQLStatement.getSQL() == null. @see SQLStatement
- Throws:
org.pentaho.di.core.exception.KettleStepException
-
analyseImpact
public void analyseImpact(List<DatabaseImpact> impact, TransMeta transMeta, StepMeta stepMeta, org.pentaho.di.core.row.RowMetaInterface prev, String[] input, String[] output, org.pentaho.di.core.row.RowMetaInterface info, Repository repository, org.pentaho.metastore.api.IMetaStore metaStore) throws org.pentaho.di.core.exception.KettleStepException
Description copied from class:BaseStepMeta
Each step must be able to report on the impact it has on a database, table field, etc.- Specified by:
analyseImpact
in interfaceStepMetaInterface
- Overrides:
analyseImpact
in classBaseStepMeta
- Parameters:
impact
- The list of impacts @see org.pentaho.di.transMeta.DatabaseImpacttransMeta
- The transformation informationstepMeta
- The step informationprev
- The fields entering this stepinput
- The previous step namesoutput
- The output step namesinfo
- The fields used as information by this steprepository
- the repository to use to load Kettle metadata objects impacting the output fieldsmetaStore
- the MetaStore to use to load additional external data or metadata impacting the output fields- Throws:
org.pentaho.di.core.exception.KettleStepException
-
getStep
public StepInterface getStep(StepMeta stepMeta, StepDataInterface stepDataInterface, int cnr, TransMeta transMeta, Trans trans)
Description copied from interface:StepMetaInterface
Get the executing step, needed by Trans to launch a step.- Specified by:
getStep
in interfaceStepMetaInterface
- Parameters:
stepMeta
- The step infostepDataInterface
- the step data interface linked to this step. Here the step can store temporary data, database connections, etc.cnr
- The copy nr to gettransMeta
- The transformation infotrans
- The launching transformation
-
getStepData
public StepDataInterface getStepData()
Description copied from interface:StepMetaInterface
Get a new instance of the appropriate data class. This data class implements the StepDataInterface. It basically contains the persisting data that needs to live on, even if a worker thread is terminated.- Specified by:
getStepData
in interfaceStepMetaInterface
- Returns:
- The appropriate StepDataInterface class.
-
getUsedDatabaseConnections
public org.pentaho.di.core.database.DatabaseMeta[] getUsedDatabaseConnections()
Description copied from class:BaseStepMeta
This method returns all the database connections that are used by the step.- Specified by:
getUsedDatabaseConnections
in interfaceStepMetaInterface
- Overrides:
getUsedDatabaseConnections
in classBaseStepMeta
- Returns:
- an array of database connections meta-data. Return an empty array if no connections are used.
-
getRequiredFields
public org.pentaho.di.core.row.RowMetaInterface getRequiredFields(org.pentaho.di.core.variables.VariableSpace space) throws org.pentaho.di.core.exception.KettleException
Description copied from class:BaseStepMeta
The natural way of data flow in a transformation is source-to-target. However, this makes mapping to target tables difficult to do. To help out here, we supply information to the transformation meta-data model about which fields are required for a step. This allows us to automate certain tasks like the mapping to pre-defined tables. The Table Output step in this case will output the fields in the target table using this method.This default implementation returns an empty row meaning that no fields are required for this step to operate.
- Specified by:
getRequiredFields
in interfaceStepMetaInterface
- Overrides:
getRequiredFields
in classBaseStepMeta
- Parameters:
space
- the variable space to use to do variable substitution.- Returns:
- the required fields for this steps meta data.
- Throws:
org.pentaho.di.core.exception.KettleException
- in case the required fields can't be determined
-
getSchemaName
public String getSchemaName()
Description copied from interface:ProvidesDatabaseConnectionInformation
Returns the schema name for this step.- Specified by:
getSchemaName
in interfaceProvidesDatabaseConnectionInformation
- Returns:
- the schemaName
-
setSchemaName
public void setSchemaName(String schemaName)
- Parameters:
schemaName
- the schemaName to set
-
getEncoding
public String getEncoding()
-
setEncoding
public void setEncoding(String encoding)
-
getDelimiter
public String getDelimiter()
-
setDelimiter
public void setDelimiter(String delimiter)
-
getEnclosure
public String getEnclosure()
-
setEnclosure
public void setEnclosure(String enclosure)
-
getFifoFileName
public String getFifoFileName()
- Returns:
- the fifoFileName
-
setFifoFileName
public void setFifoFileName(String fifoFileName)
- Parameters:
fifoFileName
- the fifoFileName to set
-
isReplacingData
public boolean isReplacingData()
- Returns:
- the replacingData
-
setReplacingData
public void setReplacingData(boolean replacingData)
- Parameters:
replacingData
- the replacingData to set
-
getFieldFormatType
public int[] getFieldFormatType()
-
setFieldFormatType
public void setFieldFormatType(int[] fieldFormatType)
-
getFieldFormatTypeCodes
public static String[] getFieldFormatTypeCodes()
-
getFieldFormatTypeDescriptions
public static String[] getFieldFormatTypeDescriptions()
-
getFieldFormatTypeCode
public static String getFieldFormatTypeCode(int type)
-
getFieldFormatTypeDescription
public static String getFieldFormatTypeDescription(int type)
-
getFieldFormatType
public static int getFieldFormatType(String codeOrDescription)
-
getEscapeChar
public String getEscapeChar()
- Returns:
- the escapeChar
-
setEscapeChar
public void setEscapeChar(String escapeChar)
- Parameters:
escapeChar
- the escapeChar to set
-
isIgnoringErrors
public boolean isIgnoringErrors()
- Returns:
- the ignoringErrors
-
setIgnoringErrors
public void setIgnoringErrors(boolean ignoringErrors)
- Parameters:
ignoringErrors
- the ignoringErrors to set
-
getBulkSize
public String getBulkSize()
- Returns:
- the bulkSize
-
setBulkSize
public void setBulkSize(String bulkSize)
- Parameters:
bulkSize
- the bulkSize to set
-
isLocalFile
public boolean isLocalFile()
- Returns:
- the localFile
-
setLocalFile
public void setLocalFile(boolean localFile)
- Parameters:
localFile
- the localFile to set
-
getMissingDatabaseConnectionInformationMessage
public String getMissingDatabaseConnectionInformationMessage()
Description copied from interface:ProvidesDatabaseConnectionInformation
Provides a way for this object to return a custom message when database connection information is incomplete or missing. If this returnsnull
a default message will be displayed for missing information.- Specified by:
getMissingDatabaseConnectionInformationMessage
in interfaceProvidesDatabaseConnectionInformation
- Returns:
- A friendly message that describes that database connection information is missing and, potentially, why.
-
afterInjectionSynchronization
public void afterInjectionSynchronization()
If we use injection we can have different arrays lengths. We need synchronize them for consistency behavior with UI
-
-