Handle reading of the input (object) from the kettle repository by getting the job-xml from the repository step attribute string and then re-hydrate the
job entry (object) with our already existing read method.
Handle reading of the input (object) from the kettle repository by getting the step-xml from the repository step attribute string and then re-hydrate the
step (object) with our already existing read method.
This interface allows you to pass a simple interface to an object to allow it
to store or load itself from or to any type of repository in a generic fashion.
The RepositoryElementMetaInterface is used to provide metadata about repository elements
without requiring loading the entire element from the repository.
The RepositoryObjectInterface represents all objects that can come out of a repository, including
directories (RepositoryDirectoryInterface), elements such as TransMeta and JobMeta
(RepositoryElementMetaInterface), and metadata about elements (RepositoryElementMetaInterface).
Slaves don't need ALL the partitions, they just need a few.
So we should only retain those partitions that are of interest to the slave server.
Divide the number of partitions (6) through the number of slaves (2)
That gives you 0, 1, 2, 3, 4, 5
Slave 0 : 0, 2, 4
Slave 1 : 1, 3, 5
--> slaveNumber == partitionNr % slaveCount
rounds double f to any number of places after decimal point
Does arithmetic using BigDecimal class to avoid integer overflow while rounding
TODO: make the rounding itself optional in the Props for performance reasons.
This test generates rows on the master, generates random values clustered and brings them back the master.
See also: PDI-6324 : Generate Rows to a clustered step ceases to work
This test reads a CSV file in parallel on the master in 1 copy.
It then passes the data over to a dummy step on the slaves.
We want to make sure that only 1 copy is considered.
This test reads a CSV file in parallel on all 3 slaves, each with 4 partitions.
This is a variation on the test right above, with 2 steps in sequence in clustering & partitioning.
It then passes the data over to a dummy step on the slaves.