Class MaintenanceTaskBuilder<T extends MaintenanceTaskBuilder<?>>
java.lang.Object
org.apache.iceberg.flink.maintenance.api.MaintenanceTaskBuilder<T>
- Direct Known Subclasses:
DeleteOrphanFiles.Builder,ExpireSnapshots.Builder,RewriteDataFiles.Builder
@Experimental
public abstract class MaintenanceTaskBuilder<T extends MaintenanceTaskBuilder<?>>
extends Object
-
Constructor Summary
Constructors -
Method Summary
Modifier and TypeMethodDescriptionprotected intindex()protected StringoperatorName(String operatorNameBase) protected Integerparallelism(int newParallelism) Sets the parallelism for the stream.scheduleOnCommitCount(int commitCount) After a given number of Iceberg table commits since the last run, starts the downstream job.scheduleOnDataFileCount(int dataFileCount) After a given number of new data files since the last run, starts the downstream job.scheduleOnDataFileSize(long dataFileSizeInBytes) After a given aggregated data file size since the last run, starts the downstream job.scheduleOnEqDeleteFileCount(int eqDeleteFileCount) After a given number of new equality delete files since the last run, starts the downstream job.scheduleOnEqDeleteRecordCount(long eqDeleteRecordCount) After a given number of new equality delete records since the last run, starts the downstream job.scheduleOnInterval(Duration interval) After a given time since the last run, starts the downstream job.scheduleOnPosDeleteFileCount(int posDeleteFileCount) After a given number of new positional delete files since the last run, starts the downstream job.scheduleOnPosDeleteRecordCount(long posDeleteRecordCount) After a given number of new positional delete records since the last run, starts the downstream job.protected StringslotSharingGroup(String newSlotSharingGroup) TheSingleOutputStreamOperator.slotSharingGroup(String)for all the operators of the generated stream.protected TableLoaderprotected Stringprotected StringtaskName()protected StringThe suffix used for the generatedTransformation's uid.
-
Constructor Details
-
MaintenanceTaskBuilder
public MaintenanceTaskBuilder()
-
-
Method Details
-
scheduleOnCommitCount
After a given number of Iceberg table commits since the last run, starts the downstream job.- Parameters:
commitCount- after the downstream job should be started
-
scheduleOnDataFileCount
After a given number of new data files since the last run, starts the downstream job.- Parameters:
dataFileCount- after the downstream job should be started
-
scheduleOnDataFileSize
After a given aggregated data file size since the last run, starts the downstream job.- Parameters:
dataFileSizeInBytes- after the downstream job should be started
-
scheduleOnPosDeleteFileCount
After a given number of new positional delete files since the last run, starts the downstream job.- Parameters:
posDeleteFileCount- after the downstream job should be started
-
scheduleOnPosDeleteRecordCount
After a given number of new positional delete records since the last run, starts the downstream job.- Parameters:
posDeleteRecordCount- after the downstream job should be started
-
scheduleOnEqDeleteFileCount
After a given number of new equality delete files since the last run, starts the downstream job.- Parameters:
eqDeleteFileCount- after the downstream job should be started
-
scheduleOnEqDeleteRecordCount
After a given number of new equality delete records since the last run, starts the downstream job.- Parameters:
eqDeleteRecordCount- after the downstream job should be started
-
scheduleOnInterval
After a given time since the last run, starts the downstream job.- Parameters:
interval- after the downstream job should be started
-
uidSuffix
The suffix used for the generatedTransformation's uid.- Parameters:
newUidSuffix- for the transformations
-
slotSharingGroup
TheSingleOutputStreamOperator.slotSharingGroup(String)for all the operators of the generated stream. Could be used to separate the resources used by this task.- Parameters:
newSlotSharingGroup- to be used for the operators
-
parallelism
Sets the parallelism for the stream.- Parameters:
newParallelism- the required parallelism
-
index
protected int index() -
taskName
-
tableName
-
tableLoader
-
uidSuffix
-
slotSharingGroup
-
parallelism
-
operatorName
-