Class MaintenanceTaskBuilder<T extends MaintenanceTaskBuilder<?>>
java.lang.Object
org.apache.iceberg.flink.maintenance.api.MaintenanceTaskBuilder<T>
- Direct Known Subclasses:
ExpireSnapshots.Builder
@Experimental
public abstract class MaintenanceTaskBuilder<T extends MaintenanceTaskBuilder<?>>
extends Object
-
Constructor Summary
Constructors -
Method Summary
Modifier and TypeMethodDescriptionprotected int
index()
protected String
operatorName
(String operatorNameBase) protected Integer
parallelism
(int newParallelism) Sets the parallelism for the stream.scheduleOnCommitCount
(int commitCount) After a given number of Iceberg table commits since the last run, starts the downstream job.scheduleOnDataFileCount
(int dataFileCount) After a given number of new data files since the last run, starts the downstream job.scheduleOnDataFileSize
(long dataFileSizeInBytes) After a given aggregated data file size since the last run, starts the downstream job.scheduleOnEqDeleteFileCount
(int eqDeleteFileCount) After a given number of new equality delete files since the last run, starts the downstream job.scheduleOnEqDeleteRecordCount
(long eqDeleteRecordCount) After a given number of new equality delete records since the last run, starts the downstream job.scheduleOnInterval
(Duration interval) After a given time since the last run, starts the downstream job.scheduleOnPosDeleteFileCount
(int posDeleteFileCount) After a given number of new positional delete files since the last run, starts the downstream job.scheduleOnPosDeleteRecordCount
(long posDeleteRecordCount) After a given number of new positional delete records since the last run, starts the downstream job.protected String
slotSharingGroup
(String newSlotSharingGroup) TheSingleOutputStreamOperator.slotSharingGroup(String)
for all the operators of the generated stream.protected TableLoader
protected String
protected String
taskName()
protected String
The suffix used for the generatedTransformation
's uid.
-
Constructor Details
-
MaintenanceTaskBuilder
public MaintenanceTaskBuilder()
-
-
Method Details
-
scheduleOnCommitCount
After a given number of Iceberg table commits since the last run, starts the downstream job.- Parameters:
commitCount
- after the downstream job should be started
-
scheduleOnDataFileCount
After a given number of new data files since the last run, starts the downstream job.- Parameters:
dataFileCount
- after the downstream job should be started
-
scheduleOnDataFileSize
After a given aggregated data file size since the last run, starts the downstream job.- Parameters:
dataFileSizeInBytes
- after the downstream job should be started
-
scheduleOnPosDeleteFileCount
After a given number of new positional delete files since the last run, starts the downstream job.- Parameters:
posDeleteFileCount
- after the downstream job should be started
-
scheduleOnPosDeleteRecordCount
After a given number of new positional delete records since the last run, starts the downstream job.- Parameters:
posDeleteRecordCount
- after the downstream job should be started
-
scheduleOnEqDeleteFileCount
After a given number of new equality delete files since the last run, starts the downstream job.- Parameters:
eqDeleteFileCount
- after the downstream job should be started
-
scheduleOnEqDeleteRecordCount
After a given number of new equality delete records since the last run, starts the downstream job.- Parameters:
eqDeleteRecordCount
- after the downstream job should be started
-
scheduleOnInterval
After a given time since the last run, starts the downstream job.- Parameters:
interval
- after the downstream job should be started
-
uidSuffix
The suffix used for the generatedTransformation
's uid.- Parameters:
newUidSuffix
- for the transformations
-
slotSharingGroup
TheSingleOutputStreamOperator.slotSharingGroup(String)
for all the operators of the generated stream. Could be used to separate the resources used by this task.- Parameters:
newSlotSharingGroup
- to be used for the operators
-
parallelism
Sets the parallelism for the stream.- Parameters:
newParallelism
- the required parallelism
-
index
protected int index() -
taskName
-
tableName
-
tableLoader
-
uidSuffix
-
slotSharingGroup
-
parallelism
-
operatorName
-