Class MaintenanceTaskBuilder<T extends MaintenanceTaskBuilder<?>>

java.lang.Object
org.apache.iceberg.flink.maintenance.api.MaintenanceTaskBuilder<T>
Direct Known Subclasses:
ExpireSnapshots.Builder

@Experimental public abstract class MaintenanceTaskBuilder<T extends MaintenanceTaskBuilder<?>> extends Object
  • Constructor Details

    • MaintenanceTaskBuilder

      public MaintenanceTaskBuilder()
  • Method Details

    • scheduleOnCommitCount

      public T scheduleOnCommitCount(int commitCount)
      After a given number of Iceberg table commits since the last run, starts the downstream job.
      Parameters:
      commitCount - after the downstream job should be started
    • scheduleOnDataFileCount

      public T scheduleOnDataFileCount(int dataFileCount)
      After a given number of new data files since the last run, starts the downstream job.
      Parameters:
      dataFileCount - after the downstream job should be started
    • scheduleOnDataFileSize

      public T scheduleOnDataFileSize(long dataFileSizeInBytes)
      After a given aggregated data file size since the last run, starts the downstream job.
      Parameters:
      dataFileSizeInBytes - after the downstream job should be started
    • scheduleOnPosDeleteFileCount

      public T scheduleOnPosDeleteFileCount(int posDeleteFileCount)
      After a given number of new positional delete files since the last run, starts the downstream job.
      Parameters:
      posDeleteFileCount - after the downstream job should be started
    • scheduleOnPosDeleteRecordCount

      public T scheduleOnPosDeleteRecordCount(long posDeleteRecordCount)
      After a given number of new positional delete records since the last run, starts the downstream job.
      Parameters:
      posDeleteRecordCount - after the downstream job should be started
    • scheduleOnEqDeleteFileCount

      public T scheduleOnEqDeleteFileCount(int eqDeleteFileCount)
      After a given number of new equality delete files since the last run, starts the downstream job.
      Parameters:
      eqDeleteFileCount - after the downstream job should be started
    • scheduleOnEqDeleteRecordCount

      public T scheduleOnEqDeleteRecordCount(long eqDeleteRecordCount)
      After a given number of new equality delete records since the last run, starts the downstream job.
      Parameters:
      eqDeleteRecordCount - after the downstream job should be started
    • scheduleOnInterval

      public T scheduleOnInterval(Duration interval)
      After a given time since the last run, starts the downstream job.
      Parameters:
      interval - after the downstream job should be started
    • uidSuffix

      public T uidSuffix(String newUidSuffix)
      The suffix used for the generated Transformation's uid.
      Parameters:
      newUidSuffix - for the transformations
    • slotSharingGroup

      public T slotSharingGroup(String newSlotSharingGroup)
      The SingleOutputStreamOperator.slotSharingGroup(String) for all the operators of the generated stream. Could be used to separate the resources used by this task.
      Parameters:
      newSlotSharingGroup - to be used for the operators
    • parallelism

      public T parallelism(int newParallelism)
      Sets the parallelism for the stream.
      Parameters:
      newParallelism - the required parallelism
    • index

      protected int index()
    • taskName

      protected String taskName()
    • tableName

      protected String tableName()
    • tableLoader

      protected TableLoader tableLoader()
    • uidSuffix

      protected String uidSuffix()
    • slotSharingGroup

      protected String slotSharingGroup()
    • parallelism

      protected Integer parallelism()
    • operatorName

      protected String operatorName(String operatorNameBase)