Class DynamicIcebergSink.Builder<T>
- Enclosing class:
- DynamicIcebergSink
- 
Method SummaryModifier and TypeMethodDescriptionorg.apache.flink.streaming.api.datastream.DataStreamSink<org.apache.iceberg.flink.sink.dynamic.DynamicRecordInternal> append()Append the iceberg sink operators to write records to iceberg table.cacheMaxSize(int maxSize) Maximum size of the caches used in Dynamic Sink for table data and serializers.cacheRefreshMs(long refreshMs) Maximum interval for cache items renewals.catalogLoader(CatalogLoader newCatalogLoader) The catalog loader is used for loading tables inDynamicCommitterlazily, we need this loader becauseTableis not serializable and could not just use the loaded table from Builder#table in the remote task manager.flinkConf(org.apache.flink.configuration.ReadableConfig config) generator(DynamicRecordGenerator<T> inputGenerator) immediateTableUpdate(boolean newImmediateUpdate) inputSchemasPerTableCacheMaxSize(int inputSchemasPerTableCacheMaxSize) Maximum inputSchemaobjects to cache per each Iceberg table.overwrite(boolean newOverwrite) Set the write properties for IcebergSink.Set the write properties for IcebergSink.setSnapshotProperty(String property, String value) snapshotProperties(Map<String, String> properties) Set the uid prefix for IcebergSink operators.writeParallelism(int newWriteParallelism) Configuring the write parallel number for iceberg stream writer.
- 
Method Details- 
forInputpublic DynamicIcebergSink.Builder<T> forInput(org.apache.flink.streaming.api.datastream.DataStream<T> inputStream) 
- 
generator
- 
catalogLoaderThe catalog loader is used for loading tables inDynamicCommitterlazily, we need this loader becauseTableis not serializable and could not just use the loaded table from Builder#table in the remote task manager.- Parameters:
- newCatalogLoader- to load iceberg table inside tasks.
- Returns:
- DynamicIcebergSink.Builderto connect the iceberg table.
 
- 
setSet the write properties for IcebergSink. View the supported properties inFlinkWriteOptions
- 
setAllSet the write properties for IcebergSink. View the supported properties inFlinkWriteOptions
- 
overwrite
- 
flinkConfpublic DynamicIcebergSink.Builder<T> flinkConf(org.apache.flink.configuration.ReadableConfig config) 
- 
writeParallelismConfiguring the write parallel number for iceberg stream writer.- Parameters:
- newWriteParallelism- the number of parallel iceberg stream writer.
- Returns:
- DynamicIcebergSink.Builderto connect the iceberg table.
 
- 
uidPrefixSet the uid prefix for IcebergSink operators. Note that IcebergSink internally consists of multiple operators (like writer, committer, aggregator) Actual operator uid will be appended with a suffix like "uidPrefix-writer".If provided, this prefix is also applied to operator names. Flink auto generates operator uid if not set explicitly. It is a recommended best-practice to set uid for all operators before deploying to production. Flink has an option to pipeline.auto-generate-uid=falseto disable auto-generation and force explicit setting of all operator uid.Be careful with setting this for an existing job, because now we are changing the operator uid from an auto-generated one to this new value. When deploying the change with a checkpoint, Flink won't be able to restore the previous IcebergSink operator state (more specifically the committer operator state). You need to use --allowNonRestoredStateto ignore the previous sink state. During restore IcebergSink state is used to check if last commit was actually successful or not.--allowNonRestoredStatecan lead to data loss if the Iceberg commit failed in the last completed checkpoint.- Parameters:
- newPrefix- prefix for Flink sink operator uid and name
- Returns:
- DynamicIcebergSink.Builderto connect the iceberg table.
 
- 
snapshotProperties
- 
setSnapshotProperty
- 
toBranch
- 
immediateTableUpdate
- 
cacheMaxSizeMaximum size of the caches used in Dynamic Sink for table data and serializers.
- 
cacheRefreshMsMaximum interval for cache items renewals.
- 
inputSchemasPerTableCacheMaxSizepublic DynamicIcebergSink.Builder<T> inputSchemasPerTableCacheMaxSize(int inputSchemasPerTableCacheMaxSize) 
- 
appendpublic org.apache.flink.streaming.api.datastream.DataStreamSink<org.apache.iceberg.flink.sink.dynamic.DynamicRecordInternal> append()Append the iceberg sink operators to write records to iceberg table.- Returns:
- DataStreamSinkfor sink.
 
 
-