public interface DeletedRowsScanTask extends ChangelogScanTask, ContentScanTask<DataFile>
Suppose snapshot S1 contains data files F1, F2, F3. Then snapshot S2 adds a position delete file, D1, that deletes records from F2 and snapshot S3 adds an equality delete file, D2, that removes records from F1, F2, F3. A scan for changes from S2 to S3 (inclusive) should include the following tasks:
Readers consuming these tasks should produce deleted records with metadata like change ordinal and commit snapshot ID.
Modifier and Type | Method and Description |
---|---|
java.util.List<DeleteFile> |
addedDeletes()
A list of added
delete files that apply to the task's data file. |
java.util.List<DeleteFile> |
existingDeletes()
A list of
delete files that existed before and must be applied prior to
determining which records are deleted by delete files in addedDeletes() . |
default int |
filesCount()
The number of files that will be opened by this scan task.
|
default ChangelogOperation |
operation()
Returns the type of changes produced by this task (i.e.
|
default long |
sizeBytes()
The number of bytes that should be read by this scan task.
|
changeOrdinal, commitSnapshotId
estimatedRowsCount, file, length, partition, residual, start
spec
asCombinedScanTask, asDataTask, asFileScanTask, isDataTask, isFileScanTask
java.util.List<DeleteFile> addedDeletes()
delete files
that apply to the task's data file. Records
removed by these delete files should appear as deletes in the changelog.java.util.List<DeleteFile> existingDeletes()
delete files
that existed before and must be applied prior to
determining which records are deleted by delete files in addedDeletes()
. Records
removed by these delete files should not appear in the changelog.default ChangelogOperation operation()
ChangelogScanTask
operation
in interface ChangelogScanTask
default long sizeBytes()
ScanTask
sizeBytes
in interface ContentScanTask<DataFile>
sizeBytes
in interface ScanTask
default int filesCount()
ScanTask
filesCount
in interface ScanTask