Package org.apache.iceberg.parquet
Interface VectorizedReader<T>
- All Known Implementing Classes:
BaseBatchReader
,ColumnarBatchReader
,VectorizedArrowReader
,VectorizedArrowReader.ConstantVectorReader
,VectorizedArrowReader.DeletedVectorReader
public interface VectorizedReader<T>
Interface for vectorized Iceberg readers.
-
Method Summary
Modifier and TypeMethodDescriptionvoid
close()
Release any resources allocated.Reads a batch of type @param <T> and of size numRowsvoid
setBatchSize
(int batchSize) default void
setRowGroupInfo
(org.apache.parquet.column.page.PageReadStore pages, Map<org.apache.parquet.hadoop.metadata.ColumnPath, org.apache.parquet.hadoop.metadata.ColumnChunkMetaData> metadata) Sets the row group information to be used with this readervoid
setRowGroupInfo
(org.apache.parquet.column.page.PageReadStore pages, Map<org.apache.parquet.hadoop.metadata.ColumnPath, org.apache.parquet.hadoop.metadata.ColumnChunkMetaData> metadata, long rowPosition) Deprecated.
-
Method Details
-
read
Reads a batch of type @param <T> and of size numRows- Parameters:
reuse
- container for the last batch to be reused for next batchnumRows
- number of rows to read- Returns:
- batch of records of type @param <T>
-
setBatchSize
void setBatchSize(int batchSize) -
setRowGroupInfo
@Deprecated void setRowGroupInfo(org.apache.parquet.column.page.PageReadStore pages, Map<org.apache.parquet.hadoop.metadata.ColumnPath, org.apache.parquet.hadoop.metadata.ColumnChunkMetaData> metadata, long rowPosition) Deprecated.since 1.8.0, will be removed in 1.9.0; usesetRowGroupInfo(PageReadStore, Map)
instead.Sets the row group information to be used with this reader- Parameters:
pages
- row group information for all the columnsmetadata
- map ofColumnPath
->ColumnChunkMetaData
for the row grouprowPosition
- the row group's row offset in the parquet file
-
setRowGroupInfo
default void setRowGroupInfo(org.apache.parquet.column.page.PageReadStore pages, Map<org.apache.parquet.hadoop.metadata.ColumnPath, org.apache.parquet.hadoop.metadata.ColumnChunkMetaData> metadata) Sets the row group information to be used with this reader- Parameters:
pages
- row group information for all the columnsmetadata
- map ofColumnPath
->ColumnChunkMetaData
for the row group
-
close
void close()Release any resources allocated.
-
setRowGroupInfo(PageReadStore, Map)
instead.