Class BaseBatchReader<T>

java.lang.Object
org.apache.iceberg.arrow.vectorized.BaseBatchReader<T>
All Implemented Interfaces:
VectorizedReader<T>
Direct Known Subclasses:
ColumnarBatchReader

public abstract class BaseBatchReader<T> extends Object implements VectorizedReader<T>
A base BatchReader class that contains common functionality
  • Field Summary

    Fields
    Modifier and Type
    Field
    Description
    protected final VectorizedArrowReader[]
     
    protected final VectorHolder[]
     
  • Constructor Summary

    Constructors
    Modifier
    Constructor
    Description
    protected
     
  • Method Summary

    Modifier and Type
    Method
    Description
    void
    Release any resources allocated.
    protected void
     
    void
    setBatchSize(int batchSize)
     
    void
    setRowGroupInfo(org.apache.parquet.column.page.PageReadStore pageStore, Map<org.apache.parquet.hadoop.metadata.ColumnPath,org.apache.parquet.hadoop.metadata.ColumnChunkMetaData> metaData)
    Sets the row group information to be used with this reader
    void
    setRowGroupInfo(org.apache.parquet.column.page.PageReadStore pageStore, Map<org.apache.parquet.hadoop.metadata.ColumnPath,org.apache.parquet.hadoop.metadata.ColumnChunkMetaData> metaData, long rowPosition)
    Sets the row group information to be used with this reader

    Methods inherited from class java.lang.Object

    clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait

    Methods inherited from interface org.apache.iceberg.parquet.VectorizedReader

    read
  • Field Details

  • Constructor Details

  • Method Details

    • setRowGroupInfo

      public void setRowGroupInfo(org.apache.parquet.column.page.PageReadStore pageStore, Map<org.apache.parquet.hadoop.metadata.ColumnPath,org.apache.parquet.hadoop.metadata.ColumnChunkMetaData> metaData, long rowPosition)
      Description copied from interface: VectorizedReader
      Sets the row group information to be used with this reader
      Specified by:
      setRowGroupInfo in interface VectorizedReader<T>
      Parameters:
      pageStore - row group information for all the columns
      metaData - map of ColumnPath -> ColumnChunkMetaData for the row group
      rowPosition - the row group's row offset in the parquet file
    • setRowGroupInfo

      public void setRowGroupInfo(org.apache.parquet.column.page.PageReadStore pageStore, Map<org.apache.parquet.hadoop.metadata.ColumnPath,org.apache.parquet.hadoop.metadata.ColumnChunkMetaData> metaData)
      Description copied from interface: VectorizedReader
      Sets the row group information to be used with this reader
      Specified by:
      setRowGroupInfo in interface VectorizedReader<T>
      Parameters:
      pageStore - row group information for all the columns
      metaData - map of ColumnPath -> ColumnChunkMetaData for the row group
    • closeVectors

      protected void closeVectors()
    • close

      public void close()
      Description copied from interface: VectorizedReader
      Release any resources allocated.
      Specified by:
      close in interface VectorizedReader<T>
    • setBatchSize

      public void setBatchSize(int batchSize)
      Specified by:
      setBatchSize in interface VectorizedReader<T>