Class BaseVectorizedParquetValuesReader
java.lang.Object
org.apache.parquet.column.values.ValuesReader
org.apache.iceberg.arrow.vectorized.parquet.BaseVectorizedParquetValuesReader
- Direct Known Subclasses:
VectorizedDictionaryEncodedParquetValuesReader,VectorizedParquetDefinitionLevelReader
public class BaseVectorizedParquetValuesReader
extends org.apache.parquet.column.values.ValuesReader
A values reader for Parquet's run-length encoded data that reads column data in batches instead
of one value at a time. This is based off of the VectorizedRleValuesReader class in Apache Spark
with these changes:
Writes batches of values retrieved to Arrow vectors. If all pages of a column within the row group are not dictionary encoded, then dictionary ids are eagerly decoded into actual values before writing them to the Arrow vectors
-
Constructor Summary
ConstructorsConstructorDescriptionBaseVectorizedParquetValuesReader(int maxDefLevel, boolean setValidityVector) BaseVectorizedParquetValuesReader(int bitWidth, int maxDefLevel, boolean setValidityVector) BaseVectorizedParquetValuesReader(int bitWidth, int maxDefLevel, boolean readLength, boolean setValidityVector) -
Method Summary
Modifier and TypeMethodDescriptionvoidinitFromPage(int valueCount, org.apache.parquet.bytes.ByteBufferInputStream in) booleanintintvoidskip()Methods inherited from class org.apache.parquet.column.values.ValuesReader
getNextOffset, initFromPage, initFromPage, readBytes, readDouble, readFloat, readLong, skip, updateNextOffset
-
Constructor Details
-
BaseVectorizedParquetValuesReader
public BaseVectorizedParquetValuesReader(int maxDefLevel, boolean setValidityVector) -
BaseVectorizedParquetValuesReader
public BaseVectorizedParquetValuesReader(int bitWidth, int maxDefLevel, boolean setValidityVector) -
BaseVectorizedParquetValuesReader
public BaseVectorizedParquetValuesReader(int bitWidth, int maxDefLevel, boolean readLength, boolean setValidityVector)
-
-
Method Details
-
initFromPage
public void initFromPage(int valueCount, org.apache.parquet.bytes.ByteBufferInputStream in) throws IOException - Overrides:
initFromPagein classorg.apache.parquet.column.values.ValuesReader- Throws:
IOException
-
readBoolean
public boolean readBoolean()- Overrides:
readBooleanin classorg.apache.parquet.column.values.ValuesReader
-
skip
public void skip()- Specified by:
skipin classorg.apache.parquet.column.values.ValuesReader
-
readValueDictionaryId
public int readValueDictionaryId()- Overrides:
readValueDictionaryIdin classorg.apache.parquet.column.values.ValuesReader
-
readInteger
public int readInteger()- Overrides:
readIntegerin classorg.apache.parquet.column.values.ValuesReader
-