Package org.apache.iceberg.spark.data
Class SparkOrcReader
java.lang.Object
org.apache.iceberg.spark.data.SparkOrcReader
- All Implemented Interfaces:
- OrcRowReader<org.apache.spark.sql.catalyst.InternalRow>
public class SparkOrcReader
extends Object
implements OrcRowReader<org.apache.spark.sql.catalyst.InternalRow>
Converts the OrcIterator, which returns ORC's VectorizedRowBatch to a set of Spark's UnsafeRows.
 
It minimizes allocations by reusing most of the objects in the implementation.
- 
Constructor SummaryConstructorsConstructorDescriptionSparkOrcReader(Schema expectedSchema, org.apache.orc.TypeDescription readSchema) SparkOrcReader(Schema expectedSchema, org.apache.orc.TypeDescription readOrcSchema, Map<Integer, ?> idToConstant) 
- 
Method SummaryModifier and TypeMethodDescriptionorg.apache.spark.sql.catalyst.InternalRowread(org.apache.orc.storage.ql.exec.vector.VectorizedRowBatch batch, int row) Reads a row.voidsetBatchContext(long batchOffsetInFile) 
- 
Constructor Details- 
SparkOrcReader
- 
SparkOrcReader
 
- 
- 
Method Details- 
readpublic org.apache.spark.sql.catalyst.InternalRow read(org.apache.orc.storage.ql.exec.vector.VectorizedRowBatch batch, int row) Description copied from interface:OrcRowReaderReads a row.- Specified by:
- readin interface- OrcRowReader<org.apache.spark.sql.catalyst.InternalRow>
 
- 
setBatchContextpublic void setBatchContext(long batchOffsetInFile) - Specified by:
- setBatchContextin interface- OrcRowReader<org.apache.spark.sql.catalyst.InternalRow>
 
 
-