Class ParquetUtil


  • public class ParquetUtil
    extends java.lang.Object
    • Method Detail

      • footerMetrics

        public static Metrics footerMetrics​(org.apache.parquet.hadoop.metadata.ParquetMetadata metadata,
                                            java.util.stream.Stream<FieldMetrics<?>> fieldMetrics,
                                            MetricsConfig metricsConfig)
      • footerMetrics

        public static Metrics footerMetrics​(org.apache.parquet.hadoop.metadata.ParquetMetadata metadata,
                                            java.util.stream.Stream<FieldMetrics<?>> fieldMetrics,
                                            MetricsConfig metricsConfig,
                                            NameMapping nameMapping)
      • getSplitOffsets

        public static java.util.List<java.lang.Long> getSplitOffsets​(org.apache.parquet.hadoop.metadata.ParquetMetadata md)
        Returns a list of offsets in ascending order determined by the starting position of the row groups.
      • hasNonDictionaryPages

        public static boolean hasNonDictionaryPages​(org.apache.parquet.hadoop.metadata.ColumnChunkMetaData meta)
      • hasNoBloomFilterPages

        public static boolean hasNoBloomFilterPages​(org.apache.parquet.hadoop.metadata.ColumnChunkMetaData meta)
      • readDictionary

        public static org.apache.parquet.column.Dictionary readDictionary​(org.apache.parquet.column.ColumnDescriptor desc,
                                                                          org.apache.parquet.column.page.PageReader pageSource)
      • isIntType

        public static boolean isIntType​(org.apache.parquet.schema.PrimitiveType primitiveType)
      • extractTimestampInt96

        public static long extractTimestampInt96​(java.nio.ByteBuffer buffer)
        Method to read timestamp (parquet Int96) from bytebuffer. Read 12 bytes in byteBuffer: 8 bytes (time of day nanos) + 4 bytes(julianDay)