Skip to content

BIT_PACKING is written by default when Definition or Repetition levels are empty #3307

@willtemperley

Description

@willtemperley

As discussed in apache/arrow#47113 , this package is writing the deprecated BIT_PACKING encoding to the column metadata and the DataPageHeader. This is only seems to happens when rep/def levels are empty. I think this is where it's happening:

public class DevNullValuesWriter extends ValuesWriter {
  ....
  public Encoding getEncoding() {
    return BIT_PACKED;
  }
}

Wouldn't it make sense to just return RLE?

I've seen this in parquet-java 1.15.1 and Overture Maps files using 1.13.1.

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions