Skip to content

[discussion] dealing with multiple pyarrow versions #2209

@kevinjqliu

Description

@kevinjqliu

Feature Request / Improvement

I've seen in multiple issues and across multiple PRs where we depend on a specific version of pyarrow.

  • Some features are only available after a certain pyarrow version.
  • Some bugs might only be resolved after a certain pyarrow version.

The easiest solution is to bump the minimum pyarrow version we depend on. But since this library is often used in diverse python environments, users might want to use many different versions of pyarrow.

For example, we bumped pyarrow minimum version to 17.0.0 in #1621 in order to support filtering pa.RecordBatch directly. The previous solution still works, albeit not efficiently, for pyarrow versions lower than 17.0.0.
I dont have a clear solution but maybe we can decorate certain functions to branch depending on the pyarrow version.

Would love to hear other's thoughts on this!

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions