Skip to content

Conversation

@zhengruifeng
Copy link
Contributor

What changes were proposed in this pull request?

Support Scalar Arrow Iterator UDF

Why are the changes needed?

to make Arrow UDF suitable for expansive initialization within each partition

Does this PR introduce any user-facing change?

Not yet, the whole Arrow UDF is private for now

How was this patch tested?

New Tests

Was this patch authored or co-authored using generative AI tooling?

No

empty
@xinrong-meng
Copy link
Member

LGTM thank you!

@zhengruifeng
Copy link
Contributor Author

thanks, merged to master

@zhengruifeng zhengruifeng deleted the py_arrow_udf_iter branch June 3, 2025 01:25
yhuang-db pushed a commit to yhuang-db/spark that referenced this pull request Jun 9, 2025
### What changes were proposed in this pull request?
Support Scalar Arrow Iterator UDF

### Why are the changes needed?
to make Arrow UDF suitable for expansive initialization within each partition

### Does this PR introduce _any_ user-facing change?
Not yet, the whole Arrow UDF is private for now

### How was this patch tested?
New Tests

### Was this patch authored or co-authored using generative AI tooling?
No

Closes apache#51018 from zhengruifeng/py_arrow_udf_iter.

Authored-by: Ruifeng Zheng <[email protected]>
Signed-off-by: Ruifeng Zheng <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants