Skip to content

[FEA] Support polars.Expr.str.find_many in cudf-polars #18994

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
brandon-b-miller opened this issue May 28, 2025 · 1 comment
Open

[FEA] Support polars.Expr.str.find_many in cudf-polars #18994

brandon-b-miller opened this issue May 28, 2025 · 1 comment
Labels
cudf.polars Issues specific to cudf.polars feature request New feature or request

Comments

@brandon-b-miller
Copy link
Contributor

Is your feature request related to a problem? Please describe.
Today on branch-25.08, we lack support for polars.Expr.str.find_many. Running the example from the docs yields:

import polars as pl
engine = pl.GPUEngine(raise_on_fail=True)

df = pl.DataFrame(
    {
        "values": ["discontent", "rhapsody"],
        "patterns": [
            ["winter", "disco", "onte", "discontent"],
            ["rhap", "ody", "coalesce"],
        ],
    }
).lazy()
df = df.select(pl.col("values").str.find_many("patterns"))


res = df.collect(engine=engine)
print(res)

NotImplementedError: find_many'

Describe the solution you'd like
I'd like the above code to be able to execute using the polars GPU backend. This API returns a list where the indices are the positons of the starting characters of all the found patterns. This seems to be a bit of a mix of cudf::strings::findall and cudf::strings::find_re, and is also blocked on being able to return a list[str] dtype.

Describe alternatives you've considered
N/A

Additional context
#16480

@brandon-b-miller brandon-b-miller added the feature request New feature or request label May 28, 2025
@davidwendt
Copy link
Contributor

@wence- wence- added the cudf.polars Issues specific to cudf.polars label May 28, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
cudf.polars Issues specific to cudf.polars feature request New feature or request
Projects
Status: Todo
Development

No branches or pull requests

3 participants