fix(rust): Fix cum_min and cum_max does not preserve inf or -inf values at series start #22896

Athsus · 2025-05-23T04:17:00Z

What’s Changed

Improved cum_min / cum_max for floating-point numbers, and NaN handling.

This update refines the

Value init when doing cum_max_numeric and cum_min_numeric.
It now checks the type: if it's a float, the starting value for the calculation is NaN. Otherwise, it uses the standard min/max possible value for that type.
Correct NaN handling in det_min and det_max
Now it uses MinMax::min_ignore_nan and MinMax::max_ignore_nan to compare the values. For example, if a series starts with NaN, the result will also start with NaN, as expected.
New cum_scan_numeric function.
A new helper function to handle the common scanning logic.

Added Test

def test_cum_agg_with_infs() -> None:
    # confirm that inf values are handled correctly
    s = pl.Series([float("inf"), 0.0, 1.0])
    assert_series_equal(s.cum_min(), pl.Series([float("inf"), 0.0, 0.0]))

    s = pl.Series([float("-inf"), 0.0, 1.0])
    assert_series_equal(s.cum_max(), pl.Series([float("-inf"), 0.0, 1.0]))

Would close the issue #22855

…lues at series start

codecov · 2025-05-23T04:28:45Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 80.31%. Comparing base (f230f12) to head (cc01fb0).
Report is 9 commits behind head on main.

Additional details and impacted files

@@            Coverage Diff             @@
##             main   #22896      +/-   ##
==========================================
- Coverage   80.61%   80.31%   -0.31%     
==========================================
  Files        1677     1682       +5     
  Lines      222297   223181     +884     
  Branches     2801     2804       +3     
==========================================
+ Hits       179205   179248      +43     
- Misses      42425    43264     +839     
- Partials      667      669       +2

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

Athsus · 2025-05-24T12:33:46Z

Hi @orlp , please review and suggest if needed anything.

orlp · 2025-05-26T13:12:36Z

You can save a lot of repetition by defining

fn cum_scan_numeric<T, F>(ca: &ChunkedArray<T>, reverse: bool, init: T::Native, update: F) -> ChunkedArray<T>
where
    T: PolarsNumericType,
    ChunkedArray<T>: FromIterator<Option<T::Native>>,
    F: Fn(&mut T::Native, Option<T::Native>) -> Option<Option<T::Native>>
{
    let out: ChunkedArray<T> = match reverse {
        false => ca.iter().scan(init, update).collect_trusted(),
        true => ca.iter().rev().scan(init, update).collect_reversed(),
    };
    out.with_name(ca.name().clone())
}

and then using

let init = if <$T>::is_float() { <$T>::nan_value() } else { <$T>::min_value() };
cum_scan_numeric(ca, reverse, init, det_max)

and similarly for the minimum in the dispatch.

Note that your current init value for floats is incorrect, it needs to be NaN (as a test case try adding a series with NaN as the first element).

orlp

See above.

…s_float() to simplify dispatch.

…s, as fmt suggests.

Athsus · 2025-05-27T09:21:47Z

You can save a lot of repetition by defining

fn cum_scan_numeric<T, F>(ca: &ChunkedArray<T>, reverse: bool, init: T::Native, update: F) -> ChunkedArray<T>
where
    T: PolarsNumericType,
    ChunkedArray<T>: FromIterator<Option<T::Native>>,
    F: Fn(&mut T::Native, Option<T::Native>) -> Option<Option<T::Native>>
{
    let out: ChunkedArray<T> = match reverse {
        false => ca.iter().scan(init, update).collect_trusted(),
        true => ca.iter().rev().scan(init, update).collect_reversed(),
    };
    out.with_name(ca.name().clone())
}

and then using

let init = if <$T>::is_float() { <$T>::nan_value() } else { <$T>::min_value() };
cum_scan_numeric(ca, reverse, init, det_max)

and similarly for the minimum in the dispatch.

Note that your current init value for floats is incorrect, it needs to be NaN (as a test case try adding a series with NaN as the first element).

Thanks! That's very helpful. I've made some new changes on it. @orlp

Athsus · 2025-05-27T09:33:38Z

Wait, I missed implementing cum_scan_numeric to cum_sum_numeric and cum_prod_numeric

… with cum_scan_numeric.

Athsus · 2025-05-27T11:05:02Z

Hi @orlp , I've done new commits on it.
Though the CI fails, I think it's not because of my modifications. As I don't have the permission to re-run it, could you do the re-run if it's available and necessary?

crates/polars-ops/src/series/ops/cum_agg.rs

orlp · 2025-05-27T19:14:33Z

Thank you for helping out :)

Athsus · 2025-05-28T05:32:41Z

Thank you for helping out :)

All good! Appreciations.

Athsus added 2 commits May 23, 2025 00:57

fix: Issue-22855 cum_min and cum_max does not preserve inf or -inf va…

65df46b

…lues at series start

add: test on Series.cum_agg with infs

ce47399

github-actions bot added fix Bug fix rust Related to Rust Polars labels May 23, 2025

Athsus marked this pull request as ready for review May 24, 2025 12:32

Athsus requested review from MarcoGorelli, alexander-beedie, c-peters, orlp, reswqa and ritchie46 as code owners May 24, 2025 12:32

orlp self-assigned this May 26, 2025

orlp requested changes May 26, 2025

View reviewed changes

Athsus added 4 commits May 27, 2025 18:02

refactor: Simplify cumulative aggregation functions, use T::Native::i…

57a8c8b

…s_float() to simplify dispatch.

Merge remote-tracking branch 'upstream/main' into fix-22855

ac593d1

refactor: Remove unused Float import from cumulative aggregation module

33fc58c

refactor: Format and clean up code in cumulative aggregation function…

40db697

…s, as fmt suggests.

refactor: Replace manual cumulative aggregation logic in prod and sum…

b0a2c59

… with cum_scan_numeric.

Athsus requested a review from orlp May 27, 2025 11:05

orlp requested changes May 27, 2025

View reviewed changes

crates/polars-ops/src/series/ops/cum_agg.rs Outdated Show resolved Hide resolved

Athsus added 2 commits May 27, 2025 23:02

refactor: Minimize trait bounds on cum_agg.rs

4708793

refactor: Remove unused Add and MultiAssign

cc01fb0

orlp approved these changes May 27, 2025

View reviewed changes

orlp merged commit 0e0ab52 into pola-rs:main May 27, 2025
28 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(rust): Fix cum_min and cum_max does not preserve inf or -inf values at series start #22896

fix(rust): Fix cum_min and cum_max does not preserve inf or -inf values at series start #22896

Uh oh!

Athsus commented May 23, 2025 •

edited

Loading

Uh oh!

codecov bot commented May 23, 2025 •

edited

Loading

Uh oh!

Athsus commented May 24, 2025

Uh oh!

orlp commented May 26, 2025 •

edited

Loading

Uh oh!

orlp left a comment

Uh oh!

Athsus commented May 27, 2025

Uh oh!

Athsus commented May 27, 2025

Uh oh!

Athsus commented May 27, 2025

Uh oh!

Uh oh!

Uh oh!

orlp commented May 27, 2025

Uh oh!

Athsus commented May 28, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

fix(rust): Fix cum_min and cum_max does not preserve inf or -inf values at series start #22896

fix(rust): Fix cum_min and cum_max does not preserve inf or -inf values at series start #22896

Uh oh!

Conversation

Athsus commented May 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What’s Changed

Improved cum_min / cum_max for floating-point numbers, and NaN handling.

Added Test

Uh oh!

codecov bot commented May 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Athsus commented May 24, 2025

Uh oh!

orlp commented May 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

orlp left a comment

Choose a reason for hiding this comment

Uh oh!

Athsus commented May 27, 2025

Uh oh!

Athsus commented May 27, 2025

Uh oh!

Athsus commented May 27, 2025

Uh oh!

Uh oh!

Uh oh!

orlp commented May 27, 2025

Uh oh!

Athsus commented May 28, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Athsus commented May 23, 2025 •

edited

Loading

codecov bot commented May 23, 2025 •

edited

Loading

orlp commented May 26, 2025 •

edited

Loading