test: Accumulate CPU half-precision sums in float32 by sofinvalery · Pull Request #3488 · ml-explore/mlx

sofinvalery · 2026-05-06T09:20:31Z

Proposed changes

Accumulate CPU float16 and bfloat16 sum reductions in float32, while preserving the output dtype. This fixes precision loss in ops that use sum().

Fixes #3326.

Checklist

Put an x in the boxes that apply.

I have read the CONTRIBUTING document
I have run pre-commit run --all-files to format my code / installed pre-commit prior to committing changes
I have added tests that prove my fix is effective or that my feature works
I have updated the necessary documentation (if needed)

sofinvalery · 2026-05-06T20:05:55Z

Made it inline. Feels cleaner.

zcbenz

Allocating a new array would have heavy performance penalty, the correct way would be refactoring strided_reduce/contiguous_reduce to accumulate in float32 rather than the output type.

sofinvalery · 2026-05-07T17:54:58Z

Refactored strided_reduce/contiguous_reduce to support accumulation in a separate type.
float16/bfloat16 sums now accumulate in float32 without allocating a temp array.
Also added tests for reductions across full, contiguous, and strided axes.

angeloskath · 2026-05-08T07:58:56Z

I don't think we should merge this tbh. It's kind of expected for floats that you get the accumulation precision of the float you are in.

I am not against choosing high precision temporaries internally if possible but given the current implementation it is actually faster to cast to float32 and then sum (didn't try it but almost certainly it will be).

For the test by the way, the correct fix is to simply upcast the random numbers before computing their average. We want to test random after all not sum.

sofinvalery · 2026-05-08T13:10:47Z

Updated the test only. Thanks for the feedback! @zcbenz @angeloskath

zcbenz

Thanks for fixing the test! Can you also update .github/actions/test-windows/action.yml to enable the test in CI?

sofinvalery force-pushed the sum-float-accum branch from 6aee023 to 10188e2 Compare May 6, 2026 19:58

zcbenz requested changes May 6, 2026

View reviewed changes

sofinvalery force-pushed the sum-float-accum branch from 10188e2 to f869884 Compare May 7, 2026 17:32

zcbenz reviewed May 8, 2026

View reviewed changes

Comment thread mlx/backend/cpu/reduce.cpp Outdated

Comment thread mlx/backend/cpu/reduce.cpp

Comment thread mlx/backend/cpu/reduce.cpp Outdated

sofinvalery force-pushed the sum-float-accum branch from f869884 to 96fed69 Compare May 8, 2026 13:06

zcbenz approved these changes May 9, 2026

View reviewed changes

Upcast half uniform samples before mean check

2ff1c70

sofinvalery force-pushed the sum-float-accum branch from 96fed69 to 2ff1c70 Compare May 9, 2026 12:45

zcbenz changed the title ~~Accumulate CPU half-precision sums in float32~~ test: Accumulate CPU half-precision sums in float32 May 9, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

test: Accumulate CPU half-precision sums in float32#3488

test: Accumulate CPU half-precision sums in float32#3488
sofinvalery wants to merge 1 commit intoml-explore:mainfrom
sofinvalery:sum-float-accum

sofinvalery commented May 6, 2026

Uh oh!

sofinvalery commented May 6, 2026

Uh oh!

zcbenz left a comment

Uh oh!

sofinvalery commented May 7, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

angeloskath commented May 8, 2026

Uh oh!

sofinvalery commented May 8, 2026

Uh oh!

zcbenz left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

sofinvalery commented May 6, 2026

Proposed changes

Checklist

Uh oh!

sofinvalery commented May 6, 2026

Uh oh!

zcbenz left a comment

Choose a reason for hiding this comment

Uh oh!

sofinvalery commented May 7, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

angeloskath commented May 8, 2026

Uh oh!

sofinvalery commented May 8, 2026

Uh oh!

zcbenz left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants