Broadcasted equality (.==) inaccurate on GPU due to ForwardDiff

The following example shows inaccuracy of `.==` when executed on the GPU within gradient computation:
```Julia
using Metal, CUDA
using Flux

device = gpu_device()
# device = cpu_device()

f = x -> begin
    y = [0, 1, 2] |> device
    mask = y .== 1
    return sum(x[mask])
end

x = Float32[1, 2, 3] |> device
grad = Flux.gradient(f, x) # should be [0.0, 1.0, 0.0], got [0.0, 0.0, 0.0]
```
Per [this discussion](https://discourse.julialang.org/t/and-inside-zygote-gradient-are-inaccurate-on-gpu/128269), this is due to the specific way broadcasted functions are differentiated through on GPU using `ForwardDiff`.

The problem can be avoided by replacing
```Julia
mask = y .== 1
```
with
```Julia
mask = Flux.@ignore_derivatives y .== 1
```
The problem seems to be automatically circumvented on CPU https://github.com/FluxML/Zygote.jl/blob/1b914d994aea236bcb6d3d0cd6c099d86cede101/src/lib/broadcast.jl#L206-L211
but not on GPU
https://github.com/FluxML/Zygote.jl/blob/1b914d994aea236bcb6d3d0cd6c099d86cede101/src/lib/broadcast.jl#L359-L363

Due to the potential difficulty of spotting this unexpected behavior, this may worth being considered a bug that warrants fixing.



	@adjoint broadcasted(::AbstractArrayStyle, f::F, args...) where {F} = _broadcast_generic(__context__, f, args...)
	@inline function _broadcast_generic(__context__, f::F, args...) where {F}
	T = Broadcast.combine_eltypes(f, args)
	# Avoid generic broadcasting in two easy cases:
	if T == Bool
	return (f.(args...), _ -> nothing)

	# Ordinary broadcasting calls broadcast_forward anyway when certain its' safe,
	# so perhaps this can be deleted? Possible edge case here:
	# https://github.com/FluxML/Zygote.jl/pull/1018#issuecomment-873629415
	@adjoint broadcasted(::AbstractGPUArrayStyle, f, args...) =
	broadcast_forward(f, args...)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Broadcasted equality (.==) inaccurate on GPU due to ForwardDiff #1570

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

Broadcasted equality (.==) inaccurate on GPU due to ForwardDiff #1570

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions