See https://github.com/gpuweb/gpuweb/pull/5141 This requires basic testing: - that all invocations get the same value from the atomic - that it passes uniformity checks (the load is treated as uniform) Ideally we would also modify the memory model tests to stress test the case. (That would likely be on me, as I'm the maintainer of those tests)