Complex Backend by skewballfox · Pull Request #3608 · tracel-ai/burn

skewballfox · 2025-08-24T17:20:53Z

It's a bit early but could definitely use feedback on what works and doesn't in terms of the design

The goals are:

keep the implementation for complex numbers out of the core backend trait, and make it a decorator similar to autodiff.
don't constrain backends to a specific layout, but make it possible to define layout dependent shared behavior (like butterfly operations)
Don't make the ComplexBackend trait be a supertrait of Backend, while allowing ComplexBackend to be usable with backend (done)
implement split tensors which can work with any backend, and use flex-tensor for the first backend to support interleaved complex numbers

Checklist

Confirmed that cargo run-checks command has been executed.
Made sure the book is up to date with changes in this PR.

Related Issues/PRs

continuation of Support Complex Numbers #3330
would resolve Complex numbers #3265
required for FFT Module (like torch.fft) #788

Changes

main changes in relation to the goals are:

made a new burn-complex crate that will house a lot of the shared definitions. I'm guessing most of the stuff other than the ComplexTensorBackend trait and dtype for complex numbers will be moved here
Added a dummy trait ComplexLayout that will be implemented on unit structs that indicate what type of complex layout is in use for an implementation, which allows implementors to define functions and traits only meant to be used for a specific data layout.

Testing

TODO

Notes

My current plan is to get all ops implemented for split and burn-flex, write test for the ops, and then once it's almost ready to merge, stash the ndarray implementation and make that a separate PR. It's just way easier to implement for flex. Ndarray macros do not spark joy.

- Add Complex as first-class TensorKind alongside Float, Int, Bool - Add ComplexTensorPrimitive and ComplexElem to Backend trait - Add complex tensor type aliases and exports

- Add ComplexTensorPrimitive support to NdArray backend - Implement complex arithmetic and transcendental functions in NdArray backend - Add autodiff backend wrapper for complex tensors - Begin enabling support across backend ecosystem

- Add high-level Tensor<B, D, Complex> API with BasicOps and Numeric traits - Add complex-specific methods: conj(), real(), imag(), magnitude(), phase() - Add creation utilities: from_parts(), from_polar(), zeros(), ones() - Start adding test suite covering operations

- Remove non-existent testgen_complex\!() macro call that was causing compilation errors - Add ComplexTensorOps implementations for all backends (tch, candle, cubecl, fusion, router) - Fix complex tensor assertion logic in CubeCL backend to avoid Float trait requirements - Add missing transcendental functions (exp, log, sin, cos, tan, sqrt, powc) to Complex tensor API

skewballfox · 2025-08-24T19:04:47Z

@laggui
so I'm trying to figure out what should go in the burn-tensor crate and what should go into the burn-complex crate, I could use your help in figuring out how the divide should go.

It looks like I need to have the Kind stay in burn-tensor since not all the required types are made public
complexbackend needs to stay in burn-tensor since I don't believe I can declare a supertrait on an external type, and due to the dependency so does the ComplexTensorOps trait
since I can't move the complex unit struct used in kind with Numeric impl, then basic ops needs to also stay in burn-tensor.
I can move the actual layout structs into burn-complex, which is probably where the will eventually need to be anyway if we want layout specific ops (which we likely do for generic ffts across backends), but they can also stay with the complextensorbackend declaration.

alternatively, If I make some more stuff in burn tensor public (such as kind), then I can lift almost everything out of burn-tensor

laggui

@skewballfox for now, I think we should treat the complex backend as an extension. Thus, almost all types, traits and impl should only live in the burn-complex extension at this time.

The only part that should be added to burn-tensor is the DType variant, since we don't currently have a way to add/support custom dtypes anyway.

I believe the rest can easily live as an extension in a separate crate.

pub type ComplexTensor<B> = <B as ComplexTensorBackend>::ComplexTensorPrimitive;

pub trait ComplexTensorBackend: Backend {
    /// The inner backend type.
    type InnerBackend: Backend<Device = Self::Device, FloatElem = Self::FloatElem>;
    /// Tensor primitive to be used for all complex operations.
    type ComplexTensorPrimitive: TensorMetadata + 'static;

    /// Returns the real part of a complex tensor.
    fn real(tensor: ComplexTensor<Self>) -> FloatTensor<Self::InnerBackend>;
    /// Returns the imaginary part of a complex tensor.
    fn imag(tensor: ComplexTensor<Self>) -> FloatTensor<Self::InnerBackend>;

    fn to_complex(tensor: FloatTensor<Self::InnerBackend>) -> ComplexTensor<Self>;
}

/// A type-level representation of the kind of a complex tensor.
#[derive(Clone, Debug)]
pub struct Complex;

impl<B: ComplexTensorBackend> TensorKind<B> for Complex {
    type Primitive = B::ComplexTensorPrimitive;
    fn name() -> &'static str {
        "Complex"
    }
}

You can easily implement the tensor ops traits for the Complex type, e.g.

impl<B: ComplexTensorBackend> BasicOps<B> for Complex {
    // ...
}

For the element type, I think the make_elem! macro will not work because ToElement does not have to_complex, but that is fine. The macro was mostly to avoid repeating the implementation, we can either implement the Element trait manually for these types or make the macro a bit more flexible for custom external types. Maybe that will require adding ToComplex (in the complex crate), and implement it for types that implement ToElement, so we can convert types <> complex.

Then, for the concrete implementations we can have feature flags similar to burn-vision

#[cfg(feature = "ndarray")]
mod ndarray {
    use crate::ComplexTensorBackend;
    use burn_ndarray::{
        FloatNdArrayElement, IntNdArrayElement, NdArray, NdArrayTensorFloat, QuantElement,
    };

    impl<E: FloatNdArrayElement, I: IntNdArrayElement, Q: QuantElement> ComplexTensorBackend
        for NdArray<E, I, Q>
    {
        // ...
    }
}

So we can limit the extension incrementally.

Eventually, we might move it as a core feature/trait, but I believe starting as an extension is the right approach to limit the scope.

CC'ing @nathanielsimard in case you have other thoughts on this subject.

skewballfox · 2025-09-06T20:04:51Z

@laggui I'm currently running into issues with the Complex{32,64} needing to implement the element trait for BasicOps. I can't implement elementConversion and a few others I've duplicated in burn-complex without moving the complex elements themselves into burn-tensor.

I think moving complex element into burn-tensor and leaving everything else in burn-complex is probably the right approach, but I'm open to other suggestions. An alternative is create a new trait that would be shared by Element and ComplexElement and make that the requirement for basic ops, but that seems like the wrong approach to take

…sic ops

github-actions · 2025-10-21T12:13:16Z

This PR has been marked as stale because it has not been updated for over a month

skewballfox · 2025-10-21T15:35:00Z

still working on this. Just had a busy few weekends. I'm hoping to work on this a little this weekend

…ical dependency

…owf for flex

skewballfox · 2026-04-20T00:03:54Z

@laggui I think I have a full implementation for split tensors and interleaved for burn-flex, I'm currently trying to rework the test prakash started.

I'm having trouble getting one specific method to work with the types I was using to refer to the inner backend's int tensor primitive: Numeric::powi.

While it's not quite ready, the only thing I think that's left for a first pass is to fix the issue with the ambigious/mismatched associated type blocking powi and to write the test for both the interleaved and complex versions. If you want to start reviewing, those last few pieces should be done next weekend or the one after that.

laggui · 2026-04-20T14:17:58Z

@skewballfox sounds good, will try to review this on Wednesday 🙏

antimora

Took a pass over the diff. Great progress on the scope, and the decorator direction is the right call. A few things grouped by theme so the inline comments aren't swimming in context.

Design (responding to the goals in the PR description)

Goal 3 (ComplexBackend is not a supertrait of Backend) is satisfied syntactically, but in practice BasicOps / Numeric for ComplexKind in base.rs require C: ComplexTensorBackend<InnerBackend = C> + Backend wherever they're used through Tensor<B, Complex>. That effectively excludes SplitBackend<B>, which is PhantomData<B> and never implements Backend. So the "any backend via SplitBackend" story doesn't flow to the public Tensor surface today. Worth deciding whether that constraint belongs on the trait itself or whether SplitBackend needs to implement Backend (even as a thin passthrough).

The Layout trait is currently carrying one associated type and no methods; DefaultComplexOps ends up with two near-duplicate impls for the two layouts. Until there's a layout-specialized op that actually needs the split (butterfly/FFT), it might be worth folding Layout into ComplexTensorBackend and reintroducing it when a concrete need shows up. As it stands the separation is paying ceremonial cost without a payoff yet.

There are now two parallel Complex types: burn_tensor::Complex32/64 (concrete, derived Pod) and burn_complex::Complex<E> (generic, hand-rolled unsafe impl Pod). They're sound, but the duplication comes back to bite in the element/dtype traits (the ToComplex* vs ToElement split, ElementLimits on Complex<E>, etc.). Picking one and letting DType::Complex32/64 map through it would cut a lot of surface area.

Backing laggui's earlier recommendation: leaving only the DType variant in burn-tensor and pulling the rest into burn-complex still looks right. ComplexKind / BasicOps / Numeric impls currently living in burn-tensor are the main bit keeping the crate boundary fuzzy.

Correctness hotspots

A few real bugs below (inline). The big one is complex_into_imag_data in burn-flex calling the real-extraction helper. There are also several todo!() arms reachable from normal flows (safetensors save, TensorData::assert_approx_eq, fusion IR, Numeric::powi, complex_powf when rhs dtype doesn't match) which would panic rather than fail gracefully.

Tests

The 908-line tests/ops.rs doesn't currently run in any harness: the #[testgen(complex)] attribute is commented out, the export-tests feature is commented out in Cargo.toml, and TestBackend is never defined. cargo test -p burn-complex runs only the four small unit tests in base/element.rs. Worth wiring this up first, because most of the bugs below are the kind a half-decent test suite catches. Also: the SplitBackend code path (~900 LOC) has no tests at all, so the layout-independence claim is unverified end-to-end. Conversion helpers in utils.rs have no tests either, which is exactly where the imag bug came from.

Minor / housekeeping

Commented-out stubs in router/fusion/cubecl/remote leave a false impression those backends work with complex. Consider deleting them or gating behind a clearly-unfinished marker. base.rs also has a fair amount of author-scratch comments (//Note:, //TODO: without issue links, a handful of commented-out methods) that would be worth cleaning before a real review pass.

Happy to go deeper on any of these.

antimora · 2026-04-21T15:18:14Z

+
+    fn complex_exp(tensor: ComplexTensor<SplitBackend<B>>) -> ComplexTensor<SplitBackend<B>> {
+        // formula: e^(a + bi) = e^a * (cos(b) + i*sin(b)) = from_polar(e^a, b)
+        //TODO: add the checks for corner cases +∞, -∞, and NaN


The commented-out URL points to a fork (skewballfox/burn), which will rot. If the TODO is worth keeping, converting it into an upstream issue link is more durable.

I'll create an issue once this is ready to merge, or try to go ahead and fix it after I write the test

Co-authored-by: Copilot <copilot@github.com>

…lated traits and tests Co-authored-by: Copilot <copilot@github.com>

…missing doc strings

…ch when testing burn-complex Co-authored-by: Copilot <copilot@github.com>

skewballfox · 2026-04-26T18:51:56Z

    Self::Elem: Element,
 {
+    /// The type of the integer tensor primitive associated with this numeric kind.
+    type IntTensor: TensorKind<B>;


@laggui so I was having trouble resolving non-complex primitives for complex kinds in the numeric impl. From what I understand, rust resolves traits by path, so even doing something like making a supertrait of backendtypes with complex primitives, and making it backend types a supertrait of that supertrait, because the self::Primitive is defined along a different level of that path, it coerces it the the complex kinds primitive rather than BackendTypes primitive. I tried multiple methods to work around that other than the super trait, but ultimately without equality constraints on associated traits (currently unstable), I think we'll have to pin the type for methods that need to resolve to a specific primitive, like I did above.

If there is an alternative I'd love to hear it.

That's a code smell...

It's not clear why this would required for the ComplexTensorBackend when you also have + BackendTypes bound. But I wouldn't introduce an explicit associated type on Numeric to mark the IntTensor kind.

I also noticed you moved backend methods to BackendTypes, I thought this was only introduced in #4868 for associated types?

That's a code smell...

It's not clear why this would required for the ComplexTensorBackend when you also have + BackendTypes bound. But I wouldn't introduce an explicit associated type on Numeric to mark the IntTensor kind.

honestly, I agree, but the only other solution that might work is associated type defaults, which is unstable, and that's if the issue is the trait resolution path (option 2 below). If you clone the branch locally and revert to 84514e5, and make it so that CBT or complexBackend is how you are trying to supply the self::primitive, you basically run into this

mismatched types expected associated type `<C as Backend>::IntTensorPrimitive` found associated type `<<C as ComplexTensorBackend>::Layout as base::Layout>::ComplexTensorPrimitive` an associated type was expected, but a different one was found

and that's whether you have ComplexTensorPrimitive exist in CBT, Layout or both. Trait resolution works by path. Self::Primitive always resolves to the complex tensor. I think this means either one or two things:

for other backends, rhs for powi isn't resolving to IntTensorPrimitive, it's resolving to whatever the primitive is for that tensorkind, even if the underlying dtype wasn't aligned with the primitive it was being resolved to. If that's the case then the type mapping in backend types is redundant.

the issue is with the layering, if Int/Float, whatev are actually disctinct mappings at compile time, and then this is going to pose an issue for any backend decorator introducing a new tensor kind, or new underlying dtype. The options are either provide a mapping at the trait level for the ops that need one of the inputs to be a specific type, or provide a mapping at the trait level for the default primitive, and then use backend types for the inputs.

I also noticed you moved backend methods to BackendTypes, I thought this was only introduced in #4868 for associated types?

check the other changes in this commit. we had a few functions that needed device information in addition to type mapping. It didn't pop up in the PR because I wasn't able to test it on a decorator that didn't already implement backend. I can move them into a separate trait, and then propagate that to the parent trait requirements.

skewballfox · 2026-04-26T18:54:02Z

+
+    fn complex_exp(tensor: ComplexTensor<SplitBackend<B>>) -> ComplexTensor<SplitBackend<B>> {
+        // formula: e^(a + bi) = e^a * (cos(b) + i*sin(b)) = from_polar(e^a, b)
+        //TODO: add the checks for corner cases +∞, -∞, and NaN


I'll create an issue once this is ready to merge, or try to go ahead and fix it after I write the test

Co-authored-by: Copilot <copilot@github.com>

prakash-shekhar and others added 8 commits June 29, 2025 11:45

Add core complex tensor infrastructure

8bbe761

- Add Complex as first-class TensorKind alongside Float, Int, Bool - Add ComplexTensorPrimitive and ComplexElem to Backend trait - Add complex tensor type aliases and exports

Merge branch 'main' into complex

236e260

attempting a complex backend

d12c919

[WIP] attempt at complex backend decorator

ef89d80

Merge branch 'tracel-ai:main' into complex

fdb9490

laggui self-requested a review August 28, 2025 02:46

laggui reviewed Aug 28, 2025

View reviewed changes

skewballfox added 6 commits August 30, 2025 14:29

[WIP] move all complex stuff to burn-complex

bc92d2e

[WIP] move all complex stuff to burn-complex

779f86d

[WIP] complex backend

b2cb5b9

[WIP] complex backend

bb30afc

working out how basic ops would work

6a9dabd

Merge branch 'main' into complex

3928a7b

skewballfox mentioned this pull request Sep 14, 2025

Relax Element Trait constraints #3712

Open

skewballfox added 3 commits September 14, 2025 14:18

Merge branch 'main' into complex

e21b787

moved complex element back into burn-tensor

1eacd40

complex element back in burn-complex, working to implement missing ba…

d3e8591

…sic ops

github-actions Bot added the stale The issue or pr has been open for too long label Oct 21, 2025

github-actions Bot removed the stale The issue or pr has been open for too long label Oct 22, 2025

skewballfox and others added 4 commits October 25, 2025 17:34

Merge branch 'main' into complex

01832cc

catching up to main

85257ad

castle made of glue

2b154b1

had to move ndarray complex implementation out in order to avoid cycl…

1c93e8b

…ical dependency

skewballfox added 6 commits April 18, 2026 15:29

add complex helpers for flex, impl some flex ops

67d84b6

ops, ops everywhere

b0dc53f

revert changes to ndarray for this PR

242449c

draw the rest of the owl

044fc59

starting impl for testing interleaved complex tensors

d9ecbe8

update doc strings, remove ndarray specific simd stuff, add complex_p…

1f28e26

…owf for flex

skewballfox added 2 commits April 19, 2026 19:19

delete leftover ndarray files by git checkout main

4e236f3

delete leftover ndarray files by git checkout main

c9c6e83

antimora reviewed Apr 21, 2026

View reviewed changes

skewballfox commented Apr 22, 2026

View reviewed changes

Comment thread crates/burn-backend/src/data/tensor.rs Outdated

changes from code review in progress.

7fa743b

Co-authored-by: Copilot <copilot@github.com>

skewballfox mentioned this pull request Apr 23, 2026

Split Associated Types from Backend into Backend Core #4868

Merged

2 tasks

skewballfox and others added 3 commits April 25, 2026 10:06

Merge branch 'main' into complex

9f192c9

add BackendTypes to ComplexTensorBackend

fe6892f

Co-authored-by: Copilot <copilot@github.com>

move complex element into burn-backend

cf5e322

Co-authored-by: Copilot <copilot@github.com>

skewballfox force-pushed the complex branch from 65cda40 to cf5e322 Compare April 25, 2026 19:28

skewballfox and others added 5 commits April 25, 2026 15:40

Implement Complex32 and Complex64 support in TensorData and update re…

0f7f84a

…lated traits and tests Co-authored-by: Copilot <copilot@github.com>

simplify complex element conversion, fix feature gate snafu, add doc …

f305283

…missing doc strings

changes from code review and debugging recursion error in burn-dispat…

052dd2b

…ch when testing burn-complex Co-authored-by: Copilot <copilot@github.com>

move abs to ordered ops

84514e5

still working out associate type errors for powi

f4d0305

skewballfox force-pushed the complex branch from 8e5507f to f4d0305 Compare April 26, 2026 00:55

antimora mentioned this pull request Apr 26, 2026

Proposal: Unify Tensor to a Single Type #4879

Open

force powi to resolve to int kind in trait

8184e7b

skewballfox commented Apr 26, 2026

View reviewed changes

code compiles, test fail

7e2d935

Co-authored-by: Copilot <copilot@github.com>

skewballfox mentioned this pull request Apr 27, 2026

extract trig ops in a new trait #4891

Open

2 tasks

tests in progress: 8 out of 25 passed

2d2c1f9

Conversation

skewballfox commented Aug 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Checklist

Related Issues/PRs

Changes

Testing

Notes

Uh oh!

skewballfox commented Aug 24, 2025

Uh oh!

laggui left a comment

Choose a reason for hiding this comment

Uh oh!

skewballfox commented Sep 6, 2025

Uh oh!

github-actions Bot commented Oct 21, 2025

Uh oh!

skewballfox commented Oct 21, 2025

Uh oh!

skewballfox commented Apr 20, 2026

Uh oh!

laggui commented Apr 20, 2026

Uh oh!

antimora left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

antimora Apr 21, 2026

Choose a reason for hiding this comment

Uh oh!

skewballfox Apr 26, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

skewballfox Apr 26, 2026

Choose a reason for hiding this comment

Uh oh!

laggui Apr 27, 2026

Choose a reason for hiding this comment

Uh oh!

skewballfox Apr 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

skewballfox Apr 26, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

skewballfox commented Aug 24, 2025 •

edited

Loading

skewballfox Apr 27, 2026 •

edited

Loading