fix: wrong dtype and device in `aten.full_like` decomposition #3535

junstar92 · 2025-05-28T07:36:42Z

Description

This PR addresses a bug in the Torch-TensorRT decomposition of torch.ops.aten.full_like.

In the current implementation, the decomposition incorrectly overrides the dtype and device arguments, ignoring explicitly set dtype values and assigning all tensors to the default_device() (typically cuda:0), regardless of the inputs' actual device.

Specifically, the issue occurs in the following decomposition function:

TensorRT/py/torch_tensorrt/dynamo/lowering/_decompositions.py

Lines 428 to 437 in 5c37931

    
           @register_torch_trt_decomposition( 
        
               torch.ops.aten.full_like, registry=TORCH_TRT_DECOMPOSITIONS 
        
           )  # type: ignore 
        
           def full_like_decomposition(*args, **kwargs) -> torch.Tensor: 
        
               input = args[0] 
        
               shape = args[0].shape 
        
               fill_value = args[1] 
        
               kwargs["dtype"] = input.dtype 
        
               kwargs["device"] = to_torch_device(default_device()) 
        
               return torch.full(shape, fill_value, dtype=kwargs["dtype"], device=kwargs["device"])

This implementation causes two main issues:

Incorrect dtype propagation: Even when torch.full_like(..., dtype=torch.bool) is used in the model, the decomposition overwrites the dtype with input.dtype (e.g., float16), resulting in an incorrect output type.
Device mismatch: When exporting and running models on devices other than cuda:1 (e.g., cuda:1), the decomposition forces outputs to be on cuda:0, causing runtime errors or silent bugs due to device mismatch.

To demonstrate the issue, the following test cases are included in this PR:

import torch
from torch.export._trace import _export
from torch_tensorrt.dynamo.lowering import get_decompositions


class MyModel(torch.nn.Module):
    def __init__(self):
        super().__init__()

    def forward(self, x):
        return torch.ones_like(x, dtype=torch.bool)


def test1() -> tuple[bool, str]:
    model = MyModel()
    x = torch.randn(1, 10, dtype=torch.float16)
    y = model(x)
    return y.dtype == torch.bool, f"expected dtype {torch.bool}, and got {y.dtype}"


def test2() -> tuple[bool, str]:
    model = MyModel()
    x = torch.randn(1, 10, dtype=torch.float16)
    ep = _export(model, (x,))
    ep = ep.run_decompositions(get_decompositions(False))
    gm = ep.module()
    y = gm(x)
    return y.dtype == torch.bool, f"expected dtype {torch.bool}, and got {y.dtype}"

def test3() -> tuple[bool, str]:
    device = torch.device("cuda", index=1)
    model = MyModel().to(device)
    x = torch.randn(1, 10, dtype=torch.float16).to(device)
    ep = _export(model, (x,))
    ep = ep.run_decompositions(get_decompositions(False))
    gm = ep.module()
    y = gm(x)
    return y.device == device, f"expected device {device}, and got {y.device}"
    

for test in (test1, test2, test3):
    success, msg = test()
    print(f"{test.__name__}: {'Success' if success else 'Failed'} - {msg}")

Results:

test1: Success - expected dtype torch.bool, and got torch.bool
test2: Failed - expected dtype torch.bool, and got torch.float16
test3: Failed - expected device cuda:1, and got cuda:0

test1: Verifies that torch.ones_like returns a tensor with the correct dtype.
test2: Shows that the exported model via torch.export(...).run_decompositions(...) fails to preserve dtype.
test3: Demonstrates the incrroect device assignment after decomposition when using non-default CUDA devices.

This PR fixes the decomposition logic to correctly respect the explicitly passed dtype and device values, or fall back to those inferred from the input tensor only if not explicitly provided.

Type of change

Bug fix (non-breaking change which fixes an issue)

Checklist:

My code follows the style guidelines of this project (You can use the linters)
I have performed a self-review of my own code
I have commented my code, particularly in hard-to-understand areas and hacks
I have made corresponding changes to the documentation
I have added tests to verify my fix or my feature
New and existing unit tests pass locally with my changes
I have added the relevant labels to my PR in so that relevant reviewers are notified

facebook-github-bot · 2025-05-28T07:36:49Z

Hi @junstar92!

Thank you for your pull request and welcome to our community.

Action Required

In order to merge any pull request (code, docs, etc.), we require contributors to sign our Contributor License Agreement, and we don't seem to have one on file for you.

Process

In order for us to review and merge your suggested changes, please sign at https://code.facebook.com/cla. If you are contributing on behalf of someone else (eg your employer), the individual CLA may not be sufficient and your employer may need to sign the corporate CLA.

Once the CLA is signed, our tooling will perform checks and validations. Afterwards, the pull request will be tagged with CLA signed. The tagging process may take up to 1 hour after signing. Please give it that time before contacting us about it.

If you have received this in error or have any questions, please contact us at [email protected]. Thanks!

facebook-github-bot · 2025-05-28T07:40:40Z

Thank you for signing our Contributor License Agreement. We can now accept your code for this (and any) Meta Open Source project. Thanks!

narendasan · 2025-05-28T17:36:27Z

@junstar92 thanks for the PR, we will review it!

peri044

LGTM. Thanks for the fix

junstar92 · 2025-05-29T01:03:47Z

@peri044 Thanks for quick review.

But, I missed that device needs to be treated like dtype because full_like also accepts device as an argument.

kwargs["device"] = kwargs.get("device", None) or input.device

Without handling it, a device different from the input's may be used, causing mismatches.
Should I open a new one to address the remaining issue?

apbose · 2025-05-29T02:39:40Z

Hi @junstar92 you could open a new PR.

fix: wrong dtype and device in full_like decomposition

eb6feda

github-actions bot added component: lowering Issues re: The lowering / preprocessing passes component: api [Python] Issues re: Python API component: dynamo Issues relating to the `torch.compile` or `torch._dynamo.export` paths labels May 28, 2025

facebook-github-bot added the cla signed label May 28, 2025

github-actions bot requested a review from peri044 May 28, 2025 07:40

narendasan requested a review from apbose May 28, 2025 17:36

peri044 approved these changes May 28, 2025

View reviewed changes

peri044 merged commit ee32da0 into pytorch:main May 28, 2025
84 checks passed

junstar92 mentioned this pull request May 29, 2025

fix: handle device in the same way as dtype in aten.full_like decomposition #3538

Open

7 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: wrong dtype and device in `aten.full_like` decomposition #3535

fix: wrong dtype and device in `aten.full_like` decomposition #3535

Uh oh!

junstar92 commented May 28, 2025 •

edited

Loading

Uh oh!

facebook-github-bot commented May 28, 2025

Uh oh!

facebook-github-bot commented May 28, 2025

Uh oh!

narendasan commented May 28, 2025

Uh oh!

peri044 left a comment

Uh oh!

Uh oh!

junstar92 commented May 29, 2025

Uh oh!

apbose commented May 29, 2025

Uh oh!

Uh oh!

	@register_torch_trt_decomposition(
	torch.ops.aten.full_like, registry=TORCH_TRT_DECOMPOSITIONS
	) # type: ignore
	def full_like_decomposition(args, *kwargs) -> torch.Tensor:
	input = args[0]
	shape = args[0].shape
	fill_value = args[1]
	kwargs["dtype"] = input.dtype
	kwargs["device"] = to_torch_device(default_device())
	return torch.full(shape, fill_value, dtype=kwargs["dtype"], device=kwargs["device"])

fix: wrong dtype and device in aten.full_like decomposition #3535

fix: wrong dtype and device in aten.full_like decomposition #3535

Uh oh!

Conversation

junstar92 commented May 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Type of change

Checklist:

Uh oh!

facebook-github-bot commented May 28, 2025

Action Required

Process

Uh oh!

facebook-github-bot commented May 28, 2025

Uh oh!

narendasan commented May 28, 2025

Uh oh!

peri044 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

junstar92 commented May 29, 2025

Uh oh!

apbose commented May 29, 2025

Uh oh!

Uh oh!

fix: wrong dtype and device in `aten.full_like` decomposition #3535

fix: wrong dtype and device in `aten.full_like` decomposition #3535

junstar92 commented May 28, 2025 •

edited

Loading