📖 [Story] Optimize the launch overhead of TRT engine and pytorch kernels

## TL;DR

Runtime optimization in torch-tensorrt is crucial for maximizing model performance in real-world applications.
This story tracks the effort to improve runtime performance.

## Goal(s)


- Understand the overhead in cpp/python runtime module and improve the inference performance
- Ensure no or minimized impact on accuracy and resource with optimization

## Tasks
```[tasklist]
### Tasks
- [ ] https://github.com/pytorch/TensorRT/pull/3276
- [ ] https://github.com/pytorch/TensorRT/issues/3277
```

## Additional context


```[tasklist]
### Tasks
```

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

📖 [Story] Optimize the launch overhead of TRT engine and pytorch kernels #3274

TL;DR

Goal(s)

Tasks

Additional context

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

📖 [Story] Optimize the launch overhead of TRT engine and pytorch kernels #3274

Description

TL;DR

Goal(s)

Tasks

Additional context

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions