Skip to content

📖 [Story] Caching and Compile time improvements #2684

@narendasan

Description

@narendasan

TL;DR

There are a collection of features which work together to improve compile times

Goal(s)

We want to reduce the time it takes to compile models. Some parts are out of our control (dynamo graph capture) but we can improve on the TensorRT side.

Tasks

### Tasks
- [ ] https://github.com/pytorch/TensorRT/issues/2674
- [ ] Weight Refit
- [ ] Engine Caching

Additional context

Metadata

Metadata

Assignees

No one assigned

    Labels

    StoryIssues proposing a new Story

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions