Skip to content

Conversation

@imangohari1
Copy link

@imangohari1 imangohari1 commented Nov 6, 2025

The VLLM_PROMPT_SEQ_BUCKET_MAX is expected to be the size of input if avail, otherwise max_model_len.

Essential Elements of an Effective PR Description Checklist

  • The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
  • The test plan, such as providing test command.
  • The test results, such as pasting the results comparison before and after, or e2e results

Purpose

Test Plan

Test Result

@imangohari1
Copy link
Author

@mgawarkiewicz-intel thanks for the review. what else is needed here to merge this pr?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants