[Perf] Optimize detokenizer python logic #32975

yewentao256 · 2026-01-23T22:45:06Z

Purpose

adding a num_output_tokens function, to avoid len(self.output_token_ids) might do a slice in SlowIncrementalDetokenizer

class SlowIncrementalDetokenizer(BaseIncrementalDetokenizer):
    @property
    def output_token_ids(self) -> list[int]:
        return (
            self.token_ids
            if not self.prompt_len
            else (self.token_ids[self.prompt_len :])
        )

accumulate pieces and join once instead of string add

Test

Should be covered in unit test

  tests/tokenizers_/test_detokenize.py \
  tests/detokenizer/test_min_tokens.py \
  tests/detokenizer/test_stop_string_while_stop_model_terminates.py \
  tests/v1/engine/test_fast_incdec_prefix_err.py \
  tests/entrypoints/openai/test_serving_tokens.py

CC: @WoosukKwon @njhill

Signed-off-by: yewentao256 <zhyanwentao@126.com>

gemini-code-assist

Code Review

This pull request introduces performance optimizations to the detokenizer logic. The main changes include adding a num_output_tokens method to avoid creating intermediate list slices when only the length is needed, and accumulating string pieces in a list before performing a single join operation to prevent inefficient repeated string concatenations. These changes directly address the performance goals outlined in the PR description and are well-implemented. The TODO comment regarding inefficiency in BaseIncrementalDetokenizer.update is correctly resolved by the new string accumulation logic. The use of num_output_tokens is consistently applied where appropriate, replacing len(self.output_token_ids) to avoid unnecessary slicing. Overall, the changes improve efficiency without introducing new issues.

yewentao256 added 2 commits January 23, 2026 22:27

Optimize detokenizer

4786bb2

Signed-off-by: yewentao256 <zhyanwentao@126.com>

update

2a4a625

Signed-off-by: yewentao256 <zhyanwentao@126.com>

yewentao256 added the ready ONLY add when PR is ready to merge/full CI is needed label Jan 23, 2026

mergify bot added the v1 label Jan 23, 2026

gemini-code-assist bot reviewed Jan 23, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Perf] Optimize detokenizer python logic #32975

[Perf] Optimize detokenizer python logic #32975

yewentao256 commented Jan 23, 2026 •

edited by github-actions bot

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

[Perf] Optimize detokenizer python logic #32975

Are you sure you want to change the base?

[Perf] Optimize detokenizer python logic #32975

Conversation

yewentao256 commented Jan 23, 2026 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

yewentao256 commented Jan 23, 2026 •

edited by github-actions bot

Loading