HabanaAI / vllm-fork Public

forked from vllm-project/vllm

Notifications You must be signed in to change notification settings
Fork 134
Star 85

Code
Issues 10
Pull requests 65
Discussions
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Security
Insights

Pull requests: HabanaAI/vllm-fork

Labels 19 Milestones 0

New pull request New

65 Open 1,977 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

Update minerU Gaudi support for minerU v2.6.4

#2148 opened Nov 21, 2025 by tinafengfun

Loading…

<Scheduler> Leverage padding aware in chunked prefill

#2147 opened Nov 21, 2025 by hlin99

Loading…

add VLLM_ENGINE_PROFILER_SKIP_STEPS to the engine profiler

#2143 opened Nov 19, 2025 by yangulei

Loading…

Enable the graph mode and add the full warmup logic for Deepseek OCR model

#2138 opened Nov 17, 2025 by HeJunyan

Loading…

[DeepSeek R1] chunked prefill warmup with chunk size

#2135 opened Nov 14, 2025 by jerrychenhf

Loading…

PD scripts update for 1.23 + fp8_inc

#2131 opened Nov 11, 2025 by Yanli2190

Loading…

Enable PaddleOCR-VL

#2123 opened Nov 6, 2025 by mengker33

Loading…

fea(readme): VLLM_PROMPT_SEQ_BUCKET_MAX value update

#2121 opened Nov 6, 2025 by imangohari1

Loading…

3 tasks

Use default INC version in docker

#2120 opened Nov 5, 2025 by Yanli2190

Loading…

add mineru doc

#2112 opened Nov 3, 2025 by yingjie-han

Loading…

Add max_pixels option.

#2094 opened Oct 28, 2025 by wenbinc-Bin

Loading…

Workaround for Assertion error when embedding with bge-m3 in lazy mode

#2093 opened Oct 28, 2025 by slokesha

Loading…

fix wrong section for Qwen series doc

#2074 opened Oct 23, 2025 by heyuanliu-intel

Loading…

3 tasks

Enable chunked prefill on aice 1.22

#2070 opened Oct 23, 2025 by YuJiankang

Loading…

refactor(hpu_model_runner): restructure multimodal-related code

#2066 opened Oct 22, 2025 by Jing1Ling • Draft

3 tasks

Slokesha port ovis

#2063 opened Oct 21, 2025 by slokesha • Draft

3 tasks

[CS-1549] Eanble function call DeepSeek-V3.1

#2047 opened Oct 19, 2025 by JianyuLi01

Loading…

Porting_ovis

#2044 opened Oct 16, 2025 by SupreetSinghPalne • Draft

3 tasks

Spalne/porting ovis

#2038 opened Oct 16, 2025 by SupreetSinghPalne • Draft

3 tasks

fix bug that VLLM_SKIP_WARMUP=1 is not recognized in vision_bucket

#2036 opened Oct 15, 2025 by yingjie-han

Loading…

Fix cache miss for Ovis2.5

#2035 opened Oct 15, 2025 by Jianhong-Zhang • Draft

Fix cache miss for InternVL

#2034 opened Oct 15, 2025 by Jianhong-Zhang • Draft

Keep grids tensor on CPU in multimodal kwargs

#2019 opened Oct 10, 2025 by slokesha • Draft

3 tasks

gpt-oss FP8 KV

#2017 opened Oct 10, 2025 by yiliu30 • Draft

Ovis padding

#2011 opened Oct 9, 2025 by SupreetSinghPalne

Loading…

3 tasks

Previous 1 2 3 Next

Previous Next

ProTip! Updated in the last three days: updated:>2025-11-18.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!