Commit f203673
authored
[ModelOpt] Introduce VLLM_MAX_TOKENS_PER_EXPERT_FP4_MOE env var to control blockscale tensor allocation (#18160)
Signed-off-by: Pavani Majety <[email protected]>1 parent 7d92164 commit f203673
File tree
3 files changed
+20
-18
lines changed- vllm
- model_executor/layers/fused_moe
3 files changed
+20
-18
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1085 | 1085 | | |
1086 | 1086 | | |
1087 | 1087 | | |
1088 | | - | |
1089 | 1088 | | |
1090 | 1089 | | |
1091 | 1090 | | |
| |||
1107 | 1106 | | |
1108 | 1107 | | |
1109 | 1108 | | |
| 1109 | + | |
| 1110 | + | |
| 1111 | + | |
| 1112 | + | |
| 1113 | + | |
1110 | 1114 | | |
1111 | | - | |
1112 | | - | |
| 1115 | + | |
| 1116 | + | |
| 1117 | + | |
| 1118 | + | |
1113 | 1119 | | |
1114 | 1120 | | |
1115 | 1121 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
117 | 117 | | |
118 | 118 | | |
119 | 119 | | |
| 120 | + | |
120 | 121 | | |
121 | 122 | | |
122 | 123 | | |
| |||
814 | 815 | | |
815 | 816 | | |
816 | 817 | | |
| 818 | + | |
| 819 | + | |
| 820 | + | |
| 821 | + | |
| 822 | + | |
| 823 | + | |
| 824 | + | |
817 | 825 | | |
818 | 826 | | |
819 | 827 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1 | 1 | | |
2 | 2 | | |
3 | | - | |
4 | 3 | | |
5 | 4 | | |
6 | 5 | | |
| |||
271 | 270 | | |
272 | 271 | | |
273 | 272 | | |
274 | | - | |
275 | | - | |
276 | 273 | | |
277 | 274 | | |
278 | 275 | | |
| |||
330 | 327 | | |
331 | 328 | | |
332 | 329 | | |
333 | | - | |
334 | | - | |
335 | | - | |
336 | | - | |
| 330 | + | |
337 | 331 | | |
338 | 332 | | |
339 | 333 | | |
| |||
362 | 356 | | |
363 | 357 | | |
364 | 358 | | |
365 | | - | |
366 | | - | |
| 359 | + | |
367 | 360 | | |
368 | 361 | | |
369 | 362 | | |
| |||
378 | 371 | | |
379 | 372 | | |
380 | 373 | | |
381 | | - | |
382 | | - | |
383 | | - | |
384 | | - | |
385 | | - | |
386 | | - | |
| 374 | + | |
387 | 375 | | |
388 | 376 | | |
389 | 377 | | |
| |||
0 commit comments