Skip to content

[RFC]: Doc enhancement #1248

Open
Open
@wangxiyuan

Description

@wangxiyuan

Motivation.

This RFC is used for trace the community work of official doc improvement. Currently, there are 3 important section need update.

  1. Tutorials
  2. User doc
  3. Developer doc

I'll list all the work items below. Everyone is welcome to take the task.

Proposed Change.

Tutorials

Problem:

  1. Moe model guide is missing. For example Qwen3 Moe
  2. No detial parallel case example
  3. QwQ is not very popular.
  4. V1 Engine should be used by default

Propose Change:
After improvment. The content be more clear(required : 313T+64GB):

  • Single NPU(Qwen3-8B) aclgraph mode + eager mode @leo-pony
  • Single NPU(Qwen2.5-VL-7B)eager mode @shen-shanshan
  • Single NPU(Qwen2.5-audio)eager mode @shen-shanshan
  • Single NPU(Qwen3 8B embedding)eager mode @wangxiyuan
  • Multi NPU 2 card (Qwen3 MOE-30B) aclgraph mode + TP2 @leo-pony
  • Multi NPU 4 card (Qwen3 32B) aclgraph mode + TP2 + DP2 + W8A8(optional) @22dimensions
  • Multi Node 2 node (DeepSeek V3 0528 W8A8) TP8+DP2 Graph mode @Potabk
  • Multi Node 4 node (DeepSeek R1) TP8+DP4 Graph mode @MengqingCao
  • Multi Node 8 node (DeepSeek V3 0528 ) 1P1D Graph mode @wangxiyuan

User Guide

user guide should contain the usage for vLLM Ascend

Problem:

  1. A lot of usage is missing.

Propose Change:

Developer Guide

Problem:

  1. There is no feature or code guide for developers at all.

Propose Change:

Metadata

Metadata

Assignees

No one assigned

    Labels

    RFCRequest For Comments

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions