-
Notifications
You must be signed in to change notification settings - Fork 659
[FDConfig] remove engine client args, use fd_config instead #5217
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: develop
Are you sure you want to change the base?
[FDConfig] remove engine client args, use fd_config instead #5217
Conversation
|
Thanks for your contribution! |
| enable_prefix_caching=args.enable_prefix_caching, | ||
| splitwise_role=args.splitwise_role, | ||
| max_processor_cache=args.max_processor_cache, | ||
| config=config, |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
这个config命名为fd_config吧?并且函数参数添加FDConfig注解
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
好的
Codecov Report❌ Patch coverage is
Additional details and impacted files@@ Coverage Diff @@
## develop #5217 +/- ##
==========================================
Coverage ? 60.44%
==========================================
Files ? 320
Lines ? 39053
Branches ? 5871
==========================================
Hits ? 23604
Misses ? 13582
Partials ? 1867
Flags with carried forward coverage won't be shown. Click here to find out more. ☔ View full report in Codecov by Sentry. 🚀 New features to boost your workflow:
|
Motivation
由于 FDConfig.postprocess 会对用户传参进行额外的后处理,当前 EngineClient 接收到的参数与 Engine/Worker 实际应用的配置没有完全对齐,存在使用隐患。
Modifications
删除了 EngineClient 的大部分部署参数,仅保留自身请求连接管理的相关参数。在 APIServer 中提前实例化 FDConfig,EngineClient 从 FDConfig 读取所需字段。
Usage or Command
None
Accuracy Tests
None
Checklist
[FDConfig],[APIServer],[Engine],[Scheduler],[PD Disaggregation],[Executor],[Graph Optimization],[Speculative Decoding],[RL],[Models],[Quantization],[Loader],[OP],[KVCache],[DataProcessor],[BugFix],[Docs],[CI],[Optimization],[Feature],[Benchmark],[Others],[XPU],[HPU],[GCU],[DCU],[Iluvatar],[Metax]]pre-commitbefore commit.releasebranch, make sure the PR has been submitted to thedevelopbranch, then cherry-pick it to thereleasebranch with the[Cherry-Pick]PR tag.