-
Notifications
You must be signed in to change notification settings - Fork 330
Multiple model selection with remote service #1049
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Multiple model selection with remote service #1049
Conversation
…ort multiple models chatqna with wrapper is required
…ong with remote tgi/tei service with instructions
…ital Assistant, Code Generator, Content Summarizer respectively
…ntent exceeds the window height
…and Faqgen response
… adjust to different screen sizes
…n the chatqna prompt section
…different models from dropdown
for more information, see https://pre-commit.ci
…docker compose files Signed-off-by: sgurunat <[email protected]>
for more information, see https://pre-commit.ci
ProductivitySuite/docker_compose/intel/hpu/gaudi/set_env_remote.sh
Outdated
Show resolved
Hide resolved
ProductivitySuite/docker_compose/intel/hpu/gaudi/compose_remote.yaml
Outdated
Show resolved
Hide resolved
ProductivitySuite/docker_compose/intel/hpu/gaudi/compose_tgi_remote.yaml
Outdated
Show resolved
Hide resolved
…d of copying it from local
…om/sgurunat/GenAIExamples into multiple-model-with-remote-service
…s with vLLM Signed-off-by: sgurunat <[email protected]>
…om/sgurunat/GenAIExamples into multiple-model-with-remote-service
Signed-off-by: sgurunat <[email protected]>
…with set_env.sh Signed-off-by: sgurunat <[email protected]>
…M based instructions Signed-off-by: sgurunat <[email protected]>
…ose_remote.yaml file under gaudi folder Signed-off-by: sgurunat <[email protected]>
…t required Signed-off-by: sgurunat <[email protected]>
|
@sgurunat, |
Signed-off-by: sgurunat <[email protected]>
…om/sgurunat/GenAIExamples into multiple-model-with-remote-service
|
@lvliang-intel - Fixed it |
|
Please check the failed CI, https://github.com/opea-project/GenAIExamples/actions/runs/11795779662/job/32856256339?pr=1049 |
for more information, see https://pre-commit.ci
Signed-off-by: sgurunat <[email protected]>
…om/sgurunat/GenAIExamples into multiple-model-with-remote-service
| git clone https://github.com/opea-project/GenAIComps.git && cd GenAIComps && git checkout "${opea_branch:-"main"}" && cd ../ | ||
|
|
||
| echo "Build all the images with --no-cache, check docker_image_build.log for details..." | ||
| docker compose -f build_vllm.yaml build --no-cache > ${LOG_PATH}/docker_image_build.log |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
build_vllm.yaml -> build.yaml and just build the images required by this test.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Oh yeah, missed to change it. Updated it now. Thanks
| docker compose -f build_vllm.yaml build --no-cache > ${LOG_PATH}/docker_image_build.log | ||
|
|
||
| docker pull ghcr.io/huggingface/tei-gaudi:latest | ||
| docker pull opea/vllm-hpu:latest |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
You don't need pull this, this will be build in the CI test.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
ok commented it out.
Signed-off-by: sgurunat <[email protected]>
Signed-off-by: sgurunat <[email protected]>
|
This PR should be closed right? |

Description
This PR contains changes related to multiple models selection in ProductivitySuite ChatQnA along with some minor enhancements to the UI. Also it contains docker compose files and instructions to run ProductivitySuite on Intel Gaudi server with remote TGI/TEI services.
Type of change
New feature (non-breaking change which adds new functionality)
Others (enhancement, documentation, validation, etc.)
New Features:
Add chatqna_wrapper.py along with updated Dockerfile.wrapper. To support multiple models chatqna with wrapper is required
ProductivitySuite: Add docker compose files for Intel Gaudi server along with remote tgi/tei service with instructions
ProductivitySuite UI: Add multiple models support. Choose different models from dropdown
Enhancements:
ProductivitySuite UI: Update names of ChatQnA, CodeGen, DocSum to Digital Assistant, Code Generator, Content Summarizer respectively
ProductivitySuite UI: Update Docsum to have vertical scroll bar if content exceeds the window height
ProductivitySuite UI: Remove <|eot_id|> string from the Chat, Docsum and Faqgen response
ProductivitySuite UI: Update contextWrapper and contextTitle width to adjust to different screen sizes
ProductivitySuite UI: Show system prompt input field always to edit in the chatqna prompt section
ProductivitySuite UI: Update max_new_tokens into max_tokens