Multiple model selection with remote service #1049

sgurunat · 2024-10-30T07:40:22Z

Description
This PR contains changes related to multiple models selection in ProductivitySuite ChatQnA along with some minor enhancements to the UI. Also it contains docker compose files and instructions to run ProductivitySuite on Intel Gaudi server with remote TGI/TEI services.

Type of change
New feature (non-breaking change which adds new functionality)
Others (enhancement, documentation, validation, etc.)
New Features:

Add chatqna_wrapper.py along with updated Dockerfile.wrapper. To support multiple models chatqna with wrapper is required
ProductivitySuite: Add docker compose files for Intel Gaudi server along with remote tgi/tei service with instructions
ProductivitySuite UI: Add multiple models support. Choose different models from dropdown
Enhancements:

ProductivitySuite UI: Update names of ChatQnA, CodeGen, DocSum to Digital Assistant, Code Generator, Content Summarizer respectively
ProductivitySuite UI: Update Docsum to have vertical scroll bar if content exceeds the window height
ProductivitySuite UI: Remove <|eot_id|> string from the Chat, Docsum and Faqgen response
ProductivitySuite UI: Update contextWrapper and contextTitle width to adjust to different screen sizes
ProductivitySuite UI: Show system prompt input field always to edit in the chatqna prompt section
ProductivitySuite UI: Update max_new_tokens into max_tokens

…ort multiple models chatqna with wrapper is required

…ong with remote tgi/tei service with instructions

…ital Assistant, Code Generator, Content Summarizer respectively

…ntent exceeds the window height

…and Faqgen response

… adjust to different screen sizes

…n the chatqna prompt section

…sationSlice

…different models from dropdown

for more information, see https://pre-commit.ci

ProductivitySuite/docker_compose/intel/hpu/gaudi/compose.yaml

…docker compose files Signed-off-by: sgurunat <[email protected]>

ker compose files

for more information, see https://pre-commit.ci

ProductivitySuite/docker_compose/intel/hpu/gaudi/set_env_remote.sh

ChatQnA/Dockerfile.wrapper

ProductivitySuite/docker_compose/intel/hpu/gaudi/README.md

ProductivitySuite/docker_compose/intel/hpu/gaudi/compose_remote.yaml

ProductivitySuite/docker_compose/intel/hpu/gaudi/compose_tgi_remote.yaml

…d of copying it from local

…om/sgurunat/GenAIExamples into multiple-model-with-remote-service

…s with vLLM Signed-off-by: sgurunat <[email protected]>

…om/sgurunat/GenAIExamples into multiple-model-with-remote-service

Signed-off-by: sgurunat <[email protected]>

…with set_env.sh Signed-off-by: sgurunat <[email protected]>

…M based instructions Signed-off-by: sgurunat <[email protected]>

…ose_remote.yaml file under gaudi folder Signed-off-by: sgurunat <[email protected]>

…t required Signed-off-by: sgurunat <[email protected]>

lvliang-intel · 2024-11-12T07:21:16Z

@sgurunat,
Please fix this path check issue.

Signed-off-by: sgurunat <[email protected]>

…om/sgurunat/GenAIExamples into multiple-model-with-remote-service

sgurunat · 2024-11-12T14:08:20Z

@lvliang-intel - Fixed it

chensuyue · 2024-11-13T01:24:18Z

Please check the failed CI, https://github.com/opea-project/GenAIExamples/actions/runs/11795779662/job/32856256339?pr=1049

ProductivitySuite/docker_compose/intel/hpu/gaudi/compose.yaml

ChatQnA/Dockerfile.wrapper

for more information, see https://pre-commit.ci

Signed-off-by: sgurunat <[email protected]>

…om/sgurunat/GenAIExamples into multiple-model-with-remote-service

chensuyue · 2024-11-14T08:16:42Z

ProductivitySuite/tests/test_compose_on_gaudi.sh

+    git clone https://github.com/opea-project/GenAIComps.git && cd GenAIComps && git checkout "${opea_branch:-"main"}" && cd ../
+
+    echo "Build all the images with --no-cache, check docker_image_build.log for details..."
+    docker compose -f build_vllm.yaml build --no-cache > ${LOG_PATH}/docker_image_build.log


build_vllm.yaml -> build.yaml and just build the images required by this test.

Oh yeah, missed to change it. Updated it now. Thanks

chensuyue · 2024-11-14T08:17:03Z

ProductivitySuite/tests/test_compose_on_gaudi.sh

+    docker compose -f build_vllm.yaml build --no-cache > ${LOG_PATH}/docker_image_build.log
+
+    docker pull ghcr.io/huggingface/tei-gaudi:latest
+    docker pull opea/vllm-hpu:latest


You don't need pull this, this will be build in the CI test.

ok commented it out.

Signed-off-by: sgurunat <[email protected]>

chensuyue · 2024-11-18T07:16:44Z

This PR should be closed right?

lvliang-intel · 2024-11-18T13:22:39Z

This PR is fixed by #1144 and #1149

sgurunat added 9 commits October 29, 2024 12:31

Add chatqna_wrapper.py along with updated Dockerfile.wrapper. To supp…

a3ef260

…ort multiple models chatqna with wrapper is required

ProductivitySuite: Add docker compose files for Intel Gaudi server al…

1d30bff

…ong with remote tgi/tei service with instructions

ProductivitySuite UI: Update names of ChatQnA, CodeGen, DocSum to Dig…

8ec0f6a

…ital Assistant, Code Generator, Content Summarizer respectively

ProductivitySuite UI: Update Docsum to have vertical scroll bar if co…

6216b5a

…ntent exceeds the window height

ProductivitySuite UI: Remove <|eot_id|> string from the Chat, Docsum …

a6e4a7d

…and Faqgen response

ProductivitySuite UI: Update contextWrapper and contextTitle width to…

b999077

… adjust to different screen sizes

ProductivitySuite UI: Show system prompt input field always to edit i…

cf96dcc

…n the chatqna prompt section

ProductivitySuite UI: Update max_new_tokens into max_tokens in Conver…

debdd0f

…sationSlice

ProductivitySuite UI: Add multiple models support in ChatQnA. Choose …

0a5584a

…different models from dropdown

sgurunat requested a review from lvliang-intel as a code owner October 30, 2024 07:40

[pre-commit.ci] auto fixes from pre-commit.com hooks

4d9b1da

for more information, see https://pre-commit.ci

lvliang-intel reviewed Oct 30, 2024

View reviewed changes

ProductivitySuite/docker_compose/intel/hpu/gaudi/compose.yaml Outdated Show resolved Hide resolved

sgurunat and others added 3 commits October 30, 2024 15:05

removed langchain related environment variables in ProductivitySuite …

594891a

…docker compose files Signed-off-by: sgurunat <[email protected]>

removed langchain related environment variables in ProductivitySuite doc

0907b42

ker compose files

[pre-commit.ci] auto fixes from pre-commit.com hooks

2cd3ea7

for more information, see https://pre-commit.ci

letonghan reviewed Oct 31, 2024

View reviewed changes

ProductivitySuite/docker_compose/intel/hpu/gaudi/set_env_remote.sh Outdated Show resolved Hide resolved

jaswanth8888 added this to the v1.1 milestone Nov 7, 2024

Merge branch 'main' into multiple-model-with-remote-service

0d3ec68

ftian1 approved these changes Nov 8, 2024

View reviewed changes

amberjain1 reviewed Nov 8, 2024

View reviewed changes

ChatQnA/Dockerfile.wrapper Outdated Show resolved Hide resolved

amberjain1 reviewed Nov 8, 2024

View reviewed changes

ChatQnA/Dockerfile.wrapper Outdated Show resolved Hide resolved