Your current environment
Docker image of built using below command:
docker build -f Dockerfile.hpu -t vllm-hpu-env .
🐛 Describe the bug
The vLLM failed to load model on multi gpu when we set the tensor parallelism greater than or equal to 2 on a 8x Gaudi 3 system.
Before submitting a new issue...