Skip to content

Commit 296396e

Browse files
committed
Add test documentation for AMD CPU Docker image
1 parent 7e9217e commit 296396e

File tree

1 file changed

+63
-0
lines changed

1 file changed

+63
-0
lines changed

TEST_AMD_IMAGE.md

Lines changed: 63 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,63 @@
1+
# Testing AMD CPU Compatible Docker Image
2+
3+
This document explains how to test the AMD CPU compatible Docker image built by GitHub Actions.
4+
5+
## Image Location
6+
7+
The image will be available at:
8+
```
9+
ghcr.io/randomm/text-embeddings-inference/text-embeddings-inference-cpu-amd:fix-dockerfile-issue-7e9217e
10+
```
11+
12+
## Quick Test on AMD Server
13+
14+
1. **Pull the image:**
15+
```bash
16+
docker pull ghcr.io/randomm/text-embeddings-inference/text-embeddings-inference-cpu-amd:fix-dockerfile-issue-7e9217e
17+
```
18+
19+
2. **Run with Qwen3 model (the one that was failing):**
20+
```bash
21+
model=Qwen/Qwen3-Embedding-0.6B
22+
volume=$PWD/data
23+
24+
docker run --rm -p 8080:80 -v $volume:/data \
25+
ghcr.io/randomm/text-embeddings-inference/text-embeddings-inference-cpu-amd:fix-dockerfile-issue-7e9217e \
26+
--model-id $model
27+
```
28+
29+
3. **Test the embedding endpoint:**
30+
```bash
31+
curl 127.0.0.1:8080/embed \
32+
-X POST \
33+
-d '{"inputs":"What is Deep Learning?"}' \
34+
-H 'Content-Type: application/json'
35+
```
36+
37+
## Expected Results
38+
39+
**Success indicators:**
40+
- No Intel MKL SGEMM errors in logs
41+
- Model loads successfully
42+
- Embedding requests return valid responses
43+
- No "Parameter 8/13 was incorrect" errors
44+
45+
**Previous error (should be gone):**
46+
```
47+
Intel MKL ERROR: Parameter 8 was incorrect on entry to SGEMM
48+
Intel MKL ERROR: Parameter 13 was incorrect on entry to SGEMM
49+
```
50+
51+
## Key Differences from Original Image
52+
53+
- **No Intel MKL**: Uses generic BLAS libraries
54+
- **AMD Compatible**: Works on AMD processors without CPU vendor detection issues
55+
- **Same Performance**: Should maintain similar performance characteristics
56+
- **Broader Compatibility**: Works on both Intel and AMD CPUs
57+
58+
## Build Status
59+
60+
Check the GitHub Actions build status at:
61+
https://github.com/randomm/text-embeddings-inference/actions
62+
63+
The workflow "Build CPU AMD Docker Image" should show a successful build.

0 commit comments

Comments
 (0)