|
| 1 | +# Testing AMD CPU Compatible Docker Image |
| 2 | + |
| 3 | +This document explains how to test the AMD CPU compatible Docker image built by GitHub Actions. |
| 4 | + |
| 5 | +## Image Location |
| 6 | + |
| 7 | +The image will be available at: |
| 8 | +``` |
| 9 | +ghcr.io/randomm/text-embeddings-inference/text-embeddings-inference-cpu-amd:fix-dockerfile-issue-7e9217e |
| 10 | +``` |
| 11 | + |
| 12 | +## Quick Test on AMD Server |
| 13 | + |
| 14 | +1. **Pull the image:** |
| 15 | + ```bash |
| 16 | + docker pull ghcr.io/randomm/text-embeddings-inference/text-embeddings-inference-cpu-amd:fix-dockerfile-issue-7e9217e |
| 17 | + ``` |
| 18 | + |
| 19 | +2. **Run with Qwen3 model (the one that was failing):** |
| 20 | + ```bash |
| 21 | + model=Qwen/Qwen3-Embedding-0.6B |
| 22 | + volume=$PWD/data |
| 23 | + |
| 24 | + docker run --rm -p 8080:80 -v $volume:/data \ |
| 25 | + ghcr.io/randomm/text-embeddings-inference/text-embeddings-inference-cpu-amd:fix-dockerfile-issue-7e9217e \ |
| 26 | + --model-id $model |
| 27 | + ``` |
| 28 | + |
| 29 | +3. **Test the embedding endpoint:** |
| 30 | + ```bash |
| 31 | + curl 127.0.0.1:8080/embed \ |
| 32 | + -X POST \ |
| 33 | + -d '{"inputs":"What is Deep Learning?"}' \ |
| 34 | + -H 'Content-Type: application/json' |
| 35 | + ``` |
| 36 | + |
| 37 | +## Expected Results |
| 38 | + |
| 39 | +✅ **Success indicators:** |
| 40 | +- No Intel MKL SGEMM errors in logs |
| 41 | +- Model loads successfully |
| 42 | +- Embedding requests return valid responses |
| 43 | +- No "Parameter 8/13 was incorrect" errors |
| 44 | + |
| 45 | +❌ **Previous error (should be gone):** |
| 46 | +``` |
| 47 | +Intel MKL ERROR: Parameter 8 was incorrect on entry to SGEMM |
| 48 | +Intel MKL ERROR: Parameter 13 was incorrect on entry to SGEMM |
| 49 | +``` |
| 50 | + |
| 51 | +## Key Differences from Original Image |
| 52 | + |
| 53 | +- **No Intel MKL**: Uses generic BLAS libraries |
| 54 | +- **AMD Compatible**: Works on AMD processors without CPU vendor detection issues |
| 55 | +- **Same Performance**: Should maintain similar performance characteristics |
| 56 | +- **Broader Compatibility**: Works on both Intel and AMD CPUs |
| 57 | + |
| 58 | +## Build Status |
| 59 | + |
| 60 | +Check the GitHub Actions build status at: |
| 61 | +https://github.com/randomm/text-embeddings-inference/actions |
| 62 | + |
| 63 | +The workflow "Build CPU AMD Docker Image" should show a successful build. |
0 commit comments