-
Notifications
You must be signed in to change notification settings - Fork 21
Open
Description
I'm testing the library with the following configurations:
Driver Version: 535.183.01
CUDA Version: 12.2
CUDA Toolkit Version: 12.6
GPU: Nvidia H100
With the default settings, running the dsscfg program I got the following output:
Begin testing DSSCFG on the CPU (double precision)
CPU iteration 0 F: -0.349732
CPU iteration 0 F: 0.0192875
CPU iteration 0 F: -0.496264
CPU iteration 100 F: -0.888434
CPU iteration 200 F: -1.00093
CPU iteration 300 F: -1.01115
CPU iteration 400 F: -1.01126
CPU iteration 500 F: -1.01129
Timing: 13.946 ms / iteration
Begin testing DSSCFG with CUDA (double precision)
terminate called after throwing an instance of 'thrust::THRUST_200500_600_610_700_720_750_860_NS::system::system_error'
what(): after reduction step 1: cudaErrorInvalidDevice: invalid device ordinal
Aborted (core dumped)
Then I added 90
to the CMakeLists.txt at
foreach(ComputeCapability 60 61 70 72 75 86 90)
The program can run without throwing an exception. But the GPU minimization got stuck and didn't continue further:
Begin testing DSSCFG on the CPU (double precision)
CPU iteration 0 F: -0.349732
CPU iteration 0 F: 0.0192875
CPU iteration 0 F: -0.496264
CPU iteration 100 F: -0.888434
CPU iteration 200 F: -1.00093
CPU iteration 300 F: -1.01115
CPU iteration 400 F: -1.01126
CPU iteration 500 F: -1.01129
Timing: 20.0913 ms / iteration
Begin testing DSSCFG with CUDA (double precision)
CUDA iteration 0 F: -0.349732
CUDA iteration 0 F: -0.199307
CUDA iteration 0 F: -0.496264
I'm not sure whether is it because the hardware was not supported.
Metadata
Metadata
Assignees
Labels
No labels