Add `@async` computations in GPU implementation of the time evolution. Benchmark GPU code with NVIDIA Nsight.