Description
Hoffman et al introduced ChEES-HMC in http://proceedings.mlr.press/v130/hoffman21a/hoffman21a.pdf. It's one answer to the problem of the wastefulness of NUTS when sampling many chains discussed in #228 (comment). When sampling many chains on the GPU with NUTS, the number of steps taken for all chains is equal to the maximum number of steps taken on any one chain before termination (even though for the chains that already terminated, those steps are never used).
ChEES-HMC uses static HMC to sample all chains, where the trajectory length is jittered. The main contribution of the paper is an adaptation routine to tune the step size and trajectory length to try to minimize the autocorrelation of the chains. For the models they checked, they saw higher ESS/evaluation for many chains than with NUTS.
See in particular Algorithm 1.