Kokkos-kernels: Fix #4630

James Willenbring requested to merge mhoemmen:Fix-4630 into develop

Created by: mhoemmen

@trilinos/tpetra @trilinos/belos @vbrunini


Modify Trilinos' snapshot of kokkos-kernels in order to fix #4630 (closed). We will add a separate kokkos-kernels PR with these changes.

Motivation and Context

Kokkos-kernels was not dispatching to cuBLAS for CudaUVMSpace Views. This made it slow. See discussion here:

Related Issues

How Has This Been Tested?

TpetraCore tests pass in a CUDA static release build.

