Selective disables of a few individual unit tests in Kokkos and KokkosKernels for cuda-debug build on 'white' and 'ride'
Created by: bartlettroscoe
CC: @trilinos/kokkos, @trilinos/kokkos-kernels, @fryeguy52
Description
This PR disables a couple of the more expensive individual unit tests for a Kokkos and KokkosKernels test for just the cuda-debug
build on just 'white'/'ride'. This addresses the last of the errors related to #2827 (closed).
As part of this, I extended TriBITS to support a _EXTRA_ARGS cache var that allows you to append arbitrary command-line arguments to specific tests added with TRIBITS_ADD_TEST(). I used this to add the --gtest_filter
arguments to disable the specific unit tests.
I also included a commit fixing some typos in a README.md file.
Motivation and Context
These tests were timing out at over 10 minutes for this build.
How Has This Been Tested?
I tested this on white
with:
$ bsub -x -Is -q rhel7F -n 16 \
./checkin-test-atdm.sh cuda-debug --enable-packages=Kokkos,KokkosKernels --local-do-all
and it returned:
...
31/35 Test #29: KokkosKernels_sparse_cuda_MPI_1 .................. Passed 235.92 sec
32/35 Test #2: KokkosCore_UnitTest_Cuda_MPI_1 ................... Passed 241.43 sec
33/35 Test #25: KokkosContainers_UnitTest_Serial_MPI_1 ........... Passed 263.52 sec
34/35 Test #33: KokkosKernels_sparse_serial_MPI_1 ................ Passed 358.63 sec
35/35 Test #34: KokkosKernels_graph_serial_MPI_1 ................. Passed 408.13 sec
100% tests passed, 0 tests failed out of 35
Subproject Time Summary:
Kokkos = 814.77 sec*proc (27 tests)
KokkosKernels = 1282.89 sec*proc (8 tests)
Total Test time (real) = 408.15 sec
So that is now under the 600 sec (10 minute) time limit.
Before this change, these two tests timed out with:
...
34/35 Test #33: KokkosKernels_sparse_serial_MPI_1 ................***Timeout 600.07 sec
35/35 Test #25: KokkosContainers_UnitTest_Serial_MPI_1 ...........***Timeout 600.08 sec
94% tests passed, 2 tests failed out of 35
Subproject Time Summary:
Kokkos = 1145.23 sec*proc (27 tests)
KokkosKernels = 1516.06 sec*proc (8 tests)
Total Test time (real) = 600.12 sec
The following tests FAILED:
25 - KokkosContainers_UnitTest_Serial_MPI_1 (Timeout)
33 - KokkosKernels_sparse_serial_MPI_1 (Timeout)
Errors while running CTest
Checklist
-
My commit messages mention the appropriate GitHub issue numbers. -
All new and existing tests passed.