Skip to content

GitLab

  • Menu
Projects Groups Snippets
  • Help
    • Help
    • Support
    • Community forum
    • Submit feedback
    • Contribute to GitLab
  • Sign in
  • T Trilinos
  • Project information
    • Project information
    • Activity
    • Labels
    • Members
  • Repository
    • Repository
    • Files
    • Commits
    • Branches
    • Tags
    • Contributors
    • Graph
    • Compare
  • Issues 936
    • Issues 936
    • List
    • Boards
    • Service Desk
    • Milestones
  • Merge requests 22
    • Merge requests 22
  • CI/CD
    • CI/CD
    • Pipelines
    • Jobs
    • Schedules
  • Deployments
    • Deployments
    • Environments
    • Releases
  • Monitor
    • Monitor
    • Incidents
  • Analytics
    • Analytics
    • Value stream
    • CI/CD
    • Repository
  • Wiki
    • Wiki
  • Snippets
    • Snippets
  • Activity
  • Graph
  • Create a new issue
  • Jobs
  • Commits
  • Issue Boards
Collapse sidebar
  • James Willenbring
  • Trilinos
  • Issues
  • #3336

Closed
Open
Created Aug 22, 2018 by James Willenbring@jmwilleMaintainer

KokkosKernels and KokkosContainers tests timing out on ATDM waterman builds starting 8/16/2018

Created by: fryeguy52

CC: @trilinos/kokkos-kernels , @kddevin (Trilinos Data Services Product Lead), @bartlettroscoe

Next Action Status

PR #3660 merged on 10/18/2018 disabled the test KokkosContainers_UnitTest_Serial_MPI_1 in the build Trilinos-atdm-waterman-cuda-9.2-debug and all Kokkos and KokkosKernels tests are passing on 10/19/2018.

Description

Some KokkosKernals test are timing out on the debug builds on waterman

The following tests are failing in the job Trilinos-atdm-waterman-cuda-9.2-debug:

  • KokkosKernels_sparse_serial_MPI_1
  • KokkosContainers_UnitTest_Serial_MPI_1

as shown here

And the following tests are failing in the job Trilinos-atdm-waterman-gnu-debug-openmp:

  • KokkosKernels_sparse_serial_MPI_1
  • KokkosContainers_UnitTest_Serial_MPI_1
  • KokkosKernels_sparse_openmp_MPI_1
  • KokkosKernels_graph_serial_MPI_1

as shown here

Steps to Reproduce

One should be able to reproduce this failure on the machine as described in:

  • https://github.com/trilinos/Trilinos/blob/develop/cmake/std/atdm/README.md

More specifically, the commands given for the system are provided at:

  • https://github.com/trilinos/Trilinos/blob/develop/cmake/std/atdm/README.md#waterman

The exact commands to reproduce this issue should be:


$ cd <some_build_dir>/

$ source $TRILINOS_DIR/cmake/std/atdm/load-env.sh cuda-debug

$ cmake \
  -GNinja \
  -DTrilinos_CONFIGURE_OPTIONS_FILE:STRING=cmake/std/atdm/ATDMDevEnv.cmake \
  -DTrilinos_ENABLE_TESTS=ON -DTrilinos_ENABLE_Kokkos=ON \
  $TRILINOS_DIR

$ make NP=20

$ bsub -x -Is -n 20 ctest -j20
Assignee
Assign to
Time tracking