Tpetra::CrsGraph: 2- or 3-level thread parallelization of sortAllIndices & mergeAllIndices
Created by: mhoemmen
@trilinos/tpetra Do a 2-level or 3-level thread parallelization of Tpetra::CrsGraph methods sortAllIndices and mergeAllIndices.
This is a "story" because this may call for a thread-parallel segmented sort, or segmented sort-and-merge.
Update (12 Nov 2016): I rewrote this issue to reflect a multiple-step process. See #832. The first step will be a single-level thread parallelization. The second step (likely done at the same time) will be to remove any implicit UVM assumptions that the methods may make. The third step would be this issue, a 2-level or 3-level parallelization that relies on a segmented sort (which does not exist yet; see #662).