Skip to content

Tpetra::MultiVector: Fix #4870 (prefer running norms on device)

James Willenbring requested to merge mhoemmen:Fix-4870 into develop

Created by: mhoemmen

NOTE: Superseded by PR #4881.

@trilinos/tpetra @vbrunini

Description

  • Prefer running MultiVector norms on device, even if it is currently sync'd to host, unless the vector has too few local rows.
  • Factor out the implementation of Tpetra::MultiVector::norm* into a separate function, Tpetra::Details::normImpl. I do ETI for that function, when ETI is enabled. This required adding new ETI macros for iterating over the (Scalar, Node) type combination.

Motivation and Context

  • SPARC performance
  • Build time

Related Issues

Merge request reports