Start working on updating how we do device management
A lot of the C++ stuff (in
spinfel/device) will likely move to a seperate PyPI package soon. This MR makes two changes:
- Get the device count from the CUDA API. I checked and this respects
CUDA_VISIBLE_DEVICESon Cori GPU, Perlmutter, and Summit.
- Remove the
gpu.devices_per_nodesetting -- this is controlled by
RE point 2 above: you need to remove the
[gpu] devices_per_node = ...
section of your own tomls.
This also fixed a bug where the orientation matching code would use the setting rather than