Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add flexible controls of GPU configuration and remove old GPU configuration #97

Merged
merged 27 commits into from
Aug 9, 2023

Conversation

sjsprecious
Copy link
Collaborator

@sjsprecious sjsprecious commented Apr 17, 2023

This PR:

  • updates the configurations for the GPU offload based on the new XML variables in CMEPS (Add new XML variables for flexible controls of GPU configuration ESCOMP/CMEPS#363).
  • removes the old approaches for GPU configurations.
  • updates the module versions and batch script template on Derecho.
  • removes #PBS -V option as it incorrectly populate the NGPUS env variable from the login node to the compute node on Derecho.

Use nvhpc/22.11, cray-mpich/8.1.21, and esmf/8.4.1.b02

	modified:   machines/cmake_macros/gust.cmake
	modified:   machines/config_machines.xml
…_config_cesm/compare/52c06b3..fbc05d6

Update settings on Gust and Casper

	modified:   Depends.nvhpc
	deleted:    Depends.nvhpc-gpu
	deleted:    cmake_macros/nvhpc-gpu.cmake
	deleted:    cmake_macros/nvhpc-gpu_casper.cmake
	modified:   cmake_macros/nvhpc.cmake
	modified:   cmake_macros/nvhpc_casper.cmake
	deleted:    cmake_macros/pgi-gpu.cmake
	deleted:    cmake_macros/pgi-gpu_casper.cmake
	modified:   config_batch.xml
	modified:   config_machines.xml
	deleted:    mpi_run_gpu.casper
…asper and Gust

	modified:   machines/config_machines.xml
	modified:   machines/config_machines.xml
	modified:   machines/config_machines.xml
	modified:   machines/config_machines.xml
load cuda module on Gust for a GPU run

	deleted:    cmake_macros/nvhpc_gust.cmake
	modified:   config_machines.xml
	modified:   machines/config_machines.xml
	modified:   config_batch.xml
	modified:   machines/config_batch.xml
	modified:   machines/config_machines.xml
	modified:   config_machines.xml
	modified:   machines/config_machines.xml
@sjsprecious
Copy link
Collaborator Author

@jedwards4b will issue a separate PR to introduce these changes.

@sjsprecious
Copy link
Collaborator Author

I updated the nvhpc compiler version to 23.5 with a newer version of ESMF8.6.0b03. The GPU run finished successfully on Derecho.

@jedwards4b jedwards4b merged commit fe65a9c into ESMCI:main Aug 9, 2023
@sjsprecious sjsprecious deleted the add_gpu_gust branch August 9, 2023 19:24
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants