NVIDIA HPC SDK Release Notes (original) (raw)

1. What's New

Welcome to version 24.9 of the NVIDIA HPC SDK, a comprehensive suite of compilers and libraries enabling developers to program the entire HPC platform, from the GPU foundation to the CPU and out through the interconnect. The 24.9 release of the HPC SDK includes component updates as well as important functionality and performance improvements.

2. Release Component Versions

The NVIDIA HPC SDK 24.9 release contains the following versions of each component:

Table 1. HPC SDK Release Components

| | Linux_x86_64 | Linux_aarch64 | | | | | ----------------- | -------------- | --------- | --------- | --------- | | | CUDA 11.8 | CUDA 12.6 | CUDA 11.8 | CUDA 12.6 | | | nvc++ | 24.9 | 24.9 | | | | nvc | 24.9 | 24.9 | | | | nvfortran | 24.9 | 24.9 | | | | nvcc | 11.8.89 | 12.6.37 | 11.8.89 | 12.6.37 | | NCCL | 2.18.5 | 2.18.5 | 2.19.3 | 2.19.3 | | NVSHMEM | 3.0.6 | 3.0.6 | N/A | 3.0.6 | | cuBLAS | 11.11.4.17 | 12.6.0.22 | 11.11.3.6 | 12.6.0.22 | | cuBLASMp | 0.2.1 | 0.2.1 | 0.2.1 | 0.2.1 | | cuFFT | 10.9.0.58 | 11.2.6.28 | 10.9.0.58 | 11.2.6.28 | | cuFFTMp | 11.2.6 | 11.2.6 | N/A | 11.2.6 | | cuRAND | 10.3.0.86 | 10.3.7.37 | 10.3.0.86 | 10.3.7.37 | | cuSOLVER | 11.4.1.48 | 11.6.4.38 | 11.4.1.48 | 11.6.4.38 | | cuSOLVERMp | 0.5.1.0 | 0.5.1.0 | 0.5.1.0 | 0.5.1.0 | | cuSPARSE | 11.7.5.86 | 12.5.2.23 | 11.7.5.86 | 12.5.2.23 | | cuTENSOR | 2.0.2 | 2.0.2 | 2.0.2 | 2.0.2 | | Nsight Compute | 2024.3.0 | 2024.3.0 | | | | Nsight Systems | 2024.5.1 | 2024.5.1 | | | | HPC-X | 2.14 | 2.20 | 2.14 | 2.20 | | OpenBLAS | 0.3.23 | 0.3.23 | | | | Scalapack | 2.2.0 | 2.2.0 | | | | Thrust | 1.15.1 | 2.5.0 | 1.15.1 | 2.5.0 | | CUB | 1.15.1 | 2.5.0 | 1.15.1 | 2.5.0 | | libcu++ | 1.8.1 | 2.5.0 | 1.8.1 | 2.5.0 |

3. Supported Platforms

3.1. Platform Requirements for the HPC SDK

Table 2. HPC SDK Platform Requirements

Architecture Linux Distributions Minimum gcc/glibc Toolchain Minimum CUDA Driver
x86_64 RHEL/CentOS/Rocky 8.0 - 8.10 RHEL/Rocky 9.2 - 9.4 OpenSUSE Leap 15.4 - 15.4 SLES 15SP3, 15SP4, 15SP5, 15SP6 Ubuntu 18.04, 20.04, 22.04, 24.04 Debian 10-12 C99, C11, C++03, C++11, C++14, C++17: 7.5 C++20: 10.1 C++23: 12.1 450.36.06
aarch64 RHEL/CentOS/Rocky 8.0 - 8.10 Rocky 9.2 - 9.3 Ubuntu 20.04, 22.04, 24.04 SLES 15SP6 Amazon Linux 2023 C99, C11, C++03, C++11, C++14, C++17: 7.5 C++20: 10.1 C++23: 12.1 450.36.06

Programs generated by the HPC Compilers for x86_64 processors require a minimum of AVX instructions, which includes Sandy Bridge and newer CPUs from Intel, as well as Bulldozer and newer CPUs from AMD. The HPC SDK includes support for v8.1+ Server Class Arm CPUs that meet the requirements appendix E specified in the SBSA 7.1 specification.

The HPC Compilers are compatible with gcc and g++ and use the GCC C and C++ libraries; the minimum compatible versions of GCC are listed in the table in Section 3. The minimum system requirements for CUDA and NVIDIA Math Library requirements are available in the NVIDIA CUDA Toolkit documentation.

4. Known Limitations

The following are usage recommendations for more effectively using the HPC SDK and its components when unexpected behavior or suboptimal performance is encountered.

5. Deprecations and Changes