NVIDIA HPC SDK Release Notes (original) (raw)

1. What's New

Welcome to the 24.3 version of the NVIDIA HPC SDK, a comprehensive suite of compilers and libraries enabling developers to program the entire HPC platform, from the GPU foundation to the CPU and out through the interconnect. The 24.3 release of the HPC SDK includes new features as well as important functionality and performance improvements.

2. Release Component Versions

The NVIDIA HPC SDK 24.3 release contains the following versions of each component:

Table 1. HPC SDK Release Components

| | Linux_x86_64 | Linux_aarch64 | | | | | ----------------- | -------------- | ---------- | --------- | ---------- | | | CUDA 11.8 | CUDA 12.3 | CUDA 11.8 | CUDA 12.3 | | | nvc++ | 24.3 | 24.3 | | | | nvc | 24.3 | 24.3 | | | | nvfortran | 24.3 | 24.3 | | | | nvcc | 11.8.89 | 12.3.101 | 11.8.89 | 12.3.101 | | NCCL | 2.18.5 | 2.18.5 | 2.19.3 | 2.19.3 | | NVSHMEM | 2.10.1 | 2.10.1 | N/A | 2.10.1 | | cuBLAS | 11.11.4.17 | 12.3.4.1 | 11.11.3.6 | 12.3.4.1 | | cuBLASMp | 0.1.2 | 0.1.2 | 0.1.2 | 0.1.2 | | cuFFT | 10.9.0.58 | 11.0.12.1 | 10.9.0.58 | 11.0.12.1 | | cuFFTMp | 11.0.14 | 11.0.14 | N/A | 11.0.14 | | cuRAND | 10.3.0.86 | 10.3.4.101 | 10.3.0.86 | 10.3.4.101 | | cuSOLVER | 11.4.1.48 | 11.5.4.101 | 11.4.1.48 | 11.5.4.101 | | cuSOLVERMp | 0.4.3 | 0.4.3 | 0.4.3 | 0.4.3 | | cuSPARSE | 11.7.5.86 | 12.2.0.103 | 11.7.5.86 | 12.2.0.103 | | cuTENSOR | 2.0.1 | 2.0.1 | 2.0.1 | 2.0.1 | | Nsight Compute | 2023.3.1 | 2023.3.1 | | | | Nsight Systems | 2024.4.1 | 2024.4.1 | | | | OpenMPI | 3.1.5 | 3.1.5 | | | | HPC-X | 2.14 | 2.17.1 | 2.14 | 2.17.1 | | OpenBLAS | 0.3.23 | 0.3.23 | | | | Scalapack | 2.2.0 | 2.2.0 | | | | Thrust | 1.15.1 | 2.2.0 | 1.15.1 | 2.2.0 | | CUB | 1.15.1 | 2.2.0 | 1.15.1 | 2.2.0 | | libcu++ | 1.8.1 | 2.2.0 | 1.8.1 | 2.2.0 |

3. Supported Platforms

3.1. Platform Requirements for the HPC SDK

Table 2. HPC SDK Platform Requirements

Architecture Linux Distributions Minimum gcc/glibc Toolchain Minimum CUDA Driver
x86_64 RHEL/CentOS 7.3 - 7.9 RHEL/CentOS/Rocky 8.0 - 8.7 RHEL 9.2 Fedora 33, 34 OpenSUSE Leap 15.2 - 15.4 SLES 15SP2, 15SP3, 15SP4 Ubuntu 18.04, 20.04, 22.04 Debian 10 C99: 4.8 C11: 4.9 C++03: 4.8 C++11: 4.9 C++14: 5.1 C++17: 7.1 C++20: 10.1 450.36.06
aarch64 RHEL/CentOS/Rocky 8.0 - 8.7 Rocky 9.2 Ubuntu 20.04, 22.04 SLES 15SP2, 15SP3, 15SP4 Amazon Linux 2 C99: 4.8 C11: 4.9 C++03: 4.8 C++11: 4.9 C++14: 5.1 C++17: 7.1 C++20: 10.1 450.36.06

Programs generated by the HPC Compilers for x86_64 processors require a minimum of AVX instructions, which includes Sandy Bridge and newer CPUs from Intel, as well as Bulldozer and newer CPUs from AMD. The HPC SDK includes support for v8.1+ Server Class Arm CPUs that meet the requirements appendix E specified in the SBSA 7.1 specification.

The HPC Compilers are compatible with gcc and g++ and use the GCC C and C++ libraries; the minimum compatible versions of GCC are listed in Table 2. The minimum system requirements for CUDA and NVIDIA Math Library requirements are available in the NVIDIA CUDA Toolkit documentation.

4. Known Limitations

5. Deprecations and Changes