Releases · ROCm/aomp (original) (raw)
rocm-7.2.4
rocm-7.2.3
rocm-7.2.2
rocm-7.2.1
AOMP Release 23.0-0
These are the release notes for AOMP 23.0-0. AOMP uses AMD developer modifications to the upstream LLVM development trunk. These differences are managed in a branch called "amd-staging". This branch is found in a mirror of upstream LLVM found at https://github.com/ROCm/llvm-project. The amd-staging branch is constantly changing as it merges the upstream development trunk with its downstream development updates. The AMD modifications are experimental while under review for the upstream trunk. AOMP uses a snapshot of amd-staging at the commit ids and dates listed below. AOMP also includes builds of related ROCm components. We call AOMP a "standalone" build as it does not use or require ROCm with the exception of the kernel module (amdgpu-dkms) and libdrm which are often part of the Linux distribution. AOMP is isolated from any ROCm installations by installing into /usr/lib/aomp and the use of RPATH for runtime libraries.
For AOMP 23.0-0, the last LLVM trunk commit is a 82447b92c37842b33e708146ba4ec794bb9fae98 on February 12, 2026. The last amd-only commit is ae56654bd26b8475fc000b1a2a8a80557fe7c83f on February 12, 2026. These commits form a frozen branch now called "aomp-23.0-0". See https://github.com/ROCm/llvm-project/tree/aomp-23.0-0.
The integrated ROCm components for this AOMP release were built with ROCM 7.2.0 sources.
This is the 1st AOMP release based on upstream LLVM 23 development.
These are the changes since 22.0-2:
- We now distribute release binaries in a tar file that was built on ManyLinux (AlmaLinux 8)
- Deprecated docs for installing debians/rpms on Ubuntu, SLES, RHEL, and CentOS
- Switched to ROCm 7.2 sources
- This release does not include the hip math libraries.
Flang updates:
- Fix type error when calling EXIT intrinsic
- Fix recursive mapper emission.
- Fix mapping of constant arrays
- Fixes error: 'hlfir.designate' op using value defined outside the region
- Allow assumed-size arrays on USE_DEVICE_ADDR clause
- Allow ALLOC/RELEASE in place of STORAGE in 6.0
- Add FIR and LLVM lowering support for prefetch directive
- Hoist static parallel region allocas to the entry block on the CPU
- Hoist static
allocas emitted by privateinitregions to the allocation IP - Add support for implicit linearization in omp.simd
rocm-7.2.0
AOMP Release 22.0-2
These are the release notes for AOMP 22.0-2. AOMP uses AMD developer modifications to the upstream LLVM development trunk. These differences are managed in a branch called "amd-staging". This branch is found in a mirror of upstream LLVM found at https://github.com/ROCm/llvm-project. The amd-staging branch is constantly changing as it merges the upstream development trunk with its downstream development updates. The AMD modifications are experimental while under review for the upstream trunk. AOMP uses a snapshot of amd-staging at the commit ids and dates listed below. AOMP also includes builds of related ROCm components. We call AOMP a "standalone" build as it does not use or require ROCm with the exception of the kernel module (amdgpu-dkms) and libdrm which are often part of the Linux distribution. AOMP is isolated from any ROCm installations by installing into /usr/lib/aomp and the use of RPATH for runtime libraries.
For AOMP 22.0-2, the last LLVM trunk commit is a0e0775d91859d6ca32efc5fb8644b6281e98593 on January 12, 2026. The last amd-only commit is 7652844035d910d0d9b4b3070ba515903ad3ca19 on January 18, 2026. These commits form a frozen branch now called "aomp-22.0-2". See https://github.com/ROCm/llvm-project/tree/aomp-22.0-2.
The integrated ROCm components for this AOMP release were built with ROCM 7.1.1 sources.
This is the 2nd AOMP release based on upstream LLVM 22 development.
These are the changes since 22.0-1:
- Switch to ROCm 7.1.1 sources
- Added a small patch to Tensile to fix issue with v_dot4 op_sel modifiers
- We recommend using ROCm 7.1 amdgpu-dkms driver with this release
Updates to flang:
- Fix declare target to implicit mapping
- Fix assert SIInstrInfo.cpp:907 AMDGPU::VGPR_32RegClass.contains(SrcReg)
- Defer descriptor mapping for assumed dummy argument types
- Fix negative array indexing with allocatable derived type array maps
- Add standalone tile support
- Implicitly map nested allocatable components in derived types
- Add support for -fno-fast-real-mod (now enabled as part of -ffast-math)
- Enable no-loop kernels
- User defined reductions are now supported
- Improved support for split distribute and parallel
- Add debug type support for procedure pointers
- Swap to attach semantics for descriptor mapping
- Emit module summary for Full LTO to improve link times
- Set REQUIRES flags on program unit symbol
- Fix USM
closesemantics anduse_device_ptr - Fixed flang issue (error: failure in HLFIR intrinsic simplification) with GenASiS
Errata:
- Flang - UMT runtime failure