sci-libs / rocBLAS

AMD's library for BLAS on ROCm

Official package sites : https://github.com/ROCm/rocBLAS ·

v6.3.0 :: 0/6.3 :: gentoo (Masked by set 7566)

Modified
License
BSD
Keywords
~amd64
USE flags
benchmark doc hipblaslt test

v6.1.1-r1 :: 0/6.1 :: gentoo

Modified
License
BSD
Keywords
~amd64
USE flags
benchmark doc test

v5.7.1-r2 :: 0/5.7 :: gentoo

Modified
License
BSD
Keywords
~amd64
USE flags
benchmark doc test

v5.4.2-r3 :: 0/5.4 :: gentoo

Modified
License
BSD
Keywords
~amd64
USE flags
benchmark doc test

v5.1.3-r4 :: 0/5.1 :: gentoo

Modified
License
BSD
Keywords
~amd64
USE flags
benchmark doc test

General

benchmark
Build and install rocblas-bench.
doc
Add extra documentation (API, Javadoc, etc). It is recommended to enable per package instead of globally
hipblaslt
Build with hipBLASLt for some non-batched and strided batched problems on gfx12.
test
Perform rocblas-test to compare the result between rocBLAS and system BLAS.

amdgpu_targets

gfx1010
RDNA GPU, codename navi10, including Radeon RX 5700XT/5700/5700M/5700B/5700XTB/5600XT/5600/5600M, Radeon Pro 5700XT/5700, Radeon Pro W5700X/W5700
gfx1011
RDNA GPU, codename navi12, including Radeon Pro 5600M/V520
gfx1012
RDNA GPU, codename navi14, including Radeon RX 5500XT/5500/5500M/5500XTB/5300/5300M, Radeon Pro 5500XT/5500M/5300/5300M, Radeon Pro W5500X/W5500/W5500M/W5300M
gfx1030
RDNA2 GPU, codename navi21/sienna cichlid, including Radeon RX 6950XT/6900XT/6800XT/6800, Radeon Pro W6800
gfx1031
RDNA2 GPU, codename navi22/navy flounder, including Radeon RX 6750XT/6700XT/6800M/6700M
gfx1100
RDNA3 GPU, codename navi31/plum bonito, including Radeon RX 7900XTX/7900XT, AMD Radeon Pro W7900/W7800
gfx1100
RDNA3 GPU, codename navi31/plum bonito, including Radeon RX 7900XTX/7900XT, AMD Radeon Pro W7900/W7800
gfx1101
RDNA3 GPU, codename navi32, including Radeon RX 7700XT/7800XT
gfx1102
RDNA3 GPU, codename navi33, including Radeon RX 7600/7600M/7600M XT/7700S/7600S, AMD Radeon PRO W7600/W7500
gfx803
Fiji GPU, codename fiji, including Radeon R9 Nano/Fury/FuryX, Radeon Pro Duo, FirePro S9300x2, Radeon Instinct MI8
gfx900
Vega GPU, codename vega10, including Radeon Vega Frontier Edition, Radeon RX Vega 56/64, Radeon RX Vega 64 Liquid, Radeon Pro Vega 48/56/64/64X, Radeon Pro WX 8200/9100, Radeon Pro V320/V340/SSG, Radeon Instinct MI25
gfx906
Vega GPU, codename vega20, including Radeon (Pro) VII, Radeon Instinct MI50/MI60
gfx908
CDNA Accelerator, codename arcturus, including AMD Instinct MI100 Accelerator
gfx90a
CDNA2 Accelerator, codename aldebaran, including AMD Instinct MI200 series Accelerators
gfx940
CDNA3 Accelerator, codename aqua_vangaram, MI300A rev 0
gfx941
CDNA3 Accelerator, codename aqua_vangaram, MI300X rev 0
gfx942
CDNA3 Accelerator, codename aqua_vangaram, MI300A and MI300X rev >=1

llvm_slot

18
Use LLVM 18.
19
Use LLVM 19.

video_cards

amdgpu
VIDEO_CARDS setting to build driver for AMDGPU video cards

dev-cpp / gtest : Google C++ Testing Framework

dev-cpp / msgpack-cxx : MessagePack for C++

dev-util / hip : C++ Heterogeneous-Compute Interface for Portability

llvm-runtimes / openmp : OpenMP runtime library for LLVM/clang compiler

virtual / blas : Virtual for FORTRAN 77 BLAS implementation

sci-libs / composable-kernel : High Performance Composable Kernel for AMD GPUs

sci-libs / hipBLAS : ROCm BLAS marshalling library

sci-libs / miopen : AMD's Machine Intelligence Library

sci-libs / rocSOLVER : Implementation of a subset of LAPACK functionality on the ROCm platform

810700
sci-libs/rocBLAS-4.3.0 fails tests
810721
sci-libs/rocBLAS-5.1.3-r4 does not respect CFLAGS/LDFLAGS
897784
sci-libs/rocBLAS-5.1.3-r2 installs unrecognized ELF file (HARDENED-SYSTEM)
908220
sci-libs/rocBLAS-5.4.2-r3 fails to compile
913371
sci-libs/rocBLAS-5.1.3-r4 installs files into unexpected paths (GLIBC-2.38-SYSTEM)
920554
sci-libs/rocBLAS-5.7.1-r1: TypeError: buildSourceCodeObjectFile() missing 2 required positional arguments: 'outputPath' and 'kernelFile'
922071
sci-libs/rocBLAS-5.7.1-r1 FAILED: Tensile/lib/CMakeFiles/TensileHost.dir/source/hip/HipSolutionAdapter.cpp.o /usr/include/c++/v1/deque:436:67: error: expected ')'
925087
sci-libs/rocBLAS-5.7.1-r1: 40G build size
931093
sci-libs/rocBLAS-5.7.1-r2: error: option 'cf-protection=return' and 'cf-protection=branch' cannot be specified on this target
931674
sci-libs/rocBLAS-5.7.1-r2 fails with IndexError: list index out of range
935047
sci-libs/rocBLAS-6.1.1 fails to compile

7566
Paul Zander
Masked for testing
  • >=dev-build/rocm-cmake-6.3.0
  • >=dev-libs/hipother-6.3.0
  • >=dev-libs/rccl-6.3.0
  • >=dev-libs/rocdbgapi-6.3.0
  • >=dev-libs/rocm-comgr-6.3.0
  • >=dev-libs/rocm-device-libs-6.3.0
  • >=dev-libs/rocm-opencl-runtime-6.3.0
  • >=dev-libs/rocr-runtime-6.3.0
  • >=dev-libs/roct-thunk-interface-6.3.0
  • >=dev-util/Tensile-6.3.0
  • >=dev-util/hip-6.3.0
  • >=dev-util/hipcc-6.3.0
  • >=dev-util/hipify-clang-6.3.0
  • >=dev-util/rocm-smi-6.3.0
  • >=dev-util/rocm_bandwidth_test-6.3.0
  • >=dev-util/rocminfo-6.3.0
  • >=dev-util/roctracer-6.3.0
  • >=sci-libs/composable-kernel-6.3.0
  • >=sci-libs/hipBLAS-6.3.0
  • >=sci-libs/hipBLAS-common-6.3.0
  • >=sci-libs/hipBLASLt-6.3.0
  • >=sci-libs/hipCUB-6.3.0
  • >=sci-libs/hipFFT-6.3.0
  • >=sci-libs/hipRAND-6.3.0
  • >=sci-libs/hipSOLVER-6.3.0
  • >=sci-libs/hipSPARSE-6.3.0
  • >=sci-libs/miopen-6.3.0
  • >=sci-libs/rocBLAS-6.3.0
  • >=sci-libs/rocFFT-6.3.0
  • >=sci-libs/rocPRIM-6.3.0
  • >=sci-libs/rocRAND-6.3.0
  • >=sci-libs/rocSOLVER-6.3.0
  • >=sci-libs/rocSPARSE-6.3.0
  • >=sci-libs/rocThrust-6.3.0
  • >=sci-libs/rocWMMA-6.3.0
  • >=sci-libs/rpp-6.3.0
Repository mirror & CI · gentoo
Merge updates from master
Sv. Lockal · gentoo
sci-libs/rocBLAS: fix sed expression for -m16c removal
Closes: https://bugs.gentoo.org/947599 Signed-off-by: Sv. Lockal <lockalsash@gmail.com> Closes: https://github.com/gentoo/gentoo/pull/40004 Signed-off-by: Sam James <sam@gentoo.org>
Repository mirror & CI · gentoo
Merge updates from master
Repository mirror & CI · gentoo
Merge updates from master
Michał Górny · gentoo
Move sys-libs/libomp → llvm-runtimes/openmp
Signed-off-by: Michał Górny <mgorny@gentoo.org>
Sv. Lockal · gentoo
sci-libs/rocBLAS: add 6.3.0
Also backport fix for 16-bit floats for 6.1.1 Closes: https://bugs.gentoo.org/944820 Signed-off-by: Sv. Lockal <lockalsash@gmail.com> Signed-off-by: Sam James <sam@gentoo.org>
Repository mirror & CI · gentoo
Merge updates from master
Patrick Lauer · gentoo
sci-libs/rocBLAS: Restrict to llvm-18
Explodes violently with llvm-19 Signed-off-by: Patrick Lauer <patrick@gentoo.org>
Repository mirror & CI · gentoo
Merge updates from master
Sv. Lockal · gentoo
sci-libs/rocBLAS: strip unsupported flags for potentially switched compiler
Also remove addpredict in src_configure (not needed after rocm_use_hipcc) Bug: https://bugs.gentoo.org/936099 Signed-off-by: Sv. Lockal <lockalsash@gmail.com> Signed-off-by: Sam James <sam@gentoo.org>
Repository mirror & CI · gentoo
Merge updates from master
Paul Zander · gentoo
sci-libs/rocBLAS: 6.1.1 make Tensile dependency optional
Building with Tensile requires actual hardware present, to avoid breaking CI make this optional. Signed-off-by: Paul Zander <negril.nx+gentoo@gmail.com> Signed-off-by: Sam James <sam@gentoo.org>
Sv. Lockal · gentoo
sci-libs/rocBLAS: add 6.1.1
Changes: * in updated expand-isa-compatibility patch do not coerce gfx1011 and gfx1012 to gfx1010, as Gentoo users can build rocBLAS for gfx1011 and gfx1012 with USE flags * add myself to maintainers Signed-off-by: Sv. Lockal <lockalsash@gmail.com> Signed-off-by: Sam James <sam@gentoo.org>
Repository mirror & CI · gentoo
Merge updates from master
Sv. Lockal · gentoo
sci-libs/rocBLAS: fix doxygen directory
Signed-off-by: Sv. Lockal <lockalsash@gmail.com> Signed-off-by: Sam James <sam@gentoo.org>
Sv. Lockal · gentoo
sci-libs/rocBLAS: disable stripping for *.hsaco files
Signed-off-by: Sv. Lockal <lockalsash@gmail.com> Signed-off-by: Sam James <sam@gentoo.org>
Repository mirror & CI · gentoo
Merge updates from master
Michał Górny · gentoo
Move {dev-util → dev-build}/rocm-cmake
Signed-off-by: Michał Górny <mgorny@gentoo.org>
Sv. Lockal · gentoo
sci-libs/rocBLAS: lock dev-util/hip version, as with hip-6.0 build fails with "no member named 'gcnArch'"
Signed-off-by: Sv. Lockal <lockalsash@gmail.com> Signed-off-by: Sam James <sam@gentoo.org>
Repository mirror & CI · gentoo
Merge updates from master
Benda Xu · gentoo
sci-libs/rocBLAS: re-enable tests.
Signed-off-by: Benda Xu <heroxbd@gentoo.org>
Sv. Lockal · gentoo
sci-libs/rocBLAS: add 5.7.1
Closes: https://bugs.gentoo.org/911816 Signed-off-by: Sv. Lockal <lockalsash@gmail.com> Signed-off-by: Benda Xu <heroxbd@gentoo.org>
Repository mirror & CI · gentoo
Merge updates from master
Benda Xu · gentoo
sci-libs/rocBLAS: drop versioned dependence on rocm-cmake.
Signed-off-by: Benda Xu <heroxbd@gentoo.org>
Repository mirror & CI · gentoo
Merge updates from master
Marek Szuba · gentoo
sci-libs/rocBLAS: adapt for msgpack-cxx-6.0.0
Signed-off-by: Marek Szuba <marecki@gentoo.org>
Repository mirror & CI · gentoo
Merge updates from master
Yiyang Wu · gentoo
sci-libs/rocBLAS: replace msgpack dep with msgpack-cxx
Bug: https://bugs.gentoo.org/893544 Signed-off-by: Yiyang Wu <xgreenlandforwyy@gmail.com> Signed-off-by: Marek Szuba <marecki@gentoo.org>
Repository mirror & CI · gentoo
Merge updates from master
Marek Szuba · gentoo
sci-libs/rocBLAS: block >=dev-libs/msgpack-5.0.0
For anyone who might try to fix this, it's a bit of a funny one - the CMake file which looks for "msgpack" rather than "msgpackc-cxx" is in fact part of dev-util/Tensile, not sci-libs/rocBLAS itself. The reason why it's rocBLAS that fails is that C/C++ bits of Tensile is installed as sources to be incorporated into other projects. Still, it's rocBLAS which has dev-libs/msgpack in its list of dependencies so this is where the version cap goes. Signed-off-by: Marek Szuba <marecki@gentoo.org>
Repository mirror & CI · gentoo
Merge updates from master
Repository mirror & CI · gentoo
Merge updates from master
Yiyang Wu · gentoo
sci-libs/rocBLAS: add 5.4.2
Closes: https://github.com/gentoo/gentoo/pull/29319 Signed-off-by: Yiyang Wu <xgreenlandforwyy@gmail.com> Signed-off-by: Benda Xu <heroxbd@gentoo.org>
Andreas Sturmlechner · gentoo
sci-libs/rocBLAS: drop 5.0.2-r2
Signed-off-by: Andreas Sturmlechner <asturm@gentoo.org>
Repository mirror & CI · gentoo
Merge updates from master
Andreas Sturmlechner · gentoo
sci-libs/rocBLAS: drop 4.3.0, 4.3.0-r1
Signed-off-by: Andreas Sturmlechner <asturm@gentoo.org>
Repository mirror & CI · gentoo
Merge updates from master
Sam James · gentoo
sci-libs/rocBLAS: fix PythonHasVersionUsage
Signed-off-by: Sam James <sam@gentoo.org>
Repository mirror & CI · gentoo
Merge updates from master
Repository mirror & CI · gentoo
Merge updates from master
Benda Xu · gentoo
sci-libs/rocBLAS: relax SLOT dependency on dev-util/hip.
This unlocks the exact SLOT dependency of the lower and higher level ROCm tools to make version bumps easier. Package-Manager: Portage-3.0.30, Repoman-3.0.3 Signed-off-by: Benda Xu <heroxbd@gentoo.org>
Repository mirror & CI · gentoo
Merge updates from master
Yiyang Wu · gentoo
sci-libs/rocBLAS: respect MAKEOPTS when build Tensile libs
Closes: https://github.com/gentoo/gentoo/pull/26285 Closes: https://bugs.gentoo.org/852236 Suggested-by: Alessandro Barbieri <lssndrbarbieri@gmail.com> Signed-off-by: Yiyang Wu <xgreenlandforwyy@gmail.com> Signed-off-by: Benda Xu <heroxbd@gentoo.org>
Yiyang Wu · gentoo
sci-libs/rocBLAS: add 5.1.3, using rocm.eclass
No need to rebuilt rocBLAS when hip upgrade because librocblas is dynamically linked to libhipamd, while llvm SLOT does not matter because it is only used to produce GPU kernels, libclang and libLLVM does not linked in. Signed-off-by: Yiyang Wu <xgreenlandforwyy@gmail.com> Signed-off-by: Benda Xu <heroxbd@gentoo.org>
Repository mirror & CI · gentoo
Merge updates from master
Repository mirror & CI · gentoo
Merge updates from master
YiyangWu · gentoo
sci-libs/rocBLAS-4.3.0: re-enable Tensile library
890148bb81f517304f86f41303534b0abaa7c77c disables compiling Tensile library (maybe due to dev-util/Tensile not properly configured). After correcting paths in Tensile, compiling libraries ican be re-enabled. Also fix test fail due to library not found Closes: https://github.com/gentoo/gentoo/pull/24679 Package-Manager: Portage-3.0.30, Repoman-3.0.3 Signed-off-by: Yiyang Wu <xgreenlandforwyy@gmail.com> Signed-off-by: Benda Xu <heroxbd@gentoo.org>
Repository mirror & CI · gentoo
Merge updates from master
Andrew Ammerlaan · gentoo
sci-libs/rocBLAS: unbundle Tensile
Package-Manager: Portage-3.0.30, Repoman-3.0.3 Signed-off-by: Andrew Ammerlaan <andrewammerlaan@gentoo.org>
YiyangWu · gentoo
sci-libs/rocBLAS: bump version to 5.0.2
Enable nai22 chips (Radeon RX 6700/6700 XT / 6800M, gfx1031) fix test issue when PORTAGE_USERNAME is not portage: "[: too many arguments" Closes: https://bugs.gentoo.org/834672 Package-Manager: Portage-3.0.30, Repoman-3.0.3 Signed-off-by: Yiyang Wu <xgreenlandforwyy@gmail.com> Signed-off-by: Benda Xu <heroxbd@gentoo.org>
YiyangWu · gentoo
sci-libs/rocBLAS: fix LICENSE
The rocBLAS license is BSD, but is mistakenly set to MIT in ebuild Package-Manager: Portage-3.0.30, Repoman-3.0.3 Signed-off-by: Yiyang Wu <xgreenlandforwyy@gmail.com> Closes: https://github.com/gentoo/gentoo/pull/23450 Signed-off-by: Andrew Ammerlaan <andrewammerlaan@gentoo.org>
Repository mirror & CI · gentoo
Merge updates from master
Sam James · gentoo
sci-libs/rocBLAS: drop trailing full stop from DESCRIPTION
Package-Manager: Portage-3.0.30, Repoman-3.0.3 Signed-off-by: Sam James <sam@gentoo.org>
YiyangWu · gentoo
sci-libs/rocBLAS: fix tests using openblas as cpu reference
Add a patch that classify 6 test suites as known_bug. Upstream will fix this issue in future releases, and they recommand this as solution for current version. Closes: https://github.com/ROCmSoftwarePlatform/rocBLAS/issues/1202 Package-Manager: Portage-3.0.22, Repoman-3.0.3 Signed-off-by: Yiyang Wu <xgreenlandforwyy@gmail.com> Signed-off-by: Andrew Ammerlaan <andrewammerlaan@gentoo.org>
YiyangWu · gentoo
sci-libs/rocBLAS: remove unused cmake variable
__skip_rocmclang is used to avoid configuration error for cmake-3.21.(1|2), which don't exist among ebuilds anymore, so this flag is not recognized. Package-Manager: Portage-3.0.22, Repoman-3.0.3 Signed-off-by: Yiyang Wu <xgreenlandforwyy@gmail.com> Signed-off-by: Andrew Ammerlaan <andrewammerlaan@gentoo.org>
YiyangWu · gentoo
sci-libs/rocBLAS: let TensileCreateLibrary respects MAKEOPTS
Originaly TensileCreateLibrary consumes all cpu detected. This modification will inject $(makeopts_jobs) into TensileCreateLibrary.py so it respects MAKEOPTS Closes: https://bugs.gentoo.org/822828 Package-Manager: Portage-3.0.22, Repoman-3.0.3 Signed-off-by: Yiyang Wu <xgreenlandforwyy@gmail.com> Signed-off-by: Andrew Ammerlaan <andrewammerlaan@gentoo.org>
Repository mirror & CI · gentoo
Merge updates from master
YiyangWu · gentoo
sci-libs/rocBLAS: Next generation BLAS implementation for ROCm
Migrated from ::science, rocBLAS is the core math libraries on ROCm platform for high performance computing on AMDGPU, dependency of MIOpen. Current version 4.3.0, support nearly all major AMD GPUs, especially for GCN, CDNA and RDNA2 (nightly) cards. Closes: https://bugs.gentoo.org/732952 Closes: https://bugs.gentoo.org/804181 Closes: https://github.com/gentoo/gentoo/pull/22065 Package-Manager: Portage-3.0.20, Repoman-3.0.3 Signed-off-by: Yiyang Wu <xgreenlandforwyy@gmail.com> Signed-off-by: Benda Xu <heroxbd@gentoo.org>