Releases · ROCm/rocSPARSE

21 Jan 18:58

rocm-ci

rocm-7.2.0

89dc267

rocSPARSE 4.2.0 for ROCm 7.2.0 Latest

Latest

Added

Added sliced ELL format support to the rocsparse_spmv routine.
Added the rocsparse_sptrsv and rocsparse_sptrsm routines for triangular solve.
Added the --clients-only option to the install.sh and rmake.py scripts to only build the clients for a version of rocSPARSE that is already installed.
Added nnz split algorithm rocsparse_spmv_alg_csr_nnzsplit to rocsparse_spmv. This algorithm might be superior to the existing adaptive algorithm rocsparse_spmv_alg_csr_adaptive when running the computation a small number of times because it avoids paying the analysis cost of the adaptive algorithm.

Changed

Make rocBLAS a requirement when it's requested when building from source. Previously, rocBLAS was not used if it could not be found. To opt out of using rocblas when building from source, use the --no-rocblas option with the install.sh or rmake.py build scripts.

Optimized

Significantly improved the rocsparse_sddmm routine when using CSR format, especially as the number of columns in the dense A matrix (or rows in the dense B matrix) increase.
Improved the user documentation.

Resolved issues

Fix the rmake.py build script to properly handle auto and all options when selecting offload targets.
Fix building rocSPARSE with the install script on centOS 9.
Fix std::fma casting in host routines to properly deduce types. This could have previously caused compilation failures when building from source.

Assets 2

26 Nov 07:19

rocm-ci

rocm-7.1.1

a6c0029

rocsparse 4.1.0 for ROCm 7.1.1

rocSPARSE code for ROCm 7.1.1 did not change. The library was rebuilt for the updated ROCm 7.1.1 stack.

Assets 2

30 Oct 05:52

rocm-ci

rocm-7.1.0

a6c0029

rocSPARSE 4.1.0 for ROCm 7.1.0

Added

Added brain half float mixed precision to rocsparse_axpby where X and Y use bfloat16 and result and the compute type use float.
Added brain half float mixed precision to rocsparse_spvv where X and Y use bfloat16 and result and the compute type use float.
Added brain half float mixed precision to rocsparse_spmv where A and X use bfloat16 and Y and the compute type use float.
Added brain half float mixed precision to rocsparse_spmm where A and B use bfloat16 and C and the compute type use float.
Added brain half float mixed precision to rocsparse_sddmm where A and B use bfloat16 and C and the compute type use float.
Added brain half float mixed precision to rocsparse_sddmm where A and B and C use bfloat16 and the compute type use float.
Added half float mixed precision to rocsparse_sddmm where A and B and C use float16 and the compute type use float.
Added brain half float uniform precision to rocsparse_scatter and rocsparse_gather routines.

Optimized

Improved the user documentation.

Upcoming changes

Deprecate trace, debug, and bench logging using environment variable ROCSPARSE_LAYER.

Assets 2

10 Oct 12:12

rocm-ci

rocm-7.0.2

8946cdc

rocSPARSE 4.0.3 for ROCm 7.0.2

Resolved issues

Resolved an issue causing premature deallocation of internal buffers still in use.

Assets 2

17 Sep 16:37

rocm-ci

rocm-7.0.1

dc63d50

rocsparse 4.0.2 for ROCm 7.0.1

rocSPARSE code for ROCm 7.0.1 did not change. The library was rebuilt for the updated ROCm 7.0.1 stack.

Assets 2

16 Sep 06:32

rocm-ci

rocm-7.0.0

dc63d50

rocSPARSE 4.0.2 for ROCm 7.0.0

Added

Adds SpGEAM generic routine for computing sparse matrix addition in CSR format
Adds v2_SpMV generic routine for computing sparse matrix vector multiplication. As opposed to the deprecated rocsparse_spmv routine, this routine does not use a fallback algorithm if a non-implemented configuration is encountered and will return an error in such a case. For the deprecated routine rocsparse_spmv, the user can enable warning messages in situations where a fallback algorithm is used by either calling upfront the routine rocsparse_enable_debug or exporting the variable ROCSPARSE_DEBUG (with the shell command export ROCSPARSE_DEBUG=1).
Adds half float mixed precision to rocsparse_axpby where X and Y use float16 and result and the compute type use float
Adds half float mixed precision to rocsparse_spvv where X and Y use float16 and result and the compute type use float
Adds half float mixed precision to rocsparse_spmv where A and X use float16 and Y and the compute type use float
Adds half float mixed precision to rocsparse_spmm where A and B use float16 and C and the compute type use float
Adds half float mixed precision to rocsparse_sddmm where A and B use float16 and C and the compute type use float
Adds half float uniform precision to rocsparse_scatter and rocsparse_gather routines
Adds half float uniform precision to rocsparse_sddmm routine
Added rocsparse_spmv_alg_csr_rowsplit algorithm.
Added support for gfx950
Add ROC-TX instrumentation support in rocSPARSE (not available on Windows or in the static library version on Linux).
Added the almalinux OS name to correct the gfortran dependency

Changed

Switch to defaulting to C++17 when building rocSPARSE from source. Previously rocSPARSE was using C++14 by default.

Optimized

Reduced the number of template instantiations in the library to further reduce the shared library binary size and improve compile times
Allow SpGEMM routines to use more shared memory when available. This can speed up performance for matrices with a large number of intermediate products.
Use of the rocsparse_spmv_alg_csr_adaptive or rocsparse_spmv_alg_csr_default algorithms in rocsparse_spmv to perform transposed sparse matrix multiplication (C=alpha*A^T*x+beta*y) resulted in unnecessary analysis on A and needless slowdown during the analysis phase. This has been fixed by skipping the analysis when performing the transposed sparse matrix multiplication.
Improved the user documentation

Resolved issues

Fixed an issue in the public headers where extern "C" was not wrapped by #ifdef __cplusplus, which caused failures when building C programs with rocSPARSE.
Fixed a memory access fault in the rocsparse_Xbsrilu0 routines.
Fixed failures that could occur in rocsparse_Xbsrsm_solve or rocsparse_spsm with BSR format when using host pointer mode.
Fixed ASAN compilation failures
Fixed failure that occurred when using const descriptor rocsparse_create_const_csr_descr with the generic routine rocsparse_sparse_to_sparse. Issue was not observed when using non-const descriptor rocsparse_create_csr_descr with rocsparse_sparse_to_sparse.
Fixed a memory leak in the rocsparse handle

Removed

The deprecated rocsparse_spmv_ex routine
The deprecated rocsparse_sbsrmv_ex, rocsparse_dbsrmv_ex, rocsparse_cbsrmv_ex, and rocsparse_zbsrmv_ex routines
The deprecated rocsparse_sbsrmv_ex_analysis, rocsparse_dbsrmv_ex_analysis, rocsparse_cbsrmv_ex_analysis, and rocsparse_zbsrmv_ex_analysis routines

Upcoming changes

Deprecated the rocsparse_spmv routine. Users should use the rocsparse_v2_spmv routine going forward.
Deprecated rocsparse_spmv_alg_csr_stream algorithm. Users should use the rocsparse_spmv_alg_csr_rowsplit algorithm going forward.
Deprecated the rocsparse_itilu0_alg_sync_split_fusion algorithm. Users should use one of rocsparse_itilu0_alg_async_inplace, rocsparse_itilu0_alg_async_split, or rocsparse_itilu0_alg_sync_split going forward.

Assets 2

24 Sep 14:02

rocm-ci

rocm-6.4.4

8fbfc79

rocSPARSE 3.4.0 for ROCm 6.4.4

rocSPARSE code for ROCm 6.4.4 did not change. The library was rebuilt for the updated ROCm 6.4.4 stack.

Assets 2

07 Aug 14:20

rocm-ci

rocm-6.4.3

8fbfc79

rocSPARSE 3.4.0 for ROCm 6.4.3

rocSPARSE code for ROCm 6.4.3 did not change. The library was rebuilt for the updated ROCm 6.4.3 stack.

Assets 2

21 Jul 16:54

rocm-ci

rocm-6.4.2

8fbfc79

rocSPARSE 3.4.0 for ROCm 6.4.2

rocSPARSE code for ROCm 6.4.2 did not change. The library was rebuilt for the updated ROCm 6.4.2 stack.

Assets 2

20 May 13:16

rocm-ci

rocm-6.4.1

4953add

rocSPARSE 3.4.0 for ROCm 6.4.1

rocSPARSE code for ROCm 6.4.1 did not change. The library was rebuilt for the updated ROCm 6.4.1 stack.

Assets 2

Releases: ROCm/rocSPARSE

rocSPARSE 4.2.0 for ROCm 7.2.0

Added

Changed

Optimized

Resolved issues

Uh oh!

rocsparse 4.1.0 for ROCm 7.1.1

Uh oh!

rocSPARSE 4.1.0 for ROCm 7.1.0

Added

Optimized

Upcoming changes

Uh oh!

rocSPARSE 4.0.3 for ROCm 7.0.2

Resolved issues

Uh oh!

rocsparse 4.0.2 for ROCm 7.0.1

Uh oh!

rocSPARSE 4.0.2 for ROCm 7.0.0

Added

Changed

Optimized

Resolved issues

Removed

Upcoming changes

Uh oh!

rocSPARSE 3.4.0 for ROCm 6.4.4

Uh oh!

rocSPARSE 3.4.0 for ROCm 6.4.3

Uh oh!

rocSPARSE 3.4.0 for ROCm 6.4.2

Uh oh!

rocSPARSE 3.4.0 for ROCm 6.4.1

Uh oh!