Skip to content

rocBLAS 2.39.0 for ROCm 4.3.0

Compare
Choose a tag to compare
@saadrahim saadrahim released this 30 Jul 22:51
8328dcc

Optimizations

  • Improved performance of non-batched and batched rocblas_Xgemv for gfx908 when m <= 15000 and n <= 15000
  • Improved performance of non-batched and batched rocblas_sgemv and rocblas_dgemv for gfx906 when m <= 6000 and n <= 6000
  • Improved the overall performance of non-batched and batched rocblas_cgemv for gfx906

Changed

  • Internal use only APIs prefixed with rocblas_internal_ and deprecated to discourage use