Skip to content

OpenBLAS 0.3.26 version

Compare
Choose a tag to compare
@martin-frbg martin-frbg released this 02 Jan 21:27
· 855 commits to release-0.3.0 since this release
6c77e5e

general:

  • improved the version of openblas.pc that is created by the CMAKE build
  • fixed a CMAKE-specific build problems on older versions of MacOS
  • worked around linking problems on old versions of MacOS
  • corrected installation location of the lapacke_mangling header in CMAKE builds
  • added type declarations for complex variables to the MSVC-specific parts of the LAPACK header
  • significantly sped up ?GESV for small problem sizes by introducing a lower bound for multithreading
  • imported additions and corrections from the Reference-LAPACK project:
    • added new LAPACK functions for truncated QR with pivoting (Reference-LAPACK PRs 891&941)
    • handle miscalculation of minimum work array size in corner cases (Reference-LAPACK PR 942)
    • fixed use of uninitialized variables in ?GEDMD and improved inline documentation (PR 959)
    • fixed use of uninitialized variables (and consequential failures) in ?BBCSD (PR 967)
    • added tests for the recently introduced Dynamic Mode Decomposition functions (PR 736)
    • fixed several memory leaks in the LAPACK testsuite (PR 953)
    • fixed counting of testsuite results by the Python script (PR 954)

x86-64:

  • fixed computation of CASUM on SkylakeX and newer targets in the special
    case that AVX512 is not supported by the compiler or operating environment
  • fixed potential undefined behaviour in the CASUM/ZASUM kernels for AVX512 targets
  • worked around a problem in the pre-AVX kernel for GEMV
  • sped up the thread management code on MS Windows

arm64:

  • fixed building of the LAPACK testsuite with Xcode 15 on Apple M1 and newer
  • sped up the thread management code on MS Windows
  • sped up SGEMM and DGEMM on Neoverse V1
  • sped up ?DOT on SVE-capable targets
  • reduced the number of targets in DYNAMIC_ARCH builds by eliminating functionally equivalent ones
  • included support for Apple M1 and newer targets in DYNAMIC_ARCH builds

power:

  • improved the SGEMM kernel for POWER10
  • fixed compilation with (very) old versions of gcc
  • fixed detection of old 32bit PPC targets in CMAKE-based builds
  • added autodetection of the POWERPC 7400 subtype
  • fixed CMAKE-based compilation for PPCG4 and PPC970 targets

loongarch64:

  • added and improved optimized kernels for almost all BLAS functions

md5sums:
bd496a1c81769ed19a161c1f8f904ccd OpenBLAS-0.3.26.tar.gz
f2524d2eaa55e9c2bad4d203401d4c7f OpenBLAS-0.3.26.zip
739d5666e46b046425b932fb83ce5571 OpenBLAS-0.3.26-x86.zip
3b573471bbc7639b896d1aab356b7e57 OpenBLAS-0.3.26-x64.zip
7522e53dfb4c8c3207c191e66de59430 OpenBLAS-0.3.26-x64-64.zip
(note that you need to edit the paths in the openblas.pc and OpenBLASConfig.cmake files of the Windows binary packages to reflect
your installation location, if you plan to have OpenBLAS findable via pkgconfig or cmake on your Windows system)

Download OpenBLAS