Commit Graph

8966 Commits

Author SHA1 Message Date
Martin Kroeker
2891fd8d6d Replace while loop with for 2025-01-08 23:17:45 +01:00
Martin Kroeker
69d3541488 Merge pull request #5055 from fleclairTT/develop
remove fastmath from x280 target
2025-01-07 15:21:41 -08:00
Felix LeClair
d0350a08b7 Merge pull request #1 from fleclairTT/fleclairTT-remove-fast-math
Update Makefile.riscv64
2025-01-07 17:11:13 -05:00
Felix LeClair
05dce05c24 Update Makefile.riscv64
remove fast-math
2025-01-07 17:10:34 -05:00
Martin Kroeker
74be676936 Merge pull request #5049 from rgommers/docs-buildsystem-page
Rewrite the "Build system" documentation page
2025-01-04 14:14:24 -08:00
Martin Kroeker
91d861a0d4 Merge pull request #5048 from barracuda156/powerpc
PPC970: fix scal
2025-01-04 12:34:45 -08:00
Ralf Gommers
f697cfe0d0 docs: improve the rendering of the HarmonyOS build instructions 2025-01-04 21:18:07 +01:00
Ralf Gommers
eda80f436a docs: improve rendering of Windows on Arm instructions 2025-01-04 21:10:43 +01:00
Ralf Gommers
1833e68bee docs: improve rendering of "Runtime variables" page 2025-01-04 20:55:39 +01:00
Ralf Gommers
c0bf48fbf3 docs: remove warning on the Build system page
Content is reviewed fairly carefully, and should be up to the
same standard as the rest of the docs now.
2025-01-04 20:42:10 +01:00
Ralf Gommers
f764d76a4a docs: improve the Makefile dependency graph
Uses Mermaid to render it as a diagram in the html docs.
2025-01-04 20:42:05 +01:00
Ralf Gommers
5aa1845a43 docs: fix two broken links related to MSVC
The doc build is now clean of warnings again.
2025-01-04 20:37:47 +01:00
Ralf Gommers
ed114150d1 docs: add the build variables for BLAS/LAPACK functionality 2025-01-04 20:37:47 +01:00
Ralf Gommers
c526b10b68 docs: add library and symbol name build variables 2025-01-04 20:37:47 +01:00
Ralf Gommers
d4addc0688 docs: improve description of library, data type and toolchain build variables 2025-01-04 20:37:47 +01:00
Ralf Gommers
970e48e9e5 docs: improve readability of the Build system page
This only fixes Markdown syntax, and adds a few headers to bring
some structure into the long list of variables that influence the
build. It does not add or remove variables.
2025-01-04 20:37:47 +01:00
Martin Kroeker
0be97edd7e Merge pull request #5047 from martin-frbg/issue3967
Add a documentation page for the runtime variables
2025-01-04 09:00:22 -08:00
Sergey Fedorov
229efa42ff scal.S: use r11 on 32-bit Darwin on powerpc 2025-01-05 00:31:27 +08:00
Sergey Fedorov
81e1be8d90 Revert "temporarily disable the default S/DSCAL kernel"
This reverts commit 9b9c0aa5c9.
2025-01-04 22:54:54 +08:00
Martin Kroeker
b6c906f805 Add OPENBLAS_L2_SIZE and improve formatting 2025-01-04 13:02:35 +01:00
Martin Kroeker
33b9e5b143 Add a documentation page for the runtime variables 2025-01-03 23:32:34 +01:00
Martin Kroeker
6f9b9e2da8 Merge pull request #5046 from martin-frbg/doc5044
Add build instructions for Huawei's HarmonyOS to the documentation
2025-01-03 13:50:26 -08:00
Martin Kroeker
afd572511f Add build instructions for Huawei's HarmonyOS from #5043 2025-01-03 22:47:38 +01:00
Martin Kroeker
a71041e402 Merge pull request #5045 from martin-frbg/issue5034
Temporarily disable the default S/DSCAL kernel on PPC970
2025-01-03 12:45:39 -08:00
Martin Kroeker
9b9c0aa5c9 temporarily disable the default S/DSCAL kernel 2025-01-03 21:36:46 +01:00
Martin Kroeker
52c7695be1 Merge pull request #5044 from martin-frbg/issue5043
[WIP,Testing] Add CI workflow to test crossbuilding for Huawei HarmonyOS
2025-01-03 11:30:09 -08:00
Martin Kroeker
14e68355d2 Update harmonyos.yml 2025-01-03 18:29:39 +01:00
Martin Kroeker
593427c8a1 Update harmonyos.yml 2025-01-03 18:10:05 +01:00
Martin Kroeker
67bbde71e5 Update harmonyos.yml 2025-01-03 17:47:17 +01:00
Martin Kroeker
108bf599ae Create harmonyos.yml 2025-01-03 17:19:41 +01:00
Martin Kroeker
e4f83d4485 Merge pull request #5041 from martin-frbg/issue2715
Identify all cores in ARM64 autodetection, return fastest TARGET and performance group sizes
2025-01-02 10:22:37 -08:00
Martin Kroeker
7fd73a40dc Fix accidentally dropped cpu ids and add MacOS performance groups 2025-01-02 06:13:07 -08:00
Martin Kroeker
a182251284 fix typo 2025-01-02 00:04:33 +01:00
Martin Kroeker
ed95791618 fix conflicting variables 2025-01-01 23:27:38 +01:00
Martin Kroeker
3c3d1c4849 Identify all cores and select the most performant one as TARGET 2025-01-01 22:21:29 +01:00
Martin Kroeker
be807c98a6 Identify all cores, group by performance and report the fastest TARGET 2025-01-01 21:42:10 +01:00
Martin Kroeker
a63282a688 Merge pull request #5037 from tingboliao/develop
Optimize the nrm2_rvv function to further improve performance.
2024-12-31 21:08:28 +01:00
Martin Kroeker
2f86913209 Merge pull request #5040 from martin-frbg/issue922
Add an install_tests target to facilitate testing cross-compiles
2024-12-31 19:29:19 +01:00
Martin Kroeker
e9ff70b394 Add an install_tests target to facilitate testing on cross-compiled targets 2024-12-31 15:55:13 +01:00
Martin Kroeker
85a33326a1 Merge pull request #5039 from martin-frbg/fixgmakenaming
Fix "make install" creating incorrect names for suffixed libraries in the cmake and pkgconfig files
2024-12-31 15:52:50 +01:00
Martin Kroeker
6ad793d65e Fix naming of suffixed libraries in the cmake and pkgconfig files 2024-12-31 14:34:55 +01:00
Martin Kroeker
0a2d9aaf32 Merge pull request #4982 from svillemot/develop
Restore libsuffix support in pkg-config file
2024-12-31 14:30:49 +01:00
Martin Kroeker
9297c46dfb Merge pull request #5036 from martin-frbg/issue4032
Add a paragraph on "benign" LAPACK-TEST errors to the FAQ document
2024-12-31 08:53:41 +01:00
tingbo.liao
c37509c213 Optimize the nrm2_rvv function to further improve performance.
Signed-off-by: tingbo.liao <tingbo.liao@starfivetech.com>
2024-12-31 08:46:55 +08:00
Martin Kroeker
a1075477c3 Merge pull request #4994 from martin-frbg/issue4886
Disable multithreading in ?TRTRI for small workloads
2024-12-30 23:10:55 +01:00
Martin Kroeker
fff2e214ca Add LAPACK-TEST errors topic 2024-12-30 23:05:17 +01:00
Martin Kroeker
718fb73bd8 Merge pull request #4976 from martin-frbg/m3m_exprec
[WIP]Add better workaround for GEMM3M on GENERIC and re-enable EXPRECISION for x86/x86_64 targets
2024-12-30 18:55:21 +01:00
Martin Kroeker
73527aab3c Merge pull request #5030 from tingboliao/develop
Optimize the zgemm_tcopy_4_rvv function to be compatible with the situations where the vector lengths(vlens) are 128 and 256.
2024-12-30 16:02:46 +01:00
Martin Kroeker
c1258662db Merge branch 'OpenMathLib:develop' into m3m_exprec 2024-12-30 15:58:15 +01:00
Martin Kroeker
36b0fb3aff Merge pull request #5035 from martin-frbg/issue4396
Improve OpenBLASConfig.cmake contents in gmake builds
2024-12-30 09:34:33 +01:00