Commit Graph

8825 Commits

Author SHA1 Message Date
Martin Kroeker
3a63bbabd1 Add compiler version notes and mention the f2c fallback LAPACK 2024-11-22 12:10:56 +01:00
Martin Kroeker
c520ed1916 Merge pull request #4984 from rgommers/docs-link
doc: update README to link to the html docs and fix links
2024-11-22 01:58:45 -08:00
Ralf Gommers
a0131e56e0 doc: update README to link to the html docs and fix links
Also some minor formatting improvements and linking the home page.
2024-11-21 13:57:40 +01:00
Martin Kroeker
18014b04c8 Merge pull request #4979 from martin-frbg/issue4978-2
Remove any optimization flags from DEBUG builds on POWER architecture
2024-11-17 15:11:18 -08:00
Martin Kroeker
9db51f790a Remove any optimization flags from DEBUG builds on POWER architecture 2024-11-17 23:19:58 +01:00
Martin Kroeker
e334b79b47 Merge pull request #4977 from martin-frbg/issue4973
Add dummy implementations of openblas_get/set_affinity for OpenMP builds
2024-11-17 11:06:20 -08:00
Martin Kroeker
4060dd43e3 Add dummy implementations of openblas_get/set_affinity 2024-11-15 15:16:17 -08:00
Martin Kroeker
2e2f952bfb Merge pull request #4975 from martin-frbg/fixup4974
Update Cray compiler options and calling convention in CMake
2024-11-13 23:51:47 -08:00
Martin Kroeker
cea9df3643 Update Cray compiler options and calling convention 2024-11-13 14:56:30 -08:00
Martin Kroeker
3e7e312d7d Merge pull request #4974 from cenewcombe/develop
Corrections for Cray and Nvidia Fortran compiler calling conventions
2024-11-13 14:50:19 -08:00
Caroline Newcombe
10cf06dce1 Merge branch 'OpenMathLib:develop' into develop 2024-11-13 14:05:57 -06:00
Caroline Newcombe
760bf7aa37 Update Fortran return for complex data types (Cray and Nvidia compilers) 2024-11-13 14:05:20 -06:00
Martin Kroeker
2057cd75fd Merge pull request #4971 from martin-frbg/m4init
Add initial autodetection support for Apple M4 as VORTEX with HAVE_SME
2024-11-13 06:14:32 -08:00
Martin Kroeker
50da5c2b34 Add Apple M4 as VORTEX with HAVE_SME 2024-11-12 15:20:21 -08:00
Martin Kroeker
24c5152fbd Merge pull request #4966 from martin-frbg/getrf_indent
Fix misleading indentation warning in getrf
2024-11-06 23:04:02 +01:00
Martin Kroeker
2332ea7e7a fix misleading indentation 2024-11-06 18:35:31 +01:00
Martin Kroeker
794f75a8f7 Merge pull request #4965 from martin-frbg/homebrewjob-2
CI: Update upload-artifacts version in the homebrew workflow again
2024-11-06 18:04:05 +01:00
Martin Kroeker
87a18154ce Update version of upload-artifacts again 2024-11-06 14:44:11 +01:00
Martin Kroeker
453b9e4886 Merge pull request #4961 from h-vetinari/flang
explicitly link to OpenMP
2024-10-31 17:46:55 +01:00
H. Vetinari
d3272e51eb explicitly link to OpenMP 2024-11-01 00:55:50 +11:00
Martin Kroeker
c634114c8a Merge pull request #4960 from martin-frbg/gemmtr
Provide copies of GEMMT under its new name GEMMTR adopted by the Reference BLAS
2024-10-30 18:37:26 +01:00
Martin Kroeker
598bd21991 Merge pull request #4958 from XiWeiGu/x86_64_opt_somatcopy_ct_avx
x86_64: opt somatcopy_ct with AVX
2024-10-30 16:04:00 +01:00
Martin Kroeker
c9d1a3b912 Merge pull request #4954 from XiWeiGu/la64_update_gh_actions
gh-actions: Test LoongArch64 with gcc14 from Ubuntu 24.04
2024-10-30 14:35:57 +01:00
Martin Kroeker
c3e7d08fb4 Copy GEMMT to its new name GEMMTR 2024-10-30 12:56:16 +01:00
Martin Kroeker
0cf656fd3e Add copies of GEMMT under its new name GEMMTR 2024-10-30 12:55:14 +01:00
Martin Kroeker
24b5ccaf4b Merge pull request #4202 from TiborGY/inlines_pt2
Remove some unused inline macro definitions
2024-10-30 12:48:35 +01:00
gxw
73c6a28073 x86_64: opt somatcopy_ct with AVX 2024-10-29 07:06:15 +00:00
Martin Kroeker
f66e6d32c2 Merge pull request #4953 from NickelWenzel/fix_trtrs_return_types
fix: return types of *trtrs routines
2024-10-25 23:29:24 +02:00
Martin Kroeker
a8bb105ed6 Merge pull request #4848 from haampie/fix/cmake-min-version
cmake: set `CMP0042` to `NEW`
2024-10-25 20:59:13 +02:00
Martin Kroeker
0e6a2cc93c bump the minimum_required version instead 2024-10-25 16:47:52 +02:00
TGY
815cb24944 remove unused INLINE macro definitions 2024-10-25 15:29:51 +02:00
Martin Kroeker
ac736820d7 Merge pull request #4955 from cdaley/optimize_gemv_forwarding
Optimize gemv forwarding on ARM64 systems
2024-10-25 13:43:54 +02:00
gxw
8f595382c4 gh-actions: Test LoongArch64 with gcc14 from Ubuntu 24.04 2024-10-25 05:00:54 +00:00
Chris Daley
cb48505251 optimize gemv forwarding on ARM64 systems 2024-10-24 21:05:26 -07:00
nickel
79f4bbd4cd fix: return types of *trtrs routines 2024-10-24 11:20:02 +02:00
Martin Kroeker
72461f1c8c Merge pull request #4950 from ayappanec/fix-aix-build
Fix AIX build
2024-10-23 16:40:02 +02:00
Ayappan Perumal
020cce1068 Fix build issues with gcc compiler as well 2024-10-23 04:24:06 -05:00
Ayappan Perumal
b6ec73e77c Fix AIX build 2024-10-21 07:38:03 -05:00
Martin Kroeker
8a0cd5fcef Merge pull request #4949 from martin-frbg/mingw32-14.2
work around mingw32-gfortran 14.2 miscompiling CBLAS1 tests
2024-10-20 21:52:57 +02:00
Martin Kroeker
4dba6ce6ea work around mingw32-gfortran 14.2 miscompiling CBLAS1 tests 2024-10-20 20:25:06 +02:00
Martin Kroeker
a93ec74e95 Merge pull request #4948 from martin-frbg/fixhavesve
Properly report HAVE_SVE in ARM64 autodetection where applicable
2024-10-18 20:00:42 +02:00
Martin Kroeker
c4bb4e74fc NeoverseN2 has SVE too 2024-10-18 14:50:55 +02:00
Martin Kroeker
86720778ef write HAVE_SVE to config where applicable 2024-10-18 14:14:43 +02:00
Martin Kroeker
016bdb9b0b Merge pull request #4946 from XiWeiGu/la64_omatcopy_lasx
LoongArch64: Opt somatcopy with LASX
2024-10-18 14:03:06 +02:00
gxw
ffaa5765a4 Bench: Add omatcopy 2024-10-18 11:07:52 +08:00
Martin Kroeker
a93897276b Merge pull request #4943 from martin-frbg/update_readme
Update README.md
2024-10-17 21:13:48 +02:00
Martin Kroeker
3fc1225dd6 Merge branch 'OpenMathLib:develop' into update_readme 2024-10-17 21:08:58 +02:00
Martin Kroeker
33078d11e4 stress importance of TARGET setting in DYNAMIC_ARCH builds 2024-10-17 21:07:49 +02:00
Martin Kroeker
15a57598f5 Merge pull request #4944 from ChipKerchner/vectorizeBF16GEMV
[POWER] Vectorize BF16 GEMV
2024-10-17 19:21:07 +02:00
Chip Kerchner
ab71a1edf2 Better VSX. 2024-10-17 08:25:02 -05:00