Martin Kroeker
fc516af155
Merge branch 'develop' into issue5414
2025-10-01 14:12:59 -07:00
Martin Kroeker
2fee943edb
Add CMake build support for IBM Z ( #5440 )
...
* Add ZARCH support, including DYNAMIC_ARCH
2025-09-09 22:18:51 +02:00
Martin Kroeker
426b5f23ed
Add compiler options for VORTEXM4
2025-08-18 14:35:36 -07:00
Martin Kroeker
c504aedca1
Merge pull request #5400 from Mousius/neoversev2-target
...
Add NEOVERSEV2 target support
2025-07-25 15:47:06 +02:00
Martin Kroeker
2f89a5970e
fix NeoverseV2 typo
2025-07-25 15:43:37 +02:00
Chris Sidebottom
87247daadc
Add NEOVERSEV2 target support
...
Did a quick run around to make `TARGET=NEVOERSEV2` build successfully.
Fixes #5385
2025-07-24 12:40:31 +01:00
Martin Kroeker
a5b55f6fe3
remove CBLAS restriction on GEMM_GEMV forwarding
2025-07-24 09:30:58 +02:00
Martin Kroeker
82954ba4ca
Update ?GEMM-to-?GEMV forwarding settings
2025-07-23 23:24:42 +02:00
Chris Sidebottom
f95e7b0e32
Add infrastructure for BGEMM
...
Setting up all the infrastructure for BGEMM support in OpenBLAS, hopefully I found all the right places.
Derived mostly from the previous work done in https://github.com/OpenMathLib/OpenBLAS/pull/5287
Co-authored-by: Ye Tao <ye.tao@arm.com >
2025-07-08 16:22:41 +01:00
Chris Sidebottom
552e1c7a7a
Correct compiler flags for NEOVERSEV1 target
2025-07-07 11:26:36 +00:00
Srangrang
0a967797a1
Add FP16 support for RISCV
2025-05-27 14:34:57 +08:00
Ruiyang Wu
1b0c0f00e9
CMake: Avoid mixed OpenMP linkage
2025-03-26 23:52:13 -04:00
Ruiyang Wu
02fd1df10b
CMake: Pass OpenMP compiler and linker flags through CMake targets
...
Using `OpenMP::OpenMP_LANG` targets for CMake is less error-prone than
passing the compiler and linker flags manually. Furthermore, it allows
the user to customize those flags by setting `OpenMP_LANG_FLAGS`,
`OpenMP_LANG_LIB_NAMES`, and `OpenMP_omp_LIBRARY`.
2025-03-26 23:09:54 -04:00
Vaisakh K V
f66ca05b31
Merge branch 'develop' into topic/sgemm_direct_sme1
2025-02-13 14:54:37 +05:30
Vaisakh K V
d23eb3b93e
Support for SME1 based sgemm_direct kernel for cblas_sgemm level 3 API
...
* Added ARMV9SME target
* Added SGEMM_DIRECT kernel based on SME1
2025-02-13 14:51:21 +05:30
Martin Kroeker
877d5a5be6
Add -O2 to flang flags when building on WoA in Release mode
2025-02-12 17:01:06 +01:00
gxw
1ebcbdbab3
LoongArch64: Fixed the issue of using the old-style TARGET in cmake builds
2025-01-23 09:08:42 +00:00
Martin Kroeker
d78fbe425c
Assume no underline suffixes on symbols when compiling with ifx on Windows
2024-12-23 19:04:50 +01:00
Matthew Thompson
c4e8bac5a5
Fix indent
2024-12-04 12:11:35 -05:00
Matthew Thompson
be19966d3b
Fixes for NAG CMake
2024-12-04 10:52:43 -05:00
Matthew Thompson
2eaf285de5
Use F_COMPILER name
2024-12-04 08:46:40 -05:00
Matthew Thompson
a8b1705dbd
CMake build has wrong PIC flag for NAG
2024-12-04 08:46:40 -05:00
Chip Kerchner
36bd3eeddf
Vectorize BF16 GEMV (VSX & MMA). Use GEMM_GEMV_FORWARD_BF16 (for Power).
2024-10-13 13:46:11 -05:00
Martin Kroeker
b4495a8fb8
Merge branch 'develop' into arm64_cmake_small_matrix_opt
2024-10-03 20:04:52 +02:00
gxw
7087b0a7d0
ARM64: Enable SMALL_MATRIX_OPT when compiling with CMake
2024-09-29 10:31:26 +08:00
gxw
30af9278dc
LoongArch64: Enable cmake cross-compilation
2024-09-29 10:13:30 +08:00
psykose
1265eee85c
fix cmake typo for power10 cc version check
...
fixes 668f48f4fc
2024-08-09 20:38:58 +02:00
Martin Kroeker
cc36db643e
Support new LAPACK build option LAPACK_STRLEN
2024-08-06 17:31:03 +02:00
Martin Kroeker
e8bd97ab4b
add RISCV64 entries for DYNAMIC_ARCH
2024-08-03 23:56:59 +02:00
Martin Kroeker
9eecd0d33b
enable GEMM/GEMV forwarding for riscv and ppc
2024-07-31 23:29:12 +02:00
Chris Sidebottom
b26424c6a2
Allow opt into GEMM -> GEMV forwarding
2024-07-31 13:09:14 +01:00
yamazaki-mitsufumi
821ef34635
Add A64FX to the list of CPUs supported by DYNAMIC_ARCH
2024-07-23 20:44:39 +09:00
Jaap Aarts
cea4abcac0
Fix compiling on mingw
2024-07-04 14:56:16 +02:00
Martin Kroeker
a3f6b13bc9
remove spurious brace
2024-05-16 09:25:53 +02:00
Martin Kroeker
668f48f4fc
Use CMAKE_C_COMPILER_VERSION instead of dumpversion calls ( #4698 )
...
* Use CMAKE_C_COMPILER_VERSION throughout
2024-05-15 23:58:14 +02:00
Martin Kroeker
3d26837a35
Suppress GCC14 error exit in the f2c-converted LAPACK
2024-04-30 19:05:18 +02:00
مهدي شينون (Mehdi Chinoune)
cda55f2fd2
Don't pass -exhaustive-register-search directly to clang compiler
...
`-exhaustive-register-search` is an LLVM code generation flag that shouldn't be passed directly to clang compiler.
2024-04-06 05:54:48 +01:00
Martin Kroeker
52b71a1673
Filter out FFLAGS that flang-new from LLVM18 no longer supports ( #4569 )
...
* Filter out FFLAGS that flang-new from LLVM18 no longer supports
2024-03-22 17:02:39 +01:00
Martin Kroeker
a0e3f77e0b
add FIXED_LIBNAME, PREFIX and SUFFIX
2024-02-15 12:17:38 +01:00
Martin Kroeker
49689fbef7
Add support for compiling SVE kernels with the NVIDIA HPC compiler
2023-08-25 17:11:04 +02:00
Martin Kroeker
ac698cedad
Add compiler options for ARM64 SVE targets in DYNAMIC_ARCH builds
2023-07-05 09:47:49 +02:00
Martin Kroeker
d2144b2981
Add NVHPC
2023-06-09 19:01:15 +02:00
Martin Kroeker
de937b3194
Add clang option to avoid running out of registers in AVX512 assembly
2023-03-17 21:22:37 +01:00
Martin Kroeker
e964ebd0d0
Add compiler option for AVX512-capable Ryzen(4)
2023-02-02 19:04:05 +01:00
Martin Kroeker
a0a4f7c447
Add -mfma to -mavx2 for clang, and add AVX2 declaration for Zen in DYNAMIC_ARCH builds
2022-09-13 22:47:00 +02:00
Martin Kroeker
85fd3c4279
Support compilation with the Cray C and Fortran compilers ( #3712 )
...
* Add support for the Cray Fortran compiler
2022-08-04 20:42:18 +02:00
Martin Kroeker
18b19d135b
C_LAPACK: Fixes to make it compile with MSVC ( #3605 )
...
* Fix f2c-like support functions to compile with MSVC, and
re-enable C_LAPACK for MSVC in CMAKE
* Add MSVC&flang build to Azure CI in order to check C_LAPACK correctness
2022-04-17 17:49:38 +02:00
Martin Kroeker
b7873605d4
Use f2c translations of LAPACK when no Fortran compiler is available ( #3539 )
...
* Add C equivalents of the Fortran routines from Reference-LAPACK as fallbacks, and C_LAPACK variable to trigger their use
2022-04-09 22:38:58 +02:00
Rafael Cardoso Fernandes Sousa
d38110a5ce
Use CMake variables instead of as
2021-12-10 17:46:53 -06:00
Rafael Cardoso Fernandes Sousa
214fbcee15
Fix cmake for power
2021-12-09 08:28:17 -06:00