Commit Graph

8889 Commits

Author SHA1 Message Date
Martin Kroeker
70dddacb9f Merge pull request #5023 from rgommers/fix-warnings
Fix two compiler warnings in `memory.c`
2024-12-18 16:13:12 -08:00
Martin Kroeker
d3cc8c65ed Merge pull request #5022 from tingboliao/develop
Replace the __riscv_vid_v_i32m2 and __riscv_vid_v_i64m2 with __riscv…_vid_v_u32m2 and __riscv_vid_v_u64m2 for riscv64-unknown-linux-gnu-gcc compiling.
2024-12-18 14:29:39 -08:00
Ralf Gommers
765ad8bcd2 Fix guard around alloc_hugetlb, fixes compile warning
The warning was:
```
/home/rgommers/code/pixi-dev-scipystack/openblas/OpenBLAS/driver/others/memory.c: At top level:
/home/rgommers/code/pixi-dev-scipystack/openblas/OpenBLAS/driver/others/memory.c:2565:14: warning: 'alloc_hugetlb' defined but not used [-Wunused-function]
 2565 | static void *alloc_hugetlb(void *address){
      |              ^~~~~~~~~~~~~
```

The added define is the same as is already present in the TLS part of
`memory.c`. This follows up on gh-4681.
2024-12-18 09:42:05 +01:00
Ralf Gommers
48caf2303d Fix build warning about discarding volatile qualifier in memory.c
The warning was:
```
[4339/5327] Building C object driver/others/CMakeFiles/driver_others.dir/memory.c.o
/home/rgommers/code/pixi-dev-scipystack/openblas/OpenBLAS/driver/others/memory.c: In function 'blas_shutdown':
/home/rgommers/code/pixi-dev-scipystack/openblas/OpenBLAS/driver/others/memory.c:3257:10: warning: passing argument 1 of 'free' discards 'volatile' qualifier from pointer target type [-Wdiscarded-qualifiers]
 3257 |     free(newmemory);
      |          ^~~~~~~~~
In file included from /home/rgommers/code/pixi-dev-scipystack/openblas/OpenBLAS/common.h:83,
                 from /home/rgommers/code/pixi-dev-scipystack/openblas/OpenBLAS/driver/others/memory.c:74:
/home/rgommers/code/pixi-dev-scipystack/openblas/.pixi/envs/default/x86_64-conda-linux-gnu/sysroot/usr/include/stdlib.h:482:25: note: expected 'void *' but argument is of type 'volatile struct newmemstruct *'
  482 | extern void free (void *__ptr) __THROW;
      |                   ~~~~~~^~~~~
```

The use of `volatile` for `newmemstruct` seems on purpose, and there are
more such constructs in this file. The warning appeared after gh-4451
and is correct. The `free` prototype doesn't expect a volatile pointer,
hence this change adds a cast to silence the warning.
2024-12-18 08:53:29 +01:00
tingbo.liao
d00cc400b1 Replaced the __riscv_vid_v_i32m2 and __riscv_vid_v_i64m2 with __riscv_vid_v_u32m2 and __riscv_vid_v_u64m2 for riscv64-unknown-linux-gnu-gcc compiling.
Signed-off-by: tingbo.liao <tingbo.liao@starfivetech.com>
2024-12-18 08:38:30 +08:00
Martin Kroeker
229d8a025e Merge pull request #4959 from CDAC-Bengaluru/level-1-sve
SVE Implementation for Level-1 BLAS Routines
2024-12-13 05:20:51 -08:00
SushilPratap04
3368a4e697 Update swap_kernel_sve.c 2024-12-13 16:47:58 +05:30
CDAC-SSDG
dd71e4234a Added Updated swap and rot sve kernels. 2024-12-13 11:15:29 +05:30
CDAC-SSDG
06ffd411a5 Update KERNEL.ARMV8SVE 2024-12-13 11:05:47 +05:30
CDAC-SSDG
41912f9c22 Update CONTRIBUTORS.md 2024-12-13 11:05:10 +05:30
CDAC-SSDG
765850194e Delete kernel/arm64/swap_kernel_sve.c 2024-12-13 11:02:01 +05:30
CDAC-SSDG
c17c19fbcf Delete kernel/arm64/swap_kernel_c.c 2024-12-13 11:01:46 +05:30
CDAC-SSDG
f6416c0e37 Delete kernel/arm64/swap.c 2024-12-13 11:01:32 +05:30
CDAC-SSDG
3b7b74664c Delete kernel/arm64/scal_kernel_sve.c 2024-12-13 11:01:03 +05:30
CDAC-SSDG
95a97012e8 Delete kernel/arm64/scal_kernel_c.c 2024-12-13 11:00:45 +05:30
CDAC-SSDG
5540f2121e Delete kernel/arm64/scal.c 2024-12-13 11:00:12 +05:30
CDAC-SSDG
f62519cc87 Delete kernel/arm64/rot_kernel_sve.c 2024-12-13 10:59:35 +05:30
CDAC-SSDG
10857c9df4 Delete kernel/arm64/rot_kernel_c.c 2024-12-13 10:58:51 +05:30
CDAC-SSDG
b9f51a5cf7 Delete kernel/arm64/rot.c 2024-12-13 10:58:06 +05:30
Martin Kroeker
89f02ed394 Merge pull request #5014 from martin-frbg/issue5013
Add some missed lapack 3.11+ symbols to gensymbol
2024-12-10 23:09:33 -08:00
Martin Kroeker
61d5aec7c1 remove typo 2024-12-11 00:41:56 +01:00
Martin Kroeker
5aea097df0 add missing lapack 3.11+ symbols 2024-12-10 23:52:05 +01:00
Martin Kroeker
72f7b7011c Merge pull request #5009 from martin-frbg/pybenchdoc
DOCS, pybench : Add build notes for Windows and flang from gh Discussion 5008
2024-12-06 02:50:14 -08:00
Martin Kroeker
0f8ff82592 Add build notes for Windows and flang from gh Discussion 5008 2024-12-06 01:35:42 -08:00
Martin Kroeker
81666de4ef Merge pull request #5007 from martin-frbg/issue5006
Revert the NRM2 kernels for NeoverseN2 and ARMV8SVE targets to the generic NEON version
2024-12-05 14:43:03 -08:00
Martin Kroeker
230e665bca Merge pull request #4996 from iha-taisei/sdgemv_sve_unroll
Loop-unrolled transposed [SD]GEMV kernels for A64FX and Neoverse V1
2024-12-05 13:36:47 -08:00
Martin Kroeker
3345007d8f retire the thunderx2 NRM2 kernels due to reported inaccuracies and NAN 2024-12-05 21:12:06 +01:00
Martin Kroeker
5fe983db29 retire the thunderx2 nrm2 kernels for now due to NAN and inaccuracies 2024-12-05 21:09:53 +01:00
Martin Kroeker
5dc4d7dd7e Merge pull request #5005 from martin-frbg/evbarm
Improve support for NetBSD on arm64 (evbarm)
2024-12-05 00:02:58 -08:00
Martin Kroeker
4ba471dd5a Merge pull request #5003 from mathomp4/bugfix/nag-pic
Fixes for NAG Compiler
2024-12-04 15:41:12 -08:00
Martin Kroeker
a791912cbb handle uname returning evbarm on NetBSD 2024-12-04 15:34:57 -08:00
Martin Kroeker
1a6ecda398 utilize /proc/cpuinfo on NetBSD too 2024-12-04 15:32:26 -08:00
Matthew Thompson
c4e8bac5a5 Fix indent 2024-12-04 12:11:35 -05:00
Matthew Thompson
d3b2036d49 Move to use ERROR STOP instead of ABORT 2024-12-04 12:09:24 -05:00
Matthew Thompson
35334ed2ea Fixes for Fortran Standards violations for lapack-netlib 2024-12-04 10:53:05 -05:00
Matthew Thompson
be19966d3b Fixes for NAG CMake 2024-12-04 10:52:43 -05:00
Martin Kroeker
9c5d20187b Merge pull request #4999 from dg0yt/macro-failed
Fix redefinition of FAILED
2024-12-04 07:37:51 -08:00
Matthew Thompson
2eaf285de5 Use F_COMPILER name 2024-12-04 08:46:40 -05:00
Matthew Thompson
a8b1705dbd CMake build has wrong PIC flag for NAG 2024-12-04 08:46:40 -05:00
Martin Kroeker
5f65846691 Merge pull request #4998 from dg0yt/arm-type-function
arm: Declare symbols as .type function
2024-12-04 01:50:53 -08:00
Kai Pastor
93eb42fdc8 Fix redefinition of FAILED 2024-12-03 09:45:04 +01:00
Kai Pastor
dc905636d1 arm: Declare symbols as .type function 2024-12-03 07:42:44 +01:00
Iha, Taisei
4918beecbe Loop-unrolled transposed [SD]GEMV kernels for A64FX and Neoverse V1 2024-12-02 18:46:00 +09:00
Martin Kroeker
0578a89afd Merge pull request #4993 from martin-frbg/issue4991
Translate CMAKE_SYSTEM_NAME in compilations on or for IOS
2024-11-27 02:54:41 -08:00
Martin Kroeker
57a51d74c9 translate CMAKE_SYSTEM_NAME in compilations on or for IOS 2024-11-27 09:52:56 +01:00
Martin Kroeker
35f2e6afe6 Merge pull request #4992 from mmuetzel/ci-msys2
CI (MinGW): Remove CLANG32 environment from build matrix.
2024-11-26 10:09:29 -08:00
Markus Mützel
f5e6b5b5c9 CI (MinGW): Remove CLANG32 environment from build matrix.
The CLANG32 environment is in the process of being removed from MSYS2
currently:
https://www.msys2.org/news/#2024-09-23-starting-to-drop-the-clang32-environment

Remove it from the build matrix ahead of its complete removal from MSYS2.
2024-11-26 15:44:50 +01:00
Martin Kroeker
8e8003a2d1 Merge pull request #4180 from mmuetzel/cmake
CI (MinGW): Remove work-around needed for old versions of LLVM Flang
2024-11-26 06:41:48 -08:00
Martin Kroeker
71963a7bc4 Merge pull request #4985 from CheryDan/RISCV/sched
added optimizations for RISC-V YIELDING
2024-11-26 04:37:38 -08:00
Markus Mützel
7452af4471 CI (MinGW): Remove work-around with NO_AVX512 that was needed for older versions of LLVM Flang. 2024-11-26 13:02:27 +01:00