Files
OpenBLAS/kernel/arm64
Fadi Arafeh f30202b705 Accelerate SVE128 SBGEMM/BGEMM
This accelerates SBGEMM/BGEMM by extending the existing 8x4 kernel to 8x8 (unrolling N by 8)

Not sure if it's a good idea to delete the previous 8x4 kernel?

Here are the speedups on single core Neoverse-V2 (SVE128) compared to prev state:

Per-shape speedup
  M=N=K=64: SBGEMM 1.164x (16.42%), BGEMM 1.133x (13.30%)
  M=N=K=128: SBGEMM 1.220x (22.02%), BGEMM 1.186x (18.56%)
  M=N=K=256: SBGEMM 1.241x (24.08%), BGEMM 1.235x (23.54%)
  M=N=K=512: SBGEMM 1.240x (23.95%), BGEMM 1.227x (22.75%)
  M=N=K=1024: SBGEMM 1.251x (25.11%), BGEMM 1.232x (23.23%)
  M=N=K=2048: SBGEMM 1.235x (23.47%), BGEMM 1.246x (24.64%)

Signed-off-by: Fadi Arafeh <fadi.arafeh@arm.com>
2026-03-05 13:50:07 +00:00
..
2025-07-25 11:10:20 +05:30
2023-07-25 20:15:44 +01:00
2023-07-25 20:15:44 +01:00
2019-03-30 22:13:36 +01:00
2025-11-20 12:14:17 +05:30
2018-02-27 10:47:55 +00:00
2025-11-20 12:14:17 +05:30
2025-06-17 03:05:34 -04:00
2024-04-02 19:41:44 +02:00
2022-03-28 17:28:29 +02:00
2021-01-12 16:49:39 +01:00
2021-01-12 16:51:35 +01:00
2025-08-18 08:41:38 -07:00
2024-12-13 11:15:29 +05:30
2023-07-23 20:37:07 +02:00
2020-06-10 01:01:16 +08:00
2025-03-13 10:17:43 +00:00
2025-03-13 10:17:43 +00:00
2020-06-04 02:10:45 +08:00
2021-12-05 14:03:08 +01:00
2019-03-30 22:13:36 +01:00
2024-12-13 16:47:58 +05:30
2024-12-13 11:15:29 +05:30
2023-07-25 20:15:44 +01:00
2023-07-25 20:15:44 +01:00
2025-04-11 20:39:52 +09:00
2025-04-11 20:39:52 +09:00
2025-04-11 20:39:52 +09:00
2025-04-11 20:39:52 +09:00
2022-01-15 22:27:25 +01:00
2022-01-15 22:27:25 +01:00
2022-01-15 22:27:25 +01:00
2022-01-15 22:27:25 +01:00
2023-07-25 20:15:44 +01:00
2023-07-25 20:15:44 +01:00
2023-07-25 20:15:44 +01:00
2020-03-09 15:36:50 +08:00
2023-07-25 20:15:44 +01:00
2023-07-25 20:15:44 +01:00
2023-07-27 13:27:28 +01:00
2023-07-27 13:27:28 +01:00
2025-06-12 03:03:01 -07:00
2019-03-30 22:13:36 +01:00
2023-07-25 20:15:44 +01:00
2023-07-25 20:15:44 +01:00
2022-01-04 14:42:07 +01:00
2023-07-25 20:15:44 +01:00
2023-07-25 20:15:44 +01:00
2023-07-25 20:15:44 +01:00
2023-07-25 20:15:44 +01:00
2023-07-25 20:15:44 +01:00
2023-07-25 20:15:44 +01:00
2023-07-25 20:15:44 +01:00
2023-07-25 20:15:44 +01:00