Merge pull request #5294 from arnej27959/arnej/fix-arm64-register

Accumulate results in output register explicitly
This commit is contained in:
Martin Kroeker
2025-06-09 23:41:12 -07:00
committed by GitHub

View File

@@ -134,7 +134,7 @@ USE OF THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE.
" fadd v4.4s, v4.4s, v6.4s \n" \
" fadd v0.4s, v0.4s, v4.4s \n" \
" faddp v0.4s, v0.4s, v0.4s \n" \
" faddp v0.4s, v0.4s, v0.4s \n"
" faddp "OUT", v0.2s \n"
#else /* !defined(DSDOT) */
#define KERNEL_F1 \