[PATCH v2 21/34] target/arm: Use float*_maybe_ah_chs in sve_fcadd

qemu-devel

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[PATCH v2 21/34] target/arm: Use float_maybe_ah_chs in sve_fcadd_

From:	Richard Henderson
Subject:	[PATCH v2 21/34] target/arm: Use float_maybe_ah_chs in sve_fcadd_
Date:	Tue, 28 Jan 2025 17:38:44 -0800

The construction of neg_imag and neg_real were done to make it easy
to apply both in parallel with two simple logical operations.  This
changed with FPCR.AH, which is more complex than that.

Signed-off-by: Richard Henderson <richard.henderson@linaro.org>
---
 target/arm/tcg/vec_helper.c | 51 +++++++++++--------------------------
 1 file changed, 15 insertions(+), 36 deletions(-)

diff --git a/target/arm/tcg/vec_helper.c b/target/arm/tcg/vec_helper.c
index 9ed04b1b0a..55bac9536f 100644
--- a/target/arm/tcg/vec_helper.c
+++ b/target/arm/tcg/vec_helper.c
@@ -879,27 +879,20 @@ void HELPER(gvec_fcaddh)(void *vd, void *vn, void *vm,
     float16 *d = vd;
     float16 *n = vn;
     float16 *m = vm;
-    uint32_t neg_real = extract32(desc, SIMD_DATA_SHIFT, 1);
-    uint32_t neg_imag = neg_real ^ 1;
+    bool rot = extract32(desc, SIMD_DATA_SHIFT, 1);
     bool fpcr_ah = extract64(desc, SIMD_DATA_SHIFT + 1, 1);
     uintptr_t i;
 
-    /* Shift boolean to the sign bit so we can xor to negate.  */
-    neg_real <<= 15;
-    neg_imag <<= 15;
-
     for (i = 0; i < opr_sz / 2; i += 2) {
         float16 e0 = n[H2(i)];
         float16 e1 = m[H2(i + 1)];
         float16 e2 = n[H2(i + 1)];
         float16 e3 = m[H2(i)];
 
-        /* FPNeg() mustn't flip sign of a NaN if FPCR.AH == 1 */
-        if (!(fpcr_ah && float16_is_any_nan(e1))) {
-            e1 ^= neg_imag;
-        }
-        if (!(fpcr_ah && float16_is_any_nan(e3))) {
-            e3 ^= neg_real;
+        if (rot) {
+            e3 = float16_maybe_ah_chs(e3, fpcr_ah);
+        } else {
+            e1 = float16_maybe_ah_chs(e1, fpcr_ah);
         }
 
         d[H2(i)] = float16_add(e0, e1, fpst);
@@ -915,27 +908,20 @@ void HELPER(gvec_fcadds)(void *vd, void *vn, void *vm,
     float32 *d = vd;
     float32 *n = vn;
     float32 *m = vm;
-    uint32_t neg_real = extract32(desc, SIMD_DATA_SHIFT, 1);
-    uint32_t neg_imag = neg_real ^ 1;
+    bool rot = extract32(desc, SIMD_DATA_SHIFT, 1);
     bool fpcr_ah = extract64(desc, SIMD_DATA_SHIFT + 1, 1);
     uintptr_t i;
 
-    /* Shift boolean to the sign bit so we can xor to negate.  */
-    neg_real <<= 31;
-    neg_imag <<= 31;
-
     for (i = 0; i < opr_sz / 4; i += 2) {
         float32 e0 = n[H4(i)];
         float32 e1 = m[H4(i + 1)];
         float32 e2 = n[H4(i + 1)];
         float32 e3 = m[H4(i)];
 
-        /* FPNeg() mustn't flip sign of a NaN if FPCR.AH == 1 */
-        if (!(fpcr_ah && float32_is_any_nan(e1))) {
-            e1 ^= neg_imag;
-        }
-        if (!(fpcr_ah && float32_is_any_nan(e3))) {
-            e3 ^= neg_real;
+        if (rot) {
+            e3 = float32_maybe_ah_chs(e3, fpcr_ah);
+        } else {
+            e1 = float32_maybe_ah_chs(e1, fpcr_ah);
         }
 
         d[H4(i)] = float32_add(e0, e1, fpst);
@@ -951,27 +937,20 @@ void HELPER(gvec_fcaddd)(void *vd, void *vn, void *vm,
     float64 *d = vd;
     float64 *n = vn;
     float64 *m = vm;
-    uint64_t neg_real = extract64(desc, SIMD_DATA_SHIFT, 1);
-    uint64_t neg_imag = neg_real ^ 1;
+    bool rot = extract32(desc, SIMD_DATA_SHIFT, 1);
     bool fpcr_ah = extract64(desc, SIMD_DATA_SHIFT + 1, 1);
     uintptr_t i;
 
-    /* Shift boolean to the sign bit so we can xor to negate.  */
-    neg_real <<= 63;
-    neg_imag <<= 63;
-
     for (i = 0; i < opr_sz / 8; i += 2) {
         float64 e0 = n[i];
         float64 e1 = m[i + 1];
         float64 e2 = n[i + 1];
         float64 e3 = m[i];
 
-        /* FPNeg() mustn't flip sign of a NaN if FPCR.AH == 1 */
-        if (!(fpcr_ah && float64_is_any_nan(e1))) {
-            e1 ^= neg_imag;
-        }
-        if (!(fpcr_ah && float64_is_any_nan(e3))) {
-            e3 ^= neg_real;
+        if (rot) {
+            e3 = float64_maybe_ah_chs(e3, fpcr_ah);
+        } else {
+            e1 = float64_maybe_ah_chs(e1, fpcr_ah);
         }
 
         d[i] = float64_add(e0, e1, fpst);
-- 
2.43.0

[Prev in Thread]

Current Thread

[Next in Thread]

[PATCH v2 08/34] target/arm: Remove standard_fp_status, (continued)
- [PATCH v2 08/34] target/arm: Remove standard_fp_status, Richard Henderson, 2025/01/28
  - Re: [PATCH v2 08/34] target/arm: Remove standard_fp_status, Philippe Mathieu-Daudé, 2025/01/29
- [PATCH v2 17/34] target/arm: Move float*_ah_chs to vec_internal.h, Richard Henderson, 2025/01/28
- [PATCH v2 19/34] target/arm: Use float*_maybe_ah_chs in sve_ftssel_*, Richard Henderson, 2025/01/28
- [PATCH v2 12/34] target/arm: Remove fp_status_f16_a32, Richard Henderson, 2025/01/28
  - Re: [PATCH v2 12/34] target/arm: Remove fp_status_f16_a32, Philippe Mathieu-Daudé, 2025/01/29
- [PATCH v2 16/34] target/arm: Simplify DO_VFP_cmp in vfp_helper.c, Richard Henderson, 2025/01/28
  - Re: [PATCH v2 16/34] target/arm: Simplify DO_VFP_cmp in vfp_helper.c, Philippe Mathieu-Daudé, 2025/01/29
- [PATCH v2 18/34] target/arm: Introduce float*_maybe_ah_chs, Richard Henderson, 2025/01/28
- [PATCH v2 20/34] target/arm: Use float*_maybe_ah_chs in sve_fcadd_*, Richard Henderson, 2025/01/28
- [PATCH v2 21/34] target/arm: Use float*_maybe_ah_chs in sve_fcadd_*, Richard Henderson <=
- [PATCH v2 23/34] target/arm: Use flags for AH negation in sve_ftmad_*, Richard Henderson, 2025/01/28
- [PATCH v2 22/34] target/arm: Use flags for AH negation in do_fmla_zpzzz_*, Richard Henderson, 2025/01/28
- [PATCH v2 24/34] target/arm: Use flags for AH negation in float*_ah_mulsub_f, Richard Henderson, 2025/01/28
- [PATCH v2 26/34] target/arm: Handle FPCR.AH in gvec_fcmla[hs]_idx, Richard Henderson, 2025/01/28
- [PATCH v2 28/34] target/arm: Split gvec_fmla_idx_* for fmls and ah_fmls, Richard Henderson, 2025/01/28
  - Re: [PATCH v2 28/34] target/arm: Split gvec_fmla_idx_* for fmls and ah_fmls, Peter Maydell, 2025/01/31
- [PATCH v2 25/34] target/arm: Handle FPCR.AH in gvec_fcmla[hsd], Richard Henderson, 2025/01/28
- [PATCH v2 27/34] target/arm: Handle FPCR.AH in sve_fcmla_zpzzz_*, Richard Henderson, 2025/01/28
- [PATCH v2 29/34] Revert "target/arm: Handle FPCR.AH in FMLSL", Richard Henderson, 2025/01/28
- [PATCH v2 30/34] target/arm: Handle FPCR.AH in gvec_fmlal_a64, Richard Henderson, 2025/01/28

Prev by Date: [PATCH v2 20/34] target/arm: Use float*_maybe_ah_chs in sve_fcadd_*
Next by Date: [PATCH v2 23/34] target/arm: Use flags for AH negation in sve_ftmad_*
Previous by thread: [PATCH v2 20/34] target/arm: Use float*_maybe_ah_chs in sve_fcadd_*
Next by thread: [PATCH v2 23/34] target/arm: Use flags for AH negation in sve_ftmad_*
Index(es):
- Date
- Thread

[PATCH v2 21/34] target/arm: Use float*_maybe_ah_chs in sve_fcadd_*

[PATCH v2 21/34] target/arm: Use float_maybe_ah_chs in sve_fcadd_