[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [PATCH v2] target/arm: Speed up aarch64 TBL/TBX
From: |
Peter Maydell |
Subject: |
Re: [PATCH v2] target/arm: Speed up aarch64 TBL/TBX |
Date: |
Fri, 5 Mar 2021 11:32:40 +0000 |
On Thu, 25 Feb 2021 at 09:54, Alex Bennée <alex.bennee@linaro.org> wrote:
>
>
> Richard Henderson <richard.henderson@linaro.org> writes:
>
> > Always perform one call instead of two for 16-byte operands.
> > Use byte loads/stores directly into the vector register file
> > instead of extractions and deposits to a 64-bit local variable.
> >
> > In order to easily receive pointers into the vector register file,
> > convert the helper to the gvec out-of-line signature. Move the
> > helper into vec_helper.c, where it can make use of H1 and clear_tail.
> >
> > Signed-off-by: Richard Henderson <richard.henderson@linaro.org>
>
> Reviewed-by: Alex Bennée <alex.bennee@linaro.org>
> Tested-by: Alex Bennée <alex.bennee@linaro.org>
>
> Looked marginally slower on the (1!) run I did but probably in the noise
> and the generated code looks nicer.
Applied to target-arm.next, thanks.
-- PMM
[Prev in Thread] |
Current Thread |
[Next in Thread] |
- Re: [PATCH v2] target/arm: Speed up aarch64 TBL/TBX,
Peter Maydell <=