Re: [PATCH v2 00/19] target/i386: decoder changes for 8.2

qemu-devel

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [PATCH v2 00/19] target/i386: decoder changes for 8.2

From:	Richard Henderson
Subject:	Re: [PATCH v2 00/19] target/i386: decoder changes for 8.2
Date:	Thu, 19 Oct 2023 08:44:14 -0700
User-agent:	Mozilla Thunderbird

On 10/19/23 03:46, Paolo Bonzini wrote:

This includes:

- implementing SHA and CMPccXADD instruction extensions

- introducing a new mechanism for flags writeback that avoids a
   tricky failure

- converting the more orthogonal parts of the one-byte opcode
   map, as well as the CMOVcc and SETcc instructions.

Tested by booting several 32-bit and 64-bit guests.

The new decoder produces roughly 2% more ops, but after optimization there
are just 0.5% more and almost all of them come from cmp instructions.
For some reason that I have not investigated, these end up with an extra
mov even after optimization:

                                 sub_i64 tmp0,rax,$0x33
  mov_i64 cc_src,$0x33           mov_i64 cc_dst,tmp0
  sub_i64 cc_dst,rax,$0x33       mov_i64 cc_src,$0x33
  discard cc_src2                discard cc_src2
  discard cc_op                  discard cc_op

It could be easily fixed by not reusing gen_SUB for cmp instructions,
or by debugging what goes on in the optimizer.  However, it does not
result in larger assembly.

This is expected behaviour out of the tcg optimizer. We don't forward-propagate outputsat that point. But during register allocation of the "mov cc_dst,tmp0" opcode, we willsee that tmp0 is dead and re-assign the register from tmp0 to cc_dst without emitting anhost instruction.

r~

[Prev in Thread]

Current Thread

[Next in Thread]

[PATCH v2 07/19] target/i386: introduce flags writeback mechanism, (continued)
- [PATCH v2 07/19] target/i386: introduce flags writeback mechanism, Paolo Bonzini, 2023/10/19
- [PATCH v2 10/19] target/i386: reintroduce debugging mechanism, Paolo Bonzini, 2023/10/19
- [PATCH v2 13/19] target/i386: split eflags computation out of gen_compute_eflags, Paolo Bonzini, 2023/10/19
- [PATCH v2 12/19] target/i386: adjust decoding of J operand, Paolo Bonzini, 2023/10/19
- [PATCH v2 16/19] target/i386: move operand load and writeback out of gen_cmovcc1, Paolo Bonzini, 2023/10/19
- [PATCH v2 15/19] target/i386: move 60-BF opcodes to new decoder, Paolo Bonzini, 2023/10/19
- [PATCH v2 17/19] target/i386: move remaining conditional operations to new decoder, Paolo Bonzini, 2023/10/19
- [PATCH v2 19/19] target/i386: remove gen_op, Paolo Bonzini, 2023/10/19
- [PATCH v2 18/19] target/i386: remove now converted opcodes from old decoder, Paolo Bonzini, 2023/10/19
- Re: [PATCH v2 00/19] target/i386: decoder changes for 8.2, Paolo Bonzini, 2023/10/19
- Re: [PATCH v2 00/19] target/i386: decoder changes for 8.2, Richard Henderson <=

Prev by Date: Re: [PATCH v3 0/7] hw/ppc: SysBus simplifications
Next by Date: Re: [PATCH v2 01/19] target/i386: group common checks in the decoding phase
Previous by thread: Re: [PATCH v2 00/19] target/i386: decoder changes for 8.2
Next by thread: [PATCH] hw/char/mcf_uart: Have mcf_uart_create() return DeviceState
Index(es):
- Date
- Thread