qemu-devel
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [PATCH] target/i386: Add new CPU model SierraForest


From: Tao Su
Subject: Re: [PATCH] target/i386: Add new CPU model SierraForest
Date: Wed, 13 Mar 2024 17:02:37 +0800

On Fri, Mar 08, 2024 at 05:36:52PM +0100, Igor Mammedov wrote:
> On Wed,  6 Dec 2023 21:19:23 +0800
> Tao Su <tao1.su@linux.intel.com> wrote:
> 
> > SierraForest is Intel's first generation E-core based Xeon server
> > processor, which will be released in the first half of 2024.
> > 
> > SierraForest mainly adds the following new features based on
> > GraniteRapids:
> > 
> 
> *) Please specify chapter/table where it says that CPU model got
> this features

I got it from table 1-2 in Intel Architecture Instruction Set Extensions
and Future Features (rev 051), I will add to next version.

> 
> > - CMPCCXADD CPUID.(EAX=7,ECX=1):EAX[bit 7]
> > - AVX-IFMA CPUID.(EAX=7,ECX=1):EAX[bit 23]
> > - AVX-VNNI-INT8 CPUID.(EAX=7,ECX=1):EDX[bit 4]
> > - AVX-NE-CONVERT CPUID.(EAX=7,ECX=1):EDX[bit 5]
> 
> > - LAM CPUID.(EAX=7,ECX=1):EAX[bit 26]
> > - LASS CPUID.(EAX=7,ECX=1):EAX[bit 6]
> this implies that series adds these bit but later you say it doesn't,
> it's confusing at best.
> 
> Also you've missed some other features mentioned in spec
> (for example: RDMSRLIST, UIRET, UC-Lock Disable, ENQCMD ...)
> so it makes above list even more inconsistent.
> Why some features were added while others not.

The above spec lists bare-metal features, but the virtualization patches
of some features still aren’t merged in mainline now, e.g. LAM and LASS.
CPU model only contains the features which have already been virtualized,
otherwise the unvirtualized features will trigger warning and be filtered.

>  
> 
> ditto [*] for removed +
> where it says that it's GraniteRapids based.
> 

The "based on" should be misleading, SierraForest and GraniteRapids have
completely different instruction sets and SierraForest is based on
Efficient-core, maybe "compared with" is more accurate. The below features
are captured from SierraForest real machine.

> > and removes the following features based on GraniteRapids:
> > 
> > - HLE CPUID.(EAX=7,ECX=0):EBX[bit 4]
> > - RTM CPUID.(EAX=7,ECX=0):EBX[bit 11]
> > - AVX512F CPUID.(EAX=7,ECX=0):EBX[bit 16]
> > - AVX512DQ CPUID.(EAX=7,ECX=0):EBX[bit 17]
> > - AVX512_IFMA CPUID.(EAX=7,ECX=0):EBX[bit 21]
> > - AVX512CD CPUID.(EAX=7,ECX=0):EBX[bit 28]
> > - AVX512BW CPUID.(EAX=7,ECX=0):EBX[bit 30]
> > - AVX512VL CPUID.(EAX=7,ECX=0):EBX[bit 31]
> > - AVX512_VBMI CPUID.(EAX=7,ECX=0):ECX[bit 1]
> > - AVX512_VBMI2 CPUID.(EAX=7,ECX=0):ECX[bit 6]
> > - AVX512_VNNI CPUID.(EAX=7,ECX=0):ECX[bit 11]
> > - AVX512_BITALG CPUID.(EAX=7,ECX=0):ECX[bit 12]
> > - AVX512_VPOPCNTDQ CPUID.(EAX=7,ECX=0):ECX[bit 14]
> > - LA57 CPUID.(EAX=7,ECX=0):ECX[bit 16]
> > - TSXLDTRK CPUID.(EAX=7,ECX=0):EDX[bit 16]
> > - AMX-BF16 CPUID.(EAX=7,ECX=0):EDX[bit 22]
> > - AVX512_FP16 CPUID.(EAX=7,ECX=0):EDX[bit 23]
> > - AMX-TILE CPUID.(EAX=7,ECX=0):EDX[bit 24]
> > - AMX-INT8 CPUID.(EAX=7,ECX=0):EDX[bit 25]
> > - AVX512_BF16 CPUID.(EAX=7,ECX=1):EAX[bit 5]
> > - fast zero-length MOVSB CPUID.(EAX=7,ECX=1):EAX[bit 10]
> > - fast short CMPSB, SCASB CPUID.(EAX=7,ECX=1):EAX[bit 12]
> > - AMX-FP16 CPUID.(EAX=7,ECX=1):EAX[bit 21]
> > - PREFETCHI CPUID.(EAX=7,ECX=1):EDX[bit 14]
> > - XFD CPUID.(EAX=0xD,ECX=1):EAX[bit 4]
> > - EPT_PAGE_WALK_LENGTH_5 VMX_EPT_VPID_CAP(0x48c)[bit 7]
> 
> 
> > SierraForest doesn't support TSX, so TSX Async Abort(TAA) vulnerabilities
> > don't exist on SierraForest. On KVM side, if host doesn't enumerate RTM
> > or RTM gets disabled, ARCH_CAP_TAA_NO is reported as unsupported. To
> > avoid the confusing warning:
> > warning: host doesn't support requested feature: MSR(10AH).taa-no
> >          [bit 8]
> > 
> > just don't include TAA_NO in SierraForest CPU model.
> 
> I probably missing something.
> 
> If host is Sierra Forest it won't have TSX and so TAA_NO, right?
> If so, then how above warning can be triggered and why SierraForest
> might have when it shouldn't?
> /me confused

Host doesn’t support TSX and RTM but supports TAA_NO. When RTM is not
enabled in host, KVM will not report TAA_NO. So, if adding TAA_NO to CPU
model, it will trigger warning and be filtered because KVM doesn’t report.

> 
> 
> > Currently LAM and LASS are not enabled in KVM mainline yet,  will add
> > them after merged.
> 
> if you add features in this series while kernel side is missing,
> wouldn't they be filtered out since kernel doesn't support  them yet?

Yes, they will be filtered, but they will trigger warning in QEMU.

> 
> > 
> > Signed-off-by: Tao Su <tao1.su@linux.intel.com>
> > ---
> > The new features can be found in Intel ISE[1].
> > LAM has just been accepted by KVM[2].
> > 
> > Although we would like to include all SierraForest features in the first
> > version of the CPU model, SierraForest will be released in the first half
> > of 2024[3], we would want user can have a first usable SierraForest CPU
> > model in the QEMU when they have the hardware in their hand.
> > 
> > [1] https://cdrdv2.intel.com/v1/dl/getContent/671368
> 
> Links are ephemeral, so along with it add full spec name + rev/date
> so that readers could look it up.
> Also move this to the beginning of the main body (above --- line),
> so it won't be gone after commit 

Yes, I will do in the next version, many thanks for your reminder.

Thanks,
Tao

> 
> > [2]
> > https://lore.kernel.org/all/169810442917.2499338.3440694989716170017.b4-ty@google.com/
> > [3]
> > https://www.intel.com/content/www/us/en/newsroom/news/tackling-throughput-computing-sierra-forest.html
> > ---
> >  target/i386/cpu.c | 126 ++++++++++++++++++++++++++++++++++++++++++++++
> >  1 file changed, 126 insertions(+)
> > 
> > diff --git a/target/i386/cpu.c b/target/i386/cpu.c
> > index cd16cb893d..2405c9e407 100644
> > --- a/target/i386/cpu.c
> > +++ b/target/i386/cpu.c
> > @@ -4099,6 +4099,132 @@ static const X86CPUDefinition builtin_x86_defs[] = {
> >              { /* end of list */ },
> >          },
> >      },
> > +    {
> > +        .name = "SierraForest",
> > +        .level = 0x23,
> > +        .vendor = CPUID_VENDOR_INTEL,
> > +        .family = 6,
> > +        .model = 175,
> > +        .stepping = 0,
> > +        /*
> > +         * please keep the ascending order so that we can have a clear 
> > view of
> > +         * bit position of each feature.
> > +         */
> > +        .features[FEAT_1_EDX] =
> > +            CPUID_FP87 | CPUID_VME | CPUID_DE | CPUID_PSE | CPUID_TSC |
> > +            CPUID_MSR | CPUID_PAE | CPUID_MCE | CPUID_CX8 | CPUID_APIC |
> > +            CPUID_SEP | CPUID_MTRR | CPUID_PGE | CPUID_MCA | CPUID_CMOV |
> > +            CPUID_PAT | CPUID_PSE36 | CPUID_CLFLUSH | CPUID_MMX | 
> > CPUID_FXSR |
> > +            CPUID_SSE | CPUID_SSE2,
> > +        .features[FEAT_1_ECX] =
> > +            CPUID_EXT_SSE3 | CPUID_EXT_PCLMULQDQ | CPUID_EXT_SSSE3 |
> > +            CPUID_EXT_FMA | CPUID_EXT_CX16 | CPUID_EXT_PCID | 
> > CPUID_EXT_SSE41 |
> > +            CPUID_EXT_SSE42 | CPUID_EXT_X2APIC | CPUID_EXT_MOVBE |
> > +            CPUID_EXT_POPCNT | CPUID_EXT_TSC_DEADLINE_TIMER | 
> > CPUID_EXT_AES |
> > +            CPUID_EXT_XSAVE | CPUID_EXT_AVX | CPUID_EXT_F16C | 
> > CPUID_EXT_RDRAND,
> > +        .features[FEAT_8000_0001_EDX] =
> > +            CPUID_EXT2_SYSCALL | CPUID_EXT2_NX | CPUID_EXT2_PDPE1GB |
> > +            CPUID_EXT2_RDTSCP | CPUID_EXT2_LM,
> > +        .features[FEAT_8000_0001_ECX] =
> > +            CPUID_EXT3_LAHF_LM | CPUID_EXT3_ABM | CPUID_EXT3_3DNOWPREFETCH,
> > +        .features[FEAT_8000_0008_EBX] =
> > +            CPUID_8000_0008_EBX_WBNOINVD,
> > +        .features[FEAT_7_0_EBX] =
> > +            CPUID_7_0_EBX_FSGSBASE | CPUID_7_0_EBX_BMI1 | 
> > CPUID_7_0_EBX_AVX2 |
> > +            CPUID_7_0_EBX_SMEP | CPUID_7_0_EBX_BMI2 | CPUID_7_0_EBX_ERMS |
> > +            CPUID_7_0_EBX_INVPCID | CPUID_7_0_EBX_RDSEED | 
> > CPUID_7_0_EBX_ADX |
> > +            CPUID_7_0_EBX_SMAP | CPUID_7_0_EBX_CLFLUSHOPT | 
> > CPUID_7_0_EBX_CLWB |
> > +            CPUID_7_0_EBX_SHA_NI,
> > +        .features[FEAT_7_0_ECX] =
> > +            CPUID_7_0_ECX_UMIP | CPUID_7_0_ECX_PKU | CPUID_7_0_ECX_GFNI |
> > +            CPUID_7_0_ECX_VAES | CPUID_7_0_ECX_VPCLMULQDQ |
> > +            CPUID_7_0_ECX_RDPID | CPUID_7_0_ECX_BUS_LOCK_DETECT,
> > +        .features[FEAT_7_0_EDX] =
> > +            CPUID_7_0_EDX_FSRM | CPUID_7_0_EDX_SERIALIZE |
> > +            CPUID_7_0_EDX_SPEC_CTRL | CPUID_7_0_EDX_ARCH_CAPABILITIES |
> > +            CPUID_7_0_EDX_SPEC_CTRL_SSBD,
> > +        .features[FEAT_ARCH_CAPABILITIES] =
> > +            MSR_ARCH_CAP_RDCL_NO | MSR_ARCH_CAP_IBRS_ALL |
> > +            MSR_ARCH_CAP_SKIP_L1DFL_VMENTRY | MSR_ARCH_CAP_MDS_NO |
> > +            MSR_ARCH_CAP_PSCHANGE_MC_NO | MSR_ARCH_CAP_SBDR_SSDP_NO |
> > +            MSR_ARCH_CAP_FBSDP_NO | MSR_ARCH_CAP_PSDP_NO |
> > +            MSR_ARCH_CAP_PBRSB_NO,
> > +        .features[FEAT_XSAVE] =
> > +            CPUID_XSAVE_XSAVEOPT | CPUID_XSAVE_XSAVEC |
> > +            CPUID_XSAVE_XGETBV1 | CPUID_XSAVE_XSAVES,
> > +        .features[FEAT_6_EAX] =
> > +            CPUID_6_EAX_ARAT,
> > +        .features[FEAT_7_1_EAX] =
> > +            CPUID_7_1_EAX_AVX_VNNI | CPUID_7_1_EAX_CMPCCXADD |
> > +            CPUID_7_1_EAX_FSRS | CPUID_7_1_EAX_AVX_IFMA,
> > +        .features[FEAT_7_1_EDX] =
> > +            CPUID_7_1_EDX_AVX_VNNI_INT8 | CPUID_7_1_EDX_AVX_NE_CONVERT,
> > +        .features[FEAT_7_2_EDX] =
> > +            CPUID_7_2_EDX_MCDT_NO,
> > +        .features[FEAT_VMX_BASIC] =
> > +            MSR_VMX_BASIC_INS_OUTS | MSR_VMX_BASIC_TRUE_CTLS,
> > +        .features[FEAT_VMX_ENTRY_CTLS] =
> > +            VMX_VM_ENTRY_LOAD_DEBUG_CONTROLS | VMX_VM_ENTRY_IA32E_MODE |
> > +            VMX_VM_ENTRY_LOAD_IA32_PERF_GLOBAL_CTRL |
> > +            VMX_VM_ENTRY_LOAD_IA32_PAT | VMX_VM_ENTRY_LOAD_IA32_EFER,
> > +        .features[FEAT_VMX_EPT_VPID_CAPS] =
> > +            MSR_VMX_EPT_EXECONLY | MSR_VMX_EPT_PAGE_WALK_LENGTH_4 |
> > +            MSR_VMX_EPT_WB | MSR_VMX_EPT_2MB | MSR_VMX_EPT_1GB |
> > +            MSR_VMX_EPT_INVEPT | MSR_VMX_EPT_AD_BITS |
> > +            MSR_VMX_EPT_INVEPT_SINGLE_CONTEXT | 
> > MSR_VMX_EPT_INVEPT_ALL_CONTEXT |
> > +            MSR_VMX_EPT_INVVPID | MSR_VMX_EPT_INVVPID_SINGLE_ADDR |
> > +            MSR_VMX_EPT_INVVPID_SINGLE_CONTEXT |
> > +            MSR_VMX_EPT_INVVPID_ALL_CONTEXT |
> > +            MSR_VMX_EPT_INVVPID_SINGLE_CONTEXT_NOGLOBALS,
> > +        .features[FEAT_VMX_EXIT_CTLS] =
> > +            VMX_VM_EXIT_SAVE_DEBUG_CONTROLS |
> > +            VMX_VM_EXIT_LOAD_IA32_PERF_GLOBAL_CTRL |
> > +            VMX_VM_EXIT_ACK_INTR_ON_EXIT | VMX_VM_EXIT_SAVE_IA32_PAT |
> > +            VMX_VM_EXIT_LOAD_IA32_PAT | VMX_VM_EXIT_SAVE_IA32_EFER |
> > +            VMX_VM_EXIT_LOAD_IA32_EFER | 
> > VMX_VM_EXIT_SAVE_VMX_PREEMPTION_TIMER,
> > +        .features[FEAT_VMX_MISC] =
> > +            MSR_VMX_MISC_STORE_LMA | MSR_VMX_MISC_ACTIVITY_HLT |
> > +            MSR_VMX_MISC_VMWRITE_VMEXIT,
> > +        .features[FEAT_VMX_PINBASED_CTLS] =
> > +            VMX_PIN_BASED_EXT_INTR_MASK | VMX_PIN_BASED_NMI_EXITING |
> > +            VMX_PIN_BASED_VIRTUAL_NMIS | 
> > VMX_PIN_BASED_VMX_PREEMPTION_TIMER |
> > +            VMX_PIN_BASED_POSTED_INTR,
> > +        .features[FEAT_VMX_PROCBASED_CTLS] =
> > +            VMX_CPU_BASED_VIRTUAL_INTR_PENDING |
> > +            VMX_CPU_BASED_USE_TSC_OFFSETING | VMX_CPU_BASED_HLT_EXITING |
> > +            VMX_CPU_BASED_INVLPG_EXITING | VMX_CPU_BASED_MWAIT_EXITING |
> > +            VMX_CPU_BASED_RDPMC_EXITING | VMX_CPU_BASED_RDTSC_EXITING |
> > +            VMX_CPU_BASED_CR3_LOAD_EXITING | 
> > VMX_CPU_BASED_CR3_STORE_EXITING |
> > +            VMX_CPU_BASED_CR8_LOAD_EXITING | 
> > VMX_CPU_BASED_CR8_STORE_EXITING |
> > +            VMX_CPU_BASED_TPR_SHADOW | VMX_CPU_BASED_VIRTUAL_NMI_PENDING |
> > +            VMX_CPU_BASED_MOV_DR_EXITING | VMX_CPU_BASED_UNCOND_IO_EXITING 
> > |
> > +            VMX_CPU_BASED_USE_IO_BITMAPS | VMX_CPU_BASED_MONITOR_TRAP_FLAG 
> > |
> > +            VMX_CPU_BASED_USE_MSR_BITMAPS | VMX_CPU_BASED_MONITOR_EXITING |
> > +            VMX_CPU_BASED_PAUSE_EXITING |
> > +            VMX_CPU_BASED_ACTIVATE_SECONDARY_CONTROLS,
> > +        .features[FEAT_VMX_SECONDARY_CTLS] =
> > +            VMX_SECONDARY_EXEC_VIRTUALIZE_APIC_ACCESSES |
> > +            VMX_SECONDARY_EXEC_ENABLE_EPT | VMX_SECONDARY_EXEC_DESC |
> > +            VMX_SECONDARY_EXEC_RDTSCP |
> > +            VMX_SECONDARY_EXEC_VIRTUALIZE_X2APIC_MODE |
> > +            VMX_SECONDARY_EXEC_ENABLE_VPID | 
> > VMX_SECONDARY_EXEC_WBINVD_EXITING |
> > +            VMX_SECONDARY_EXEC_UNRESTRICTED_GUEST |
> > +            VMX_SECONDARY_EXEC_APIC_REGISTER_VIRT |
> > +            VMX_SECONDARY_EXEC_VIRTUAL_INTR_DELIVERY |
> > +            VMX_SECONDARY_EXEC_RDRAND_EXITING |
> > +            VMX_SECONDARY_EXEC_ENABLE_INVPCID |
> > +            VMX_SECONDARY_EXEC_ENABLE_VMFUNC | 
> > VMX_SECONDARY_EXEC_SHADOW_VMCS |
> > +            VMX_SECONDARY_EXEC_RDSEED_EXITING | 
> > VMX_SECONDARY_EXEC_ENABLE_PML |
> > +            VMX_SECONDARY_EXEC_XSAVES,
> > +        .features[FEAT_VMX_VMFUNC] =
> > +            MSR_VMX_VMFUNC_EPT_SWITCHING,
> > +        .xlevel = 0x80000008,
> > +        .model_id = "Intel Xeon Processor (SierraForest)",
> > +        .versions = (X86CPUVersionDefinition[]) {
> > +            { .version = 1 },
> > +            { /* end of list */ },
> > +        },
> > +    },
> >      {
> >          .name = "Denverton",
> >          .level = 21,
> > 
> > base-commit: d451e32ce8e1eef1b7d05d9f532113e9618f1fc1
> 



reply via email to

[Prev in Thread] Current Thread [Next in Thread]