qemu-discuss
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: MP tables do not report multiple CPUs in Qemu 6.2.0 on x86 when give


From: Ani Sinha
Subject: Re: MP tables do not report multiple CPUs in Qemu 6.2.0 on x86 when given -smp cpus=n flag
Date: Thu, 20 Jan 2022 15:38:26 +0530 (IST)
User-agent: Alpine 2.20 (OSX 67 2015-01-07)


On Thu, 20 Jan 2022, Ani Sinha wrote:

> +qemu-devel
>
> On Thu, 20 Jan 2022, Ani Sinha wrote:
>
> >
> >
> > On Wed, 19 Jan 2022, Peter Maydell wrote:
> >
> > > On Wed, 19 Jan 2022 at 14:44, Godmar Back <gback@cs.vt.edu> wrote:
> > > > after upgrading to 6.2.0, I observed that code such as MIT's xv6 (see
> > > > [1]) is no longer able to detect multiple CPUs.  Their code works in
> > > > 6.1.1, however.
> > >
> > > Hi; this isn't a great place for reporting QEMU bugs, because
> > > it's more of a user-to-user discussion list. Not all QEMU
> > > developers read it. I've cc'd the ACPI maintainers, who
> > > hopefully may have an idea about what's happening here.
> > > You could also file a bug at
> > > https://gitlab.com/qemu-project/qemu/-/issues
> > >
> > > > I built 6.1.1 from source and 6.2.0 from source and I have also tested
> > > > with CentOS stream's 6.1.1 qemu-kvm and was able to pinpoint this
> > > > change to these 2 versions of qemu. I am using qemu-system-i386
> > > > specifically.
> > > >
> > > > I tried to go through the ChangeLog to see if the `-smp` option was
> > > > deprecated or changed.  I found this note [2] about invalid topologies
> > > > having been removed in 5.2. Here's what I found after long
> > > > experimentation:
> > > >
> > > > The legacy MP tables appear to work only if you specify the longform
> > > > `-smp cpus=4,cores=1,threads=1,sockets=4` in 6.2.0.  If you specify
> > > > `-smp 4` or `-smp cpus=4` it will not work in 6.2.0 (it worked in
> > > > 6.1.1). I am guessing that perhaps the MP tables add entries for each
> > > > socket, but that perhaps the behavior of the shortcuts `-smp n` and
> > > > `-smp cpus=n` was changed to influence the number of cores rather than
> > > > sockets.
> > > >
> > > > In other words, `-smp cpus=n` now means `-smp
> > > > cpus=n,cores=n,threads=1,sockets=1` whereas in 6.1.1 and before it
> > > > meant `-smp cpus=n,cores=1,threads=1,sockets=n`. I note that
> > > > specifying `-smp cpus=4,cores=4,threads=1,sockets=1` in 6.1.1 also
> > > > does not create 4 entries in the legacy MP tables.
> > > >
> >
> > My suspicion is that the following commit might have introduced a
> > regression:
> >
> > commit 86ce2d28fa09d15547b5cabdc264cadfb53a848c
> > Author: Yanan Wang <wangyanan55@huawei.com>
> > Date:   Tue Oct 26 11:46:58 2021 +0800
> >
> >     hw/core/machine: Split out the smp parsing code
> >
> >     We are going to introduce an unit test for the parser smp_parse()
> >     in hw/core/machine.c, but now machine.c is only built in softmmu.
> >
> >     In order to solve the build dependency on the smp parsing code and
> >     avoid building unrelated stuff for the unit tests, move the tested
> >     code from machine.c into a separate file, i.e., machine-smp.c and
> >     build it in common field.
> >
> >     Signed-off-by: Yanan Wang <wangyanan55@huawei.com>
> >     Reviewed-by: Andrew Jones <drjones@redhat.com>
> >     Reviewed-by: Philippe Mathieu-Daudé <philmd@redhat.com>
> >     Tested-by: Philippe Mathieu-Daudé <philmd@redhat.com>
> >     Message-Id: <20211026034659.22040-2-wangyanan55@huawei.com>
> >     Acked-by: Eduardo Habkost <ehabkost@redhat.com>
> >     Signed-off-by: Philippe Mathieu-Daudé <philmd@redhat.com>
> >
> > I am still checking.
>
> Yes this patch introduced the behavior of preferring cores over sockets
> post 6.2. I think this is intentional. See the following hunk:
>
> +        if (mc->smp_props.prefer_sockets) {
> +            /* prefer sockets over cores before 6.2 */
> +            if (sockets == 0) {
> +                cores = cores > 0 ? cores : 1;
> +                threads = threads > 0 ? threads : 1;
> +                sockets = maxcpus / (dies * cores * threads);
> +            } else if (cores == 0) {
> +                threads = threads > 0 ? threads : 1;
> +                cores = maxcpus / (sockets * dies * threads);
> +            }
> +        } else {
> +            /* prefer cores over sockets since 6.2 */
> +            if (cores == 0) {
> +                sockets = sockets > 0 ? sockets : 1;
> +                threads = threads > 0 ? threads : 1;
> +                cores = maxcpus / (sockets * dies * threads);
> +            } else if (sockets == 0) {
> +                threads = threads > 0 ? threads : 1;
> +                sockets = maxcpus / (dies * cores * threads);
> +            }
> +        }

Actually I am not quite right. This is the real change which changed the
preference. The previous change was a code re-org that preserved the
behavior:

commit 4a0af2930a4e4f64ce551152fdb4b9e7be106408
Author: Yanan Wang <wangyanan55@huawei.com>
Date:   Wed Sep 29 10:58:09 2021 +0800

    machine: Prefer cores over sockets in smp parsing since 6.2

    In the real SMP hardware topology world, it's much more likely that
    we have high cores-per-socket counts and few sockets totally. While
    the current preference of sockets over cores in smp parsing results
    in a virtual cpu topology with low cores-per-sockets counts and a
    large number of sockets, which is just contrary to the real world.

    Given that it is better to make the virtual cpu topology be more
    reflective of the real world and also for the sake of compatibility,
    we start to prefer cores over sockets over threads in smp parsing
    since machine type 6.2 for different arches.

    In this patch, a boolean "smp_prefer_sockets" is added, and we only
    enable the old preference on older machines and enable the new one
    since type 6.2 for all arches by using the machine compat mechanism.

    Suggested-by: Daniel P. Berrange <berrange@redhat.com>
    Signed-off-by: Yanan Wang <wangyanan55@huawei.com>
    Acked-by: David Gibson <david@gibson.dropbear.id.au>
    Acked-by: Cornelia Huck <cohuck@redhat.com>
    Reviewed-by: Andrew Jones <drjones@redhat.com>
    Reviewed-by: Pankaj Gupta <pankaj.gupta@ionos.com>
    Reviewed-by: Daniel P. Berrangé <berrange@redhat.com>
    Message-Id: <20210929025816.21076-10-wangyanan55@huawei.com>
    Signed-off-by: Paolo Bonzini <pbonzini@redhat.com>

In any case, the behavior change is intended because of the reasons the
above commit outlines.

reply via email to

[Prev in Thread] Current Thread [Next in Thread]