qemu-discuss
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Qemu-discuss] USE QEMU + KVM TO RUN CUSTOM KERNEL


From: giuseppe maugeri
Subject: Re: [Qemu-discuss] USE QEMU + KVM TO RUN CUSTOM KERNEL
Date: Thu, 5 May 2016 09:53:40 +0200

The code reported below with KVM needs 60 minutes to complete aganist 4 minutes without.
I just wondering if with KVM enabled is possible to match the performance of simple Qemu.


#include "lib/lib.h"

     main()
     {
         int i=0;
         for (i=0;i<=900000;i++)
         {
               printf("num=%d \n",i);
         }
        exit(0);
}

Regards,
Giuseppe.


2016-05-05 9:40 GMT+02:00 <address@hidden>:

What is your expecting performance?


Vga of stdvga and cirrus are all software renderer, and their IO addresses were all emulated by virtual memory (i.e. mmap).  The doubled R/W access time seems be reasonable.



從: Qemu-discuss <qemu-discuss-bounces+itria30110=address@hidden> 代表 giuseppe maugeri <address@hidden>
寄件日期: 2016年05月05日 04:47
至: address@hidden
主旨: Re: [Qemu-discuss] USE QEMU + KVM TO RUN CUSTOM KERNEL
 
I tried to launch Qemu with Vga set to cirrus,std and vmware.
Same bad numbers.
Is my only option to patch Qemu + KVM?
I also noted some slowdown in my ata custom driver.
The driver currently supports pio mode only.
Is this mode optimized in Qemu and Qemu + KVM?

Regards,
Giuseppe.

2016-05-04 22:39 GMT+02:00 giuseppe maugeri <address@hidden>:
I tried to launch Qemu with Vga set to cirrus,std and vmware.
Same bad numbers.
Is my only option to patch Qemu + KVM?
I also noted some slowdown in my ata custom driver.
The driver currently supports pio mode only.
Is this mode optimized in Qemu and Qemu + KVM?

Regards,
Giuseppe.

2016-05-04 11:23 GMT+02:00 Jakob Bohm <address@hidden>:
On 04/05/2016 10:50, giuseppe maugeri wrote:
Hi,
i'm developing a custom kernel for x86 platform.So Far i used as virtual environment Simics.
Recently i decided to move to Qemu.
Running my kernel on Qemu I found it tremendously slow even with KVM enabled.

Here a little example:

     #include "lib/lib.h"

     main()
     {
         int i=0;
         for (i=0;i<=900000;i++)
         {
               printf("num=%d \n",i);
         }
        exit(0);
}

In this code printf uses a custom simple system call that switch the kernel in ring mode and writes a string in the text screen video memory at 0xB8000.
Performace are awful in Qemu even with KVM support.

With Simics performance are very close to real hardware.

Some number:
Running code under Linux on real hardware 30 sescs
Running code under my os with qemu with no kvm 4 min
Running code under my os with qemu+kmv 60 minites.

This is my Qemu launch string with KVM:
./qemu-system-x86_64 -enable-kvm -m 1000M -hda /home/peppe/Scrivania/disk.img

KVM module is loaded:
lsmod | grep kvm
kvm_intel 142999 0
kvm 444314 1 kvm_intel

To double check my virtualization environment i loaded a Fedora 13 live image with Qemu + KVM and all is worked as expected.

MY Question is:
Why my code is so slow under KVM?
Need i to perform a special tuning on KVM?
Is KVM SUTED to run custom kernel like mine?

In this example i suppose KVM needs to perform some special trap in order to access to text memory video.
Just because this is a unusual path is it perhaps not optimized?

Have you tried varying the options for the virtual video card?

Maybe the default video card does something like the
following (I am guessing):

qemu alone: Each write to the 128K block at A0000 is special
cased as a call to the video card emulator, which instantly
knows which character cell you wrote.

qemu+kvm: Each write to the 128K block at A0000 triggers a
page fault to the KVM driver which then uses a less efficient
single instruction stepper to execute the faulting
instruction (so the write actually goes through), then tells
qemu which 4K page was modified.  Then qemu has to figure out
which bytes in those 4K were modified in order to figure out
which character cell(s) to need to be redrawn.  Even on a fast
CPU this is not unlikely to consume the observed 25us per
character.

The above mechanisms are probably needed to emulate the
special memory write behavior of VGA 640x480 graphics mode,
and some similar modes, but are not the most efficient way
to emulate a text mode MDA/HGA/CGA/EGA/VGA simple VRAM
buffer if the final output device is a fast display and not
e.g. a tty-protocol "terminal" or a remote X/VNC server.

Choosing a different virtual video card might result in a
different algorithm for managing the A0000/128K block when
in classic text modes.

As for the Fedora image, I don't know if that uses a different
video mode where the qemu code is more optimized (because
Fedora/RedHat are big promoters of KVM and would have
optimized the default cases to work together).



Enjoy

Jakob
-- 
Jakob Bohm, CIO, Partner, WiseMo A/S.  https://www.wisemo.com
Transformervej 29, 2860 Søborg, Denmark.  Direct +45 31 13 16 10
This public discussion message is non-binding and may contain errors.
WiseMo - Remote Service Management for PCs, Phones and Embedded 


本信件可能包含工研院機密資訊,非指定之收件者,請勿使用或揭露本信件內容,並請銷毀此信件。
This email may contain confidential information. Please do not use or disclose it in any way and delete it if you are not the intended recipient.


reply via email to

[Prev in Thread] Current Thread [Next in Thread]