emacs-devel
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Profiling native-compilation


From: Andrea Corallo
Subject: Re: Profiling native-compilation
Date: Fri, 23 Apr 2021 13:46:26 +0000
User-agent: Gnus/5.13 (Gnus v5.13) Emacs/28.0.50 (gnu/linux)

Eli Zaretskii <eliz@gnu.org> writes:

>> From: Andrea Corallo <akrl@sdf.org>
>> Cc: emacs-devel@gnu.org
>> Date: Fri, 23 Apr 2021 08:45:44 +0000
>>
>> AFAIK profiling syncronous native compilation with profile.el should be
>> possible but with one caveat that is: we run the final pass in a
>> subprocess so we'll loose visibility on that, OTOH we can consider final
>> being essentially all libgccjit work.
>
> The "final pass" being what in this case? which function call
> initiates that final pass?

`comp-final'

>> Maybe that's the reason why you see low percentages? Have you profiled a
>> `native-compile' invocation?
>
> Yes, I've profiled the invocation of native-compile.  (Btw, it
> signaled an error near the end.)  Is there a way to invoke native
> compilation in an interactive session while preventing the last pass
> from being run in a subprocess?

Probably for profiling an hacky way would be binding
`byte-native-for-bootstrap' to t.

Another variant I think you'll find of interest is to run binding
`comp-dry-run' to t in order to exclude final from the run, this way
you'll profile essentialy the Lisp part only.  I think this should help
you in geting a more meaningful result from profile.el.

>> > I think this is important because native-compilation is quite slow,
>> > and at least I don't have a clear idea where most of that time is
>> > spent.  Andrea, you said once that most of the time is spent in
>> > libgccjit functions, which I find strange, since those are basically
>> > parts of GCC code, and I don't think I ever saw such slow compilation
>> > times from GCC.
>>
>> Yes I mentioned that but since we changed the layout of generated code
>> for storing immediates GCC got way faster and now the Emacs side is
>> about equally responsible for the time spent in compilation.
>
> Which is why it's important to see where the time is spent, so that we
> could work on perhaps speeding it up.

I've found some not very recent notes and at the time I measured quite
some time spent in SSA renaming `comp-ssa-rename' called by fwprop.
We'll see if it's still the case.

>> One simple approach to get an idea of where time is spent in the
>> compiler is to bind `comp-log-time-report' to get a report pass by pass.
>
> That's a start, but it doesn't help to identify the hot spots.
>
>> Running:
>>
>> ===
>> (require 'comp)
>> (let ((comp-log-time-report t))
>>   (native-compile ".../emacs2/lisp/emacs-lisp/byte-opt.el"))
>> ===
>>
>> I get:
>>
>> Done compiling 
>> .../.emacs.d/eln-cache/28.0.50-7f3f9b99/byte-opt-9c5f25f5-d2005e8f.eln
>> Pass comp-spill-lap took: 0.095720s.
>> Pass comp-limplify took: 0.423704s.
>> Pass comp-fwprop took: 3.334509s.
>> Pass comp-call-optim took: 0.023363s.
>> Pass comp-ipa-pure took: 0.000046s.
>> Pass comp-add-cstrs took: 0.222719s.
>> Pass comp-fwprop took: 2.549254s.
>> Pass comp-tco took: 0.000044s.
>> Pass comp-fwprop took: 0.256151s.
>> Pass comp-remove-type-hints took: 0.025859s.
>> Pass comp-final took: 4.268238s.
>
> That's very fast.  I don't think I see such short times.  Here's my
> result, almost 4 times slower:
>
>   Pass comp-spill-lap took: 1.281250s.
>   Pass comp-limplify took: 1.500000s.
>   Pass comp-fwprop took: 11.078125s.
>   Pass comp-call-optim took: 0.000000s.
>   Pass comp-ipa-pure took: 0.000000s.
>   Pass comp-add-cstrs took: 0.625000s.
>   Pass comp-fwprop took: 8.750000s.
>   Pass comp-tco took: 0.000000s.
>   Pass comp-fwprop took: 0.906250s.
>   Pass comp-remove-type-hints took: 0.000000s.
>   Pass comp-final took: 27.375000s.
>
> Note that I ran this in an unoptimized build of Emacs, so any Lisp,
> bytecode, and calls into primitives are expected to be between 3 and 4
> times slower than in an optimized build.

Yeah as a result I think is expected for an unoptimized build.

> How much elapsed time does it take you to compile comp.el itself?

On my optimized build it's 1m29sec, that's the slowest compilation
unit we have I think.

>> I think would be interesting to sort by priority what I think are the
>> three main ares to work on the native compiler on the post merge phase:
>> compile-time, generated code performance, hardening, documentation.
>
> They are all equally important, IMO.  I hope we can work on them in
> parallel.  I didn't mean to imply that you personally must work on all
> of them (which would then make them more sequential tasks).

Sure, I'm quite bad at parallelizing work so some direction for my
effort will be appreciated once merge is done and we extinguished any
possible fire.

Thanks

  Andrea



reply via email to

[Prev in Thread] Current Thread [Next in Thread]