Re: Hotspot Identification

octave-maintainers

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Hotspot Identification

From:	Daniel J Sebald
Subject:	Re: Hotspot Identification
Date:	Wed, 14 Aug 2019 01:33:44 -0400
User-agent:	Mozilla/5.0 (X11; Linux x86_64; rv:60.0) Gecko/20100101 Thunderbird/60.8.0

On 8/13/19 5:21 PM, Rik wrote:

Here is another possibility.  I find that octave_value_list is often taking
~1% of an particular leaf function.  If I check the annotated code I see
that atomic locking instructions take a very long time.

--- Start Code Annotation ---
octave_value_list::octave_value_list
/home/rik/wip/Projects_Mine/octave-dev/libgui/.libs/liboctgui.so.5.0.0
Samples│    _ZN17octave_value_listC2Ev():
        │    OCTINTERP_API
        │    octave_value_list
        │    {
        │    public:
        │
        │      octave_value_list (void)
        │      push   %rbp
     11 │      mov    %rsp,%rbp
      7 │      push   %r12
      1 │      push   %rbx
        │    _ZN17octave_value_listC1Ev():
      1 │      add    $0x10,%rax
        │    _ZN17octave_value_listC2Ev():
     15 │      mov    %rdi,%rbx
        │    _ZN17octave_value_listC1Ev():
      6 │      mov    %rax,(%rdi)
        │
        │    public:
        │
        │      static octave_idx_type dim_max (void);
        │
        │      explicit dim_vector (void) : rep (nil_rep ())
      1 │    → callq  dim_vector::nil_rep()@plt
      3 │      mov    %rax,0x8(%rbx)
        │      { OCTAVE_ATOMIC_INCREMENT (&(count ())); }
    327 │      lock   addq   $0x1,-0x10(%rax)
        │        : dimensions (), rep (nil_rep ()), slice_data (rep->data),
      1 │    → callq  Array<octave_value
        │          slice_len (rep->len)
     10 │      mov    (%rax),%rdx
      7 │      mov    %rax,0x10(%rbx)
        │      mov    %rdx,0x18(%rbx)
        │      mov    0x8(%rax),%rdx
     14 │      mov    %rdx,0x20(%rbx)
        │          return OCTAVE_ATOMIC_INCREMENT (&m_count);
        │        }
        │
        │        count_type operator++ (int)
        │        {
        │          return OCTAVE_ATOMIC_POST_INCREMENT (&m_count);
    297 │      lock   addl   $0x1,0x10(%rax)
      1 │      mov    vtable for Array<std::__cxx11::basic_string<char,
std::char_traits<char,%rax
        │      add    $0x10,%rax
     16 │      mov    %rax,0x28(%rbx)
        │      explicit dim_vector (void) : rep (nil_rep ())
        │    → callq  dim_vector::nil_rep()@plt
        │      mov    %rax,0x30(%rbx)
        │      { OCTAVE_ATOMIC_INCREMENT (&(count ())); }
    294 │      lock   addq   $0x1,-0x10(%rax)
        │        : dimensions (), rep (nil_rep ()), slice_data (rep->data),
      1 │    → callq  Array<std::__cxx11::basic_string<char,
std::char_traits<char
        │          slice_len (rep->len)

--- End Code Annotation ---

I can change the atomic instructions to ordinary ones by configuring with
--disable-atomic-refcount.  The benchmark runtime drops from 14.1 seconds
to 11.6 seconds (2.5 seconds) which seems important.

Nice work Rik. So the versions where we see a jump in consumption, arethey correlated with the introduction of the GUI? Roughly 4.0? Thatsounds about right.

The requirement for atomic refcounting was introduced by communication with
the GUI.  This brings up a hard question, is there a better way to
implement cross-thread communication?

I believe so. Signals/slots are thread safe. One can declare anoctave_value/octave_value_ptr as a QObject then shuffle it across asignal/slot. All the octave routines return an octave_value or pointeror something. One has to be efficient about it though. On the GUIside, only request the amount of information that would, say, bevisible. E.g., just some small sub-matrix, not a whole 1000 x 1000 matrix.

Dan

[Prev in Thread]

Current Thread

[Next in Thread]

Hotspot identification, Rik, 2019/08/13
- Re: Hotspot identification, John W. Eaton, 2019/08/13
  - Re: Hotspot identification, Rik, 2019/08/13
- Re: Hotspot Identification, Rik, 2019/08/13
  - Re: Hotspot Identification, Daniel J Sebald <=
    - Re: Hotspot Identification, John W. Eaton, 2019/08/14
    - Re: Hotspot Identification, Daniel J Sebald, 2019/08/14
    - Re: atomic references, Rik, 2019/08/14
    - Re: atomic references, John W. Eaton, 2019/08/14
    - Re: move constructors, Rik, 2019/08/14
    - Re: move constructors, John W. Eaton, 2019/08/14
    - Re: move constructors, Rik, 2019/08/14
    - Message not available
    - Re: move constructors, Rik, 2019/08/14
    - Re: move constructors, John W. Eaton, 2019/08/15
    - Re: move constructors, Rik, 2019/08/15

Prev by Date: Re: Hotspot Identification
Next by Date: Re: Hotspot Identification
Previous by thread: Re: Hotspot Identification
Next by thread: Re: Hotspot Identification
Index(es):
- Date
- Thread