reproduce-devel
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[task #15737] slurm - openmpi - (PMIx+libevent+hwloc)


From: Boud Roukema
Subject: [task #15737] slurm - openmpi - (PMIx+libevent+hwloc)
Date: Wed, 12 Aug 2020 13:43:34 -0400 (EDT)
User-agent: Mozilla/5.0 (X11; Linux x86_64; rv:68.0) Gecko/20100101 Firefox/68.0

Follow-up Comment #4, task #15737 (project reproduce):

It seems that more work is still needed to compile openmpi to use libraries
fully within the maneage subsystem.

I have one package that works fine with the openmpi options that are currently
used in Maneage, but another package fails semi-consistently on the same host,
within the same overall Maneage system.

Some of the packages that seem to have been installed from the host system
(CentOS 2.6.32-754.18.2.el6.x86_64), as indicated by the error tracing
messages, appear to be:


    /lib64/libc.so.6
    /lib64/libpthread.so.0
    /usr/lib64/ld-2.17.so
    /usr/lib64/libdl-2.17.so
    /usr/lib64/libm-2.17.so
    /usr/lib64/libutil-2.17.so


Openmpi is a *huge* package. I do not intend to try to solve this any time
soon. (A workaround is to run this particular program in serial mode, which is
an acceptable compromise.)

One possibility, that I might try if there's enough time, would be to update
to a more recent upstream Maneage, which *might* solve this.

However, I think that a proper Maneage install of openmpi, and thorough
testing on task schedulers like _slurm_, should be considered a major task.


    _______________________________________________________

Reply to this item at:

  <https://savannah.nongnu.org/task/?15737>

_______________________________________________
  Message sent via Savannah
  https://savannah.nongnu.org/




reply via email to

[Prev in Thread] Current Thread [Next in Thread]