bug-parallel
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: GNU Parallel Bug Reports Feature request: if a SSH node goes down, r


From: Nazgul
Subject: Re: GNU Parallel Bug Reports Feature request: if a SSH node goes down, retry on other nodes
Date: Mon, 14 Dec 2015 00:48:53 +1100

It seems --filter-hosts is a good candidate. However I have two doubts:
  1. Is this a check performed before the distributed executions or is this a policy active throughout the whole life-time of the Parallel process? This makes a difference if the node fails after the check.
  2. If a node fails while executing a command, is that command re-executed on a still active node?

Marco


On 14 December 2015 at 00:20, Ole Tange <address@hidden> wrote:
On Thu, Dec 10, 2015 at 11:17 PM, Nazgul <address@hidden> wrote:

> I am using GNU Parallel with --sshlogin on unreliable nodes - that is, some
> of them become unreachable after an unpredictable amount of time.
>
> When this happens, Parallel sends all the queued commands one after the
> other to the machine (because they finish very quickly with a "No route to
> host", leaving the machine "free").
>
> It would be nice to have a feature so that, instead, remaining threads are
> sent to the machines that are still available.

Did you try --retries and --filterhosts?

/Ole



--
Marco

reply via email to

[Prev in Thread] Current Thread [Next in Thread]