<meta http-equiv="Content-Type" content="text/html; charset=utf-8"><div dir="ltr"><div class="gmail_extra">Specifying MPIEXEC_TIMEOUT would not be possible since the execution times would vary depending on the different job. Is there any solution available within mpich?</div><div class="gmail_extra"><br></div><div class="gmail_extra">Thanks,</div><div class="gmail_extra">Pranav</div><div class="gmail_extra"><br></div><div class="gmail_extra"><br><div class="gmail_quote">On Thu, Oct 27, 2016 at 10:15 PM, Halim Amer <span dir="ltr"><<a href="mailto:aamer@anl.gov" target="_blank">aamer@anl.gov</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">You can try setting MPIEXEC_TIMEOUT=<timeout value in seconds> to force the job to abort after running for the specified period. This is for the whole execution though, not just for the process launching step.<br>
<br>
Halim<br>
<a href="http://www.mcs.anl.gov/~aamer" rel="noreferrer" target="_blank">www.mcs.anl.gov/~aamer</a><div><div class="gmail-h5"><br>
<br>
On 10/26/16 6:30 PM, Pranav Ladkat wrote:<br>
</div></div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div><div class="gmail-h5">
Hi,<br>
<br>
When I run mpi program on multiple hosts, if executable fails to start<br>
on any of the host (due to missing library etc. type of reasons), other<br>
hosts just keep waiting for the process to come up. The program just<br>
hangs forever. Is there any way to set a timeout in such cases such that<br>
MPI should abort if not all processes were launched in a given timeout<br>
period?<br>
<br>
Thanks,<br>
Pranav<br>
<br>
<br></div></div>
______________________________<wbr>_________________<br>
discuss mailing list <a href="mailto:discuss@mpich.org" target="_blank">discuss@mpich.org</a><br>
To manage subscription options or unsubscribe:<br>
<a href="https://lists.mpich.org/mailman/listinfo/discuss" rel="noreferrer" target="_blank">https://lists.mpich.org/mailma<wbr>n/listinfo/discuss</a><br>
<br>
</blockquote>
______________________________<wbr>_________________<br>
discuss mailing list <a href="mailto:discuss@mpich.org" target="_blank">discuss@mpich.org</a><br>
To manage subscription options or unsubscribe:<br>
<a href="https://lists.mpich.org/mailman/listinfo/discuss" rel="noreferrer" target="_blank">https://lists.mpich.org/mailma<wbr>n/listinfo/discuss</a><br>
</blockquote></div><br></div></div>