[mpich-discuss] Is there a way to set timeout to mpi process launch?

Pranav Ladkat pranavpr at buffalo.edu
Mon Oct 31 18:18:38 CDT 2016


Specifying MPIEXEC_TIMEOUT would not be possible since the execution times
would vary depending on the different job. Is there any solution available
within mpich?

Thanks,
Pranav


On Thu, Oct 27, 2016 at 10:15 PM, Halim Amer <aamer at anl.gov> wrote:

> You can try setting MPIEXEC_TIMEOUT=<timeout value in seconds> to force
> the job to abort after running for the specified period. This is for the
> whole execution though, not just for the process launching step.
>
> Halim
> www.mcs.anl.gov/~aamer
>
>
> On 10/26/16 6:30 PM, Pranav Ladkat wrote:
>
>> Hi,
>>
>> When I run mpi program on multiple hosts, if executable fails to start
>> on any of the host (due to missing library etc. type of reasons), other
>> hosts just keep waiting for the process to come up. The program just
>> hangs forever. Is there any way to set a timeout in such cases such that
>> MPI should abort if not all processes were launched in a given timeout
>> period?
>>
>> Thanks,
>> Pranav
>>
>>
>> _______________________________________________
>> discuss mailing list     discuss at mpich.org
>> To manage subscription options or unsubscribe:
>> https://lists.mpich.org/mailman/listinfo/discuss
>>
>> _______________________________________________
> discuss mailing list     discuss at mpich.org
> To manage subscription options or unsubscribe:
> https://lists.mpich.org/mailman/listinfo/discuss
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.mpich.org/pipermail/discuss/attachments/20161031/f9297a36/attachment.html>
-------------- next part --------------
_______________________________________________
discuss mailing list     discuss at mpich.org
To manage subscription options or unsubscribe:
https://lists.mpich.org/mailman/listinfo/discuss


More information about the discuss mailing list