[mpich-discuss] Is there a way to set timeout to mpi process launch?

Halim Amer aamer at anl.gov
Fri Oct 28 00:15:00 CDT 2016


You can try setting MPIEXEC_TIMEOUT=<timeout value in seconds> to force 
the job to abort after running for the specified period. This is for the 
whole execution though, not just for the process launching step.

Halim
www.mcs.anl.gov/~aamer

On 10/26/16 6:30 PM, Pranav Ladkat wrote:
> Hi,
>
> When I run mpi program on multiple hosts, if executable fails to start
> on any of the host (due to missing library etc. type of reasons), other
> hosts just keep waiting for the process to come up. The program just
> hangs forever. Is there any way to set a timeout in such cases such that
> MPI should abort if not all processes were launched in a given timeout
> period?
>
> Thanks,
> Pranav
>
>
> _______________________________________________
> discuss mailing list     discuss at mpich.org
> To manage subscription options or unsubscribe:
> https://lists.mpich.org/mailman/listinfo/discuss
>
_______________________________________________
discuss mailing list     discuss at mpich.org
To manage subscription options or unsubscribe:
https://lists.mpich.org/mailman/listinfo/discuss


More information about the discuss mailing list