[mpich-discuss] Fault tolerance after MPI_Comm_connect/accept
Matthieu Dorier
matthieu.dorier at irisa.fr
Mon Mar 4 07:56:16 CST 2013
Hi,
I'm connecting two MPI applications A and B using MPI_Comm_accept in A and MPI_Comm_connect in B. I would like to know a bit more about the behavior in case one application stops (say B): will a communication attempt (e.g. MPI_Send) from a process from A to a process from B crash? return an error? block?
Is there a way for application A to notice that B has stopped in order to avoid communicating with it?
Thanks.
PS: by the way for whom is involved in the MPI forum, an MPI_Comm_iaccept in the MPI3 standard would have been useful. Something to keep in mind for the next version maybe ;)
Matthieu Dorier
PhD student at ENS Cachan Brittany and IRISA
http://people.irisa.fr/Matthieu.Dorier
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.mpich.org/pipermail/discuss/attachments/20130304/e36f1ea7/attachment.html>
More information about the discuss
mailing list