[mpich-discuss] Fault tolerance after MPI_Comm_connect/accept

Matthieu Dorier matthieu.dorier at irisa.fr
Tue Mar 5 09:42:25 CST 2013


Alright, thanks for the answer (and for the ticket).
Cheers,

Matthieu

----- Mail original -----
> De: "Jim Dinan" <dinan at mcs.anl.gov>
> À: discuss at mpich.org
> Envoyé: Mardi 5 Mars 2013 16:33:31
> Objet: Re: [mpich-discuss] Fault tolerance after MPI_Comm_connect/accept
> 
> Hi Mathieu,
> 
> I created an MPI Forum ticket for this:
> 
> https://svn.mpi-forum.org/trac/mpi-forum-web/ticket/365
> 
> In terms of what is guaranteed by the standard, the behavior is
> undefined.  In terms of that MPICH will do, I am not sure, although
> my
> guess is that current MPICH will be unable to continue working after
> such a failure.  You may need to do some testing or read the code to
> find out.
> 
> Cheers,
>   ~Jim.
> 
> On 3/4/13 7:56 AM, Matthieu Dorier wrote:
> > Hi,
> >
> > I'm connecting two MPI applications A and B using MPI_Comm_accept
> > in A
> > and MPI_Comm_connect in B. I would like to know a bit more about
> > the
> > behavior in case one application stops (say B): will a
> > communication
> > attempt (e.g. MPI_Send) from a process from A to a process from B
> > crash?
> > return an error? block?
> > Is there a way for application A to notice that B has stopped in
> > order
> > to avoid communicating with it?
> >
> > Thanks.
> >
> > PS: by the way for whom is involved in the MPI forum, an
> > MPI_Comm_iaccept in the MPI3 standard would have been useful.
> > Something
> > to keep in mind for the next version maybe ;)
> >
> > Matthieu Dorier
> > PhD student at ENS Cachan Brittany and IRISA
> > http://people.irisa.fr/Matthieu.Dorier
> >
> >
> > _______________________________________________
> > discuss mailing list     discuss at mpich.org
> > To manage subscription options or unsubscribe:
> > https://lists.mpich.org/mailman/listinfo/discuss
> >
> _______________________________________________
> discuss mailing list     discuss at mpich.org
> To manage subscription options or unsubscribe:
> https://lists.mpich.org/mailman/listinfo/discuss
> 



More information about the discuss mailing list