[mpich-discuss] Specifying a timeout for MPI_Comm_Accept
Balaji, Pavan
balaji at anl.gov
Mon Sep 8 08:30:38 CDT 2014
Thanks, Matthieu.
— Pavan
On Sep 8, 2014, at 3:17 AM, Matthieu Dorier <matthieu.dorier at irisa.fr> wrote:
> Hi Pavan,
>
> With all the work on in situ analysis/visualization, non-blocking versions of MPI_Comm_accept/connect could become very useful in the future to easily couple simulations and visualization codes.
>
> Here is an example: the VisIt software can use in situ, but right now the connection between VisIt and the simulation is done through a normal socket. The simulation will periodically call VisItDetectInput to check (in a non-blocking manner) if VisIt connected to the simulation.
>
> I recently had to use VisIt with a master-worker type simulation, that is, the master has an MPI_Recv(MPI_ANY_SOURCE...) to wait for workers to finish pieces of work. When using VisIt, this simple MPI_Recv becomes a non-blocking receive an active loop alternating between an MPI_Testany and VisItDetectInput. If VisIt could leverage non-blocking accept/connect, this active loop could be replaced with an MPI_Waitany on a set of MPI_Requests, one of them being VisIt connecting (the result of an MPI_Comm_iaccept), the other being workers finishing some work.
>
> I'm not a VisIt developer, though. This is just a thought.
>
> Matthieu Dorier
> PhD student at ENS Rennes
> http://people.irisa.fr/Matthieu.Dorier
> ----- Mail original -----
>> De: "Pavan Balaji" <balaji at anl.gov>
>> À: discuss at mpich.org
>> Envoyé: Dimanche 7 Septembre 2014 00:53:58
>> Objet: Re: [mpich-discuss] Specifying a timeout for MPI_Comm_Accept
>>
>> Hirak,
>>
>> The function definitions for MPI_Comm_accept, etc., are a part of the MPI
>> standard. We cannot change them in MPICH without changing them in the MPI
>> standard first. Changes to the MPI standard go through the MPI Forum, and
>> through a formal proposal and voting process before they get in.
>>
>> FWIW, both a timeout model and a nonblocking connect/accept have been
>> proposed in the past, but they were both voted down. The nonblocking
>> connect/accept proposal was originally done by Josh Hursey
>> (http://www.cs.uwlax.edu/~jjhursey/), but I’m planning to revive the ticket
>> but more broadly asking for nonblocking variants for many other operations
>> as well. It’s unclear if/when this will get in, but we can try.
>>
>> The timeout proposal was put together by Jeff Squyres @ Cisco and Fab Tillier
>> @ Microsoft (they are both on this list). I personally thought it was a
>> very elegant proposal, but it was voted down because there was no use case
>> for it at the time, particularly given that there was no standardized fault
>> model in MPI. Once the Fault Tolerance working group gets its proposal in,
>> there might be room to revisit this. But you’ll need to talk to the above
>> mentioned guys to see if they are planning to revive it.
>>
>> Hope that helps.
>>
>> — Pavan
>>
>> On Sep 6, 2014, at 12:35 PM, Roy, Hirak <Hirak_Roy at mentor.com> wrote:
>>
>>> Hi,
>>>
>>> The thread at the end of my email, shows that there is no way we can
>>> specify a timeout in MPI_Comm_accept/connect.
>>> Since the thread is pretty old (2007), I would like to know if there is any
>>> development related to this or not?
>>>
>>> If we still can not specify a timeout, is there any provision of
>>> non-blocking accept/connect ?
>>>
>>> Thanks,
>>> Hirak
>>>
>>>
>>> https://lists.mcs.anl.gov/mailman/htdig/mpich-discuss/2007-April/002159.html
>>> _______________________________________________
>>> discuss mailing list discuss at mpich.org
>>> To manage subscription options or unsubscribe:
>>> https://lists.mpich.org/mailman/listinfo/discuss
>>
>> --
>> Pavan Balaji ✉️
>> http://www.mcs.anl.gov/~balaji
>>
>> _______________________________________________
>> discuss mailing list discuss at mpich.org
>> To manage subscription options or unsubscribe:
>> https://lists.mpich.org/mailman/listinfo/discuss
> _______________________________________________
> discuss mailing list discuss at mpich.org
> To manage subscription options or unsubscribe:
> https://lists.mpich.org/mailman/listinfo/discuss
--
Pavan Balaji ✉️
http://www.mcs.anl.gov/~balaji
More information about the discuss
mailing list