[mpich-discuss] Fwd: MPICH fault tolerance and resiliency

Halim Amer aamer at anl.gov
Fri May 26 10:16:54 CDT 2017


Sanjeev,

 > More precisely my requirement is suppose I started 4 instances of my
 > application. Now I want to add one more instance dynamically to this set

 From my understanding, dynamic processes would work fine for this case. 
Could you elaborate on why the dynamic process model is not sufficient 
for your needs?

Halim
www.mcs.anl.gov/~aamer

On 5/26/17 9:11 AM, sanjeev s wrote:
> Hi mpich,
>
> I have a requirement where in we need to add start stop application
> instances on the fly before starting a job.Is there any mpich service
> available. I looked through dynamic process model, but its not sufficing
> our need.
>
> More precisely my requirement is suppose I started 4 instances of my
> application. Now I want to add one more instance dynamically to this set
>
> Is there any tool which MPICH supports for fault tolerance behavior?
>
> Thanks
> Sanjeev
>
>
>
> _______________________________________________
> discuss mailing list     discuss at mpich.org
> To manage subscription options or unsubscribe:
> https://lists.mpich.org/mailman/listinfo/discuss
>
_______________________________________________
discuss mailing list     discuss at mpich.org
To manage subscription options or unsubscribe:
https://lists.mpich.org/mailman/listinfo/discuss


More information about the discuss mailing list