[mpich-discuss] Checkpoint/Restart problem

Balaji, Pavan balaji at anl.gov
Sat Apr 2 22:20:29 CDT 2016


Hello,

BLCR checkpointing is no longer supported in MPICH.  The BLCR kernel module hasn't been updated to recent linux kernels, and move folks have moved to alternate checkpointing infrastructure such as FTI or SCR.  You might want to consider doing that as well.

  -- Pavan

> On Apr 1, 2016, at 9:51 PM, Husen R <hus3nr at gmail.com> wrote:
> 
> Dear all,
> 
> Please anyone tell me how to manually checkpoint mpiexec ?
> I have followed the instruction in this link https://wiki.mpich.org/mpich/index.php/Checkpointing. 
> I used the following command to send SIGUSR1. nothing happened.
> 
> kill -SIGUSR1 [pid of mpiexec]
> 
> thank you in advance
> 
> regards,
> 
> 
> Husen
> _______________________________________________
> discuss mailing list     discuss at mpich.org
> To manage subscription options or unsubscribe:
> https://lists.mpich.org/mailman/listinfo/discuss

_______________________________________________
discuss mailing list     discuss at mpich.org
To manage subscription options or unsubscribe:
https://lists.mpich.org/mailman/listinfo/discuss


More information about the discuss mailing list