[mpich-discuss] MPICH interact with RTDA

Zhou, Hui zhouh at anl.gov
Fri Oct 26 08:42:06 CDT 2018


Having a manual mode of launching proxies sounds very interesting. I don’t think currently hydra supports it though.

The manually launched proxies need establish communication back to mpiexec.hydra — I suspect the mechanism already exist. I guess the next question is how to reliably detect launch failures other than having mpiexec.hydra hanging forever — maybe that is a valid option.

Hui Zhou

On Oct 23, 2018, at 11:44 PM, Shuwei Zhao <shuweizhao1991 at gmail.com<mailto:shuweizhao1991 at gmail.com>> wrote:

Hi,

I'm trying integrate MPICH with network submission tool called RTDA(RunTime Design Automation - www.rtda.com<http://www.rtda.com/>), I use the mpich-3.2.1a but looks like that mpich doesn't detect the resource manager and launcher, mpich cannot distribute jobs as expected.

Does mpich support interact with RTDA resource manager and launcher?

Since the hydra process manager did tight integration with SGE, LSF, SLURM, PBS, etc. I was thinking for platform it doesn't support - if there is way to submit jobs with loose integration?
( loose integration means that we do the job submission self and run hydra_pmi_proxy on each allocated node manually, instead of mpich finished everything under the hood.
)

Thank you very much,
Shuwei
_______________________________________________
discuss mailing list     discuss at mpich.org<mailto:discuss at mpich.org>
To manage subscription options or unsubscribe:
https://lists.mpich.org/mailman/listinfo/discuss

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.mpich.org/pipermail/discuss/attachments/20181026/985f289e/attachment.html>


More information about the discuss mailing list