[mpich-discuss] communication device selection

Balaji, Pavan balaji at anl.gov
Mon May 26 14:06:22 CDT 2014


Hi Yida,

Using SCIF + TCP (+ shared memory, which is implicit) together within the same job is not supported right now.  When you configure with scif,tcp, you can pick either one at runtime, but not both.

We are working on some improvements to this for the next major release (3.2).  Please stay tuned.

Regards,

  — Pavan

On May 26, 2014, at 10:50 AM, Yida Wang <yidawa at gmail.com> wrote:

> Hi Guys,
> 
> During configuration, I put in an optional argument as "--with-device=ch3:nemesis:scif,tcp" to hopefully enable scif communication through PCIe, and use tcp for remote communication.
> 
> However, it looks like that tcp channel is disabled by this setting. That is, when I wanted to communicate between remote Xeon Phis, it returned error saying that "channel initialization failed" and "scif_connect failed", where I assume that tcp channel should be used instead of scif.
> 
> Moreover, when communicating between two remote hosts, it would get stuck probably because it still tried to use scif channel.
> 
> Did I configure the communication device incorrectly?
> 
> Thanks,
> YW
> _______________________________________________
> discuss mailing list     discuss at mpich.org
> To manage subscription options or unsubscribe:
> https://lists.mpich.org/mailman/listinfo/discuss




More information about the discuss mailing list