[mpich-discuss] communication device selection
Balaji, Pavan
balaji at anl.gov
Mon May 26 14:06:22 CDT 2014
Hi Yida,
Using SCIF + TCP (+ shared memory, which is implicit) together within the same job is not supported right now. When you configure with scif,tcp, you can pick either one at runtime, but not both.
We are working on some improvements to this for the next major release (3.2). Please stay tuned.
Regards,
— Pavan
On May 26, 2014, at 10:50 AM, Yida Wang <yidawa at gmail.com> wrote:
> Hi Guys,
>
> During configuration, I put in an optional argument as "--with-device=ch3:nemesis:scif,tcp" to hopefully enable scif communication through PCIe, and use tcp for remote communication.
>
> However, it looks like that tcp channel is disabled by this setting. That is, when I wanted to communicate between remote Xeon Phis, it returned error saying that "channel initialization failed" and "scif_connect failed", where I assume that tcp channel should be used instead of scif.
>
> Moreover, when communicating between two remote hosts, it would get stuck probably because it still tried to use scif channel.
>
> Did I configure the communication device incorrectly?
>
> Thanks,
> YW
> _______________________________________________
> discuss mailing list discuss at mpich.org
> To manage subscription options or unsubscribe:
> https://lists.mpich.org/mailman/listinfo/discuss
More information about the discuss
mailing list