[mpich-discuss] Loopback IPs should be filtered out by MPICH tools

Nevolin, Andrey Andrey.Nevolin at emc.com
Fri Aug 14 06:32:06 CDT 2015


Hi,

I installed MPICH on a cluster deployed in a virtual environment. After that I spent a lot of time trying to figure out why it doesn't work.

I'm new to MPICH (and even to MPI). Hence my suggestions about the system functioning may contradict to the real world. But looks like in my case the problem has the following roots:

1) Looks like MPICH nodes use something similar to 'hostname -i' to discover their own IPs
2) If several IP aliases exist for a single host name, MPICH picks the first of them
3) Order of IP aliases resembles their order in '/etc/hosts/' list
4) VMWare hypervisor automatically adds loop back alias for the host name (when the cluster is deployed by cloning of a single node). And this alias is added as a first line in '/etc/hosts'
5) All of the above means that MPICH discovers loop back IP in my case.

The problem disappeared when I removed loop back alias from '/etc/hosts/'.

Please, change IP discovery process to avoid loop back IP. Or add an explicit requirement to documentation to avoid loop back aliases in '/etc/hosts'.

Thanks,
                --Andrey

@Andrey_Nevolin<https://twitter.com/Andrey_Nevolin>

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.mpich.org/pipermail/discuss/attachments/20150814/dc06b88d/attachment.html>
-------------- next part --------------
_______________________________________________
discuss mailing list     discuss at mpich.org
To manage subscription options or unsubscribe:
https://lists.mpich.org/mailman/listinfo/discuss


More information about the discuss mailing list