[mpich-discuss] mpich hanging on startup

Orion Poplawski orion at cora.nwra.com
Wed Jan 20 20:25:12 CST 2016


On 01/20/2016 05:05 PM, Orion Poplawski wrote:
> I'm got a strange situation - I'm trying to build hdf5 1.8.16 for Fedora.  The
> Fedora builders do not have network access (/etc/resolv.conf set to
> "nameserver 127.0.0.1" and no local nameserver) for security purposes.  The
> hdf5 t_mpi parallel test hangs on launch with mpich 3.2, but only on the arm
> builders.
>
> The processes seem to deadlock in the MPI code, each blocking in poll(),
> presumably waiting for communication from the other process.


Apparently the Fedora package builds using --with-device=ch3:nemesis on 
x86 and --with-device=ch3:sock on all other platforms, so the arm builds 
are using a different channel.  Is that still a necessary and/or 
recommended configuration?


-- 
Orion Poplawski
Technical Manager                     303-415-9701 x222
NWRA/CoRA Division                    FAX: 303-415-9702
3380 Mitchell Lane                  orion at cora.nwra.com
Boulder, CO 80301              http://www.cora.nwra.com
_______________________________________________
discuss mailing list     discuss at mpich.org
To manage subscription options or unsubscribe:
https://lists.mpich.org/mailman/listinfo/discuss


More information about the discuss mailing list