[mpich-discuss] Parallel test hanging with mpich on rhel7
Orion Poplawski
orion at cora.nwra.com
Mon Feb 10 22:47:26 CST 2014
On 02/06/2014 09:10 PM, Balaji, Pavan wrote:
>
> Thanks. That’s very useful analysis. Would you be willing to try the
> attached patch to see if it solves this issue?
>
> — Pavan
Well, it seems to prevent a hang (although I'm also updating from 3.0.4
to 3.1rc3 so not sure what is all changing here), but it does not run:
============================
Fatal error in MPI_Init: Other MPI error, error stack:
MPIR_Init_thread(467)..............:
MPID_Init(177).....................: channel initialization failed
MPIDI_CH3_Init(70).................:
MPID_nem_init(319).................:
MPID_nem_tcp_init(171).............:
MPID_nem_tcp_get_business_card(418):
MPID_nem_tcp_init(377).............: gethostbyname failed, i-00001ff8
(errno 1)
Fatal error in MPI_Init: Other MPI error, error stack:
MPIR_Init_thread(467)..............:
MPID_Init(177).....................: channel initialization failed
MPIDI_CH3_Init(70).................:
MPID_nem_init(319).................:
MPID_nem_tcp_init(171).............:
MPID_nem_tcp_get_business_card(418):
MPID_nem_tcp_init(377).............: gethostbyname failed, i-00001ff8
(errno 1)
===================================================================================
= BAD TERMINATION OF ONE OF YOUR APPLICATION PROCESSES
= PID 19673 RUNNING AT i-00001ff8
= EXIT CODE: 1
= CLEANING UP REMAINING PROCESSES
= YOU CAN IGNORE THE BELOW CLEANUP MESSAGES
===================================================================================
--
Orion Poplawski
Technical Manager 303-415-9701 x222
NWRA/CoRA Division FAX: 303-415-9702
3380 Mitchell Lane orion at cora.nwra.com
Boulder, CO 80301 http://www.cora.nwra.com
More information about the discuss
mailing list