[mpich-discuss] Problem Running MPI on cluster

Md. Amjad Hossain mhossai2 at kent.edu
Tue Oct 21 00:09:07 CDT 2014


Hi I am trying to run simple hello world program on cluster nodes. I am
running it by following command but getting errors:

Command:  /usr/lib64/mpich/bin/mpirun -n 4 -f host_file ./mpi_hello_world

errors:
[mpiexec at md-lin-01.mcs.kent.edu] control_cb
(./pm/pmiserv/pmiserv_cb.c:202): assert (!closed) failed
[mpiexec at md-lin-01.mcs.kent.edu] HYDT_dmxu_poll_wait_for_event
(./tools/demux/demux_poll.c:77): callback returned error status
[mpiexec at md-lin-01.mcs.kent.edu] HYD_pmci_wait_for_completion
(./pm/pmiserv/pmiserv_pmci.c:197): error waiting for event
[mpiexec at md-lin-01.mcs.kent.edu] main (./ui/mpich/mpiexec.c:331): process
manager error waiting for completion


Before running the command I am setting variables MPIRUN =mpi diretory and
MPI_HOSTS=host_file. The "host_file" has four nodes and they can ssh to
each other without password.

MPICH version I am running is: 3.0.4. The MPI code is attached.

Any help to solve the problem please?
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.mpich.org/pipermail/discuss/attachments/20141021/88dbb920/attachment.html>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: mpi_hello_world.c
Type: text/x-csrc
Size: 1354 bytes
Desc: not available
URL: <http://lists.mpich.org/pipermail/discuss/attachments/20141021/88dbb920/attachment.bin>
-------------- next part --------------
_______________________________________________
discuss mailing list     discuss at mpich.org
To manage subscription options or unsubscribe:
https://lists.mpich.org/mailman/listinfo/discuss


More information about the discuss mailing list