<html><head>
<meta http-equiv="Content-Type" content="text/html; charset=utf-8">
</head>
<body bgcolor="#FFFFFF" text="#000000">
<div class="moz-cite-prefix"><br>
Hi Ramiro,<br>
<br>
Our folks that contributed that netmod are looking into this
issue. In the meantime, I'd suggest trying the MXM netmod. MXM is
a netmod for IB networks using the MXM API instead of Verbs to
interact with the HCA.<br>
<br>
Best,<br>
Antonio<br>
<br>
<br>
On 01/15/2015 05:52 AM, Ramiro Alba wrote:<br>
</div>
<blockquote cite="mid:012b7397d78231a43ac985b58de97f5e@cttc.upc.edu" type="cite">Hi all,
<br>
<br>
I've compiled mpich-3.1.3 on centos 6.5 with the following
options:
<br>
<br>
--enable-fortran=yes \
<br>
--with-device=ch3:nemesis:ib \
<br>
--with-pm=hydra:gforker \
<br>
<br>
and the package 'libibverbs-devel' installed.
<br>
<br>
When I try to run a test hello program using two IB DDR nodes,
using the
<br>
command:
<br>
<br>
mpiexec.hydra -np 16 -bind-to core -launcher rsh -iface ib0 -hosts
jff201,jff202 mpi_hello
<br>
<br>
I've got the errors bellow, even running using with root user.
<br>
<br>
If I compile with:
<br>
<br>
--with-device=ch3:nemesis
<br>
<br>
it works with no errors.
<br>
<br>
I am also using both openmpi and mvapich2 on Infiniband and they
work fine
<br>
<br>
Am I doing something wrong when compiling and/or running?
<br>
Any sugestion is welcomed?
<br>
<br>
Thanks in advance
<br>
Regards
<br>
<br>
##########################################################################
<br>
MPICH OVER IB: RUN ERRORS
<br>
##########################################################################
<br>
Fatal error in MPI_Init: Other MPI error, error stack:
<br>
MPIR_Init_thread(498):
<br>
MPID_Init(177).......: channel initialization failed
<br>
MPIDI_CH3_Init(89)...:
<br>
MPID_nem_init(320)...:
<br>
MPID_nem_ib_init(264): MPID_nem_ib_com_open failed
<br>
Fatal error in MPI_Init: Other MPI error, error stack:
<br>
MPIR_Init_thread(498):
<br>
MPID_Init(177).......: channel initialization failed
<br>
MPIDI_CH3_Init(89)...:
<br>
MPID_nem_init(320)...:
<br>
MPID_nem_ib_init(264): MPID_nem_ib_com_open failed
<br>
[root@jff201 mpich]# mpirun -np 2 -iface eth0 mpi_hello-mpich
<br>
IB device not foundFatal error in MPI_Init: Other MPI error, error
stack:
<br>
MPIR_Init_thread(498):
<br>
MPID_Init(177).......: channel initialization failed
<br>
MPIDI_CH3_Init(89)...:
<br>
MPID_nem_init(320)...:
<br>
MPID_nem_ib_init(264): MPID_nem_ib_com_open failed
<br>
IB device not foundFatal error in MPI_Init: Other MPI error, error
stack:
<br>
MPIR_Init_thread(498):
<br>
MPID_Init(177).......: channel initialization failed
<br>
MPIDI_CH3_Init(89)...:
<br>
MPID_nem_init(320)...:
<br>
MPID_nem_ib_init(264): MPID_nem_ib_com_open failed
<br>
##########################################################################
<br>
<br>
<fieldset class="mimeAttachmentHeader"></fieldset>
<br>
<pre wrap="">_______________________________________________
discuss mailing list <a class="moz-txt-link-abbreviated" href="mailto:discuss@mpich.org">discuss@mpich.org</a>
To manage subscription options or unsubscribe:
<a class="moz-txt-link-freetext" href="https://lists.mpich.org/mailman/listinfo/discuss">https://lists.mpich.org/mailman/listinfo/discuss</a></pre>
</blockquote>
<br>
<br>
<pre class="moz-signature" cols="72">--
Antonio J. Peña
Postdoctoral Appointee
Mathematics and Computer Science Division
Argonne National Laboratory
9700 South Cass Avenue, Bldg. 240, Of. 3148
Argonne, IL 60439-4847
<a class="moz-txt-link-abbreviated" href="mailto:apenya@mcs.anl.gov">apenya@mcs.anl.gov</a>
<a class="moz-txt-link-abbreviated" href="http://www.mcs.anl.gov/~apenya">www.mcs.anl.gov/~apenya</a></pre>
</body>
</html>