[mpich-discuss] Assertion failed in file src/mpid/ch3/src/ch3u_handle_send_req.c at line 61 (RMA && Derived datatypes)&In-Reply-To=<0A407957589BAB4F924824150C4293EF588072EC at CITESMBX2.ad.uillinois.edu>

Victor Vysotskiy victor.vysotskiy at teokem.lu.se
Tue Nov 11 10:31:01 CST 2014


Dear Pavan,

thank you for hints! Ok, I was able to install MXM locally without root privileges. Then I have configured and compiled MPICH3 with MXM. Everything went smoothly and finally I got a working binaries and libs. Unfortunately, root is still needed because a simple MPI HELLO WORLD code now crashes with the following error message:

%mpirun  -np 2 ./a.out

[1415719636.245549] [n1:37817:0]         shm.c:65   MXM  WARN  Could not open the KNEM device file at /dev/knem : No such file or directory. Won't use knem.

[1415719636.245549] [n1:37818:0]         shm.c:65   MXM  WARN  Could not open the KNEM device file at /dev/knem : No such file or directory. Won't use knem.

[1415719636.263914] [n1:37817:0]      ib_dev.c:443  MXM  ERROR ibv_query_device() returned 38: No such file or directory

[1415719636.264134] [n1:37818:0]      ib_dev.c:443  MXM  ERROR ibv_query_device() returned 38: No such file or directory

Fatal error in MPI_Init: Other MPI error, error stack:

MPIR_Init_thread(498).........:

MPID_Init(187)................: channel initialization failed

MPIDI_CH3_Init(89)............:

MPID_nem_init(320)............:

MPID_nem_mxm_init(157)........:

MPID_nem_mxm_vc_terminate(451): mxm_init failed (Input/output error)

Fatal error in MPI_Init: Other MPI error, error stack:

MPIR_Init_thread(498).........:

MPID_Init(187)................: channel initialization failed

MPIDI_CH3_Init(89)............:

MPID_nem_init(320)............:

MPID_nem_mxm_init(157)........:

MPID_nem_mxm_vc_terminate(451): mxm_init failed (Input/output error)

%ldd a.out
linux-vdso.so.1 =>  (0x00007ffff29ff000)
libmpi.so.12 => /nobackup/global/x_vicvy/mpich3-dev-bin.mxm/lib/libmpi.so.12 (0x00007f9fd4fa1000)
libm.so.6 => /lib64/libm.so.6 (0x00007f9fd4cfe000)
libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f9fd4ae8000)
libc.so.6 => /lib64/libc.so.6 (0x00007f9fd4754000)
libdl.so.2 => /lib64/libdl.so.2 (0x00007f9fd454f000)
libmxm.so.2 => /nobackup/global/x_vicvy/hpcx-v1.2.0-255-icc-MLNX_OFED_LINUX-2.3-1.5.0-redhat6.5/mxm/lib/libmxm.so.2 (0x00007f9fd41f5000)
libz.so.1 => /lib64/libz.so.1 (0x00007f9fd3fdf000)
libibverbs.so.1 => /usr/lib64/libibverbs.so.1 (0x00007f9fd3dd1000)
librt.so.1 => /lib64/librt.so.1 (0x00007f9fd3bc9000)
libgpfs.so => /usr/lib64/libgpfs.so (0x00007f9fd39ba000)
libpthread.so.0 => /lib64/libpthread.so.0 (0x00007f9fd379c000)
libifport.so.5 => /software/apps/intel/composer_xe_2015.0.090/compiler/lib/intel64/libifport.so.5 (0x00007f9fd356f000)
libifcore.so.5 => /software/apps/intel/composer_xe_2015.0.090/compiler/lib/intel64/libifcore.so.5 (0x00007f9fd3239000)
libimf.so => /software/apps/intel/composer_xe_2015.0.090/compiler/lib/intel64/libimf.so (0x00007f9fd2d7e000)
libsvml.so => /software/apps/intel/composer_xe_2015.0.090/compiler/lib/intel64/libsvml.so (0x00007f9fd212f000)
libintlc.so.5 => /software/apps/intel/composer_xe_2015.0.090/compiler/lib/intel64/libintlc.so.5 (0x00007f9fd1ed5000)
/lib64/ld-linux-x86-64.so.2 (0x00007f9fd5499000)
libirng.so => /software/apps/intel/composer_xe_2015.0.090/compiler/lib/intel64/libirng.so (0x00007f9fd1ccd000)

Since ’nemesis:ch3:ib’  is fixed, I will check it out.

With best regards,
Victor.

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.mpich.org/pipermail/discuss/attachments/20141111/54e7a73f/attachment.html>


More information about the discuss mailing list