[mpich-discuss] MPI_Comm_Spawn + UCX error

Iker Martín Álvarez martini at uji.es
Wed Apr 21 18:20:35 CDT 2021


Hello,

I have been working with the MPI_Comm_spawn function, which was working
fine for a simple compiled version of MPICH 3.4.1, where in the
*configure* step
it only has the "--prefix" argument.

However, when this function was called with another compiled version of
MPICH 3.4.1 which uses Infiniband, an error arised. Am I missing some
arguments in the compilation step of MPICH when using UCX?

Here is the output of *mpichversion*:
$ mpichversion
MPICH Version:     3.4.1
MPICH Release date: Fri Jan 22 14:17:48 CST 2021
MPICH Device:     ch4:ucx
MPICH configure:  --prefix=/soft/gnu/mpich-3.4.1-ucx --with-device=ch4:ucx
--with-ucx=/soft/gnu/ucx-1.11
MPICH CC:  gcc    -O2
MPICH CXX:  g++   -O2
MPICH F77:  gfortran   -O2
MPICH FC:  gfortran   -O2
MPICH Custom Information:

The following is the information about the minimal code which arises the
error

Source: https://www.rookiehpc.com/mpi/docs/mpi_comm_spawn.php

Compiling: mpicc mpi_spawn.c

Running: mpirun -np 2 ./a.out

We are processes spawned directly by you, we now spawn a new instance of an
MPI application.

We are processes spawned directly by you, we now spawn a new instance of an
MPI application.

Assertion failed in file src/mpid/ch4/src/ch4_init.c at line 651:
MPIR_Process.comm_parent != NULL

/soft/gnu/mpich-3.4.1-ucx/lib/libmpi.so.12(MPL_backtrace_show+0x39)
[0x7fe15d506d41]

/soft/gnu/mpich-3.4.1-ucx/lib/libmpi.so.12(+0x32eaa8) [0x7fe15d4a6aa8]

/soft/gnu/mpich-3.4.1-ucx/lib/libmpi.so.12(+0x3602f8) [0x7fe15d4d82f8]

/soft/gnu/mpich-3.4.1-ucx/lib/libmpi.so.12(+0x225803) [0x7fe15d39d803]

/soft/gnu/mpich-3.4.1-ucx/lib/libmpi.so.12(PMPI_Init+0xa8) [0x7fe15d39d598]

./a.out(+0x123e) [0x55ece110a23e]

/lib/x86_64-linux-gnu/libc.so.6(__libc_start_main+0xf3) [0x7fe15cfa10b3]

./a.out(+0x114e) [0x55ece110a14e]

Abort(1) on node 0: Internal error
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.mpich.org/pipermail/discuss/attachments/20210422/483aa931/attachment.html>


More information about the discuss mailing list