[mpich-discuss] Help resolving hwloc_pci_compare_busids: Assertion `0' failed error

Brice Goglin Brice.Goglin at inria.fr
Fri Sep 4 11:11:05 CDT 2020


Hello

I think we've seen this issue once in the past, but I couldn't find it
in the archives yet.

Anyway, it's not MPICH's fault. Can you open an issue on
github/open-mpi/hwloc, with the output of lspci -vt ? If you can build
hwloc, run lstopo to check whether the issue occurs, and get a gdb
backtrace, it'd be great. I am going to add some debug printf before
this assert to avoid having to run gdb next time.

In the meantime, try setting HWLOC_COMPONENTS=-pci,-linuxio in the
environment so that hwloc's PCI backends are disabled. Hopefully, MPICH
won't fail because of this.

Brice



Le 04/09/2020 à 17:47, David Alexander via discuss a écrit :
> When I execute a distributed copy of MPICH 3.3.2 with a program
> built against that same copy of MPICH I see the following error on
> the target machine:
>
> $ mpiexec -np 2 myprogram
>
> myprogram: /builds/mpich-3.3.2/src/hwloc/hwloc/pci-common.c:259: hwloc_pci_compare_busids: Assertion `0' failed.
> myprogram: /builds/mpich-3.3.2/src/hwloc/hwloc/pci-common.c:259: hwloc_pci_compare_busids: Assertion `0' failed.
>
> The target machine where mpiexec is running on is an SGI with SLES12 and the
> machine that I built on is Centos7.
>
> The library dependency of mpiexec.hydra and myprogram are below:
>
> $ lddtree /installation/bin/mpiexec.hydra 
> mpiexec.hydra => /installation/bin/mpiexec.hydra (interpreter => /lib64/ld-linux-x86-64.so.2)
>    libm.so.6 => /lib64/libm.so.6
>        ld-linux-x86-64.so.2 => /lib64/ld-linux-x86-64.so.2
>    libudev.so.1 => /lib64/libudev.so.1
>        librt.so.1 => /lib64/librt.so.1
>        libcap.so.2 => /lib64/libcap.so.2
>            libattr.so.1 => /lib64/libattr.so.1
>        libdw.so.1 => /lib64/libdw.so.1
>            libelf.so.1 => /lib64/libelf.so.1
>            libz.so.1 => /lib64/libz.so.1
>            liblzma.so.5 => /lib64/liblzma.so.5
>            libbz2.so.1 => /lib64/libbz2.so.1
>        libdl.so.2 => /lib64/libdl.so.2
>        libgcc_s.so.1 => /lib64/libgcc_s.so.1
>    libpciaccess.so.0 => /lib64/libpciaccess.so.0
>    libxml2.so.2 => /lib64/libxml2.so.2
>    libpthread.so.0 => /lib64/libpthread.so.0
>    libc.so.6 => /lib64/libc.so.6
>
> $ lddtree /installation/bin/myprogram
> myprogram => /installation/bin/myprogram (interpreter => /lib64/ld-linux-x86-64.so.2)
>    libHYPRE.so => /installation/bin/../../lib/libHYPRE.so
>    libz.so.1 => /installation/bin/../../lib/libz.so.1
>    libgfortran.so.5 => /installation/bin/../../lib/libgfortran.so.5
>    libquadmath.so.0 => /installation/bin/../../lib/libquadmath.so.0
>    libpthread.so.0 => /lib64/libpthread.so.0
>    libutil.so.1 => /lib64/libutil.so.1
>    libdl.so.2 => /lib64/libdl.so.2
>    librt.so.1 => /lib64/librt.so.1
>    libcurand.so.10 => /installation/bin/../../lib/libcurand.so.10
>    libmpicxx.so.12 => /installation/bin/../../lib/libmpicxx.so.12
>        libudev.so.1 => /lib64/libudev.so.1
>            libcap.so.2 => /lib64/libcap.so.2
>                libattr.so.1 => /lib64/libattr.so.1
>            libdw.so.1 => /lib64/libdw.so.1
>                libelf.so.1 => /lib64/libelf.so.1
>                liblzma.so.5 => /lib64/liblzma.so.5
>                libbz2.so.1 => /lib64/libbz2.so.1
>        libpciaccess.so.0 => /lib64/libpciaccess.so.0
>        libxml2.so.2 => /lib64/libxml2.so.2
>    libmpi.so.12 => /installation/bin/../../lib/libmpi.so.12
>    libpython2.7.so.1.0 => /installation/bin/../../lib/libpython2.7.so.1.0
>    libstdc++.so.6 => /installation/bin/../../lib/libstdc++.so.6
>    libm.so.6 => /lib64/libm.so.6
>    libgomp.so.1 => /installation/bin/../../lib/libgomp.so.1
>    libgcc_s.so.1 => /installation/bin/../../lib/libgcc_s.so.1
>    libc.so.6 => /lib64/libc.so.6
>    ld-linux-x86-64.so.2 => /lib64/ld-linux-x86-64.so.2
>
> Thanks!
> _______________________________________________
> discuss mailing list     discuss at mpich.org
> To manage subscription options or unsubscribe:
> https://lists.mpich.org/mailman/listinfo/discuss


More information about the discuss mailing list