[mpich-discuss] Help resolving hwloc_pci_compare_busids: Assertion `0' failed error
David Alexander
alexanda at txcorp.com
Fri Sep 4 18:08:02 CDT 2020
Thanks! Setting HWLOC_COMPONENTS=-pci,-linuxio worked!
BTW, I am not absolutely sure it was in the MPICH code. I do know
that
$ mpiexec -np 2 hostname
worked and did not give the error, but of course that doesn’t do an
of the MPI operations in the target executable “hostname".
Thanks again!
dave
> On Sep 4, 2020, at 10:11 AM, Brice Goglin via discuss <discuss at mpich.org> wrote:
>
> Hello
>
> I think we've seen this issue once in the past, but I couldn't find it
> in the archives yet.
>
> Anyway, it's not MPICH's fault. Can you open an issue on
> github/open-mpi/hwloc, with the output of lspci -vt ? If you can build
> hwloc, run lstopo to check whether the issue occurs, and get a gdb
> backtrace, it'd be great. I am going to add some debug printf before
> this assert to avoid having to run gdb next time.
>
> In the meantime, try setting HWLOC_COMPONENTS=-pci,-linuxio in the
> environment so that hwloc's PCI backends are disabled. Hopefully, MPICH
> won't fail because of this.
>
> Brice
>
>
>
> Le 04/09/2020 à 17:47, David Alexander via discuss a écrit :
>> When I execute a distributed copy of MPICH 3.3.2 with a program
>> built against that same copy of MPICH I see the following error on
>> the target machine:
>>
>> $ mpiexec -np 2 myprogram
>>
>> myprogram: /builds/mpich-3.3.2/src/hwloc/hwloc/pci-common.c:259: hwloc_pci_compare_busids: Assertion `0' failed.
>> myprogram: /builds/mpich-3.3.2/src/hwloc/hwloc/pci-common.c:259: hwloc_pci_compare_busids: Assertion `0' failed.
>>
>> The target machine where mpiexec is running on is an SGI with SLES12 and the
>> machine that I built on is Centos7.
>>
>> The library dependency of mpiexec.hydra and myprogram are below:
>>
>> $ lddtree /installation/bin/mpiexec.hydra
>> mpiexec.hydra => /installation/bin/mpiexec.hydra (interpreter => /lib64/ld-linux-x86-64.so.2)
>> libm.so.6 => /lib64/libm.so.6
>> ld-linux-x86-64.so.2 => /lib64/ld-linux-x86-64.so.2
>> libudev.so.1 => /lib64/libudev.so.1
>> librt.so.1 => /lib64/librt.so.1
>> libcap.so.2 => /lib64/libcap.so.2
>> libattr.so.1 => /lib64/libattr.so.1
>> libdw.so.1 => /lib64/libdw.so.1
>> libelf.so.1 => /lib64/libelf.so.1
>> libz.so.1 => /lib64/libz.so.1
>> liblzma.so.5 => /lib64/liblzma.so.5
>> libbz2.so.1 => /lib64/libbz2.so.1
>> libdl.so.2 => /lib64/libdl.so.2
>> libgcc_s.so.1 => /lib64/libgcc_s.so.1
>> libpciaccess.so.0 => /lib64/libpciaccess.so.0
>> libxml2.so.2 => /lib64/libxml2.so.2
>> libpthread.so.0 => /lib64/libpthread.so.0
>> libc.so.6 => /lib64/libc.so.6
>>
>> $ lddtree /installation/bin/myprogram
>> myprogram => /installation/bin/myprogram (interpreter => /lib64/ld-linux-x86-64.so.2)
>> libHYPRE.so => /installation/bin/../../lib/libHYPRE.so
>> libz.so.1 => /installation/bin/../../lib/libz.so.1
>> libgfortran.so.5 => /installation/bin/../../lib/libgfortran.so.5
>> libquadmath.so.0 => /installation/bin/../../lib/libquadmath.so.0
>> libpthread.so.0 => /lib64/libpthread.so.0
>> libutil.so.1 => /lib64/libutil.so.1
>> libdl.so.2 => /lib64/libdl.so.2
>> librt.so.1 => /lib64/librt.so.1
>> libcurand.so.10 => /installation/bin/../../lib/libcurand.so.10
>> libmpicxx.so.12 => /installation/bin/../../lib/libmpicxx.so.12
>> libudev.so.1 => /lib64/libudev.so.1
>> libcap.so.2 => /lib64/libcap.so.2
>> libattr.so.1 => /lib64/libattr.so.1
>> libdw.so.1 => /lib64/libdw.so.1
>> libelf.so.1 => /lib64/libelf.so.1
>> liblzma.so.5 => /lib64/liblzma.so.5
>> libbz2.so.1 => /lib64/libbz2.so.1
>> libpciaccess.so.0 => /lib64/libpciaccess.so.0
>> libxml2.so.2 => /lib64/libxml2.so.2
>> libmpi.so.12 => /installation/bin/../../lib/libmpi.so.12
>> libpython2.7.so.1.0 => /installation/bin/../../lib/libpython2.7.so.1.0
>> libstdc++.so.6 => /installation/bin/../../lib/libstdc++.so.6
>> libm.so.6 => /lib64/libm.so.6
>> libgomp.so.1 => /installation/bin/../../lib/libgomp.so.1
>> libgcc_s.so.1 => /installation/bin/../../lib/libgcc_s.so.1
>> libc.so.6 => /lib64/libc.so.6
>> ld-linux-x86-64.so.2 => /lib64/ld-linux-x86-64.so.2
>>
>> Thanks!
>> _______________________________________________
>> discuss mailing list discuss at mpich.org
>> To manage subscription options or unsubscribe:
>> https://lists.mpich.org/mailman/listinfo/discuss
> _______________________________________________
> discuss mailing list discuss at mpich.org
> To manage subscription options or unsubscribe:
> https://lists.mpich.org/mailman/listinfo/discuss
More information about the discuss
mailing list