[mpich-discuss] Help resolving hwloc_pci_compare_busids: Assertion `0' failed error

David Alexander alexanda at txcorp.com
Fri Sep 4 18:08:02 CDT 2020


Thanks!  Setting HWLOC_COMPONENTS=-pci,-linuxio worked!

BTW, I am not absolutely sure it was in the MPICH code. I do know
that 

$ mpiexec -np 2 hostname

worked and did not give the error, but of course that doesn’t do an
of the MPI operations in the target executable “hostname".

Thanks again!
dave


> On Sep 4, 2020, at 10:11 AM, Brice Goglin via discuss <discuss at mpich.org> wrote:
> 
> Hello
> 
> I think we've seen this issue once in the past, but I couldn't find it
> in the archives yet.
> 
> Anyway, it's not MPICH's fault. Can you open an issue on
> github/open-mpi/hwloc, with the output of lspci -vt ? If you can build
> hwloc, run lstopo to check whether the issue occurs, and get a gdb
> backtrace, it'd be great. I am going to add some debug printf before
> this assert to avoid having to run gdb next time.
> 
> In the meantime, try setting HWLOC_COMPONENTS=-pci,-linuxio in the
> environment so that hwloc's PCI backends are disabled. Hopefully, MPICH
> won't fail because of this.
> 
> Brice
> 
> 
> 
> Le 04/09/2020 à 17:47, David Alexander via discuss a écrit :
>> When I execute a distributed copy of MPICH 3.3.2 with a program
>> built against that same copy of MPICH I see the following error on
>> the target machine:
>> 
>> $ mpiexec -np 2 myprogram
>> 
>> myprogram: /builds/mpich-3.3.2/src/hwloc/hwloc/pci-common.c:259: hwloc_pci_compare_busids: Assertion `0' failed.
>> myprogram: /builds/mpich-3.3.2/src/hwloc/hwloc/pci-common.c:259: hwloc_pci_compare_busids: Assertion `0' failed.
>> 
>> The target machine where mpiexec is running on is an SGI with SLES12 and the
>> machine that I built on is Centos7.
>> 
>> The library dependency of mpiexec.hydra and myprogram are below:
>> 
>> $ lddtree /installation/bin/mpiexec.hydra 
>> mpiexec.hydra => /installation/bin/mpiexec.hydra (interpreter => /lib64/ld-linux-x86-64.so.2)
>>   libm.so.6 => /lib64/libm.so.6
>>       ld-linux-x86-64.so.2 => /lib64/ld-linux-x86-64.so.2
>>   libudev.so.1 => /lib64/libudev.so.1
>>       librt.so.1 => /lib64/librt.so.1
>>       libcap.so.2 => /lib64/libcap.so.2
>>           libattr.so.1 => /lib64/libattr.so.1
>>       libdw.so.1 => /lib64/libdw.so.1
>>           libelf.so.1 => /lib64/libelf.so.1
>>           libz.so.1 => /lib64/libz.so.1
>>           liblzma.so.5 => /lib64/liblzma.so.5
>>           libbz2.so.1 => /lib64/libbz2.so.1
>>       libdl.so.2 => /lib64/libdl.so.2
>>       libgcc_s.so.1 => /lib64/libgcc_s.so.1
>>   libpciaccess.so.0 => /lib64/libpciaccess.so.0
>>   libxml2.so.2 => /lib64/libxml2.so.2
>>   libpthread.so.0 => /lib64/libpthread.so.0
>>   libc.so.6 => /lib64/libc.so.6
>> 
>> $ lddtree /installation/bin/myprogram
>> myprogram => /installation/bin/myprogram (interpreter => /lib64/ld-linux-x86-64.so.2)
>>   libHYPRE.so => /installation/bin/../../lib/libHYPRE.so
>>   libz.so.1 => /installation/bin/../../lib/libz.so.1
>>   libgfortran.so.5 => /installation/bin/../../lib/libgfortran.so.5
>>   libquadmath.so.0 => /installation/bin/../../lib/libquadmath.so.0
>>   libpthread.so.0 => /lib64/libpthread.so.0
>>   libutil.so.1 => /lib64/libutil.so.1
>>   libdl.so.2 => /lib64/libdl.so.2
>>   librt.so.1 => /lib64/librt.so.1
>>   libcurand.so.10 => /installation/bin/../../lib/libcurand.so.10
>>   libmpicxx.so.12 => /installation/bin/../../lib/libmpicxx.so.12
>>       libudev.so.1 => /lib64/libudev.so.1
>>           libcap.so.2 => /lib64/libcap.so.2
>>               libattr.so.1 => /lib64/libattr.so.1
>>           libdw.so.1 => /lib64/libdw.so.1
>>               libelf.so.1 => /lib64/libelf.so.1
>>               liblzma.so.5 => /lib64/liblzma.so.5
>>               libbz2.so.1 => /lib64/libbz2.so.1
>>       libpciaccess.so.0 => /lib64/libpciaccess.so.0
>>       libxml2.so.2 => /lib64/libxml2.so.2
>>   libmpi.so.12 => /installation/bin/../../lib/libmpi.so.12
>>   libpython2.7.so.1.0 => /installation/bin/../../lib/libpython2.7.so.1.0
>>   libstdc++.so.6 => /installation/bin/../../lib/libstdc++.so.6
>>   libm.so.6 => /lib64/libm.so.6
>>   libgomp.so.1 => /installation/bin/../../lib/libgomp.so.1
>>   libgcc_s.so.1 => /installation/bin/../../lib/libgcc_s.so.1
>>   libc.so.6 => /lib64/libc.so.6
>>   ld-linux-x86-64.so.2 => /lib64/ld-linux-x86-64.so.2
>> 
>> Thanks!
>> _______________________________________________
>> discuss mailing list     discuss at mpich.org
>> To manage subscription options or unsubscribe:
>> https://lists.mpich.org/mailman/listinfo/discuss
> _______________________________________________
> discuss mailing list     discuss at mpich.org
> To manage subscription options or unsubscribe:
> https://lists.mpich.org/mailman/listinfo/discuss



More information about the discuss mailing list