[mpich-discuss] MPICH 3.1.4 builds with ib netmod

Huiwei Lu huiweilu at mcs.anl.gov
Mon Apr 27 08:53:32 CDT 2015


Hi Martin,

Thank you for reporting your use of MPICH to us.

If you could send us a simple test case that reproduce the crash in your
workload, then we can use it to fix our code and make it better.

For the question of 3.2b2, the answer is yes! We have been improving MXM
since 3.1.4 (thanks to the contribution of Mellanox). It will be worth
trying the new release to see if fixes the crash in your workload.

Thanks,

--
Huiwei Lu
Postdoc Appointee
Mathematics and Computer Science Division
Argonne National Laboratory
http://www.mcs.anl.gov/~huiweilu/

On Fri, Apr 24, 2015 at 4:38 PM, Martin Cuma <martin.cuma at utah.edu> wrote:

> Hello,
>
> I am getting errors with building -with-device=ch3:nemesis:ib, using GNU,
> Intel or PGI compilers, as:
>
> I am wondering what's going on since this since Intel and GNU worked in
> version 3.1.2. I configure as:
> ../../../srcdir/mpich/3.1.4/configure --prefix=/uufs/
> chpc.utah.edu/sys/installdir/mpich/3.1.4 --enable-romio
> --with-file-system=nfs+ufs --with-mpe -with-device=ch3:nemesis:ib
> --enable-threads=runtime --enable-fast=all
>
> and use RHEL6.6 with stock gcc 4.4.7.
>
> When I reported a similar issue earlier, Pavan suggested to use MXM - I
> tried that and that seems to work, however, perhaps since we run relatively
> old OFED (stock RHEL6-like), which does not come with MXM, I used the one
> from the latest Mellanox HPC-X, and, I am not sure if that's the best idea
> since I see crashes related to communication at certain workloads - which I
> don't see with other MPIs or when using the ib netmod in MPICH.
>
> Would you please also mind commenting on this? Would you expect the just
> released 3.2b2 fare better with MXM than the 3.1.4?
>
> Thanks,
> MC
>
> --
> Martin Cuma
> Center for High Performance Computing
> Department of Geology and Geophysics
> University of Utah
> _______________________________________________
> discuss mailing list     discuss at mpich.org
> To manage subscription options or unsubscribe:
> https://lists.mpich.org/mailman/listinfo/discuss
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.mpich.org/pipermail/discuss/attachments/20150427/ed5f7f8d/attachment.html>
-------------- next part --------------
_______________________________________________
discuss mailing list     discuss at mpich.org
To manage subscription options or unsubscribe:
https://lists.mpich.org/mailman/listinfo/discuss


More information about the discuss mailing list