[mpich-discuss] Problem configuring mpich-3.4a3 and 3.4a2 --with-device=ch4:ucx

Zhou, Hui zhouh at anl.gov
Wed Sep 23 18:24:34 CDT 2020


Hi Martin,

Try `--with-pmix=/usr`

--
Hui Zhou


From: "Audet, Martin" <Martin.Audet at cnrc-nrc.gc.ca>
Date: Wednesday, September 23, 2020 at 4:38 PM
To: "discuss at mpich.org" <discuss at mpich.org>
Cc: "Zhou, Hui" <zhouh at anl.gov>
Subject: RE: [mpich-discuss] Problem configuring mpich-3.4a3 and 3.4a2 --with-device=ch4:ucx

Thanks Zhou for your answer.

Now when I add the following options:

--with-pmix-include=/usr/include --with-pmix-lib=/usr/lib64

to my configuration options (just after --with-pmix) it works as expected (i.e. I am able to use the pmix mechanism in Slurm).

However if I don’t provide --with-pmix, even if --with-pmix-include=/usr/include --with-pmix-lib=/usr/lib64 are provided, pmix is not used (and I have to use the pmi2 mechanism in Slurm).

Regards,

Martin Audet


From: Zhou, Hui [mailto:zhouh at anl.gov]
Sent: Wednesday, September 23, 2020 9:47
To: discuss at mpich.org
Cc: Audet, Martin <Martin.Audet at cnrc-nrc.gc.ca>
Subject: Re: [mpich-discuss] Problem configuring mpich-3.4a3 and 3.4a2 --with-device=ch4:ucx

>    ./configure --with-device=ch4:ucx --with-hcoll=/opt/mellanox/hcoll --with-pmix --prefix=/home/publique/depot/mpi/mpich-ch4_ucx-3.4a3 --enable-fast=all --enable-romio --with-file-system=ufs+nfs --enable-shared --enable-sharedlibs=gcc

I believe the offending option is `--with-pmix`, since that defaults to `--with-pmix=yes`. You are supposed to pass in the path to pmix installation, e.g. `--with-pmix=/usr/local`.

I admit this is a bit not obvious.

--
Hui Zhou


From: "Audet, Martin via discuss" <discuss at mpich.org<mailto:discuss at mpich.org>>
Reply-To: "discuss at mpich.org<mailto:discuss at mpich.org>" <discuss at mpich.org<mailto:discuss at mpich.org>>
Date: Tuesday, September 22, 2020 at 9:41 PM
To: "discuss at mpich.org<mailto:discuss at mpich.org>" <discuss at mpich.org<mailto:discuss at mpich.org>>
Cc: "Audet, Martin" <Martin.Audet at cnrc-nrc.gc.ca<mailto:Martin.Audet at cnrc-nrc.gc.ca>>
Subject: [mpich-discuss] Problem configuring mpich-3.4a3 and 3.4a2 --with-device=ch4:ucx

Hello MPICH_Developers,

I am having trouble running the configure script on two latest mpich release (3.4a2 and 3.4a3) when I use the flag --with-device=ch4:ucx (no such problems with ch3:sock or ch3:nemesis). It seems that the generated Makefile often contains two strange compilation and link flags: -Iyes/include  and –Lyes/lib.

To be able to compile the library I had to use the following command to remove those strange flags:

   find . -name Makefile -exec sed -i -e 's/-Iyes\/include\>//g' -e 's/-Lyes\/lib\>//g' {} \;

After this step I am able to compile and the resulting library seems ok and appears to performs well (not extensive testing).

For your information I call the configure script like this:

   ./configure --with-device=ch4:ucx --with-hcoll=/opt/mellanox/hcoll --with-pmix --prefix=/home/publique/depot/mpi/mpich-ch4_ucx-3.4a3 --enable-fast=all --enable-romio --with-file-system=ufs+nfs --enable-shared --enable-sharedlibs=gcc

I had experienced this problem on the following three configurations:

   mpich-3.4a2  CentOS 7.6  MOFED 4.7
   mpich-3.4a3  CentOS 7.6  MOFED 4.7
   mpich-3.4a3  CentOS 7.8  MOFED 4.9

The architecture is x86_64.

Could someone take a look at this problem ?

Thanks,

Martin Audet
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.mpich.org/pipermail/discuss/attachments/20200923/c950dde6/attachment-0001.html>


More information about the discuss mailing list