[mpich-discuss] Fwd: MPICH 3.3.1 tests fail

Benson Muite benson_muite at emailplus.org
Fri Oct 4 08:20:34 CDT 2019


Hi,

You would want some of the drivers for the other communication stacks, 
but TCP/IP should already be available on your system. You may want to 
check the hardware available on your server farm to decide best 
communication software stack to use.

Benson

On 10/4/19 4:13 PM, Mark Cunningham via discuss wrote:
> I was unaware that there was a repository in Ubuntu.  So, if there are 
> binaries available, that solves my immediate problem.
>
> But, I was under the impression that the default build would do 
> ch3:nemesis anyway, so isn't the --with-device really a no-op?
> We also have a bigger server farm, which is why I was fiddling with 
> the ofi bit.
> Thanks,
> Mark
>
> On Fri, Oct 4, 2019 at 8:51 AM Benson Muite via discuss 
> <discuss at mpich.org <mailto:discuss at mpich.org>> wrote:
>
>     Ok. Does MPICH from Ubuntu repositories work?
>
>     Have you tried a basic build:
>
>     ./configure --prefix=/home/share/mpich/mpich-3.3.1
>
>
>     On 10/4/19 3:42 PM, Mark Cunningham via discuss wrote:
>>     This is just running in shared memory for the test.
>>
>>
>>     On Fri, Oct 4, 2019 at 8:41 AM Benson Muite via discuss
>>     <discuss at mpich.org <mailto:discuss at mpich.org>> wrote:
>>
>>         Hi Mark,
>>
>>         What interconnect is on your machine?
>>
>>         Regards,
>>
>>         Benson
>>
>>         On 10/4/19 3:37 PM, Mark Cunningham via discuss wrote:
>>>         Oops, sorry I forgot the attachment.
>>>
>>>         ---------- Forwarded message ---------
>>>         From: *Mark Cunningham* <mark.cunningham at ariacoustics.com
>>>         <mailto:mark.cunningham at ariacoustics.com>>
>>>         Date: Fri, Oct 4, 2019 at 8:35 AM
>>>         Subject: MPICH 3.3.1 tests fail
>>>         To: <discuss at mpich.org <mailto:discuss at mpich.org>>
>>>
>>>
>>>         I have built MPICH 3.3.1 on Ubuntu 18.04 with the gcc-7
>>>         compiler suite.  (Build script is provided: build_mpich.)  I
>>>         have built with both --with-device=ch3:nemesis and
>>>         --with-device=ch3:nemesis:ofi
>>>         with the same outcome in the make test.  A dozen of the
>>>         tests fail with timeouts.
>>>         I ran the test on an 8 processor Thinkstation but what would
>>>         be beneficial is some sort of sense of what the test results
>>>         should be. I've browsed the test results that you've posted
>>>         but these appear to be summaries that conclude All Tests
>>>         Passed, without individual timings for each test.
>>>
>>>         In any case, I would appreciate any guidance that you might
>>>         be able to provide.  I have provided the config files and
>>>         the summary.junit.xml file. (On my machine the summary.xml
>>>         file just reports that there is an error on line 9562.)  The
>>>         result of mpiexec -info is:
>>>         ~/build/mpich-3.3.1/src/pm/hydra % ./mpiexec.hydra -info
>>>         HYDRA build details:
>>>             Version:         3.3.1
>>>             Release Date:          Wed Jun  5 14:57:33 CDT 2019
>>>             CC:  gcc
>>>             CXX: g++
>>>             F77: gfortran
>>>             F90: gfortran
>>>             Configure options:         '--disable-option-checking'
>>>         '--prefix=/home/share/mpich/mpich-3.3.1'
>>>         '--with-devce=ch3:nemesis' '--cache-file=/dev/null'
>>>         '--srcdir=.' 'CC=gcc' 'CFLAGS= -O2' 'LDFLAGS=' 'LIBS='
>>>         'CPPFLAGS= -I/home/mark/buildmpich-3.3.1/src/mpl/include
>>>         -I/home/mark/build/mpich-3.3.1/src/mpl/include
>>>         -I/home/mark/build/mpich-3.3.1/src/openpa/src
>>>         -I/hme/mark/build/mpich-3.3.1/src/openpa/src -D_REENTRANT
>>>         -I/home/mark/build/mpich-3.3.1/src/mpi/romio/include'
>>>         'MPLLIBNAME=mpl'
>>>             Process Manager:         pmi
>>>             Launchers available:         ssh rsh fork slurm ll lsf
>>>         sge manual persist
>>>             Topology libraries available:          hwloc
>>>             Resource management kernels available:   user slurm ll
>>>         lsf sge pbs cobalt
>>>             Checkpointing libraries available:
>>>             Demux engines available:         poll select
>>>         ---------------------------------------------------------
>>>         Thanks for your help,
>>>         Mark
>>>
>>>         Mark A Cunningham, PhD
>>>         Senior Scientist
>>>         Applied Research in Acoustics LLC
>>>         Culpeper VA 22701
>>>
>>>
>>>         _______________________________________________
>>>         discuss mailing listdiscuss at mpich.org  <mailto:discuss at mpich.org>
>>>         To manage subscription options or unsubscribe:
>>>         https://lists.mpich.org/mailman/listinfo/discuss
>>         _______________________________________________
>>         discuss mailing list discuss at mpich.org <mailto:discuss at mpich.org>
>>         To manage subscription options or unsubscribe:
>>         https://lists.mpich.org/mailman/listinfo/discuss
>>
>>
>>     _______________________________________________
>>     discuss mailing listdiscuss at mpich.org  <mailto:discuss at mpich.org>
>>     To manage subscription options or unsubscribe:
>>     https://lists.mpich.org/mailman/listinfo/discuss
>     _______________________________________________
>     discuss mailing list discuss at mpich.org <mailto:discuss at mpich.org>
>     To manage subscription options or unsubscribe:
>     https://lists.mpich.org/mailman/listinfo/discuss
>
>
> _______________________________________________
> discuss mailing list     discuss at mpich.org
> To manage subscription options or unsubscribe:
> https://lists.mpich.org/mailman/listinfo/discuss
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.mpich.org/pipermail/discuss/attachments/20191004/ead8b8b3/attachment.html>


More information about the discuss mailing list