[mpich-ibm] [mpich-discuss] Problem with MPICH3/OpenPA on IBM P755

Sameh S Sharkawi sssharka at us.ibm.com
Tue Dec 2 17:11:45 CST 2014


I am not familiar with OPA but I can point out couple of issues here:

1 - PAMI installed on the system which seems to be 1.2, "may not" be
compatible with MPICH3 code. MPICH3 is only supported on 1.4 and above and
only for Linux
2 - --with-file-system=bg+bglockless is definitely the wrong option for
this machine. I would try nfs+ufs

Thanks
Sameh


|------------>
| From:      |
|------------>
  >--------------------------------------------------------------------------------------------------------------------------------------------------|
  |"Balaji, Pavan" <balaji at anl.gov>                                                                                                                  |
  >--------------------------------------------------------------------------------------------------------------------------------------------------|
|------------>
| To:        |
|------------>
  >--------------------------------------------------------------------------------------------------------------------------------------------------|
  |"discuss at mpich.org" <discuss at mpich.org>                                                                                                           |
  >--------------------------------------------------------------------------------------------------------------------------------------------------|
|------------>
| Cc:        |
|------------>
  >--------------------------------------------------------------------------------------------------------------------------------------------------|
  |"ibm at lists.mpich.org" <ibm at lists.mpich.org>                                                                                                       |
  >--------------------------------------------------------------------------------------------------------------------------------------------------|
|------------>
| Date:      |
|------------>
  >--------------------------------------------------------------------------------------------------------------------------------------------------|
  |11/27/2014 09:12 AM                                                                                                                               |
  >--------------------------------------------------------------------------------------------------------------------------------------------------|
|------------>
| Subject:   |
|------------>
  >--------------------------------------------------------------------------------------------------------------------------------------------------|
  |Re: [mpich-ibm] [mpich-discuss] Problem with MPICH3/OpenPA on IBM	P755                                                                            |
  >--------------------------------------------------------------------------------------------------------------------------------------------------|






Hmm.  That's not a good sign.  OPA's configure seems to think that it can
use some inbuilt atomic capability while the compiler/hardware is clearly
not supporting it.

Can you send us the OPA config.log (src/openpa/config.log) so we can look
into it?

Also cc'ed the IBM folks.

IBM folks, any thoughts?

  -- Pavan

> On Nov 27, 2014, at 6:11 AM, Victor Vysotskiy
<victor.vysotskiy at teokem.lu.se> wrote:
>
> Hi,
>
> I am trying to compile the latest ('v3.2a2-24-g4ad367d0') nightly
snapshot on the IBM P775 machine. The MPICH3 is configured with the
following options:
>
> export OBJECT_MODE=64
> ./configure  --enable-f77 --enable-fc --enable-cxx --enable-smpcoll
--with-thread-package=pthreads --with-pami=/opt/ibmhpc/pecurrent/ppe.pami
--with-pami-lib=/opt/ibmhpc/pecurrent/ppe.pami/lib
--with-pami-include=/opt/ibmhpc/pecurrent/ppe.pami/include64 --enable-mpe
--enable-error-messages=all CC="xlc_r -q64 -qmaxmem=-1"
CPP=/usr/ccs/lib/cpp CXX="xlC_r -q64 -qmaxmem=-1" F77="xlf_r -q64
-qmaxmem=-1" FC="xlf90_r -q64 -qmaxmem=-1" CFLAGS="-q64 -qmaxmem=-1"
CXXFLAGS="-q64 -qmaxmem=-1" FCFLAGS="-qmaxmem=-1 -q64" FFLAGS="-q64
-qmaxmem=-1" OBJECT_MODE=64 AR="ar -X 64"   --with-file-system=bg
+bglockless
>
> No problem with compiling it, but there is a problem with running OpenPA
tests:
>
> Testing simple integer load-linked/store-conditional functionality
-SKIP-
>    LL/SC not available
> Testing simple pointer load-linked/store-conditional functionality
-SKIP-
>    LL/SC not available
> Testing integer LL/SC ABA
-SKIP-
>    LL/SC not available
> Testing pointer LL/SC ABA
-SKIP-
>    LL/SC not available
> Testing integer load/store with 1 thread
*FAILED*
>        at test_primitives.c:371 in test_threaded_loadstore_int()...
> Testing pointer load/store with 1 thread
*FAILED*
>        at test_primitives.c:556 in test_threaded_loadstore_ptr()...
> Testing add with 1 thread
*FAILED*
>        at test_primitives.c:750 in test_threaded_add()...
> Testing incr and decr with 1 thread
PASSED
> Testing decr and test with 1 thread
*FAILED*
>        at test_primitives.c:1093 in test_threaded_decr_and_test()...
> Testing fetch and add with 1 thread
*FAILED*
>        at test_primitives.c:1344 in test_threaded_faa()...
> Testing fetch and add return values with 1 thread
*FAILED*
>        at test_primitives.c:1490 in test_threaded_faa_ret()...
> Testing fetch and incr/decr with 1 thread
PASSED
> Testing fetch and incr return values with 1 thread
*FAILED*
>        at test_primitives.c:1796 in test_threaded_fai_ret()...
> Testing fetch and decr return values with 1 thread
*FAILED*
>        at test_primitives.c:1953 in test_threaded_fad_ret()...
> Testing integer compare-and-swap with 1 thread
*FAILED*
>        at test_primitives.c:2191 in test_threaded_cas_int()...
> Testing pointer compare-and-swap with 1 thread
*FAILED*
>        at test_primitives.c:2346 in test_threaded_cas_ptr()...
> Testing grouped integer compare-and-swap with 1 thread
*FAILED*
>        at test_primitives.c:2506 in test_grouped_cas_int()...
> Testing grouped pointer compare-and-swap with 1 thread
*FAILED*
>        at test_primitives.c:2683 in test_grouped_cas_ptr()...
> Testing integer compare-and-swap fairness with 1 thread
*FAILED*
>        at test_primitives.c:2890 in test_threaded_cas_int_fairness()...
> Testing pointer compare-and-swap fairness with 1 thread
*FAILED*
>        at test_primitives.c:3087 in test_threaded_cas_ptr_fairness()...
> Testing integer swap with 1 thread
*FAILED*
>        at test_primitives.c:3341 in test_threaded_swap_int()...
> Testing pointer swap with 1 thread
*FAILED*
>        at test_primitives.c:3496 in test_threaded_swap_ptr()...
> Testing integer LL/SC stack
-SKIP-
>    LL/SC not available
> Testing pointer LL/SC stack
-SKIP-
>    LL/SC not available
> Testing integer LL/SC stack
-SKIP-
>    LL/SC not available
> Testing pointer LL/SC stack
-SKIP-
>    LL/SC not available
> Testing integer LL/SC stack
-SKIP-
>    LL/SC not available
> Testing pointer LL/SC stack
-SKIP-
>    LL/SC not available
> Testing integer LL/SC stack
-SKIP-
>    LL/SC not available
> Testing pointer LL/SC stack
-SKIP-
>    LL/SC not available
> ***** 16 PRIMITIVES TESTS FAILED! *****
>
> Apparently, a bunch of tests failed. Is there anything to worry about?
Or, I can simply skip these failed test and can use the compiled MPICH3 for
production?
>
> The software stack used:
> AIX v7.1.0.0
> XLC/XLF compiler v14.01.0000.0008
> POE v1-1.2.0.3
>
> Best,
> Victor.
> _______________________________________________
> discuss mailing list     discuss at mpich.org
> To manage subscription options or unsubscribe:
> https://lists.mpich.org/mailman/listinfo/discuss

--
Pavan Balaji  ✉️
http://www.mcs.anl.gov/~balaji

_______________________________________________
ibm mailing list
ibm at lists.mpich.org
https://lists.mpich.org/mailman/listinfo/ibm
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.mpich.org/pipermail/discuss/attachments/20141202/faf462a8/attachment.html>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: graycol.gif
Type: image/gif
Size: 105 bytes
Desc: not available
URL: <http://lists.mpich.org/pipermail/discuss/attachments/20141202/faf462a8/attachment.gif>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: ecblank.gif
Type: image/gif
Size: 45 bytes
Desc: not available
URL: <http://lists.mpich.org/pipermail/discuss/attachments/20141202/faf462a8/attachment-0001.gif>
-------------- next part --------------
_______________________________________________
ibm mailing list
ibm at lists.mpich.org
https://lists.mpich.org/mailman/listinfo/ibm


More information about the discuss mailing list