[mpich-ibm] [mpich-discuss] Problem with MPICH3/OpenPA on IBM P755
Sameh S Sharkawi
sssharka at us.ibm.com
Tue Dec 2 17:11:45 CST 2014
I am not familiar with OPA but I can point out couple of issues here:
1 - PAMI installed on the system which seems to be 1.2, "may not" be
compatible with MPICH3 code. MPICH3 is only supported on 1.4 and above and
only for Linux
2 - --with-file-system=bg+bglockless is definitely the wrong option for
this machine. I would try nfs+ufs
Thanks
Sameh
|------------>
| From: |
|------------>
>--------------------------------------------------------------------------------------------------------------------------------------------------|
|"Balaji, Pavan" <balaji at anl.gov> |
>--------------------------------------------------------------------------------------------------------------------------------------------------|
|------------>
| To: |
|------------>
>--------------------------------------------------------------------------------------------------------------------------------------------------|
|"discuss at mpich.org" <discuss at mpich.org> |
>--------------------------------------------------------------------------------------------------------------------------------------------------|
|------------>
| Cc: |
|------------>
>--------------------------------------------------------------------------------------------------------------------------------------------------|
|"ibm at lists.mpich.org" <ibm at lists.mpich.org> |
>--------------------------------------------------------------------------------------------------------------------------------------------------|
|------------>
| Date: |
|------------>
>--------------------------------------------------------------------------------------------------------------------------------------------------|
|11/27/2014 09:12 AM |
>--------------------------------------------------------------------------------------------------------------------------------------------------|
|------------>
| Subject: |
|------------>
>--------------------------------------------------------------------------------------------------------------------------------------------------|
|Re: [mpich-ibm] [mpich-discuss] Problem with MPICH3/OpenPA on IBM P755 |
>--------------------------------------------------------------------------------------------------------------------------------------------------|
Hmm. That's not a good sign. OPA's configure seems to think that it can
use some inbuilt atomic capability while the compiler/hardware is clearly
not supporting it.
Can you send us the OPA config.log (src/openpa/config.log) so we can look
into it?
Also cc'ed the IBM folks.
IBM folks, any thoughts?
-- Pavan
> On Nov 27, 2014, at 6:11 AM, Victor Vysotskiy
<victor.vysotskiy at teokem.lu.se> wrote:
>
> Hi,
>
> I am trying to compile the latest ('v3.2a2-24-g4ad367d0') nightly
snapshot on the IBM P775 machine. The MPICH3 is configured with the
following options:
>
> export OBJECT_MODE=64
> ./configure --enable-f77 --enable-fc --enable-cxx --enable-smpcoll
--with-thread-package=pthreads --with-pami=/opt/ibmhpc/pecurrent/ppe.pami
--with-pami-lib=/opt/ibmhpc/pecurrent/ppe.pami/lib
--with-pami-include=/opt/ibmhpc/pecurrent/ppe.pami/include64 --enable-mpe
--enable-error-messages=all CC="xlc_r -q64 -qmaxmem=-1"
CPP=/usr/ccs/lib/cpp CXX="xlC_r -q64 -qmaxmem=-1" F77="xlf_r -q64
-qmaxmem=-1" FC="xlf90_r -q64 -qmaxmem=-1" CFLAGS="-q64 -qmaxmem=-1"
CXXFLAGS="-q64 -qmaxmem=-1" FCFLAGS="-qmaxmem=-1 -q64" FFLAGS="-q64
-qmaxmem=-1" OBJECT_MODE=64 AR="ar -X 64" --with-file-system=bg
+bglockless
>
> No problem with compiling it, but there is a problem with running OpenPA
tests:
>
> Testing simple integer load-linked/store-conditional functionality
-SKIP-
> LL/SC not available
> Testing simple pointer load-linked/store-conditional functionality
-SKIP-
> LL/SC not available
> Testing integer LL/SC ABA
-SKIP-
> LL/SC not available
> Testing pointer LL/SC ABA
-SKIP-
> LL/SC not available
> Testing integer load/store with 1 thread
*FAILED*
> at test_primitives.c:371 in test_threaded_loadstore_int()...
> Testing pointer load/store with 1 thread
*FAILED*
> at test_primitives.c:556 in test_threaded_loadstore_ptr()...
> Testing add with 1 thread
*FAILED*
> at test_primitives.c:750 in test_threaded_add()...
> Testing incr and decr with 1 thread
PASSED
> Testing decr and test with 1 thread
*FAILED*
> at test_primitives.c:1093 in test_threaded_decr_and_test()...
> Testing fetch and add with 1 thread
*FAILED*
> at test_primitives.c:1344 in test_threaded_faa()...
> Testing fetch and add return values with 1 thread
*FAILED*
> at test_primitives.c:1490 in test_threaded_faa_ret()...
> Testing fetch and incr/decr with 1 thread
PASSED
> Testing fetch and incr return values with 1 thread
*FAILED*
> at test_primitives.c:1796 in test_threaded_fai_ret()...
> Testing fetch and decr return values with 1 thread
*FAILED*
> at test_primitives.c:1953 in test_threaded_fad_ret()...
> Testing integer compare-and-swap with 1 thread
*FAILED*
> at test_primitives.c:2191 in test_threaded_cas_int()...
> Testing pointer compare-and-swap with 1 thread
*FAILED*
> at test_primitives.c:2346 in test_threaded_cas_ptr()...
> Testing grouped integer compare-and-swap with 1 thread
*FAILED*
> at test_primitives.c:2506 in test_grouped_cas_int()...
> Testing grouped pointer compare-and-swap with 1 thread
*FAILED*
> at test_primitives.c:2683 in test_grouped_cas_ptr()...
> Testing integer compare-and-swap fairness with 1 thread
*FAILED*
> at test_primitives.c:2890 in test_threaded_cas_int_fairness()...
> Testing pointer compare-and-swap fairness with 1 thread
*FAILED*
> at test_primitives.c:3087 in test_threaded_cas_ptr_fairness()...
> Testing integer swap with 1 thread
*FAILED*
> at test_primitives.c:3341 in test_threaded_swap_int()...
> Testing pointer swap with 1 thread
*FAILED*
> at test_primitives.c:3496 in test_threaded_swap_ptr()...
> Testing integer LL/SC stack
-SKIP-
> LL/SC not available
> Testing pointer LL/SC stack
-SKIP-
> LL/SC not available
> Testing integer LL/SC stack
-SKIP-
> LL/SC not available
> Testing pointer LL/SC stack
-SKIP-
> LL/SC not available
> Testing integer LL/SC stack
-SKIP-
> LL/SC not available
> Testing pointer LL/SC stack
-SKIP-
> LL/SC not available
> Testing integer LL/SC stack
-SKIP-
> LL/SC not available
> Testing pointer LL/SC stack
-SKIP-
> LL/SC not available
> ***** 16 PRIMITIVES TESTS FAILED! *****
>
> Apparently, a bunch of tests failed. Is there anything to worry about?
Or, I can simply skip these failed test and can use the compiled MPICH3 for
production?
>
> The software stack used:
> AIX v7.1.0.0
> XLC/XLF compiler v14.01.0000.0008
> POE v1-1.2.0.3
>
> Best,
> Victor.
> _______________________________________________
> discuss mailing list discuss at mpich.org
> To manage subscription options or unsubscribe:
> https://lists.mpich.org/mailman/listinfo/discuss
--
Pavan Balaji ✉️
http://www.mcs.anl.gov/~balaji
_______________________________________________
ibm mailing list
ibm at lists.mpich.org
https://lists.mpich.org/mailman/listinfo/ibm
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.mpich.org/pipermail/discuss/attachments/20141202/faf462a8/attachment.html>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: graycol.gif
Type: image/gif
Size: 105 bytes
Desc: not available
URL: <http://lists.mpich.org/pipermail/discuss/attachments/20141202/faf462a8/attachment.gif>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: ecblank.gif
Type: image/gif
Size: 45 bytes
Desc: not available
URL: <http://lists.mpich.org/pipermail/discuss/attachments/20141202/faf462a8/attachment-0001.gif>
-------------- next part --------------
_______________________________________________
ibm mailing list
ibm at lists.mpich.org
https://lists.mpich.org/mailman/listinfo/ibm
More information about the discuss
mailing list