[mpich-discuss] success and failure report for mpich-3.0.2

Pavan Balaji balaji at mcs.anl.gov
Sun Apr 28 17:19:30 CDT 2013


This is not an architectural problem.  You need to have mpich installed
on all machines.  It looks like some of your machines don't have it
installed.

 -- Pavan

On 03/22/2013 10:22 AM US Central Time, Siegmar Gross wrote:
> Hi
> 
> today I tried to install mpich-3.0.2. I could compile and install the
> package for the following architectures.
> 
> openSuSE Linux 12.1, x86_64, Sun C 5.12, 64-bit
> Solaris 10, x86_64, Sun C 5.12, 64-bit
> Solaris 10, sparc, Sun C 5.12, 32-bit
> Solaris 10, sparc, Sun C 5.12, 64-bit
> 
> As described in the RELEASE_NOTES a communication between little-endian
> and big-endian machines is not possible (broken pipe). Unfortunately
> you don't even allow different operating systems (it seems that you
> don't evaluate PATH to find a program).
> 
> sunpc1 fd1026 103 mpiexec -np 2 -host sunpc0,sunpc1 init_finalize
> Hello!
> Hello!
> 
> sunpc1 fd1026 104 mpiexec -np 2 -host sunpc0,linpc1 init_finalize
> [proxy:0:1 at linpc1] HYDU_create_process
>   (../../../../mpich-3.0.2/src/pm/hydra/utils/launch/launch.c:74):
>   execvp error on file init_finalize (No such file or directory)
> 
> ========================================================================
> =   BAD TERMINATION OF ONE OF YOUR APPLICATION PROCESSES
> =   EXIT CODE: 255
> =   CLEANING UP REMAINING PROCESSES
> =   YOU CAN IGNORE THE BELOW CLEANUP MESSAGES
> ========================================================================
> [proxy:0:0 at sunpc0] HYD_pmcd_pmip_control_cmd_cb
>   (../../../../mpich-3.0.2/src/pm/hydra/pm/pmiserv/pmip_cb.c:886):
>   assert (!closed) failed
> [proxy:0:0 at sunpc0] HYDT_dmxu_poll_wait_for_event
>   (../../../../mpich-3.0.2/src/pm/hydra/tools/demux/demux_poll.c:77):
>   callback returned error status
> [proxy:0:0 at sunpc0] main
>   (../../../../mpich-3.0.2/src/pm/hydra/pm/pmiserv/pmip.c:206):
>   demux engine error waiting for event
> 
> 
> 
> The programs are available on both machines.
> 
> sunpc1 fd1026 105 which init_finalize
> /home/fd1026/SunOS/x86_64/bin/init_finalize
> sunpc1 fd1026 106 ssh linpc1 which init_finalize
> /home/fd1026/Linux/x86_64/bin/init_finalize
> sunpc1 fd1026 107 
> 
> 
> 
> 
> I wasn't able to build the package on the following architectures.
> 
> 
> openSuSE Linux 12.1, x86_64, Sun C 5.12, 32-bit
> openSuSE Linux 12.1, x86_64, GNU C 4.7.1, 32-bit
> ------------------------------------------------
> 
> tyr mpich-3.0.2-Linux.x86_64.32_cc 158 tail log.make.Linux.x86_64.32_cc
>   CC       mplstr.lo
>   CC       mpltrmem.lo
>   CC       mplenv.lo
>   CCLD     libmpl.la
> /usr/lib/libxml2.so: could not read symbols: File in wrong format
> make[2]: *** [libmpl.la] Error 2
> make[2]: Leaving directory `.../src/mpl'
> make[1]: *** [all-recursive] Error 1
> 
> 
> 
> openSuSE Linux 12.1, x86_64, GNU C 4.7.1, 64-bit
> ------------------------------------------------
> 
> tyr mpich-3.0.2-Linux.x86_64.64_gcc 171 tail log.make.Linux.x86_64.64_gcc
>   CC       mpltrmem.lo
>   CC       mplenv.lo
>   CCLD     libmpl.la
> /usr/local/bin/ld: libmpl.so.1: No such file: No such file or directory
> collect2: error: ld returned 1 exit status
> make[2]: *** [libmpl.la] Error 1
> make[2]: Leaving directory `.../src/mpl'
> 
> 
> 
> Solaris 10, x86_64, Sun C 5.12, 32-bit
> --------------------------------------
> 
> hangs
> 
> 
> 
> Solaris 10, x86_64, GNU C 4.7.1, 32-bit
> ---------------------------------------
> 
> tyr mpich-3.0.2-SunOS.x86_64.32_gcc 193 tail -12 log.make.SunOS.x86_64.32_gcc
>   CC       src/mpi/coll/barrier_group.lo
>   CC       src/mpi/coll/helper_fns.lo
> Assembler: helper_fns.c
>         "/var/tmp//ccIdXgrh.s", line 912 : Syntax error
>         Near line: "mov $PREFETCHBLOCK/16, %eax"
>         "/var/tmp//ccIdXgrh.s", line 920 : Syntax error
>         Near line: "mov $PREFETCHBLOCK/8, %eax"
> make[2]: *** [src/mpi/coll/helper_fns.lo] Error 1
> 
> 
> 
> Solaris 10, x86_64, GNU C 4.7.1, 64-bit
> Solaris 10, sparc, GNU C 4.7.1, 32-bit
> Solaris 10, sparc, GNU C 4.7.1, 64-bit
> ---------------------------------------
> 
> tyr mpich-3.0.2-SunOS.x86_64.64_gcc 195 tail log.make.SunOS.x86_64.64_gcc 
>   CXX      src/binding/cxx/initcxx.lo
>   CXXLD    lib/libmpichcxx.la
> ld: fatal: file libmpichcxx.so.10: open failed: No such file or directory
> ld: fatal: file processing errors. No output written to
>   lib/.libs/libmpichcxx.so.10.0.2
> collect2: error: ld returned 1 exit status
> make[2]: *** [lib/libmpichcxx.la] Error 1
> 
> 
> Kind regards
> 
> Siegmar
> 
> _______________________________________________
> discuss mailing list     discuss at mpich.org
> To manage subscription options or unsubscribe:
> https://lists.mpich.org/mailman/listinfo/discuss
> 

-- 
Pavan Balaji
http://www.mcs.anl.gov/~balaji



More information about the discuss mailing list