<html xmlns:o="urn:schemas-microsoft-com:office:office" xmlns:w="urn:schemas-microsoft-com:office:word" xmlns:m="http://schemas.microsoft.com/office/2004/12/omml" xmlns="http://www.w3.org/TR/REC-html40">
<head>
<meta http-equiv="Content-Type" content="text/html; charset=utf-8">
<meta name="Generator" content="Microsoft Word 15 (filtered medium)">
<style><!--
/* Font Definitions */
@font-face
{font-family:"Cambria Math";
panose-1:2 4 5 3 5 4 6 3 2 4;}
@font-face
{font-family:Calibri;
panose-1:2 15 5 2 2 2 4 3 2 4;}
@font-face
{font-family:Consolas;
panose-1:2 11 6 9 2 2 4 3 2 4;}
@font-face
{font-family:"Times New Roman \(Body CS\)";
panose-1:2 2 6 3 5 4 5 2 3 4;}
/* Style Definitions */
p.MsoNormal, li.MsoNormal, div.MsoNormal
{margin:0in;
font-size:12.0pt;
font-family:"Calibri",sans-serif;}
span.EmailStyle17
{mso-style-type:personal-compose;
font-family:Consolas;
color:windowtext;
font-weight:normal;
font-style:normal;}
.MsoChpDefault
{mso-style-type:export-only;
font-size:12.0pt;
font-family:"Calibri",sans-serif;}
@page WordSection1
{size:8.5in 11.0in;
margin:1.0in 1.0in 1.0in 1.0in;}
div.WordSection1
{page:WordSection1;}
--></style>
</head>
<body lang="EN-US" link="#0563C1" vlink="#954F72" style="word-wrap:break-word">
<div class="WordSection1">
<p class="MsoNormal"><span style="font-family:Consolas">MPICH Discuss,<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-family:Consolas"><o:p> </o:p></span></p>
<p class="MsoNormal"><span style="font-family:Consolas">So this is an odd one. Thanks to Hui Zhou from this list, I was able to build MPICH 4.0b1. I proceeded to build libraries and my application that I work on. And all seemed to compile just well. I then
ran my model and...crash:<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-family:Consolas"><o:p> </o:p></span></p>
<p class="MsoNormal"><span style="font-family:Consolas"> Error opening file species.data -115<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-family:Consolas"> NetCDF: Error initializing for parallel access<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-family:Consolas"><o:p> </o:p></span></p>
<p class="MsoNormal"><span style="font-family:Consolas">I did a bit of debugging and found that the crash was due to an environment variable that was set because my application mistakenly thought I was running Intel MPI (mainly because we didn't have detection
for MPICH, so it defaulted to our "default" on this cluster of Intel MPI). When it sees Intel MPI, it sets:<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-family:Consolas"><o:p> </o:p></span></p>
<p class="MsoNormal"><span style="font-family:Consolas"> ROMIO_FSTYPE_FORCE="gpfs:"<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-family:Consolas"><o:p> </o:p></span></p>
<p class="MsoNormal"><span style="font-family:Consolas">which we've found is useful when running with Intel MPI on our GPFS system.<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-family:Consolas"><o:p> </o:p></span></p>
<p class="MsoNormal"><span style="font-family:Consolas">I whipped up a little netCDF reader executable that essentially does:<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-family:Consolas"><o:p> </o:p></span></p>
<p class="MsoNormal"><span style="font-family:Consolas"> call check( nf90_open(FILE_NAME, IOR(NF90_NOWRITE, NF90_MPIIO), comm=MPI_COMM_WORLD, info=MPI_INFO_NULL, ncid=ncid) )<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-family:Consolas"><o:p> </o:p></span></p>
<p class="MsoNormal"><span style="font-family:Consolas">and then:<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-family:Consolas"><o:p> </o:p></span></p>
<p class="MsoNormal"><span style="font-family:Consolas"> $ mpirun -np 2 ./simple_xy_rd_mpiio.exe<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-family:Consolas"> OX_in(1,1:5): 7.1448361893544643E-008 1.7177123368128377E-007 2.8831971121690003E-007 3.7688394627366506E-007 4.6076729631749913E-007<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-family:Consolas"> OX_in(1,1:5): 7.1448361893544643E-008 1.7177123368128377E-007 2.8831971121690003E-007 3.7688394627366506E-007 4.6076729631749913E-007<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-family:Consolas"> *** SUCCESS reading example file species.data!<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-family:Consolas"> *** SUCCESS reading example file species.data!<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-family:Consolas"> $ ROMIO_FSTYPE_FORCE="gpfs:" mpirun -np 2 ./simple_xy_rd_mpiio.exe<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-family:Consolas"> NetCDF: Error initializing for parallel access<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-family:Consolas"> NetCDF: Error initializing for parallel access<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-family:Consolas"> STOP Stopped<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-family:Consolas"> STOP Stopped<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-family:Consolas"><o:p> </o:p></span></p>
<p class="MsoNormal"><span style="font-family:Consolas">So, of course the "right" thing to do is not to set that. (Doctor, it hurts when I do this. So stop doing that.)<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-family:Consolas"><o:p> </o:p></span></p>
<p class="MsoNormal"><span style="font-family:Consolas">But it got me wondering, is there perhaps a "better" way I should be building MPICH? Should this flag cause this sort of crash? Or does it mean I build MPICH/ROMIO incorrectly or incompletely (no GPFS
support, say)?<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-family:Consolas"><o:p> </o:p></span></p>
<p class="MsoNormal"><span style="font-family:Consolas">Thanks,<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-family:Consolas">Matt<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-family:Consolas">-- <o:p></o:p></span></p>
<div>
<p class="MsoNormal" style="text-autospace:none"><span style="font-family:Consolas">Matt Thompson, SSAI, Ld Scientific Programmer/Analyst<o:p></o:p></span></p>
<p class="MsoNormal" style="text-autospace:none"><span style="font-family:Consolas">NASA GSFC, Global Modeling and Assimilation Office<o:p></o:p></span></p>
<p class="MsoNormal" style="text-autospace:none"><span style="font-family:Consolas">Code 610.1, 8800 Greenbelt Rd, Greenbelt, MD 20771<o:p></o:p></span></p>
<p class="MsoNormal" style="text-autospace:none"><span style="font-family:Consolas">Phone: 301-614-6712 Fax: 301-614-6246<o:p></o:p></span></p>
</div>
<p class="MsoNormal"><u><span style="font-family:Consolas;color:#0000E9">http://science.gsfc.nasa.gov/sed/bio/matthew.thompson</span></u><o:p></o:p></p>
</div>
</body>
</html>