<meta http-equiv="Content-Type" content="text/html; charset=utf-8"><div dir="ltr"><div>Hi MPICH users<br></div><div><br></div><div>I am trying to run a meteorological simulation with RAMS model (<a href="http://vandenheever.atmos.colostate.edu/vdhpage/rams.php%29">http://vandenheever.atmos.colostate.edu/vdhpage/rams.php)</a>
in a new cluster with CentOs7 in all nodes. We have other applications
running fine in the cluster, sending processes to each node,...
everything seemd to run fine.<br></div><div><br></div><div>But
the RAMS model freezes at its first stage on the parallel run. RAMS
developers recommend using mpich2-1.4.1 as they have successfully
tested. If I try to run a parallel simulation in the master node it runs
fine and starts the required number of processes. If I try to use the
other nodes the simulation freezes, do not stop with any error message;
RAMS model processes appear in all used nodes and in the master node but
any output is created and the usual status messages from RAMS do not
appear on screen.<br></div><div><br></div><div>The commnad line to run the model is<br></div><div>[paco@Llamp RUN]$ time ../misc/mpich2-1.4.1/bin/mpirun -verbose -machinefile mpd.hosts -n 20 ./rams-6.2.03 -f RAMSIN<br></div><div><br></div><div>I have also compiled and tried to run with mpich 3.0.4 but it behaves exactly the same way. </div><div><br></div><div>Log messages with -verbose option for both mpich2-1.4.1 and mpich3.0.4 can be found at<br></div><div><br></div><div>MPICH2-1.4.1: <a href="https://www.dropbox.com/s/6sgkarmsi5vrdfd/RAMS-mpich2-1.4.1.log?dl=0">https://www.dropbox.com/s/6sgkarmsi5vrdfd/RAMS-mpich2-1.4.1.log?dl=0</a><br></div><div>MPICH3.0.4: <a href="https://www.dropbox.com/s/bxvl5q6dy03pgew/RAMS-mpich2-3.0.4.log?dl=0">https://www.dropbox.com/s/bxvl5q6dy03pgew/RAMS-mpich2-3.0.4.log?dl=0</a><br></div><div><br></div><div>cpi example runs fine with both mpich instances<br></div><div><br></div><div>Thanks in advance for your help and best regards</div></div>