<meta http-equiv="Content-Type" content="text/html; charset=utf-8"><div dir="ltr"><div><div><div>Hi<br><br></div>MPICH 3.0.4 was installed in the system because other applications have been tested with this implementation. And specifically, RAMS model developers recommend mpich2-1.4.1 as they have tested before and run without any problem. <br></div>How can I debug/save logs for an mpich application? Although I'm not an expert with MPICH and parallel computing I would like (I need) to solve this problem.<br><br></div>Thanks<br></div><br><div class="gmail_quote"><div dir="ltr">El mié., 12 abr. 2017 a las 2:36, Guo, Yanfei (<<a href="mailto:yguo@anl.gov">yguo@anl.gov</a>>) escribió:<br></div><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">Hi,<br class="gmail_msg">
<br class="gmail_msg">
MPICH 3.0.4 is a quite old version. Can you try your application with the latest MPICH (3.2) <a href="http://www.mpich.org/downloads/" rel="noreferrer" class="gmail_msg" target="_blank">http://www.mpich.org/downloads/</a> ?<br class="gmail_msg">
<br class="gmail_msg">
Yanfei Guo<br class="gmail_msg">
Postdoctoral Researcher<br class="gmail_msg">
MCS Division, ANL<br class="gmail_msg">
<br class="gmail_msg">
<br class="gmail_msg">
On 4/11/17, 5:57 AM, "CEAM Meteorología" <<a href="mailto:ceamet@gmail.com" class="gmail_msg" target="_blank">ceamet@gmail.com</a>> wrote:<br class="gmail_msg">
<br class="gmail_msg">
Hi MPICH users<br class="gmail_msg">
<br class="gmail_msg">
<br class="gmail_msg">
<br class="gmail_msg">
I am trying to run a meteorological simulation with RAMS model (<a href="http://vandenheever.atmos.colostate.edu/vdhpage/rams.php" rel="noreferrer" class="gmail_msg" target="_blank">http://vandenheever.atmos.colostate.edu/vdhpage/rams.php</a>) <<a href="http://vandenheever.atmos.colostate.edu/vdhpage/rams.php%29" rel="noreferrer" class="gmail_msg" target="_blank">http://vandenheever.atmos.colostate.edu/vdhpage/rams.php%29</a>> in a new cluster with CentOs7 in all nodes. We have other<br class="gmail_msg">
applications running fine in the cluster, sending processes to each node,... everything seemd to run fine.<br class="gmail_msg">
<br class="gmail_msg">
<br class="gmail_msg">
<br class="gmail_msg">
But the RAMS model freezes at its first stage on the parallel run. RAMS developers recommend using mpich2-1.4.1 as they have successfully tested. If I try to run a parallel simulation in the master node it runs fine and starts the required number of processes.<br class="gmail_msg">
If I try to use the other nodes the simulation freezes, do not stop with any error message; RAMS model processes appear in all used nodes and in the master node but any output is created and the usual status messages from RAMS do not appear on screen.<br class="gmail_msg">
<br class="gmail_msg">
<br class="gmail_msg">
<br class="gmail_msg">
The commnad line to run the model is<br class="gmail_msg">
<br class="gmail_msg">
[paco@Llamp RUN]$ time ../misc/mpich2-1.4.1/bin/mpirun -verbose -machinefile mpd.hosts -n 20 ./rams-6.2.03 -f RAMSIN<br class="gmail_msg">
<br class="gmail_msg">
<br class="gmail_msg">
<br class="gmail_msg">
I have also compiled and tried to run with mpich 3.0.4 but it behaves exactly the same way.<br class="gmail_msg">
<br class="gmail_msg">
<br class="gmail_msg">
<br class="gmail_msg">
Log messages with -verbose option for both mpich2-1.4.1 and mpich3.0.4 can be found at<br class="gmail_msg">
<br class="gmail_msg">
<br class="gmail_msg">
<br class="gmail_msg">
MPICH2-1.4.1:<br class="gmail_msg">
<a href="https://www.dropbox.com/s/6sgkarmsi5vrdfd/RAMS-mpich2-1.4.1.log?dl=0" rel="noreferrer" class="gmail_msg" target="_blank">https://www.dropbox.com/s/6sgkarmsi5vrdfd/RAMS-mpich2-1.4.1.log?dl=0</a> <<a href="https://www.dropbox.com/s/6sgkarmsi5vrdfd/RAMS-mpich2-1.4.1.log?dl=0" rel="noreferrer" class="gmail_msg" target="_blank">https://www.dropbox.com/s/6sgkarmsi5vrdfd/RAMS-mpich2-1.4.1.log?dl=0</a>><br class="gmail_msg">
<br class="gmail_msg">
MPICH3.0.4:<br class="gmail_msg">
<a href="https://www.dropbox.com/s/bxvl5q6dy03pgew/RAMS-mpich2-3.0.4.log?dl=0" rel="noreferrer" class="gmail_msg" target="_blank">https://www.dropbox.com/s/bxvl5q6dy03pgew/RAMS-mpich2-3.0.4.log?dl=0</a> <<a href="https://www.dropbox.com/s/bxvl5q6dy03pgew/RAMS-mpich2-3.0.4.log?dl=0" rel="noreferrer" class="gmail_msg" target="_blank">https://www.dropbox.com/s/bxvl5q6dy03pgew/RAMS-mpich2-3.0.4.log?dl=0</a>><br class="gmail_msg">
<br class="gmail_msg">
<br class="gmail_msg">
<br class="gmail_msg">
cpi example runs fine with both mpich instances<br class="gmail_msg">
<br class="gmail_msg">
<br class="gmail_msg">
<br class="gmail_msg">
Thanks in advance for your help and best regards<br class="gmail_msg">
<br class="gmail_msg">
<br class="gmail_msg">
<br class="gmail_msg">
_______________________________________________<br class="gmail_msg">
discuss mailing list <a href="mailto:discuss@mpich.org" class="gmail_msg" target="_blank">discuss@mpich.org</a><br class="gmail_msg">
To manage subscription options or unsubscribe:<br class="gmail_msg">
<a href="https://lists.mpich.org/mailman/listinfo/discuss" rel="noreferrer" class="gmail_msg" target="_blank">https://lists.mpich.org/mailman/listinfo/discuss</a></blockquote></div>