[mpich-discuss] Torque MPICH jobs stuck

Halim Amer aamer at anl.gov
Wed Aug 30 11:00:51 CDT 2017


Which MPICH version are you using? Have you tried the latest 3.2 version?

If it still fails, can you attach your simple Torque job script here?

Halim
www.mcs.anl.gov/~aamer

On 8/30/17 3:18 AM, Souparno Adhikary wrote:
> I know this is not a proper place to discuss this, but, as the 
> Torque-mpich list seems dead, I can't think of any other place to post this.
> 
> MPICH2 was installed in the servers. I installed Torque afterwards. I 
> opened the ports including them in the iptables file.
> 
> Torque mpi jobs (even the simple jobs like hostname) remains stuck. But, 
> the jobs are properly distributed in the nodes and pbsnodes -a showing 
> them in order.
> 
> The sched_log files and server_logs do not yield anything different. 
> Therefore, it might be a problem with the mpich2.
> 
> Can you please suggest me from where I can start troubleshooting???
> 
> Thanks,
> 
> Souparno Adhikary,
> CHPC Lab,
> Department of Microbiology,
> University of Calcutta.
> 
> 
> _______________________________________________
> discuss mailing list     discuss at mpich.org
> To manage subscription options or unsubscribe:
> https://lists.mpich.org/mailman/listinfo/discuss
> 
_______________________________________________
discuss mailing list     discuss at mpich.org
To manage subscription options or unsubscribe:
https://lists.mpich.org/mailman/listinfo/discuss


More information about the discuss mailing list