[mpich-discuss] Regarding issue with MPICH 3.2
Zhou, Hui
zhouh at anl.gov
Fri Oct 9 09:37:50 CDT 2020
Thanks, Neeraj singh.
To confirm, you always have this error with the profiler, but intermittent without profiler? When the error shows up, does the error show up at the beginning of the application, middle of the application, or near the end of the application?
--
Hui Zhou
From: Neeraj singh <neeraj844singh at gmail.com>
Date: Friday, October 9, 2020 at 9:22 AM
To: "Zhou, Hui" <zhouh at anl.gov>
Cc: "discuss at mpich.org" <discuss at mpich.org>
Subject: Re: [mpich-discuss] Regarding issue with MPICH 3.2
Yes, but sometimes without profiler also, I am also facing this issue with some applications also.
Thanks and regards
On Fri, Oct 9, 2020 at 7:27 PM Zhou, Hui <zhouh at anl.gov<mailto:zhouh at anl.gov>> wrote:
Hi Neeraj singh,
To clarify, does this error only happen when you try to use the profiler tool?
--
Hui Zhou
From: Neeraj singh via discuss <discuss at mpich.org<mailto:discuss at mpich.org>>
Reply-To: "discuss at mpich.org<mailto:discuss at mpich.org>" <discuss at mpich.org<mailto:discuss at mpich.org>>
Date: Friday, October 9, 2020 at 8:41 AM
To: "discuss at mpich.org<mailto:discuss at mpich.org>" <discuss at mpich.org<mailto:discuss at mpich.org>>
Cc: Neeraj singh <neeraj844singh at gmail.com<mailto:neeraj844singh at gmail.com>>
Subject: [mpich-discuss] Regarding issue with MPICH 3.2
I am using the MAQAO profiler tool to collect data based on sampling. While launching an application for multinode some issues I'm facing.
Kindly check the attached problem lines and clarify the same.
Thanks & Regards
Problem lines:
[mpiexec at gpu7] HYDU_sock_write (utils/sock/sock.c:294): write error (Bad file descriptor)
[mpiexec at gpu7] HYD_pmcd_pmiserv_send_signal (pm/pmiserv/pmiserv_cb.c:177): unable to write data to proxy
[mpiexec at gpu7] ui_cmd_cb (pm/pmiserv/pmiserv_pmci.c:79): unable to send signal downstream
[mpiexec at gpu7] HYDT_dmxu_poll_wait_for_event (tools/demux/demux_poll.c:76): callback returned error status
[mpiexec at gpu7] HYD_pmci_wait_for_completion (pm/pmiserv/pmiserv_pmci.c:198): error waiting for event
[mpiexec at gpu7] main (ui/mpich/mpiexec.c:340): process manager error waiting for completion
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.mpich.org/pipermail/discuss/attachments/20201009/9fec976c/attachment-0001.html>
More information about the discuss
mailing list