[mpich-discuss] turning off MPI abort messages
Jeff Hammond
jeff.science at gmail.com
Fri Feb 21 12:19:25 CST 2014
>> Just configure MPICH such that snprintf isn't discovered by configure
>> and you won't see these messages.
>>
>> The other solution is to fix PETSc so that people can't crash it so easily ;-)
>
> Here we go again. It is not CRASHING; it has detected an error conditioning and trying to appropriately and cleanly terminate. The reason it needs to use MPI_Abort() is that often detecting error conditions is not a uniformly collective thing.
>
> Printing a suitable error message and ending is not crashing. But with all the badly formatted “error messages” printed by MPICH I can not control at the end it looks like it is crashing.
You're returning a non-zero exit code, which I consider crashing. I
apologize if this definition disagrees with yours. If this is just
gentle cleanup, why not exit with code=0 as Jim suggested already?
Jeff
>> On Thu, Feb 20, 2014 at 3:19 PM, Jim Dinan <james.dinan at gmail.com> wrote:
>>> If you can find a way to call MPI_Finalize instead, you will portably
>>> eliminate these messages.
>>>
>>> A lesser solution would be to provide an error code of 0 (or MPI_SUCCESS) to
>>> MPI_Abort, e.g. MPI_Comm_abort(MPI_COMM_WORLD, MPI_SUCCESS). This would
>>> eliminate the error message that you are getting from the job launcher.
>>> MPICH could be modified to be quiet about the abort when the application
>>> aborts with an error code of MPI_SUCCESS.
>>>
>>> ~Jim.
>>>
>>>
>>> On Thu, Feb 20, 2014 at 12:33 PM, Barry Smith <bsmith at mcs.anl.gov> wrote:
>>>>
>>>>
>>>> Is there any way to turn off MPICH (and others) printing messages about
>>>> MPI_Abort? We have already prepared and presented useful error messages to
>>>> the user about the situation and would like to avoid having these additional
>>>> messages printed (that often make the situation look worse than it is)
>>>>
>>>> Thanks
>>>>
>>>> Barry
>>>>
>>>> application called MPI_Abort(MPI_COMM_WORLD, 56) - process 0
>>>> [cli_0]: aborting job:
>>>> application called MPI_Abort(MPI_COMM_WORLD, 56) - process 0
>>>>
>>>>
>>>> ==================================================================mailto:discuss at mpich.org=================
>>>> = BAD TERMINATION OF ONE OF YOUR APPLICATION PROCESSES
>>>> = EXIT CODE: 56
>>>> = CLEANING UP REMAINING PROCESSES
>>>> = YOU CAN IGNORE THE BELOW CLEANUP MESSAGES
>>>>
>>>> ===================================================================================
>>>>
>>>>
>>>>
>>>>
>>>> _______________________________________________
>>>> discuss mailing list discuss at mpich.org
>>>> To manage subscription options or unsubscribe:
>>>> https://lists.mpich.org/mailman/listinfo/discuss
>>>
>>>
>>>
>>> _______________________________________________
>>> discuss mailing list discuss at mpich.org
>>> To manage subscription options or unsubscribe:
>>> https://lists.mpich.org/mailman/listinfo/discuss
>>
>>
>>
>> --
>> Jeff Hammond
>> jeff.science at gmail.com
>> _______________________________________________
>> discuss mailing list discuss at mpich.org
>> To manage subscription options or unsubscribe:
>> https://lists.mpich.org/mailman/listinfo/discuss
>
> _______________________________________________
> discuss mailing list discuss at mpich.org
> To manage subscription options or unsubscribe:
> https://lists.mpich.org/mailman/listinfo/discuss
--
Jeff Hammond
jeff.science at gmail.com
More information about the discuss
mailing list