[mpich-devel] detecting mpirun mismatch

Jeff Squyres (jsquyres) jsquyres at cisco.com
Tue Apr 22 17:02:27 CDT 2014


Is there something we should put between OMPI and MPICH such that one can detect such a mismatch like this?

Might be worth thinking about.  We get users doing this not infrequently, too.



On Apr 22, 2014, at 5:58 PM, Jeff Hammond <jeff.science at gmail.com> wrote:

> PMI is specific to MPICH-family, right?  The problem that was observed
> had to do with MPICH/OpenMPI mixing.  'mpirun -np 2' led to two copies
> of the application running in serial...
> 
> If PMI implementations are version specific, should they not export
> PMI_VERSION to the environment so bad pairings can be detected?
> 
> Thanks,
> 
> Jeff
> 
> On Tue, Apr 22, 2014 at 4:52 PM, Balaji, Pavan <balaji at anl.gov> wrote:
>> Jeff,
>> 
>> Do you need to know if it’s Hydra or if it’s a PMI server?  Any PMI server can work with any PMI client.  Ideally, they’ll negotiate on the PMI version to use, but a bad client/server might assume a particular version.
>> 
>> You can make sure it’s a PMI server by checking for PMI environment variables.  Either PMI_FD or PMI_PORT will need to be set.
>> 
>>  — Pavan
>> 
>> On Apr 22, 2014, at 4:48 PM, Jeff Hammond <jeff.science at gmail.com> wrote:
>> 
>>> Does anyone know of a good way to detect when the user has built an
>>> MPI application with mpicc from implementation A and tries to launch
>>> with mpirun/mpiexec from implementation B?
>>> 
>>> I know that inside of the application, I can use either
>>> MPI_Get_library_version or preprocessor info to know what
>>> implementation I am using.  But can I discover anything about mpirun
>>> inside of the application?  Is there anything in the environment that
>>> I can query to know if my PM is Hydra, for example?  If not, can that
>>> be added?
>>> 
>>> This is a relatively common user error that leads to very bad things
>>> in some cases so it would be nice to be able to detect it and throw a
>>> warning/error.
>>> 
>>> Thanks,
>>> 
>>> Jeff
>>> 
>>> --
>>> Jeff Hammond
>>> jeff.science at gmail.com
>> 
> 
> 
> 
> -- 
> Jeff Hammond
> jeff.science at gmail.com


-- 
Jeff Squyres
jsquyres at cisco.com
For corporate legal information go to: http://www.cisco.com/web/about/doing_business/legal/cri/



More information about the devel mailing list