[mpich-discuss] ROMIO filesystem check during MPI_File_open

Jeff Squyres (jsquyres) jsquyres at cisco.com
Mon Mar 24 16:37:10 CDT 2014


On Mar 24, 2014, at 5:17 PM, Rob Latham <robl at mcs.anl.gov> wrote:

> Every process will call ADIO_FileSysType_fncall() in the case where have_nfs_enabled==1 .  Your interpretation of the code is so different from mine that I'm just going to have to past what I'm looking at and you can tell me where I'm wrong.  These lines come from MPICH's ad_fstype.c.  The content is the same in openmpi-1.6.4, just shifted by 25 or so lines:
> 
> 637     ADIO_FileSysType_fncall(filename, &file_system, &myerrcode);
> 638     if (myerrcode != MPI_SUCCESS) {

This is where our disconnect is occurring: _fncall returns myerrcode==MPI_SUCCESS in all cases.

Meaning: the block with the Allreduce is not entered.  This causes everyone to just use the file_system value they got from _fncall: one will have ADIO_UFS, and the rest will have ADIO_NFS.

/Me checks again just to make sure I'm not goofing this up... Yep: I ran this through DDT and verified that process0 gets file_system==ADIO_UFS/myerrcode==MPI_SUCCESS and process1 gets file_system=ADIO_NFS/myerrcode=MPI_SUCCESS.

-- 
Jeff Squyres
jsquyres at cisco.com
For corporate legal information go to: http://www.cisco.com/web/about/doing_business/legal/cri/




More information about the discuss mailing list