[mpich-discuss] Worries with ROMIO on NFS since commit b4ab2f118d
Eric Chamberland
Eric.Chamberland at giref.ulaval.ca
Tue Nov 21 13:49:33 CST 2017
Hi M. Latham,
I have more information now.
When I try to run my example on NFS, I have the following error code:
error #812707360
Other I/O error , error stack:
ADIOI_NFS_READSTRIDED(523): Other I/O error Success
that is returned by MPI_File_read_all_begin
When I try on a local disk, everything is fine.
Here are all files about my actual build:
http://www.giref.ulaval.ca/~cmpgiref/dernier_mpich/2017.11.21.05h40m02s_config.log
http://www.giref.ulaval.ca/~cmpgiref/dernier_mpich/2017.11.21.05h40m02s_c.txt
http://www.giref.ulaval.ca/~cmpgiref/dernier_mpich/2017.11.21.05h40m02s_m.txt
http://www.giref.ulaval.ca/~cmpgiref/dernier_mpich/2017.11.21.05h40m02s_mi.txt
http://www.giref.ulaval.ca/~cmpgiref/dernier_mpich/2017.11.21.05h40m02s_mpl_config.log
http://www.giref.ulaval.ca/~cmpgiref/dernier_mpich/2017.11.21.05h40m02s_pm_hydra_config.log
http://www.giref.ulaval.ca/~cmpgiref/dernier_mpich/2017.11.21.05h40m02s_mpiexec_info.txt
Hope this help to dig further into this issue.
Thanks,
Eric
On 15/11/17 03:55 PM, Eric Chamberland wrote:
> Hi,
>
> We are compiling with mpich/master each night since august 2016...
>
> since nov 8, the mpich/master branch is buggy with our nighlty build tests.
>
> Here is the nov 8 config.log:
>
> http://www.giref.ulaval.ca/~cmpgiref/dernier_mpich/2017.11.08.05h36m02s_config.log
>
>
> For nov 7 the configure log:
>
> http://www.giref.ulaval.ca/~cmpgiref/dernier_mpich/2017.11.07.05h36m01s_config.log
>
>
>
> Since nov 8, on a specific ROMIO test, it is hanging indefinitely in
> optimized mode, and into DEBUG mode, I have a strange (yet to be
> debugged) assertion in our code.
>
> I reran the test manually, and when I wrote the results on a local disk,
> everything is fine.
>
> However, when I write over *NFS*, the test is faulty.
>
> I have not yet debugged enough through this, but, I suspect something
> related with one of:
>
> MPI_File_write_all_begin
> MPI_File_write_all_end
> MPI_File_read_all_begin
> MPI_File_read_all_end
> MPI_File_set_view
> MPI_Type_free
>
> Am I alone to see these problems?
>
> Thanks,
> Eric
>
> _______________________________________________
> discuss mailing list discuss at mpich.org
> To manage subscription options or unsubscribe:
> https://lists.mpich.org/mailman/listinfo/discuss
_______________________________________________
discuss mailing list discuss at mpich.org
To manage subscription options or unsubscribe:
https://lists.mpich.org/mailman/listinfo/discuss
More information about the discuss
mailing list