[mpich-discuss] Issue with OrangeFS 2.9.7 direct interface and MPICH 3.3.1 using CH4 device

Kun Feng kfeng1 at hawk.iit.edu
Sun Oct 6 12:20:05 CDT 2019


Hi Min,

If that is the case, please ignore this email. Nothing is wrong without
OrangeFS direct interface. I will try "ch4:ucx". Thank you for the info.

Thanks
Kun


On Sun, Oct 6, 2019 at 10:25 AM Si, Min via discuss <discuss at mpich.org>
wrote:

> Hi Kun,
>
> Can you please try to reproduce the issue in a simple MPI program which
> does not use OrangeFS ? It is hard for the MPICH community to help when
> mixing MPI and OrangeFS together, because we are not OrangeFS experts.
>
> Besides, for InfiniBand networks, you might want to use `ch4:ucx` instead
> of  `ch4:ofi`. But I do not think it causes the failure in your use case.
>
> Best regards,
> Min
>
> On 2019/10/04 12:21, Kun Feng via discuss wrote:
>
> To Whom It May Concern,
>
> Recently, I switched to CH4 device in MPICH 3.3.1 for better network
> performance over the RoCE network we are using.
> I realized that my code fails to run when I use direct interface of
> OrangeFS 2.9.7. It exits without any error. But even simple helloworld
> cannot print anything. It happens only when I enable direct interface of
> OrangeFS by linking -lorangefsposix.
> Could you please help me on this issue?
> Here are some information that might be useful:
> Output of ibv_devinfo of 40Gbps Mellanox ConnectX-4 Lx adapter:
> hca_id: mlx5_0
>         transport:                      InfiniBand (0)
>         fw_ver:                         14.20.1030
>         node_guid:                      248a:0703:0015:a800
>         sys_image_guid:                 248a:0703:0015:a800
>         vendor_id:                      0x02c9
>         vendor_part_id:                 4117
>         hw_ver:                         0x0
>         board_id:                       LNV2430110027
>         phys_port_cnt:                  1
>                 port:   1
>                         state:                  PORT_ACTIVE (4)
>                         max_mtu:                4096 (5)
>                         active_mtu:             1024 (3)
>                         sm_lid:                 0
>                         port_lid:               0
>                         port_lmc:               0x00
>                         link_layer:             Ethernet
>
> hca_id: i40iw0
>         transport:                      iWARP (1)
>         fw_ver:                         0.2
>         node_guid:                      7cd3:0aef:3da0:0000
>         sys_image_guid:                 7cd3:0aef:3da0:0000
>         vendor_id:                      0x8086
>         vendor_part_id:                 14289
>         hw_ver:                         0x0
>         board_id:                       I40IW Board ID
>         phys_port_cnt:                  1
>                 port:   1
>                         state:                  PORT_ACTIVE (4)
>                         max_mtu:                4096 (5)
>                         active_mtu:             1024 (3)
>                         sm_lid:                 0
>                         port_lid:               1
>                         port_lmc:               0x00
>                         link_layer:             Ethernet
> MPICH 3.3.1 configuration command: ./configure --with-device=ch4:ofi
> --with-pvfs2=/home/kfeng/install --enable-shared --enable-romio
> --with-file-system=ufs+pvfs2+zoidfs --enable-fortran=no
> --with-libfabric=/home/kfeng/install
> OrangeFS 2.9.7 configuration command: ./configure
> --prefix=/home/kfeng/install --enable-shared --enable-jni
> --with-jdk=/usr/lib/jvm/java-1.8.0-openjdk-1.8.0.191.b12-0.el7_5.x86_64
> --with-kernel=/usr/src/kernels/3.10.0-862.el7.x86_64
> Make command: mpicc -o ~/hello ~/hello.c -L/home/kfeng/install/lib
> -lorangefsposix
> The verbose outputs of mpiexec are attached.
>
> Thanks
> Kun
>
> _______________________________________________
> discuss mailing list     discuss at mpich.org
> To manage subscription options or unsubscribe:https://lists.mpich.org/mailman/listinfo/discuss
>
>
> _______________________________________________
> discuss mailing list     discuss at mpich.org
> To manage subscription options or unsubscribe:
> https://lists.mpich.org/mailman/listinfo/discuss
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.mpich.org/pipermail/discuss/attachments/20191006/74b5053a/attachment.html>


More information about the discuss mailing list