[mpich-discuss] libfabric+psm2 performance
Wesley Bland
work at wesbland.com
Tue Jun 15 08:03:32 CDT 2021
Hey Toni,
I’d be surprised that the performance drops that much, but you can try —with-device=ch4:ofi:psm2 to convert at least some of the branches to be compile-time instead of runtime. After that, I don’t remember enough about OFI 1.5. There might have been some changes in MPICH over the last year or two that makes that version not perform as well…
Thanks,
Wes
> On Jun 15, 2021, at 4:46 AM, Antonio Peña via discuss <discuss at mpich.org> wrote:
>
>
> Hi folks,
>
> I'm setting up an MPICH over libfabric over psm2 for MareNostrum (Omni-Path), to try out some ideas.
>
> I've compiled libfabric 1.5 (last one that compiles in this machine) over opa-psm2-11.2.185, and mpich-3.4.2 + mpich-4.0a1 in both ch3 and ch4 (yes 4 MPICH variants). There's only psm2 support in libfabric, so no danger of falling back to other providers. ldd confirms my libfabric is linked.
>
> ./fi_info
> provider: psm2
> fabric: psm2
> domain: psm2
> version: 1.5
> type: FI_EP_RDM
> protocol: FI_PROTO_PSMX2
>
> I'm comparing 2-node pt2pt performance against impi/2017.4 using osu microbenchmarks.
>
> While both fi_pingong and impi give me a max. BW of ~10 MB/s, all mpich versions stick at ~3 MB/s.
>
> Is this expected? I mean, is there so much secret sauce in impi? Or, am likely doing something wrong?
>
> I'm doing fairly plain configures, nothing fancy, e.g.:
> ./configure --prefix=... --with-device=ch4:ofi --with-libfabric=...
>
> I'd appreciate some guidance - my MPICH tweaking is a little rusted :)
>
> Best,
> Toni
> _______________________________________________
> discuss mailing list discuss at mpich.org
> To manage subscription options or unsubscribe:
> https://lists.mpich.org/mailman/listinfo/discuss
More information about the discuss
mailing list