[mpich-discuss] libmpi size unusually large when building with CUDA support
Lucas Zacchi de Medeiros
lucasd at cadence.com
Fri Sep 15 09:52:59 CDT 2023
I work on a project that supports different architectures, so we've built 3 separate versions of MPICH 4.1.1 (cuda11.2, rocm5.4.3 and cpu-only with ucx1.14.1)
The cpu-only and the rocm5 libraries are around 50MB each. The CUDA version on the other hand, is close to 1.8GB! There doesn't seem to be anything out of the ordinary with the builds and all versions are working as expected.
This is the output of du -sh performed on the install directories:
First the rocm5 build:
$ du -sh LINUX_gcc9.3_glibc2.28_rocm5.4.3_ucx1.14.1/lib/*
(...)
53M LINUX_gcc9.3_glibc2.28_rocm5.4.3_ucx1.14.1/lib/libmpi.a
40M LINUX_gcc9.3_glibc2.28_rocm5.4.3_ucx1.14.1/lib/libmpi.so.12.3.0
And then the CUDA build:
$ du -sh LINUX_gcc9.3_glibc2.17_cuda11.2_ucx1.14.1/lib/*
(...)
1.8G LINUX_gcc9.3_glibc2.17_cuda11.2_ucx1.14.1/lib/libmpi.a
1.7G LINUX_gcc9.3_glibc2.17_cuda11.2_ucx1.14.1/lib/libmpi.so.12.3.0
This issue really complicates packaging and distribution since we provide both the archive and the shared libraries, I am looking at more than 3GB for MPICH only.
After some investigation, it doesn't seem the problem is coming from our end. Is it possible that something in mpich's build process is causing this excessive file size?
Thanks for the help.
Kind regards,
Lucas
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.mpich.org/pipermail/discuss/attachments/20230915/f8edb554/attachment.html>
More information about the discuss
mailing list