<div dir="ltr"><div>If the CPU-only and ROCm builds are the same size, I wonder if ROCm support was compiled in at all. How did you verify that?</div><div><br></div><div>Can you provide the full build commands for each so I can reproduce these?</div><div><br></div><div>You might see if -yaksa-depth=1 (or 0 perhaps) changes the results. There is a performance tradeoff, but if your applications don't use interesting noncontiguous datatypes, it shouldn't matter.</div><div><br></div><div>Jeff</div><br><div class="gmail_quote"><div dir="ltr" class="gmail_attr">On Fri, Sep 15, 2023 at 5:53 PM Lucas Zacchi de Medeiros via discuss <<a href="mailto:discuss@mpich.org">discuss@mpich.org</a>> wrote:<br></div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div class="msg2245223579988298242">
<div lang="EN-US" style="overflow-wrap: break-word;">
<div class="m_2245223579988298242WordSection1">
<p class="MsoNormal" style="margin-bottom:12pt;background:white"><span style="font-size:10.5pt;font-family:"Segoe UI",sans-serif;color:rgb(36,41,47)">I work on a project that supports different architectures, so we've built 3 separate versions of MPICH 4.1.1 (cuda11.2,
rocm5.4.3 and cpu-only with ucx1.14.1)<u></u><u></u></span></p>
<p class="MsoNormal" style="margin-bottom:12pt;background:white"><span style="font-size:10.5pt;font-family:"Segoe UI",sans-serif;color:rgb(36,41,47)">The cpu-only and the rocm5 libraries are around 50MB each. The CUDA version on the other hand, is close to 1.8GB!
There doesn’t seem to be anything out of the ordinary with the builds and all versions are working as expected.
<u></u><u></u></span></p>
<p class="MsoNormal" style="background:white"><span style="font-size:10.5pt;font-family:"Segoe UI",sans-serif;color:rgb(36,41,47)">This is the output of </span><span style="font-size:9pt;font-family:Consolas;color:rgb(36,41,47)">du -sh</span><span style="font-size:10.5pt;font-family:"Segoe UI",sans-serif;color:rgb(36,41,47)"> performed
on the install directories:<u></u><u></u></span></p>
<p class="MsoNormal" style="margin-bottom:12pt;background:white"><span style="font-size:10.5pt;font-family:"Segoe UI",sans-serif;color:rgb(36,41,47)">First the rocm5 build:<u></u><u></u></span></p>
<p class="MsoNormal" style="background:white"><span style="font-size:9pt;font-family:Consolas;color:rgb(36,41,47);border:1pt none windowtext;padding:0cm">$ du -sh LINUX_gcc9.3_glibc2.28_rocm5.4.3_ucx1.14.1/lib/*<u></u><u></u></span></p>
<p class="MsoNormal" style="background:white"><span style="font-size:9pt;font-family:Consolas;color:rgb(36,41,47);border:1pt none windowtext;padding:0cm">(...)<u></u><u></u></span></p>
<p class="MsoNormal" style="background:white"><span style="font-size:9pt;font-family:Consolas;color:rgb(36,41,47);border:1pt none windowtext;padding:0cm">53M LINUX_gcc9.3_glibc2.28_rocm5.4.3_ucx1.14.1/lib/libmpi.a<u></u><u></u></span></p>
<p class="MsoNormal" style="background:white"><span style="font-size:9pt;font-family:Consolas;color:rgb(36,41,47);border:1pt none windowtext;padding:0cm">40M LINUX_gcc9.3_glibc2.28_rocm5.4.3_ucx1.14.1/lib/libmpi.so.12.3.0<u></u><u></u></span></p>
<p class="MsoNormal" style="margin-bottom:12pt;background:white"><span style="font-size:10.5pt;font-family:"Segoe UI",sans-serif;color:rgb(36,41,47)">And then the CUDA build:<u></u><u></u></span></p>
<p class="MsoNormal" style="background:white"><span style="font-size:9pt;font-family:Consolas;color:rgb(36,41,47);border:1pt none windowtext;padding:0cm">$ du -sh LINUX_gcc9.3_glibc2.17_cuda11.2_ucx1.14.1/lib/*<u></u><u></u></span></p>
<p class="MsoNormal" style="background:white"><span style="font-size:9pt;font-family:Consolas;color:rgb(36,41,47);border:1pt none windowtext;padding:0cm">(...)<u></u><u></u></span></p>
<p class="MsoNormal" style="background:white"><span style="font-size:9pt;font-family:Consolas;color:rgb(36,41,47);border:1pt none windowtext;padding:0cm">1.8G LINUX_gcc9.3_glibc2.17_cuda11.2_ucx1.14.1/lib/libmpi.a<u></u><u></u></span></p>
<p class="MsoNormal" style="margin-bottom:12pt;background:white"><span style="font-size:9pt;font-family:Consolas;color:rgb(36,41,47);border:1pt none windowtext;padding:0cm">1.7G LINUX_gcc9.3_glibc2.17_cuda11.2_ucx1.14.1/lib/libmpi.so.12.3.0<br>
<br>
</span><span style="font-size:9pt;font-family:Consolas;color:rgb(36,41,47)"><br>
</span><span style="font-size:10.5pt;font-family:"Segoe UI",sans-serif;color:rgb(36,41,47)">This issue really complicates packaging and distribution since we provide both the archive and the shared libraries, I am looking at more than 3GB for MPICH only.<u></u><u></u></span></p>
<p class="MsoNormal" style="margin-bottom:12pt;background:white"><span style="font-size:10.5pt;font-family:"Segoe UI",sans-serif;color:rgb(36,41,47)">After some investigation, it doesn’t seem the problem is coming from our end. Is it possible that something in
mpich’s build process is causing this excessive file size?<br>
<br>
Thanks for the help.<br>
<br>
Kind regards,<br>
Lucas<u></u><u></u></span></p>
<p class="MsoNormal" style="background:white"><span style="font-size:9pt;font-family:Consolas;color:rgb(36,41,47)"><u></u> <u></u></span></p>
</div>
</div>
_______________________________________________<br>
discuss mailing list <a href="mailto:discuss@mpich.org" target="_blank">discuss@mpich.org</a><br>
To manage subscription options or unsubscribe:<br>
<a href="https://lists.mpich.org/mailman/listinfo/discuss" rel="noreferrer" target="_blank">https://lists.mpich.org/mailman/listinfo/discuss</a><br>
</div></blockquote></div><br clear="all"><div><br></div><span class="gmail_signature_prefix">-- </span><br><div dir="ltr" class="gmail_signature">Jeff Hammond<br><a href="mailto:jeff.science@gmail.com" target="_blank">jeff.science@gmail.com</a><br><a href="http://jeffhammond.github.io/" target="_blank">http://jeffhammond.github.io/</a></div></div>