<html><head><meta http-equiv="Content-Type" content="text/html charset=iso-8859-1"></head><body style="word-wrap: break-word; -webkit-nbsp-mode: space; -webkit-line-break: after-white-space; ">That's the correct way to set that environment variable, but I would have to agree with Pavan here. If you're running into the cap on context id's (especially with so few processes) it would seem that you're creating a lot of communicators that you probably don't need. Are you sure that you're freeing them correctly after use?<div><br></div><div>Wesley<br><div><br><div><div>On Jul 8, 2013, at 10:05 AM, Reem Alraddadi <<a href="mailto:raba500@york.ac.uk">raba500@york.ac.uk</a>> wrote:</div><br class="Apple-interchange-newline"><blockquote type="cite"><div dir="ltr"><div><div><div><div>Hi Wesley.<br></div>I wrote the following:<br>mpirun --np 4 --env MPIR_PARAM_CTXID_EAGER_SIZE 1 ./flash4<br></div>but the error still the same. Did I do it in the wrong way??<br><br></div>
Thanks,<br></div>Reem<br><div><div class="gmail_extra"><div class="gmail_quote"><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">
<br>
Message: 5<br>
Date: Mon, 8 Jul 2013 08:14:48 -0500<br>
From: Wesley Bland <<a href="mailto:wbland@mcs.anl.gov">wbland@mcs.anl.gov</a>><br>
To: <a href="mailto:discuss@mpich.org">discuss@mpich.org</a><br>
Subject: Re: [mpich-discuss] mpich on Mac os x<br>
Message-ID: <<a href="mailto:8DC984B2-4E4B-4BFE-806E-203463A7A4E4@mcs.anl.gov">8DC984B2-4E4B-4BFE-806E-203463A7A4E4@mcs.anl.gov</a>><br>
Content-Type: text/plain; charset=iso-8859-1<br>
<br>
It seems that you're creating more communicators than MPICH can handle. It's possible that you might be able to get around this by setting the environment variable MPIR_PARAM_CTXID_EAGER_SIZE to something smaller than its default (which is 2). That frees up a few more communicators, but there is a pathological case where even with fewer communicators than the max, MPICH won't be able to agree on a new communicator id when needed. Try changing that environment variable and see if that fixes things.<br>
<br>
Wesley<br>
<br>
On Jul 8, 2013, at 5:33 AM, Reem Alraddadi <<a href="mailto:raba500@york.ac.uk">raba500@york.ac.uk</a>> wrote:<br>
<br>
> Hi all,<br>
> I am using mpich-3.0.4 on Mac os x version 10.7.5 to run FLASH code. It works fine in the beginning of the run and then I got the following error:<br>
><br>
> Fatal error in MPI_Comm_create: Other MPI error, error stack:<br>
> MPI_Comm_create(600).................: MPI_Comm_create(comm=0x84000002, group=0xc8001349, new_comm=0x7fff606a8614) failed<br>
> MPI_Comm_create(577).................:<br>
> MPIR_Comm_create_intra(241)..........:<br>
> MPIR_Get_contextid(799)..............:<br>
> MPIR_Get_contextid_sparse_group(1146): Cannot allocate context ID because of fragmentation (169/2048 free on this process; ignore_id=0)<br>
> Fatal error in MPI_Comm_create: Other MPI error, error stack:<br>
> MPI_Comm_create(600).................: MPI_Comm_create(comm=0x84000002, group=0xc80012b6, new_comm=0x7fff670cc614) failed<br>
> MPI_Comm_create(577).................:<br>
> MPIR_Comm_create_intra(241)..........:<br>
> MPIR_Get_contextid(799)..............:<br>
> MPIR_Get_contextid_sparse_group(1146): Cannot allocate context ID because of fragmentation (316/2048 free on this process; ignore_id=0)<br>
> Fatal error in MPI_Comm_create: Other MPI error, error stack:<br>
> MPI_Comm_create(600).................: MPI_Comm_create(comm=0x84000004, group=0xc800000e, new_comm=0x7fff629d5614) failed<br>
> MPI_Comm_create(577).................:<br>
> MPIR_Comm_create_intra(241)..........:<br>
> MPIR_Get_contextid(799)..............:<br>
> MPIR_Get_contextid_sparse_group(1146): Cannot allocate context ID because of fragmentation (2020/2048 free on this process; ignore_id=0)<br>
> Fatal error in MPI_Comm_create: Other MPI error, error stack:<br>
> MPI_Comm_create(600).................: MPI_Comm_create(comm=0x84000002, group=0xc8000020, new_comm=0x7fff639ae614) failed<br>
> MPI_Comm_create(577).................:<br>
> MPIR_Comm_create_intra(241)..........:<br>
> MPIR_Get_contextid(799)..............:<br>
> MPIR_Get_contextid_sparse_group(1146): Cannot allocate context ID because of fragmentation (2002/2048 free on this process; ignore_id=0<br>
><br>
> Is there a way to fix that ?<br>
><br>
> Thanks,<br>
> Reem<br>
> _______________________________________________<br>
> discuss mailing list <a href="mailto:discuss@mpich.org">discuss@mpich.org</a><br>
> To manage subscription options or unsubscribe:<br>
> <a href="https://lists.mpich.org/mailman/listinfo/discuss" target="_blank">https://lists.mpich.org/mailman/listinfo/discuss</a><br>
<br>
<br></blockquote></div></div></div></div>
_______________________________________________<br>discuss mailing list <a href="mailto:discuss@mpich.org">discuss@mpich.org</a><br>To manage subscription options or unsubscribe:<br><a href="https://lists.mpich.org/mailman/listinfo/discuss">https://lists.mpich.org/mailman/listinfo/discuss</a></blockquote></div><br></div></div></body></html>