[mpich-discuss] Fwd: Can't receive messages

Reuti reuti at staff.uni-marburg.de
Mon Dec 30 04:58:09 CST 2013


Hi,

Am 30.12.2013 um 10:58 schrieb Matthias Neuer:

> Unfortunately, this does not solve the problem.
> 
> Maybe I should have written that the program hangs after the output in my last post. It waits until MPI_Recv returns which does not happen.
> 
> By the way I can login from one machine to the other using ssh, no problem.

For me it's working even without `fflush`. Do you see anything in the "messages" file on the machine where rank 0 is running?

Just for curiosity: does adding an MPI_Barrier before the MPI_Finalize change anything for you?

-- Reuti


> Thanks
> 
> On 12/30/2013 06:39 AM, Rajeev Thakur wrote:
>> Try adding an fflush(stdout) after the printf after the recv.
>> 
>> On Dec 29, 2013, at 12:50 PM, Matthias Neuer <mneuer at web.de> wrote:
>> 
>>> 
>>> 
>>> 
>>> -------- Original Message --------
>>> Subject: Can't receive messages
>>> Date: Sun, 29 Dec 2013 19:00:05 +0100
>>> From: Matthias Neuer <mneuer at web.de>
>>> To: discuss at mpich.org
>>> 
>>> Hi.
>>> 
>>> I installed MPICH on 2 machines (called notebook and matze-debian) using
>>> the same sourcecode. The output of mpichversion is both times:
>>> 
>>> MPICH Version:          3.0.4
>>> MPICH Release date:     Wed Apr 24 10:08:10 CDT 2013
>>> MPICH Device:           ch3:nemesis
>>> MPICH configure:        --disable-f77 --disable-fc
>>> --prefix=/home/matze/mpich-install
>>> MPICH CC:       cc    -O2
>>> MPICH CXX:      c++   -O2
>>> MPICH F77:      no
>>> MPICH FC:       no
>>> 
>>> I wrote a small test program which I send as an attachment.
>>> On execution on the machine matze-debian I get the following:
>>> 
>>> Process 0 is on matze-debian
>>> Process 2 is on matze-debian
>>> Process 1 is on notebook
>>> received: Hello, from number 2 on matze-debian
>>> Process 2 has sent the message
>>> Process 1 has sent the message
>>> 
>>> I don't receive the message from process number 1, but the output
>>> suggests that the message was succesfully sent. Actually I checked the
>>> network traffic using wireshark and matze-debian received a package in
>>> which I found the sent message. So somehow the program does not
>>> recognize the arrival of the message.
>>> 
>>> When I run the program on a single machine it works.
>>> 
>>> I don't know if this is a programming error or a configuration error.
>>> 
>>> Thanks for your help
>>> 
>>> 
>>> 
>>> <ex1.c>_______________________________________________
>>> discuss mailing list     discuss at mpich.org
>>> To manage subscription options or unsubscribe:
>>> https://lists.mpich.org/mailman/listinfo/discuss
>> 
>> _______________________________________________
>> discuss mailing list     discuss at mpich.org
>> To manage subscription options or unsubscribe:
>> https://lists.mpich.org/mailman/listinfo/discuss
>> 
> 
> _______________________________________________
> discuss mailing list     discuss at mpich.org
> To manage subscription options or unsubscribe:
> https://lists.mpich.org/mailman/listinfo/discuss




More information about the discuss mailing list