[mpich-discuss] Using external load-balancer with mpich

Halim Amer aamer at anl.gov
Fri Oct 28 09:58:54 CDT 2016


I don't know what is this load balancer you are using, but Hydra (the 
process manager) can interact with several standard job schedulers, such 
as SLURM and PBS. You can refer to 
https://wiki.mpich.org/mpich/index.php/Using_the_Hydra_Process_Manager 
for more information about how to use Hydra.

Alternatively, if you want to keep your own scheduler, the only solution 
that comes to mind is for the scheduler to communicate indirectly with 
Hydra through a hosts file. The scheduler publishes the list of hosts in 
a file, say hosts.txt, and this file gets passed to hydra with the -f 
option: mpiexec -f hosts.txt. It is up to you to synchronize the 
scheduler and Hydra properly.

Halim
www.mcs.anl.gov/~aamer

On 10/27/16 11:59 PM, lostfreeman at gmail.com wrote:
> I have a system, that can start and stop machines depending on fleet’s
> current load. Essentially, it must be the one to decide, which machine
> will receive a new task, and optionally start acquire a new one from
> some pool for it.
>
>
>
> For example, I can have two hosts of six currently executing something
> (might be not related to MPICH), and my load balancer machine is aware
> of it. Other 4 are in low-power state to conserve costs. So when I call
> mpiexec, I want to tell it to send all the tasks to the load balancer,
> requesting 2 hosts, so that load balancer then could start two new hosts
> to handle that request. If the initial hosts would not be busy with some
> other task, load balancer would send incoming jobs to them, and kept all
> other four deallocated.
>
>
>
> - Victor
>
>
>
> *From: *Halim Amer <mailto:aamer at anl.gov>
> *Sent: *Thursday, October 27, 2016 9:18 PM
> *To: *discuss at mpich.org <mailto:discuss at mpich.org>
> *Subject: *Re: [mpich-discuss] Using external load-balancer with mpich
>
>
>
> I don't understand what you are trying to do. Can you give an example?
>
>
>
> Halim
>
> www.mcs.anl.gov/~aamer
>
>
>
> On 10/26/16 5:21 PM, lost wrote:
>
>> Can I use an external load balancer with mpiexec by providing a single
>
>> hostname in hosts file with, optionally, some large number for host
>
>> rank, and putting load balancer listening on that hostname and
>
>> forwarding connections to the actual hosts?
>
>>
>
>> I am trying to achieve autoscaling (load balancer tracks liveness of
>
>> hosts and spins up new ones on demand).
>
>>
>
>>
>
>> _______________________________________________
>
>> discuss mailing list     discuss at mpich.org
>
>> To manage subscription options or unsubscribe:
>
>> https://lists.mpich.org/mailman/listinfo/discuss
>
>>
>
> _______________________________________________
>
> discuss mailing list     discuss at mpich.org
>
> To manage subscription options or unsubscribe:
>
> https://lists.mpich.org/mailman/listinfo/discuss
>
>
>
_______________________________________________
discuss mailing list     discuss at mpich.org
To manage subscription options or unsubscribe:
https://lists.mpich.org/mailman/listinfo/discuss


More information about the discuss mailing list