Many thanks to all of you!!
Definitely Nate's approach is a better choice. We are running Slurm 14.03, but Nate's manual is exhaustive enough to recompile even the existing version. (I don't know how we can do this on a running cluster though :) I will most probably go for this solution.
There is a sentence in Nate's answer I don't really understand :
"... using `--clusters` means you have to have your controllers integrated using slurmdbd, ..."
what do you mean by this, Nate?
Carie, I don't actually get how you implemented the hack : did you reduplicate the
class DRMAAJobRunner
under a different name in drmaa.py? And where do you define every next cluster (controller machines)?
Can you give me some more detalis?
Thank you
Nikolay
Hi Nikolay,The slurm branch that John mentioned sounds great! That might be your best bet.I didn't get drmaa to run with multiple clusters with flags, but I did 'assign' different job handlers to different destinations in the drmaa.py runner in Galaxy - but that is a bit of a hacky way to do it.
-Carrie
From: John Chilton <jmchilton@gmail.com>
Date: Monday, February 1, 2016 at 11:02 AM
To: Nikolay Aleksandrov Vazov <n.a.vazov@usit.uio.no>
Cc: "dannon.baker@gmail.com" <dannon.baker@gmail.com>, "galaxy-dev@lists.galaxyproject.org" <galaxy-dev@lists.galaxyproject.org>, Carrie Ganote <cganote@iu.edu>, Nate Coraor <nate@bx.psu.edu>
Subject: Re: Galaxy sending jobs to multiple clusters
Nate has a branch of slurm drmaa that allows specifying a --clustersargument in the native specification this can be used to targetmultiple hosts.
More information can be found here:
Here is how Nate uses it to configure usegalaxy.org:
I guess instead of installing slurm-drmaa for a package manager or thedefault source - you will just need to install Nate's version.
-John
On Wed, Jan 20, 2016 at 1:18 PM, Nikolay Aleksandrov Vazov<n.a.vazov@usit.uio.no> wrote:Hi, John, Dan, Carrie and all others,
I am considering a task of setting up a Galaxy instance which shall sendjobs to more than on cluster at a time. In my case I am using drmaa-pythonand I was wondering if it was possible to configure multiple drmaa runnerseach "pointing" at a different (slurm) control host, e.g.
local
drmaa1
drmaa2
Thanks a lot for your advice
Nikolay
===============Nikolay Vazov, PhDDepartment for Research Computing, University of Oslo