sge & pbs job dependency

1 Jul 2010


      I am looking at the code inside lib/galaxy/jobs. It appears that the 
dependency tracking between jobs is done entirely by Galaxy. In other 
words, the option of e.g. SGE to submit a job with a dependency on 
previously submitted jobs is not used. Galaxy simply waits until the 
dependency jobs have finished, and then it submits the dependent job.
It seems like inefficient to me on a busy cluster. E.g. if a user has 
one "reducer" job in a workflow that depends on a 100 "mapper" jobs, and 
all are submitted at once with an SGE dependency of the mapper on the 
reducer, then the mapper can be at the top of the SGE queue by the time 
all mappers have finished, and run immediately. In the current Galaxy 
implementation, the reducer will only be submitted to SGE queue after 
all mappers have finished, and have to wait for all the jobs in front of 
it (assuming a simple FIFO policy on the  SGE side).
Why is such design necessary?

Andrey Tovchigrechko

James Taylor

tags

participants (2)