Hi, Nate,
Hi Luobin,
Sorry for the delay in response. I would suggest moving this discussion
to the galaxy-dev mailing list since it does not contain any private
data. The wide audience on galaxy-dev may be able to come up with
additional ideas than what we on the Galaxy Team come up with.
Please see my responses inline below.
When this happens, is the job stuck in Galaxy or in the queueing system
Luobin Yang wrote:
> Hi,
>
> Thanks for Martin Dahlo's excellent blog (
> http://mdahlo.blogspot.com/2011/06/galaxy-on-uppmax.html) on making SLURM
> work with GALAXY, I am able to run galaxy jobs on a cluster, but I've got a
> couple of issues:
>
> 1. Sometimes a job is in waiting state and it won't start to run until I
> restart galaxy.
(PBS, SGE, something else?)?
In your config file (universe_wsgi.ini), is track_jobs_in_database =
True or False? Are you using SQLite or another database? If you watch
the Galaxy log, are any exceptions or other errors logged when a job
becomes stuck?
> 2. Sometimes a job is in running state even though it is already finishedMost likely, Galaxy is setting metadata on the job's outputs. You can
> and restarting galaxy can make the job's state change from running to
> finished.
probably speed up this process by setting:
set_metadata_externally = True
in the config file.
Thanks,
--nate
>
> I am wondering what's causing those issues and how they can be fixed.
>
> Thanks,
> Luobin
>