Not-So-Running Jobs

10 Apr 2009

      Hello,

I've encountered a strange situation (at at least strange for me):

Users are submitting jobs, and the jobs start almost immediately.
However, the jobs look like they are running for very long time.
Checking the report web page, a job looks like the attached image.

The status is "running", but the command line is empty, and no program 
was executed for this job (I checked with "ps ax -H" and looked for 
python's child-processes).

Some technical information:
Running on Fedora with Python 2.4.3, PostgreSQL 8.0.

The server is loaded but not too loaded ( 14.4 load average for 16 cores ).

Relevant settings from universe_wsgi.ini:
   use_threadpool = true
   threadpool_workers = 10
   local_job_queue_workers = 5
   job_scheduler_policy = 
galaxy.jobs.schedulingpolicy.roundrobin:UserRoundRobin
   job_queue_cleanup_interval = 30

Is this normal ?

Thanks,
    Gordon.

Assaf Gordon

Nate Coraor

Assaf Gordon

Nate Coraor

tags

participants (2)