Hello,
We run Galaxy (2013 January version) with load balancing mode (5 x web manager, 5 job handler) with Apache/Sun Grid Engine 6.0u4/CentOS 6.3
- Since 2 weeks, some handler job process crash during the Galaxy startup with this error message in handlerx.log
..... Starting server in PID 13634. serving on http://127.0.0.1:8091 galaxy.jobs.handler DEBUG 2013-01-29 20:06:48,902 Stopping job 22842: galaxy.jobs.handler DEBUG 2013-01-29 20:06:48,902 stopping job 22842 in drmaa runner
- The system log files report a segfault with libdrmaa
kernel: python[13977]: segfault at 0 ip 00007f2811805dc5 sp 00007f27f4aac0a0 error 4 in libdrmaa.so.1.0[7f28116dd000+185000]
Thanks for your help !
Christophe