Hi Dev-Team,
I am experiencing a strange error that I cannot replicate on two identical instances of my Galaxy server. I have a tool that runs fine on my test instance and fails on my development instance. The servers are identically configured and running the same build. The handler.log from the successful instance shows the following:
galaxy.jobs DEBUG 2012-08-02 11:31:07,881 (295) Working directory for job is: /hpcdata/galaxy-test/galaxy-setup/galaxy-dist/database/job_working_directory/000/295galaxy.jobs.handler DEBUG 2012-08-02 11:31:07,881 dispatching job 295 to pbs runnergalaxy.jobs.handler INFO 2012-08-02 11:31:08,083 (295) Job dispatchedgalaxy.jobs.runners.pbs DEBUG 2012-08-02 11:31:08,382 (295) submitting file /hpcdata/galaxy-test/galaxy-setup/galaxy-dist/database/pbs/295.shgalaxy.jobs.runners.pbs DEBUG 2012-08-02 11:31:08,382 (295) command is: perl /hpcdata/galaxy-test/galaxy-setup/galaxy-dist/tools/jax_cnv/cnv-seq.pl --out-basename cnv_seq_result --test /hpcdata/galaxy-test/galaxy-setup/galaxy-dist/database/files/001/dataset_1504.dat --ref /hpcdata/galaxy-test/galaxy-setup/galaxy-dist/database/files/001/dataset_1503.dat --genome-size 2716965481 --p-value 0.001 --bigger-window 1.5 --no-annotate --log2-threshold 0.6; cd /hpcdata/galaxy-test/galaxy-setup/galaxy-dist; /hpcdata/galaxy-test/galaxy-setup/galaxy-dist/set_metadata.sh ./database/files /hpcdata/galaxy-test/galaxy-setup/galaxy-dist/database/job_working_directory/000/295 . /hpcdata/galaxy-test/galaxy-setup/galaxy-dist/universe_wsgi.ini /hpcdata/galaxy-test/galaxy-setup/galaxy-dist/database/tmp/tmplTSKeF /hpcdata/galaxy-test/galaxy-setup/galaxy-dist/database/job_working_directory/000/295/galaxy.json /hpcdata/galaxy-test/galaxy-setup/galaxy-dist/database/job_working_directory/000/295/metadata_in_HistoryDatasetAssociation_689_Oh21hx,/hpcdata/galaxy-test/galaxy-setup/galaxy-dist/database/job_working_directory/000/295/metadata_kwds_HistoryDatasetAssociation_689_xgxlZy,/hpcdata/galaxy-test/galaxy-setup/galaxy-dist/database/job_working_directory/000/295/metadata_out_HistoryDatasetAssociation_689_H0claK,/hpcdata/galaxy-test/galaxy-setup/galaxy-dist/database/job_working_directory/000/295/metadata_results_HistoryDatasetAssociation_689_rdQBs8,,/hpcdata/galaxy-test/galaxy-setup/galaxy-dist/database/job_working_directory/000/295/metadata_override_HistoryDatasetAssociation_689_B0U0MVgalaxy.jobs.runners.pbs DEBUG 2012-08-02 11:31:08,395 (295) queued in default queue as 112605.scyld.localdomaingalaxy.jobs.runners.pbs DEBUG 2012-08-02 11:31:08,776 (295/112605.scyld.localdomain) PBS job state changed from N to Rgalaxy.jobs.runners.pbs DEBUG 2012-08-02 11:31:27,117 (295/112605.scyld.localdomain) PBS job state changed from R to Cgalaxy.jobs.runners.pbs DEBUG 2012-08-02 11:31:27,118 (295/112605.scyld.localdomain) PBS job has completed successfullygalaxy.jobs DEBUG 2012-08-02 11:31:27,197 finish(): Moved /hpcdata/galaxy-test/galaxy-setup/galaxy-dist/database/job_working_directory/000/295/cnv_seq_result.cnv to /hpcdata/galaxy-test/galaxy-setup/galaxy-dist/database/files/001/dataset_1555.dat as directed by from_work_dirgalaxy.datatypes.metadata DEBUG 2012-08-02 11:31:27,249 loading metadata from file for: HistoryDatasetAssociation 689galaxy.jobs DEBUG 2012-08-02 11:31:27,382 job 295 endedgalaxy.datatypes.metadata DEBUG 2012-08-02 11:31:27,382 Cleaning up external metadata files
The same log from the failing instance shows the following:
galaxy.jobs DEBUG 2012-08-02 11:54:29,234 (113) Working directory for job is: /hpcdata/galaxy-dev/galaxy-setup/galaxy-dist/database/job_working_directory/000/113galaxy.jobs.handler DEBUG 2012-08-02 11:54:29,234 dispatching job 113 to pbs runnergalaxy.jobs.handler INFO 2012-08-02 11:54:29,420 (113) Job dispatchedgalaxy.jobs.runners.pbs DEBUG 2012-08-02 11:54:29,669 (113) submitting file /hpcdata/galaxy-dev/galaxy-setup/galaxy-dist/database/pbs/113.shgalaxy.jobs.runners.pbs DEBUG 2012-08-02 11:54:29,669 (113) command is: perl /hpcdata/galaxy-dev/galaxy-setup/galaxy-dist/tools/jax_cnv/cnv-seq.pl --out-basename cnv_seq_result --test /hpcdata/galaxy-dev/galaxy-setup/galaxy-dist/database/files/000/dataset_131.dat --ref /hpcdata/galaxy-dev/galaxy-setup/galaxy-dist/database/files/000/dataset_132.dat --genome-size 2716965481 --p-value 0.001 --bigger-window 1.5 --no-annotate --log2-threshold 0.6; cd /hpcdata/galaxy-dev/galaxy-setup/galaxy-dist; /hpcdata/galaxy-dev/galaxy-setup/galaxy-dist/set_metadata.sh ./database/files /hpcdata/galaxy-dev/galaxy-setup/galaxy-dist/database/job_working_directory/000/113 . /hpcdata/galaxy-dev/galaxy-setup/galaxy-dist/universe_wsgi.ini /hpcdata/galaxy-dev/galaxy-setup/galaxy-dist/database/tmp/tmpgTn7ey /hpcdata/galaxy-dev/galaxy-setup/galaxy-dist/database/job_working_directory/000/113/galaxy.json /hpcdata/galaxy-dev/galaxy-setup/galaxy-dist/database/job_working_directory/000/113/metadata_in_HistoryDatasetAssociation_136_VYG968,/hpcdata/galaxy-dev/galaxy-setup/galaxy-dist/database/job_working_directory/000/113/metadata_kwds_HistoryDatasetAssociation_136_yo_4aK,/hpcdata/galaxy-dev/galaxy-setup/galaxy-dist/database/job_working_directory/000/113/metadata_out_HistoryDatasetAssociation_136_D6HwmI,/hpcdata/galaxy-dev/galaxy-setup/galaxy-dist/database/job_working_directory/000/113/metadata_results_HistoryDatasetAssociation_136_zunslF,,/hpcdata/galaxy-dev/galaxy-setup/galaxy-dist/database/job_working_directory/000/113/metadata_override_HistoryDatasetAssociation_136_FDUxsogalaxy.jobs.runners.pbs DEBUG 2012-08-02 11:54:29,680 (113) queued in default queue as 112609.scyld.localdomaingalaxy.jobs.runners.pbs DEBUG 2012-08-02 11:54:30,163 (113/112609.scyld.localdomain) PBS job state changed from N to Rgalaxy.jobs.runners.pbs DEBUG 2012-08-02 11:54:33,298 (113/112609.scyld.localdomain) PBS job state changed from R to Cgalaxy.jobs.runners.pbs DEBUG 2012-08-02 11:54:33,298 (113/112609.scyld.localdomain) PBS job has completed successfullygalaxy.jobs DEBUG 2012-08-02 11:54:33,354 finish(): Could not move /hpcdata/galaxy-dev/galaxy-setup/galaxy-dist/database/job_working_directory/000/113/cnv_seq_result.cnv to /hpcdata/galaxy-dev/galaxy-setup/galaxy-dist/database/files/000/dataset_140.dat as directed by from_work_dirgalaxy.jobs DEBUG 2012-08-02 11:54:33,467 job 113 ended
Why would one fail and the other succeed?
Thanks,IryThe information in this email, including attachments, may be confidential and is intended solely for the addressee(s). If you believe you received this email by mistake, please notify the sender by return email as soon as possible.
___________________________________________________________
Please keep all replies on the list by using "reply all"
in your mail client. To manage your subscriptions to this
and other Galaxy lists, please use the interface at:
http://lists.bx.psu.edu/