looks like this was some really huge upoad jobs that caused a DOS.
Maybe the pubic instance has some config in place to avoid blowing up in this situation.
While watching the logs during startup i see the memory usage of handler0 go from about 200M to about 4G
right after the simple json module conflict warning. it seems to stop increasing right after it prints the cloudlaunch line.
Soon thereafter it jumps to 7G of RAM at which point it starts swapping.
Then I see a huge number of lines like this (fills my terminal buffer)
galaxy.jobs DEBUG 2014-11-24 21:16:02,205 (159978) Persisting job destination (destination id: gridengine)
galaxy.jobs.handler INFO 2014-11-24 21:16:02,315 (159978) Job dispatched
galaxy.jobs DEBUG 2014-11-24 21:16:02,337 (159979) Working directory for job is: /mnt/galaxy/data/galaxy/job_working_directory/000/159/
159979
galaxy.jobs.handler DEBUG 2014-11-24 21:16:02,352 (159979) Dispatching to gridengine runner
galaxy.tools.deps DEBUG 2014-11-24 21:17:05,076 Building dependency shell command for dependency 'samtools'
galaxy.tools.deps WARNING 2014-11-24 21:17:05,088 Failed to resolve dependency on 'samtools', ignoring
galaxy.jobs DEBUG 2014-11-24 21:17:29,869 (159979) Persisting job destination (destination id: gridengine)
galaxy.jobs.handler INFO 2014-11-24 21:17:30,177 (159979) Job dispatched
galaxy.jobs.runners DEBUG 2014-11-24 21:22:24,137 (159978) command is: python /mnt/galaxy/data/galaxy/galaxy-dist/tools/data_source/upload.py /mnt/galaxy/data/galaxy/galaxy-dist /mnt/galaxy/data/galaxy/tmp/tmpVdupFJ /mnt/galaxy/data/galaxy/tmp/tmpLHXyqa
219868:/mnt/galaxy/data/galaxy/job_working_directory/000/159/159978/dataset_219868_files:/mnt/galaxy/data/galaxy/user-data/000/219/dataset_219868.dat 219869:/mnt/galaxy/data/galaxy/job_working_directory/000/159/159978/dataset_219869_files:/mnt/galaxy/data/galaxy/user-data/000/219/dataset_219869.dat
219870:/mnt/galaxy/data/galaxy/job_working_directory/000/159/159978/dataset_219870_files:/mnt/galaxy/data/galaxy/user-data/000/219/dataset_219870.dat 219871:/mnt/galaxy/data/galaxy/job_working_directory/000/159/159978/dataset_219871_files:/mnt/galaxy/data/galaxy/user-data/000/219/dataset_219871.dat
219872:/mnt/galaxy/data/galaxy/job_working_directory/000/159/159978/dataset_219872_files:/mnt/galaxy/data/galaxy/user-data/000/219/dataset_219872.dat 219873:/mnt/galaxy/data/galaxy/job_working_directory/000/159/159978/dataset_219873_files:/mnt/galaxy/data/galaxy/user-data/000/219/dataset_219873.dat
219874:/mnt/galaxy/data/galaxy/job_working_directory/000/159/159978/dataset_219874_files:/mnt/galaxy/data/galaxy/user-data/000/219/dataset_219874.dat 219875:/mnt/galaxy/data/galaxy/job_working_directory/000/159/159978/dataset_219875_files:/mnt/galaxy/data/galaxy/user-data/000/219/dataset_219875.dat
219876:/mnt/galaxy/data/galaxy/job_working_directory/000/159/159978/dataset_219876_files:/mnt/galaxy/data/galaxy/user-data/000/219/dataset_219876.dat
…
thousands and thousands of lines later...
978/metadata_results_LibraryDatasetDatasetAssociation_27280_rGPnso,,/mnt/galaxy/data/galaxy/job_working_directory/000/159/159978/metadata_override_LibraryDatasetDatasetAssociation_27280_xFpkLn /mnt/galaxy/data/galaxy/job_working_directory/000/159/159978/metadata_in_LibraryDatasetDatasetAssociation_27282_EjpcbG,/mnt/galaxy/data/galaxy/job_working_directory/000/159/159978/metadata_kwds_LibraryDatasetDatasetAssociation_27282_wVaFjB,/mnt/galaxy/data/galaxy/job_working_directory/000/159/159978/metadata_out_LibraryDatasetDatasetAssociation_27282_OeHNUy,/mnt/galaxy/data/galaxy/job_working_directory/000/159/159978/metadata_results_LibraryDatasetDatasetAssociation_27282_G1Q9PY,,/mnt/galaxy/data/galaxy/job_working_directory/000/159/159978/metadata_override_LibraryDatasetDatasetAssociation_27282_17V4t6
/mnt/galaxy/data/galaxy/job_working_directory/000/159/159978/metadata_in_LibraryDatasetDatasetAssociation_27290_Y5X8Ih,/mnt/galaxy/data/galaxy/job_working_directory/000/159/159978/metadata_kwds_LibraryDatasetDatasetAssociation_27290_8E55GA,/mnt/galaxy/data/galaxy/job_working_directory/000/159/159978/metadata_out_LibraryDatasetDatasetAssociation_27290_W11KQm,/mnt/galaxy/data/galaxy/job_working_directory/000/159/159978/metadata_results_LibraryDatasetDatasetAssociation_27290_65u8Ja,,/mnt/galaxy/data/galaxy/job_working_directory/000/159/159978/metadata_override_LibraryDatasetDatasetAssociation_27290_3jbsMb
/mnt/galaxy/data/galaxy/job_working_directory/000/159/159978/metadata_in_LibraryDatasetDatasetAssociation_27296_DmijyX,/mnt/galaxy/data/galaxy/job_working_directory/000/159/159978/metadata_kwds_LibraryDatasetDatasetAssociation_27296_u1ihbO,/mnt/galaxy/data/galaxy/job_working_directory/000/159/159978/metadata_out_LibraryDatasetDatasetAssociation_27296_MYIvvx,/mnt/galaxy/data/galaxy/job_working_directory/000/159/159978/metadata_results_LibraryDatasetDatasetAssociation_27296_eLwrTR,,/mnt/galaxy/data/galaxy/job_working_directory/000/159/159978/metadata_override_LibraryDatasetDatasetAssociation_27296_UpVqz7
/mnt/galaxy/data/galaxy/job_working_directory/000/159/159978/metadata_in_LibraryDatasetDatasetAssociation_27321_gSfC2s,/mnt/galaxy/data/galaxy/job_working_directory/000/159/159978/metadata_kwds_LibraryDatasetDatasetAssociation_27321_ienD2I,/mnt/galaxy/data/galaxy/job_working_directory/000/159/159978/metadata_out_LibraryDatasetDatasetAssociation_27321_hUnFKv,/mnt/galaxy/data/galaxy/job_working_directory/000/159/159978/metadata_results_LibraryDatasetDatasetAssociation_27321_zWIzcm,,/mnt/galaxy/data/galaxy/job_working_directory/000/159/159978/metadata_override_LibraryDatasetDatasetAssociation_27321_4KvsAs
/mnt/galaxy/data/galaxy/job_working_directory/000/159/159978/metadata_in_LibraryDatasetDatasetAssociation_27270_5oWQ0p,/mnt/galaxy/data/galaxy/job_working_directory/000/159/159978/metadata_kwds_LibraryDatasetDatasetAssociation_27270_Rv3rwJ,/mnt/galaxy/data/galaxy/job_working_directory/000/159/159978/metadata_out_LibraryDatasetDatasetAssociation_27270_VyM6CA,/mnt/galaxy/data/galaxy/job_working_directory/000/159/159978/metadata_results_LibraryDatasetDatasetAssociation_27270_3nmQ_S,,/mnt/galaxy/data/galaxy/job_working_directory/000/159/159978/metadata_override_LibraryDatasetDatasetAssociation_27270_ZCTSSJ
/mnt/galaxy/data/galaxy/job_working_directory/000/159/159978/metadata_in_LibraryDatasetDatasetAssociation_27317_YBjGjl,/mnt/galaxy/data/galaxy/job_working_directory/000/159/159978/metadata_kwds_LibraryDatasetDatasetAssociation_27317_xC8TCo,/mnt/galaxy/data/galaxy/job_working_directory/000/159/159978/metadata_out_LibraryDatasetDatasetAssociation_27317_ygYy0B,/mnt/galaxy/data/galaxy/job_working_directory/000/159/159978/metadata_results_LibraryDatasetDatasetAssociation_27317_WJO3MW,,/mnt/galaxy/data/galaxy/job_working_directory/000/159/159978/metadata_override_LibraryDatasetDatasetAssociation_27317_8MY55s;
sh -c "exit $return_code"
Whaaaa…..
brad
galaxy.web.framework.base DEBUG 2014-11-24 21:15:17,288 Enabling 'workflow' controller, class: WorkflowController
/mnt/galaxy/data/galaxy/galaxy-dist/lib/galaxy/__init__.py:63: UserWarning: Module simplejson was already imported from /mnt/galaxy/data/galaxy/sw/lib/python2.7/site-packages/simplejson/__init__.pyc, but /mnt/galaxy/data/galaxy/galaxy-dist/eggs/simplejson-2.1.1-py2.7-linux-x86_64-ucs2.egg
is being added to sys.path
self.check_version_conflict()
galaxy.web.framework.base DEBUG 2014-11-24 21:15:29,670 Enabling 'cloudlaunch' controller, class: CloudController
galaxy.web.framework.base DEBUG 2014-11-24 21:15:30,620 Enabling 'data_manager' controller, class: DataManager
--
Brad Langhorst, Ph.D.
Applications and Product Development Scientist
I recently upgraded galaxy to the 10/06 unannounced galaxy-dist.
I typically run with 2 web processes and 2 job handlers on a single machine.
It seems that now the job handlers consume all RAM at startup and are killed by the kernel.
Anybody seen something like this?
Bad job? bug?
I’m considering a downgrade to the previous stable release.
Brad
--
Brad Langhorst, Ph.D.
Applications and Product Development Scientist
___________________________________________________________
Please keep all replies on the list by using "reply all"
in your mail client. To manage your subscriptions to this
and other Galaxy lists, please use the interface at:
https://lists.galaxyproject.org/
To search Galaxy mailing lists use the unified search at:
http://galaxyproject.org/search/mailinglists/