Dear Galaxy Team,
we have developed a machine learning tool for gene finding that uses a
lot of small files to pass on its parameters learned on the training
It would be cool if we could put them in a directory in the Galaxy
dataset store, e.g. under the same name as the actual output file. What
do you think is the best idea here? It would be important to be able to
delete the datasets again, as they will become huge.
So here is how it works right now (please correct me if there is a
- the user executes a tool that creates an $output file
- galaxy creates the empty file "dataset_###.dat" in the database dir
- the executed program gets this file passed as a parameter and can
write its info there
- galaxy will display the file upon completion of the tool as soon as
the user clicks on the eye logo
In our case, we would like to have a directory instead of
dataset_###.dat and delete it as soon as the purge_datasets.sh is run,
and the user has deleted it from their history.
One idea would be to replace the file dataset_###.dat with a directory
of the same name. What would Galaxy do when you click on the eye? Would
it be correctly deleted?
Another idea we had was to create a directory without the .dat extension
and somehow modify the purge_datasets.sh script to include such
directories in the deletion process.
What would be easiest for you?
Thanks a lot for your time!