New subject: Relabeling dataset pairs in 'list:paired' collection

13 Feb 2017

      Dear Developers

Is there an existing tool or mechanism that can be used to duplicate a 
"list of pairs" dataset collection, keeping the paired datasets the same 
but relabeling each pair with a new identifier taken from a user 
supplied file or list?

I've cobbled together my own tool to try and do something like this:

https://github.com/pjbriggs/Amplicon_analysis-galaxy/blob/77340d8bb2470a646d...

and while it works, it doesn't feel like a good solution as it creates 
duplicates of the datasets from the first collection and consumes 
additional disk/quota space unnecessarily. (This is particularly 
undesirable as we expect that the input collections might be relatively 
large numbers of FASTQ pairs e.g. 30 or more.)

Looking at some of the 'Collection Operations' tools that come with 
Galaxy, it appears that these are able to create new collections without 
making duplicate datasets, which seems much better. But these tools work 
by directly invoking Python classes from the Galaxy core, so I don't 
know if a similar approach could be used in a non-core tool.

Any advice or suggestions are very welcome! Thanks

Best wishes

Peter

-- 
Peter Briggs peter.briggs@manchester.ac.uk
Bioinformatics Core Facility University of Manchester
B.1083 Michael Smith Bldg Tel: (0161) 2751482

Relabeling dataset pairs in 'list:paired' collection

Peter Briggs

John Chilton

Peter Briggs

tags

participants (2)