I’d like to incorporate some short-read genome
assembly programs into Galaxy. The Velvet assembler is a particular favourite
of our facility and ideally I would like to give our users the option of
running Velvet assemblies on FASTQ datasets. The ideal scenario would be for
the contigs generated by velvet to then be piped into an interface to
interproscan.
However, in the first instance, I’d like to focus on
the velvet interface. My main question centres around whether the format velvet
uses to store information is representable as a composite datatype in Galaxy.
Velvet comprises of two elements – a prep step
(velveth) followed by the actual assembly (velvetg). Velveth generates a directory
with files (the files always have the same names). Velvetg is then run with the
directory containing the relevant files as a parameter.
I have read through the composite datatypes wiki page but I’m
not sure if this form of data storage can be represented in galaxy at the
moment.
Any advice would be much appreciated.
All the very best,
Konrad.
Dr Konrad Paszkiewicz
Exeter Sequencing Service,
Biosciences,
Stocker Road,
University of Exeter,
Exeter EX4 4QD, UK.
http://biosciences.exeter.ac.uk/facilities/sequencing/