I’d use the original fasta file and input it into the 'Fasta Manipulation > Compute Sequence Length' tool
Then, using the output, run the 'Statistics > Summary Statistics for any numerical column' tool on c2.
That will give you all the info you’re after.
Dr. Graham Etherington
Bioinformatics Support Officer,
The Sainsbury Laboratory,
Norwich Research Park,
Norwich NR4 7UH.
Tel: +44 (0)1603 450601
I am attempting to use Galaxy to calculate the mean sequence read length and identify the range of read lengths for my 454 data. The data has already been divided into columns:
I have attempted to use the "Summary Statistics" button, however it appears to only be for numerical data and not sequence data. Is this tool/task available
Thank you in advance,