Dear User, I am trying to use the tool "Compute quality statistics<http://main.g2.bx.psu.edu/tool_runner?tool_id=cshl_fastx_quality_statistics>" in galaxy on Ilumina single reads. The file is 2.3 Gb, fastq format. I have performed Quality format converter on the data set and the format is now qualillumina. Despite of this, galaxy don't recognize any dataset in the workflow to use as input into quality statistics. Any idea why my dataset is not accepted as input? Best, Robin
On Tue, May 3, 2011 at 10:00 AM, Robin Mjelle <robinmjelle@gmail.com> wrote:
Dear User,
I am trying to use the tool "Compute quality statistics" in galaxy on Ilumina single reads. The file is 2.3 Gb, fastq format. I have performed Quality format converter on the data set and the format is now qualillumina. Despite of this, galaxy don't recognize any dataset in the workflow to use as input into quality statistics. Any idea why my dataset is not accepted as input?
Best,
Robin
Looking at the XML wrapper, it expects fastqsanger ONLY. See fastx_toolkit/fastx_quality_statistics.xml It could in theory take fastqillumina or fastqsolexa as well, I thought there was an open bug report on this issue but I can't find it right now. Certainly fastx_clipper.xml was updated to use the -Q 33 switch only for Sanger quality scores, and from a quick check all the other FASTX tools have this fix except fastx_quality_statistics.xml Peter
Hi, I recently sent out an email asking if anyone knew much about analysis of SNPs etc and how to visualise them. I got some very useful answers and planned to return to the problem when I got a better chance to work on this in some depth. I now have the time but, like an idiot, I've accidentally deleted those email replies! So can I please ask again, does anyone have experience of SNP analysis and, especially, visualisation that can hold my hand whilst I work this out (apologies to the ones who replied last time but can you get in touch again !)? Best Wishes, David. __________________________________ Dr David A. Matthews Senior Lecturer in Virology Room E49 Department of Cellular and Molecular Medicine, School of Medical Sciences University Walk, University of Bristol Bristol. BS8 1TD U.K. Tel. +44 117 3312058 Fax. +44 117 3312091 D.A.Matthews@bristol.ac.uk
Hi David, For NGS data, the tools under "NGS: Indel Analysis" can build simple BED files of indels from SAM files that can be viewed in Galaxy (Visualization tab - i.e. Trackster). For viral genomes, you will not be able to use UCSC. The Galaxy 101 tutorial includes some basic SNP analysis and visualization tasks that may help you to orient: http://main.g2.bx.psu.edu/u/aun1/p/galaxy101 You may get some more replies, but to pull up old responses on the galaxy-user and galaxy-dev mailing list, the archives are here: http://lists.bx.psu.edu/pipermail/galaxy-user/ http://lists.bx.psu.edu/pipermail/galaxy-dev/ To search, use google and enter: site:http://lists.bx.psu.edu/pipermail plus enter whatever keywords would identify your questions (perhaps your name?) Other list members are encouraged to reply to David's inquiry with more help again, Best! Jen Galaxy team On 5/3/11 10:03 AM, David Matthews wrote:
Hi,
I recently sent out an email asking if anyone knew much about analysis of SNPs etc and how to visualise them. I got some very useful answers and planned to return to the problem when I got a better chance to work on this in some depth. I now have the time but, like an idiot, I've accidentally deleted those email replies! So can I please ask again, does anyone have experience of SNP analysis and, especially, visualisation that can hold my hand whilst I work this out (apologies to the ones who replied last time but can you get in touch again !)?
Best Wishes, David.
__________________________________ Dr David A. Matthews
Senior Lecturer in Virology Room E49 Department of Cellular and Molecular Medicine, School of Medical Sciences University Walk, University of Bristol Bristol. BS8 1TD U.K.
Tel. +44 117 3312058 Fax. +44 117 3312091
D.A.Matthews@bristol.ac.uk <mailto:D.A.Matthews@bristol.ac.uk>
___________________________________________________________ The Galaxy User list should be used for the discussion of Galaxy analysis and other features on the public server at usegalaxy.org. Please keep all replies on the list by using "reply all" in your mail client. For discussion of local Galaxy instances and the Galaxy source code, please use the Galaxy Development list:
http://lists.bx.psu.edu/listinfo/galaxy-dev
To manage your subscriptions to this and other Galaxy lists, please use the interface at:
-- Jennifer Jackson http://usegalaxy.org http://galaxyproject.org
Hi Robin, Run the FASTQ Groomer on your datafile first, then use the result as input to the Compute quality statistics tool. (Skip using Quality format converter). Hopefully this helps, Jen Galaxy team On 5/3/11 2:00 AM, Robin Mjelle wrote:
Dear User,
I am trying to use the tool "Compute quality statistics <http://main.g2.bx.psu.edu/tool_runner?tool_id=cshl_fastx_quality_statistics>" in galaxy on Ilumina single reads. The file is 2.3 Gb, fastq format. I have performed Quality format converter on the data set and the format is now qualillumina. Despite of this, galaxy don't recognize any dataset in the workflow to use as input into quality statistics. Any idea why my dataset is not accepted as input?
Best,
Robin
___________________________________________________________ The Galaxy User list should be used for the discussion of Galaxy analysis and other features on the public server at usegalaxy.org. Please keep all replies on the list by using "reply all" in your mail client. For discussion of local Galaxy instances and the Galaxy source code, please use the Galaxy Development list:
http://lists.bx.psu.edu/listinfo/galaxy-dev
To manage your subscriptions to this and other Galaxy lists, please use the interface at:
-- Jennifer Jackson http://usegalaxy.org http://galaxyproject.org
participants (4)
-
David Matthews
-
Jennifer Jackson
-
Peter Cock
-
Robin Mjelle