Hi Kory,

The problem with this FASTQ block is that the sequence and quality score identifier lines do not match ('SNPSTER6_0679:1:1:1083:939#0/1 run=100908_SNPSTER6_0679_70929AAXX' vs 'SNPSTER6_0679:1:1:1083:939#0/1'), where the identifier for the sequence line has additional text not found on the identifier for the quality score line, which is not valid for the FASTQ format. Alternatively the quality score identifier line could be only a '+', without the sequence identifier.

The quality score lines appear to be either illumina or solexa, but it is best to check with the source of the data to be sure:
Input ASCII range: 'B'(66) - '_'(95)
Input decimal range: 2 - 31

You'll need to upload valid FASTQ files inorder to work with them in Galaxy. Correct examples of your provided read are:

@SNPSTER6_0679:1:1:1083:939#0/1
NATTTATGGATAGTTGGGTAGTAGGTGTAAATGTATGTGGTAAAAGGCCTAGGAGATTTGTTGATCCAATAAATATGATTAGGGAAACAA
+SNPSTER6_0679:1:1:1083:939#0/1
BIQQIQQQTP[[[[[VVVVQPPPPPTWWWW[[YYTTTOVV____TWVXRWPTQPQWWWWWTOOVV___V_TROOWTWTWTQWQWTTRWRO

or

@SNPSTER6_0679:1:1:1083:939#0/1 run=100908_SNPSTER6_0679_70929AAXX
NATTTATGGATAGTTGGGTAGTAGGTGTAAATGTATGTGGTAAAAGGCCTAGGAGATTTGTTGATCCAATAAATATGATTAGGGAAACAA
+
BIQQIQQQTP[[[[[VVVVQPPPPPTWWWW[[YYTTTOVV____TWVXRWPTQPQWWWWWTOOVV___V_TROOWTWTWTQWQWTTRWRO

or

@SNPSTER6_0679:1:1:1083:939#0/1 run=100908_SNPSTER6_0679_70929AAXX
NATTTATGGATAGTTGGGTAGTAGGTGTAAATGTATGTGGTAAAAGGCCTAGGAGATTTGTTGATCCAATAAATATGATTAGGGAAACAA
+SNPSTER6_0679:1:1:1083:939#0/1 run=100908_SNPSTER6_0679_70929AAXX
BIQQIQQQTP[[[[[VVVVQPPPPPTWWWW[[YYTTTOVV____TWVXRWPTQPQWWWWWTOOVV___V_TROOWTWTWTQWQWTTRWRO

Please let us know if we can be of further assistance.

Thanks for using Galaxy,

Dan


On Feb 2, 2011, at 2:01 PM, Johnson, Kory (NIH/NINDS) [C] wrote:

Hello,
 
My account login is: johnsonko@ninds.nih.gov
 
I am a first time Galaxy user.
 
I have uploaded my sequences as format “fastq” into Galaxy and would like to next use “Groomer” to output Sanger fastq format so to go on with exploring quality via box plot, deciding on a trim length (if any), and map to genome using bwa or bowtie.
 
However, I am running into a problem using “Groomer”.
 
I do not know what format my sequences are per setting the required input parameter.
 
An example of my sequences is as follows:
 
@SNPSTER6_0679:1:1:1083:939#0/1 run=100908_SNPSTER6_0679_70929AAXX
NATTTATGGATAGTTGGGTAGTAGGTGTAAATGTATGTGGTAAAAGGCCTAGGAGATTTGTTGATCCAATAAATATGATTAGGGAAACAA
+SNPSTER6_0679:1:1:1083:939#0/1
BIQQIQQQTP[[[[[VVVVQPPPPPTWWWW[[YYTTTOVV____TWVXRWPTQPQWWWWWTOOVV___V_TROOWTWTWTQWQWTTRWRO
 
… how to tell if you have: “Sanger”, “Solexa”, “Illumina 1.3+”, etc.
 
I have tried to submit to “Groomer” different times using these options one at a time and none return with results.
 
Need help please.
 
Also, what is the expected time for “Groomer” to return results for a file containing 2.7 million reads.
 
Thank you … best,
 
Kory
 
--------------------------------------------
 
Kory R. Johnson, MS, PhD
Sr. Bioinformatics Scientist
 
<image001.jpg>
 
 
Providing Contract Services For:
 
Bioinformatics Section,
Information Technology & Bioinformatics Program,
Division of Intramural Research (DIR),
National Institute of Neurological Disorders & Stroke (NINDS),
National Institutes of Health (NIH),
Bethesda, Maryland
 
Mailing Address:
 
NINDS/NIH
Clinical Center (Building 10)
Office 5S223
9000 Rockville Pike
Bethesda, MD 20892
 
Contact Information:
 
Phone:    301-402-1956
Fax:           301-480-3563
email:       johnsonko@ninds.nih.gov
 
P Green Message:
 
Please consider the environment before printing this e-mail.  Thank you.
 
Important Message:
 
This electronic message transmission contains information intended for the recipient only.  Such that, the information contained herein may be confidential, privaledged, or proprietary.  If you are not the intended recipient, be aware that any disclosure, copying, distribution, or use of this information is strictly prohibited.  If you have received this electronic information in error, please notify the sender immediately by telephone.  Thank you.
 
--------------------------------------------
 
_______________________________________________
galaxy-user mailing list
galaxy-user@lists.bx.psu.edu
http://lists.bx.psu.edu/listinfo/galaxy-user