9 Nov
2011
9 Nov
'11
7:49 a.m.
On Wed, Nov 9, 2011 at 12:03 PM, Bob Harris <rsharris@bx.psu.edu> wrote:
David, in my experience with Illumina sequencing, it looks like the reads at the start of a file have a much higher sequencing error rate. Bob H
Yes, reads at the start and the end of the file come from the edge of the Illumina slide, and tend to be of poorer quality that the reads from the middle. So depending on the purpose in mind, picking 5 million reads from the middle of the file might be fine (and much easier computationally). Peter