Hi Larry,

If you are performing RNA-seq analysis, there is no need to filter the data to ensure exact pairs before running Tophat. Later steps will deal with any unpaired sequences.

If you are instead performing variant analysis, then in many cases you will want matched pairs. Join the data first, perform any manipulations that may remove reads entirely (filtering by quality), then split and perform steps that may only trim off ends (if even needed). The FASTQ splitter/joiner are at the top of the "NGS: QC and manipulation" tool group, FASTQ Quality Trimmer is a bit lower down. Use FastQC to understand what thresholds to use, per dataset.

Hopefully this helps,

Jen
Galaxy team




On 7/22/13 9:10 PM, Larry Simpson wrote:
Hi Users

How do I process paired reads from an Ilumina Miseq platform? If I first use "Groomer" and then "Filter by quality", the paired reads get out of sync. I read some responses to similar questions in this Forum  that the paired reads must first be "joined"  before filtering and then split for mapping. There is also a "Fastq interlacer" command to join reads. However, I also read in the Forum that Galaxy requires the filtered paired reads to be of equal length. But would not the filtering process on the joined reads also modify the length of each differently? Or can the  NGS: Picard command "Paired Read Mate Fixer"  be used to re-synchronize the paired reads? 

If there is no way around the  requirement for equal lengths, then I guess that it is really not possible to process paired reads in Galaxy?  But I am sure it is just my stupidity.

Full disclosure: Be kind, I am a novice in this field, just learning Galaxy (which by the way is a fantastic resource!). 

Larry Simpson
UCLA


___________________________________________________________
The Galaxy User list should be used for the discussion of
Galaxy analysis and other features on the public server
at usegalaxy.org.  Please keep all replies on the list by
using "reply all" in your mail client.  For discussion of
local Galaxy instances and the Galaxy source code, please
use the Galaxy Development list:

  http://lists.bx.psu.edu/listinfo/galaxy-dev

To manage your subscriptions to this and other Galaxy lists,
please use the interface at:

  http://lists.bx.psu.edu/

To search Galaxy mailing lists use the unified search at:

  http://galaxyproject.org/search/mailinglists/

-- 
Jennifer Hillman-Jackson
Galaxy Support and Training
http://galaxyproject.org