Hi Mayank,

The best option I know of is to do the following:

1 - obtain the sequence identifiers for the unmapped reads by filter the SAM file, then cutting them out
2 - convert the original FASTQ file to FASTA - you should get two output, one for the sequences and one for the quality score values
3 - use the tool "Fetch Sequences -> Extract Genomic DNA". The query is the list from #1, the target is the "genome" from #2. Do this twice - once for seqs, once for quals. This means using the target datasets from #2 as Custom Reference genomes - help about how to do this is here:
http://wiki.galaxyproject.org/Support#Custom_reference_genome
4 - combine the FASTA seq and qual files back to FASTQ

If you will be doing this again, then capture the process into a workflow for future use, in a way creating your own "tool".

Hopefully this helps!

Jen
Galaxy

On 7/18/13 9:40 AM, Mayank Tandon wrote:
I was hoping this question had been asked, but I haven't been able to find it.  I want to output the unmapped reads from bowtie as a fastq file for subsequent mapping to other genomes (i.e. the "--un <filename>" option).  I know I can extract the unmapped reads by filtering on the bitwise values in the sam output and converting to fastq with the Picard tool, but I'm using colorspace data and bowtie converts them to letterspace. My understanding (coming mostly from forums and personal discussions) was that the color-to-letter conversion was somehow lossy so mapping the colorspace data directly is always preferable.

So the question is: Is bowtie's '--un' option implemented in Galaxy and if so, how do I access it?

Thanks in advance!


Mayank Tandon


___________________________________________________________
The Galaxy User list should be used for the discussion of
Galaxy analysis and other features on the public server
at usegalaxy.org.  Please keep all replies on the list by
using "reply all" in your mail client.  For discussion of
local Galaxy instances and the Galaxy source code, please
use the Galaxy Development list:

  http://lists.bx.psu.edu/listinfo/galaxy-dev

To manage your subscriptions to this and other Galaxy lists,
please use the interface at:

  http://lists.bx.psu.edu/

To search Galaxy mailing lists use the unified search at:

  http://galaxyproject.org/search/mailinglists/

-- 
Jennifer Hillman-Jackson
Galaxy Support and Training
http://galaxyproject.org