operate on genomic intervals

4 May 2012


      Hello,
        I have binned the mouse genome into fragments based on restriction
enzyme cut sites.  So each bin is a fragment flanked by say BamHI.  The
output file is in the interval format: chr# start and end coordinates of
each bin.  I want to count how many times each bin has reads that align to
it.  I mapped my reads using bowtie and generated a dataset (interval
format) for the aligned reads.  I then used join in "operate on genomic
intervals" and asked it to return intervals that innerjoin the "bin file".
The subsequent steps involve grouping and counting and then joining back to
the 1st dataset (BamHI delimited bins).  I have tried this workflow on
small datasets and it worked.  However when I subject my full alignment
file and full BamHI delimited bin file, the tool fails.  I am doing this on
cloud.  Any advice would be appreciated!

Jose

Xianrong Wong

Jennifer Jackson

tags

participants (2)