uploading large files/repeat sequences

22 Mar 2010

      I am trying to see if there are known repeat sequences in my chip seq data
set, which are not uniquely alignable, and therefore thrown out by the eland
algorithim in the sorted.txt file.  I understand those sequences are present
in the export.txt file.  I am trying to upload that file to galaxy, but have
not been able to yet.  Does anyone know the file size limitation?  Does
anyone know the best way to compress such a file to upload it?  I tried to
gzip it, but for some reason the gzipped file has a .gzip.tmp filename, and
I'm not sure if galaxy can handle this.

Also, if anyone has any other suggestions on how to analyze the repeated
portion of a chip seq, I'd be greatly appreciated.

Keith E. Giles

Ian Donaldson

tags

participants (2)