Hello,

I have two questions - I apologize if they are trivial..

1) I want to simulate the amount of Illumina sequencing needed to sequence  and assemble a known genome.  Is there a way to randomly pick sequences of a specific length from a genome (either one available online or one I upload)?  Something like "pick 100bp randomly (either strand), move 400-500bp forward and pick another 100bp?"

2) Is there a way to remove redundant sequences from a FASTA file without losing the original sequence names (as happens with "collapse")?

Thanks

Daniel


-- 
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Daniel Sher, PhD
Department of Marine Biology
Leon H. Charney School of Marine Sciences
University of Haifa, Mt. Carmel 31905, Haifa, Israel
 
Office +972-4-8240731
Lab    +972-4-8288961
email: dsher@sci.haifa.ac.il