I would like to share a little tool for galaxy that I wrote.
The tool takes fasta files as input, keeps the header of each sequence but shuffles the sequence itself in a random order. Therefore the output file is a fasta file with sequences of the same base composition and length, but shuffled.
It is a very simple idea but gives me the opportunity to statistically evaluate certain analysis for “real” libraries with the newly created “in silico” libraries.
I hope some of you guys find this tool useful, comments are more than welcome!