Folks,

First, I wanted to thank you for making the datacache available (http://wiki.galaxyproject.org/Admin/Data%20Integration; rsync://datacache.g2.bx.psu.edu). It’s a great resource.

However, what is the best way to stay abreast of changes to what’s in datacache, and understand how these indexes are computed?

We are currently upgrading to bowtie2, but I notice that the bowtie2 indices for mm9, which used to be in

rsync://datacache.g2.bx.psu.edu/indexes/mm9/mm9*/bowtie2_index

have been removed, and only the hg19 genome has bowtie2 indices. Why only that one, and not the others?

Where are the scripts you use to make these indices, in case I want to create bowtie2 indices for other

So, how do I find out *why* they were removed? (Can I safely use the copy I have, or was there a problem with them?)

More generally, how do I understand the policies and logic behind the datacache indices, and be notified of changes, short of running my own periodic rsync/diff?

Finally, since I’m doing “reproducible research” is anything planned for systematically versioning genome indices, so I can easily tell what version of a system (ie, what BWA version) was used to create the index, and be sure that an index will not suddenly disappear.

Thanks,

Curtis

Research Associate/CTSA-Informatics Team

University of Alabama at Birmingham

___________________________________________________________ Please keep all replies on the list by using "reply all" in your mail client. To manage your subscriptions to this and other Galaxy lists, please use the interface at: http://lists.bx.psu.edu/ To search Galaxy mailing lists use the unified search at: http://galaxyproject.org/search/mailinglists/