Getting genomes into a standalone galaxy instance
I've been trying to get genomes into my local galaxy instance to use the "Fetch Sequences - Extract Genomic DNA" tool. I've read plenty of posts about adding custom genomes but I'm just looking to download the standard genomes already included with Galaxy, ie. mm9, hg19, etc. I've already run the updateucsc.sh which created the indexes however, I need to download the actual genome sequences and put them where Galaxy can find them. I'm sure I'm missing a command somewhere. Thanks for the help. Dr. Will Dampier Instructor, Drexel University Assistant Director, Center for Integrated Bioinformatics, School of Biomedical Engineering and Health Systems, Bossone 714, Drexel University, 3143 Chestnut St, Philadelphia, PA 19104
Will;
I've been trying to get genomes into my local galaxy instance to use the "Fetch Sequences - Extract Genomic DNA" tool. I've read plenty of posts about adding custom genomes but I'm just looking to download the standard genomes already included with Galaxy, ie. mm9, hg19, etc.
You'll need to download the genomes you are interested in, index them with faToTwoBit from UCSC: http://hgdownload.cse.ucsc.edu/admin/exe/linux.x86_64/ and then add the locations of the index files to tool-data/alignseq.loc. I've been working on a script to automate this process, which also deals with indexing the genomes for next gen analysis: http://github.com/chapmanb/bcbb/blob/master/ec2/biolinux/data_fabfile.py It's targeted for cloud instances, but you can adjust the amazon_ec2 function to deal with your local setup and customize the scripts to only download and index what you want. Hope this helps, Brad
participants (2)
-
Brad Chapman
-
Will Dampier