Trouble with BLAST+ and .loc file
Dear Galaxy users, I have been trying to upload a blastable database in my local instance of galaxy. I have used the nr database and generated all the nhr, nin, nsq, and nal files. I have also edited the blastdb.loc file in the galaxy-dist/tool-data/ directory and it looks like this: database [build data] path nr_01_Mar_2012 nr 15 Mar 2012 /home/user/Desktop/nr.00/nr Nevertheless when i start galaxy the megablast tool can't recognise the database. Am I missing something? Thank you in advance Makis Ladoukakis
2012/3/21 Makis Ladoukakis <makis4ever@hotmail.com>:
Dear Galaxy users,
I have been trying to upload a blastable database in my local instance of galaxy. I have used the nr database and generated all the nhr, nin, nsq, and nal files. I have also edited the blastdb.loc file in the galaxy-dist/tool-data/ directory and it looks like this:
database [build data] path nr_01_Mar_2012 nr 15 Mar 2012 /home/user/Desktop/nr.00/nr
Nevertheless when i start galaxy the megablast tool can't recognise the database. Am I missing something?
The NR database comes split up into many parts, 00 to 06 currently, and you need to download them all. They are linked by the nr.pal file, which you should also have downloaded. The database is then used via the full name of the nr.pal file (but without the .pal extension). If you are running Galaxy on a server, it is likely your systems administrator can/has setup a shared set of NCBI BLAST databases for all the system users (including Galaxy), to avoid unnecessary copies under /home Note that queries about local Galaxy installations are normally handled via the galaxy-dev mailing list (although perhaps the project needs three lists now given local Galaxy installations are getting more common and not everyone wants to follow the Galaxy development itself). Peter
On Wed, Mar 21, 2012 at 10:21 AM, Peter Cock <p.j.a.cock@googlemail.com> wrote:
2012/3/21 Makis Ladoukakis <makis4ever@hotmail.com>:
Dear Galaxy users,
I have been trying to upload a blastable database in my local instance of galaxy. I have used the nr database and generated all the nhr, nin, nsq, and nal files. I have also edited the blastdb.loc file in the galaxy-dist/tool-data/ directory and it looks like this:
database [build data] path nr_01_Mar_2012 nr 15 Mar 2012 /home/user/Desktop/nr.00/nr
Nevertheless when i start galaxy the megablast tool can't recognise the database. Am I missing something?
The NR database comes split up into many parts, 00 to 06 currently, and you need to download them all. They are linked by the nr.pal file, which you should also have downloaded. The database is then used via the full name of the nr.pal file (but without the .pal extension).
If you are running Galaxy on a server, it is likely your systems administrator can/has setup a shared set of NCBI BLAST databases for all the system users (including Galaxy), to avoid unnecessary copies under /home
Note that queries about local Galaxy installations are normally handled via the galaxy-dev mailing list (although perhaps the project needs three lists now given local Galaxy installations are getting more common and not everyone wants to follow the Galaxy development itself).
Peter
Sorry, I missed something else which is vitally important: The NCBI NR database is a protein database, and should be listed in blastdb_p.loc (which is used by the BLASTP wrapper etc) while blastdb.loc is for nucleotide databases only (and used for the BLASTN/megablast wrapper etc). As you were asking about megablast, you probably want the NCBI NT BLAST database instead (although sometimes confusingly the NCBI can use the names ambiguously, for the file names this is very important). Peter
Dear Peter, Thank you for your reply. You are right I do have all 6 parts of the nr database on the server plus the .pal file in the same directory. The .loc file still is: database [build data] path nr_01_Mar_2012 nr 15 Mar 2012 /path/on/the/server/nr_directory/nr but the megablast tool still doesn't recognise the database. Am I missing something? Thank you in advance, Makis Ladoukakis
Date: Wed, 21 Mar 2012 10:21:16 +0000 Subject: Re: [galaxy-user] Trouble with BLAST+ and .loc file From: p.j.a.cock@googlemail.com To: makis4ever@hotmail.com CC: galaxy-user@lists.bx.psu.edu
2012/3/21 Makis Ladoukakis <makis4ever@hotmail.com>:
Dear Galaxy users,
I have been trying to upload a blastable database in my local instance of galaxy. I have used the nr database and generated all the nhr, nin, nsq, and nal files. I have also edited the blastdb.loc file in the galaxy-dist/tool-data/ directory and it looks like this:
database [build data] path nr_01_Mar_2012 nr 15 Mar 2012 /home/user/Desktop/nr.00/nr
Nevertheless when i start galaxy the megablast tool can't recognise the database. Am I missing something?
The NR database comes split up into many parts, 00 to 06 currently, and you need to download them all. They are linked by the nr.pal file, which you should also have downloaded. The database is then used via the full name of the nr.pal file (but without the .pal extension).
If you are running Galaxy on a server, it is likely your systems administrator can/has setup a shared set of NCBI BLAST databases for all the system users (including Galaxy), to avoid unnecessary copies under /home
Note that queries about local Galaxy installations are normally handled via the galaxy-dev mailing list (although perhaps the project needs three lists now given local Galaxy installations are getting more common and not everyone wants to follow the Galaxy development itself).
Peter
Hi Makis, here is my working nr entry: /media/data/ncbi/blast/db/unpacked_nt/nt<tab>ncbi_nt<tab>/media/data/databases/ncbi/blast/db/unpacked_nt/nt i was not able to see if your tabs are correct. So maybe its just a wrongly formatted loc file. Cheers, Bjoern P.S. You can download preformatted blast-databases directly from http://www.ncbi.nlm.nih.gov/staff/tao/URLAPI/blastdb.html
Dear Galaxy users,
I have been trying to upload a blastable database in my local instance of galaxy. I have used the nr database and generated all the nhr, nin, nsq, and nal files. I have also edited the blastdb.loc file in the galaxy-dist/tool-data/ directory and it looks like this:
database [build data] path nr_01_Mar_2012 nr 15 Mar 2012 /home/user/Desktop/nr.00/nr
Nevertheless when i start galaxy the megablast tool can't recognise the database. Am I missing something?
Thank you in advance Makis Ladoukakis
___________________________________________________________ The Galaxy User list should be used for the discussion of Galaxy analysis and other features on the public server at usegalaxy.org. Please keep all replies on the list by using "reply all" in your mail client. For discussion of local Galaxy instances and the Galaxy source code, please use the Galaxy Development list:
http://lists.bx.psu.edu/listinfo/galaxy-dev
To manage your subscriptions to this and other Galaxy lists, please use the interface at:
-- Björn Grüning Albert-Ludwigs-Universität Freiburg Institute of Pharmaceutical Sciences Pharmaceutical Bioinformatics Hermann-Herder-Strasse 9 D-79104 Freiburg i. Br. Tel.: +49 761 203-4872 Fax.: +49 761 203-97769 E-Mail: bjoern.gruening@pharmazie.uni-freiburg.de Web: http://www.pharmaceutical-bioinformatics.org/
participants (3)
-
Björn Grüning
-
Makis Ladoukakis
-
Peter Cock