Experience with Loading NGS data on standalone instance of galaxy

newer
Default global variables in Galaxy

Abhishek Pratap

20 Jul 2009 20 Jul '09

9:18 p.m.

Hi All I recently came to know about NGS analysis on galaxy during ISMB. Getting excited I tried couple of things basically to play with it. Few comments : I may have interepretted something described below in a wrong way. My apologies before hand. On a standalone installation of galaxy while I was trying to explore one FASTQ(sequence) file. It takes considerable (> 20 min) for a fastq file to get uploaded (2 GB). I am not sure what is the rationale behind that. Ideally I think there should be no need to upload such heavy files into the workspace. They could actually be used straight away by the path specified. Also is there any way to access the scripts for analysis on the command line. I know this undermines the main aim of working with galaxy but rite now I am concerned about the performance/time. I will be happy to discuss more about this in case you have some comments/questions for me. Best, -Abhi ----------------------------- Abhishek Pratap Bioinformatics Software Engineer Institute for Genome Sciences School of Medicine, Univ of Maryland 801, W. Baltimore Street, Baltimore, MD 21209 Ph: (+1)-410-706-2296 www.igs.umaryland.edu/

Show replies by date

Anton Nekrutenko

21 Jul 21 Jul

1:36 p.m.

Abhishek: Let talk. This is the area of active current development. We are looking at implementing a universal fastq-like format or supporting multiple formats. Perhaps we should join efforts in ironing out specifications. anton galaxy team On Jul 20, 2009, at 5:18 PM, Abhishek Pratap wrote:

...

Hi All

I recently came to know about NGS analysis on galaxy during ISMB. Getting excited I tried couple of things basically to play with it.

Few comments : I may have interepretted something described below in a wrong way. My apologies before hand.

On a standalone installation of galaxy while I was trying to explore one FASTQ(sequence) file. It takes considerable (> 20 min) for a fastq file to get uploaded (2 GB). I am not sure what is the rationale behind that. Ideally I think there should be no need to upload such heavy files into the workspace. They could actually be used straight away by the path specified. Also is there any way to access the scripts for analysis on the command line. I know this undermines the main aim of working with galaxy but rite now I am concerned about the performance/time.

I will be happy to discuss more about this in case you have some comments/questions for me.

Best, -Abhi

-----------------------------

Abhishek Pratap

Bioinformatics Software Engineer

Institute for Genome Sciences

School of Medicine, Univ of Maryland

801, W. Baltimore Street, Baltimore, MD 21209

Ph: (+1)-410-706-2296

www.igs.umaryland.edu/ _______________________________________________ galaxy-user mailing list galaxy-user@bx.psu.edu http://mail.bx.psu.edu/cgi-bin/mailman/listinfo/galaxy-user

Anton Nekrutenko http://nekrut.bx.psu.edu http://galaxyproject.org

Greg Von Kuster

2:44 p.m.

New subject: Experience with Loading NGS data on standalone instance of galaxy

Hello Abhi, Can you clarify the steps you took that produced the behavior? See my comments below. Anton Nekrutenko wrote:

...

Abhishek:

Let talk. This is the area of active current development. We are looking at implementing a universal fastq-like format or supporting multiple formats. Perhaps we should join efforts in ironing out specifications.

anton galaxy team

On Jul 20, 2009, at 5:18 PM, Abhishek Pratap wrote:

...
Hi All

I recently came to know about NGS analysis on galaxy during ISMB. Getting excited I tried couple of things basically to play with it.

Few comments : I may have interepretted something described below in a wrong way. My apologies before hand.

On a standalone installation of galaxy while I was trying to explore one FASTQ(sequence) file. It takes considerable (> 20 min) for a fastq file to get uploaded (2 GB).

Are you using the Galaxy upload utility to create an item in your history that points to the dataset file on disk? I am not sure what is the rationale

...

...
behind that. Ideally I think there should be no need to upload such heavy files into the workspace.

A data file that originates from a place external to Galaxy must be uploaded into Galaxy so that the disk file can be placed in the location configured in the Galaxy config file. Also, when data is uploaded to Galaxy ( either to a history or a library ), several database table settings are created that are used by various Galaxy features. They could actually be used straight

...

...
away by the path specified.

What do you mean by "the path specified"? Also is there any way to access the

...

...
scripts for analysis on the command line. I know this undermines the main aim of working with galaxy but rite now I am concerned about the performance/time.

You should be able to run any Galaxy tool from the command line as long as you have all of the tool's required binaries in your path. However, running a tool from within Galaxy should generally not be any slower than running it outside of Galaxy, depending, of course, on what you are doing.

...

...
I will be happy to discuss more about this in case you have some comments/questions for me.

Best, -Abhi

-----------------------------

Abhishek Pratap

Bioinformatics Software Engineer

Institute for Genome Sciences

School of Medicine, Univ of Maryland

801, W. Baltimore Street, Baltimore, MD 21209

Ph: (+1)-410-706-2296

www.igs.umaryland.edu/ _______________________________________________ galaxy-user mailing list galaxy-user@bx.psu.edu http://mail.bx.psu.edu/cgi-bin/mailman/listinfo/galaxy-user

Anton Nekrutenko http://nekrut.bx.psu.edu http://galaxyproject.org

_______________________________________________ galaxy-user mailing list galaxy-user@bx.psu.edu http://mail.bx.psu.edu/cgi-bin/mailman/listinfo/galaxy-user

Abhishek Pratap

4:55 p.m.

Hi All @Greg : Please find my comments below. On Tue, Jul 21, 2009 at 10:44 AM, Greg Von Kuster<ghv2@psu.edu> wrote:

...

Hello Abhi,

Can you clarify the steps you took that produced the behavior? See my comments below.

Anton Nekrutenko wrote:

...
Abhishek:

Let talk. This is the area of active current development. We are looking at implementing a universal fastq-like format or supporting multiple formats. Perhaps we should join efforts in ironing out specifications.

anton galaxy team

On Jul 20, 2009, at 5:18 PM, Abhishek Pratap wrote:

...
Hi All

I recently came to know about NGS analysis on galaxy during ISMB. Getting excited I tried couple of things basically to play with it.

Few comments : I may have interepretted something described below in a wrong way. My apologies before hand.

On a standalone installation of galaxy while I was trying to explore one FASTQ(sequence) file. It takes considerable (> 20 min) for a fastq file to get uploaded (2 GB).

Are you using the Galaxy upload utility to create an item in your history that points to the dataset file on disk?

Yes that is precisely correct, I am trying to upload a solexa FASTQ file but on a standalone galaxy installation from my local file system.

...

I am not sure what is the rationale

...
...
behind that. Ideally I think there should be no need to upload such heavy files into the workspace.

A data file that originates from a place external to Galaxy must be uploaded into Galaxy so that the disk file can be placed in the location configured in the Galaxy config file. Also, when data is uploaded to Galaxy ( either to a history or a library ), several database table settings are created that are used by various Galaxy features.

They could actually be used straight

Thanks for the clarification but I am not sure this will help a lot of people who are interested to install and run galaxy locally mainly for the following reasons. May be it is just local to me. A. We already one instance of data saved on the local file system B. Making another copy via galaxy will eat away a lot of space in long run. C. The time needed to import the files into galaxy space is huge

...

...
...
away by the path specified.

What do you mean by "the path specified"?

Well what I mean was a way to specify the path of the file/run on the lcoal file system and galaxy could directly pick it up from there rather than uploading it into its own space. Now I understand this might not work based on the way the system was designed.

...

Also is there any way to access the

...
...
scripts for analysis on the command line. I know this undermines the main aim of working with galaxy but rite now I am concerned about the performance/time.

You should be able to run any Galaxy tool from the command line as long as you have all of the tool's required binaries in your path. However, running a tool from within Galaxy should generally not be any slower than running it outside of Galaxy, depending, of course, on what you are doing.

Ok I was under the impression that running from SHELL will eliminate the step of uploading them into galaxy file space. -Abhi

...

...
...
I will be happy to discuss more about this in case you have some comments/questions for me.

Best, -Abhi

-----------------------------

Abhishek Pratap

Bioinformatics Software Engineer

Institute for Genome Sciences

School of Medicine, Univ of Maryland

801, W. Baltimore Street, Baltimore, MD 21209

Ph: (+1)-410-706-2296

www.igs.umaryland.edu/ _______________________________________________ galaxy-user mailing list galaxy-user@bx.psu.edu http://mail.bx.psu.edu/cgi-bin/mailman/listinfo/galaxy-user

Anton Nekrutenko http://nekrut.bx.psu.edu http://galaxyproject.org

_______________________________________________ galaxy-user mailing list galaxy-user@bx.psu.edu http://mail.bx.psu.edu/cgi-bin/mailman/listinfo/galaxy-user

Greg Von Kuster

8:26 p.m.

New subject: Experience with Loading NGS data on standalone instance of galaxy

Hello Abishek, We are currently in the process of significantly enhancing the current Galaxy upload utilities, and the new version should eliminate the issue you've raised about the time needed to upload large files via HTTP ( not for making an initial copy of the file in the Galaxy environment ). However, it will probably not be ready for release for a few more weeks, so if you can take advantage of Assaf's script in the meantime, that's great. I can't guarantee that all Galaxy features will function correctly if you do this though. Assaf, have you found that using your script breaks anything? Also, if you upload a file to a library rather than a history, multiple users can "import" the library dataset into their history for analysis, but there is only 1 file on disk ( users are pointing to it from their histories ). But uploading a file to a history will create a new copy of the file each time it is uploaded. Greg Von Kuster Galaxy Development Team Abhishek Pratap wrote:

...

Hi All

@Greg : Please find my comments below.

On Tue, Jul 21, 2009 at 10:44 AM, Greg Von Kuster<ghv2@psu.edu> wrote:

...
Hello Abhi,

Can you clarify the steps you took that produced the behavior? †See my comments below.

...
Abhishek:

Let talk. This is the area of active current development. We are †looking at implementing a universal fastq-like format or supporting †multiple formats. Perhaps we should join efforts in ironing out †specifications.

anton galaxy team

On Jul 20, 2009, at 5:18 PM, Abhishek Pratap wrote:

...
Hi All

I recently came to know about NGS analysis on galaxy during ISMB. Getting excited I tried couple of things basically to play with it.

Few comments : I may have interepretted something described below in a wrong way. My apologies before hand.

On a standalone installation of galaxy while I was trying to explore one FASTQ(sequence) file. It takes considerable (> 20 min) for a fastq file to get uploaded (2 GB). Are you using the Galaxy upload utility to create an item in your history

Anton Nekrutenko wrote: that points to the dataset file on disk?

Yes that is precisely correct, I am trying to upload a solexa FASTQ file but on a standalone galaxy installation from my local file system.

...
...
...
behind that. Ideally I think there should be no need to upload such heavy files into the workspace. A data file that originates from a place external to Galaxy must be uploaded into Galaxy so that the disk file can be placed in the location configured in the Galaxy config file. †Also, when data is uploaded to Galaxy ( either to a history or a library ), several database table settings are created

I am not sure what is the rationale that are used by various Galaxy features.

They could actually be used straight

Thanks for the clarification but I am not sure this will help a lot of people who are interested to install and run galaxy locally mainly for the following reasons. May be it is just local to me.

A. We already one instance of data saved on the local file system B. Making another copy via galaxy will eat away a lot of space in long run. C. The time needed to import the files into galaxy space is huge

...
...
...
away by the path specified. What do you mean by "the path specified"?

Well what I mean was a way to specify the path of the file/run on the lcoal file system and galaxy could directly pick it up from there rather than uploading it into its own space. Now I understand this might not work based on the way the system was designed.

...
Also is there any way to access the

...
...
scripts for analysis on the command line. I know this undermines the main aim of working with galaxy but rite now I am concerned about the performance/time. You should be able to run any Galaxy tool from the command line as long as you have all of the tool's required binaries in your path. †However, running a tool from within Galaxy should generally not be any slower than running it outside of Galaxy, depending, of course, on what you are doing.

Ok I was under the impression that running from SHELL will eliminate the step of uploading them into galaxy file space.

-Abhi

...
...
...
I will be happy to discuss more about this in case you have some comments/questions for me.

Best, -Abhi

-----------------------------

Abhishek Pratap

Bioinformatics Software Engineer

Institute for Genome Sciences

School of Medicine, Univ of Maryland

801, W. Baltimore Street, Baltimore, MD 21209

Ph: (+1)-410-706-2296

www.igs.umaryland.edu/ _______________________________________________ galaxy-user mailing list galaxy-user@bx.psu.edu http://mail.bx.psu.edu/cgi-bin/mailman/listinfo/galaxy-user Anton Nekrutenko http://nekrut.bx.psu.edu http://galaxyproject.org

_______________________________________________ galaxy-user mailing list galaxy-user@bx.psu.edu http://mail.bx.psu.edu/cgi-bin/mailman/listinfo/galaxy-user

Abhishek Pratap

25 Sep 25 Sep

7:44 p.m.

Hi Greg, Anton and all Just wondering if there has been any progress made on this end. I am sorry I was not able to follow it up on Assaf's suggestion due to other things at work. I did try the latest version of galaxy and looks like the files are still transferred over HTTP before they could be used in the galaxy workspace. Also I would again like to highlight that many labs might want to use the local instance of galaxy and prefer to point to a local path where the file is being stored. That way we will have both the benefits of using a cool GUI and process data stored locally. Let me know if you guys need some feedback or have more questions. I will be happy to discuss them. best, -Abhi On Tue, Jul 21, 2009 at 4:26 PM, Greg Von Kuster <ghv2@psu.edu> wrote:

...

Hello Abishek,

We are currently in the process of significantly enhancing the current Galaxy upload utilities, and the new version should eliminate the issue you've raised about the time needed to upload large files via HTTP ( not for making an initial copy of the file in the Galaxy environment ). However, it will probably not be ready for release for a few more weeks, so if you can take advantage of Assaf's script in the meantime, that's great. I can't guarantee that all Galaxy features will function correctly if you do this though.

Assaf, have you found that using your script breaks anything?

Also, if you upload a file to a library rather than a history, multiple users can "import" the library dataset into their history for analysis, but there is only 1 file on disk ( users are pointing to it from their histories ). But uploading a file to a history will create a new copy of the file each time it is uploaded.

Greg Von Kuster Galaxy Development Team

Abhishek Pratap wrote:

...
Hi All

@Greg : Please find my comments below.

On Tue, Jul 21, 2009 at 10:44 AM, Greg Von Kuster<ghv2@psu.edu> wrote:

...
Hello Abhi,

Can you clarify the steps you took that produced the behavior? †See my comments below.

Anton Nekrutenko wrote:

...
Abhishek:

Let talk. This is the area of active current development. We are †looking at implementing a universal fastq-like format or supporting †multiple formats. Perhaps we should join efforts in ironing out †specifications.

anton galaxy team

On Jul 20, 2009, at 5:18 PM, Abhishek Pratap wrote:

Hi All

...
I recently came to know about NGS analysis on galaxy during ISMB. Getting excited I tried couple of things basically to play with it.

Few comments : I may have interepretted something described below in a wrong way. My apologies before hand.

On a standalone installation of galaxy while I was trying to explore one FASTQ(sequence) file. It takes considerable (> 20 min) for a fastq file to get uploaded (2 GB).

Are you using the Galaxy upload utility to create an item in your history that points to the dataset file on disk?

Yes that is precisely correct, I am trying to upload a solexa FASTQ file but on a standalone galaxy installation from my local file system.

I am not sure what is the rationale

...
...
behind that. Ideally I think there should be no need to upload such

...
heavy files into the workspace.

A data file that originates from a place external to Galaxy must be uploaded into Galaxy so that the disk file can be placed in the location configured in the Galaxy config file. †Also, when data is uploaded to Galaxy ( either to a history or a library ), several database table settings are created that are used by various Galaxy features.

They could actually be used straight

Thanks for the clarification but I am not sure this will help a lot of people who are interested to install and run galaxy locally mainly for the following reasons. May be it is just local to me.

A. We already one instance of data saved on the local file system B. Making another copy via galaxy will eat away a lot of space in long run. C. The time needed to import the files into galaxy space is huge

away by the path specified.

...
...
...
What do you mean by "the path specified"?

Well what I mean was a way to specify the path of the file/run on the lcoal file system and galaxy could directly pick it up from there rather than uploading it into its own space. Now I understand this might not work based on the way the system was designed.

Also is there any way to access the

...
...
scripts for analysis on the command line. I know this undermines the

...
main aim of working with galaxy but rite now I am concerned about the performance/time.

You should be able to run any Galaxy tool from the command line as long as you have all of the tool's required binaries in your path. †However, running a tool from within Galaxy should generally not be any slower than running it outside of Galaxy, depending, of course, on what you are doing.

Ok I was under the impression that running from SHELL will eliminate the step of uploading them into galaxy file space.

-Abhi

...
I will be happy to discuss more about this in case you have some

...
...
comments/questions for me.

Best, -Abhi

-----------------------------

Abhishek Pratap

Bioinformatics Software Engineer

Institute for Genome Sciences

School of Medicine, Univ of Maryland

801, W. Baltimore Street, Baltimore, MD 21209

Ph: (+1)-410-706-2296

www.igs.umaryland.edu/ _______________________________________________ galaxy-user mailing list galaxy-user@bx.psu.edu http://mail.bx.psu.edu/cgi-bin/mailman/listinfo/galaxy-user

Anton Nekrutenko http://nekrut.bx.psu.edu http://galaxyproject.org

_______________________________________________ galaxy-user mailing list galaxy-user@bx.psu.edu http://mail.bx.psu.edu/cgi-bin/mailman/listinfo/galaxy-user

Ido M. Tamir

26 Sep 26 Sep

6:27 a.m.

On Friday 25 September 2009 21:44:36 Abhishek Pratap wrote:

...

Hi Greg, Anton and all

Just wondering if there has been any progress made on this end. I am sorry I was not able to follow it up on Assaf's suggestion due to other things at work.

I did try the latest version of galaxy and looks like the files are still transferred over HTTP before they could be used in the galaxy workspace. Also I would again like to highlight that many labs might want to use the local instance of galaxy and prefer to point to a local path where the file is being stored. That way we will have both the benefits of using a cool GUI and process data stored locally.

Let me know if you guys need some feedback or have more questions. I will be happy to discuss them.

If you need a more elaborate solution creating folders per group and user you could take a look at what I have written up here: http://idotamir.blogspot.com/2009/08/adventures-in-galaxy-pt1.html http://idotamir.blogspot.com/2009/09/adventures-in-galaxy-pt2.html The solution given at galaxy-central-importer should just work(tm) and show you what you can do. But just don't merge it into the current version - this will not work. best, ido

Greg Von Kuster

28 Sep 28 Sep

1:28 p.m.

New subject: Experience with Loading NGS data on standalone instance of galaxy

Hello Abhishek, The Galaxy distribution includes the enhancements to which I previously referred for uploading history files. Uploading files to a history now creates a Galaxy job just like any other tool, and can be run on a cluster node, allowing upload of very large files. The initial pass of this work is also completed for uploading to a Data Library, but this enhancement is still in test, so it should soon be available in the distribution. Do you want to avoid having to import at all (e.g. allow Galaxy to refer to datasets that live in their original locations)? This is not currently possible, but if this is what you are looking for, we can consider some additional options on the current upload form, or possibly a new, separate form. Greg Von Kuster Galaxy Development Team Abhishek Pratap wrote:

...

Hi Greg, Anton and all

Just wondering if there has been any progress made on this end. I am sorry I was not able to follow it up on Assaf's suggestion due to other things at work.

I did try the latest version of galaxy and looks like the files are still transferred over HTTP before they could be used in the galaxy workspace. Also I would again like to highlight that many labs might want to use the local instance of galaxy and prefer to point to a local path where the file is being stored. That way we will have both the benefits of using a cool GUI and process data stored locally.

Let me know if you guys need some feedback or have more questions. I will be happy to discuss them.

best, -Abhi

On Tue, Jul 21, 2009 at 4:26 PM, Greg Von Kuster <ghv2@psu.edu <mailto:ghv2@psu.edu>> wrote:

Hello Abishek,

We are currently in the process of significantly enhancing the current Galaxy upload utilities, and the new version should eliminate the issue you've raised about the time needed to upload large files via HTTP ( not for making an initial copy of the file in the Galaxy environment ). However, it will probably not be ready for release for a few more weeks, so if you can take advantage of Assaf's script in the meantime, that's great. ¬†I can't guarantee that all Galaxy features will function correctly if you do this though.

Assaf, have you found that using your script breaks anything?

Also, if you upload a file to a library rather than a history, multiple users can "import" the library dataset into their history for analysis, but there is only 1 file on disk ( users are pointing to it from their histories ). ¬†But uploading a file to a history will create a new copy of the file each time it is uploaded.

Greg Von Kuster Galaxy Development Team

Abhishek Pratap wrote:

Hi All

@Greg : Please find my comments below.

On Tue, Jul 21, 2009 at 10:44 AM, Greg Von Kuster<ghv2@psu.edu <mailto:ghv2@psu.edu>> wrote:

Hello Abhi,

Can you clarify the steps you took that produced the behavior? ‚Ä†See my comments below.

Anton Nekrutenko wrote:

Abhishek:

Let talk. This is the area of active current development. We are ‚Ä†looking at implementing a universal fastq-like format or supporting ‚Ä†multiple formats. Perhaps we should join efforts in ironing out ‚Ä†specifications.

anton galaxy team

On Jul 20, 2009, at 5:18 PM, Abhishek Pratap wrote:

Hi All

I recently came to know about NGS analysis on galaxy during ISMB. Getting excited I tried couple of things basically to play with it.

Few comments : I may have interepretted something described below in a wrong way. My apologies before hand.

On a standalone installation of galaxy while I was trying to explore one FASTQ(sequence) file. It takes considerable (> 20 min) for a fastq file to get uploaded (2 GB).

Are you using the Galaxy upload utility to create an item in your history that points to the dataset file on disk?

Yes that is precisely correct, I am trying to upload a solexa FASTQ file but on a standalone galaxy installation from my local file system.

I am not sure what is the rationale

behind that. Ideally I think there should be no need to upload such heavy files into the workspace.

A data file that originates from a place external to Galaxy must be uploaded into Galaxy so that the disk file can be placed in the location configured in the Galaxy config file. ‚Ä†Also, when data is uploaded to Galaxy ( either to a history or a library ), several database table settings are created that are used by various Galaxy features.

They could actually be used straight

Thanks for the clarification but I am not sure this will help a lot of people who are interested to install and run galaxy locally mainly for the following reasons. May be it is just local to me.

A. We already one instance of data saved on the local file system B. Making another copy via galaxy will eat away a lot of space in long run. C. The time needed to import the files into galaxy space is huge

away by the path specified.

What do you mean by "the path specified"?

Well what I mean was a way to specify the path of the file/run on the lcoal file system and galaxy could directly pick it up from there rather than uploading it into its own space. Now I understand this might not work based on the way the system was designed.

Also is there any way to access the

scripts for analysis on the command line. I know this undermines the main aim of working with galaxy but rite now I am concerned about the performance/time.

You should be able to run any Galaxy tool from the command line as long as you have all of the tool's required binaries in your path. ‚Ä†However, running a tool from within Galaxy should generally not be any slower than running it outside of Galaxy, depending, of course, on what you are doing.

Ok I was under the impression that running from SHELL will eliminate the step of uploading them into galaxy file space.

-Abhi

I will be happy to discuss more about this in case you have some comments/questions for me.

Best, -Abhi

-----------------------------

Abhishek Pratap

Bioinformatics Software Engineer

Institute for Genome Sciences

School of Medicine, Univ of Maryland

801, W. Baltimore Street, Baltimore, MD 21209

Ph: (+1)-410-706-2296

www.igs.umaryland.edu/ <http://www.igs.umaryland.edu/> _______________________________________________ galaxy-user mailing list galaxy-user@bx.psu.edu <mailto:galaxy-user@bx.psu.edu> http://mail.bx.psu.edu/cgi-bin/mailman/listinfo/galaxy-user

Anton Nekrutenko http://nekrut.bx.psu.edu http://galaxyproject.org

_______________________________________________ galaxy-user mailing list galaxy-user@bx.psu.edu <mailto:galaxy-user@bx.psu.edu> http://mail.bx.psu.edu/cgi-bin/mailman/listinfo/galaxy-user

Matthias Dodt

1:53 p.m.

New subject: Experience with Loading NGS data on standalone instance of galaxy

Hi Guys! Thanks for the reply - I wrote a simple .xml which allows the import of any file which is accessible by the galaxy process (read permission) and is mounted in the current file system of the galaxy server. in the web-interface you have to type in the absolute path+filename manually- It copies the file into galaxy (unix cp used). The uploaded file is assumed to be a text file and has to be set manually to the correct file format- @nate: Thanks! ill have a look at that too- Greetings, mat <tool id="importer_1" name="Cluster file importer"> <description>copies files from import directory into galaxy</description> <command> cp $source $target 2> $log_report </command> <inputs> <param name="source" type="text" label="Absolute path+filename to file on cluster" optional="false" size="60"/> </inputs> <outputs> <data name="target" format="text" label="Imported file" /> <data format="text" name="log_report" label="Detailed log report from importer"/> </outputs> <help> **What it does** This tool imports a file from the cluster into galaxy. The file will be copied into the galaxy environment, the original file remains on the cluster. **!Important advice!** Please make sure the uploaded file is set to the correct file format! Otherwise tools cannot access this file. This can be done by editing the properties of the uploaded file (pen symbol in the history). The default assumed file format is -text- which might be incorrect it most cases. Please keep although in mind that disk space on the galaxy server is limited. **Example** /home/mat/myfile.fa **Support** Feel free to contact us if you cant upload your files into galaxy. </help> </tool>

Abhishek Pratap

6:24 p.m.

HI Greg Thanks for a quick reply and making some requested changes. However I am not still sure if importing NGS data will help in long run. For Centers generating NGS data which could 2-3 T.B / week depending on no. of sequencers I think importing another copy of raw data into galaxy workspace will be asking for lot of disk space. I understand it is a neat way of doing things as it becomes agnostic of the raw data location but might not be the best way for handling huge data in long run for centers like ours. Please correct me if I am wrong. I think we could also have a simple option without having to import the data and just using it for analysis from the current location, also storing results at the same location. That way in future even if the data set is moved analysis also stays with it. Let me know what you feel. I will be happy to know if there are any other smart reasons of importing the data in galaxy workspace just for curiosity sake. Thanks, -Abhi On Mon, Sep 28, 2009 at 9:28 AM, Greg Von Kuster <ghv2@psu.edu> wrote:

...

Hello Abhishek,

The Galaxy distribution includes the enhancements to which I previously referred for uploading history files. Uploading files to a history now creates a Galaxy job just like any other tool, and can be run on a cluster node, allowing upload of very large files. The initial pass of this work is also completed for uploading to a Data Library, but this enhancement is still in test, so it should soon be available in the distribution.

Do you want to avoid having to import at all (e.g. allow Galaxy to refer to datasets that live in their original locations)? This is not currently possible, but if this is what you are looking for, we can consider some additional options on the current upload form, or possibly a new, separate form.

Greg Von Kuster Galaxy Development Team

Abhishek Pratap wrote:

...
Hi Greg, Anton and all

Just wondering if there has been any progress made on this end. I am sorry I was not able to follow it up on Assaf's suggestion due to other things at work.

I did try the latest version of galaxy and looks like the files are still transferred over HTTP before they could be used in the galaxy workspace. Also I would again like to highlight that many labs might want to use the local instance of galaxy and prefer to point to a local path where the file is being stored. That way we will have both the benefits of using a cool GUI and process data stored locally.

Let me know if you guys need some feedback or have more questions. I will be happy to discuss them.

best, -Abhi

On Tue, Jul 21, 2009 at 4:26 PM, Greg Von Kuster <ghv2@psu.edu <mailto: ghv2@psu.edu>> wrote:

Hello Abishek,

We are currently in the process of significantly enhancing the current Galaxy upload utilities, and the new version should eliminate the issue you've raised about the time needed to upload large files via HTTP ( not for making an initial copy of the file in the Galaxy environment ). However, it will probably not be ready for release for a few more weeks, so if you can take advantage of Assaf's script in the meantime, that's great. ¬†I can't guarantee that all Galaxy features will function correctly if you do this though.

Assaf, have you found that using your script breaks anything?

Also, if you upload a file to a library rather than a history, multiple users can "import" the library dataset into their history for analysis, but there is only 1 file on disk ( users are pointing to it from their histories ). ¬†But uploading a file to a history will create a new copy of the file each time it is uploaded.

Greg Von Kuster Galaxy Development Team

Abhishek Pratap wrote:

Hi All

@Greg : Please find my comments below.

On Tue, Jul 21, 2009 at 10:44 AM, Greg Von Kuster<ghv2@psu.edu <mailto:ghv2@psu.edu>> wrote:

Hello Abhi,

Can you clarify the steps you took that produced the behavior? ‚Ä†See my comments below.

Anton Nekrutenko wrote:

Abhishek:

Let talk. This is the area of active current development. We are ‚Ä†looking at implementing a universal fastq-like format or supporting ‚Ä†multiple formats. Perhaps we should join efforts in ironing out ‚Ä†specifications.

anton galaxy team

On Jul 20, 2009, at 5:18 PM, Abhishek Pratap wrote:

Hi All

I recently came to know about NGS analysis on galaxy during ISMB. Getting excited I tried couple of things basically to play with it.

Few comments : I may have interepretted something described below in a wrong way. My apologies before hand.

On a standalone installation of galaxy while I was trying to explore one FASTQ(sequence) file. It takes considerable (> 20 min) for a fastq file to get uploaded (2 GB).

Are you using the Galaxy upload utility to create an item in your history that points to the dataset file on disk?

Yes that is precisely correct, I am trying to upload a solexa FASTQ file but on a standalone galaxy installation from my local file system.

I am not sure what is the rationale

behind that. Ideally I think there should be no need to upload such heavy files into the workspace.

A data file that originates from a place external to Galaxy must be uploaded into Galaxy so that the disk file can be placed in the location configured in the Galaxy config file. ‚Ä†Also, when data is uploaded to

Galaxy ( either to a history or a library ), several database table settings are created that are used by various Galaxy features.

They could actually be used straight

Thanks for the clarification but I am not sure this will help a lot of people who are interested to install and run galaxy locally mainly for the following reasons. May be it is just local to me.

A. We already one instance of data saved on the local file system B. Making another copy via galaxy will eat away a lot of space in long run. C. The time needed to import the files into galaxy space is huge

away by the path specified.

What do you mean by "the path specified"?

Well what I mean was a way to specify the path of the file/run on the lcoal file system and galaxy could directly pick it up from there rather than uploading it into its own space. Now I understand this might not work based on the way the system was designed.

Also is there any way to access the

scripts for analysis on the command line. I know this undermines the main aim of working with galaxy but rite now I am concerned about the performance/time.

You should be able to run any Galaxy tool from the command line as long as you have all of the tool's required binaries in your path. ‚Ä†However, running a tool from within Galaxy should generally not be any slower than running it outside of Galaxy, depending, of course, on what you are doing.

Ok I was under the impression that running from SHELL will eliminate the step of uploading them into galaxy file space.

-Abhi

I will be happy to discuss more about this in case you have some comments/questions for me.

Best, -Abhi

-----------------------------

Abhishek Pratap

Bioinformatics Software Engineer

Institute for Genome Sciences

School of Medicine, Univ of Maryland

801, W. Baltimore Street, Baltimore, MD 21209

Ph: (+1)-410-706-2296

www.igs.umaryland.edu/ <http://www.igs.umaryland.edu/> _______________________________________________ galaxy-user mailing list galaxy-user@bx.psu.edu <mailto:galaxy-user@bx.psu.edu>

http://mail.bx.psu.edu/cgi-bin/mailman/listinfo/galaxy-user

Anton Nekrutenko http://nekrut.bx.psu.edu http://galaxyproject.org

_______________________________________________ galaxy-user mailing list galaxy-user@bx.psu.edu <mailto:galaxy-user@bx.psu.edu>

http://mail.bx.psu.edu/cgi-bin/mailman/listinfo/galaxy-user

Oliver Hofmann

6:31 p.m.

Dear all, to echo what Abhi said: we are also currently looking of ways to automatically import data sets (libraries) into Galaxy without having to manually trigger the import via the administration interface, and ideally while keeping the data in the original place. The idea here is to have multiple tools all point at the original 'source data' without having to replicate terabytes of data. Not quite sure how feasible this is in practice, but it certainly would be incredibly helpful. Best, Oliver On 28 Sep 2009, at 14:24, Abhishek Pratap wrote:

...

HI Greg

Thanks for a quick reply and making some requested changes. However I am not still sure if importing NGS data will help in long run.

For Centers generating NGS data which could 2-3 T.B / week depending on no. of sequencers I think importing another copy of raw data into galaxy workspace will be asking for lot of disk space. I understand it is a neat way of doing things as it becomes agnostic of the raw data location but might not be the best way for handling huge data in long run for centers like ours.

Please correct me if I am wrong. I think we could also have a simple option without having to import the data and just using it for analysis from the current location, also storing results at the same location. That way in future even if the data set is moved analysis also stays with it.

Let me know what you feel. I will be happy to know if there are any other smart reasons of importing the data in galaxy workspace just for curiosity sake.

Thanks, -Abhi

On Mon, Sep 28, 2009 at 9:28 AM, Greg Von Kuster <ghv2@psu.edu> wrote: Hello Abhishek,

The Galaxy distribution includes the enhancements to which I previously referred for uploading history files. Uploading files to a history now creates a Galaxy job just like any other tool, and can be run on a cluster node, allowing upload of very large files. The initial pass of this work is also completed for uploading to a Data Library, but this enhancement is still in test, so it should soon be available in the distribution.

Do you want to avoid having to import at all (e.g. allow Galaxy to refer to datasets that live in their original locations)? This is not currently possible, but if this is what you are looking for, we can consider some additional options on the current upload form, or possibly a new, separate form.

Greg Von Kuster Galaxy Development Team

Abhishek Pratap wrote: Hi Greg, Anton and all

Just wondering if there has been any progress made on this end. I am sorry I was not able to follow it up on Assaf's suggestion due to other things at work.

I did try the latest version of galaxy and looks like the files are still transferred over HTTP before they could be used in the galaxy workspace. Also I would again like to highlight that many labs might want to use the local instance of galaxy and prefer to point to a local path where the file is being stored. That way we will have both the benefits of using a cool GUI and process data stored locally.

Let me know if you guys need some feedback or have more questions. I will be happy to discuss them.

best, -Abhi

On Tue, Jul 21, 2009 at 4:26 PM, Greg Von Kuster <ghv2@psu.edu <mailto:ghv2@psu.edu

...
...
wrote:

Hello Abishek,

We are currently in the process of significantly enhancing the current Galaxy upload utilities, and the new version should eliminate the issue you've raised about the time needed to upload large files via HTTP ( not for making an initial copy of the file in the Galaxy environment ). However, it will probably not be ready for release for a few more weeks, so if you can take advantage of Assaf's script in the meantime, that's great. ¬†I can't guarantee that all Galaxy features will function correctly if you do this though.

Assaf, have you found that using your script breaks anything?

Also, if you upload a file to a library rather than a history, multiple users can "import" the library dataset into their history for analysis, but there is only 1 file on disk ( users are pointing to it from their histories ). ¬†But uploading a file to a history will create a new copy of the file each time it is uploaded.

Greg Von Kuster Galaxy Development Team

Abhishek Pratap wrote:

Hi All

@Greg : Please find my comments below.

On Tue, Jul 21, 2009 at 10:44 AM, Greg Von Kuster<ghv2@psu.edu <mailto:ghv2@psu.edu>> wrote:

Hello Abhi,

Can you clarify the steps you took that produced the behavior? ‚Ä†See my

comments below.

Anton Nekrutenko wrote:

Abhishek:

Let talk. This is the area of active current development. We are ‚Ä†looking

at implementing a universal fastq-like format or supporting ‚Ä†multiple

formats. Perhaps we should join efforts in ironing out ‚Ä†specifications.

anton galaxy team

On Jul 20, 2009, at 5:18 PM, Abhishek Pratap wrote:

Hi All

I recently came to know about NGS analysis on galaxy during ISMB. Getting excited I tried couple of things basically to play with it.

Few comments : I may have interepretted something described below in a wrong way. My apologies before hand.

On a standalone installation of galaxy while I was trying to explore one FASTQ(sequence) file. It takes considerable (> 20 min) for a fastq file to get uploaded (2 GB).

Are you using the Galaxy upload utility to create an item in your history that points to the dataset file on disk?

Yes that is precisely correct, I am trying to upload a solexa FASTQ file but on a standalone galaxy installation from my local file system.

I am not sure what is the rationale

behind that. Ideally I think there should be no need to upload such heavy files into the workspace.

A data file that originates from a place external to Galaxy must be uploaded into Galaxy so that the disk file can be placed in the location configured in the Galaxy config file. ‚Ä†Also, when data is uploaded to

Galaxy ( either to a history or a library ), several database table settings are created that are used by various Galaxy features.

They could actually be used straight

Thanks for the clarification but I am not sure this will help a lot of people who are interested to install and run galaxy locally mainly for the following reasons. May be it is just local to me.

A. We already one instance of data saved on the local file system B. Making another copy via galaxy will eat away a lot of space in long run. C. The time needed to import the files into galaxy space is huge

away by the path specified.

What do you mean by "the path specified"?

Well what I mean was a way to specify the path of the file/run on the lcoal file system and galaxy could directly pick it up from there rather than uploading it into its own space. Now I understand this might not work based on the way the system was designed.

Also is there any way to access the

scripts for analysis on the command line. I know this undermines the main aim of working with galaxy but rite now I am concerned about the performance/time.

You should be able to run any Galaxy tool from the command line as long as you have all of the tool's required binaries in your path. ‚Ä†However, running

a tool from within Galaxy should generally not be any slower than running it outside of Galaxy, depending, of course, on what you are doing.

Ok I was under the impression that running from SHELL will eliminate the step of uploading them into galaxy file space.

-Abhi

I will be happy to discuss more about this in case you have some comments/questions for me.

Best, -Abhi

-----------------------------

Abhishek Pratap

Bioinformatics Software Engineer

Institute for Genome Sciences

School of Medicine, Univ of Maryland

801, W. Baltimore Street, Baltimore, MD 21209

Ph: (+1)-410-706-2296

www.igs.umaryland.edu/ <http://www.igs.umaryland.edu/

...
_______________________________________________ galaxy-user mailing list galaxy-user@bx.psu.edu <mailto:galaxy-user@bx.psu.edu

...
http://mail.bx.psu.edu/cgi-bin/mailman/listinfo/galaxy-user

Anton Nekrutenko http://nekrut.bx.psu.edu http://galaxyproject.org

_______________________________________________ galaxy-user mailing list galaxy-user@bx.psu.edu <mailto:galaxy-user@bx.psu.edu>

http://mail.bx.psu.edu/cgi-bin/mailman/listinfo/galaxy-user

_______________________________________________ galaxy-user mailing list galaxy-user@bx.psu.edu http://mail.bx.psu.edu/cgi-bin/mailman/listinfo/galaxy-user

-- Research Associate Department of Biostatistics Associate Director Bioinformatics Core Harvard School of Public Health Skype: ohofmann Phone: +1 (617) 365 0984

Ido M. Tamir

29 Sep 29 Sep

8:16 a.m.

Dear Oliver,

...

Not quite sure how feasible this is in practice, but it certainly would be incredibly helpful.

It is quite simple. If you have 7 minutes do the following from the command line: hg clone http://bitbucket.org/ido/galaxy-central-importer gci cd gci hg update -C importer sh setup.sh #sets up basic things ./run.sh #end with ctrl-c after startup completes (serving on ....) ./importer.sh #starts the importer wait for "done importing" ./run.sh then open your browser at localhost:8080/galaxy You can log in with 3 different usernames/passwords: name1@/123456 #can only see group1 libs name2@123456 #can only see group2 libs tamir@/123456 #can see all libs Of course only the file paths get inserted into the db. more on: http://idotamir.blogspot.com best wishes, ido

Greg Von Kuster

2 Oct 2 Oct

2:21 p.m.

New subject: Experience with Loading NGS data on standalone instance of galaxy

Change set 2812 will be included in a release to the distribution today - here are details of a new option that we're hoping will provide what is needed for most labs. Add a new option, 'allow_library_path_paste' that adds a new upload page ("Upload files from file system paths") to the admin-side library upload pages. This form contains a textarea that allows Galaxy admins to paste any number of file system paths (files or directories) from which Galaxy will import library datasets, saving the directory structure (if desired). Since such ability allows admins access to any file on the Galaxy server which is readable by Galaxy's system user, this option is disabled by default, and system administrators should take care in assigning Galaxy administrators when this feature is enabled. Controls on what files are accessible to this tool based on ownership or other properties can be added at a later date if there is sufficient interest for such features. This commit also includes a checkbox on the "Upload directory of files" page (as well as the new "Upload files from file system paths" page above) that will prevent Galaxy from copying data to its files directory (by default, 'database/files/'). This is useful for large library datasets that live in their own managed locations on the file system, this will prevent the existence of duplicate copies of datasets (but means administrators must take care to manage data - moving or removing the data from its Galaxy-external location will render these datasets invalid within Galaxy). One unique feature to be aware of: when using the "Copy data into Galaxy?" checkbox on the "Upload directory of files" page, any symbolic links encountered in the chosen import directory will be made absolute and dereferenced ONCE. This allows administrators to link large datasets to the import directory, rather than having to make full copies, while being able to delete such links after importing. Only the first symlink (the one in the import directory itself) is dereferenced; all others remain. See the following for an example: library_import_dir = /galaxy/import % ls -lR /galaxy/import /galaxy/import: total 6 drwxr-xr-x 2 nate nate 512 Oct 1 11:31 link/ /galaxy/import/link: total 10 lrwxrwxrwx 1 nate nate 71 Oct 1 10:38 1.bed -> ../../../home/nate/galaxy/test-data/1.bed lrwxrwxrwx 1 nate nate 60 Oct 1 10:38 2.bed -> /home/nate/galaxy/test-data/2.bed lrwxrwxrwx 1 nate nate 11 Oct 1 10:38 3.bed -> ../../3.bed lrwxrwxrwx 1 nate nate 35 Oct 1 11:30 4.bed -> ../../galaxy_symlink/test-data/4.bed lrwxrwxrwx 1 nate nate 41 Oct 1 11:31 5.bed -> /galaxy/galaxy_symlink/test-data/5.bed % ls -l /galaxy/3.bed lrwxrwxrwx 1 nate nate 60 Oct 1 10:39 /galaxy/3.bed -> /home/nate/galaxy/test-data/3.bed % ls -l /galaxy/galaxy_symlink lrwxrwxrwx 1 nate nate 44 Oct 1 11:30 /galaxy/galaxy_symlink -> /home/nate/galaxy/ In this example, 1.bed is a relative symbolic link to the real 1.bed. 2.bed is an absolute symlink to the real 2.bed. 3.bed is a relative symlink to ../../3.bed, aka /galaxy/3.bed, which itself is a symlink to the real 3.bed. 4.bed is a relative symlink which follows another symlink (/galaxy/galaxy_symlink) to the real 4.bed. 5.bed is an absolute symlink in the same fashion as 4.bed If the 'link' server directory is chosen on the "Upload directory of files" page, and "Copy data into Galaxy?" is checked "No", the following files will be referenced by Galaxy: /home/nate/galaxy/test-data/1.bed /home/nate/galaxy/test-data/2.bed /galaxy/3.bed /galaxy/galaxy_symlink/test-data/4.bed /galaxy/galaxy_symlink/test-data/5.bed The Galaxy administrator may now safely delete /galaxy/import/link, but should take care not to remove the referenced symbolic links (/galaxy/3.bed, /galaxy/galaxy_symlink). Not all symbolic links are dereferenced because it is assumed that if an administrator links to a path in the import directory which itself is (or contains) links, that is the preferred path for accessing the data. Oliver Hofmann wrote:

...

Dear all,

to echo what Abhi said: we are also currently looking of ways to automatically import data sets (libraries) into Galaxy without having to manually trigger the import via the administration interface, and ideally while keeping the data in the original place. The idea here is to have multiple tools all point at the original 'source data' without having to replicate terabytes of data.

Not quite sure how feasible this is in practice, but it certainly would be incredibly helpful.

Best,

Oliver

On 28 Sep 2009, at 14:24, Abhishek Pratap wrote:

...
HI Greg

Thanks for a quick reply and making some requested changes. However I am not still sure if importing NGS data will help in long run.

For Centers generating NGS data which could 2-3 T.B / week depending on no. of sequencers I think importing another copy of raw data into galaxy workspace will be asking for lot of disk space. I understand it is a neat way of doing things as it becomes agnostic of the raw data location but might not be the best way for handling huge data in long run for centers like ours.

Please correct me if I am wrong. I think we could also have a simple option without having to import the data and just using it for analysis from the current location, also storing results at the same location. That way in future even if the data set is moved analysis also stays with it.

Let me know what you feel. I will be happy to know if there are any other smart reasons of importing the data in galaxy workspace just for curiosity sake.

Thanks, -Abhi

On Mon, Sep 28, 2009 at 9:28 AM, Greg Von Kuster <ghv2@psu.edu> wrote: Hello Abhishek,

The Galaxy distribution includes the enhancements to which I previously referred for uploading history files. Uploading files to a history now creates a Galaxy job just like any other tool, and can be run on a cluster node, allowing upload of very large files. The initial pass of this work is also completed for uploading to a Data Library, but this enhancement is still in test, so it should soon be available in the distribution.

Do you want to avoid having to import at all (e.g. allow Galaxy to refer to datasets that live in their original locations)? This is not currently possible, but if this is what you are looking for, we can consider some additional options on the current upload form, or possibly a new, separate form.

Greg Von Kuster Galaxy Development Team

Abhishek Pratap wrote: Hi Greg, Anton and all

Just wondering if there has been any progress made on this end. I am sorry I was not able to follow it up on Assaf's suggestion due to other things at work.

I did try the latest version of galaxy and looks like the files are still transferred over HTTP before they could be used in the galaxy workspace. Also I would again like to highlight that many labs might want to use the local instance of galaxy and prefer to point to a local path where the file is being stored. That way we will have both the benefits of using a cool GUI and process data stored locally.

Let me know if you guys need some feedback or have more questions. I will be happy to discuss them.

best, -Abhi

On Tue, Jul 21, 2009 at 4:26 PM, Greg Von Kuster <ghv2@psu.edu <mailto:ghv2@psu.edu>> wrote:

Hello Abishek,

We are currently in the process of significantly enhancing the current Galaxy upload utilities, and the new version should eliminate the issue you've raised about the time needed to upload large files via HTTP ( not for making an initial copy of the file in the Galaxy environment ). However, it will probably not be ready for release for a few more weeks, so if you can take advantage of Assaf's script in the meantime, that's great. ¨ÜI can't guarantee that all Galaxy features will function correctly if you do this though.

Assaf, have you found that using your script breaks anything?

Also, if you upload a file to a library rather than a history, multiple users can "import" the library dataset into their history for analysis, but there is only 1 file on disk ( users are pointing to it from their histories ). ¨ÜBut uploading a file to a history will create a new copy of the file each time it is uploaded.

Greg Von Kuster Galaxy Development Team

Abhishek Pratap wrote:

Hi All

@Greg : Please find my comments below.

On Tue, Jul 21, 2009 at 10:44 AM, Greg Von Kuster<ghv2@psu.edu <mailto:ghv2@psu.edu>> wrote:

Hello Abhi,

Can you clarify the steps you took that produced the behavior? ÇƒÜSee my

comments below.

Anton Nekrutenko wrote:

Abhishek:

Let talk. This is the area of active current development. We are ÇƒÜlooking

at implementing a universal fastq-like format or supporting ÇƒÜmultiple

formats. Perhaps we should join efforts in ironing out ÇƒÜspecifications.

anton galaxy team

On Jul 20, 2009, at 5:18 PM, Abhishek Pratap wrote:

Hi All

I recently came to know about NGS analysis on galaxy during ISMB. Getting excited I tried couple of things basically to play with it.

Few comments : I may have interepretted something described below in a wrong way. My apologies before hand.

On a standalone installation of galaxy while I was trying to explore one FASTQ(sequence) file. It takes considerable (> 20 min) for a fastq file to get uploaded (2 GB).

Are you using the Galaxy upload utility to create an item in your history that points to the dataset file on disk?

Yes that is precisely correct, I am trying to upload a solexa FASTQ file but on a standalone galaxy installation from my local file system.

I am not sure what is the rationale

behind that. Ideally I think there should be no need to upload such heavy files into the workspace.

A data file that originates from a place external to Galaxy must be uploaded into Galaxy so that the disk file can be placed in the location configured in the Galaxy config file. ÇƒÜAlso, when data is uploaded to

Galaxy ( either to a history or a library ), several database table settings are created that are used by various Galaxy features.

They could actually be used straight

Thanks for the clarification but I am not sure this will help a lot of people who are interested to install and run galaxy locally mainly for the following reasons. May be it is just local to me.

A. We already one instance of data saved on the local file system B. Making another copy via galaxy will eat away a lot of space in long run. C. The time needed to import the files into galaxy space is huge

away by the path specified.

What do you mean by "the path specified"?

Well what I mean was a way to specify the path of the file/run on the lcoal file system and galaxy could directly pick it up from there rather than uploading it into its own space. Now I understand this might not work based on the way the system was designed.

Also is there any way to access the

scripts for analysis on the command line. I know this undermines the main aim of working with galaxy but rite now I am concerned about the performance/time.

You should be able to run any Galaxy tool from the command line as long as you have all of the tool's required binaries in your path. ÇƒÜHowever, running

a tool from within Galaxy should generally not be any slower than running it outside of Galaxy, depending, of course, on what you are doing.

Ok I was under the impression that running from SHELL will eliminate the step of uploading them into galaxy file space.

-Abhi

I will be happy to discuss more about this in case you have some comments/questions for me.

Best, -Abhi

-----------------------------

Abhishek Pratap

Bioinformatics Software Engineer

Institute for Genome Sciences

School of Medicine, Univ of Maryland

801, W. Baltimore Street, Baltimore, MD 21209

Ph: (+1)-410-706-2296

www.igs.umaryland.edu/ <http://www.igs.umaryland.edu/> _______________________________________________ galaxy-user mailing list galaxy-user@bx.psu.edu <mailto:galaxy-user@bx.psu.edu>

http://mail.bx.psu.edu/cgi-bin/mailman/listinfo/galaxy-user

Anton Nekrutenko http://nekrut.bx.psu.edu http://galaxyproject.org

_______________________________________________ galaxy-user mailing list galaxy-user@bx.psu.edu <mailto:galaxy-user@bx.psu.edu>

http://mail.bx.psu.edu/cgi-bin/mailman/listinfo/galaxy-user

_______________________________________________ galaxy-user mailing list galaxy-user@bx.psu.edu http://mail.bx.psu.edu/cgi-bin/mailman/listinfo/galaxy-user

-- Research Associate Department of Biostatistics Associate Director Bioinformatics Core Harvard School of Public Health Skype: ohofmann Phone: +1 (617) 365 0984

Abhishek Pratap

4:18 p.m.

Hi Greg Many thanks for accommodating our requests in quick time. I will be testing it right away. Have a good weekend. Cheers, -Abhi On Fri, Oct 2, 2009 at 10:21 AM, Greg Von Kuster <ghv2@psu.edu> wrote:

...

Change set 2812 will be included in a release to the distribution today - here are details of a new option that we're hoping will provide what is needed for most labs.

Add a new option, 'allow_library_path_paste' that adds a new upload page ("Upload files from file system paths") to the admin-side library upload pages. This form contains a textarea that allows Galaxy admins to paste any number of file system paths (files or directories) from which Galaxy will import library datasets, saving the directory structure (if desired). Since such ability allows admins access to any file on the Galaxy server which is readable by Galaxy's system user, this option is disabled by default, and system administrators should take care in assigning Galaxy administrators when this feature is enabled. Controls on what files are accessible to this tool based on ownership or other properties can be added at a later date if there is sufficient interest for such features.

This commit also includes a checkbox on the "Upload directory of files" page (as well as the new "Upload files from file system paths" page above) that will prevent Galaxy from copying data to its files directory (by default, 'database/files/'). This is useful for large library datasets that live in their own managed locations on the file system, this will prevent the existence of duplicate copies of datasets (but means administrators must take care to manage data - moving or removing the data from its Galaxy-external location will render these datasets invalid within Galaxy).

One unique feature to be aware of: when using the "Copy data into Galaxy?" checkbox on the "Upload directory of files" page, any symbolic links encountered in the chosen import directory will be made absolute and dereferenced ONCE. This allows administrators to link large datasets to the import directory, rather than having to make full copies, while being able to delete such links after importing. Only the first symlink (the one in the import directory itself) is dereferenced; all others remain. See the following for an example:

library_import_dir = /galaxy/import

% ls -lR /galaxy/import /galaxy/import: total 6 drwxr-xr-x 2 nate nate 512 Oct 1 11:31 link/

/galaxy/import/link: total 10 lrwxrwxrwx 1 nate nate 71 Oct 1 10:38 1.bed -> ../../../home/nate/galaxy/test-data/1.bed lrwxrwxrwx 1 nate nate 60 Oct 1 10:38 2.bed -> /home/nate/galaxy/test-data/2.bed lrwxrwxrwx 1 nate nate 11 Oct 1 10:38 3.bed -> ../../3.bed lrwxrwxrwx 1 nate nate 35 Oct 1 11:30 4.bed -> ../../galaxy_symlink/test-data/4.bed lrwxrwxrwx 1 nate nate 41 Oct 1 11:31 5.bed -> /galaxy/galaxy_symlink/test-data/5.bed

% ls -l /galaxy/3.bed lrwxrwxrwx 1 nate nate 60 Oct 1 10:39 /galaxy/3.bed -> /home/nate/galaxy/test-data/3.bed

% ls -l /galaxy/galaxy_symlink lrwxrwxrwx 1 nate nate 44 Oct 1 11:30 /galaxy/galaxy_symlink -> /home/nate/galaxy/

In this example,

1.bed is a relative symbolic link to the real 1.bed.

2.bed is an absolute symlink to the real 2.bed.

3.bed is a relative symlink to ../../3.bed, aka /galaxy/3.bed, which itself is a symlink to the real 3.bed.

4.bed is a relative symlink which follows another symlink (/galaxy/galaxy_symlink) to the real 4.bed.

5.bed is an absolute symlink in the same fashion as 4.bed

If the 'link' server directory is chosen on the "Upload directory of files" page, and "Copy data into Galaxy?" is checked "No", the following files will be referenced by Galaxy:

/home/nate/galaxy/test-data/1.bed /home/nate/galaxy/test-data/2.bed /galaxy/3.bed /galaxy/galaxy_symlink/test-data/4.bed /galaxy/galaxy_symlink/test-data/5.bed

The Galaxy administrator may now safely delete /galaxy/import/link, but should take care not to remove the referenced symbolic links (/galaxy/3.bed, /galaxy/galaxy_symlink).

Not all symbolic links are dereferenced because it is assumed that if an administrator links to a path in the import directory which itself is (or contains) links, that is the preferred path for accessing the data.

Oliver Hofmann wrote:

...
Dear all,

to echo what Abhi said: we are also currently looking of ways to automatically import data sets (libraries) into Galaxy without having to manually trigger the import via the administration interface, and ideally while keeping the data in the original place. The idea here is to have multiple tools all point at the original 'source data' without having to replicate terabytes of data.

Not quite sure how feasible this is in practice, but it certainly would be incredibly helpful.

Best,

Oliver

On 28 Sep 2009, at 14:24, Abhishek Pratap wrote:

...
HI Greg

Thanks for a quick reply and making some requested changes. However I am not still sure if importing NGS data will help in long run.

For Centers generating NGS data which could 2-3 T.B / week depending on no. of sequencers I think importing another copy of raw data into galaxy workspace will be asking for lot of disk space. I understand it is a neat way of doing things as it becomes agnostic of the raw data location but might not be the best way for handling huge data in long run for centers like ours.

Please correct me if I am wrong. I think we could also have a simple option without having to import the data and just using it for analysis from the current location, also storing results at the same location. That way in future even if the data set is moved analysis also stays with it.

Let me know what you feel. I will be happy to know if there are any other smart reasons of importing the data in galaxy workspace just for curiosity sake.

Thanks, -Abhi

On Mon, Sep 28, 2009 at 9:28 AM, Greg Von Kuster <ghv2@psu.edu> wrote: Hello Abhishek,

The Galaxy distribution includes the enhancements to which I previously referred for uploading history files. Uploading files to a history now creates a Galaxy job just like any other tool, and can be run on a cluster node, allowing upload of very large files. The initial pass of this work is also completed for uploading to a Data Library, but this enhancement is still in test, so it should soon be available in the distribution.

Do you want to avoid having to import at all (e.g. allow Galaxy to refer to datasets that live in their original locations)? This is not currently possible, but if this is what you are looking for, we can consider some additional options on the current upload form, or possibly a new, separate form.

Greg Von Kuster Galaxy Development Team

Abhishek Pratap wrote: Hi Greg, Anton and all

Just wondering if there has been any progress made on this end. I am sorry I was not able to follow it up on Assaf's suggestion due to other things at work.

I did try the latest version of galaxy and looks like the files are still transferred over HTTP before they could be used in the galaxy workspace. Also I would again like to highlight that many labs might want to use the local instance of galaxy and prefer to point to a local path where the file is being stored. That way we will have both the benefits of using a cool GUI and process data stored locally.

Let me know if you guys need some feedback or have more questions. I will be happy to discuss them.

best, -Abhi

On Tue, Jul 21, 2009 at 4:26 PM, Greg Von Kuster <ghv2@psu.edu <mailto:ghv2@psu.edu>> wrote:

Hello Abishek,

We are currently in the process of significantly enhancing the current Galaxy upload utilities, and the new version should eliminate the issue you've raised about the time needed to upload large files via HTTP ( not for making an initial copy of the file in the Galaxy environment ). However, it will probably not be ready for release for a few more weeks, so if you can take advantage of Assaf's script in the meantime, that's great. ¨ÜI can't guarantee that all Galaxy features will function correctly if you do this though.

Assaf, have you found that using your script breaks anything?

Also, if you upload a file to a library rather than a history, multiple users can "import" the library dataset into their history for analysis, but there is only 1 file on disk ( users are pointing to it from their histories ). ¨ÜBut uploading a file to a history will create a new copy of the file each time it is uploaded.

Greg Von Kuster Galaxy Development Team

Abhishek Pratap wrote:

Hi All

@Greg : Please find my comments below.

On Tue, Jul 21, 2009 at 10:44 AM, Greg Von Kuster<ghv2@psu.edu <mailto:ghv2@psu.edu>> wrote:

Hello Abhi,

Can you clarify the steps you took that produced the behavior? ÇƒÜSee my

comments below.

Anton Nekrutenko wrote:

Abhishek:

Let talk. This is the area of active current development. We are ÇƒÜlooking

at implementing a universal fastq-like format or supporting ÇƒÜmultiple

formats. Perhaps we should join efforts in ironing out ÇƒÜspecifications.

anton galaxy team

On Jul 20, 2009, at 5:18 PM, Abhishek Pratap wrote:

Hi All

I recently came to know about NGS analysis on galaxy during ISMB. Getting excited I tried couple of things basically to play with it.

Few comments : I may have interepretted something described below in a wrong way. My apologies before hand.

On a standalone installation of galaxy while I was trying to explore one FASTQ(sequence) file. It takes considerable (> 20 min) for a fastq file to get uploaded (2 GB).

Are you using the Galaxy upload utility to create an item in your history that points to the dataset file on disk?

Yes that is precisely correct, I am trying to upload a solexa FASTQ file but on a standalone galaxy installation from my local file system.

I am not sure what is the rationale

behind that. Ideally I think there should be no need to upload such heavy files into the workspace.

A data file that originates from a place external to Galaxy must be uploaded into Galaxy so that the disk file can be placed in the location configured in the Galaxy config file. ÇƒÜAlso, when data is uploaded to

Galaxy ( either to a history or a library ), several database table settings are created that are used by various Galaxy features.

They could actually be used straight

Thanks for the clarification but I am not sure this will help a lot of people who are interested to install and run galaxy locally mainly for the following reasons. May be it is just local to me.

A. We already one instance of data saved on the local file system B. Making another copy via galaxy will eat away a lot of space in long run. C. The time needed to import the files into galaxy space is huge

away by the path specified.

What do you mean by "the path specified"?

Well what I mean was a way to specify the path of the file/run on the lcoal file system and galaxy could directly pick it up from there rather than uploading it into its own space. Now I understand this might not work based on the way the system was designed.

Also is there any way to access the

scripts for analysis on the command line. I know this undermines the main aim of working with galaxy but rite now I am concerned about the performance/time.

You should be able to run any Galaxy tool from the command line as long as you have all of the tool's required binaries in your path. ÇƒÜHowever, running

a tool from within Galaxy should generally not be any slower than running it outside of Galaxy, depending, of course, on what you are doing.

Ok I was under the impression that running from SHELL will eliminate the step of uploading them into galaxy file space.

-Abhi

I will be happy to discuss more about this in case you have some comments/questions for me.

Best, -Abhi

-----------------------------

Abhishek Pratap

Bioinformatics Software Engineer

Institute for Genome Sciences

School of Medicine, Univ of Maryland

801, W. Baltimore Street, Baltimore, MD 21209

Ph: (+1)-410-706-2296

www.igs.umaryland.edu/ <http://www.igs.umaryland.edu/> _______________________________________________ galaxy-user mailing list galaxy-user@bx.psu.edu <mailto:galaxy-user@bx.psu.edu>

http://mail.bx.psu.edu/cgi-bin/mailman/listinfo/galaxy-user

Anton Nekrutenko http://nekrut.bx.psu.edu http://galaxyproject.org

_______________________________________________ galaxy-user mailing list galaxy-user@bx.psu.edu <mailto:galaxy-user@bx.psu.edu>

http://mail.bx.psu.edu/cgi-bin/mailman/listinfo/galaxy-user

_______________________________________________ galaxy-user mailing list galaxy-user@bx.psu.edu http://mail.bx.psu.edu/cgi-bin/mailman/listinfo/galaxy-user

-- Research Associate Department of Biostatistics Associate Director Bioinformatics Core Harvard School of Public Health Skype: ohofmann Phone: +1 (617) 365 0984

Abhishek Pratap

4:49 p.m.

Hi Greg I have updated my galaxy rep to changeset 2825. I dont see the checkbox on the "Upload File" page. Am I missing something ? Thanks, -Abhi On Fri, Oct 2, 2009 at 10:21 AM, Greg Von Kuster <ghv2@psu.edu> wrote:

...

Change set 2812 will be included in a release to the distribution today - here are details of a new option that we're hoping will provide what is needed for most labs.

Add a new option, 'allow_library_path_paste' that adds a new upload page ("Upload files from file system paths") to the admin-side library upload pages. This form contains a textarea that allows Galaxy admins to paste any number of file system paths (files or directories) from which Galaxy will import library datasets, saving the directory structure (if desired). Since such ability allows admins access to any file on the Galaxy server which is readable by Galaxy's system user, this option is disabled by default, and system administrators should take care in assigning Galaxy administrators when this feature is enabled. Controls on what files are accessible to this tool based on ownership or other properties can be added at a later date if there is sufficient interest for such features.

This commit also includes a checkbox on the "Upload directory of files" page (as well as the new "Upload files from file system paths" page above) that will prevent Galaxy from copying data to its files directory (by default, 'database/files/'). This is useful for large library datasets that live in their own managed locations on the file system, this will prevent the existence of duplicate copies of datasets (but means administrators must take care to manage data - moving or removing the data from its Galaxy-external location will render these datasets invalid within Galaxy).

One unique feature to be aware of: when using the "Copy data into Galaxy?" checkbox on the "Upload directory of files" page, any symbolic links encountered in the chosen import directory will be made absolute and dereferenced ONCE. This allows administrators to link large datasets to the import directory, rather than having to make full copies, while being able to delete such links after importing. Only the first symlink (the one in the import directory itself) is dereferenced; all others remain. See the following for an example:

library_import_dir = /galaxy/import

% ls -lR /galaxy/import /galaxy/import: total 6 drwxr-xr-x 2 nate nate 512 Oct 1 11:31 link/

/galaxy/import/link: total 10 lrwxrwxrwx 1 nate nate 71 Oct 1 10:38 1.bed -> ../../../home/nate/galaxy/test-data/1.bed lrwxrwxrwx 1 nate nate 60 Oct 1 10:38 2.bed -> /home/nate/galaxy/test-data/2.bed lrwxrwxrwx 1 nate nate 11 Oct 1 10:38 3.bed -> ../../3.bed lrwxrwxrwx 1 nate nate 35 Oct 1 11:30 4.bed -> ../../galaxy_symlink/test-data/4.bed lrwxrwxrwx 1 nate nate 41 Oct 1 11:31 5.bed -> /galaxy/galaxy_symlink/test-data/5.bed

% ls -l /galaxy/3.bed lrwxrwxrwx 1 nate nate 60 Oct 1 10:39 /galaxy/3.bed -> /home/nate/galaxy/test-data/3.bed

% ls -l /galaxy/galaxy_symlink lrwxrwxrwx 1 nate nate 44 Oct 1 11:30 /galaxy/galaxy_symlink -> /home/nate/galaxy/

In this example,

1.bed is a relative symbolic link to the real 1.bed.

2.bed is an absolute symlink to the real 2.bed.

3.bed is a relative symlink to ../../3.bed, aka /galaxy/3.bed, which itself is a symlink to the real 3.bed.

4.bed is a relative symlink which follows another symlink (/galaxy/galaxy_symlink) to the real 4.bed.

5.bed is an absolute symlink in the same fashion as 4.bed

If the 'link' server directory is chosen on the "Upload directory of files" page, and "Copy data into Galaxy?" is checked "No", the following files will be referenced by Galaxy:

/home/nate/galaxy/test-data/1.bed /home/nate/galaxy/test-data/2.bed /galaxy/3.bed /galaxy/galaxy_symlink/test-data/4.bed /galaxy/galaxy_symlink/test-data/5.bed

The Galaxy administrator may now safely delete /galaxy/import/link, but should take care not to remove the referenced symbolic links (/galaxy/3.bed, /galaxy/galaxy_symlink).

Not all symbolic links are dereferenced because it is assumed that if an administrator links to a path in the import directory which itself is (or contains) links, that is the preferred path for accessing the data.

Oliver Hofmann wrote:

...
Dear all,

to echo what Abhi said: we are also currently looking of ways to automatically import data sets (libraries) into Galaxy without having to manually trigger the import via the administration interface, and ideally while keeping the data in the original place. The idea here is to have multiple tools all point at the original 'source data' without having to replicate terabytes of data.

Not quite sure how feasible this is in practice, but it certainly would be incredibly helpful.

Best,

Oliver

On 28 Sep 2009, at 14:24, Abhishek Pratap wrote:

...
HI Greg

Thanks for a quick reply and making some requested changes. However I am not still sure if importing NGS data will help in long run.

For Centers generating NGS data which could 2-3 T.B / week depending on no. of sequencers I think importing another copy of raw data into galaxy workspace will be asking for lot of disk space. I understand it is a neat way of doing things as it becomes agnostic of the raw data location but might not be the best way for handling huge data in long run for centers like ours.

Please correct me if I am wrong. I think we could also have a simple option without having to import the data and just using it for analysis from the current location, also storing results at the same location. That way in future even if the data set is moved analysis also stays with it.

Let me know what you feel. I will be happy to know if there are any other smart reasons of importing the data in galaxy workspace just for curiosity sake.

Thanks, -Abhi

On Mon, Sep 28, 2009 at 9:28 AM, Greg Von Kuster <ghv2@psu.edu> wrote: Hello Abhishek,

The Galaxy distribution includes the enhancements to which I previously referred for uploading history files. Uploading files to a history now creates a Galaxy job just like any other tool, and can be run on a cluster node, allowing upload of very large files. The initial pass of this work is also completed for uploading to a Data Library, but this enhancement is still in test, so it should soon be available in the distribution.

Do you want to avoid having to import at all (e.g. allow Galaxy to refer to datasets that live in their original locations)? This is not currently possible, but if this is what you are looking for, we can consider some additional options on the current upload form, or possibly a new, separate form.

Greg Von Kuster Galaxy Development Team

Abhishek Pratap wrote: Hi Greg, Anton and all

Just wondering if there has been any progress made on this end. I am sorry I was not able to follow it up on Assaf's suggestion due to other things at work.

I did try the latest version of galaxy and looks like the files are still transferred over HTTP before they could be used in the galaxy workspace. Also I would again like to highlight that many labs might want to use the local instance of galaxy and prefer to point to a local path where the file is being stored. That way we will have both the benefits of using a cool GUI and process data stored locally.

Let me know if you guys need some feedback or have more questions. I will be happy to discuss them.

best, -Abhi

On Tue, Jul 21, 2009 at 4:26 PM, Greg Von Kuster <ghv2@psu.edu <mailto:ghv2@psu.edu>> wrote:

Hello Abishek,

We are currently in the process of significantly enhancing the current Galaxy upload utilities, and the new version should eliminate the issue you've raised about the time needed to upload large files via HTTP ( not for making an initial copy of the file in the Galaxy environment ). However, it will probably not be ready for release for a few more weeks, so if you can take advantage of Assaf's script in the meantime, that's great. ¨ÜI can't guarantee that all Galaxy features will function correctly if you do this though.

Assaf, have you found that using your script breaks anything?

Also, if you upload a file to a library rather than a history, multiple users can "import" the library dataset into their history for analysis, but there is only 1 file on disk ( users are pointing to it from their histories ). ¨ÜBut uploading a file to a history will create a new copy of the file each time it is uploaded.

Greg Von Kuster Galaxy Development Team

Abhishek Pratap wrote:

Hi All

@Greg : Please find my comments below.

On Tue, Jul 21, 2009 at 10:44 AM, Greg Von Kuster<ghv2@psu.edu <mailto:ghv2@psu.edu>> wrote:

Hello Abhi,

Can you clarify the steps you took that produced the behavior? ÇƒÜSee my

comments below.

Anton Nekrutenko wrote:

Abhishek:

Let talk. This is the area of active current development. We are ÇƒÜlooking

at implementing a universal fastq-like format or supporting ÇƒÜmultiple

formats. Perhaps we should join efforts in ironing out ÇƒÜspecifications.

anton galaxy team

On Jul 20, 2009, at 5:18 PM, Abhishek Pratap wrote:

Hi All

I recently came to know about NGS analysis on galaxy during ISMB. Getting excited I tried couple of things basically to play with it.

Few comments : I may have interepretted something described below in a wrong way. My apologies before hand.

On a standalone installation of galaxy while I was trying to explore one FASTQ(sequence) file. It takes considerable (> 20 min) for a fastq file to get uploaded (2 GB).

Are you using the Galaxy upload utility to create an item in your history that points to the dataset file on disk?

Yes that is precisely correct, I am trying to upload a solexa FASTQ file but on a standalone galaxy installation from my local file system.

I am not sure what is the rationale

behind that. Ideally I think there should be no need to upload such heavy files into the workspace.

A data file that originates from a place external to Galaxy must be uploaded into Galaxy so that the disk file can be placed in the location configured in the Galaxy config file. ÇƒÜAlso, when data is uploaded to

Galaxy ( either to a history or a library ), several database table settings are created that are used by various Galaxy features.

They could actually be used straight

Thanks for the clarification but I am not sure this will help a lot of people who are interested to install and run galaxy locally mainly for the following reasons. May be it is just local to me.

A. We already one instance of data saved on the local file system B. Making another copy via galaxy will eat away a lot of space in long run. C. The time needed to import the files into galaxy space is huge

away by the path specified.

What do you mean by "the path specified"?

Well what I mean was a way to specify the path of the file/run on the lcoal file system and galaxy could directly pick it up from there rather than uploading it into its own space. Now I understand this might not work based on the way the system was designed.

Also is there any way to access the

scripts for analysis on the command line. I know this undermines the main aim of working with galaxy but rite now I am concerned about the performance/time.

You should be able to run any Galaxy tool from the command line as long as you have all of the tool's required binaries in your path. ÇƒÜHowever, running

a tool from within Galaxy should generally not be any slower than running it outside of Galaxy, depending, of course, on what you are doing.

Ok I was under the impression that running from SHELL will eliminate the step of uploading them into galaxy file space.

-Abhi

I will be happy to discuss more about this in case you have some comments/questions for me.

Best, -Abhi

-----------------------------

Abhishek Pratap

Bioinformatics Software Engineer

Institute for Genome Sciences

School of Medicine, Univ of Maryland

801, W. Baltimore Street, Baltimore, MD 21209

Ph: (+1)-410-706-2296

www.igs.umaryland.edu/ <http://www.igs.umaryland.edu/> _______________________________________________ galaxy-user mailing list galaxy-user@bx.psu.edu <mailto:galaxy-user@bx.psu.edu>

http://mail.bx.psu.edu/cgi-bin/mailman/listinfo/galaxy-user

Anton Nekrutenko http://nekrut.bx.psu.edu http://galaxyproject.org

_______________________________________________ galaxy-user mailing list galaxy-user@bx.psu.edu <mailto:galaxy-user@bx.psu.edu>

http://mail.bx.psu.edu/cgi-bin/mailman/listinfo/galaxy-user

_______________________________________________ galaxy-user mailing list galaxy-user@bx.psu.edu http://mail.bx.psu.edu/cgi-bin/mailman/listinfo/galaxy-user

-- Research Associate Department of Biostatistics Associate Director Bioinformatics Core Harvard School of Public Health Skype: ohofmann Phone: +1 (617) 365 0984

Greg Von Kuster

6:53 p.m.

New subject: Experience with Loading NGS data on standalone instance of galaxy

Hello Abhishek, Add this to your universe_wsgi.ini file: allow_library_path_paste = True Then, clicking the down-arrow on the upload form Create new data library datasets ▼ will give you 4 options, 1 of which is: Upload files from file system paths Greg Von Kuster Galaxy Development Team Abhishek Pratap wrote:

...

Hi Greg

I have updated my galaxy rep to changeset 2825. I dont see the checkbox on the "Upload File" page. Am I missing something ?

Thanks, -Abhi

On Fri, Oct 2, 2009 at 10:21 AM, Greg Von Kuster <ghv2@psu.edu> wrote:

...
Change set 2812 will be included in a release to the distribution today - here are details of a new option that we're hoping will provide what is needed for most labs.

Add a new option, 'allow_library_path_paste' that adds a new upload page ("Upload files from file system paths") to the admin-side library upload pages. This form contains a textarea that allows Galaxy admins to paste any number of file system paths (files or directories) from which Galaxy will import library datasets, saving the directory structure (if desired). ¬†Since such ability allows admins access to any file on the Galaxy server which is readable by Galaxy's system user, this option is disabled by default, and system administrators should take care in assigning Galaxy administrators when this feature is enabled. ¬†Controls on what files are accessible to this tool based on ownership or other properties can be added at a later date if there is sufficient interest for such features.

This commit also includes a checkbox on the "Upload directory of files" page (as well as the new "Upload files from file system paths" page above) that will prevent Galaxy from copying data to its files directory (by default, 'database/files/'). ¬†This is useful for large library datasets that live in their own managed locations on the file system, this will prevent the existence of duplicate copies of datasets (but means administrators must take care to manage data - moving or removing the data from its Galaxy-external location will render these datasets invalid within Galaxy).

One unique feature to be aware of: when using the "Copy data into Galaxy?" checkbox on the "Upload directory of files" page, any symbolic links encountered in the chosen import directory will be made absolute and dereferenced ONCE. ¬†This allows administrators to link large datasets to the import directory, rather than having to make full copies, while being able to delete such links after importing. ¬†Only the first symlink (the one in the import directory itself) is dereferenced; all others remain. ¬†See the following for an example:

library_import_dir = /galaxy/import

% ls -lR /galaxy/import /galaxy/import: total 6 drwxr-xr-x ¬† 2 nate ¬† ¬† nate ¬† ¬† ¬† ¬† 512 Oct ¬†1 11:31 link/

/galaxy/import/link: total 10 lrwxrwxrwx ¬† 1 nate ¬† ¬† nate ¬† ¬† ¬† ¬† ¬†71 Oct ¬†1 10:38 1.bed -> ../../../home/nate/galaxy/test-data/1.bed lrwxrwxrwx ¬† 1 nate ¬† ¬† nate ¬† ¬† ¬† ¬† ¬†60 Oct ¬†1 10:38 2.bed -> /home/nate/galaxy/test-data/2.bed lrwxrwxrwx ¬† 1 nate ¬† ¬† nate ¬† ¬† ¬† ¬† ¬†11 Oct ¬†1 10:38 3.bed -> ../../3.bed lrwxrwxrwx ¬† 1 nate ¬† ¬† nate ¬† ¬† ¬† ¬† ¬†35 Oct ¬†1 11:30 4.bed -> ../../galaxy_symlink/test-data/4.bed lrwxrwxrwx ¬† 1 nate ¬† ¬† nate ¬† ¬† ¬† ¬† ¬†41 Oct ¬†1 11:31 5.bed -> /galaxy/galaxy_symlink/test-data/5.bed

% ls -l /galaxy/3.bed lrwxrwxrwx ¬† 1 nate ¬† ¬† nate ¬† ¬† ¬† ¬† ¬†60 Oct ¬†1 10:39 /galaxy/3.bed -> /home/nate/galaxy/test-data/3.bed

% ls -l /galaxy/galaxy_symlink lrwxrwxrwx ¬† 1 nate ¬† ¬† nate ¬† ¬† ¬† ¬† ¬†44 Oct ¬†1 11:30 /galaxy/galaxy_symlink -> /home/nate/galaxy/

In this example,

1.bed is a relative symbolic link to the real 1.bed.

2.bed is an absolute symlink to the real 2.bed.

3.bed is a relative symlink to ../../3.bed, aka /galaxy/3.bed, which itself is a symlink to the real 3.bed.

4.bed is a relative symlink which follows another symlink (/galaxy/galaxy_symlink) to the real 4.bed.

5.bed is an absolute symlink in the same fashion as 4.bed

If the 'link' server directory is chosen on the "Upload directory of files" page, and "Copy data into Galaxy?" is checked "No", the following files will be referenced by Galaxy:

/home/nate/galaxy/test-data/1.bed /home/nate/galaxy/test-data/2.bed /galaxy/3.bed /galaxy/galaxy_symlink/test-data/4.bed /galaxy/galaxy_symlink/test-data/5.bed

The Galaxy administrator may now safely delete /galaxy/import/link, but should take care not to remove the referenced symbolic links (/galaxy/3.bed, /galaxy/galaxy_symlink).

Not all symbolic links are dereferenced because it is assumed that if an administrator links to a path in the import directory which itself is (or contains) links, that is the preferred path for accessing the data.

Oliver Hofmann wrote:

...
Dear all,

to echo what Abhi said: we are also currently looking of ways to automatically import data sets (libraries) into Galaxy without having to manually trigger the import via the administration interface, and ideally while keeping the data in the original place. The idea here is to have multiple tools all point at the original 'source data' without having to replicate terabytes of data.

Not quite sure how feasible this is in practice, but it certainly would be incredibly helpful.

Best,

¬† ¬†Oliver

On 28 Sep 2009, at 14:24, Abhishek Pratap wrote:

...
HI Greg

Thanks for a quick reply and making some requested changes. However I am not still sure if importing NGS data will help in long run.

For Centers generating NGS data which could 2-3 T.B / week depending on no. of sequencers I think importing another copy of raw data into galaxy workspace will be asking for lot of disk space. I understand it is a neat way of doing things as it becomes agnostic of the raw data location ¬†but might not be the best way for handling huge data in long run for centers like ours.

Please correct me if I am wrong. I think we could also have a simple option without having to import the data and just using it for analysis from the current location, also storing results at the same location. That way in future even if the data set is moved analysis also stays with it.

Let me know what you feel. I will be happy to know if there are any other smart reasons of importing the data in galaxy workspace just for curiosity sake.

Thanks, -Abhi

On Mon, Sep 28, 2009 at 9:28 AM, Greg Von Kuster <ghv2@psu.edu> wrote: Hello Abhishek,

The Galaxy distribution includes the enhancements to which I previously referred for uploading history files. ¬†Uploading files to a history now creates a Galaxy job just like any other tool, and can be run on a cluster node, allowing upload of very large files. ¬†The initial pass of this work is also completed for uploading to a Data Library, but this enhancement is still in test, so it should soon be available in the distribution.

Do you want to avoid having to import at all (e.g. allow Galaxy to refer to datasets that live in their original locations)? ¬†This is not currently possible, but if this is what you are looking for, we can consider some additional options on the current upload form, or possibly a new, separate form.

Greg Von Kuster Galaxy Development Team

Abhishek Pratap wrote: Hi Greg, Anton and all

Just wondering if there has been any progress made on this end. I am sorry I was not able to follow it up on Assaf's suggestion due to other things at work.

I did try the latest version of galaxy and looks like the files are still transferred over HTTP before they could be used in the galaxy workspace. Also I would again like to highlight that many labs might want to use the local instance of galaxy and prefer to point to a local path where the file is being stored. That way we will have both the benefits of using a cool GUI and process data stored locally.

Let me know if you guys need some feedback or have more questions. I will be happy to discuss them.

best, -Abhi

On Tue, Jul 21, 2009 at 4:26 PM, Greg Von Kuster <ghv2@psu.edu <mailto:ghv2@psu.edu>> wrote:

¬† Hello Abishek,

¬† We are currently in the process of significantly enhancing the ¬† current Galaxy upload utilities, and the new version should ¬† eliminate the issue you've raised about the time needed to upload ¬† large files via HTTP ( not for making an initial copy of the file in ¬† the Galaxy environment ). However, it will probably not be ready for ¬† release for a few more weeks, so if you can take advantage of ¬† Assaf's script in the meantime, that's great. ¬®√úI can't guarantee ¬† that all Galaxy features will function correctly if you do this though.

¬† Assaf, have you found that using your script breaks anything?

¬† Also, if you upload a file to a library rather than a history, ¬† multiple users can "import" the library dataset into their history ¬† for analysis, but there is only 1 file on disk ( users are pointing ¬† to it from their histories ). ¬®√úBut uploading a file to a history ¬† will create a new copy of the file each time it is uploaded.

¬† Greg Von Kuster ¬† Galaxy Development Team

¬† Abhishek Pratap wrote:

¬† ¬† ¬† Hi All

¬† ¬† ¬† @Greg : Please find my comments below.

¬† ¬† ¬† On Tue, Jul 21, 2009 at 10:44 AM, Greg Von Kuster<ghv2@psu.edu ¬† ¬† ¬† <mailto:ghv2@psu.edu>> wrote:

¬† ¬† ¬† ¬† ¬† Hello Abhi,

¬† ¬† ¬† ¬† ¬† Can you clarify the steps you took that produced the ¬† ¬† ¬† ¬† ¬† behavior? √á∆í√úSee my

¬† ¬† ¬† ¬† ¬† comments below.

¬† ¬† ¬† ¬† ¬† Anton Nekrutenko wrote:

¬† ¬† ¬† ¬† ¬† ¬† ¬† Abhishek:

¬† ¬† ¬† ¬† ¬† ¬† ¬† Let talk. This is the area of active current ¬† ¬† ¬† ¬† ¬† ¬† ¬† development. We are √á∆í√úlooking

¬† ¬† ¬† ¬† ¬† ¬† ¬† at implementing a universal fastq-like format or ¬† ¬† ¬† ¬† ¬† ¬† ¬† supporting √á∆í√úmultiple

¬† ¬† ¬† ¬† ¬† ¬† ¬† formats. Perhaps we should join efforts in ironing out ¬† ¬† ¬† ¬† ¬† ¬† ¬† √á∆í√úspecifications.

¬† ¬† ¬† ¬† ¬† ¬† ¬† anton ¬† ¬† ¬† ¬† ¬† ¬† ¬† galaxy team

¬† ¬† ¬† ¬† ¬† ¬† ¬† On Jul 20, 2009, at 5:18 PM, Abhishek Pratap wrote:

¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† Hi All

¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† I recently came to know about NGS analysis on galaxy ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† during ISMB. ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† Getting excited I tried couple of things basically ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† to play with it.

¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† Few comments : I may have interepretted something ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† described below in a ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† wrong way. My apologies before hand.

¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† On a standalone installation of galaxy while I was ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† trying to explore ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† one FASTQ(sequence) file. It takes considerable (> ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† 20 min) for a fastq ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† file to get uploaded (2 GB).

¬† ¬† ¬† ¬† ¬† Are you using the Galaxy upload utility to create an item in ¬† ¬† ¬† ¬† ¬† your history ¬† ¬† ¬† ¬† ¬† that points to the dataset file on disk?

¬† ¬† ¬† Yes that is precisely correct, I am trying to upload a solexa FASTQ ¬† ¬† ¬† file but on a standalone galaxy installation from my local file ¬† ¬† ¬† system.

¬† ¬† ¬† ¬† ¬† I am not sure what is the rationale

¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† behind that. Ideally I think there should be no need ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† to upload such ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† heavy files into the workspace.

¬† ¬† ¬† ¬† ¬† A data file that originates from a place external to Galaxy ¬† ¬† ¬† ¬† ¬† must be uploaded ¬† ¬† ¬† ¬† ¬† into Galaxy so that the disk file can be placed in the ¬† ¬† ¬† ¬† ¬† location configured ¬† ¬† ¬† ¬† ¬† in the Galaxy config file. √á∆í√úAlso, when data is uploaded to

¬† ¬† ¬† ¬† ¬† Galaxy ( either ¬† ¬† ¬† ¬† ¬† to a history or a library ), several database table settings ¬† ¬† ¬† ¬† ¬† are created ¬† ¬† ¬† ¬† ¬† that are used by various Galaxy features.

¬† ¬† ¬† ¬† ¬† They could actually be used straight

¬† ¬† ¬† Thanks for the clarification but I am not sure this will help a ¬† ¬† ¬† lot of ¬† ¬† ¬† people who are interested to install and run galaxy locally ¬† ¬† ¬† mainly for ¬† ¬† ¬† the following reasons. May be it is just local to me.

¬† ¬† ¬† A. We already one instance of data saved on the local file system ¬† ¬† ¬† B. Making another copy via galaxy will eat away a lot of space ¬† ¬† ¬† in long run. ¬† ¬† ¬† C. The time needed to import the files into galaxy space is huge

¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† away by the path specified.

¬† ¬† ¬† ¬† ¬† What do you mean by "the path specified"?

¬† ¬† ¬† Well what I mean was a way to specify the path of the file/run ¬† ¬† ¬† on the ¬† ¬† ¬† lcoal file system and galaxy could directly pick it up from there ¬† ¬† ¬† rather than uploading it into its own space. Now I understand this ¬† ¬† ¬† might not work based on the way the system was designed.

¬† ¬† ¬† ¬† ¬† Also is there any way to access the

¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† scripts for analysis on the command line. I know ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† this undermines the ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† main aim of working with galaxy but rite now I am ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† concerned about the ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† performance/time.

¬† ¬† ¬† ¬† ¬† You should be able to run any Galaxy tool from the command ¬† ¬† ¬† ¬† ¬† line as long as ¬† ¬† ¬† ¬† ¬† you have all of the tool's required binaries in your path. ¬† ¬† ¬† ¬† ¬† √á∆í√úHowever, running

¬† ¬† ¬† ¬† ¬† a tool from within Galaxy should generally not be any slower ¬† ¬† ¬† ¬† ¬† than running it ¬† ¬† ¬† ¬† ¬† outside of Galaxy, depending, of course, on what you are doing.

¬† ¬† ¬† Ok I was under the impression that running from SHELL will eliminate ¬† ¬† ¬† the step of uploading them into galaxy file space.

¬† ¬† ¬† -Abhi

¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† I will be happy to discuss more about this in case ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† you have some ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† comments/questions for me.

¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† Best, ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† -Abhi

¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† -----------------------------

¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† Abhishek Pratap

¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† Bioinformatics Software Engineer

¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† Institute for Genome Sciences

¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† School of Medicine, Univ of Maryland

¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† 801, W. Baltimore Street, Baltimore, MD 21209

¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† Ph: (+1)-410-706-2296

¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† www.igs.umaryland.edu/ <http://www.igs.umaryland.edu/> ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† _______________________________________________ ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† galaxy-user mailing list ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† galaxy-user@bx.psu.edu <mailto:galaxy-user@bx.psu.edu>

http://mail.bx.psu.edu/cgi-bin/mailman/listinfo/galaxy-user

¬† ¬† ¬† ¬† ¬† ¬† ¬† Anton Nekrutenko ¬† ¬† ¬† ¬† ¬† ¬† ¬† http://nekrut.bx.psu.edu ¬† ¬† ¬† ¬† ¬† ¬† ¬† http://galaxyproject.org

¬† ¬† ¬† ¬† ¬† ¬† ¬† _______________________________________________ ¬† ¬† ¬† ¬† ¬† ¬† ¬† galaxy-user mailing list ¬† ¬† ¬† ¬† ¬† ¬† ¬† galaxy-user@bx.psu.edu <mailto:galaxy-user@bx.psu.edu>

¬† ¬† ¬† ¬† ¬† ¬† ¬† http://mail.bx.psu.edu/cgi-bin/mailman/listinfo/galaxy-user

_______________________________________________ galaxy-user mailing list galaxy-user@bx.psu.edu http://mail.bx.psu.edu/cgi-bin/mailman/listinfo/galaxy-user -- Research Associate ¬† ¬†Department of Biostatistics Associate Director ¬† ¬†Bioinformatics Core ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬†Harvard School of Public Health Skype: ohofmann ¬† ¬† ¬† Phone: +1 (617) 365 0984

Abhishek Pratap

7:24 p.m.

Hi Greg Unfortunately it is not working for me. I made sure I cleared my browser cache before re-viewing it. I have set the option as suggested by you in the universe_wsgi.ini file. -Abhi On Fri, Oct 2, 2009 at 2:53 PM, Greg Von Kuster <ghv2@psu.edu> wrote:

...

Hello Abhishek,

Add this to your universe_wsgi.ini file:

allow_library_path_paste = True

Then, clicking the down-arrow on the upload form

Create new data library datasets ▼

will give you 4 options, 1 of which is:

Upload files from file system paths

Greg Von Kuster Galaxy Development Team

Abhishek Pratap wrote:

...
Hi Greg

I have updated my galaxy rep to changeset 2825. I dont see the checkbox on the "Upload File" page. Am I missing something ?

Thanks, -Abhi

On Fri, Oct 2, 2009 at 10:21 AM, Greg Von Kuster <ghv2@psu.edu> wrote:

...
Change set 2812 will be included in a release to the distribution today - here are details of a new option that we're hoping will provide what is needed for most labs.

Add a new option, 'allow_library_path_paste' that adds a new upload page ("Upload files from file system paths") to the admin-side library upload pages. This form contains a textarea that allows Galaxy admins to paste any number of file system paths (files or directories) from which Galaxy will import library datasets, saving the directory structure (if desired). ¬†Since such ability allows admins access to any file on the Galaxy server which is readable by Galaxy's system user, this option is disabled by default, and system administrators should take care in assigning Galaxy administrators when this feature is enabled. ¬†Controls on what files are accessible to this tool based on ownership or other properties can be added at a later date if there is sufficient interest for such features.

This commit also includes a checkbox on the "Upload directory of files" page (as well as the new "Upload files from file system paths" page above) that will prevent Galaxy from copying data to its files directory (by default, 'database/files/'). ¬†This is useful for large library datasets that live in their own managed locations on the file system, this will prevent the existence of duplicate copies of datasets (but means administrators must take care to manage data - moving or removing the data from its Galaxy-external location will render these datasets invalid within Galaxy).

One unique feature to be aware of: when using the "Copy data into Galaxy?" checkbox on the "Upload directory of files" page, any symbolic links encountered in the chosen import directory will be made absolute and dereferenced ONCE. ¬†This allows administrators to link large datasets to the import directory, rather than having to make full copies, while being able to delete such links after importing. ¬†Only the first symlink (the one in the import directory itself) is dereferenced; all others remain. ¬†See the following for an example:

library_import_dir = /galaxy/import

% ls -lR /galaxy/import /galaxy/import: total 6 drwxr-xr-x ¬† 2 nate ¬† ¬† nate ¬† ¬† ¬† ¬† 512 Oct ¬†1 11:31 link/

/galaxy/import/link: total 10 lrwxrwxrwx ¬† 1 nate ¬† ¬† nate ¬† ¬† ¬† ¬† ¬†71 Oct ¬†1 10:38 1.bed -> ../../../home/nate/galaxy/test-data/1.bed lrwxrwxrwx ¬† 1 nate ¬† ¬† nate ¬† ¬† ¬† ¬† ¬†60 Oct ¬†1 10:38 2.bed -> /home/nate/galaxy/test-data/2.bed lrwxrwxrwx ¬† 1 nate ¬† ¬† nate ¬† ¬† ¬† ¬† ¬†11 Oct ¬†1 10:38 3.bed -> ../../3.bed lrwxrwxrwx ¬† 1 nate ¬† ¬† nate ¬† ¬† ¬† ¬† ¬†35 Oct ¬†1 11:30 4.bed -> ../../galaxy_symlink/test-data/4.bed lrwxrwxrwx ¬† 1 nate ¬† ¬† nate ¬† ¬† ¬† ¬† ¬†41 Oct ¬†1 11:31 5.bed -> /galaxy/galaxy_symlink/test-data/5.bed

% ls -l /galaxy/3.bed lrwxrwxrwx ¬† 1 nate ¬† ¬† nate ¬† ¬† ¬† ¬† ¬†60 Oct ¬†1 10:39 /galaxy/3.bed -> /home/nate/galaxy/test-data/3.bed

% ls -l /galaxy/galaxy_symlink lrwxrwxrwx ¬† 1 nate ¬† ¬† nate ¬† ¬† ¬† ¬† ¬†44 Oct ¬†1 11:30 /galaxy/galaxy_symlink -> /home/nate/galaxy/

In this example,

1.bed is a relative symbolic link to the real 1.bed.

2.bed is an absolute symlink to the real 2.bed.

3.bed is a relative symlink to ../../3.bed, aka /galaxy/3.bed, which itself is a symlink to the real 3.bed.

4.bed is a relative symlink which follows another symlink (/galaxy/galaxy_symlink) to the real 4.bed.

5.bed is an absolute symlink in the same fashion as 4.bed

If the 'link' server directory is chosen on the "Upload directory of files" page, and "Copy data into Galaxy?" is checked "No", the following files will be referenced by Galaxy:

/home/nate/galaxy/test-data/1.bed /home/nate/galaxy/test-data/2.bed /galaxy/3.bed /galaxy/galaxy_symlink/test-data/4.bed /galaxy/galaxy_symlink/test-data/5.bed

The Galaxy administrator may now safely delete /galaxy/import/link, but should take care not to remove the referenced symbolic links (/galaxy/3.bed, /galaxy/galaxy_symlink).

Not all symbolic links are dereferenced because it is assumed that if an administrator links to a path in the import directory which itself is (or contains) links, that is the preferred path for accessing the data.

Oliver Hofmann wrote:

...
Dear all,

to echo what Abhi said: we are also currently looking of ways to automatically import data sets (libraries) into Galaxy without having to manually trigger the import via the administration interface, and ideally while keeping the data in the original place. The idea here is to have multiple tools all point at the original 'source data' without having to replicate terabytes of data.

Not quite sure how feasible this is in practice, but it certainly would be incredibly helpful.

Best,

¬† ¬†Oliver

On 28 Sep 2009, at 14:24, Abhishek Pratap wrote:

...
HI Greg

Thanks for a quick reply and making some requested changes. However I am not still sure if importing NGS data will help in long run.

For Centers generating NGS data which could 2-3 T.B / week depending on no. of sequencers I think importing another copy of raw data into galaxy workspace will be asking for lot of disk space. I understand it is a neat way of doing things as it becomes agnostic of the raw data location ¬†but might not be the best way for handling huge data in long run for centers like ours.

Please correct me if I am wrong. I think we could also have a simple option without having to import the data and just using it for analysis from the current location, also storing results at the same location. That way in future even if the data set is moved analysis also stays with it.

Let me know what you feel. I will be happy to know if there are any other smart reasons of importing the data in galaxy workspace just for curiosity sake.

Thanks, -Abhi

On Mon, Sep 28, 2009 at 9:28 AM, Greg Von Kuster <ghv2@psu.edu> wrote: Hello Abhishek,

The Galaxy distribution includes the enhancements to which I previously referred for uploading history files. ¬†Uploading files to a history now creates a Galaxy job just like any other tool, and can be run on a cluster node, allowing upload of very large files. ¬†The initial pass of this work is also completed for uploading to a Data Library, but this enhancement is still in test, so it should soon be available in the distribution.

Do you want to avoid having to import at all (e.g. allow Galaxy to refer to datasets that live in their original locations)? ¬†This is not currently possible, but if this is what you are looking for, we can consider some additional options on the current upload form, or possibly a new, separate form.

Greg Von Kuster Galaxy Development Team

Abhishek Pratap wrote: Hi Greg, Anton and all

Just wondering if there has been any progress made on this end. I am sorry I was not able to follow it up on Assaf's suggestion due to other things at work.

I did try the latest version of galaxy and looks like the files are still transferred over HTTP before they could be used in the galaxy workspace. Also I would again like to highlight that many labs might want to use the local instance of galaxy and prefer to point to a local path where the file is being stored. That way we will have both the benefits of using a cool GUI and process data stored locally.

Let me know if you guys need some feedback or have more questions. I will be happy to discuss them.

best, -Abhi

On Tue, Jul 21, 2009 at 4:26 PM, Greg Von Kuster <ghv2@psu.edu <mailto:ghv2@psu.edu>> wrote:

¬† Hello Abishek,

¬† We are currently in the process of significantly enhancing the ¬† current Galaxy upload utilities, and the new version should ¬† eliminate the issue you've raised about the time needed to upload ¬† large files via HTTP ( not for making an initial copy of the file in ¬† the Galaxy environment ). However, it will probably not be ready for ¬† release for a few more weeks, so if you can take advantage of ¬† Assaf's script in the meantime, that's great. ¬®√úI can't guarantee ¬† that all Galaxy features will function correctly if you do this though.

¬† Assaf, have you found that using your script breaks anything?

¬† Also, if you upload a file to a library rather than a history, ¬† multiple users can "import" the library dataset into their history ¬† for analysis, but there is only 1 file on disk ( users are pointing ¬† to it from their histories ). ¬®√úBut uploading a file to a history ¬† will create a new copy of the file each time it is uploaded.

¬† Greg Von Kuster ¬† Galaxy Development Team

¬† Abhishek Pratap wrote:

¬† ¬† ¬† Hi All

¬† ¬† ¬† @Greg : Please find my comments below.

¬† ¬† ¬† On Tue, Jul 21, 2009 at 10:44 AM, Greg Von Kuster<ghv2@psu.edu ¬† ¬† ¬† <mailto:ghv2@psu.edu>> wrote:

¬† ¬† ¬† ¬† ¬† Hello Abhi,

¬† ¬† ¬† ¬† ¬† Can you clarify the steps you took that produced the ¬† ¬† ¬† ¬† ¬† behavior? √á∆í√úSee my

¬† ¬† ¬† ¬† ¬† comments below.

¬† ¬† ¬† ¬† ¬† Anton Nekrutenko wrote:

¬† ¬† ¬† ¬† ¬† ¬† ¬† Abhishek:

¬† ¬† ¬† ¬† ¬† ¬† ¬† Let talk. This is the area of active current ¬† ¬† ¬† ¬† ¬† ¬† ¬† development. We are √á∆í√úlooking

¬† ¬† ¬† ¬† ¬† ¬† ¬† at implementing a universal fastq-like format or ¬† ¬† ¬† ¬† ¬† ¬† ¬† supporting √á∆í√úmultiple

¬† ¬† ¬† ¬† ¬† ¬† ¬† formats. Perhaps we should join efforts in ironing out ¬† ¬† ¬† ¬† ¬† ¬† ¬† √á∆í√úspecifications.

¬† ¬† ¬† ¬† ¬† ¬† ¬† anton ¬† ¬† ¬† ¬† ¬† ¬† ¬† galaxy team

¬† ¬† ¬† ¬† ¬† ¬† ¬† On Jul 20, 2009, at 5:18 PM, Abhishek Pratap wrote:

¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† Hi All

¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† I recently came to know about NGS analysis on galaxy ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† during ISMB. ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† Getting excited I tried couple of things basically ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† to play with it.

¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† Few comments : I may have interepretted something ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† described below in a ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† wrong way. My apologies before hand.

¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† On a standalone installation of galaxy while I was ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† trying to explore ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† one FASTQ(sequence) file. It takes considerable (> ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† 20 min) for a fastq ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† file to get uploaded (2 GB).

¬† ¬† ¬† ¬† ¬† Are you using the Galaxy upload utility to create an item in ¬† ¬† ¬† ¬† ¬† your history ¬† ¬† ¬† ¬† ¬† that points to the dataset file on disk?

¬† ¬† ¬† Yes that is precisely correct, I am trying to upload a solexa FASTQ ¬† ¬† ¬† file but on a standalone galaxy installation from my local file ¬† ¬† ¬† system.

¬† ¬† ¬† ¬† ¬† I am not sure what is the rationale

¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† behind that. Ideally I think there should be no need ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† to upload such ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† heavy files into the workspace.

¬† ¬† ¬† ¬† ¬† A data file that originates from a place external to Galaxy ¬† ¬† ¬† ¬† ¬† must be uploaded ¬† ¬† ¬† ¬† ¬† into Galaxy so that the disk file can be placed in the ¬† ¬† ¬† ¬† ¬† location configured ¬† ¬† ¬† ¬† ¬† in the Galaxy config file. √á∆í√úAlso, when data is uploaded to

¬† ¬† ¬† ¬† ¬† Galaxy ( either ¬† ¬† ¬† ¬† ¬† to a history or a library ), several database table settings ¬† ¬† ¬† ¬† ¬† are created ¬† ¬† ¬† ¬† ¬† that are used by various Galaxy features.

¬† ¬† ¬† ¬† ¬† They could actually be used straight

¬† ¬† ¬† Thanks for the clarification but I am not sure this will help a ¬† ¬† ¬† lot of ¬† ¬† ¬† people who are interested to install and run galaxy locally ¬† ¬† ¬† mainly for ¬† ¬† ¬† the following reasons. May be it is just local to me.

¬† ¬† ¬† A. We already one instance of data saved on the local file system ¬† ¬† ¬† B. Making another copy via galaxy will eat away a lot of space ¬† ¬† ¬† in long run. ¬† ¬† ¬† C. The time needed to import the files into galaxy space is huge

¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† away by the path specified.

¬† ¬† ¬† ¬† ¬† What do you mean by "the path specified"?

¬† ¬† ¬† Well what I mean was a way to specify the path of the file/run ¬† ¬† ¬† on the ¬† ¬† ¬† lcoal file system and galaxy could directly pick it up from there ¬† ¬† ¬† rather than uploading it into its own space. Now I understand this ¬† ¬† ¬† might not work based on the way the system was designed.

¬† ¬† ¬† ¬† ¬† Also is there any way to access the

¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† scripts for analysis on the command line. I know ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† this undermines the ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† main aim of working with galaxy but rite now I am ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† concerned about the ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† performance/time.

¬† ¬† ¬† ¬† ¬† You should be able to run any Galaxy tool from the command ¬† ¬† ¬† ¬† ¬† line as long as ¬† ¬† ¬† ¬† ¬† you have all of the tool's required binaries in your path. ¬† ¬† ¬† ¬† ¬† √á∆í√úHowever, running

¬† ¬† ¬† ¬† ¬† a tool from within Galaxy should generally not be any slower ¬† ¬† ¬† ¬† ¬† than running it ¬† ¬† ¬† ¬† ¬† outside of Galaxy, depending, of course, on what you are doing.

¬† ¬† ¬† Ok I was under the impression that running from SHELL will eliminate ¬† ¬† ¬† the step of uploading them into galaxy file space.

¬† ¬† ¬† -Abhi

¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† I will be happy to discuss more about this in case ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† you have some ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† comments/questions for me.

¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† Best, ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† -Abhi

¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† -----------------------------

¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† Abhishek Pratap

¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† Bioinformatics Software Engineer

¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† Institute for Genome Sciences

¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† School of Medicine, Univ of Maryland

¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† 801, W. Baltimore Street, Baltimore, MD 21209

¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† Ph: (+1)-410-706-2296

¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† www.igs.umaryland.edu/ <http://www.igs.umaryland.edu/> ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† _______________________________________________ ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† galaxy-user mailing list ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† galaxy-user@bx.psu.edu <mailto:galaxy-user@bx.psu.edu>

http://mail.bx.psu.edu/cgi-bin/mailman/listinfo/galaxy-user

¬† ¬† ¬† ¬† ¬† ¬† ¬† Anton Nekrutenko ¬† ¬† ¬† ¬† ¬† ¬† ¬† http://nekrut.bx.psu.edu ¬† ¬† ¬† ¬† ¬† ¬† ¬† http://galaxyproject.org

¬† ¬† ¬† ¬† ¬† ¬† ¬† _______________________________________________ ¬† ¬† ¬† ¬† ¬† ¬† ¬† galaxy-user mailing list ¬† ¬† ¬† ¬† ¬† ¬† ¬† galaxy-user@bx.psu.edu <mailto:galaxy-user@bx.psu.edu>

¬† ¬† ¬† ¬† ¬† ¬† ¬† http://mail.bx.psu.edu/cgi-bin/mailman/listinfo/galaxy-user

_______________________________________________ galaxy-user mailing list galaxy-user@bx.psu.edu http://mail.bx.psu.edu/cgi-bin/mailman/listinfo/galaxy-user

-- Research Associate ¬† ¬†Department of Biostatistics Associate Director ¬† ¬†Bioinformatics Core ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬† ¬†Harvard School of Public Health Skype: ohofmann ¬† ¬† ¬† Phone: +1 (617) 365 0984

Greg Von Kuster

7:30 p.m.

New subject: Experience with Loading NGS data on standalone instance of galaxy

Please type the following in your galaxy install directory, and let me know what you get: hg heads Thanks Abhishek Pratap wrote:

...

Hi Greg

Unfortunately it is not working for me. I made sure I cleared my browser cache before re-viewing it.

I have set the option as suggested by you in the universe_wsgi.ini file.

-Abhi

On Fri, Oct 2, 2009 at 2:53 PM, Greg Von Kuster <ghv2@psu.edu> wrote:

...
Hello Abhishek,

Add this to your universe_wsgi.ini file:

allow_library_path_paste = True

Then, clicking the down-arrow on the upload form

Create new data library datasets ¬†‚ñº

will give you 4 options, 1 of which is:

Upload files from file system paths

Greg Von Kuster Galaxy Development Team

Abhishek Pratap wrote:

...
Hi Greg

I have updated my galaxy rep to changeset 2825. I dont see the checkbox on the "Upload File" page. Am I missing something ?

Thanks, -Abhi

On Fri, Oct 2, 2009 at 10:21 AM, Greg Von Kuster <ghv2@psu.edu> wrote:

...
Change set 2812 will be included in a release to the distribution today - here are details of a new option that we're hoping will provide what is needed for most labs.

Add a new option, 'allow_library_path_paste' that adds a new upload page ("Upload files from file system paths") to the admin-side library upload pages. This form contains a textarea that allows Galaxy admins to paste any number of file system paths (files or directories) from which Galaxy will import library datasets, saving the directory structure (if desired). ¬¨‚Ä†Since such ability allows admins access to any file on the Galaxy server which is readable by Galaxy's system user, this option is disabled by default, and system administrators should take care in assigning Galaxy administrators when this feature is enabled. ¬¨‚Ä†Controls on what files are accessible to this tool based on ownership or other properties can be added at a later date if there is sufficient interest for such features.

This commit also includes a checkbox on the "Upload directory of files" page (as well as the new "Upload files from file system paths" page above) that will prevent Galaxy from copying data to its files directory (by default, 'database/files/'). ¬¨‚Ä†This is useful for large library datasets that live in their own managed locations on the file system, this will prevent the existence of duplicate copies of datasets (but means administrators must take care to manage data - moving or removing the data from its Galaxy-external location will render these datasets invalid within Galaxy).

One unique feature to be aware of: when using the "Copy data into Galaxy?" checkbox on the "Upload directory of files" page, any symbolic links encountered in the chosen import directory will be made absolute and dereferenced ONCE. ¬¨‚Ä†This allows administrators to link large datasets to the import directory, rather than having to make full copies, while being able to delete such links after importing. ¬¨‚Ä†Only the first symlink (the one in the import directory itself) is dereferenced; all others remain. ¬¨‚Ä†See the following for an example:

library_import_dir = /galaxy/import

% ls -lR /galaxy/import /galaxy/import: total 6 drwxr-xr-x ¬¨‚Ä† 2 nate ¬¨‚Ä† ¬¨‚Ä† nate ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† 512 Oct ¬¨‚Ä†1 11:31 link/

/galaxy/import/link: total 10 lrwxrwxrwx ¬¨‚Ä† 1 nate ¬¨‚Ä† ¬¨‚Ä† nate ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä†71 Oct ¬¨‚Ä†1 10:38 1.bed -> ../../../home/nate/galaxy/test-data/1.bed lrwxrwxrwx ¬¨‚Ä† 1 nate ¬¨‚Ä† ¬¨‚Ä† nate ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä†60 Oct ¬¨‚Ä†1 10:38 2.bed -> /home/nate/galaxy/test-data/2.bed lrwxrwxrwx ¬¨‚Ä† 1 nate ¬¨‚Ä† ¬¨‚Ä† nate ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä†11 Oct ¬¨‚Ä†1 10:38 3.bed -> ../../3.bed lrwxrwxrwx ¬¨‚Ä† 1 nate ¬¨‚Ä† ¬¨‚Ä† nate ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä†35 Oct ¬¨‚Ä†1 11:30 4.bed -> ../../galaxy_symlink/test-data/4.bed lrwxrwxrwx ¬¨‚Ä† 1 nate ¬¨‚Ä† ¬¨‚Ä† nate ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä†41 Oct ¬¨‚Ä†1 11:31 5.bed -> /galaxy/galaxy_symlink/test-data/5.bed

% ls -l /galaxy/3.bed lrwxrwxrwx ¬¨‚Ä† 1 nate ¬¨‚Ä† ¬¨‚Ä† nate ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä†60 Oct ¬¨‚Ä†1 10:39 /galaxy/3.bed -> /home/nate/galaxy/test-data/3.bed

% ls -l /galaxy/galaxy_symlink lrwxrwxrwx ¬¨‚Ä† 1 nate ¬¨‚Ä† ¬¨‚Ä† nate ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä†44 Oct ¬¨‚Ä†1 11:30 /galaxy/galaxy_symlink -> /home/nate/galaxy/

In this example,

1.bed is a relative symbolic link to the real 1.bed.

2.bed is an absolute symlink to the real 2.bed.

3.bed is a relative symlink to ../../3.bed, aka /galaxy/3.bed, which itself is a symlink to the real 3.bed.

4.bed is a relative symlink which follows another symlink (/galaxy/galaxy_symlink) to the real 4.bed.

5.bed is an absolute symlink in the same fashion as 4.bed

If the 'link' server directory is chosen on the "Upload directory of files" page, and "Copy data into Galaxy?" is checked "No", the following files will be referenced by Galaxy:

/home/nate/galaxy/test-data/1.bed /home/nate/galaxy/test-data/2.bed /galaxy/3.bed /galaxy/galaxy_symlink/test-data/4.bed /galaxy/galaxy_symlink/test-data/5.bed

The Galaxy administrator may now safely delete /galaxy/import/link, but should take care not to remove the referenced symbolic links (/galaxy/3.bed, /galaxy/galaxy_symlink).

Not all symbolic links are dereferenced because it is assumed that if an administrator links to a path in the import directory which itself is (or contains) links, that is the preferred path for accessing the data.

Oliver Hofmann wrote:

...
Dear all,

to echo what Abhi said: we are also currently looking of ways to automatically import data sets (libraries) into Galaxy without having to manually trigger the import via the administration interface, and ideally while keeping the data in the original place. The idea here is to have multiple tools all point at the original 'source data' without having to replicate terabytes of data.

Not quite sure how feasible this is in practice, but it certainly would be incredibly helpful.

Best,

¬¨‚Ä† ¬¨‚Ä†Oliver

On 28 Sep 2009, at 14:24, Abhishek Pratap wrote:

...
HI Greg

Thanks for a quick reply and making some requested changes. However I am not still sure if importing NGS data will help in long run.

For Centers generating NGS data which could 2-3 T.B / week depending on no. of sequencers I think importing another copy of raw data into galaxy workspace will be asking for lot of disk space. I understand it is a neat way of doing things as it becomes agnostic of the raw data location ¬¨‚Ä†but might not be the best way for handling huge data in long run for centers like ours.

Please correct me if I am wrong. I think we could also have a simple option without having to import the data and just using it for analysis from the current location, also storing results at the same location. That way in future even if the data set is moved analysis also stays with it.

Let me know what you feel. I will be happy to know if there are any other smart reasons of importing the data in galaxy workspace just for curiosity sake.

Thanks, -Abhi

On Mon, Sep 28, 2009 at 9:28 AM, Greg Von Kuster <ghv2@psu.edu> wrote: Hello Abhishek,

The Galaxy distribution includes the enhancements to which I previously referred for uploading history files. ¬¨‚Ä†Uploading files to a history now creates a Galaxy job just like any other tool, and can be run on a cluster node, allowing upload of very large files. ¬¨‚Ä†The initial pass of this work is also completed for uploading to a Data Library, but this enhancement is still in test, so it should soon be available in the distribution.

Do you want to avoid having to import at all (e.g. allow Galaxy to refer to datasets that live in their original locations)? ¬¨‚Ä†This is not currently possible, but if this is what you are looking for, we can consider some additional options on the current upload form, or possibly a new, separate form.

Greg Von Kuster Galaxy Development Team

Abhishek Pratap wrote: Hi Greg, Anton and all

Just wondering if there has been any progress made on this end. I am sorry I was not able to follow it up on Assaf's suggestion due to other things at work.

I did try the latest version of galaxy and looks like the files are still transferred over HTTP before they could be used in the galaxy workspace. Also I would again like to highlight that many labs might want to use the local instance of galaxy and prefer to point to a local path where the file is being stored. That way we will have both the benefits of using a cool GUI and process data stored locally.

Let me know if you guys need some feedback or have more questions. I will be happy to discuss them.

best, -Abhi

On Tue, Jul 21, 2009 at 4:26 PM, Greg Von Kuster <ghv2@psu.edu <mailto:ghv2@psu.edu>> wrote:

¬¨‚Ä† Hello Abishek,

¬¨‚Ä† We are currently in the process of significantly enhancing the ¬¨‚Ä† current Galaxy upload utilities, and the new version should ¬¨‚Ä† eliminate the issue you've raised about the time needed to upload ¬¨‚Ä† large files via HTTP ( not for making an initial copy of the file in ¬¨‚Ä† the Galaxy environment ). However, it will probably not be ready for ¬¨‚Ä† release for a few more weeks, so if you can take advantage of ¬¨‚Ä† Assaf's script in the meantime, that's great. ¬¨¬Æ‚àö√∫I can't guarantee ¬¨‚Ä† that all Galaxy features will function correctly if you do this though.

¬¨‚Ä† Assaf, have you found that using your script breaks anything?

¬¨‚Ä† Also, if you upload a file to a library rather than a history, ¬¨‚Ä† multiple users can "import" the library dataset into their history ¬¨‚Ä† for analysis, but there is only 1 file on disk ( users are pointing ¬¨‚Ä† to it from their histories ). ¬¨¬Æ‚àö√∫But uploading a file to a history ¬¨‚Ä† will create a new copy of the file each time it is uploaded.

¬¨‚Ä† Greg Von Kuster ¬¨‚Ä† Galaxy Development Team

¬¨‚Ä† Abhishek Pratap wrote:

¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† Hi All

¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† @Greg : Please find my comments below.

¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† On Tue, Jul 21, 2009 at 10:44 AM, Greg Von Kuster<ghv2@psu.edu ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† <mailto:ghv2@psu.edu>> wrote:

¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† Hello Abhi,

¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† Can you clarify the steps you took that produced the ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† behavior? ‚àö√°‚àÜ√≠‚àö√∫See my

¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† comments below.

¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† Anton Nekrutenko wrote:

¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† Abhishek:

¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† Let talk. This is the area of active current ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† development. We are ‚àö√°‚àÜ√≠‚àö√∫looking

¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† at implementing a universal fastq-like format or ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† supporting ‚àö√°‚àÜ√≠‚àö√∫multiple

¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† formats. Perhaps we should join efforts in ironing out ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ‚àö√°‚àÜ√≠‚àö√∫specifications.

¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† anton ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† galaxy team

¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† On Jul 20, 2009, at 5:18 PM, Abhishek Pratap wrote:

¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† Hi All

¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† I recently came to know about NGS analysis on galaxy ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† during ISMB. ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† Getting excited I tried couple of things basically ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† to play with it.

¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† Few comments : I may have interepretted something ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† described below in a ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† wrong way. My apologies before hand.

¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† On a standalone installation of galaxy while I was ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† trying to explore ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† one FASTQ(sequence) file. It takes considerable (> ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† 20 min) for a fastq ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† file to get uploaded (2 GB).

¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† Are you using the Galaxy upload utility to create an item in ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† your history ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† that points to the dataset file on disk?

¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† Yes that is precisely correct, I am trying to upload a solexa FASTQ ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† file but on a standalone galaxy installation from my local file ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† system.

¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† I am not sure what is the rationale

¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† behind that. Ideally I think there should be no need ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† to upload such ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† heavy files into the workspace.

¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† A data file that originates from a place external to Galaxy ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† must be uploaded ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† into Galaxy so that the disk file can be placed in the ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† location configured ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† in the Galaxy config file. ‚àö√°‚àÜ√≠‚àö√∫Also, when data is uploaded to

¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† Galaxy ( either ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† to a history or a library ), several database table settings ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† are created ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† that are used by various Galaxy features.

¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† They could actually be used straight

¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† Thanks for the clarification but I am not sure this will help a ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† lot of ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† people who are interested to install and run galaxy locally ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† mainly for ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† the following reasons. May be it is just local to me.

¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† A. We already one instance of data saved on the local file system ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† B. Making another copy via galaxy will eat away a lot of space ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† in long run. ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† C. The time needed to import the files into galaxy space is huge

¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† away by the path specified.

¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† What do you mean by "the path specified"?

¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† Well what I mean was a way to specify the path of the file/run ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† on the ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† lcoal file system and galaxy could directly pick it up from there ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† rather than uploading it into its own space. Now I understand this ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† might not work based on the way the system was designed.

¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† Also is there any way to access the

¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† scripts for analysis on the command line. I know ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† this undermines the ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† main aim of working with galaxy but rite now I am ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† concerned about the ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† performance/time.

¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† You should be able to run any Galaxy tool from the command ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† line as long as ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† you have all of the tool's required binaries in your path. ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ‚àö√°‚àÜ√≠‚àö√∫However, running

¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† a tool from within Galaxy should generally not be any slower ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† than running it ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† outside of Galaxy, depending, of course, on what you are doing.

¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† Ok I was under the impression that running from SHELL will eliminate ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† the step of uploading them into galaxy file space.

¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† -Abhi

¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† I will be happy to discuss more about this in case ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† you have some ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† comments/questions for me.

¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† Best, ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† -Abhi

¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† -----------------------------

¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† Abhishek Pratap

¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† Bioinformatics Software Engineer

¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† Institute for Genome Sciences

¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† School of Medicine, Univ of Maryland

¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† 801, W. Baltimore Street, Baltimore, MD 21209

¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† Ph: (+1)-410-706-2296

¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† www.igs.umaryland.edu/ <http://www.igs.umaryland.edu/> ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† _______________________________________________ ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† galaxy-user mailing list ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† galaxy-user@bx.psu.edu <mailto:galaxy-user@bx.psu.edu>

http://mail.bx.psu.edu/cgi-bin/mailman/listinfo/galaxy-user

¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† Anton Nekrutenko ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† http://nekrut.bx.psu.edu ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† http://galaxyproject.org

¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† _______________________________________________ ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† galaxy-user mailing list ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† galaxy-user@bx.psu.edu <mailto:galaxy-user@bx.psu.edu>

¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† http://mail.bx.psu.edu/cgi-bin/mailman/listinfo/galaxy-user

_______________________________________________ galaxy-user mailing list galaxy-user@bx.psu.edu http://mail.bx.psu.edu/cgi-bin/mailman/listinfo/galaxy-user -- Research Associate ¬¨‚Ä† ¬¨‚Ä†Department of Biostatistics Associate Director ¬¨‚Ä† ¬¨‚Ä†Bioinformatics Core ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä†Harvard School of Public Health Skype: ohofmann ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† Phone: +1 (617) 365 0984

Abhishek Pratap

7:37 p.m.

Here it is :

...

hg heads

changeset: 2824:d97f4e86be45 tag: tip parent: 2823:2c0c81150dbd parent: 2821:04a753865407 user: Anton Nekrutenko <anton@bx.psu.edu> date: Fri Oct 02 11:02:14 2009 -0400 summary: merge -Abhi On Fri, Oct 2, 2009 at 3:30 PM, Greg Von Kuster <ghv2@psu.edu> wrote:

...

Please type the following in your galaxy install directory, and let me know what you get:

hg heads

Thanks

Abhishek Pratap wrote:

...
Hi Greg

Unfortunately it is not working for me. I made sure I cleared my browser cache before re-viewing it.

I have set the option as suggested by you in the universe_wsgi.ini file.

-Abhi

On Fri, Oct 2, 2009 at 2:53 PM, Greg Von Kuster <ghv2@psu.edu> wrote:

...
Hello Abhishek,

Add this to your universe_wsgi.ini file:

allow_library_path_paste = True

Then, clicking the down-arrow on the upload form

Create new data library datasets ¬†‚ñº

will give you 4 options, 1 of which is:

Upload files from file system paths

Greg Von Kuster Galaxy Development Team

Abhishek Pratap wrote:

...
Hi Greg

I have updated my galaxy rep to changeset 2825. I dont see the checkbox on the "Upload File" page. Am I missing something ?

Thanks, -Abhi

On Fri, Oct 2, 2009 at 10:21 AM, Greg Von Kuster <ghv2@psu.edu> wrote:

...
Change set 2812 will be included in a release to the distribution today - here are details of a new option that we're hoping will provide what is needed for most labs.

Add a new option, 'allow_library_path_paste' that adds a new upload page ("Upload files from file system paths") to the admin-side library upload pages. This form contains a textarea that allows Galaxy admins to paste any number of file system paths (files or directories) from which Galaxy will import library datasets, saving the directory structure (if desired). ¬¨‚Ä†Since such ability allows admins access to any file on the Galaxy server which is readable by Galaxy's system user, this option is disabled by default, and system administrators should take care in assigning Galaxy administrators when this feature is enabled. ¬¨‚Ä†Controls on what files are accessible to this tool based on ownership or other properties can be added at a later date if there is sufficient interest for such features.

This commit also includes a checkbox on the "Upload directory of files" page (as well as the new "Upload files from file system paths" page above) that will prevent Galaxy from copying data to its files directory (by default, 'database/files/'). ¬¨‚Ä†This is useful for large library datasets that live in their own managed locations on the file system, this will prevent the existence of duplicate copies of datasets (but means administrators must take care to manage data - moving or removing the data from its Galaxy-external location will render these datasets invalid within Galaxy).

One unique feature to be aware of: when using the "Copy data into Galaxy?" checkbox on the "Upload directory of files" page, any symbolic links encountered in the chosen import directory will be made absolute and dereferenced ONCE. ¬¨‚Ä†This allows administrators to link large datasets to the import directory, rather than having to make full copies, while being able to delete such links after importing. ¬¨‚Ä†Only the first symlink (the one in the import directory itself) is dereferenced; all others remain. ¬¨‚Ä†See the following for an example:

library_import_dir = /galaxy/import

% ls -lR /galaxy/import /galaxy/import: total 6 drwxr-xr-x ¬¨‚Ä† 2 nate ¬¨‚Ä† ¬¨‚Ä† nate ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† 512 Oct ¬¨‚Ä†1 11:31 link/

/galaxy/import/link: total 10 lrwxrwxrwx ¬¨‚Ä† 1 nate ¬¨‚Ä† ¬¨‚Ä† nate ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä†71 Oct ¬¨‚Ä†1 10:38 1.bed -> ../../../home/nate/galaxy/test-data/1.bed lrwxrwxrwx ¬¨‚Ä† 1 nate ¬¨‚Ä† ¬¨‚Ä† nate ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä†60 Oct ¬¨‚Ä†1 10:38 2.bed -> /home/nate/galaxy/test-data/2.bed lrwxrwxrwx ¬¨‚Ä† 1 nate ¬¨‚Ä† ¬¨‚Ä† nate ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä†11 Oct ¬¨‚Ä†1 10:38 3.bed -> ../../3.bed lrwxrwxrwx ¬¨‚Ä† 1 nate ¬¨‚Ä† ¬¨‚Ä† nate ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä†35 Oct ¬¨‚Ä†1 11:30 4.bed -> ../../galaxy_symlink/test-data/4.bed lrwxrwxrwx ¬¨‚Ä† 1 nate ¬¨‚Ä† ¬¨‚Ä† nate ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä†41 Oct ¬¨‚Ä†1 11:31 5.bed -> /galaxy/galaxy_symlink/test-data/5.bed

% ls -l /galaxy/3.bed lrwxrwxrwx ¬¨‚Ä† 1 nate ¬¨‚Ä† ¬¨‚Ä† nate ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä†60 Oct ¬¨‚Ä†1 10:39 /galaxy/3.bed -> /home/nate/galaxy/test-data/3.bed

% ls -l /galaxy/galaxy_symlink lrwxrwxrwx ¬¨‚Ä† 1 nate ¬¨‚Ä† ¬¨‚Ä† nate ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä†44 Oct ¬¨‚Ä†1 11:30 /galaxy/galaxy_symlink -> /home/nate/galaxy/

In this example,

1.bed is a relative symbolic link to the real 1.bed.

2.bed is an absolute symlink to the real 2.bed.

3.bed is a relative symlink to ../../3.bed, aka /galaxy/3.bed, which itself is a symlink to the real 3.bed.

4.bed is a relative symlink which follows another symlink (/galaxy/galaxy_symlink) to the real 4.bed.

5.bed is an absolute symlink in the same fashion as 4.bed

If the 'link' server directory is chosen on the "Upload directory of files" page, and "Copy data into Galaxy?" is checked "No", the following files will be referenced by Galaxy:

/home/nate/galaxy/test-data/1.bed /home/nate/galaxy/test-data/2.bed /galaxy/3.bed /galaxy/galaxy_symlink/test-data/4.bed /galaxy/galaxy_symlink/test-data/5.bed

The Galaxy administrator may now safely delete /galaxy/import/link, but should take care not to remove the referenced symbolic links (/galaxy/3.bed, /galaxy/galaxy_symlink).

Not all symbolic links are dereferenced because it is assumed that if an administrator links to a path in the import directory which itself is (or contains) links, that is the preferred path for accessing the data.

Oliver Hofmann wrote:

...
Dear all,

to echo what Abhi said: we are also currently looking of ways to automatically import data sets (libraries) into Galaxy without having to manually trigger the import via the administration interface, and ideally while keeping the data in the original place. The idea here is to have multiple tools all point at the original 'source data' without having to replicate terabytes of data.

Not quite sure how feasible this is in practice, but it certainly would be incredibly helpful.

Best,

¬¨‚Ä† ¬¨‚Ä†Oliver

On 28 Sep 2009, at 14:24, Abhishek Pratap wrote:

> HI Greg > > Thanks for a quick reply and making some requested changes. However I > am > not still sure if importing NGS data will help in long run. > > For Centers generating NGS data which could 2-3 T.B / week depending > on > no. of sequencers I think importing another copy of raw data into > galaxy > workspace will be asking for lot of disk space. I understand it is a > neat > way of doing things as it becomes agnostic of the raw data location > ¬¨‚Ä†but > might not be the best way for handling huge data in long run for > centers > like ours. > > Please correct me if I am wrong. I think we could also have a simple > option without having to import the data and just using it for > analysis > from > the current location, also storing results at the same location. That > way in > future even if the data set is moved analysis also stays with it. > > Let me know what you feel. I will be happy to know if there are any > other > smart reasons of importing the data in galaxy workspace just for > curiosity > sake. > > Thanks, > -Abhi > > On Mon, Sep 28, 2009 at 9:28 AM, Greg Von Kuster <ghv2@psu.edu> > wrote: > Hello Abhishek, > > The Galaxy distribution includes the enhancements to which I > previously > referred for uploading history files. ¬¨‚Ä†Uploading files to a > history > now > creates a Galaxy job just like any other tool, and can be run on a > cluster > node, allowing upload of very large files. ¬¨‚Ä†The initial pass of > this > work is > also completed for uploading to a Data Library, but this enhancement > is > still in test, so it should soon be available in the distribution. > > Do you want to avoid having to import at all (e.g. allow Galaxy to > refer > to datasets that live in their original locations)? ¬¨‚Ä†This is not > currently > possible, but if this is what you are looking for, we can consider > some > additional options on the current upload form, or possibly a new, > separate > form. > > > Greg Von Kuster > Galaxy Development Team > > > Abhishek Pratap wrote: > Hi Greg, Anton and all > > Just wondering if there has been any progress made on this end. I am > sorry I was not able to follow it up on Assaf's suggestion due to > other > things at work. > > I did try the latest version of galaxy and looks like the files are > still > transferred over HTTP before they could be used in the galaxy > workspace. > Also I would again like to highlight that many labs might want to use > the > local instance of galaxy and prefer to point to a local path where > the > file > is being stored. That way we will have both the benefits of using a > cool GUI > and process data stored locally. > > Let me know if you guys need some feedback or have more questions. I > will > be happy to discuss them. > > best, > -Abhi > > On Tue, Jul 21, 2009 at 4:26 PM, Greg Von Kuster <ghv2@psu.edu > <mailto:ghv2@psu.edu>> wrote: > > ¬¨‚Ä† Hello Abishek, > > ¬¨‚Ä† We are currently in the process of significantly enhancing the > ¬¨‚Ä† current Galaxy upload utilities, and the new version should > ¬¨‚Ä† eliminate the issue you've raised about the time needed to > upload > ¬¨‚Ä† large files via HTTP ( not for making an initial copy of the > file in > ¬¨‚Ä† the Galaxy environment ). However, it will probably not be > ready for > ¬¨‚Ä† release for a few more weeks, so if you can take advantage of > ¬¨‚Ä† Assaf's script in the meantime, that's great. ¬¨¬Æ‚àö√∫I can't > guarantee > ¬¨‚Ä† that all Galaxy features will function correctly if you do this > though. > > ¬¨‚Ä† Assaf, have you found that using your script breaks anything? > > ¬¨‚Ä† Also, if you upload a file to a library rather than a history, > ¬¨‚Ä† multiple users can "import" the library dataset into their > history > ¬¨‚Ä† for analysis, but there is only 1 file on disk ( users are > pointing > ¬¨‚Ä† to it from their histories ). ¬¨¬Æ‚àö√∫But uploading a file to > a history > ¬¨‚Ä† will create a new copy of the file each time it is uploaded. > > ¬¨‚Ä† Greg Von Kuster > ¬¨‚Ä† Galaxy Development Team > > > > ¬¨‚Ä† Abhishek Pratap wrote: > > ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† Hi All > > ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† @Greg : Please find my comments below. > > ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† On Tue, Jul 21, 2009 at 10:44 AM, Greg Von > Kuster<ghv2@psu.edu > ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† <mailto:ghv2@psu.edu>> wrote: > > ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† Hello Abhi, > > ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† Can you clarify the steps you took that > produced the > ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† behavior? ‚àö√°‚àÜ√≠‚àö√∫See my > > ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† comments below. > > ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† Anton Nekrutenko wrote: > > ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† Abhishek: > > ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† Let talk. This is the area > of active current > ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† development. We are > ‚àö√°‚àÜ√≠‚àö√∫looking > > ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† at implementing a universal > fastq-like format or > ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† supporting > ‚àö√°‚àÜ√≠‚àö√∫multiple > > ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† formats. Perhaps we should > join efforts in ironing > out > ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† > ‚àö√°‚àÜ√≠‚àö√∫specifications. > > > ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† anton > ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† galaxy team > > > ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† On Jul 20, 2009, at 5:18 > PM, Abhishek Pratap > wrote: > > ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† Hi All > > > ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† I recently came > to know about NGS analysis > on galaxy > ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† during ISMB. > ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† Getting excited > I tried couple of things > basically > ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† to play with > it. > > ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† Few comments : > I may have interepretted > something > ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† described below > in a > ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† wrong way. My > apologies before hand. > > > > ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† On a standalone > installation of galaxy while > I was > ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† trying to > explore > ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† one > FASTQ(sequence) file. It takes > considerable (> > ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† 20 min) for a > fastq > ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† file to get > uploaded (2 GB). > > ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† Are you using the Galaxy upload utility > to create an > item in > ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† your history > ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† that points to the dataset file on > disk? > > > ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† Yes that is precisely correct, I am trying to > upload a solexa > FASTQ > ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† file but on a standalone galaxy installation from > my local > file > ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† system. > > ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† I am not sure what is the rationale > > ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† behind that. > Ideally I think there should be > no need > ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† to upload such > ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† heavy files > into the workspace. > > ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† A data file that originates from a > place external to > Galaxy > ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† must be uploaded > ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† into Galaxy so that the disk file can > be placed in the > ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† location configured > ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† in the Galaxy config file. > ‚àö√°‚àÜ√≠‚àö√∫Also, when data is > uploaded to > > ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† Galaxy ( either > ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† to a history or a library ), several > database table > settings > ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† are created > ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† that are used by various Galaxy > features. > > ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† They could actually be used straight > > > ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† Thanks for the clarification but I am not sure this > will help > a > ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† lot of > ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† people who are interested to install and run galaxy > locally > ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† mainly for > ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† the following reasons. May be it is just local to > me. > > ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† A. We already one instance of data saved on the > local file > system > ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† B. Making another copy via galaxy will eat away a > lot of space > ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† in long run. > ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† C. The time needed to import the files into galaxy > space is > huge > > ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† away by the > path specified. > > ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† What do you mean by "the path > specified"? > > > > ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† Well what I mean was a way to specify the path of > the file/run > ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† on the > ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† lcoal file system and galaxy could directly pick it > up from > there > ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† rather than uploading it into its own space. Now I > understand > this > ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† might not work based on the way the system was > designed. > > > ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† Also is there any way to access the > > ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† scripts for > analysis on the command line. I > know > ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† this undermines > the > ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† main aim of > working with galaxy but rite now > I am > ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† concerned about > the > ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† > performance/time. > > ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† You should be able to run any Galaxy > tool from the > command > ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† line as long as > ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† you have all of the tool's required > binaries in your > path. > ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ‚àö√°‚àÜ√≠‚àö√∫However, running > > ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† a tool from within Galaxy should > generally not be any > slower > ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† than running it > ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† outside of Galaxy, depending, of > course, on what you are > doing. > > > > > ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† Ok I was under the impression that running from > SHELL will > eliminate > ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† the step of uploading them into galaxy file space. > > > ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† -Abhi > > ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† I will be happy > to discuss more about this > in case > ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† you have some > ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† > comments/questions for me. > > > > ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† Best, > ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† -Abhi > > > > ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† > ----------------------------- > > ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† Abhishek Pratap > > ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† Bioinformatics > Software Engineer > > ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† Institute for > Genome Sciences > > ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† School of > Medicine, Univ of Maryland > > ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† 801, W. > Baltimore Street, Baltimore, MD > 21209 > > ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† Ph: > (+1)-410-706-2296 > > ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† > www.igs.umaryland.edu/ > <http://www.igs.umaryland.edu/> > ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† > _______________________________________________ > ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† galaxy-user > mailing list > ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† > galaxy-user@bx.psu.edu > <mailto:galaxy-user@bx.psu.edu> > > > http://mail.bx.psu.edu/cgi-bin/mailman/listinfo/galaxy-user > > ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† Anton Nekrutenko > ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† http://nekrut.bx.psu.edu > ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† http://galaxyproject.org > > ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† > _______________________________________________ > ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† galaxy-user mailing list > ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† galaxy-user@bx.psu.edu > <mailto:galaxy-user@bx.psu.edu> > > ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† > http://mail.bx.psu.edu/cgi-bin/mailman/listinfo/galaxy-user > > > > > > > > > _______________________________________________ > galaxy-user mailing list > galaxy-user@bx.psu.edu > http://mail.bx.psu.edu/cgi-bin/mailman/listinfo/galaxy-user

-- Research Associate ¬¨‚Ä† ¬¨‚Ä†Department of Biostatistics Associate Director ¬¨‚Ä† ¬¨‚Ä†Bioinformatics Core ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä†Harvard School of Public Health Skype: ohofmann ¬¨‚Ä† ¬¨‚Ä† ¬¨‚Ä† Phone: +1 (617) 365 0984

Greg Von Kuster

7:46 p.m.

New subject: Experience with Loading NGS data on standalone instance of galaxy

Ok, you have the change set. Perhaps you are trying to upload into a history rather than a Data Library? This new feature is only available for uploading into a Data Library. Can you confirm? I'm not sure what else could be causing you to not see the 4 options on the drop-down menu for uploading dataset into a Data Library. Greg Abhishek Pratap wrote:

...

Here it is :

...
hg heads

changeset: 2824:d97f4e86be45 tag: tip parent: 2823:2c0c81150dbd parent: 2821:04a753865407 user: Anton Nekrutenko <anton@bx.psu.edu> date: Fri Oct 02 11:02:14 2009 -0400 summary: merge

-Abhi

On Fri, Oct 2, 2009 at 3:30 PM, Greg Von Kuster <ghv2@psu.edu> wrote:

...
Please type the following in your galaxy install directory, and let me know what you get:

hg heads

Thanks

Abhishek Pratap wrote:

...
Hi Greg

Unfortunately it is not working for me. I made sure I cleared my browser cache before re-viewing it.

I have set the option as suggested by you in the universe_wsgi.ini file.

-Abhi

On Fri, Oct 2, 2009 at 2:53 PM, Greg Von Kuster <ghv2@psu.edu> wrote:

...
Hello Abhishek,

Add this to your universe_wsgi.ini file:

allow_library_path_paste = True

Then, clicking the down-arrow on the upload form

Create new data library datasets ¬¨‚Ä†‚Äö√±¬∫

will give you 4 options, 1 of which is:

Upload files from file system paths

Greg Von Kuster Galaxy Development Team

Abhishek Pratap wrote:

...
Hi Greg

I have updated my galaxy rep to changeset 2825. I dont see the checkbox on the "Upload File" page. Am I missing something ?

Thanks, -Abhi

On Fri, Oct 2, 2009 at 10:21 AM, Greg Von Kuster <ghv2@psu.edu> wrote:

...
Change set 2812 will be included in a release to the distribution today - here are details of a new option that we're hoping will provide what is needed for most labs.

Add a new option, 'allow_library_path_paste' that adds a new upload page ("Upload files from file system paths") to the admin-side library upload pages. This form contains a textarea that allows Galaxy admins to paste any number of file system paths (files or directories) from which Galaxy will import library datasets, saving the directory structure (if desired). ¬¨¬®‚Äö√Ñ‚Ä†Since such ability allows admins access to any file on the Galaxy server which is readable by Galaxy's system user, this option is disabled by default, and system administrators should take care in assigning Galaxy administrators when this feature is enabled. ¬¨¬®‚Äö√Ñ‚Ä†Controls on what files are accessible to this tool based on ownership or other properties can be added at a later date if there is sufficient interest for such features.

This commit also includes a checkbox on the "Upload directory of files" page (as well as the new "Upload files from file system paths" page above) that will prevent Galaxy from copying data to its files directory (by default, 'database/files/'). ¬¨¬®‚Äö√Ñ‚Ä†This is useful for large library datasets that live in their own managed locations on the file system, this will prevent the existence of duplicate copies of datasets (but means administrators must take care to manage data - moving or removing the data from its Galaxy-external location will render these datasets invalid within Galaxy).

One unique feature to be aware of: when using the "Copy data into Galaxy?" checkbox on the "Upload directory of files" page, any symbolic links encountered in the chosen import directory will be made absolute and dereferenced ONCE. ¬¨¬®‚Äö√Ñ‚Ä†This allows administrators to link large datasets to the import directory, rather than having to make full copies, while being able to delete such links after importing. ¬¨¬®‚Äö√Ñ‚Ä†Only the first symlink (the one in the import directory itself) is dereferenced; all others remain. ¬¨¬®‚Äö√Ñ‚Ä†See the following for an example:

library_import_dir = /galaxy/import

% ls -lR /galaxy/import /galaxy/import: total 6 drwxr-xr-x ¬¨¬®‚Äö√Ñ‚Ä† 2 nate ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† nate ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† 512 Oct ¬¨¬®‚Äö√Ñ‚Ä†1 11:31 link/

/galaxy/import/link: total 10 lrwxrwxrwx ¬¨¬®‚Äö√Ñ‚Ä† 1 nate ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† nate ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä†71 Oct ¬¨¬®‚Äö√Ñ‚Ä†1 10:38 1.bed -> ../../../home/nate/galaxy/test-data/1.bed lrwxrwxrwx ¬¨¬®‚Äö√Ñ‚Ä† 1 nate ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† nate ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä†60 Oct ¬¨¬®‚Äö√Ñ‚Ä†1 10:38 2.bed -> /home/nate/galaxy/test-data/2.bed lrwxrwxrwx ¬¨¬®‚Äö√Ñ‚Ä† 1 nate ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† nate ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä†11 Oct ¬¨¬®‚Äö√Ñ‚Ä†1 10:38 3.bed -> ../../3.bed lrwxrwxrwx ¬¨¬®‚Äö√Ñ‚Ä† 1 nate ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† nate ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä†35 Oct ¬¨¬®‚Äö√Ñ‚Ä†1 11:30 4.bed -> ../../galaxy_symlink/test-data/4.bed lrwxrwxrwx ¬¨¬®‚Äö√Ñ‚Ä† 1 nate ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† nate ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä†41 Oct ¬¨¬®‚Äö√Ñ‚Ä†1 11:31 5.bed -> /galaxy/galaxy_symlink/test-data/5.bed

% ls -l /galaxy/3.bed lrwxrwxrwx ¬¨¬®‚Äö√Ñ‚Ä† 1 nate ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† nate ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä†60 Oct ¬¨¬®‚Äö√Ñ‚Ä†1 10:39 /galaxy/3.bed -> /home/nate/galaxy/test-data/3.bed

% ls -l /galaxy/galaxy_symlink lrwxrwxrwx ¬¨¬®‚Äö√Ñ‚Ä† 1 nate ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† nate ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä†44 Oct ¬¨¬®‚Äö√Ñ‚Ä†1 11:30 /galaxy/galaxy_symlink -> /home/nate/galaxy/

In this example,

1.bed is a relative symbolic link to the real 1.bed.

2.bed is an absolute symlink to the real 2.bed.

3.bed is a relative symlink to ../../3.bed, aka /galaxy/3.bed, which itself is a symlink to the real 3.bed.

4.bed is a relative symlink which follows another symlink (/galaxy/galaxy_symlink) to the real 4.bed.

5.bed is an absolute symlink in the same fashion as 4.bed

If the 'link' server directory is chosen on the "Upload directory of files" page, and "Copy data into Galaxy?" is checked "No", the following files will be referenced by Galaxy:

/home/nate/galaxy/test-data/1.bed /home/nate/galaxy/test-data/2.bed /galaxy/3.bed /galaxy/galaxy_symlink/test-data/4.bed /galaxy/galaxy_symlink/test-data/5.bed

The Galaxy administrator may now safely delete /galaxy/import/link, but should take care not to remove the referenced symbolic links (/galaxy/3.bed, /galaxy/galaxy_symlink).

Not all symbolic links are dereferenced because it is assumed that if an administrator links to a path in the import directory which itself is (or contains) links, that is the preferred path for accessing the data.

Oliver Hofmann wrote: > Dear all, > > > to echo what Abhi said: we are also currently looking of ways to > automatically import data sets (libraries) into Galaxy without having > to > manually trigger the import via the administration interface, and > ideally > while keeping the data in the original place. The idea here is to have > multiple tools all point at the original 'source data' without having > to > replicate terabytes of data. > > Not quite sure how feasible this is in practice, but it certainly > would > be > incredibly helpful. > > Best, > > ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä†Oliver > > > > > On 28 Sep 2009, at 14:24, Abhishek Pratap wrote: > >> HI Greg >> >> Thanks for a quick reply and making some requested changes. However I >> am >> not still sure if importing NGS data will help in long run. >> >> For Centers generating NGS data which could 2-3 T.B / week depending >> on >> no. of sequencers I think importing another copy of raw data into >> galaxy >> workspace will be asking for lot of disk space. I understand it is a >> neat >> way of doing things as it becomes agnostic of the raw data location >> ¬¨¬®‚Äö√Ñ‚Ä†but >> might not be the best way for handling huge data in long run for >> centers >> like ours. >> >> Please correct me if I am wrong. I think we could also have a simple >> option without having to import the data and just using it for >> analysis >> from >> the current location, also storing results at the same location. That >> way in >> future even if the data set is moved analysis also stays with it. >> >> Let me know what you feel. I will be happy to know if there are any >> other >> smart reasons of importing the data in galaxy workspace just for >> curiosity >> sake. >> >> Thanks, >> -Abhi >> >> On Mon, Sep 28, 2009 at 9:28 AM, Greg Von Kuster <ghv2@psu.edu> >> wrote: >> Hello Abhishek, >> >> The Galaxy distribution includes the enhancements to which I >> previously >> referred for uploading history files. ¬¨¬®‚Äö√Ñ‚Ä†Uploading files to a >> history >> now >> creates a Galaxy job just like any other tool, and can be run on a >> cluster >> node, allowing upload of very large files. ¬¨¬®‚Äö√Ñ‚Ä†The initial pass of >> this >> work is >> also completed for uploading to a Data Library, but this enhancement >> is >> still in test, so it should soon be available in the distribution. >> >> Do you want to avoid having to import at all (e.g. allow Galaxy to >> refer >> to datasets that live in their original locations)? ¬¨¬®‚Äö√Ñ‚Ä†This is not >> currently >> possible, but if this is what you are looking for, we can consider >> some >> additional options on the current upload form, or possibly a new, >> separate >> form. >> >> >> Greg Von Kuster >> Galaxy Development Team >> >> >> Abhishek Pratap wrote: >> Hi Greg, Anton and all >> >> Just wondering if there has been any progress made on this end. I am >> sorry I was not able to follow it up on Assaf's suggestion due to >> other >> things at work. >> >> I did try the latest version of galaxy and looks like the files are >> still >> transferred over HTTP before they could be used in the galaxy >> workspace. >> Also I would again like to highlight that many labs might want to use >> the >> local instance of galaxy and prefer to point to a local path where >> the >> file >> is being stored. That way we will have both the benefits of using a >> cool GUI >> and process data stored locally. >> >> Let me know if you guys need some feedback or have more questions. I >> will >> be happy to discuss them. >> >> best, >> -Abhi >> >> On Tue, Jul 21, 2009 at 4:26 PM, Greg Von Kuster <ghv2@psu.edu >> <mailto:ghv2@psu.edu>> wrote: >> >> ¬¨¬®‚Äö√Ñ‚Ä† Hello Abishek, >> >> ¬¨¬®‚Äö√Ñ‚Ä† We are currently in the process of significantly enhancing the >> ¬¨¬®‚Äö√Ñ‚Ä† current Galaxy upload utilities, and the new version should >> ¬¨¬®‚Äö√Ñ‚Ä† eliminate the issue you've raised about the time needed to >> upload >> ¬¨¬®‚Äö√Ñ‚Ä† large files via HTTP ( not for making an initial copy of the >> file in >> ¬¨¬®‚Äö√Ñ‚Ä† the Galaxy environment ). However, it will probably not be >> ready for >> ¬¨¬®‚Äö√Ñ‚Ä† release for a few more weeks, so if you can take advantage of >> ¬¨¬®‚Äö√Ñ‚Ä† Assaf's script in the meantime, that's great. ¬¨¬®¬¨√Ü‚Äö√†√∂‚àö‚à´I can't >> guarantee >> ¬¨¬®‚Äö√Ñ‚Ä† that all Galaxy features will function correctly if you do this >> though. >> >> ¬¨¬®‚Äö√Ñ‚Ä† Assaf, have you found that using your script breaks anything? >> >> ¬¨¬®‚Äö√Ñ‚Ä† Also, if you upload a file to a library rather than a history, >> ¬¨¬®‚Äö√Ñ‚Ä† multiple users can "import" the library dataset into their >> history >> ¬¨¬®‚Äö√Ñ‚Ä† for analysis, but there is only 1 file on disk ( users are >> pointing >> ¬¨¬®‚Äö√Ñ‚Ä† to it from their histories ). ¬¨¬®¬¨√Ü‚Äö√†√∂‚àö‚à´But uploading a file to >> a history >> ¬¨¬®‚Äö√Ñ‚Ä† will create a new copy of the file each time it is uploaded. >> >> ¬¨¬®‚Äö√Ñ‚Ä† Greg Von Kuster >> ¬¨¬®‚Äö√Ñ‚Ä† Galaxy Development Team >> >> >> >> ¬¨¬®‚Äö√Ñ‚Ä† Abhishek Pratap wrote: >> >> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† Hi All >> >> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† @Greg : Please find my comments below. >> >> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† On Tue, Jul 21, 2009 at 10:44 AM, Greg Von >> Kuster<ghv2@psu.edu >> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† <mailto:ghv2@psu.edu>> wrote: >> >> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† Hello Abhi, >> >> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† Can you clarify the steps you took that >> produced the >> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† behavior? ‚Äö√†√∂‚àö¬∞‚Äö√†√ú‚àö‚â†‚Äö√†√∂‚àö‚à´See my >> >> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† comments below. >> >> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† Anton Nekrutenko wrote: >> >> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† Abhishek: >> >> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† Let talk. This is the area >> of active current >> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† development. We are >> ‚Äö√†√∂‚àö¬∞‚Äö√†√ú‚àö‚â†‚Äö√†√∂‚àö‚à´looking >> >> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† at implementing a universal >> fastq-like format or >> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† supporting >> ‚Äö√†√∂‚àö¬∞‚Äö√†√ú‚àö‚â†‚Äö√†√∂‚àö‚à´multiple >> >> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† formats. Perhaps we should >> join efforts in ironing >> out >> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† >> ‚Äö√†√∂‚àö¬∞‚Äö√†√ú‚àö‚â†‚Äö√†√∂‚àö‚à´specifications. >> >> >> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† anton >> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† galaxy team >> >> >> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† On Jul 20, 2009, at 5:18 >> PM, Abhishek Pratap >> wrote: >> >> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† Hi All >> >> >> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† I recently came >> to know about NGS analysis >> on galaxy >> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† during ISMB. >> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† Getting excited >> I tried couple of things >> basically >> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† to play with >> it. >> >> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† Few comments : >> I may have interepretted >> something >> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† described below >> in a >> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† wrong way. My >> apologies before hand. >> >> >> >> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† On a standalone >> installation of galaxy while >> I was >> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† trying to >> explore >> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† one >> FASTQ(sequence) file. It takes >> considerable (> >> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† 20 min) for a >> fastq >> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† file to get >> uploaded (2 GB). >> >> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† Are you using the Galaxy upload utility >> to create an >> item in >> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† your history >> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† that points to the dataset file on >> disk? >> >> >> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† Yes that is precisely correct, I am trying to >> upload a solexa >> FASTQ >> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† file but on a standalone galaxy installation from >> my local >> file >> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† system. >> >> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† I am not sure what is the rationale >> >> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† behind that. >> Ideally I think there should be >> no need >> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† to upload such >> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† heavy files >> into the workspace. >> >> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† A data file that originates from a >> place external to >> Galaxy >> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† must be uploaded >> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† into Galaxy so that the disk file can >> be placed in the >> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† location configured >> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† in the Galaxy config file. >> ‚Äö√†√∂‚àö¬∞‚Äö√†√ú‚àö‚â†‚Äö√†√∂‚àö‚à´Also, when data is >> uploaded to >> >> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† Galaxy ( either >> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† to a history or a library ), several >> database table >> settings >> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† are created >> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† that are used by various Galaxy >> features. >> >> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† They could actually be used straight >> >> >> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† Thanks for the clarification but I am not sure this >> will help >> a >> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† lot of >> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† people who are interested to install and run galaxy >> locally >> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† mainly for >> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† the following reasons. May be it is just local to >> me. >> >> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† A. We already one instance of data saved on the >> local file >> system >> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† B. Making another copy via galaxy will eat away a >> lot of space >> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† in long run. >> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† C. The time needed to import the files into galaxy >> space is >> huge >> >> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† away by the >> path specified. >> >> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† What do you mean by "the path >> specified"? >> >> >> >> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† Well what I mean was a way to specify the path of >> the file/run >> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† on the >> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† lcoal file system and galaxy could directly pick it >> up from >> there >> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† rather than uploading it into its own space. Now I >> understand >> this >> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† might not work based on the way the system was >> designed. >> >> >> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† Also is there any way to access the >> >> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† scripts for >> analysis on the command line. I >> know >> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† this undermines >> the >> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† main aim of >> working with galaxy but rite now >> I am >> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† concerned about >> the >> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† >> performance/time. >> >> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† You should be able to run any Galaxy >> tool from the >> command >> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† line as long as >> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† you have all of the tool's required >> binaries in your >> path. >> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ‚Äö√†√∂‚àö¬∞‚Äö√†√ú‚àö‚â†‚Äö√†√∂‚àö‚à´However, running >> >> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† a tool from within Galaxy should >> generally not be any >> slower >> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† than running it >> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† outside of Galaxy, depending, of >> course, on what you are >> doing. >> >> >> >> >> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† Ok I was under the impression that running from >> SHELL will >> eliminate >> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† the step of uploading them into galaxy file space. >> >> >> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† -Abhi >> >> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† I will be happy >> to discuss more about this >> in case >> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† you have some >> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† >> comments/questions for me. >> >> >> >> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† Best, >> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† -Abhi >> >> >> >> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† >> ----------------------------- >> >> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† Abhishek Pratap >> >> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† Bioinformatics >> Software Engineer >> >> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† Institute for >> Genome Sciences >> >> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† School of >> Medicine, Univ of Maryland >> >> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† 801, W. >> Baltimore Street, Baltimore, MD >> 21209 >> >> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† Ph: >> (+1)-410-706-2296 >> >> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† >> www.igs.umaryland.edu/ >> <http://www.igs.umaryland.edu/> >> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† >> _______________________________________________ >> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† galaxy-user >> mailing list >> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† >> galaxy-user@bx.psu.edu >> <mailto:galaxy-user@bx.psu.edu> >> >> >> http://mail.bx.psu.edu/cgi-bin/mailman/listinfo/galaxy-user >> >> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† Anton Nekrutenko >> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† http://nekrut.bx.psu.edu >> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† http://galaxyproject.org >> >> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† >> _______________________________________________ >> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† galaxy-user mailing list >> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† galaxy-user@bx.psu.edu >> <mailto:galaxy-user@bx.psu.edu> >> >> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† >> http://mail.bx.psu.edu/cgi-bin/mailman/listinfo/galaxy-user >> >> >> >> >> >> >> >> >> _______________________________________________ >> galaxy-user mailing list >> galaxy-user@bx.psu.edu >> http://mail.bx.psu.edu/cgi-bin/mailman/listinfo/galaxy-user > -- > Research Associate ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä†Department of Biostatistics > Associate Director ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä†Bioinformatics Core > ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† > ¬¨¬®‚Äö√Ñ‚Ä†Harvard School of Public Health > Skype: ohofmann ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† Phone: +1 (617) 365 0984 > > >

Abhishek Pratap

7:54 p.m.

I think that could be it. I am using the "Upload File " option under the "Get Data" from the left hand menu. Sorry for the confusion if I am doing something that I am not supposed to. How do we upload data to data library ? -Abhi On Fri, Oct 2, 2009 at 3:46 PM, Greg Von Kuster <ghv2@psu.edu> wrote:

...

Ok, you have the change set. Perhaps you are trying to upload into a history rather than a Data Library? This new feature is only available for uploading into a Data Library. Can you confirm? I'm not sure what else could be causing you to not see the 4 options on the drop-down menu for uploading dataset into a Data Library.

Greg

Abhishek Pratap wrote:

...
Here it is :

...
hg heads

changeset: 2824:d97f4e86be45 tag: tip parent: 2823:2c0c81150dbd parent: 2821:04a753865407 user: Anton Nekrutenko <anton@bx.psu.edu> date: Fri Oct 02 11:02:14 2009 -0400 summary: merge

-Abhi

On Fri, Oct 2, 2009 at 3:30 PM, Greg Von Kuster <ghv2@psu.edu> wrote:

...
Please type the following in your galaxy install directory, and let me know what you get:

hg heads

Thanks

Abhishek Pratap wrote:

...
Hi Greg

Unfortunately it is not working for me. I made sure I cleared my browser cache before re-viewing it.

I have set the option as suggested by you in the universe_wsgi.ini file.

-Abhi

On Fri, Oct 2, 2009 at 2:53 PM, Greg Von Kuster <ghv2@psu.edu> wrote:

...
Hello Abhishek,

Add this to your universe_wsgi.ini file:

allow_library_path_paste = True

Then, clicking the down-arrow on the upload form

Create new data library datasets ¬¨‚Ä†‚Äö√±¬∫

will give you 4 options, 1 of which is:

Upload files from file system paths

Greg Von Kuster Galaxy Development Team

Abhishek Pratap wrote:

...
Hi Greg

I have updated my galaxy rep to changeset 2825. I dont see the checkbox on the "Upload File" page. Am I missing something ?

Thanks, -Abhi

On Fri, Oct 2, 2009 at 10:21 AM, Greg Von Kuster <ghv2@psu.edu> wrote: > > Change set 2812 will be included in a release to the distribution > today > - > here are details of a new option that we're hoping will provide what > is > needed for most labs. > > Add a new option, 'allow_library_path_paste' that adds a new upload > page > ("Upload files from file system paths") to the admin-side library > upload > pages. > This form contains a textarea that allows Galaxy admins to paste any > number > of > file system paths (files or directories) from which Galaxy will > import > library > datasets, saving the directory structure (if desired). > ¬¨¬®‚Äö√Ñ‚Ä†Since such > ability > allows admins access to any file on the Galaxy server which is > readable > by > Galaxy's system user, this option is disabled by default, and system > administrators should take care in assigning Galaxy administrators > when > this > feature is enabled. ¬¨¬®‚Äö√Ñ‚Ä†Controls on what files are accessible > to this > tool > based > on ownership or other properties can be added at a later date if > there > is > sufficient interest for such features. > > This commit also includes a checkbox on the "Upload directory of > files" > page > (as well as the new "Upload files from file system paths" page above) > that > will > prevent Galaxy from copying data to its files directory (by default, > 'database/files/'). ¬¨¬®‚Äö√Ñ‚Ä†This is useful for large library > datasets that > live > in > their own managed locations on the file system, this will prevent the > existence > of duplicate copies of datasets (but means administrators must take > care > to > manage data - moving or removing the data from its Galaxy-external > location > will render these datasets invalid within Galaxy). > > One unique feature to be aware of: when using the "Copy data into > Galaxy?" > checkbox on the "Upload directory of files" page, any symbolic links > encountered in the chosen import directory will be made absolute and > dereferenced ONCE. ¬¨¬®‚Äö√Ñ‚Ä†This allows administrators to link > large > datasets to > the > import directory, rather than having to make full copies, while being > able > to > delete such links after importing. ¬¨¬®‚Äö√Ñ‚Ä†Only the first symlink > (the one > in > the > import directory itself) is dereferenced; all others remain. > ¬¨¬®‚Äö√Ñ‚Ä†See > the > following > for an example: > > library_import_dir = /galaxy/import > > % ls -lR /galaxy/import > /galaxy/import: > total 6 > drwxr-xr-x ¬¨¬®‚Äö√Ñ‚Ä† 2 nate ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† nate > ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† 512 > Oct ¬¨¬®‚Äö√Ñ‚Ä†1 11:31 link/ > > /galaxy/import/link: > total 10 > lrwxrwxrwx ¬¨¬®‚Äö√Ñ‚Ä† 1 nate ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† nate > ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† > ¬¨¬®‚Äö√Ñ‚Ä†71 Oct ¬¨¬®‚Äö√Ñ‚Ä†1 10:38 1.bed -> > ../../../home/nate/galaxy/test-data/1.bed > lrwxrwxrwx ¬¨¬®‚Äö√Ñ‚Ä† 1 nate ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† nate > ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† > ¬¨¬®‚Äö√Ñ‚Ä†60 Oct ¬¨¬®‚Äö√Ñ‚Ä†1 10:38 2.bed -> > /home/nate/galaxy/test-data/2.bed > lrwxrwxrwx ¬¨¬®‚Äö√Ñ‚Ä† 1 nate ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† nate > ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† > ¬¨¬®‚Äö√Ñ‚Ä†11 Oct ¬¨¬®‚Äö√Ñ‚Ä†1 10:38 3.bed -> > ../../3.bed > lrwxrwxrwx ¬¨¬®‚Äö√Ñ‚Ä† 1 nate ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† nate > ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† > ¬¨¬®‚Äö√Ñ‚Ä†35 Oct ¬¨¬®‚Äö√Ñ‚Ä†1 11:30 4.bed -> > ../../galaxy_symlink/test-data/4.bed > lrwxrwxrwx ¬¨¬®‚Äö√Ñ‚Ä† 1 nate ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† nate > ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† > ¬¨¬®‚Äö√Ñ‚Ä†41 Oct ¬¨¬®‚Äö√Ñ‚Ä†1 11:31 5.bed -> > /galaxy/galaxy_symlink/test-data/5.bed > > % ls -l /galaxy/3.bed > lrwxrwxrwx ¬¨¬®‚Äö√Ñ‚Ä† 1 nate ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† nate > ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† > ¬¨¬®‚Äö√Ñ‚Ä†60 Oct ¬¨¬®‚Äö√Ñ‚Ä†1 10:39 > /galaxy/3.bed -> > /home/nate/galaxy/test-data/3.bed > > % ls -l /galaxy/galaxy_symlink > lrwxrwxrwx ¬¨¬®‚Äö√Ñ‚Ä† 1 nate ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† nate > ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† > ¬¨¬®‚Äö√Ñ‚Ä†44 Oct ¬¨¬®‚Äö√Ñ‚Ä†1 11:30 > /galaxy/galaxy_symlink > -> /home/nate/galaxy/ > > In this example, > > 1.bed is a relative symbolic link to the real 1.bed. > > 2.bed is an absolute symlink to the real 2.bed. > > 3.bed is a relative symlink to ../../3.bed, aka /galaxy/3.bed, which > itself > is > a symlink to the real 3.bed. > > 4.bed is a relative symlink which follows another symlink > (/galaxy/galaxy_symlink) to the real 4.bed. > > 5.bed is an absolute symlink in the same fashion as 4.bed > > If the 'link' server directory is chosen on the "Upload directory of > files" > page, and "Copy data into Galaxy?" is checked "No", the following > files > will > be > referenced by Galaxy: > > /home/nate/galaxy/test-data/1.bed > /home/nate/galaxy/test-data/2.bed > /galaxy/3.bed > /galaxy/galaxy_symlink/test-data/4.bed > /galaxy/galaxy_symlink/test-data/5.bed > > The Galaxy administrator may now safely delete /galaxy/import/link, > but > should > take care not to remove the referenced symbolic links (/galaxy/3.bed, > /galaxy/galaxy_symlink). > > Not all symbolic links are dereferenced because it is assumed that if > an > administrator links to a path in the import directory which itself is > (or > contains) links, that is the preferred path for accessing the data. > > > > Oliver Hofmann wrote: >> >> Dear all, >> >> >> to echo what Abhi said: we are also currently looking of ways to >> automatically import data sets (libraries) into Galaxy without >> having >> to >> manually trigger the import via the administration interface, and >> ideally >> while keeping the data in the original place. The idea here is to >> have >> multiple tools all point at the original 'source data' without >> having >> to >> replicate terabytes of data. >> >> Not quite sure how feasible this is in practice, but it certainly >> would >> be >> incredibly helpful. >> >> Best, >> >> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä†Oliver >> >> >> >> >> On 28 Sep 2009, at 14:24, Abhishek Pratap wrote: >> >>> HI Greg >>> >>> Thanks for a quick reply and making some requested changes. However >>> I >>> am >>> not still sure if importing NGS data will help in long run. >>> >>> For Centers generating NGS data which could 2-3 T.B / week >>> depending >>> on >>> no. of sequencers I think importing another copy of raw data into >>> galaxy >>> workspace will be asking for lot of disk space. I understand it is >>> a >>> neat >>> way of doing things as it becomes agnostic of the raw data location >>> ¬¨¬®‚Äö√Ñ‚Ä†but >>> might not be the best way for handling huge data in long run for >>> centers >>> like ours. >>> >>> Please correct me if I am wrong. I think we could also have a >>> simple >>> option without having to import the data and just using it for >>> analysis >>> from >>> the current location, also storing results at the same location. >>> That >>> way in >>> future even if the data set is moved analysis also stays with it. >>> >>> Let me know what you feel. I will be happy to know if there are any >>> other >>> smart reasons of importing the data in galaxy workspace just for >>> curiosity >>> sake. >>> >>> Thanks, >>> -Abhi >>> >>> On Mon, Sep 28, 2009 at 9:28 AM, Greg Von Kuster <ghv2@psu.edu> >>> wrote: >>> Hello Abhishek, >>> >>> The Galaxy distribution includes the enhancements to which I >>> previously >>> referred for uploading history files. ¬¨¬®‚Äö√Ñ‚Ä†Uploading files >>> to a >>> history >>> now >>> creates a Galaxy job just like any other tool, and can be run on a >>> cluster >>> node, allowing upload of very large files. ¬¨¬®‚Äö√Ñ‚Ä†The initial >>> pass of >>> this >>> work is >>> also completed for uploading to a Data Library, but this >>> enhancement >>> is >>> still in test, so it should soon be available in the distribution. >>> >>> Do you want to avoid having to import at all (e.g. allow Galaxy to >>> refer >>> to datasets that live in their original locations)? >>> ¬¨¬®‚Äö√Ñ‚Ä†This is not >>> currently >>> possible, but if this is what you are looking for, we can consider >>> some >>> additional options on the current upload form, or possibly a new, >>> separate >>> form. >>> >>> >>> Greg Von Kuster >>> Galaxy Development Team >>> >>> >>> Abhishek Pratap wrote: >>> Hi Greg, Anton and all >>> >>> Just wondering if there has been any progress made on this end. I >>> am >>> sorry I was not able to follow it up on Assaf's suggestion due to >>> other >>> things at work. >>> >>> I did try the latest version of galaxy and looks like the files are >>> still >>> transferred over HTTP before they could be used in the galaxy >>> workspace. >>> Also I would again like to highlight that many labs might want to >>> use >>> the >>> local instance of galaxy and prefer to point to a local path where >>> the >>> file >>> is being stored. That way we will have both the benefits of using a >>> cool GUI >>> and process data stored locally. >>> >>> Let me know if you guys need some feedback or have more questions. >>> I >>> will >>> be happy to discuss them. >>> >>> best, >>> -Abhi >>> >>> On Tue, Jul 21, 2009 at 4:26 PM, Greg Von Kuster <ghv2@psu.edu >>> <mailto:ghv2@psu.edu>> wrote: >>> >>> ¬¨¬®‚Äö√Ñ‚Ä† Hello Abishek, >>> >>> ¬¨¬®‚Äö√Ñ‚Ä† We are currently in the process of significantly >>> enhancing the >>> ¬¨¬®‚Äö√Ñ‚Ä† current Galaxy upload utilities, and the new version >>> should >>> ¬¨¬®‚Äö√Ñ‚Ä† eliminate the issue you've raised about the time >>> needed to >>> upload >>> ¬¨¬®‚Äö√Ñ‚Ä† large files via HTTP ( not for making an initial copy >>> of the >>> file in >>> ¬¨¬®‚Äö√Ñ‚Ä† the Galaxy environment ). However, it will probably >>> not be >>> ready for >>> ¬¨¬®‚Äö√Ñ‚Ä† release for a few more weeks, so if you can take >>> advantage of >>> ¬¨¬®‚Äö√Ñ‚Ä† Assaf's script in the meantime, that's great. >>> ¬¨¬®¬¨√Ü‚Äö√†√∂‚àö‚à´I can't >>> guarantee >>> ¬¨¬®‚Äö√Ñ‚Ä† that all Galaxy features will function correctly if >>> you do this >>> though. >>> >>> ¬¨¬®‚Äö√Ñ‚Ä† Assaf, have you found that using your script breaks >>> anything? >>> >>> ¬¨¬®‚Äö√Ñ‚Ä† Also, if you upload a file to a library rather than a >>> history, >>> ¬¨¬®‚Äö√Ñ‚Ä† multiple users can "import" the library dataset into >>> their >>> history >>> ¬¨¬®‚Äö√Ñ‚Ä† for analysis, but there is only 1 file on disk ( users >>> are >>> pointing >>> ¬¨¬®‚Äö√Ñ‚Ä† to it from their histories ). ¬¨¬®¬¨√Ü‚Äö√†√∂‚àö‚à´But >>> uploading a file to >>> a history >>> ¬¨¬®‚Äö√Ñ‚Ä† will create a new copy of the file each time it is >>> uploaded. >>> >>> ¬¨¬®‚Äö√Ñ‚Ä† Greg Von Kuster >>> ¬¨¬®‚Äö√Ñ‚Ä† Galaxy Development Team >>> >>> >>> >>> ¬¨¬®‚Äö√Ñ‚Ä† Abhishek Pratap wrote: >>> >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† Hi All >>> >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† @Greg : Please find my >>> comments below. >>> >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† On Tue, Jul 21, 2009 at >>> 10:44 AM, Greg Von >>> Kuster<ghv2@psu.edu >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† <mailto:ghv2@psu.edu>> >>> wrote: >>> >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† >>> Hello Abhi, >>> >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† >>> Can you clarify the steps you took that >>> produced the >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† >>> behavior? ‚Äö√†√∂‚àö¬∞‚Äö√†√ú‚àö‚â†‚Äö√†√∂‚àö‚à´See my >>> >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† >>> comments below. >>> >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† >>> Anton Nekrutenko wrote: >>> >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† Abhishek: >>> >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† Let talk. This is the area >>> of active current >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† development. We are >>> ‚Äö√†√∂‚àö¬∞‚Äö√†√ú‚àö‚â†‚Äö√†√∂‚àö‚à´looking >>> >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† at implementing a universal >>> fastq-like format or >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† supporting >>> ‚Äö√†√∂‚àö¬∞‚Äö√†√ú‚àö‚â†‚Äö√†√∂‚àö‚à´multiple >>> >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† formats. Perhaps we should >>> join efforts in ironing >>> out >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† >>> ‚Äö√†√∂‚àö¬∞‚Äö√†√ú‚àö‚â†‚Äö√†√∂‚àö‚à´specifications. >>> >>> >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† anton >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† galaxy team >>> >>> >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† On Jul 20, 2009, at 5:18 >>> PM, Abhishek Pratap >>> wrote: >>> >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† Hi All >>> >>> >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† I recently came >>> to know about NGS analysis >>> on galaxy >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† during ISMB. >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† Getting excited >>> I tried couple of things >>> basically >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† to play with >>> it. >>> >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† Few comments : >>> I may have interepretted >>> something >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† described below >>> in a >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† wrong way. My >>> apologies before hand. >>> >>> >>> >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† On a standalone >>> installation of galaxy while >>> I was >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† trying to >>> explore >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† one >>> FASTQ(sequence) file. It takes >>> considerable (> >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† 20 min) for a >>> fastq >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† file to get >>> uploaded (2 GB). >>> >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† >>> Are you using the Galaxy upload utility >>> to create an >>> item in >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† >>> your history >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† >>> that points to the dataset file on >>> disk? >>> >>> >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† Yes that is precisely >>> correct, I am trying to >>> upload a solexa >>> FASTQ >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† file but on a standalone >>> galaxy installation from >>> my local >>> file >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† system. >>> >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† I >>> am not sure what is the rationale >>> >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† behind that. >>> Ideally I think there should be >>> no need >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† to upload such >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† heavy files >>> into the workspace. >>> >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† A >>> data file that originates from a >>> place external to >>> Galaxy >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† >>> must be uploaded >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† >>> into Galaxy so that the disk file can >>> be placed in the >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† >>> location configured >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† in >>> the Galaxy config file. >>> ‚Äö√†√∂‚àö¬∞‚Äö√†√ú‚àö‚â†‚Äö√†√∂‚àö‚à´Also, when data is >>> uploaded to >>> >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† >>> Galaxy ( either >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† to >>> a history or a library ), several >>> database table >>> settings >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† >>> are created >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† >>> that are used by various Galaxy >>> features. >>> >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† >>> They could actually be used straight >>> >>> >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† Thanks for the clarification >>> but I am not sure this >>> will help >>> a >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† lot of >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† people who are interested to >>> install and run galaxy >>> locally >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† mainly for >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† the following reasons. May >>> be it is just local to >>> me. >>> >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† A. We already one instance >>> of data saved on the >>> local file >>> system >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† B. Making another copy via >>> galaxy will eat away a >>> lot of space >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† in long run. >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† C. The time needed to import >>> the files into galaxy >>> space is >>> huge >>> >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† away by the >>> path specified. >>> >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† >>> What do you mean by "the path >>> specified"? >>> >>> >>> >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† Well what I mean was a way >>> to specify the path of >>> the file/run >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† on the >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† lcoal file system and galaxy >>> could directly pick it >>> up from >>> there >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† rather than uploading it >>> into its own space. Now I >>> understand >>> this >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† might not work based on the >>> way the system was >>> designed. >>> >>> >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† >>> Also is there any way to access the >>> >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† scripts for >>> analysis on the command line. I >>> know >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† this undermines >>> the >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† main aim of >>> working with galaxy but rite now >>> I am >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† concerned about >>> the >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† >>> performance/time. >>> >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† >>> You should be able to run any Galaxy >>> tool from the >>> command >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† >>> line as long as >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† >>> you have all of the tool's required >>> binaries in your >>> path. >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† >>> ‚Äö√†√∂‚àö¬∞‚Äö√†√ú‚àö‚â†‚Äö√†√∂‚àö‚à´However, running >>> >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† a >>> tool from within Galaxy should >>> generally not be any >>> slower >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† >>> than running it >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† >>> outside of Galaxy, depending, of >>> course, on what you are >>> doing. >>> >>> >>> >>> >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† Ok I was under the >>> impression that running from >>> SHELL will >>> eliminate >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† the step of uploading them >>> into galaxy file space. >>> >>> >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† -Abhi >>> >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† I will be happy >>> to discuss more about this >>> in case >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† you have some >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† >>> comments/questions for me. >>> >>> >>> >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† Best, >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† -Abhi >>> >>> >>> >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† >>> ----------------------------- >>> >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† Abhishek Pratap >>> >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† Bioinformatics >>> Software Engineer >>> >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† Institute for >>> Genome Sciences >>> >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† School of >>> Medicine, Univ of Maryland >>> >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† 801, W. >>> Baltimore Street, Baltimore, MD >>> 21209 >>> >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† Ph: >>> (+1)-410-706-2296 >>> >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† >>> www.igs.umaryland.edu/ >>> <http://www.igs.umaryland.edu/> >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† >>> _______________________________________________ >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† galaxy-user >>> mailing list >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† >>> galaxy-user@bx.psu.edu >>> <mailto:galaxy-user@bx.psu.edu> >>> >>> >>> http://mail.bx.psu.edu/cgi-bin/mailman/listinfo/galaxy-user >>> >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† Anton Nekrutenko >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† http://nekrut.bx.psu.edu >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† http://galaxyproject.org >>> >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† >>> _______________________________________________ >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† galaxy-user mailing list >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† galaxy-user@bx.psu.edu >>> <mailto:galaxy-user@bx.psu.edu> >>> >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† >>> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† >>> http://mail.bx.psu.edu/cgi-bin/mailman/listinfo/galaxy-user >>> >>> >>> >>> >>> >>> >>> >>> >>> _______________________________________________ >>> galaxy-user mailing list >>> galaxy-user@bx.psu.edu >>> http://mail.bx.psu.edu/cgi-bin/mailman/listinfo/galaxy-user >> >> -- >> Research Associate ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä†Department of >> Biostatistics >> Associate Director ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä†Bioinformatics Core >> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† >> ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† >> ¬¨¬®‚Äö√Ñ‚Ä†Harvard School of Public Health >> Skype: ohofmann ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† ¬¨¬®‚Äö√Ñ‚Ä† Phone: +1 >> (617) 365 0984 >> >> >>

Greg Von Kuster

8 p.m.

New subject: Experience with Loading NGS data on standalone instance of galaxy

Set yourself up as a Galaxy admin, in universe_wsgi.ini, add your galaxy account ( email ) to the following: # this should be a comma-separated list of valid Galaxy users admin_users = you@youremail.edu You'll see an Admin link in the top Galaxy menu bar when you restart your Galaxy sever after making this change. Click on it, and you be presented with teh Galaxy Admin UI. Look for the "Manage data libraries" link in th left panel, create a new data library, and you can upload datasets to using 1 of 4 options, including this latest option just introduced. Greg Von Kuster Galaxy Development Team Abhishek Pratap wrote:

...

I think that could be it. I am using the "Upload File " option under the "Get Data" from the left hand menu. Sorry for the confusion if I am doing something that I am not supposed to. How do we upload data to data library ?

-Abhi

On Fri, Oct 2, 2009 at 3:46 PM, Greg Von Kuster <ghv2@psu.edu> wrote:

...
Ok, you have the change set. ¬†Perhaps you are trying to upload into a history rather than a Data Library? ¬†This new feature is only available for uploading into a Data Library. ¬†Can you confirm? ¬†I'm not sure what else could be causing you to not see the 4 options on the drop-down menu for uploading dataset into a Data Library.

Greg

Abhishek Pratap wrote:

...
Here it is :

...
hg heads

changeset: ¬† 2824:d97f4e86be45 tag: ¬† ¬† ¬† ¬† tip parent: ¬† ¬† ¬†2823:2c0c81150dbd parent: ¬† ¬† ¬†2821:04a753865407 user: ¬† ¬† ¬† ¬†Anton Nekrutenko <anton@bx.psu.edu> date: ¬† ¬† ¬† ¬†Fri Oct 02 11:02:14 2009 -0400 summary: ¬† ¬† merge

-Abhi

On Fri, Oct 2, 2009 at 3:30 PM, Greg Von Kuster <ghv2@psu.edu> wrote:

...
Please type the following in your galaxy install directory, and let me know what you get:

hg heads

Thanks

Abhishek Pratap wrote:

...
Hi Greg

Unfortunately it is not working for me. I made sure I cleared my browser cache before re-viewing it.

I have set the option as suggested by you in the universe_wsgi.ini file.

-Abhi

On Fri, Oct 2, 2009 at 2:53 PM, Greg Von Kuster <ghv2@psu.edu> wrote:

...
Hello Abhishek,

Add this to your universe_wsgi.ini file:

allow_library_path_paste = True

Then, clicking the down-arrow on the upload form

Create new data library datasets ¬¨¬®‚Äö√Ñ‚Ä†‚Äö√Ñ√∂‚àö¬±¬¨‚à´

will give you 4 options, 1 of which is:

Upload files from file system paths

Greg Von Kuster Galaxy Development Team

Abhishek Pratap wrote: > Hi Greg > > I have updated my galaxy rep to changeset 2825. I dont see the > checkbox on the "Upload File" page. Am I missing something ? > > Thanks, > -Abhi > > On Fri, Oct 2, 2009 at 10:21 AM, Greg Von Kuster <ghv2@psu.edu> wrote: >> Change set 2812 will be included in a release to the distribution >> today >> - >> here are details of a new option that we're hoping will provide what >> is >> needed for most labs. >> >> Add a new option, 'allow_library_path_paste' that adds a new upload >> page >> ("Upload files from file system paths") to the admin-side library >> upload >> pages. >> This form contains a textarea that allows Galaxy admins to paste any >> number >> of >> file system paths (files or directories) from which Galaxy will >> import >> library >> datasets, saving the directory structure (if desired). >> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä†Since such >> ability >> allows admins access to any file on the Galaxy server which is >> readable >> by >> Galaxy's system user, this option is disabled by default, and system >> administrators should take care in assigning Galaxy administrators >> when >> this >> feature is enabled. ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä†Controls on what files are accessible >> to this >> tool >> based >> on ownership or other properties can be added at a later date if >> there >> is >> sufficient interest for such features. >> >> This commit also includes a checkbox on the "Upload directory of >> files" >> page >> (as well as the new "Upload files from file system paths" page above) >> that >> will >> prevent Galaxy from copying data to its files directory (by default, >> 'database/files/'). ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä†This is useful for large library >> datasets that >> live >> in >> their own managed locations on the file system, this will prevent the >> existence >> of duplicate copies of datasets (but means administrators must take >> care >> to >> manage data - moving or removing the data from its Galaxy-external >> location >> will render these datasets invalid within Galaxy). >> >> One unique feature to be aware of: when using the "Copy data into >> Galaxy?" >> checkbox on the "Upload directory of files" page, any symbolic links >> encountered in the chosen import directory will be made absolute and >> dereferenced ONCE. ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä†This allows administrators to link >> large >> datasets to >> the >> import directory, rather than having to make full copies, while being >> able >> to >> delete such links after importing. ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä†Only the first symlink >> (the one >> in >> the >> import directory itself) is dereferenced; all others remain. >> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä†See >> the >> following >> for an example: >> >> library_import_dir = /galaxy/import >> >> % ls -lR /galaxy/import >> /galaxy/import: >> total 6 >> drwxr-xr-x ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† 2 nate ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† nate >> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† 512 >> Oct ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä†1 11:31 link/ >> >> /galaxy/import/link: >> total 10 >> lrwxrwxrwx ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† 1 nate ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† nate >> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† >> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä†71 Oct ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä†1 10:38 1.bed -> >> ../../../home/nate/galaxy/test-data/1.bed >> lrwxrwxrwx ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† 1 nate ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† nate >> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† >> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä†60 Oct ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä†1 10:38 2.bed -> >> /home/nate/galaxy/test-data/2.bed >> lrwxrwxrwx ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† 1 nate ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† nate >> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† >> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä†11 Oct ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä†1 10:38 3.bed -> >> ../../3.bed >> lrwxrwxrwx ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† 1 nate ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† nate >> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† >> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä†35 Oct ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä†1 11:30 4.bed -> >> ../../galaxy_symlink/test-data/4.bed >> lrwxrwxrwx ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† 1 nate ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† nate >> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† >> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä†41 Oct ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä†1 11:31 5.bed -> >> /galaxy/galaxy_symlink/test-data/5.bed >> >> % ls -l /galaxy/3.bed >> lrwxrwxrwx ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† 1 nate ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† nate >> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† >> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä†60 Oct ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä†1 10:39 >> /galaxy/3.bed -> >> /home/nate/galaxy/test-data/3.bed >> >> % ls -l /galaxy/galaxy_symlink >> lrwxrwxrwx ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† 1 nate ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† nate >> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† >> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä†44 Oct ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä†1 11:30 >> /galaxy/galaxy_symlink >> -> /home/nate/galaxy/ >> >> In this example, >> >> 1.bed is a relative symbolic link to the real 1.bed. >> >> 2.bed is an absolute symlink to the real 2.bed. >> >> 3.bed is a relative symlink to ../../3.bed, aka /galaxy/3.bed, which >> itself >> is >> a symlink to the real 3.bed. >> >> 4.bed is a relative symlink which follows another symlink >> (/galaxy/galaxy_symlink) to the real 4.bed. >> >> 5.bed is an absolute symlink in the same fashion as 4.bed >> >> If the 'link' server directory is chosen on the "Upload directory of >> files" >> page, and "Copy data into Galaxy?" is checked "No", the following >> files >> will >> be >> referenced by Galaxy: >> >> /home/nate/galaxy/test-data/1.bed >> /home/nate/galaxy/test-data/2.bed >> /galaxy/3.bed >> /galaxy/galaxy_symlink/test-data/4.bed >> /galaxy/galaxy_symlink/test-data/5.bed >> >> The Galaxy administrator may now safely delete /galaxy/import/link, >> but >> should >> take care not to remove the referenced symbolic links (/galaxy/3.bed, >> /galaxy/galaxy_symlink). >> >> Not all symbolic links are dereferenced because it is assumed that if >> an >> administrator links to a path in the import directory which itself is >> (or >> contains) links, that is the preferred path for accessing the data. >> >> >> >> Oliver Hofmann wrote: >>> Dear all, >>> >>> >>> to echo what Abhi said: we are also currently looking of ways to >>> automatically import data sets (libraries) into Galaxy without >>> having >>> to >>> manually trigger the import via the administration interface, and >>> ideally >>> while keeping the data in the original place. The idea here is to >>> have >>> multiple tools all point at the original 'source data' without >>> having >>> to >>> replicate terabytes of data. >>> >>> Not quite sure how feasible this is in practice, but it certainly >>> would >>> be >>> incredibly helpful. >>> >>> Best, >>> >>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä†Oliver >>> >>> >>> >>> >>> On 28 Sep 2009, at 14:24, Abhishek Pratap wrote: >>> >>>> HI Greg >>>> >>>> Thanks for a quick reply and making some requested changes. However >>>> I >>>> am >>>> not still sure if importing NGS data will help in long run. >>>> >>>> For Centers generating NGS data which could 2-3 T.B / week >>>> depending >>>> on >>>> no. of sequencers I think importing another copy of raw data into >>>> galaxy >>>> workspace will be asking for lot of disk space. I understand it is >>>> a >>>> neat >>>> way of doing things as it becomes agnostic of the raw data location >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä†but >>>> might not be the best way for handling huge data in long run for >>>> centers >>>> like ours. >>>> >>>> Please correct me if I am wrong. I think we could also have a >>>> simple >>>> option without having to import the data and just using it for >>>> analysis >>>> from >>>> the current location, also storing results at the same location. >>>> That >>>> way in >>>> future even if the data set is moved analysis also stays with it. >>>> >>>> Let me know what you feel. I will be happy to know if there are any >>>> other >>>> smart reasons of importing the data in galaxy workspace just for >>>> curiosity >>>> sake. >>>> >>>> Thanks, >>>> -Abhi >>>> >>>> On Mon, Sep 28, 2009 at 9:28 AM, Greg Von Kuster <ghv2@psu.edu> >>>> wrote: >>>> Hello Abhishek, >>>> >>>> The Galaxy distribution includes the enhancements to which I >>>> previously >>>> referred for uploading history files. ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä†Uploading files >>>> to a >>>> history >>>> now >>>> creates a Galaxy job just like any other tool, and can be run on a >>>> cluster >>>> node, allowing upload of very large files. ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä†The initial >>>> pass of >>>> this >>>> work is >>>> also completed for uploading to a Data Library, but this >>>> enhancement >>>> is >>>> still in test, so it should soon be available in the distribution. >>>> >>>> Do you want to avoid having to import at all (e.g. allow Galaxy to >>>> refer >>>> to datasets that live in their original locations)? >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä†This is not >>>> currently >>>> possible, but if this is what you are looking for, we can consider >>>> some >>>> additional options on the current upload form, or possibly a new, >>>> separate >>>> form. >>>> >>>> >>>> Greg Von Kuster >>>> Galaxy Development Team >>>> >>>> >>>> Abhishek Pratap wrote: >>>> Hi Greg, Anton and all >>>> >>>> Just wondering if there has been any progress made on this end. I >>>> am >>>> sorry I was not able to follow it up on Assaf's suggestion due to >>>> other >>>> things at work. >>>> >>>> I did try the latest version of galaxy and looks like the files are >>>> still >>>> transferred over HTTP before they could be used in the galaxy >>>> workspace. >>>> Also I would again like to highlight that many labs might want to >>>> use >>>> the >>>> local instance of galaxy and prefer to point to a local path where >>>> the >>>> file >>>> is being stored. That way we will have both the benefits of using a >>>> cool GUI >>>> and process data stored locally. >>>> >>>> Let me know if you guys need some feedback or have more questions. >>>> I >>>> will >>>> be happy to discuss them. >>>> >>>> best, >>>> -Abhi >>>> >>>> On Tue, Jul 21, 2009 at 4:26 PM, Greg Von Kuster <ghv2@psu.edu >>>> <mailto:ghv2@psu.edu>> wrote: >>>> >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† Hello Abishek, >>>> >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† We are currently in the process of significantly >>>> enhancing the >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† current Galaxy upload utilities, and the new version >>>> should >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† eliminate the issue you've raised about the time >>>> needed to >>>> upload >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† large files via HTTP ( not for making an initial copy >>>> of the >>>> file in >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† the Galaxy environment ). However, it will probably >>>> not be >>>> ready for >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† release for a few more weeks, so if you can take >>>> advantage of >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† Assaf's script in the meantime, that's great. >>>> ¬¨¬®¬¨¬Æ¬¨¬®‚àö√ú‚Äö√Ñ√∂‚àö‚Ä†‚àö‚àÇ‚Äö√†√∂‚Äö√†¬¥I can't >>>> guarantee >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† that all Galaxy features will function correctly if >>>> you do this >>>> though. >>>> >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† Assaf, have you found that using your script breaks >>>> anything? >>>> >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† Also, if you upload a file to a library rather than a >>>> history, >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† multiple users can "import" the library dataset into >>>> their >>>> history >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† for analysis, but there is only 1 file on disk ( users >>>> are >>>> pointing >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† to it from their histories ). ¬¨¬®¬¨¬Æ¬¨¬®‚àö√ú‚Äö√Ñ√∂‚àö‚Ä†‚àö‚àÇ‚Äö√†√∂‚Äö√†¬¥But >>>> uploading a file to >>>> a history >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† will create a new copy of the file each time it is >>>> uploaded. >>>> >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† Greg Von Kuster >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† Galaxy Development Team >>>> >>>> >>>> >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† Abhishek Pratap wrote: >>>> >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† Hi All >>>> >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† @Greg : Please find my >>>> comments below. >>>> >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† On Tue, Jul 21, 2009 at >>>> 10:44 AM, Greg Von >>>> Kuster<ghv2@psu.edu >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† <mailto:ghv2@psu.edu>> >>>> wrote: >>>> >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† >>>> Hello Abhi, >>>> >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† >>>> Can you clarify the steps you took that >>>> produced the >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† >>>> behavior? ‚Äö√Ñ√∂‚àö‚Ä†‚àö‚àÇ‚Äö√†√∂¬¨‚àû‚Äö√Ñ√∂‚àö‚Ä†‚àö√∫‚Äö√†√∂‚Äö√¢‚Ä†‚Äö√Ñ√∂‚àö‚Ä†‚àö‚àÇ‚Äö√†√∂‚Äö√†¬¥See my >>>> >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† >>>> comments below. >>>> >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† >>>> Anton Nekrutenko wrote: >>>> >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† Abhishek: >>>> >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† Let talk. This is the area >>>> of active current >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† development. We are >>>> ‚Äö√Ñ√∂‚àö‚Ä†‚àö‚àÇ‚Äö√†√∂¬¨‚àû‚Äö√Ñ√∂‚àö‚Ä†‚àö√∫‚Äö√†√∂‚Äö√¢‚Ä†‚Äö√Ñ√∂‚àö‚Ä†‚àö‚àÇ‚Äö√†√∂‚Äö√†¬¥looking >>>> >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† at implementing a universal >>>> fastq-like format or >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† supporting >>>> ‚Äö√Ñ√∂‚àö‚Ä†‚àö‚àÇ‚Äö√†√∂¬¨‚àû‚Äö√Ñ√∂‚àö‚Ä†‚àö√∫‚Äö√†√∂‚Äö√¢‚Ä†‚Äö√Ñ√∂‚àö‚Ä†‚àö‚àÇ‚Äö√†√∂‚Äö√†¬¥multiple >>>> >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† formats. Perhaps we should >>>> join efforts in ironing >>>> out >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† >>>> ‚Äö√Ñ√∂‚àö‚Ä†‚àö‚àÇ‚Äö√†√∂¬¨‚àû‚Äö√Ñ√∂‚àö‚Ä†‚àö√∫‚Äö√†√∂‚Äö√¢‚Ä†‚Äö√Ñ√∂‚àö‚Ä†‚àö‚àÇ‚Äö√†√∂‚Äö√†¬¥specifications. >>>> >>>> >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† anton >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† galaxy team >>>> >>>> >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† On Jul 20, 2009, at 5:18 >>>> PM, Abhishek Pratap >>>> wrote: >>>> >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† Hi All >>>> >>>> >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† I recently came >>>> to know about NGS analysis >>>> on galaxy >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† during ISMB. >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† Getting excited >>>> I tried couple of things >>>> basically >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† to play with >>>> it. >>>> >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† Few comments : >>>> I may have interepretted >>>> something >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† described below >>>> in a >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† wrong way. My >>>> apologies before hand. >>>> >>>> >>>> >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† On a standalone >>>> installation of galaxy while >>>> I was >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† trying to >>>> explore >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† one >>>> FASTQ(sequence) file. It takes >>>> considerable (> >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† 20 min) for a >>>> fastq >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† file to get >>>> uploaded (2 GB). >>>> >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† >>>> Are you using the Galaxy upload utility >>>> to create an >>>> item in >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† >>>> your history >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† >>>> that points to the dataset file on >>>> disk? >>>> >>>> >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† Yes that is precisely >>>> correct, I am trying to >>>> upload a solexa >>>> FASTQ >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† file but on a standalone >>>> galaxy installation from >>>> my local >>>> file >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† system. >>>> >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† I >>>> am not sure what is the rationale >>>> >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† behind that. >>>> Ideally I think there should be >>>> no need >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† to upload such >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† heavy files >>>> into the workspace. >>>> >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† A >>>> data file that originates from a >>>> place external to >>>> Galaxy >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† >>>> must be uploaded >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† >>>> into Galaxy so that the disk file can >>>> be placed in the >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† >>>> location configured >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† in >>>> the Galaxy config file. >>>> ‚Äö√Ñ√∂‚àö‚Ä†‚àö‚àÇ‚Äö√†√∂¬¨‚àû‚Äö√Ñ√∂‚àö‚Ä†‚àö√∫‚Äö√†√∂‚Äö√¢‚Ä†‚Äö√Ñ√∂‚àö‚Ä†‚àö‚àÇ‚Äö√†√∂‚Äö√†¬¥Also, when data is >>>> uploaded to >>>> >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† >>>> Galaxy ( either >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† to >>>> a history or a library ), several >>>> database table >>>> settings >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† >>>> are created >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† >>>> that are used by various Galaxy >>>> features. >>>> >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† >>>> They could actually be used straight >>>> >>>> >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† Thanks for the clarification >>>> but I am not sure this >>>> will help >>>> a >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† lot of >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† people who are interested to >>>> install and run galaxy >>>> locally >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† mainly for >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† the following reasons. May >>>> be it is just local to >>>> me. >>>> >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† A. We already one instance >>>> of data saved on the >>>> local file >>>> system >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† B. Making another copy via >>>> galaxy will eat away a >>>> lot of space >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† in long run. >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† C. The time needed to import >>>> the files into galaxy >>>> space is >>>> huge >>>> >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† away by the >>>> path specified. >>>> >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† >>>> What do you mean by "the path >>>> specified"? >>>> >>>> >>>> >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† Well what I mean was a way >>>> to specify the path of >>>> the file/run >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† on the >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† lcoal file system and galaxy >>>> could directly pick it >>>> up from >>>> there >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† rather than uploading it >>>> into its own space. Now I >>>> understand >>>> this >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† might not work based on the >>>> way the system was >>>> designed. >>>> >>>> >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† >>>> Also is there any way to access the >>>> >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† scripts for >>>> analysis on the command line. I >>>> know >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† this undermines >>>> the >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† main aim of >>>> working with galaxy but rite now >>>> I am >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† concerned about >>>> the >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† >>>> performance/time. >>>> >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† >>>> You should be able to run any Galaxy >>>> tool from the >>>> command >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† >>>> line as long as >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† >>>> you have all of the tool's required >>>> binaries in your >>>> path. >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† >>>> ‚Äö√Ñ√∂‚àö‚Ä†‚àö‚àÇ‚Äö√†√∂¬¨‚àû‚Äö√Ñ√∂‚àö‚Ä†‚àö√∫‚Äö√†√∂‚Äö√¢‚Ä†‚Äö√Ñ√∂‚àö‚Ä†‚àö‚àÇ‚Äö√†√∂‚Äö√†¬¥However, running >>>> >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† a >>>> tool from within Galaxy should >>>> generally not be any >>>> slower >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† >>>> than running it >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† >>>> outside of Galaxy, depending, of >>>> course, on what you are >>>> doing. >>>> >>>> >>>> >>>> >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† Ok I was under the >>>> impression that running from >>>> SHELL will >>>> eliminate >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† the step of uploading them >>>> into galaxy file space. >>>> >>>> >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† -Abhi >>>> >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† I will be happy >>>> to discuss more about this >>>> in case >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† you have some >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† >>>> comments/questions for me. >>>> >>>> >>>> >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† Best, >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† -Abhi >>>> >>>> >>>> >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† >>>> ----------------------------- >>>> >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† Abhishek Pratap >>>> >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† Bioinformatics >>>> Software Engineer >>>> >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† Institute for >>>> Genome Sciences >>>> >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† School of >>>> Medicine, Univ of Maryland >>>> >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† 801, W. >>>> Baltimore Street, Baltimore, MD >>>> 21209 >>>> >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† Ph: >>>> (+1)-410-706-2296 >>>> >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† >>>> www.igs.umaryland.edu/ >>>> <http://www.igs.umaryland.edu/> >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† >>>> _______________________________________________ >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† galaxy-user >>>> mailing list >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† >>>> galaxy-user@bx.psu.edu >>>> <mailto:galaxy-user@bx.psu.edu> >>>> >>>> >>>> http://mail.bx.psu.edu/cgi-bin/mailman/listinfo/galaxy-user >>>> >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† Anton Nekrutenko >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† http://nekrut.bx.psu.edu >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† http://galaxyproject.org >>>> >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† >>>> _______________________________________________ >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† galaxy-user mailing list >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† galaxy-user@bx.psu.edu >>>> <mailto:galaxy-user@bx.psu.edu> >>>> >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† >>>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† >>>> http://mail.bx.psu.edu/cgi-bin/mailman/listinfo/galaxy-user >>>> >>>> >>>> >>>> >>>> >>>> >>>> >>>> >>>> _______________________________________________ >>>> galaxy-user mailing list >>>> galaxy-user@bx.psu.edu >>>> http://mail.bx.psu.edu/cgi-bin/mailman/listinfo/galaxy-user >>> -- >>> Research Associate ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä†Department of >>> Biostatistics >>> Associate Director ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä†Bioinformatics Core >>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† >>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† >>> ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä†Harvard School of Public Health >>> Skype: ohofmann ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† ¬¨¬®¬¨¬Æ‚Äö√Ñ√∂‚àö√ë‚Äö√Ñ‚Ä† Phone: +1 >>> (617) 365 0984 >>> >>> >>>

Matthias Dodt

4 Nov 4 Nov

9:11 a.m.

New subject: Experience with Loading NGS data on standalone instance of galaxy

Hi! One more question according change set 2812: Add a new option, 'allow_library_path_paste' that adds a new upload page ("Upload files from file system paths") to the admin-side library upload pages. I am not sure where to add this option- i added to universe_wsgi.ini: allow_library_path_paste = True allow_library_path_paste = /somepath/ both didnt work- how can i enable the "allow_library_path_paste" option? thanks! mat Greg Von Kuster schrieb:

...

Change set 2812 will be included in a release to the distribution today - here are details of a new option that we're hoping will provide what is needed for most labs.

Add a new option, 'allow_library_path_paste' that adds a new upload page ("Upload files from file system paths") to the admin-side library upload pages. This form contains a textarea that allows Galaxy admins to paste any number of file system paths (files or directories) from which Galaxy will import library datasets, saving the directory structure (if desired). Since such ability allows admins access to any file on the Galaxy server which is readable by Galaxy's system user, this option is disabled by default, and system administrators should take care in assigning Galaxy administrators when this feature is enabled. Controls on what files are accessible to this tool based on ownership or other properties can be added at a later date if there is sufficient interest for such features.

This commit also includes a checkbox on the "Upload directory of files" page (as well as the new "Upload files from file system paths" page above) that will prevent Galaxy from copying data to its files directory (by default, 'database/files/'). This is useful for large library datasets that live in their own managed locations on the file system, this will prevent the existence of duplicate copies of datasets (but means administrators must take care to manage data - moving or removing the data from its Galaxy-external location will render these datasets invalid within Galaxy).

One unique feature to be aware of: when using the "Copy data into Galaxy?" checkbox on the "Upload directory of files" page, any symbolic links encountered in the chosen import directory will be made absolute and dereferenced ONCE. This allows administrators to link large datasets to the import directory, rather than having to make full copies, while being able to delete such links after importing. Only the first symlink (the one in the import directory itself) is dereferenced; all others remain. See the following for an example:

library_import_dir = /galaxy/import

% ls -lR /galaxy/import /galaxy/import: total 6 drwxr-xr-x 2 nate nate 512 Oct 1 11:31 link/

/galaxy/import/link: total 10 lrwxrwxrwx 1 nate nate 71 Oct 1 10:38 1.bed -> ../../../home/nate/galaxy/test-data/1.bed lrwxrwxrwx 1 nate nate 60 Oct 1 10:38 2.bed -> /home/nate/galaxy/test-data/2.bed lrwxrwxrwx 1 nate nate 11 Oct 1 10:38 3.bed -> ../../3.bed lrwxrwxrwx 1 nate nate 35 Oct 1 11:30 4.bed -> ../../galaxy_symlink/test-data/4.bed lrwxrwxrwx 1 nate nate 41 Oct 1 11:31 5.bed -> /galaxy/galaxy_symlink/test-data/5.bed

% ls -l /galaxy/3.bed lrwxrwxrwx 1 nate nate 60 Oct 1 10:39 /galaxy/3.bed -> /home/nate/galaxy/test-data/3.bed

% ls -l /galaxy/galaxy_symlink lrwxrwxrwx 1 nate nate 44 Oct 1 11:30 /galaxy/galaxy_symlink -> /home/nate/galaxy/

In this example,

1.bed is a relative symbolic link to the real 1.bed.

2.bed is an absolute symlink to the real 2.bed.

3.bed is a relative symlink to ../../3.bed, aka /galaxy/3.bed, which itself is a symlink to the real 3.bed.

4.bed is a relative symlink which follows another symlink (/galaxy/galaxy_symlink) to the real 4.bed.

5.bed is an absolute symlink in the same fashion as 4.bed

If the 'link' server directory is chosen on the "Upload directory of files" page, and "Copy data into Galaxy?" is checked "No", the following files will be referenced by Galaxy:

/home/nate/galaxy/test-data/1.bed /home/nate/galaxy/test-data/2.bed /galaxy/3.bed /galaxy/galaxy_symlink/test-data/4.bed /galaxy/galaxy_symlink/test-data/5.bed

The Galaxy administrator may now safely delete /galaxy/import/link, but should take care not to remove the referenced symbolic links (/galaxy/3.bed, /galaxy/galaxy_symlink).

Not all symbolic links are dereferenced because it is assumed that if an administrator links to a path in the import directory which itself is (or contains) links, that is the preferred path for accessing the data.

Oliver Hofmann wrote:

...
Dear all,

to echo what Abhi said: we are also currently looking of ways to automatically import data sets (libraries) into Galaxy without having to manually trigger the import via the administration interface, and ideally while keeping the data in the original place. The idea here is to have multiple tools all point at the original 'source data' without having to replicate terabytes of data.

Not quite sure how feasible this is in practice, but it certainly would be incredibly helpful.

Best,

Oliver

On 28 Sep 2009, at 14:24, Abhishek Pratap wrote:

...
HI Greg

Thanks for a quick reply and making some requested changes. However I am not still sure if importing NGS data will help in long run.

For Centers generating NGS data which could 2-3 T.B / week depending on no. of sequencers I think importing another copy of raw data into galaxy workspace will be asking for lot of disk space. I understand it is a neat way of doing things as it becomes agnostic of the raw data location but might not be the best way for handling huge data in long run for centers like ours.

Please correct me if I am wrong. I think we could also have a simple option without having to import the data and just using it for analysis from the current location, also storing results at the same location. That way in future even if the data set is moved analysis also stays with it.

Let me know what you feel. I will be happy to know if there are any other smart reasons of importing the data in galaxy workspace just for curiosity sake.

Thanks, -Abhi

On Mon, Sep 28, 2009 at 9:28 AM, Greg Von Kuster <ghv2@psu.edu> wrote: Hello Abhishek,

The Galaxy distribution includes the enhancements to which I previously referred for uploading history files. Uploading files to a history now creates a Galaxy job just like any other tool, and can be run on a cluster node, allowing upload of very large files. The initial pass of this work is also completed for uploading to a Data Library, but this enhancement is still in test, so it should soon be available in the distribution.

Do you want to avoid having to import at all (e.g. allow Galaxy to refer to datasets that live in their original locations)? This is not currently possible, but if this is what you are looking for, we can consider some additional options on the current upload form, or possibly a new, separate form.

Greg Von Kuster Galaxy Development Team

Abhishek Pratap wrote: Hi Greg, Anton and all

Just wondering if there has been any progress made on this end. I am sorry I was not able to follow it up on Assaf's suggestion due to other things at work.

I did try the latest version of galaxy and looks like the files are still transferred over HTTP before they could be used in the galaxy workspace. Also I would again like to highlight that many labs might want to use the local instance of galaxy and prefer to point to a local path where the file is being stored. That way we will have both the benefits of using a cool GUI and process data stored locally.

Let me know if you guys need some feedback or have more questions. I will be happy to discuss them.

best, -Abhi

On Tue, Jul 21, 2009 at 4:26 PM, Greg Von Kuster <ghv2@psu.edu <mailto:ghv2@psu.edu>> wrote:

Hello Abishek,

We are currently in the process of significantly enhancing the current Galaxy upload utilities, and the new version should eliminate the issue you've raised about the time needed to upload large files via HTTP ( not for making an initial copy of the file in the Galaxy environment ). However, it will probably not be ready for release for a few more weeks, so if you can take advantage of Assaf's script in the meantime, that's great. ¨ÜI can't guarantee that all Galaxy features will function correctly if you do this though.

Assaf, have you found that using your script breaks anything?

Also, if you upload a file to a library rather than a history, multiple users can "import" the library dataset into their history for analysis, but there is only 1 file on disk ( users are pointing to it from their histories ). ¨ÜBut uploading a file to a history will create a new copy of the file each time it is uploaded.

Greg Von Kuster Galaxy Development Team

Abhishek Pratap wrote:

Hi All

@Greg : Please find my comments below.

On Tue, Jul 21, 2009 at 10:44 AM, Greg Von Kuster<ghv2@psu.edu <mailto:ghv2@psu.edu>> wrote:

Hello Abhi,

Can you clarify the steps you took that produced the behavior? ÇƒÜSee my

comments below.

Anton Nekrutenko wrote:

Abhishek:

Let talk. This is the area of active current development. We are ÇƒÜlooking

at implementing a universal fastq-like format or supporting ÇƒÜmultiple

formats. Perhaps we should join efforts in ironing out ÇƒÜspecifications.

anton galaxy team

On Jul 20, 2009, at 5:18 PM, Abhishek Pratap wrote:

Hi All

I recently came to know about NGS analysis on galaxy during ISMB. Getting excited I tried couple of things basically to play with it.

Few comments : I may have interepretted something described below in a wrong way. My apologies before hand.

On a standalone installation of galaxy while I was trying to explore one FASTQ(sequence) file. It takes considerable (> 20 min) for a fastq file to get uploaded (2 GB).

Are you using the Galaxy upload utility to create an item in your history that points to the dataset file on disk?

Yes that is precisely correct, I am trying to upload a solexa FASTQ file but on a standalone galaxy installation from my local file system.

I am not sure what is the rationale

behind that. Ideally I think there should be no need to upload such heavy files into the workspace.

A data file that originates from a place external to Galaxy must be uploaded into Galaxy so that the disk file can be placed in the location configured in the Galaxy config file. ÇƒÜAlso, when data is uploaded to

Galaxy ( either to a history or a library ), several database table settings are created that are used by various Galaxy features.

They could actually be used straight

Thanks for the clarification but I am not sure this will help a lot of people who are interested to install and run galaxy locally mainly for the following reasons. May be it is just local to me.

A. We already one instance of data saved on the local file system B. Making another copy via galaxy will eat away a lot of space in long run. C. The time needed to import the files into galaxy space is huge

away by the path specified.

What do you mean by "the path specified"?

Well what I mean was a way to specify the path of the file/run on the lcoal file system and galaxy could directly pick it up from there rather than uploading it into its own space. Now I understand this might not work based on the way the system was designed.

Also is there any way to access the

scripts for analysis on the command line. I know this undermines the main aim of working with galaxy but rite now I am concerned about the performance/time.

You should be able to run any Galaxy tool from the command line as long as you have all of the tool's required binaries in your path. ÇƒÜHowever, running

a tool from within Galaxy should generally not be any slower than running it outside of Galaxy, depending, of course, on what you are doing.

Ok I was under the impression that running from SHELL will eliminate the step of uploading them into galaxy file space.

-Abhi

I will be happy to discuss more about this in case you have some comments/questions for me.

Best, -Abhi

-----------------------------

Abhishek Pratap

Bioinformatics Software Engineer

Institute for Genome Sciences

School of Medicine, Univ of Maryland

801, W. Baltimore Street, Baltimore, MD 21209

Ph: (+1)-410-706-2296

www.igs.umaryland.edu/ <http://www.igs.umaryland.edu/> _______________________________________________ galaxy-user mailing list galaxy-user@bx.psu.edu <mailto:galaxy-user@bx.psu.edu>

http://mail.bx.psu.edu/cgi-bin/mailman/listinfo/galaxy-user

Anton Nekrutenko http://nekrut.bx.psu.edu http://galaxyproject.org

_______________________________________________ galaxy-user mailing list galaxy-user@bx.psu.edu <mailto:galaxy-user@bx.psu.edu>

http://mail.bx.psu.edu/cgi-bin/mailman/listinfo/galaxy-user

_______________________________________________ galaxy-user mailing list galaxy-user@bx.psu.edu http://mail.bx.psu.edu/cgi-bin/mailman/listinfo/galaxy-user

-- Research Associate Department of Biostatistics Associate Director Bioinformatics Core Harvard School of Public Health Skype: ohofmann Phone: +1 (617) 365 0984

_______________________________________________ galaxy-user mailing list galaxy-user@bx.psu.edu http://mail.bx.psu.edu/cgi-bin/mailman/listinfo/galaxy-user

-- ------------------------------------------------ Matthias Dodt Scientific Programmer at Bioinformaitcs platform AG Dieterich Berlin Institute for Medical Systems Biology at the Max-Delbrueck-Center for Molecular Medicine Robert-Roessle-Strasse 10, 13125 Berlin, Germany fon: +49 30 9406 4261 email: matthias.dodt@mdc-berlin.de

Hiram Clawson

21 Jul 21 Jul

8:35 p.m.

New subject: Experience with Loading NGS data on standalone instance of galaxy

Good Afternoon Anton: I've written a fastq parser here. As I did that I discovered that existing fastq formats are ambiguous and not really identifiable from their data. Are you thinking of making a new fastq specification ? --Hiram - UCSC genome browser Anton Nekrutenko wrote:

...

Let talk. This is the area of active current development. We are looking at implementing a universal fastq-like format or supporting multiple formats. Perhaps we should join efforts in ironing out specifications.

anton galaxy team

Anton Nekrutenko

8:39 p.m.

Hiram: Can you share the code? Which baseQ scaling are you using (Sanger| Solexa)? SOliD support? Thanks, anton On Jul 21, 2009, at 4:35 PM, Hiram Clawson wrote:

...

I've written a fastq parser here.

Anton Nekrutenko http://nekrut.bx.psu.edu http://galaxyproject.org

Hiram Clawson

8:50 p.m.

New subject: Experience with Loading NGS data on standalone instance of galaxy

kent source tree directories: src/utils/fastqToFa/ and src/utils/faToFastq/ I've never used the faToFastq program. The fastqToFa takes arguments to decide the baseQ score scheme. See usage message below. I haven't tested the -solexa scoring method option. --Hiram fastqToFa - Convert from fastq to fasta format. usage: fastqToFa [options] in.fastq out.fa options: -nameVerify='string' - for multi-line fastq files, 'string' must match somewhere in the sequence names in order to correctly identify the next sequence block (e.g.: -nameVerify='Supercontig_') -qual=file.qual.fa - output quality scores to specifed file (default: quality scores are ignored) -qualSizes=qual.sizes - write sizes file for the quality scores -noErrors - warn only on problems, do not error out (specify -verbose=3 to see warnings -solexa - use Solexa/Illumina quality score algorithm (instead of Phread quality) -verbose=2 - set warning level to get some stats output during processing Anton Nekrutenko wrote:

...

Hiram:

Can you share the code? Which baseQ scaling are you using (Sanger|Solexa)? SOliD support?

Thanks,

anton

On Jul 21, 2009, at 4:35 PM, Hiram Clawson wrote:

...
I've written a fastq parser here.

John Obenauer

9:43 p.m.

Dr. Nekrutenko, I've found the fq_all2std.pl script distributed with Maq to be very helpful for converting file formats to a standardized Fastq. Because it's so small (10K) I'll attach it to this email for anyone interested. Also, I'll copy its usage notes here so you can see the formats it converts: Usage: fq_all2std.pl <command> <in.txt> Command: scarf2std Convert SCARF format to the standard/Sanger FASTQ fqint2std Convert FASTQ-int format to the standard/Sanger F ASTQ sol2std Convert Solexa/Illumina FASTQ to the standard FAS TQ std2sol Convert standard FASTQ to Solexa/Illumina FASTQ ( simplified) fa2std Convert FASTA to the standard FASTQ seqprb2std Convert .seq and .prb files to the standard FASTQ fq2fa Convert various FASTQ-like format to FASTA export2sol Convert Solexa export format to Solexa FASTQ export2std Convert Solexa export format to Sanger FASTQ csfa2std Convert AB SOLiD read format to Sanger FASTQ std2qual Convert standard FASTQ to .seq+.qual instruction Explanation to different format example Show examples of various formats Note: Read/quality sequences MUST be presented in one line. John Obenauer On Tuesday 21 July 2009 03:39:16 pm Anton Nekrutenko wrote:

...

Hiram:

Can you share the code? Which baseQ scaling are you using (Sanger| Solexa)? SOliD support?

Thanks,

anton

On Jul 21, 2009, at 4:35 PM, Hiram Clawson wrote:

...
I've written a fastq parser here.

Anton Nekrutenko http://nekrut.bx.psu.edu http://galaxyproject.org

_______________________________________________ galaxy-user mailing list galaxy-user@bx.psu.edu http://mail.bx.psu.edu/cgi-bin/mailman/listinfo/galaxy-user

-- John Obenauer, Ph.D. Bioinformatics Group Leader Information Sciences Department St. Jude Children's Research Hospital 262 Danny Thomas Place Mail Stop 312 Memphis, TN 38105 (901) 595-3188 john.obenauer@stjude.org Email Disclaimer: www.stjude.org/emaildisclaimer

5736

Age (days ago)

5843

Last active (days ago)

List overview

Download

26 comments

8 participants

participants (8)

Abhishek Pratap
Anton Nekrutenko
Greg Von Kuster
Hiram Clawson
Ido M. Tamir
John Obenauer
Matthias Dodt
Oliver Hofmann