Performing operations on tab delimited text files
Hi, My name is Lanelle Edwards. I am a new galaxy user. I uploaded a tab delimited text file to my history. However, the first column of the data I imported is solely chromosome number and start and end position (ALL IN ONE COLUMN) in the format chrI:1000000-2000000. My question is: How can I filter out these values into separate columns using galaxy tools? For example: Having a separate column for chromosome number, column for start position, column for end position. I would really appreciate the help. Thanks, Lanelle Edwards Lab Manger University of Maryland
Hello Lanelle, A combination of text manipulations can extract the data into interval format. A good step-by-step example, including a video tutorial, is in the Using Galaxy paper & supplemental available here: http://main.g2.bx.psu.edu/u/galaxyproject/p/using-galaxy-2012 Protocol #2 has examples of loading 4 files - the 3rd file, starting with dataset #3, is the one where a column of data that combines chrom, start, stop is manipulated to produce distinct columns and assigned standard datatypes. This example has a ".." instead of a "-" between the start and stop, so just adjust that part of the manipulation to be specific for your data when you come to it. Hopefully this helps! Jen Galaxy team On 10/3/12 11:28 AM, Lanelle Oni Edwards wrote:
Hi, My name is Lanelle Edwards. I am a new galaxy user. I uploaded a tab delimited text file to my history. However, the first column of the data I imported is solely chromosome number and start and end position (ALL IN ONE COLUMN) in the format chrI:1000000-2000000. My question is: How can I filter out these values into separate columns using galaxy tools? For example: Having a separate column for chromosome number, column for start position, column for end position. I would really appreciate the help.
Thanks, Lanelle Edwards Lab Manger University of Maryland ___________________________________________________________ The Galaxy User list should be used for the discussion of Galaxy analysis and other features on the public server at usegalaxy.org. Please keep all replies on the list by using "reply all" in your mail client. For discussion of local Galaxy instances and the Galaxy source code, please use the Galaxy Development list:
http://lists.bx.psu.edu/listinfo/galaxy-dev
To manage your subscriptions to this and other Galaxy lists, please use the interface at:
-- Jennifer Jackson http://galaxyproject.org
participants (2)
-
Jennifer Jackson
-
Lanelle Oni Edwards