Hello Rami, The tool "Operate on Genomic Intervals -> Get flanks" will return reference genomic sequence upstream/downstream from a set of coordinates, which is essentially what the UCSC Genome Browser is doing when you use the export promoters option. This tool requires that you already have a set of genomic intervals (a data track) from an annotation data source. UCSC is only one choice. Please see all directly linked data sources listed under "Get Data". Or, locate your own and load via URL (http/ftp) or a file that you have already downloaded to your computer/server (use ftp if large). Please note that this tool will function with most (but not all) genomes already part of the native reference genome set in Galaxy and must be assigned. To assign, use the "Edit Attributes" form by clicking on the pencil icon for the dataset. Best! Jen On 7/25/11 11:55 AM, Rami Al-Ouran wrote:
Thank you very much for the quick response.
Does that mean there is no direct method in Galaxy (without using UCSC) to get promoters from a list of gene symbols ?
Thank you, Rami
On 7/25/2011 2:51 PM, Jennifer Jackson wrote:
Hello Rami,
The UCSC genome browser will result in one promoter sequence per transcript, as many genes will have many associated transcripts.
Depending on which track you are obtaining data from, a "representative" transcript for each gene may or may not be notated. If using "UCSC Genes", the tables associated with the primary table knownGenes to examine are knownIsoforms and knownCanonical, which will show how transcripts are clustered. You can link these tables together using the output option "selected fields from primary and related tables".
Alternatively, you may already have already selected a set of transcripts (not genes) to obtain data from. In that case, enter the identifiers directly into the UCSC Table browser's "identifiers" field when performing the query (paste or upload text file). Or pull all into Galaxy and join with your identifier list to subset the results. Identifiers must be in the same format as used by the track of interest for this method to work.
If you are having trouble with the Table browser, the UCSC Genome team can help through their mailing list at http://genome.ucsc.edu
Hopefully this helps to explain the data,
Best,
Jen Galaxy team
On 7/25/11 10:25 AM, Rami Al-Ouran wrote:
Hello,
Is there a way in galaxy to retrieve the promoter sequences for a list of genes. I tried using UCSC genome browser, but in many cases it keeps giving more than one promoter sequence per gene.
Thank you, Rami ___________________________________________________________ The Galaxy User list should be used for the discussion of Galaxy analysis and other features on the public server at usegalaxy.org. Please keep all replies on the list by using "reply all" in your mail client. For discussion of local Galaxy instances and the Galaxy source code, please use the Galaxy Development list:
http://lists.bx.psu.edu/listinfo/galaxy-dev
To manage your subscriptions to this and other Galaxy lists, please use the interface at:
-- Jennifer Jackson http://usegalaxy.org/ http://galaxyproject.org/