genomic coordinates to Enterz gene ids
I have a list of around 3000 genomic coordinates of mouse genome and I would iike to extract gene name Enetrz gene ids Is it possible to extract this information in galaxy. Thanks Kanwar
Hi Kanwar, Both UCSC and Biomart have this data. So you could use the tools in "Get Data", pull over the complete set for the genome, then identify overlapping regions versus your coordinates with the tool "Operate on Genomic Intervals -> Join". For UCSC, the track is UCSC Genes (the linked table knownToLocusLink has Entrez ID). For Biomart, you can select the Entrez ID from the list of optional fields to include per species (be sure to avoid any long text description fields or you may get an HTML import error, or test and omit if this comes up). UCSC also has an intersection function in their Table Browser. Send the Galaxy dataset over as a custom track then intersect with the UCSC Genes track. The Table browser's user guide has instructions. Hopefully one of these works out for you! Jen Galaxy team On 6/14/13 3:18 PM, shamsher jagat wrote:
I have a list of around 3000 genomic coordinates of mouse genome and I would iike to extract gene name Enetrz gene ids Is it possible to extract this information in galaxy.
Thanks
Kanwar
___________________________________________________________ The Galaxy User list should be used for the discussion of Galaxy analysis and other features on the public server at usegalaxy.org. Please keep all replies on the list by using "reply all" in your mail client. For discussion of local Galaxy instances and the Galaxy source code, please use the Galaxy Development list:
http://lists.bx.psu.edu/listinfo/galaxy-dev
To manage your subscriptions to this and other Galaxy lists, please use the interface at:
To search Galaxy mailing lists use the unified search at:
-- Jennifer Hillman-Jackson Galaxy Support and Training http://galaxyproject.org
participants (2)
-
Jennifer Jackson
-
shamsher jagat