Hi Galaxy Team,
I'm trying to use GATK DepthOfCoverage, and to give a BAM file of alined
results as input.
My steps were:
1. BWA alignment
when I'm trying to give 3 (rmdup results) as an input to GATK
DepthOfCoverage, an error appears: "
Sequences are not currently available for the specified build."
What did I do wrong?
Thanks Jen, I will try it tomorrow.
I installed Blast2Go from the Toolshed in my local instance of Galaxy
and when I try to run it I get the following error:
Index file named 'blast2go.loc' is required by tool but not available.
I logged as admin and the installation did not gave me any error. From
galaxy.util.shed_util DEBUG 2012-03-23 17:23:37,088 Installing repository
galaxy.util.shed_util DEBUG 2012-03-23 17:23:37,088 Cloning
destination directory: blast2go
requesting all changes
adding file changes
added 2 changesets with 6 changes to 3 files
updating to branch default
3 files updated, 0 files merged, 0 files removed, 0 files unresolved
galaxy.util.shed_util DEBUG 2012-03-23 17:23:42,798 Updating cloned
repository to revision "7b53cc52e7ed"
Anyway, I was thinking to use it because most of my differentially
expressed genes are unknown. I was thinking to use Blast2GO to get them at
least clustered in functional groups. I am not sure if that would be the
best approach to find what might be the function of these genes. I also
checked the list of public services that might have this tool, and Berkeley
BOP is listed, but it seems that they no longer have the server or it was
down when I checked (or the link is broken http://galaxy.berkeleybop.org/).
On Fri, Mar 23, 2012 at 8:43 AM, Jennifer Jackson <jen(a)bx.psu.edu> wrote:
> Hello Luciano,
> There is no single tool do to this operation (although there has been some
> discussion about including one in the Tool Shed), but the same information
> can be obtained by using a combination of existing tools.
> First, start by converting both starting datasets to interval format.
> mapped reads:
> - for TopHat output, "NGS: SAM Tools -> Convert SAM to interval"
> - for GFF file (convert to tabular if necessary), subtract "1"
> from the start position's value using tool "Text Manipulation ->
> - cut columns chrom, new start, stop, strand, name, and score from
> this result file using "Text Manipulation -> Cut"
> - set the data type to "interval" using the 'Edit attributes form
> (pencil icon)
> Next, use a tool in the group "Operate on Genomic Intervals" to compare
> these intervals for overlap. The tool "Cluster" with the option "Find" is
> mostly likely the one you will want to use.
> As a final step, summarize the data by feature using the tool "Join,
> Subtract and Group -> Group".
> Hopefully this helps,
> Galaxy team
> On 3/19/12 4:36 PM, Luciano Cosme wrote:
>> I was wondering if there is any tool on Galaxy were I can obtain a
>> table with how many reads have been mapped to a given sample and to a
>> given gene (for example, use a Tophat output and use a GFF file to
>> obtain the table). I am using HTSeq to get it (htseq-count). There is
>> also GenomicRanges and easyRNASeq packages in bioconductor.
>> Thank you.
>> The Galaxy User list should be used for the discussion of
>> Galaxy analysis and other features on the public server
>> at usegalaxy.org. Please keep all replies on the list by
>> using "reply all" in your mail client. For discussion of
>> local Galaxy instances and the Galaxy source code, please
>> use the Galaxy Development list:
>> To manage your subscriptions to this and other Galaxy lists,
>> please use the interface at:
We are working on developing NGS data analysis pipeline. Does your institution have a training program where one or two people from my team can be trained on NGS data analysis, particularly de novo genome assembly?
Dr. Aarti Desai | Domain Specialist - Life Sciences Domain
aarti_desai(a)persistent.co.in<mailto:firstname.lastname@example.org> | Cell: +91-9673009492 | Tel: +91-20-30236348
Persistent Systems Ltd. | Partners in Innovation | www.persistentsys.com<http://www.persistentsys.com/>
This e-mail may contain privileged and confidential information which is the property of Persistent Systems Ltd. It is intended only for the use of the individual or entity to which it is addressed. If you are not the intended recipient, you are not authorized to read, retain, copy, print, distribute or use this message. If you have received this communication in error, please notify the sender and delete all copies of this message. Persistent Systems Ltd. does not accept any liability for virus infected mails.
How do I run Tophat and RNA-Seq analysis using the GRCH37- embl 66 genome? I noticed there is no input for this genome version.
Can I construct a reference genome from the following embl format source sequences: ftp://ftp.ensembl.org/pub/release-66/embl/homo_sapiens/, and map it against my RNAseq data?
Karthik Srinivasan | Senior Application Engineer
P:HYPERLINK "tel:+912242554282"+912242554282 | M:HYPERLINK "tel:+919987014704"+919987014704
Oracle Health Sciences Global Business Unit
6'th Floor, Silver Metropolis, W.E.Highway, Goregaon(E) | 400063 Mumbai