Thank you for the suggestions. Ultimately, I would like to compare gene(isoform) expression between two groups of 10 animals with one lane per animal. I am using the public server to practice with some small data sets right now, but will be getting the real data very soon and plan on using an Amazon Cloud account to actually do the analysis. I can see now that this approach is going to be met with some difficulty with the current state of the data volume restrictions and limited functionality of Galaxy for Cuffcompare/diff. Can you comment any further on the timeline of the availability of the full functionality of these programs?
I did not mean to imply this, only that the Cloud folks have their own process and schedule for rolling out changes, and I do not know their schedule.You seemed to suggest they will be available on the public server before they are available on the Cloud?
Also, for the time being, would you mind clarifying for me what you mean by repeatedly merging Cufflinks outputs? I imagine using Tophat to map the reads and find splice junctions and assembling transcripts using Cufflinks for each of the 20 animals. Are you talking about running the Cufflinks GTF output through Cuffcompare, which allows two GTF files in Galaxy, and merging that output(the union file) with the third Cufflinks file and so on for all ten animals? Then do the same thing for the other group of ten animals, and then comparing the two for a rough idea of the differences?
I guess I'm wondering how far I will be able to get with the analysis as things stand on the Cloud or the public server....
This is a question that Enis can comment on.I also need to come up with a strategy to work around the 1000Gb space limit, as with 20 samples of 25 million reads and repeatedly generating files I think it will get used up quickly....