On 04/07/2011 11:40 PM, Ryan Golhar wrote:
Hi all - So, I been asked to provide specs for a production Galaxy system to support approximately 20-30 users. Most of these users are new to bioinformatics and very new to NGS. I'm targeting a user base that will use a light to moderate amount of NGS data.
I've looked at the the Produce Server Wiki page stuff, but I'm curious what everyone else is using or recommends? How big of a compute cluster, how much storage, proxy/web server configurations, etc, etc.
If you had to deploy a production system, based on what you know, what would you choose?
Hi Ryan I would go for a single (multicore) box. With just 20-30 users who are 'new to bioinformatics' you will hardly ever have more than 3 users using Galaxy at the same time - you can always limit the number of concurrent galaxy jobs in the universe_wsgi.ini file ('local_job_queue_workers'). Since you are expecting NGS data, having the right amount of RAM would be my biggest concern. What do you mean by "light to moderate amount of NGS data"? are you talking about the number of samples to process or are you talking about the individual size of the sample. The latter will have an impact on the required amount of RAM. On the other hand both will have an impact on the amount of storage required. You have to make the calculations for required storage and RAM first, but this is independent of whether you use Galaxy or not. The only risk when offering NGS tools via galaxy it might be to easy to run them resulting in a lot of 'garbage' or redundant NGS processing. That's why it is important to disable anonymous access so you can track who is doing what. Using external authentication is very handy. However, it does restrict you to users already in you 'network'. We are using it, and it is sometimes annoying, as I can't have temporary guest accounts - our IT guys would have to create a new 'member' of our institute for every guest.... Hope this helps, Hans
Ryan
___________________________________________________________ Please keep all replies on the list by using "reply all" in your mail client. To manage your subscriptions to this and other Galaxy lists, please use the interface at: