Thanks Peter. My next question is, I have found that VCF files don't get split properly as the header is not included in the second file as is usually required by tools (such as vcf-subset). I have read the code and am happy to implement this functionality but am not to sure where this would best be done. I see a class Text ( data ) which looks like every datatype is sent to. Would it be best to implement a VCF class which is called when the datatype is VCF? Cheers, Ed On Wed, Oct 31, 2012 at 12:35 PM, Peter Cock <p.j.a.cock@googlemail.com>wrote:
On Tue, Oct 30, 2012 at 11:20 PM, Edward Hills <ehills666@gmail.com> wrote:
Hi Galaxy-Team,
After reading a message on this mailing list about the job splitter I began to investigate what and how this is used. Unfortunately I have been unable to find any documentation on your website for it.
Am I blind and missing it or is it yet to be properly documented?
Sorry if this turns to be out a pointless exercise, but it would be extremely useful for my Galaxy development.
Cheers, Ed
Hi Ed,
To enable this you need to add a <parallelism> tag to the tool's XML file, and enable the feature in universe_wsgi.ini with something like this:
use_tasked_jobs = True local_task_queue_workers = 4
I'm not aware of any documentation, I've been mostly working from the Python source code in order to get it to work on the BLAST+ wrappers and some of my other tool wrappers. In all the cases I've used there is a single FASTA file being split, sometimes some common input files which are unchanged, and a single output file being merged.
Peter