Re: [galaxy-user] LASTZ arguments in "Full parameter list"-mode

6 Jun 2011

      Howdy, Uwe,

(I don't speak for the galaxy group, but I watch the list for lastz- 
related messages.)
...
Is it possible to specify more arguments in "Full parameter list"- 
mode, particularly those that're implicitely described in YASRA-mode  
(match/mismatch rewards and step-size (Z/step))?
As you indicate, the galaxy wrapper for lastz only provides access to  
certain combinations of arguments.  At present, the only ways I know  
that you would be able to access other lastz parameters would be  
either to (1) write your own custom galaxy wrapper or (2) install  
lastz and run it from the command line.

Kelly Vincent is planning to update the galaxy wrapper some time this  
summer, so this is something she's likely to address with that update.
...
However, heading to "Full parameter list"-mode leads to excessive  
CPU-use, because (presumably) the step-size (default=1?) is left  
unset.
You are correct that the default step is 1.  (defaults are shown at  
the bottom of each command section in the lastz readme file which can  
be found at www.bx.psu.edu/~rsharris/lastz or at the Miller Lab  
website)  Depending on your data, a step of 1 may be overly sensitive  
causing lastz to spend a more time than necessary.
...
Also, it'd be great to be able to alter not only gap penalties, but  
match/mismatch rewards as well. (Obviously a mismatch would cause  
the same problem).
Ideally I think the wrapper interface should allow you to point at a  
lastz scoring file, which can contain all the scoring parameters.   
This is an oversight-- the thinking when the wrapper was written was  
that the yasra settings would be sufficient (and work well) and would  
simplify the user's choices, making it more likely that the user would  
choose settings appropriate for their data.

I've discussed this with Kelly some, this morning.  It's not clear  
whether the wrapper would be better if it allowed each scoring option  
to be set as a separate field, or if they were incorporated in a file,  
or if (somehow) they were one of your galaxy history items.  Do you  
have any thoughts on that?
...
I'm dealing with 454 reads and it's crucial to the scenario to have  
the 5' ends aligned properly (in terms of sensitivity), so all the  
YASRA-templates comprising heavy gap penalties perform fairly poor  
as soon as there's a gap nearby the 5' end.
The idea behind having such severe gap penalties is that 454 often  
incorrectly calls the length of homopolymer runs, introducing what  
will look like short gaps.  As used within yasra (an assembler from  
the Miller Lab, not in galaxy) these settings are probably  
appropriate.  But this is not an ideal general solution because it can  
keep us from discovering true gaps.  A better solution would probably  
be to have less-severe gap penalties, with an additional context- 
related gap penalty (or reward) for gaps at homopolymer runs.   
However, it would be costly to add this to the alignment core inside  
lastz.

The problem with gaps or mismatches close to the end of a read is  
discussed in the lastz readme file, in the section "Y-drop Mismatch  
Shadow".  The situation can can be improved by using the --noytrim  
option.  This option was added to lastz after the wrapper was written,  
and so is not currently available from galaxy.  --noytrim tells lastz  
to accept a lower-than-maximal-scoring alignment if it can reach the  
end of the read.  I intend to add that into the yasra settings, but  
changing those raises an issue of backward compatibility that I need  
to resolve.

I hope that helps.  Please post a reply if I've left something  
unanswered or if you have other thoughts on this.

Bob H

On Jun 6, 2011, at 6:47 AM, Appelt, Uwe wrote:
...
Hi @All,
short version:
Is it possible to specify more arguments in "Full parameter list"- 
mode, particularly those that're implicitely described in YASRA-mode  
(match/mismatch rewards and step-size (Z/step))?
long version:
I'm dealing with 454 reads and it's crucial to the scenario to have  
the 5' ends aligned properly (in terms of sensitivity), so all the  
YASRA-templates comprising heavy gap penalties perform fairly poor  
as soon as there's a gap nearby the 5' end. However, heading to  
"Full parameter list"-mode leads to excessive CPU-use, because  
(presumably) the step-size (default=1?) is left unset. Also, it'd be  
great to be able to alter not only gap penalties, but match/mismatch  
rewards as well. (Obviously a mismatch would cause the same problem).
Examples:
AAAAAAAAAA (target)
AAAAATAAAA (query)
=> I need the alignment boundary to be found at position 1, rather  
than behind the mismatched T and this cleary doesn't work as long as  
the mismatch penalty is too large (same applies for gap-open/extend  
penalties).
Thanks in advance and Cheers,
Uwe
___________________________________________________________
The Galaxy User list should be used for the discussion of
Galaxy analysis and other features on the public server
at usegalaxy.org.  Please keep all replies on the list by
using "reply all" in your mail client.  For discussion of
local Galaxy instances and the Galaxy source code, please
use the Galaxy Development list:
http://lists.bx.psu.edu/listinfo/galaxy-dev
To manage your subscriptions to this and other Galaxy lists,
please use the interface at:
http://lists.bx.psu.edu/

Re: [galaxy-user] LASTZ arguments in "Full parameter list"-mode

Bob Harris