
Howdy, Uwe, (I don't speak for the galaxy group, but I watch the list for lastz- related messages.)
Is it possible to specify more arguments in "Full parameter list"- mode, particularly those that're implicitely described in YASRA-mode (match/mismatch rewards and step-size (Z/step))?
As you indicate, the galaxy wrapper for lastz only provides access to certain combinations of arguments. At present, the only ways I know that you would be able to access other lastz parameters would be either to (1) write your own custom galaxy wrapper or (2) install lastz and run it from the command line. Kelly Vincent is planning to update the galaxy wrapper some time this summer, so this is something she's likely to address with that update.
However, heading to "Full parameter list"-mode leads to excessive CPU-use, because (presumably) the step-size (default=1?) is left unset.
You are correct that the default step is 1. (defaults are shown at the bottom of each command section in the lastz readme file which can be found at www.bx.psu.edu/~rsharris/lastz or at the Miller Lab website) Depending on your data, a step of 1 may be overly sensitive causing lastz to spend a more time than necessary.
Also, it'd be great to be able to alter not only gap penalties, but match/mismatch rewards as well. (Obviously a mismatch would cause the same problem).
Ideally I think the wrapper interface should allow you to point at a lastz scoring file, which can contain all the scoring parameters. This is an oversight-- the thinking when the wrapper was written was that the yasra settings would be sufficient (and work well) and would simplify the user's choices, making it more likely that the user would choose settings appropriate for their data. I've discussed this with Kelly some, this morning. It's not clear whether the wrapper would be better if it allowed each scoring option to be set as a separate field, or if they were incorporated in a file, or if (somehow) they were one of your galaxy history items. Do you have any thoughts on that?
I'm dealing with 454 reads and it's crucial to the scenario to have the 5' ends aligned properly (in terms of sensitivity), so all the YASRA-templates comprising heavy gap penalties perform fairly poor as soon as there's a gap nearby the 5' end.
The idea behind having such severe gap penalties is that 454 often incorrectly calls the length of homopolymer runs, introducing what will look like short gaps. As used within yasra (an assembler from the Miller Lab, not in galaxy) these settings are probably appropriate. But this is not an ideal general solution because it can keep us from discovering true gaps. A better solution would probably be to have less-severe gap penalties, with an additional context- related gap penalty (or reward) for gaps at homopolymer runs. However, it would be costly to add this to the alignment core inside lastz. The problem with gaps or mismatches close to the end of a read is discussed in the lastz readme file, in the section "Y-drop Mismatch Shadow". The situation can can be improved by using the --noytrim option. This option was added to lastz after the wrapper was written, and so is not currently available from galaxy. --noytrim tells lastz to accept a lower-than-maximal-scoring alignment if it can reach the end of the read. I intend to add that into the yasra settings, but changing those raises an issue of backward compatibility that I need to resolve. I hope that helps. Please post a reply if I've left something unanswered or if you have other thoughts on this. Bob H On Jun 6, 2011, at 6:47 AM, Appelt, Uwe wrote:
Hi @All,
short version: Is it possible to specify more arguments in "Full parameter list"- mode, particularly those that're implicitely described in YASRA-mode (match/mismatch rewards and step-size (Z/step))?
long version: I'm dealing with 454 reads and it's crucial to the scenario to have the 5' ends aligned properly (in terms of sensitivity), so all the YASRA-templates comprising heavy gap penalties perform fairly poor as soon as there's a gap nearby the 5' end. However, heading to "Full parameter list"-mode leads to excessive CPU-use, because (presumably) the step-size (default=1?) is left unset. Also, it'd be great to be able to alter not only gap penalties, but match/mismatch rewards as well. (Obviously a mismatch would cause the same problem).
Examples: AAAAAAAAAA (target) AAAAATAAAA (query)
=> I need the alignment boundary to be found at position 1, rather than behind the mismatched T and this cleary doesn't work as long as the mismatch penalty is too large (same applies for gap-open/extend penalties).
Thanks in advance and Cheers, Uwe ___________________________________________________________ The Galaxy User list should be used for the discussion of Galaxy analysis and other features on the public server at usegalaxy.org. Please keep all replies on the list by using "reply all" in your mail client. For discussion of local Galaxy instances and the Galaxy source code, please use the Galaxy Development list:
http://lists.bx.psu.edu/listinfo/galaxy-dev
To manage your subscriptions to this and other Galaxy lists, please use the interface at: