Hi Peter,
I worked on it today and I found the reason of this bug. In the
upload.py tool, the existence of dataset.space_to_tab is tested. In
the old version of the data library, when the option was unchecked,
the value space_to_tab was None and the test worked. But in the new
version the value of space_to_tab is "False", it's a string and the
test "if dataset.space_to_tab" returns True.
It is possible to fix it with a little correction of the
"/lib/galaxy/webapps/galaxy/api/lda_datasets.py".
I join a patch with this email.
I can propose a pull request if you want.
Best,
Fabien
--
Fabien Mareuil | Centre d'Informatique pour la Biologie
fabien.mareuil@pasteur.fr
| Institut Pasteur
25,28 rue du Docteur Roux 75015 Paris, France
On 21/03/2016 09:45, fmareuil wrote:
Hi
Peter,
I tested it on the last version of the dev and master branches.
step-by-step :
I click on the "shared data/Data Libraries", I select my library
and I click on the button to upload a file and I select the option
"from User Directory". I select my fastqsanger file (sequences
headers are like this :
@HWI-D00395:188:HYGNFADXX:2:1108:1315:1965 1:N:0:ATGAGC
with a space between @HWI-D00395:188:HYGNFADXX:2:1108:1315:1965
and 1:N:0:ATGAGC and I import it (with or without the auto-detect
option).
When I look the file in a history, the space between
@HWI-D00395:188:HYGNFADXX:2:1108:1315:1965 and 1:N:0:ATGAGC has
been replaced by a tab.
I didn't have this problem with "shared data/ Data Libraries
deprecated".
Thank a lot for your help,
Best,
Fabien
On 18/03/2016 18:59, Peter Cock wrote:
Can you provide more details - in
particular the version of Galaxy you
are running? A step-by-step description of how you did the data
import
might be useful too for the Galaxy team to try to reproduce
this.
Peter
On Fri, Mar 18, 2016 at 5:34 PM, Fabien Mareuil
<fmareuil@pasteur.fr> wrote:
Hi Peter,
Thank a lot for this information, I didn't found this option
for the new
version of the Data Libraries.
This could explain why there is this conversion.
I tested it with an automatic format detection and when I set
explicitly the
format but in the two cases a tab was added.
Fabien
Le 18/03/2016 17:04, Peter Cock a écrit :
Hi Fabien,
There is a Galaxy upload option to turn spaces into tabs,
useful for
tables of data. Is that being ticked by mistake? That would
explain
this problem.
Also I would check if the FASTQ file format is being
detected
automatically, or set explicitly (e.g. fastqsanger)? Doing
that ought
to prevent any changes.
Peter
On Fri, Mar 18, 2016 at 3:26 PM, Fabien Mareuil
<fmareuil@pasteur.fr>
wrote:
Hi,
We got a problem with the new version of Data libraries.
When we upload a
fastq file from a User Directory, a tabulation character
is added in the
header in replacement of a space character.
from @HWI-D00395:188:HYGNFADXX:2:1108:1315:1965
1:N:0:ATGAGC to
@HWI-D00395:188:HYGNFADXX:2:1108:1315:1965
1:N:0:ATGAGC
This behavior causes an error with cutadapt 1.6 (maybe
with other tools)
because it doesn't work with tab character. With the
direct upload tool
or
the deprecated version of Data Libraries no tab character
is added. Do
you
have some idea to correct this problem? Thanks a lot Best
regards --
Fabien Mareuil | Centre d'Informatique pour la Biologie
fabien.mareuil@pasteur.fr | Institut Pasteur 25,28 rue du
Docteur Roux
75015
Paris, France
___________________________________________________________
Please keep all replies on the list by using "reply all"
in your mail client. To manage your subscriptions to this
and other Galaxy lists, please use the interface at:
https://lists.galaxyproject.org/
To search Galaxy mailing lists use the unified search at:
http://galaxyproject.org/search/mailinglists/
___________________________________________________________
Please keep all replies on the list by using "reply all"
in your mail client. To manage your subscriptions to this
and other Galaxy lists, please use the interface at:
https://lists.galaxyproject.org/
To search Galaxy mailing lists use the unified search at:
http://galaxyproject.org/search/mailinglists/