1) Could it be made possible to load generalised tabular data from file with column headings being recognised as column headings rather than appearing as the first row of the table? It looks like this is in general possible as more specific file types (e.g. gtf) do get displayed with column headings, so the data structures obviously support that.
2) With data sets that do have column headings, is there any reason why the 'Join two Datasets' tool couldn't retain the column headings from both sets in the final output rather than just the first one?
3) Could the 'Join two Datasets' tool allow selection of the columns by name rather than by number?
To me these three changes would make it much easier for our users to manipulate and join tables of genes and gene annotations, but I'm new to Galaxy and maybe there are reasons why it doesn't work this way?
Justin
Hi Justin,
You are not alone!
Galaxy has limited understanding of column headers in a # line, but there is room for improvement here certainly. I'm sure there are open issues on some/all of these points but Trello is being very slow and the only link I could find in my emails is:
https://trello.com/card/554-show-column-names-headers-or-first-entry-in-colu...
Peter
On Tue, Dec 15, 2015 at 6:06 PM, Justin Powell justin_galaxy@jacp1.com wrote:
- Could it be made possible to load generalised tabular data from file with
column headings being recognised as column headings rather than appearing as the first row of the table? It looks like this is in general possible as more specific file types (e.g. gtf) do get displayed with column headings, so the data structures obviously support that.
- With data sets that do have column headings, is there any reason why the
'Join two Datasets' tool couldn't retain the column headings from both sets in the final output rather than just the first one?
- Could the 'Join two Datasets' tool allow selection of the columns by
name rather than by number?
To me these three changes would make it much easier for our users to manipulate and join tables of genes and gene annotations, but I'm new to Galaxy and maybe there are reasons why it doesn't work this way?
Justin
Please keep all replies on the list by using "reply all" in your mail client. To manage your subscriptions to this and other Galaxy lists, please use the interface at: https://lists.galaxyproject.org/
To search Galaxy mailing lists use the unified search at: http://galaxyproject.org/search/mailinglists/
Thanks. Is there a format for a # line that will create column names when the file is uploaded? I couldn't find this in the docs.
Best regards,
Justin
On 15 December 2015 at 18:55, Peter Cock p.j.a.cock@googlemail.com wrote:
Hi Justin,
You are not alone!
Galaxy has limited understanding of column headers in a # line, but there is room for improvement here certainly. I'm sure there are open issues on some/all of these points but Trello is being very slow and the only link I could find in my emails is:
https://trello.com/card/554-show-column-names-headers-or-first-entry-in-colu...
Peter
On Tue, Dec 15, 2015 at 6:06 PM, Justin Powell justin_galaxy@jacp1.com wrote:
- Could it be made possible to load generalised tabular data from file
with
column headings being recognised as column headings rather than
appearing as
the first row of the table? It looks like this is in general possible as more specific file types (e.g. gtf) do get displayed with column
headings,
so the data structures obviously support that.
- With data sets that do have column headings, is there any reason why
the
'Join two Datasets' tool couldn't retain the column headings from both
sets
in the final output rather than just the first one?
- Could the 'Join two Datasets' tool allow selection of the columns by
name rather than by number?
To me these three changes would make it much easier for our users to manipulate and join tables of genes and gene annotations, but I'm new to Galaxy and maybe there are reasons why it doesn't work this way?
Justin
Please keep all replies on the list by using "reply all" in your mail client. To manage your subscriptions to this and other Galaxy lists, please use the interface at: https://lists.galaxyproject.org/
To search Galaxy mailing lists use the unified search at: http://galaxyproject.org/search/mailinglists/
galaxy-dev@lists.galaxyproject.org