Dear Nate,
We're trying to implement different tools to handle sequences generated by Megabace/AB1 sequencers.
The very first step in this process is to load the chromatograms provided by the machines.
The thing is these machines produce an individual file per sequence and the phred software (base caller) receive as parameters a compressed file or a directory path.
Since galaxy 'get data' feature loads only individual files and we're not able to implement phred to do such things, what options are left?
Thank you so much,
Adhemar



On 07/29/2010 01:57 PM, Eric Aguiar wrote:


-------- Original Message --------
Subject: Re: [galaxy-dev] Help about zip upload (esd files)
Date: Wed, 28 Jul 2010 12:09:06 -0400
From:
To: Eric Aguiar <eric@cebio.org>
CC:  List 


Eric Aguiar wrote:
> Dear,
> 
> I followed all steps in the galaxy tutorial, but I didn't have success.
> I'm trying to create a datatype for megabase chromatograms (.esd) very 
> similar to the Ab1 ones.
> 
> Here is my configurations.
> 
> 
> *1 - Creating datatypes in datatypes_conf.xml*
> 
>  <datatype extension="zip" type="galaxy.datatypes.binary:Esd" 
> mimetype="application/zip" display_in_upload="true"/>
> <datatype extension="esd" type="galaxy.datatypes.binary:Esd" 
> mimetype="application/octet-stream" display_in_upload="true"/>
> 
> *2 - Defining types in lib/galaxy/datatypes/binary.py*
> 
> class Esd( Binary ):
>     """Class describing an ab1 binary sequence file"""
>     file_ext = "esd"
> 
>     def set_peek( self, dataset, is_multi_byte=False ):
>         if not dataset.dataset.purged:
>             dataset.peek  = "Binary chromatograms sequence file"
>             dataset.blurb = data.nice_size( dataset.get_size() )
>         else:
>             dataset.peek = 'file does not exist'
>             dataset.blurb = 'file purged from disk'
>     def display_peek( self, dataset ):
>         try:
>             return dataset.peek
>         except:
>             return "Binary esd sequence file (%s)" % ( data.nice_size( 
> dataset.get_size() ) )
> 
> 
> class Zip( Binary ):
>     """Class describing a zip archive of binary sequence files"""
>     file_ext = "zip"
> 
>     def set_peek( self, dataset, is_multi_byte=False ):
>         if not dataset.dataset.purged:
>             zip_file = zipfile.ZipFile( dataset.file_name, "r" )
>             num_files = len( zip_file.namelist() )
>             dataset.peek  = "Archive of %s binary sequence files" % ( 
> str( num_files ) )
>             dataset.blurb = data.nice_size( dataset.get_size() )
>         else:
>             dataset.peek = 'file does not exist'
>             dataset.blurb = 'file purged from disk'
>     def display_peek( self, dataset ):
>         try:
>             return dataset.peek
>         except:
>             return "Binary sequence file archive (%s)" % ( 
> data.nice_size( dataset.get_size() ) )
>     def get_mime( self ):
>         """Returns the mime type of the datatype"""
>         return 'application/zip'
> 
> 
> When I'm going to send the file in zip format (.esd files compressed),it 
> shows me the following error:
> *"An error occurred running this job: Invalid 'File Format' for archive 
> consisting of binary files - use 'Binseq.zip'"*

Hi Eric,

Have you updated to a recent revision of Galaxy?  Binseq.zip and the 
code that prevents uploading zip files has been removed.  Currently, 
though, a zip file can only contain one member.

--nate

> 
> I tried somethings, but I don't have success.
> 
> Thank you,
> -- 
> 
> 
> ------------------------------------------------------------------------
> 
> _______________________________________________
> galaxy-dev mailing list
> galaxy-dev@lists.bx.psu.edu
> http://lists.bx.psu.edu/listinfo/galaxy-dev



-- 
Adhemar Zerlotini Neto, PhD
FIOCRUZ - Fundação Oswaldo Cruz
CPqRR - Centro de Pesquisas René Rachou
http://www.cpqrr.fiocruz.br
http://www.cebio.org
+55 31 3337 3863
+55 31 3337 3649