GenePeeks is hiring a Computational Geneticist. The position description is
list below. We are also hiring a tools programmer and I will send another
note with that position description.
GenePeeks is a unique genetic information company with a mission to help
families protect the health of their future children. We are searching for a
highly skilled scientist to join our growing team and help develop
leading-edge technologies in a fast-paced, early-stage environment.
* - Manipulate and analyze whole genome and high complexity genotype data,
including SNPs and Copy Number Variations (CNVs) from raw microarray data
* - Develop computational and data mining approaches to combine genotype
information, molecular profiling data and other phenotypic information
derived from NCBI and other online databases.
* - Develop protocols to predict the probability and severity of particular
diseases and conditions related to SNP, CNV and other structural variations.
* - Participate in implementing a system to test for and identify those
* - Test and assure the quality of automated analysis systems
* - Efficiently manage experimental data and data workflows
* - Monitor trends and innovations in the fast-moving world of genomic
data, analytics, storage,
* and clinical diagnostics
* - Contribute to the ongoing design and development of GenePeeks
* - Other duties as assigned
* Our ideal candidate would have the following training and technical
* - Ph.D. in genetics, biostatistics, computer science or related field, or
* - Familiarity with and experience using related publicly available
scientific datasources and tools
* (OMIM, DBSNP, GeneReviews, UCSC Genome Browser, Galaxy, HapMap)
* - Ability to implement tools using APIs that automatically interrogate
publicly available scientific
* datasources and tools such as OMIM, DBSNP, GeneReviews, UCSC Genome
* Galaxy, HapMap
* - Familiarity with next-gen data analysis and whole genome data
* - Research experience with and /or knowledge of clinical aspects of the
genetic causes of
* Autism, Schizophrenia and other neurological disorders
* - Experience analyzing SNP, CNV, GWAS (or NGS) data
* - Experience using automated bioinformatics tools
* - Experience working with whole genome sequence data
* - Proficiency with scripting language such as Python or R
* - Familiarity with 1000 Genomes Project or similar exposure
* General Requirements:
* - Ability and motivation to work independently
* - Ability to interact proactively with a small group
* - Strong written and verbal communication skills
* - Desire to work in a fast-paced, loosely structured business environment
- Sense of humor
* - Cambridge, MA or New York, NY - Some travel may be required
* Contact Information:
* If you are interested in this role and fulfill the requirements, please
send your CV and cover letter to jobs(a)genepeeks.com
Adam S. Cohn
Chief Technology Officer / Chief Operations Officer
175 Varick Street, New York, NY
(703) 473-8811 (mobile)
It seems like tophat2 does not support "--transcriptome-mismatches" option
but Galaxy is still configured to call tophat2 with this option when run
with "Full parameter setting" option within Galaxy. Is there new wrapper
with this fix? If not, how to remove this option from Galaxy so that
tophat2 is not call with this option. I am assuming that this is specified
somewhere in the wrapper.
I'm finishing up installing Galaxy on our local cluster.
I'm at this step regarding setting up reference genomes:
But it looks like a whole lot of manual work that needs to be done by
a system administrator. I was kind of expecting this would be
something end users would be able to do. I'm actually not sure we
have the resources for a sys admin to install all of these reference
genomes every time someone needs one.
I guess here are my questions:
Am I misunderstanding the process?
Is there a different way that normal users can add reference genomes instead?
Is there anything in the works that lets users easily add these?
Is there a way to use NGS tools without installing reference genomes?
I downloaded and extracted the .chain files for liftOver and edited the .loc file. After restarting Galaxy I have the option to convert my files to the respective coordinates but when running the job I get the same error for each.
"Traceback (most recent call last): File "/steno-internal/projects/galaxy/galaxy-dist/tools/extract/liftOver_wrapper.py", line 81, in raise Exception, 'Exception caught attempting conversion: ' + str( e ) Exception: Exception caught attempting conversion"
I also found this in the handler.log file:
"galaxy.tools DEBUG 2012-12-20 11:37:09,902 Error opening galaxy.json file: [Errno 2] No such file or directory: '/steno-internal/projects/galaxy/galaxy-dist/database/job_working_directory/000/98/galaxy.json' "
Do you know how could this be sorted?
Thanks in advance,
I'm looking into integrating some R scripts into Galaxy, which create
some graphics / plots. We are interested in integrating those in galaxy
somehow, or otherwise presenting them to the user in an as easy way as
I wanted to check if there is some support for this kind of
functionality there already, or if I'll need to hack something together
(I couldn't really find anything here on the mailing list).
Developer at SNIC-UPPMAX www.uppmax.uu.se
Developer at Dept of Pharm Biosciences www.farmbio.uu.se
I have a bacterial genome from ncbi and woulld like to extract seq from the
corresponding fasta file of bacterial genome. Since i have list of
coordinates so would not be possible to extract one by one. Is there any
interface within galxy that i can use.
I just installed a galaxy server and ran into something that may be a
deliberate feature or a genuine bug when trying to install a package
from the main tool shed. I got the Admin tab working, went to the
section dealing with tool sheds and selected "Search for valid tools" in
the Galaxy main too shed. I was able to find the tool I wanted (NCBI
Blast+, and I found multiple revisions) but I never found anything
remotely like an "Install" button. There are bare-bones instructions
for a manual install of the tool, but nothing automatic.
However, if I start with "Browse valid repositories" and _then_ search
for "ncbi blast" (and not "ncbi blast+"), I find a single revision that
has a "Preview and install" option and an "Install into local Galaxy"
So the only question here is whether this behavior is a bug or a
feature. In other words, should I be able to automatically install
packages using my first approach, or does Galaxy prevent this deliberately?
David Gene Morgan
320C Simon Hall
Indiana University Bloomington
812 856 1457 (office)
812 856 3221 (EM lab)
Is there a way to directly copy files from galaxy with the proper file
name? I'm currently using the galaxy API to download files via HTTP, but it
is very slow and would like to speed up the process of copying files
directly, but am having a hard time getting the proper file names.
Are there any suggestions on how to copy files directly from galaxy without
HTTP and with the proper filenames?
# Looking up files via API
$ ./display.py <api_key>
misc_blurb: 10,028,615 lines
misc_info: uploaded txt file
# Downloading files using a URL like
Are there (or were there) any known problems with Galaxy under
Microsoft Internet Explorer 9? One of our biologists is now having
problems with our local Galaxy server, probably after updating his
Specifically new history entries do not seem to automatically update
from queued to running to done, and also the "peep" functionality to
expand/collapse a history entry had stopped working. This was on a
large history of about 100 entries.
We installed Firefox 10, logged into Galaxy, and it was fine with this
A quick test on the public Galaxy with a single history entry seemed
to work in both browsers, so perhaps this is something you've already
fixed in the latest Galaxy - or perhaps it depends on the size of the
Any ideas? Has anyone else seen something similar?
We installed a local galaxy a couple of days ago following the basic
installation instruction (http://wiki.galaxyproject.org/Admin/Get%20Galaxy).
The installation went well, all the tools are listed on the left. I can
also upload data to Galaxy. However, an error occurs when I try to run
RNA-seq analysis. I am thinking the tools are not configured properly. Can
you give me tips to configure NGS tools? An example file with setup syntax
will help as well.