[CPM-SPIRE-L] PhD offer: Search engine for genomic sequencing data
Pierre Peterlongo
pierre.peterlongo at inria.fr
Wed May 20 00:14:03 PDT 2020
Dear all,
Our PhD offer remains available, do not hesitate to apply!
Job Offer – Search engine for genomic sequencing data
We are currently witnessing a deep knowledge revolution due to the
availability of exponentially expanding DNA sequence databases. This is
made possible by the continuous acceleration of DNA sequencing
throughput. Sequencing data is accumulating faster than Moore’s Law,
bringing fundamental new insights, conjecture, and understanding, with
impacts in medicine, agronomy and ecology. Today, the Sequence Read
Archive <https://www.ncbi.nlm.nih.gov/sra> raw data archive stores more
than 10^16 nucleotides, in the form of short sequences (<1000 bp) which
represent fragments from generally unknown genomic location (the “reads”).
Currently there exists no way to query this treasure of information.
Today, it would be unthinkable to access the Internet without powerful
search engines. However, this is precisely the current situation for raw
read archives, where precious data sleep undisturbed in rarely-opened
drawers. In this project we propose to develop a new scaling
breakthrough, allowing users to directly query sequencing data on the
fly in order to tap into the largest underexploited resource in life
sciences.
In the framework of the broader SeqDigger ANR project
<https://www.cesgo.org/seqdigger/>, we propose to design and propose new
indexing schemes, scaling up very large DNA collection (assembled or
not), and offering a way to query in real time input sequences of
interest. The recruited PhD student will explore existing methods,
mainly based on Bloom Filters, and will propose new algorithmic solutions.
Required skills
Candidates must have strong interest and expertise in algorithmics, data
structures and C++ implementation.
Knowledge in genomics and biology will be highly appreciated but is not
a prerequisite.
Practical information
The recruited PhD student will work in the GenScale
<https://team.inria.fr/genscale/> team at Inria Rennes, France. She/he
will work in close collaboration with the SeqDigger partners (Pasteur
Institute Paris, CEA Genoscope, MIO) and external collaborators (EBI
<https://www.ebi.ac.uk/>, Bielefeld University
<https://www.techfak.uni-bielefeld.de/~stoye/>, …). Short stays at EBI,
Cambridge, are expected.
*Remuneration*: net salary approx 1600€ per month.
Starting date is flexible. PhD could start as early as possible.
Informal enquiries are encouraged. Please contact
pierre.peterlongo at inria.fr
<https://www.cesgo.org/seqdigger/job-offer-search-engine-for-genomic-sequencing-data/>
Applicants should send a full CV, a cover letter explaining their
motivation, experience, and achievements, at least 2 references with
contact information, and their date of availability
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://fenris.cs.ucr.edu/pipermail/cpm-spire-l/attachments/20200520/e053d78d/attachment.html>
More information about the CPM-SPIRE-L
mailing list