I am a Genome Data Scientist who works at the interfaces of statistics, discrete mathematics and pattern recognition on the one hand and molecular biology and biomedicine on the other hand. On the theoretical end, I am particularly interested in data mining, sequence analysis and machine learning, with a special focus on Markovian and latent variable models. Biomedical questions I address relate to issues in computational genomics, in particular in the areas of pathogen, cancer and single cell biology. Genotyping and phasing variants from next- and third-generation sequencing data is a special focus. I am also concerned with computational pan-genomics, that is, to try to make sense out of -- soon -- millions of sequenced genomes, and with translating genome sequencing data into information that one can make use of in clinical practice, using machine learning methods.
Tenured Principal Investigator in the Life Sciences Group, Centrum Wiskunde & Informatica, Amsterdam
2010 - 2014
Tenure-Track Principal Investigator in the Life Sciences Group, Centrum Wiskunde & Informatica, Amsterdam
2009 - 2010
Postdoctoral Fellow, Department of Mathematics, University of California at Berkeley. Advisor: Lior Pachter.
2007 - 2009
Postdoctoral Fellow, School of Computing Science, Simon Fraser University. Advisors: Martin Ester, S.Cenk Sahinalp
1999 - 2007
Research Assistant at the Center for Applied Computer Science (ZAIK), Cologne, Germany. Advisor: Ulrich Faigle
"Qualification Professeur des Universites"
Habilitation Equivalent France, January 2010
University of Cologne, Germany, 1999
[ Thesis ]
Private Donation, D. DesJardins, Google Inc.
For postdoctoral research in the Department of Mathematics at UC Berkeley
Postdoctoral Fellowship, Pacific Institute of the Mathematical Sciences