I am a Genome Data Scientist who works at the interfaces of statistics, discrete mathematics and pattern recognition on the one hand and molecular biology and biomedicine on the other hand. I joined CWI in 2010, coming from UC Berkeley where I was a postdoctoral fellow in the Laboratory for Mathematical and Computational Biology in the Department of Mathematics, and I got tenure at CWI in 2014. Prior to UC Berkeley I also was a postdoctoral fellow in the Data Mining and Computational Laboratories of the School of Computing Science at Simon Fraser University in Vancouver. I received my PhD in 2006 from University of Cologne, Germany. On the theoretical end, I am particularly interested in data mining, sequence analysis and machine learning, with a special focus on Markovian and latent variable models. Biomedical questions I address relate to issues in computational genomics, in particular in the areas of pathogen, cancer and single cell biology. Genotyping and phasing variants from next- and third-generation sequencing data is a special focus. I am also concerned with computational pan-genomics, that is, to try to make sense out of -- soon -- millions of sequenced genomes, and with translating genome sequencing data into information that one can make use of in clinical practice, using machine learning methods.


  • Statistical Models for Structural Genetic Variants in the Genome of the Netherlands

Professional activities

  • committeeMember
    Member Program Committee - Research on Computational MolecularᅠᅠBiology [RECOMB]
  • committeeMember
    Member Program Committee - Intelligent Systems on Molecular Biologyᅠᅠ[ISMB], 2013, 2014, 2016, 2017
  • committeeMember
    Member Program Committee - European Conference on ComputationalᅠᅠBiology [ECCB], 2017
  • committeeMember
    Member Program Committee - ACM Conference on Bioinformatics,ᅠᅠComputational Biology and Biomedicine [ACM-BCB], 2012, 2017
  • chair
    Publicity Chair - Research on Computational MolecularᅠBiology [RECOMB]
  • chair
    Program Faculty - Computational Genomics Summer Institute [CGSI],ᅠᅠUCLA
  • organizer
    Workshop, "Data Structures in Bioinformatics" [DSB]
  • committeeMember
    Founding Member - Consortium, "Computational Pangenomics"
  • committeeMember
    Member Consortium - "The Genome of the Netherlands"
  • committeeMember
    Member Award Selection Committee - BioSB
  • grant
    - Summer Semester Stipends, IPAM, UCLA, 2016, 2017
  • grant
    - NWO and KNAW Workshop Grant, including sponsorships from industry