Description

Leader of the group Database Architectures: Stefan Manegold.

We, the Database Architectures (DA) research group of CWI, are well known as a top data systems research group, active in the broad area of data (management) systems and infrastructure for supporting data science. Our research group has a strong international reputation in academia and industry for pioneering column store technology, fast compression methods, vectorized query execution, on-line query-driven indexing (cracking), adaptive caching, and integration of statistical languages and analysis in database management systems.


We develop, distribute and maintain the MonetDB open-source system, and we have spawned multiple spin-off companies, including Data Distilleries, VectorWise and MonetDB Solutions. Our team also operates a self-built cluster, SciLens, that – unlike many other computer clusters – is bandwidth-optimized and thus better suited as a data-science infrastructure. We pride ourselves on revealing the real problems in our discipline and coming up with revolutionary solutions that are frequently ahead of their time.

More

Vacancies

PhD Students in the areas of Big Data Management and Analysis Architectures and Data Science Engineering Technologies

Three fully funded PhD positions are available to work under the direction of prof. Stefan Manegold on big data management technology with a particular focus on hardware-conscious data structures and algorithms in distributed and cloud environments as well as the integration of data mining and machine learning into large-scale data management systems.

PhD Students in the areas of Big Data Management and Analysis Architectures and Data Science Engineering Technologies - Read More…

News

Members

Associated Members

Publications

Software

Current projects with external funding

  • Capturing the Laws of Data Nature
  • Cross-Industry Predictive Maintenance Optimization Platform (CIMPLO)
  • Data Mining on High Volume Simulation Output (DAMIOSO)
  • Databricks CWI Research Agreement (Databricks)
  • LAD: Layered Astronomical Databases (LAD)
  • Process mining for multi-objective online control (PROMIMOOC)
  • The SciLens-II Infrastructure, Big Data at work (Scilens-II)

Related partners

  • BMW Munich
  • Databricks
  • LIACS Institute
  • MonetDB B.V.
  • Tata Steel