Description
Leader of the group Database Architectures: Stefan Manegold.
We, the Database Architectures (DA) research group of CWI, are well known as a top data systems research group, active in the broad area of data (management) systems and infrastructure for supporting data science. Our research group has a strong international reputation in academia and industry for pioneering column store technology, fast compression methods, vectorized query execution, on-line query-driven indexing (cracking), adaptive caching, and integration of statistical languages and analysis in database management systems.
We develop, distribute and maintain the MonetDB open-source system, and we have spawned multiple spin-off companies, including Data Distilleries, VectorWise and MonetDB Solutions. Our team also operates a self-built cluster, SciLens, that – unlike many other computer clusters – is bandwidth-optimized and thus better suited as a data-science infrastructure. We pride ourselves on revealing the real problems in our discipline and coming up with revolutionary solutions that are frequently ahead of their time.
Vacancies
PhD Students in the areas of Big Data Management and Analysis Architectures and Data Science Engineering Technologies
Three fully funded PhD positions are available to work under the direction of prof. Stefan Manegold on big data management technology with a particular focus on hardware-conscious data structures and algorithms in distributed and cloud environments as well as the integration of data mining and machine learning into large-scale data management systems.
News

Hannes Mühleisen in media about insecure wifi in trains
Hannes Mühleisen, a postdoc researcher at CWI, recently found that he could see too much information from the wifi network in NS trains, which are passing close by his home - a houseboat in Amsterdam near Central Station. The network information is not encrypted.

Veni grants for Daniel Dadush and Hannes Mühleisen
The Netherlands Organisation for Scientific Research (NWO) has awarded Veni grants to Daniel Dadush and Hannes Mühleisen of CWI. The funding allows these researchers, who have recently obtained their PhD, to conduct independent research and develop their ideas for a period of three years.

Big Data at High Performance
Computer hardware systems have evolved from monolithic machines in which each component performed one specific task, to complex systems with a wide range of heterogeneous components such as spinning disks, SSDs, RAM, and CPUs.

CWI spin-off MonetDB Solutions teams up with Numascale
CWI spin-off company MonetDB Solutions and the Norwegian appliance company Numascale have joined forces to develop database appliances that enable companies and organizations to perform easier and more affordable big data analytics.
Members
Associated Members
Publications
-
Angles Rojas, R, Arenas, M, Barceló, P, Boncz, P.A, Fletcher, G, Gutierrez, C, … Voigt, H. (2018). G-CORE a core for future graph query languages. In Proceedings of the ACM SIGMOD International Conference on Management of data.
-
Kipf, A, Lang, H, Pandey, V.N, Persa, R, Boncz, P.A, Neumann, T, & Kemper, A. (2018). Adaptive geospatial joins for modern hardware.
-
Boncz, P.A, Anatiotis, A.-C, & Kläbe, S. (2017). JCC-H: Adding Join Crossing Correlations with skew to TPC-H. In Performance Evaluation and Benchmarking for the Analytics Era (pp. 103–119). doi:10.1007/978-3-319-72401-0_8
-
Raasveldt, M, & Mühleisen, H.F. (2017). Don't hold my data hostage - A case for client protocol redesign. In Proceedings of the VLDB Endowment (pp. 1022–1033).
-
Sidirourgos, E, & Mühleisen, H.F. (2017). Scaling column imprints using advanced vectorization. Presented at the Thirteenth International Workshop on Data Management on New Hardware. doi:10.1145/3076113.3076120
-
Rozenberg, E, & Boncz, P.A. (2017). Faster across the PCIe bus: A GPU library for lightweight decompression including support for patched compression schemes. In Proceedings of the International Workshop on Data Management on New Hardware. doi:10.1145/3076113.3076122
-
CWI gaat samenwerking aan met Databricks - executive-people.nl - 29-04-2017. (2017). CWI gaat samenwerking aan met Databricks - executive-people.nl - 29-04-2017.
-
Database-onderzoek en vooruitlopen op de toekomst - computable.nl - 25-04-2017. (2017). Database-onderzoek en vooruitlopen op de toekomst - computable.nl - 25-04-2017.
-
CWI en Databricks gaan samenwerken - engineerslonline.nl - 25-04-2017. (2017). CWI en Databricks gaan samenwerken - engineerslonline.nl - 25-04-2017.
-
Database-onderzoek en voorlopen op de toekomst - Computable - 31-03-2017. (2017). Database-onderzoek en voorlopen op de toekomst - Computable - 31-03-2017.
Software
MonetDB: high-performance query processing against very large databases
MonetDB is a relational database management system (DBMS) providing high performance on complex queries against large databases.
Current projects with external funding
-
Capturing the Laws of Data Nature
-
Cross-Industry Predictive Maintenance Optimization Platform (CIMPLO)
-
Data Mining on High Volume Simulation Output (DAMIOSO)
-
Databricks CWI Research Agreement (Databricks)
-
LAD: Layered Astronomical Databases (LAD)
-
Process mining for multi-objective online control (PROMIMOOC)
-
The SciLens-II Infrastructure, Big Data at work (Scilens-II)
Related partners
-
BMW Munich
-
Databricks
-
LIACS Institute
-
MonetDB B.V.
-
Tata Steel