Description
Leader of the group Database Architectures: Stefan Manegold.
We, the Database Architectures (DA) research group of CWI, are well known as a top data systems research group, active in the broad area of data (management) systems and infrastructure for supporting data science. Our research group has a strong international reputation in academia and industry for pioneering column store technology, fast compression methods, vectorized query execution, on-line query-driven indexing (cracking), adaptive caching, and integration of statistical languages and analysis in database management systems.
We develop, distribute and maintain the MonetDB open-source system, and we have spawned multiple spin-off companies, including Data Distilleries, VectorWise and MonetDB Solutions. Our team also operates a self-built cluster, SciLens, that – unlike many other computer clusters – is bandwidth-optimized and thus better suited as a data-science infrastructure. We pride ourselves on revealing the real problems in our discipline and coming up with revolutionary solutions that are frequently ahead of their time.
Vacancies
No vacancies currently.
News

CWI en Databricks lanceren samenwerking in aanwezigheid van minister Kamp
Op 25 april 2017 lanceren CWI en het big data analyse- en data science software-bedrijf Databricks een nieuwe samenwerking. Dat gebeurt bij de Hannover Messe, in aanwezigheid van Minister Kamp van Economische Zaken.

CWI & Databricks: Big Data in Amsterdam
Amsterdam wants to play a leading international role in the development of data science research. In Big Data Amsterdam, Financieele Dagblad journalist Job Woudt interviews Amsterdam Data Science researchers on the functioning of the ecosystem where companies and knowledge institutions in Amsterdam collaborate in the area of Big Data.

Making a tedious search a breeze: parallel query execution in multi-core systems
When you request information from a huge database, you might want to grab a cup of coffee and sit back, because your request may take a while. How can we optimize searches for information, specifically in a multi-core processing unit? The answer may lie in parallel queries, which run simultaneously on different processors.

Martin Kersten appointed ACM Fellow
CWI fellow Martin Kersten has been appointed as one of the 2016 fellows of the Association of Computing Machinery (ACM).
Members
Associated Members
Publications
-
Lang, H, Beischl, A, Leis, V, Boncz, P.A, Neumann, T, & Kemper, A. (2020). Tree-Encoded Bitmaps. In Proceedings of the ACM SIGMOD International Conference on Management of Data (pp. 937–967). doi:10.1145/3318464.3380588
-
Raasveldt, M. (2020, June 9). Integrating analytics with relational databases. SIKS Dissertation Series.
-
Ghita, B, Gomes Tomé, D, & Boncz, P.A. (2020). White-box compression: Learning and exploiting compact table representations. In Proceedings of the Conference on Innovative Data Systems Research.
-
Kipf, A, Lang, H, Pandey, V.N, Persa, R.A, Anneser, C, Zacharatou, E.T, … Kemper, A. (2020). Adaptive main-memory indexing for high-performance point-polygon joins. In Advances in Database Technology - EDBT (pp. 347–358). doi:10.5441/002/edbt.2020.31
-
Kruit, B.B, He, H., H, & Urbani, J. (2020). Tab2Know: Building a Knowledge Base from Tables in Scientific Papers. In Lecture Notes in Computer Science/Lecture Notes in Artificial Intelligence. doi:10.1007/978-3-030-62419-4_20
-
Gubner, T.K, Gomes Tomé, D, Lang, H, & Boncz, P.A. (2019). Fluid co-processing: GPU Bloom-filters for CPU joins. In Proceedings of the 15th International Workshop on Data Management on New Hardware (pp. 9:1–9:10). doi:10.1145/3329785.3329934
-
De Leo, D, & Boncz, P.A. (2019). Fast concurrent reads and updates with PMAs. In Proceedings of the Joint International Workshop on Graph Data Management Experiences & Systems (GRADES) and Network Data Analytics (NDA) 2019. doi:10.1145/3327964.3328497
-
Boncz, P.A, Manegold, S, Ailamaki, A, Deshpande, A, & Kraska, T (Eds.). (2019). Proceedings of the 2019 International Conference on Management of Data. In P.A Boncz, S Manegold, A Ailamaki, A Deshpande, & T Kraska (Eds.), .
-
Kipf, A, Vorona, D., Müller, J, Kipf, T, Radke, B, Leis, V, … Kemper, A. (2019). Estimating cardinalities with deep sketches. In Proceedings of the ACM SIGMOD International Conference on Management of Data (pp. 1937–1940). doi:10.1145/3299869.3320218
-
Raasveldt, M, & Mühleisen, H.F. (2019). [Demo] DuckDB: An embeddable analytical database. In Proceedings of the ACM SIGMOD International Conference on Management of Data (pp. 1981–1984). doi:10.1145/3299869.3320212
Software
MonetDB: high-performance query processing against very large databases
MonetDB is a relational database management system (DBMS) providing high performance on complex queries against large databases.
Current projects with external funding
-
RelationalAI-CWI Research Agreement ()
-
Actian Research Grant II (ACTIAN II)
-
Cross-Industry Predictive Maintenance Optimization Platform (CIMPLO)
-
Data Mining on High Volume Simulation Output (DAMIOSO)
-
Databricks CWI Research Agreement (Databricks)
-
Facebook Research Grant (Facebook)
-
Research agreeement CWI - Databricks - vervolg contract (None)
-
Process mining for multi-objective online control (PROMIMOOC)
-
Structure-aware Querying & Information Retrieval on Evolving Large Graphs (SQIREL-GRAPHS)
Related partners
-
Actian Corporation
-
BMW Munich
-
Databricks
-
LIACS Institute
-
MonetDB B.V.
-
Neo Technology AB
-
OBI4wan B.V.
-
RelationalAI
-
Spinque
-
Tata Steel
-
WizeNoze B.V.
-
Facebook