Description
Leader of the group Database Architectures: Stefan Manegold.
We, the Database Architectures (DA) research group of CWI, are well known as a top data systems research group, active in the broad area of data (management) systems and infrastructure for supporting data science. Our research group has a strong international reputation in academia and industry for pioneering column store technology, fast compression methods, vectorized query execution, on-line query-driven indexing (cracking), adaptive caching, and integration of statistical languages and analysis in database management systems.
We develop, distribute and maintain the MonetDB open-source system, and we have spawned multiple spin-off companies, including Data Distilleries, VectorWise and MonetDB Solutions. Our team also operates a self-built cluster, SciLens, that – unlike many other computer clusters – is bandwidth-optimized and thus better suited as a data-science infrastructure. We pride ourselves on revealing the real problems in our discipline and coming up with revolutionary solutions that are frequently ahead of their time.
Vacancies
No vacancies currently.
News
Current events
New national bi-weekly seminar: The Dutch Seminar on Data Systems Design (DSDSD)
- 2021-03-26T16:15:00+01:00
- 2021-03-26T17:45:00+01:00
New national bi-weekly seminar: The Dutch Seminar on Data Systems Design (DSDSD)
Start: 2021-03-26 16:15:00+01:00 End: 2021-03-26 17:45:00+01:00
CWI has initiated a new national bi-weekly seminar called the Dutch Seminar on Data Systsems Design (DSDSD).
DSDSD is co-organized with TUE, TU Delft, and Uva and brings together researchers in data systems. The format is a 90-minute session with two talks, on Friday late afternoon; so DSDSD marks the start of weekend, but is time-wise also friendly for US based speakers. In each seminar, there is one invited speaker from abroad and one local speaker. There have been two seminars already, this Friday is the third.
The website is
https://dsdsd.da.cwi.nl
(here people can get on the mailing list, which distributes the zoom links) and DSDSD also tweets at https://twitter.com/dsdsdnl
We have had two succesful seminars so far, the first with Semih Salihoglu (University of Waterloo) and Hannes Muehleisen (CWI) and the second with Andy Pavlo (Carnegie Mellon University) and Peter Boncz. Friday is the third, with Viktor Leis (University of Erlangen) and Sam Ansmink (CWI). In these seminars, we see 60-70 attendees so far. In the following weeks there will be talks also by TUE and TU Delft.
Members
Associated Members
Publications
-
Ding, M, Chen, S, Makrynioti, K, & Manegold, S. (2021). Progressive join algorithms considering user preference.
-
Makrynioti, K. (2020). Evaluation of query modifications in progressive data processing.
-
Kruit, B.B, Boncz, P.A, & Urbani, J. (2020). Extracting N-ary Facts from Wikipedia Table Clusters. In International Conference on Information and Knowledge Management, Proceedings (pp. 655–664). doi:10.1145/3340531.3412027
-
Ghit, B, Poggi, N, Rosen, J, Xin, R, & Boncz, P.A. (2020). SparkFuzz: Searching correctness regressions in modern query engines. In Proceedings of the Workshop on Testing Database Systems, DBTest 2020. doi:10.1145/3395032.3395327
-
Fuchs, P, Boncz, P.A, & Ghit, B. (2020). Edgeframe: Worst-case optimal joins for graph-pattern matching in spark. In Proceedings of the 3rd ACM SIGMOD Joint International Workshop on Graph Data Management Experiences and Systems and Network Data Analytics, GRADES-NDA 2020. doi:10.1145/3398682.3399162
-
Gubner, T.K, Leis, V, & Boncz, P.A. (2020). Efficient query processing with Optimistically Compressed Hash Tables & Strings in the USSR. In Proceedings of the IEEE International Conference on Data Engineering (ICDE) (pp. 301–312). doi:10.1109/ICDE48307.2020.000
-
Raasveldt, M, & Mühleisen, H.F. (2020). Data Management for Data Science - Towards Embedded Analytics. In Proceedings of the Conference on Innovative Data Systems Research.
-
Gomes Tomé, D, & Boncz, P.A. (2020). Redesigning query engines for white-box compression. In CEUR Workshop Proceedings.
-
Holanda, P.T, Raasveldt, M, Manegold, S, & Mühleisen, H.F. (2019). Progressive Indexes: Indexing for interactive data analysis. Proceedings of the VLDB Endowment, 12(13), 2366–2378. doi:10.14778/3358701.3358705
-
Kruit, B.B, Boncz, P.A, & Urbani, J. (2019). Extracting novel facts from tables for Knowledge Graph completion. In Proceedings of ISWC 2019 (pp. 364–381). doi:10.1007/978-3-030-30793-6_21
Software
MonetDB: high-performance query processing against very large databases
MonetDB is a relational database management system (DBMS) providing high performance on complex queries against large databases.
Current projects with external funding
-
RelationalAI-CWI Research Agreement ()
-
Actian Research Grant II (ACTIAN II)
-
Cross-Industry Predictive Maintenance Optimization Platform (CIMPLO)
-
Data Mining on High Volume Simulation Output (DAMIOSO)
-
Databricks CWI Research Agreement (Databricks)
-
Lokale Digitale Competentie centra (DCC) (DCC-NWO-I)
-
Facebook Research Grant (Facebook)
-
Research agreeement CWI - Databricks - vervolg contract (None)
-
Process mining for multi-objective online control (PROMIMOOC)
-
RelationalAI-CWI Research Grant Agreement (RelationalAI)
-
Structure-aware Querying & Information Retrieval on Evolving Large Graphs (SQIREL-GRAPHS)
-
Ursa-CWI Research Grant (Ursa)
Related partners
-
Actian Corporation
-
BMW Munich
-
Databricks
-
LIACS Institute
-
MonetDB B.V.
-
Neo Technology AB
-
OBI4wan B.V.
-
RelationalAI
-
Spinque
-
Tata Steel
-
WizeNoze B.V.
-
Facebook
-
Ursa Computing Inc.