News & Events
[Fall 2023] Survey and Evaluation of Database Management System Extensibility (Abi Kim)
Database management system (DBMS) extensibility is a feature which enables users to extend the DBMS with user software. However, the DBMS extensibility environment is fraught with perils, and DBMS developers have to resort to unspecified methods of developing extensions, including copying core DBMS source code and casing between different versions of the DBMS. Extending a DBMS to support new functionality is challenging due to the tight coupling between the system's internal components. This thesis studies and evaluates the design of Read More
[Fall 2023] Viewing Collaborative Editing Through a Databases Lens (Martin Kleppmann)
Software that allows several users to collaboratively edit a document, such as Google Docs, has traditionally been ignored by the databases community. This is surprising, because managing the edits to a text document, spreadsheet, vector graphics file, etc. is very much a data management problem, albeit with a data model that is very different from that supported by most databases. Collaboration software needs replication, concurrency control, and data layouts for efficient storage and retrieval. In our work on Automerge, a Read More
[Fall 2023] Snowflake Tech Talk (Bowei Chen)
Snowflake internals tech talk. Read More
On Embedding Database Management System Logic in Operating Systems via Restricted Programming Environments (Matt Butrovich)
The rise in computer storage and network performance means that disk I/O and network communication are often no longer bottlenecks in database management systems (DBMSs). Instead, the overheads associated with operating system (OS) services (e.g., system calls, thread scheduling, and data movement from kernel- space) limit query processing responsiveness. To avoid these overheads, user-space applications prioritizing performance over simplicity can elide these software layers with a kernel-bypass design. However, extracting benefits from kernel-bypass frameworks is challenging, and the libraries are Read More
ML⇄DB Seminar Series — Fall 2023
Pittsburgh, PA — The Carnegie Mellon University Database Research Group celebrates the grand convergence of data storage and computational mastery with the ML⇄DB Seminar Series (Machine Learning for Databases + Databases for Machine Learning). Each speaker will present the implementation details of their respective systems and examples of the technical challenges they faced when working with real-world customers. CMU-DB's weekly meetings (Mondays @ 4:30pm EST) are available to the public on Zoom. Non-CMU affiliated members of the general public are Read More
[ML⇄DB 2023] pgvector: Stylish Hierarchical Navigable Small World Indexes! (Jonathan Katz)
This talk is part of the ML⇄DB Seminar Series. Zoom Link: https://cmu.zoom.us/j/91461275681 (Passcode 177332) Read More
[ML⇄DB 2023] Chroma: Vector Database Straight From the Tenderloin! (Jeff Huber)
This talk is part of the ML⇄DB Seminar Series. Zoom Link: https://cmu.zoom.us/j/91461275681 (Passcode 177332) Read More
[ML⇄DB 2023] Alibaba: Domain Knowledge Augmented AI for Databases (Jian Tan)
One goal of applying AI for databases is to make the systems easier to use, e.g., natural language to SQL conversion (NL2SQL), and more efficient to operate, e.g., DevOps root cause analysis (RCA). Although scaling up general models and datasets with less hand engineering have achieved unprecedented successes in various applications, we argue that utilizing domain knowledge to augment AI for databases can provide an efficient and effective solution. Specifically, we use two production systems, SQL Bridge (NL2SQL) and ShapleyIQ Read More
[ML⇄DB 2023] Milvus: A Purpose-Built Vector Data Management System! (Li Liu)
This talk is part of the ML⇄DB Seminar Series. Zoom Link: https://cmu.zoom.us/j/91461275681 (Passcode 177332) Read More
[ML⇄DB 2023] Akamas: The Database Tuner of the Future! (Stefano Cereda)
This talk is part of the ML⇄DB Seminar Series. Zoom Link: https://cmu.zoom.us/j/91461275681 (Passcode 177332) Read More