Archived Events
[Spring 2023] 15-721 Final Project Presentations
Carnegie Mellon University is thrilled to announce the upcoming final project presentations for the Advanced Database Systems course for the Spring 2023 semester. This eagerly awaited event showcases the hard work and innovation of the university's talented students as they present their cutting-edge projects to an audience of esteemed guests and fellow students. The presentations promise to deliver captivating insights into the future of database systems. CMU's Advanced Database Systems course, renowned for its comprehensive curriculum and focus on industry-relevant... [Read More]
Return of the Database Machines? Towards a Hardware-Software Approach for High-performance Databases (Jignesh Patel)
Analytic database applications have an insatiable appetite for higher performance. In the past, a large part of this appetite was met by leveraging the gift of Moore’s Law. However, the slowing down of Moore’s Law now requires a new approach. Fortunately, the hardware landscape is currently undergoing a Cambrian explosion of new architectures. In this talk, I will describe how one class of architecture may provide part of the answer to our search for future high-performance database systems. This architecture... [Read More]
MS Thesis Defense: Extendable Rule-Based Action Generation for Self-Driving Database Systems (Mike Xu)
Database management systems (DBMSs) have become more complex to meet increasingly demanding usage. To owners and operators, the need for a self-driving DBMS that can automatically tune and optimize itself without human intervention is apparent now more than ever. Such a self-driving DBMS considers a set of candidate actions to apply to reach a configuration that improves performance for a given workload. Furthermore, the DBMS would continuously adjust the configuration in anticipation of changing workloads and data distributions. Efforts to... [Read More]
MS Thesis Defense: High Performance DBMS Design for Intelligent Query Scheduling (Deepayan Patra)
Decades of research in the field of database management systems (DBMSs) have focused on improving system performance with impressive results. Modern analytical databases take advantage of innovative methods such as vectorization and compilation to improve single query performance, use supporting data structures such as indexes or views to reduce data access requirements, and support the execution of multiple queries in parallel while maintaining necessary isolation guarantees. We propose a new line of work with workload and architecture-aware scheduling algorithms to... [Read More]
[15-445/645] Fall 2022 Live Call-in Q&A Lecture
For the final lecture in CMU's Introduction to Database Systems (Fall 2022) course, we are allowing anyone to call in with their database questions. The lecture will be livestreamed via Youtube and you will be able to ask your questions to Prof. Andy Pavlo directly. Livestream: https://youtu.be/MxOKUt6LeeU Audience Call-in: https://cmu.zoom.us/j/99783788428?pwd=R2ZSd2x0SUFnRlNIak5TVk5ubmFjQT09 (Must have Zoom account) [Read More]
[¡Databases! 2022] CrateDB: Distributed SQL Database Built on Top of Lucene (Marios Trivyzas)
CrateDB is a SQL, open-source, distributed database that makes storage and analysis of massive amounts of data simple and efficient. It offers effective data handling due to a high degree of scalability and availability, real-time query performance, and extensible data models. In this talk, we will discuss the fundamental concepts of CrateDB and highlight what makes it unique in comparison with other databases. In particular, we will cover topics such as dynamic schemas, Lucene index, distributed query execution, logical replication... [Read More]
[¡Databases! 2022] SplinterDB: A Key-Value Store for Modern Storage Devices (Alex Conway)
We built SplinterDB to address two trends. The first is the modern storage hardware offers significantly higher bandwidth and lower latency. The second is that modern applications at VMware and elsewhere store fine-grain data, such as metadata. Current state-of-the-art key-value stores such as RocksDB fail to fully exploit the capabilities of these devices on these types of workloads. SplinterDB is designed around a novel data structure, the mapped Bε-tree, which combines ideas from the theory of external memory hash tables... [Read More]
[¡Databases! 2022] TigerBeetle: Magical Memory Tour! (Joran Dirk Greef)
TigerBeetle is an open source distributed financial accounting database designed for mission critical safety and performance to track financial transactions at scale. TigerBeetle is coded to NASA’s Power of Ten Rules for Safety Critical Code. All memory is statically allocated at startup for predictable and efficient resource usage. Function arguments and return values are verified at runtime by over three thousand assertions. Deterministic Simulation Testing accelerates the maturation process of TigerBeetle’s VSR consensus protocol and LSM storage engine, through fault... [Read More]
[¡Databases! 2022] Gaia: Direct Database Access without Database APIs (Tengiz Kharatishvili)
GAIA database is an unusual engine - it offers a direct access model and supports memory pointers, applications are able to create and navigate complex data structures like graphs with no or very little API overhead: the engine implements transactional memory view with full transactional durability - whatever you do in memory gets captures and persistent in an efficient transparent manner. You get ACID properties of a database but with the look and feel of a "regular" C/C++ code that... [Read More]
[¡Databases! 2022] RisingWave: Reinventing(?!) Stream Processing in the Cloud Era (Yingjun Wu)
RisingWave is a cloud-native streaming database. Different from existing streaming systems, RisingWave fully leverages the modern cloud infrastructure to achieve high performance and scalability at a low cost. In this talk, I will walk you through the detailed designs of RisingWave, and discuss how we adopt some old ideas to build a next-generation streaming system. This talk is part of the ¡Databases! – A Database Seminar Series. Zoom Link: https://cmu.zoom.us/j/94466872009 (Passcode 424050) [Read More]