Jun 7
2021
[Vaccination 2021] PostgreSQL Optimizer Methodology (Robert Haas)
- Speaker:
- Robert Haas
- System:
- PostgreSQL
In this talk, I'll talk at a high level about how the PostgreSQL query planner approaches join planning, and how it gathers and uses statistics. Without losing sight of the fact that these algorithms generally work, I want to highlight some of the annoying cases where they break down, and the problems that they can cause for users and developers.... Read More
May 24
2021
[Vaccination 2021] MonetDB: Scale Up Before You Scale Out (Sir Martin Kersten)
- Speaker:
- Sir Martin Kersten
- System:
- MonetDB
MonetDB is the pioneering open-source main-memory oriented column store developed in a research setting and spinning out into the enterprise market to make a (performance) difference. MonetDB innovates at all layers of a DBMS, e.g. a storage model based on vertical fragmentation, a modern CPU-tuned query execution architecture, automatic and self-tuning indexes, run-time query optimization, and modular software architecture. In... Read More
May 17
2021
[Vaccination 2021] Fast Materialized Views for Fast Websites (Malte Schwarzkopf)
- Speaker:
- Malte Schwarzkopf
- System:
- Noria
Modern web applications require fast reads of query results over user data. In practice, they use a complex, brittle, and tricky-to-manage caching layers to achieve this performance. In this talk, I will discuss how we built a new database system, Noria, from the ground up around the paradigm of materialized view maintenance via incremental streaming dataflow. Noria combines eager and... Read More
May 10
2021
[Vaccination 2021] The Design of InfluxDB IOx: An In-Memory Columnar Database Written in Rust with Apache Arrow (Paul Dix)
I'll talk about the design of InfluxDB IOx, the future core of InfluxDB, an open source time series database. It's an in-memory columnar database that uses object storage for persistence. It's written in Rust and is built on top of Apache Arrow. Unlike previous versions of InfluxDB, IOx supports standards compliant SQL and the Postgres dialect in particular. This is... Read More
May 3
2021
[Vaccination 2021] Under the Hood of an Exadata Transaction – How Did We Harness the Power of Persistent Memory? (Jia Shi)
Persistent memory is a new silicon technology, adding a distinct storage tier of performance, capacity, and price between DRAM and Flash. The persistent memory is physically present on the memory bus of the storage server resulting in reads at memory speed, much faster than flash. Writes are persistent, surviving power cycles, unlike DRAM. Oracle has engineered Exadata Smart PMEM Cache... Read More
Apr 26
2021
[Vaccination 2021] Separation of Storage and Compute for Transactions and Analytics (Joyo Victor)
- Speaker:
- Joyo Victor
- System:
- SingleStore
Separation of Storage and Compute, ala Snowflake or BigQuery, gives enormous benefits in terms of flexibility, scalability and durability. This talk presents a detailed architecture differentiated on low latency small writes. This talk is part of the Vaccination Database Tech Talk Seminar Series. Zoom Link: https://cmu.zoom.us/j/94112059546 (Password 809013) Read More
Apr 19
2021
[Vaccination 2021] Deterministic Database Management in Mission-Critical Applications (Andrei Gorine)
- Speaker:
- Andrei Gorine
- System:
- eXtremeDB
Mission- and safety-critical systems software designs embody key characteristics for which temporal correctness is essential. Deterministic, predictable, and fully controllable software components that complement modern real-time operating systems offerings are in demand. It is commonly believed by software developers that meeting timing requirements is a matter of sufficiently increasing system throughput. However, research, and industry projects have often brought forward... Read More
Apr 12
2021
[Vaccination 2021] LeanStore: In-Memory Data Management Beyond Main Memory (Viktor Leis)
- Speaker:
- Viktor Leis
- System:
- LeanStore
LeanStore is a high-performance OLTP storage engine optimized for many-core CPUs and NVMe SSDs. The goal of the project is to achieve performance comparable to in-memory systems when the data set fits into RAM, while being able to fully exploit the bandwidth of fast NVMe SSDs for large data sets. In this talk, I will present most of the important... Read More
Apr 5
2021
[Vaccination 2021] Query Processing in Google BigQuery (Hossein Ahmadi + Aleksandras Surna)
- Speakers:
- Hossein Ahmadi , Aleksandras Surna
- System:
- BigQuery
Google BigQuery is a serverless, scalable, and cost effective cloud data warehouse. In this talk, we give an overview of distributed query execution in BigQuery and present various query optimization techniques used. In particular, we will discuss the dynamic query execution primitives built into BigQuery. This talk is part of the Vaccination Database Tech Talk Seminar Series. Zoom Link: https://cmu.zoom.us/j/94112059546... Read More
Mar 29
2021
[Vaccination 2021] FASTER: Efficient State Management for the Modern Edge-Cloud (Badrish Chandramouli)
- Speaker:
- Badrish Chandramouli
- System:
- FASTER
Managing state efficiently in modern applications written for the cloud and edge is hard. In the FASTER project, we have been creating building blocks such as FasterKV and FasterLog to alleviate this problem using techniques such as epoch protection, tiered storage, and asynchronous recoverability. In this talk, we describe these components and how we have been evolving the project over... Read More