Archived Events

Archived Events

May 17

2021

May 17 2021
[Vaccination 2021] Fast Materialized Views for Fast Websites (Malte Schwarzkopf)
Speaker:
Malte Schwarzkopf
System:
Noria
Video:
YouTube

Modern web applications require fast reads of query results over user data. In practice, they use a complex, brittle, and tricky-to-manage caching layers to achieve this performance. In this talk, I will discuss how we built a new database system, Noria, from the ground up around the paradigm of materialized view maintenance via incremental streaming dataflow. Noria combines eager and... Read More

May 10

2021

May 10 2021
[Vaccination 2021] The Design of InfluxDB IOx: An In-Memory Columnar Database Written in Rust with Apache Arrow (Paul Dix)
Speaker:
Paul Dix
System:
InfluxDB
Video:
YouTube

I'll talk about the design of InfluxDB IOx, the future core of InfluxDB, an open source time series database. It's an in-memory columnar database that uses object storage for persistence. It's written in Rust and is built on top of Apache Arrow. Unlike previous versions of InfluxDB, IOx supports standards compliant SQL and the Postgres dialect in particular. This is... Read More

May 3

2021

May 3 2021
[Vaccination 2021] Under the Hood of an Exadata Transaction – How Did We Harness the Power of Persistent Memory? (Jia Shi)
Speaker:
Jia Shi
System:
Oracle
Video:
YouTube

Persistent memory is a new silicon technology, adding a distinct storage tier of performance, capacity, and price between DRAM and Flash. The persistent memory is physically present on the memory bus of the storage server resulting in reads at memory speed, much faster than flash. Writes are persistent, surviving power cycles, unlike DRAM. Oracle has engineered Exadata Smart PMEM Cache... Read More

Apr 26

2021

Apr 26 2021
[Vaccination 2021] Separation of Storage and Compute for Transactions and Analytics (Joyo Victor)
Speaker:
Joyo Victor
System:
SingleStore
Video:
YouTube

Separation of Storage and Compute, ala Snowflake or BigQuery, gives enormous benefits in terms of flexibility, scalability and durability. This talk presents a detailed architecture differentiated on low latency small writes. This talk is part of the Vaccination Database Tech Talk Seminar Series. Zoom Link: https://cmu.zoom.us/j/94112059546 (Password 809013) Read More

Apr 19

2021

Apr 19 2021
[Vaccination 2021] Deterministic Database Management in Mission-Critical Applications (Andrei Gorine)
Speaker:
Andrei Gorine
System:
eXtremeDB
Video:
YouTube

Mission- and safety-critical systems software designs embody key characteristics for which temporal correctness is essential. Deterministic, predictable, and fully controllable software components that complement modern real-time operating systems offerings are in demand. It is commonly believed by software developers that meeting timing requirements is a matter of sufficiently increasing system throughput. However, research, and industry projects have often brought forward... Read More

Apr 12

2021

Apr 12 2021
[Vaccination 2021] LeanStore: In-Memory Data Management Beyond Main Memory (Viktor Leis)
Speaker:
Viktor Leis
System:
LeanStore
Video:
YouTube

LeanStore is a high-performance OLTP storage engine optimized for many-core CPUs and NVMe SSDs. The goal of the project is to achieve performance comparable to in-memory systems when the data set fits into RAM, while being able to fully exploit the bandwidth of fast NVMe SSDs for large data sets. In this talk, I will present most of the important... Read More

Apr 5

2021

Apr 5 2021
[Vaccination 2021] Query Processing in Google BigQuery (Hossein Ahmadi + Aleksandras Surna)
Speakers:
Hossein Ahmadi , Aleksandras Surna
System:
BigQuery
Video:
YouTube

Google BigQuery is a serverless, scalable, and cost effective cloud data warehouse. In this talk, we give an overview of distributed query execution in BigQuery and present various query optimization techniques used. In particular, we will discuss the dynamic query execution primitives built into BigQuery. This talk is part of the Vaccination Database Tech Talk Seminar Series. Zoom Link: https://cmu.zoom.us/j/94112059546... Read More

Mar 29

2021

Mar 29 2021
[Vaccination 2021] FASTER: Efficient State Management for the Modern Edge-Cloud (Badrish Chandramouli)
Speaker:
Badrish Chandramouli
System:
FASTER
Video:
YouTube

Managing state efficiently in modern applications written for the cloud and edge is hard. In the FASTER project, we have been creating building blocks such as FasterKV and FasterLog to alleviate this problem using techniques such as epoch protection, tiered storage, and asynchronous recoverability. In this talk, we describe these components and how we have been evolving the project over... Read More

Mar 22

2021

Mar 22 2021
[Vaccination 2021] NoisePage: The Self-Driving Database Management System (Lin Ma)
Speaker:
Lin Ma
System:
NoisePage
Video:
YouTube

Database management systems (DBMSs) are an important part of modern data-driven applications. However, they are notoriously difficult to deploy and administer. There are existing methods that recommend physical design or knob configurations for DBMSs. But most of them require humans to make final decisions and decide when to apply changes. The goal of a self-driving DBMS is to remove the... Read More

Mar 16

2021

Mar 16 2021
[PDL] Package Queries: Scalable Prescriptive Analytics Close to the Data (Matteo Brucato)
Speaker:
Matteo Brucato

Decision making is central to a broad range of domains, including finance, transportation, healthcare, the travel industry, robotics, and engineering. It is often found at the very final step of business analytics--prescriptive analytics--to allow businesses to transform a rich understanding of data, typically provided by advanced predictive models, into actionable decisions. Modeling and solving these problems have relied on application-specific... Read More