Events

Speaker:: Gian Merlino
Date:: Mon Mar 1, 2021 @ 04:30pm EDT
Date:: Mon Mar 1, 2021
Time:: 04:30pm EDT
Location:: https://cmu.zoom.us/j/94112059546?pwd=TUFNMk96Z1h4bG9CSHh6czlpVktuQT09 Zoom
Title:: Inside Apache Druid's Storage and Query Engine
System:: Druid
Video:: YouTube

Talk Info:

Apache Druid is an open-source columnar database known for high performance at scale; its largest deployments comprise thousands of servers. But no matter the scale, high performance starts with good fundamentals. This talk will dive into those fundamentals by exploring the inner workings of a single data server. We'll cover how Apache Druid stores data, what kinds of compression it uses, how it indexes data, how the storage engine is linked with the query processing engine, and how the system handles resource management and multithreading. Together, all these pieces enable Apache Druid to process billions of records per second on a single data server. This talk is part of the Vaccination Database Tech Talk Seminar Series.

Zoom Link: https://cmu.zoom.us/j/94112059546 (Password 809013)

Bio:

Gian is a cofounder and CTO at Imply, a San Francisco based technology startup that provides a real-time analytics platform based on Apache Druid. Gian is also a committer and PMC member on the Apache Druid project. He holds a BS in Computer Science from Caltech.

More Info: https://db.cs.cmu.edu/seminar2021/#db4

Events

Events

[Vaccination 2021] Inside Apache Druid’s Storage and Query Engine (Gian Merlino)

Talk Info:

Bio: