Events

Events

[Building Blocks] Apache Arrow DataFusion: A Fast, Embeddable, Modular Analytic Query Engine (Andrew Lamb)

Date

Mon Sep 23, 2024

Time

04:30pm EST

Location

ZOOM

Speaker

Andrew Lamb

Apache DataFusion is a fast, embeddable, and extensible query engine written in Rust that uses Apache Arrow as its memory model. In this talk we explain DataFusion in more detail and describe the types of data centric systems it is used to build. We will also review its high level architecture and feature set, discussing tradeoffs and performance between DataFusion’s modularity vs more common tightly coupled design.

This talk is part of the Database Building Blocks Seminar Series.

Zoom Link: https://cmu.zoom.us/j/95283696582 (Passcode 787637)

Bio:
Staff Engineer at InfluxData. Apache Arrow and Apache DataFusion PMC. In a past life worked on Vertica and Oracle.