Events

Events

[Building Blocks] Apache Arrow DataFusion: A Fast, Embeddable, Modular Analytic Query Engine (Andrew Lamb)

Speaker:
Andrew Lamb
Date:
Mon Sep 23, 2024 @ 04:30pm EDT
Date:
Mon Sep 23, 2024
Time:
04:30pm EDT
Location:
https://cmu.zoom.us/j/95283696582?pwd=dn4nharXNC7lu3WCdCXdE2dYWfBB0u.1Zoom
Title:
Apache Arrow DataFusion: A Fast, Embeddable, Modular Analytic Query Engine
System:
DataFusion
Video:
YouTube

Talk Info:

Apache DataFusion is a fast, embeddable, and extensible query engine written in Rust that uses Apache Arrow as its memory model. In this talk we explain DataFusion in more detail and describe the types of data centric systems it is used to build. We will also review its high level architecture and feature set, discussing tradeoffs and performance between DataFusion’s modularity vs more common tightly coupled design.

This talk is part of the Database Building Blocks Seminar Series.

Zoom Link: https://cmu.zoom.us/j/95283696582 (Passcode 787637)

Bio:

Staff Engineer at InfluxData. Apache Arrow and Apache DataFusion PMC. In a past life worked on Vertica and Oracle.