[Building Blocks] Apache Arrow DataFusion: A Fast, Embeddable, Modular Analytic Query Engine (Andrew Lamb)
Date
Time
Location
Speaker
Apache DataFusion is a fast, embeddable, and extensible query engine written in Rust that uses Apache Arrow as its memory model. In this talk we explain DataFusion in more detail and describe the types of data centric systems it is used to build. We will also review its high level architecture and feature set, discussing tradeoffs and performance between DataFusion’s modularity vs more common tightly coupled design.
This talk is part of the Database Building Blocks Seminar Series.
Zoom Link: https://cmu.zoom.us/j/95283696582 (Passcode 787637)
Bio:
Staff Engineer at InfluxData. Apache Arrow and Apache DataFusion PMC. In a past life worked on Vertica and Oracle.