Sign up for the
Database Group Seminar
Mailing List
Jan
21
2025
12:00pm EST
GHC 8115
SplitSQL: Practical Pushdown Cache for DataLake Analytics (Xiangpeng Hao)
Modern data analytics embrace a disaggregated architecture which decouples storage, cache, and compute into network-connected independent components. With disaggregated cache, a key design decision is whether to push down query predicates to the cache server. Without predicate pushdown, the cache must send all data to compute nodes, creating network bottlenecks. With predicate pushdown, the cache server evaluates predicates on cached data, but its limited computational resources become the bottleneck. In this talk, we introduce SplitSQL, a pushdown cache system with... Read More