Archived Events

Archived Events

Mar 13

2025

Mar 13 2025
Redesigning Blockchains: SSD-optimized Verifiable Databases and Beyond (Daniel Lin-Kit Wong)
Speaker:
Daniel Lin-Kit Wong
System:
QMDB

Blockchains are decentralized ledgers that replace trusted central authorities with verifiable distributed consensus. This decentralization has resulted in blockchains effectively becoming ‘slow and expensive computers’, but there are huge opportunities for architectural optimization across the entire blockchain software stack. We begin this talk by outlining the scaling challenges from a systems researcher’s perspective, and discussing the bottlenecks faced in computation,... Read More

Mar 10

2025

Mar 10 2025
[SQL Death] Pipe Syntax in SQL: SQL for the 21st Century
Speaker:
Jeff Shute
System:
GoogleSQL
Video:
YouTube

SQL has been extremely successful as the de facto standard language for working with data. Virtually all mainstream database-like systems use SQL as their primary query language. But SQL is an old language with significant design problems, making it difficult to learn, difficult to use, and difficult to extend. Many have observed these challenges with SQL, and proposed solutions involving... Read More

Feb 24

2025

Feb 24 2025
[SQL Death] Apache Pinot Query Optimizer
Speakers:
Yash Mayya , Gonzalo Ortiz
System:
Pinot
Video:
YouTube

Apache Pinot is a distributed real-time OLAP database, part of a fast-growing segment designed for large-scale, user-facing analytics. Its primary query language is SQL, and it excels at low-latency queries, high throughput, and fresh data. Currently, Pinot supports two SQL dialects, and we are building a compatibility layer to enable pluggable time-series query languages, with Uber's M3 and PromQL as... Read More

Feb 17

2025

Feb 17 2025
[SQL Death] Towards Sanity in Query Languages
Speakers:
Viktor Leis , Thomas Neumann
System:
Technical University of Munich
Video:
YouTube

The relational model has stood the test of time is the foundation of most database systems. But let's be honest -- its success is not because of SQL, but in spite of it. SQL's syntax is arcane, inconsistent, and bears little resemblance to the actual execution semantics of queries. Worse yet, SQL is not even a true standard -- every... Read More

Feb 10

2025

Feb 10 2025
[SQL Death] Larry Ellison was Right (kinda)! TypeScript Stored Procedures for the Modern Age
Speaker:
James Cowling
System:
Convex
Video:
YouTube

No one uses SQL to write business logic. It's written in a programming language with libraries, tests, type safety, and expressive syntax. Traditionally this was the domain of a backend team, who’d try to build enough functionality to keep the frontend team happy without breaking the database. This model hasn’t kept up with the needs of full stack developers though,... Read More

Jan 21

2025

Jan 21 2025
SplitSQL: Practical Pushdown Cache for DataLake Analytics (Xiangpeng Hao)
Speaker:
Xiangpeng Hao
System:
DataFusion

Modern data analytics embrace a disaggregated architecture which decouples storage, cache, and compute into network-connected independent components. With disaggregated cache, a key design decision is whether to push down query predicates to the cache server. Without predicate pushdown, the cache must send all data to compute nodes, creating network bottlenecks. With predicate pushdown, the cache server evaluates predicates on cached... Read More

Dec 9

2024

Dec 9 2024
[Building Blocks] Implement, Integrate and Extend a Query Engine (Ruihang Xia)
Speaker:
Ruihang Xia
System:
GreptimeDB
Video:
YouTube

GreptimeDB uses Apache DataFusion and many other common building blocks in its implementation. This talk will focus on managing the query aspect of a (time-series) database across various parts. We have extended DataFusion to implemenet PromQL, add grammar candies to SQL, cooperate with external secondary indexes and write domain-specific optimizer rules etc. Each of above is extended in a different... Read More

Dec 2

2024

Dec 2 2024
[Building Blocks] Apache OpenDAL: One Layer, All Storage (Xuanwo)
Speaker:
Xuanwo
System:
OpenDAL
Video:
YouTube

Apache OpenDAL is an Open Data Access Layer that enables seamless interaction with diverse storage services, guided by its mission of "One Layer, All Storage" and core tenets of being open, solid, fast, and extensible to serve various users from infrastructure builders to application developers. In this talk, we will explain OpenDAL in more detail and describe the abstractions it... Read More

Nov 25

2024

Nov 25 2024
Amazon Redshift: re-innovating cloud analytics
Speaker:
Ippokratis Pandis
System:
Redshift

In 2013, Amazon Web Services revolutionized the data warehousing industry by launching Amazon Redshift, the first fully-managed, petabyte-scale, enterprise-grade cloud data warehouse. Amazon Redshift made it simple and cost-effective to efficiently analyze large volumes of data using existing business intelligence tools. This cloud service was a significant leap from the traditional on-premise data warehousing solutions, which were expensive, not elastic,... Read More

Nov 20

2024

Nov 20 2024
The Rise of Data Streaming Platforms
Speaker:
Jun Rao
System:
Confluent

Apache Kafka and Apache Flink are powering a new category of data infrastructure called data streaming platform (DSP). This provides an opportunity for each enterprise to take actions on what’s happening in its business in real time. I will first provide an overview of DSP. DSP has both similarities and differences to database systems. I will show how existing database... Read More