News & Events
[¡Databases! 2022] Gaia: Direct Database Access without Database APIs (Tengiz Kharatishvili)
GAIA database is an unusual engine - it offers a direct access model and supports memory pointers, applications are able to create and navigate complex data structures like graphs with no or very little API overhead: the engine implements transactional memory view with full transactional durability - whatever you do in memory gets captures and persistent in an efficient transparent manner. You get ACID properties of a database but with the look and feel of a "regular" C/C++ code that Read More
[¡Databases! 2022] RisingWave: Reinventing(?!) Stream Processing in the Cloud Era (Yingjun Wu)
RisingWave is a cloud-native streaming database. Different from existing streaming systems, RisingWave fully leverages the modern cloud infrastructure to achieve high performance and scalability at a low cost. In this talk, I will walk you through the detailed designs of RisingWave, and discuss how we adopt some old ideas to build a next-generation streaming system. This talk is part of the ¡Databases! – A Database Seminar Series. Zoom Link: https://cmu.zoom.us/j/94466872009 (Passcode 424050) Read More
EdgeDB: Replacing SQL and Improving on the Relational Database Model (Michael Sullivan)
EdgeDB is a new database built around an evolution of the relational model that we call "graph-relational". In the graph-relational model, data is represented as strongly typed objects containing set-valued scalar properties and links to other objects. Missing values are represented in the language as empty sets (no NULL!), and have consistent semantics. The query language, EdgeQL, supports convenient fetching and modification of nested data. EdgeQL queries are composable: queries can be used without restriction as subexpressions and bound to Read More
[¡Databases! 2022] Neon: Serverless PostgreSQL! (Heikki Linnakangas)
Neon is a Postgres cloud service that separates compute and storage. Separation of compute and storage makes it possible to scale the components independently, and instant startup without a lengthy restore. Amazon Aurora pioneered the separation of compute and storage for OLTP databases. Neon uses the same concept of applying the WAL stream to reconstruct database state in the storage layer. However, Neon takes that concept further, and retains the WAL in a pre-processed format, making it possible to reconstruct Read More
[¡Databases! 2022] Litestream: Making Single-Node Deployments Cool Again (Ben Johnson)
SQLite has long been regarded as an incredibly reliable, fast, & easy-to-use database in the world of personal devices such as laptops & phones. However, it's never gained much traction in the world of web applications because it's built as a single-node database. Litestream adds simple, cheap streaming replication to SQLite to expand the use cases that the database can be used for. Litestream provides the missing disaster recovery tooling to make SQLite a viable database for many concurrent, production Read More
[¡Databases! 2022] Rockset: High Performance Queries with Dynamically Typed SQL (Ben Hannel)
This talk is part of the ¡Databases! – A Database Seminar Series. Zoom Link: https://cmu.zoom.us/j/94466872009 (Passcode 424050) Read More
[¡Databases! 2022] Odyssey: PostgreSQL Connection Proxy! (Andrey Borodin)
In hypertext world connection proxies is a must for many decades now. And in many cases this idea works for databases too! Today almost any busy OLTP Postgres instance have to use some sort of proxy: old but gold PgBouncer, scalable Odyssey or entirely new SPQR\PgCat. In this talk I'll discuss what allows proxies to be useful, what Postgres hackers do to fix this and architecture of proxy that I maintain - Odyssey. Odyssey is a scalable multi-threaded connection pooler Read More
[¡Databases! 2022] Architecture Insight of OceanBase: A Distributed SQL Database (Charlie Yang)
OceanBase Database, is an open-source, distributed Hybrid Transactional/Real-time Operational Analytics (HTAP) database management system that has set new world records in both the TPC-C and TPC-H benchmark tests. OceanBase Database starts from 2010, and it's built from scratch by Alibaba & Ant Group, it provides features of an enterprise-level database, including the native distributed architecture, financial-grade high availability, transparent horizontal scalability, real-time analytics, multi-tenancy, high cost-efficiency and compatibility with MySQL syntax. OceanBase has been serving all of the critical systems Read More
[¡Databases! 2022] Snowflake Iceberg Tables, Streaming Ingest, and Unistore! (Ashish Motivala)
Why settle for 1 cool db talk when you can get 3? Snowflake is pushing the boundaries of what a unified cloud data platform can do. Today we'll talk about how Snowflake can be combined with open standards like Apache Iceberg, hard tech to stream data into Snowflake and bring transactional and analytical workloads together in a single platform. Apache Iceberg is an open source project that provides a way to represent a table as files on the cloud. Iceberg Read More
[¡Databases! 2022] Umbra: A Disk-Based System with In-Memory Performance (Thomas Neumann)
The increases in main-memory sizes over the last decade have made pure in-memory database systems feasible, and in-memory systems offer unprecedented performance. However, DRAM is still relatively expensive, and the growth of main-memory sizes has slowed down. In contrast, the prices for SSDs have fallen substantially in the last years, and their read bandwidth has increased to gigabytes per second. This makes it attractive to combine a large in-memory buffer with fast SSDs as storage devices, combining the excellent performance Read More