- Aerospike
- Akamas
- AlloyDB
- ApertureDB
- Arrow
- Berkeley DB
- BlazingDB
- Brytlyt
- Chaos Mesh
- Citus
- CockroachDB
- Convex
- CrateDB
- Databricks
- Datometry
- dbt
- Delta Lake
- Dremio
- DSQL
- DVMS
- EraDB
- eXtremeDB
- Fauna
- Featureform
- Firebolt
- Fluss
- Gaia
- GlareDB
- GoogleSQL
- GreptimeDB
- Heron
- Iceberg
- InfluxDB
- kdb
- ksqlDB
- LeanStore
- LMDB
- MapD
- Materialize
- Milvus
- MonetDB
- Mooncake
- MySQL
- Neon
- Noria
- OceanBase
- Oracle
- OxQL
- Pinecone
- PlanetScale
- PostgresML
- PRQL
- QMDB
- QuestDB
- Redshift
- RisingWave
- Rockset
- rqlite
- Samza
- SingleStore
- SLOG
- Snowflake
- SpiceDB
- SplinterDB
- SQL Server
- SQLite
- Stardog
- Striim
- Swarm64
- Technical University of Munich
- TiDB
- TileDB
- Tokutek
- Umbra
- Vertica
- VoltDB
- Weaviate
- XTDB
- YugabyteDB
- AirFlow
- Alibaba
- Anna
- APOLLO
- Azure Cosmos DB
- BigQuery
- Bodo
- Cassandra
- Chroma
- ClickHouse
- Confluent
- CouchDB
- CrocodileDB
- DataFusion
- Datomic
- Debezium
- Dolt
- Druid
- DuckDB
- EdgeDB
- Exon
- FASTER
- FeatureBase
- Feldera
- Fluree
- FoundationDB
- Gel
- Google Spanner
- Greenplum
- HarperDB
- Hudi
- Impala
- Jepsen
- Kinetica
- LanceDB
- Litestream
- Malloy
- MariaDB
- MemSQL
- Modin
- MongoDB
- MotherDuck
- Napa
- NoisePage
- NuoDB
- OpenDAL
- OtterTune
- ParadeDB
- Pinot
- Polaris
- PostgreSQL
- Qdrant
- QuasarDB
- RavenDB
- RelationalAI
- RocksDB
- RonDB
- SalesForce
- ScyllaDB
- sled
- Smooth
- Spice.ai
- Splice Machine
- SQL Anywhere
- SQLancer
- SQream
- StarRocks
- Summingbird
- Synnada
- TerminusDB
- TigerBeetle
- TimescaleDB
- Trino
- Velox
- Vitesse
- Vortex
- WiredTiger
- Yellowbrick
- Aerospike
- Alibaba
- ApertureDB
- Azure Cosmos DB
- BlazingDB
- Cassandra
- Citus
- Confluent
- CrateDB
- DataFusion
- dbt
- Dolt
- DSQL
- EdgeDB
- eXtremeDB
- FeatureBase
- Firebolt
- FoundationDB
- GlareDB
- Greenplum
- Heron
- Impala
- kdb
- LanceDB
- LMDB
- MariaDB
- Milvus
- MongoDB
- MySQL
- NoisePage
- OceanBase
- OtterTune
- Pinecone
- Polaris
- PRQL
- QuasarDB
- Redshift
- RocksDB
- rqlite
- ScyllaDB
- SLOG
- Spice.ai
- SplinterDB
- SQLancer
- Stardog
- Summingbird
- Technical University of Munich
- TigerBeetle
- Tokutek
- Velox
- VoltDB
- WiredTiger
- YugabyteDB
- AirFlow
- AlloyDB
- APOLLO
- Berkeley DB
- Bodo
- Chaos Mesh
- ClickHouse
- Convex
- CrocodileDB
- Datometry
- Debezium
- Dremio
- DuckDB
- EraDB
- FASTER
- Featureform
- Fluree
- Gaia
- Google Spanner
- GreptimeDB
- Hudi
- InfluxDB
- Kinetica
- LeanStore
- Malloy
- Materialize
- Modin
- Mooncake
- Napa
- Noria
- OpenDAL
- OxQL
- Pinot
- PostgresML
- Qdrant
- QuestDB
- RelationalAI
- Rockset
- SalesForce
- SingleStore
- Smooth
- SpiceDB
- SQL Anywhere
- SQLite
- StarRocks
- Swarm64
- TerminusDB
- TileDB
- Trino
- Vertica
- Vortex
- XTDB
- Akamas
- Anna
- Arrow
- BigQuery
- Brytlyt
- Chroma
- CockroachDB
- CouchDB
- Databricks
- Datomic
- Delta Lake
- Druid
- DVMS
- Exon
- Fauna
- Feldera
- Fluss
- Gel
- GoogleSQL
- HarperDB
- Iceberg
- Jepsen
- ksqlDB
- Litestream
- MapD
- MemSQL
- MonetDB
- MotherDuck
- Neon
- NuoDB
- Oracle
- ParadeDB
- PlanetScale
- PostgreSQL
- QMDB
- RavenDB
- RisingWave
- RonDB
- Samza
- sled
- Snowflake
- Splice Machine
- SQL Server
- SQream
- Striim
- Synnada
- TiDB
- TimescaleDB
- Umbra
- Vitesse
- Weaviate
- Yellowbrick
- Aerospike
- AlloyDB
- Arrow
- BlazingDB
- Chaos Mesh
- CockroachDB
- CrateDB
- Datometry
- Delta Lake
- DSQL
- EraDB
- Fauna
- Firebolt
- Gaia
- GoogleSQL
- Heron
- InfluxDB
- ksqlDB
- LMDB
- Materialize
- MonetDB
- MySQL
- Noria
- Oracle
- Pinecone
- PostgresML
- QMDB
- Redshift
- Rockset
- Samza
- SLOG
- SpiceDB
- SQL Server
- Stardog
- Swarm64
- TiDB
- Tokutek
- Vertica
- Weaviate
- YugabyteDB
- AirFlow
- Anna
- Azure Cosmos DB
- Bodo
- Chroma
- Confluent
- CrocodileDB
- Datomic
- Dolt
- DuckDB
- Exon
- FeatureBase
- Fluree
- Gel
- Greenplum
- Hudi
- Jepsen
- LanceDB
- Malloy
- MemSQL
- MongoDB
- Napa
- NuoDB
- OtterTune
- Pinot
- PostgreSQL
- QuasarDB
- RelationalAI
- RonDB
- ScyllaDB
- Smooth
- Splice Machine
- SQLancer
- StarRocks
- Synnada
- TigerBeetle
- Trino
- Vitesse
- WiredTiger
- Akamas
- ApertureDB
- Berkeley DB
- Brytlyt
- Citus
- Convex
- Databricks
- dbt
- Dremio
- DVMS
- eXtremeDB
- Featureform
- Fluss
- GlareDB
- GreptimeDB
- Iceberg
- kdb
- LeanStore
- MapD
- Milvus
- Mooncake
- Neon
- OceanBase
- OxQL
- PlanetScale
- PRQL
- QuestDB
- RisingWave
- rqlite
- SingleStore
- Snowflake
- SplinterDB
- SQLite
- Striim
- Technical University of Munich
- TileDB
- Umbra
- VoltDB
- XTDB
- Alibaba
- APOLLO
- BigQuery
- Cassandra
- ClickHouse
- CouchDB
- DataFusion
- Debezium
- Druid
- EdgeDB
- FASTER
- Feldera
- FoundationDB
- Google Spanner
- HarperDB
- Impala
- Kinetica
- Litestream
- MariaDB
- Modin
- MotherDuck
- NoisePage
- OpenDAL
- ParadeDB
- Polaris
- Qdrant
- RavenDB
- RocksDB
- SalesForce
- sled
- Spice.ai
- SQL Anywhere
- SQream
- Summingbird
- TerminusDB
- TimescaleDB
- Velox
- Vortex
- Yellowbrick
May 17
2021
[Vaccination 2021] Fast Materialized Views for Fast Websites (Malte Schwarzkopf)
- Speaker:
- Malte Schwarzkopf
- System:
- Noria
- Video:
- YouTube
Modern web applications require fast reads of query results over user data. In practice, they use a complex, brittle, and tricky-to-manage caching layers to achieve this performance. In this talk, I will discuss how we built a new database system, Noria, from the ground up around the paradigm of materialized view maintenance via incremental streaming dataflow. Noria combines eager and... Read More
May 10
2021
[Vaccination 2021] The Design of InfluxDB IOx: An In-Memory Columnar Database Written in Rust with Apache Arrow (Paul Dix)
- Speaker:
- Paul Dix
- System:
- InfluxDB
- Video:
- YouTube
I'll talk about the design of InfluxDB IOx, the future core of InfluxDB, an open source time series database. It's an in-memory columnar database that uses object storage for persistence. It's written in Rust and is built on top of Apache Arrow. Unlike previous versions of InfluxDB, IOx supports standards compliant SQL and the Postgres dialect in particular. This is... Read More
May 3
2021
[Vaccination 2021] Under the Hood of an Exadata Transaction – How Did We Harness the Power of Persistent Memory? (Jia Shi)
- Speaker:
- Jia Shi
- System:
- Oracle
- Video:
- YouTube
Persistent memory is a new silicon technology, adding a distinct storage tier of performance, capacity, and price between DRAM and Flash. The persistent memory is physically present on the memory bus of the storage server resulting in reads at memory speed, much faster than flash. Writes are persistent, surviving power cycles, unlike DRAM. Oracle has engineered Exadata Smart PMEM Cache... Read More
Apr 26
2021
[Vaccination 2021] Separation of Storage and Compute for Transactions and Analytics (Joyo Victor)
- Speaker:
- Joyo Victor
- System:
- SingleStore
- Video:
- YouTube
Separation of Storage and Compute, ala Snowflake or BigQuery, gives enormous benefits in terms of flexibility, scalability and durability. This talk presents a detailed architecture differentiated on low latency small writes. This talk is part of the Vaccination Database Tech Talk Seminar Series. Zoom Link: https://cmu.zoom.us/j/94112059546 (Password 809013) Read More
Apr 19
2021
[Vaccination 2021] Deterministic Database Management in Mission-Critical Applications (Andrei Gorine)
- Speaker:
- Andrei Gorine
- System:
- eXtremeDB
- Video:
- YouTube
Mission- and safety-critical systems software designs embody key characteristics for which temporal correctness is essential. Deterministic, predictable, and fully controllable software components that complement modern real-time operating systems offerings are in demand. It is commonly believed by software developers that meeting timing requirements is a matter of sufficiently increasing system throughput. However, research, and industry projects have often brought forward... Read More
Apr 12
2021
[Vaccination 2021] LeanStore: In-Memory Data Management Beyond Main Memory (Viktor Leis)
- Speaker:
- Viktor Leis
- System:
- LeanStore
- Video:
- YouTube
LeanStore is a high-performance OLTP storage engine optimized for many-core CPUs and NVMe SSDs. The goal of the project is to achieve performance comparable to in-memory systems when the data set fits into RAM, while being able to fully exploit the bandwidth of fast NVMe SSDs for large data sets. In this talk, I will present most of the important... Read More
Apr 5
2021
[Vaccination 2021] Query Processing in Google BigQuery (Hossein Ahmadi + Aleksandras Surna)
- Speakers:
- Hossein Ahmadi , Aleksandras Surna
- System:
- BigQuery
- Video:
- YouTube
Google BigQuery is a serverless, scalable, and cost effective cloud data warehouse. In this talk, we give an overview of distributed query execution in BigQuery and present various query optimization techniques used. In particular, we will discuss the dynamic query execution primitives built into BigQuery. This talk is part of the Vaccination Database Tech Talk Seminar Series. Zoom Link: https://cmu.zoom.us/j/94112059546... Read More
Mar 29
2021
[Vaccination 2021] FASTER: Efficient State Management for the Modern Edge-Cloud (Badrish Chandramouli)
- Speaker:
- Badrish Chandramouli
- System:
- FASTER
- Video:
- YouTube
Managing state efficiently in modern applications written for the cloud and edge is hard. In the FASTER project, we have been creating building blocks such as FasterKV and FasterLog to alleviate this problem using techniques such as epoch protection, tiered storage, and asynchronous recoverability. In this talk, we describe these components and how we have been evolving the project over... Read More
Mar 22
2021
[Vaccination 2021] NoisePage: The Self-Driving Database Management System (Lin Ma)
- Speaker:
- Lin Ma
- System:
- NoisePage
- Video:
- YouTube
Database management systems (DBMSs) are an important part of modern data-driven applications. However, they are notoriously difficult to deploy and administer. There are existing methods that recommend physical design or knob configurations for DBMSs. But most of them require humans to make final decisions and decide when to apply changes. The goal of a self-driving DBMS is to remove the... Read More
Mar 16
2021
[PDL] Package Queries: Scalable Prescriptive Analytics Close to the Data (Matteo Brucato)
- Speaker:
- Matteo Brucato
Decision making is central to a broad range of domains, including finance, transportation, healthcare, the travel industry, robotics, and engineering. It is often found at the very final step of business analytics--prescriptive analytics--to allow businesses to transform a rich understanding of data, typically provided by advanced predictive models, into actionable decisions. Modeling and solving these problems have relied on application-specific... Read More