- Aerospike
- Alibaba
- Anna
- APOLLO
- Azure Cosmos DB
- BigQuery
- Bodo
- Cassandra
- Chroma
- ClickHouse
- Confluent
- CouchDB
- CrocodileDB
- DataFusion
- Datomic
- Debezium
- Dremio
- DuckDB
- EdgeDB
- Exon
- FASTER
- FeatureBase
- Firebolt
- FoundationDB
- Gel
- Google Spanner
- Greenplum
- HarperDB
- Impala
- Jepsen
- Kinetica
- LanceDB
- Litestream
- Malloy
- MariaDB
- MemSQL
- Modin
- MongoDB
- Napa
- NoisePage
- NuoDB
- OpenDAL
- OtterTune
- ParadeDB
- Pinot
- PostgresML
- PRQL
- QMDB
- QuestDB
- Redshift
- RisingWave
- Rockset
- rqlite
- Samza
- SingleStore
- SLOG
- Snowflake
- SpiceDB
- SplinterDB
- SQL Server
- SQLite
- Stardog
- Striim
- Swarm64
- Technical University of Munich
- TiDB
- TileDB
- Tokutek
- Umbra
- Vertica
- VoltDB
- WiredTiger
- YugabyteDB
- Akamas
- AlloyDB
- ApertureDB
- Arrow
- Berkeley DB
- BlazingDB
- Brytlyt
- Chaos Mesh
- Citus
- CockroachDB
- Convex
- CrateDB
- Databricks
- Datometry
- dbt
- Dolt
- Druid
- DVMS
- EraDB
- eXtremeDB
- Fauna
- Featureform
- Fluree
- Gaia
- GlareDB
- GoogleSQL
- GreptimeDB
- Heron
- InfluxDB
- kdb
- ksqlDB
- LeanStore
- LMDB
- MapD
- Materialize
- Milvus
- MonetDB
- MySQL
- Neon
- Noria
- OceanBase
- Oracle
- OxQL
- Pinecone
- PlanetScale
- PostgreSQL
- Qdrant
- QuasarDB
- RavenDB
- RelationalAI
- RocksDB
- RonDB
- SalesForce
- ScyllaDB
- sled
- Smooth
- Spice.ai
- Splice Machine
- SQL Anywhere
- SQLancer
- SQream
- StarRocks
- Summingbird
- Synnada
- TerminusDB
- TigerBeetle
- TimescaleDB
- Trino
- Velox
- Vitesse
- Weaviate
- Yellowbrick
- Aerospike
- AlloyDB
- APOLLO
- Berkeley DB
- Bodo
- Chaos Mesh
- ClickHouse
- Convex
- CrocodileDB
- Datometry
- Debezium
- Druid
- EdgeDB
- eXtremeDB
- FeatureBase
- Fluree
- Gel
- GoogleSQL
- HarperDB
- InfluxDB
- Kinetica
- LeanStore
- Malloy
- Materialize
- Modin
- MySQL
- NoisePage
- OceanBase
- OtterTune
- Pinecone
- PostgresML
- Qdrant
- QuestDB
- RelationalAI
- Rockset
- SalesForce
- SingleStore
- Smooth
- SpiceDB
- SQL Anywhere
- SQLite
- StarRocks
- Swarm64
- TerminusDB
- TileDB
- Trino
- Vertica
- Weaviate
- YugabyteDB
- Akamas
- Anna
- Arrow
- BigQuery
- Brytlyt
- Chroma
- CockroachDB
- CouchDB
- Databricks
- Datomic
- Dolt
- DuckDB
- EraDB
- FASTER
- Featureform
- FoundationDB
- GlareDB
- Greenplum
- Heron
- Jepsen
- ksqlDB
- Litestream
- MapD
- MemSQL
- MonetDB
- Napa
- Noria
- OpenDAL
- OxQL
- Pinot
- PostgreSQL
- QMDB
- RavenDB
- RisingWave
- RonDB
- Samza
- sled
- Snowflake
- Splice Machine
- SQL Server
- SQream
- Striim
- Synnada
- TiDB
- TimescaleDB
- Umbra
- Vitesse
- WiredTiger
- Alibaba
- ApertureDB
- Azure Cosmos DB
- BlazingDB
- Cassandra
- Citus
- Confluent
- CrateDB
- DataFusion
- dbt
- Dremio
- DVMS
- Exon
- Fauna
- Firebolt
- Gaia
- Google Spanner
- GreptimeDB
- Impala
- kdb
- LanceDB
- LMDB
- MariaDB
- Milvus
- MongoDB
- Neon
- NuoDB
- Oracle
- ParadeDB
- PlanetScale
- PRQL
- QuasarDB
- Redshift
- RocksDB
- rqlite
- ScyllaDB
- SLOG
- Spice.ai
- SplinterDB
- SQLancer
- Stardog
- Summingbird
- Technical University of Munich
- TigerBeetle
- Tokutek
- Velox
- VoltDB
- Yellowbrick
- Aerospike
- Anna
- Azure Cosmos DB
- Bodo
- Chroma
- Confluent
- CrocodileDB
- Datomic
- Dremio
- EdgeDB
- FASTER
- Firebolt
- Gel
- Greenplum
- Impala
- Kinetica
- Litestream
- MariaDB
- Modin
- Napa
- NuoDB
- OtterTune
- Pinot
- PRQL
- QuestDB
- RisingWave
- rqlite
- SingleStore
- Snowflake
- SplinterDB
- SQLite
- Striim
- Technical University of Munich
- TileDB
- Umbra
- VoltDB
- YugabyteDB
- Akamas
- ApertureDB
- Berkeley DB
- Brytlyt
- Citus
- Convex
- Databricks
- dbt
- Druid
- EraDB
- Fauna
- Fluree
- GlareDB
- GreptimeDB
- InfluxDB
- ksqlDB
- LMDB
- Materialize
- MonetDB
- Neon
- OceanBase
- OxQL
- PlanetScale
- Qdrant
- RavenDB
- RocksDB
- SalesForce
- sled
- Spice.ai
- SQL Anywhere
- SQream
- Summingbird
- TerminusDB
- TimescaleDB
- Velox
- Weaviate
- Alibaba
- APOLLO
- BigQuery
- Cassandra
- ClickHouse
- CouchDB
- DataFusion
- Debezium
- DuckDB
- Exon
- FeatureBase
- FoundationDB
- Google Spanner
- HarperDB
- Jepsen
- LanceDB
- Malloy
- MemSQL
- MongoDB
- NoisePage
- OpenDAL
- ParadeDB
- PostgresML
- QMDB
- Redshift
- Rockset
- Samza
- SLOG
- SpiceDB
- SQL Server
- Stardog
- Swarm64
- TiDB
- Tokutek
- Vertica
- WiredTiger
- AlloyDB
- Arrow
- BlazingDB
- Chaos Mesh
- CockroachDB
- CrateDB
- Datometry
- Dolt
- DVMS
- eXtremeDB
- Featureform
- Gaia
- GoogleSQL
- Heron
- kdb
- LeanStore
- MapD
- Milvus
- MySQL
- Noria
- Oracle
- Pinecone
- PostgreSQL
- QuasarDB
- RelationalAI
- RonDB
- ScyllaDB
- Smooth
- Splice Machine
- SQLancer
- StarRocks
- Synnada
- TigerBeetle
- Trino
- Vitesse
- Yellowbrick
Apr 19
2021
[Vaccination 2021] Deterministic Database Management in Mission-Critical Applications (Andrei Gorine)
- Speaker:
- Andrei Gorine
- System:
- eXtremeDB
- Video:
- YouTube
Mission- and safety-critical systems software designs embody key characteristics for which temporal correctness is essential. Deterministic, predictable, and fully controllable software components that complement modern real-time operating systems offerings are in demand. It is commonly believed by software developers that meeting timing requirements is a matter of sufficiently increasing system throughput. However, research, and industry projects have often brought forward... Read More
Apr 12
2021
[Vaccination 2021] LeanStore: In-Memory Data Management Beyond Main Memory (Viktor Leis)
- Speaker:
- Viktor Leis
- System:
- LeanStore
- Video:
- YouTube
LeanStore is a high-performance OLTP storage engine optimized for many-core CPUs and NVMe SSDs. The goal of the project is to achieve performance comparable to in-memory systems when the data set fits into RAM, while being able to fully exploit the bandwidth of fast NVMe SSDs for large data sets. In this talk, I will present most of the important... Read More
Apr 5
2021
[Vaccination 2021] Query Processing in Google BigQuery (Hossein Ahmadi + Aleksandras Surna)
- Speakers:
- Hossein Ahmadi , Aleksandras Surna
- System:
- BigQuery
- Video:
- YouTube
Google BigQuery is a serverless, scalable, and cost effective cloud data warehouse. In this talk, we give an overview of distributed query execution in BigQuery and present various query optimization techniques used. In particular, we will discuss the dynamic query execution primitives built into BigQuery. This talk is part of the Vaccination Database Tech Talk Seminar Series. Zoom Link: https://cmu.zoom.us/j/94112059546... Read More
Mar 29
2021
[Vaccination 2021] FASTER: Efficient State Management for the Modern Edge-Cloud (Badrish Chandramouli)
- Speaker:
- Badrish Chandramouli
- System:
- FASTER
- Video:
- YouTube
Managing state efficiently in modern applications written for the cloud and edge is hard. In the FASTER project, we have been creating building blocks such as FasterKV and FasterLog to alleviate this problem using techniques such as epoch protection, tiered storage, and asynchronous recoverability. In this talk, we describe these components and how we have been evolving the project over... Read More
Mar 22
2021
[Vaccination 2021] NoisePage: The Self-Driving Database Management System (Lin Ma)
- Speaker:
- Lin Ma
- System:
- NoisePage
- Video:
- YouTube
Database management systems (DBMSs) are an important part of modern data-driven applications. However, they are notoriously difficult to deploy and administer. There are existing methods that recommend physical design or knob configurations for DBMSs. But most of them require humans to make final decisions and decide when to apply changes. The goal of a self-driving DBMS is to remove the... Read More
Mar 16
2021
[PDL] Package Queries: Scalable Prescriptive Analytics Close to the Data (Matteo Brucato)
- Speaker:
- Matteo Brucato
Decision making is central to a broad range of domains, including finance, transportation, healthcare, the travel industry, robotics, and engineering. It is often found at the very final step of business analytics--prescriptive analytics--to allow businesses to transform a rich understanding of data, typically provided by advanced predictive models, into actionable decisions. Modeling and solving these problems have relied on application-specific... Read More
Mar 15
2021
[Vaccination 2021] HarperDB’s Data Storage Journey: From File System to LMDB (Kyle Bernhardy)
- Speaker:
- Kyle Bernhardy
- System:
- HarperDB
- Video:
- YouTube
HarperDB is a distributed database with hybrid SQL and NoSQL functionality in one, accessed via a REST API. Known as a structured object store with SQL capabilities, or NewSQL. HarperDB leverages a logical structure enabling ACID compliant efficient storage and retrieval without inconsistency, race conditions, or utilizing in-memory indexing. HarperDB is fully indexed and runs on any device from edge... Read More
Mar 8
2021
[Vaccination 2021] Novel Design Choices in Apache CouchDB (Adam Kocoloski)
- Speaker:
- Adam Kocoloski
- System:
- CouchDB
- Video:
- YouTube
Apache CouchDB is a JSON document store with a native HTTP API, server-side JavaScript indexing, and active/active data replication across flexible configurations of server instances that are free to come and go as they please. Under the hood the DBMS is implemented largely in Erlang and features copy-on-write B-trees, hash histories for automatic revision tracking of individual records, and a... Read More
Mar 1
2021
[Vaccination 2021] Inside Apache Druid’s Storage and Query Engine (Gian Merlino)
- Speaker:
- Gian Merlino
- System:
- Druid
- Video:
- YouTube
Apache Druid is an open-source columnar database known for high performance at scale; its largest deployments comprise thousands of servers. But no matter the scale, high performance starts with good fundamentals. This talk will dive into those fundamentals by exploring the inner workings of a single data server. We'll cover how Apache Druid stores data, what kinds of compression it... Read More
Feb 22
2021
[Vaccination 2021] Citus: Distributed PostgreSQL as an Extension (Marco Slot)
- Speaker:
- Marco Slot
- System:
- Citus
- Video:
- YouTube
One of the defining characteristics of PostgreSQL is its extensibility, which enables developers to add new database functionality without forking from the original project. Citus is an open source PostgreSQL extension that transforms PostgreSQL into a distributed database. The goal of Citus is to make the versatile set of data processing capabilities in PostgreSQL available at any scale. Citus can... Read More