- Aerospike
- Akamas
- AlloyDB
- ApertureDB
- Arrow
- Berkeley DB
- BlazingDB
- Brytlyt
- Chaos Mesh
- Citus
- CockroachDB
- Convex
- CrateDB
- Databricks
- Datometry
- dbt
- Delta Lake
- Dremio
- DSQL
- DVMS
- EraDB
- eXtremeDB
- Fauna
- Featureform
- Firebolt
- Fluss
- Gaia
- GlareDB
- GoogleSQL
- GreptimeDB
- Heron
- Iceberg
- InfluxDB
- kdb
- ksqlDB
- LeanStore
- LMDB
- MapD
- Materialize
- Milvus
- MonetDB
- Mooncake
- MySQL
- Neon
- Noria
- OceanBase
- Oracle
- OxQL
- Pinecone
- PlanetScale
- PostgresML
- PRQL
- QMDB
- QuestDB
- Redshift
- RisingWave
- Rockset
- rqlite
- Samza
- SingleStore
- SLOG
- Snowflake
- SpiceDB
- SplinterDB
- SQL Server
- SQLite
- Stardog
- Striim
- Swarm64
- Technical University of Munich
- TiDB
- TileDB
- Tokutek
- Umbra
- Vertica
- VoltDB
- Weaviate
- XTDB
- YugabyteDB
- AirFlow
- Alibaba
- Anna
- APOLLO
- Azure Cosmos DB
- BigQuery
- Bodo
- Cassandra
- Chroma
- ClickHouse
- Confluent
- CouchDB
- CrocodileDB
- DataFusion
- Datomic
- Debezium
- Dolt
- Druid
- DuckDB
- EdgeDB
- Exon
- FASTER
- FeatureBase
- Feldera
- Fluree
- FoundationDB
- Gel
- Google Spanner
- Greenplum
- HarperDB
- Hudi
- Impala
- Jepsen
- Kinetica
- LanceDB
- Litestream
- Malloy
- MariaDB
- MemSQL
- Modin
- MongoDB
- MotherDuck
- Napa
- NoisePage
- NuoDB
- OpenDAL
- OtterTune
- ParadeDB
- Pinot
- Polaris
- PostgreSQL
- Qdrant
- QuasarDB
- RavenDB
- RelationalAI
- RocksDB
- RonDB
- SalesForce
- ScyllaDB
- sled
- Smooth
- Spice.ai
- Splice Machine
- SQL Anywhere
- SQLancer
- SQream
- StarRocks
- Summingbird
- Synnada
- TerminusDB
- TigerBeetle
- TimescaleDB
- Trino
- Velox
- Vitesse
- Vortex
- WiredTiger
- Yellowbrick
- Aerospike
- Alibaba
- ApertureDB
- Azure Cosmos DB
- BlazingDB
- Cassandra
- Citus
- Confluent
- CrateDB
- DataFusion
- dbt
- Dolt
- DSQL
- EdgeDB
- eXtremeDB
- FeatureBase
- Firebolt
- FoundationDB
- GlareDB
- Greenplum
- Heron
- Impala
- kdb
- LanceDB
- LMDB
- MariaDB
- Milvus
- MongoDB
- MySQL
- NoisePage
- OceanBase
- OtterTune
- Pinecone
- Polaris
- PRQL
- QuasarDB
- Redshift
- RocksDB
- rqlite
- ScyllaDB
- SLOG
- Spice.ai
- SplinterDB
- SQLancer
- Stardog
- Summingbird
- Technical University of Munich
- TigerBeetle
- Tokutek
- Velox
- VoltDB
- WiredTiger
- YugabyteDB
- AirFlow
- AlloyDB
- APOLLO
- Berkeley DB
- Bodo
- Chaos Mesh
- ClickHouse
- Convex
- CrocodileDB
- Datometry
- Debezium
- Dremio
- DuckDB
- EraDB
- FASTER
- Featureform
- Fluree
- Gaia
- Google Spanner
- GreptimeDB
- Hudi
- InfluxDB
- Kinetica
- LeanStore
- Malloy
- Materialize
- Modin
- Mooncake
- Napa
- Noria
- OpenDAL
- OxQL
- Pinot
- PostgresML
- Qdrant
- QuestDB
- RelationalAI
- Rockset
- SalesForce
- SingleStore
- Smooth
- SpiceDB
- SQL Anywhere
- SQLite
- StarRocks
- Swarm64
- TerminusDB
- TileDB
- Trino
- Vertica
- Vortex
- XTDB
- Akamas
- Anna
- Arrow
- BigQuery
- Brytlyt
- Chroma
- CockroachDB
- CouchDB
- Databricks
- Datomic
- Delta Lake
- Druid
- DVMS
- Exon
- Fauna
- Feldera
- Fluss
- Gel
- GoogleSQL
- HarperDB
- Iceberg
- Jepsen
- ksqlDB
- Litestream
- MapD
- MemSQL
- MonetDB
- MotherDuck
- Neon
- NuoDB
- Oracle
- ParadeDB
- PlanetScale
- PostgreSQL
- QMDB
- RavenDB
- RisingWave
- RonDB
- Samza
- sled
- Snowflake
- Splice Machine
- SQL Server
- SQream
- Striim
- Synnada
- TiDB
- TimescaleDB
- Umbra
- Vitesse
- Weaviate
- Yellowbrick
- Aerospike
- AlloyDB
- Arrow
- BlazingDB
- Chaos Mesh
- CockroachDB
- CrateDB
- Datometry
- Delta Lake
- DSQL
- EraDB
- Fauna
- Firebolt
- Gaia
- GoogleSQL
- Heron
- InfluxDB
- ksqlDB
- LMDB
- Materialize
- MonetDB
- MySQL
- Noria
- Oracle
- Pinecone
- PostgresML
- QMDB
- Redshift
- Rockset
- Samza
- SLOG
- SpiceDB
- SQL Server
- Stardog
- Swarm64
- TiDB
- Tokutek
- Vertica
- Weaviate
- YugabyteDB
- AirFlow
- Anna
- Azure Cosmos DB
- Bodo
- Chroma
- Confluent
- CrocodileDB
- Datomic
- Dolt
- DuckDB
- Exon
- FeatureBase
- Fluree
- Gel
- Greenplum
- Hudi
- Jepsen
- LanceDB
- Malloy
- MemSQL
- MongoDB
- Napa
- NuoDB
- OtterTune
- Pinot
- PostgreSQL
- QuasarDB
- RelationalAI
- RonDB
- ScyllaDB
- Smooth
- Splice Machine
- SQLancer
- StarRocks
- Synnada
- TigerBeetle
- Trino
- Vitesse
- WiredTiger
- Akamas
- ApertureDB
- Berkeley DB
- Brytlyt
- Citus
- Convex
- Databricks
- dbt
- Dremio
- DVMS
- eXtremeDB
- Featureform
- Fluss
- GlareDB
- GreptimeDB
- Iceberg
- kdb
- LeanStore
- MapD
- Milvus
- Mooncake
- Neon
- OceanBase
- OxQL
- PlanetScale
- PRQL
- QuestDB
- RisingWave
- rqlite
- SingleStore
- Snowflake
- SplinterDB
- SQLite
- Striim
- Technical University of Munich
- TileDB
- Umbra
- VoltDB
- XTDB
- Alibaba
- APOLLO
- BigQuery
- Cassandra
- ClickHouse
- CouchDB
- DataFusion
- Debezium
- Druid
- EdgeDB
- FASTER
- Feldera
- FoundationDB
- Google Spanner
- HarperDB
- Impala
- Kinetica
- Litestream
- MariaDB
- Modin
- MotherDuck
- NoisePage
- OpenDAL
- ParadeDB
- Polaris
- Qdrant
- RavenDB
- RocksDB
- SalesForce
- sled
- Spice.ai
- SQL Anywhere
- SQream
- Summingbird
- TerminusDB
- TimescaleDB
- Velox
- Vortex
- Yellowbrick
Sep 26
2022
[¡Databases! 2022] Rockset: High Performance Queries with Dynamically Typed SQL (Ben Hannel)
- Speaker:
- Ben Hannel
- System:
- Rockset
- Video:
- YouTube
This talk is part of the ¡Databases! – A Database Seminar Series. Zoom Link: https://cmu.zoom.us/j/94466872009 (Passcode 424050) Read More
Sep 19
2022
[¡Databases! 2022] Snowflake Iceberg Tables, Streaming Ingest, and Unistore! (Ashish Motivala)
- Speakers:
- Nileema Shingte , Tyler Jones, Ashish Motivala
- System:
- Snowflake
- Video:
- YouTube
Why settle for 1 cool db talk when you can get 3? Snowflake is pushing the boundaries of what a unified cloud data platform can do. Today we'll talk about how Snowflake can be combined with open standards like Apache Iceberg, hard tech to stream data into Snowflake and bring transactional and analytical workloads together in a single platform. Apache... Read More
Sep 12
2022
[¡Databases! 2022] Umbra: A Disk-Based System with In-Memory Performance (Thomas Neumann)
- Speaker:
- Thomas Neumann
- System:
- Umbra
- Video:
- YouTube
The increases in main-memory sizes over the last decade have made pure in-memory database systems feasible, and in-memory systems offer unprecedented performance. However, DRAM is still relatively expensive, and the growth of main-memory sizes has slowed down. In contrast, the prices for SSDs have fallen substantially in the last years, and their read bandwidth has increased to gigabytes per second.... Read More
May 2
2022
[Vaccination 2022] IO in PostgreSQL: Past, Present, Future (Andres Freund)
- Speaker:
- Andres Freund
- System:
- PostgreSQL
- Video:
- YouTube
PostgreSQL traditionally has handled IO in a fairly minimal way, relying on the operating system more than most other databases. This talk will discuss why PostgreSQL mostly got away with that so far, why current hardware trends (NVMe with very high bandwidth / low latency, cloud storage with high latency but good random / concurrent read behaviour) require changing course... Read More
Apr 25
2022
[Vaccination 2022] RonDB: A Key-Value Store with SQL Capabilities and LATS Properties (Mikael Ronström)
- Speaker:
- Mikael Ronström
- System:
- RonDB
- Video:
- YouTube
RonDB is a key-value store with SQL capabilities and LATS (Latency/Availability/Throughput/ScalableStorage) properties. It is based on MySQL NDB Cluster that is used in extremely available applications such as universal data storage for mobile operators for many billions of subscribers. It is also used in gaming applications, financial applications and other areas. The main focus of RonDB in Hopsworks is as... Read More
Apr 18
2022
[Vaccination 2022] Velox: An Open-source Unified Execution Engine (Deepak Majeti)
- Speaker:
- Deepak Majeti
- System:
- Velox
- Video:
- YouTube
Data keeps getting bigger, processing keeps getting more and more complex but the hardware does not get faster. We need to reconsider efficiency from the ground up. While these data processing systems handle various workloads (e.g. “batch”, “analytical”, “streaming”, “AI/ML”), they employ common features such as functions, joins, filter-pushdown, sorting, grouping, projections, etc… A shared library that provides optimized implementations... Read More
Apr 11
2022
[Vaccination 2022] QuestDB: Fast Open Source Time Series Database (Vlad Ilyushchenko)
- Speaker:
- Vlad Ilyushchenko
- System:
- QuestDB
- Video:
- YouTube
In this talk, we will discuss major technical challenges developers face when dealing with time series data and QuestDB's design principles that are meant to solve these challenges. We will then go through QuestDB's performance focused architecture and cover topics like storage model, transactions, in-order and out-of-order ingestion, concurrency control, and network interfaces. This talk is part of the Vaccination... Read More
Apr 4
2022
[Vaccination 2022] Yellowbrick: An Elastic Data Warehouse on Kubernetes (Mark Cusack)
- Speaker:
- Mark Cusack
- System:
- Yellowbrick
- Video:
- YouTube
Yellowbrick is an elastic SQL data warehouse with a design centered on efficiency, high concurrency and performance. The database management system is composed from a set of Kubernetes-orchestrated containers. Kubernetes provides the single-source-of-truth for system configuration and state, and manages all warehousing lifecycle operations. In this session, I'll provide an overview of Yellowbrick and its microservices architecture, and focus on... Read More
Mar 28
2022
[Vaccination 2022] Design and Implementation of the RelationalAI Knowledge Graph Management System (Martin Bravenboer)
- Speaker:
- Martin Bravenboer
- System:
- RelationalAI
- Video:
- YouTube
RelationalAI is the next-generation database platform for new intelligent data applications based on relational knowledge graphs. The Relational Knowledge Graph Management System (KGMS) complements the modern data stack by allowing data applications to be implemented relationally and declaratively, leveraging knowledge/semantics for reasoning, graph analytics, relational machine learning, and mathematical optimization workloads. RelationalAI as a relational and cloud native system fits... Read More
Mar 24
2022
Hyperscale Data Processing with Network-centric Designs (Qizhen Zhang)
- Speaker:
- Qizhen Zhang
Today's largest data processing workloads are hosted in cloud data centers. Due to exponential data growth and the end of Moore's Law, these workloads have ballooned to the hyperscale level, encompassing billions to trillions of data items per query spread across hundreds to thousands of servers connected by the data center network. These massive scales fundamentally challenge the designs of... Read More