Quarantine Database Tech Talks — Spring 2020

It is a pandemic. Life is a mess. There is no end in sight. Each of us do not know how long it will be until it is our turn to catch the 'rona. Instead of living in fear and huffing bleach, you should focus on what really matters in life: databases.

Given this, the "Quarantine Database Tech Talks" is a on-line seminar series at Carnegie Mellon University with leading developers and researchers of database systems. Each speaker will present the implementation details of their respective systems and examples of the technical challenges that they faced when working with real-world customers.

All talks are on-line and open to the public via Zoom. You do not need to be a current CMU student to attend. Drifters are especially welcome.

Videos will be posted after each talk: https://www.youtube.com/playlist?list=PLSE8ODhjZXjagqlf1NxuBQwaMkrHXi-iz

  • Time: Mondays @ 5:00pm ET
  • Location: Zoom (Must Be Authenticated)
  • Organizers: Andy Pavlo
This seminar series is held in conjunction with the following groups at Carnegie Mellon:

Schedule

Date Speaker Talk Title Video
Apr 20DuckDB Apr 20 Mark Raasveldt DuckDB – The SQLite for Analytics
Apr 27Anna Apr 27 Chenggang Wu Anna: a KVS for Any Scale
May 11ClickHouse May 11 Robert Hodges Introducing ClickHouse–the fastest data warehouse you’ve never heard of
May 18APOLLO May 18 Jinho Jung APOLLO: Automatic Detection and Diagnosis of Performance Regressions in Database Systems
Jun 1Materialize Jun 1 Arjun Narayan Building Materialize, a Streaming SQL Database powered by Timely Dataflow
Jun 8SQLancer Jun 8 Manuel Rigger Finding Logic Bugs in Database Management Systems
Jun 15Vitesse Jun 15 CK Tan Deepgreen DB: Greenplum at Speed
Jun 22Chaos Mesh Jun 22 Siddon Tang Testing Cloud-Native Databases with Chaos Mesh
Jul 6Dolt Jul 6 Oscar Batori & Zach Musgrave Another Relational Database, Why and How
Jul 13Cassandra Jul 13 Jim McCollom, Jeff Carpenter Astra: How we built a Cassandra-as-a-Service
Jul 20Rockset Jul 20 Dhruba Borthakur Rockset: Realtime Indexing for fast queries on massive semi-structured data
Jul 27Jepsen Jul 27 Kyle Kingsbury Black-box Isolation Checking with Elle
Aug 3YugabyteDB Aug 3 Karthik Ranganathan YugabyteDB: Bringing Together the Best of Amazon Aurora and Google Spanner
Aug 10Splice Machine Aug 10 Daniel Gómez Ferro, Yi Xia Splice Machine – An HTAP DB at Scale
Aug 17TerminusDB Aug 17 Gavin Mendel-Gleason TerminusDB: Building a Native Revision Control DB from Scratch
Aug 24ScyllaDB Aug 24 Avi Kivity ScyllaDB — No-Compromise Performance
Aug 31PlanetScale Aug 31 Sugu Sougoumarane PlanetScale: Query Planning for a Sharded System like Vitess
Sep 14CrocodileDB Sep 14 Aaron Elmore CrocodileDB: Resource Efficient Database Execution
Sep 21Snowflake Sep 21 Jiaqi Yan Query Optimization at Snowflake
Sep 28CockroachDB Sep 28 Rebecca Taft CockroachDB’s Query Optimizer
Oct 5Arrow Oct 5 Wes McKinney Apache Arrow Flight: Accelerating Columnar Dataset Transport
Oct 12Databricks Oct 12 Cheng Lian, Maryann Xue Databricks: A Deep Dive into Spark SQL’s Catalyst Optimizer
Oct 19FoundationDB Oct 19 Markus Pilman FoundationDB or: How I Learned to Stop Worrying and Trust the Database
Oct 26Datometry Oct 26 Lyublena Antova Datometry Hyper-Q: Virtualizing the World’s Enterprise Data Warehouses
Nov 2MySQL Nov 2 Norvald H. Ryeng Refactoring Query Processing in MySQL
Nov 9EraDB Nov 9 Todd Persen EraDB: Designing Systems for Cardinality and Dimensionality
Nov 16Fauna Nov 16 Matt Freels Fauna: Lessons Learned Building a Real World, Calvin-based System
Nov 23ksqlDB Nov 23 Matthias J. Sax ksqlDB: A Stream-Relational Database System
Nov 30SQL Server Nov 30 Nico Bruno + Cesar Galindo-Legaria The Cascades Framework for Query Optimization at Microsoft
Dec 14TiDB Dec 14 Xiaoyu Ma TiDB – On the Long Journey of HTAP

Sponsors