Archived Events

Archived Events

Sep 17

2024

Sep 17 2024
Industry Affiliates Program Visit 2024 – Day 2

The second day of Carnegie Mellon University's Database Industry Affiliate Program (IAP) Visit Day, held in the Gates-Hillman Center, shifts focus to the industry side, featuring a series of informative sessions presented by member companies. These sessions offer companies the opportunity to showcase their latest innovations, products, and challenges in the database space, while also highlighting potential career opportunities for... Read More

Sep 16

2024

Sep 16 2024
Industry Affiliates Program Visit 2024 – Day 1

The first day of Carnegie Mellon University's Database Industry Affiliate Program (IAP) Visit Day takes place in the Gates-Hillman Center and is focused on showcasing cutting-edge research in the field of databases. The day is filled with a series of research talks delivered by faculty and students from the university's database group. These presentations provide an in-depth look at the... Read More

Sep 12

2024

Sep 12 2024
[Fall 2024] Advancing Database Performance and Capabilities at Snowflake
Speakers:
Dan Sotolongo, Bowei Chen
System:
Snowflake

This talk presents recent research and development at Snowflake aimed at pushing the boundaries of database performance and functionality. In the first section, we will introduce a series of optimizations designed to accelerate query execution within Snowflake’s platform. We will discuss the technical challenges associated with developing general-purpose optimizations and balancing performance improvements across a wide range of workloads. The... Read More

Sep 10

2024

Sep 10 2024
[Fall 2024] Databricks: Introduction to Mosaic AI Vector Search
Speaker:
Ankit Vij
System:
Databricks

This tech talk will deep dive into some of the most interesting challenges being solved at Databricks. Read More

Aug 21

2024

Aug 21 2024
LSM Management and Using LSM Immutability for Data Virtualization (Vaibhav Arora)
Speaker:
Vaibhav Arora

LSM (Log-Structured Merge) trees are now the bedrock of many storage engines and datastores like RocksDB, HBase, Cassandra etc. They provide the ability to avoid random-writes, and provide immutability. Data is organized in multiple-levels that are exponentially increasing in size. Each data mutation writes a new version of an object, and background processes named merge/compaction continuously remove the unused versions,... Read More

Jun 26

2024

Jun 26 2024
Leveraging Generative AI with Oracle AI Vector Search (Shasank Chavan)
Speaker:
Shasank Chavan
System:
Oracle

AI Vector Search in Oracle 23ai is a new, transformative way to intelligently search through your unstructured business data efficiently, and accurately, by using AI techniques to match on the semantics, or meaning, of the underlying data. With the inclusion of a new VECTOR datatype, new approximate search indexes, and new SQL operators and extensions, enterprise companies can quickly and... Read More

Apr 24

2024

Apr 24 2024
[Spring 2024] Beyond SQL: Dataframes in the Database (Devin Petersohn)
Speaker:
Devin Petersohn
System:
Modin

Dataframes are popular tools for interacting with and exploring data, but they are not as well understood nor as deeply studied as databases. Python's pandas. and Apache Spark are two of the most popular dataframes in use by data practitioners, but even these are extremely different from each other in terms of guarantees and user expectations. In this talk, we... Read More

Apr 17

2024

Apr 17 2024
[Spring 2024] Manufacturing AI Applications (Anthony Tomasic)
Speaker:
Anthony Tomasic

Developing AI applications is costly and difficult and recent trends have only intensified these challenges. Developers use a bottom-up approach, focusing on the nitty-gritty of integration and infrastructure, which leads to a complex "blob" of code. Changes to this blob are risky due to the intricate web of dependencies. Fort Alto has fundamentally rethought the application development process with a... Read More

Apr 5

2024

Apr 5 2024
PhD Defense: On Embedding Database Management System Logic in Operating Systems via Restricted Programming Environments (Matt Butrovich)
Speaker:
Matt Butrovich

The rise in computer storage and network performance means that disk I/O and network communication are often no longer bottlenecks in database management systems (DBMSs). Instead, the overheads associated with operating system (OS) services (e.g., system calls, thread scheduling, and data movement from kernel-space) limit query processing responsiveness. User-space applications can elide these overheads with a kernel-bypass design. However, extracting... Read More

Mar 14

2024

Mar 14 2024
[Spring 2024] Towards a Systematic Framework for Index Structure Design (Dong Xie)
Speaker:
Dong Xie

Index structures are at the database management systems' core to facilitate efficient data access. Due to the constant changes in application requirements and hardware trends, people are going through exhaustive and painstaking work designing/tailoring new index structures to catch up. In this talk, I will show a vision of a systematic index structure design framework that will allow index designers... Read More