News & Events
Amazon Redshift: re-innovating cloud analytics
In 2013, Amazon Web Services revolutionized the data warehousing industry by launching Amazon Redshift, the first fully-managed, petabyte-scale, enterprise-grade cloud data warehouse. Amazon Redshift made it simple and cost-effective to efficiently analyze large volumes of data using existing business intelligence tools. This cloud service was a significant leap from the traditional on-premise data warehousing solutions, which were expensive, not elastic, and required significant expertise to tune and operate. Customers embraced Amazon Redshift and it became the fastest growing service in Read More
The Rise of Data Streaming Platforms
Apache Kafka and Apache Flink are powering a new category of data infrastructure called data streaming platform (DSP). This provides an opportunity for each enterprise to take actions on what’s happening in its business in real time. I will first provide an overview of DSP. DSP has both similarities and differences to database systems. I will show how existing database technologies can be used in this new platform and some of the unique problems that DSP needs to solve. I Read More
Evolution of the Storage Engine for Spanner, an Exabyte-scale Database System
I'll describe the design of Spanner's new storage engine, Ressi, which replaced untyped sorted string tables (inherited from Bigtable) with a strongly typed SQL-native representation. Live migration of 6 exabytes of data and multiple billion-user products to the new engine posed unique challenges. Sound methodology from experimental computer science was the key to its success. The simplicity and power of declarative queries combined with strongly consistent transactional semantics has scaled to many thousands of machines running an aggregate of over Read More
Snowflake, and why the Cloud reshaped the analytics industry
Snowflake was the first data warehouse designed from scratch to take advantage of Cloud economics. We'll talk about what that means, why it was such a big deal, and how its design differs from the approaches taken by similar systems. Stay until the end for some bonus content on how Snowflake is bringing stream processing into the DBMS. Zoom link: https://cmu.zoom.us/my/jignesh Read More
AI Vector Search in the Oracle Database
AI Vector Search in Oracle Database is a new, transformative way to intelligently, efficiently, and accurately search business data by using AI techniques to search data by semantics, or meaning. With the inclusion of a new VECTOR data type, new approximate search indexes, and new SQL operators and extensions, enterprise companies can quickly and easily leverage AI Vector Search to build modern, generative AI applications in just a few lines of SQL. And with this simplicity comes power, as AI Read More
Engineering Your Own Path: From University to Universal Impact (Camille Fournier)
SCS Distinguished Alumni / Bruce Nelson Distinguished Lecture Read More
[DB Seminar] JSON Relational Duality: Converging the worlds of Objects, Documents, and Relational
The "Object-Relational Impedance Mismatch" has been a multi-decade problem for developers, and past solutions have all had various tradeoffs that have compromised efficiency or consistency. JSON Relational Duality is a breakthrough capability that combines the best aspects of the Document model and the Relational models without the drawbacks of either model. This session will provide an overview and deep dive into the inner workings of JSON Relational Duality. We will also discuss some of the benefits of being able to Read More
Industry Affiliates Program Visit 2024 – Day 2
The second day of Carnegie Mellon University's Database Industry Affiliate Program (IAP) Visit Day, held in the Gates-Hillman Center, shifts focus to the industry side, featuring a series of informative sessions presented by member companies. These sessions offer companies the opportunity to showcase their latest innovations, products, and challenges in the database space, while also highlighting potential career opportunities for students. Attendees, including faculty, students, and other participants, can engage directly with company representatives to learn about real-world applications of Read More
Industry Affiliates Program Visit 2024 – Day 1
The first day of Carnegie Mellon University's Database Industry Affiliate Program (IAP) Visit Day takes place in the Gates-Hillman Center and is focused on showcasing cutting-edge research in the field of databases. The day is filled with a series of research talks delivered by faculty and students from the university's database group. These presentations provide an in-depth look at the latest advancements in database technologies, methodologies, and applications. Attendees, including industry partners, gain valuable insights into innovative projects, ongoing research, Read More
Announcing CMU’s Database Industry Affiliates Program
Pittsburgh, PA – The Carnegie Mellon Database Group is pleased to announce the launch of its new Industry Affiliates Program (IAP), designed to create stronger ties between academia and the tech industry. Through this initiative, industry leaders will collaborate with the group to drive cutting-edge research, contribute to database innovation, and help shape the next generation of database engineers. Members of the IAP have exclusive access to unique student recruitment opportunities, early-stage research, and an annual workshop aimed at solving Read More