Archived Events

Archived Events

Oct 16

2017

Oct 16 2017
[DB Seminar] Fall 2017: Angela Jiang
Speaker:
Angela Jiang

Mainstream adaptively merges the video stream processing of concurrent applications sharing fixed edge resources to maximize aggregate result quality. Mainstream’s approach enables partial-DNN compute sharing among applications using DNNs (deep neural networks) that are fine-tuned from the same base model, decreasing aggregate per-frame compute time. Moreover, since the choice depends on the mix of applications running on an edge node,... Read More

Oct 12

2017

Oct 12 2017
Smooth Storage : A Distributed Storage System for Managing Structured Time-series Data at Two Sigma (Saurabh Goel)
Speaker:
Saurabh Goel
System:
Smooth
Video:
YouTube

Smooth is a distributed storage system for managing structured time series data at Two Sigma. Smooth's design emphasizes scale, both in terms of size and aggregate request bandwidth, reliability and storage efficiency. It is optimized for large parallel streaming read/write accesses over provided time ranges. Smooth has a clear separation between the metadata and data layers, and supports multiple pluggable... Read More

Oct 9

2017

Oct 9 2017
[DB Seminar] Fall 2017: CMU-DB Research Projects Overview
Speaker:
Andy Pavlo

Andy will regale the team with a discussion of the various research projects that are currently ongoing this semester. He will then muse about various papers that he wants to write within the next year followed by a group discussion. Read More

Sep 25

2017

Sep 25 2017
[DB Seminar] Fall 2017: Ben Darnell (CockroachDB)
Speaker:
Ben Darnell
System:
CockroachDB

Distributed consensus algorithms like Paxos and Raft provide an important building block for distributed systems, but there's a lot more that goes into a resilient and scalable distributed database. CockroachDB's key-value layer is built on many independent and overlapping Raft consensus groups. In this talk I'll explain why we built it this way, and some of the expected and unexpected... Read More

Sep 21

2017

Sep 21 2017
Autopiloting #realtime Stream Processing in Heron (Karthik Ramasamy)
Speaker:
Karthik Ramasamy
System:
Heron
Video:
YouTube

Several enterprises have been producing data not only at high volume but also at high velocity. Many daily business operations depend on real-time insights, therefore real-time processing of the data is gaining significance. Hence there is a need for a scalable infrastructure that can continuously process billions of events per day the instant the data is acquired. To achieve real... Read More

Sep 18

2017

Sep 18 2017
[DB Seminar] Fall 2017: Nick Katsipoulakis
Speaker:
Nick Katsipoulakis

Stream processing has become the dominant processing model for monitoring and real-time analytics. Modern Parallel Stream Processing Engines (pSPEs) have made it feasible to increase the performance in both monitoring and analytical queries by parallelizing a query’s execution and distributing the load on multiple workers. A determining factor for the performance of a pSPE is the partitioning algorithm used to... Read More

Sep 14

2017

Sep 14 2017
InfluxDB Storage Engine Internals (Paul Dix)
Speaker:
Paul Dix
System:
InfluxDB
Video:
YouTube

InfluxDB is an open source time series database written in Go. This talk will introduce how InfluxDB structures time series data and what makes it different from other use cases like OLTP. We'll then go into the internals of the storage engine we wrote from scratch, the Time Structured Merge Tree, heavily inspired by LSM trees. In addition to the... Read More

Sep 11

2017

Sep 11 2017
[DB Seminar] Fall 2017: Joy Arulraj
Speaker:
Joy Arulraj

For the first time in 25 years, a new non-volatile memory (NVM) category is being created that is expected to be 1000 times faster than current durable storage devices. The advent of NVM will fundamentally change the dichotomy between memory and durable storage in database systems (DBMSs). These new NVM devices are almost as fast as DRAM, but all writes... Read More

May 22

2017

May 22 2017
[DB Seminar] Spring 2017: Yingjun Wu
Speaker:
Yingjun Wu

The emergence of large main memories and massively parallel processors has triggered the development of multi-core main-memory database management systems (DBMSs). Although the reduction of disk accesses results in low single-thread transaction execution time, scaling these systems on multi-core machines remains notoriously difficult. In particular, the concurrent processing of a large number of transactions can bring about significant performance bottlenecks.... Read More

May 15

2017

May 15 2017
[DB Seminar] Spring 2017: Priya Govindan
Speaker:
Priya Govindan

The structure of real-world complex networks has long been an area of interest, and one common way to describe the structure of a network has been with the k-core decomposition. The core number of a node can be thought of as a measure of its centrality and importance, and is used by applications such as community detection, understanding viral spreads,... Read More