[DB Seminar] Fall 2019 DB Group: Ankur Goyal
In this talk, we will discuss the challenges of supporting traditional database operations (ingestion, schema design, querying) to the domain of visual data, i.e. images, videos, PDFs, etc. Recent innovations in artificial intelligence have enabled computers to "see" the same structure in visual data as humans. As a result, new systems are emerging which enable users and applications to manage and query this information, as databases have successfully done for more traditional structured data. We will start by motivating... Read More
Fall 2019: Pat Helland (SalesForce)
This talk is a summary of my soon to be released column in ACM Queue titled "Write Amplification versus Read Perspiration". In this short discussion, we observe that there is a strong pattern in which writing data incurs and obligation to do more work to make it easy to read that data later. We frequently talk about write amplification to describe the extra work we do in many cases to ease reading. We propose the nomenclature read perspiration to describe... Read More
Fall 2019: Rohit Agrawal (SalesForce)
In this talk we discuss LSM compression for a KV store. In our KV store, we write to an underlying shared storage system that models data as named extents (up to 2GB) and variable-length fragments contained within the extent. Fragments are max of 1MB and are the atomic unit of read and write. Our KV store reads fragments into 64K buffers for scanning and random reads. Our compression has two facets: key-compression and fragment-compression. Key-compression is particularly effective because the... Read More
[DB Seminar] Fall 2019 DB Group: Alex Smola (Amazon)
In this talk I will give a sample of some of the research done at AWS. In particular I will talk about some recent results in Reinforcement Learning using a combined on-policy and off-policy approach to obtain rapidly converging and sample efficient algorithms. The key idea in this work is to use propensity scoring and effective sample size reweighting to obtain an optimization algorithm that converges rapidly and that takes advantage of a large replay buffer. Secondly, I will discuss... Read More
[DB Seminar] Summer 2019 DB Group: Andy Pavlo
The current research trend is on developing "learned" components to supplement and replace legacy components in database management systems (DBMSs). Such learned components use machine learning (ML) methods to identify non-trivial trends and correlations in the DBMS's runtime behavior. They then use this information to create execution strategies and data structures that are tailored to the application's access patterns. The hope is that learned components will enable new optimizations that are not possible today because the complexity of managing DBMSs... Read More
[DB Seminar] Summer 2019 DB Group: Chenyao Lou
Title: NEVER use mmap for your database Abstract: MMAP can be used as the buffer pool manager for DBMSs. But is it good to use mmap for DBMSs? Chenyao is going to share evaluations for mmap, pitfalls in mmap, and methods to make mmap safe in existing DBMSs. Read More
[DB Seminar] Summer 2019: Lucas Lersch (TU Dresden)
Non-volatile memory technologies (NVM) enable persistent media to be directly accessed by the CPU through its caches. The biggest challenge introduced by NVM is the little control the application has when persisting data. This stems from the fact that it is not possible to prevent data from being evicted from the CPU cache to NVM at arbitrary points in time, possibly leading to corruption. To deal with this problem, work so far treat NVM either like storage (by always writing... Read More
[DB Seminar] Summer 2019 DB Group: Amadou Ngom
Amadou will present this paper in this meeting: Title: SkinnerDB: Regret-Bounded Query Evaluation via Reinforcement Learning Authors: Immanuel Trummer, Junxiong Wang, Deepak Maram, Samuel Moseley, Saehan Jo, Joseph Antonakakis Read More
[DB Seminar] Summer 2019 DB Group: Vivian Huang
Vivian will present this paper in this meeting: Title: Plan Stitch: Harnessing the Best of Many Plans Authors: Bailu Ding, Sudipto Das, Wentao Wu, Surajit Chaudhuri, Vivek Narasayya Read More
[DB Seminar] Summer 2019 DB Group: Perf Tutorial
In this DB group meeting, we are going to watch this perf tutorial together: https://www.youtube.com/watch?v=nXaxk27zwlk Read More