Archived Events

Archived Events

Jan 28

2019

Jan 28 2019
[DB Seminar] Spring 2019 Reading Group: Dana Van Aken
Speaker:
Dana Van Aken

Dana will present the following paper in this meeting: Title: Automated Performance Management for the Big Data Stack Authors: Anastasios Arvanitis, Shivnath Babu, Eric Chu, Adrian Popescu, Alkis Simitsis, Kevin Wilkinson Read More

Dec 10

2018

Dec 10 2018
[DB Seminar] Fall 2018: Tianyu Li, Matt Butrovich, Sivaprasad Sudhir
Speakers:
Tianyu Li, Matt Butrovich, Sivaprasad Sudhir

Project 1: Storage Engine (Tianyu Li & Matt Butrovich) In this talk, we will discuss the work we've done on terrier's storage engine over the semester. We will cover the implementation of write-ahead logging and our proposed model for recovery, implementation of indexes, and our roadmap for the storage engine next semester. The immediate future direction for the storage work... Read More

Dec 3

2018

Dec 3 2018
[DB Seminar] Fall 2018: Ethan Zhang (VoltDB)
Speaker:
Ethan Zhang
System:
VoltDB

Following from the idea that "one size no longer fits for all", a family of "NewSQL" specialized databases arose. To handle OLTP, researchers at MIT and Brown (and a few other places) built H-Store, a distributed, shared-nothing, in-memory database that got rid of locking, latching, buffering, and logging, beating the performance of traditional OLTP RDBMSs by nearly two orders of... Read More

Nov 30

2018

Nov 30 2018
[DB Seminar] Fall 2018: Lin Ma
Speaker:
Lin Ma

n the last two decades, both researchers and vendors have built advisory tools to assist database administrators (DBAs) in various aspects of system tuning and physical design. Most of this previous work, however, is incomplete because they still require humans to make the final decisions about any changes to the database and are reactionary measures that fix problems after they... Read More

Nov 29

2018

Nov 29 2018
The Swarm64 Data Accelerator (S64 DA): Processing OLAP Workloads of Open-Source SQL-Databases with CPU+FPGA Cooperative Computing (Karsten Rönner)
Speaker:
Karsten Rönner
System:
Swarm64
Video:
YouTube

Online Analytical Processing (OLAP) of very large data sets and/or high-velocity data is a workload that strains all parts of a compute system: storage bandwidth, IO-subsystem throughput, main-memory bandwidth, instruction-level concurrency and thread-parallelism. Swarm64 seeks to improve the effective throughput and the compute efficiency of OLAP workloads by adding FPGAs as additional compute element to standard compute servers. The hard-... Read More

Nov 5

2018

Nov 5 2018
[DB Seminar] Fall 2018: Yihan Sun
Speaker:
Yihan Sun

Modern query-heavy applications of database systems especially require minimal delays to OLAP queries, as well as allowing the lasted OLTP updates to be visible in time. A popular mechanism for fast response to OLAP queries is to use snapshot isolation (SI) for multi-version concurrency control (MVCC), as it allows readers to make progress regardless of concurrent writers. Many other optimizations... Read More

Nov 1

2018

Nov 1 2018
Using GPU Databases to Build the Next Generation of Artificial Intelligence (Richard Heyns)
Speaker:
Richard Heyns
System:
Brytlyt
Video:
YouTube

In this talk, we will cover how the implementation of GPU database management systems are different than CPU database systems and provide evidence that shows how much of the performance gains with these systems are achieved via just GPUs. We will also discuss how we are solving the problems of tomorrow – making AI smarter, faster and more intuitive with... Read More

Oct 25

2018

Oct 25 2018
The Design & Implementation of BlazingDB: An Open-Source GPU-Accelerated Database Management System (Felipe Aramburu)
Speaker:
Felipe Aramburu
System:
BlazingDB
Video:
YouTube

BlazingDB has spent the past six months working on an open-source project (libgdf) alongside Anaconda and Nvidia. Libgdf is a library of computational primitives on top of a memory layout which is similar to Apache Arrow but optimized for GPUs. We have created a distributed, GPU-accelerated ETL pipeline that takes a user from reading data in Parquet, to performing SQL... Read More

Oct 22

2018

Oct 22 2018
[DB Seminar] Fall 2018: Lin Ma
Speaker:
Lin Ma

In this talk, I will present the progress on the self-driving database project. This is a 10-min practice talk for the PDL retreat. Read More

Oct 15

2018

Oct 15 2018
[DB Seminar] Fall 2018: System Design Planning II

We will discuss the ongoing work on porting the LLVM engine to the new system. Read More