Archived Events

Archived Events

Feb 22

2016

Feb 22 2016
[DB Seminar] Spring 2016: Round table discussion
Speaker:
DB group members

This week, we will have round table discussion. We will talk about on-going research, and paper submissions. Read More

Feb 15

2016

Feb 15 2016
[DB Seminar] Spring 2016: Wei Dai
Speaker:
Wei Dai

In this talk I will first give a brief overview of Petuum which encompasses a set of distributed machine learning principles as well as our open-sourced implementations. By discussing the the high level ideas and performance highlights, I hope to show that Big ML systems can benefit greatly from ML-rooted statistical and algorithmic insights. In the second part I will... Read More

Feb 8

2016

Feb 8 2016
[DB Seminar] Spring 2016: Jun Woo Park
Speaker:
Jun Woo Park

Traditional sketches, such as the Bloom filter, the CountMin sketch, and the Space-Saving sketch, estimate set membership, frequency counts, or moments of scalar random variables. In this paper, we extend these approaches to a new family of sketches that approximate moments of vectorial random variables that satisfy convex polytope constraints. One application is the Semidefinite sketch, a succinct way to... Read More

Feb 1

2016

Feb 1 2016
[DB Seminar] Spring 2016: Daniel Chino
Speaker:
Daniel Chino

Finding previously unknown patterns that frequently occur on time series is a core task of mining time series. These patterns are known as time series motifs and are essential to associate events and meaningful occurrences within the time series. In this work we propose a method based on a trie data structure, that allows a fast and accurate time series... Read More

Jan 11

2016

Jan 11 2016
[DB Seminar] Spring 2016: Alex Beutel
Speaker:
Alex Beutel

Read More

Dec 14

2015

Dec 14 2015
[DB Seminar] Fall 2015: Yifei Ma
Speaker:
Yifei Ma

Many modern information access problems involve highly complex patterns that cannot be handled by traditional keyword based search. Active Search is an emerging paradigm that helps users quickly find relevant information by efficiently collecting and learning from user feedback. We consider active search on graphs, where the nodes represent the set of instances users want to search over and the... Read More

Dec 7

2015

Dec 7 2015
[DB Seminar] Fall 2015: Zeyuan Shang
Speaker:
Zeyuan Shang

Finding real-world applications and workloads is the bane of every database researcher. To overcome this problem, we present the Carnegie Mellon Database Application Catalog (DBAC). The DBAC finds database applications from on-line source code repositories (e.g., GitHub) and then automatically installs them in a virtual machine sandbox. It then submits requests to the application that cause it to interact with... Read More

Nov 30

2015

Nov 30 2015
[DB Seminar] Fall 2015: Huanchen Zhang
Speaker:
Huanchen Zhang

Using indexes for query execution is crucial for achieving high performance in modern on-line transaction processing databases. For a main-memory database, however, these indexes consume a large fraction of the total memory available and are thus a major source of storage overhead of in-memory databases. To reduce this overhead, we propose using a two-stage index: The first stage ingests all... Read More

Nov 12

2015

Nov 12 2015
Oracle Berkeley DB, an Open Source Embedded Key/Value Database System (Lauren Foutz)
Speaker:
Lauren Foutz
System:
Berkeley DB
Video:
YouTube

In 1991, graduate students at the University of California, Berkeley created an improved database engine library for Unix they named Berkeley DB (BDB). When the up and coming web browser company Netscape requested that the authors extend and improve the library, Sleepycat Software was born to maintain BDB. In the following decades BDB has been deployed millions of times and... Read More