Archived Events

Archived Events

May 9

2016

May 9 2016
[DB Seminar] Spring 2016: Lin Ma
Speaker:
Lin Ma

In-memory database management systems (DBMSs) outperform disk-oriented systems for on-line transaction processing (OLTP) workloads. But this improved performance is only achievable when the database is smaller than the amount of physical memory available in the system. To overcome this limitation, some in-memory DBMSs can move cold data out of volatile DRAM to secondary storage. Such data appears as if it... Read More

May 3

2016

May 3 2016
[PDL Visit Day 2016] Siying Dong (Facebook)
Speaker:
Siying Dong
System:
RocksDB

RocksDB is an embedded persistent key-value store for low-latency and high-throughput workload. It has been adapted to a wide range of workloads, including RocksDB as an embedded DBMS and as storage engines of other DBMS systems. Our benchmarks show RocksDB can achieve 126K random reads per second on flash and 7 million random reads per second on memory. RocksDB also... Read More

May 3

2016

May 3 2016
[PDL Visit Day 2016] Thomas Baby (Oracle)
Speaker:
Thomas Baby
System:
Oracle

The IT industry today is undergoing a revolutionary change in how customers deploy and configure their compute resources. Driven by the demand to reduce costs, both in capital and operation expense, these customers are turning to CLOUD or HYBRID-CLOUD solutions. These customers span the spectrum from very small startup businesses to Fortune 500 companies across regions and industries. Oracle Corporation... Read More

May 3

2016

May 3 2016
[PDL Visit Day 2016] Shasank Chavan (Oracle)
Speaker:
Shasank Chavan
System:
Oracle

The Database In-Memory (DBIM) Option by Oracle is an industry-first dual format in-memory database that maintains transactional consistent data in both row and columnar formats. This unique architecture enables analytic and OLTP workloads to coexist simultaneously, bringing together the best of both worlds. DBIM is the fastest growing database option since its release in 2014, achieving great success with customer... Read More

May 2

2016

May 2 2016
[DB Seminar] Spring 2016: Huanchen Zhang
Speaker:
Huanchen Zhang

Using indexes for query execution is crucial for achieving high performance in modern on-line transaction processing databases. For a main-memory database, however, these indexes consume a large fraction of the total memory available and are thus a major source of storage overhead of in-memory databases. To reduce this overhead, we propose using a two-stage index: The first stage ingests all... Read More

Apr 28

2016

Apr 28 2016
Murat Demirbas (University at Buffalo)
Speaker:
Murat Demirbas

Work on theory of distributed systems abstract away from the physical-clock time and use the notion of logical clocks for ordering events in asynchronous distributed systems. Practice of distributed systems, on the other hand, employ loosely synchronized clocks using NTP in a best-effort manner without any guarantees. Recently, we introduced a third option: hybrid clocks. Hybrid clocks combine the best... Read More

Apr 25

2016

Apr 25 2016
[DB Seminar] Spring 2016: Miguel Araujo
Speaker:
Miguel Araujo

Miguel will give a practice talk on his thesis proposal. Abstract: The identification of anomalies and communities of nodes in real-world graphs has applications in widespread domains, from the automatic categorization of wikipedia articles or websites to bank fraud detection. While recent and ongoing research is supplying tools for the analysis of simple unlabeled data, it is still a challenge to find patterns and anomalies in large labeled... Read More

Apr 18

2016

Apr 18 2016
[DB Seminar] Spring 2016: Dana Van Aken
Speaker:
Dana Van Aken

Database management system (DBMS) configuration tuning is an essential aspect of any data-intensive application effort. But this is historically a difficult task because DBMSs have hundreds of configuration "knobs" that control everything in the system, such as the amount of memory to use for caches and how often data is written to storage. The problem with these knobs is that... Read More

Apr 15

2016

Apr 15 2016
Monte Zweben (Splice Machine)
Speaker:
Monte Zweben
System:
Splice Machine

This talk describes the Splice Machine RDBMS designed to power today’s new class of modern applications that require high scalability and high-availability while simultaneously executing OLTP and OLAP workloads. Splice Machine is a full ANSI SQL database that is ACID compliant, supports secondary indexes, constraints, triggers, and stored procedures. It uses a unique, distributed snapshot isolation algorithm that preserves transactional... Read More

Apr 14

2016

Apr 14 2016
Yi Pan (Apache Samza @ LinkedIn)
Speaker:
Yi Pan
System:
Samza

This talk will provide an overview of LinkedIn's distributed stream processing platform, including Samza/Kafka/Databus. It will first cover the high level scenarios for stream processing in LinkedIn, followed by detailed requirements around scalability, re-processing, accuracy of results, and easy programmability; then we will focus on the requirements on stateful stream processing applications and explain how Samza's state management allows us... Read More