Archived Events

Archived Events

Nov 10

2014

Nov 10 2014
DB Seminar [Fall 2014]: Jianquan Liu
Speaker:
Jianquan Liu

In this talk, Dr. Liu will briefly introduce the related research topics that are currently conducted at the Central Research Laboratories of NEC Corporation, such as big data processing. He will then focus on the introduction to a commercial level demo system for surveillance video search, named Wally, which will be exhibited at the ACM Multimedia 2014. Wally is a... Read More

Nov 3

2014

Nov 3 2014
DB Seminar [Fall 2014]: Nobu Furukawa
Speaker:
Nobu Furukawa

Abstract: Improving student productivity in online learning depends on designing learning environments based on principles derived from learning science research into how people learn.  Students master a skill by solving the sequence of practice exercises related to the skill.  The initial development of a skills model on a course, defining skills and associate them with exercises, heavily relies on human... Read More

Oct 30

2014

Oct 30 2014
The Future of Databases is Not a Database (Ori Herrnstadt) CANCELLED
Speaker:
Ori Herrnstadt
System:
FoundationDB

We all get excited about the next technical capability. In-memory - cool; scalable - even cooler; vector based execution, real-time code generation, etc etc. But do these really tackle the most important problems that will lead the next generation of databases? In this presentation Ori will present FoundationDB - a fault-tolerant, scalable and transactional K/V store, and the languages he... Read More

Oct 27

2014

Oct 27 2014
DB Seminar [Fall 2014]: Yuto Yamaguchi
Speaker:
Yuto Yamaguchi

The location pro les of social media users are valuable for various applications, such as marketing and real-world anal- ysis. As most users do not disclose their home locations, the problem of inferring home locations has been well stud- ied in recent years. In fact, most existing methods perform batch inference using static (i.e., pre-stored) social media contents. However, social... Read More

Oct 24

2014

Oct 24 2014
Eliminating Unscalable Communication in Transaction Processing, Toward Bionic Databases (Ippokratis Pandis)
Speaker:
Ippokratis Pandis
System:
Impala

On-line transaction processing (OLTP) is one of the two most important enterprise data management applications. Transaction processing workloads typically exhibit high concurrency and provide ample opportunities for parallel execution by multicore hardware. Unfortunately, due to the characteristics of the application, transaction processing systems must moderate and coordinate communication between independent agents. As a result, transaction processing systems cannot always convert... Read More

Oct 22

2014

Oct 22 2014
Pitt/CMU DB Meetup – Spyros Blanas (Ohio State)
Speaker:
Spyros Blanas

Web data are commonly processed using thousands of CPU cores, and large-scale scientific simulations are quickly approaching the one million CPU core mark. At this scale, the barrier to efficient data analysis is commonly the limited bandwidth to the disk. The growing main memory capacities allow data to be intelligently reduced, analyzed and transformed in situ, before being written to... Read More

Oct 22

2014

Oct 22 2014
Impala: A Modern, Open-Source SQL Engine for Hadoop (Ippokratis Pandis)
Speaker:
Ippokratis Pandis
System:
Impala

The Cloudera Impala project is pioneering the next generation of Hadoop capabilities: the convergence of fast SQL queries with the capacity, scalability, and flexibility of a Hadoop cluster. With Impala, the academic and Hadoop communities now have an open-sourced codebase that helps query data stored in HDFS and Apache HBase in real time, using familiar SQL syntax. In contrast with... Read More

Oct 20

2014

Oct 20 2014
DB Seminar [Fall 2014]: Neil Shah
Speaker:
Neil Shah

Abstract: How can we detect suspicious users in large online networks? Online popularity of a user or product (via follows, page-likes, etc.) can be monetized on the premise of higher ad click-through rates or increased sales. Web services and social networks which incentivize popularity thus suffer from a major problem of fake connections from link fraudsters looking to make a... Read More

Oct 17

2014

Oct 17 2014
Jimeng Sun (Georgia Institute of Technology)
Speaker:
Dr. Jimeng Sun

Predictive modeling plays an important role in biomedical research. Thanks to the explosion of Electronic Heart Records (EHR), the interest of building predictive models using EHR data has skyrocketed in recent years. However, the methodologies for develop a predictive model are still labor intensive and ad-hoc. Such rudimentary approaches have hindered the quality and throughput of healthcare and biomedical research.... Read More

Oct 16

2014

Oct 16 2014
State-of-the-Art Database Index Maintenance (Bradley C. Kuszmaul)
Speaker:
Bradley C. Kuszmaul
System:
Tokutek
Video:
YouTube

This talk will discuss how B-trees, Log-Structured Merge Trees and Streaming B-trees operate, and what is their asymptotic performance. Part of the "Seven Databases in Seven Weeks" Seminar Series: http://db.cs.cmu.edu/seminar2014 Read More