Database Group Meeting (April 21, 2014)

Event Date: Monday April 21, 2014
Event Time: 04:30pm
Location: GHC 8102
Speaker: Vagelis Papalexakis

Title: Turbo-SMT: Accelerating Coupled Sparse Matrix-Tensor Factorizations By 200x

How can we correlate the neural activity in the human brain as it responds to typed words, with properties of these terms (like ’edible’, ’fits in hand’)? In short, we want to find latent variables, that jointly explain both the brain activity, as well as the behavioral responses. This is one of many settings of the Coupled Matrix- Tensor Factorization (CMTF) problem.

Can we accelerate any CMTF solver, so that it runs within a few minutes instead of tens of hours to a day, while maintaining good accuracy? We introduce Turbo-SMT, a meta-method capable of doing exactly that: it boosts the performance of any CMTF algorithm, by up to 200x, along with an up to 65 fold increase in sparsity, with comparable accuracy to the baseline.

We apply Turbo-SMT to BrainQ, a dataset consisting of a (nouns, brain voxels, human subjects) tensor and a (nouns, properties) matrix, with coupling along the nouns dimension. Turbo-SMT is able to find meaningful latent variables, as well as to predict brain activity with competitive accuracy.

This is joint work with Tom Mitchell, Nikos Sidiropoulos, Christos Faloutsos, Partha Talukdar, and Brian Murphy. This paper is going to appear in SDM 2014.