[DB Seminar] Fall 2016: Kijung Shin

Event Date: Monday December 5, 2016
Event Time: 04:45pm EDT
Location: GHC 8102
Speaker: Kijung Shin

Title: CoreScope: Graph Mining Using K-Core Analysis

How do the k-core structures of real-world graphs look like? What are the common patterns and the anomalies?  How can we use them for algorithm design and applications? A k-core is the maximal subgraph where all vertices have degree at least k. This concept has been applied to such diverse areas as hierarchical structure analysis, graph visualization, and graph clustering. In this talk, we explore pervasive patterns that are related to k-cores and emerging in graphs from several diverse domains.

Our discoveries are as follows: (1) Mirror Pattern: coreness of vertices (i.e., maximum k such that each vertex belongs to the k-core) is strongly correlated to their degree. (2) Core-Triangle Pattern: degeneracy of a graph (i.e., maximum k such that the k-core exists in the graph) obeys a 3-to-1 power law with respect to the count of triangles. (3) Structured Core Pattern: degeneracy-cores are not cliques but have non-trivial structures such as core-periphery and communities.

Our algorithmic contributions show the usefulness of these patterns. (1) Core-A, which measures the deviation from Mirror Pattern, successfully finds anomalies in real-world graphs complementing densest-subgraph based anomaly detection methods. (2) Core-D, a single-pass streaming algorithm based on Core-Triangle Pattern, accurately estimates the degeneracy of billion-scale graphs up to 7 faster than a recent multi-pass algorithm.(3) Core-S, inspired by Structured Core Pattern, identifies influential spreaders up to 17 faster than top competitors with comparable accuracy.