G-Finder: Approximate Attributed Subgraph Matching

201938 citationsJournal Article

Authors

Lihui Liu · University of Illinois Urbana-Champaign

Boxin Du · University of Illinois Urbana-Champaign

Jiejun Xu · HRL Laboratories (United States)

Hanghang Tong · University of Illinois Urbana-Champaign

Abstract

Subgraph matching is a core primitive across a number of disciplines, ranging from data mining, databases, information retrieval, computer vision to natural language processing. Despite decades of efforts, it is still highly challenging to balance between the matching accuracy and the computational efficiency, especially when the query graph and/or the data graph are large. In this paper, we propose an index-based algorithm (G-FINDER) to find the top-k approximate matching subgraphs. At the heart of the proposed algorithm are two techniques, including (1) a novel auxiliary data structure (LOOKUP-TABLE) in conjunction with a neighborhood expansion method to effectively and efficiently index candidate vertices, and (2) a dynamic filtering and refinement strategy to prune the false candidates at an early stage. The proposed G-FINDER bears some distinctive features, including (1) generality, being able to handle different types of inexact matching (e.g., missing nodes, missing edges, intermediate vertices) on node attributed and/or edge attributed graphs or multigraphs; (2) effectiveness, achieving up to 30% Fl-Score improvement over the best known competitor; and (3) efficiency, scaling near-linearly w.r.t. the size of the data graph as well as the query graph.

Topics & Keywords

Graph Theory and Algorithms Advanced Graph Neural Networks Network Packet Processing and Optimization

UN Sustainable Development Goals

Quality Education

Publication Details

DOI: 10.1109/bigdata47090.2019.9006525

Field-Weighted Citation Impact: 1.84