GraphSAC: Detecting anomalies in large-scale graphs

10/21/2019
by   Vassilis N. Ioannidis, et al.
28

A graph-based sampling and consensus (GraphSAC) approach is introduced to effectively detect anomalous nodes in large-scale graphs. Existing approaches rely on connectivity and attributes of all nodes to assign an anomaly score per node. However, nodal attributes and network links might be compromised by adversaries, rendering these holistic approaches vulnerable. Alleviating this limitation, GraphSAC randomly draws subsets of nodes, and relies on graph-aware criteria to judiciously filter out sets contaminated by anomalous nodes, before employing a semi-supervised learning (SSL) module to estimate nominal label distributions per node. These learned nominal distributions are minimally affected by the anomalous nodes, and hence can be directly adopted for anomaly detection. Rigorous analysis provides performance guarantees for GraphSAC, by bounding the required number of draws. The per-draw complexity grows linearly with the number of edges, which implies efficient SSL, while draws can be run in parallel, thereby ensuring scalability to large graphs. GraphSAC is tested under different anomaly generation models based on random walks, clustered anomalies, as well as contemporary adversarial attacks for graph data. Experiments with real-world graphs showcase the advantage of GraphSAC relative to state-of-the-art alternatives.

READ FULL TEXT
research
06/02/2023

GAD-NR: Graph Anomaly Detection via Neighborhood Reconstruction

Graph Anomaly Detection (GAD) is a technique used to identify abnormal n...
research
02/27/2020

Semi-supervised Anomaly Detection on Attributed Graphs

We propose a simple yet effective method for detecting anomalous instanc...
research
04/17/2021

Unveiling Anomalous Edges and Nominal Connectivity of Attributed Networks

Uncovering anomalies in attributed networks has recently gained populari...
research
10/07/2020

Anomaly Detection in Large Labeled Multi-Graph Databases

Within a large database G containing graphs with labeled nodes and direc...
research
04/27/2019

Exploring Information Centrality for Intrusion Detection in Large Networks

Modern networked systems are constantly under threat from systemic attac...
research
07/28/2023

BOURNE: Bootstrapped Self-supervised Learning Framework for Unified Graph Anomaly Detection

Graph anomaly detection (GAD) has gained increasing attention in recent ...
research
04/01/2014

A Kernel-Based Nonparametric Test for Anomaly Detection over Line Networks

The nonparametric problem of detecting existence of an anomalous interva...

Please sign up or login with your details

Forgot password? Click here to reset