Applications of PageRank to Function Comparison and Malware Classification

10/11/2018
by   Michael A. Slawinski, et al.
0

We classify .NET files as either benign or malicious by examining certain directed graphs extracted from the files via decompilation. Each graph is viewed probabilistically as a Markov chain where each node heuristically represents the possible state of the running file, and by computing the PageRank vector (Perron vector with transport) we can assign a probability measure over the nodes of the given graph. We train a random forest with features derived from computing Lebesgue antiderivatives of functions defined over the vertex sets of the graphs listed above against the PageRank measure. The model was trained on 2.5 million samples of .NET and has an accuracy of 98.3% on test data. The median time needed for decompilation and scoring was 24ms.

READ FULL TEXT
research
11/03/2017

Decentralised firewall for malware detection

This paper describes the design and development of a decentralized firew...
research
09/26/2018

Classification of malware based on file content and characteristics

In general, the industry of malware has come to be a market which brings...
research
06/09/2023

AVScan2Vec: Feature Learning on Antivirus Scan Data for Production-Scale Malware Corpora

When investigating a malicious file, searching for related files is a co...
research
02/25/2022

Kron Reduction and Effective Resistance of Directed Graphs

In network theory, the concept of the effective resistance is a distance...
research
12/14/2020

SOREL-20M: A Large Scale Benchmark Dataset for Malicious PE Detection

In this paper we describe the SOREL-20M (Sophos/ReversingLabs-20 Million...
research
06/25/2020

A metric on directed graphs and Markov chains based on hitting probabilities

The shortest-path, commute time, and diffusion distances on undirected g...

Please sign up or login with your details

Forgot password? Click here to reset