Hidden Markov Models with Random Restarts vs Boosting for Malware Detection

07/17/2023
by   Aditya Raghavan, et al.
0

Effective and efficient malware detection is at the forefront of research into building secure digital systems. As with many other fields, malware detection research has seen a dramatic increase in the application of machine learning algorithms. One machine learning technique that has been used widely in the field of pattern matching in general-and malware detection in particular-is hidden Markov models (HMMs). HMM training is based on a hill climb, and hence we can often improve a model by training multiple times with different initial values. In this research, we compare boosted HMMs (using AdaBoost) to HMMs trained with multiple random restarts, in the context of malware detection. These techniques are applied to a variety of challenging malware datasets. We find that random restarts perform surprisingly well in comparison to boosting. Only in the most difficult "cold start" cases (where training data is severely limited) does boosting appear to offer sufficient improvement to justify its higher computational cost in the scoring phase.

READ FULL TEXT

page 8

page 9

page 11

page 13

page 14

research
07/04/2021

Machine Learning for Malware Evolution Detection

Malware evolves over time and antivirus must adapt to such evolution. He...
research
03/13/2022

A Comparison of Static, Dynamic, and Hybrid Analysis for Malware Detection

In this research, we compare malware detection techniques based on stati...
research
01/06/2019

Malware Detection Using Dynamic Birthmarks

In this paper, we explore the effectiveness of dynamic analysis techniqu...
research
06/27/2022

Multifamily Malware Models

When training a machine learning model, there is likely to be a tradeoff...
research
03/03/2021

Malware Classification with GMM-HMM Models

Discrete hidden Markov models (HMM) are often applied to malware detecti...
research
03/07/2021

On Ensemble Learning

In this paper, we consider ensemble classifiers, that is, machine learni...
research
10/25/2017

Malware Detection by Eating a Whole EXE

In this work we introduce malware detection from raw byte sequences as a...

Please sign up or login with your details

Forgot password? Click here to reset