Precise Energy Consumption Measurements of Heterogeneous Artificial Intelligence Workloads

12/03/2022
by   René Caspart, et al.
0

With the rise of AI in recent years and the increase in complexity of the models, the growing demand in computational resources is starting to pose a significant challenge. The need for higher compute power is being met with increasingly more potent accelerators and the use of large compute clusters. However, the gain in prediction accuracy from large models trained on distributed and accelerated systems comes at the price of a substantial increase in energy demand, and researchers have started questioning the environmental friendliness of such AI methods at scale. Consequently, energy efficiency plays an important role for AI model developers and infrastructure operators alike. The energy consumption of AI workloads depends on the model implementation and the utilized hardware. Therefore, accurate measurements of the power draw of AI workflows on different types of compute nodes is key to algorithmic improvements and the design of future compute clusters and hardware. To this end, we present measurements of the energy consumption of two typical applications of deep learning models on different types of compute nodes. Our results indicate that 1. deriving energy consumption directly from runtime is not accurate, but the consumption of the compute node needs to be considered regarding its composition; 2. neglecting accelerator hardware on mixed nodes results in overproportional inefficiency regarding energy consumption; 3. energy consumption of model training and inference should be considered separately - while training on GPUs outperforms all other node types regarding both runtime and energy consumption, inference on CPU nodes can be comparably efficient. One advantage of our approach is that the information on energy consumption is available to all users of the supercomputer, enabling an easy transfer to other workloads alongside a raise in user-awareness of energy consumption.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/24/2018

Power and Energy-efficiency Roofline Model for GPUs

Energy consumption has been a great deal of concern in recent years and ...
research
07/31/2022

Eco2AI: carbon emissions tracking of machine learning models as the first step towards sustainable AI

The size and complexity of deep neural networks continue to grow exponen...
research
06/09/2023

EfficientBioAI: Making Bioimaging AI Models Efficient in Energy, Latency and Representation

Artificial intelligence (AI) has been widely used in bioimage image anal...
research
09/12/2021

Compute and Energy Consumption Trends in Deep Learning Inference

The progress of some AI paradigms such as deep learning is said to be li...
research
12/30/2018

Space Expansion of Feature Selection for Designing more Accurate Error Predictors

Approximate computing is being considered as a promising design paradigm...
research
02/27/2023

Predicting the Performance of a Computing System with Deep Networks

Predicting the performance and energy consumption of computing hardware ...
research
03/22/2021

Power Modeling for Effective Datacenter Planning and Compute Management

Datacenter power demand has been continuously growing and is the key dri...

Please sign up or login with your details

Forgot password? Click here to reset