Machine-Learning Compression for Particle Physics Discoveries

10/20/2022
by   Jack H. Collins, et al.
0

In collider-based particle and nuclear physics experiments, data are produced at such extreme rates that only a subset can be recorded for later analysis. Typically, algorithms select individual collision events for preservation and store the complete experimental response. A relatively new alternative strategy is to additionally save a partial record for a larger subset of events, allowing for later specific analysis of a larger fraction of events. We propose a strategy that bridges these paradigms by compressing entire events for generic offline analysis but at a lower fidelity. An optimal-transport-based β Variational Autoencoder (VAE) is used to automate the compression and the hyperparameter β controls the compression fidelity. We introduce a new approach for multi-objective learning functions by simultaneously learning a VAE appropriate for all values of β through parameterization. We present an example use case, a di-muon resonance search at the Large Hadron Collider (LHC), where we show that simulated data compressed by our β-VAE has enough fidelity to distinguish distinct signal morphologies.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/09/2021

Efficient Data Compression for 3D Sparse TPC via Bicephalous Convolutional Autoencoder

Real-time data collection and analysis in large experimental facilities ...
research
03/23/2023

Clinically Relevant Latent Space Embedding of Cancer Histopathology Slides through Variational Autoencoder Based Image Compression

In this paper, we introduce a Variational Autoencoder (VAE) based traini...
research
06/11/2020

Extreme data compression while searching for new physics

Bringing a high-dimensional dataset into science-ready shape is a formid...
research
05/03/2018

Polynomial data compression for large-scale physics experiments

The new generation research experiments will introduce huge data surge t...
research
08/04/2022

Background Modeling for Double Higgs Boson Production: Density Ratios and Optimal Transport

We study the problem of data-driven background estimation, arising in th...
research
11/09/2017

Deep Neural Networks for Physics Analysis on low-level whole-detector data at the LHC

There has been considerable recent activity applying deep convolutional ...
research
07/27/2023

Online Clustered Codebook

Vector Quantisation (VQ) is experiencing a comeback in machine learning,...

Please sign up or login with your details

Forgot password? Click here to reset