Smart caching in a Data Lake for High Energy Physics analysis

08/02/2022
by   Tommaso Tedeschi, et al.
0

The continuous growth of data production in almost all scientific areas raises new problems in data access and management, especially in a scenario where the end-users, as well as the resources that they can access, are worldwide distributed. This work is focused on the data caching management in a Data Lake infrastructure in the context of the High Energy Physics field. We are proposing an autonomous method, based on Reinforcement Learning techniques, to improve the user experience and to contain the maintenance costs of the infrastructure.

READ FULL TEXT
research
03/14/2022

Deploying in-network caches in support of distributed scientific data sharing

The importance of intelligent data placement, management, and analysis h...
research
02/03/2022

Astronomical data organization, management and access in Scientific Data Lakes

The data volumes stored in telescope archives is constantly increasing d...
research
02/26/2019

Rucio - Scientific Data Management

Rucio is an open source software framework that provides scientific coll...
research
07/04/2019

Development of a data infrastructure for a global data and analysis center in astroparticle physics

Nowadays astroparticle physics faces a rapid data volume increase. Meanw...
research
05/12/2021

A Survey on Reinforcement Learning-Aided Caching in Mobile Edge Networks

Mobile networks are experiencing tremendous increase in data volume and ...
research
11/03/2017

Toward real-time data query systems in HEP

Exploratory data analysis tools must respond quickly to a user's questio...

Please sign up or login with your details

Forgot password? Click here to reset