MACK: Mismodeling addressed with contrastive knowledge

The use of machine learning methods in high energy physics typically relies on large volumes of precise simulation for training. As machine learning models become more complex they can become increasingly sensitive to differences between this simulation and the real data collected by experiments. We...

Full description

Saved in:
Bibliographic Details
Main Author: Liam Rankin Sheldon, Dylan Sheldon Rankin, Philip Harris
Format: Article
Language:English
Published: SciPost 2025-05-01
Series:SciPost Physics
Online Access:https://scipost.org/SciPostPhys.18.5.150
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:The use of machine learning methods in high energy physics typically relies on large volumes of precise simulation for training. As machine learning models become more complex they can become increasingly sensitive to differences between this simulation and the real data collected by experiments. We present a generic methodology based on contrastive learning which is able to greatly mitigate this negative effect. Crucially, the method does not require prior knowledge of the specifics of the mismodeling. While we demonstrate the efficacy of this technique using the task of jet-tagging at the Large Hadron Collider, it is applicable to a wide array of different tasks both in and out of the field of high energy physics.
ISSN:2542-4653