Accelerating domain-aware electron microscopy analysis using deep learning models with synthetic data and image-wide confidence scoring

Abstract The integration of machine learning (ML) models enhances the efficiency, affordability, and reliability of feature detection in microscopy, yet their development and applicability are hindered by the dependency on scarce and often flawed manually labeled datasets with a lack of domain aware...

Full description

Saved in:
Bibliographic Details
Main Authors: M. J. Lynch, R. Jacobs, G. A. Bruno, P. Patki, D. Morgan, K. G. Field
Format: Article
Language:English
Published: Nature Portfolio 2025-08-01
Series:npj Computational Materials
Online Access:https://doi.org/10.1038/s41524-025-01756-6
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Abstract The integration of machine learning (ML) models enhances the efficiency, affordability, and reliability of feature detection in microscopy, yet their development and applicability are hindered by the dependency on scarce and often flawed manually labeled datasets with a lack of domain awareness. We addressed these challenges by creating a physics-based synthetic image and data generator, resulting in an ML model that achieves comparable precision (0.86), recall (0.63), F1 scores (0.71), and engineering property predictions (R 2 = 0.82) to a model trained on human-labeled data. We enhanced both models by using feature prediction confidence scores to derive an image-wide confidence metric, enabling simple thresholding to eliminate ambiguous and out-of-domain images, resulting in performance boosts of 5–30% with a filtering-out rate of 25%. Our study demonstrates that synthetic data can eliminate human reliance in ML and provides a means for domain awareness in cases where many feature detections per image are needed.
ISSN:2057-3960