-
21
Hybrid deep learning framework based on EfficientViT for classification of gastrointestinal diseases
Published 2025-07-01“…At the same time, it includes the capacity of the ViT model to recognize the context of images of the GI tract for the detection of slight disease patterns and precursors of disease diffusion. Furthermore, we designed a dual-block in which input is divided into two parts (q1, q2) to better optimize the model q1 processed through an EfficientNet for local details and a q2 through encoder block for capturing the global dependencies, which enables EfficientViT to pay attention to multiple image regions simultaneously. …”
Get full text
Article -
22
Multimodal diffusion framework for collaborative text image audio generation and applications
Published 2025-07-01“…Abstract This paper presents a novel framework for collaborative generation across text, image, and audio modalities using an enhanced diffusion model architecture. We introduce a Hierarchical Cross-modal Alignment Network that establishes unified representations while preserving modality-specific characteristics, and a Cross-modal Conditional Diffusion Model that enables flexible generation pathways through innovative conditional embedding and attention-guided mechanisms. …”
Get full text
Article -
23
CaloDREAM – Detector response emulation via attentive flow matching
Published 2025-03-01Get full text
Article -
24
Menstrual cycle inspired latent diffusion model for image augmentation in energy production
Published 2025-05-01“…This paper introduces menstrual cycle-inspired latent diffusion model (MCI-LDM), a novel framework that addresses these challenges with three key modifications. …”
Get full text
Article -
25
Advancing Persistent Character Generation: Comparative Analysis of Fine-Tuning Techniques for Diffusion Models
Published 2024-09-01“…It excels in low VRAM contexts due to its targeted fine-tuning of low-rank matrices within cross-attention layers, enabling faster training and efficient parameter tweaking. …”
Get full text
Article -
26
Modeling Homogeneous, Stratified, and Diffusion Combustion in Hydrogen SI Engines Using the Wiebe Approach
Published 2025-06-01“…This study further develops the model by accounting for the combined influence of the mixture composition and engine speed, mixture stratification, and the effects of injection and ignition parameters on premixed and diffusion combustion. Special attention is given to combustion modeling in an engine with single injection and jet-guided operation.…”
Get full text
Article -
27
SVPDSA: Selective View Perception Data Synthesis With Annotations Using Lightweight Diffusion Network
Published 2025-01-01“…A compressed Unet-based diffusion model, pre-trained on the LAION-5B dataset, serves as the foundation for efficient text-to-image synthesis. …”
Get full text
Article -
28
Denoising diffusion probabilistic models for addressing data limitations in chest X-ray classification
Published 2024-01-01“…To address these challenges, there has been a growing interest in the use of deep generative models to create synthetic training data, with denoising diffusion probabilistic models (DDPMs) recently gaining attention for their ability to produce realistic and high-quality images. …”
Get full text
Article -
29
SGM-EMA: Speech Enhancement Method Score-Based Diffusion Model and EMA Mechanism
Published 2025-05-01“…This paper proposes a U-Net architecture using a score-based diffusion model and an efficient multi-scale attention mechanism (EMA) for the speech enhancement task. …”
Get full text
Article -
30
Swin-Diff: a single defocus image deblurring network based on diffusion model
Published 2025-02-01“…Our results validate the effectiveness of combining diffusion models with hierarchical attention mechanisms for high-quality defocus blur removal.…”
Get full text
Article -
31
Contour wavelet diffusion – a fast and high-quality facial expression generation model
Published 2024-12-01“…To address these limitations, we propose a contour wavelet diffusion model that accelerates both training and inference speeds. …”
Get full text
Article -
32
Bibliometric cartography of data science: a large-scale analysis on knowledge integration and diffusion
Published 2025-07-01“…The diffusion process is driven by both viral and broadcasting adopters, with the former facilitating early-stage dissemination, although a “damping” effect emerges as dissemination efficiency gradually declines. …”
Get full text
Article -
33
Dynamically Tunable Multidimensional Feature Focusing and Diffusion Networks for Water Surface Debris Detection
Published 2025-01-01“…Furthermore, a Focal-Diffuse Feature Pyramid Network (FD-FPN) was introduced to accurately capture and integrate key feature information through focused feature fusion techniques while utilizing cross-scale diffusion analysis to efficiently transfer and enhance feature information across different scales. …”
Get full text
Article -
34
A Mathematical Survey of Image Deep Edge Detection Algorithms: From Convolution to Attention
Published 2025-07-01“…Beginning with Sobel and Canny’s kernel-based approaches, we trace the shift to data-driven CNNs like Holistically Nested Edge Detection (HED) and Bidirectional Cascade Network (BDCN), which leverage multi-scale supervision and achieve ODS (Optimal Dataset Scale) scores 0.788 and 0.806, respectively. Attention mechanisms, as in EdgeNAT (ODS 0.860) and RankED (ODS 0.824), enhance global context, while generative models like GED (ODS 0.870) achieve state-of-the-art precision via diffusion and GAN frameworks. …”
Get full text
Article -
35
A Lightweight Conditional Diffusion Segmentation Network Based on Deformable Convolution for Surface Defect Detection
Published 2025-01-01“…Surface defect detection is crucial to industrial manufacturing and research for surface defects has drawn much attention. However, defects in industrial environment are very diverse. …”
Get full text
Article -
36
Attention-enhanced residual U-Net: lymph node segmentation method with bimodal MRI images
Published 2025-06-01“…The DWI and T2 images were fused and inputted into U-Net. The efficient channel attention (ECA) module was added to U-Net. …”
Get full text
Article -
37
Enhancing Atmospheric Turbulence Phase Screen Generation with an Improved Diffusion Model and U-Net Noise Generation Network
Published 2025-04-01“…Additionally, a self-attention module strengthens the model’s ability to learn phase screen features. …”
Get full text
Article -
38
DCT-DiffPose: A Lightweight Diffusion Model With Multi-Hypothesis for 3D Human Pose Estimation
Published 2025-01-01“…3D human pose estimation is a crucial task in computer vision with extensive applications, yet it remains challenging due to depth ambiguity and constraints on computational efficiency. In this paper, we propose DCT-DiffPose, a novel framework that integrates a diffusion model with Confidence and Consistency-based Multi-Hypothesis Aggregation (CCMA). …”
Get full text
Article -
39
Evaluating Nanofiltration and Reverse Osmosis Membranes for Pharmaceutically Active Compounds Removal: A Solution Diffusion Model Approach
Published 2024-11-01“…To address these issues, nanofiltration (NF) and reverse osmosis (RO) membrane technologies have gained attention. This study aims to evaluate the performance of NF and RO membranes in removing TrOCs from wastewater and develop a predictive model using the Solution Diffusion Model. …”
Get full text
Article -
40
MGLI-Former: a multi-scale and global-local information interactive attention transformer for urban shantytown extraction
Published 2024-12-01“…Shantytowns, characterized by poor living conditions and simple houses, necessitate efficient extraction and analysis for urban planning. …”
Get full text
Article