-
341
Computational design exploration of rocket nozzle using deep reinforcement learning
Published 2025-03-01“…Additionally, the use of the Single-Step Proximal Policy Optimization (SSPPO) algorithm enhances the exploration of nozzle geometries by maximizing aerodynamic performance while balancing computational efficiency. …”
Get full text
Article -
342
MODELS OF STRATEGIC COMMUNICATION FOR ENSURING SOCIAL COHESION IN DE-OCCUPIED REGIONS
Published 2025-06-01“…It is proposed to consider strategic communication as a key instrument of state policy in the context of hybrid threats and post-conflict transformation. …”
Get full text
Article -
343
Volatility Spillover Between the Carbon Market and Traditional Energy Market Using the DGC-t-MSV Model
Published 2024-11-01“…This study employed the dynamic conditional correlation algorithm and incorporated the temporal dynamics of spillover effect to enhance the Multivariate Stochastic Volatility (MSV) model. …”
Get full text
Article -
344
Digital Intelligence Pathology Platform and Its Service Pattern
Published 2025-04-01“…Building on proprietary research achievements, we propose a tripartite middleware architecture comprising data, algorithm, and service platforms. The system architecture integrates standardized data management, AI-driven analytical modules, and interoperable service interfaces to optimize pathological workflows. …”
Get full text
Article -
345
LiDAR point cloud denoising for individual tree extraction based on the Noise4Denoise
Published 2025-01-01Get full text
Article -
346
Enhanced Q learning and deep reinforcement learning for unmanned combat intelligence planning in adversarial environments
Published 2025-08-01“…By integrating an attention mechanism and an adaptive reward mechanism, the algorithm effectively fuses image data, sensor data, and intelligent information, enabling collaborative multimodal data processing. …”
Get full text
Article -
347
Research on Ship Heave Motion Compensation Control Under Complex Sea State Environment Based on Improved Reinforcement Learning
Published 2025-07-01“…Within this process, the Twin Delayed Deep Deterministic Policy Gradient (TD3) algorithm assumes a central role as the core control strategy. …”
Get full text
Article -
348
Exploring the opportunities and challenges of using large language models to represent institutional agency in land system modelling
Published 2025-03-01“…The LLM agents provide simulated reasoning and policy action output. The agents' performance is benchmarked against two baseline scenarios: one without policy interventions and another implementing optimal policy actions determined through a genetic algorithm. …”
Get full text
Article -
349
A Hierarchical Reinforcement Learning Framework for Multi-Agent Cooperative Maneuver Interception in Dynamic Environments
Published 2025-06-01“…At the low level, an improved prioritized experience replay multi-agent deep deterministic policy gradient algorithm (PER-MADDPG) is designed, integrating curriculum learning and prioritized experience replay mechanisms to effectively enhance the interception success rate against complex maneuvering targets. …”
Get full text
Article -
350
Online Self-Organizing Network Control with Time Averaged Weighted Throughput Objective
Published 2018-01-01“…To maximize the mean time averaged weighted throughput of the jobs through the network, we propose a reinforcement learning algorithm with time averaged reward to deal with the control decision model and obtain a control policy integrating the jobs routing selection strategy and the jobs sequencing strategy. …”
Get full text
Article -
351
UAV spatiotemporal crowdsourcing resource allocation based on deep reinforcement learning
Published 2025-01-01“…Our results show that the SAC algorithm achieves faster convergence speed and better solutions than existing state-of-the-art methods, such as the twin delayed deep deterministic policy gradient (TD3) and the deep deterministic policy gradient (DDPG) algorithms. …”
Get full text
Article -
352
Demand-Adapting Charging Strategy for Battery-Swapping Stations
Published 2025-07-01“…Battery tests were conducted to assess charging time variability, and traffic density measurements were collected in the city of Valencia across multiple days to provide a realistic scenario, while real-time data of the electricity cost is integrated into the control proposal. The results show that incorporating traffic and electricity price forecasts into the control algorithm can reduce electricity costs by up to 11% and decrease associated <inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><msub><mi>CO</mi><mn>2</mn></msub></semantics></math></inline-formula> emissions by more than 26%.…”
Get full text
Article -
353
Machine learning-based detection of medical service anomalies: Kazakhstan’s health insurance data
Published 2025-06-01Get full text
Article -
354
Deep reinforcement learning enhanced PID control for hydraulic servo systems in injection molding machines
Published 2025-07-01“…In particular, this work innovatively integrates the DDPG algorithm with an auxiliary servo valve structure for PID parameter optimization and dynamic performance enhancement, offering new ideas and technical pathways for adaptive control of complex hydraulic systems.…”
Get full text
Article -
355
Optimal Power Flow for High Spatial and Temporal Resolution Power Systems with High Renewable Energy Penetration Using Multi-Agent Deep Reinforcement Learning
Published 2025-04-01“…A heterogeneous multi-agent proximal policy optimization (H-MAPPO) DRL algorithm is introduced for multi-area power systems. …”
Get full text
Article -
356
A predictive framework using advanced machine learning approaches for measuring and analyzing the impact of synthetic agrochemicals on human health
Published 2025-05-01“…Although, the incorporation of machine learning algorithms for accurate risk evaluation and predictive modeling still underexplored, requiring novel solutions. …”
Get full text
Article -
357
What patients and caregivers want to know when consenting to the use of digital behavioral markers
Published 2024-12-01Get full text
Article -
358
Research on malicious code variants detection based on texture fingerprint
Published 2014-08-01“…In the detection phase, according to the generation policy for malicious code texture fingerprint, the prototype system for texture fingerprint extraction and detection is con-structed by employing the integrated weight method to multi-segmented texture fingerprint similarity matching to de-tect variants and unknown malicious codes. …”
Get full text
Article -
359
Real-time torque distribution simulation of parallel hybrid vehicle engine
Published 2025-08-01“…This study aims to develop a high-precision, robust torque distribution model to enhance energy utilization while addressing interference from environmental noise and extreme temperatures.MethodsA real-time torque distribution model integrates three core components: a Markov Decision Process framework transforms torque allocation into a mathematical optimization problem; the Proximal Policy Optimization algorithm enhanced with Prioritized Experience Replay dynamically generates control strategies; and Fiber Bragg Grating sensors achieve millisecond-level torque measurement by correlating shaft strain forces with wavelength shifts. …”
Get full text
Article -
360
Multi-Objective-Based Multi-Heterogeneous- Agent Deep Reinforcement Learning for Minimization of Voltage Deviation and Operation Cost in Active Distribution System
Published 2025-01-01“…The proposed framework employs a multi-objective optimization approach, integrating three advanced algorithms: multi-agent proximal policy optimization (MAPPO), multi-agent asynchronous actor-critic (MAA2C), and multi-agent twin delayed deep deterministic policy gradient (MATD3). …”
Get full text
Article