Showing 341 - 360 results of 617 for search 'Policy integration algorithm', query time: 0.11s Refine Results
  1. 341

    Computational design exploration of rocket nozzle using deep reinforcement learning by Aagashram Neelakandan, Arockia Selvakumar Arockia Doss, Natrayan Lakshmaiya

    Published 2025-03-01
    “…Additionally, the use of the Single-Step Proximal Policy Optimization (SSPPO) algorithm enhances the exploration of nozzle geometries by maximizing aerodynamic performance while balancing computational efficiency. …”
    Get full text
    Article
  2. 342

    MODELS OF STRATEGIC COMMUNICATION FOR ENSURING SOCIAL COHESION IN DE-OCCUPIED REGIONS by Tetiana Lushahina

    Published 2025-06-01
    “…It is proposed to consider strategic communication as a key instrument of state policy in the context of hybrid threats and post-conflict transformation. …”
    Get full text
    Article
  3. 343

    Volatility Spillover Between the Carbon Market and Traditional Energy Market Using the DGC-t-MSV Model by Jining Wang, Renjie Zeng, Lei Wang

    Published 2024-11-01
    “…This study employed the dynamic conditional correlation algorithm and incorporated the temporal dynamics of spillover effect to enhance the Multivariate Stochastic Volatility (MSV) model. …”
    Get full text
    Article
  4. 344

    Digital Intelligence Pathology Platform and Its Service Pattern by Xiaohong Chen, Liu Liu, Yajua Niu, Xiaoliang Liu, Xiaohai Li, Jianhua Zhou, Junpu Wang

    Published 2025-04-01
    “…Building on proprietary research achievements, we propose a tripartite middleware architecture comprising data, algorithm, and service platforms. The system architecture integrates standardized data management, AI-driven analytical modules, and interoperable service interfaces to optimize pathological workflows. …”
    Get full text
    Article
  5. 345
  6. 346

    Enhanced Q learning and deep reinforcement learning for unmanned combat intelligence planning in adversarial environments by Xu Jianhong, Liang Gongqian

    Published 2025-08-01
    “…By integrating an attention mechanism and an adaptive reward mechanism, the algorithm effectively fuses image data, sensor data, and intelligent information, enabling collaborative multimodal data processing. …”
    Get full text
    Article
  7. 347

    Research on Ship Heave Motion Compensation Control Under Complex Sea State Environment Based on Improved Reinforcement Learning by ZHANG Qin, ZHOU Jingyi, WANG Xingyue, HU Xiong

    Published 2025-07-01
    “…Within this process, the Twin Delayed Deep Deterministic Policy Gradient (TD3) algorithm assumes a central role as the core control strategy. …”
    Get full text
    Article
  8. 348

    Exploring the opportunities and challenges of using large language models to represent institutional agency in land system modelling by Y. Zeng, C. Brown, C. Brown, J. Raymond, M. Byari, R. Hotz, M. Rounsevell, M. Rounsevell, M. Rounsevell

    Published 2025-03-01
    “…The LLM agents provide simulated reasoning and policy action output. The agents' performance is benchmarked against two baseline scenarios: one without policy interventions and another implementing optimal policy actions determined through a genetic algorithm. …”
    Get full text
    Article
  9. 349

    A Hierarchical Reinforcement Learning Framework for Multi-Agent Cooperative Maneuver Interception in Dynamic Environments by Qinlong Huang, Yasong Luo, Zhong Liu, Jiawei Xia, Ming Chang, Jiaqi Li

    Published 2025-06-01
    “…At the low level, an improved prioritized experience replay multi-agent deep deterministic policy gradient algorithm (PER-MADDPG) is designed, integrating curriculum learning and prioritized experience replay mechanisms to effectively enhance the interception success rate against complex maneuvering targets. …”
    Get full text
    Article
  10. 350

    Online Self-Organizing Network Control with Time Averaged Weighted Throughput Objective by Zhicong Zhang, Shuai Li, Xiaohui Yan

    Published 2018-01-01
    “…To maximize the mean time averaged weighted throughput of the jobs through the network, we propose a reinforcement learning algorithm with time averaged reward to deal with the control decision model and obtain a control policy integrating the jobs routing selection strategy and the jobs sequencing strategy. …”
    Get full text
    Article
  11. 351

    UAV spatiotemporal crowdsourcing resource allocation based on deep reinforcement learning by Yaxi LIU, Xulong LI, Jiahao HUO, Wei HUANGFU

    Published 2025-01-01
    “…Our results show that the SAC algorithm achieves faster convergence speed and better solutions than existing state-of-the-art methods, such as the twin delayed deep deterministic policy gradient (TD3) and the deep deterministic policy gradient (DDPG) algorithms. …”
    Get full text
    Article
  12. 352

    Demand-Adapting Charging Strategy for Battery-Swapping Stations by Benjamín Pla, Pau Bares, Andre Aronis, Augusto Perin

    Published 2025-07-01
    “…Battery tests were conducted to assess charging time variability, and traffic density measurements were collected in the city of Valencia across multiple days to provide a realistic scenario, while real-time data of the electricity cost is integrated into the control proposal. The results show that incorporating traffic and electricity price forecasts into the control algorithm can reduce electricity costs by up to 11% and decrease associated <inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><msub><mi>CO</mi><mn>2</mn></msub></semantics></math></inline-formula> emissions by more than 26%.…”
    Get full text
    Article
  13. 353
  14. 354

    Deep reinforcement learning enhanced PID control for hydraulic servo systems in injection molding machines by Xiaoxi Hao, Zengmiao Xin, Weizhuo Huang, Sicheng Wan, Guangfan Qiu, Tianlei Wang, Zhu Wang

    Published 2025-07-01
    “…In particular, this work innovatively integrates the DDPG algorithm with an auxiliary servo valve structure for PID parameter optimization and dynamic performance enhancement, offering new ideas and technical pathways for adaptive control of complex hydraulic systems.…”
    Get full text
    Article
  15. 355

    Optimal Power Flow for High Spatial and Temporal Resolution Power Systems with High Renewable Energy Penetration Using Multi-Agent Deep Reinforcement Learning by Liangcai Zhou, Long Huo, Linlin Liu, Hao Xu, Rui Chen, Xin Chen

    Published 2025-04-01
    “…A heterogeneous multi-agent proximal policy optimization (H-MAPPO) DRL algorithm is introduced for multi-area power systems. …”
    Get full text
    Article
  16. 356

    A predictive framework using advanced machine learning approaches for measuring and analyzing the impact of synthetic agrochemicals on human health by Sahezpreet Singh, Puneet Kaur, Inderdeep Kaur, Gurpreet Singh, Satinder Kaur, Parminder Kaur

    Published 2025-05-01
    “…Although, the incorporation of machine learning algorithms for accurate risk evaluation and predictive modeling still underexplored, requiring novel solutions. …”
    Get full text
    Article
  17. 357
  18. 358

    Research on malicious code variants detection based on texture fingerprint by Xiao-guang HAN, UWu Q, AOXuan-xia Y, UOChang-you G, Fang ZHOU

    Published 2014-08-01
    “…In the detection phase, according to the generation policy for malicious code texture fingerprint, the prototype system for texture fingerprint extraction and detection is con-structed by employing the integrated weight method to multi-segmented texture fingerprint similarity matching to de-tect variants and unknown malicious codes. …”
    Get full text
    Article
  19. 359

    Real-time torque distribution simulation of parallel hybrid vehicle engine by Jing Wang

    Published 2025-08-01
    “…This study aims to develop a high-precision, robust torque distribution model to enhance energy utilization while addressing interference from environmental noise and extreme temperatures.MethodsA real-time torque distribution model integrates three core components: a Markov Decision Process framework transforms torque allocation into a mathematical optimization problem; the Proximal Policy Optimization algorithm enhanced with Prioritized Experience Replay dynamically generates control strategies; and Fiber Bragg Grating sensors achieve millisecond-level torque measurement by correlating shaft strain forces with wavelength shifts. …”
    Get full text
    Article
  20. 360

    Multi-Objective-Based Multi-Heterogeneous- Agent Deep Reinforcement Learning for Minimization of Voltage Deviation and Operation Cost in Active Distribution System by Anurak Deanseekeaw, Watcharakorn Pinthurat, Boonruang Marungsri

    Published 2025-01-01
    “…The proposed framework employs a multi-objective optimization approach, integrating three advanced algorithms: multi-agent proximal policy optimization (MAPPO), multi-agent asynchronous actor-critic (MAA2C), and multi-agent twin delayed deep deterministic policy gradient (MATD3). …”
    Get full text
    Article