Showing 401 - 420 results of 617 for search 'Policy integration algorithm', query time: 0.13s Refine Results
  1. 401
  2. 402

    Research on Long-Term Scheduling Optimization of Water–Wind–Solar Multi-Energy Complementary System Based on DDPG by Zixing Wan, Wenwu Li, Mu He, Taotao Zhang, Shengzhe Chen, Weiwei Guan, Xiaojun Hua, Shang Zheng

    Published 2025-07-01
    “…Then, a long-term optimization scheduling model is established with the goal of maximizing the absorption of clean energy, and it is converted into a Markov Decision Process (MDP). Next, the DDPG algorithm is employed with a noise dynamic adjustment mechanism to optimize the policy in continuous action spaces, yielding the optimal long-term scheduling strategy for the water–wind–solar multi-energy complementary system. …”
    Get full text
    Article
  3. 403
  4. 404

    Three-Dimensional Path-Following Control of a Robotic Airship with Reinforcement Learning by Chunyu Nie, Zewei Zheng, Ming Zhu

    Published 2019-01-01
    “…To ensure the control adaptability without dependence on an accurate airship dynamic model, a Q-Learning algorithm is directly adopted for learning the action policy of actuator commands, and the controller is trained online based on actual motion. …”
    Get full text
    Article
  5. 405

    Autonomous agents: Augmenting visual information with raw audio data. by Enoch Solomon

    Published 2025-01-01
    “…Experimental evaluation were conducted employing Deep Q Networks (DQN) and Proximal Policy Optimization (PPO) algorithms within ViZDoom and Unity reinforcement learning environments. …”
    Get full text
    Article
  6. 406

    Estimation of High Spatial Resolution CO<sub>2</sub> Concentration in China from 2010 to 2022 Based on Multi-Source Carbon Satellite Data by Shanzhao Cai, Heng Dong, Bo Zhang, Huan Huang

    Published 2025-05-01
    “…To address this, this study effectively integrates XCO<sub>2</sub> data retrieved from the GOSAT and OCO-2 satellites using atmospheric profile adjustment and spatial grid integration techniques. …”
    Get full text
    Article
  7. 407

    The association between ozone exposure and blood pressure in a general Chinese middle-aged and older population: a large-scale repeated-measurement study by Chen Tang, Yiqin Zhang, Jingping Yi, Zhonghua Lu, Xianfa Xuan, Hanxiang Jiang, Dongbei Guo, Hanyu Xiang, Ting Wu, Jianhua Yan, Siyu Zhang, Yuxin Wang, Jie Zhang

    Published 2024-11-01
    “…The findings provide preliminary evidence for the impact of O3 exposure on BP regulation and underscore the urgent need to reassess public health policies in response to O3 pollution.…”
    Get full text
    Article
  8. 408

    A Comprehensive Survey on AI in Counter-Terrorism and Cybersecurity: Challenges and Ethical Dimensions by Ioannis Syllaidopoulos, Klimis S. Ntalianis, Ioannis Salmon

    Published 2025-01-01
    “…However, critical concerns such as algorithmic bias, data quality limitations, and governance challenges introduce significant obstacles to their deployment. …”
    Get full text
    Article
  9. 409

    Coordinated Volt/VAR Control in Distribution Networks Considering Demand Response via Safe Deep Reinforcement Learning by Dong Hua, Fei Peng, Suisheng Liu, Qinglin Lin, Jiahui Fan, Qian Li

    Published 2025-01-01
    “…Next, a safe deep reinforcement learning (SDRL) algorithm is proposed, incorporating a novel Lagrange multiplier update mechanism to ensure that the control policies adhere to safety constraints during the learning process. …”
    Get full text
    Article
  10. 410

    Research Status and Development Trends of Deep Reinforcement Learning in the Intelligent Transformation of Agricultural Machinery by Jiamuyang Zhao, Shuxiang Fan, Baohua Zhang, Aichen Wang, Liyuan Zhang, Qingzhen Zhu

    Published 2025-06-01
    “…Meanwhile, this paper identifies three major challenges facing DRL in agricultural contexts: the difficulty of dynamic path planning in unstructured environments, constraints imposed by edge computing resources on algorithmic real-time performance, and risks to policy reliability and safety under human–machine collaboration conditions. …”
    Get full text
    Article
  11. 411

    Modeling Chaotic Behavior of Chittagong Stock Indices by Shipra Banik, Mohammed Anwer, A. F. M. Khodadad Khan

    Published 2012-01-01
    “…We have used well-known models such as, the genetic algorithm (GA) model and the adaptive network fuzzy integrated system (ANFIS) model as soft computing forecasting models. …”
    Get full text
    Article
  12. 412

    Novel deep reinforcement learning based collision avoidance approach for path planning of robots in unknown environment. by Raed Alharthi, Iram Noreen, Amna Khan, Turki Aljrees, Zoraiz Riaz, Nisreen Innab

    Published 2025-01-01
    “…Reinforcement learning can address these issues using its action feedback and reward policies. This research presents a novel Q-learning-based reinforcement algorithm with deep learning integration. …”
    Get full text
    Article
  13. 413
  14. 414

    Hub-and-spoke network design to optimize government subsidy costs for maritime transportation in Indonesia by Windra Priatna Humang, Djoko Prijo Utomo, Hasriwan Putra, Dedy Arianto, Rutma Pujiwat, Sucipto, Dwi Phalita Upahita, Nur Fitriana, Maharani Almira Salsabilla, Yustina Niken Raharina Hendra, Asep Yayat Nurhidayat, Mega Novetrishka Putri, Mohamad Ivan Aji Saputro, Amelia Santoso, Ivan Kristianto Singgih, Dina Natalia Prayogo, Indri Hapsari, Olyvia Novawanda, Stefanus Soegiharto, Jerry Agus Arlianto

    Published 2025-06-01
    “…This study advances the body of knowledge by presenting an empirical, data-driven framework for optimizing Indonesia's maritime logistics infrastructure. The results inform policy recommendations on subsidy restructuring, port infrastructure development, and the integration of real-time data analytics to enhance the sustainability and efficiency of maritime transport operations.…”
    Get full text
    Article
  15. 415

    Research Progress and Prospect of Green Infrastructure with Public Health Promotion Function by Tongyu LI, Junyi LIXU, Binxia XUE, Yan (USA) SONG

    Published 2025-07-01
    “…To manage the multidimensionality of GI research, cluster analysis is performed using a Word2Vec model combined with a K-means algorithm to integrate different GI forms into a coherent classification system.ResultsThe results show that GI can be clearly divided into different categories, such as urban green spaces and parks, high-interaction spaces, trees in built-up areas, water management and biofiltration systems, community and residential greening, green roofs and facades, linear green networks, and broader macro-GI strategies. …”
    Get full text
    Article
  16. 416

    Optimizing Fairness and Spectral Efficiency With Shapley-Based User Prioritization in Semantic Communication by Moirangthem Tiken Singh, Adnan Arif, Rabinder Kumar Prasad, Bikramjit Choudhury, Chandan Kalita, Sikdar Md. S. Askari

    Published 2025-01-01
    “…The Shapley-based approach outperforms established methods, including the Hungarian algorithm, reinforcement learning algorithms like Deep Q-Network (DQN) and Proximal Policy Optimization (PPO), as well as conventional 4G and 5G resource allocation strategies. …”
    Get full text
    Article
  17. 417

    Intelligent Wireless Power Scheduling for Lunar Multienergy Systems: Deep Reinforcement Learning for Real-Time Adaptive Beam Steering and Vehicle-to-Grid Energy Optimization by Thomas Tongxin Li, Shuangqi Li, Cynthia Xin Ding, Zhaoyao Bao, Mohannad Alhazmi

    Published 2025-01-01
    “…Future work will explore the integration of hybrid energy storage models, quantum-inspired optimization for real-time decision-making, and predictive beamforming algorithms to further enhance the reliability and efficiency of lunar energy networks.…”
    Get full text
    Article
  18. 418
  19. 419

    A Deep Reinforcement Learning Method with a Low Intercept Probability in a Netted Synthetic Aperture Radar by Longhao Xie, Ziyang Cheng, Ming Li, Huiyong Li

    Published 2025-07-01
    “…The powers in multiple moments are optimized using the DRL proximal policy optimization algorithm with the designed reward and observation. …”
    Get full text
    Article
  20. 420

    Energy efficient control of indoor environments under time-varying multi-parameter uncertainty by Jianhao ZHAO, Hua SONG, Xinyuan NAN, Xin CAI

    Published 2024-12-01
    “…To solve the problem that the current indoor environment is affected by a variety of time-varying parameters with large uncertainty and the existing control equipment cannot adaptively adjust the operating power according to the indoor environment, which has caused a great waste of energy, the method of integrating the prioritized experience replay (PER) into the deep deterministic policy gradient (DDPG) is adopted. …”
    Get full text
    Article