-
361
-
362
-
363
Multi-Agent Deep Reinforcement Learning Cooperative Control Model for Autonomous Vehicle Merging into Platoon in Highway
Published 2025-04-01“…To enhance training efficiency, we develop a dual-layer multi-agent maximum Q-value proximal policy optimization (MAMQPPO) method, which extends the multi-agent PPO algorithm (a policy gradient method ensuring stable policy updates) by incorporating maximum Q-value action selection for platoon gap control and discrete command generation. …”
Get full text
Article -
364
Enhancing Port Shipping Synergy Through Bayesian Network: A Case of Major Chinese Ports
Published 2025-05-01“…Five leverage points stand out: customer engagement in green supply chains, perceived service quality, port digital information integration, multilateral trading maturity, and strict policy enforcement. …”
Get full text
Article -
365
AI-enabled obstetric point-of-care ultrasound as an emerging technology in low- and middle-income countries: provider and health system perspectives
Published 2025-07-01“…First, AI-enabled POCUS elicited concerns around algorithmic accuracy and compromised clinical acumen due to over-reliance on AI, but an interest in gestational age automation. …”
Get full text
Article -
366
Carbon co-benefits of digital economy and green finance: empirical evidence from China
Published 2025-07-01“…The study provides China and other emerging economies seeking to promote sustainable development through digital-green integration with policy-relevant implications.…”
Get full text
Article -
367
Learning-based locomotion control fusing multimodal perception for a bipedal humanoid robot
Published 2025-03-01“…In this paper, visual information is added to the locomotion control problem of humanoid robot, and a three-stage multi-objective constraint policy distillation optimization algorithm is innovantly proposed. …”
Get full text
Article -
368
GTrXL-SAC-Based Path Planning and Obstacle-Aware Control Decision-Making for UAV Autonomous Control
Published 2025-04-01“…To address these issues, this paper integrates DRL with the Transformer architecture to propose the GTrXL-SAC (gated Transformer-XL soft actor critic) algorithm. …”
Get full text
Article -
369
Collaborative carbon emission peak actions in urban agglomerations: multi-agent reinforcement learning analysis of the urban agglomerations of Beijing-Tianjin-Hebei and Yangtze Riv...
Published 2025-05-01“…The socioeconomic actions of cities within an urban agglomeration are coordinated using the QMIX algorithm, which employs value function factorization to integrate local decisions and feedback on reduced emissions. …”
Get full text
Article -
370
Innovative Business Models Towards Sustainable Energy Development: Assessing Benefits, Risks, and Optimal Approaches of Blockchain Exploitation in the Energy Transition
Published 2025-08-01“…Furthermore, according to the results, technological and legal risks are the most significant, followed by political, economic, and social risks, while environmental risks of blockchain integration are not as important. Strategies to address risks relevant to blockchain exploitation include ensuring policy alignment, emphasising economic feasibility, facilitating social inclusion, prioritising security and interoperability, consulting with legal experts, and using consensus algorithms with low energy consumption. …”
Get full text
Article -
371
A systematic review of AI-powered collaborative learning in higher education: Trends and outcomes from the last decade
Published 2025-01-01“…This review aims to integrate the current state and future opportunities of AI-enhanced collaborative learning within a higher education context to inform educators, researchers, and policy makers in pursuit of improving teaching and learning practices.…”
Get full text
Article -
372
Secure Latency-Aware Task Offloading Using Federated Learning and Zero Trust in Edge Computing for IoMT
Published 2025-01-01“…Within FL, an improved on-policy temporal difference control algorithm is leveraged for local model training. …”
Get full text
Article -
373
Human-in-the-loop control strategy for IoT-based smart thermostats with Deep Reinforcement Learning
Published 2025-05-01“…A key focus of this research is enhancing the adaptability of agents’ behavior by implementing a more generic and flexible Markov Decision Process (MDP) to promote policy generalization across diverse scenarios. The study explores the challenges of transferring control behaviors from simulation environments to real-world settings, examining the performance across different thermal zones and evaluating the integration flexibility of the control strategy within building systems. …”
Get full text
Article -
374
Secure Transmission for RIS-Assisted Downlink Hybrid FSO/RF SAGIN: Sum Secrecy Rate Maximization
Published 2025-03-01“…Then, an alternating iterative framework is proposed for a joint solution using the simulated annealing algorithm, semi-definite programming, and the designed deep deterministic policy gradient (DDPG) algorithm. …”
Get full text
Article -
375
The Impact of AI Software on Financial Transactions
Published 2025-01-01“…This paper explores AI’s applications in quantitative trading, risk forecasting, and intelligent customer interactions, demonstrating its ability to optimize decision-making and reduce operational costs. However, the integration of AI also raises significant concerns, including data security risks, algorithmic opacity, and increased market volatility, as evidenced by incidents like the 2010 “Flash Crash” and recent AI-driven stock fluctuations. …”
Get full text
Article -
376
The Gendered Well-being Assessment: addressing trauma, complex needs & social determinants of health
Published 2025-08-01“…By recognising protective factors like self-efficacy and social support, the GWA moves beyond harm mitigation to promote resilience and flourishing. Its structured, algorithm-based approach informs personalised support plans, transforming service provision and policy. …”
Get full text
Article -
377
Analysis of Encrypted Network Traffic for Enhancing Cyber-security in Dynamic Environments
Published 2024-12-01“…User selection is accomplished through robust Deep Reinforcement Learning with the Tabu Search (DRL-TS) algorithm, while channel selection is optimized through rigorous training employing Proximal Policy Optimization (PPO). …”
Get full text
Article -
378
-
379
Intelligent optimization method of fracturing parameters for shale oil reservoirs in Jimsar Sag, Junggar Basin, NW China
Published 2025-06-01“…A policy gradient-genetic-particle swarm algorithm is designed, which can adaptively adjust the inertia weights and learning factors in the iterative process, significantly improving the optimization ability of the optimization strategy. …”
Get full text
Article -
380
Optimization strategy of UAV‐ARIS assisted vehicular communication system
Published 2024-11-01“…In addition, a deep deterministic policy gradient (DDPG) algorithm is utilized for the optimization problem, and achieves convergence in continuous action space. …”
Get full text
Article