-
81
Decision-Time Learning and Planning Integrated Control for the Mild Hyperbaric Chamber
Published 2025-06-01“…Furthermore, a decision-time planning algorithm is developed and the planning process is further guided by incorporating a value network and an enhanced online policy. …”
Get full text
Article -
82
From humans to algorithms: A sociotechnical framework of workplace surveillance
Published 2025-12-01“…Current frameworks also underplay the influence of digital technology and fail to account for the impact modern digital technologies, such as artificial intelligence algorithms, have on shaping the characteristics of surveillance. …”
Get full text
Article -
83
RS-DRL-based offloading policy and UAV trajectory design in F-MEC systems
Published 2025-04-01Get full text
Article -
84
-
85
Reverse Engineering Segment Routing Policies and Link Costs With Inverse Reinforcement Learning and EM
Published 2025-01-01“…This study delves into the inverse problem of a general type of SR and attempts to infer the SR policies given expert traffic traces. To this end, we propose MoME, a Mixture-of-Experts (MoE) model using the Maximum Entropy Inverse Reinforcement Learning (MaxEnt-IRL) framework that is capable of incorporating diverse features (e.g., router, link and context) and capturing complex relationships in the link cost, in combination with an Expectation-Maximization (EM) based iterative algorithm that jointly infers link costs and SR policy classes. …”
Get full text
Article -
86
Surface Defect Detection for Small Samples of Particleboard Based on Improved Proximal Policy Optimization
Published 2025-04-01“…The proposed method is based on the proximal policy optimization (PPO) algorithm of the Actor-Critic framework, and defect detection is achieved by performing a series of scaling and translation operations on the mask. …”
Get full text
Article -
87
Decentralized Voltage and Var Control of Active Distribution Network Based on Parameter-Sharing Deep Reinforcement Learning
Published 2025-01-01“…By allowing agents to share parts of their neural network, the proposed Parameter Sharing - twin-delay deep deterministic policy gradient algorithm improves the stability and efficiency of voltage regulation. …”
Get full text
Article -
88
-
89
Clustering Analysis of the Energy Mix in Romania Using K-Means Algorithm
Published 2023-08-01Get full text
Article -
90
An Exploratory Application of Machine Learning Algorithms in Estimating Net Salaries in Romania
Published 2025-06-01“…The paper contributes to the integration and use of artificial intelligence methods in macroeconomic forecasting and labor market analysis. …”
Get full text
Article -
91
Economic Development of the Latin American Integration Association: Trends and Prospects
Published 2021-12-01“…The testing of the author’s methodology aimed at studying the economic development of LAIA member countries allows us to argue about the operability of the algorithm, which made it possible to establish the low efficiency of the integration policy of Latin American countries, which, despite the significant level of national wealth, does not allow us to form a trend towards economic growth, a decrease in dependence on the export of natural capital, and an increase in the standard of living of the population. …”
Get full text
Article -
92
A Reinforcement Learning Approach to Personalized Asthma Exacerbation Prediction Using Proximal Policy Optimization
Published 2025-01-01“…The model achieved 96.60% accuracy, 95.79% precision, 96.65% recall, and 95.92% F1-score, outperforming baseline RL algorithms such as Deep Q-Learning (92.21% accuracy), Advantage Actor-Critic (94.34% accuracy), and Trust Region Policy Optimization (95.12% accuracy). …”
Get full text
Article -
93
Heterogeneous Multi-Agent Task Planning Method in Complex Marine Environment
Published 2025-01-01“…The task allocation framework combines the proximal policy optimization algorithm with experience replay to train an Actor network, ensuring stable iterative updates of task allocation policies toward high-reward directions. …”
Get full text
Article -
94
Adaptive RFID Data Scheduling Using Proximal Policy Optimization for Reducing Data Processing Latency
Published 2025-01-01“…This paper presents a novel approach for dynamically offloading data using deep reinforcement learning, specifically employing the Proximal Policy Optimization (PPO) algorithm. The proposed method utilizes a central controller equipped with the PPO model to make intelligent, real-time reader selection decisions based on environmental factors such as reader load, tag mobility, and network conditions. …”
Get full text
Article -
95
Research on the policy inconsistency, network motifs and low carbon effects for municipal solid waste management
Published 2025-12-01“…This study combines the policy consistency formula, a four-node network motif evolution algorithm, and the Exponential Random Graph Model (ERGM), analyzing MSWM green network motifs with carbon emission data. …”
Get full text
Article -
96
Explainable post hoc portfolio management financial policy of a Deep Reinforcement Learning agent.
Published 2025-01-01“…In this work, driven by the motivation of making DRL explainable, we developed a novel Explainable DRL (XDRL) approach for PM, integrating the Proximal Policy Optimization (PPO) DRL algorithm with the model agnostic explainable machine learning techniques of feature importance, SHAP and LIME to enhance transparency in prediction time. …”
Get full text
Article -
97
RETRACTED: Utilizing Generative Design Algorithms for Innovative Structural Engineering Solutions
Published 2024-01-01Get full text
Article -
98
Investigating the hydrogen renaissance in the global energy transition with AI integration
Published 2025-04-01“…In storage, AI-driven algorithms are improving the management of hydrogen in large-scale storage systems, helping to mitigate issues such as leakage and optimizing storage conditions. …”
Get full text
Article -
99
Prospects of integrative and hybrid approaches to assessing undeclared work
Published 2024-12-01“…The article considers the prospects for the formation of a new methodology for researching undeclared work in order to develop an evidence-based policy on to minimize it. In the context of European integration, Ukraine has an obligation to join European initiatives to combat undeclared work and make this activity a systemic component of the relevant state policy. …”
Get full text
Article -
100
Federated Reinforcement Learning in Stock Trading Execution: The FPPO Algorithm for Information Security
Published 2025-01-01“…This paper presents the Federated Proximal Policy Optimization (FPPO) algorithm, an adaptive trade execution framework that leverages joint reinforcement learning. …”
Get full text
Article