Scenario-Driven Evaluation of Autonomous Agents: Integrating Large Language Model for UAV Mission Reliability

The Internet of Drones (IoD) integrates autonomous aerial platforms with security, logistics, agriculture, and disaster relief. Decision-making in IoD suffers in real-time adaptability, platform interoperability, and scalability. Conventional decision frameworks with heuristic algorithms and narrow...

Full description

Saved in:

Bibliographic Details
Main Author:	Anıl Sezgin
Format:	Article
Language:	English
Published:	MDPI AG 2025-03-01
Series:	Drones
Subjects:	Internet of Drones large language models centralized decision-making autonomous systems retrieval-augmented generation
Online Access:	https://www.mdpi.com/2504-446X/9/3/213
Tags:	Add Tag No Tags, Be the first to tag this record!

Description
Summary:	The Internet of Drones (IoD) integrates autonomous aerial platforms with security, logistics, agriculture, and disaster relief. Decision-making in IoD suffers in real-time adaptability, platform interoperability, and scalability. Conventional decision frameworks with heuristic algorithms and narrow Artificial Intelligence (AI) falter in complex environments. To mitigate these, in this study, an augmented decision model is proposed, combining large language models (LLMs) and retrieval-augmented generation (RAG) for enhancing IoD intelligence. Centralized intelligence is achieved by processing environment factors, mission logs, and telemetry, with real-time adaptability. Efficient retrieval of contextual information through RAG is merged with LLMs for timely, correct decision-making. Contextualized decision-making vastly improves adaptability in uncertain environments for a drone network. With LLMs and RAG, the model introduces a scalable, adaptable IoD operations solution. It enables the development of autonomous aerial platforms in industries, with future work in computational efficiency, ethics, and extending operational environments. In-depth analysis with the collection of drone telemetry logs and operational factors was conducted. Decision accuracy, response time, and contextual relevance were measured to gauge system effectiveness. The model’s performance increased remarkably, with a BLEU of 0.82 and a cosine similarity of 0.87, proving its effectiveness for operational commands. Decision latency averaged 120 milliseconds, proving its suitability for real-time IoD use cases.
ISSN:	2504-446X

Scenario-Driven Evaluation of Autonomous Agents: Integrating Large Language Model for UAV Mission Reliability

Similar Items