Nonlinear Waveform Sensing for Cognitive Radar Based on Reinforcement Learning

Cognitive radar automatically adjusts its waveform via ceaseless interaction with the environment and learning from the experience. Compared with the linear frequency modulation (LFM) that has been commonly adopted in cognitive radars, the nonlinear FM (NLFM) signal has more flexible frequency varia...

Full description

Saved in:
Bibliographic Details
Main Authors: Peikun Zhu, Xu Si, Jiachen Han, Jing Liang
Format: Article
Language:English
Published: IEEE 2025-01-01
Series:IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
Subjects:
Online Access:https://ieeexplore.ieee.org/document/10838705/
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1825207057343578112
author Peikun Zhu
Xu Si
Jiachen Han
Jing Liang
author_facet Peikun Zhu
Xu Si
Jiachen Han
Jing Liang
author_sort Peikun Zhu
collection DOAJ
description Cognitive radar automatically adjusts its waveform via ceaseless interaction with the environment and learning from the experience. Compared with the linear frequency modulation (LFM) that has been commonly adopted in cognitive radars, the nonlinear FM (NLFM) signal has more flexible frequency variation and small time delay–Doppler coupling. In this work, we propose an NLFM cognitive radar based on reinforcement learning for target sensing. Specifically, a radar waveform selection framework is proposed via the interactive multimodel. It embraces the Riccati equation and Riccati-like iterative calculations to obtain the prediction error covariance (PEC) and the prediction Bayesian Cramér–Rao lower bound (PBCRLB), respectively, which are used to guide the optimal waveform design. With PEC or PBCRLB, an entropy reward Q-learning method is also proposed for joint waveform parameter selection (JWPS) and pure waveform parameter selection from the NLFM library. Simulations show that both the time complexity and tracking accuracy of PEC-based Q-learning JWPS outperform that of the PBCRLB method. Furthermore, PXIe-5785 is utilized to construct a cognitive radar platform and conduct field experiments for nonlinear waveform sensing, which confirms that nonlinear waveforms are more effective than linear waveforms in target localization.
format Article
id doaj-art-d8ae7c015bbf4310af3a5e681beaf0b2
institution Kabale University
issn 1939-1404
2151-1535
language English
publishDate 2025-01-01
publisher IEEE
record_format Article
series IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
spelling doaj-art-d8ae7c015bbf4310af3a5e681beaf0b22025-02-07T00:00:14ZengIEEEIEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing1939-14042151-15352025-01-01184821483510.1109/JSTARS.2025.352865910838705Nonlinear Waveform Sensing for Cognitive Radar Based on Reinforcement LearningPeikun Zhu0https://orcid.org/0000-0001-7482-2996Xu Si1https://orcid.org/0000-0003-1279-5667Jiachen Han2https://orcid.org/0000-0002-4811-5048Jing Liang3https://orcid.org/0000-0002-0860-6563School of Information and Communication Engineering, University of Electronic Science and Technology of China, Chengdu, ChinaSchool of Information and Communication Engineering, University of Electronic Science and Technology of China, Chengdu, ChinaSchool of Information and Communication Engineering, University of Electronic Science and Technology of China, Chengdu, ChinaSchool of Information and Communication Engineering, University of Electronic Science and Technology of China, Chengdu, ChinaCognitive radar automatically adjusts its waveform via ceaseless interaction with the environment and learning from the experience. Compared with the linear frequency modulation (LFM) that has been commonly adopted in cognitive radars, the nonlinear FM (NLFM) signal has more flexible frequency variation and small time delay–Doppler coupling. In this work, we propose an NLFM cognitive radar based on reinforcement learning for target sensing. Specifically, a radar waveform selection framework is proposed via the interactive multimodel. It embraces the Riccati equation and Riccati-like iterative calculations to obtain the prediction error covariance (PEC) and the prediction Bayesian Cramér–Rao lower bound (PBCRLB), respectively, which are used to guide the optimal waveform design. With PEC or PBCRLB, an entropy reward Q-learning method is also proposed for joint waveform parameter selection (JWPS) and pure waveform parameter selection from the NLFM library. Simulations show that both the time complexity and tracking accuracy of PEC-based Q-learning JWPS outperform that of the PBCRLB method. Furthermore, PXIe-5785 is utilized to construct a cognitive radar platform and conduct field experiments for nonlinear waveform sensing, which confirms that nonlinear waveforms are more effective than linear waveforms in target localization.https://ieeexplore.ieee.org/document/10838705/Cognitive radar (CR)entropy reward Q-learning (ERQL)nonlinear frequency modulation (NLFM)target sensingwaveform selection
spellingShingle Peikun Zhu
Xu Si
Jiachen Han
Jing Liang
Nonlinear Waveform Sensing for Cognitive Radar Based on Reinforcement Learning
IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
Cognitive radar (CR)
entropy reward Q-learning (ERQL)
nonlinear frequency modulation (NLFM)
target sensing
waveform selection
title Nonlinear Waveform Sensing for Cognitive Radar Based on Reinforcement Learning
title_full Nonlinear Waveform Sensing for Cognitive Radar Based on Reinforcement Learning
title_fullStr Nonlinear Waveform Sensing for Cognitive Radar Based on Reinforcement Learning
title_full_unstemmed Nonlinear Waveform Sensing for Cognitive Radar Based on Reinforcement Learning
title_short Nonlinear Waveform Sensing for Cognitive Radar Based on Reinforcement Learning
title_sort nonlinear waveform sensing for cognitive radar based on reinforcement learning
topic Cognitive radar (CR)
entropy reward Q-learning (ERQL)
nonlinear frequency modulation (NLFM)
target sensing
waveform selection
url https://ieeexplore.ieee.org/document/10838705/
work_keys_str_mv AT peikunzhu nonlinearwaveformsensingforcognitiveradarbasedonreinforcementlearning
AT xusi nonlinearwaveformsensingforcognitiveradarbasedonreinforcementlearning
AT jiachenhan nonlinearwaveformsensingforcognitiveradarbasedonreinforcementlearning
AT jingliang nonlinearwaveformsensingforcognitiveradarbasedonreinforcementlearning