Speech enhancement method based on multi-domain fusion and neural architecture search

In order to further improve the self-learning and noise reduction ability of speech enhancement model, a speech enhancement method based on multi-domain fusion and neural architecture search was proposed.The multi-spatial domain mapping and fusion mechanism of speech signals were designed to realize...

Full description

Saved in:
Bibliographic Details
Main Authors: Rui ZHANG, Pengyun ZHANG, Chaoli SUN
Format: Article
Language:zho
Published: Editorial Department of Journal on Communications 2024-02-01
Series:Tongxin xuebao
Subjects:
Online Access:http://www.joconline.com.cn/zh/article/doi/10.11959/j.issn.1000-436x.2024018/
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1841540030202380288
author Rui ZHANG
Pengyun ZHANG
Chaoli SUN
author_facet Rui ZHANG
Pengyun ZHANG
Chaoli SUN
author_sort Rui ZHANG
collection DOAJ
description In order to further improve the self-learning and noise reduction ability of speech enhancement model, a speech enhancement method based on multi-domain fusion and neural architecture search was proposed.The multi-spatial domain mapping and fusion mechanism of speech signals were designed to realize the mining of real complex number correlation.Based on the characteristics of convolution pooling of the model, a complex neural architecture search mechanism was proposed, and the speech enhancement model was constructed efficiently and automatically through the designed search space, search strategy and evaluation strategy.In the comparison and generalization experiment between the optimal speech enhancement model and the baseline model, the two indexes of PESQ and STOI increase by 5.6% compared with the optimal baseline model, and the number of model parameters is the lowest.
format Article
id doaj-art-32fd5f95ef0e42008ba209ec46b0665a
institution Kabale University
issn 1000-436X
language zho
publishDate 2024-02-01
publisher Editorial Department of Journal on Communications
record_format Article
series Tongxin xuebao
spelling doaj-art-32fd5f95ef0e42008ba209ec46b0665a2025-01-14T06:22:10ZzhoEditorial Department of Journal on CommunicationsTongxin xuebao1000-436X2024-02-014522523959383535Speech enhancement method based on multi-domain fusion and neural architecture searchRui ZHANGPengyun ZHANGChaoli SUNIn order to further improve the self-learning and noise reduction ability of speech enhancement model, a speech enhancement method based on multi-domain fusion and neural architecture search was proposed.The multi-spatial domain mapping and fusion mechanism of speech signals were designed to realize the mining of real complex number correlation.Based on the characteristics of convolution pooling of the model, a complex neural architecture search mechanism was proposed, and the speech enhancement model was constructed efficiently and automatically through the designed search space, search strategy and evaluation strategy.In the comparison and generalization experiment between the optimal speech enhancement model and the baseline model, the two indexes of PESQ and STOI increase by 5.6% compared with the optimal baseline model, and the number of model parameters is the lowest.http://www.joconline.com.cn/zh/article/doi/10.11959/j.issn.1000-436x.2024018/speech enhancement modelcomplex spatial domain mappingmulti-domain fusioncomplex neural archi-tecture searchlow-cost evaluation
spellingShingle Rui ZHANG
Pengyun ZHANG
Chaoli SUN
Speech enhancement method based on multi-domain fusion and neural architecture search
Tongxin xuebao
speech enhancement model
complex spatial domain mapping
multi-domain fusion
complex neural archi-tecture search
low-cost evaluation
title Speech enhancement method based on multi-domain fusion and neural architecture search
title_full Speech enhancement method based on multi-domain fusion and neural architecture search
title_fullStr Speech enhancement method based on multi-domain fusion and neural architecture search
title_full_unstemmed Speech enhancement method based on multi-domain fusion and neural architecture search
title_short Speech enhancement method based on multi-domain fusion and neural architecture search
title_sort speech enhancement method based on multi domain fusion and neural architecture search
topic speech enhancement model
complex spatial domain mapping
multi-domain fusion
complex neural archi-tecture search
low-cost evaluation
url http://www.joconline.com.cn/zh/article/doi/10.11959/j.issn.1000-436x.2024018/
work_keys_str_mv AT ruizhang speechenhancementmethodbasedonmultidomainfusionandneuralarchitecturesearch
AT pengyunzhang speechenhancementmethodbasedonmultidomainfusionandneuralarchitecturesearch
AT chaolisun speechenhancementmethodbasedonmultidomainfusionandneuralarchitecturesearch