Research on the construction of intelligent art design system based on multimodal perception and generative AI

Abstract This paper introduces an intelligent art design system powered by generative AI to address the cross-modal semantic disconnection in art creation. By integrating symbolic cognitive constraints and dynamic rule injection mechanisms, the system enhances both semantic accuracy and aesthetic qu...

Full description

Saved in:
Bibliographic Details
Main Authors: Hang Hang, Zhengdong Li
Format: Article
Language:English
Published: Springer 2025-08-01
Series:Discover Applied Sciences
Subjects:
Online Access:https://doi.org/10.1007/s42452-025-07513-0
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1849225973551071232
author Hang Hang
Zhengdong Li
author_facet Hang Hang
Zhengdong Li
author_sort Hang Hang
collection DOAJ
description Abstract This paper introduces an intelligent art design system powered by generative AI to address the cross-modal semantic disconnection in art creation. By integrating symbolic cognitive constraints and dynamic rule injection mechanisms, the system enhances both semantic accuracy and aesthetic quality in generative models. A cultural semantic knowledge graph is developed to map cultural metaphors to visual representations, while a symbolic attention encoder ensures semantic consistency during image generation. Additionally, a differentiable aesthetic rule validator adjusts the output in real-time to optimize the alignment with both artistic intent and aesthetic standards. Experimental results show that the proposed system outperforms existing models in key areas such as cultural symbol mapping (92.5% coverage), symmetry, and color harmony, delivering more accurate and aesthetically pleasing results. The system has been applied in real-world art creation, significantly improving the efficiency and creativity of artists by providing culturally relevant and visually coherent references. The findings highlight the system’s potential as an innovative tool for generative art, bridging the gap between semantic accuracy and aesthetic quality in the creative process.
format Article
id doaj-art-3747065ed6ef4d4c9ea34f25c4ee90bc
institution Kabale University
issn 3004-9261
language English
publishDate 2025-08-01
publisher Springer
record_format Article
series Discover Applied Sciences
spelling doaj-art-3747065ed6ef4d4c9ea34f25c4ee90bc2025-08-24T11:45:09ZengSpringerDiscover Applied Sciences3004-92612025-08-017912810.1007/s42452-025-07513-0Research on the construction of intelligent art design system based on multimodal perception and generative AIHang Hang0Zhengdong Li1Hefei Vocational and Technical CollegeAnhui Xinhua UniversityAbstract This paper introduces an intelligent art design system powered by generative AI to address the cross-modal semantic disconnection in art creation. By integrating symbolic cognitive constraints and dynamic rule injection mechanisms, the system enhances both semantic accuracy and aesthetic quality in generative models. A cultural semantic knowledge graph is developed to map cultural metaphors to visual representations, while a symbolic attention encoder ensures semantic consistency during image generation. Additionally, a differentiable aesthetic rule validator adjusts the output in real-time to optimize the alignment with both artistic intent and aesthetic standards. Experimental results show that the proposed system outperforms existing models in key areas such as cultural symbol mapping (92.5% coverage), symmetry, and color harmony, delivering more accurate and aesthetically pleasing results. The system has been applied in real-world art creation, significantly improving the efficiency and creativity of artists by providing culturally relevant and visually coherent references. The findings highlight the system’s potential as an innovative tool for generative art, bridging the gap between semantic accuracy and aesthetic quality in the creative process.https://doi.org/10.1007/s42452-025-07513-0Generative AI art creationCross-modal semantic breaksSymbolic cognitive constraintsDynamic rule injectionCultural metaphorical instructions
spellingShingle Hang Hang
Zhengdong Li
Research on the construction of intelligent art design system based on multimodal perception and generative AI
Discover Applied Sciences
Generative AI art creation
Cross-modal semantic breaks
Symbolic cognitive constraints
Dynamic rule injection
Cultural metaphorical instructions
title Research on the construction of intelligent art design system based on multimodal perception and generative AI
title_full Research on the construction of intelligent art design system based on multimodal perception and generative AI
title_fullStr Research on the construction of intelligent art design system based on multimodal perception and generative AI
title_full_unstemmed Research on the construction of intelligent art design system based on multimodal perception and generative AI
title_short Research on the construction of intelligent art design system based on multimodal perception and generative AI
title_sort research on the construction of intelligent art design system based on multimodal perception and generative ai
topic Generative AI art creation
Cross-modal semantic breaks
Symbolic cognitive constraints
Dynamic rule injection
Cultural metaphorical instructions
url https://doi.org/10.1007/s42452-025-07513-0
work_keys_str_mv AT hanghang researchontheconstructionofintelligentartdesignsystembasedonmultimodalperceptionandgenerativeai
AT zhengdongli researchontheconstructionofintelligentartdesignsystembasedonmultimodalperceptionandgenerativeai