Research on the construction of intelligent art design system based on multimodal perception and generative AI

Abstract This paper introduces an intelligent art design system powered by generative AI to address the cross-modal semantic disconnection in art creation. By integrating symbolic cognitive constraints and dynamic rule injection mechanisms, the system enhances both semantic accuracy and aesthetic qu...

Full description

Saved in:
Bibliographic Details
Main Authors: Hang Hang, Zhengdong Li
Format: Article
Language:English
Published: Springer 2025-08-01
Series:Discover Applied Sciences
Subjects:
Online Access:https://doi.org/10.1007/s42452-025-07513-0
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Abstract This paper introduces an intelligent art design system powered by generative AI to address the cross-modal semantic disconnection in art creation. By integrating symbolic cognitive constraints and dynamic rule injection mechanisms, the system enhances both semantic accuracy and aesthetic quality in generative models. A cultural semantic knowledge graph is developed to map cultural metaphors to visual representations, while a symbolic attention encoder ensures semantic consistency during image generation. Additionally, a differentiable aesthetic rule validator adjusts the output in real-time to optimize the alignment with both artistic intent and aesthetic standards. Experimental results show that the proposed system outperforms existing models in key areas such as cultural symbol mapping (92.5% coverage), symmetry, and color harmony, delivering more accurate and aesthetically pleasing results. The system has been applied in real-world art creation, significantly improving the efficiency and creativity of artists by providing culturally relevant and visually coherent references. The findings highlight the system’s potential as an innovative tool for generative art, bridging the gap between semantic accuracy and aesthetic quality in the creative process.
ISSN:3004-9261