Toward Enhancing LightWeight GAN for Text-Guided Generation of Animated Character Faces

Traditional text-guided image generation methods primarily focus on modifying existing images or altering specific elements, which limits their applicability. This paper introduces a significant enhancement to the LightWeight-GAN model, originally designed for image generation from random noise, by...

Full description

Saved in:
Bibliographic Details
Main Authors: Sameh Zarif, Abdalfatah Najjar, Khalid Mohamed Amin, Abdullah Alharbi, Wail S. Elkilani, Mahmoud A. Shawky, Marian Wagdy
Format: Article
Language:English
Published: IEEE 2025-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/11119659/
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Traditional text-guided image generation methods primarily focus on modifying existing images or altering specific elements, which limits their applicability. This paper introduces a significant enhancement to the LightWeight-GAN model, originally designed for image generation from random noise, by transforming it into a text-guided generative system. The proposed approach enables the generation of high-quality animated character faces directly from textual descriptions. To achieve this, we incorporate a mapping network that refines textual inputs before feeding them into the generator, ensuring more precise feature representation. Additionally, we integrate contrastive language-image pretraining (CLIP) to verify the generated images and enforce stronger alignment between textual prompts and visual outputs. The model is trained on a specialized facial dataset, demonstrating its ability to generate semantically accurate and visually compelling character faces. Extensive experiments using the CartoonSet dataset validate the effectiveness of our approach, achieving an FID score of 29.8 across 10 000 generated images. These improvements significantly outperform existing text-to-image generation models, making our system a promising tool for applications in game development, animation, and virtual reality.
ISSN:2169-3536