Dual Generative Network with Discriminative Information for Generalized Zero-Shot Learning

Zero-shot learning is dedicated to solving the classification problem of unseen categories, while generalized zero-shot learning aims to classify the samples selected from both seen classes and unseen classes, in which “seen” and “unseen” classes indicate whether they can be used in the training pro...

Full description

Saved in:
Bibliographic Details
Main Authors: Tingting Xu, Ye Zhao, Xueliang Liu
Format: Article
Language:English
Published: Wiley 2021-01-01
Series:Complexity
Online Access:http://dx.doi.org/10.1155/2021/6656797
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832550551565369344
author Tingting Xu
Ye Zhao
Xueliang Liu
author_facet Tingting Xu
Ye Zhao
Xueliang Liu
author_sort Tingting Xu
collection DOAJ
description Zero-shot learning is dedicated to solving the classification problem of unseen categories, while generalized zero-shot learning aims to classify the samples selected from both seen classes and unseen classes, in which “seen” and “unseen” classes indicate whether they can be used in the training process, and if so, they indicate seen classes, and vice versa. Nowadays, with the promotion of deep learning technology, the performance of zero-shot learning has been greatly improved. Generalized zero-shot learning is a challenging topic that has promising prospects in many realistic scenarios. Although the zero-shot learning task has made gratifying progress, there is still a strong deviation between seen classes and unseen classes in the existing methods. Recent methods focus on learning a unified semantic-aligned visual representation to transfer knowledge between two domains, while ignoring the intrinsic characteristics of visual features which are discriminative enough to be classified by itself. To solve the above problems, we propose a novel model that uses the discriminative information of visual features to optimize the generative module, in which the generative module is a dual generation network framework composed of conditional VAE and improved WGAN. Specifically, the model uses the discrimination information of visual features, according to the relevant semantic embedding, synthesizes the visual features of unseen categories by using the learned generator, and then trains the final softmax classifier by using the generated visual features, thus realizing the recognition of unseen categories. In addition, this paper also analyzes the effect of the additional classifiers with different structures on the transmission of discriminative information. We have conducted a lot of experiments on six commonly used benchmark datasets (AWA1, AWA2, APY, FLO, SUN, and CUB). The experimental results show that our model outperforms several state-of-the-art methods for both traditional as well as generalized zero-shot learning.
format Article
id doaj-art-53be854bd0774cca8f5aec08724e4827
institution Kabale University
issn 1076-2787
1099-0526
language English
publishDate 2021-01-01
publisher Wiley
record_format Article
series Complexity
spelling doaj-art-53be854bd0774cca8f5aec08724e48272025-02-03T06:06:30ZengWileyComplexity1076-27871099-05262021-01-01202110.1155/2021/66567976656797Dual Generative Network with Discriminative Information for Generalized Zero-Shot LearningTingting Xu0Ye Zhao1Xueliang Liu2School of Computer and Information, Hefei University of Technology, Hefei 230000, ChinaSchool of Computer and Information, Hefei University of Technology, Hefei 230000, ChinaSchool of Computer and Information, Hefei University of Technology, Hefei 230000, ChinaZero-shot learning is dedicated to solving the classification problem of unseen categories, while generalized zero-shot learning aims to classify the samples selected from both seen classes and unseen classes, in which “seen” and “unseen” classes indicate whether they can be used in the training process, and if so, they indicate seen classes, and vice versa. Nowadays, with the promotion of deep learning technology, the performance of zero-shot learning has been greatly improved. Generalized zero-shot learning is a challenging topic that has promising prospects in many realistic scenarios. Although the zero-shot learning task has made gratifying progress, there is still a strong deviation between seen classes and unseen classes in the existing methods. Recent methods focus on learning a unified semantic-aligned visual representation to transfer knowledge between two domains, while ignoring the intrinsic characteristics of visual features which are discriminative enough to be classified by itself. To solve the above problems, we propose a novel model that uses the discriminative information of visual features to optimize the generative module, in which the generative module is a dual generation network framework composed of conditional VAE and improved WGAN. Specifically, the model uses the discrimination information of visual features, according to the relevant semantic embedding, synthesizes the visual features of unseen categories by using the learned generator, and then trains the final softmax classifier by using the generated visual features, thus realizing the recognition of unseen categories. In addition, this paper also analyzes the effect of the additional classifiers with different structures on the transmission of discriminative information. We have conducted a lot of experiments on six commonly used benchmark datasets (AWA1, AWA2, APY, FLO, SUN, and CUB). The experimental results show that our model outperforms several state-of-the-art methods for both traditional as well as generalized zero-shot learning.http://dx.doi.org/10.1155/2021/6656797
spellingShingle Tingting Xu
Ye Zhao
Xueliang Liu
Dual Generative Network with Discriminative Information for Generalized Zero-Shot Learning
Complexity
title Dual Generative Network with Discriminative Information for Generalized Zero-Shot Learning
title_full Dual Generative Network with Discriminative Information for Generalized Zero-Shot Learning
title_fullStr Dual Generative Network with Discriminative Information for Generalized Zero-Shot Learning
title_full_unstemmed Dual Generative Network with Discriminative Information for Generalized Zero-Shot Learning
title_short Dual Generative Network with Discriminative Information for Generalized Zero-Shot Learning
title_sort dual generative network with discriminative information for generalized zero shot learning
url http://dx.doi.org/10.1155/2021/6656797
work_keys_str_mv AT tingtingxu dualgenerativenetworkwithdiscriminativeinformationforgeneralizedzeroshotlearning
AT yezhao dualgenerativenetworkwithdiscriminativeinformationforgeneralizedzeroshotlearning
AT xueliangliu dualgenerativenetworkwithdiscriminativeinformationforgeneralizedzeroshotlearning