A large dataset covering the Chinese national sign language for dual-view isolated sign language recognition

Abstract Isolated Sign Language Recognition (ISLR), which seeks to automatically align sign videos with corresponding glosses, has recently gained considerable attention from the artificial intelligence community. This technology has the potential to bridge the communication gap between hearing peop...

Full description

Saved in:
Bibliographic Details
Main Authors: Peng Jin, Hongkai Li, Jun Yang, Yazhou Ren, Yuhao Li, Lilan Zhou, Jin Liu, Mei Zhang, Xiaorong Pu, Siyuan Jing
Format: Article
Language:English
Published: Nature Portfolio 2025-04-01
Series:Scientific Data
Online Access:https://doi.org/10.1038/s41597-025-04986-x
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1850146403382198272
author Peng Jin
Hongkai Li
Jun Yang
Yazhou Ren
Yuhao Li
Lilan Zhou
Jin Liu
Mei Zhang
Xiaorong Pu
Siyuan Jing
author_facet Peng Jin
Hongkai Li
Jun Yang
Yazhou Ren
Yuhao Li
Lilan Zhou
Jin Liu
Mei Zhang
Xiaorong Pu
Siyuan Jing
author_sort Peng Jin
collection DOAJ
description Abstract Isolated Sign Language Recognition (ISLR), which seeks to automatically align sign videos with corresponding glosses, has recently gained considerable attention from the artificial intelligence community. This technology has the potential to bridge the communication gap between hearing people and the deaf community. However, the development of ISLR is hindered by the scarcity of sign language datasets. Moreover, existing ISLR datasets are limited by their provision of a single perspective, which makes hand gesture occlusion difficult to handle. In addition, existing Chinese ISLR datasets, such as DEVISIGN and NMFs-CSL, fail to cover the entire vocabulary of Chinese National Sign Language (CNSL). This greatly obstructs the application of ISLR in the real world. To address these challenges, we introduce a novel word-level sign language dataset for ISLR that encompasses the entire CNSL vocabulary, comprising 6,707 unique signs. Moreover, it provides two perspectives of signers: the front side and the left side. There are ten signers involved in sign video recording, and the processes of sign video recording, annotation and quality assurance were rigorously controlled. To the best of our knowledge, this dataset is the first dual-view Chinese sign language dataset for ISLR that covers all the sign words in CNSL.
format Article
id doaj-art-e2027bb00c454108a7b26c17a8a38adb
institution OA Journals
issn 2052-4463
language English
publishDate 2025-04-01
publisher Nature Portfolio
record_format Article
series Scientific Data
spelling doaj-art-e2027bb00c454108a7b26c17a8a38adb2025-08-20T02:27:50ZengNature PortfolioScientific Data2052-44632025-04-0112111010.1038/s41597-025-04986-xA large dataset covering the Chinese national sign language for dual-view isolated sign language recognitionPeng Jin0Hongkai Li1Jun Yang2Yazhou Ren3Yuhao Li4Lilan Zhou5Jin Liu6Mei Zhang7Xiaorong Pu8Siyuan Jing9Sichuan Province Key Laboratory of Philosophy and Social Science for Language Intelligence in Special Education, Leshan Normal UniversitySchool of Computer Science and Engineering, University of Electronic Science and Technology of ChinaSichuan Province Key Laboratory of Philosophy and Social Science for Language Intelligence in Special Education, Leshan Normal UniversitySchool of Computer Science and Engineering, University of Electronic Science and Technology of ChinaSchool of Computer Science and Engineering, University of Electronic Science and Technology of ChinaSichuan Province Key Laboratory of Philosophy and Social Science for Language Intelligence in Special Education, Leshan Normal UniversitySichuan Province Key Laboratory of Philosophy and Social Science for Language Intelligence in Special Education, Leshan Normal UniversitySichuan Province Key Laboratory of Philosophy and Social Science for Language Intelligence in Special Education, Leshan Normal UniversitySchool of Computer Science and Engineering, University of Electronic Science and Technology of ChinaSichuan Province Key Laboratory of Philosophy and Social Science for Language Intelligence in Special Education, Leshan Normal UniversityAbstract Isolated Sign Language Recognition (ISLR), which seeks to automatically align sign videos with corresponding glosses, has recently gained considerable attention from the artificial intelligence community. This technology has the potential to bridge the communication gap between hearing people and the deaf community. However, the development of ISLR is hindered by the scarcity of sign language datasets. Moreover, existing ISLR datasets are limited by their provision of a single perspective, which makes hand gesture occlusion difficult to handle. In addition, existing Chinese ISLR datasets, such as DEVISIGN and NMFs-CSL, fail to cover the entire vocabulary of Chinese National Sign Language (CNSL). This greatly obstructs the application of ISLR in the real world. To address these challenges, we introduce a novel word-level sign language dataset for ISLR that encompasses the entire CNSL vocabulary, comprising 6,707 unique signs. Moreover, it provides two perspectives of signers: the front side and the left side. There are ten signers involved in sign video recording, and the processes of sign video recording, annotation and quality assurance were rigorously controlled. To the best of our knowledge, this dataset is the first dual-view Chinese sign language dataset for ISLR that covers all the sign words in CNSL.https://doi.org/10.1038/s41597-025-04986-x
spellingShingle Peng Jin
Hongkai Li
Jun Yang
Yazhou Ren
Yuhao Li
Lilan Zhou
Jin Liu
Mei Zhang
Xiaorong Pu
Siyuan Jing
A large dataset covering the Chinese national sign language for dual-view isolated sign language recognition
Scientific Data
title A large dataset covering the Chinese national sign language for dual-view isolated sign language recognition
title_full A large dataset covering the Chinese national sign language for dual-view isolated sign language recognition
title_fullStr A large dataset covering the Chinese national sign language for dual-view isolated sign language recognition
title_full_unstemmed A large dataset covering the Chinese national sign language for dual-view isolated sign language recognition
title_short A large dataset covering the Chinese national sign language for dual-view isolated sign language recognition
title_sort large dataset covering the chinese national sign language for dual view isolated sign language recognition
url https://doi.org/10.1038/s41597-025-04986-x
work_keys_str_mv AT pengjin alargedatasetcoveringthechinesenationalsignlanguagefordualviewisolatedsignlanguagerecognition
AT hongkaili alargedatasetcoveringthechinesenationalsignlanguagefordualviewisolatedsignlanguagerecognition
AT junyang alargedatasetcoveringthechinesenationalsignlanguagefordualviewisolatedsignlanguagerecognition
AT yazhouren alargedatasetcoveringthechinesenationalsignlanguagefordualviewisolatedsignlanguagerecognition
AT yuhaoli alargedatasetcoveringthechinesenationalsignlanguagefordualviewisolatedsignlanguagerecognition
AT lilanzhou alargedatasetcoveringthechinesenationalsignlanguagefordualviewisolatedsignlanguagerecognition
AT jinliu alargedatasetcoveringthechinesenationalsignlanguagefordualviewisolatedsignlanguagerecognition
AT meizhang alargedatasetcoveringthechinesenationalsignlanguagefordualviewisolatedsignlanguagerecognition
AT xiaorongpu alargedatasetcoveringthechinesenationalsignlanguagefordualviewisolatedsignlanguagerecognition
AT siyuanjing alargedatasetcoveringthechinesenationalsignlanguagefordualviewisolatedsignlanguagerecognition
AT pengjin largedatasetcoveringthechinesenationalsignlanguagefordualviewisolatedsignlanguagerecognition
AT hongkaili largedatasetcoveringthechinesenationalsignlanguagefordualviewisolatedsignlanguagerecognition
AT junyang largedatasetcoveringthechinesenationalsignlanguagefordualviewisolatedsignlanguagerecognition
AT yazhouren largedatasetcoveringthechinesenationalsignlanguagefordualviewisolatedsignlanguagerecognition
AT yuhaoli largedatasetcoveringthechinesenationalsignlanguagefordualviewisolatedsignlanguagerecognition
AT lilanzhou largedatasetcoveringthechinesenationalsignlanguagefordualviewisolatedsignlanguagerecognition
AT jinliu largedatasetcoveringthechinesenationalsignlanguagefordualviewisolatedsignlanguagerecognition
AT meizhang largedatasetcoveringthechinesenationalsignlanguagefordualviewisolatedsignlanguagerecognition
AT xiaorongpu largedatasetcoveringthechinesenationalsignlanguagefordualviewisolatedsignlanguagerecognition
AT siyuanjing largedatasetcoveringthechinesenationalsignlanguagefordualviewisolatedsignlanguagerecognition