Identifying key genes in COPD risk via multiple population data integration and gene prioritization.

Chronic obstructive pulmonary disease (COPD) is a highly prevalent disease, making it a leading cause of death worldwide. Several genome-wide association studies (GWAS) have been conducted to identify loci associated with COPD. However, different ancestral genetic compositions for the same disease a...

Full description

Saved in:
Bibliographic Details
Main Authors: Afeefa Zainab, Hayato Anzawa, Kengo Kinoshita
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2024-01-01
Series:PLoS ONE
Online Access:https://doi.org/10.1371/journal.pone.0305803
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1850033509282873344
author Afeefa Zainab
Hayato Anzawa
Kengo Kinoshita
author_facet Afeefa Zainab
Hayato Anzawa
Kengo Kinoshita
author_sort Afeefa Zainab
collection DOAJ
description Chronic obstructive pulmonary disease (COPD) is a highly prevalent disease, making it a leading cause of death worldwide. Several genome-wide association studies (GWAS) have been conducted to identify loci associated with COPD. However, different ancestral genetic compositions for the same disease across various populations present challenges in studies involving multi-population data. In this study, we aimed to identify protein-coding genes associated with COPD by prioritizing genes for each population's GWAS data, and then combining these results instead of performing a common meta-GWAS due to significant sample differences in different population cohorts. Lung function measurements are often used as indicators for COPD risk prediction; therefore, we used lung function GWAS data from two populations, Japanese and European, and re-evaluated them using a multi-population gene prioritization approach. This study identified significant single nucleotide variants (SNPs) in both Japanese and European populations. The Japanese GWAS revealed nine significant SNPs and four lead SNPs in three genomic risk loci. In comparison, the European population showed five lead SNPs and 17 independent significant SNPs in 21 genomic risk loci. A comparative analysis of the results found 28 similar genes in the prioritized gene lists of both populations. We also performed a standard meta-analysis for comparison and identified 18 common genes in both populations. Our approach demonstrated that trans-ethnic linkage disequilibrium (LD) could detect some significant novel associations and genes that have yet to be reported or were missed in previous analyses. The study suggests that a gene prioritization approach for multi-population analysis using GWAS data may be a feasible method to identify new associations in data with genetic diversity across different populations. It also highlights the possibility of identifying generalized and population-specific treatment and diagnostic options.
format Article
id doaj-art-de9c10e486f942c78a13781f548fc3ff
institution DOAJ
issn 1932-6203
language English
publishDate 2024-01-01
publisher Public Library of Science (PLoS)
record_format Article
series PLoS ONE
spelling doaj-art-de9c10e486f942c78a13781f548fc3ff2025-08-20T02:58:11ZengPublic Library of Science (PLoS)PLoS ONE1932-62032024-01-011911e030580310.1371/journal.pone.0305803Identifying key genes in COPD risk via multiple population data integration and gene prioritization.Afeefa ZainabHayato AnzawaKengo KinoshitaChronic obstructive pulmonary disease (COPD) is a highly prevalent disease, making it a leading cause of death worldwide. Several genome-wide association studies (GWAS) have been conducted to identify loci associated with COPD. However, different ancestral genetic compositions for the same disease across various populations present challenges in studies involving multi-population data. In this study, we aimed to identify protein-coding genes associated with COPD by prioritizing genes for each population's GWAS data, and then combining these results instead of performing a common meta-GWAS due to significant sample differences in different population cohorts. Lung function measurements are often used as indicators for COPD risk prediction; therefore, we used lung function GWAS data from two populations, Japanese and European, and re-evaluated them using a multi-population gene prioritization approach. This study identified significant single nucleotide variants (SNPs) in both Japanese and European populations. The Japanese GWAS revealed nine significant SNPs and four lead SNPs in three genomic risk loci. In comparison, the European population showed five lead SNPs and 17 independent significant SNPs in 21 genomic risk loci. A comparative analysis of the results found 28 similar genes in the prioritized gene lists of both populations. We also performed a standard meta-analysis for comparison and identified 18 common genes in both populations. Our approach demonstrated that trans-ethnic linkage disequilibrium (LD) could detect some significant novel associations and genes that have yet to be reported or were missed in previous analyses. The study suggests that a gene prioritization approach for multi-population analysis using GWAS data may be a feasible method to identify new associations in data with genetic diversity across different populations. It also highlights the possibility of identifying generalized and population-specific treatment and diagnostic options.https://doi.org/10.1371/journal.pone.0305803
spellingShingle Afeefa Zainab
Hayato Anzawa
Kengo Kinoshita
Identifying key genes in COPD risk via multiple population data integration and gene prioritization.
PLoS ONE
title Identifying key genes in COPD risk via multiple population data integration and gene prioritization.
title_full Identifying key genes in COPD risk via multiple population data integration and gene prioritization.
title_fullStr Identifying key genes in COPD risk via multiple population data integration and gene prioritization.
title_full_unstemmed Identifying key genes in COPD risk via multiple population data integration and gene prioritization.
title_short Identifying key genes in COPD risk via multiple population data integration and gene prioritization.
title_sort identifying key genes in copd risk via multiple population data integration and gene prioritization
url https://doi.org/10.1371/journal.pone.0305803
work_keys_str_mv AT afeefazainab identifyingkeygenesincopdriskviamultiplepopulationdataintegrationandgeneprioritization
AT hayatoanzawa identifyingkeygenesincopdriskviamultiplepopulationdataintegrationandgeneprioritization
AT kengokinoshita identifyingkeygenesincopdriskviamultiplepopulationdataintegrationandgeneprioritization