RCFGL: Rapid Condition adaptive Fused Graphical Lasso and application to modeling brain region co-expression networks.
Inferring gene co-expression networks is a useful process for understanding gene regulation and pathway activity. The networks are usually undirected graphs where genes are represented as nodes and an edge represents a significant co-expression relationship. When expression data of multiple (p) gene...
Saved in:
| Main Authors: | , , , , |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
Public Library of Science (PLoS)
2023-01-01
|
| Series: | PLoS Computational Biology |
| Online Access: | https://journals.plos.org/ploscompbiol/article/file?id=10.1371/journal.pcbi.1010758&type=printable |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| _version_ | 1849706525039263744 |
|---|---|
| author | Souvik Seal Qunhua Li Elle Butler Basner Laura M Saba Katerina Kechris |
| author_facet | Souvik Seal Qunhua Li Elle Butler Basner Laura M Saba Katerina Kechris |
| author_sort | Souvik Seal |
| collection | DOAJ |
| description | Inferring gene co-expression networks is a useful process for understanding gene regulation and pathway activity. The networks are usually undirected graphs where genes are represented as nodes and an edge represents a significant co-expression relationship. When expression data of multiple (p) genes in multiple (K) conditions (e.g., treatments, tissues, strains) are available, joint estimation of networks harnessing shared information across them can significantly increase the power of analysis. In addition, examining condition-specific patterns of co-expression can provide insights into the underlying cellular processes activated in a particular condition. Condition adaptive fused graphical lasso (CFGL) is an existing method that incorporates condition specificity in a fused graphical lasso (FGL) model for estimating multiple co-expression networks. However, with computational complexity of O(p2K log K), the current implementation of CFGL is prohibitively slow even for a moderate number of genes and can only be used for a maximum of three conditions. In this paper, we propose a faster alternative of CFGL named rapid condition adaptive fused graphical lasso (RCFGL). In RCFGL, we incorporate the condition specificity into another popular model for joint network estimation, known as fused multiple graphical lasso (FMGL). We use a more efficient algorithm in the iterative steps compared to CFGL, enabling faster computation with complexity of O(p2K) and making it easily generalizable for more than three conditions. We also present a novel screening rule to determine if the full network estimation problem can be broken down into estimation of smaller disjoint sub-networks, thereby reducing the complexity further. We demonstrate the computational advantage and superior performance of our method compared to two non-condition adaptive methods, FGL and FMGL, and one condition adaptive method, CFGL in both simulation study and real data analysis. We used RCFGL to jointly estimate the gene co-expression networks in different brain regions (conditions) using a cohort of heterogeneous stock rats. We also provide an accommodating C and Python based package that implements RCFGL. |
| format | Article |
| id | doaj-art-5349a71db846473b9386cce6052cd0e2 |
| institution | DOAJ |
| issn | 1553-734X 1553-7358 |
| language | English |
| publishDate | 2023-01-01 |
| publisher | Public Library of Science (PLoS) |
| record_format | Article |
| series | PLoS Computational Biology |
| spelling | doaj-art-5349a71db846473b9386cce6052cd0e22025-08-20T03:16:11ZengPublic Library of Science (PLoS)PLoS Computational Biology1553-734X1553-73582023-01-01191e101075810.1371/journal.pcbi.1010758RCFGL: Rapid Condition adaptive Fused Graphical Lasso and application to modeling brain region co-expression networks.Souvik SealQunhua LiElle Butler BasnerLaura M SabaKaterina KechrisInferring gene co-expression networks is a useful process for understanding gene regulation and pathway activity. The networks are usually undirected graphs where genes are represented as nodes and an edge represents a significant co-expression relationship. When expression data of multiple (p) genes in multiple (K) conditions (e.g., treatments, tissues, strains) are available, joint estimation of networks harnessing shared information across them can significantly increase the power of analysis. In addition, examining condition-specific patterns of co-expression can provide insights into the underlying cellular processes activated in a particular condition. Condition adaptive fused graphical lasso (CFGL) is an existing method that incorporates condition specificity in a fused graphical lasso (FGL) model for estimating multiple co-expression networks. However, with computational complexity of O(p2K log K), the current implementation of CFGL is prohibitively slow even for a moderate number of genes and can only be used for a maximum of three conditions. In this paper, we propose a faster alternative of CFGL named rapid condition adaptive fused graphical lasso (RCFGL). In RCFGL, we incorporate the condition specificity into another popular model for joint network estimation, known as fused multiple graphical lasso (FMGL). We use a more efficient algorithm in the iterative steps compared to CFGL, enabling faster computation with complexity of O(p2K) and making it easily generalizable for more than three conditions. We also present a novel screening rule to determine if the full network estimation problem can be broken down into estimation of smaller disjoint sub-networks, thereby reducing the complexity further. We demonstrate the computational advantage and superior performance of our method compared to two non-condition adaptive methods, FGL and FMGL, and one condition adaptive method, CFGL in both simulation study and real data analysis. We used RCFGL to jointly estimate the gene co-expression networks in different brain regions (conditions) using a cohort of heterogeneous stock rats. We also provide an accommodating C and Python based package that implements RCFGL.https://journals.plos.org/ploscompbiol/article/file?id=10.1371/journal.pcbi.1010758&type=printable |
| spellingShingle | Souvik Seal Qunhua Li Elle Butler Basner Laura M Saba Katerina Kechris RCFGL: Rapid Condition adaptive Fused Graphical Lasso and application to modeling brain region co-expression networks. PLoS Computational Biology |
| title | RCFGL: Rapid Condition adaptive Fused Graphical Lasso and application to modeling brain region co-expression networks. |
| title_full | RCFGL: Rapid Condition adaptive Fused Graphical Lasso and application to modeling brain region co-expression networks. |
| title_fullStr | RCFGL: Rapid Condition adaptive Fused Graphical Lasso and application to modeling brain region co-expression networks. |
| title_full_unstemmed | RCFGL: Rapid Condition adaptive Fused Graphical Lasso and application to modeling brain region co-expression networks. |
| title_short | RCFGL: Rapid Condition adaptive Fused Graphical Lasso and application to modeling brain region co-expression networks. |
| title_sort | rcfgl rapid condition adaptive fused graphical lasso and application to modeling brain region co expression networks |
| url | https://journals.plos.org/ploscompbiol/article/file?id=10.1371/journal.pcbi.1010758&type=printable |
| work_keys_str_mv | AT souvikseal rcfglrapidconditionadaptivefusedgraphicallassoandapplicationtomodelingbrainregioncoexpressionnetworks AT qunhuali rcfglrapidconditionadaptivefusedgraphicallassoandapplicationtomodelingbrainregioncoexpressionnetworks AT ellebutlerbasner rcfglrapidconditionadaptivefusedgraphicallassoandapplicationtomodelingbrainregioncoexpressionnetworks AT lauramsaba rcfglrapidconditionadaptivefusedgraphicallassoandapplicationtomodelingbrainregioncoexpressionnetworks AT katerinakechris rcfglrapidconditionadaptivefusedgraphicallassoandapplicationtomodelingbrainregioncoexpressionnetworks |