Comprehensive analysis of computational approaches in plant transcription factors binding regions discovery

Transcription factors (TFs) are regulatory proteins which bind to a specific DNA region known as the transcription factor binding regions (TFBRs) to regulate the rate of transcription process. The identification of TFBRs has been made possible by a number of experimental and computational techniques...

Full description

Saved in:
Bibliographic Details
Main Authors: Jyoti, Ritu, Sagar Gupta, Ravi Shankar
Format: Article
Language:English
Published: Elsevier 2024-10-01
Series:Heliyon
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S2405844024151717
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1846170160411443200
author Jyoti
Ritu
Sagar Gupta
Ravi Shankar
author_facet Jyoti
Ritu
Sagar Gupta
Ravi Shankar
author_sort Jyoti
collection DOAJ
description Transcription factors (TFs) are regulatory proteins which bind to a specific DNA region known as the transcription factor binding regions (TFBRs) to regulate the rate of transcription process. The identification of TFBRs has been made possible by a number of experimental and computational techniques established during the past few years. The process of TFBR identification involves peak identification in the binding data, followed by the identification of motif characteristics. Using the same binding data attempts have been made to raise computational models to identify such binding regions which could save time and resources spent for binding experiments. These computational approaches depend a lot on what way they learn and how. These existing computational approaches are skewed heavily around human TFBRs discovery, while plants have drastically different genomic setup for regulation which these approaches have grossly ignored. Here, we provide a comprehensive study of the current state of the matters in plant specific TF discovery algorithms. While doing so, we encountered several software tools’ issues rendering the tools not useable to researches. We fixed them and have also provided the corrected scripts for such tools. We expect this study to serve as a guide for better understanding of software tools’ approaches for plant specific TFBRs discovery and the care to be taken while applying them, especially during cross-species applications. The corrected scripts of these software tools are made available at https://github.com/SCBB-LAB/Comparative-analysis-of-plant-TFBS-software.
format Article
id doaj-art-694a46c3b3f24f1c983136197688f45f
institution Kabale University
issn 2405-8440
language English
publishDate 2024-10-01
publisher Elsevier
record_format Article
series Heliyon
spelling doaj-art-694a46c3b3f24f1c983136197688f45f2024-11-12T05:19:53ZengElsevierHeliyon2405-84402024-10-011020e39140Comprehensive analysis of computational approaches in plant transcription factors binding regions discovery Jyoti0 Ritu1Sagar Gupta2Ravi Shankar3Studio of Computational Biology & Bioinformatics, The Himalayan Centre for High-throughput Computational Biology, (HiCHiCoB, A BIC Supported by DBT, India), Biotechnology Division, CSIR-Institute of Himalayan Bioresource Technology (CSIR-IHBT), Palampur, (HP), 176061, India; Academy of Scientific and Innovative Research (AcSIR), Ghaziabad, Uttar Pradesh, 201002, IndiaStudio of Computational Biology & Bioinformatics, The Himalayan Centre for High-throughput Computational Biology, (HiCHiCoB, A BIC Supported by DBT, India), Biotechnology Division, CSIR-Institute of Himalayan Bioresource Technology (CSIR-IHBT), Palampur, (HP), 176061, India; Academy of Scientific and Innovative Research (AcSIR), Ghaziabad, Uttar Pradesh, 201002, IndiaStudio of Computational Biology & Bioinformatics, The Himalayan Centre for High-throughput Computational Biology, (HiCHiCoB, A BIC Supported by DBT, India), Biotechnology Division, CSIR-Institute of Himalayan Bioresource Technology (CSIR-IHBT), Palampur, (HP), 176061, India; Academy of Scientific and Innovative Research (AcSIR), Ghaziabad, Uttar Pradesh, 201002, IndiaStudio of Computational Biology & Bioinformatics, The Himalayan Centre for High-throughput Computational Biology, (HiCHiCoB, A BIC Supported by DBT, India), Biotechnology Division, CSIR-Institute of Himalayan Bioresource Technology (CSIR-IHBT), Palampur, (HP), 176061, India; Academy of Scientific and Innovative Research (AcSIR), Ghaziabad, Uttar Pradesh, 201002, India; Corresponding author. author. Studio of Computational Biology & Bioinformatics, The Himalayan Centre for High-throughput Computational Biology, (HiCHiCoB, A BIC supported by DBT, India), Biotechnology Division CSIR-Institute of Himalayan Bioresource Technology (CSIR-IHBT), Palampur, (HP), 176061, India.Transcription factors (TFs) are regulatory proteins which bind to a specific DNA region known as the transcription factor binding regions (TFBRs) to regulate the rate of transcription process. The identification of TFBRs has been made possible by a number of experimental and computational techniques established during the past few years. The process of TFBR identification involves peak identification in the binding data, followed by the identification of motif characteristics. Using the same binding data attempts have been made to raise computational models to identify such binding regions which could save time and resources spent for binding experiments. These computational approaches depend a lot on what way they learn and how. These existing computational approaches are skewed heavily around human TFBRs discovery, while plants have drastically different genomic setup for regulation which these approaches have grossly ignored. Here, we provide a comprehensive study of the current state of the matters in plant specific TF discovery algorithms. While doing so, we encountered several software tools’ issues rendering the tools not useable to researches. We fixed them and have also provided the corrected scripts for such tools. We expect this study to serve as a guide for better understanding of software tools’ approaches for plant specific TFBRs discovery and the care to be taken while applying them, especially during cross-species applications. The corrected scripts of these software tools are made available at https://github.com/SCBB-LAB/Comparative-analysis-of-plant-TFBS-software.http://www.sciencedirect.com/science/article/pii/S2405844024151717Transcription factorsTranscription factor binding regionsMachine learningDeep learningTF-DNA binding specificity
spellingShingle Jyoti
Ritu
Sagar Gupta
Ravi Shankar
Comprehensive analysis of computational approaches in plant transcription factors binding regions discovery
Heliyon
Transcription factors
Transcription factor binding regions
Machine learning
Deep learning
TF-DNA binding specificity
title Comprehensive analysis of computational approaches in plant transcription factors binding regions discovery
title_full Comprehensive analysis of computational approaches in plant transcription factors binding regions discovery
title_fullStr Comprehensive analysis of computational approaches in plant transcription factors binding regions discovery
title_full_unstemmed Comprehensive analysis of computational approaches in plant transcription factors binding regions discovery
title_short Comprehensive analysis of computational approaches in plant transcription factors binding regions discovery
title_sort comprehensive analysis of computational approaches in plant transcription factors binding regions discovery
topic Transcription factors
Transcription factor binding regions
Machine learning
Deep learning
TF-DNA binding specificity
url http://www.sciencedirect.com/science/article/pii/S2405844024151717
work_keys_str_mv AT jyoti comprehensiveanalysisofcomputationalapproachesinplanttranscriptionfactorsbindingregionsdiscovery
AT ritu comprehensiveanalysisofcomputationalapproachesinplanttranscriptionfactorsbindingregionsdiscovery
AT sagargupta comprehensiveanalysisofcomputationalapproachesinplanttranscriptionfactorsbindingregionsdiscovery
AT ravishankar comprehensiveanalysisofcomputationalapproachesinplanttranscriptionfactorsbindingregionsdiscovery