Genome characterization through dichotomic classes: An analysis of the whole chromosome 1 of A. thaliana

In this article we show how dichotomic classes, binary variables naturally derived from a new mathematical model of the genetic code, can be used in order to characterize different parts of the genome. In particular, we analyze and compare different parts of whole chromosome 1 of Arabidopsis thalian...

Full description

Saved in:
Bibliographic Details
Main Authors: Enrico Properzi, Simone Giannerini, Diego Luis Gonzalez, Rodolfo Rosa
Format: Article
Language:English
Published: AIMS Press 2012-11-01
Series:Mathematical Biosciences and Engineering
Subjects:
Online Access:https://www.aimspress.com/article/doi/10.3934/mbe.2013.10.199
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832590107059683328
author Enrico Properzi
Simone Giannerini
Diego Luis Gonzalez
Rodolfo Rosa
author_facet Enrico Properzi
Simone Giannerini
Diego Luis Gonzalez
Rodolfo Rosa
author_sort Enrico Properzi
collection DOAJ
description In this article we show how dichotomic classes, binary variables naturally derived from a new mathematical model of the genetic code, can be used in order to characterize different parts of the genome. In particular, we analyze and compare different parts of whole chromosome 1 of Arabidopsis thaliana: genes, exons, introns, coding sequences (CDS), intergenes, untranslated regions (UTR) and regulatory sequences. In order to accomplish the task we encode each sequence in the 3 possible reading frames according to the definitions of the dichotomic classes (parity, Rumer and hidden). Then, we perform a statistical analysis on the binary sequences. Interestingly, the results show that coding and non-coding sequences have different patterns and proportions of dichotomic classes. This suggests that the frame is important only for coding sequences and that dichotomic classes can be useful to recognize them. Moreover, such patterns seem to be more enhanced in CDS than in exons. Also, we derive an independence test in order to assess whether the percentages observed could be considered as an expression of independent random processes. The results confirm that only genes, exons and CDS seem to possess a dependence structure that distinguishes them from i.i.d sequences. Such informational content is independent from the global proportion of nucleotides of a sequence. The present work confirms that the recent mathematical model of the genetic code is a new paradigm for understanding the management and the organization of genetic information and is an innovative tool for investigating informational aspects of error detection/correction mechanisms acting at the level of DNA replication.
format Article
id doaj-art-35adf65987c2419882e0c17bd66559ae
institution Kabale University
issn 1551-0018
language English
publishDate 2012-11-01
publisher AIMS Press
record_format Article
series Mathematical Biosciences and Engineering
spelling doaj-art-35adf65987c2419882e0c17bd66559ae2025-01-24T02:25:25ZengAIMS PressMathematical Biosciences and Engineering1551-00182012-11-0110119921910.3934/mbe.2013.10.199Genome characterization through dichotomic classes: An analysis of the whole chromosome 1 of A. thalianaEnrico Properzi0Simone Giannerini1Diego Luis Gonzalez2Rodolfo Rosa3Dipartimento di Scienze Statistiche, Università di Bologna, Via delle Belle Arti 41, 40126, BolognaDipartimento di Scienze Statistiche, Università di Bologna, Via delle Belle Arti 41, 40126, BolognaDipartimento di Scienze Statistiche, Università di Bologna, Via delle Belle Arti 41, 40126, BolognaDipartimento di Scienze Statistiche, Università di Bologna, Via delle Belle Arti 41, 40126, BolognaIn this article we show how dichotomic classes, binary variables naturally derived from a new mathematical model of the genetic code, can be used in order to characterize different parts of the genome. In particular, we analyze and compare different parts of whole chromosome 1 of Arabidopsis thaliana: genes, exons, introns, coding sequences (CDS), intergenes, untranslated regions (UTR) and regulatory sequences. In order to accomplish the task we encode each sequence in the 3 possible reading frames according to the definitions of the dichotomic classes (parity, Rumer and hidden). Then, we perform a statistical analysis on the binary sequences. Interestingly, the results show that coding and non-coding sequences have different patterns and proportions of dichotomic classes. This suggests that the frame is important only for coding sequences and that dichotomic classes can be useful to recognize them. Moreover, such patterns seem to be more enhanced in CDS than in exons. Also, we derive an independence test in order to assess whether the percentages observed could be considered as an expression of independent random processes. The results confirm that only genes, exons and CDS seem to possess a dependence structure that distinguishes them from i.i.d sequences. Such informational content is independent from the global proportion of nucleotides of a sequence. The present work confirms that the recent mathematical model of the genetic code is a new paradigm for understanding the management and the organization of genetic information and is an innovative tool for investigating informational aspects of error detection/correction mechanisms acting at the level of DNA replication.https://www.aimspress.com/article/doi/10.3934/mbe.2013.10.199dichotomic classesarabidopsis thalianagenetic codestatistical tests.
spellingShingle Enrico Properzi
Simone Giannerini
Diego Luis Gonzalez
Rodolfo Rosa
Genome characterization through dichotomic classes: An analysis of the whole chromosome 1 of A. thaliana
Mathematical Biosciences and Engineering
dichotomic classes
arabidopsis thaliana
genetic code
statistical tests.
title Genome characterization through dichotomic classes: An analysis of the whole chromosome 1 of A. thaliana
title_full Genome characterization through dichotomic classes: An analysis of the whole chromosome 1 of A. thaliana
title_fullStr Genome characterization through dichotomic classes: An analysis of the whole chromosome 1 of A. thaliana
title_full_unstemmed Genome characterization through dichotomic classes: An analysis of the whole chromosome 1 of A. thaliana
title_short Genome characterization through dichotomic classes: An analysis of the whole chromosome 1 of A. thaliana
title_sort genome characterization through dichotomic classes an analysis of the whole chromosome 1 of a thaliana
topic dichotomic classes
arabidopsis thaliana
genetic code
statistical tests.
url https://www.aimspress.com/article/doi/10.3934/mbe.2013.10.199
work_keys_str_mv AT enricoproperzi genomecharacterizationthroughdichotomicclassesananalysisofthewholechromosome1ofathaliana
AT simonegiannerini genomecharacterizationthroughdichotomicclassesananalysisofthewholechromosome1ofathaliana
AT diegoluisgonzalez genomecharacterizationthroughdichotomicclassesananalysisofthewholechromosome1ofathaliana
AT rodolforosa genomecharacterizationthroughdichotomicclassesananalysisofthewholechromosome1ofathaliana