|
PRINTER
VERSION
|
|
Genes: Only dna chain segments, or functional units of information? Jorge Barragán © Copyright 2004 Jorge Barragán. © Copyright 2004 Research Institute on Human Evolution. All rights reserved.
INTRODUCTION |
|
The genome capacity self-organize, or to guide the organism self-organization, is a condition that respects any model of the same one (1) (2) (3) (4) (5). Even the same human genome project, assumes that the organism genes set demands for himself such condition (6) (7) (8) (9) (10). The model whose accuracy will be evaluated, is the one that happens of the atractor call theory, developed by Stuart Kauffman. (1) One is a model that leaves from the base to consider that a proportional relation between the amount of DNA or value "c" exists in the alive beings, and the genes number that they have (all models do not do it) (2) (3) (9) (10). The genome would behave like a NK2 type random boolean network. That means, networks of "n" elements, in which the one state anyone of them depends on (or it is determined by) been on other two elements of the network (K=2). In this network type, the order emerges from spontaneous way. The system will display in addition, the capacity to draw for disturbances (homeostasis). The self-organization and the homeostasis, are two fundamental characteristics of the biological physical systems (the alive beings) (11). It is remarkable that if the network outside type N equal K (N=K) , its behavior would be chaotic, and if outside NK1 "would crystallize" in a high degree of order that it would prevent to draw for disturbances him (to adapt to the possible disturbances). The systems of this type, become stabilized (they cross all his possible expression guidelines) after an cycles number equal to the square root of "n" ( OBJECTIVES |
|
To verify if the observed thing in the reality, agrees with the hoped thing by the model predictions as far as the different possible cellular types number in the organism. To analyze the relation between the genes number found by the human genome project, and the different cellular types number whereupon we counted in the organism, such are the quantitative aspects of the human genome project whose analysis refers the work title. To propose a new concept of gene, like a functional unit of information, beyond the classical definition of "DNA chain segments that codify the information for the polypeptides synthesis (structural molecular units)". DESIGN AND RESULTS |
|
The model´s data, as far as genes number and the one of different possible cellular types, will be compared with observed in the human genome project. In order to evaluate an aspect of model´s accuraccy as it is his calibration, and the relation between different cellular types number and the findings from the genome project, the test of Hosmer-Lemeshow will be used. The model (1) supposes a base of 100,000 to 136,900 genes (value of "n"), reason why if different cellular types number is square root of n, the same one would go up to around a value between the 316 and 370 different cells types in the organism. The data coming from the human genome project, confirm that we would count on genes number included between the 30,000 and the 40.000 (12). It happens of real identification and genes count, and not of its "mathematical estimation by model". Following the formula The different cellular types number that they are in the organism, is of 200 to 254 according to the last counts that the histology offers (13) (14) (15) (1). One is an interest data, at time of evaluating the model´s predictions, and the human genome project´s estimations. The statistical analysis sample that the model is valid (p<0.005), in the same way that the prediction on different cellular types number from the genome project, also is in the correct magnitude order (p<0.005). However both predictions only are statistical approaches to reality. It isn´t 100,000 the genes number, neither is 173 the different celullar types number. DISCUSSION |
|
The results demonstrate that the model overestimate the genes number to the supposition that our genome is constituted by about 100,000 elements. The genes number count carried out in the human genome project, gives account of not more of about 30,000 to 40,000 elements. The model predicts then, 316 to 370 different cellular types existence, while that human genome project´s data is deduced that must not more have 173 to 200 different cellular types.
But the formula
This characteristic is not a "theoretical device", but an observation of the reality, a characteristic of the essence or intrinsic nature of our genomic system. And it is to hope that they fulfill so much the model, as the human genome project. But the real number of different cellular types whereupon counts the organism, is 200 to 254. This takes to consider that the model´s evaluation, could show the existence of an anomaly (from an epistemologyc point of view) in the theoretical genetics bases. Identifying and recounting DNA chain segments that codify the information for (16) (17) the polypeptides synthesis, the human genome project adjusts to the gene definition like structural molecular unit carrier of information. But its count of 30,000 to 40,000 genes, takes to wait for 173 to us to 200 cellular types, number that does not respond to the real observation of 200 to 254 different cellular types (200 it isn´t heigth above of different cellular types, but its lower). The analyzed genomic model, supposes the existence of about 100,000 or more of these molecular structural units. Number that does not correspond with the real values found by the human genome project. In addition, the model predicts with base in the false premise to count on about 100,000 (or little more) genes, than different cellular types number would be of 316 or 370. Number that does not agree either with the real count of different cellular types presents in the organism. The model overestimate of different cellular types number that they are possible to be found in the organism. But it not must to that the knowed formula square root of n = River Basins ( One would be three genome objective characteristics, but opposed to each other. So that arrived east point, it is worth to question itself what conclusions can be extracted of these opposed facts, or better still, will be able to be unified these genomic characteristics under a same and only explanation? CONCLUSIONS |
|
One possible conclusions is, to assume that since we counted with not more than 30,000 or 40,000 structural molecular units carriers of information (genes), the genome does not behave like a self-regulated NK2 type network, reason why the formula square root of n = River Basins (
Another possible conclusion is perhaps that, is not correct the value of 254 different cellular types, and the notion is due to review of which we considered functional properties and morphologic differentials between the cells. This would allow to consider like the findings of the human genome project with respect to genes number valid (30,000 to 40,000) and the genomic self-regulation (
Another possible conclusion would be, to consider that the molecular structural units, or the DNA chain segments called genes, are not in carrying themselves of information, but that the same one is from the interaction between products of these molecular structures, so that the genes were in fact functions or carrying functional units of information. In such case, the finding of 30,000 to 40,000 structural units (human genome project), would be compatible with the notion of the genome as a self-regulated network ( REFERENCES |
|
|
|
It is forbidden the reproduction of this contents without the authorization of their authors or HumanEvol.com.
Zurbano 72, 4ºJ. 28010, Madrid (Spain). Telephone: (+34) 91 3086746 / E-mail: contact@humanevol.com |