The distribution of genes in the human genome
- PMID: 2055469
- DOI: 10.1016/0378-1119(91)90364-h
The distribution of genes in the human genome
Abstract
Previous investigations on the human genome determined: (i) the base compositions (GC levels) and the relative amounts of its isochore families; (ii) the compositional correlations (i.e., the correlations between GC levels) between third codon positions of a set of genes and the DNA fractions in which the genes were localized; and (iii) the compositional correlations between (a) third and first + second codon positions, as well as that between (b) introns and exons from the set of 'localized genes' and from all the coding sequences and genes (genomic sequences of exons + introns) available in gene banks. Here, we have shown that the correlations (iii, a and b) for 'localized genes' and genes from the bank are in full agreement, indicating that the former set is representative of the latter. We have then used the data (i) and the correlation (ii) to estimate the distribution of genes in isochore families. We have found that 34% of the genes are located in the GC-poor isochores (which represent 62% of the genome), 38% in the GC-rich isochores (31% of the genome) and 28% in the GC-richest isochores (3% of the genome). There is, therefore, a compositional gradient of gene concentration in the human genome. The gene density in the GC-richest 3% of the genome is about eight times higher than in the GC-rich 31%, and about 16 times higher than in the GC-poorest 62%.
Similar articles
-
The compositional properties of human genes.J Mol Evol. 1991 Jun;32(6):493-503. doi: 10.1007/BF02102651. J Mol Evol. 1991. PMID: 1908020
-
The vertebrate genome: isochores and evolution.Mol Biol Evol. 1993 Jan;10(1):186-204. doi: 10.1093/oxfordjournals.molbev.a039994. Mol Biol Evol. 1993. PMID: 8450755 Review.
-
Correlations between the compositional properties of human genes, codon usage, and amino acid composition of proteins.J Mol Evol. 1991 Jun;32(6):504-10. doi: 10.1007/BF02102652. J Mol Evol. 1991. PMID: 1908021
-
Statistical analysis of vertebrate sequences reveals that long genes are scarce in GC-rich isochores.J Mol Evol. 1995 Mar;40(3):308-17. doi: 10.1007/BF00163235. J Mol Evol. 1995. PMID: 7723057
-
Recombination explains isochores in mammalian genomes.Trends Genet. 2003 Mar;19(3):128-30. doi: 10.1016/S0168-9525(03)00021-0. Trends Genet. 2003. PMID: 12615004 Review.
Cited by
-
The bimodal distribution of genic GC content is ancestral to monocot species.Genome Biol Evol. 2014 Dec 19;7(1):336-48. doi: 10.1093/gbe/evu278. Genome Biol Evol. 2014. PMID: 25527839 Free PMC article.
-
ChIPr: accurate prediction of cohesin-mediated 3D genome organization from 2D chromatin features.Genome Biol. 2024 Jan 12;25(1):15. doi: 10.1186/s13059-023-03158-7. Genome Biol. 2024. PMID: 38217027 Free PMC article.
-
Reverse polarization in amino acid and nucleotide substitution patterns between human-mouse orthologs of two compositional extrema.DNA Res. 2007 Aug 31;14(4):141-54. doi: 10.1093/dnares/dsm015. Epub 2007 Sep 25. DNA Res. 2007. PMID: 17895298 Free PMC article.
-
Identification and physical localization of useful genes and markers to a major gene-rich region on wheat group 1S chromosomes.Genetics. 2001 Apr;157(4):1735-47. doi: 10.1093/genetics/157.4.1735. Genetics. 2001. PMID: 11290727 Free PMC article.
-
The GC-heterogeneity of teleost fishes.BMC Genomics. 2008 Dec 24;9:632. doi: 10.1186/1471-2164-9-632. BMC Genomics. 2008. PMID: 19108743 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Medical
Miscellaneous