Australasian and Southeast Asian Tissue Typing Association
30th scientific meeting 22-24 November 2006, Chiangmai, Thailand
Implications of fractal organization of DNA on disease risk genomic mapping and immune function analysis
Malcolm J. Simons (Haplomic Technologies, Melbourne)
Andras J. Pellionisz (HelixoMetry, Sunnyvale, California)
Genetic risk for over 100 diseases has been localized to the MHC. While many of these associations have been established for more than 30 years, the mechanisms remain virtually unknown. In the last century, it was expected that gene characterisation, projected to culminate with the completion of the Human Genome Project, would at last reveal all its causal secrets. With the realization that genetic risk can not be attributed to genes alone, the era of Genetics, with exclusive focus on genes, and dismissal of non-protein coding DNA as “Junk”, has now ended. In this new century of Post-Genetics attention is being increasingly directed to non-coding intron and intergenic contributions to genetic phenomena; Genetics beyond Genes. Now a major challenge is to understand the ‘language’ of all DNA, including non-protein coding sequence. One possibility, first considered in the 1990’s, is that DNA is fractal, with the corollary that sequence similarity search algorithms assuming Euclidean geometry may fail to reveal sequence patterns with non-Euclidean properties. Very recently, Rigoutsos and colleagues at IBM reported the presence of over 66 million motifs in the human genome with over 128,000 repetitive short-length sequences. This strongly suggests a multi-component composition of the genome involving non-coding as well as coding elements.
Beyond characterization of DNA sequence patterns, the task will also be to discern functional implications. A fractal approach, first proposed by one of us (AJP, 2002) causally connected fractal structure with fractal processes. Application to the prokaryote Mycoplasma genitalium, as the smallest genome of free living organisms, showed that whole genome accords with the Zipf/Mandelbrot parabolic fractal distribution (personal communication, AJP). This establishes a fractality in the Mycoplasma genome.
In M.genitalium, near-contiguous iterative repetitions of fractosets composing fractogems occur in the intergenic sequences. An immunological phenomenon that directly impinges on genomic organization involves these intergenic repeat elements, which are in coding gene-reading frame but lack an open reading frame. Antigenic phase variation occurs as a result of homologous recombination between FractoGem-containing intergenic sequence and coding genes, in turn subserving escape from immune surveillance and chronicity of infection.
We are now applying to the MHC these genomic fractal analysis tools which the two of us earlier espoused (2006). In the first instance we are examining the 300 kb psoriasis risk region. The goal is to determine whether any fractal composition of multi-SNP haplotypes reveals disease associated risks greater than those observed with recombinant haplotype analysis, thereby refining the mapping of disease genetic risk.
Pellionisz, A. (2002) FractoGene: Utility to use self-similar repetitions in the language-like genetic information as fractal sets. US Patent Application (Aug. 1st, 2002)
Rigoutsos, Isidore, Tien Huynh, Kevin Miranda, Aristotelis Tsirigos, Alice McHardy, and Daniel Platt (2006) Short blocks from the noncoding parts of the human genome have instances within nearly all known genes and relate to biological processes PNAS April 25, 2006 vol. 103 no. 17 6605-6610 |
Simons, M, Pellionisz, A;(2006) Genomics, morphogenesis and biophysics: Triangulation of Purkinje cell development. The Cerebellum; 5(1): 27-35