Vodka! Jean Claude Perez, the golden ratio, dragon curve fractals and musical design in “junk DNA”

_{scordova
April 25, 2014

'Junk DNA', Intelligent Design

10}_{Categories
'Junk DNA'
Intelligent Design}

Share: Facebook; Twitter/X; LinkedIn; Flipboard; Print; Email

Jean Claude Perez is a self-organizational theorist, he is not a creationist. He has also published papers with an occasional visitor to UD, Andras Pellionisz.

If the mathematical/musical patterns Perez has found in DNA are improbable relative to laws of physics and chemistry, then he may have found yet another design feature of DNA, and this feature is found by combining coding DNA with non-coding DNA and viewing it holistically.

Here is the simplest explanation I found of his work:

When cells replicate, they count the total number of letters in the DNA strand of the daughter cell. If the letter counts don’t match certain exact ratios, the cell knows that an error has been made. So it abandons the operation and kills the new cell.

Failure of this checksum mechanism causes birth defects and cancer.

Jean-Claude Perez discovered an evolutionary mathematical matrix in DNA, based on the Golden Ratio 1.618

Dr. Jean-Claude Perez started counting letters in DNA. He discovered that these ratios are highly mathematical and based on “Phi”, the Golden Ratio 1.618. This is a very special number, sort of like Pi. Perez’ discovery was published in the scientific journal Interdisciplinary Sciences / Computational Life Sciences in September 2010.

Jean-Claude Perez discovered an evolutionary mathematical matrix in DNA, based on the Golden Ratio 1.618

Before I tell you about it, allow me to explain just a little bit about the genetic code.

DNA has four symbols, T, C, A and G. These symbols are grouped into letters made from combinations of 3 symbols, called triplets. There are 4x4x4=64 possible combinations.

So the genetic alphabet has 64 letters. The 64 letters are used to write the instructions that make amino acids and proteins.

Perez somehow figured out that if he arranged the letters in DNA according to a T-C-A-G table, an interesting pattern appeared when he counted the letters.

He divided the table in half as you see below. He took single stranded DNA of the human genome, which has 1 billion triplets. He counted the population of each triplet in the DNA and put the total in each slot:

When he added up the letters, the ratio of total white letters to black letters was 1:1. And this turned out to not just be roughly true. It was exactlytrue, to better than one part in one thousand, i.e. 1.000:1.000.

Then Perez divided the table this way:

Perez discovered that the ratio of white letters to black letters is exactly 0.690983, which is (3-Phi)/2. Phi is the number 1.618, the “Golden Ratio.”

He also discovered the exact same ratio, 0.690983, when he divided the table the following two alternative ways:

Perez discovered that the ratio of white letters to black letters is exactly 0.690983, which is (3-Phi)/2. Phi is the number 1.618, the “Golden Ratio.”

He also discovered the exact same ratio, 0.690983, when he divided the table the following two alternative ways:

Above: Total ratio of white:black letters = 1:1

So for three ways of dividing the table, the ratio of white to black is 1.000:1.000.

And for the other three ways of dividing it, the ratio is 0.690983 or (3-Phi)/2.

When you overlay these 6 symmetries on top of each other, you get a set of mathematical stairs with 32 golden steps. Then an absolutely fascinating geometrical pattern emerges: The “Dragon Curve” which is well known in fractal geometry. Here it is, labeled with DNA letters in descending frequency:

You can see other non-DNA, computer generated versions of this same curve here.

Other interesting facts:
•Similar patterns with variations on these same rules are seen across a range of 20 different species. From the AIDS virus to bacteria, primates and humans
•Each character in DNA occurs a precise number of times, and each has a twin. TTT and AAA are twins and appear the most often; they’re the DNA equivalent of the letter E.
•This pattern creates a stair step of 32 frequencies, a specific frequency for each pair.
•The number of triplets that begin with a T is precisely the same as the number of triplets that begin with A (to within 0.1%).
•The number of triplets that begin with a C is precisely the same as the number of triplets that begin with G.
•The genetic code table is fractal – the same pattern repeats itself at every level. The micro scale controls conversion of triplets to amino acids, and it’s in every biology book. The macro scale, newly discovered by Dr. Perez, checks the integrity of the entire organism.
•Perez is also discovering additional patterns within the pattern.

I am only giving you the tip of the iceberg. There are other rules and layers of detail that I’m omitting for simplicity. Perez presses forward with his research; more papers are in the works, and if you’re able to read French, I recommend his book “Codex Biogenesis” and his French website. Here is an English translation.

(By the way, he found some of his most interesting data in what used to be called “Junk DNA.” It turns out to not be junk at all.)

OK, so what does all this mean?
•Copying errors cannot be the source of evolutionary progress, because if that were true, eventually all the letters would be equally probable.
•This proves that useful evolutionary mutations are not random. Instead, they are controlled by a precise Evolutionary Matrix to within 0.1%
•When organisms exchange DNA with each other through Horizontal Gene Transfer, the end result still obeys specific mathematical patterns
•DNA is able to re-create destroyed data by computing checksums in reverse – like calculating the missing contents of a page ripped out of a novel.

No man-made language has this kind of precise mathematical structure. DNA is a tightly woven, highly efficient language that follows extremely specific rules. Its alphabet, grammar and overall structure are ordered by a beautiful set of mathematical functions.

More interesting factoids:

The most common pair of letters (TTT and AAA) appears exactly 1/13X as often as all the letters combined – consistently, the genomes of humans and chimpanzees.

If you put the 32 most common triplets in Group 1 and the 32 least common triplets in Group 2, the ratio of letters in Group1:Group2 is exactly 2:1. And since triplet counts occur in symmetrical pairs (TTT-AAA, TAT-ATA, etc), you can group them into four groups of 16.

When you put those four triplet populations on a graph, you get the peace symbol:

dna_peace_symbol

Does this precise set of rules and symmetries appear random or accidental to you?

My friend, this is how it is possible for DNA to be a code that is self-repairing, self-correcting, self-re-writing and self-evolving. It reveals a level of engineering and sophistication that human engineers could only dream of. Most of all, it’s elegant.

Cancer has sometimes been described as “evolution run amok.” Dr. Perez has noted interesting distortions of this matrix in cancer cells. I strongly suspect that new breakthroughs in cancer research are hidden in this matrix.

I submit to you that the most productive research that can possibly be conducted in medicine and computer science is intensive study of the DNA Evolution Matrix. Like I said, this is just the tip of the iceberg.

There is so much more here to discover!

When we develop computer languages based on DNA language, they will be capable of extreme data compression, error correction, and yes, self-evolution. Imagine: Computer programs that add features and improve with time. All by themselves.

What would that be like?

Perry Marshall

Shift Frequency Jean Claude Perez

Elizabeth Liddle with her love of music might actually appreciate the possibility that life and musical notes are possibly intertwined.

I can’t do justice to the topic by quoting snippets, so I will have to provide a link:
Phi and Music DNA Here is diagram from that article:

Recall the discussion of ID and tuning of musical instruments here.

Perez bio is in creation wiki:

Jean-Claude Perez, Ph.D., is a French interdisciplinary scientist born on June 26, 1947 in Bassens, Gironde near Bordeaux (France). An engineer and French scholar from Bordeaux university[1], Perez worked principally with IBM in both the areas of Biomathematics and Artificial Intelligence (the first time, showing evidence of high level self-organization in cellular automata networks [2] and the second time creating neural networks with “FRACTAL CHAOS”(Fractal geometry), his holographic-like memory system and novelty detector). Then, in 1990, Jean-claude Perez published strong links between the world of fractals and numbers of the Fibonacci sequence which are based on the Golden ratio[3]. In this last area, with the “DNA supracode”[4], he proved that DNA coding for genes is structured by proportions related to Fibonacci numbers[5] [6] [7]. He verified this discovery in the field of the HIV genome by partnerships with Professor Luc Montagnier[8], the discoverer of the HIV virus. He has worked for 20 years in the fields of whole genome numerical analysis and numerical decoding of genes as coding or non-coding DNA sequences (as demonstrated particularly by the last publications: Five last publications/conferences ).
Particularly, in “Interdisciplinary Science” September 2010 issue, J.C. Perez published a peer-reviewed paper proving that the whole human genome codon populations are managed by a “DRAGON fractal paper folding curve” fine-tuned around the “Golden ratio”. Particularly, this main paper entitled “Codon populations in single-stranded whole human genome DNA are fractal and fine-tuned by the Golden Ratio 1.618.” shows that the Universal Genetic Code Table not only maps codons to amino acids, but serves as a global checksum matrix at the whole genome macro-structural scale.[9]

Golden ratio.jpg

Draft of the paper Final published paper
A complete summary of J.C Perez’s research was published in Pellionisz A., Graham R., Pellionisz P., Perez J.: Genome Function of the Cerebellum: Geometric Unification of Neuroscience and Genomics. In: Manto M., Gruol D., Schmahmann J., Koibuchi N., Rossi F. (Ed.) Handbook of the Cerebellum and Cerebellar Disorders: SpringerReference (www.springerreference.com). Springer-Verlag Berlin Heidelberg, -1. DOI: 10.1007/SpringerReference_310386 2012-03-12 14:15:14 UTC. Details in [10] and full paper available in [11]

In october 2013 jean-claude perez published a peer reviewed major article in [APPLIED MATHEMATICS]http://www.scirp.org/journal/am/ (BIOMATHEMATICS issue) entitled “The 3 genomic numbers discovery”. This article show – for the first way – that complete human genome single stranded DNA constitutes a WHOLE… [12]

http://creationwiki.org/Jean-claude_Perez

The “Vodka” designation means a highly speculative topic.

Comments

I have just carried out three experiments to verify the work done by Perez. I am happy to announce that his findings have been confirmed. I have created a video showing each of the experiments. You can view the video by going to Youtube and entering the phrase "Mathematical Patterns within DNA" Craig Paardekoopercraig373_{November 12, 2016
November
11
Nov
12
12
2016
06:29 AM
6
06
29
AM
PDT}

ERRATUM 2 last remarks on Gordon Davisson remarks 12 and 15: Post 12: you are wrong because precisely i analyse single stranded dna whole genome sequence where there is no reason to find trivial crick watson pairing symmetries… Post 15: i meet inventor of ISOCHORES PR GIORGIO BERNARDI 25 years ago in his lab paris sorbonne… There are no links between my analysis at codpn population scale and ISOCHORES ratios… We no not considere here the same level of genomic infpamation: him. Ratiod CG by species. Me. Codons tripletsjean-claude perez_{May 3, 2014
May
05
May
3
03
2014
03:10 AM
3
03
10
AM
PDT}

2 last remarks on Gordon Davisson remarks 12 and 15: Post 12: you are wrong because precisely i analyse single stranded dna whole genome sequence where there id no reason to find trivial crick watson pairing symmetries... Post 15: i meet inventor of ISOCHORES PR GIORGIO BERNARDI 25 years ago in his lab paris sorbonne... There are no links between my analysis at codpn population scale and ospchorrs ratips... We no not considere here the same level of henomic infprmation: him. Ratiod cg by species. Me. Codons tripletsjean-claude perez_{May 3, 2014
May
05
May
3
03
2014
03:01 AM
3
03
01
AM
PDT}

Contrarly The first pi analyse of QUARK MIST be studied seriously!jean-claude perez_{May 2, 2014
May
05
May
2
02
2014
10:17 AM
10
10
17
AM
PDT}

To Gordon: An important topic showing you run in error: precisely if i analyse SINGLE STRANDED DNA the trivial crick watson base pairing disappear . Then your analyse is wrong in the native article we discuss this topic... I m sorry...jean-claude perez_{May 2, 2014
May
05
May
2
02
2014
10:14 AM
10
10
14
AM
PDT}

Dears Pietr, Joe or Gordon, I'm sorry for your unapropried comments on SANDWALK of the Sal Cordova entry entitled Vodka! Jean Claude Perez, the golden ratio, dragon curve fractals and musical design in “junk DNA”... The reason is that all (ALL) their comments were done without reading the basic original article: I suggest you reading the original basic peer review article of 2010 published in Interdisciplinary Science: http://fr.scribd.com/doc/95641538/Codon-Populations-in-Single-stranded-Whole-Human-Genome-DNA-Are-Fractal-and-Fine-tuned-by-the-Golden-Ratio-1-618 and my 2013 peer review article: http://www.scirp.org/journal/PaperInformation.aspx?paperID=37457#.U2Mwlfl_trAjean-claude perez_{May 1, 2014
May
05
May
1
01
2014
10:50 PM
10
10
50
PM
PDT}

Quark1 Free Curiisity is on of the first qualities to do best and real ...research in science.. Like you do there!!jean-claude perez_{April 30, 2014
April
04
Apr
30
30
2014
12:12 PM
12
12
12
PM
PDT}

Hello Jean-Claude Perez No, I did this only for fun and I like math.quark1_{April 30, 2014
April
04
Apr
30
30
2014
11:27 AM
11
11
27
AM
PDT}

Quark1: intersting perhaps: our devise of researchers and not of numerologists: in french sorry: "il faut separer le bon grain de l'ivraie"jean-claude perez_{April 30, 2014
April
04
Apr
30
30
2014
10:41 AM
10
10
41
AM
PDT}

Quark1: I precise a bit: there is here the distance between SCIENCE and... NUMEROLOGY !jean-claude perez_{April 30, 2014
April
04
Apr
30
30
2014
09:31 AM
9
09
31
AM
PDT}

quark1: no comment... Here is the real limit of a real Scientific!jean-claude perez_{April 30, 2014
April
04
Apr
30
30
2014
09:14 AM
9
09
14
AM
PDT}

Hello again. Golden ratio formula: Other way to find the thi number. (((TTA)-(TTC))^2 + ((TCT)-(TAT))^2 + ((TGC) - (TGA))^2 + ((TAG)-(TCG))^2)/10^16 = 1,618 General Quadrilateral The general quadrilateral can be find from this numbers. We name the angles with the corners like e,f,v and w and the two diagonals for E and F. The length of the general quadrilateral can be named as A, B, C, and D The length can be calculate as A = ((TTT)-(TTC))+((TTG)-(TTA)) B = ((TTT)-(TCT))+((TGT)-(TAT)) C = ((TGT)-(TGC))+((TGG)-(TGA)) D = ((TTG)-(TCG))+((TGG)-(TAG)) The total angle for this geometry is 2pi, and we want to use the cosinus law to calculate the angles. But we need the diagonal for that and if we look at the numbers in the inner circle which is (TCC),(TAC),(TCA) and (TAA), this numbers maybe can be corresponding to the diagonals. One can see some kind of structure of this thing. However use that the diagonal is E = (TAC) and F = (TCA) e = cos^-1((E^2-A^2-C^2)/2AC) = 1,6371 f = cos^-1((E^2-B^2-D^2)/2BD) = 2,011 v = cos^-1((F^2-C^2-D^2)/2CD) = 0,83 w = cos^-1((F^2-A^2-B^2)/2AB) = 1,799 Thereafter, e + f + v + w = 2pi For this case I only get an error on 0,1%. Other cases this will be between 1 to 2 %. Probability other geometrical figures can be find, I´m nearly to find one formula for the circle from this thing but so far the error is about over 5 %.quark1_{April 30, 2014
April
04
Apr
30
30
2014
08:54 AM
8
08
54
AM
PDT}

https://uncommondescent.com/intelligent-design/professor-larry-moran-poses-five-questions-for-the-id-movement/#comment-498347jean-claude perez_{April 30, 2014
April
04
Apr
30
30
2014
07:56 AM
7
07
56
AM
PDT}

more data in: https://uncommondescent.com/intelligent-design/professor-larry-moran-poses-five-questions-for-the-id-movement/#comment-498347jean-claude perez_{April 30, 2014
April
04
Apr
30
30
2014
07:53 AM
7
07
53
AM
PDT}

Gordon, on your last main question: "Finally, is there any reason to regard the similarity between the GC/AT ratio and (3–Phi)/2 as anything other than a numeric coincidence? Especially since it varies a great deal between different regions of the genome, different chromosomes, and different species? Especially since there are many other formulae involving Phi that it could have come out close to (e.g. Phi/3, Phi/2, Phi-1, 4-2*Phi, 2*Phi/5, etc…)" 2 responses: 1/ In my 2013 article, it is show the universal nature of this value 'quarks etc...° 2/ applying this same analysis to: -whole human genome ==> (3-Phi)/2 -lower scale chromosome is chr4: ratio = 1/Phi -highter scale chromosome is chr19: ratio = 1/Phi + 1/Pi strange no? Then analysing by others approachs chr4 shows that - if there is "design" - the right research way must be... HUMAN CHROMOSOME4jean-claude perez_{April 30, 2014
April
04
Apr
30
30
2014
12:08 AM
12
12
08
AM
PDT}

Gordon, ALL your questions are fine and natural. But all find response in my 2 peer published articles referenced here: the 2010 article: http://fr.scribd.com/doc/95641538/Codon-Populations-in-Single-stranded-Whole-Human-Genome-DNA-Are-Fractal-and-Fine-tuned-by-the-Golden-Ratio-1-618 the 2013 article: http://file.scirp.org/Html/4-7401586_37457.htmjean-claude perez_{April 29, 2014
April
04
Apr
29
29
2014
06:09 PM
6
06
09
PM
PDT}

Gordon, I sent you an e-mail from my personal account. You'll get an e-mail from the CEU Admin "Hwang" as well. I really don't need your personal e-mail, just something address where I can privately transmit you username and password. I don't expect there will be a lot of traffic on that discussion board, and that is by design. I prefer it to be a repository of research. Salscordova_{April 29, 2014
April
04
Apr
29
29
2014
03:33 PM
3
03
33
PM
PDT}

Sal, thanks for your offer. I don't like to publish my regular email address anywhere spammer-searchable, but if you drop me a note at -------, I'll send you my real address. [e-mail edited to protect commenter privacy]Gordon Davisson_{April 29, 2014
April
04
Apr
29
29
2014
03:08 PM
3
03
08
PM
PDT}

Jean-Claude Perez, thank you for joining us! I have some questions that I'd appreciate if it you could answer: First, is there anyplace where you detail exactly how you performed your various analyses? I earlier used the description in the "Phi and Music in DNA" (by Jordi Solà-Soler), but it appears that doesn't match your method. For example, in the "The Number « 4 »" section of your Beijing paper, the counts for TTT and AAA are close, but not exactly equal (while Jordi's method will always give exactly equal counts for these triplets). How exactly did you do your counting? Second, it looks to me like you're reading a lot of meaning into features of the data that don't really seem (to me) to show anything significant. For example, in the "-III- Evidence of 2 « attractors »:« 1 » and « (3-Phi)/2 »" section of your Beijing paper, you make a great deal of the fact that ratio of triplets in the even and odd halves of the table (and even and odd octants, etc) are very close to 1. But isn't this just a trivial result of DNA base pairing rules (every T on one strand matches an A on the other, and every C on one matches a G on the other), together with a lack of bias for either half of the pair to be concentrated on one strand? In other words, if you counted the bases on both strands, DNA pairing guarantees that the number of Ts will match the number of As, and the number of Cs will match the number of Gs. And unless there's some consistent difference between the two strands that led, for example, most of the As to be in one strand and the matching Ts to be in the other, the counts should nearly match for each individual strand as well. Just as we see. (Note that your "codon level generalization of Chargaff's second rule" can also be explained the same way.) As for the ratios that come out to (3–Phi)/2, aren't these just the GC/AT ratios, filtered through the triplet table in various ways? Finally, is there any reason to regard the similarity between the GC/AT ratio and (3–Phi)/2 as anything other than a numeric coincidence? Especially since it varies a great deal between different regions of the genome, different chromosomes, and different species? Especially since there are many other formulae involving Phi that it could have come out close to (e.g. Phi/3, Phi/2, Phi-1, 4-2*Phi, 2*Phi/5, etc...)Gordon Davisson_{April 29, 2014
April
04
Apr
29
29
2014
02:53 PM
2
02
53
PM
PDT}

Dear SCORDOVA, your last question: "I have my doubts as well, and I was waiting to see if anyone else would chime in. My main issue is how did Perez do an alignment so as to identify triplets. There are huge “NNNN” segments in fasta files, so how does he know where to start recognizing triplets?" my response: on the about billion codon triplets within whole human genome, NNN undetermined nucleotides are unsignifiant. Then to be sure of consistency of analyses, we have analysed the 3 coden reading frames, compressing NNN bases. Then, the 3 kinds of results are highly similar... The perfect proportions fine tuning around golden ratio etc... are statistically unaffected by undetermined bases. MEANWHILE, in the CODEX BIOGENESIS book, we show how accuracy increase withe the progress of human genome project comparing the successives releases of HGP, with NNN decreasing at each release. THEN our ratios converge at each new release... Here is a DYNAMIC proof of our discovery! Please, see CODEX BIOGENESIS pp 155 fig 12.3 comparing Human Genome Project releases od April 2001, Nivember 2002 and August 2003. http://www.amazon.co.uk/Codex-Biogenesis-harmonies-g%C3%A9nome-latome/dp/2874340448jean-claude perez_{April 29, 2014
April
04
Apr
29
29
2014
02:12 PM
2
02
12
PM
PDT}

Complete jc perez interdisciplinary Creation Wiki BIOGRAPHY in: http://creationwiki.org/Jean-claude_Perezjean-claude perez_{April 29, 2014
April
04
Apr
29
29
2014
01:08 PM
1
01
08
PM
PDT}

Full J.C. Perez's interdisciplinary biography in CreationWiki: http://creationwiki.org/Jean-claude_Perezjean-claude perez_{April 29, 2014
April
04
Apr
29
29
2014
01:05 PM
1
01
05
PM
PDT}

Dear GORDON DAVIDSON, you are right talking on PI: if the concensus ratio for whole human genome is (3-Phi)/2, the limits lower chromosomes are: chr4 1/Phi and chr19 1/Phi + 1/Pi !!!! please see details in my BEIJING conference here: http://fr.scribd.com/doc/57828784/jcperezBeijing032011jean-claude perez_{April 29, 2014
April
04
Apr
29
29
2014
11:08 AM
11
11
08
AM
PDT}

Gordon all this work was reproduced by Dr Jordi sola soler http://www.sacred-geometry.es/en/content/phi-and-music-dna for human chromosome1 I have all data results... details in my book CODEX BIOGENESIS http://www.amazon.co.uk/Codex-Biogenesis-harmonies-g%C3%A9nome-latome/dp/2874340448jean-claude perez_{April 29, 2014
April
04
Apr
29
29
2014
05:44 AM
5
05
44
AM
PDT}

more details in: Welcome... http://golden-ratio-in-dna.blogspot.com/ https://sites.google.com/site/codexbiogenesis/ http://fr.scribd.com/doc/57828784/jcperezBeijing032011 http://www.scribd.com/jean_claude_perez/documents http://file.scirp.org/Html/4-7401586_37457.htm http://www.scribd.com/doc/95641538/Codon-Populations-in-Single-stranded-Whole-Human-Genome-DNA-Are-Fractal-and-Fine-tuned-by-the-Golden-Ratio-1-618 https://plus.google.com/u/0/+jeanclaudePerez/aboutjean-claude perez_{April 29, 2014
April
04
Apr
29
29
2014
04:22 AM
4
04
22
AM
PDT}

Gordon, I publicly express my thanks for your meticulous work. I'm indebted to you: https://uncommondescent.com/news/many-thanks-to-gordon-davisson-and-joe-felsenstein-for-review-and-criticism-of-my-ud-article/scordova_{April 28, 2014
April
04
Apr
28
28
2014
09:02 AM
9
09
02
AM
PDT}

Follow on: https://uncommondescent.com/junk-dna/request-for-help-verifying-non-random-3mer-pattern-in-human-chromosome-1/scordova_{April 28, 2014
April
04
Apr
28
28
2014
02:33 AM
2
02
33
AM
PDT}

Gordon, If it is okay with you: 1. I can try to get you a copy of Perez orginal paper, I have several papers I'll be getting in also, and I can make them available to you. 2. I'd like to give you an account at Creation Evolution University to record some of our more technical interactions rather than at UD. It will be a better way of archiving some of what you have to say without it going quickly into cyber oblivion. What you have to say is to valuable, and it's awfully hard to search for some of your comments in the quarter million or so at UD. I have my doubts as well, and I was waiting to see if anyone else would chime in. My main issue is how did Perez do an alignment so as to identify triplets. There are huge "NNNN" segments in fasta files, so how does he know where to start recognizing triplets?scordova_{April 27, 2014
April
04
Apr
27
27
2014
11:03 PM
11
11
03
PM
PDT}

Sal, I don't think there's any question that there are many types (/levels) of non-randomness in genome sequences. In addition to the ones you mentioned, I'll throw isochores (regions with similar GC content) on the list. And note that they're related to other forms of non-randomness, for example gene-rich regions tend to have high GC content, while gene deserts have low GC content. I think there are two different questions here, though: first, whether natural processes (i.e. evolution) can account for the various forms of non-randomness, and second whether Perez has identified a significant form of non-randomness. The first question is huge (in a sense, it's the question of ID) and complicated, and I'll mostly duck it here. Well, ok, a really quick summary of my opinion: some of the non-randomness can be accounted for by known processes and mechanisms, and some will be accounted for by processes and mechanisms we've yet to fully work out. I haven't seen a convincing case either way for whether all non-randomness will eventually be explained by evolutionary mechanisms, so I don't try to draw implications from either assumption. As for the second question, it's also a bit hard to answer, because I don't have access to the original paper, data, and analysis method. There's some stuff at golden-ratio-in-dna.blogspot.com that might be by Perez, but I frankly find it incomprehensible. The discussion in "Phi and Music in DNA" is more readable, but I'm not sure it's describing Perez's analysis method correctly. If the description in "Phi and Music in DNA" is correct, I think my criticism is obviously correct (to the point of not needing to be tested). There is one small correction I need to make, though: bases near the end of a strand will not be counted as many times as I described. A base right at the end of a strand can only be the first or last of a triplet, so it only contributes to 2 entries in the table. Similarly, a base one from the end will only contribute to 4 entries. Bases two or more from the end contribute to the full 6 entries, as I described. Note, however, that the paired base also contributes the same amount, so the symmetry is maintained even in these cases. BTW, there's another, more visible way to see the symmetry due to the analysis method: suppose that at a particular position in the genome, there's an ACG triplet. It obviously gets counted in the ACG bin. In the reverse order, it also gets counted in the GCA bin. The paired triplet on the other strand, TGC, similarly gets counted in the TGC and (in reverse) CGT bins. This means that the ACG, GCA, TGC, and CGT bins should all have exactly the same count. And if you look at the figure you included, they do: 96112792. Similarly, CAT, TAC, GTA, and ATG all have 169023944; ATT, TTA, TAA, and AAT all have 260313647; etc. Note that if the analysis didn't count both strands in both directions with all three possible starting offsets, we'd expect these symmetry groups to have the same counts on average, but have different statistical variations from that average. With all 12 possibilities counted, these variations are smoothed out and vanish, but if the 12-way overcounting wasn't done we'd see at least some variation. Maybe not very much, though: if non-random features appear at random starting offsets on random strands, they'll average out and produce near-symmetries in the table. That may sound backward, so I'd better also point out that in this table arrangement, uniformity shows up as symmetries in the table, and non-uniformity (i.e. deviations from uniform randomness) show up as asymmetries. For example, the 3-codon periodicity (actually, correlations between nearby bases in general) show up in the table as differences between bins that are anagrams of each other. Thus the difference in counts between CAT (169023944), CTA (149333215), and ACT (202932695) is an indicator of non-randomness in the sequence. Thus, the symmetries he points out (equalities between different rows and columns) are actually an indication of a lack of pattern in the DNA sequence (if anything). Note that this does lead to something that could be tested: generate a random sequence of ACGT with a slight bias toward AT, and see if the various permutation, reorder, and complement bins come out with nearly the same counts. I feel pretty secure that they will... I'm pretty sure we can also rule out the checksum claim. If it were correct, mutations wouldn't happen (they do), and the different chromosomes would all have the same CG ratios (they don't; look at figure 1 of the isochore map paper, and compare chromosomes 17, 19, and 22 with 4, 5, 13, and X).Gordon Davisson_{April 27, 2014
April
04
Apr
27
27
2014
10:48 PM
10
10
48
PM
PDT}

Where I was heading with all this is that I think there are non-random pattern in the DNA as a whole including the "junk DNA". It makes possible this: http://www.asknature.org/strategy/0146a4de195dde317e3ce62870a3544a#.U12sm41OXrc Whether Perez is right or wrong, the quest for non-random patterns is on. How Chargaff's rules didn't detect codon bias or 3 periodicity is an important question. Something has to give! Now if the physical "fractal globules" in http://www.asknature.org/strategy/0146a4de195dde317e3ce62870a3544a#.U12sm41OXrc have connection to the sequences, then this is also important, and it also means "junk DNA" has significance.scordova_{April 27, 2014
April
04
Apr
27
27
2014
06:30 PM
6
06
30
PM
PDT}

1 2 Next

You must be logged in to post a comment.

Leave a Reply