Uncommon Descent Serving The Intelligent Design Community

Double debunking: Glenn Williamson on human-chimp DNA similarity and genes unique to human beings

Share
Facebook
Twitter
LinkedIn
Flipboard
Print
Email

Computer programmer Glenn Williamson now claims that ICR geneticist Jeff Tomkins made an elementary error when using the nucmer program to show that human and chimp DNA are only 88% similar. Williamson also asserts that 60 de novo protein coding genes said to be unique to human beings have very similar counterparts in apes, contrary to claims made last year by Dr. Cornelius Hunter, who is an adjunct professor of biophysics at Biola University.

What Dr. Tomkins allegedly got wrong

As readers of my recent post, Human and chimp DNA: They really are about 98% similar, will recall, Glenn Williamson demolished Dr. Tomkins’s original claim, made back in 2013, that human and chimp DNA are only about 70% similar. Williamson’s detailed takedown of Dr. Tomkins’s 70% similarity figure can be accessed here. In short: the version of the BLAST computer algorithm used by Tomkins contained a bug which invalidated his results. Dr. Tomkins responded by performing a new study which came up with a similarity figure of 88% – still far below the 98% similarity figure commonly claimed in textbooks for human and chimp DNA. Tomkins arrived at that figure by using a version of the BLAST algorithm which did not contain the bug, and in my last post, I pointed out the errors identified by Glenn Williamson in Dr. Tomkins’ new paper, relating to BLAST.

But to give credit where credit is due, Dr. Tomkins didn’t rely on just one computer program to come up with his 88% figure; he relied on three. In addition to BLAST, Dr. Tomkins made use of two other programs: nucmer and LASTZ. Creation scientist Jay Wile described these programs in a recent post discussing Dr. Tomkins’ work:

The nucmer program’s results agreed with the unbugged BLAST results: on average the human and chimpanzee genomes are 88% similar. The LASTZ program produced a lower average similarity (73%), which indicates that perhaps LASTZ has a bug or is not optimized for such comparisons, since its results are very close to the results Dr. Tomkins got with the bugged version of BLAST.

In today’s post, I’ll discuss the flaws identified by Glenn Williamson in Dr. Tomkins’s comparisons that were made using the nucmer program.

Basic methodological errors?

As we saw in yesterday’s post on Uncommon Descent, Glenn Williamson claims that Dr. Tomkins’s new study makes some fundamental errors in the way it performs the BLASTN analysis. Now, however, Williamson has gone further, and identified some very basic errors in the way Dr. Tomkins obtained his results from the nucmer program. What Williamson has shown is that even when human chromosome 20 is compared with itself, the calculation method used by Dr. Tomkins when running the “nucmer” program would imply (absurdly) that it is less than 90% similar to itself!

I have been in email correspondence with Glenn Williamson over the past 24 hours, and he kindly allowed me to publish his responses, as well as some emails he sent to Dr. Tomkins. Here’s an excerpt from his first email to me.

Hi Vincent,

I’ve only just seen your post on UD, and I thought I’d fill you in on where we are at with one of the other comparisons (“nucmer”) Jeff did in his recent paper. Basically what he is doing in this comparison is taking every single alignment for each query sequence (as opposed to taking just the best alignment) and taking the average of all those. Obviously all the repeat motifs will find many matches across each chromosome, but only one of those will be (putatively) homologous. If you can follow the email thread from the bottom, hopefully you can understand the issue.

I’m currently running a nucmer job with human chromosome 20 being compared to itself, just to show the absurdity of his calculation method. I should have the results by tomorrow.

I subsequently emailed him, and asked if he could tell me about the results:

I would greatly appreciate it if you would let me know about your results, after you finish running your nucmer job. I was also wondering if you would allow me to quote excepts from your correspondence in a forthcoming post on UD.

Glenn Williamson replied:

Hey,

Yup, no problems quoting any of the emails…

The first nucmer job I ran took 37 hours (human 20 to chimp 20), and this current “control” job (human 20 to human 20) has taken 37 hours as of right now, so it should finish soon. It will take a couple of hours to put all the results together, so should have something by tonight.

It wasn’t long before I heard from Glenn Williamson again:

It’s done!

And human chromosome 20 is only 88.86% identical to human chromosome 20! 🙂

Unix commands, if you care:

awk ‘NR>5 { print $7″\t”$8″\t”$10 }’ control.coords > control.tab
awk ‘{ sum += ($1 + $2) / 2; prod += ($1 + $2) / 2 * $3 } END { print prod; print sum; print prod / sum }’ control.tab

Output:

1.71549e+09
1.52439e+11
88.8601

So basically the alignments covered 1.715Gb for a chromosome that is only 64Mb long (27x coverage). There were 4.8 million individual alignments …

So there we have it. If Dr. Tomkins is right, then not only is chimpanzee DNA only 88% similar to our own, but human DNA is only 89% similar to itself!

Do human beings really have 60 de novo protein-coding genes with no counterparts in apes?

But there was more – much more. In my original email to Glenn Williamson, I had expressed curiosity over a comment he made on a January 2014 post titled, Chinese Researchers Demolish Evolutionary Pseudo-Science, over at Dr. Cornelius Hunter’s Website, Darwin’s God, in which Williamson expressed skepticism over Dr. Hunter’s claim that no less than 60 protein-coding orphan genes had been identified in human DNA which had no counterpart in chimpanzees. To support his claim, Dr. Hunter cited a 2011 PLOS study by Dong-Dong Wu, David M. Irwin and Ya-Ping Zhang, titled De Novo Origin of Human Protein-Coding Genes. Here is the authors’ summary of their paper (emphases mine – VJT):

The origin of genes can involve mechanisms such as gene duplication, exon shuffling, retroposition, mobile elements, lateral gene transfer, gene fusion/fission, and de novo origination. However, de novo origin, which means genes originate from a non-coding DNA region, is considered to be a very rare occurrence. Here we identify 60 new protein-coding genes that originated de novo on the human lineage since divergence from the chimpanzee, supported by both transcriptional and proteomic evidence. It is inconsistent with the traditional view that the de novo origin of new genes is rare. RNA–seq data indicate that these de novo originated genes have their highest expression in the cerebral cortex and testes, suggesting these genes may contribute to phenotypic traits that are unique to humans, such as development of cognitive ability. Therefore, the importance of de novo origination needs greater appreciation.

Commenting on the paper, Dr. Hunter remarked (bold emphases mine – VJT):

A 2011 paper out of China and Canada, for example, found 60 protein-coding genes in humans that are not in the chimp. And that was an extremely conservative estimate. They actually found evidence for far more such genes, but used conservative filters to arrive at 60 unique genes. Not surprisingly, the research also found evidence of function, for these genes, that may be unique to humans.

If the proteins encoded by these genes are anything like most proteins, then this finding would be another major problem for evolutionary theory. Aside from rebuking the evolutionist’s view that the human-chimp genome differences must be minor, 6 million years simply would not be enough time to evolve these genes.

In fact, 6 billion years would not be enough time. The evolution of a single new protein, even by evolutionists’ incredibly optimistic assumptions, is astronomically unlikely, even given the entire age of the universe to work on the problem.

Note the claim that Dr. Hunter is making here: “60 protein-coding genes in humans that are not in the chimp.” But as we’ll see, these genes do have virtually identical counterparts in chimps, even if they are noncoding.

So, how many ORFan genes do humans really have?

In his comment, Glenn Williamson responded to Dr. Hunter’s claim that humans have 60 protein-coding genes that are “not in the chimp” by pointing out that the first of these 60-odd genes actually has a counterpart in chimpanzee DNA which is 98% identical to the human gene (emphasis mine – VJT):

“So how many ORFan genes are actually in humans???”

Depends what you call an ORFan gene. I looked at the paper that Cornelius cites as having 60 de novo protein coding genes.

Now, granted that I only looked at the very first one (“ZNF843”), it was quite easy to find the corresponding DNA on the chimpanzee chromosome, with approximately 98.5% identity.

So whether it is protein-coding in humans and non-coding in everything else doesn’t really concern me. What concerns me is whether it has an evolutionary history: clearly this one does.

Like I said, I’ve only done one. I’d happily take bets on the majority of these de novo genes having an evolutionary history (chimpanzee > 95% and/or gorilla > 90%).

Any takers?

I had only come across this exchange in the last couple of days, while surfing the Net, and my curiosity was piqued. So I wrote back to Williamson:

By the way, I was intrigued with your work on orphan genes, and I thought I’d have a look at the 60 genes mentioned by Cornelius Hunter in a post he wrote last year. However, I don’t have any experience in this area. Can you tell me how to go about running these comparisons?

Orphan genes – did Dr. Hunter get his facts wrong?

Glenn Williamson’s reply was very helpful – and it pulled no punches. He accused Dr. Hunter of getting his facts wrong about ORFan genes (emphasis mine – VJT):

As for Orphan genes, I assume you mean this comment? http://darwins-god.blogspot.com.au/2014/01/chinese-researchers-demolish.html?showComment=1421299517820#c1105680265537141676

There are a couple of points to be made here. First is that Cornelius fundamentally misunderstands what an orphan gene is and what an ORF(an) is – they are not equivalent terms. A true orphan gene should be called a “taxonomically restricted gene” (TRG), and no trace of its evolutionary history can be found outside a particular taxonomic group. These would be examples of de novo evolution. With respect to humans and chimpanzees, I don’t know of any TRGs that exist in either genome (with respect to the other), and if there were, I would then check the other great apes to see if it was likely that this gene was deleted in one of the genomes (rather than evolved out of nothing in 6mn years!).

Good point. Williamson continued:

An ORFan gene usually refers to a putative protein coding gene. “Putative” because these are generally the result of a computer program trying to find long open reading frames, and if it finds something over a certain length (300bp? 400bp?) then, since a long open reading frame is quite unlikely, the program thinks that this open reading frame is evolutionarily conserved, and it might be conserved because it codes for an important protein. Have a read of Eric Lander’s paper – http://www.ncbi.nlm.nih.gov/pubmed/18040051 – where he says we should be removing these ORFs from the gene database unless and until we can actually find their corresponding proteins.

Glad we got that point cleared up. So, what about those 60 protein-coding genes in humans which Dr. Hunter claimed are not found in the chimp? Here’s what Williamson wrote back to me:

So, these 60-odd genes that Cornelius brings up, he is claiming that they must have evolved de novo:

“In fact, 6 billion years would not be enough time. The evolution of a single new protein, even by evolutionists’ incredibly optimistic assumptions, is astronomically unlikely, even given the entire age of the universe to work on the problem.”

And that’s why I checked the first one on the list, just to demonstrate that it was in the chimpanzee genome (at 98.5% identity). So if this gene codes for a protein in humans, maybe we just haven’t found the protein in chimps. Maybe it codes for a protein in humans, and there was a single mutation that caused it not to be translated in chimps. Maybe it doesn’t actually code for a protein in humans at all? (Although I think we can check that). In any case, it’s not an example of de novo evolution – it’s not an orphan gene in the sense of being taxonomically restricted.

As to how to do the work yourself .. let me send this one off first and I’ll start another email 🙂

For my part, I am somewhat skeptical about Williamson’s speculation that these genes got switched off in the lin leading to chimpanzees – especially in view of the discovery of three undoubted cases of de novo genes in human beings where the ancestral sequence in apes was noncoding. But given the striking 98% similarities between these genes and their non-coding counterparts in apes, I would also urge caution about Dr. Hunter’s claim that even billions of years would not have been long enough for these protein-coding genes to have evolved. If they were evolving from scratch, yes; but if they were evolving from 98% identical counterparts, I wouldn’t be so sure about that.

I learn how to do a BLAST comparison

In his next email, Glenn Williamson kindly informed me how to do a BLAST comparison, and how he obtained his results for ZNF843, which was the first of the 60 de novo protein coding genes cited by Dr. Hunter in his 2014 post. In his response to Dr. Hunter, Williamson had reported that “it was quite easy to find the corresponding DNA on the chimpanzee chromosome, with approximately 98.5% identity.” Here’s what he wrote to me:

Alright, I’ll run you through a simple BLAST search on the Ensembl website. Although, if you want to do some serious BLASTing, then you probably should install the software on your own machine, and download the genomes onto your hard drive.

Anyway, go to:

http://www.ensembl.org/index.html

and stick the name of the gene: ZNF843 into the search box. That should get you to here:

http://asia.ensembl.org/Homo_sapiens/Gene/Summary?db=core;g=ENSG00000176723;r=16:31432593-31443160

On the left hand side, there should be an “Export Data” tab. Click it. Deselect all the checkboxes (we just want the raw DNA) and hit “Next”. Hit the “Text” button, and then just Copy the whole output, starting with the “>blah blah blah”. Now, at the top left of the page is the “BLAST/BLAT” tool. Click it.

Paste the copied DNA into the box, make sure you search against the chimpanzee genome (i.e. uncheck the human genome) and then run the search – using the default parameters should be fine for now.

The results can be found here:

http://www.ensembl.org/Homo_sapiens/Tools/Blast/Ticket?tl=mQCTv8YnFRQKB0Kx

Unfortunately the results are given in chunks, and if you want to get an exact number, stick them in Excel and work it out. But if you just want to look at it on the website, click on the “Genomic Location” header to sort them in that order, scroll down to chromosome 16, and you’ll see that it covers the vast majority of the 10.5kb of query DNA, and the matches are around 98.5%-99.5%. Rough guess for the overall identity (including some small indels) is about 98.5%.

If you need help just email me back and I’ll see what I can do. I gotta run now tho 🙂

And here’s what Williamson got when he ran the BLAST comparison on his computer:

I ran it on my local machine:

#!/bin/sh

QRY=”ZNF843.fa”
SBJ=”${HOME}/Data/Ensembl/chimp/Pan_troglodytes.CHIMP2.1.4.dna.chromosome.16.fa”

blastn -query ${QRY} -subject ${SBJ} -max_hsps 1 -outfmt ’10 qseqid qstart qend sstart send nident pident qlen length’

Output:

16,1,10568,31611859,31601307,10375,97.62,10568,10628

So, only 97.62% identity for that one … 0.57% of the alignment is indels. Boooooooooooooo.

So, for the first of the alleged 60 “de novo” protein coding genes cited by Dr. Hunter (“ZNF843″), Glenn Williamson managed to locate some corresponding DNA on the chimpanzee chromosome, which was approximately 98% identical. Are these genes without an evolutionary history? I hardly think so!

More good news – the results for all the other genes are already in!

In his most recent email, Glenn Williamson shared further good tidings: comparisons for the other 59 genes have already been done!

Just looking into that 2011 paper a little further – they’ve already done all the work for us!

http://journals.plos.org/plosgenetics/article/asset?unique&id=info:doi/10.1371/journal.pgen.1002379.s009
http://journals.plos.org/plosgenetics/article/asset?unique&id=info:doi/10.1371/journal.pgen.1002379.s011

These are the 60 “de novo” genes, and their alignments with chimpanzee and orang-utan 🙂

I’ve had a look at the output, and even to my untutored eye, it’s obvious that any claims that these “de novo” genes are not found in the DNA of chimps and other apes are flat-out wrong. They have virtually identical counterparts on the chimpanzee and orang-utan genomes, even if these are non-protein coding.

Some cautionary remarks about the 2011 paper cited by Dr. Hunter

The 2011 paper by Wu et al. which was cited by Dr. Hunter was critiqued in another article in PLOS Genetics (7(11): e1002381. doi:10.1371/journal.pgen.1002381, published 10 November 2011), titled,
De Novo Origins of Human Genes by Daniele Guerzoni and Aoife McLysaght. The authors felt that the estimate of 60 de novo human-specific genes in the paper by Wu et al. was based on rather lax criteria. What’s more, they seemed confident that the genes could have evolved:

In this issue of PLoS Genetics, Wu et al. [15] report 60 putative de novo human-specific genes. This is a lot higher than a previous, admittedly conservative, estimate of 18 such genes [13], [16]. The genes identified share broad characteristics with other reported de novo genes [13]: they are short, and all but one consist of a single exon. In other words, the genes are simple, and their evolution de novo seems plausible. The potential evolution of complex features such as intron splicing and protein domains within de novo genes remains somewhat puzzling. However, features such as proto-splice sites may pre-date novel genes [9], [17], and the appearance of protein domains by convergent evolution may be more likely than previously thought [2].

The operational definition of a de novo gene used by Wu et al. [15] means that there may be an ORF (and thus potentially a protein-coding gene) in the chimpanzee genome that is up to 80% of the length of the human gene (for about a third of the genes the chimpanzee ORF is at least 50% of the length of the human gene). This is a more lenient criterion than employed by other studies, and this may partly explain the comparatively high number of de novo genes identified. Some of these cases may be human-specific extensions of pre-existing genes, rather than entirely de novo genes — an interesting, but distinct, phenomenon.

In a 2009 paper titled Recent de novo origin of human protein-coding genes (Genome Research 2009, 19: 1752-1759), David Knowles and Aoife McLysaght presented evidence for the de novo origin of at least three human protein-coding genes since the divergence with chimp, and estimated that there may be 18 such genes in the human genome, altogether. Here’s what they said about the three genes they identified:

Each of these genes has no protein-coding homologs in any other genome, but is supported by evidence from expression and, importantly, proteomics data. The absence of these genes in chimp and macaque cannot be explained by sequencing gaps or annotation error. High-quality sequence data indicate that these loci are noncoding DNA in other primates. Furthermore, chimp, gorilla, gibbon, and macaque share the same disabling sequence difference, supporting the inference that the ancestral sequence was noncoding over the alternative possibility of parallel gene inactivation in multiple primate lineages.

Note the wording: “Each of these genes has no protein-coding homologs in any other genome.” Nevertheless, the genes have non-coding counterparts in the DNA of apes: “High-quality sequence data indicate that these loci are noncoding DNA in other primates.”

Whether these genes could have evolved naturally from their ape counterparts is a question I’ll leave for the experts to sort out. One thing I do know, however: they are not “new” in the sense that layfolk would construe that term – that is, functioning genes which have no counterparts in the DNA of apes. Clearly, they do have very similar counterparts in apes, even if those counterparts are non-coding.

Conclusion

Well, I think that’s about enough new revelations for one day, so I shall stop there. It seems to me that any claims that humans have a large number of “de novo” genes with no counterparts in the DNA of chimpanzees and other apes should be treated with extreme caution. In fact, I wouldn’t bet on our having any de novo protein-coding genes having no counterparts in apes, after that takedown.

We already have very good arguments demonstrating the impossibility of proteins having evolved via an undirected process, thanks to the excellent work of Dr. Douglas Axe – see, for instance, his excellent article, The Case Against a Darwinian Origin of Protein Folds. It seems to me that arguments based on de novo genes alleged to exist in human beings, with no counterparts in apes, have much weaker evidential support, and that Intelligent Design proponents would be better off not using them.

But perhaps those who are feeling adventurous might like to take up Glenn Williamson on his 2014 wager:

I’d happily take bets on the majority of these de novo genes having an evolutionary history (chimpanzee > 95% and/or gorilla > 90%).

Any takers?

Well? Is anyone feeling lucky?

POSTSCRIPT: Readers may be interested to know that Dr. Ann Gauger has written a very balanced post titled, Orphan Genes—A Guide for the Perplexed. In her post, Dr. Gauger defines orphan genes as ” those open reading frames that lack identifiable sequence similarity to other protein-coding genes.” Note the word “protein-coding.” She raises the possibility that “they are uniquely designed for species- and clade-specific functions” but draws no firm conclusions.

Comments
Dr. Torley, this article that Dr. Nelson posted today may interest you
“Evolution, in the sense of common descent, is not a theory of similarity. Linnaeus, Cuvier, and Agassiz knew all about similarity, yet they denied common descent. Evolution is a theory of transformation.”,,, Paul Nelson - What Evolution Is, and What It's Not - October 30, 2015 http://www.evolutionnews.org/2015/10/what_evolution100501.html Common Descent? - Some Insurmountable Problems for gradualism/transformation https://docs.google.com/document/d/1BBU4GVEPIxDDSre6YLqU5zbaXVdSk4RRMD8F7GU3DPM/edit
bornagain
October 30, 2015
October
10
Oct
30
30
2015
04:18 PM
4
04
18
PM
PDT
Dr. Hunter, When I asked, “How do you explain the fact that these genes have non-coding counterparts in chimpanzees which are 98% similar?”, you replied, "The fact that there are homologous ORFs of these genes in the chimp and ape is an enormous problem for evolution." I have to say I'm a little surprised by this leap of logic. If humans had genes for which there was nothing homologous in the DNA of chimpanzees or other apes, or for that matter other organisms, that would indeed be an "enormous problem for evolution." But when we find homologs which are 98% similar, obviously the problem is considerably reduced. You write that ORFs need to be regulated and recruited, but in that case, Thick Python's argument that "the underlying framework for that regulation – be it transcription factor, or some other mechanism – is likely to have 98% of its DNA already in the genome" would still apply. I am not arguing that orphan genes weren't designed. I honestly don't know whether they were or not. But I'd like to ask you a question. Which do you think is the greater leap, in evolutionary terms: (i) the leap from a random string of bases to a non-coding DNA string which is 98% similar to a so-called "de novo" gene, or (ii) the leap from this 98%-similar homolog to a gene which codes for a protein? If the latter, why? If you believe that the gene was designed, then why not the 98%-similar homolog? And if you believe that the non-coding DNA wasn't designed, then doesn't it seem a bit odd to say that design was required to tweak the last 2%? So far, the most sensible suggestion I've seen coming from Intelligent Design advocates is one by gpuccio at comment #55 above:
In the case on new protein coding genes which are protein coding only in humans, but which have large homologies with non coding sequences in primates, that is exactly my idea of one modality about how new protein coding genes may come into existence: by the gradual engineering of non coding sequences into a coding sequence. Some examples have been described, and transposons could have an important role in the process. Which can only be interpreted, obviously, as an intelligent design process.
Would this be your position?vjtorley
October 30, 2015
October
10
Oct
30
30
2015
03:23 PM
3
03
23
PM
PDT
@Peer, #472:
my question was: what did you run…WU or NCBI….?
And I answered it: "Jeff Tomkins used NCBI BLAST, so clearly I have to use it ...".
I would like to see both results.
I'm sure you would, but that's no small amount of work just to satisfy your curiosity. I've already put two analyses out there for scrutiny (BLAST and nucmer) and I'm not aware of any criticisms of them in this thread that have held up (please give post #'s, if you think otherwise!). If you can get Tomkins to address the seven points above, I will gladly do a WU-BLAST comparison.ThickPython
October 30, 2015
October
10
Oct
30
30
2015
03:00 PM
3
03
00
PM
PDT
@ big snake , that was not my question. my question was: what did you run...WU or NCBI....? I would like to see both results.Peer
October 30, 2015
October
10
Oct
30
30
2015
02:21 PM
2
02
21
PM
PDT
@BornAgain:
Peer, kf has offered to host a guest post by you on this subject.
Yes, in line with BornAgain's post in #380, I would love to see a guest post with "sufficient feedback from Dr Tomkins himself", and would also love to see Peer's rebuttal as well. Clearly I've sought feedback from Tomkins but the silence is deafening. What I'd like to see an explanation of: 1. Why Tomkins chose the "ungapped" parameter, and then how he includes those results in his calculation. Specifically he needs to address the example where an indel is right in the middle of two sequences, and why he includes this as a 50% result. 2. What result he obtains when he corrects for #1. 3. Why he continued to use "ungapped" in his most recent paper, when my paper directly pointed out this flaw to him 12 months ago. 4. Why Tomkins included ALL matches for a given query in his nucmer analysis rather than just taking the best match. 5. What result he obtains when he corrects for #4. 6. Why his previous work took only the best match into his calculation, and why he has confirmed in personal correspondence that this is the correct practice. 7. What feedback he gave to the editor (Andrew Snelling) that caused him to reject my paper for publication. I don't think I'm the only one that would like answers to these questions.ThickPython
October 30, 2015
October
10
Oct
30
30
2015
02:12 PM
2
02
12
PM
PDT
@Peer, #468:
What program did you run? NCBI Blast or WU Blast? If you used NCBI Blast…that would explain a lot.
Yes, it explains EVERYTHING. Jeff Tomkins used NCBI BLAST, so clearly I have to use it to demonstrate the bug in it that gave him his 70% result. Please let me know your thoughts on the choice of software, and I'll pass them on to Dr Jeffrey P. Tomkins.ThickPython
October 30, 2015
October
10
Oct
30
30
2015
01:49 PM
1
01
49
PM
PDT
Peer, kf has offered to host a guest post by you on this subject.bornagain
October 30, 2015
October
10
Oct
30
30
2015
08:41 AM
8
08
41
AM
PDT
@Big Snake... What program did you run? NCBI Blast or WU Blast? If you used NCBI Blast...that would explain a lot.Peer
October 30, 2015
October
10
Oct
30
30
2015
04:57 AM
4
04
57
AM
PDT
@Mung:
And yet you asserted that the probability of obtaining those results was “exactly zero.” You were obviously wrong. Now all I have to do is replicate Tomkin’s results and I win a million bucks. Right?
You sir, are an imbecile. The reason he got those results was due to the bug in the software. Please try to keep up.ThickPython
October 29, 2015
October
10
Oct
29
29
2015
09:27 PM
9
09
27
PM
PDT
Mung: Who on earth are you alleging got those results? ThickPython: Jeff Tomkins did. Check out the pretty graph in his 2013 paper. And yet you asserted that the probability of obtaining those results was "exactly zero." You were obviously wrong. Now all I have to do is replicate Tomkin's results and I win a million bucks. Right?Mung
October 29, 2015
October
10
Oct
29
29
2015
08:35 PM
8
08
35
PM
PDT
@Mung:
Who on earth are you alleging got those results?
Jeff Tomkins did. Check out the pretty graph in his 2013 paper.
p.s. If your goal is “Counter-apologetics” then UD might not be the right place for you.
Fixed.ThickPython
October 29, 2015
October
10
Oct
29
29
2015
07:52 PM
7
07
52
PM
PDT
Box, yes, (@ 448 & 449), I was going to say something along the lines of 'poetry' too, but Andre nailed it before me. https://uncommondescent.com/intelligent-design/double-debunking-glenn-williamson-on-human-chimp-dna-similarity-and-genes-unique-to-human-beings/#comment-585481bornagain
October 29, 2015
October
10
Oct
29
29
2015
07:41 PM
7
07
41
PM
PDT
Bornagain, talking about DNA-repair ... did you happen to read about "topoisomerases"; see Talbott quote in bottom section post #499 ? Utterly phenomenal isn't it? :) // - - - -
Andre: Talbott’s paper is poetry…..
Good to hear that you enjoyed it!Box
October 29, 2015
October
10
Oct
29
29
2015
07:34 PM
7
07
34
PM
PDT
OT: OK, dumb question, but how did random mutations to DNA build such elaborate overlapping random mutation repair mechanisms to prevent random mutations from happening to DNA in the first place? Does such a clear contradiction even make sense any where else in science save for the fantasy land that is Darwinian 'science'?
New class of DNA repair enzyme discovered - October 29, 2015 Excerpt: This year's Nobel Prize in chemistry was given to three scientists who each focused on one piece of the DNA repair puzzle. Now a new study, reported online Oct. 28 in the journal Nature, reports the discovery of a new class of DNA repair enzyme. When the structure of DNA was first discovered, scientists imagined it to be extremely chemically stable, which allowed it to act as a blueprint for passing the basic traits of parents along to their offspring. Although this view has remained prevalent among the public, biologists have since learned that the double helix is in fact a highly reactive molecule that is constantly being damaged and that cells must make unceasing repair efforts to protect the genetic information that it contains.,,, "More than 10,000 DNA damage events occur each day in every cell in the human body that must be repaired for DNA to function properly," said first author Elwood Mullins, a postdoctoral research associate in the Eichman lab. The newly discovered DNA repair enzyme is a DNA glycosylase, a family of enzymes discovered by Tomas Lindahl, who received this year's Nobel prize for recognizing that these enzymes removed damaged DNA bases through a process called base-excision repair. It was the first of about 10 different DNA repair pathways that biologists have identified to date.,,, "Our discovery shows that we still have a lot to learn about DNA repair, and that there may be alternative repair pathways yet to be discovered. It certainly shows us that a much broader range of DNA damage can be removed in ways that we didn't think were possible," said Eichman http://phys.org/news/2015-10-class-dna-enzyme.html
bornagain
October 29, 2015
October
10
Oct
29
29
2015
07:23 PM
7
07
23
PM
PDT
ThickPython: The chimpanzee non-coding DNA is the “98%”. The “98%” is the chimpanzee non-coding DNA. The human protein coding gene is the “100%”. The “100%” is the human protein coding gene. Please continue.
No, you continue. Some questions: What is the 98% non-coding DNA doing there? How did it get there? If it is subjected to random mutations what are the odds that it will ever get to "100%"? If it will get to 100% will there be matching regulation in place? If so, where does it come from? What are the odds? ...Box
October 29, 2015
October
10
Oct
29
29
2015
07:11 PM
7
07
11
PM
PDT
semi related is the podcast from ENV yesterday: podcast - Molecular Data Wreak Havoc on the Tree of Life http://www.discovery.org/multimedia/audio/2015/10/molecular-data-wreak-havoc-on-the-tree-of-life/ On this episode of ID the Future, hear about a recent article in the science magazine Nautilus (Evolution, You’re Drunk: DNA studies topple the ladder of complexity) that shows animal phylogenetic trees conflicting sharply with genetic data. As Casey Luskin points out, “When Darwinian theory tells us that crucial and complex features like brains or nervous systems evolved independently — or almost as weirdly, evolved and were repeatedly lost throughout life’s history — maybe, it’s time for the “ghost of teleology” to make an appearance in the form of common design.”bornagain
October 29, 2015
October
10
Oct
29
29
2015
06:51 PM
6
06
51
PM
PDT
ThickPython:
The probability of getting these results – where 200bp slices have 62% identity and 100 bp slices have 24% identity – is exactly zero.
Who on earth are you alleging got those results? p.s. If your goal is "Counter-apologetics" then UD might not be the right place for you.Mung
October 29, 2015
October
10
Oct
29
29
2015
06:45 PM
6
06
45
PM
PDT
Vincent: “Also, why would it take more than 6 billion years to get from 98% to 100%?” Cornelius: I never said it would. [...] 6 billion years is insufficient to evolve a protein coding gene from non coding DNA.
The chimpanzee non-coding DNA is the "98%". The "98%" is the chimpanzee non-coding DNA. The human protein coding gene is the "100%". The "100%" is the human protein coding gene. Please continue.ThickPython
October 29, 2015
October
10
Oct
29
29
2015
06:37 PM
6
06
37
PM
PDT
VJ: "Finally, what do you make of this paper in “Nature”? http://www.nature.com/nature/j.....11184.html" Actually this is along the lines of my speculation above. As usual the paper is cast into an evolutionary perspective. For example, the abstract makes this absurd conclusion: "Our work illustrates that evolution exploits seemingly dispensable sequences to generate adaptive functional innovation." There is no special evidence for evolution or common descent here. They write: "Here we formalize an evolutionary model," in spite of the fact that what they describe is far from evolutionary theory. They describe a whole bunch of molecular machinery operating on a whole bunch of DNA, with regulation, to construct protein genes in response to environmental challenges. That is not evolution.Cornelius Hunter
October 29, 2015
October
10
Oct
29
29
2015
04:17 PM
4
04
17
PM
PDT
VJ: “how do you explain the fact that these genes have non-coding counterparts in chimpanzees which are 98% similar? I think that’s a legitimate question.” The fact that there are homologous ORFs of these genes in the chimp and ape is an enormous problem for evolution. As for an explanation of this fact, that question is beyond current science. We don't even have basic data to answer such a question. Of course we can speculate. For instance, genetic regulation spans a wide range of time scales (think strategic to tactical) and associated mechanisms. Perhaps this is an example of regulation on the strategic end of the scale, where an ORF is recruited to be integrated with a gene, when needed. Under that idea non coding DNA has raw materials in the form of ORFs, or exons, parts of exons, etc., that can be recruited if/when needed. Obviously that is highly speculative. “Also, why would it take more than 6 billion years to get from 98% to 100%?” I never said it would and, unfortunately, this question reveals an ignorance of basic genetics and why the OP is groundless. What I explained was that 6 billion years is insufficient to evolve a protein coding gene from non coding DNA. It doesn’t matter if you have an ORF to begin with.Cornelius Hunter
October 29, 2015
October
10
Oct
29
29
2015
03:56 PM
3
03
56
PM
PDT
Dr. Hunter: Re your assertion (based on a 2011 paper by Wu et al.) that humans have "60 protein-coding genes in humans that are not in the chimp," how do you explain the fact that these genes have non-coding counterparts in chimpanzees which are 98% similar? I think that's a legitimate question. Also, why would it take more than 6 billion years to get from 98% to 100%? Finally, what do you make of this paper in "Nature"? http://www.nature.com/nature/journal/v487/n7407/full/nature11184.html Cheers.vjtorley
October 29, 2015
October
10
Oct
29
29
2015
02:53 PM
2
02
53
PM
PDT
Thanks Mung. Not a lot to report, though I do have a new website surveying evolution's predictions and their failures: https://sites.google.com/site/darwinspredictions/homeCornelius Hunter
October 29, 2015
October
10
Oct
29
29
2015
12:52 PM
12
12
52
PM
PDT
Dr. Hunter, big fan of your books. Anything new in the works that you can share?Mung
October 29, 2015
October
10
Oct
29
29
2015
10:04 AM
10
10
04
AM
PDT
Mung (451): Well evolving a protein-coding gene from a non coding sequence is yet another one of those probabilities which is difficult to estimate precisely because it is so extremely unlikely. And note that when evolutionists appeal to things like mobile genetic elements to do the heavy lifting then we are no longer talking about evolution (in spite of what they would like to claim).Cornelius Hunter
October 29, 2015
October
10
Oct
29
29
2015
09:50 AM
9
09
50
AM
PDT
ThickPython:
The point is that you intimated in your article that these de novo genes basically evolved out of nothing – “6 billion years would not be enough time” – and that’s clearly not what the paper is about.
I think he was calculating the probability with a zero in the numerator, not that they "evolved out of nothing."Mung
October 29, 2015
October
10
Oct
29
29
2015
08:18 AM
8
08
18
AM
PDT
Box Talbott's paper is poetry.....
Most of this noncoding DNA was at first dismissed as “junk” — meaningless evolutionary detritus accumulated over the ages. At best it was viewed as a kind of bag of spare parts, borne by cells from one generation to another for possible employment in future genomic innovations. But that’s an awful amount of junk for a cell to have to lug around, duplicate at every cell division, and otherwise manage on a continuing basis.
I hope Prof Moran has read it......Andre
October 29, 2015
October
10
Oct
29
29
2015
05:44 AM
5
05
44
AM
PDT
//follow-up #448// More S.L.Talbott: — on histones and gene expression:
The canonical nucleosome core particle is a complex of histone proteins, each of which has a flexible, filamentary "tail". This tail can be modified through the addition of several different chemical groups — acetyl, methyl, phosphate, ubiquitin, and so on — at any of various locations along its length. A great variety of enzymes can apply and remove these chemical groups, and the groups themselves play a role in attracting a stunning array of gene regulatory proteins that restructure chromatin or otherwise help choreograph the drama of gene expression. After a few histone tail modifications were found to be rather distinctly associated with active or repressed genes, the forlorn hope arose that we would discover a precise, combinatorial “histone code”. It would provide a fixed and reliable key enabling us to predict the consequences of any arrangement of modifications (Strahl and Allis 2000). But this was to ignore the nearly infinite variety of all those contextual factors that blend their voices in concert with the histone modifications. In the plastic organism, what goes on at the local level is always shaped and guided by a larger, coherent context — a context that surely has meaning, but (as in all natural languages) never an absolutely fixed grammar. And, in fact, while overwhelming evidence for a meaningful, gene-regulatory conversation involving histone modifications has emerged, there is little to suggest a rigid code. Shelley Berger of Philadelphia’s Wistar Institute, noting that a single tail modification “recruits numerous proteins whose regulatory functions are not only activating but also repressing” and that “many of these marks have several, seemingly conflicting roles”, summarized the situation this way: Although [histone] modifications were initially thought to be a simple code, a more likely model is of a sophisticated, nuanced chromatin ‘language’ in which different combinations of basic building blocks yield dynamic functional outcomes. (Berger 2007) And (leaving aside the jarring reference to building blocks) how could it be otherwise? Each histone tail modification re-shapes the physical and electrical structure of the local chromatin, shifting the pattern of interactions among nucleosome, DNA, and associated protein factors. To picture this situation concretely — as opposed to remaining within the straightjacket of code — is immediately to realize that it cannot be captured in purely digital terms. A sculptor does not try to assess the results of a stroke of the hammer as a choice among the possibilities of a digital logic. Berger envisions histone modifications as participating in “an intricate ‘dance’ of associations”. There is much much more. The histones making up a nucleosome core particle can themselves be exchanged for noncanonical, or variant histones, which also have recognizable — but not strictly encoded — relations to gene expression. Histones can even be removed from a core particle altogether, leaving it “incomplete”. And certain energy-expending proteins can slide core particles along the DNA, changing their position. We saw above that a shift of position by as little as two or three base pairs can mark the difference between gene activation or repression, as can changes in the rotational orientation of the DNA on the face of the histone core particle. And finally (to artificially limit our consideration): the tails — no doubt depending at least in part on the various modifications and protein associations mentioned earlier — can thread themselves through the encircling double helix, perhaps either loosening it from the core particle or holding it more firmly in place. But some of those same tails are also thought to establish nucleosome-to-nucleosome contacts, helping to compact a stretch of chromatin and repress gene expression.
— DNA in knots:
With so much concerted movement going on — not to mention the coiling and packing and unpacking of chromosomes mentioned earlier — how does the cell keep all those “miles of string in the tennis ball” from getting hopelessly tangled? In this case we at least know some of the players addressing the problem. For example, there are enzymes called “topoisomerases”, whose task is to help manage the spatial organization of chromosomes. Demonstrating a spatial insight and dexterity that might amaze those of us who have struggled to sort out tangled masses of thread, these enzymes manage to make just the right local cuts to the strands in order to relieve strain, allow necessary movement of individual genes or regions of the chromosome and prevent a hopeless mass of knots. Some topoisomerases cut just one strand of the double helix, allow it to wind or unwind around the other strand, and then reconnect the severed ends. This alters the supercoiling of the DNA. Other topoisomerases cut both strands, pass a loop of the chromosome through the gap thus created, and then seal the gap again. (Imagine trying this with miles of string crammed into a tennis ball!) I don't think anyone would claim to have the faintest idea how this is actually managed in a meaningful, overall, contextual sense, although great and fruitful efforts are being made to analyze isolated local forces and “mechanisms”.
Box
October 29, 2015
October
10
Oct
29
29
2015
05:09 AM
5
05
09
AM
PDT
ThickPython, Like many people, you seem to have a very abstract (and overly simplistic) view of DNA and gene-expression. How to picture these things? Talbott’s writings helped me a lot:
Talbott: If you arranged the DNA in a human cell linearly, it would extend for nearly two meters. How do you pack all that DNA into a cell nucleus just five or ten millionths of a meter in diameter? According to the usual comparison it’s as if you had to pack 24 miles (40 km) of extremely thin thread into a tennis ball. Moreover, this thread is divided into 46 pieces (individual chromosomes) averaging, in our tennis-ball analogy, over half a mile long. Can it be at all possible not only to pack the chromosomes into the nucleus, but also to keep them from becoming hopelessly entangled? Obviously it must be possible, however difficult to conceive — and in fact an endlessly varied packing and unpacking is going on all the time. The first thing to realize is that chromosomes do not consist of DNA only. Their actual substance, an intricately woven structure of DNA, RNA, and protein, is referred to as chromatin. Histone proteins, several of which can bind together in the form of an extremely complex histone core particle, are the single most prominent constituent of this chromatin. Every cell contains numerous such core particles — there are some 30 million in a typical human cell — and the DNA double helix, after wrapping a couple of times around one of them, typically extends for a short stretch and then wraps around another one. The core particle with its DNA is referred to as a nucleosome, and between 75 and 90 percent of our DNA is wrapped up in nucleosomes. But that’s just the first level of packing; it accounts for relatively little of the overall condensation of the chromosomes. If you twist a long, double-stranded rope, you will find the rope beginning to coil upon itself, and if you continue to twist, the coils will coil upon themselves, and so on without particular limit, depending on the fineness and length of the rope. Something like this supercoiling happens with the chromosome, mediated in part by the histone core particles. As a result the core particles, and the DNA along with them, become tightly packed almost beyond comprehension, in a dense, three-dimensional geometry that researchers have yet to visualize in any detail. This highly condensed state, characterizing great stretches of every chromosome, contrasts with other, relatively uncondensed stretches known as open chromatin. With that background, we can gain our first glimpse of the concerted dynamism in which genes participate. At any one time — and with the details depending on the tissue type and stage of the organism’s development, among other things — some parts of every chromosome are heavily condensed while others are open. Every overall configuration represents a unique balance between constrained and liberated expression of our total complement of 21,000 genes This is because the transcription of genes generally requires an open state; genes in condensed chromatin are largely silenced. The supercoiling has another direct, more localized role in gene expression. Think again of twisting a rope: depending on the direction of your twist, the two strands of the helix will either become more tightly wound around each other or will be loosened and unwound. (This is independent of the supercoiling, which occurs in either case.) And if, taking a double-stranded rope in hand, you insert a pencil between the strands and force it in one direction along the rope, you will find the strands winding ever more tightly ahead of the pencil’s motion and unwinding behind. Recall, then, that the enzyme (RNA polymerase) responsible for transcribing DNA into RNA must separate the two strands as it moves along a gene sequence. This is much easier if the supercoiling of the chromatin has already loosened the strands — and harder if the strands are tightened. So in this way the variations in supercoiling along the length of a chromosome either encourage or discourage the transcription of particular genes. Moreover, by virtue of its own activity in moving along the DNA and separating the two strands, RNA polymerase (like the pencil) tends to unwind the strands in the chromosomal region behind it, rendering that region, too, more susceptible to gene expression. There are proteins that detect such changes in torsion propagating along chromatin, and they read the changes as “suggestions” about helping to activate nearby genes (Lavelle 2009; Kouzine et al. 2008). Picture the situation concretely. Every bodily activity or condition presents its own requirements for gene expression. Whether you are running or sleeping, starving or feasting, getting aroused or calming down, suffering a flesh wound or recovering from pneumonia — in all cases the body and its different cells have specific, almost incomprehensibly complex and changing requirements for differential expression of thousands of genes. And one thing necessary for achieving this expression in all its fine detail is the properly choreographed performance of the chromosomes. This performance cannot be captured with an abstract code. Interacting with its surroundings, the chromosome belongs as much to a living activity as any other element in its cellular environment. Maybe instead of summoning the image of a rope, I should have invoked a snake, coiling, curling, and sliding over a landscape that is itself in continual movement. (...) Nucleosomes will sometimes move — or be moved (the distinction between actor and acted upon is forever obscured in the living cell) — rhythmically back and forth along the DNA, shifting between alternative positions in order to enable multiple transcription passes over a gene. In stem cells a process some have called “histone modification pulsing” results in the continual application and removal of both gene-repressive and gene-activating modifications of nucleosomes. In this way a delicate balance is maintained around genes involved in development and cell differentiation. The genes are kept, so to speak, in a finely poised state of “suspended readiness”, so that when the decision to specialize is finally taken, the repressive modifications can be quickly lifted, leading to rapid gene expression (Gan et al. 2007).
Box
October 29, 2015
October
10
Oct
29
29
2015
02:27 AM
2
02
27
AM
PDT
TP (446): "If you’ve admitted to that error, I must have missed it …" Actually you have made a series of false accusations about a two-year old post which describes a research study of human de novo genes. There was no "error" in the post. The error, as you have repeatedly proven, is in your ignorance of genetics. And you prove it once again: "The point is that you intimated in your article that these de novo genes basically evolved out of nothing" The post said no such thing, except in your imagination. In spite of that, and in spite of it being pointed out to you, you continue with the false accusations. I would have hoped for a retraction, but you won't even stop the falsehoods, though you are backpedaling by saying the post merely "intimated" what you imagined. So now I'm guilty of "intimating." " '6 billion years would not be enough time' " It wouldn't be, not by evolution. "No idea, and I don’t know why I would be expected to know that. Do you know how they evolved? In any case, it’s not the point." That is convenient. You have "no idea" because, in fact, this is a problem for evolution. Suddenly you need a noncoding sequence to evolve into a functional ORF with no natural selection to help. It is absurd and, yes, 6 billion years would not be enough time, or trillion, etc. And so the evolutionist looks the other way and says it doesn't matter. "Do you know how they evolved?" I know that they could not have evolved, unless you call on the multiverse to bail you out.Cornelius Hunter
October 28, 2015
October
10
Oct
28
28
2015
11:56 PM
11
11
56
PM
PDT
@Cornelius, #442:
A transcription factor? Amazing. All an evolutionist needs is a transcription factor and wala, you have regulation.
"voila". And what will you say if 500bp upstream and downstream of these genes also shows 98%-ish identity?
OK, tell us this, how did those Ape/Orangutan/Human homologous non coding sequences evolve in the first place. Random drift or what?
No idea, and I don't know why I would be expected to know that. Do you know how they evolved? In any case, it's not the point. The point is that you intimated in your article that these de novo genes basically evolved out of nothing - "6 billion years would not be enough time" - and that's clearly not what the paper is about. If you've admitted to that error, I must have missed it ...ThickPython
October 28, 2015
October
10
Oct
28
28
2015
11:20 PM
11
11
20
PM
PDT
1 4 5 6 7 8 21

Leave a Reply