Uncommon Descent Serving The Intelligent Design Community

Homologies, differences and information jumps

Share
Facebook
Twitter
LinkedIn
Flipboard
Print
Email

shark-553666_1280In recent posts, I have been discussing some important points about the reasonable meaning of homologies and differences in the proteome in the course of natural history. For the following discussion, just to be clear, I will accept a scenario of Common Descent (as explained in many recent posts) in the context of an ID approach. I will also accept the very reasonable concept that neutral or quasi-neutral random variation happens in time, and that negative (purifying) selection is the main principle which limits random variation in functional sequences.

My main points are the following:

  1. Given those premises, homologies through natural history are certainly an indicator of functional constraints, because they mean that some sequence cannot be significantly transformed by random variation. Another way to express this concept is that variation in a functional sequence with strong functional constraints is not neutral, but negative, and therefore negative selection will in mot cases suppress variation and conserve the functional sequence through time. This is a very important point, because it means that strong homologies through time point to high functional complexity, and therefore to design. I have used this kind of argument, for example, for proteins like the beta chain of ATP synthase (highly conserved from LUCA to humans) and Histone H3 (highly conserved in all eukaryotes).
  2. Differences between homologues, instead, can have two completely different meanings:
  •  2a) They can be the result of accumulating neutral variation in parts of the molecule which are not functionally constrained
  • 2b) They can be the expression of differences in function in different species and contexts

I do believe that both 2a and 2b happen and have an important role in shaping the proteome. 2b, in particular, is often underestimated. It is also, in many cases, a very good argument for ID.

 

Now, I will try to apply this reasoning to one example. I have chosen a regulatory protein, one which is not really well understood, but which has certainly an important role in epigenetic regulation. The protein is called “Prickle”, and we will consider in particular the one known as “Prickle 1”. It has come to my attention trough an interesting paper linked by Dionisio (to whom go my sincere thanks and appreciation):

Planar polarization of Vangl2 in the vertebrate neural plate is controlled by Wnt and Myosin II signaling

In brief, Prickle is a molecule implied, among other things, in planar polarization events and in the regulation of neural system in vertebrates.

Let’s have a look at the protein. From Wikipedia:

Prickle is part of the non-canonical Wnt signaling pathway that establishes planar cell polarity.[2] A gain or loss of function of Prickle1 causes defects in the convergent extension movements of gastrulation.[3] In epithelial cells, Prickle2 establishes and maintains cell apical/basal polarity.[4] Prickle1 plays an important role in the development of the nervous system by regulating the movement of nerve cells.[5

And:

Mutations in Prickle genes can cause epilepsy in humans by perturbing Prickle function.[12] One mutation in Prickle1 gene can result in Prickle1-Related Progressive Myoclonus Epilepsy-Ataxia Syndrome.[2] This mutation disrupts the interaction between prickle-like 1 and REST, which results in the inability to suppress REST.[2] Gene knockdown of Prickle1 by shRNA or dominant-negative constructs results in decreased axonal and dendritic extension in neurons in the hippocampus.[5] Prickle1 gene knockdown in neonatal retina causes defects in axon terminals of photoreceptors and in inner and outer segments.[5]

The human protein is 831 AAs long.

Its structure is interesting: according to Uniprot, in the first part of the molecule we can recognize 4 domains:

1 PET domain:  AAs 14 – 122

3 LIM zinc-binding doamins:  AAs 124 – 313

In the rest of the sequence (AAs 314 – 831) no known domain is recognized.

Here is the FASTA sequence of the human protein, divided in the two parts (red: 4 domain part; blue: no domain part):

 

>sp|Q96MT3|PRIC1_HUMAN Prickle-like protein 1 OS=Homo sapiens GN=PRICKLE1 PE=1 SV=2
MPLEMEPKMSKLAFGCQRSSTSDDDSGCALEEYAWVPPGLRPEQIQLYFACLPEEKVPYV
NSPGEKHRIKQLLYQLPPHDNEVRYCQSLSEEEKKELQVFSAQRKKEALGRGTIKLLSRA
VMHAVCEQCGLKINGGEVAVFASRAGPGVCWHPSCFVCFTCNELLVDLIYFYQDGKIHCG
RHHAELLKPRCSACDEIIFADECTEAEGRHWHMKHFCCLECETVLGGQRYIMKDGRPFCC
GCFESLYAEYCETCGEHIGVDHAQMTYDGQHWHATEACFSCAQCKASLLGCPFLPKQGQI
YCSKTCSLGEDVHASDSSDSAFQSARSRDSRRSVRMGKSSRSADQCRQSLLLSPALNYKF
PGLSGNADDTLSRKLDDLSLSRQGTSFASEEFWKGRVEQETPEDPEEWADHEDYMTQLLL
KFGDKSLFQPQPNEMDIRASEHWISDNMVKSKTELKQNNQSLASKKYQSDMYWAQSQDGL
GDSAYGSHPGPASSRRLQELELDHGASGYNHDETQWYEDSLECLSDLKPEQSVRDSMDSL
ALSNITGASVDGENKPRPSLYSLQNFEEMETEDCEKMSNMGTLNSSMLHRSAESLKSLSS
ELCPEKILPEEKPVHLPVLRRSKSQSRPQQVKFSDDVIDNGNYDIEIRQPPMSERTRRRV
YNFEERGSRSHHHRRRRSRKSRSDNALNLVTERKYSPKDRLRLYTPDNYEKFIQNKSARE
IQAYIQNADLYGQYAHATSDYGLQNPGMNRFLGLYGEDDDSWCSSSSSSSDSEEEGYFLG
QPIPQPRPQRFAYYTDDLSSPPSALPTPQFGQRTTKSKKKKGHKGKNCIIS

So, this is a very interesting situation, which is not so rare. We have the first part of the sequence (313 AAs) which configures well known and conserved domains, while “the rest”(517 AAs)  is apparently not understood in terms of structure and function.

So, to better understand what all this could mean, I have blasted those two parts of the human molecule separately.

(Those who are not interested in the technical details, can choose here to go on to the conclusions  🙂 )

The first part of the sequence (AAs 1 – 313) shows no homologies in prokaryotes. So, we are apparently in the presence of domains which appear in eukaryotes.

In fungi, we find some significant, but weak, homologues. The best hit is an expect of 2e-21, with 56 identities and 93 positives (99.4 bits).

Multicellular organisms have definitely stronger homologies:

C. elegans:  144 identities, 186 positives, expect 2e-90 (282 bits)

Drosophila melanogaster:  202 identities, 244 positives, expect 5e-152 (447 bits)

Let’s go to non vertebrate chordates:

Cephalochordata (Branchiostoma floridae):  222 identities, 256 positives, expect 6e-165 (484 bits)

Tunicata (Ciona intestinalis): 196 identities, 241 positives, expect 2e-149 (442 bits)

Now, vertebrates:

Cartilaginous fishes (Callorhincus milii): 266 identities, 290 positives, expect 0.0 (588 bits)

Bony fishes (Lepisosteus oculatus): 274 identities, 292 positives, expect 0.0 (598 bits)

Mammals (Mouse): 309 identities, 312 positives, expect 0.0 (664 bits)

IOWs, what we see here is that the 4 domain part of the molecule, absent in prokaryotes, is already partially observable in single celled eukaryotes, and is strongly recognizable in all multicellular beings. It is interesting that homology with the human form is not very different between drosophila and non vertebrate chordates, while there is a significant increase in vertebrates, and practical identity already in mouse. That is a very common pattern, and IMO it can be explained as a mixed result of different functional constraints and neutral evolution in different time splits.

Now, let’s go to “the rest” of the molecule: AAs 314 – 831 (518 AAs). No recognizable domains here.

What is the behaviour of this sequence in natural history?

Again, let’s start again from the human sequence and blast it.

With Prokaryotes: no homologies

With Fungi: no homologies

C. elegans: no homologies

Drosophila melanogaster: no homologies

Let’s go to non vertebrate chordates:

Cephalochordata (Branchiostoma floridae):  no significant homologies

Tunicata (Ciona intestinalis): no significant homologies

So, there is no significant homology in the whole range of eukaryotes, excluding vertebrates and including chordates which are not vertebrates.

Now, what happens with vertebrates?

Here are the numbers:

Cartilaginous fishes (Callorhincus milii): 350 identities, 429 positives, expect 0.0 (597 bits)

Bony fishes (Lepisosteus oculatus): 396 identities, 446 positives, expect 0.0 (662 bits)

Mammals (Mouse): 466 identities, 491 positives, expect 0.0 (832 bits)

IOWs, what we see here is that the no domain part of the molecule is practically non existent in prokaryotes, in single celled eukaryotes and in all multicellular beings which are not vertebrates. In vertebrates, the sequence is not only present in practically all vertebrates, but it is also extremely conserved, from sharks to humans. So, we have a steep informational jump from non chordates and non vertebrate chordates, where the sequence is practically absent, to the very first vertebrates, where the sequence is already highly specific.

What does that mean from an ID point of view? It’s simple:

a) The sequence of 517 AAs which represents the major part of the human protein must be reasonably considered highly functional, because it is strongly conserved throughout vertebrate evolution. As we have said in the beginning, the only reasonable explanation for high conservation throughout a span of time which must be more than 400 million years long is the presence of strong functional constraints in the sequence.

b) The sequence and its function, whatever it may be (but it is probably an important regulatory function) is highly specific of vertebrates.

We have here a very good example of a part of a protein which practically appears in vertebrates while it is absent before, and which is reasonably highly functional in vertebrates.

So, to sum up:

  1. Prickle 1 is a functional protein which is found in all eukaryotes.
  2. The human sequence can be divided in two parts, with different properties.
  3. The first part, while undergoing evolutionary changes, is rather well conserved in all eukaryotes. Its function can be better understood, because it is made of known domains with known structure.
  4. The second part does not include any known domain or structure, and is practically absent in all eukaryotes except vertebrates.
  5. In vertebrates, it is highly conserved and almost certainly highly functional. Probably as a regulatory epigenetic sequence.
  6. For its properties, this second part, and its functional sequence, are a very reasonable object for a strong design inference.

 

I have added a graph to show better what is described in the conclusions, in particular the information jump in vertebrates for the second part of the sequence:

Graph3

Note: Thanks to the careful checking of Alicia Cartelli, I have corrected a couple of minor imprecisions in the data and the graph (see posts #83 and #136). Thank you, Alicia, for your commitment. The sense of the post, however, does not change.

Those who are interested in the evolutionary behaviour of protein Prickle 2 could give a look at my posts #127 and #137.

Comments
So during the same time that some of the chordates were evolving a bony spine, a protein now known to be involved in spinal development also evolved? Wow! Groundbreaking!Alicia Cartelli
February 3, 2016
February
02
Feb
3
03
2016
05:42 PM
5
05
42
PM
PDT
Dionisio: The first part of the protein, with its domains, shows a rather "gradual" increase in its similarity to the human sequence as time splits become shorter. As discussed, in that case it is more difficult to distinguish between neutral variation and functional tweaking of the molecule in different species. There are tools, obviously: a detailed study of structure and function of the different forms, Ka/Ks analysis, and so on. However, I have not looked at those aspects (and I am not sure at all that we have enough information to get reasonable inferences about that issue), because my focus here was mainly on the evolutionary history of the second part of the protein.gpuccio
February 3, 2016
February
02
Feb
3
03
2016
03:43 PM
3
03
43
PM
PDT
GaryGaulin: Alternative exlanation to what? I have given the facts that support the design inference based on dFSCI. You may agree or not, but I have given facts, and in great detail. What facts have you given to support any other kind of explanation? Just to know.gpuccio
February 3, 2016
February
02
Feb
3
03
2016
03:38 PM
3
03
38
PM
PDT
Gpuccio, as you say: An alternative explanation (what you call “dFSCI”) should be supported by explicit facts to be scientifically considered. And it is not.GaryGaulin
February 3, 2016
February
02
Feb
3
03
2016
02:58 PM
2
02
58
PM
PDT
gpuccio @24 I think I understood your explanation about the conclusion inferred from the big jump seen so clearly in your graph related to the protein second part (blue lines). However, my long (mouthful) question @21 is in reference to the first part of the protein that you described in the graph using the red lines.Dionisio
February 3, 2016
February
02
Feb
3
03
2016
01:56 PM
1
01
56
PM
PDT
Your ID argument seems to be that because we can’t point to a plausible evolutionary history (given the supposed time constraints and complexity of the sequence) of a section of protein in primitive vertebrates, that it must have been designed.
Grow up Zachriel. Please.Mung
February 3, 2016
February
02
Feb
3
03
2016
12:46 PM
12
12
46
PM
PDT
Zachriel: OK to your OKs. Many million years is fine with me. Many. Not "too many", however, in an evolutionary sense. Certainly less than the 400+ million years of evolution of vertebrates. You say: "Genomes sequencing is not nearly as complete as made out to be. Also, it’s possible that the sequence was present in the common ancestor, but lost in a particular lineage." OK, that's the standard defense. Is that your best? "Your ID argument seems to be that because we can’t point to a plausible evolutionary history (given the supposed time constraints and complexity of the sequence) of a section of protein in primitive vertebrates, that it must have been designed." You know my argument very well. My argument, which is essentially the ID argument, is that high levels of dFSCI (functionally specified information, in digital form) arise empirically only from design, and point to a design inference. In all the cases we know about. My argument is that 600 bits of functional information is vastly, vastly beyond the 500 bit threshold which is usually considered as Universal Probability Bound. My argument is that, in the presence of such facts, design is the best inference. An alternative explanation (what you call "a plausible evolutionary history") should be supported by explicit facts to be scientifically considered. And it is not. Therefore, dFSCI at its best is exhibited by the sequence we are discussing, and a design inference remains by far the best explanation for its origin. This is my argument, and always has been.gpuccio
February 3, 2016
February
02
Feb
3
03
2016
10:09 AM
10
10
09
AM
PDT
gpuccio: a, b, c, d, e Okay. gpuccio: f We don't claim any particular expertise. Early chordates left few fossils, though vertebrate fossils have been found during the Cambrian Explosion. The split between protostomes and deuterostomes apparently occurred well before the Cambrian Explosion. Molecular clocks show a divergence of urochordates long before the Cambrian Explosion. See Blair & Hedges, Molecular Phylogeny and Divergence Times of Deuterostome Animals, Molecular Biology and Evolution 2005. Molecular clocks are probably not terribly accurate at that distance, but it's safe to say there were many millions of years involved. gpuccio: g) Genomes sequencing is not nearly as complete as made out to be. gpuccio: h) Your ID argument seems to be that because we can’t point to a plausible evolutionary history (given the supposed time constraints and complexity of the sequence) of a section of protein in primitive vertebrates, that it must have been designed.Zachriel
February 3, 2016
February
02
Feb
3
03
2016
09:39 AM
9
09
39
AM
PDT
Mung: Zachriel is doing his best. The fact that this is his best, however, is really significant. :)gpuccio
February 3, 2016
February
02
Feb
3
03
2016
08:39 AM
8
08
39
AM
PDT
Zachriel: The point is: conservation is a matter of sequence homology. Of course it is a matter of degree: that's why in a blast it is measured in bits, and expressed as a bit score and an expect value, which are exactly the measures that I have considered in mt reasoning. And of course it is a matter of time: that's why I have compared blasts with different organisms which are references to specific times in natural history. But the main point is: conservation is conservation of the sequence, and the results of a blast measure the degree of conservation between two sequences. Do you agree? Let's go to my argument. The correct way of describing it is the following: a) We have a sequence which is highly conserved in vertebrates, and apparently not present before. b) It is part of a protein which, instead, is present before, and whose first part shows, well recognizable homologues in all eukaryotes. c) As the sequence in the second part is highly conserved in vertebrates, it is very reasonable to infer that it is functional, and under purifying selection, from its appearance in vertebrates up to humans (about 400 million years of evolution). d) As the sequence is functional and conserved, we can reasonably infer that its functional complexity is very high: in particular, we can take the difference in bit score between two specific times of natural history as a gross measure of the functional complexity, IOWs of the improbability to get that sequence by chance. e) The highest difference in bit score, for our sequence, is observed at its appearance, at the split between tunicata and vertebrates. At that point, and before the following split between cartilaginous fishes and bony fishes, the sequence has already acquired about 600 bits of its functional complexity. That's what I have called an "information jump". f) That information jump happens in a rather narrow window time: I leave it to you, who certainly are more expert than I am of these things, the task to suggest how wide it is (between the tunicata - vertebrates split and the cartilaginous fishes - bony fishes split). g) There is obviously no trace at all in the proteome of any intermediate sequences, as shown by the results of the blasts, least of all of "a plausible evolutionary history ". h) And so it is: the appearance in natural history (a rather sudden appearance) of a functional sequence, which is then highly conserved, and whose functional complexity at its appearance can be quantified at about 600 bits of specific functional information, is a very strong argument for a design inference. This is correct.gpuccio
February 3, 2016
February
02
Feb
3
03
2016
08:37 AM
8
08
37
AM
PDT
Reading further, your ID argument seems to be that because we can’t point to a plausible evolutionary history of a section of protein in primitive vertebrates, that it must have been designed. Is this correct?
Grow up Zachriel. Please.Mung
February 3, 2016
February
02
Feb
3
03
2016
08:23 AM
8
08
23
AM
PDT
gpuccio: I think there is great confusion in what you say. Perhaps. We'll reboot. gpuccio: When we say that a sequence is conserved, we mean just that simple fact: the sequence is conserved. Conservation is a matter of degree and time. Take a gene under purifying selection. The gene duplicates, and the copy experiences some modest changes, undergoes directional selection, then stabilizes with a different function under purifying selection. We look at the sequence and recognize the homology and the phylogeny. Reading further, your ID argument seems to be that because we can't point to a plausible evolutionary history of a section of protein in primitive vertebrates, that it must have been designed. Is this correct?Zachriel
February 3, 2016
February
02
Feb
3
03
2016
07:21 AM
7
07
21
AM
PDT
Zachriel: I think there is great confusion in what you say. "If the function can change, then they aren’t conserved." ??? What do you mean? When we say that a sequence is conserved, we mean just that simple fact: the sequence is conserved. The statement that a sequence is conserved between two species simply means that you find strong homology between two or more sequences in the two species. The fact that a sequence is conserved through enough evolutionary time points, according to evolutionary theory, to purifying selection. Purifying selection implies function in terms of reproductive fitness. Function linked to a conserved sequence implies functional constraint in that sequence: it means that, if the sequence changes, reproductive fitness is reduced. That's how purifying selection works. So, you must distinguish between the different consepts: a) Sequence conservation is just sequence conservation. You observe it by observing sequences, not by reasoning about function (in science, that is called an observed fact). b) According to evolutionary theory, we infer function for the sequences which are conserved, in terms of reproductive fitness. That is how purifying selection is supposed to work. We infer function because we know that a sequence, if not functional, changes through time because of neutral variation. c) For my reasoning here, it is irrelevant what the function is. The only relevant point is that the sequence is conserved, and therefore functional, and therefore under functional constraints. You say: "Yes. Paralogs form by gene duplication, so one or both can drift, contrary to your statement that the “sequence cannot be significantly transformed by random variation”. Hemoglobin is such an example." But that's exactly because globins are less functionally constrained at the sequence level, and therefore can undergo sequence variation retaining function. Indeed, in my discussion with Eric Anderson here: https://uncommondescent.com/atheism/ba77-and-a-vid-on-foxp-123-molecular-trees-vs-dawkins-claim-of-you-get-the-same-family-tree/ I have used exactly myoglobin as an example of protein which changes at sequence level, while retaining the same function. I post here, for your consideration, the pertinent part from my post #30 in that thread:
I have said many times that, IMO, the strongest argument at the molecular level for CD is not the homologies, but the differences. My point is: I accept that neutral variation happens, if there is not a strong functional constraint which translates into negative (purifying) selection. IOWs, take histone H3, a 136 AAs protein which is practically the same in all eukaryotes: it does not change, except for really trivial differences, throughout something like 1 – 2 billion years (impossible to say exactly when eukaryotes first appeared). Is that an argument for CD? Yes and no. It is an argument for a very strong functional constraint on histone H3, and if we accept CD it is an argument for extremely strong purifying selection on the protein. IOWs, it is an argument for the designed origin of the protein (a sequence of 136 AAs which has to be, in some way, almost exactly that specific sequence has a lot of functional information in it). But the fans of common design could simply argue that, exactly because there is such a functional constraint, a designer who redesigns each living species can only use that sequence. So, simple strong homology and conservation is not the best argument fro CD. Now, take myoglobin. Human myoglobin is 154 AAs long. Globins are rather “simple” globular proteins, whose 3D structure is rather well conserved even in presence of great differences in sequence. Now, if we blast human myoglobin against chimp, we have 153/154 identities (99%), expect 2e-108. IOWs, almost the same sequence. And a few million years of chronological split. Let’s go to mouse. Against human, we have 129/154 identities (89%), expect 1e-89. The two proteins are still very similar, but less. And we have 80 – 100 million years of chronological split. Let’s go to bony fishes. We have 71/149 identities (48%), with an expect of 6e-37. And a time split of about 400 million years. And so on. Now, my point is: these molecules are rather similar. They do more or less the same thing in different species. They have similar 3d structure. OK, there could be different functional constraint to explain some of the sequence differences, but… frankly, the best explanation for the growing differences at sequence level between homologue molecules with the same function and 3d structure is simply: neutral variation through time. And indeed, as the time split grows, so grows the sequence difference. This is a good argument for CD: not the homologies, but the differences in similar molecules, differences which are proportional to time separation between species.
I hope that helps. Finally, you say: "2b is standard adaptation. A protein which might be under strong purifying selection in one context may evolve when put in a different context, e.g. angiogenin. Not sure how you distinguish this from design." 2b means that the sequence changes, and acquires a new function (or simply a function). The design inference in that case is linked to the functional complexity of that function. In a sense, the appearance of a new functional sequence, like the second part of the Prickle molecule sequence in vertebrates, is a case of 2b: that sequence did not exist in previous species, indeed other sequences were present in previous species associated with the conserved "first part". Then, suddenly, in vertebrates, a new long sequence appears, and then it remains highly conserved, which proves that it is unsed purifying selection, and therefore functional. It's as simple as that.gpuccio
February 2, 2016
February
02
Feb
2
02
2016
10:33 PM
10
10
33
PM
PDT
Zachriel: If the function can change, then they aren’t conserved. Sure it is. You're making an assumption and begging the question.Mung
February 2, 2016
February
02
Feb
2
02
2016
02:41 PM
2
02
41
PM
PDT
gpuccio: Remember that proteins with different general functions can share conserved domains, whose strict function, however, usually remains similar. If the function can change, then they aren't conserved. gpuccio: However, do you agree that if a sequence resists change by neutral variation, it can only be because it is under purifying selection, and therefore functional? Yes. Paralogs form by gene duplication, so one or both can drift, contrary to your statement that the "sequence cannot be significantly transformed by random variation". Hemoglobin is such an example. gpuccio: 2. Differences between homologues, instead, can have two completely different meanings: 2a) They can be the result of accumulating neutral variation in parts of the molecule which are not functionally constrained 2b) They can be the expression of differences in function in different species and contexts 2b is standard adaptation. A protein which might be under strong purifying selection in one context may evolve when put in a different context, e.g. angiogenin. Not sure how you distinguish this from design.Zachriel
February 2, 2016
February
02
Feb
2
02
2016
01:48 PM
1
01
48
PM
PDT
Mung: Thank you! I don't know if the process can be automated (probably yes), but the method is not too complex. I would look preferentially at regulatory proteins, especially Transcription Factors, which often have large parts of the molecule which are unexplained, while the DNA binding site is usually conserved and corresponds to well known motifs. The important point is to verify if the non explained parts are conserved, and in what range of natural history. That can easily be done with a few blasts with reference organisms or groups of organisms. I will try to look for some more examples, in the future. The Prickle protein is in no way special: it's just an example which I looked at recently.gpuccio
February 2, 2016
February
02
Feb
2
02
2016
12:58 PM
12
12
58
PM
PDT
Nice post. Any opinion on whether it would be possible to automate a process to identify other similar sequences? Would we first search for sequences of some minimum length? Then look within that sequence for a sub-sequence of some minimum length where there is no known something but where the same sequences also contains one of more sub-sequences of some known something?Mung
February 2, 2016
February
02
Feb
2
02
2016
09:03 AM
9
09
03
AM
PDT
EugeneS: Thank you for the kind comment.gpuccio
February 2, 2016
February
02
Feb
2
02
2016
08:26 AM
8
08
26
AM
PDT
KF: I have just given one example, but that kind of information jump is rather common. Both eukaryotes versus prokaryotes, and vertebrates versus other metazoa, are critical nodes where a lot of information is gained at sequence level. And there are many others. A common mistake is to underestimate information jumps which happen just in parts of a molecule, and that is one point that I wanted to stress in my post.gpuccio
February 2, 2016
February
02
Feb
2
02
2016
08:24 AM
8
08
24
AM
PDT
Zachriel: My #1 is that homologues with sequences which are strongly conserved can only be explained as the result of purifying selection, and therefore of functional constraints on the sequence. Do you disagree? Paralogs can be different things, and I don't see how they contradict the principle I have stated. Could you maybe give examples? Remember that proteins with different general functions can share conserved domains, whose strict function, however, usually remains similar. However, do you agree that if a sequence resists change by neutral variation, it can only be because it is under purifying selection, and therefore functional? It seems that this is a fundamental principle in evolutionary theory.gpuccio
February 2, 2016
February
02
Feb
2
02
2016
08:20 AM
8
08
20
AM
PDT
gpuccio: I know you love cryptic and deep statements, but could you please be kind and explain better what you mean? Not sure if it is so cryptic, much less deep. A paralog is a homolog that has a different function than the ancestor, which seems to contradict your #1 above.Zachriel
February 2, 2016
February
02
Feb
2
02
2016
08:11 AM
8
08
11
AM
PDT
GPuccio, As usual, a very good and very plainly written OP. Could I suggest to the people who run the blog to develop the search functionality a bit further e.g. to be able to look for OPs written by an author. That would be really handy.EugeneS
February 2, 2016
February
02
Feb
2
02
2016
08:10 AM
8
08
10
AM
PDT
GP, a 500+ bit info jump is of course just what the doctor ordered; dear (and most gentle) physician. KFkairosfocus
February 2, 2016
February
02
Feb
2
02
2016
07:33 AM
7
07
33
AM
PDT
Dionisio: I believe that strong functional constraints which point to a strict target space are the best indicator of design. In the case of the second part of the Prickle proteins, the sudden jump in informational content in a relatively small evolutionary time is a very reliable indicator of a designed origin. I am not aware of any other reasonable explanation for such a scenario.gpuccio
February 2, 2016
February
02
Feb
2
02
2016
06:36 AM
6
06
36
AM
PDT
Zachriel: Welcome to the discussion: after all, one word is better than nothing! :) I know you love cryptic and deep statements, but could you please be kind and explain better what you mean? Thank you in advance.gpuccio
February 2, 2016
February
02
Feb
2
02
2016
06:33 AM
6
06
33
AM
PDT
gpuccio: 1. Given those premises, homologies through natural history are certainly an indicator of functional constraints, because they mean that some sequence cannot be significantly transformed by random variation. Paralogs.Zachriel
February 2, 2016
February
02
Feb
2
02
2016
06:03 AM
6
06
03
AM
PDT
gpuccio As far as you know, what would be the most detailed illustrated explanation of at least a hypothetical scenario for the different functional constraints and neutral evolution that occurred on the protein prickle from Prokaryotes to fungi, then to c. elegans, an later to Drosophila melanogaster, that I could look at to learn about this? Thank you.Dionisio
February 2, 2016
February
02
Feb
2
02
2016
05:25 AM
5
05
25
AM
PDT
Dionisio at #19: Yes.gpuccio
February 2, 2016
February
02
Feb
2
02
2016
04:03 AM
4
04
03
AM
PDT
gpuccio @17
[...] those known domains show some “graduality” of modification in eukaryotes [...]
is the above quoted statement associated with the below one taken from your OP?
[...] it can be explained as a mixed result of different functional constraints and neutral evolution in different time splits [...]
Dionisio
February 2, 2016
February
02
Feb
2
02
2016
03:17 AM
3
03
17
AM
PDT
The added graph does illustrate what you explained so well. Definitely a picture is worth a thousand words. :) Thank you. BTW, that impressive jump could easily win an Olympic gold medal, but also it could raise suspicion about doping, thus prompting some serious folks to demand an investigation to search for a valid explanation. :)Dionisio
February 2, 2016
February
02
Feb
2
02
2016
03:07 AM
3
03
07
AM
PDT
1 30 31 32 33

Leave a Reply