Uncommon Descent Serving The Intelligent Design Community

The spliceosome: a molecular machine that defies any non-design explanation.

Share
Facebook
Twitter
LinkedIn
Flipboard
Print
Email

OK, let’s start with a very simple fact: eukaryotic genes have introns.

IOWs, they are not continuous. They are made of exons and introns: exon – intron – exon – intron – exon and so on. Exons code for the protein. Introns don’t.

So, when the content of the gene is copied to the mRNA, introns must be cut away, and only exons are retained, in order to be translated, so that the mature mRNA can be transferred to the cytoplasm and translated by the ribosome.

This process of removing introns is called splicing.

Now, a few clarifications:

a) Introns exist in prokaryotes too, but they are rather rare. For our purposes, we will only discuss introns in eukaryotes.

b) Introns exist in many different types of genes. For our purposes, we will discuss only those in protein coding genes.

c) The origin and possible function of introns is, still, a mystery.

d) Introns are usually longer than exons. In humans, for example, they amount to approximately 35% of the whole geneome, vs about 1.5% of coding exons.

e) However, the amount and length of introns can vary a lot in different organims. An extreme example is yeast (s. cerevisiae), whose genome contains a very small amount of introns (about 250 out of about 6250 genes).

When the gene if transcribed, both exons and introns are transcribed. A 5′ UTR (Untranslated region) and 3′ UTR, is also part of the pre-mRNA.

 

Fig. 1

Pre-mRNA is the first form of RNA created through transcription in protein synthesis. The pre-mRNA lacks structures that the messenger RNA (mRNA) requires. First all introns have to be removed from the transcribed RNA through a process known as splicing. Before the RNA is ready for export, a Poly(A)tail is added to the 3’ end of the RNA and a 5’ cap is added to the 5’ end. – By Nastypatty (Own work) [CC BY-SA 4.0 (https://creativecommons.org/licenses/by-sa/4.0)], via Wikimedia Commons

So the question is: how are introns removed from pre-mRNA? IOWs, how is splicing achieved?

And there is more. As everybody probably knows, splicing is not always done in the same way. Different isoforms of the same protein can be obtained by alternative splicing, and they can have functional differences. I will not go into details about that, but here is the Wikipedia page about alternative splicing:

https://en.wikipedia.org/wiki/Alternative_splicing

So, how is splicing done, and how is alternative splicing regulated?

We know much about the first question, very little about the second.

There are three ways to perform splicing:

  1. Spliceosomal splicing
  2. Self-splicing
  3. tRNA splicing

The last two modalities are rare, and can be found both in prokaryotes and eukaryotes. I will not discuss them here.

So, the subject of this OP is spliceosomal splicing, which is restricted to eukaryotes.

Moreover, I will discuss only the major spliceosome, which is responsible for the vast majority of splicing in eukaryotes. It must be said, however, that also a minor spliceosome exists, and that it acts in a minority of cases.

So, the spliceosome.

The first important point is:

It is an amazing molecular machine. Even more, it is an amazing molecular cycle, involving many different stages each of which is an amazing molecular machine.

Let’s see. Here is a figure which summarizes the main stages of the spliceosome cycle:

 

Fig. 2

 

Spliceosomal splicing cycle – By JBrain [CC BY-SA 2.0 de (https://creativecommons.org/licenses/by-sa/2.0/de/deed.en)], via Wikimedia Commons

To make it simple, the spliceosome units are built upon 5 specific RNAs, called small nuclear RNAs (snRNA). These are, in humans:

U1 (164 bases), U2 (187 bases), U4 (145 bases), U5 (116 bases), U6 (107 bases)

They are transcribed from multiple gene copies. While their sequences are not particularly conserved (U6 being the most conserved of all), their secondary structure seems to be very conserved.

snRNAs are very important in the spliceosome, because they seem to be responsible for the catalytic activities.

Each of the 5 snRNAs forms a complex with proteins, and the complex takes the name of snRNP. The whole spliceosome includes at least 145 different proteins, maybe more, some of which are still not well known.

I will mention here some of the most important:

 

U1 snRNP:

U1 snRNP 70 kDa  (P08621, 437 AAs)

U1 snRNP A (P09012, 282 AAs)

U1 snRNP C  (P09234, 159 AAs)

Sm proteins: 7 small proteins (76 – 240 AAs) which form the “Sm core” ring in spliceosome subunits U1, U2, U4, U5, a ring which hosts the specific snRNA molecule.

 

U2 snRNP:

U2 snRNP A’ (P09661, 255 AAs)

U2 snRNP  B” (P08579, 225 AAs)

SF3a120 (Q15459, 793 AAs)

SF3a66 (Q15428, 464 AAs)

SF3a60 (Q12874, 501 AAs)

SF3b155 (O75533, 1304 AAs)

SF3b145 (Q13435,  895 AAs)

SF3b130 (Q15393, 1217 AAs)

SF3b49 (Q15427, 424 AAs)

SF3b14a/p14 (Q9Y3B4, 125 AAs)

SF3b10 (Q9BWJ5, 86 AAs)

Sm proteins

 

U4/U6 snRNP:

Prp3 (O43395, 683 AAs)

Prp31 (Q8WWY3, 499 AAs)

Prp4 (O43172, 522 AAs)

Cyph ( O43447, 177 AAs)

15.5 K (P55769, 128 AAs)

Sm proteins (for the U4 snRNA)

Lsm proteins: a number of proteins similar to Sm proteins (usually Lsm 2-8), which form a specific ring for the U6 snRNA.

 

U5 snRNP:

Prp8 (Q6P2Q9, 2335 AAs)

BRR2 (O75643, 2136 AAs)

Snu114 (Q15029, 972 AAs)

Prp6 (O94906, 941 AAs)

Prp28 (Q9BUQ8, 820 AAs)

52 K (O95400, 341 AAs)

40 K (Q96DI7, 357 AAs)

Sm proteins

 

Additional proteins in the U4/U6/U5 complex:

hSnu66 (O43290, 800 AAs)

hSad1 (Q53GS9, 565 AAs)

27 K (Q8WVK2, 155 AAs)

 

OK, these are only the main components, and the best understood. We are still far from the sum total of 145/150 proteins which are involved in the spliceosome cycle.

But how does it work?

Always in brief. Here is a typical exon-intron structure:

Fig. 3

 

Parts of an intron – By miguelferig (Own work) [Public domain], via Wikimedia Commons

 

GU, A and AG are nucleotides almost universally conserved in all introns, approximately at the positions shown in the figure, and which have a fundamental role in splicing. However, the real stuff is much more complex than this (see the splicing code section). GU (n. 4 in the figure) is near to the 5′ end of the intron, AG (n. 1) near to the 3′ end. The A (n. 3 in the figure) is called “the branch point”. The py-py-py (n. 2 in the figure) is the “polypyrimidine tract”.

a) The U1 subunit binds to the GU sequence at the 5′ splice site in the intron

b) The U2 subunit binds to the “branch point”.

c) The U4/U6/U5 binds to the complex.

d) Numeorus further modifications take place, causing the formation of a “lariat” (including the intron), which is then cleaved, while the two exons are ligated.

I will spare you the many complexities in all the various steps, which are well summarized (in a very simplified way) in this Wikipedia page:

RNA splicing

See in particular the “Formation and activity” section.

Or, if you like more detail, here:

Spliceosome Structure and Function

And here is a very good video on the whole splicing process in yeast:

 

 

Now, if somebody still has doubts about the complexity of this molecular machine/process, let’s consider some important aspects.

  • 1. The spliceosome is a molecular machine which appears in eukaryotes.

I quote from this paper (in the abstract):

Origin and evolution of spliceosomal introns

There is no indication that any prokaryote has ever possessed a spliceosome or introns in protein-coding genes, other than relatively rare mobile self-splicing introns.

The following Table shows how some of the main proteins involved in the spliceosome activity show practically no trace of homology in prokaryotes. I have included also included in the table two examples  which show low homologies due to some domain which is already expressed in prokaryotes: Prp4, whose 209 bits of homology are due to a specific domain, WD40, and Prp28, which exhibits 313 bits linked to the DEXDc domain. The point is: many of the spliceosome proteins are complete novelties in eukaryotes, but not all of them.

 

 Protein Bacteria Archea
U1 snRNP 70 kDa  (P08621, 437 AAs) 67
U2 SF3b130 (Q15393, 1217 AAs) 43.5
U4/U6 hPrp3 (O43395, 683 AAs)
U4/U6/U5 hSnu66 (O43290, 800 AAs)
U5 Prp8 (Q6P2Q9, 2335 AAs)
U5 Prp6 (O94906, 941 AAs) 100
U4/U6 hPrp4 (O43172, 522 AAs) 209 150
U/5 Prp28 (Q9BUQ8, 820 AAs) 313 286

 

I will analyze in more detail one of the most important proteins in the spliceosome, Prp8, in the last part of this OP

(Just a technical note: if you blast Prp8, you will find 3 hits which are obviously  an error due to unverified sequences, probably cases of contamination).

  • 2. The spliceosome is a molecular machine which is universally present in eukaryotes.

All eukaryotes have introns, even if in very different degrees, and the spliceosome, even if in some organisms parts of the spliceosome complex can be lost. For a more detailed discussion, look at the Rogozin paper quoted above (Origin and evolution of spliceosomal introns), in particular the section:

Intron density, size and distribution in protein coding genes across the eukaryote domain

I quote this important conclusion:

As pointed out above, despite the existence of numerous, diverse intron-poor genomes, eukaryotes do not lose the “last” intron or the spliceosome although degradation of the spliceosome including loss of many components does occur, e.g. in yeast. The only firmly established exception is the tiny genome of a nucleomorph (an extremely degraded intracellular symbiont of algae) that has lost both all the introns and the spliceosome [7]; preliminary genomic data indicate that all introns might have been lost also in a microsporidium, a highly degraded intracellular parasite distantly related to Fungi [54].

So, whe can conclude that both introns and the spliceosome are a universal feature of eukaryotes, the few exceptions being simply cases of loss of information.

  • 3. The spliceosome is a molecular machine whose information is extremely conserved throughout eukaryotes, up to humans.

I have already mentioned that the 5 RNAs which form the core of the spliceosome are not extremely conserved at sequence level, even if they are extremely conserved at structure level.

However, many of the proteins that compose the spliceosome show an amazing sequence conservation throughout eukaryotes. Now, even if we cannot be certain of when eukaryoyes really emerged, and of their early evolutionary history (both issues being at present highly controversial), we can reasonably assume that protein sequences which are highly conserved in all eukaryotes have been conserved for something like 2 billion years (more or less). As anybody who has followed my previous OPs about information conservation in vertebrates certainly knows, that is an evolutionary time frame which certainly allows us to equal conservation to extremely high functional constraint.

But how conserved are spliceosome proteins? We can analyze a few of them with my usual methodology: looking at human conserved information. The results shows that many proteins involved in the spliceosome are amazingly conserved in all eukaryotes. While there are a few cases which have a rather different evolutionary history, this is by far the most common behaviour for spliceosomal proteins.

Here is a sample of some important sequences that show high homology with the human form in all major groups of single celled eukaryotes. The 5 groups of single celled organisms chosen here, indeed, cover rather well the whole range of single celled eukaryotes.

 

Fig. 4

 

These four important proteins, as shown, have an amazing amount of information shared with the human form, ranging from more than 1000 to more than 4000 bits. In bits per AA, the range goes from 0.88 to 1.80 bits per aminoacid (baa).  As can be seen, the highest homology is found in fungi, as expected, because fungi are the most likely ancestors of metazoa. The lowest homologies are observed in Naegleria (Excavata) or in Alveolata.

Of course, these proteins remain highly similar to the human form in the following evolutionary history in Metazoa.

So, we can safely state that most spliceosomal proteins, while emerging almost entirely in eukaryotes and showing only trivial homologies with prokaryotes, were probably already universally present in the Last Universal Eukaryotic Ancestor (LECA), and in a form already very similar at sequence level to what we observe in metazoa and in humans.

  • 4. The spliceosome is a wonderful example of irreducibly complexity.

OK, we have already said that splicing can be achieved in at least three different ways. For example, bacterial introns, although rare, are of the self-splicing type. So, we know that the generic function of splicing introns can be implemented in different ways.

But eukaryotic introns are of the spliceosomal type, and they are spliced only by the spliceosome.

We have also said that a minor spliceosome also exists. It shares some featues with the major spliceosome, but it is a different structure and acts on different, and much rarer, introns.

So, for the vast majority of eukaryotic introns, the major spliceosome, and only the major spliceosome, can effectively accomplish the splicing.

Now, I don’t mean here that the major spliceosome must always be absolutely complete, with all its 150 proteins, to be able to work. That’s not what I mean when I say that it is irreducibly complex.

Maybe in some organisms the spliceosome can be partially defective, and still work. It is difficult to say, because we still don’t understand the role of all the components of the spliceosome.

But however, as far as we can understand, most of the principal features must be present, because, as we have seen, the splicing is the result of a complex cycle, involving the RNAs and the subunits, and all stages are essential to the final result.

So, the spliceosome is certainly highly irreducibly complex, even if we may not be able to clearly identify the essential nucleus of molecules which is absolutely necessary to the minimal function.

Moreover, the spliceosome would be useless if spliceosomal introns did not exist, with their properties and code (see later), and spliceosomal introns could not exist if the spliceosome were not there to splice them, because otherwise transcription would be completely ineffective. So, in that sense, spliceosomal introns and the spliceosome are a good example of chicken egg paradox, or we could also say that they form an irreducibly complex system at a higher level. And remember, the whole system seems to have been already present, very much similar to its current form in humans, in LECA, as we have seen.

But, of course, our darwinist friends will simply say that they have co-evolved!  🙂

Moreover, we can and should ask ourselves: why is the spliceosome so complex? The answer is not easy, because we still understand very little, but it is certainly related to the complexity of the splicing code, and to the fundamental issue of alternative splicing.

The Splicing code.

To splice introns by localizing 5 (or a few more) conserved nucleotides and then cutting at the ends of a lariat and rejoining the two exons is certainly a complex task, but apparently not so complex  that one of the biggest and most impressing known molecular machines is needed for that. But the simple truth is that recognizing and appropriately splicing all introns is a much more complex task than that.

That is due to the simple fact that conserved nucleotides at the ends are not a sufficient signal to identify the segment that has to be spliced, and that a lot of other components (not always well understood) are necessary to that, and that the splicing is not made always in the same way, and that alternative splicing is a very powerful tool for transcription regulation.

The subject is very complex, and I will not deal with it in depth. However, Those interested could look at this recent review:

The splicing code.

Unfortunately, the paper is paywalled, but the abstract is very informative.

The complexity of the splicing code can give us some insight about the true reasons for the complexity of the spliceosome, and definitely supposrts the idea that the whole system, introns, splicing code and spliceosome, is irreducibly complex.

The Prp8 protein.

I will add a few words about this proteins, which is probably the most amazing component of the spliceosome, and well represents its essential features.

This protein has many amazing charactertistics:

a) It is, as far as I can say, the longest protein in the whole spliceosomal system, and a very long protein indeed: 2335 AAs.

b) It is completely absent in prokaryotes.

c) It is extremely conserved in eukaryotes, probably the most conserved protein in the whole spliceosomal complex (see Fig. 4). In Naegleria, we have 3345 bits of homology with the human protein, corresponding to 1.4 baa, while in fungi we have the amazing result of 4211 bits of human-conserved information, corresponding to 1.8 baa. IOWs, in fungi we already find almost 90% of the functional information present in human Prp8 (remember, the highest possible bitscore is about 1.2 baas), corresponding to 1995 identities (86%) and 2155 positives (92%): a result which is incredibly rare, considering that such information has been conserved for about 2 billion years.

Just to emphasize the importance of this fact, here is the blast between the human protein and the best hit in fungi (Basidiobolus meristosporus):

 

 

d) It is a protein which is extremely important for the function of the spliceosome. Here is a very good paper about this protein and its functional relevance:

Prp8 protein: At the heart of the spliceosome

I quote from the conclusions:

Prp8p is central to the expression of all nuclear intron-containing mRNAs. In higher eukaryotes, it is responsible for processing thousands of transcripts in alternative splicing pathways, and in both U2 and U12 spliceosomes. It is important for the pathology of human disease, as all eukaryotic pathogens and parasites require Prp8p to functionally express their genes, in some cases via the trans-spliceosome. Retinitis Pigmentosa, a human genetic disorder that causes progressive blindness, positions Prp8p as a target for therapeutic medicine.

Moreover, another way to assess the functional constraints of a protein is to check its tolerance to polymorphisms in humans. That can be done consulting a very recent and useful database, the ExAC browser, which reports data from about 60000 human genomes.

ExAC gives two important metrics to assess how much a protein is tolerant to polymorphisms and variants. I quote here from the site FAQ:

What are the constraint metrics?

For synonymous and missense, we created a signed Z score for the deviation of observed counts from the expected number. Positive Z scores indicate increased constraint (intolerance to variation) and therefore that the gene had fewer variants than expected. Negative Z scores are given to genes that had a more variants than expected.

For LoF, we assume that there are three classes of genes with respect to tolerance to LoF variation: null (where LoF variation is completely tolerated), recessive (where heterozygous LoFs are tolerated), and haploinsufficient (where heterozygous LoFs are not tolerated). We used the observed and expected variants counts to determine the probability that a given gene is extremely intolerant of loss-of-function variation (falls into the third category). The closer pLI is to one, the more LoF intolerant the gene appears to be. We consider pLI >= 0.9 as an extremely LoF intolerant set of genes.

Now, for Prp8 we get the following results:

Constraint
from ExAC
Expected
no. variants
Observed
no. variants
Constraint
Metric
Synonymous 366.6 460 z = -3.02
Missense 920.2 266 z = 10.55
LoF 92.6 8 pLI = 1.00

That means the following:

a) The number of observed missense variants is so low vs expected (266 vs 920.2) that the z value is 10.55 (IOWs, 10.55 standard deviations, more than 10 sigma). Believe me, this is a really exceptional result, most proteins are much more tolerant to missense variants.

b) The probability of loss of function is 1. That means that even heterozygous LoF is absolutely not tolerated.

This is an extremely functional molecule, if ever there was one!

A brief conclusion.

To sum up the meaning of this rather long discussion, I would simply say:

  1. The introns – spliceosome system is a molecular machine of amazing complexity. I have touched only its main aspects in this OP, but believe me, there are layers of complexity there that would require a whole treatise and that I have not even started to mention here.
  2. Whoever can really believe that all this can be explained by some RV + NS model is, IMO, really admirable for his faith in a wrong paradigm.
Comments
gpuccio, Interesting analysis. Thanks. What common molecule did the proteins HBB and Prp8 come from?Dionisio
January 8, 2018
January
01
Jan
8
08
2018
01:59 PM
1
01
59
PM
PDT
Dionisio (and all interested): Again about Prp8 and its really extraordinary sequence conservation. I have compared Prp8 with another well known protein, the beta chain of hemoglobin (HBB). This is a small globular protein (147 AAs in humans) with a very important function. However, its sequence-function relationship is not extremely strict, and the protein is known to undergo some important neutral variation in long evolutionary times. So, I have taken HBB and Prp8 and compared their evolutionary history by blasting the two proteins in Danio rerio (a bony fish) and in humans, and computing the KaKs for both couples of proteins. Comparing a bony fish with humans gives us an evolutionary time of something less than 400 million years, the assumed time of the fish-tetrapods split, which is a good time frame to evaluate KaKs. Here are the results. First, the blasts: HBB (human vs Danio Rerio, 147 AAs): 172 bits 76 identities (51%) 106 positives (71%) bits per aminoacid: 1.170068 Prp8 (human vs Danio Rerio, 2335 AAs): 4786 bits 2286 identities (98%) 2319 positives (98%) bits per aminoacid: 2.049679 Now, look at the big difference between these two proteins. HBB is certainly conserved, but in about 390 million years its sequence retains "only" 51% identities, while Prp8 retains 98% of its sequence. The difference in bitscore is tremendous (4786 vs 172), but of course it is also due to the difference in length. But if we use the bits per aminoacid metrics, which corrects for length, the difference is still amazing: 2.049679 vs 1.170068 IOWs the functional specificity, or density, is double in Prp8 if compared to HBB! Now let's go to the KaKs ratio. I will remind here, for the convenience of all, that Ka is the number of nonsynonymous substitutions per non-synonymous site (IOWs, those substitutions which change the encoded aminoacid), while Ks is the number of synonymous substitutions per synonymous site (IOWs, those substitutions which do not change the encoded aminoacid), in the same period. Here is the simple Wikipedia page: https://en.wikipedia.org/wiki/Ka/Ks_ratio from which I quote: "The Ka/Ks ratio is used to infer the direction and magnitude of natural selection acting on protein coding genes. A ratio greater than 1 implies positive or Darwinian selection (driving change); less than 1 implies purifying or stabilizing selection (acting against change); and a ratio of exactly 1 indicates neutral (i.e. no) selection" and "The Ka/Ks ratio is a more powerful test of the neutral model of evolution than many others available in population genetics as it requires fewer assumptions." So, just to simplify: Forget the values above 1 (they are very rarely seen). Values near to 1 indicate neutral variation in the whole sequence (non synonymous mutations happen at the same rate as synonymous mutations) Values significantly lower than 1 indicate negative (purifying) selection: non synonymous mutations are antagonized by negative selection. In proteins where the sequence is conserved, we expect values lower than 1. To compute the KaKs I have used the R package seqinR, and the function kaks. So, let's see: HBB (Danio rerio vs human): Ka = 0.3799238 Ks = 1.399608 Ka/Ks = 0.2714501 This value is well below 1, so we can say that HBB has been under negative (purifying) selection in the last 390 million years. But let's go to Prp8: Prp8 (Danio rerio vs human): Ka = 0.0139506 Ks = 1.482462 Ka/Ks = 0.009410427 Two important points: a) The rate of synonymous substitutions (Ks) is absolutely comparable in the two proteins: HBB vs Prp8 1.399608 vs 1.482462 This means that the rate of neutral variation is rather constant. This is, as I have said many times, the best proof of molecular common descent. b) The rate of non synonymous substitutions (Ka), instead, is strikingly different. In both proteins, indeed, it is lower than the Ks, but in Prp8 it is much lower (almost 30 times lower!): HBB vs Prp8 0.3799238 vs 0.0139506 As a consequence, the Ka/Ks ratio is almost 30 times lower in Prp8: HBB vs Prp8 0.2714501 vs 0.009410427 IOWs, both proteins are under purifying selection, but the purifying selection, as measured by the low Ka/Ks, is almost 30 times stronger in Prp8! So, there can be no doubts about the functional specificity of the Prp8 sequence!gpuccio
January 8, 2018
January
01
Jan
8
08
2018
12:29 PM
12
12
29
PM
PDT
@315 correction: lncRNA involved in intron splicing?Dionisio
January 8, 2018
January
01
Jan
8
08
2018
07:23 AM
7
07
23
AM
PDT
lncRNA involved in alternate splicing?
LncRNAs can also regulate intron splicing of the sense transcripts by masking splicing sites through its complementary sequences. For example, alternative splicing competitor lncRNA (ASCO-lncRNA) can hijack nuclear speckle RNA-binding protein (NSR) to alter splicing patterns of transcripts in response to auxin in Arabidopsis [...]
Genome-wide screening and characterization of long non-coding RNAs involved in flowering development of trifoliate orange (Poncirus trifoliata L. Raf.). Wang CY1, Liu SR1, Zhang XY1, Ma YJ1, Hu CG1, Zhang JZ1 Sci Rep. 2017 Feb 24;7:43226. doi: 10.1038/srep43226.
The known -not the unknown- clearly points to complex functionally specified informational complexityDionisio
January 8, 2018
January
01
Jan
8
08
2018
07:19 AM
7
07
19
AM
PDT
@313: That makes sense. Thanks.Dionisio
January 8, 2018
January
01
Jan
8
08
2018
02:47 AM
2
02
47
AM
PDT
Dionisio: "How did you single out Prp8?" I sorted out Prp8 essentially because it was the longest protein, it had no sequence homologues in prokaryotes and it was extremely conserved in eukaryotes. Yes, I looked at most other proteins in my list too, and I could see that, while those features were common to most of them, Prp8 was certainly the one which best exhibited them. While I went on deepening my understanding of the details of the spliceosome, I also realized that there were a lot of structural and functional reasons to consider Prp8 as the key protein in the spliceosome. So, this is IMO a very good demonstration that following the tracks of functional information by the methodology I use is a very good and effective way to detect extremely objective functional features! :)gpuccio
January 8, 2018
January
01
Jan
8
08
2018
01:58 AM
1
01
58
AM
PDT
OP brief summary:
So, the subject of this OP is spliceosomal splicing, which is restricted to eukaryotes. So, the spliceosome. A few important points about it:
It is an amazing molecular machine. Even more, it is an amazing molecular cycle, involving many different stages each of which is an amazing molecular machine.
1. The spliceosome is a molecular machine which appears in eukaryotes. 2. The spliceosome is a molecular machine which is universally present in eukaryotes. 3. The spliceosome is a molecular machine whose information is extremely conserved throughout eukaryotes, up to humans. 4. The spliceosome is a wonderful example of irreducibly complexity.
The Splicing code. The Prp8 protein.
a) It is, as far as I can say, the longest protein in the whole spliceosomal system, and a very long protein indeed: 2335 AAs. b) It is completely absent in prokaryotes. c) It is extremely conserved in eukaryotes, probably the most conserved protein in the whole spliceosomal complex d) It is a protein which is extremely important for the function of the spliceosome.
Dionisio
January 8, 2018
January
01
Jan
8
08
2018
01:21 AM
1
01
21
AM
PDT
gpuccio, In the OP figure #2 Prp24 is shown in the center. Prp3, Prp31, Prp4 are listed under U4/U6 snRNP. Prp8, Prp6, Prp28 are listed under U5 snRNP. Then you wrote:
The Prp8 protein. I will add a few words about this proteins, which is probably the most amazing component of the spliceosome, and well represents its essential features. This protein has many amazing charactertistics
How did you single out Prp8? Did you look at the other Prp* proteins mentioned above too? Thanks.Dionisio
January 8, 2018
January
01
Jan
8
08
2018
01:05 AM
1
01
05
AM
PDT
@295: "the spliceosome is a unique protein-directed metalloribozyme." :)Dionisio
January 8, 2018
January
01
Jan
8
08
2018
12:32 AM
12
12
32
AM
PDT
@295: "RNA elements account for more than half of the total molecular mass in the bacterial or mammalian ribosome, but less than 10% in the human spliceosome." :)Dionisio
January 8, 2018
January
01
Jan
8
08
2018
12:29 AM
12
12
29
AM
PDT
@295: "The placement and conformation of the RNA elements in the spliceosome are determined in large part by surrounding protein components." :)Dionisio
January 8, 2018
January
01
Jan
8
08
2018
12:26 AM
12
12
26
AM
PDT
@295: “...the overall structure and operational principles of the spliceosome are markedly different from those of the group II introns...” "...the catalytic activity of the spliceosome, but not of the group II intron, is directed by the protein components.” :)Dionisio
January 7, 2018
January
01
Jan
7
07
2018
06:27 PM
6
06
27
PM
PDT
@294: "The rigid protein components include the bulk of the central spliceosomal protein Prp8" Did we read about this protein Prp8 somewhere else in this thread? :)Dionisio
January 7, 2018
January
01
Jan
7
07
2018
05:00 PM
5
05
00
PM
PDT
Professor Arthur Hunt posted his last comment in this discussion @164 last December 26, 2017. Since then gpuccio has addressed to him 17 comments with very insightful explanations. Perhaps professor Hunt is getting ready to respond to gpuccio? We look forward to reading his next comments.Dionisio
January 7, 2018
January
01
Jan
7
07
2018
12:43 PM
12
12
43
PM
PDT
Discussion between AH and GP Index of posted comments: AH @25 …….……. GP @28 AH @50 AH @51 …….……. GP @54 AH @56 * …….……. GP @60 …….……. GP @69 …….……. GP @75 …….……. GP @86 …….……. GP @98 …….……. GP @106 …….……. GP @118 …….……. GP @127 …….……. GP @129 AH @130 …….……. GP @136 …….……. GP @138 …….……. GP @146 …….……. GP @162 AH @164 …….……. GP @167 …….……. GP @176 …….……. GP @182 …….……. GP @198 …….……. GP @200 …….……. GP @201 …….……. GP @210 …….……. GP @211 …….……. GP @212 …….……. GP @231 …….……. GP @242 …….……. GP @253 ** …….……. GP @254 ** …….……. GP @263 …….……. GP @268 …….……. GP @294 …….……. GP @295 …….……. GP @296 …….……. GP @297 AH is the distinguished professor Arthur Hunt GP is the author of the excellent OP that started this discussion thread (*) first publicly admitted mistake @56 (**) not addressed to professor A. Hunt directly (to be continued…)Dionisio
January 7, 2018
January
01
Jan
7
07
2018
12:30 PM
12
12
30
PM
PDT
Upright BiPed @301:
GP at 295. "Boom"
:) :) :)Dionisio
January 7, 2018
January
01
Jan
7
07
2018
12:20 PM
12
12
20
PM
PDT
The spliceosomal proteins PPIH and PRPF4 exhibit bi-partite binding https://www.researchgate.net/profile/Tara_Davis/publication/319986193_The_spliceosomal_proteins_PPIH_and_PRPF4_exhibit_bi-partite_binding/links/5a135513aca27217b5a2f32f/The-spliceosomal-proteins-PPIH-and-PRPF4-exhibit-bi-partite-binding.pdfDionisio
January 7, 2018
January
01
Jan
7
07
2018
12:16 PM
12
12
16
PM
PDT
GP at 295. BoomUpright BiPed
January 7, 2018
January
01
Jan
7
07
2018
11:48 AM
11
11
48
AM
PDT
Any news from professor Arthur Hunt yet?Dionisio
January 7, 2018
January
01
Jan
7
07
2018
10:44 AM
10
10
44
AM
PDT
@297: Wow! Another interesting paper! Nice catch! Fishing season is on! :)Dionisio
January 7, 2018
January
01
Jan
7
07
2018
10:43 AM
10
10
43
AM
PDT
@294, 295: Journal of Molecular Biology Volume 429, Issue 17, 18 August 2017, Pages 2640-2653 Very interesting paper. Timely catch!Dionisio
January 7, 2018
January
01
Jan
7
07
2018
09:35 AM
9
09
35
AM
PDT
Arthur Hunt (and all interested): But neo-darwinists, as we very well know, are not easily discouraged! :) Here is, for example, a very recent just-so story about the evolution of our spliceosome, with some interesting connotations, even in the abstract: Domestication of self-splicing introns during eukaryogenesis: the rise of the complex spliceosomal machinery. https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5709842/
Abstract The spliceosome is a eukaryote-specific complex that is essential for the removal of introns from pre-mRNA. It consists of five small nuclear RNAs (snRNAs) and over a hundred proteins, making it one of the most complex molecular machineries. Most of this complexity has emerged during eukaryogenesis, a period that is characterised by a drastic increase in cellular and genomic complexity. Although not fully resolved, recent findings have started to shed some light on how and why the spliceosome originated. In this paper we review how the spliceosome has evolved and discuss its origin and subsequent evolution in light of different general hypotheses on the evolution of complexity. Comparative analyses have established that the catalytic core of this ribonucleoprotein (RNP) complex, as well as the spliceosomal introns, evolved from self-splicing group II introns. Most snRNAs evolved from intron fragments and the essential Prp8 protein originated from the protein that is encoded by group II introns. Proteins that functioned in other RNA processes were added to this core and extensive duplications of these proteins substantially increased the complexity of the spliceosome prior to the eukaryotic diversification. The splicing machinery became even more complex in animals and plants, yet was simplified in eukaryotes with streamlined genomes. Apparently, the spliceosome did not evolve its complexity gradually, but in rapid bursts, followed by stagnation or even simplification. We argue that although both adaptive and neutral evolution have been involved in the evolution of the spliceosome, especially the latter was responsible for the emergence of an enormously complex eukaryotic splicing machinery from simple self-splicing sequences.
Emphasis mine. No comment! :)gpuccio
January 7, 2018
January
01
Jan
7
07
2018
09:34 AM
9
09
34
AM
PDT
Arthur Hunt (and all interested): This is interesting and pertinent too: Intron specificity in pre-mRNA splicing. https://link.springer.com/article/10.1007%2Fs00294-017-0802-8 Paywall.
Abstract The occurrence of spliceosomal introns in eukaryotic genomes is highly diverse and ranges from few introns in an organism to multiple introns per gene. Introns vary with respect to their lengths, strengths of splicing signals, and position in resident genes. Higher intronic density and diversity in genetically complex organisms relies on increased efficiency and accuracy of spliceosomes for pre-mRNA splicing. Since intron diversity is critical for functions in RNA stability, regulation of gene expression and alternative splicing, RNA-binding proteins, spliceosomal regulatory factors and post-translational modifications of splicing factors ought to make the splicing process intron-specific. We recently reported function and regulation of a ubiquitin fold harboring splicing regulator, Sde2, which following activation by ubiquitin-specific proteases facilitates excision of selected introns from a subset of multi-intronic genes in Schizosaccharomyces pombe (Thakran et al. EMBO J, https://doi.org/10.15252/embj.201796751, 2017). By reviewing our findings with understandings of intron functions and regulated splicing processes, we propose possible functions and mechanism of intron-specific pre-mRNA splicing and suggest that this process is crucial to highlight importance of introns in eukaryotic genomes.
gpuccio
January 7, 2018
January
01
Jan
7
07
2018
09:16 AM
9
09
16
AM
PDT
Arthur Hunt (and all interested): From the same paper quoted at #294, a very interesting conclusion, absolutely pertinent to our discussion:
As previously suggested [3], the splicing active site of the spliceosome is strikingly similar to that of the group II self-splicing intron [35,36]. These similarities include the position and coordination of the two catalytic metal ions, the fine features of metal-coordinating nucleotides, and the catalytic triplex [3,35,36]. Similar to pre-mRNA splicing, the group IIB self-splicing intron also has an intron lariat intermediate [76]. These preliminary analyses strongly argue for a common ancestry between the spliceosome and the group II introns [77–79]. However, despite these similarities, the overall structure and operational principles of the spliceosome are markedly different from those of the group II introns [35,36]. The overall structure of the U2, U5, or U6 snRNA does not have a mimicry in the group II introns. The placement and conformation of the RNA elements in the spliceosome are determined in large part by surrounding protein components. Unlike the group II introns [79], the splicing active site of the spliceosome is accommodated by a positively charged catalytic cavity on the surface of Prp8 and supported by a number of splicing factors throughout the two steps of transesterification [35,36]. The assembled spliceosome refers to at least seven distinct states, whose inter-conversion is driven by conserved ATPases/helicases [9,10]. In summary, the catalytic activity of the spliceosome, but not of the group II intron, is directed by the protein components. The spliceosome is a central RNP enzyme in all eukaryotic cells. The unique structural features of the spliceosome also sharply contrast those of the other central RNP enzyme—the ribosome. Compared with the spliceosome, the ribosome is considerably more stable with a large subunit and a small subunit, both of which are amenable to crystallization and structural investigation [80–82]. RNA elements account for more than half of the total molecular mass in the bacterial or mammalian ribosome, but less than 10% in the human spliceosome. For this and all other reasons discussed previously, the spliceosome is a unique protein-directed metalloribozyme.
Emphasis mine.gpuccio
January 7, 2018
January
01
Jan
7
07
2018
09:00 AM
9
09
00
AM
PDT
Arthur Hunt (and all interested): Here is a very recent update on the molecular functions of the spliceosome: The Spliceosome: A Protein-Directed Metalloribozyme http://www.sciencedirect.com/science/article/pii/S0022283617303479?via%3Dihub Regarding irreducible complexity, the authors find a very interesting sub-core of the spliceosome which remains remarkably constant throughout the whole spliceosomal cycle:
A rigid core of the spliceosome Despite its dynamic changes during each splicing cycle, the spliceosome—from the Bact to the ILS complex—contains a rigid core of about 20 discrete components [35,36,42–47] (Fig. 2a). These components include the entire U6 snRNA, the bulk of U5 snRNA, a portion of U2 snRNA, and at least 17 proteins. The location and the overall conformation of these components remain nearly indistinguishable among the structurally characterized yeast Bact[42,43], C [44,45], C* [46,47], and ILS [35,36] complexes. Consequently, all RNA components of the splicing active site—ISL of U6 snRNA, loop I of U5 snRNA, and helix I of the U2/U6 duplex—remain largely unchanged in all spliceosomal complexes throughout the splicing reaction (Fig. 2b). The rigid protein components include the bulk of the central spliceosomal protein Prp8 (Spp42 in S. pombe), Snu114 (Cwf10 in S. pombe) and the heptameric Sm complex of the U5 snRNP, Cef1 (Cdc5 in S. pombe), and Syf2 of the NTC, and six components of the NTR including Prp45, Prp46 (Cwf1 in S. pombe), Cwc2 (Cwf2 in S. pombe), Ecm2 (Cwf5 in S. pombe), Bud31 (Cwf14 in S. pombe), and Cwc15 (Cwf15 in S. pombe) (Fig. 2a). These proteins are organized around the three snRNA elements, stabilizing their rigid conformation, and remain largely static in the Bact, C, C?, and ILS complexes
Emphasis mine.gpuccio
January 7, 2018
January
01
Jan
7
07
2018
08:57 AM
8
08
57
AM
PDT
Dionisio: "There’s no other side in the battlefield." That's absolutely correct: no other game in town. The fight is between RV + NS and design. Nothing else makes any sense. So, as long as the third way people go on stating that RV + NS can't do it, they are really arguing in favour of design. Even if they don't like it! :)gpuccio
January 6, 2018
January
01
Jan
6
06
2018
03:37 PM
3
03
37
PM
PDT
Upright BiPed @288: All they do is claim 'natural' design, 'natural' engineering, but their former colleagues in the neo-Darwinian party don't forgive them for having used such forbidden words as 'design' even if it's sugarcoated with the adjective 'natural' in front. They third way folks crossed the demarcation line and now are being shot at by the neo-Darwinian border patrols. They either recant their newly adopted ideas and go back to where they were before or change the term 'natural' in front of 'design' to 'intelligent' and join the ID group. There's no other side in the battlefield.Dionisio
January 6, 2018
January
01
Jan
6
06
2018
12:38 PM
12
12
38
PM
PDT
gpuccio @287:
If they try to put away, or even simply minimize, the role of NS, as those third way freaks seem so eager to do, they are left literally with nothing: no explanations at all, of anything. The probabilistic barriers that NS certainly cannot overcome become then absolute barriers, that they don’t try to climb in any way, not even just going through the motions. So, design remains the only, solitary king of biological explanations. Are these third way guys our friends or our enemies? I really don’t know. And, frankly, I don’t like not to know.
They seem to be alone in the middle of the battlefield, hence they are strongly criticized by both contending sides. The neo-Darwinian folks go beyond criticism and practically call the third way folks 'defectors' and 'traitors' in their own terms. That's not polite at all. Perhaps that's the price the third way folks have to pay for buttering their bread on both sides? :)Dionisio
January 6, 2018
January
01
Jan
6
06
2018
12:15 PM
12
12
15
PM
PDT
gpuccio @286: Apparently after a quick look I rushed to the wrong conclusion and incorrectly placed the two websites in the same category. Thanks for the correction.Dionisio
January 6, 2018
January
01
Jan
6
06
2018
11:58 AM
11
11
58
AM
PDT
gpuccio @285: Agree completely. I like science, specially biology, which is becoming the queen of science, heavily relying on other branches of science: mathematics, physics, chemistry. Modern biology is closely associated with information technology, cybernetics, control systems. The fact that the big data problem in biology is turning more serious than in physics confirms the enormous complexity of the biological systems and testifies about the beauty of modern biology. Unfortunately, many folks out there prefer to waste time on political fights, instead of engaging in polite scientific discussion, like you do here.Dionisio
January 6, 2018
January
01
Jan
6
06
2018
11:42 AM
11
11
42
AM
PDT
1 2 3 4 5 6 14

Leave a Reply