RNA Processing
cap
cap
cap poly(A)
cap poly(A)
mGpppNmpNm
AAUAAA A A
A
polyadenylation
mGpppNmpNm A
A
A
3’
intron 1
Pre-mRNA
2’OH-A branch site adenosine
exon 1 exon 2
5’ G-p-G-U
- A-G-p-G 3’
intron 1 Splicing
intermediate
U-G-5’-p-2’-A
A
exon 1 exon 2
5’ G-OH
O 3’ A-G-p-G
A - 3’
U-G-5’-p-2’-A
A
3’ G-A
Spliced mRNA
exon 1 exon 2
5’ G-p-G 3’
Recognition of splice sites
• invariant GU and AG dinucleotides at intron ends
• donor (upstream) and acceptor (downstream) splice sites
are within conserved consensus sequences
donor (5’) splice site branch site acceptor (3’) splice site
G/GUAAGU..................…A.......…YYYYYNYAG/G
U1 U2
= hnRNP proteins
Spliceosome assembly
intron 1
Step 1: binding of U1
and U2 snRNPs
U2
2’OH-A
exon 1 exon 2
5’ U1
G-p-G-U
- A-G-p-G 3’
intron 1 Step 2: binding of U4, U5, U6
U2 2’OH-A
exon 1
U4 U6 exon 2
5’
U5
G-p-G-U
- A-G-p-G 3’
U1
Step 3: U1 is released,
intron 1
then U4 is released
2’OH-A
U2
exon 1
U6 exon 2
5’ G-p-G-U
- U5 A-G-p-G 3’
Step 4: U6 binds the 5’ splice site and
the two splicing reactions occur,
catalyzed by U2 and U6 snRNPs
intron 1
2’OH-A
U6 U2
U-G-5’-p-2’-A
A
mRNA 3’ G-A U5
5’ G-p-G 3’
Frequency of bases in each position of the splice sites
Donor sequences
exon intron
%A 30 40 64 9 0 0 62 68 9 17 39 24
%U 20 7 13 12 0 100 6 12 5 63 22 26
%C 30 43 12 6 0 0 2 9 2 12 21 29
%G 19 9 12 73 100 0 29 12 84 9 18 20
A G G U A A G U
Acceptor sequences
intron exon
%A 15 10 10 15 6 15 11 19 12 3 10 25 4 100 0 22 17
%U 51 44 50 53 60 49 49 45 45 57 58 29 31 0 0 8 37
%C 19 25 31 21 24 30 33 28 36 36 28 22 65 0 0 18 22
%G 15 21 10 10 10 6 7 9 7 7 5 24 1 0 100 52 25
Y Y Y Y Y Y Y Y Y Y Y N Y A G G
Polypyrimidine track (Y = U or C; N = any nucleotide)
Mutations that disrupt splicing
• βo-thalassemia - no β-chain synthesis
• β+-thalassemia - some β-chain synthesis
Intron 2 acceptor site βο mutation: no use of mutant site; use of cryptic splice site in intron 2
Translation of the retained
portion of intron 2 results in
Exon 1 Exon 2 premature termination of
Intron 1 translation due to a stop
codon within the intron, 15
codons from
Intron 2 cryptic acceptor site: UUUCUUUCAG/G the cryptic splice site
Donor site: /GU AG/: Normal acceptor site (used 10% of the time in β+ mutant)
Translation of the retained portion of intron 1 results in termination at a stop codon in intron 1
Exon 1 β+ mutation creates a new donor splice site: use of both sites
Exon 2 Exon 3
Intron 2
/GU: Normal donor site (used 60% of the time when exon 1 site is mutated)
The incorrect splicing results in a frameshift and translation terminates at a stop codon in exon 2
Patterns of alternative exon usage
• one gene can produce several (or numerous) different
but related protein species (isoforms)
Cassette
Mutually exclusive
Alternative promoters
The Troponin T (muscle protein) pre-mRNA
is alternatively spliced to give rise to
64 different isoforms of the protein