Genomics Proteomics
Proteomics is multidisciplinary
Protein
Biochemistry
Analytical
Biology
Chemistry
Proteomics
Molecular
Bioinformatics
Biology
Proteomics Research
•Basic research:
To understand the molecular mechanisms
underlying life.
•Applied research:
Clinical testing for proteins associated with
pathological states (e.g. cancer).
Applications of Proteomics
Medical Signal Disease
Microbiology Transduction Mechanisms
Protein
Drug Discovery Glycoyslation
Expression
Profiling Post-
Proteome
Target ID translational Phosphorylation
Mining
Modifications
Differential Display Proteolysis
Proteomics
Yeast Genomics Yeast two-hybrid
Protein-
Affinity Purified Functional protein Co-precipitation
Protein Complexes Proteomics Interactions
Mouse Knockouts Structural Phage Display
Proteomics
Hbβ Hbα
O2
Hbα Hbβ
hemoglobin
Sickle cell disease is caused
by a single amino acid change.
Normal Hbβ Mutated Hbβ
ATG GTG CAC CTG ACT CCT GAG GAG … ATG GTG CAC CTG ACT CCT GTG GAG …
M V H L T P E E… M V H L T P V E…
Summary – what is proteomics?
•Proteomics is multidisciplinary
R
Variable
CH H2N OH H N OH
H
H2O
N H
H O R2 O
H Base R1 C C H C C
Dipeptide
H2N H N OH
Peptide Bond
Each amino acid has unique
chemical properties.
basic
acidic
Histidine Aspartate
Glutamate
Lysine
Arginine
non-polar hydrophobic
Valine
Proline
Alanine Isoleucine Phenylalanine
Leucine Methionine Tryptophan
polar hydrophilic
Serine Cysteine
Glutamine
Glycine Tyrosine Asparagine Threonine
Proteins are chains of amino acids.
O
C OH
N H
H
Short chains of amino acids are
called peptides.
N H
Proteins are polypeptide molecules H
that contain many peptide subunits.
Gene
Nucleus 3’
Messenger
Trp Ribonucleic Acid
tRNA (mRNA)
Ala
tRNA
Met
Amino Acid-
tRNA
Met transfer
5’ Ribosome
Large Subunit Ala RNA
Met Trp
Empty tRNA
Empty tRNA
A U G G C C U G G U A G
Small Subunit
Cytoplasm Ribonucleotides A G C U
http://www.path.cam.ac.uk/~mrc7/igs/mikeimages.html
Proteins arrive at their final
structure in an ordered fashion
• Protein isolation
• Protein separation
• Protein identification
Protein Isolation
How are proteins isolated?
• Mechanical Methods
– grinding – break open cell
– centrifugation – remove insoluble debris
• Chemical Methods
– detergent – breaks open cell compartments
– reducing agent – breaks specific protein
bonds
– heat – break peptide bonds to “linearize”
protein
Protein isolation procedure
Transfer to tube
“pure” protein
solution
Recover supernatant Keep solution for gel analysis
Protein X
“pure” protein
solution
Isolated Protein X
Summary – protein isolation
SDS-PAGE
Why separate proteins?
Tube 1 Tube 2
Increased Complexity Decreased Complexity
Decreased Protein ID Increased Protein ID
How to separate proteins?
• TEMED - catalyst
O
Bisacrylam ide
(cross-linking agent)
H H
N N
Polymerization
N N
O O
TEMED
(catalyst)
Am monium persulfate
(free radical initiator) SO4
Polyacrylamide
(non-toxic)
Polyacrylamide
C ON2H C ON2H
Polyacrylamide
(non-toxic) O O
NH NH
Bis-acrylamide
C H2 cross links C H2
NH NH
O C ON2H
O
C ONH
Sodium dodecyl sulfate - SDS
The anionic detergent SDS unfolds or
denatures proteins
• Uniform charge/mass
ratio
One-dimensional polyacrylamide
gel electrophoresis (SDS-PAGE)
Cathode (-)
Anode (+)
Standard Sample1 Sample2
During SDS-PAGE proteins separate
according to their molecular weight
Cathode (-)
150 kDa
100 kDa
75 kDa
50 kDa
37 kDa
25 kDa
20 kDa
Bromophenol
Anode (+) Blue dye front
Standard Sample1 Sample2
Image of Real SDS-PAG
Cathode
250 kiloDaltons
150 kDa
100 kDa
75 kDa
50 kDa
37 kDa
25 kDa
20 kDa
Anode
Separation of Protein X
Cathode (-)
150 kDa
100 kDa
75 kDa
50 kDa
37 kDa
150 kDa
100 kDa
75 kDa
50 kDa
2nd dimension
37 kDa
SDS-PAGE
25 kDa
20 kDa
11 kDa
2-DG
kDa 3 4 5 6 7 8 9 10 pI
100
75
mass
50
25
3 4 5 6 7 8 9 10
150 kDa
100 kDa
75 kDa
50 kDa
20 kDa
11 kDa
Protein X
25 kDa
pI 5
1-DGE vs. 2-DGE
1-DGE (SDS-PAGE) 2-DGE
• High reproduciblity • Modest reproducibility
• Quick/Easy • Slow/Demanding
• Separates solely based • Separates based on pI and
on size size
• Modest resolution, • High resolution, not
dependent on complexity dependent on complexity
of sample of sample
Summary – protein separation
•Protein separation takes advantage physical
properties such as isoelectric point and molecular
weight
mass spectrometry
Peptide mass intact protein x
fingerprinting protein digestion
intensity
Measure peptide masses -
“Weigh” the peptides in a m/z
mass spectrometer
mass
952.0984
1895.9057
Match peptide masses to 1345.6342
899.8743
protein or nucleotide 2794.9761
Protein X
????????K?????R????????
How does mass spectrometry
identify unknown proteins?
Basics of mass spectrometry
????????K
?????R
????????
Mass Spectrometer
We then “weigh” these peptides
with a Mass Spectrometer
????????K 1106.55 Da
?????R 692.31 Da
????????
1002.37Da
Mass of peptides should be compared to
theoretical masses of known peptides
????????K = 1106.55 Da
?????R = 692.31 Da
???????? = 1002.37Da
Computation of theoretical masses of
known peptides known
Computer Peptides
Proteome = all protein sequences •
•
WEGETMILK
ADEMTYEK
1106.55
1105.23
• PLMEHGAK 1089.50
• LMEHHH 782.25
• ASTEER 692.31
• DMGEYIILES 1056.92
• EGEDMPAFY 1002.35
• CYHGMEI 984.36
• EFPKLYSEK 900.56
• YSEPYSSIIR 1102.34
• IESPLMIA 864.35
• AEFLYSR 600.21
• DLMILIYR 864.97
• METHIPEEK 795.36
• KISSMER 513.21
• PEPTIDEK 456.23
simulated Trypsin •
•
YMEPSATFGHR
GHLMEDFSAC
995.46
896.35
• HHFAASTR 564.88
• ALPMESS 469.12
Mass of peptides compared to
theoretical masses of all peptides
known, using a computer program.
Computer Peptides
• WEGETMILK 1106.55
• ADEMTYEK 1105.23
• PLMEHGAK 1089.50
????????K = 1106.55 Da •
•
LMEHHH
ASTEER
782.25
692.31
• DMGEYIILES 1056.92
• EGEDMPAFY 1002.35
• CYHGMEI 984.36
?????R = 692.31 Da •
•
EFPKLYSEK
YSEPYSSIIR
900.56
1102.34
• IESPLMIA 864.35
• AEFLYSR 600.21
• DLMILIYR 864.97
• METHIPEEK 795.36
???????? = 1002.37Da
• KISSMER 513.21
• PEPTIDEK 456.23
• MANYCQWS 792.15
• TYSMEDGHK 678.46
• YMEPSATFGHR 995.46
• GHLMEDFSAC 896.35
• HHFAASTR 564.88
• ALPMESS 469.12
Mass of peptides matched to
theoretical masses known peptides,
using a computer program.
Computer Peptides
• WEGETMILK 1106.55
• ADEMTYEK 1105.23
• PLMEHGAK 1089.50
????????K = 1106.55 Da •
•
LMEHHH
ASTEER
782.25
692.31
• DMGEYIILES 1056.92
• EGEDMPAFY 1002.35
• CYHGMEI 984.36
?????R = 692.31 Da •
•
EFPKLYSEK
YSEPYSSIIR
900.56
1102.34
• IESPLMIA 864.35
• AEFLYSR 600.21
• DLMILIYR 864.97
• METHIPEEK 795.36
• KISSMER 513.21
???????? = 1002.37Da • PEPTIDEK 456.23
• MANYCQWS 1002.37
• TYSMEDGHK 678.46
• YMEPSATFGHR 995.46
• GHLMEDFSAC 896.35
• HHFAASTR 564.88
• ALPMESS 469.12
The unknown peptides have been
identified
????????K?????R????????
????????K?????R????????
????????K?????R????????
WEGETMILK AFTEER MANYCQWS
Summary – tools to study proteins?
-Interdisciplinary research