• No results found

Introduction to Proteomics

N/A
N/A
Protected

Academic year: 2022

Share "Introduction to Proteomics"

Copied!
46
0
0

Loading.... (view fulltext now)

Full text

(1)

1

Introduction to Proteomics

Åsa Wheelock, Ph.D.

Division of Respiratory Medicine &

Karolinska Biomics Center [email protected]

In: Systems Biology and the Omics Cascade, Karolinska Institutet, June 9-13, 2008

(2)

Focus of course: Tools for data analysis Your analysis is no better than data you have collected...

The goals of this proteomics overview:

• Understand possibilities & limitations

• Pros and cons of different method

• Sources of variance in proteomics

• Take advantage of proteomics core facilities

• Perform proteomics collaborations

(3)

3

Proteomics publications in Pubmed

0 500 1000 1500 2000 2500 3000

1996 1997 1998 1999 2000 2001 2002 2003 2004 2005 2006

First ”proteomics”

publication

(4)

Why Proteins?!?

• Business end of the cell

• Detailed information with limited efforts

– As compared to metabolomics

TRANSCRIPTOMICS

PROTEOME

Limited info from mRNA

Detailed info

Robust technology

• Relatively robust methods available

(5)

5

Proteomics Methodology

• No ”protein PCR”

– 4 nucleotides vs 20+ amino acids

– Post-translational modifications (PTM) 3 MAIN PROTEOMICS PLATFORMS

• Gel based methods

• Shotgun methods (mass spec-based)

”chromatography-based”, ”gel-free”

• Array based (antibody based)

(6)

Gel based: 2-Dimensional Electrophoresis

Isoelectric point (pI)

Net charge

3 4 5 6 7 8 9 10 11 pH +2

+1 0 -1 -2

log Mw

Molecular weight

SEPARATION VISUALISATION

QUANTIFICATION Stoichiometric protein stain => 3rd dimension

(7)

7

Image acquisition

Image acquisition using fluorescent scanner

(8)

Quantitative image analysis

1. Detect spots

2. Match spots across gels 3. Quantify spot volumes

Pixel intensity => 3rd dimension Spot volume = protein quantity

(9)

9

Protein identification

⇒ Mass spectrometry

(MALDI-TOF/TOF) Trypsin digestion

⇒ DATABASE SEARCH => IDENTIFICATION

(Swiss-prot, EnSemble) (statistical probability)

(10)

Protein Identification

Protein Trypsine digestion Peptides EXPERIMENTAL Peptide masses

DTHKSEIAHRFK DLGEEHFKGLVL IAFSQYLQQCPF DEHVKLVNELTE FAKTCVADESHA GCEKSLHTLFGD

MS analysis

(11)

11

Peptide mass mapping

LHTLFGDR

MS/MS analysis=>

sequence information MS analysis=>

peptide masses

Statistical matching

DLGEEHFK database

search

DTHKSEIAHRFK DLGEEHFKGLVL IAFSQYLQQCPF DEHVKLVNELTE FAKTCVADESHA GEKSLHTLFGRE LCKVASLRET

Homology search Validate statistical hit

(12)

Shotgun vs. Gel-based proteomics

Adapted from Patterson and Aebersold, Nature Genetics 2003, 33:311-23. Fig. 3 extract

Separate Quantify

Separate (LC) Digest

Digest

MS/MS

MS/MS (2DE)

ID

Quantify

(13)

13

Semi-quantitative proteomics

Both 2DE and MS-based methods NOT quantitative by nature

Co-separation: 2 samples => ratios

Tags => Semi-quantitative proteomics

(14)

Shotgun isotope-tagging :

Isotope coded affinity tag (ICAT)

(15)

15

Multiplexing in 2DE: DIGE Multiplexing in 2DE: DIGE

-- Differential Gel ElectrophoresisDifferential Gel Electrophoresis

Protein abundance = 532/633 signal ratio Co-separation by 2DE

Direct ratiometric normalization Spot quantification

Statistical analysis Protein visualization (super-imposable images)

λex 633nm ⇒ Cy5 λex 532nm ⇒ Cy3

Treated sample:

Covalently labeled (Cy3)

Control sample:

Covalently labeled (Cy5)

(16)

Semi-quantitative proteomics

Both 2DE and MS-based methods NOT quantitative by nature

Co-separation: 2 samples => ratios

Tags => Semi-quantitative proteomics

Pooled internal standard + 2-3 samples =>

(17)

17

Internal Standards in 2DE: DIGE Internal Standards in 2DE: DIGE

Protein abundance = relative to IS Co-separation by 2DE

Direct ratiometric normalization Spot quantification

Statistical analysis Protein visualization (super-imposable images)

λex 633nm ⇒ Cy5 λex 532nm ⇒ Cy3

Treated Sample:

Covalently labeled (Cy3)

Control Sample:

Covalently labeled (Cy5)

λex 488nm ⇒ Cy2

Pooled Internal standard (Cy2)

(18)

Proteomics in Pubmed

0 500 1000 1500 2000 2500 3000

1996 1997 1998 1999 2000 2001 2002 2003 2004 2005 2006

proteomics 2DE

(19)

19

Sample A

Trypsin digestion (peptides)

31 114

PRG

30 115

PRG

29 116

PRG

28 117

PRG

Multiplexing in MS: iTRAQ

- isobaric Tag for Relative and Absolute Quantitation

Sample B Sample C Sample D

a b c d

iTRAQ labelling

MS/MS m/z

ID

Quantification

(20)

Differential labelling opens up new possibilities

• Cysteine oxidative states

• Identify peptides on plasma

membrane surface

(21)

21

2D or not 2D?

2D or not 2D?

Gel-based methods: 2-D electrophoresis + soluble proteins

+ post-translational modifications

(22)

Post-translational modifications

”Spot trains”

Intact proteins

(23)

23

2D or not 2D?

2D or not 2D?

Gel-based methods: 2-D electrophoresis + soluble proteins

+ post-translational modifications

- technical variance, time consuming

MS-based (Gel-free) methods: ICAT, iTRAQ + membrane proteins

+ low abundance proteins

(24)

Extremes of physiochemical properties: Peptides

• Charge

- pI range from 3-12

• Size

- Mw range of 5 – 500,000 kDa

• Hydrophobicity

(25)

25

2D or not 2D?

2D or not 2D?

Gel-based methods: 2-D electrophoresis + soluble proteins

+ post-translational modifications

- technical variance, time consuming

MS-based (Gel-free) methods: ICAT, iTRAQ + membrane proteins

+ low abundance proteins - expensive, data intense

(26)

Shotgun approcahes and gel-

based approaches complementary

SHOTGUN GEL-BASED

(27)

27

日本 Japan

103

100 101 102 103 104 105 106 107 108 109 1010 1011 1012

COPIES of each PROTEIN

Osaka 大阪

Kyoto 京都

Kobe 神戸

104

1012 DYNAMIC RANGE

(28)

- 400 reported PTMs

0 1 2 3 4 5 6 7 8 9 10 11 12

Phosphorylation sites

7 6 5 4 3 2 1

Post-translational Modifications (PTMs)

(29)

29

Variance in 2DE

• BIOLOGICAL VARIANCE

• Experimental variance

– Pre-fractionation, isolation & labelling of proteins – Protein staining

• Technical variance

– Gel-to-gel variation in 2DE – Image acquisition (scanner)

• Post-experimental variance

– Software-induced variance – User dependant variance

(30)

Variance in 2DE

• BIOLOGICAL VARIANCE

•• Experimental varianceExperimental variance

– Pre-fractionation, isolation & labelling of proteins – Protein staining

• Technical variance

– Gel-to-gel variation in 2DE – Image acquisition (scanner)

• Post-experimental variance

(31)

31

Remember that variance adds up:

Multiple-step method is not your friend...

10% 40%

(32)

Variance in 2DE

• BIOLOGICAL VARIANCE

• Experimental variance

– Pre-fractionation, isolation & labelling of proteins – Protein staining

• Technical variance

– Gel-to-gel variation in 2DE – Image acquisition (scanner)

• Post-experimental variance

(33)

33

Technical variance in 2DE

Solubilization Rehydration Isoelectric focusing

SDS-PAGE

Load IPG strip Protein visualization

Recovery?

Recovery?

Inhomogenous electric field?

Gel-to-gel variations?

-SH modifications?

Staining artifacts?

Background fluorescence?

Biological variance

(34)

Tools to reduce variance

Technical variance

• Internal standard:

– DIGE

• Software algorithms:

– Background subtraction – Normalization

(35)

35

Dynamic range of scanner

16 bit pixel resolution (216 ∼ 65,000 ~ 105) Make sure you are using the entire range!

(36)

Variance in 2DE

• BIOLOGICAL VARIANCE

• Experimental variance

– Pre-fractionation, isolation & labelling of proteins – Protein staining

• Technical variance

– Gel-to-gel variation in 2DE – Image acquisition (scanner)

• Post-experimental variance

(37)

37

2DE analysis software

• Main purpose: match and quantify spots

• Normalization: reduce gel-to-gel variation

• Background subtraction:

– Reduce background noise – Increase signal/noise ratio – Increase sensitivity

(38)
(39)

39

(40)

Global Background Subtraction Global Background Subtraction

PDQuest

PDQuest: Floating/Rolling Ball: Floating/Rolling Ball

(41)

41

Software induced variance Software induced variance

Expression; No background subtraction

0 25 50 75 100

1 51 101 151 201 251 301 351 401 451

CV (%) n=5

Expression; No background subtraction

0 25 50 75 100

1 51 101 151 201 251 301 351 401 451

CV (%) n=5

Average CV=4.6%

Average CV=10%

PDQuest PG200

PD Quest; Power mean

0 25 50 75 100

1 51 101 151 201 251 301 351

CV (%) n=5

PD Quest; Power mean

0 25 50 75 100

1 51 101 151 201 251 301 351

CV (%) n=5

Software variance up to 30% of technical variance

(42)

Applications of proteomics Applications of proteomics

BIOMARKER DISCOVERY

• Biomarker of disease & susceptibility

CLINICAL APPLICATIONS

• Pharmaceutical target identification

• Improved diagnostics

MECHANISTIC STUDIES

• Protein-protein interactions

• Protein adduction /Altered protein expression

(43)

43

Proteomics in the future

• Improved sensitivity

– Currently: scratching the surface – laser capture microdissection

• Protein microarrays

– Antibody arrays (e.g. for cytokines)

– Tissue microarrays (Peter Nilsson, Friday)

• In vivo subcellular localization assays

• Protein amplicifation method?

– i.e. ”protein-PCR”

(44)

Focus on INTERPRETING data,INTERPRETING not on ACQUIRING data.ACQUIRING

Pathway Analysis

• Integrate data from omics cascade

• Integrate heatmap with biological pathways

Proteomics in the NEAR future...

(45)

45

Preview of coming attractions

Preview of coming attractions KEGG &

KEGG & KegArray KegArray

(46)

Take home messages...

...keep your variance down and your dynamic range up!

...keep your false positives down,

References

Related documents

Williams’ criticism of the gold standard model moved from the specific nature of its assumptions and his interpretation echoed many arguments which we have already found in

In accordance with our data, RNA-Seq transcriptome data of tumor tissue from 20 melanomas revealed that patients with a long-term survival benefit from anti–CTLA-4 treatment

The talks also feature agenda items that put Russia on the defensive, such as a pledge not to use force (a pledge that Russia has not given, though Georgia has) or the implementation

The above discussion deals with the requirement that Mea- suRouting must operate within the confines of the TE policy. The other equally important challenge is that any MeasuRout-

*) Sekretariatet för arkivering och publicering av forskningsdata, Data Curation Unit (DCU).. Det går att markera flera svarsalternativ. Nej, jag hanterar inte känsliga data..

We convened a group of three occupational therapy students from Western Michigan University who are nearing the end of their programs and three occupational therapists who

Far more significant indications of the intention in due course com- pletely to nationalize ocean space are: (a) the refusal by the UNCLOS majority to set clear

The secondary objectives are to determine the perceived and expected levels of customer service the retail customers receive from the pharmaceutical wholesaler and measure