• No results found

Analysis of the MAPK pathways, in silico and in vitro

N/A
N/A
Protected

Academic year: 2020

Share "Analysis of the MAPK pathways, in silico and in vitro"

Copied!
223
0
0

Loading.... (view fulltext now)

Full text

(1)

LEABHARLANN CHOLAISTE NA TRIONOIDE, BAILE ATHA CLIATH

TRINITY COLLEGE LIBRARY DUBLIN

OUscoil Atha Cliath

The University of Dublin

Terms and Conditions of Use of Digitised Theses from Trinity College Library Dublin

Copyright statement

All material supplied by Trinity College Library is protected by copyright (under the Copyright and

Related Rights Act, 2000 as amended) and other relevant Intellectual Property Rights. By accessing

and using a Digitised Thesis from Trinity College Library you acknowledge that all Intellectual Property

Rights in any Works supplied are the sole and exclusive property of the copyright and/or other I PR

holder. Specific copyright holders may not be explicitly identified. Use of materials from other sources

within a thesis should not be construed as a claim over them.

A non-exclusive, non-transferable licence is hereby granted to those using or reproducing, in whole or in

part, the material for valid purposes, providing the copyright owners are acknowledged using the normal

conventions. Where specific permission to use material is required, this is identified and such

permission must be sought from the copyright holder or agency cited.

Liability statement

By using a Digitised Thesis, I accept that Trinity College Dublin bears no legal responsibility for the

accuracy, legality or comprehensiveness of materials contained within the thesis, and that Trinity

College Dublin accepts no liability for indirect, consequential, or incidental, damages or losses arising

from use of the thesis for whatever reason. Information located in a thesis may be subject to specific

use constraints, details of which may not be explicitly described. It is the responsibility of potential and

actual users to be aware of such constraints and to abide by them. By making use of material from a

digitised thesis, you accept these copyright and disclaimer provisions. Where it is brought to the

attention of Trinity College Library that there may be a breach of copyright or other restraint, it is the

policy to withdraw or take down access to a thesis while the issue is being resolved.

Access Agreement

By using a Digitised Thesis from Trinity College Library you are bound by the following Terms &

Conditions. Please read them carefully.

(2)

Analysis of the MARK pathways,

in silico and

in vitro.

Thesis subm itted to the University of Dublin

for the degree of Doctor of Philosophy

By

Daniel Richard Caffrey

Department of Biochemistry

Trinity College Dublin

(3)
(4)

Declaration

This thesis is submitted by the undersigned to the University o f Dubiin for

the examination of Doctorate in Philosophy. The work herein is entirely my

own and has not been submitted as an exercise for a degree to any other

university. The library of Trinity College has my permission to lend or copy

this thesis upon request.

(5)

Summary

1

Acknowledgments

iii

Abbreviations

iv

List of Figures

V

List of Tables

vii

Chapter 1

1

Introduction

1

A Bioinformatic approach

1

Protein evolution and Sequence analysis

1

Neutral and adaptive protein evolution

2

Phylogenetic trees

4

Predicting functional sites in proteins

4

THE MAPK pathvi^ays

8

O bjective of thesis

11

Chapter 2

15

Introduction

15

Material and Methods

17

Sequences used in analysis

17

Multiple sequence alignments.

18

Phylogenetic tree reconstruction.

18

Bootstrapping

21

Additional information

21

Results

22

Term inology and evolutionary interpretation

22

Evolution of the MAPK family

22

Evolution of the MEK family

36

Evolution of the MEKK family

48

Evolution of the STE20/PAK family

60

Discussion

72

Yeast hyper-osm olarity and mammalian stress pathway evolution

72

Novel substrate specificities tend to evolve over long periods

75

An evolutionary model for the yeast pherom one and the mammalian ERK pathway

76

The yeast hypo-osm olarity pathway and the mammalian oxidative stress pathway

78

The yeast sporulation pathway

79

Conclusions

80

(6)

Introduction

82

Materials and methods

87

Multiple alignm ent and Prediction of ancestral sequences

87

Calculation of Burst After Duplication (BAD) scores

88

Calculation of subfam ily scores (SS) which ignore ancestral nodes

92

Analysis of BADT and SST predictions versus experimental evidence

93

Solvent accessibility and residue contacts

94

Results

96

Comparison of p38 and ERK

96

Comparison of p38 and JNK

111

Relationships between BAD / BADT scores and structure

124

Sites identified in MKK3/6, MKK4, and MKK7

126

Discussion

138

C h a p t e r 4

143

Introduction

143

Materials and Methods

146

Plasmids

146

Plasmid purification for Cloning

146

DNA Ligation and transform ation into competent cells

147

PCR conditions for Chimera construction

148

Chimera construction

151

Plasmid purification for transfection

154

Culture and transfection OF 293RI cells

155

Bradford assay

157

Immuno-precipitation protein kinase assay

157

Western blot procedure for detection of substrate phosphorylation

159

Western blot procedure for detection of flag-tagged protein expression

160

Results

161

Chimeric proteins require upstream activation in order to activate downstream targets

166

Activation of chim eric MAPKs by upstream MKKs.

173

Inhibition of MAPK signal by over-expression of JIP.

184

Discussion

186

C h a p t e r 5

iss

Final conclusions and com m ents

198

C h a p t e r 6

201

References

201

(7)

Appendix II

List of sup p liers

207

(8)

Summary

This thesis takes a combined computational and experimental approach to

study the MARK pathways. These proteins were chosen as they are highly

conserved in both sequence and function across all eukaryotes.

An evolutionary analysis demonstrated that the MARK pathways in

animals arose largely, by a co-ordinated set of gene duplications. This involved

the JNK and p38 pathways at the MARK and MKK levels. These pathways are

primarily involved in stress and immune responses. The gene duplications

occurred after the divergence of animals from fungi. However, a more ancient

cross talk involving STE11 has been retained in both animals and fungi.

A novel prediction method was developed in an attempt to delineate the

residues that were conferring pathway specificity between JNK and p38. The

method compared predicted ancestral residues at the gene duplication node with

the first animal speciation node, to identify sites of interest. The method was also

applied to predicting residues conferring functional differences between p38 and

ERK. This allowed a partial assessment of the method, as several experiments

analysing p38 and ERK specificity had been previously published. It was found

that the method improved differentiation between subfamily specific sites and

other sites, compared with an almost identical method that did not use ancestral

information. There were other predicted sites that represented either false

positives or uncharacterised sites. The prediction method was also applied to the

MKK components of the p38 and JNK pathways. It was found that all predictions,

(9)

also distinct results. This suggested that a similar region of these proteins were

involved in protein-protein interactions.

The p38 and JNK predictions were used to design p38/JNK chimeric

proteins. Some of these chimeras failed to express in cells and their role could

not be assessed. However, a key residue was identified as being important for

both upstream and downstream specificity in the JNK pathway. Introduction of

additional residues often disturbed this specificity, suggesting that they may have

been indirectly involved but not essential to subfamily specificity. The results

suggested that the prediction method was useful but also likely to give many

(10)

Acknowledgements

This work would not have been realised without the kind help and guidance of

(11)

Abbreviations

ERK

Extracellular Regulated Kinase

JNK

c-Jun-N-terminal Kjnase

MEK

MAPK/ERK Kinase

MEKK

MEK Kinase

MKK

Mitogen Activated Protein Kinase Kinase

MARK

Mitogen Activated Protein Kinase

PAK

p21 Activated Kinase

SAPK

Stress Activated Protein Kinase

IL1

Interleukin

1

.

DMSO

Dimethyl sulphoxide

DTT

Dithiothreitol

EDTA

Etylene-diamine-tetra-acetic acid

PCS

Foetal calf serum

(12)

List of Figures

Figure 1.1

A proposed model of residue types involved in protein-protein interactions.

7

Figure 1.2

A schematic diagram of the MARK pathways for S.

cerevisiae.

12

Figure 1.3

A schematic diagram of the MARK pathways for animals.

13

Figure 1.4

The three dimensional structures for p38 (A), JNK (B), and ERK (C).

14

Figure 2.1

Alignm ent of the MARK family.

34

Figure 2.2

Rhylogenetic tree for the MARK family.

35

Figure 2.3

Alignm ent for the MEK family.

46

Figure 2.4

Rhylogenetic tree for the t\/IEK family.

47

Figure 2.5

.Alignment for the MEKK family.

58

Figure 2,6

Phylogenetic tree for the MEKK family.

59

Figure 2.7

Alignment for the Ste20-PAK family.

70

Figure 2.8

Rhylogenetic tree for the STE20-RAK family.

71

Figure 3.1

Schematic diagram of the stress Induced MAR kinase pathways in

animals and yeast at the MARK and MKK levels.

85

Figure 3.2

Rhylogenetic trees for the JN K/p38 pathway.

86

Figure 3.3

Example of ancestral residues with multiple probabilities for a site

91

Figure 3.4

Alignm ent of sequences used in comparison between ERK and p38.

106

Figure 3.5

Prediction of sites conferring functional differences between ERK

and p38.

107

Figure 3.6

Prediction of sites conferring functional differences between ERK

and p38 as adapted from [29],

108

Figure 3.7

Alignm ent of JNK, p38, and ERK with structural features.

109

Figure 3.8

Alignm ent of sequences used in comparison between JNK and p38.

119

Figure 3.9:

Prediction of regions that have evolved functional differences in JNK

since its duplication from the Ik ' ancestor.

120

Figure 3.10

Prediction of regions that have evolved functional differences in p38

since its duplication from the 1k"ancestor.

121

Figure 3.11

Prediction of regions conferring functional differences between JNK

(13)

122

133

134

135

136

137

142

145

165

169

170

176

177

178

179

180

181

182

183

185

191

192

193

194

195

196

and p38.

Alignm ent of sequences used in comparison between MKKs.

Prediction of regions that have evolved functional differences in MKK7

since its duplication from the 2k"ancestor.

Prediction of regions that have evolved functional differences in MKK4

since its duplication from the 2k” ancestor.

Prediction of regions that have evolved functional differences in

MKK3/6 since its duplication from the 2k' ancestor.

Prediction of regions conferring functional differences between

MKK7, MKK4, and tVlKK3/6.

Carbon alpha trace of the p38 structure.

Schematic diagrams of the MAPK chimeras.

Aligned sequences for chimeric (C1-C9) and wild type proteins as

inferred from DNA sequence.

Substrate specificity of p38, JNK and JNK/p38 chimeras.

Residues differing between p38 and JNK/p38 chimeras.

Activation of p38 and JNK by various MKKs.

Activation of p38, JNK, and

p38/JNK chim eras by MKK3.

Activation of p38, JNK, and

p38/JNK chimeras by MKKS.

Activation of p38, JNK, and

p38/JNK chimeras by MKK4.

Activation of p38, JNK, and

p38/JNK chimeras by MKK7.

Activation of C5 by various MKKs.

Activation of C2 by various MKKs.

Activation of C4 by various MKKs.

The effect of JBD on p38, JNK, and JN K/p38 chim era activity.

Schem atic summ ary of experimental results.

(14)
(15)

List of Tables

Table 3.1

Sequences used in analysis

Table 3.2

Relationships between evolutionary change in physicochemical

properties and structural function for selected residues across

ERK (top) and p38 (bottom).

Table 3.3

Relationships between evolutionary change in physicochemical

properties and structural function for selected residues across

JNK (top) and p38 (bottom).

Table 3.4

The relationship between BADT values and exposed/buried for

JNK and p38.

Table 4.1

Primers used in study.

Table 4.2

The PCR conditions for chimera construction.

Table 4.3

Residues that differ between chimera 9 and chimera 2.

Table 4.4

Residues that differ between chimera 2 and chimera 4.

95

110

123

125

149

150

171

172

(16)

Chapter 1

Introduction

A Bioinformatic approach

One of the main challenges facing biologists, is to extract meaningful

knowledge from the wealth of information that is generated from various genomic

and proteomic projects. For example, it is relatively easy to sequence a genome,

but more difficult to identify the genes from the rest of the DNA [1], It is also

difficult to predict the function of a protein encoded by a gene unless it has

sequence similarity with another protein that has already been biochemically

characterised. It is likely that the most successful post genomic science will

involve a team of biologists and computational biologists, where a computer

based prediction will be made based on a biologists prior knowledge. This

prediction will in turn be tested by the biologist and reported back so as

eventually a reliable predictive model will exist for biologists to exploit. Similarly,

this project attempts to take a combined computational and experimental

approach.

Protein evolution and Sequence analysis

The majority of techniques applied in the following chapters are founded

on our knowledge of protein evolution and sequence analysis. The first matrices

for amino acid replacements were derived in the 1970’s [2]. PAM (Percentage

Accepted Mutations) matrices are based on the assumption that all sites in a

(17)

protein evolve independently and are essentially a matrix of weights that is

derived from how often different amino acids replace other amino acids as

observed in related proteins. In other words, matrices allow for a quantitative

measure of evolutionary distances between proteins and tell us how likely it is

that a particular residue will be replaced by another. Several other matrices have

been constructed since the pioneering work of Margaret Dayhoff. Weights have

been constructed on the basis of chemical and structural features of the different

amino acids [3 , 4]. The BLOSUM matrices [5] were constructed from alignments

of different percentage identities, such that a BLOSUM 45 (derived from

alignment of 45% identity) is more tolerant of non-conservative substitutions than

a BLOSUM 80 (derived from alignment of 80% identity). Almost all sequence

analysis programs use a substitution matrix, and their role is paramount.

The sequence families that were originally identified by Dayhoff relied

heavily on manual editing and were not automated. It was only in 1987, that

progressive multiple alignment schemes were devised [6-8] and a year later

before such programs became available for the microcomputer [9, 10]. All of

these methods use a substitution matrix to align their sequences. Since then,

multiple alignments have been used to various applications. For example, to

search for distantly related proteins [11], to predict secondary structure [12], to

predict protein 3D structure, (reviewed in [13 ]), to reconstruct phylogenetic trees

[14], and to predict contact maps [15].

Neutral and adaptive protein evolution

(18)

reconstructed from multiple alignments. Tree reconstruction is founded on some

fundamental theories of evolution that were developed in the 1960’s. As

sequence data for haemoglobins and cytochrome c became available, it was

observed that the rate of amino acid substitution was

approximately the

same

across different mammals [16-18]. This approximately constant rate of evolution

is referred to as the molecular clock. It is an important observation, as these

constant rates facilitate reconstruction of phylogenetic relationships between

different species and gene families. This in part led to the neutral mutation

hypothesis [19]. It argues that the majority of molecular changes in evolution are

the result of neutral mutations or random genetic drift rather than natural

selection. Of course, there are striking exceptions, where Darwinian selection is

the dominant form of evolution, e.g. the binding sites of the MHC molecules [20].

Positive selection or convergent evolution can cause the incorrect reconstruction

of a tree, e.g. the lysozyme family [21, 22]. In the case of the lysozymes, they

function as a bacteriolytic enzyme in both ruminents (e.g. cows) and colobine

monkeys (e.g. langur). This required that the proteins adapt to the harsh

environment of the stomach. Thus, lysozyme had undergone similar selective

changes in both ruminents and colobine monkeys such that the colobine

sequences cluster with ruminents rather than primates in the wrong tree.

However, it is likely that the mutation rates for most proteins are approximately

constant with only slight periodic changes in rates that will usually be reflected in

the branch length of the reconstructed tree. These changes in rates may reflect a

relaxation in constraint, particularly in a duplicated gene that is free to evolve as

(19)

the other copy remains conserved so that it can perform its function.

Phylogenetic trees

There are several methodologies for constructing trees, such as maximum

parsimony methods e.g. [23], maximum likelihood methods e.g. [24], and

distance based methods e.g. [14]. The latter method is used to reconstruct the

trees in this project, and generally performs well when the distances are

estimated accurately. However, accurate estimation of distances can be difficult

when the rate varies greatly among sites [25]. The primary application of trees in

this work is to distinguish between paralogues (arose by gene duplcation) and

orthologues (arose by speciation events). It is assumed that in general,

paralogues will have slightly different functions and that orthologues will have the

same function. It also allows investigation of the possibility that interacting

proteins arise by co-ordinated gene duplication events (as reviewed in [26 ]).

Predicting functional sites in proteins

This project also uses multiple alignment to predict residues conferring

functional differences between related proteins. Other groups have looked at

similar problems. In general, the underlying assumption is that a functionally

important site will be conserved through out evolution. The Valencia group

extended their idea of correlated mutations [15] to the problem of predicting sites

involved in protein-protein interactions [27]. Correlated mutations are essentially

sites that co-evolve due to their functional interaction. For example, a mutation in

one site will cause a compensatory mutation in another site that interacts with it.

The same group also developed a method for identifying tree determinant

(20)

residues [28]. These are essentially residues that are conserved within a

subfamily but differ from other subfamilies. Livingstone and Barton took a similar

approach, although their methodology for measuring differences between

subfamilies was more vague and was implemented in AMAS [29]. A similar

method using profiles and entropy as a measure of conservation has also been

described [30]. Armon

et al used phylogenetic information (evenly distributed

taxonomic sampling) to get a more informed measure of conservation for

predicting functional sites [31]. An alternative approach was taken by Gallet

et al

who analysed hydrophobicity distribution along linear stretches of sequences to

predict interacting sites [32]. Jones ef a/claim 66% accuracy in predicting the

"surface patch" of the protein that forms the protein-protein interface. [33].

Lichtarge ef a/developed a method that attempts to predict binding sitesby

mapping alignment information to a known three-dimensional structure [34].

Despite the encouraging results that these different methods yielded, it is clear

that predicting sites involved in protein-protein interactions is not trivial. A recent

analysis of 621 protein-protein interfaces by Glaser ef a/suggests that the

residue composition does not differ significantly from other regions of the protein

[35], They also looked at residue-residue contact preferences and their

observations were similar to previous studies. The most common residue pairing

was between 2 hydrophobic residues. They found that arginine and tryptophan

were the most common hydrophobic-charged pairing. The interaction of two

oppositely charged residues (salt-bridge) are also quite common, and previous

studies suggest that these account for 13% of residue interactions [36], Contacts

(21)

between two positive residues were more common than contacts between two

negative residues. There is an average of 18 Water molecules found between the

protein-protein interface, these often form hydrogen bonds between the protein-

protein interface, thus increasing the complexity of the protein-protein interaction

[37]. Taken together, these studies suggest that accurate prediction of functional

or binding sites in proteins will be a difficult task. A proposed model of the

different types of residues involved in protein-protein interaction is shown in

Figure 1.1

(22)

Conserved

for family

Conserved

for

subfamily

Correlated/

compensator

residues

[image:22.529.108.516.36.416.2]

Variable

Figure

1 . 1:

A proposed of model of residue types involved in protein - protein interactions.

The most critical residues are likely to be conserved across the fam ily (triangles) or conserved

across a subfam ily (squares). Other interacting sites will be variable across a fam ily but their

interacting proteins will have compensatory mutations (diamond/wedge) to maintain the

interaction. It is likely that variable residues are less important for interactions. It is possible that

conserved residues interact with variable residues but this is probably quite rare. It is not clear if

indirect interactions through water molecules are more prominent in conserved or variable

residues.

(23)

THE MARK pathways

The Mitogen Activated Protein Kinase (MARK) signalling pathways were

chosen as the proteins to model for a number of reasons. The signalling

pathways, the specificity of their protein-protein interactions, and biochemistry

are reasonably well characterised and are summarised in Figures 1.2 and 1.3.

Sequences are available from a large and diverse number of eukaryotes, but

have remained relatively conserved. This means that they can be aligned to

provide a rich source of evolutionary information that can be potentially exploited.

The high degree of similarity is reflected in the three dimensional structures that

are illustrated in Figure 1.4. Despite the availability of structures for the MAPKs,

the multiple alignment is the primary source of information used in the analysis.

This is due to the simple fact that the number sequences exceeds the number of

structures. Indeed, this will be the case for any protein family, thus the need to

develop tools that can extract important information form such alignments.

A typical MARK pathway (MEKK->MEK->MARK) consists of a

serine/threonine MARK that is activated by its upstream dual specificity MARK /

ERK Kinase (MEK), which is activated by its upstream serine / threonine MEK

Kinase (MEKK). The MARK modules appear to be fundamental in relaying a

diverse set of signals into the cell. Depending on the cell type or organism a

MARK pathway can be activated by a diverse range of extra-cellular signals and

the downstream effect is also context dependent. Extracellular signals such as

pheromones, mitogens, osmotic stress, UV stress, and pro-inflammatory

(24)

number of activators. Likewise, the best known effects include mitosis and

programmed cell death, but it is clear that many genes are increased and

decreased in expression and the physiological effects will be cell and context

dependent. Some of the characterised examples are outlined below.

There are five distinct pathways within the budding yeast

S.cerevisiae.

In

mammals, there are several more MAP kinases. These cascades are

summarised in Figure 1.2 and Figure 1.3 and reviewed in [38-41]. The resulting

activation of the MAPKs allows them to bind and phosphorylate a wide variety of

substrates. Their translocation to the nucleus and subsequent activation of

transcription factors is probably documented best [42-44]. Other substrates of

MAPKs include the cytoplasmic phosphlipase A2 [45], and the EGF receptor [46],

The yeast pheromone and filamentation MAPK pathways are almost

identical with the exception of their preference at the MAPK level for FUS3 and

KSS1 respectively [47, 48]. Kinases belonging to these pathways share closest

sequence similarity with the animal classical (mitogenic) pathway, but the

physiological activators and responses are not necessarily similar [42]. For

example, mitogens and growth factors activate the classical pathway that leads

to multiple differentiation and developmental processes, while the pheromone

activated cascade causes cell arrest, morphogenic changes, and transcription of

genes necessary for mating. In yeast, the serine/threonine kinase STE20 has

been shown to act upstream of the MEKK STE11 [49]. It was proposed that

CDC42 is upstream of this pathway [50, 51]. An alternative view is that STE20

may be activated in the pheromone pathway by the Gpy protein complex via the

(25)

adapter protein STE5 and that CDC42 binding to STE20 is only essential for

pseudohyphal growth and cytokinesis [49]. In animals, the STE20 relatives are

called P21-Activated protein Kinases (PAKs) and were shown to interact with

Rac and human CDC42 to varying extents [52-54], The PAKs are likely to

activate MEKK1/2/3 (similarity to STE11). However, subsequent activation of the

classical pathway in this context is unclear as it appears that MEKK1 then

activates the p38 and JNK pathways [52, 55-60]. In the classical pathway, the

small G-protein RAS recruits a RAF kinase to the cell membrane, thus resulting

in its activation [61-64]. The activated RAF acts as a MEKK in the classical

pathway but sequence similarity with MEKK1/2/3/4 in not significant.

Interestingly, PAK3 has been shown to activate RAF [65].

Like the yeast hyper-osmolarity pathway, the animal JNK and p38

pathways are also activated by high salt concentrations [66]. It is unclear if the

upstream osmo-sensing components are ancestrally shared. In addition, the p38

and JNK pathways are activated by LPS, UV light, and cytokines [66]. The yeast

hyper-osmolarity pathway leads to activation of transcription factors that initiate

transcription of glycerol phosphate dehydrogenase, while the known targets of

JNK and p38 are necessary for mitotic arrest, expression of cytokines, and

apoptosis [42, 44].

The yeast hypo-osmolarity pathway is activated by PKC1, which lies

downstream of Rho1 [67]. PKC may by-pass BCK1 in the activation of this

pathway [68]. It is not known what genes are expressed as a result of this

pathway being activated. There are no clear orthologues identified in animals to

(26)

date and activation of the mammalian MARK pathways by PKC isoforms may

simply reflect its promiscuity

in vitro [69, 70].

Other potential activators of animal MARK pathways include: Mixed

lineage kinases (MLK3/SPRK/PTK-1) [71, 72], (DLK/MUK/LZK/ZPK) [73-75],

(MST/MLK2) [76], Ste20-like oxidant stress response kinase-1 (S0K-1/YSK-1)

[77, 78], Tpl-2 [79], TGPp activate kinase (TAK1) [80], and Apoptosis signal-

regulating kinase 1 (ASK1) [81]. Meanwhile, more recently discovered MAPK

components such as the MEK5-ERK5 pathway appear to be involved in

response to hyper-osmolarity, oxidative stress and EGF [82, 83]. Similarly, ERK3

and ERK4 are relatively uncharacterised [84, 85].

Objective of thesis

This project attempts to understand the evolution of the MARK pathways.

The evolutionary model is put in the context of kinase and pathway function.

Sites that confer functional differences between kinase subfamilies are then

predicted by exploiting the evolutionary model. The predictions are then tested

by standard experimental procedures.

(27)

S cerevisiae

Filam entous

Pheromone

H yper-osm olari^

H ypo-osm olarity

Sporulation

??? Pheromones H ig h $ a ll concentration L o w 5 alt concentration ???

Level

K K K K

i

M E K K

i

M E K

MAPK

1

1

( s i B O

1

X ^ S T E 1 1 _ SSK22/2

C ^ S T E ? PBS2

i

* C ^ r u s 3 ^ H O G l

RHOl,

PKC

BCKl

4

SPSl

[image:27.528.72.500.114.401.2]

T

Figure

1.2:

A schematic diagram of the MAPK pathways for S.

cerevisiae.

Kinases that are studied in this thesis are surrounded by ovals, other kinases are

in triangles, small G proteins are in pentagons, and arrows denote

phosphorylation. Many other activators of the MAPK modules have been omitted

for simplicity. Reviews or individual citations within the text should be consulted

for a more comprehensive description of the pathways, as experimental evidence

for activation of substrates is often ambiguous.

(28)

Mammals

C la s s ic a l J N K P38 O x id a tiv e G icw tli factors Sbfiss siiitTuh andpio-irifUimBtorycytokuies EGF, oxidative stress

RAS

Level

PAK.

KKKK

RAF

MEKK

MEKK1/2J

ASKl

MEKK4

MEK.

M EK1/2V V M K K V VMKK^

I

MAPK

Figure 1.3:

A schematic diagram of the i\/IAPK pathways for animals.

Kinases

that are studied in this thesis are surrounded by ovals, other kinases are in

triangles, small G proteins are in pentagons, and arrows denote phosphorylation.

Many other activators of the MAPK modules have been omitted for simplicity.

Reviews or individual citations within the text should be consulted for a more

comprehensive description of the pathways, as experimental evidence for

activation of substrates is often ambiguous.

[image:28.528.44.477.63.377.2]
(29)
[image:29.526.35.516.49.613.2]
(30)

Chapter 2

Introduction

The first objective of the thesis was to establish the evolutionary

relationship between the kinases at different levels. This would allow the

distinction between orthologues and paralogues, that is, proteins that have arisen

by speciation events and those that have arisen by gene duplication respectively.

The reasoning behind this is that orthologues (often but not always) have similar

function, while paralogues often allow for novel or slightly different function to be

acquired.

The second objective was to establish if gene duplication patterns at

different levels in the cascade correlated with pathway and substrate specificity.

There are many examples of two interacting proteins being duplicated so that

there are two sets of interacting proteins, as reviewed in [26].

Thirdly, there are several so-called isoforms in the MAPK pathways that

have subtle differences in function. It was hoped that evolutionary analysis might

provide insight into their functional differences.

Briefly, a select number of kinases from each level and pathway were

searched against a copy of the non-redundant protein database to identify

candidate homologues. The kinases from each level were aligned using clustalw.

Alignments were then edited manually to ensure a correct and optimal alignment

was used for construction of evolutionary trees. Upon tree construction,

sequences that appeared to be evolving abnormally (long-branch attraction) were

(31)

omitted to allow reconstruction of a more stable and accurate tree. A bootstrap

analysis was performed as an assessment of the reliability of the tree

construction.

(32)

Material and Methods

Sequences used in analysis

The catalytic domains of 35 kinases belonging to the PAK-MEKK-MEK-

MAPK pathways were searched against the Non-redundant Protein database at

NCBI on November 1®'1998 using the NETBLAST program [86]. The search

data-set consisted of

Dictyostelium discoideum: ERK1 (SP P42525);

Saccharomyces cerevisiae: H0G1 (SP P32465), FUSS (SP P16892), KKQ1 (SP

P36005), KSS1 (SP P14681), SLT2 (SP Q00772), SMK1 (SP P41808), MKK1

(SP P32490), MKK2 (SP P32491), PBS2 (SP P08018), STE7 (SP P06784),

SSK2 (SP P53599), SSK22 (SP P25390), CLA4 (SP P48562), K 0L3 (SP

Q12469), SPS1 (SP P08458), STE20 (SP Q03497);

Schizosaccharomyces

pombe : BYR2 (SP P28829):Canc//da

albicans: STE20 (SP Q92212);

Homo

sapiens: ERK5 (SP Q13164), MEK2 (SP P36507), MKK3 (SP P46734), MEKK3

(SP Q99759), ASK1 (GB D84476), PAK1 (GB U51120), PAK3 (GB U25975) ;

xenopus laevis: p38 (SP P47812), MEK2 (SP Q07192);

Caenorhabditis elegans

SUR1 (SP P39745);

Mus muscuiis

(SP P47809), MEKK1 (SP P53349),

MEKK2 (SP Q61083), MEKK3 (SP Q61084), MEKK4a (GB U85607);

Rattus

norvegicus: PAK2 (SP Q62829). All proteins with P values of 1e-65, 1e-50, 1e-

40, 1e-50 or less (a raw score of approximately 250++) for MAPK, MEK, MEKK,

or PAK searches respectively were fetched locally from the

EMBI_/SWISSPROT/PIR databases at the Irish National Centre for

Bioinformatics.

(33)

Multiple sequence alignments.

Residues that were N and C terminal to the kinase catalytic domain could

not be aligned very well and were consequently removed as this would distort

tree construction. The catalytic domain for each kinase was identified using

previously described criteria as a guide [87], The catalytic domain of each protein

was aligned to a structural alignment that was held fixed using the

CLUSTALW1.7 package [88]. The structural alignment was based on the crystal

structures of the catalytic domains for Rat ERK2, Human Insulin receptor, Human

CDK2, Mouse p38, Human SRC, Mouse cAMP, Rabbit Phosphorylase B Kinase,

S Pombe casein Kinase, Rat Casein Kinase (SP P27703 [89]; SP P06213 [90];

SP P06213 [91]: SP P24941 [92]; SP P47811 [93]; SP P12931 [94], SP P05132

[95], SP P00518 [96]; SP P40233 [97]).

Phylogenetic tree reconstruction.

A neighbour-joining tree was drawn for the entire dataset using the

CLUSTALW tree drawing option [14]. The tree consisted of four major groupings

that coincided with the MAPK family, the MEK family, the MEKK family, and the

STE20-PAK family. Alignments for MAPK, MEK, MEKK, and STE20-PAK families

were extracted from the large alignment. To reduce errors in the tree

construction, the alignments were edited by hand before neighbour-joining trees

were constructed using the bootstrap option in CLUSTALW. All trees were

constructed with gaps excluded and corrected for multiple substitutions. Human

CDK2 (SP P24941) was used to root the MAPK and the MEK tree. The MEKK

(34)

and STE20-PAK trees were rooted with yeast STE20 (SP Q03497) and STE11

(SP P23561) respectively. Similar, redundant, outlying, unalignable, or fast

evolving sequences were removed and the trees were re-constructed using the

same options described above. The final trees were derived from the following

sequences that can be accessed at the Genbank™/EMBL (GB), Swissprot (SW),

and PIR (PIR) databases: SMK1_SC (SP P41808), MKC1_CA (SP P43068),

MPS1_MG (GB AF020316), SPM1_SP (SP Q92398), SLT2_SC (SP Q00772),

ERK5_HS (SP Q13164), SUR1_CE (SP P39745), ERKA_DM (SP P40417),

ERK1_HS (SP P27361), ERK2_HS (SP P28482), FUS3_SC (SP P16892),

KSS1_SC (SP PI 4681), SPK1_SP (SP P27638), ERK1_CA (SP P28869),

MAPK_PNC (GB AF043941), FSMAPK_NH (GB U52963), PMK1_MG (GB

U70134), ERK1_DD (SP P42525), MPK6_AT (SP Q39026), MAPK_PS (SP

Q06060), NTF4_NT (SP Q40532), MPK_AS (PIR S56638), MPK3_AT (SP

Q39023), MAPK1_PC (GB Y12785), NTF6_NT (SP Q40531), MPK5_AT

(Q39025), MPK4_AT (SP Q39024), MMK2_MS (SP Q40353), MPK7_AT (SP

Q39027), MPK1_AT (SP Q39021), MPK2_AT (SP Q39022), STY1_SP (SP

Q09892), H0G1_CA (SP Q92207), H0G1_SC (SP P32485), H0G2_ZR (GB

AB012088), H0G1_ZR (GB AB012146), B0218_CE (GB U58752), F42G8.3_CE

(GB AF038618), P38G_HS (SP P53778), P38D_HS (GB Y10488), P38_DM (GB

U86867), P38_CC (SP Q90336), P38_XL (SP P47812), P38A_HS (SP Q16539),

P38B_HS (SP Q15759), P38B2_HS (GB AF031135), B0478_CE (GB U57054),

JNK_DM (GB U73196), JNK2_HS (SP P45984), JNK1_HS (SP P45983),

JNK3_RN (SP P49187), MEK1_ZM (GB U83625), MEK1_LE (GB AJ000728),

(35)

MEK1_AT ( GB AF000977), MKK1_SC (SP P32490), MKK2_SC (SP P32491),

K08A8_CE (GB U38377), HEP_DM (SP Q23977), YR62_CE (SP Q20347),

VZC3741_GE (GB Z95122), MEK2_XL (SP Q07192), MKK4_HS (SP P45985),

MKK4_MM (SP P47809), R03G5_CE (GB U51994), MKK3_MM (SP 009110)

MKK3_HS (SP P46734), MKK6_HS (SP P52564), MKK6_MM (SP P70236),

PBS2_SC (SP P08018), WIS1_SP (SP P33886), MEK5 ANIMAL MEK5B_HS

(GB U71087), MEK5_RN (SP Q62862), MEK2_CE (PIR A56466), DSOR_DM

(SP Q24324), MEK2_RN (SP P36506), MEK2_HS (SP Q63932), MEK1_XL (SP

Q05116), MEK1_HS (SP Q02750), BYR1_SP (SP PI 0506), FUZ7_UM (GB

U07801), STE7_SC (SP P06784), STE7_CA (SP P46599), CDC7_SP (SP

P41892), CDC15_SC (SP P27636), MEKK4A_MM (GB U85607), MTK1_HS (GB

AF002715), SSK22_SC (SP P25390), SSK2_SC (SP P53599), WAK1_SP (GB

Y11989), WIN1+_SP (GB AJ223190), F9A6_CE (GB U41994), ASK1_HS (GB

D84476), MEKK5_DM (PIR JC4673), ATMEKK1_AT (GB D50468), MAP3KG_AT

(GB D50468), PMK1_SP (SP Q10407), BCK1_KL (GB AJ005079), BCK1_SC

(SP Q01389), BYR2_SP (SP P28829), NRC1_NCGB (AF034090), STE11_SC

(SP P23561), MEKK2_MM (SP Q61083), MEKK3_MM (SP Q61084),

MEKK3_HS (SP Q99759), MEKK1_HS (GB AF042838), MEKK1_MM (SP

Q62925), NPK1_NT ( PIR A48084), ANP1_AT (GB AB000796), NRK1_SC (SP

P38692), STE20_DD (GB U51923), DDPAK_DD (GB Y10158), MIHCK_AC (GB

U67056), SHK2_SP (SP Q10056), CLA4_CA (GB U87996), CLA4_SC (SP

P48562), SHK1_SP (GB L41552), PAK1_SP (SP P50527), STE20_SC (SP

Q03497), STE20_CA (SP Q92212), DPAK_DM (GB U49446), PAKLIKE_CE (GB

(36)

D83215), CEPAK_CE (GB U63744), PAK1_XL (GB AF000239), PAK1_RN (SP

P35465), PAK1_HS (SP Q13153), PAK3_HS (SP Q13177), PAK3_RN (SP

Q64303), PAK2_MM (SP Q61036), PAK2_RN (SP Q64303), LOK_MM (GB

D89728), KIAA020_HS (GB D86959), SLK_MM (GB AF039574), ZC_CE (GB

U55363), SIK1_AT (GB U96613), PRKSD_SD (GB Y13101), KRS1_RN (GB

U60206), KRS2_HS (GB U60207), SPAC9_SP (GB Z98763), SEVERIN_DD (GB

AF059534), T19A5_CE (GB U53153), S0K1_HS (GB X99325), YSK1_HS (GB

D63780), MST3_HS (GB AF024636), SPS1_SC (SP P08458), and SPAC_SP

(GB Z99259).

Bootstrapping

Bootstrapping is a standard statistical technique used to assess the

confidence in an inferred branching order. The bootstrap percentage corresponds

to the level of statistical confidence in the branching order, assuming that amino

acid positions are independent. This may be violated when correlated changes

occur or when there are alignment errors. In general, a bootstrap of 70% is likely

to be correct at the 95% level [98]

Additional inform ation

All sequences included and excluded from the analysis, the alignments,

their respective trees, plus additional information can be found at

http://acer.gen.tcd.ie/~dcaffrey/mapk/

(37)

Results

Terminology and evolutionary interpretation

Sequences are defined as orthologous, when the duplication arose via

speciation. Thus, a yeast kinase and a mammalian kinase are orthologous if they

were encoded by the same gene in their common ancestor, and non-orthologous

if they were encoded by two separate genes in their common ancestor. In

inferring relationships, it was generally assumed that mammalian kinases did not

have yeast orthologues that were deleted, in order to keep the interpretation

simple. In order to help interpret the trees, we arbitrarily chose a kinase that

would lie outside of the main cluster. This outgroup provides a root for the

sequences of interest, which indicates the branching order within the group since

the divergence from the last common ancestor.

Evolution of the MARK family

The MARK alignment (Figure 2.1) consists of 52 sequences along with

human CDK2. The CDK2 sequence was included to indicate the root of the

MARK tree that is shown in Figure 2.2. The branch length represents the inferred

number of amino acid replacements over evolutionary time. The rate of evolution

is more or less constant across the different lineages. Exceptions such as KKQ1

do not appear to represent alignment error, so that this acceleration may reflect

reduced functional constraints or else adaptation to a novel function.. The tree

falls into two broad groups, one including H0G1 FUNGI group, the animal groups

(38)

for p38 and JNK (F42G8 ANIMAL, P38DG ANIMAL, P38AB ANMAL, JNK

ANIMAL); the other containing everything else.

The H0G1 FUNGI group clusters with the F42G8 ANIMAL, P38DG

ANIMAL, P38AB ANIMAL, and JNK ANIMAL groups (70% bootstrap). Since the

JNK and p38 clusters form a very distinct group from the H0G1 FUNGI group

(96% bootstrap), this suggests that a H0G1 like progenitor duplicated to give the

present day JNK and p38 found in animals. For the single budding yeast H0G1

gene there are at least seven human relatives. While the greatest sequence

identity is with the p38 isoforms, this is simply because the JNK genes have

undergone a more rapid evolution. It is clear that this accelerated evolution in

JNK occurred after the divergence from yeast but before the

C. elegans

and

mammalian split. This accelerated evolution in JNK most likely represents a

selection for novel function, given our biochemical knowledge of JNK in each

organism.

The absence of any animal sequences along with the SMK1 FUNGI and

the SLT2 FUNGI cluster suggest that SMK1 duplicated from SLT2 since the

animal/fungal split, although this grouping is weakly supported. Similarly, it

appears that KKQ1 and SLT2 duplicated after the animal/fungal split. ERK5

ANIMAL does not cluster strongly with any of the yeast members. This suggests

that either the tree topology is incorrect so that its yeast orthologue is one of the

ungrouped yeast kinases (such as SLT2 FUNGI) or a deletion of the yeast

orthologue occurred. The duplication of KSS1 and FUS3 appears to have

occurred after the animal/fungal split (92% bootstrap), but before the speciation

(39)

of many fungal groups. Thus, it is unlikely that there is a mannmalian orthologue

for both KSS1 and FUSS of the filamentous and pheromone pathways

respectively. Instead, the ERK1 ANIMAL group is orthologous to both KSS1 and

FUSS.

(40)

B0478_CE ... ... ... YQNLRLIG- - SGAQG1 VC SAF DT...

JNK_DM ... ....YINLRPIG--SGAQGIVCAAYDT--...

JNKl.HS --- --- YQNLKPIG--SGAQGIVCAAYDA...

JNK^_RN -... YQNLKPIG--SGAQGIVCAAYDA-...

JNK2_HS -...- - YQOLKPIG- -SGAQGIVCAAFDT...

B0218_HS -.... -... ... YINLTPIG-

-TGAYGTVCAAECT---F42G8 . 3_CE -... YNSLKPLG--EGAYGWCTAEYE-...

P3 8G_HS -... ... YRDLQPVG- -SGAYGAVCSAVDG-...

P3 8D_HS --- --- --- YVSPTHVG--SGAYGSVCSAIDK ....

P3 8_XL -.... --- --- YQNLTPVG- SGAYGSVCSSFDT ...

-P38A_HS -.... YQNLSPVG-

-SGAYGSVCAAFDT---P38_CC -...

YQNLSPVG--SGAYGTVCSAYDE---P3 8B_HS LQGLRPVG - - SGAYGSVCSAYDA...

P38_I»! -... -...

YQGLQPVG--SGAYGQVSKAWR---H0G2_ZR -... -... YTDLNPVG-

-MGAFGLVCSATDT---HOGl.ZR YTDLNPVG-

-MGAFGLVCSATDT---H0G1_SC --- ---

---YNDLNPVG--MGAFGLVCSATDT---H0G1_CA ... ... ...YTELNPVG- -MGAFGLVCSAVDR- -

-STYl.SP -... -YSDLQPIG-

-MGAFGLVCSAKDQ---ERK5_HS -... ...YEI lETIG- -NGAYGWSSARRR...

ERK1_HS -... YTQLQYIG- - EGAYGMVSSAYDH -...

ERK2_HS -YTNLSYIG- -EGAYGMVCSAYDN...

ERKA_DM -...

YIKLAYIG--EGAYGMWSADDT---SUR1_CE -... -... YVNLSYIG- -EGAYGMVASALDT ...

FSMAPK_NH -... - -YDIQDWG- -EGAYGWCSAIHK...

PMK1_MG -... -... Y DIQ D W G - - EGAYGWCSAIHK...

MAPK_PNC YEILDV1G--EGAYGIVCSAIHK...

ERK1_CA -... - - YQILEIVG- -EGAYGIVCSAIHK...

KSS:_SC ... YKLVDLIG--EGAYGTVCSAIHK...

SPKl.SP -...-YEMINLIG-'QGAYGWCAALHK...

FUS3_SC ... FQLKSLLG- -EGAYGWCSATHK ....

ERKi_DD YSIVKCIG--HGAYGWCSAKDN...

MAPK_PS RPPIMPIG--KGAYGIVCSAHNS...

NTF4_NT

-KPPIMPIG--KGAYGIVCSALNS---MPK6_AT KPPIMPIG--KGAYGIVCSAMNS... ...

MPK3.AT -... RPPIIPIG--RGAYGIVCSVLDT...

MAPKi_PC RPPIMPIGRGAYGIVCSIMNT...

-MPK^AS -...

QPPIMPIG--RGAYGIVCSVMNF---MPK4_AT -... - - VPPLRPIG-

-RGAYGIVCAATNS---MMK2_MS ... VPPIRSVG-

-RGAYGIVCAAVNA---MPK5_AT

VPPIRPIG--RGAYGFVCPAVDS---NTF6_NT

IPPIQPVG--RGAYGMVCCATNS---MPK1_AT

YMP1KPIG--RGAYGWCSSVNS---MPK2_AT

YMPIKPIG--RGAYGWCSSVNR.-MPK7_AT

--YVPIKPIG--RGAYGWCSSINR---MPS 1_MG -... -...-... YTVTKELG - - QGAYGIVCAAVNN ...

-SPM1_SP ... ...

...FKWKELG--QGAYGIVCAARNVAS---MKC1_CA -... FKIVKELG--HGAYGIVCSAKYDNGSKKVPDS

SLT2_SC ... ...FQLIKEIG--HGAYGIVCSARFAEA...

KKQ1_SC ... FHLTGKIG--RGSHSLICSSTYTES...

SMK1_SC -... -.... - YEIIQFLG- - KGAYGTVCSVKFKGR...

P38B2_HS ... LQGLRPVG--SGAYGSVCSAYDAR...

CDK2_HS MENFQKVEKIG--EGTYGWYKARNK...

(41)
(42)

-B0478_CE FONVTHAKRAYRELKLMSLVN-JNK_DM FQNVTHAKRAYREFKLMKLVN-JNK1_HS FQNQTHAKRAYRELVLMKCVN-JNK3..RN -FQNOTHAKRAYRELVLMKCVN-JNK2_HS FQNOTllAKRAYRELVLLKCVN-B0218_HS FQSIIHARRTYRELRLLRCMC-F42G8.3_CE FQSTIHAKRTYRELKLLRTLQ--P38G_HS FQSELFAKLAYRELRLLKHMR--P38D_HS FQSEIFAKRAYRELLLLKHMQ--P38_XL FQSIIHAKRTYRELRLLKHMK--P38A_HS FQSIIHAKRTYRELRLLKHMK-P3 8_CC FQSIIKAKRTYRELRLLKHMK-P38B_HS FQSLIHARRTYRELRLLKHLK-P38_DM FQSAVHAKRTYRELRLLKHMA--H0G2_ZR FSTAVLAKRTYRELKLLKHLR--H0G1_ZR FSTAVLAKRTYRELKLLKHLR--H0G1_SC FSTAVLAKRTYRELKLLKHLR--H0G1_CA FSTSVLAKRTYRELKLLKHLK-STY1_SP FSTPVLAKRTYRELKLLKHLR-• ERK5_HS AFDWTNAKRTLRELKILKHFK--ERK1_HS FEHQTYCQRTLREIQILLRFR--ERK2_HS FEHQTYCQRTLREIKILLRFR--ERKA_DM -FEHQTYCORTLREITILTRFK--SUR1_CE FEHQTFCQRTLREIKILNRFK-• FSMAPK_NH FDHSMFCLRTLREMKLLRYFN-• PMK1_MG FDHSMFCLRTLREMKLLRYFN--MAPK.PNC FDHSMFCLRTLREMKLLRYFN-• ERK1_CA FERSMLCLRTLRELKLLKHFN--KSS1_SC FSKKLFVTRTIREIKLLRYFHE-SPK1_SP -FNHPVFCLRTLREIKLLRHFR- • FUS3_SC FDKPLFALRTLREIKILKHFK-• ERK1_DD AFDNLKDTKRTLREIHLLRHFK--MAPK_PS AFDNKIDAKRTLREIKLVRHMD--NTF4_NT ---- AFDNKIDAKRTLREIKLLRHMD--MPK6_AT .... AFDNKIDAKRTLREIKLLRHMD--MPK3_AT .... AFDNHMDAKRTLREIKLLRHLD--MAPK1_PC .... AFDNYMDAKRTLREIKLLRHLD--MPK_AS ---- AFDNNMDAKRTLREIKLLRHLD--MPK4_AT AFDNIIDAKRTLREIKLLKHMD--MMK2_MS AFDNRIDAKRTLREIKLLRHMD--MPK5_AT AFDNKVDAKRTLREIKLLRHLE--NTF6_NT AFENRIDAKRTLREIKLLSHMD--MPK1_AT VYENR1DALRTLRELKLLRHLR--MPK2_AT VFENRIDALRTLRELKLLRHLR--MPK7_AT VFENRVDALRTLRELKLLRHVR--MPS1_MG VFSKKILAKRALREIKLLQHFRG-SPMl.SP VFSKSILTKRALREIKLLIHFRN-MKC1_CA IFSKNILCKRALRELKLLQFFRG-SLT2_SC VFSKTLLCKRSLRELKLLRHFRG-KKQ1_SC AFGNKLSCKRTLRELKLLRHLRG-SMK1_SC IFNKEILLKRAIRELKFMNFFKG-P38B2_HS FQSLIHARRTYRELRLLKHLK--CDK2_HS

TETEGVPSTAIREISLLKELN--•HKNIIGILNCFTPQKK... LDEFNDL ■ HKNIIGLLNAFTPORN... LEEFQDV -HKNIIGLLNVFTPQKS... LEEFQDV ■HKNIISLLNVFTPQKT... --LEEFQDV ■HKNIISLLNVFTPQKT-.... --LEEFQDV ■HENIIDLLDVFTPNEN--- VNDIEDV •HDNVLBMIDVFTPDPD... ASSLNNV -HENVIGLLDVFTPDET--... LDDFTDF -HENVIGLLDVFTPASS- -.... -LRNFYDF -HENVIGLLDVFSPAKS... FEEFNDV •HENVIGLLDVFTPARS--... LEEFNDV -HENVIGLLDVFTPATS--... LEEFNDV •HENVIGLLDVFTPATS--... lEDFSEV •HENVIGLLDIFHPHPA---- NGSLENFQQV •HENLICLQDIFLSP--- LED---•HENLICLQDIFLSP HENLICLQDIFLSP... LED---•HENLITLDDIFISP--- LED---HENIISLSDIFISP ... FED--•HDNIIAIKDILRPTVP--- YGEPKSV •HENVIGIRDILRAST LEAMRDV -HEN IIGINDIIRAPT... IEQMKDV •HENIIDIRDILRVDS IDQMRDV •HENIINIQEIIRSET... VDSLKDI HENIISILDIQKPRN... YESFNEV •HENIISILDIQKPRS... YETFNEV •HENIISILDIQQPQD--- FESFSEV HENIISILAIQRPIN...-YESFNEI HEN 11SILDK VR PVS... IDKLNAV HENIISILDILPPPS... YQELEDV HENIITIFNIQRPDS... FENFNBV HENLISIKDILKPNSK... EQFEDV HENWAIRDIVPPPQR.... ....EVFNDV HENIVAIRDIIPPPQR.... EAFNDV HENIVAIRDIIPPPLR... N AFNDV HENIIAIRDWPPPLR... RQFSDV HENVIAITDVIPPPLR... REFTDV HENIVGLRDVIPPSIP... QSFNDV HENVIAVKDIIKPPQR... ENFNDV HENVMS1KDIIRPPQK--- ENFNHV HENVWIKDIIRPPKK...---EDFVDV HENIIKIKDIVRPPDR--- EEFNDV HENVIALKDVMMPIHK... MSFKDV HENWALKDVMMANHK--- RSFKDV HENVIALKDVMLPANR... TSFKDV HRNITCLYDMDIPRPD... NFNET HRNITCIYDLDIINPY--- NFNEV HKNITCLYDLDIIPNP... MTGEFNEI HKNITCLYDMDIVFYP--- DGSINGL HPNIVWLFDTDIVFYP---- N---GALNGV HKNIVNLIDLEIVTSS-... PYDGL HENVIGLLDVFTPATS---lEDFSEV HPNIVKLLDVIHTEN ... KL

(43)

B0478_CE JNK_DM JNK1_HS JNK1.RN JNK2_HS B0218_HS F42G8.3_C) P38G_HS P38D_HS P38_XL P38A_HS P38_CC P38B_HS P38_DM HOG2_2R H0G1_ZR H0G1_SC H0G1_CA STY1_SP ERK5_HS ERK1_HS ERK2_HS ERKA_DM SUR1_CE FSMAPK.NH PMK1_MG MAPK_PNC ERK1_CA KSS1_SC SPK1_SP njs3_sc ERK1_DD MAPK_PS NTF4_NT MPK6_AT MPK3_AT MAPK1_PC MPK_AS MPK4.AT MMK2_MS MPK5_AT NTF6_NT MPK1_AT MPK2_AT MPK7_AT MPS1_MG SPMl.SP MKC1_CA SLT2_SC KKQ1_SC SMK1_SC P38B2_HS CDK2_HS

YIVMELMDANLCQVIQMD- -... LDHERLSYLLYQMLCGIR- - -... YLVMELMDANLCQVIQMD... -LDHDRMSYLLYQMLCGIK-.. YIVMELMDANLCOVIQME...LDHERMSYLLYQMLCGIK... YLVMELMDANLCQVIQME...LDHERMSYLLYQMLSAIK-... . YLVMELMDANLCQVIHME... -LDHERMSYLLYQMLCGIK-... YFVSMLMGADLSNILKIQR--- --- LNDDHIQFLVYQILRGLK-... YFVSVLMGSDLQNIMKIQR... LTDEQIQLLIYQVLRGLKYMSHQNFNS YLVMPFMGTDLGKLMKHEK... --LGEDRIQFLVYQMMKGLR---YLVMPFMQTDLQKIMGMEF... -SEEKIQYLVYQMLKGLK... YLVTHLMGADLNNIVKCQK... LTDDHVQFLIYQILRGLK... -YLVTHLMGADLNNIVKCQK... LTDDHVQFLIYQILRGLK--... YLVTHLMGADLNNIVKCQK-...LTDDHVQFLIYQILRGLK... YLVTTLMGADLNNIVKCQAGAHQG---ARLALDEHVQFLVYQLLRGLK---YLVTHLMDADLNNIIRMQH ... LSDDHVQFLVYQILRGLK---YFVTELQGTDLHRLLQTRP--... LEKQFVQYFLYQILRGLK---YFVTELQGTDLHRLLQTRP...LEKQFVQYFLYQILRGLK-.... - - -YFVTELQGTDLHRLLQTRP-... LEKQFVQYFLYQILRGLK-... YFVNELQGTDLHRLLNSRP...LEKQFIQYFTYQIMRGLK---YFVTELLGTDLHRLLTSRP- --- LETQFIQYFLYQILRGLK---YWLDLMESDLHQIIHSSQP- ... LTLEHVRYFLYQLLRGLK---YIVQDLMETDLYKLLKSQQ... LSNDHICYFLYQILRGLK---YIVQDLMETDLYKLLKTQH-... LSNDHICYFLYQILRGLK... YIVQCLMETDLYKLLKTQR... LSNDHICYFLYQILRGLK.... .... YIVQCLMETDLYKLLKTQK--- -LSNDHVCYFLYQILRGLK---YLIQELMETDMHRAIRTQD-... -LSDDHCQYFIYQTLRALK... . YLIQELMETDMHRVIRTQD --- LSYDHCQYFIYQTLRALK.... .... YLIQELMETDMHRVIRTQD... LSDDHCQYFIYQILRALK... YLIQELMETDLHRVIRTQN... LSDDHIQYFIYQTLRALK.... .... YLVEELMETDLQKVINNQNSG---... FSTLSDDHVQYFTYQILRALK... YIVQELMETDLYRVIRSQP--- LSDDHCQYFTYQILRALK.... .... YIIQELMQTDLHRVISTQM... LSDDHIQYFIYQTLRAVK... . YIVSELMDTDLHQIITSPQP... LSDDHCQYFVYQMLRGLK... YIAYELMDTDLHQIIRSNQ...ALSEEHCQYFLYQILRGLK... YIAYELMDTDLHQIIRSNQG...LSEEHCQYFLYQILRGLK... YIAYELMDTDLHQIIRSNQ...ALSEEHCQYFLYQILRGLK... YISTELMDTDLHQIIRSNQS- - -.... LSEEHCQYFLYQLLRGLK... YIATELMDTDLHQIIRSNQG... . . LSEEHCQYFLYQLLRGLK... YIATELMDTDLHHIIRSNQE- -... LSEEHCQYFLYQLLRGLK... YIVYELMDTDLHQIIRSNQP... LTDDHCRFFLYQLLRGLK... YIVSELMDTDLHQIIRSNQP--...MTDDHCRYFVYQLLRGLK... YIVFELMDTDLHQI1RSNQS... LNDDHCQYFLYQI LRGLK... YIVYELMDTDLHQIIRSSQ... ALTDDHCQYFLYQLLRGLK-- -...* YLVYELMDTDLHQIIKSSQR--... -LSNDHCQYFLFQLLRGLK... YLVYELMDTDLHQIIKSSQ...VLSNDHCQYFLFQLLRGLK... YLVYELMDTDLHQIIKSSQS--- LSDDHCKYFLFQLLRGLK... YLYEELMECDLAAHRSGQP... LTDAHFQSFIYQILCGLK...-YIYEELMEADLNAIIKSGQP... LTDAHFQSFIYQILCGLK---YLYEELMECDMHQIIRSGQP- -... ... LSDQHYQSFIYQVLCGLN... YLYEELMECDMHQIIKSGQP--- LTDAHYQSFTYQILCGLK---YLYEELMECDLSQIIRSEQR... ... LEDAHFQSFIYQILCALK---YCYQELIDYDLAKVIHSSVQ... LSEFHIKYFLYQILCGLK---YLVTTLMGADLNNIVKCQA... LSDEHVQFLVYQLLRGLK---YLVFEFLHQDLKKFMDASALTG...

(44)

B0478_CE HLHSA-GIIHRDLK--PSNIWR... -... JNK_DM -... HLHSA-GIIHRDLK--PSNIWK... . JNK1_HS -... HLHSA-GIIHRDLK- - P S N I W K ... -... JNK3..RN HLHSA-GI1HRDLK--PSNIWK... ... ... JNK2_HS -...- -HLHSA-GIIHRDLK- -PSNIWK ... .... B0218_HS YIHSA-DIIHRDLK--PSNIAVN-... F42G8.3_CE TIILKKLMHPFQRRNTRFRLYIHSA-GIIHRDLK--PSNIAVNERCEVKVFLSFSQLSFL P38G_HS YIHAA-GIIHRDLK--PGNLAVN---- ---P38D_HS YIHSA-GWHRDLK--PGNLAVN... ... ... P38_XL YIHSA-GIIHRDLK--PSNLAVN... ... P38A_HS --- ---YIHSA-DIIHRDLK--PSNLAVN... P38_CC YIHSA-DIIHRDLK--PSNLAVN... ... -.... P38B_HS YIHSAGIIHRDLKPSNVAVN... -P38_DM ---... -... YIHSA-GVIHRDLK- -PSNIAVN ... H0G2_ZR --- ---YVHSA-GVIHRDLK--PSNILIN-... H0G1_2R --- --- ---YVHSA-GVIHRDLK--PSNILIN ... H0G1_SC --- --- YVHSA-GVI HRDLK--PSNI LIN---H0G1_CA ... ... YIHSA-GVIHRDLK--PSNILIN... STY1_SP ... ... FVHSA-GV1HRDLK--PSNILIN... ERK5_HS --- --- YMHSA-QVIHRDLK--PSNLLVN... ERK1_HS -.... -... YIHSA-NVLHRDLK- - PSNLLIN ---- -... ERK2_HS -... YIHSA-NVLHRDLK--PSNLLLN... ERKA_DM -... -YIHSA-NVLHRDLK- - PSNLLLN ... SUR1_CE -... - -YIHSA-NVLHRDLK- - PSNLLLN... -... FSMAPK_NH -... - - AMHSA-NVLHRDLK- - PSNLLLN ... PMKl.MG AMHSA-NVLHRDLK--PSNLLLN... MAPK_PNC ... AMHSA-DILHRDLK--PSNLLLN... ERK1_CA ... - - AMHSA-NVLHRDLK- - PSNLLLN- -.... ... KSS1_SC ... SIHSA-OVIHRDIK--PSNLLLN... ... SPK1_SP -... - -AMHSA-GWHRDLK- - PSNLLLN... FUS3_SC VLHGS-NVIHRDLK--PSNLLIN... ... ERKl.DD ... ... HIHSA-NVLHRDLK--PSNLLIN... MAPK_PS -.... --YIHSA-NVLHRDLK--PSNLLLN... NTF4_NT ---... YIHSA-NVLHRDLK--PSNLLLN -... MPK6_AT -... YIHSA-NVLHRDLK- - PSNLLLN... MPK3_AT .. ... -YIHSA-NI IHRDLK--PSNLLLN-... MAPK1_PC -...YIHSA-NI IHRDLK--PSNLLLN... MPK_AS ... ... YIHSA-NVIHRDLK- - PSNLLLN...

MPK4_AT -YVHSA-NVLHRDLK--PSNLLLN...

MMK2_MS ... - YVHSA-NVLHRDLK - - PSNLLLN... MPK5_AT ... ... YIHSA-NVLHRDLK--PSNLLLN... ... NTF6_NT ... ... YVHSA-NVLHRDLK--PSNLLLN-- ---- ---MPKl^AT ... ... YIHSA-NILHRDLK- - PGNLLVN... --- ---MPK2_AT ... ... - - - YIHSA-NILHRDLK- - PGNLLVN... MPK7_AT ... YLHSA-NILHRDLK- - PGNLLVN... -...-.... MPS 1_HG -... YIHSA-NVLHRDLK- - PGNLLVN... -... SPM1_SP YIHSA-NVIHRDLK--PGNLLVN---- ---MKC1_CA ... ....FIHSA-DVLHRDLK--PGNLLVN... -... SLT2_SC ... ... YIHSA-DVLHRDLK--PGNLLVN ... ... KKQ1_SC -YIHSA-NVLHCDLK--PKNLLVN... ... SMK1_SC YIHSA-DVIHRDLK--PGNILCT... ... P3 8B2_HS -...YIHSA-GI IHRDLK--PSNVAVN... CDK2_HS ... FCHSH-RVLHRDLK- - PQNLLIN... -... ...

(45)

B0478_CE JNK_DM JNK1_HS JNK3_RN JNK2_HS B0218_HS F42G8.3_C1

P3 8G_HS

P38D_HS

P38_XL

P3 8A_HS

P3 6_CC

P38B_HS P3B_DM H0G2_ZR HCX31_ZR H0G1_SC H0G1_CA STY1_SP ERK5_HS ERK1_HS ERK2_HS ERKA_DM SUR1_CE FSMAPK_NH PMK1_MG MAPK_PNC ERK1_CA KSS1_SC SPK1_SP FUS3„SC ERK1_DD KAPK_PS

N T K 4 _ N T

MPK6.AT MPK3.AT MAPKi.PC MPK_AS MPK4_AT MMK2_MS MPK5_AT NTF6_NT MPK1_AT MPK2_AT MPK7_AT MPSl.MG SPMl.SP MKC1_CA SLT2.SC KKQ1_SC SMK1_SC P38B2_HS CDK2_HS

SDCTLKILDFGLAR --TAIE-A--... -... FMMTPYWT

ADCTLKILDFGLAR... -TAGT-T... FMMTPYWT

SDCTLKILDFGLAR...TAGT-S ... -... FMMTPYWT

S DCTLKILDFG L AR... TAGT - S... ...- FMMT P Y W T

SDCTLKILDFGLAR... -TACT-N - -.-... FMMTPYWT

EDCELKILDFGLAR... (JTDS- E... -... MTGYVAT

ILSFFKILDFGLAR--....AQDA- E--- -... MTGYVAT

EDCELKILDFGLAR... QADS-E- -... -.... -... M T G Y W T

EDCELKILDFGLAR...KADA-E--- M T G Y W T

EDCELKILDFGLAR- -... HTDE-E... -... MTGYVAT

EDCELKILDFGLAR...HTDD-E... -... MTGYVAT

EDCELKILDFGLAR...HTDD- E... -... MTGYVAT

EDCELRILDFGLAR---QADE-E--... -... MTGYVAT

EDCELRILDFGLAR- - PTEN-E... -...-... MTGYVAT

ENCDLKICDFGLAR... IQDP-Q- -... -... MTGYVST

ENCDLKICDFGLAR---lODP-Q- -... MTGYVST

ENCDLKICDFGLAR--- IQDP-Q- -... -...MTGYVST

ENCDLKICDFGLAR---LQDP-Q--- -MTGYVST

ENCDLKICDFGLAR---IQDP-Q--- -MTGYVST

ENCELKIGDFGMAR---GLCTSPAEHQY --- F-MTEYVAT

TTCDLKICDFGLAR--....lADPEHDHT... F-LTEYVAT

TTCDLKICDFGLAR--- VADPDHDHTG-... -... F-LTEYVAT

KTCDLKICDFGLAR---lADPEHDHTG... F-LTEYVAT

TTCDLKICDFGLAR---VTDPQTDHTG... ...F - LTEYVAT

ANCDLKVCDFGLAR-...SAASQEDNSG--- F-MTEYVAT

ANCDLKVCDFGLAR-...SAASQENNSG... F-MTEYVAT

ANCDLKVCDFGLAR-... SAVSTEDSSS... -.... F-MTEYVAT

SNCDLKICDFGLAR...SIASQEDNYG-... -... F-MTEYVAT

SNCDLKVCDFGLAR--- CLASSSDSRETLVG... F-MTEYVAT

ANCDLKVADFGLAR...STTAQGGNPG-... F-MTEYVAT

SNCDLKVCDFGLAR---IIDESAADNSEPTG(MSG... MTEYVAT

EDCLLKICDLGLAR...VED.... ATHQG... F-MTEYVAT

ANCDLKICDFGLAR...VTS.... ETD... F-MTEYWT

ANCDLKICDFGLAR...VTS.... ETD... F - M T E Y W T

ANC DLKIC DFGLAR -.... VTS.... ESD... F - MTE Y W T

ANCDLKICDFGLAR...PTS.... END... F-MTEYWT

ANCDLKICDFGLAR... HNT DDE ... -... F-MTEYWT

ANCDLKICDFGLAR...PSS.... ESD---... M-MTEYWT

ANCDLKLGDFGLAR... TKS ETD... -... F-MTEYWT

ANCDLKIGDFGLAR... TTS ETD--... -... F-MTEYWT

SNCDLKITDFGLAR... TTS ETE... - -Y-MTEYWT

ANCDLKICDFGLAR...TTS.... EAD... F-MTEYWT

ANCDLKICDFGLAR... ASNT KGQ... -... F-MTEYWT

ANCDLKICDFGLAR--- TSNT--- KGQ... F-MTEYWT

ANCDLKICDFGLAR... TSQG NEQ... -... F-MTEYWT

ADCELKICDFGLAR... -GFSVDPEENAG-... Y-MTEYVAT

ADCELKICDFGLAR... GCSENPEENPG... F-MTEYVAT

ADCELKICDFGLAR... GFSENPDENAG... -... F-MTEYVAT

ADCQLK ICDFGLAR -....GYSENPVENSQ --- F-LTEYVAT

SDCQLKICNFGLSC... SYSENHKVNDG... -... --F-IKGYITS

LNGCLKICDFGLAR--- GIHAGFFKCHSTVQP--- H-ITNYVAT

EDCELRILDFGLAR--- QADEE... -... MTGYVAT

TEGAIKLADFGLAR.... - - AFGVPVRT... YTH E W T

(46)
(47)

C-KYYSTAVDIWS--LGCIFAEMVTRR-ALFPGDSEID-B0478_CE QWTRIIEO--- L--- GTP---DRSFLERLQPTV--RN... -.... -... JNK_EW QVmKI lEQ L GTP- - -SPSFMQRLQPTV- - RN... -... JNK1_HS QWNKVIEQ--- L--- GTP---CPEFMKKLQPTV--RT... JNK3 _RN QWNKVIEQ-L- - - OTP- - -CPEFMKKLQPTV- - KN... ... JNK2_HS OWNKVIEQ L GTPSAEFMKKLQPTVRN... .... ... -B0218_HS QLTRIMSV--- T --- GTP--- DEEFLKKISSEE-ARN... ... F4 2G8. 3_CE QLTKIMSV--- V --- GTP---KEEFWSKIQSEE-ARN-.... ... P3 BG_HS QLKEIMKV--- T --- GTP---PAEFVQRLOSDE-AKN... P3 8D_HS QLTQILKV--- T --- GVP---GTEFVQKLNDKA-AKS... P3 8_XL QLKLILRL--- V --- GTP---EPELLQKISSEA-ARN... -.... P3 8A_HS QLKLILRL--- V --- GTP---GAELLKKISSES-ARN... ... P38_CC QLQQIMRL--- T --- GTP- - - PASLISRMPSHE-ART... -...-... P36B_HS QLKRIMEV--- V --- GTP---SPEVLAKISSEH-ART... P38_DM QLNLIMEM--- L--- GTP-- - PAEFLKKISSES-ARS-.... -... H0G2_ZR QFSIITDL--- L--- GSP---PRDVINTICSENTLK... ... H0G1_ZR QFSIITDL--- L--- GSP---PRDVIITICSEDTLK... H0G1_SC QFSIITDL--- L--- GSP---PKDVINTICSENTLK--... H0G1_CA QFSIITEL--- L--- GSP---PADVIDT1CSENTLR--- -STY1_SP QFSIITEL--- L--- GTP---PMEVIETICSKNTLR-- ---ERK5_HS QLQLIMMV--- L--- GTP---SPAVIQAVGAER-VRA... ... ERK1_HS QLNHILGI----L--- GSP---SQEDLNCIINMK-ARN-... ... ... ERK2_HS QLNHILGI--- L--- GSP---SQEDLNCIINLK-ARN... ERKA_DM QLNHILGV--- L--- GSP SRDDLECIINEK-ARJJ-... ... . SUR1_CE QLNLILAV--- V --- GSP---SNADLQCIINDK-ARS... ... FSMAPK.NH QLTLILDV--- L --- GTP---TMEDYYGIKSRR-ARE--... PMK1_MG QLTLILDV--- L --- GTP---TMEDYYGIKSRR-ARE... -.... MAPK_PNC QLMLILDV---- L--- GTP---TMEDYYGIKSRR-ARE... ERK1_CA QLWLIMEV--- L --- GTP-- NMEDYYNIKSKR-ARE... ... KSSl. SC QLWLILEV--- L--- GTP- - -SFEDFNQIKSKR-AKEYIA--- ---SPK1_SP QITLILNI--- L--- GTP---TMDDFSRIKSAR-ARK--- ---FUS3_SC QLLLIFGI--- 1--- GTP- --HSDNDLRCIESP-RAR... -... ERK1_DD QITLIIET--- 1--- GSP---SEEDICNIANEQ-ARQ... MAPK_PS QLRLLMEL--- I--- GTP---SEADLGFLNEN--AKRY... NTF4_NT QLRLLMEL--- I--- GTP---SEAEMEFLNEN--AKRY... MPK6_AT QLRLLMEL--- I--- GTP---SEEELEFLNEN--AKRY... MPK3.AT QMRLLTEL--- L --- GTP---TESDLGFTHNED-AKR... ... MAPK1_PC QMRLLTEL--- L--- GSP---TEADLGFVRNED-AKR... MPK_AS QMRLITEV--- I--- GTP---TDDDLGFIRNED-ARR-- ---MPK4.AT QLRLITEL--- 1--- GSP---DDSSLGFLRSDN-ARR... .... MMK2_MS QLRLVTEL--- I--- GSP-- -DDASLGFLRSEN-ARR... MPK5_AT QLKLITEL--- 1--- GSP---DGASLEFLRSAN-GGK... NTF6_NT QLGLI lAL---- L--- GSP- - -EDSDLGFLRSDN-ARK... MPK1_AT QLKLIVNI---- 1--- GSQ REEDLEFIVNPK-AKR... MPK2_AT QIKLIINI---- L--- GSQ REEDLEFIDNPK-AKR ... .... MPK7_AT QLKLIINV--- V --- GSQ---QESDIRFIDNPK-ARR... MPS1_MG QLNQILHI---- L--- GTP NEETLSRIGSPR-AQE- ---SPMl.SP QLNLILHQ---- L--- GTP DEETLSHISSSR-AQE... MKC1_CA QLNQILMI----L --- GTP-- PESTLQRIGSHR-AQN--- ---SLT2_SC QLNQILQV--- L--- GTP- - - PDETLRRIGSKN-VQD... -... .... KKQ1_SC HLNHILQI--- L--- GTP---PEETLQEIASQK-VYN--- -... SMK1_SC QIFEIIKV L GTP DKDILIKFGTIKAWNLG... -P38B2_HS QLKRIMEV--- V--- GTP---SPEVLAKISSEH-ART.... .... ... CDK2_HS QLFRIFRT---- L--- GTP DEWWPGVTSMP-DYK---

(48)

B0478_CE JNK_DM JNK1_HS JNK3_RN JNK2_HS B0218_HS F42G8.3_CE P38G_HS P38D_HS P38_XL P38A_HS P38_CC P38B_HS P38_DM H0G2_ZR H0G1_ZR H0G1_SC H0G1_CA STY1_SP ERK5_HS ERK1_HS ERK2_HS ERKA_DM SUR1_CE FSMAPK_NH PMK1_MG MAPK_PNC ERKl.CA KSS1_SC SPK1_SP FUS3_SC ERK1_DD MAPK_PS NTF4_NT MPK6_AT MPK3_AT MAPK1_PC MPK_AS MPK4_AT MMK2_MS MPK5_AT NTF6_NT MPKl.AT MPK2_AT MPK7_AT MPS1_MG SPMl.SP MKC1_CA SLT2_SC KKQ1_SC SMK1_SC P38B2_HS CDK2_HS

YVENRPRYQATPFEVLFSDNMFPMTADSS RLTGAQARDLLSRMLVI ---YVENRPRYTGYSFDRLFPDGLFPNDNNQNS--RRKASDARNLLSKMLVI ---YVENRPKYAGYSFEKLPPDVLFPADSEHN---KLKASQARDLLSKMLVI ---YVENRPKYAGLTFPKLFPDSLFPADSEHN---KLKASQARDLLSKMLVI •--YVENRPKYPGIKFEELFPDWIFPSESERD---KIKTSQARDLLSKMLVI - - -YIRNLPKMTRRDFKRLFAQAT--- PQAIDLLEKMLHL - - -YIKNRSPIIRODFVTLFPMAS--- PYALELLEMMLIL YMKGLPELEKKDFASILTNAS ... PLAVNLLEKMLVL YIOSLPQTPRKDFTOLFPRAS--- PQAADLLEKMLEL - • - YIQSLPYMPKMNFEDVFLGAN ... PQAVDLLEKMLVL - - - YIQSLTQMPKMNFANVFIGAN---PLAVDLLEKMLVL - - - YINSLPQMPKRNFSEVFIGAN--- PQAVDLLEKMLVL - - - YIOSLPPMPQKDLSSIFRGAN ... PLAIDLLGRMLVL - - - YIQSLPPMKGRSFKNVFKNAN--- PLAIDLLEKMLEL - - -FVTSLPHRDPVPFQERFKTVE--- PDAVOLLRRMLVF - - -FVTSLPHRDPVPFQERFKAVE--- PDAVDLLGRMLVF ---FVTSLPHRDPIPFSERFKTVE...PDAVDLLEKMLVF - - -FVQSLPHRDPIPFSERFASCT- -... HVEPEAIDLLAKLLVF FVQSLPQKEKVPFAEKFKNAD...PDAIDLLEKMLVF - - - YIQSLPPRQPVPWETVYPGAD ... RQALSLLGRMLRF - - -YLQSLPSKTKVAWAKLFPKSD-... SKALDLLDRMLTF - - - YLLSLPHKNKVPWNRLF PNAD... SKALDLLDKMLTF - - - YLESLPFKPNVPWAKLFPNAD ... ALALDLLGKMLTF YLISLPHKPKQPWARLYPGAD...PRALDLLDKMLTF ---YIRSLPFKKKVPFRTLFPKTS ... DLALDLLEKLLAF YIRSLPFKKKVPFRTLFPQDF --- GSRLDLLEKLLAF - - -YIRSLPFKKRVSFASIFPRAN... -.... PLALDLLEKLLAF ---YIRSLPFCKKIPFSELFANTNNNTSTSNTGGRTNINPLALDLLEKLLIF

NLPMRPPLPWETVWS-KTDLN- -... -... PDMIDLLDKMLQF - - -YIKSLPFTPKVSFKALFPQAS- -...- PDAIDLLEKLLTF - -EYIKSLPMYPAAPLEKMFPRVN ... -PKGIDLLQRMLVF -FIRSLNMGNQPKVNFANMFPKAN...-PDAIDLLERMLYF 1RQLPLYRRQSFOEKFPHVH... PEAIDLVEKMLTF IRQLPLYRROSFVEKFPHVN---...PAAIDLVEKMLTF IRQLPPYPRQSITDKFPTVH... PLAIDLIEKMLTF YIRQLPNFPRQPLAKLFSHVN... PMAIDLVDRMLTF ---FILQLPRHPRQPLROLYPQVH--- PLAIDLIDKMLTF - - - YMRHLPQFPRRPFPGQFPKVQ... PAALDLIERMLTF - - - YVRQLPQYPRQNFAARFPNMS... -.... AGAVDLLEKMLVF - - - YVRQLPQYPKQNFSARFPNMS-... -... PGAVDLLEKMLIF YVXELPKFPRQNFSARFPSMN ... STAIDLLEKMLVF - - -YVKHLPRVPRHPFSQKFPDVS---PLALDLAERMLVF YIRSLPYSPGMSLSRLYPCAH--- VLAIDLLQKMLVF ---YIESLPYSPGISFSRLYPGAN... VLAIDLLQKILVL FIKSLPYSRGTHLSNLYPQAN---...PLAIDLLQRMLVF YVRNLPFMAKKPFPTLFPNAN ... PDALDLLDRMLAF YVRSLPKQRPIPFETNFPKAN -... PLALDLLAKLLAF YVRSLPITRKASYEELFPDAN... PLALDLLERMLTL YIHQLGFIPKVPFVNLYPNAN... -... SQALDLLEQMLAF YIFQFGNIPGRSFESILPGAN---PEALELLKKMLEF KNSNNPVYKKIPWSNIFPFAS--- HEAINLIESLLHW ---YIQSLPPMPQKDLSSIFRGAN-...PLAIDLLGRMLVL PSFPKWARQDFSKWPPLD... EDGRSLLSQMLHY

(49)

B0478_CE DPERRIS-JNK^DM DPEQRIS-JNK1_HS DASKRIS JNK3_ RN DPAKBIS-JNK2_HS DPDKRIS B0218_HS DPDRRPT-F4 2G8.3_CE DPDRRIS-P3 8

Figure

Figure 1 . 1 : A proposed of model of residue types involved in protein -  protein interactions.
Figure 1.2: A schematic diagram of the MAPK pathways for S. cerevisiae.
Figure 1.3: A schematic diagram of the i\/IAPK pathways for animals. Kinases
Figure 1.4; The three dimensional structures for p38 (A), JNK (B), and ERK
+7

References

Related documents

In a study on the political culture in united Germany published in 2010, Russell Dalton and Steven Weldon traced the results of surveys from the first two decades following

Thus, the results showed that the application of ultrasonic vibration on the metal tube filled with foam, also contribute in the reduction of compressive load and better

One definition of view for data could be used for several roles and phases (Table I). Also if the data entity has no limitation related to the roles then the data entity could

Nick Volluci and Harrison Tucker, the author, were chosen to assist on this project in order to provide a high quality and methodical product that catered to the special

The patients, all with traumatic full thickness corneal wounds, were divided into two groups (n = 30 each), compris- ing a non-contact lens group in which all eyes were

High epiregulin (EREG) gene expression plus K-ras wild-type (WT) status as predictors of cetuximab benefit in the treatment of advanced colorectal cancer

sources with high star formation rates are less common than are optically-selected galax- ies, the integrated light emitted by the submillimeter sources and by the

Therefore, the higher prevalence of knee and shoulder pain and the higher crude prevalence of daily pain in women with diabetes compared with women without diabetes could be