H-Ras and Its Oncogenic Mutants Ras G12V and Ras Q61L

(1)

ABSTRACT

HOLSENBECK, STEPHANIE LEAH. H-RAS AND ITS ONCOGENIC MUTANTS RAS G12V AND RAS Q61L. (Under the direction of Dr. Carla Mattos.)

The H-Ras protein is a GTPase important to cell cycle and differentiation. Mutations in this protein have been associated with 30% of cancers. A better

(2)

H-RAS AND ITS ONCOGENIC MUTANTS RAS G12V AND RAS Q61L

By

STEPHANIE LEAH HOLSENBECK

A thesis submitted to the Graduate Faculty of North Carolina State University in partial fulfillment of the

requirements for the Degree of Master of Science

BIOCHEMISTRY

Raleigh

(3)

DEDICATION

(4)

BIOGRAPHY

(5)

ACKNOWLEDGEMENTS

(6)

TABLE OF CONTENTS

List of Tables . . . vi List of Figures . . . vii 1. Introduction and Background . . . 1-11 2. Materials and Methods . . . 11

2.1 Cloning of H-Ras 166 into pET-21a(+) . . . 11-13 2.2 Mutation of H-Ras 166 . . . 13-14 2.3 Western Blot . . . 15 2.4 Mutation of Triple Stop Codon . . . 15-16

2.5 Expression and Isolation of H-Ras 166 protein . . . 16 2.6 Anion Exchange Chromatography . . . 16-17 2.7 Size Exclusion Chromatography . . . 17 2.8 High Resolution Anion Exchange Chromatography . . . 17 2.9 Exchange Reaction . . . 17-18 2.10 Crystallization (performed by Greg Buhrman) . . . 18 2.11 Crystal Soaking in Glycerol (performed by Greg Buhrman) . . . 18-19

2.12 Structure Refinement of Q61L in Glycerol . . . 19 3. Results and Discussion . . . 19

3.1 Isolation of H-Ras 166 for Greater Yield and Purification . . . 19-21 3.2 Superimposition and Root Mean Square Deviation . . . 21-23 3.3 Aqueous vs. Q61L in Glycerol . . . 24-26 3.4 Specific Comparison of the H-Ras Mutant Soaked in Glycerol

(7)

LIST OF TABLES

Results and Discussion

1. Data collection and refinement statistics for Ras/GMPPNP Wild-Type and Ras/GMPPNP Q61L mutant in aqueous solution, Ras/GMPPNP Q61L mutant in70% glycerol . . . 23 2. Statistics of comparison molecules superimposed onto the Q61L

(8)

LIST OF FIGURES

Introduction and Background

1. A dendogram of the G-Protein Superfamily . . . 4 2. Ras cycle of GTP/GDP interaction . . . 4 3. Diagram of conserved sequences found in GTPases and the relative

location within the protein . . . 5 4. Schematic of the general farensylation/palmitoylation pathway that

H-Ras 166 undergoes in order to be anchored to the membrane . . . 6 5. H-Ras/GTP activates the MEK, PI3-Kinase, RaldGDS, and PLC

Pathways . . . 8 6. Q61L H-Ras 166 featuring the Switch I and Switch II regions . . . . 9 Results and Discussion

7. SDS-PAGE of QFF Fractions and Western Blot . . . 20 8. SDS-PAGE and Western Blot of QFF fractions after triple stop

codon . . . 21 9. Comparison of Gel Filtration and QHP fractions . . . 21 10. RMSD of mutant in aqueous/mutant in glycerol and the mutant in glycerol/wild-type in glycerol with accessible surface area of the

mutant in aqueous solution . . . 24 11. Alternate conformations of Arginine 102 . . . 26 12. Comparison of Asparagine 26 side chain conformations . . . 26 13. General position of the placement of glycerol molecules in the

R32 crystal space group of the H-Ras mutant in glycerol and the wild-type H-Ras in glycerol . . . 29 14. Wild-type and mutant comparison at residue 61 . . . 30 15. RMSD plots of the H-Ras mutant in aqueous solution/H-Ras

mutant in glycerol (blue) and the H-Ras mutant in glycerol/

(9)

Introduction and Background:

H-Ras is a GTPase involved in cellular signaling and workings in eukaryotes.

Protein-protein interactions are necessary for signaling and a myriad of other cellular

processes. Protein-protein interfaces can be studied through a number of techniques,

including Nuclear Magnetic Resonance (NMR) and X-Ray Crystallography. The present

study uses a technique based on X-Ray Crystallography called Multiple Solvent Crystal

Structure (MSCS) to determine interaction sites within Ras (7, 8, 9).

The MSCS uses various organic solvents to locate and characterize protein binding

sites, probe plasticity in protein structure, and study hydration patterns (7, 8, 9). With this

technique, probing of mutant protein binding sites may lead to discovery of changes in

binding sites and related function. The objective of the following study was to express the

mutant forms of H-Ras, form protein crystals, and use MSCS techniques to identify areas of

potential protein-protein interaction. A comparison of the X-Ray crystal structure of the

mutants to that of the wild-type, in different organic solvent soaks, reveals differences along

the surfaces of the proteins responsible for changes in protein-protein interaction and may

also contain clues to the aberrant function of the mutants. The study of H-Ras within this

thesis reveals only the crystalline structure of the protein soaked in one organic solvent,

glycerol. Further organic soaks are needed to show possible function of the mutant within

this protein as well as potential sites of protein-protein interaction (7, 8, 9).

The MSCS method first requires cross-linking of a crystalline protein with

glutaraldehyde. The crystal is then soaked in a variety of organic solvents, which represent

functional groups that might be present in a larger binding molecule. Each protein structure

(10)

specific organic solvent molecule. The structures and the solvent molecules are then

superimposed onto each other. The “ligand” or in this case the organic solvent molecule

tends to bind areas with access to hydrophobic residues. Mattos et. al. (2006) used elastase

to develop the MSCS method. Extensive mapping of the protein surface was possible due to

the method of superimposing several structures. The structure of Elastase was solved in

100% acetonitrile, 95% acetone, 55% dimethylfomanide, 80% ethanol, 80% isopropanol,

80% 1,2-hexenediol, and 40% trifluoroethanol (9). After superposition of the protein

structure/solvents, it was shown that the organic molecules clustered around the active site

and a few crystal contacts. However, the crystal contacts did not have as many solvent

molecules as the active sites (9). In general, an area with six different organic solvent

molecules interacting with the protein typifies a hydrophobic interaction point as seen in the

S1 site of elastase (9). Where the molecules bind is important but also the type of molecule

that binds is vital in the characterizing of the protein surface.

The type of organic molecules at the site reveals traits of the binding site. This can

show specificity of the binding site for certain types of ligands. The MSCS method can also

show the plasticity of the protein binding site and the functions of water molecules within

the context of the surface of the binding site. MSCS method is useful in many ways because

it does not require prior knowledge of a protein’s properties or of an inhibitor or substrate.

Identification of the hydrophobic or interactive areas helps in determining protein

structure-function relationships, protein/protein or protein-ligand interaction, and the design of ligands

for disease control (9)_{. This method is being used to study GTPases, which are necessary for}

(11)

The G-proteins (or GTPases) are found in a wide range of eukaryotes, from yeast to

humans. The superfamily is comprised of five groups: Ras, Rho, Rab, Sar1/Arf, and Ran

families (see Figure1). The five sub-groups of the superfamily provide a large range of

cellular functions. Although there is a significant amount of cross-talk between the

signaling pathways involving GTPases, each group has been identified with specific types of

functional roles. The Ras family members mainly regulate gene expression. The Rho

family regulates cytoskeletal reorganization and gene expression. Rab and Sarf1/Arf

members regulate intracellular vesicle movement while Ran regulates nucleocytoplasmic

trafficking during G1, S, and G2 phases of the cell cycle (17, 18). All of these GTPases are

categorized as members due to a basic molecular “switch” involving interaction with GTP

and GDP.

Ras-GTPases function through the use of guanine nucleotide exchange factors

(GEFs) to catalyze the conversion of Ras to the GTP-bound state, and GTPase activating

proteins (GAPs), which accelerate the rate of hydrolysis of bound GTP to GDP (11, 15, 18).

Without GAP the rate of Ras activity is 4.2 x 10 -4 s -1 (15). GAP increases the rate of GTP

hydrolysis by stabilizing the transition state (18)_{. Figure 2 shows how the GTPases function}

(12)

Figure 1. A dendogram of the G-Protein Superfamily. In Red, the Ran and Rab families. Green is the Sar1/Arf family. The Ras family is in yellow, while the Rho family is seen in blue (18)_.

(13)

The conserved sequences found throughout G-proteins (including Ras) gives insight

into the potential function of these areas and reasons for conservation (see Figure 3). The

N-terminal end contains a conserved Lysine 5 residue, although the reason for conservation is

not known. The N-terminal domain has the first four beta strands and two alpha helices. It

also contains phosphate (the P-loop) and magnesium ion binding loops. The boxes

GxxxxGK and DXXG are the conserved sequence motifs for the phosphate and magnesium

respectively (18). The C-terminal domain contains the other two beta sheets and alpha

helices. Binding loop motifs, NKXD and EXSAX, bind the guanine nucleotide base. The

C-terminus also has a cysteine containing motif at the end that is necessary for post

translational modification.

Figure 3. Diagram of conserved sequences found in GTPases and the relative location within the protein (18)_.

This sequence is truncated and does not show the CAAX motif.

The post-translational modifications of the cysteine motif anchor the protein to the

plasma membrane. Each of the Ras proteins takes a different route to its location in the

plasma membrane, with each functioning in a distinct microdomain (12). H-Ras (and the

other Ras proteins) are anchored to the membrane through hydrophobic modifications such

as isoprenylation and palmitoylation. H-Ras undergoes four post-translational modifications

(14)

C-terminal cysteine and undergoes prenylation catalyzed by the farnesyltransferase. The last

three terminal amino acids are removed through an endoprotease. A methyltransferase adds

a methyl group to the S-farnesylated cysteine and the palmitoyltransferase performs the

palmitoylation of one or two cysteines (20).

Figure 4. Schematic of the general farensylation/palmitoylation pathway that H-Ras166 undergoes in order to be anchored to the membrane. The protein must be anchored to the plasma membrane before becoming functional in activation pathways (13)_.

This anchors the H-Ras to the plasma-membrane. From here, H-Ras activates kinase

cascades through interaction with GTP, which allows the binding of the effector molecule.

The GTP-bound Ras protein constitutes the “on” state and allows the Ras protein to interact

with its effector molecules. The switch mechanism involves a

guanine-nucleotide-dependent conformational change in Switch I and Switch II domains of the Ras protein,

(15)

molecules then activate kinase cascades specific to the effector molecule/Ras complex. The

Ras/GTP complex interacts with multiple downstream effector molecules and regulates

different pathways. One such pathway includes mitogen activated protein kinases

(MAPKs). This pathway controls cellular growth and cytokine receptor signaling within

eukaryotic organisms. In the MAPK module each enzyme phosphorylates and activates the

next member within the cascade (21). The typical MAPK cascade contains two other

proteins: the MAPK kinase kinase (MEKK) that activates MAPK kinase (MEK), which then

activates the MAPK. This module must be activated by a GTP/RAS/Effector interaction (13,

21)_.

Induction of activation is by a variety of extracellular signals that activate receptors

with intrinsic or associated tryrosine kinase activity. Phosphotyrosines serve as docking

sites for adaptor proteins, which in turn can recruit SOS (21). An extracellular signal

activates the trasmembrane receptors, and in the case of Ras, the SH3 domain of Grb2 binds

to a proline-rich region of the GEF, Son of Sevenless (SOS). These receptors have no

intrinsic protein tryrosine kinase activity but harbor sites for tyrosine phosphorylation, thus

activating the cascade through their phosphotyrosine residues with “adaptor” molecules.

The SOS increases the association of Ras with GTP and the GTP/Ras binds to RAF (a

MEKK) isoforms which targets Raf to the membrane (21). Here the protein kinase activity

increases by phosphorylation. This kinase cascade is vital in all eukaryotic organisms.

Figure 5 shows the Ras activated MAPK and JNK pathways. These pathways are important

(16)

Figure 5. H-Ras/GTP activates the MEK, PI3-Kinase, RaldGDS, and PLC pathways. The image above shows the MEK pathway to cellular proliferation and differentiation (23)_.

The ras genes were discovered due to the high oncogenic potential when transduced

with Harvey and Kirsten murine sarcoma viruses (5). Variants of the two genes, H-Ras and

K-Ras, and N-ras, an isoform found in neuroblastoma cells, are transforming genes when

mutated, and these mutations are present in many human and animal tumors (16). The genes

encode proteins that control regulatory pathways responsible for normal cellular

proliferation and differentiation (12). In thirty percent of human tumors, one of the three

endogenous Ras genes, mentioned above, has a somatic point mutation (1). These mutations

can cause cancer in the pancreas (90%), colon (50%), lung (30%), and thyroid (2, 18). The

point mutations result in the accumulation of the Ras GTP-bound form (3). Oncogenic

mutant forms of H-Ras include the change of Glutamine 61 to Leucine or Arginine and

(17)

locations in H-Ras function to cause oncogenic (bound to GTP) variation by not allowing

GTP hydrolysis. Both the intrinsic and the GAP catalyzed reaction is abolished, although

GAP is still able to bind the RAS/GTP molecule (1).

The catalytic domain of H-Ras is comprised of five alpha helices with a central six

stranded beta sheet. Ten loops (named one to ten respectively) connect the alpha helices and

beta strands. Three areas vital for GTP binding in H-Ras are Switch I, Switch II, and the

Phosphate Binding Loop (P-loop). Residues 30-38 (in Loop 2) make up the Switch I region

and residues 60-72 (Loop 4 and part of Alpha helix 2) make up the Switch II region. Switch

I and Switch II are involved in catalysis of GTP. The effector binding area (or where GAP

binds) is made up of amino acids 32-40, encompassing part of Loop 2 and part of the second

beta strand. The effector region, as seen below, overlaps with the majority of the Switch I

region (10).

(18)

Upon binding of the GAP, the Switch I and Switch II regions become much more

ordered. The P-loop is made of residues 11-14 and is involved with phosphate binding

along with loops one, two, and four. Loops eight and ten are involved in binding the

guanine nucleotide. The effector binding region for H-Ras contains residues 32-40, which is

comprised of part of loop two and part of beta strand two. Theses regions are involved in

the catalysis of GTP and interaction with the GAP. The switch regions change the most in

conformation when going from the GTP to GDP-bound forms or vice versa. Switch I moves

away from the nucleotide pocket, with Tyrosine 32 completely directed away from the

nucleotide. The Switch II N-terminal portion moves away from the nucleotide pocket as

well, with alpha helix two unwinding one turn (10). The GAP molecule binds the GTP-bound

Ras through the effector region, greatly restricting the flexibility of the switch regions. The

GAP stabilizes the Switch II region, allowing for glutamine 61 to participate in catalysis.

Arginine 789 (from GAP) contributes to the active site by neutralizing the developing

charge in the transition state (11, 14, 18). In the catalytic region there are amino acids that when

mutated, cause oncogenesis by keeping the Ras switch in the “on” position. Residue 12

(Glycine) is oncogenic when replaced with a valine. This mutation physically blocks the

catalytic site. The 61 residue (Glutamine) also becomes oncogenic when replaced with

Leucine. Glutamine 61 interacts with a water molecule responsible for attacking the gamma

phosphate during catalysis. With leucine in place this reaction cannot take place (19).

Burhman and Mattos (2006) report a new crystal form of H-Ras166 having symmetry of the

R32 space group. This is the crystal form used to determine all of the structures discussed in

(19)

conformation, directly over the nucleotide, which is also seen in non-catalytic crystals of

Rap2A/GTP (4).

This thesis will first outline the materials and methods used in the study and then present

the results with a discussion of the structural features observed for Q61L H-Ras/GMPPNP (a

non-hydrolyzable GTP analog) in glycerol. The Q61L refers to the mutation of the

Glutamine 61 residue to Leucine. The Results and Conclusion section will compare the

Q61L H-Ras/GMPPNP crystal soaked in 70% glycerol to the wild-type H-Ras/GMPPNP

protein crystal soaked in 70% glycerol. It will also compare the Q61L H-Ras/GMPPNP

crystal in aqueous solution and in glycerol.

Materials and Methods:

Cloning of H-Ras 166 into pET-21a(+)

The Mattos lab received a clone for the catalytic domain of H-Ras in a PAT plasmid

from the Campel lab (UNC, Chapel Hill). The catalytic domain (H-Ras 166) corresponds to

the domain after truncation of twenty residues from the C-terminus, making the functional

protein easier to use in experimentation. However, the PAT plasmid is not well known and

a complete sequence was not available. The Ras gene was therefore transferred to the

pET-21a(+) plasmid, a well known and studied construct. Transferring the gene began by

introducing NdeI and BamHI cut sites into the PAT vector in order to retrieve the H-Ras 166

sequence. The primers are seen below:

Upper Primer:

5’- GGAGGTCCATATGACGGAATATAAGCTGG -3’ Lower Primer:

(20)

The NdeI primer cut at CA-TATG and the BamHI primer at G-GATCC. The PCR

reaction used 1uL of PAT plasmid DNA, 36.75ul of distilled water, 1ul of 10mM dNTP

mix, 3ul of each respective primer (5pmol/ul), 5ul of Nova Taq buffer with MgCl2, and

0.25ul of Nova Taq DNA polymerase. For detailed instructions see Novagen Nova Taq

DNA Polymerase and Kits. Briefly, the polymerase chain reaction (PCR) puts the cut sites

into either side of the H-Ras 166 protein that is within the PAT plasmid. The recombinant

PAT DNA, using 1% agarose gel containing 0.5ug/ml eithidium bromide, was gel purified.

The eithidium bromide allows for visualization of the DNA under ultra violet (UV) light.

The plasmid was isolated from the gel using Quiagen Spin Miniprep. The mutated PAT

plasmid and the pET-21a(+) plasmid we digested using 1.5ul of NdeI, 0.5ul of BamHI,

Buffer for BamHI (being similar to that of NdeI), 0.5ul bovine serum albumin (BSA), the

respective pET or PAT DNA, and distilled water up to 30ul. A tube without DNA served as

a negative control and all of the tubes were placed in a 37◦C incubator for one and a half

hours. Dye and running buffer were added to each tube and then they were run on a 1%

agarose gel containing 0.5ug/ml of ethidium bromide. The digested pET vector and H-Ras

166 (containing 547 nuclotides due to the addition of the cut site primers) were isolated.

This produced sticky ends that were complimentary to the ends of the H-Ras 166 protein

sequence which allowed annealing to occur. Annealing the H-Ras 166 gene to the pET

plasmid required T4 DNA ligase, T4 DNA buffer, distilled water, and a 2:1 ratio of the

insert (H-Ras 166 sequence) to the vector (the pET-21a(+)). The protocol is found in

Stratagene Ligation manual. The components were placed in the PCR machine, which was

(21)

The plasmid was then placed into the XL-Blue cells via the transformation protocol

supplied by the Quick Change II Site-Directed Mutagenesis Kit. The cells were selectively

grown on Lauria broth agar plates with carbenicillin, overnight at 37°C. This selects only

the cells that took in the recombinant plasmid. A stab of the cells from the plate are then

grown in 200ml Lauria Broth (with the appropriate amount of antibiotic) overnight at 37°C

(in the shaking incubator at 225rpm). There was now enough of the plasmid DNA to isolate

and place into storage cells (T10) for a stock of wild-type H-Ras 166 (pET-21a(+)) and to

use for mutagenesis. The new H-Ras 166/pET21a(+) was isolated through the Quiagen

microcetrifuge miniprep. Gel electrophoresis confirmed the addition of H-Ras 166 to the

pET-21 plasmid in comparison to the original pET-21 vector. The next step was to

introduce the mutations into the H-Ras 166.

Mutation of H-Ras 166

Mutagenesis primers were designed to introduce valine (instead of the wild-type

Glycine) into the twelfth amino acid position (G12V) and in a separate experiment, a

Leucine for a Glutamine at the sixty-first amino acid position (Q61L). Bother were single

mutants. The primers were designed using the program found at

http://labtools.stratgene.com/QC(22). Shown below are the primers for G12V and Q61L.

G12V Sense

5’- GTGGTGGGCGCTGTAGGCGTGGGAAAG -3’ Antisense

5’- CTTTCCCACGCCTACAGCGCCCACCAC -3’ Q61L

Sense

5’- GGATACCGCCGGCCTAGAGGAGTACAGCG -3’ Antisense

(22)

QuickChange II Site Directed Mutagenesis was the protocol followed for

mutagenesis. Briefly, 5ul of 10X reaction buffer, 2.0ul of the plasmid DNA containing the

wild-type H-ras 166 sequence, 10ng of each respective mutagenesis primer, 1ul of dNTPs,

and then distilled water brought up to a volume of 50ul were mixed. After the contents of

the PCR-mutagenesis tubes warmed up, 1 ul Pfu Ultra was added, using the “hot start”

method. Pfu Ultra is the enzyme used to catalyze the mutagenesis reaction. The PCR

program selected followed the manufacturers’ protocol. Afterwards, 1 ul of Dpn was added

to each reaction tube, with gentle pippeting of the mixture. The tubes were spun down using

the microcentrifuge for one minute. They were then placed in the 37°C incubator for one

hour, digesting the parental template DNA. The mutated plasmids were then transformed

into the XL1-Blue supercompetent cells. The cells were grown on carbenicillen selective

solid Lauria broth plates in a 37°C incubator overnight. The next day, a colony from the

overnight culture, was grown in ampicillin inoculated Lauria broth overnight at 37°C while

shaking (225rpm). The mutated plasmid was isolated from the Lauria broth culture using

Quiagen microcentrifuge miniprep kit. The DNA was then sent off for sequencing, which

confirmed the mutation to the H-Ras 166 sequence. The plasmid was transformed into the

expression vector (Rosetta cells) in the manner described by Novagen. The cells grew on

agar plates in a 37°C incubator overnight. A sample of the culture was then placed into

240ml of Lauria broth and grown overnight in a shaking incubator (225rpm) at 37°C. The

next day, 40ml aliquots of the overnight culture were placed into one liter flasks of sterile

(23)

Western Blot

Identical SDS-PAGE gels are run using fractions from the protein expression or

fractions from later purification steps. One gel is dyed with Comassie Blue and used as a

control for the Western Blot. The other gel is soaked with the transfer membrane and pads

in transfer buffer (25mM Tris pH8.3, 190mM Glycine, and 20% methanol). They are

soaked and washed three times for ten minutes each (while gently shaking). The proteins

from the gel are transferred to the membrane at 25volts for fourteen minutes. The

membrane is placed in blocking buffer (5%milk, 10mM Tris pH7.5, 100mM NaCl, and

0.1% Tween) for one hour at 37°C on a gentle shake. The block is taken off and the

membrane incubated with the primary antibody. The primary antibody was Anti-Pan-Ras

Ab-1 Mouse from Calbiochem and specific for H-Ras. The antibody (5ug/ml) was placed in

10ml of blocking buffer and added to the membrane. This was placed into the incubator at

37°C, shaking for forty-five minutes. Afterwards, the primary antibody was removed and

the secondary antibody added (Immuno Pure Antibody, Rabbit Anti, Mouse IgG). This

secondary antibody is sensitive to alkaline phosphatase. After incubating with the antibody

for forty-five minutes (37°C) the membrane was developed with alkaline phosphatase

solution.

Mutation for Triple Stop Codon

Mutagenesis primers were designed to introduce a triple stop codon to the end of the

protein sequence. This insures less read-through and more of the complete protein.

These triple stop codon mutant sequences were used for the mutagenesis, described in the

next section. Below are the primers for the triple stop codon.

(24)

5’- CCGGCAGCACTGATGATGAGGATCCGAATTCGAG- 3’ Antisense

5’- CTCGAATTCGGATCCTCATCATCAGTGCTGCCGG -3’

Expression and Isolation of H-Ras 166 protein

Six liters were inoculated with cells containing one of the H-Ras mutants (and the

triple stop codon) and placed into the shaker (225rpm) at 37°C. The cells were grown until

they reached an optical density (O.D.) of 0.8 to 1.0 (log phase). The measurements were

taken in a spectrophotometer set with a wavelength (λ) equal to six hundred nanometers.

The cells were then inoculated with 1mM Isopropyl-β-D-Thiogalactopyranoside (IPTG) and

the temperature turned down to 32°C, to allow for expression of the H-Ras166 protein. The

cells expressed the protein for four to eight hours. The cells were then centrifuged down

into a pellet. The H-Ras 166 mutated protein was then isolated from within the cells as the

following describes. The cells were resuspended using 50 ml cellular resuspension solution.

This solution contains 20XTris/MgCl2, 5M NaCl, 1mM DTT, 20mM GDP, 200mM

Glycerol, and distilled water. Protease inhibitors including antipain, E34, Pefabloc,

Pepstatin, and Benzamidine were added to the mixture. The resuspension solution was

sonicated for thirty seconds and then cooled for thirty seconds. This step was repeated six

times with the solution on ice. The sonicated mixture spun down at 7,000 rpm for twenty

minutes. The soluble liquid fraction was isolated and placed in an ice bath. Slowly 100ul of

10% PEI was added and mixed at 4◦C for one hour. Afterwards, the solution spun down at

15,000 rpm for ten minutes. This gets rid of any heavy material and DNA.

Anion Exchange Chromatography

The supernatant from the previous step was applied to the Q Sepharose Fast Flow

(25)

Amersham. Unbound material was eluted off using a 20mMTris/MgCl2 buffer, pH 7.5. The

bound protein was removed through a linear salt gradient. The samples were tested for

protein through SDS-PAGE. The protein runs in the gel at about 15kD to 20kD and elutes

off the column around 30% to 40% of the elution buffer. Gel Filtration (or size exclusion

chromatography) purified the protein further.

Size Exclusion Chromatography

The isolated fractions from the previous step were spun down to concentrate the

sample to less than two milliliters. A centrifugal filter device from Millipore had a size

exclusion of 5kD and concentrated down the protein sample. The solution was applied to

the 100 Sephadex gel filtration column and eluted off with a 150mM NaCl buffer. Samples

of the fractions were run on the SDS-PAGE which revealed the fractions containing protein.

These fractions were isolated and further purified through the 5ml Q Sepharose HP column.

High Resolution Anion Exchange Chromatography

The collected fractions were diluted, from the previous step, down to 70mM NaCl.

The protein was applied to the equilibrated Q Sepharose HP Column. The protein was then

eluted off over 110 ml, 0-35% of the elution buffer. SDS-PAGE located the fractions with

purified protein and these were collected and concentrated down to less than two milliliters.

Exchange Reaction

The exchange reaction removes salt from the purified protein and binds GMPPNP to

stabilize the protein in the active conformation. The protein was passed over a desalting

column that is pre-equilibrated with 200mM Amonium Sulfate/ 32mM Tris (pH 8.0) buffer.

The fractions were collected and the tubes with protein determined through the Bovine

(26)

10mM GMPPNP was added to the total protein solution. The protein (plus GMPPNP)

incubates with alkaline phosphatase (AP) beads (20mg protein/100 U tube AP beads) for

forty-five minutes to one hour at 37°C while shaking. Afterwards, 20mM MgCl2 is added

and the beads are centrifuged (two minutes, 4°C). The supernatant was poured over a

desalting column pre-equilibrated with 20mM Hepes/20mM MgCl2 buffer. The tubes with

protein are then identified through the BSA Assay. The protein fractions are pooled and

concentrated down to 15-20mg/ml. The purified protein is then used to set up trays to

produce Q61L H-Ras 166 crystals.

Crystallization (performed by Greg Buhrman)

Q61L H-Ras crystals were grown in the R32 space group through the hanging drop

vapor diffusion method using protein concentrated down to 15-20mg/ml. The reservoir

solution consisted of 800 ul (0.2 M CaCl2. 24-26 % PEG3350) and 200 ul stabilization

buffer (20mM Hepes/20mM MgCl2) with fresh DTT (4). The drop consisted of 2ul of

protein and 2ul of the reservoir solution and the crystals grew in approximately 5-10 days.

Crystal Soaking in Glycerol and Diffraction Data Collection (performed by Greg Buhrman)

Crystals were soaked in glycerol by first transferring the crystals to 50ul of

stabilization buffer, described above, within the sitting drop plate. The crystals were washed

with stabilization buffer three times and then transferred into a crosslinking buffer

(stabilization buffer with 1% glutaraldehyde). The crystals soaked for thirty minutes to one

hour and then the crosslinking buffer was exchanged with stabilization buffer. Glycerol was

added in increments to a value of 70% glycerol. The crystals were soaked for 90-180

minutes and then flash frozen in liquid nitrogen (4). Diffraction data were collected at the

(27)

and the exposure time was 1-3 seconds, an oscillation angle of one degree was used during

data collection in a total of 180 frames. The crystal to detector distance was 120mm and the

HKL2000 program was used to index and scale the data (4).

Structure Refinement of Q61L in Glycerol

The Crystallography and NMR System (CNS) program was used for reciprocal space

refinement, with 10% of the unique reflections removed for the Rfree calculations (4). A

protein model of wild-type H-Ras 166 was used for 30 rounds of rigid body refinement at

2.5 Angstroms (Å). This was followed by 30 rounds of rigid body refinement at 2.0Å.

Simulated annealing, energy minimization, and B-factor refinement were used in CNS

before the calculation of the 2Fo-Fc and Fo-Fc electron density maps (4). Manual rebuilding

was done in the program O and addition of water molecules, ions, GMPPNP, and glycerol

were also added during successive rounds of refinement. Coordinate files for water

molecules were taken from previously published structures found in the Protein Data Bank

(PDB). Buhrman et. al. (2005) also used the wild-type Ras 166 from crystals having

symmetry of the space group R32 to phase the Q61L H-Ras 166 mutant (4).

Results and Discussion:

Isolation of H-Ras 166 for Greater Yield and Purification

The sequences of G12V and Q61L H-Ras 166 in pET were correct, but there was a

problem in the isolation of the protein. After purification of the protein through the QFF

column, SDS-PAGE gels were run to determine the fractions that contained the H-Ras 166

protein (about 20 kDa). The gels below show the fractions possibly containing the protein

(Figure 7). However, another band ran at about the same size but slightly bigger. In order

(28)

Western Blot was run. The protein that was originally collected is seen in the bright bands

at the bottom, closer to the 15kDa standard. The top bands, closer to the 20kDa standard,

however, were also H-Ras. Upon review, a triple stop codon was added to the end of the

protein sequence to insure read-through was not occurring.

20kD

15kD

19 20 23 24 2

0kD 15kD

S 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 S 28 29 30 31 32 33 34 35

Figure 7. SDS-PAGE of QFF Fractions and Western Blot. Upper image is the SDS-PAGE of the fractions containing H-Ras protein after QFF purification. The numbers represent the fraction from which the sample was collected. The green circle indicates the H-Ras protein fractions run on the Western Blot that produced single bands (Fractions 19 and 20). Correspondingly, the red circle indicates the fractions found in the Western Blot that produced the dual bands (Fractions 23 and 24).

Upon addition of the triple stop codon, the protein was found entirely in the lower

band (Figure 8). This resulted in an increase in yield of purified H-Ras 166 for

crystallization.

(29)

Figure 8. SDS-PAGE and Western Blot of QFF fractions after triple stop codon. The left hand image

represents the fractions containing H-Ras, with the triple stop codon, collected after QFF purification. The red circle indicated the H-Ras seen in the corresponding Western Blot.

S 21 22 23 24 25 26 27 28 29 30 31 32 33 34 S 21 22 23 24 25 26 27 28 29 30 31 32 33 34

Both mutants were constructed with a triple stop codon to insure no read-through

was occurring. However, the protein was still not pure at the end of the S-100 Gel Filtration

run as seen in the following gel. An extra QHP column was added to further purify the

protein in order to more easily produce crystals (Figure 9). Addition of the triple stop codon

and an extra QHP column ensured the acquisition of highly pure H-Ras protein.

Gel Filtration QHP

Figure 9. Comparison of Gel Filtration and QHP fractions. The left hand figure shows contamination after gel filtration. The right gel shows the protein fractions after collection from the QHP column.

S 51 52 53 54 55 56 57 58 59 60 S 16 17 18 19 20 21 22 23 24 25 26 27 28

Superimposition and Root Mean Square Deviation:

(30)

structures of the mutant H-Ras in aqueous solution and the wild-type H-Ras in glycerol (4).

Table 2 shows the data and statistics for the superposition for the latter two protein

molecules onto the mutant H-Ras in glycerol.

All superimpositions and Root Mean Square Deviation (R.M.S.D.) calculations were

done with LSQMAN (Upsala Software Factory). The brute force algorithm was used to

superimpose the mutant H-Ras in glycerol with the wild-type H-Ras in glycerol and the

mutant H-Ras in aqueous solution. Root mean square deviations per residue for each

superimposition were calculated with a global fitting algorithm. Visual inspection of the

structures was done manually in O and PyMol.

The comparison of the mutant H-Ras in glycerol with the wild-type in glycerol is to

determine the differences in the position of glycerol molecules and what changes are caused

by the mutation. The comparison of the mutant H-Ras in glycerol with the mutant in

aqueous solution was to determine if structural changes were due to the mutation or to the

(31)

Table 1. Data collection and refinement statistics for R32 crystal space group of Ras/GMPPNP Wild-Type in 70% glycerol and Ras/GMPPNP Q61L mutant in aqueous solution, Ras/GMPPNP Q61L mutant in70% glycerol.

Solvent

(concentration)

Aqueous Solution of Q61L Mutant

Glycerol (70%) Wild-Type Glycerol (70%) Q61L Mutant

Space Group R32 R32 R32

Unit Cell a = 88.95 b = 88.95 c = 134.02Å a = b = 90° g = 120°

a = 89.28 b = 89.28 c = 134.70Å a = b = 90° g = 120°

a = 89.157 b = 89.157 c = 135.09Å a = b = 90° c = 120°

Temperature (K) 100 100 100

Resolution (Å) 2.0 1.6 2.0

# Reflections 13,810 26,775 11,707

Completeness (%) 98.3 97.4 88.5

Rfactor/Rfree (%) 19.7/23.4 20.0/21.5 18.9/21.2

Bond Length (Å) 0.005 0.004 0.008

Bond Angle (°) 1.1 1.1 1.3

# Protein Atoms 1,303 1,295 1,322

# Nucleotide Atoms 32 32 32

# Magnesium

Molecules 1 1 1

# Calcium Molecules 3 3 2

# Water Molecules 128 93 124

# Organic Solvent

Molecules 0 12 7

Table 2. Statistics of comparison molecules superimposed onto the Q61L H-Ras (R32 Space Group) in 70% Glycerol. ** MI = (1+Nmatch)/([1+W*RMS]*[1+Nmin]).

Molecules

Superimposed onto Q61L H-Ras in 70% Glycerol

Aqueous Solution of Q61L Mutant

Glycerol

(70%) Wild-Type

RMS Distance (Å) 0.220 0.181

RMS*Nmin/Nmatch 0.220 0.181

MI** 0.81988 0.84673

Maiorov-Crippen RHO

(0-2) 0.01522 0.01253

Relative RMSD 0.01422 0.01171

Normalized RMSD (Å) 0.175 0.124

RMS delta B for

Matched Atoms (Å2₎ 11.298 3.791

Corr. Coefficient

(32)

Comparison of Q61L Aqueous vs. Q61L in Glycerol:

Glycerol is well-known to have a stabilizing effect on protein structure and is widely

used as a stabilizing agent (4, 7, 8, 9). As seen previously with other organic solvents, this

stabilizing effect results in a disorder to order transition within certain mobile elements of

protein structure (4, 7, 8, 9). This was also seen in the mutant H-Ras in glycerol structure. On

comparing the mutant in glycerol to the H-Ras mutant in aqueous solution, the change

becomes more apparent. In Figure 10, comparison of the R.M.S.D. per residue between

aqueous and glycerol soaked mutant structures clearly shows that the largest changes in

protein structure occur in solvent exposed regions of the protein. Some of these regions are

highly flexible and disordered in the aqueous structure, but become ordered to alternate

conformations in the presence of glycerol. Two specific examples are Asparagine 26 and

Arginine 102.

RMSD & Accessible Surface Area vs. Residue Number

0 10 20 30 40 50 60

1 8 15 22 29 36 43 50 57 64 71 78 85 92 99 106 113 120 127 134 141 148 155 162

Residue Number A c ce ss ib le S u rf ac e A rea 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 R MSD ( A ) Acc.SurfaceAreaQ61LAqueous RMSD Q61LAQ/Q61LGly RMSDQ61LGly/WildTypeGly

(33)

Arginine 102 adopts a more buried conformation, forming a salt bridge with Aspartic

acid 69 (3.52Å) seen in Figure11. Aspartic acid 69 is more ordered due to this interaction,

whereas the structure of the mutant in aqueous solution, the Aspartic acid 69 was modeled

as an Alanine due to lack of density. A similar ordering effect within Switch II is seen for

residues Glutamic acid 63 and Serine 65. The change is seen in the glycerol mutant even

though there is no glycerol molecule nearby, suggesting that the ordering effect is due to

bulk properties of glycerol, such as lowering the dielectric which causes the protein

charge/charge interaction to be more favorable (4, 7, 8, 9).

In the case of Asparagine 26, alternate conformations are due to specific interactions

with bound glycerol molecules. The conformation seen in the wild-type in glycerol is

shown in Figure 12, with two glycerol molecules nearby. One glycerol is more solvent

exposed and the other more buried. The presence of the two glycerol molecules in the

wild-type structure seems to keep the side chain of Asparagine 26 in a more buried conformation,

similar to the conformation found in aqueous solution. The Asparagine 26 of the mutant in

glycerol also shares this conformation. However, unlike the wild-type, the mutant is lacking

the more solvent exposed glycerol. This change allows for the alternate conformation of

Asparagine 26 that is not found in the wild-type (see Figure 12).

(34)

Figure 11. Alternate conformation of Arginine 102. Note the salt bridge formed in the conformation of Arginine102 of the mutant in glycerol, not seen in the aqueous mutant. The second conformation of Arginine 102 in the glycerol mutant is similar to the aqueous mutant.

(35)

Specific comparison of the H-Ras mutant soaked in glycerol vs. the H-Ras mutant in aqueous solution:

The side chains of residues Lysine 147, Leucine 120, Aspartic acid 30, and Tyrosine

32 show conformational shifts due to the presence of glycerol molecules (average R.M.S.D.

for these four residues 0.22Å versus overall R.M.S.D. = 0.17Å). Even larger differences

occur for residues Tyrosine 64, Isoleucine 36, and Aspartic acid 38 due to the presence of

two glycerol molecules near the switch regions (average R.M.S.D. for these three residues

0.23Å versus overall R.M.S.D. 0.17Å). The conformational changes observed for these

residues towards Leucine 61 tightens the interaction of the H-Ras mutant in glycerol with

Tyrosine 32. Other solvent exposed residue that differ between the aqueous and glycerol

soaked structures include Aspartic acid 54, Arginine 41, Glutamic acid 3 and Lysine 5.

These residues all have good electron density in the glycerol structure. The average

R.M.S.D. between the mutant H-Ras in glycerol and the mutant in H-Ras in aqueous

solution structures is 0.17Å, while the average R.M.S.D. between mutant H-Ras in glycerol

and the wild-type H-Ras in glycerol structures is 0.12Å, supporting the notion that changes

in the solvent environment have a greater effect on overall protein structure than changes

due to a specific mutation.

Comparison of Q61L and Wild-Type H-Ras in 70% Glycerol:

The wild type and Q61L mutant Ras proteins are very similar in structure with few

major differences. Most of the differences between the two structures occur in the dynamic

Switch II region. The overall R.M.S.D. for the wild-type H-Ras soaked in glycerol and the

mutant H-Ras soaked in glycerol is 0.12Å. The wild-type H-Ras has twelve glycerol

(36)

part, to the higher resolution of the wild-type structure (1.6Å) to the mutant H-Ras in

glycerol structure (2.0Å). Five glycerol molecules bound to the H-Ras mutant in glycerol

have corresponding glycerol molecules in the wild-type H-Ras in glycerol structure

(glycerol molecules of the mutant Ras: 1, 2, 3, 4, 5 with a, b, c, d, e of the wild-type

H-Ras in Figure 13). Three of these are found in the effector binding pocket adjacent to the

nucleotide (see Figure 13). The other two glycerol molecules in the mutant and the other

seven in the wild-type Ras are in different positions and resulting in slightly different protein

side chain conformations. The most striking difference between the H-Ras mutant in

glycerol and the wild-type H-Ras soaked in glycerol is the absence of a glycerol at the site of

mutation.

Effector Binding Pocket

Many of the glycerol molecules are found near the effector binding pocket. Four are

near the GMPPNP molecule in both the wild-type H-Ras in glycerol (Fig.13 glycerols b, e,

d, and h) and in the mutant H-Ras in glycerol (Fig. 13 glycerols 2, 4, 5, and 6) structures,

with three being in overlapping positions (glycerols 2, 4, 5 and b, d, e). Two glycerol

molecules, found in slightly different locations in the mutant H-Ras and wild-type H-Ras

structures are still both near the GMPPNP pocket (see Figure 13). Some interesting glycerol

binding sites were also found in other areas in each structure.

Site of Mutation

The glycerol seen adjacent to Glutamine 61 of the wild-type H-Ras soaked in

glycerol is absent in the mutant H-Ras in glycerol. The glycerol (Fig. 14) is 3.58Å away

from the Glutamine 61 residue (see Figure 14). The polar side chain of the Glutamine 61

(37)

in glycerol does not favor that interaction. Figure 15 shows the R.M.S.D.s of the residues

for the mutant H-Ras in glycerol with wild-type H-Ras in glycerol and the mutant H-Ras in

glycerol with the mutant H-Ras in aqueous solution. Focusing on the individual residues in

the mutant H-Ras in glycerol with the wild-type H-Ras in glycerol, there is a higher

R.M.S.D. for Alanine 66 and Methionine 67, within the Switch II region.

l

3 c 5 e

6

4 d f

h 1 a

j

7

i g

2 b

m

(38)

A B

Figure 14. Wild-type and mutant comparison at residue 61.(a) Image of the glycerol molecule found near the Glutamine 61 in wild-type but not in the mutant H-Ras in glycerol. The wild-type H-Ras glycerol and Glutamine 61 residue are seen in salmon. (b) This is a figure in O that shows Glutamine 61 (wild-type H-Ras in glycerol) interacting with the glycerol molecule. The two water molecules of Leucine 61 in the mutant H-Ras can be seen in the same area as the glycerol molecule.

The Alanine 66 and Methionine 67 are exposed and the side chains do not have any

electron density, therefore the RMSD is higher in the aqueous structure. The Methionine 1

residue has complete electron density in the mutant H-Ras soaked in glycerol structure, but

in the comparable wild-type structures the residue was replaced with Alanine due to lack of

electron density. The difference between the two causes a large R.M.S.D. value. However,

the residues within the Switch II (residues 60-72, Avg R.M.S.D. = 0.33Å), have an overall

higher R.M.S.D. value than those seen in Switch I (residues 30-38, Avg R.M.S.D. = 0.13Å),

reflecting the general disorder of the Switch II region. This only reveals the major

differences of the two proteins in glycerol, with the Switch II being more disordered.

Comparison of the R.M.S.D. values of the mutant H-Ras in glycerol/wild-type H-Ras in

glycerol and the mutant in glycerol/mutant H-Ras in aqueous solution shows a similar

pattern within the switch regions. However, the C-terminal residues after Switch II, are

(39)

RMSD vs Residue Number 0 0.2 0.4 0.6 0.8 1 1.2 1.4

1 10 19 28 37 46 55 64 73 82 91 100 109 118 127 136 145 154 163 Residue Number RM SD ( A ) AqueousQ61L/GlcerolQ61L GlycerolQ61L/GlycerolWildType Figure 15. R.M.S.D. plots of the Ras mutant in aqueous solution/Ras mutant in glycerol (blue) and the H-Ras mutant in glycerol/wild-type H-H-Ras in glycerol (magenta).

The R.M.S.D. values between the H-Ras structures in the presence of glycerol (Total

Avg = 0.12Å) are lower than those between the aqueous structure and the H-Ras mutant in

glycerol (Total Avg = 0.17Å). Switch I (30-38) R.M.S.D.s for the structures in glycerol

(Switch I: Avg. R.M.S.D. = 0.13Å) is lower than that of the aqueous (Switch I: Avg.

R.M.S.D. = 0.18Å) but the Switch II residues (60-72) have the same average RMSD for the

mutant H-Ras in glycerol with the wild-type H-Ras in glycerol as that of the mutant H-Ras

in aqueous solution with the mutant H-Ras in glycerol (0.33Å). The overall changes seen

within the H-Ras protein is mainly due to the solvent and not the mutation. However, within

the Switch II region, this becomes unclear.

Conclusions:

The overall changes seen in the H-Ras mutant in glycerol is due mainly to the

(40)

glycerol to the protein in aqueous solution. However, the mutation comparison with

wild-type in glycerol does reveal several interesting facts. The glycerol molecule located near the

Glutamine 61 in the wild-type and not in the mutated protein is interesting considering most

organic molecules are attracted to regions of hydrophobicity. It is also noteworthy that

while most of the glycerol molecules found in the mutant structure and in the wild-type were

in separate areas, there were several places where the glycerols were superimposed, most

notably in the GMPPNP pocket. This region is important for binding the downstream

effector molecules and being involved in the catalysis of GTP to GDP. The conservation of

glycerol molecules to this area denotes the importance of this region. This is also similar to

results seen in Mattos et.al. (2006) where organic solvents tend to bind in areas of the

protein that are involved as binding sites to effector molecules. Future research will involve

more organic solvent soaks to the Q61L Ras crystal and review of the regions of interaction

within the protein, giving a broader understanding of the structure function relationship of

the mutated Ras protein. This may reveal important areas within the protein that can be used

(41)

REFERENCES

1. H.R. Bourne, D.A. Sanders, F. McCormick. Nature. 349:117–127 (1991).

2. J. L. Bos, Cancer Res. 49, 4682-4689 (1989).

3. J. Boguski et al. Nature 409, 922-927 (2001).

4. G. Buhrman, Title, Dissertation PhD., North Carolina State University, (2005).

5. J. J. Harvey, Nature 204, 1104-1105 (1964).

6. U. Krengel, Cell 62, 539-548 (1990).

7. C. Mattos and D. Ringe, Nat.Biotechnol. 14, 595-599 (1996).

8. C. Mattos and D. Ringe, Curr.Opin.Struct.Biol. 11, 761-764 (2001).

9. C. Mattos et al. JMB In Press, 1-12 (2006).

10. M. Milburn et al. Science 247, 939-945 (1990).

11. R. Mittal, M. R. Ahmadian, R. S. Goody, A. Wittinghofer, Science 273, 115-117 (1996).

12. G.W. Reuther and C.J. Der, Curr. Opin. Cell Biol. 12, 157-165 (2000).

13. E.K. Rowinsky et al. J. Clin.Oncol. 17, 3631-3652 (1999).

14. A. Scheidig, C. Burmester, and R.S. Goody, Structure 7(11), 1311-1324 (1999).

15. I. Schlichting et al., Nature 345, 309-315 (1990).

16. K. Shimizu et al., Proc.Natl.Acad.Sci.U.S.A 80, 2112-2116 (1983).

17. Y. Takai et. al., Mol.Biol.Cell. 11(5), 1875-1886 (2000).

18. Y. Takai et. al., Phys. Rev. 81(1), 154-188 (2001).

19. A. Valencia et al. Biochemistry, 30(19), 4637-4646 (1991).

(42)

21. http://www.biochemweb.org/signaling.shtml

22. http://labtools.stratgene.com/QC

23. http://www.cellsignal.com