• No results found

Structural Bioinformatics (C3210) Experimental Methods for Macromolecular Structure Determination

N/A
N/A
Protected

Academic year: 2021

Share "Structural Bioinformatics (C3210) Experimental Methods for Macromolecular Structure Determination"

Copied!
41
0
0

Loading.... (view fulltext now)

Full text

(1)

Structural Bioinformatics

(C3210)

Experimental Methods for

Macromolecular Structure

(2)

2

Introduction

Knowing the exact 3D-structure of bio-molecules is essential for any attempt to understand how they work and how they interact with each other. Unfortunately, getting to the structure of such macromolecules in the lab is not easy. X-ray crystallography,

Nuclear Magnetic Resonance (NMR) spectroscopy, cryo-electron microscopy and neutron diffraction are the four main techniques that provide protein structures experimentally.

The distribution by source of the structures available in the protein data bank is indicated in the diagram below (neutron diffraction not shown - it was approx. 66 structures in PDB in 2013).

PDB distribution in 2013: X-ray: 83373 (88%) NMR: 10079 (11%) EM: 650 (0.7%) ND: 66 (0.01%)

(3)

3

X-ray Crystallography

X-ray crystallography is a technique in which the pattern

generated by the diffraction of X-rays passing through the single crystal lattice of a given material is recorded and then analyzed to reveal the 3D structure of the molecules composing the lattice. X-ray crystallography involves the following steps:

 protein production  growing of crystal

 measurement of diffraction data

 electron density calculation  structure determination and

(4)

4

Protein Production and Purification

The first step is to get large amounts of protein we want to

crystallize. This is generally done by cloning the protein in various expression systems such as microorganism, bacteria, plant etc.... The protein is then purified, since only highly purified solutions

can give rise to crystals and not every protein can be purified to a sufficient degree.

(5)

5

Molecular Cloning

Molecular cloning refers to the procedure of isolating a defined DNA sequence and obtaining multiple copies of it in vivo. It can also be used to produce protein codded by the DNA sequence. The procedure consists of the following steps:

 preparation of protein coding DNA (isolation

from fragmented cell DNA)

 inclusion of the DNA fragment into plasmid

(cyclic DNA)

 transformation of DNA fragment into bacteria  cultivation of bacteria cells that include

required gene

 isolation of protein from bacteria cells and its purification (cell disruption, precipitation,

(6)

6

Growing of Single Crystal

Many approaches are used to produce crystals but the general method is to start with a solution of the protein and to slowly

change the conditions so that saturation is exceeded. This can be done by changing parameters such as ionic strength (by adding sallts), pH, temperature of the solution. Supersaturation has to be reached slowly so that only a few nucleation sites are created

which, with time, will grow larger.

Thousands of solution conditions may be tried before finding one that succeeds in crystallizing the molecules, and ultimate success is never guaranteed.

(7)

7

Methods of Growing Protein Crystal

The most common method is as follows. A protein is dissolved in a buffer solution and precipitant (salt or polyethyleneglycol) is added very slowly to reduce the solubility close to spontaneous precipitation. Then water is slowly removed by controlled

evaporation resulting in precipitation. If left undisturbed for a few days ideally a few large crystals will grow.

There are various methods implementing this approach. The most common is hanging drop - vapor diffusion method - a drop of

protein solution is suspended over a reservoir containing buffer and precipitant (of higher concentration than in the drop). Water diffuses from the drop to the solution leaving the drop with

optimal crystal growth conditions. Sitting drop - vapor diffusion

method is similar only different placement of a drop is used.

hanging

(8)

8

Crystalization Screening

 Different proteins crystallize under different conditions

(concentrations of protein and precipitant, pH, temperature ...) which is difficult to find

 Crystallization screening is the search for conditions that allow

crystal growth in a multi-component solution

 Automated systems are used to prepare solutions with different

concentrations of protein, precipitant and other components

 Mosquito Crystal robot automates high density 96-well hanging

drop plate preparation; see video:

http://www.youtube.com/user/ttplabtech#p/u/1/gNolATSi3Qo http://www.youtube.com/user/ttplabtech#p/u/3/gi3zcUyBLz4

(9)

9

Properties of Protein Crystals

 The protein crystals are not as dense packed as crystals of small

molecules. They are loosely packed, with large solvent-filled holes and channels that may occupy up to 50% of the crystal volume. Protein crystal are very fragile.

 The protein crystals must always be kept in contact with their

mother liquor or saturated vapour of the liquor even during

exposition to X-ray beams, so crystals are usually closed in glass capillary during measurement (see figure).

 Alternatively, a crystal can be cooled to very low temperature

(approx 100 K) which preserve the crystal in the X-ray beam

<- Packing of protein molecules in crystal. Black areas are filled with water.

-> Protein crystal in sealed capillary prepared for X-ray measurement.

(10)

10

The Single Crystal

 The crystal for X-rays measurement must be a crystalline solid

in which the crystal lattice of the sample is pure, continuous and unbroken.

 The size of a protein crystal must be approximately 0.2-0.5 mm

in all three dimensions to diffract strongly enough to achieve the resolution needed for a successful 3D structure determination.

 The basic unit of a crystal is called the unit cell, which is defined

by three principal axes (a, b and c) and three angles (alpha, beta and gamma).

(11)

11

Collecting the Diffraction Data

The crystal is bombarded by X-rays, and the interference

between rays scattered by different parts of the crystal give rise to diffraction. This diffraction pattern is dependent on the 3D

(12)

12

X-ray Difractiometer

 When X-rays encounters a crystal lattice, electrons diffract the

X-rays, which results in a diffraction pattern.

 In four-cycle diffractometer the crystal is mounted in a

goniometer head located at the centre of the diffractometer and irradiated with X-ray beam. Intensity of diffracted X-ray radiation is measured by detector for different crystal rotations.

 See video: http://www.youtube.com/watch?v=eRa42zzBkpw

http://www.youtube.com/watch?v=yNjPPYirwGA&feature=related http://www.youtube.com/watch?v=LU6lExqbxHs&feature=related

(13)

13

X-ray Generation

Evacuated cathode-ray tube:

 Electrons traverse the evacuated tube from cathode to anode

and generate X-ray by high energy collision with anode

 A high-energy electron collided with and displace an electron

from a low-lying ortbital in a target metal atom of anode. Then an electron from a higher orbital drops into the resulting

vacancy, emitting its excess energy as an X-ray photon

(14)

14

X-ray Generation

Synchrotron radiation:

 Synchrotron are particles accelerators that circulate electron

beams at speeds close to the speed of light

 Whenever electron beam is forced to change direction,

electromagnetic radiation is released

 Produced radiation covers broad range of wavelengths about 1 A  X-ray radiation is two orders of magnitude more intense, so

measurements are much faster than with evacuated cathode-ray tube radiation

 Synchrotrons are big and expensive devices  Web links: http://www.esrf.eu/AboutUs/GuidedTour

http://www.diamond.ac.uk/Home/Media/Links.html

 Video:

http://video.moglik.com/v/2xzLxYE18C8/Tour-of-the-I16-Experimental-Hutch-with-the-.htm http://www.youtube.com/watch?v=_xv_GwCgpAc

Cathode-ray tube Scheme of synchrotron Synchrotron in Grenoble

ESFR, Grenoble, France

(15)

15

Electron Density Calculation

 The diffraction pattern is analyzed by mathematical and

computation methods (Fourier transform analysis) to produce an

electron density map. These maps show contour lines of electron density.

 Since electrons more or less surround atoms uniformly, it is

possible to determine where atoms are located

 Unfortunately, since hydrogen has only one electron, it is difficult to map hydrogens

(16)

16

Electron Density Maps at Different Resolution

 Quality of density maps is described by its resolution

 Very low resolution 8 to 6 Å: solvent/protein boundary visible,

may see helices or beta sheet (unlikely), no sign of ligands

 Low resolution 6 to 3.5Å: secondary structure visible

 Medium resolution 3.5 to 2.4Å: side chains visible, some still

blurred, unambiguous secondary structure, ligands clearly visible, some water (primary solvation shell) visible

 Medium high resolution 2.4 to 2.0Å: most side chains

unambiguously defined

 High resolution 2.0 to 1.6Å: very accurate ligand-protein

interactions, multiple side chain conformations, secondary solvation shell visible

(17)

17

Structure Determination and Refinement

Structure determination:

 Electron maps are interpreted and fitted to a 3D structure model  For low and medium resolution electron maps, information abut

residues sequence is necessary, because some amino acids have very similar side-chain electron density

 Structure model can be build manually using interactive

graphics system or automatically using software tools

Structure refinement:

 The atomic model must be further refined. Original model is

used to generate modified models using molecular dynamics -based method or other methods. For these models, theoretical diffraction patterns are calculated and compared to

experimental data. This process continues until good agreement with experimental data is achieved.

(18)

18

Structure Validation

 The quality of the structure is validated by comparing various

parameters with typical values in proteins

 Distribution of backbone dihedral angles (Ramachandran plot)

should correspond to values observed in majority of proteins

 Distribution of hydrophobic and hydrophilic amino acids is also

useful indicator. Hydrophobic acids dominates in inferior of the protein while hydrophobic proteins are more common on protein surface.

(19)

19

Structure Quality Estimation

At the end of the process, the 3D structure of the macromolecule is obtained as a list of coordinates (x, y and z). The overall quality of the coordinates is usually measured by two quantities:

 The resolution - this is the ability to resolve two points separated

by some distance from each other. The higher the resolution, the "sharper" our view of the structure. To obtain structures of reasonable quality measurement should have resolution have resolution less than 2.0 A

 The crystallographic R factor - it measures the agreement

between observed diffraction patterns and those calculated from determined structure. The value 0.0 corresponds to perfect agreement of calculated and observed intensities. A well-refined structure of a macromolecule has an R factor not larger 0.2.

|Fobs| is derived from the measured intensity of a reflection in the diffraction pattern, and |Fcalc| is the intensity of the same reflection calculated from the current model. The sum is over points in space.

(20)

20

The Advantages of X-ray Crystallography

X-ray crystallography has major advantages, making it the most practical method for bio-molecule structure determination. It

provides accurate atomic resolution models and allows scientists to resolve relatively large structures and complexes. Recent

examples are structures of viral capsides and the ribosome, each composed of tens of thousands of atoms.

(21)

21

The Limitations of X-ray Crystallography

X-ray crystallography also has some major limitations:

 The information provides only one snapshot of the protein that does not reflect its dynamic behavior

 Contacts between molecules in the crystal (crystal packing) and

the dense packing might affect the structures

 The procedure is very slow

 There are many macromolecules which are difficult to crystalize,

such as proteins with significant hydrophobic portions (membrane proteins) or highly flexible proteins

 Positions of hydrogens are difficult to obtain (can be determined

(22)

22

NMR Spectroscopy

Nuclear magnetic resonance (NMR) is a physical phenomenon

based upon the magnetic properties of the nuclei of some atoms. NMR spectroscopy of macromolecules was developed in the mid-1980s enabling the elucidation of 3D structures of

macromolecules in solution. The first NMR structures deposited at the PDB started to appear in the early 1990s.

(23)

23

Nuclear Magnetic Resonance

 Some atomic nuclei, such as 1H, 13C, 15N and 31P have magnetic

moment cause by their spin

 When the nuclei are placed in a strong external static magnetic field they align with or against an applied field.

 Exposition of the nuclei to a second oscillating magnetic field of specific (resonance) frequency can cause the spin to reverse its orientation.

 The resonance frequency depends on strength of static

magnetic field. This external magnetic field is partially shielded by electron cloud around nuclei. Because local electronic

environment differs for different atoms, effective field acting on the nuclei is also different that causes different resonant

frequencies

 Difference between resonant

frequency of specific nuclei and reference frequency is called

chemical shift

 Chemical shift depends on

electronic environment of an atom;

higher values of the shift correspond to higher electronic shielding

(24)

24

NMR Spectroscopy

 NMR spectroscopy is experimental method based on

measurement of resonant frequencies (chemical shift respectively) of atomic nuclei in molecules

 Because atoms in specific atomic groups have characteristic

chemical shifts, NMR spectra can be used to identify atom groups of molecule

 Measurement of 1H is the most common

 Measurement of 13C is also common but much less sensitive

mainly because 13C isotope forms of about 1% only of C isotopes

(the most common isotope is 12C).

 Small molecules and biological macromolecules (proteins, DNA)

are usually measured in liquid state (solution) although solid state measurement is also possible (e.g. in material research)

(25)

25

NMR Instrumentation

 Current NMR spectrometers utilize high-field superconducting magnets producing magnetic field up to 24 T and working with frequency of RF field up to 1000 MHz

 The sample (approx. 300 – 600 microlitres with 0.1 – 3 mM of

protein) is positioned in the magnetic field and excited via pulsations in the radio frequency input circuit. The realigned magnetic fields induce a radio signal in the output circuit which is used to generate the output signal.

(26)

26

NMR in CEITEC

(27)

27

Spin-Spin Coupling

 Effective field acting on nuclei is influenced by magnetic field due to

spin of neighbouring nuclei. This effect is called spin-spin coupling.

 Spin-spin coupling give rise to the peak splitting in NMR spectra  This effect is observable only between protons on covalently

bonded atoms and only if the distance between nuclei is less than or equal to three bond lengths. This is because in spin-spin coupling the magnetic field of neighbouring nuclei does not act directly but through electron cloud surrounding the nuclei.

 Spin-spin coupling is never caused by neighbouring atoms that are

bonded to the same atom and have identical electronic

environment (e.g. H in CH2 group does not split peak of the second H)

 Number of new peaks and their

distance bring information about atoms connectivity because they occur only between protons on covalently bonded atoms.

 Information about torsional angles

can also be obtained from spin-spin coupling measurements.

(28)

28

(29)

29

Nuclear Overhauser Effect

 Nuclear Overhauser Effect (NOE) is another interaction between

spins of neighbouring nuclei that can occur between non-covalently bonded nuclei which are close in space

 NOE differs from spin-spin coupling in the respect that NOE is observed through space, not through bonds

 This interaction is distance dependent, usually only protons

separated by less than 5 or 6 Å can be observed

 Because the nuclei distance can be derived from the observed

NOEs, precise three-dimensional structure of the molecule can be reconstructed.

 NOE measurements play important role in determination of 3D

structure of biological macromolecules because many of atoms in these big structures are covalently distant while they can be close in a 3D space

(30)

30

2D NMR

 Classical NMR spectra is useless for analyzing the 3D structure of

macromolecules due to the spectral overlap of hundreds of protons. To solve this problem, special technique called 2D NMR was

developed which produces 2D spectra with better separation of individual peaks

 Measurement of 2D NMR spectra is based on application of two

radiofrequency pulses separated in time. The time is changing during series of measurement. For each time different classical spectrum is measured.

(31)

31

2D NMR

 3D NMR is improvement which is based on the same principles as

2D spectra, but two time differences between RF pulses are varying.

 Various methods of 2D/3D NMR use specific series of RF pulses to

obtain spectra with desired information. Correlated spectroscopy (COSY) gives rise to cross peaks corresponding to the covalent structure (spin-spin couplings) while in NOESY cross peaks are obtained from the NOE effect.

(32)

32

Interpretation of Protein NMR Spectra

 Peaks in the spectra must be assigned to hydrogen atoms in specific

residues in protein

 Spin-spin coupling measurements (e.g. COSY 2D NMR) enable to

identify to which amino acid type individual H atom belong, its position within side-chain and give information about dihedral angles of side chain atoms. But it does not provide information about sequence order of amino acids.

 The sequence information can be obtained from NOE measurements

(e.g. NOESY 2D NMR)

 Information obtained from NMR is not sufficient for direct

assessment of 3D structure. First, 3D models consistent with the set of constraints (dihedral angles, distances) are generated. These

models (usually several tens) can be refined by calculating NOEs from the model and fitting against the measured NOE intensities, as well as by optimizing for more favorable structural properties.

(33)

33

NMR Spectra and Dynamics of Molecules

 NMR spectrum of biological macromolecules is measured in liquid

phase (solution of protein)

 Molecule in solution has very dynamic behaviour thus the structure

of molecule is not rigid (compare with rigid structure in X-ray crystallography)

 Measured data correspond to weighted average of all structures in

solution.

 This complicates assessment of 3D structure form 3D data but

brings information about dynamic behaviour of the molecule

 Reconstruction of 3D structure from NMR data usually provides few

(34)

34

The Advantages and Limitations of NMR

Advantages:

 The main advantage of NMR spectroscopy over X-ray

crystallography is that there is no need to have the sample in a crystalline state (which restricts the applicability of X-ray

crystallography).

 Further information about the dynamics of the molecule in

solution can be deduced. Motions that occur on a time-scale ranging from about 10 picoseconds to about 10 nanoseconds can be detected.

Limitations:

 The precision of the technique is still not as high as in X-ray

crystallography.

 Only relatively small bio-molecules can be solved, for proteins usually smaller than 300 residues

 Rapid developments in equipment and software has led

(35)

35

The Advantages and Limitations of NMR

Advantages:

 The main advantage of NMR spectroscopy over X-ray

crystallography is that there is no need to have the sample in a crystalline state (which restricts the applicability of X-ray

crystallography).

 Further information about the dynamics of the molecule in

solution can be deduced. Motions that occur on a time-scale ranging from about 10 picoseconds to about 10 nanoseconds can be detected.

Limitations:

 The precision of the technique is still not as high as in X-ray

crystallography.

 Only relatively small bio-molecules can be solved, for proteins usually smaller than 300 residues

 Rapid developments in equipment and software has led

(36)

36

Electron Microscopy

Electron microscopy (EM) is another approach that provides information about the 3D structure of macromolecules. It has recently become popular in structural biology as it can provide molecular structural data about large molecular complexes.

(37)

37

Principles of Electron Microscope

 Electron microscope is based on similar principles as light

microscopes but electron beam is used instead of light beam

 It has much higher magnification or resolving power than a regular

light microscope, allowing it to see very small objects such as cells and large molecules

 Source of illumination is a cathode that emits electrons at the top of

the column

 Since electrons are scattered by collisions with air molecules,

(38)

38

Cryo-electron Microscopy

 Important breakthrough in solving bio-molecule structures by

EM, is cryo-electron microscopy ("cryo-em")

 The images are created with an electron microscope equipped

with a "cold stage". This means that the samples are scanned in a frozen state, usually liquid nitrogen temperatures (77 K = -196 ˚C).

 This procedure protects samples from radiation damage and

keeps them in a native state

 A version of cryo-electron microscopy is cryo-electron

tomography (CET) where a 3D reconstruction of a sample is created from tilted 2D images (also measured at cryogenic temperatures, either liquid nitrogen or helium).

(39)

39

The Advantages of Electron Microscopy

 EM is able to resolve very large biological complexes not

accessible or X-ray crystallography

 Structures determined by EM can serve as an initial reference

for interpretation of X-ray diffraction patterns

 Structures from EM can be combined with X-ray data to obtain

(40)

40

The Limitations of Electron Microscopy

The limitation of EM is the structural resolution that can be

achieved. The current resolution (about 3.5 Å) is not sufficient to determine the location of the amino-acid side chains.

(41)

41

Neutron Diffraction

 Neutron diffraction is based on scattering of neutrons on atoms nuclei. Nuclear reactors are used as a source of neutrons.

 The technique is similar to X-ray diffraction but due to the

different scattering properties of neutrons versus x-rays complementary information can be obtained

 Because hydrogen nuclei are strong scatterers for neutrons, the position of hydrogen (or deuterium) atoms in a crystal structure can be determined far more precisely with neutron diffraction than with X-ray diffraction

 Deuterium atoms is easier to detect then 1H atoms

 The studies with deuterium can be used to identify the presence

of H/D exchange sites (which usually corresponds to acidic hydrogens)

 Typical resolution of neutron diffraction is between 1.5 and 4.0 A

(i.e. individual atoms are visible including hydrogens)

 Disadvantage of the method is necessity to have access to

References

Related documents

Given that the HDTTCA samples a comparatively small data set (200 patients) for experts ’ opinions, HDTTCA used all experts ’ opinions as the training set and took 30 random samples

when operated on by operation R , each of the terms in the spatial wavefunction yield the same eigenvalue: r i r j r k. The set is the basis for an irreducible representation so

In conclusion, serum levels of IL-6 are raised in chemical warfare veterans with pulmonary complications due to sulfur mustard exposure, and this inflammatory marker has a direct

The information provided in this Safety Data Sheet is correct to the best of our knowledge, information and belief at the date of its publication. The information given is designed

Type of Infection Chloroquine prophylaxis Chloroquine- IPTp SP-IPTp Chloroquine prophylaxis vs. SP-IPTp Chloroquine- IPTp vs. SP-IPTp Chloroquine prophylaxis vs. SP-IPTp

Overall, in this study we have found a higher rate of health service usage, higher incidence of injuries and seizures, and low level of recorded alcohol related interventions prior to

For example, many studies demonstrate robust effects of differences in dietary fat intake on circulating cholesterol levels, and Mendelian randomization studies demonstrate that

Thus the title of the project is “Cultivating the Practice of Missional Hospitality for Small Groups at West Houston Church of Christ.” Chapter 1 introduces the project by