• No results found

1 General Introduction

1.3 Methods for characterizing protein-carbohydrate interactions

1.3.2 Protein crystallography

1.3.2.2 X-ray diffraction

X-rays are electromagnetic radiation ranging from 0.1 to 100 Γ… and can be produced by bombarding a metal target, most often copper, with an electron beam that is generated by a heated filament and accelerated by an electric field. As a result, a high-energy electron then collides with the metal target displacing an electron of the metal from a low-lying orbital and making an electron from a higher orbital to drop to the vacated one. This transition of the electron from an M-shell to a K-shell results in the emission of the excess electrons’ energy in the form of an X-ray photon112.

24

By exposing a single crystal of a protein to a monochromated X-rays beam, the X-rays are scattered by the atoms present in the crystal and a set of diffraction spots, called reflections, is recorded on the detector originating the diffraction pattern. Each reflection spot in the diffraction pattern results from a monochromatic wave, constructively scattered by all equivalent lattice points that fulfil Bragg’s Law (Equation 1.1 and Figure 1.10),

𝑛𝑛 πœ†πœ† = 2 𝑑𝑑. sin πœƒπœƒ (1.1)

that describes the relationship between the angle of the incident beam πœƒπœƒ and the wavelength πœ†πœ†, where d is the minimum spacing between equivalent planes in the crystal, i e. the maximum resolution π‘‘π‘‘π‘šπ‘šπ‘šπ‘šπ‘šπ‘š= πœ†πœ† 2 sin πœƒπœƒβ„ π‘šπ‘šπ‘Žπ‘Žπ‘₯π‘₯.

Figure 1.10. Bragg's Law defines the relationship between the angle of the incident radiation, ΞΈ, the distance between the planes of a crystal, d, and the wavelength of the incident radiation, Ξ». The waves

of incident monochromatic radiation are reflected by the parallel equidistant planes of a crystal. When the difference in optical path between the scattered waves takes a multiple of the wavelength, constructive interference occurs, and a diffraction spot is produced.

The intensity of the reflection is recorded by the detector and corresponds to the intensity of this constructive wave. The diffraction data contains information from all atoms in the structure and is obtained as a list of reflection intensities with hkl positions (πΌπΌβ„Žπ‘™π‘™π‘˜π‘˜)64,65,114,115, defined by the crystal

planes that originated each reflection. At this point, the unit cell parameters and space group can be determined, and a full data collection experiment is performed. Afterwards, all the collected diffraction images are integrated and the intensity and hkl position of each reflection is extracted. The power of the crystal to divert the X-ray photons is what dictates the high resolution limit of the diffraction data set and the global quality of the data set65. This quality is assessed by calculation

of a series of important parameters: the value of I/Οƒ(I), which corresponds to the signal-to-noise ratio; the completeness, which corresponds to the percentage of the reflections relative to the total number of reflections that could be measured for that crystal, and should be greater than 90%; the redundancy or multiplicity, which is the number of observations per reflection (that is, the number of times the same reflection was measured); the Rmerge (Equation 1.2) and Rp.m.i (Equation 1.3) factors, which compare the intensities measured for the various reflections, and should be as low as possible since equivalent reflections (related by symmetry) must have similar intensity values; and the CC1/2 (correlation coefficient between random half data sets116), that is

25

generally close to 1 at low resolution and falls to near zero at higher resolution as the intensities become weaker. Individually, these parameters do not indicate the resolution limits, but are used globally to assess the quality of a data set.

(1.2) π‘…π‘…π‘šπ‘šπ‘šπ‘šπ‘šπ‘šπ‘šπ‘šπ‘šπ‘š =βˆ‘ βˆ‘ |πΌπΌπ‘šπ‘š(β„Žπ‘˜π‘˜π‘™π‘™) βˆ’ 𝐼𝐼̅(β„Žπ‘˜π‘˜π‘™π‘™)| π‘šπ‘š π‘šπ‘š=1 β„Žπ‘˜π‘˜π‘˜π‘˜ βˆ‘ βˆ‘β„Žπ‘˜π‘˜π‘˜π‘˜ π‘šπ‘šπ‘šπ‘š=1πΌπΌπ‘šπ‘š(β„Žπ‘˜π‘˜π‘™π‘™) (1.3) 𝑅𝑅𝑝𝑝.π‘šπ‘š.π‘šπ‘š.=βˆ‘ οΏ½1 (𝑛𝑛 βˆ’ 1)⁄ βˆ‘ |πΌπΌπ‘šπ‘š(β„Žπ‘˜π‘˜π‘™π‘™) βˆ’ 𝐼𝐼̅(β„Žπ‘˜π‘˜π‘™π‘™)| π‘šπ‘š π‘šπ‘š=1 β„Žπ‘˜π‘˜π‘˜π‘˜ βˆ‘ βˆ‘β„Žπ‘˜π‘˜π‘˜π‘˜ π‘šπ‘šπ‘šπ‘š=1πΌπΌπ‘šπ‘š(β„Žπ‘˜π‘˜π‘™π‘™)

Nowadays, due to the brightness of X-rays from a synchrotron facility, which is a thousand times greater than that from a laboratory X-ray generator of fixed-wavelength, most structures are solved by using synchrotron radiation64. Synchrotrons have the additional advantage of providing

tunable radiation, which can be modulated to different wavelengths of interest. Synchrotron facilities have contributed to the structural characterization of protein-carbohydrate interactions of most carbohydrate-recognising protein families117.

1.3.2.3 3D structure determination

To solve the structure of a protein or protein-ligand complex, the electron density that surrounds all the atoms of the macromolecule in the crystal must be calculated, and structure factors and phase information are necessary, as expressed by the electron density equation (Equation 1.4):

(1.4)

𝜌𝜌(π‘₯π‘₯, 𝑦𝑦, 𝑧𝑧) = (1/𝑉𝑉) οΏ½|πΉπΉβ„Žπ‘˜π‘˜π‘˜π‘˜| β„Žπ‘˜π‘˜π‘˜π‘˜

βˆ™ 𝑒𝑒2πœ‹πœ‹π‘šπ‘šπ›Όπ›Όβ„Žπ‘˜π‘˜π‘˜π‘˜βˆ™ 𝑒𝑒[βˆ’2πœ‹πœ‹π‘šπ‘š(β„Žπ‘₯π‘₯+π‘˜π‘˜π‘˜π‘˜+π‘˜π‘˜π‘™π‘™)]

where |πΉπΉβ„Žπ‘˜π‘˜π‘˜π‘˜|, known as the amplitude of the structure factor, is obtained from the intensities of

each reflection, measured experimentally (|πΉπΉβ„Žπ‘˜π‘˜π‘˜π‘˜| = οΏ½πΌπΌβ„Žπ‘˜π‘˜π‘˜π‘˜), 𝛼𝛼 is the phase angle of the scattered

wave, π‘₯π‘₯𝑗𝑗𝑦𝑦𝑗𝑗𝑧𝑧𝑗𝑗 is the position of atom 𝑗𝑗 in the unit cell and 𝑉𝑉 is the volume of the unit cell.

However, from the dataset collected phase angle values cannot be obtained.This is what it is known as the Phase Problem in crystallography. While the intensities and the structure factors are directly measured in the diffraction experiment, phase angle values need to be correctly estimated in order to obtain the electron density map64,65.

Three methods are generally used to determine the phases. Single or Multiple Isomorphous Replacement (SIR/MIR), that uses heavy atoms such as Au, Pt, and Hg usually soaked into the native crystals. Multiwavelength Anomalous Dispersion (MAD) and Single-wavelength Anomalous Dispersion (SAD), that exploit the anomalous dispersion (or scattering) effect of specific atoms, typically selenium, recurring to selenomethionyl proteins in which methionines are

26

replaced by selenomethionine upon protein expression. SIRAS and MIRAS are additional methods that combine Isomorphous Replacement and anomalous scattering. Ultimately, the Molecular Replacement (MR) method can be applied if the structure of the same protein or a similar one, with at least 30% amino acid sequence identity, has already been solved64,65.MR is

of particular importance for protein-ligand complexes, as the unliganded-structure phases can be combined with the diffraction intensities of the crystal with the bound ligand to calculate its electron density map65.