--not necessarily a protein! (all proteins are polypeptides, but the converse is not true)

(1)

PEPTIDE BONDS AND POLYPEPTIDES

OLIGOPEPTIDE:

--chain containing only a few amino acids (see tetrapaptide, Fig 5.9)

POLYPEPTIDE CHAINS:

--many amino acids joined together

--not necessarily a protein! (all proteins are polypeptides, but the converse is not true) •a protein has a specific amino acid sequence that is defined by a gene

amino acids in a polypeptide are called amino acid residues

polypeptides (and proteins) have a front end (amino terminus or N-terminus) and a back end (carboxyl terminus or C-terminus)

•most proteins contain 50-2000 amino acids

•mean molecular weight of an amino acid is 110 (see Problem #1) so MW of proteins could be 5500 to220,000 (ball park numbers)

Polypeptides as polyampholytes (substance whose groups have both acidic and basic groups)

•they have many weakly acidic and basic groups (Fig 5.11)

•even a small shift in pH can significantly affect the structure and interactions of a protein molecule

Molecular weight does not have units mass has units called daltons

(2)

mass of above proteins would be 5500 daltons to 220,000 daltons (how many kD?)

•PEPTIDE BONDS:

•amide bond between α-amino and α-carboxyl groups •structure solved by Pauling and Corey

•No rotation around peptide bond...a rigid, planer unit (Fig 5.12) so the -C=O and -N-H bonds are nearly parallel (O, C, N, H usually coplanar)

1. x-ray crystallographic studies of synthetic peptides indicated that distance of C-N bond was shorter that would be expected if it had only single bond character, so must have double bond properties

2. rotations are permitted only around the α carbon

3. depending on the R group, a constraint can be imposed on these rotations by steric hindrance

4. Therefore, trans form usually favored (-X-Pro can be cis althought trans still favored; prolyl cis-trans isomerases an important new class of regulatory enzymes

STABILITY AND FORMATION OF THE PEPTIDE BOND •formed by dehydration (loss of H2O) (Fig 5.8)

•requires input of free energy, about +10 kJ/mol (hydrolysis favored but very slow w/o catalyst, so peptides are stable, similar to nucleic acids)

•can be hydrolyzed in hot 6N HCl (see below) or by proteolytic enzymes or proteases (see Table 5.4) that often cleave at specific residues

(3)

•since synthesis unfavorable, like with nucleic acids, amino acids are activated by an ATP-driven reaction before incorporation into proteins = coupling each amino acid to the 3’ end of its tRNA to yield the aminoacyl-tRNA, catalyzed by aminoacyl-tRNA synthetases, and ATP is hydrolyzed to AMP (Fig 5.19)

PRIMARY STRUCTURE, proteins of defined sequence

•Primary structure is the specific sequence of amino acids occuring in a protein defined by genes

•Fred Sanger sequenced the first protein in 1953: bovine insulin 1st Nobel, because it showed that:

the sequence is precisely defined only L-amino acids are found they are linked by peptide bonds

•thousands of proteins have now been sequenced each is unique

•Why is the primary structure important? helpful in elucidating mechanism of action

determines the three dimensional structure, which confers biological function •rules that govern protein folding are being discovered by studying the relationship

between primary structure and three dimensional structure

very important to an emerging area of medicine: molecular pathology

•one amino acid change in the primary structure can lead to abnormal function and disease

(4)

examples: sickle cell anemia and cancer CHARACTERIZATION OF PRIMARY STRUCTURE •Examples of primary structure:

Mb myoglobin (3D Fig 5.1, primary Fig 5.14) and see Table 5A.1 for how to isolate and purify Mb)

Insulin (Fig 5.15 and 5.21) Insulin has disulfide bonds: •disulfide bonds

cross-links between chains or within a chain

formed through oxidation between pairs of cys side chains product of the oxidation called cystine

PROTEIN PURIFICATION

For example, from < 0.1% of starting material (total protein) to 98% pure 1. Requirements:

Assay

Good supply of starting material

Overexpession in E coli or other organism not without problems...not as easy as the book makes it sound

Patience

(5)

Abundant proteins

Proteins that have some unique property General procedures

1. Stabilization:

Use buffers at appropriate pH to prevent denaturation and/or degradation

Purification often done in cold because proteases less active...not so important if using HPLC which is very fast

Protease inhibitors are often used to prevent degradation Minimize foaming

Keep concentrated, proteins usually are more stable in a concentrated solution. Especially important for long-term storage.

Usually stored at -20°C to -80°C Assays

1. Use a direct assay if available, as for an enzyme, or a coupled reaction if a detectable product is not made by a direct assay

Amount of product formed is proportional to the amount of enzyme present 2. Use antibodies to detect purification, if can be made through reverse genetics Separation Techniques

Homogenation

Subcellular fractionation Differential centrifugation

(6)

Separation on the basis of charge, polarity, size, binding specificity, temperature stability and other properties

Solubility

Solubilize appropriate fraction: Salt step ("salting out")

Different proteins precipitate at different salt concentrations "ammonium sulfate cut"

Chromatography Ion exchange

DEAE cellulose (or agarose) = anion exchange, CM cellulose = cation exchange Reverse Phase = hydrophobic interaction chromatography, nonpolar alkyl groups

attached to matrix Affinity

Gel filtration (Fig 5A.5) Silica beads with pores HPLC

Gel electrophoresis

See one band on two different kinds of gels!! PAGE (Native)

SDS-PAGE (Denaturing) Isoelectric Focusing

(7)

Determination of amino acid compostition •hydrolysis of protein (usually in 6N HCl at 110°)

•hydrolysate (amino acids) is separated by ion exchange chromatography a column of beads that separates molecules on the basis of charge there are cation exchange (-) columns and anion exchange (+) columns

•amount of each amino acid is then quantitated by reaction with ninhydrin (Fig 5B.2) (darkness of resulting blue color is proportional to the amount of amino acid that is present); Or Fig 5B.1 use a single column amino acid analyzer, can be done by HPLC

•identification of amino and carboxy terminal residues •amino terminal residue:

rxn with dabsyl or dansyl chloride

forms dabsylated or dansylated derivative which can be identified by chromatography after cleavage in 6N HCl

destroys peptide so only amino term. amino acid can be identified •carboxy terminal residue:

rxn with carboxypeptidase A

removes carboxy terminal residue

composition can then be redetermined to see what's missing Cut up the protein and sequence the pieces...

(8)

all cut after the indicated amino acid(s) trypsin: lys or arg

clostripain: arg

chymotrypsin: phe, trp, tyr thermolysin: leu, ile, val CNBr: met

more in your book •Sequence the pieces:

Edman degradation (Fig 5C.1) done these days by machine react with PITH and cleave

does not detroy protein so can do over and over analyze PTH-derivative

also analyze what's left

see example of sequencing of the B chain of insulin in Fig 5C.2 and locating the disulfide bonds in Fig 5C.4

MS-MS

tandem mass spectroscopy inject peptides into first sector

(9)

moves into second sector

smashed into random, different smaller pieces by collisions with inert gas

pieces move to detector via a strong magnetic field which allows the determination of the mass of each piece