SECTION TWO : THE MSP-7 GENE AND PROTEIN
3.6 Secondary Protein Structure and Pattern Database Predictions of MSP-
The MSP-7 amino acid sequence was entered into a variety o f secondary
structure prediction programs and used to search pattern databases to identify conserved
motifs present in MSP-7.
To determine whether the MSP-7 precursor was targeted to the merozoite surface
by a signal sequence and if MSP-7 had any transmembrane domains, the 351 residue
sequence was submitted to signal sequence and trans-membrane prediction programs.
Psort II and SignalP, using networks trained on eukaryotic data, gave differing
results. Psort II, using a combination o f PSG and von Heijne’s method, did not predict a
N-terminal signal peptide, instead using the ^-nearest neighbour (A:-NN) algorithm to
predict the following probabilities o f location for MSP-7; 34.8 % nuclear, 21.7 %
extracellular / cell surface, 17.4 % mitochondrial, 13.0 % cytoplasmic, 8.7 % vacuolar,
4.3 % vesicles o f secretory system. However, Signal P predicted a putative signal
sequence o f 27 amino acids, with the most likely cleavage site between residues 27 (Ser)
and 28 (Thr) (Figure 3.8). Until a signal prediction program trained on protozoan or
Plasmodium signal sequences is available, and since MSP-722 has been located on the
merozoite surface suggesting it is targeted to the plasma membrane (McBride and Heidrich 1987; Stafford et ah 1996), the Signal P prediction can be assumed to be
correct. Similarity o f the predicted MSP-7 signal sequence with other malarial surface
proteins associated with the MSP-1 complex, such as MSP-6 remains to be elucidated.
TMpred and DAS transmembrane prediction servers denominate a possible
transmembrane helix between residues 2 to 21. Assuming MSP-7 has a signal sequence
o f 27 residues and not a transmembrane domain at the N-terminus, the resulting 324
amino acid protein would have a predicted molecular mass o f 38.5 kDa and an estimated
pi o f 4.52.
3,6.1 Secondary Protein Structure
A combination o f the Chou-Fasman and Robson-Gamier secondary structure
prediction programs available on Mac Vector 6.5, and the Profile fed neural network
Chapter Three : The Merozoite Surface Protein - 7 Gene and Protein 133
predict the secondary structure and solvent accessibility for iÿMSP-7. As no homologues
were identified in SWIS-PROT the expected accuracy o f the PHD prediction programs
was only between 62 - 6 6 %. The secondary structure o f MSP-7 is predominantly alpha
helical with two small regions o f beta sheet and several beta turns indicating the protein
is highly fiexible (See Figure 3.12).
SMART sub-program SEG predicted that residues: 7 - 20, 37 - 52, 62 - 75, 92 -
113, 328 - 339 are o f low compositional complexity whilst Prospero predicted the
presence o f two internal repeats between residues 117 - 252, and 125 - 262.
MSP-7 is mainly hydrophilic, with a negative charge cluster at residues 94 - 148,
which may form the core backbone o f the structure. Psort II predicted a coiled-coil
region fi*om residues 234 (Lys) to 261 (Tyr) with a hydrophobic region in the coil
perhaps a putative region for interaction with MSP-1.
3.6.2 Pattern Database Searches
The MSP-7 amino acid sequence was used as the query sequence to search a
number o f secondary or pattern databases for conserved motife that could give an
indication o f function for MSP-7.
PHD and ScanProsite searches o f the PROSITE database predicted the presence
in MSP-7 o f a number o f patterns including N-glycosylation, kinase phosphorylation and
N-myristoylation sites, although these are probably felse positives. Additionally a
PrintsScan search o f the PRINTS fingerprint database predicted the presence o f
tropomyosin (residues 6 6 - 84, 229 - 258, and 332 - 358), and alpha-tubulin (residues
236 - 249, and 2 9 9 -3 1 5 ) motifs in MSP-7.
A PRODOM domain search performed by PHD recovered no significant
homologous domains, whilst a Pfem search for Hidden Markov Models and a NCBI-CD
BLOCKS + and PRINTS 23.0 databases for similarity to highly conserved aligned
motifs but no significant matches were obtained.
These results do not predict a known function for MSP-7. This may be because
the motifs present in MSP-7 remain to be discovered or that the programs used in the
analysis are not sophisticated or comprehensive enough to deal with protozoan motifs.
3.6.3 Sequence Conservation Between / y MSP-7 Processing Sites and Other Merozoite Surface Proteins
The P f MSP-722 cleavage site shows similarity to protease cleavage sites at the
start o f MSP-130 (MSP-1 primary processing site) and MSP-636 (Figure 3.13A). This
suggests that the protease responsible for these cleavages has a similar specificity. Since
the cleavages are all predicted to occur at the same stage o f merozoite development and
the C-terminal products are known to be complexed together in the shed MSP-1
complex, it is possible that the same protease is responsible for these cleavages. Once an inhibitor o f the protease SERPH that is predicted to be responsible for primary MSP-1
processing is discovered, this hypothesis can be tested (Blackman 2000).
The P f MSP-722-19 N-terminal sequences, especially the N-terminal region of
MSP-7i9, show similarity to a motif in the MSP-2 gene family (Figure 3.13B). This
requires further analysis o f MSP-2 processing and binding sites before any hypothesis
concerning this motif can be made with confidence.