CopyrightX 1988, AmericanSociety for
Microbiology
Purification and Structural
Characterization
of the Putative
gag-pol
Protease of Human Immunodeficiency Virus
ERIK P.
LILLEHOJ,l
F. H. RICK SALAZAR,2 ROBERT J. MERVIS,3 MICHAEL G. RAUM,4 HARDY W. CHAN,2NAFEES AHMAD,3 ANDSUNDARARAJAN VENKATESAN3*
Laboratory ofMolecular Microbiology3 and BiologicalResourcesBranch,4NationalInstitute ofAllergyandInfectious Diseases, Bethesda, Maryland 20892;ProgramResources, Inc., Frederick Cancer ResearchFacility, Frederick,
Maryland 217011; andInstituteof Bio-Organic Chemistry, SyntexResearch, Palo Alto, California 943032 Received 28 January 1988/Accepted 25 April 1988
We have purifieda 10,774-daltonprotein from human immunodeficiency virus (HIV)type 1 that isencoded in the protease domain of thepolopen reading frame (ORF). Radiochemical amino acid microsequencing identified 12 aminoacids from the stretch of 39 N-terminal residues ofthisprotein,beginning withaPQITLW
sequenceatposition 69 ofthepol ORF. Radiosequencing of selected tryptic peptides oftheproteinidentified 11additional residues (Leu-9 and Val-2) in six peptides encompassingtheentiremoleculeof 99 residues. A protein of similar sizeand identical N-terminalsequence(determined through the first 39 residues)waspresent
amongtheprocessed HIVpolgeneproducts in Escherichia coliwhich expressed the entire HIVpolORF. The
Cterminus ofboththe viraland E. coli-expressed proteinswasinferredtobecontiguous with the N terminus of the p64-p5l reverse transcriptase onthe basis oftryptic mapping and specific immunoreactivity withan
antiserumagainstadodecapeptide locatedupstreamof thereversetranscriptase. Thus, theinitial processing ofthepolprecursorthatgeneratesthe native proteaseisapparently preservedacrossphylogenetic barriers.
Although the purified viralproteaselacked measurable proteolytic activity, the bacterialextractswerecapable
ofprocessinganHIVgagprecursorprotein synthesized in E. coli.
Genetic and biochemical studies have established the requirement ofa specific viral protease for the expression andprocessing of the gagand polgeneproducts of several
retroviruses (4, 7, 19, 20).Among the retroviruses studiedso
far, a viral protease is synthesized as part of the gag or
gag-pol polyprotein, encoded either within the gag orpol
openreading frame (ORF)orbyaseparateORFoverlapping bothgagandpol (10-12, 15, 21). Viralproteasesof 13 and14 kilodaltons (kDa) have been purified from murine leukemia virus and bovine leukemia virus and extensively character-ized(21, 22).
Manyof the mature internalstructuralproteins of human immunodeficiency virus (HIV) type 1 are derivedfrom the
proteolytic processingoftwoprimary translation products of 55 and of 180 to 200 kDa corresponding to the gag and
gag-pol polyproteins, respectively (8, 9, 13, 16).Two forms (p64 andp51) of HIVreverse transcriptase (RT)and a p32
protein(the putativeviralintegration protein) juxtaposed to the C-terminal side of the p64 RT have already been
se-quenced (13, 17, 18). Although acandidateproteinencoded
bytheN-terminal 167 residues of thepolgenethatoverlaps the gag ORF by 82 residues had not been discovered in isolated virions orinfected cells,thestructural homologyof this domain of thepol ORFwith other retroviral proteases prompted us to analyze extracellular virions for viral
pro-teins using antisera against peptides corresponding to the
gagand
pol
ORFs.Byuseoflimitedaminoacidsequencing,we have identified a ca. 10-kDa protein from the virus
particles andEscherichiacoliextracts expressingthe entire HIVpolORF andpossessinggagprocessing activity. Dur-ingthecourse of thiswork, Debouck etal. havereported a
gag-specific proteolytic activity associated with a 10-kDa protein from E. coli extracts which express the protease domain of the HIVgenome (5).
*Correspondingauthor.
The gag and gag-pol gene products were analyzed by
immunoprecipitation of intracellular viral proteins labeled under steady-state or pulse-chase conditions with either
pooled sera from acquired immunodeficiency syndrome (AIDS) patients orrabbit hyperimmune sera raised against E.colifusion proteins containing discrete structural domains of thegagandpolORFs.Labeledextracellular viralproteins were purified by immunoprecipitation, and their partial
N-terminal sequences were determined. Figure 1A illus-trates the mappositions of these various gag (S.
Venkate-san, unpublished data) and pol (13, 18) gene products. During theseearly studies, asmall(ca. 9to10kDa) protein
was consistently visualized among the immunoprecipitates with pooled sera from AIDS patients (Fig. 1B). It was
occasionally immunoadsorbed by gag antisera and was
presumed to represent a gag gene product. Direct N-ter-minal radiosequence analysisof this moiety revealed equi-molar abundance oftwodifferentsequences. Onewas iden-tified as beginning at position 378 of the gag ORF and
probably representedaprocessed productof the C-terminal
p15 gagprotein (Fig. 1A). The other sequence determined for the 9-to 10-kDaproteinwastentatively localized begin-ningat69 residues from thebeginningof the
pol
ORF.The 9- to 10-kDa moiety was electroeluted from
acryl-amidegels andchromatographed on aDEAE-cellulose col-umn. Successive N-terminalprotein sequencingof two
pro-tein fractionselutingat0.2 and 0.5 M NaClwasundertaken. The N-terminal sequencethrough 30 degradative cycles of the 0.5 M NaClfractionexactlycoincided with astretch of residues starting at position 378 of the gag ORF (R. J. Mervisetal.,submittedforpublication). Anotherproteinof 9to10kDawaseluted with 0.2 M NaCl. Toavoidconfusion, the 0.2 M NaCl-eluted protein will be referred to as plO, while the gag protein eluting with 0.5 M NaCl will be referred to as p9. Multiple sequence analysis of the plO protein yielded an unambiguous sequence. Figure 2a illus-3053
on November 10, 2019 by guest
http://jvi.asm.org/
A
B
L
kDa M
M,L
M,V
34
6
34_
9f 7
]env
p64
p55
p41
---4
9
DRY GEL AUTORAD/OGRAM
FtUORPOGRAM FIG. 1. Polypeptides encoded by the gag and pol genes ofHIV. ArecombinantHIV strain
generated
previously by
transfection ofan infectiousproviral DNA,pNL432, into SW480 cells(1)wasused in all thestudies.Forviruslabeling experiments,
100mlofA3.01cellsin RPMI1640medium(106cells perml)wasinfected with1mlof virus inoculum(105to106 50% tissure culture infective doseunits)and labeled with[35S]Met
and theindicated3H-labeled
amino acidasdescribedpreviously (13).Theextracellular viruswaspurified by
precipitation
with polyethylene glycol6000 (13). In someexperiments, the virus wasfurther purified bycentrifugal
banding
at35,000
rpmfor 90 min inaBeckman SW40rotor at the60% interphase ofadiscontinuoussucrosegradient (60, 35,and20%[wt/vol]in20 mMTris
hydrochloride [pH
7.5], 1mM EDTA[TE]buffer).Theviruswassolubilizedbythe addition ofNP-40and TritonX-100to0.5% and
dialyzed
twiceagainst
500 volumesof TE buffer with detergentsbeforeelectrophoreticorchromatographic analysis. (A) Schematic illustration of the HIV genome and the relevant ORFs. The viral polypeptides derived from the gag and pol ORFs andvisualized in the infected cellsorpurified virus areindicated in the lower part. The180- to 200-kDagag-pol precursorwasidentified andmappedwith definedantisera inpulse-chaseexperiments ofinfected cells ortransfectants expressingaseries ofoverlapping gag-pol deletion plasmids(9; S. Venkatesan, unpublished data). My, myristyl groupfound at the Ntermini ofp55, p4la,andp17.Mapping ofthep4laandp4lbgagproteinsis basedonimmunoprecipitationand limited protein sequenceanalysis (Mervisetal.,submitted). Alsonotethat theassigned cleavage siteseparating p9andp7is tentative. The position oftheproteasedomain is alsoshown.(B) Viralextracts(0.4ml)from2 x 108A3.01cellswereincubated for4hat4°Cwith10,ul of pooled antiserum frompatients with AIDS. The immune complexes wererecovered bybindingto protein A-Sepharose, exhaustively rinsed, and eluted for electrophoresis under reducing anddenaturing conditions on15 or 10 to20%gradient polyacrylamide gelsin SDS. Resultsobtainedwith10 to20%gradient acrylamide gelsinSDSareshown.LanesM; M,L;M,V; andLillustrate results obtainedwith virus labeled with therespective aminoacid(s) (single-lettercode).
trates the results
obtained
when plO was labeledwith
Met, Leu, Lys, or Val. Theexperimentally
determined occur-rences of Leu at positions 5, 10, 19, 23, 24, 33, and 38; Lys atpositions
14 and 20; Met at position 36; and Val atpositions
11and 32precisely corresponded
totheir
positions
within a domain of the deduced sequence of the pol gene startingat residue 69. The probability value for the random occurrenceof suchasequence wascalculated (13) tobe less than 1.042 x
10'-,
thus ruling out the possibility that this sequence wasderived from contaminating cellular proteins.Since direct
N-terminal sequence analysis was reliable for only 35 to 40 degradative cycles, we sought to obtain the internal sequence of the plO protein from its tryptic peptides. Individual peptides labeled with either Met and Leu or Met and Val were purified by reverse-phase high-performanceliquid
chromatography (RP HPLC), and their N-terminal sequences were determined. The results obtained with the Met-and Leu-labeled tryptic peptides are shown in Fig. 2b. Not all the tryptic peptides predicted by the deducedse-quence of this region of the pol ORF were recovered by HPLC. This
might
have been dueto thehydrophobic
char-acter of some of the internaltryptic peptides.
Six Leu-labeledpeptides
that wereultimately recovered and
se-quenced
are identified within the deduced sequenceof the protease domain. Forinstance,
thepeptide
contained in HPLCpeak
L2waslocalized between residues 83 and 88 of thepolORF,
that inpeak
L3 was between residues 77 and 82, that inpeak
L7 wasbetweenresidues 89 and 109, that inpeak
L8wasbetweenresidues 69 and 76, and that inpeak
L9 was between residues 114 and 123. The sequence of Val-labeledtryptic peptides
further confirmed the peptide assign-mentsshown inFig.
2.The HPLC-eluted viral
protein
had no demonstrable pro-teaseactivity despite
experimental
variationsof
protein concentration(0.8
to 3.6mg/ml),
pH (2.5 to 7.0), ionicstrength (O
to0.5 MNaCl),
incubation time(1to48h),
and temperature(22
or37°C),
conditions under which proteases of mammalian(22)
and avian(2, 6, 19)
retroviruses arep16
on November 10, 2019 by guest
http://jvi.asm.org/
[image:2.612.51.555.66.362.2]a
00% 0
2
0_
0
z
on
('4 0
_-b
8
6
4
2
4
2
2
1.5 1.0
0.5 o 0
U'
1.5 (A
1.0 0
0.5 X
0-%
0
_-1
CYCLE NUMBER
P4
p34RioPROTEASE
UL6 II I
L2I2
LI7III II I LIO69 76 2 Ua 109-113 1231-15 136 155 167
69 so 100
. . . .. . .
I'QI'I.WQRPr.VTIKIGGQLKEALLI)T(;ADI)TVIEEIMSI.P(;RWKI'u(MI
1,8 L L2 L7
[image:3.612.166.469.69.468.2]120 140 167
... ...
I.9 i. ''
FIG. 2. N-terminalsequenceanalysis of the intact HIV protease and its tryptic peptides. Extracellular viral proteins eluted from the gel after immunoprecipitationwerefurther purified by adsorption to a DEAE-cellulose column (1.0 by 5.0 cm) in TE buffer containing 0.5% NP-40 and0.5% Triton X-100, and the column was eluted batchwise with TE buffer plus detergents containing 0.1, 0.2, 0.5, and 1.0 MNaCl.The individual fractionswereanalyzed by SDS-PAGE. Limited N-terminal sequencing of all fractions was undertaken. The 0.2 M NaCl fraction thatyieldedahomogeneous sequenceofthecandidateprotease wasmixed with 1.0 mg of crystalline bovine immunoglobulin G, reduced, alkylated, and exhaustively digested with tolylsulfonyl phenylalanyl chloromethyl ketone-treated trypsin. The tryptic digests were fractionated by RPHPLC (14) on aC18column (Supelco Inc.) with a linear gradient of 0 to 100% aqueous acetonitrile containing0.1% trifluoroacetic acid. Samples were measuredfor radioactivity, and selected peaks wereconcentrated by lyophilization and processedfor N-terminalsequencedetermination. (a)Radiochemical sequencedeterminationofthe plOprotein(0.2 MNaCl-DEAEfraction) labeled with
[35S]Met
and[3H]Leu
(A),[35S]Met
and[3H]Val
(B),or[3H]Lys(C). Viral bandswerevisualized bywetgel autoradiography and recovered by electroelution (14). Automated N-terminal microsequence analysis by sequential Edman degradationwasperformed withaBeckman 890M sequenatorand the Beckman0.1 MQuadrolprogram042386 (3). Onthebasis of thereleaseof the individual amino acids through39cycles, auniqueHIVproteinsequence in theproteasedomainwasidentified(topline). Theoccurrence of[35S]Met
atcycles1,2, and 10inprofile B wasattributedto aminorcontaminant which constitutedless than5%oftheprotease-specific sequenceonthebasis ofarepetitive yield of96%. (b) Tryptic peptides of the putativeplO HIV protease labeled with[35S]Met
and either[3H]Leu
or[3H]Val
werefractionatedbyRP HPLC, and their partial N-terminal amino acidsequencesweredeterminedandaligned withinthepredictedtryptic peptides(L2, L3, L7,L8, andL10)of theproteasedomain. The residues identifiedby sequencing oftheselectedpeptidesareindicatedbyasterisks.active.
Asanalternative,
weanalyzed
theproteaseactivity
in extracts
of
E.coli expressing
the HIV polORF,
since these extractshadsubstantial
RTactivity
andreadily
detect-able amountsof
maturep64-pSi
and p34 HIVpol
proteins,
suggesting
that thepol
ORF geneproduct
wasprocessed
in thesecells,
presumably by
the HIVprotease. The crude E. coli extracts werescreened for
agag proteaseactivity
with apurified preparation
ofatruncated HIV"gag"
precursorprotein expressed
inE. coli.Thissynthetic
moleculewas alacZ fusion
protein
consisting
of six residues of lacZ fol-lowedby
a stretch of 348 HIVgag-encoded
amino acidsstarting
with residue 57 of the gag ORF.Cleavage
of this"gag"
substrate at the N terminus of the maturep24
gagprotein
wouldgenerate a ca. 32-kDaproduct
containing
the p24 and the N-terminal residues of the distalp9
gagprotein.
The
enzymatic
reactions wererun for 2or4h,
electropho-resedon 15%
polyacrylamide gels
insodiumdodecyl
sulfate(SDS),
andanalyzed
by staining
with Coomassie brilliantI p- I
6
on November 10, 2019 by guest
http://jvi.asm.org/
1 2 3 4 kDa
B
974
68.0
,43.0
4--gag"SUBSTRAIE
+-gag"
-25.7
- 18.4
[image:4.612.55.296.76.261.2]-14.3
FIG. 3. Enzymaticassayof theE.coli-expressedprotease.
Bac-terially expressed protease was partially purified from E. coli transformedbyarecombinantpUC8 plasmid carryinga
3,746-base-pair BglII-SalI HIV proviral DNA fragnment (obtained from a
subgenomicHIVproviralDNAplasmid, pBenn2 [1]) containingthe
entirepolandsorORFs. The E. coli transformantscontainingthe
HIVpol ORF fused to the lacZ gene were induced with 1 mM
isopropyl-p-D-thiogalactopyranoside
for4 h and then extracted ina buffercontaining20 mM Trishydrochloride (pH 7.8),1.0 MNaCl, 0.5%NP-40,and 0.5% Triton X-100.After clarification(10,000x g, 10min),thesupernatantswereassayedforRTactivityanddialyzed exhaustively against low-salt (0.1 M NaCl) extraction buffer by using tubingwithacutoff size of 2to3 kDa. ForassayingtheHIV protease,anE.coli fusionprotein containingapartof the HIIVgag ORF was used. A 1,087-base-pair MboI-BgII fragment (corre-spondingtonucleotides 1016 to2098 of the HIVgenome) of HIVproviralDNAcontainingthe middle one-third of thegagORFwas fusedtothe lacZgeneofpUC8,andprotein expressionwasinduced with
isopropyl-p-D-thiogalactopyranoside.
A 42-kDaE. coli HIV gagfusionprotein containingthe C-terminal 57 residues ofp17,the entirep24, and the N-terminal 60 residues ofp9wasidentified by immunoblottingwithgag-specificantisera. Thisproteinwaspurified fromthe cellextracts tonearhomogeneity bytwosuccessivecycles of RP HPLC as detailed above. The 42-kDa gag protein was incubatedatroomtemperaturewith variousamountsofanextractofE. coli whichexpressedthepolORF. All reactionswereanalyzed
by SDS-PAGE and staining with Coomassie brilliant blue (A) or
immunoblottingwithagag-specificmonoclonalantibody (0058)and
125"-labeled
proteinA(B). (A)Lanes: 1 and2,mockassayreactions containing only 15 ,ug of"gag" substrate incubated for 2or4 h,respectively; 3 and 4,incubationscontaining only 10 p.l ofextract
fromE. coliexpressingthepolORF for 2or4h, respectively;5and 6, completereactioncontaining 10 of E. coli-expressedenzyme
plus15 jigof"gag"substrate incubated for 2or4h, respectively.
Aprominent10-kDa bandseennearthe bottom of thegelin E.coli
extractsexpressingthepol ORFprobably represents theputative
protease. (B) "gag" substrate(5 ,ug)wasincubated for 4 h with 10
(lane 1),2.5(lane 2),or0(lane3),u1 of E.coliextractexpressingthe polORF. The substratealone is shown inlane 4. The positions of
the42-kDa"gag"substrateand the 32-kDa"gag"reactionproduct areindicatedbythearrows. Thereactions in panels A and Bwere
analyzedondifferentgels.
blue(Fig. 3A)andimmunoblotting witha mousemonoclonal
antibody (0058) raised against theHIV p24gagprotein(Fig.
3B). The crude extractswere capable of cleaving the HIV
"gag" substrate, yieldinga32-kDaproduct (Fig. 3A, lanes5
and6,andB, lanes 1and2).Incontrast,neither the purified substrate northecrudeextractalone produced this reaction product. Also, E. coliextractsexpressing thevectorplasmid alone hadnoenzymatic activity.
Partially purified
bacterialextractsexpressing
the HIVpol
ORF
hadareadily
detectableband of 10 kDa(Fig.
3A,
lanes 3through
6). The extracts were fractionatedby RPHPLC, andthe HIVproteins
in individual fractions were detectedby immunoblotting
withpooled
sera from AIDSpatients
(Fig.
4A1 and2).
The bulk of the viralproteins eluting
with the acetonitrilegradient
included bands of 64 and 51kDa,asexpected
for theHIV RT.Indeed,
the fractions enriched for thep64
andp51
bands had the maximal RTactivity (Fig.
4A1,
fraction30).
Two additionalsmaller viralproteins
of 10 and of 15 to 20 kDa werepreferentially
eluted with anisopropanol gradient (Fig. 4A2).
These twoproteins
alsospecifically
immunoreacted with a rabbit antiserum raisedagainst
asynthetic peptide corresponding
tothe C terminus of theputative
viralprotease
(residues
154 to 167 of the HIVpol
ORF; Fig. 4B).
None of the individual HPLCfractions,
however,
hadreproducible
gagprotease
activity
when the assay described abovewas used.To
obtain
purified
E.coli-expressed plO protein
for aminoacid
sequencing,
the bacterialextractslabeledwith[35S]Met
and
[3H]Leu
or[3H]Val
werefractionatedby
RP HPLC and individualfractions
wereimmunoprecipitated
withpooled
antisera
from
AIDSpatients
and resolvedby
SDS-polyacryl-amide
gel electrophoresis (PAGE) (Fig.
4C).Twoproteins
of 10 kDa(Fig.
4C, arrow)
and of 15to20kDa, eluting
atca.40% and 55%
isopropanol, respectively,
immunoreacted with thepooled
sera.Rabbit antisera raisedagainst peptides
corresponding
toHIVpol-encoded
residues 71 to 128(data
notshown)
or 154to 167(Fig.
4B)
alsospecifically
reacted with theseproteins.
TheplO
protein,
labeledwith Met and Leuor Met andVal,
waselectroeluted,
and its N-terminal sequence was determined. This sequence was identical to that of the N terminus of the HIVplO
protein.
Limited sequenceanalysis
of thelarger species (15
to 20kDa)
also revealed the same N terminus(data
notshown). Although
the
C terminus
of neitherprotein
wasdetermined,
the fact thatboth
proteins
immunoreacted with rabbit antiseraagainst
peptides
corresponding
to thepol-encoded
residues between 71 and 128or154 and 167 localized theirsequences withina99-residue domain
between codons 69 and 167 of the HIVpol ORF. Neither of
theproteins
immunoreacted
with monoclonal antibodiesagainst
the HIV RT sequence(data
not
shown).
HPLC
fractions enriched for
bacterially expressed
plO
lacked
reproducible
proteaseactivity, probably
because
of low recovery, poorstability,
orextensive denaturation.
The
following
lines of evidencestrongly
support thecandidacy
ofplO
astheauthentic
HIVprotease.(i)
Theproteaseactivity
could be
detected
only
in E.coli
expressing
the
HIVpol
ORF
andnotin E.coli
carrying
thevectorplasmid
alone.
(ii)
The gag
processing activity
wasabolished
by
antibodies
directed
against
allorpartof
the proteasedomain
(data
notshown).
(iii)
Bacterialexpression
of
a small DNAfragment
encompassing
the proteasedomain
yielded
a10-kDaprotein
that
processed
gag precursor(5).
(iv)
pol
ORFdeletions
spanning
the N terminusof
theprotease oreliminating
the entire protease domain abolished the appearanceof
thep64-p5i
HIV
RTconcomitant with thedisappearance
of
the 10-kDaprotein
inE.coli (R.
Swanstrom, personal
commu-nication).
The
relatively
lowyield
of the plOprotein from
virusprecluded
C terminusanalysis by carboxypeptidase
degra-dation.
Unlabeled, HPLC-purified,
E.coli-expressed
plO
was not amenable todigestion, either.
The deduced se-quence of thepol ORF
wouldpredict
a 12-residuetryptic
fragment
between residues 156 and 167 if theplO
and RT1 23 4 5 6
A
Ak.
qw#.
F"'I
0,
on November 10, 2019 by guest
http://jvi.asm.org/
50 55 60 65 70
B 1 2 3 4
5
kDa
- 46.0 - 30.0 - 21.5
do
- 14.340 0
o
.0
20
ir
so0
50l
o0
a
s
C
IHPLC 45 46 47 48 4950 51 52 53 54 55 56 57 58 59 60 6152 FRACTION
.Ifls
_ _ ____.,s_
_/-0
i.
1 69 168 1015
pol ORF H2N-FFRED-PQITLWORPLVTIKIGGOLKEALLDTGADD-- PISP-- ED-COOH A AA A AA
FIG. 4. Purificationand amino acidsequencedeterminationofE.coli-expressed protease.Thedialysateof HIVpolORF-expressingE. coliextracts(asdescribedin the legendtoFig. 3)wasappliedtoa20-mlcolumn of DEAE-cellulose in0.1 MNaCl, and theproteinswere
elutedby batchwisesalt gradients of 0.2, 0.5, and1.0 MNaCl. AfterRTanalysis and immunoblotting, theplO-enriched0.1to0.2 MNaCl
fractionswereconcentrated byacetoneprecipitation, dissolved in6 Mguanidine hydrochloride containing 0.1%trifluoroacetic acid(TFA),
andresolvedby RPHPLCon aVydac C4 column developed withalineargradient of 0to30%aqueousacetonitrile-0.1%TFAfor 30min followedby isocraticelution for10minanda60-mingradient of30to60% acetonitrile-0.1%TFA.Thiswasfollowed by isocratic elution for
10min with20%aqueousisopropanol-0. 1%TFAanda20to100%linear gradientofisopropanol-0.1% TFA for60min.Fractions(2 mleach)
werecollected, and samples of selected fractionswereelectrophoresed and immunoblotted with pooledserafrom AIDSpatients. (Panels Al
andA2)SDS-PAGEprofiles of E. coliextractswhichexpressthe UIIVpolORF resolved by RP HPLC. The discontinuous linear acetonitrile
(Al)andisopropanol (A2) gradientsaredenoted by lines drawnacrossthe panels. The proteinswereelectroblotted and screened with pooled
serafromAIDS patientsand 1251I-labeled protein A. Selected fractionswereassayedforRTactivity,and the RTprofile(fractions 25to34)
isillustrated byagraph in panel Al. (B) Immunoblot detection of the twoforms oftheputative HIVprotease expressed inE. coli.Two separateisopropanol gradient fractionscontaining thetwoforms of the putative HIVprotease(isopropanol gradient fraction58, lanes 2and
4; isopropanol gradient fraction64, lanes 3 and 5) were lyophilized and rerun on 17.5% polyacrylamide gels in SDS, electroblotted to
nitrocellulose,andimmunoreacted witharabbit hyperimmuneserumagainsta syntheticpeptide containing residues 154to167of the HIV
pol ORF (lanes 2 and 3) or pooled sera from AIDS patients (lanes 4 and 5). The isopropanol fractions 58 and 64 were mixed and
electrophoresed (lane1)and reacted with nonimmune rabbitserum.(C)Immunoprecipitation ofradiolabeled bacterialextractswithpooled sera from AIDS patients. A 100-ml sample ofanE. coli culture expressing the HIVpol ORF was induced with 1 mM
isopropyl-P-D-thiogalactopyranosidefor 30minand labeled for 2 h withacombination of[35S]Met(1mCi) and[3H]Leuor[3H]Val(5mCi).Bacterialextracts
were prepared and fractionatedby RPHPLCasdescribed above. Selected isopropanol gradient fractions were immunoprecipitated with pooledserafrom AIDSpatients and resolvedbySDS-PAGE.Theputativeprotease(arrow)waselectroeluted, and its N-terminalsequence wasdetermined(seethelegendtoFig.2);theresiduesidentifiedareindicated by thecaretswithin the relevant region of the deducedsequence of theHIVpolORF belowtheautoradiogram.Thenumbers abovetheproteinsequencerefertothe residues(1to1012)of the HIVpol ORF.
were contiguous and if the protease were cleaved after
Arg-155 (denoted by! inFig. 2b).Afragment of this sizewas not observed among the tryptic digestion products.
How-ever, the experimentally determined positions of Leu and Val withina large peptide, L10 (probably overlapping
Arg-155), coincidedwiththeirpositionsinthepol ORFbetween Lys-138 and Arg-155. Among different HIV isolates and otherretroviral proteases, there isaconserved domain of 6 to9 residues surrounding an invariant Arg atposition 155. We believe, therefore, that this Arg might reside within a
hydrophobic pocket and be shielded from tryptic attack. Since both the viral and bacterial proteins immunoreacted with antiserum raisedagainsta 14-residue peptidebetween
residues154and 167of thepol ORF, the C terminus of the protease is most likely contiguous with the N terminus of RT.Onthisbasis the HIV proteasewasassumedtocontain 99 amino acids witha calculated molecular mass of10,774
andtoberelativelyrichinbasic residueswithacalculated
pl
of 9.83. It had a canonical -DTG- sequence (conserved
among all retroviral gag proteases and generic aspartyl proteinases) atposition 25. Theprotein washighly
homolo-gous to retroviral proteases over a 13-residue region
cen-teredaround the -DTG- sequence.Two other lesserregions ofhomology with other retroviral proteases were centered around amino acidpositions52and 87(correspondingtothe polORF residues 120 and 155, respectively).
Thefollowing three types ofmechanisms have been de-scribed for theexpressionofthe
pol
ORF of retroviruses: (i) suppression ofan amber codon between the gag andpol
ORFs of murine leukemia virus and feline leukemia virus(21); (ii)asingleribosomalframeshiftnearthe start of the
pol
ORF of Rous sarcoma virus and HIV (10, 12); and (iii) doubleframeshiftingatthejunctionsof thegag-Xlpro
andX/ pro-polORFs of mouse mammary tumor virus (11). Sincethe HIV proteaseiscontainedwhollywithin the
pol
ORF, itislikely thatframeshifting occurs eithernearthe beginning ofthe
pol
ORForimmediately upstream of theprotease Nterminus. Frameshiftingatthe former sitehas recentlybeen
Al 1A
A2 69
30 14
kDa
- 200
- 92.5
- 69.0
-46.0
-30.0
- 21.5
- 14.3
on November 10, 2019 by guest
http://jvi.asm.org/
[image:5.612.63.569.66.307.2]demonstrated in vitro (10). Whatever the mechanism of gag-pol precursor synthesis, the initial cleavage of the protease moiety from the precursor is apparently an auto-catalytic process, asevidenced by the presenceof identical Ntermini for both the viral andE. coli-expressed proteins. Weare grateful to Malcolm A.Martin forsupport and encourage-ment. The help of Charles E. Buckler in computer analysis is acknowledged. Malcolm A. Martin and Arnold Rabson are also thanked forcriticalreview of themanuscript.
LITERATURE CITED
1. Adachi, A., H. E.Gendelman, S. Koenig, T. Folks, R. Willey, A. Rabson, and M. A. Martin. 1986.Production of acquired immu-nodeficiency syndrome-associated retrovirus inhumanand non-humancellstransfected withaninfectious molecular clone. J. Virol. 59:284-291.
2. Alexander, F., J. Leis, D. A. Soltis, R. M. Crowl, W. Danho, M. S. Poonian, Y.-C. E.Pan, and A. M. Skalka. 1987. Proteo-lytic processing of aviansarcomaandleukosis virusespol-endo recombinant proteins revealsanotherpolgenedomain.J. Virol. 61:534-542.
3. Coligan,J. E., F. T. Gates III, E. S.Kimball,and W. L.Maloy. 1983. Radiochemical sequence analysis ofbiosynthetically la-beled proteins.MethodsEnzymol. 91:413-444.
4. Crawford, S.,andS. P.Goff. 1985. Adeletion mutationin the 5' partofthepol geneof Moloney murine leukemia virusblocks proteolytic processing of thegagand pol polyproteins.J.Virol. 53:899-907.
5. Debouck, C., J. G. Gorniak, J. E. Strickler, T. D. Meek, B. W. Metcalf, and M. Rosenberg. 1987. Human immunodeficiency virus protease expressed in Escherichia coli exhibits autopro-cessing and specific maturation of the gag precursor. Proc. Natl.Acad. Sci. USA84:8903-8906.
6. Dittmar, K. J., and K.Moelling. 1978.Biochemicalpropertiesof p15-associatedproteaseinanavian RNA tumorvirus. J. Virol. 28:106-118.
7. Eisenman, R.N., W. S. Mason, and M. Linial. 1980. Synthesis andprocessing of polymerase proteins of wild-type andmutant avian retroviruses. J. Virol.36:62-78.
8. Farmerie, W. G., D. D. Loeb, N. C. Casavant, C. A. Hutchison III, M. H. Edgell, and R. Swanstrom. 1987. Expression and processing ofthe AIDS virus reverse transcriptase in Esche-richiacoli. Science 236:305-308.
9. Gendelman, H. E., T. S. Theodore, R. Willey, J. McCoy, A. Adachi, R. J. Mervis, S. Venkatesan, and M. A. Martin. 1987. Molecular characterization of a polymerase mutant human immunodeficiencyvirus. Virology 160:323-329.
10. Jacks, T., M. D. Power, F. R. Masiarz, P. A. Luciw, P. J. Barr, and H. E. Varmus. 1987. Characterization ofribosomal
frame-shiftingin HIV-1gag-polexpression.Nature(London) 331:280-283.
11. Jacks, T., K. Townsley, H. E. Varmus, andJ. Majors. 1987. Two efficient ribosomalframeshiftingevents arerequiredfor synthe-sis ofmousemammary tumorvirusgag-related polyproteins. Proc. Natl.Acad. Sci. USA 84:4298-4302.
12. Jacks, T., and H. E. Varmus. 1985. Expression ofthe Rous sarcomaviruspolgenebyribosomalframeshifting. Science230: 1237-1242.
13. Lightfoote, M. M., J. E. Coligan, T. M. Folks, A. S. Fauci,M.A. Martin, and S. Venkatesan. 1986.Structural characterizationof reverse transcriptase and endonuclease polypeptides of the acquired immunodeficiency syndrome retrovirus. J. Virol. 60: 771-775.
14. Lillehoj, E. P., N. B. Myers, D. R. Lee, T. H.Hansen, andJ.E. Coligan. 1985.Structural definition ofafamily of Ld-like mole-cules distributedamongfourofsevenhaplotypescompared.J. Immunol. 135:1271-1275.
15. Rice, N. R., R. M.Stephens, A. Burny, and R. V.Gilden.1985. The gag and pol genes of bovine leukemia virus: nucleotide sequence andanalysis. Virology 142:357-377.
16. Robey, W. G., B. Safai, S. Oroszlan, L.0. Arthur, M. A. Gonda, R. C. Gallo, and P. J. Fischinger. 1985. Characterization of envelope andcore structural geneproducts of HTLV-III with serafromAIDSpatients. Science 228:593-595.
17. Steimer, K.S., K. W. Higgins, M. A. Powers, J. C.Stephans, A. Gyenes, C.George-Nascimento, P. A.Luciw, P. J. Barr,R. A. Hallewell, and R. Sanchez-Pescador. 1986. Recombinant poly-peptide fromtheendonuclease region of the acquiredimmune deficiency syndrome retrovirus polymerase (po) gene detects serumantibodiesinmostinfected individuals.J.Virol.58:9-16. 18. Veronese, F. D., T. D.Copeland, A. L. DeVico, R. Rahman, S. Oroszlan,R. C.Gallo, and M. G.Sarngadharan. 1986. Charac-terization ofhighly immunogenic p66/p5l as the reverse tran-scriptase of HTLV-III/LAV. Science 231:1289-1291.
19. Vogt, V. M., A. Wight, and R. Eisenman. 1979. Invitrocleavage of avianretrovirusgagproteins by viralproteasep15. Virology 98:154-167.
20. Witte,0.N., and D. Baltimore. 1978.Relationship of retrovirus polyprotein cleavagestovirionmaturation studied with temper-ature-sensitivemurineleukemia virusmutants.J.Virol. 26:750-761.
21. Yoshinaka, Y.,I. Katoh, T. D. Copeland, and S. Oroszlan. 1985. Murineleukemia virusproteaseis encoded by the gag-polgene and issynthesized throughsuppression ofanambertermination codon. Proc. Natl. Acad. Sci. USA 82:1618-1622.
22. Yoshinaka, Y.,I. Katoh, T. D. Copeland, G. W. Smythers, and S. Oroszlan. 1986.Bovine leukemiavirusprotease:purification, chemical analysis, and in vitro processing of gag precursor polyproteins.J. Virol. 57:826-832.