Conditional Value-at-Risk and Average Value-at-Risk: Estimation and Asymptotics

(1)

MPRA

Munich Personal RePEc Archive

Conditional Value-at-Risk and Average

Value-at-Risk: Estimation and

Asymptotics

So Yeon Chun and Alexander Shapiro and Stan Uryasev

School of Industrial and Systems Engineering, Georgia Institute of

Technology, epartment of Industrial and Systems Engineering,

University of Florida

3 April 2011

Online at

https://mpra.ub.uni-muenchen.de/33115/

(2)

Value-at-Risk: Estimation and Asymptotics

SoYeonChun*

SchoolofIndustrialandSystemsEngineering,GeorgiaInstituteofTechnology,[email protected] Alexander Shapiro

y

SchoolofIndustrialandSystemsEngineering,GeorgiaInstituteofTechnology,[email protected] StanUryasev

DepartmentofIndustrialandSystemsEngineering,UniversityofFlorida,uryasev@u .edu

Wediscusslinearregressionapproachestoestimationoflawinvariantconditionalriskmeasures.Two esti-mation proceduresare considered and compared; oneis based onresidual analysis of the standard least squares method and the otheris in the spiritof the M-estimationapproach used inrobust statistics.In particular,Value-at-RiskandAverageValue-at-Riskmeasuresarediscussedindetails.Largesample statis-ticalinferenceoftheestimatorsisderived.Furthermore,nitesamplepropertiesoftheproposedestimators are investigated andcompared withtheoretical derivations inanextensive MonteCarlo study. Empirical resultsonthereal-data(dierent nancialasset classes) arealsoprovidedtoillustratetheperformance of theestimators.

Keywords: Value-at-Risk,AverageValue-at-Risk,linearregression,leastsquaresresiduals,M-estimators, quantileregression,conditionalriskmeasures,lawinvariantriskmeasures,statistical inference

1. Introduction

Innancialindustry,sell-sideanalystsperiodicallypublishrecommendationsof underlying securi-tieswithtargetprices(e.g.,GoldmanSach'sConvictionBuyList).Thoserecommendationsre ect speciceconomicconditionsandin uenceinvestors'decisionsandthuspricemovements.However, thistypeof analysisdoesnotprovideriskmeasuresassociatedwithunderlyingcompanies.Wesee similar phenomenainthe buy-side analysisas well. Each analystor team covers dierent sectors (e.g.,AirlinesVS Semi-conductors)andtypicallymakesseparaterecommendationsforthe portfo-liomanagerswithoutassociatedrisk measures.However, riskmeasures of thecoveringcompanies

ResearchofthisauthorwaspartlysupportedbytheNSFawardDMS-0914785. y

(3)

are one of themost important factors for investment decision making. In this paper, we consider ways to estimate risk measures for a single asset at given market conditions. These information couldbe usefulforinvestorsandportfoliomanagerstocompareprospective securitiesandtopick thebest.Forexample,whenportfoliomanagersexpectthecrudeoilpricetohike(duetoin ation or geo-politicalcon icts), they couldselect securitiesless sensitive to oilprice movements in the airlineindustry.

In order to formalize our discussion, let us introduce the following setting. Let (;F) be a measurablespaceequippedwithprobabilitymeasureP.AmeasurablefunctionY :!R iscalled arandomvariable.WithrandomvariableY,weassociateanumber(Y)towhichwe referasrisk measure.Weassumethat\smallerisbetter",i.e.,betweentwopossiblerealizationsofrandomdata, weprefertheonewithsmallervalueof().Theterm\riskmeasure"issomewhatunfortunatesince itcanbe confusedwith theprobabilitymeasure.Moreover, inapplicationsoneoftentriestoreach acompromisebetweenminimizingtheexpectation(i.e.,minimizingonaverage)andcontrollingthe associatedrisk.Thus,someauthorsusetheterm\mean-riskmeasure",or\acceptabilityfunctional" (e.g.P ugandRomisch2007).Forhistoricalreasons,weuseherethe\riskmeasure"terminology. Formally, risk measure is a function :Y !R dened on an appropriate space Y of random variables.For example, in someapplications it is naturalto use the space Y=L

p

(;F;P), with p2[1;1), of randomvariableshavingnitep-th ordermoments.

It was suggestedin Artzneret al. (1999) that a \good" risk measure shouldhave the following properties(axioms),andsuch risk measureswerecalledcoherent.

(A1) Monotonicity:If Y;Y 0

2Y andY Y 0

,then (Y)(Y 0 ). (A2) Convexity: (tY +(1 t)Y 0 )t(Y)+(1 t)(Y 0 ) for allY;Y 0 2Y andallt2[0;1].

(A3) TranslationEquivariance:Ifa2R andY 2Y,then (Y +a)=(Y)+a. (A4) Positive Homogeneity:If t0andY 2Y,then (tY)=t(Y).

(4)

The notation Y Y 0

means that Y(!) Y 0

(!) for a.e. ! 2 . We may refer, e.g., to DetlefsenandScandolo (2005), Weber (2006), FollmerandSchied (2011) fora further discussion of mathematicalpropertiesof risk measures.

An importantexampleof risk measuresis theValue-at-Riskmeasure

V @R (Y)=infft:F Y (t)g; (1) where 2(0;1) and F Y

(t)=Pr(Y t) is the cumulative distribution function (cdf) of Y, i.e., V @R

(Y)=F 1 Y

() isthe leftside-quantileof the distributionof Y.This risk measuresatises axioms(A1),(A3) and(A4),butnot(A2), andhenceisnotcoherent.Another importantexample is theso-calledAverageValue-at-Risk measure, which canbe dened as

AV @R (Y)=inf t2R t+(1 ) 1 E[ Y t] + (2)

(cf.,RockafellarandUryasev 2002),orequivalently

AV @R (Y)= 1 1 Z 1 V @R (Y)d: (3) Note that AV @R (Y) is nite i E[ Y] +

<1. Therefore, it is natural to use the space Y = L

1

(;F;P)ofrandomvariableshavingniterstordermomentfortheAV @R

riskmeasure.The Average Value-at-Risk measure is also called the Conditional Value-at-Risk or Expected Short-fall measure. (Since we discuss here \conditional" variants of risk measures, we use the Average Value-at-Riskratherthan ConditionalValue-at-Riskterminology.)

TheValue-at-RiskandAverageValue-at-Riskmeasures arewidelyusedtomeasureandmanage risk in the nancial industry (see, e.g., Jorion 2003, DuÆeandSingleton 2003, Gaglianoneet al. 2011,forthenancialbackgroundandvariousapplications).Notethatintheabovetwoexamples, risk measures are functions of the distribution of Y. Such risk measures are called law invari-ant. Law invariant risk measures have been studiedextensively inthe nancial risk management literature (e.g., Acerbi 2002, Frey andMcNeil 2002, Scaillet 2004, FermanianandScaillet 2005, ChenandTang 2005, Zhu andFukushima 2009, JacksonandPerraudin 2000, Berkowitzet al.

(5)

2002,Bluhmet al.2002,andreferencetherein).Sometimes,wewritea lawinvariantriskmeasure asa function(F) of cdfF.

Nowletusconsiderasituationwherethereexistsinformationcomposedofeconomicandmarket variables X

1 ;:::;X

k

which canbe consideredas a set of predictors fora variableof interestY. In thatcase,onecanbeinterestedinestimationofariskmeasureofY conditionalonobservedvalues of predictors X

1 ;:::;X

k

. For example, suppose we want to measure (predict) the risk of a single assetgivenspeciceconomicconditionsrepresentedbymarketindexandinterestrates.Then,for arandomvectorX=(X 1 ;:::;X k ) T

ofrelevantpredictors,theconditionalversionofalawinvariant riskmeasure,denoted(YjX)or

jX

(Y),isobtainedbyapplyingtotheconditionaldistribution of Y given X. In particular,V @R

(YjX) is the -quantile of the conditional distribution of Y givenX,and AV @R (YjX)= 1 1 Z 1 V @R (YjX)d: (4)

Recently,severalresearchershavepaid attentiontoestimationoftheconditionalriskmeasures. For the conditional Value-at-Risk, ChernozhukovandUmantsev (2001) used a polynomial type regression quantile model and EngleandManganelli (2004) proposed the model which species the evolution of the quantile over time using a special type of autoregressive processes. In both models, unknown parameterswere estimatedbyminimizingthe regression quantile lossfunction. ForconditionalAverageValue-at-Risk,Scaillet(2005)andCaiandWang(2008)utilized Nadaraya-Watson(NW)typenonparametricdoublekernelestimationwhilePeracchi andTanase(2008) and Leoratoet al. (2010) usedthe semiparametricmethod.

In this paper, we discuss procedures forestimationof conditionalrisk measures. Especially,we willpayattentiontoestimationofconditionalValue-at-RiskandAverageValue-at-Riskmeasures. We assume thefollowinglinearmodel(linear regression)

Y = 0 + T X+"; (5) where 0 and=( 1 ;:::; k ) T

are(unknown)parametersofthemodelandtheerror(noise)random variable"isassumedtobeindependentofrandomvectorX.Meaningofthemodel(5)isthatthere

(6)

isatrue(population)value 0

;

of therespectiveparametersforwhich(5)holds.Sometimes,we willwritethisexplicitlyandsometimessuppressthisinthe notation.

Let () be a law invariant risk measuresatisfying axiom(A3) (TranslationEquivariance),and

jX

() be its conditionalanalogue. Note that because of the independenceof "and X,it follows that

jX

(")=(").Together withaxiom(A3), thisimplies

jX (Y)= jX ( 0 + T X+")= 0 + T X+ jX (")= 0 + T X+("): (6) Since 0 +(")=("+ 0

), we can set (")=0 by adding a constant to the error term. In that case, for the truevalues of the parameters,we have

jX (Y)= 0 + T

X. Hence, the question is howtoestimatethese(true) values

0

;

of therespectiveparameters.

This paperis organizedasfollows.InSection2,wedescribetwodierent estimationprocedures fortheconditionalriskmeasures;oneisbasedonresidualsoftheleastsquaresestimationprocedure andtheotherisbasedontheM-estimationapproach.Asymptoticpropertiesofbothestimatorsare provided inSection3. In Section4,we investigate thenite sampleandasymptoticpropertiesof theconsideredestimators.We presentMonteCarlosimulationresultsunder dierent distribution assumptions of the error term. Later, we illustrate the performance of dierent methods on the real data (dierent nancial asset classes) in Section 5. Finally, Section6 gives some conclusion remarks andsuggestionsforfutureresearch directions.

2. Basic Estimation Procedures

Suppose that we have N observations (datapoints) (Y i

;X i

), i=1;:::;N, which satisfy the linear regressionmodel(5),i.e.,

Y i = 0 + T X i +" i ; i=1;:::;N: (7)

We assume that:(i)X i

,i=1;:::;N, areiid(independent identicallydistributed)randomvectors, andwriteX forrandomvectorhavingthesamedistributionasX

i

,(ii)theerrors" 1

;::;" N

are iid withnitesecondordermomentsandindependentof X

i .We denoteby 2 =Var[" i ]the common varianceof theerror terms.

(7)

There aretwobasicapproachestoestimationofthetruevaluesof 0

and.One approachisto apply thestandardLeast Squares(LS) estimationprocedureandthen to make an adjustment of the estimateof the interceptparameter

0

.That is,let ~ 0

and ~

be the least squares estimators of therespectiveparameters ofthe linearmodel(7) and

e i :=Y i ~ 0 ~ T X i ; i=1;:::;N; (8)

be the correspondingresiduals. By the standard theory of the LS method, we have that ~ 0

and ~

are unbiased estimatorsof the respective parametersof the linearmodel (5) provided E["]=0. Therefore,weneedtomake thecorrection

~ 0

+(")oftheinterceptestimator.Ifweknewthetrue values"

1 ;:::;"

N

oftheerrorterm,wecouldestimate(")byreplacingthecdfF " of"byitsempirical estimate ^ F ";N associated with" 1 ;:::;" N ,i.e.,toestimate(F " ) by( ^ F ";N

).Sincetruevalues ofthe error term are unknown, it is a naturalidea to replace"

1 ;:::;"

N

by the residual values e 1

;:::;e N

. Hence, we use the estimator

~ 0 +( ^ F e;N ), where ^ F e;N

is the empirical cdf of the residual values, i.e.,

^ F

e;N

is thecdfof theprobabilitydistributionassigningmass1=N toeachpointe i

,i=1;:::;N (seesection3.1 for furtherdiscussion).We referto thisestimationapproach asthe Least Squares Residuals (LSR)method.

Analternativeapproachisbasedonthefollowingidea.Supposethatwecanconstructafunction h(y;) of y2R and 2R, convex in , such that the minimizer of E

F

[h(Y;)] will be equal to (F),i.e.,(F)=argmin

E

F

[h(Y;)]: Since(Y +a)=(Y)+aforanya2R, itfollowsthat the function h(y;) should be of the form h(y;)= (y ) forsome convex function :R !R. We referto () asthe error function.Therefore,we needtoconstructan errorfunction such that

(F)=argmin

E F

[ (Y )]: (9)

This isequivalenttosolving theequation

E F

[(Y )]=0; (10)

where(t):= 0

(t).Note thattheerrorfunction () couldbe nondierentiable,inwhichcasethe correspondingderivativefunction()is discontinuous.Thatis,thefunction()ismonotonically nondecreasing.

(8)

The correspondingestimators ^ 0 and ^

aretakenassolutionsof theoptimizationproblem

Min 0 ; N X i=1 Y i 0 T X i : (11)

In the statistics literature, such estimators are called M-estimators (the terminology which we will follow) and for an appropriate choice of the error function, this is the approach of robust regression(Huber1981).FortheV @R

risk measure,theerrorfunctionis readilyavailable(recall that [t] + =maxf0;tg): (t):=[t] + +(1 )[ t] + : (12)

The corresponding robust regression approach is known as the quantile regression method (cf. Koenker2005).

Forcoherentriskmeasures,thesituationismoredelicate.Letusmakethefollowingobservations. Suppose that therepresentation (9) holds.Let F

1 andF

2

be two cdfsuchthat (F 1

)=(F 2

)=. Thenitfollowsby(9) (by(10))that(tF

1

+(1 t)F 2

)=foranyt2[0;1]. Thisis quite astrong necessary condition for existence of a representation of the form (9). It certainly doesn't hold for the AV @R

, 2(0;1), risk measure. Indeed, consider the following probabilitydistributions F 1 :=Æ a + 1 2 (1 )(Æ b +Æ d ),F 2 :=Æ c +(1 )Æ (b+d)=2 and 1 2 (F 1 +F 2 )= 1 2 Æ a + 1 4 (1 )Æ b + 1 2 Æ c + 1 4 (1 )Æ d + 1 2 (1 )Æ (b+d)=2 ; whereÆ x

denotesmeasureofmassoneatx,2( 1 2

;1)anda<b<c<daresuchthatc< 1 2

(b+d).It isstraightforwardtocalculatethatAV @R

(F 1 )=AV @R (F 2 )= 1 2 (b+d).However,since2( 1 2 ;1), AV @R 1 2 (F 1 +F 2 ) = 1 4 b+2c(1 ) 1 +2d > 1 4 (b+2c+2d)> 1 4 (2b+c+2d)> 1 2 (b+d):

These arguments areduetoGneiting(2009).

Thisshowsthatforgeneralcoherentriskmeasures,possibilityofconstructingthecorresponding M-estimatorsisratherexceptional,andsuch estimatorscertainlydo notexistfortheAV @R

risk measure. Nevertheless, it is possibleto construct the following approximations (this construction is essentiallyduetoRockafellaretal. (2008)).

(9)

Proposition 1. Let j :R!R, j=1;:::;r, be convex functions, j 2R be such that P r j=1 j =1 and E(Y):= inf 2R r ( E " r X j=1 j (Y j ) # : r X j=1 j j =0 ) : (13) Moreover, let S j (Y) be a minimizer ofE[ j

(Y )] over2R. Then S(Y):= P r j=1 j S j (Y) is a minimizerof E(Y ) over2R.

Proof Considertheproblem

Min ; E " r X j=1 j (Y j ) # s:t: r X j=1 j j =0: (14)

By makingchangeof variables j

=+ j

, j=1;:::;r,we canwritethisprobleminthe form

Min ; E " r X j=1 j (Y j ) # s:t: r X j=1 j j =: (15) SinceS j (Y) is a minimizerof E[ j (Y j )], it follows that j =S j (Y), i=1;::;r,=S(Y), is an optimalsolutionof problem(15).This completestheprof.

In particular,we canconsiderfunctions j

() oftheform (12), i.e.,

j (t):= j [t] + +(1 j )[ t] + ; (16) for some j 2 (0;1), j = 1;:::;r. Then S j (Y) = V @R j

(Y) and hence the risk measure P r j=1 j V @R j

(Y) isa minimizerof E(Y ).We canview P r j=1 j V @R j (Y) asa discretization of theintegral 1 1 R 1 V @R

(Y)d ifwe set :=(1 )=rand take

j :=(1 ) 1 ; j :=+(j 0:5);j=1;:::;r: (17) For thischoiceof j , j

,andby formula(3),we have that

AV @R (Y)= 1 1 Z 1 V @R (Y)d r X j=1 j V @R j (Y)=S(Y): (18)

Considernowthe problem

Min ; E(Y 0 T X): (19)

(10)

By thedenition(13) of E(),wecanwritethis probleminthefollowingequivalent form Min ; 0 ; E h P r j=1 j (Y 0 T X i ) i s:t: P r j=1 j j =0: (20)

The so-calledSampleAverage Approximation(SAA)of thisproblemis

Min ; 0 ; 1 N N X i=1 r X j=1 j (Y i 0 T X i i ) s:t: r X j=1 j j =0: (21)

The above problem (21) can be formulated as a linear programming problem. Following Rockafellaretal. (2008),we considerthe followingestimators.

Mixed quantile estimator for AV @R (Yjx) We refer to 0 + T

x as the mixed quantile estimator of AV @R (Yjx), where (; 0 ; ) is an optimalsolutionof problem(21).

Thisideacanbeextendedtoalargerclassoflawinvariantrisk measures.Forexample,consider a riskmeasure

(Y):=cE[ Y]+(1 c)AV @R

(Y) (22)

for some constants c2[0;1] and 2(0;1). Recall that the minimizerof E[(Y t) 2

] is t

=E[Y]. Therefore, by taking functions

0 (t):=t 2 , functions j (t) of the form (16), j , and j given in (17),we canconstructthecorrespondingerrorfunction

E(Y):= inf 2R r+1 ( E " 0 (Y 0 )+ r X j=1 j (Y j ) # :c 0 + r X j=1 (1 c) j j =0 ) : (23)

As anotherexample,considerrisk measuresof theform

(Y):= Z 1 0 AV @R (Y)d(); (24)

where is a probability measure on the interval [0;1). By a result due to Kusuoka (2001), this measures forma class ofthe comonotelawinvariantcoherentrisk measures.By (3),we canwrite suchrisk measureas

(Y)= Z 1 Z 1 (1 ) 1 V @R (Y)dd()= Z 1 w()V @R (Y)d; (25)

(11)

wherew():= R 0 (1 ) 1

d(). Suchriskmeasuresarealsocalledspectralrisk measures(Acerbi 2002).By makinga discretizationof theabove integral(25),we canproceedasabove.

It could be remarked here that while the LSR approach is quite general, the approach based on mixingM-estimatorsissomewhatrestrictive.Constructingan appropriateerrorfunction fora particularriskmeasure couldbe quiteinvolved.

3. Large Sample Statistical Inference

In the previous section, we formulated two approaches, the LSR estimators and mixed M -estimators, toestimationof thetrue (population)valuesof parameters

0

;

of the linearmodel (5) such that (")=0.For theV @R

risk measure,the correspondingM-estimators ^ 0 and ^ are takenas solutionsof theoptimization problem(11), withthe error function(12), andreferredto asthequantileregressionestimators.FortheAV @R

riskmeasureandmoregenerallycomonotone riskmeasuresoftheform(25),weconstructedthecorrespondingmixedquantileestimators;

0 ; . In thissection, we discuss statisticalpropertiesof these estimators.In particular,we address the questionofwhichofthesetwoestimationproceduresismoreeÆcientbycomputingcorresponding asymptoticvariances.

3.1. Statistical Inference ofLeast Squares ResidualEstimators The linearmodel(7) canbewrittenas

Y =X[ 0 ;]+ ; (26) where Y =(Y 1 ;:::;Y N ) T

is N 1 vector of responses, X is N (k+1) data matrix of predictor variableswithrows (1;X

T i

),i=1;:::;N,(i.e.,rstcolumnofX iscolumnof ones),=( 1

;:::; k

) T vectorof parametersand=("

1 ;::;" N ) T isN1 vectorof errors.By [ 0 ;], wedenote(k+1)1 vector ( 0 ; T ) T

.We assume that theconditions (i)and (ii),specied atthe beginningof section 2,hold.ItisalsopossibletoviewdatapointsX

i

asdeterministic.Inthatcase, weassume thatX hasfullcolumn rankk+1.

Let ~ and

~

(12)

Recallthattheseestimatorsaregivenby[ ~ 0 ; ~ ]=(X T X) 1 X T Y,vectorofresidualse:=Y X[ ~ 0 ; ~ ] is givenby e=(I N H)Y =(I N H); where I N

is the NN identitymatrix, and H=X(X T

X) 1

X T

is the so-calledhat matrix. Note that trace(H)=k+1 andwe have that

" i e i =[1;X T i ](X T X) 1 X T ; i=1;:::;N: (27) If we knewerrors" 1 ;::;" N

, we couldestimate(") bythe correspondingsampleestimatebased on theempiricalcdf ^ F ";N ()=N 1 N X i=1 I [" i ;1) (); (28) where I A

() denotes the indicator function of set A. However, the true values of the errors are unknown.Therefore,intheLSRapproachwe replacethembytheresidualscomputedbytheleast squaresmethod andhence estimate(")byemployingtherespectiveempiricalcdf

^ F e;N () instead of ^ F ";N ().

The rstnaturalquestion is whetherthe LSRestimatorsare consistent,i.e., convergew.p.1 to their truevalues asthe samplesizeN tendsto innity.It iswellknownthat, under thespecied assumptions, theLS estimators

~ 0

and ~

are consistent, with ~ 0

beingconsistent under the con-dition E["]=0. The question of consistency of empirical estimates of law invariant coherent risk measures was studiedin Wozabal andWozabal (2009). It was shown that, under mild regularity conditions,such estimatorsareconsistent.Inparticular,theconsistencyholdsforthecomonotone riskmeasures oftheform (25),i.e.,(

^ F ";N ) convergesw.p.1 to(F " ) asN!1.It isalsopossible toshowthatthedierence(

^ F ";N ) ( ^ F e;N

)tendsw.p.1tozeroandhence( ^ F e;N )convergesw.p.1 to(F "

) aswell. A rigorousproofof thiscouldbe quite technical andwill be beyondthescope of thispaper.

We have thatthe LS estimator h ~ 0 ; ~ i

asymptoticallyhasnormaldistributionwith the asymp-totic covariance matrix N

1 2 1 , where :=E[ X]; :=E XX T and := 1 T : Conse-quently, for a given x, the estimate

~ 0 +x T ~

asymptotically has normal distribution with the asymptoticvarianceN 1 2 [1;x T ] 1 [1;x T ] T .

(13)

We also have that random vectors ( ~ 0 ; ~

) and e are uncorrelated. Therefore, if errors " i

have normaldistribution,thenvectors(

~ 0

; ~

)andejointlyhaveamultivariatenormaldistributionand thesevectorsareindependent.Consequently,

~ 0 +x T ~ and( ^ F e;N

)areindependent.Fornonnormal distribution,thisindependenceholdsasymptoticallyandthusasymptotically

~ 0 +x T ~ and( ^ F e;N ) areuncorrelated.

Asymptotics of empirical estimators of law invariant coherent risk measures were studied in P ugandWozabal (2010) and Shapiroetal. (2009, section 6.5). Derivation of the asymptotic varianceof(

^ F

";N

),foragenerallawinvariantriskmeasure,couldbequiteinvolved.Letusconsider two important cases of the V @R

and AV @R

risk measures. We give below a summary of basic results,fora moretechnical discussionwe refertotheAppendix.

In caseof :=V @R ,theLSRestimateof V @R (")becomes [ V @R (e):= ^ F 1 e;N ()=e (dNe) ; (29) wheree (1) :::e (N)

areorderstatistics(i.e.,numberse 1

;:::;e N

arrangedintheincreasingorder), anddaedenotesthesmallestintegera.SupposethatthecdfF

"

()hasnonzerodensityf " ()=F 0 " () atF 1 " () andlet ! 2 := (1 ) [f " (F 1 " ())] 2 : (30) LSR estimator of V @R (Yjx) Consider the LSR estimator

~ 0 +x T ~ + [ V @R (e) of V @R

(Yjx ). Suppose that the set of population -quantiles is a singleton. Then the LSR estimator is a consistent estimator of V @R

(Yjx), andtheasymptoticvarianceof thisestimatorcanbeapproximatedby

N 1 ! 2 + 2 [1;x T ] 1 [1;x T ] T : (31)

Detailedderivationofabove asymptoticsis discussedinAppendix A. Forthe:=AV @R

risk measure,the LSRestimateofAV @R (")is givenby \ AV @R (e)=inf t2R n t+ 1 (1 )N P N i=1 [e i t] + o = [ V @R (e)+ 1 (1 )N P N i=1 h e i [ V @R (e) i + =e (dNe) + 1 P N i=dNe+1 e (i) e (dNe) : (32)

(14)

LSR estimator of AV @R

(Yjx ) ConsidertheLSRestimator

~ 0 +x T ~ + \ AV @R (e)ofAV @R

(Yjx).Thisestimatorisconsistent anditsasymptoticvarianceis givenby

N 1 2 + 2 [1;x T ] 1 [1;x T ] T ; (33) where 2 :=(1 ) 2 Var [" V @R (")] + ,:= 1 T ;:=E[X]and:=E XX T .

The aboveasymptoticsare discussedin AppendixB.

Remark 1. Itshouldberememberedthattheaboveapproximatevariancesareasymptoticresults. SupposeforthemomentthatN<(1 )

1

.ThendNe=N andhence [ V @R (")=maxf" 1 ;:::;" N g. Consequently " i [ V @R (") +

=0 foralli=1;:::;N,andhence

\ AV @R (")= [ V @R (")=maxf" 1 ;:::;" N g:

Inthatcasetheaboveasymptoticsareinappropriate.Inorderfortheseasymptoticstobe reason-able,N shouldbesignicantlybiggerthan(1 )

1 .

LSRapproachcanbeeasilyappliedtoaconsiderablylargerclassoflawinvariantriskmeasures. Forexample,letusconsidertheentropicriskmeasure(Y):=

1 logE[ e

Y

],where>0isa pos-itive constant.This riskmeasuresatises axioms(A1){(A3),butitis notpositivelyhomogeneous (see Giesecke andWeber 2008, for the general discussion of utility-based shortfallrisk including entropic riskmeasure).The empiricalestimateof (")is

( ^ F ";N )= 1 log N 1 P N i=1 e " i : (34)

Ofcourse,asitwasdiscussedabove,theerrors" i

shouldbereplacedbytherespectiveresidualse i

in theconstructionof thecorrespondingLSRestimators.Byusinglinearizationse

"

=1+"+o(") andlog(1+x)=x+o(x),we obtainthat N

1=2 [(

^ F

";N

) (")]converges indistributiontonormal withzeromeanandvariance

2

(15)

3.2. Statistical Inference ofQuantile and Mixed Quantile Estimators

As it was discussed in section2, the quantile regression is a particular caseof the M-estimation method with the error function () of the form (12). By the Law of Large Numbers (LLN), we have that N

1

times the objective function in (11) converges (pointwise) w.p.1 to the function ( 0 ;):=E (Y 0 T X) .We alsohave ( 0 ;) =E 0 + T X+" 0 T X =E[ (" ( 0 0 ) ( ) T X)]: (35)

Under mildregularityconditions,derivativesof ( 0

;)canbetakeninsidetheintegral (expec-tation)andhence

r 0 ( 0 ;) =E[r 0 (" ( 0 0 ) ( ) T X)] = E[ 0 (" ( 0 0 ) ( ) T X)]; (36) r ( 0 ;) =E[r (" ( 0 0 ) ( ) T X)] = E[ 0 (" ( 0 0 ) ( ) T X)X]: (37)

Since" andX are independent,we obtainthat derivativesof ( 0 ;) arezeros at( 0 ; ) ifthe followingconditionholds

E[ 0

(")]=0: (38)

Since function (;) is convex, it follows that if condition (38) holds, then (;) attains its minimumat( 0 ; ).Iftheminimizer( 0 ;

)isunique,thentheestimator( ^ 0 ; ^ )convergesw.p.1 to the population value (

0 ; ) as N !1, i.e., ( ^ 0 ; ^ ) is a consistent estimator of ( 0 ; ) (cf. Huber1981). Thatis,(38) isthebasic conditionforconsistencyof (

^ 0 ; ^ ). Fortheerrorfunction (12) ofthe quantile regression,we have

0 (t)= 1ift<0; ift>0: (39)

(Note that here the error function (t) is not dierentiable at t=0 and its derivative 0

(t) is discontinuous at t=0. Nevertheless, all arguments can go through provided that the error term hasa continuousdistribution.)Consequently,

E[ 0

(16)

andhencecondition (38) holds i F "

(0)=, or equivalentlyF 1 "

()=0 provided thisquantile is unique. In that case, the estimator (

^ 0

; ^

) is consistent if the population value 0

is normalized such that V @R

(")=0. That is,for this errorfunction, ^ 0 + ^ T

x is a consistent estimator of the conditionalValue-at-RiskV @R

(Yjx)of Y givenX=x. It is also possible to derive asymptotics of the estimator (

^ 0

; ^

). That is, suppose that the cdf F

"

() has nonzero density f " () =F 0 " () at F 1 " () and consider ! 2 dened in (30). Then N 1=2 h ^ 0 0 ; ^ i

converges in distribution to normal with zero mean vector and covariance matrix(cf., Koenker2005) ! 2 [1;x T ] 1 [1;x T ] T ; (41) i.e.,N 1

timesthe matrixgivenin(41) is theasymptoticcovariancematrixof h ^ 0 ; ^ i .

Remark 2. Note that by LLN, we have that N 1 P N i=1 X i andN 1 P N i=1 X i X T i converge w.p.1 as N!1 tothe vector andmatrix respectivelyandthat

T

is thecovariancematrix of X. In case of deterministic X

i

, we simply dene vector and matrix as the respective limitsof N 1 P N i=1 X i and N 1 P N i=1 X i X T i

, assumingthat suchlimitsexist. It follows then that N

1 X

T

X! asN!1.

Themixedquantileestimator 0 + T

xcanbejustiedbythefollowingarguments.Wehavethat an optimal solution (; 0 ;

) of problem (21) converges w.p.1 as N !1 to theoptimal solution ( ? ; ? 0 ; ?

)ofproblem(20),provided(20)hasuniqueoptimalsolution.Becauseofthelinearmodel (5),we canwriteproblem(20) as

Min ; 0 ; E h P r j=1 j ("+ 0 0 +( ) T X j ) i s:t: P r j=1 j j =0; (42) where 0 and

are population values of the parameters. Similarto the proof of Proposition1, bymakingchangeof variables

j =

0 +

j

,j=1;:::;r,we canwrite problem(42) in thefollowing equivalentform Min ; ; E h P r j=1 j ("+ 0 j +( ) T X) i s:t: P r j=1 j j = 0 : (43)

(17)

−4

−2

0

2

4 −25

−20

−15

−10

−5

0

5

10

15

20 Standard Normal Quantiles

Quantiles of Input Sample

(a)N(0,1)vs.t(3)

−4

−2

0

2

4 −10

−5

0

5

10

15 Standard Normal Quantiles

Quantiles of Input Sample

(b)N(0,1)vs.CN(1,9)

−4

−2

0

2

4 −10

0

10

20

30

40

50

60

70 Standard Normal Quantiles

Quantiles of Input Sample

(c)N(0,1)vs.LN(0,1) Figure1 NormalQ-Qplotfor dierenterrordistributions

It follows thatif r X j=1 j V @R j (")=0; (44) then ( ? 0 ; ? )=( 0 ; ). Thatis, 0 + T x is a consistent estimatorof P r j=1 j V @R j (Yjx). Con-sequentlyfor j and j

givenin (17), we canuse 0 + T xas anapproximationof AV @R (Yjx). Asymptoticsofthemixedquantileestimatorsaremoreinvolved.Theseasymptoticsarediscussed inAppendix C.

4. Simulation Study

Toillustratetheperformanceoftheconsideredestimators,weperformtheMonteCarlosimulations whereerrors(innovations)inlinearmodel(7)are generatedfrom followingdierent distributions; (1) StandardNormal (denotedas N(0;1)),(2)Student's tdistributionwith 3 degreesof freedom (denotedast(3)),(3)SkewedContaminatedNormalwherestandardnormalis contaminatedwith 20% N(1;9) errors (denoted as CN(1;9)), (4) Log-Normal with parameter 0 and 1 (denotedas LN(0;1)).Notethaterrordistributions(2)-(4)areheavy-tailedincontrasttothenormalerrorsas showninFigure4.Infact,nancialinnovationsoftenfollowheavy-taileddistributions.Weconsider =0:9;0:95;0:99, samplesize N =500;1000;2000 andR=500 replicationsfor each sample size. ConditionalValue-at-Risk(VaR)andAverageValue-at-Risk(AVaR)areestimatedandcompared

(18)

−2

−1.5

−1

−0.5

0

0.5

1

1.5

2 −2

−1

0

1

2

3

4

5

6

7 x

_k

conditional VaR

MAE(QVaR)=0.4771, MAE(RVaR)=0.2145

True

QVaR

RVaR

(a)ConditionalVaR

−2

−1.5

−1

−0.5

0

0.5

1

1.5

2

0

1

2

3

4

5

6

7

8 x

_k

conditional AVaR

MAE(QAVaR)=0.6336, MAE(RAVaR)=0.2466

True

QAVaR

RAVaR

(b)ConditionalAVaR Figure2 ConditionalVaRandAVaR:Truevs.Estimated(ErrorsCN(1;9),=0:95,N=1000)

withtrue(theoretical)valuesatgiven500testpointsx k

(k=1;2;:::;500),whichareequallyspaced between -2and2 foreach replication.Estimatorsobtainedfrom dierent methodsarecomputed; quantile based estimator (referred to as \QVaR") and LSR estimator (referred to as \RVaR") for the conditional VaR, mixed quantile estimator (referred to as \QAVaR") and LSRestimator (referredtoas\RAVaR") fortheconditionalAVaR (as describedinSection2).

Figure 4 displays an example of estimation results where solid line is true (theoretical) VaR (AVaR), dash-circle line is QVaR (QAVaR), and dash-cross line is RVaR (RAVaR) given test points x

k

. In this example, errors follow CN(1;9), = 0:95 and N = 1000. In Fig-ure 4-(a), RVaR estimates are closer to true VaR values as Mean Absolute Error (MAE) conrms (MAE(QVaR)=0.4771 vs. MAE(RVaR)=0.2145). Performance of both estimators are worse for AVaR, yet RAVaR estimates are still closer to true AVaR values than QAVaR (MAE(QAVaR)=0.6336vs.MAE(RAVaR)=0.2466)asshowninFigure4-(b).

To compareestimatorsunder dierent errordistributions,MAE (averaged over alltest points) and variance of MAE (in parenthesis) across 500 replications are obtainedas shown in Table 1. Regardless of the error distributions,RVaR (RAVaR) works better than QVaR (QAVaR); MAE and the varianceof MAE are smaller. As we can expect,both estimators perform better for the

(19)

Table1 MAEfordierenterrordistributions=0:95;N=1000(averagedoveralltestpoints) Error QVaR RVaR QAVaR RAVaR

N(0;1) 0.0762 0.0575 0.0990 0.0674 (0.0037) (0.0020) (0.0058) (0.0026) t(3) 0.1758 0.1290 0.4255 0.3232 (0.0188) (0.0095) (0.0808) (0.0623) CN(1;9) 0.3006 0.1955 0.3844 0.2311 (0.0563) (0.0225) (0.0882) (0.0316) LN(0;1) 0.3905 0.2670 0.8957 0.6432 (0.0959) (0.0430) (0.3896) (0.2481)

QAVaR−N(0,1)

RAVaR−N(0,1)

QAVaR−t(3)

RAVaR−t(3)

QAVaR−CN(1,9) RAVaR−CN(1,9) QAVaR−LN(0,1) RAVaR−LN(0,1)

0

0.5

1

1.5

2

2.5

3 MAE (Mean Absolute Error)

Estimator − Error Distribution

Figure3 MAEforconditionalAVaRgivenx=1:006underdierenterrordistributions(=0:95,N=1000)

conditionalVaR thanAVaR.

Figure 3 presents box-plots for both estimators (QAVaR and RAVaR) given x=1:006 across 500replications.Findingsaresimilar totheonefrom Table1; therearesomeevidence tosuggest that RAVaR hassmaller MAEthan QAVaR. Also, RAVaR is more stable thanQAVaR (MAE of QAVaRismorespread).Notethatbothestimatorsworkbetterfornormaldistributionsthanother heavy-tailed distributions.We couldobserve thesimilarpattern forconditionalVaR.

(20)

Table2 MAEfordierentsamplesizeN with=0:95(averagedoveralltestpoints) Error Estimator N=500 N =1000 N =2000 N(0;1) QVaR 0.1129 0.0762 0.0569 RVaR 0.0849 0.0575 0.0418 QAVaR 0.1390 0.0990 0.0737 RAVaR 0.0992 0.0674 0.0498 t(3) QVaR 0.2420 0.1758 0.1277 RVaR 0.1785 0.1290 0.0942 QAVaR 0.5385 0.4255 0.3207 RAVaR 0.4517 0.3232 0.2085 CN(1;9) QVaR 0.4322 0.3006 0.2180 RVaR 0.2928 0.1955 0.1447 QAVaR 0.5471 0.3844 0.2658 RAVaR 0.3373 0.2311 0.1636 LN(0;1) QVaR 0.5814 0.3905 0.2959 RVaR 0.4095 0.2670 0.1975 QAVaR 1.1986 0.8957 0.7275 RAVaR 0.9503 0.6432 0.4754

Table2illustratessamplesizeeectonMAEofestimators.Asexpected,bothestimatorsperform betterassamplesizeincreases.MAEofRVaR(RAVaR)isstillsmallerthanthatofQVaR(QAVaR) across allsamplesizes.

Next, we obtain asymptotic variances (derived in Section3) andcompare that with empirical (nitesample)variancesofbothestimators.Figure4reportsasymptoticandnitesample eÆcien-cies of bothestimators forthe conditional VaR where R=500, anderror follows N(0;1) (results aresimilarforothererrordistributions).InFigure4-(a),weseethatasymptoticvarianceofRVaR (dash-dot line)is smallerthan that of QVaR (solid line) except atx

k

near 0.In fact,asymptotic variance is aected by how far x

k

is away from 0 (which is the mean of explanatory variable in thesimulation);whenx

k

is furtherfromthemean,thedierencebetweenasymptoticvariancesof bothestimators is bigger. Figure4-(b) provides empirical variance of both estimatorsacross 500 replications.Empirical variance of RVaR is (equal or) smaller than that of QVaR at allx

k . Fig-ure4-(c)andFigure4-(d)compareasymptoticvariancestoempiricalvariancesofbothestimators. Itisclearthatasymptoticvariancesaretoprovideagoodapproximationtotheempiricalonesfor bothestimators.

(21)

−2

−1.5

−1

−0.5

0

0.5

1

1.5

2

0

0.005

0.01

0.015

0.02

0.025

0.03

0.035 x

_k

Variance

QVaR

RVaR

(a)Asymptoticvariance

−2

−1.5

−1

−0.5

0

0.5

1

1.5

2

0

0.005

0.01

0.015

0.02

0.025

0.03

0.035 x

_k

Variance

QVaR

RVaR

(b)Empiricalvariance

−2

−1.5

−1

−0.5

0

0.5

1

1.5

2

0

0.005

0.01

0.015

0.02

0.025

0.03

0.035 x

_k

Variance

Asymptotic

Empirical

(c) QVaR:Asymptoticvs.Empirical

−2

−1.5

−1

−0.5

0

0.5

1

1.5

2

0

0.005

0.01

0.015

0.02

0.025

0.03

0.035 x

_k

Variance

Asymptotic

Empirical

(d)RVaR:Asymptoticvs.Empirical

Figure4 ConditionalVaR:asymptoticandempiricalvariance(ErrorN(0;1),=0:95,N=1000,R=500)

Figure 4 illustrates asymptotic and empirical variances of both estimators for AVaR. Insights obtainedfromtheresultsaresimilartotheVaRcase.However,Figure4-(c)indicatesthatempirical variancesofQAVaRarelargerthanasymptoticvariances,especiallywhenx

k

isfarfromthemean. For this case, asymptoticeÆciency of QAVaR may not very informative on its behavior innite sample. Results are similarfor othererror distributions except t(3). Whenthe error follows t(3), asymptotic (empirical) variances of QAVaR are smaller than that of RAVaR except when x

k is closetothe boundary(asshowninFigure4).

TofurtherinvestigatethenitesampleeÆcienciesandrobustnessofbothestimatorscomparedto theasymptoticones,weprovideempiricalcoverageprobabilities(CP)ofatwo-sided95%(nominal)

(22)

−2

−1.5

−1

−0.5

0

0.5

1

1.5

2

0

0.005

0.01

0.015

0.02

0.025

0.03

0.035 x

_k

Variance

QAVaR

RAVaR

−2

−1.5

−1

−0.5

0

0.5

1

1.5

2

0

0.005

0.01

0.015

0.02

0.025

0.03

0.035 x

_k

Variance

QAVaR

RAVaR

−2

−1.5

−1

−0.5

0

0.5

1

1.5

2

0

0.005

0.01

0.015

0.02

0.025

0.03

0.035 x

_k

Variance

Asymptotic

Empirical

(c)QAVaR:Asymptoticvs.Empirical

−2

−1.5

−1

−0.5

0

0.5

1

1.5

2

0

0.005

0.01

0.015

0.02

0.025

0.03

0.035 x

_k

Variance

Asymptotic

Empirical

(d)RAVaR:Asymptoticvs.Empirical

Figure5 ConditionalAVaR:asymptoticandempiricalvariance(ErrorN(0;1),=0:95,N=1000,R=500)

condence interval (CI) in Table 3 (dierence between CP and 0.95 is given in parentheses). For each replication, the empirical condence interval is calculated from the sample version of asymptoticvariance(whenappliedtothevaluesof an observedsampleof a givensize).Then,for given x

k

, the proportion of the 500 replications where the obtained condenceinterval contains thetrue(theoretical)valueiscalculated,andtheseproportionsareaveragedacross alltestpoints. For N(0;1) and CN(1;9) error distributions, the resulting CP of RVaR (RAVaR) is very close to 0.95 whileempirical CI for QVaR (QAVaR) under-covers (resulting CP is smaller than 0.95). For t(3) and LN(0;1) error distributions,CP of RVaR (RAVaR) drops, yet maintains somewhat adequateCPwhichisalotbetterthanCPofQVaR(QAVaR).CIofQAVaRunder-coversseriously

(23)

−2

−1.5

−1

−0.5

0

0.5

1

1.5

2

0

0.05

0.1

0.15

0.2

0.25

0.3

0.35 x

_k

Variance

QAVaR

RAVaR

−2

−1.5

−1

−0.5

0

0.5

1

1.5

2

0

0.05

0.1

0.15

0.2

0.25

0.3

0.35 x

_k

Variance

QAVaR

RAVaR

−2

−1.5

−1

−0.5

0

0.5

1

1.5

2

0

0.05

0.1

0.15

0.2

0.25

0.3

0.35 x

_k

Variance

Asymptotic

Empirical

(c)QAVaR:Asymptoticvs.Empirical

−2

−1.5

−1

−0.5

0

0.5

1

1.5

2

0

0.05

0.1

0.15

0.2

0.25

0.3

0.35 x

_k

Variance

Asymptotic

Empirical

(d)RAVaR:Asymptoticvs.Empirical

Figure6 ConditionalAVaR:asymptoticandempiricalvariance(Errort(3),=0:95,N=1000,R=500)

(resultingCP is about0.7)andthisindicatesQAVaR proceduremay be veryunstable andneeds rather wider CI than other estimators to overcome its sensitivity. Note that RVaR (RAVaR) is moreconservativethanQVaR (QAVaR) regardlessof theerrordistributions.

Wecoulddrawsimilarconclusionsforothersamplesizesand values.Thatis,RVaR(RAVaR) performsbetterandprovidesstableresultsthanQVaR(QAVaR)underdierenterrordistributions.

5. Illustrative Empirical Examples

In thissection, we demonstrate consideredmethods toestimateconditional VaR andAVaR with realdata; dierent nancialasset classes.Let usrstpresent an example of CreditDefaultSwap

(24)

Table3 Coverage probabilitywith=0:95;N=1000(averagedoveralltestpoints) Error QVaR RVaR QAVaR RAVaR

N(0;1) 0.9167 0.9551 0.8442 0.9552 (0.0333) (-0.0051) (0.1058) (-0.0052) t(3) 0.9044 0.9269 0.7088 0.9080 (0.0456) (0.0231) (0.2412) (0.0420) CN(1;9) 0.9262 0.9428 0.8824 0.9548 (0.0238) (0.0072) (0.0676) (-0.0048) LN(0;1) 0.9185 0.9276 0.6930 0.9185 (0.0315) (0.0224) (0.2570) (0.0315)

(CDS). CDS is the most popular credit derivative in the rapidly growing credit markets (see FitchRatings2006, fora detailedsurvey of the creditderivativesmarket). CDScontractprovides insuranceagainsta defaultbya particularcompany,a poolofcompanies,orsovereign entity.The rate of payments made per year bythe buyer is known as the CDS spread (in basis points). We focusontherisk ofCDStrading(longorshortposition)ratherthanontheuseofaCDStohedge credit risk. The CDS dataset obtainedfrom Bloomberg consists of 1006 daily observations from January2007toJanuary2011.LetthedependentvariableY bedailypercent change,(Y(t+1) Y(t))=Y(t)100,ofBankofAmericaCorp(NYSE:BAC)5-yearCDSspread,explanatoryvariables X

1

bedailyreturnofBACstockprice,andX 2

bedailypercentchangeofgeneric5-yearinvestment gradeCDX spread (CDX.IG). We use the term \percent change"ratherthan return becausethe return of CDScontract is not same as the return of CDSspread (e.g., see O'KaneandTurnbull 2003,for an overview of CDSvaluationmodels).Residuals obtainedfrom thisdatasetare heavy-taileddistributed(similartoFigure4-(b)).

Figure 7 shows estimatedconditionalVaR (RVaR) of BAC CDS spread percentchange (result ofQVaRissimilar).Sinceonecantake eithershortorlongposition,we presentbothtailriskwith allvaluesof whichranges from0.01 to0.99;<0:5correspondstothelefttail(shortposition) and right tail (long position), otherwise. It is clear that RVaR of certain dates are much higher (lower) than normal level due to the dierent daily economic conditions re ectedby BAC stock price and CDX spread. This indicates the specic (daily) economic conditions should be taken accountfortheaccurateestimationofrisk,andthereforeemphasizetheimportanceof conditional

(25)

Figure7 EstimatedconditionalVaR(RVaR)forBACCDSspreadpercentchangefor=0:01;:::;0:99

risk measures.Note that givena specicdate,estimatedRVaR curve alongthedierent values is asymmetricsincethedistributionof CDSspreadpercentchange isnot symmetric.

To compare the prediction performance of both estimators, we forecast 603 one-day-ahead (tomorrow's)VaR(AVaR)giventhecurrent(today's)valueofexplanatoryvariablesusingarolling window of the previous 403 days. Figure 8 presents forecasting results of QVaR and RVaR with =0:05on603out-of-sample.Bothestimatorsshowsimilarbehaviors,butRVaRseemslittlemore stable. Following ideas in McNeilandFrey(2000) and Leorato etal. (2010), \violation event" is said to occur whenever observed CDS spread percent change falls below the predicted VaR (we can nda few violation events from Figure8). Also, the forecasterror of AVaR is dened as the dierence between the observed CDS spread percent change and the predicted AVaR under the violationevent.Bydenition,theviolationeventprobabilityshouldbeclosetoandtheforecast error should be close to zero. Table 4 presents the prediction performance(violation event prob-abilityfor VaR,mean andMAEof forecasterror forAVaR inparenthesis)of bothestimatorsfor =0:01 and 0:05. In-sample statistics show that both estimators tthe data well; the violation

(26)

09/08

01/09

06/09

11/09

04/10

08/10

01/11

−50

−40

−30

−20

−10

0

10

20

30 Date

conditional VaR (percent change)

CDS

QVaR

09/08

01/09

06/09

11/09

04/10

08/10

01/11

−50

−40

−30

−20

−10

0

10

20

30 Date

conditional VaR (percent change)

CDS

RVaR

Figure8 RiskpredictionofBACCDS:QVaRandRVaR(=0.05)

Table4 RiskpredictionperformanceofBACCDS In-sample Event(%) Mean MAE QVaR(QAVaR) 0.01 0.9950 (0.1965) (1.3118)

RVaR(RAVaR) 0.01 0.9950 (-0.8630) (2.8183) QVaR(QAVaR) 0.05 4.9751 (0.2287) (2.5016) RVaR(RAVaR) 0.05 4.9751 (-0.0269) (2.8090) Out-of-sample Event(%) Mean MAE QVaR(QAVaR) 0.01 0.8292 (1.4546) (2.4421)

RVaR(RAVaR) 0.01 0.8292 (1.1052) (4.0615) QVaR(QAVaR) 0.05 3.6484 (1.3740) (3.1099) RVaR(RAVaR) 0.05 4.4776 (-0.3722) (3.3681)

event probabilitiesare very close to and forecast errors are very small. Out-of-sample perfor-mances of both estimatorsare very similar for =0:01, even thoughthe forecast errorsincrease a little compared to in-sample cases. For =0:05, RVaR (RAVaR) seems perform better; event probabilitiesare closerto 0.05andforecasterrorsaresmaller.

Next, we applyconsideredmethodstoone of theUS equities; InternationalBusinessMachines Corp (NYSE). The dataset contains 1722 daily observation from December 2005 to December

(27)

Table5 RiskpredictionperformanceofIBMstock In-sample Event(%) Mean MAE QVaR(QAVaR) 0.01 1.0180 (-0.1305) (0.5727)

RVaR(RAVaR) 0.01 0.9397 (-0.3481) (0.8926) QVaR(QAVaR) 0.05 5.0117 (0.0468) (1.0204) RVaR(RAVaR) 0.05 4.9334 (-0.0225) (1.1579) Out-of-sample Event(%) Mean MAE QVaR(QAVaR) 0.01 2.3511 (0.6171) (1.1028)

RVaR(RAVaR) 0.01 1.8809 (0.5023) (0.6827) QVaR(QAVaR) 0.05 6.7398 (0.4787) (1.3086) RVaR(RAVaR) 0.05 6.1129 (0.4778) (1.2387)

2010.Let the dependent variableY be the daily logreturn, 100*log(Y(t+1)/Y(t)),of IBM stock price, explanatory variables X

1

be the log return of S&P 500 index, and X 2

be the lagged log return.Similar toCDS example,we forecast638 one-day-ahead (tomorrow's) VaR (AVaR) given the current (today's) value of explanatory variables using a rolling window of the previous 639 days.Residualsobtainedfrom thisdatasetareheavy-taileddistributed.Table4 comparestherisk prediction performance of IBM stock return. Bothestimators perform well for in-sample predic-tion. For out-of-sample prediction, both estimators behave similarly for =0:05, but violation event probability is largerthan 0.05.For=0:01, RVaR (RAVaR) seems a bit better,but event probabilityexceeds0.01.

Finally, we illustrate how crude oil price had impacted the US airlines' risk as we mentioned in Section 1. Crude oil prices had continuedto rise since May 2007 and peaked alltime high in July2008,rightbeforethebrinkoftheUSnancialsystemcollapse.Wecomparethemovementof estimatedVaRforthreeairlinestocksgivencrudeoilpricechange;DeltaAirlines,Inc(NYSE:DAL), American Airlines,Inc(NYSE:AMR), andSouthwestAirlines Co(NYSE:LUV). Figure9 depicts RVaR movement with =0:05 from May2007 to July 2008 (QVaR shows similar patterns).For easy comparison, we standardize all units relative to the starting date. As we can see, crude oil pricehadjumped150%duringthistimespan.On theotherhand,RVaR of LUVincreasedabout 15% whilethat of AMR increased120% andthat of DALincreased 90% (in magnitude). Infact, dierent airlines have dierent strategies to hedge the risk on oil price uctuations and this in turnaectstheriskofairlines'stockmovement.Forexample,SouthwestAirlinesiswellknownfor

(28)

05/07

07/07

09/07

12/07

02/08

05/08

07/08

−150

−120

−90

−60

−30

0

30

60

90

120

150 Crude Oil

DAL

AMR

LUV

Figure9 Airlineequities:RVaRconditionaloncrudeoilprice(=0.05)

hedging crudeoil pricesaggressively.On the other hand, Delta Airlines does little hedge against crudeoilprice,butoperates alot of international ights. American Airlinesdoesnothave strong hedging againstcrude oilprice either, andoperates less international ights than Delta Airlines. Our estimationresults conrmtherm specicrisk regardingcrudeoilprice uctuations.

6. Conclusions

Value-at-Risk and Average Value-at-Risk are widely used measures of nancialrisk. In order to accuratelyestimateriskmeasures,takingintoaccountthespeciceconomicconditions,we consid-ered two estimationprocedures for conditionalrisk measures;oneis based on residual analysisof thestandardleastsquaresmethod(LSRestimator)andtheotherisbasedonmixedM-estimators (mixedquantileestimator).Largesamplestatisticalinferencesof bothestimatorsarederivedand compared. Inaddition, nitesamplepropertiesof bothestimatorsareinvestigatedand compared aswell.MonteCarlosimulationresults,under dierent error distributions,indicatethat the LSR estimatorsperformbetterthantheir(mixed)quantilescounterparts.Ingeneral,MAEand asymp-totic/empiricalvarianceof theLSRestimatorsaresmallerthanthatof quantilebasedestimators.

(29)

We alsoobservethat asymptoticvarianceof estimatorsapproximatesthe nitesampleeÆciencies wellforreasonablesamplesizes usedin practice.However, we mayneed moresamplesto guaran-teean acceptableeÆciencyof thequantilebasedestimatorforAverageValue-at-risk comparedto other estimators. Prediction performances on the real data example suggest similar conclusions. Infact,residualbasedestimatorscanbe calculatedeasilyandthereforetheLSRmethodcouldbe implemented eÆciently in practice. Moreover, LSR method can be easily applied to the general class of law invariant risk measures. In this study, we assume a static model with independent error distributions.Extension of consideredestimationprocedures incorporatingdierent aspects of (dynamic)timeseriesmodelscouldbe an interestingtopicforthefurtherstudy.

Appendix A: Asymptotics for LSR Estimatorof V @R

(Yjx)

Suppose, forthe sake of simplicity, that support of the distributionof X i is bounded,i.e., X i is boundedw.p.1. SinceN 1 X T

X convergesw.p.1 toandby(27),wehavethat

j" i e i jO p (N 1 ) N X j=1 " j :

We can assumehere thatE[ " i ]=0,andhence P N j=1 " j =O p (N 1=2 ). Itfollowsthat " (dNe) e (dNe) =O p (N 1=2 ): (45)

Supposenowthatthesetof population-quantilesisasingleton.Then ^ F

1 "

()convergesw.p.1 tothe populationquantileF

1 "

()=V @R

("), and henceby (45), we have that e (dNe) converges in probabilityto F 1 " (). That is, [ V @R

(e) is a consistent estimator of V @R

("), and hencethe estimator ~ 0 +x T ~ + [ V @R

(e)isa consistentestimatorof V @R

(Yjx).

Let us consider the asymptotic eÆciency of the residual based V @R estimator. It is known that ~ 0 +x T ~

is an unbiased estimator of the true expected value 0 + x T and N 1=2 h ~ 0 0 +x T ( ~ ) i

convergesin distributiontonormalwith zeromeanandvariance

2 [1;x T ] 1 [1;x T ] T : (46) Also, N 1=2 " (dNe) V @R (")

converges indistributiontonormalwith zeromeanandvariance

! 2 := (1 ) [f " (F 1 ())] 2 ; (47)

(30)

provided thatdistributionof "hasnonzerodensityf " () atthe quantile F 1 " ().

Let us also estimate the asymptotic variance of the right hand side of (27). We have that N timesvarianceof thesecondterm intheright handsideof (27) canbe approximatedby

2 E [1;X T i ] 1 [1;X T i ] T = 2 (k+1):

Wealsohavethatrandomvectors( ~ 0

; ~

)andeareuncorrelated.Therefore,iferrors" i

havenormal distribution, then vectors (

~ 0

; ~

) and e have jointlya multivariatenormal distributionandthese vectorsareindependent.Consequently,

~ 0 +x T ~ and [ V @R

(e)areindependent.Fornotnecessarily normaldistribution,thisindependenceholdsasymptoticallyandthusasymptotically

~ 0 +x T ~ and [ V @R

(e)areuncorrelated.

Now,wecancalculatetheasymptoticcovarianceofthecorrespondingterms " (dNe) V @R (") and " (dNe) e (dNe) as 2 k +1 2

.Thus,asymptoticvarianceoftheresidualbasedV @R

estima-torcan be approximatedas

N 1 ! 2 + 2 [1;x T ] 1 [1;x T ] T : (48)

Appendix B: Asymptotics for LSR Estimatorof AV @R (Yjx) Theestimator \ AV @R

(e)canbecomparedwiththecorrespondingrandomvariablewhichisbased on theerrorsinsteadofresiduals

\ AV @R ("):=inf t2R n t+ 1 (1 )N P N i=1 [" i t] + o = [ V @R (")+ 1 (1 )N P N i=1 h " i [ V @R (") i + = " (dNe) + 1 (1 )N P N i=dNe+1 " (i) " (dNe) : (49) Note that \ AV @R

(")isnotan estimatorsince errors" i are unobservable. By (45), we have that [ V @R (") [ V @R (e) =O p (N 1=2 ) (50)

and it is known that \ AV @R

(") converges w.p.1 to AV @R

(") as N!1, provided that " has a nite rst order moment. It follows that

\ AV @R

(e) converges in probability to AV @R ("), and hence ~ +x T ~ + \

(31)

Lets discuss asymptotic properties of the residual based AV @R

estimator. As it was pointed out in Appendix A , random vectors (

~ 0

; ~

) and e are uncorrelated, and hence asymptotically ~ 0 +x T ~ and \ AV @R

(e) are independent and hence uncorrelated. Assuming that -quantile of F

"

() isunique,wehave byDelta theorem

\ AV @R (e)=V @R (")+(1 ) 1 N 1 N X i=1 [e i V @R (")] + +o p (N 1=2 ) (51) and \ AV @R (")=V @R (")+(1 ) 1 N 1 N X i=1 [" i V @R (")] + +o p (N 1=2 ): (52)

Equation(52)leadstothefollowingasymptoticresult(cf.Trindadeet al.2007,Shapiroetal.2009, section6.5.1) N 1=2 \ AV @R (") AV @R (") D !N(0; 2 ); (53) where 2 =(1 ) 2 Var [" V @R (")] +

. Moreover, ifdistributionof " hasnonzerodensityf " () atV @R ("), then E \ AV @R (") AV @R (")= 1 2Nf " (V @R (")) +o(N 1 ): (54)

From the equation (51) and (52), the asymptoticvariance of

\ AV @R (") \ AV @R (e) can be boundedby(1 ) 1 N 2 2 k+1

andwe canapproximatetheasymptoticcovarianceof the cor-respondingterms, \ AV @R (") AV @R (") and \ AV @R (") \ AV @R (e) as (1 ) 1 N 2 2 k +1 2 . Thus,asymptoticvarianceof theresidualbasedAV @R

estimatorcanbe approximatedas

N 1 2 + 2 [1;x T ] 1 [1;x T ] T : (55)

Appendix C: Asymptoticsfor the Mixed QuantileEstimator

It is possible to derive asymptotics of the mixed quantile estimator. For the sake of simplicity, let us start with a sample estimate of S(X), with

j

and j

, j=1;:::;r, given in (17). That is, let X

1 ;:::;X

N

be an iid sample (data) of the random variable X, and X (1)

:::X (N)

be the corresponding order statistics. Then the corresponding sampleestimate is obtainedby replacing

(32)

thetruedistributionF ofXbyitsempiricalestimate ^ F.Consequently,(1 ) 1 S(X)isestimated by (1 ) 1 r X j=1 j ^ F 1 ( j )= 1 r r X j=1 X (dN j e) : (56)

This canbe comparedwith thefollowingestimatorof AV @R

(X) basedon sampleversion of (2): X (dNe) + 1 (1 )N P N i=dNe+1 X (i) X (dNe) = 1 N dNe (1 )N X (dNe) + 1 (1 )N P N i=dNe+1 X (i) : (57)

Assuming that N is an integerandtakingr:=(1 )N, we obtainthat the right handsides of (56) and(57) are thesame.

Asymptotic variance of the mixed quantile estimator can be calculated as follows. Consider problem(43).The optimalsolution ofthat problemis

? = , ? j = 0 +V @R j (")= 0 +F 1 " ( j );j=1;:::;r; and ? 0 = P r j=1 j ? j = 0

. Assume that " has continuous distribution with cdf F "

() and density function f

"

(). Then conditional on X, the asymptotic covariance matrix of the corresponding sample estimator ( ;) of ( ? ; ? ) is N 1 H 1 H 1

, where H is the Hessian matrix of second orderpartialderivativesof E

h P r j=1 j ("+ 0 j +( ) T X) i atthepoint ( ? ; ? ),andis thecovariancematrixof therandom vector

Z:= r X j=1 r j "+ 0 j +( ) T X ;

where the gradients are taken with respect to (;) at (;)=( ? ; ? ) (e.g., Shapiro 1989). We have r X j=1 r j "+ 0 j +( ) T X = r X j=1 0 j "+ 0 j +( ) T X ! X; r j j "+ 0 j +( ) T X = 0 j "+ 0 j +( ) T X ; with 0 j () is givenin(39). Note that E[ 0 (" F 1 " ( j

(33)

E

ZZ T

and we can compute = E XX T T , where =E h P r j=1 0 j (" F 1 " ( j )) i 2 , =[ 1 ;:::; r ]with j =E " r X i=1 0 i " F 1 " ( i ) ! 0 j " F 1 " ( j ) X # ;j=1;:::;r; and ij =E h 0 i (" F 1 " ( i )) 0 j (" F 1 " ( j )) i ,i;j=1;:::;r. The Hessianmatrix H canbecomputedasH=

E XX T F F T D ,where = P r j=1 j with j = @E h 0 j ("+ 0 ? j +t) i @t t=0 = @[ j (1 F " (F 1 " ( j ) t))+( j 1)F " (F 1 " ( j ) t)] @t t=0 = j f " (F 1 " ( j )) (1 j )f " (F 1 " ( j ))=f " (F 1 " ( j )); j=1;:::;r; F =[F 1 ;:::;F r ]with F j = j E[X],j=1;:::;r, andD=diag( 1 ;:::; r ). Since 0 = T , we have that 0 + T x=[x T ; T ][

;], and hence the asymptotic variance of 0 + 0 T x isgivenbyN 1 [x T ; T ]H 1 H 1 [x ;]. References

Acerbi,C.2002. Spectralmeasuresofrisk:acoherentrepresentationofsubjectiveriskaversion. Journalof Banking &Finance 261505{1518.

Artzner,P., F.Delbaen, J.-M.Eber,D. Heath.1999. Coherentmeasures of risk. Mathematical Finance 9 203{228.

Berkowitz,J,M Pritsker,M Gibson,H Zhou. 2002. How accurateare value-at-riskmodelsat commercial banks. Journal ofFinance 571093{1111.

Bluhm, Christian, Ludger Overbeck, Christoph Wagner. 2002. An Introduction to Credit Risk Modeling (Chapman&Hall/CrcFinancial Mathematics Series). 1sted.Chapman andHall/CRC.

Cai, Z., X. Wang. 2008. Nonparametric estimation of conditional var and expected shortfall. Journal of Econometrics 147(1)120{130.

Chen, S. X., C. Y. Tang. 2005. Nonparametric inference of value-at-riskfor dependent nancial returns. JournalofFinancial Econometrics 3(2)227{255.

(34)

Chernozhukov, V., L. Umantsev. 2001. Conditional value-at-risk: Aspects of modeling and estimation. Empirical Economics 26(1)271{292.

Detlefsen,K.,G.Scandolo.2005. Conditionalanddynamicconvexriskmeasures.FinanceandStochastics . DuÆe,D.,K.J.Singleton.2003.CreditRisk:Pricing,Measurement,andManagement.Princeton,Princeton

UniversityPress.

Engle,R. F., S Manganelli.2004. Caviar:Conditional autoregressivevalueat riskby regressionquantiles. JournalofBusiness&EconomicStatistics 22367{381.

Fermanian, J.D., O. Scaillet. 2005. Sensitivity analysis of var and expected shortfall for portfolios under nettingagreements. Journalof BankingandFinance 29927{958.

FitchRatings.2006. Globalcredit derivativessurvey.

Follmer,H.,A.Schied.2011.Stochasticnance:Anintroductionindiscretetime. WalterdeGruyter&Co., Berlin.

Frey,R.,A. J.McNeil.2002. Varandexpectedshortfallin portfoliosofdependentcreditrisks:Conceptual andpracticalinsights. JournalofBanking &Finance 26(7)1317{1334.

Gaglianone,W.,L.Lima,O.Linton,SmithD. 2011. Varandexpectedshortfallin portfoliosof dependent creditrisks:Conceptualandpracticalinsights.JournalofBusiness&EconomicStatistics 29150{160. Giesecke,K.,S. Weber.2008. Measuringthe riskoflargelosses. Journal of InvestmentManagement 6(4)

1{15.

Gneiting, T. 2009. Evaluating point forecasts. Institut fur Angewandte Mathematik, Universitpreprint, preprint.

Huber,P.J.1981. RobustStatistics. Wiley,NewYork.

Jackson,Patricia, William Perraudin. 2000. Regulatory implications of credit risk modelling. Journal of Banking &Finance 24(1-2)1{14.

Jorion,P.2003. Financial Risk Manager Handbook. 2nded. Wiley,NewYork. Koenker,R.2005. QuantileRegression. CambridgeUniversityPress,Cambridge,UK.

Leorato,S.,F.Peracchi,A.V.Tanase.2010. Asymptoticallyecientestimationoftheconditionalexpected shortfall. EIEFWorkingPapersSeries1013,EinaudiInstituteforEconomicandFinance(EIEF).

(35)

McNeil,A.J.,R.Frey.2000.Estimationoftail-relatedriskmeasuresforheteroscedasticnancialtimeseries: Anextremevalueapproach. Journalof Empirical Finance 7271{300.

O'Kane, D., S. Turnbull. 2003. Valuation of credit default swaps. Quantitative credit research quarterly, LehmanBrothers.

Peracchi,F.,A.V.Tanase.2008.Onestimatingtheconditionalexpectedshortfall.AppliedStochasticModels inBusinessandIndustry 24471493.

P ug,G., W. Romisch. 2007. Modeling, Measuring and Managing Risk. World ScienticPublishing Co., London.

P ug,G.,N.Wozabal.2010. Asymptoticdistributionoflaw-invariantriskfunctionals. Financeand Stochas-tics 14397{418.

Rockafellar, R. T., S. Uryasev. 2002. Conditional value-at-risk for general lossdistributions. Journal of Banking&Finance 26(7)1443{1471.

Rockafellar,R.T.,S. Uryasev,M.Zabarankin.2008. Risktuning withgeneralizedlinearregression. Math-ematics ofOperationsResearch 33712{729.

Scaillet, O. 2004. Nonparametric estimation and sensitivity analysis of expected shortfall. Mathematical Finance 14(1)115{129.

Scaillet, O.2005. Nonparametricestimationofconditionalexpected shortfall. InsuranceandRisk Manage-mentJournal 72639{660.

Shapiro, A. 1989. Asymptotic properties of statistical estimators in stochastic programming. Annals of Statistics 17841{858.

Shapiro,A.,D.Dentcheva,A.Ruszczynski.2009.LecturesonStochasticProgramming:ModelingandTheory. SIAM,Philadelphia.

Trindade,A., S.Uryasev,A. Shapiro,G.Zrazhevsky.2007. Financial predictionwithconstrainedtailrisk. JournalofBanking &Finance 313524{3538.

Weber,S. 2006. Distribution-invariantriskmeasures,information,anddynamic consistency. Mathematical Finance 16(2)419{441.

Wozabal, D., N. Wozabal. 2009. Asymptotic consistency of risk functionals. Journal of Nonparametric Statistics 21(8) 977{990.

(36)

Zhu, S., M. Fukushima. 2009. Worst-case conditional value-at-risk with application to robust portfolio management. Operations Research 57(5)1155{1168.