Unformatted Document Excerpt
Coursehero >>
Florida >>
University of Florida >>
GEO 6938
Course Hero has millions of student submitted documents similar to the one
below including study guides, practice problems, reference materials, practice exams, textbook help and tutor support.
Course Hero has millions of student submitted documents similar to the one
below including study guides, practice problems, reference materials, practice exams, textbook help and tutor support.
Application An of Extreme Value Theory for Measuring Risk
Manfred Gilli, Evis Kllezi e
Department of Econometrics, University of Geneva and FAME CH1211 Geneva 4, Switzerland
Abstract Many fields of modern science and engineering have to deal with events which are rare but have significant consequences. Extreme value theory is considered to provide the basis for the statistical modelling of such extremes. The potential of extreme value theory applied to financial problems has only been recognized recently. This paper aims at introducing the fundamentals of extreme value theory as well as practical aspects for estimating and assessing statistical models for tail-related risk measures. Key words: Extreme Value Theory, Generalized Pareto Distribution, Generalized Extreme Value Distribution, Quantile Estimation, Risk Measures, Maximum Likelihood Estimation, Profile Likelihood Confidence Intervals
1
Introduction
The last years have been characterized by significant instabilities in financial markets worldwide. This has led to numerous criticisms about the existing risk management systems and motivated the search for more appropriate methodologies able to cope with rare events that have heavy consequences. The typical question one would like to answer is: "If things go wrong, how wrong can they go? " The problem is then how can we model the rare phenomena that lie outside the range of available observations. In such a situation
Supported by the Swiss National Science Foundation (projects 1252481.97 and 1214056900.99/1). We are grateful to an anonymous referee for corrections and comments and thank Elion Jani and Agim Xhaja for their suggestions. Email addresses: Manfred.Gilli@metri.unige.ch, e Evis.Kellezi@metri.unige.ch (Manfred Gilli, Evis Kllezi).
Preprint submitted to Elsevier Science
8 February 2003
it seems essential to rely on a well founded methodology. Extreme value theory (EVT) provides a firm theoretical foundation on which we can build statistical models describing extreme events. In many fields of modern science, engineering and insurance, extreme value theory is well established (see e.g. Embrechts et al. (1999), Reiss and Thomas (1997)). Recently, more and more research has been undertaken to analyze the extreme variations that financial markets are subject to, mostly because of currency crises, stock market crashes and large credit defaults. The tail behaviour of financial series has, among others, been discussed in Koedijk et al. (1990), Dacorogna et al. (1995), Loretan and Phillips (1994), Longin (1996), Danielsson and de Vries (1997b), Kuan and Webber (1998), Straetmans (1998), McNeil (1999), Jondeau and Rockinger (1999), Rootz`n and e Klppelberg (1999), Neftci (2000) and McNeil and Frey (2000). An interestu ing discussion about the potential of extreme value theory in risk management is given in Diebold et al. (1998). This paper deals with the behavior of the tails of financial series. More specifically, the focus is on the use of extreme value theory to assess tail related risk; it thus aims at providing a modelling tool for modern risk management. Section 2 presents some definitions of common risk measures which provide the general background for practical applications. Section 3 reviews the fundamental results of extreme value theory used to model the distributions underlying the risk measures. In Section 4, a practical application is presented where the observations of thirty-one years of daily returns on an index representing the Swiss market are analyzed. In particular, the loss tail is modelled and point and interval estimates of the tail risk measures presented in Section 2 are computed. Finally, in Section 5, a brief analysis of out-of-sample performance of the model is presented which then suggests that the approach is robust and therefore useful.
2
Risk Measures
Some of the most frequent questions concerning risk management in finance involve extreme quantile estimation. This corresponds to the determination of the value a given variable exceeds with a given (low) probability. A typical example of such measures is the Value-at-Risk (VaR). Other less frequently used measures are the expected shortfall (ES) and the return level. 2
VaR Calculation VaR is generally defined as the risk capital sufficient, in most instances, to cover losses from a portfolio over a holding period of a fixed number of days. Suppose a random variable X with continuous distribution function F models losses or negative returns on a certain financial instrument over a certain time horizon. VaR can then be defined as the p-th quantile of the distribution F VaRp = F -1 (1 - p), (1)
where F -1 is the so-called quantile function defined as the inverse of the distribution function F . For internal risk control purposes, most of the financial firms compute a 5% VaR over a one-day holding period. The Basle accord proposed that VaR for the next 10 days and p = 1%, based on a historical observation period of at least 1 year (220 days) of data, should be computed and then multiplied by the `safety factor' 3. The safety factor was introduced because the normal hypothesis for the profit and loss distribution is widely recognized as unrealistic. Expected Shortfall Another informative measure of risk is the expected shortfall (ES) or the tail conditional expectation which estimates the potential size of the loss exceeding VaR. The expected shortfall is defined as the expected size of a loss that exceeds VaR ESp = E(X | X > VaRp ). (2) Return Level If H is the distribution of the maxima observed over successive non overlapping 1 k periods of equal length, the return level Rn = H -1 (1 - k ) is the level expected to be exceeded in one out of k periods of length n.
3
Extreme Value Theory
When modelling the maxima of a random variable, extreme value theory plays the same fundamental role as the Central Limit theorem plays when modelling sums of random variables. In both cases, the theory tells us what the limiting distributions are. Generally there are two related ways of identifying extremes in real data. Let 3
us consider a random variable which may represent daily losses or returns. The first approach then considers the maximum (or minimum) the variable takes in successive periods, for example months or years. These selected observations constitute the extreme events, also called block (or per-period) maxima. In the left panel of Figure 1, the observations X2 , X5 , X7 and X11 represent these block maxima for four periods with three observations.
. . . .. . . .. ... . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
X . 2
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
X7
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . .. . .. . ... . .
X11
. . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . .
X5
. . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
u
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . .
. ....... ....... . . ...
. . . . . . X . . . . 1. . . . . . . . . . ... ... ... ... ... . .... .... .... .... .... .... .... ... ... . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . .
X2
. . 9 . . . . . . . . . . . . X . . . . . 11 . . . . . . . . . . . . . . . . . . . . X8. . . . . . . . . . . . . . . .... .... .... .... .... .... .... ... ... ... ... ... ... ... ... ... . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . ... . . . . . . ....... ....... . . . .
. . . .
X7
X
1
2
3
4
Fig. 1. Block-maxima (left panel) and excesses over a threshold u (right panel).
The second approach focuses on the realizations which exceed a given (high) threshold. The observations X1 , X2 , X7 , X8 , X9 and X11 in the right panel of Figure 1, all exceed the threshold u and constitute extreme events. The block maxima method is the traditional method used to analyze data with seasonality as for instance hydrological data. However, threshold methods use data more efficiently and, for that reason, seem to become the choice method in recent applications. In the following subsections, the fundamental theoretical results underlying the block maxima and the threshold method are presented.
3.1
Distribution of Maxima (GEV)
The limit law for the block maxima, which we denote by Mn , with n the size of the subsample (block), is given by the following theorem: Theorem 1 (Fisher and Tippett (1928), Gnedenko (1943)) Let (Xn ) be a sequence of i.i.d. random variables. If there exist constants cn > 0, dn R and some non-degenerate distribution function H such that Mn - dn d H, cn 4
then H belongs to one of the three standard extreme value distributions:
0,
Frchet: e
(x) = (x) =
Weibull: Gumbel:
e-x- , x > 0 -(-x) e ,x0 1,
x0
> 0,
> 0,
x>0
(x) = e-e-x , x R.
The shape of the probability density functions for the standard Frchet, Weibull e and Gumbel distributions is given in Figure 2.
0.6 0.4 0.2 0 -1
... ... . .. . . . . . . . . . . . .. . .. . .. . . . . . . . . . . . . . . . . . . . . . . . .. . .. . . .. . .. . . . .. .. . . . .. .. . ... . . ... . ... . ... . ... . .... . .... . ..... . ...... . . ........ ......... . . ... . . .... ...
0.6 0.4 0.2 0
=1.5
. .... .. .. . . .. .. .. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .. . . . . .. . . .. . .. . .. . .. . . . ... . ... . . . . . .... .... . . . ............ ............ ............... ............... . . .
0.6 0.4 0.2
........... ............ ... ... ... ... ... .. ... .. ... ... .. .. ... ... .. ... .. ... ... .. ... .. .... .... . .. .... .... ..... .. ...... .. . ........ ......... ... ... ...... . . .... ................ ...............
=1.5
0
0
1
2
4
-4
-2
0
1
-2 -1
0
1
3
Fig. 2. Densities for the Frchet, Weibull and Gumbel functions. e
Jenkinson and von Mises suggested the following one-parameter representation
-(1+x)-1/ e e
-e-x
H (x) =
if = 0 if = 0
(3)
of these three standard distributions, with x such that 1 + x > 0. This generalization, known as the generalized extreme value (GEV) distribution, is obtained by setting = -1 for the Frchet distribution, = --1 for the e Weibull distribution and by interpreting the Gumbel distribution as the limit case for = 0. As in general we do not know in advance the type of limiting distribution of the sample maxima, the generalized representation is particularly useful when maximum likelihood estimates have to be computed. 5
3.2 Distribution of Exceedances (GPD) The theory in theorem 1 strongly underlies the approach where we consider the distribution of exceedances. This method is also called the peak over threshold (POT) method. Our problem is illustrated in Figure 3 where we consider an (unknown) distribution function F of a random variable X. We are interested in estimating the distribution function Fu of values of x above a certain threshold u.
1 F (x)
. . .. . . .. ... ........................................................................... ...................................................................... .... . .. . .... .. .. . ......... ..... ..... . . .. ... .... ... . .. . . ... ... . ... . ... . . . . . . . ... . . ... . . . . . . . . . . .. . . . . . . . . . .. .. . . . . . . .. . . .. . . . . . . . . . .. . . . . . . . . . . . .. . .. . . . . . . . . . . . . . . . . . . . . . . . . .. . . . . . . . .. . . .. . . . . . . . .. . . ... . . ... . . . ... .... ...... ... . .
1 Fu (y)
Fu
0
u
xF
x
. . . .. . .. .. .. .......................................................................... .......................................................................... . .. . . .. ........... . . . . .. .. ............. .. .. .. .......... . ......... . ...... ...... . .... . .... . .... ... . .. .. . ... ... . . ... ... . .. . .. . .. . .. . . . .. . . . .. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . ... . . . ...... ... . .
0
xF - u
y
Fig. 3. Distribution function F and conditional distribution function Fu .
The distribution function Fu is called the conditional excess distribution function (cedf) and is defined as Fu (y) = P (X - u y | X > u), 0 y xF - u (4)
where X is a random variable, u is a given threshold, y = x-u are the excesses and xF is the right endpoint of F . We verify that Fu can be written in terms of F , i.e. Fu (y) = F (u + y) - F (u) F (x) - F (u) = . 1 - F (u) 1 - F (u) (5)
The realizations of the random variable X lie mainly between 0 and u and therefore the estimation of F in this interval generally poses no problems. The estimation of the portion Fu however might be difficult as we have in general very little observations in this area. At this point EVT can prove very helpful as it provides us with a powerful result about the cedf which is stated in the following theorem: 6
Theorem 2 (Pickands (1975), Balkema and de Haan (1974)) For a large class of underlying distribution functions F the conditional excess distribution function Fu (y), for u large, is well approximated by Fu (y) G, (y), where
1 - 1 + y
u ,
-1/
for y [0, (xF - u)] if 0 and y [0, - ] if < 0. G, is the so-called generalized Pareto distribution (GPD). If x is defined as x = u + y the GPD can also be expressed as a function of x, i.e. G, (x) = 1 - (1 + (x - u)/)-1/ . Figure 4 illustrates the shape of the generalized Pareto distribution G, (x) when , called the shape parameter or tail index, takes a negative, a positive and a zero value. The scaling parameter is kept equal to one.
1
. . .. .. . .. ............................................................ ........ .................................................. .. . . .. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . ... . . . . ... .....
G, (y) =
if = 0 if = 0
1 - e-y/
(6)
1
= -.5
0.5
0.5
0
-/ 4
0
2
8
y
0
. . .. .. . .. .............................................. ........... .............................................. .......... . .. . . ... . ... .. .. .. .. . .. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . ..... ... . . ....
1
=0
0.5
0
2
4
8
y
0
. . .. . .. .. ................................................... ....................................... .......... ....... ........... .......... .......... ..... ..... .... .... ... .. .. . .. .. . .. . .. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .... . . ... .....
= .5
0
2
4
8
y
Fig. 4. Shape of the generalized Pareto distribution G, for = 1.
As, in general, one cannot fix an upper bound for financial losses we see from Figure 4 that only distributions with shape parameter > 0 are suited to model fat tailed distributions. We will now derive analytical expressions for VaRp and ESp defined respectively in (1) and (2). First we isolate F (x) from (5) F (x) = (1 - F (u)) Fu (y) + F (u) and replacing Fu by the GPD and F (u) by the estimate (n - Nu )/n, where n is the total number of observations and Nu the number of observations above the threshold u, we have F (x) =
Nu n 1 - 1 + (x - u) -1/
+ 1-
Nu n
7
which simplifies to F (x) = 1 -
Nu n 1 + (x - u) -1/
.
(7)
Inverting (7) for a given probability p gives VaRp = u +
n p Nu -
-1 .
(8)
Let us now rewrite the expected shortfall as ESp = VaRp + E(X - VaRp | X > VaRp ) where the second term on the right is the mean of the excess distribution FVaRp (y) over the threshold VaRp . It is known that the mean excess function for the GPD with parameter < 1 is e(z) = E(X - z | X > z) = + z , 1- + z > 0 . (9)
This function gives the average of the excesses of X over varying values of z. A general result concerning the existence of moments is that if X follows a GPD then, for all integers r such that r < 1/, the r first moments exist. 1 Similarly, given the definition (2) for the expected shortfall and using expression (9), for z = VaRp - u and X representing the excesses y over u we obtain ESp = VaRp + VaRp - u + (VaRp - u) = + . 1- 1- 1- (10)
4
Modelling the Fat Tails of Stock Returns
Our aim is to model the tail of the distribution of a market index negative movements in order to estimate extreme quantiles. We analyze the daily returns of the Swiss market represented by the "Credit Suisse General" index 2 for the period from 1011969 to 1121999. The application has been executed in a MATLAB 5.x programming environment. The files containing the data and the code can be downloaded from the URL www.unige.ch/ses/ metri/gilli/evtrm/evtrm_mm.html. Figure 5 shows the plot of the n = 8065 observed daily returns.
See Embrechts et al. (1999), page 165. This index has the longest history among indices representing the Swiss market. Data have been extracted from DataStream.
2 1
8
10 0 -10 1971 1974 1976 1979 1982 1984 1987 1990 1993 1995 1998
Fig. 5. Daily returns of the Credit Suisse General index.
A first information about the behaviour of the returns can be obtained by standardizing, i.e. centering and reducing them. In Figure 6, we reproduce the histogram of the standardized returns.
0.015 0.01 0.005 0 -10 -5 0 5 10 15
Fig. 6. Tails of the standardized returns (lower part of histogram).
We will now use the theory introduced in the previous chapter to analyze and model the losses. As, for convenience, we want the losses to be in the right tail, in the following we have changed the sign of the returns so that positive values correspond to losses. First, we will model the exceedances over a given threshold which will enable us to estimate high quantiles and the corresponding expected shortfall. Second, we will consider the distribution of the so called block maxima, which then allows the determination of the return level.
4.1
The Peak Over Threshold (POT) Method
Despite the appealing theoretical framework EVT provides, small sample issues pose problems when it comes to statistical inference. The main problem is the selection of the threshold u. Theory tells us that u should be high in order to satisfy Theorem 2, but the higher the threshold the less observations are left for the estimation of the parameters of the tail distribution function. 9
So far, no automatic algorithm with satisfactory performance for the selection of the threshold u is available. The issue of determining the fraction of data belonging to the tail is treated by Danielsson et al. (1997), Danielsson and de Vries (1997a) and Dupuis (1998) among others. Tools from exploratory data analysis prove helpful in approaching this problem and we will present them together with our application. Let us consider the sample distribution function Fn (xn ) which, for a set of n i observations, given in increasing order xn xn , is defined as n 1 Fn (xn ) = i i n i = 1, . . . , n. (11)
The sample distribution function corresponding to the right tail of our data set is given in Figure 7. Our goal is to estimate the functional form of this portion of the distribution.
1 0.98 0.96 2 4 6 8 10 12 14
Fig. 7. Right portion of the sample distribution for our data.
Graphical Data Exploration Tools Quantile plots (QQplots) can be used to distinguish visually between different distribution functions. Figure 8 shows the sample quantiles plotted against the GPD quantiles. Knowing that for financial data = 1/ takes values in the range [3, 4] (see e.g. (Embrechts et al., 1999, p. 291)) the GPD quantiles are computed with = 0.3.
20
10
0
0
2
4
6
8
10
12
14
Fig. 8. QQplot of sample quantiles exceeding 1 against the corresponding quantiles of GPD distribution.
The picture strongly suggests that the hypotheses that our data follow a GPD distribution is acceptable. 10
Another graphical tool that is helpful for the selection of the threshold u is the sample mean excess plot defined by the points (u, en (u)) , xn < u < xn n 1 (12)
and where the sample mean excess function is defined as en (u) = - u) , n-k+1
n n i=k (xi
k = min{i | xn > u}, i
where n - k + 1 is the number of observations exceeding the threshold u. The sample mean excess function is an estimate of the mean excess function e(u). For the GPD it has been defined in (9) and is linear. Figure 9 shows the sample mean excess plot corresponding to our data. From a closer inspection of the plot in the right panel, which zooms the function for a smaller range of values for u, we suggest trying the value u = 1.56 and u = 3 for the threshold as they are located at the beginning of a portion of the sample mean excess plot which is roughly linear.
4 3.5 3 2.5 2 1.5 1 0.5 0 0 2 u 4 6 8
2 1.8 1.6 1.4 1.2 1 0.8 1.5 2 u 2.5 3
Fig. 9. Sample mean excess plot.
Maximum Likelihood Estimation Given the theoretical presented results in the previous section, we know that the distribution of the observations above the threshold in the tail should be a generalized Pareto distribution (GPD). This is confirmed by the QQ-plot in Figure 8. Different methods can be used to estimate the parameters of the GPD. 3 We use the maximum likelihood estimation method.
These are the maximum likelihood estimation, the method of moments, the method of probability-weighted moments and the elemental percentile method. For comparisons and detailed discussions about their use for fitting the GPD to data, see Hosking and Wallis (1987), Grimshaw (1993), Tajvidi (1996a), Tajvidi (1996b) and Castillo and Hadi (1997).
3
11
For a sample y = {y1 , . . . , yn } the log-likelihood function L(, | y) for the GPD is the logarithm of the joint density of the n observations
-n log -n log
L(, | y) =
- -
1 1
+1
n i=1
n i=1
log 1 + yi if = 0
yi
if = 0.
According to our interpretation of the sample mean excess plot, we computed ^ the values and which maximize the log-likelihood function for the two ^ samples corresponding to a threshold of u = 1.56 and u = 3. We obtained ^ ^ ^ the estimates = 0.35 and = 0.67 for u = 1.56 and = 0.33 and = 0.96 ^ for u = 3. We observe that the shape parameter varies very little between the two values of u and we therefore choose the threshold u = 1.56 leaving 247 observations in the tail instead of 56 for u = 3. Again we may use a QQplot to visually check whether the data points satisfy the GPD assumption. The left panel in Figure 10 shows the plot of the sample quantiles against G,^ quantiles, from where we can conclude that the fit is ^ satisfactory.
14 12 10
0.995 1
8 6 4
0.985 0.99
2 0 0 5 10 15
0.98 0 5 10 15
Fig. 10. QQplot of sample quantiles against G,^ quantiles (left panel) and GPD ^ fitted to the 247 exceedances above the threshold u = 1.56 (right panel).
The GPD fitted to the Nu = 247 exceedances above the threshold u = 1.56 is plotted in the right panel in Figure 10. High quantiles may now be directly read in the plot or computed from equation (8) where we replace the parameters by their estimates VaRp = u +
^ ^ n p Nu ^ -
-1 ,
^ = 0.
(13)
For instance, if we choose p = 0.01 we can compute VaR 0.01 = 2.48 and ES 0.01 = 4.0. 12
Confidence Intervals If we admit that large-sample theory holds for our estimates, we can construct confidence intervals for the parameters and by inverting the likelihood ratio test. 4 As only exceedances enter the estimation procedure, the estimates rely on very small data sets. For this reason one cannot always rely on the asymptotic optimality properties of the maximum likelihood estimators. Tajvidi (1996a) investigates the performance of several bootstrap and likelihoodbased methods for constructing confidence intervals for the parameters and quantiles of the GPD. His conclusion is that the profile likelihood confidence intervals should be corrected for the small sample size using the methodology of Lawley (1956) and that they are comparable to the bias corrected and accelerated bootstrap confidence intervals 5 in terms of accuracy. In addition to single confidence intervals on the parameters, we also consider joint confidence intervals. For example, a joint interval for the parameters and is given by the contour at level 2 of the relative log-likelihood function ,2 ^^ defined as L(, ) - L(, ). To construct single confidence intervals, we need to compute the profile loglikelihood functions. For example, in the case we need an interval estimate for , the profile log-likelihood function is L () = max L(, ) and the 1 - confidence interval is given by values of satisfying ^^ L () > L(, ) -
1 2 2 ,1
where 2 is the 1- quantile of the 2 distribution with 1 degree of freedom. ,1 Figure 11 shows the single and joint confidence intervals for the estimated parameters. In order to further explore the reliability of the confidence intervals we applied the bootstrap method to generate 1000 samples. For each sample, we estimated ^ ^ the parameters and plotted the pairs (i , i ), i = 1, . . . , 1000. These points are plotted in Figure 11 together with the the log-likelihood based confidence intervals. We can verify that about 5% lie outside the 95% joint confidence region computed with the likelihood ratio test. For the marginal distributions we observe that about 13.5% of the estimates of and 18% of the estimates of lie outside the maximum likelihood confidence intervals for a single parameter. We can also observe that the density of the bootstrap estimates differs from
4 5
See for instance Azzalini (1996) for an introduction. See Efron and Tibshirani (1993) or Shao and Tu (1995).
13
the log-likelihood based confidence interval and further investigation about the statistical properties of the estimates would be needed.
1 0.9 0.8 0.7 0.6 0.5 0.4 0.2 0.4 0.6
Fig. 11. Single and joint 95% confidence intervals for and for the POT method. Dots represent 1000 bootstrap estimates.
The empirical marginal distributions of the bootstrap values for and as well as the corresponding empirical distribution of VaR 0.01 and ES 0.01 are reproduced in the plots in Figure 12.
6 4 2 0 6 4 2 0 2.2 2.4 2.6 2.8 VaR0.001 0 0.2 0.4 0.6 0.8
6 4 2 0 2 ES0.001 0.4 0.6 0.8 1
1
0
3
3.5
4
4.5
5
5.5
^ Fig. 12. Empirical bootstrap marginal distributions for (upper left), (upper ^ right), VaR 0.01 (lower left) and ES 0.01 (lower right) for the POT method.
Table 1 summarizes the point estimates, the maximum likelihood (ML) and the bootstrap (BS) confidence intervals of the marginal distributions. 14
Table 1 Point estimates and 95% maximum likelihood (ML) and bootstrap (BS) confidence intervals for the POT method.
^ ^ VaR 0.01 ES 0.01
Lower bound BS ML 0.16 0.23 0.52 0.57 2.33 2.34 3.47 3.53
Point estimate ML 0.35 0.67 2.48 4.00
Upper bound ML BS 0.51 0.52 0.79 0.85 2.65 2.66 5.03 4.67
The results in Table 1 indicate that with probability 0.01 the tomorrow's loss will exceed the value 2.48% and that the corresponding expected loss, that is the average loss in situations where the losses exceed 2.48%, is 4.00%. These point estimates are completed with 95% confidence intervals. Thus the expected loss will, in 95 out of 100 cases, lie between 3.47% and 4.67%. It is interesting to note that the upper bound of the confidence interval for the parameter is such that the first order moment is finite. This guarantees that the estimated expected shortfall, which is a conditional first moment, exists. Log-likelihood based confidence intervals for VaRp can be obtained by using a reparameterized version of GPD defined as a function of and VaRp :
-1
G,VaRp (y) =
1- 1+ 1-
n Nu
pe
y VaRp -u
- n p -1 Nu VaRp -u
y
=0 =0
.
The corresponding probability density function is
- 1 -1
g,VaRp (y) =
-
- n -1 p Nu (VaRp -u) y n p e VaRp -u Nu VaRp -u
1+
- n -1 p Nu VaRp -u
y
=0 =0
.
Figure 13 these likelihood based confidence regions obtained by using this reparameterized version of GPD. 15
0.7
0
0.6
-2
0.5
-4 -6 -8 -10 -12 2 2.5 VaR0.01 3 3.5
0.4 0.3 0.2 0.1 0 2.2 2.4 VaR0.01 2.6 2.8
Fig. 13. Left panel: Relative profile log-likelihood function and confidence interval for ^ VaRp . Right panel: Single and joint confidence intervals at level 95% for and VaRp . Dots represent 1000 bootstrap estimates for the parameters of the POT method.
Similarly, using the following reparameterization for = 0
-
-1 G,ESp = 1 - 1 + y (ESp - u)(1 - )
n p Nu
+
- 1
-1 -1 + g,ESp = y 1 + (1 - )(ESp - u) (ESp - u)(1 - )
n p Nu n p Nu
+
-
-
- 1 -1
we compute a log-likelihood based confidence interval for the expected shortfall ESp . In Figure 14 we see that the log-likelihood confidence interval for ESp is not symmetric with respect to ES 0.01 .
0.7
0
0.6
-2 -4 -6
0.5 0.4 0.3 0.2
-8 -10 3
0.1
4 ES0.01
5
6
0 3
4 ES0.01
5
6
Fig. 14. Left panel: Relative profile log-likelihood function and confidence interval ^ for ESp . Right panel: Single and joint confidence intervals at level 95% for and ESp . Dots represent 1000 bootstrap estimates for the parameters of the POT method.
We observe that in both Figures 13 and 14 the number of bootstrap estimates lying outside the 95% likelihood joint confidence intervals is 49 out of 1000. 16
4.2 Method of Block Maxima We now apply the block maxima method to our daily return data. For this method the delicate point is the appropriate choice of the periods defining the blocks. The calendar naturally suggests periods like months, quarters, etc. In order to avoid seasonal effects, we choose yearly periods which are likely to be sufficiently large for Theorem 1 to hold. Thus our sample has been divided into 31 non-overlapping sub-samples, each of them containing the daily returns of the successive calendar years. Therefore not all our blocks are of exactly the same length. The absolute value of the minimum return in each of the blocks constitute the data points in the sample of minima M which is used to estimate the generalized extreme value distribution (GEV). Figure 15 plots the yearly minima and maxima of our daily returns.
10 0 -10 1971 1974 1976 1979 1982 1984 1987 1990 1993 1995 1998
Fig. 15. Yearly minima and maxima of the daily returns of the Credit Suisse General index.
The standard GEV defined in (3) is the limiting distribution of normalized extrema. Given that in practice we do not know the true distribution of the returns and, as a result, we do not have any idea about the norming constants cn and dn , we use the three parameter specification x-
] - , - [ ] - , [ ] - [
<0 =0 >0
H,, (x) = H
x D,
D=
(14)
of the GEV, which is the limiting distribution of the unnormalized maxima. The two additional parameters and are the location and the scale parameters representing the unknown norming constants. The log-likelihood function we maximize with respect to the three unknown 17
parameters is L(, , ; x) =
i
log(h(xi )),
xi M x-
-1/
(15)
where h(, , ; x) = 1 x- 1+
-1/-1
exp - 1 +
is the probability density function if = 0 and 1 + x- > 0. If = 0 the function h is h(, , ; x) = 1 x- exp - exp - exp - x- .
^ ^ ^ The log-likelihood estimates we obtain are = 0.28, = 1.37 and = 2.65. In Figure 16, we give the plot of the sample distribution and the corresponding fitted GEV distribution.
1 0.8 0.6 0.4 0.2 0 0
5
10
15
Fig. 16. Sample distribution of yearly maxima and corresponding fitted GEV distribution.
In practice the quantities of interest are not the parameters themselves, but the quantiles, also called return levels, of the estimated GEV. The return level Rk is the level we expect to be exceeded in one out of k one year periods:
-1 1 Rk = H,, (1 - k ).
Substituting the parameters , and by their estimates we get ^ Rk =
- ^
^ ^
1 1 - - log(1 - k ) 1 ) k
^ -
=0 =0
.
(16)
^ ^ - log - log(1 -
^ Taking for example k = 10, we obtain for our data R10 = 6.94, which means that the maximum loss observed during a period of one year will exceed 6.94% in one out of ten years on average. 18
Confidence Intervals As already discussed in the case of the GPD estimation, it may prove useful to approach the quantile estimation problem by directly reparameterizing the GEV distribution as a function of the unknown return level Rk . To achieve this, we isolate from equation (16) and substitute it into H,, defined ^ in (14). The GEV distribution function then becomes
for x D defined as
H,,Rk =
exp -
1 (1 - k )
(x
1 - Rk ) + - log(1 - k )
k
- -1/
=0 =0
exp - x-R
and we can directly obtain maximum likelihood estimates for Rk . The profile log-likelihood function can then be used to compute separate or joint confidence intervals for each of the parameters. For example, in the case where the parameter of interest is Rk , the profile log-likelihood function will be defined as L (Rk ) = max L(, , Rk ).
,
D = ] - , [
] - ,
Rk -
1 - log(1 - k )
-
[ <0 =0 >0
] Rk - - log(1 - 1 ) - , [ k
2 where , 1 refers to the (1 - )level quantile of the 2 distribution with 1 ^^ ^ degree of freedom. The function L (Rk ) - L (, , Rk ) is called the relative profile log-likelihood and is plotted in the left panel of Figure 17. For = 0.05 the interval estimate for R10 is [5.29, 11.63].
The confidence interval we then derive includes all values of Rk satisfying the condition 1 ^^ ^ L (Rk ) - L(, , Rk ) > - 2 2 1 ,
In the case where a joint confidence region on two parameters, say and Rk is needed, the profile log-likelihood function, which in this case is a surface, is defined as L (, Rk ) = max L(, , Rk ). In this case the confidence region is defined as the contour at the level - 1 2 2 2 , of the relative profile log-likelihood function ^^ ^ L (, Rk ) - L(, , Rk ). In the right panel of Figure 17, we reproduce the confidence regions at level ^ ^ 95% for the parameters and Rk . We also generated 1000 bootstrap samples 19
and computed the bootstrap confidence intervals for , , and R10 . The pairs (R10 , ) are plotted in the right panel of Figure 17.
0 -1 -2 -3 -4 -5 -6 -7 4 6 8 R10 10 12 14
-0.5 4 6 8 10 R10 12 14 16 0 0.5 1
Fig. 17. Left panel: Relative profile log-likelihood and 95% confidence interval for ^ R10 estimated with the method of block maxima. Right panel: Joint confidence ^ ^ region for and R10 at level 95%. In both panels the maximum likelihood estimates are marked with the symbol .
The empirical marginal distributions of the bootstrap values for , , and R10 are reproduced in the plots in Figure 18.
4 2 1 0 2 1 0.4 0.2 0 R10 0 0.5 1 0 0.5 1 1.5 2 2.5 3 2
0 1.5
2
2.5
3
3.5
4
4
6
8
10
12
Fig. 18. Empirical marginal distributions of the bootstrap estimates.
Table 2 summarizes the point estimates, the maximum likelihood (ML) and the bootstrap (BS) confidence intervals for the reparameterized GEV distribution.
Table 2 Point estimates and 95% maximum likelihood (ML) and bootstrap (BS) confidence intervals.
^ ^ ^ R10
Lower bound BS ML -.11 0.02 0.91 1.09 5.05 5.29
Point estimate ML 0.28 1.37 6.94
Upper bound ML BS 0.58 0.60 1.84 1.86 11.63 9.26
20
5
Concluding Remarks
We have illustrated how extreme value theory can be used to model tail-related risk measures such as value-at-risk, expected shortfall and return level. In order to assess the reliability of the methods presented in the previous sections, we did some out-of-sample analysis by repeating all computations for a subsample of the data covering the period from 1969 to 1989. For the POT method, Table 3 reproduces the point estimates and the bootstrap confidence intervals obtained from the 5477 first observations defining the subsample.
Table 3 Point estimates for the POT method and 95% maximum likelihood (ML) and bootstrap (BS) confidence intervals corresponding to the period 19691989.
^ ^ VaR 0.01 ES 0.01
Lower bound BS ML 0.12 0.24 0.44 0.48 2.02 2.03 3.01 3.07
Point estimate ML 0.41 0.60 2.17 3.60
Upper bound ML BS 0.65 0.68 0.77 0.83 2.33 2.34 5.41 4.78
We observe that the estimated values differ very little from values reported in Table 1 which correspond to the estimates obtained from the whole sample. We also compared VaR estimated with the POT method with the VaR proposed by the Basle accord. Assuming the normal distribution for the observations until 1989, the 1% lower quantile is 1.95. Multiplying this value by 3 gives 5.86, whereas in our calculation the upper bound for the expected shortfall is 4.78. Clearly the POT method provides more accurate information. As far as the estimation of GEV is concerned, the estimated parameters over ^ ^ ^ the 21 first yearly maximum losses are = 0.33, = 1.2 and = 2.36. The 10 corresponding return level R is 6.4, which means that yearly maximum losses will exceed the value 6.4% once in ten years on average. In Figure 19, we verify that in the out-of-sample period this value is exceeded twice, which is not in contradiction with our model. 21
10 0 -10 1971 1976 1982 1987 1993 1998
Fig. 19. Out of sample comparisons of the return level R10 (horizontal dashed line). Observed minima exceeding R10 are marked with a star. GEV distribution is estimated over yearly minima from 1969 to 1989 (marked with +).
Finally, it might be interesting to show how the model allows extrapolation beyond the sample. Using the entire sample, we computed R100 = 15.4%, i.e. the level we expect to be exceeded only in one year every century. This value varies only insignificantly whether the full or the subsample is considered. The models also tells us that the 1987 crash is likely to happen once in 60 years. This is obtained by computing H(13.1), where H is the GEV distribution with ^ ^ ^ parameters = 0.33, = 1.2 and = 2.36. From the POT model, we conclude that the "once a century loss" of 15.4% predicted by GEV occurs once every 60 years. We obtain this result by computing the probability p = 1 - F (15.4) where F is the estimated GPD. Such an event will happen on average every 1/p days which in our case gives 60 years. Similarly using the POT method we can compute that the 1987 crash loss is likely to be exceeded every 37 years. We should mention that the events forecasted by the POT and block maxima method are not exactly the same, which in part explains the difference concerning the results of 60 and 100 years.
References Azzalini, A. (1996). Statistical Inference Based on the Likelihood. Chapman and Hall, London. Balkema, A. A. and de Haan, L. (1974). Residual life time at great age. Annals of Probability, 2:792804. Castillo, E. and Hadi, A. (1997). Fitting the Generalized Pareto Distribution to Data. Journal of the American Statistical Association, 92(440):1609 1620. Dacorogna, M. M., Mller, U. A., Pictet, O. V., and de Vries, C. G. (1995). u 22
The distribution of extremal foreign exchange rate returns in extremely large data sets. Preprint, O&A Research Group. Danielsson, J., de Haan, L., Peng, L., and de Vries, C. (1997). Using a Bootstrap Method to Choose the Sample Fraction in Tail Index Estimation. Mimeo. Danielsson, J. and de Vries, C. (1997a). Beyond the Sample: Extreme Quantile and Probability Estimation. Mimeo. Danielsson, J. and de Vries, C. (1997b). Value-at-Risk and Extreme Returns. Working Paper. Diebold, F. X., Schuermann, T., and Stroughair, J. D. (1998). Pitfalls and opportunities in the use of extreme value theory in risk management. In Refenes, A.-P., Burgess, A., and Moody, J., editors, Decision Technologies for Computational Finance, pages 312. Kluwer Academic Publishers. Dupuis, D. J. (1998). Exceedances over high thresholds: A guide to threshold selection. Extremes, 1(3):251261. Efron, B. and Tibshirani, R. J. (1993). An Introduction to the Bootstrap. Chapman & Hall, New York. Embrechts, P., Klppelberg, C., and Mikosch, T. (1999). Modelling Extremal u Events for Insurance and Finance. Applications of Mathematics. Springer. 2nd ed.(1st ed., 1997). Fisher, R. and Tippett, L. H. C. (1928). Limiting forms of the frequency distribution of largest or smallest member of a sample. Proceedings of the Cambridge philosophical society, 24:180190. Gnedenko, B. V. (1943). Sur la distribution limite du terme d'une srie e alatoire. Annals of Mathematics, 44:423453. e Grimshaw, S. (1993). Computing the Maximum Likelihood Estimates for the Generalized Pareto Distribution to Data. Technometrics, 35(2):185191. Hosking, J. R. M. and Wallis, J. R. (1987). Parameter and quantile estimation for the generalised Pareto distribution. Technometrics, 29:339349. Jondeau, E. and Rockinger, M. (1999). The tail behavior of stock returns: Emerging versus mature markets. Mimeo, HEC and Banque de France. Koedijk, K. G., Schafgans, M., and de Vries, C. (1990). The Tail Index of Exchange Rate Returns. Journal of International Economics, 29:93108. Kuan, C. H. and Webber, N. (1998). Valuing Interest Rate Derivatives Consistent with a Volatility Smile. Working Paper, University of Warwick. Lawley, D. N. (1956). A General Method for Approximating to the Distribution of Likeligood Ratio Criteria. Biometrika, 43:295303. Longin, F. M. (1996). The Assymptotic Distribution of Extreme Stock Market Returns. Journal of Business, 69:383408. Loretan, M. and Phillips, P. (1994). Testing the covariance stationarity of heavy-tailed time series. Journal of Empirical Finance, 1(2):211248. McNeil, A. J. (1999). Extreme value theory for risk managers. In Internal Modelling and CAD II, pages 93113. RISK Books. McNeil, A. J. and Frey, R. (2000). Estimation of tail-related risk measures for heteroscedastic financial time series: an extreme value approach. Journal of 23
Empirical Finance, 7(34):271300. Neftci, S. N. (2000). Value at risk calculations, extreme events, and tail estimation. Journal of Derivatives, pages 2337. Pickands, J. I. (1975). Statistical inference using extreme value order statistics. Annals of Statististics, 3:119131. Reiss, R. D. and Thomas, M. (1997). Statistical Analysis of Extreme Values with Applications to Insurance, Finance, Hydrology and Other Fields. Birkhuser Verlag, Basel. a Rootz`n, H. and Klppelberg, C. (1999). A single number can't hedge against e u economic catastrophes. Ambio, 28(6):550555. Preprint. Shao, J. and Tu, D. (1995). The Jackknife and Bootstrap. Springer Series in Statistics. Springer. Straetmans, S. (1998). Extreme financial returns and their comovements. Ph.D. Thesis, Tinbergen Institute Research Series, Erasmus University Rotterdam. Tajvidi, N. (1996a). Confidence Intervals and Accuracy Estimation for Heavytailed Generalized Pareto Distribution. Thesis article, Chalmers University of Technology. www.maths.lth.se/matstat/staff/nader/. Tajvidi, N. (1996b). Design and Implementation of Statistical Computations for Generalized Pareto Distributions. Technical Report, Chalmers University of Technology. www.maths.lth.se/matstat/staff/nader/.
24
Find millions of documents on Course Hero - Study Guides, Lecture Notes, Reference Materials, Practice Exams and more.
Course Hero has millions of course specific materials providing students with the best way to expand
their education.
Below is a small sample set of documents:
University of Florida - GEO - 6938
1WHY EXTREME VALUE THEORY?1.1 A Simple Extreme Value ProblemMany statistical tools are available in order to draw information concerning specific measures in a statistical distribution. In this textbook, we focus on the behaviour of the extreme values
University of Florida - GEO - 6938
Spatial analysis of density dependent pattern in coniferous forest stands*Janet Franklin 1, Joel Michaelsen 1 & Alan H. Strahler2*, *1Department of Geography, University of California, Santa Barbara, California, 93106; 2Department of Geology and Geograp
University of Florida - GEO - 6938
Generalized extreme value distribution - Wikipedia.http:/en.wikipedia.org/wiki/Extreme_value_distri.Your continued donations keep Wikipedia running!Generalized extreme value distributionFrom Wikipedia, the free encyclopedia(Redirected from Extreme va
University of Florida - GEO - 6938
GeoDa: An Introduction to Spatial Data AnalysisLuc Anselin, Ibnu Syabri and Youngihn Kho Spatial Analysis Laboratory Department of Agricultural and Consumer Economics University of Illinois, Urbana-Champaign Urbana, IL 61801 USAanselin@uiuc.edu, syabri@
University of Florida - GEO - 6938
Geographical Analysis ISSN 0016-7363GeoDa: An Introduction to Spatial Data AnalysisLuc Anselin1, Ibnu Syabri2, Youngihn Kho11Spatial Analysis Laboratory, Department of Geography, University of Illinois, Urbana, IL, 2Laboratory for Spatial Computing an
University of Florida - GEO - 6938
Exploring Spatial Data with GeoDaTM : A WorkbookLuc AnselinSpatial Analysis Laboratory Department of Geography University of Illinois, Urbana-Champaign Urbana, IL 61801http:/sal.agecon.uiuc.edu/Center for Spatially Integrated Social Sciencehttp:/www.
University of Florida - GEO - 6938
Available online at www.sciencedirect.comEconomics Letters 99 (2008) 585 590 www.elsevier.com/locate/econbaseFunctional forms for the negative binomial model for count dataWilliam Greene Department of Economics, Stern School of Business, New York Univ
University of Florida - GEO - 6938
Geographical Processes and the Analysis of Point Patterns: Testing Models of Diffusion by Quadrat Sampling Author(s): D. W. Harvey Source: Transactions of the Institute of British Geographers, No. 40 (Dec., 1966), pp. 81-95 Published by: Blackwell Publish
University of Florida - GEO - 6938
Some Methodological Problems in the Use of the Neyman Type A and the Negative Binomial Probability Distributions for the Analysis of Spatial Point Patterns Author(s): David Harvey Source: Transactions of the Institute of British Geographers, No. 44 (May,
University of Florida - GEO - 6938
Landscape Ecology 15: 467478, 2000. 2000 Kluwer Academic Publishers. Printed in the Netherlands.467Lacunarity analysis of spatial pattern: A comparisonM.R.T. DaleDepartment of Biological Sciences, University of Alberta, Edmonton, Alberta, T6G 2E9, Can
University of Florida - GEO - 6938
GEO 6938 Advanced Quantitative Methods for Spatial Analysis Spring 2012 Timothy J. Fik, Ph.D. Associate Professor Department of Geography University of Florida e-mail: "fik@ufl.edu"Selected Topics include. Point-Pattern/Pattern Analysis & Modeling Cluste
University of Florida - GEO - 6938
Lab #1. Point Pattern Analysis using Quadrat counts Carry out a point pattern analysis using quadrat counts based on grid cells superimposed on a given study (a two dimensional surface) containing a spatial distribution of points that represent the locati
University of Florida - GEO - 6938
Poisson Regression. continuedIn the PR model, the mean and variance V are assumed/restricted to be equal.something that rarely occurs in practice (as real data almost always rejects this restriction when tested).Typically, the variance is greater than t
University of Florida - GEO - 6938
Spatial Diffusion & Pattern AnalysisFive general types of spatial diffusion processes.3 2 11. Expansion Diffusion a simple outward expansion from the source (covering a larger, more extensive area over time).Study area2. Relocation Diffusion the move
University of Florida - GEO - 6938
Nearest-Neighbor MethodsDefining "connectivity" between points Point data can be used in various ways to measure the degree to which the point pattern exhibits spatial autocorrelation.But first, care must be taken in describing the nature of connectivit
University of Florida - GEO - 6938
Analysis of Pattern Measuresat the Local/Regional Scale9. 10. 11. 12. 13. 14. 15. 16. 17. 18. 19. 20.Local Moran's I The Score Statistic Also Getis G-statistic Tango's CF (which we will review later) 2 Cumulative test Maximum 2 test Local Quadrat Test
University of Florida - GEO - 6938
Identification of Local Clusters for Count Data: A Model-Based Moran's I TestTonglin Zhang and Ge LinPurdue University and West Virginia University February 14, 2007Department of Statistics, Purdue University, 250 North University Street,West Lafayette
University of Florida - GEO - 6938
Chapter 4 Modelling Counts - The Poisson and Negative Binomial RegressionIn this chapter, we discuss methods that model counts. In a longitudinal setting, these counts typically result from the collapsing repeated binary events on subjects measured over
University of Florida - GEO - 6938
Notes on the Negative Binomial DistributionJohn D. Cook October 28, 2009Abstract These notes give several properties of the negative binomial distribution. 1. Parameterizations 2. The connection between the negative binomial distribution and the binomia
University of Florida - GEO - 6938
On Model Fitting Procedures for Inhomogeneous Neyman-Scott ProcessesYongtao GuanJuly 31, 2006ABSTRACTIn this paper we study computationally efficient procedures to estimate the second-order parameters for a class of inhomogeneous Neyman-Scott processe
University of Florida - GEO - 6938
Spatial AutocorrelationGeography 683 - Introduction to Geographic AnalysisSpatial AutocorrelationGuoxiang Ding Department of Geography1155 Derby Hall Phone: 292-2704 Email: ding.45@osu.edu First law of geography: "everything is related to everything
University of Florida - GEO - 6938
Overdispersion and Poisson RegressionRichard Berk John MacDonald Department of Statistics Department of Criminology University of Pennsylvania November 19, 2007Abstract This article discusses the use of regression models for count data. A claim is often
University of Florida - GEO - 6938
Spatial AutocorrelationMoran's I Geary's C Arthur J. Lembo, Jr. Salisbury UniversitySpatial Autocorrelation First law of geography: "everything is related to everything else, but near things are more related than distant things" Waldo Tobler Many geog
University of Florida - GEO - 6938
Analysing spatial point patterns in RAdrian Baddeley CSIRO and University of Western Australia Adrian.Baddeley@csiro.au adrian@maths.uwa.edu.au Workshop Notes Version 3 October 2008 Copyright c CSIRO 2008Abstract This is a detailed set of notes for a wo
University of Florida - GEO - 6938
136Poisson Regression Analysis13. Poisson Regression AnalysisWe have so far considered situations where the outcome variable is numeric and Normally distributed, or binary. In clinical work one often encounters situations where the outcome variable is
University of Florida - GEO - 6938
Parametric Test Quadrat AnalysisEquations taken from Rogerson, 2001.i=m i =1s2 = (xs2 xi- x )2m -1 m -1 (VMR - 1) z= 2 m is the number of quadrats, x is the mean of the number of points per quadrat, s2 is the variance of the number of points per
University of Florida - GEO - 6938
AN INTRODUCTION TO QUADRAT ANALYSISR.W.ThomasISSN 0306-6142ISBN 0 902246 66 6 1977 R.W. ThomasCONCEPTS AND TECHNIQUES IN MODERN GEOGRAPHY No. 12CATMOG(Concepts and Techniques in Modern Geography) CATMOG has been created to fill a teaching need in th
University of Florida - GEO - 6938
Rate Transformations and SmoothingLuc Anselin Nancy Lozano Julia KoschinskySpatial Analysis Laboratory Department of Geography University of Illinois, Urbana-Champaign Urbana, IL 61801 http:/sal.uiuc.edu/Revised Version, January 31, 2006Copyright c 20
University of Florida - GEO - 6938
Change Detection Thresholds: Alternative Statistical Approaches to Detecting Temporal Change in Spatial PatternsPeter A. Rogerson Daikwon Han Ikuho Yamada Department of Geography National Center for Geographic Information and Analysis University at Buffa
University of Florida - GEO - 6938
The author(s) shown below used Federal funds provided by the U.S. Department of Justice and prepared the following final report: Document Title: Author(s): Document No.: Date Received: Award Number: Crime Analysis Geographic Information System Services: A
University of Florida - GEO - 6938
Spatial AutocorrelationMorans I Gearys C Arthur J. Lembo, Jr. Salisbury UniversitySpatial Autocorrelation First law of geography: everything is related to everything else, but near things are more related than distant things Waldo Tobler Many geograph
University of Florida - GEO - 6938
SaTScan User GuideTMfor version 8.0By Martin Kulldorff February, 2009 http:/www.satscan.org/ContentsIntroduction . 4 The SaTScan Software . 4 Download and Installation . 5 Test Run . 5 Sample Data Sets. 5 Statistical Methodology.
University of Florida - GEO - 6938
Further Methods for Point Pattern AnalysisBailey and Gatrell Chapter 4Variations in Populationn Certain types of events will exhibit clustering due to heterogeneity in the underlying distribution e.g disease cases or crimes will tend to cluster where t
University of Florida - GEO - 6938
Non-technical Overview of Geospatial Statistical MethodsGIS/Mapping and Census Data Second Annual Census Workshop Series Workshop 3: Spatial Statistics, Spatial Research & Confidential Census DataNew York Census Research Data Center (CRDC) Baruch Colleg
University of Florida - GEO - 6938
Package `spatstat'December 21, 2011Version 1.25-1 Date 2011-12-21 Title Spatial Point Pattern analysis, model-fitting, simulation, tests Author Adrian Baddeley <Adrian.Baddeley@csiro.au> and Rolf Turner <r.turner@auckland.ac.nz> with substantial contrib
University of Florida - GEO - 6938
Andrei Rogers and Norbert G. GomarStatistical inference in Quadrat AnalysisThe growing recognition of the need for establishing a systematic and quantitative means for describing and analyzing, the spatial dispersion of activities in urban areas has gen
University of Florida - GEO - 6938
Biometrical Journal 50 (2008) 1, 4357 DOI: 10.1002/bimj.20061033943Parameter Estimation and Model Selection for Neyman-Scott Point ProcessesUshio Tanaka1, Yosihiko Ogata*, 1, 2, and Dietrich Stoyan31 2 3The Graduate University for Advanced Studies, M
University of Florida - GEO - 4167
Review of Matrix AlgebraMatrices A matrix is a rectangular or square array of values arranged in rows and columns. An m n matrix A, has m rows and n columns, and has a general form of a11 a = 21 . am1 a12 a22 . am 2 . a1n . a2 n . . . amn mn mn1Exa
University of Florida - GEO - 4167
University of Florida - GEO - 4167
Geographically Weighted RegressionA Tutorial on using GWR in ArcGIS 9.3Martin Charlton A Stewart FotheringhamNational Centre for Geocomputation National University of Ireland Maynooth Maynooth, County Kildare, Ireland http:/ncg.nuim.ieThe authors grat
University of Florida - GEO - 4167
GEOGRAPHICALLY WEIGHTED REGRESSIONWHITE PAPERMARTIN CHARLTON A STEWART FOTHERINGHAMNational Centre for Geocomputation National University of Ireland Maynooth Maynooth, Co Kildare, IRELANDMarch 3 2009The authors gratefully acknowledge support from a S
University of Florida - GEO - 4167
Lab#1, Spring 2012 (25 points) GEO 4167/GEO 6161 Intermediate Quantitative Methods (Fik) Name: _ Score: _ Instructions: Complete this lab to the best of your abilities. Attach your work sheets, relevant computer output, results, and write-up to this cover
University of Florida - GEO - 4167
Polynomial regressionDaniel Borcard, Dpartement de sciences biologiques, Universit de Montral Reference: Legendre and Legendre (1998) p. 526A variant form of multiple regression can be used to fit a nonlinear model of an explanatory variable x (or sever
University of Florida - GEO - 4167
Board of the Foundation of the Scandinavian Journal of Statistics 2004. Published by Blackwell Publishing Ltd, 9600 Garsington Road, Oxford OX4 2DQ, UK and 350 Main Street, Malden, MA 02148, USA Vol 31: 515534, 2004Functional Coefficient Regression Mode
University of Florida - GEO - 4167
AN INTRODUCTION TO TREND SURFACE ANALYSISD.UnwinISSN 0305-6142 ISBN 0 902246 51 8 1978 David J. UnwinCONCEPTS AND TECHNIQUES IN MODERN GEOGRAPHY No. 5CATMOG(Concepts and Techniques in Modern Geography) CATMOG has been created to fill a teaching need
University of Florida - GEO - 4167
Intermediate Quantitative MethodsTimothy J. Fik Associate Professor GEO 4167 section #6647 (undergraduate) GEO 6161 section #8377 (graduate)Credit hours: 3Thursdays (periods 2-4): 8:30-11:30AM Location: TUR 3012 SPRING 2012Intermediate Quantitative Me
University of Florida - GEO - 4167
More on the Reliability, Precision, and Performance of the regression model and its estimated parameters. As the least-squares coefficient/parameter estimates ( j's) and the SRF's ability to explain variation in the dependent variable (Y) can vary from sa
University of Florida - GEO - 4167
II. Testing for Multicollinearity When two or more independent variables in a regression model are highly correlated with one another (or collinear), they will contribute "redundant" explanatory information. Hence, not all of those independent variables
University of Florida - GEO - 4167
Recall our recent Reading Assignments. Read and review: (a) the technical appendix in your textbook on Matrix approach to LS regression. Basic Econometrics by D. Gujarati, 2007, 4th edition. and/or (b) the posted Matrix Algebra review and the Matrix Appro
University of Florida - GEO - 4167
Extending Linear Regression: Weighted Least Squares, Heteroskedasticity, Local Polynomial Regression36-350, Data Mining 23 October 2009Contents1 Weighted Least Squares 2 Heteroskedasticity 2.1 Weighted Least Squares as a Solution to Heteroskedasticity
University of Florida - GEO - 4167
OLS Under Heteroskedasticity Testing for HeteroskedasticityHeteroskedasticity and Weighted Least SquaresWalter Sosa-EscuderoEcon 507. Econometric Analysis. Spring 2009April 14, 2009Walter Sosa-EscuderoHeteroskedasticity and Weighted Least SquaresOL
University of Florida - GEO - 4167
Regression Analysis Tutorial183LECTURE / DISCUSSION Weighted Least SquaresEconometrics Laboratory C University of California at Berkeley C 22-26 March 1999Regression Analysis Tutorial184IntroductionIn a regression problem with time series data (whe
University of Florida - GEO - 4167
Intermediate Quantitative MethodsTimothy J. Fik Associate Professor GEO 4167 section #6647 (undergraduate) GEO 6161 section #8377 (graduate)Credit hours: 3Thursdays (periods 2-4): 8:30-11:30AM Location: TUR 3012 SPRING 2012Intermediate Quantitative Me
University of Florida - AST - 1002
UFIDQ1 9.2 8.25 4.5 8 9.85 5.5 9.1 10 7.5 4.5 9.85 6 3.5 7 6.35 10 9 7.5 9.5 5.25 6.75 5 5.75 2.5 5.25 3.25 6.1 7 6.5 9.1 5 3.25 6.5 8.75 9 3.5 10 5 4.1 5.1 4.5 6.7501713653 03291993 03891805 05193165 09669612 11156163 11161338 11314038 11334031 1139879
University of Florida - AST - 1002
1/19/12discoveredinNov2011~600lyfromEarth P=290daysThe1stexoplanetorbiAngwithintheGoldilockzonearoundaSunlikestarReviewonLecture2WhyPtolemy'sEpicycleModelwasagoodtheory? WhyPtolemy'sEpicycleModelwasnotagood theory? Inwhataspect,Kepler'sModelissuperio
University of Florida - AST - 1002
1/19/12ImportantNo/ce1stQuizonJan26(1weekfromtoday) about10~15problems mul/plechoice+T,F+answering +simplemath? itwillcoverChap0.2. Tipsforstudyingthetextbook.Exoplanets51Pegasib*1stexoplanetdiscovered (1995)orbi/ngaSunlikestar CentralStar(51Pegasi)
University of Florida - AST - 1002
1/24/12Observa-onProjectI:Observingthe FullCycleoftheMoonAim:Understandingtherela-vemo-ons betweentheMoon&Sunbyobserving 1)theloca-onoftheMoonintheskyata fixedobserva-on-me 2)thephaseoftheMoon Due:1weekbeforetheFinalExamObserva-onProjectI:Observingthe
University of Florida - AST - 1002
1/27/12ReviewL02L051.BeginningoftheModernAstronomy Aristotle,Ptolemy,Copernicus,Kepler,Newton 2.Exoplanets(examples&generalproperMes) mass,eccentricity,distancefromhoststars,numberofmembers &layout 3.DetecMonMethodsofExoplanets directimagingwithAO radia
University of Florida - AST - 1002
What'supUniverse? TheStrongestSolarFlareIn2012,arewedoomed?UnderstandingOurWorld,SolarSystemChap48kpclyrSolarSystemLayout(1)30AU 100AU 105AULaunchedin1977 V=20,000m/sAsofAug2006OortCloudisahypothePcalshellwhichiscomposedofnumerous cometlikebodie
University of Florida - AST - 1002
EarthMoonSystemPlanetEarthP=365days d=1AU =5,500kg/m3 6,387kmMoon=3,300kg/m3 1,738kmChap5StudyingEarth:LandscapesStudyingEarth:OverallStructure6mainlayersofEarth1)MetallicCores(ironcore) 2)Mantle(Silicatemantle) 3)Crust 4)Atmosphere 5)Trophospher
University of Florida - AST - 1002
2/10/12Reminder!ObservingProjectsReviewonLecture78TextbookChap45Keyconcepts Q.UnderstandingtheoverallproperKesoftheSolarSystem Layout,Orbits,ChemicalcomposiKon.etc Q.Understandingthebasicfeaturesofthenebulartheoryof SolarSystemformaKon Q.Understanding