This preview has intentionally blurred sections. Sign up to view the full version.
View Full Document
Unformatted text preview: University of California, Los Angeles Department of Statistics Statistics C173/C273 Instructor: Nicolas Christou Introduction • What is geostatistics? Geostatistics is concerned with estimation and prediction for spatially continuous phe nomena, using data obtained at a limited number of spatial locations. Here, with phenomena we mean the distribution in a two or threedimensional space of one or more random variables called regionalized variables . The phenomenon for which the regionalized variables are referred to it is called regionalization . For example, the dis tribution of mineral ore grades in the threedimensional space. Or the distribution of ozone, etc. • History: The term geostatistics was coined by Georges Matheron (1962). Matheron and his colleagues (at Fontainebleau, France) used this term in prediction for problems in the mining industry. The prefix “geo” concerns data related to earth. • Today, geostatistical methods are applied in many areas beyond mining such as soil science, epidemiology, ecology, forestry, meteorology, astronomy, corps science, envi ronmental sciences, and in general where data are collected at geographical locations (spatial locations). • The spatial locations throughout the course will be denoted with s 1 ,s 2 , ··· ,s n and the spatial data collected at these locations will be denoted with z ( s 1 ) ,z ( s 2 ) , ··· ,z ( s n ). Spatial locations are determined by their coordinates ( x,y ). We will mainly focus in twodimensional space data. • Very important in the analysis of spatial data is the distance between the data points. We will use mostly Euclidean distances. Suppose data point s i has coordinates ( x i ,y i ) and data point s j has coordinates ( x j ,y j ). The Euclidean distance between points s i and s j is given by: d ij = q ( x i x j ) 2 + ( y i y j ) 2 Other forms of distances can be used (greatcircle distance, azimuth distance, travel distance from point to point, time needed to get from point to point, etc.). 1 • The problem: Present and explain the distribution of the random function Z ( s ) : s ∈ D Predict the value of the function Z ( s ) at spatial location s (in other words the value z ( s )) using the observed data vector z ( s 1 ) ,z ( s 2 ) , ··· ,z ( s n ) (see figure be low). ● ● ● ● ● ● ● ● 62 64 66 68 70 72 74 128130132134136138140 x coordinate y coordinate s0 s1 s2 s3 s4 s5 s6 s7 Environmental protection agencies set maximum thresholds for harmful substances in the soil, atmosphere, and water. Therefore given the data we should also like to know the probabilities that the true values exceed these thresholds at unsampled locations....
View
Full
Document
This note was uploaded on 02/11/2012 for the course STATS c173/c273 taught by Professor Nicolaschristou during the Spring '11 term at UCLA.
 Spring '11
 NicolasChristou
 Statistics

Click to edit the document details