Documents Found!
As seen in
Less Work, Better Grades
Join
Course Hero
Access
best resources
Ace
your classes
Ace your courses with Course Hero!

Submit your homework question or assignment here:
352 Tutors are online
 
We are so confident that you will love our service, we will answer your first homework question for FREE!
*  Attach Assignment (optional):
 
Study Smarter, Score Higher
 
Document Content (unformatted)
Course Hero has millions of student submitted documents similar to the one below including study guides, homework solutions, papers, exam answer keys and textbook solutions.
NOTES 8/30/2004 Logistics Logistics, o ce hour times, the course website, relevant texts, and a list of topics to be covered are given on the course syllabus. This syllabus can be obtained from Mark van der Laan or Alan Hubbard. Course evaluation will be based on attendance, lecture notes (each student will have to transcribe one or more lectures), a midterm, and a nal poster project. The nal project will be an application of causal inference methodology from the class to real or simulated data, and students will be allowed to work in small groups. The three big questions Before a researcher comes to a statistician for help with data analysis, he or she should be able to answer the following questions. (1) What is my data, and what is my population of interest? (2) What is my model? (What assumptions can I make about the data generating distribution?) (3) What is the parameter of interest? (What would I like to know about the population?) A statistician s job is to then help the researcher estimate the parameter of interest. Causal inference problems fall into this three question framework, but the answers to (1), (2), and (3) are di erent from what one would nd in more traditional elds of statistics, as we describe below. 1. What is the data in causal inference problems? We will be interested in longitudinal data, where each subject is followed over time, and we de ne the following variables. A(t). This denotes the treatment given to a subject at time t. Y(t). This denotes some outcome of interest, measured at time t. X(t). This include Y (t), as well as time-dependent (and baseline) covariates measured on the subject. Note that A(t), Y (t), X(t) can be possibly multivariate. De ne A as (A(t) : t 0), the process giving the value of A(t) for each t, and de ne Y and X similarly. The observed data in causal inference problems is then n i.i.d. copies of (A, X) P0 . If the reader is unfamiliar with the notation i.i.d., feel free to consult any introductory statistics text. Here P0 is the (unknown) data generating distribution, which assigns probabilities to members of the population of interest. As an example, consider an AIDS study. Suppose n patients are selected at random from an AIDS registry, and each is followed up for a period of time. Here P0 is the probability distribution putting equal mass on each sample of n distinct subjects from the registry (this approximates i.i.d. sampling if the registry size is very large compared to n), and the population of interest consists of all members of the registry. We might have A(t) represent the collection of medications being prescribed to the patient at time t, and the outcome Y (t) might represent the viral load at time t or an indicator of whether the patient is still alive at time t. Here X(t) could include baseline measurements such as the patient s sex, age, and income, as well as time-dependent covariates such as the patient s CD4 count at time t. 2. What is the model in causal inference problems? Causal inference is the study of counterfactuals, which are the outcomes that would have been observed had the treatment somehow been di erent. Speci cally, let X a be a counterfactual, and represent the process X that would have been observed had the treatment been set at A = a. When A = a, we refer to the observed X a = X A as the factual. We will assume that X A = X, and this is called the consistency assumption. It states that the observed data is equal to what we would have observed in the counterfactual world had the treatment been set to the observed treatment. In the AIDS example, suppose a represents 1 no treatment being given. Then for a given patient, X a represents the covariate process that would have occured if, contrary to fact, no treatment had been given to that patient. The idea of counterfactuals raises philosophical issues that have been discussed at least since the time of David Hume, because in the real world each subject is only assigned one treatment process. In causal inference problems, our model will assume the F U LL existence of counterfactuals. If represents the support of A, then we refer to X = (X a : a ) as the full data. For the estimation procedures discussed in this class to be e ective, we must make additional F U LL assumptions on the distribution of X (the full data model), and the conditional distribution of A, given F U LL X , such as the sequential randomization or no unmeasured confounding assumptions, and these will be formally de ned in subsequent lectures. The choice of models are typically heavily driven by the parameter of interest. Our general phylosophy is that one should make model assumptions on the parameter of interest, but try to minimize assumptions on the nuisance parameters. Marginal structural models, models for direct and indirect e ects, and history adjusted marginal structural models (three topics covered in this class) are just di erent models on (conditional) distributions of counterfactuals, describing how these conditional distributions change with a change in treatment regime a. 3. What is the parameter of interest in causal inference? In causal inference problems, parameters of interest are called causal parameters. Typically they are F U LL , but in history adjusted marginal structural models functions of the data generating distribution of X they will be functions of conditional distributions of the counterfactual outcome Ya , given an observed past. Usually these parameters will be related to how the outcome process Y a varies with a, and how this variation is modi ed by the covariates. For instance, in a marginal structural models our model might assume that E[Ya (t)] = m(t, a, ) for some known function m( ) and unknown Euclidean parameter . In this case, would be the causal parameter, and we would have to nd a way to estimate it from the observed data. Note F U LL ), causal inference can that because the observed data (A, X) is a strict subset of the full data (A, X be treated as a missing data problem. This will be heavily exploited in the following lectures. The general estimating function approach for censored/missing data structures as described in van der Laan, Robins (2002), and presented in this course, corresponds with rst nding procedures (i.e., full data estimating functions) that can estimate the causal parameter from the full data, and then map these to procedures (i.e., observed data estimating functions) that estimate the causal parameter from the observed data. Causal graphs, 9/1/2004 Notes. In point treatment studies we are interested in studying the causal e ect of treatment on outcome of interest. Let A denotes the treatment and Y the outcome variables respectively. Other covariates X 1 , X2 , . . . , Xm are also collected from the patients in the study, requiring adjustment of the causal e ect for the covariates. In some situations, the set of causal assumptions on the random variables is known before the study begins and can be represented in the form of a causal graph. Before we proceed to give a formal de nition of this concept, we introduce the notation using the causal graph in g. 1 as an example. There are 5 variables in this hypothetical study - X1 , X2 , X3 , A and Y , which are represented in the graph as vertices. Directed edges represent causal dependence (of which the statistical conditional dependence is a special case), thus Y is causally dependent on A, X1 and X3 ; X2 is dependent on X1 , etc. We de ne P A (X) to be the set of all random variables Z in the graph which have a direct arrow going into the node X. In other words P A (X) is the set of parent nodes of vertex X, i.e. all those vertices in the causal graph that have a directed edge that ends in X. For example P A (Y ) = {A, X 1 , X3 } and P A (X1 ) = . Loops are not allowed, which is equivalent to requiring that the causal graph is DAG (directed acyclic graph). De nition 1. A causal graph G for the set of R.V. (X1 , X2 , . . . , Xm , A, Y ) in a point treatment study is a DAG de ning a set of causal assumptions: Xi = A= Y= fi (P A (Xi ) , i ) , i = 1, . . . , m, fA (P A (A) , A ) , fY (P A (Y ) , Y ) , 2 Figure 1. Example of a causal graph with three covariates. for some deterministic functions fi , i = 1, . . . , m, fA and fY , and i , i = 1, . . . , m, A and Y are random variables called (exogenous) errors satisfying the following assumptions i P A (Xi ) , A P A (A) , Y P A (Y ). Note that the causal graphs just assumes that nodes are certain functions of parents-nodes and exogenous error variables, but it assumes nothing about the functional form of these deterministic functions. However, in practice, if one aims to estimate these unknown deterministic functions, then, by the curse of dimensionality, one will parameterize each of these functions with a set of parameters such as coe cients of linear/logistic regression models. The causal graph can involve unmeasured variables: that is, A, Y or X 1 , . . . , Xm can be subject to missingness or (right-)censoring. In this case one views the causal graph as a set of assumptions on the full-data random vector being the collection of nodes in the causal graph: in thus, this case the causal graph does not make any assumptions about the conditional distribution of censoring/missingness variables, given the full data X F U LL (X1 , . . . , Xm , A, Y ). Specifying the whole DAG is usually hard in practice. Given such a DAG it is now possible to identify a causal e ect of one node on another node in the graph from the distribution/density of the full-data X F U LL = (X1 , . . . , Xm ), A, Y . In addition, if the observed data is O = (C, X F U LL ) for some known function of a censoring variable C and X F U LL (this is the most general de nition of a censored/missing data structure), and one assumes that the conditional distribution of C, given X F U LL , satis es coarsening at random (see e.g., van der Laan, Robins, 2002, for literature overview and de nitions), then one can often identify from the observed data distribution the full data distribution X F U LL , and thereby the wished causal e ects. To understand what variables in the graph can be completely missing (i.e., they are not observed on any subject in the sample) while still having coarsening at random and (thereby) identi cation of the wished causal e ect is an interesting problem, and area of research. In class we we will present an alternative counterfactual approach exists that does not require full speci cation of a causal graph, but, does only require knowing which variables are pre-treatment, but does also need to assume that there are no unmeasured confounders. Once we have the causal graph we can identify from the distribution of X F U LL the counterfactual distribution P (Ya = y) which is the marginal distr. of Y when treatment is set at level A = a. If the interest is only in the e ect of A on Y , all covariates connected to A only through an undirected path that includes Y should be ignored. One important issue in the study of causal graphs is what s the minimal subset of covariates that is su cient to identify the counterfactual distribution of Ya ; related to this is the question of what s the minimal set of confounders that needs to be strati ed upon in a point treatment study. De nition 2. A set of edges connecting two vertices A and Y is called a back-door path if X s.t. X P A (A) and there is a undirected path between X and Y . A vertex in a back-door path is called a collider if the path edges incident with this vertex are incoming (i.e. having their direction towards the vertex). A confounding between A and Y is present if a back-door path with no colliders connecting A and Y . For example, there are two back-door paths present in g. 1 - A X1 Y and A X2 X3 Y . 3 How to nd the likelihood of the data given a causal graph? Using the chain rule for factoring the joint likelihood of discrete R.V. (Z1 , Z2 , . . . , Zd ): d (1) P (Z1 , Z2 , . . . , Zd ) = P (Z1 ) i=2 P (Zi |Z1 , . . . , Zi 1 ) , together with the conditional dependence between variables implied by the causal graph P (Z i |Z1 , . . . , Zi 1 ) = P (Zi |P A(Zi )), we obtain: d P (Z1 , Z2 , . . . , Zd ) = i=1 P (Zi |P A(Zi )) . When applying this to a single observation in a point-treatment study with discrete R.V. (X 1 , X2 , . . . , Xm , A, Y ) we obtain: P (X1 = x1 , . . . , Xm = xm , A = a, Y = y) = m (2) = j=1 P (Xj = xj |P A (Xj )) P (A = a|P A (A)) P (Y = y|P A (Y )) . Assuming that each of the functional relationships in de nition 1 is parameterized as a (e.g., linear) regression in the parent nodes with known (or up till a nite dimensional parameter) conditional distribution of the error-term, given the parent-nodes, we can express the joint probability in 2 as a function of an unknown parameter vector (e.g., the collection of node-speci c regression coe cients), which represents now a parametric model and for the density/likelihood of X F U LL . One can estimate the unknown parameters with the maximum likelihood estimator obtained by maximizn ing the likelihood i=1 P (X F U LL = xF U LL ) of an observed sample xF U LL , i = 1, . . . , n. Typically, this can i i be carried out with standard software (e.g. implementations of generalized linear regression). Given the causal graph and its estimated functional relations, one can now de ne the distribution of Y a as the distribution one obtains by xing the treatment variable A = a in the system of equations de ned by the node-speci c equations, and generating all nodes accordingly. Example 1. The causal graph in this example is speci ed in g. 2. Figure 2. Another example of a causal graph with three covariates. Assuming continuous distr. for all R.V in the causal graph, the density for a single observation can be written as follows: (3) f (X1 , X2 , X3 , A, Y ) = f (X3 ) f (A|X3 ) f (X2 |A) f (Y |A, X1 , X3 ) f (X1 |A) . 4 Now let s nd the counterfactual distribution for A = a: (1) Erase f (A|P A (A)) from 3. This is equivalent to performing an incision to the graph in g. 2 that removes both vertex A and the edges incident to A ( g. 3); (2) Set A = a in all functions where A belongs to the parents set; (3) De ne fa (X1 , X2 , X3 , Y ) = f (X3 ) f (X2 |A = a) f (Y |A = a, X1 , X3 ) f (X1 |A = a) ; (4) Integrate out X1 , X2 and X3 in fa (X1 , X2 , X3 , Y ) to get the counterfactual density fa (Y ). Figure 3. Graph from example 1 after incision of A. Example 2. This is how Pearl de ned the counterfactual probability distribution fa (Y ). Hence, if A has two levels - treatment (a = 1) and control (a = 0), and the causal parameter (or e ect) of interest is the treatment di erence, one needs to calculate the quantity Ef1 (Y ) Ef0 (Y ). The above method for doing causal inference can be summarized as follows. Using standard software we can do maximum likelihood estimation to t the functional forms of each node in the causal graph, thus estimating the parameters in each functional relationships in def. 1. Subsequently, given the maximum likelihood estimator of the unknown parameters, we can identify corresponding estimates of the treatment speci c disstribution of (Ya , X1a , . . . , Xma ) by Monte-Carlo simulation for each choice of treatment level a. If the functions in def. 1 are parameterized using exible regression functions, one can employ dataadaptive model selection methods such as cross-validation or penalized likelihood methods. Such methods provide tools to decide data adaptively how exible the parametrization should be. Clearly, if the number of parameters is larger than the sample size n, then the maximum likelihood estimator of the unknown parameter vector becomes too variable or ill de ned, and thereby our estimate of the treatment speci c distribution is too variable as well. That is, the size/dimension/complexity of the model needs to be data dependent. An important research area is the development of methods which data adaptively selects models for the purpose of estimating a particular parameter (such as in our case, the causal e ect of treatment on the outcome Y ) of interest. To estimate the variability of the estimates of the causal parameters, (non-) parametric bootstrap is usually employed. This involves resampling repeatedly n observations from the actual sample (nonparametric bootstrap) or from a t of the true probability distribution of the data (parametric bootstrap). Is it possible to make a choice between di erent causal graphs based exclusively on the data from a study? Short answer is, No . Consider a simple causal graph with 2 R.V. X1 , X2 , where the true causal graph and 2 data generating distribution is: X1 X2 , X1 N 1 , 1 and X2 = X1 + c. If we would choose between the only two possible causal graphs X1 X2 and X1 X2 the one with the largest corresponding tted 2 likelihood (with the second model assuming X1 = X2 + c , X2 N 2 , 2 , and 2 , 2 estimated from the data generated using the true data distribution) we ll get that approximately 50% of the time we pick the wrong causal graph. In general, if all nodes (A, Y, X1 , . . . , Xm ) are discrete valued, then all possible causal graphs of X F U LL give the same corresponding tted maximum likelihood estimate of the distribution of 5 X F U LL , if we use for all causal graphs the nonparametric model (that is, do not assume any parametric form for the functional relations). In this case, the causal-graph speci c maximum likelihood is completely at in the choice of causal graph. Consequently, any variability in the causal-graph speci c maximum likelihood values across causal-graphs is NOT due to changes in the causal graphs, but it is due to the fact that di erent parametrized causal graphs result in di erent statistical models for X F U LL , and one might approximate the true distribution of X F U LL better than another. For example, if in truth X1 X2 , and the conditional mean of X1 , given X2 happens to be linear in X2 , then a wrong causal graph X2 X1 with corresponding linear normal regression assumption X1 N (betaX2 , 2 ), will likely give a higher maximum likelihood value then a correct causal graph X1 X2 with corresponding assumption X2 is exponential with (X1 ) = X1 . 6
Find millions of documents here - Study Guides, Homework Solutions, Papers, Exam Answer Keys and more. Course Hero has millions of course related materials that will enable you to learn better, faster and get an A in all your courses.
Below is a small sample set of documents:

Berkeley >> STAT >> 210b (Spring, 2008)
Data Adaptive Estimation of the Treatment Specic Mean in Causal Inference R-package cvDSA Yue Wang Division of Biostatistics Nov. 2004 Nov. 8, 2004 1 Outlines Introduction: Data structure and Marginal Structural Model. Estimation Road map Choice...
Berkeley >> STAT >> 210b (Spring, 2008)
Multiple Testing Procedures Examples and Software Implementation Multiple Testing in Action Examples From New Book Multiple Testing Procedures with Applications to Genomics (2007). S. Dudoit and M. J. van der Laan. Multiple Testing Software R packa...
Berkeley >> STAT >> 210b (Spring, 2008)
Targeted MLE for Variable Importance and Causal Effect with Clinical Trial and Observational Data Mark van der Laan works.bepress.com/mark_van_der_laan Division of Biostatistics, University of California, Berkeley Outline Standard approaches...
Berkeley >> STAT >> 210b (Spring, 2008)
Multivariate Statistical Methods in Genomics PH 243 A, MW 12-2, 2305 Tolman Instructor: Mark van der Laan Oce: Haviland Hall 108, tel: 643-9866 website: www.stat.berkeley.edu/ laan Technical reports at www.bepress.com/ucbbiostat/ email: laan@stat.ber...
Berkeley >> STAT >> 210b (Spring, 2008)
Super Learning Eric Polley Joint work with Mark van der Laan and Alan Hubbard e-mail: ecpolley@berkeley.edu PH 246C (Berkeley) Super Learning PH 246C 1 / 27 Outline 1 Cross-Validation Denitions Oracle Inequalities 2 Super Learning Denitions...
Berkeley >> STAT >> 210b (Spring, 2008)
Super Learning in Prediction HIV Example Mark van der Laan www.bepress.com/ucbbiostat Division of Biostatistics, University of California, Berkeley Outline Super Learning in Prediction of HIV Phenotype based on HIV Genotype Scientific Goal Predic...
Berkeley >> STAT >> 210b (Spring, 2008)
Empirical Processes - Introduction April 8, 2005 Notation and Basic Setup We will assume that O1 , ., On P i.i.d. Pn is the empirical distribution. P f means Ep f (O) = f dP , likewise Pn f = f dPn . F will represent a set of real-valued functions...
Berkeley >> STAT >> 25 (Fall, 2008)
STAT C206A / MATH C223A : Curie-Weiss and KMT Strong Embedding 1 Lecture 14 Lecture date: Sept 28, 2007 Scribe: John Zhu 1 Some nal remarks about the Curie-Weiss concentration 1 n In the Curie Weiss model, if m() = i , then +t n exp nt2 . 4(...
Berkeley >> STAT >> 25 (Fall, 2008)
Stat 25, Quiz 4 Solutions May 9, 2006 1. (a) If H0 is rejected, it means that the data is inconsistent with H0 . Thus the scale is (almost certainly) out of calibration, so the answer is (ii). (b) If H0 is not rejected, it means that the data is co...
Berkeley >> STAT >> 25 (Fall, 2008)
Game Theory, Alive Yuval Peres i The author would like to cordially thank Alan Hammond, for scribing the rst draft; Gabor Pete, Yun Long and Peter Ralph for scribing the revisions; Yelena Shvets for pictures and editing; Ranjit Samra of rojaysorigi...
Berkeley >> STAT >> 25 (Fall, 2008)
Stat 155 Fall 08 Practice Final December 9, 2008 Time: 3 hours. Please show all steps. 1. Consider a game of nim with inital conguration (48, 23, 74, 10) and with the restriction that at most three chips can be removed from a pile in a single move....
Berkeley >> STAT >> 25 (Fall, 2008)
STAT C206A / MATH C223A : Steins method and applications 1 Lecture 18 Lecture date: Oct 8, 2007 Scribe: Guy Bresler 1 Tusndys Lemma a Lemma 1 (Tusndys Lemma, Lemma 3.10 in handout) Let 1 , . . . , n be i.i.d. a symmetric 1 random variables and S...
Berkeley >> STAT >> 25 (Fall, 2008)
Stat 134 Section 2 Homework 3 (Due: Wednesday, 9/27) Problems from Pitmans book: Section 2.1: 1, 5, 7. Section 3.2: 3, 6, 13. 1 ...
Berkeley >> STAT >> 25 (Fall, 2008)
Stat 25, Homework # 2 Solution February 25, 2006 1. n!. If n = 10, then this is 3628800. 2. 310 . 3. (a) S = {(1, 1), (1, 2), (1, 3), (1, 4), (1, 5), (1, 6), (2, 1), (2, 2), (2, 3), (2, 4), (2, 5), (2, 6), (3, 1), (3, 2), (3, 3), (3, 4), (3, 5), (3...
Berkeley >> STAT >> 25 (Fall, 2008)
A new method of normal approximation Sourav Chatterjee (UCB) Sourav Chatterjee A new method of normal approximation Central limit theorems Classical CLT: If X1 , . . . , Xn are independent random variables with zero mean and nite variance and (.),...
Berkeley >> STAT >> 25 (Fall, 2008)
STAT C206A / MATH C223A : Steins method and applications Fall 2007 Lecture 23 Lecture date: Oct. 19, 2007 Scribe: Partha Dey Recall that, in the Sherrington Kirkpatrick model, the probability of a conguration = (i )N {1, +1}N is i=1 P() = 1 ZN e...
Berkeley >> STAT >> 25 (Fall, 2008)
Stat 25, Homework # 3 Solution February 25, 2006 1. P (A B c ) = P (A) + P (B c ) P (A B c ) = P (A) + P (B c ) P (A)P (B c ) = P (A) + (1 P (B) P (A)(1 P (B) = 1 P (B) + P (A)P (B). 2. Let R1 be the event that engine 1 is running. Let R2 b...
Berkeley >> STAT >> 25 (Fall, 2008)
STAT C206A / MATH C223A : Steins method and applications 1 Lecture 15 Lecture date: Oct 1, 2007 Scribe: Arnab Sen Proof of Lemma 1 (contd.) Recall that K is set of all probability measure on Rn such that x(x) = 0 and exp , x (dx) exp(b 2 ) Rn...
Berkeley >> STAT >> 25 (Fall, 2008)
STAT C206A / MATH C223A : Steins method and applications 1 Lecture 13 Lecture date: September 26, 2007 Scribe: Joel Meord Recall the following theorem from the previous lecture. Theorem 1 Suppose (X, X ) is an exchangeable pair, F(X, X ) is an ant...
Berkeley >> STAT >> 25 (Fall, 2008)
Stat 200A Practice Problem Set 1 1. Let = R, F = all subsets so that either A or Ac is countable, P (A) = 0 in the rst case and = 1 in the second. Show that (, F, P ) is a probability space. 2. Let (, F, P ) be a probability space and let A1 A2 A3...
Berkeley >> STAT >> 25 (Fall, 2008)
STAT C206A / MATH C223A : Steins method and applications 1 Lecture 20 Lecture date: Oct 12, 2007 Scribe: Tanya Gordeeva 1 Proof of Lemma 3.9 Recall lemma 3.9 from the handout: Lemma 1 (Lemma 3.9) There exist universal constants C, K, and 0 such ...
Berkeley >> STAT >> 25 (Fall, 2008)
STAT C206A / MATH C223A : Steins method and applications Fall 2007 Lecture 7 Lecture date: Sept. 12, 2007 Scribe: Partha Dey 1 Method of Exchangeable Pair d Suppose (W, W ) is an exchangeable pair of random variables, i.e. (W, W ) = (W , W ), an...
Berkeley >> STAT >> 25 (Fall, 2008)
Stat 134 Section 2 Homework 3 (Due: Wednesday, 10/4) Problems from Pitmans book: Section 3.1: 2, 15. Section 3.3: 3, 12. Section 3.4: 1, 10 1 ...
Berkeley >> STAT >> 25 (Fall, 2008)
Stat 134 Section 2 Homework 6 (Due: Friday, 10/27) Problems from Pitmans book: Section 4.1: 2, 12. Section 4.4: 1, 3, 6, 7. Section 4.5: 6, 8. 1 ...
Berkeley >> STAT >> 25 (Fall, 2008)
STAT C206A / MATH C223A : Steins method and applications 1 Lecture 9 Lecture date: Sep 17, 2007 Scribe: Tanya Gordeeva 1 Proof of the Hoeding combinatorial CLT Recall the Hoeding CLT from the previous lecture: Theorem 1 Suppose (aij )1i,jn is an...
Berkeley >> STAT >> 25 (Fall, 2008)
STAT C206A / MATH C223A : Steins method and applications 1 Lecture 12 Lecture date: September 24, 2007 Scribe: Richard Liang 1 Steins method for concentration inequalities The purpose of this lecture will be to prove the following theorem. Theor...
Berkeley >> STAT >> 25 (Fall, 2008)
1. A loses if he gets no sixes. This can happen in 56 ways. Total number of ways = 66 . Therefore P (A loses) = 56 0.335. 66 B loses if the number of sixes is either 0 or 1. Now, P (B gets 0 sixes) = 512 /612 . Again, B can get exactly one six if i...
Berkeley >> STAT >> 25 (Fall, 2008)
Stat 25, Quiz 2 Solutions March 23, 2006 Note from the TA: All the relevant formulas will be provided for quizes in the future. 1. Linear combinations of normaly distributed varaibles are normaly distributed. The mean is 2 2 2 X+Y = X +Y = 90. Sinc...
Berkeley >> STAT >> 25 (Fall, 2008)
STAT C206A / MATH C223A : Steins method and applications 1 Lecture 27 Lecture date: Oct 29, 2007 Scribe: Arnab Sen In the previous lectures, for the SK model with h = 0, < 1/2 ( though the result actually holds for h = 0, < 1), we proved that th...
Berkeley >> STAT >> 25 (Fall, 2008)
STAT C206A / MATH C223A : Steins method and applications 1 Lecture 19 Lecture date: Oct. 10, 2007 Scribe: Laura Derksen 1 Let Tsundys lemma for a given total sum a i.i.d. symmetric random variables taking values in {1, 1} and let Sn = be a unifor...
Berkeley >> STAT >> 25 (Fall, 2008)
Stat 200A Fall 2007 Homework 5 1. Suppose X1 , . . . , Xn i.i.d. N (, c2 ), where c is a known constant. Compute the MLE of . 2. A company is produces xi items in the ith week, and Yi of them turn out to be defective. Here xi is deterministic and Yi...
Berkeley >> STAT >> 25 (Fall, 2008)
Stat 155 Fall 08 Solution to HW 6 1. Suppose player I has m pure strategies. Since the game is symmetric, player II also has m pure strategies. Let m K = m = {p = (p1 , . . . , pm ) : pi 0 for all i, and i=1 pi = 1}. Then K is a closed, convex, ...
Berkeley >> STAT >> 25 (Fall, 2008)
STAT C206A / MATH C223A : Steins method and applications 1 Lecture 5 Lecture date: Sept 7, 2007 Scribe: Guy Bresler 1 Continuation of Stein Bound In the previous lecture we proved bounds on f and its derivatives, f satisfying f (x) xf (x) = g(x...
Berkeley >> STAT >> 25 (Fall, 2008)
Stat 25, Quiz 4 May 4, 2006 1 Formulas The least squares line for y vs x is y = 0 + 1 x. 1 = Given points (x1 , y1 ), . . . , (xn , yn ) P n x y i=1 (xi )(yi ) n x2 i=1 (xi ) P 0 = y 1 x The residual for the ith data point is ei =...
Berkeley >> STAT >> 25 (Fall, 2008)
A generalization of the Lindeberg principle Sourav Chatterjee 1 X = (X1, . . . , Xn) and Y = (Y1, . . . , Yn) are independent random vectors. f : Rn R is a C 3 function. If, say, Xis and Yis are all independent 2 and EXi = EYi, EXi = EYi2, th...
Berkeley >> STAT >> 25 (Fall, 2008)
(Updated on November 5, 2008.) SOURAV CHATTERJEE Assistant Professor of Statistics UNIVERSITY OF CALIFORNIA AT BERKELEY 367 Evans Hall Berkeley, CA 94720-3860 United States Born: November 1979, Calcutta, India. Nationality: Indian 1-510-225-5152 (Ce...
Berkeley >> STAT >> 25 (Fall, 2008)
Gravitational allocation to Poisson points Sourav Chatterjee joint work with Ron Peled Yuval Peres Dan Romik S. Chatterjee, R. Peled, Y. Peres, D. Romik Gravitational allocation to Poisson points Allocation rules Let be a discrete subset of Rd ...
Berkeley >> STAT >> 25 (Fall, 2008)
STAT C206A / MATH C223A : Steins method and applications 1 Lecture 16 Lecture date: Oct 3, 2007 Scribe: Anand Sarwate 1 The general plan of attack Recall the setup from the last lecture: we have iid 1 symmetric random variables 1 , 2 , . . . , a...
Berkeley >> STAT >> 25 (Fall, 2008)
STAT 25 Homework # 3 1. Suppose A and B are two independent events. Derive a formula for P (A B c ) in terms of P (A) and P (B). 2. Airplanes usually have multiple engines so that a backup engine can be used if one breaks down. Suppose a certain ai...
Berkeley >> STAT >> 25 (Fall, 2008)
A The Comprehensive L TEX Symbol List Scott Pakin <pakin@uiuc.edu> 29 September 2003 Abstract A This document lists 2826 symbols and the corresponding L TEX commands that produce them. Some A X 2 system; others require fonts and packages of these sy...
Berkeley >> STAT >> 25 (Fall, 2008)
Stat 155 Fall 08 HW 5 Due date: 10/30/2008 Dont just write down the answer. Please show all steps. Grading scheme: 2 points for fully correct answer, 1 point for partially correct answer, 0 point for completely wrong answer. 1. Game of chicken. Two ...
Berkeley >> STAT >> 25 (Fall, 2008)
STAT C206A / MATH C223A : Steins method and applications 1 Lecture 26 Lecture date: October 26, 2007 Scribe: Richard Liang 1 Annealed CLT for the Hamiltonian of the SherringtonKirkpatrick model We begin by recalling the setting of the previous l...
Berkeley >> STAT >> 25 (Fall, 2008)
STAT C206A / MATH C223A : Steins method and applications 1 Lecture 11 Lecture date: Sep 21, 2007 Scribe: Maximilian Kasy For E the set of edges in the lattice Z2 , let (e )eE be i.i.d. nonnegative edge weights. Dene tn := inf{ e : P is a path from...
Berkeley >> STAT >> 25 (Fall, 2008)
Stat 200A Solution Set 1 1. Clearly, F, because the empty set is countable. Also, it is clear from the denition that if A F, then Ac F. Now suppose A1 , A2 , . . . F. There can be two cases: (i) If Ai is countable for each i, then i Ai is counta...
Berkeley >> STAT >> 25 (Fall, 2008)
Solutions to practice midterm # 2 1. (a) As discussed in class, Xt . = t In the particular data we are considering, we have t = 100 and Xt = 20, and thus = 20/100 = 0.2. (b) Again, this was discussed in class. From the denition of the Poisson pro...
Berkeley >> STAT >> 25 (Fall, 2008)
STAT 25 Homework # 2 1. Suppose that in a certain class, there are n students, and they are ranked from 1 to n according to their total scores in a certain course. How many possible rankings can there be? What is this number if, say, n = 10? 2. If i...
Berkeley >> STAT >> 25 (Fall, 2008)
Stat 134 Section 2 Homework 5 (Due: Wednesday, 10/11) Problems from Pitmans book: Section 3.4: 12. Section 3.5: 1, 2, 9. Chapter 3 Review Exercises: 7, 10. Additional question: Suppose there are n individuals living in a certain town. Let Xn be the n...
Berkeley >> STAT >> 25 (Fall, 2008)
Stat 155 Fall 08 HW 1 Due date: 9/11/2008 Dont just write down the answer. Please show all steps. Grading scheme: 2 points for fully correct answer, 1 point for partially correct answer, 0 point for completely wrong answer. 1. Recall that the subtra...
Berkeley >> STAT >> 25 (Fall, 2008)
Stat 200A Practice Problem Set 3 1. Suppose X1 , . . . , Xn i.i.d. N (, c2 ), where c is a known constant. Compute the MLE of and nd out the asymptotic distribution of n(n ). 2. Suppose X1 , . . . , Xn i.i.d. P oi(). Compute the MLE of . What i...
Berkeley >> STAT >> 25 (Fall, 2008)
Stat 25, Quiz 3 Solutions April 7, 2006 1. (a) Let A and B be the concentrations of particles per mL in suspensions A and B, respectively. Then A = 55 18.3 mL1 and B = 48 = 16 mL1 . Thus, an estimate 3 3 1 for the dierence is A B = 2.3 mL . ...
Berkeley >> STAT >> 25 (Fall, 2008)
STAT C206A / MATH C223A : Steins method and applications 1 Lecture 30 Lecture date: Nov. 5, 2007 Scribe: Laura Derksen We will again consider the SK model with parameters and h. Let 1 l1 = N and 1 r1 = N N N g1j j , j=2 g1j j (1 q) 1 . j=2 ...
Berkeley >> STAT >> 25 (Fall, 2008)
Stat 25, Homework # 1 Solution February 25, 2006 1. (a) Mean = (Sum of scores)/24 26.583. (b) Listed in ascending order, the scores are 0, 7, 11, 15, 18, 19, 24, 25, 25, 25, 27, 28, 29, 29, 29, 30, 30, 35, 36, 39, 39, 39, 39, 40 The number of score...
Berkeley >> STAT >> 25 (Fall, 2008)
STAT C206A / MATH C223A : Steins method and applications 1 Lecture 8 Lecture date: Sept. 14, 2007 Scribe: Laura Derksen 1 Exchangeable pairs Recall that a pair of random variables is called exchangeable if (W, W ) and (W , W ) are equal in distr...
Berkeley >> STAT >> 25 (Fall, 2008)
STAT C206A / MATH C223A : Steins method and applications 1 Lecture 21 Lecture date: Oct 15, 2007 Scribe: Chris Haulk Exercise Let X = (Xi,j )1i<jn denote the Erds-Renyi random graph G(n, p). Let o f (X) = #triangles(X) E(#triangles(X). Obtain con...
Berkeley >> STAT >> 25 (Fall, 2008)
STAT C206A / MATH C223A : Steins method and applications 1 Lecture 28 Lecture date: Oct 31, 2007 Scribe: Anand Sarwate 1 A recap For < 1/2 in the Sherrington-Kirkpatric (S-K) model, we showed a bound on the overlap R12 : E (R12 q)2k (Ck)k , N...
Berkeley >> STAT >> 25 (Fall, 2008)
STAT C206A / MATH C223A : Steins method and applications 1 Lecture 24 Lecture date: October 22, 2007 Scribe: Joel Meord Continuing with the Sherrington-Kirkpatrick model. q [0, 1], 0, h R E where z N(0, 1). Exercise 1 (From the previous lectu...
Berkeley >> STAT >> 25 (Fall, 2008)
STAT C206A / MATH C223A : Steins method and applications 1 Lecture 10: Concentration Inequalities Lecture date: September 19, 2007 Scribe: Chris Haulk 1 Concentration Inequalities Suppose X is a random variable and m is a constant (usually the m...
Berkeley >> STAT >> 25 (Fall, 2008)
STAT C206A / MATH C223A : Steins method and applications 1 Lecture 1 Lecture date: Aug 27, 2007 Scribe: David Rosenberg 1 Introduction Steins method was invented in early 70s by Charles Stein as a method for proving central limit theorems. 1.1 ...
Berkeley >> STAT >> 25 (Fall, 2008)
STAT C206A / MATH C223A : Steins method and applications 1 Lecture 34 Lecture date: November 16, 2007 Scribe: Joel Meord Size Bias Transformations Size bias transformations are a classical topic in probability, with applications to sampling theory...
Berkeley >> HISTORY >> 103d (Fall, 2008)
History 103D Spring 2006 Room TBA War and Mediation: the Vietnam Wars in History, Film, and Story Professor Diane Shaver Clemens, 3223 Dwinelle Hall , Office Hours Th 4-5:30pm or by appt 642-1102 (message), 642-1971 (history dept ), athena1@berkeley...
Berkeley >> HISTORY >> 124b (Summer, 2008)
History 124B - Mr. Abrams Spring 2006 Office Hrs.: 11-12 Tu by appointment 11 Office: 3219 Dwinelle (Tel: 642-1971; -2611) E-mail: abramsr@berkeley.edu The United States Since 1940 Tu,Th 9:30Rm. 2060 VLSB There are no formal prerequisites f...
Berkeley >> HISTORY >> 149b (Fall, 2008)
History R1 The Practice of History Spring 2005 Prof. David Henkin, Prof. Maureen Miller GSIs: Kavita Datla, Mary Doyno, Mary Panelo, Ellen Yutzy Lectures: Tuesday/Thursday 3:30-5pm, 159 Mulford What exactly are we doing when we study History? This...
Berkeley >> HISTORY >> R1 (Fall, 2008)
History R1 The Practice of History Spring 2005 Prof. David Henkin, Prof. Maureen Miller GSIs: Kavita Datla, Mary Doyno, Mary Panelo, Ellen Yutzy Lectures: Tuesday/Thursday 3:30-5pm, 159 Mulford What exactly are we doing when we study History? This...
Berkeley >> HISTORY >> 171c (Fall, 2008)
History 171C FINAL Identifications Study Sheet Maxim Litvinov Spanish Civil War Molotov-Ribbentrop Pact (Soviet-German Non-aggression Pact) Russo-Finish War Operation Barbarrossa Battle of Moscow Battle of Stalingrad Yalta Conference Marshall Plan T...
Berkeley >> HISTORY >> 172 (Fall, 2008)
When Do Scientists Become Entrepreneurs? The Social Structural Antecedents of Commercial Activity in the Academic Life Sciences1 Toby E. Stuart Columbia Business School Waverly W. Ding University of California, Berkeley The authors examine the condi...
Berkeley >> HISTORY >> 172 (Fall, 2008)
Joseph Farrell farrell@econ.berkeley.edu http:/emlab.berkeley.edu/users/farrell Professor of Economics, University of California, Berkeley, 1991Chair of the Competition Policy Center, Affiliated Professor, Haas School of Business, U. C. Berkeley, 199...
Berkeley >> HISTORY >> 172 (Fall, 2008)
Vertical Arrangements, Market Structure, and Competition: An Analysis of Restructured U.S. Electricity Markets James B. Bushnell, Erin T. Mansur, and Celeste Saravia August 6, 2007 Abstract This paper examines vertical arrangements in electricity m...
Berkeley >> HISTORY >> 172 (Fall, 2008)
Christopher A. Hennessy Curriculum Vitae Walter A. Haas School of Business University of California, Berkeley Education Princeton University, Ph.D. Economics, 2001. (Adviser: Patrick Bolton) Princeton University, Master of Public Affairs, 1994. Swar...
Berkeley >> HISTORY >> 172 (Fall, 2008)
Revised 08/16/08 Laura J. Kray PERSONAL INFORMATION Organizational Behavior & Industrial Relations Group Haas School of Business University of California, Berkeley ACADEMIC POSITIONS 2007-present Harold Furst Chair of Management Philosophy and Value...
Berkeley >> HISTORY >> 172 (Fall, 2008)
Nicole Bastian Johnson University of California, Berkeley Walter A. Haas School of Business 545 Student Service Building # 1900 Berkeley, CA 94720. Phone: 510-642-6590 njohnson@haas.berkeley.edu Current Position 2005-Present Assistant Professor, Wal...
What are you waiting for?