(Oppenheimer et al., 2009). The probable net effect is that we were much more liberal in ourinclusion criteria in Experiments 1a and 1b than we were in Experiment 2, and yet we obtainedhighly consistent results.Furthermore, recall that confidence interval construction is hypothesized to benefit the accuracyof best estimates by improving the recruitment of relevant evidence pertinent to testing theequivalent of best- and worst-case scenarios or multiple viewpoints (Hemming et al., 2018).Presumably, the benefit afforded to best-estimate accuracy depends on how accurately thepreceding intervals are constructed. Following this line of reasoning, one might also expect thecorrelation between best-estimate accuracy and (calibrated) confidence interval accuracy to bestronger if interval construction preceded best-estimate construction than if the best estimateswere constructed first. However, we do not find support for that prediction either. Acrossexperiments and question types, the correlation between GMAE for the best estimates and theproportion of correct responses in the calibrated confidence interval wasr(289) =−.40 (p< .001)when best estimates were elicited first andr(257) =−.34 (p< .001) when they were elicited afterthe confidence intervals were constructed. The difference is not significant,z=−0.79,p= .21.From a practical perspective, the present results do not indicate the utility of prior confidenceinterval construction for improving the accuracy of best estimates. As we noted in theIntroduction, improving the accuracy of judgments represents a major effort that has importantimplications for several domains. An important consideration in these efforts is cost. IBBEprotocols require significant additional time (e.g.., in the case of the four-step method, threeadditional judgments). Thus, even a small benefit may not justify the added effort, given thatother methods that require a similar number of elicitations have yielded large improvements inprobability judgment accuracy. Notwithstanding the risks associated with internal meta-analyses(Ueno, Fastrich & Murayama, 2016; Vosgerau, Simonsohn, Nelson & Simmons, 2019), it isuseful to estimate the overall effect size of the elicitation order manipulation we conductedacross three experiments. There is a small positive effect (Cohen’sd= 0.285, 95% CI [0.117,0.453]) of the modified four-step method we tested on the accuracy of best estimates.1Moreover,in some elicitation contexts, such as decision analysis (Clemen, 1996; von Winterfeldt &Edwards, 1986), it may be highly desirable, if not necessary, to collect lower- and upper-boundestimates, in which case there may be a small benefit to following the ordering prescribed byIBBE protocols. However, if the aim of the method is to improve best estimates, then query1Estimation of the confidence interval ondwas computed using the implementation ofprocedures by Smithson (2001) provided by Wuensch (2012).
Upload your study docs or become a
Course Hero member to access this document
Upload your study docs or become a
Course Hero member to access this document
End of preview. Want to read all 29 pages?
Upload your study docs or become a
Course Hero member to access this document
Term
Summer
Professor
NoProfessor
Tags
Credible interval, ICAR