708
Views
14
CrossRef citations to date
0
Altmetric
Articles

The Brier Rule Is not a Good Measure of Epistemic Utility (and Other Useful Facts about Epistemic Betterness)

&
Pages 576-590 | Received 30 Apr 2015, Accepted 28 Oct 2015, Published online: 14 Dec 2015
 

ABSTRACT

Measures of epistemic utility are used by formal epistemologists to make determinations of epistemic betterness among cognitive states. The Brier rule is the most popular choice (by far) among formal epistemologists for such a measure. In this paper, however, we show that the Brier rule is sometimes seriously wrong about whether one cognitive state is epistemically better than another. In particular, there are cases where an agent gets evidence that definitively eliminates a false hypothesis (and the probabilities assigned to the other hypotheses stay in the same ratios), but where the Brier rule says that things have become epistemically worse. Along the way to this ‘elimination experiment’ counter-example to the Brier rule as a measure of epistemic utility, we identify several useful monotonicity principles for epistemic betterness. We also reply to several potential objections to this counter-example.

Notes

1 Don Fallis and Dennis Whitcomb [Citation2009] consider how epistemic values beyond truth might fit into the framework of epistemic consequentialism.

2 For the sake of simplicity, in this paper we set aside the possibility that the different hypotheses might have different degrees of informativeness. We also set aside the possibility that the different false hypotheses might have different degrees of verisimilitude. See Graham Oddie [Citation2014] for a discussion of formal models of verisimilitude or ‘truthlikeness’.

3 Since we are focusing here on the degree to which an agent's cognitive state gets at the truth, we might speak instead about measures of inaccuracy and about minimizing expected inaccuracy as many formal epistemologists do (e.g. Joyce [Citation2009] and Leitgeb and Pettigrew [Citation2010a]).

4 Strictly speaking, Joyce's result is even stronger than this. For measures of epistemic utility that have certain independently desirable properties (such as propriety and separability), Joyce shows that, for any credences c that are not coherent, there are coherent credences c′ that have a higher epistemic utility than c regardless of which hypothesis happens to be true (see Pettigrew [Citation2013: 900–2]). In other words, he uses utility dominance rather than expected utility maximization to vindicate probabilism.

5 Goldman [Citation1999: 115–23] argues that conditionalizing on new evidence maximizes objectively expected epistemic utility and not just subjectively expected epistemic utility. But, as we discuss in the following section, the measure of epistemic utility that he uses to prove this result is not a proper scoring rule. So, it is not an appropriate measure of epistemic utility. Moreover, Goldman's result about conditionalization does not hold for any bounded proper scoring rule [Fallis and Liddell Citation2002].

6 Note that this statement of propriety is not restricted to coherent credences. Also, some epistemologists (e.g. Oddie [Citation1997: 539] and Joyce [Citation2009: 276]) require that measures of epistemic utility be strictly proper scoring rules. That is, , for all r and s. All of the scoring rules under discussion in this paper are strictly proper.

7 Note that the following statements of these rules presuppose that credences are over a partition. See Joyce [Citation2009: 275] for statements of these rules for credences over a Boolean algebra. We discuss the issue of partitions versus Boolean algebras below. Also, the Brier rule is often given as a measure of inaccuracy (with the sign reversed) rather than as a measure of epistemic utility, as it is here. It is sometimes referred to as the quadratic rule.

8 For coherent credences, this sum is minimized when a probability of 1/n is assigned to all of the hypotheses in the partition. It is maximized when a probability of 1 is assigned to one of the hypotheses.

9 Since they handle this term a bit differently, the two rules do give a different weight to this epistemic cost.

10 If the probability assigned to the true hypothesis and the probability assigned to a false hypothesis increase, epistemic utility may increase or decrease depending on the details of the case.

11 In ‘A Strange Thing about the Brier Score’, Brian Knab and Miriam Schoenfield [Citation2015] suggest that ‘falsity distributions don't matter.’ Lewis and Fallis [Citation2014] argue that they do matter. But this debate is orthogonal to our concerns here. It is the specific way in which the Brier rule handles falsity distributions that we object to in this paper.

12 Like other PSRs, the Brier rule is typically used as a method for eliciting probability estimates [Bickel Citation2007]. Our criticism, though, is just of the Brier rule as a measure of epistemic utility.

13 According to the Brier rule, u1(r) = 0.625 and u1(s) = 0.333. According to the logarithmic rule, u1(r) = −0.301 and u1(s) = −0.477. According to the spherical rule, u1(r) = 0.817 and u1(s) = 0.577.

14 Since the probabilities have to sum to 1 for coherent credences, the other probabilities cannot all be exactly the same. That is, if ri is different than si, then some of the other probabilities in r and s have to be different as well.

15 According to the Brier rule, u1(r) = 0.875 and u1(s) = 0.625. According to the logarithmic rule, u1(r) = −0.125 and u1(s) = −0.301. According to the spherical rule, u1(r) = 0.949 and u1(s) = 0.817.

16 This claim is trivial for the logarithmic rule. Proofs of this claim for the Brier rule and for the spherical rule are in the Appendix. It may not be the case, though, that all PSRs endorse this principle.

17 According to the Brier rule, u1(r) = 0.500 and u1(s) = 0.333. According to the logarithmic rule, u1(r) = −0.301 and u1(s) = −0.477. According to the spherical rule, u1(r) = 0.707 and u1(s) = 0.577.

18 At least, it represents progress unless ‘the Bayesian agent has been so unfortunate as to assign the true hypothesis a zero prior’ [Earman Citation1992: 163]. Since we assume that the hypotheses under consideration are jointly exhaustive, the true hypothesis is included. In addition, in our ‘elimination experiment’ examples, the agent starts out assigning a non-zero probability to each of the hypotheses.

19 See Godfrey-Smith [Citation2011: 1294–5] and Fallis [Citation2015: 384–6] for further discussion of why these philosophers have themselves been misled.

20 If an agent shifts from s to r on the basis of evidence e, then rk/sk = pr(hk|e)/pr(hk) = pr(e|hk)/pr(e). So, ri/sirj/sj if and only if pr(e|hi) ≥ pr(e|hj). Thus, according to Ian Hacking's ‘Law of Likelihoods’, e supports hi at least as much as it does hj if and only if ri/sirj/sj.

21 More formally, what M3 says is that, if rj < sj for some ji, and there is a real number α such that αrk = sk for all kj, then ui(r) > ui(s).

22 This claim is again trivial for the logarithmic rule. A proof of this claim for the spherical rule is in the Appendix.

23 As noted above, all other things being equal, epistemic utility on the Brier rule decreases as the total probability assigned to the false hypotheses is concentrated on fewer false hypotheses. In many cases, including our ‘elimination experiment’ counter-example, this ‘epistemic cost’ is (according to the Brier rule) enough to outweigh the epistemic benefit of an increase in the probability assigned to the true hypothesis. Knab and Schoenfield [Citation2015] give a different example that also displays this effect. In offering an analysis of misleading evidence, Lewis and Fallis [Citation2014] independently discuss a very similar case. However, these examples are not as obviously in conflict with scientific practice as our ‘elimination experiment’ counter-example is.

24 A full Boolean algebra includes conjunctions, as well as disjunctions, of the basic hypotheses. However, since we are assuming here that the hypotheses under consideration are mutually exclusive, any conjunctions must be assigned a probability of zero.

25 It is possible to modify some of these principles so that they are applicable to a full Boolean algebra of hypotheses in which more than one hypothesis can be true. For instance, we could rewrite M4 as M4*: If ri/sirj/sj for all i and j such that hi is true and hj is false, then r is epistemically at least as good as s. Although this principle is true, it is far too weak for our purposes. Elimination experiments are always epistemically beneficial, but M4* does not entail this.

26 Thanks to an anonymous referee for suggesting this sort of objection.

27 The standard versions of the logarithmic rule and the spherical rule (as stated above) do not vindicate probabilism. Joyce [Citation2009: 275] proposes alternative versions of these rules that do. Unfortunately, unlike the standard versions, Joyce's versions of these rules are (just like the Brier rule) subject to ‘elimination experiment’ counter-examples. (His version of the logarithmic rule does say that r = (1/3, 2/3, 0) is epistemically better than s = (1/4, 1/2, 1/4) when h1 is true, but it says that s = (1/10, 8/10, 1/10) is epistemically better than r = (1/9, 8/9, 0) when h1 is true.) So, vindicating probabilism while avoiding ‘elimination experiment’ counter-examples might not be trivial.

28 We would like to thank Kobus Barnard, David Black, Kenny Easwaran, Branden Fitelson, Will Fleisher, Martin Frické, Peter Godfrey-Smith, Simon Goldstein, Stephen Hetherington, Terry Horgan, Jenann Ismael, James Joyce, Gerrard Liddell, Kay Mathiesen, Richard Pettigrew, Daniel Rubio, Jonah Schupbach, Elliott Sober, Julia Staffel, Dan Zelinski, and two anonymous referees for helpful feedback on earlier versions of this material. Much of this work was carried out while the first author was a visiting fellow at the Tanner Humanities Center at the University of Utah.

Log in via your institution

Log in to Taylor & Francis Online

PDF download + Online access

  • 48 hours access to article PDF & online version
  • Article PDF can be downloaded
  • Article PDF can be printed
USD 53.00 Add to cart

Issue Purchase

  • 30 days online access to complete issue
  • Article PDFs can be downloaded
  • Article PDFs can be printed
USD 94.00 Add to cart

* Local tax will be added as applicable

Related Research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.