2.8 Plausibility Measures


2.8 Plausibility Measures

I conclude this chapter by considering an approach that is a generalization of all the approaches mentioned so far. This approach uses what are called plausibility measures, which are unfortunately not the same as the plausibility functions used in the Dempster-Shafer approach (although plausibility functions are instances of plausibility measures). I hope that the reader will be able to sort through any confusion caused by this overloading of terminology.

The basic idea behind plausibility measures is straightforward. A probability measure maps sets in an algebra over a set W of worlds to [0, 1]. A plausibility measure is more general; it maps sets in to some arbitrary partially ordered set. If Pl is a plausibility measure, Pl(U) denotes the plausibility of U.If Pl(U) Pl(V), then V is at least as plausible as U. Because the ordering is partial, it could be that the plausibility of two different sets is incomparable. An agent may not be prepared to order two sets in terms of plausibility.

Formally, a plausibility space is a tuple S = (W, , PI), where W is a set of worlds, is an algebra over W, and Pl maps sets in to some set D of plausibility values partially ordered by a relation D (so that D is reflexive, transitive, and antisymmetric). D is assumed to contain two special elements, D and D, such that D D d D D for all d D. As usual, the ordering <D is defined by taking d1 < D d2 if d1 D d2 and d1 d2. I omit the subscript D from D, < D, D, and D whenever it is clear from context.

There are three requirements on plausibility measures. The first two just enforce the standard convention that the whole space gets the maximum plausibility and the empty set gets the minimum plausibility ( and ). The third requirement says that a set must be at least as plausible as any of its subsets; that is, plausibility respects subsets.

  • Pl1. Pl() = .

  • Pl2. Pl(W) = .

  • Pl3. If U V, then Pl(U) Pl(V).

Clearly probability measures, lower and upper probabilities, inner and outer measures, Dempster-Shafer belief functions and plausibility functions, and possibility and necessity measures are all instances of plausibility measures, where D = [0, 1], = 0, = 1, and D is the standard ordering on the reals. Ranking functions are also instances of plausibility measures; in this case, D = *, = , = 0, and the ordering * is the opposite of the standard ordering on *; that is, x * y if and only if y x under the standard ordering.

In all these cases, the plausibility values are totally ordered. But there are also cases of interest where the plausibility values are not totally ordered. Two examples are given by starting with a partial preorder on W as in Section 2.7. The partial preorders e and s derived from can be used to define plausibility measures, although there is a minor subtle issue. Given , consider the plausibility space (W, 2W, Ple). Roughly speaking, Ple is the identity, and Pl(U) Ple(V) iff U e V. There is only one problem with this. The set of plausibility values is supposed to be a partial order, not just a preorder.

One obvious way around this problem is to allow the order D of plausibility values to be a preorder rather than a partial order. There would be no conceptual difficulty in doing this, and in fact I do it (briefly) for technical reasons in Section 5.4.3. I have restricted to partial orders here partly to be consistent with the literature and partly because there seems to be an intuition that if the likelihood of U is at least as great as that of V, and the likelihood of V is as great as that of U, then U and V have equal likelihoods. In any case, the particular problem of capturing e using plausibility measures can easily be solved. Define an equivalence relation ~ on 2W by taking U ~ V if U e V and V e U. Let [U] consist of all the sets equivalent to U; that is, [U] ={U : U ~ U}. Let W/~ = {[U]: U W}. Define a partial order on W/~ in the obvious way: [U] [V] iff U e V. It is easy to check that this order on W/~ is well-defined and makes W/~ a partial order (Exercise 2.54). Now taking E = W/~ and defining Ple(U) = [U]gives a well-defined plausibility measure. Exactly the same technique works for s.

For a perhaps more interesting example, suppose that is a set of probability measures on W. Both * and * give a way of comparing the likelihood of two subsets U and V of W. These two ways are incomparable; it is easy to find a set of probability measures on W and subsets U and V of W such that * (U) < * (V) and * (U) > * (V) (Exercise 2.55(a)). Rather than choosing between * and *, it is possible to associate a different plausibility measure with that captures both. Let Dint ={(a, b) :0 a b 1} (the int is for interval) and define (a, b) (a, b) iff b a. This puts a partial order on Dint, with Dint = (0, 0) and Dint = (1, 1). Define PI*, *(U) = (* (U), * (U)), . Thus, PI*, * associates with a set U two numbers that can be thought of as defining an interval in terms of the lower and upper probability of U. It is easy to check that PI*, * (U) PI*, * (V) if the upper probability of U is less than or equal to the lower probability of V. Clearly PI*, * satisfies Pl1–3, so it is indeed a plausibility measure, but one that puts only a partial (pre)order on events. A similar plausibility measure can be associated with a belief/plausibility function and with an inner/outer measure.

The trouble with *, *, and even Pl*,* is that they lose information. Example 2.3.4 gives one instance of this phenomenon; the fact that μ(r) = μ(b) for every measure μ 4 is lost by taking lower and upper probabilities. It is easy to generate other examples. For example, it is not hard to find a set of probability measures and subsets U, V of W such that μ(U) μ(V) for all μ 4 and μ(U) < μ(V) for some μ , but *(U) = *(V) and *(U) = *(V). Indeed, there exists an infinite set of probability measures such that μ(U) < μ(V) for all μ but * (U) = *(V) and *(U) = *(V) (Exercise 2.55(b)). If all the probability measures in agree that U is less likely than V, it seems reasonable to conclude that U is less likely than V. However, none of the plausibility measures *, *, or Pl*,* will necessarily draw this conclusion.

Fortunately, it is not hard to associate yet another plausibility measure with that does not lose this important information (and does indeed conclude that U is less likely than V ).

To explain this representation, it is easiest to consider first the case that is finite. Suppose = {μ1,,μn}. Then the idea is to define Pl(U) = (μ1(U, , μn(U)). That is, the plausibility of a set U is represented as a tuple, consisting of the probability of U according to each measure in . The ordering on tuples is pointwise: (a1, , an) (b1, , bn) if ai bi for i = 1, , n. There are two minor problems with this approach, both easily fixed. The first is that a set is unordered. Although the subscripts suggest that μ1 is the "first" element in , there is no first element in . On the other hand, there really is a first element in a tuple. Which probability measure in should be first, second, and so on? Another minor problem comes if consists of an uncountable number of elements; it is not clear how to represent the set of measures in as a tuple.

These problems can be dealt with in a straightforward way. Let D consist of all functions from to [0, 1]. The standard pointwise ordering on functions—that is, f g if f (μ) g(μ) for all μ —gives a partial order on D. Note that D is the function f : [0, 1] such that f (μ) = 0 for all μ and D is the function g such that g(μ) = 1 for all μ . For U W, let fU be the function such that fU(μ) = μ(U) for all μ . Define the plausibility measure Pl by taking Pl(U) = fU. Thus, Pl(U) Pl(V) iff μ(U) μ(V) for all μ . It is easy to see that fθ = D and fw D. Clearly Pl satisfies Pl1–3. Pl1 and Pl2 follow since Pl(θ) = fθ = D and Pl(W) = fW = D, while Pl3 holds because if U V, then μ(U) μ(V) for all μ . Note that if = {μ1,,μn}, then Pl(U) is the function f such that f(μi) = μi(U). This function can be identified with the tuple (μ1(U), , μn(U)).

To see how this representation works, consider Example 2.3.2 (the example with a bag of red, blue, and yellow marbles). Recall that this was modeled using the set 2 = {μa: a [0,.7]} of probabilities, where μa(red) = .3, μa(blue) = a, and μa(yellow) = .7 a. Then, for example, Pl2 (blue) = fblue, where fblue(μa) = μa(blue) = a for all a [0, .7]. Similarly,

  • Pl2 (red) = fred, where fred(μa) = .3,

  • Pl2 (yellow) = fyellow, where fyellow(μa) = .7 a,

  • Pl2 (red, blue) = fred,blue, where f{red, blue}(μa) = .3 + a.

The events yellow and blue are incomparable with respect to Pl2 since fyellow and fblue are incomparable (e.g., fyellow(μ.7) < fblue(μ.7) while fyellow(μ0)>fblue(μ0)).

On the other hand, consider the sets 3 and 4 from Example 2.3.4. Recall that 3 = {μ : μ(blue) .5, μ(yellow) .5}, and 4 ={μ : μ(b = μ(y)}. It is easy to check that Pl4 (blue) = Pl4 (yellow), while Pl3 (blue) and Pl3 (yellow) are incomparable.

This technique for defining a plausibility measure that represents a set of probability measures is quite general. The same approach can be used essentially without change to represent any set of plausibility measures as a single plausibility measure.

Plausibility measures are very general. Pl1–3 are quite minimal requirements, by design, and arguably are the smallest set of properties that a representation of likelihood should satisfy. It is, of course, possible to add more properties, some of which seem quite natural, but these are typically properties that some representation of uncertainty does not satisfy (see, e.g., Exercise 2.57).

What is the advantage of having this generality? This should hopefully become clearer in later chapters, but I can make at least some motivating remarks now. For one thing, by using plausibility measures, it is possible to prove general results about properties of representations of uncertainty. That is, it is possible to show that all representations of uncertainty that have property X also have property Y. Since it may be clear that, say, possibility measures and ranking functions have property X, then it immediately follows that both have property Y ; moreover, if Dempster-Shafer belief functions do not have property X, the proof may well give a deeper understanding as to why belief functions do not have property Y.

For example, it turns out that a great deal of mileage can be gained by assuming that there is some operation on the set of plausibility values such that Pl(U V) = Pl(U) Pl(V) if U and V are disjoint. (Unfortunately, the discussed here has nothing to do with the defined in the context of Dempster's Rule of Combination. I hope that it will be clear from context which version of is being used.) If such an exists, then Pl is said to be additive (with respect to ). Probability measures, possibility measures, and ranking functions are all additive. In the case of probability measures, is +; in the case of possibility measures, it is max; in the case of ranking functions, it is min. For the plausibility measure PlP, is essentially pointwise addition (see Section 3.9 for a more careful definition). However, belief functions are not additive; neither are plausibility functions, lower probabilities, or upper probabilities. There exist a set W, a belief function Bel on W, and pairwise disjoint subsets U1, U2, V1, V2 of W such that Bel(U1) = Bel(V1), Bel(U2) = Bel(V2), but Bel(U1 U2) Bel(V1 V2) (Exercise 2.56). It follows that there cannot be a function such that Bel(U V) = Bel(U) Bel(V). Similar arguments apply to plausibility functions, lower probabilities, and upper probabilities. Thus, in the most general setting, I do not assume additivity. Plausibility measures are of interest in part because they make it possible to investigate the consequences of assuming additivity. I return to this issue in Sections 3.9, 5.3, and 8.4.




Reasoning About Uncertainty
Reasoning about Uncertainty
ISBN: 0262582597
EAN: 2147483647
Year: 2005
Pages: 140

flylib.com © 2008-2017.
If you may any questions please contact us: flylib@qtcs.net