aggregation functions and information fusion modeling
play

Aggregation functions and information fusion. Modeling decisions - PowerPoint PPT Presentation

Aggregation functions and information fusion. Modeling decisions Vicen c Torra Universitat de Sk ovde (HiS, Sweden) July, 2017 Summary Aggregation functions There is life beyond the (weighted) mean Important concepts: the


  1. Aggregation functions • Goals of data aggregation ( goals of the area ): ◦ Formalization of the aggregation process ◦ Definition of new functions (and impossibility results!) ◦ Selection of functions (methods to decide which is the most appropriate function in a given context) ◦ Parameter determination Vicen¸ c Torra; Modeling decisions 2017 14 / 91

  2. Aggregation functions • Goals of data aggregation ( goals of the area ): ◦ Formalization of the aggregation process ◦ Definition of new functions (and impossibility results!) ◦ Selection of functions (methods to decide which is the most appropriate function in a given context) ◦ Parameter determination ◦ Study of existing methods: ◦ Caracterization of functions ◦ Determination of the modeling capabilities of the functions ◦ Relation between operators and parameters (how parameters influence the result: dictatorship?, sensitivity to data → index) . Vicen¸ c Torra; Modeling decisions 2017 14 / 91

  3. Aggregation: II. on the definition of aggregation functions Vicen¸ c Torra; Modeling decisions 2017 15 / 91

  4. Aggregation functions Definition of aggregation functions: • 1. Definition from properties: E.g., results consistent with changes of scale: C ( ra 1 , . . . , ra n ) = r C ( a 1 , . . . , a n ) • 2. Heuristic definition: from functions to properties E.g., after some testing we decide to use xxxxx : properties? • 3. Definition from examples E.g., find C that approximates the examples properties function examples Vicen¸ c Torra; Modeling decisions 2017 16 / 91

  5. Aggregation functions • 1. Definition from properties properties − → function Vicen¸ c Torra; Modeling decisions 2017 17 / 91

  6. Aggregation functions • 1. Definition from properties properties − → function • Some alternatives (1.a) Expressing properties as equations: functional equations Vicen¸ c Torra; Modeling decisions 2017 17 / 91

  7. Aggregation functions • 1. Definition from properties properties − → function • Some alternatives (1.a) Expressing properties as equations: functional equations (1.b) Aggregation of a 1 , a 2 , . . . , a N ∈ D , as the datum c which is at a minimum distance from a i : � c { d ( c, a i ) } , C ( a 1 , a 2 , . . . , a N ) = arg min a i d is a distance over D . Vicen¸ c Torra; Modeling decisions 2017 17 / 91

  8. Aggregation functions • Functional equations (case 1.a). ◦ What is a functional equation ? • Equations where the unknown are functions ◦ Example. Cauchy equation (a well known functional equation): φ ( x + y ) = φ ( x ) + φ ( y ) ◦ find φ ! Vicen¸ c Torra; Modeling decisions 2017 18 / 91

  9. Aggregation functions • Functional equations (case 1.a). ◦ What is a functional equation ? • Equations where the unknown are functions ◦ Example. Cauchy equation (a well known functional equation): φ ( x + y ) = φ ( x ) + φ ( y ) ◦ find φ ! ◦ Solution: For continuous φ , φ ( x ) = αx for an arbitrary value for α Vicen¸ c Torra; Modeling decisions 2017 18 / 91

  10. Aggregation functions • Functional equations (case 1.a). Example I in aggregation. ◦ Distribute s euros among m projects according to the opinion of N experts · · · · · · Proj 1 Proj 2 Proj j Proj m x 1 x 1 x 1 x 1 · · · · · · E 1 1 2 m j x 2 x 2 x 2 x 2 E 2 · · · · · · 1 2 j m . . . . . . . . . . . . x i x i x i x i E i · · · · · · 1 2 j m . . . . . . . . . . . . x N x N x N x N · · · · · · E N 1 2 j m DM f 1 ( x 1 ) f 2 ( x 2 ) · · · f j ( x j ) · · · f m ( x m ) Vicen¸ c Torra; Modeling decisions 2017 19 / 91

  11. Aggregation functions • Functional equations (case 1.a). Example I in aggregation. ◦ The general solution of the system (Prop. 3.11) for m > 2 f j : [0 , s ] N → R + for j = { 1 , · · · , m } m m � � x j = s implies that f j ( x j ) = s j =1 j =1 f j ( 0 ) = 0 for j = 1 , · · · , m Vicen¸ c Torra; Modeling decisions 2017 20 / 91

  12. Aggregation functions • Functional equations (case 1.a). Example I in aggregation. ◦ The general solution of the system (Prop. 3.11) for m > 2 f j : [0 , s ] N → R + for j = { 1 , · · · , m } m m � � x j = s implies that f j ( x j ) = s j =1 j =1 f j ( 0 ) = 0 for j = 1 , · · · , m is given by N � f 1 ( x ) = f 2 ( x ) = · · · = f m ( x ) = f (( x 1 , x 2 , . . . , x N )) = α i x i , i =1 where α 1 , · · · , α N are nonnegative constants satisfying � N i =1 α i = 1 , but are otherwise arbitrary. Vicen¸ c Torra; Modeling decisions 2017 20 / 91

  13. Aggregation functions • Functional equations (case 1.a). Example II in aggregation. ◦ (Prop. 4.17) An operator C is separable in terms of a unique monotone increasing g (that is, C ( a 1 , . . . , a N ) = g ( a 1 ) ◦ · · · ◦ g ( a N ) ) (with ◦ continuous, associative, and cancellative) Vicen¸ c Torra; Modeling decisions 2017 21 / 91

  14. Aggregation functions • Functional equations (case 1.a). Example II in aggregation. ◦ (Prop. 4.17) An operator C is separable in terms of a unique monotone increasing g (that is, C ( a 1 , . . . , a N ) = g ( a 1 ) ◦ · · · ◦ g ( a N ) ) (with ◦ continuous, associative, and cancellative) N � C ( a 1 , . . . , a N ) = φ − 1 ( φ ( g ( a i ))) i =1 and satisfies unanimity C ( a, . . . , a ) = a if and only if it is of the form (quasi-arithmetic mean) N C ( a 1 , . . . , a N ) = φ − 1 ( 1 � φ ( a i )) . N i =1 Vicen¸ c Torra; Modeling decisions 2017 21 / 91

  15. Aggregation functions • Functional equations (case 1.a). Quasi-arithmetic means Name Generator function C ( a 1 , . . . , a N ) � N i =1 x i Arithmetic mean φ ( x ) = x N � � N N i =1 x i Geometric mean φ ( x ) = log x N N Harmonic mean φ ( x ) = 1 /x � N 1 i =1 xi � � N i =1 x 2 φ ( x ) = x 2 Root-mean-square i N � � N i =1 x α α φ ( x ) = x α Root-mean-power i N � � N i =1 e xi φ ( x ) = e x � Exponential mean log N �� − 1 � � N i =1 c 1 /xi � φ ( x ) = c 1 /x Radical mean log c N i =1 x xi � N m s.t. m m = φ ( x ) = x x Basis-exponential mean i N i =1 x 1 /xi � N m s.t. m 1 /m = φ ( x ) = x 1 /x Basis-radical mean i N Vicen¸ c Torra; Modeling decisions 2017 22 / 91

  16. Aggregation functions • Functional equations (case 1.a). Example III in aggregation. ◦ (Prop. 4.20) An operator C is separable in terms of a unique monotone increasing g (that is, C ( a 1 , . . . , a N ) = g ( a 1 ) ◦ · · · ◦ g ( a N ) ) (with ◦ continuous, associative, and cancellative) N � C ( a 1 , . . . , a N ) = φ − 1 ( φ ( g ( a i ))) i =1 and satisfies unanimity C ( a, . . . , a ) = a and positive homogeneity C ( ra 1 , . . . , ra N ) = r C ( a 1 , . . . , a N ) Vicen¸ c Torra; Modeling decisions 2017 23 / 91

  17. Aggregation functions • Functional equations (case 1.a). Example III in aggregation (cont.) ◦ if and only if C is either the root-mean-power N C ( a 1 , . . . , a N ) = ( 1 � a α i ) 1 /α N i =1 with parameter α � = 0 ( RMP α ) or the geometric mean N � a i ) 1 /N . C ( a 1 , . . . , a N ) = ( i =1 Vicen¸ c Torra; Modeling decisions 2017 24 / 91

  18. Aggregation functions • Functional equations (case 1.a). Example III in aggregation (cont.) ◦ if and only if C is either the root-mean-power N C ( a 1 , . . . , a N ) = ( 1 � a α i ) 1 /α N i =1 with parameter α � = 0 ( RMP α ) or the geometric mean N � a i ) 1 /N . C ( a 1 , . . . , a N ) = ( i =1 • Remember Ex. 4.21. Assessing performance of Java runtime systems. Only RMP and GM are acceptable !! (Note: lim α → 0 RMP α = GM ) • Root-mean-powers, known as r th power mean, generalized mean. Vicen¸ c Torra; Modeling decisions 2017 24 / 91

  19. Aggregation functions • Functional equations (case 1.a). Example IV in aggregation. ◦ (Prop. 4.24) When we add the equation of reciprocity C (1 /a 1 , . . . , 1 /a N ) = 1 / C ( a 1 , . . . , a N ) the only operator C satisfying all conditions is the geometric mean N � a i ) 1 /N . C ( a 1 , . . . , a N ) = ( i =1 Vicen¸ c Torra; Modeling decisions 2017 25 / 91

  20. Aggregation functions • Functional equations (case 1.a). Root-mean-powers ( r th power mean, generalized mean) Name α C ( a 1 , . . . , a N ) � N i =1 x i Arithmetic mean α = 1 N � � N i =1 x 2 Root-mean-square α = 2 i N N Harmonic mean α = − 1 � N 1 i =1 xi Vicen¸ c Torra; Modeling decisions 2017 26 / 91

  21. Aggregation functions • Example (case (b)): Consider the following expression � c { d ( c, a i ) } , C ( a 1 , a 2 , . . . , a N ) = arg min a i where a i are numbers from R and d is a distance on D . Then, Vicen¸ c Torra; Modeling decisions 2017 27 / 91

  22. Aggregation functions • Example (case (b)): Consider the following expression � c { d ( c, a i ) } , C ( a 1 , a 2 , . . . , a N ) = arg min a i where a i are numbers from R and d is a distance on D . Then, 1. When d ( a, b ) = ( a − b ) 2 , C is the arithmetic mean I.e., C ( a 1 , a 2 , . . . , a N ) = � N i =1 a i /N . 2. When d ( a, b ) = | a − b | , C is the median I.e., the median of a 1 , a 2 , . . . , a N is the element which occupies the central position when we order a i . 3. When d ( a, b ) = 1 iff a = b , C is the plurality rule (mode or voting). I.e., C ( a 1 , a 2 , . . . , a N ) selects the element of R with a largest frequency among elements in ( a 1 , a 2 , . . . , a N ) . Vicen¸ c Torra; Modeling decisions 2017 27 / 91

  23. Aggregation functions • 2. Heuristic definition: from functions to properties function − → properties ◦ We can use functional equations for this purpose: characterizations of functions ◦ We have propositions with “if and only if” eq1, eq2, eq3 if and only if aggr. function so, given a function we know the fundamental properties of the function. E.g., geometric mean: a fundamental property is reciprocity • Note: characterizations are not unique Vicen¸ c Torra; Modeling decisions 2017 28 / 91

  24. Aggregation: III. from the weighted mean to fuzzy integrals Vicen¸ c Torra; Modeling decisions 2017 29 / 91

  25. Aggregation: III. from the weighted mean to fuzzy integrals 1. An example Vicen¸ c Torra; Modeling decisions 2017 30 / 91

  26. Aggregation: example Example. A and B teaching a tutorial+training course w/ constraints • The total number of sessions is six. • Professor A will give the tutorial, which should consist of about three sessions; three is the optimal number of sessions; a difference in the number of sessions greater than two is unacceptable. • Professor B will give the training part, consisting of about two sessions. • Both professors should give more or less the same number of sessions. A difference of one or two is half acceptable; a difference of three is unacceptable. Vicen¸ c Torra; Modeling decisions 2017 31 / 91

  27. Aggregation: example Example. Formalization • Variables ◦ x A : Number of sessions taught by Professor A ◦ x B : Number of sessions taught by Professor B • Constraints ◦ the constraints are translated into ⋆ C 1 : x A + x B should be about 6 ⋆ C 2 : x A should be about 3 ⋆ C 3 : x B should be about 2 ⋆ C 4 : | x A − x B | should be about 0 ◦ using fuzzy sets, the constraints are described ... Vicen¸ c Torra; Modeling decisions 2017 32 / 91

  28. Aggregation: example Example. Formalization • Constraints ◦ if fuzzy set µ 6 expresses “about 6,” then, we evaluate “ x A + x B should be about 6” by µ 6 ( x A + x B ) . → given µ 6 , µ 3 , µ 2 , µ 0 , ◦ Then, given a solution pair ( x A , x B ) , the degrees of satisfaction: ⋆ µ 6 ( x A + x B ) ⋆ µ 3 ( x A ) ⋆ µ 2 ( x B ) ⋆ µ 0 ( | x A − x B | ) Vicen¸ c Torra; Modeling decisions 2017 33 / 91

  29. Aggregation: example Example. Formalization • Membership functions for constraints µ 2 µ 0 µ 3 µ 6 1 2 3 4 5 6 7 Vicen¸ c Torra; Modeling decisions 2017 34 / 91

  30. Aggregation: example Example. Application alternative Satisfaction degrees Satisfaction degrees ( x A , x B ) ( µ 6 ( x A + x B ) , µ 3 ( x A ) , C 1 C 2 C 3 C 4 µ 2 ( x B ) , µ 0 ( | x A − x B | ) ) (2 , 2) ( µ 6 (4) , µ 3 (2) , µ 2 (2) , µ 0 (0) ) 0 0 . 5 1 1 (2 , 3) ( µ 6 (5) , µ 3 (2) , µ 2 (3) , µ 0 (1) ) 0 . 5 0 . 5 0 . 5 0 . 5 (2 , 4) ( µ 6 (6) , µ 3 (2) , µ 2 (4) , µ 0 (2) ) 1 0 . 5 0 0 . 5 (3 . 5 , 2 . 5) ( µ 6 (6) , µ 3 (3 . 5) , µ 2 (2 . 5) , µ 0 (1) ) 1 0 . 5 0 . 5 0 . 5 (3 , 2) ( µ 6 (5) , µ 3 (3) , µ 2 (2) , µ 0 (1) ) 0 . 5 1 1 0 . 5 (3 , 3) ( µ 6 (6) , µ 3 (3) , µ 2 (3) , µ 0 (0) ) 1 1 0 . 5 1 Vicen¸ c Torra; Modeling decisions 2017 35 / 91

  31. Aggregation: III. from the weighted mean to fuzzy integrals 2. WM, OWA, and WOWA operators Vicen¸ c Torra; Modeling decisions 2017 36 / 91

  32. Aggregation: WM, OWA, and WOWA operators • Operators ◦ Weighting vector (dimension N ): v = ( v 1 ...v N ) iff v i ∈ [0 , 1] and � i v i = 1 ◦ Arithmetic mean (AM : R N → R ): AM ( a 1 , ..., a N ) = (1 /N ) � N i =1 a i ◦ Weighted mean (WM: R N → R ): WM p ( a 1 , ..., a N ) = � N i =1 p i a i ( p a weighting vector of dimension N ) ◦ Ordered Weighting Averaging operator (OWA: R N → R ): N � OWA w ( a 1 , ..., a N ) = w i a σ ( i ) , i =1 where { σ (1) , ..., σ ( N ) } is a permutation of { 1 , ..., N } s. t. a σ ( i − 1) ≥ a σ ( i ) , and w a weighting vector. Vicen¸ c Torra; Modeling decisions 2017 37 / 91

  33. Aggregation: WM, OWA, and WOWA operators Example. Application • Let us consider the following situation: ◦ Professor A is more important than Professor B ◦ The number of sessions equal to six is the most important constraint (not a crisp requirement) ◦ The difference in the number of sessions taught by the two professors is the least important constraint WM with p = ( p 1 , p 2 , p 3 , p 4 ) = (0 . 5 , 0 . 3 , 0 . 15 , 0 . 05) . Vicen¸ c Torra; Modeling decisions 2017 38 / 91

  34. Aggregation: WM, OWA, and WOWA operators Example. Application • WM with p = ( p 1 , p 2 , p 3 , p 4 ) = (0 . 5 , 0 . 3 , 0 . 15 , 0 . 05) . alternative Aggregation of the Satisfaction degrees WM ( x A , x B ) WM p ( C 1 , C 2 , C 3 , C 4 ) (2 , 2) WM p (0 , 0 . 5 , 1 , 1) 0.35 (2 , 3) WM p (0 . 5 , 0 . 5 , 0 . 5 , 0 . 5) 0.5 (2 , 4) WM p (1 , 0 . 5 , 0 , 0 . 5) 0.675 (3 . 5 , 2 . 5) WM p (1 , 0 . 5 , 0 . 5 , 0 . 5) 0.75 (3 , 2) WM p (0 . 5 , 1 , 1 , 0 . 5) 0.725 (3 , 3) WM p (1 , 1 , 0 . 5 , 1) 0.925 Vicen¸ c Torra; Modeling decisions 2017 39 / 91

  35. Aggregation: WM, OWA, and WOWA operators Example. Application • Compensation: how many values can have a bad evaluation • One bad value does not matter: OWA with w = (1 / 3 , 1 / 3 , 1 / 3 , 0) (lowest value discarded) alternative Aggregation of the Satisfaction degrees OWA ( x A , x B ) OWA w ( C 1 , C 2 , C 3 , C 4 ) (2 , 2) OWA w (0 , 0 . 5 , 1 , 1) 0.8333 (2 , 3) OWA w (0 . 5 , 0 . 5 , 0 . 5 , 0 . 5) 0.5 (2 , 4) OWA w (1 , 0 . 5 , 0 , 0 . 5) 0.6666 (3 . 5 , 2 . 5) OWA w (1 , 0 . 5 , 0 . 5 , 0 . 5) 0.6666 (3 , 2) OWA w (0 . 5 , 1 , 1 , 0 . 5) 0.8333 (3 , 3) OWA w (1 , 1 , 0 . 5 , 1) 1.0 Vicen¸ c Torra; Modeling decisions 2017 40 / 91

  36. Aggregation: WM, OWA, and WOWA operators • Weighted Ordered Weighted Averaging WOWA operator (WOWA : R N → R ): WOWA p , w ( a 1 , ..., a N ) = � N i =1 ω i a σ ( i ) where ω i = w ∗ ( � j ≤ i p σ ( j ) ) − w ∗ ( � j<i p σ ( j ) ) , with σ a permutation of { 1 , ..., N } s. t. a σ ( i − 1) ≥ a σ ( i ) , and w ∗ a nondecreasing function that interpolates the points { ( i/N, � j ≤ i w j ) } i =1 ,...,N ∪ { (0 , 0) } . w ∗ is required to be a straight line when the points can be interpolated in this way. Vicen¸ c Torra; Modeling decisions 2017 41 / 91

  37. Aggregation: WM, OWA, and WOWA operators • Construction of the w ∗ quantifier (a) (b) ( ) w w N N w w 2 2 ! 1 w 1 w 1 ! 1 � 0 � p � (1) p � (1) 1 = N 1 = N ::: 1 = N 0 0 0 0 p p p p p � (1) � (1) � (2) � ( N ) � (1) • Rationale for new weights ( ω i , for each value a i ) in terms of p and w . ◦ If a i is small, and small values have more importance than larger ones, increase p i for a i (i.e., ω i ≥ p σ ( i ) ). (the same holds if the value a i is large and importance is given to large values) ◦ If a i is small, and importance is for large values, ω i < p σ ( i ) (the same holds if a i is large and importance is given to small values). Vicen¸ c Torra; Modeling decisions 2017 42 / 91

  38. Aggregation: WM, OWA, and WOWA operators • The shape of the function w ∗ gives importance ◦ (a) to large values ◦ (b) to medium values ◦ (c) to small values ◦ (d) equal importance to all values (a) (b) (c) (d) Vicen¸ c Torra; Modeling decisions 2017 43 / 91

  39. Aggregation: WM, OWA, and WOWA operators Example. Application • Importance for constraints as given above: p = (0 . 5 , 0 . 3 , 0 . 15 , 0 . 05) • Compensation as given above: w = (1 / 3 , 1 / 3 , 1 / 3 , 0) (lowest value discarded) → WOWA with p and w . alternative Aggregation of the Satisfaction degrees WOWA ( x A , x B ) WOWA p , w ( C 1 , C 2 , C 3 , C 4 ) (2 , 2) WOWA p , w (0 , 0 . 5 , 1 , 1) 0.4666 (2 , 3) WOWA p , w (0 . 5 , 0 . 5 , 0 . 5 , 0 . 5) 0.5 (2 , 4) WOWA p , w (1 , 0 . 5 , 0 , 0 . 5) 0.8333 (3 . 5 , 2 . 5) WOWA p , w (1 , 0 . 5 , 0 . 5 , 0 . 5) 0.8333 (3 , 2) WOWA p , w (0 . 5 , 1 , 1 , 0 . 5) 0.8 (3 , 3) WOWA p , w (1 , 1 , 0 . 5 , 1) 1.0 Vicen¸ c Torra; Modeling decisions 2017 44 / 91

  40. Aggregation: WM, OWA, and WOWA operators • Properties ◦ The WOWA operator generalizes the WM and the OWA operator. ◦ When p = (1 /N . . . 1 /N ) , OWA WOWA p , w ( a 1 , ..., a N ) = OWA w ( a 1 , ..., a N ) for all w and a i . ◦ When w = (1 /N ... 1 /N ) , WM WOWA p , w ( a 1 , ..., a N ) = WM p ( a 1 , ..., a N ) for all p and a i . ◦ When w = p = (1 /N ... 1 /N ) , AM WOWA p , w ( a 1 , ..., a N ) = AM ( a 1 , ..., a N ) Vicen¸ c Torra; Modeling decisions 2017 45 / 91

  41. Aggregation: III. from the weighted mean to fuzzy integrals 3. Choquet integral Vicen¸ c Torra; Modeling decisions 2017 46 / 91

  42. Choquet integrals • In the WM, a single weight is used for each element I.e., p i = p ( x i ) (where, x i is the information source that supplies a i ) → when we consider a set A ⊂ X , weight of A ??? Vicen¸ c Torra; Modeling decisions 2017 47 / 91

  43. Choquet integrals • In the WM, a single weight is used for each element I.e., p i = p ( x i ) (where, x i is the information source that supplies a i ) → when we consider a set A ⊂ X , weight of A ??? . . . fuzzy measures µ ( A ) Vicen¸ c Torra; Modeling decisions 2017 47 / 91

  44. Choquet integrals • Example. ◦ We need to evaluate students (who is best?) using marks in three subjects X= { Mathematics, Physics, Literature } ( M , P , L ) ◦ p M = 0 . 4 , p P = 0 . 4 , p L = 0 . 2 . In the WM, a single weight is used for each element I.e., p i = p ( x i ) (where, x i is the information source that supplies a i ) → when we consider a set A ⊂ X , weight of A ??? ◦ p(Mathematics, Physics) ? Vicen¸ c Torra; Modeling decisions 2017 48 / 91

  45. Choquet integrals • Example. ◦ We need to evaluate students (who is best?) using marks in three subjects X= { Mathematics, Physics, Literature } ( M , P , L ) ◦ p M = 0 . 4 , p P = 0 . 4 , p L = 0 . 2 . In the WM, a single weight is used for each element I.e., p i = p ( x i ) (where, x i is the information source that supplies a i ) → when we consider a set A ⊂ X , weight of A ??? ◦ p(Mathematics, Physics) ? . . . fuzzy measures µ ( A ) Vicen¸ c Torra; Modeling decisions 2017 48 / 91

  46. Choquet integrals • fuzzy measures µ ( A ) : X ⊂ X = { M, P, L } P M L Vicen¸ c Torra; Modeling decisions 2017 49 / 91

  47. Choquet integrals • fuzzy measures µ ( A ) Formally, ◦ Fuzzy measure ( µ : ℘ ( X ) → [0 , 1] ), a set function satisfying (i) µ ( ∅ ) = 0 , µ ( X ) = 1 (boundary conditions) (ii) A ⊆ B implies µ ( A ) ≤ µ ( B ) (monotonicity) Vicen¸ c Torra; Modeling decisions 2017 50 / 91

  48. Choquet integrals • fuzzy measures µ ( A ) Formally, ◦ Fuzzy measure ( µ : ℘ ( X ) → [0 , 1] ), a set function satisfying (i) µ ( ∅ ) = 0 , µ ( X ) = 1 (boundary conditions) (ii) A ⊆ B implies µ ( A ) ≤ µ ( B ) (monotonicity) ◦ Similar to probability or standard (additive) measures, but additivity condition is removed replaced by monotonicity Vicen¸ c Torra; Modeling decisions 2017 50 / 91

  49. Choquet integrals • fuzzy measures µ ( A ) Formally, ◦ Fuzzy measure ( µ : ℘ ( X ) → [0 , 1] ), a set function satisfying (i) µ ( ∅ ) = 0 , µ ( X ) = 1 (boundary conditions) (ii) A ⊆ B implies µ ( A ) ≤ µ ( B ) (monotonicity) ◦ Similar to probability or standard (additive) measures, but additivity condition is removed replaced by monotonicity ◦ Why? to represent redundancy and support (for A ∩ B = ∅ ) µ ( A ∪ B ) < µ ( A ) + µ ( B ) µ ( A ∪ B ) > µ ( A ) + µ ( B ) Vicen¸ c Torra; Modeling decisions 2017 50 / 91

  50. Choquet integrals • Fuzzy measures µ ( A ) : Example with X = { M, P, L } 1. Boundary conditions: µ ( ∅ ) = 0 , µ ( { M, P, L } ) = 1 2. Relative importance of scientific versus literary subjects: µ ( { M } ) = µ ( { P } ) = 0 . 45 , µ ( { L } ) = 0 . 3 3. Redudancy between mathematics and physics: µ ( { M, P } ) = 0 . 5 < µ ( { M } ) + µ ( { P } ) 4. Support between literature and scientific subjects: µ ( { M, L } ) = µ ( { P, L } ) = 0 . 9 > µ ( { P } ) + µ ( { L } ) = 0 . 45 + 0 . 3 = 0 . 75 µ ( { M, L } ) = µ ( { P, L } ) = 0 . 9 > µ ( { M } ) + µ ( { L } ) = 0 . 45 + 0 . 3 = 0 . 75 Vicen¸ c Torra; Modeling decisions 2017 51 / 91

  51. Choquet integrals • Now, we have a fuzzy measure µ ( A ) then, how aggregation proceeds? ⇒ fuzzy integrals as e.g. the Choquet integral Vicen¸ c Torra; Modeling decisions 2017 52 / 91

  52. Choquet integrals • In WM, we combine a i w.r.t. weights p i . → a i is the value supplied by information source x i . Formally Vicen¸ c Torra; Modeling decisions 2017 53 / 91

  53. Choquet integrals • In WM, we combine a i w.r.t. weights p i . → a i is the value supplied by information source x i . Formally ◦ X = { x 1 , . . . , x N } is the set of information sources ◦ f : X → R + the values supplied by the sources → then a i = f ( x i ) Thus, N N � � WM p ( a 1 , ..., a N ) = p i a i = p i f ( x i ) = WM p ( f ( x 1 ) , ..., f ( x N )) i =1 i =1 Vicen¸ c Torra; Modeling decisions 2017 53 / 91

  54. Choquet integrals • Choquet integral of f w.r.t. µ (alternative notation, CI µ ( a 1 , . . . , a N ) /CI µ ( f ) ) N � � ( C ) fdµ = [ f ( x s ( i ) ) − f ( x s ( i − 1) )] µ ( A s ( i ) ) , i =1 where s in f ( x s ( i ) ) is a permutation so that f ( x s ( i − 1) ) ≤ f ( x s ( i ) ) for i ≥ 1 , f ( x s (0) ) = 0 , and A s ( k ) = { x s ( j ) | j ≥ k } and A s ( N +1) = ∅ . • Alternative expressions (Proposition 6.18): N � � ( C ) fdµ = f ( x σ ( i ) )[ µ ( A σ ( i ) ) − µ ( A σ ( i − 1) )] , i =1 N � � ( C ) fdµ = f ( x s ( i ) )[ µ ( A s ( i ) ) − µ ( A s ( i +1) )] , i =1 where σ is a permutation of { 1 , . . . , N } s.t. f ( x σ ( i − 1) ) ≥ f ( x σ ( i ) ) , where A σ ( k ) = { x σ ( j ) | j ≤ k } for k ≥ 1 and A σ (0) = ∅ Vicen¸ c Torra; Modeling decisions 2017 54 / 91

  55. Choquet integrals • Different equations point out different aspects of the CI fdµ = � N (6.1) ( C ) � i =1 [ f ( x s ( i ) ) − f ( x s ( i − 1) )] µ ( A s ( i ) ) , 0 a s (1) a s (2) a s (3) a s (4) a s (5) µ ( A s (2) ) µ ( A s (1) ) = { x s (1) , · · · , x s ( N ) } µ ( A s (4) ) = { x s (4) , · · · , x s ( N ) } fdµ = � N (6.2) ( C ) � i =1 f ( x σ ( i ) )[ µ ( A σ ( i ) ) − µ ( A σ ( i − 1) )] , Vicen¸ c Torra; Modeling decisions 2017 55 / 91

  56. Choquet integrals � • fdµ = (for additive measures) (6.5) � x ∈ X f ( x ) µ ( { x } ) (6.6) � R i =1 b i µ ( { x | f ( x ) = b i } ) (6.7) � N i =1 ( a i − a i − 1 ) µ ( { x | f ( x ) ≥ a i } ) (6.8) � N � � i =1 ( a i − a i − 1 ) 1 − µ ( { x | f ( x ) ≤ a i − 1 } ) (a) (b) (c) b i b i a i b i − 1 b i − 1 a i − 1 x 1 x N x 1 x N x 1 { x | f ( x ) = b i } { x | f ( x ) ≥ a i } x • Among (6.5), (6.6) and (6.7), only (6.7) satisfies internality. Vicen¸ c Torra; Modeling decisions 2017 56 / 91

  57. Choquet integrals • Properties of CI ◦ Horizontal additive because CI µ ( f ) = CI µ ( f ∧ c ) + CI µ ( f + c ) ( f = ( f ∧ c ) + f + c is a horizontal additive decomposition of f ) where, f + c is defined by (for c ∈ [0 , 1] ) � 0 if f ( x ) ≤ c f + c = f ( x ) − c if f ( x ) > c. f + f ∧ c f c c Vicen¸ c Torra; Modeling decisions 2017 57 / 91

  58. Choquet integrals • Definitions ( X a reference set, f, g functions f, g : X → [0 , 1] ) ◦ f < g when, for all x i , f ( x i ) < g ( x i ) ◦ f and g are comonotonic if, for all x i , x j ∈ X , f ( x i ) < f ( x j ) imply that g ( x i ) ≤ g ( x j ) ◦ C is comonotonic monotone if and only if, for comonotonic f and g , f ≤ g imply that C ( f ) ≤ C ( g ) ◦ C is comonotonic additive if and only if, for comonotonic f and g , C ( f + g ) = C ( f ) + C ( g ) • Characterization. Let C satisfy the following properties ◦ C is comonotonic monotone ◦ C is comonotonic additive ◦ C (1 , . . . , 1) = 1 Then, there exists µ s.t. C ( f ) is the CI of f w.r.t. µ . Vicen¸ c Torra; Modeling decisions 2017 58 / 91

  59. Choquet integrals • Properties ◦ WM, OWA and WOWA are particular cases of CI. ⋆ WM with weighting vector p is a CI w.r.t. µ p ( B ) = � x i ∈ B p i ⋆ OWA with weighting vector w is a CI w.r.t. µ w ( B ) = � | B | i =1 w i ⋆ WOWA with w.v. p and w is a CI w.r.t. µ p , w ( B ) = w ∗ ( � x i ∈ B p i ) ◦ Any CI with a symmetric measure is an OWA operator. ◦ Any CI with a distorted probability is a WOWA operator. ◦ Let A be a crisp subset of X ; then, the Choquet integral of A with respect to µ is µ ( A ) . Here, the integral of A corresponds to the integral of its characteristic function, or, in other words, to the integral of the function f A defined as f A ( x ) = 1 if and only if x ∈ A . Vicen¸ c Torra; Modeling decisions 2017 59 / 91

  60. Aggregation: III. from the weighted mean to fuzzy integrals 4. Weighted minimum and maximum Vicen¸ c Torra; Modeling decisions 2017 60 / 91

  61. Weighted Minimum and Weighted Maximum • Possibilistic weighting vector (dimension N ): v = ( v 1 ...v N ) iff v i ∈ [0 , 1] and max i v i = 1 . • Weighted minimum (WMin: [0 , 1] N → [0 , 1] ): WMin u ( a 1 , ..., a N ) = min i max( neg ( u i ) , a i ) (alternative definition can be given with v = ( v 1 , . . . , v N ) where v i = neg ( u i ) ) • Weighted maximum (WMax: [0 , 1] N → [0 , 1] ): WMax u ( a 1 , ..., a N ) = max i min( u i , a i ) Vicen¸ c Torra; Modeling decisions 2017 61 / 91

  62. Weighted Minimum and Weighted Maximum • Only operators in ordinal scales (max, min, neg ) are used in WMax and WMin . • neg is completely determined in an ordinal scale Proposition 6.36. Let L = { l 0 , . . . , l r } with l 0 < L l 1 < L · · · < L l r ; then, there exists only one function, neg : L → L , satisfying (N1) if x < L x ′ then neg ( x ) > L neg ( x ′ ) for all x, x ′ in L . (N2) neg ( neg ( x )) = x for all x in L . This function is defined by neg ( x i ) = x r − i for all x i in L • Properties. For u = (1 , . . . , 1) ◦ WMIN u = min ◦ WMAX u = max Vicen¸ c Torra; Modeling decisions 2017 62 / 91

  63. Aggregation: III. from the weighted mean to fuzzy integrals 5. Sugeno integral Vicen¸ c Torra; Modeling decisions 2017 63 / 91

  64. Sugeno integral • Sugeno integral of f w.r.t. µ (alternative notation, SI µ ( a 1 , . . . , a N ) /SI µ ( f ) ) � ( S ) fdµ = max i =1 ,N min( f ( x s ( i ) ) , µ ( A s ( i ) )) , where s in f ( x s ( i ) ) is a permutation so that f ( x s ( i − 1) ) ≤ f ( x s ( i ) ) for i ≥ 2 , and A s ( k ) = { x s ( j ) | j ≥ k } . • Alternative expression (Proposition 6.38): max min( f ( x σ ( i ) ) , µ ( A σ ( i ) )) , i where σ is a permutation of { 1 , . . . , N } s.t. f ( x σ ( i − 1) ) ≥ f ( x σ ( i ) ) , where A σ ( k ) = { x σ ( j ) | j ≤ k } for k ≥ 1 Vicen¸ c Torra; Modeling decisions 2017 64 / 91

  65. Sugeno integral • Graphical interpretation of Sugeno integrals (a) (b) ( ) f ( x ) f ( x ) s ( i ) � ( A ) s ( i ) R ( S ) f d� f ( x ) f ( x ) s ( i ) � ( A ) s ( i ) � ( A ) Vicen¸ c Torra; Modeling decisions 2017 65 / 91

  66. Sugeno integral • Properties ◦ WMin and WMax are particular cases of SI ⋆ WMax with weighting vector u is a SI w.r.t. µ wmax ( A ) = max a i ∈ A u i . u ⋆ WMin with weighting vector u is a SI w.r.t. µ wmin ( A ) = 1 − max a i / ∈ A u i . u Vicen¸ c Torra; Modeling decisions 2017 66 / 91

  67. Fuzzy integrals • Fuzzy integrals that generalize Choquet and Sugeno integrals ◦ The fuzzy t-conorm integral ◦ The twofold integral Vicen¸ c Torra; Modeling decisions 2017 67 / 91

  68. Aggregation: IV. fuzzy measures Vicen¸ c Torra; Modeling decisions 2017 68 / 91

  69. Fuzzy measures • Definition: (i) µ ( ∅ ) = 0 , µ ( X ) = 1 (boundary conditions) (ii) A ⊆ B implies µ ( A ) ≤ µ ( B ) (monotonicity) • Difficulty: ◦ 2 | X | − 2 values (because µ ( ∅ ) = 0 , µ ( X ) = 1 ) • Solution: Families of fuzzy measures (to reduce complexity) Vicen¸ c Torra; Modeling decisions 2017 69 / 91

  70. Fuzzy measures • Examples of families: ◦ ⊥ -Decomposable Fuzzy Measures ( ⊥ a t-conorm) µ ( A ∪ B ) = µ ( A ) ⊥ µ ( B ) . Therefore, for a given A : ⊥ x i ∈ A v ( x i ) ◦ Sugeno λ -measures (for λ > − 1 ) µ ( A ∪ B ) = µ ( A ) + µ ( B ) + λµ ( A ) µ ( B ) ⊥ -decomposable for ⊥ ( x, y ) = min (1 , x + y + λxy ) . ◦ Extensively used in computer vision applications Vicen¸ c Torra; Modeling decisions 2017 70 / 91

  71. Fuzzy measures • Examples of families: ◦ Distorted probabilities ( P : probability, f : increasing function) µ ( A ) = f ( P ( A )) ◦ Well known in economics (decision) ◦ m -dimensional distorted probability X 1 , . . . , X m , P i , f µ ( A ) = f ( P 1 ( A ∩ X 1 ) , P 2 ( A ∩ X 2 ) , · · · , P m ( A ∩ X m )) . Vicen¸ c Torra; Modeling decisions 2017 71 / 91

  72. Aggregation: V. preference relations (MCDM: social choice) Vicen¸ c Torra; Modeling decisions 2017 72 / 91

  73. Aggregation for preference relations • Social choice ◦ studies voting rules, and how the preferences of a set of people can be aggregated to obtain the preference of the set. Vicen¸ c Torra; Modeling decisions 2017 73 / 91

  74. Aggregation for preference relations • Given preference relations, how aggregation is built? ◦ Formalization of preferences with > an = (preference, indiference) ◦ F ( R 1 , R 2 , . . . , R N ) to denote aggregated preference Vicen¸ c Torra; Modeling decisions 2017 74 / 91

  75. Aggregation for preference relations • Given preference relations, how aggregation is built? ◦ Formalization of preferences with > an = (preference, indiference) ◦ F ( R 1 , R 2 , . . . , R N ) to denote aggregated preference ◦ Problems (I): consider ⋆ R 1 : x > y > z ⋆ R 4 : y > z > x ⋆ R 5 : z > x > y → simple majority rule: u > v if most prefer u to v ⋆ x > y, y > z, z > x (intransitive!!: x > y, y > z but not x > z ) ◦ Problems (II): → Arrow impossibility theorem Vicen¸ c Torra; Modeling decisions 2017 74 / 91

  76. Aggregation for preference relations • Given preference relations, how aggregation is built? • Axioms of Arrow impossibility theorem C0 Finite number of voters and more than one Number of alternatives more or equal to three C1 Universality: Voters can select any total preorder 1 C2 Transitivity: The result is a total preorder C3 Unanimity: If all agree on x better than y , then x better than y in the social preference C4 Independence of irrellevant alternatives: the social preference of x and y only depends on the preferences on x and y C5 No-dictatorship: No voter can be a dictatorship • There is no function F that satisfies all C0-C5 axioms 1 either x � y or y � x Vicen¸ c Torra; Modeling decisions 2017 75 / 91

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend