Estimating Variance under Estimating Mean . . . Interval and Fuzzy - - PowerPoint PPT Presentation

estimating variance under
SMART_READER_LITE
LIVE PREVIEW

Estimating Variance under Estimating Mean . . . Interval and Fuzzy - - PowerPoint PPT Presentation

Estimating Variance . . . Case of Measurement . . . Case of Expert . . . Estimating Mean . . . Estimating Variance under Estimating Mean . . . Interval and Fuzzy Estimating Variance . . . Estimating Variance . . . Uncertainty: Estimating


slide-1
SLIDE 1

Estimating Variance . . . Case of Measurement . . . Case of Expert . . . Estimating Mean . . . Estimating Mean . . . Estimating Variance . . . Estimating Variance . . . Estimating Variance . . . Estimating Variance . . . Acknowledgments Title Page ◭◭ ◮◮ ◭ ◮ Page 1 of 11 Go Back Full Screen Close Quit

Estimating Variance under Interval and Fuzzy Uncertainty: Parallel Algorithms

Karen Villaverde

Department of Computer Science New Mexico State University Las Cruces, NM 88003, USA email kvillave@cs.nmsu.edu

Gang Xiang

Philips Healthcare Informatics El Paso, Texas email gxiang@acm.org

slide-2
SLIDE 2

Estimating Variance . . . Case of Measurement . . . Case of Expert . . . Estimating Mean . . . Estimating Mean . . . Estimating Variance . . . Estimating Variance . . . Estimating Variance . . . Estimating Variance . . . Acknowledgments Title Page ◭◭ ◮◮ ◭ ◮ Page 2 of 11 Go Back Full Screen Close Quit

1. Estimating Variance under Uncertainty

  • Computing statistics is important: traditional data pro-

cessing starts with computing population mean and population variance: E = 1 n ·

n

  • i=1

xi, V = 1 n ·

n

  • i=1

(xi − E)2.

  • Traditional approach: assumes that we know the exact

values xi.

  • In practice: these values come either from measure-

ments or from expert estimates.

  • Uncertainty: in both cases, we get only approximations
  • xi to the actual (unknown) values xi.
  • Result: we only get approximate valued

E and V .

  • Question: what is the accuracy of these approxima-

tions?

slide-3
SLIDE 3

Estimating Variance . . . Case of Measurement . . . Case of Expert . . . Estimating Mean . . . Estimating Mean . . . Estimating Variance . . . Estimating Variance . . . Estimating Variance . . . Estimating Variance . . . Acknowledgments Title Page ◭◭ ◮◮ ◭ ◮ Page 3 of 11 Go Back Full Screen Close Quit

2. Case of Measurement Uncertainty

  • The result

x of the measurement is, in general, different from the (unknown) actual value x: ∆x

def

= x − x = 0.

  • Upper bound ∆ is usually supplied by the manufac-

turer: |∆x| ≤ ∆.

  • Interval uncertainty: x ∈ [

x − ∆, x + ∆].

  • Probabilistic approach: often, we know probabilities of

different values of ∆x.

  • How these probabilities are determined: by comparing

with standard measuring instrument (SMI).

  • Cases when we do not know probabilities:

– cutting-edge measurements; – manufacturing.

  • Resulting problem: find the ranges E and V of E and V .
slide-4
SLIDE 4

Estimating Variance . . . Case of Measurement . . . Case of Expert . . . Estimating Mean . . . Estimating Mean . . . Estimating Variance . . . Estimating Variance . . . Estimating Variance . . . Estimating Variance . . . Acknowledgments Title Page ◭◭ ◮◮ ◭ ◮ Page 4 of 11 Go Back Full Screen Close Quit

3. Case of Expert Uncertainty

  • Situation: an expert use natural language.
  • Example: “most probably, the value of the quantity is

between 6 and 7, but it is somewhat possible to have values between 5 and 8”.

  • Natural formalization: for every i, a fuzzy set µi(xi).
  • Resulting problem: given fuzzy numbers xi, find the

fuzzy numbers for E and V .

  • Reduction to interval case: the α-cut for C(x1, . . . , xn)

is equal to the range of C when xi are in the corre- sponding α-cuts: xi ∈ xi(α).

  • Conclusion: for each characteristic C(x1, . . . , xn), it is

sufficient to be able to compute the range C(x1, . . . , xn)

def

= {C(x1, . . . , xn) | x1 ∈ x1, . . . , xn ∈ xn}.

slide-5
SLIDE 5

Estimating Variance . . . Case of Measurement . . . Case of Expert . . . Estimating Mean . . . Estimating Mean . . . Estimating Variance . . . Estimating Variance . . . Estimating Variance . . . Estimating Variance . . . Acknowledgments Title Page ◭◭ ◮◮ ◭ ◮ Page 5 of 11 Go Back Full Screen Close Quit

4. Estimating Mean under Interval Uncertainty: What Is Known

  • Fact: the arithmetic average E(x1, . . . , xn) is an in-

creasing function of x1, . . . , xn.

  • Conclusions:

– the smallest possible value E of E is attained when each value xi is the smallest possible (xi = xi); – the largest possible value E of E is attained when xi = xi for all i.

  • Resulting formulas: the range E of E is equal to

[E(x1, . . . , xn), E(x1, . . . , xn)], i.e., to E = [E, E] = 1 n · (x1 + . . . + xn), 1 n · (x1 + . . . + xn)

  • .
slide-6
SLIDE 6

Estimating Variance . . . Case of Measurement . . . Case of Expert . . . Estimating Mean . . . Estimating Mean . . . Estimating Variance . . . Estimating Variance . . . Estimating Variance . . . Estimating Variance . . . Acknowledgments Title Page ◭◭ ◮◮ ◭ ◮ Page 6 of 11 Go Back Full Screen Close Quit

5. Estimating Mean under Interval Uncertainty: Par- allelization

  • General problem: for large n, the corresponding algo-

rithm may require a large computation time.

  • Possible solution: if we have several (p) processors, we

may speed up computations by parallelization.

  • Case of the mean – reminder:

E = [E, E] = 1 n · (x1 + . . . + xn), 1 n · (x1 + . . . + xn)

  • .
  • Parallelization:

– divide n elements into p groups of n/p elements; – each of p processors computes the sum of all the ele- ments from the corresponding group in time O(n/p); – then, we add p subsums.

  • Resulting computation time: O(n/p + log(p)).
slide-7
SLIDE 7

Estimating Variance . . . Case of Measurement . . . Case of Expert . . . Estimating Mean . . . Estimating Mean . . . Estimating Variance . . . Estimating Variance . . . Estimating Variance . . . Estimating Variance . . . Acknowledgments Title Page ◭◭ ◮◮ ◭ ◮ Page 7 of 11 Go Back Full Screen Close Quit

6. Estimating Variance under Interval Uncertainty: What is Known

  • Problem: compute the range V = [V , V ] of the vari-

ance V over interval data xi ∈ [ xi − ∆i, xi + ∆i].

  • Known: there is a polynomial-time algorithm for com-

puting V .

  • In general: computing V is NP-hard.
  • In many practical situations: there are efficient algo-

rithms for computing V .

  • Example: consider narrowed intervals [x−

i , x+ i ], where

x−

i def

= xi − ∆i n and x+

i def

= xi + ∆i n .

  • Case: no two narrowed intervals are proper subsets of
  • ne another, i.e., [x−

i , x+ i ] ⊆ (x− j , x+ j ) for all i and j.

  • For this case: there exists an O(n · log(n)) time algo-

rithm for computing V .

slide-8
SLIDE 8

Estimating Variance . . . Case of Measurement . . . Case of Expert . . . Estimating Mean . . . Estimating Mean . . . Estimating Variance . . . Estimating Variance . . . Estimating Variance . . . Estimating Variance . . . Acknowledgments Title Page ◭◭ ◮◮ ◭ ◮ Page 8 of 11 Go Back Full Screen Close Quit

7. Estimating Variance Under Interval Uncertainty: Main Idea

  • Reminder: V = M − E2, where M = 1

n ·

n

  • i=1

x2

i and

E = 1 n ·

n

  • i=1

xi.

  • Main lemma: if we sort the narrowed intervals in lexi-

cographic order, then V is attained at one of the vectors x = (x1, . . . , xk, xk+1, . . . , xn).

  • Conclusion: for some k, we have V = Mk − E2

k, where

Mk = M k + M k, Ek = Ek + Ek, M k = 1 n ·

k

  • i=1

(xi)2, M k = 1 n ·

n

  • i=k+1

(xi)2, Ek = 1 n ·

k

  • i=1

xi, Ek = 1 n ·

n

  • i=k+1

xi.

slide-9
SLIDE 9

Estimating Variance . . . Case of Measurement . . . Case of Expert . . . Estimating Mean . . . Estimating Mean . . . Estimating Variance . . . Estimating Variance . . . Estimating Variance . . . Estimating Variance . . . Acknowledgments Title Page ◭◭ ◮◮ ◭ ◮ Page 9 of 11 Go Back Full Screen Close Quit

8. Estimating Variance Under Interval Uncertainty: Resulting Algorithm

  • First, we sort the intervals; this takes time O(n·log(n)).
  • Then, for every k, we compute M k, M k, Ek, Ek, and

Vk = (M k + M k) − (Ek + Ek)2 : – computing the values for k = 0 takes linear time O(n); – then, we update in O(1) steps for each k, e.g., M k+1 = M k + 1 n · (xk+1)2.

  • Finally, we find the largest of the values V0, . . . , Vn+1;

this takes O(n) time.

  • Total time O(n · log(n)) + O(n) + n · O(1) + O(n) =

O(n · log(n)).

slide-10
SLIDE 10

Estimating Variance . . . Case of Measurement . . . Case of Expert . . . Estimating Mean . . . Estimating Mean . . . Estimating Variance . . . Estimating Variance . . . Estimating Variance . . . Estimating Variance . . . Acknowledgments Title Page ◭◭ ◮◮ ◭ ◮ Page 10 of 11 Go Back Full Screen Close Quit

9. Estimating Variance Under Interval Uncertainty: Parallel Algorithm If we have p < n processors, then we can:

  • on Stage 1, sort n values in time O

n · log(n) p + log(n)

  • ;
  • on Stage 2, compute all n + 1 values M k, M k, Ek, Ek

(and hence Vk) in time in O(n/p + log(p));

  • on Stage 3, compute the maximum of V0, . . . , Vn in time

O(n/p + log(p)). Overall, we thus need time (since p ≤ n): O n · log(n) p + log(n)

  • +O

n p + log(p)

  • +O

n p + log(p)

  • =

O n · log(n) p + log(n)

  • .
slide-11
SLIDE 11

Estimating Variance . . . Case of Measurement . . . Case of Expert . . . Estimating Mean . . . Estimating Mean . . . Estimating Variance . . . Estimating Variance . . . Estimating Variance . . . Estimating Variance . . . Acknowledgments Title Page ◭◭ ◮◮ ◭ ◮ Page 11 of 11 Go Back Full Screen Close Quit

10. Acknowledgments This work was supported in part

  • by an internal grant from New Mexico State University,
  • by NSF grants HRD-0734825, EAR-0225670, and EIA-

0080940, and

  • by Texas Department of Transportation contract No.

0-5453. The authors are thankful to the anonymous referees for valuable suggestions.