processing quantities with
play

Processing Quantities with Result for Addition . . . Heavy-Tailed - PowerPoint PPT Presentation

Need for Data Processing Need to Estimate . . . Traditional Statistical . . . Heavy-Tailed . . . Processing Quantities with Result for Addition . . . Heavy-Tailed Distribution of Case of a General . . . Case of the Product . . . Measurement


  1. Need for Data Processing Need to Estimate . . . Traditional Statistical . . . Heavy-Tailed . . . Processing Quantities with Result for Addition . . . Heavy-Tailed Distribution of Case of a General . . . Case of the Product . . . Measurement Uncertainty: General Asymptotics . . . What If We Only Have . . . How to Estimate Home Page the Tails of the Results Title Page of Data Processing ◭◭ ◮◮ ◭ ◮ capek 1 and Vladik Kreinovich 2 Michal Holˇ Page 1 of 16 1 Centre of Excellence IT4Innovations, University of Ostrava, Go Back Institute for Research and Applications of Fuzzy Modeling, Ostrava, Czech Republic, michal.holcapek@osu.cz Full Screen 2 University of Texas at El Paso El Paso, Texas 79968, USA, vladik@utep.edu Close Quit

  2. Need for Data Processing Need to Estimate . . . 1. Need for Data Processing Traditional Statistical . . . • We are often interested in the values of a quantity y Heavy-Tailed . . . which is not easy to measure directly, e.g.: Result for Addition . . . Case of a General . . . – tomorrow’s weather, Case of the Product . . . – distance to a faraway planet, General Asymptotics . . . – amount of oil in an oil well. What If We Only Have . . . • In such situations in which we cannot measure y di- Home Page rectly , we can often measure y indirectly , i.e.: Title Page – measure auxiliary quantities x 1 , . . . , x n related to ◭◭ ◮◮ the desired quantity y by a known relation ◭ ◮ y = f ( x 1 , . . . , x n ); Page 2 of 16 – use the results � x 1 , . . . , � x n of measuring the quanti- Go Back ties x i to compute the estimate � y = f ( � x n ). x 1 , . . . , � Full Screen • The process of computing � y = f ( � x 1 , . . . , � x n ) is known as data processing. Close Quit

  3. Need for Data Processing Need to Estimate . . . 2. Need to Estimate Uncertainty of the Result of Traditional Statistical . . . Data Processing Heavy-Tailed . . . • Measurements are never 100% accurate. Result for Addition . . . Case of a General . . . • In general, the measurement results � x i are somewhat Case of the Product . . . def different from the actual values x i : ∆ x i = � x i − x i � = 0. General Asymptotics . . . • Since � x i � = x i , the estimate � y = f ( � x 1 , . . . , � x n ) is, in gen- What If We Only Have . . . eral, different from the actual value y = f ( x 1 , . . . , x n ). Home Page • Often, there is additional difference since the depen- Title Page dence between y and x i is only approximately known. ◭◭ ◮◮ • It is therefore important to gauge how much the actual ◭ ◮ value y can differ from this estimate. Page 3 of 16 • In other words, we need to gauge the uncertainty of Go Back the result of data processing. Full Screen Close Quit

  4. Need for Data Processing Need to Estimate . . . 3. Traditional Statistical Approach Traditional Statistical . . . • Usually, there are many different (and independent) Heavy-Tailed . . . factors which contribute to the measurement error. Result for Addition . . . Case of a General . . . • Due to Central Limit Theorem, the distr. of the joint Case of the Product . . . effect of numerous independent factors is ≈ normal. General Asymptotics . . . • To describe a normal distribution, it is sufficient to What If We Only Have . . . know the mean µ and the standard deviation σ . Home Page • If µ � = 0, we can compensate for this bias, so each ∆ x i Title Page is normally distributed with mean 0 and st. dev. σ i . ◭◭ ◮◮ • The measurement errors ∆ x i are usually small, so ◭ ◮ n � ∂f ∆ y = f ( � x n ) − f ( x 1 , . . . , x n ) ≈ · ∆ x i , x 1 , . . . , � Page 4 of 16 ∂x i i =1 Go Back • Thus, ∆ y is normally distributed with 0 mean and vari- � ∂f � 2 � Full Screen n ance σ 2 = · σ 2 i . ∂x i Close i =1 Quit

  5. Need for Data Processing Need to Estimate . . . 4. Heavy-Tailed Distributions Traditional Statistical . . . • In practice, the probability distribution of the mea- Heavy-Tailed . . . surement error is often different from normal. Result for Addition . . . Case of a General . . . • In many such situations, the variance is infinite. Case of the Product . . . • Such distributions are called heavy-tailed . General Asymptotics . . . • Mandelbrot (of fractal fame) found that price fluctua- What If We Only Have . . . Home Page tions follows the Pareto power-law Title Page ρ ( x ) = A · x − α , α ≈ 2 . 7 . ◭◭ ◮◮ • For this empirical value α , variance is infinite. ◭ ◮ • We need to estimate ∆ y for the case when distributions Page 5 of 16 for ∆ x i are heavy-tailed. Go Back Full Screen Close Quit

  6. Need for Data Processing Need to Estimate . . . 5. Result for Addition y = f ( x 1 , x 2 ) = x 1 + x 2 Traditional Statistical . . . • For addition, ∆ y = ∆ x 1 + ∆ x 2 . Heavy-Tailed . . . Result for Addition . . . • Let us assume that: Case of a General . . . – the measurement error ∆ x 1 of the first input has a Case of the Product . . . tail with asymptotics ρ 1 (∆ x 1 ) ∼ A 1 · | ∆ x 1 | − α 1 ; General Asymptotics . . . – the measurement error ∆ x 2 of the second input has What If We Only Have . . . a tail with asymptotics ρ 2 (∆ x 2 ) ∼ A 2 · | ∆ x 2 | − α 2 , Home Page • Then the tail for ∆ y has the asymptotics Title Page ◭◭ ◮◮ ρ (∆ y ) ∼ A · | ∆ y | − α , with α = min( α 1 , α 2 ) . ◭ ◮ Page 6 of 16 Go Back Full Screen Close Quit

  7. Need for Data Processing Need to Estimate . . . 6. Case of a General Linear Combination Traditional Statistical . . . • Let us assume that: Heavy-Tailed . . . � m Result for Addition . . . • y = a 0 + a i · x i ; and Case of a General . . . i =1 • the measurement error ∆ x i of the i -th input has a Case of the Product . . . tail with asymptotics General Asymptotics . . . What If We Only Have . . . ρ i (∆ x i ) ∼ A i · | ∆ x i | − α i . Home Page • Then the tail for ∆ y has the asymptotics Title Page ρ (∆ y ) ∼ A · | ∆ y | − α with α = min( α 1 , . . . , α m ) . ◭◭ ◮◮ ◭ ◮ Page 7 of 16 Go Back Full Screen Close Quit

  8. Need for Data Processing Need to Estimate . . . 7. Case of the Product y = f ( x 1 , x 2 ) = x 1 · x 2 : Result Traditional Statistical . . . • Let us assume that: Heavy-Tailed . . . Result for Addition . . . – the measurement error ∆ x 1 of the first input has a Case of a General . . . tail with asymptotics ρ 1 (∆ x 1 ) ∼ A 1 · | ∆ x 1 | − α 1 ; Case of the Product . . . – the measurement error ∆ x 2 of the second input has General Asymptotics . . . a tail with asymptotics ρ 2 (∆ x 2 ) ∼ A 2 · | ∆ x 2 | − α 2 . What If We Only Have . . . • Then, ρ (∆ y ) ∼ A · | ∆ y | − α with α = min( α 1 , α 2 ). Home Page • Similar formulas hold for an arbitrary combination Title Page � m x a i y = a 0 · i : ◭◭ ◮◮ i =1 ◭ ◮ – when the meas. error ∆ x i of the the i -th input has a tail with asymptotics ρ i (∆ x i ) ∼ A i · | ∆ x i | − α i , Page 8 of 16 – then the tail for ∆ y has the asymptotics Go Back ρ (∆ y ) ∼ A · | ∆ y | − α with α = min( α 1 , . . . , α m ). Full Screen Close Quit

  9. Need for Data Processing Need to Estimate . . . 8. Epistemic vs Aleatory Uncertainty Traditional Statistical . . . • The main objective of this paper is to deal with mea- Heavy-Tailed . . . surement (epistemic) uncertainty. Result for Addition . . . Case of a General . . . • However, the same formula can be used if we have Case of the Product . . . aleatory uncertainty. General Asymptotics . . . • For example, we can use these formulas to analyze what What If We Only Have . . . happens if: Home Page – we have a population of two-job individuals; Title Page – we know the distribution ρ 1 ( x 1 ) of first salaries; ◭◭ ◮◮ – we know the distribution ρ 2 ( x 2 ) of second salaries; ◭ ◮ – we know that these distributions are independent, Page 9 of 16 and Go Back – we want to find the distribution of the total salary Full Screen y = x 1 + x 2 . Close Quit

  10. Need for Data Processing Need to Estimate . . . 9. General Asymptotics Remains a Challenge Traditional Statistical . . . • For a normal distribution, Prob( | ∆ x i | > 6 σ i ) ≈ 10 − 6 %. Heavy-Tailed . . . Result for Addition . . . • Such large deviations can be safely ignored, so mea- Case of a General . . . surement errors ∆ x i are small. Case of the Product . . . • So, we can approximate the dependence y = f ( x 1 , . . . , x n ) General Asymptotics . . . by linear terms in its Taylor expansion. What If We Only Have . . . • For ρ (∆ x ) ≈ A · | ∆ x | − α with α = 2, the probability of Home Page ∆ x exceeding 6 σ is ≈ 6 − 2 ≈ 3%: quite probable. Title Page • Even deviations of size 100 σ are possible: they occur ◭◭ ◮◮ once every 10,000 trials. ◭ ◮ • For such large deviations, we can no longer ignore quadratic Page 10 of 16 or higher order terms. Go Back • So, we can no longer reduce any smooth function to its linear approximation. Full Screen • Each smooth function has to be treated separately. Close Quit

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend