Parameter Estimation Saravanan Vijayakumaran sarva@ee.iitb.ac.in - PowerPoint PPT Presentation

Parameter Estimation Saravanan Vijayakumaran sarva@ee.iitb.ac.in Department of Electrical Engineering Indian Institute of Technology Bombay October 21, 2013 1 / 24

Motivation

System Model used to Derive Optimal Receivers s ( t ) y ( t ) Channel y ( t ) = s ( t ) + n ( t ) s ( t ) Transmitted Signal y ( t ) Received Signal n ( t ) Noise Simplified System Model. Does Not Account For • Propagation Delay • Carrier Frequency Mismatch Between Transmitter and Receiver • Clock Frequency Mismatch Between Transmitter and Receiver 3 / 24

Why Study the Simplified System Model? • Consider the effect of propagation delay s ( t ) y ( t ) Channel y ( t ) = s ( t − τ ) + n ( t ) • If the receiver can estimate τ , the simplified system model is valid • Receivers estimate propagation delay, carrier frequency and clock frequency before demodulation • Once these unknown parameters are estimated, the simplified system model is valid • Then why not study parameter estimation first? • Hypothesis testing is easier to learn than parameter estimation • Historical reasons 4 / 24

Parameter Estimation

Parameter Estimation • Hypothesis testing was about making a choice between discrete states of nature • Parameter or point estimation is about choosing from a continuum of possible states Example • Consider a manufacturer of clothes for newborn babies • She wants her clothes to fit at least 50% of newborn babies. Clothes can be loose but not tight. She also wants to minimize material used. • Since babies are made up of a large number of atoms, their length is a Gaussian random variable (by Central Limit Theorem) Baby Length ∼ N ( µ, σ 2 ) • Only knowledge of µ is required to achieve her goal of 50% fit • But µ is unknown and she is interested in estimating it • What is a good estimator of µ ? If she wants her clothes to fit at least 75% of the newborn babies, is knowledge of µ enough? 6 / 24

System Model for Parameter Estimation • Consider a family of distributions Y ∼ P θ , θ ∈ Λ where the observation vector Y ∈ Γ ⊆ R n and Λ ⊆ R m is the parameter space. θ itself can be a realization of a random variable Θ Example Y ∼ N ( µ, σ 2 ) � T , Λ = R 2 . � where µ and σ are unknown. Here Γ = R , θ = µ σ The parameters µ and σ can themselves be random variables. • The goal of parameter estimation is to find θ given Y • An estimator is a function from the observation space to the parameter space ˆ θ : Γ → Λ 7 / 24

Which is the Optimal Estimator? • Assume there is a cost function C C : Λ × Λ → R such that C [ a , θ ] is the cost of estimating the true value of θ as a • Examples of cost functions for scalar θ Squared Error C [ a , θ ] = ( a − θ ) 2 Absolute Error C [ a , θ ] = | a − θ | � 0 if | a − θ | ≤ ∆ Threshold Error C [ a , θ ] = if | a − θ | > ∆ 1 8 / 24

Which is the Optimal Estimator? • Suppose that the parameter θ is the realization of a random variable Θ • With an estimator ˆ θ we associate a conditional cost or risk conditioned on θ � � �� r θ (ˆ ˆ θ ) = E θ C θ ( Y ) , θ • The average risk or Bayes risk is given by � � R (ˆ r Θ (ˆ θ ) = E θ ) • The optimal estimator is the one which minimizes the Bayes risk 9 / 24

Which is the Optimal Estimator? • Given that � � � � � � �� r θ (ˆ ˆ ˆ � θ ) = E θ C θ ( Y ) , θ = E C θ ( Y ) , Θ � Θ = θ � the average risk or Bayes risk is given by � � �� R (ˆ ˆ θ ) = E C θ ( Y ) , Θ � � � � �� ˆ � = E E C θ ( Y ) , Θ � Y � � � � � � � ˆ � = E C θ ( Y ) , Θ � Y = y p Y ( y ) d y � • The optimal estimate for θ can be found by minimizing for each Y = y the posterior cost � � � � � ˆ � E C θ ( y ) , Θ � Y = y � 10 / 24

Minimum-Mean-Squared-Error (MMSE) Estimation • Consider a scalar parameter θ • C [ a , θ ] = ( a − θ ) 2 • The posterior cost is given by � � � � 2 � (ˆ θ ( y ) − Θ) 2 ˆ � E � Y = y = θ ( y ) � � � � − 2 ˆ � θ ( y ) E Θ � Y = y � � � � Θ 2 � + E � Y = y � • Differentiating posterior cost wrt ˆ θ ( y ) , the Bayes estimate is � � � ˆ � θ MMSE ( y ) = E Θ � Y = y � 11 / 24

Example: MMSE Estimation • Suppose X and Y are jointly Gaussian random variables • Let the joint pdf be given by 1 � − 1 � 2 ( s − µ ) T C − 1 ( s − µ ) p XY ( x , y ) = exp 1 2 π | C | 2 � σ 2 � x � � µ x � � ρσ x σ y x where s = , µ = and C = σ 2 y µ y ρσ x σ y y • Suppose Y is observed and we want to estimate X • The MMSE estimate of X is � � � ˆ � X MMSE ( y ) = E X � Y = y � • The conditional density of X given Y = y is p ( x | y ) = p XY ( x , y ) p Y ( y ) 12 / 24

Example: MMSE Estimation • The conditional density of X given Y = y is a Gaussian density with mean µ X | y = µ x + σ x σ y ρ ( y − µ y ) and variance σ 2 X | y = ( 1 − ρ 2 ) σ 2 x • Thus the MMSE estimate of X given Y = y is X MMSE ( y ) = µ x + σ x ˆ σ y ρ ( y − µ y ) 13 / 24

Maximum A Posteriori (MAP) Estimation • In some situations, the conditional mean may be difficult to compute • An alternative is to use MAP estimation • The MAP estimator is given by ˆ θ MAP ( y ) = argmax p ( θ | y ) θ where p is the conditional density of Θ given Y . • It can be obtained as the optimal estimator for the threshold cost function � 0 if | a − θ | ≤ ∆ C [ a , θ ] = if | a − θ | > ∆ 1 for small ∆ > 0 14 / 24

Maximum A Posteriori (MAP) Estimation • For the threshold cost function, we have 1 � � � � � ˆ � E C θ ( y ) , Θ � Y = y � � ∞ C [ˆ = θ ( y ) , θ ] p ( θ | y ) d θ −∞ � ∞ ˆ θ ( y ) − ∆ � p ( θ | y ) d θ + p ( θ | y ) d θ = ˆ −∞ θ ( y )+∆ � ∞ ˆ θ ( y )+∆ � p ( θ | y ) d θ − p ( θ | y ) d θ = ˆ −∞ θ ( y ) − ∆ ˆ θ ( y )+∆ � = 1 − p ( θ | y ) d θ ˆ θ ( y ) − ∆ • The Bayes estimate is obtained by maximizing the integral in the last equality 1 Assume a scalar parameter θ for illustration 15 / 24

Maximum A Posteriori (MAP) Estimation p ( θ | y ) � ˆ θ ( y )+∆ θ ( y ) − ∆ p ( θ | y ) ˆ ˆ θ ( y ) � ˆ θ ( y )+∆ • The shaded area is the integral θ ( y ) − ∆ p ( θ | y ) d θ ˆ • To maximize this integral, the location of ˆ θ ( y ) should be chosen to be the value of θ which maximizes p ( θ | y ) 16 / 24

Maximum A Posteriori (MAP) Estimation p ( θ | y ) � ˆ θ ( y )+∆ θ ( y ) − ∆ p ( θ | y ) ˆ ˆ θ MAP ( y ) • This argument is not airtight as p ( θ | y ) may not be symmetric at the maximum • But the MAP estimator is widely used as it is easier to compute than the MMSE estimator 17 / 24

Maximum Likelihood (ML) Estimation • The ML estimator is given by ˆ θ ML ( y ) = argmax p ( y | θ ) θ where p is the conditional density of Y given Θ . • It is the same as the MAP estimator when the prior probability distribution of Θ is uniform p ( θ , y ) p ( y | θ ) p ( θ ) ˆ p ( θ | y ) = argmax θ MAP ( y ) = argmax = argmax p ( y ) p ( y ) θ θ θ • It is also used when the prior distribution is not known 18 / 24

Example 1: ML Estimation • Suppose we observe Y i , i = 1 , 2 , . . . , M such that Y i ∼ N ( µ, σ 2 ) where Y i ’s are independent, µ is unknown and σ 2 is known • The ML estimate is given by M µ ML ( y ) = 1 � ˆ y i M i = 1 19 / 24

Example 2: ML Estimation • Suppose we observe Y i , i = 1 , 2 , . . . , M such that Y i ∼ N ( µ, σ 2 ) where Y i ’s are independent, both µ and σ 2 are unknown • The ML estimates are given by M 1 � ˆ µ ML ( y ) = y i M i = 1 M 1 σ 2 � µ ML ( y )) 2 ˆ ML ( y ) = ( y i − ˆ M i = 1 20 / 24

Example 3: ML Estimation • Suppose we observe Y i , i = 1 , 2 , . . . , M such that Y i ∼ Bernoulli ( p ) where Y i ’s are independent and p is unknown • The ML estimate of p is given by M p ML ( y ) = 1 � ˆ y i M i = 1 21 / 24

Example 4: ML Estimation • Suppose we observe Y i , i = 1 , 2 , . . . , M such that Y i ∼ Uniform [ 0 , θ ] where Y i ’s are independent and θ is unknown • The ML estimate of θ is given by ˆ θ ML ( y ) = max ( y 1 , y 2 , . . . , y M − 1 , y M ) 22 / 24

Reference • Chapter 4, An Introduction to Signal Detection and Estimation , H. V. Poor, Second Edition, Springer Verlag, 1994. 23 / 24

Thanks for your attention 24 / 24

Parameter Estimation Saravanan Vijayakumaran sarva@ee.iitb.ac.in - PowerPoint PPT Presentation

Parameter Estimation Saravanan Vijayakumaran sarva@ee.iitb.ac.in Department of Electrical Engineering Indian Institute of Technology Bombay October 21, 2013 1 / 24 Motivation System Model used to Derive Optimal Receivers s ( t ) y ( t )

I 4 - Bayesian parameter estimation in a normal model STAT 587 (Engineering) Iowa State

4CSLL5 Parameter Estimation (Supervised and Unsupervised) Unsupervised Maximum Likelihood

4CSLL5 Parameter Estimation (Supervised and Unsupervised) Supervised Maximum Likelihood

Maximum-likelihood and Bayesian parameter estimation Andrea Passerini passerini@disi.unitn.it

Maximum likelihood parameter estimation Maximum likelihood parameter estimation For an HMM

4CSLL5 Parameter Estimation (Supervised and Unsupervised) Martin Emms September 20, 2019 4CSLL5

6. Parameter Passing Parameter Passing CS 381 Spring 2016 Example (Formal) Parameter void

10/16/19 Parameter Control Genetic Algorithms Motivation Parameter setting Tuning

Outline Introduction Knowledge Structures Parameter Estimation Maximum Likelihood Estimation

Lecture 6. Bayesian estimation Lecture 6. Bayesian estimation 1 (172) 6. Bayesian estimation

Parameter Passing and Pointers Parameter passing and functions I: reference parameters

10/16/19 Parameters and Parameter Tuning Genetic Algorithms History Taxonomy

Risk-parameter estimation in volatility models Christian Francq Jean-Michel Zakoan CREST and

Motion Estimation by Affine Transforms Motion Estimation by Affine Transforms Motion Estimation

A Maximum A-Posteriori Based Algorithm for Dynamic Load Model Parameter Estimation Siming Guo and

Parameter estimation methods for fault detection and isolation LAAS-CNRS UPC Teresa Escobet

Detection and Estimation Theory Lecture 13 Mojtaba Soltanalian- UIC msol@uic.edu

Malaysian Healthy Ageing Society DR R ZURR RRAIN INI I ARA RABI Dementia Ageing population

Cherenkov detector R&D for super B factory - Introduction - Prototype development -

Computing with Allinea 06 Nov 2015 VSB, Ostrava Florent Lebeau flebeau@allinea.com Agenda

ECE 417 Fall 2018 Lecture 17: Neural Networks Mark Hasegawa-Johnson University of Illinois

Phase transitions in low-rank matrix estimation May 11, 2017 Marc Lelarge & L eo Miolane

A Multichannel Feature Compensation Approach for Robust ASR in Noisy and Reverberant Environments

On the Spectral Efficiency of Space-Constrained Massive MIMO with Linear Receivers Jiayi Zhang 1

Parameter Estimation Saravanan Vijayakumaran sarva@ee.iitb.ac.in - PowerPoint PPT Presentation

Parameter Estimation Saravanan Vijayakumaran sarva@ee.iitb.ac.in Department of Electrical Engineering Indian Institute of Technology Bombay October 21, 2013 1 / 24 Motivation System Model used to Derive Optimal Receivers s ( t ) y ( t )

I 4 - Bayesian parameter estimation in a normal model STAT 587 (Engineering) Iowa State

4CSLL5 Parameter Estimation (Supervised and Unsupervised) Unsupervised Maximum Likelihood

4CSLL5 Parameter Estimation (Supervised and Unsupervised) Supervised Maximum Likelihood

Maximum-likelihood and Bayesian parameter estimation Andrea Passerini passerini@disi.unitn.it

Maximum likelihood parameter estimation Maximum likelihood parameter estimation For an HMM

4CSLL5 Parameter Estimation (Supervised and Unsupervised) Martin Emms September 20, 2019 4CSLL5

6. Parameter Passing Parameter Passing CS 381 Spring 2016 Example (Formal) Parameter void

10/16/19 Parameter Control Genetic Algorithms Motivation Parameter setting Tuning

Outline Introduction Knowledge Structures Parameter Estimation Maximum Likelihood Estimation

Lecture 6. Bayesian estimation Lecture 6. Bayesian estimation 1 (172) 6. Bayesian estimation

Parameter Passing and Pointers Parameter passing and functions I: reference parameters

10/16/19 Parameters and Parameter Tuning Genetic Algorithms History Taxonomy

Risk-parameter estimation in volatility models Christian Francq Jean-Michel Zakoan CREST and

Motion Estimation by Affine Transforms Motion Estimation by Affine Transforms Motion Estimation

A Maximum A-Posteriori Based Algorithm for Dynamic Load Model Parameter Estimation Siming Guo and

Parameter estimation methods for fault detection and isolation LAAS-CNRS UPC Teresa Escobet

Detection and Estimation Theory Lecture 13 Mojtaba Soltanalian- UIC msol@uic.edu

Malaysian Healthy Ageing Society DR R ZURR RRAIN INI I ARA RABI Dementia Ageing population

Cherenkov detector R&amp;D for super B factory - Introduction - Prototype development -

Computing with Allinea 06 Nov 2015 VSB, Ostrava Florent Lebeau flebeau@allinea.com Agenda

ECE 417 Fall 2018 Lecture 17: Neural Networks Mark Hasegawa-Johnson University of Illinois

Phase transitions in low-rank matrix estimation May 11, 2017 Marc Lelarge &amp; L eo Miolane

A Multichannel Feature Compensation Approach for Robust ASR in Noisy and Reverberant Environments

On the Spectral Efficiency of Space-Constrained Massive MIMO with Linear Receivers Jiayi Zhang 1

Cherenkov detector R&D for super B factory - Introduction - Prototype development -

Phase transitions in low-rank matrix estimation May 11, 2017 Marc Lelarge & L eo Miolane