SLIDE 11 11
HMM/N-gram-based Model
- Expectation-Maximum Training
( )
( )
( ) ( ) ( )
( ) ( ) [ ]
( ) ( )
( ) [ ]
log ˆ ˆ ˆ ˆ log ˆ ˆ ˆ log ˆ ) ˆ , (
∑ ∑ ∑ ∑ ∑ ∑ ∑
∈ ∈ ∈
= = = Φ
Q n q k n k j j n k n Q n q n k n n Q n q n k n
m |k,D q P m D |j, q P m D |k, q P k|D P |k,D q P D | q P D k| P D |k, q P ,k|D q P D , k|q P D D
( ) ( )
( ) [ ]
( ) ( ) ( ) ( )
l m G m G m G m D |j, q P m D |k, q P G l m D |j, q P m D |k, q P m m D D m l m |k,D q P m D |j, q P m D |k, q P D D
k k Q n q j j n k n k Q n q j j n k n k k i i Q n q k k n j j n k n
− = = = = ⇒ = = + = ∂ Φ ′ ∂ − + = Φ ′
∑ ∑ ∑ ∑ ∑ ∑ ∑ ∑
∈ ∈ ∈
... .... ˆ ˆ ˆ ˆ Assume ˆ ˆ ˆ ˆ 1 ) ˆ , ( 1 log ˆ ˆ ˆ ˆ ) ˆ , (
2 2 1 1
( ) ( ) ( ) ( ) ( ) ( ) ˆ
ˆ ˆ ˆ ˆ ˆ ˆ ˆ ˆ ˆ ˆ ˆ Q m D |j, q P m D |k, q P m D |j, q P m D |k, q P m D |j, q P m D |k, q P m G l
Q q j j n k n s Q q j j n s n Q q j j n k n k s s
n n n
∑ ∑ ∑ ∑ ∑ ∑ ∑ ∑
∈ ∈ ∈
= = ∴ − = ∴
Q function
normalization constraints using Lagrange multipliers empirical distribution the model
l
k
G