Multitask Learning with Low-Level Auxiliary Tasks
for Speech Recognition
Shubham Toshniwal, Hao Tang, Liang Lu, Karen Livescu
Toyota Technological Institute at Chicago
Multitask Learning with Low-Level Auxiliary Tasks 1 Traditional - - PowerPoint PPT Presentation
for Speech Recognition Shubham Toshniwal , Hao Tang, Liang Lu, Karen Livescu Toyota Technological Institute at Chicago Multitask Learning with Low-Level Auxiliary Tasks 1 Traditional automatic speech recognition (ASR) systems are modular.
Toyota Technological Institute at Chicago
. . .
‘‘recognize speech’’
1
3
4
5
y1 y2 x1 x2 x3 x4 x5 x6 x7 x8 xT GO CharDec (Lc)
6
y1 y2 x1 x2 x3 x4 x5 x6 x7 x8 xT GO CharDec (Lc)
6
y1 y2 x1 x2 x3 x4 x5 x6 x7 x8 xT GO CharDec (Lc)
6
$z_1$ $z_1$ $z_2$ PhoneDec $(L_{p}^{Dec})$ PhoneCTC $(L_{p}^{CTC})$ GO
y1 y2 x1 x2 x3 x4 x5 x6 x7 x8 xT GO CharDec (Lc)
p ),
p )
1 2 Lc
7
PhoneCTC $(L_{p}^{CTC})$
GO
PhoneDec (LDec
p
) z1 z2 z1 y1 y2 x1 x2 x3 x4 x5 x6 x7 x8 xT GO CharDec (Lc)
p ),
p )
1 2 Lc
7
$z_1$ $z_1$ $z_2$ PhoneDec $(L_{p}^{Dec})$ GO
PhoneCTC (LCTC
p
) y1 y2 x1 x2 x3 x4 x5 x6 x7 x8 xT GO CharDec (Lc)
p ),
p )
1 2 Lc
7
GO
PhoneDec (LDec
p
) z1 z2 z1 PhoneCTC (LCTC
p
) y1 y2 x1 x2 x3 x4 x5 x6 x7 x8 xT GO CharDec (Lc)
p ),
p )
2(Lc + Lp). 7
z1 z2 z1 PhoneDec (LDec
p
) State (Ls) s2 s1 s3 s4 s5 s6 s7 s8 sT PhoneCTC (LCTC
p
) y1 y2 x1 x2 x3 x4 x5 x6 x7 x8 xT GO CharDec (Lc) GO
3(Lc + Lp + Ls). 8
9
10
10
10
10
10
11
11
11
11
12
12
12
13
13
13
13
13