Introduction to HTK Toolkit
Berlin Chen 2004
Reference:
- Steve Young et al. The HTK Book. Version 3.2, 2002.
Introduction to HTK Toolkit Berlin Chen 2004 Reference: - Steve - - PowerPoint PPT Presentation
Introduction to HTK Toolkit Berlin Chen 2004 Reference: - Steve Young et al. The HTK Book . Version 3.2, 2002 . Outline An Overview of HTK HTK Processing Stages Data Preparation Tools Training Tools Testing Tools
2004 SP - Berlin Chen
2
2004 SP - Berlin Chen
3
2004 SP - Berlin Chen
4
recognition output
2004 SP - Berlin Chen
5
HFoo -T -C Config1 -f 34.3 -a -s myfile file1 file2
2004 SP - Berlin Chen
6
2004 SP - Berlin Chen
7
2004 SP - Berlin Chen
8
2004 SP - Berlin Chen
9
LPC linear prediction filter coefficients LPCREFC linear prediction reflection coefficients LPCEPSTRA LPC cepstral coefficients LPDELCEP LPC cepstra plus delta coefficients MFCC mel-frequency cepstral coefficients MELSPEC linear mel-filter bank channel outputs DISCRETE vector quantized data
2004 SP - Berlin Chen
10
2004 SP - Berlin Chen
11
2004 SP - Berlin Chen
12
2004 SP - Berlin Chen
13
2004 SP - Berlin Chen
14
O1 State O2
1 2 N
ON s2 s3 s1 s2 s3 s1 s2 s3 s1 s2 s3 s1 s2 s3 s1 s2 s3 s1 s2 s3 s1 s2 s3 s1 s2 s3 s1
Global mean
Cluster 1 mean Cluster 2mean
2004 SP - Berlin Chen
15
2004 SP - Berlin Chen
16
2004 SP - Berlin Chen
17
2004 SP - Berlin Chen
18
2004 SP - Berlin Chen
19
2004 SP - Berlin Chen
20
ㄓ (j) ㄜ (e) ㄠ (au) (j_a) (j_e) right-context-dependent modeling
2004 SP - Berlin Chen
21
lexicon/ dictionary word Network HMMs feature file label file
2004 SP - Berlin Chen
22
2004 SP - Berlin Chen
23
2004 SP - Berlin Chen
24
2004 SP - Berlin Chen
25
2004 SP - Berlin Chen
26
2004 SP - Berlin Chen
27
2004 SP - Berlin Chen
28
2004 SP - Berlin Chen
29
2004 SP - Berlin Chen
30
in 100 nanosecond unit
2004 SP - Berlin Chen
31
2004 SP - Berlin Chen
32
2004 SP - Berlin Chen
33
2004 SP - Berlin Chen
34
2004 SP - Berlin Chen
35
2004 SP - Berlin Chen
36
2004 SP - Berlin Chen
37
HCOPY -T 00001 -C ..\config\HCOPY.fig
user defined wave format
specify the pcm and coefficient files and their respective directories specify the detailed configuration for feature extraction
file header (set to 0 here) 2 bytes per sample in accordance with sampling rate 1e7/16000 Z(zero mean), E(Energy), D(delta) A(Delta Delta) 10e-3 *1e7 Hamming window Pre-emphasis filter bank no liftering setting Cepstral coefficient no 32e-3 *1e7 Intel PC byte Order
Level of trace information
2004 SP - Berlin Chen
38
HCompV -C ..\Config\Config.fig -m -S ..\script\HCompV.scp -M ..\Global_pro_hmm_def39 ..\HTK_pro_hmm_def39\pro_39_m1_s1 The prototype 1-state HMM with zero mean and variance of value 1 the resultant prototype HMM (with the global mean and variance setting) mean will be updated a list of coefficient files
Generate prototype HMMs with different state numbers
2004 SP - Berlin Chen
39
remove this row for all proto HMMs
2004 SP - Berlin Chen
40
2004 SP - Berlin Chen
41
HHEd -C ..\Config\ConfigHHEd.fig -d ..\Init_pro_hmm -M ..\Init_pro_hmm_mixture ..\Script\HEdCmd.scp ..\Script\rcdmodel_sil dir of the resultant HMMs dir of the proto HMMs HMM model list HHEd configuration
mixture splitting command the resultant mixture number The states of a specific model to be processed
List of the models to be trained HHEd configuration
2004 SP - Berlin Chen
42
HERest -T 00001 -t 100 -v 0.000000001 -C ..\Config\Config.fig -L ..\label -X rec -d ..\Init_pro_hmm_mixture
HERest -T 00001 -t 100 -v 0.000000001 -C ..\Config\Config.fig -L ..\label -X rec -d ..\Rest_E
…… Dir of initial models List of the coefficient files of the training data Dir to look the corresponding label files cut-off value of the variance Pruning threshold
List of the models to be trained
2004 SP - Berlin Chen
43
Boundary information of the segments of HMM models (will not be used for HERest)
A label file of a training utterance List of the models to be trained
2004 SP - Berlin Chen
44
HCOPY -T 00001 -C ..\Config\Config.fig -S ..\script\HCopy_Test.scp
2004 SP - Berlin Chen
45
HVite -C ..\Config\Config.fig
The extension file name for the search/recognition network Set the output label files format: no score information, and no word information The search/recognition network generated by HParse command A list of the testing utterances A list to lookup the constituent INITIAL/FINAL models for the composite syllable models Dir to load the HMM models Dir to save the output label files
2004 SP - Berlin Chen
46
A list to lookup the constituent INITIAL/FINAL models for the composite syllable models The search/recognition network before performing HParse command
loop
a composite syllable model
Regular expression
HParse SYL_WORD_NET SYL_WORD_NET.netparsed
The search/recognition network generated by HParse command
2004 SP - Berlin Chen
47
HResults -C ..\Config\Config.fig -T 00020 -X rec -e ??? sil -L ..\Syllable
ignore the silence label “sil” The extension file name for the label files Dir lookup the reference label files A list of the label files generated by the recognition process
2004 SP - Berlin Chen
48