t Prr r t r - - PowerPoint PPT Presentation
t Prr r t r - - PowerPoint PPT Presentation
t Prr r t r t t ss
❙tr✉❝t✉r❡❞ Pr❡❞✐❝t✐♦♥
❂ ❥♦✐♥t ♣r❡❞✐❝t✐♦♥ ✇✐t❤ ❛ ❥♦✐♥t ❧♦ss ❊①❛♠♣❧❡✿ P❛rt ♦❢ ❙♣❡❡❝❤ ❚❛❣❣✐♥❣ P✐❡rr❡ ❱✐♥❦❡♥ ✱ ✻✶ ②❡❛rs ♦❧❞ Pr♦♣❡r ◆✳ Pr♦♣❡r ◆✳ ❈♦♠♠❛ ◆✉♠❜❡r ◆♦✉♥ ❆❞❥✳
❙tr✉❝t✉r❡❞ Pr❡❞✐❝t✐♦♥
❂ ❥♦✐♥t ♣r❡❞✐❝t✐♦♥ ✇✐t❤ ❛ ❥♦✐♥t ❧♦ss ❊①❛♠♣❧❡✿ P❛rt ♦❢ ❙♣❡❡❝❤ ❚❛❣❣✐♥❣ P✐❡rr❡ ❱✐♥❦❡♥ ✱ ✻✶ ②❡❛rs ♦❧❞ Pr♦♣❡r ◆✳ Pr♦♣❡r ◆✳ ❈♦♠♠❛ ◆✉♠❜❡r ◆♦✉♥ ❆❞❥✳
❍♦✇ ❝❛♥ ②♦✉ ❜❡st ❞♦ str✉❝t✉r❡❞ ♣r❡❞✐❝t✐♦♥❄
❲❡ ❝❛r❡ ❛❜♦✉t✿ ✶✳ Pr♦❣r❛♠♠✐♥❣ ❝♦♠♣❧❡①✐t②✳ ▼♦st str✉❝t✉r❡❞ ♣r❡❞✐❝t✐♦♥s ❛r❡ ♥♦t ❛❞❞r❡ss❡❞ ✇✐t❤ str✉❝t✉r❡❞ ❧❡❛r♥✐❣♥ ❛❧❣♦r✐t❤♠s✱ ❜❡❝❛✉s❡ ✐t ✐t t♦♦ ❝♦♠♣❧❡① t♦ ❞♦ s♦✳ ✷✳ Pr❡❞✐❝t✐♦♥ ❛❝❝✉r❛❝②✳ ■t ❤❛❞ ❜❡tt❡r ✇♦r❦ ✇❡❧❧✳ ✸✳ ❚r❛✐♥ s♣❡❡❞✳ ❉❡❜✉❣✴❞❡✈❡❧♦♣♠❡♥t ♣r♦❞✉❝t✐✈✐t② ✰ ♠❛①✐♠✉♠ ❞❛t❛ ✐♥♣✉t✳ ✹✳ ❚❡st s♣❡❡❞✳ ❆♣♣❧✐❝❛t✐♦♥ ❡✣❝✐❡♥❝②
❍♦✇ ❝❛♥ ②♦✉ ❜❡st ❞♦ str✉❝t✉r❡❞ ♣r❡❞✐❝t✐♦♥❄
❲❡ ❝❛r❡ ❛❜♦✉t✿ ✶✳ Pr♦❣r❛♠♠✐♥❣ ❝♦♠♣❧❡①✐t②✳ ▼♦st str✉❝t✉r❡❞ ♣r❡❞✐❝t✐♦♥s ❛r❡ ♥♦t ❛❞❞r❡ss❡❞ ✇✐t❤ str✉❝t✉r❡❞ ❧❡❛r♥✐❣♥ ❛❧❣♦r✐t❤♠s✱ ❜❡❝❛✉s❡ ✐t ✐t t♦♦ ❝♦♠♣❧❡① t♦ ❞♦ s♦✳ ✷✳ Pr❡❞✐❝t✐♦♥ ❛❝❝✉r❛❝②✳ ■t ❤❛❞ ❜❡tt❡r ✇♦r❦ ✇❡❧❧✳ ✸✳ ❚r❛✐♥ s♣❡❡❞✳ ❉❡❜✉❣✴❞❡✈❡❧♦♣♠❡♥t ♣r♦❞✉❝t✐✈✐t② ✰ ♠❛①✐♠✉♠ ❞❛t❛ ✐♥♣✉t✳ ✹✳ ❚❡st s♣❡❡❞✳ ❆♣♣❧✐❝❛t✐♦♥ ❡✣❝✐❡♥❝②
❍♦✇ ❝❛♥ ②♦✉ ❜❡st ❞♦ str✉❝t✉r❡❞ ♣r❡❞✐❝t✐♦♥❄
❲❡ ❝❛r❡ ❛❜♦✉t✿ ✶✳ Pr♦❣r❛♠♠✐♥❣ ❝♦♠♣❧❡①✐t②✳ ▼♦st str✉❝t✉r❡❞ ♣r❡❞✐❝t✐♦♥s ❛r❡ ♥♦t ❛❞❞r❡ss❡❞ ✇✐t❤ str✉❝t✉r❡❞ ❧❡❛r♥✐❣♥ ❛❧❣♦r✐t❤♠s✱ ❜❡❝❛✉s❡ ✐t ✐t t♦♦ ❝♦♠♣❧❡① t♦ ❞♦ s♦✳ ✷✳ Pr❡❞✐❝t✐♦♥ ❛❝❝✉r❛❝②✳ ■t ❤❛❞ ❜❡tt❡r ✇♦r❦ ✇❡❧❧✳ ✸✳ ❚r❛✐♥ s♣❡❡❞✳ ❉❡❜✉❣✴❞❡✈❡❧♦♣♠❡♥t ♣r♦❞✉❝t✐✈✐t② ✰ ♠❛①✐♠✉♠ ❞❛t❛ ✐♥♣✉t✳ ✹✳ ❚❡st s♣❡❡❞✳ ❆♣♣❧✐❝❛t✐♦♥ ❡✣❝✐❡♥❝②
❍♦✇ ❝❛♥ ②♦✉ ❜❡st ❞♦ str✉❝t✉r❡❞ ♣r❡❞✐❝t✐♦♥❄
❲❡ ❝❛r❡ ❛❜♦✉t✿ ✶✳ Pr♦❣r❛♠♠✐♥❣ ❝♦♠♣❧❡①✐t②✳ ▼♦st str✉❝t✉r❡❞ ♣r❡❞✐❝t✐♦♥s ❛r❡ ♥♦t ❛❞❞r❡ss❡❞ ✇✐t❤ str✉❝t✉r❡❞ ❧❡❛r♥✐❣♥ ❛❧❣♦r✐t❤♠s✱ ❜❡❝❛✉s❡ ✐t ✐t t♦♦ ❝♦♠♣❧❡① t♦ ❞♦ s♦✳ ✷✳ Pr❡❞✐❝t✐♦♥ ❛❝❝✉r❛❝②✳ ■t ❤❛❞ ❜❡tt❡r ✇♦r❦ ✇❡❧❧✳ ✸✳ ❚r❛✐♥ s♣❡❡❞✳ ❉❡❜✉❣✴❞❡✈❡❧♦♣♠❡♥t ♣r♦❞✉❝t✐✈✐t② ✰ ♠❛①✐♠✉♠ ❞❛t❛ ✐♥♣✉t✳ ✹✳ ❚❡st s♣❡❡❞✳ ❆♣♣❧✐❝❛t✐♦♥ ❡✣❝✐❡♥❝②
❍♦✇ ❝❛♥ ②♦✉ ❜❡st ❞♦ str✉❝t✉r❡❞ ♣r❡❞✐❝t✐♦♥❄
❲❡ ❝❛r❡ ❛❜♦✉t✿ ✶✳ Pr♦❣r❛♠♠✐♥❣ ❝♦♠♣❧❡①✐t②✳ ▼♦st str✉❝t✉r❡❞ ♣r❡❞✐❝t✐♦♥s ❛r❡ ♥♦t ❛❞❞r❡ss❡❞ ✇✐t❤ str✉❝t✉r❡❞ ❧❡❛r♥✐❣♥ ❛❧❣♦r✐t❤♠s✱ ❜❡❝❛✉s❡ ✐t ✐t t♦♦ ❝♦♠♣❧❡① t♦ ❞♦ s♦✳ ✷✳ Pr❡❞✐❝t✐♦♥ ❛❝❝✉r❛❝②✳ ■t ❤❛❞ ❜❡tt❡r ✇♦r❦ ✇❡❧❧✳ ✸✳ ❚r❛✐♥ s♣❡❡❞✳ ❉❡❜✉❣✴❞❡✈❡❧♦♣♠❡♥t ♣r♦❞✉❝t✐✈✐t② ✰ ♠❛①✐♠✉♠ ❞❛t❛ ✐♥♣✉t✳ ✹✳ ❚❡st s♣❡❡❞✳ ❆♣♣❧✐❝❛t✐♦♥ ❡✣❝✐❡♥❝②
❆ ♣r♦❣r❛♠ ❝♦♠♣❧❡①✐t② ❝♦♠♣❛r✐s♦♥
1 10 100 1000 CRFSGD CRF++ S-SVM Search lines of code for POS
100 101 102 103 Training Time (minutes) 0.88 0.90 0.92 0.94 0.96 0.98 Accuracy (per tag)
95.7 96.6 95.0 95.8 95.7 96.1 90.7 96.1 1m 10m 30m 1h
Part of speech tagging (tuned hps)
VW Search VW Search (own fts) VW Classification CRF SGD CRF++
- Str. Perceptron
Structured SVM Str.SVM (DEMI-DCD)
NER POS
50 100 150 200 250 300
285 133 218 129 24 5.7 98 13 5.6 14 5.3
Prediction (test-time) Speed
VW Search VW Search (own fts) CRF SGD CRF++
- Str. Perceptron
Structured SVM
- Str. SVM (DEMI-DCD)
Thousands of Tokens per Second
❆♥ ♦✉t❧✐♥❡
✶✳ ❍♦✇❄
✶✳✶ Pr♦❣r❛♠♠✐♥❣ ✶✳✷ ▲❡❛r♥✐♥❣ t♦ ❙❡❛r❝❤ ✶✳✸ ❊q✉✐✈❛❧❡♥❝❡ ✶✳✹ ❖♣t✐♠✐③❛t✐♦♥s
✷✳ ❖t❤❡r ❘❡s✉❧ts
❍♦✇ ❞♦ ②♦✉ ♣r♦❣r❛♠❄
❙❡q✉❡♥t✐❛❧❴❘❯◆✭❡①❛♠♣❧❡s✮
✶✿ ❢♦r ✐ = ✶ t♦ ❧❡♥✭❡①❛♠♣❧❡s✮ ❞♦ ✷✿
♣r❡❞✐❝t✐♦♥ ← ♣r❡❞✐❝t✭❡①❛♠♣❧❡s❬✐❪✱ ❡①❛♠♣❧❡s❬✐❪✳❧❛❜❡❧✮
✸✿
✐❢ ♦✉t♣✉t✳❣♦♦❞ t❤❡♥
✹✿
♦✉t♣✉t ✓ ✬ ✬ ✓ ♣r❡❞✐❝t✐♦♥
✺✿
❡♥❞ ✐❢
✻✿ ❡♥❞ ❢♦r
■♥ ❡ss❡♥❝❡✱ ✇r✐t❡ t❤❡ ❞❡❝♦❞❡r✱ ♣r♦✈✐❞✐♥❣ ❛ ❧✐tt❧❡ ❜✐t ♦❢ s✐❞❡ ✐♥❢♦r♠❛t✐♦♥ ❢♦r tr❛✐♥✐♥❣✳
❍♦✇ ❞♦ ②♦✉ ♣r♦❣r❛♠❄
❙❡q✉❡♥t✐❛❧❴❘❯◆✭❡①❛♠♣❧❡s✮
✶✿ ❢♦r ✐ = ✶ t♦ ❧❡♥✭❡①❛♠♣❧❡s✮ ❞♦ ✷✿
♣r❡❞✐❝t✐♦♥ ← ♣r❡❞✐❝t✭❡①❛♠♣❧❡s❬✐❪✱ ❡①❛♠♣❧❡s❬✐❪✳❧❛❜❡❧✮
✸✿
✐❢ ♦✉t♣✉t✳❣♦♦❞ t❤❡♥
✹✿
♦✉t♣✉t ✓ ✬ ✬ ✓ ♣r❡❞✐❝t✐♦♥
✺✿
❡♥❞ ✐❢
✻✿ ❡♥❞ ❢♦r
■♥ ❡ss❡♥❝❡✱ ✇r✐t❡ t❤❡ ❞❡❝♦❞❡r✱ ♣r♦✈✐❞✐♥❣ ❛ ❧✐tt❧❡ ❜✐t ♦❢ s✐❞❡ ✐♥❢♦r♠❛t✐♦♥ ❢♦r tr❛✐♥✐♥❣✳
❙❡q❴❉❡t❡❝t✐♦♥✭❡①❛♠♣❧❡s✱ ❢❛❧s❡❴♥❡❣❛t✐✈❡❴❧♦ss✮
▲❡t ♠❛①❴❧❛❜❡❧ ❂ ✶✱ ♠❛①❴♣r❡❞✐❝t✐♦♥ ❂ ✶ ❢♦r ✐ = ✶ t♦ ❧❡♥✭❡①❛♠♣❧❡s✮ ❞♦ ♠❛①❴❧❛❜❡❧ ← ♠❛①✭♠❛①❴❧❛❜❡❧✱ ❡①❛♠♣❧❡s❬✐❪✳❧❛❜❡❧✮ ❡♥❞ ❢♦r ❢♦r ✐ = ✶ t♦ ❧❡♥✭❡①❛♠♣❧❡s✮ ❞♦ ♠❛①❴♣r❡❞✐❝t✐♦♥ ← ♠❛①✭♠❛①❴♣r❡❞✐❝t✐♦♥✱ ♣r❡❞✐❝t✭❡①❛♠♣❧❡s❬✐❪✱ ❡①❛♠♣❧❡s❬✐❪✳❧❛❜❡❧✮✮ ❡♥❞ ❢♦r ✐❢ ♠❛①❴❧❛❜❡❧ ❃ ♠❛①❴♣r❡❞✐❝t✐♦♥ t❤❡♥ ❧♦ss✭❢❛❧s❡❴♥❡❣❛t✐✈❡❴❧♦ss✮ ❡❧s❡ ✐❢ ♠❛①❴❧❛❜❡❧ ❁ ♠❛①❴♣r❡❞✐❝t✐♦♥ t❤❡♥ ❧♦ss✭✶✮ ❡❧s❡ ❧♦ss✭✵✮ ❡♥❞ ✐❢ ❡♥❞ ✐❢
❙❡q❴❉❡t❡❝t✐♦♥✭❡①❛♠♣❧❡s✱ ❢❛❧s❡❴♥❡❣❛t✐✈❡❴❧♦ss✮
▲❡t ♠❛①❴❧❛❜❡❧ ❂ ✶✱ ♠❛①❴♣r❡❞✐❝t✐♦♥ ❂ ✶ ❢♦r ✐ = ✶ t♦ ❧❡♥✭❡①❛♠♣❧❡s✮ ❞♦ ♠❛①❴❧❛❜❡❧ ← ♠❛①✭♠❛①❴❧❛❜❡❧✱ ❡①❛♠♣❧❡s❬✐❪✳❧❛❜❡❧✮ ❡♥❞ ❢♦r ❢♦r ✐ = ✶ t♦ ❧❡♥✭❡①❛♠♣❧❡s✮ ❞♦ s♥❛♣s❤♦t✭✐✱ ✫✐✮ s♥❛♣s❤♦t✭✐✱ ✫♠❛①❴♣r❡❞✐❝t✐♦♥✮ ♠❛①❴♣r❡❞✐❝t✐♦♥ ← ♠❛①✭♠❛①❴♣r❡❞✐❝t✐♦♥✱ ♣r❡❞✐❝t✭❡①❛♠♣❧❡s❬✐❪✱ ❡①❛♠♣❧❡s❬✐❪✳❧❛❜❡❧✮✮ ❡♥❞ ❢♦r ✐❢ ♠❛①❴❧❛❜❡❧ ❃ ♠❛①❴♣r❡❞✐❝t✐♦♥ t❤❡♥ ❧♦ss✭❢❛❧s❡❴♥❡❣❛t✐✈❡❴❧♦ss✮ ❡❧s❡ ✐❢ ♠❛①❴❧❛❜❡❧ ❁ ♠❛①❴♣r❡❞✐❝t✐♦♥ t❤❡♥ ❧♦ss✭✶✮ ❡❧s❡ ❧♦ss✭✵✮ ❡♥❞ ✐❢ ❡♥❞ ✐❢
❍♦✇ ❞♦❡s ✐t ✇♦r❦❄
❆♥ ❆♣♣❧✐❝❛t✐♦♥ ♦❢ ✏▲❡❛r♥✐♥❣ t♦ ❙❡❛r❝❤✑ ❛❧❣♦r✐t❤♠s ✭❡✳❣✳ ❙❡❛r♥✴❉❆❣❣❡r✮✳ ❉❡❝♦❞❡r r✉♥ ♠❛♥② t✐♠❡s ❛t tr❛✐♥ t✐♠❡ t♦ ♦♣t✐♠✐③❡ ♣r❡❞✐❝t✭✳✳✳✮ ❢♦r ❧♦ss✭✳✳✳✮ ✉s✐♥❣ ♦♣t✐♦♥❛❧ s♥❛♣s❤♦t✭✳✳✳✮ ♦♣t✐♠✐③❛t✐♦♥✳
❆ ❙❡❛r❝❤ ❙♣❛❝❡
Start State
❆ ❙❡❛r❝❤ ❙♣❛❝❡
Start State
❆ ❙❡❛r❝❤ ❙♣❛❝❡
Start State
❆ ❙❡❛r❝❤ ❙♣❛❝❡
Start State
❆ ❙❡❛r❝❤ ❙♣❛❝❡
Start State
❆ ❙❡❛r❝❤ ❙♣❛❝❡
End States Start State
❆ ❙❡❛r❝❤ ❙♣❛❝❡
End States Start State Predict Loss
▲❡❛r♥✐♥❣ t♦ ❙❡❛r❝❤
End States Start State Rollout
▲❡❛r♥✐♥❣ t♦ ❙❡❛r❝❤
End States Start State Rollout Collapse
▲❡❛r♥✐♥❣ t♦ ❙❡❛r❝❤
End States Start State Loss Rollout Collapse
▲❡❛r♥✐♥❣ t♦ ❙❡❛r❝❤ ❉❡t❛✐❧s
❘♦❧❧♦✉ts r❡q✉✐r❡ ❡①✐st✐♥❣ ♣♦❧✐❝②✳ ❚❤❡ ♣♦❧✐❝② ❢♦r♠✿ π(①) = αtπ✵ + (✶ − αt)πt ✇❤❡r❡ αt = (✶ − ǫ)t ✐s ❛ st♦❝❤❛st✐❝ ✐♥t❡r♣♦❧❛t✐♦♥ ❛♥❞ πt ✐s t❤❡ ♣♦❧✐❝② tr❛✐♥❡❞ ♦♥ t − ✶ s❛♠♣❧❡s
- ✐✈❡♥ ❢❡❛t✉r❡s ①s ❛♥❞ ❛ ❧♦ss ❢♦r ❡❛❝❤ ❛❝t✐♦♥ ❧❛ ❢♦r♠
❝♦st✲s❡♥s✐t✐✈❡ tr❛✐♥✐♥❣ ❡①❛♠♣❧❡ (①s, ❧✶, ❧✷, ...❧❦) ✉s❡❞ t♦ ✉♣❞❛t❡ ♣♦❧✐❝②✳
❚❉❖▲❘ ♣r♦❣r❛♠ ❡q✉✐✈❛❧❡♥❝❡
❚❤❡♦r❡♠✿ ❊✈❡r② ❛❧❣♦r✐t❤♠ ✇❤✐❝❤✿ ✶✳ ❆❧✇❛②s t❡r♠✐♥❛t❡s✳ ✷✳ ❚❛❦❡s ❛s ✐♥♣✉t r❡❧❡✈❛♥t ❢❡❛t✉r❡ ✐♥❢♦r♠❛t✐♦♥ ❳✳ ✸✳ ▼❛❦❡ ✵+ ❝❛❧❧s t♦ ♣r❡❞✐❝t✳ ✹✳ ❘❡♣♦rts ❧♦ss ♦♥ t❡r♠✐♥❛t✐♦♥✳ ❞❡✜♥❡s ❛ s❡❛r❝❤ s♣❛❝❡✱ ❛♥❞ s✉❝❤ ❛♥ ❛❧❣♦r✐t❤♠ ❡①✐sts ❢♦r ❡✈❡r② s❡❛r❝❤ s♣❛❝❡✳
❖♣t✐♠✐③❛t✐♦♥
◆❛✐✈❡ tr❛✐♥✐♥❣ t✐♠❡✿ ❖(❑❚ ✷❉) ✇❤❡r❡✿ ❑ ❂ ♥✉♠❜❡r ♦❢ ❛❝t✐♦♥s ❚ ❂ ♥✉♠❜❡r ♦❢ ❞❡❝✐s✐♦♥s ❉ ❂ ❖r❛❝❧❡ ❝♦♠♣❧❡①✐t② ❚②♣✐❝❛❧ ❈❘❋ r✉♥t✐♠❡ ✐s ❖(❑ ✷❚) ❲✐t❤ ♦♣t✐♠✐③❛t✐♦♥s✱ t②♣✐❝❛❧❧② ❖ ❑❚ ✷ ❑❚❉ ✳
❖♣t✐♠✐③❛t✐♦♥
◆❛✐✈❡ tr❛✐♥✐♥❣ t✐♠❡✿ ❖(❑❚ ✷❉) ✇❤❡r❡✿ ❑ ❂ ♥✉♠❜❡r ♦❢ ❛❝t✐♦♥s ❚ ❂ ♥✉♠❜❡r ♦❢ ❞❡❝✐s✐♦♥s ❉ ❂ ❖r❛❝❧❡ ❝♦♠♣❧❡①✐t② ❚②♣✐❝❛❧ ❈❘❋ r✉♥t✐♠❡ ✐s ❖(❑ ✷❚) ❲✐t❤ ♦♣t✐♠✐③❛t✐♦♥s✱ t②♣✐❝❛❧❧② ❖(❑❚ ✷ + ❑❚❉)✳
❖♣t✐♠✐③❛t✐♦♥s
✶✳ ▼❡♠♦✐③❛t✐♦♥✳ ◆❡✈❡r ❝❛❧❧ t❤❡ ♦r❛❝❧❡ t✇✐❝❡ ✇✐t❤ t❤❡ s❛♠❡ ❢❡❛t✉r❡s ✇✐t❤✐♥ ❛♥ ❡①❛♠♣❧❡✳ ✷✳ Ps❡✉❞♦r❛♥❞♦♠ ✈❛r✐❛♥❝❡ r❡❞✉❝t✐♦♥✳ ❙t♦❝❤❛st✐❝ ✐♥t❡r♣♦❧❛t✐♦♥ ✉s❡s t❤❡ s❛♠❡ r❛♥❞♦♠ s❡❡❞ ♦♥ ❡❛❝❤ r♦❧❧♦✉t✳ ✸✳ ❙♥❛♣s❤♦t ❏✉♠♣ t♦ r♦❧❧♦✉t ♣♦✐♥t✳ ✹✳ ❘♦❧❧♦✉t ❈♦❧❧❛♣s❡✳ ◆♦ ♥❡❡❞ t♦ r♦❧❧♦✉t ❛❢t❡r t❤❡ st❛t❡ ❝♦❧❧❛♣s❡s✳
❖♣t✐♠✐③❛t✐♦♥s
✶✳ ▼❡♠♦✐③❛t✐♦♥✳ ◆❡✈❡r ❝❛❧❧ t❤❡ ♦r❛❝❧❡ t✇✐❝❡ ✇✐t❤ t❤❡ s❛♠❡ ❢❡❛t✉r❡s ✇✐t❤✐♥ ❛♥ ❡①❛♠♣❧❡✳ ✷✳ Ps❡✉❞♦r❛♥❞♦♠ ✈❛r✐❛♥❝❡ r❡❞✉❝t✐♦♥✳ ❙t♦❝❤❛st✐❝ ✐♥t❡r♣♦❧❛t✐♦♥ ✉s❡s t❤❡ s❛♠❡ r❛♥❞♦♠ s❡❡❞ ♦♥ ❡❛❝❤ r♦❧❧♦✉t✳ ✸✳ ❙♥❛♣s❤♦t ❏✉♠♣ t♦ r♦❧❧♦✉t ♣♦✐♥t✳ ✹✳ ❘♦❧❧♦✉t ❈♦❧❧❛♣s❡✳ ◆♦ ♥❡❡❞ t♦ r♦❧❧♦✉t ❛❢t❡r t❤❡ st❛t❡ ❝♦❧❧❛♣s❡s✳
❖♣t✐♠✐③❛t✐♦♥s
✶✳ ▼❡♠♦✐③❛t✐♦♥✳ ◆❡✈❡r ❝❛❧❧ t❤❡ ♦r❛❝❧❡ t✇✐❝❡ ✇✐t❤ t❤❡ s❛♠❡ ❢❡❛t✉r❡s ✇✐t❤✐♥ ❛♥ ❡①❛♠♣❧❡✳ ✷✳ Ps❡✉❞♦r❛♥❞♦♠ ✈❛r✐❛♥❝❡ r❡❞✉❝t✐♦♥✳ ❙t♦❝❤❛st✐❝ ✐♥t❡r♣♦❧❛t✐♦♥ ✉s❡s t❤❡ s❛♠❡ r❛♥❞♦♠ s❡❡❞ ♦♥ ❡❛❝❤ r♦❧❧♦✉t✳ ✸✳ ❙♥❛♣s❤♦t ❏✉♠♣ t♦ r♦❧❧♦✉t ♣♦✐♥t✳ ✹✳ ❘♦❧❧♦✉t ❈♦❧❧❛♣s❡✳ ◆♦ ♥❡❡❞ t♦ r♦❧❧♦✉t ❛❢t❡r t❤❡ st❛t❡ ❝♦❧❧❛♣s❡s✳
❖♣t✐♠✐③❛t✐♦♥s
✶✳ ▼❡♠♦✐③❛t✐♦♥✳ ◆❡✈❡r ❝❛❧❧ t❤❡ ♦r❛❝❧❡ t✇✐❝❡ ✇✐t❤ t❤❡ s❛♠❡ ❢❡❛t✉r❡s ✇✐t❤✐♥ ❛♥ ❡①❛♠♣❧❡✳ ✷✳ Ps❡✉❞♦r❛♥❞♦♠ ✈❛r✐❛♥❝❡ r❡❞✉❝t✐♦♥✳ ❙t♦❝❤❛st✐❝ ✐♥t❡r♣♦❧❛t✐♦♥ ✉s❡s t❤❡ s❛♠❡ r❛♥❞♦♠ s❡❡❞ ♦♥ ❡❛❝❤ r♦❧❧♦✉t✳ ✸✳ ❙♥❛♣s❤♦t ❏✉♠♣ t♦ r♦❧❧♦✉t ♣♦✐♥t✳ ✹✳ ❘♦❧❧♦✉t ❈♦❧❧❛♣s❡✳ ◆♦ ♥❡❡❞ t♦ r♦❧❧♦✉t ❛❢t❡r t❤❡ st❛t❡ ❝♦❧❧❛♣s❡s✳
❆♥ ♦✉t❧✐♥❡
✶✳ ❍♦✇❄
✶✳✶ Pr♦❣r❛♠♠✐♥❣ ✶✳✷ ▲❡❛r♥✐♥❣ t♦ ❙❡❛r❝❤ ✶✳✸ ❊q✉✐✈❛❧❡♥❝❡ ✶✳✹ ❖♣t✐♠✐③❛t✐♦♥s
✷✳ ❖t❤❡r ❘❡s✉❧ts
◆❛♠❡❞ ❊♥t✐t② ❘❡❝♦❣♥t✐♦♥
■s t❤✐s ✇♦r❞ ♣❛rt ♦❢ ❛♥ ♦r❣❛♥✐③❛t✐♦♥✱ ♣❡rs♦♥✱ ♦r ♥♦t❄
10-1 100 101 Training Time (minutes) 0.45 0.50 0.55 0.60 0.65 0.70 0.75 0.80 F-score (per entity)
80.0 79.2 73.3 76.5 75.9 76.5 74.6 78.3 10s 1m 10m
Named entity recognition (tuned hps)