Rigorous Approximated Determinization
- f Weighted Automata
Rigorous Approximated Determinization of Weighted Automata - - PowerPoint PPT Presentation
Rigorous Approximated Determinization of Weighted Automata Benjamin Aminof (Hebrew University) Orna Kupferman (Hebrew University) Robby Lampert (Weizmann Institute) Israel Outline Weighted automata Determinizability of weighted
Weighted automata Determinizability of weighted automata Mohri’s determinization algorithm Approximated-determinization algorithm Correctness and termination Summary Future work
w=abc w=abbd w=abb
q1/0 q0 q2/0
q3/0
a,1 c,1 b,2 a,1 b,1 d,1
A run of A on a word w=w1…wn
A run r is accepting $ rn is accepting.
L(A)={w: A has an accepting run on w}
wi
A cost of a run r=r0 r1 r2 … rn is
defined only for accepting runs
A cost of a word w=w1…wn is
If w62L(A) then cost(w)=1.
n
wi
A WFA A is trim if each of its states is
A WFA A is unambiguous (single-run) if
formal verification of quantitative
automatic speech recognition image compression pattern matching (widely used in
…
cost(abkc)=2k+2, cost(abkd)=k+2 After reading the word abk, the difference
For i≠j, a deterministic WFA must be in
A deterministic WFA must have 1 states. q1 q0 q2
q3/0
a,1 c,1 b,2 a,1 b,1 d,1
Weighted automata are not
To decide whether a given
A sufficient condition for
The twins property: In case the automaton is trim (no empty
q q0
u u v v
q’
{(q1,0), (q2,1)}
{(q0,0)} {(q1,0), (q2,1)}
a,? c,? {(q3,0)} /0 b,? d,?
q1 q0 q2
q3/0
a,3 c,5 a,4 d,4 b,2 b,3
{(q3,0)} /0 3 min {3,4} min {2,3} 2 3-2 5 0+5 5 0+0 4-3 1+4
{(q3,0)} /0
c,2 {(q1,0), (q2,2)} /3
q1 q0 q2/1
q3/0
a,1 d,2 a,3 d,1 b,4 b,1 c,2 c,2
{(q0,0)}
a,1 c,2 b,1 d,1 {(q1,0), (q2,2)} /3 {(q1,3), (q2,0)} /1
{(q3,0)} /0
c,2 {(q1,3), (q2,0)} /1 c,2 d,2
{(q3,0)} /0
b,2
q1 q0 q2
q3/0
a,1 c,1 a,1 b,1 d,1
{(q1,1), (q2,0)} {(q0,0)} {(q1,0), (q2,0)}
a,1 b,1
{(q1,2), (q2,0)}
b,1
{(q1,3), (q2,0)}
b,1
b,2
q1 q0 q2
q3/0
a,1 c,1 a,1 b,1 d,1
{(q1,1), (q2,0)} {(q0,0)} {(q1,0), (q2,0)}
a,1 b,1
{(q1,2), (q2,0)}
b,1
{(q1,3), (q2,0)}
b,1
d
Mohri’s algorithm terminates iff
For trim and unambiguous WFAs,
There are determinizable WFAs that do
When exact determinization is impossible. When the result of exact determinization is
Σ,t Σ,0 a,1 Σ,0 Σ,0
Σ,0 n-1
+\Ln
+
Based on Mohri’s algorithm. Relaxes the condition for unification of
No guarantees about the new costs. No sufficient condition for termination.
Determinization up to a factor t
The new cost of any accepted word w is
differs from Mohri’s algorithm
Weights are multiplied by t. For each state in a subset we maintain
The criterion for unification of states is
{(q1,-1,0), (q2,-1,0)}
{(q0,0,0)}
a,? b,2 b,? b,2
q1 q0 q2
q3/0
a,1 c,1 a,1 b,1 d,1
a,2 {(q1,-1,1), (q2,-2,0)} b,2 lower bound cost(w) upper bound t ¢cost(w) d,2 {(q3,-2,0)} /0 {(q3,-2,0)} /0 c,2
residual ranges contain those of
a,2
a,2 b,2 b,2
q0/0
a,2 c,2
q2/0
a,1 b,1
b,2
{(q0,-1,0)} /0
q1/0
{(q0,0,0), (q1,0,0)} /0 {(q0,-1,0), (q2,0,2)} /0 {(q1,-2,0)} /0
c,4
{(q0,-2,0)} /0
a,2
{(q0,-2,0), (q1,0,4)} /0
b,2
{(q0,-2,0)} /0
a,2 b,2
{(q2,-4,0)} /0
a,4
{(q1,-4,0)} /0
c,4
{(q1,-6,0)} /0
b,4 b,4
Thm: If the algorithm terminates on a
Thm: If a WFA has the t-twins property,
The weights and the factor t are rational.
Thm: For trim unambiguous WFAs,
Thm: Deciding the t-twins property for
Why approximate determinization?
Non-determinizable WFA Equivalent deterministic is large
t-determinization algorithm
Weights multiplied by t Use ranges rather than single residues Collapse to a state whose ranges are contained in mine
A sufficient condition
The t-twins property For unambiguous WFAs – characterizes determinizability Decidable in polynomial time
Generalize the termination proof to the
An algorithm to decide whether a WFA