Title: A Fast Finitestate Relaxation Method for Enforcing Global Constraints on Sequence Decoding
1A FastFinite-state Relaxation Methodfor
Enforcing Global Constraintson Sequence Decoding
- Roy Tromble
- Jason Eisner
- Johns Hopkins University
2We know what the labels should look like!
- Agreement
- Named Entity Recognition (Finkel et al., ACL
2005) - Seminar announcements (Finkel et al., ACL 2005)
- Label structure
- Bibliography parsing (Peng McCallum, HLT-NAACL
2004) - Semantic Role Labeling (Roth Yih, ICML 2005)
Seminar Friday, April 1 Speaker Monty
Hall Location Auditorium 1 Lets Make a
Dilemma Monty Hall will host a discussion of his
famous paradox.
3Finite-state constraint relaxation
Local models
Sequence modeling quality
Decoding runtime
Global constraints
Exploit the quality of the local models!
4Semantic Role Labeling
- Label each argument to a verb
- Six core argument types (A0-A5)
- CoNLL-2004 shared task
- Penn Treebank section 20
- 4305 propositions
- Follow Roth Yih (ICML 2005)
A1 A1 A1 O O A4 O A3 O
5Encoding constraints as finite-state automata
6Roth Yihs constraints as FSAs
A0A0A0
A1A1A1
NO DUPLICATE ARGUMENTS
Each argument type (A0, A1, ...) can label at
most one sub-sequence of the input.
7Roth Yihs constraints as FSAs
Regular expressions on any sequences grep for
sequence models
OO?
AT LEAST ONE ARGUMENT
The label sequence must contain at least one
instance that is not O.
8Roth Yihs constraints as FSAs
DISALLOW ARGUMENTS
Only allow argument types that are compatible
with the propositions verb.
9Roth Yihs constraints as FSAs
KNOWN VERB POSITION
The propositions verb must be labeled O.
10Roth Yihs constraints as FSAs
Any constraints on bounded-length sequences
ARGUMENT CANDIDATES
Certain sub-sequences must receive a single label.
11Roth Yihs local model as a lattice
Soft constraints or features
12A brute-force FSA decoder
13NO DUPLICATE A0
14NO DUPLICATE A0, A1
15(No Transcript)
16Satisfying global constraints is NP-hard.
Any approach would blow up in worst case!
NO DUPLICATE ARGUMENTS
17Handling an NP-hard problem
- Roth Yih (ICML 2005)
- Express path decoding and global constraints as
an integer linear program (ILP). - Apply ILP solver
- Relax ILP to (real-valued) LP.
- Apply polynomial-time LP solver.
- Branch and bound to find optimal integer
solution.
18The ILP solver doesnt know its labeling
sequences
Path constraints State 0 outflow 1 State 3
inflow 1 States 1 2 outflow inflow At
least one argument Arcs labeled O flow 1
19Maybe we can fixthe brute-force decoder?
20Local model usually violated no constraints
21Most constraints were rarely violated
22Finite-state constraint relaxation
- Local models already capture much structure.
- Relax the constraints instead!
- Find best path using linear decoding algorithm.
- Apply only those global constraints that path
violates.
23Brute-force algorithm
24Constraint relaxation algorithm
no
yes
25Finite-state constraint relaxation is faster than
the ILP solver
- State-of-the-art implementations
- Xpress-MP for ILP,
- FSA (Kanthak Ney, ACL 2004) for constraint
relaxation.
26No sentences required more than a few iterations
27Buy one, get one free
A1
A4
A3
A1
Sales for the quarter rose to 1.63 billion from
1.47 billion .
28Lattices remained small
29Take-home message
- Global constraints arent usually doing that much
work for you - Typical examples violate only a small number
using local models. - They shouldnt have to slow you down so much,
even though theyre NP-hard in the worst case - Figure out dynamically which ones need to be
applied.
30Future work
- General soft constraints
- (We discuss binary soft constraints in the
paper.) - Choose order to test and apply constraints, e.g.
by reinforcement learning. - k-best decoding
31Thanks
- to Scott Yih for providing both data and runtime,
and - to Stephan Kanthak for FSA.