A Fast Finitestate Relaxation Method for Enforcing Global Constraints on Sequence Decoding - PowerPoint PPT Presentation

1 / 31
About This Presentation
Title:

A Fast Finitestate Relaxation Method for Enforcing Global Constraints on Sequence Decoding

Description:

Finite-state Relaxation Method. for Enforcing Global Constraints. on Sequence Decoding ... Finite-state constraint relaxation is faster than the ILP solver ... – PowerPoint PPT presentation

Number of Views:43
Avg rating:3.0/5.0
Slides: 32
Provided by: non8119
Category:

less

Transcript and Presenter's Notes

Title: A Fast Finitestate Relaxation Method for Enforcing Global Constraints on Sequence Decoding


1
A FastFinite-state Relaxation Methodfor
Enforcing Global Constraintson Sequence Decoding
  • Roy Tromble
  • Jason Eisner
  • Johns Hopkins University

2
We know what the labels should look like!
  • Agreement
  • Named Entity Recognition (Finkel et al., ACL
    2005)
  • Seminar announcements (Finkel et al., ACL 2005)
  • Label structure
  • Bibliography parsing (Peng McCallum, HLT-NAACL
    2004)
  • Semantic Role Labeling (Roth Yih, ICML 2005)

Seminar Friday, April 1 Speaker Monty
Hall Location Auditorium 1 Lets Make a
Dilemma Monty Hall will host a discussion of his
famous paradox.
3
Finite-state constraint relaxation
Local models
Sequence modeling quality
Decoding runtime
Global constraints
Exploit the quality of the local models!
4
Semantic Role Labeling
  • Label each argument to a verb
  • Six core argument types (A0-A5)
  • CoNLL-2004 shared task
  • Penn Treebank section 20
  • 4305 propositions
  • Follow Roth Yih (ICML 2005)

A1 A1 A1 O O A4 O A3 O
5
Encoding constraints as finite-state automata
6
Roth Yihs constraints as FSAs
A0A0A0
A1A1A1
NO DUPLICATE ARGUMENTS
Each argument type (A0, A1, ...) can label at
most one sub-sequence of the input.
7
Roth Yihs constraints as FSAs
Regular expressions on any sequences grep for
sequence models
OO?
AT LEAST ONE ARGUMENT
The label sequence must contain at least one
instance that is not O.
8
Roth Yihs constraints as FSAs
DISALLOW ARGUMENTS
Only allow argument types that are compatible
with the propositions verb.
9
Roth Yihs constraints as FSAs
KNOWN VERB POSITION
The propositions verb must be labeled O.
10
Roth Yihs constraints as FSAs
Any constraints on bounded-length sequences
ARGUMENT CANDIDATES
Certain sub-sequences must receive a single label.
11
Roth Yihs local model as a lattice
Soft constraints or features
12
A brute-force FSA decoder
13
NO DUPLICATE A0
14
NO DUPLICATE A0, A1
15
(No Transcript)
16
Satisfying global constraints is NP-hard.
Any approach would blow up in worst case!
NO DUPLICATE ARGUMENTS
17
Handling an NP-hard problem
  • Roth Yih (ICML 2005)
  • Express path decoding and global constraints as
    an integer linear program (ILP).
  • Apply ILP solver
  • Relax ILP to (real-valued) LP.
  • Apply polynomial-time LP solver.
  • Branch and bound to find optimal integer
    solution.

18
The ILP solver doesnt know its labeling
sequences
Path constraints State 0 outflow 1 State 3
inflow 1 States 1 2 outflow inflow At
least one argument Arcs labeled O flow 1
19
Maybe we can fixthe brute-force decoder?
20
Local model usually violated no constraints
21
Most constraints were rarely violated
22
Finite-state constraint relaxation
  • Local models already capture much structure.
  • Relax the constraints instead!
  • Find best path using linear decoding algorithm.
  • Apply only those global constraints that path
    violates.

23
Brute-force algorithm
24
Constraint relaxation algorithm
no
yes
25
Finite-state constraint relaxation is faster than
the ILP solver
  • State-of-the-art implementations
  • Xpress-MP for ILP,
  • FSA (Kanthak Ney, ACL 2004) for constraint
    relaxation.

26
No sentences required more than a few iterations
27
Buy one, get one free
A1
A4
A3
A1
Sales for the quarter rose to 1.63 billion from
1.47 billion .
28
Lattices remained small
29
Take-home message
  • Global constraints arent usually doing that much
    work for you
  • Typical examples violate only a small number
    using local models.
  • They shouldnt have to slow you down so much,
    even though theyre NP-hard in the worst case
  • Figure out dynamically which ones need to be
    applied.

30
Future work
  • General soft constraints
  • (We discuss binary soft constraints in the
    paper.)
  • Choose order to test and apply constraints, e.g.
    by reinforcement learning.
  • k-best decoding

31
Thanks
  • to Scott Yih for providing both data and runtime,
    and
  • to Stephan Kanthak for FSA.
Write a Comment
User Comments (0)
About PowerShow.com