Training Tree Transducers presentation

About This Presentation

Transcript and Presenter's Notes

Title: Training Tree Transducers

1
Training Tree Transducers

2
Outline

3
Finite State Transducers (FSTs)

4
R transducer

An R transducer compactly represent a potentially
infinite set of input/output tree pairs.
While a FST compactly represent such a set of
input/output string pairs.
R is a generalization of FST.

5
Example of R

6
Example for R cont
Rule 1
Rule 2,3,4
English order S(PRO, VP(V, NP))
Arabic order S(V,PRO,NP)
7
Trees

8
Regular Tree Grammars (RTG)

Regular Tree Grammar, a common way of compactly
representing a potentially infinite set of trees.
wRTG is just like WFSA.
wRTG G (?,N,S,P)
? alphabet
N nonterminals
S start nonterminal
Weighted
productions

9
Sample wRTG
10
Extended-LHS Tree Transducer (xR)

Different from R explicitly represent the
lookahead and movement with a more specified LHS
Form of LHS is
The pattern will be used to match an input
subtree.
There is a set of finite tree patterns.

11
Binary Relation
12
Derivation Tree

So many trees now, but this derivation tree is a
representation of the transducer, neither the
input tree nor the output tree.
But derivation tree can deterministically produce
a single weighted output tree.

13
Derivation tree derivation wRTG
X
X
14
Inside-Outside algorithm

Basic idea of inside-outside algorithm
Use current probability of rules to estimate the
expected frequencies of certain types of
derivation steps and compute new probabilities
for those rules.1
Generally
for inside probability is to recalculate p of
A-gta may go through A-gtBC
for outside probability is to recalculate p of
C-gtAB or C-gtBA

15
Inside-Outside for wRTG

16
EM training

EM training to maximized the corpus likelihood,
repeatedly estimating the expectation of decision
and maximizing by assigning counts to parameter
and renormaliztion.
Algorithm 2 implements EM xR training by
repeatedly computing inside-outside weights.

17
From tree to string

Although we can use Extended-LHS Tree Transducer
(xR) to get an output tree from an input tree
(say parse trees), but still, it is a (parse)
tree, not the sentence in another language (for
machine translation).
Now we have xRStree to string transducer.

18
Tree-to-string transducer

19
Example

20
Example
21
Related Work

TSG vs RTG (equivalent)
xR vs weighted synchronous TSG (similar)
EM training vs forward backward algorithm for
finite state (string) transducer and also for HMM

22
Questions

Is there any future work on this tree transducer
especially for Machine Translation?
Precision? Recall?
Also a little bit confused in the descriptions of
those two relationships gtx and gtG
Not very sure about inside-outside algorithm.

Questions?
23

24
Reference

1 Fernando Pereira, Yves Schabes INSIDE-OUTSIDE
REESTIMATION FROM PARTIALLY BRACKETED CORPORA
1992

25
What might be useful

An Overview of Probabilistic Tree Transducers for
Natural Language Processing Kevin Knight and
Jonathan Graehl

R Top-down transducer, introduced before.
F Bottom-up transducer (Frontier-to-root),
with similar rules, but transforming the leaves
of the input tree first, and working its way up.
L Linear transducer, which prohibits copying
subtrees. Rule 4 in Figure 4 is example of a
copying production, so this whole transducer is R
but not RL.
N Non-deleting transducer, which requires that
every left-hand-side variable also appear on the
right-hand side. A deleting R-transducer can
simply delete a subtree (without inspecting it).
The transducer in Figure 4 is the deleting kind,
because of rules 34-39. It would also be deleting
if it included a rule for dropping English
determiners, e.g., q NP(x0, x1) q x1.
D Deterministic transducer, with a maximum of
one production per ltstate, symbolgt pair.
T Total transducer, with a minimum of one
production per ltstate, symbolgt pair.
PDTT Push-down tree transducer, the transducer
analog of CFTG 36.
subscript Regular-lookahead transducer, which
can check to see if an input subtree is
tree-regular, i.e., whether it belongs to a
specified RTL. Productions only fire when their
lookahead conditions are met.

27
(No Transcript)
28
(No Transcript)

Write a Comment

User Comments (0)

About PowerShow.com

Training Tree Transducers PowerPoint PPT Presentation