Learning Translation Templates from Bilingual Translation Examples - PowerPoint PPT Presentation

1 / 22
About This Presentation
Title:

Learning Translation Templates from Bilingual Translation Examples

Description:

... Translation Templates from Bilingual Translation ... Example-based machine translation (EBMT) Main idea ... Difference translation template learning (DTTL) ... – PowerPoint PPT presentation

Number of Views:84
Avg rating:3.0/5.0
Slides: 23
Provided by: scch7
Category:

less

Transcript and Presenter's Notes

Title: Learning Translation Templates from Bilingual Translation Examples


1
Learning Translation Templates from Bilingual
Translation Examples
  • Source Applied Intelligence, 2001
  • Authors Ilyas Cicekli and H. Altay Guvenir
  • Reporter ???
  • Professor ???

2
Outline
  • Introduction
  • Translation Template Learner
  • System Architecture
  • Conclusion

3
Outline
  • Introduction
  • Translation Template Learner
  • System Architecture
  • Conclusion

4
Introduction
  • Example-based machine translation (EBMT)
  • Main idea
  • A given input sentence in the source language is
    compared with the example translations in the
    given bilingual parallel text to find the closest
    matching examples
  • Exemplars
  • The characteristic examples are stored in the
    memory
  • Template
  • An example translation pairs

5
Introduction
  • This paper
  • Use stem and morphemes to describe pairs
  • they are running lt-gt kosuyorlar
  • they are walking lt-gt yuruyorlar
  • they are runPROG lt-gt kosPROG3PL
  • they are walkPROG lt-gt yuruPROG3PL
  • Learn translation templates from translation
    examples and store them as generalized exemplars
  • Translation Template Learner
  • Similarity translation template learning
  • they are X1PROG lt-gt X2PROG3PLif X1 lt-gt X2
  • run lt-gt koswalk lt-gt yuru
  • Difference translation template learning
  • X1 run X2 lt-gt kos X2 X3
  • they lt-gt 3PL
  • PROG lt-gt PROG

6
Outline
  • Introduction
  • Translation Template Learner
  • System Architecture
  • Conclusion

7
Translation Templates
  • A translation template is a generalized
    translation exemplar pair.
  • Replace some components with variables
  • Atomic translation templates do not contain any
    variable

8
Translation Template Learner
  • Similarity translation template learning (STTL)
  • The similar parts in sentence pairs must be
    translations
  • Difference translation template learning (DTTL)
  • The remaining differing constituents in source
    and target sentences are matched if the sentences
    have similarities

9
Translation Template Learner
  • two translation examples (Ea, Eb)
  • a translation example Ea
  • (D1, D2) a difference between two sentences of a
    language
  • (S1, S2) a similarity between two sentences of a
    language
  • Ma,b match sequence
  • a similarity (a sequence of common items)
  • at least one similarity on each side must be
    non-empty
  • Ma,bW DV a new match sequence in Ma,b which all
    differences are replaced by proper variables
  • Ma,bW SV a new match sequence in Ma,b which all
    similarities are replaced by proper variables

10
Similarity Translation Template Learning
11
Difference Translation Template Learning
12
Different Number of Similarities or Differences
in Match Sequences
  • i came lt-gt geldimyou went lt-gt gittin
  • i comePAST lt-gt gelPAST1SGyou goPAST lt-gt
    gitPAST2SG
  • Match Sequence(I come, you go) PAST
    lt-gt (gel,git) PAST (1SG,2SG)
  • try to make the number of differences to be equal
    on both sides of a match sequence by separating
    differences before STTL algorithm

13
Differences Separating
  • Match Sequence(i come, you go) PAST
    lt-gt (gel,git) PAST (1SG,2SG)
  • Divide both constituents of difference into two
    parts from morpheme boundaries (i,you) (come,go)
    PAST lt-gt (gel,git) PAST (1SG,2SG)

14
Differences with Empty Constituents
  • i seePAST the man lt-gt adamACC gorPAST1SGi
    seePAST a man lt-gt bir adam gorPAST1SG
  • Let a difference to have an empty constituenti
    seePAST (thea) manlt-gt (ebir) adam (ACCe)
    gorPAST1SG

15
Examples
  • i comePAST lt-gt gelPAST1SGyou comePAST lt-gt
    gelPAST2SG
  • X1 comePAST lt-gt gelPAST X2 if X1 lt-gt X2i lt-gt
    1SGyou lt-gt 2SGi X1 lt-gt X2 1SG if X1 lt-gt
    X2you X1 lt-gt X2 2SG if X1 lt-gt X2comePAST lt-gt
    gelPAST

16
Performance Results
  • Training set
  • 747 English and Turkish pairs
  • Manually Tagging

17
Outline
  • Introduction
  • Translation Template Learner
  • System Architecture
  • Conclusion

18
System Architecture
19
Evaluation
  • Goal
  • accomplish top results contain correct
    translation
  • Order
  • statistical method
  • specify order according to the source language
  • a higher number of terminals is more specific
    than the other

20
Statistical Method
  • Confidence of templates
  • N1 the number of training pairs where X is a
    substring of Xi and Y is a substring of Yi
  • N2 the number of training pairs where X is a
    substring of Xi and Y is not a substring of Yi
  • Confidence of translations
  • R the set of rule generates the translation

21
Confidence Method
22
Outline
  • Introduction
  • Translation Template Learner
  • System Architecture
  • Conclusion

23
Conclusion
  • The major contribution is that the proposed TTL
    algorithm eliminates the need for manually
    encoding the translation templates.
Write a Comment
User Comments (0)
About PowerShow.com