Presentation Outline - PowerPoint PPT Presentation

About This Presentation
Title:

Presentation Outline

Description:

Example-Based Machine Translation Based on the Synchronous SSTC Annotation Schema ... Example-Based Machine Translation (EBMT) EBMT is the case-based reasoning ... – PowerPoint PPT presentation

Number of Views:95
Avg rating:3.0/5.0
Slides: 45
Provided by: pusa77
Category:

less

Transcript and Presenter's Notes

Title: Presentation Outline


1
(No Transcript)
2
Presentation Outline
  • Introduction
  • Structured String-Tree Correspondence (SSTC)
  • Synchronous Structured String-Tree Correspondence
    (SSTC)
  • EBMT based on synchronous SSTC
  • The Construction of a BKB Based on the
    Synchronous SSTC
  • Bitext World-level Mapping (Word Alignment)
  • Bitext Synchronous Parsing Technique

3
The Structured String-Tree Correspondence (SSTC)
SSTC string arbitrary tree structure
correspondence
Correspondence node(X/Y)
4
(No Transcript)
5
(No Transcript)
6
(No Transcript)
7
(No Transcript)
8
Example-Based Machine Translation (EBMT)
EBMT is the case-based reasoning approach to MT
EBMT uses translated examples of similar
sentences to translate a given Source sentence
into the target sentence.
9
The general Architecture for EBMT
10
EBMT based on synchronous SSTC.
Different senses for the word bank bank 1 a
land beside the river. bank 2 a place to keep
money. E.g The1 man2 keep1 his1 money1 in1 the1
bank2.
Replacement Combination
11
Source sentence The old man picks the green lamp
up
12
Set of synchronous SSTCs represents Example-base.
English sentence The lamp is off. Malay
translation Lampu itu padam.
13
(No Transcript)
14
Source the old man picks the green lamp up
15
Sub-synchronous SSTCs for the source sentence
16
Selected closed example
Sub-synchronous SSTCs derived from the example
17
(No Transcript)
18
(No Transcript)
19
(No Transcript)
20
lelaki tua itu kutip lampu hijau itu
Generation
The translation for the source sentence is
generated from the synchronous SSTC the Malay
part, which is the String in the SSTC.

21
EBMT General Problems
  • How to utilize more than one example to translate
    one source sentence

The construction of well-formed target language
sentences from extracted fragments of a BKB.
  • lack of flexibility in representing translation
    relations between source and target substrings

The treatment of wild linguistic phenomena, which
are non-standard, e.g. crossed dependencies
22
(No Transcript)
23
(No Transcript)
24
(No Transcript)
25
  • The Construction of a BKB Based on the
    Synchronous SSTC

Based on Bitext Synchronous Parsing Technique
  • BiText Text that is available in two languages.

26
  • Schema

Parsing POS Tagging for the English source text
Build the SSTC for Malay target text based on the
SSTC for the English source text using the word
alignment
Compile the APP output into SSTC for the English
source text
27
(No Transcript)
28
Bitext World-level Mapping (Word Alignment)
Real texts are noisy - Fertility A single word
in the source sentence may correspond to zero,
one, two or more words in the target sentence and
vice versa.
- crossed dependencies (distortion) Where human
translators change and rearrange material so the
target output text will not flow well according
to the order of the source text.
29
(No Transcript)
30
(No Transcript)
31
n Context Window Word Alignment
The correspondence between the source and the
target is denoted by an interval attached to each
subtext according to its offset in the text.
32
n Context Window Word Alignment
Find the TPCs between the source and the target.
?(Bilingual dictionary)
Bilingual dictionary
33
n Context Window Word Alignment
Find out the chains for all possible TPCs for a
source word.
34
n Context Window Word Alignment
35
  • Bitext Synchronous Parsing Technique

36
(No Transcript)
37
Apple Pie Parser (APP)
  • It is a bottom-up probabilistic chart parser to
    find the parse tree for an input text (English).
  • It was developed at New York University.
  • The parser generates a syntactic tree in
    PennTreeBank bracketing.
  • It is Free, and available to download with the
    source code.
  • http//cs.nyu.edu/cs/projects/proteus/sekine

38
Apple Pie Parser (APP)
The basic idea of example-based parsing is very
simple
The representation structure and the POS for the
source English is obtained
39
(No Transcript)
40
Compile the APP output to SSTC structure
(S (NP (NPL The basic idea) (PP of (NPL
example-based parsing))) (VP is (ADJP very
simple)))
41
Lexical Transfer
42
(No Transcript)
43
The synchronous SSTC editor.
44
Discussion
Thank you..
Write a Comment
User Comments (0)
About PowerShow.com