GATE Evaluation Tools presentation

About This Presentation

Transcript and Presenter's Notes

Title: GATE Evaluation Tools

1
GATE Evaluation Tools

2
System development cycle

3
Corpora and System Development

Gold standard data created by manual annotation
Corpora are divided typically into a training and
testing portion
Rules and/or learning algorithms are developed or
trained on the training part
Tuned on the testing portion in order to optimise
Rule priorities, rules effectiveness, etc.
Parameters of the learning algorithm and the
features used (typical routine 10-fold cross
validation)
Evaluation set the best system configuration is
run on this data and the system performance is
obtained
No further tuning once evaluation set is used!

4
Some NE Annotated Corpora

MUC-6 and MUC-7 corpora - English
CONLL shared task corpora http//cnts.uia.ac.be/co
nll2003/ner/ - NEs in English and
Germanhttp//cnts.uia.ac.be/conll2002/ner/ -
NEs in Spanish and Dutch
TIDES surprise language exercise (NEs in Cebuano
and Hindi)
ACE English - http//www.ldc.upenn.edu/Projects/
ACE/

5
Some NE Annotated Corpora

MUC-6 and MUC-7 corpora - English
CONLL shared task corpora http//cnts.uia.ac.be/co
nll2003/ner/ - NEs in English and
Germanhttp//cnts.uia.ac.be/conll2002/ner/ -
NEs in Spanish and Dutch
TIDES surprise language exercise (NEs in Cebuano
and Hindi)
ACE English - http//www.ldc.upenn.edu/Projects/
ACE/

6
The MUC-7 corpus

100 documents in SGML
News domain
Named Entities
1880 Organizations (46)
1324 Locations (32)
887 Persons (22)
Inter-annotator agreement very high (97)
http//www.itl.nist.gov/iaui/894.02/related_projec
ts/muc/proceedings/muc_7_proceedings/marsh_slides.
pdf

7
The MUC-7 Corpus (2)

8
ACE Towards Semantic Tagging of Entities

MUC NE tags segments of text whenever that text
represents the name of an entity
In ACE (Automated Content Extraction), these
names are viewed as mentions of the underlying
entities. The main task is to detect (or infer)
the mentions in the text of the entities
themselves
Rolls together the NE and CO tasks
Domain- and genre-independent approaches
ACE corpus contains newswire, broadcast news (ASR
output and cleaned), and newspaper reports (OCR
output and cleaned)

9
ACE Entities

10
ACE Example

11
Annotate Gold Standard Manual Annotation in
GATE GUI
12
Ontology-Based Annotation (coming in GATE 4.0)
13
Two GATE evaluation tools

14
AnnotationDiff

15
Annotation Diff
16
Corpus Benchmark Tool

Compares annotations at the corpus level
Compares all annotation types at the same time,
i.e. gives an overall score, as well as a score
for each annotation type
Enables regression testing, i.e. comparison of 2
different versions against gold standard
Visual display, can be exported to HTML
Granularity of results user can decide how much
information to display
Results in terms of Precision, Recall, F-measure

17
Corpus structure

Corpus benchmark tool requires particular
directory structure
Each corpus must have a clean and marked
directory
Clean holds the unannotated version, while marked
holds the marked (gold standard) ones
There may also be a processed subdirectory this
is a datastore (unlike the other two)
Corresponding files in each subdirectory must
have the same name

18
How it works

19
Corpus Benchmark Tool

Write a Comment

User Comments (0)

About PowerShow.com

GATE Evaluation Tools PowerPoint PPT Presentation