Pranav Anand, Caroline Andrews, Matthew Wagers - PowerPoint PPT Presentation

About This Presentation
Title:

Pranav Anand, Caroline Andrews, Matthew Wagers

Description:

... Mechanical Turk Workers Systematic Debriefing Survey Experiment 2 Prompt Type Effect for Prompt Experiment 2 Prompt Type Effect for Prompt (p – PowerPoint PPT presentation

Number of Views:120
Avg rating:3.0/5.0
Slides: 19
Provided by: Semant5
Category:

less

Transcript and Presenter's Notes

Title: Pranav Anand, Caroline Andrews, Matthew Wagers


1
Assessing the pragmatics of experiments with
crowdsourcing The case of scalar implicature
  • Pranav Anand, Caroline Andrews, Matthew Wagers

University of California, Santa Cruz
2
Experiments Pragmatic Processing
  • Each of the critics reviewed some of the
    movies.

Case Study (Embedded) Implicatures
but not all ?
Depending on the study
evidence for EIs, with different response
choices
- no evidence of EIs
Worry Are we adequately testing the influence
of methodologies on our data?
Worry How much do methodologies themselves
influence judgements?
Previous Limitation Lack of Subjects and Money
Crowd-sourcing addresses both problems
3
Pragmatics of Experimental Situations
Evaluation Apprehension subjects know they are
being judged

Teleological Curiosity - Subjects hypothesizing
expected behavior, matching an ideal
Worry How much do methodologies themselves
influence judgements?
The experiment itself is part of the pragmatic
context
See Rosenthal Rosnow. (1975) The Volunteer
Subject.
4
Elements of Experimental Context

Protocol Social Context / Task Specification
Response Structure Response choices available
to the subject
e.g. True / False, Yes / No, 1-7 scale
Prompt the Question
directions for the Response Structure
Immediate Linguistic/Visual Context
Our Goal Explore variations of these elements
in a systematic way
Worry How much do methodologies themselves
influence judgements?
5
Experimental Design

Is this an accurate description? Some of the
spices have red lids.
Linguistic Contexts All Relevant, All
Irrelevant, No Context
Protocol
Experimental normal experiment instructions
Annotation checking the work of unaffiliated
annotators
4 Implicature Targets, 6 Some/All Controls, 20
Fillers
6
Experiment 1Social Context
Focus on Protocol Annotation vs Experiment

All Irrelevant No Story All-Relevant
Experiment
Annotation
Accuracy Prompt - Is this an accurate
description?
Response Categories - Yes, No, Dont Know
Population Undergraduates
7
Experiment 1Social Context
Finding Social context even when linguistic
context does not.

Linguistic Context No Effect
8
Experiment 1Social Context
Finding Social context even when linguistic
context does not.

Lower SI rate for Annotation (plt0.05)
9
Experiment 2Prompt Type
Accuracy Prompt - Is this an accurate
description?

Response Categories - Yes, No, Dont Know
Informativity Prompt - How Informative is
this sentence?
Response Categories - Not Informative
Enough Informative Enough Too
Much Information False
Population Mechanical Turk Workers
Systematic Debriefing Survey
10
Experiment 2Prompt Type
Effect for Prompt
11
Experiment 2Prompt Type
Effect for Prompt (plt0.001)
Effect for Context (plt0.001)
12
Experiment 2Prompt Type
Effect for Prompt (plt0.001)
Effect for Context (plt0.001)
Weak Interaction Prompt x Context (plt0.06)
13
Experiment 2Prompt Type
No Effect for Protocol
14
Experiment 2Prompt Type
Low SI rates overall
But the debriefing survey indicates that
(roughly) 70 of participants were aware of
some/all contrast
15
Populations
Turkers More sensitive to Linguistic
Context Less sensitive to changes in changes
in social context/ evaluation apprehension

Undergraduates More sensitive to Protocol
16
Take Home Points
  • Methodological variables should be explored
    alongside conventional linguistic variables
  • Ideal models of these processes (cf. Schutze
    1996)
  • Crowdsourcing allows for cheap/fast exploration
    of parameter spaces
  • New Normal Dont guess, test.
  • Controls, norming, confounding all testable
    online

17
A potential check on exuberance
  • Undergraduates may be WEIRD, but crowdsourcing
    engenders its own weirdness
  • High evaluation apprehension
  • Uncontrolled backgrounds, skillsets, focus levels
  • Unknown motivations
  • Ignorance does not necessarily mean diversity
  • This requires study if we rely on such
    participants more

Heinrich et al. (2010) The Weirdest People in
the World? BBS
18
Acknowledgments
  • Thanks Jaye Padgett and to the attendees of two
    Semantics Lab presentations and the XPRAG
    conference for their comments, to the HUGRA
    committee for their generous award and support,
    and thanks to Rosie Wilson-Briggs for stimuli
    construction.
Write a Comment
User Comments (0)
About PowerShow.com