Confidence Based Marking in Formative and Summative Assessments Tony Gardner-Medwin, Physiology, UCL www.ucl.ac.uk/lapt - PowerPoint PPT Presentation

1 / 20

About This Presentation

Title:

Confidence Based Marking in Formative and Summative Assessments Tony Gardner-Medwin, Physiology, UCL www.ucl.ac.uk/lapt

Description:

Confidence Based Marking in Formative and Summative Assessments Tony Gardner-Medwin, Physiology, UCL www.ucl.ac.uk/lapt What words may characterise a student answer? – PowerPoint PPT presentation

Number of Views:71

Avg rating:3.0/5.0

Slides: 21

Provided by: Tony158

Category:

more less

Transcript and Presenter's Notes

Title: Confidence Based Marking in Formative and Summative Assessments Tony Gardner-Medwin, Physiology, UCL www.ucl.ac.uk/lapt

1
Confidence Based Marking in Formative and
Summative AssessmentsTony Gardner-Medwin,
Physiology, UCL www.ucl.ac.uk/lapt

What words may characterise a student answer?
How do they relate? Which deserve reward?
Which encourage learning?

2
Why CBM ? (1) Knowledge is degree of belief,
or confidence

knowledge
uncertainty
ignorance
misconception
delusion

(2) Students must be able to justify knowledge
relate it to other things, check it and argue
with rigour. Rote learning is the bane of
education.
Knowledge is justified true belief In
teaching we need to emphasise justification. In
assessment we need to measure degrees of belief.
3
What is CBM ?
The LAPT (UCL) Confidence-Based Marking scheme
applied to each answer that will be marked
right/wrong e.g. T/F, MCQ,
EMQs, Numerical, Simple text Confidence Level
1 2 3 Score if
Correct 1 2
3 Score if Incorrect 0 -2 -6
Best marks obtained if
Probability correct lt 67
67-80 gt80 Odds lt
21 gt 21 gt 41
4
With CBM you must think about justification You
gain EITHER if you find justifications for high
confidence OR if you see justifications for
reservation.
5
(No Transcript)
6
How well do students discriminate confidence?
Mean /- 95 confidence limits, 331 students
7
Personality, gender issues real or imagined?
Does confidence-based marking favour certain
personality types?

Both underconfidence and overconfidence are
undesirable
Correct calibration is well defined, desirable
and achievable

No significant gender differences are evident (at
least after practice)
Students with confidence problems this is the
way to deal with it!

In exams, we can adjust to compensate for poor
calibration, so students still benefit from
distinguishing more/less reliable answers

8
(No Transcript)
9
How should one handle students with poor
calibration? Significantly overconfident in exam
2 students (1) e.g. 50 correct _at_C1, 59
_at_C2, 73 _at_C3 Significantly underconfident in
exam 41 students (14) e.g. 83 correct _at_C1,
89 _at_C2, 99 _at_C3 Maybe one shouldnt penalise
such students
Adjusted confidence-based score Mark the set of
answers at each C level as if they were entered
at the C level that gives the highest score.
mean benefit 1.5 2.1 (median 0.6)
10
Reliability and Validity of Confidence-based exam
marks
Exam marks are determined by 1. the students
knowledge and skills in the subject area 2. the
level of difficulty of the questions 3. chance
factors - how questions relate to details of the
students knowledge and how uncertainties resolve
(luck)
(1) signal (its measurement is the
object of the exam) (3) noise
(random factors obscuring the signal) Confidence
-based marks improve the signal-to-noise ratio
A simple convincing test of this is to compare
marks on one set of questions with marks for the
same student on a different set (e.g. odd even
Q nos.). High correlation means the data are
measuring something about the student, not just
noise.
11
Marks scaled 0chance 100max
The correlation, across students, between scores
on one set of questions and another is higher for
CBM than for simple scores.
But perhaps they are just measuring ability to
handle confidence ?
12
Improvements in reliability and efficiency,
comparing CBM to conventional scores, in 6
medical student exams (each 250-300 T/F Qs, gt300
students).
13
Cronbach Alpha (standard psychometric measure of
reliability) On six exams (mean SEM,
n6) a 0.925 0.007 using CBM a
0.873 0.012 using number of items correct

The improvement (Plt0.001, paired t-test)
corresponds to a reduction of the random element
in the variance of exam scores from 14.6 of the
student variance to 8.1.

14
Arriving at a conclusion through probabilistic
inference
15
We fail if we mark a lucky guess as if it were
knowledge. We fail if we mark delusion as no
worse than ignorance.
www.ucl.ac.uk/lapt
16
(No Transcript)
17
How should one handle students with poor
calibration? Significantly overconfident in exam
2 students (1) e.g. 50 correct _at_C1, 59
_at_C2, 73 _at_C3 Significantly underconfident in
exam 41 students (14) e.g. 83 correct _at_C1,
89 _at_C2, 99 _at_C3 Maybe one shouldnt penalise
such students
Adjusted confidence-based score Mark the set of
answers at each C level as if they were entered
at the C level that gives the highest score.
mean benefit 1.5 2.1 (median 0.6)
18
Marks as a function of lack of knowledge defined
by information theory, for T/F answers
19
y x1.67
equality (only expected for a pure mix of
certain knowledge and total guesses)
scores if uncertainty is homogeneous and
correctly reported
theoretical scores for homogeneous uncertainty,
based on an information theoretic measure
20
Rational and Irrational marking schemes

Write a Comment

User Comments (0)