V Seminario TEST DE JUICIO SITUACIONAL

Transcription

V Seminario TEST DE JUICIO SITUACIONAL
V Seminario
TEST DE JUICIO SITUACIONAL
•
Introducción
2-3
•
Programación
5
•
Prof. Filip Lievens
6-29
•
Prof. Fiona Patterson
30-41
•
Mr. Stefan Meyer
42-49
•
Prof. Julio Olea
50-59
•
Resumen posters
60-67
•
Notas
El objetivo de los seminarios de la Cátedra
MAP es fomentar y contribuir a la medición
psicológica de calidad, especialmente en
el campo de las organizaciones.
En este quinto seminario se ha escogido
como tema central los test situacionales.
•
26 de junio de 2013
Facultad de Psicología
Universidad Autónoma de Madrid
4—5
PROGRAMA
8:45
Recogida del material del seminario.
9:15
Apertura y presentación.
Ángela Loeches
Decana de la facultad de Psicología de la UAM.
José Miguel Mata
Director General del Instituto de Ingeniería del
Conocimiento.
9:30
Conferencia.
Situational Judgment Tests: An introduction
to theory, practice, and research.
Prof. Filip Lievens
Profesor del departamento de Gestión de personal y
Psicología del trabajo y de las organizaciones de la
Universidad de Gante de Bélgica.
10:40
Sesión de pósters.
Café.
11:40
Conferencia.
Evaluations of situational judgments tests
for high stakes selection.
Prof. Fiona Patterson
Profesora del departamento de Psicología de la
Universidad de Cambridge y directora del Work
Psychology Group.
12:20
Conferencia.
Situational Judgment Test in EPSO open
competitions for the European Institutions
from 2010 to present.
Mr. Stefan Meyer
Responsable de evaluaciones informatizadas de la
European Personnel Selection Office (EPSO).
13:00
Conferencia.
Medición de competencias mediante
Modelos de Diagnóstico Cognitivo:
aplicación a un Test de Juicio Situacional.
Prof. Julio Olea
Catedrático de Psicología de la Universidad Autónoma
de Madrid y codirector de la cátedra UAM-IIC Modelos y
aplicaciones psicométricos (MAP).
13:40
Debate y clausura del seminario.
6—7
Situational Judgment
Tests: An introduction
to theory, practice,
and research.
— Prof. Filip Lievens
Departamento de Gestión de
Personal y Psicología del Trabajo
y de las Organizaciones, de la
Universidad de Gante, Bélgica.
Situational Judgment Tests:
An Introduction
to Theory,
Situational
Judgment
Tests:
Practice, &to
Research
An Introduction
Theory,
Practice, & Research
Madrid
June, 2013
Madrid
June, 2013
Prof. Dr. Filip Lievens
Ghent University, Belgium
Prof. Dr. Filip Lievens
Ghent
University,
Belgium
Department of Personnel Dunantlaan
2, 9000,
Ghent
filip.lievens@ugent.be
Management
Work and Dunantlaan
Organizational
Psychology
Departmentand
of Personnel
2, 9000,
Ghent
http://users.ugent.be
/~flievens/
filip.lievens@ugent.be
Management and Work and Organizational Psychology
http://users.ugent.be /~flievens/
Who am I?
! 
Professor:
Who
amGhent
I? U. (Belgium)
! 
Professor: Ghent U. (Belgium)
! 
Visiting professor:
! 
! 
U. of Minnesota (USA)
Visiting professor:
!  Bowling Green State U. (USA)
! 
! 
! 
! 
! 
! 
! 
! 
! 
! 
! 
! 
! 
! 
! 
! 
U. of
of Zürich
Minnesota
(USA)
U.
(Switzerland)
Bowling
Green
State
U. (USA)
U. of Guelph (Canada)
U.
of
Zürich
(Switzerland)
Nanyang Technological U. (Singapore)
U. of Guelph
(Canada) U. (Singapore)
Singapore
Management
Nanyang
Technological
U. of Valencia (Spain) U. (Singapore)
Singapore
Management
U.
of Giessen
(Germany)U. (Singapore)
U.
of
Valencia
(Spain)
U. of Stellenbosch
(South Africa)
U. of Giessen (Germany)
U. of Stellenbosch (South Africa)
Who am I?
! 
Situational judgment tests, high-stakes testing, assessment centers,
web-based assessment, & employer branding
Teaching at Ghent U. (Belgium)
! 
! 
1
Research expertise
! 
! 
1
HRM
Consultancy
! 
! 
Private, public, & military sector
Metaconsultant of international consultancy firms
3
8—9
Objectives & overview
! 
Theory
! 
! 
! 
! 
Practice
! 
! 
Basics of SJTs
History
Definition
SJT development & SJT building bocks
State-of-the-art of SJT research
4
Roles
Roles
2
SJT developer or
user or
SJTSJT
developer
SJT user
SJT
Candidate
SJT
Candidate
Trainer /
supervisor
Trainerof/ SJT
usersof SJT
supervisor
users
SJT
researcher
SJT
researcher
5
5
SJT Theory:
SJT Theory:
Basics
of SJTs
Basics of SJTs
Situational Judgment Tests: An introduction to theory, practice, and research.
SJT Basics
Written SJT item
Written
SJT
Suppose you
haveitem
to assess people on their level of
SJT Basics
customer service during selection? What do you do?
Suppose you have to assess people on their level of
customer service during selection? What do you do?
!  Personality inventory
Sign-based/
! 
! 
! 
! 
! 
! 
Agreeableness, Emotional Stability,
Personality
inventory
& Conscientiousness
Agreeableness, Emotional Stability,
& Conscientiousness
Interview
Interview
Role-play (high-fidelity simulation)
! 
Role-play (high-fidelity simulation)
SJT (low-fidelity simulation)
! 
SJT (low-fidelity simulation)
! 
dispositional
approach
Sign-based/
dispositional
approach
Sample-based/
interactional
approach
Sample-based/
interactional
approach
SJT Basics
Written SJT item
Written
SJT item
You have an excellent
long term employee who is always visiting
SJT Basics
at length with customers. Although customer orientation is one
Youofhave
an excellent
employee
is always
visiting
the key
values in long
yourterm
company,
otherwho
employees
have
at
length
with
customers.
Although
customer
orientation
is one
complained that the employee is not doing her share of the
of
the
key
values
in
your
company,
other
employees
have
administrative work. Pick the best response.
complained that the employee is not doing her share of the
•  administrative
Leave her alone
sincePick
thethe
customers
seem to enjoy the chats.
work.
best response.
•  Leave
Tell herher
you
needsince
her to
more and
talktoless
andthe
then
give
alone
thework
customers
seem
enjoy
chats.
her assignments to be completed by specific times.
•  Tell her you need her to work more and talk less and then give
•  her
Initiate
formal discipline
against her.
assignments
to be completed
by specific times.
•  Initiate
Explain formal
the difference
between
discipline
againstsocial
her. chatter and service and
coach her on how to keep her conversations shorter.
•  Explain the difference between social chatter and service and
•  coach
Discussher
theonissues
with
theher
employee
only if her
production is
how to
keep
conversations
shorter.
SJT Introduction
Prof. Dr. Filip Lievens
below standard.
•  Discuss the issues with the employee only if her production is
SJT Introduction
Prof. Dr. Filip Lievens
below standard.
8
8
SJT Basics
4
Written SJT item
4
You work as a waiter/waitress and a customer
orders a dish that is not on the menu. You
mention this politely but the customer gets very
upset. What do you do?
! 
! 
! 
! 
Suggest alternative main dishes and side orders that
might be of interest
Suggest alternative dishes that might be of interest to
the customer
Suggest only those alternative dishes that are very
similar to the dish the customer wanted
Tell the customer there is nothing you can do
9
10—11
SJT Basics
What Are SJTs?
! 
! 
An applicant is presented with a situation
& asked what he/she would do.
SJT items are typically in a multiple
choice format: items have a stem and
various item responses (response
options).
SJT Introduction
Prof. Dr. Filip Lievens
10
SJT Basics
History
of SJTs
SJT Basics
History
of Civil
SJTs
!  1873: US
Service Examinations
! 
! 
5
« A banking company asks protection for a certain
1873:
USasCivil
Service Examinations
device,
a trade-mark,
which they propose to put
« A banking
company
protection
for take
a certain
upon
their notes.
Whatasks
action
would you
in the
device,
as
a
trade-mark,
which
they
propose
to put
application? »
upon their notes. What action would you take in the
1905:
Binet »
application?
!  « When a person has offended you and comes to offer
1905:
Binet
his apologies, what would you do? »
!  « When a person has offended you and comes to offer
1941:
Ansbacher
his apologies,
what would you do? »
!  Your sports club is planning a trip to Berlin to attend
1941:
Ansbacher
the Germany-England football game, which will take
!  place
Yourinsports
clubYou
is planning
a trip
to Berlinwith
to attend
2 weeks.
have been
entrusted
the
the
Germany-England
football
game,
which
will
preparations and entire management of the trip.take
What
SJT Introduction
Prof. Dr. Filip been
Lievens
place
2 weeks.
You
entrusted with the11
have
do youinintend
to do?
preparations and entire management of the trip. What
do you intend to do?
! 
! 
! 
! 
! 
SJT Introduction
Prof. Dr. Filip Lievens
11
SJT Basics
History
of SJTs
SJT Basics
History
of SJTs
!  Early years
1926:
Judgment
!  ! 
Early
years
! 
! 
! 
! 
! 
! 
! 
scale in the George Washington
University Social Intelligence Test
1926: Judgment scale in the George Washington
Used in World
War
II
University
Social
Intelligence
Test
1948:
How
Supervise?
Used in World War II (Rosen, 1961)
1960’s:How
SJTs
used at the
U.S. Civil
Service System
1948:
Supervise?
(Rosen,
1961)
1960’s: SJTs used at the U.S. Civil Service System
Breakthrough years
Motowidlo reinvigorated
interest: low fidelity
!  ! 
Breakthrough
years
! 
! 
! 
! 
simulations.
Motowidlo reinvigorated interest: low fidelity
Sternberg reinvigorated interest: tacit knowledge
simulations.
inventories.
Sternberg reinvigorated interest: tacit knowledge
inventories.
SJT Introduction
Prof. Dr. Filip Lievens
SJT Introduction
Prof. Dr. Filip Lievens
12
12
Situational Judgment Tests: An introduction to theory, practice, and research.
SJT Basics
SJT Basics
In
2013
In 2013
! 
Popular in US & UK
Public &
!  ! 
Popular
inprivate
US & sector
UK
! 
! 
! 
! 
! 
EPSO
Public & private sector
EPSO
Emerging body of research
!  2006: first SJT book (edited by
Emerging body of research
Weekley & Ployhart)
! 
! 
! 
2006: first SJT book (edited by
Weekley & Ployhart)
Increasing demand around the
world
Increasing demand around the
world
SJT Introduction
Prof. Dr. Filip Lievens
14
SJT Introduction
Prof. Dr. Filip Lievens
14
SJT Basics
7
7
Research on SJTs
SJT Publications in Web of Science
60
Frequency
50
40
30
20
10
0
<91
91-95
96-00
01-05
06-10
Years
SJT Introduction
Prof. Dr. Filip Lievens
15
12—13
SJT Basics
In 2013
16
SJT Basics
Wat do SJTs measure?
Wat do SJTs measure?
SJT Basics
! 
! 
8
Measurement method that can be designed to
Measurement
method
that can be designed to
measure
a variety
of constructs
measure a variety of constructs
! 
! 
Procedural knowledge about costs &
Procedural
knowledge
about in
costs
&
benefits of courses
of action
a particular
benefits of courses of action in a particular
domain
domain
!  Going beyond cognitive ability
! 
! 
Mostly
interpersonal
& leadership
competencies
Going beyond
cognitive
ability
Mostly interpersonal & leadership competencies
17
SJT Introduction
Prof. Dr. Filip Lievens
SJT Introduction
Prof. Dr. Filip Lievens
17
SJT Basics
Theory of Prediction
Theory of Prediction
SJT Basics
On-the-job
On-the-job
Situation
Person
Person
Situation
Behavior
Behavior
Performance
Performance
Performance
Experience
Selection
Selection
Performance
Experience
IQ
Behavior
Behavior
Job specific
Knowledge
Job
specific
General
Knowledge
Person
Situation
Situation
Person
General
IQ
Personality
Personality
Socialization
Socialization
Situational Judgment Tests: An introduction to theory, practice, and research.
SJT Basics
Comparisons
with tests
SJT Basics
Comparisons with tests
!  Parallels
! 
! 
Standardization
Parallels
!  Automatic scoring
! 
Standardization
Performance
test
Automatic
scoring
Large group
screening
!  Selecting-outtest
Performance
Large group screening
! 
Sample-based vs. sign-based
! 
! 
! 
! 
! 
! 
!  Selecting-out
Differences
! 
Differences
!  Contextualized vs. decontextualized
! 
! 
! 
! 
! 
! 
! 
! 
Sample-based
vs.vs.
sign-based
Multidimensional
unidimensional
Contextualized
vs. decontextualized
Procedural
knowledge
vs. aptitude & declarative knowledge
Multidimensional
vs. unidimensional
Noncognitive
competencies
vs. cognitive competencies
Procedural knowledge vs. aptitude & declarative knowledge
Noncognitive competencies vs. cognitive competencies
SJT Introduction
Prof. Dr. Filip Lievens
SJT Introduction
Prof. Dr. Filip Lievens
SJT Introduction
Prof. Dr. Filip Lievens
SJT Introduction
Prof. Dr. Filip Lievens
19
19
SJT Basics
Comparisons with assessment centers
SJT Basics
Comparisons with assessment centers
! 
Parallels
! 
!  Sample-based (vs. sign-based)
Parallels
! 
Behavioral consistency logic
Psychological
fidelity logic
Behavioral consistency
Multidimensional:
which measure multiple constructs
Sample-based (vs.Methods
sign-based)
!  Psychological fidelity
!  Multidimensional: Methods which measure multiple constructs
Differences
!  Stimulus
! 
! 
! 
! 
!  Standardized situation vs. life situation
Differences
! 
! 
!  Low fidelity vs. high fidelity
Stimulus
Response
!  Standardized situation vs. life situation
! 
! 
!  MC vs. open ended
Response
Scoring
!  Procedural knowledge vs. actual behavior
! 
! 
!  Built-in model vs. trained assessors
Scoring
Use
!  A priori vs. life observation/rating
! 
!  Select out vs. select in
Use
! 
! 
Procedural
vs. actual behavior
Low fidelity knowledge
vs. high fidelity
! 
! 
A
priori
vs. lifeended
observation/rating
MC
vs. open
! 
! 
Large
smallvs.
groups
Built-invs.
model
trained assessors
! 
! 
SJT Introduction
Large vs. small groups
Select out vs. select SJT
in Introduction
Prof. Dr. Filip Lievens
20
Prof. Dr. Filip Lievens
20
10
SJT Basics
Comparisons with behavior description interviews
! 
Parallels
! 
! 
! 
! 
10
Behavioral consistency logic
Sample-based (vs. sign-based)
Multidimensional: Methods which measure multiple constructs
Differences
! 
Stimulus
! 
! 
! 
Response
! 
! 
! 
Written response vs. oral response
MC vs. open ended
Scoring
! 
! 
! 
Standardized situation vs. situation reported by interviewee
Past or future situation vs. past situation
A priori vs. life observation & rating
Built-in model vs. trained interviewers
Use
! 
Large vs. small groups
21
SJT Introduction
Prof. Dr. Filip Lievens
14—15
SJT Practice:
SJT Development
SJT Development
SJT
vendors
SJT
Development
SJT
vendors
!  General
11
SHL / Previsor
!  ! 
General
! 
! 
! 
! 
! 
! 
! 
! 
A
& DC
SHL
/ Previsor
Kronos
A
& DC
Aon
Kronos
! 
! 
…
Aon
! 
…
Specific
! 
Work Skills First
Specific
! 
! 
Ergometrics
Work Skills First
Van
der Maesen/Koch
Ergometrics
! 
Van der Maesen/Koch
! 
! 
SJT Introduction
Prof. Dr. Filip Lievens
23
SJT Introduction
Prof. Dr. Filip Lievens
23
SJT Development
Stages
in SJT Development
SJT
Development
Stages
in SJT Development
Strategic SJT decisions
! 
! 
! 
! 
! 
! 
! 
! 
! 
Determine purpose of SJT
Strategic
SJT decisions
!  Identify target population
Determine purpose of SJT
Identify target population
Generate critical incidents
Content
development of SJT
!  Sort critical incidents
!  Generate critical incidents
!  Turn selected critical incidents into item stems
!  Sort critical incidents
!  Generate item responses
!  Turn selected critical incidents into item stems
!  Edit item responses
!  Generate item responses
Instructions & scoring
!  Edit item responses
!  Determine response instructions
Instructions & scoring
!  Develop a scoring key
!  Determine response instructions
Implementation
!  Develop a scoring key
!  Continued use of SJTs
! 
Content development of SJT
! 
! 
Implementation
! 
Continued use of SJTs
SJT Introduction
Prof. Dr. Filip Lievens
24
SJT Introduction
Prof. Dr. Filip Lievens
24
Situational Judgment Tests: An introduction to theory, practice, and research.
SJT Development
SJT Development
SJT
Building Blocks
SJT Building
1.  Item lengthBlocks
2. 
1.  Item
Item complexity
length
3. 
2.  Item
Item comprehensibility
complexity
Item
Item contextualization
comprehensibility
Item
Item branching
contextualization
6. 
& stimulus fidelity
5.  Response
Item branching
7. 
6.  Response
Response instructions
& stimulus fidelity
8. 
Scoring keyinstructions
7.  Response
9. 
8.  Measurement
Scoring key level of scores
10. 
measure
9.  Constructs
Measurement
level of scores
Purpose
Purpose
4. 
3. 
5. 
4. 
10.  Constructs measure
25
25
SJT Development
SJT Purpose
SJT Purpose
SJT Development
Selection & assessment
!  Selection & assessment
!  Recruitment (realistic job preview)
!  Recruitment (realistic job preview)
!  Training & evaluation (scenario-based)
!  Training & evaluation (scenario-based)
!  Promotion
!  Promotion
! 
SJT Introduction
Prof. Dr. Filip Lievens
26
SJT Introduction
Prof. Dr. Filip Lievens
26
SJT Development
13
1. Item Length
! 
13
Stems vary in the length of the situation
presented.
! 
Very short descriptions of situation
! 
! 
E.g., Your colleague is rude towards you.
Very detailed descriptions of situation
! 
E.g., Tacit Knowledge Inventory (Wagner & Sternberg,
1991)
SJT Introduction
Prof. Dr. Filip Lievens
27
16—17
SJT Development
Tacit Knowledge Inventory (Sternberg)
You are a company commander, and your battalion commander is the type of person who
seems always to shoot the messenger– he does not like to be surprised by bad news,
and he tends to take his anger out on the person who brought him the bad news. You
want to build a positive, professional relationship with your battalion commander. What
should you do?
___ Speak to your battalion commander about his behavior and share your perception of it.
___ Attempt to keep the battalion commander over-informed by telling him what is
occurring in your unit on a regular basis (e.g., daily or every other day).
___ Speak to the sergeant major and see if she/he is willing to try to influence the battalion
commander.
___ Keep the battalion commander informed only on important issues, but dont bring up
issues you dont have to discuss with him.
___ When you bring a problem to your battalion commander, bring a solution at the same
time.
___ Disregard the battalion commanders behavior: Continue to bring him news as you
normally would.
___ Tell your battalion commander all of the good news you can, but try to shield him from
hearing the bad news.
___ Tell the battalion commander as little as possible; deal with problems on your own if at
all possible.
28
SJT Development
SJT Development
2.
Item Complexity
2. Item Complexity
14
Stems vary in the complexity of the situation
!  presented.
Stems vary in the complexity of the situation
! 
Low complexity
presented.
! 
One
has difficulty
!  ! 
Low
complexity
! 
! 
with a new assignment and needs
instructions.
One has difficulty with a new assignment and needs
instructions.
High complexity
One
has multiple supervisors who are not
High
complexity
!  ! 
! 
cooperating with each other, and who are providing
One has multiple supervisors who are not
conflicting instructions concerning which of your
cooperating with each other, and who are providing
assignments has
highest
priority.
SJT Introduction concerning
Prof. Dr. Filip Lievens
conflicting instructions
which of your
assignments has
highest
priority.
SJT Introduction
Prof. Dr. Filip Lievens
29
29
SJT Development
SJT Development
3.
Item Comprehensiblity
3. Item Comprehensiblity
Comprehensibility: It is more difficult to
the meaning
and difficult
import of
!  understand
Comprehensibility:
It is more
to some
situations
than
situations
understand
the other
meaning
and import of some
! 
cf. reading
levelother
formulas,
technical jargon
situations
than
situations
! 
! 
cf. reading level formulas, technical jargon
Length, complexity, & comprehensibility of
situations
are interrelated
and probably
!  the
Length,
complexity,
& comprehensibility
of
drive
the
cognitive
loading
of
items.
the situations are interrelated and probably
drive the cognitive loading of items.
! 
SJT Introduction
Prof. Dr. Filip Lievens
30
SJT Introduction
Prof. Dr. Filip Lievens
30
Situational Judgment Tests: An introduction to theory, practice, and research.
SJT Development
Example admission SJT item
Example
admission
item
Patient
: So, this physiotherapy
is really going SJT
to help me?
SJT Development
Physician: Absolutely, even though the first days it might still be painful.
Patient : So, this physiotherapy is really going to help me?
Patient : Yes, I suppose it will take a while before it starts working.
Physician: Absolutely, even though the first days it might still be painful.
Physician: That is why I am going to prescribe a painkiller. You should take 3
Patient : Yes, I suppose it will take a while before it starts working.
painkillers per day.
Physician: That is why I am going to prescribe a painkiller. You should take 3
Patient: Do I really have to take them? I have already tried a few things. First, they
painkillers per day.
didnt help me. And second, Im actually opposed to taking any medication. Id
reallythem.
haveThey
to take
I have
tried a few things. First, they
Patient:
rather Do
notI take
arethem?
not good
for already
my health.
didnt help me. And second, Im actually opposed to taking any medication. Id
What is the best way for you (as a physician) to react to this patients refusal to
rather not take them. They are not good for my health.
take the prescribed medication?
What is the best way for you (as a physician) to react to this patients refusal to
a. Ask her if she knows something else to relieve the pain.
take the prescribed medication?
b. Give her the scientific evidence as to why painkillers will help.
a. Ask her if she knows something else to relieve the pain.
c. Agree not to take them now but also stress the importance of the physiotherapy.
b. Give her the scientific evidence as to why painkillers will help.
31
d. Tell her that, in her own interest, she will have to start changing her attitude.
c. Agree not to take them now but also stress the importance of the physiotherapy.
31
d. Tell her that, in her own interest, she will have to start changing her attitude.
SJT Development
Example advanced level SJT item
advanced
level
SJT
item20
AExample
55 year old woman
with ischaemic heart
disease
has smoked
SJT Development
cigarettes per day for 40 years. She requests nicotine replacement
A 55
year oldShe
woman
withthese
ischaemic
heartbut
disease
has smoked
20 in
patches.
has had
previously
has been
inconsistent
cigarettes
per has
day often
for 40continued
years. She
nicotine
replacement
their use and
torequests
smoke while
using
the patches.
patches. She has had these previously but has been inconsistent in
A. Emphasize
thehas
dangers
smokingtobut
do not
prescribe.
their use and
often of
continued
smoke
while
using the patches.
B.
Enquire
about
the
difficulties
she
has
with
stopping
smoking and any
A. Emphasize the dangers of smoking but do not prescribe.
previous problems with patches.
B. Enquire about the difficulties she has with stopping smoking and any
C. Insist
on a
period ofwith
abstinence
previous
problems
patches.before prescribing any further
patches.
C. Insist on a period of abstinence before prescribing any further
D. Prescribe
patches. another supply of patches and explain how they should be
used.
D. Prescribe another supply of patches and explain how they should be
E. Suggest
used. that nicotine replacement therapy is not suitable for her but
explore alternative therapies.
E. Suggest that nicotine replacement therapy is not suitable for her but32
explore alternative therapies.
32
SJT Development
16
4. Item Contextualization
! 
Items for a job can be more specific.
! 
! 
! 
16
Mention job specific equipment, technical terms.
Items for a family of jobs need to make sense for all the
jobs to be covered by the test.
!  Entry-level vs. Supervisory
!  Armed Forces, Coast Guard, Paramilitary Forces, &
Security Forces
Items for specific competencies needed across several
jobs or job families to be covered by SJT.
! 
SJT taxonomy
SJT Introduction
Prof. Dr. Filip Lievens
33
18—19
Judgment at Work Survey for Customer Service
© Work Skills First, Inc.
! 
A customer is disrespectful and rude to you.
! 
! 
! 
! 
A customer is very angry with you because the customer believes that the
product received is not what the customer
! 
! 
! 
Ask the customer to leave until he is calm.
Treat the customer as the customer treats you.
Try to solve the problem as quickly as possible to get the customer out of the store.
Offer to replace the customer's product with the correct one.
Ask a co-worker to deal with this problem.
A customer is becoming impatient about waiting for service for a long
time. Other customers are becoming equally anxious.
! 
! 
! 
! 
! 
! 
Apologize for the slow service on the part of your colleagues.
Offer the customers a discount.
Reassure the customer that the wait is necessary and address the needs they may
have.
Try to find someone else to wait on the customer immediately.
Refer the customer to a competing organization that offers similar services.
Suggest that the customer consider leaving and coming back when you are not so
busy.
Extremely Ineffective - Ineffective -Average Effectiveness – Effective -Extremely Effective
34
SJT Development
5.
Item Branching / Integration
SJT Development
5. Item Branching / Integration
! 
Unbranched/linear items
! 
Unbranched/linear items
! 
Branched/nonlinear items
! 
17
Present overall situation
followed by subordinate
Branched/nonlinear
items
! 
! 
situations.
Present overall situation followed by subordinate
Subordinate stems are stems linked to the responses.
situations.
! 
! 
Subordinate SJT
stems
linked to the responses.35
Introductionare
Prof.stems
Dr. Filip Lievens
SJT Introduction
Prof. Dr. Filip Lievens
35
SJT Development
6.
Stimulus & Response Fidelity
SJT Development
6. Stimulus & Response Fidelity
! 
! 
Degree to which format of stimulus (item stem) and
response is consistent with how situation is
Degree to which
of stimulus (item stem) and
encountered
in a format
work setting.
response is consistent with how situation is
encountered in a work setting.
! 
! 
Stimulus
Written
Stimulus
! 
! 
! 
! 
! 
! 
! 
Oral
Written
Video
Oral
Behavior
Video
! 
Behavior
! 
Written SJT
JT
dS
Situational
aseinterview
b
eo
Vid Written SJT
SJT
Webcam
sed SJT
Situational
-ba interview
o
e
id
VAssessment
center
Webcam SJT
! 
Assessment center
SJT Introduction
Prof. Dr. Filip Lievens
SJT Introduction
Prof. Dr. Filip Lievens
Response
Written
Response
! 
! 
! 
! 
! 
! 
! 
Oral
Written
Video
Oral
Behavior
Video
! 
Behavior
36
36
Situational Judgment Tests: An introduction to theory, practice, and research.
Level of standardization
Level of standardization
Written SJT
Cartoon SJT
Video-based SJT
Written SJT
Cartoon SJT
Multimedia SJT
Video-based SJT
Multimedia SJT
Webcam SJT
Situational
interview
Situational
interview
Webcam SJT
Role-play
Role-play
Group discussion
SJT Introduction
Prof. Dr. Filip Lievens
SJT Introduction
Prof. Dr. Filip Lievens
Group discussion
Level of fidelity
37
Level of fidelity
37
SJT Development
SJT generations
Written SJT
SJT generations
SJT Development
You have an excellent long term employee who is always
visiting at length with customers. Although customer
Written
orientation is one
of the keySJT
values in your company,
other employees have complained that the employee is
You have
excellent
longofterm
who work.
is always
not an
doing
her share
the employee
administrative
Pick the
visiting
at length with customers. Although customer
best response.
orientation is one of the key values in your company,
A.  other
Leaveemployees
her alone since
the customers
to enjoy the
have complained
thatseem
the employee
is
chats.
not
doing her share of the administrative work. Pick the
response.
B.  best
Tell her
you need her to work more and talk less and
then give
assignments
to be completed
specific
A. 
Leave
her her
alone
since the customers
seem tobyenjoy
the
times.
chats.
C. 
B. 
D. 
Initiate
formal
discipline
against
her.and talk less and
Tell
her you
need
her to work
more
then
givethe
herdifference
assignments
to besocial
completed
byand
specific
Explain
between
chatter
service
times.
and coach her on how to keep her conversations shorter.
C. 
E. 
Initiate
discipline
against
her. only if her
Discussformal
the issues
with the
employee
production
is below standard.
Explain
the difference
between social chatter and service
and coach her on how to keep her conversations shorter.
D. 
E. 
Video SJT
Webcam SJT
Video SJT
3D animatedWebcam
SJT
SJT
3D animated SJT
Avatar-based
SJT
Avatar-based
SJT
Discuss the issues with the employee only if her
production is below standard.
© Previsor
Cartoon SJT
Cartoon SJT
38
© Previsor
38
19
SJT Development
7. Response Instructions
! 
Knowledge instructions
ask respondents to
display their knowledge
of the effectiveness of
behavioral responses
! 
Best action
! 
! 
Behavioral tendency
instructions ask respondents
to report how they typically
respond
! 
Pick the best & worst
answer.
Rate on effectiveness
! 
Rate each response for
effectiveness.
Most likely action
! 
! 
Best action/worst action
! 
! 
Pick the best answer.
! 
19
! 
! 
What would you most likely do?
Most likely action/least likely
action
What would you most likely do?
What would you least likely do?
Rate on the likelihood of
performing the action
! 
Rate each response on
likelihood that you would do the
behavior.
39
20—21
SJT Development
7. Response instructions
Knowledge
Behavioral consistency
Cognitive
loading
More correlated with
cognitive ability (.35)
Less correlated with
cognitive ability (.19)
Personality
loading
Less correlated with
A (.19), C (.24), ES (.12)
Mean SJT
scores
Higher
Lower
Race
differences
Larger
Smaller
Fakability
Less
Validity
ρ = .26
More correlated with
A (.37), C (.34), ES (.35)
More
SJT Introduction
Prof. Dr. Filip Lievens
ρ = .26
40
SJT Development
SJTScoring
Development Key
8.
8. Scoring Key
20
Rational (consensus among experts)
E.g., 60%
agreement among experts)
!  ! 
Rational
(consensus
! 
! 
E.g., 60% agreement
Empirical (administration of large groups)
E.g., 25%(administration
endorsement of correct
option
!  ! 
Empirical
of large
groups)
! 
! 
E.g., 25% endorsement of correct option
Theoretical model
!  ! 
Theoretical
E.g., conflictmodel
management, leadership
! 
! 
! 
E.g., conflict management, leadership
Hybrid
!  ! 
Hybrid
Combination
! 
of approaches
Combination of approaches
41
41
SJT Development
SJTScoring
Development Key
8.
8. Scoring Key
! 
! 
Most use discrete points assigned to response options.
!  From a legal point of view, it makes some uneasy to try to
Most use discrete points assigned to response options.
defend the difference between very effective and effective.
!  From a legal point of view, it makes some uneasy to try to
!  Very effective = 1
defend the difference between very effective and effective.
! 
! 
! 
! 
! 
! 
! 
! 
! 
Effective = 1
Very effective = 1
Ineffective = 0
Effective = 1
Very ineffective = 0
Ineffective = 0
Very ineffective = 0
Others use score responses as deviations from the mean
effectiveness ratings (= the correct answer)
Others use score responses as deviations from the mean
!  If the mean is 1.5, a respondent who provided a rating of 1 or
effectiveness ratings (= the correct answer)
2 would both have a -.5 as a score on the item.
!  If the mean is 1.5, a respondent who provided a rating of 1 or
!  Zero is the highest possible score
2 would both have a -.5 as a score on the item.
!  Prone to large coaching effects (Cullen et al., 2006)
!  Zero is the highest possible score
! 
SJT Introduction
Prof.(Cullen
Dr. Filip Lievens
Prone to large coaching
effects
et al., 2006)
SJT Introduction
Prof. Dr. Filip Lievens
42
42
Situational Judgment Tests: An introduction to theory, practice, and research.
SJT Development
9. Measurement
Level of Scores
SJT
Development
9. Measurement Level of Scores
! 
! 
! 
! 
! 
! 
! 
! 
One dichotomous response per item
! 
Most
likely, pick response
the best per item
One
dichotomous
!  Most likely, pick the best
Two dichotomous responses per item
! 
Most/least
likely,responses
pick the best/worst
Two
dichotomous
per item
!  Most/least likely, pick the best/worst
Ranking the responses yields ordinal level data.
Ranking the responses yields ordinal level data.
Rating the effectiveness yields interval level data
per
itemthe
response.
Rating
effectiveness yields interval level data
per item response.
43
43
SJT Development
10.
Constructs measured
SJT Development
10. Constructs measured
Construct
Homogeneous
One construct
!  ! 
Homogeneous
! 
! 
One construct
Heterogeneous
Multiple constructs
!  ! 
Heterogeneous
! 
! 
Multiple constructs
Usage
Construct
Usage
Technical knowledge
2.94%
Technical knowledge
Interpersonal
Skills
2.94%
12.50%
Interpersonal
Skills
Teamwork
Skills
12.50%
4.41%
Teamwork Skills
Leadership
4.41%
37.50%
Leadership
Basic personality
37.50%
9.56%
Basic personality
Heterogeneous
9.56%
33.09%
Heterogeneous
33.09%
Christian et al., 2010; N= 136 SJTs
SJT Introduction
Prof.
Dr. Filipet
Lievens
Christian
al., 2010;
SJT Introduction
Prof. Dr. Filip Lievens
N= 136 SJTs
44
44
22
22
SJT Research:
State-of-the-art
22—23
Research
SJT Quiz
Indica si la siguiente afirmación es “verdadera” o “falsa”:
1. 
La validez predictiva de los tests de juicio situacional (SJT) es aproximadamente 0.3.
2. 
Los SJTs mejoran la predicción del rendimiento que puede alcanzarse con las pruebas de capacidad
cognitiva y personalidad.
3. 
Los SJTs basados en video son mejores predictores del rendimiento futuro que los SJTs en papel.
4. 
Los hombres puntúan por lo general más alto que las mujeres en los SJTs.
5. 
Se han encontrado diferencias raciales de escasa importancia en los SJTs.
6. 
Las diferencias raciales en los SJTs son mas pequeñas en los SJTs basados en video.
7. 
No se han encontrado diferencias entre la administración informatizada de los SJTs y la tradicional
(administración en papel).
8. 
Cuando se realiza un análisis factorial sobre las puntuaciones factoriales, se suele encontrar una
estructura factorial interpretable.
9. 
La fiabilidad test-retest de los SJT es aproximadamente 0.70.
10.  Es posible entrenar a la gente para que mejore su rendimiento en los SJTs.
SJT Introduction
Prof. Dr. Filip Lievens
46
Research
Internal
consistency
Research
Internal consistency
23
Meta-analysis (Catano, Brochu & Lamerson, 2012)
Average alpha(Catano,
= .46Brochu
(N=56)
!  ! 
Meta-analysis
& Lamerson, 2012)
! 
! 
! 
! 
Factor
analysis:
factors & no clear structure
Average
alpha =Several
.46 (N=56)
Factor analysis: Several factors & no clear structure
Moderators (McDaniel et al., 2001)
Length of SJT
!  ! 
Moderators
(McDaniel et al., 2001)
! 
! 
! 
Type
ofof
scale
Length
SJT
Rating
scale
Type of scale (.73) > Two choices (.60) > One choice (.24)
!  ! 
! 
Rating scale (.73) > Two choices (.60) > One choice (.24)
SJT Introduction
Prof. Dr. Filip Lievens
47
SJT Introduction
Prof. Dr. Filip Lievens
47
Research
Test-retest
Reliability
Research
Test-retest Reliability
Adequate
Between .60 - .80
!  ! 
Adequate
! 
! 
Between .60 - .80
48
48
Situational Judgment Tests: An introduction to theory, practice, and research.
Research
Criterion-related
Validity
Research
Criterion-related
Validity
McDaniel, Morgeson, & Campion (2001)
! 
! 
ρ = .27 (corrected r = .34)
McDaniel, Morgeson, & Campion (2001)
McDaniel
al. (2007)r = .34)
! 
ρ = .27et(corrected
!  ρ = .20 (corrected r = .26)
McDaniel et al. (2007)
Validity
for (corrected
predicting relevant
! 
ρ = .20
r = .26) criteria (Christian et al., 2010)
! 
Equivalence
computerrelevant
& paper-and-pencil
SJTs et
(Ployhart
et al., 2003)
Validity for predicting
criteria (Christian
al., 2010)
! 
! 
! 
! 
! 
! 
Moderators:
Morgeson
& Campion
(2001)
Equivalence McDaniel,
computer &
paper-and-pencil
SJTs
(Ployhart et al., 2003)
!  Job analysis (.38 vs. .29)
! 
Less detailed
questions
(.35 vs.& 33)
Moderators:
McDaniel,
Morgeson
Campion (2001)
!  Video-based
(Christian
et al., 2010)
Job analysis (.38
vs. .29)
!  Less detailed questions (.35 vs. 33)
!  Video-based (Christian et al., 2010)
Research
Incremental
Validity (McDaniel et al., 2007)
Research
Incremental
Validity (McDaniel et al., 2007)
!  Incremental validity over
Cognitive ability
(3%-5%)
!  ! 
Incremental
validity
over
! 
! 
Personality
(6%-7%)
Cognitive ability
(3%-5%)
Cognitive
ability
& personality (1%-2%)
Personality (6%-7%)
! 
Cognitive ability & personality (1%-2%)
! 
! 
50
50
25
25
0,60
0,50
0,40
0,30
0,20
Cognitive Tests
Video-based
Interpersonal SJT
0,10
0,00
-0,10
Note. Correlations corrected for range restriction and unreliability in criterion.
24—25
Research
Gender Differences
! 
! 
Females > males (d = .10)
Explanation
!  Not related to cognitive loading & response instructions
! 
Interpersonal nature of situations?
Cf. assessment center/work sample research
! 
SJT Introduction
Prof. Dr. Filip Lievens
52
Research
Race differences
Race differences
26
Research
! 
! 
! 
! 
! 
d = .38 (N = 36.355; K = 39, Nguyen et al., 2005)
d = .38 (N = 36.355; K = 39, Nguyen et al., 2005)
Correlation of SJT with cognitive ability drives racial
differences of
found.
Correlation
SJT with cognitive ability drives racial
differences found.
Moderators
!  Reading component
!  Moderators
! 
! 
! 
! 
! 
Presentation
format (written vs. video)
Reading
component
Response
instructions
Presentation format (written vs. video)
Response instructions
SJT Introduction
Prof. Dr. Filip Lievens
53
SJT Introduction
Prof. Dr. Filip Lievens
53
Research
Faking, Coaching, & Retest effects
Faking, Coaching, & Retest effects
Research
! 
Hooper, Cullen, & Sackett (2006)
! 
Hooper, Cullen, & Sackett (2006)
! 
! 
! 
! 
! 
! 
Smaller than personality inventories
d: .08 - .89
Smaller than personality inventories
d: .08 - .89
Moderators
!  Cognitive loading of items
Moderators
! 
Transparency of items
Cognitive loading of items
Response instructions
Transparency of items
Type of study design
Response instructions
!  Type of study design
Retest effects (SJTs) < retest effects (cognitive ability tests)
! 
Retest effects (SJTs) < retest effects (cognitive ability tests)
! 
! 
! 
! 
! 
! 
(Dunlop et al., 2011; Lievens, Buyse, & Sackett, 2005)
(Dunlop et al., 2011; Lievens, Buyse, & Sackett, 2005)
54
54
Situational Judgment Tests: An introduction to theory, practice, and research.
Research
Applicant
Perceptions
Research
Applicant Perceptions
! 
! 
Meta-analysis (Hausknecht et al., 2005)
More favorable(Hausknecht
reactions et
towards
job-related selection
Meta-analysis
al., 2005)
! 
! 
! 
procedures
More favorable reactions towards job-related selection
procedures
Moderators
Video SJTs
!  ! 
Moderators
(Chan & Schmitt, 1997)
! 
! 
Non-linear
(Kanning et al., 2006)
Video SJTsSJTs
(Chan & Schmitt, 1997)
Multimedia
SJTs (Kanning
(Richman-Hirsch et al., 2000)
Non-linear SJTs
et al., 2006)
! 
Multimedia SJTs (Richman-Hirsch et al., 2000)
! 
! 
55
55
Epilogue
Epilogue
Epilogue
28
Future & Advanced Topics
! 
Technologically-enhanced SJTs
! 
Construct-oriented SJTs
! 
28
Trait Activation Theory
! 
Cross-cultural issues
! 
Norming & item banking
! 
Alternate form development
57
26—27
Epilogue
A Famous Quote
[SJTs] seem to represent psychometric alchemy
(adverse impact is down, validity is up), they
seem to assess practically important KSAOs,
and assessees like them
Landy, 2007, p. 418
29
Thank you
More info on
http://users.ugent.be/~flievens
Useful references
! 
! 
! 
! 
! 
! 
! 
! 
! 
! 
! 
! 
! 
! 
! 
! 
! 
! 
Becker, T.E. (2005). Development and validation of a situational judgment test of employee integrity. International Journal of
Selection and Assessment, 13, 225-232.
Bledow, R., & Frese, M. (2009). A situational judgment test of personal initiative and its relationship to performance. Personnel
Psychology, 62, 229-258.
Bergman, M.E., Drasgow, F., Donovan, M.A., & Henning, J.B. (2006). Scoring situational judgment tests: Once you get the data, your
troubles begin. International Journal of Selection and Assessment, 14, 223-235.
Bruce, M. M., & Learner, D. B. (1958). A supervisory practices test. Personnel Psychology, 11, 207-216.
Chan, D., & Schmitt, N. (1997). Video-based versus paper-and-pencil method of assessment is situational judgment tests: Subgroup
differences in test performance and face validity perceptions. Journal of Applied Psychology, 82, 143-159.
Christian, M. S., Edwards, B. D., & Bradley, J. C. (2010). Situational judgement tests: Constructs assessed and a meta-analysis of
their criterion-related validities. Personnel Psychology, 63, 83-117.
Cullen, M. J., Sackett, P. R., & Lievens, F. (2006). Threats to the operational use of situational judgment tests in the college
admission process. International Journal of Selection and Assessment, 14, 142-155.
Chan, D., & Schmitt, N. (2002). Situational judgment and job performance. Human Performance, 15, 233-254.
Clause, C.C., Mullins, M.E., Nee, M.T., Pulakos, E.D., & Schmitt, N. (1998). Parallel test form development: A procedure for
alternative predictors and an example. Personnel Psychology, 51, 193-208.
Clevenger, J., Pereira, G.M., Wiechmann, D., Schmitt, N., & Schmidt-Harvey, V.S. (2001). Incremental validity of situational judgment
tests. Journal of Applied Psychology, 86, 410-417.
Cullen, M.J., Sackett, P.R., & Lievens, F. (2006). Threats to the operational use of situational judgment tests in the college admission
process. International Journal of Selection and Assessment, 14, 142-155.
Dalessio, A.T. (1994). Predicting insurance agent turnover using a video-based situational judgment test. Journal of Business and
Psychology, 9, 23-32.
Funke, U., & Schuler, H. (1998). Validity of stimulus and response components in a video test of social competence. International
Journal of Selection and Assessment, 6, 115-123.
Hooper, A.C., Cullen, M.J., & Sackett, P.R. (2006). Operational threats to the use of SJTs: Faking, coaching, and retesting issues. In
J.A. Weekley & R.E. Ployhart (Eds.), Situational judgment tests: Theory, measurement and application (pp. 205-232). Mahwah, New
Jersey: Lawrence Erlbaum Associates.
Hunter, D.R. (2003). Measuring general aviation pilot judgment using a situational judgment technique. International Journal of
Aviation Psychology, 13, 373-386.
Kanning, U.P., Grewe, K., Hollenberg, S., & Hadouch, M. (2006). From the subjects’ point of view – Reactions to different types of
situational judgment items. European Journal of Psychological Assessment, 22, 168-176.
Konradt, U., Hertel, G., & Joder, K. (2003). Web-based assessment of call center agents: Development and validation of a
computerized instrument. International Journal of Selection and Assessment, 11, 184-193.
Lievens, F. (2006). International situational judgment tests. In J.A. Weekley & R.E. Ployhart (Eds.) Situational judgment tests:
Introduction
Prof.
Dr. Filip
Lievens
Theory, measurement and application (pp. 279-300).SJT
SIOP
Frontier Series.
Mahwah,
NJ:
Lawrence Erlbaum Associates.
60
Situational Judgment Tests: An introduction to theory, practice, and research.
USEFUL REFERENCES
• Becker, T.E. (2005). Development and validation
process. International Journal of Selection and
of a situational judgment test of employee
Assessment, 14, 142-155.
integrity. International Journal of Selection and • Dalessio, A.T. (1994). Predicting insurance
Assessment, 13, 225-232.
agent turnover using a video-based situational
• Bledow, R., & Frese, M. (2009). A situational
judgment test. Journal of Business and
judgment test of personal initiative and its
Psychology, 9, 23-32.
relationship
to
performance.
Personnel • Funke, U., & Schuler, H. (1998). Validity of
Psychology, 62, 229-258.
stimulus and response components in a video
• Bergman, M.E., Drasgow, F., Donovan, M.A.,
test of social competence. International Journal
& Henning, J.B. (2006). Scoring situational
of Selection and Assessment, 6, 115-123.
judgment tests: Once you get the data, your • Hooper, A.C., Cullen, M.J., & Sackett, P.R. (2006).
troubles begin. International Journal of Selection
Operational threats to the use of SJTs: Faking,
and Assessment, 14, 223-235.
coaching, and retesting issues. In J.A. Weekley &
• Bruce, M. M., & Learner, D. B. (1958). A
R.E. Ployhart (Eds.), Situational judgment tests:
supervisory practices test. Personnel Psychology, Theory, measurement and application (pp. 20511, 207-216.
232). Mahwah, New Jersey: Lawrence Erlbaum
Associates.
• Chan, D., & Schmitt, N. (1997). Video-based
versus paper-and-pencil method of assessment • Hunter, D.R. (2003). Measuring general aviation
is situational judgment tests: Subgroup
pilot judgment using a situational judgment
differences in test performance and face validity
technique. International Journal of Aviation
perceptions. Journal of Applied Psychology, 82, Psychology, 13, 373-386.
143-159.
• Kanning, U.P., Grewe, K., Hollenberg, S., &
• Christian, M. S., Edwards, B. D., & Bradley, J. C.
Hadouch, M. (2006). From the subjects’ point
(2010). Situational judgement tests: Constructs
of view – Reactions to different types of
assessed and a meta-analysis of their criterion- situational judgment items. European Journal of
related validities. Personnel Psychology, 63, 83- Psychological Assessment, 22, 168-176.
117.
• Konradt, U., Hertel, G., & Joder, K. (2003).
• Cullen, M. J., Sackett, P. R., & Lievens, F. (2006).
Web-based assessment of call center agents:
Threats to the operational use of situational
Development and validation of a computerized
judgment tests in the college admission
instrument. International Journal of Selection
process. International Journal of Selection and
and Assessment, 11, 184-193.
Assessment, 14, 142-155.
• Lievens, F. (2006). International situational
• Chan, D., & Schmitt, N. (2002). Situational
judgment tests. In J.A. Weekley & R.E. Ployhart
judgment and job performance. Human
(Eds.) Situational judgment tests: Theory,
Performance, 15, 233-254.
measurement and application (pp. 279-300).
SIOP Frontier Series. Mahwah, NJ: Lawrence
• Clause, C.C., Mullins, M.E., Nee, M.T., Pulakos,
Erlbaum Associates.
E.D., & Schmitt, N. (1998). Parallel test form
development: A procedure for alternative • Lievens, F., Buyse, T., & Sackett, P.R. (2005a). The
predictors and an example. Personnel
operational validity of a video-based situational
Psychology, 51, 193-208.
judgment test for medical college admissions:
Illustrating
the importance of matching predictor
• Clevenger, J., Pereira, G.M., Wiechmann, D.,
and criterion construct domains. Journal of
Schmitt, N., & Schmidt-Harvey, V.S. (2001).
Applied
Psychology, 90, 442-452.
Incremental validity of situational judgment tests.
Journal of Applied Psychology, 86, 410-417.
• Lievens, F., Buyse, T., & Sackett, P.R. (2005b).
Retest effects in operational selection settings:
• Cullen, M.J., Sackett, P.R., & Lievens, F. (2006).
Development and test of a framework. Personnel
Threats to the operational use of situational
Psychology,
58, 981-1007.
judgment tests in the college admission
28—29
• Lievens, F., & Sackett, P.R. (2006). Video-based • Motowidlo, S., Dunnette, M.D., & Carter, G.W.
versus written situational judgment tests: A
(1990). An alternative selection procedure:
comparison in terms of predictive validity. Journal
The low-fidelity simulation. Journal of Applied
of Applied Psychology, 91, 1181-1188.
Psychology, 75, 640-647.
• Lievens, F., & Sackett, P.R. (2007). Situational • Motowidlo, S.J., Hooper, A.C., & Jackson, H.L.
judgment tests in high stakes settings: Issues
(2006). Implicit policies about relations between
and strategies with generating alternate forms.
personality traits and behavioral effectiveness
Journal of Applied Psychology, 92, 1043-1055.
in situational judgment items.Journal of Applied
Psychology,
91, 749-761.
• Lievens, F., Peeters, H., & Schollaert, E. (2008).
Situational judgment tests: A review of recent • Motowidlo, S.J., & Beier, M. E. (2010).
research. Personnel Review, 37, 426-441.
Differentiating specific job knowledge from
implicit trait policies in procedural knowledge
• Lievens, F., Sackett, P.R., & Buyse, T. (2009). The
measured by a situational judgment test. Journal
effects of response instructions on situational
of
Applied Psychology, 95, 321-333.
judgment test performance and validity in a high-
stakes context. Journal of Applied Psychology, • Motowidlo, S.J., Crook, A.E., Kell, H.J., & Naemi,
94, 1095-1101.
B. (2009). Measuring procedural knowledge
more simply with a single-response
• McClough, A.C., & Rogelberg, S.G. (2003).
Selection in teams: An exploration of the • situational judgment test. Journal of Business
Teamwork Knowledge, Skills, and Ability
and Psychology, 24, 281-288.
test. International Journal of Selection and • Nguyen, N.T., Biderman, M.D., & McDaniel, M.A.
Assessment, 11, 56-66.
(2005). Effects of response instructions on
• McDaniel, M.A., Hartman, N.S., Whetzel, D.L., &
faking a situational judgment test.
Grubb, W.L. (2007). Situational judgment tests, • International Journal of Selection and Assessment,
response instructions, and validity: A meta- 13, 250-260.
analysis. Personnel Psychology, 60, 63-91.
• Nguyen, N.T., McDaniel, M.A., & Whetzel, D.L.
• McDaniel, M.A., Morgeson, F.P., Finnegan,
(2005, April). Subgroup differences in situational
E.B., Campion, M.A., & Braverman, E.P. (2001).
judgment test performance: A metaanalysis.
Predicting job performance using situational
Paper presented at the 20th Annual Conference
judgment tests: A clarification of the literature.
of the Society for Industrial and Organizational
Journal of Applied Psychology, 86, 730-740.
Psychology, Los Angeles, CA.
• McDaniel, M.A., & Motowidlo, S. (2005). • O’Connell, M.S., Hartman, N.S., McDaniel, M.A.,
Situational
Judgment
Tests.
Workshop
Grubb, W.L., & Lawrence, A. (2007). Incremental
presented at SIOP.
validity of situational judgment tests for task and
• McDaniel, M.A., & Nguyen, N.T. (2001).
contextual job performance. International Journal
Situational judgment tests: A review of practice
of Selection and Assessment, 15, 19-29.
and constructs assessed. International Journal • Olson-Buchanan, J.B., & Drasgow, F. (2006).
of Selection and Assessment, 9, 103-113.
Multimedia situational judgment tests: The
• McDaniel, M.A., & Whetzel, D.L. (2005).
medium creates the message. In J.A. Weekley &
Situational judgment test research: Informing
R.E. Ployhart (Eds.), Situational Judgment Tests
the debate on practical intelligence theory.
(pp. 253-278). San Francisco, Jossey Bass.
Intelligence, 33, 515-525.
• Olson-Buchanan, J. B., Drasgow, F., Moberg, P.
• McHenry, J. J. & Schmitt, N. (1994). Multimedia
J., Mead, A. D., Keenan, P. A., & Donovan, M. A.
testing. In M. G. Rumsey & C. B. Walker (Eds.),
(1998). Interactive video assessment of conflict
Personnel selection and classification (pp. 193- resolution skills. Personnel Psychology, 51, 1-24.
232). Hillsdale, NJ: Erlbaum.
• Oswald, F.L., Friede, A.J., Schmitt, N., Kim, B.K.,
• Morgeson, F.P., Reider, M.H., & Campion, M.A. & Ramsay, L.J. (2005). Extending a practical
(2005). Selecting individuals in team settings:
method for developing alternate test forms
The importance of social skills, personality
using independent sets of items. Organizational
characteristics, and teamwork knowledge. Research methods, 8, 149-164.
Personnel Psychology, 58, 583-611.
•
Situational Judgment Tests: An introduction to theory, practice, and research.
• Oswald, F.L., Schmitt, N., Kim, B.H., Ramsay, L.J.,
using cognitive and noncognitive predictors and
& Gillespie, M.A. (2004). Developing a biodata
the impact of demographic status on admitted
measure and situational judgment inventory
students. Journal of Applied Psychology, 94,
as predictors of college student performance. 1479-1497.
Journal of Applied Psychology, 89, 187-208.
• Stevens, M.J., & Campion, M.A. (1999). Staffing
• Peeters, H., & Lievens, F. (2005). Situational
work teams: Development and validation
judgment tests and their predictiveness of
of a selection test for teamwork. Journal of
college students’ success: The influence of faking. Management, 25, 207-228.
Educational and Psychological Measurement, • Such, M.J., & Schmidt, D.B. (2004, April).
65, 70-89.
Examining the effectiveness of empirical keying:
• Ployhart, R.E., & Ehrhart, M.G. (2003). Be careful
A cross-cultural perspective. Paper presented
what you ask for: Effects of response instructions
at the 19th Annual Conference of the Society
on the construct validity and reliability of
for Industrial and Organizational Psychology,
situational judgment tests. International Journal
Chicago, IL.
of Selection and Assessment, 11, 1-16.
• Wagner, R.K., & Sternberg, R.J. (1985). Practical
• Ployhart, R.E., Weekley, J.A., Holtz, B.C. & Kemp,
intelligence in real world pursuits: The role of
C.F. (2003). Web-based and paper-and-pencil
tacit knowledge. Journal of Personality and
testing of applicants in a proctored setting: Are
Social Psychology, 49, 436-458.
personality, biodata, and situational judgment • Weekley, J.A., & Jones, C. (1997). Video-based
tests comparable? Personnel Psychology, 56,
situational testing. Personnel Psychology, 50,
733-752.
25-49.
• Smiderle, D., Perry, B. A., & Cronshaw, S. F. • Weekley, J.A., & Jones, C. (1999). Further studies
(1994). Evaluation of video-based assessment
of situational tests. Personnel Psychology, 52,
in transit operator selection. Journal of Business
679-700.
and Psychology, 9, 3-22.
• Weekley, J.A., & Ployhart, R.E. (2006). Situational
• Schmitt, N., & Chan, D. (2006). Situational
judgment tests: Theory, measurement and
judgment tests: Method or construct? In J.
application. San Francisco, Jossey Bass.
Weekley & R.E. Ployhart (Eds.), Situational
judgment tests (pp. 135-156). Mahwah, NJ: • Whetzel, D.L., & McDaniel, M.A. (2009).
Situational judgment tests: An overview of
Lawrence Erlbaum
current research. Human Resource Management
• Schmitt, N., Keeney, J., Oswald, F. L., Pleskac, Review, 19, 188-202.
T., Quinn, A., Sinha, R., & Zorzie, M. (2009).
Prediction of 4-year college student performance
30—31
Evaluations of
situational judgments
tests for high stakes
selection.
— Prof. Fiona Patterson
Departamento de Psicología de
la Universidad de Cambridge,
Reino Unido y directora del Work
Psychology Group.
Evaluations of situational judgement tests
for high stakes selection:
Research & practice in the healthcare
professions
Professor Fiona Patterson
Work Psychology Group & University of Cambridge
Madrid, June 2013
Overview
• Selecting doctors in the UK National Health Service
• Selection methods
– Low fidelity: Knowledge test, situational judgement test (SJT)
– High fidelity: Assessment Centre (AC)
• Predictive validity studies
• Low fidelity simulations are cost-effective to develop,
administer & score, but how do they compare to high fidelity
ACs?
• Current research & implications for policy & practice
UK Medical Training & Career Pathway
Specialty Training
Undergraduate
medical
school
training
GP Specialty
Training
FT1
FT2
Core
Specialty
Training
Selection
Gateway 1
Foundation Training
(FT)
2 Years
Selection
Gateway 2
CCT &
eligible for
Consultant
Grade
3 years
Higher
Specialty
Training
Specialty Training (ST)
8 Years
Selection
Gateway 3
32—33
Selection in medicine through the ages…
“Work for me, son – I
knew your father.”
1970
“ Fill out the application
form for HR
and the job is
yours, mate.”
1980
• 26,000
applicants for
8,000 medical
school places
“It isn’t an interview – just
an informal
chat, sweetie.
Just a
formality.”
1990
Help!
• 8,000 medical
students apply
for their first
post
• 10,000 speciality
applicants
• 27,000 +
interviews
• Weeks of
offering,
rejecting,
cascading
1000s Consultant hours
Key research questions
What non-cognitive attributes are important to be
an effective clinician?
What methods are available to test these in high
stakes selection?
Should we use personality testing?
Given the costs, beyond some basic assessment,
is a lottery the best option?
Evaluations of situational judgments tests for high stakes selection.
Why not use a lottery system?
GP Specialty selection (selection gateway 3)
N= 6,500 applicants per year for 3,000 training posts
UK Medical Training & Career Pathway
Specialty Training
Undergraduate
medical
school
training
FT1
FT2
GP Specialty
Training
Core
Specialty
Training
Selection
Gateway 1
Foundation Training
(FT)
2 Years
Selection
Gateway 2
CCT &
eligible for
Consultant
Grade
3 years
Higher
Specialty
Training
Specialty Training (ST)
8 Years
Selection
Gateway 3
34—35
Job analysis of the GP role
• GP selection system is based on
multi-source, multi-method job analysis
Patterson et al, BJGP, 2000; Patterson et al, BJGP, 2013
•
•
•
•
•
•
Empathy & sensitivity
Communication skills
Problem-solving
Professional integrity
Coping with pressure
Clinical expertise
British Journal of General Practice, May 2013
Previously…
Shortlisting
Interview
GP Selection
National
Longlisting
Electronic
Application
Process
Foundation
Competency
National
panel
Regional
Selection Tests
Assessment Centre
(AC)
Clinical Problem solving test
CPS
Situational Judgement test
SJT
Assessment Centre Using
Simulated Consultations
Ranking
Matching to Region
Evaluations of situational judgments tests for high stakes selection.
Clinical Problem Solving (CPS) Knowledge Test
• Applying clinical knowledge in relevant contexts, e.g. diagnosis, management
• Item type single best answer/ multiple choice
• Operational test: 100 items, duration 90 minutes
Reduced Vision
A.
Basilar migraine
F.
Central retinal vein occlusion
B.
Cerebral tumour
G.
Optic neuritis (demyelinating)
C.
Cranial arteritis
H.
Retinal detachment
D.
Macular degeneration
I.
Tobacco optic neuropathy
E.
Central retinal artery occlusion
For each patient below select the SINGLE most likely diagnosis from the list above. Each option
may be selected once, more than once or not at all.
1. A 75 year old man, who is a heavy smoker, with a blood pressure of 170/105, complains of
floaters in the left eye for many months and flashing lights in bright sunlight. He has now
noticed a "curtain" across his vision.
Situational Judgement Test (SJT)
• Professional dilemmas focusing on non-cognitive attributes in
complex interpersonal scenarios (empathy, integrity, coping with
pressure, teamworking)
• Item writing by subject matter experts, concordance panel to agree
scoring key
• Cognitively-oriented (‘What should you do?’)
• Item types: rank 4/5 options, choose best 2/3
• Operational tests comprise 50 items, in 90 minutes
Example SJT item
You are reviewing a routine drug chart for a patient with
rheumatoid arthritis during an overnight shift. You notice that
your consultant has inappropriately prescribed methotrexate
7.5mg daily instead of weekly.
Rank in order the following actions in response to this situation (1= Most
appropriate; 5= Least appropriate)
A Ask the nurses if the consultant has made any other drug errors
recently
B Correct the prescription to 7.5mg weekly
C Leave the prescription unchanged until the consultant ward round
the following morning
D Phone the consultant at home to ask about changing the prescription
E Inform the patient of the error
36—37
Predictive Validity Results
GP Selection: Validation studies
Study 1.
Supervisor ratings after 1 year into training
Study 2.
End-of-training outcomes (licensure exam)
after 3 years
Study 3.
Structural (theoretical) model to evaluate the
incremental validity for each selection method
Study 1: Correlations between the selection
methods & job performance after 1 year
N=196
1.
2.
Mean
SD
1. Clinical problem-solving test
78.88
9.02
2. Situational judgement test
637.87 34.31 .50
3. Assessment centre
3.32
0.39
.30
.43
4.63
0.73
.36
(.54)
.37
(.56)
3.
Selection methods (Predictors)
Outcome variable
4. Supervisor ratings
.30
(.50)
Note. Correlations between parentheses were corrected for multivariate range restriction. Correlations are
significant at p < .01
Evaluations of situational judgments tests for high stakes selection.
Study 2: Correlations between the selection
methods & end-of-training licensure assessments
N=2292
Mean
SD
1.
2.
3.
4.
Selection methods (Predictors)
1. Clinical problem-solving test
80.08 8.14
2. Situational judgement test
640.13 31.66 .40
3. Assessment centre
3.34
0.36
0.26
0.90
.24
.32
Outcome variables
4. End of training applied
knowledge test
5. End of training clinical skills
exam (Simulated surgery)
0.20
.73 .43 .24
(.85) (.69) (.41)
0.80 .38 .43 .32
(.55) (.57) (.41)
.41
Note. Correlations between parentheses were corrected for multivariate range restriction. Correlations are
significant at p < .01
Study 3. Structural/theoretical model showing
selection methods & their link to job performance
Lievens & Patterson (2011)
SJT fully mediates the effects of declarative knowledge
on job performance. The AC partially mediates the effects
of the SJT
Significant incremental validity offered by the AC
Theoretical Model Underlying
SJTs in GP Selection
General
Experience
Implicit Trait
Policies
Specific Job
Experience
Specific Job
Knowledge
Job/Training
Performance
Procedural
Knowledge Captured
by the SJT
38—39
Current Research into Medical & Dental
School Admissions
UK Clinical Aptitude Test (UKCAT)
5 subtests
• Verbal, numerical, abstract reasoning
& decision analysis
• Non-cognitive analysis using an SJT targeting empathy, integrity & team
involvement
http://www.ukcat.ac.uk/
UK Medical Training & Career Pathway
Specialty Training
Undergraduate
medical
school
training
FT1
GP Specialty
Training
FT2
Core
Specialty
Training
Selection
Gateway 1
Foundation Training
(FT)
2 Years
Selection
Gateway 2
CCT &
eligible for
Consultant
Grade
3 years
Higher
Specialty
Training
Specialty Training (ST)
8 Years
Selection
Gateway 3
SJT Test Specification
Content
• Scenarios based in either a clinical setting or during
education/training for a medical/dental career
• Third party perspective to increase breadth of available scenarios
Response Format (rating using a 4 point scale)
• Rate the appropriateness of a series of options from ‘very appropriate’ to ‘very inappropriate.
• Rate the importance of a series of options from ‘very important’ to ‘not important at all’
Evaluations of situational judgments tests for high stakes selection.
Example UKCAT SJT items
A consultation is taking place between a senior doctor and a patient; a medical
student is observing. The senior doctor tells the patient that he requires some
blood tests to rule out a terminal disease. The senior doctor is called away
urgently, leaving the medical student alone with the patient. The patient tells the
student that he is worried he is going to die and asks the student what the blood
tests will show.
How appropriate are each of the following responses by the medical student in
this situation?
Q1 Explain to the patient that he is unable to comment on what the tests will
show as he is a medical student
Q2 Acknowledge the patient’s concerns and ask whether he would like them to be raised with the senior doctor
Q3 Suggest to the patient that he poses these questions to the senior doctor
when he returns
Q4 Tell the patient that he should not worry and that it is unlikely that he will die
UKCAT SJT Evaluation
• Reliability of a 70 item test with similar quality items estimated
(α=.75 to .85)
• Candidate reactions show high face validity (significantly more
than the cognitive subsections of UKCAT)
• Content of SJT relevant for med/dental applicants = 70%
• Content of the SJT is fair to med/dental applicants = 63%
Group differences & content validity
• Gender: Females outperform males (0.2 SD)
• Ethnicity: White candidates performed better (0.3SD)
• Occupation & Employment Status: those in the higher occupational
classes (i.e. Managerial/Professional Occupations) do not always score
higher than those in lower classes - in some cases those from lowest
occupational groups, received the highest mean score.
• SJT correlates with other subtests (approx r=0.28) indicating some
shared variance between the tests. Since a large amount of variance
is not explained, the SJT is assessing different constructs to the other
tests.
40—41
Implications for policy & practice
• Strong predictive validity translates into significant gains in utility
• Focus not on ‘how much validity’ does a selection method add but more ‘valid for what’
• Closer attention to the criterion constructs targeted as high-fidelity
simulations show incremental validity over low-fidelity simulations for
predicting interpersonal job performance dimensions
• Bespoke job analysis is the cornerstone to effective selection
• Positive candidate reactions (Patterson et al, 2010)
• Political validity is an important concept in this setting (Patterson et al,
2012)
Thank you
f.patterson@workpsychologygroup.com
Some useful links
http://www.workpsychologygroup.com/
http://www.gprecruitment.org.uk/
http://www.isfp.org.uk/
http://www.ukcat.ac.uk/about-the-test/behavioural-test/
http://www.agpt.com.au/
http://www.gmc-uk.org/about/research/14400.asp
Evaluations of situational judgments tests for high stakes selection.
42—43
Situational Judgment
Test in EPSO open
competitions for the
European Institutions
from 2010 to present.
— Mr. Stefan Meyer
EPSO (European Personnel
Selection Office).
Situational Judgment Tests
in EPSO open competitions
for the European Institutions
Stefan Meyer, European Personnel Selection Office
Madrid, 26 June 2013
1
The European Union
Commission
Economic &
Social Committee
Council
Committee of
the Regions
Parliament
Ombudsman
Court of Justice
Data Protection
Supervisor
Court of Auditors
The EU Institutions
Posts in 2012
European Parliament
6655
Council of the European Union
3153
European Commission
Court of Justice
25478
1952
European Court of Auditors
887
European Economic and Social Committee
724
Committee of the Regions
531
European Ombudsman
European Data Protection Supervisor
European External Action Service
Total
European
Court
of Auditors
2%
Court of
Justice
5%
European
External
Action Service
European
4%
Others Parliament
16%
3%
66
43
1670
European
Commission
62%
41159
44—45
Council of
the
European
Union
8%
Entry Streams
•  Administrators (AD)
e.g. policy officers, lawyers, auditors, scientific
officers, translators, interpreters, communication
& press officers …
Qualification: University degree (bachelor)
•  Assistants (AST)
e.g. secretaries, HR assistants, financial
assistants, conference technicians …
Qualification: Secondary education (at least) and
relevant professional experience
Where in competition procedure?
Registration
CBT
Admission
Assessment
centre
Reserve
list
5
EPSO Development Programme
(2008 – 2011)
Redesign of the CBT pre-selection phase:
•  Keep verbal reasoning
•  Keep numerical reasoning
•  Skip EU knowledge
•  Introduce abstract reasoning tests
•  Introduce Situational Judgement Tests
•  Introduce professional skills testing,
including linguistic skills
•  New linguistic regime as of 2011: cognitive
tests in 23 languages
6
Situational Judgment Test in EPSO open competitions for the European Institutions from 2010 to present.
New EPSO admission phase
Abstract
Reasoning
Situational
Judgement Tests
(in EN, FR, DE; where
appropriate)
CBT
Verbal &
Numerical reasoning
- In 23 languages as of 2011!
Professional / Linguistic
Skills (where appropriate)
7
Situational Judgment Tests
8
EPSO Development Programme
•  EDP, action No. 8:
The introduction of situational/
behavioural testing based on a wellfounded competency framework
Why?
•  Good indicators for job performance
•  Widely used and perceived as relevant
+ fair
46—47
9
SJT competencies*
(from the EU competency framework)
Analysing &
Problem
solving
Delivering
Quality &
Results
Prioritising&
Organising
Resilience
Working
with Others
!  Identifies the critical facts in complex issues and develops
creative and practical solutions
!  Takes personal responsibility and initiative for delivering
work to a high standard of quality within set procedures
!  Prioritises the most important tasks, works flexibly and
organises own workload efficiently
!  Remains effective under a heavy workload, handles
organisational frustrations positively and adapts to a
changing work environment
!  Works co-operatively with others in teams and across
organisational boundaries and respects differences between
people
10
SJT: features
–  SJT evaluate workplace-related behaviour
–  Target groups: AD5 and AST3 annual cycle of
competitions
–  Administered @ CBT stage
–  Score report used as expert input @ AC stage
–  20 items test in 30 mins. in either EN, FR or DE
=> NOT a speed test!
–  Tests developed in cooperation with expert external
contractor
–  First launch: AD cycle 2010 (non-eliminatory)
11
Multi-stage TD process for SJT
1.  Creation of draft item-models in EN
2.  Review by experienced SME (contractor) =>
amendments/modifications (if applicable)
3.  Statistical trials => selection of item models
4.  Review by EPSO quality board in view of
–  Organisational fit
–  Degrees of ambiguety
–  Translation/localisation (and other)
5.  Translation into FR and DE
6.  CBT upload => ROLL OUT
12
Situational Judgment Test in EPSO open competitions for the European Institutions from 2010 to present.
Which are the most and least effective
options?
You work as part of a technical support team that produces
work internally for an organisation. You have noticed that
often work is not performed correctly or a step has been
omitted from a procedure.
You are aware that some
individuals are more at fault than others as they do not make
the effort to produce high quality results and they work in a
disorganised way.
a. 
Explain to your team why these procedures are important and
what the consequences are of not performing these correctly.
b. 
Try to arrange for your team to observe another team in the
organisation who produce high quality work.
c. 
Check your own work and that of everyone else in the team to
make sure any errors are found.
d. 
Suggest that the team tries many different ways to approach
their work to see if they can find a method where fewer mistakes
are made.
M
L
13
Cdts sentiment towards SJT
14
SJT cumulative score distributions
15
48—49
To sum up
•  No evidence of bias by language version
•  No evidence of bias by gender or age
•  Good psychometric properties for the tests
–  Reliabilities found to exceed 0.7 for the short forms
–  This has been confirmed for the longer 20 items tests
•  Score ranges showed that the tests would discriminate
between candidates in an effective but fair way
…. and candidates actually like SJT!
16
Thank you
for your attention!
17
Situational Judgment Test in EPSO open competitions for the European Institutions from 2010 to present.
50—51
Medición de competencias
mediante Modelos de
Diagnóstico Cognitivo:
aplicación a un Test de
Juicio Situacional.
— Prof. Julio Olea
Catedrático de Psicología de la
Universidad Autónoma de Madrid y
codirector de la cátedra UAM-IIC Modelos
y aplicaciones psicométricos (MAP).
Índice de la presentación
– Ejemplo: estimación de la competencia
matemática.
– MDC: modelos, proceso de aplicación y
aportaciones previsibles para la medición de
competencias.
– Estudio empírico: objetivos, procedimiento y
resultados al aplicar un MDC a un TJS.
Ejemplo: Competencia matemática
de un niño de 6 años
• ¿Tiene adquiridas las competencias de suma y
de resta de números naturales?
• 4 posibles estados latentes:
a)
b)
c)
d)
No suma, no resta (0, 0)
Suma, no resta (1, 0)
No suma, resta (0, 1)
Suma, resta (1, 1)
Medición de competencias mediante Modelos de
Diagnóstico Cognitivo:
aplicación a un Test de Juicio Situacional
Pablo E. García (IIC)
Julio Olea (UAM)
Jimmy de la Torre (Rutgers U.)
52—53
Test
Respuestas de un niño
Ítem 1:
3+4=
7
Ítem 2:
7-2=
9
Ítem 3:
7+6-3=
16 Test
Respuestas de un niño
Ítem 1:
3+4=
7
Ítem 2:
7-2=
9
Ítem 3:
7+6-3=
16 ¿Qué estado latente es más probable?
a) No suma, no resta (0, 0)
b) Suma, no resta (1, 0)
c) No suma, resta (0, 1)
d) Suma, resta (1, 1)
El MDC más simple: modelo DINA
• Matriz Q:
• Asume dos parámetros para cada ítem:
• s: slipping (desliz). P. ej. 0.1
• g: guessing (acierto por azar). P. ej. 0.2
• Asume que es preciso tener todas las competencias requeridas para
acertar un ítem.
Medición de competencias mediante Modelos de Diagnóstico Cognitivo: aplicación a un Test de Juicio Situacional.
Probabilidades de acierto
Probabilidades de acierto
ESTADO LATENTE (ɲ)
Ítem 1
Ítem 2
Ítem 3
(1,0)
(0,1)
(1,1)
No suma, no resta (0, 0)
0.2
0.2
0.2
Suma, no resta (1, 0)
0.9
0.2
0.2
No suma, resta (0, 1)
0.2
0.9
0.2
Suma, resta (1, 1)
0.9
0.9
0.9
Verosimilitud
Probabilidades para el niño
ESTADO
LATENTE (ɲ)
Ítem 1
Ítem 2
Ítem 3
Verosimilitud
No suma, no resta (0, 0)
0.2
0.8
0.8
0.032
Suma, no resta (1, 0)
0.9
0.8
0.8
0.144
No suma, resta (0, 1)
0.2
0.1
0.8
0.016
Suma, resta (1, 1)
0.9
0.1
0.1
0.009
Verosimilitud
Probabilidades para el niño
ESTADO
LATENTE (ɲ)
Ítem 1
Ítem 2
Ítem 3
Verosimilitud
No suma, no resta (0, 0)
0.2
0.8
0.8
0.032
Suma, no resta (1, 0)
0.9
0.8
0.8
0.144
No suma, resta (0, 1)
0.2
0.1
0.8
0.016
Suma, resta (1, 1)
0.9
0.1
0.1
0.009
54—55
Diferentes MDC
VARIEDAD DE MODELOS (Rupp, Templin & Henson, 2010):
•
No compensatorios:
•
Compensatorios:
•
Modelos generales:
– DINA: Deterministic-input, noisy-and-gate (Junker & Sijtsma, 2001).
– DINO: Deterministic-input, noisy-or-gate (Templin & Henson, 2006): s es la probabilidad de
fallo cuando “al menos” un atributo está presente.
–
G-DINA (de la Torre, 2011).
PECULIARIDADES:
•
•
•
Modelos multidimensionales.
Naturaleza confirmatoria: podemos obtener indicadores de ajuste para aplicar un
modelo u otro.
Novedosos, en relación a otros modelos psicométricos (TCT, TRI): mayor
información “diagnóstica”.
Proceso de aplicación de un MDC
Jueces
expertos
2 K
[0 1 0 1 1]
TJS para la medición de competencias
• Problemas habituales de validez (Christian, Edwards &
Bradley, 2010; Ployhart & MacKenzie, 2011).
– Evidencias sobre validez de contenido: ¿Qué miden
exactamente los TJS? ¿Una o diferentes competencias?
Normalmente se extrae una única puntuación.
– Evidencias sobre la estructura interna: ¿Cómo comprobar
que las competencias están realmente reflejadas en los
ítems o situaciones?
– Evidencias sobre relaciones con otras variables. Estudios
de validez referida al criterio inconsistentes. Estudios
prometedores de validez predictiva (Lievens & Sackett,
2012).
Medición de competencias mediante Modelos de Diagnóstico Cognitivo: aplicación a un Test de Juicio Situacional.
TJS para la medición de competencias
(Lievens & Sackett, 2012)
PREDICTORES
(examen de admisión)
Test de capacidades cognitivas y de
conocimientos científicos
Test de contenidos médicos
TJS en video (interpersonal skills)
CRITERIOS (ɴ)
Calificaciones en las
prácticas (7años)
Rendimiento en el
trabajo (9 años)
.13(*)
.12
-.02
-.17
.22(*)
.23(*)
¿Qué pueden aportar los MDC a la
medición de competencias?
Evidencias sobre el contenido
Evidencias sobre la estructura
interna y los procesos de
respuesta
Evidencias sobre relaciones
con otras variables
Matriz Q
Ajuste del modelo a los
datos
???
Habrá que
comprobarlo
Estudio empírico: objetivos
• Aplicar los MDC a la medición de competencias
profesionales a través de TJS.
• Pregunta principal: ¿qué competencias podemos juzgar que
tienen o no tienen los evaluados?
• Preguntas adicionales:
– ¿Qué competencias están incorporadas en las situaciones?
– ¿Son todas relevantes?
– ¿Se requieren todas las competencias incluidas en una situación
para emitir la respuesta apropiada?
– ¿Se relacionan las medidas categóricas que proporciona el MDC
con otras variables?
56—57
Participantes e instrumentos
Participantes: 485 empleados de una entidad bancaria española (programa de
desarrollo interno). Se aplicó el TJS y un test de personalidad (Big Five).
TJS informatizado de 30 ítems (perteneciente a la batería eValue del IIC).Cada ítem
comienza con un incidente crítico, seguido de una pregunta y tres opciones.
ÍTEM 4: Durante una reunión con su equipo de trabajo y algunos responsables de otras
áreas, tras exponer las cualidades del proyecto en desarrollo, uno de los responsables
contesta: “Encuentro que su proyecto presenta algunas carencias”.
¿Qué respondería?
a) No estoy de acuerdo, pero explíqueme exactamente qué tipo de carencias y
examinemos todos los puntos sobre los que tenga cualquier duda.
b) Probablemente es que no hemos profundizado lo suficiente en las ventajas.
Repasemos de nuevo cada uno de los puntos del proyecto.
c) Puede ser, es un tema sobre el que estamos trabajando y al que esperamos darle
pronto solución. Aún así, será rentable de igual modo.
Definición de la matriz Q
7
Librería de competencias Great-Eight (SHL Group; Kurz &
Bartram, 2002; Bartram, 2005). Tres niveles:
1. 8 factores generales.
2. 20 competencias
3. 112 componentes.
Competencias
Factores
Big Five
Interactuar y
presentar
Extraversión
Adaptarse y
aguantar
Estabilidad
emocional
Decidir e iniciar acciones
Relacionarse y establecer redes
Persuadir e influenciar
Presentar y comunicar información
Seguir instrucciones y procedimientos
Adaptarse y responder al cambio
Aguantar la presión y contratiempos
Definición de la matriz Q
• 4 expertos en Psicología organizacional especificaron la
Matriz-Q:
– Se les proporcionó varios indicadores para cada competencia.
– Valoraron, para cada ítem: ¿qué competencias son necesarias
para dar la respuesta considerada como más adecuada?
– La tarea se llevó a cabo en 3 fases:
• Fase 1: tarea individual.
• Fase 2: posible reconsideración cuando conocen los resultados de la
fase 1.
• Fase 3: grupo de discusión para intentar llegar a mayores acuerdos.
– Una competencia (Relacionarse y establecer redes) fue asignada sólo a 4
ítems.
Medición de competencias mediante Modelos de Diagnóstico Cognitivo: aplicación a un Test de Juicio Situacional.
Definición de la matriz Q
• ¿Cuál de las matrices Q?
– Se valoró estadísticamente el ajuste de diferentes posibles matrices.
– Chen, de la Torre & Zhang (en prensa): el índice BIC (Bayesian
Information Criterion; Schwarz, 1976) permite elegir la mejor matriz,
asumiendo como cierto el modelo G-DINA.
Matriz-Q (7)
࡮ࡵ࡯ = െ૛ࡸࡸ + ࡼ ࢒࢔(ࡺ)
BIC
Matriz-Q (6)
BIC
Unanimidad
16.115,88
Unanimidad
15.782,75
Al menos dos
16.169,48
Al menos dos
15.749,79
Al menos uno
16.300,34
Al menos uno
15.840,37
1 competencia
eliminada
4 ítems
eliminados
Definición de la matriz Q
1. Persuadir e influenciar 2.Presentar y comunicar información 3.Seguir instrucciones y procedimientos
4.Adaptarse y responder al cambio 5.Aguantar la presión y contratiempos 6.Decidir e iniciar acciones
Item
1
2
3
4
5
6
7
8
9
10
11
12
13
Competency
3
4
5
Item
1
2
6
0
0
1
1
1
0
0
1
0
1
1
0
1
1
0
0
0
1
1
0
0
0
1
0
0
1
0
0
0
0
0
0
1
1
1
0
1
0
0
0
0
0
0
0
1
1
1
0
0
0
1
1
1
0
1
1
0
0
0
0
0
0
0
1
0
1
1
1
0
0
0
1
0
0
1
0
1
1
14
15
16
17
18
19
20
21
22
23
24
25
26
Competency
3
4
5
1
2
0
0
1
1
0
0
1
1
0
0
0
6
1
0
1
0
0
0
0
0
0
1
0
0
0
1
0
0
0
0
0
1
1
0
0
0
0
1
1
0
0
0
1
0
0
1
1
0
0
0
0
1
1
1
0
0
0
1
1
1
0
1
1
0
0
0
1
1
1
0
0
0
1
1
1
0
0
0
1
ÍTEM 4: Durante una reunión con su equipo de trabajo y algunos responsables de otras áreas, tras exponer las cualidades del
proyecto en desarrollo, uno de los responsables contesta: “Encuentro que su proyecto presenta algunas carencias”.
¿Qué respondería?
a) No estoy de acuerdo, pero explíqueme exactamente qué tipo de carencias y examinemos todos los puntos sobre los que tenga
cualquier duda.
Selección del MDC
• Ajuste relativo:
– Una vez seleccionada la Matriz-Q, se comparó el modelo
G-DINA con modelos más restrictivos (y parsimoniosos):
DINA (no-compensatorio) y DINO (compensatorio).
– En la medida en que son modelos anidados, se empleó la
razón de verosimilitudes (Likelihood Ratio) para valorar si
la pérdida de ajuste al emplear un modelo más sencillo era
estadísticamente significativa. Así fue.
• Se seleccionó en consecuencia el modelo G-DINA (de
la Torre, 2011).
58—59
Estimación de perfiles de competencias
• Nº de perfiles posibles:
• Los más frecuentes:
2K
26
%
64
Competency
1
2
3
4
5
6
9.46
1
1
0
0
1
1
8.25
1
0
0
0
1
0
8.08
0
1
0
1
0
0
7.93
1
1
1
0
1
1
7.05
1
1
1
1
1
1
1. Persuadir e influenciar 2.Presentar y comunicar información 3.Seguir instrucciones y procedimientos
4.Adaptarse y responder al cambio 5.Aguantar la presión y contratiempos 6.Decidir e iniciar acciones
Evidencias de validez: relaciones con
otras variables
Adaptarse y aguantar
Estabilidad emocional (promedio)
Posee la competencia
69,26
No posee la competencia
64,59
T=-3,37; punilateral=0,0005
Interactuar y presentar
Extraversión (promedio)
Posee la competencia
56,63
No posee la competencia
53,58
T=-2,52; punilateral=0,006
Medición de competencias mediante Modelos de Diagnóstico Cognitivo: aplicación a un Test de Juicio Situacional.
CONCLUSIONES: ¿Qué pueden aportar los
MDC a la medición de competencias?
Evidencias sobre el contenido
Evidencias sobre la estructura
interna y los procesos de
respuesta
- Permiten especificar los
contenidos incluidos en los
ítems.
- Podemos quedarnos con las competencias
realmente válidas.
-El modelo que mejor ajusta nos
proporciona información sobre los procesos
de respuesta.
Evidencias sobre relaciones
con otras variables
- Convendría estudiar si proporcionan
mejores pronósticos del rendimiento
en el trabajo.
MUCHAS GRACIAS
THANK YOU VERY MUCH
60—61
RESUMEN POSTERS
Competencias de empleabilidad:
diseño de un test situacional.
— Laura Arnau Sabate1, Josefina Sala Roca1,
Mercè Jariot2, Teresa Eulalia Marzo3, Adrià Pagès2
RESUMEN
Las competencias de empleabilidad son competencias básicas necesarias para obtener y
mantener un trabajo independientemente del tipo y sector de ocupación. Son transversales
y transferibles a otras situaciones o contextos personales y profesionales.
Nuestro grupo ha desarrollado una propuesta de competencias de empleabililidad (autoorganización, contrucción del proyecto formativo-profesional, toma de decisiones y resolución de problemas, trabajo en equipo, comunicación, perseverancia, flexibilidad y responsabilidad) validado por un grupo de expertos en una primera fase (Arnau et al, 2013) y que
posteriormente ha sido validado con profesionales de los diferentes sectores económicos
de la población (trabajo pendiente de publicación).
Se ha diseñado un test situacional a partir de la creación de 5 historias ambientadas en
diferentes contextos de la vida cotidiana (de amigos, escolar, familiar, más formales o informales). En estas historias se plantean situaciones a las que el sujeto debe responder
como va a actuar frente a esa situación. Estas situaciones requieren de un buen nivel de
competencia para ser afrontadas adecuadamente. Se contó con un panel de expertos para
la elaboración de las historias y situaciones.
Para elaborar las respuestas, se realizaron entrevistas a 115 jóvenes de edades entre 12 y
18 años, que fueron transcritas y sometidas a análisis de contenido. El panel de expertos
seleccionó de entre las respuestas dadas 5 respuestas que podrían ser indicadoras de
diferentes niveles de competencia y que asimismo asegurarían suficiente variabilidad.
Posteriormente otro panel de 11 expertos distintos está validando la asignación de cada una
de las situaciones como indicador de las competencias, y en una segunda fase se pedirá a
estos expertos que asignen un valor a cada respuesta en función del nivel de competencia.
Una vez construido el test será validado para comprobar sus propiedades psicométricas y
revisar su estructura interna.
1
2
3
Departament de Pedagogia Sistemàtica i Social. Universitat Autònoma de Barcelona
Departament de Pedagogia Aplicada. Universitat Autònoma de Barcelona
Pere Tarrés. Universitat Ramón Llull
62—63
Análisis de la estructura cognitiva del área de
habilidades cuantitativas del exhcoba mediante
los modelos LLTM y LSDM.
— Juan Carlos Pérez Morán1, Norma Larrazolo
Reyna y Eduardo Backhoff Escudero
Instituto de Investigación y Desarrollo Educativo
Universidad Autónoma de Baja California
RESUMEN
En el presente estudio se integran aspectos de la psicología cognitiva y la psicometría (Rupp
y Mislevy, 2006) para el análisis de la estructura cognitiva del área de Habilidades Cuantitativas (HC) del Examen de Habilidades y Conocimientos Básicos (EXHCOBA) (Backhoff,
Tirado, y Larrazolo, 2001), mediante el Linear Logistic Latent Trait Model (en inglés, LLTM)
de Fischer (1973, 1997) y el Least Squares Distance Method (en inglés, LDSM) propuesto
por Dimitrov (2007). Para alcanzar dicho propósito, un grupo de expertos propuso catorce
operaciones para resolver los ítems del área HC apoyándose en los resultados de reportes
verbales concurrentes y retrospectivos realizados a un grupo de estudiantes de educación
secundaria. Además, se analizó la estructura cognitiva determinada por los expertos y se
evaluó la dimensionalidad y el ajuste de los ítems a los modelos Rasch y LLTM. Los resultados mostraron que los datos ajustan moderadamente al modelo LLTM con una matriz Q de
catorce atributos y que un refinamiento de las operaciones para resolver los ítems podría
explicar mejor sus fuentes de dificultad y el rendimiento de los examinados en la prueba.
Además, la aplicación del LSDM mostró que es necesario mejorar la validación de los atributos cognitivos requeridos en la solución de ítems del área de HC del EXHCOBA.
Palabras clave: modelo LLTM, modelo Rasch, matriz Q, EXHCOBA.
1
Presentador: Juan Carlos Pérez Morán
Correo electrónico: jucarpint@gmail.com
Instituto de Investigación y Desarrollo Educativo Universidad Autónoma de Baja California
T. celular. 044 (664) 3851253 · T. (646) 175-07-33 ext. 64508 · F. (646) 174-20-60
Midiendo la competencia Iniciativa-Innovación
mediante Test: diseño, construcción y análisis
psicométrico inicial de un Test Informatizado
de Juicio Situacional.
— Sonia Rodríguez, Beatriz Lucía, David Aguado
Instituto de Ingeniería del Conocimiento
RESUMEN
La competencia Iniciativa-Innovación es una competencia transversal definida en la mayoría
de los modelos de competencias de mayor uso tanto en nuestro país como en el contexto
internacional (Bartram, 2002; Alles, 2002, Lombardo, 2005).
En este trabajo se presenta el diseño, construcción y análisis de un test situacional orientado a la evaluación de esta competencia. En la prueba diseñada el evaluado debe imaginar
que forma parte de un equipo de trabajo e interactuar con el resto de componentes a través
del correo electrónico. Los diferentes mensajes de correo electrónico que recibe plantean
situaciones donde es posible manifestar los diferentes comportamientos asociados a la
competencia objeto de medida. Además, el evaluado tiene la posibilidad de enviar mensajes
por iniciativa propia.
Aplicada a una muestra de 261 profesionales de una entidad financiera de ámbito multinacional los resultados muestran una buena capacidad de discriminación de la prueba y una
baja consistencia interna (alpha=.52) -en línea con otros test situacionales-.
64—65
Cheating en administración de Test por
Internet. Análisis del Test de Verificación eCat.
— David Aguado1, Julio Olea2, Vicente Ponsoda2
y Francisco Abad2
RESUMEN
El uso de internet ha modificado en los últimos tiempos las prácticas de reclutamiento y selección desarrolladas por las compañías y es común que tanto el proceso de reclutamiento
como los procesos de evaluación inicial se realicen a través de internet en lo que se conoce
como Unproctored Internet Testing (UIT). A pesar de las grandes ventajas asociadas al UIT
en términos de costes y flexibilidad es una práctica cuanto menos controvertida debido a
problemas relacionados con el engaño (cheating). Para controlarlo, la recomendación de
la International Test Comission es la de realizar una evaluación en un entorno controlado y
comparar las puntuaciones en la condición UIT y la controlada para identificar a las personas que engañan. El objetivo del trabajo que se presenta es analizar los resultados que se
obtienen al implantar un procedimiento específico de verificación sobre un Test Adaptativo
Informatizado para la evaluación del nivel de inglés: eCat; aplicado en un proceso real de
selección de personal.
Se evaluó a una muestra de 417 candidatos a puestos de trabajo como ingenieros en una
multinacional española del sector energético. Los resultados iniciales muestran como el
procedimiento implementado es eficaz en la detección del cheating detectándose que 44
personas (10.55%) habían utilizado algún tipo de ayuda adicional en el entorno UIT.
1
2
Instituto de Ingeniería del Conocimiento
Departamento de Psicología Social y Metodología. Universidad Autónoma de Madrid.
Batería de aptitudes de TEA:
Fiabilidad y evidencias de validez.
— David Arribas Águila
Dpto. de I+D+i de TEA Ediciones
RESUMEN
El BAT-7 es una nueva batería que permite estimar la inteligencia y evaluar 8 aptitudes cognitivas. Se estructura en 3 niveles de dificultad creciente y está enfocada a la evaluación de
escolares, universitarios y adultos.
Método: La muestra de tipificación estuvo compuesta por 4.263 escolares y 1.507 adultos
con diferente grado de formación. Se ha utilizado un modelo logístico unidimensional de 3
parámetros y se ha estudiado el ajuste mediante los residuos estandarizados y el estadístico
chi-cuadrado. La fiabilidad se ha analizado utilizando el coeficiente alfa ordinal y las funciones de información. Para el estudio de la validez se desarrolló un modelo AFC multigrupo.
Resultados: El modelo TRI de 3 parámetros mostró un adecuado ajuste en todos los ítems.
Los valores de fiabilidad oscilaron entre 0,79 y 0,91 para los tests y entre 0,91 y 0,97 para
los índices de inteligencia. El modelo estructural basado en la teoría CHC presentó un buen
ajuste a los datos empíricos.
Discusión: El BAT-7 supone una medida informativa y altamente fiable para la evaluación
de las aptitudes cognitivas. Igualmente, mediante su aplicación puede obtenerse una estimación bastante razonable de la capacidad general (g), de la inteligencia fluida (Gf) y de la
inteligencia cristalizada (Gc).
Correspondencia:
david.arribas@teaediciones.com
c/ Fray Bernardino Sahagún, 24.
28036 Madrid
66—67
Papel del psicólogo del trabajo en el
reclutamiento y selección de personas.
— Gloria Castaño, Vicente Ponsoda, M. Ángeles Romeo,
Pedro Sanz, Ana Sanz, José Manuel Chamorro, C. Manuel
Leal, Yolanda Peinador, Jesús Martínez, Laura Alonso, M.
Ely Iglesias, Francisco Álvarez y Luis Picazo
Grupo de Trabajo de Buenas Prácticas en Reclutamiento y Selección del Colegio Oficial
de Psicólogos de Madrid
RESUMEN
En este estudio se presentan los resultados de una encuesta realizada por el Grupo de Trabajo de Buenas Prácticas en Reclutamiento y Selección del Colegio Oficial de Psicólogos
de Madrid. Su principal objetivo es el de clarificar el papel de los psicólogos en materia de
Reclutamiento y Selección, tanto en los departamentos de Recursos Humanos como en las
consultoras que prestan servicios en esta área.
Para el estudio se seleccionaron 129 empresas, entre las que se encontraban las del IBEX35 y otras representativas de los diferentes sectores de actividad, así como las consultoras
más relevantes. La tasa de respuesta fue del 29,2%
Los resultados indican que la mayoría de los responsables de reclutamiento y selección son
psicólogos (65,8%) y que sigue siendo la titulación preferida a la hora de incorporar profesionales que se dediquen a esta actividad. La participación de los psicólogos es relevante
en todas las fases del proceso de R&S: en la clarificación de la demanda (76,3%), en la
elaboración del perfil de exigencias (84,2%), en el reclutamiento y captación de candidatos
(89,5%) y en la preselección de candidatos (92,1%). En la fase de evaluación, los psicólogos son los principales responsables de evaluar inteligencia, aptitudes, personalidad y
competencias (81,6%), de la elaboración de informes (89,5%) y participan en la toma de
decisiones sobre el candidato elegido (76,3%). Su involucración es algo menor en los seguimientos de la incorporación (57,9%), desvinculación o salida (28, 9%) y en la auditoria
del proceso (44,7%). En cuanto a las técnicas utilizadas, la mayoría (73,7%) utilizan test
psicológicos. El póster ofrecerá otras informaciones recogidas en la encuesta.
¿Cómo puntuar las diferentes opciones
en un ítem de un test situacional?.
— Miguel Ángel Sorrel1, Francisco José Abad1,
Vicente Ponsoda1, Julio Olea1, Juan Francisco
Riesco2, Maria Luisa Vidania2
RESUMEN
Mientras que en los test de habilidad cognitiva está claro cuál de las opciones a un ítem es
la correcta, en los test de juicio situacional a menudo no existe una alternativa objetivamente
correcta, de tal manera que varias de las opciones pueden resultar plausibles. Esto hace
necesario el desarrollo de procedimientos de asignación de puntuaciones. En el presente
estudio se han generado dos alternativas a la propuesta original para valorar las respuestas al Test de integridad de Becker (Becker, 2005). Se tradujo la prueba al castellano y se
aplicó a una muestra de 182 personas. Para comprobar la validez convergente y discriminante del test, se aplicaron también medidas de los cinco rasgos básicos de personalidad
(NEO-FFI), la escala de deseabilidad social de Marlowe y Crowne (SDS) y un cuestionario
de Integridad construido al efecto. Los resultados indican que las dos puntuaciones alternativas dan lugar a mejores propiedades psicométricas en lo referido a fiabilidad y validez.
1
Universidad Autónoma de Madrid
2
Gabinete Psicopedagógico de la Academia de Policía Local de la Comunidad de Madrid.
68—69
NOTAS
70—71
72—73
http://www.iic.uam.es/catedras/map