Search

LO83: Effect of the transition to an entrustability scale on assessor stringency and leniency on daily encounter cards in emergency medicine
S. Dewhirst, W. Cheung, N. Dudek, T. Wood, J. Frank
Journal:

Canadian Journal of Emergency Medicine / Volume 22 / Issue S1 / May 2020

Published online by Cambridge University Press:

13 May 2020, p. S37

Print publication:

May 2020
- Article
- - You have access
- PDF
- Export citation
Introduction: Workplace based assessments (WBAs) are integral to emergency medicine residency training. However many biases undermine their validity, such as an assessor's personal inclination to rate learners leniently or stringently. Outlier assessors produce assessment data that may not reflect the learner's performance. Our emergency department introduced a new Daily Encounter Card (DEC) using entrustability scales in June 2018. Entrustability scales reflect the degree of supervision required for a given task, and are shown to improve assessment reliability and discrimination. It is unclear what effect they will have on assessor stringency/leniency – we hypothesize that they will reduce the number of outlier assessors. We propose a novel, simple method to identify outlying assessors in the setting of WBAs. We also examine the effect of transitioning from a norm-based assessment to an entrustability scale on the population of outlier assessors. Methods: This was a prospective pre-/post-implementation study, including all DECs completed between July 2017 and June 2019 at The Ottawa Hospital Emergency Department. For each phase, we identified outlier assessors as follows: 1. An assessor is a potential outlier if the mean of the scores they awarded was more than two standard deviations away from the mean score of all completed assessments. 2. For each assessor identified in step 1, their learners’ assessment scores were compared to the overall mean of all learners. This ensures that the assessor was not simply awarding outlying scores due to working with outlier learners. Results: 3927 and 3860 assessments were completed by 99 and 116 assessors in the pre- and post-implementation phases respectively. We identified 9 vs 5 outlier assessors (p = 0.16) in the pre- and post-implementation phases. Of these, 6 vs 0 (p = 0.01) were stringent, while 3 vs 5 (p = 0.67) were lenient. One assessor was identified as an outlier (lenient) in both phases. Conclusion: Our proposed method successfully identified outlier assessors, and could be used to identify assessors who might benefit from targeted coaching and feedback on their assessments. The transition to an entrustability scale resulted in a non-significant trend towards fewer outlier assessors. Further work is needed to identify ways to mitigate the effects of rater cognitive biases.

MP51: The relationship between entrustment scores in the simulated and workplace environments among emergency medicine residents
N. Prudhomme, M. O'Brien, M. McConnell, N. Dudek, W. Cheung
Journal:

Canadian Journal of Emergency Medicine / Volume 22 / Issue S1 / May 2020

Published online by Cambridge University Press:

13 May 2020, p. S61

Print publication:

May 2020
- Article
- - You have access
- PDF
- Export citation
Introduction: The Emergency Medicine Specialty Committee of the Royal College of Physicians and Surgeons of Canada (RCPSC) has specified that resuscitation Entrustable Professional Activities (EPAs) can be assessed in either the workplace or simulation environments; however, there is minimal evidence that such clinical performance correlates. We sought to determine the relationship between assessments in the workplace versus simulation environments among junior emergency medicine residents. Methods: We conducted a prospective observational study to compare workplace and simulation resuscitation performance among all first-year residents (n = 9) enrolled in the RCPSC-Emergency Medicine program at the University of Ottawa. All scores from Foundations EPA #1 (F1) were collected during the 2018-2019 academic year; this EPA focuses on initiating and assisting in the resuscitation of critically ill patients. Workplace performance was assessed by clinical supervisors by direct observation during clinical shifts. Simulation performance was assessed by trained simulation educators during regularly-scheduled sessions. We present descriptive statistics and within-subjects analyses of variance. Results: We collected a total of 104 workplace and 36 simulation assessments. Interobserver reliability of simulation assessments was high (ICC = 0.863). We observed no correlation between mean EPA scores assigned in the workplace and simulation environments (Spearman's rho=−0.092, p = 0.813). Scores in both environments improved significantly over time (F(1,8) = 18.79, p < 0.001, ηp2 = 0.70), from 2.9(SD = 1.2) in months 1-4 to 3.5(0.2) in months 9-12 (p = 0.002). Workplace scores (3.4(0.1)) were consistently higher than simulation scores (2.9(0.2)) (F(1,8) = 7.16, p = 0.028, ηp2 = 0.47). Conclusion: We observed no correlation between EPA F1 ratings of resuscitation performance between the workplace and simulation environments. Further studies should seek to clarify this relationship to inform our ongoing use of simulation to assess clinical competence.

LO75: Does the Ottawa emergency department shift observation tool give more useful information – assessing the utility of transitioning to a novel, entrustability based assessment tool in the emergency department
S. Dewhirst, W. Cheung, N. Dudek, T. Wood, J. Frank
Journal:

Canadian Journal of Emergency Medicine / Volume 22 / Issue S1 / May 2020

Published online by Cambridge University Press:

13 May 2020, p. S34

Print publication:

May 2020
- Article
- - You have access
- PDF
- Export citation
Introduction: The Ottawa Emergency Department Shift Observation Tool (O-EDShOT) was recently developed to assess a resident's ability to safely run an ED shift and is supported by multiple sources of validity evidence. The O-EDShOT uses entrustability scales, which reflect the degree of supervision required for a given task. It was found to discriminate between learners of different levels, and to differentiate between residents who were rated as able to safely run the shift and those who were not. In June 2018 we replaced norm-based daily encounter cards (DECs) with the O-EDShOT. With the ideal assessment tool, most of the score variability would be explained by variability in learners’ performances. In reality, however, much of the observed variability is explained by other factors. The purpose of this study is to determine what proportion of total score variability is accounted for by learner variability when using norm-based DECs vs the O-EDShOT. Methods: This was a prospective pre-/post-implementation study, including all daily assessments completed between July 2017 and June 2019 at The Ottawa Hospital ED. A generalizability analysis (G study) was performed to determine what proportion of total score variability is accounted for by the various factors in this study (learner, rater, form, pgy level) for both the pre- and post- implementation phases. We collected 12 months of data for each phase, because we estimated that 6-12 months would be required to observe a measurable increase in entrustment scale scores within a learner. Results: A total of 3908 and 3679 assessments were completed by 99 and 116 assessors in the pre- and post- implementation phases respectively. Our G study revealed that 21% of total score variance was explained by a combination of post-graduate year (PGY) level and the individual learner in the pre-implementation phase, compared to 59% in the post-implementation phase. An average of 51 vs 27 forms/learner are required to achieve a reliability of 0.80 in the pre- and post-implementation phases respectively. Conclusion: A significantly greater proportion of total score variability is explained by variability in learners’ performances with the O-EDShOT compared to norm-based DECs. The O-EDShOT also requires fewer assessments to generate a reliable estimate of the learner's ability. This study suggests that the O-EDShOT is a more useful assessment tool than norm-based DECs, and could be adopted in other emergency medicine training programs.

P042: Workplace-based assessment in emergency medicine: how do physicians use entrustment anchors?
T. Robinson, N. Wagner, A. Szulewski, N. Dudek, W. Cheung, A. Hall
Journal:

Canadian Journal of Emergency Medicine / Volume 22 / Issue S1 / May 2020

Published online by Cambridge University Press:

13 May 2020, p. S79

Print publication:

May 2020
- Article
- - You have access
- PDF
- Export citation
Introduction: Competency based medical education (CBME) has triggered widespread utilization of workplace-based assessment (WBA) tools in postgraduate training programs. These WBAs predominately use rating scales with entrustment anchors, such as the Ottawa Surgical Competency Operating Room Evaluation (O-SCORE). However, little is known about the factors that influence a supervising physician's decision to assign a particular rating on scales using entrustment anchors. This study aimed to identify the factors that influence supervisors’ ratings of trainees using WBA tools with entrustment anchors at the time of assessment and to explore the experiences with and challenges of using entrustment anchors in the emergency department (ED). Methods: A convenience sample of full-time emergency medicine (EM) faculty were recruited from two sites within a single academic Canadian EM hospital system. Fifty semi-structured interviews were conducted with EM physicians within two hours of completing a WBA for an EM trainee. Interviews were audio-recorded, transcribed verbatim, and independently analyzed by two members of the research team. Themes were stratified by trainee level, rating and task. Results: Interviews involved 73% (27/37) of all EM staff and captured assessments completed on 83% (37/50) of EM trainees. The mean WBA rating of studied samples was 4.34 ± 0.77 (2 to 5), which was similar to the mean rating of all WBAs completed during the study period. Overall, six major factors were identified that influenced staff WBA ratings: amount of guidance required, perceived competence through discussion and questioning, trainee experience, clinical context, past experience working with the trainee, and perceived confidence. The majority of staff denied struggling to assign ratings. However, when they did struggle, it involved the interpretation of WBA anchors and their application to the clinical context in the ED. Conclusion: Several factors appear to be taken into account by clinical supervisors when they make decisions regarding the particular rating that they will assign a trainee on a WBA that uses entrustment anchors. Not all of these factors are specific to that particular clinical encounter. The results from this study further our understanding on the use of entrustment anchors within the ED and may facilitate faculty development regarding WBA completion as we move forward in CBME.

LO84: Ready to run the show: development of a new instrument for assessing resident competence in the emergency department
W. Cheung, W. Gofton, T. Wood, M. Duffy, S. Dewhirst, N. Dudek
Journal:

Canadian Journal of Emergency Medicine / Volume 21 / Issue S1 / May 2019

Published online by Cambridge University Press:

02 May 2019, p. S38

Print publication:

May 2019
- Article
- - You have access
- PDF
- Export citation
Innovation Concept: The outcome of emergency medicine training is to produce physicians who can competently run an emergency department (ED) shift. While many workplace-based ED assessments focus on discrete tasks of the discipline, others emphasize assessment of performance across the entire shift. However, the quality of assessments is generally poor and these tools often lack validity evidence. The use of entrustment scale anchors may help to address these psychometric issues. The aim of this study was to develop and gather validity evidence for a novel tool to assess a resident's ability to independently run an ED shift. Methods: Through a nominal group technique, local and national stakeholders identified dimensions of performance reflective of a competent ED physician. These dimensions were included in a new tool that was piloted in the Department of Emergency Medicine at the University of Ottawa during a 4-month period. Psychometric characteristics of the items were calculated, and a generalizability analysis used to determine the reliability of scores. An ANOVA was conducted to determine whether scores increased as a function of training level (junior = PGY1-2, intermediate = PGY3, senior = PGY4-5), and varied by ED treatment area. Safety for independent practice was analyzed with a dichotomous score. Curriculum, Tool or Material: The developed Ottawa Emergency Department Shift Observation Tool (O-EDShOT) includes 12-items rated on a 5-point entrustment scale with a global assessment item and 2 short-answer questions. Eight hundred and thirty-three assessment were completed by 78 physicians for 45 residents. Mean scores differed significantly by training level (p < .001) with junior residents receiving lower ratings (3.48 ± 0.69) than intermediate residents who received lower ratings (3.98 ± 0.48) than senior residents (4.54 ± 0.42). Scores did not vary by ED treatment area (p > .05). Residents judged to be safe to independently run the shift had significantly higher mean scores than those judged not to be safe (4.74 ± 0.31 vs 3.75 ± 0.66; p < .001). Fourteen observations per resident, the typical number recorded during a 1-month rotation, were required to achieve a reliability of 0.80. Conclusion: The O-EDShOT successfully discriminated between junior, intermediate and senior-level residents regardless of ED treatment area. Multiple sources of evidence support the O-EDShOT producing valid scores for assessing a resident's ability to independently run an ED shift.

K-Ar dating of the Lower Palaeozoic K-bentonites from the Baltic Basin and the Baltic Shield: implications for the role of temperature and time in the illitization of smectite
J. Środoń, N. Clauer, W. Huff, T. Dudek, M. Banaś
Journal:

Clay Minerals / Volume 44 / Issue 3 / September 2009

Published online by Cambridge University Press:

09 July 2018, pp. 361-387
- Article
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Mixed-layer illite-smectite samples from the Ordovician and Silurian K-bentonites of the Baltic Basin and the Baltic Shield (Norway, Sweden, Denmark, Poland and Estonia) were dated by K-Ar on several grain fractions and were studied by X-ray diffraction (XRD), both on oriented and random preparations, in order to reveal the conditions of smectite illitization in the area. Authigenic K-feldspar was also dated. The geographic pattern of the degree of illitization (% smectite in illite-smectite measured by XRD) is consistent with other indicators of palaeotemperatures (acritarchs, conodont alteration index, vitrinite reflectance, apatite fission track ages). It reveals the highest maximum palaeotemperatures (up to at least 200ºC) along the Norwegian and the German-Polish branches of the Caledonides and the lowest palaeotemperatures (120ºC) in the central part of the studied area. The distribution of K-Ar ages is not well correlated with this pattern, revealing a zone of older ages (Lower Devonian-Lower Carboniferous) between Denmark and Estonia, and areas of younger ages (Upper Devonian to Carboniferous/Permian boundary) to the north and south of this zone. The zone of older ages is interpreted as the result of illitization induced by a thermal event in front of the Caledonian orogenic belt (migration of hot metamorphic fluids?). The areas of younger ages are considered as representing deep burial illitization under a thick Silurian-Carboniferous sedimentary cover, perhaps augmented by a tectonic load. The K-Ar dates invalidate the hypothesis of a long-lasting low-temperature illitization as the mechanism of formation of the Estonian Palaeozoic illite-smectite. The ammonium content of illite-smectite from the Baltic K-bentonites reflects the proximity of organic-rich source rocks that underwent thermal alteration at the time of illite crystallization.

LO080: Performance and proximity: exploring resident factors that impact the quality of work-based assessments
W. Cheung, N. Dudek, T.J. Wood, J.R. Frank
Journal:

Canadian Journal of Emergency Medicine / Volume 18 / Issue S1 / May 2016

Published online by Cambridge University Press:

02 June 2016, pp. S57-S58

Print publication:

May 2016
- Article
- - You have access
- PDF
- Export citation
Introduction: Much of the literature investigating the challenges associated with completing high quality work-based assessments (WBAs) have raised specific concerns over the appropriate documentation of assessments of underperforming trainees or trainees in difficulty. The purpose of this study was to examine the relationship between resident performance and the quality of assessments documented by supervisors on Daily Encounter Cards (DECs). The effect of trainee proximity (i.e. on-service versus off-service status) on this relationship was also examined. Methods: A series of DECs from the Department of Emergency Medicine at the University of Ottawa was scored by two raters using the Completed Clinical Evaluation Report Rating (CCERR). The CCERR is a 9-item instrument that has previously demonstrated reliable ratings and the ability to discriminate the quality of completed DECs. A proxy measure of resident performance was calculated by averaging the scores across performance items on the DEC to produce a “mean DEC rating”. Linear regression analysis was conducted with “mean DEC rating” as the independent measure and CCERR score as the dependent measure. Separate linear regression analyses were repeated for DECs completed for on-service versus off-service residents. Results: Linear regression analysis demonstrated a small but significant inverse relationship between mean DEC rating and CCERR score (p<0.001, r=-0.184), suggesting that when residents performed poorly, their supervisors tended to document higher quality assessments, and conversely, when residents performed well, their supervisors provided lower quality assessments. Further analysis demonstrated that this relationship was present for the on-service group (p<0.001, r=-0.24). However, no relationship was observed in the off-service group (p=0.62, r=-0.05). Conclusion: Resident performance and trainee proximity are important factors impacting the quality of documented clinical performance assessments. Greater attention needs to be given to determining ways of improving the quality of assessments reported for residents who are appropriately progressing in their clinical competence as well as for off-service trainees.

MP002: Beyond rater cognition: the impact of supervisor continuity on the quality of documented work-based assessments
W. Cheung, N. Dudek, T.J. Wood, J.R. Frank
Journal:

Canadian Journal of Emergency Medicine / Volume 18 / Issue S1 / May 2016

Published online by Cambridge University Press:

02 June 2016, pp. S66-S67

Print publication:

May 2016
- Article
- - You have access
- PDF
- Export citation
Introduction: Barriers to completing high quality work-based assessments (WBAs) include relational factors such as the episodic and fragmented interaction that often exists between clinical supervisors and trainees. In an effort to increase supervisor-trainee continuity, the Department of Emergency Medicine at the University of Ottawa created Clinical Teaching Teams (CTT) in which a resident and clinical supervisor work matched shifts together throughout the year. The aim of this study was to determine the impact of supervisor-trainee continuity on the quality of assessments documented on Daily Encounter Cards (DECs). Methods: DECs completed by 20 clinical supervisors were collected and sorted into three groups representing differing degrees of supervisor-trainee continuity (Group 1: CTT emergency resident; Group 2: non-CTT emergency resident; Group 3: non-CTT off-service resident). DECs were scored using the Completed Clinical Evaluation Report Rating (CCERR), a 9-item instrument that has been shown to have reliable ratings and the ability to discriminate the quality of completed DECs. Scores were analyzed using a univariate ANOVA with “mean CCERR score” as the dependent variable and “continuity group” and “supervisor” as between-subject variables. The relationship between CCERR scores and number of CTT encounters over time was examined using a repeated measures ANOVA with “encounter number” as the within-subject factor. Results: Mean CCERR scores for the CTT (21.0, SD=5.8), non-CTT (21.9, SD=4.2), and off-service (20.7, SD=4.0) groups differed (p=0.019). A subsequent pairwise comparison demonstrated a statistically significant difference in means between the non-CTT and off-service groups (p=0.04); however, this 1.2 difference on the 45-point CCERR scale is unlikely to be of any educational significance. The number of repeated encounters did not have a statistically significant effect on CCERR scores (p=0.43) indicating that DEC quality did not improve with greater supervisor-trainee interaction. Conclusion: DEC quality as scored by the CCERR was low for all three groups. Increasing supervisor continuity alone did not result in higher quality assessments of clinical performance. Additional research focusing on the educational alliance that develops between supervisor and trainee may hold greater promise.

MP015: Daily encounter cards: evaluating the quality of documented assessments
W. Cheung, N. Dudek, T.J. Wood, J.R. Frank
Journal:

Canadian Journal of Emergency Medicine / Volume 18 / Issue S1 / May 2016

Published online by Cambridge University Press:

02 June 2016, p. S71

Print publication:

May 2016
- Article
- - You have access
- PDF
- Export citation
Introduction: In response to concerns in the literature over the quality of completed work-based assessments (WBAs), faculty development and rater training initiatives have been developed. The Completed Clinical Evaluation Report Rating (CCERR) was designed to evaluate these interventions by providing a measure of the quality of documented assessments on In-Training Evaluation Reports (ITERs). Daily Encounter Cards (DECs) are a common form of WBA used in the Emergency Department setting. A tool to evaluate initiatives aimed at improving the quality of completion of this widely used WBA is also needed. The purpose of this study was to provide validity evidence to support using the CCERR to assess the quality of DEC completion. Methods: This study was conducted in the Department of Emergency Medicine at the University of Ottawa. Six experts in resident assessment grouped 60 DECs into three quality categories (high, average, poor) based on their perception of how informative each DEC was for reporting judgments of the resident’s performance. Eight clinical supervisors (blinded to the expert groupings) scored the 10 most representative DECs in each group using the CCERR. Mean scores were compared using a univariate ANOVA to determine if the CCERR was able to discriminate DEC quality. Reliability for the CCERR scores was determined using a generalizability analysis. Results: Mean CCERR scores for the high (37.3, SD=1.2), average (24.2, SD=3.3), and poor (14.4, SD=1.4) quality groups differed (p<0.001). A pairwise comparison demonstrated that differences between all three quality groups were statistically significant (p<0.001), indicating that the CCERR was able to discriminate DEC quality as judged by experts. A generalizability study demonstrated the majority of score variation was due to differences in DECs. The reliability with a single rater was 0.95. Conclusion: There is strong validity evidence to support the use of the CCERR to evaluate DEC quality. It can be used to provide feedback to supervisors for improving assessment reporting, and offers a quantitative measure of change in assessor behavior when utilized as a program evaluation instrument for determining the quality of completed DECs.

Search Results

Refine search

Refine search

Actions for selected content:

9 results

LO83: Effect of the transition to an entrustability scale on assessor stringency and leniency on daily encounter cards in emergency medicine

MP51: The relationship between entrustment scores in the simulated and workplace environments among emergency medicine residents

LO75: Does the Ottawa emergency department shift observation tool give more useful information – assessing the utility of transitioning to a novel, entrustability based assessment tool in the emergency department

P042: Workplace-based assessment in emergency medicine: how do physicians use entrustment anchors?

LO84: Ready to run the show: development of a new instrument for assessing resident competence in the emergency department

K-Ar dating of the Lower Palaeozoic K-bentonites from the Baltic Basin and the Baltic Shield: implications for the role of temperature and time in the illitization of smectite

LO080: Performance and proximity: exploring resident factors that impact the quality of work-based assessments

MP002: Beyond rater cognition: the impact of supervisor continuity on the quality of documented work-based assessments

MP015: Daily encounter cards: evaluating the quality of documented assessments

Search Results

Refine search

Refine search

Actions for selected content:

Save Search

9 results