eOSCE stations live versus remote evaluation and scores variability

Donia Bouzid; Jimmy Mullaert; Aiham Ghazali; Valentine Marie Ferré; France Mentré; Cédric Lemogne; Philippe Ruszniewski; Albert Faye; Alexy Tran Dinh; Tristan Mirault; Université Paris Cité Osce study group

doi:10.1186/s12909-022-03919-1

eOSCE stations live versus remote evaluation and scores variability

BMC Med Educ. 2022 Dec 13;22(1):861. doi: 10.1186/s12909-022-03919-1.

Authors

Donia Bouzid^{1

2}, Jimmy Mullaert³, Aiham Ghazali³, Valentine Marie Ferré^{3

4}, France Mentré^{3

5

6}, Cédric Lemogne^{6

7

8}, Philippe Ruszniewski^{6

9}, Albert Faye^{6

10}, Alexy Tran Dinh^{6

11}, Tristan Mirault^{6

12

13}; Université Paris Cité Osce study group

Collaborators

Université Paris Cité Osce study group:
Nathan Peiffer Smadja, Léonore Muller, Laure Falque Pierrotin, Michael Thy, Maksud Assadi, Sonia Yung, Christian de Tymowski, Quentin le Hingrat, Xavier Eyer, Paul Henri Wicky, Mehdi Oualha, Véronique Houdouin, Patricia Jabre, Dominique Vodovar, Marco Dioguardi Burgio, Noémie Zucman, Rosy Tsopra, Asmaa Tazi, Quentin Ressaire, Yann Nguyen, Muriel Girard, Adèle Frachon, François Depret, Anna Pellat, Adèle de Masson, Henri Azais, Nathalie de Castro, Caroline Jeantrelle, Nicolas Javaud, Alexandre Malmartel, Constance Jacquin de Margerie, Benjamin Chousterman, Ludovic Fournel, Mathilde Holleville, Stéphane Blanche

Affiliations

¹ Université Paris Cité and Université Sorbonne Paris Nord, Inserm IAME, F-75018, Paris, France. [email protected].
² Emergency Department, Bichat-Claude Bernard University Hospital AP-HP, Paris, France. [email protected].
³ Université Paris Cité and Université Sorbonne Paris Nord, Inserm IAME, F-75018, Paris, France.
⁴ Virology laboratory, Bichat-Claude Bernard University Hospital AP-HP, Paris, France.
⁵ Département d'Épidémiologie, Biostatistique et Recherche Clinique, Bichat-Claude Bernard University Hospital AP-HP, Paris, France.
⁶ UFR de Médecine, Université Paris Cité, Paris, France.
⁷ Université Paris Cité, INSERM U1266, Institut de Psychiatrie et Neuroscience de Paris, F-75014, Paris, France.
⁸ Service de Psychiatrie de l'adulte, AP-HP, Hôpital Hôtel-Dieu, F-75004, Paris, France.
⁹ Service de gastro-entérologie et pancréatologie, Hôpital Beaujon AP-HP, Paris, France.
¹⁰ Service de Pédiatrie Générale, Hôpital Robert Debré AP-HP, Paris, France.
¹¹ Département d'Anesthésie-Réanimation, Hôpital Bichat-Claude Bernard, AP-HP, Paris, France.
¹² Département de médecine vasculaire, Hôpital Européen Georges Pompidou AP-HP, Paris, France.
¹³ Université Paris Cité, PARCC team 5, INSERM U970, F-75015, Paris, France.

Abstract

Background: Objective structured clinical examinations (OSCEs) are known to be a fair evaluation method. These recent years, the use of online OSCEs (eOSCEs) has spread. This study aimed to compare remote versus live evaluation and assess the factors associated with score variability during eOSCEs.

Methods: We conducted large-scale eOSCEs at the medical school of the Université de Paris Cité in June 2021 and recorded all the students' performances, allowing a second evaluation. To assess the agreement in our context of multiple raters and students, we fitted a linear mixed model with student and rater as random effects and the score as an explained variable.

Results: One hundred seventy observations were analyzed for the first station after quality control. We retained 192 and 110 observations for the statistical analysis of the two other stations. The median score and interquartile range were 60 out of 100 (IQR 50-70), 60 out of 100 (IQR 54-70), and 53 out of 100 (IQR 45-62) for the three stations. The score variance proportions explained by the rater (ICC rater) were 23.0, 16.8, and 32.8%, respectively. Of the 31 raters, 18 (58%) were male. Scores did not differ significantly according to the gender of the rater (p = 0.96, 0.10, and 0.26, respectively). The two evaluations showed no systematic difference in scores (p = 0.92, 0.053, and 0.38, respectively).

Conclusion: Our study suggests that remote evaluation is as reliable as live evaluation for eOSCEs.

Keywords: Global ratings; Interrater reliability; Remote objective structured clinical examination.

MeSH terms

Clinical Competence*
Educational Measurement* / methods
Female
Humans
Male
Reproducibility of Results
Schools, Medical
Students