Page 716 - ISC PROCEEDINGS 21.4
P. 716

The four conversational tasks are summarized according to their learning objectives
                  in Table 1.
                               Table 1. Structured conversational tasks by learning objective
                         Core Learning                                                        Practice
                   No.                               Conversational Task Content
                         Objective                                                             Article
                          Distinguish  IRR,  Students query the AI to understand why the
                         aOR, RR, HR by      SCCS design uses IRR instead of aOR/RR;        Maret-Ouda
                                                                                               et
                                                                                            J
                                                                                                   al. J
                   1     statistical nature  interpret the IRR value and 95% CI in clinical  Gastroenter
                         and conditions of   language; analyze the significance of a CI
                         application         that does not include 1                        ol. 2023 [16]
                                             Students query the AI to perform step-by-      Bhatt DL et
                          Calculate    and   step calculation of ARR and NNT from two-      al.
                         interpret     ARR
                   2                         group event rates; interpret the clinical      (COGENT).
                         and NNT from
                         clinical trial data  significance of NNT; critique the limitations  NEngl J Med.
                                             of HR alone for prescribing decisions          2010 [18]
                          Read         and
                         interpret   meta-
                         analysis  results;  Students query the AI to interpret pooled RR
                         understand    the   and confidence interval; analyze why a         Wu Z et al.
                                                                                            Front
                   3     relationship        large meta-analysis may not reach p < 0.05;    Pharmacol.
                         between sample      evaluate I² and the remaining uncertainty
                         size, effect size,  regarding the PPI-pneumonia association        2025 [17]
                         and     statistical
                         significance
                          Identify     and   Students query the AI to enumerate and
                         control      for    classify potential confounders in the SCCS     Maret-Ouda
                         confounding         design and meta-analysis; evaluate the         et al. [16] &
                   4                                                                        Wu et. al.
                         variable        in  authors’ confounding control strategies;       [17]
                         observational       compare    confounding    control   strength
                         studies             between SCCS and RCT                           (combined)


                        Each conversational task comprises 3-4 cognitively oriented questions following
                  Bloom’s taxonomy levels 2-5. The instructor did not intervene in the content of student-AI
                  interactions. Guidance was provided before each session, and debriefing was conducted
                  afterward.
                        2.6. Outcome variables and indicators
                        Primary outcome: proportion of students achieving correct answers for each
                  pharmacoepidemiological indicator domain (binary variable; unit: percentage points).
                  Secondary outcomes: (1) task completion time (minutes); (2) subjective self-confidence
                  (Likert 1-5); (3) comparison of results between ChatGPT and Gemini groups at T1.
                        2.7. Statistical analysis
                        All eligible students enrolled in the Pharmacoepidemiology course were included
                  using census sampling. Because the study aimed to evaluate an educational intervention
                  within a single class, no a priori sample size calculation was performed.
                        Binary variables (pre-post correctness rates in the same individuals): McNemar test.
                  Normally distributed continuous variables (task completion time): paired t-test. Ordinal
                  variables (Likert): Wilcoxon signed-rank test. Independent comparison of the two AI

                  715
   711   712   713   714   715   716   717   718   719   720   721