flowchart LR
A[Explain] --> B[Predict]
B --> C[Reflect]
C --> D[Revise]
D --> B
Not All Chatbots Teach
Evidence for Pedagogical Design in AI-Assisted Technical Education
This is a presentation for a talk for a SIGCITE 2025 conference. As generative AI tools like ChatGPT become embedded in technical education, a critical challenge emerges: how can we ensure these tools foster learning rather than bypass it? This study provides empirical evidence that pedagogical design, not merely model access, determines the educational value of AI assistants. We developed a freely available custom conversational AI tool that embeds metacognitive scaffolding through structured prompts grounded in the Feynman Technique and learning science literature. In a quasi-experimental study within an undergraduate data structures course (\(N=36\)), students using this structured AI assistant significantly outperformed peers using the same interface configured as a minimally prompted ChatGPT wrapper (92.7 vs. 74.3, \(p<.001\)). Gains were especially strong in abstraction, technical justification, and documentation, which are skills critical across software engineering, IT, and cybersecurity. These findings underscore a key insight: AI-integrated learning environments must be intentionally designed to prompt reflection, prediction, and explanation. By aligning AI interactions with evidence-based pedagogy, our framework demonstrates how to develop conceptual understanding, reduce automation bias, and support equitable learning outcomes as AI reshapes computing education.
Why this paper/talk?
Problem & Research Question
Problem framing
Research question
Does integrating metacognitive scaffolding into an AI assistant improve student performance compared to an unstructured Generative AI wrapper with the same model access?
Pedagogical Tool Design
Grounded in The Feynman Technique: Explain → Predict → Reflect → Revise
Figure 2: Scaffold operationalizing the Feynman-inspired Explain → Predict → Reflect → Revise loop.
Pedagogical Learning Theories
Integrates proven learning theories into the system design to enhance the learning experience:
- Feynman Technique [19]: explain concepts in one’s own words to enhance retention and comprehension.
- Cognitive Load Theory [20]: convert verbose analyzer output → concise, relevant guidance (reduce extraneous load).
- Zone of Proximal Development [21]: feedback level aligned to course maturity (scaffolding).
Pedalogical Application

Pedalogical Question Nodes

Pedalogical LLM Question Generation

Prompt intents & light statefulness
- Concept Articulation (own words) → clarify mental model.
- System Reasoning (constraints) → apply principles.
- Diagnostic → surface gaps/assumptions.
- Justification → trade‑offs and rationale.
- Light statefulness → earlier answers steer follow‑ups (e.g., check prior claims).
Beyond programming
- Networking: Subnet reasoning; routing constraints; misconfiguration diagnosis; DNS failure prediction
- Cybersecurity: Risk analysis; firewall rule revision; trade-off justification
- Software Engineering: Debugging; testing rationale; cause–effect reasoning
- Data Science / AI: Model evaluation; metric justification; bias and leakage detection
- Systems Administration: Failure cascade prediction; assumption checking
- Project Management / DevOps: Design trade-offs; pipeline optimization; resource reasoning
Study Design
Context & participants
- Undergraduate Data Structures, Spring 2025, liberal arts university.
- Two sections: Structured assistant (n=19) vs. Unstructured wrapper (n=17).
- Same instructor, assignments, rubric, incentives.
Assignment & conditions
- Project: “Bistro Ordering System.”
- Requires data‑structure selection/justification, working program, and documentation.
- Treatment: gated E‑P‑R‑R prompts with reflection checkpoints.
- Control: same UI; minimal pre‑prompt; free ChatGPT queries; no scaffolds.
Treatment: Interaction Snapshot (Structured Assistant)

Conversation scaffold operationalizing the Feynman-inspired Explain → Predict → Reflect → Revise loop.
Control: Interaction Snapshot (Unstructured Wrapper)

Conversation scaffold operationalizing the Feynman-inspired Explain → Predict → Reflect → Revise loop.
Measures & analysis
- Blind TA grading; five 20‑pt rubric dimensions.
- Overall scores; category scores; logs; short survey.
- Ethics: IRB‑approved; de‑identified; participation voluntary.
Results
Overall performance
- Structured: 92.7 (SD 3.8) vs. Unstructured: 74.3 (SD 10.2).
- Independent‑samples t‑test: t = 6.93, p < .000001; Cohen’s d = 2.14.
- Lower variance in treatment → more equitable outcomes.
Where did it help most?
- Functional implementation (p < .0001)
- Data structure usage & justification (p = .0002)
- Report accuracy (p = .0005)
- Documentation quality (p < .0001)
- No sig. diff. in code modularity/style (p = .27).
Interpretation
- Pedagogical structure—not just access—drives conceptual gains.
- Scaffolding mitigates automation bias; promotes reflective practice. [11, 23]
- Mirrors prior findings on explanation‑based learning and metacognition. [1, 16, 23]
Total Project Scores by Condition

Boxplot illustrating higher average and lower variance for the structured condition. Based on synthetic samples approximating reported summary statistics (92.7±3.8 vs. 74.3±10.2; n=19/17).
Category Scores by Condition

Schematic relative visualization reflecting significant gains in four rubric categories and no sig. difference in modularity/style. Exact means not reported in paper; shown as proportional (baseline=1.0).
Validity & Limitations
Threats to validity
- Quasi‑experimental; section assignment (selection bias).
- Engagement confound (gated prompts vs. free use).
- Product‑oriented outcomes; limited metacognitive measures.
- Single institution/course; evolving LLM behavior.
Conclusion & Future Work
Takeaways
- AI’s impact is not pedagogically neutral. Design matters.
- E‑P‑R‑R scaffolding produced large gains (d = 2.14) where it counts.
- Provides a replicable blueprint aligned with IT2017/CS2023. [4, 14]
Where next?
- Randomized, cross‑institutional replications.
- Domain‑specific scaffolds (cybersecurity, systems, networking).
- Longitudinal learning & transfer; adaptive scaffolds; instructor authoring tools.
Thank You!
Lucas Cordova
Join our lightning talk!
- 11:15 AM - 11:30 AM: Room LRC-107