Title
Bandit Algorithms To Personalize Educational Chatbots
Abstract
To emulate the interactivity of in-person math instruction, we developed MathBot, a rule-based chatbot that explains math concepts, provides practice questions, and offers tailored feedback. We evaluated MathBot through three Amazon Mechanical Turk studies in which participants learned about arithmetic sequences. In the first study, we found that more than 40% of our participants indicated a preference for learning with MathBot over videos and written tutorials from Khan Academy. The second study measured learning gains, and found that MathBot produced comparable gains to Khan Academy videos and tutorials. We solicited feedback from users in those two studies to emulate a real-world development cycle, with some users finding the lesson too slow and others finding it too fast. We addressed these concerns in the third and main study by integrating a contextual bandit algorithm into MathBot to personalize the pace of the conversation, allowing the bandit to either insert extra practice problems or skip explanations. We randomized participants between two conditions in which actions were chosen uniformly at random (i.e., a randomized A/B experiment) or by the contextual bandit. We found that the bandit learned a similarly effective pedagogical policy to that learned by the randomized A/B experiment while incurring a lower cost of experimentation. Our findings suggest that personalized conversational agents are promising tools to complement existing online resources for math education, and that data-driven approaches such as contextual bandits are valuable tools for learning effective personalization.
Year
DOI
Venue
2021
10.1007/s10994-021-05983-y
MACHINE LEARNING
Keywords
DocType
Volume
Chatbot, Contextual bandit, Online education, Online experimentation, Reinforcement learning
Journal
110
Issue
ISSN
Citations 
9
0885-6125
0
PageRank 
References 
Authors
0.34
30
7
Name
Order
Citations
PageRank
William Cai100.34
Josh Grossman200.34
Zhiyuan Lin3887.40
Hao Sheng400.68
Johnny Tian-Zheng Wei500.34
Joseph Jay Williams613.09
Sharad Goel763134.81