Adaptive Training Environment without Prior Knowledge: Modeling Feedback Selection as a Multi-armed Bandit Problem. - Citegraph

Paper Info

Title
Adaptive Training Environment without Prior Knowledge: Modeling Feedback Selection as a Multi-armed Bandit Problem.

Abstract
Pedagogical Action Selection (PAS) is a major issue for intelligent tutoring and training systems. Expert knowledge provides useful insights to build strategies that relate students representation to PAS, but it can be difficult to collect. Furthermore, the influence of a specific action may vary across students, which is rarely reflected in expert knowledge. As part of an automatic gesture training system, we propose to model the co-evolution between a student and a training environment in order to provide personalized action selection. The proposed approach is based on three models representing the student, the environment, and the interactions between these two entities. The latter model sees the PAS as a multi-armed bandit problem, each arm representing a possible action. Thus, PAS personalization only relies on the interactions between the student and the learning environment, without any prior knowledge. Two experiments, one in a simulated environment and a second in a calligraphy training environment, highlight the model ability to personalize action selection, and the benefits of this ability on students skill acquisition.

Year	DOI	Venue
2016	10.1145/2930238.2930256	UMAP
Field	DocType	Citations
Intelligent tutoring system,Training system,Computer science,Dreyfus model of skill acquisition,Artificial intelligence,Learning environment,Multi-armed bandit,Action selection,Machine learning,Knowledge modeling,Personalization	Conference	2
PageRank	References	Authors
0.38	9	4

Authors (4 rows)

Cited by (2 rows)

References (9 rows)

Name	Order	Citations	PageRank
Rémy Frenoy	1	2	0.38
Yann Soullard	2	18	4.83
Indira Thouvenin	3	44	8.05
Olivier Gapenne	4	71	10.22

1