Towards Interpreting and Mitigating Shortcut Learning Behavior of NLU models - Citegraph

Paper Info

Title
Towards Interpreting and Mitigating Shortcut Learning Behavior of NLU models

Abstract
Recent studies indicate that NLU models are prone to rely on shortcut features for prediction. As a result, these models could potentially fail to generalize to real-world out-of-distribution scenarios. In this work, we show that the shortcut learning behavior can be explained by the long-tailed phenomenon. There are two findings : 1) Trained NLU models have strong preference for features located at the head of the long-tailed distribution, and 2) Shortcut features are picked up during very early few iterations of the model training. These two observations are further employed to formulate a measurement which can quantify the shortcut degree of each training sample. Based on this shortcut measurement, we propose a shortcut mitigation framework, to suppress the model from making overconfident predictions for samples with large shortcut degree. Experimental results on three NLU benchmarks demonstrate that our long-tailed distribution explanation accurately reflects the shortcut learning behavior of NLU models. Experimental analysis further indicates that our method can improve the generalization accuracy on OOD data, while preserving the accuracy on in distribution test data.

Year	Venue	DocType
2021	NAACL-HLT	Conference
Citations	PageRank	References
0	0.34	0
Authors
8

Authors (8 rows)

Cited by (0 rows)

References (0 rows)

Name	Order	Citations	PageRank
Mengnan Du	1	94	13.54
Varun Manjunatha	2	68	6.43
Rajiv Jain	3	4	5.16
Ruchi Deshpande	4	0	0.34
Franck Dernoncourt	5	149	35.39
Jiuxiang Gu	6	23	6.45
Tong Sun	7	0	0.34
Xia Hu	8	2411	110.07

1