RoCBert: Robust Chinese Bert with Multimodal Contrastive Pretraining - Citegraph

Paper Info

Title
RoCBert: Robust Chinese Bert with Multimodal Contrastive Pretraining

Abstract
Large-scale pretrained language models have achieved SOTA results on NLP tasks. However, they have been shown vulnerable to adversarial attacks especially for logographic languages like Chinese. In this work, we propose ROCBERT: a pretrained Chinese Bert that is robust to various forms of adversarial attacks like word perturbation, synonyms, typos, etc. It is pretrained with the contrastive learning objective which maximizes the label consistency under different synthesized adversarial examples. The model takes as input multimodal information including the semantic, phonetic and visual features. We show all these features are important to the model robustness since the attack can be performed in all the three forms. Across 5 Chinese NLU tasks, ROCBERT outperforms strong baselines under three blackbox adversarial algorithms without sacrificing the performance on clean testset. It also performs the best in the toxic content detection task under human-made attacks.

Year	DOI	Venue
2022	10.18653/v1/2022.acl-long.65	PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS)
DocType	Volume	Citations
Conference	Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)	0
PageRank	References	Authors
0.34	0	7

Authors (7 rows)

Cited by (0 rows)

References (0 rows)

Name	Order	Citations	PageRank
Hui Su	1	38	5.70
Weiwei Shi	2	0	0.34
Xiaoyu Shen	3	0	0.34
Zhou Xiao	4	0	0.34
Tuo Ji	5	0	0.68
Jiarui Fang	6	0	0.34
Jie Zhou	7	2103	190.17

1