Visualizing and Measuring the Geometry of BERT. - Citegraph

Paper Info

Title
Visualizing and Measuring the Geometry of BERT.

Abstract
Transformer architectures show significant promise for natural language processing. Given that a single pretrained model can be fine-tuned to perform well on many different tasks, these networks appear to extract generally useful linguistic features. How do such networks represent this information internally? This paper describes qualitative and quantitative investigations of one particularly effective model, BERT. At a high level, linguistic features seem to be represented in separate semantic and syntactic subspaces. We find evidence of a fine-grained geometric representation of word senses. We also present empirical descriptions of syntactic representations in both attention matrices and individual word embeddings, as well as a mathematical argument to explain the geometry of these representations.

Year	Venue	DocType
2019	ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019)	Journal
Volume	ISSN	Citations
32	1049-5258	0
PageRank	References	Authors
0.34	0	7

Authors (7 rows)

Cited by (0 rows)

References (0 rows)

Name	Order	Citations	PageRank
Andy Coenen	1	1	2.37
Emily Reif	2	0	0.34
Ann Yuan	3	4	1.86
Been Kim	4	353	21.44
Adam Pearce	5	0	0.68
Fernanda B. Viégas	6	0	0.34
Martin Wattenberg	7	4695	333.69

1