Title
Quantification and Analysis of Scientific Language Variation Across Research Fields.
Abstract
Quantifying differences in terminologies from various academic domains has been a longstanding problem yet to be solved. We propose a computational approach for analyzing linguistic variation among scientific research fields by capturing the semantic change of terms based on a neural language model. The model is trained on a large collection of literature in five computer science research fields, for which we obtain field-specific vector representations for key terms, and global vector representations for other words. Several quantitative approaches are introduced to identify the terms whose semantics have drastically changed, or remain unchanged across different research fields. We also propose a metric to quantify the overall linguistic variation of research fields. After quantitative evaluation on human annotated data and qualitative comparison with other methods, we show that our model can improve cross-disciplinary data collaboration by identifying terms that potentially induce confusion during interdisciplinary studies.
Year
DOI
Venue
2018
10.1109/ICDMW.2018.00037
2018 IEEE International Conference on Data Mining Workshops (ICDMW)
Keywords
DocType
Volume
Semantics,Linguistics,Computer science,Computational modeling,Measurement,Data models,Vocabulary
Conference
abs/1812.01250
ISSN
Citations 
PageRank 
2375-9232
1
0.37
References 
Authors
0
4
Name
Order
Citations
PageRank
Pei Zhou111.72
Muhao Chen28320.01
Kai-Wei Chang34735276.81
Carlo Zaniolo443051447.58