SpanPredict: Extraction of Predictive Document Spans with Neural Attention - Citegraph

Paper Info

Title
SpanPredict: Extraction of Predictive Document Spans with Neural Attention

Abstract
In many natural language processing applications, identifying predictive text can be as important as the predictions themselves. When predicting medical diagnoses, for example, identifying predictive content in clinical notes not only enhances interpretability, but also allows unknown, descriptive (i.e., text-based) risk factors to be identified. We here formalize this problem as predictive extraction and address it using a simple mechanism based on linear attention. Our method preserves differentiability, allowing scalable inference via stochastic gradient descent. Further, the model decomposes predictions into a sum of contributions of distinct text spans. Importantly, we require only document labels, not ground-truth spans. Results show that our model identifies semantically-cohesive spans and assigns them scores that agree with human ratings, while preserving classification performance.

Year	Venue	DocType
2021	NAACL-HLT	Conference
Citations	PageRank	References
0	0.34	0
Authors
6

Authors (6 rows)

Cited by (0 rows)

References (0 rows)

Name	Order	Citations	PageRank
Vivek Subramanian	1	0	0.68
Matthew Engelhard	2	4	2.79
Sam Berchuck	3	0	0.34
Liqun Chen	4	2082	139.89
Ricardo Henao	5	286	23.85
L. Carin	6	4603	339.36

1