Title
Similarity-Based Support for Text Reuse in Technical Writing
Abstract
Technical writing in professional environments, such as user manual authoring for new products, is a task that relies heavily on reuse of content. Therefore, technical content is typically created following a strategy where modular units of text have references to each other. One of the main challenges faced by technical authors is to avoid duplicating existing content, as this adds unnecessary effort, generates undesirable inconsistencies, and dramatically increases maintenance and translation costs. However, there are few computational tools available to support this activity. This paper investigates the use of different similarity methods for the task of identification of reuse opportunities in technical writing. We evaluated our results using existing ground truth as well as feedback from technical authors. Finally, we also propose a tool that combines text similarity algorithms with interactive visualizations to aid authors in understanding differences in a collection of topics and identifying reuse opportunities.
Year
DOI
Venue
2015
10.1145/2682571.2797068
DocEng
Field
DocType
Citations 
Technical writing,World Wide Web,Document analysis,Information retrieval,Reuse,Computer science,Ground truth,Technical communication,Modular design,Database
Conference
4
PageRank 
References 
Authors
0.49
18
8