Title
An Algorithm for Identifying Authors Using Synonyms
Abstract
An approach for identifying the human source of a text by leveraging the significance of synonyms in language is presented. While others have attempted to identify authors in the past, they have focused on purely statistical approaches such as word length distribution, number of distinct words, and language models. We claim that an author's choice of synonyms is idiosyncratic and can be used in determining the identity of an author, which we demonstrate via our algorithm for recognizing authors. This algorithm uses synonym sets from the WordNet lexical database to give more weight to words that have many common synonyms. The results of this method applied to the task of identifying the authors of classic literature show that there is a correlation between an author's synonym choice and the author's identity. With this new author recognition technology, we may now explore new avenues of intelligent and meaningful interaction with users.
Year
DOI
Venue
2007
10.1109/ENC.2007.7
ENC
Keywords
Field
DocType
classic literature show,synonym set,distinct word,synonym choice,language model,new author recognition technology,new avenue,common synonym,human source,identifying authors,wordnet lexical database,natural language processing,text analysis,synonyms
Length distribution,Information retrieval,Computer science,Lexical database,Synonym,Algorithm,Artificial intelligence,Natural language processing,WordNet,Language model
Conference
ISBN
Citations 
PageRank 
0-7695-2899-6
4
0.51
References 
Authors
2
2
Name
Order
Citations
PageRank
Jonathan H. Clark141116.42
Charles J. Hannon2212.06