Abstract | ||
---|---|---|
In this paper, we propose a method for "linguistic ethnography" --- a general mechanism for characterising texts with respect to the dominance of certain classes of words. Using humour as a case study, we explore the automatic learning of salient word classes, including semantic classes (e.g., person, animal), psycholinguistic classes (e.g., tentative, cause), and affective load (e.g., anger, happiness). We measure the reliability of the derived word classes and their associated dominance scores by showing significant correlation across different corpora. |
Year | DOI | Venue |
---|---|---|
2009 | 10.1007/978-3-642-00382-0_48 | CICLing |
Keywords | Field | DocType |
identifying dominant word classes,general mechanism,automatic learning,associated dominance score,different corpus,salient word class,word class,characterising text,certain class,affective load,linguistic ethnography,case study,natural language processing | Computer science,Part of speech,Correlation,Happiness,Anger,Artificial intelligence,Natural language processing,Affect (psychology),Ethnography,Linguistics,Language technology,Salient | Conference |
Volume | ISSN | Citations |
5449 | 0302-9743 | 6 |
PageRank | References | Authors |
0.50 | 11 | 2 |
Name | Order | Citations | PageRank |
---|---|---|---|
Rada Mihalcea | 1 | 6460 | 445.54 |
Stephen Pulman | 2 | 450 | 38.31 |