Title
Investigating content selection for language generation using machine learning
Abstract
The content selection component of a natural language generation system decides which information should be communicated in its output. We use information from reports on the game of cricket. We first describe a simple factoid-to-text alignment algorithm then treat content selection as a collective classification problem and demonstrate that simple 'grouping' of statistics at various levels of granularity yields substantially improved results over a probabilistic baseline. We additionally show that holding back of specific types of input data, and linking database structures with commonality further increase performance.
Year
Venue
Keywords
2009
ENLG
database structure,collective classification problem,simple factoid-to-text alignment algorithm,increase performance,improved result,natural language generation system,input data,granularity yield,content selection,machine learning,content selection component
Field
DocType
Citations 
Collective classification,Natural language generation,Computer science,Natural language processing,Artificial intelligence,Granularity,Probabilistic logic,Machine learning
Conference
10
PageRank 
References 
Authors
0.68
5
3
Name
Order
Citations
PageRank
Colin Kelly1221.70
Ann Copestake286295.10
Nikiforos Karamanis327419.23