Title
French and German Corpora for Audience-based Text Type Classification.
Abstract
This paper presents some of the results of the CLASSYN project which investigated the classification of text according to audience-related text types. We describe the design principles and the properties of the French and German linguistically annotated corpora that we have created. We report on tools used to collect the data and on the quality of the syntactic annotation. The CLASSYN corpora comprise two text collections to investigate general text types difference between scientific and popular science text on the two domains of medical and computer science.
Year
Venue
Keywords
2012
LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION
audience-based text type,features for text categorization,text extraction
Field
DocType
Citations 
Design elements and principles,Annotation,Computer science,Text types,Speech recognition,Artificial intelligence,Natural language processing,Linguistics,Syntax,German
Conference
0
PageRank 
References 
Authors
0.34
5
5
Name
Order
Citations
PageRank
Amalia Todirascu1268.52
Sebastian Padó21787146.15
Jennifer Krisch300.68
Max Kisselew472.54
Ulrich Heid519040.48