Title
Extraction and Analysis of Web Interviews.
Abstract
The amount of newspaper and blog articles keeps growing and the analysis of these unstructured data gains importance as well in research and in the business environment. As special kind of articles we like to focus on interviews. In contrast to regular articles, interviews consist of two or more speakers with different viewpoints. We propose a semi-supervised approach to detect webpages containing interviews. Our experiments show a high f-measure of 77.3% for a manually annotated test set. To apply text and author analysis approaches one needs to separate the excerpts for different authors and recognize their names. We present an extraction method of speakers and their corresponding text excerpts e.g. questions and answers. Based on the extracted text structure, we introduce first measures to understand the interviewer and interviewee.
Year
DOI
Venue
2015
10.1109/WI-IAT.2015.101
WI-IAT
Keywords
Field
DocType
web interviews,interviews,metrics,machine learning
Data mining,Web page,Information retrieval,Computer science,Viewpoints,Business environment,Interview,Newspaper,Feature extraction,Unstructured data,Test set
Conference
Volume
Citations 
PageRank 
1
0
0.34
References 
Authors
8
5
Name
Order
Citations
PageRank
Philipp Berger1178.14
Patrick Hennig2147.38
Johannes Eschrig311.09
Daniel Roeder400.34
Christoph Meinel52341319.90