From Spelling Correction to Text Cleaning - Using Context Information - Citegraph

Paper Info

Title
From Spelling Correction to Text Cleaning - Using Context Information

Abstract
Spelling correction is the task of correcting words in texts. Most of the available spelling correction tools only work on isolated words and compute a list of spelling suggestions ranked by edit-distance, letter-n-gram similarity or comparable measures. Although the probability of the best ranked suggestion being correct in the current context is high, user intervention is usually necessary to choose the most appropriate suggestion (Kukich, 1992). Based on preliminary work by Sabsch (2006), we developed an efficient context sensitive spelling correction system dcClean by combining two approaches: the edit distance based ranking of an open source spelling corrector and neighbour co-occurrence statistics computed from a domain specific corpus. In combination with domain specific replacement and abbreviation lists we are able to significantly improve the correction precision compared to edit distance or context based spelling correctors applied on their own.

Year	DOI	Venue
2007	10.1007/978-3-540-78246-9_47	Studies in Classification Data Analysis and Knowledge Organization
Field	DocType	ISSN
Information retrieval,Word lists by frequency,Ranking,Computer science,Levenshtein distance,Natural language processing,Spelling,Artificial intelligence	Conference	1431-8814
Citations	PageRank	References
10	0.65	4
Authors
3

Authors (3 rows)

Cited by (10 rows)

References (4 rows)

Name	Order	Citations	PageRank
Martin Schierle	1	26	2.85
Sascha Schulz	2	15	1.14
Markus Ackermann	3	22	3.63

1