Title
A probabilistic approach to solving crossword puzzles
Abstract
We attacked the problem of solving crossword puzzles by computer: given a set of clues and a crossword grid, try to maximize the number of words correctly filled in. After an analysis of a large collection of puzzles, we decided to use an open architecture in which independent programs specialize in solving specific types of clues, drawing on ideas from information retrieval, database search, and machine learning. Each expert module generates a (possibly empty) candidate list for each clue, and the lists are merged together and placed into the grid by a centralized solver. We used a probabilistic representation as a common interchange language between subsystems and to drive the search for an optimal solution. PROVERB, the complete system, averages 95.3% words correct and 98.1% letters correct in under 15 minutes per puzzle on a sample of 370 puzzles taken from the New York Times and several other puzzle sources. This corresponds to missing roughly 3 words or 4 letters on a daily 1515 puzzle, making PROVERB a better-than-average cruciverbalist (crossword solver).
Year
DOI
Venue
2002
10.1016/S0004-3702(01)00114-X
Artif. Intell.
Keywords
Field
DocType
candidate list,crossword solver,centralized solver,database search,new york times,puzzle source,crossword puzzles,information retrieval,probabilistic constraint satisfaction,loopy belief propagation,crossword grid,crossword puzzle,common interchange language,posterior probability,probabilistic reasoning,better-than-average cruciverbalist,probabilistic approach,constraint satisfaction,open architecture,machine learning
Open architecture,Mathematical puzzle,Computer science,Theoretical computer science,Posterior probability,Artificial intelligence,Solver,Probabilistic logic,Machine learning,Grid,Belief propagation
Journal
Volume
Issue
ISSN
134
1-2
0004-3702
Citations 
PageRank 
References 
29
2.12
11
Authors
3
Name
Order
Citations
PageRank
Michael L. Littman19798961.84
Greg A. Keim28712.95
Noam Shazeer3108943.70