Title
Best Practices for Convolutional Neural Networks Applied to Visual Document Analysis
Abstract
Neural networks are a powerful technology forclassification of visual inputs arising from documents.However, there is a confusing plethora of different neuralnetwork methods that are used in the literature and inindustry. This paper describes a set of concrete bestpractices that document analysis researchers can use toget good results with neural networks. The mostimportant practice is getting a training set as large aspossible: we expand the training set by adding a newform of distorted data. The next most important practiceis that convolutional neural networks are better suited forvisual document tasks than fully connected networks. Wepropose that a simple "do-it-yourself" implementation ofconvolution with a flexible architecture is suitable formany visual document problems. This simpleconvolutional neural network does not require complexmethods, such as momentum, weight decay, structure-dependentlearning rates, averaging layers, tangent prop,or even finely-tuning the architecture. The end result is avery simple yet general architecture which can yieldstate-of-the-art performance for document analysis. Weillustrate our claims on the MNIST set of English digitimages.
Year
DOI
Venue
2003
10.1109/ICDAR.2003.1227801
ICDAR-1
Keywords
Field
DocType
forvisual document task,neural network,visual document analysis,document analysis researcher,convolutional neural network,flexible architecture,convolutional neural networks applied,avery simple,document analysis,general architecture,best practices,suitable formany visual document,simpleconvolutional neural network,best practice,information processing,concrete,text analysis,support vector machines,digital image,neural networks,convolution,handwriting recognition
Computer vision,Architecture,Information processing,MNIST database,Convolution,Computer science,Convolutional neural network,Support vector machine,Handwriting recognition,Artificial intelligence,Artificial neural network,Machine learning
Conference
ISSN
ISBN
Citations 
1520-5363
0-7695-1960-1
550
PageRank 
References 
Authors
84.14
4
5
Search Limit
100550
Name
Order
Citations
PageRank
Patrice Y. Simard11112155.00
Dave Steinkraus255084.14
John Platt366111100.14
PY Simard455084.14
JC Platt555084.14