A method for selecting constrained hand-printed character shapes for machine recognition

IEEE Trans Pattern Anal Mach Intell. 1982 Jan;4(1):74-8. doi: 10.1109/tpami.1982.4767199.

Abstract

Since handwritten characters vary in shape and writing-stroke sequence, it is desirable to develop a standard set of characters that are of high quality, so that not only are they easy to write, but they are also most suitable for machine recognition. A database of more than 100 000 alphanumeric patterns was assembled. It consisted of 174 models of the alphanumeric characters written by both left-handed and right-handed subjects. Based on frequency density and distance measurements, a nietric called the dispersion factor was computed to rank the various models. The principle of the metric is discussed, and results are given indicating the high quality models of the alphanumerics.