CA2162609C - Method for recognizing handwritten input - Google Patents

Method for recognizing handwritten input

Info

Publication number
CA2162609C
CA2162609C CA002162609A CA2162609A CA2162609C CA 2162609 C CA2162609 C CA 2162609C CA 002162609 A CA002162609 A CA 002162609A CA 2162609 A CA2162609 A CA 2162609A CA 2162609 C CA2162609 C CA 2162609C
Authority
CA
Canada
Prior art keywords
distance
discrete continuous
discrete
continuous segments
segment
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CA002162609A
Other languages
French (fr)
Other versions
CA2162609A1 (en
Inventor
John L. C. Seybold
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Motorola Solutions Inc
Original Assignee
Motorola Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Motorola Inc filed Critical Motorola Inc
Publication of CA2162609A1 publication Critical patent/CA2162609A1/en
Application granted granted Critical
Publication of CA2162609C publication Critical patent/CA2162609C/en
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition
    • G06V30/14Image acquisition
    • G06V30/148Segmentation of character regions
    • G06V30/15Cutting or merging image elements, e.g. region growing, watershed or clustering-based techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition

Abstract

The present invention determines whether two discrete continuous segments of handwritten input S1 (210) and S2 (220) form part of the same handwritten input or are part of more than one, separate handwritten inputs. The present method calculates one or more substantially parallel distance (260) disposed substantially parallel to the writing access (230) and compares these distances to one or more predefined thresholds. The predefined thresholds specify minimum distance measures which must be exceed by the substantially parallel distances (260) for the discrete continuous segments (210, 220) to be judged as belonging to separate segments of handwritten input.

Description

METHOD FOR RECOGNIZING HANDWRITTEN INPUT
Field Of The Invention This invention relates generally to handwriting recognition, and more particularly to recognition of individual words.
Background of the Invention Machine recognition of human handwriting is a very difficult problem, and with the recent explosion of pen-based computing devices, has become an important problem to be addressed. Machine recognition of human handwriting has various present applications.
One example of the current application for machine recognition of human handwriting is found in personal digital assistants. Typically these type of products have a touch sensitive screen upon which a user can impose handwriting. These devices then function to 2 0 digitize the handwritten input, such as alphanumeric input, and thereafter process the input in an attempt to recognize the information content of the handwriting.
Pursuant to one prior art handwriting recognition technique, one makes a best determination as to the identity of 2 5 each alphanumeric character in sequence, with the resulting string of characters comprising the result of the recognition activity. There are a variety of drawbacks to this approach. It is hindered by the difficulty of identifying spatial boundaries of the candidate inputs (in this case alphanumeric characters to be 3 0 recognized. When these boundaries are not located correctly, it is impossible to recognize the character accurately, since it will either be lacking pieces or will incorporate extraneous material from adjacent characters.

One significant problem with machine recognition of human handwriting is the ability to recognize the end of one input and the beginning of the next input. For example, a significant problem. exists in locating the end of one handwritten input segment, word, or alphanumeric input, from the beginning of the subsequent handwritten input segment, word, or alphanumeric input. Poor recognition of such breaks in the handwritten 'input results in poor, inaccurate interpretation of the information content of the handwritten input.
Accordingly, a need exists for a handwriting recognition technique that can detect the end of a first handwritten input segment from the beginning of a second handwritten input segment. in the handwritten input and thereby provide a more accurate interpretation of the information content of the handwritten input.
Brief Description of the Drawings 2 0 FIG. 1 Illustrates a flow diagram of operation in accordance with a preferred embodiment of the present invention.
FIG. 2 Illustrates a graphical view of an illustrative display in accordance with a preferred embodiment of the 2 5 present invention.
FIG. 3 Illustrates a graphical view of an illustrative display in accordance with a preferred embodiment of the present invention.
" FIG. 4 Illustrates a graphical view of an illustrative 3 0 display in accordance with an alternative preferred embodiment of the invention.
FIG. 5 Illustrates a graphical view of an illustrative display in accordance with a preferred embodiment of the present invention.

Detailed Description of the Preferred Embodiments Typically, handwritten character input is collected from the user in the form of discrete continuous segments. A
discrete continuous segment consists of one or more pen strokes, where a pen stroke is the mark left by a pen during its period of contact with an input device such as a digitizing tablet or paper. A stroke is represented as a sequence of points sampled at approximately regular intervals by the input device. Each point is described at minimum by an X coordinate and a Y coordinate. Strokes may be captured electronically using a digitizing tablet, or in an alternative embodiment may be derived from a scanned or faxed image through a process of line detection in the image; such methods of capturing input 1 5 electronically are ~ understood in the art.
Generally, the present invention as disclosed determines whether two discrete continuous segments form part of the same handwritten character input or part of more than one handwritten character input. In the present invention one or 2 0 more discrete continuous segments are the units of handwritten input being recognized. Handwritten input is input which is captured electronically that includes but is not limited to the following: handwritten input; electronic input;
input captured through pressure, such as stamped input; input 2 5 that is received electronically, such as via facsimile, pager, or other device. For example - the present invention determines whether two discrete continuous segments form part of the same word or whether they form part of separate words. In a pref~'rred method, the present invention calculates one or more 3 0 substantially parallel distances disposed substantially parallel to the writing axis, and compares these distances to one or more predefined thresholds. The predefined thresholds specify minimum distance measures which must be exceeded by the substantially parallel distances for the discrete 3 5 continuous segments to be judged as belonging to separate -- 21 b2b09 handwritten input, for example separate handwritten words.
The writing axis is the line along which the handwritten input is added. The writing direction is the direction in which each subsequent handwritten input is added In English, handwritten input is added typically along a horizontal writing axis with each subsequent alphanumeric input following horizontally after the previous input in a writing direction that is left to right. Various other writing axis and writing directions alternatives are possible with implementation of the_ teachings of the present invention.
In a preferred embodiment, the handwriting axis is horizontal and the handwritten input forms a series of words. In this preferred embodiment, the substantially parallel distances are calculated horizontally, and the output tells whether discrete continuous segments belong to separate words. In an alternative preferred embodiment, the handwriting axis is horizontal and the handwritten input forms a series of separate characters, which may be alphanumeric characters, ideographic characters as found in languages such as Chinese, or other 2 0 forms of characters or symbols of written communications. In this alternative embodiment, the output tells whether the discrete continuous segments belong to separate characters. In another preferred embodiment, the handwriting axis is vertical and the handwritten input forms a series of separate 2 5 characters, which may be alphanumeric characters, ideo~ranhic characters, or other handwritten text. In this preferred embodiment, the writing axis is vertical and the substantially parallel distances are aligned vertically; the output tells whether the discrete continuous segments belong to separate 3 0 characters. In yet another preferred embodiment, the handwriting axis is vertical and the handwritten input forms a series of separate words, alphanumeric input, or other handwritten input, such as a vertical list of words, or numbers.
In this preferred embodiment, the writing axis is vertical and 3 5 the substantially parallel distances are aligned vertically; the s 21 626 09 output tells whether the discrete continuous segments belong to separate handwritten input, such as separate words. As disclosed above and as will be discussed further, the present invention demonstrates through the disclosure of several of the preferred embodiments that the writing axis may exist at any angle and the handwritten input may be interpreted more generally as corresponding to discrete elements (including but not limited to characters and words) containing one or more discrete continuous segments. The application of the methods described herein to any of various preferred embodiments requires only a change in the coordinate system used and such modifications can be made in accordance with the teachings presented.
Referring now to FIG. 1, a preferred method of the present invention is illustrated. The present invention is applicable to one or more handwritten inputs of discrete continuous segments. The preferred embodiments of the present invention, are applicable to two or more handwritten inputs of discrete continuous segments. The use of only two 2 0 discrete continuous segments S 1 and S2 is for illustrative purposes. In the preferred method illustrated in FIG. 1, handwritten input consisting of two discrete continuous segments S, and SZ (110) is accepted by a device, such as a personal digital assistant (PDA) or other device. Other devices which function to receive 2 5 handwritten input include but are not limited to the following:
computers, modems, pagers, telephones, digital or interactive or other televisions, devices having a digitizing tablet, facsimile devices, scanning devices, and any device with the ability to captttie handwritten input. Preferably, upon acceptance of the 3 0 handwritten input the substantially perpendicular boundaries b 1 and b2 between the discrete continuous segments S 1 and S2 are identified ( 120). The boundaries b 1 and b2 which are substantially perpendicular to the writing axis are determined by finding the point in the stroke sequence S 1 that has the 3 5 largest displacement along the writing direction and the point in the stroke sequence S2 that has the smallest displacement along the writing direction ( 120). The substantially perpendicular boundary bl is the displacement values of the point in the stroke sequence S1 that has the largest displacement along the writing direction. The substantially perpendicular boundary b2 is the displacement values of the point in the stroke sequence S2 that has the smallest displacement along the writing direction. By calculating the displacement in the writing direction of each point in S 1 in turn and comparing that value to a stored value that is initially a very large negative number, the substantially perpendicular boundary bl that has the largest displacement along the writing direction can be determined. If the calculated displacement value is larger than the stored value, the stored value is replaced with the calculated displacement value. Once all the points have been examined, the stored value will contain the value of the largest displacement value found in the stroke sequence. A similar procedure, starting with a large positive initializing value, can be used to discover the point in 2 0 S 2 that has the smallest displacement along the writing direction by calculating the displacement in the writing direction of each point in S2 in turn and comparing that value to a stored value that is initially a very large positive number.
If the calculated displacement value is less than the stored 2 5 value, the stored value is replaced with the calculated displacement value. The preferred method then calculates a first substantially parallel distance d', where d' = b2 - bl (130).
Referring to FIG. 1, the first substantially parallel distance d' is compared to a first predetermined threshold t'.
3 0 If d' is greater than or equal to (>_) t' the preferred method concludes that the first continuous discrete segment S 1 and the second continuous discrete segment S2 belong to different segments of handwritten input (145). By way of example if d' is greater than (>) t' the preferred method concludes S 1 and S2 3 5 are different words, characters, or other elements of _ 2162609 handwritten input. The selection of a predetermined threshold t' is a value made in accordance with the specific embodiment.
Choice of a threshold t' will be discussed in further detail elsewhere.
If the value of d' is smaller, or less than, the threshold value t' further processing occurs. To determine if the discrete continuous segments S 1 and S2 form separate discrete continuous segments of handwritten input, such as separate words, characters, or other elements, a second substantially parallel distance d" is calculated. The second substantially parallel distance d" is found by calculating several substantially parallel distances and choosing the shortest of those distances. In a preferred embodiment, this is done by first calculating the maximum extent perpendicular to the 1 5 writing axis subtended by S 1 and SZ together ( 150). The extent is then divided into a number of bands of equal height substantially parallel to the writing axis ( 160). The substantially parallel distance between S 1 and SZ for each band ( 170) is then found. The smallest or shortest 2 0 substantially parallel distance between S 1 and S2 from among the plurality of bands is selected as d" ( 180). In the preferred method represented in FIG 1, once the substantially horizontal distance d" is found, the distance d" and the first substantially horizontal distance d' are combined to aid in making a final 2 5 decision that S 1 and S2 are part of the same handwritten input or are parts of separate discrete continuous segments of handwritten input. A weighted average of d' and d" is calculated. A preferred weighted average equation is q'd' +
q"d"t(q' + q"). It has been found by empirical tests on actual 3 0 handwriting data that a weighted average of d' and d" is more accurate than either alone. In one preferred embodiment, q' q", but it may be possible to optimize this further by a judicious choice of q' and q" which are not identical.
Additionally, q' and q" can each be equal to zero, but not at the 3 5 same time.

s The weighted average is compared against a second threshold t". If the weighted average is less than or smaller than t" the preferred method concludes that S 1 and S2 are part of the same segment of handwritten input ( 197). If the weighted average is larger than or equal to t" the preferred method concludes that S 1 and S2 are different continuous discrete segments of handwritten input and the preferred method concludes with that' result (199).
The preferred method of selecting t' and t" is a detail of 1 0 the specific embodiment. In one embodiment, t' may be set by measuring the value of d' across a large set of data and choosing the value that best distinguishes real dividing points from false choices. In another preferred embodiment, t' may be set to a constant :value that corresponds to some fraction of 1 S the distance between input guides on the input device. In another preferred embodiment, t' may vary dynamically as a fraction of the measured extent of S 1 and S2 perpendicular to the writing axis. In another embodiment, t' may be set explicitly by the user before the method is invoked. In the 2 0 preferred embodiment illustrated, t' is set to be 1 /3 of the measured extent of S 1 and S2 perpendicular to the writing axis, which gives a good result for English, alphanumeric text.
Many other ways of setting t' may be considered and may be optimal for different solutions. The various ways of 2 5 determining t" in different embodiments are similar to those described for determining t'.
In FIG. 2, a graphical representation is given, for a preferred embodiment. In this embodiment, the discrete conti'tiuous segments, S 1 (210) and S2 (220), correspond to 3 0 English words, the writing axis (230) is horizontal, and the writing direction (270) is left to right. In this preferred embodiment, the displacement along the writing direction for any point is simply the value of the X coordinate for that point, so the substantially perpendicular boundary bl (240) is the 3 S value of the X coordinate of the rightmost point in S 1 (210) 2i626~9 and the substantially perpendicular boundary b2 (250) is the value of the X coordinate of the leftmost point in S2 (220). The substantially parallel distance d' (260) is the distance extending between bl and b2.
Alternatively, as illustrated in FIG. 3, the discrete continuous segments S1 (310) and S2 (320) correspond to Chinese characters, the writing axis (330) is vertical, and the writing direction (370) is top to bottom. In this embodiment, the displacement along the writing direction (370) for any 1 0 point is simply the value of the Y coordinate for that point multiplied by -1 (assuming a standard coordinate system in which Y increases as one moves from bottom to top), so the substantially perpendicular boundary bl (340) is the value of the Y coordinate times -1 of the bottommost point in S1 (310) 1 S and the substantially perpendicular boundary b2 (350) is the value of the Y coordinate times -1 of the topmost point in S2 (320).
In other embodiments, the values for bl and b2 can be calculated by applying a simple geometric rotation of the 2 0 handwritten input to line up the writing axis with one of the cardinal axes of the coordinate system, and then applying the procedure just described. This is a straightforward operation that will be understood by persons in conjunction with the teachings presented here. Alternatively, other embodiments 2 5 may calculate the displacements of the points along the writing direction, when the writing direction does not line up with one of the cardinal axes of the coordinate system, by a simple geometric projection of the discrete continuous segments onto the writing axis and then using trigonometry to calculate the 3 0 displacement of the projected points from the origin of the coordinate system. This is a well-understood mathematical procedure that can be applied to the teachings herein.
As stated previously, if d' is smaller than t', the processing continues. In many cases, segments that do in fact 3 5 belong to separate words or characters are positioned in such a to way that d' is small or even negative, but it is still possible to determine that S 1 and S2 belong to different words or characters. FIG. 4 shows an example of this situation for an embodiment in which S1 (410) and SZ (420) are English words.
In this example, d' (460) is actually negative, and bl (440) is larger than b2 (450). In this example, the writing axis (430) is horizontal and the writing direction (470) is left to right.
In one embodiment in which the writing axis is horizontal, the maximum extent perpendicular to the writing axis subtended by S 1 and S2 is calculated by finding the minimum and maximum Y coordinate values present in S 1 and S 2 . To find the minimum Y coordinate value, a stored value is initialized with a very large positive value, and then every point in S 1 and S2 is examined in turn, and if its Y coordinate value is smaller than the stored value, the stored value is assigned that Y coordinate value. A similar method is used to find the maximum Y coordinate value. By subtracting the maximum and minimum Y coordinate values, the extent perpendicular to the writing axis can be calculated. All points 2 0 in S i and S2 lie within this extent.
In a preferred embodiment, this process can be made more efficient by considering only a subset of the points in S 1 and S2 such that only a fixed number of points near the boundary in question are considered , since these are the ones 2 5 most likely to affect the measure being derived.
In an alternative embodiment, in which the writing axis is vertical, the maximum extent perpendicular to the writing axis subtended by S1 and S2 is calculated by finding the minimum and maximum X coordinate values present in S 1 and 3 0 S 2 . To find the minimum X coordinate value, a stored value is initialized with a very large positive value, and then every point in S1 and S2 is examined in turn, and if its X coordinate value is smaller than the stored value, the stored value is assigned that X coordinate value. A similar method is used to 3 5 find the maximum X coordinate value. By subtracting the maximum and minimum X coordinate values, the extent perpendicular to the writing axis can be calculated.
In an alternative embodiment, where the writing axis is neither horizontal nor vertical, the image plane can be rotated to align the writing axis with the X or Y axis of the coordinate system, as described above in the calculation of the substantially parallel distance d' when the writing axis was neither horizontal nor vertical.
Once the extent perpendicular to the writing axis has been found, a preferred embodiment divides this extent into bands substantially parallel to the writing axis, such that each band describes a narrow slice through S i and S2. Within each band, the .point in S 1 with the largest displacement in the writing direction is found, and the point in S2 with the smallest 1 5 displacement in the writing direction is found. This can be done efficiently with respect to S1 by initializing a stored value for each band to a very large negative number. Each point in S 1 is checked in turn. First, its displacement perpendicular to the writing axis is checked to see which band it lies in. Since 2 0 the bands span the entire extent of S1 and SZ perpendicular to the writing axis, it is guaranteed that a band can be found for any point in S 1 and S2. If the preferred embodiment described above for calculating the perpendicular extent by considering only a subset of points in S1 and S2, the perpendicular extent 2 5 may not contain all points in S 1 and S2, and so each point must be checked to ensure that it lies within a band. If it does not, it is not considered further. Once the band is identified, the displacement of the point in the writing direction is compared to th'~ stored value for that band, and if the displacement is 3 0 larger, it is assigned to the stored value. Once all of the points in S 1 have been checked in this way, the stored value in each band ~ will contain the largest displacement of the point with the largest displacement encountered in that band in S 1. If there were no points in S 1 lying within a given band, the stored 3 5 value will remain as the very large negative initializing value.

A similar procedure, using a second stored value in each band initialized to a large positive number, is used with respect to S2 to find the minimum displacement of the point with the minimum displacement in the writing direction within each band in S2. If there are no points for a given band in S2, the second stored value for that band remains at its large initialized value.
At the end of this operation, for each band there are two stored values corresponding to the largest displacement in the 1 0 writing direction of S 1 in the band and the smallest displacement in the writing direction of S2 in the band. If either stored value has not changed from its very large negative or positive initializing value, then the information for that band is not used in any further calculations. Otherwise, a distance is calculated for each band by subtracting the stored value for the displacement of the point in S 1 from the stored value for the displacement of the point in S2. The smallest of these distances is found by examining the distance for each band in turn and storing the smallest one found. This smallest 2 0 distance is the substantially horizontal distance d". If d" is less than 0, which can occur if S 1 and SZ touch or overlap, it is assigned the value 0.
One embodiment in which the writing axis is horizontal and the writing direction is left to right is shown in FIG. 5. In 2 5 this embodiment, the number of bands (510) is 10. The rightmost point of S 1 lying within each band is found (520) and the leftmost point of S2 lying within that band is also found (530). This is done with respect to S1 by initializing a stored value' for each band to a very large negative number. Each 3 0 point in S 1 is checked in turn. First, its Y coordinate is checked to see which band it lies in. Since the bands span the entire height, extent (540) of S 1 and S2, it is guaranteed that a band can be found for any point in S1 and S2. Once the band is identified, the X coordinate of the point is compared to the 3 5 stored value for that band, and if the X coordinate is larger, it is assigned to the stored value. Once all of the points in S1 have been checked in this way, the stored value in each band will contain the X coordinate of the rightmost point encountered in that band in S 1. If there were no points in S 1 lying within a given band, the stored value will remain as the very large negative initializing value. A similar procedure with respect to S2, using a second stored value in each band initialized to a large positive number, is used to find the X
coordinate of the leftmost point in each band in S2.
1 0 As described previously, the value of d" is computed from the stored locations for each band. The smallest, or shortest, substantially parallel distance between S1 and SZ
from amount the plurality of bands is selected as d". The distance d" is then combined with d' via a weighted average 1 5 and compared to t". Upon comparison with t" the method of the present invention concludes whether the discrete continuous segments S1 and S2 belong to the same handwritten input or separate handwritten inputs.
It will be apparent to those skilled in the art that the 2 0 disclosed invention may be modified in numerous ways and may assume many embodiments other than the preferred forms particularly set out and described above. Accordingly, it is intended by the appended claims to cover all modifications of the invention that fall within that fall within the true spirit 2 5 and scope of the invention and its equivalents.

Claims (9)

THE EMBODIMENT OF THE INVENTION IN WHICH AN EXCLUSIVE PROPERTY
OR PRIVILEGE IS CLAIMED ARE DEFINED AS FOLLOWS:
1. A method comprising the steps of:
receiving handwritten character input, which handwritten character input is comprised of at least a first and a second discrete continuous segment:
calculating of at least one distance between the first and second discrete continuous segments, the distance being substantially parallel to a writing axis of the first and second discrete continuous segments including the steps of:
calculating a plurality of distances between the first and second discrete continuous segments which are substantially parallel to the writing axis of the first and second discrete continuous segments:
identifying which of the plurality of distances has a smallest value; and using the distance to determine whether the first and second discrete continuous segments belong to separate handwritten character inputs.
2. The method of claim 1, wherein the step of using the parallel distance to determine whether the first and second discrete continuous segments belong to separate handwritten inputs includes the step of determining whether the first and second discrete continuous segments belong to separate handwritten inputs based upon the smallest value of the plurality of distances.
3. A method comprising the steps of:
receiving handwritten character input, which handwritten character input is comprised of at least a first and a second discrete continuous segment;
calculating at least one distance between the first and second discrete continuous segments, the distance being substantially parallel to a writing axis of the first and second discrete continuous segments;
identifying a boundary for each of the first and second discrete continuous segments, which boundary for each of the first and second discrete continuous segments is substantially perpendicular to the writing axis;
calculating a distance between the boundaries for the first and second discrete continuous segments; and using the distance between the boundaries for the first and second discrete continuous segments to determine whether the first and second discrete continuous segments belong to separate handwritten inputs.
4. A method comprising the steps of:
receiving handwritten input comprising first and second continuous segments, the handwritten input having a first axis defined along a direction of writing of the handwritten input;
determining a boundary for each of the first and second continuous segments, the boundary for each of the first and second continuous segments being defined along a second axis perpendicular to the first axis through a point along the first axis for each of the first and second continuous segments nearest the boundary of the other of the first and second continuous segments;
calculating a distance parallel to the first axis between the boundaries; and determining from the distance whether the first and second continuous segments are separate handwritten inputs, including the steps of:
comparing the distance to a first predetermined threshold;
determining that the first and second continuous segments are separate handwritten inputs when the distance is greater than the first predetermined threshold;
separating each of the first and second continuous segments into a plurality of sections when the distance is not greater than the first predetermined threshold;
analyzing each of the plurality of sections to determine which of the plurality of sections has a portion of each of the first and second continuous segments;
for each of the plurality of sections where the portion of each of the first and second continuous segments has been found, calculating a segment distance between the portions of the first and second continuous segments that are nearest to each other; and determining from the distance and the segment distance whether the first and second continuous segments are separate handwritten input.
5. A method according to claim 4 wherein the step of determining from the distance and the section distance whether at least first and second continuous segment are separate handwritten input comprises the steps of:
calculating a weighted average of the distance and the segment distance;

comparing the weighted average to a second predetermined threshold;
determining that the at least first and second continuous segment are separate handwritten input if the weighted average is greater than the second predetermined threshold; and determining that the at least first and second continuous segment are not separate handwritten input if the weighted average is not greater than the second predetermined threshold.
6. A method according to claim 4 wherein the sections are distributed in a direction perpendicular to the writing axis.
7. A method according to claim 6 wherein the sections are of equal breadth in a direction perpendicular to the writing axis.
8. A method, comprising the steps of:
processing first and second discrete continuous segments representing handwritten input entered along a writing axis to identify a first boundary for the first discrete continuous segment and a second boundary for the second discrete continuous segment, the first boundary representing a greatest displacement of the first discrete continuous segment from a reference and the second boundary representing a least displacement of the second discrete continuous segment from the reference;
calculating a first distance between the first and second boundary, the first distance being substantially parallel to the writing axis;
determining that the first and second discrete continuous segments represent a separate handwritten character when the first distance exceeds a first threshold;
determining a second distance when the first distance does not at least exceed the first threshold, the second distance being substantially parallel to the writing axis and selected as a minimum distance from a plurality of distances between left-most points of the first discrete continuous segment and right-most points of the second discrete continuous segment in a like plurality of bands being substantially parallel to the writing axis; and determining that the first and second discrete continuous segments represent a separate handwritten character when the second distance exceeds a second threshold.
9. A method comprising the steps of:
receiving handwritten character input, which handwritten character input is comprised of at least a first and a second discrete continuous segment positioned along a writing axis;
defining a first and a second boundary which are substantially perpendicular to the writing axis;
calculating the first boundary by finding the point in the first discrete continuous segment that has the largest displacement from a reference point along the writing axis;
calculating the second boundary by finding the point in the second discrete continuous segment that has the smallest displacement from the reference point along the writing axis;
determining that the largest displacement of the point in the first discrete continuous segment is greater than the smallest displacement of the point in the second discrete continuous segment;
calculating a negative value by subtracting the smallest displacement of the point of the second discrete continuous segment from the largest displacement of the point of the first discrete continuous segment; and using the negative value to determine whether the first and second discrete continuous segments belong to separate handwritten inputs.
CA002162609A 1994-05-10 1995-05-08 Method for recognizing handwritten input Expired - Fee Related CA2162609C (en)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US08/240,405 US5600735A (en) 1994-05-10 1994-05-10 Method of recognizing handwritten input
US08/240,405 1994-05-10
PCT/US1995/005640 WO1995030965A1 (en) 1994-05-10 1995-05-08 Method for recognizing handwritten input

Publications (2)

Publication Number Publication Date
CA2162609A1 CA2162609A1 (en) 1995-11-16
CA2162609C true CA2162609C (en) 1999-12-14

Family

ID=22906381

Family Applications (1)

Application Number Title Priority Date Filing Date
CA002162609A Expired - Fee Related CA2162609C (en) 1994-05-10 1995-05-08 Method for recognizing handwritten input

Country Status (18)

Country Link
US (1) US5600735A (en)
EP (1) EP0708944B1 (en)
JP (1) JPH09500473A (en)
CN (1) CN1128073A (en)
AT (1) ATE221225T1 (en)
AU (1) AU672558B2 (en)
BR (1) BR9506217A (en)
CA (1) CA2162609C (en)
CZ (1) CZ285285B6 (en)
DE (1) DE69527487T2 (en)
ES (1) ES2181776T3 (en)
FI (1) FI112403B (en)
HU (1) HU219991B (en)
IL (1) IL113658A (en)
NO (1) NO309247B1 (en)
PL (1) PL312986A1 (en)
SK (1) SK3196A3 (en)
WO (1) WO1995030965A1 (en)

Families Citing this family (33)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0739521B1 (en) * 1994-11-14 2001-10-31 Motorola, Inc. Method of splitting handwritten input
US5801941A (en) * 1996-08-12 1998-09-01 International Business Machines Corporation Mobile client computer programmed to establish soft keyboard targeting sensitivity
US6108444A (en) * 1997-09-29 2000-08-22 Xerox Corporation Method of grouping handwritten word segments in handwritten document images
WO1999038149A1 (en) 1998-01-26 1999-07-29 Wayne Westerman Method and apparatus for integrating manual input
US7091959B1 (en) * 1999-03-31 2006-08-15 Advanced Digital Systems, Inc. System, computer program product, computing device, and associated methods for form identification and information manipulation
US6826551B1 (en) * 2000-05-10 2004-11-30 Advanced Digital Systems, Inc. System, computer software program product, and method for producing a contextual electronic message from an input to a pen-enabled computing system
US6798907B1 (en) * 2001-01-24 2004-09-28 Advanced Digital Systems, Inc. System, computer software product and method for transmitting and processing handwritten data
US20020107885A1 (en) * 2001-02-01 2002-08-08 Advanced Digital Systems, Inc. System, computer program product, and method for capturing and processing form data
US7639876B2 (en) * 2005-01-14 2009-12-29 Advanced Digital Systems, Inc. System and method for associating handwritten information with one or more objects
US7295206B2 (en) * 2005-01-31 2007-11-13 Microsoft Corporation Ink input region adjustments
US7697001B2 (en) * 2005-01-31 2010-04-13 Microsoft Corporation Personalized ink font
US7613341B1 (en) * 2005-04-19 2009-11-03 Adobe Systems Incorporated Gap detection in a drawing
US7720286B2 (en) * 2005-05-25 2010-05-18 Advanced Digital Systems, Inc. System and method for associating handwritten information with one or more objects via discontinuous regions of a printed pattern
CN100430958C (en) * 2005-08-18 2008-11-05 富士通株式会社 Identifying distance regulator and method thereof and text lines identifier and method thereof
CN101192124B (en) * 2006-11-17 2010-04-21 中兴通讯股份有限公司 System and method for automatic distinguishing and processing for touch screen input information
US8269727B2 (en) * 2007-01-03 2012-09-18 Apple Inc. Irregular input identification
US7855718B2 (en) * 2007-01-03 2010-12-21 Apple Inc. Multi-touch input discrimination
US8130203B2 (en) 2007-01-03 2012-03-06 Apple Inc. Multi-touch input discrimination
JP5343617B2 (en) * 2009-02-25 2013-11-13 富士通株式会社 Character recognition program, character recognition method, and character recognition device
CN102663388B (en) * 2012-03-27 2014-01-08 复旦大学 Method for segmenting handwritten character from background image
CN104346631A (en) * 2013-07-30 2015-02-11 夏普株式会社 Image distinguishing method, image processing device and image outputting device
JP6492894B2 (en) 2015-04-01 2019-04-03 富士通株式会社 Recognition program, recognition method, and recognition apparatus
US11762989B2 (en) 2015-06-05 2023-09-19 Bottomline Technologies Inc. Securing electronic data by automatically destroying misdirected transmissions
US20170163664A1 (en) 2015-12-04 2017-06-08 Bottomline Technologies (De) Inc. Method to secure protected content on a mobile device
US11163955B2 (en) 2016-06-03 2021-11-02 Bottomline Technologies, Inc. Identifying non-exactly matching text
US10235356B2 (en) 2016-06-03 2019-03-19 Bottomline Technologies (De), Inc. Dual authentication method for identifying non-exactly matching text
US11416713B1 (en) 2019-03-18 2022-08-16 Bottomline Technologies, Inc. Distributed predictive analytics data set
US11042555B1 (en) 2019-06-28 2021-06-22 Bottomline Technologies, Inc. Two step algorithm for non-exact matching of large datasets
US11269841B1 (en) 2019-10-17 2022-03-08 Bottomline Technologies, Inc. Method and apparatus for non-exact matching of addresses
US11763278B2 (en) 2020-03-13 2023-09-19 Bottomline Technologies, Inc. Deposit token service system, apparatus and method
US11449870B2 (en) 2020-08-05 2022-09-20 Bottomline Technologies Ltd. Fraud detection rule optimization
US11544798B1 (en) 2021-08-27 2023-01-03 Bottomline Technologies, Inc. Interactive animated user interface of a step-wise visual path of circles across a line for invoice management
US11694276B1 (en) 2021-08-27 2023-07-04 Bottomline Technologies, Inc. Process for automatically matching datasets

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4206442A (en) * 1974-07-03 1980-06-03 Nippon Electric Co., Ltd. Letter segmenting apparatus for OCR comprising multi-level segmentor operable when binary segmenting fails
US4379282A (en) * 1979-06-01 1983-04-05 Dest Corporation Apparatus and method for separation of optical character recognition data
JPS6274181A (en) * 1985-09-27 1987-04-04 Sony Corp Character recognizing device
JP2822189B2 (en) * 1988-05-19 1998-11-11 ソニー株式会社 Character recognition apparatus and method
US5062141A (en) * 1988-06-02 1991-10-29 Ricoh Company, Ltd. Method of segmenting characters in lines which may be skewed, for allowing improved optical character recognition

Also Published As

Publication number Publication date
DE69527487T2 (en) 2002-12-19
SK3196A3 (en) 1997-02-05
CZ285285B6 (en) 1999-06-16
JPH09500473A (en) 1997-01-14
IL113658A0 (en) 1995-08-31
NO309247B1 (en) 2001-01-02
HUT73908A (en) 1996-10-28
EP0708944A1 (en) 1996-05-01
NO955088D0 (en) 1995-12-15
FI112403B (en) 2003-11-28
NO955088L (en) 1995-12-15
PL312986A1 (en) 1996-05-27
US5600735A (en) 1997-02-04
IL113658A (en) 1998-10-30
FI955609A0 (en) 1995-11-22
DE69527487D1 (en) 2002-08-29
MX9600190A (en) 1998-11-30
EP0708944A4 (en) 1996-09-18
CN1128073A (en) 1996-07-31
CA2162609A1 (en) 1995-11-16
AU2472495A (en) 1995-11-29
BR9506217A (en) 1997-09-30
AU672558B2 (en) 1996-10-03
ATE221225T1 (en) 2002-08-15
FI955609A (en) 1995-11-22
HU9503881D0 (en) 1996-02-28
HU219991B (en) 2001-10-28
EP0708944B1 (en) 2002-07-24
ES2181776T3 (en) 2003-03-01
CZ6096A3 (en) 1996-07-17
WO1995030965A1 (en) 1995-11-16

Similar Documents

Publication Publication Date Title
CA2162609C (en) Method for recognizing handwritten input
US5740273A (en) Method and microprocessor for preprocessing handwriting having characters composed of a preponderance of straight line segments
US5751851A (en) Method of splitting handwritten input
EP0543590B1 (en) Method for comparing word shapes
US5557689A (en) Optical word recognition by examination of word shape
US5321770A (en) Method for determining boundaries of words in text
US5841902A (en) System and method for unconstrained on-line alpha-numerical handwriting recognition
KR100247970B1 (en) Direction correction method of document image
EP0543594B1 (en) A method for deriving wordshapes for subsequent comparison
Dave Segmentation methods for hand written character recognition
US5903666A (en) Methods of splitting and joining handwritten input
US20070140565A1 (en) Table detection in ink notes
CA1309180C (en) Method for detecting character strings
US8200043B2 (en) Page orientation detection based on selective character recognition
Yin Skew detection and block classification of printed documents
US6266445B1 (en) Classification-driven thresholding of a normalized grayscale image
US20040120555A1 (en) Slap print segmentation system and method
CA2162489A1 (en) Method of stroke segmentation for handwritten input
Chang et al. Caption analysis and recognition for building video indexing systems
US6373997B1 (en) Coarse and fine skew measurement
CN111783781A (en) Malicious clause identification method, device and equipment based on product agreement character identification
MXPA96000190A (en) Method for recognition of writing entry man
JPH05166010A (en) Character siegmenting method

Legal Events

Date Code Title Description
EEER Examination request
MKLA Lapsed