WO2004079526A3 - Systems and methods for source language word pattern matching - Google Patents

Systems and methods for source language word pattern matching Download PDF

Info

Publication number
WO2004079526A3
WO2004079526A3 PCT/US2004/006173 US2004006173W WO2004079526A3 WO 2004079526 A3 WO2004079526 A3 WO 2004079526A3 US 2004006173 W US2004006173 W US 2004006173W WO 2004079526 A3 WO2004079526 A3 WO 2004079526A3
Authority
WO
WIPO (PCT)
Prior art keywords
pattern matching
systems
methods
source language
word pattern
Prior art date
Application number
PCT/US2004/006173
Other languages
French (fr)
Other versions
WO2004079526A2 (en
Inventor
Mark A Walch
Original Assignee
Gannon Technologies Group
Mark A Walch
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Gannon Technologies Group, Mark A Walch filed Critical Gannon Technologies Group
Priority to AT04716134T priority Critical patent/ATE524787T1/en
Priority to EP04716134A priority patent/EP1634135B1/en
Publication of WO2004079526A2 publication Critical patent/WO2004079526A2/en
Publication of WO2004079526A3 publication Critical patent/WO2004079526A3/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition
    • G06V30/14Image acquisition
    • G06V30/148Segmentation of character regions
    • G06V30/153Segmentation of character regions using recognition of characters or words
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/28Determining representative reference patterns, e.g. by averaging or distorting; Generating dictionaries
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition
    • G06V30/19Recognition using electronic means
    • G06V30/191Design or setup of recognition systems or techniques; Extraction of features in feature space; Clustering techniques; Blind source separation
    • G06V30/1914Determining representative reference patterns, e.g. averaging or distorting patterns; Generating dictionaries, e.g. user dictionaries
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition
    • G06V30/19Recognition using electronic means
    • G06V30/196Recognition using electronic means using sequential comparisons of the image signals with a plurality of references
    • G06V30/1983Syntactic or structural pattern recognition, e.g. symbolic string recognition
    • G06V30/1988Graph matching
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition

Abstract

A data capture and mining method which enables identification of characters and words through their visible patterns. Specifically, a data capture and mining method involves searching a scanned image using isomorphic, graphical pattern matching techniques that eliminate both the need to convert imaged writing to electronic format through, e.g., OCR and the subsequent need to convert the electronic text into English.
PCT/US2004/006173 2003-02-28 2004-03-01 Systems and methods for source language word pattern matching WO2004079526A2 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
AT04716134T ATE524787T1 (en) 2003-02-28 2004-03-01 SYSTEMS AND METHODS FOR SOURCE LANGUAGE WORD PATTERN COMPARISON
EP04716134A EP1634135B1 (en) 2003-02-28 2004-03-01 Systems and methods for source language word pattern matching

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US45094803P 2003-02-28 2003-02-28
US60/450,948 2003-02-28

Publications (2)

Publication Number Publication Date
WO2004079526A2 WO2004079526A2 (en) 2004-09-16
WO2004079526A3 true WO2004079526A3 (en) 2009-02-12

Family

ID=32962548

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2004/006173 WO2004079526A2 (en) 2003-02-28 2004-03-01 Systems and methods for source language word pattern matching

Country Status (4)

Country Link
US (1) US7724956B2 (en)
EP (1) EP1634135B1 (en)
AT (1) ATE524787T1 (en)
WO (1) WO2004079526A2 (en)

Families Citing this family (24)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20070179760A1 (en) * 2006-01-06 2007-08-02 Intel Corporation Method of determining graph isomorphism in polynomial-time
EP1974314A4 (en) * 2006-01-11 2009-09-02 Gannon Technologies Group Llc Pictographic recognition technology applied to distinctive characteristics of handwritten arabic text
US7860313B2 (en) 2006-01-11 2010-12-28 Gannon Technologies Group, Llc Methods and apparatuses for extending dynamic handwriting recognition to recognize static handwritten and machine generated text
US9147213B2 (en) * 2007-10-26 2015-09-29 Zazzle Inc. Visualizing a custom product in situ
US8452108B2 (en) * 2008-06-25 2013-05-28 Gannon Technologies Group Llc Systems and methods for image recognition using graph-based pattern matching
WO2010011180A1 (en) * 2008-07-25 2010-01-28 Resolvo Systems Pte Ltd Method and system for securing against leakage of source code
US20100189316A1 (en) * 2009-01-27 2010-07-29 Gannon Technologies Group, Llc Systems and methods for graph-based pattern recognition technology applied to the automated identification of fingerprints
US8312457B2 (en) * 2009-12-14 2012-11-13 Microsoft Corporation Maintaining a count for lock-free linked list structures
US8322384B2 (en) * 2010-03-05 2012-12-04 Whirlpool Corporation Select-fill dispensing system
US9213920B2 (en) 2010-05-28 2015-12-15 Zazzle.Com, Inc. Using infrared imaging to create digital images for use in product customization
CN102385707A (en) 2010-08-30 2012-03-21 阿里巴巴集团控股有限公司 Digital picture recognizing method and device and crawler server
CN102385700B (en) * 2010-09-01 2013-05-29 汉王科技股份有限公司 Off-line handwriting recognizing method and device
CN103827923A (en) 2011-08-31 2014-05-28 彩滋公司 Product options framework and accessories
US8712566B1 (en) 2013-03-14 2014-04-29 Zazzle Inc. Segmentation of a product markup image based on color and color differences
US10318583B2 (en) * 2013-03-15 2019-06-11 The Board Of Trustees Of The Leland Stanford Junior University Systems and methods for recommending relationships within a graph database
US9373048B1 (en) * 2014-12-24 2016-06-21 Wipro Limited Method and system for recognizing characters
US9910566B2 (en) * 2015-04-22 2018-03-06 Xerox Corporation Copy and paste operation using OCR with integrated correction application
US10121232B1 (en) * 2015-12-23 2018-11-06 Evernote Corporation Visual quality of photographs with handwritten content
US10438098B2 (en) * 2017-05-19 2019-10-08 Hand Held Products, Inc. High-speed OCR decode using depleted centerlines
US11023526B2 (en) * 2017-06-02 2021-06-01 International Business Machines Corporation System and method for graph search enhancement
CN109101971A (en) * 2017-10-23 2018-12-28 新乡市海胜网络技术有限公司 It claps to stand and turns Text region and interaction language translating method
US10621453B2 (en) * 2017-11-30 2020-04-14 Wipro Limited Method and system for determining relationship among text segments in signboards for navigating autonomous vehicles
US11216448B2 (en) * 2018-07-24 2022-01-04 Ernst & Young U.S. Llp Information storage and retrieval using an off-chain isomorphic database and a distributed ledger
US11157645B2 (en) * 2018-11-01 2021-10-26 International Business Machines Corporation Data masking with isomorphic functions

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5519786A (en) * 1994-08-09 1996-05-21 Trw Inc. Method and apparatus for implementing a weighted voting scheme for multiple optical character recognition systems
US5745600A (en) * 1992-12-17 1998-04-28 Xerox Corporation Word spotting in bitmap images using text line bounding boxes and hidden Markov models

Family Cites Families (18)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH0610829B2 (en) 1984-06-29 1994-02-09 インタ−ナショナル ビジネス マシ−ンズ コ−ポレ−ション Handwriting recognition method
US4961231A (en) 1987-01-20 1990-10-02 Ricoh Company, Ltd. Pattern recognition method
US5267332A (en) * 1991-06-19 1993-11-30 Technibuild Inc. Image recognition system
US5559895A (en) 1991-11-08 1996-09-24 Cornell Research Foundation, Inc. Adaptive method and system for real time verification of dynamic human signatures
EP0578432A3 (en) * 1992-07-06 1994-06-22 Canon Kk Similarity determination among patterns using affine-invariant features
US5588072A (en) 1993-12-22 1996-12-24 Canon Kabushiki Kaisha Method and apparatus for selecting blocks of image data from image data having both horizontally- and vertically-oriented blocks
US5854855A (en) 1994-09-09 1998-12-29 Motorola, Inc. Method and system using meta-classes and polynomial discriminant functions for handwriting recognition
US5633957A (en) 1994-09-16 1997-05-27 Compaq Computer Corporation Method and apparatus for determining positional guidelines of handwritten data
JP2845149B2 (en) 1994-12-28 1999-01-13 日本電気株式会社 Handwritten character input device and handwritten character input method
US6556712B1 (en) * 1996-05-23 2003-04-29 Apple Computer, Inc. Methods and apparatus for handwriting recognition
WO1998035468A2 (en) 1997-01-27 1998-08-13 Benjamin Slotznick System for delivering and displaying primary and secondary information
US5930380A (en) 1997-02-11 1999-07-27 Lucent Technologies, Inc. Method and apparatus for verifying static signatures using dynamic information
US5923739A (en) * 1997-03-13 1999-07-13 Disalvo; Anthony G VCR with remote telephone programming
US5953451A (en) * 1997-06-19 1999-09-14 Xerox Corporation Method of indexing words in handwritten document images using image hash tables
US6108444A (en) 1997-09-29 2000-08-22 Xerox Corporation Method of grouping handwritten word segments in handwritten document images
US6445820B1 (en) 1998-06-29 2002-09-03 Limbic Systems, Inc. Method for conducting analysis of handwriting
CN1411586A (en) * 2000-03-06 2003-04-16 埃阿凯福斯公司 System and method for creating searchable word index of scanned document including multiple interpretations of word at given document location
EP1306775A1 (en) * 2001-10-29 2003-05-02 BRITISH TELECOMMUNICATIONS public limited company Machine translation

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5745600A (en) * 1992-12-17 1998-04-28 Xerox Corporation Word spotting in bitmap images using text line bounding boxes and hidden Markov models
US5519786A (en) * 1994-08-09 1996-05-21 Trw Inc. Method and apparatus for implementing a weighted voting scheme for multiple optical character recognition systems

Also Published As

Publication number Publication date
EP1634135A4 (en) 2009-08-19
ATE524787T1 (en) 2011-09-15
US20080247674A1 (en) 2008-10-09
WO2004079526A2 (en) 2004-09-16
US7724956B2 (en) 2010-05-25
EP1634135B1 (en) 2011-09-14
EP1634135A2 (en) 2006-03-15

Similar Documents

Publication Publication Date Title
WO2004079526A3 (en) Systems and methods for source language word pattern matching
US20030004991A1 (en) Correlating handwritten annotations to a document
AUPR824601A0 (en) Methods and system (npw004)
WO2005048188A3 (en) Method and apparatus for capturing paper-based information on a mobile computing device
WO2006124473A3 (en) System and method for capturing and processing business data
WO2007025258A3 (en) Methods and systems for biometric identification
GB2435753A (en) System and method of enabling a cellular/wireless device with imaging capabilities to decode printed alphanumeric characters
DE60217299D1 (en) HOLISTIC-ANALYTICAL DETECTION OF HAND-WRITTEN TEXT
ATE544125T1 (en) CHARACTER RECOGNITION APPARATUS AND METHOD
CN102169541A (en) Character recognition input system using optical localization and method thereof
EP1217537A3 (en) Method and apparatus for embedding translation information in text-based image data
RU2309456C2 (en) Method for recognizing text information in vector-raster image
EP1626334A3 (en) System and method for printing out image data and text data
EP2264995A3 (en) Image processing apparatus, image processing method, and computer program
WO2008137094A3 (en) Slot in housing adapted to receive at least a portion of a printed paper item for optical character recognition
EP1569139A3 (en) Method of obtaining at least a portion of a document
TW200717338A (en) Character recognition apparatus, character recognition method, and character data
EP2447854A1 (en) Method and system of automatic diacritization of Arabic
EP1296287A3 (en) Image information code processing system
WO2007133737A3 (en) Systems and methods for handwritten digital pen lexical inference
EP1296513A3 (en) Image processing apparatus
WO2003014966A3 (en) An apparatus and method for extracting information from a formatted document
US20080227062A1 (en) Phonetic teaching/correcting device for learning Mandarin
WO2004070647A3 (en) Identification card production
DE60106189D1 (en) Barcode and character recognition for a print label editor

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A2

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BW BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE EG ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NA NI NO NZ OM PG PH PL PT RO RU SC SD SE SG SK SL SY TJ TM TN TR TT TZ UA UG US UZ VC VN YU ZA ZM ZW

AL Designated countries for regional patents

Kind code of ref document: A2

Designated state(s): BW GH GM KE LS MW MZ SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IT LU MC NL PL PT RO SE SI SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
NENP Non-entry into the national phase

Ref country code: DE

DPEN Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed from 20040101)
WWE Wipo information: entry into national phase

Ref document number: 2004716134

Country of ref document: EP

WWP Wipo information: published in national office

Ref document number: 2004716134

Country of ref document: EP

DPE2 Request for preliminary examination filed before expiration of 19th month from priority date (pct application filed from 20040101)