EP0567344A3 - Method and apparatus for character recognition - Google Patents

Method and apparatus for character recognition Download PDF

Info

Publication number
EP0567344A3
EP0567344A3 EP9393303194A EP93303194A EP0567344A3 EP 0567344 A3 EP0567344 A3 EP 0567344A3 EP 9393303194 A EP9393303194 A EP 9393303194A EP 93303194 A EP93303194 A EP 93303194A EP 0567344 A3 EP0567344 A3 EP 0567344A3
Authority
EP
European Patent Office
Prior art keywords
character recognition
recognition
character
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
EP9393303194A
Other versions
EP0567344A2 (en
EP0567344B1 (en
Inventor
Shin-Ywan Wang
Mehrzad R Vaezi
Christopher A Sherrick
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Canon Inc
Original Assignee
Canon Inc
Canon Information Systems Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Canon Inc, Canon Information Systems Inc filed Critical Canon Inc
Publication of EP0567344A2 publication Critical patent/EP0567344A2/en
Publication of EP0567344A3 publication Critical patent/EP0567344A3/en
Application granted granted Critical
Publication of EP0567344B1 publication Critical patent/EP0567344B1/en
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/40Document-oriented image-based pattern recognition
    • G06V30/41Analysis of document content
    • G06V30/413Classification of content, e.g. text, photographs or tables
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition
    • G06V30/14Image acquisition
    • G06V30/148Segmentation of character regions
    • G06V30/15Cutting or merging image elements, e.g. region growing, watershed or clustering-based techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition
    • G06V30/14Image acquisition
    • G06V30/148Segmentation of character regions
    • G06V30/155Removing patterns interfering with the pattern to be recognised, such as ruled lines or underlines
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/40Document-oriented image-based pattern recognition
    • G06V30/41Analysis of document content
    • G06V30/414Extracting the geometrical structure, e.g. layout tree; Block segmentation, e.g. bounding boxes for graphics or text
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition
EP93303194A 1992-04-24 1993-04-23 Method and apparatus for character recognition Expired - Lifetime EP0567344B1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US873012 1992-04-24
US07/873,012 US5680479A (en) 1992-04-24 1992-04-24 Method and apparatus for character recognition

Publications (3)

Publication Number Publication Date
EP0567344A2 EP0567344A2 (en) 1993-10-27
EP0567344A3 true EP0567344A3 (en) 1994-09-14
EP0567344B1 EP0567344B1 (en) 2002-11-06

Family

ID=25360812

Family Applications (1)

Application Number Title Priority Date Filing Date
EP93303194A Expired - Lifetime EP0567344B1 (en) 1992-04-24 1993-04-23 Method and apparatus for character recognition

Country Status (4)

Country Link
US (4) US5680479A (en)
EP (1) EP0567344B1 (en)
JP (1) JP3359095B2 (en)
DE (1) DE69332459T2 (en)

Families Citing this family (214)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5436981A (en) * 1992-06-24 1995-07-25 Canon Kabushiki Kaisha Image processing method, and apparatus therefor
US6002798A (en) * 1993-01-19 1999-12-14 Canon Kabushiki Kaisha Method and apparatus for creating, indexing and viewing abstracted documents
US6005976A (en) * 1993-02-25 1999-12-21 Fujitsu Limited Image extraction system for extracting patterns such as characters, graphics and symbols from image having frame formed by straight line portions
US5848184A (en) * 1993-03-15 1998-12-08 Unisys Corporation Document page analyzer and method
US5588072A (en) * 1993-12-22 1996-12-24 Canon Kabushiki Kaisha Method and apparatus for selecting blocks of image data from image data having both horizontally- and vertically-oriented blocks
US5987171A (en) 1994-11-10 1999-11-16 Canon Kabushiki Kaisha Page analysis system
US5754708A (en) * 1994-11-16 1998-05-19 Mita Industrial Co. Ltd. Dotted image area detecting apparatus and dotted image area detecting method
JPH08235310A (en) * 1995-02-24 1996-09-13 Nec Corp Contact character segmenting device
US6005680A (en) 1995-04-04 1999-12-21 Canon Information Systems, Inc. Method for capturing a document image, a scanner using the method and a document image management system using the scanner
US6009196A (en) * 1995-11-28 1999-12-28 Xerox Corporation Method for classifying non-running text in an image
JP3814320B2 (en) * 1995-12-14 2006-08-30 キヤノン株式会社 Image processing method and apparatus
JP3234148B2 (en) * 1996-03-07 2001-12-04 シャープ株式会社 Display control device
US5974158A (en) * 1996-03-29 1999-10-26 The Commonwealth Of Australia Commonwealth Scientific And Industrial Research Organization Aircraft detection system
US5892843A (en) * 1997-01-21 1999-04-06 Matsushita Electric Industrial Co., Ltd. Title, caption and photo extraction from scanned document images
US6023534A (en) * 1997-08-04 2000-02-08 Xerox Corporation Method of extracting image data from an area generated with a halftone pattern
JP3780103B2 (en) * 1997-09-03 2006-05-31 キヤノン株式会社 Information processing apparatus, information processing method, storage medium, and printing system
US5995659A (en) * 1997-09-09 1999-11-30 Siemens Corporate Research, Inc. Method of searching and extracting text information from drawings
US6298173B1 (en) 1997-10-03 2001-10-02 Matsushita Electric Corporation Of America Storage management system for document image database
JP3601658B2 (en) * 1997-12-19 2004-12-15 富士通株式会社 Character string extraction device and pattern extraction device
US6173073B1 (en) 1998-01-05 2001-01-09 Canon Kabushiki Kaisha System for analyzing table images
JPH11203402A (en) * 1998-01-16 1999-07-30 Canon Inc Image processor and its method
JPH11220628A (en) 1998-01-30 1999-08-10 Canon Inc Image processor and method therefor and storage medium
US6532302B2 (en) 1998-04-08 2003-03-11 Canon Kabushiki Kaisha Multiple size reductions for image segmentation
JP4035228B2 (en) 1998-05-11 2008-01-16 キヤノン株式会社 Image processing method and image processing apparatus
US6075535A (en) * 1998-06-26 2000-06-13 Hewlett-Packard Company Method and apparatus for visualizing the tile access frequencies for tiled, multi-resolution images
US6233353B1 (en) 1998-06-29 2001-05-15 Xerox Corporation System for segmenting line drawings from text within a binary digital image
US6327388B1 (en) 1998-08-14 2001-12-04 Matsushita Electric Industrial Co., Ltd. Identification of logos from document images
US6360006B1 (en) 1998-09-29 2002-03-19 Canon Kabushiki Kaisha Color block selection
US7039856B2 (en) * 1998-09-30 2006-05-02 Ricoh Co., Ltd. Automatic document classification using text and images
KR100295360B1 (en) * 1998-10-13 2001-11-26 윤종용 Image Processing Method Using Shading Algorithm
US6711292B2 (en) 1998-12-30 2004-03-23 Canon Kabushiki Kaisha Block selection of table features
AU3712600A (en) 1999-02-26 2000-09-14 Raf Technology, Inc. Method and system for identifying a reference region on an image of a dropped-out form
AU4077300A (en) 1999-04-07 2000-10-23 Raf Technology, Inc. Extracting user data from a scanned image of a pre-printed form
US7000186B1 (en) * 1999-05-03 2006-02-14 Amicas, Inc. Method and structure for electronically transmitting a text document and linked information
US6496198B1 (en) 1999-05-04 2002-12-17 Canon Kabushiki Kaisha Color editing system
JP4454789B2 (en) * 1999-05-13 2010-04-21 キヤノン株式会社 Form classification method and apparatus
JP3204259B2 (en) * 1999-10-06 2001-09-04 インターナショナル・ビジネス・マシーンズ・コーポレーション Character string extraction method, handwritten character string extraction method, character string extraction device, and image processing device
US8311946B1 (en) * 1999-10-15 2012-11-13 Ebrary Method and apparatus for improved information transactions
US20040148274A1 (en) * 1999-10-15 2004-07-29 Warnock Christopher M. Method and apparatus for improved information transactions
US7536561B2 (en) * 1999-10-15 2009-05-19 Ebrary, Inc. Method and apparatus for improved information transactions
JP2001156274A (en) * 1999-11-29 2001-06-08 Nec Corp Semiconductor storage and its manufacturing
US6718059B1 (en) 1999-12-10 2004-04-06 Canon Kabushiki Kaisha Block selection-based image processing
JP4401560B2 (en) * 1999-12-10 2010-01-20 キヤノン株式会社 Image processing apparatus, image processing method, and storage medium
US6687421B1 (en) * 2000-03-17 2004-02-03 International Business Machines Corporation Skew detection of text in a noisy digitized image
US7672022B1 (en) 2000-04-07 2010-03-02 Hewlett-Packard Development Company, L.P. Methods and apparatus for analyzing an image
JP2002032770A (en) * 2000-06-23 2002-01-31 Internatl Business Mach Corp <Ibm> Method and system for processing document and medium
JP4603658B2 (en) 2000-07-07 2010-12-22 キヤノン株式会社 Image processing apparatus, image processing method, and storage medium
US7603415B1 (en) * 2000-08-15 2009-10-13 ART Technology Group Classification of electronic messages using a hierarchy of rule sets
US7221810B2 (en) * 2000-11-13 2007-05-22 Anoto Group Ab Method and device for recording of information
US8682077B1 (en) 2000-11-28 2014-03-25 Hand Held Products, Inc. Method for omnidirectional processing of 2D images including recognizable characters
US6690826B2 (en) * 2000-12-21 2004-02-10 Micron Technology, Inc. System and method for detecting text in mixed graphics data
JP4366011B2 (en) * 2000-12-21 2009-11-18 キヤノン株式会社 Document processing apparatus and method
US6807309B1 (en) * 2000-12-27 2004-10-19 Canon Kabushiki Kaisha Linear list compression
US6909805B2 (en) * 2001-01-31 2005-06-21 Matsushita Electric Industrial Co., Ltd. Detecting and utilizing add-on information from a scanned document image
EP1237115B1 (en) * 2001-02-22 2005-05-11 Océ Print Logic Technologies S.A. Automatic table location in documents
DE60204066T2 (en) * 2001-02-22 2006-02-02 Oce Print Logic Technologies S.A. Automatic localization of tables in documents
JP2002271611A (en) * 2001-03-14 2002-09-20 Fujitsu Ltd Image processing unit
US20030042319A1 (en) * 2001-08-31 2003-03-06 Xerox Corporation Automatic and semi-automatic index generation for raster documents
US6721452B2 (en) 2001-09-12 2004-04-13 Auburn University System and method of handwritten character recognition
US6678699B2 (en) 2001-10-09 2004-01-13 International Business Machines Corporation Visual indexing of displayable digital documents
US8103104B2 (en) * 2002-01-11 2012-01-24 Hewlett-Packard Development Company, L.P. Text extraction and its application to compound document image compression
US20030210803A1 (en) 2002-03-29 2003-11-13 Canon Kabushiki Kaisha Image processing apparatus and method
JP4278918B2 (en) * 2002-04-19 2009-06-17 富士通株式会社 Image data processing apparatus and method
US7120297B2 (en) 2002-04-25 2006-10-10 Microsoft Corporation Segmented layered image system
US7164797B2 (en) 2002-04-25 2007-01-16 Microsoft Corporation Clustering
US7392472B2 (en) 2002-04-25 2008-06-24 Microsoft Corporation Layout analysis
US7110596B2 (en) 2002-04-25 2006-09-19 Microsoft Corporation System and method facilitating document image compression utilizing a mask
US7263227B2 (en) 2002-04-25 2007-08-28 Microsoft Corporation Activity detector
US7043079B2 (en) 2002-04-25 2006-05-09 Microsoft Corporation “Don't care” pixel interpolation
US7024039B2 (en) 2002-04-25 2006-04-04 Microsoft Corporation Block retouching
AU2003229313A1 (en) 2002-05-15 2003-12-02 Thomson Licensing S.A. Close captioning system in windows based graphics system
JP2004023565A (en) * 2002-06-18 2004-01-22 Canon Inc Electronic watermark burying apparatus, electronic watermark extracting apparatuses, and method thereof
US7079686B2 (en) * 2002-08-20 2006-07-18 Lexmark International, Inc. Systems and methods for content-based document image enhancement
JP4194462B2 (en) * 2002-11-12 2008-12-10 キヤノン株式会社 Digital watermark embedding method, digital watermark embedding apparatus, program for realizing them, and computer-readable storage medium
JP4538214B2 (en) * 2002-11-22 2010-09-08 オセ−テクノロジーズ・ベー・ヴエー Image segmentation by graph
JP2004193756A (en) * 2002-12-09 2004-07-08 Canon Inc Electronic watermark embedding method
JP3919656B2 (en) * 2002-12-09 2007-05-30 キヤノン株式会社 Digital watermark embedding device, digital watermark embedding method, digital watermark extraction device, digital watermark extraction method
RU2251736C2 (en) * 2002-12-17 2005-05-10 "Аби Софтвер Лтд." Method for identification of crossed symbols during recognition of hand-written text
KR100480781B1 (en) 2002-12-28 2005-04-06 삼성전자주식회사 Method of extracting teeth area from teeth image and personal identification method and apparatus using teeth image
US7283669B2 (en) * 2003-01-29 2007-10-16 Lockheed Martin Corporation Fine segmentation refinement for an optical character recognition system
US6914700B2 (en) 2003-04-17 2005-07-05 Lexmark International, Inc. Method for reducing migrating residual error in error diffusion halftoning
JP2004334339A (en) 2003-04-30 2004-11-25 Canon Inc Information processor, information processing method, and storage medium, and program
JP2004348706A (en) 2003-04-30 2004-12-09 Canon Inc Information processing device, information processing method, storage medium, and program
JP4350414B2 (en) * 2003-04-30 2009-10-21 キヤノン株式会社 Information processing apparatus, information processing method, storage medium, and program
RU2259592C2 (en) 2003-06-24 2005-08-27 "Аби Софтвер Лтд." Method for recognizing graphic objects using integrity principle
US7805307B2 (en) * 2003-09-30 2010-09-28 Sharp Laboratories Of America, Inc. Text to speech conversion system
EP1555804A3 (en) * 2004-01-19 2006-08-16 Ricoh Company, Ltd. Image processing apparatus, image processing program and storage medium
US20050281463A1 (en) * 2004-04-22 2005-12-22 Samsung Electronics Co., Ltd. Method and apparatus for processing binary image
KR100647284B1 (en) * 2004-05-21 2006-11-23 삼성전자주식회사 Apparatus and method for extracting character of image
EP1603072A1 (en) * 2004-06-02 2005-12-07 CCS Content Conversion Specialists GmbH Process and apparatus for analysing the structure of a document
TWI284288B (en) * 2004-06-04 2007-07-21 Benq Corp Text region recognition method, storage medium and system
JP2005352696A (en) * 2004-06-09 2005-12-22 Canon Inc Image processing device, control method thereof, and program
US7610274B2 (en) 2004-07-02 2009-10-27 Canon Kabushiki Kaisha Method, apparatus, and program for retrieving data
US20060045346A1 (en) 2004-08-26 2006-03-02 Hui Zhou Method and apparatus for locating and extracting captions in a digital image
JP4681870B2 (en) * 2004-12-17 2011-05-11 キヤノン株式会社 Image processing apparatus, image processing method, and computer program
JP4455357B2 (en) * 2005-01-28 2010-04-21 キヤノン株式会社 Information processing apparatus and information processing method
JP4646797B2 (en) * 2005-02-01 2011-03-09 キヤノン株式会社 Image processing apparatus, control method therefor, and program
JP4566772B2 (en) * 2005-02-14 2010-10-20 キヤノン株式会社 Image processing apparatus, image processing method, and program
US7840564B2 (en) * 2005-02-16 2010-11-23 Ebrary System and method for automatic anthology creation using document aspects
JP4443443B2 (en) * 2005-03-04 2010-03-31 富士通株式会社 Document image layout analysis program, document image layout analysis apparatus, and document image layout analysis method
JP2006253842A (en) * 2005-03-08 2006-09-21 Ricoh Co Ltd Image processor, image forming apparatus, program, storage medium and image processing method
JP2006268372A (en) * 2005-03-23 2006-10-05 Fuji Xerox Co Ltd Translation device, image processor, image forming device, translation method and program
AU2005201758B2 (en) * 2005-04-27 2008-12-18 Canon Kabushiki Kaisha Method of learning associations between documents and data sets
US7623712B2 (en) * 2005-06-09 2009-11-24 Canon Kabushiki Kaisha Image processing method and apparatus
US7555711B2 (en) * 2005-06-24 2009-06-30 Hewlett-Packard Development Company, L.P. Generating a text layout boundary from a text block in an electronic document
JP4574467B2 (en) * 2005-06-30 2010-11-04 キヤノン株式会社 Data processing apparatus, data processing method, and computer program
US7433869B2 (en) * 2005-07-01 2008-10-07 Ebrary, Inc. Method and apparatus for document clustering and document sketching
JP4708888B2 (en) * 2005-07-12 2011-06-22 キヤノン株式会社 Image processing apparatus, image processing method, and computer program
WO2007024216A1 (en) * 2005-08-23 2007-03-01 The Mazer Corporation Test scoring system and method
JP4717562B2 (en) * 2005-09-02 2011-07-06 キヤノン株式会社 Image processing apparatus and method
JP2007081482A (en) * 2005-09-09 2007-03-29 Canon Inc Terminal authentication method, apparatus and program thereof
JP4993674B2 (en) * 2005-09-09 2012-08-08 キヤノン株式会社 Information processing apparatus, verification processing apparatus, control method thereof, computer program, and storage medium
US7596270B2 (en) * 2005-09-23 2009-09-29 Dynacomware Taiwan Inc. Method of shuffling text in an Asian document image
US20100254606A1 (en) * 2005-12-08 2010-10-07 Abbyy Software Ltd Method of recognizing text information from a vector/raster image
RU2309456C2 (en) * 2005-12-08 2007-10-27 "Аби Софтвер Лтд." Method for recognizing text information in vector-raster image
JP4771804B2 (en) * 2005-12-20 2011-09-14 富士通株式会社 Layout analysis program, layout analysis apparatus, layout analysis method
US8509563B2 (en) * 2006-02-02 2013-08-13 Microsoft Corporation Generation of documents from images
US7650041B2 (en) * 2006-02-24 2010-01-19 Symbol Technologies, Inc. System and method for optical character recognition in an image
JP4799246B2 (en) * 2006-03-30 2011-10-26 キヤノン株式会社 Image processing method and image processing apparatus
JP4764231B2 (en) * 2006-03-31 2011-08-31 キヤノン株式会社 Image processing apparatus, control method, and computer program
US7734065B2 (en) * 2006-07-06 2010-06-08 Abbyy Software Ltd. Method of text information recognition from a graphical file with use of dictionaries and other supplementary data
JP4909216B2 (en) * 2006-09-13 2012-04-04 株式会社キーエンス Character segmentation device, method and program
US8631012B2 (en) * 2006-09-29 2014-01-14 A9.Com, Inc. Method and system for identifying and displaying images in response to search queries
US8971667B2 (en) * 2006-10-23 2015-03-03 Hewlett-Packard Development Company, L.P. Digital image auto-resizing
CN101276363B (en) * 2007-03-30 2011-02-16 夏普株式会社 Document image retrieval device and document image retrieval method
JP4945739B2 (en) * 2007-03-30 2012-06-06 日本電産サンキョー株式会社 Character string recognition method and character string recognition apparatus
JP4402138B2 (en) * 2007-06-29 2010-01-20 キヤノン株式会社 Image processing apparatus, image processing method, and computer program
JP4590433B2 (en) * 2007-06-29 2010-12-01 キヤノン株式会社 Image processing apparatus, image processing method, and computer program
US8238662B2 (en) * 2007-07-17 2012-08-07 Smart Technologies Ulc Method for manipulating regions of a digital image
CN101354746B (en) * 2007-07-23 2011-08-31 夏普株式会社 Device and method for extracting character image
US8731297B1 (en) * 2007-09-28 2014-05-20 Amazon Technologies, Inc. Processing a digital image of content to remove border artifacts
JP4956366B2 (en) * 2007-10-16 2012-06-20 キヤノン株式会社 Image processing device
US20090116757A1 (en) * 2007-11-06 2009-05-07 Copanion, Inc. Systems and methods for classifying electronic documents by extracting and recognizing text and image features indicative of document categories
US20090153912A1 (en) * 2007-12-18 2009-06-18 Mohamed Nooman Ahmed Scanner Calibration Strip, Scanner, and Method for Segmenting a Scanned Document Image
US8838489B2 (en) 2007-12-27 2014-09-16 Amazon Technologies, Inc. On-demand generating E-book content with advertising
JP4952627B2 (en) * 2008-03-21 2012-06-13 富士通株式会社 Image processing apparatus, image processing method, and image processing program
US7471826B1 (en) 2008-03-31 2008-12-30 International Business Machines Corporation Character segmentation by slices
US8200043B2 (en) * 2008-05-01 2012-06-12 Xerox Corporation Page orientation detection based on selective character recognition
JP5047051B2 (en) * 2008-05-02 2012-10-10 キヤノン株式会社 Image processing apparatus and image encoding method
US8023770B2 (en) * 2008-05-23 2011-09-20 Sharp Laboratories Of America, Inc. Methods and systems for identifying the orientation of a digital image
US8023741B2 (en) 2008-05-23 2011-09-20 Sharp Laboratories Of America, Inc. Methods and systems for detecting numerals in a digital image
JP5028337B2 (en) * 2008-05-30 2012-09-19 キヤノン株式会社 Image processing apparatus, image processing method, program, and storage medium
JP5171421B2 (en) * 2008-06-18 2013-03-27 キヤノン株式会社 Image processing apparatus, image processing method, and computer program
JP5132440B2 (en) * 2008-06-23 2013-01-30 キヤノン株式会社 Image processing apparatus and image processing method
JP5146190B2 (en) * 2008-08-11 2013-02-20 オムロン株式会社 Character recognition device, character recognition program, and character recognition method
US8520979B2 (en) * 2008-08-19 2013-08-27 Digimarc Corporation Methods and systems for content processing
JP5049921B2 (en) * 2008-08-26 2012-10-17 キヤノン株式会社 Image processing apparatus and image processing method
JP5049922B2 (en) * 2008-08-26 2012-10-17 キヤノン株式会社 Image processing apparatus and image processing method
JP5049920B2 (en) * 2008-08-26 2012-10-17 キヤノン株式会社 Image processing apparatus and image processing method
US8620080B2 (en) * 2008-09-26 2013-12-31 Sharp Laboratories Of America, Inc. Methods and systems for locating text in a digital image
JP2010123002A (en) * 2008-11-20 2010-06-03 Canon Inc Document image layout device
JP5350148B2 (en) * 2008-11-28 2013-11-27 キヤノン株式会社 Information processing apparatus and information processing method
JP5178490B2 (en) * 2008-12-17 2013-04-10 キヤノン株式会社 Image processing apparatus, image processing method, and computer program
US8261186B2 (en) * 2009-01-02 2012-09-04 Apple Inc. Methods for efficient cluster analysis
US8290255B2 (en) * 2009-02-06 2012-10-16 Canon Kabushiki Kaisha Image processing method, image processing apparatus, and program
US8625895B2 (en) * 2009-03-30 2014-01-07 The Neat Company, Inc. Table grid detection and separation
AU2009201252B2 (en) * 2009-03-31 2011-06-02 Canon Kabushiki Kaisha Colour correcting foreground colours for visual quality improvement
JP5312166B2 (en) * 2009-04-13 2013-10-09 キヤノン株式会社 Image processing apparatus, control method, and program
JP5208043B2 (en) * 2009-04-16 2013-06-12 キヤノン株式会社 Image processing apparatus, image processing method, and program
JP5335581B2 (en) * 2009-07-01 2013-11-06 キヤノン株式会社 Image processing apparatus, image processing method, and program
JP5361574B2 (en) 2009-07-01 2013-12-04 キヤノン株式会社 Image processing apparatus, image processing method, and program
JP5276541B2 (en) * 2009-07-27 2013-08-28 キヤノン株式会社 Image processing method, image processing apparatus, and program
JP5465015B2 (en) * 2010-01-06 2014-04-09 キヤノン株式会社 Apparatus and method for digitizing documents
US8594422B2 (en) * 2010-03-11 2013-11-26 Microsoft Corporation Page layout determination of an image undergoing optical character recognition
CN101853297A (en) * 2010-05-28 2010-10-06 英华达(南昌)科技有限公司 Method for fast obtaining expected image in electronic equipment
US8218875B2 (en) * 2010-06-12 2012-07-10 Hussein Khalid Al-Omari Method and system for preprocessing an image for optical character recognition
CN101984426B (en) * 2010-10-21 2013-04-10 优视科技有限公司 Method used for character splitting on webpage picture and device thereof
CN102479326B (en) * 2010-11-30 2013-07-24 方正国际软件(北京)有限公司 Man-operated proofreading auxiliary method of picture-text identification and system thereof
US8549399B2 (en) 2011-01-18 2013-10-01 Apple Inc. Identifying a selection of content in a structured document
US8442998B2 (en) 2011-01-18 2013-05-14 Apple Inc. Storage of a document using multiple representations
US8380753B2 (en) 2011-01-18 2013-02-19 Apple Inc. Reconstruction of lists in a document
US9002139B2 (en) 2011-02-16 2015-04-07 Adobe Systems Incorporated Methods and systems for automated image slicing
US8731296B2 (en) * 2011-04-21 2014-05-20 Seiko Epson Corporation Contact text detection in scanned images
US8818092B1 (en) * 2011-09-29 2014-08-26 Google, Inc. Multi-threaded text rendering
JP5948866B2 (en) * 2011-12-27 2016-07-06 富士ゼロックス株式会社 Image processing apparatus and program
WO2013110287A1 (en) * 2012-01-23 2013-08-01 Microsoft Corporation Vector graphics classification engine
US9990347B2 (en) 2012-01-23 2018-06-05 Microsoft Technology Licensing, Llc Borderless table detection engine
JP5950700B2 (en) 2012-06-06 2016-07-13 キヤノン株式会社 Image processing apparatus, image processing method, and program
CN103577817B (en) * 2012-07-24 2017-03-01 阿里巴巴集团控股有限公司 Form recognition method and apparatus
US9424249B1 (en) * 2012-09-18 2016-08-23 Amazon Technologies, Inc. Encoding text units
US9569679B1 (en) * 2012-12-04 2017-02-14 A9.Com, Inc. Adaptive image sampling for text detection
US9098537B2 (en) * 2012-12-20 2015-08-04 Oracle International Corporation Techniques for aligned run-length encoding
US9953008B2 (en) 2013-01-18 2018-04-24 Microsoft Technology Licensing, Llc Grouping fixed format document elements to preserve graphical data semantics after reflow by manipulating a bounding box vertically and horizontally
US9785240B2 (en) * 2013-03-18 2017-10-10 Fuji Xerox Co., Ltd. Systems and methods for content-aware selection
GB2516007B (en) * 2013-06-28 2018-05-09 Displaylink Uk Ltd Efficient encoding of display data
CN104715178B (en) * 2013-12-11 2020-04-03 深圳富泰宏精密工业有限公司 Unlocking system and method of electronic device
US20170061257A1 (en) * 2013-12-16 2017-03-02 Adobe Systems Incorporated Generation of visual pattern classes for visual pattern regonition
JP5875637B2 (en) 2013-12-19 2016-03-02 キヤノン株式会社 Image processing apparatus and image processing method
JP6494166B2 (en) * 2014-03-12 2019-04-03 キヤノン株式会社 Image processing apparatus, image processing method, and program
US11100571B1 (en) * 2014-06-10 2021-08-24 Wells Fargo Bank, N.A. Systems and methods for payee identification via camera
US9361531B2 (en) * 2014-07-21 2016-06-07 Optum, Inc. Targeted optical character recognition (OCR) for medical terminology
US20160026613A1 (en) * 2014-07-28 2016-01-28 Microsoft Corporation Processing image to identify object for insertion into document
RU2571616C1 (en) * 2014-08-12 2015-12-20 Общество с ограниченной ответственностью "Аби Девелопмент" Optical character recognition system and method, reducing processing time for images potentially not containing characters
US9384391B2 (en) * 2014-10-03 2016-07-05 Xerox Corporation Methods and systems for processing documents
US9430703B2 (en) * 2014-12-19 2016-08-30 Konica Minolta Laboratory U.S.A., Inc. Method for segmenting text words in document images using vertical projections of center zones of characters
US9984287B2 (en) * 2015-03-05 2018-05-29 Wipro Limited Method and image processing apparatus for performing optical character recognition (OCR) of an article
US10049268B2 (en) * 2015-03-06 2018-08-14 Kofax, Inc. Selective, user-mediated content recognition using mobile devices
US9811505B2 (en) * 2015-07-20 2017-11-07 Sas Institute Inc. Techniques to provide processing enhancements for a text editor in a computing environment
US9865038B2 (en) * 2015-11-25 2018-01-09 Konica Minolta Laboratory U.S.A., Inc. Offsetting rotated tables in images
CN107688788B (en) * 2017-08-31 2021-01-08 平安科技(深圳)有限公司 Document chart extraction method, electronic device and computer readable storage medium
GB201719862D0 (en) * 2017-11-29 2018-01-10 Yellow Line Parking Ltd Hierarchical image interpretation system
US10685225B2 (en) * 2017-12-29 2020-06-16 Wipro Limited Method and system for detecting text in digital engineering drawings
US10579707B2 (en) * 2017-12-29 2020-03-03 Konica Minolta Laboratory U.S.A., Inc. Method for inferring blocks of text in electronic documents
TWI671686B (en) 2018-01-24 2019-09-11 緯創資通股份有限公司 Image data retrieving method and image data retrieving device
RU2701453C1 (en) * 2018-06-25 2019-09-26 Михаил Григорьевич Блайвас Method of displaying graphic objects
JP7185451B2 (en) * 2018-09-10 2022-12-07 キヤノン株式会社 Image processing device, image processing method, and program
CN109685070B (en) * 2019-01-11 2023-01-24 上海大学(浙江·嘉兴)新兴产业研究院 Image preprocessing method
CN109871938B (en) * 2019-01-21 2023-04-25 重庆大学 Component code spraying detection method based on convolutional neural network
JP7406884B2 (en) * 2019-06-27 2023-12-28 キヤノン株式会社 Information processing device, program and control method
JPWO2022070999A1 (en) * 2020-09-30 2022-04-07
US11531454B2 (en) * 2020-12-10 2022-12-20 Microsoft Technology Licensing, Llc Selecting content in ink documents using a hierarchical data structure
US11550934B2 (en) * 2021-03-16 2023-01-10 Check Point Software Technologies, Ltd. Systems and methods for the efficient detection of improperly redacted electronic documents
US11409981B1 (en) * 2021-03-31 2022-08-09 Intuit, Inc. Document classification using signal processing
CN115082598B (en) * 2022-08-24 2023-07-11 北京百度网讯科技有限公司 Text image generation, training, text image processing method and electronic equipment

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1980002761A1 (en) * 1979-06-01 1980-12-11 Dest Data Corp Apparatus and method for separation of optical character recognition data
US5101448A (en) * 1988-08-24 1992-03-31 Hitachi, Ltd. Method and apparatus for processing a document by utilizing an image
WO1992006448A1 (en) * 1990-09-27 1992-04-16 Cgk Computer Gesellschaft Konstanz Mbh Process for extracting individual characters from raster images of a read-in handwritten or typed series of characters in free distribution

Family Cites Families (18)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH07107694B2 (en) * 1984-08-31 1995-11-15 株式会社日立製作所 Document processor
JPH0731714B2 (en) * 1986-05-29 1995-04-10 インタ−ナショナル ビジネス マシ−ンズ コ−ポレ−ション Character component cutting method
JPH01183784A (en) * 1988-01-19 1989-07-21 Toshiba Corp Document picture processor
US5129012A (en) * 1989-03-25 1992-07-07 Sony Corporation Detecting line segments and predetermined patterns in an optically scanned document
JP2812982B2 (en) * 1989-04-05 1998-10-22 株式会社リコー Table recognition method
JPH0816918B2 (en) * 1989-04-18 1996-02-21 シャープ株式会社 Row extraction method
JP2644041B2 (en) * 1989-05-08 1997-08-25 キヤノン株式会社 Character recognition device
JP2940936B2 (en) * 1989-06-06 1999-08-25 株式会社リコー Tablespace identification method
US5272764A (en) * 1989-12-08 1993-12-21 Xerox Corporation Detection of highlighted regions
JPH03290774A (en) * 1990-04-06 1991-12-20 Fuji Facom Corp Sentence area extracting device for document picture
JPH0490083A (en) * 1990-08-03 1992-03-24 Canon Inc Character recognizing device
US5101439A (en) * 1990-08-31 1992-03-31 At&T Bell Laboratories Segmentation process for machine reading of handwritten information
KR930002349B1 (en) * 1990-12-29 1993-03-29 주식회사 금성사 Character array devide method for press image
JPH04248687A (en) * 1991-01-23 1992-09-04 Internatl Business Mach Corp <Ibm> Layout analyzing method and system of document picture
US5307422A (en) * 1991-06-25 1994-04-26 Industrial Technology Research Institute Method and system for identifying lines of text in a document
US5351314A (en) * 1991-10-04 1994-09-27 Canon Information Systems, Inc. Method and apparatus for image enhancement using intensity dependent spread filtering
US5253304A (en) * 1991-11-27 1993-10-12 At&T Bell Laboratories Method and apparatus for image segmentation
US5335290A (en) * 1992-04-06 1994-08-02 Ricoh Corporation Segmentation of text, picture and lines of a document image

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1980002761A1 (en) * 1979-06-01 1980-12-11 Dest Data Corp Apparatus and method for separation of optical character recognition data
US5101448A (en) * 1988-08-24 1992-03-31 Hitachi, Ltd. Method and apparatus for processing a document by utilizing an image
WO1992006448A1 (en) * 1990-09-27 1992-04-16 Cgk Computer Gesellschaft Konstanz Mbh Process for extracting individual characters from raster images of a read-in handwritten or typed series of characters in free distribution

Non-Patent Citations (6)

* Cited by examiner, † Cited by third party
Title
"LINE SEGMENTATION METHOD FOR DOCUMENTS IN EUROPEAN LANGUAGES", IBM TECHNICAL DISCLOSURE BULLETIN., vol. 33, no. 1B, June 1990 (1990-06-01), NEW YORK US, pages 207 - 210 *
FLETCHER L.A. AND KASTURI R.: "A Robust Algorithm for Text String Separation from Mixed Text/Graphics Images", IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, vol. 10, no. 6, November 1988 (1988-11-01), NEW YORK US, pages 910 - 918, XP000112065 *
MASUDA I. ET AL.: "Approach to Smart Document Reader System", IEEE PROCEEDINGS OF THE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CAT. NO. CH2145-1/85,, 23 June 1985 (1985-06-23), SAN FRANCISCO, CALIFORNIA, USA, pages 550 - 557 *
MIZUNO M. ET AL.: "Document Recognition System with Layout Structure Generator", NEC RESEARCH AND DEVELOPMENT, vol. 32, no. 3, July 1991 (1991-07-01), TOKYO JP, pages 430 - 437, XP000265886 *
PIZANO A. ET AL.: "A Business Form Recognition system", COMPSAC91 PROCEEDINGS, THE FIFTEENTH ANNUAL INTERNATIONAL COMPUTER SOFTWARE & APPLICATIONS CONFERENCE, 13 September 1991 (1991-09-13), KOGAKUIN UNIVERSITY, TOKYO, JAPAN, pages 626 - 632, XP000260573 *
YAMADA M., HASUIKE K.: "Document Image Processing Based on Enhanced Border Following Algorithm", IEEE PROCEEDINGS OF THE 10TH INTRNATIONAL CONFERENCE ON PATTERN RECOGNITION, CAT. NO. CH2898-5/90,, vol. 2, 21 June 1990 (1990-06-21), BALLY'S PARK PLACE HOTEL, ATLANTIC CITY, NEW JERSEY, USA, pages 231 - 236, XP000166494 *

Also Published As

Publication number Publication date
JP3359095B2 (en) 2002-12-24
US6081616A (en) 2000-06-27
US6115497A (en) 2000-09-05
JPH0668301A (en) 1994-03-11
DE69332459T2 (en) 2003-07-10
US5680478A (en) 1997-10-21
US5680479A (en) 1997-10-21
DE69332459D1 (en) 2002-12-12
EP0567344A2 (en) 1993-10-27
EP0567344B1 (en) 2002-11-06

Similar Documents

Publication Publication Date Title
EP0567344A3 (en) Method and apparatus for character recognition
HK1011437A1 (en) Character recognition method and apparatus
KR960015761B1 (en) Charaster gerenating method and apparatus
EP0584783A3 (en) Method and apparatus for improved processing
EP0542566A3 (en) Character recognition method and apparatus thereof
EP0488733A3 (en) Method and apparatus for speech recognition
EP0588074A3 (en) Method and apparatus for character recognition with supervised training
EP0551739A3 (en) Method and apparatus for connected and degraded text recognition
HK1011429A1 (en) Character input method and apparatus
EP0519714A3 (en) Apparatus and method for recognizing characters
GB9301635D0 (en) Method and apparatus
EP0690405A3 (en) Handwritten character entry method and apparatus
EP0576020A3 (en) Character recognizing method and apparatus.
EP0575135A3 (en) Information processing method and apparatus
GB2270604B (en) Scanning method and apparatus
GB9226594D0 (en) Autolevelling method and apparatus
GB9322871D0 (en) Method and apparatus
GB9325993D0 (en) Method and apparatus for connection
GB2270771B (en) Weigh-filling method and apparatus
EP0586217A3 (en) Method and apparatus for recognition template enhancement
EP0595243A3 (en) Punching method and punching apparatus
EP0585098A3 (en) Sign recognition apparatus and method and sign translation system using same.
EP0457547A3 (en) Information recognition apparatus and method
EP0583477A4 (en) Printing method and apparatus
EP0541365A3 (en) Character recognition method and apparatus

Legal Events

Date Code Title Description
PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

AK Designated contracting states

Kind code of ref document: A2

Designated state(s): DE FR GB IT

PUAL Search report despatched

Free format text: ORIGINAL CODE: 0009013

AK Designated contracting states

Kind code of ref document: A3

Designated state(s): DE FR GB IT

17P Request for examination filed

Effective date: 19950127

RAP1 Party data changed (applicant data changed or rights of an application transferred)

Owner name: CANON KABUSHIKI KAISHA

17Q First examination report despatched

Effective date: 19990520

GRAG Despatch of communication of intention to grant

Free format text: ORIGINAL CODE: EPIDOS AGRA

GRAG Despatch of communication of intention to grant

Free format text: ORIGINAL CODE: EPIDOS AGRA

GRAG Despatch of communication of intention to grant

Free format text: ORIGINAL CODE: EPIDOS AGRA

GRAH Despatch of communication of intention to grant a patent

Free format text: ORIGINAL CODE: EPIDOS IGRA

GRAH Despatch of communication of intention to grant a patent

Free format text: ORIGINAL CODE: EPIDOS IGRA

GRAA (expected) grant

Free format text: ORIGINAL CODE: 0009210

AK Designated contracting states

Kind code of ref document: B1

Designated state(s): DE FR GB IT

REG Reference to a national code

Ref country code: GB

Ref legal event code: FG4D

REF Corresponds to:

Ref document number: 69332459

Country of ref document: DE

Date of ref document: 20021212

ET Fr: translation filed
PLBE No opposition filed within time limit

Free format text: ORIGINAL CODE: 0009261

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: NO OPPOSITION FILED WITHIN TIME LIMIT

26N No opposition filed

Effective date: 20030807

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: DE

Payment date: 20120430

Year of fee payment: 20

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: GB

Payment date: 20120425

Year of fee payment: 20

Ref country code: FR

Payment date: 20120504

Year of fee payment: 20

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: IT

Payment date: 20120417

Year of fee payment: 20

REG Reference to a national code

Ref country code: DE

Ref legal event code: R071

Ref document number: 69332459

Country of ref document: DE

REG Reference to a national code

Ref country code: DE

Ref legal event code: R071

Ref document number: 69332459

Country of ref document: DE

REG Reference to a national code

Ref country code: GB

Ref legal event code: PE20

Expiry date: 20130422

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: GB

Free format text: LAPSE BECAUSE OF EXPIRATION OF PROTECTION

Effective date: 20130422

Ref country code: DE

Free format text: LAPSE BECAUSE OF EXPIRATION OF PROTECTION

Effective date: 20130424