EP0567344A3 - Method and apparatus for character recognition - Google Patents
Method and apparatus for character recognition Download PDFInfo
- Publication number
- EP0567344A3 EP0567344A3 EP9393303194A EP93303194A EP0567344A3 EP 0567344 A3 EP0567344 A3 EP 0567344A3 EP 9393303194 A EP9393303194 A EP 9393303194A EP 93303194 A EP93303194 A EP 93303194A EP 0567344 A3 EP0567344 A3 EP 0567344A3
- Authority
- EP
- European Patent Office
- Prior art keywords
- character recognition
- recognition
- character
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V30/00—Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
- G06V30/40—Document-oriented image-based pattern recognition
- G06V30/41—Analysis of document content
- G06V30/413—Classification of content, e.g. text, photographs or tables
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V30/00—Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
- G06V30/10—Character recognition
- G06V30/14—Image acquisition
- G06V30/148—Segmentation of character regions
- G06V30/15—Cutting or merging image elements, e.g. region growing, watershed or clustering-based techniques
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V30/00—Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
- G06V30/10—Character recognition
- G06V30/14—Image acquisition
- G06V30/148—Segmentation of character regions
- G06V30/155—Removing patterns interfering with the pattern to be recognised, such as ruled lines or underlines
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V30/00—Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
- G06V30/40—Document-oriented image-based pattern recognition
- G06V30/41—Analysis of document content
- G06V30/414—Extracting the geometrical structure, e.g. layout tree; Block segmentation, e.g. bounding boxes for graphics or text
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V30/00—Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
- G06V30/10—Character recognition
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US873012 | 1992-04-24 | ||
US07/873,012 US5680479A (en) | 1992-04-24 | 1992-04-24 | Method and apparatus for character recognition |
Publications (3)
Publication Number | Publication Date |
---|---|
EP0567344A2 EP0567344A2 (en) | 1993-10-27 |
EP0567344A3 true EP0567344A3 (en) | 1994-09-14 |
EP0567344B1 EP0567344B1 (en) | 2002-11-06 |
Family
ID=25360812
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
EP93303194A Expired - Lifetime EP0567344B1 (en) | 1992-04-24 | 1993-04-23 | Method and apparatus for character recognition |
Country Status (4)
Country | Link |
---|---|
US (4) | US5680479A (en) |
EP (1) | EP0567344B1 (en) |
JP (1) | JP3359095B2 (en) |
DE (1) | DE69332459T2 (en) |
Families Citing this family (214)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5436981A (en) * | 1992-06-24 | 1995-07-25 | Canon Kabushiki Kaisha | Image processing method, and apparatus therefor |
US6002798A (en) * | 1993-01-19 | 1999-12-14 | Canon Kabushiki Kaisha | Method and apparatus for creating, indexing and viewing abstracted documents |
US6005976A (en) * | 1993-02-25 | 1999-12-21 | Fujitsu Limited | Image extraction system for extracting patterns such as characters, graphics and symbols from image having frame formed by straight line portions |
US5848184A (en) * | 1993-03-15 | 1998-12-08 | Unisys Corporation | Document page analyzer and method |
US5588072A (en) * | 1993-12-22 | 1996-12-24 | Canon Kabushiki Kaisha | Method and apparatus for selecting blocks of image data from image data having both horizontally- and vertically-oriented blocks |
US5987171A (en) | 1994-11-10 | 1999-11-16 | Canon Kabushiki Kaisha | Page analysis system |
US5754708A (en) * | 1994-11-16 | 1998-05-19 | Mita Industrial Co. Ltd. | Dotted image area detecting apparatus and dotted image area detecting method |
JPH08235310A (en) * | 1995-02-24 | 1996-09-13 | Nec Corp | Contact character segmenting device |
US6005680A (en) | 1995-04-04 | 1999-12-21 | Canon Information Systems, Inc. | Method for capturing a document image, a scanner using the method and a document image management system using the scanner |
US6009196A (en) * | 1995-11-28 | 1999-12-28 | Xerox Corporation | Method for classifying non-running text in an image |
JP3814320B2 (en) * | 1995-12-14 | 2006-08-30 | キヤノン株式会社 | Image processing method and apparatus |
JP3234148B2 (en) * | 1996-03-07 | 2001-12-04 | シャープ株式会社 | Display control device |
US5974158A (en) * | 1996-03-29 | 1999-10-26 | The Commonwealth Of Australia Commonwealth Scientific And Industrial Research Organization | Aircraft detection system |
US5892843A (en) * | 1997-01-21 | 1999-04-06 | Matsushita Electric Industrial Co., Ltd. | Title, caption and photo extraction from scanned document images |
US6023534A (en) * | 1997-08-04 | 2000-02-08 | Xerox Corporation | Method of extracting image data from an area generated with a halftone pattern |
JP3780103B2 (en) * | 1997-09-03 | 2006-05-31 | キヤノン株式会社 | Information processing apparatus, information processing method, storage medium, and printing system |
US5995659A (en) * | 1997-09-09 | 1999-11-30 | Siemens Corporate Research, Inc. | Method of searching and extracting text information from drawings |
US6298173B1 (en) | 1997-10-03 | 2001-10-02 | Matsushita Electric Corporation Of America | Storage management system for document image database |
JP3601658B2 (en) * | 1997-12-19 | 2004-12-15 | 富士通株式会社 | Character string extraction device and pattern extraction device |
US6173073B1 (en) | 1998-01-05 | 2001-01-09 | Canon Kabushiki Kaisha | System for analyzing table images |
JPH11203402A (en) * | 1998-01-16 | 1999-07-30 | Canon Inc | Image processor and its method |
JPH11220628A (en) | 1998-01-30 | 1999-08-10 | Canon Inc | Image processor and method therefor and storage medium |
US6532302B2 (en) | 1998-04-08 | 2003-03-11 | Canon Kabushiki Kaisha | Multiple size reductions for image segmentation |
JP4035228B2 (en) | 1998-05-11 | 2008-01-16 | キヤノン株式会社 | Image processing method and image processing apparatus |
US6075535A (en) * | 1998-06-26 | 2000-06-13 | Hewlett-Packard Company | Method and apparatus for visualizing the tile access frequencies for tiled, multi-resolution images |
US6233353B1 (en) | 1998-06-29 | 2001-05-15 | Xerox Corporation | System for segmenting line drawings from text within a binary digital image |
US6327388B1 (en) | 1998-08-14 | 2001-12-04 | Matsushita Electric Industrial Co., Ltd. | Identification of logos from document images |
US6360006B1 (en) | 1998-09-29 | 2002-03-19 | Canon Kabushiki Kaisha | Color block selection |
US7039856B2 (en) * | 1998-09-30 | 2006-05-02 | Ricoh Co., Ltd. | Automatic document classification using text and images |
KR100295360B1 (en) * | 1998-10-13 | 2001-11-26 | 윤종용 | Image Processing Method Using Shading Algorithm |
US6711292B2 (en) | 1998-12-30 | 2004-03-23 | Canon Kabushiki Kaisha | Block selection of table features |
AU3712600A (en) | 1999-02-26 | 2000-09-14 | Raf Technology, Inc. | Method and system for identifying a reference region on an image of a dropped-out form |
AU4077300A (en) | 1999-04-07 | 2000-10-23 | Raf Technology, Inc. | Extracting user data from a scanned image of a pre-printed form |
US7000186B1 (en) * | 1999-05-03 | 2006-02-14 | Amicas, Inc. | Method and structure for electronically transmitting a text document and linked information |
US6496198B1 (en) | 1999-05-04 | 2002-12-17 | Canon Kabushiki Kaisha | Color editing system |
JP4454789B2 (en) * | 1999-05-13 | 2010-04-21 | キヤノン株式会社 | Form classification method and apparatus |
JP3204259B2 (en) * | 1999-10-06 | 2001-09-04 | インターナショナル・ビジネス・マシーンズ・コーポレーション | Character string extraction method, handwritten character string extraction method, character string extraction device, and image processing device |
US8311946B1 (en) * | 1999-10-15 | 2012-11-13 | Ebrary | Method and apparatus for improved information transactions |
US20040148274A1 (en) * | 1999-10-15 | 2004-07-29 | Warnock Christopher M. | Method and apparatus for improved information transactions |
US7536561B2 (en) * | 1999-10-15 | 2009-05-19 | Ebrary, Inc. | Method and apparatus for improved information transactions |
JP2001156274A (en) * | 1999-11-29 | 2001-06-08 | Nec Corp | Semiconductor storage and its manufacturing |
US6718059B1 (en) | 1999-12-10 | 2004-04-06 | Canon Kabushiki Kaisha | Block selection-based image processing |
JP4401560B2 (en) * | 1999-12-10 | 2010-01-20 | キヤノン株式会社 | Image processing apparatus, image processing method, and storage medium |
US6687421B1 (en) * | 2000-03-17 | 2004-02-03 | International Business Machines Corporation | Skew detection of text in a noisy digitized image |
US7672022B1 (en) | 2000-04-07 | 2010-03-02 | Hewlett-Packard Development Company, L.P. | Methods and apparatus for analyzing an image |
JP2002032770A (en) * | 2000-06-23 | 2002-01-31 | Internatl Business Mach Corp <Ibm> | Method and system for processing document and medium |
JP4603658B2 (en) | 2000-07-07 | 2010-12-22 | キヤノン株式会社 | Image processing apparatus, image processing method, and storage medium |
US7603415B1 (en) * | 2000-08-15 | 2009-10-13 | ART Technology Group | Classification of electronic messages using a hierarchy of rule sets |
US7221810B2 (en) * | 2000-11-13 | 2007-05-22 | Anoto Group Ab | Method and device for recording of information |
US8682077B1 (en) | 2000-11-28 | 2014-03-25 | Hand Held Products, Inc. | Method for omnidirectional processing of 2D images including recognizable characters |
US6690826B2 (en) * | 2000-12-21 | 2004-02-10 | Micron Technology, Inc. | System and method for detecting text in mixed graphics data |
JP4366011B2 (en) * | 2000-12-21 | 2009-11-18 | キヤノン株式会社 | Document processing apparatus and method |
US6807309B1 (en) * | 2000-12-27 | 2004-10-19 | Canon Kabushiki Kaisha | Linear list compression |
US6909805B2 (en) * | 2001-01-31 | 2005-06-21 | Matsushita Electric Industrial Co., Ltd. | Detecting and utilizing add-on information from a scanned document image |
EP1237115B1 (en) * | 2001-02-22 | 2005-05-11 | Océ Print Logic Technologies S.A. | Automatic table location in documents |
DE60204066T2 (en) * | 2001-02-22 | 2006-02-02 | Oce Print Logic Technologies S.A. | Automatic localization of tables in documents |
JP2002271611A (en) * | 2001-03-14 | 2002-09-20 | Fujitsu Ltd | Image processing unit |
US20030042319A1 (en) * | 2001-08-31 | 2003-03-06 | Xerox Corporation | Automatic and semi-automatic index generation for raster documents |
US6721452B2 (en) | 2001-09-12 | 2004-04-13 | Auburn University | System and method of handwritten character recognition |
US6678699B2 (en) | 2001-10-09 | 2004-01-13 | International Business Machines Corporation | Visual indexing of displayable digital documents |
US8103104B2 (en) * | 2002-01-11 | 2012-01-24 | Hewlett-Packard Development Company, L.P. | Text extraction and its application to compound document image compression |
US20030210803A1 (en) | 2002-03-29 | 2003-11-13 | Canon Kabushiki Kaisha | Image processing apparatus and method |
JP4278918B2 (en) * | 2002-04-19 | 2009-06-17 | 富士通株式会社 | Image data processing apparatus and method |
US7120297B2 (en) | 2002-04-25 | 2006-10-10 | Microsoft Corporation | Segmented layered image system |
US7164797B2 (en) | 2002-04-25 | 2007-01-16 | Microsoft Corporation | Clustering |
US7392472B2 (en) | 2002-04-25 | 2008-06-24 | Microsoft Corporation | Layout analysis |
US7110596B2 (en) | 2002-04-25 | 2006-09-19 | Microsoft Corporation | System and method facilitating document image compression utilizing a mask |
US7263227B2 (en) | 2002-04-25 | 2007-08-28 | Microsoft Corporation | Activity detector |
US7043079B2 (en) | 2002-04-25 | 2006-05-09 | Microsoft Corporation | “Don't care” pixel interpolation |
US7024039B2 (en) | 2002-04-25 | 2006-04-04 | Microsoft Corporation | Block retouching |
AU2003229313A1 (en) | 2002-05-15 | 2003-12-02 | Thomson Licensing S.A. | Close captioning system in windows based graphics system |
JP2004023565A (en) * | 2002-06-18 | 2004-01-22 | Canon Inc | Electronic watermark burying apparatus, electronic watermark extracting apparatuses, and method thereof |
US7079686B2 (en) * | 2002-08-20 | 2006-07-18 | Lexmark International, Inc. | Systems and methods for content-based document image enhancement |
JP4194462B2 (en) * | 2002-11-12 | 2008-12-10 | キヤノン株式会社 | Digital watermark embedding method, digital watermark embedding apparatus, program for realizing them, and computer-readable storage medium |
JP4538214B2 (en) * | 2002-11-22 | 2010-09-08 | オセ−テクノロジーズ・ベー・ヴエー | Image segmentation by graph |
JP2004193756A (en) * | 2002-12-09 | 2004-07-08 | Canon Inc | Electronic watermark embedding method |
JP3919656B2 (en) * | 2002-12-09 | 2007-05-30 | キヤノン株式会社 | Digital watermark embedding device, digital watermark embedding method, digital watermark extraction device, digital watermark extraction method |
RU2251736C2 (en) * | 2002-12-17 | 2005-05-10 | "Аби Софтвер Лтд." | Method for identification of crossed symbols during recognition of hand-written text |
KR100480781B1 (en) | 2002-12-28 | 2005-04-06 | 삼성전자주식회사 | Method of extracting teeth area from teeth image and personal identification method and apparatus using teeth image |
US7283669B2 (en) * | 2003-01-29 | 2007-10-16 | Lockheed Martin Corporation | Fine segmentation refinement for an optical character recognition system |
US6914700B2 (en) | 2003-04-17 | 2005-07-05 | Lexmark International, Inc. | Method for reducing migrating residual error in error diffusion halftoning |
JP2004334339A (en) | 2003-04-30 | 2004-11-25 | Canon Inc | Information processor, information processing method, and storage medium, and program |
JP2004348706A (en) | 2003-04-30 | 2004-12-09 | Canon Inc | Information processing device, information processing method, storage medium, and program |
JP4350414B2 (en) * | 2003-04-30 | 2009-10-21 | キヤノン株式会社 | Information processing apparatus, information processing method, storage medium, and program |
RU2259592C2 (en) | 2003-06-24 | 2005-08-27 | "Аби Софтвер Лтд." | Method for recognizing graphic objects using integrity principle |
US7805307B2 (en) * | 2003-09-30 | 2010-09-28 | Sharp Laboratories Of America, Inc. | Text to speech conversion system |
EP1555804A3 (en) * | 2004-01-19 | 2006-08-16 | Ricoh Company, Ltd. | Image processing apparatus, image processing program and storage medium |
US20050281463A1 (en) * | 2004-04-22 | 2005-12-22 | Samsung Electronics Co., Ltd. | Method and apparatus for processing binary image |
KR100647284B1 (en) * | 2004-05-21 | 2006-11-23 | 삼성전자주식회사 | Apparatus and method for extracting character of image |
EP1603072A1 (en) * | 2004-06-02 | 2005-12-07 | CCS Content Conversion Specialists GmbH | Process and apparatus for analysing the structure of a document |
TWI284288B (en) * | 2004-06-04 | 2007-07-21 | Benq Corp | Text region recognition method, storage medium and system |
JP2005352696A (en) * | 2004-06-09 | 2005-12-22 | Canon Inc | Image processing device, control method thereof, and program |
US7610274B2 (en) | 2004-07-02 | 2009-10-27 | Canon Kabushiki Kaisha | Method, apparatus, and program for retrieving data |
US20060045346A1 (en) | 2004-08-26 | 2006-03-02 | Hui Zhou | Method and apparatus for locating and extracting captions in a digital image |
JP4681870B2 (en) * | 2004-12-17 | 2011-05-11 | キヤノン株式会社 | Image processing apparatus, image processing method, and computer program |
JP4455357B2 (en) * | 2005-01-28 | 2010-04-21 | キヤノン株式会社 | Information processing apparatus and information processing method |
JP4646797B2 (en) * | 2005-02-01 | 2011-03-09 | キヤノン株式会社 | Image processing apparatus, control method therefor, and program |
JP4566772B2 (en) * | 2005-02-14 | 2010-10-20 | キヤノン株式会社 | Image processing apparatus, image processing method, and program |
US7840564B2 (en) * | 2005-02-16 | 2010-11-23 | Ebrary | System and method for automatic anthology creation using document aspects |
JP4443443B2 (en) * | 2005-03-04 | 2010-03-31 | 富士通株式会社 | Document image layout analysis program, document image layout analysis apparatus, and document image layout analysis method |
JP2006253842A (en) * | 2005-03-08 | 2006-09-21 | Ricoh Co Ltd | Image processor, image forming apparatus, program, storage medium and image processing method |
JP2006268372A (en) * | 2005-03-23 | 2006-10-05 | Fuji Xerox Co Ltd | Translation device, image processor, image forming device, translation method and program |
AU2005201758B2 (en) * | 2005-04-27 | 2008-12-18 | Canon Kabushiki Kaisha | Method of learning associations between documents and data sets |
US7623712B2 (en) * | 2005-06-09 | 2009-11-24 | Canon Kabushiki Kaisha | Image processing method and apparatus |
US7555711B2 (en) * | 2005-06-24 | 2009-06-30 | Hewlett-Packard Development Company, L.P. | Generating a text layout boundary from a text block in an electronic document |
JP4574467B2 (en) * | 2005-06-30 | 2010-11-04 | キヤノン株式会社 | Data processing apparatus, data processing method, and computer program |
US7433869B2 (en) * | 2005-07-01 | 2008-10-07 | Ebrary, Inc. | Method and apparatus for document clustering and document sketching |
JP4708888B2 (en) * | 2005-07-12 | 2011-06-22 | キヤノン株式会社 | Image processing apparatus, image processing method, and computer program |
WO2007024216A1 (en) * | 2005-08-23 | 2007-03-01 | The Mazer Corporation | Test scoring system and method |
JP4717562B2 (en) * | 2005-09-02 | 2011-07-06 | キヤノン株式会社 | Image processing apparatus and method |
JP2007081482A (en) * | 2005-09-09 | 2007-03-29 | Canon Inc | Terminal authentication method, apparatus and program thereof |
JP4993674B2 (en) * | 2005-09-09 | 2012-08-08 | キヤノン株式会社 | Information processing apparatus, verification processing apparatus, control method thereof, computer program, and storage medium |
US7596270B2 (en) * | 2005-09-23 | 2009-09-29 | Dynacomware Taiwan Inc. | Method of shuffling text in an Asian document image |
US20100254606A1 (en) * | 2005-12-08 | 2010-10-07 | Abbyy Software Ltd | Method of recognizing text information from a vector/raster image |
RU2309456C2 (en) * | 2005-12-08 | 2007-10-27 | "Аби Софтвер Лтд." | Method for recognizing text information in vector-raster image |
JP4771804B2 (en) * | 2005-12-20 | 2011-09-14 | 富士通株式会社 | Layout analysis program, layout analysis apparatus, layout analysis method |
US8509563B2 (en) * | 2006-02-02 | 2013-08-13 | Microsoft Corporation | Generation of documents from images |
US7650041B2 (en) * | 2006-02-24 | 2010-01-19 | Symbol Technologies, Inc. | System and method for optical character recognition in an image |
JP4799246B2 (en) * | 2006-03-30 | 2011-10-26 | キヤノン株式会社 | Image processing method and image processing apparatus |
JP4764231B2 (en) * | 2006-03-31 | 2011-08-31 | キヤノン株式会社 | Image processing apparatus, control method, and computer program |
US7734065B2 (en) * | 2006-07-06 | 2010-06-08 | Abbyy Software Ltd. | Method of text information recognition from a graphical file with use of dictionaries and other supplementary data |
JP4909216B2 (en) * | 2006-09-13 | 2012-04-04 | 株式会社キーエンス | Character segmentation device, method and program |
US8631012B2 (en) * | 2006-09-29 | 2014-01-14 | A9.Com, Inc. | Method and system for identifying and displaying images in response to search queries |
US8971667B2 (en) * | 2006-10-23 | 2015-03-03 | Hewlett-Packard Development Company, L.P. | Digital image auto-resizing |
CN101276363B (en) * | 2007-03-30 | 2011-02-16 | 夏普株式会社 | Document image retrieval device and document image retrieval method |
JP4945739B2 (en) * | 2007-03-30 | 2012-06-06 | 日本電産サンキョー株式会社 | Character string recognition method and character string recognition apparatus |
JP4402138B2 (en) * | 2007-06-29 | 2010-01-20 | キヤノン株式会社 | Image processing apparatus, image processing method, and computer program |
JP4590433B2 (en) * | 2007-06-29 | 2010-12-01 | キヤノン株式会社 | Image processing apparatus, image processing method, and computer program |
US8238662B2 (en) * | 2007-07-17 | 2012-08-07 | Smart Technologies Ulc | Method for manipulating regions of a digital image |
CN101354746B (en) * | 2007-07-23 | 2011-08-31 | 夏普株式会社 | Device and method for extracting character image |
US8731297B1 (en) * | 2007-09-28 | 2014-05-20 | Amazon Technologies, Inc. | Processing a digital image of content to remove border artifacts |
JP4956366B2 (en) * | 2007-10-16 | 2012-06-20 | キヤノン株式会社 | Image processing device |
US20090116757A1 (en) * | 2007-11-06 | 2009-05-07 | Copanion, Inc. | Systems and methods for classifying electronic documents by extracting and recognizing text and image features indicative of document categories |
US20090153912A1 (en) * | 2007-12-18 | 2009-06-18 | Mohamed Nooman Ahmed | Scanner Calibration Strip, Scanner, and Method for Segmenting a Scanned Document Image |
US8838489B2 (en) | 2007-12-27 | 2014-09-16 | Amazon Technologies, Inc. | On-demand generating E-book content with advertising |
JP4952627B2 (en) * | 2008-03-21 | 2012-06-13 | 富士通株式会社 | Image processing apparatus, image processing method, and image processing program |
US7471826B1 (en) | 2008-03-31 | 2008-12-30 | International Business Machines Corporation | Character segmentation by slices |
US8200043B2 (en) * | 2008-05-01 | 2012-06-12 | Xerox Corporation | Page orientation detection based on selective character recognition |
JP5047051B2 (en) * | 2008-05-02 | 2012-10-10 | キヤノン株式会社 | Image processing apparatus and image encoding method |
US8023770B2 (en) * | 2008-05-23 | 2011-09-20 | Sharp Laboratories Of America, Inc. | Methods and systems for identifying the orientation of a digital image |
US8023741B2 (en) | 2008-05-23 | 2011-09-20 | Sharp Laboratories Of America, Inc. | Methods and systems for detecting numerals in a digital image |
JP5028337B2 (en) * | 2008-05-30 | 2012-09-19 | キヤノン株式会社 | Image processing apparatus, image processing method, program, and storage medium |
JP5171421B2 (en) * | 2008-06-18 | 2013-03-27 | キヤノン株式会社 | Image processing apparatus, image processing method, and computer program |
JP5132440B2 (en) * | 2008-06-23 | 2013-01-30 | キヤノン株式会社 | Image processing apparatus and image processing method |
JP5146190B2 (en) * | 2008-08-11 | 2013-02-20 | オムロン株式会社 | Character recognition device, character recognition program, and character recognition method |
US8520979B2 (en) * | 2008-08-19 | 2013-08-27 | Digimarc Corporation | Methods and systems for content processing |
JP5049921B2 (en) * | 2008-08-26 | 2012-10-17 | キヤノン株式会社 | Image processing apparatus and image processing method |
JP5049922B2 (en) * | 2008-08-26 | 2012-10-17 | キヤノン株式会社 | Image processing apparatus and image processing method |
JP5049920B2 (en) * | 2008-08-26 | 2012-10-17 | キヤノン株式会社 | Image processing apparatus and image processing method |
US8620080B2 (en) * | 2008-09-26 | 2013-12-31 | Sharp Laboratories Of America, Inc. | Methods and systems for locating text in a digital image |
JP2010123002A (en) * | 2008-11-20 | 2010-06-03 | Canon Inc | Document image layout device |
JP5350148B2 (en) * | 2008-11-28 | 2013-11-27 | キヤノン株式会社 | Information processing apparatus and information processing method |
JP5178490B2 (en) * | 2008-12-17 | 2013-04-10 | キヤノン株式会社 | Image processing apparatus, image processing method, and computer program |
US8261186B2 (en) * | 2009-01-02 | 2012-09-04 | Apple Inc. | Methods for efficient cluster analysis |
US8290255B2 (en) * | 2009-02-06 | 2012-10-16 | Canon Kabushiki Kaisha | Image processing method, image processing apparatus, and program |
US8625895B2 (en) * | 2009-03-30 | 2014-01-07 | The Neat Company, Inc. | Table grid detection and separation |
AU2009201252B2 (en) * | 2009-03-31 | 2011-06-02 | Canon Kabushiki Kaisha | Colour correcting foreground colours for visual quality improvement |
JP5312166B2 (en) * | 2009-04-13 | 2013-10-09 | キヤノン株式会社 | Image processing apparatus, control method, and program |
JP5208043B2 (en) * | 2009-04-16 | 2013-06-12 | キヤノン株式会社 | Image processing apparatus, image processing method, and program |
JP5335581B2 (en) * | 2009-07-01 | 2013-11-06 | キヤノン株式会社 | Image processing apparatus, image processing method, and program |
JP5361574B2 (en) | 2009-07-01 | 2013-12-04 | キヤノン株式会社 | Image processing apparatus, image processing method, and program |
JP5276541B2 (en) * | 2009-07-27 | 2013-08-28 | キヤノン株式会社 | Image processing method, image processing apparatus, and program |
JP5465015B2 (en) * | 2010-01-06 | 2014-04-09 | キヤノン株式会社 | Apparatus and method for digitizing documents |
US8594422B2 (en) * | 2010-03-11 | 2013-11-26 | Microsoft Corporation | Page layout determination of an image undergoing optical character recognition |
CN101853297A (en) * | 2010-05-28 | 2010-10-06 | 英华达(南昌)科技有限公司 | Method for fast obtaining expected image in electronic equipment |
US8218875B2 (en) * | 2010-06-12 | 2012-07-10 | Hussein Khalid Al-Omari | Method and system for preprocessing an image for optical character recognition |
CN101984426B (en) * | 2010-10-21 | 2013-04-10 | 优视科技有限公司 | Method used for character splitting on webpage picture and device thereof |
CN102479326B (en) * | 2010-11-30 | 2013-07-24 | 方正国际软件(北京)有限公司 | Man-operated proofreading auxiliary method of picture-text identification and system thereof |
US8549399B2 (en) | 2011-01-18 | 2013-10-01 | Apple Inc. | Identifying a selection of content in a structured document |
US8442998B2 (en) | 2011-01-18 | 2013-05-14 | Apple Inc. | Storage of a document using multiple representations |
US8380753B2 (en) | 2011-01-18 | 2013-02-19 | Apple Inc. | Reconstruction of lists in a document |
US9002139B2 (en) | 2011-02-16 | 2015-04-07 | Adobe Systems Incorporated | Methods and systems for automated image slicing |
US8731296B2 (en) * | 2011-04-21 | 2014-05-20 | Seiko Epson Corporation | Contact text detection in scanned images |
US8818092B1 (en) * | 2011-09-29 | 2014-08-26 | Google, Inc. | Multi-threaded text rendering |
JP5948866B2 (en) * | 2011-12-27 | 2016-07-06 | 富士ゼロックス株式会社 | Image processing apparatus and program |
WO2013110287A1 (en) * | 2012-01-23 | 2013-08-01 | Microsoft Corporation | Vector graphics classification engine |
US9990347B2 (en) | 2012-01-23 | 2018-06-05 | Microsoft Technology Licensing, Llc | Borderless table detection engine |
JP5950700B2 (en) | 2012-06-06 | 2016-07-13 | キヤノン株式会社 | Image processing apparatus, image processing method, and program |
CN103577817B (en) * | 2012-07-24 | 2017-03-01 | 阿里巴巴集团控股有限公司 | Form recognition method and apparatus |
US9424249B1 (en) * | 2012-09-18 | 2016-08-23 | Amazon Technologies, Inc. | Encoding text units |
US9569679B1 (en) * | 2012-12-04 | 2017-02-14 | A9.Com, Inc. | Adaptive image sampling for text detection |
US9098537B2 (en) * | 2012-12-20 | 2015-08-04 | Oracle International Corporation | Techniques for aligned run-length encoding |
US9953008B2 (en) | 2013-01-18 | 2018-04-24 | Microsoft Technology Licensing, Llc | Grouping fixed format document elements to preserve graphical data semantics after reflow by manipulating a bounding box vertically and horizontally |
US9785240B2 (en) * | 2013-03-18 | 2017-10-10 | Fuji Xerox Co., Ltd. | Systems and methods for content-aware selection |
GB2516007B (en) * | 2013-06-28 | 2018-05-09 | Displaylink Uk Ltd | Efficient encoding of display data |
CN104715178B (en) * | 2013-12-11 | 2020-04-03 | 深圳富泰宏精密工业有限公司 | Unlocking system and method of electronic device |
US20170061257A1 (en) * | 2013-12-16 | 2017-03-02 | Adobe Systems Incorporated | Generation of visual pattern classes for visual pattern regonition |
JP5875637B2 (en) | 2013-12-19 | 2016-03-02 | キヤノン株式会社 | Image processing apparatus and image processing method |
JP6494166B2 (en) * | 2014-03-12 | 2019-04-03 | キヤノン株式会社 | Image processing apparatus, image processing method, and program |
US11100571B1 (en) * | 2014-06-10 | 2021-08-24 | Wells Fargo Bank, N.A. | Systems and methods for payee identification via camera |
US9361531B2 (en) * | 2014-07-21 | 2016-06-07 | Optum, Inc. | Targeted optical character recognition (OCR) for medical terminology |
US20160026613A1 (en) * | 2014-07-28 | 2016-01-28 | Microsoft Corporation | Processing image to identify object for insertion into document |
RU2571616C1 (en) * | 2014-08-12 | 2015-12-20 | Общество с ограниченной ответственностью "Аби Девелопмент" | Optical character recognition system and method, reducing processing time for images potentially not containing characters |
US9384391B2 (en) * | 2014-10-03 | 2016-07-05 | Xerox Corporation | Methods and systems for processing documents |
US9430703B2 (en) * | 2014-12-19 | 2016-08-30 | Konica Minolta Laboratory U.S.A., Inc. | Method for segmenting text words in document images using vertical projections of center zones of characters |
US9984287B2 (en) * | 2015-03-05 | 2018-05-29 | Wipro Limited | Method and image processing apparatus for performing optical character recognition (OCR) of an article |
US10049268B2 (en) * | 2015-03-06 | 2018-08-14 | Kofax, Inc. | Selective, user-mediated content recognition using mobile devices |
US9811505B2 (en) * | 2015-07-20 | 2017-11-07 | Sas Institute Inc. | Techniques to provide processing enhancements for a text editor in a computing environment |
US9865038B2 (en) * | 2015-11-25 | 2018-01-09 | Konica Minolta Laboratory U.S.A., Inc. | Offsetting rotated tables in images |
CN107688788B (en) * | 2017-08-31 | 2021-01-08 | 平安科技(深圳)有限公司 | Document chart extraction method, electronic device and computer readable storage medium |
GB201719862D0 (en) * | 2017-11-29 | 2018-01-10 | Yellow Line Parking Ltd | Hierarchical image interpretation system |
US10685225B2 (en) * | 2017-12-29 | 2020-06-16 | Wipro Limited | Method and system for detecting text in digital engineering drawings |
US10579707B2 (en) * | 2017-12-29 | 2020-03-03 | Konica Minolta Laboratory U.S.A., Inc. | Method for inferring blocks of text in electronic documents |
TWI671686B (en) | 2018-01-24 | 2019-09-11 | 緯創資通股份有限公司 | Image data retrieving method and image data retrieving device |
RU2701453C1 (en) * | 2018-06-25 | 2019-09-26 | Михаил Григорьевич Блайвас | Method of displaying graphic objects |
JP7185451B2 (en) * | 2018-09-10 | 2022-12-07 | キヤノン株式会社 | Image processing device, image processing method, and program |
CN109685070B (en) * | 2019-01-11 | 2023-01-24 | 上海大学(浙江·嘉兴)新兴产业研究院 | Image preprocessing method |
CN109871938B (en) * | 2019-01-21 | 2023-04-25 | 重庆大学 | Component code spraying detection method based on convolutional neural network |
JP7406884B2 (en) * | 2019-06-27 | 2023-12-28 | キヤノン株式会社 | Information processing device, program and control method |
JPWO2022070999A1 (en) * | 2020-09-30 | 2022-04-07 | ||
US11531454B2 (en) * | 2020-12-10 | 2022-12-20 | Microsoft Technology Licensing, Llc | Selecting content in ink documents using a hierarchical data structure |
US11550934B2 (en) * | 2021-03-16 | 2023-01-10 | Check Point Software Technologies, Ltd. | Systems and methods for the efficient detection of improperly redacted electronic documents |
US11409981B1 (en) * | 2021-03-31 | 2022-08-09 | Intuit, Inc. | Document classification using signal processing |
CN115082598B (en) * | 2022-08-24 | 2023-07-11 | 北京百度网讯科技有限公司 | Text image generation, training, text image processing method and electronic equipment |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO1980002761A1 (en) * | 1979-06-01 | 1980-12-11 | Dest Data Corp | Apparatus and method for separation of optical character recognition data |
US5101448A (en) * | 1988-08-24 | 1992-03-31 | Hitachi, Ltd. | Method and apparatus for processing a document by utilizing an image |
WO1992006448A1 (en) * | 1990-09-27 | 1992-04-16 | Cgk Computer Gesellschaft Konstanz Mbh | Process for extracting individual characters from raster images of a read-in handwritten or typed series of characters in free distribution |
Family Cites Families (18)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH07107694B2 (en) * | 1984-08-31 | 1995-11-15 | 株式会社日立製作所 | Document processor |
JPH0731714B2 (en) * | 1986-05-29 | 1995-04-10 | インタ−ナショナル ビジネス マシ−ンズ コ−ポレ−ション | Character component cutting method |
JPH01183784A (en) * | 1988-01-19 | 1989-07-21 | Toshiba Corp | Document picture processor |
US5129012A (en) * | 1989-03-25 | 1992-07-07 | Sony Corporation | Detecting line segments and predetermined patterns in an optically scanned document |
JP2812982B2 (en) * | 1989-04-05 | 1998-10-22 | 株式会社リコー | Table recognition method |
JPH0816918B2 (en) * | 1989-04-18 | 1996-02-21 | シャープ株式会社 | Row extraction method |
JP2644041B2 (en) * | 1989-05-08 | 1997-08-25 | キヤノン株式会社 | Character recognition device |
JP2940936B2 (en) * | 1989-06-06 | 1999-08-25 | 株式会社リコー | Tablespace identification method |
US5272764A (en) * | 1989-12-08 | 1993-12-21 | Xerox Corporation | Detection of highlighted regions |
JPH03290774A (en) * | 1990-04-06 | 1991-12-20 | Fuji Facom Corp | Sentence area extracting device for document picture |
JPH0490083A (en) * | 1990-08-03 | 1992-03-24 | Canon Inc | Character recognizing device |
US5101439A (en) * | 1990-08-31 | 1992-03-31 | At&T Bell Laboratories | Segmentation process for machine reading of handwritten information |
KR930002349B1 (en) * | 1990-12-29 | 1993-03-29 | 주식회사 금성사 | Character array devide method for press image |
JPH04248687A (en) * | 1991-01-23 | 1992-09-04 | Internatl Business Mach Corp <Ibm> | Layout analyzing method and system of document picture |
US5307422A (en) * | 1991-06-25 | 1994-04-26 | Industrial Technology Research Institute | Method and system for identifying lines of text in a document |
US5351314A (en) * | 1991-10-04 | 1994-09-27 | Canon Information Systems, Inc. | Method and apparatus for image enhancement using intensity dependent spread filtering |
US5253304A (en) * | 1991-11-27 | 1993-10-12 | At&T Bell Laboratories | Method and apparatus for image segmentation |
US5335290A (en) * | 1992-04-06 | 1994-08-02 | Ricoh Corporation | Segmentation of text, picture and lines of a document image |
-
1992
- 1992-04-24 US US07/873,012 patent/US5680479A/en not_active Expired - Lifetime
-
1993
- 1993-04-23 EP EP93303194A patent/EP0567344B1/en not_active Expired - Lifetime
- 1993-04-23 DE DE69332459T patent/DE69332459T2/en not_active Expired - Lifetime
- 1993-04-26 JP JP12188393A patent/JP3359095B2/en not_active Expired - Lifetime
-
1994
- 1994-06-27 US US08/265,833 patent/US5680478A/en not_active Expired - Lifetime
-
1997
- 1997-07-18 US US08/896,859 patent/US6081616A/en not_active Expired - Fee Related
- 1997-07-18 US US08/896,547 patent/US6115497A/en not_active Expired - Lifetime
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO1980002761A1 (en) * | 1979-06-01 | 1980-12-11 | Dest Data Corp | Apparatus and method for separation of optical character recognition data |
US5101448A (en) * | 1988-08-24 | 1992-03-31 | Hitachi, Ltd. | Method and apparatus for processing a document by utilizing an image |
WO1992006448A1 (en) * | 1990-09-27 | 1992-04-16 | Cgk Computer Gesellschaft Konstanz Mbh | Process for extracting individual characters from raster images of a read-in handwritten or typed series of characters in free distribution |
Non-Patent Citations (6)
Title |
---|
"LINE SEGMENTATION METHOD FOR DOCUMENTS IN EUROPEAN LANGUAGES", IBM TECHNICAL DISCLOSURE BULLETIN., vol. 33, no. 1B, June 1990 (1990-06-01), NEW YORK US, pages 207 - 210 * |
FLETCHER L.A. AND KASTURI R.: "A Robust Algorithm for Text String Separation from Mixed Text/Graphics Images", IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, vol. 10, no. 6, November 1988 (1988-11-01), NEW YORK US, pages 910 - 918, XP000112065 * |
MASUDA I. ET AL.: "Approach to Smart Document Reader System", IEEE PROCEEDINGS OF THE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CAT. NO. CH2145-1/85,, 23 June 1985 (1985-06-23), SAN FRANCISCO, CALIFORNIA, USA, pages 550 - 557 * |
MIZUNO M. ET AL.: "Document Recognition System with Layout Structure Generator", NEC RESEARCH AND DEVELOPMENT, vol. 32, no. 3, July 1991 (1991-07-01), TOKYO JP, pages 430 - 437, XP000265886 * |
PIZANO A. ET AL.: "A Business Form Recognition system", COMPSAC91 PROCEEDINGS, THE FIFTEENTH ANNUAL INTERNATIONAL COMPUTER SOFTWARE & APPLICATIONS CONFERENCE, 13 September 1991 (1991-09-13), KOGAKUIN UNIVERSITY, TOKYO, JAPAN, pages 626 - 632, XP000260573 * |
YAMADA M., HASUIKE K.: "Document Image Processing Based on Enhanced Border Following Algorithm", IEEE PROCEEDINGS OF THE 10TH INTRNATIONAL CONFERENCE ON PATTERN RECOGNITION, CAT. NO. CH2898-5/90,, vol. 2, 21 June 1990 (1990-06-21), BALLY'S PARK PLACE HOTEL, ATLANTIC CITY, NEW JERSEY, USA, pages 231 - 236, XP000166494 * |
Also Published As
Publication number | Publication date |
---|---|
JP3359095B2 (en) | 2002-12-24 |
US6081616A (en) | 2000-06-27 |
US6115497A (en) | 2000-09-05 |
JPH0668301A (en) | 1994-03-11 |
DE69332459T2 (en) | 2003-07-10 |
US5680478A (en) | 1997-10-21 |
US5680479A (en) | 1997-10-21 |
DE69332459D1 (en) | 2002-12-12 |
EP0567344A2 (en) | 1993-10-27 |
EP0567344B1 (en) | 2002-11-06 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
EP0567344A3 (en) | Method and apparatus for character recognition | |
HK1011437A1 (en) | Character recognition method and apparatus | |
KR960015761B1 (en) | Charaster gerenating method and apparatus | |
EP0584783A3 (en) | Method and apparatus for improved processing | |
EP0542566A3 (en) | Character recognition method and apparatus thereof | |
EP0488733A3 (en) | Method and apparatus for speech recognition | |
EP0588074A3 (en) | Method and apparatus for character recognition with supervised training | |
EP0551739A3 (en) | Method and apparatus for connected and degraded text recognition | |
HK1011429A1 (en) | Character input method and apparatus | |
EP0519714A3 (en) | Apparatus and method for recognizing characters | |
GB9301635D0 (en) | Method and apparatus | |
EP0690405A3 (en) | Handwritten character entry method and apparatus | |
EP0576020A3 (en) | Character recognizing method and apparatus. | |
EP0575135A3 (en) | Information processing method and apparatus | |
GB2270604B (en) | Scanning method and apparatus | |
GB9226594D0 (en) | Autolevelling method and apparatus | |
GB9322871D0 (en) | Method and apparatus | |
GB9325993D0 (en) | Method and apparatus for connection | |
GB2270771B (en) | Weigh-filling method and apparatus | |
EP0586217A3 (en) | Method and apparatus for recognition template enhancement | |
EP0595243A3 (en) | Punching method and punching apparatus | |
EP0585098A3 (en) | Sign recognition apparatus and method and sign translation system using same. | |
EP0457547A3 (en) | Information recognition apparatus and method | |
EP0583477A4 (en) | Printing method and apparatus | |
EP0541365A3 (en) | Character recognition method and apparatus |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PUAI | Public reference made under article 153(3) epc to a published international application that has entered the european phase |
Free format text: ORIGINAL CODE: 0009012 |
|
AK | Designated contracting states |
Kind code of ref document: A2 Designated state(s): DE FR GB IT |
|
PUAL | Search report despatched |
Free format text: ORIGINAL CODE: 0009013 |
|
AK | Designated contracting states |
Kind code of ref document: A3 Designated state(s): DE FR GB IT |
|
17P | Request for examination filed |
Effective date: 19950127 |
|
RAP1 | Party data changed (applicant data changed or rights of an application transferred) |
Owner name: CANON KABUSHIKI KAISHA |
|
17Q | First examination report despatched |
Effective date: 19990520 |
|
GRAG | Despatch of communication of intention to grant |
Free format text: ORIGINAL CODE: EPIDOS AGRA |
|
GRAG | Despatch of communication of intention to grant |
Free format text: ORIGINAL CODE: EPIDOS AGRA |
|
GRAG | Despatch of communication of intention to grant |
Free format text: ORIGINAL CODE: EPIDOS AGRA |
|
GRAH | Despatch of communication of intention to grant a patent |
Free format text: ORIGINAL CODE: EPIDOS IGRA |
|
GRAH | Despatch of communication of intention to grant a patent |
Free format text: ORIGINAL CODE: EPIDOS IGRA |
|
GRAA | (expected) grant |
Free format text: ORIGINAL CODE: 0009210 |
|
AK | Designated contracting states |
Kind code of ref document: B1 Designated state(s): DE FR GB IT |
|
REG | Reference to a national code |
Ref country code: GB Ref legal event code: FG4D |
|
REF | Corresponds to: |
Ref document number: 69332459 Country of ref document: DE Date of ref document: 20021212 |
|
ET | Fr: translation filed | ||
PLBE | No opposition filed within time limit |
Free format text: ORIGINAL CODE: 0009261 |
|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: NO OPPOSITION FILED WITHIN TIME LIMIT |
|
26N | No opposition filed |
Effective date: 20030807 |
|
PGFP | Annual fee paid to national office [announced via postgrant information from national office to epo] |
Ref country code: DE Payment date: 20120430 Year of fee payment: 20 |
|
PGFP | Annual fee paid to national office [announced via postgrant information from national office to epo] |
Ref country code: GB Payment date: 20120425 Year of fee payment: 20 Ref country code: FR Payment date: 20120504 Year of fee payment: 20 |
|
PGFP | Annual fee paid to national office [announced via postgrant information from national office to epo] |
Ref country code: IT Payment date: 20120417 Year of fee payment: 20 |
|
REG | Reference to a national code |
Ref country code: DE Ref legal event code: R071 Ref document number: 69332459 Country of ref document: DE |
|
REG | Reference to a national code |
Ref country code: DE Ref legal event code: R071 Ref document number: 69332459 Country of ref document: DE |
|
REG | Reference to a national code |
Ref country code: GB Ref legal event code: PE20 Expiry date: 20130422 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: GB Free format text: LAPSE BECAUSE OF EXPIRATION OF PROTECTION Effective date: 20130422 Ref country code: DE Free format text: LAPSE BECAUSE OF EXPIRATION OF PROTECTION Effective date: 20130424 |