US20100208943A1 - Perspective improvement for image and video applications - Google Patents

Perspective improvement for image and video applications Download PDF

Info

Publication number
US20100208943A1
US20100208943A1 US12/772,605 US77260510A US2010208943A1 US 20100208943 A1 US20100208943 A1 US 20100208943A1 US 77260510 A US77260510 A US 77260510A US 2010208943 A1 US2010208943 A1 US 2010208943A1
Authority
US
United States
Prior art keywords
image
distance
camera
captured image
accordance
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
US12/772,605
Inventor
Boaz J. Super
Bruce A. Augustine
James E. Crenshaw
Evan A. Groat
Michael S. Thiems
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Google Technology Holdings LLC
Original Assignee
Motorola Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Motorola Inc filed Critical Motorola Inc
Priority to US12/772,605 priority Critical patent/US20100208943A1/en
Assigned to MOTOROLA, INC. reassignment MOTOROLA, INC. ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: GROAT, EVAN A., THIEMS, MICHAEL S., AUGUSTINE, BRUCE A., CRENSHAW, JAMES E., SUPER, BOAZ J.
Publication of US20100208943A1 publication Critical patent/US20100208943A1/en
Assigned to Motorola Mobility, Inc reassignment Motorola Mobility, Inc ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: MOTOROLA, INC
Assigned to MOTOROLA MOBILITY LLC reassignment MOTOROLA MOBILITY LLC CHANGE OF NAME (SEE DOCUMENT FOR DETAILS). Assignors: Motorola Mobility, Inc
Assigned to Google Technology Holdings LLC reassignment Google Technology Holdings LLC ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: MOTOROLA MOBILITY LLC
Abandoned legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V40/00Recognition of biometric, human-related or animal-related patterns in image or video data
    • G06V40/10Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
    • G06V40/16Human faces, e.g. facial parts, sketches or expressions
    • G06V40/161Detection; Localisation; Normalisation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/20Image preprocessing
    • G06V10/24Aligning, centring, orientation detection or correction of the image
    • G06V10/247Aligning, centring, orientation detection or correction of the image by affine transforms, e.g. correction due to perspective effects; Quadrilaterals, e.g. trapezoids

Definitions

  • FIG. 1 is a diagram illustrative of the distance-based perspective effect.
  • FIG. 2 is a flow chart of a method consistent with certain embodiments of the present invention.
  • FIG. 3 is a further diagram illustrative of distance-based distortion.
  • FIG. 4 and FIG. 5 are flow charts of further methods consistent with certain embodiments of the present invention.
  • FIG. 6 is a diagrammatic representation of a system consistent with certain embodiments of the invention.
  • FIG. 1 shows a simplified camera 100 .
  • Light from an object 102 at a first distance from the camera 100 passes through the lens 104 and falls on sensor array 106 to form an image of height 108 .
  • Light from a second object 110 passes through the lens 104 and falls on the sensor array 106 to form an image of height 112 .
  • objects 102 and 110 are of the same height, the image of the more distant object 102 is much smaller than the image of the closer object 110 .
  • objects (or portions of object) closer to the lens appear disproportionately larger in the image relative to more distant objects.
  • the ratio of image heights depends on the ratio of distances to the lens.
  • the amount of distortion depends on the depth of the object relative to the distance of the object to the camera lens. For many lenses in wide use, the distortion of the image of a face is therefore small when the face is several meters from the camera.
  • the user is able to monitor the image of his or her face that is being transmitted. If the image does not fill the image frame the user is likely to move closer to the camera. This is especially true if the camera uses a wide angle lens. The result is that the image of the user's face, while filling more of the image frame, becomes distorted due to the distance-based perspective effect. The resulting image is unflattering, showing an enlarged nose and pinched cheeks.
  • the object is a face.
  • the invention has application to the reduction of distance-based distortion in images of other objects and images of multiple objects.
  • FIG. 2 is a flow chart of a method consistent with certain embodiments of the present invention.
  • an image of a user is captured by a camera at block 204 .
  • an amount of camera zoom required is determined. The amount of camera zoom is determined such that when the user is sufficiently far from the camera that distance-based perspective distortion is acceptably small, the image of the user's face fills a significant portion of the image frame. If the distortion is not acceptably small, as depicted by the negative branch from decision block 208 , the zoom is increased at block 210 and a new image is displayed at block 212 .
  • the zoom may be a digital zoom, an optical zoom or a combination thereof.
  • the increased amount of zoom causes the image of the user's face to appear too large for the image frame, thus prompting the user to move away from the camera.
  • the amount of zoom may be determined by standard distance measuring systems. These include, for example, active systems that use infrared or ultrasonic waves to determine the distance to the user, or passive systems based on the size of the image of user's face (taking into account the current zoom), stereo images, or some other feature. If the distortion is acceptably small (for example, if the user is at a sufficient distance from the camera), as depicted by the positive branch from decision block 208 , the image size is checked at block 214 . If the image size is not too large or too small, no further adjustment of the zoom is required. Flow continues to block 212 and the image is displayed. If the image size is too large or too small, as depicted by the negative branch from decision block 214 , the zoom is adjusted at block 216 to provide the correct size and a new image is displayed at block 212 .
  • the image size is adjusted so the user is not prompted to move closer to the camera. If the user is too close to the camera, so that distance based distortion is unacceptably larger, the zoom is increased so that image of the user's face appears too large for the image frame, prompting the user to move farther from the camera, thereby reducing the distance-based distortion.
  • the region of the image corresponding to the user's face may be identified so that the camera can zoom in on this region.
  • Methods of identifying a user's face are well known in the art.
  • the region may be identified and tracked as the user moves, using techniques well known in the art.
  • FIG. 3 is a diagram illustrating distance-based distortion.
  • the object is a stack of elliptical discs, shown in frontal view in view 300 with no distance-based distortion.
  • the large disc 302 is farthest from the lens 104 of the camera 100 and small disc 304 is closest to the lens 104 .
  • View 306 is a view of the image of the stack formed on the image array 106 .
  • the image 308 of the smallest (closest) disc is increased in size relative to the image 310 of the largest (farthest) disc. A corresponding effect occurs with the image of a face close to the camera.
  • the image of the disc can be modified in size to compensate for the distance-based distortion. This same approach may be used with an image such as a face.
  • is a parameter dependent upon the optical characteristics (such as the magnification) of the camera.
  • the parameter ⁇ may be determined by calibration.
  • the image at position (x′,y′) on the sensor array corresponds to an object point at a distance
  • d i ⁇ ( x ′ , y ′ ) d i ⁇ ( ⁇ ⁇ ⁇ x d ⁇ ( x , y ) , ⁇ ⁇ ⁇ y d ⁇ ( x , y ) )
  • a pixel value p corrected in a corrected image may be calculated from a pixel value p image in the captured image as
  • is a scale factor.
  • the scale factor ⁇ is a constant. The constant may be proportional to the mean value of d(x,y) or the root mean square value of d(x,y), for example.
  • the scale factor ⁇ ⁇ [d 0 +d i (x′,y′)] where d 0 is a constant distance and ⁇ is a scale factor.
  • a reprojected image of the object is generated that is dependent upon the estimated distances and upon a virtual viewpoint that is more distant than the camera from the object.
  • FIG. 4 A flow chart of an exemplary method for estimating the distance d(x,y) is shown in FIG. 4 .
  • an image is captured at block 404 .
  • a face template or other model is matched to the image at block 406 .
  • the face template may comprise, for example, simple oval shapes for the periphery of the face and the eyes, and a simple nose template.
  • the matching may be performed by use of alignment matching and verification, for example.
  • the matching process results in a set of model parameters.
  • a decision is made as to whether the image includes a face.
  • the distance to one or more matched features of the face is estimated at block 410 .
  • the distance may be estimated from the size of the matched templates, the distance between the eyes, or other parameters of the templates. Alternatively, the distance could be determined by an active technique or by stereopsis.
  • the distances of various parts of the face are estimated from the template parameters at block 412 .
  • a table may be accessed that contains typical distances for all regions of a face obtained from anthropological statistics. For example, a point 50% of the distance between the nose and the side of the face and 20% of the distance from the center of the eye to the chin might be 1 inch farther away than the tip of the nose.
  • the distances may be determined for each position in the distorted image or for each position in the undistorted face. Other distance estimating techniques are well known to those of ordinary skill in the art.
  • a corrected image is formed using the captured image and the distance estimates for each point in the image.
  • the corrected image is displayed at block 416 and the process terminates at block 418 .
  • the process may be repeated for each consecutive image, or it may be performed when a change in face position is detected, or it may be repeated at periodic intervals.
  • FIG. 5 is a flow chart of a method consistent with certain embodiments of the present invention.
  • This embodiment uses pre-stored templates.
  • the templates are several views of the user photographed from different angles (such as frontal and profile views).
  • an image is captured by the camera at block 504 .
  • a number of control points or features are identified in the captured image. For example, the position of a corner of an eye may be identified using a method known in the art.
  • the captured image is then warped or morphed to match each template at block 508 . This process matches the control points in the captured image to corresponding control points in the templates.
  • the best-matching template is selected. This image is displayed, transmitted and/or stored at block 512 .
  • the process terminates at block 514 .
  • the goodness of a match between the warped image and the template may be measured, for example, as a combination of an amount of deformation and an amount of residual color differences between the warped image and the template, where less deformation, less residual difference, or both correspond to a better match.
  • the templates are warped to match the image, and the best-matching template is selected. This template may be used as the image to be displayed/stored, or the captured image can be warped to match the selected template. Alternative embodiments will be apparent to those of ordinary skill in the art.
  • the position of the image of a person's head is detected using matching to templates or other means known in the art. Then a stored, template-independent, warping transformation is applied to the image of the head.
  • the transformation may be parameterized by the size and orientation of the detected image of the head.
  • a geometric model of head such as a spherical, cylindrical or ellipsoidal model, may be used. The size of the image of the head is used to estimate the distance of the head from the camera and then the geometric model is used to estimate the variation of distance across the image of the head.
  • video cameras examples include a video camera of a videoconferencing system, a videophone, a video email product, a door camera and a security camera.
  • the approach may also be used in still cameras, such as cameras in cellular telephone and other portable electronic devices.
  • FIG. 6 is a diagrammatic representation of a system consistent with certain embodiments of the invention.
  • the system includes a camera 100 , which may be a still camera or a video camera.
  • a processor 602 is coupled to the camera 100 .
  • the processor 602 may be incorporated in the body of the camera or may be external to the camera and coupled to it by a wired or wireless link.
  • the processor also communicates with a display 604 and a communication circuit 606 .
  • the processor operates to control the digital or optical zoom of camera 100 . Images captured from the camera may be displayed on display unit 604 and/or transmitted via communication circuit 606 to a remote device.
  • the camera 100 is a video camera that captures images of the face of a user in a video conferencing or video telephony system.
  • the display unit 604 displays images of the user and also images received from a remote camera.
  • the processor 602 controls the zoom of camera 100 . If the user is so close to the camera that a large amount of distance-based distortion is present in the image, the zoom is controlled such that the image of the user on the display is too large for the display frame, thereby encouraging the user to move farther from the display.
  • the processor 602 may process images captured by the camera to reduce distance-based distortion using the methods described above. This may be done by estimating distances from the camera to the user's face or by matching templates to the image of the user's face. On one embodiment, the distance is estimated using a distance measuring system 608 , such an infrared or ultrasonic system. In another embodiment, the distance is estimated using a stereo vision system.
  • a distance measuring system 608 such an infrared or ultrasonic system.
  • the distance is estimated using a stereo vision system.
  • the system shown in FIG. 6 may be incorporated, for example, in a mobile device, such as a radio telephone, or in a fixed device, such as a personal computer.
  • the present invention may be implemented using a programmed processor executing programming instructions that are broadly described above in flow chart form that can be stored on any suitable electronic storage medium.
  • programming instructions that are broadly described above in flow chart form that can be stored on any suitable electronic storage medium.
  • the processes described above can be implemented in any number of variations and in many suitable programming languages without departing from the present invention.
  • the order of certain operations carried out can often be varied, additional operations can be added or operations can be deleted without departing from the invention.
  • Error trapping can be added and/or enhanced and variations can be made in user interface and information presentation without departing from the present invention. Such variations are contemplated and considered equivalent.
  • the invention may be implemented in hardware.
  • the programmed processor or hardware may be integral to the camera or may be a separate processor operating on images received from the camera.

Abstract

A system and method for reducing distance-based distortion in a camera image of an object, where the distanced-based distortion is due to differences in distance from the camera to different parts of the object. In one approach, the distortion is reduced by estimating distances to different parts of the object and then generating a reprojected image of the object dependent upon the estimated distances and upon a virtual viewpoint that is more distant than the camera from the object. In a further approach, the image is warped such that points in the image match corresponding points in one or more stored templates. In a still further approach, if excessive distortion is present in the image, the camera zoom is increased and a magnified image is displayed, prompting a person to move farther from the camera thereby reducing the distortion.

Description

    RELATED APPLICATIONS
  • This patent application is related to U.S. application Ser. No. 11/380,196, entitled Perspective Improvement for Image and Video Applications, priority to which is claimed.
  • BACKGROUND
  • Many camera based products, such as videophones, webcams and mobile devices, require wide angle lenses. In order to avoid having their faces look too small, users tend to move their faces close to the camera. As a result, perspective distance effects become large and their faces appear distorted in the image, with large noses and pinched cheeks. Users find this unflattering. This distance-based perspective effect is an undesirable feature of videophone and video email products.
  • There are other perspective effects such as wide-angle (fish-eye) distortion and oblique distortion (which causes keystoning). Compensation techniques have been proposed for mitigating these effects, but these techniques are not applicable to the type of distortion described above.
  • BRIEF DESCRIPTION OF THE DRAWINGS
  • The novel features believed characteristic of the invention are set forth in the appended claims. The invention itself, however, as well as the preferred mode of use, and further objects and advantages thereof, will best be understood by reference to the following detailed description of an illustrative embodiment when read in conjunction with the accompanying drawing(s), wherein:
  • FIG. 1 is a diagram illustrative of the distance-based perspective effect.
  • FIG. 2 is a flow chart of a method consistent with certain embodiments of the present invention.
  • FIG. 3 is a further diagram illustrative of distance-based distortion.
  • FIG. 4 and FIG. 5 are flow charts of further methods consistent with certain embodiments of the present invention.
  • FIG. 6 is a diagrammatic representation of a system consistent with certain embodiments of the invention.
  • DETAILED DESCRIPTION
  • While this invention is susceptible of embodiment in many different forms, there is shown in the drawings and will herein be described in detail one or more specific embodiments, with the understanding that the present disclosure is to be considered as exemplary of the principles of the invention and not intended to limit the invention to the specific embodiments shown and described. In the description below, like reference numerals are used to describe the same, similar or corresponding parts in the several views of the drawings.
  • The distance-based perspective effect is illustrated in the diagram in FIG. 1. FIG. 1 shows a simplified camera 100. Light from an object 102 at a first distance from the camera 100 passes through the lens 104 and falls on sensor array 106 to form an image of height 108. Light from a second object 110 passes through the lens 104 and falls on the sensor array 106 to form an image of height 112. Although objects 102 and 110 are of the same height, the image of the more distant object 102 is much smaller than the image of the closer object 110. Thus, objects (or portions of object) closer to the lens appear disproportionately larger in the image relative to more distant objects. The ratio of image heights depends on the ratio of distances to the lens. Thus for three-dimensional objects, such as faces, the amount of distortion depends on the depth of the object relative to the distance of the object to the camera lens. For many lenses in wide use, the distortion of the image of a face is therefore small when the face is several meters from the camera.
  • In some video applications, such as video-telephony, the user is able to monitor the image of his or her face that is being transmitted. If the image does not fill the image frame the user is likely to move closer to the camera. This is especially true if the camera uses a wide angle lens. The result is that the image of the user's face, while filling more of the image frame, becomes distorted due to the distance-based perspective effect. The resulting image is unflattering, showing an enlarged nose and pinched cheeks.
  • In the description below, an embodiment is described in which the object is a face. However, the invention has application to the reduction of distance-based distortion in images of other objects and images of multiple objects.
  • FIG. 2 is a flow chart of a method consistent with certain embodiments of the present invention. Referring to FIG. 2, following start block 202, an image of a user is captured by a camera at block 204. At block 206, an amount of camera zoom required is determined. The amount of camera zoom is determined such that when the user is sufficiently far from the camera that distance-based perspective distortion is acceptably small, the image of the user's face fills a significant portion of the image frame. If the distortion is not acceptably small, as depicted by the negative branch from decision block 208, the zoom is increased at block 210 and a new image is displayed at block 212. The zoom may be a digital zoom, an optical zoom or a combination thereof. The increased amount of zoom causes the image of the user's face to appear too large for the image frame, thus prompting the user to move away from the camera. The amount of zoom may be determined by standard distance measuring systems. These include, for example, active systems that use infrared or ultrasonic waves to determine the distance to the user, or passive systems based on the size of the image of user's face (taking into account the current zoom), stereo images, or some other feature. If the distortion is acceptably small (for example, if the user is at a sufficient distance from the camera), as depicted by the positive branch from decision block 208, the image size is checked at block 214. If the image size is not too large or too small, no further adjustment of the zoom is required. Flow continues to block 212 and the image is displayed. If the image size is too large or too small, as depicted by the negative branch from decision block 214, the zoom is adjusted at block 216 to provide the correct size and a new image is displayed at block 212.
  • In this way, the image size is adjusted so the user is not prompted to move closer to the camera. If the user is too close to the camera, so that distance based distortion is unacceptably larger, the zoom is increased so that image of the user's face appears too large for the image frame, prompting the user to move farther from the camera, thereby reducing the distance-based distortion.
  • The region of the image corresponding to the user's face may be identified so that the camera can zoom in on this region. Methods of identifying a user's face are well known in the art. The region may be identified and tracked as the user moves, using techniques well known in the art.
  • FIG. 3 is a diagram illustrating distance-based distortion. The object is a stack of elliptical discs, shown in frontal view in view 300 with no distance-based distortion. The large disc 302 is farthest from the lens 104 of the camera 100 and small disc 304 is closest to the lens 104. View 306 is a view of the image of the stack formed on the image array 106. The image 308 of the smallest (closest) disc is increased in size relative to the image 310 of the largest (farthest) disc. A corresponding effect occurs with the image of a face close to the camera.
  • If the distance from the lens 104 to a disc is known, the image of the disc can be modified in size to compensate for the distance-based distortion. This same approach may be used with an image such as a face.
  • For a given object surface, the distance d from the camera lens along the optical axis is a function of the x and y coordinates in a plane perpendicular to the optical axis, so d=d(x,y). Light at position (x, y, d) meets the image sensor array at a position
  • ( x , y ) = ( α x d ( x , y ) , α y d ( x , y ) ) ,
  • where α is a parameter dependent upon the optical characteristics (such as the magnification) of the camera. The parameter α may be determined by calibration. The image at position (x′,y′) on the sensor array corresponds to an object point at a distance
  • d i ( x , y ) = d i ( α x d ( x , y ) , α y d ( x , y ) )
  • from the camera. If the distance di(x′, y′) is known, a pixel value pcorrected in a corrected image may be calculated from a pixel value pimage in the captured image as
  • p corrected ( x d i ( x , y ) β , y d i ( x , y ) β ) = p image ( x , y ) ,
  • where β is a scale factor. In one embodiment, the scale factor β is a constant. The constant may be proportional to the mean value of d(x,y) or the root mean square value of d(x,y), for example. In another embodiment, the scale factor β=γ[d0+di(x′,y′)] where d0 is a constant distance and γ is a scale factor. In this embodiment, a reprojected or virtual image in obtained for which the face appears a distance d0 farther from the camera. In other words, a reprojected image of the object is generated that is dependent upon the estimated distances and upon a virtual viewpoint that is more distant than the camera from the object. A flow chart of an exemplary method for estimating the distance d(x,y) is shown in FIG. 4. Following start block 402, an image is captured at block 404. A face template or other model is matched to the image at block 406. A variety of techniques for extracting features and for matching templates or other models are known to those of ordinary skill in the art. The face template may comprise, for example, simple oval shapes for the periphery of the face and the eyes, and a simple nose template. The matching may be performed by use of alignment matching and verification, for example. The matching process results in a set of model parameters. At decision block 408, a decision is made as to whether the image includes a face. If a face is detected, as depicted by the positive path from decision block 408, the distance to one or more matched features of the face is estimated at block 410. The distance may be estimated from the size of the matched templates, the distance between the eyes, or other parameters of the templates. Alternatively, the distance could be determined by an active technique or by stereopsis. In one embodiment, the distances of various parts of the face are estimated from the template parameters at block 412. For example, a table may be accessed that contains typical distances for all regions of a face obtained from anthropological statistics. For example, a point 50% of the distance between the nose and the side of the face and 20% of the distance from the center of the eye to the chin might be 1 inch farther away than the tip of the nose. The distances may be determined for each position in the distorted image or for each position in the undistorted face. Other distance estimating techniques are well known to those of ordinary skill in the art. At block 414, a corrected image is formed using the captured image and the distance estimates for each point in the image. The corrected image is displayed at block 416 and the process terminates at block 418. The process may be repeated for each consecutive image, or it may be performed when a change in face position is detected, or it may be repeated at periodic intervals.
  • FIG. 5 is a flow chart of a method consistent with certain embodiments of the present invention. This embodiment uses pre-stored templates. The templates are several views of the user photographed from different angles (such as frontal and profile views). Following start block 502 in FIG. 5, an image is captured by the camera at block 504. At block 506 a number of control points or features are identified in the captured image. For example, the position of a corner of an eye may be identified using a method known in the art. The captured image is then warped or morphed to match each template at block 508. This process matches the control points in the captured image to corresponding control points in the templates. At block 510, the best-matching template is selected. This image is displayed, transmitted and/or stored at block 512. The process terminates at block 514. The goodness of a match between the warped image and the template may be measured, for example, as a combination of an amount of deformation and an amount of residual color differences between the warped image and the template, where less deformation, less residual difference, or both correspond to a better match. In an alternative embodiment, the templates are warped to match the image, and the best-matching template is selected. This template may be used as the image to be displayed/stored, or the captured image can be warped to match the selected template. Alternative embodiments will be apparent to those of ordinary skill in the art.
  • In an alternative embodiment, the position of the image of a person's head is detected using matching to templates or other means known in the art. Then a stored, template-independent, warping transformation is applied to the image of the head. The transformation may be parameterized by the size and orientation of the detected image of the head. For example, a geometric model of head, such as a spherical, cylindrical or ellipsoidal model, may be used. The size of the image of the head is used to estimate the distance of the head from the camera and then the geometric model is used to estimate the variation of distance across the image of the head.
  • The methods described above with reference to FIGS. 2, 4 and 5 may be used in combination.
  • Examples of video cameras include a video camera of a videoconferencing system, a videophone, a video email product, a door camera and a security camera. The approach may also be used in still cameras, such as cameras in cellular telephone and other portable electronic devices.
  • FIG. 6 is a diagrammatic representation of a system consistent with certain embodiments of the invention. Referring to FIG. 6, the system includes a camera 100, which may be a still camera or a video camera. A processor 602 is coupled to the camera 100. The processor 602 may be incorporated in the body of the camera or may be external to the camera and coupled to it by a wired or wireless link. In this embodiment, the processor also communicates with a display 604 and a communication circuit 606. In one embodiment, the processor operates to control the digital or optical zoom of camera 100. Images captured from the camera may be displayed on display unit 604 and/or transmitted via communication circuit 606 to a remote device.
  • In one embodiment of the invention, the camera 100 is a video camera that captures images of the face of a user in a video conferencing or video telephony system. The display unit 604 displays images of the user and also images received from a remote camera. The processor 602 controls the zoom of camera 100. If the user is so close to the camera that a large amount of distance-based distortion is present in the image, the zoom is controlled such that the image of the user on the display is too large for the display frame, thereby encouraging the user to move farther from the display.
  • Additionally, the processor 602 may process images captured by the camera to reduce distance-based distortion using the methods described above. This may be done by estimating distances from the camera to the user's face or by matching templates to the image of the user's face. On one embodiment, the distance is estimated using a distance measuring system 608, such an infrared or ultrasonic system. In another embodiment, the distance is estimated using a stereo vision system.
  • The system shown in FIG. 6 may be incorporated, for example, in a mobile device, such as a radio telephone, or in a fixed device, such as a personal computer.
  • The present invention, as described in embodiments herein, may be implemented using a programmed processor executing programming instructions that are broadly described above in flow chart form that can be stored on any suitable electronic storage medium. However, those skilled in the art will appreciate that the processes described above can be implemented in any number of variations and in many suitable programming languages without departing from the present invention. For example, the order of certain operations carried out can often be varied, additional operations can be added or operations can be deleted without departing from the invention. Error trapping can be added and/or enhanced and variations can be made in user interface and information presentation without departing from the present invention. Such variations are contemplated and considered equivalent.
  • Alternatively, the invention may be implemented in hardware.
  • The programmed processor or hardware may be integral to the camera or may be a separate processor operating on images received from the camera.
  • While the invention has been described in conjunction with specific embodiments, it is evident that many alternatives, modifications, permutations and variations will become apparent to those of ordinary skill in the art in light of the foregoing description. Accordingly, it is intended that the present invention embrace all such alternatives, modifications and variations as fall within the scope of the appended claims.

Claims (17)

1. A method for reducing distance-based distortion in an image of an object, the image being captured by a camera, where the distance-based distortion is due to differences in distance from the camera to different parts of the object, the method comprising:
for each template of at least one template of the object, a camera determining a set of warping parameters that relate points in the image to corresponding points in the template of the object;
selecting by the camera from the set of warping parameters for each of the at least one template, the set of warping parameters that results in the best match between the warped image and the corresponding template; and
warping, by the camera, the image using the selected set of warping parameters.
2. A method in accordance with claim 1, wherein the at least one template of the object comprise a plurality of views of the object.
3. A method in accordance with claim 1, wherein the object is a face and wherein the image comprises an image from a sequence of video images.
4. A system for reducing distance-based distortion in an image of an object, the system comprising:
a camera operable to capture an image of the object;
an estimation element operable to estimate distance-based distortion in the captured image;
a processing element operable to process the captured image to reduce distance-based distortion in the captured image.
5. A system in accordance with claim 4, wherein the estimation element is operable to estimate the distance to at least one region of the object.
6. A system in accordance with claim 4, wherein the estimation element is operable to estimate the size, in the image, of a specified feature of the object.
7. A system in accordance with claim 4, wherein the estimation element comprises an active distance measuring means.
8. A system in accordance with claim 4, wherein the estimation element is operable to estimate the amount of warping needed to match points in the captured image to corresponding points in a previously stored image.
9. A system in accordance with claim 4, wherein the processing element is operable to warp the captured image such that points in the captured image match points in a previously stored image.
10. A system in accordance with claim 4, wherein the processing element is operable to reproject the captured image to simulate a more distant camera view point.
11. A system for reducing distance-based distortion in an image of an object, the system comprising:
a camera operable to capture an image of the object;
a means for estimating distance-based distortion in the captured image;
a means for processing the captured image to reduce distance-based distortion in the captured image.
12. A system in accordance with claim 11, wherein the means for estimating distance-based distortion in the captured image comprises a means for estimating the distance to at least one region of the object.
13. A system in accordance with claim 11, wherein the means for estimating the distance to at least one region of the object comprises a means for estimating the size, in the image, of a specified feature of the object.
14. A system in accordance with claim 11, wherein the means for estimating the distance to at least one region of the object comprises an active distance measuring means.
15. A system in accordance with claim 11, wherein the means for estimating distance-based distortion in the captured image comprises a means for estimating the amount of warping needed to match points in the captured image to corresponding points in a previously stored image.
16. A system in accordance with claim 11, wherein the means for processing the captured image to reduce distance-based distortion in the captured image comprises a means for warping the captured image such that points in the captured image match points in a previously stored image.
17. A system in accordance with claim 11, wherein the means for processing the captured image to reduce distance-based distortion in the captured image comprises a means for reprojecting the captured image to simulate a more distant camera view point.
US12/772,605 2006-04-25 2010-05-03 Perspective improvement for image and video applications Abandoned US20100208943A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
US12/772,605 US20100208943A1 (en) 2006-04-25 2010-05-03 Perspective improvement for image and video applications

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US11/380,196 US7742624B2 (en) 2006-04-25 2006-04-25 Perspective improvement for image and video applications
US12/772,605 US20100208943A1 (en) 2006-04-25 2010-05-03 Perspective improvement for image and video applications

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
US11/380,196 Division US7742624B2 (en) 2006-04-25 2006-04-25 Perspective improvement for image and video applications

Publications (1)

Publication Number Publication Date
US20100208943A1 true US20100208943A1 (en) 2010-08-19

Family

ID=38619531

Family Applications (3)

Application Number Title Priority Date Filing Date
US11/380,196 Expired - Fee Related US7742624B2 (en) 2006-04-25 2006-04-25 Perspective improvement for image and video applications
US12/772,605 Abandoned US20100208943A1 (en) 2006-04-25 2010-05-03 Perspective improvement for image and video applications
US13/041,486 Expired - Fee Related US8494224B2 (en) 2006-04-25 2011-03-07 Perspective improvement for image and video applications

Family Applications Before (1)

Application Number Title Priority Date Filing Date
US11/380,196 Expired - Fee Related US7742624B2 (en) 2006-04-25 2006-04-25 Perspective improvement for image and video applications

Family Applications After (1)

Application Number Title Priority Date Filing Date
US13/041,486 Expired - Fee Related US8494224B2 (en) 2006-04-25 2011-03-07 Perspective improvement for image and video applications

Country Status (1)

Country Link
US (3) US7742624B2 (en)

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104811602A (en) * 2014-01-24 2015-07-29 维科技术有限公司 Self-shooting method and self-shooting device for mobile terminals
CN104994282A (en) * 2015-06-30 2015-10-21 广东欧珀移动通信有限公司 Large view angle camera control method and user terminal
CN104994280A (en) * 2015-06-30 2015-10-21 广东欧珀移动通信有限公司 Method for processing preview image and user terminal
CN105049713A (en) * 2015-06-30 2015-11-11 广东欧珀移动通信有限公司 Preview image processing method and user terminal
CN105049716A (en) * 2015-06-30 2015-11-11 广东欧珀移动通信有限公司 Preview image processing method and user terminal
CN105049714A (en) * 2015-06-30 2015-11-11 广东欧珀移动通信有限公司 Figure image preview method and device

Families Citing this family (22)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP3007131A1 (en) * 2005-09-22 2016-04-13 3M Innovative Properties Company of 3M Center Artifact mitigation in three-dimensional imaging
US20080101713A1 (en) * 2006-10-27 2008-05-01 Edgar Albert D System and method of fisheye image planar projection
JP5164721B2 (en) * 2007-08-22 2013-03-21 キヤノン株式会社 Imaging apparatus and zoom control method
CN101409784A (en) * 2007-10-10 2009-04-15 联想(北京)有限公司 Camera device and information-prompting apparatus
US8185959B2 (en) 2008-02-26 2012-05-22 International Business Machines Corporation Digital rights management of captured content based on capture associated locations
US8095991B2 (en) * 2008-02-26 2012-01-10 International Business Machines Corporation Digital rights management of streaming captured content based on criteria regulating a sequence of elements
JP2009294728A (en) * 2008-06-02 2009-12-17 Sony Ericsson Mobilecommunications Japan Inc Display processor, display processing method, display processing program, and portable terminal device
US8805110B2 (en) * 2008-08-19 2014-08-12 Digimarc Corporation Methods and systems for content processing
KR20100081881A (en) * 2009-01-07 2010-07-15 삼성전자주식회사 Data matching device and method, and robot using these
JP2010257037A (en) * 2009-04-22 2010-11-11 Sony Corp Information processing apparatus and method, and program
US8599238B2 (en) * 2009-10-16 2013-12-03 Apple Inc. Facial pose improvement with perspective distortion correction
KR101679290B1 (en) * 2009-11-17 2016-11-24 삼성전자 주식회사 Image processing method and apparatus
US20120176495A1 (en) * 2011-01-11 2012-07-12 Honeywell International Inc. System to improve face image acquisition
US20120236105A1 (en) 2011-03-14 2012-09-20 Motorola Mobility, Inc. Method and apparatus for morphing a user during a video call
CN104363986B (en) 2014-10-31 2017-06-13 华为技术有限公司 A kind of image processing method and equipment
US9978265B2 (en) 2016-04-11 2018-05-22 Tti (Macao Commercial Offshore) Limited Modular garage door opener
CA2961090A1 (en) 2016-04-11 2017-10-11 Tti (Macao Commercial Offshore) Limited Modular garage door opener
KR102560780B1 (en) 2016-10-05 2023-07-28 삼성전자주식회사 Image processing system including plurality of image sensors and electronic device including thereof
US11922605B2 (en) 2018-11-23 2024-03-05 Hewlett-Packard Development Company, L.P. Selective distortion or deformation correction in images from a camera with a wide angle lens
JP7211266B2 (en) * 2019-05-27 2023-01-24 富士フイルムビジネスイノベーション株式会社 Information processing device and information processing program
CN111064895B (en) * 2019-12-31 2022-02-01 维沃移动通信有限公司 Virtual shooting method and electronic equipment
CN111311712B (en) * 2020-02-24 2023-06-16 北京百度网讯科技有限公司 Video frame processing method and device

Citations (18)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5193124A (en) * 1989-06-29 1993-03-09 The Research Foundation Of State University Of New York Computational methods and electronic camera apparatus for determining distance of objects, rapid autofocusing, and obtaining improved focus images
US5687249A (en) * 1993-09-06 1997-11-11 Nippon Telephone And Telegraph Method and apparatus for extracting features of moving objects
US6195119B1 (en) * 1994-12-28 2001-02-27 Olympus America, Inc. Digitally measuring scopes using a high resolution encoder
US6211903B1 (en) * 1997-01-14 2001-04-03 Cambridge Technology Development, Inc. Video telephone headset
US6262743B1 (en) * 1995-06-22 2001-07-17 Pierre Allio Autostereoscopic image acquisition method and system
US6400848B1 (en) * 1999-03-30 2002-06-04 Eastman Kodak Company Method for modifying the perspective of a digital image
US20020130961A1 (en) * 2001-03-15 2002-09-19 Lg Electronics Inc. Display device of focal angle and focal distance in iris recognition system
US20030117675A1 (en) * 2000-08-30 2003-06-26 Norimitu Shirato Curved image conversion method and record medium where this method for converting curved image is recorded
US6641053B1 (en) * 2002-10-16 2003-11-04 Xerox Corp. Foreground/background document processing with dataglyphs
US20040001146A1 (en) * 2002-06-28 2004-01-01 Zicheng Liu Real-time wide-angle image correction system and method for computer image viewing
US20040201768A1 (en) * 2002-01-17 2004-10-14 Eastman Kodak Company Electronic imaging system having a sensor for correcting perspective projection distortion
US20040201587A1 (en) * 2002-03-04 2004-10-14 Kazufumi Mizusawa Image combination/conversion apparatus
US20040207743A1 (en) * 2003-04-15 2004-10-21 Nikon Corporation Digital camera system
US20040233543A1 (en) * 2003-05-23 2004-11-25 Chang Matthew Tsu-Yang Optical apparatus for a line scanner system with reduced optical total track
US20050163398A1 (en) * 2003-05-13 2005-07-28 Olympus Corporation Image processing apparatus
US20050251015A1 (en) * 2004-04-23 2005-11-10 Omron Corporation Magnified display apparatus and magnified image control apparatus
US7454057B2 (en) * 2003-03-28 2008-11-18 Olympus Corporation Image processing apparatus and image processing program
US7454054B2 (en) * 2004-05-07 2008-11-18 Konica Minolta Sensing, Inc. Three-dimensional shape input device

Family Cites Families (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20010049589A1 (en) * 1993-01-21 2001-12-06 Nikon Corporation Alignment method and apparatus therefor
AU7603894A (en) * 1993-08-27 1995-03-21 Massachusetts Institute Of Technology Example-based image analysis and synthesis using pixelwise correspondence
JPH10327373A (en) * 1997-05-26 1998-12-08 Mitsubishi Electric Corp Eyepiece video display
US6330486B1 (en) * 1997-07-16 2001-12-11 Silicon Graphics, Inc. Acoustic perspective in a virtual three-dimensional environment
US7084877B1 (en) * 2000-06-06 2006-08-01 General Instrument Corporation Global motion estimation for sprite generation
CN100369487C (en) * 2002-04-25 2008-02-13 松下电器产业株式会社 Object detection device, object detection server, and object detection method
US7184609B2 (en) * 2002-06-28 2007-02-27 Microsoft Corp. System and method for head size equalization in 360 degree panoramic images
US7149367B2 (en) * 2002-06-28 2006-12-12 Microsoft Corp. User interface for a system and method for head size equalization in 360 degree panoramic images
US7042508B2 (en) * 2002-07-26 2006-05-09 Appro Technology Inc. Method for presenting fisheye-camera images
US7330556B2 (en) * 2003-04-03 2008-02-12 Gn Resound A/S Binaural signal enhancement system
US7432917B2 (en) * 2004-06-16 2008-10-07 Microsoft Corporation Calibration of an interactive display system
JPWO2006064770A1 (en) * 2004-12-16 2008-06-12 松下電器産業株式会社 Imaging device
US7612794B2 (en) * 2005-05-25 2009-11-03 Microsoft Corp. System and method for applying digital make-up in video conferencing
US7603000B2 (en) * 2005-08-31 2009-10-13 Siemens Medical Solutions Usa, Inc. System and method for learning relative distance in a shape space using image based features
WO2008139251A2 (en) * 2006-04-14 2008-11-20 Patrick Levy Rosenthal Virtual video camera device with three-dimensional tracking and virtual object insertion
US8442330B2 (en) * 2009-03-31 2013-05-14 Nbcuniversal Media, Llc System and method for automatic landmark labeling with minimal supervision

Patent Citations (18)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5193124A (en) * 1989-06-29 1993-03-09 The Research Foundation Of State University Of New York Computational methods and electronic camera apparatus for determining distance of objects, rapid autofocusing, and obtaining improved focus images
US5687249A (en) * 1993-09-06 1997-11-11 Nippon Telephone And Telegraph Method and apparatus for extracting features of moving objects
US6195119B1 (en) * 1994-12-28 2001-02-27 Olympus America, Inc. Digitally measuring scopes using a high resolution encoder
US6262743B1 (en) * 1995-06-22 2001-07-17 Pierre Allio Autostereoscopic image acquisition method and system
US6211903B1 (en) * 1997-01-14 2001-04-03 Cambridge Technology Development, Inc. Video telephone headset
US6400848B1 (en) * 1999-03-30 2002-06-04 Eastman Kodak Company Method for modifying the perspective of a digital image
US20030117675A1 (en) * 2000-08-30 2003-06-26 Norimitu Shirato Curved image conversion method and record medium where this method for converting curved image is recorded
US20020130961A1 (en) * 2001-03-15 2002-09-19 Lg Electronics Inc. Display device of focal angle and focal distance in iris recognition system
US20040201768A1 (en) * 2002-01-17 2004-10-14 Eastman Kodak Company Electronic imaging system having a sensor for correcting perspective projection distortion
US20040201587A1 (en) * 2002-03-04 2004-10-14 Kazufumi Mizusawa Image combination/conversion apparatus
US20040001146A1 (en) * 2002-06-28 2004-01-01 Zicheng Liu Real-time wide-angle image correction system and method for computer image viewing
US6641053B1 (en) * 2002-10-16 2003-11-04 Xerox Corp. Foreground/background document processing with dataglyphs
US7454057B2 (en) * 2003-03-28 2008-11-18 Olympus Corporation Image processing apparatus and image processing program
US20040207743A1 (en) * 2003-04-15 2004-10-21 Nikon Corporation Digital camera system
US20050163398A1 (en) * 2003-05-13 2005-07-28 Olympus Corporation Image processing apparatus
US20040233543A1 (en) * 2003-05-23 2004-11-25 Chang Matthew Tsu-Yang Optical apparatus for a line scanner system with reduced optical total track
US20050251015A1 (en) * 2004-04-23 2005-11-10 Omron Corporation Magnified display apparatus and magnified image control apparatus
US7454054B2 (en) * 2004-05-07 2008-11-18 Konica Minolta Sensing, Inc. Three-dimensional shape input device

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104811602A (en) * 2014-01-24 2015-07-29 维科技术有限公司 Self-shooting method and self-shooting device for mobile terminals
CN104994282A (en) * 2015-06-30 2015-10-21 广东欧珀移动通信有限公司 Large view angle camera control method and user terminal
CN104994280A (en) * 2015-06-30 2015-10-21 广东欧珀移动通信有限公司 Method for processing preview image and user terminal
CN105049713A (en) * 2015-06-30 2015-11-11 广东欧珀移动通信有限公司 Preview image processing method and user terminal
CN105049716A (en) * 2015-06-30 2015-11-11 广东欧珀移动通信有限公司 Preview image processing method and user terminal
CN105049714A (en) * 2015-06-30 2015-11-11 广东欧珀移动通信有限公司 Figure image preview method and device

Also Published As

Publication number Publication date
US7742624B2 (en) 2010-06-22
US20070248281A1 (en) 2007-10-25
US20110169986A1 (en) 2011-07-14
US8494224B2 (en) 2013-07-23

Similar Documents

Publication Publication Date Title
US8494224B2 (en) Perspective improvement for image and video applications
US10339386B2 (en) Unusual event detection in wide-angle video (based on moving object trajectories)
KR101091297B1 (en) Image processing apparatus, and method of the same
US7711155B1 (en) Method and system for enhancing three dimensional face modeling using demographic classification
US6198484B1 (en) Stereoscopic display system
US7330584B2 (en) Image processing apparatus and method
TWI610571B (en) Display method, system and computer-readable recording medium thereof
WO2017045326A1 (en) Photographing processing method for unmanned aerial vehicle
KR20200023651A (en) Preview photo blurring method and apparatus and storage medium
CN107800965B (en) Image processing method, device, computer readable storage medium and computer equipment
CN113518996A (en) Damage detection from multiview visual data
JP2011166305A (en) Image processing apparatus and imaging apparatus
CN110245549B (en) Real-time face and object manipulation
CN107436681A (en) Automatically adjust the mobile terminal and its method of the display size of word
CN107368817B (en) Face recognition method and device
CN116453198A (en) Sight line calibration method and device based on head posture difference
Jabar et al. Content-aware perspective projection optimization for viewport rendering of 360 images
JP2005174148A (en) Image pickup device and method, and image pickup system
JP4586447B2 (en) Image processing apparatus and method, and program
KR101971258B1 (en) Perspective distortion correcting apparatus for image and method threrof
CN117480776A (en) Video conference terminal
US20190387166A1 (en) Image processing apparatus and program
CN114531560A (en) Video call method and device
KR20220125423A (en) Server and system for auto-filming and method for auto-filming by using electronic device
CN117876296A (en) Face image normalization method and device, electronic equipment and storage medium

Legal Events

Date Code Title Description
AS Assignment

Owner name: MOTOROLA, INC., ILLINOIS

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:SUPER, BOAZ J.;AUGUSTINE, BRUCE A.;CRENSHAW, JAMES E.;AND OTHERS;SIGNING DATES FROM 20060417 TO 20060420;REEL/FRAME:024325/0405

AS Assignment

Owner name: MOTOROLA MOBILITY, INC, ILLINOIS

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:MOTOROLA, INC;REEL/FRAME:025673/0558

Effective date: 20100731

STCB Information on status: application discontinuation

Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION

AS Assignment

Owner name: MOTOROLA MOBILITY LLC, ILLINOIS

Free format text: CHANGE OF NAME;ASSIGNOR:MOTOROLA MOBILITY, INC;REEL/FRAME:033578/0165

Effective date: 20120622

AS Assignment

Owner name: GOOGLE TECHNOLOGY HOLDINGS LLC, CALIFORNIA

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:MOTOROLA MOBILITY LLC;REEL/FRAME:034534/0439

Effective date: 20141028