US8648919B2 - Methods and systems for image stabilization - Google Patents

Methods and systems for image stabilization Download PDF

Info

Publication number
US8648919B2
US8648919B2 US13/154,375 US201113154375A US8648919B2 US 8648919 B2 US8648919 B2 US 8648919B2 US 201113154375 A US201113154375 A US 201113154375A US 8648919 B2 US8648919 B2 US 8648919B2
Authority
US
United States
Prior art keywords
image
motion
data
capturing device
frame
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active, expires
Application number
US13/154,375
Other versions
US20120307085A1 (en
Inventor
William E. Mantzel
Kenneth I. Greenebaum
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Apple Inc
Original Assignee
Apple Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Apple Inc filed Critical Apple Inc
Priority to US13/154,375 priority Critical patent/US8648919B2/en
Assigned to APPLE INC. reassignment APPLE INC. ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: Mantzel, William E., GREENEBAUM, KENNETH I.
Priority to PCT/US2012/038569 priority patent/WO2012170179A1/en
Priority to TW101119998A priority patent/TWI464707B/en
Publication of US20120307085A1 publication Critical patent/US20120307085A1/en
Application granted granted Critical
Publication of US8648919B2 publication Critical patent/US8648919B2/en
Active legal-status Critical Current
Adjusted expiration legal-status Critical

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N23/00Cameras or camera modules comprising electronic image sensors; Control thereof
    • H04N23/60Control of cameras or camera modules
    • H04N23/68Control of cameras or camera modules for stable pick-up of the scene, e.g. compensating for camera body vibrations
    • H04N23/689Motion occurring during a rolling shutter mode
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N23/00Cameras or camera modules comprising electronic image sensors; Control thereof
    • H04N23/60Control of cameras or camera modules
    • H04N23/68Control of cameras or camera modules for stable pick-up of the scene, e.g. compensating for camera body vibrations

Definitions

  • Embodiments of the invention are generally related to methods and systems for image stabilization using a motion-estimating device.
  • Image-capturing devices include cameras, portable handheld electronic devices, and other electronic devices.
  • the images captured by image-capturing devices may be compromised based on motion of the image-capturing devices. For example, vibration, camera shake, or rotation of the camera may blur images.
  • One prior approach uses software that compares similar portions of different frames and adjusts the output image based on the comparison. This approach typically compensates for translational motion, but fails to compensate for rotational motion.
  • Some image-capturing devices may use what could be referred to as a rolling shutter as a method of image acquisition in which each frame is recorded not from a snapshot of an entire frame at a single point in time, but rather by scanning across the frame, one line at a time, either vertically or horizontally. In other words, not all parts of the image are recorded at exactly the same time, even though the whole frame is displayed at the same time during playback.
  • At least some CMOS image sensors have a rolling shutter. Rolling shutter produces predictable distortions of fast-moving objects or when the sensor captures rapid flashes of light. This method is implemented by rolling (moving) the shutter across the exposable image area instead of exposing the image area all at the same time. Rolling shutters can cause such effects as skew and wobble.
  • Skews occur when the image bends diagonally in one direction or another as the camera or subject moves from one side to another, exposing different parts of the image at different times.
  • Wobble is most common in hand-held shots at telephoto settings and most extreme in cases when the camera is vibrating due to being attached to a moving vehicle.
  • the rolling shutter causes the image to wobble unnaturally and playfully. This is often called the jello effect.
  • an image-capturing device includes a camera and a motion-estimating device.
  • the image-capturing device utilizes camera calibration data in one embodiment to map image coordinates of an image plane of the image sensor into normalized image coordinates of a coordinate space.
  • the motion-estimating device can determine motion data (e.g., three dimensional rotation data) for the device.
  • the device matches motion data to a sequence of frames captured by the device to determine motion data for each frame.
  • the device estimates an estimated motion path of the device based on the motion data.
  • the device constructs a target motion path for the image-capturing device based on the estimated motion path.
  • the device computes a desired motion correction based on the estimated motion path and the target motion path.
  • the device utilizes camera calibration data to resample each frame to generate a corrected sequence of stabilized frames according to the desired motion correction.
  • a user may capture a sequence of images with the device.
  • the motion path is constructed based on motion data that indicates sudden movement or subtle movement (e.g., camera shake from a user, vibration, rotation of camera, etc.).
  • the stabilized frames compensate for the unintended motion of the device during image capture.
  • an image capturing system includes a rolling shutter image sensor that may cause a rolling shutter artifact (e.g., warping).
  • the system includes a motion-estimating device to detect motion data and a processing system that is configured to perform an automatic rolling shutter correction mechanism.
  • the correction mechanism utilizes calibration data based on a relationship between pixel locations in an image plane of the image sensor and their corresponding rays of light in a coordinate space (e.g., three dimensional space), determines pixel velocity components based on the calibration data, and estimates for each image an aggregate pixel velocity based on an aggregation of the pixel velocity components and corresponding rotational velocity values, which are determined from the motion data.
  • the correction mechanism resamples each image to generate a new corrected image that is based on the aggregate pixel velocity.
  • FIG. 1 illustrates a flow diagram in one embodiment of the present invention for a computer-implemented method 100 of stabilizing images (e.g., sequence of images, video) captured with an image-capturing device.
  • images e.g., sequence of images, video
  • FIG. 2 illustrates the construction of motion paths of an image-capturing device in one embodiment of the present invention.
  • FIG. 3 illustrates frame resampling to be applied to an exploded view of a subset of a frame in one embodiment of the present invention.
  • FIG. 4 illustrates an example of an image 400 in one embodiment of the present invention.
  • FIG. 5 illustrates a flow diagram in one embodiment of the present invention for a computer-implemented method 500 of a rolling shutter correction of images (e.g., sequence of images, video) captured with an image-capturing device.
  • images e.g., sequence of images, video
  • FIGS. 6-8 illustrate pre-computed velocity components (e.g., V x , V y , and V z ) in one embodiment of the present invention.
  • FIG. 9 shows in one embodiment of the present invention a wireless image-capturing device which includes the capability for wireless communication and for capturing images.
  • FIG. 10 is a block diagram of one embodiment of the present invention of a system 1000 .
  • FIG. 11 illustrates aggregating a weighted sum of each pre-computed pixel velocity with weights corresponding to the rotational velocity value for each dimension in one embodiment of the present invention.
  • FIG. 12 illustrates the generation of a displacement map based on the aggregate pixel velocity vector V 1220 and (t m ⁇ t 0 ) 1210 in one embodiment of the present invention.
  • FIGS. 13A-13D illustrate a sequence of images for showing a rolling shutter correction in one embodiment of the present invention.
  • FIGS. 14 and 15 illustrate the instantaneous rotation of the image-capturing device in one embodiment of the present invention.
  • an image-capturing device includes an image sensor (e.g., camera) and a motion-estimating device.
  • the motion-estimating device can, in one embodiment, determine motion data for the device.
  • the device matches motion data to a sequence of frames captured by the device to determine motion data for each frame.
  • the device constructs a target motion path for the image-capturing device based on the motion data for each frame.
  • the device computes a desired motion correction from an estimated motion path observed in the motion data to the target motion path. Then, the device resamples each frame to generate stabilized frames based on the desired motion correction.
  • an image capturing system includes a rolling shutter image sensor that may cause a rolling shutter artifact (e.g., warping).
  • the system includes a motion-estimating device to detect motion data and a processing system that is configured to perform an automatic rolling shutter correction mechanism.
  • FIG. 1 illustrates a flow diagram in one embodiment of the present invention for a computer-implemented method 100 of stabilizing images (e.g., sequence of images, video) captured with an image-capturing device.
  • the computer-implemented method 100 is performed by processing logic that may comprise hardware (circuitry, dedicated logic, etc.), software (such as is run on a general purpose computer system or a dedicated machine or a system), or a combination of both.
  • the processing logic sends information to and receives information from an image sensing unit having a microprocessor and image sensors.
  • the image sensing unit may send frames of metadata (e.g., focal-number, exposure time, white balance) to the processing logic. Pixel values are read from the image sensors to generate image data. Frames are sent at a certain time interval (e.g., 1/15 of a second) to the processing logic.
  • the frames are stabilized by correcting for a rotational path of the image-capturing device as discussed below.
  • processing logic e.g., one or more processing units
  • the calibration data is utilized to map image coordinates (e.g., two dimensional pixels) of an image plane of an image sensor of the camera into normalized image coordinates (e.g., light rays) of a three dimensional coordinate space at block 103 .
  • the calibration may be an offline one time process.
  • the processing logic (e.g., one or more processing units) captures a sequence of images (e.g., frames, video).
  • the processing logic determines motion data (e.g., three dimensional rotation data) for the device using the motion-estimating device.
  • the motion-estimating device can be a gyroscope or an accelerometer or a combination of both.
  • the gyroscope may provide three dimensional rotation data and the accelerometer may provide three dimensional translation data (six degrees of freedom). Determining the three dimensional motion data for each frame may include determining rotational velocity vectors in three dimensions for each frame.
  • the processing logic matches motion data to the sequence of frames captured by the image-capturing device to determine three dimensional motion data for each frame.
  • Matching motion data to the sequences of frames may include translating time stamps of the motion-estimating device into video time of the frames and also integrating rotational velocity data (e.g., rotational velocity vectors) received from the motion-estimating device to estimate inter-frame rotation (e.g., ⁇ [ ⁇ ]).
  • rotational velocity data e.g., rotational velocity vectors
  • a live bias estimate between time stamps of the motion-estimating device and video time of the frames may be available using a long-term average (e.g., 5 minutes).
  • the processing logic estimates an estimated motion path (e.g., observed motion path) of the image-capturing device based on the three dimensional motion data for each frame.
  • the estimated motion path is constructed for motion data that indicates sudden movement or subtle movement (e.g., camera shake from a user, vibration).
  • the estimated motion path may be rough or jagged depending on the movement of the camera.
  • the processing logic constructs a target motion path of the image-capturing device based upon the estimated motion path.
  • the target motion path can be a smoothed (e.g., filtered) version of the estimated motion path.
  • the processing logic computes a desired motion correction from the estimated motion path to the target motion path.
  • the processing logic utilizes camera calibration data to resample each frame to generate a correct sequence of stabilized frames according to the desired motion correction.
  • the processing logic optionally performs an adaptive crop and fill of an unknown region (e.g., dark region) of the stabilized frames if necessary.
  • the operations of the method 100 provide pre-processing that may be part of a compression algorithm of the frames or decoupled from the compression algorithm.
  • the compressed video frames may require less memory space or provide higher image quality at a lower bit rate based on the operations of the method 100 .
  • the motion-estimating device can be a gyroscope, an accelerometer, or any combination thereof in single or multi physical packages.
  • FIG. 2 illustrates the construction of motion paths of an image-capturing device in one embodiment of the present invention.
  • the processing logic constructs a rough motion path 220 of the image-capturing device based on the three dimensional motion data for each frame.
  • the processing logic can apply a filter in each dimension, such as a low pass or predictive filter, (possibly inducing a short delay to construct a smooth motion path 220 from the rough motion path 200 .
  • Smooth motion path 220 represents a desirable target path of the image-capturing device during a time of image capture.
  • FIG. 3 illustrates frame resampling to be applied to an exploded view of a subset of a frame in one embodiment of the present invention.
  • the processing logic can apply frame resampling to the exploded view 300 .
  • the frame resampling uses the smooth motion path 220 to correct pixels in the subset 330 of the frame 320 .
  • the processing logic may artificially rotate an observer's viewing direction (e.g., user's viewing direction) based on the difference between motion path 220 and motion path 200 .
  • the frame resampling uses interpolation (e.g., bilinear interpolation) to construct new frames.
  • Frame 4 illustrates in an example of an image 400 in one embodiment of the present invention.
  • a central region 402 includes an object 410 to be captured.
  • Peripheral regions 403 and 404 may be dark regions of the image.
  • the regions 403 and 404 can be cropped or eliminated from the image 404 . Pixel values may be missing for these regions. These values can be filled in based on adjacent frames or in painting techniques. Alternatively, the cropped region is constrained within the original frame.
  • the method 100 provides image stabilization to correct for rotational motion and vibration of an image-capturing device.
  • Translational vibration accounts for approximately 10% of vibration and requires depth knowledge to correct.
  • Rotational vibration generally accounts for the large majority of vibration-induced distortions and does not require depth knowledge to correct.
  • Short term rotational offsets may be accurate to within approximately 1 milliradian.
  • This method 100 can also be used for intermediate frame generation and rolling shutter correction.
  • a roller shutter artifact may occur because scan lines are read one at a time from an image sensor (e.g., CMOS image sensor) and the camera itself moves during the image capture time period.
  • FIG. 5 illustrates a flow diagram in one embodiment of the present invention for a computer-implemented method 500 of a rolling shutter correction of images (e.g., sequence of images, video) captured with an image-capturing device.
  • the computer-implemented method 500 is performed by processing logic that may comprise hardware (circuitry, dedicated logic, etc.), software (such as is run on a general purpose computer system or a dedicated machine or a system), or a combination of both.
  • the method 500 determines a value of a pixel a certain time period ago or in the future (e.g., 1 millisecond, 2 millisecond) to correct for rolling shutter effects.
  • processing logic calibrates a camera of the image-capturing device in order to generate calibration data.
  • the calibration data is utilized in order to transform image coordinates (e.g., two dimensional pixels) of an image plane of an image sensor of the camera into a three dimensional direction vector.
  • a calibration model is a parametric way of describing the connection between pixel locations in the image plane and their corresponding rays of light in the three dimensional space from the perspective of a camera observer.
  • a three dimensional rotation can be applied to the direction vector. An application of the three dimensional rotation and the direction vectors results in determining where a pixel would move under a hypothetical camera rotation in three dimensional space.
  • the calibration may be an offline one time process.
  • the calibration parameters may include numerous parameters as described herein.
  • the variable parameters include focal length f or equivalently field of view and two more parameters to describe radial distortion ⁇ .
  • the skew may be zero.
  • the focal length f x can be modeled as approximately 1.1823*(2c x ) and f y can be modeled as approximately 1.1966*(2c x ).
  • a function F is obtained that converts normalized pixel coordinates (e.g., x vector) to actual pixel coordinates (e.g., m vector) as indicated by the following equation.
  • m vector F ( x
  • the processing logic calibrates and synchronizes a motion-estimating device with the camera of the image-capturing device.
  • This synchronization may include translating time stamps of the motion-estimating device into video time of the captured images.
  • a live bias estimate between time stamps of the motion-estimating device and video time of the frames may be available using a long-term average (e.g., 5 minutes).
  • the processing logic determines pre-computed predicted pixel velocity components (e.g., V x , V y , and V z ) from the calibration data.
  • the pixel velocity components may be determined in two dimensions from the calibration data.
  • the processing logic e.g., one or more processing units
  • captures a sequence of images e.g., video, frames.
  • the processing logic determines motion data of the camera during capture of the images.
  • the motion data may include rotational velocity vector w (e.g., w x , w y , and w z ).
  • the processing logic estimates for each image a corresponding aggregate pixel velocity vector V based on an aggregation of the pixel velocity components.
  • This aggregation may include a weighted sum of the components w x V x , w y V y , and w z V z .
  • the rotational velocity weights w x , w y , and w z are scalars and the pixel velocity components (e.g., V x , V y , and V z ) are functions.
  • the processing logic resamples each image (e.g., using bilinear interpolation) to generate a new image (e.g., frame) ⁇ by assigning for each pixel a new value as indicated by the following equation.
  • new image ⁇ (vector m ) I (vector m ⁇ ( t m ⁇ t 0 )* V (vector m ))
  • the time when row m was captured is represented by t m and an initial arbitrary time (e.g., first row, middle row, etc.) for image capture is represented by t o .
  • the new image ⁇ may include new pixel locations based on the movement of the camera. A new pixel location may not be located on a grid location of the new image and may be interpolated based on a certain number of nearest neighbor pixels. An extrapolation may be necessary for calculating a new pixel's color in the case of missing pixels.
  • FIGS. 6-8 illustrate pre-computed velocity components (e.g., V x , V y , and V z ) in one embodiment of the present invention.
  • the pixels in FIG. 6 move upwards along the y axis (e.g., velocity component 601 ) due to an estimated or predicted rotational movement of the camera.
  • the pixels in FIG. 7 move to the right along the x axis (e.g., velocity component 701 ) due to an estimated rotational movement of the camera.
  • the pixels in FIG. 8 move in a clockwise direction (e.g., velocity component 801 ) due to an estimated rotational movement of the camera.
  • the path of each pixel can be determined based on these velocity components.
  • a pixel velocity field V may be computed from a rotational velocity w and the component velocity maps V x , V y , and V z .
  • the composition of transformations is described as follows.
  • the actual pixel coordinates, represented by vector m, of an image plane of the image sensor are mapped or translated into a three dimensional direction vector x.
  • An estimated three dimensional rotation can be applied to the direction vector x to generate vector x′.
  • the estimated or predicted rotation of the camera may correspond to a human model for hand vibration while the human holds the camera.
  • This model may estimate or predict camera rotation for when the user turns his hand slightly in one or more directions during a time period for capturing a row or rows of pixels.
  • this model is designed for predicted vibrations having a frequency less than or equal to 100 hertz. Vibrations for frequencies greater than 100 hertz are not likely caused by human rotational movement and these vibrations are more difficult to correct.
  • Vector x′ is translated into vector m′ with function F.
  • the velocity components e.g., V x , V y , and V z ) are approximately equal to a difference between m and m′ under incremental rotations about the x, y, and z axis respectively.
  • FIG. 11 illustrates aggregating a weighted sum of each pre-computed pixel velocity with weights corresponding to the rotational velocity value for each dimension in one embodiment of the present invention.
  • a weighted sum of the components w x V x 1110 , w y V y 1120 , and w z V z 1130 generates the aggregate pixel velocity vector V 1140 .
  • w x 0.2 radians
  • w y ⁇ 0.2 radians
  • w z 0.8 radians.
  • the displacement map for a given pixel of an image sensor is generated by multiplying the aggregate pixel velocity vector V 1220 and (t m ⁇ t 0 ) 1210 .
  • the displacement map indicates a displacement for a given pixel of the image sensor based on rotation or movement of the camera during image capture.
  • FIGS. 13A-13D illustrate a sequence of images for showing a rolling shutter correction in one embodiment of the present invention.
  • FIG. 13A illustrates a reference image 1300 , which represents an original scene with stop sign 1302 that was captured with an image-capturing device (e.g., an iPhone 4) with no rolling shutter correction.
  • the reference image 1300 may include radial distortion.
  • FIG. 13B illustrates a simulated image 1310 with stop sign 1312 .
  • the simulated image 1310 is a simulated rendering of the image 1300 using a rolling shutter exposure under the effects of an instantaneous rotation of the image-capturing device.
  • FIGS. 14 and 15 illustrate the instantaneous rotation of the image-capturing device in one embodiment of the present invention.
  • FIG. 14 illustrates the rotation in degrees on a vertical axis versus t m ⁇ t 0 in milliseconds on a horizontal axis. For example, for a 2 millisecond exposure time, the image-capturing device rotates from approximately ⁇ 25 degrees to approximately 25 degrees.
  • FIG. 15 illustrates the rotation in degrees on a vertical axis versus row number of an image sensor of the image-capturing device on a horizontal axis.
  • FIG. 13C illustrates a resampled image 1320 with stop sign 1322 .
  • the image 1320 simulates the unwarping of the image 1310 based on available information, which may be imperfect.
  • the correction is performed using imperfect rotation information with a signal to noise ratio that may be 10:1 (i.e., approximately 4 degrees of error for a rotation of 40 degrees over the exposure interval).
  • the error may be caused by noise from the motion-estimating device (e.g., gyroscope, accelerometer) measurements as well as modeling errors that result from sources of motion (e.g., motion in the scene, translation, etc.). These sources of motion are difficult to accurately model.
  • an image-capturing device such as a digital processing system (e.g., conventional, general-purpose computer system).
  • Special purpose computers which are designed or programmed to perform only one function, may also be used.
  • the methods, systems, and apparatuses of the present disclosure can be implemented in various devices including electronic devices, consumer devices, data processing systems, desktop computers, portable computers, wireless devices, cellular devices, tablet devices, handheld devices, multi touch devices, multi touch data processing systems, any combination of these devices, or other like devices.
  • FIGS. 9 and 10 illustrate examples of a few of these devices, which are capable of capturing still images and video to implement the methods of the present disclosure.
  • the methods e.g., 100, 500
  • FIG. 9 shows in one embodiment of the present invention a wireless image-capturing device which includes the capability for wireless communication and for capturing images.
  • Wireless device 900 may include an antenna system 901 .
  • Wireless device 900 may also include a digital and/or analog radio frequency (RF) transceiver 902 , coupled to the antenna system 901 , to transmit and/or receive voice, digital data and/or media signals through antenna system 901 .
  • RF radio frequency
  • Wireless device 900 may also include a digital processing system 903 to control the digital RF transceiver and to manage the voice, digital data and/or media signals.
  • Digital processing system 903 may be a general purpose processing system, such as a microprocessor or controller for example.
  • Digital processing system 903 may also be a special purpose processing system, such as an ASIC (application specific integrated circuit), FPGA (field-programmable gate array) or DSP (digital signal processor).
  • Digital processing system 903 may also include other devices, as are known in the art, to interface with other components of wireless device 900 .
  • digital processing system 903 may include analog-to-digital and digital-to-analog converters to interface with other components of wireless device 900 .
  • Digital processing system 903 may include a media processing system 909 , which may also include a general purpose or special purpose processing system to manage media, such as files of audio data.
  • Wireless device 900 may also include a storage device 904 , coupled to the digital processing system, to store data and/or operating programs for the Wireless device 900 .
  • Storage device 904 may be, for example, any type of solid-state or magnetic memory device.
  • Storage device 904 may be or include a machine-readable medium.
  • a machine-readable medium includes any mechanism for storing or transmitting information in a form readable by a machine (e.g., a computer).
  • machines store and communicate (internally and with other devices over a network) code and data using machine-readable media, such as machine storage media (e.g., magnetic disks; optical disks; random access memory; read only memory; flash memory devices; phase-change memory).
  • machine storage media e.g., magnetic disks; optical disks; random access memory; read only memory; flash memory devices; phase-change memory.
  • Wireless device 900 may also include one or more input devices 905 , coupled to the digital processing system 903 , to accept user inputs (e.g., telephone numbers, names, addresses, media selections, etc.)
  • Input device 905 may be, for example, one or more of a keypad, a touchpad, a touch screen, a pointing device in combination with a display device or similar input device.
  • Wireless device 900 may also include at least one display device 906 , coupled to the digital processing system 903 , to display information such as messages, telephone call information, contact information, pictures, movies and/or titles or other indicators of media being selected via the input device 905 .
  • Display device 906 may be, for example, an LCD display device.
  • display device 906 and input device 905 may be integrated together in the same device (e.g., a touch screen LCD such as a multi-touch input panel which is integrated with a display device, such as an LCD display device).
  • the display device 906 may include a backlight 906 A to illuminate the display device 906 under certain circumstances. It will be appreciated that the wireless device 900 may include multiple displays.
  • Wireless device 900 may also include a battery 907 to supply operating power to components of the system including digital RF transceiver 902 , digital processing system 903 , storage device 904 , input device 905 , microphone 905 A, audio transducer 908 , media processing system 909 , sensor(s) 910 , and display device 906 , an image sensor 959 (e.g., CCD (Charge Coupled Device), CMOS sensor).
  • the image sensor may be integrated with an image processing unit 960 .
  • the display device 906 may include a Liquid Crystal Display (LCD) which may be used to display images which are captured or recorded by the wireless image-capturing device 900 .
  • the LCD serves as a viewfinder of a camera (e.g., combination of lens 963 , image sensor 959 , and unit 960 ) and there may optionally be other types of image display devices on device 900 which can serve as a viewfinder.
  • LCD Liquid Crystal Display
  • the device 900 also includes an imaging lens 963 which can be optically coupled to image sensor 959 .
  • the processing system 903 controls the operation of the device 900 ; and, it may do so by executing a software program stored in ROM 957 , or in the processing system 903 , or in both ROM 957 and the processing system 903 .
  • the processing system 903 controls the image processing operation; and, it controls the storage of a captured image in storage device 904 .
  • the processing system 903 also controls the exporting of image data (which may or may not be color corrected) to an external general purpose computer or special purpose computer.
  • the processing system 903 also responds to user commands (e.g., a command to “take” a picture or video by capturing image(s) on the image sensor and storing it in memory or a command to select an option for contrast enhancement and color balance adjustment).
  • user commands e.g., a command to “take” a picture or video by capturing image(s) on the image sensor and storing it in memory or a command to select an option for contrast enhancement and color balance adjustment.
  • the ROM 957 may store software instructions for execution by the processing system 903 to perform the automatic image stabilization and rolling shutter correction mechanisms discussed in the present disclosure.
  • the storage device 904 is used to store captured/recorded images which are received from the image sensor 959 . It will be appreciated that other alternative architectures of a camera can be used with the various embodiments of the invention.
  • Battery 907 may be, for example, a rechargeable or non-rechargeable lithium or nickel metal hydride battery.
  • Wireless device 900 may also include audio transducers 908 , which may include one or more speakers, and at least one microphone 905 A, and an accelerometer 946 .
  • the device 900 also includes a motion or orientation detector 940 (e.g., accelerometer, gyroscope, or any combination thereof) for determining motion data or an orientation of the device 900 .
  • a motion or orientation detector 940 e.g., accelerometer, gyroscope, or any combination thereof
  • the image-capturing device 900 is designed to stabilize images and video.
  • the image-capturing device 900 includes the image sensor 959 with associated calibration data to sense images, the storage device 904 to store captured images, the motion-estimating device 940 to detect motion data for the device, and the processing system 903 which is coupled to the storage device and the motion-estimating device.
  • the processing system is configured to perform an automatic image stabilization mechanism by determining motion data for the image-capturing device using the motion-estimating device, matching motion data to a sequence of frames captured by the image-capturing device to determine three dimensional motion data for each frame, and estimating an estimated motion path (e.g., rough motion path) of the image-capturing device based on the three dimensional motion data for each frame. Determining the three dimensional motion data for each frame includes determining rotational velocity vectors in three dimensions for each frame.
  • the processing system is further configured to construct a target motion path (e.g., smooth motion path) of the system based on the estimated motion path of the system. Constructing the target motion path may include filtering the estimated motion path.
  • the processing system is further configured to compute a desired motion correction from the estimated motion path to the target motion path.
  • the processing system is further configured to correct image coordinates of the image plane based on the calibration data and the desired motion correction, which may be determined based on a difference between the target and estimated motion paths of the system, apply resampling for each frame from the original image coordinates to the corrected image coordinates to generate stabilized frames and adaptively crop and fill an unknown region if necessary of the stabilized frames.
  • Correcting image coordinates of the image plane based on the calibration data and the difference between the estimated and target motion paths includes utilizing the calibration data to map the image coordinates, which represent two dimensional pixels, into the normalized image coordinates of the coordinate space, which represent light rays, applying a desired rotation to all light rays, according to the difference between the estimated and target motion paths, and utilizing the calibration data to map these rotated light rays back to the image plane.
  • Matching motion data to a sequence of frames captured by the image-capturing device to determine three dimensional motion data for each frame may include translating time stamps of the motion-estimating device into video time of the frames and integrating rotational velocity data received from the motion-estimating device to estimate inter-frame rotation.
  • an image capturing system (e.g., image-capturing device 900 ) is designed to correct for rolling shutter effects (e.g., warping) and compensate for vibrations and rotational movements of the image capturing system.
  • the image capturing system includes an image sensor 959 to sense images, a storage device 904 that is coupled to the image sensor.
  • the storage device stores captured images.
  • the motion-estimating device 940 e.g., gyroscope, accelerometer
  • the processing system 903 is coupled to the storage device and the motion-estimating device.
  • the processing system is configured to perform an automatic image sensor correction mechanism to utilize calibration data based on a relationship between pixel locations in an image plane of the image sensor and their corresponding rays of light in a three dimensional space, to determine pixel velocity components based on the calibration data, and to estimate for each image an aggregate pixel velocity based on an aggregation of the pixel velocity components.
  • the pixel velocity components include pre-computed predicted velocity components that are computed prior to sensing the images. Determining the pixel velocity components may include constructing pixel velocity maps from the calibration data. Estimating for each image the aggregate pixel velocity based on the aggregation of the pixel velocity components includes aggregating a weighted sum of each pre-computed pixel velocity corresponding to the rotational velocity value (e.g., w x , w y , and w z ) for each dimension.
  • the rotational velocity value e.g., w x , w y , and w z
  • the processing system is further configured to resample each image to generate a new image to perform the rolling shutter correction mechanism. Resampling each image to generate a new image is based on a current image and the aggregate pixel velocity. The new images have compensated for rolling shutter effects, vibrations, and rotational movement of the image capturing system.
  • FIG. 10 is a block diagram of one embodiment of the present invention of a system 1000 that generally includes one or more computer-readable mediums 1001 , processing system 1004 , Input/Output (I/O) subsystem 1006 , radio frequency (RF) circuitry 1008 , audio circuitry 1010 , and an image sensor 1059 (e.g., CCD (Charge Coupled Device), CMOS sensor).
  • the image sensor may be integrated with an image processing unit 1060 .
  • the image sensor 1059 is optically coupled to receive light from a lens 1063 , which can be used for capturing images with the image sensor.
  • a motion-estimating device 1040 determines motion data in three dimensions for the system 1000 . These components may be coupled by one or more communication buses or signal lines 1003 .
  • FIG. 10 is only one example architecture of system 1000 , and that system 1000 could have more or fewer components than shown, or a different configuration of components.
  • the various components shown in FIG. 10 can be implemented in hardware, software, firmware or any combination thereof, including one or more signal processing and/or application specific integrated circuits.
  • RF circuitry 1008 is used to send and receive information over a wireless link or network to one or more other devices and includes well-known circuitry for performing this function.
  • RF circuitry 1008 and audio circuitry 1010 are coupled to processing system 1004 via peripherals interface 1016 .
  • Interface 1016 includes various known components for establishing and maintaining communication between peripherals and processing system 1004 .
  • Audio circuitry 1010 is coupled to audio speaker 1050 and microphone 1052 and includes known circuitry for processing voice signals received from interface 1016 to enable a user to communicate in real-time with other users.
  • audio circuitry 1010 includes a headphone jack (not shown).
  • Peripherals interface 1016 couples the input and output peripherals of the system to one or more processing units 1018 and computer-readable medium 1001 .
  • One or more processing units 1018 communicate with one or more computer-readable mediums 1001 via controller 1520 .
  • Computer-readable medium 1001 can be any device or medium (e.g., storage device, storage medium) that can store code and/or data for use by one or more processing units 1018 .
  • Medium 1001 can include a memory hierarchy, including but not limited to cache, main memory and secondary memory.
  • the memory hierarchy can be implemented using any combination of RAM (e.g., SRAM, DRAM, DDRAM), ROM, FLASH, magnetic and/or optical storage devices, such as disk drives, magnetic tape, CDs (compact disks) and DVDs (digital video discs).
  • Medium 1001 may also include a transmission medium for carrying information-bearing signals indicative of computer instructions or data (with or without a carrier wave upon which the signals are modulated).
  • the transmission medium may include a communications network, including but not limited to the Internet (also referred to as the World Wide Web), intranet(s), Local Area Networks (LANs), Wide Local Area Networks (WLANs), Storage Area Networks (SANs), Metropolitan Area Networks (MAN) and the like.
  • One or more processing units 1018 run various software components stored in medium 1001 to perform various functions for system 1000 .
  • the software components include operating system 1022 , communication module (or set of instructions) 1024 , touch processing module (or set of instructions) 1026 , graphics module (or set of instructions) 1028 , one or more applications (or set of instructions) 1030 , and modules [or set of instructions] 1038 and 1039 .
  • the image stabilization module 1038 and rolling shutter correction module 1039 each correspond to a set of instructions for performing one or more functions described above and the methods described in this application (e.g., the computer-implemented methods and other information processing methods described herein).
  • These modules i.e., sets of instructions
  • medium 1001 may store a subset of the modules and data structures identified above. Furthermore, medium 1001 may store additional modules and data structures not described above.
  • Operating system 1022 includes various procedures, sets of instructions, software components and/or drivers for controlling and managing general system tasks (e.g., memory management, storage device control, power management, etc.) and facilitates communication between various hardware and software components.
  • general system tasks e.g., memory management, storage device control, power management, etc.
  • Communication module 1024 facilitates communication with other devices over one or more external ports 1036 or via RF circuitry 1008 and includes various software components for handling data received from RF circuitry 1008 and/or external port 1036 .
  • Graphics module 1028 includes various known software components for rendering, animating and displaying graphical objects on a display surface.
  • touch I/O device 1012 is a touch sensitive display (e.g., touch screen)
  • graphics module 1028 includes components for rendering, displaying, and animating objects on the touch sensitive display.
  • One or more applications 1030 can include any applications installed on system 1000 , including without limitation, a browser, address book, contact list, email, instant messaging, word processing, keyboard emulation, widgets, JAVA-enabled applications, encryption, digital rights management, voice recognition, voice replication, location determination capability (such as that provided by the global positioning system (GPS), a music player, etc.
  • a browser address book, contact list, email, instant messaging, word processing, keyboard emulation, widgets, JAVA-enabled applications, encryption, digital rights management, voice recognition, voice replication, location determination capability (such as that provided by the global positioning system (GPS), a music player, etc.
  • GPS global positioning system
  • Touch processing module 1026 includes various software components for performing various tasks associated with touch I/O device 1012 including but not limited to receiving and processing touch input received from I/O device 1012 via touch I/O device controller 1032 .
  • System 1000 may image stabilization module 1038 , rolling shutter correction module 1039 , and image capturing unit 1060 for performing the method/functions as described herein in connection with FIGS. 1-10 .
  • the image capturing unit 1060 is integrated with the system 1000 and may be coupled to the peripheral interface 1016 as illustrated in FIG. 10 or integrated with one of the I/O devices 1012 or 1014 .
  • the computer readable medium 1001 contains executable computer program instructions (e.g., module 1038 ) which when executed by the data processing system 1000 cause said system to perform a method.
  • the method utilizes calibration data of a camera of the system to map image coordinates of an image plane of an image sensor of the camera into normalized image coordinates of a three dimensional coordinate space.
  • the method determines motion data for the system using the motion-estimating device 1040 (e.g., gyroscope, accelerometer).
  • the method also matches motion data to a sequence of frames captured by the system to determine three dimensional motion data for each frame, estimates estimates a motion path (e.g., rough motion path) of the system based on the three dimensional motion data for each frame, constructs a target motion path (e.g., smooth motion path) of the system based on the motion data for each frame, and computes a desired motion correction from the estimated motion path observed in the motion data to the target motion path.
  • the method corrects image coordinates of the image plane based on the desired motion correction and generates stabilized frames based on the corrected image coordinates.
  • Matching motion data to a sequence of frames may include translating time stamps of the motion-estimating device into video time of the frames.
  • Constructing the target motion path may include applying a low-pass or predictive filter in each dimension to the three dimensional motion data used to construct the estimated motion path (possibly inducing a delay), and estimating a necessary correction based on the estimated motion path.
  • the method also artificially rotates a user's viewing direction based on the desired motion correction, applies bilinear interpolation to generate stabilized frames based on the corrected image coordinates, and adaptively crops and fills an unknown region of the stabilized frames if an unknown region (e.g., dark pixels) exists.
  • an unknown region e.g., dark pixels
  • a computer readable medium contains executable computer program instructions (e.g., module 1039 ) which when executed by the data processing system 1000 cause said system to perform a method.
  • the method utilizes calibration data for the system, which has an image sensor 1061 and a motion-estimating device 1040 (e.g., gyroscope, accelerometer), by transforming two-dimensional pixel locations in an image plane of the image sensor into a three dimensional direction vector.
  • the method determines pixel velocity components based on the calibration data, captures a sequence of images with the system, determines motion data with the motion-estimating device during image capture, and estimates for each image an aggregate pixel velocity based on an aggregation of the pixel velocity components and corresponding motion data in three dimensions. Estimating may include aggregating a weighted sum of each pre-computed pixel velocity with weights corresponding to the rotational velocity value for each dimension.
  • the method also includes resampling each image to generate a new image.
  • Resampling each image may occur with a binary interpolation to generate a new image that is based on a current image and the aggregate pixel velocity.
  • Modules 1038 and 1039 may be embodied as hardware, software, firmware, or any combination thereof. Although modules 1038 and 1039 are shown to reside within medium 1001 , all or portions of modules 1038 and 1039 may be embodied within other components within system 1000 or may be wholly embodied as a separate component within system 1000 .
  • I/O subsystem 1006 is coupled to touch I/O device 1012 and one or more other I/O devices 1014 for controlling or performing various functions.
  • Touch I/O device 1012 communicates with processing system 1004 via touch I/O device controller 1032 , which includes various components for processing user touch input (e.g., scanning hardware).
  • touch I/O device controller 1032 includes various components for processing user touch input (e.g., scanning hardware).
  • One or more other input controllers 1034 receives/sends electrical signals from/to other I/O devices 1014 .
  • Other I/O devices 1014 may include physical buttons, dials, slider switches, sticks, keyboards, touch pads, additional display screens, or any combination thereof.
  • touch I/O device 1012 displays visual output to the user in a GUI.
  • the visual output may include text, graphics, video, and any combination thereof. Some or all of the visual output may correspond to user-interface objects.
  • Touch I/O device 1012 forms a touch-sensitive surface that accepts touch input from the user.
  • Touch I/O device 1012 and touch screen controller 1032 (along with any associated modules and/or sets of instructions in medium 1001 ) detects and tracks touches or near touches (and any movement or release of the touch) on touch I/O device 1012 and converts the detected touch input into interaction with graphical objects, such as one or more user-interface objects.
  • device 1012 is embodied as a touch screen
  • the user can directly interact with graphical objects that are displayed on the touch screen.
  • the user may indirectly interact with graphical objects that are displayed on a separate display screen embodied as I/O device 1014 .
  • Feedback may be provided by touch I/O device 1012 based on the user's touch input as well as a state or states of what is being displayed and/or of the computing system.
  • Feedback may be transmitted optically (e.g., light signal or displayed image), mechanically (e.g., haptic feedback, touch feedback, force feedback, or the like), electrically (e.g., electrical stimulation), olfactory, acoustically (e.g., beep or the like), or the like or any combination thereof and in a variable or non-variable manner.
  • System 1000 also includes power system 1044 for powering the various hardware components and may include a power management system, one or more power sources, a recharging system, a power failure detection circuit, a power converter or inverter, a power status indicator and any other components typically associated with the generation, management and distribution of power in portable devices.
  • a power management system for powering the various hardware components and may include a power management system, one or more power sources, a recharging system, a power failure detection circuit, a power converter or inverter, a power status indicator and any other components typically associated with the generation, management and distribution of power in portable devices.
  • peripherals interface 1016 , one or more processing units 1018 , and memory controller 1020 may be implemented on a single chip, such as processing system 1004 . In some other embodiments, they may be implemented on separate chips.
  • the present disclosure can relate to an apparatus for performing one or more of the operations described herein. This apparatus may be specially constructed for the required purposes, or it may comprise a general purpose computer selectively activated or reconfigured by a computer program stored in the computer. Such a computer program may be stored in a machine (e.g.
  • a machine-readable medium includes any mechanism for storing or transmitting information in a form readable by a machine (e.g., a computer).
  • machines store and communicate (internally and with other devices over a network) code and data using machine-readable media, such as machine storage media (e.g., magnetic disks; optical disks; random access memory; read only memory; flash memory devices; phase-change memory).
  • machine storage media e.g., magnetic disks; optical disks; random access memory; read only memory; flash memory devices; phase-change memory.

Abstract

Several methods, devices and systems for stabilizing images and correcting rolling shutter effects are described. In one embodiment, a computer implemented method for image stabilization for an image-capturing device with associated calibration data includes determining motion data for the image-capturing device using a motion-estimating device. The method also includes matching motion data to a sequence of frames captured by the image-capturing device to determine motion data for each frame, constructing a target motion path of the image-capturing device based on the motion data for each frame, and computing a desired motion correction from a motion path observed in the motion data to the target motion path.

Description

Embodiments of the invention are generally related to methods and systems for image stabilization using a motion-estimating device.
BACKGROUND
Image-capturing devices include cameras, portable handheld electronic devices, and other electronic devices. The images captured by image-capturing devices may be compromised based on motion of the image-capturing devices. For example, vibration, camera shake, or rotation of the camera may blur images.
One prior approach uses software that compares similar portions of different frames and adjusts the output image based on the comparison. This approach typically compensates for translational motion, but fails to compensate for rotational motion.
Some image-capturing devices may use what could be referred to as a rolling shutter as a method of image acquisition in which each frame is recorded not from a snapshot of an entire frame at a single point in time, but rather by scanning across the frame, one line at a time, either vertically or horizontally. In other words, not all parts of the image are recorded at exactly the same time, even though the whole frame is displayed at the same time during playback. At least some CMOS image sensors have a rolling shutter. Rolling shutter produces predictable distortions of fast-moving objects or when the sensor captures rapid flashes of light. This method is implemented by rolling (moving) the shutter across the exposable image area instead of exposing the image area all at the same time. Rolling shutters can cause such effects as skew and wobble. Skews occur when the image bends diagonally in one direction or another as the camera or subject moves from one side to another, exposing different parts of the image at different times. Wobble is most common in hand-held shots at telephoto settings and most extreme in cases when the camera is vibrating due to being attached to a moving vehicle. The rolling shutter causes the image to wobble unnaturally and bizarrely. This is often called the jello effect.
Prior approaches for stabilizing images captured with a rolling shutter may include post-processing techniques. These techniques typically compensate for translational motion, but fail to compensate for rotational motion.
SUMMARY
Several methods, devices and systems for stabilizing images and correcting rolling shutter effects are described. In one embodiment, an image-capturing device includes a camera and a motion-estimating device. The image-capturing device utilizes camera calibration data in one embodiment to map image coordinates of an image plane of the image sensor into normalized image coordinates of a coordinate space. The motion-estimating device can determine motion data (e.g., three dimensional rotation data) for the device. The device matches motion data to a sequence of frames captured by the device to determine motion data for each frame. The device estimates an estimated motion path of the device based on the motion data. The device constructs a target motion path for the image-capturing device based on the estimated motion path. The device computes a desired motion correction based on the estimated motion path and the target motion path. Then, the device utilizes camera calibration data to resample each frame to generate a corrected sequence of stabilized frames according to the desired motion correction.
For example, a user may capture a sequence of images with the device. The motion path is constructed based on motion data that indicates sudden movement or subtle movement (e.g., camera shake from a user, vibration, rotation of camera, etc.). The stabilized frames compensate for the unintended motion of the device during image capture.
In another embodiment, an image capturing system includes a rolling shutter image sensor that may cause a rolling shutter artifact (e.g., warping). The system includes a motion-estimating device to detect motion data and a processing system that is configured to perform an automatic rolling shutter correction mechanism. The correction mechanism utilizes calibration data based on a relationship between pixel locations in an image plane of the image sensor and their corresponding rays of light in a coordinate space (e.g., three dimensional space), determines pixel velocity components based on the calibration data, and estimates for each image an aggregate pixel velocity based on an aggregation of the pixel velocity components and corresponding rotational velocity values, which are determined from the motion data. The correction mechanism resamples each image to generate a new corrected image that is based on the aggregate pixel velocity.
Other embodiments are also described. Other features of the present invention will be apparent from the accompanying drawings and from the detailed description which follows.
BRIEF DESCRIPTION OF THE DRAWINGS
The patent or application file contains at least one drawing executed in color. Copies of this patent or patent application publication with color drawings will be provided by the Office upon request and payment of the necessary fee.
The embodiments of the invention are illustrated by way of example and not by way of limitation in the figures of the accompanying drawings in which like references indicate similar elements. It should be noted that references to “an” or “one” embodiment of the invention in this disclosure are not necessarily to the same embodiment, and they mean at least one.
FIG. 1 illustrates a flow diagram in one embodiment of the present invention for a computer-implemented method 100 of stabilizing images (e.g., sequence of images, video) captured with an image-capturing device.
FIG. 2 illustrates the construction of motion paths of an image-capturing device in one embodiment of the present invention.
FIG. 3 illustrates frame resampling to be applied to an exploded view of a subset of a frame in one embodiment of the present invention.
FIG. 4 illustrates an example of an image 400 in one embodiment of the present invention.
FIG. 5 illustrates a flow diagram in one embodiment of the present invention for a computer-implemented method 500 of a rolling shutter correction of images (e.g., sequence of images, video) captured with an image-capturing device.
FIGS. 6-8 illustrate pre-computed velocity components (e.g., Vx, Vy, and Vz) in one embodiment of the present invention.
FIG. 9 shows in one embodiment of the present invention a wireless image-capturing device which includes the capability for wireless communication and for capturing images.
FIG. 10 is a block diagram of one embodiment of the present invention of a system 1000.
FIG. 11 illustrates aggregating a weighted sum of each pre-computed pixel velocity with weights corresponding to the rotational velocity value for each dimension in one embodiment of the present invention.
FIG. 12 illustrates the generation of a displacement map based on the aggregate pixel velocity vector V 1220 and (tm−t0) 1210 in one embodiment of the present invention.
FIGS. 13A-13D illustrate a sequence of images for showing a rolling shutter correction in one embodiment of the present invention.
FIGS. 14 and 15 illustrate the instantaneous rotation of the image-capturing device in one embodiment of the present invention.
DETAILED DESCRIPTION
Several methods, devices and systems for stabilizing images and correcting rolling shutter artifacts are described. In one embodiment, an image-capturing device includes an image sensor (e.g., camera) and a motion-estimating device. The motion-estimating device can, in one embodiment, determine motion data for the device. The device matches motion data to a sequence of frames captured by the device to determine motion data for each frame. The device constructs a target motion path for the image-capturing device based on the motion data for each frame. The device computes a desired motion correction from an estimated motion path observed in the motion data to the target motion path. Then, the device resamples each frame to generate stabilized frames based on the desired motion correction.
In another embodiment, an image capturing system includes a rolling shutter image sensor that may cause a rolling shutter artifact (e.g., warping). The system includes a motion-estimating device to detect motion data and a processing system that is configured to perform an automatic rolling shutter correction mechanism.
FIG. 1 illustrates a flow diagram in one embodiment of the present invention for a computer-implemented method 100 of stabilizing images (e.g., sequence of images, video) captured with an image-capturing device. The computer-implemented method 100 is performed by processing logic that may comprise hardware (circuitry, dedicated logic, etc.), software (such as is run on a general purpose computer system or a dedicated machine or a system), or a combination of both. The processing logic sends information to and receives information from an image sensing unit having a microprocessor and image sensors. The image sensing unit may send frames of metadata (e.g., focal-number, exposure time, white balance) to the processing logic. Pixel values are read from the image sensors to generate image data. Frames are sent at a certain time interval (e.g., 1/15 of a second) to the processing logic. The frames are stabilized by correcting for a rotational path of the image-capturing device as discussed below.
At block 102, processing logic (e.g., one or more processing units) generates calibration data of a camera of the image-capturing device. The calibration data is utilized to map image coordinates (e.g., two dimensional pixels) of an image plane of an image sensor of the camera into normalized image coordinates (e.g., light rays) of a three dimensional coordinate space at block 103. The field of view and radial distortion of the camera are determined for the generation of the calibration data. In one embodiment, the field of view is 60.8×47.5 degrees and the radial distortion is approximately κ1=0.1 and κ4=−0.007. The calibration may be an offline one time process. At block 104, the processing logic (e.g., one or more processing units) captures a sequence of images (e.g., frames, video). At block 106, the processing logic determines motion data (e.g., three dimensional rotation data) for the device using the motion-estimating device. In one embodiment, the motion-estimating device can be a gyroscope or an accelerometer or a combination of both. The gyroscope may provide three dimensional rotation data and the accelerometer may provide three dimensional translation data (six degrees of freedom). Determining the three dimensional motion data for each frame may include determining rotational velocity vectors in three dimensions for each frame.
At block 108, the processing logic matches motion data to the sequence of frames captured by the image-capturing device to determine three dimensional motion data for each frame. Matching motion data to the sequences of frames may include translating time stamps of the motion-estimating device into video time of the frames and also integrating rotational velocity data (e.g., rotational velocity vectors) received from the motion-estimating device to estimate inter-frame rotation (e.g., ΔΘ[κ]). A live bias estimate between time stamps of the motion-estimating device and video time of the frames may be available using a long-term average (e.g., 5 minutes).
At block 110, the processing logic estimates an estimated motion path (e.g., observed motion path) of the image-capturing device based on the three dimensional motion data for each frame. The estimated motion path is constructed for motion data that indicates sudden movement or subtle movement (e.g., camera shake from a user, vibration). The estimated motion path may be rough or jagged depending on the movement of the camera. At block 111, the processing logic constructs a target motion path of the image-capturing device based upon the estimated motion path. The target motion path can be a smoothed (e.g., filtered) version of the estimated motion path. At block 112, the processing logic computes a desired motion correction from the estimated motion path to the target motion path. At block 113, the processing logic utilizes camera calibration data to resample each frame to generate a correct sequence of stabilized frames according to the desired motion correction. At block 114, the processing logic optionally performs an adaptive crop and fill of an unknown region (e.g., dark region) of the stabilized frames if necessary. The operations of the method 100 provide pre-processing that may be part of a compression algorithm of the frames or decoupled from the compression algorithm. The compressed video frames may require less memory space or provide higher image quality at a lower bit rate based on the operations of the method 100.
In certain embodiments, the motion-estimating device can be a gyroscope, an accelerometer, or any combination thereof in single or multi physical packages.
Additional details of the image stabilization will be explained below.
FIG. 2 illustrates the construction of motion paths of an image-capturing device in one embodiment of the present invention. The processing logic constructs a rough motion path 220 of the image-capturing device based on the three dimensional motion data for each frame. The processing logic can apply a filter in each dimension, such as a low pass or predictive filter, (possibly inducing a short delay to construct a smooth motion path 220 from the rough motion path 200. Smooth motion path 220 represents a desirable target path of the image-capturing device during a time of image capture.
FIG. 3 illustrates frame resampling to be applied to an exploded view of a subset of a frame in one embodiment of the present invention. The processing logic can apply frame resampling to the exploded view 300. The frame resampling uses the smooth motion path 220 to correct pixels in the subset 330 of the frame 320. The processing logic may artificially rotate an observer's viewing direction (e.g., user's viewing direction) based on the difference between motion path 220 and motion path 200. The frame resampling uses interpolation (e.g., bilinear interpolation) to construct new frames.
Frame 4 illustrates in an example of an image 400 in one embodiment of the present invention. A central region 402 includes an object 410 to be captured. Peripheral regions 403 and 404 may be dark regions of the image. The regions 403 and 404 can be cropped or eliminated from the image 404. Pixel values may be missing for these regions. These values can be filled in based on adjacent frames or in painting techniques. Alternatively, the cropped region is constrained within the original frame.
The method 100 provides image stabilization to correct for rotational motion and vibration of an image-capturing device. Translational vibration accounts for approximately 10% of vibration and requires depth knowledge to correct. Rotational vibration generally accounts for the large majority of vibration-induced distortions and does not require depth knowledge to correct. Short term rotational offsets may be accurate to within approximately 1 milliradian. This method 100 can also be used for intermediate frame generation and rolling shutter correction. A roller shutter artifact may occur because scan lines are read one at a time from an image sensor (e.g., CMOS image sensor) and the camera itself moves during the image capture time period.
FIG. 5 illustrates a flow diagram in one embodiment of the present invention for a computer-implemented method 500 of a rolling shutter correction of images (e.g., sequence of images, video) captured with an image-capturing device. The computer-implemented method 500 is performed by processing logic that may comprise hardware (circuitry, dedicated logic, etc.), software (such as is run on a general purpose computer system or a dedicated machine or a system), or a combination of both. The method 500 determines a value of a pixel a certain time period ago or in the future (e.g., 1 millisecond, 2 millisecond) to correct for rolling shutter effects.
At block 501, processing logic (e.g., one or more processing units) calibrates a camera of the image-capturing device in order to generate calibration data. At block 502, the calibration data is utilized in order to transform image coordinates (e.g., two dimensional pixels) of an image plane of an image sensor of the camera into a three dimensional direction vector. A calibration model is a parametric way of describing the connection between pixel locations in the image plane and their corresponding rays of light in the three dimensional space from the perspective of a camera observer. A three dimensional rotation can be applied to the direction vector. An application of the three dimensional rotation and the direction vectors results in determining where a pixel would move under a hypothetical camera rotation in three dimensional space. The calibration may be an offline one time process.
The calibration parameters may include numerous parameters as described herein. In one embodiment, the variable parameters include focal length f or equivalently field of view and two more parameters to describe radial distortion κ. A video frame may include a center point c in the middle of the video frame (e.g., c=[512, 384] for a 1024×768 video frame). The skew may be zero. The focal length fx can be modeled as approximately 1.1823*(2cx) and fy can be modeled as approximately 1.1966*(2cx). Alternatively, fx=fy=1.19*(2cx). All radial distortion terms κ can be set to zero, except κ1=0.1 and κ4=−0.007. Thus, a function F is obtained that converts normalized pixel coordinates (e.g., x vector) to actual pixel coordinates (e.g., m vector) as indicated by the following equation.
mvector=F(xvector)
The inverse of F normalizes actual pixel coordinates to the image plane (e.g., x vector=F (m vector)).
At block 504, the processing logic calibrates and synchronizes a motion-estimating device with the camera of the image-capturing device. This synchronization may include translating time stamps of the motion-estimating device into video time of the captured images. A live bias estimate between time stamps of the motion-estimating device and video time of the frames may be available using a long-term average (e.g., 5 minutes).
At block 506, the processing logic determines pre-computed predicted pixel velocity components (e.g., Vx, Vy, and Vz) from the calibration data. The pixel velocity components may be determined in two dimensions from the calibration data. At block 508, the processing logic (e.g., one or more processing units) captures a sequence of images (e.g., video, frames). At block 510, the processing logic determines motion data of the camera during capture of the images. The motion data may include rotational velocity vector w (e.g., wx, wy, and wz). At block 512, the processing logic estimates for each image a corresponding aggregate pixel velocity vector V based on an aggregation of the pixel velocity components. This aggregation may include a weighted sum of the components wxVx, wyVy, and wzVz. In one embodiment, the rotational velocity weights wx, wy, and wz are scalars and the pixel velocity components (e.g., Vx, Vy, and Vz) are functions. At block 514, the processing logic resamples each image (e.g., using bilinear interpolation) to generate a new image (e.g., frame) Ĩ by assigning for each pixel a new value as indicated by the following equation.
new imageĨ(vectorm)=I(vectorm−(t m −t 0)*V(vectorm))
The time when row m was captured is represented by tm and an initial arbitrary time (e.g., first row, middle row, etc.) for image capture is represented by to. The new image Ĩ may include new pixel locations based on the movement of the camera. A new pixel location may not be located on a grid location of the new image and may be interpolated based on a certain number of nearest neighbor pixels. An extrapolation may be necessary for calculating a new pixel's color in the case of missing pixels.
FIGS. 6-8 illustrate pre-computed velocity components (e.g., Vx, Vy, and Vz) in one embodiment of the present invention. The pixels in FIG. 6 move upwards along the y axis (e.g., velocity component 601) due to an estimated or predicted rotational movement of the camera. The pixels in FIG. 7 move to the right along the x axis (e.g., velocity component 701) due to an estimated rotational movement of the camera. The pixels in FIG. 8 move in a clockwise direction (e.g., velocity component 801) due to an estimated rotational movement of the camera. The path of each pixel can be determined based on these velocity components. A pixel velocity field V may be computed from a rotational velocity w and the component velocity maps Vx, Vy, and Vz. The composition of transformations is described as follows.
    • Vector m→vector x→vector x′→vector m′
The actual pixel coordinates, represented by vector m, of an image plane of the image sensor are mapped or translated into a three dimensional direction vector x. An estimated three dimensional rotation can be applied to the direction vector x to generate vector x′. For example, the estimated or predicted rotation of the camera may correspond to a human model for hand vibration while the human holds the camera. This model may estimate or predict camera rotation for when the user turns his hand slightly in one or more directions during a time period for capturing a row or rows of pixels. In one embodiment, this model is designed for predicted vibrations having a frequency less than or equal to 100 hertz. Vibrations for frequencies greater than 100 hertz are not likely caused by human rotational movement and these vibrations are more difficult to correct. Vector x′ is translated into vector m′ with function F. The velocity components (e.g., Vx, Vy, and Vz) are approximately equal to a difference between m and m′ under incremental rotations about the x, y, and z axis respectively.
FIG. 11 illustrates aggregating a weighted sum of each pre-computed pixel velocity with weights corresponding to the rotational velocity value for each dimension in one embodiment of the present invention. A weighted sum of the components wxVx 1110, wyVy 1120, and wzVz 1130 generates the aggregate pixel velocity vector V 1140. In one embodiment, wx=0.2 radians, wy=−0.2 radians, and wz=0.8 radians.
FIG. 12 illustrates the generation of a displacement map based on the aggregate pixel velocity vector V 1220 and (tm−t0) 1210 in one embodiment of the present invention. As discussed above, the time when row m of an image sensor was captured is represented by tm and an arbitrary time for image capture is represented by to. Each row of an image sensor is read out at a different time and this is represented by horizontal bars in FIG. 12. For an upper region of an image sensor, (tm−t0) 1210 is a negative value. For a lower region of the image sensor, (tm−t0) 1210 is a positive value. The displacement map for a given pixel of an image sensor is generated by multiplying the aggregate pixel velocity vector V 1220 and (tm−t0) 1210. The displacement map indicates a displacement for a given pixel of the image sensor based on rotation or movement of the camera during image capture.
FIGS. 13A-13D illustrate a sequence of images for showing a rolling shutter correction in one embodiment of the present invention. FIG. 13A illustrates a reference image 1300, which represents an original scene with stop sign 1302 that was captured with an image-capturing device (e.g., an iPhone 4) with no rolling shutter correction. The reference image 1300 may include radial distortion. FIG. 13B illustrates a simulated image 1310 with stop sign 1312. The simulated image 1310 is a simulated rendering of the image 1300 using a rolling shutter exposure under the effects of an instantaneous rotation of the image-capturing device.
FIGS. 14 and 15 illustrate the instantaneous rotation of the image-capturing device in one embodiment of the present invention. FIG. 14 illustrates the rotation in degrees on a vertical axis versus tm−t0 in milliseconds on a horizontal axis. For example, for a 2 millisecond exposure time, the image-capturing device rotates from approximately −25 degrees to approximately 25 degrees. FIG. 15 illustrates the rotation in degrees on a vertical axis versus row number of an image sensor of the image-capturing device on a horizontal axis.
FIG. 13C illustrates a resampled image 1320 with stop sign 1322. The image 1320 simulates the unwarping of the image 1310 based on available information, which may be imperfect. In this illustration, the correction is performed using imperfect rotation information with a signal to noise ratio that may be 10:1 (i.e., approximately 4 degrees of error for a rotation of 40 degrees over the exposure interval). In practice, the error may be caused by noise from the motion-estimating device (e.g., gyroscope, accelerometer) measurements as well as modeling errors that result from sources of motion (e.g., motion in the scene, translation, etc.). These sources of motion are difficult to accurately model.
FIG. 13D illustrates a resampled image 1330 with stop sign 1332. The image 1330 simulates the unwarping of the image 1310 based on perfect available information. The image 1330 shows what the recovered or resampled image looks like given perfect information about the instantaneous rotation and no other sources of motion (e.g., translation of the observer or movement in the environment). Radial distortion has been removed from this simulated image 1330.
Many of the methods in embodiments of the present invention may be performed with an image-capturing device such as a digital processing system (e.g., conventional, general-purpose computer system). Special purpose computers, which are designed or programmed to perform only one function, may also be used.
In some embodiments, the methods, systems, and apparatuses of the present disclosure can be implemented in various devices including electronic devices, consumer devices, data processing systems, desktop computers, portable computers, wireless devices, cellular devices, tablet devices, handheld devices, multi touch devices, multi touch data processing systems, any combination of these devices, or other like devices. FIGS. 9 and 10 illustrate examples of a few of these devices, which are capable of capturing still images and video to implement the methods of the present disclosure. The methods (e.g., 100, 500) enhance a user experience for capturing images, capturing video, video calls, etc. based on the image stabilization and rolling shutter correction.
FIG. 9 shows in one embodiment of the present invention a wireless image-capturing device which includes the capability for wireless communication and for capturing images. Wireless device 900 may include an antenna system 901. Wireless device 900 may also include a digital and/or analog radio frequency (RF) transceiver 902, coupled to the antenna system 901, to transmit and/or receive voice, digital data and/or media signals through antenna system 901.
Wireless device 900 may also include a digital processing system 903 to control the digital RF transceiver and to manage the voice, digital data and/or media signals. Digital processing system 903 may be a general purpose processing system, such as a microprocessor or controller for example. Digital processing system 903 may also be a special purpose processing system, such as an ASIC (application specific integrated circuit), FPGA (field-programmable gate array) or DSP (digital signal processor). Digital processing system 903 may also include other devices, as are known in the art, to interface with other components of wireless device 900. For example, digital processing system 903 may include analog-to-digital and digital-to-analog converters to interface with other components of wireless device 900. Digital processing system 903 may include a media processing system 909, which may also include a general purpose or special purpose processing system to manage media, such as files of audio data.
Wireless device 900 may also include a storage device 904, coupled to the digital processing system, to store data and/or operating programs for the Wireless device 900. Storage device 904 may be, for example, any type of solid-state or magnetic memory device. Storage device 904 may be or include a machine-readable medium.
A machine-readable medium includes any mechanism for storing or transmitting information in a form readable by a machine (e.g., a computer). For example, machines store and communicate (internally and with other devices over a network) code and data using machine-readable media, such as machine storage media (e.g., magnetic disks; optical disks; random access memory; read only memory; flash memory devices; phase-change memory).
Wireless device 900 may also include one or more input devices 905, coupled to the digital processing system 903, to accept user inputs (e.g., telephone numbers, names, addresses, media selections, etc.) Input device 905 may be, for example, one or more of a keypad, a touchpad, a touch screen, a pointing device in combination with a display device or similar input device.
Wireless device 900 may also include at least one display device 906, coupled to the digital processing system 903, to display information such as messages, telephone call information, contact information, pictures, movies and/or titles or other indicators of media being selected via the input device 905. Display device 906 may be, for example, an LCD display device. In one embodiment, display device 906 and input device 905 may be integrated together in the same device (e.g., a touch screen LCD such as a multi-touch input panel which is integrated with a display device, such as an LCD display device). The display device 906 may include a backlight 906A to illuminate the display device 906 under certain circumstances. It will be appreciated that the wireless device 900 may include multiple displays.
Wireless device 900 may also include a battery 907 to supply operating power to components of the system including digital RF transceiver 902, digital processing system 903, storage device 904, input device 905, microphone 905A, audio transducer 908, media processing system 909, sensor(s) 910, and display device 906, an image sensor 959 (e.g., CCD (Charge Coupled Device), CMOS sensor). The image sensor may be integrated with an image processing unit 960. The display device 906 may include a Liquid Crystal Display (LCD) which may be used to display images which are captured or recorded by the wireless image-capturing device 900. The LCD serves as a viewfinder of a camera (e.g., combination of lens 963, image sensor 959, and unit 960) and there may optionally be other types of image display devices on device 900 which can serve as a viewfinder.
The device 900 also includes an imaging lens 963 which can be optically coupled to image sensor 959. The processing system 903 controls the operation of the device 900; and, it may do so by executing a software program stored in ROM 957, or in the processing system 903, or in both ROM 957 and the processing system 903.
The processing system 903 controls the image processing operation; and, it controls the storage of a captured image in storage device 904. The processing system 903 also controls the exporting of image data (which may or may not be color corrected) to an external general purpose computer or special purpose computer.
The processing system 903 also responds to user commands (e.g., a command to “take” a picture or video by capturing image(s) on the image sensor and storing it in memory or a command to select an option for contrast enhancement and color balance adjustment).
The ROM 957 may store software instructions for execution by the processing system 903 to perform the automatic image stabilization and rolling shutter correction mechanisms discussed in the present disclosure. The storage device 904 is used to store captured/recorded images which are received from the image sensor 959. It will be appreciated that other alternative architectures of a camera can be used with the various embodiments of the invention.
Battery 907 may be, for example, a rechargeable or non-rechargeable lithium or nickel metal hydride battery. Wireless device 900 may also include audio transducers 908, which may include one or more speakers, and at least one microphone 905A, and an accelerometer 946. The device 900 also includes a motion or orientation detector 940 (e.g., accelerometer, gyroscope, or any combination thereof) for determining motion data or an orientation of the device 900.
In one embodiment, the image-capturing device 900 is designed to stabilize images and video. The image-capturing device 900 includes the image sensor 959 with associated calibration data to sense images, the storage device 904 to store captured images, the motion-estimating device 940 to detect motion data for the device, and the processing system 903 which is coupled to the storage device and the motion-estimating device. The processing system is configured to perform an automatic image stabilization mechanism by determining motion data for the image-capturing device using the motion-estimating device, matching motion data to a sequence of frames captured by the image-capturing device to determine three dimensional motion data for each frame, and estimating an estimated motion path (e.g., rough motion path) of the image-capturing device based on the three dimensional motion data for each frame. Determining the three dimensional motion data for each frame includes determining rotational velocity vectors in three dimensions for each frame.
The processing system is further configured to construct a target motion path (e.g., smooth motion path) of the system based on the estimated motion path of the system. Constructing the target motion path may include filtering the estimated motion path. The processing system is further configured to compute a desired motion correction from the estimated motion path to the target motion path.
The processing system is further configured to correct image coordinates of the image plane based on the calibration data and the desired motion correction, which may be determined based on a difference between the target and estimated motion paths of the system, apply resampling for each frame from the original image coordinates to the corrected image coordinates to generate stabilized frames and adaptively crop and fill an unknown region if necessary of the stabilized frames. Correcting image coordinates of the image plane based on the calibration data and the difference between the estimated and target motion paths includes utilizing the calibration data to map the image coordinates, which represent two dimensional pixels, into the normalized image coordinates of the coordinate space, which represent light rays, applying a desired rotation to all light rays, according to the difference between the estimated and target motion paths, and utilizing the calibration data to map these rotated light rays back to the image plane.
Matching motion data to a sequence of frames captured by the image-capturing device to determine three dimensional motion data for each frame may include translating time stamps of the motion-estimating device into video time of the frames and integrating rotational velocity data received from the motion-estimating device to estimate inter-frame rotation.
In another embodiment, an image capturing system (e.g., image-capturing device 900) is designed to correct for rolling shutter effects (e.g., warping) and compensate for vibrations and rotational movements of the image capturing system. The image capturing system includes an image sensor 959 to sense images, a storage device 904 that is coupled to the image sensor. The storage device stores captured images. The motion-estimating device 940 (e.g., gyroscope, accelerometer) detects motion data. The processing system 903 is coupled to the storage device and the motion-estimating device. The processing system is configured to perform an automatic image sensor correction mechanism to utilize calibration data based on a relationship between pixel locations in an image plane of the image sensor and their corresponding rays of light in a three dimensional space, to determine pixel velocity components based on the calibration data, and to estimate for each image an aggregate pixel velocity based on an aggregation of the pixel velocity components.
The pixel velocity components (e.g., Vx, Vy, and Vz) include pre-computed predicted velocity components that are computed prior to sensing the images. Determining the pixel velocity components may include constructing pixel velocity maps from the calibration data. Estimating for each image the aggregate pixel velocity based on the aggregation of the pixel velocity components includes aggregating a weighted sum of each pre-computed pixel velocity corresponding to the rotational velocity value (e.g., wx, wy, and wz) for each dimension.
The processing system is further configured to resample each image to generate a new image to perform the rolling shutter correction mechanism. Resampling each image to generate a new image is based on a current image and the aggregate pixel velocity. The new images have compensated for rolling shutter effects, vibrations, and rotational movement of the image capturing system.
FIG. 10 is a block diagram of one embodiment of the present invention of a system 1000 that generally includes one or more computer-readable mediums 1001, processing system 1004, Input/Output (I/O) subsystem 1006, radio frequency (RF) circuitry 1008, audio circuitry 1010, and an image sensor 1059 (e.g., CCD (Charge Coupled Device), CMOS sensor). The image sensor may be integrated with an image processing unit 1060. The image sensor 1059 is optically coupled to receive light from a lens 1063, which can be used for capturing images with the image sensor. A motion-estimating device 1040 determines motion data in three dimensions for the system 1000. These components may be coupled by one or more communication buses or signal lines 1003.
It should be apparent that the architecture shown in FIG. 10 is only one example architecture of system 1000, and that system 1000 could have more or fewer components than shown, or a different configuration of components. The various components shown in FIG. 10 can be implemented in hardware, software, firmware or any combination thereof, including one or more signal processing and/or application specific integrated circuits.
RF circuitry 1008 is used to send and receive information over a wireless link or network to one or more other devices and includes well-known circuitry for performing this function. RF circuitry 1008 and audio circuitry 1010 are coupled to processing system 1004 via peripherals interface 1016. Interface 1016 includes various known components for establishing and maintaining communication between peripherals and processing system 1004. Audio circuitry 1010 is coupled to audio speaker 1050 and microphone 1052 and includes known circuitry for processing voice signals received from interface 1016 to enable a user to communicate in real-time with other users. In some embodiments, audio circuitry 1010 includes a headphone jack (not shown).
Peripherals interface 1016 couples the input and output peripherals of the system to one or more processing units 1018 and computer-readable medium 1001. One or more processing units 1018 communicate with one or more computer-readable mediums 1001 via controller 1520. Computer-readable medium 1001 can be any device or medium (e.g., storage device, storage medium) that can store code and/or data for use by one or more processing units 1018. Medium 1001 can include a memory hierarchy, including but not limited to cache, main memory and secondary memory. The memory hierarchy can be implemented using any combination of RAM (e.g., SRAM, DRAM, DDRAM), ROM, FLASH, magnetic and/or optical storage devices, such as disk drives, magnetic tape, CDs (compact disks) and DVDs (digital video discs). Medium 1001 may also include a transmission medium for carrying information-bearing signals indicative of computer instructions or data (with or without a carrier wave upon which the signals are modulated). For example, the transmission medium may include a communications network, including but not limited to the Internet (also referred to as the World Wide Web), intranet(s), Local Area Networks (LANs), Wide Local Area Networks (WLANs), Storage Area Networks (SANs), Metropolitan Area Networks (MAN) and the like.
One or more processing units 1018 run various software components stored in medium 1001 to perform various functions for system 1000. In some embodiments, the software components include operating system 1022, communication module (or set of instructions) 1024, touch processing module (or set of instructions) 1026, graphics module (or set of instructions) 1028, one or more applications (or set of instructions) 1030, and modules [or set of instructions] 1038 and 1039. The image stabilization module 1038 and rolling shutter correction module 1039 each correspond to a set of instructions for performing one or more functions described above and the methods described in this application (e.g., the computer-implemented methods and other information processing methods described herein). These modules (i.e., sets of instructions) need not be implemented as separate software programs, procedures or modules, and thus various subsets of these modules may be combined or otherwise rearranged in various embodiments.
In some embodiments, medium 1001 may store a subset of the modules and data structures identified above. Furthermore, medium 1001 may store additional modules and data structures not described above.
Operating system 1022 includes various procedures, sets of instructions, software components and/or drivers for controlling and managing general system tasks (e.g., memory management, storage device control, power management, etc.) and facilitates communication between various hardware and software components.
Communication module 1024 facilitates communication with other devices over one or more external ports 1036 or via RF circuitry 1008 and includes various software components for handling data received from RF circuitry 1008 and/or external port 1036.
Graphics module 1028 includes various known software components for rendering, animating and displaying graphical objects on a display surface. In embodiments in which touch I/O device 1012 is a touch sensitive display (e.g., touch screen), graphics module 1028 includes components for rendering, displaying, and animating objects on the touch sensitive display.
One or more applications 1030 can include any applications installed on system 1000, including without limitation, a browser, address book, contact list, email, instant messaging, word processing, keyboard emulation, widgets, JAVA-enabled applications, encryption, digital rights management, voice recognition, voice replication, location determination capability (such as that provided by the global positioning system (GPS), a music player, etc.
Touch processing module 1026 includes various software components for performing various tasks associated with touch I/O device 1012 including but not limited to receiving and processing touch input received from I/O device 1012 via touch I/O device controller 1032.
System 1000 may image stabilization module 1038, rolling shutter correction module 1039, and image capturing unit 1060 for performing the method/functions as described herein in connection with FIGS. 1-10. The image capturing unit 1060 is integrated with the system 1000 and may be coupled to the peripheral interface 1016 as illustrated in FIG. 10 or integrated with one of the I/O devices 1012 or 1014.
In one embodiment, the computer readable medium 1001 contains executable computer program instructions (e.g., module 1038) which when executed by the data processing system 1000 cause said system to perform a method. The method utilizes calibration data of a camera of the system to map image coordinates of an image plane of an image sensor of the camera into normalized image coordinates of a three dimensional coordinate space. The method determines motion data for the system using the motion-estimating device 1040 (e.g., gyroscope, accelerometer). Utilizing the calibration data may include mapping the image coordinates, which represent two dimensional pixels, into the normalized image coordinates of the three dimensional coordinate space, which represent light rays. Determining the three dimensional motion data for each frame may include determining rotational velocity vectors in three dimensions for each frame.
The method also matches motion data to a sequence of frames captured by the system to determine three dimensional motion data for each frame, estimates estimates a motion path (e.g., rough motion path) of the system based on the three dimensional motion data for each frame, constructs a target motion path (e.g., smooth motion path) of the system based on the motion data for each frame, and computes a desired motion correction from the estimated motion path observed in the motion data to the target motion path. The method corrects image coordinates of the image plane based on the desired motion correction and generates stabilized frames based on the corrected image coordinates. Matching motion data to a sequence of frames may include translating time stamps of the motion-estimating device into video time of the frames. Constructing the target motion path may include applying a low-pass or predictive filter in each dimension to the three dimensional motion data used to construct the estimated motion path (possibly inducing a delay), and estimating a necessary correction based on the estimated motion path.
The method also artificially rotates a user's viewing direction based on the desired motion correction, applies bilinear interpolation to generate stabilized frames based on the corrected image coordinates, and adaptively crops and fills an unknown region of the stabilized frames if an unknown region (e.g., dark pixels) exists.
In another embodiment, a computer readable medium contains executable computer program instructions (e.g., module 1039) which when executed by the data processing system 1000 cause said system to perform a method. The method utilizes calibration data for the system, which has an image sensor 1061 and a motion-estimating device 1040 (e.g., gyroscope, accelerometer), by transforming two-dimensional pixel locations in an image plane of the image sensor into a three dimensional direction vector. The method determines pixel velocity components based on the calibration data, captures a sequence of images with the system, determines motion data with the motion-estimating device during image capture, and estimates for each image an aggregate pixel velocity based on an aggregation of the pixel velocity components and corresponding motion data in three dimensions. Estimating may include aggregating a weighted sum of each pre-computed pixel velocity with weights corresponding to the rotational velocity value for each dimension.
The pixel velocity components may include pre-computed velocity components that are computed prior to capturing the sequence of images. Determining the pixel velocity components may include constructing pixel velocity maps from the calibration data.
The method also includes resampling each image to generate a new image.
Resampling each image may occur with a binary interpolation to generate a new image that is based on a current image and the aggregate pixel velocity.
Modules 1038 and 1039 may be embodied as hardware, software, firmware, or any combination thereof. Although modules 1038 and 1039 are shown to reside within medium 1001, all or portions of modules 1038 and 1039 may be embodied within other components within system 1000 or may be wholly embodied as a separate component within system 1000.
I/O subsystem 1006 is coupled to touch I/O device 1012 and one or more other I/O devices 1014 for controlling or performing various functions. Touch I/O device 1012 communicates with processing system 1004 via touch I/O device controller 1032, which includes various components for processing user touch input (e.g., scanning hardware). One or more other input controllers 1034 receives/sends electrical signals from/to other I/O devices 1014. Other I/O devices 1014 may include physical buttons, dials, slider switches, sticks, keyboards, touch pads, additional display screens, or any combination thereof.
If embodied as a touch screen, touch I/O device 1012 displays visual output to the user in a GUI. The visual output may include text, graphics, video, and any combination thereof. Some or all of the visual output may correspond to user-interface objects. Touch I/O device 1012 forms a touch-sensitive surface that accepts touch input from the user. Touch I/O device 1012 and touch screen controller 1032 (along with any associated modules and/or sets of instructions in medium 1001) detects and tracks touches or near touches (and any movement or release of the touch) on touch I/O device 1012 and converts the detected touch input into interaction with graphical objects, such as one or more user-interface objects. In the case in which device 1012 is embodied as a touch screen, the user can directly interact with graphical objects that are displayed on the touch screen. Alternatively, in the case in which device 1012 is embodied as a touch device other than a touch screen (e.g., a touch pad), the user may indirectly interact with graphical objects that are displayed on a separate display screen embodied as I/O device 1014.
Embodiments in which touch I/O device 1012 is a touch screen, the touch screen may use LCD (liquid crystal display) technology, LPD (light emitting polymer display) technology, OLED (organic LED), or OEL (organic electro luminescence), although other display technologies may be used in other embodiments.
Feedback may be provided by touch I/O device 1012 based on the user's touch input as well as a state or states of what is being displayed and/or of the computing system. Feedback may be transmitted optically (e.g., light signal or displayed image), mechanically (e.g., haptic feedback, touch feedback, force feedback, or the like), electrically (e.g., electrical stimulation), olfactory, acoustically (e.g., beep or the like), or the like or any combination thereof and in a variable or non-variable manner.
System 1000 also includes power system 1044 for powering the various hardware components and may include a power management system, one or more power sources, a recharging system, a power failure detection circuit, a power converter or inverter, a power status indicator and any other components typically associated with the generation, management and distribution of power in portable devices.
In some embodiments, peripherals interface 1016, one or more processing units 1018, and memory controller 1020 may be implemented on a single chip, such as processing system 1004. In some other embodiments, they may be implemented on separate chips. The present disclosure can relate to an apparatus for performing one or more of the operations described herein. This apparatus may be specially constructed for the required purposes, or it may comprise a general purpose computer selectively activated or reconfigured by a computer program stored in the computer. Such a computer program may be stored in a machine (e.g. computer) readable storage medium, such as, but is not limited to, any type of disk including floppy disks, optical disks, CD-ROMs, and magnetic-optical disks, read-only memories (ROMs), random access memories (RAMs), erasable programmable ROMs (EPROMs), electrically erasable programmable ROMs (EEPROMs), flash memory, magnetic or optical cards, or any type of media suitable for storing electronic instructions, and each coupled to a bus.
A machine-readable medium includes any mechanism for storing or transmitting information in a form readable by a machine (e.g., a computer). For example, machines store and communicate (internally and with other devices over a network) code and data using machine-readable media, such as machine storage media (e.g., magnetic disks; optical disks; random access memory; read only memory; flash memory devices; phase-change memory).
In the foregoing specification, the disclosure has been described with reference to specific exemplary embodiments thereof. It will be evident that various modifications may be made thereto without departing from the broader spirit and scope of the disclosure as set forth in the following claims. The specification and drawings are, accordingly, to be regarded in an illustrative sense rather than a restrictive sense.

Claims (22)

What is claimed is:
1. A computer implemented method for image stabilization for an image-capturing device with associated calibration data, the method comprising:
utilizing camera calibration data of the image-capturing device to map image coordinates of an image plane of an image sensor of the image-capturing device into normalized image coordinates of a three dimensional (3D) coordinate space;
determining motion data including 3D rotation data for the image-capturing device using a motion-estimating device after utilizing camera calibration data of the image-capturing device to map image coordinates of an image plane of an image sensor of the image-capturing device into normalized image coordinates of a 3D coordinate space;
matching motion data to a sequence of frames captured by the image-capturing device to determine motion data for each frame;
constructing a target motion path of the image-capturing device based on the motion data for each frame; and
computing a desired motion correction from a motion path observed in the motion data to the target motion path.
2. The method of claim 1, further comprising:
utilizing camera calibration data of the image-capturing device to resample each frame to generate a corrected sequence of stabilized frames according to the desired motion correction; and
cropping and filling unknown regions of the stabilized frames.
3. The method of claim 1, wherein the image stabilization to correct for rotational motion and vibration of the image-capturing device.
4. The method of claim 3, wherein the calibration data comprises radial distortion, field of view of the image, and center point of the capturing device, or any combination thereof.
5. The method of claim 1, wherein the motion-estimating device comprises a gyroscope, an accelerometer, or any combination thereof.
6. The method of claim 1, wherein determining the motion data for each frame comprises determining rotational velocity vectors in three dimensions for each frame.
7. The method of claim 1, wherein matching motion data to the sequence of frames captured by the image-capturing device to determine motion data for each frame comprises translating time stamps of the motion-estimating device into video time of the frames.
8. The method of claim 7, wherein matching motion data to the sequence of frames captured by the image-capturing device to determine motion data for each frame further comprises integrating rotational velocity data received from the motion-estimating device to estimate inter-frame rotation.
9. A computer readable non-transitory medium containing executable computer program instructions which when executed by a data processing system cause said system to perform a method, the method comprising:
utilizing calibration data of an image-capturing device to map image coordinates of an image plane of an image sensor of the system into normalized image coordinates of a three dimensional (3D) coordinate space;
determining motion data including 3D rotation data for the system using a motion-estimating device after utilizing calibration data of the image-capturing device to map image coordinates of an image plane of an image sensor of the system into normalized image coordinates of a three dimensional coordinate space;
matching motion data to a sequence of frames captured by the system to determine motion data for each frame; and
constructing a target motion path of the system based on the motion data for each frame; and
computing a desired motion correction from a motion path observed in the motion data to the target motion path.
10. The computer readable non-transitory medium of claim 9, further comprising:
utilizing calibration data of the system to correct image coordinates of the image plane based on the desired motion correction; and
generating stabilized frames based on the corrected image coordinates.
11. The computer readable non-transitory medium of claim 10, wherein utilizing calibration data comprises mapping the image coordinates, which represent two dimensional pixels, into the normalized image coordinates of the three dimensional coordinate space, which represent light rays.
12. The computer readable non-transitory medium of claim 10, further comprising:
adaptively cropping and filling any unknown region of the stabilized frames.
13. The computer readable non-transitory medium of claim 9, wherein constructing the target motion path of the system comprises applying a lowpass filter in each dimension to the estimated motion path.
14. The computer readable non-transitory medium of claim 9, wherein determining the motion data for each frame comprises determining rotational velocity vectors in three dimensions for each frame.
15. The computer readable non-transitory medium of claim 9, wherein matching motion data to a sequence of frames captured by the system to determine motion data for each frame comprises translating time stamps of the motion-estimating device into video time of the frames.
16. An image-capturing device, comprising:
an image sensor with associated calibration data to sense images;
a storage device coupled to the image sensor, the storage device to store captured images;
a motion-estimating device to detect motion data including three dimensional (3D) rotation data; and
a processing system coupled to the storage device and the motion-estimating device, the processing system is configured to perform an automatic image stabilization mechanism by utilizing calibration data to map image coordinates, which represent two dimensional pixels, into normalized image coordinates of a three dimensional coordinate space, which represent light rays, subsequently determining motion data for the image-capturing device using the motion-estimating device, matching motion data to a sequence of frames captured by the image-capturing device to determine motion data for each frame, and estimating a motion path of the image-capturing device based on the motion data for each frame.
17. The image-capturing device of claim 16, wherein the processing system is further configured to:
construct a target motion path of the system based on the estimated motion path of the system;
compute a desired motion correction from the estimated motion path to the target motion path;
correct image coordinates of the image plane based on the calibration data and the desired motion correction;
apply resampling for each frame from the original image coordinates to the corrected image coordinates to generate stabilized frames; and
crop and fill an unknown region of the image plane.
18. The image-capturing device of claim 17, wherein correcting image coordinates of the image plane based on the calibration data and the desired motion correction comprises:
utilizing the calibration data to map the image coordinates, which represent two dimensional pixels, into the normalized image coordinates of the coordinate space, which represent light rays;
applying a desired rotation to all light rays, according to the desired motion correction; and
utilizing the calibration data to map these rotated light rays back to the image plane.
19. The image-capturing device of claim 17, wherein constructing the target motion path comprises filtering the estimated motion path.
20. The image-capturing device of claim 16, wherein determining the motion data for each frame comprises determining rotational velocity vectors in three dimensions for each frame.
21. The image-capturing device of claim 16, wherein matching motion data to a sequence of frames captured by the image-capturing device to determine motion data for each frame comprises translating time stamps of the motion-estimating device into video time of the frames.
22. The image-capturing device of claim 21, wherein matching motion data to the sequence of frames captured by the image-capturing device to determine motion data for each frame further comprises integrating rotational velocity data received from the motion-estimating device to estimate inter-frame rotation.
US13/154,375 2011-06-06 2011-06-06 Methods and systems for image stabilization Active 2031-12-09 US8648919B2 (en)

Priority Applications (3)

Application Number Priority Date Filing Date Title
US13/154,375 US8648919B2 (en) 2011-06-06 2011-06-06 Methods and systems for image stabilization
PCT/US2012/038569 WO2012170179A1 (en) 2011-06-06 2012-05-18 Methods and systems for image stabilization
TW101119998A TWI464707B (en) 2011-06-06 2012-06-04 Methods and systems for image stabilization

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
US13/154,375 US8648919B2 (en) 2011-06-06 2011-06-06 Methods and systems for image stabilization

Publications (2)

Publication Number Publication Date
US20120307085A1 US20120307085A1 (en) 2012-12-06
US8648919B2 true US8648919B2 (en) 2014-02-11

Family

ID=46177559

Family Applications (1)

Application Number Title Priority Date Filing Date
US13/154,375 Active 2031-12-09 US8648919B2 (en) 2011-06-06 2011-06-06 Methods and systems for image stabilization

Country Status (3)

Country Link
US (1) US8648919B2 (en)
TW (1) TWI464707B (en)
WO (1) WO2012170179A1 (en)

Cited By (25)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20140085493A1 (en) * 2012-09-24 2014-03-27 Motorola Mobility Llc Preventing motion artifacts by intelligently disabling video stabilization
US20140320681A1 (en) * 2011-06-06 2014-10-30 Apple Inc. Correcting rolling shutter using image stabilization
US9998663B1 (en) 2015-01-07 2018-06-12 Car360 Inc. Surround image capture and processing
US20180302554A1 (en) * 2013-03-15 2018-10-18 Fotonation Cayman Limited Systems and Methods for Synthesizing High Resolution Images Using Image Deconvolution Based on Motion and Depth Information
US10284794B1 (en) 2015-01-07 2019-05-07 Car360 Inc. Three-dimensional stabilized 360-degree composite image capture
US10547772B2 (en) 2013-03-14 2020-01-28 Fotonation Limited Systems and methods for reducing motion blur in images or video in ultra low light with array cameras
US10694114B2 (en) 2008-05-20 2020-06-23 Fotonation Limited Capturing and processing of images including occlusions focused on an image sensor by a lens stack array
US10708492B2 (en) 2013-11-26 2020-07-07 Fotonation Limited Array camera configurations incorporating constituent array cameras and constituent cameras
US10767981B2 (en) 2013-11-18 2020-09-08 Fotonation Limited Systems and methods for estimating depth from projected texture using camera arrays
US10839485B2 (en) 2010-12-14 2020-11-17 Fotonation Limited Systems and methods for synthesizing high resolution images using images captured by an array of independently controllable imagers
US10909707B2 (en) 2012-08-21 2021-02-02 Fotonation Limited System and methods for measuring depth using an array of independently controllable cameras
US10958892B2 (en) 2013-03-10 2021-03-23 Fotonation Limited System and methods for calibration of an array camera
US10984276B2 (en) 2011-09-28 2021-04-20 Fotonation Limited Systems and methods for encoding image files containing depth maps stored as metadata
US11270110B2 (en) 2019-09-17 2022-03-08 Boston Polarimetrics, Inc. Systems and methods for surface modeling using polarization cues
US11290658B1 (en) 2021-04-15 2022-03-29 Boston Polarimetrics, Inc. Systems and methods for camera exposure control
US11302012B2 (en) 2019-11-30 2022-04-12 Boston Polarimetrics, Inc. Systems and methods for transparent object segmentation using polarization cues
US11525906B2 (en) 2019-10-07 2022-12-13 Intrinsic Innovation Llc Systems and methods for augmentation of sensor systems and imaging systems with polarization
US11546576B2 (en) 2014-09-29 2023-01-03 Adeia Imaging Llc Systems and methods for dynamic calibration of array cameras
US11580667B2 (en) 2020-01-29 2023-02-14 Intrinsic Innovation Llc Systems and methods for characterizing object pose detection and measurement systems
US20230077815A1 (en) * 2015-10-26 2023-03-16 Alpinereplay, Inc. System and method for enhanced video image recognition using motion sensors
US11689813B2 (en) 2021-07-01 2023-06-27 Intrinsic Innovation Llc Systems and methods for high dynamic range imaging using crossed polarizers
US11748844B2 (en) 2020-01-08 2023-09-05 Carvana, LLC Systems and methods for generating a virtual display of an item
US11792538B2 (en) 2008-05-20 2023-10-17 Adeia Imaging Llc Capturing and processing of images including occlusions focused on an image sensor by a lens stack array
US11797863B2 (en) 2020-01-30 2023-10-24 Intrinsic Innovation Llc Systems and methods for synthesizing data for training statistical models on different imaging modalities including polarized images
US11954886B2 (en) 2021-04-15 2024-04-09 Intrinsic Innovation Llc Systems and methods for six-degree of freedom pose estimation of deformable objects

Families Citing this family (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2940986B1 (en) * 2012-12-26 2018-02-07 Sony Corporation Image processing device and method, and program
KR102003460B1 (en) * 2013-08-27 2019-07-24 한화테크윈 주식회사 Device and Method for dewobbling
WO2015038160A1 (en) * 2013-09-16 2015-03-19 Intel Corporation Camera and light source synchronization for object tracking
US9554048B2 (en) * 2013-09-26 2017-01-24 Apple Inc. In-stream rolling shutter compensation
US9219993B2 (en) * 2013-10-20 2015-12-22 Oahu Group, Llc Method and system for determining object motion by capturing motion data via radio frequency phase and direction of arrival detection
US11080890B2 (en) * 2017-07-28 2021-08-03 Qualcomm Incorporated Image sensor initialization in a robotic vehicle
US10477064B2 (en) * 2017-08-21 2019-11-12 Gopro, Inc. Image stitching with electronic rolling shutter correction
US10545215B2 (en) * 2017-09-13 2020-01-28 Google Llc 4D camera tracking and optical stabilization
US10440271B2 (en) * 2018-03-01 2019-10-08 Sony Corporation On-chip compensation of rolling shutter effect in imaging sensor for vehicles
CN108462838B (en) * 2018-03-16 2020-10-02 影石创新科技股份有限公司 Panoramic video anti-shake method and device and portable terminal
TWI675237B (en) * 2018-06-14 2019-10-21 友達光電股份有限公司 Display device
CN109729263B (en) * 2018-12-07 2021-02-23 苏州中科广视文化科技有限公司 Video jitter removal method based on fusion motion model
CN112637496B (en) * 2020-12-21 2022-05-31 维沃移动通信有限公司 Image correction method and device
TWI810813B (en) * 2022-02-11 2023-08-01 圓展科技股份有限公司 Image correction system and method

Citations (29)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0358196A1 (en) 1988-09-09 1990-03-14 Canon Kabushiki Kaisha Automatic image stabilization device
US5430480A (en) 1992-06-30 1995-07-04 Ricoh California Research Center Sensor driven global motion compensation
US5627905A (en) 1994-12-12 1997-05-06 Lockheed Martin Tactical Defense Systems Optical flow detection system
US5717611A (en) 1993-10-08 1998-02-10 Nikon Corporation Slight motion detecting device
US20040252200A1 (en) * 1999-07-19 2004-12-16 David Thomas Vertical compensation in a moving camera
JP2007097195A (en) 2005-09-29 2007-04-12 Seiko Epson Corp Method for determining maximum exposure time for limiting motion blur in capturing image by image capture device, method of capturing image using image capture device, and apparatus for determining maximum exposure time for limiting motion blur when capturing image by image capturing device
WO2007045714A1 (en) 2005-10-21 2007-04-26 Nokia Corporation A method and a device for reducing motion distortion in digital imaging
US20080030587A1 (en) 2006-08-07 2008-02-07 Rene Helbing Still image stabilization suitable for compact camera environments
WO2008151802A1 (en) 2007-06-14 2008-12-18 Fotonation Ireland Limited Fast motion estimation method
US20090021588A1 (en) 2007-07-20 2009-01-22 Border John N Determining and correcting for imaging device motion during an exposure
US20090052743A1 (en) * 2004-10-12 2009-02-26 Axel Techmer Motion estimation in a plurality of temporally successive digital images
US20090160957A1 (en) 2007-12-20 2009-06-25 Micron Technology, Inc. Methods and system for digitally stabilizing video captured from rolling shutter cameras
US20090174782A1 (en) 2008-01-07 2009-07-09 Philippe Kahn Method and Apparatus for Improving Photo Image Quality
US20090213234A1 (en) * 2008-02-18 2009-08-27 National Taiwan University Method of full frame video stabilization
US20090219402A1 (en) 2008-03-01 2009-09-03 Avago Technologies Ecbu Ip (Singapore) Pte. Ltd Systems and Methods for Image Stabilization
US20100053347A1 (en) * 2008-08-28 2010-03-04 Agarwala Aseem O Content-Aware Video Stabilization
US7714892B2 (en) 2006-11-08 2010-05-11 Avago Technologies Ecbu Ip (Singapore) Pte. Ltd. Systems, devices and methods for digital camera image stabilization
US20100208087A1 (en) 2009-02-19 2010-08-19 Sony Corporation Image processing device, camera motion component calculation method, image processing program, and recording medium
US20100214423A1 (en) * 2009-02-19 2010-08-26 Sony Corporation Image processing device, focal plane distortion component calculation method, image processing program, and recording medium
US7796872B2 (en) 2007-01-05 2010-09-14 Invensense, Inc. Method and apparatus for producing a sharp image from a handheld device containing a gyroscope
WO2010116366A1 (en) 2009-04-07 2010-10-14 Nextvision Stabilized Systems Ltd Video motion compensation and stabilization gimbaled imaging system
US7817187B2 (en) 2007-06-27 2010-10-19 Aptina Imaging Corporation Image blur correction using a secondary camera
US20100329582A1 (en) 2009-06-29 2010-12-30 Tessera Technologies Ireland Limited Adaptive PSF Estimation Technique Using a Sharp Preview and a Blurred Image
US20110043613A1 (en) * 2008-01-04 2011-02-24 Janos Rohaly Three-dimensional model refinement
US20110085049A1 (en) 2009-10-14 2011-04-14 Zoran Corporation Method and apparatus for image stabilization
US20110109755A1 (en) 2009-11-12 2011-05-12 Joshi Neel S Hardware assisted image deblurring
US20110176043A1 (en) 2010-01-21 2011-07-21 Microsoft Corporation Reducing Motion-Related Artifacts in Rolling Shutter Video Information
US20120105654A1 (en) * 2010-10-28 2012-05-03 Google Inc. Methods and Systems for Processing a Video for Stabilization and Retargeting
US20120120264A1 (en) * 2010-11-12 2012-05-17 Samsung Electronics Co., Ltd. Method and apparatus for video stabilization by compensating for view direction of camera

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TW201005673A (en) * 2008-07-18 2010-02-01 Ind Tech Res Inst Example-based two-dimensional to three-dimensional image conversion method, computer readable medium therefor, and system
US8126247B2 (en) * 2009-05-19 2012-02-28 National Tsing Hua University Image preprocessing system for 3D image database construction

Patent Citations (31)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0358196A1 (en) 1988-09-09 1990-03-14 Canon Kabushiki Kaisha Automatic image stabilization device
US5430480A (en) 1992-06-30 1995-07-04 Ricoh California Research Center Sensor driven global motion compensation
US5717611A (en) 1993-10-08 1998-02-10 Nikon Corporation Slight motion detecting device
US5627905A (en) 1994-12-12 1997-05-06 Lockheed Martin Tactical Defense Systems Optical flow detection system
US20040252200A1 (en) * 1999-07-19 2004-12-16 David Thomas Vertical compensation in a moving camera
US20090052743A1 (en) * 2004-10-12 2009-02-26 Axel Techmer Motion estimation in a plurality of temporally successive digital images
JP2007097195A (en) 2005-09-29 2007-04-12 Seiko Epson Corp Method for determining maximum exposure time for limiting motion blur in capturing image by image capture device, method of capturing image using image capture device, and apparatus for determining maximum exposure time for limiting motion blur when capturing image by image capturing device
US7509038B2 (en) * 2005-09-29 2009-03-24 Seiko Epson Corporation Determining maximum exposure time to limit motion blur during image capture
WO2007045714A1 (en) 2005-10-21 2007-04-26 Nokia Corporation A method and a device for reducing motion distortion in digital imaging
US20080030587A1 (en) 2006-08-07 2008-02-07 Rene Helbing Still image stabilization suitable for compact camera environments
US8068140B2 (en) 2006-08-07 2011-11-29 Avago Technologies General Ip (Singapore) Pte. Ltd. Still image stabilization suitable for compact camera environments
US7714892B2 (en) 2006-11-08 2010-05-11 Avago Technologies Ecbu Ip (Singapore) Pte. Ltd. Systems, devices and methods for digital camera image stabilization
US7796872B2 (en) 2007-01-05 2010-09-14 Invensense, Inc. Method and apparatus for producing a sharp image from a handheld device containing a gyroscope
WO2008151802A1 (en) 2007-06-14 2008-12-18 Fotonation Ireland Limited Fast motion estimation method
US7817187B2 (en) 2007-06-27 2010-10-19 Aptina Imaging Corporation Image blur correction using a secondary camera
US20090021588A1 (en) 2007-07-20 2009-01-22 Border John N Determining and correcting for imaging device motion during an exposure
US20090160957A1 (en) 2007-12-20 2009-06-25 Micron Technology, Inc. Methods and system for digitally stabilizing video captured from rolling shutter cameras
US20110043613A1 (en) * 2008-01-04 2011-02-24 Janos Rohaly Three-dimensional model refinement
US20090174782A1 (en) 2008-01-07 2009-07-09 Philippe Kahn Method and Apparatus for Improving Photo Image Quality
US20090213234A1 (en) * 2008-02-18 2009-08-27 National Taiwan University Method of full frame video stabilization
US20090219402A1 (en) 2008-03-01 2009-09-03 Avago Technologies Ecbu Ip (Singapore) Pte. Ltd Systems and Methods for Image Stabilization
US20100053347A1 (en) * 2008-08-28 2010-03-04 Agarwala Aseem O Content-Aware Video Stabilization
US20100214423A1 (en) * 2009-02-19 2010-08-26 Sony Corporation Image processing device, focal plane distortion component calculation method, image processing program, and recording medium
US20100208087A1 (en) 2009-02-19 2010-08-19 Sony Corporation Image processing device, camera motion component calculation method, image processing program, and recording medium
WO2010116366A1 (en) 2009-04-07 2010-10-14 Nextvision Stabilized Systems Ltd Video motion compensation and stabilization gimbaled imaging system
US20100329582A1 (en) 2009-06-29 2010-12-30 Tessera Technologies Ireland Limited Adaptive PSF Estimation Technique Using a Sharp Preview and a Blurred Image
US20110085049A1 (en) 2009-10-14 2011-04-14 Zoran Corporation Method and apparatus for image stabilization
US20110109755A1 (en) 2009-11-12 2011-05-12 Joshi Neel S Hardware assisted image deblurring
US20110176043A1 (en) 2010-01-21 2011-07-21 Microsoft Corporation Reducing Motion-Related Artifacts in Rolling Shutter Video Information
US20120105654A1 (en) * 2010-10-28 2012-05-03 Google Inc. Methods and Systems for Processing a Video for Stabilization and Retargeting
US20120120264A1 (en) * 2010-11-12 2012-05-17 Samsung Electronics Co., Ltd. Method and apparatus for video stabilization by compensating for view direction of camera

Non-Patent Citations (6)

* Cited by examiner, † Cited by third party
Title
"Camera CalibrationToolbox for Matlab", http://www.vision.caltech.edu/bouguetj/calib-doc/htmls/parameters.html, Feb. 16, 2011, pp. 3 total.
Extended European Search Report for EP 12170153.6 mailed Sep. 3, 2012, 7 pages.
PCT International Search Report and Written Opinion of the International Searching Authority for PCT/US2012/038999, mailed Aug. 24, 2012.
PCT Search Report and Written Opinion of the International Searching Authority for PCT/US2012/038569, mailed Jul. 27, 2012.
Sinha, Sudipta N., et al., Piecewise Planar Stereo for Image-based Rendering, IEEE, Inproceedings, Sep. 29, 2009, 8 pages.
Wikipedia, "Rolling Shutter", http://en.wikipedia.org/wiki/Rolling-shutter, Dec. 17, 2010, pp. 2 total.

Cited By (41)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11792538B2 (en) 2008-05-20 2023-10-17 Adeia Imaging Llc Capturing and processing of images including occlusions focused on an image sensor by a lens stack array
US10694114B2 (en) 2008-05-20 2020-06-23 Fotonation Limited Capturing and processing of images including occlusions focused on an image sensor by a lens stack array
US11412158B2 (en) 2008-05-20 2022-08-09 Fotonation Limited Capturing and processing of images including occlusions focused on an image sensor by a lens stack array
US11423513B2 (en) 2010-12-14 2022-08-23 Fotonation Limited Systems and methods for synthesizing high resolution images using images captured by an array of independently controllable imagers
US11875475B2 (en) 2010-12-14 2024-01-16 Adeia Imaging Llc Systems and methods for synthesizing high resolution images using images captured by an array of independently controllable imagers
US10839485B2 (en) 2010-12-14 2020-11-17 Fotonation Limited Systems and methods for synthesizing high resolution images using images captured by an array of independently controllable imagers
US9602725B2 (en) * 2011-06-06 2017-03-21 Apple Inc. Correcting rolling shutter using image stabilization
US20140320681A1 (en) * 2011-06-06 2014-10-30 Apple Inc. Correcting rolling shutter using image stabilization
US11729365B2 (en) 2011-09-28 2023-08-15 Adela Imaging LLC Systems and methods for encoding image files containing depth maps stored as metadata
US10984276B2 (en) 2011-09-28 2021-04-20 Fotonation Limited Systems and methods for encoding image files containing depth maps stored as metadata
US10909707B2 (en) 2012-08-21 2021-02-02 Fotonation Limited System and methods for measuring depth using an array of independently controllable cameras
US9554042B2 (en) * 2012-09-24 2017-01-24 Google Technology Holdings LLC Preventing motion artifacts by intelligently disabling video stabilization
US20140085493A1 (en) * 2012-09-24 2014-03-27 Motorola Mobility Llc Preventing motion artifacts by intelligently disabling video stabilization
US10958892B2 (en) 2013-03-10 2021-03-23 Fotonation Limited System and methods for calibration of an array camera
US11272161B2 (en) 2013-03-10 2022-03-08 Fotonation Limited System and methods for calibration of an array camera
US11570423B2 (en) 2013-03-10 2023-01-31 Adeia Imaging Llc System and methods for calibration of an array camera
US10547772B2 (en) 2013-03-14 2020-01-28 Fotonation Limited Systems and methods for reducing motion blur in images or video in ultra low light with array cameras
US20180302554A1 (en) * 2013-03-15 2018-10-18 Fotonation Cayman Limited Systems and Methods for Synthesizing High Resolution Images Using Image Deconvolution Based on Motion and Depth Information
US10542208B2 (en) * 2013-03-15 2020-01-21 Fotonation Limited Systems and methods for synthesizing high resolution images using image deconvolution based on motion and depth information
US11486698B2 (en) 2013-11-18 2022-11-01 Fotonation Limited Systems and methods for estimating depth from projected texture using camera arrays
US10767981B2 (en) 2013-11-18 2020-09-08 Fotonation Limited Systems and methods for estimating depth from projected texture using camera arrays
US10708492B2 (en) 2013-11-26 2020-07-07 Fotonation Limited Array camera configurations incorporating constituent array cameras and constituent cameras
US11546576B2 (en) 2014-09-29 2023-01-03 Adeia Imaging Llc Systems and methods for dynamic calibration of array cameras
US10284794B1 (en) 2015-01-07 2019-05-07 Car360 Inc. Three-dimensional stabilized 360-degree composite image capture
US11095837B2 (en) 2015-01-07 2021-08-17 Carvana, LLC Three-dimensional stabilized 360-degree composite image capture
US11616919B2 (en) 2015-01-07 2023-03-28 Carvana, LLC Three-dimensional stabilized 360-degree composite image capture
US9998663B1 (en) 2015-01-07 2018-06-12 Car360 Inc. Surround image capture and processing
US20230077815A1 (en) * 2015-10-26 2023-03-16 Alpinereplay, Inc. System and method for enhanced video image recognition using motion sensors
US11699273B2 (en) 2019-09-17 2023-07-11 Intrinsic Innovation Llc Systems and methods for surface modeling using polarization cues
US11270110B2 (en) 2019-09-17 2022-03-08 Boston Polarimetrics, Inc. Systems and methods for surface modeling using polarization cues
US11525906B2 (en) 2019-10-07 2022-12-13 Intrinsic Innovation Llc Systems and methods for augmentation of sensor systems and imaging systems with polarization
US11302012B2 (en) 2019-11-30 2022-04-12 Boston Polarimetrics, Inc. Systems and methods for transparent object segmentation using polarization cues
US11842495B2 (en) 2019-11-30 2023-12-12 Intrinsic Innovation Llc Systems and methods for transparent object segmentation using polarization cues
US11748844B2 (en) 2020-01-08 2023-09-05 Carvana, LLC Systems and methods for generating a virtual display of an item
US11580667B2 (en) 2020-01-29 2023-02-14 Intrinsic Innovation Llc Systems and methods for characterizing object pose detection and measurement systems
US11797863B2 (en) 2020-01-30 2023-10-24 Intrinsic Innovation Llc Systems and methods for synthesizing data for training statistical models on different imaging modalities including polarized images
US11683594B2 (en) 2021-04-15 2023-06-20 Intrinsic Innovation Llc Systems and methods for camera exposure control
US11290658B1 (en) 2021-04-15 2022-03-29 Boston Polarimetrics, Inc. Systems and methods for camera exposure control
US11954886B2 (en) 2021-04-15 2024-04-09 Intrinsic Innovation Llc Systems and methods for six-degree of freedom pose estimation of deformable objects
US11953700B2 (en) 2021-05-27 2024-04-09 Intrinsic Innovation Llc Multi-aperture polarization optical systems using beam splitters
US11689813B2 (en) 2021-07-01 2023-06-27 Intrinsic Innovation Llc Systems and methods for high dynamic range imaging using crossed polarizers

Also Published As

Publication number Publication date
TW201303795A (en) 2013-01-16
TWI464707B (en) 2014-12-11
WO2012170179A1 (en) 2012-12-13
US20120307085A1 (en) 2012-12-06

Similar Documents

Publication Publication Date Title
US9602725B2 (en) Correcting rolling shutter using image stabilization
US8648919B2 (en) Methods and systems for image stabilization
US10217200B2 (en) Joint video stabilization and rolling shutter correction on a generic platform
CN103412616B (en) Device camera angle
RU2758460C1 (en) Terminal apparatus and method for video image stabilisation
US9148651B2 (en) Methods and devices for generating a stereoscopic image
KR20230044137A (en) Multi-camera video stabilization
US9532041B2 (en) Method and system for automatic focus with self-calibration
CN102207674A (en) Panorama image shooting apparatus and method
TW201404128A (en) Motion-based image stitching
CN107071277B (en) Optical drawing shooting device and method and mobile terminal
EP1542454A2 (en) Image processing apparatus
CN113542600B (en) Image generation method, device, chip, terminal and storage medium
JP2010072813A (en) Image processing device and image processing program
JP4975644B2 (en) Image processing device
CA2827531C (en) Methods and devices for generating a stereoscopic image
CN113890984B (en) Photographing method, image processing method and electronic equipment
CN115802178A (en) Motion blur processing method and device, electronic equipment and storage medium
CN117596482A (en) Video jitter removing method, electronic equipment, system and storage medium
JP2015019327A (en) Information processing apparatus, program, and method
CN112150554A (en) Picture display method, device, terminal and storage medium
JP2013078038A (en) Imaging device, image processing method, and program
JP2012120016A (en) Terminal device, and processing method and program of the same

Legal Events

Date Code Title Description
AS Assignment

Owner name: APPLE INC., CALIFORNIA

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:MANTZEL, WILLIAM E.;GREENEBAUM, KENNETH I.;SIGNING DATES FROM 20110519 TO 20110606;REEL/FRAME:026397/0850

FEPP Fee payment procedure

Free format text: PAYOR NUMBER ASSIGNED (ORIGINAL EVENT CODE: ASPN); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY

STCF Information on status: patent grant

Free format text: PATENTED CASE

FPAY Fee payment

Year of fee payment: 4

MAFP Maintenance fee payment

Free format text: PAYMENT OF MAINTENANCE FEE, 8TH YEAR, LARGE ENTITY (ORIGINAL EVENT CODE: M1552); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY

Year of fee payment: 8