WO1999034327A2 - System and method for permitting three-dimensional navigation through a virtual reality environment using camera-based gesture inputs - Google Patents

System and method for permitting three-dimensional navigation through a virtual reality environment using camera-based gesture inputs Download PDF

Info

Publication number
WO1999034327A2
WO1999034327A2 PCT/IB1998/001965 IB9801965W WO9934327A2 WO 1999034327 A2 WO1999034327 A2 WO 1999034327A2 IB 9801965 W IB9801965 W IB 9801965W WO 9934327 A2 WO9934327 A2 WO 9934327A2
Authority
WO
WIPO (PCT)
Prior art keywords
system user
user
dimensional
virtual reality
reality environment
Prior art date
Application number
PCT/IB1998/001965
Other languages
French (fr)
Other versions
WO1999034327A3 (en
Inventor
Damian M. Lyons
Original Assignee
Koninklijke Philips Electronics N.V.
Philips Ab
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Koninklijke Philips Electronics N.V., Philips Ab filed Critical Koninklijke Philips Electronics N.V.
Priority to EP98955857A priority Critical patent/EP0976106A2/en
Priority to JP53468399A priority patent/JP2001517344A/en
Publication of WO1999034327A2 publication Critical patent/WO1999034327A2/en
Publication of WO1999034327A3 publication Critical patent/WO1999034327A3/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/017Gesture based interaction, e.g. based on a set of recognized hand gestures
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T1/00General purpose image data processing
    • A63F13/10
    • AHUMAN NECESSITIES
    • A63SPORTS; GAMES; AMUSEMENTS
    • A63FCARD, BOARD, OR ROULETTE GAMES; INDOOR GAMES USING SMALL MOVING PLAYING BODIES; VIDEO GAMES; GAMES NOT OTHERWISE PROVIDED FOR
    • A63F13/00Video games, i.e. games using an electronically generated display having two or more dimensions
    • A63F13/20Input arrangements for video game devices
    • A63F13/21Input arrangements for video game devices characterised by their sensors, purposes or types
    • A63F13/213Input arrangements for video game devices characterised by their sensors, purposes or types comprising photodetecting means, e.g. cameras, photodiodes or infrared cells
    • AHUMAN NECESSITIES
    • A63SPORTS; GAMES; AMUSEMENTS
    • A63FCARD, BOARD, OR ROULETTE GAMES; INDOOR GAMES USING SMALL MOVING PLAYING BODIES; VIDEO GAMES; GAMES NOT OTHERWISE PROVIDED FOR
    • A63F13/00Video games, i.e. games using an electronically generated display having two or more dimensions
    • A63F13/40Processing input control signals of video game devices, e.g. signals generated by the player or derived from the environment
    • AHUMAN NECESSITIES
    • A63SPORTS; GAMES; AMUSEMENTS
    • A63FCARD, BOARD, OR ROULETTE GAMES; INDOOR GAMES USING SMALL MOVING PLAYING BODIES; VIDEO GAMES; GAMES NOT OTHERWISE PROVIDED FOR
    • A63F13/00Video games, i.e. games using an electronically generated display having two or more dimensions
    • A63F13/45Controlling the progress of the video game
    • AHUMAN NECESSITIES
    • A63SPORTS; GAMES; AMUSEMENTS
    • A63FCARD, BOARD, OR ROULETTE GAMES; INDOOR GAMES USING SMALL MOVING PLAYING BODIES; VIDEO GAMES; GAMES NOT OTHERWISE PROVIDED FOR
    • A63F13/00Video games, i.e. games using an electronically generated display having two or more dimensions
    • A63F13/60Generating or modifying game content before or while executing the game program, e.g. authoring tools specially adapted for game development or game-integrated level editor
    • A63F13/65Generating or modifying game content before or while executing the game program, e.g. authoring tools specially adapted for game development or game-integrated level editor automatically by game devices or servers from real world data, e.g. measurement in live racing competition
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/011Arrangements for interaction with the human body, e.g. for user immersion in virtual reality
    • AHUMAN NECESSITIES
    • A63SPORTS; GAMES; AMUSEMENTS
    • A63FCARD, BOARD, OR ROULETTE GAMES; INDOOR GAMES USING SMALL MOVING PLAYING BODIES; VIDEO GAMES; GAMES NOT OTHERWISE PROVIDED FOR
    • A63F2300/00Features of games using an electronically generated display having two or more dimensions, e.g. on a television screen, showing representations related to the game
    • A63F2300/10Features of games using an electronically generated display having two or more dimensions, e.g. on a television screen, showing representations related to the game characterized by input arrangements for converting player-generated signals into game device control signals
    • A63F2300/1087Features of games using an electronically generated display having two or more dimensions, e.g. on a television screen, showing representations related to the game characterized by input arrangements for converting player-generated signals into game device control signals comprising photodetecting means, e.g. a camera
    • A63F2300/1093Features of games using an electronically generated display having two or more dimensions, e.g. on a television screen, showing representations related to the game characterized by input arrangements for converting player-generated signals into game device control signals comprising photodetecting means, e.g. a camera using visible light
    • AHUMAN NECESSITIES
    • A63SPORTS; GAMES; AMUSEMENTS
    • A63FCARD, BOARD, OR ROULETTE GAMES; INDOOR GAMES USING SMALL MOVING PLAYING BODIES; VIDEO GAMES; GAMES NOT OTHERWISE PROVIDED FOR
    • A63F2300/00Features of games using an electronically generated display having two or more dimensions, e.g. on a television screen, showing representations related to the game
    • A63F2300/60Methods for processing data by generating or executing the game program
    • A63F2300/69Involving elements of the real world in the game world, e.g. measurement in live races, real video
    • AHUMAN NECESSITIES
    • A63SPORTS; GAMES; AMUSEMENTS
    • A63FCARD, BOARD, OR ROULETTE GAMES; INDOOR GAMES USING SMALL MOVING PLAYING BODIES; VIDEO GAMES; GAMES NOT OTHERWISE PROVIDED FOR
    • A63F2300/00Features of games using an electronically generated display having two or more dimensions, e.g. on a television screen, showing representations related to the game
    • A63F2300/80Features of games using an electronically generated display having two or more dimensions, e.g. on a television screen, showing representations related to the game specially adapted for executing a specific type of game
    • A63F2300/8082Virtual reality

Definitions

  • the present invention relates generally to multimedia and virtual reality applications, and, more particularly to a system and method for permitting three-dimensional navigation through a virtual reality environment using camera-based gesture inputs.
  • Multimedia and virtual reality applications permit exciting interaction between a user and a computer.
  • current computer/user interfaces present a barrier to simplistic user interactivity and thus, consumer acceptance of multimedia and virtual reality applications.
  • computer/user interfaces would combine an intuitive interaction format with a broad range of interaction capabilities. Practically, however, these two features conflict. For example, a computer keyboard offers broad interaction capabilities but is not intuitive, whereas a television remote control is more intuitive but offers limited interaction capabilities. Even more flexible interfaces, such as an instrumented body suit, can be both cumbersome and expensive.
  • the two primary computer/user interface approaches to displaying and interacting with the virtual reality environment comprise an immersive approach and a non-immersive approach.
  • the immersive approach the user wears a head-mounted display, as well as tracking devices attached to the head and one or more limbs.
  • a computer displays a virtual reality environment on head-mounted display by displaying synthetic visual images to the user's eyes, and changes the images based upon the information received from the head tracking device.
  • the limb tracking devices permit user interaction with the virtual reality environment.
  • the immersive approach provides the advantage of giving the user the impression of being "in" the virtual reality environment.
  • the immersive approach provides the disadvantage of isolating the user from his/her real environment, leaving the user unaware of health and safety dangers in the real environment.
  • the computer displays visual images of the virtual environment on a stationary display, such as a computer monitor or television screen.
  • the user changes the perspective of the visual images using a computer keyboard, mouse, joystick or other similar devices as .an interface with the computer.
  • the non-immersive approach provides the advantage of allowing the user to interact with the virtual reality environment without isolating the user from his/her real environment.
  • the non- immersive approach fails to give the user the impression of being "in" the virtual reality environment.
  • Such an interface system 10 comprises a blue wall 12 in which a user 14 stands in front of, permitting two- dimensional silhouette extraction of user 14 and chromakeying of the silhouette.
  • System 10 further includes a video camera 16 for identifying the two-dimensional, user silhouette and for producing a video signal.
  • a microprocessor 18 of a computer identifies the two-dimensional, user silhouette seen by video camera 16, but only as a two-dimensional shape.
  • Microprocessor 18 displays an image of user 14 on a television display 20.
  • the image displayed on television 20 consists of a two-dimensional scene into which the user's image has been chromakeyed.
  • User 14 can interact with the displayed scene by adopting a specific pose, e.g., hands-over-head, or by moving so that a portion of the user's silhouette touches a designated set of image coordinates making it appear as if user 14 touched a displayed object.
  • the interface system shown in FIG. 1 provides an easy-to-use, inexpensive interface with multimedia and virtual reality applications.
  • the interface system only permits two-dimensional interaction with computer-displayed objects, restricting the capabilities of the interface to two dimensions.
  • all of the computer-displayed objects are at the same depth in the window surrounding the user's silhouette.
  • a conventional two-dimensional silhouette extraction process used by the system shown in FIG. 1 comprises both a hardware process (above the dashed line) and a software process (below the dashed line), wherein computer microprocessor 18 performs the software process steps.
  • the hardware process involves a step 22 of inputting an analog video camera signal, followed by a step 24 of digitizing the analog camera signal to produce a gray-scale binary data signal.
  • the hardware process further comprises a step 26 of adjusting the resolution (high or low) of the video camera, and a step 28 of restricting the camera view to a window of the image of interest, i.e., the user's image.
  • the hardware process next comprises a dynamic threshold step 30 where the gray-scale binary data signal is converted into digital binary data, e.g., "1" or "0.”
  • the hardware process determines the edges (silhouette) of the user's image, and, based on the edge data, adjusts the picture size (step 34) so to adjust the resolution accordingly at step 26.
  • the softw.are process involves a first step 36 of subtracting the background from the edge data of step 34, leaving only an image contour of the user's image.
  • the background is a picture of .an empty scene as seen by the camera, and is provided at step 38.
  • the softw.are further comprises a step of joining together all of the edge data of the user's image, providing a single contour around the user's image.
  • the software process also comprises an identification step 42 for determining whether the user image contour represents a person, an animal, etc., and a silhouette feature step 44 for identifying the silhouette features (in x, y coordinates) of the user, e.g., head, hands, feet, arms, legs, etc.
  • the software process utilizes the contour identification data in order to calculate a bounding box around the user.
  • the bounding box data is provided to the window restricting step 28 for restricting the size of the camera window around the user, and thus, increase the speed of the extraction process.
  • Still another approach includes a method for real-time recognition of a human image, as disclosed Japanese Patent Abstract Publication No. 07-038873 ("JP 07-038873").
  • JP 07-038873 describes three-dimensional graphical generation of a person that detects the expression, rotation of the head, motion of the fingers, and rotation of the human body.
  • JP 07-038873 is limited to graphical model generation of the human body.
  • JP 07-38873 focuses on using three-dimensional graphical animation of a user primarily for teleconferencing purposes, wherein the user cannot control objects in a computer-generated scene.
  • the reference discloses using three-dimensional animation of a remote user for teleconferencing purposes, as opposed to a three-dimensional animation of a local user.
  • PCT 96/21321 A final approach, as found in International Patent Application (PCT) WO 96/21321 (“PCT 96/21321”), consists of creating a three-dimensional simulation of an event (e.g., a football game), in real-time or storing it on a CD ROM, using cameras and microphones.
  • the system disclosed in PCT 96/21321 merely replays three- dimensional scenes of the event as they are viewed by the cameras.
  • users of the PCT 96/21321 system can only change their perspective of the three-dimensional scenes and are unable to control objects in the scenes.
  • An object of the present invention is to address the problems encountered by the two-dimensional interface systems and the alternative approaches proposed by the Media Lab at the Massachusetts Institute of Technology and the other related art discussed above. Another object is to provide a three-dimensional interface for computer users that integrates the advantages of both the immersive and non-immersive approaches to displaying and interacting with a virtual reality environment.
  • a further object is to provide a three-dimensional display of computer- generated objects so that the objects occupy the three-dimensional space in a virtual reality environment around the computer users and the computer users navigate through and control the objects in the virtual reality environment through normal body movements.
  • the invention comprises a system for permitting three- dimensional navigation through a virtual reality environment using camera-based gesture inputs of a user of the system, including: a computer-readable memory means; means for generating video signals indicative of the gestures of the system user and an interaction area surrounding the system user; means for displaying video images, the video image display means being positioned in front of the system user; and means for processing the video signals, in accordance with a program stored in the computer-readable memory means, to determine the three-dimensional positions of the body and principle body parts of the system user, wherein the video signal processing means constructs three-dimensional images of the system user and interaction area on the video image display means based upon the three- dimensional positions of the body and principle body parts of the system user, the video image display means displays three-dimensional graphical objects within the virtual reality environment, and movement by the system user permits apparent movement of the three- dimensional objects displayed on the video image display means so that the system user appears to move throughout
  • the present invention comprises a method for permitting three-dimensional navigation through a virtual reality environment using camera- based gesture inputs of a user of a computer system having a computer-readable memory and video image display connected to a microprocessor using a program stored in the computer- readable memory, the method comprising the steps of: generating video signals indicative of the gestures of the system user and an interaction area surrounding the system user; processing the video signals in the microprocessor to determine the three-dimensional positions of the body and principle body parts of the system user; using the microprocessor to construct three- dimensional images of the system user and interaction area on the video image display based upon the three-dimensional positions of the body and principle body parts of the system user; and utilizing the microprocessor to display on the video image display three-dimensional graphical objects within the virtual reality environment, wherein movement by the system user permits apparent movement by the three-dimensional objects displayed on the video image display so that the system user appears to move throughout the virtual reality environment.
  • the objects shown on the video image display will more to different positions on the video image display and to different positions relative to one another, by an amount dependent on the distance that the object is supposed to be away from the video image display screen.
  • This movement corresponds to a shift in the intersection of the screen with a line of sight from the head of the user to the object at its modeled position in the virtual reality environment. This shift is computed for each object from the detected change in position of the head.
  • the present invention comprises a computer- readable memory device for storing a program that permits three-dimensional navigation through a virtual reality environment using camera-based gesture inputs of a user of a computer system having a video image display connected to a microprocessor using instructions stored in the computer-readable memory device, the computer-readable memory device comprising: instructions for processing video signals indicative of gestures of the system user to determine the three-dimensional positions of the body and principle body parts of the system user; instructions for constructing three-dimensional images of the system user and interaction area on the video image display based upon the three-dimensional positions of the body and principle body parts of the system user; and instructions for displaying, on the video image display, three-dimensional graphical objects within the virtual reality environment, wherein movement by the system user permits apparent movement by the three- dimensional objects displayed on the video image display so that the system user appears to move throughout the virtual reality environment.
  • the present invention comprises a computer program product for permitting three-dimensional navigation through a virtual reality environment using camera-based gesture inputs of a user of a computer system having a video image display connected to a microprocessor, the computer program product comprising: means for processing video signals indicative of gestures of the system user to determine the three-dimensional positions of the body and principle body parts of the system user; means for constructing three-dimensional images of the system user and interaction area on the video image display based upon the three-dimensional positions of the body and principle body parts of the system user; .and means for displaying, on the video image display, three-dimensional graphical objects within the virtual reality environment, wherein movement by the system user permits apparent movement by the three-dimensional objects displayed on the video image display so that the system user appears to move throughout the virtual reality environment.
  • FIG. 1 is a block diagram of a conventional system for constructing two- dimensional images using camera-based silhouettes of users;
  • FIG. 2 is a flowchart showing the steps involved in a conventional software process for extracting two-dimensional images using silhouettes of users;
  • FIG. 3 is a block diagram of a system for constructing three-dimensional images using camera-based gesture inputs of users in accordance with a preferred embodiment of the present invention
  • FIG. 4 is a block diagram of a system for constructing three-dimensional images using camera-based gesture inputs of users in accordance with another preferred embodiment of the present invention
  • FIG. 5 is a flowchart showing the steps involved in a software process for mapping two-dimensional image features of users .and an interactive area onto three- dimensional locations within the interactive area in accordance with the preferred embodiments of the present invention shown in FIGs. 3 and 4;
  • FIG. 6 is a block diagram showing a system for permitting three-dimensional navigation through a virtual reality environment using camera-based gesture inputs of the present invention using the system for constructing three-dimensional images shown in FIG. 3;
  • FIG. 7 is a flowchart showing the steps involved in a software process for use with the system for permitting three-dimensional navigation through a virtual reality environment using camera-based gesture inputs of the present invention shown in FIG. 6; and
  • FIG. 8 is a biometric data table showing the length of body parts as a ratio of the body height (H), wherein the body height (H) is the height of a standing person.
  • the present invention includes a system and method for permitting three-dimensional navigation through a virtual reality environment using camera-based gesture inputs of system users.
  • the system comprises a computer-readable memory means, means for generating video signals indicative of the gestures of the system users and an interaction area surrounding the system users, and means for displaying video images.
  • the video image display means is positioned in front of the system users.
  • the system further comprises means for processing the video signals, in accordance with a program stored in the computer-readable memory means, to determine the three-dimensional positions of the bodies and principle body parts of the system users, wherein the video signal processing means constructs three-dimensional images of the system users and interaction area on the video image display means based upon the three-dimensional positions of the bodies and principle body parts of the system users, the video image display means displays three-dimensional graphical objects within the virtual reality environment, and movement by the system users causes apparent movement of the three-dimensional objects displayed on the video image display means so that the system user appears to move throughout the virtual reality environment.
  • the present invention is drawn to a natural and intuitive computer/user interface based upon computer vision interaction by system users.
  • computer vision is the use of a computer to interpret information received from a video imaging device in terms of what objects the imaging device sees.
  • Computer vision permits recognition of user gestures, body motions, head motions, eye motions, etc.
  • the recognized user motions are used to interact with multimedia and virtual reality applications.
  • the present invention takes the system users' silhouettes in two- dimensional image coordinates and projects them into the three-dimensional image coordinates the system users occupy in the interaction area.
  • System 50 comprises a video camera 56, a video display screen 54, and a computer 58 having a Philips® single board image processor (SBEP) 60.
  • SBIP 60 eliminates problems (1) - (3) encountered in the approach proposed by the Media Lab at the Massachusetts Institute of Technology, and thus, also encountered in the two-dimensional systems.
  • Computer 58 also comprises a computer-readable memory 66 encoded with three-dimensional imaging software. SBIP 60 utilizes the software so that system 50 may handle the three-dimensional body motions of the system user.
  • the three-dimensional imaging software of the present invention corrects problems (4) and (5) encountered in the approach proposed by the Media Lab at the Massachusetts Institute of Technology.
  • the present invention provides an interactive video environment ("IVE") capable of evaluating several IVE paradigms other than the "magic mirror” paradigm proposed by Massachusetts Institute of Technology.
  • the present invention is capable of evaluating the following four IVE paradigms: (1) the display shows live video of a camera input of a remote site, and the video camera input of the system users is composited with the live video on the display (this is known as "mirror” effect, as in the MIT approach); (2) the display shows live video of the remote site, and the camera input of the users is not composited with the live video on the display (this is known as "window” effect); (3) the display shows graphical images as in virtual reality, and the camera input of the system users is composited with the graphical images on the display; and (4) the display shows graphical images, and the camera input of the system users is not composited with the graphical images on the display.
  • system 50 comprises a means for processing video signals, such as a computer 58, electrically coupled to a means for generating video signals, such as a video camera 56.
  • Computer 58 is electrically coupled to a means for displaying video images, such as a video display screen 54.
  • video display screen 54 is located in front of an interaction area 52 where system users 62 stand.
  • Video camera 56 electronically reads the images of users 62 and interactive area 52, creates video signals corresponding to these images, and provides the video signals to computer 58.
  • computer 58 comprises a typical microprocessor-based computing device such as an IBM-compatible personal computer.
  • Computer 58 further comprises a serial port 64 connected to a microprocessor 60 for receiving the video signals from video camera 56, and a conventional computer-readable memory 66 capable of being encoded with software programs.
  • Microprocessor 60 preferably is a Philips® single board image processor (SBIP).
  • SBLP 60 uses the software (described below), encoded in computer memory 66, for mapping the two-dimensional image features of users 62 and interactive area 52 and calculating the three-dimensional position of system users 62 within interactive area 52.
  • SBLP 60 also may use an application program permanently encoded within computer-readable memory 66, or temporarily encoded therein via an external computer-readable memory, such as for example, a floppy diskette or a CD ROM.
  • Computer 58 further comprises a mode control port 68, connected to SBLP 60 for receiving data from other interactive controls such .as a remote control, and a video processor port 70 for delivering video signals to video display screen 54.
  • the software encoded in computer-readable memory 66, and used by SBIP 60 isolates the contours of the system users, determines their body and limb positions in three-dimensional image space, and generates a video signal corresponding to the body and limb position to video display screen 54.
  • Display screen 54 preferably consists of a conventional audio/visual monitor system capable of displaying three-dimensional graphical information.
  • the type of display screen 54 and video camera 56 used in the present invention is arbitrary and may be chosen based only upon the intended application of the system of the present invention.
  • video display screen 34 is a rear-projection Ikegami TPP 1000/1500® projector with a Retroscan RS125SW® screen (six feet in height in the y direction and eight feet in width in the x direction); interaction area 52 is an eleven feet (in the z direction) by twelve feet (in the x direction) area in front of video display screen 54; and video camera 56 is a Sony® NTSC video camera.
  • the alternate embodiment 80 comprises a video camera 56 and computer 58 with SBIP 60 and computer-readable memory 66 similar to those described with reference to FIG. 3.
  • the alternate embodiment further comprises a compact disc reader 84 capable of reading an application program encoded on a CD ROM and providing such application program to SBLP 60.
  • the alternate embodiment also comprises a remote controller 86 for controlling features of the application program.
  • the alternate embodiment includes a conventional television display 82 capable of receiving video signals from SBIP 60 and transmitting information to SBIP 60.
  • the software for mapping two-dimensional image features of system users .and an interactive area onto three-dimensional locations within the interactive area, as well as the applications programs for use with the preferred embodiments, will now be described.
  • the software and applications programs are described with reference to a single system user.
  • the camera-based gesture recognition technology of the present invention can be used with multiple users by identifying each user individually and interacting with each user accordingly.
  • FIG. 5 is a flowchart showing the steps involved in the software process for mapping two-dimensional image features of a system user 62 onto three-dimensional locations in the room where system user 62 is located in accordance with the preferred embodiments of the present invention shown in FIGs. 3 and 4.
  • the three-dimensional imaging software may be permanently encoded within computer-readable memory 66 or may be temporarily encoded in memory 66 via a diskette, CD ROM, or similar memory storage means.
  • the three-dimensional imaging software process comprises a first step 100 of extracting the two- dimensional head, hands and feet features of a user in image space coordinates (u, v) using the two-dimensional extraction process shown in FIG. 2.
  • Coordinates (u, v) correspond to the two-dimensional x-y plane of the user in front of video camera 56.
  • the three-dimensional imaging process further comprises a step 102 of reading the orientation and location of video camera 36 in three-dimensional coordinates (x, y, z) with respect to the room. Assuming the user's feet are on the floor, at step 104 the software process projects the two-dimensional, extracted features of the user's feet to three-dimensional coordinates (x, y, z) of the user's feet, using the (x, y, z) orientation of camera 56 with respect to the room.
  • the software process projects the two-dimensional, extracted features of the user's head and hands to three- dimensional coordinates (x, y, z) of the user's head and hands, assuming that the head and hands are slightly offset from the position of the feet in the z direction and using the (x, y, z) orientation of camera 56 with respect to the room.
  • the three-dimensional imaging process further comprises a step 108 of using the measured height (h) of the user to access a biometric data (shown at step 110) indexed by height and stored within computer-readable memory 66.
  • a biometric data table capable of use with the present invention is shown in FIG. 8. The present invention is not limited by the biometric data shown in FIG.
  • the imaging process further assumes that the user's arm length is 0.44h, and utilizes the assumed arm length (0.44h) until a measured arm length greater than 0.44h is extracted by video camera 56.
  • the software process further comprises a step 112 of calculating each arm's offset in the z direction from the corresponding foot, using the assumed arm length (0.44h) calculated in step 108.
  • each arm's actual length in the z direction is calculated from the assumed arm length using the principle of foreshortening.
  • the software process comprises a final step 114 of supplying the three-dimensional positions of the user's head, hands and feet to an application program.
  • FIG. 6 is a block diagram showing the system 90 for permitting navigation through a virtual reality environment using camera-based gesture inputs and the system for constructing three-dimensional images shown in FIG. 3.
  • a user 62 of the navigation system stands in front of video display screen 54 displaying graphical images 92 of a virtual reality environment 94.
  • Video camera 56 views user 62 and SBIP 60 processes data received from camera 56 by extracting the three-dimensional image of user 62 and by identifying user body motions, such as the three-dimensional positions of the user's head, hands, legs, feet, etc., as described above.
  • the system shown in FIG. 6 incorporates the advantages of both the immersive and non-immersive approaches to displaying and interacting with a virtual reality environment.
  • the navigation system of the present invention utilizes the non-immersive concepts of displaying the virtual reality environment on a stationary display, and of not mounting special equipment on the user's body to read body motions.
  • the navigation system of the present invention also utilizes the immersive concept of employing the user's body motions to control the motion of and interaction with the virtual reality environment.
  • a large- screen display is preferably used so that the user sees a life-size rendering of the virtual environment, and the display covers a large portion of the user's field of vision.
  • the present invention permits navigation through virtual reality environment 94 as follows.
  • SBIP 60 computes the location where user 62 is standing.
  • user 62 causes SBIP 60 to move graphical images 92 on display screen 54 so that user 62 appears to be moving forward through virtual reality environment 94.
  • SBIP 60 stops the movement of graphical images 92 when user 62 takes a step backwards. If user 62 takes a second step backwards, SBLP 60 moves graphical images 92 in the reverse direction so that user 62 appears to be moving backwards through virtual reality environment 94.
  • Additional gestures may be used with the navigation system 90 of the present invention to enhance flexibility. For example, if user 62 raises his/Tier right hand, SBIP 60 causes graphical images 92 to rotate at a fixed rate in a clockwise direction so that user appears to rotate clockwise in virtual reality environment 94. By lowering his/her right hand, user 62 instructs SBIP 60 to stop rotation, wherein SBIP 60 causes graphical images 92 to cease rotating. User 62 may use his/Tier left hand in a similar fashion to start or stop apparent counter-clockwise rotation in virtual reality environment 94. Further, if user 62 bends at the waist, SBIP 60 causes graphical images 92 to rotate so that a virtual image of the ground in front of user 62 can be seen.
  • the navigation system 90 of the present invention may also be provided with the following additional gestures: (1) graphical objects could be picked up when user 62 reaches his/her hand in the direction of the graphical objects; (2) the "picked-up" graphical objects could be manipulated by user gestures such as pushing, moving, etc.; and (3) left and right leg motions could be identified to permit user 62 to "walk" through the virtual reality environment.
  • FIG. 7 is a flowchart showing the steps involved in a software process for use with the system for permitting three-dimensional navigation through a virtual reality environment using camera-based gesture inputs of the present invention shown m FIG. 6.
  • the process comp ⁇ ses a step 200 of using the extracted, three-dimensional data of the system user (provided from process of FIG. 5) to take one-second samples of the user's head and hands positions.
  • the process further comp ⁇ ses a step 202 of determining whether the average height of either hand (average hand height being determined from the one-second samples) is greater than the average height of the user's head (average head height being determined from the one-second samples). If the average height of either hand is not greater than the average height of the head, the process returns to step 200. However, if the average height of either hand is greater than the average height of the head, the process proceeds to step 204 where the user's foot position (bx, bz) .and current head position (bh) are measured.
  • the process further comp ⁇ ses a step 206 for determining whether the user is still visible. If the user is not visible, the process returns to step 200. If the user is visible, the process proceeds to step 208 where the current foot position (px, pz), current hand positions, and current height (ph) are measured, and variables for the movement velocity (vx and vz), height va ⁇ able to determine if the user is bending (dy), and rotational movement (dry and drx) are set to zero.
  • (k) is a value from 0.3 to 1.0.
  • the height va ⁇ able (dy) is set to a dummy variable (deltay) if both hand heights are greater than head height (bh).
  • step 212 if only the left hand height is greater than head height (bh), then rotational movement (dry) is set to alpha, wherein alpha indicates that counterclockwise rotation of the virtual reality environment is desired. Preferably alpha is set to +1.0 degrees. If only the right hand height is greater than head height (bh), then rotational movement (dry) is set to (-alpha), wherein (-alpha) indicates that clockwise rotation of the virtual reality environment is desired and is preferably -1.0 degrees.
  • Alpha and beta determine the speed of rotation of the virtual reality environment, since the environment may be rotated every iteration based on the loop time of the iteration. Preferably, the system is operating at 10 Hz meaning that the 10 iterations occur per second.
  • the process further comp ⁇ ses a step 214 of setting rotational movement (drx) to a dummy variable (-beta) and a va ⁇ able (bending) to "TRUE" if the current height (ph) is less than 80% of the current head height (bh), wherein (-beta) indicates rotation of the virtual reality environment to see the ground in front of the user, and (bending) indicates that user is bending.
  • (-beta) is set to -2.0 degrees.
  • step 216 of implementing displacement of the virtual reality environment by (vx), (vz), and (dy), and implementing rotation of the virtual reality environment by (drx) and (dry) on the video display screen. After step 216, the process returns to step 206 to determine if the user is still visible.
  • the invention will be further clarified by the following examples of application programs capable of use with the system and method for constructing three-dimensional images using camera-based inputs of the present invention.
  • the application programs are intended to be purely exemplary of the uses of the preferred embodiments of the present invention, and are not intended to limit the scope of the broad features of the invention.
  • the preferred embodiments of the present invention can be used with any application requiring calculation of a three-dimensional position of a user so that the user may manipulate graphical computer-generated objects in three dimensions.
  • Examples of application programs include an information wall for multiple-user interaction, a telecommunications application, a gesture- based remote control, and a home exercise and entertainment application.
  • An information wall application program may use the system and method for navigating through virtual reality of the present invention.
  • the information wall comprises a large, notice board-like display screen with which multiple users can interact, providing a highly intuitive and interactive information system.
  • Such an application is preferably deployed in shopping malls, museums, libraries, galleries, and other similar environments. For example, in a shopping mall the information wall would allow shoppers entering the mall to simply stand within a certain distance of the wall to activate it.
  • the information wall displays an overall map of the mall at the position and height of the person standing in front of it. A number of information icons are displayed around the map from which the shopper can select by pointing.
  • the information wall displays various pieces of information, such as, the location of certain stores and rest-rooms, and so forth.
  • the information wall may also support forms of advertising. For example, by pointing at a store on the map, the shopper could display a short video sequence describing the products and service offered by the store.
  • the information wall may also permit the display to follow the user as he/she walks along its length, pointing in the correct direction to enable a shopper to get where he/she wants to go.
  • the preferred embodiments of the present invention may also be used with telecommunications applications.
  • bandwidth problems prevent consumer telecommunications via video.
  • users can communicate via a shared virtual reality world, rather than via their actual environments. Only video from the user silhouette needs to be transmitted .and shown in the virtual environment, wherein the preferred embodiments of the present invention extracts the user silhouette. This approach could be simplified even more by showing the users with computer-generated bodies (in the correct position and pose, since the present invention can determine that) and only video of the head region is transmitted.
  • Multi-user video conferencing may also be aided by the present invention.
  • a user needs to pan and zoom the camera from user to user of a teleconference.
  • the present invention could be used as part of a commercial teleconferencing system where the camera can be controlled by the gestures of the participants in the teleconference. For example, pointing at a participant causes the camera to focus on that participant, raising your hand attracts the camera to focus on you, etc.
  • the preferred embodiments of the present invention could also be used as part of the infrastructure of an integrated home entertainment and communications system, replacing the functions currently provided by a remote control unit.
  • the user's position within the room, as well as user body pose and gestures could all be accessed by the present invention.
  • Pointing at a CD player could display the controls for the CD player on the television, and pointing at menu items on the television could select those items.
  • the position of the user could be used to determine which television is employed. If there are more than one user, it is also conceivable that the present invention could enable separate commands issued by different users, or construct a hierarchy of authority for the different commands.
  • a conventional remote control could be used with the present invention, wherein the present invention simplifies the functionality of the remote control, e.g., so that it has only four buttons.
  • a user could point the remote control at the CD player (or stand adjacent thereto), and the remote control would function .as a CD player remote.
  • the user could sit in front of the television and the remote control would function as a channel changer.
  • the remote control could be used to establish a hierarchy of authority wherein the preferred embodiments of the present invention will respond only to the user holding remote control.
  • the preferred embodiments of the present invention could also be used to support home exercise CD ROM programs, wherein the user buys his/her own celebrity trainer.
  • the present invention would provide the location of the user in a room to the home exercise progr.am so that the trainer will always look in the direction of the user.
  • the present invention could also determine when the user stops exercising in the middle of an exercise, so that the trainer can recommend an alternate exercise regimen. It is also possible for the trainer to critique the way a user is exercising and offer helpful information.
  • An additional feature of the home exercise application would be to combine video input of the user with the graphically-generated image of the trainer and display both on a television.
  • Such a feature gives the user the advantage of seeing themselves in action, and permits the trainer to point or touch portions of the video image of the user so to impart advice, e.g., lift your leg this higher.
  • Other exercise applications such "virtual" jogging, educational applications such as learning to dance in virtual reality, and entertainment applications such as interactive games may also be used with the preferred embodiments of the present invention.

Abstract

A system and method for permitting three-dimensional navigation through a virtual reality environment using camera-based gesture inputs of a system user. The system comprises a computer-readable memory, a video camera for generating video signals indicative of the gestures of the system user and an interaction area surrounding the system user, and a video image display. The video image display is positioned in front of the system user. The system further comprises a microprocessor for processing the video signals, in accordance with a program stored in the computer-readable memory, to determine the three-dimensional positions of the body and principle body parts of the system user. The microprocessor constructs three-dimensional images of the system user and interaction area on the video image display based upon the three-dimensional positions of the body and principle body parts of the system user. The video image display shows three-dimensional graphical objects within the virtual reality environment, and movement by the system user permits apparent movement of the three-dimensional objects displayed on the video image display so that the system user appears to move throughout the virtual reality environment.

Description

System and method for permitting three-dimensional navigation through a virtual reality environment using camera-based gesture inputs.
BACKGROUND OF THE INVENTION
A. Field of the invention
The present invention relates generally to multimedia and virtual reality applications, and, more particularly to a system and method for permitting three-dimensional navigation through a virtual reality environment using camera-based gesture inputs.
B. Description of the Related Art
Multimedia and virtual reality applications permit exciting interaction between a user and a computer. Unfortunately, current computer/user interfaces present a barrier to simplistic user interactivity and thus, consumer acceptance of multimedia and virtual reality applications. Ideally, computer/user interfaces would combine an intuitive interaction format with a broad range of interaction capabilities. Practically, however, these two features conflict. For example, a computer keyboard offers broad interaction capabilities but is not intuitive, whereas a television remote control is more intuitive but offers limited interaction capabilities. Even more flexible interfaces, such as an instrumented body suit, can be both cumbersome and expensive.
In virtual reality applications, the two primary computer/user interface approaches to displaying and interacting with the virtual reality environment comprise an immersive approach and a non-immersive approach. In the immersive approach, the user wears a head-mounted display, as well as tracking devices attached to the head and one or more limbs. A computer displays a virtual reality environment on head-mounted display by displaying synthetic visual images to the user's eyes, and changes the images based upon the information received from the head tracking device. The limb tracking devices permit user interaction with the virtual reality environment. The immersive approach provides the advantage of giving the user the impression of being "in" the virtual reality environment. Unfortunately, the immersive approach provides the disadvantage of isolating the user from his/her real environment, leaving the user unaware of health and safety dangers in the real environment. In the non-immersive approach, the computer displays visual images of the virtual environment on a stationary display, such as a computer monitor or television screen. The user changes the perspective of the visual images using a computer keyboard, mouse, joystick or other similar devices as .an interface with the computer. The non-immersive approach provides the advantage of allowing the user to interact with the virtual reality environment without isolating the user from his/her real environment. However, the non- immersive approach fails to give the user the impression of being "in" the virtual reality environment.
A number of approaches to computer/user interface design have been suggested to address the trade-offs between the immersive and non-immersive approaches. One approach uses a video camera in a non-invasive way to measure the gestures of a system user, so to control the images displayed to the system user. As shown in FIG. 1, such an interface system 10 comprises a blue wall 12 in which a user 14 stands in front of, permitting two- dimensional silhouette extraction of user 14 and chromakeying of the silhouette. System 10 further includes a video camera 16 for identifying the two-dimensional, user silhouette and for producing a video signal. A microprocessor 18 of a computer identifies the two-dimensional, user silhouette seen by video camera 16, but only as a two-dimensional shape. Thus, motions of user 14 are only understood by microprocessor 18 in terms of the changing image coordinates of the silhouette. Microprocessor 18 displays an image of user 14 on a television display 20. The image displayed on television 20 consists of a two-dimensional scene into which the user's image has been chromakeyed. User 14 can interact with the displayed scene by adopting a specific pose, e.g., hands-over-head, or by moving so that a portion of the user's silhouette touches a designated set of image coordinates making it appear as if user 14 touched a displayed object. The interface system shown in FIG. 1 provides an easy-to-use, inexpensive interface with multimedia and virtual reality applications. However, the interface system only permits two-dimensional interaction with computer-displayed objects, restricting the capabilities of the interface to two dimensions. For example, in the two-dimensional system of FIG. 1, all of the computer-displayed objects are at the same depth in the window surrounding the user's silhouette.
As seen in FIG. 2, a conventional two-dimensional silhouette extraction process used by the system shown in FIG. 1, comprises both a hardware process (above the dashed line) and a software process (below the dashed line), wherein computer microprocessor 18 performs the software process steps. The hardware process involves a step 22 of inputting an analog video camera signal, followed by a step 24 of digitizing the analog camera signal to produce a gray-scale binary data signal. The hardware process further comprises a step 26 of adjusting the resolution (high or low) of the video camera, and a step 28 of restricting the camera view to a window of the image of interest, i.e., the user's image. The hardware process next comprises a dynamic threshold step 30 where the gray-scale binary data signal is converted into digital binary data, e.g., "1" or "0." At step 32, the hardware process determines the edges (silhouette) of the user's image, and, based on the edge data, adjusts the picture size (step 34) so to adjust the resolution accordingly at step 26.
The softw.are process involves a first step 36 of subtracting the background from the edge data of step 34, leaving only an image contour of the user's image. The background is a picture of .an empty scene as seen by the camera, and is provided at step 38. The softw.are further comprises a step of joining together all of the edge data of the user's image, providing a single contour around the user's image. The software process also comprises an identification step 42 for determining whether the user image contour represents a person, an animal, etc., and a silhouette feature step 44 for identifying the silhouette features (in x, y coordinates) of the user, e.g., head, hands, feet, arms, legs, etc. At step 46, the software process utilizes the contour identification data in order to calculate a bounding box around the user. The bounding box data is provided to the window restricting step 28 for restricting the size of the camera window around the user, and thus, increase the speed of the extraction process.
An alternative approach, proposed by the Media Lab at the Massachusetts Institute of Technology ("MIT"), allows a user to interact with a computer-generated graphical world by using camera-based body motions and gestures of a system user. Such a system, while being amongst the most versatile of its kind currently available, suffers from the following problems: (1) it is based on a standard graphical interface ("SGI") platform; (2) it is sensitive to lighting conditions around the system user; (3) although it tracks the user's foot position in three dimensions, it treats the remainder of the user's body as a two-dimensional object; (4) it is limited to a single user; (5) it provides too coarse of resolution to see user hand details such as fingers; and (6) it is tied to only the "magic mirror" interactive video environment ("IVE") paradigm, described below. Thus, the alternative approach suffers from the same limitations encountered by the conventional two-dimensional approach, as well as many other problems.
Still another approach includes a method for real-time recognition of a human image, as disclosed Japanese Patent Abstract Publication No. 07-038873 ("JP 07-038873"). JP 07-038873 describes three-dimensional graphical generation of a person that detects the expression, rotation of the head, motion of the fingers, and rotation of the human body. However, JP 07-038873 is limited to graphical model generation of the human body. Furthermore, JP 07-38873 focuses on using three-dimensional graphical animation of a user primarily for teleconferencing purposes, wherein the user cannot control objects in a computer-generated scene. Finally, the reference discloses using three-dimensional animation of a remote user for teleconferencing purposes, as opposed to a three-dimensional animation of a local user.
A final approach, as found in International Patent Application (PCT) WO 96/21321 ("PCT 96/21321"), consists of creating a three-dimensional simulation of an event (e.g., a football game), in real-time or storing it on a CD ROM, using cameras and microphones. The system disclosed in PCT 96/21321, however, merely replays three- dimensional scenes of the event as they are viewed by the cameras. Furthermore, users of the PCT 96/21321 system can only change their perspective of the three-dimensional scenes and are unable to control objects in the scenes.
Unfortunately, none of these proposed approaches described above provides a computer/user interface that combines an intuitive interaction format with a broad range of interaction capabilities.
SUMMARY OF THE INVENTION
An object of the present invention is to address the problems encountered by the two-dimensional interface systems and the alternative approaches proposed by the Media Lab at the Massachusetts Institute of Technology and the other related art discussed above. Another object is to provide a three-dimensional interface for computer users that integrates the advantages of both the immersive and non-immersive approaches to displaying and interacting with a virtual reality environment.
A further object is to provide a three-dimensional display of computer- generated objects so that the objects occupy the three-dimensional space in a virtual reality environment around the computer users and the computer users navigate through and control the objects in the virtual reality environment through normal body movements.
Additional objects and advantages of the invention will be set forth in part in the description which follows, and in pan will be obvious from the descπption, or may be learned by practice of the invention. The objects and advantages of the invention will be realized and attained by means of the elements and combinations particularly pointed out in the appended claims.
To achieve the objects and in accordance with the purpose of the invention, as embodied and broadly described herein, the invention comprises a system for permitting three- dimensional navigation through a virtual reality environment using camera-based gesture inputs of a user of the system, including: a computer-readable memory means; means for generating video signals indicative of the gestures of the system user and an interaction area surrounding the system user; means for displaying video images, the video image display means being positioned in front of the system user; and means for processing the video signals, in accordance with a program stored in the computer-readable memory means, to determine the three-dimensional positions of the body and principle body parts of the system user, wherein the video signal processing means constructs three-dimensional images of the system user and interaction area on the video image display means based upon the three- dimensional positions of the body and principle body parts of the system user, the video image display means displays three-dimensional graphical objects within the virtual reality environment, and movement by the system user permits apparent movement of the three- dimensional objects displayed on the video image display means so that the system user appears to move throughout the virtual reality environment.
To further achieve the objects, the present invention comprises a method for permitting three-dimensional navigation through a virtual reality environment using camera- based gesture inputs of a user of a computer system having a computer-readable memory and video image display connected to a microprocessor using a program stored in the computer- readable memory, the method comprising the steps of: generating video signals indicative of the gestures of the system user and an interaction area surrounding the system user; processing the video signals in the microprocessor to determine the three-dimensional positions of the body and principle body parts of the system user; using the microprocessor to construct three- dimensional images of the system user and interaction area on the video image display based upon the three-dimensional positions of the body and principle body parts of the system user; and utilizing the microprocessor to display on the video image display three-dimensional graphical objects within the virtual reality environment, wherein movement by the system user permits apparent movement by the three-dimensional objects displayed on the video image display so that the system user appears to move throughout the virtual reality environment.
The objects shown on the video image display will more to different positions on the video image display and to different positions relative to one another, by an amount dependent on the distance that the object is supposed to be away from the video image display screen. This movement corresponds to a shift in the intersection of the screen with a line of sight from the head of the user to the object at its modeled position in the virtual reality environment. This shift is computed for each object from the detected change in position of the head.
To still further achieve the objects, the present invention comprises a computer- readable memory device for storing a program that permits three-dimensional navigation through a virtual reality environment using camera-based gesture inputs of a user of a computer system having a video image display connected to a microprocessor using instructions stored in the computer-readable memory device, the computer-readable memory device comprising: instructions for processing video signals indicative of gestures of the system user to determine the three-dimensional positions of the body and principle body parts of the system user; instructions for constructing three-dimensional images of the system user and interaction area on the video image display based upon the three-dimensional positions of the body and principle body parts of the system user; and instructions for displaying, on the video image display, three-dimensional graphical objects within the virtual reality environment, wherein movement by the system user permits apparent movement by the three- dimensional objects displayed on the video image display so that the system user appears to move throughout the virtual reality environment. To even further achieve the objects, the present invention comprises a computer program product for permitting three-dimensional navigation through a virtual reality environment using camera-based gesture inputs of a user of a computer system having a video image display connected to a microprocessor, the computer program product comprising: means for processing video signals indicative of gestures of the system user to determine the three-dimensional positions of the body and principle body parts of the system user; means for constructing three-dimensional images of the system user and interaction area on the video image display based upon the three-dimensional positions of the body and principle body parts of the system user; .and means for displaying, on the video image display, three-dimensional graphical objects within the virtual reality environment, wherein movement by the system user permits apparent movement by the three-dimensional objects displayed on the video image display so that the system user appears to move throughout the virtual reality environment.
It is to be understood that both the foregoing general description and the following detailed description are exemplary and explanatory only and are not restrictive of the invention, as claimed. BRIEF DESCRIPTION OF THE DRAWINGS
The accompanying drawings, which are incorporated in and constitute a part of this specification, illustrate several embodiments of the invention and together with the description, serve to explain the principles of the invention. In the drawings: FIG. 1 is a block diagram of a conventional system for constructing two- dimensional images using camera-based silhouettes of users;
FIG. 2 is a flowchart showing the steps involved in a conventional software process for extracting two-dimensional images using silhouettes of users;
FIG. 3 is a block diagram of a system for constructing three-dimensional images using camera-based gesture inputs of users in accordance with a preferred embodiment of the present invention;
FIG. 4 is a block diagram of a system for constructing three-dimensional images using camera-based gesture inputs of users in accordance with another preferred embodiment of the present invention; FIG. 5 is a flowchart showing the steps involved in a software process for mapping two-dimensional image features of users .and an interactive area onto three- dimensional locations within the interactive area in accordance with the preferred embodiments of the present invention shown in FIGs. 3 and 4;
FIG. 6 is a block diagram showing a system for permitting three-dimensional navigation through a virtual reality environment using camera-based gesture inputs of the present invention using the system for constructing three-dimensional images shown in FIG. 3;
FIG. 7 is a flowchart showing the steps involved in a software process for use with the system for permitting three-dimensional navigation through a virtual reality environment using camera-based gesture inputs of the present invention shown in FIG. 6; and FIG. 8 is a biometric data table showing the length of body parts as a ratio of the body height (H), wherein the body height (H) is the height of a standing person.
DESCRIPTION OF THE PREFERRED EMBODIMENTS
Reference will now be made in detail to the present preferred embodiments of the invention, examples of which are illustrated in the accompanying drawings. Wherever possible, the same reference numbers will be used throughout the drawings to refer to the same or like parts.
In accordance with the preferred embodiments, the present invention includes a system and method for permitting three-dimensional navigation through a virtual reality environment using camera-based gesture inputs of system users. The system comprises a computer-readable memory means, means for generating video signals indicative of the gestures of the system users and an interaction area surrounding the system users, and means for displaying video images. The video image display means is positioned in front of the system users. The system further comprises means for processing the video signals, in accordance with a program stored in the computer-readable memory means, to determine the three-dimensional positions of the bodies and principle body parts of the system users, wherein the video signal processing means constructs three-dimensional images of the system users and interaction area on the video image display means based upon the three-dimensional positions of the bodies and principle body parts of the system users, the video image display means displays three-dimensional graphical objects within the virtual reality environment, and movement by the system users causes apparent movement of the three-dimensional objects displayed on the video image display means so that the system user appears to move throughout the virtual reality environment. In other words, the present invention is drawn to a natural and intuitive computer/user interface based upon computer vision interaction by system users. As used herein, "computer vision" is the use of a computer to interpret information received from a video imaging device in terms of what objects the imaging device sees. Computer vision permits recognition of user gestures, body motions, head motions, eye motions, etc. The recognized user motions, in turn, are used to interact with multimedia and virtual reality applications. Specifically, the present invention takes the system users' silhouettes in two- dimensional image coordinates and projects them into the three-dimensional image coordinates the system users occupy in the interaction area.
Such an approach is undemanding upon the system users and inexpensive to make. If the system users are interacting via a large-screen display, the computer knows where the system users are looking and pointing, etc., and manipulates the information on the display accordingly. Further, the position and pose of the system users in front of the display screen are extracted and used for interaction with a three-dimensional graphical model, i.e., virtual reality environment. The addition of gesture interpretation to the computer vision system of the present invention adds realism to the interaction with the computer. For example, intuitive hand gestures may be used as an interface with the computer system to permit navigation through the virtual reality environment.
Rather than relying on conventional SGI-based software, the present invention utilizes a camera-based user interface system 50, as best shown in FIG. 3. System 50 comprises a video camera 56, a video display screen 54, and a computer 58 having a Philips® single board image processor (SBEP) 60. SBIP 60 eliminates problems (1) - (3) encountered in the approach proposed by the Media Lab at the Massachusetts Institute of Technology, and thus, also encountered in the two-dimensional systems. Computer 58 also comprises a computer-readable memory 66 encoded with three-dimensional imaging software. SBIP 60 utilizes the software so that system 50 may handle the three-dimensional body motions of the system user. The three-dimensional imaging software of the present invention corrects problems (4) and (5) encountered in the approach proposed by the Media Lab at the Massachusetts Institute of Technology. To address problem (6) of the Media Lab approach, the present invention provides an interactive video environment ("IVE") capable of evaluating several IVE paradigms other than the "magic mirror" paradigm proposed by Massachusetts Institute of Technology. The present invention is capable of evaluating the following four IVE paradigms: (1) the display shows live video of a camera input of a remote site, and the video camera input of the system users is composited with the live video on the display (this is known as "mirror" effect, as in the MIT approach); (2) the display shows live video of the remote site, and the camera input of the users is not composited with the live video on the display (this is known as "window" effect); (3) the display shows graphical images as in virtual reality, and the camera input of the system users is composited with the graphical images on the display; and (4) the display shows graphical images, and the camera input of the system users is not composited with the graphical images on the display.
A. Detailed Description Of The System Hardware Of The Preferred Embodiments
As embodied herein, the system and method for permitting three-dimensional navigation through a virtual reality environment using camera-based gesture inputs of the preferred embodiment of the present invention is shown in FIG. 3. Specifically, system 50 comprises a means for processing video signals, such as a computer 58, electrically coupled to a means for generating video signals, such as a video camera 56. Computer 58 is electrically coupled to a means for displaying video images, such as a video display screen 54. Preferably, video display screen 54 is located in front of an interaction area 52 where system users 62 stand. Video camera 56 electronically reads the images of users 62 and interactive area 52, creates video signals corresponding to these images, and provides the video signals to computer 58. Preferably, computer 58 comprises a typical microprocessor-based computing device such as an IBM-compatible personal computer. Computer 58 further comprises a serial port 64 connected to a microprocessor 60 for receiving the video signals from video camera 56, and a conventional computer-readable memory 66 capable of being encoded with software programs. Microprocessor 60 preferably is a Philips® single board image processor (SBIP). SBLP 60 uses the software (described below), encoded in computer memory 66, for mapping the two-dimensional image features of users 62 and interactive area 52 and calculating the three-dimensional position of system users 62 within interactive area 52. SBLP 60 also may use an application program permanently encoded within computer-readable memory 66, or temporarily encoded therein via an external computer-readable memory, such as for example, a floppy diskette or a CD ROM. Computer 58 further comprises a mode control port 68, connected to SBLP 60 for receiving data from other interactive controls such .as a remote control, and a video processor port 70 for delivering video signals to video display screen 54. The software encoded in computer-readable memory 66, and used by SBIP 60, isolates the contours of the system users, determines their body and limb positions in three-dimensional image space, and generates a video signal corresponding to the body and limb position to video display screen 54.
Display screen 54 preferably consists of a conventional audio/visual monitor system capable of displaying three-dimensional graphical information. The type of display screen 54 and video camera 56 used in the present invention is arbitrary and may be chosen based only upon the intended application of the system of the present invention.
In a more preferred embodiment of the system for constructing three- dimensional images using camera-based gesture inputs of the preferred embodiment, video display screen 34 is a rear-projection Ikegami TPP 1000/1500® projector with a Retroscan RS125SW® screen (six feet in height in the y direction and eight feet in width in the x direction); interaction area 52 is an eleven feet (in the z direction) by twelve feet (in the x direction) area in front of video display screen 54; and video camera 56 is a Sony® NTSC video camera.
An alternate embodiment of the system for permitting three-dimensional navigation through a virtual reality environment using camera-based gesture inputs of the present invention is shown in FIG. 4. As shown, the alternate embodiment 80 comprises a video camera 56 and computer 58 with SBIP 60 and computer-readable memory 66 similar to those described with reference to FIG. 3. However, the alternate embodiment further comprises a compact disc reader 84 capable of reading an application program encoded on a CD ROM and providing such application program to SBLP 60. The alternate embodiment also comprises a remote controller 86 for controlling features of the application program. Furthermore, in contr.ast to the display screen of the embodiment shown in FIG. 3, the alternate embodiment includes a conventional television display 82 capable of receiving video signals from SBIP 60 and transmitting information to SBIP 60.
B. Description Of The System Software Of The Preferred Embodiments
In accordance with the preferred embodiments of the present invention, the software for mapping two-dimensional image features of system users .and an interactive area onto three-dimensional locations within the interactive area, as well as the applications programs for use with the preferred embodiments, will now be described. For ease of reference, the software and applications programs are described with reference to a single system user. However, it is to be understood that the camera-based gesture recognition technology of the present invention can be used with multiple users by identifying each user individually and interacting with each user accordingly.
FIG. 5 is a flowchart showing the steps involved in the software process for mapping two-dimensional image features of a system user 62 onto three-dimensional locations in the room where system user 62 is located in accordance with the preferred embodiments of the present invention shown in FIGs. 3 and 4. The three-dimensional imaging software may be permanently encoded within computer-readable memory 66 or may be temporarily encoded in memory 66 via a diskette, CD ROM, or similar memory storage means. As shown, the three-dimensional imaging software process comprises a first step 100 of extracting the two- dimensional head, hands and feet features of a user in image space coordinates (u, v) using the two-dimensional extraction process shown in FIG. 2. Coordinates (u, v) correspond to the two-dimensional x-y plane of the user in front of video camera 56. The three-dimensional imaging process further comprises a step 102 of reading the orientation and location of video camera 36 in three-dimensional coordinates (x, y, z) with respect to the room. Assuming the user's feet are on the floor, at step 104 the software process projects the two-dimensional, extracted features of the user's feet to three-dimensional coordinates (x, y, z) of the user's feet, using the (x, y, z) orientation of camera 56 with respect to the room. At step 106, the software process projects the two-dimensional, extracted features of the user's head and hands to three- dimensional coordinates (x, y, z) of the user's head and hands, assuming that the head and hands are slightly offset from the position of the feet in the z direction and using the (x, y, z) orientation of camera 56 with respect to the room. The three-dimensional imaging process further comprises a step 108 of using the measured height (h) of the user to access a biometric data (shown at step 110) indexed by height and stored within computer-readable memory 66. An example of a biometric data table capable of use with the present invention is shown in FIG. 8. The present invention is not limited by the biometric data shown in FIG. 8, since other biometric data may be utilized as set forth in D. Chaffin & G. Andersson, Occupational Biomechanics, 2d ed. (1991), L. Farkas, Anthropometry of the Head and Face, 2d ed. (1994), and N.A.S.A., Anthropometric Source Book, vols. 1-3 (1978). The three-dimensional imaging process assumes that the user's shoulders are offset from the top of the user's head to the bottom of the user's neck by a distance of 0.182h in the y-direction, and that the shoulder width from the center of the user's back to the end of the shoulder blade is 0.129h in the x direction, wherein h is the user's height. The imaging process further assumes that the user's arm length is 0.44h, and utilizes the assumed arm length (0.44h) until a measured arm length greater than 0.44h is extracted by video camera 56. The software process further comprises a step 112 of calculating each arm's offset in the z direction from the corresponding foot, using the assumed arm length (0.44h) calculated in step 108. At step 112, each arm's actual length in the z direction is calculated from the assumed arm length using the principle of foreshortening. The software process comprises a final step 114 of supplying the three-dimensional positions of the user's head, hands and feet to an application program. FIG. 6 is a block diagram showing the system 90 for permitting navigation through a virtual reality environment using camera-based gesture inputs and the system for constructing three-dimensional images shown in FIG. 3. As shown in FIG. 6, a user 62 of the navigation system stands in front of video display screen 54 displaying graphical images 92 of a virtual reality environment 94. Video camera 56 views user 62 and SBIP 60 processes data received from camera 56 by extracting the three-dimensional image of user 62 and by identifying user body motions, such as the three-dimensional positions of the user's head, hands, legs, feet, etc., as described above.
Thus, the system shown in FIG. 6 incorporates the advantages of both the immersive and non-immersive approaches to displaying and interacting with a virtual reality environment. The navigation system of the present invention utilizes the non-immersive concepts of displaying the virtual reality environment on a stationary display, and of not mounting special equipment on the user's body to read body motions. The navigation system of the present invention also utilizes the immersive concept of employing the user's body motions to control the motion of and interaction with the virtual reality environment. A large- screen display is preferably used so that the user sees a life-size rendering of the virtual environment, and the display covers a large portion of the user's field of vision.
The present invention permits navigation through virtual reality environment 94 as follows. When user 62 walks in front of display screen 54, SBIP 60 computes the location where user 62 is standing. By taking a step forward, user 62 causes SBIP 60 to move graphical images 92 on display screen 54 so that user 62 appears to be moving forward through virtual reality environment 94. Similarly, by stepping left or right, user 62 causes SBIP 60 to move graphical images 92 so that user 62 appears to be moving left or right through virtual reality environment 94. SBIP 60 stops the movement of graphical images 92 when user 62 takes a step backwards. If user 62 takes a second step backwards, SBLP 60 moves graphical images 92 in the reverse direction so that user 62 appears to be moving backwards through virtual reality environment 94.
Additional gestures may be used with the navigation system 90 of the present invention to enhance flexibility. For example, if user 62 raises his/Tier right hand, SBIP 60 causes graphical images 92 to rotate at a fixed rate in a clockwise direction so that user appears to rotate clockwise in virtual reality environment 94. By lowering his/her right hand, user 62 instructs SBIP 60 to stop rotation, wherein SBIP 60 causes graphical images 92 to cease rotating. User 62 may use his/Tier left hand in a similar fashion to start or stop apparent counter-clockwise rotation in virtual reality environment 94. Further, if user 62 bends at the waist, SBIP 60 causes graphical images 92 to rotate so that a virtual image of the ground in front of user 62 can be seen. The navigation system 90 of the present invention may also be provided with the following additional gestures: (1) graphical objects could be picked up when user 62 reaches his/her hand in the direction of the graphical objects; (2) the "picked-up" graphical objects could be manipulated by user gestures such as pushing, moving, etc.; and (3) left and right leg motions could be identified to permit user 62 to "walk" through the virtual reality environment.
Finally, the navigation system of the present invention could be extended to handle a team of users, wherein majority rules could be employed to govern motion so that the system goes in the direction indicated by the majority of the users. Alternatively, multi-user interaction could be handled on an individual basis, allowing different users to simultaneously select graphical objects in the virtual reality environment and interact with them. For example, in a video game application, such an arrangement would permit different users to fight one-on-one with a team of virtual opponents. FIG. 7 is a flowchart showing the steps involved in a software process for use with the system for permitting three-dimensional navigation through a virtual reality environment using camera-based gesture inputs of the present invention shown m FIG. 6. The process compπses a step 200 of using the extracted, three-dimensional data of the system user (provided from process of FIG. 5) to take one-second samples of the user's head and hands positions. The process further compπses a step 202 of determining whether the average height of either hand (average hand height being determined from the one-second samples) is greater than the average height of the user's head (average head height being determined from the one-second samples). If the average height of either hand is not greater than the average height of the head, the process returns to step 200. However, if the average height of either hand is greater than the average height of the head, the process proceeds to step 204 where the user's foot position (bx, bz) .and current head position (bh) are measured.
The process further compπses a step 206 for determining whether the user is still visible. If the user is not visible, the process returns to step 200. If the user is visible, the process proceeds to step 208 where the current foot position (px, pz), current hand positions, and current height (ph) are measured, and variables for the movement velocity (vx and vz), height vaπable to determine if the user is bending (dy), and rotational movement (dry and drx) are set to zero. The process comprises a next step 210 of setting the movement velocity according to the following formulas: (1) vx = k * (bx - px) and (2) vz = k * (by - pz), wherein (k) is an input to the system that determines the speed with which user moves around the virtual reality environment, e.g., system movement becomes faster as (k) increases. Preferably, (k) is a value from 0.3 to 1.0. At the next step 212 of the process, the height vaπable (dy) is set to a dummy variable (deltay) if both hand heights are greater than head height (bh). Otherwise, at step 212, if only the left hand height is greater than head height (bh), then rotational movement (dry) is set to alpha, wherein alpha indicates that counterclockwise rotation of the virtual reality environment is desired. Preferably alpha is set to +1.0 degrees. If only the right hand height is greater than head height (bh), then rotational movement (dry) is set to (-alpha), wherein (-alpha) indicates that clockwise rotation of the virtual reality environment is desired and is preferably -1.0 degrees. Alpha and beta determine the speed of rotation of the virtual reality environment, since the environment may be rotated every iteration based on the loop time of the iteration. Preferably, the system is operating at 10 Hz meaning that the 10 iterations occur per second.
The process further compπses a step 214 of setting rotational movement (drx) to a dummy variable (-beta) and a vaπable (bending) to "TRUE" if the current height (ph) is less than 80% of the current head height (bh), wherein (-beta) indicates rotation of the virtual reality environment to see the ground in front of the user, and (bending) indicates that user is bending. Preferably (-beta) is set to -2.0 degrees. However, if (bending) is "TRUE" and the current height (ph).is greater than 80% of the head height (bh), then rotational movement (drx) is set to (beta) and (bending) is set to "FALSE," wherein (beta) indicates rotation opposite to (- beta). Preferably, (beta) is set to +2.0 degrees. The process comprises a final step 216 of implementing displacement of the virtual reality environment by (vx), (vz), and (dy), and implementing rotation of the virtual reality environment by (drx) and (dry) on the video display screen. After step 216, the process returns to step 206 to determine if the user is still visible.
C. Examples Of Application Programs For Use With The Preferred Embodiments
The invention will be further clarified by the following examples of application programs capable of use with the system and method for constructing three-dimensional images using camera-based inputs of the present invention. The application programs are intended to be purely exemplary of the uses of the preferred embodiments of the present invention, and are not intended to limit the scope of the broad features of the invention. The preferred embodiments of the present invention can be used with any application requiring calculation of a three-dimensional position of a user so that the user may manipulate graphical computer-generated objects in three dimensions. Examples of application programs include an information wall for multiple-user interaction, a telecommunications application, a gesture- based remote control, and a home exercise and entertainment application.
1. Information Wall For Multiple User Interaction An information wall application program may use the system and method for navigating through virtual reality of the present invention. The information wall comprises a large, notice board-like display screen with which multiple users can interact, providing a highly intuitive and interactive information system. Such an application is preferably deployed in shopping malls, museums, libraries, galleries, and other similar environments. For example, in a shopping mall the information wall would allow shoppers entering the mall to simply stand within a certain distance of the wall to activate it. The information wall then displays an overall map of the mall at the position and height of the person standing in front of it. A number of information icons are displayed around the map from which the shopper can select by pointing. By pointing at the icons, the information wall displays various pieces of information, such as, the location of certain stores and rest-rooms, and so forth. The information wall may also support forms of advertising. For example, by pointing at a store on the map, the shopper could display a short video sequence describing the products and service offered by the store. The information wall may also permit the display to follow the user as he/she walks along its length, pointing in the correct direction to enable a shopper to get where he/she wants to go.
2. Telecommunications Applications
The preferred embodiments of the present invention may also be used with telecommunications applications. Currently, bandwidth problems prevent consumer telecommunications via video. With the present invention users, can communicate via a shared virtual reality world, rather than via their actual environments. Only video from the user silhouette needs to be transmitted .and shown in the virtual environment, wherein the preferred embodiments of the present invention extracts the user silhouette. This approach could be simplified even more by showing the users with computer-generated bodies (in the correct position and pose, since the present invention can determine that) and only video of the head region is transmitted.
Multi-user video conferencing may also be aided by the present invention. Currently, a user needs to pan and zoom the camera from user to user of a teleconference. The present invention could be used as part of a commercial teleconferencing system where the camera can be controlled by the gestures of the participants in the teleconference. For example, pointing at a participant causes the camera to focus on that participant, raising your hand attracts the camera to focus on you, etc.
3. Gesture-Based Remote Control
The preferred embodiments of the present invention could also be used as part of the infrastructure of an integrated home entertainment and communications system, replacing the functions currently provided by a remote control unit. For example, the user's position within the room, as well as user body pose and gestures, could all be accessed by the present invention. Pointing at a CD player could display the controls for the CD player on the television, and pointing at menu items on the television could select those items.
If more than one television (or display) is in the room, the position of the user could be used to determine which television is employed. If there are more than one user, it is also conceivable that the present invention could enable separate commands issued by different users, or construct a hierarchy of authority for the different commands.
Additionally, a conventional remote control could be used with the present invention, wherein the present invention simplifies the functionality of the remote control, e.g., so that it has only four buttons. With the present invention, a user could point the remote control at the CD player (or stand adjacent thereto), and the remote control would function .as a CD player remote. Alternatively, the user could sit in front of the television and the remote control would function as a channel changer. Finally, the remote control could be used to establish a hierarchy of authority wherein the preferred embodiments of the present invention will respond only to the user holding remote control.
4. Home Exercise And Entertainment Applications
The preferred embodiments of the present invention could also be used to support home exercise CD ROM programs, wherein the user buys his/her own celebrity trainer. The present invention would provide the location of the user in a room to the home exercise progr.am so that the trainer will always look in the direction of the user. The present invention could also determine when the user stops exercising in the middle of an exercise, so that the trainer can recommend an alternate exercise regimen. It is also possible for the trainer to critique the way a user is exercising and offer helpful information. An additional feature of the home exercise application would be to combine video input of the user with the graphically-generated image of the trainer and display both on a television. Such a feature gives the user the advantage of seeing themselves in action, and permits the trainer to point or touch portions of the video image of the user so to impart advice, e.g., lift your leg this higher. Other exercise applications such "virtual" jogging, educational applications such as learning to dance in virtual reality, and entertainment applications such as interactive games may also be used with the preferred embodiments of the present invention.
It will be apparent to those skilled in the art that various modifications and variations can be made in the system and method for constructing three-dimensional images using camera-based gesture inputs of the present invention and in construction of this system without departing from the scope or spirit of the invention. As an example, the system and method could be used with other application programs which require three-dimensional construction of images and users, and require interaction between the users and three- dimensional images. Further, CD reader 84 and remote 86 of the system shown in FIG. 4 may be used with the system shown in FIG. 3. Finally, audio features may be incorporated into the preferred embodiments to provide voice-recognized commands from the system user and sound effects to the display screen.
Other embodiments of the invention will be apparent to those skilled in the art from consideration of the specification and practice of the invention disclosed herein. It is intended that the specification and examples be considered as exemplary only, with a true scope and spirit of the invention being indicated by the following claims.

Claims

CLAIMS:
1. A system for permitting three-dimensional navigation through a virtual reality environment (94) using camera-based gesture inputs of a user (62) of the system, comprising: a computer-readable memory means (65); means for generating video signals (56, 58) indicative of the gestures of the system user (62) and an interaction area (52) surrounding the system user (62); means for displaying video images (54, 82), the video image display means being positioned in front of the system user (62); and means for processing the video signals (58), in accordance with a program stored in the computer-readable memory means (66), to determine the three-dimensional positions of the body and principle body parts of the system user (62), wherein the video signal processing means (58) constructs three-dimensional images of the system user (52) and interaction area (52) on the video image display means (54, 82) based upon the three- dimensional positions of the body and principle body parts of the system user (52), the video image display means (54, 82) displays three-dimensional graphical objects within the virtual reality environment (94), and movement by the system user (52) permits apparent movement of the three-dimensional objects displayed on the video image display means (54, 82) so that the system user (62) appears to move throughout the virtual reality environment (94).
2. A system for permitting three-dimensional navigation through a virtual reality environment using camera-based gesture inputs of a user as recited in claim 1, wherein: when the system detects that the system user steps forward, the video signal processing means moves the three-dimensional graphical objects on the video image display means so that the system user appears to be moving forward through virtual reality environment; when the system user steps left or right, the video signal processing means moves the three-dimensional graphical objects on the video image display means so that the system user appears to be moving left or right through the virtual reality environment.
3. A system as claimed in claim 2, wherein when the system user takes a first step backwards, the video signal processing means stops the movement of the three-dimensional graphical objects on the video image display means; and when the system user takes a second step backwards, the video signal processing means moves the three-dimensional graphical objects on the video image display means so that the system user appears to be moving backwards through the virtual reality environment.
4. A system for permitting three-dimensional navigation through a virtual reality environment using camera-based gesture inputs of a user as recited in claim 1, wherein when the system user raises a first hand, the video signal processing means rotates the three- dimensional graphical objects on the video image display means in a first direction so that the system user appears to be rotating in that a first direction through the virtual reality environment; and when the system user lowers first hand the rotation stops.
5. A system for permitting three-dimensional navigation through a virtual reality environment using camera-based gesture inputs of a user as recited in claim 1, wherein when the system user bends at the waist, the video signal processing means rotates the three- dimensional graphical objects on the video image display means so that three-dimensional graphical objects in front of the system user can be seen by the system user.
6. A method for permitting three-dimensional navigation through a virtual reality environment (94) using camera-based gesture inputs of a user (62) of a computer system having a computer-readable memory (66) and video image display (54, 82) connected to a microprocessor (58) using a program stored in the computer-readable memory (66), the method comprising the steps of: generating video signals indicative of the gestures of the system user (52) and an interaction area (52) surrounding the system user (52); processing the video signals in the microprocessor (58) to determine the three- dimensional positions of the body and principle body parts of the system user (62); using the microprocessor (58) to construct three-dimensional images of the system user (52) and interaction area (52) on the video image display (54, 82) based upon the three-dimensional positions of the body and principle body parts of the system user (52); and utilizing the microprocessor (58) to display on the video image display (54, 82) three-dimensional graphical objects within the virtual reality environment, wherein movement by the system user (52) permits apparent movement by the three-dimensional objects displayed on the video image display (54, 82) so that the system user appears to move throughout the virtual reality environment (94).
7. A method for permitting three-dimensional navigation through a virtual reality environment using camera-based gesture inputs of a user of a computer system, as recited in claim 6, wherein: when the system user steps forward, the video signal processing means moves the three-dimensional graphical objects on the video image display means so that the system user appears to be moving forward through virtual reality environment; when the system user steps left or right, the video signal processing means moves the three-dimensional graphical objects on the video image display means so that the system user appears to be moving left or right through the virtual reality environment.
8. A method as claimed in claim 7, wherein when the system user steps backwards, the video signal processing means stops the movement of the three-dimensional graphical objects on the video image display means; and when the system user takes a second step backwards, the video signal processing means moves the three-dimensional graphical objects on the video image display means in a reverse direction so that the system user appears to be moving backwards through the virtual reality environment.
9. A method for permitting three-dimensional navigation through a virtual reality environment using camera-based gesture inputs of a user of a computer system, as recited in claim 6, wherein when the system user raises a first hand, the video signal processing means rotates the three-dimensional graphical objects on the video image display means in a first direction so that the system user appears to be rotating in said first direction through the virtual reality environment; and when the system user lowers first hand the clockwise rotation stops.
10. A computer readable memory containing a program for executing the method of any one of claims 6 to 9 on a computer system.
PCT/IB1998/001965 1997-12-23 1998-12-07 System and method for permitting three-dimensional navigation through a virtual reality environment using camera-based gesture inputs WO1999034327A2 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
EP98955857A EP0976106A2 (en) 1997-12-23 1998-12-07 System and method for permitting three-dimensional navigation through a virtual reality environment using camera-based gesture inputs
JP53468399A JP2001517344A (en) 1997-12-23 1998-12-07 System and method for admitting three-dimensional navigation through a virtual reality environment using camera-based gesture input

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US08/996,678 US6181343B1 (en) 1997-12-23 1997-12-23 System and method for permitting three-dimensional navigation through a virtual reality environment using camera-based gesture inputs
US08/996,678 1997-12-23

Publications (2)

Publication Number Publication Date
WO1999034327A2 true WO1999034327A2 (en) 1999-07-08
WO1999034327A3 WO1999034327A3 (en) 1999-09-10

Family

ID=25543182

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/IB1998/001965 WO1999034327A2 (en) 1997-12-23 1998-12-07 System and method for permitting three-dimensional navigation through a virtual reality environment using camera-based gesture inputs

Country Status (5)

Country Link
US (1) US6181343B1 (en)
EP (1) EP0976106A2 (en)
JP (1) JP2001517344A (en)
KR (1) KR20000075608A (en)
WO (1) WO1999034327A2 (en)

Cited By (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1358632A1 (en) * 2001-01-22 2003-11-05 France Telecom Sa Avatar animating method and device for communicating in a non-immersive collaborative virtual environment
GB2470072A (en) * 2009-05-08 2010-11-10 Sony Comp Entertainment Europe Virtual object movement in response to real object movement
FR2960986A1 (en) * 2010-06-04 2011-12-09 Thomson Licensing METHOD FOR SELECTING AN OBJECT IN A VIRTUAL ENVIRONMENT
CN101237915B (en) * 2005-08-12 2012-02-29 皇家飞利浦电子股份有限公司 Interactive entertainment system and method of operation thereof
CN102426480A (en) * 2011-11-03 2012-04-25 康佳集团股份有限公司 Man-machine interactive system and real-time gesture tracking processing method for same
WO2014137673A1 (en) * 2013-03-03 2014-09-12 Microsoft Corporation Enhanced presentation environments
US9330478B2 (en) 2012-02-08 2016-05-03 Intel Corporation Augmented reality creation using a real scene
US9635159B2 (en) 2012-05-08 2017-04-25 Nokia Technologies Oy Method and apparatus for providing immersive interaction via everyday devices
US9901828B2 (en) 2010-03-30 2018-02-27 Sony Interactive Entertainment America Llc Method for an augmented reality character to maintain and exhibit awareness of an observer
WO2020124253A1 (en) * 2018-12-19 2020-06-25 Cae Inc. Dynamically adjusting image characteristics in real-time
US10937240B2 (en) 2018-01-04 2021-03-02 Intel Corporation Augmented reality bindings of physical objects and virtual objects

Families Citing this family (417)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8352400B2 (en) 1991-12-23 2013-01-08 Hoffberg Steven M Adaptive pattern recognition based controller apparatus and method and human-factored interface therefore
US6176782B1 (en) * 1997-12-22 2001-01-23 Philips Electronics North America Corp. Motion-based command generation technology
US6765566B1 (en) * 1998-12-22 2004-07-20 Che-Chih Tsao Method and apparatus for displaying volumetric 3D images
US7966078B2 (en) 1999-02-01 2011-06-21 Steven Hoffberg Network media appliance system and method
US6677938B1 (en) * 1999-08-04 2004-01-13 Trimble Navigation, Ltd. Generating positional reality using RTK integrated with scanning lasers
WO2001052230A1 (en) * 2000-01-10 2001-07-19 Ic Tech, Inc. Method and system for interacting with a display
GB0012275D0 (en) * 2000-05-22 2000-07-12 Secr Defence Brit Three dimensional human computer interface
FI20001429A (en) * 2000-06-15 2001-12-16 Nokia Corp Choosing an alternative
WO2002015560A2 (en) * 2000-08-12 2002-02-21 Georgia Tech Research Corporation A system and method for capturing an image
EP1311803B8 (en) * 2000-08-24 2008-05-07 VDO Automotive AG Method and navigation device for querying target information and navigating within a map view
JP3760755B2 (en) * 2000-10-11 2006-03-29 日産自動車株式会社 Voice input device
US6728582B1 (en) * 2000-12-15 2004-04-27 Cognex Corporation System and method for determining the position of an object in three dimensions using a machine vision system with two cameras
US7925703B2 (en) * 2000-12-26 2011-04-12 Numedeon, Inc. Graphical interactive interface for immersive online communities
US6704447B2 (en) * 2001-02-21 2004-03-09 Justsystem Corporation Method and apparatus for using illumination from a display for computer vision based user interfaces and biometric authentication
US7197459B1 (en) * 2001-03-19 2007-03-27 Amazon Technologies, Inc. Hybrid machine/human computing arrangement
US6498970B2 (en) 2001-04-17 2002-12-24 Koninklijke Phillips Electronics N.V. Automatic access to an automobile via biometrics
US7007236B2 (en) * 2001-09-14 2006-02-28 Accenture Global Services Gmbh Lab window collaboration
US6990639B2 (en) * 2002-02-07 2006-01-24 Microsoft Corporation System and process for controlling electronic components in a ubiquitous computing environment using multimodal integration
US20030227453A1 (en) * 2002-04-09 2003-12-11 Klaus-Peter Beier Method, system and computer program product for automatically creating an animated 3-D scenario from human position and path data
US20040113887A1 (en) * 2002-08-27 2004-06-17 University Of Southern California partially real and partially simulated modular interactive environment
JP4185052B2 (en) * 2002-10-15 2008-11-19 ユニバーシティ オブ サザン カリフォルニア Enhanced virtual environment
US20040100484A1 (en) * 2002-11-25 2004-05-27 Barrett Peter T. Three-dimensional television viewing environment
US7511710B2 (en) 2002-11-25 2009-03-31 Microsoft Corporation Three-dimensional program guide
US7665041B2 (en) * 2003-03-25 2010-02-16 Microsoft Corporation Architecture for controlling a computer using hand gestures
US8745541B2 (en) 2003-03-25 2014-06-03 Microsoft Corporation Architecture for controlling a computer using hand gestures
US7038661B2 (en) * 2003-06-13 2006-05-02 Microsoft Corporation Pointing device and cursor for use in intelligent computing environments
ES2231035B1 (en) * 2003-10-30 2006-07-01 Frontera Azul Systems, S.L. COMMUNICATION SYSTEM AND PROCEDURE BASED ON VIRTUAL REALITY.
US7554545B2 (en) * 2003-11-04 2009-06-30 Ssd Company Limited Drawing apparatus operable to display a motion path of an operation article
JP4311190B2 (en) * 2003-12-17 2009-08-12 株式会社デンソー In-vehicle device interface
US8442331B2 (en) 2004-02-15 2013-05-14 Google Inc. Capturing text from rendered documents using supplemental information
US7707039B2 (en) * 2004-02-15 2010-04-27 Exbiblio B.V. Automatic modification of web pages
US20060122983A1 (en) * 2004-12-03 2006-06-08 King Martin T Locating electronic instances of documents based on rendered instances, document fragment digest generation, and digest based document fragment determination
US7812860B2 (en) * 2004-04-01 2010-10-12 Exbiblio B.V. Handheld device for capturing text from both a document printed on paper and a document displayed on a dynamic display device
US20060053097A1 (en) * 2004-04-01 2006-03-09 King Martin T Searching and accessing documents on private networks for use with captures from rendered documents
US20060041605A1 (en) * 2004-04-01 2006-02-23 King Martin T Determining actions involving captured information and electronic content associated with rendered documents
US10635723B2 (en) 2004-02-15 2020-04-28 Google Llc Search engines and systems with handheld document data capture devices
US20060041484A1 (en) * 2004-04-01 2006-02-23 King Martin T Methods and systems for initiating application processes by data capture from rendered documents
US20050255434A1 (en) * 2004-02-27 2005-11-17 University Of Florida Research Foundation, Inc. Interactive virtual characters for training including medical diagnosis training
US20050227217A1 (en) * 2004-03-31 2005-10-13 Wilson Andrew D Template matching on interactive surface
US7990556B2 (en) * 2004-12-03 2011-08-02 Google Inc. Association of a portable scanner with input/output and storage devices
US20070300142A1 (en) * 2005-04-01 2007-12-27 King Martin T Contextual dynamic advertising based upon captured rendered text
US20060081714A1 (en) 2004-08-23 2006-04-20 King Martin T Portable scanning device
US20060098900A1 (en) * 2004-09-27 2006-05-11 King Martin T Secure data gathering from rendered documents
US9008447B2 (en) 2004-04-01 2015-04-14 Google Inc. Method and system for character recognition
US8146156B2 (en) 2004-04-01 2012-03-27 Google Inc. Archive of text captures from rendered documents
US20080313172A1 (en) * 2004-12-03 2008-12-18 King Martin T Determining actions involving captured information and electronic content associated with rendered documents
US9116890B2 (en) 2004-04-01 2015-08-25 Google Inc. Triggering actions in response to optically or acoustically capturing keywords from a rendered document
US7894670B2 (en) 2004-04-01 2011-02-22 Exbiblio B.V. Triggering actions in response to optically or acoustically capturing keywords from a rendered document
US8081849B2 (en) * 2004-12-03 2011-12-20 Google Inc. Portable scanning and memory device
US9143638B2 (en) 2004-04-01 2015-09-22 Google Inc. Data capture from rendered documents using handheld device
US8713418B2 (en) * 2004-04-12 2014-04-29 Google Inc. Adding value to a rendered document
US8620083B2 (en) 2004-12-03 2013-12-31 Google Inc. Method and system for character recognition
US8874504B2 (en) * 2004-12-03 2014-10-28 Google Inc. Processing techniques for visual capture data from a rendered document
US8489624B2 (en) 2004-05-17 2013-07-16 Google, Inc. Processing techniques for text capture from a rendered document
US7394459B2 (en) 2004-04-29 2008-07-01 Microsoft Corporation Interaction between objects and a virtual environment display
US7787706B2 (en) * 2004-06-14 2010-08-31 Microsoft Corporation Method for controlling an intensity of an infrared source used to detect objects adjacent to an interactive display surface
US7593593B2 (en) 2004-06-16 2009-09-22 Microsoft Corporation Method and system for reducing effects of undesired signals in an infrared imaging system
US8346620B2 (en) 2004-07-19 2013-01-01 Google Inc. Automatic modification of web pages
CA2578653A1 (en) 2004-07-29 2006-02-09 Kevin Ferguson A human movement measurement system
US8560972B2 (en) 2004-08-10 2013-10-15 Microsoft Corporation Surface UI for gesture-based interaction
US7945469B2 (en) * 2004-11-16 2011-05-17 Amazon Technologies, Inc. Providing an electronic marketplace to facilitate human performance of programmatically submitted tasks
US7881957B1 (en) 2004-11-16 2011-02-01 Amazon Technologies, Inc. Identifying tasks for task performers based on task subscriptions
US8046250B1 (en) 2004-11-16 2011-10-25 Amazon Technologies, Inc. Facilitating performance by task performers of language-specific tasks
US8005697B1 (en) 2004-11-16 2011-08-23 Amazon Technologies, Inc. Performing automated price determination for tasks to be performed
US8170897B1 (en) 2004-11-16 2012-05-01 Amazon Technologies, Inc. Automated validation of results of human performance of tasks
US20060106774A1 (en) * 2004-11-16 2006-05-18 Cohen Peter D Using qualifications of users to facilitate user performance of tasks
US7885844B1 (en) 2004-11-16 2011-02-08 Amazon Technologies, Inc. Automatically generating task recommendations for human task performers
US7773074B2 (en) * 2005-06-28 2010-08-10 Siemens Medical Solutions Usa, Inc. Medical diagnostic imaging three dimensional navigation device and methods
US7911444B2 (en) * 2005-08-31 2011-03-22 Microsoft Corporation Input method for surface of interactive display
US7697827B2 (en) 2005-10-17 2010-04-13 Konicek Jeffrey C User-friendlier interfaces for a camera
US8094928B2 (en) * 2005-11-14 2012-01-10 Microsoft Corporation Stereo video for gaming
US8060840B2 (en) 2005-12-29 2011-11-15 Microsoft Corporation Orientation free user interface
US7612786B2 (en) * 2006-02-10 2009-11-03 Microsoft Corporation Variable orientation input mode
US8930834B2 (en) * 2006-03-20 2015-01-06 Microsoft Corporation Variable orientation user interface
US8139059B2 (en) * 2006-03-31 2012-03-20 Microsoft Corporation Object illumination in a virtual environment
US8024211B1 (en) 2006-03-31 2011-09-20 Amazon Technologies, Inc. Automatically generating assessments of qualification relevance and qualification issuer credibility
US20070284429A1 (en) * 2006-06-13 2007-12-13 Microsoft Corporation Computer component recognition and setup
US7552402B2 (en) * 2006-06-22 2009-06-23 Microsoft Corporation Interface orientation using shadows
US8001613B2 (en) * 2006-06-23 2011-08-16 Microsoft Corporation Security using physical objects
US20080040692A1 (en) * 2006-06-29 2008-02-14 Microsoft Corporation Gesture input
US7899694B1 (en) 2006-06-30 2011-03-01 Amazon Technologies, Inc. Generating solutions to problems via interactions with human responders
WO2008014826A1 (en) 2006-08-03 2008-02-07 Alterface S.A. Method and device for identifying and extracting images of multiple users, and for recognizing user gestures
US7907117B2 (en) * 2006-08-08 2011-03-15 Microsoft Corporation Virtual controller for visual displays
US7725547B2 (en) * 2006-09-06 2010-05-25 International Business Machines Corporation Informing a user of gestures made by others out of the user's line of sight
EP2067119A2 (en) 2006-09-08 2009-06-10 Exbiblio B.V. Optical scanners, such as hand-held optical scanners
WO2008031625A2 (en) * 2006-09-15 2008-03-20 Exbiblio B.V. Capture and display of annotations in paper and electronic documents
US9697486B2 (en) * 2006-09-29 2017-07-04 Amazon Technologies, Inc. Facilitating performance of tasks via distribution using third-party sites
US7945470B1 (en) 2006-09-29 2011-05-17 Amazon Technologies, Inc. Facilitating performance of submitted tasks by mobile task performers
US20080147488A1 (en) * 2006-10-20 2008-06-19 Tunick James A System and method for monitoring viewer attention with respect to a display and determining associated charges
US8793756B2 (en) * 2006-12-20 2014-07-29 Dst Technologies, Inc. Secure processing of secure information in a non-secure environment
US7792328B2 (en) * 2007-01-12 2010-09-07 International Business Machines Corporation Warning a vehicle operator of unsafe operation behavior based on a 3D captured image stream
US8588464B2 (en) * 2007-01-12 2013-11-19 International Business Machines Corporation Assisting a vision-impaired user with navigation based on a 3D captured image stream
US7877706B2 (en) * 2007-01-12 2011-01-25 International Business Machines Corporation Controlling a document based on user behavioral signals detected from a 3D captured image stream
US8295542B2 (en) * 2007-01-12 2012-10-23 International Business Machines Corporation Adjusting a consumer experience based on a 3D captured image stream of a consumer response
US7840031B2 (en) * 2007-01-12 2010-11-23 International Business Machines Corporation Tracking a range of body movement based on 3D captured image streams of a user
US7971156B2 (en) * 2007-01-12 2011-06-28 International Business Machines Corporation Controlling resource access based on user gesturing in a 3D captured image stream of the user
US7801332B2 (en) * 2007-01-12 2010-09-21 International Business Machines Corporation Controlling a system based on user behavioral signals detected from a 3D captured image stream
US8269834B2 (en) 2007-01-12 2012-09-18 International Business Machines Corporation Warning a user about adverse behaviors of others within an environment based on a 3D captured image stream
US8212857B2 (en) * 2007-01-26 2012-07-03 Microsoft Corporation Alternating light sources to reduce specular reflection
US8005238B2 (en) 2007-03-22 2011-08-23 Microsoft Corporation Robust adaptive beamforming with enhanced noise suppression
US8005237B2 (en) 2007-05-17 2011-08-23 Microsoft Corp. Sensor array beamformer post-processor
US20110145068A1 (en) * 2007-09-17 2011-06-16 King Martin T Associating rendered advertisements with digital content
US8629976B2 (en) * 2007-10-02 2014-01-14 Microsoft Corporation Methods and systems for hierarchical de-aliasing time-of-flight (TOF) systems
US20090119604A1 (en) * 2007-11-06 2009-05-07 Microsoft Corporation Virtual office devices
US9171454B2 (en) * 2007-11-14 2015-10-27 Microsoft Technology Licensing, Llc Magic wand
US9026483B1 (en) 2007-11-19 2015-05-05 Amazon Technologies, Inc. Automatic prediction of aspects of human task performance
US8121888B1 (en) 2007-12-14 2012-02-21 Amazon Technologies, Inc. Facilitating improvement of results of human performance of tasks
US20090166684A1 (en) * 2007-12-26 2009-07-02 3Dv Systems Ltd. Photogate cmos pixel for 3d cameras having reduced intra-pixel cross talk
US10438152B1 (en) 2008-01-25 2019-10-08 Amazon Technologies, Inc. Managing performance of human review of media data
US8113991B2 (en) * 2008-06-02 2012-02-14 Omek Interactive, Ltd. Method and system for interactive fitness training program
US8219432B1 (en) 2008-06-10 2012-07-10 Amazon Technologies, Inc. Automatically controlling availability of tasks for performance by human users
US8385557B2 (en) 2008-06-19 2013-02-26 Microsoft Corporation Multichannel acoustic echo reduction
US8325909B2 (en) 2008-06-25 2012-12-04 Microsoft Corporation Acoustic echo suppression
US8203699B2 (en) 2008-06-30 2012-06-19 Microsoft Corporation System architecture design for time-of-flight system having reduced differential pixel size, and time-of-flight systems so designed
US8847739B2 (en) * 2008-08-04 2014-09-30 Microsoft Corporation Fusing RFID and vision for surface object tracking
US20100031202A1 (en) * 2008-08-04 2010-02-04 Microsoft Corporation User-defined gesture set for surface computing
US20100105479A1 (en) 2008-10-23 2010-04-29 Microsoft Corporation Determining orientation in an external reference frame
US8681321B2 (en) * 2009-01-04 2014-03-25 Microsoft International Holdings B.V. Gated 3D camera
US8577085B2 (en) * 2009-01-30 2013-11-05 Microsoft Corporation Visual target tracking
US8565477B2 (en) 2009-01-30 2013-10-22 Microsoft Corporation Visual target tracking
US8267781B2 (en) 2009-01-30 2012-09-18 Microsoft Corporation Visual target tracking
US8565476B2 (en) 2009-01-30 2013-10-22 Microsoft Corporation Visual target tracking
US8294767B2 (en) * 2009-01-30 2012-10-23 Microsoft Corporation Body scan
US9652030B2 (en) * 2009-01-30 2017-05-16 Microsoft Technology Licensing, Llc Navigation of a virtual plane using a zone of restriction for canceling noise
US8682028B2 (en) * 2009-01-30 2014-03-25 Microsoft Corporation Visual target tracking
US8577084B2 (en) * 2009-01-30 2013-11-05 Microsoft Corporation Visual target tracking
US8487938B2 (en) * 2009-01-30 2013-07-16 Microsoft Corporation Standard Gestures
US8588465B2 (en) * 2009-01-30 2013-11-19 Microsoft Corporation Visual target tracking
US8448094B2 (en) * 2009-01-30 2013-05-21 Microsoft Corporation Mapping a natural input device to a legacy system
US7996793B2 (en) 2009-01-30 2011-08-09 Microsoft Corporation Gesture recognizer system architecture
US8295546B2 (en) * 2009-01-30 2012-10-23 Microsoft Corporation Pose tracking pipeline
US20100199231A1 (en) * 2009-01-30 2010-08-05 Microsoft Corporation Predictive determination
CA2748037C (en) * 2009-02-17 2016-09-20 Omek Interactive, Ltd. Method and system for gesture recognition
DE202010018601U1 (en) * 2009-02-18 2018-04-30 Google LLC (n.d.Ges.d. Staates Delaware) Automatically collecting information, such as gathering information using a document recognizing device
US8447066B2 (en) 2009-03-12 2013-05-21 Google Inc. Performing actions based on capturing information from rendered documents, such as documents under copyright
WO2010105245A2 (en) 2009-03-12 2010-09-16 Exbiblio B.V. Automatically providing content associated with captured information, such as information captured in real-time
US8773355B2 (en) 2009-03-16 2014-07-08 Microsoft Corporation Adaptive cursor sizing
US8988437B2 (en) 2009-03-20 2015-03-24 Microsoft Technology Licensing, Llc Chaining animations
US9256282B2 (en) * 2009-03-20 2016-02-09 Microsoft Technology Licensing, Llc Virtual object manipulation
US9313376B1 (en) 2009-04-01 2016-04-12 Microsoft Technology Licensing, Llc Dynamic depth power equalization
US8660303B2 (en) * 2009-05-01 2014-02-25 Microsoft Corporation Detection of body and props
US9377857B2 (en) 2009-05-01 2016-06-28 Microsoft Technology Licensing, Llc Show body position
US8503720B2 (en) 2009-05-01 2013-08-06 Microsoft Corporation Human body pose estimation
US9498718B2 (en) 2009-05-01 2016-11-22 Microsoft Technology Licensing, Llc Altering a view perspective within a display environment
US9898675B2 (en) 2009-05-01 2018-02-20 Microsoft Technology Licensing, Llc User movement tracking feedback to improve tracking
US9015638B2 (en) 2009-05-01 2015-04-21 Microsoft Technology Licensing, Llc Binding users to a gesture based system and providing feedback to the users
US8942428B2 (en) 2009-05-01 2015-01-27 Microsoft Corporation Isolate extraneous motions
US8253746B2 (en) 2009-05-01 2012-08-28 Microsoft Corporation Determine intended motions
US8649554B2 (en) 2009-05-01 2014-02-11 Microsoft Corporation Method to control perspective for a camera-controlled computer
US8181123B2 (en) 2009-05-01 2012-05-15 Microsoft Corporation Managing virtual port associations to users in a gesture-based computing environment
US8340432B2 (en) 2009-05-01 2012-12-25 Microsoft Corporation Systems and methods for detecting a tilt angle from a depth image
US8638985B2 (en) 2009-05-01 2014-01-28 Microsoft Corporation Human body pose estimation
WO2010130245A1 (en) * 2009-05-11 2010-11-18 Universität Zu Lübeck Method for the real-time-capable, computer-assisted analysis of an image sequence containing a variable pose
US9417700B2 (en) 2009-05-21 2016-08-16 Edge3 Technologies Gesture recognition systems and related methods
US9400559B2 (en) 2009-05-29 2016-07-26 Microsoft Technology Licensing, Llc Gesture shortcuts
US8625837B2 (en) 2009-05-29 2014-01-07 Microsoft Corporation Protocol and format for communicating an image from a camera to a computing environment
US9383823B2 (en) 2009-05-29 2016-07-05 Microsoft Technology Licensing, Llc Combining gestures beyond skeletal
US8542252B2 (en) * 2009-05-29 2013-09-24 Microsoft Corporation Target digitization, extraction, and tracking
US8320619B2 (en) 2009-05-29 2012-11-27 Microsoft Corporation Systems and methods for tracking a model
US8418085B2 (en) 2009-05-29 2013-04-09 Microsoft Corporation Gesture coach
US20100302365A1 (en) * 2009-05-29 2010-12-02 Microsoft Corporation Depth Image Noise Reduction
US9182814B2 (en) 2009-05-29 2015-11-10 Microsoft Technology Licensing, Llc Systems and methods for estimating a non-visible or occluded body part
US8509479B2 (en) * 2009-05-29 2013-08-13 Microsoft Corporation Virtual object
US8744121B2 (en) 2009-05-29 2014-06-03 Microsoft Corporation Device for identifying and tracking multiple humans over time
US8856691B2 (en) 2009-05-29 2014-10-07 Microsoft Corporation Gesture tool
US8693724B2 (en) 2009-05-29 2014-04-08 Microsoft Corporation Method and system implementing user-centric gesture control
US8379101B2 (en) 2009-05-29 2013-02-19 Microsoft Corporation Environment and/or target segmentation
US8487871B2 (en) 2009-06-01 2013-07-16 Microsoft Corporation Virtual desktop coordinate transformation
US8390680B2 (en) 2009-07-09 2013-03-05 Microsoft Corporation Visual representation expression based on player expression
US9159151B2 (en) 2009-07-13 2015-10-13 Microsoft Technology Licensing, Llc Bringing a visual representation to life via learned input from the user
US8264536B2 (en) * 2009-08-25 2012-09-11 Microsoft Corporation Depth-sensitive imaging via polarization-state mapping
US9141193B2 (en) 2009-08-31 2015-09-22 Microsoft Technology Licensing, Llc Techniques for using human gestures to control gesture unaware programs
US8508919B2 (en) 2009-09-14 2013-08-13 Microsoft Corporation Separation of electrical and optical components
US8330134B2 (en) 2009-09-14 2012-12-11 Microsoft Corporation Optical fault monitoring
US8976986B2 (en) * 2009-09-21 2015-03-10 Microsoft Technology Licensing, Llc Volume adjustment based on listener position
US8428340B2 (en) * 2009-09-21 2013-04-23 Microsoft Corporation Screen space plane identification
US8760571B2 (en) * 2009-09-21 2014-06-24 Microsoft Corporation Alignment of lens and image sensor
US9014546B2 (en) 2009-09-23 2015-04-21 Rovi Guides, Inc. Systems and methods for automatically detecting users within detection regions of media devices
US8452087B2 (en) * 2009-09-30 2013-05-28 Microsoft Corporation Image selection techniques
US8723118B2 (en) * 2009-10-01 2014-05-13 Microsoft Corporation Imager for constructing color and depth images
US8963829B2 (en) 2009-10-07 2015-02-24 Microsoft Corporation Methods and systems for determining and tracking extremities of a target
US8867820B2 (en) 2009-10-07 2014-10-21 Microsoft Corporation Systems and methods for removing a background of an image
US8564534B2 (en) 2009-10-07 2013-10-22 Microsoft Corporation Human tracking system
US7961910B2 (en) 2009-10-07 2011-06-14 Microsoft Corporation Systems and methods for tracking a model
US8458010B1 (en) 2009-10-13 2013-06-04 Amazon Technologies, Inc. Monitoring and enforcing price parity
US9400548B2 (en) * 2009-10-19 2016-07-26 Microsoft Technology Licensing, Llc Gesture personalization and profile roaming
US20110099476A1 (en) * 2009-10-23 2011-04-28 Microsoft Corporation Decorating a display environment
US8988432B2 (en) * 2009-11-05 2015-03-24 Microsoft Technology Licensing, Llc Systems and methods for processing an image for target tracking
KR101647722B1 (en) * 2009-11-13 2016-08-23 엘지전자 주식회사 Image Display Device and Operating Method for the Same
US8843857B2 (en) 2009-11-19 2014-09-23 Microsoft Corporation Distance scalable no touch computing
US9081799B2 (en) * 2009-12-04 2015-07-14 Google Inc. Using gestalt information to identify locations in printed information
US9323784B2 (en) * 2009-12-09 2016-04-26 Google Inc. Image search using text-based elements within the contents of images
US9244533B2 (en) 2009-12-17 2016-01-26 Microsoft Technology Licensing, Llc Camera navigation for presentations
US20110151974A1 (en) * 2009-12-18 2011-06-23 Microsoft Corporation Gesture style recognition and reward
US20110150271A1 (en) 2009-12-18 2011-06-23 Microsoft Corporation Motion detection using depth images
US8320621B2 (en) 2009-12-21 2012-11-27 Microsoft Corporation Depth projector system with integrated VCSEL array
US9268404B2 (en) * 2010-01-08 2016-02-23 Microsoft Technology Licensing, Llc Application gesture interpretation
US9019201B2 (en) * 2010-01-08 2015-04-28 Microsoft Technology Licensing, Llc Evolving universal gesture sets
US8631355B2 (en) * 2010-01-08 2014-01-14 Microsoft Corporation Assigning gesture dictionaries
US8933884B2 (en) * 2010-01-15 2015-01-13 Microsoft Corporation Tracking groups of users in motion capture system
US8334842B2 (en) 2010-01-15 2012-12-18 Microsoft Corporation Recognizing user intent in motion capture system
US8676581B2 (en) 2010-01-22 2014-03-18 Microsoft Corporation Speech recognition analysis via identification information
US8265341B2 (en) 2010-01-25 2012-09-11 Microsoft Corporation Voice-body identity correlation
US8864581B2 (en) * 2010-01-29 2014-10-21 Microsoft Corporation Visual based identitiy tracking
US8891067B2 (en) 2010-02-01 2014-11-18 Microsoft Corporation Multiple synchronized optical sources for time-of-flight range finding systems
US8619122B2 (en) * 2010-02-02 2013-12-31 Microsoft Corporation Depth camera compatibility
US8687044B2 (en) * 2010-02-02 2014-04-01 Microsoft Corporation Depth camera compatibility
US8717469B2 (en) * 2010-02-03 2014-05-06 Microsoft Corporation Fast gating photosurface
US8499257B2 (en) * 2010-02-09 2013-07-30 Microsoft Corporation Handles interactions for human—computer interface
US8659658B2 (en) * 2010-02-09 2014-02-25 Microsoft Corporation Physical interaction zone for gesture-based user interfaces
US20110199302A1 (en) * 2010-02-16 2011-08-18 Microsoft Corporation Capturing screen objects using a collision volume
US8633890B2 (en) * 2010-02-16 2014-01-21 Microsoft Corporation Gesture detection based on joint skipping
US8928579B2 (en) * 2010-02-22 2015-01-06 Andrew David Wilson Interacting with an omni-directionally projected display
US8411948B2 (en) 2010-03-05 2013-04-02 Microsoft Corporation Up-sampling binary images for segmentation
US8655069B2 (en) 2010-03-05 2014-02-18 Microsoft Corporation Updating image segmentation following user input
US8422769B2 (en) 2010-03-05 2013-04-16 Microsoft Corporation Image segmentation using reduced foreground training data
US20110221755A1 (en) * 2010-03-12 2011-09-15 Kevin Geisner Bionic motion
US20110223995A1 (en) 2010-03-12 2011-09-15 Kevin Geisner Interacting with a computer based application
US8279418B2 (en) 2010-03-17 2012-10-02 Microsoft Corporation Raster scanning for depth detection
US8213680B2 (en) * 2010-03-19 2012-07-03 Microsoft Corporation Proxy training data for human body tracking
US20110234481A1 (en) * 2010-03-26 2011-09-29 Sagi Katz Enhancing presentations using depth sensing cameras
US8514269B2 (en) * 2010-03-26 2013-08-20 Microsoft Corporation De-aliasing depth images
US8523667B2 (en) * 2010-03-29 2013-09-03 Microsoft Corporation Parental control settings based on body dimensions
US8605763B2 (en) 2010-03-31 2013-12-10 Microsoft Corporation Temperature measurement and control for laser and light-emitting diodes
US9098873B2 (en) 2010-04-01 2015-08-04 Microsoft Technology Licensing, Llc Motion-based interactive shopping environment
US9646340B2 (en) 2010-04-01 2017-05-09 Microsoft Technology Licensing, Llc Avatar-based virtual dressing room
US8780115B1 (en) * 2010-04-06 2014-07-15 The Pnc Financial Services Group, Inc. Investment management marketing tool
US8791949B1 (en) * 2010-04-06 2014-07-29 The Pnc Financial Services Group, Inc. Investment management marketing tool
US8351651B2 (en) 2010-04-26 2013-01-08 Microsoft Corporation Hand-location post-process refinement in a tracking system
US8379919B2 (en) 2010-04-29 2013-02-19 Microsoft Corporation Multiple centroid condensation of probability distribution clouds
US8284847B2 (en) 2010-05-03 2012-10-09 Microsoft Corporation Detecting motion for a multifunction sensor device
US8885890B2 (en) 2010-05-07 2014-11-11 Microsoft Corporation Depth map confidence filtering
US8498481B2 (en) 2010-05-07 2013-07-30 Microsoft Corporation Image segmentation using star-convexity constraints
US8457353B2 (en) 2010-05-18 2013-06-04 Microsoft Corporation Gestures and gesture modifiers for manipulating a user-interface
US8396252B2 (en) 2010-05-20 2013-03-12 Edge 3 Technologies Systems and related methods for three dimensional gesture recognition in vehicles
US8803888B2 (en) 2010-06-02 2014-08-12 Microsoft Corporation Recognition system for sharing information
US9008355B2 (en) 2010-06-04 2015-04-14 Microsoft Technology Licensing, Llc Automatic depth camera aiming
US8751215B2 (en) 2010-06-04 2014-06-10 Microsoft Corporation Machine based sign language interpreter
US8966400B2 (en) * 2010-06-07 2015-02-24 Empire Technology Development Llc User movement interpretation in computer generated reality
US9557574B2 (en) 2010-06-08 2017-01-31 Microsoft Technology Licensing, Llc Depth illumination and detection optics
US8330822B2 (en) 2010-06-09 2012-12-11 Microsoft Corporation Thermally-tuned depth camera light source
US9384329B2 (en) 2010-06-11 2016-07-05 Microsoft Technology Licensing, Llc Caloric burn determination from body movement
US8749557B2 (en) 2010-06-11 2014-06-10 Microsoft Corporation Interacting with user interface via avatar
US8675981B2 (en) 2010-06-11 2014-03-18 Microsoft Corporation Multi-modal gender recognition including depth data
US8982151B2 (en) 2010-06-14 2015-03-17 Microsoft Technology Licensing, Llc Independently processing planes of display data
US8639020B1 (en) 2010-06-16 2014-01-28 Intel Corporation Method and system for modeling subjects from a depth map
US8558873B2 (en) 2010-06-16 2013-10-15 Microsoft Corporation Use of wavefront coding to create a depth image
US8670029B2 (en) 2010-06-16 2014-03-11 Microsoft Corporation Depth camera illuminator with superluminescent light-emitting diode
US8296151B2 (en) 2010-06-18 2012-10-23 Microsoft Corporation Compound gesture-speech commands
US8381108B2 (en) 2010-06-21 2013-02-19 Microsoft Corporation Natural user input for driving interactive stories
US8416187B2 (en) 2010-06-22 2013-04-09 Microsoft Corporation Item navigation using motion-capture data
US9262015B2 (en) * 2010-06-28 2016-02-16 Intel Corporation System for portable tangible interaction
US9075434B2 (en) 2010-08-20 2015-07-07 Microsoft Technology Licensing, Llc Translating user motion into multiple object responses
US8613666B2 (en) 2010-08-31 2013-12-24 Microsoft Corporation User selection and navigation based on looped motions
US8655093B2 (en) 2010-09-02 2014-02-18 Edge 3 Technologies, Inc. Method and apparatus for performing segmentation of an image
US8666144B2 (en) 2010-09-02 2014-03-04 Edge 3 Technologies, Inc. Method and apparatus for determining disparity of texture
WO2012030872A1 (en) 2010-09-02 2012-03-08 Edge3 Technologies Inc. Method and apparatus for confusion learning
US8582866B2 (en) 2011-02-10 2013-11-12 Edge 3 Technologies, Inc. Method and apparatus for disparity computation in stereo images
US8437506B2 (en) 2010-09-07 2013-05-07 Microsoft Corporation System for fast, probabilistic skeletal tracking
US20120058824A1 (en) 2010-09-07 2012-03-08 Microsoft Corporation Scalable real-time motion recognition
DE102010040699A1 (en) * 2010-09-14 2012-03-15 Otto-Von-Guericke-Universität Magdeburg Medizinische Fakultät Apparatus for determining anticipation skill of athletes in sport activities, has projection device and video camera that are connected with data processing system to which display screen is connected
US8988508B2 (en) 2010-09-24 2015-03-24 Microsoft Technology Licensing, Llc. Wide angle field of view active illumination imaging system
US8681255B2 (en) 2010-09-28 2014-03-25 Microsoft Corporation Integrated low power depth camera and projection device
US8548270B2 (en) 2010-10-04 2013-10-01 Microsoft Corporation Time-of-flight depth imaging
US9484065B2 (en) 2010-10-15 2016-11-01 Microsoft Technology Licensing, Llc Intelligent determination of replays based on event identification
US8592739B2 (en) 2010-11-02 2013-11-26 Microsoft Corporation Detection of configuration changes of an optical element in an illumination system
US8866889B2 (en) 2010-11-03 2014-10-21 Microsoft Corporation In-home depth camera calibration
US8667519B2 (en) 2010-11-12 2014-03-04 Microsoft Corporation Automatic passive and anonymous feedback system
US10726861B2 (en) 2010-11-15 2020-07-28 Microsoft Technology Licensing, Llc Semi-private communication in open environments
US9349040B2 (en) 2010-11-19 2016-05-24 Microsoft Technology Licensing, Llc Bi-modal depth-image analysis
US10234545B2 (en) 2010-12-01 2019-03-19 Microsoft Technology Licensing, Llc Light source module
US8553934B2 (en) 2010-12-08 2013-10-08 Microsoft Corporation Orienting the position of a sensor
US8618405B2 (en) 2010-12-09 2013-12-31 Microsoft Corp. Free-space gesture musical instrument digital interface (MIDI) controller
US8408706B2 (en) 2010-12-13 2013-04-02 Microsoft Corporation 3D gaze tracker
US8884968B2 (en) 2010-12-15 2014-11-11 Microsoft Corporation Modeling an object from image data
US8920241B2 (en) 2010-12-15 2014-12-30 Microsoft Corporation Gesture controlled persistent handles for interface guides
US9171264B2 (en) 2010-12-15 2015-10-27 Microsoft Technology Licensing, Llc Parallel processing machine learning decision tree training
US8448056B2 (en) 2010-12-17 2013-05-21 Microsoft Corporation Validation analysis of human target
US8803952B2 (en) 2010-12-20 2014-08-12 Microsoft Corporation Plural detector time-of-flight depth mapping
US9848106B2 (en) 2010-12-21 2017-12-19 Microsoft Technology Licensing, Llc Intelligent gameplay photo capture
US8994718B2 (en) 2010-12-21 2015-03-31 Microsoft Technology Licensing, Llc Skeletal control of three-dimensional virtual world
US8385596B2 (en) 2010-12-21 2013-02-26 Microsoft Corporation First person shooter control with virtual skeleton
US9821224B2 (en) 2010-12-21 2017-11-21 Microsoft Technology Licensing, Llc Driving simulator control with virtual skeleton
US9823339B2 (en) 2010-12-21 2017-11-21 Microsoft Technology Licensing, Llc Plural anode time-of-flight sensor
US9123316B2 (en) 2010-12-27 2015-09-01 Microsoft Technology Licensing, Llc Interactive content creation
US8488888B2 (en) 2010-12-28 2013-07-16 Microsoft Corporation Classification of posture states
US9247238B2 (en) 2011-01-31 2016-01-26 Microsoft Technology Licensing, Llc Reducing interference between multiple infra-red depth cameras
US8401225B2 (en) 2011-01-31 2013-03-19 Microsoft Corporation Moving object segmentation using depth images
US8401242B2 (en) 2011-01-31 2013-03-19 Microsoft Corporation Real-time camera tracking using depth maps
US8587583B2 (en) 2011-01-31 2013-11-19 Microsoft Corporation Three-dimensional environment reconstruction
US8724887B2 (en) 2011-02-03 2014-05-13 Microsoft Corporation Environmental modifications to mitigate environmental factors
US8970589B2 (en) 2011-02-10 2015-03-03 Edge 3 Technologies, Inc. Near-touch interaction with a stereo camera grid structured tessellations
US8942917B2 (en) 2011-02-14 2015-01-27 Microsoft Corporation Change invariant scene recognition by an agent
US8497838B2 (en) 2011-02-16 2013-07-30 Microsoft Corporation Push actuation of interface controls
JP2012181704A (en) * 2011-03-01 2012-09-20 Sony Computer Entertainment Inc Information processor and information processing method
US9551914B2 (en) 2011-03-07 2017-01-24 Microsoft Technology Licensing, Llc Illuminator with refractive optical element
US9067136B2 (en) 2011-03-10 2015-06-30 Microsoft Technology Licensing, Llc Push personalization of interface controls
US8571263B2 (en) 2011-03-17 2013-10-29 Microsoft Corporation Predicting joint positions
US9470778B2 (en) 2011-03-29 2016-10-18 Microsoft Technology Licensing, Llc Learning from high quality depth measurements
US9298287B2 (en) 2011-03-31 2016-03-29 Microsoft Technology Licensing, Llc Combined activation for natural user interface systems
US9760566B2 (en) 2011-03-31 2017-09-12 Microsoft Technology Licensing, Llc Augmented conversational understanding agent to identify conversation context between two humans and taking an agent action thereof
US10642934B2 (en) 2011-03-31 2020-05-05 Microsoft Technology Licensing, Llc Augmented conversational understanding architecture
US9842168B2 (en) 2011-03-31 2017-12-12 Microsoft Technology Licensing, Llc Task driven user intents
US8824749B2 (en) * 2011-04-05 2014-09-02 Microsoft Corporation Biometric recognition
US8503494B2 (en) 2011-04-05 2013-08-06 Microsoft Corporation Thermal management system
US8620113B2 (en) 2011-04-25 2013-12-31 Microsoft Corporation Laser diode modes
US9259643B2 (en) 2011-04-28 2016-02-16 Microsoft Technology Licensing, Llc Control of separate computer game elements
US8702507B2 (en) 2011-04-28 2014-04-22 Microsoft Corporation Manual and camera-based avatar control
US10671841B2 (en) 2011-05-02 2020-06-02 Microsoft Technology Licensing, Llc Attribute state classification
US8888331B2 (en) 2011-05-09 2014-11-18 Microsoft Corporation Low inductance light source module
US9137463B2 (en) 2011-05-12 2015-09-15 Microsoft Technology Licensing, Llc Adaptive high dynamic range camera
US9064006B2 (en) 2012-08-23 2015-06-23 Microsoft Technology Licensing, Llc Translating natural language utterances to keyword search queries
US8923686B2 (en) 2011-05-20 2014-12-30 Echostar Technologies L.L.C. Dynamically configurable 3D display
US8788973B2 (en) 2011-05-23 2014-07-22 Microsoft Corporation Three-dimensional gesture controlled avatar configuration interface
US8657683B2 (en) 2011-05-31 2014-02-25 Microsoft Corporation Action selection gesturing
US8760395B2 (en) 2011-05-31 2014-06-24 Microsoft Corporation Gesture recognition techniques
US8845431B2 (en) 2011-05-31 2014-09-30 Microsoft Corporation Shape trace gesturing
US8740702B2 (en) 2011-05-31 2014-06-03 Microsoft Corporation Action trigger gesturing
US9594430B2 (en) 2011-06-01 2017-03-14 Microsoft Technology Licensing, Llc Three-dimensional foreground selection for vision system
US8526734B2 (en) 2011-06-01 2013-09-03 Microsoft Corporation Three-dimensional background removal for vision system
US10796494B2 (en) 2011-06-06 2020-10-06 Microsoft Technology Licensing, Llc Adding attributes to virtual representations of real-world objects
US9208571B2 (en) 2011-06-06 2015-12-08 Microsoft Technology Licensing, Llc Object digitization
US8929612B2 (en) 2011-06-06 2015-01-06 Microsoft Corporation System for recognizing an open or closed hand
US9013489B2 (en) 2011-06-06 2015-04-21 Microsoft Technology Licensing, Llc Generation of avatar reflecting player appearance
US8897491B2 (en) 2011-06-06 2014-11-25 Microsoft Corporation System for finger recognition and tracking
US9098110B2 (en) 2011-06-06 2015-08-04 Microsoft Technology Licensing, Llc Head rotation tracking from depth-based center of mass
US8597142B2 (en) 2011-06-06 2013-12-03 Microsoft Corporation Dynamic camera based practice mode
US9724600B2 (en) 2011-06-06 2017-08-08 Microsoft Technology Licensing, Llc Controlling objects in a virtual environment
US9597587B2 (en) 2011-06-08 2017-03-21 Microsoft Technology Licensing, Llc Locational node device
US11048333B2 (en) 2011-06-23 2021-06-29 Intel Corporation System and method for close-range movement tracking
JP6074170B2 (en) 2011-06-23 2017-02-01 インテル・コーポレーション Short range motion tracking system and method
US8961315B1 (en) 2011-06-28 2015-02-24 Amazon Technologies, Inc. Providing tasks to users during electronic game play
US8786730B2 (en) 2011-08-18 2014-07-22 Microsoft Corporation Image exposure using exclusion regions
US9939888B2 (en) * 2011-09-15 2018-04-10 Microsoft Technology Licensing Llc Correlating movement information received from different sources
US9557836B2 (en) 2011-11-01 2017-01-31 Microsoft Technology Licensing, Llc Depth image compression
US9117281B2 (en) 2011-11-02 2015-08-25 Microsoft Corporation Surface segmentation from RGB and depth images
US8854426B2 (en) 2011-11-07 2014-10-07 Microsoft Corporation Time-of-flight camera with guided light
US9672609B1 (en) 2011-11-11 2017-06-06 Edge 3 Technologies, Inc. Method and apparatus for improved depth-map estimation
US8724906B2 (en) 2011-11-18 2014-05-13 Microsoft Corporation Computing pose and/or shape of modifiable entities
US8509545B2 (en) 2011-11-29 2013-08-13 Microsoft Corporation Foreground subject detection
US8803800B2 (en) 2011-12-02 2014-08-12 Microsoft Corporation User interface control based on head orientation
US8958631B2 (en) 2011-12-02 2015-02-17 Intel Corporation System and method for automatically defining and identifying a gesture
US8635637B2 (en) 2011-12-02 2014-01-21 Microsoft Corporation User interface presenting an animated avatar performing a media reaction
US9100685B2 (en) 2011-12-09 2015-08-04 Microsoft Technology Licensing, Llc Determining audience state or interest using passive sensor data
US8879831B2 (en) 2011-12-15 2014-11-04 Microsoft Corporation Using high-level attributes to guide image processing
US8971612B2 (en) 2011-12-15 2015-03-03 Microsoft Corporation Learning image processing tasks from scene reconstructions
US8630457B2 (en) 2011-12-15 2014-01-14 Microsoft Corporation Problem states for pose tracking pipeline
US8811938B2 (en) 2011-12-16 2014-08-19 Microsoft Corporation Providing a user interface experience based on inferred vehicle state
US9342139B2 (en) 2011-12-19 2016-05-17 Microsoft Technology Licensing, Llc Pairing a computing device to a user
US9501152B2 (en) 2013-01-15 2016-11-22 Leap Motion, Inc. Free-space user interface and control using virtual constructs
US8638989B2 (en) 2012-01-17 2014-01-28 Leap Motion, Inc. Systems and methods for capturing motion in three-dimensional space
US8693731B2 (en) 2012-01-17 2014-04-08 Leap Motion, Inc. Enhanced contrast for object detection and characterization by optical imaging
US11493998B2 (en) 2012-01-17 2022-11-08 Ultrahaptics IP Two Limited Systems and methods for machine control
US10691219B2 (en) 2012-01-17 2020-06-23 Ultrahaptics IP Two Limited Systems and methods for machine control
US9679215B2 (en) 2012-01-17 2017-06-13 Leap Motion, Inc. Systems and methods for machine control
US9070019B2 (en) 2012-01-17 2015-06-30 Leap Motion, Inc. Systems and methods for capturing motion in three-dimensional space
US9720089B2 (en) 2012-01-23 2017-08-01 Microsoft Technology Licensing, Llc 3D zoom imager
US8898687B2 (en) 2012-04-04 2014-11-25 Microsoft Corporation Controlling a media program based on a media reaction
US9477303B2 (en) 2012-04-09 2016-10-25 Intel Corporation System and method for combining three-dimensional tracking with a three-dimensional display for a user interface
US9210401B2 (en) 2012-05-03 2015-12-08 Microsoft Technology Licensing, Llc Projected visual cues for guiding physical movement
CA2775700C (en) 2012-05-04 2013-07-23 Microsoft Corporation Determining a future portion of a currently presented media program
CN104395929B (en) 2012-06-21 2017-10-03 微软技术许可有限责任公司 Constructed using the incarnation of depth camera
US9836590B2 (en) 2012-06-22 2017-12-05 Microsoft Technology Licensing, Llc Enhanced accuracy of user presence status determination
US8532675B1 (en) 2012-06-27 2013-09-10 Blackberry Limited Mobile communication device user interface for manipulation of data items in a physical space
US9696427B2 (en) 2012-08-14 2017-07-04 Microsoft Technology Licensing, Llc Wide angle depth detection
US10175750B1 (en) * 2012-09-21 2019-01-08 Amazon Technologies, Inc. Projected workspace
US9285893B2 (en) 2012-11-08 2016-03-15 Leap Motion, Inc. Object detection and tracking with variable-field illumination devices
US8882310B2 (en) 2012-12-10 2014-11-11 Microsoft Corporation Laser die light source module with low inductance
US9857470B2 (en) 2012-12-28 2018-01-02 Microsoft Technology Licensing, Llc Using photometric stereo for 3D environment modeling
US10609285B2 (en) 2013-01-07 2020-03-31 Ultrahaptics IP Two Limited Power consumption in motion-capture systems
US9465461B2 (en) 2013-01-08 2016-10-11 Leap Motion, Inc. Object detection and tracking with audio and optical signals
US9632658B2 (en) 2013-01-15 2017-04-25 Leap Motion, Inc. Dynamic user interactions for display control and scaling responsiveness of display objects
US9459697B2 (en) 2013-01-15 2016-10-04 Leap Motion, Inc. Dynamic, free-space user interactions for machine control
US9251590B2 (en) 2013-01-24 2016-02-02 Microsoft Technology Licensing, Llc Camera pose estimation for 3D reconstruction
US9052746B2 (en) 2013-02-15 2015-06-09 Microsoft Technology Licensing, Llc User center-of-mass and mass distribution extraction using depth images
US9940553B2 (en) 2013-02-22 2018-04-10 Microsoft Technology Licensing, Llc Camera/object pose from predicted coordinates
US9135516B2 (en) 2013-03-08 2015-09-15 Microsoft Technology Licensing, Llc User body angle, curvature and average extremity positions extraction using depth images
US9092657B2 (en) 2013-03-13 2015-07-28 Microsoft Technology Licensing, Llc Depth image processing
US9274606B2 (en) 2013-03-14 2016-03-01 Microsoft Technology Licensing, Llc NUI video conference controls
US9702977B2 (en) 2013-03-15 2017-07-11 Leap Motion, Inc. Determining positional information of an object in space
US10721448B2 (en) 2013-03-15 2020-07-21 Edge 3 Technologies, Inc. Method and apparatus for adaptive exposure bracketing, segmentation and scene organization
US9953213B2 (en) 2013-03-27 2018-04-24 Microsoft Technology Licensing, Llc Self discovery of autonomous NUI devices
US10620709B2 (en) 2013-04-05 2020-04-14 Ultrahaptics IP Two Limited Customized gesture interpretation
US9916009B2 (en) 2013-04-26 2018-03-13 Leap Motion, Inc. Non-tactile interface systems and methods
US9442186B2 (en) 2013-05-13 2016-09-13 Microsoft Technology Licensing, Llc Interference reduction for TOF systems
US9747696B2 (en) 2013-05-17 2017-08-29 Leap Motion, Inc. Systems and methods for providing normalized parameters of motions of objects in three-dimensional space
US10281987B1 (en) 2013-08-09 2019-05-07 Leap Motion, Inc. Systems and methods of free-space gestural interaction
US9721383B1 (en) 2013-08-29 2017-08-01 Leap Motion, Inc. Predictive information for free space gesture control and communication
US9462253B2 (en) 2013-09-23 2016-10-04 Microsoft Technology Licensing, Llc Optical modules that reduce speckle contrast and diffraction artifacts
US9632572B2 (en) 2013-10-03 2017-04-25 Leap Motion, Inc. Enhanced field of view to augment three-dimensional (3D) sensory space for free-space gesture interpretation
US9443310B2 (en) 2013-10-09 2016-09-13 Microsoft Technology Licensing, Llc Illumination modules that emit structured light
US9996638B1 (en) 2013-10-31 2018-06-12 Leap Motion, Inc. Predictive information for free space gesture control and communication
US9674563B2 (en) 2013-11-04 2017-06-06 Rovi Guides, Inc. Systems and methods for recommending content
US9769459B2 (en) 2013-11-12 2017-09-19 Microsoft Technology Licensing, Llc Power efficient laser diode driver circuit and method
US9508385B2 (en) 2013-11-21 2016-11-29 Microsoft Technology Licensing, Llc Audio-visual project generator
US9985943B1 (en) 2013-12-18 2018-05-29 Amazon Technologies, Inc. Automated agent detection using multiple factors
US10438225B1 (en) 2013-12-18 2019-10-08 Amazon Technologies, Inc. Game-based automated agent detection
US9971491B2 (en) 2014-01-09 2018-05-15 Microsoft Technology Licensing, Llc Gesture library for natural user input
US9613262B2 (en) 2014-01-15 2017-04-04 Leap Motion, Inc. Object detection and tracking for providing a virtual device experience
US9684369B2 (en) 2014-04-08 2017-06-20 Eon Reality, Inc. Interactive virtual reality systems and methods
US9542011B2 (en) 2014-04-08 2017-01-10 Eon Reality, Inc. Interactive virtual reality systems and methods
CN204480228U (en) 2014-08-08 2015-07-15 厉动公司 motion sensing and imaging device
US10726625B2 (en) 2015-01-28 2020-07-28 CCP hf. Method and system for improving the transmission and processing of data regarding a multi-user virtual environment
US9852546B2 (en) 2015-01-28 2017-12-26 CCP hf. Method and system for receiving gesture input via virtual control objects
US10725297B2 (en) * 2015-01-28 2020-07-28 CCP hf. Method and system for implementing a virtual representation of a physical environment using a virtual reality environment
EP3364270A4 (en) * 2015-10-15 2018-10-31 Sony Corporation Information processing device and information processing method
US10257932B2 (en) 2016-02-16 2019-04-09 Microsoft Technology Licensing, Llc. Laser diode chip on printed circuit board
US10277836B2 (en) * 2016-03-07 2019-04-30 Ricoh Company, Ltd. Communication terminal, image management apparatus, image processing system, method for controlling display, and computer program product
US10462452B2 (en) 2016-03-16 2019-10-29 Microsoft Technology Licensing, Llc Synchronizing active illumination cameras
KR20170116437A (en) * 2016-04-11 2017-10-19 전자부품연구원 Apparatus and method for recognizing user's posture in ski simulator
US10249084B2 (en) 2016-06-10 2019-04-02 Microsoft Technology Licensing, Llc Tap event location with a selection apparatus
US10115237B2 (en) * 2016-09-28 2018-10-30 Redzone Robotics, Inc. Virtual reality display of pipe inspection data
US11875012B2 (en) 2018-05-25 2024-01-16 Ultrahaptics IP Two Limited Throwable interface for augmented reality and virtual reality environments
CN111167115A (en) * 2018-11-09 2020-05-19 致伸科技股份有限公司 Interactive game system
US11853533B1 (en) * 2019-01-31 2023-12-26 Splunk Inc. Data visualization workspace in an extended reality environment
US20210192681A1 (en) * 2019-12-18 2021-06-24 Ati Technologies Ulc Frame reprojection for virtual reality and augmented reality
CN115840507A (en) * 2022-12-20 2023-03-24 北京帮威客科技有限公司 Large-screen equipment interaction method based on 3D image control

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1996021321A1 (en) * 1995-01-06 1996-07-11 Anderson David P Virtual reality television system
US5563988A (en) * 1994-08-01 1996-10-08 Massachusetts Institute Of Technology Method and system for facilitating wireless, full-body, real-time user interaction with a digitally represented visual environment
US5617312A (en) * 1993-11-19 1997-04-01 Hitachi, Ltd. Computer system that enters control information by means of video camera

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5495576A (en) * 1993-01-11 1996-02-27 Ritchey; Kurtis J. Panoramic image based virtual reality/telepresence audio-visual system and method
JP2534617B2 (en) 1993-07-23 1996-09-18 株式会社エイ・ティ・アール通信システム研究所 Real-time recognition and synthesis method of human image
US5594469A (en) * 1995-02-21 1997-01-14 Mitsubishi Electric Information Technology Center America Inc. Hand gesture machine control system
US5729471A (en) * 1995-03-31 1998-03-17 The Regents Of The University Of California Machine dynamic selection of one video camera/image of a scene from multiple video cameras/images of the scene in accordance with a particular perspective on the scene, an object in the scene, or an event in the scene
US5872575A (en) * 1996-02-14 1999-02-16 Digital Media Interactive Method and system for the creation of and navigation through a multidimensional space using encoded digital video

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5617312A (en) * 1993-11-19 1997-04-01 Hitachi, Ltd. Computer system that enters control information by means of video camera
US5563988A (en) * 1994-08-01 1996-10-08 Massachusetts Institute Of Technology Method and system for facilitating wireless, full-body, real-time user interaction with a digitally represented visual environment
WO1996021321A1 (en) * 1995-01-06 1996-07-11 Anderson David P Virtual reality television system

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
COMMUNIATION OF THE ACM, Volume 35, No. 6, June 1992, CAROLINA CRUZ-NELRA et al., "The Cave Audio Visual Experience Automatic Virtual Environment", pages 64-72. *

Cited By (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1358632A1 (en) * 2001-01-22 2003-11-05 France Telecom Sa Avatar animating method and device for communicating in a non-immersive collaborative virtual environment
CN101237915B (en) * 2005-08-12 2012-02-29 皇家飞利浦电子股份有限公司 Interactive entertainment system and method of operation thereof
GB2470072B (en) * 2009-05-08 2014-01-01 Sony Comp Entertainment Europe Entertainment device,system and method
GB2470072A (en) * 2009-05-08 2010-11-10 Sony Comp Entertainment Europe Virtual object movement in response to real object movement
US9901828B2 (en) 2010-03-30 2018-02-27 Sony Interactive Entertainment America Llc Method for an augmented reality character to maintain and exhibit awareness of an observer
EP2410401A1 (en) * 2010-06-04 2012-01-25 Thomson Licensing Method for selection of an object in a virtual environment
US9298346B2 (en) 2010-06-04 2016-03-29 Thomson Licensing Method for selection of an object in a virtual environment
FR2960986A1 (en) * 2010-06-04 2011-12-09 Thomson Licensing METHOD FOR SELECTING AN OBJECT IN A VIRTUAL ENVIRONMENT
CN102426480A (en) * 2011-11-03 2012-04-25 康佳集团股份有限公司 Man-machine interactive system and real-time gesture tracking processing method for same
US9330478B2 (en) 2012-02-08 2016-05-03 Intel Corporation Augmented reality creation using a real scene
US9635159B2 (en) 2012-05-08 2017-04-25 Nokia Technologies Oy Method and apparatus for providing immersive interaction via everyday devices
WO2014137673A1 (en) * 2013-03-03 2014-09-12 Microsoft Corporation Enhanced presentation environments
US10937240B2 (en) 2018-01-04 2021-03-02 Intel Corporation Augmented reality bindings of physical objects and virtual objects
WO2020124253A1 (en) * 2018-12-19 2020-06-25 Cae Inc. Dynamically adjusting image characteristics in real-time

Also Published As

Publication number Publication date
EP0976106A2 (en) 2000-02-02
WO1999034327A3 (en) 1999-09-10
KR20000075608A (en) 2000-12-26
JP2001517344A (en) 2001-10-02
US6181343B1 (en) 2001-01-30

Similar Documents

Publication Publication Date Title
US6181343B1 (en) System and method for permitting three-dimensional navigation through a virtual reality environment using camera-based gesture inputs
US6195104B1 (en) System and method for permitting three-dimensional navigation through a virtual reality environment using camera-based gesture inputs
US9910509B2 (en) Method to control perspective for a camera-controlled computer
KR102641655B1 (en) Select virtual objects in 3D space
US9776083B2 (en) Spatially-correlated multi-display human-machine interface
US9550124B2 (en) Projection of an interactive environment
US8223147B1 (en) Method and system for vision-based interaction in a virtual environment
Leibe et al. The perceptive workbench: Toward spontaneous and natural interaction in semi-immersive virtual environments
Leibe et al. Toward spontaneous interaction with the perceptive workbench
EP0961965A2 (en) Method and system for gesture based option selection
WO2015139002A1 (en) Gaming device with volumetric sensing
JPH1118025A (en) Image display device
US11847745B1 (en) Collision avoidance system for head mounted display utilized in room scale virtual reality system
KR20140043522A (en) Apparatus and method for controlling of transparent both-sided display
WO2006108279A1 (en) Method and apparatus for virtual presence
Webel et al. Immersive experience of current and ancient reconstructed cultural attractions
Freeman et al. Computer vision for computer interaction
JP3341734B2 (en) Video display device
US5603507A (en) Method of input selection in an electronic game system
Li Development of immersive and interactive virtual reality environment for two-player table tennis
Pinhanez Creating ubiquitous interactive games using everywhere displays projectors
JP2000039949A (en) Video display device
WO2016057997A1 (en) Support based 3d navigation
YAN Tangible user interfaces for augmented reality
Singletary et al. Toward Spontaneous Interaction with the Perceptive Workbench

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A2

Designated state(s): JP KR

AL Designated countries for regional patents

Kind code of ref document: A2

Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE

WWE Wipo information: entry into national phase

Ref document number: 1998955857

Country of ref document: EP

ENP Entry into the national phase

Ref country code: JP

Ref document number: 1999 534683

Kind code of ref document: A

Format of ref document f/p: F

WWE Wipo information: entry into national phase

Ref document number: 1019997007669

Country of ref document: KR

121 Ep: the epo has been informed by wipo that ep was designated in this application
AK Designated states

Kind code of ref document: A3

Designated state(s): JP KR

AL Designated countries for regional patents

Kind code of ref document: A3

Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE

WWP Wipo information: published in national office

Ref document number: 1998955857

Country of ref document: EP

WWP Wipo information: published in national office

Ref document number: 1019997007669

Country of ref document: KR

WWW Wipo information: withdrawn in national office

Ref document number: 1998955857

Country of ref document: EP

WWR Wipo information: refused in national office

Ref document number: 1019997007669

Country of ref document: KR