WO2008045153A2 - Interaction based on facial recognition of conference participants - Google Patents

Interaction based on facial recognition of conference participants Download PDF

Info

Publication number
WO2008045153A2
WO2008045153A2 PCT/US2007/017569 US2007017569W WO2008045153A2 WO 2008045153 A2 WO2008045153 A2 WO 2008045153A2 US 2007017569 W US2007017569 W US 2007017569W WO 2008045153 A2 WO2008045153 A2 WO 2008045153A2
Authority
WO
WIPO (PCT)
Prior art keywords
facial image
participant
person
user
information
Prior art date
Application number
PCT/US2007/017569
Other languages
French (fr)
Other versions
WO2008045153A3 (en
Inventor
Labhesh Patel
Mukul Jain
Joseph F. Khouri
Original Assignee
Cisco Technology, Inc.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Cisco Technology, Inc. filed Critical Cisco Technology, Inc.
Priority to CN2007800300803A priority Critical patent/CN101502088B/en
Priority to EP07811156A priority patent/EP2055088B1/en
Publication of WO2008045153A2 publication Critical patent/WO2008045153A2/en
Publication of WO2008045153A3 publication Critical patent/WO2008045153A3/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/14Systems for two-way working
    • H04N7/15Conference systems

Definitions

  • This disclosure relates generally to the fields of biometrics and video conferencing.
  • interruptions for the purpose of identifying a participant may occur during the video conference session.
  • a participant may want to interact with another participant, for example, to privately ask a question to someone who was just speaking in the meeting, or to start an instant messaging (IM) session with that person to privately discuss the issue before sharing it with everybody in the conference.
  • IM instant messaging
  • Figure 1 illustrates an example video conferencing system.
  • Figure 2 illustrates an example display of conference participants on an endpoint device.
  • Figure 3 illustrates an example graphical user interface
  • Figure 4 illustrates an example method of operation for a video conferencing system.
  • Figure 5 illustrates an example method for obtaining identity information about a selected conference participant.
  • Figure 6 illustrates basic components of an example node.
  • a computer network is a geographically distributed collection of interconnected subnetworks for transporting data between nodes, such as intermediate nodes and end nodes (also referred to as endpoints).
  • a local area network is an example of such a subnetwork; a plurality of LANs may be further interconnected by an intermediate network node, such as a router, bridge, or switch, to extend the effective "size" of the computer network and increase the number of communicating nodes.
  • an intermediate network node such as a router, bridge, or switch
  • Examples of the endpoint devices or nodes may include servers and personal computers.
  • the nodes typically communicate by exchanging discrete frames or packets of data according to predefined protocols.
  • an endpoint represents an end user, client, or person who is capable of participating in an audio conference session via conferencing system.
  • Endpoint devices that may be used to initiate or participate in a conference session include a personal digital assistant (PDA); a personal computer (PC), such as notebook, laptop, or desktop computer; an audio/video appliance; a streaming client; a television device with built-in camera and microphone; or any other device, component, element, or object capable of initiating or participating in exchanges with a video conferencing system.
  • PDA personal digital assistant
  • PC personal computer
  • audio/video appliance such as notebook, laptop, or desktop computer
  • streaming client such as a streaming client
  • television device with built-in camera and microphone such as any other device, component, element, or object capable of initiating or participating in exchanges with a video conferencing system.
  • Figure 6 illustrates basic components of an example node
  • FIG. 60 which typically comprises a number of basic subsystems that includes a processor subsystem 61 , a main memory 62 and an input/output (I/O) subsystem 65.
  • Data is transferred between main memory (“system memory”) 62 and processor subsystem 61 over a memory bus 63, and between the processor and I/O subsystems over a system bus 66.
  • Examples of the system bus may include the conventional lightning data transport (or hyper transport) bus and the conventional peripheral component interconnect (PCI) bus.
  • Node 60 may also comprise other hardware units/modules 64 coupled to system bus 66 for performing additional functions.
  • Processor subsystem 11 may comprise one or more processors and a controller device that incorporates a set of functions including a system memory controller, support for one or more system buses and direct memory access (DMA) engines.
  • DMA direct memory access
  • Video conference server 20 includes software (or firmware) plug-ins, modules, or enhancements that implement the various features and functions described herein.
  • video conferencing server 20 is shown coupled with a database 23 that is used to store information (e.g., name, ID number, department, phone number, email address, etc.) about employees and other individuals associated with the organization for which the conference session is being held.
  • Database 23 also stores image data of the faces of persons within (and possibly outside of) the organization.
  • video conference server 20 includes a digital signal processor (DSP) or firmware/software-based system that mixes and/or switches audio / video signals received at its input ports under the control of server 20.
  • DSP digital signal processor
  • the audio / video signals received at the conference server ports originate from each of the conference or meeting participants (e.g., individual conference participants using endpoint devices 12 & 14), and possibly from an interactive voice response (IVR) system (not shown).
  • Conference server 20 may also incorporate or be associated with a natural language automatic speech recognition (ASR) module for interpreting and parsing speech of the participants, and standard speech-to-text (STT) and text-to- speech (TTS) converter modules.
  • ASR natural language automatic speech recognition
  • STT standard speech-to-text
  • TTS text-to- speech
  • the media path for the conference participants may include audio / video transmissions, e.g., Real-Time Transport Protocol (RTP) packets sent across a variety of different networks (e.g., Internet, intranet, PSTN, etc.), protocols (e.g., IP, Asynchronous Transfer Mode (ATM), Point-to-Point Protocol (PPP)), with connections that span across multiple services, systems, and devices.
  • RTP Real-Time Transport Protocol
  • networks e.g., Internet, intranet, PSTN, etc.
  • protocols e.g., IP, Asynchronous Transfer Mode (ATM), Point-to-Point Protocol (PPP)
  • System 10 also shows a user 11 participating in a videoconference via endpoint 12, which comprises a PC with a camera 13.
  • the main conference includes a group 18 connected with video conferencing server 20 via endpoint 14, which includes a camera 15, microphone 16 and display monitor 17.
  • video conferencing server 20 includes a facial recognition (FR) module 22 (e.g., software) that may be utilized to pick someone's face out of an image, extract that face from the rest of the scene, and compare it to a catalog of facial images stored in database 23.
  • FR module 22 may comprise a software program such as the FaceltTM sold by Identix, Inc., of Minnetonka, Minnesota, or facial recognition programs sold by Viisage Technology of Littleton, Massachusetts. These programs analyze images of human faces for the purpose of identifying them.
  • FR module 22 may take a captured (“snapshot") facial image of a conference participant and measure characteristics such as the distance between the eyes, the length of the nose, the outline of the face, and the angle of the jaw, and create a unique "template" of the facial image. This template may then be stored in database 23 as an object. Using templates, module 22 may compare a captured image with a stored image (e.g., a pre-existing photo from database 23) and produce a score that measures how similar the images are to each other. That is, module 22 works by comparing facial, landmarks, i.e., specific proportions and angles of defined facial features.
  • a stored image e.g., a pre-existing photo from database 23
  • different snapshots may be taken of a conference participant as the person slightly moves (or by taking a much smaller number of snapshots from a stereo camera) and comparing it against a known database of facial eigenvectors.
  • Database 23 may store facial eigenvectors (a series of facial pictures from different angles that can uniquely identify a person) along with the profile of each person.
  • the system matches each of the outlined participant faces in the video conference meeting against the stored catalogue or corporate directory in order to identify the participants. This matching may occur either at the start of a conference session as each person or group joins in, or whenever a participant selects a facial image of another participant via a user interface running on an endpoint device that connects with server 20.
  • FR module 22 is utilized in communication with a graphical user interface running on a user's endpoint device (e.g., PC 12) that allows the user to position a mouse or other cursor control device on the image of a person's face during a video conference. The user may then "click" on that image (i.e., object) to obtain identity information about that participant. The user interface then communicates the facial image information to FR module 22, which then performs a lookup to database 23 in an attempt to match the selected person's face.
  • a graphical user interface running on a user's endpoint device (e.g., PC 12) that allows the user to position a mouse or other cursor control device on the image of a person's face during a video conference. The user may then "click" on that image (i.e., object) to obtain identity information about that participant. The user interface then communicates the facial image information to FR module 22, which then performs a lookup to database 23 in an attempt to match the selected person's face.
  • the interface may be used to initiate interactions with the selected participant in a video conference session -for example, by starting a communication session (e.g., IM, email, or telephony) with the selected participant via video conferencing server 20.
  • a communication session e.g., IM, email, or telephony
  • the video conference session may be either live or recorded in order to start a communication session (such as IM or email or telephony session) with a selected person.
  • Figure 2 illustrates an example display of the conference participants in group 18 on a display screen of endpoint device 12. Group 18 appears in a window 28 of the GUI running on PC 12.
  • the outline of each participant to the video conferencing is converted into an active object that a user of the GUI can select (e.g., "click” on) in order to obtain information about that person.
  • a cursor arrow 25 is shown positioned by a user over the face of the participant sitting on the far right-hand side of the conference table in group 18. The user, for instance, may utilize a cursor device or mouse to encircle or "lasso" a person's face in order to convert the facial image into an active object.
  • server 20 executes a routine or procedure wherein each new participant to a video conference has their facial image automatically captured and converted into a "clickable" object when they join the conference session. Once a participant's facial image has been captured, module 22 of server 20 may attempt to match that image within database 23 in response to a user selecting or clicking on that facial image (i.e., when cursor arrow 25 is positioned over the person's face).
  • GUI may be generated by software (i.e., code) running the user's PC.
  • the GUI may comprise a collaborative web-based application that is accessed by the browser software running on the user's PC.
  • GUI may comprise a downloaded application, or other forms of computer-executable code that may be loaded or accessed by a participant's PC.
  • the software code for implementing the GUI may be executed on server 20 and accessed by users who want to utilize the features provided therein.
  • Figure 3 illustrates an example graphical user interface
  • GUI window 30 produced on a user's PC or other endpoint device in response to the user clicking on a conference participant's facial image.
  • GUI virtual reality
  • the system displays a photo 32 (either copied from the video window or reproduced from file storage) of that person along with various information about that person.
  • the information includes the participant's name, company division, site location, office phone number, manager's name, and email address. All of this information is associated with that person's image as stored in the corporate directory / database.
  • the information is presented in a data field 33 of window 30.
  • command button 34 returns the user to the video stream (e.g., window 28 of Figure 2) of the ongoing (or recorded) conference session.
  • Selection of command button 35 causes any attachments that participant 32 may have submitted for group viewing / discussion to be displayed to the user. For instance, each participant may have been invited to submit a report on a certain subject or topic before joining the meeting.
  • Command button 35 allows a user to quickly view the relevant document(s) submitted by a particular participant.
  • selection of command button 36 causes the system to initiate communications between the user and participant 32. Communications may be through a predetermined media channel (e.g., IM) or selectable via a pull-down menu that presents the user with various communication options (e.g., IM, email, phone, etc.).
  • the actual contact details such as
  • IM, telephone number, or email address of one or more conference participants may be suppressed or hidden from view during certain types of conferences or meetings. For example, in a large conference call or public meeting individual participants may choose to keep their contact information private to all participants. Additionally, certain participants (e.g., a CEO participating in a public briefing) may choose to disable user interface-initiated communications entirely. Both of these features may be implemented by a user preference setting that hides or masks identification information of that user during a conference session. [0026] .
  • the conference server may instead automatically generate a temporary mapping or anonymous link to a communication device associated with that person, such as ⁇ participant1@conference.server>, e.g., based on their order of joining the conference.
  • a communication device associated with that person
  • the participant who is requesting communications with another conference attendee only sees the temporary mapping details - which may still be used to contact that attendee - but without actually revealing the personal contact details of that participant.
  • communications may be initiated with a selected conference participant via a user interface without revealing personal contact information about that participant.
  • different embodiments may execute different routines / approaches for establishing communications between the user and the selected meeting participant.
  • the user may be given an option to start a telephony session with a conference participant immediately after the user clicks on the outline of that participant in the video window.
  • the user can drag and drop the outline of a meeting participant from the video window onto a soft phone running on his PC in order to start a telephony session with that person.
  • the user may start an IM session with a meeting participant by clicking on the outline of that participant in the video conference window.
  • the user can start an IM session with a meeting participant by dragging and dropping the outline of that participant from the video conference window on to the IM client running on his PC.
  • the user can add a video conference participant on to an ongoing IM conference chat conference by dragging the outline of that person from the video conference window on to the IM conference window.
  • the system opens a new email message composition for a video conference participant and automatically fills in the selected participant's email address when the user clicks on the outline of that participant in the video conference window, or drags the outline of that participant's facial image from the video conference window onto an email client such as Microsoft OutlookTM.
  • the conferencing system may schedule a new meeting between the user and a selected video conference participant when the user drags the outlined image of the participant's face onto a scheduling application such as MeetingMakerTM running on the user's PC desktop.
  • the system may generate and send an invitation to a conference session participant to join another existing meeting when a user clicks on the facial image of that participant, or drags the participant's outline from the video conference window onto a scheduled meeting in his meeting scheduling application.
  • the GUI may be configured to allow a user to drag a file (such as document containing meeting notes) onto the facial image of a participant in the video conference window in order to send (e.g., email) that file to that participant.
  • Figure 4 illustrates an example method of operation for a video conferencing system. The process begins when a video conference participant dials into a conference session from an endpoint device (step 41). Immediately upon joining the session, the system automatically captures an image of the participant's face (step 42), and then accesses a directory database in an attempt to match the person's face with a stored image (step 43).
  • the identity information of that person is attached to an object associated with that image, obviating the need for a database query when a user later clicks on or otherwise selects that person's image.
  • the participant's identity may optionally be confirmed (step 45).
  • the confirmation process may simply involve an IVR query sent to the participant (e.g., "Are you James Smith of the Video Technology Group?).
  • the system may send a query to the participant requesting that he identify himself (block 46).
  • Certain conferencing system may be configured with authentication / identification procedures to insure that only authorized persons are admitted to a particular conference session. In such cases, the system may utilize a participant's name, badge number, or other unique identifier (e.g., fingerprint) to locate further information about that person from the corporate directory database.
  • the system may create a clickable object from the captured facial image, essentially transforming the facial image into a user selectable object for purposes of user interface-based association during the conference session. Additionally, the system may update the database by adding the recently captured image to the database entry associated with the participant.
  • a new database entry in the directory may be added to include that participant's image and any available identification or contact information (block 49).
  • any available identification or contact information for example, guests, vendors, or contractors may be invited to participate in a conference session.
  • each such participant may be requested / queried to provide certain basic information to be stored along with their image.
  • Figure 4 is also applicable to groups who dial into a conference session from a single endpoint device. Rather than capture the facial image of a single participant and compare it to a directory of stored images, the system would perform the same operations on each participant in that group. In other words, each of the steps outlined in Figure 4 would be automatically applied to each individual of the group attending the meeting.
  • the video conferencing system queries the database in response to a user of the GUI clicking on the image of a participant to a conference session. That is, the system does not attempt to match the facial image of a conference participant captured in the video feed against images stored in the corporate database until a particular participant has been selected by a user.
  • this may involve the user having to draw a boundary around or using a lasso tool to frame a participant's face to cause the system to create an object for that face and then initiate a query to the database in an attempt to match the facial image with a known person.
  • this latter embodiment operates reactively to a user selection / request for information identifying and associated with a particular participant.
  • Figure 5 illustrates an example method for obtaining identity information about a selected conference participant in accordance with the latter described embodiment.
  • the conferencing system makes no attempt to identify participants to the meeting until a user outlines a particular participant's face, for example, by using a GUI to click on the person's face, lasso their face, draw a boundary around their face, or using other similar techniques / methods (block 52).
  • the system sends a query to the database that attempts to match the facial image captured in the video feed with an image stored in the directory database. If the match is successful (block 56), the user is provided with detailed information about the selected participant (block 56). This information may be presented in a variety of ways; for example, as shown in Figure 3.
  • the database query is unsuccessful, i.e., no match is found, the user may be notified of the failed result (block 55).
  • the system may alternatively present the user with a list of different communication modes available for interacting with the participant.
  • the GUI may open a window that presents the user with the option of interacting via IM, email, telephone connection, etc.
  • the user may then click on one of the available communication modes.
  • the system establishes communication with the conference participant via the selected mode. For instance, if the user selects IM from the list presented, the system automatically establishes an IM session between the conference participant and the user via an appropriate communication channel, link, or connection.
  • elements of the present invention may also be provided as a computer program product which may include a machine-readable medium having stored thereon instructions which may be used to program a computer (e.g., a processor or other electronic device) to perform a sequence of operations. Alternatively, the operations may be performed by a combination of hardware and software.
  • the machine-readable medium may include, but is not limited to, floppy diskettes, optical disks, CD-ROMs, and magneto- optical disks, ROMs, RAMs, EPROMs, EEPROMs, magnet or optical cards, propagation media or other type of media/machine-readable medium suitable for storing electronic instructions.
  • elements of the present invention may be downloaded as a computer program product, wherein the program may be transferred from a remote computer or telephonic device to a requesting process by way of data signals embodied in a carrier wave or other propagation medium via a communication link (e.g., a modem or network connection).
  • a communication link e.g., a modem or network connection

Abstract

In one embodiment, a method includes capturing a facial image (32, Fig. 3) of a participant to a video conference session (10, Fig. 1) from a video stream of the video conference session. A database (23, Fig. 1) containing a plurality a stored facial images is then accessed in an attempt to match the facial image with one of the stored facial images. Each stored facial image is associated with information in the database associated with a person, the information being retrieved in the event of a match. It is emphasized that this abstract is provided to comply with the rules requiring an abstract that will allow a searcher or other reader to quickly ascertain the subject matter of the technical disclosure.

Description

INTEERACTION BASED ON FACIAL RECOGNITION OF CONFERENCE PARTICIPANTS
TECHNICAL FIELD
[0001] This disclosure relates generally to the fields of biometrics and video conferencing.
BACKGROUND
[0002] In many cases the individual participants to a video conference session do not know or recognize each other. Typically, this requires some sort of introduction period at the start of the video conference. Alternatively, interruptions for the purpose of identifying a participant may occur during the video conference session. A participant may want to interact with another participant, for example, to privately ask a question to someone who was just speaking in the meeting, or to start an instant messaging (IM) session with that person to privately discuss the issue before sharing it with everybody in the conference. In either scenario, interruptions or introductions for the purpose of sharing identity information are time consuming, inconvenient, and disruptive to normal meeting discussions.
BRIEF DESCRIPTION OF THE DRAWINGS
[0003] The present invention will be understood more fully from the detailed description that follows and from the accompanying drawings, which however, should not be taken to limit the invention to the specific embodiments shown, but are for explanation and understanding only.
[0004] Figure 1 illustrates an example video conferencing system.
[0005] Figure 2 illustrates an example display of conference participants on an endpoint device.
[0006] Figure 3 illustrates an example graphical user interface
(GUI).
[0007] Figure 4 illustrates an example method of operation for a video conferencing system.
[0008] Figure 5 illustrates an example method for obtaining identity information about a selected conference participant.
[0009] Figure 6 illustrates basic components of an example node.
DESCRIPTION OF EXAMPLE EMBODIMENTS [0010] In the following description specific details are set forth, such as device types, system configurations, device types, communication methods, etc., in order to provide a thorough understanding of the present invention. However, persons having ordinary skill in the relevant arts will appreciate that these specific details may not be needed to practice the present invention. [0011] In the context of the present application, a computer network is a geographically distributed collection of interconnected subnetworks for transporting data between nodes, such as intermediate nodes and end nodes (also referred to as endpoints). A local area network (LAN) is an example of such a subnetwork; a plurality of LANs may be further interconnected by an intermediate network node, such as a router, bridge, or switch, to extend the effective "size" of the computer network and increase the number of communicating nodes. Examples of the endpoint devices or nodes may include servers and personal computers. The nodes typically communicate by exchanging discrete frames or packets of data according to predefined protocols. [0012] In general, an endpoint represents an end user, client, or person who is capable of participating in an audio conference session via conferencing system. Endpoint devices that may be used to initiate or participate in a conference session include a personal digital assistant (PDA); a personal computer (PC), such as notebook, laptop, or desktop computer; an audio/video appliance; a streaming client; a television device with built-in camera and microphone; or any other device, component, element, or object capable of initiating or participating in exchanges with a video conferencing system.
[0013] Figure 6 illustrates basic components of an example node
60, which typically comprises a number of basic subsystems that includes a processor subsystem 61 , a main memory 62 and an input/output (I/O) subsystem 65. Data is transferred between main memory ("system memory") 62 and processor subsystem 61 over a memory bus 63, and between the processor and I/O subsystems over a system bus 66. Examples of the system bus may include the conventional lightning data transport (or hyper transport) bus and the conventional peripheral component interconnect (PCI) bus. Node 60 may also comprise other hardware units/modules 64 coupled to system bus 66 for performing additional functions. Processor subsystem 11 may comprise one or more processors and a controller device that incorporates a set of functions including a system memory controller, support for one or more system buses and direct memory access (DMA) engines. [0014] Figure 1 illustrates an example video conferencing system
10 that includes a video conferencing server 20 located on an Internet protocol (IP) network 21. Video conference server 20 includes software (or firmware) plug-ins, modules, or enhancements that implement the various features and functions described herein. In this example, video conferencing server 20 is shown coupled with a database 23 that is used to store information (e.g., name, ID number, department, phone number, email address, etc.) about employees and other individuals associated with the organization for which the conference session is being held. Database 23 also stores image data of the faces of persons within (and possibly outside of) the organization.
[0015] In the embodiment shown, video conference server 20 includes a digital signal processor (DSP) or firmware/software-based system that mixes and/or switches audio / video signals received at its input ports under the control of server 20. The audio / video signals received at the conference server ports originate from each of the conference or meeting participants (e.g., individual conference participants using endpoint devices 12 & 14), and possibly from an interactive voice response (IVR) system (not shown). Conference server 20 may also incorporate or be associated with a natural language automatic speech recognition (ASR) module for interpreting and parsing speech of the participants, and standard speech-to-text (STT) and text-to- speech (TTS) converter modules.
[0016] It is appreciated that in different specific implementations the media path for the conference participants may include audio / video transmissions, e.g., Real-Time Transport Protocol (RTP) packets sent across a variety of different networks (e.g., Internet, intranet, PSTN, etc.), protocols (e.g., IP, Asynchronous Transfer Mode (ATM), Point-to-Point Protocol (PPP)), with connections that span across multiple services, systems, and devices.
[0017] System 10 also shows a user 11 participating in a videoconference via endpoint 12, which comprises a PC with a camera 13. In this example the main conference includes a group 18 connected with video conferencing server 20 via endpoint 14, which includes a camera 15, microphone 16 and display monitor 17. In accordance with one embodiment, video conferencing server 20 includes a facial recognition (FR) module 22 (e.g., software) that may be utilized to pick someone's face out of an image, extract that face from the rest of the scene, and compare it to a catalog of facial images stored in database 23. By way of example^ FR module 22 may comprise a software program such as the Facelt™ sold by Identix, Inc., of Minnetonka, Minnesota, or facial recognition programs sold by Viisage Technology of Littleton, Massachusetts. These programs analyze images of human faces for the purpose of identifying them.
[0018] In the embodiment of Figure 1 , FR module 22 may take a captured ("snapshot") facial image of a conference participant and measure characteristics such as the distance between the eyes, the length of the nose, the outline of the face, and the angle of the jaw, and create a unique "template" of the facial image. This template may then be stored in database 23 as an object. Using templates, module 22 may compare a captured image with a stored image (e.g., a pre-existing photo from database 23) and produce a score that measures how similar the images are to each other. That is, module 22 works by comparing facial, landmarks, i.e., specific proportions and angles of defined facial features. [0019] In certain implementations, different snapshots may be taken of a conference participant as the person slightly moves (or by taking a much smaller number of snapshots from a stereo camera) and comparing it against a known database of facial eigenvectors. Database 23 may store facial eigenvectors (a series of facial pictures from different angles that can uniquely identify a person) along with the profile of each person. In one embodiment, the system matches each of the outlined participant faces in the video conference meeting against the stored catalogue or corporate directory in order to identify the participants. This matching may occur either at the start of a conference session as each person or group joins in, or whenever a participant selects a facial image of another participant via a user interface running on an endpoint device that connects with server 20.
[0020] In one embodiment, FR module 22 is utilized in communication with a graphical user interface running on a user's endpoint device (e.g., PC 12) that allows the user to position a mouse or other cursor control device on the image of a person's face during a video conference. The user may then "click" on that image (i.e., object) to obtain identity information about that participant. The user interface then communicates the facial image information to FR module 22, which then performs a lookup to database 23 in an attempt to match the selected person's face. In another embodiment, the interface may be used to initiate interactions with the selected participant in a video conference session -for example, by starting a communication session (e.g., IM, email, or telephony) with the selected participant via video conferencing server 20. It is appreciated that the video conference session may be either live or recorded in order to start a communication session (such as IM or email or telephony session) with a selected person. [0021] Figure 2 illustrates an example display of the conference participants in group 18 on a display screen of endpoint device 12. Group 18 appears in a window 28 of the GUI running on PC 12. In one embodiment, the outline of each participant to the video conferencing is converted into an active object that a user of the GUI can select (e.g., "click" on) in order to obtain information about that person. By way of example, a cursor arrow 25 is shown positioned by a user over the face of the participant sitting on the far right-hand side of the conference table in group 18. The user, for instance, may utilize a cursor device or mouse to encircle or "lasso" a person's face in order to convert the facial image into an active object. In another embodiment, server 20 executes a routine or procedure wherein each new participant to a video conference has their facial image automatically captured and converted into a "clickable" object when they join the conference session. Once a participant's facial image has been captured, module 22 of server 20 may attempt to match that image within database 23 in response to a user selecting or clicking on that facial image (i.e., when cursor arrow 25 is positioned over the person's face).
[0022] Practitioners in the art will understand that the GUI may be generated by software (i.e., code) running the user's PC. In other cases, the GUI may comprise a collaborative web-based application that is accessed by the browser software running on the user's PC. In other instances, the GUI may comprise a downloaded application, or other forms of computer-executable code that may be loaded or accessed by a participant's PC. For instance, the software code for implementing the GUI may be executed on server 20 and accessed by users who want to utilize the features provided therein. [0023] Figure 3 illustrates an example graphical user interface
(GUI) window 30 produced on a user's PC or other endpoint device in response to the user clicking on a conference participant's facial image. In the embodiment shown, when a user clicks oh a meeting participant in the video window the system displays a photo 32 (either copied from the video window or reproduced from file storage) of that person along with various information about that person. In this case, the information includes the participant's name, company division, site location, office phone number, manager's name, and email address. All of this information is associated with that person's image as stored in the corporate directory / database. The information is presented in a data field 33 of window 30.
[0024] The bottom of window 30 is shown including three example command buttons (icons) 34-36 that may be clicked-on or otherwise selected by a user. Command button 34 returns the user to the video stream (e.g., window 28 of Figure 2) of the ongoing (or recorded) conference session. Selection of command button 35 causes any attachments that participant 32 may have submitted for group viewing / discussion to be displayed to the user. For instance, each participant may have been invited to submit a report on a certain subject or topic before joining the meeting. Command button 35 allows a user to quickly view the relevant document(s) submitted by a particular participant. Lastly, selection of command button 36 causes the system to initiate communications between the user and participant 32. Communications may be through a predetermined media channel (e.g., IM) or selectable via a pull-down menu that presents the user with various communication options (e.g., IM, email, phone, etc.).
[0025] In another embodiment, the actual contact details such as
IM, telephone number, or email address of one or more conference participants may be suppressed or hidden from view during certain types of conferences or meetings. For example, in a large conference call or public meeting individual participants may choose to keep their contact information private to all participants. Additionally, certain participants (e.g., a CEO participating in a public briefing) may choose to disable user interface-initiated communications entirely. Both of these features may be implemented by a user preference setting that hides or masks identification information of that user during a conference session. [0026] . In a slightly different embodiment, rather than revealing contact information via a user interface the conference server may instead automatically generate a temporary mapping or anonymous link to a communication device associated with that person, such as <participant1@conference.server>, e.g., based on their order of joining the conference. In this manner, the participant who is requesting communications with another conference attendee only sees the temporary mapping details - which may still be used to contact that attendee - but without actually revealing the personal contact details of that participant. In other words, communications may be initiated with a selected conference participant via a user interface without revealing personal contact information about that participant. [0027] It is appreciated that different embodiments may execute different routines / approaches for establishing communications between the user and the selected meeting participant. For instance, in one embodiment, the user may be given an option to start a telephony session with a conference participant immediately after the user clicks on the outline of that participant in the video window. In another embodiment, the user can drag and drop the outline of a meeting participant from the video window onto a soft phone running on his PC in order to start a telephony session with that person.
[0028] In another embodiment, the user may start an IM session with a meeting participant by clicking on the outline of that participant in the video conference window. In still another embodiment, the user can start an IM session with a meeting participant by dragging and dropping the outline of that participant from the video conference window on to the IM client running on his PC. Alternatively, the user can add a video conference participant on to an ongoing IM conference chat conference by dragging the outline of that person from the video conference window on to the IM conference window.
[0029] In yet another embodiment, the system opens a new email message composition for a video conference participant and automatically fills in the selected participant's email address when the user clicks on the outline of that participant in the video conference window, or drags the outline of that participant's facial image from the video conference window onto an email client such as Microsoft Outlook™. [0030] In another embodiment, the conferencing system may schedule a new meeting between the user and a selected video conference participant when the user drags the outlined image of the participant's face onto a scheduling application such as MeetingMaker™ running on the user's PC desktop. In a slight variation of this embodiment, the system may generate and send an invitation to a conference session participant to join another existing meeting when a user clicks on the facial image of that participant, or drags the participant's outline from the video conference window onto a scheduled meeting in his meeting scheduling application.
[0031] In still another embodiment, the GUI may be configured to allow a user to drag a file (such as document containing meeting notes) onto the facial image of a participant in the video conference window in order to send (e.g., email) that file to that participant. [0032] Figure 4 illustrates an example method of operation for a video conferencing system. The process begins when a video conference participant dials into a conference session from an endpoint device (step 41). Immediately upon joining the session, the system automatically captures an image of the participant's face (step 42), and then accesses a directory database in an attempt to match the person's face with a stored image (step 43). Once the system matches or recognizes a person's facial image in a meeting, the identity information of that person is attached to an object associated with that image, obviating the need for a database query when a user later clicks on or otherwise selects that person's image. In the event that a match is made, the participant's identity may optionally be confirmed (step 45). The confirmation process may simply involve an IVR query sent to the participant (e.g., "Are you James Smith of the Video Technology Group?).
[0033] In the event that a match to the captured facial image is not found, the system may send a query to the participant requesting that he identify himself (block 46). Certain conferencing system may be configured with authentication / identification procedures to insure that only authorized persons are admitted to a particular conference session. In such cases, the system may utilize a participant's name, badge number, or other unique identifier (e.g., fingerprint) to locate further information about that person from the corporate directory database. Instead of trying to match the facial image of that participant with a stored or archived facial image, the system may create a clickable object from the captured facial image, essentially transforming the facial image into a user selectable object for purposes of user interface-based association during the conference session. Additionally, the system may update the database by adding the recently captured image to the database entry associated with the participant.
[0034] Continuing with the example of Figure 4, in the event that a participant identifies himself and the system determines that that person is not already in the corporate database (block 47), a new database entry in the directory may be added to include that participant's image and any available identification or contact information (block 49). For example, guests, vendors, or contractors may be invited to participate in a conference session. In order to facilitate interaction, each such participant may be requested / queried to provide certain basic information to be stored along with their image. In cases where a person such as a contractor has previously provided information about himself to the organization (e.g., name, street address, company affiliation, email address, phone number, etc.), yet who has never attended a video conference of that organization, his captured facial image may be added to his other information entered in the corporate database. Similarly, in the case where a participant's captured image does not match the facial image stored in the database entry for that person, the directory database may be updated by replacing the older image with the newly captured facial image (block 48).
[0035] It is appreciated that the example of Figure 4 is also applicable to groups who dial into a conference session from a single endpoint device. Rather than capture the facial image of a single participant and compare it to a directory of stored images, the system would perform the same operations on each participant in that group. In other words, each of the steps outlined in Figure 4 would be automatically applied to each individual of the group attending the meeting. [0036] In another embodiment, the video conferencing system queries the database in response to a user of the GUI clicking on the image of a participant to a conference session. That is, the system does not attempt to match the facial image of a conference participant captured in the video feed against images stored in the corporate database until a particular participant has been selected by a user. In certain implementations this may involve the user having to draw a boundary around or using a lasso tool to frame a participant's face to cause the system to create an object for that face and then initiate a query to the database in an attempt to match the facial image with a known person. To put it differently, whereas the previously described embodiment operates proactively to identify everyone in the video conference session at the start, this latter embodiment operates reactively to a user selection / request for information identifying and associated with a particular participant.
[0037] Figure 5 illustrates an example method for obtaining identity information about a selected conference participant in accordance with the latter described embodiment. After the video conference starts (block 51 ) the conferencing system makes no attempt to identify participants to the meeting until a user outlines a particular participant's face, for example, by using a GUI to click on the person's face, lasso their face, draw a boundary around their face, or using other similar techniques / methods (block 52). At that point, the system sends a query to the database that attempts to match the facial image captured in the video feed with an image stored in the directory database. If the match is successful (block 56), the user is provided with detailed information about the selected participant (block 56). This information may be presented in a variety of ways; for example, as shown in Figure 3. On the other hand, if the database query is unsuccessful, i.e., no match is found, the user may be notified of the failed result (block 55).
[0038] Instead of presenting a user with detailed information about a selected participant, the system may alternatively present the user with a list of different communication modes available for interacting with the participant. For example, the GUI may open a window that presents the user with the option of interacting via IM, email, telephone connection, etc. Using a mouse or other input device, the user may then click on one of the available communication modes. In response, the system establishes communication with the conference participant via the selected mode. For instance, if the user selects IM from the list presented, the system automatically establishes an IM session between the conference participant and the user via an appropriate communication channel, link, or connection.
[0039] It should be understood that elements of the present invention may also be provided as a computer program product which may include a machine-readable medium having stored thereon instructions which may be used to program a computer (e.g., a processor or other electronic device) to perform a sequence of operations. Alternatively, the operations may be performed by a combination of hardware and software. The machine-readable medium may include, but is not limited to, floppy diskettes, optical disks, CD-ROMs, and magneto- optical disks, ROMs, RAMs, EPROMs, EEPROMs, magnet or optical cards, propagation media or other type of media/machine-readable medium suitable for storing electronic instructions. For example, elements of the present invention may be downloaded as a computer program product, wherein the program may be transferred from a remote computer or telephonic device to a requesting process by way of data signals embodied in a carrier wave or other propagation medium via a communication link (e.g., a modem or network connection). [0040] Additionally, although the present invention has been described in conjunction with specific embodiments, numerous modifications and alterations are well within the scope of the present invention. Accordingly, the specification and drawings are to be regarded in an illustrative rather than a restrictive sense.

Claims

CLAIMSWe claim:
1. A method comprising: capturing a facial image of a participant to a video conference session from a video stream of the video conference session;
" accessing a database containing a plurality a stored facial images, each stored facial image being associated with information associated with a person; matching the facial image with one of the stored facial images; and retrieving the information associated with the one of the stored facial images.
2. The method of claim 1 further comprising creating, as a user interface selectable object, the facial image of the participant to the video conference session.
3. The method of claim 1 wherein the capturing is performed in response to input received from a user interface.
4. The method of claim 1 further comprising updating the database to include the captured facial image.
5. The method of claim 3 further comprising establishing communication between an endpoint device running the user interface and the person associated with the one of the stored facial images.
6. The method of claim 1 wherein the capturing is performed in response to the participant joining the video conference session.
7. A method comprising: receiving input Identifying a facial image of a participant from a video feed to a video conference session; creating a selectable object of the facial image; in response to input selection of the selectable object during the video feed, initiating a database query in an attempt to match the facial image with one of a plurality of stored facial images, each stored facial image being associated with information associated with a person; and in the event of a match between a particular stored facial image and the facial image, retrieving the information associated with the particular stored facial image.
8. The method of claim 7 wherein the person associated with the particular stored facial image is one of a group of persons attending the video conference session from a single endpoint device.
9. The method of claim 7 further comprising capturing the facial image of the participant to the video conference session.
10. The method of claim 7 wherein the input selection is generated by a user interface.
11. The method of claim 10 further comprising establishing communications between an endpoint device running the user interface and the person associated with the one of the stored facial images.
12. The method of claim 11 wherein the communications comprises an instant message sent from the endpoint device running the user interface to an endpoint device of the person associated with the one of the stored facial images.
13. The method of claim 7 wherein the information includes a temporary mapping to a communications device of the person.
14. An apparatus comprising: a display; and a program that runs on a personal computer (PC) to produce a graphical user interface (GUI) on the display, the GUI providing a user of the PC with an ability to select a facial image of a participant from a video feed of a video conference session, selection of the facial image by the user during the video feed causing the PC to send output to a server hosting the video conference session, in response to receiving the output, the server accessing a database in an attempt to match the facial image with one of a plurality of stored facial images, each stored facial image being associated with information about a person, in the event of a match between a particular stored facial image and the facial image the server sending the information of the person associated with the particular stored facial image to the PC, the program being operable to display the information on the display.
15. The apparatus of claim 14 wherein the user interface further provides the user with the ability to request the server to establish communications between the user and the person associated with the particular stored facial image in response to a click of a button on an input cursor device.
16. The apparatus of claim 15 wherein the click occurs when a cursor positionable by the input cursor device is positioned over the facial image.
17. The apparatus of claim 15 wherein the communications comprises an instant message session.
18. The apparatus of claim 15 wherein the communications comprises an email message.
19. The apparatus of claim 15 wherein the communications comprises a telephone call.
20. Logic encoded in one or more media for execution and when executed is operable to: during a video conference session, capture a facial image of a participant to the video conference session; access a database containing a plurality a stored facial images, each stored facial image being associated with information about a person; match the facial image with one of the stored facial images; retrieve the information about the person associated with the one of the stored facial images; and send the information an endpoint device connected to the video conference session in response to input received from the endpoint device.
21. The logic of claim 20 wherein the input comprises a user interface selection of the facial image of the participant.
22. The logic of claim 20 wherein the media, when executed, is further operable to establish communications between a user of the endpoint device and the person associated with the one of the stored facial images.
23. The logic of claim 22 wherein the communications are established in response to a click of a button on an input cursor device associated with the user interface.
24. The logic of claim 20 wherein the information includes identity information.
25. The method of claim 22 wherein the communications comprises an instant message sent from the endpoint device of the user and the person associated with the one of the stored facial images.
26. An apparatus comprising: a display; and means for producing a graphical user interface (GUI) on the display, the GUI providing a user of the PC with an ability to select a facial image of a participant from a video feed of a video conference session, selection of the facial image by the user during the video feed causing the PC to send output. to a server hosting the video conference session, in response to receiving the output, the server accessing a database in an attempt to match the facial image with one of a plurality of stored facial images, each stored facial image being associated with contact information about a person, in the event of a match between a particular stored facial image and the facial image the server sending an anonymous link to the PC, the means for displaying the anonymous link on the display, the means also for initiating communications with the person using the contact information responsive to selection input of the anonymous link.
27. A system comprising: one or more processors; and a memory comprising one or more instructions executable at the processors, the processors operable when executing the instructions to: list a plurality of communication modes available for interacting with a conference participant, the list being generated in response to a first input from a user interface selecting a facial image of the conference participant from a video feed of a video conference session; and establish communications between the user and the conference participant responsive to a second input from the user interface selecting one of the plurality of communication modes.
PCT/US2007/017569 2006-10-11 2007-08-07 Interaction based on facial recognition of conference participants WO2008045153A2 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
CN2007800300803A CN101502088B (en) 2006-10-11 2007-08-07 Interaction based on facial recognition of conference participants
EP07811156A EP2055088B1 (en) 2006-10-11 2007-08-07 Interaction based on facial recognition of conference participants

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US11/546,264 US7847815B2 (en) 2006-10-11 2006-10-11 Interaction based on facial recognition of conference participants
US11/546,264 2006-10-11

Publications (2)

Publication Number Publication Date
WO2008045153A2 true WO2008045153A2 (en) 2008-04-17
WO2008045153A3 WO2008045153A3 (en) 2008-12-31

Family

ID=39283844

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2007/017569 WO2008045153A2 (en) 2006-10-11 2007-08-07 Interaction based on facial recognition of conference participants

Country Status (4)

Country Link
US (1) US7847815B2 (en)
EP (1) EP2055088B1 (en)
CN (1) CN101502088B (en)
WO (1) WO2008045153A2 (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2010071442A1 (en) * 2008-12-15 2010-06-24 Tandberg Telecom As Method for speeding up face detection
WO2011062339A1 (en) 2009-11-17 2011-05-26 Lg Electronics Inc. Method for user authentication, and video communication apparatus and display apparatus thereof
WO2012057665A1 (en) * 2010-10-28 2012-05-03 Telefonaktiebolaget L M Ericsson (Publ) A face data acquirer, end user video conference device, server, method, computer program and computer program product for extracting face data
CN103839151A (en) * 2012-11-20 2014-06-04 联想(北京)有限公司 Control equipment and control method

Families Citing this family (118)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9583141B2 (en) 2005-07-01 2017-02-28 Invention Science Fund I, Llc Implementing audio substitution options in media works
US9092928B2 (en) 2005-07-01 2015-07-28 The Invention Science Fund I, Llc Implementing group content substitution in media works
US7860342B2 (en) 2005-07-01 2010-12-28 The Invention Science Fund I, Llc Modifying restricted images
US8732087B2 (en) 2005-07-01 2014-05-20 The Invention Science Fund I, Llc Authorization for media content alteration
US9065979B2 (en) 2005-07-01 2015-06-23 The Invention Science Fund I, Llc Promotional placement in media works
US9426387B2 (en) 2005-07-01 2016-08-23 Invention Science Fund I, Llc Image anonymization
US8910033B2 (en) * 2005-07-01 2014-12-09 The Invention Science Fund I, Llc Implementing group content substitution in media works
US9230601B2 (en) 2005-07-01 2016-01-05 Invention Science Fund I, Llc Media markup system for content alteration in derivative works
US8326927B2 (en) * 2006-05-23 2012-12-04 Cisco Technology, Inc. Method and apparatus for inviting non-rich media endpoints to join a conference sidebar session
US7801971B1 (en) 2006-09-26 2010-09-21 Qurio Holdings, Inc. Systems and methods for discovering, creating, using, and managing social network circuits
US8463893B2 (en) * 2006-11-30 2013-06-11 Red Hat, Inc. Automatic playlist generation in correlation with local events
US9195996B1 (en) 2006-12-27 2015-11-24 Qurio Holdings, Inc. System and method for classification of communication sessions in a social network
US8234277B2 (en) * 2006-12-29 2012-07-31 Intel Corporation Image-based retrieval for high quality visual or acoustic rendering
KR101425428B1 (en) * 2007-01-30 2014-07-31 삼성전자주식회사 Display apparatus and control method thereof
NO20071451L (en) * 2007-03-19 2008-09-22 Tandberg Telecom As System and method for controlling conference equipment
US9215512B2 (en) 2007-04-27 2015-12-15 Invention Science Fund I, Llc Implementation of media content alteration
US20080266381A1 (en) * 2007-04-30 2008-10-30 Matthew David Smith Selectively privatizing data transmissions in a video conference
US20090027484A1 (en) * 2007-07-26 2009-01-29 Avaya Technology Llc Call Resource Management Based on Calling-Party Disengagement from a Call
US9060094B2 (en) 2007-09-30 2015-06-16 Optical Fusion, Inc. Individual adjustment of audio and video properties in network conferencing
US8954178B2 (en) 2007-09-30 2015-02-10 Optical Fusion, Inc. Synchronization and mixing of audio and video streams in network-based video conferencing call systems
US20090171690A1 (en) * 2007-12-28 2009-07-02 Humanbook, Inc. System and method for a web-based people directory
US20090180664A1 (en) * 2008-01-16 2009-07-16 Christine Efstathiades Master Key Identification System
CN103402070B (en) 2008-05-19 2017-07-07 日立麦克赛尔株式会社 Record reproducing device and method
US8688841B2 (en) * 2008-06-05 2014-04-01 Modena Enterprises, Llc System and method for content rights based on existence of a voice session
US8892553B2 (en) * 2008-06-18 2014-11-18 Microsoft Corporation Auto-generation of events with annotation and indexing
US8539031B2 (en) * 2008-06-27 2013-09-17 Microsoft Corporation Displaying images for people associated with a message item
US20100015975A1 (en) * 2008-07-17 2010-01-21 Kota Enterprises, Llc Profile service for sharing rights-enabled mobile profiles
US20100015976A1 (en) * 2008-07-17 2010-01-21 Domingo Enterprises, Llc System and method for sharing rights-enabled mobile profiles
JP5414282B2 (en) * 2009-01-07 2014-02-12 キヤノン株式会社 File management system, file management method and program
US8311292B2 (en) * 2009-02-09 2012-11-13 Cisco Technology, Inc. Context aware, multiple target image recognition
US20100217806A1 (en) * 2009-02-20 2010-08-26 Gautam Khot Email Based Remote Management of Network Connected Entities
JP2011040036A (en) 2009-03-30 2011-02-24 Avaya Inc System and method for managing communication session using graphical user interface
DE202010004430U1 (en) * 2009-05-13 2010-08-12 Hasso-Plattner-Institut für Softwaresystemtechnik GmbH Device for processing information
CN101930284B (en) * 2009-06-23 2014-04-09 腾讯科技(深圳)有限公司 Method, device and system for implementing interaction between video and virtual network scene
KR101756835B1 (en) * 2009-08-21 2017-07-13 삼성전자주식회사 Method for managing contact information, user device and storage medium thereof
US8131848B1 (en) * 2009-09-29 2012-03-06 Jason Adam Denise Image analysis and communication device control technology
US20110096135A1 (en) * 2009-10-23 2011-04-28 Microsoft Corporation Automatic labeling of a video session
DE102009060687A1 (en) * 2009-11-04 2011-05-05 Siemens Aktiengesellschaft Method and device for computer-aided annotation of multimedia data
JP5675089B2 (en) * 2009-12-17 2015-02-25 キヤノン株式会社 Video information processing apparatus and method
TWI465943B (en) * 2009-12-31 2014-12-21 Fih Hong Kong Ltd Data processing device for sharing photos and creating a communication link through the photos
EP2526507A1 (en) * 2010-01-20 2012-11-28 Telefonaktiebolaget L M Ericsson (PUBL) Meeting room participant recogniser
US20120150956A1 (en) * 2010-12-10 2012-06-14 Polycom, Inc. Extended Video Conferencing Features Through Electronic Calendaring
WO2012000826A1 (en) * 2010-06-30 2012-01-05 Alcatel Lucent Method and device for teleconferencing
US8817966B2 (en) * 2010-07-08 2014-08-26 Lisa Marie Bennett Wrench Method of collecting and employing information about parties to a televideo conference
JP5545084B2 (en) * 2010-07-08 2014-07-09 ソニー株式会社 Information processing apparatus, information processing method, and program
US20120117110A1 (en) 2010-09-29 2012-05-10 Eloy Technology, Llc Dynamic location-based media collection aggregation
JP5740972B2 (en) * 2010-09-30 2015-07-01 ソニー株式会社 Information processing apparatus and information processing method
US8558864B1 (en) 2010-10-12 2013-10-15 Sprint Communications Company L.P. Identifying video conference participants
CN101951494B (en) * 2010-10-14 2012-07-25 上海紫南信息技术有限公司 Method for fusing display images of traditional phone and video session
US8494231B2 (en) * 2010-11-01 2013-07-23 Microsoft Corporation Face recognition in video content
US8462191B2 (en) 2010-12-06 2013-06-11 Cisco Technology, Inc. Automatic suppression of images of a video feed in a video call or videoconferencing system
CN102647577A (en) * 2011-02-16 2012-08-22 鸿富锦精密工业(深圳)有限公司 Teleconference management system and management method
US8599236B2 (en) 2011-05-11 2013-12-03 Cisco Technology, Inc. Utilizing a video image from a video communication session as contact information
KR101786944B1 (en) * 2011-05-12 2017-10-18 삼성전자 주식회사 Speaker displaying method and videophone terminal therefor
US8767586B2 (en) 2011-06-20 2014-07-01 At&T Intellectual Property I, L.P. Methods, systems, and products for network topology
CN102891978B (en) * 2011-07-21 2016-03-30 联想(北京)有限公司 Image processing method and terminal
US20130083151A1 (en) * 2011-09-30 2013-04-04 Lg Electronics Inc. Electronic device and method for controlling electronic device
TWI484827B (en) * 2011-11-02 2015-05-11 Quanta Comp Inc Video conference system, video conference terminal apparatus and image capturing method for video conferences
US20130162752A1 (en) * 2011-12-22 2013-06-27 Advanced Micro Devices, Inc. Audio and Video Teleconferencing Using Voiceprints and Face Prints
TWI488503B (en) * 2012-01-03 2015-06-11 國際洋行股份有限公司 Conference photography device and the method thereof
CN102572356B (en) 2012-01-16 2014-09-03 华为技术有限公司 Conference recording method and conference system
US8826150B1 (en) 2012-01-25 2014-09-02 Google Inc. System and method for tagging images in a social network
US20130215214A1 (en) * 2012-02-22 2013-08-22 Avaya Inc. System and method for managing avatarsaddressing a remote participant in a video conference
CN103312747A (en) * 2012-03-12 2013-09-18 联想(北京)有限公司 Information processing method, communication terminal equipment and server equipment
US10430036B2 (en) * 2012-03-14 2019-10-01 Tivo Solutions Inc. Remotely configuring windows displayed on a display device
US9066129B2 (en) * 2012-04-24 2015-06-23 Comcast Cable Communications, Llc Video presentation device and method
US9031543B2 (en) 2012-06-06 2015-05-12 Qualcomm Incorporated Visualization of network members based on location and direction
WO2013191680A1 (en) * 2012-06-19 2013-12-27 Empire Technology Development, Llc Group nature verification for providing group incentives
JP5349656B1 (en) * 2012-06-29 2013-11-20 株式会社東芝 Video processing apparatus and control method thereof
US9350944B2 (en) 2012-08-24 2016-05-24 Qualcomm Incorporated Connecting to an onscreen entity
US9966075B2 (en) * 2012-09-18 2018-05-08 Qualcomm Incorporated Leveraging head mounted displays to enable person-to-person interactions
US9076028B2 (en) * 2012-10-08 2015-07-07 Citrix Systems, Inc. Facial recognition and transmission of facial images in a videoconference
US9087402B2 (en) * 2013-03-13 2015-07-21 Microsoft Technology Licensing, Llc Augmenting images with higher resolution data
US9210269B2 (en) * 2012-10-31 2015-12-08 Cisco Technology, Inc. Active speaker indicator for conference participants
CN103870794A (en) * 2012-12-11 2014-06-18 谢骅 Method for identifying human face and storing facial features
US10140354B2 (en) * 2013-03-01 2018-11-27 Nec Corporation Information processing device, data processing method therefor, and recording medium
CN103136357A (en) * 2013-03-06 2013-06-05 广东欧珀移动通信有限公司 Method, device and mobile equipment for positioning webpages by face recognition technology
EP2824913A1 (en) * 2013-07-09 2015-01-14 Alcatel Lucent A method for generating an immersive video of a plurality of persons
CN104349113A (en) * 2013-08-01 2015-02-11 波利康公司 Method for providing auxiliary information in video conference
US20150088515A1 (en) * 2013-09-25 2015-03-26 Lenovo (Singapore) Pte. Ltd. Primary speaker identification from audio and video data
WO2015054342A1 (en) 2013-10-09 2015-04-16 Mindset Systems Method of and system for automatic compilation of crowdsourced digital media productions
US9210377B2 (en) * 2013-10-30 2015-12-08 At&T Intellectual Property I, L.P. Methods, systems, and products for telepresence visualizations
US10075656B2 (en) 2013-10-30 2018-09-11 At&T Intellectual Property I, L.P. Methods, systems, and products for telepresence visualizations
US10459985B2 (en) * 2013-12-04 2019-10-29 Dell Products, L.P. Managing behavior in a virtual collaboration session
US9972324B2 (en) * 2014-01-10 2018-05-15 Verizon Patent And Licensing Inc. Personal assistant application
US9386270B2 (en) * 2014-01-15 2016-07-05 Cisco Technology, Inc. Displaying information about at least one participant in a video conference session
GB201406789D0 (en) * 2014-04-15 2014-05-28 Microsoft Corp Displaying video call data
CN104133917B (en) * 2014-08-15 2018-08-10 百度在线网络技术(北京)有限公司 The classification storage method and device of photo
CN105608754A (en) * 2014-11-12 2016-05-25 中兴通讯股份有限公司 Video conference signing method, video conference signing apparatus and video conference signing system
US20160261648A1 (en) * 2015-03-04 2016-09-08 Unify Gmbh & Co. Kg Communication system and method of using the same
US9883003B2 (en) 2015-03-09 2018-01-30 Microsoft Technology Licensing, Llc Meeting room device cache clearing
US20160269254A1 (en) * 2015-03-09 2016-09-15 Michael K. Forney Meeting Summary
CN106034215A (en) * 2015-03-16 2016-10-19 冠捷投资有限公司 Video conference method and system
US9672829B2 (en) * 2015-03-23 2017-06-06 International Business Machines Corporation Extracting and displaying key points of a video conference
CN106162042A (en) * 2015-04-13 2016-11-23 中兴通讯股份有限公司 A kind of method of video conference, server and terminal
CN106209725B (en) * 2015-04-30 2019-11-15 中国电信股份有限公司 Method, video conference central server and system for video conference certification
US10691314B1 (en) * 2015-05-05 2020-06-23 State Farm Mutual Automobile Insurance Company Connecting users to entities based on recognized objects
US10013153B1 (en) 2015-05-05 2018-07-03 State Farm Mutual Automobile Insurance Company Initiating communications based on interactions with images
US10305863B2 (en) * 2015-08-05 2019-05-28 Cronvo Llc Systems and methods for managing telecommunications
CN105488478B (en) * 2015-12-02 2020-04-07 深圳市商汤科技有限公司 Face recognition system and method
US9686510B1 (en) * 2016-03-15 2017-06-20 Microsoft Technology Licensing, Llc Selectable interaction elements in a 360-degree video stream
CN105915798A (en) * 2016-06-02 2016-08-31 北京小米移动软件有限公司 Camera control method in video conference and control device thereof
US10152636B2 (en) 2017-01-12 2018-12-11 International Business Machines Corporation Setting a personal status using augmented reality
US20180232566A1 (en) * 2017-02-15 2018-08-16 Cisco Technology, Inc. Enabling face recognition in a cognitive collaboration environment
US11417343B2 (en) * 2017-05-24 2022-08-16 Zoominfo Converse Llc Automatic speaker identification in calls using multiple speaker-identification parameters
US10923121B2 (en) * 2017-08-11 2021-02-16 SlackTechnologies, Inc. Method, apparatus, and computer program product for searchable real-time transcribed audio and visual content within a group-based communication system
US10928895B2 (en) * 2017-09-21 2021-02-23 Tobii Ab Systems and methods for interacting with a computing device using gaze information
US11100330B1 (en) * 2017-10-23 2021-08-24 Facebook, Inc. Presenting messages to a user when a client device determines the user is within a field of view of an image capture device of the client device
CN110519546B (en) * 2018-05-22 2021-05-28 视联动力信息技术股份有限公司 Method and device for pushing business card information based on video conference
CN110688510B (en) * 2018-06-20 2022-06-14 浙江宇视科技有限公司 Face background image acquisition method and system
CN109560941A (en) * 2018-12-12 2019-04-02 深圳市沃特沃德股份有限公司 Minutes method, apparatus, intelligent terminal and storage medium
US11386562B2 (en) 2018-12-28 2022-07-12 Cyberlink Corp. Systems and methods for foreground and background processing of content in a live video
US11910067B2 (en) * 2019-02-04 2024-02-20 Nec Networks & System Integration Corporation Content providing system, content providing method, and content providing program
US11068856B2 (en) * 2019-04-30 2021-07-20 International Business Machines Corporation Biometric data based scheduling
US10764535B1 (en) * 2019-10-14 2020-09-01 Facebook, Inc. Facial tracking during video calls using remote control input
US20230048703A1 (en) * 2020-02-27 2023-02-16 Nec Corporation Server device, conference assistance system, and conference assistance method
US11463499B1 (en) * 2020-12-18 2022-10-04 Vr Edu Llc Storage and retrieval of virtual reality sessions state based upon participants
WO2023018905A1 (en) * 2021-08-12 2023-02-16 Avail Medsystems, Inc. Systems and methods for enhancing audio communications

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20030070072A1 (en) 2001-10-09 2003-04-10 Nick Nassiri System and method of identity and signature and document authentication using a video conference
US20050062844A1 (en) 2003-09-19 2005-03-24 Bran Ferren Systems and method for enhancing teleconferencing collaboration
EP1659518A2 (en) 2004-10-30 2006-05-24 Microsoft Corporation Automatic face extraction

Family Cites Families (69)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5689641A (en) * 1993-10-01 1997-11-18 Vicor, Inc. Multimedia collaboration system arrangement for routing compressed AV signal through a participant site without decompressing the AV signal
US5729687A (en) * 1993-12-20 1998-03-17 Intel Corporation System for sending differences between joining meeting information and public meeting information between participants in computer conference upon comparing annotations of joining and public meeting information
US5483587A (en) 1994-06-08 1996-01-09 Linkusa Corporation System and method for call conferencing
US5594732A (en) * 1995-03-03 1997-01-14 Intecom, Incorporated Bridging and signalling subsystems and methods for private and hybrid communications systems including multimedia systems
US5600366A (en) * 1995-03-22 1997-02-04 Npb Partners, Ltd. Methods and apparatus for digital advertisement insertion in video programming
US5673253A (en) * 1996-02-29 1997-09-30 Siemens Business Communication Systems Dynamic allocation of telecommunications resources
US6137834A (en) * 1996-05-29 2000-10-24 Sarnoff Corporation Method and apparatus for splicing compressed information streams
US6332153B1 (en) * 1996-07-31 2001-12-18 Vocaltec Communications Ltd. Apparatus and method for multi-station conferencing
US5917830A (en) * 1996-10-18 1999-06-29 General Instrument Corporation Splicing compressed packetized digital video streams
US5963217A (en) * 1996-11-18 1999-10-05 7Thstreet.Com, Inc. Network conference system using limited bandwidth to generate locally animated displays
KR100232164B1 (en) * 1997-02-05 1999-12-01 구자홍 Trnsport stream demultiplexer
US6600733B2 (en) * 1997-02-06 2003-07-29 Verizon Laboratories Inc. System for interconnecting packet-switched and circuit-switched voice communications
US6480667B1 (en) * 1997-12-23 2002-11-12 Intel Corporation Method of time shifting to simultaneously record and play a data stream
US6351474B1 (en) * 1998-01-14 2002-02-26 Skystream Networks Inc. Network distributed remultiplexer for video program bearing transport streams
US6643496B1 (en) * 1998-03-31 2003-11-04 Canon Kabushiki Kaisha System, method, and apparatus for adjusting packet transmission rates based on dynamic evaluation of network characteristics
US6236854B1 (en) * 1998-08-17 2001-05-22 Nortel Networks Limited Method and apparatus for controlling a conference call
US6608820B1 (en) * 1998-08-17 2003-08-19 Nortel Networks Ltd. Method and apparatus for controlling a conference call
US6141324A (en) * 1998-09-01 2000-10-31 Utah State University System and method for low latency communication
US6775247B1 (en) * 1999-03-22 2004-08-10 Siemens Information And Communication Networks, Inc. Reducing multipoint conferencing bandwidth
US6925068B1 (en) * 1999-05-21 2005-08-02 Wi-Lan, Inc. Method and apparatus for allocating bandwidth in a wireless communication system
US6483587B1 (en) * 1999-06-30 2002-11-19 John Charles Jackson Gap/edge bead detection system
US6675216B1 (en) * 1999-07-06 2004-01-06 Cisco Technolgy, Inc. Copy server for collaboration and electronic commerce
US6771644B1 (en) 1999-09-17 2004-08-03 Lucent Technologies Inc. Program insertion in real time IP multicast
US6650652B1 (en) * 1999-10-12 2003-11-18 Cisco Technology, Inc. Optimizing queuing of voice packet flows in a network
US6771657B1 (en) * 1999-12-09 2004-08-03 General Instrument Corporation Non real-time delivery of MPEG-2 programs via an MPEG-2 transport stream
US6816469B1 (en) * 1999-12-30 2004-11-09 At&T Corp. IP conference call waiting
US6671262B1 (en) * 1999-12-30 2003-12-30 At&T Corp. Conference server for automatic x-way call port expansion feature
US6505169B1 (en) * 2000-01-26 2003-01-07 At&T Corp. Method for adaptive ad insertion in streaming multimedia content
US6876734B1 (en) * 2000-02-29 2005-04-05 Emeeting.Net, Inc. Internet-enabled conferencing system and method accommodating PSTN and IP traffic
JP2001320440A (en) * 2000-05-02 2001-11-16 Sony Corp Communication apparatus and method
US6501739B1 (en) * 2000-05-25 2002-12-31 Remoteability, Inc. Participant-controlled conference calling system
US6865540B1 (en) * 2000-08-09 2005-03-08 Ingenio, Inc. Method and apparatus for providing group calls via the internet
US7007098B1 (en) * 2000-08-17 2006-02-28 Nortel Networks Limited Methods of controlling video signals in a video conference
US7844489B2 (en) * 2000-10-30 2010-11-30 Buyerleverage Buyer-driven targeting of purchasing entities
US20020087976A1 (en) * 2000-12-28 2002-07-04 Kaplan Marc P. System and method for distributing video with targeted advertising using switched communication networks
US6956828B2 (en) * 2000-12-29 2005-10-18 Nortel Networks Limited Apparatus and method for packet-based media communications
US6976055B1 (en) * 2001-01-18 2005-12-13 Cisco Technology, Inc. Apparatus and method for conducting a transfer of a conference call
US7003086B1 (en) * 2001-01-18 2006-02-21 Cisco Technology, Inc. Apparatus and method for allocating call resources during a conference call
WO2002091202A1 (en) * 2001-05-04 2002-11-14 Globespan Virata Incorporated System and method for distributed processing of packet data containing audio information
US6937569B1 (en) * 2001-05-21 2005-08-30 Cisco Technology, Inc. Method and system for determining a relative position of a device on a network
AU2002314941A1 (en) * 2001-06-06 2002-12-16 Complete Tv, Llc Centralized aggregation of broadcast television programming and multi-market digital delivery thereof over interconnected terrestrial fiber optic networks
US6947417B2 (en) * 2001-06-29 2005-09-20 Ip Unity Method and system for providing media services
US20030025786A1 (en) * 2001-07-31 2003-02-06 Vtel Corporation Method and system for saving and applying a video address from a video conference
US8218829B2 (en) * 2001-08-20 2012-07-10 Polycom, Inc. System and method for using biometrics technology in conferencing
US7127487B1 (en) 2001-10-15 2006-10-24 3Com Corporation System and method for sidebar functionality in a regular conference system
US7355971B2 (en) * 2001-10-22 2008-04-08 Intel Corporation Determining packet size in networking
US7379653B2 (en) * 2002-02-20 2008-05-27 The Directv Group, Inc. Audio-video synchronization for digital systems
US7292543B2 (en) * 2002-04-17 2007-11-06 Texas Instruments Incorporated Speaker tracking on a multi-core in a packet based conferencing system
US8411594B2 (en) 2002-09-20 2013-04-02 Qualcomm Incorporated Communication manager for providing multimedia in a group communication network
US6931113B2 (en) * 2002-11-08 2005-08-16 Verizon Services Corp. Facilitation of a conference call
EP1432196A1 (en) * 2002-12-20 2004-06-23 Matsushita Electric Industrial Co., Ltd. Control traffic compression method in media data transmission
JP3769752B2 (en) * 2002-12-24 2006-04-26 ソニー株式会社 Information processing apparatus and information processing method, data communication system, and program
US7010108B2 (en) * 2003-02-21 2006-03-07 Magicsoft Corporation Method for scheduling videoconferences
CN1531282A (en) * 2003-03-12 2004-09-22 ���µ�����ҵ��ʽ���� Packet trunk device
US6959075B2 (en) * 2003-03-24 2005-10-25 Cisco Technology, Inc. Replay of conference audio
EP1623588B1 (en) * 2003-05-13 2011-06-29 TELEFONAKTIEBOLAGET LM ERICSSON (publ) Method of reducing connection set-up delay
US7257769B2 (en) * 2003-06-05 2007-08-14 Siemens Communications, Inc. System and method for indicating an annotation for a document
US20060189337A1 (en) * 2003-07-18 2006-08-24 Farrill Craig F Premium voice services for wireless communications systems
US7460652B2 (en) * 2003-09-26 2008-12-02 At&T Intellectual Property I, L.P. VoiceXML and rule engine based switchboard for interactive voice response (IVR) services
US8659636B2 (en) * 2003-10-08 2014-02-25 Cisco Technology, Inc. System and method for performing distributed video conferencing
US7562375B2 (en) * 2003-10-10 2009-07-14 Microsoft Corporation Fast channel change
SE0302920D0 (en) * 2003-11-03 2003-11-03 Ericsson Telefon Ab L M Improvements in or relating to group calls
US7084898B1 (en) * 2003-11-18 2006-08-01 Cisco Technology, Inc. System and method for providing video conferencing synchronization
JP2005184640A (en) * 2003-12-22 2005-07-07 Fujitsu Ltd Information distribution device and information distribution method
US20050259803A1 (en) 2004-05-19 2005-11-24 Nokia Corporation Managing a conference session
US20060020995A1 (en) * 2004-07-20 2006-01-26 Comcast Cable Communications, Llc Fast channel change in digital media systems
US20060090166A1 (en) * 2004-09-30 2006-04-27 Krishna Dhara System and method for generating applications for communication devices using a markup language
US20070110029A1 (en) * 2005-11-12 2007-05-17 Motorola, Inc. Method for linking communication channels of disparate access technologies in a selective call unit
US8125509B2 (en) * 2006-01-24 2012-02-28 Lifesize Communications, Inc. Facial recognition for a videoconference

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20030070072A1 (en) 2001-10-09 2003-04-10 Nick Nassiri System and method of identity and signature and document authentication using a video conference
US20050062844A1 (en) 2003-09-19 2005-03-24 Bran Ferren Systems and method for enhancing teleconferencing collaboration
EP1659518A2 (en) 2004-10-30 2006-05-24 Microsoft Corporation Automatic face extraction

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
See also references of EP2055088A4

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2010071442A1 (en) * 2008-12-15 2010-06-24 Tandberg Telecom As Method for speeding up face detection
US8390669B2 (en) 2008-12-15 2013-03-05 Cisco Technology, Inc. Device and method for automatic participant identification in a recorded multimedia stream
WO2011062339A1 (en) 2009-11-17 2011-05-26 Lg Electronics Inc. Method for user authentication, and video communication apparatus and display apparatus thereof
EP2502435A1 (en) * 2009-11-17 2012-09-26 LG Electronics Inc. Method for user authentication, and video communication apparatus and display apparatus thereof
EP2502435A4 (en) * 2009-11-17 2014-05-07 Lg Electronics Inc Method for user authentication, and video communication apparatus and display apparatus thereof
WO2012057665A1 (en) * 2010-10-28 2012-05-03 Telefonaktiebolaget L M Ericsson (Publ) A face data acquirer, end user video conference device, server, method, computer program and computer program product for extracting face data
US8965067B2 (en) 2010-10-28 2015-02-24 Telefonaktiebolaget L M Ericsson (Publ) Face data acquirer, end user video conference device, server, method, computer program and computer program product for extracting face data
CN103839151A (en) * 2012-11-20 2014-06-04 联想(北京)有限公司 Control equipment and control method

Also Published As

Publication number Publication date
EP2055088B1 (en) 2013-01-09
CN101502088B (en) 2013-07-24
US7847815B2 (en) 2010-12-07
EP2055088A4 (en) 2010-12-22
EP2055088A2 (en) 2009-05-06
US20080088698A1 (en) 2008-04-17
CN101502088A (en) 2009-08-05
WO2008045153A3 (en) 2008-12-31

Similar Documents

Publication Publication Date Title
US7847815B2 (en) Interaction based on facial recognition of conference participants
US10757050B2 (en) System and method for topic based segregation in instant messaging
US9641804B2 (en) Audio/video conferencing system
RU2488227C2 (en) Methods for automatic identification of participants for multimedia conference event
US10057542B2 (en) System for immersive telepresence
US8791977B2 (en) Method and system for presenting metadata during a videoconference
US8264520B2 (en) Communication system, connection controller, terminal, and control method
CN111935443B (en) Method and device for sharing instant messaging tool in real-time live broadcast of video conference
US8553067B2 (en) Capturing and controlling access to muted content from a conference session
US20100085415A1 (en) Displaying dynamic caller identity during point-to-point and multipoint audio/videoconference
US20070050448A1 (en) Method and system for information collaboration over an IP network via handheld wireless communication devices
US20070140532A1 (en) Method and apparatus for providing user profiling based on facial recognition
US20170288942A1 (en) Portal for Provisioning Autonomous Software Agents
US20090123035A1 (en) Automated Video Presence Detection
US20170288943A1 (en) Supplying Context Data to a Servicing Entity
TW200939775A (en) Techniques to generate a visual composition for a multimedia conference event
US10701116B2 (en) Method, computer-readable storage device and apparatus for establishing persistent messaging sessions
CN109816722A (en) Position method, apparatus, storage medium and the computer equipment of spokesman position
US9609273B2 (en) System and method for not displaying duplicate images in a video conference
US20240040036A1 (en) Real-time user screening of messages within a communication platform
CN113225521B (en) Video conference control method and device and electronic equipment
KR102445944B1 (en) Method for involving user in video conference using qr code and method for participating in video conference using qr code
US20240089410A1 (en) Method of allowing user to participate in video conference using qr code and method of participating, by user, in video conference using qr code
TR2021020522A2 (en) A SYSTEM THAT FOCUSES ON USER FACE IN VIDEO CONFERENCE
TR2021018363A2 (en) INFORMATION SHARING SYSTEM THROUGH VIDEO CONFERENCE

Legal Events

Date Code Title Description
WWE Wipo information: entry into national phase

Ref document number: 200780030080.3

Country of ref document: CN

121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 07811156

Country of ref document: EP

Kind code of ref document: A2

WWE Wipo information: entry into national phase

Ref document number: 2007811156

Country of ref document: EP

NENP Non-entry into the national phase

Ref country code: DE