WO2002030117A1 - Systems and methods for video and audio capture and communication - Google Patents

Systems and methods for video and audio capture and communication Download PDF

Info

Publication number
WO2002030117A1
WO2002030117A1 PCT/US2001/015822 US0115822W WO0230117A1 WO 2002030117 A1 WO2002030117 A1 WO 2002030117A1 US 0115822 W US0115822 W US 0115822W WO 0230117 A1 WO0230117 A1 WO 0230117A1
Authority
WO
WIPO (PCT)
Prior art keywords
video
network
remote control
set top
top box
Prior art date
Application number
PCT/US2001/015822
Other languages
French (fr)
Inventor
Paul Allen
Original Assignee
Digeo, Inc.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Digeo, Inc. filed Critical Digeo, Inc.
Priority to AU2001263186A priority Critical patent/AU2001263186A1/en
Publication of WO2002030117A1 publication Critical patent/WO2002030117A1/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/14Systems for two-way working
    • H04N7/141Systems for two-way working between two video terminals, e.g. videophone
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M1/00Substation equipment, e.g. for use by subscribers
    • H04M1/72Mobile telephones; Cordless telephones, i.e. devices for establishing wireless links to base stations without route selection
    • H04M1/724User interfaces specially adapted for cordless or mobile telephones
    • H04M1/72403User interfaces specially adapted for cordless or mobile telephones with means for local support of applications that increase the functionality
    • H04M1/72409User interfaces specially adapted for cordless or mobile telephones with means for local support of applications that increase the functionality by interfacing with external accessories
    • H04M1/72415User interfaces specially adapted for cordless or mobile telephones with means for local support of applications that increase the functionality by interfacing with external accessories for remote control of appliances
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/41Structure of client; Structure of client peripherals
    • H04N21/422Input-only peripherals, i.e. input devices connected to specially adapted client devices, e.g. global positioning system [GPS]
    • H04N21/42203Input-only peripherals, i.e. input devices connected to specially adapted client devices, e.g. global positioning system [GPS] sound input device, e.g. microphone
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/41Structure of client; Structure of client peripherals
    • H04N21/422Input-only peripherals, i.e. input devices connected to specially adapted client devices, e.g. global positioning system [GPS]
    • H04N21/42204User interfaces specially adapted for controlling a client device through a remote control device; Remote control devices therefor
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/41Structure of client; Structure of client peripherals
    • H04N21/422Input-only peripherals, i.e. input devices connected to specially adapted client devices, e.g. global positioning system [GPS]
    • H04N21/42204User interfaces specially adapted for controlling a client device through a remote control device; Remote control devices therefor
    • H04N21/42206User interfaces specially adapted for controlling a client device through a remote control device; Remote control devices therefor characterized by hardware details
    • H04N21/42222Additional components integrated in the remote control device, e.g. timer, speaker, sensors for detecting position, direction or movement of the remote control, microphone or battery charging device
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/41Structure of client; Structure of client peripherals
    • H04N21/422Input-only peripherals, i.e. input devices connected to specially adapted client devices, e.g. global positioning system [GPS]
    • H04N21/4223Cameras
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/41Structure of client; Structure of client peripherals
    • H04N21/426Internal components of the client ; Characteristics thereof
    • H04N21/42684Client identification by a unique number or address, e.g. serial number, MAC address, socket ID
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/433Content storage operation, e.g. storage operation in response to a pause request, caching operations
    • H04N21/4334Recording operations
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/436Interfacing a local distribution network, e.g. communicating with another STB or one or more peripheral devices inside the home
    • H04N21/4363Adapting the video or multiplex stream to a specific local network, e.g. a IEEE 1394 or Bluetooth® network
    • H04N21/43637Adapting the video or multiplex stream to a specific local network, e.g. a IEEE 1394 or Bluetooth® network involving a wireless protocol, e.g. Bluetooth, RF or wireless LAN [IEEE 802.11]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/439Processing of audio elementary streams
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/47End-user applications
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/47End-user applications
    • H04N21/478Supplemental services, e.g. displaying phone caller identification, shopping application
    • H04N21/4788Supplemental services, e.g. displaying phone caller identification, shopping application communicating with other users, e.g. chatting
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/60Network structure or processes for video distribution between server and client or between remote clients; Control signalling between clients, server and network components; Transmission of management data between server and client, e.g. sending from server to client commands for recording incoming content stream; Communication details between server and client 
    • H04N21/61Network physical structure; Signal processing
    • H04N21/6106Network physical structure; Signal processing specially adapted to the downstream path of the transmission network
    • H04N21/6125Network physical structure; Signal processing specially adapted to the downstream path of the transmission network involving transmission via Internet
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/60Network structure or processes for video distribution between server and client or between remote clients; Control signalling between clients, server and network components; Transmission of management data between server and client, e.g. sending from server to client commands for recording incoming content stream; Communication details between server and client 
    • H04N21/63Control signaling related to video distribution between client, server and network components; Network processes for video distribution between server and clients or between remote clients, e.g. transmitting basic layer and enhancement layers over different transmission paths, setting up a peer-to-peer communication via Internet between remote STB's; Communication protocols; Addressing
    • H04N21/64Addressing
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/14Systems for two-way working
    • H04N7/141Systems for two-way working between two video terminals, e.g. videophone
    • H04N7/142Constructional details of the terminal equipment, e.g. arrangements of the camera and the display
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/16Analogue secrecy systems; Analogue subscription systems
    • H04N7/173Analogue secrecy systems; Analogue subscription systems with two-way working, e.g. subscriber sending a programme selection signal
    • H04N7/17309Transmission or handling of upstream communications
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/14Systems for two-way working
    • H04N7/141Systems for two-way working between two video terminals, e.g. videophone
    • H04N7/142Constructional details of the terminal equipment, e.g. arrangements of the camera and the display
    • H04N2007/145Handheld terminals

Definitions

  • the present invention relates generally to interactive television systems, and more particularly, to systems and methods for video and audio capture and communication.
  • Video camera unit for mounting on a computer monitor. Both of these design patents are hereby incorporated by reference.
  • a commercially-available "webcam” product that is designed to be placed near a computer monitor is the Logitech Quickcam Pro USB ® from
  • a conventional system would utilize a modem connection from a personal computer to an Internet service provider (ISP). Using such a connection, the captured video information would be transmitted from the personal computer of one user over the Internet to a personal computer of another user. Although this may achieve a rudimentary form of video conferencing between two users, such Internet-based video conferencing is typically unreliable and of uneven bandwidth due to limitations of the Internet.
  • ISP Internet service provider
  • the present invention provides systems and methods for video and audio capture and communication that overcome the above-described problems and disadvantages.
  • a remote control for an interactive television system includes an integrated camera and a wireless transmitter for transmitting video information captured by the camera to the interactive television system.
  • a set top box for the interactive television system includes a wireless receiver for receiving the video information.
  • the wireless transmitter is a high-bandwidth, radio-frequency (RF) transmitter
  • the receiver is a high-bandwidth, RF receiver.
  • the set top box includes a digital storage device for recording video information received by the wireless receiver in the set top box.
  • the set top box may include a converter for transforming the video information captured by the camera into a video stream compatible for transmission over a network.
  • the remote control includes an integrated microphone
  • the wireless transmitter is further configured to transmit audio information captured by the microphone to the interactive television system.
  • the set top box may include a digital recording device for recording audio information received from the microphone, as well as a converter for transforming the audio information into a network-compatible audio stream for transmission to the network.
  • a video signal is captured using a camera integrated with a remote control. Thereafter, the video signal is transmitted using a wireless transmitter, which is received by a wireless receiver integrated with a set top box of the interactive television system.
  • the video signal is transformed into a video stream of a format compatible for transmission over a network, after which the video stream transmitted from the set top box to the network.
  • the video stream is then transmitted from the network to a second set top box, after which it is transformed into a display-compatible video signal.
  • the video signal is displayed on a television coupled to the second set top box.
  • FIG. 1 is a schematic block diagram of a television network according to an embodiment of the invention
  • FIG. 2 is a schematic block diagram of an interactive television system according to an embodiment of the invention
  • FIG. 3 is a schematic block diagram of a set top box according to an embodiment of the invention
  • FIG. 4 is a plan view of a remote control according to an embodiment of the invention
  • FIG. 5 is a schematic block diagram of an interactive television system according to an embodiment of the invention
  • FIG. 6 is a schematic block diagram of a set top box according to an embodiment of the invention
  • FIG. 7 is a plan view of a remote control according to an embodiment of the invention.
  • FIG. 8 is a flowchart of a method for video capture and communication according to an embodiment of the invention.
  • Embodiments of systems and methods for video and audio capture and communication are described herein.
  • numerous specific details are provided, such as examples of programming, user selections, transactions, etc., to provide a thorough understanding of embodiments of the invention.
  • One skilled in the relevant art will recognize, however, that the invention can be practiced without one or more of the specific details, or with other methods, components, materials, etc.
  • well-known structures, materials, or operations are not shown or described in detail to avoid obscuring aspects of the invention.
  • FIG. 1 there is shown a television network 100, such as a cable network, according to an embodiment of the invention.
  • the network 100 includes a plurality of set top boxes 102 (hereinafter STB 102) or other client terminals located, for instance, at customer homes.
  • STB 102 set top boxes 102
  • an STB 102 is consumer electronics device that serves as a gateway between a customer's television and a broadband communication network, such as a cable network.
  • a broadband communication network such as a cable network.
  • an STB 102 is typically located on top of, or in close proximity to, a customer's television.
  • an STB 102 receives encoded video/audio signals (including television signals) from the network 100 and decodes the same for display on the television. Additionally, an STB 102 receives commands from a user (typically via a remote control) and transmits such commands back to the network 100.
  • each STB 102 is connected to a headend 104.
  • a headend 104 is a centrally-located facility where cable TV (CATV) channels are received from a local CATV satellite downlink and packaged together for transmission to customer homes.
  • CATV cable TV
  • the headend 104 also functions as a Central Office (CO) in the telephone industry, routing video and audio streams and other data to and from the various STBs 102 serviced thereby.
  • CO Central Office
  • Headends 104 may be coupled directly to one another or through a network center 106. In some cases, headends 104 may be connected via a separate network, one particular example of which is Internet 108. Of course, the illustrated network topology is provided for example purposes only, and other network topologies may be used within the scope of the invention.
  • an STB 102 may transmit video and audio streams to one or more other STBs 102 connected to the network 100.
  • the communication path for the transmission may involve one or more headends 104, network centers 106, and/or the Internet 108.
  • a first STB 102 may send a video transmission upstream to a first headend 104, then to a second headend 104, and finally downstream to a second STB 102.
  • the transmission may use various standard protocols, such as MPEG or video over IP (Internet Protocol).
  • the first and second headends 104 may be one and the same if the
  • STBs 102 are served by the same headend 104.
  • the transmission between headends 104 may occur (i) via a direct peer-to-peer connection between headends 104, (ii) upstream from the first headend 104 to a network center 106 and then downstream to the second headend 104, or (iii) via the Internet 108.
  • each STB 102 may be identified by a unique number, code or address, such as an IP (Internet Protocol) address.
  • IP Internet Protocol
  • a user of one STB 102 may indicate an STB 102 to receive an audio or video transmission by specifying the corresponding address.
  • the network 100 then routes the transmission to its destination using conventional techniques.
  • the television system 200 preferably includes a television 202, which is configured to receive and display standard analog or digital television signals or high-definition television (HDTV) signals.
  • the television system 200 also includes a STB 102, as discussed above, for sending and receiving audio/video information (including television signals) or other data to and from the network 100.
  • the functionality of the STB 102 is integrated into an advanced version of the television 202.
  • a remote control 204 is provided for convenient remote operation of the STB 102 and the television 202.
  • the remote control 204 may communicate with the STB 102 and television 202 using conventional techniques to adjust, for example, the volume of the television, the displayed channel, and the like.
  • the remote control 204 includes a camera 208, such as a color (or monochromatic) digital video camera.
  • the camera includes a progressive scan CCD (charged coupled device) array to deliver digital video up to 320 x 240 pixels in 24-bit color. Other resolutions and levels of color are also contemplated to be within the scope of the present invention. The resolution and levels of color of the camera may also be adjustable or selectable by the viewer.
  • a zoom function may be provided for the camera 208.
  • the zoom function may be lens based or preferably digitally based.
  • the camera may be provided with automatic white balance and automatic exposure features to adjust for lighting and scene content. Of course, such automatic features may be turned off by the user.
  • the frame rate of the video capture may be 30 frames per second (NTSC, VHS, MPEG), 25 frames per second (PAL), 24 frames per second (motion picture), or other rates.
  • NTSC frames per second
  • VHS video conferencing
  • MPEG frames per second
  • PAL frames per second
  • motion picture motion picture
  • a frame rate of 8 frames per second provides a somewhat jerky video.
  • video conferencing applications using the present invention is performed at a frame rate of at least 10 frames per second for smoother motion.
  • the MPEG-4 protocol may be used for video conferencing applications using the present invention.
  • the camera may be used to capture not only video, but also still- pictures.
  • Such still-pictures may be stored in JPEG, BMP, TIFF, or other formats in a digital storage device in the STB 102.
  • the remote control 204 itself may include a digital storage device to store such still-pictures.
  • the resolution of the camera when used to capture still-pictures may be greater than the resolution when used to capture video.
  • the camera is preferably disposed on a surface of the remote control 204 to provide a generally unobstructed view for the camera 208.
  • the camera may be disposed on the same surface as the majority of the buttons, or it may be disposed on a surface perpendicular to that surface.
  • the camera 208 is capable of capturing a series of images in real time and converting the same into analog or digital video signals.
  • the camera 208 is in electrical communication with a specifically- designated button, such as a camera ("cam") button 206, which toggles operation of the camera 208 in one implementation.
  • the remote control 204 may further include additional buttons to control various features of the STB 102 and the television 202.
  • the term "button” includes other types of controls, such as switches and the like.
  • more than one button or control may be provided to activate and deactivate the camera 208.
  • the remote control 204 also includes a microphone 209 for receiving sound waves and converting the same into analog or digital audio signals.
  • the microphone 209 may be further in communication with the "cam" button 206 to toggle the operation thereof.
  • the microphone 209 may be enabled through a separate button on the remote control 204.
  • the remote control 204 further includes a radio frequency (RF) transmitter 210.
  • the transmitter 210 may be configured to transmit using infrared (IR), microwave, VHF, UHF, or other frequencies along the electromagnetic spectrum.
  • the transmitter 210 is in electrical communication with the camera 208 to receive video information captured by the camera 208.
  • the transmitter 210 may further be in electrical communication with the microphone 209 to receive audio information.
  • the transmitter 210 preferably modulates the video and/or audio information with a carrier frequency to enable transmission of the information to the STB 102 using techniques well known in the art.
  • the transmitter 210 may operate according to the IEEE 802.11a or 802.11 b Wireless Networking standards, the "Bluetooth" standard, or according to other standard or proprietary wireless techniques. Modulation techniques may include spread spectrum, frequency shift keying, multiple carrier, or other techniques known in the art.
  • the transmitter 210 may include various additional components not specifically illustrated but well known in the art.
  • the transmitter 210 may include a source encoder to reduce the amount of bandwidth required, a channel encoder to modulate the video and/or audio information with a carrier wave, and a directional or non-directional transmission antenna.
  • the transmitter 210 may further include an amplifier to increase the transmission signal strength to an appropriate power level.
  • the transmitter 210 comprises an integrated RF antenna (linear or otherwise configured) etched onto the main printed circuit board of the remote 204. Integration of the antenna with the remote control's circuit board provides for compactness and efficiency in manufacture.
  • the transmitter 210 is a high-bandwidth transmitter capable of sending the video/audio information to the STB 102 in real time.
  • the transmitter 210 may use wideband frequency modulation over a frequency band to provide a one-way video/audio link from the remote control 204 to the STB 102.
  • frequency band may be within the 890- 960 MHz range (GSM), 1990-2110 MHz range or 2400-2500 MHz range or other frequency ranges as approved by FCC regulations.
  • GSM 890- 960 MHz range
  • the one-way video/audio link between remote control 204 and STB 102 also provides for efficiency in manufacture, as a two-way video/audio link is not required in accordance with this embodiment.
  • the transmitter 210 utilizes a frequency division multiplexing (FDM) technique in order to transmit several streams of data simultaneously. These streams may be reassembled at the STB 102 to derive the encoded video/audio information.
  • FDM frequency division multiplexing
  • Various other techniques for providing a high bandwidth in multimedia transmissions may also be used within the scope of the invention.
  • the transmitter 210 is configured to broadcast digital signals.
  • the transmitter 210 may include an analog-to-digital converter (ADC) to convert analog video/audio signals from an analog camera system into digital information.
  • ADC analog-to-digital converter
  • the present invention contemplates the use of analog or digital or both types of transmissions from the remote control 204.
  • the remote control 204 is also in electrical communication with a processor (not shown) that senses a user's operation of the buttons of the remote control 204 and generates appropriate command signals for transmission to the STB 102 and television 202 in order to control the operation of the same.
  • a processor not shown
  • the STB 102 includes an RF receiver 212 for receiving transmissions from the transmitter 210 in the remote control 204.
  • a receiver 212 may include an antenna integrated into a printed circuit board (either a main board or a card coupled to a main board) within the STB 102.
  • the receiver 202 may also demodulate video/audio information from the modulated band transmitted by the remote control 204.
  • the receiver 212 may be configured to receive IR, microwave, VHF, UHF, or other frequencies.
  • the receiver 212 demodulates the video/audio information contained within a carrier frequency of the transmission.
  • the receiver 212 may further include components not specifically illustrated but well known in the art.
  • the receiver 212 may include an antenna for receiving the transmission, an amplifier for increasing the strength of the received signal, and a decoder for separating and demodulating the video and/or audio information from the carrier signal.
  • the receiver 212 is in electrical communication with a converter 214, which converts the video and/or audio information into a digital video and/or audio stream compatible for transmission over the network 100.
  • the conversion process may include compressing the information to improve transmission speed.
  • the converter 214 is in electrical communication with a headend 104 in order to transmit the network-compatible video/audio stream to one or more other STBs 102 in the network 100.
  • the converter 214 is further configured to receive network-compatible video/audio streams from the network 100 and transform the same into display-compatible video/audio signals for display/playback on the television 202.
  • the transmission from the STB 102 to the network 100 must be made to be compatible with upstream transmission in the network 100.
  • one or more frequency bands (for example from 5 to 30 MHz) may be reserved for upstream transmission.
  • Digital modulation for example, quadrature amplitude modulation or vestigial sideband modulation
  • Various protocols such as MPEG or video over IP, may be used to embed the video/audio stream in the digital signals.
  • Upstream transmission will be accomplished differently for different networks 100.
  • Alternative ways to accomplish upstream transmission include an analog telephone line, ISDN, DSL, or other techniques.
  • the STB 102 may include a storage interface 302, which provides access to a digital storage device 304, such as a hard disk drive or the like.
  • the storage interface 302 receives video/audio information from the receiver 212 and delivers the same to the digital storage device 304 for storage.
  • the video/audio information may be stored in an MPEG format or other encoded file formats.
  • the video/audio information may be converted by the converter 214 into a network-compatible video/audio stream before being stored in the storage device 304.
  • the converter 214 includes conventional interface circuitry for communicating with the network 100.
  • a separate network interface (not shown) may be provided, such as a cable modem or the like. Such a cable modem may operate in accordance with the DOCSIS or DAVIC standards.
  • the STB 102 may further include a random access memory (RAM) 306 configured to store data for temporary use.
  • a read-only memory (ROM) 308 may be provided for storing more permanent data, such as fixed code and configuration information.
  • the ROM 308 may be used to store an operating system for the STB 102, such as Windows CE ® or Linux ® .
  • the STB 102 preferably includes a controller 310 that is in communication with the receiver 212, the converter 214, the storage interface 302, the RAM 306, the ROM 308, and the converter 214.
  • the controller 310 may be coupled to the other components of the STB 102 via a bus 312.
  • the controller 310 may be embodied as a microcontroller, a microprocessor, a digital signal processor (DSP) or other device known in the art.
  • DSP digital signal processor
  • the controller 310 manages the operation of the STB 102, including, for example, the conversion of the encoded video/audio information, the storage of the video/audio information, the transmission and reception of video/audio information from the network 100, and the like.
  • the controller 310 may perform these and other operations based on control signals generated by the remote control 204 and transmitted to the receiver 212.
  • the video/audio information received from the remote control 204 may be displayed directly on the television 202 coupled to the STB 102.
  • the video/audio information may also be converted, compressed and transmitted across the network 100 to one or more other STBs 102 where it is displayed on corresponding televisions 202.
  • a user may select which STB(s) 102 will receive a video/audio transmission by entering one or more addresses of the receiving STB(s) 102 using the remote control 204.
  • the address of an STB 102 uniquely identifies the STB 102 within the network 100 and is used by the headends 104, network centers 106, and/or the Internet 108 to route a network-compatible video/audio stream to the appropriate STB 102 using conventional techniques.
  • an STB 102 may simultaneously send and receive multiple video/audio streams. In this manner, video conferencing of networked interactive television systems 200 is enabled.
  • FIG. 4 provides an expanded view of the remote control 204, including the camera 208, the microphone 209, the transmitter 210, and the "cam” button 206.
  • FIG. 4 illustrates an activity indicator 402, which illuminates or otherwise signals the user when the camera 208 and/or microphone 209 is active.
  • the activity indicator 402 may be embodied as an LED (light-emitting diode) or other suitable indicator.
  • the remote control 204 may include a number of other buttons or controls, such as an "accept” button 406, a "reject” button 408, and a “switch” button 410, the functions of which are described below.
  • the various components of the remote control 204 may be positioned in different locations for ergonomics and ease-of-use.
  • the camera 208, "cam" button 206, and activity indicator 402 may be disposed at any convenient and ergonomic location within the remote control 204.
  • FIG. 5 there is shown an alternative interactive television system 500 according to an embodiment of the invention.
  • the television system 500 differs primarily from the television system 200 of FIG. 2 in that the camera 208 and microphone 209 are disposed within a STB 502 rather than a remote control 504.
  • the remote control 504 includes an infrared (IR) transmitter 506 for sending control signals to an I R receiver 508 within the STB 502 and/or the television 202.
  • IR infrared
  • the transmitter may use RF, VHF, UHF, microwave, or other frequencies.
  • the remote control 504 also includes a "cam" button 206 for enabling remote operation of the camera 208 and/or the microphone 209 disposed within the STB 502. ⁇
  • FIG. 6 there is shown an expanded block diagram of the STB 502.
  • the converter 214, the storage interface 302, the digital storage device 304, the RAM 306, the ROM 308, and the controller 310 function as previously described with reference to FIG 3.
  • the STB 502 includes a camera 208 and a microphone 209, which are depicted as being in communication with the bus 312.
  • the STB 502 is depicted as including an activity indicator 402 for visually indicating to a user when the camera 208 is active.
  • FIG. 7 provides an expanded view of the remote control 504, including the IR transmitter 506, the "cam” button 206, the "accept” button 406 and the “reject” button 408.
  • the remote control 504 may also include a separate activity indicator 402 in addition to the indicator 504 in the STB 502. Those skilled in the art will recognize that the various components of the remote control 504 may be positioned in different locations for convenience and ergonomics.
  • the remote control 504 and the STB 502 may both be configured with a camera 208 and/or a microphone 209. This would allow a user to select between a camera 208 disposed locally on the remote control 504 and a camera 208 disposed remotely on the STB 102. Thus, a user may conveniently switch between a stationary camera 208 at a fixed distance or a remote-mounted camera 208 that is highly mobile, depending on the subject to be viewed. In one embodiment, the "switch" button 410 of FIG. 4 may be used for this purpose.
  • FIG. 8 is a flowchart of a method 800 for video and audio capture and communication according to an embodiment of the invention.
  • the method 800 begins when a user of a first STB 102 selects 802 a second STB 102 (or set of STBs 102) in the network 100 to receive a video/audio transmission.
  • the selection may be performed by entering an identification of the second STB 102 or a user thereof by means the remote control 204. If a user's name is specified, for example, the first STB 102 may access a name server or directory (not shown) to retrieve a corresponding address of the second STB 102.
  • the first STB 102 may contain a local directory of addresses to which the user frequently sends video/audio transmissions. Once the first STB 102 has a valid address, it sends a request across the network 100 to the second STB 102.
  • the request should indicate to the second STB 102 that the user of the first STB 102 desires to send a video/audio transmission.
  • the second STB 102 In response to the request, the second STB 102 generates a notification, such as a text message or icon, for display on the corresponding television 202 to notify the user of the second STB 102 of the video/audio transmission.
  • the notification may take the form of an audio signal that is played on a speaker (not shown) in the STB 102 or the television 202.
  • the first STB 102 may wait until a timeout period has expired, after which it notifies the user that the audio/video transmission cannot be sent. Likewise, if the user of the second STB 102 does not respond, or refuses to receive the transmission (by means of the "reject" button 408 of FIG. 4, for example) a not-available signal may be returned to the first STB 102.
  • the user of the second STB 102 wishes to receive the video/audio transmission, she may press a suitable button the remote control 204, such as the "accept" button 406 of FIG. 4, which results in an acceptance signal being returned to the first STB 102.
  • the first STB 102 generates, in response to receiving the acceptance signal, a video or audio acceptance message to notify the user that permission for the video/audio transmission has been granted.
  • the first and second STBs 102 may then initiate 804 a handshake procedure to establish a communication protocol.
  • a handshake procedure may have some similarity with handshake procedures performed between facsimile (fax) machines.
  • the STBs 102 may negotiate a new protocol or reaffirm an existing protocol for video/audio communication.
  • the appropriate protocol may need to be determined because the two STBs have different video/audio conferencing capabilities.
  • the second STB may be capable of video conferencing at a lower resolution (or frame rate), so the communication protocol would be established as is suitable to this lower resolution (or frame rate).
  • the communication protocol used may also depend on the bandwidth and/or reliability of the connection between the two set top boxes. At this point, an active communication link is established between the STBs 102 across the network 100.
  • the first user then activates 806 the camera 208 and/or microphone 209 by pressing, for example, the "cam" button 206.
  • the remote control 204 and/or STB 102 indicates 808 activation of the camera 208 by a visual mechanism, such as an activity indicator 402 (e.g., LED).
  • the camera 208 and/or microphone 209 captures 810 a video and/or audio signal (which is transmitted to the STB 102 in the case of the remote control 204 of FIG. 2).
  • the converter 214 within the STB 102 then transforms 812 the captured video/audio signal into a network-compatible video/audio stream for transmission over the network 100.
  • the network-compatible video/audio stream is transmitted 814 upstream to the network 100.
  • the communication path for the transmission may involve one or more headends 104, network centers 106, and/or the Internet 108, using conventional routing techniques.
  • the network-compatible video/audio stream is then transmitted 816 downstream from the network 100 to the second STB 102. Thereafter, the network-compatible video/audio stream is transformed 818 into a display-compatible video/audio signal for display 820 on the television 202.
  • the second STB 102 may transmit video/audio information to the first STB 102.
  • multiple video/audio streams may be received and transmitted simultaneously by a STB 102.
  • Multiple video streams received by a STB 102 may be displayed on a television 202 at the same time using picture-in-picture (PIP) techniques.
  • PIP picture-in-picture
  • multiple audio streams may be mixed for playback on the television 202.
  • video conferencing between two or more users of networked interactive television systems 200 is enabled.
  • the first STB 102 may transmit a video/audio stream to the second STB 102 without waiting for an acceptance signal.
  • the second STB 102 may record all incoming transmissions in the digital storage device 304. Thereafter, a user of the second STB 102 may review the stored video/audio streams and select which stream, if any, to display at a convenient time.
  • the first STB 102 may be pre-configured to transmit video/audio information to a second STB 102, which has previously granted permission to receive the transmission. Accordingly, a user of the first STB 102 may simply press the "cam" button 206 to immediately capture video/audio information and transmit the same to the second STB 102 for immediate display.
  • the video/audio conferencing may occur between the first STB 102 and a client terminal more generically (not just a second STB 102).
  • the client terminal may comprise a personal computer or other device with a connection to the Internet 108.
  • Such other devices may include Internet appliances, personal digital assistants, Internet-enabled cell phones, and the like. These devices are likely to have varying videoconferencing capabilities, so a handshaking procedure as described above is likely to be quite useful in determining a proper communication protocol.
  • the present invention offers numerous advantages not available in the prior art.
  • a user may easily capture video images of events that would be difficult or impossible to capture with conventional "webcam" devices.
  • the remote control 204 is not limited by a physical cable, a user has the flexibility of carrying the remote control 204 to any desired location. Even in an embodiment in which the camera 208 is located in the STB 102, it is likely that a user will be able to capture events of primary interest, since televisions 202 and STBs 102 are normally located in areas of high use, such as family rooms and the like.

Abstract

A remote control (204) for an interactive television system includes an integrated camera (208) and a wireless transmitter (210) for transmitting video information captured by the camera to the interactive television system. A set top box (102) for the interactive television system includes a wireless receiver (212) for receiving the video information and a converter (214) for transforming the video information into a network-compatible video stream for transmission to a network.

Description

SYSTEMS AND METHODS FOR VIDEO AND AUDIO CAPTURE
AND COMMUNICATION
BACKGROUND OF THE INVENTION
RELATED APPLICATIONS
The present application is related to and claims priority from U.S.
Provisional Application No. 60/237,013, entitled "Systems, Methods, and Devices for Video and Audio Capture and Communications," filed September 29, 2000, with inventor Paul. G. Allen, which is hereby incorporated by reference in its entirety.
FIELD OF THE INVENTION
The present invention relates generally to interactive television systems, and more particularly, to systems and methods for video and audio capture and communication.
DESCRIPTION OF THE BACKGROUND ART
Prior systems, methods, and devices for capturing and communicating video and audio information have various problems and disadvantages.
Consider conventional "webcam" (web camera) devices available today. Such cameras are designed to be mounted on, or placed near, a computer monitor. Designs for monitor-mounted cameras are shown, for example, in U.S. Design Patent No. D0363502 to MacMurtrie et al., entitled
"Monitor mounted video camera," and No. D0363730 to Flohr et al., entitled
"Video camera unit for mounting on a computer monitor." Both of these design patents are hereby incorporated by reference.
A commercially-available "webcam" product that is designed to be placed near a computer monitor is the Logitech Quickcam Pro USB® from
Logitech, Inc., of Fremont, California. Such camera devices are typically connected via a cable to a port of the computer. In general, monitor-mounted cameras are advantageous when used to capture images of a user sitting in front of the computer. Nevertheless, they present several problems and disadvantages.
First, it is awkward and difficult for a user to point such cameras in different directions. The user would typically have to be near the camera and reach over to change the camera's orientation. Second, such cameras are typically not very mobile, since they are connected via a physical cable to the computer. Unfortunately, this means that in order to capture an image of a subject, the subject must be physically placed within view of the camera at its fixed location. Third, due to their immobility, such cameras often miss moments of primary interest to users, which are often transitory and do not occur in close proximity to a computer. For instance, a baby walking for the first time in the family room is very unlikely to be captured by such fixed or tethered devices.
Moreover, consider conventional systems and methods where such "webcam" devices are used for video communications. A conventional system would utilize a modem connection from a personal computer to an Internet service provider (ISP). Using such a connection, the captured video information would be transmitted from the personal computer of one user over the Internet to a personal computer of another user. Although this may achieve a rudimentary form of video conferencing between two users, such Internet-based video conferencing is typically unreliable and of uneven bandwidth due to limitations of the Internet.
SUMMARY OF THE INVENTION The present invention provides systems and methods for video and audio capture and communication that overcome the above-described problems and disadvantages.
In one aspect of the invention, a remote control for an interactive television system includes an integrated camera and a wireless transmitter for transmitting video information captured by the camera to the interactive television system. A set top box for the interactive television system includes a wireless receiver for receiving the video information. In one embodiment, the wireless transmitter is a high-bandwidth, radio-frequency (RF) transmitter, and the receiver is a high-bandwidth, RF receiver. In various embodiments, the set top box includes a digital storage device for recording video information received by the wireless receiver in the set top box. In addition, the set top box may include a converter for transforming the video information captured by the camera into a video stream compatible for transmission over a network.
In another aspect of the invention, the remote control includes an integrated microphone, and the wireless transmitter is further configured to transmit audio information captured by the microphone to the interactive television system. The set top box may include a digital recording device for recording audio information received from the microphone, as well as a converter for transforming the audio information into a network-compatible audio stream for transmission to the network.
In yet another aspect of the invention, a video signal is captured using a camera integrated with a remote control. Thereafter, the video signal is transmitted using a wireless transmitter, which is received by a wireless receiver integrated with a set top box of the interactive television system.
In still another aspect, within the set top box, the video signal is transformed into a video stream of a format compatible for transmission over a network, after which the video stream transmitted from the set top box to the network. The video stream is then transmitted from the network to a second set top box, after which it is transformed into a display-compatible video signal. Finally, the video signal is displayed on a television coupled to the second set top box.
BRIEF DESCRIPTION OF THE DRAWINGS
Non-limiting and non-exhaustive embodiments of the present invention are described in the Figures, in which
FIG. 1 is a schematic block diagram of a television network according to an embodiment of the invention; FIG. 2 is a schematic block diagram of an interactive television system according to an embodiment of the invention;
FIG. 3 is a schematic block diagram of a set top box according to an embodiment of the invention; FIG. 4 is a plan view of a remote control according to an embodiment of the invention;
FIG. 5 is a schematic block diagram of an interactive television system according to an embodiment of the invention; FIG. 6 is a schematic block diagram of a set top box according to an embodiment of the invention;
FIG. 7 is a plan view of a remote control according to an embodiment of the invention; and
FIG. 8 is a flowchart of a method for video capture and communication according to an embodiment of the invention.
DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS
Embodiments of systems and methods for video and audio capture and communication are described herein. In the following description, numerous specific details are provided, such as examples of programming, user selections, transactions, etc., to provide a thorough understanding of embodiments of the invention. One skilled in the relevant art will recognize, however, that the invention can be practiced without one or more of the specific details, or with other methods, components, materials, etc. In other instances, well-known structures, materials, or operations are not shown or described in detail to avoid obscuring aspects of the invention.
Reference throughout this specification to "one embodiment" or "an embodiment" means that a particular feature, structure, or characteristic described in connection with the embodiment is included in at least one embodiment of the present invention. Thus, the appearances of the phrases "in one embodiment" or "in an embodiment" in various places throughout this specification are not necessarily all referring to the same embodiment. Furthermore, the particular features, structures, or characteristics may be combined in any suitable manner in one or more embodiments. Throughout the following description, reference is made to both video and audio information. It should be understood, however, that the devices, systems, and methods of the present invention may be used to capture and communicate either video or audio information, or both, in various embodiments. Referring now to FIG. 1 , there is shown a television network 100, such as a cable network, according to an embodiment of the invention. In one implementation, the network 100 includes a plurality of set top boxes 102 (hereinafter STB 102) or other client terminals located, for instance, at customer homes. Generally, an STB 102 is consumer electronics device that serves as a gateway between a customer's television and a broadband communication network, such as a cable network. As its name implies, an STB 102 is typically located on top of, or in close proximity to, a customer's television.
In one embodiment, an STB 102 receives encoded video/audio signals (including television signals) from the network 100 and decodes the same for display on the television. Additionally, an STB 102 receives commands from a user (typically via a remote control) and transmits such commands back to the network 100.
In various embodiments, each STB 102 is connected to a headend 104. In the context of cable network, a headend 104 is a centrally-located facility where cable TV (CATV) channels are received from a local CATV satellite downlink and packaged together for transmission to customer homes. In one embodiment, the headend 104 also functions as a Central Office (CO) in the telephone industry, routing video and audio streams and other data to and from the various STBs 102 serviced thereby.
Headends 104 may be coupled directly to one another or through a network center 106. In some cases, headends 104 may be connected via a separate network, one particular example of which is Internet 108. Of course, the illustrated network topology is provided for example purposes only, and other network topologies may be used within the scope of the invention.
As described in greater detail below, an STB 102 may transmit video and audio streams to one or more other STBs 102 connected to the network 100. The communication path for the transmission may involve one or more headends 104, network centers 106, and/or the Internet 108. For example, a first STB 102 may send a video transmission upstream to a first headend 104, then to a second headend 104, and finally downstream to a second STB 102. The transmission may use various standard protocols, such as MPEG or video over IP (Internet Protocol). The first and second headends 104 may be one and the same if the
STBs 102 are served by the same headend 104. The transmission between headends 104 may occur (i) via a direct peer-to-peer connection between headends 104, (ii) upstream from the first headend 104 to a network center 106 and then downstream to the second headend 104, or (iii) via the Internet 108.
As described in detail hereafter, each STB 102 may be identified by a unique number, code or address, such as an IP (Internet Protocol) address. Thus, a user of one STB 102 may indicate an STB 102 to receive an audio or video transmission by specifying the corresponding address. The network 100 then routes the transmission to its destination using conventional techniques.
Referring now to FIG. 2, there is shown an interactive television system 200 according to an embodiment of the invention. The television system 200 preferably includes a television 202, which is configured to receive and display standard analog or digital television signals or high-definition television (HDTV) signals. In one embodiment, the television system 200 also includes a STB 102, as discussed above, for sending and receiving audio/video information (including television signals) or other data to and from the network 100. In an alternate embodiment, the functionality of the STB 102 is integrated into an advanced version of the television 202. In one embodiment, a remote control 204 is provided for convenient remote operation of the STB 102 and the television 202. As described below, the remote control 204 may communicate with the STB 102 and television 202 using conventional techniques to adjust, for example, the volume of the television, the displayed channel, and the like. In the illustrated embodiment, the remote control 204 includes a camera 208, such as a color (or monochromatic) digital video camera. In accordance with one embodiment, the camera includes a progressive scan CCD (charged coupled device) array to deliver digital video up to 320 x 240 pixels in 24-bit color. Other resolutions and levels of color are also contemplated to be within the scope of the present invention. The resolution and levels of color of the camera may also be adjustable or selectable by the viewer. Furthermore, a zoom function may be provided for the camera 208. The zoom function may be lens based or preferably digitally based. In addition, the camera may be provided with automatic white balance and automatic exposure features to adjust for lighting and scene content. Of course, such automatic features may be turned off by the user.
The frame rate of the video capture may be 30 frames per second (NTSC, VHS, MPEG), 25 frames per second (PAL), 24 frames per second (motion picture), or other rates. For video conferencing, a frame rate of 8 frames per second provides a somewhat jerky video. Preferably, video conferencing applications using the present invention is performed at a frame rate of at least 10 frames per second for smoother motion. In one embodiment, the MPEG-4 protocol may be used for video conferencing applications using the present invention.
The camera may be used to capture not only video, but also still- pictures. Such still-pictures may be stored in JPEG, BMP, TIFF, or other formats in a digital storage device in the STB 102. Alternatively, the remote control 204 itself may include a digital storage device to store such still-pictures. In accordance with one embodiment, the resolution of the camera when used to capture still-pictures may be greater than the resolution when used to capture video.
The camera is preferably disposed on a surface of the remote control 204 to provide a generally unobstructed view for the camera 208. In particular embodiments, the camera may be disposed on the same surface as the majority of the buttons, or it may be disposed on a surface perpendicular to that surface.
In one embodiment, the camera 208 is capable of capturing a series of images in real time and converting the same into analog or digital video signals. The camera 208 is in electrical communication with a specifically- designated button, such as a camera ("cam") button 206, which toggles operation of the camera 208 in one implementation. The remote control 204 may further include additional buttons to control various features of the STB 102 and the television 202. As used herein, the term "button" includes other types of controls, such as switches and the like. In addition, more than one button or control may be provided to activate and deactivate the camera 208.
In one embodiment, the remote control 204 also includes a microphone 209 for receiving sound waves and converting the same into analog or digital audio signals. The microphone 209 may be further in communication with the "cam" button 206 to toggle the operation thereof. Alternatively, the microphone 209 may be enabled through a separate button on the remote control 204.
In the illustrated embodiment, the remote control 204 further includes a radio frequency (RF) transmitter 210. In alternative embodiments, the transmitter 210 may be configured to transmit using infrared (IR), microwave, VHF, UHF, or other frequencies along the electromagnetic spectrum.
In one implementation, the transmitter 210 is in electrical communication with the camera 208 to receive video information captured by the camera 208. The transmitter 210 may further be in electrical communication with the microphone 209 to receive audio information.
The transmitter 210 preferably modulates the video and/or audio information with a carrier frequency to enable transmission of the information to the STB 102 using techniques well known in the art. For example, the transmitter 210 may operate according to the IEEE 802.11a or 802.11 b Wireless Networking standards, the "Bluetooth" standard, or according to other standard or proprietary wireless techniques. Modulation techniques may include spread spectrum, frequency shift keying, multiple carrier, or other techniques known in the art. To achieve modulation and transmission, the transmitter 210 may include various additional components not specifically illustrated but well known in the art. For example, the transmitter 210 may include a source encoder to reduce the amount of bandwidth required, a channel encoder to modulate the video and/or audio information with a carrier wave, and a directional or non-directional transmission antenna. The transmitter 210 may further include an amplifier to increase the transmission signal strength to an appropriate power level.
In accordance with one embodiment, the transmitter 210 comprises an integrated RF antenna (linear or otherwise configured) etched onto the main printed circuit board of the remote 204. Integration of the antenna with the remote control's circuit board provides for compactness and efficiency in manufacture.
Preferably, the transmitter 210 is a high-bandwidth transmitter capable of sending the video/audio information to the STB 102 in real time. In one embodiment, the transmitter 210 may use wideband frequency modulation over a frequency band to provide a one-way video/audio link from the remote control 204 to the STB 102. For example, frequency band may be within the 890- 960 MHz range (GSM), 1990-2110 MHz range or 2400-2500 MHz range or other frequency ranges as approved by FCC regulations. The one-way video/audio link between remote control 204 and STB 102 also provides for efficiency in manufacture, as a two-way video/audio link is not required in accordance with this embodiment. In another embodiment, the transmitter 210 utilizes a frequency division multiplexing (FDM) technique in order to transmit several streams of data simultaneously. These streams may be reassembled at the STB 102 to derive the encoded video/audio information. Various other techniques for providing a high bandwidth in multimedia transmissions may also be used within the scope of the invention.
In one embodiment, the transmitter 210 is configured to broadcast digital signals. As such, the transmitter 210 may include an analog-to-digital converter (ADC) to convert analog video/audio signals from an analog camera system into digital information. The present invention contemplates the use of analog or digital or both types of transmissions from the remote control 204.
In various embodiments, the remote control 204 is also in electrical communication with a processor (not shown) that senses a user's operation of the buttons of the remote control 204 and generates appropriate command signals for transmission to the STB 102 and television 202 in order to control the operation of the same.
In the illustrated embodiment, the STB 102 includes an RF receiver 212 for receiving transmissions from the transmitter 210 in the remote control 204. Such a receiver 212 may include an antenna integrated into a printed circuit board (either a main board or a card coupled to a main board) within the STB 102. The receiver 202 may also demodulate video/audio information from the modulated band transmitted by the remote control 204. In various embodiments, the receiver 212 may be configured to receive IR, microwave, VHF, UHF, or other frequencies. In one embodiment, the receiver 212 demodulates the video/audio information contained within a carrier frequency of the transmission.
The receiver 212 may further include components not specifically illustrated but well known in the art. For example, the receiver 212 may include an antenna for receiving the transmission, an amplifier for increasing the strength of the received signal, and a decoder for separating and demodulating the video and/or audio information from the carrier signal.
In one implementation, the receiver 212 is in electrical communication with a converter 214, which converts the video and/or audio information into a digital video and/or audio stream compatible for transmission over the network 100. The conversion process may include compressing the information to improve transmission speed.
As noted above, the converter 214 is in electrical communication with a headend 104 in order to transmit the network-compatible video/audio stream to one or more other STBs 102 in the network 100. The converter 214 is further configured to receive network-compatible video/audio streams from the network 100 and transform the same into display-compatible video/audio signals for display/playback on the television 202. In particular, the transmission from the STB 102 to the network 100 must be made to be compatible with upstream transmission in the network 100. For example, in a cable distribution network 100, one or more frequency bands (for example from 5 to 30 MHz) may be reserved for upstream transmission. Digital modulation (for example, quadrature amplitude modulation or vestigial sideband modulation) may be used to send digital signals in the upstream transmission. Various protocols, such as MPEG or video over IP, may be used to embed the video/audio stream in the digital signals. Upstream transmission will be accomplished differently for different networks 100. Alternative ways to accomplish upstream transmission include an analog telephone line, ISDN, DSL, or other techniques.
Referring to FIG. 3, there is shown an expanded block diagram of an STB 102 according to an embodiment of the invention. The STB 102 may include a storage interface 302, which provides access to a digital storage device 304, such as a hard disk drive or the like. In one embodiment, the storage interface 302 receives video/audio information from the receiver 212 and delivers the same to the digital storage device 304 for storage. The video/audio information may be stored in an MPEG format or other encoded file formats. Alternatively, the video/audio information may be converted by the converter 214 into a network-compatible video/audio stream before being stored in the storage device 304. In one embodiment, the converter 214 includes conventional interface circuitry for communicating with the network 100. In an alternative embodiment, a separate network interface (not shown) may be provided, such as a cable modem or the like. Such a cable modem may operate in accordance with the DOCSIS or DAVIC standards.
The STB 102 may further include a random access memory (RAM) 306 configured to store data for temporary use. Similarly, a read-only memory (ROM) 308 may be provided for storing more permanent data, such as fixed code and configuration information. In one embodiment, the ROM 308 may be used to store an operating system for the STB 102, such as Windows CE® or Linux®.
The STB 102 preferably includes a controller 310 that is in communication with the receiver 212, the converter 214, the storage interface 302, the RAM 306, the ROM 308, and the converter 214. The controller 310 may be coupled to the other components of the STB 102 via a bus 312. In various embodiments, the controller 310 may be embodied as a microcontroller, a microprocessor, a digital signal processor (DSP) or other device known in the art. The controller 310 manages the operation of the STB 102, including, for example, the conversion of the encoded video/audio information, the storage of the video/audio information, the transmission and reception of video/audio information from the network 100, and the like. As noted above, the controller 310 may perform these and other operations based on control signals generated by the remote control 204 and transmitted to the receiver 212.
In operation, the video/audio information received from the remote control 204 may be displayed directly on the television 202 coupled to the STB 102. As described in greater detail below, the video/audio information may also be converted, compressed and transmitted across the network 100 to one or more other STBs 102 where it is displayed on corresponding televisions 202.
In one embodiment, a user may select which STB(s) 102 will receive a video/audio transmission by entering one or more addresses of the receiving STB(s) 102 using the remote control 204. As noted above, the address of an STB 102 uniquely identifies the STB 102 within the network 100 and is used by the headends 104, network centers 106, and/or the Internet 108 to route a network-compatible video/audio stream to the appropriate STB 102 using conventional techniques.
In various embodiments, an STB 102 may simultaneously send and receive multiple video/audio streams. In this manner, video conferencing of networked interactive television systems 200 is enabled.
FIG. 4 provides an expanded view of the remote control 204, including the camera 208, the microphone 209, the transmitter 210, and the "cam" button 206. In addition, FIG. 4 illustrates an activity indicator 402, which illuminates or otherwise signals the user when the camera 208 and/or microphone 209 is active. The activity indicator 402 may be embodied as an LED (light-emitting diode) or other suitable indicator. As illustrated, the remote control 204 may include a number of other buttons or controls, such as an "accept" button 406, a "reject" button 408, and a "switch" button 410, the functions of which are described below. Those skilled in the art will recognize that the various components of the remote control 204 may be positioned in different locations for ergonomics and ease-of-use. For example, the camera 208, "cam" button 206, and activity indicator 402 may be disposed at any convenient and ergonomic location within the remote control 204. Referring now to FIG. 5, there is shown an alternative interactive television system 500 according to an embodiment of the invention. The television system 500 differs primarily from the television system 200 of FIG. 2 in that the camera 208 and microphone 209 are disposed within a STB 502 rather than a remote control 504. In the illustrated embodiment, the remote control 504 includes an infrared (IR) transmitter 506 for sending control signals to an I R receiver 508 within the STB 502 and/or the television 202. In alternative embodiments, however, the transmitter may use RF, VHF, UHF, microwave, or other frequencies. In one embodiment, the remote control 504 also includes a "cam" button 206 for enabling remote operation of the camera 208 and/or the microphone 209 disposed within the STB 502. <
Referring to FIG. 6, there is shown an expanded block diagram of the STB 502. The converter 214, the storage interface 302, the digital storage device 304, the RAM 306, the ROM 308, and the controller 310 function as previously described with reference to FIG 3. However, the STB 502 includes a camera 208 and a microphone 209, which are depicted as being in communication with the bus 312. In addition, the STB 502 is depicted as including an activity indicator 402 for visually indicating to a user when the camera 208 is active.
FIG. 7 provides an expanded view of the remote control 504, including the IR transmitter 506, the "cam" button 206, the "accept" button 406 and the "reject" button 408. The remote control 504 may also include a separate activity indicator 402 in addition to the indicator 504 in the STB 502. Those skilled in the art will recognize that the various components of the remote control 504 may be positioned in different locations for convenience and ergonomics.
In yet another alternative embodiment, the remote control 504 and the STB 502 may both be configured with a camera 208 and/or a microphone 209. This would allow a user to select between a camera 208 disposed locally on the remote control 504 and a camera 208 disposed remotely on the STB 102. Thus, a user may conveniently switch between a stationary camera 208 at a fixed distance or a remote-mounted camera 208 that is highly mobile, depending on the subject to be viewed. In one embodiment, the "switch" button 410 of FIG. 4 may be used for this purpose. FIG. 8 is a flowchart of a method 800 for video and audio capture and communication according to an embodiment of the invention. The method 800 begins when a user of a first STB 102 selects 802 a second STB 102 (or set of STBs 102) in the network 100 to receive a video/audio transmission. The selection may be performed by entering an identification of the second STB 102 or a user thereof by means the remote control 204. If a user's name is specified, for example, the first STB 102 may access a name server or directory (not shown) to retrieve a corresponding address of the second STB 102. In one embodiment, the first STB 102 may contain a local directory of addresses to which the user frequently sends video/audio transmissions. Once the first STB 102 has a valid address, it sends a request across the network 100 to the second STB 102. The precise format of the request is not crucial to the invention, but the request should indicate to the second STB 102 that the user of the first STB 102 desires to send a video/audio transmission. In response to the request, the second STB 102 generates a notification, such as a text message or icon, for display on the corresponding television 202 to notify the user of the second STB 102 of the video/audio transmission. Alternatively, the notification may take the form of an audio signal that is played on a speaker (not shown) in the STB 102 or the television 202.
If the second STB 102 is off-line or otherwise not available, the first STB 102 may wait until a timeout period has expired, after which it notifies the user that the audio/video transmission cannot be sent. Likewise, if the user of the second STB 102 does not respond, or refuses to receive the transmission (by means of the "reject" button 408 of FIG. 4, for example) a not-available signal may be returned to the first STB 102.
If the user of the second STB 102 wishes to receive the video/audio transmission, she may press a suitable button the remote control 204, such as the "accept" button 406 of FIG. 4, which results in an acceptance signal being returned to the first STB 102. In one embodiment, the first STB 102 generates, in response to receiving the acceptance signal, a video or audio acceptance message to notify the user that permission for the video/audio transmission has been granted.
The first and second STBs 102 may then initiate 804 a handshake procedure to establish a communication protocol. Such a handshake procedure may have some similarity with handshake procedures performed between facsimile (fax) machines. In this case, the STBs 102 may negotiate a new protocol or reaffirm an existing protocol for video/audio communication. The appropriate protocol may need to be determined because the two STBs have different video/audio conferencing capabilities. For example, the second STB may be capable of video conferencing at a lower resolution (or frame rate), so the communication protocol would be established as is suitable to this lower resolution (or frame rate). The communication protocol used may also depend on the bandwidth and/or reliability of the connection between the two set top boxes. At this point, an active communication link is established between the STBs 102 across the network 100.
In one embodiment, the first user then activates 806 the camera 208 and/or microphone 209 by pressing, for example, the "cam" button 206. In one implementation, the remote control 204 and/or STB 102 indicates 808 activation of the camera 208 by a visual mechanism, such as an activity indicator 402 (e.g., LED). Thereafter, the camera 208 and/or microphone 209 captures 810 a video and/or audio signal (which is transmitted to the STB 102 in the case of the remote control 204 of FIG. 2). The converter 214 within the STB 102 then transforms 812 the captured video/audio signal into a network-compatible video/audio stream for transmission over the network 100. Thereafter, the network-compatible video/audio stream is transmitted 814 upstream to the network 100. As noted with reference to FIG. 1 , the communication path for the transmission may involve one or more headends 104, network centers 106, and/or the Internet 108, using conventional routing techniques.
In one embodiment, the network-compatible video/audio stream is then transmitted 816 downstream from the network 100 to the second STB 102. Thereafter, the network-compatible video/audio stream is transformed 818 into a display-compatible video/audio signal for display 820 on the television 202.
In a like manner, the second STB 102 may transmit video/audio information to the first STB 102. Indeed, in one embodiment, multiple video/audio streams may be received and transmitted simultaneously by a STB 102. Multiple video streams received by a STB 102 may be displayed on a television 202 at the same time using picture-in-picture (PIP) techniques. Likewise, multiple audio streams may be mixed for playback on the television 202. Thus, video conferencing between two or more users of networked interactive television systems 200 is enabled.
Of course, the above-described method 800 is only one possible technique for video and audio capture and communication within the scope of the invention. In alternative embodiments, the first STB 102 may transmit a video/audio stream to the second STB 102 without waiting for an acceptance signal. The second STB 102 may record all incoming transmissions in the digital storage device 304. Thereafter, a user of the second STB 102 may review the stored video/audio streams and select which stream, if any, to display at a convenient time.
In yet another alternative embodiment, the first STB 102 may be pre-configured to transmit video/audio information to a second STB 102, which has previously granted permission to receive the transmission. Accordingly, a user of the first STB 102 may simply press the "cam" button 206 to immediately capture video/audio information and transmit the same to the second STB 102 for immediate display.
Alternatively, the video/audio conferencing may occur between the first STB 102 and a client terminal more generically (not just a second STB 102). The client terminal may comprise a personal computer or other device with a connection to the Internet 108. Such other devices may include Internet appliances, personal digital assistants, Internet-enabled cell phones, and the like. These devices are likely to have varying videoconferencing capabilities, so a handshaking procedure as described above is likely to be quite useful in determining a proper communication protocol.
In view of the forgoing, the present invention offers numerous advantages not available in the prior art. By integrating, in a compact manner, a camera 208 and/or microphone 209 with a remote control 204 for an interactive television system 200, a user may easily capture video images of events that would be difficult or impossible to capture with conventional "webcam" devices. Because the remote control 204 is not limited by a physical cable, a user has the flexibility of carrying the remote control 204 to any desired location. Even in an embodiment in which the camera 208 is located in the STB 102, it is likely that a user will be able to capture events of primary interest, since televisions 202 and STBs 102 are normally located in areas of high use, such as family rooms and the like.
The above description of illustrated embodiments of the invention is not intended to be exhaustive or to limit the invention to the precise forms disclosed. While specific embodiments of, and examples for, the invention are described herein for illustrative purposes, various equivalent modifications are possible within the scope of the invention, as those skilled in the relevant art will recognize.
These modifications can be made to the invention in light of the above detailed description. The terms used in the following claims should not be construed to limit the invention to the specific embodiments disclosed in the specification and the claims. Rather, the scope of the invention is to be determined by the following claims, which are to be construed in accordance with established doctrines of claim interpretation.

Claims

CLAIMS What is claimed is:
1. A system for video capture and communication comprising: a remote control for an interactive television system, the remote control comprising a camera and a wireless transmitter for transmitting video information captured by the camera to the interactive television system; and a set top box for the interactive television system, the set top box comprising a wireless receiver for receiving the video information from the wireless transmitter in the remote control.
2. The system of claim 1 , wherein the wireless transmitter comprises a high- bandwidth, radio-frequency transmitter that utilizes a radio-frequency antenna integrated into a circuit board for the remote control.
3. The system of claim 1 , wherein the wireless receiver comprises a high- bandwidth, radio-frequency receiver that utilizes a radio-frequency antenna integrated into a circuit board within the set top box.
4. The system of claim 1 , wherein the set top box further comprises a digital recording device for recording video information received from the camera in the remote control.
5. The system of claim 1 , wherein the set top box further comprises a converter for transforming the video information received from the camera in the remote control into a network-compatible video stream for transmission over a network.
6. The system of claim 1 , wherein the remote control further comprises a microphone, and wherein the wireless transmitter is configured to transmit audio information captured by the microphone to the interactive television system.
7. The system of claim 6, wherein the set top box further comprises a digital recording device for recording audio information received from the microphone in the remote control.
8. The system of claim 6, wherein the set top box further comprises a converter for transforming the audio information received from the microphone in the remote control into a network-compatible audio stream for transmission over a network.
9. A method for video capture and communication comprising: capturing a video signal using a camera integrated with a remote control for an interactive television system; transmitting the captured video signal using a wireless transmitter integrated with the remote control; and receiving the captured video signal at a wireless receiver integrated with a set top box for the interactive television system.
10. The method of claim 9, further comprising: transforming the video signal into a video stream of a format compatible for transmission over a network.
11. The method of claim 10, further comprising: recording the video stream in a digital storage device integrated with the set top box.
12. The method of claim 14, further comprising: transmitting the video stream from the set top box to the network; and transmitting the video stream from the network to a client terminal.
13. The method of claim 12, wherein the client terminal comprises a second set top box having a corresponding second remote control for control thereof.
14. The method of claim 12, wherein transmitting the video stream from the network to the client terminal comprises: transmitting the video stream from the network to an Internet; and transmitting the video stream from the Internet to the client terminal.
15. The method of claim 12, wherein the client terminal comprises a personal computer coupled to the Internet and having video conferencing software for receiving and displaying the video stream.
16. The method of claim 12, further comprising: performing a handshake procedure between the set top box and the client terminal to initiate a video conference between the set top box and the client terminal.
17. The method of claim 12, further comprising: recording the video stream received from the network in a digital storage device for the client terminal.
18. The method of claim 9, further comprising: generating an audio signal using a microphone integrated with the remote control; transmitting the audio signal using the wireless transmitter; and receiving the audio signal at the wireless receiver integrated with the set top box.
19. The method of claim 18, further comprising: transforming the audio signal into an audio stream of a format compatible for transmission over a network; transmitting the audio stream from the set top box to the network; and transmitting the audio stream from the network to a client terminal.
PCT/US2001/015822 2000-09-29 2001-05-16 Systems and methods for video and audio capture and communication WO2002030117A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
AU2001263186A AU2001263186A1 (en) 2000-09-29 2001-05-16 Systems and methods for video and audio capture and communication

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
US23701300P 2000-09-29 2000-09-29
US60/237,013 2000-09-29
US69875300A 2000-10-27 2000-10-27
US09/698,753 2000-10-27

Publications (1)

Publication Number Publication Date
WO2002030117A1 true WO2002030117A1 (en) 2002-04-11

Family

ID=26930318

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2001/015822 WO2002030117A1 (en) 2000-09-29 2001-05-16 Systems and methods for video and audio capture and communication

Country Status (2)

Country Link
AU (1) AU2001263186A1 (en)
WO (1) WO2002030117A1 (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1463287A1 (en) * 2003-03-27 2004-09-29 Samsung Electronics Co., Ltd. Portable multifunctional device
US8170109B2 (en) 2006-03-23 2012-05-01 Nds Limited System for analysis of motion
CN105451089A (en) * 2014-08-21 2016-03-30 扬智科技股份有限公司 Multimedia processing device and multimedia communication system

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5793416A (en) * 1995-12-29 1998-08-11 Lsi Logic Corporation Wireless system for the communication of audio, video and data signals over a narrow bandwidth
US5796424A (en) * 1995-05-01 1998-08-18 Bell Communications Research, Inc. System and method for providing videoconferencing services
US5936679A (en) * 1995-08-24 1999-08-10 Hitachi, Ltd. Television receiver having multiple communication capabilities
US6084638A (en) * 1996-10-08 2000-07-04 Hare; Charles S. Computer interface extension system and method
US6243129B1 (en) * 1998-01-09 2001-06-05 8×8, Inc. System and method for videoconferencing and simultaneously viewing a supplemental video source

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5796424A (en) * 1995-05-01 1998-08-18 Bell Communications Research, Inc. System and method for providing videoconferencing services
US5936679A (en) * 1995-08-24 1999-08-10 Hitachi, Ltd. Television receiver having multiple communication capabilities
US5793416A (en) * 1995-12-29 1998-08-11 Lsi Logic Corporation Wireless system for the communication of audio, video and data signals over a narrow bandwidth
US6084638A (en) * 1996-10-08 2000-07-04 Hare; Charles S. Computer interface extension system and method
US6243129B1 (en) * 1998-01-09 2001-06-05 8×8, Inc. System and method for videoconferencing and simultaneously viewing a supplemental video source

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1463287A1 (en) * 2003-03-27 2004-09-29 Samsung Electronics Co., Ltd. Portable multifunctional device
US8170109B2 (en) 2006-03-23 2012-05-01 Nds Limited System for analysis of motion
CN105451089A (en) * 2014-08-21 2016-03-30 扬智科技股份有限公司 Multimedia processing device and multimedia communication system

Also Published As

Publication number Publication date
AU2001263186A1 (en) 2002-04-15

Similar Documents

Publication Publication Date Title
US6489986B1 (en) Remote control device for video and audio capture and communication
US6529233B1 (en) Systems and methods for remote video and audio capture and communication
US6397388B1 (en) Systems and devices for audio capture and communication during television broadcasts
US6944880B1 (en) Methods for audio capture and communication during television broadcasts
US20020054206A1 (en) Systems and devices for audio and video capture and communication during television broadcasts
US7003795B2 (en) Webcam-based interface for initiating two-way video communication
US6941575B2 (en) Webcam-based interface for initiating two-way video communication and providing access to cached video
US8654262B2 (en) Content delivery to a digital TV using a low-power frequency converted RF signal
US9584757B2 (en) Apparatus and method for effectively implementing a wireless television system
US8239912B2 (en) Wireless network base stations capable of receiving video signals
US20060020995A1 (en) Fast channel change in digital media systems
US20080049118A1 (en) Self-contained wireless camera device, wireless camera system and method
US9843765B2 (en) Integrated devices for multimedia content delivery and video conferencing
WO2002056588A1 (en) Hardware decoding of media streams from multiple sources
KR20070005495A (en) Content integration platform with format and protocol conversion
US20030046705A1 (en) System and method for enabling communication between video-enabled and non-video-enabled communication devices
US20040155961A1 (en) Apparatus and method for controlling display of video camera signals received over a powerline network
WO2002047383A1 (en) Interactive companion set top box
WO2002030117A1 (en) Systems and methods for video and audio capture and communication
WO2002030121A1 (en) Systems, methods, and devices for audio capture and telecommunication
WO2003021960A1 (en) Tv system with group communication
JP2003046880A (en) Portable viewer, receiver mounting transmission function therefor and portable viewer system program
WO2003003708A2 (en) Webcam-based interface for initiating two-way video communication
JP2003304526A (en) Image distribution system, camera system, and server and client apparatus
JPH03154583A (en) Television receiver integrated type picture transmitter

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A1

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CR CU CZ DE DK DM DZ EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG UZ VN YU ZA ZW

AL Designated countries for regional patents

Kind code of ref document: A1

Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE TR BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
REG Reference to national code

Ref country code: DE

Ref legal event code: 8642

122 Ep: pct application non-entry in european phase
NENP Non-entry into the national phase

Ref country code: JP