WO2003058403A2 - Method and apparatus for wideband conferencing - Google Patents

Method and apparatus for wideband conferencing Download PDF

Info

Publication number
WO2003058403A2
WO2003058403A2 PCT/US2002/041778 US0241778W WO03058403A2 WO 2003058403 A2 WO2003058403 A2 WO 2003058403A2 US 0241778 W US0241778 W US 0241778W WO 03058403 A2 WO03058403 A2 WO 03058403A2
Authority
WO
WIPO (PCT)
Prior art keywords
wideband
audio
communications device
connection
communications
Prior art date
Application number
PCT/US2002/041778
Other languages
French (fr)
Other versions
WO2003058403A3 (en
Inventor
Jeffrey Rodman
David Drell
Original Assignee
Polycom, Inc.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Polycom, Inc. filed Critical Polycom, Inc.
Priority to EP02806278A priority Critical patent/EP1464142A4/en
Publication of WO2003058403A2 publication Critical patent/WO2003058403A2/en
Publication of WO2003058403A3 publication Critical patent/WO2003058403A3/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M7/00Arrangements for interconnection between switching centres
    • H04M7/0024Services and arrangements where telephone services are combined with data services
    • H04M7/0057Services where the data services network provides a telephone service in addition or as an alternative, e.g. for backup purposes, to the telephone service provided by the telephone services network
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M3/00Automatic or semi-automatic exchanges
    • H04M3/42Systems providing special services or facilities to subscribers
    • H04M3/56Arrangements for connecting several subscribers to a common circuit, i.e. affording conference facilities
    • H04M3/567Multimedia conference systems
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M3/00Automatic or semi-automatic exchanges
    • H04M3/42Systems providing special services or facilities to subscribers
    • H04M3/42127Systems providing several special services or facilities from groups H04M3/42008 - H04M3/58
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M3/00Automatic or semi-automatic exchanges
    • H04M3/42Systems providing special services or facilities to subscribers
    • H04M3/56Arrangements for connecting several subscribers to a common circuit, i.e. affording conference facilities
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M7/00Arrangements for interconnection between switching centres
    • H04M7/12Arrangements for interconnection between switching centres for working between exchanges having different types of switching equipment, e.g. power-driven and step by step or decimal and non-decimal

Definitions

  • the present invention relates generally to the field of conferencing, and more particularly to a method and apparatus for wideband conferencing.
  • Speakerphones and telephones are telecommunications devices used for a variety of purposes, typically for telephonic communications between two or more endpoints and two or more parties.
  • Remote teleconferencing serves a valuable purpose, and often enables increased productivity between, or within, organizations without raising associated costs incurred due to travel and time constraints.
  • Telecommunications devices e.g., speakerphones, etc.
  • Telecommunications devices of the prior art are typically used to transmit analog, voice-based communications at frequencies below 3.3 kiloHertz (kHz), and typically work over two different types of telecommunications networks.
  • the first type of telecommunications network comprises the Plain Old Telephone System (POTS) and the Public Switched Telephone Network (PSTN).
  • POTS Plain Old Telephone System
  • PSTN Public Switched Telephone Network
  • PSTN Public Switched Telephone Network
  • the second type of network relies upon network information technologies such as the Internet, a Local Area Network (LAN), a Wide Area Network (WAN) or a Virtual Private Network (VPN) to transmit voice signals as data packets.
  • LAN Local Area Network
  • WAN Wide Area Network
  • VPN Virtual Private Network
  • a telecommunications device Over a PSTN /POTS network, a telecommunications device routes calls between specialized computers known as switches. A call signal is sent through a Private Branch Exchange (PBX), which addresses and connects calls to a destination PBX and, ultimately, to a receiving telecommunications device.
  • PBX Private Branch Exchange
  • a POTS/PSTN conferencing system 100 is shown.
  • An initiating telecommunications device 102 sends a calling signal to a PBX 104, which in turn routes the call to a switch 106.
  • the switch 106 subsequently, routes the call to a receiving switch 108 over a PSTN 110.
  • the call is then routed from the receiving switch 108 through a destination PBX 112 to a receiving telecommunications device 114.
  • the telecommunications devices 102 and 114 may train and synchronize, adjusting for line conditions such as amplitude response, delay distortions, timing recovery, and echo characteristics.
  • the PBX is optional.
  • the telecommunications device may be connected directly to the switch 106. This is the typical connection from a user's home.
  • a primary source of quality degradation with telecommunications devices occurs as a result of network infrastructure characteristics such as frequency handling and available bandwidth.
  • the conferencing system 100 operating over a PSTN/POTS network is typically limited by both available frequency and a narrow range of bandwidth. These network characteristics limit type, form, and amount of data shared between telecommunications devices 102 and 114. Conventional narrow bandwidth systems further limit audio quality: audio bandwidth, audio noise level, audio path gain, etc.
  • the second type of telecommunications network may also rely upon PSTN, but transmits signals over digital networks using packet switching.
  • This treatment results in a technique known as Voice over IP (VoIP) or simply IP.
  • VoIP devices forward sound and other data as packets of information over digital networks using standards including ITU H.323, MGCP, SIP, etc.
  • VoIP devices encodes voice (and /or other sounds) as data packets that are switched between network- addressed servers.
  • the network-addressed servers process, reassemble, and convert digital signals to analog signals at a receiving VoIP device.
  • an exemplary IP conferencing system 200 of the prior art is shown.
  • An IP device 202 encodes sound from analog signals into digital data packets using a codec 204.
  • the system 200 switches data packets through a home (or other type of) network 208 such as a LAN, WAN, etc.
  • a communications device is an RS-232- C modem.
  • the system 200 then transmits the data packets through a firewall 210, if one is present, to an access server 212.
  • the access server 212 enables the data packets to be switched onto an Internet backbone 214.
  • the system 200 forwards the data packets through a destination access server 216 and a destination firewall 218, if one is present, to a receiving home (or other type of) network 220.
  • the data packets are re- modulated via a modem 222 or similar device, and encoded into analog voice- based signals using a codec 224.
  • the modem 222 and codec 224 are embedded in a receiving VoIP device 226, in one example.
  • Multimedia conferencing represents a substantial improvement over voice conferencing.
  • current telecommunications devices are incapable of overcoming existing network limitations.
  • current telecommunications devices cannot effectually bridge or manage multiple calls.
  • the limited frequency range of 3.3 kHz prohibits multiple call functions from being simultaneously performed including bridging or data exchange.
  • Current telecomm devices can bridge and manage multiple calls, but their usability is degraded due to the fact that the available bandwidth is much less than the bandwidth of human speech. Further, this issue becomes more critical as more people are in the conference. Understandability, more sources of noise, increases difficulty in identifying the talker, for example.
  • the present invention provides, in various embodiments, a method for wideband voice and data conferencing over a communications network between wideband communications devices.
  • a system determines whether a wideband audio communications device is communicating with another wideband audio communications device. If both devices have wideband capability, then data is sent via wideband audio communications. Alternatively, if one of the communications devices is not wideband capable, then the communication is performed via a narrowband connection. In both situations, the communications devices may train (coordinated and connected, which may include training) prior to the connection. Further, the system may adjust the communications between the communications devices based on network conditions.
  • a wideband audio communications device such as a speakerphone, comprises an integrated modem, at least one microphone, and at least one audio codec.
  • the at least one microphone obtains audio from conference participants located in proximity of the wideband audio communications device.
  • the audio codec then encodes the obtained audio into digitized audio signals for transmission over a wideband audio connection to a receiving wideband audio communications device.
  • the wideband audio connection is over a PSTN.
  • the wideband audio communications device further comprises a speaker for outputting audio received from a remote communications device.
  • FIG. 1 illustrates a Plain Old Telephone System (POTS) and a Public-Switched Telephone Network (PSTN) of the prior art;
  • POTS Plain Old Telephone System
  • PSTN Public-Switched Telephone Network
  • FIG. 2 illustrates a Voice Over Internet Protocol (VoIP) system of the prior art
  • FIG. 3a illustrates a wideband conferencing system over a Public Switched Telephone Network (PSTN) according to one embodiment of the present invention
  • FIG. 3b illustrates a wideband audio conferencing system over a PSTN, according to an alternative embodiment of the present invention
  • FIG. 4a illustrates a wideband conferencing system according to another embodiment of the present invention
  • FIG. 4b illustrates an alternative, exemplary wideband audio conferencing system
  • FIG. 5a illustrates a wideband data conferencing system between two wideband communications devices utilizing a gateway, according to one embodiment of the present invention
  • FIG. 5b illustrates a conferencing system utilizing a gateway when one device in on POTS and a second device in on an IP network
  • FIG. 6a illustrates an exemplary wideband communications device
  • FIG. 6b illustrates an alternative exemplary wideband communications device
  • FIG. 7 is exemplary signal flow path for wideband data exchange
  • FIG. 8 is an exemplary flowchart illustrating a method of establishing a wideband telephony conference.
  • FIG. 9 is an exemplary flowchart illustrating an alternative method of establishing a wideband telephony conference. DESCRIPTION OF EXEMPLARY EMBODIMENTS
  • wideband audio communications devices eliminate numerous limitations of narrowband communications devices, such as conventional phones, and may effectively enable simultaneously combined audio and data communications.
  • video and audio conferencing at higher frequencies e.g., 7 kHz
  • increased bandwidth in which the bandwidth enhancement is based on the principle of exchanging audio as compressed digital information via a codec, permits improved capabilities such as voice and data security between endpoints.
  • Wideband communications devices also enable simultaneous audio and data communications, wideband bridging, private session key exchange for network and data security, and additional call management capabilities.
  • FIG. 3a an overview of an exemplary wideband conferencing system 300 of the present invention is depicted.
  • This embodiment allows for the transmission of both audio and data signals.
  • a computing device 302 and a video display 304 At one endpoint of the conferencing system 300 are a computing device 302 and a video display 304.
  • the computing device 302 and the video display 304 are integrated into one device.
  • other devices may be coupled instead of, or in addition to, the computing device 302 such as projectors, cameras, etc.
  • Data is input or initially stored in the computing device 302. The data is then forwarded to a codec 306, which compresses the data for transmission.
  • a wideband communications devices 308 coupled to the computing device 302 comprises at least one codec 310 and a modem 312.
  • the wideband communications device 308 further comprises a microphone for picking up audio and a speaker (not shown) for outputting audio from a remote site.
  • the codec 310 is a common device or technique for converting a signal (in this case, an analog, and especially audio, signal) to digital data. Examples of codecs include, but are not limited to, G.711, G.722, G.722.1, G.728, etc.
  • the codec 310 receives audio signals from the microphone, and encodes the audio signals for transmission over a PSTN 318 at higher frequencies then conventional audio signals over PSTN.
  • the modem 312 modulates all signals for transmission over the
  • the modem 312 is a common device or technique for sending digital data over an analog medium such as a phone line. Examples of modems include Bell212, V.34, V.90, and V.92. Thus, the encoded audio signals from the codec 310 are modulated by the modem 312 and sent as digitized audio signals over the PSTN 318. In one embodiment, the modem 312 acts as a multiplexer to combine the audio signals, control data signals, and data signals received from the codec 306. In an alternative embodiment, a multiplexer or similar device may be utilized to combine these signals for transmission.
  • compressed speech data from the codec 310 and control data may be sent through the modem, while video /graphic data from the codec 306 is routed through a second network 314 via a router or switch.
  • wideband communications devices 308 and 320 can access and retrieve data stored in a central office data conferencing server 316.
  • the central office data conferencing server 316 may be a repository for data, be a voice bridge, or generally control the conference (e.g., allow multi-endpoint communications).
  • control signals for retrieving data from the data conferencing server 316 are sent over the PSTN 318 along with the audio signals.
  • the control data is sent via a separate data channel.
  • a separate data side-channel may be utilized for sending connection information, such as URL and website information.
  • the wideband communications device 308 and receiving wideband communications device 320 both have dual connections to the PSTN 318 and the external communications network 314.
  • the present system uses the limited data exchange capabilities of the PSTN 318 to send control signals to efficiently manage data retrieval of substantial file sizes from the external communications network 314. Included with the control signals may be a private password or session key, which directs the receiving wideband communications device 320 and /or a receiving computing device 322 to a specific network address to retrieve data (e.g., the data conferencing server 316).
  • a receiving modem 324 demodulates received conference signals.
  • the modem 324 (if it is acting as a multiplexer device) or a multiplexer device then separates the data signals from the audio signals.
  • a receiving internal codec 326 decodes the received digitized audio signals.
  • the audio is then output to a user through an internal speaker (not shown), while the data signals are sent to a codec 328.
  • Control signals may either be utilized internal in the receiving communications device 320 or sent to the computing device 322.
  • the receiving codec 328 then decodes the digital data signals enabling the data to be displayed at the receiving computing device 322 or a receiving video display 330, or utilized by other coupled devices.
  • both audio and limited data signals are transmitted over the PSTN 318.
  • the transmitting communications device 308 modulates the control data and /or the data signals onto a carrier signal, or adds DTMF signals to the audio signal.
  • the carrier or DTMF signal is then added to the audio signal prior to transmission.
  • the multiplexing may occur at a specialized modem, or alternatively, at a separate multiplexing device.
  • only audio signals may be sent via the PSTN 318 while data signals are sent via the external communications network 314.
  • the receiving communications device 320 and the receiving computing device 322 may operate as a transmitting communications device and transmitting computer device, respectively.
  • the codec 328 will encode /compress data received from the computing device 322 prior to forwarding the data to the communication device 320.
  • the codec 326 will encode the audio signals received from an internal microphone (not shown), while the modem 324 modulates all signals for transmission over the PSTN 318 .
  • the communications device 308 now demodulates the signal with the modem 312, and decodes the audio signals via the codec 310. The audio is then output to a user via the internal speaker.
  • Data signals are then sent to the codec 306 for decoding/decompression, and subsequent display on, or operated on by, the computing device 302 and /or the display 304, or sent to other coupled devices.
  • more computing devices may be coupled to the communications devices.
  • FIG. 3a The conferencing system 300 of FIG. 3a is described in connection with data conferencing. However, it should be noted that a conferencing system, according to the present invention may operate without data exchange as simply a wideband audio conferencing system, as shown in FIG. 3b.
  • the communications device 308 is only coupled to the communications device 320 via the PSTN 318. Thus, digitized audio signals (and optional control data signals) are sent via the PSTN 318.
  • FIGs. 3a and 3b only show two communications devices coupled via the PSTN, any number of communications devices can be involved in a conference session.
  • FIG. 4a an exemplary wideband voice conferencing system 400 is shown whereby audio and data signals are transmitted via a PSTN 412.
  • a computing device 402 provides data that is subsequently encoded /compressed by a codec 404.
  • the encoded data is then sent to a wideband communications device 406.
  • the wideband communications device 406 (e.g., a telephone, speakerphone, etc.) comprises at least one codec 408, a microphone for picking up audio (not shown), and a speaker (not shown) for outputting audio from a remote site.
  • the codec 408 receives audio signals from the microphone, and encodes the audio signals for transmission over the PSTN 412 in a digital form.
  • a line interface 410 coupled to the communications device 406 is an exemplary means of establishing a network connection from the wideband communications device 406 to the PSTN 412.
  • the line interface 410 may be a ISDN hub utilizing protocols such as, but not limited to, SIP, TCP, and IP.
  • a receiving line interface 414 coupled to the PSTN 412 receives the encoded signals and forwards the signals to a communications device 418 which comprises a codec 416, a microphone (not shown), and a speaker (not shown). Subsequently, the receiving codec 416 decodes the audio signals and presents the audio to conference participants via the internal speakers. Roughly simultaneously, the data signals are forwarded to codec 420 for decoding. The decoded data is then transmitted to a computing device 422. Alternatively, the conferencing system may be reversed with the wideband communications device 418 transmitting data to the communications device 406. In further embodiments, more computing devices may be coupled to each of the communications devices.
  • FIG. 4b illustrates a similar conferencing system to the FIG. 4a embodiment.
  • the FIG. 4b embodiment only allows for communication of signals generated by the communications devices 406 or 418.
  • audio received by the internal microphone of the communications device 406 is digitized by the codec 408 and sent via the PSTN 412 to the communications device 418, and vice-versa.
  • Control data generated by the communications devices 406 and 418 may also be sent via the PSTN on a side-channel.
  • FIG. 5a another exemplary wideband conferencing system 500 uses bridging and media control.
  • a computing device 502 transmits encoded data to a wideband communications device 506 using a codec 504, which processes the data signals for transmission.
  • the wideband communications device 506, which is coupled to the network 508, also receives audio from an internal microphone (not shown).
  • Other devices may also be coupled to the network 508 including, but not limited to, a G.711 communications device 510, a G.722.1 communications device 512, and a video conferencing device 514 in any combination thereof.
  • audio and data can be managed from a gateway /media control unit 516.
  • the exemplary gateway/media control unit 516 controls signal routing and serves as a communications network interface for both video and audio signals.
  • Video signals may include high-resolution still graphics, moving video images, images of spreadsheets, and other visual presentations.
  • the gateway/media control unit 516 comprises a transcoder 518, a data conferencing server 520, and a bridge 522.
  • the transcoder 518 encodes data retrieved from the data conferencing server 520.
  • other embodiments of the gateway/media control unit 516 may contain more or less elements.
  • the gateway/media control unit 516 enables multiple endpoints to be simultaneously bridged using the bridge 522 regardless of the use of data retrieval.
  • the bridge 522 is capable of integrating PSTN audio, VGA-LAN (i.e., high-resolution graphics imagery) data, and IP-voice into a single conference.
  • Data and bridged calls are routed through a destination network 524 to a receiving wideband communications device 526.
  • Voice signals may be presented to a user though a speaker located in the communications device 526, while data signals are forwarded to a receiving codec 528. After decoding, the data may then be displayed at a receiving computing device 530.
  • other devices may be coupled to the network 524.
  • These devices may include, but are not limited to, a G.711 communications device 532, a G.722.1 communications device 534, and a videoconferencing device 536.
  • audio signals may be presented to the user through the computing device 530 and/or data may be displayed or presented at the communications device 526. Because communications is bilateral, the conferencing system 500 may operate in the reverse (i.e., the communications device 526 transmits signals to the communications device 506).
  • the transcoder 518 encodes digital audio signals according to ITU standards (e.g., G.711 for either PBX or ISDN channels, etc.). Additionally, the transcoder 518 encodes audio signals in accordance with ITU standards (e.g., G.722.1, etc.) for wideband transmission above 7 kHz. Further, alternative components encompassing other ITU, non-ITU standards, or other techniques are interchangeable with the transcoder 518. The transcoder 518 also can direct the receiving wideband communications device 526 (or communications device 506 in the reverse process) to a particular network address for data retrieval.
  • ITU standards e.g., G.711 for either PBX or ISDN channels, etc.
  • ITU standards e.g., G.722.1, etc.
  • alternative components encompassing other ITU, non-ITU standards, or other techniques are interchangeable with the transcoder 518.
  • the transcoder 518 also can direct the receiving wideband communications device 526 (or communications device 506 in the reverse
  • the data conferencing server 520 is integrated within the gateway media control unit 516, and can be used by all conferencing parties.
  • the storage and retrieval of data from the data conferencing server 520 is controlled by signals passed between the wideband communications devices 506 and 526.
  • the present invention allows for determination of whether participating communications devices are wideband or narrowband, and can adjust communications accordingly.
  • FIG. 5b is an exemplary conference system 550 whereby one communications device 552 is located on an IP network, while a second communications device 572 uses POTS.
  • the communications device 552 encodes audio via a codec 554.
  • the audio data packets are then switched via a modem 556 through a home network 558 or similar network (e.g., LAN, WAN, etc.).
  • the audio data packets are sent through a firewall 560 (if one exists) to an access server 562, which allows the audio data packets to be switched onto an Internet backbone 564.
  • the packets are then sent to a destination server 566, and to a VoIP/POTS gateway 568.
  • the VoIP/POTS gateway 568 translates between the VoIP's version of wideband audio to a form of wideband audio utilized by the communications device 572.
  • the packets are sent through POTS wiring 570 to the communications device 572.
  • the packets are then demodulated by an internal modem 574 and decoded by an internal codec 576.
  • the audio is then presented to a user of the communications device 572 via an internal speaker (not shown).
  • the conference system 550 is a bilateral communications system. Therefore, audio may be sent from the POTS communications device 572 to the VoIP communications device 552. In this embodiment, audio is received by an internal microphone in the communications device 572 and encoded by the codec 576. The digitized audio signals are then sent via the modem 574 to the VoIP/POTS gateway 568 via the POTS wiring 570. The VoIP/POTS gateway translates the POTS wideband signal into a VoIP wideband signal compatible with the communications device 552. Subsequently, the signal is then routed to the communications device 552.
  • FIG. 6a illustrates an exemplary block diagram of a wideband communications device 600 of the present invention.
  • the wideband communications device 600 is coupled to a PSTN 602.
  • the wideband communications device 600 is also optionally coupled to the Internet 604. For signals switched over the PSTN
  • a line interface 606 relays signals to and from either a codec 608 (e.g., G.711 codec) or a codec 610 (e.g., G.722.1 codec) which encodes audio signals received by a microphone 611 or similar audio sensor, and decodes audio signals received from a remote site.
  • the codec 610 is coupled to the line interface 606 via an optional voice channel 612 and a digital modem 614.
  • the codec 608 may operate in narrowband, while the codec 610 and the modem operate in wideband. In alternative embodiments, more or less codecs may be embodied in the communications device 600.
  • control module 616 Further coupled to the modem 614 is a control module 616.
  • the control module 616 enables users to manage calls and adjust sounds rendered audible by a speaker 617. Supervisory control of conferencing is also handled by the control module 616, which provides control data to conferencing parties in order to modify conferencing parameters. These parameters include directing the type and manner of audio and/or video display, caller intervention, secure access and retrieval of data, and a variety of other functions.
  • the modem 614 receives both the audio and control data signals, and forwards the signals to the line interface for transmission over the PSTN 602.
  • the communications device 600 may provide a digital control channel and a voice channel over a modem link to the PSTN 602.
  • the control data may be sent to a control module of the receiving communications device where the parameters may then be altered.
  • a network interface 618 e.g., Ethernet interface
  • the protocol stack 620 establishes Internet voice and protocol video conferencing sessions.
  • the data signals are passed to the control module 616.
  • conferencing video or other data are decoded by a codec 622, and transmitted to a computing device 624 and /or a video display 626.
  • the codec 622 encodes the data from the computing device 624, and forwards the encoded data to the communications device 600.
  • the communications device 600 then processes the signals before forwarding the data to the Internet 604 via the network interface
  • the communications device 600 further comprises an embedded audio echo canceller.
  • This audio echo canceller allows for full-duplex conversation between participating endpoints.
  • this audio canceller may be embodied in the voice channel 612.
  • FIG. 6b shows an alternative embodiment of the wideband communications device whereby both codecs 608 and 610 are coupled to the voice channel 612.
  • the voice channel 612 comprises the audio echo canceller.
  • the codec 608 operates in narrowband mode, while the codec 612 and the modem operate in wideband mode.
  • the wideband communications device 600 may only be coupled to the PSTN 602.
  • the protocol stack 620 and the network interface 618 are not present in the wideband communications device 600.
  • more or less elements may be present in the communications device.
  • the integrated wideband communications device 600 at a minimum, comprises at least one codec, a modem, a microphone, and a speaker. Additionally, the communications device 600 may not be coupled to the codec 622, the computing device 624, and the video display 626.
  • the first purpose is to transmit data between endpoints.
  • the second purpose is to direct an endpoint to a location such as a central office data storage repository to retrieve conferencing data such as spreadsheets or images.
  • a signal flow path 700 for wideband data exchange is shown. Control signals containing encrypted data and session keys from an initiating wideband communications device 702 pass through a network
  • the gateway media control unit 708 initiates a data exchange using Session Initial Protocols (SIPs) that establish data tunnels 710 on either end of a data conferencing server 712.
  • SIPs Session Initial Protocols
  • the data tunnels 710 permit exchange of security information such as session keys that encrypt specific network addresses for the data conferencing server 712.
  • Data retrieved or sent must pass through a destination firewall 714 (if one exists) of a destination network 716 before reaching a receiving wideband communications device 718.
  • the gateway media control unit 708 controls connections of H.263 bi-directional data streams between two endpoints. Initially, the gateway media control unit 708 receives HTTP connections from the two endpoints (e.g., wideband communications devices 702 and 718), and if the two endpoints present matching session keys, the gateway media control unit 708 connects the H.263 data streams to each endpoint. Additionally, the gateway media control unit 708 may validate that each endpoint is registered and authorized for service.
  • the two endpoints e.g., wideband communications devices 702 and 718
  • Data can be either sent directly between the transmitting wideband communications device 702 and the receiving wideband communications device 718, or directed for retrieval from the data conferencing server 712 by passing control data such as session keys.
  • control data such as session keys
  • the preferred method would be to pass control data or signals using the flow path 700 described in FIG. 7, thus reducing the amount of data passed between endpoints.
  • the control data and session keys are sent via a PSTN connection, while data is sent /retrieved via the Internet. This method leverages the use of combined analog PSTN and data network connections (e.g., the Internet) to reduce call latency and delay.
  • an exemplary flowchart 800 illustrates one method for initiating a wideband audio conference with an optional data side- channel.
  • a call is initiated, and the audio portion is linked.
  • a wideband probe signal is sent from an initiating wideband communications device in step 804.
  • the purpose of the wideband probe is to determine whether a receiving communications device, computing device, etc. is wideband telephony-capable. If at step 806 a wideband probe query does not receive a response, then the transmitting wideband communications device enters a narrowband mode at step 808 and connects the call as a narrowband call at step 810. Consequently, the behavior of the wideband communications device is typical of an analog, PSTN-switched POTS call.
  • the wideband communications device shifts to a wideband mode at step 814.
  • the wideband communications device begins initiating training at step 816 with the receiving communications device.
  • Training is a telecommunications technique used by many modems for ensuring that QoS and call quality are maximized by synchronizing modems to each other, adjusting for changing line conditions such as delay, signal distortion and amplitude response, etc. Training usually, but not always, occurs prior to establishing a connection, depending on the specifications of the modem. If two modems /communications devices fail to train, a call connection will not be established.
  • both wideband communications devices are trained and synchronized at step 818 to exchange both voice and optional data signals
  • the call is connected at step 820.
  • both endpoints i.e., the transmitting wideband communications device and the receiving communications device
  • other frequencies can be used, or that other audio enhancements, such as stereo, or enhanced dynamic range, can be transmitted.
  • Conferencing parties can also exchange data or direct another parties to retrieve data from data storage repositories. These enhancements lead to a high degree of interactivity and increased capabilities for multimedia communication.
  • the initialing wideband communications device may determine a receiving communications device is not wideband capable without using a probe.
  • the receiving communication device may respond with speech (e.g., a user of the receiving communications device says "hello").
  • the initiating communications device determines the receiving communications device is not wideband capable, and the audio only conference (i.e., narrowband connection) can start.
  • FIG. 9 illustrates an alternative embodiment whereby the receiving communications device initiates a wideband probe.
  • a call is initiated by a transmitting communications device.
  • an audio link is established between the transmitting communications device and a receiving communications device.
  • a wideband probe signal is sent from by the wideband, receiving communications device in step 904 to determine whether the transmitting communications device is wideband telephony-capable. If at step 906 a wideband probe query does not receive a response, then the receiving, wideband communications device enters a narrowband mode at step 908 and connects the call as a narrowband call at step 910. Consequently, the behavior of the wideband communications device is typical of an analog, PSTN-switched POTS call.
  • the wideband communications device shifts to a wideband mode at step 914.
  • the wideband communications device begins initiating training at step 916 with the transmitting communications device.
  • both wideband communications devices are trained and synchronized at step 918 to exchange both voice and data signals, the call is connected at step 920.
  • both audio and data may be transmitted between the wideband communications devices.
  • quality of an audio (and possibly data) connection may be adjusted based on available data bandwidth.
  • the present system may automatically adjust the quality of the connection during a call, and in some situations, will even drop the call to a conventional analog, narrowband connection when the connection quality drops too low.
  • some references to data may be construed to refer to audio data or digitized audio signals, while other references may refer to control data, or alternatively to video /graphics data.

Abstract

A method and apparatus (Fig. 3A) for wideband voice and optional data conferencing over a telecommunications network channel between at least two wideband communications devices (PC 302, 322 and Video Display 330, 304). An exemplary method comprises establishing an audio link, verifying wideband capability between the at least two wideband communications devices, training modems of the at least two wideband communications device to line conditions, and adjusting the telecommunications connection line conditions between the communications devices. Once a wideband connection has been established, audio and data may be simultaneously exchanged.

Description

METHOD AND APPARATUS FOR WIDEBAND CONFERENCING
CROSS-REFERENCES TO RELATED APPLICATIONS
[ 0001] This application claims priority from U.S. Provisional Patent Application Number 60/345,929, filed December 31, 2001, and entitled "Method and Apparatus for IP Conferencing," which is incorporated herein by reference for all purposes.. rfThis application &lsV> Halms priority from U.S. Provisional Patent Application Number 60/360,984, filed March 1, 2002, and entitled "Systems and Methods for Video Conferencing Across a Network," which is also incorporated herein by reference.
BACKGROUND OF THE INVENTION
Field of the Invention
[ 0002 ] The present invention relates generally to the field of conferencing, and more particularly to a method and apparatus for wideband conferencing.
Description of the Background Art
[0003] Speakerphones and telephones are telecommunications devices used for a variety of purposes, typically for telephonic communications between two or more endpoints and two or more parties. Remote teleconferencing serves a valuable purpose, and often enables increased productivity between, or within, organizations without raising associated costs incurred due to travel and time constraints.
[ 0004 ] Telecommunications devices (e.g., speakerphones, etc.) of the prior art are typically used to transmit analog, voice-based communications at frequencies below 3.3 kiloHertz (kHz), and typically work over two different types of telecommunications networks. The first type of telecommunications network comprises the Plain Old Telephone System (POTS) and the Public Switched Telephone Network (PSTN). The second type of network relies upon network information technologies such as the Internet, a Local Area Network (LAN), a Wide Area Network (WAN) or a Virtual Private Network (VPN) to transmit voice signals as data packets. However, each of these types of networks suffers from limitations unique to their respective type of network.
[0005] Over a PSTN /POTS network, a telecommunications device routes calls between specialized computers known as switches. A call signal is sent through a Private Branch Exchange (PBX), which addresses and connects calls to a destination PBX and, ultimately, to a receiving telecommunications device.
[0006] Referring to FIG. 1, a POTS/PSTN conferencing system 100 is shown. An initiating telecommunications device 102 sends a calling signal to a PBX 104, which in turn routes the call to a switch 106. The switch 106, subsequently, routes the call to a receiving switch 108 over a PSTN 110. The call is then routed from the receiving switch 108 through a destination PBX 112 to a receiving telecommunications device 114. In analog mode, the telecommunications devices 102 and 114 may train and synchronize, adjusting for line conditions such as amplitude response, delay distortions, timing recovery, and echo characteristics. However, conventional phones and speakerphones do very little training and synchronizing, so the amount of training and synchronization can be anywhere to none, which still yields a working link (99.9% of phones do it this way) to other telecommunications devices. It should be noted that the PBX is optional. In this embodiment, the telecommunications device may be connected directly to the switch 106. This is the typical connection from a user's home.
[ 0007 ] A primary source of quality degradation with telecommunications devices occurs as a result of network infrastructure characteristics such as frequency handling and available bandwidth. The conferencing system 100 operating over a PSTN/POTS network is typically limited by both available frequency and a narrow range of bandwidth. These network characteristics limit type, form, and amount of data shared between telecommunications devices 102 and 114. Conventional narrow bandwidth systems further limit audio quality: audio bandwidth, audio noise level, audio path gain, etc.
[0008] Conventional telecommunications devices are typically designed to filter out frequencies above 3.3 kHz. However, the filtering of frequencies between 3.3 kHz and 7 kHz significantly reduces sound quality, clarity, and distinction. The fact that conventional telecommunications devices and networks filter frequencies above 3.3kHz limits intelligibility of speech and other sounds, because much of the content that the ear depends on is carried in these higher frequencies. This results in connections that sound hollow, muddled, muffled, or distorted. The use of analog pathways also introduces significant variations in gain, thus one connection will be much quieter than another. This results in a connection that is difficult to hear. The use of analog pathways also often introduces significant noise, resulting in connections that are difficult to understand. The combination of all these degradations, which is common on conventional phone lines, results in poor and variable call quality. These problems are exacerbated when participants in a conference are in sub-optimal environments, such as reverberant rooms, when there are multiple participants who may interrupt one another, or when participants do not all share a common native language or dialect, resulting in accented speech that can be difficult to understand in the best of conditions, and impossible over a phone connection. Another problem often associated with conventional telecommunications devices includes the transmission of background noise and static over a PSTMcall. Thus, current telecommunications devices often provide poor Quality of Service (QoS) and lack enhanced features and functionality.
[0009] Alternatively, the second type of telecommunications network may also rely upon PSTN, but transmits signals over digital networks using packet switching. This treatment results in a technique known as Voice over IP (VoIP) or simply IP. VoIP devices forward sound and other data as packets of information over digital networks using standards including ITU H.323, MGCP, SIP, etc. Using an embedded modem and codec, a VoIP device encodes voice (and /or other sounds) as data packets that are switched between network- addressed servers. The network-addressed servers process, reassemble, and convert digital signals to analog signals at a receiving VoIP device.
[0010] Referring to FIG. 2, an exemplary IP conferencing system 200 of the prior art is shown. An IP device 202 encodes sound from analog signals into digital data packets using a codec 204. Using a modem 206 or similar device, the system 200 switches data packets through a home (or other type of) network 208 such as a LAN, WAN, etc. One example of a communications device is an RS-232- C modem. The system 200 then transmits the data packets through a firewall 210, if one is present, to an access server 212. The access server 212 enables the data packets to be switched onto an Internet backbone 214.
[0011 ] Next, the system 200 forwards the data packets through a destination access server 216 and a destination firewall 218, if one is present, to a receiving home (or other type of) network 220. The data packets are re- modulated via a modem 222 or similar device, and encoded into analog voice- based signals using a codec 224. The modem 222 and codec 224 are embedded in a receiving VoIP device 226, in one example.
[0012] Although the system 200 enables VoIP, there are significant problems associated with this type of conferencing over an IP network. One problem of conventional PSTN and IP-capable telecommunications devices is an inability to conduct effective simultaneous sound (e.g., voice, etc.) and data conferencing due to bandwidth limitations and lower frequency ranges. Further, significant problems with conferencing over a VoIP network are that expanded services such as wideband audio or side-channel data cannot be shared with non- VoIP devices, which constitute the great majority of endpoints in the world. [0013] Another limitation of VoIP telecommunications devices results from time delays incurred from data packet re-assembly. The delay in re- assembly results in broken and unnatural speech, greatly reducing the quality of the conference call. Still another limitation with IP conferencing is data vulnerability to external breaches of security. Although data encryption can be implemented using means such as public and private session keys, bandwidth restrictions impose a significant burden upon telecommunications devices and substantially affect QoS.
[0014] Multimedia conferencing represents a substantial improvement over voice conferencing. However, current telecommunications devices are incapable of overcoming existing network limitations. Furthermore, current telecommunications devices cannot effectually bridge or manage multiple calls. The limited frequency range of 3.3 kHz prohibits multiple call functions from being simultaneously performed including bridging or data exchange. Current telecomm devices can bridge and manage multiple calls, but their usability is degraded due to the fact that the available bandwidth is much less than the bandwidth of human speech. Further, this issue becomes more critical as more people are in the conference. Understandability, more sources of noise, increases difficulty in identifying the talker, for example. As a separate but significant issue in modern teleconferencing, there is very limited ability to communicate side- channel data (such as sending dial-additional-call commands, requesting cost-of- conference-so-far status information, and so forth) between the participants or to a bridging device. The most common technique is to use DTMF tones, which is slow and disruptive.
[ 0015] Yet another limitation of prior art narrowband conferencing systems is their ineffectiveness in handling multiple simultaneous speakers (or other sources of sound). A further problematic limitation of prior art narrowband speakerphones is call degradation due to the exclusionary filtering of signals above 3.3 kHz. Thus, conventional telecommunications devices have severe limitations related to QoS, data security, bridging, and advanced communications functions. Therefore, there is a need for a new and innovative method and apparatus for wideband audio conferencing using existing infrastructures to deliver enhanced services. SUMMARY OF THE INVENTION
[0016] The present invention provides, in various embodiments, a method for wideband voice and data conferencing over a communications network between wideband communications devices. According to one embodiment of the present invention, a system determines whether a wideband audio communications device is communicating with another wideband audio communications device. If both devices have wideband capability, then data is sent via wideband audio communications. Alternatively, if one of the communications devices is not wideband capable, then the communication is performed via a narrowband connection. In both situations, the communications devices may train (coordinated and connected, which may include training) prior to the connection. Further, the system may adjust the communications between the communications devices based on network conditions.
[0017] In an exemplary embodiment of the system, a wideband audio communications device, such as a speakerphone, comprises an integrated modem, at least one microphone, and at least one audio codec. The at least one microphone obtains audio from conference participants located in proximity of the wideband audio communications device. The audio codec then encodes the obtained audio into digitized audio signals for transmission over a wideband audio connection to a receiving wideband audio communications device. In an exemplary embodiment, the wideband audio connection is over a PSTN. The wideband audio communications device further comprises a speaker for outputting audio received from a remote communications device.
[0018] A further understanding of the nature and advantages of the present inventions herein may be realized by reference to the remaining portions of the specification and the attached drawings. BRIEF DESCRIPTION OF THE DRAWINGS
[0019] FIG. 1 illustrates a Plain Old Telephone System (POTS) and a Public-Switched Telephone Network (PSTN) of the prior art;
[0020] FIG. 2 illustrates a Voice Over Internet Protocol (VoIP) system of the prior art;
[ 0021 ] FIG. 3a illustrates a wideband conferencing system over a Public Switched Telephone Network (PSTN) according to one embodiment of the present invention;
[ 0022 ] FIG. 3b illustrates a wideband audio conferencing system over a PSTN, according to an alternative embodiment of the present invention;
[0023] FIG. 4a illustrates a wideband conferencing system according to another embodiment of the present invention;
[ 0024 ] FIG. 4b illustrates an alternative, exemplary wideband audio conferencing system; [0025] FIG. 5a illustrates a wideband data conferencing system between two wideband communications devices utilizing a gateway, according to one embodiment of the present invention;
[0026] FIG. 5b illustrates a conferencing system utilizing a gateway when one device in on POTS and a second device in on an IP network; [0027 ] FIG. 6a illustrates an exemplary wideband communications device;
[0028] FIG. 6b illustrates an alternative exemplary wideband communications device;
[0029] FIG. 7 is exemplary signal flow path for wideband data exchange;
[0030] FIG. 8 is an exemplary flowchart illustrating a method of establishing a wideband telephony conference; and
[0031] FIG. 9 is an exemplary flowchart illustrating an alternative method of establishing a wideband telephony conference. DESCRIPTION OF EXEMPLARY EMBODIMENTS
[ 0032 ] As shown in the exemplary drawings wherein like reference numerals indicate like or corresponding elements among the figures, exemplary embodiments of a system and method according to the present invention will now be described in detail. Detailed descriptions of various embodiments are provided herein. It is to be understood, however, that the present invention may be embodied in various forms. Therefore, specific details disclosed herein are not to be interpreted as limiting, but rather as a basis for the claims and as a representative basis for teaching one skilled in the art to employ the present invention in virtually any appropriately detailed system, structure, method, process, or manner.
[0033] As mentioned herein, various drawbacks to the prior art telephony approaches exist. For example, call degradation of prior art phone systems often occurs due to the exclusionary filtering of signals with frequencies above 3.3 kHz.
[0034] Advantageously, wideband audio communications devices eliminate numerous limitations of narrowband communications devices, such as conventional phones, and may effectively enable simultaneously combined audio and data communications. In one embodiment according to the present invention, video and audio conferencing at higher frequencies (e.g., 7 kHz) greatly increases call quality and clarity. Additionally, increased bandwidth, in which the bandwidth enhancement is based on the principle of exchanging audio as compressed digital information via a codec, permits improved capabilities such as voice and data security between endpoints. Wideband communications devices also enable simultaneous audio and data communications, wideband bridging, private session key exchange for network and data security, and additional call management capabilities.
[0035] Referring to FIG. 3a, an overview of an exemplary wideband conferencing system 300 of the present invention is depicted. This embodiment allows for the transmission of both audio and data signals. At one endpoint of the conferencing system 300 are a computing device 302 and a video display 304. In alternative embodiments, the computing device 302 and the video display 304 are integrated into one device. In a further embodiment, other devices may be coupled instead of, or in addition to, the computing device 302 such as projectors, cameras, etc. Data is input or initially stored in the computing device 302. The data is then forwarded to a codec 306, which compresses the data for transmission.
[0036] A wideband communications devices 308 (e.g., a telephone, speakerphone, etc.) coupled to the computing device 302 comprises at least one codec 310 and a modem 312. The wideband communications device 308 further comprises a microphone for picking up audio and a speaker (not shown) for outputting audio from a remote site. The codec 310 is a common device or technique for converting a signal (in this case, an analog, and especially audio, signal) to digital data. Examples of codecs include, but are not limited to, G.711, G.722, G.722.1, G.728, etc. Thus, the codec 310 receives audio signals from the microphone, and encodes the audio signals for transmission over a PSTN 318 at higher frequencies then conventional audio signals over PSTN. Although the codec 310 is shown embodied in the communications device 302, alternatively, the codec 310 may be embodied in a different device or be a stand-alone device. [ 0037 ] The modem 312 modulates all signals for transmission over the
PSTN 318. The modem 312 is a common device or technique for sending digital data over an analog medium such as a phone line. Examples of modems include Bell212, V.34, V.90, and V.92. Thus, the encoded audio signals from the codec 310 are modulated by the modem 312 and sent as digitized audio signals over the PSTN 318. In one embodiment, the modem 312 acts as a multiplexer to combine the audio signals, control data signals, and data signals received from the codec 306. In an alternative embodiment, a multiplexer or similar device may be utilized to combine these signals for transmission. In a further alternative embodiment, compressed speech data from the codec 310 and control data may be sent through the modem, while video /graphic data from the codec 306 is routed through a second network 314 via a router or switch. [0038] Accessing an external communications network 314, such as the Internet, wideband communications devices 308 and 320 can access and retrieve data stored in a central office data conferencing server 316. The central office data conferencing server 316 may be a repository for data, be a voice bridge, or generally control the conference (e.g., allow multi-endpoint communications). In one embodiment, control signals for retrieving data from the data conferencing server 316 are sent over the PSTN 318 along with the audio signals. The control data is sent via a separate data channel. Further, a separate data side-channel may be utilized for sending connection information, such as URL and website information. In these embodiments, the wideband communications device 308 and receiving wideband communications device 320 both have dual connections to the PSTN 318 and the external communications network 314. The present system uses the limited data exchange capabilities of the PSTN 318 to send control signals to efficiently manage data retrieval of substantial file sizes from the external communications network 314. Included with the control signals may be a private password or session key, which directs the receiving wideband communications device 320 and /or a receiving computing device 322 to a specific network address to retrieve data (e.g., the data conferencing server 316).
[0039] At a receiving endpoint, a receiving modem 324 demodulates received conference signals. The modem 324 (if it is acting as a multiplexer device) or a multiplexer device then separates the data signals from the audio signals. A receiving internal codec 326 decodes the received digitized audio signals. The audio is then output to a user through an internal speaker (not shown), while the data signals are sent to a codec 328. Control signals may either be utilized internal in the receiving communications device 320 or sent to the computing device 322. The receiving codec 328 then decodes the digital data signals enabling the data to be displayed at the receiving computing device 322 or a receiving video display 330, or utilized by other coupled devices. If the data signals contain the control signals, then the computing device 322 is directed to a specific network address to retrieve data. Both the data retrieve/display and the audio occur relatively simultaneously to an end user. [0040] In the above-described embodiment, both audio and limited data signals are transmitted over the PSTN 318. In one embodiment, the transmitting communications device 308 modulates the control data and /or the data signals onto a carrier signal, or adds DTMF signals to the audio signal. The carrier or DTMF signal is then added to the audio signal prior to transmission. The multiplexing may occur at a specialized modem, or alternatively, at a separate multiplexing device. Alternatively, only audio signals may be sent via the PSTN 318 while data signals are sent via the external communications network 314. [0041] Because communications is bilateral, the receiving communications device 320 and the receiving computing device 322 may operate as a transmitting communications device and transmitting computer device, respectively. In this situation, the codec 328 will encode /compress data received from the computing device 322 prior to forwarding the data to the communication device 320. Similarly, the codec 326 will encode the audio signals received from an internal microphone (not shown), while the modem 324 modulates all signals for transmission over the PSTN 318 . On the receiving end, the communications device 308 now demodulates the signal with the modem 312, and decodes the audio signals via the codec 310. The audio is then output to a user via the internal speaker. Data signals are then sent to the codec 306 for decoding/decompression, and subsequent display on, or operated on by, the computing device 302 and /or the display 304, or sent to other coupled devices. In further embodiments, more computing devices may be coupled to the communications devices.
[ 0042 ] The conferencing system 300 of FIG. 3a is described in connection with data conferencing. However, it should be noted that a conferencing system, according to the present invention may operate without data exchange as simply a wideband audio conferencing system, as shown in FIG. 3b. In this embodiment, the communications device 308 is only coupled to the communications device 320 via the PSTN 318. Thus, digitized audio signals (and optional control data signals) are sent via the PSTN 318. It should be noted that although FIGs. 3a and 3b only show two communications devices coupled via the PSTN, any number of communications devices can be involved in a conference session.
[0043] Referring to FIG. 4a, an exemplary wideband voice conferencing system 400 is shown whereby audio and data signals are transmitted via a PSTN 412. Initially, a computing device 402 provides data that is subsequently encoded /compressed by a codec 404. The encoded data is then sent to a wideband communications device 406.
[ 004 ] The wideband communications device 406 (e.g., a telephone, speakerphone, etc.) comprises at least one codec 408, a microphone for picking up audio (not shown), and a speaker (not shown) for outputting audio from a remote site. The codec 408 receives audio signals from the microphone, and encodes the audio signals for transmission over the PSTN 412 in a digital form. A line interface 410 coupled to the communications device 406 is an exemplary means of establishing a network connection from the wideband communications device 406 to the PSTN 412. In one embodiment, the line interface 410 may be a ISDN hub utilizing protocols such as, but not limited to, SIP, TCP, and IP.
[ 0045] A receiving line interface 414 coupled to the PSTN 412 receives the encoded signals and forwards the signals to a communications device 418 which comprises a codec 416, a microphone (not shown), and a speaker (not shown). Subsequently, the receiving codec 416 decodes the audio signals and presents the audio to conference participants via the internal speakers. Roughly simultaneously, the data signals are forwarded to codec 420 for decoding. The decoded data is then transmitted to a computing device 422. Alternatively, the conferencing system may be reversed with the wideband communications device 418 transmitting data to the communications device 406. In further embodiments, more computing devices may be coupled to each of the communications devices.
[0046] FIG. 4b illustrates a similar conferencing system to the FIG. 4a embodiment. However, the FIG. 4b embodiment only allows for communication of signals generated by the communications devices 406 or 418. Thus, audio received by the internal microphone of the communications device 406 is digitized by the codec 408 and sent via the PSTN 412 to the communications device 418, and vice-versa. Control data generated by the communications devices 406 and 418 may also be sent via the PSTN on a side-channel.
[0047] Referring to FIG. 5a, another exemplary wideband conferencing system 500 uses bridging and media control. Initially, a computing device 502 transmits encoded data to a wideband communications device 506 using a codec 504, which processes the data signals for transmission. The wideband communications device 506, which is coupled to the network 508, also receives audio from an internal microphone (not shown). Other devices may also be coupled to the network 508 including, but not limited to, a G.711 communications device 510, a G.722.1 communications device 512, and a video conferencing device 514 in any combination thereof.
[0048] Accessed through the network 508, audio and data can be managed from a gateway /media control unit 516. The exemplary gateway/media control unit 516 controls signal routing and serves as a communications network interface for both video and audio signals. Video signals may include high-resolution still graphics, moving video images, images of spreadsheets, and other visual presentations. The gateway/media control unit 516 comprises a transcoder 518, a data conferencing server 520, and a bridge 522. The transcoder 518 encodes data retrieved from the data conferencing server 520. Alternatively, other embodiments of the gateway/media control unit 516 may contain more or less elements.
[0049] Additionally, the gateway/media control unit 516 enables multiple endpoints to be simultaneously bridged using the bridge 522 regardless of the use of data retrieval. The bridge 522 is capable of integrating PSTN audio, VGA-LAN (i.e., high-resolution graphics imagery) data, and IP-voice into a single conference. Data and bridged calls are routed through a destination network 524 to a receiving wideband communications device 526. Voice signals may be presented to a user though a speaker located in the communications device 526, while data signals are forwarded to a receiving codec 528. After decoding, the data may then be displayed at a receiving computing device 530. In addition to, or instead of, the communications device 526, other devices may be coupled to the network 524. These devices may include, but are not limited to, a G.711 communications device 532, a G.722.1 communications device 534, and a videoconferencing device 536. In further embodiments of the present invention, audio signals may be presented to the user through the computing device 530 and/or data may be displayed or presented at the communications device 526. Because communications is bilateral, the conferencing system 500 may operate in the reverse (i.e., the communications device 526 transmits signals to the communications device 506).
[ 0050 ] In one embodiment, the transcoder 518 encodes digital audio signals according to ITU standards (e.g., G.711 for either PBX or ISDN channels, etc.). Additionally, the transcoder 518 encodes audio signals in accordance with ITU standards (e.g., G.722.1, etc.) for wideband transmission above 7 kHz. Further, alternative components encompassing other ITU, non-ITU standards, or other techniques are interchangeable with the transcoder 518. The transcoder 518 also can direct the receiving wideband communications device 526 (or communications device 506 in the reverse process) to a particular network address for data retrieval.
[0051] The data conferencing server 520 is integrated within the gateway media control unit 516, and can be used by all conferencing parties. The storage and retrieval of data from the data conferencing server 520 is controlled by signals passed between the wideband communications devices 506 and 526. In further embodiments, it is possible for narrowband devices to participate in a wideband speakerphone conference. As will be described further below in connection with FIGs. 8 and 9, the present invention allows for determination of whether participating communications devices are wideband or narrowband, and can adjust communications accordingly.
[ 0052 ] FIG. 5b is an exemplary conference system 550 whereby one communications device 552 is located on an IP network, while a second communications device 572 uses POTS. In this embodiment, the communications device 552 encodes audio via a codec 554. The audio data packets are then switched via a modem 556 through a home network 558 or similar network (e.g., LAN, WAN, etc.). The audio data packets are sent through a firewall 560 (if one exists) to an access server 562, which allows the audio data packets to be switched onto an Internet backbone 564.
[ 0053 ] The packets are then sent to a destination server 566, and to a VoIP/POTS gateway 568. The VoIP/POTS gateway 568 translates between the VoIP's version of wideband audio to a form of wideband audio utilized by the communications device 572. Once translated, the packets are sent through POTS wiring 570 to the communications device 572. The packets are then demodulated by an internal modem 574 and decoded by an internal codec 576. The audio is then presented to a user of the communications device 572 via an internal speaker (not shown).
[ 0054 ] The conference system 550 is a bilateral communications system. Therefore, audio may be sent from the POTS communications device 572 to the VoIP communications device 552. In this embodiment, audio is received by an internal microphone in the communications device 572 and encoded by the codec 576. The digitized audio signals are then sent via the modem 574 to the VoIP/POTS gateway 568 via the POTS wiring 570. The VoIP/POTS gateway translates the POTS wideband signal into a VoIP wideband signal compatible with the communications device 552. Subsequently, the signal is then routed to the communications device 552.
[ 0055 ] In further keeping with exemplary embodiments, FIG. 6a illustrates an exemplary block diagram of a wideband communications device 600 of the present invention. The wideband communications device 600 is coupled to a PSTN 602. In one embodiment, the wideband communications device 600 is also optionally coupled to the Internet 604. For signals switched over the PSTN
602, a line interface 606 relays signals to and from either a codec 608 (e.g., G.711 codec) or a codec 610 (e.g., G.722.1 codec) which encodes audio signals received by a microphone 611 or similar audio sensor, and decodes audio signals received from a remote site. The codec 610 is coupled to the line interface 606 via an optional voice channel 612 and a digital modem 614. In one embodiment, the codec 608 may operate in narrowband, while the codec 610 and the modem operate in wideband. In alternative embodiments, more or less codecs may be embodied in the communications device 600.
[0056] Further coupled to the modem 614 is a control module 616. The control module 616 enables users to manage calls and adjust sounds rendered audible by a speaker 617. Supervisory control of conferencing is also handled by the control module 616, which provides control data to conferencing parties in order to modify conferencing parameters. These parameters include directing the type and manner of audio and/or video display, caller intervention, secure access and retrieval of data, and a variety of other functions. The modem 614 receives both the audio and control data signals, and forwards the signals to the line interface for transmission over the PSTN 602. Thus, the communications device 600 may provide a digital control channel and a voice channel over a modem link to the PSTN 602. On a receiving end, the control data may be sent to a control module of the receiving communications device where the parameters may then be altered.
[ 0057 ] For digital data over the Internet 604, a network interface 618 (e.g., Ethernet interface) relays signals to and from a protocol stack 620. The protocol stack 620 establishes Internet voice and protocol video conferencing sessions. For data received from the Internet 604, the data signals are passed to the control module 616. Ultimately, conferencing video or other data are decoded by a codec 622, and transmitted to a computing device 624 and /or a video display 626. When sending data over the Internet 604, the codec 622 encodes the data from the computing device 624, and forwards the encoded data to the communications device 600. The communications device 600 then processes the signals before forwarding the data to the Internet 604 via the network interface
618.
[0058] In a further embodiment of the present invention, the communications device 600 further comprises an embedded audio echo canceller. This audio echo canceller allows for full-duplex conversation between participating endpoints. In one embodiment, this audio canceller may be embodied in the voice channel 612. [0059] FIG. 6b shows an alternative embodiment of the wideband communications device whereby both codecs 608 and 610 are coupled to the voice channel 612. In this embodiment, the voice channel 612 comprises the audio echo canceller. Thus all audio signals must pass through the audio echo canceller, which provides full-duplex functions in narrowband or wideband mode. Further, in an exemplary embodiment of the wideband communications device 600, the codec 608 operates in narrowband mode, while the codec 612 and the modem operate in wideband mode.
[ 0060 ] With regards to both embodiments of FIG. 6a and 6b, alternatively, the wideband communications device 600 may only be coupled to the PSTN 602. In this embodiment, the protocol stack 620 and the network interface 618 are not present in the wideband communications device 600. In further embodiments, more or less elements may be present in the communications device. However, it is preferred that the integrated wideband communications device 600, at a minimum, comprises at least one codec, a modem, a microphone, and a speaker. Additionally, the communications device 600 may not be coupled to the codec 622, the computing device 624, and the video display 626.
[0061] There are primarily two main purposes to wideband data exchange. The first purpose is to transmit data between endpoints. The second purpose is to direct an endpoint to a location such as a central office data storage repository to retrieve conferencing data such as spreadsheets or images. Referring to FIG. 7, an exemplary signal flow path 700 for wideband data exchange is shown. Control signals containing encrypted data and session keys from an initiating wideband communications device 702 pass through a network
704 and an optional security firewall 706 to a gateway media control unit 708. The gateway media control unit 708 initiates a data exchange using Session Initial Protocols (SIPs) that establish data tunnels 710 on either end of a data conferencing server 712. The data tunnels 710 permit exchange of security information such as session keys that encrypt specific network addresses for the data conferencing server 712. Data retrieved or sent must pass through a destination firewall 714 (if one exists) of a destination network 716 before reaching a receiving wideband communications device 718.
[ 0062 ] In one embodiment, the gateway media control unit 708 controls connections of H.263 bi-directional data streams between two endpoints. Initially, the gateway media control unit 708 receives HTTP connections from the two endpoints (e.g., wideband communications devices 702 and 718), and if the two endpoints present matching session keys, the gateway media control unit 708 connects the H.263 data streams to each endpoint. Additionally, the gateway media control unit 708 may validate that each endpoint is registered and authorized for service.
[0063] Data can be either sent directly between the transmitting wideband communications device 702 and the receiving wideband communications device 718, or directed for retrieval from the data conferencing server 712 by passing control data such as session keys. For larger amounts of data, the preferred method would be to pass control data or signals using the flow path 700 described in FIG. 7, thus reducing the amount of data passed between endpoints. By storing data on the data conferencing server 712, the amount of necessary data transmitted is reduced to the encrypted network address passed between endpoints. In one embodiment, the control data and session keys are sent via a PSTN connection, while data is sent /retrieved via the Internet. This method leverages the use of combined analog PSTN and data network connections (e.g., the Internet) to reduce call latency and delay. Although FIG. 7 is described in connection to data exchange, a similar signal path may be utilized for the exchange of wideband audio. [0064] In an alternative embodiment, data may be stored on an internal server. Due to security concerns of storing data on the Internet, some users may prefer to host a web server on their own internal system. In this situation, the communications devices are configured to point to the private, internal server via a web page in each communications device. This configuration may be performed once when the communications device is installed. [0065] Referring to FIG. 8, an exemplary flowchart 800 illustrates one method for initiating a wideband audio conference with an optional data side- channel. At step 802, a call is initiated, and the audio portion is linked. Subsequently, a wideband probe signal is sent from an initiating wideband communications device in step 804. The purpose of the wideband probe is to determine whether a receiving communications device, computing device, etc. is wideband telephony-capable. If at step 806 a wideband probe query does not receive a response, then the transmitting wideband communications device enters a narrowband mode at step 808 and connects the call as a narrowband call at step 810. Consequently, the behavior of the wideband communications device is typical of an analog, PSTN-switched POTS call.
[0066] Alternatively, if the wideband probe reply signal is returned positively at step 806, then the wideband communications device shifts to a wideband mode at step 814. In the wideband mode, the wideband communications device begins initiating training at step 816 with the receiving communications device. Training is a telecommunications technique used by many modems for ensuring that QoS and call quality are maximized by synchronizing modems to each other, adjusting for changing line conditions such as delay, signal distortion and amplitude response, etc. Training usually, but not always, occurs prior to establishing a connection, depending on the specifications of the modem. If two modems /communications devices fail to train, a call connection will not be established.
[0067] Once both wideband communications devices are trained and synchronized at step 818 to exchange both voice and optional data signals, the call is connected at step 820. In the wideband mode, both endpoints (i.e., the transmitting wideband communications device and the receiving communications device) will hear sounds employing audio signals up to 7 kHz, in one embodiment. It is contemplated that other frequencies can be used, or that other audio enhancements, such as stereo, or enhanced dynamic range, can be transmitted. Conferencing parties can also exchange data or direct another parties to retrieve data from data storage repositories. These enhancements lead to a high degree of interactivity and increased capabilities for multimedia communication.
[0068] In an alternative embodiment, the initialing wideband communications device may determine a receiving communications device is not wideband capable without using a probe. In this embodiment, once the audio is linked, the receiving communication device may respond with speech (e.g., a user of the receiving communications device says "hello"). Upon receiving the speech, the initiating communications device determines the receiving communications device is not wideband capable, and the audio only conference (i.e., narrowband connection) can start.
[0069] FIG. 9 illustrates an alternative embodiment whereby the receiving communications device initiates a wideband probe. At step 902, a call is initiated by a transmitting communications device. Subsequently, an audio link is established between the transmitting communications device and a receiving communications device. Next, a wideband probe signal is sent from by the wideband, receiving communications device in step 904 to determine whether the transmitting communications device is wideband telephony-capable. If at step 906 a wideband probe query does not receive a response, then the receiving, wideband communications device enters a narrowband mode at step 908 and connects the call as a narrowband call at step 910. Consequently, the behavior of the wideband communications device is typical of an analog, PSTN-switched POTS call.
[ 0070 ] Alternatively, if the wideband probe reply signal is returned positively at step 906, then the wideband communications device shifts to a wideband mode at step 914. In the wideband mode, the wideband communications device begins initiating training at step 916 with the transmitting communications device. Once both wideband communications devices are trained and synchronized at step 918 to exchange both voice and data signals, the call is connected at step 920. As described with reference to FIGs. 3-5, once the wideband connection has been established, both audio and data may be transmitted between the wideband communications devices. [ 0071 ] In embodiments of FIG. 8 and 9 where a wideband conference is established, the systems may monitor for line conditions and adjust accordingly. For example, quality of an audio (and possibly data) connection may be adjusted based on available data bandwidth. The more bits that can be sent over the PSTN, the better the quality audio codec setting can be used for encoding the audio (and optional data) signals. Further, the present system may automatically adjust the quality of the connection during a call, and in some situations, will even drop the call to a conventional analog, narrowband connection when the connection quality drops too low. [0072] For purposes of clarity, some references to data may be construed to refer to audio data or digitized audio signals, while other references may refer to control data, or alternatively to video /graphics data.
[0073] The invention has been described above with reference to exemplary embodiments. Many variations of the invention will become apparent to those of skill in the art upon review of this disclosure. For example, more devices may be coupled to the wideband communications device for providing data signals. Further, although exemplary codecs and standards have been described for use in the present invention, those skilled in the art will recognize that other codecs and standards may be utilized. Therefore, the scope of the invention should be determined not with reference to the above description, but instead should be construed in view of the full breadth and spirit of the invention as disclosed herein.

Claims

CLAIMSWhat is claimed is:
1. A method for wideband conferencing, comprising: verifying whether at least two communications devices are wideband capable; establishing a connection between the at least two communications devices; obtaining and digitizing audio signals at a wideband communications device; and transmitting the digitized audio signals over the connection from the wideband communications devices to at least one receiving wideband communications device.
2. The method of claim 1, wherein the verifying occurs at one of the at least two communications devices.
3. The method of claim 1, wherein the connection is a narrowband connection if a communications device is not wideband capable.
4. The method of claim 1, wherein the connection is a wideband connection between communications device that are wideband capable.
5. The method of claim 1, wherein the connection is over a PSTN.
6. The method of claim 1, further comprising adjusting quality of the connection based on available bandwidth.
7. The method of claim 6, wherein the adjusting further comprises modifying a setting of an audio codec in the wideband communications device.
8. The method of claim 6, wherein the connections becomes narrowband when the quality of the connection drops too low.
9. The method of claim 1, further comprising establishing a side channel in the connection for transmission of non-audio signals.
10. The method of claim 1 further comprising utilizing audio echo cancellation in the at least one receiving wideband communications device to allow for full- duplex conversation.
11. The method of claim 1 further comprising outputting audio received at the at least one receiving wideband communications device via a speaker located in the at least one receiving wideband communications device.
12. The method of claim 1 wherein the obtaining is performed by at least one microphone located in the wideband communications device.
13. The system of claim 1, wherein the at least two communications devices are simultaneously bridged using a bridge located on a network.
14. A wideband communications device for conducting a wideband conference, comprising: a microphone for obtaining audio from at least one conference participant; at least one codec coupled to the microphone for digitizing the received audio; and a modem coupled to a codec of the at least one codec for modulating the digitized audio for transmission to at least one receiving wideband communications device.
15. The system of claim 14 further comprising a speaker for outputting received audio from at least one transmitting communications device.
16. The system of claim 14 where in the at least one codec is a G.711 codec.
17. The system of claim 14 wherein the at least one codec is a G.722.1 codec.
18. The system of claim 14 further comprising a control module for rendering supervisory control of the wideband conference.
19. The system of claim 14 wherein the transmission occurs over a PSTN.
20. The system of claim 14 further comprising an audio echo canceller to allow full-duplex audio communication.
21. The system of claim 14 further comprising a multiplexing device for combining the digitized audio signals with data signals.
22. The system of claim 14 further comprising a network interface for coupling the wideband communications device to the Internet.
23. A method for establishing a conference connection between two communications devices, comprising: establishing an audio link between the two communications devices; receiving a signal at a first device of the two communications devices indicating whether a second device of the two communications device is wideband capable; entering a connection mode based on the received signal; and establishing the conference connection based on the connection mode.
24. The method of claim 23 wherein the received signal is in response to a wideband probe sent by the first device.
25. The method of claim 23 wherein the received signal is audio speech by a user of the second device, the audio speech indicating that the second device is not wideband capable.
26. The method of claim 23, wherein the connection mode is narrowband if the received signal indicates the second device is not wideband capable.
27. The method of claim 23, wherein the connection mode is wideband if the first device is wideband capable and the received signal indicates the second device is wideband capable.
28. A system for wideband conferencing, comprising: means for verifying whether at least two communications devices are wideband capable; means for establishing a connection between the at least two communications devices; means for obtaining and digitizing audio signals at a wideband communications device; and means for transmitting the digitized audio signals over the connection from the wideband communications devices to at least one receiving wideband communications device.
29. A system for establishing a conference connection between two communications devices, comprising: means for establishing an audio link between the two communications devices; means for receiving a signal at a first device of the two communications devices indicating whether a second device of the two communications device is wideband capable; means for entering a connection mode based on the received signal; and means for establishing the conference connection based on the connection mode.
30. A computer readable medium having embodied thereon a program, the program being executable by a machine to perform method steps for wideband conferencing, the method steps comprising: verifying whether at least two communications devices are wideband capable; establishing a connection between the at least two communications devices; obtaining and digitizing audio signals at a wideband communications device; and transmitting the digitized audio signals over the connection from the wideband communications devices to at least one receiving wideband communications device.
PCT/US2002/041778 2001-12-31 2002-12-31 Method and apparatus for wideband conferencing WO2003058403A2 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
EP02806278A EP1464142A4 (en) 2001-12-31 2002-12-31 Method and apparatus for wideband conferencing

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
US34592901P 2001-12-31 2001-12-31
US60/345,929 2001-12-31
US36098402P 2002-03-01 2002-03-01
US60/360,984 2002-03-01

Publications (2)

Publication Number Publication Date
WO2003058403A2 true WO2003058403A2 (en) 2003-07-17
WO2003058403A3 WO2003058403A3 (en) 2003-09-04

Family

ID=26994610

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2002/041778 WO2003058403A2 (en) 2001-12-31 2002-12-31 Method and apparatus for wideband conferencing

Country Status (2)

Country Link
EP (1) EP1464142A4 (en)
WO (1) WO2003058403A2 (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1655938A1 (en) * 2004-11-03 2006-05-10 AT&T Corp. High-quality audio transmission by bandwidth negotiation and codec selection
US7822014B2 (en) 2003-11-21 2010-10-26 Oki Electric Industry Co., Ltd. Voice communication system and a server apparatus

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US3573377A (en) * 1969-02-24 1971-04-06 Bell Telephone Labor Inc Equipment to coordinate establishment of audio and video connections through switching systems
US3612767A (en) * 1969-06-11 1971-10-12 Bell Telephone Labor Inc Equipment for selectively establishing audio and wideband communication paths through two autonomous switching systems
US3649761A (en) * 1970-01-23 1972-03-14 Bell Telephone Labor Inc Dial selective wideband intercommunication system
US4257119A (en) * 1978-12-29 1981-03-17 Wescom Switching, Inc. PCM switching system for wide and narrow band signals
US4763317A (en) * 1985-12-13 1988-08-09 American Telephone And Telegraph Company, At&T Bell Laboratories Digital communication network architecture for providing universal information services
US5677728A (en) * 1982-02-24 1997-10-14 Schoolman Scientific Corporation Stereoscopic video telecommunication system
US6130880A (en) * 1998-03-20 2000-10-10 3Com Corporation Method and apparatus for adaptive prioritization of multiple information types in highly congested communication devices

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CA2126927C (en) * 1994-02-24 1999-01-26 Sidhartha Maitra Noncompressed voice and data communication over modem for a computer-based multifunction personal communications system

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US3573377A (en) * 1969-02-24 1971-04-06 Bell Telephone Labor Inc Equipment to coordinate establishment of audio and video connections through switching systems
US3612767A (en) * 1969-06-11 1971-10-12 Bell Telephone Labor Inc Equipment for selectively establishing audio and wideband communication paths through two autonomous switching systems
US3649761A (en) * 1970-01-23 1972-03-14 Bell Telephone Labor Inc Dial selective wideband intercommunication system
US4257119A (en) * 1978-12-29 1981-03-17 Wescom Switching, Inc. PCM switching system for wide and narrow band signals
US5677728A (en) * 1982-02-24 1997-10-14 Schoolman Scientific Corporation Stereoscopic video telecommunication system
US4763317A (en) * 1985-12-13 1988-08-09 American Telephone And Telegraph Company, At&T Bell Laboratories Digital communication network architecture for providing universal information services
US6130880A (en) * 1998-03-20 2000-10-10 3Com Corporation Method and apparatus for adaptive prioritization of multiple information types in highly congested communication devices

Non-Patent Citations (5)

* Cited by examiner, † Cited by third party
Title
HAOJUN ET AL.: 'Implementing an audio multipoint processor on DSP array' IEEE May 2001, pages 441 - 444, XP010544757 *
JUNG ET AL.: 'The multimedia desktop conference system adaptability in network traffic on LAN' IEEE 1995, pages 334 - 338, XP002106182 *
NOORE ET AL.: 'Computer-based multimedia video conference system' IEEE 1993, pages 587 - 591, XP000396336 *
SASSE ET AL.: 'Workstation-based multimedia conferencing: experiences from the MICE project' IEE 1994, pages 1 - 6, XP002965553 *
See also references of EP1464142A2 *

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7822014B2 (en) 2003-11-21 2010-10-26 Oki Electric Industry Co., Ltd. Voice communication system and a server apparatus
EP1655938A1 (en) * 2004-11-03 2006-05-10 AT&T Corp. High-quality audio transmission by bandwidth negotiation and codec selection

Also Published As

Publication number Publication date
WO2003058403A3 (en) 2003-09-04
EP1464142A2 (en) 2004-10-06
EP1464142A4 (en) 2005-04-27

Similar Documents

Publication Publication Date Title
US7221663B2 (en) Method and apparatus for wideband conferencing
US6750897B1 (en) Systems and methods for implementing internet video conferencing using standard phone calls
KR100773196B1 (en) Communication terminal, method for controlling the same, and storage medium storing control program
US8582520B2 (en) Method and apparatus for wideband conferencing
KR20000015926A (en) Telephone doubler arrangement
KR101498913B1 (en) Method, modem and server for bridging telephone calls into internet calls
WO2005039165A2 (en) Video relay system and method
US8116442B2 (en) Method and apparatus for audio conference bridge initiated remote device muting
JP3873048B2 (en) Ringback tone transmission method, terminal, ringback tone generation method, and system for generating ringback tone
US7804815B1 (en) System and apparatus for telecommunication
US7545802B2 (en) Use of rtp to negotiate codec encoding technique
US6754342B1 (en) Method and apparatus for concealing mute in an IP-based telephony system
US7813378B2 (en) Wideband-narrowband telecommunication
EP1521412A1 (en) Switching between communication systems with circuit and packet switching
WO2003058403A2 (en) Method and apparatus for wideband conferencing
US20080267092A1 (en) Method and Apparatus for Maintaining Continuous Connections for Audio/Visual Communications
US20230247136A1 (en) Automated attendant that specifies audio transmission characteristics for calls
KR100316312B1 (en) An Internet Phone Telecommunication System by using Personal Internet Telephone Server
GB2491872A (en) Automatic setup of audio/computer teleconference
JP2004064407A (en) Voice communication system
MX2007006912A (en) Conference layout control and control protocol.
KR20040055499A (en) Duplex appatatus and method using Subscriber Line interface of VoIP Board
MX2007006910A (en) Associating independent multimedia sources into a conference call.

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A2

Designated state(s): JP

AL Designated countries for regional patents

Kind code of ref document: A2

Designated state(s): AT BE BG CH CY CZ DE DK EE ES FI FR GB GR IE IT LU MC NL PT SE SK TR

121 Ep: the epo has been informed by wipo that ep was designated in this application
WWE Wipo information: entry into national phase

Ref document number: 2002806278

Country of ref document: EP

WWP Wipo information: published in national office

Ref document number: 2002806278

Country of ref document: EP

NENP Non-entry into the national phase in:

Ref country code: JP

WWW Wipo information: withdrawn in national office

Country of ref document: JP