WO2003094047A2 - Accelerated transmission of hypertext documents - Google Patents
Accelerated transmission of hypertext documents Download PDFInfo
- Publication number
- WO2003094047A2 WO2003094047A2 PCT/DE2003/001379 DE0301379W WO03094047A2 WO 2003094047 A2 WO2003094047 A2 WO 2003094047A2 DE 0301379 W DE0301379 W DE 0301379W WO 03094047 A2 WO03094047 A2 WO 03094047A2
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- archive
- file
- files
- server
- buffer memory
- Prior art date
Links
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L9/00—Cryptographic mechanisms or cryptographic arrangements for secret or secure communications; Network security protocols
- H04L9/40—Network security protocols
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L67/00—Network arrangements or protocols for supporting network services or applications
- H04L67/01—Protocols
- H04L67/02—Protocols based on web technology, e.g. hypertext transfer protocol [HTTP]
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L67/00—Network arrangements or protocols for supporting network services or applications
- H04L67/50—Network services
- H04L67/56—Provisioning of proxy services
- H04L67/568—Storing data temporarily at an intermediate stage, e.g. caching
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L67/00—Network arrangements or protocols for supporting network services or applications
- H04L67/50—Network services
- H04L67/60—Scheduling or organising the servicing of application requests, e.g. requests for application data transmissions using the analysis and optimisation of the required network resources
- H04L67/63—Routing a service request depending on the request content or context
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L67/00—Network arrangements or protocols for supporting network services or applications
- H04L67/2866—Architectures; Arrangements
- H04L67/289—Intermediate processing functionally located close to the data consumer application, e.g. in same machine, in same home or in same sub-network
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L69/00—Network arrangements, protocols or services independent of the application payload and not provided for in the other groups of this subclass
- H04L69/30—Definitions, standards or architectural aspects of layered protocol stacks
- H04L69/32—Architecture of open systems interconnection [OSI] 7-layer type protocol stacks, e.g. the interfaces between the data link level and the physical level
- H04L69/322—Intralayer communication protocols among peer entities or protocol data unit [PDU] definitions
- H04L69/329—Intralayer communication protocols among peer entities or protocol data unit [PDU] definitions in the application layer [OSI layer 7]
Definitions
- the invention relates to the accelerated transmission of hypertext documents, in particular HTML pages with the HTTP protocol.
- the HTTP / 1.1 protocol allows a connection to be kept open, so that the connection setup and clearing down are not necessary.
- the browser regularly has a cache ('cache.'), From which previously loaded elements can be extracted.
- the latter is devalued by the fact that the graphic designers prefer to use different graphics on each page.
- the buffer memory can only buffer elements of identical addresses; when activating a reference that leads to another computer, all graphics are reload when visiting this page for the first time.
- the problem is only barely visible to professional users due to the high computing speeds and fast network connections today. However, it is found that private users are also no longer willing to wait a long time for a page to be built and may no longer consider a commercial offer if the loading time is subjectively too slow.
- the HTML tag provides that not only the name of the JAVA class to be executed, but also the address of a JAR archive in which the class is to be contained is specified. The browser then loads the entire archive and takes the class and the objects it needs from the archive.
- the object of the present invention is to provide a solution with which all objects required by one side can be transmitted with a network transfer and which is nevertheless compatible with the previous use.
- the HTML page should remain unchanged; only the server and browser are supplemented in such a way that an improved server with an improved browser increases efficiency, but the remaining three combinations remain functional.
- the solution uses the possibility that the browser sets an indicator when the HTML page is requested, which is ignored by previous servers and is used by new servers to send an archive containing the HTML page instead of the HTML page , In contrast to a JAR archive, this archive is not set as a whole in the buffer memory, but is unpacked and loaded into the buffer memory (element). The content of the archive can contain any elements that can be addressed via URL, which are then used automatically by the buffer memory mechanism.
- the invention is as follows: When an HTML page is requested, the browser sets an indicator on the basis of which the server sends an archive instead of the HTML page, which the browser unpacks into the cache.
- the archive contains, for example, the images required by the page, which are then retrieved from the cache, so that only one network transfer is required per page.
- the invention is compatible with previous operation, in particular by using the 'content negotiation' according to RFC 2295 or 'remote variant selection' according to RFC 2296.
- the wave line 10 indicates a network to which a client below the wave line 10 and a server above the wave line 10 are connected.
- the network and the connections are preferably implemented using the Internet protocol HTTP.
- the HTTP process includes a switch 16 with which it is decided whether the requested document "testl.html” is taken from the mass storage device 18 as before and sent back to the client, or the extension to be described is used.
- the display program ('browser') in the client that requested the page comprises a controller 22 for a buffer memory 24.
- the invention changes the process as follows: An indicator in the request from "testl.html” informs the server process 14 that an archive can also be processed instead of the requested file. This indicator is presented in more detail below. If this indicator is set, the server process 14 checks whether there is a corresponding archive 20, here with the name 'testl .harc', and then sends this archive 20 back, symbolized by the switch 16. A corresponding entry in the header ('header') in accordance with the HTTP protocol, for example the (new) MIME type 'archive / harc' instead of 'text / html', indicates to the buffer memory process 22 that a instead of a single document Archive is transmitted.
- the buffer storage process 22 then breaks the archive down into its elements and stores these elements individually in the buffer memory 24, as indicated in FIG. 1. As in the example, include the file 'testl.html' in the archive. Since this is now in the buffer memory, it can be made available to the display program as before. This interprets the HTML text and determines that additional files are required. For this purpose, as always, it is first checked whether these are in the buffer memory. Since this is the case, these additional files 'imgl.jpg', 'test2.html' and 'scriptl.js' are taken from the buffer memory; communication with the server is no longer necessary.
- a new element can be defined in the HTTP header.
- content selection ('content negotiation') according to RFC 2295 or 'remote variant selection' according to RFC 2296 is preferred.
- the latter is mainly used to make different Requesting language variants of an HTML page is therefore not fully usable in order not to lose this ability.
- the formats known to the browser are listed in the header of the request; the server can then send one of these formats.
- a browser with a buffer memory according to the invention uses this header and adds an appropriate (MIME) type.
- MIME appropriate
- a server according to the invention only sends the archive format if the browser has declared it possible. This is one way of realizing the indicator listed above.
- the browser is set up in such a way that it first requests the file 'testl.harc' instead of the file 'testl.html' and, if the server shows this as not available, then requests the original 'testl.html'.
- the changed file type then represents the above-mentioned indicator.
- the indicator permitting an archive is only set if the document is not yet in the buffer memory. In the event that it is still in the buffer memory but is no longer valid, an archive is then no longer permitted, since the other elements can still be valid.
- one solution is that manually or automatically maintained lists, also by means of a data bank are implemented, which assign the file names of the archives pages. If the file name is found in one of the lists and the assigned archive exists, the archive is transferred instead of the file.
- the server can also search for existing archives and the page in these archives and automatically transmit the archive in the event of a hit.
- Another variant checks whether the subdirectory in which the requested file is or should be contains an archive. If this is the case, the archive is transferred instead of the file. Optionally, it can be checked beforehand whether the file is contained in the archive. In a further development, if no archive is available, it is created, saved and transmitted from all files of the subdirectory 'on the fly'. In this case, a positive or negative list of file types, in particular via their extensions, is preferably specified, for example, so that JAVA archives (.jar) are not included.
- the buffer memory of an intermediary referred to as a 'proxy' is used.
- a proxy server receives the request from the browser and in turn sends it to the server, receives the requested document and forwards it to the browser.
- a proxy server has a protective function in that the outbound requests are routed through a single, dedicated system.
- a proxy server such as 'SQID' or the corresponding Apache module is usually combined with a buffer memory in order to transmit requested pages as often as possible from the proxy via the expensive external connection without the external connection being loaded. The proxy server can therefore set the indicator, receive an archive, unpack it in the buffer memory and then use the fast and inexpensive internal network to make a large Efficiently supply a number of conventional browsers with parts of external HTML pages.
- the archive sent by the server contains the requested file, which is then processed further from the buffer memory.
- the controller for the buffer memory should also be able to handle the case that the archive received does not contain the requested file. Since it is not generally possible to decide whether the content of the archive should be discarded for security reasons or whether it can still be set in the buffer memory, this selection is set via user options and may depend on whether simple HTTP or secure HTTPS is used as the protocol was used. Furthermore, either the requested file can immediately be treated as not available (error 404 in the HTTP protocol). Or a second request can preferably be made, in which the indicator that an archive is welcome is not set and the page is requested again individually.
- the server can check whether the requested file is present in the archive and only then send the archive, otherwise it can send the file itself. This solution is simple, manageable and robust against errors in the assignment of files and archives.
- the archive is sent. If it is not up to date, either only the page is sent or the current page is temporarily or permanently replaced in the archive by the current version and then the archive is sent. If it does not exist, the archive can still be sent on the assumption that the control of the buffer memory, as described above, then requests the page again, but without an archive indicator. This applies to HTML pages, which are changed more often, but where the images etc. used in them are changed very rarely. In this version, an archive containing objects required by the page is still sent in front of the page, so to speak.
- the server sends two files upon request of a file; namely an archive without the file and then or before the file.
- the indicator which indicates that an archive can be delivered instead of the requested page is useful for the invention, but not essential. For example, consider a closed network, as is often the case with self-service terminals. Then it can be assumed that all browsers are able to receive an archive instead of a requested file; explicit marking is no longer necessary. The same applies to the data supplied by the server: The MIME type in the header indicates that an archive is being sent. Alternatively, each file received can first be interpreted as an archive and only treated as a simple file if it is not a valid archive.
- one of the formats known as 'are', 'zip', 'tar', 'epio', 'cab', '1ha' etc. can be used as the archive format, in which the files become a new file representing the archive be packed. Often a compression is used and the transfer is accelerated if the packed files are not already compressed, as is the case with the common image formats • gif, 'jpg' and 'png'.
- Another possibility with the HTTP (S) protocol is to encode the returned data as a multi-part message with separators in accordance with the MIME standard RFC-2046 ('multi-part message'). Here, a number of individual files are connected to a total stream by separators. The data type in the header is then 'multipart / mixed'.
- This variant has the advantage that the dynamic compilation in the server is particularly simple.
- the requested file is preferably sent as the first part. Then either a prepared file is added, which already has a
- the invention was described on the basis of HTML pages. It is equally applicable to other markup sites such as the successor XHTML, the XML extension, and other appropriate formats.
Abstract
Description
Claims
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP03747392A EP1509860A2 (en) | 2002-04-30 | 2003-04-29 | Accelerated transmission of hypertext documents |
US10/512,974 US20060122844A1 (en) | 2002-04-30 | 2003-04-29 | Accelerated transmission of hypertext documents |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
DE10219390A DE10219390B4 (en) | 2002-04-30 | 2002-04-30 | Server, buffer memory and browser for accelerated transmission of hypertext documents |
DE10219390.8 | 2002-04-30 |
Publications (2)
Publication Number | Publication Date |
---|---|
WO2003094047A2 true WO2003094047A2 (en) | 2003-11-13 |
WO2003094047A3 WO2003094047A3 (en) | 2004-05-27 |
Family
ID=29285049
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/DE2003/001379 WO2003094047A2 (en) | 2002-04-30 | 2003-04-29 | Accelerated transmission of hypertext documents |
Country Status (4)
Country | Link |
---|---|
US (1) | US20060122844A1 (en) |
EP (1) | EP1509860A2 (en) |
DE (1) | DE10219390B4 (en) |
WO (1) | WO2003094047A2 (en) |
Families Citing this family (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8156423B1 (en) * | 2004-11-01 | 2012-04-10 | Sprint Spectrum L.P. | Method and system for dynamically updating fixed resources on client station |
JPWO2006051906A1 (en) * | 2004-11-12 | 2008-05-29 | 株式会社ジャストシステム | Archiver device, data acquisition device, and data acquisition method |
US7953893B1 (en) | 2008-04-03 | 2011-05-31 | Sprint Spectrum L.P. | Method and system for expedited HTTP communication |
RU2475832C1 (en) * | 2009-03-30 | 2013-02-20 | Нокиа Корпорейшн | Methods and systems for processing document object models (dom) to process video content |
US20100306344A1 (en) * | 2009-04-06 | 2010-12-02 | Nokia Corporation | Methods and Systems for Using Multipart Messaging with Preset Constraints |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP0868063A2 (en) * | 1997-03-26 | 1998-09-30 | International Business Machines Corporation | Method and apparatus for precaching data at a server |
US6336122B1 (en) * | 1998-10-15 | 2002-01-01 | International Business Machines Corporation | Object oriented class archive file maker and method |
US20020029228A1 (en) * | 1999-09-09 | 2002-03-07 | Herman Rodriguez | Remote access of archived compressed data files |
Family Cites Families (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6185625B1 (en) * | 1996-12-20 | 2001-02-06 | Intel Corporation | Scaling proxy server sending to the client a graphical user interface for establishing object encoding preferences after receiving the client's request for the object |
US6055569A (en) * | 1998-01-27 | 2000-04-25 | Go Ahead Software Inc. | Accelerating web access by predicting user action |
US6427187B2 (en) * | 1998-07-31 | 2002-07-30 | Cache Flow, Inc. | Multiple cache communication |
US6625624B1 (en) * | 1999-02-03 | 2003-09-23 | At&T Corp. | Information access system and method for archiving web pages |
US6560618B1 (en) * | 2000-03-22 | 2003-05-06 | International Business Machines Corporation | On-demand generation, packaging, and delivery of archive files |
US20020065800A1 (en) * | 2000-11-30 | 2002-05-30 | Morlitz David M. | HTTP archive file |
-
2002
- 2002-04-30 DE DE10219390A patent/DE10219390B4/en not_active Expired - Fee Related
-
2003
- 2003-04-29 EP EP03747392A patent/EP1509860A2/en not_active Withdrawn
- 2003-04-29 US US10/512,974 patent/US20060122844A1/en not_active Abandoned
- 2003-04-29 WO PCT/DE2003/001379 patent/WO2003094047A2/en active Application Filing
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP0868063A2 (en) * | 1997-03-26 | 1998-09-30 | International Business Machines Corporation | Method and apparatus for precaching data at a server |
US6336122B1 (en) * | 1998-10-15 | 2002-01-01 | International Business Machines Corporation | Object oriented class archive file maker and method |
US20020029228A1 (en) * | 1999-09-09 | 2002-03-07 | Herman Rodriguez | Remote access of archived compressed data files |
Non-Patent Citations (1)
Title |
---|
INNERJOIN: "mod_gunzip howto" INTERNET CITATION, [Online] 2. Februar 2002 (2002-02-02), Seiten 1-5, XP002274155 Gefunden im Internet: URL:http://web.archive.org/web/20020202171 755/http://www.innerjoin.org/apache-compre ssion/howto.html> [gefunden am 2002-02-02] * |
Also Published As
Publication number | Publication date |
---|---|
EP1509860A2 (en) | 2005-03-02 |
DE10219390B4 (en) | 2007-05-31 |
WO2003094047A3 (en) | 2004-05-27 |
DE10219390A1 (en) | 2003-12-11 |
US20060122844A1 (en) | 2006-06-08 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
EP1559038B1 (en) | Method for the pre-transmission of structured data amounts between a client device and a server device | |
DE60130633T2 (en) | Secure Internet Clipboard | |
DE69833899T2 (en) | Dynamic data transmission device and method | |
DE60015423T2 (en) | Method and device for reproducing objects in a network | |
DE69736045T2 (en) | Method for transmitting and displaying data pages in a data network | |
DE69731318T2 (en) | MAKING COMMUNICATION CONNECTIONS IN A COMPUTER NETWORK | |
DE60127247T2 (en) | NETWORK DEVICE FOR THE DOCUMENT STATEMENT OF VALIDITY | |
DE10024715B4 (en) | Method and apparatus for establishing a two-way communication between a host system and a device | |
DE112013007597T5 (en) | Improved web server for storing large files | |
DE202008013034U1 (en) | System for speeding up browsing sessions | |
EP1369790A2 (en) | Method for dynamically generating structured documents | |
EP1620810B1 (en) | Method and arrangement for establishing and updating a user surface used for accessing data pages in a data network | |
DE10219390B4 (en) | Server, buffer memory and browser for accelerated transmission of hypertext documents | |
EP1845440B1 (en) | Method and assembly for printing from web applications and the corresponding computer program and the corresponding machine readable storage medium | |
DE69925435T2 (en) | A computer-implemented method and apparatus for providing a logical access point to one or more files | |
DE10296924T5 (en) | User-defined selective reloading of images | |
DE102005042068B4 (en) | Method and arrangement for handling files by means of mobile terminals and a corresponding computer program and a corresponding computer-readable storage medium | |
DE60108176T2 (en) | METHOD AND SYSTEM FOR DELIVERING INFORMATION IN A TELECOMMUNICATIONS NETWORK | |
EP3454290A1 (en) | Data exchange between different applications/clients | |
WO2002037793A2 (en) | Method for the exchange of data in a network, device for carrying out said method, computer program for carrying out the same and data store for the storage of the above | |
DE102014100225B4 (en) | System and method for delivering advertising content | |
DE10351739A1 (en) | Server system for supplying information concerning server traffic e.g. for Internet subscriber, has logic circuit held in servlet container and configured for receiving request or requirement | |
DE10105188B4 (en) | Method for feeding internet pages into digital broadcast signals | |
DE10224102A1 (en) | Internet web page transmission system loads new main page in response to answer using script in hidden frame | |
DE10059931A1 (en) | Method and device for data transmission and remote-controlled generation of output data |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AK | Designated states |
Kind code of ref document: A2 Designated state(s): US |
|
AL | Designated countries for regional patents |
Kind code of ref document: A2 Designated state(s): AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IT LU MC NL PT RO SE SI SK TR |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application | ||
WWE | Wipo information: entry into national phase |
Ref document number: 2003747392 Country of ref document: EP |
|
WWP | Wipo information: published in national office |
Ref document number: 2003747392 Country of ref document: EP |
|
ENP | Entry into the national phase |
Ref document number: 2006122844 Country of ref document: US Kind code of ref document: A1 |
|
WWE | Wipo information: entry into national phase |
Ref document number: 10512974 Country of ref document: US |
|
WWP | Wipo information: published in national office |
Ref document number: 10512974 Country of ref document: US |