WO2000074294A2 - General-purpose robot agent and real-time search method - Google Patents

General-purpose robot agent and real-time search method Download PDF

Info

Publication number
WO2000074294A2
WO2000074294A2 PCT/KR1999/000356 KR9900356W WO0074294A2 WO 2000074294 A2 WO2000074294 A2 WO 2000074294A2 KR 9900356 W KR9900356 W KR 9900356W WO 0074294 A2 WO0074294 A2 WO 0074294A2
Authority
WO
WIPO (PCT)
Prior art keywords
general
real
robot agent
search method
purpose robot
Prior art date
Application number
PCT/KR1999/000356
Other languages
French (fr)
Other versions
WO2000074294A3 (en
Inventor
So Young Kim
Original Assignee
Webnara Co., Ltd.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Webnara Co., Ltd. filed Critical Webnara Co., Ltd.
Priority to AU45325/99A priority Critical patent/AU4532599A/en
Publication of WO2000074294A2 publication Critical patent/WO2000074294A2/en
Publication of WO2000074294A3 publication Critical patent/WO2000074294A3/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/951Indexing; Web crawling techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/958Organisation or management of web site content, e.g. publishing, maintaining pages or automatic linking

Definitions

  • the present invention relates to a web search engine and more particularly to general-purpose robot agent and real-time search method.
  • Typical web search engines (or search sites) serviced in present are based on a robot agent system usually called a robot agent, a database, and a search step.
  • the web search system comprises a robot agent system 100, a search system (hereinafter, called a searcher) 200, and a database 300.
  • the robot agent system 100 comprises a collecting system (hereinafter, called a collector) 10 and an indexing system (hereinafter, called an indexer) 20.
  • the collector 10 collects data in predetermined web sites and the indexer 20 indexes the data by analyzing morphemes and processing stop words and particles under the linkage with electronic dictionaries including dictionaries of nouns, synonyms, and particles .
  • the indexed data is stored in the database 300, and thereafter, the searcher 200 searches the stored data.
  • To search information using the web search system usually means to search the database 300 using the searcher 200.
  • the conventional internet search engines have the following problems.
  • a web search system produced for the purpose of searching shopping malls on the web comprises a collector for collecting only shopping mall web sites and an indexer for indexing data related to the collected shopping malls and constructs a program in linkage with an electronic dictionary related to the shopping mall.
  • the conventional indexing method comprises the steps of analyzing morphemes and processing stop words and particles.
  • these steps result in difficulty in constituting the electronic dictionary.
  • the electronic dictionary itself changes according to service fields it is applied to, and interpreting and processing inevitably results in hard coding within the program, thereby decreasing flexibility and extensibility of the entire system.
  • the conventional searcher 200 searches the contents of the database 300. In this structure, if updated contents of the web sites have not been reflected in the database 300 yet, the searcher 200 retrieves old contents excluding updated new contents.
  • the conventional robot agent has a problem that it cannot search updated new data if the data on the web sites is frequently changed.
  • the present invention provides a general -purpose robot agent and realtime search method for freely operating individual modules as independent systems or an integrated system and allowing each module to be easily integrated into another system as a component by creating templates for individual modules such as collector, indexer, and searcher, which were coded within the program in the conventional techniques, to construct the modules in an independent format, thereby increasing flexibility of the system.
  • the present invention also provides a general -purpose robot agent and real-time search method which employs an indexing method depending upon a common format instead of specific terms. Moreover, the present invention provides a general- purpose robot agent and real-time search method, where a searcher not only searches a database but also directly searches predetermined web sites, thereby effectively searching contents on the web sites having short update cycles .
  • FIG. 1 is a schematic diagram of a conventional web search system
  • FIG. 2 is a schematic diagram of a general -purpose robot agent and real-time search method according to the present invention.
  • a robot agent 100 comprises a collector 10 for performing a collecting process and an indexer 20 for performing an indexing process .
  • Data indexed by the indexer 20 in the indexing process constructs a database 300.
  • the database 300 is searched by a searcher 200.
  • a processor is controlled by an initialization file provided from the outside.
  • the initialization file is a simple text file which can be easily edited according to rules.
  • the system is controlled by this file.
  • the initialization file for the collector 10 contains initial URLs of sites, the number of threads, and a collection/removal pattern.
  • the initialization file for the indexer 20 contains index rules of respective sites.
  • the initialization file for the searcher 200 contains collected data/index rule and a display format for each site.
  • control information of the collector 10, indexer 20 and searcher 200 is coded within the program.
  • the present invention creates templates for the control information of the collector 10, indexer 20 and searcher 200 in the outside of the system.
  • the functions located outside the system can be easily changed in accordance with different purposes, thus being used as a general-purpose robot agent.
  • Each sub-system is also designed to function as an independent system, so it can be freely utilized for different particular purposes.
  • the conventional indexing method uses specific phrases or words with respect to the web file.
  • the present invention indexes data of the web file based upon a predetermined pattern shown in the web file, so the conventional morpheme analysis and stop words and particle processing are not required, thereby decreasing difficulties in construction of an electronic dictionary and change of the electronic dictionary itself according to which fields the dictionary is applied to. Consequently, the present invention improves flexibility and extensibility of the entire system a lot.
  • the searcher 200 not only searches the database 300 but also directly searches specified web sites, so the web sites whose updated new data cannot be searched by the conventional methods can be searched .
  • the general -purpose robot agent and real-time search method of the present invention can be generally applied to general search systems, special search systems for shopping malls, special search systems for book information, and so on.
  • the collector 10 which is a sub system of the robot agent, basically performs a web server mirroring function as in existing other search systems.
  • the collector 10 of the present invention is different from the conventional one in that it can ensure the flexibility through a template function. Specifically, a collection skip URL pattern can be sophisticatedly defined or files to be collected can be limited in associated with the indexing to achieve efficient collection control. In addition, taking network traffic into consideration, a thread can be assigned to each site.
  • the indexer 20 is basically controlled by the template, namely, the initialization file.
  • the indexer 20 of the present invention does not use electronic dictionaries and program routines subjected to service while analyzing or parsing files and constructing a database. Since the present invention indexes all files based upon only index rules recorded in the template, thus ensuring generality and achieving a high index rate through rapid index speed.
  • Index rule mechanism is based upon the idea that a web file to be indexed has a particular pattern for record of information.
  • a proper record pattern instead of different special patterns is repeatedly applied to individual articles when recording article information (such as name, price, and manufacturer of the article) .
  • article information such as name, price, and manufacturer of the article
  • the searcher receives this index rule and files and performs the indexing. If a plurality of index rules are written in the same site, the searcher automatically chooses an index rule corresponding to the style of a file.
  • the index rule is supported by a function of indexing only files containing a particular word, a function of processing extracted data (e.g., removal and replacement of a particular character string) , 'and a function of making information into a database (e.g., embedding a monetary unit or URL of a homepage in a certain field) .
  • the searcher 200 is written in Java Servlet and divided into a meta search system and a directory (subject) search system in functional aspect. Since an idea of template is applied to the searcher 200 in the same way as to the robot agent 100, supplement of search sites or change of user interface elements can be achieved only by changing the initialization file.
  • control information for most efficiently searching, namely, collecting and indexing search sites.
  • the control information includes cache time, type of information (associated with a directory) , shortest search path, and index rule (subset of the index rule of the indexer) .
  • An internal cache is implemented to take advantages of a local database .
  • the cache comprises a memory cache and a disc cache and is embodied in the unit of keywords and the unit of subjects (directory) . Specifically, when there is a request for service related to the same keywords and directories, the content in the cache is used for response.
  • a cache manager is operated as a separate thread and automatically performs setting, removal, and conversion of cache.
  • the general-purpose robot agent and real-time search method of the present invention allows search engines to be applied to general search systems, search systems specialized in shopping malls, search systems specialized in book information, and other like systems.
  • the searcher of the present invention not only searching the database but also directly searches predetermined web sites having short update cycles in real time .

Abstract

The present invention provides a general-purpose robot agent (100) and real-time search method for freely operating individual modules as independent systems or an integrated system and allowing each module to be easily integrated into another system as a component by creating templates for individual modules such as collector (10), indexer (20), and searcher (200), which were coded within the program in the conventional techniques, to construct the modules in an independent format, thereby increasing flexibility of the system. The present invention also provides a general-purpose robot agent (100) and real-time search method which employs an indexing method depending upon a common format instead of specific terms. Moreover, the present invention provides a general-purpose robot agent (100) and real-time search method, where a searcher (200) not only searches a database (300) but also directly searches predetermined web sites, thereby effectively searching contents on the web sites having short update cycles.

Description

GENERA -PURPOSE ROBOT AGENT AND REAL-TIME SEARCH METHOD
Technical Field
The present invention relates to a web search engine and more particularly to general-purpose robot agent and real-time search method.
Background Art
Typical web search engines (or search sites) serviced in present are based on a robot agent system usually called a robot agent, a database, and a search step.
Referring to FIG. 1, a conventional web search engine program (or web search system) will be described in detail.
The web search system comprises a robot agent system 100, a search system (hereinafter, called a searcher) 200, and a database 300. The robot agent system 100 comprises a collecting system (hereinafter, called a collector) 10 and an indexing system (hereinafter, called an indexer) 20.
In the robot agent system 100, the collector 10 collects data in predetermined web sites and the indexer 20 indexes the data by analyzing morphemes and processing stop words and particles under the linkage with electronic dictionaries including dictionaries of nouns, synonyms, and particles . The indexed data is stored in the database 300, and thereafter, the searcher 200 searches the stored data.
To search information using the web search system usually means to search the database 300 using the searcher 200. However, the conventional internet search engines have the following problems.
Primarily, since existing web search systems developed with particular purposes have the collector 10 and the indexer 20 coded as parts of the entire program, the collector 10 and the indexer 20 are strictly subjected to the entire program.
This decreases flexibility of the systems in extension of their functions for other purposes or switch to systems of another use. Furthermore, administration and maintenance of the systems costs a great deal since technical knowledge of software is required.
For example, a web search system produced for the purpose of searching shopping malls on the web comprises a collector for collecting only shopping mall web sites and an indexer for indexing data related to the collected shopping malls and constructs a program in linkage with an electronic dictionary related to the shopping mall.
However, if this web search system is used for any other purpose other than the shopping mall, the collector 10 and the indexer 20 which have been coded for the shopping mall and the searcher 200 should be all altered.
Such alteration requires technical knowledge of software, thereby making free and convenient alteration of the search system difficult.
The conventional indexing method comprises the steps of analyzing morphemes and processing stop words and particles. However, these steps result in difficulty in constituting the electronic dictionary. Moreover, the electronic dictionary itself changes according to service fields it is applied to, and interpreting and processing inevitably results in hard coding within the program, thereby decreasing flexibility and extensibility of the entire system.
The conventional searcher 200 searches the contents of the database 300. In this structure, if updated contents of the web sites have not been reflected in the database 300 yet, the searcher 200 retrieves old contents excluding updated new contents.
Consequently, the conventional robot agent has a problem that it cannot search updated new data if the data on the web sites is frequently changed.
Summary of the Invention
To overcome the defects described above, the present invention provides a general -purpose robot agent and realtime search method for freely operating individual modules as independent systems or an integrated system and allowing each module to be easily integrated into another system as a component by creating templates for individual modules such as collector, indexer, and searcher, which were coded within the program in the conventional techniques, to construct the modules in an independent format, thereby increasing flexibility of the system.
The present invention also provides a general -purpose robot agent and real-time search method which employs an indexing method depending upon a common format instead of specific terms. Moreover, the present invention provides a general- purpose robot agent and real-time search method, where a searcher not only searches a database but also directly searches predetermined web sites, thereby effectively searching contents on the web sites having short update cycles .
Brief Description of Drawings
The present invention will become more fully understood from the detailed description given hereinbelow and the accompanying drawings which are given by way of illustration only, and thus are not limitative of the present invention, and wherein:
FIG. 1 is a schematic diagram of a conventional web search system; and
FIG. 2 is a schematic diagram of a general -purpose robot agent and real-time search method according to the present invention.
Best Mode for carrying Out the Invention
The present invention relates to a general-purpose robot agent and real-time search method and is developed based upon Pure Java, thus being operated in most existing platforms without a special porting process. A robot agent 100 comprises a collector 10 for performing a collecting process and an indexer 20 for performing an indexing process .
Data indexed by the indexer 20 in the indexing process constructs a database 300. In a search process, the database 300 is searched by a searcher 200.
During the actuation, or the initialization of the web search system, a processor is controlled by an initialization file provided from the outside. The initialization file is a simple text file which can be easily edited according to rules. The system is controlled by this file.
The initialization file for the collector 10 contains initial URLs of sites, the number of threads, and a collection/removal pattern.
The initialization file for the indexer 20 contains index rules of respective sites. The initialization file for the searcher 200 contains collected data/index rule and a display format for each site. In conventional techniques, control information of the collector 10, indexer 20 and searcher 200 is coded within the program. On the other hand, the present invention creates templates for the control information of the collector 10, indexer 20 and searcher 200 in the outside of the system. The functions located outside the system can be easily changed in accordance with different purposes, thus being used as a general-purpose robot agent. Each sub-system is also designed to function as an independent system, so it can be freely utilized for different particular purposes. The conventional indexing method uses specific phrases or words with respect to the web file. On the other hand, the present invention indexes data of the web file based upon a predetermined pattern shown in the web file, so the conventional morpheme analysis and stop words and particle processing are not required, thereby decreasing difficulties in construction of an electronic dictionary and change of the electronic dictionary itself according to which fields the dictionary is applied to. Consequently, the present invention improves flexibility and extensibility of the entire system a lot.
In the search process, the searcher 200 not only searches the database 300 but also directly searches specified web sites, so the web sites whose updated new data cannot be searched by the conventional methods can be searched .
Through such operations, the general -purpose robot agent and real-time search method of the present invention can be generally applied to general search systems, special search systems for shopping malls, special search systems for book information, and so on.
The operation of each module in the general -purpose robot agent having such features will be described.
The collector 10, which is a sub system of the robot agent, basically performs a web server mirroring function as in existing other search systems.
The collector 10 of the present invention is different from the conventional one in that it can ensure the flexibility through a template function. Specifically, a collection skip URL pattern can be sophisticatedly defined or files to be collected can be limited in associated with the indexing to achieve efficient collection control. In addition, taking network traffic into consideration, a thread can be assigned to each site. The indexer 20 is basically controlled by the template, namely, the initialization file.
Differently from the conventional indexing system, the indexer 20 of the present invention does not use electronic dictionaries and program routines subjected to service while analyzing or parsing files and constructing a database. Since the present invention indexes all files based upon only index rules recorded in the template, thus ensuring generality and achieving a high index rate through rapid index speed.
Index rule mechanism is based upon the idea that a web file to be indexed has a particular pattern for record of information.
For example, for the file of web shopping mall, a proper record pattern instead of different special patterns is repeatedly applied to individual articles when recording article information (such as name, price, and manufacturer of the article) . Particularly, in case of providing mass information, it is general to use a specified proper pattern. In other words, in case of indexing data of the shopping mall, once the repeated pattern is written as an index rule according to defined grammar, the searcher receives this index rule and files and performs the indexing. If a plurality of index rules are written in the same site, the searcher automatically chooses an index rule corresponding to the style of a file. The index rule is supported by a function of indexing only files containing a particular word, a function of processing extracted data (e.g., removal and replacement of a particular character string) , 'and a function of making information into a database (e.g., embedding a monetary unit or URL of a homepage in a certain field) . The searcher 200 is written in Java Servlet and divided into a meta search system and a directory (subject) search system in functional aspect. Since an idea of template is applied to the searcher 200 in the same way as to the robot agent 100, supplement of search sites or change of user interface elements can be achieved only by changing the initialization file. This makes the present invention overcome the defect of the existing searcher that should repeat modification of program, compile, and deployment even in case of trivial modification of information. To overcome stateless (basically, the HTTP protocol does not define a session) which is one of serious obstacles in web application programs and to achieve an optimal response time which is essential in a method for searching web sites having short update cycles in real time, the following elementary techniques are used.
In the initialization file is described control information for most efficiently searching, namely, collecting and indexing search sites. The control information includes cache time, type of information (associated with a directory) , shortest search path, and index rule (subset of the index rule of the indexer) .
At the initial stage, generation and initialization of spare classes, generation of threads, and assignment of shared memory are performed to minimize an overhead occurring during the service, using resource management.
An internal cache is implemented to take advantages of a local database . The cache comprises a memory cache and a disc cache and is embodied in the unit of keywords and the unit of subjects (directory) . Specifically, when there is a request for service related to the same keywords and directories, the content in the cache is used for response. A cache manager is operated as a separate thread and automatically performs setting, removal, and conversion of cache.
Through such operation, the general-purpose robot agent and real-time search method of the present invention allows search engines to be applied to general search systems, search systems specialized in shopping malls, search systems specialized in book information, and other like systems.
In addition, the searcher of the present invention not only searching the database but also directly searches predetermined web sites having short update cycles in real time . Although the preferred embodiments of the present invention have been disclosed for illustrative purposes, those skilled in the art will appreciate that various modifications, additions and substitutions are possible, without departing from the scope and spirit of the invention as recited in the accompanying claims.

Claims

WHAT IS CLAIMED IS;
1. In a web search system for searching information over the internet, a general-purpose robot agent and realtime search method comprising the steps of: collecting data from predetermined web sites using a collector; indexing the collected data using an indexer; making the indexed data into a database; and searching the data in the database using a searcher.
2. The general -purpose robot agent and real-time search method as claimed in claim 1, wherein, in said collecting, indexing, and searching steps, subordinate control information of a service is embodied in the form of templates .
3. The general-purpose robot agent and real-time search method as claimed in claim 1, wherein said indexing step is characterized by indexing the collected data based upon a form of description of a file instead of a content of the file.
4. The general-purpose robot agent and real-time search method as claimed in claim 1, wherein said searching step is characterized by not only searching the data in the database but also directly searching the web sites.
5. The general -purpose robot agent and real-time search method as claimed in claim 1, wherein said web search system is developed based upon Java.
PCT/KR1999/000356 1999-05-31 1999-07-03 General-purpose robot agent and real-time search method WO2000074294A2 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
AU45325/99A AU4532599A (en) 1999-05-31 1999-07-03 General-purpose robot agent and real-time search method

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
KR1999/19679 1999-05-31
KR1019990019679A KR20000075203A (en) 1999-05-31 1999-05-31 General-purpose robot agent and real-time search method

Publications (2)

Publication Number Publication Date
WO2000074294A2 true WO2000074294A2 (en) 2000-12-07
WO2000074294A3 WO2000074294A3 (en) 2002-02-14

Family

ID=19588675

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/KR1999/000356 WO2000074294A2 (en) 1999-05-31 1999-07-03 General-purpose robot agent and real-time search method

Country Status (3)

Country Link
KR (1) KR20000075203A (en)
AU (1) AU4532599A (en)
WO (1) WO2000074294A2 (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2002048906A1 (en) * 2000-12-14 2002-06-20 Kapow Aps Query processor, query processor elements and a method of establishing such a query processor and query processor elements and a domain processor
WO2002095617A1 (en) * 2001-05-18 2002-11-28 Superchai Pty Ltd Dynamic content management application

Families Citing this family (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR100684444B1 (en) * 1999-12-17 2007-02-16 주식회사 케이티 Method for collecting information using robot and regular expression
KR20020001300A (en) * 2000-06-27 2002-01-09 전병국 Apparatus for detecting web site address
KR20020062096A (en) * 2001-01-19 2002-07-25 주식회사 윌서치 A system for providing links to multimedia contents in an Internet and a method thereof
KR101037673B1 (en) * 2004-01-06 2011-05-30 주식회사 비즈모델라인 System for Collecting and Processing Dispersing Contents and Recording Medium
KR100975510B1 (en) * 2008-07-17 2010-08-11 엔에이치엔(주) Method and System for Updating Web Page Index
KR101037675B1 (en) * 2009-01-06 2011-05-30 주식회사 비즈모델라인 System and Method for Collecting and Processing Dispersing Contents

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1998049637A1 (en) * 1997-05-01 1998-11-05 Northern Light Technology, Llc Method and apparatus for searching a database of records
US5855020A (en) * 1996-02-21 1998-12-29 Infoseek Corporation Web scan process

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5855020A (en) * 1996-02-21 1998-12-29 Infoseek Corporation Web scan process
WO1998049637A1 (en) * 1997-05-01 1998-11-05 Northern Light Technology, Llc Method and apparatus for searching a database of records

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
BURKOWSKI FORBES J.: 'Retrieval activities in a database consisting of heterogenous collection of structured text' ACM SIGIR, [Online] 21 June 1992 - 24 June 1992, COPENHAGEN, DENMARK, page 112 Retrieved from the Internet: <URL:http://www.acm.org/pubs/citations/proc eedings/ir/133160/p112-burkowski> [retrieved on 2000-03-28] *
EXTENSIBLE MARKUP LANGUAGE (XML) 1.0, W3C RECOMMENDATION, [Online] 10 February 1998, page CHAPTER 3: LOGICAL STRUCTURES Retrieved from the Internet: <URL:http://www.xml.com/axml/target.html> [retrieved on 2000-03-28] *

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2002048906A1 (en) * 2000-12-14 2002-06-20 Kapow Aps Query processor, query processor elements and a method of establishing such a query processor and query processor elements and a domain processor
US7698277B2 (en) 2000-12-14 2010-04-13 Kapow Aps Query processor, query processor elements and a method of establishing such a query processor and query processor elements and a domain processor
WO2002095617A1 (en) * 2001-05-18 2002-11-28 Superchai Pty Ltd Dynamic content management application

Also Published As

Publication number Publication date
WO2000074294A3 (en) 2002-02-14
KR20000075203A (en) 2000-12-15
AU4532599A (en) 2000-12-18

Similar Documents

Publication Publication Date Title
Woods Conceptual indexing: A better way to organize knowledge
JP4644420B2 (en) Method and machine-readable storage device for retrieving and presenting data over a network
US6859800B1 (en) System for fulfilling an information need
CN1955963B (en) System and method for searching dates in electronic documents
CA2230333C (en) Structured-text cataloging method, structured-text searching method, and portable medium used in the methods
JP4241934B2 (en) Text processing and retrieval system and method
US6745161B1 (en) System and method for incorporating concept-based retrieval within boolean search engines
CN100405371C (en) Method and system for abstracting new word
US7024405B2 (en) Method and apparatus for improved internet searching
Vidal et al. Structure-driven crawler generation by example
JP7168411B2 (en) Information processing system and information processing method
CN101241506A (en) Many dimensions search method and device and system
WO2000074294A2 (en) General-purpose robot agent and real-time search method
JP3617096B2 (en) Relational expression extraction apparatus, relational expression search apparatus, relational expression extraction method, relational expression search method
JPH10283366A (en) Information classifying device
KR100234271B1 (en) Real time searching method using movable search engine
Chandrasekar et al. Gleaning information from the web: Using syntax to filter out irrelevant information
JP4649036B2 (en) Category reporting method, record reporting method, search service device by search server
JP2005242416A (en) Natural language text search method and device
JP3898016B2 (en) Information search device, information search method, and information search program
JP3565117B2 (en) Access method for multiple different information sources, client device, and storage medium storing multiple different information source access program
Castellano et al. A web text mining flexible architecture
JP2000003366A (en) Document registration method, document retrieval method, execution device therefor and medium having recorded its processing program thereon
JPH117452A (en) Method and device for collecting information through network and recording medium recording program for executing the method
JPH08190571A (en) Document retrieval method

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A2

Designated state(s): AL AM AU BA BB BG BR CA CN CU CZ EE HR HU ID IL IN IS JP LC LK LR LT LV MG MK MN MX NO NZ PL RO SG SI SK SL TR TT UA US UZ VN YU

AL Designated countries for regional patents

Kind code of ref document: A2

Designated state(s): GH GM KE LS MW SD SL SZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
AK Designated states

Kind code of ref document: A3

Designated state(s): AL AM AU BA BB BG BR CA CN CU CZ EE HR HU ID IL IN IS JP LC LK LR LT LV MG MK MN MX NO NZ PL RO SG SI SK SL TR TT UA US UZ VN YU

AL Designated countries for regional patents

Kind code of ref document: A3

Designated state(s): GH GM KE LS MW SD SL SZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG

122 Ep: pct application non-entry in european phase