WO2007078768A1 - Providing predicted search queries and predictive search results - Google Patents

Providing predicted search queries and predictive search results Download PDF

Info

Publication number
WO2007078768A1
WO2007078768A1 PCT/US2006/047543 US2006047543W WO2007078768A1 WO 2007078768 A1 WO2007078768 A1 WO 2007078768A1 US 2006047543 W US2006047543 W US 2006047543W WO 2007078768 A1 WO2007078768 A1 WO 2007078768A1
Authority
WO
WIPO (PCT)
Prior art keywords
query
predicted
queries
user
search results
Prior art date
Application number
PCT/US2006/047543
Other languages
French (fr)
Inventor
Richard Kasperski
Original Assignee
Yahoo! Inc.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Yahoo! Inc. filed Critical Yahoo! Inc.
Publication of WO2007078768A1 publication Critical patent/WO2007078768A1/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/242Query formulation
    • G06F16/2425Iterative querying; Query formulation based on the results of a preceding query
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2457Query processing with adaptation to user needs
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2458Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
    • G06F16/2468Fuzzy queries
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/332Query formulation
    • G06F16/3322Query formulation using system suggestions
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/951Indexing; Web crawling techniques
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10TECHNICAL SUBJECTS COVERED BY FORMER USPC
    • Y10STECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10S707/00Data processing: database and file management or data structures
    • Y10S707/99931Database or file accessing
    • Y10S707/99933Query processing, i.e. searching

Definitions

  • the present invention relates generally to search engines, and more particularly to providing an interactive user interface to a search engine using user input prediction.
  • Search engines are common and useful tools for searching the Internet for any type of information that is web accessible. Search engines respond to user queries by generating a lisi: of links to documents deemed relevant to the query. Search engines are also used in proprietary websites to search for information specifically pertaining to the proprietary websites.
  • search engines perform all their work for a user only after the user has entered a query into a query field and issued the query by clicking "Search,” "Enter,” or “Go.”
  • This traditional approach is intuitive because the only time a search engine "knows" for certain what a user desires is when the user decides that the query is correct and complete by formally issuing the query.
  • search engines do not provide help to the user while the user is formulating a query. Accordingly, search engines must "wait" to provide the search results until the user has determined that the query is complete, at which time the u ⁇ ier explicitly issues the query to a search engine.
  • any additional information relating to the query and search results is provided after the user issues the query.
  • FIG . 1 illustrates a user interface that displays predicted queries based on input entered in a query field, and search results from the most likely predicted query, according to one embodiment of the invention
  • FIG. 2 is a flow diagram that illustrates how temporal relevance is factored into determining which potential queries become predicted queries sent to the user, according to one embodiment of the invention
  • FIG, 3A is a block diagram that illustrates the communication between a web browser on a client and a front end server, according to one embodiment of the invention
  • FIG. 3B is a block diagram that illustrates the communication between a web browser on a cl ⁇ .ent and a front end server, according to another embodiment of the invention.
  • FIG. 3C is a block diagram that illustrates the communication between a web browser on a client and a front end server, according to another embodiment of the invention.
  • FIG. 4 is a block diagram that illustrates a computer system upon which an embodiment of ii ⁇ e invention may be implemented.
  • An interface to a search engine assists the user 1) by predicting what the user is searching for based on the character or characters the user has entered so far in the query field of the interface, and 2) by providing search results to the user via the interface without the user having to formally issue the intended query. For each character entered into the query field, that portion of the query entered so far is automatically issued to a query predictor that determine!, a set of one or more predicted queries that correspond to the portion of the query. The set of predicted queries is determined based on the frequency of previously issued queries alone or also on when the previous queries were issued. The most likely predicted query is processed by a search engine to obtain search results.
  • Both the predicted queries and the search results are provided to the user via the interactive user interface.
  • the predicted queries when displayed to the user, may be ordered based on their popularity (frequency-based) alone or also on their temporal relevance (time-based).
  • the user may select any query in the set of predicted queries.
  • the search results are then updated to display the search results that pertain to the different predicted query.
  • other dynamic data may be provided that relate to the most likely predicted query but are not necessarily obtained by the search results, such as advertisements and other related links to websites.
  • FIG. 1 illustrates a user interface display, predicted queries, and search results, according to one embodiment of the invention.
  • a user via a web browser 100, enters characters, which will constitute the user's intended query, into a query field 102.
  • a query predictor described below (also see FIGS. 3 A-C).
  • the query predictor determines a set of one or more predicted queries based on the partial query,.
  • the predicted queries are sent back to the user and displayed, for example in a drop box 104.
  • the web browser 100 also displays a selected predicted query 108 (hereinafter referred to as the "particular predicted query").
  • a search engine processes the particular predicted query 108 from the set of predicted queries and sends the search results 112 to the user to be displayed, for example in results page 110. It is thus possible that the user only has to enter one or a few characters before the actual intended query is determined and the results of the intended query are displayed. Therefore, search button 106 may never have to be selected in order for an intended query to be issued.
  • the portion of the query entered so far by a user is sent fiom the user's web browser to a query predictor across a network. This may occur for each character, or sequence of characters, entered by the user.
  • the query predictor examines the one or more characters and makes one or more predictions on what the intended query is.
  • the one or more predictions are in the form of one or more completed queries, each of which was a previously issued query. These one or more predictions are sent to and displayed on the user's computer; effectively assisting the user in formulating a. query before the user is finished typing the entire intended query in the query field.
  • the basic .assumption behind the query predictor is that it is highly probable that a user intends to issue a query in which at least one other person has issued previously.
  • a highly interactive search engine may assist the • user in formulating a query, or perhaps aiding the user in refining a query by listing other possible variants of the query that the user would be interested in. Every previously issued query is saved and logged because, if the query was valuable to one user, it has potential value for another user.
  • the query predictor extends to other languages and is not exclusive to English.
  • the query predictor may also support other types of strings, such as product names and part numbers where the user may only know a small piece of it.
  • the query predictor thus has a searchable database of queries that the query predictor may access once the query predictor has received one or more characters from the user. Based on the partial query, the query predictor determines one or more completed queries from the database of queries that match lexically. However, instead of simply completing ⁇ the partial query lexically and returning only those queries that begin with the character or characters in the partial query, other queries are also found that contain the lexical completion anywhere in the predicted query. For example, if a user enters the string "tti”, not only may "theory of evolution” be a predicted query sent to the user, but also "string theory” or "music theory,” each of which, are not simple lexical completions of "th.”
  • the query predictor biases the resulting set of predicted queries based on their frequency (i.e., number of times the query has issued in the entire query database history), and how often they were issued within a specified time, for instance, within the past week.
  • the fact that the most recently issued queries are biased is based on the premise that a user is more likely to be interested in a subject that many other people are interested in at roughly the same time.
  • the time component is determined by searching at least two databases, one for relatively recent queries and one for relatively older queries, and then scaling the results from searching the recent database and weighting them accordingly.
  • FIG. 2 illustrates steps in which this embodiment may be implemented. It will be apparent that there are many ways this scaling and weighting may be performed, in addition to the number of "old" query databases and "new" query databases, as the invention is not limited to this particular example.
  • the query predictor has access to a small database of all queries that issued in the last week and to a large database of all queries that issued before a week ago. When searching the small database for potentially valuable predicted queries, the number of times a potential query is found in the small database is scaled based on a factor.
  • This factor is the ratio of the number of times a moderately popular query is found in the large database to the number of times that samis moderately popular query is found in the small database. For example, suppose that "Yahoo" is a moderately popular query over the last week and over the past few years. If “Yahoo” is found in the large database 1.7 million times, and 25 thousand times in the small database, then the factor would be 1.7 million/25 thousand, or 68. [0027] Query prediction would be less effective if a moderately popular query in both the small and largs databases were not used to scale. If a query was popular only in the large database and but not in the small database, then the scaling factor would be skewed.
  • the number is scaled, in step 204, by 68, which is based on the scaling factor determined above with "Yahoo" as the scaling query.
  • the resulting scaled value essentially indicates that the potential queries in the small database are equal in weight to the potential queries in the large (i.e. old) database.
  • the query predictor determines, in stsp 206, the number of times the potential query appears in the large database of "older" queries.
  • a weight is applied to the potential queries in the small database versus the potential queries in the large database. This is performed by multiplying the result of the scaled small database number by 2/3 and adding it to the result of multiplying the number of times the potential query was found in the large database by 1/3 (see steps 208-212). Steps 202-212 are performed for each potential query determined by the query predictor. When there are no more potential queries to process (214), all the potential queries are then compared with each other (step 216) based on their respective values determined for each potential query at step 212. The two or more queries (e.g., ten) with the highest values become the predicted queries, which are subsequently sent to the user.
  • the two or more queries e.g., ten
  • the search engine component processes the particular predicted query (i.e. the most likely intended predicted query) that a user would be interested in.
  • the particular predicted query is processed to obtain search results.
  • the search engine that may be used for this purpose is common in the art and requires no further description.
  • the search results obtained by the search engine are sent to and displayed on the user's computer. If the particular predicted query is the user's intended query, the search results based on the particular predicted query may appear on the user's monitor even before the user enters another character in the query field and very likely before the user finishes entering the full intended query. If the particular predicted query is not the user's intended query, then the user may select a different predicted query in the list or continue typing, at which time a new set of search results, based on the selected or new particular predicted query, will be displayed via the user interface.
  • FIG. 3A is a block diagram that illustrates one way a partial query is processed and how the results of the partial query are returned, according to one embodiment of the invention.
  • a user at a client 300 enters a partial query in a web browser 302.
  • the partial query 312 is sent to a front end server 304 over a network 350.
  • Front end server 304 is not a necessary element in any embodiment of the invention. Its main purpose is to add security to the interactive search engine system.
  • Network 350 is also not a required element in any embodiment, but is merely illustrated to show one approach in which the invention may be implemented.
  • Network 350 may be a local area network (LAN), a wide area network (WAN), or the Internet.
  • Front end server 304 forwards partial query 312 to a query predictor 306, discussed above, which processes the partial query.
  • Front end server 304, query predictor 306, and a search engine 308, or any combination thereof, may be implemented on the same device. However, for the purpose of illustration and simplification, they each reside on different devices.
  • Query predictor 306 determines a set of one or more predicted queries based on the partial query and sends them 314 back to front end server 304. Along with the set of predicted queries, query predictor 306 sends additional data indicating which of the predicted queries in the set is the particular predicted query. Either query predictor 306 determines which predicted query is the particular predicted query or web browser 302 is given sufficient iiiformation to make that determination.
  • Front end server 304 then forwards the predicted queries 314 and the data indicating the particular predicted query to client 300 over network 350 to be displayed on web browser 302.
  • web browser 302 Upon receipt of the set of predicted queries, web browser 302 sends particular predicted query 316 over network 350 to front end server 304, which forwards particular predicted query 316 to search engine 308.
  • the search results 318 are finally sent to front end server 304, which forwards them 318 to client 300 over network 350.
  • One advantage of this implementation is that the predicted queries are sent immediately to the user as soon as they are determined. However, this implementation also illustrates the possibility that for every character the user types into the query field of his web browser, there are two complete round trips that a communication has to make between client 300 and front end server 304.
  • FIG. 3B is a block diagram that illustrates a different way in which a partial query is processed and how the results are returned to the user, according to another embodiment of vhe invention.
  • a user at client 300 enters a partial query in a web browser 302. Partial query
  • Front end server 304 forwards partial query 31?. to query predictor 306, which processes the partial query.
  • Query predictor 306 determines a set of one or more predicted queries based upon the partial query and sends them 314 to front end server 304. Instead of immediately forwarding the predicted queries to client 300, front end server 304 retains the predicted queries and sends search engine 308 the particular predicted query 316. Again, along with the set of predicted queries, query predictor 306 sends additional data indicating which of the predicted queries in the set is the particular predicted query. Either query predictor
  • Search engine 308 processes the particular predicted query to obtain search results.
  • the search results 318 are sent to front end server 304, at which time front end server 304 forwards both predicted queries 314 and search results 318 to client 300 over network 350.
  • query predictor 306 sends the predicted queries 314 and to search engine 308, which subsequently sends the predicted queries 314 and search results 318 to client 300 over network 350.
  • FIG. 3C is a block diagram that illustrates a different way in which a partial query is processed and how the results are returned to the user, according to another embodiment of the invention.
  • a user at client 300 enters a partial query in a web browser 302.
  • Partial query 312 is sent to fiont end server 304 over network 350.
  • Front end server 304 forwards the partial query 312 to query predictor 306, which processes the partial query.
  • Query predictor 306 determines a set of one or more predicted queries based upon the partial query and sends them 314 to front end server 304. Again, along with the set of predicted queries, query predictor 306 sends additional data indicating which of the predicted queries in the set is the particular predicted query. Either query predictor 306 determines which predicted query is the particular predicted query or front end server 304 is given sufficient information to make that determination.
  • front end server 304 sends the predicted queries 314 to client 300 over network 350 and sends particular predicted query 316 to search engine 308 at substantially the same time. It is also possible for query predictor 306 to send the particular predicted query to search engine 308 directly.
  • Search engine 308 processes the particular predicted query to obtain search results.
  • the search results 318 are sent to front end server 304, at which time front end server 304 forwards search results 318 to client 300 over network 350.
  • query predictor 306 sends both the predicted queries 314 and the particular predicted query 316 to search engine 308, after which search engine 308 sends predicted queries 314 and search results 318 to client 300 over network 350.
  • query predictor 306 sends both predicted queries 314 and the particular predicted query 316 to search engine 308, which subsequently sends predicted queries 314 and search results 318 to client 300 over network 350.
  • the advantage of this implementation compared to the embodiment described in FIG. 3 A is thai there is less traffic between client 300 and front end server 304.
  • the advantage compared to the embodiment described in FIG. 3B is that the predicted queries do not have to "wait" for the search results to be produced and sent to front end server 304 before the predicted queries are forwarded to client 300. Thus, the predicted queries are sent immediately upon their production and less communication is required between client 300 and front end server 304.
  • the user interface includes at least 1) a query field 102 where a user enters characters that will constitute the partial query, 2) a drop down box 104 that lists the set of one or more predicted queries, 3) a search results page 110, and 4) a "Search" button 106.
  • the search button may be in the form of any mechanism that allows the user to select the query the user enters, in case the user is not satisfied with any of the predicted queries provided by the interactive search engine.
  • the set of predicted queries listed in drop down box 104 may be represented in almost any other type of user interface element, including, but not limited to, a text box, li:;t box, menu, or context menu.
  • the user interface may be viewed using a web browser, such as Internet Explorer or Mozilla Firefox.
  • the set of predicted queries are listed, beginning at the top, in order of the most likely predicted query to the least likely predicted query.
  • the interactive search engine may be modified in many ways to alter the look, feel, and responsiveness of the search experience.
  • the user interface includes tabs, such buttons or links 122 in FIG. 1, wherein the user may select a subsection of possible queries and search based on that subsection.
  • tabs or “search verticals,” such as "Web,” “Images,” “Video,” and “Shopping” a user may select different query sets.
  • the data for which the query predictor is predicting is different based on what the user is interested in, which data is narrowed by using these tabs. For example, if the user is interested in shopping for a product, the user selects the "Shopping" tab. The user then begins to enter a product name or service in query field 102.
  • the query predictor is not only sent the partial query but also the shopping selection information, indicating that the user is searching for a particular product or service, wherein the query predictor returns only those predicted queries that pertain to products and services.
  • the order of words in the query is unimportant.
  • the issued query does not have to be in English.
  • non-natural strings such as product names and part numbers where the user may only know a portion of the non-natural string. Therefore, the term "word” as used herein may include an English word, a Korean word, or a product number.
  • a user enters "solar wind water power" in the query field.
  • the user does not particularly care about the order.
  • the user is rather interested in queries that contain the words “solar,” “wind,” “water,” and “power” somewhere in the query.
  • the query predictor determines what words are important in the query and which words are not important, and then predicts queries based on the important words instead of predicting queries based simply on a matching substring. DELAY RESULTS
  • the step of displaying the predicted queries and/or the search results is delayed.
  • the query predictor "waits" until certain criteria is satisfied (such as the lapse of a specified amount of time or when a few characters are entered, or both) before the predicted queries and search results are displayed.
  • This additional step of waiting assumes that the user may not be sure what he/she wants to query on.
  • the predicted queries are delayed until the interactive user interface determines, based on the waiting criteria, that this is what the user truly intends to query on.
  • the partial query is processed by the query predictor and the search engine, as described above.
  • advertisements that appear on the interactive user interface change based on the particular predicted query returned from the query predictor.
  • new advertisements that relate to the query are posted on the user interface and advertisements that related to an older and non-relevant query are deleted from the user interface. For instance, if a user types "elli" and the query predictor determines "elliptical" as the particular predicted query, advertisements that relate to exercise equipment will appear on the user interface.
  • the query predictor determines that "theory” is the particular predicted query for the partial query "th" entered by the user.
  • the query predictor or perhaps another program, determines that "theory” is associated with "string theory,” “music theory,” and “math theory” and returns these related subjects to be displayed in the form of predicted queries or in a different form on the user interface. For short queries like "theory,” this additional information happens to be the same set as what the query predictor would produce.
  • the query predictor in addition to the predicted queries, would return queries that are not lexical completions of "international trade," but rather queries related to the topic of international trade. Such queries could be on GATT, WTO, UN, US trade policies, etc. A program separate from the query predictor could also perform this function. [0063] Clearly, this aspect of the invention is not performing query prediction, but rather is providing the user with dynamic, related, and hopefully helpful information. A principle in providing advertisements, additional queries, and other related information is to keep everything that is displayed via the user interface consistent with what the query predictor "believes" is the user's intent, which the query predictor determines from the partial query.
  • FIG. 4 is a block diagram that illustrates a computer system 400 upon which an embodiment of the invention may be implemented.
  • Computer system 400 includes a bus 402 or other communication mechanism for communicating information, and a processor 404 coupled with bus 402 for processing information.
  • Computer system 400 also includes a main memory 406, such as a random access memory (RAM) or other dynamic storage device, coupled to bus 402 for storing information and instructions to be executed by processor 404.
  • Main memory 406 also may be used for storing temporary variables or other intermediate information during execution of instructions to be executed by processor 404.
  • Computer system 400 further includes a read only memory (ROM) 408 or other static storage device coupled to bus 402 for storing static information and instructions for processor 404.
  • ROM read only memory
  • a storage device 410 such as a magnetic disk or optical disk, is provided and coupled to bus 402 for storing information and instructions.
  • Computer system 400 may be coupled via bus 402 to a display 412, such as a cathode ray tube (CRT), for displaying information to a computer user.
  • An input device 414 is coupled to bus 402 for communicating information and command selections to processor 404.
  • cursor control 416 is Another type of user input device, such as a mouse, a trackball, or cursor direction keys for communicating direction information and command selections to processor 404 and for controlling cursor movement on display 412.
  • This input device typically has two degrees of freedom in two axes, a first axis (e.g., x) and a second axis (e.g., y), that allows the device to specify positions in a plane.
  • the invention is related to the use of computer system 400 for implementing the techniques described herein. According to one embodiment of the invention, those techniques are performed by computer system 400 in response to processor 404 executing one or more sequences of one or more instructions contained in main memory 406. Such instructions may be read into main memory 406 from another machine-readable medium, such as storage device 410. Execution of the sequences of instructions contained in main memory 406 causes processor 404 to perform the process steps described herein. In alternative embodiments, hard-wired circuitry may be used in place of or in combination with software instructions to implement the invention. Thus, embodiments of the invention are not limited to any specific combination of hardware circuitry and software.
  • machine-readable medium refers to any medium that participates in providing data that causes a machine to operation in a specific fashion.
  • various machine-readable media are involved, for example, in providing instructions to processor 404 for execution.
  • Such a medium may take many forms, including but not limited to, non-volatile media, volatile media, and transmission media.
  • Non-volatile media includes, for example, optical or magnetic disks, such as storage device 410.
  • Volatile media includes dynamic memory, such as main memory 406.
  • Transmission media includes coaxial cables, copper wire and fiber optics, including the wires that comprise bus 402. Transmission media can also take the form of acoustic or light waves, such as those generated during radio-wave and infra-red data communications.
  • Machine-readable media include, for example, a floppy disk, a flexible disk, hard disk, magnetic tape, or any other magnetic medium, a CD- ROM, any other optical medium, punchcards, papertape, any other physical medium with patterns of holes, a RAM, a PROM, and EPROM, a FLASH-EPROM, any other memory chip or cartridge, a carrier wave as described hereinafter, or any other medium from which a computer can read.
  • Various forms of machine-readable media may be involved in carrying one or more sequences of one or more instructions to processor 404 for execution.
  • the instructions may initially be carried on a magnetic disk of a remote computer.
  • the remote computer can load the instructions into its dynamic memory and send the instructions over a telephone line using a modem.
  • a modem local to computer system 400 can receive the data on the telephone line and use an infra-red transmitter to convert the data to an infra-red signal.
  • An infra-red detector can receive the data carried in the infra-red signal ;and appropriate circuitry can place the data on bus 402.
  • Bus 402 carries the data to main memory 406, from which processor 404 retrieves and executes the instructions.
  • Computer system 400 also includes a communication interface 418 coupled to bus 402.
  • Communication interface 418 provides a two-way data communication coupling to a network link 420 that is connected to a local network 422.
  • communication interface 418 maybe an integrated services digital network (ISDN) card or a modem to provide a data communication connection to a corresponding type of telephone line.
  • ISDN integrated services digital network
  • communication interface 418 may be a local area network (LAN) card to provide a data communication connection to a compatible LAN.
  • LAN local area network
  • Wireless links may also be implemented.
  • communication interface 418 sends and receives electrical, electromagnetic or optical signals that carry digital data streams representing various types of information.
  • Network link 420 typically provides data communication through one or more networks to other data devices.
  • network link 420 may provide a connection through local network 422 to a host computer 424 or to data equipment operated by an Internet Service Provider (ISP) 426.
  • ISP 426 in turn provides data communication services through the world wide packet data communication network now commonly referred to as the "Internet" 428.
  • Internet 428 uses electrical, electromagnetic or optical signals that carry digital data streams.
  • the signals through the various networks ⁇ und the signals on network link 420 and through communication interface 418, which carry the digital data to and from computer system 400, are exemplary forms of carrier waves transporting the information.
  • Computer system 400 can send messages and receive data, including program code, through the network(s), network link 420 and communication interface 418.
  • a server 430 might transmit a requested code for an application program through Internet 428, ISP 426, local network 422 and communication interface 418.
  • the received code may be executed by processor 404 as it is received, and/or stored in storage device 410, or other non-volatile storage for later execution. In this manner, computer system 400 may obtain application code in the form of a carrier wave.
  • embodiments of the invention have been described with reference to numerous specific details that may vary from implementation to implementation.

Abstract

A method and apparatus for providing a highly interactive user interface to a search engine using user input prediction is disclosed. As the user enters a portion of a query in a query field, the portion of the query is sent over a network to a query predictor. The query predictor determines a set of one or more predicted queries that correspond to the portion of the query. A particular predicted query is selected from the set of predicted queries based on selection criteria. The particular predicted query is processed by a search engine to obtain search results. The set of predicted queries and the search results are sent to the user and displayed on the user interface. The user may choose a different predicted query in the set of predicted queries or continue entering characters, both of which may result in different search results being displayed.

Description

PROVIDING PREDICTED SEARCH QUERIES AND PREDICTIVE SEARCH RESULTS
FIELD OF THE INVENTION
[0001] The present invention relates generally to search engines, and more particularly to providing an interactive user interface to a search engine using user input prediction.
BACKGROUISID
[0002] The approaches described in this section are approaches that could be pursued, but not necessarily approaches that have been previously conceived or pursued. Therefore, unless otherwise indicated, it should not be assumed that any of the approaches described in this section qualify as prior art merely by virtue of their inclusion in this section.
[0003] Search engines are common and useful tools for searching the Internet for any type of information that is web accessible. Search engines respond to user queries by generating a lisi: of links to documents deemed relevant to the query. Search engines are also used in proprietary websites to search for information specifically pertaining to the proprietary websites.
[0004] However, search engines perform all their work for a user only after the user has entered a query into a query field and issued the query by clicking "Search," "Enter," or "Go." This traditional approach is intuitive because the only time a search engine "knows" for certain what a user desires is when the user decides that the query is correct and complete by formally issuing the query. Thus, search engines do not provide help to the user while the user is formulating a query. Accordingly, search engines must "wait" to provide the search results until the user has determined that the query is complete, at which time the uϊier explicitly issues the query to a search engine. As a logical extension, any additional information relating to the query and search results is provided after the user issues the query.
[0005] Furthermore, the manner in which the user issues subsequent queries is relatively time consuming. If the user is dissatisfied with the search results of a particular query, the user must reformulate a subsequent query and then issue that query. Again, the search engine does not provide any assistance or search results until after the subsequent query is issued. [0006] Based on the foregoing, there is a need for search engines to be more responsive and proactive in assisting users with the queries they formulate before the user formally issues a full and complete query.
BRIEF DESCRIPTION OF THE DRAWINGS
[0007] The present invention is illustrated by way of example, and not by way of limitation, in the figures of the accompanying drawings and in which like reference numerals refer to similar elements and in which:
[0008] FIG . 1 illustrates a user interface that displays predicted queries based on input entered in a query field, and search results from the most likely predicted query, according to one embodiment of the invention;
[0009] FIG. 2 is a flow diagram that illustrates how temporal relevance is factored into determining which potential queries become predicted queries sent to the user, according to one embodiment of the invention;
[0010] FIG, 3A is a block diagram that illustrates the communication between a web browser on a client and a front end server, according to one embodiment of the invention;
[0011] FIG. 3B is a block diagram that illustrates the communication between a web browser on a clϊ.ent and a front end server, according to another embodiment of the invention;
[0012] FIG. 3C is a block diagram that illustrates the communication between a web browser on a client and a front end server, according to another embodiment of the invention;
[0013] FIG. 4 is a block diagram that illustrates a computer system upon which an embodiment of iiιe invention may be implemented.
DETAILED DESCRIPTION
[0014] In the following description, for the purposes of explanation, numerous specific details are set forth in order to provide a thorough understanding of the present invention. It will be apparent, however, that the present invention may be practiced without these specific details. In other instances, well-known structures and devices are shown in block diagram form in order to avoid unnecessarily obscuring the present invention.
OVERVIEW
[0015] Techniques are provided for providing an interface to a search engine. An interface to a search engine assists the user 1) by predicting what the user is searching for based on the character or characters the user has entered so far in the query field of the interface, and 2) by providing search results to the user via the interface without the user having to formally issue the intended query. For each character entered into the query field, that portion of the query entered so far is automatically issued to a query predictor that determine!, a set of one or more predicted queries that correspond to the portion of the query. The set of predicted queries is determined based on the frequency of previously issued queries alone or also on when the previous queries were issued. The most likely predicted query is processed by a search engine to obtain search results. Both the predicted queries and the search results are provided to the user via the interactive user interface. The predicted queries, when displayed to the user, may be ordered based on their popularity (frequency-based) alone or also on their temporal relevance (time-based). [0016] ' If the user is not interested in the search results based on the most likely predicted query, the user may select any query in the set of predicted queries. When the user selects a different predicted query in the list, the search results are then updated to display the search results that pertain to the different predicted query. [0017] In addition to displaying the predicted queries and search results to the user via the user interface, other dynamic data may be provided that relate to the most likely predicted query but are not necessarily obtained by the search results, such as advertisements and other related links to websites.
FUNCTIONAL OVERVIEW
[0018] FIG. 1 illustrates a user interface display, predicted queries, and search results, according to one embodiment of the invention. A user, via a web browser 100, enters characters, which will constitute the user's intended query, into a query field 102. As soon as the first character is entered, and for every subsequently entered character, that portion of the intended query is sent to a query predictor, described below (also see FIGS. 3 A-C). The query predictor determines a set of one or more predicted queries based on the partial query,. The predicted queries are sent back to the user and displayed, for example in a drop box 104. The web browser 100 also displays a selected predicted query 108 (hereinafter referred to as the "particular predicted query").
[0019] A search engine processes the particular predicted query 108 from the set of predicted queries and sends the search results 112 to the user to be displayed, for example in results page 110. It is thus possible that the user only has to enter one or a few characters before the actual intended query is determined and the results of the intended query are displayed. Therefore, search button 106 may never have to be selected in order for an intended query to be issued.
-1- QUERY PREDICTOR
[0020] In one embodiment of the invention, the portion of the query entered so far by a user is sent fiom the user's web browser to a query predictor across a network. This may occur for each character, or sequence of characters, entered by the user. The query predictor examines the one or more characters and makes one or more predictions on what the intended query is. The one or more predictions are in the form of one or more completed queries, each of which was a previously issued query. These one or more predictions are sent to and displayed on the user's computer; effectively assisting the user in formulating a. query before the user is finished typing the entire intended query in the query field.
[0021] The basic .assumption behind the query predictor is that it is highly probable that a user intends to issue a query in which at least one other person has issued previously. By using that information, a highly interactive search engine may assist the • user in formulating a query, or perhaps aiding the user in refining a query by listing other possible variants of the query that the user would be interested in. Every previously issued query is saved and logged because, if the query was valuable to one user, it has potential value for another user.
[0022] In one embodiment, the query predictor extends to other languages and is not exclusive to English. The query predictor may also support other types of strings, such as product names and part numbers where the user may only know a small piece of it.
"SMART" LEXICAL COMPLETION
[0023] The query predictor thus has a searchable database of queries that the query predictor may access once the query predictor has received one or more characters from the user. Based on the partial query, the query predictor determines one or more completed queries from the database of queries that match lexically. However, instead of simply completing^ the partial query lexically and returning only those queries that begin with the character or characters in the partial query, other queries are also found that contain the lexical completion anywhere in the predicted query. For example, if a user enters the string "tti", not only may "theory of evolution" be a predicted query sent to the user, but also "string theory" or "music theory," each of which, are not simple lexical completions of "th."
FREQUENCY AND TIME
[0024] In some situations, many previously issued queries may begin with "th." It has been determined that the most useful queries are likely the ones that not only have issued most often (popularity), but also those that have been issued most recently (temporal relevance). Therefore, in one embodiment of the invention, the query predictor biases the resulting set of predicted queries based on their frequency (i.e., number of times the query has issued in the entire query database history), and how often they were issued within a specified time, for instance, within the past week. The fact that the most recently issued queries are biased is based on the premise that a user is more likely to be interested in a subject that many other people are interested in at roughly the same time. [0025] As an example, although "renewable energy sources" may have issued as a query five time.', more often than "nuclear energy," the partial query "ener" will cause the query predictor to generate "nuclear energy" as the particular predicted query because "nuclear energy" may have been issued much more frequently in the last week due to a hypothetically recent announcement by Congress that 100 nuclear reactors will be constructed.
[0026J In one embodiment, the time component is determined by searching at least two databases, one for relatively recent queries and one for relatively older queries, and then scaling the results from searching the recent database and weighting them accordingly. FIG. 2 illustrates steps in which this embodiment may be implemented. It will be apparent that there are many ways this scaling and weighting may be performed, in addition to the number of "old" query databases and "new" query databases, as the invention is not limited to this particular example. In this embodiment, the query predictor has access to a small database of all queries that issued in the last week and to a large database of all queries that issued before a week ago. When searching the small database for potentially valuable predicted queries, the number of times a potential query is found in the small database is scaled based on a factor. This factor is the ratio of the number of times a moderately popular query is found in the large database to the number of times that samis moderately popular query is found in the small database. For example, suppose that "Yahoo" is a moderately popular query over the last week and over the past few years. If "Yahoo" is found in the large database 1.7 million times, and 25 thousand times in the small database, then the factor would be 1.7 million/25 thousand, or 68. [0027] Query prediction would be less effective if a moderately popular query in both the small and largs databases were not used to scale. If a query was popular only in the large database and but not in the small database, then the scaling factor would be skewed. For example, if the query "floppy disk" were used as the scaling factor and it was queried many times in the history of the large database but was queried only a few times in the previous week, for the simple reason that no one produces or uses floppy disks anymore, then the ratio between the large and small databases would be enormous. This would skew the results of a partial query by heavily weighting relatively recent queries to the detriment of relatively older, and potentially more valuable, queries. [0028] A similar problem would exist if a new query was used as the scaling factor that was only issued in the past week but rarely issued in the history of the large database. For example, "nuclear energy" may be an infrequently issued query in the past. But, because of a hypothetically recent announcement by Congress that 100 nuclear reactors will be constructed, the query "nuclear energy" will likely be issued thousands, if not hundreds of thousands of times. In that case, the scaling factor would be quite small; and when a query in. the small database is weighted against the queries in the large database, then relatively older predicted queries, rather than relatively newer, and potentially more valuable, predicted queries, would most likely be returned to the user. [0029] Therefore, referring to FIG. 2, after the query predictor determines, in step 202, the number of times a given potentially valuable query was issued in the small (i.e. recent) database, the number is scaled, in step 204, by 68, which is based on the scaling factor determined above with "Yahoo" as the scaling query. The resulting scaled value essentially indicates that the potential queries in the small database are equal in weight to the potential queries in the large (i.e. old) database. Subsequently, the query predictor determines, in stsp 206, the number of times the potential query appears in the large database of "older" queries.
[0030] At this point, a weight is applied to the potential queries in the small database versus the potential queries in the large database. This is performed by multiplying the result of the scaled small database number by 2/3 and adding it to the result of multiplying the number of times the potential query was found in the large database by 1/3 (see steps 208-212). Steps 202-212 are performed for each potential query determined by the query predictor. When there are no more potential queries to process (214), all the potential queries are then compared with each other (step 216) based on their respective values determined for each potential query at step 212. The two or more queries (e.g., ten) with the highest values become the predicted queries, which are subsequently sent to the user.
SEARCH ENGINE
[0031] In one embodiment of the invention, the search engine component processes the particular predicted query (i.e. the most likely intended predicted query) that a user would be interested in. The particular predicted query is processed to obtain search results. The search engine that may be used for this purpose is common in the art and requires no further description.
[0032] The search results obtained by the search engine are sent to and displayed on the user's computer. If the particular predicted query is the user's intended query, the search results based on the particular predicted query may appear on the user's monitor even before the user enters another character in the query field and very likely before the user finishes entering the full intended query. If the particular predicted query is not the user's intended query, then the user may select a different predicted query in the list or continue typing, at which time a new set of search results, based on the selected or new particular predicted query, will be displayed via the user interface.
PROVIDING PREDICTED QUERIES AND SEARCH RESULTS
[0033] FIG. 3A is a block diagram that illustrates one way a partial query is processed and how the results of the partial query are returned, according to one embodiment of the invention.
[0034] A user at a client 300 enters a partial query in a web browser 302. The partial query 312 is sent to a front end server 304 over a network 350. Front end server 304 is not a necessary element in any embodiment of the invention. Its main purpose is to add security to the interactive search engine system. Network 350 is also not a required element in any embodiment, but is merely illustrated to show one approach in which the invention may be implemented. Network 350 may be a local area network (LAN), a wide area network (WAN), or the Internet. Front end server 304 forwards partial query 312 to a query predictor 306, discussed above, which processes the partial query. [0035] Front end server 304, query predictor 306, and a search engine 308, or any combination thereof, may be implemented on the same device. However, for the purpose of illustration and simplification, they each reside on different devices. [0036] Query predictor 306 determines a set of one or more predicted queries based on the partial query and sends them 314 back to front end server 304. Along with the set of predicted queries, query predictor 306 sends additional data indicating which of the predicted queries in the set is the particular predicted query. Either query predictor 306 determines which predicted query is the particular predicted query or web browser 302 is given sufficient iiiformation to make that determination. Front end server 304 then forwards the predicted queries 314 and the data indicating the particular predicted query to client 300 over network 350 to be displayed on web browser 302. [0037] Upon receipt of the set of predicted queries, web browser 302 sends particular predicted query 316 over network 350 to front end server 304, which forwards particular predicted query 316 to search engine 308. Search engine 308, described above, processes the particular predicted query to obtain search results. The search results 318 are finally sent to front end server 304, which forwards them 318 to client 300 over network 350.
[0038] One advantage of this implementation is that the predicted queries are sent immediately to the user as soon as they are determined. However, this implementation also illustrates the possibility that for every character the user types into the query field of his web browser, there are two complete round trips that a communication has to make between client 300 and front end server 304.
[0039] FIG. 3B is a block diagram that illustrates a different way in which a partial query is processed and how the results are returned to the user, according to another embodiment of vhe invention.
[0040] A user at client 300 enters a partial query in a web browser 302. Partial query
312 is sent to front end server 304 over a network 350. Front end server 304 forwards partial query 31?. to query predictor 306, which processes the partial query.
[0041] Query predictor 306 determines a set of one or more predicted queries based upon the partial query and sends them 314 to front end server 304. Instead of immediately forwarding the predicted queries to client 300, front end server 304 retains the predicted queries and sends search engine 308 the particular predicted query 316. Again, along with the set of predicted queries, query predictor 306 sends additional data indicating which of the predicted queries in the set is the particular predicted query. Either query predictor
306 determines which predicted query is the particular predicted query or front end server
304 is given sufficient information to make that determination.
[0042] Search engine 308 processes the particular predicted query to obtain search results. The search results 318 are sent to front end server 304, at which time front end server 304 forwards both predicted queries 314 and search results 318 to client 300 over network 350.
[0043] In the absence of front end server 304, query predictor 306 sends the predicted queries 314 and to search engine 308, which subsequently sends the predicted queries 314 and search results 318 to client 300 over network 350.
[0044] One advantage of this implementation is that there is less communication (i.e., traffic) between client 300 and front end server 304. However, the predicted queries may not display on the user's web browser 302 as quickly as in the previous embodiment because the predicted queries must "wait" for the search results to be produced and sent to front end server 304 before the predicted queries are forwarded to client 300. [0045] FIG. 3C is a block diagram that illustrates a different way in which a partial query is processed and how the results are returned to the user, according to another embodiment of the invention.
[0046] A user at client 300 enters a partial query in a web browser 302. Partial query 312 is sent to fiont end server 304 over network 350. Front end server 304 forwards the partial query 312 to query predictor 306, which processes the partial query. [0047] Query predictor 306 determines a set of one or more predicted queries based upon the partial query and sends them 314 to front end server 304. Again, along with the set of predicted queries, query predictor 306 sends additional data indicating which of the predicted queries in the set is the particular predicted query. Either query predictor 306 determines which predicted query is the particular predicted query or front end server 304 is given sufficient information to make that determination.
[0048] Instead of "holding on" to the predicted queries, as in the last embodiment, front end server 304 sends the predicted queries 314 to client 300 over network 350 and sends particular predicted query 316 to search engine 308 at substantially the same time. It is also possible for query predictor 306 to send the particular predicted query to search engine 308 directly.
[0049] Search engine 308 processes the particular predicted query to obtain search results. The search results 318 are sent to front end server 304, at which time front end server 304 forwards search results 318 to client 300 over network 350. In the absence of front end server 304, query predictor 306 sends both the predicted queries 314 and the particular predicted query 316 to search engine 308, after which search engine 308 sends predicted queries 314 and search results 318 to client 300 over network 350. [0050] In the absence of front end server 304, query predictor 306 sends both predicted queries 314 and the particular predicted query 316 to search engine 308, which subsequently sends predicted queries 314 and search results 318 to client 300 over network 350.
[0051] The advantage of this implementation compared to the embodiment described in FIG. 3 A is thai there is less traffic between client 300 and front end server 304. The advantage compared to the embodiment described in FIG. 3B is that the predicted queries do not have to "wait" for the search results to be produced and sent to front end server 304 before the predicted queries are forwarded to client 300. Thus, the predicted queries are sent immediately upon their production and less communication is required between client 300 and front end server 304. USER INTERFACE
[0052] In one embodiment of the invention, as illustrated in FIG. 1, the user interface includes at least 1) a query field 102 where a user enters characters that will constitute the partial query, 2) a drop down box 104 that lists the set of one or more predicted queries, 3) a search results page 110, and 4) a "Search" button 106. The search button may be in the form of any mechanism that allows the user to select the query the user enters, in case the user is not satisfied with any of the predicted queries provided by the interactive search engine. The set of predicted queries listed in drop down box 104 may be represented in almost any other type of user interface element, including, but not limited to, a text box, li:;t box, menu, or context menu. The user interface may be viewed using a web browser, such as Internet Explorer or Mozilla Firefox.
[0053] In one embodiment, the set of predicted queries are listed, beginning at the top, in order of the most likely predicted query to the least likely predicted query.
MODIFICATIONS
[0054] In addition to the user interface, query predictor, and search engine described above, the interactive search engine may be modified in many ways to alter the look, feel, and responsiveness of the search experience.
TABS
[0055] For instance, the user interface includes tabs, such buttons or links 122 in FIG. 1, wherein the user may select a subsection of possible queries and search based on that subsection. With a collection of tabs or "search verticals," such as "Web," "Images," "Video," and "Shopping," a user may select different query sets. The data for which the query predictor is predicting is different based on what the user is interested in, which data is narrowed by using these tabs. For example, if the user is interested in shopping for a product, the user selects the "Shopping" tab. The user then begins to enter a product name or service in query field 102. The query predictor is not only sent the partial query but also the shopping selection information, indicating that the user is searching for a particular product or service, wherein the query predictor returns only those predicted queries that pertain to products and services. KEYWORDS
[0056] Often when a query is issued, the order of words in the query is unimportant. As alluded to earlier, the issued query does not have to be in English. In other embodiments, not only are other natural languages supported, but also non-natural strings, such as product names and part numbers where the user may only know a portion of the non-natural string. Therefore, the term "word" as used herein may include an English word, a Korean word, or a product number.
[0057] When a user enters two or more words in the query field, the user is not necessarily concerned that the search engine returns a link to a web accessible document that contains this two or more words in the order that they were entered. Rather, the user is interested in a web accessible document that merely contains those words, in whatever order they are found.
[0058] For example, a user enters "solar wind water power" in the query field. The user does not particularly care about the order. The user is rather interested in queries that contain the words "solar," "wind," "water," and "power" somewhere in the query. The query predictor determines what words are important in the query and which words are not important, and then predicts queries based on the important words instead of predicting queries based simply on a matching substring. DELAY RESULTS
[0059] In another embodiment, the step of displaying the predicted queries and/or the search results is delayed. Instead of immediately returning predicted queries, the query predictor "waits" until certain criteria is satisfied (such as the lapse of a specified amount of time or when a few characters are entered, or both) before the predicted queries and search results are displayed. This additional step of waiting assumes that the user may not be sure what he/she wants to query on. Thus, the predicted queries are delayed until the interactive user interface determines, based on the waiting criteria, that this is what the user truly intends to query on. Once the waiting criteria are satisfied, the partial query is processed by the query predictor and the search engine, as described above. OTHER DYNAMIC DATA
[0060] There are additional ways to aid users other than to predict the intended query and return the appropriate search results. In another embodiment, advertisements that appear on the interactive user interface change based on the particular predicted query returned from the query predictor. Thus, every time the particular predicted query changes, new advertisements that relate to the query are posted on the user interface and advertisements that related to an older and non-relevant query are deleted from the user interface. For instance, if a user types "elli" and the query predictor determines "elliptical" as the particular predicted query, advertisements that relate to exercise equipment will appear on the user interface.
[0061] In addition to advertisements, other dynamic information may be useful to the user when submitting a query. In another embodiment, information relating to a particular predicted query but not found in the search results are displayed to the user via the user interface. Extending the "theory" example used above, the query predictor determines that "theory" is the particular predicted query for the partial query "th" entered by the user. The query predictor, or perhaps another program, determines that "theory" is associated with "string theory," "music theory," and "math theory" and returns these related subjects to be displayed in the form of predicted queries or in a different form on the user interface. For short queries like "theory," this additional information happens to be the same set as what the query predictor would produce.
[0062] However, if the user entered "interna" in the query field and the query predictor determined that the particular predicted query is "international trade" then the query predictor, in addition to the predicted queries, would return queries that are not lexical completions of "international trade," but rather queries related to the topic of international trade. Such queries could be on GATT, WTO, UN, US trade policies, etc. A program separate from the query predictor could also perform this function. [0063] Clearly, this aspect of the invention is not performing query prediction, but rather is providing the user with dynamic, related, and hopefully helpful information. A principle in providing advertisements, additional queries, and other related information is to keep everything that is displayed via the user interface consistent with what the query predictor "believes" is the user's intent, which the query predictor determines from the partial query.
IMPLEMENTATION MECHANISMS
[0064] Figure 4 is a block diagram that illustrates a computer system 400 upon which an embodiment of the invention may be implemented. Computer system 400 includes a bus 402 or other communication mechanism for communicating information, and a processor 404 coupled with bus 402 for processing information. Computer system 400 also includes a main memory 406, such as a random access memory (RAM) or other dynamic storage device, coupled to bus 402 for storing information and instructions to be executed by processor 404. Main memory 406 also may be used for storing temporary variables or other intermediate information during execution of instructions to be executed by processor 404. Computer system 400 further includes a read only memory (ROM) 408 or other static storage device coupled to bus 402 for storing static information and instructions for processor 404. A storage device 410, such as a magnetic disk or optical disk, is provided and coupled to bus 402 for storing information and instructions. [0065] Computer system 400 may be coupled via bus 402 to a display 412, such as a cathode ray tube (CRT), for displaying information to a computer user. An input device 414, including alphanumeric and other keys, is coupled to bus 402 for communicating information and command selections to processor 404. Another type of user input device is cursor control 416, such as a mouse, a trackball, or cursor direction keys for communicating direction information and command selections to processor 404 and for controlling cursor movement on display 412. This input device typically has two degrees of freedom in two axes, a first axis (e.g., x) and a second axis (e.g., y), that allows the device to specify positions in a plane.
[0066] The invention is related to the use of computer system 400 for implementing the techniques described herein. According to one embodiment of the invention, those techniques are performed by computer system 400 in response to processor 404 executing one or more sequences of one or more instructions contained in main memory 406. Such instructions may be read into main memory 406 from another machine-readable medium, such as storage device 410. Execution of the sequences of instructions contained in main memory 406 causes processor 404 to perform the process steps described herein. In alternative embodiments, hard-wired circuitry may be used in place of or in combination with software instructions to implement the invention. Thus, embodiments of the invention are not limited to any specific combination of hardware circuitry and software. [0067] The term "machine-readable medium" as used herein refers to any medium that participates in providing data that causes a machine to operation in a specific fashion. In an embodiment implemented using computer system 400, various machine-readable media are involved, for example, in providing instructions to processor 404 for execution. Such a medium may take many forms, including but not limited to, non-volatile media, volatile media, and transmission media. Non-volatile media includes, for example, optical or magnetic disks, such as storage device 410. Volatile media includes dynamic memory, such as main memory 406. Transmission media includes coaxial cables, copper wire and fiber optics, including the wires that comprise bus 402. Transmission media can also take the form of acoustic or light waves, such as those generated during radio-wave and infra-red data communications.
[0068] Common forms of machine-readable media include, for example, a floppy disk, a flexible disk, hard disk, magnetic tape, or any other magnetic medium, a CD- ROM, any other optical medium, punchcards, papertape, any other physical medium with patterns of holes, a RAM, a PROM, and EPROM, a FLASH-EPROM, any other memory chip or cartridge, a carrier wave as described hereinafter, or any other medium from which a computer can read.
[0069] Various forms of machine-readable media may be involved in carrying one or more sequences of one or more instructions to processor 404 for execution. For example, the instructions may initially be carried on a magnetic disk of a remote computer. The remote computer can load the instructions into its dynamic memory and send the instructions over a telephone line using a modem. A modem local to computer system 400 can receive the data on the telephone line and use an infra-red transmitter to convert the data to an infra-red signal. An infra-red detector can receive the data carried in the infra-red signal ;and appropriate circuitry can place the data on bus 402. Bus 402 carries the data to main memory 406, from which processor 404 retrieves and executes the instructions. The instructions received by main memory 406 may optionally be stored on storage device 410 either before or after execution by processor 404. [0070] Computer system 400 also includes a communication interface 418 coupled to bus 402. Communication interface 418 provides a two-way data communication coupling to a network link 420 that is connected to a local network 422. For example, communication interface 418 maybe an integrated services digital network (ISDN) card or a modem to provide a data communication connection to a corresponding type of telephone line. As another example, communication interface 418 may be a local area network (LAN) card to provide a data communication connection to a compatible LAN. Wireless links may also be implemented. In any such implementation, communication interface 418 sends and receives electrical, electromagnetic or optical signals that carry digital data streams representing various types of information.
[0071] Network link 420 typically provides data communication through one or more networks to other data devices. For example, network link 420 may provide a connection through local network 422 to a host computer 424 or to data equipment operated by an Internet Service Provider (ISP) 426. ISP 426 in turn provides data communication services through the world wide packet data communication network now commonly referred to as the "Internet" 428. Local network 422 and Internet 428 both use electrical, electromagnetic or optical signals that carry digital data streams. The signals through the various networks ∑und the signals on network link 420 and through communication interface 418, which carry the digital data to and from computer system 400, are exemplary forms of carrier waves transporting the information.
[0072] Computer system 400 can send messages and receive data, including program code, through the network(s), network link 420 and communication interface 418. In the Internet example, a server 430 might transmit a requested code for an application program through Internet 428, ISP 426, local network 422 and communication interface 418. [0073] The received code may be executed by processor 404 as it is received, and/or stored in storage device 410, or other non-volatile storage for later execution. In this manner, computer system 400 may obtain application code in the form of a carrier wave. [0074] In the foregoing specification, embodiments of the invention have been described with reference to numerous specific details that may vary from implementation to implementation. Thus, the sole and exclusive indicator of what is the invention, and is intended by the applicants to be the invention, is the set of claims that issue from this application, in the specific form in which such claims issue, including any subsequent correction. Any definitions expressly set forth herein for terms contained in such claims shall govern the meaning of such terms as used in the claims. Hence, no limitation, element, property, feature, advantage or attribute that is not expressly recited in a claim should limit the scope of such claim in any way. The specification and drawings are, accordingly, to be regarded in an illustrative rather than a restrictive sense.

Claims

CLAIMSWhat is claimed is:
1. A method for processing query data comprising: receiving from a client over a network a portion of a query; determining a set of one or more predicted queries that correspond to the portion of the query; selecting, based upon selection criteria, a particular predicted query from the set of one or more predicted queries; processing the particular predicted query to obtain search results; and providing both the set of one or more predicted queries and the search results to the client over the network.
2. The method as in Claim 1 , wherein determining the set of one or more predicted queries is based on how often the predicted queries were issued in the past.
3. The method as in Claim 2, wherein determining the set of one or more predicted queries is also based on when the predicted queries were issued.
4. The method as in Claim 1, wherein providing the set of one or more predicted queries to the client over the network further comprises: sending indication information with the one or more predicted queries based on how often the predicted queries issued in the past, wherein the indication information indicates an ordering of the set of one or more predicted queries in which the client may display the predicted queries.
5. The method as in Claim 4, wherein sending indication information with the set of one or mors predicted queries is also based on when the predicted queries were issued.
6. The method as in Claim 1, wherein one of the selection criteria is determined by a selection of a user and the selection is received with the portion of the query.
7. The method as in Claim 1, wherein determining the set of one or more predicted queries that correspond to the portion of the query comprises: identifying key words in the portion of the query; and determining one or more predicted queries that correspond to the key words.
8. The method as in Claim 1, wherein: a query predictor determines the set of one or more predicted queries; and a search engine processes the particular predicted query to obtain the search results.
9. The method as in Claim 8, wherein: the particular predicted query is received from the client after the set of one or more predicted queries are provided to the client over the network; and the search results are subsequently sent to the client.
10. The method as in Claim 8, wherein the set of one or more predicted queries are provided to the client over the network at substantially the same time the particular predicted query is sent to the search engine.
11. The method as in Claim 8, wherein: the portion of the query is received at a front end server; the front end server sends the portion of the query to the query predictor; the front end server receives the set of one or more predicted queries and sends the particular predicted query to the search engine; the front end server receives the search results; and the front end server subsequently provides both the set of one or more predicted queries and the search results to the client over the network.
12. The method as in Claim 1, further comprising providing to the client additional data, including advertisements, that relates to the search results.
13. The method of Claim 1, further comprising: receiving an indication of a selection of another predicted query in the set of one or more predicted queries; and providing new search results based on the selection of the selected predicted query.
14. . The meiϊiod of Claim 1, comprising the steps of: receiving from the client over the network the portion of the query and a subsequent portion of the query; determiiύng a second set of one or more predicted queries that corresponds to the portion of the query and the subsequent portion of the query; and selecting, based upon selection criteria, a subsequent particular predicted query from the second set of one or more predicted queries, wherein the subsequent particular predicted query is not processed to obtain search results if the subsequent particular predicted query is the same as the previous particular predicted query.
15. A machine-readable medium carrying instructions which, when processed by one or more processors, causes the one or more processes to perform the method recited in any of Claims 1-14.
PCT/US2006/047543 2005-12-20 2006-12-12 Providing predicted search queries and predictive search results WO2007078768A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US11/313,525 2005-12-20
US11/313,525 US7516124B2 (en) 2005-12-20 2005-12-20 Interactive search engine

Publications (1)

Publication Number Publication Date
WO2007078768A1 true WO2007078768A1 (en) 2007-07-12

Family

ID=38006972

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2006/047543 WO2007078768A1 (en) 2005-12-20 2006-12-12 Providing predicted search queries and predictive search results

Country Status (2)

Country Link
US (1) US7516124B2 (en)
WO (1) WO2007078768A1 (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2012012396A3 (en) * 2010-07-22 2012-10-04 Google Inc. Predictive query suggestion caching
AU2011312800B2 (en) * 2010-09-27 2014-10-16 Exxonmobil Upstream Research Company Simultaneous source encoding and source separation as a practical solution for full wavefield inversion

Families Citing this family (78)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7844594B1 (en) 1999-06-18 2010-11-30 Surfwax, Inc. Information search, retrieval and distillation into knowledge objects
US20090006543A1 (en) * 2001-08-20 2009-01-01 Masterobjects System and method for asynchronous retrieval of information based on incremental user input
US8112529B2 (en) 2001-08-20 2012-02-07 Masterobjects, Inc. System and method for asynchronous client server session communication
US20050097089A1 (en) * 2003-11-05 2005-05-05 Tom Nielsen Persistent user interface for providing navigational functionality
US7836044B2 (en) 2004-06-22 2010-11-16 Google Inc. Anticipated query generation and processing in a search engine
US7487145B1 (en) * 2004-06-22 2009-02-03 Google Inc. Method and system for autocompletion using ranked results
US7596571B2 (en) * 2004-06-30 2009-09-29 Technorati, Inc. Ecosystem method of aggregation and search and related techniques
US20060106769A1 (en) 2004-11-12 2006-05-18 Gibbs Kevin A Method and system for autocompletion for languages having ideographs and phonetic characters
GB0508468D0 (en) * 2005-04-26 2005-06-01 Ramakrishna Madhusudana Method and system providing data in dependence on keywords in electronic messages
US7844599B2 (en) * 2005-08-24 2010-11-30 Yahoo! Inc. Biasing queries to determine suggested queries
US7747639B2 (en) * 2005-08-24 2010-06-29 Yahoo! Inc. Alternative search query prediction
GB0517585D0 (en) * 2005-08-30 2005-10-05 Ramakrishna Madhusudana Intuitive search which delivers fast results on the mobile phone
US7680763B2 (en) * 2005-12-21 2010-03-16 International Business Machines Corporation Administration of resources in system-wide search systems
US7769804B2 (en) 2006-01-17 2010-08-03 Microsoft Corporation Server side search with multi-word word wheeling and wildcard expansion
US20070174258A1 (en) * 2006-01-23 2007-07-26 Jones Scott A Targeted mobile device advertisements
US8065286B2 (en) 2006-01-23 2011-11-22 Chacha Search, Inc. Scalable search system using human searchers
US8266130B2 (en) * 2006-01-23 2012-09-11 Chacha Search, Inc. Search tool providing optional use of human search guides
US7962466B2 (en) * 2006-01-23 2011-06-14 Chacha Search, Inc Automated tool for human assisted mining and capturing of precise results
SG136817A1 (en) * 2006-04-13 2007-11-29 Malawalaarachchige Tissa Perer Keyboard for use in computer system
US8392358B2 (en) * 2006-06-29 2013-03-05 Nice Systems Technologies Inc. Temporal extent considerations in reporting on facts organized as a dimensionally-modeled fact collection
US20090249203A1 (en) * 2006-07-20 2009-10-01 Akira Tsuruta User interface device, computer program, and its recording medium
US20080109401A1 (en) * 2006-09-12 2008-05-08 Microsoft Corporation Presenting predetermined search results with query suggestions
US20080140519A1 (en) * 2006-12-08 2008-06-12 Microsoft Corporation Advertising based on simplified input expansion
US9602880B2 (en) 2006-12-29 2017-03-21 Kip Prod P1 Lp Display inserts, overlays, and graphical user interfaces for multimedia systems
US11783925B2 (en) 2006-12-29 2023-10-10 Kip Prod P1 Lp Multi-services application gateway and system employing the same
US20170344703A1 (en) 2006-12-29 2017-11-30 Kip Prod P1 Lp Multi-services application gateway and system employing the same
WO2008085205A2 (en) 2006-12-29 2008-07-17 Prodea Systems, Inc. System and method for providing network support services and premises gateway support infrastructure
US9569587B2 (en) 2006-12-29 2017-02-14 Kip Prod Pi Lp Multi-services application gateway and system employing the same
US11316688B2 (en) 2006-12-29 2022-04-26 Kip Prod P1 Lp Multi-services application gateway and system employing the same
US20080189179A1 (en) * 2007-02-04 2008-08-07 Joseph Marouani System, Application and Method for Facilitating Online Advertising
US20080288347A1 (en) * 2007-05-18 2008-11-20 Technorati, Inc. Advertising keyword selection based on real-time data
US20080319975A1 (en) * 2007-06-22 2008-12-25 Microsoft Corporation Exploratory Search Technique
US20090006344A1 (en) * 2007-06-28 2009-01-01 Microsoft Corporation Mark-up ecosystem for searching
US8051075B2 (en) * 2007-09-24 2011-11-01 Merced Systems, Inc. Temporally-aware evaluative score
US20090089416A1 (en) * 2007-09-28 2009-04-02 Yahoo! Inc. System and method for non-letter key functionality in a search control
US7831595B2 (en) * 2007-12-31 2010-11-09 Yahoo! Inc. Predicting and ranking search query results
US8232973B2 (en) 2008-01-09 2012-07-31 Apple Inc. Method, device, and graphical user interface providing word recommendations for text input
US20090241044A1 (en) * 2008-03-18 2009-09-24 Cuill, Inc. Apparatus and method for displaying search results using stacks
US8015129B2 (en) * 2008-04-14 2011-09-06 Microsoft Corporation Parsimonious multi-resolution value-item lists
US8832135B2 (en) * 2008-05-02 2014-09-09 Verint Systems, Ltd. Method and system for database query term suggestion
US8312032B2 (en) 2008-07-10 2012-11-13 Google Inc. Dictionary suggestions for partial user entries
US20100076948A1 (en) * 2008-09-09 2010-03-25 International Business Machines Corporation System and method for interfacing search request and corresponding search result
WO2010139277A1 (en) 2009-06-03 2010-12-09 Google Inc. Autocompletion for partially entered query
US8825691B2 (en) * 2009-06-03 2014-09-02 Yahoo! Inc. Open search assist
US20100318538A1 (en) * 2009-06-12 2010-12-16 Google Inc. Predictive searching and associated cache management
US8930842B2 (en) * 2009-12-24 2015-01-06 Xelion B.V. Method for generating a search query
US8239374B2 (en) * 2010-01-18 2012-08-07 Microsoft Corporation Collection of performance information for search queries executed in a tiered architecture
US8498983B1 (en) * 2010-01-29 2013-07-30 Guangsheng Zhang Assisting search with semantic context and automated search options
US8812733B1 (en) 2010-08-19 2014-08-19 Google Inc. Transport protocol independent communications library
EP2606440A1 (en) 2010-08-19 2013-06-26 Google, Inc. Predictive query completion and predictive search results
JP5542017B2 (en) * 2010-09-15 2014-07-09 アルパイン株式会社 Name search device
WO2012037726A1 (en) * 2010-09-21 2012-03-29 Google Inc. Generating search query suggestions
US9946768B2 (en) 2010-11-02 2018-04-17 Microsoft Technology Licensing, Llc Data rendering optimization
US8712989B2 (en) 2010-12-03 2014-04-29 Microsoft Corporation Wild card auto completion
US8762356B1 (en) 2011-07-15 2014-06-24 Google Inc. Detecting change in rate of input reception
US8645825B1 (en) 2011-08-31 2014-02-04 Google Inc. Providing autocomplete suggestions
US9767144B2 (en) 2012-04-20 2017-09-19 Microsoft Technology Licensing, Llc Search system with query refinement
WO2014000143A1 (en) 2012-06-25 2014-01-03 Microsoft Corporation Input method editor application platform
CN102880685B (en) * 2012-09-13 2015-06-24 北京航空航天大学 Method for interval and paging query of time-intensive B/S (Browser/Server) with large data size
US9384279B2 (en) 2012-12-07 2016-07-05 Charles Reed Method and system for previewing search results
KR101511031B1 (en) 2013-09-12 2015-04-10 네이버 주식회사 Search system and method for connecting vertical service
US10255267B2 (en) * 2014-05-30 2019-04-09 Apple Inc. Device, method, and graphical user interface for a predictive keyboard
US11250081B1 (en) * 2014-09-24 2022-02-15 Amazon Technologies, Inc. Predictive search
US10755294B1 (en) 2015-04-28 2020-08-25 Intuit Inc. Method and system for increasing use of mobile devices to provide answer content in a question and answer based customer support system
WO2016201452A1 (en) * 2015-06-11 2016-12-15 Shuster Gary Methods of aggregating and collaborating search results
US10133821B2 (en) * 2016-01-06 2018-11-20 Google Llc Search result prefetching of voice queries
US10769182B2 (en) 2016-06-10 2020-09-08 Apple Inc. System and method of highlighting terms
US10831763B2 (en) 2016-06-10 2020-11-10 Apple Inc. System and method of generating a key list from multiple search domains
CN106909603A (en) 2016-08-31 2017-06-30 阿里巴巴集团控股有限公司 Search information processing method and device
US10445332B2 (en) * 2016-09-28 2019-10-15 Intuit Inc. Method and system for providing domain-specific incremental search results with a customer self-service system for a financial management system
US10552843B1 (en) 2016-12-05 2020-02-04 Intuit Inc. Method and system for improving search results by recency boosting customer support content for a customer self-help system associated with one or more financial management systems
US10922367B2 (en) 2017-07-14 2021-02-16 Intuit Inc. Method and system for providing real time search preview personalization in data management systems
CN107679122B (en) * 2017-09-20 2021-04-30 福建网龙计算机网络信息技术有限公司 Fuzzy search method and terminal
US11093951B1 (en) 2017-09-25 2021-08-17 Intuit Inc. System and method for responding to search queries using customer self-help systems associated with a plurality of data management systems
US11436642B1 (en) 2018-01-29 2022-09-06 Intuit Inc. Method and system for generating real-time personalized advertisements in data management self-help systems
US11269665B1 (en) 2018-03-28 2022-03-08 Intuit Inc. Method and system for user experience personalization in data management systems using machine learning
US11194467B2 (en) 2019-06-01 2021-12-07 Apple Inc. Keyboard management user interfaces
US11416136B2 (en) 2020-09-14 2022-08-16 Apple Inc. User interfaces for assigning and responding to user inputs

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1999066427A1 (en) * 1998-06-15 1999-12-23 Amazon.Com, Inc. System and method for refining search queries
DE10235548A1 (en) * 2002-03-25 2003-10-09 Agere Syst Guardian Corp Predictive text input method, especially for mobile telephony, in which the text of a message already input is considered as well as characters of a current word in predicting a word or word sequence
EP1408674A2 (en) * 2002-10-09 2004-04-14 Matsushita Electric Industrial Co., Ltd. Method and device for anticipating operation
WO2004073286A2 (en) * 2003-02-05 2004-08-26 Zi Coporation Of Canada, Inc. Information entry mechanism for small keypads

Family Cites Families (21)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4255796A (en) * 1978-02-14 1981-03-10 Bell Telephone Laboratories, Incorporated Associative information retrieval continuously guided by search status feedback
DE3069324D1 (en) * 1980-12-19 1984-10-31 Ibm Interactive data retrieval apparatus
US5278980A (en) 1991-08-16 1994-01-11 Xerox Corporation Iterative technique for phrase query formation and an information retrieval system employing same
US5913215A (en) * 1996-04-09 1999-06-15 Seymour I. Rubinstein Browse by prompted keyword phrases with an improved method for obtaining an initial document set
US5845278A (en) * 1997-09-12 1998-12-01 Inioseek Corporation Method for automatically selecting collections to search in full text searches
US6608895B1 (en) 1998-11-04 2003-08-19 Tadiran Telecom Business Systems, Ltd. Method of anticipatory dialing
US6411950B1 (en) * 1998-11-30 2002-06-25 Compaq Information Technologies Group, Lp Dynamic query expansion
GB2347239B (en) 1999-02-22 2003-09-24 Nokia Mobile Phones Ltd A communication terminal having a predictive editor application
US7181438B1 (en) * 1999-07-21 2007-02-20 Alberti Anemometer, Llc Database access system
US6963867B2 (en) * 1999-12-08 2005-11-08 A9.Com, Inc. Search query processing to provide category-ranked presentation of search results
US6772150B1 (en) * 1999-12-10 2004-08-03 Amazon.Com, Inc. Search query refinement using related search phrases
KR20020019079A (en) 2000-04-13 2002-03-09 요트.게.아. 롤페즈 Search engine with search task model and interactive search task-refinement process
US6507837B1 (en) * 2000-06-08 2003-01-14 Hyperphrase Technologies, Llc Tiered and content based database searching
CA2323856A1 (en) * 2000-10-18 2002-04-18 602531 British Columbia Ltd. Method, system and media for entering data in a personal computing device
US7039635B1 (en) * 2002-06-11 2006-05-02 Microsoft Corporation Dynamically updated quick searches and strategies
US7249126B1 (en) * 2003-12-30 2007-07-24 Shopping.Com Systems and methods for dynamically updating relevance of a selected item
US7664734B2 (en) * 2004-03-31 2010-02-16 Google Inc. Systems and methods for generating multiple implicit search queries
US7836044B2 (en) * 2004-06-22 2010-11-16 Google Inc. Anticipated query generation and processing in a search engine
US20060064411A1 (en) * 2004-09-22 2006-03-23 William Gross Search engine using user intent
US20060173822A1 (en) * 2005-02-03 2006-08-03 Microsoft Corporation System and method for optimization of results based on monetization intent
US7461059B2 (en) * 2005-02-23 2008-12-02 Microsoft Corporation Dynamically updated search results based upon continuously-evolving search query that is based at least in part upon phrase suggestion, search engine uses previous result sets performing additional search tasks

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1999066427A1 (en) * 1998-06-15 1999-12-23 Amazon.Com, Inc. System and method for refining search queries
DE10235548A1 (en) * 2002-03-25 2003-10-09 Agere Syst Guardian Corp Predictive text input method, especially for mobile telephony, in which the text of a message already input is considered as well as characters of a current word in predicting a word or word sequence
EP1408674A2 (en) * 2002-10-09 2004-04-14 Matsushita Electric Industrial Co., Ltd. Method and device for anticipating operation
WO2004073286A2 (en) * 2003-02-05 2004-08-26 Zi Coporation Of Canada, Inc. Information entry mechanism for small keypads

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
BAEZA-YATES R ET AL: "MODERN INFORMATION RETRIEVAL, Chapter 5: Query Operations", MODERN INFORMATION RETRIEVAL, HARLOW : ADDISON-WESLEY, GB, 1999, pages 117 - 139, XP002311981, ISBN: 0-201-39829-X *

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2012012396A3 (en) * 2010-07-22 2012-10-04 Google Inc. Predictive query suggestion caching
EP2596441A2 (en) * 2010-07-22 2013-05-29 Google, Inc. Predictive query suggestion caching
US8560562B2 (en) 2010-07-22 2013-10-15 Google Inc. Predictive query suggestion caching
AU2011282258B2 (en) * 2010-07-22 2015-04-30 Google Llc Predictive query suggestion caching
US9323808B1 (en) 2010-07-22 2016-04-26 Google Inc. Predictive query suggestion caching
AU2011312800B2 (en) * 2010-09-27 2014-10-16 Exxonmobil Upstream Research Company Simultaneous source encoding and source separation as a practical solution for full wavefield inversion

Also Published As

Publication number Publication date
US7516124B2 (en) 2009-04-07
US20070143262A1 (en) 2007-06-21

Similar Documents

Publication Publication Date Title
US7516124B2 (en) Interactive search engine
US7747639B2 (en) Alternative search query prediction
US8868539B2 (en) Search equalizer
US7844599B2 (en) Biasing queries to determine suggested queries
US10372738B2 (en) Speculative search result on a not-yet-submitted search query
JP4962967B2 (en) Web page search server and query recommendation method
US8504567B2 (en) Automatically constructing titles
US20030014398A1 (en) Query modification system for information retrieval
US20090193352A1 (en) Interface for assisting in the construction of search queries
US20100228738A1 (en) Adaptive document sampling for information extraction
WO2006014835B1 (en) Search systems and methods using in-line contextual queries
WO2012023540A1 (en) Information provision device, information provision method, programme, and information recording medium
TWI399657B (en) A provider, a method of providing information, a program, and an information recording medium
JP2010108416A (en) Index generation method, program and server

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application
NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 06847607

Country of ref document: EP

Kind code of ref document: A1