US 20030171910A1
(19) United States
(12) Patent Application Publication (io) Pub. No.: US 2003/0171910 Al
Abir (43) Pub. Date: Sep. 11,2003
Correspondence Address:
ARNOLD & PORTER
IP DOCKETING DEPARTMENT; RM 1126(b)
555 12TH STREET, N.W.
WASHINGTON, DC 20004-1206 (US)
(21) Appl. No.: 10/281,997
(22) Filed: Oct. 29, 2002
Related U.S. Application Data
(63) Continuation-in-part ol application No. 10/157,894, filed on May 31,2002, which is a continuation-in-part ol application No. 10/024,473, filed on Dec. 21,2001.
(60) Provisional application No. 60/276,107, filed on Mar. 16, 2001. Provisional application No. 60/299,472, filed on Jun. 21, 2001.
Publication Classification
(51) Int. CI.7 G06F 17 20
(52) U.S. CI 704 1
(57) ABSTRACT
A method for creating and using a cross-idea association database that includes a method for associating words and word strings in a language by analyzing word formations around a word or word string to identily owther words or word strings that are equivalents or near equivalents semantically. One method for associating words and word strings includes querying a collection of documents with a usersupplied word or word string, determining a user-defined amount of words or word strings to the left and right of the query string, determining the frequency of occurrence of words or word strings located on the left and right of the query string, and ranking the located words.