CA2302264A1 - Methods and/or systems for selecting data sets - Google Patents

Methods and/or systems for selecting data sets Download PDF

Info

Publication number
CA2302264A1
CA2302264A1 CA002302264A CA2302264A CA2302264A1 CA 2302264 A1 CA2302264 A1 CA 2302264A1 CA 002302264 A CA002302264 A CA 002302264A CA 2302264 A CA2302264 A CA 2302264A CA 2302264 A1 CA2302264 A1 CA 2302264A1
Authority
CA
Canada
Prior art keywords
key words
nouns
rules
methods
words
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CA002302264A
Other languages
French (fr)
Other versions
CA2302264C (en
Inventor
Nicholas John Davies
Richard Weeks
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
British Telecommunications PLC
Original Assignee
British Telecommunications Public Limited Company
Nicholas John Davies
Richard Weeks
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by British Telecommunications Public Limited Company, Nicholas John Davies, Richard Weeks filed Critical British Telecommunications Public Limited Company
Publication of CA2302264A1 publication Critical patent/CA2302264A1/en
Application granted granted Critical
Publication of CA2302264C publication Critical patent/CA2302264C/en
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/31Indexing; Data structures therefor; Storage structures
    • G06F16/313Selection or weighting of terms for indexing
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10TECHNICAL SUBJECTS COVERED BY FORMER USPC
    • Y10STECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10S707/00Data processing: database and file management or data structures
    • Y10S707/912Applications of a database
    • Y10S707/918Location
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10TECHNICAL SUBJECTS COVERED BY FORMER USPC
    • Y10STECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10S707/00Data processing: database and file management or data structures
    • Y10S707/99931Database or file accessing
    • Y10S707/99933Query processing, i.e. searching
    • Y10S707/99936Pattern matching access

Abstract

Methods and apparatus for identifying associated key words (1035) in a data set (1000). Associated key words are identified by a parser (1020) which firstly operates to extract key words from a data set (1000). These key words are then analysed by the parser (1020) to identify which key words, if any, have an association as determined by a predefined set of rules. These rules are grammatical and include, for example, two keywords both being nouns that occur one after the other without intervening low value words. A similar rule applies to nouns followed by verbs but does not extend to verbs followed by nouns.
These rules allow terms and phrases such as "information technology" and "wide area network" to be identified as associated key words (1035) rather than as individual and unrelated key words.
CA002302264A 1997-09-04 1998-08-28 Methods and/or systems for selecting data sets Expired - Lifetime CA2302264C (en)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
GB97306878.6 1997-09-04
EP97306878 1997-09-04
PCT/GB1998/002611 WO1999012108A1 (en) 1997-09-04 1998-08-28 Methods and/or systems for selecting data sets

Publications (2)

Publication Number Publication Date
CA2302264A1 true CA2302264A1 (en) 1999-03-11
CA2302264C CA2302264C (en) 2009-09-15

Family

ID=8229494

Family Applications (1)

Application Number Title Priority Date Filing Date
CA002302264A Expired - Lifetime CA2302264C (en) 1997-09-04 1998-08-28 Methods and/or systems for selecting data sets

Country Status (9)

Country Link
US (1) US6353827B1 (en)
EP (1) EP1010105B1 (en)
JP (1) JP4274689B2 (en)
CN (1) CN1269897A (en)
AU (1) AU742831B2 (en)
CA (1) CA2302264C (en)
DE (1) DE69809263T2 (en)
NZ (1) NZ503279A (en)
WO (1) WO1999012108A1 (en)

Families Citing this family (57)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6115709A (en) * 1998-09-18 2000-09-05 Tacit Knowledge Systems, Inc. Method and system for constructing a knowledge profile of a user having unrestricted and restricted access portions according to respective levels of confidence of content of the portions
US6549897B1 (en) * 1998-10-09 2003-04-15 Microsoft Corporation Method and system for calculating phrase-document importance
JP3685938B2 (en) * 1998-12-18 2005-08-24 富士通株式会社 Communication support method and communication support system
US7213205B1 (en) * 1999-06-04 2007-05-01 Seiko Epson Corporation Document categorizing method, document categorizing apparatus, and storage medium on which a document categorization program is stored
US7213198B1 (en) * 1999-08-12 2007-05-01 Google Inc. Link based clustering of hyperlinked documents
BE1013153A3 (en) * 1999-11-25 2001-10-02 Datastat S A Method and system for information collection.
US20020059223A1 (en) * 1999-11-30 2002-05-16 Nash Paul R. Locator based assisted information browsing
US6704728B1 (en) * 2000-05-02 2004-03-09 Iphase.Com, Inc. Accessing information from a collection of data
US6711561B1 (en) 2000-05-02 2004-03-23 Iphrase.Com, Inc. Prose feedback in information access system
US8478732B1 (en) * 2000-05-02 2013-07-02 International Business Machines Corporation Database aliasing in information access system
US7383299B1 (en) * 2000-05-05 2008-06-03 International Business Machines Corporation System and method for providing service for searching web site addresses
US7822735B2 (en) 2000-05-29 2010-10-26 Saora Kabushiki Kaisha System and method for saving browsed data
US6408277B1 (en) 2000-06-21 2002-06-18 Banter Limited System and method for automatic task prioritization
US8290768B1 (en) 2000-06-21 2012-10-16 International Business Machines Corporation System and method for determining a set of attributes based on content of communications
US9699129B1 (en) 2000-06-21 2017-07-04 International Business Machines Corporation System and method for increasing email productivity
JP2002140339A (en) * 2000-10-31 2002-05-17 Tonfuu:Kk System, device and program for retrieving law and the like
GB2368670A (en) * 2000-11-03 2002-05-08 Envisional Software Solutions Data acquisition system
US6978419B1 (en) * 2000-11-15 2005-12-20 Justsystem Corporation Method and apparatus for efficient identification of duplicate and near-duplicate documents and text spans using high-discriminability text fragments
US7644057B2 (en) * 2001-01-03 2010-01-05 International Business Machines Corporation System and method for electronic communication management
US20040111386A1 (en) * 2001-01-08 2004-06-10 Goldberg Jonathan M. Knowledge neighborhoods
US7136846B2 (en) 2001-04-06 2006-11-14 2005 Keel Company, Inc. Wireless information retrieval
IES20020335A2 (en) * 2001-05-10 2002-11-13 Changing Worlds Ltd Intelligent internet website with hierarchical menu
US20040205454A1 (en) * 2001-08-28 2004-10-14 Simon Gansky System, method and computer program product for creating a description for a document of a remote network data source for later identification of the document and identifying the document utilizing a description
US8078545B1 (en) 2001-09-24 2011-12-13 Aloft Media, Llc System, method and computer program product for collecting strategic patent data associated with an identifier
US20030074409A1 (en) * 2001-10-16 2003-04-17 Xerox Corporation Method and apparatus for generating a user interest profile
US7343372B2 (en) * 2002-02-22 2008-03-11 International Business Machines Corporation Direct navigation for information retrieval
US7120641B2 (en) 2002-04-05 2006-10-10 Saora Kabushiki Kaisha Apparatus and method for extracting data
US9805373B1 (en) 2002-11-19 2017-10-31 Oracle International Corporation Expertise services platform
JP4024137B2 (en) * 2002-11-28 2007-12-19 沖電気工業株式会社 Quantity expression search device
US8495002B2 (en) * 2003-05-06 2013-07-23 International Business Machines Corporation Software tool for training and testing a knowledge base
US20050187913A1 (en) * 2003-05-06 2005-08-25 Yoram Nelken Web-based customer service interface
US7752200B2 (en) 2004-08-09 2010-07-06 Amazon Technologies, Inc. Method and system for identifying keywords for use in placing keyword-targeted advertisements
US20070061158A1 (en) * 2005-09-09 2007-03-15 Qwest Communications International Inc. Compliance management using complexity factors
US20070061157A1 (en) * 2005-09-09 2007-03-15 Qwest Communications International Inc. Obligation assignment systems and methods
US8290962B1 (en) * 2005-09-28 2012-10-16 Google Inc. Determining the relationship between source code bases
US8799512B2 (en) * 2005-10-19 2014-08-05 Qwest Communications International Inc. Cross-platform support for a variety of media types
US8170189B2 (en) 2005-11-02 2012-05-01 Qwest Communications International Inc. Cross-platform message notification
US20070143355A1 (en) * 2005-12-13 2007-06-21 Qwest Communications International Inc. Regulatory compliance advisory request system
EP1798678A1 (en) * 2005-12-15 2007-06-20 Sap Ag Method and system for automatically controlling forum posting
US8122049B2 (en) * 2006-03-20 2012-02-21 Microsoft Corporation Advertising service based on content and user log mining
US20070239895A1 (en) * 2006-04-05 2007-10-11 Qwest Communications International Inc. Cross-platform push of various media types
US9323821B2 (en) * 2006-04-05 2016-04-26 Qwest Communications International Inc. Network repository auto sync wireless handset
US20070239832A1 (en) * 2006-04-05 2007-10-11 Qwest Communications International Inc. Communication presentation in a calendar perspective
US8320535B2 (en) * 2006-04-06 2012-11-27 Qwest Communications International Inc. Selectable greeting messages
US7603351B2 (en) * 2006-04-19 2009-10-13 Apple Inc. Semantic reconstruction
US7890521B1 (en) 2007-02-07 2011-02-15 Google Inc. Document-based synonym generation
US20080208852A1 (en) * 2007-02-26 2008-08-28 Yahoo! Inc. Editable user interests profile
US8661361B2 (en) 2010-08-26 2014-02-25 Sitting Man, Llc Methods, systems, and computer program products for navigating between visual components
US8780130B2 (en) 2010-11-30 2014-07-15 Sitting Man, Llc Methods, systems, and computer program products for binding attributes between visual components
US9715332B1 (en) 2010-08-26 2017-07-25 Cypress Lake Software, Inc. Methods, systems, and computer program products for navigating between visual components
US10397639B1 (en) 2010-01-29 2019-08-27 Sitting Man, Llc Hot key systems and methods
US9760634B1 (en) * 2010-03-23 2017-09-12 Firstrain, Inc. Models for classifying documents
US9727619B1 (en) * 2013-05-02 2017-08-08 Intelligent Language, LLC Automated search
US9892723B2 (en) * 2013-11-25 2018-02-13 Rovi Guides, Inc. Systems and methods for presenting social network communications in audible form based on user engagement with a user device
WO2015165112A1 (en) * 2014-04-30 2015-11-05 Pivotal Software, Inc. Validating analytics results
US9734144B2 (en) * 2014-09-18 2017-08-15 Empire Technology Development Llc Three-dimensional latent semantic analysis
CN108205553B (en) * 2016-12-19 2021-12-28 深圳联友科技有限公司 Interface processing system and method based on text file

Family Cites Families (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
AU607963B2 (en) * 1986-12-04 1991-03-21 Tnet, Inc. Information retrieval system and method
US5210868A (en) * 1989-12-20 1993-05-11 Hitachi Ltd. Database system and matching method between databases
JPH04127370A (en) * 1990-09-19 1992-04-28 Toshiba Corp Information collecting system
JP2943447B2 (en) * 1991-01-30 1999-08-30 三菱電機株式会社 Text information extraction device, text similarity matching device, text search system, text information extraction method, text similarity matching method, and question analysis device
US5265065A (en) * 1991-10-08 1993-11-23 West Publishing Company Method and apparatus for information retrieval from a database by replacing domain specific stemmed phases in a natural language to create a search query
GB9220404D0 (en) * 1992-08-20 1992-11-11 Nat Security Agency Method of identifying,retrieving and sorting documents
US5598557A (en) * 1992-09-22 1997-01-28 Caere Corporation Apparatus and method for retrieving and grouping images representing text files based on the relevance of key words extracted from a selected file to the text files
US5724567A (en) * 1994-04-25 1998-03-03 Apple Computer, Inc. System for directing relevance-ranked data objects to computer users
US5758257A (en) * 1994-11-29 1998-05-26 Herz; Frederick System and method for scheduling broadcast of and access to video programs and other data using customer profiles
AU707050B2 (en) * 1995-01-23 1999-07-01 British Telecommunications Public Limited Company Methods and/or systems for accessing information
US5819260A (en) * 1996-01-22 1998-10-06 Lexis-Nexis Phrase recognition method and apparatus
US5721897A (en) * 1996-04-09 1998-02-24 Rubinstein; Seymour I. Browse by prompted keyword phrases with an improved user interface
US5794233A (en) * 1996-04-09 1998-08-11 Rubinstein; Seymour I. Browse by prompted keyword phrases
US5857184A (en) * 1996-05-03 1999-01-05 Walden Media, Inc. Language and method for creating, organizing, and retrieving data from a database
US5956711A (en) * 1997-01-16 1999-09-21 Walter J. Sullivan, III Database system with restricted keyword list and bi-directional keyword translation
US5933822A (en) * 1997-07-22 1999-08-03 Microsoft Corporation Apparatus and methods for an information retrieval system that employs natural language processing of search results to improve overall precision
US6055528A (en) * 1997-07-25 2000-04-25 Claritech Corporation Method for cross-linguistic document retrieval

Also Published As

Publication number Publication date
CA2302264C (en) 2009-09-15
EP1010105B1 (en) 2002-11-06
AU8876298A (en) 1999-03-22
US6353827B1 (en) 2002-03-05
WO1999012108A1 (en) 1999-03-11
CN1269897A (en) 2000-10-11
NZ503279A (en) 2001-07-27
EP1010105A1 (en) 2000-06-21
DE69809263D1 (en) 2002-12-12
AU742831B2 (en) 2002-01-10
JP4274689B2 (en) 2009-06-10
JP2001515245A (en) 2001-09-18
DE69809263T2 (en) 2003-07-10

Similar Documents

Publication Publication Date Title
CA2302264A1 (en) Methods and/or systems for selecting data sets
CA2258711A1 (en) Method and system for verifying accuracy of spelling and grammatical composition of a document
WO2004070574A3 (en) System and method for semantic software analysis
WO1999046662A3 (en) System for operating on client defined rules
CA2304057A1 (en) System and method using natural language understanding for speech control application
SE9902462D0 (en) Method and apparatus in a telecommunications system
KR950001504A (en) Method and apparatus for a user of a data processing system to loosely group sliders on an interface
EP0827088A3 (en) Finding and modifying strings of a regular language in a text
WO2004095314A3 (en) System and method for navigating through websites and like information sources
CA2252091A1 (en) System and method for automated retrieval of information
CA2300495A1 (en) Technique for multi-rate coding of a signal containing information
WO1998037478A3 (en) Group action processing between users
WO2000028445A3 (en) Lender and insurer transaction processing system and method
WO1999044152A3 (en) Apparatus and data network browser for providing context sensitive web communications
WO1998049628A3 (en) Method for preventing buffer deadlock in dataflow computations
WO1995022230A3 (en) A method and a system for identifying call records
WO2002073398A3 (en) Method, system, and program for determining system configuration information
WO2002097608A8 (en) Method and system in an office application for providing content dependent help information
Neal Snape et al. Comparing Chinese, Japanese and Spanish speakers in L2 English article acquisition: Evidence against the fluctuation hypothesis?
MX9806168A (en) Database access.
WO1999033015A3 (en) Dynamic rule based market research database
Cole Switch-reference in two Quechua languages
WO2001004772A3 (en) A method of and apparatus for generating documents
CA2151370A1 (en) A speech recognition system
Hewson Person hierarchies in Algonkian and Inuktitut

Legal Events

Date Code Title Description
EEER Examination request
MKEX Expiry

Effective date: 20180828