CA2302264A1 - Methods and/or systems for selecting data sets - Google Patents
Methods and/or systems for selecting data sets Download PDFInfo
- Publication number
- CA2302264A1 CA2302264A1 CA002302264A CA2302264A CA2302264A1 CA 2302264 A1 CA2302264 A1 CA 2302264A1 CA 002302264 A CA002302264 A CA 002302264A CA 2302264 A CA2302264 A CA 2302264A CA 2302264 A1 CA2302264 A1 CA 2302264A1
- Authority
- CA
- Canada
- Prior art keywords
- key words
- nouns
- rules
- methods
- words
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/31—Indexing; Data structures therefor; Storage structures
- G06F16/313—Selection or weighting of terms for indexing
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10—TECHNICAL SUBJECTS COVERED BY FORMER USPC
- Y10S—TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10S707/00—Data processing: database and file management or data structures
- Y10S707/912—Applications of a database
- Y10S707/918—Location
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10—TECHNICAL SUBJECTS COVERED BY FORMER USPC
- Y10S—TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10S707/00—Data processing: database and file management or data structures
- Y10S707/99931—Database or file accessing
- Y10S707/99933—Query processing, i.e. searching
- Y10S707/99936—Pattern matching access
Abstract
Methods and apparatus for identifying associated key words (1035) in a data set (1000). Associated key words are identified by a parser (1020) which firstly operates to extract key words from a data set (1000). These key words are then analysed by the parser (1020) to identify which key words, if any, have an association as determined by a predefined set of rules. These rules are grammatical and include, for example, two keywords both being nouns that occur one after the other without intervening low value words. A similar rule applies to nouns followed by verbs but does not extend to verbs followed by nouns.
These rules allow terms and phrases such as "information technology" and "wide area network" to be identified as associated key words (1035) rather than as individual and unrelated key words.
These rules allow terms and phrases such as "information technology" and "wide area network" to be identified as associated key words (1035) rather than as individual and unrelated key words.
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
GB97306878.6 | 1997-09-04 | ||
EP97306878 | 1997-09-04 | ||
PCT/GB1998/002611 WO1999012108A1 (en) | 1997-09-04 | 1998-08-28 | Methods and/or systems for selecting data sets |
Publications (2)
Publication Number | Publication Date |
---|---|
CA2302264A1 true CA2302264A1 (en) | 1999-03-11 |
CA2302264C CA2302264C (en) | 2009-09-15 |
Family
ID=8229494
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CA002302264A Expired - Lifetime CA2302264C (en) | 1997-09-04 | 1998-08-28 | Methods and/or systems for selecting data sets |
Country Status (9)
Country | Link |
---|---|
US (1) | US6353827B1 (en) |
EP (1) | EP1010105B1 (en) |
JP (1) | JP4274689B2 (en) |
CN (1) | CN1269897A (en) |
AU (1) | AU742831B2 (en) |
CA (1) | CA2302264C (en) |
DE (1) | DE69809263T2 (en) |
NZ (1) | NZ503279A (en) |
WO (1) | WO1999012108A1 (en) |
Families Citing this family (57)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6115709A (en) * | 1998-09-18 | 2000-09-05 | Tacit Knowledge Systems, Inc. | Method and system for constructing a knowledge profile of a user having unrestricted and restricted access portions according to respective levels of confidence of content of the portions |
US6549897B1 (en) * | 1998-10-09 | 2003-04-15 | Microsoft Corporation | Method and system for calculating phrase-document importance |
JP3685938B2 (en) * | 1998-12-18 | 2005-08-24 | 富士通株式会社 | Communication support method and communication support system |
US7213205B1 (en) * | 1999-06-04 | 2007-05-01 | Seiko Epson Corporation | Document categorizing method, document categorizing apparatus, and storage medium on which a document categorization program is stored |
US7213198B1 (en) * | 1999-08-12 | 2007-05-01 | Google Inc. | Link based clustering of hyperlinked documents |
BE1013153A3 (en) * | 1999-11-25 | 2001-10-02 | Datastat S A | Method and system for information collection. |
US20020059223A1 (en) * | 1999-11-30 | 2002-05-16 | Nash Paul R. | Locator based assisted information browsing |
US6704728B1 (en) * | 2000-05-02 | 2004-03-09 | Iphase.Com, Inc. | Accessing information from a collection of data |
US6711561B1 (en) | 2000-05-02 | 2004-03-23 | Iphrase.Com, Inc. | Prose feedback in information access system |
US8478732B1 (en) * | 2000-05-02 | 2013-07-02 | International Business Machines Corporation | Database aliasing in information access system |
US7383299B1 (en) * | 2000-05-05 | 2008-06-03 | International Business Machines Corporation | System and method for providing service for searching web site addresses |
US7822735B2 (en) | 2000-05-29 | 2010-10-26 | Saora Kabushiki Kaisha | System and method for saving browsed data |
US6408277B1 (en) | 2000-06-21 | 2002-06-18 | Banter Limited | System and method for automatic task prioritization |
US8290768B1 (en) | 2000-06-21 | 2012-10-16 | International Business Machines Corporation | System and method for determining a set of attributes based on content of communications |
US9699129B1 (en) | 2000-06-21 | 2017-07-04 | International Business Machines Corporation | System and method for increasing email productivity |
JP2002140339A (en) * | 2000-10-31 | 2002-05-17 | Tonfuu:Kk | System, device and program for retrieving law and the like |
GB2368670A (en) * | 2000-11-03 | 2002-05-08 | Envisional Software Solutions | Data acquisition system |
US6978419B1 (en) * | 2000-11-15 | 2005-12-20 | Justsystem Corporation | Method and apparatus for efficient identification of duplicate and near-duplicate documents and text spans using high-discriminability text fragments |
US7644057B2 (en) * | 2001-01-03 | 2010-01-05 | International Business Machines Corporation | System and method for electronic communication management |
US20040111386A1 (en) * | 2001-01-08 | 2004-06-10 | Goldberg Jonathan M. | Knowledge neighborhoods |
US7136846B2 (en) | 2001-04-06 | 2006-11-14 | 2005 Keel Company, Inc. | Wireless information retrieval |
IES20020335A2 (en) * | 2001-05-10 | 2002-11-13 | Changing Worlds Ltd | Intelligent internet website with hierarchical menu |
US20040205454A1 (en) * | 2001-08-28 | 2004-10-14 | Simon Gansky | System, method and computer program product for creating a description for a document of a remote network data source for later identification of the document and identifying the document utilizing a description |
US8078545B1 (en) | 2001-09-24 | 2011-12-13 | Aloft Media, Llc | System, method and computer program product for collecting strategic patent data associated with an identifier |
US20030074409A1 (en) * | 2001-10-16 | 2003-04-17 | Xerox Corporation | Method and apparatus for generating a user interest profile |
US7343372B2 (en) * | 2002-02-22 | 2008-03-11 | International Business Machines Corporation | Direct navigation for information retrieval |
US7120641B2 (en) | 2002-04-05 | 2006-10-10 | Saora Kabushiki Kaisha | Apparatus and method for extracting data |
US9805373B1 (en) | 2002-11-19 | 2017-10-31 | Oracle International Corporation | Expertise services platform |
JP4024137B2 (en) * | 2002-11-28 | 2007-12-19 | 沖電気工業株式会社 | Quantity expression search device |
US8495002B2 (en) * | 2003-05-06 | 2013-07-23 | International Business Machines Corporation | Software tool for training and testing a knowledge base |
US20050187913A1 (en) * | 2003-05-06 | 2005-08-25 | Yoram Nelken | Web-based customer service interface |
US7752200B2 (en) | 2004-08-09 | 2010-07-06 | Amazon Technologies, Inc. | Method and system for identifying keywords for use in placing keyword-targeted advertisements |
US20070061158A1 (en) * | 2005-09-09 | 2007-03-15 | Qwest Communications International Inc. | Compliance management using complexity factors |
US20070061157A1 (en) * | 2005-09-09 | 2007-03-15 | Qwest Communications International Inc. | Obligation assignment systems and methods |
US8290962B1 (en) * | 2005-09-28 | 2012-10-16 | Google Inc. | Determining the relationship between source code bases |
US8799512B2 (en) * | 2005-10-19 | 2014-08-05 | Qwest Communications International Inc. | Cross-platform support for a variety of media types |
US8170189B2 (en) | 2005-11-02 | 2012-05-01 | Qwest Communications International Inc. | Cross-platform message notification |
US20070143355A1 (en) * | 2005-12-13 | 2007-06-21 | Qwest Communications International Inc. | Regulatory compliance advisory request system |
EP1798678A1 (en) * | 2005-12-15 | 2007-06-20 | Sap Ag | Method and system for automatically controlling forum posting |
US8122049B2 (en) * | 2006-03-20 | 2012-02-21 | Microsoft Corporation | Advertising service based on content and user log mining |
US20070239895A1 (en) * | 2006-04-05 | 2007-10-11 | Qwest Communications International Inc. | Cross-platform push of various media types |
US9323821B2 (en) * | 2006-04-05 | 2016-04-26 | Qwest Communications International Inc. | Network repository auto sync wireless handset |
US20070239832A1 (en) * | 2006-04-05 | 2007-10-11 | Qwest Communications International Inc. | Communication presentation in a calendar perspective |
US8320535B2 (en) * | 2006-04-06 | 2012-11-27 | Qwest Communications International Inc. | Selectable greeting messages |
US7603351B2 (en) * | 2006-04-19 | 2009-10-13 | Apple Inc. | Semantic reconstruction |
US7890521B1 (en) | 2007-02-07 | 2011-02-15 | Google Inc. | Document-based synonym generation |
US20080208852A1 (en) * | 2007-02-26 | 2008-08-28 | Yahoo! Inc. | Editable user interests profile |
US8661361B2 (en) | 2010-08-26 | 2014-02-25 | Sitting Man, Llc | Methods, systems, and computer program products for navigating between visual components |
US8780130B2 (en) | 2010-11-30 | 2014-07-15 | Sitting Man, Llc | Methods, systems, and computer program products for binding attributes between visual components |
US9715332B1 (en) | 2010-08-26 | 2017-07-25 | Cypress Lake Software, Inc. | Methods, systems, and computer program products for navigating between visual components |
US10397639B1 (en) | 2010-01-29 | 2019-08-27 | Sitting Man, Llc | Hot key systems and methods |
US9760634B1 (en) * | 2010-03-23 | 2017-09-12 | Firstrain, Inc. | Models for classifying documents |
US9727619B1 (en) * | 2013-05-02 | 2017-08-08 | Intelligent Language, LLC | Automated search |
US9892723B2 (en) * | 2013-11-25 | 2018-02-13 | Rovi Guides, Inc. | Systems and methods for presenting social network communications in audible form based on user engagement with a user device |
WO2015165112A1 (en) * | 2014-04-30 | 2015-11-05 | Pivotal Software, Inc. | Validating analytics results |
US9734144B2 (en) * | 2014-09-18 | 2017-08-15 | Empire Technology Development Llc | Three-dimensional latent semantic analysis |
CN108205553B (en) * | 2016-12-19 | 2021-12-28 | 深圳联友科技有限公司 | Interface processing system and method based on text file |
Family Cites Families (17)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
AU607963B2 (en) * | 1986-12-04 | 1991-03-21 | Tnet, Inc. | Information retrieval system and method |
US5210868A (en) * | 1989-12-20 | 1993-05-11 | Hitachi Ltd. | Database system and matching method between databases |
JPH04127370A (en) * | 1990-09-19 | 1992-04-28 | Toshiba Corp | Information collecting system |
JP2943447B2 (en) * | 1991-01-30 | 1999-08-30 | 三菱電機株式会社 | Text information extraction device, text similarity matching device, text search system, text information extraction method, text similarity matching method, and question analysis device |
US5265065A (en) * | 1991-10-08 | 1993-11-23 | West Publishing Company | Method and apparatus for information retrieval from a database by replacing domain specific stemmed phases in a natural language to create a search query |
GB9220404D0 (en) * | 1992-08-20 | 1992-11-11 | Nat Security Agency | Method of identifying,retrieving and sorting documents |
US5598557A (en) * | 1992-09-22 | 1997-01-28 | Caere Corporation | Apparatus and method for retrieving and grouping images representing text files based on the relevance of key words extracted from a selected file to the text files |
US5724567A (en) * | 1994-04-25 | 1998-03-03 | Apple Computer, Inc. | System for directing relevance-ranked data objects to computer users |
US5758257A (en) * | 1994-11-29 | 1998-05-26 | Herz; Frederick | System and method for scheduling broadcast of and access to video programs and other data using customer profiles |
AU707050B2 (en) * | 1995-01-23 | 1999-07-01 | British Telecommunications Public Limited Company | Methods and/or systems for accessing information |
US5819260A (en) * | 1996-01-22 | 1998-10-06 | Lexis-Nexis | Phrase recognition method and apparatus |
US5721897A (en) * | 1996-04-09 | 1998-02-24 | Rubinstein; Seymour I. | Browse by prompted keyword phrases with an improved user interface |
US5794233A (en) * | 1996-04-09 | 1998-08-11 | Rubinstein; Seymour I. | Browse by prompted keyword phrases |
US5857184A (en) * | 1996-05-03 | 1999-01-05 | Walden Media, Inc. | Language and method for creating, organizing, and retrieving data from a database |
US5956711A (en) * | 1997-01-16 | 1999-09-21 | Walter J. Sullivan, III | Database system with restricted keyword list and bi-directional keyword translation |
US5933822A (en) * | 1997-07-22 | 1999-08-03 | Microsoft Corporation | Apparatus and methods for an information retrieval system that employs natural language processing of search results to improve overall precision |
US6055528A (en) * | 1997-07-25 | 2000-04-25 | Claritech Corporation | Method for cross-linguistic document retrieval |
-
1998
- 1998-08-28 US US09/155,172 patent/US6353827B1/en not_active Expired - Lifetime
- 1998-08-28 EP EP98940436A patent/EP1010105B1/en not_active Expired - Lifetime
- 1998-08-28 AU AU88762/98A patent/AU742831B2/en not_active Expired
- 1998-08-28 NZ NZ503279A patent/NZ503279A/en unknown
- 1998-08-28 WO PCT/GB1998/002611 patent/WO1999012108A1/en active IP Right Grant
- 1998-08-28 DE DE69809263T patent/DE69809263T2/en not_active Expired - Lifetime
- 1998-08-28 CA CA002302264A patent/CA2302264C/en not_active Expired - Lifetime
- 1998-08-28 JP JP2000509044A patent/JP4274689B2/en not_active Expired - Lifetime
- 1998-08-28 CN CN98808771A patent/CN1269897A/en active Pending
Also Published As
Publication number | Publication date |
---|---|
CA2302264C (en) | 2009-09-15 |
EP1010105B1 (en) | 2002-11-06 |
AU8876298A (en) | 1999-03-22 |
US6353827B1 (en) | 2002-03-05 |
WO1999012108A1 (en) | 1999-03-11 |
CN1269897A (en) | 2000-10-11 |
NZ503279A (en) | 2001-07-27 |
EP1010105A1 (en) | 2000-06-21 |
DE69809263D1 (en) | 2002-12-12 |
AU742831B2 (en) | 2002-01-10 |
JP4274689B2 (en) | 2009-06-10 |
JP2001515245A (en) | 2001-09-18 |
DE69809263T2 (en) | 2003-07-10 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CA2302264A1 (en) | Methods and/or systems for selecting data sets | |
CA2258711A1 (en) | Method and system for verifying accuracy of spelling and grammatical composition of a document | |
WO2004070574A3 (en) | System and method for semantic software analysis | |
WO1999046662A3 (en) | System for operating on client defined rules | |
CA2304057A1 (en) | System and method using natural language understanding for speech control application | |
SE9902462D0 (en) | Method and apparatus in a telecommunications system | |
KR950001504A (en) | Method and apparatus for a user of a data processing system to loosely group sliders on an interface | |
EP0827088A3 (en) | Finding and modifying strings of a regular language in a text | |
WO2004095314A3 (en) | System and method for navigating through websites and like information sources | |
CA2252091A1 (en) | System and method for automated retrieval of information | |
CA2300495A1 (en) | Technique for multi-rate coding of a signal containing information | |
WO1998037478A3 (en) | Group action processing between users | |
WO2000028445A3 (en) | Lender and insurer transaction processing system and method | |
WO1999044152A3 (en) | Apparatus and data network browser for providing context sensitive web communications | |
WO1998049628A3 (en) | Method for preventing buffer deadlock in dataflow computations | |
WO1995022230A3 (en) | A method and a system for identifying call records | |
WO2002073398A3 (en) | Method, system, and program for determining system configuration information | |
WO2002097608A8 (en) | Method and system in an office application for providing content dependent help information | |
Neal Snape et al. | Comparing Chinese, Japanese and Spanish speakers in L2 English article acquisition: Evidence against the fluctuation hypothesis? | |
MX9806168A (en) | Database access. | |
WO1999033015A3 (en) | Dynamic rule based market research database | |
Cole | Switch-reference in two Quechua languages | |
WO2001004772A3 (en) | A method of and apparatus for generating documents | |
CA2151370A1 (en) | A speech recognition system | |
Hewson | Person hierarchies in Algonkian and Inuktitut |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
EEER | Examination request | ||
MKEX | Expiry |
Effective date: 20180828 |