US20060101018A1 - Method for processing new sequences being recorded into an interlocking trees datastore - Google Patents

Method for processing new sequences being recorded into an interlocking trees datastore Download PDF

Info

Publication number
US20060101018A1
US20060101018A1 US11/185,620 US18562005A US2006101018A1 US 20060101018 A1 US20060101018 A1 US 20060101018A1 US 18562005 A US18562005 A US 18562005A US 2006101018 A1 US2006101018 A1 US 2006101018A1
Authority
US
United States
Prior art keywords
node
interlocking trees
trees datastore
recording information
accordance
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
US11/185,620
Inventor
Jane Mazzagatti
Jane Claar
Steven Rajcan
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Unisys Corp
Original Assignee
Unisys Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Unisys Corp filed Critical Unisys Corp
Priority to US11/185,620 priority Critical patent/US20060101018A1/en
Assigned to UNISYS CORPORATION reassignment UNISYS CORPORATION ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: CLAAR, JANE VAN KEUREN, MAZZAGATTI, JANE CAMPBELL, RAJCAN, STEVEN L.
Priority to US11/384,777 priority patent/US8516004B2/en
Publication of US20060101018A1 publication Critical patent/US20060101018A1/en
Priority to US11/449,092 priority patent/US20110314022A9/en
Assigned to CITIBANK, N.A. reassignment CITIBANK, N.A. SECURITY AGREEMENT Assignors: UNISYS CORPORATION, UNISYS HOLDING CORPORATION
Priority to US11/985,623 priority patent/US8010572B1/en
Assigned to UNISYS CORPORATION, UNISYS HOLDING CORPORATION reassignment UNISYS CORPORATION RELEASE BY SECURED PARTY Assignors: CITIBANK, N.A.
Assigned to UNISYS HOLDING CORPORATION, UNISYS CORPORATION reassignment UNISYS HOLDING CORPORATION RELEASE BY SECURED PARTY Assignors: CITIBANK, N.A.
Abandoned legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/23Updating
    • G06F16/2308Concurrency control
    • G06F16/2336Pessimistic concurrency control approaches, e.g. locking or multiple versions without time stamps
    • G06F16/2343Locking methods, e.g. distributed locking or locking implementation details
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/22Indexing; Data structures therefor; Storage structures
    • G06F16/2228Indexing structures
    • G06F16/2246Trees, e.g. B+trees
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/903Querying
    • G06F16/90335Query processing
    • G06F16/90344Query processing by using string matching techniques
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10TECHNICAL SUBJECTS COVERED BY FORMER USPC
    • Y10STECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10S707/00Data processing: database and file management or data structures
    • Y10S707/99931Database or file accessing
    • Y10S707/99938Concurrency, e.g. lock management in shared database
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10TECHNICAL SUBJECTS COVERED BY FORMER USPC
    • Y10STECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10S707/00Data processing: database and file management or data structures
    • Y10S707/99941Database schema or data structure
    • Y10S707/99942Manipulating data structure, e.g. compression, compaction, compilation
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10TECHNICAL SUBJECTS COVERED BY FORMER USPC
    • Y10STECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10S707/00Data processing: database and file management or data structures
    • Y10S707/99941Database schema or data structure
    • Y10S707/99943Generating database or data structure, e.g. via user interface

Definitions

  • This invention relates to the field of building new structure in an interlocking trees datastore.
  • interlocking trees datastore when a new sequence is identified the rules for the construction of a particular interlocking trees datastore may require building a new node or nodes to record the new sequence.
  • care should be taken to prevent access to nodes being changed, for example, by another thread executing in the datastore. Therefore, nodes that are being changed should be locked until the changes are complete in order to prevent such an access.
  • the entire interlocking trees datastore was locked from threads adding new sequences to prevent other threads from accessing changing nodes. This was a severe restriction because it slowed the system down and essentially limited the construction of interlocking trees datastore to a single thread.
  • a method of recording information in an interlocking trees datastore having a plurality of K paths includes receiving a data stream input sequence and traversing the interlocking trees datastore in accordance with the received data stream input sequence for recording the received data stream input sequence within the interlocking trees datastore. First determining whether a K path of the plurality of K paths matches the data stream input sequence is performed and second determining that a new sequence has been encountered in accordance with the first determining is performed. New structure is built in accordance with the second determining.
  • a path of the plurality of paths has a plurality of K nodes including a current K node, an adjacent K node that is adjacent to the current K node, the adjacent K node having a non-adjacent K node that is not in the asCase list of the current K node.
  • the interlocking trees datastore is traversed by determining a match between the non-adjacent K node and an adjacent K node.
  • the interlocking trees datastore can include a plurality of levels and the non-adjacent K node is some other K node of the plurality of K nodes.
  • the interlocking trees datastore is traversed in accordance with an asCase bi-directional link of a K node of the plurality of K nodes.
  • the match is determined in accordance with a Result pointer of a K node of the plurality of K nodes. Traversal occurs from a current K node location to an adjacent K node in accordance with the non-adjacent K node.
  • the adjacent K node is determined in accordance with an asCase list of the current K node.
  • a match is determined between the adjacent K node and a further K node.
  • the KStore engine recognizes that a sequence being processing is ‘new’ when there is no existing structure to record the sequence being processed. This condition may occur at any level in the K structure. At this point it is possible to implement many different processes to deal with the new sequence at substantially low incremental cost.
  • new K structure may be created to record the event, an error statement may be issued, the event may be logged, processes may be initiated to evaluate the new sequence in accordance with some established criteria, or any combination of processes may be initiated in response to encountering new sequence.
  • FIG. 1 shows a block diagram representation of a KStore environment in which the system and method of the present invention can be implemented.
  • FIG. 2A shows a minimal KStore structure
  • FIGS. 2 B-E shows a series of datastore elements that may be formed in the process of building the minimal KStore structure of FIG. 2A .
  • FIGS. 2 F,G show interlocking trees datastores that may be used to represent data according to the system and method of the present invention.
  • FIG. 3 shows a preferred embodiment of the next node process for processing particles in an interlocking trees datastore such as the interlocking trees datastores of FIGS. 2E , F.
  • FIGS. 4A , B show exemplary nodes within an interlocking trees datastore such as the interlocking trees datastores of FIGS. 2F , G.
  • FIG. 5 shows a preferred embodiment of a process for detecting a new sequence in an interlocking trees datastore such as the interlocking trees datastores of FIGS. 2F , G.
  • FIG. 6 shows a preferred embodiment of a process for creating a new node in an interlocking trees datastore such as the interlocking trees datastores of FIGS. 2F , G.
  • FIG. 7 shows a preferred embodiment of a process for creating a new node in an interlocking trees datastore such as the interlocking trees datastores of FIGS. 2F , G
  • FIG. 1 there is shown a block diagram representation of a KStore environment in which the system and method of the present invention may be implemented.
  • KStore environment information, may flow bi-directionally between the KStore 12 and both a data source 30 and an application 34 by way of a K Engine 14 , as understood by those skilled in the art.
  • the transmission of information between the data source 30 and the K Engine 14 may be by way of a learn engine 26
  • the transmission of information between the application 34 and the K Engine 14 may be by way of an API utility 23 , as also understood by those skilled in the art.
  • the data source 30 and the application 34 may be provided with graphical user interfaces 36 , 38 to permit a user to communicate with the system.
  • the K Engine 14 can be provided with a new sequence process 18 to permit the processing of new sequences as described in detail herein below.
  • one of the processes that are related to the new sequence process 18 is the lock process for locking previously existing nodes when a new node is constructed.
  • the performance of the new sequence process 18 may be facilitated by providing specialized utilities 16 .
  • the learn engine 26 and the API utility 23 can communicate with the K Engine 14 both directly and by way of the utilities 16 .
  • the minimal KStore structure 100 is the smallest KStore possible. It includes a single K path, beginning with K node 102 and ending with K node 112 , wherein a portion of the K path ending with K node 112 is a portion of triad 114 having a subcomponent node 106 .
  • the minimal KStore structure 100 also includes an end product node 112 with its end of thought (EOT) root node 126 .
  • EOT end of thought
  • the subcomponent K node 106 can be created by establishing the Case and Result bi-directional links.
  • the Case bi-directional link 104 between the primary root node 102 and the K node 106 as best seen in the datastore element 130 of FIG. 2B .
  • a pointer to node 102 becomes the Case entry of node 106 and the asCase list of the primary root node 102 is updated to include a pointer to the subcomponent node 106 .
  • the Result bi-directional link 120 is established between the elemental root K node 122 and the K node 106 .
  • a pointer to node 122 becomes the Result entry of node 106 and the asResult list of the elemental root node 122 is updated to include the subcomponent node 106 .
  • the Case bi-directional link 110 can then be established between the end product node 112 and the subcomponent node 106 , as best seen in the datastore elements 140 , 150 of FIGS. 2C , D.
  • the datastore element 150 is completed, as best seen in the datastore elements 160 of FIGS. 2E .
  • the datastore element 160 has substantially the same structure as the minimal KStore structure 100 .
  • the datastore elements 130 - 160 illustrate possible stages in the process of building the minimal KStore structure 100 .
  • the building process shown by the datastore elements 130 - 160 can be continued indefinitely to create a triadic continuum of any size.
  • the building of interlocking trees datastores such as the minimal KStore structure 100 is taught in more detail in U.S.
  • the interlocking trees datastore 250 provides a representation of the sequence C-A-T.
  • the sequence of letter data particles required for representing the sequence C-A-T can be streamed into the interlocking trees datastore 250 using a data simulator or by using any other method of entering input into a KStore.
  • the BOT-C node 256 can be built by establishing the Case bi-directional link to the beginning of thought (BOT) node 252 and establishing the Result bi-directional link to the C root node 274 .
  • the BOT-C node 256 can be built in substantially the manner previously described with respect to the nodes 106 of the triad 114 within the minimal KStore structure 100 .
  • the BOT-C node 256 is then designated as the current K location within the structure for this particular thread.
  • the BOT-C-A node 258 can be built into the interlocking trees datastore 250 by establishing the Case bi-directional link to the BOT-C node 256 .
  • the Result bi-directional link to the A root node 270 can then be established.
  • the BOT-C-A node 258 is built in response to receiving the letter particle A.
  • the BOT-C-A node 258 is then designated as the current K location.
  • the Case bi-directional link can be established to the BOT-C-A node 258 and the Result bi-directional link can be established to the T root node 280 .
  • the end product node 266 can then be created by forming the Case bi-directional link to the BOT-C-A-T node 262 and the Result bi-directional link to the EOT node 282 .
  • the K path 265 of the interlocking trees datastore 250 is built for representing the sequence of letter particles within the string C-A-T. With the processing of an EOT node the current K location is set to BOT.
  • a count can be kept within each node of the interlocking trees datastore 250 in order to keep track of the number of times the node is traversed.
  • the counts of the nodes within a K path of an interlocking trees datastore may be incremented each time they are encountered during later traversals of the K path.
  • the counts of the individual nodes of an interlocking trees datastore such as the interlocking trees datastore 250 can be incremented either as they are built, encountered or when the building and traversal is complete.
  • the new sequence determining procedure 500 for determining when a new sequence is received by an interlocking trees datastore such as the interlocking trees datastores 250 , 290 .
  • an input particle is compared to the list of the sensors of the interlocking trees datastore 250 , 290 , as shown in block 504 . If no match is found in block 504 the particle is ignored and execution returns from the new sequence determining procedure 500 . If a match is found the root node corresponding to the particle is thereby located.
  • the particle is considered valid.
  • the asCase list of the current K location node is obtained.
  • a comparison is then performed between the Result node of each node in the asCase list and the root node of the input particle.
  • the comparison between the nodes on the asCase list of the current node and the particle root node is shown in block 508 . If there is a match in the comparison of block 508 the matched node becomes the current K location node as shown block 512 .
  • the count of the new current K location node and the input particle root node may be incremented at this time or may be incremented along with the other nodes in its K path when an end product node is encountered. If there is no match in the comparison of block 508 a new node may be created as shown in block 516 of new sequence determining procedure 500 .
  • the next node processing procedure 300 is a procedure for locating the path indicated by the next K node to be processed.
  • the next node processing in a Kstore procedure 300 can be used to process nodes representing particles or nodes representing complete thoughts within the interlocking trees datastore 250 .
  • the particles set forth herein are input letter particles, it will be understood that the system and method of the invention can apply to any type of particle processing within an interlocking trees datastore. For example, they can apply to input words, sentences, pixels, molecules, amino acids, or any other data that can be received and stored in a datastore.
  • next node particle processing procedure 300 will inherently detect the occurrence of a new sequence by the absence of structure to record an event within the interlocking trees datastore 250 .
  • This feature of the next node processing procedure 300 eliminates the need for any additional operations, beyond the normal input processing procedures, to determine when a new sequence is received.
  • new KStore structure may be built to represent the new sequence.
  • the next node processing procedure 300 and any other process set forth herein may be applied to a single level KStore such as the interlocking trees datastore 25 as well as any level of a multi-level KStore and to any other KStore.
  • next node particle processing procedure 300 may be used in building new structure when it determines that a new sequence has arrived.
  • the particle next node procedure 300 may be applied to processing the sequence C-A-T-S within the interlocking trees datastore 250 to create the interlocking trees datastore 290 .
  • next node processing procedure 300 may thus begin by traversing the existing structure starting at the BOT node and possibly incrementing the count fields in the nodes encountered during the traversal.
  • the current K location node is BOT and the asCase list of the BOT root node 252 may be followed to the BOT-C node 256 .
  • the C root node 274 is defined as a non-adjacent node of the BOT-C node 256 since it is not on the asCase list of the BOT-C node 256 . At this point the count of the BOT node 252 , the BOT-C node 256 , and the C root node 274 can be incremented.
  • next node processing procedure 300 can proceed to block 302 .
  • the next node processing procedure 300 may then make a determination whether the Result pointer of any subcomponent node in the asCase list of the BOT-C node 256 points to the A root node 270 .
  • the only node in the asCase list of the BOT-C node 256 is the BOT-C-A node 258 . Therefore, the current K location node is set to point to the BOT-C-A node 258 in block 302 . Since Node is thus not null as determined in decision 304 , a determination is made in decision 310 whether the Result pointer of the BOT-C-A node 258 points to the A root node 270 .
  • the count of the BOT-C-A node 258 and the Root Node 270 may be incremented.
  • the BOT-C-A node 258 is then made the current K location node in block 312 . In this manner the next node processing procedure 300 can process the input stream. No new structure has been built within the interlocking trees datastore 250 thus far since the BOT-C-A node 258 of the sequence C-A-T-S was already formed when the sequence C-A-T was previously received.
  • the asCase pointer of the BOT-C-A node 258 can be followed to the BOT-C-A-T node 262 and the Result list of the BOT-C-A-T node 262 can be followed to the T node 280 and another match is found.
  • no new structure is built within the interlocking trees datastore 250 .
  • the next node processing procedure 300 has traversed the interlocking trees datastore 250 incrementing the counts of the nodes encountered as the input particles C-A-T of the sequence C-A-T-S were received.
  • next node procedure 300 when the letter particle S is received it will be determined by the next node procedure 300 that the structure for representing the sequence C-A-T-S does not exist within the interlocking trees datastore 250 . Accordingly, when block 302 is executed the BOT-C-A-T-EOT node 266 is located using the asCase list of the BOT-C-A-T node 262 and the BOT-C-A-T-EOT node 266 is assigned to the variable Node in block 302 . Execution of the next node processing procedure 300 proceeds to decision 310 by way of decision 304 since the BOT-C-A-T-EOT node 266 is not null. When the determination of decision 310 is made execution returns to block 302 , since there is no match between the received letter particle S and the EOT node 282 indicated by the Result pointer of the BOT-C-A-T-EOT node 266 .
  • the determination whether Node is null is affirmative the next time decision 304 is encountered. It will be understood that the affirmative determination in decision 304 indicates that the interlocking trees datastore 250 does not include a K path representing the sequence C-A-T-S. Therefore, the sequence C-A-T-S has not been previously entered into the interlocking trees datastore 250 , and it may be determined that the sequence C-A-T-S is a new sequence at this point.
  • the system and method of the present invention may input sequences, and inherently and simultaneously determine the occurrence of a new sequence as part of the normal process of receiving the sequences. There is no need to perform any separate comparisons or any other operations in addition to the processing of the input stream in order to detect the need for new structure since the detection of need for new structure is inherent in decision 304 as part of the process.
  • the determinations of blocks 306 , 308 may be made according to predetermined rules or administrative guidelines or any other set of protocols.
  • the parameters of the guidelines or protocols can be communicated to the next node processing procedure 300 by a calling procedure, a GUI or any other source.
  • the rules or administrative guidelines can be set up by a user or administrator or any other party. For example, the user or administrator or other party can adapt the system and method of the invention to ignore the detection of new structure made in decision 304 and do nothing. Under these guidelines the occurrence of the new sequence can be treated substantially as noise.
  • Another possibility is to adapt the next node processing procedure 300 to build no new structure when a new sequence is detected or captured, but to report the occurrence to the user or administrator.
  • the reporting can be done through the KEngine 14 or it can be performed by passing information to a calling procedure such as the learn engine 26 or the API utilities 23 . Additionally, reporting can be done by way of an email.
  • the reports can be used, for example, to trigger events such as the starting of another process, a new field event or a new record event, to set flags, to log messages, to send information or reports to a graphical user interface (GUI) 36 , to send emails to a list of recipients or to perform any other operations. Additionally, the GUI 36 can report the occurrence to the user or administrator and permit the user to determine how to handle it. If a new node is built in view of the determination of block 306 the guidelines may or may not require providing a report.
  • GUI graphical user interface
  • a report When a report is made according to block 308 it may contain any information that may prove useful. For example, the value represented by the current node at the time that the new sequence was determined in decision 304 can be reported. The identity of the newly received particle can be reported. A list of possible next nodes obtained in block 302 of the particle processing procedure 300 according to the asCase list of the current node can be reported. It is also possible to anticipate the next node by making an estimate of which of the possible next nodes obtained in block 302 is the most likely next node. For example, the most likely next node may be determined by comparing the counts of the possible next nodes.
  • the determination of the mostly likely next node can also be made according to how recently the possible next nodes have been accessed or according to the context in which the determination is made. In another embodiment the determination can be determined based upon processing a further node/particle or nodes/particles.
  • the interlocking trees datastore 290 may be built as a result of the next node processing procedure 300 operating on the letter particle S of the sequence C-A-T-S in the interlocking trees datastore 250 . This may occur when: i) there is an affirmative determination in decision 304 , thereby indicating that a new sequence has been determined, and ii) the protocol of block 306 requires the building of a new node when such a new sequence is determined.
  • the Case bi-directional link may be established to the BOT-C-A-T node 262 by the routine for creating new nodes called in block 306 .
  • a Result bi-directional link may be established to the S root node 278 .
  • a branch is created in the path 265 of the interlocking trees datastore 290 at the BOT-C-A-T node 262 to provide the new K path 263 .
  • the exemplary node 400 of an interlocking trees datastore such as the interlocking trees datastores 250 , 290 .
  • the exemplary node 400 can be an exemplary elemental root node 402 or an exemplary subcomponent node 404 or end product node 404 .
  • memory is allocated for the new node in the manner shown in the exemplary node 400 .
  • a plurality of pointers can then be stored in the allocated memory.
  • the new node is defined by setting the Case pointer to point to the previous node and setting the Result pointer to point to the root node.
  • the Case pointer 406 would point to the subcomponent node 262 .
  • the exemplary node 400 can also include a Result pointer 408 .
  • the Result pointer 408 would point to the S root node 278 .
  • a pointer to asCase list 410 can also be included in the exemplary node 400 .
  • the pointer to asCase list 410 is a pointer to a list of the subcomponent nodes or end product nodes for which the node represented by the exemplary node 400 is the Case node.
  • the pointer to asResult list 412 is a pointer to a list of the subcomponents nodes or end product nodes for which the node represented by the exemplary node 400 is the Result node.
  • the nodes of the interlocking trees datastores 250 , 290 can also include one or more additional fields 414 .
  • the additional fields 414 may be used for an intensity or count associated with the node or for any number of different items associated with the structure.
  • Another example of a parameter that can be stored in an additional field 416 is the particle value for an elemental root node.
  • the exemplary node 450 is an alternate embodiment of the exemplary node 400 .
  • the exemplary node 450 includes the Case pointer 406 , the Result pointer 408 , the pointer to asCase list 410 , the pointer to asResult list 412 , the additional fields 414 and the value field 416 as previously described with respect to the exemplary node 400 .
  • the exemplary node 450 includes the lock definition field 413 .
  • a lock definition within the lock definition field 413 can be set and reset in order to indicate whether the node represented by the exemplary node 450 is locked or unlocked to prevent the node from being accessed while it is being changed.
  • the lock definition field 413 of a node should be checked prior to accessing any nodes in an interlocking trees datastore in order to prevent any attempts to access a locked node.
  • the writing of information to nodes is not permitted without checking the lock definition field 413 while the reading of nodes without checking can be permitted.
  • the lock definition may be stored at another location rather than within the exemplary node 400 .
  • node locking procedure 600 used when building a new node within the interlocking trees datastores 250 , 290 .
  • a new node is created as shown in block 604 .
  • the new node can be created as previously described with respect to FIGS. 2 A-E.
  • Memory for the new node is allocated as shown in the exemplary node 400 and the fields of the exemplary node 450 are populated.
  • Block 604 also shows that the current K location node is the Case node of the newly created node. Therefore, the location of the current K location node can be stored in the Case pointer field 406 of the new node. A pointer to the location of the root node is stored in the Result pointer field 408 . No locking is required while this new node is being constructed.
  • the asCase list of the current K location node may be updated to include the newly created node as shown in block 608 . Therefore, the current K location node may be locked while its asCase list is updated and unlocked after the asCase list is updated. In a preferred embodiment all list updating operations are performed with the node being locked and immediately updated and unlocked in this manner in order to minimize the lock time.
  • the current K location node may be locked and unlocked by setting and resetting the lock definition in the lock definition field 413 of the exemplary node 400 of the current K location node at the beginning and at the end of the operation of block 608 .
  • the asResult list of the root node may be updated by the node locking procedure 600 to include the newly created node. Therefore, as shown in block 612 the root node is locked and its asResult list is updated. The root node to be added may then be unlocked. In a preferred embodiment of the invention the root node is locked by the node locking procedure 600 only during the time it takes to update its asResult list.
  • the asCase list of the Case node and the asResult list of the root node may be updated one at a time so that only one or the other of the two are locked. Therefore, a very important feature of the system and method of the present invention is that only one node is locked at a time for this thread. All of the remaining nodes in the interlocking trees datastores 250 , 290 may remain unlocked. This reduces the scope of the node locking to the minimum amount possible and permits faster operation of the interlocking trees datastore 250 , 290 .
  • the node locking procedure 700 for preventing access to nodes which are involved in the construction of a new node in an interlocking trees datastore 250 , 290 .
  • the current node is locked by a thread as shown in block 702 by setting the lock flag in its lock definition field 413 .
  • This determination should be made in the node locking procedure 700 since it is possible for a thread other than the instant thread to begin building the new node between the time that the instant thread determines that the new node must be built and the time that the current node is actually locked according to block 702 .
  • the determination of decision 706 can be made by checking the asCase list of the current node.
  • the new node may be created as shown in block 710 . Since the current node was locked in block 702 a pointer to the new node can be added to the asCase list of the current node at this time as shown in block 714 . Regardless of whether a new node is created in block 710 the current node is unlocked as shown in block 716 .
  • the root node for the new node is locked as shown in block 720 .
  • a pointer to the new node is added to the asResult list of the root node as shown in block 722 .
  • the root node can then be unlocked as shown in block 724 .
  • only one node is locked at a time.

Abstract

A method of recording information in an interlocking trees datastore having a plurality of K paths includes receiving a data stream input sequence and traversing the interlocking trees datastore in accordance with the received input sequence for recording the received input sequence within the interlocking trees datastore. First determining whether a K path of the plurality of K paths matches the input sequence is performed and second determining that a new sequence has been encountered in accordance with the first determining is performed. New structure is built in accordance with the second determining. A path of the plurality of paths has a plurality of K nodes including a current K node, an adjacent K node that is adjacent to the current K node, the adjacent K node having a non-adjacent K node that is not in the asCase list of the current K node.

Description

    BACKGROUND OF THE INVENTION
  • 1. Field of Invention
  • This invention relates to the field of building new structure in an interlocking trees datastore.
  • 2. Description of Related Art
  • In many applications it is useful to identify when data or input sequences that have not been previously encountered, i.e. new variables or records, are received into a datastore. In known systems identifying a new sequence has required the very computationally intensive procedure of comparing the new sequence with all of the previously received to search for a match. In another known procedure for identifying new sequences a table of the distinct values was constructed for comparison. Therefore, more efficient methods for detecting a new sequence are required.
  • Additionally, when a new sequence is identified the rules for the construction of a particular interlocking trees datastore may require building a new node or nodes to record the new sequence. When a new node is being built care should be taken to prevent access to nodes being changed, for example, by another thread executing in the datastore. Therefore, nodes that are being changed should be locked until the changes are complete in order to prevent such an access. In known systems the entire interlocking trees datastore was locked from threads adding new sequences to prevent other threads from accessing changing nodes. This was a severe restriction because it slowed the system down and essentially limited the construction of interlocking trees datastore to a single thread.
  • All references cited herein are incorporated herein by reference in their entireties.
  • BRIEF SUMMARY OF THE INVENTION
  • A method of recording information in an interlocking trees datastore having a plurality of K paths includes receiving a data stream input sequence and traversing the interlocking trees datastore in accordance with the received data stream input sequence for recording the received data stream input sequence within the interlocking trees datastore. First determining whether a K path of the plurality of K paths matches the data stream input sequence is performed and second determining that a new sequence has been encountered in accordance with the first determining is performed. New structure is built in accordance with the second determining. A path of the plurality of paths has a plurality of K nodes including a current K node, an adjacent K node that is adjacent to the current K node, the adjacent K node having a non-adjacent K node that is not in the asCase list of the current K node. The interlocking trees datastore is traversed by determining a match between the non-adjacent K node and an adjacent K node. The interlocking trees datastore can include a plurality of levels and the non-adjacent K node is some other K node of the plurality of K nodes. The interlocking trees datastore is traversed in accordance with an asCase bi-directional link of a K node of the plurality of K nodes. The match is determined in accordance with a Result pointer of a K node of the plurality of K nodes. Traversal occurs from a current K node location to an adjacent K node in accordance with the non-adjacent K node. The adjacent K node is determined in accordance with an asCase list of the current K node. A match is determined between the adjacent K node and a further K node.
  • As a part of its regular processing of streams of input particles the KStore engine recognizes that a sequence being processing is ‘new’ when there is no existing structure to record the sequence being processed. This condition may occur at any level in the K structure. At this point it is possible to implement many different processes to deal with the new sequence at substantially low incremental cost.
  • For example, new K structure may be created to record the event, an error statement may be issued, the event may be logged, processes may be initiated to evaluate the new sequence in accordance with some established criteria, or any combination of processes may be initiated in response to encountering new sequence.
  • BRIEF DESCRIPTION OF SEVERAL VIEWS OF THE DRAWINGS
  • The invention will be described in conjunction with the following drawings in which like reference numerals designate like elements and wherein:
  • FIG. 1 shows a block diagram representation of a KStore environment in which the system and method of the present invention can be implemented.
  • FIG. 2A shows a minimal KStore structure.
  • FIGS. 2B-E shows a series of datastore elements that may be formed in the process of building the minimal KStore structure of FIG. 2A.
  • FIGS. 2F,G show interlocking trees datastores that may be used to represent data according to the system and method of the present invention.
  • FIG. 3 shows a preferred embodiment of the next node process for processing particles in an interlocking trees datastore such as the interlocking trees datastores of FIGS. 2E, F.
  • FIGS. 4A, B show exemplary nodes within an interlocking trees datastore such as the interlocking trees datastores of FIGS. 2F, G.
  • FIG. 5 shows a preferred embodiment of a process for detecting a new sequence in an interlocking trees datastore such as the interlocking trees datastores of FIGS. 2F, G.
  • FIG. 6 shows a preferred embodiment of a process for creating a new node in an interlocking trees datastore such as the interlocking trees datastores of FIGS. 2F, G.
  • FIG. 7 shows a preferred embodiment of a process for creating a new node in an interlocking trees datastore such as the interlocking trees datastores of FIGS. 2F, G
  • DETAILED DESCRIPTION OF THE INVENTION
  • The invention will be illustrated in more detail with reference to the following examples, but it should be understood that the present invention is not deemed to be limited thereto.
  • Referring now to FIG. 1, there is shown a block diagram representation of a KStore environment in which the system and method of the present invention may be implemented. Within the KStore environment information, may flow bi-directionally between the KStore 12 and both a data source 30 and an application 34 by way of a K Engine 14, as understood by those skilled in the art. The transmission of information between the data source 30 and the K Engine 14 may be by way of a learn engine 26, and the transmission of information between the application 34 and the K Engine 14 may be by way of an API utility 23, as also understood by those skilled in the art. The data source 30 and the application 34 may be provided with graphical user interfaces 36, 38 to permit a user to communicate with the system.
  • Additionally, according to the system and method of the invention, the K Engine 14 can be provided with a new sequence process 18 to permit the processing of new sequences as described in detail herein below. For example, one of the processes that are related to the new sequence process 18 is the lock process for locking previously existing nodes when a new node is constructed. The performance of the new sequence process 18 may be facilitated by providing specialized utilities 16. In the preferred embodiment of the invention the learn engine 26 and the API utility 23 can communicate with the K Engine 14 both directly and by way of the utilities 16.
  • Referring now to FIGS. 2A-E, there are shown the minimal KStore structure 100 and a plurality of FIGS. 130-160 which demonstrate a means of creating the minimal KStore structure. The minimal KStore structure 100 is the smallest KStore possible. It includes a single K path, beginning with K node 102 and ending with K node 112, wherein a portion of the K path ending with K node 112 is a portion of triad 114 having a subcomponent node 106. The minimal KStore structure 100 also includes an end product node 112 with its end of thought (EOT) root node 126. The nodes 106, 112 and 126 form a KStore triad.
  • In order to build the minimal KStore structure 100 the subcomponent K node 106 can be created by establishing the Case and Result bi-directional links. The Case bi-directional link 104 between the primary root node 102 and the K node 106, as best seen in the datastore element 130 of FIG. 2B. A pointer to node 102 becomes the Case entry of node 106 and the asCase list of the primary root node 102 is updated to include a pointer to the subcomponent node 106.
  • The Result bi-directional link 120 is established between the elemental root K node 122 and the K node 106. A pointer to node 122 becomes the Result entry of node 106 and the asResult list of the elemental root node 122 is updated to include the subcomponent node 106. It will be understood that the foregoing operations for building a subcomponent node such as the subcomponent node 106 can be described in any order for illustrative purpose and can be performed in any order when practicing the present invention.
  • The Case bi-directional link 110 can then be established between the end product node 112 and the subcomponent node 106, as best seen in the datastore elements 140, 150 of FIGS. 2C, D. By establishing the bi-directional link 124 between the end product node 112 and the EOT node 126, the datastore element 150 is completed, as best seen in the datastore elements 160 of FIGS. 2E. It will be understood that the datastore element 160 has substantially the same structure as the minimal KStore structure 100. Thus, the datastore elements 130-160 illustrate possible stages in the process of building the minimal KStore structure 100. The building process shown by the datastore elements 130-160 can be continued indefinitely to create a triadic continuum of any size. The building of interlocking trees datastores such as the minimal KStore structure 100 is taught in more detail in U.S. patent application Ser. Nos. 10/385,421,10/666,382 and 10/759,466.
  • Referring now to FIGS. 2F, G, there are shown the interlocking trees datastore 250 and the interlocking trees datastore 290. The interlocking trees datastore 250 provides a representation of the sequence C-A-T. The sequence of letter data particles required for representing the sequence C-A-T can be streamed into the interlocking trees datastore 250 using a data simulator or by using any other method of entering input into a KStore. When the letter particle C of the sequence C-A-T is received the BOT-C node 256 can be built by establishing the Case bi-directional link to the beginning of thought (BOT) node 252 and establishing the Result bi-directional link to the C root node 274. Thus, the BOT-C node 256 can be built in substantially the manner previously described with respect to the nodes 106 of the triad 114 within the minimal KStore structure 100. The BOT-C node 256 is then designated as the current K location within the structure for this particular thread.
  • When the letter particle A of the sequence C-A-T is received the BOT-C-A node 258 can be built into the interlocking trees datastore 250 by establishing the Case bi-directional link to the BOT-C node 256. The Result bi-directional link to the A root node 270 can then be established. Thus, the BOT-C-A node 258 is built in response to receiving the letter particle A. The BOT-C-A node 258 is then designated as the current K location.
  • In order to form the BOT-C-A-T node 262 when the T letter particle is received the Case bi-directional link can be established to the BOT-C-A node 258 and the Result bi-directional link can be established to the T root node 280. The end product node 266 can then be created by forming the Case bi-directional link to the BOT-C-A-T node 262 and the Result bi-directional link to the EOT node 282. In this manner the K path 265 of the interlocking trees datastore 250 is built for representing the sequence of letter particles within the string C-A-T. With the processing of an EOT node the current K location is set to BOT.
  • In one preferred embodiment a count can be kept within each node of the interlocking trees datastore 250 in order to keep track of the number of times the node is traversed. The counts of the nodes within a K path of an interlocking trees datastore may be incremented each time they are encountered during later traversals of the K path. Thus, it will be understood that in alternate embodiments of the system and method of the invention the counts of the individual nodes of an interlocking trees datastore such as the interlocking trees datastore 250 can be incremented either as they are built, encountered or when the building and traversal is complete.
  • Referring now to FIG. 5, there is shown the new sequence determining procedure 500 for determining when a new sequence is received by an interlocking trees datastore such as the interlocking trees datastores 250, 290. In the new sequence determining procedure 500 an input particle is compared to the list of the sensors of the interlocking trees datastore 250, 290, as shown in block 504. If no match is found in block 504 the particle is ignored and execution returns from the new sequence determining procedure 500. If a match is found the root node corresponding to the particle is thereby located.
  • If a match is found in block 504 the particle is considered valid. The asCase list of the current K location node is obtained. A comparison is then performed between the Result node of each node in the asCase list and the root node of the input particle. The comparison between the nodes on the asCase list of the current node and the particle root node is shown in block 508. If there is a match in the comparison of block 508 the matched node becomes the current K location node as shown block 512. The count of the new current K location node and the input particle root node may be incremented at this time or may be incremented along with the other nodes in its K path when an end product node is encountered. If there is no match in the comparison of block 508 a new node may be created as shown in block 516 of new sequence determining procedure 500.
  • Referring now to FIG. 3, there is shown a more detailed diagram of boxes 508, 512, and 516 from FIG. 5. The next node processing procedure 300 is a procedure for locating the path indicated by the next K node to be processed. For example, the next node processing in a Kstore procedure 300 can be used to process nodes representing particles or nodes representing complete thoughts within the interlocking trees datastore 250.
  • Although the particles set forth herein are input letter particles, it will be understood that the system and method of the invention can apply to any type of particle processing within an interlocking trees datastore. For example, they can apply to input words, sentences, pixels, molecules, amino acids, or any other data that can be received and stored in a datastore.
  • It will be understood by those skilled in the art that during its next node processing operations the next node particle processing procedure 300 will inherently detect the occurrence of a new sequence by the absence of structure to record an event within the interlocking trees datastore 250. This feature of the next node processing procedure 300 eliminates the need for any additional operations, beyond the normal input processing procedures, to determine when a new sequence is received. When a new sequence is determined within a KStore by the next node processing procedure 300 new KStore structure may be built to represent the new sequence. It will be understood that, the next node processing procedure 300 and any other process set forth herein, may be applied to a single level KStore such as the interlocking trees datastore 25 as well as any level of a multi-level KStore and to any other KStore.
  • Furthermore, the next node particle processing procedure 300 may be used in building new structure when it determines that a new sequence has arrived. For example, the particle next node procedure 300 may be applied to processing the sequence C-A-T-S within the interlocking trees datastore 250 to create the interlocking trees datastore 290.
  • When the sequence of letter data particles required for representing the sequence C-A-T-S is streamed into the KEngine for interlocking trees datastore 250 there is already structure within the interlocking trees datastore for the sequence. This structure was previously created for the C-A-T input sequence. The next node processing procedure 300 may thus begin by traversing the existing structure starting at the BOT node and possibly incrementing the count fields in the nodes encountered during the traversal. When the letter particle C of the sequence C-A-T-S is received, the current K location node is BOT and the asCase list of the BOT root node 252 may be followed to the BOT-C node 256. Since the Result node of the BOT-C node 256 (the C root node 274) matches the input letter particle C, traversal can continue. The C root node 274 is defined as a non-adjacent node of the BOT-C node 256 since it is not on the asCase list of the BOT-C node 256. At this point the count of the BOT node 252, the BOT-C node 256, and the C root node 274 can be incremented.
  • When the letter particle A of the sequence C-A-T-S is received execution of the next node processing procedure 300 can proceed to block 302. The next node processing procedure 300 may then make a determination whether the Result pointer of any subcomponent node in the asCase list of the BOT-C node 256 points to the A root node 270.
  • The only node in the asCase list of the BOT-C node 256 is the BOT-C-A node 258. Therefore, the current K location node is set to point to the BOT-C-A node 258 in block 302. Since Node is thus not null as determined in decision 304, a determination is made in decision 310 whether the Result pointer of the BOT-C-A node 258 points to the A root node 270.
  • Since a match is found in this determination the count of the BOT-C-A node 258 and the Root Node 270 may be incremented. The BOT-C-A node 258 is then made the current K location node in block 312. In this manner the next node processing procedure 300 can process the input stream. No new structure has been built within the interlocking trees datastore 250 thus far since the BOT-C-A node 258 of the sequence C-A-T-S was already formed when the sequence C-A-T was previously received.
  • When the letter particle T of the sequence C-A-T-S is received, the asCase pointer of the BOT-C-A node 258 can be followed to the BOT-C-A-T node 262 and the Result list of the BOT-C-A-T node 262 can be followed to the T node 280 and another match is found. Again, no new structure is built within the interlocking trees datastore 250. Thus, the next node processing procedure 300 has traversed the interlocking trees datastore 250 incrementing the counts of the nodes encountered as the input particles C-A-T of the sequence C-A-T-S were received.
  • However, when the letter particle S is received it will be determined by the next node procedure 300 that the structure for representing the sequence C-A-T-S does not exist within the interlocking trees datastore 250. Accordingly, when block 302 is executed the BOT-C-A-T-EOT node 266 is located using the asCase list of the BOT-C-A-T node 262 and the BOT-C-A-T-EOT node 266 is assigned to the variable Node in block 302. Execution of the next node processing procedure 300 proceeds to decision 310 by way of decision 304 since the BOT-C-A-T-EOT node 266 is not null. When the determination of decision 310 is made execution returns to block 302, since there is no match between the received letter particle S and the EOT node 282 indicated by the Result pointer of the BOT-C-A-T-EOT node 266.
  • Since there are no more nodes in the asCase list of the BOT-C-A-T node 262 the determination whether Node is null is affirmative the next time decision 304 is encountered. It will be understood that the affirmative determination in decision 304 indicates that the interlocking trees datastore 250 does not include a K path representing the sequence C-A-T-S. Therefore, the sequence C-A-T-S has not been previously entered into the interlocking trees datastore 250, and it may be determined that the sequence C-A-T-S is a new sequence at this point.
  • It will thus be appreciated by those skilled in the art that using a procedure such as the next node processing procedure 300 the system and method of the present invention may input sequences, and inherently and simultaneously determine the occurrence of a new sequence as part of the normal process of receiving the sequences. There is no need to perform any separate comparisons or any other operations in addition to the processing of the input stream in order to detect the need for new structure since the detection of need for new structure is inherent in decision 304 as part of the process.
  • In response to the determination in decision 304 that Node is null and that new sequence has been encountered or captured, a determination can be made in block 306 whether a new node should be built in the interlocking trees datastore 250. Additionally, a determination can be made whether the occurrence of the new sequence should be reported to the user, administrator, log, etc. of the system and method of the present invention in block 308.
  • The determinations of blocks 306, 308 may be made according to predetermined rules or administrative guidelines or any other set of protocols. The parameters of the guidelines or protocols can be communicated to the next node processing procedure 300 by a calling procedure, a GUI or any other source. The rules or administrative guidelines can be set up by a user or administrator or any other party. For example, the user or administrator or other party can adapt the system and method of the invention to ignore the detection of new structure made in decision 304 and do nothing. Under these guidelines the occurrence of the new sequence can be treated substantially as noise.
  • Another possibility is to adapt the next node processing procedure 300 to build no new structure when a new sequence is detected or captured, but to report the occurrence to the user or administrator. The reporting can be done through the KEngine 14 or it can be performed by passing information to a calling procedure such as the learn engine 26 or the API utilities 23. Additionally, reporting can be done by way of an email. The reports can be used, for example, to trigger events such as the starting of another process, a new field event or a new record event, to set flags, to log messages, to send information or reports to a graphical user interface (GUI) 36, to send emails to a list of recipients or to perform any other operations. Additionally, the GUI 36 can report the occurrence to the user or administrator and permit the user to determine how to handle it. If a new node is built in view of the determination of block 306 the guidelines may or may not require providing a report.
  • When a report is made according to block 308 it may contain any information that may prove useful. For example, the value represented by the current node at the time that the new sequence was determined in decision 304 can be reported. The identity of the newly received particle can be reported. A list of possible next nodes obtained in block 302 of the particle processing procedure 300 according to the asCase list of the current node can be reported. It is also possible to anticipate the next node by making an estimate of which of the possible next nodes obtained in block 302 is the most likely next node. For example, the most likely next node may be determined by comparing the counts of the possible next nodes. The determination of the mostly likely next node can also be made according to how recently the possible next nodes have been accessed or according to the context in which the determination is made. In another embodiment the determination can be determined based upon processing a further node/particle or nodes/particles.
  • Referring again to FIG. 2G, there is shown the interlocking trees datastore 290. The interlocking trees datastore 290 may be built as a result of the next node processing procedure 300 operating on the letter particle S of the sequence C-A-T-S in the interlocking trees datastore 250. This may occur when: i) there is an affirmative determination in decision 304, thereby indicating that a new sequence has been determined, and ii) the protocol of block 306 requires the building of a new node when such a new sequence is determined.
  • In order to form the BOT-C-A-T-S node 264 the Case bi-directional link may be established to the BOT-C-A-T node 262 by the routine for creating new nodes called in block 306. A Result bi-directional link may be established to the S root node 278. Thus, a branch is created in the path 265 of the interlocking trees datastore 290 at the BOT-C-A-T node 262 to provide the new K path 263.
  • Referring now to FIG. 4A, there is shown the exemplary node 400 of an interlocking trees datastore such as the interlocking trees datastores 250, 290. The exemplary node 400 can be an exemplary elemental root node 402 or an exemplary subcomponent node 404 or end product node 404. When a new node is built in an interlocking datastore memory is allocated for the new node in the manner shown in the exemplary node 400. A plurality of pointers can then be stored in the allocated memory. The new node is defined by setting the Case pointer to point to the previous node and setting the Result pointer to point to the root node. Thus, for example, if the exemplary subcomponent node 404 represents the subcomponent node 264 of the interlocking trees datastores 290, the Case pointer 406 would point to the subcomponent node 262.
  • The exemplary node 400 can also include a Result pointer 408. Thus, when the exemplary node 400 represents the subcomponent node 264 of the interlocking trees datastore 290, the Result pointer 408 would point to the S root node 278.
  • A pointer to asCase list 410 can also be included in the exemplary node 400. The pointer to asCase list 410 is a pointer to a list of the subcomponent nodes or end product nodes for which the node represented by the exemplary node 400 is the Case node. The pointer to asResult list 412 is a pointer to a list of the subcomponents nodes or end product nodes for which the node represented by the exemplary node 400 is the Result node. The nodes of the interlocking trees datastores 250, 290 can also include one or more additional fields 414. The additional fields 414 may be used for an intensity or count associated with the node or for any number of different items associated with the structure. Another example of a parameter that can be stored in an additional field 416 is the particle value for an elemental root node.
  • Referring now to FIG. 4B, there is shown the exemplary node 450. The exemplary node 450 is an alternate embodiment of the exemplary node 400. The exemplary node 450 includes the Case pointer 406, the Result pointer 408, the pointer to asCase list 410, the pointer to asResult list 412, the additional fields 414 and the value field 416 as previously described with respect to the exemplary node 400. Additionally, the exemplary node 450 includes the lock definition field 413. A lock definition within the lock definition field 413 can be set and reset in order to indicate whether the node represented by the exemplary node 450 is locked or unlocked to prevent the node from being accessed while it is being changed. Thus, the lock definition field 413 of a node should be checked prior to accessing any nodes in an interlocking trees datastore in order to prevent any attempts to access a locked node. However, in one preferred embodiment the writing of information to nodes is not permitted without checking the lock definition field 413 while the reading of nodes without checking can be permitted. In another alternate embodiment of the invention the lock definition may be stored at another location rather than within the exemplary node 400.
  • Referring now to FIG. 6, there is shown the node locking procedure 600 used when building a new node within the interlocking trees datastores 250, 290. As previously described, care should be taken to prevent access to a node which is being changed by, for example, another thread executing in a datastore. Therefore, nodes that are linked to new nodes may be locked while their asCase list, asResult list, count field or some other field in the node is updated in order to prevent such an access.
  • In the node locking procedure 600 a new node is created as shown in block 604. The new node can be created as previously described with respect to FIGS. 2A-E. Memory for the new node is allocated as shown in the exemplary node 400 and the fields of the exemplary node 450 are populated. Block 604 also shows that the current K location node is the Case node of the newly created node. Therefore, the location of the current K location node can be stored in the Case pointer field 406 of the new node. A pointer to the location of the root node is stored in the Result pointer field 408. No locking is required while this new node is being constructed.
  • Additionally, the asCase list of the current K location node may be updated to include the newly created node as shown in block 608. Therefore, the current K location node may be locked while its asCase list is updated and unlocked after the asCase list is updated. In a preferred embodiment all list updating operations are performed with the node being locked and immediately updated and unlocked in this manner in order to minimize the lock time.
  • Furthermore, no other nodes are locked by this thread during the period that the current node is locked. Thus, it is an important feature of the present invention that only one node at a time is locked by a particular thread, as described in more detail below. In a preferred embodiment, the current K location node may be locked and unlocked by setting and resetting the lock definition in the lock definition field 413 of the exemplary node 400 of the current K location node at the beginning and at the end of the operation of block 608.
  • Additionally, the asResult list of the root node may be updated by the node locking procedure 600 to include the newly created node. Therefore, as shown in block 612 the root node is locked and its asResult list is updated. The root node to be added may then be unlocked. In a preferred embodiment of the invention the root node is locked by the node locking procedure 600 only during the time it takes to update its asResult list.
  • During the time that the pointers of the exemplary node 400 for the new node are being established there are no pointers pointing to the new node. Therefore, no nodes can access the new node and there is no need to lock the new node. Furthermore, there is no need to lock either the Case node or the Result node while the new node is being established since their data is not changing during this period.
  • When the pointers of the new node are established the asCase list of the Case node and the asResult list of the root node may be updated one at a time so that only one or the other of the two are locked. Therefore, a very important feature of the system and method of the present invention is that only one node is locked at a time for this thread. All of the remaining nodes in the interlocking trees datastores 250, 290 may remain unlocked. This reduces the scope of the node locking to the minimum amount possible and permits faster operation of the interlocking trees datastore 250, 290.
  • Referring now to FIG. 7, there is shown the node locking procedure 700 for preventing access to nodes which are involved in the construction of a new node in an interlocking trees datastore 250, 290. Within the node locking procedure 700 the current node is locked by a thread as shown in block 702 by setting the lock flag in its lock definition field 413.
  • A determination is then made in decision 706 whether the new node has been added prior to the locking operations of block 702. This determination should be made in the node locking procedure 700 since it is possible for a thread other than the instant thread to begin building the new node between the time that the instant thread determines that the new node must be built and the time that the current node is actually locked according to block 702. The determination of decision 706 can be made by checking the asCase list of the current node.
  • If the new node has not been added during the foregoing time period as determined in decision 706 of the node locking procedure 700, the new node may be created as shown in block 710. Since the current node was locked in block 702 a pointer to the new node can be added to the asCase list of the current node at this time as shown in block 714. Regardless of whether a new node is created in block 710 the current node is unlocked as shown in block 716.
  • If a new node was created in block 710 as determined in decision 718, the root node for the new node is locked as shown in block 720. A pointer to the new node is added to the asResult list of the root node as shown in block 722. The root node can then be unlocked as shown in block 724. Thus, within the node locking procedure 700 only one node is locked at a time.
  • While the invention has been described in detail and with reference to specific examples thereof, it will be apparent to one skilled in the art that various changes and modifications can be made therein without departing from the spirit and scope thereof.

Claims (63)

1. A method of recording information in an interlocking trees datastore having a plurality of K paths, comprising:
receiving a data stream input sequence;
traversing said interlocking trees datastore in accordance with said received data stream input sequence for recording said received data stream input sequence within said interlocking trees datastore;
first determining whether a K path of said plurality of K paths matches said data stream input sequence; and
second determining that a new sequence has been encountered in accordance with said first determining.
2. The method of recording information in an interlocking trees datastore of claim 1, comprising building new structure in accordance with said second determining.
3. The method of recording information in an interlocking trees datastore of claim 1, wherein a path of said plurality of paths has a plurality of K nodes including a current K node, an adjacent K node that is adjacent to said current K node, said adjacent K node having a non-adjacent K node that is not in the asCase list of said current K node further comprising traversing said interlocking trees datastore by determining a match between said non-adjacent K node an adjacent K node.
4. The method of recording information in an interlocking trees datastore of claim 3, wherein the non-adjacent K node is the K node matched to an input particle
5. The method of recording information in an interlocking trees datastore of claim 3, wherein said interlocking trees datastore includes a plurality of levels and the non-adjacent K node is some other K node of said plurality of K nodes.
6. The method of recording information in an interlocking trees datastore of claim 3, further comprising traversing said interlocking trees datastore in accordance with an asCase bi-directional link of a K node of said plurality of K nodes.
7. The method of recording information in an interlocking trees datastore of claim 6, further comprising determining said match in accordance with a Result pointer of a K node of said plurality of K nodes.
8. The method of recording information in an interlocking trees datastore of claim 3, further comprising traversing from a current K node location to an adjacent K node in accordance with said non-adjacent K node.
9. The method of recording information in an interlocking trees datastore of claim 8, further comprising determining said adjacent K node in accordance with an asCase list of said current K node.
10. The method of recording information in an interlocking trees datastore of claim 8, further comprising determining a match between said adjacent K node and a further K node.
11. The method of recording information in an interlocking trees datastore of claim 10, further comprising determining said match between said adjacent K node and an end node or a K node representing a further particle.
12. The method of recording information in an interlocking trees datastore of claim 8, wherein said current K node is a beginning of thought node.
13. The method of recording information in an interlocking trees datastore of claim 8, wherein said current K node is a subcomponent K node of said path of said plurality of paths.
14. The method of recording information in an interlocking trees datastore of claim 1, further comprising processing a new sequence event in accordance with said second determining.
15. The method for recording information in an interlocking trees datastore of claim 14, wherein said interlocking trees datastore is constructed by a KEngine further comprising processing a new sequence event by way of said KEngine.
16. The method for recording information in an interlocking trees datastore of claim 14, wherein said KEngine has a procedure further comprising processing a new sequence event in accordance with established parameter settings.
17. The method of recording information in an interlocking trees datastore of claim 16, further comprising reporting said new sequence in accordance with said established parameter settings.
18. The method for recording information in an interlocking trees datastore of claim 16, further comprising creating new KStore structure in accordance with said established parameter setting.
19. The method for recording information in an interlocking trees datastore of claim 16, further comprising instantiating a process in accordance with said established parameter setting.
20. The method for recording information in an interlocking trees datastore of claim 16, wherein said established parameter setting is obtained from a graphical user interface.
21. The method for recording information in an interlocking trees datastore of claim 16, wherein said established parameter setting is obtained from a calling procedure.
22. The method for recording information in an interlocking trees datastore of claim 1, further comprising determining a possible next K node in accordance with said first determining.
23. The method for recording information in an interlocking trees datastore of claim 22, further comprising:
determining a plurality of possible next K nodes wherein at least one possible next K node of said plurality of possible next K nodes has a node count; and
selecting said possible next K node in accordance with said node count.
24. The method for recording information in an interlocking trees datastore of claim 22, further comprising determining said possible next K in accordance with a context.
25. The method for recording information in an interlocking trees datastore of claim 22, further comprising determining said possible next K in accordance with how recently a node on the asCase list of a current node has been created.
26. The method for recording information in an interlocking trees datastore of claim 22, further comprising determining said possible next K in accordance with how recently a node on the asCase list of a current node has been accessed.
27. The method for recording information in an interlocking trees datastore of claim 2, further comprising locking said matched K node when said new K node is built.
28. The method for recording information in an interlocking trees datastore of claim 1, further comprising reporting said new sequence in accordance with said first determining.
29. The method for recording information in an interlocking trees datastore of claim 28, wherein said interlocking trees datastore is provided with a KEngine further comprising reporting said new sequence by way of said KEngine.
30. The method for recording information in an interlocking trees datastore of claim 28, wherein said interlocking trees datastore is accessed by way of a calling procedure further comprising reporting said new sequence by way of returning information to said calling procedure.
31. A method for recording information in an interlocking trees datastore having at least one K node, comprising:
receiving a sequence having at least one input into said interlocking trees datastore;
first determining a mismatch between said at least one K node and said at least one input; and
second determining that said sequence is a new sequence in accordance with said determined mismatch.
32. The method for recording information in an interlocking trees datastore of claim 31, wherein said interlocking trees datastore includes a plurality of K paths having an end node and a plurality of K nodes and said sequence has a plurality of inputs associated with a plurality of sensors, further comprising:
receiving said sequence having said plurality of inputs into said interlocking trees datastore;
traversing at least one K path of said plurality of K paths in accordance with said plurality of inputs to provide a traversed K path;
determining a mismatch between a K node of said plurality of K nodes and an end node or a sensor to provide a mismatched K node and a mismatched end node or sensor;
determining that said sequence is a new sequence in accordance with said determined mismatch.
33. The method for recording information in an interlocking trees datastore of claim 32, further comprising traversing said at least one K path by determining a match between a current K node of said at least one K path and a matched sensor K node with an associated matched input.
34. The method for recording information in an interlocking trees datastore of claim 33, further comprising determining an adjacent K node of said current K node.
35. The method for recording information in an interlocking trees datastore of claim 34, further comprising determining said adjacent K node in accordance with an asCase list pointer of said current K node.
36. The method for recording information in an interlocking trees datastore of claim 34, further comprising first determining in accordance with a Result pointer of said adjacent K node.
37. The method for recording information in an interlocking trees datastore of claim 31, further comprising reporting said new sequence in accordance with said first determining.
38. The method for recording information in an interlocking trees datastore of claim 37, wherein said interlocking trees datastore is provided with a KEngine further comprising reporting said new sequence by way of said KEngine.
39. The method for recording information in an interlocking trees datastore of claim 37, wherein said KEngine has a calling procedure further comprising reporting said new sequence by having said KEngine return information to said calling procedure.
40. The method for recording information in an interlocking trees datastore of claim 39, wherein said interlocking trees datastore has a learn engine and said calling procedure comprises said learn engine.
41. The method for recording information in an interlocking trees datastore of claim 39, wherein said calling procedure comprises a query.
42. The method for recording information in an interlocking trees datastore of claim 31, further comprising reporting an event in accordance with said first determining.
43. The method for recording information in an interlocking trees datastore of claim 42, wherein said event comprises recording a best guess in said interlocking trees datastore.
44. The method for recording information in an interlocking trees datastore of claim 31, further comprising logging a message in accordance with said first determining.
45. The method for recording information in an interlocking trees datastore of claim 31, further comprising setting a lock flag in accordance with said first determining.
46. The method for recording information in an interlocking trees datastore of claim 31, further comprising determining a possible next K node in accordance with said first determining.
47. The method for recording information in an interlocking trees datastore of claim 46, further comprising:
determining a plurality of possible next K nodes, at least one possible next K node of said plurality of possible next K nodes having a node count; and
selecting said possible next K node in accordance with said node count.
48. The method for recording information in an interlocking trees datastore of claim 31, further comprising building a new K node in accordance with said first determining.
49. The method for recording information in an interlocking trees datastore of claim 48, further comprising locking said current K node when said new K node is built.
50. The method for recording information in an interlocking trees datastore of claim 49, further comprising establishing Case bi-directional links between said current K node and said new K node when said current K node is locked.
51. The method for recording information in an interlocking trees datastore of claim 49, further comprising locking said the Result node of said new K node when said new K node is built.
52. The method for recording information in an interlocking trees datastore of claim 49, further comprising updating the asResult list of said Result node of said new K node when said Result node of said new K node is locked.
53. The method for recording information in an interlocking trees datastore of claim 48, wherein said new K node has a new node count further comprising initializing said new node count.
54. A system for recording information in an interlocking trees datastore having a plurality of K paths, comprising:
a received data stream input sequence;
a traversal of said interlocking trees datastore in accordance with said received data stream input sequence for recording said received data stream input sequence within said interlocking trees datastore;
a first determination whether a K path of said plurality of K paths matches said data stream input sequence; and
a second determination that a new sequence has been encountered in accordance with said first determination.
55. The system for recording information in an interlocking trees datastore of claim 54, comprising a new structure built in accordance with said second determination.
56. The system for recording information in an interlocking trees datastore of claim 54, wherein a path of said plurality of paths has a plurality of K nodes including a current K node, an adjacent K node that is adjacent to said current K node, said adjacent K node having a non-adjacent K node that is not in the asCase list of said current K node further comprising a traversal of said interlocking trees datastore determined by a match between said non-adjacent K node an adjacent K node.
57. The system for recording information in an interlocking trees datastore of claim 56, wherein the non-adjacent K node is the K node matched to an input particle
58. The system for recording information in an interlocking trees datastore of claim 56, wherein said interlocking trees datastore includes a plurality of levels and the non-adjacent K node is some other K node of said plurality of K nodes.
59. The system for recording information in an interlocking trees datastore of claim 56, further comprising a traversal of said interlocking trees datastore in accordance with an asCase bi-directional link of a K node of said plurality of K nodes.
60. The system for recording information in an interlocking trees datastore of claim 59, further comprising a determination of said match in accordance with a Result pointer of a K node of said plurality of K nodes.
61. The system for recording information in an interlocking trees datastore of claim 56, further comprising a traversal from a current K node location to an adjacent K node in accordance with said non-adjacent K node.
62. The system for recording information in an interlocking trees datastore of claim 61, further comprising a determination of said adjacent K node in accordance with an asCase list of said current K node.
63. The system for recording information in an interlocking trees datastore of claim 61, further comprising a determination of a match between said adjacent K node and a further K node.
US11/185,620 2003-09-19 2005-07-20 Method for processing new sequences being recorded into an interlocking trees datastore Abandoned US20060101018A1 (en)

Priority Applications (4)

Application Number Priority Date Filing Date Title
US11/185,620 US20060101018A1 (en) 2004-11-08 2005-07-20 Method for processing new sequences being recorded into an interlocking trees datastore
US11/384,777 US8516004B2 (en) 2003-09-19 2006-03-20 Method for processing K node count fields using an intensity variable
US11/449,092 US20110314022A9 (en) 2003-09-19 2006-06-08 K engine - process count after build in threads
US11/985,623 US8010572B1 (en) 2003-09-19 2007-11-16 Kstore scenario simulator processor and XML file

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US62592204P 2004-11-08 2004-11-08
US11/185,620 US20060101018A1 (en) 2004-11-08 2005-07-20 Method for processing new sequences being recorded into an interlocking trees datastore

Related Child Applications (1)

Application Number Title Priority Date Filing Date
US10/666,382 Continuation-In-Part US7158975B2 (en) 2003-03-10 2003-09-19 System and method for storing and accessing data in an interlocking trees datastore

Publications (1)

Publication Number Publication Date
US20060101018A1 true US20060101018A1 (en) 2006-05-11

Family

ID=39096068

Family Applications (2)

Application Number Title Priority Date Filing Date
US11/185,627 Active 2026-09-12 US7418445B1 (en) 2004-11-08 2005-07-20 Method for reducing the scope of the K node construction lock
US11/185,620 Abandoned US20060101018A1 (en) 2003-09-19 2005-07-20 Method for processing new sequences being recorded into an interlocking trees datastore

Family Applications Before (1)

Application Number Title Priority Date Filing Date
US11/185,627 Active 2026-09-12 US7418445B1 (en) 2004-11-08 2005-07-20 Method for reducing the scope of the K node construction lock

Country Status (2)

Country Link
US (2) US7418445B1 (en)
CN (3) CN101128848A (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20100169372A1 (en) * 2008-12-31 2010-07-01 Mazzagatti Jane C Adding and subtracting kstores
US20100169373A1 (en) * 2008-12-31 2010-07-01 Mazzagatti Jane C Updating values of elemental root nodes in kstore
US20150089435A1 (en) * 2013-09-25 2015-03-26 Microth, Inc. System and method for prediction and recognition of input sequences

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111813533B (en) * 2020-09-11 2020-12-11 腾讯科技(深圳)有限公司 Dynamic management method and device for model instantiation and storage medium

Citations (52)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4464650A (en) * 1981-08-10 1984-08-07 Sperry Corporation Apparatus and method for compressing data signals and restoring the compressed data signals
US4823310A (en) * 1987-08-10 1989-04-18 Wang Laboratories, Inc. Device for enabling concurrent access of indexed sequential data files
US5151697A (en) * 1990-10-15 1992-09-29 Board Of Regents Of The University Of Washington Data structure management tagging system
US5488717A (en) * 1992-07-06 1996-01-30 1St Desk Systems, Inc. MTree data structure for storage, indexing and retrieval of information
US5534861A (en) * 1993-04-16 1996-07-09 International Business Machines Corporation Method and system for adaptively building a static Ziv-Lempel dictionary for database compression
US5592667A (en) * 1991-05-29 1997-01-07 Triada, Ltd. Method of storing compressed data for accelerated interrogation
US5634133A (en) * 1992-01-17 1997-05-27 Compaq Computer Corporation Constraint based graphics system
US5640551A (en) * 1993-04-14 1997-06-17 Apple Computer, Inc. Efficient high speed trie search process
US5893102A (en) * 1996-12-06 1999-04-06 Unisys Corporation Textual database management, storage and retrieval system utilizing word-oriented, dictionary-based data compression/decompression
US5894311A (en) * 1995-08-08 1999-04-13 Jerry Jackson Associates Ltd. Computer-based visual data evaluation
US5918229A (en) * 1996-11-22 1999-06-29 Mangosoft Corporation Structured data storage using globally addressable memory
US5930805A (en) * 1995-12-01 1999-07-27 Sand Technology Systems International, Inc. Storage and retrieval of ordered sets of keys in a compact 0-complete tree
US5970490A (en) * 1996-11-05 1999-10-19 Xerox Corporation Integration platform for heterogeneous databases
US6061692A (en) * 1997-11-04 2000-05-09 Microsoft Corporation System and method for administering a meta database as an integral component of an information server
US6144962A (en) * 1996-10-15 2000-11-07 Mercury Interactive Corporation Visualization of web sites and hierarchical data structures
US6233575B1 (en) * 1997-06-24 2001-05-15 International Business Machines Corporation Multilevel taxonomy based on features derived from training documents classification using fisher values as discrimination values
US6275801B1 (en) * 1998-11-03 2001-08-14 International Business Machines Corporation Non-leaf node penalty score assignment system and method for improving acoustic fast match speed in large vocabulary systems
US6356902B1 (en) * 1998-07-28 2002-03-12 Matsushita Electric Industrial Co., Ltd. Method and system for storage and retrieval of multimedia objects
US6373484B1 (en) * 1999-01-21 2002-04-16 International Business Machines Corporation Method and system for presenting data structures graphically
US6374210B1 (en) * 1998-11-30 2002-04-16 U.S. Philips Corporation Automatic segmentation of a text
US20020120598A1 (en) * 2001-02-26 2002-08-29 Ori Software Development Ltd. Encoding semi-structured data for efficient search and browse
US20020143783A1 (en) * 2000-02-28 2002-10-03 Hyperroll Israel, Limited Method of and system for data aggregation employing dimensional hierarchy transformation
US20020143735A1 (en) * 2001-03-30 2002-10-03 Akin Ayi User scope-based data organization system
US20020147721A1 (en) * 2001-04-04 2002-10-10 Pankaj Gupta Compact data structures for pipelined message forwarding lookups
US6477683B1 (en) * 1999-02-05 2002-11-05 Tensilica, Inc. Automated processor generation system for designing a configurable processor and method for the same
US6505205B1 (en) * 1999-05-29 2003-01-07 Oracle Corporation Relational database system for storing nodes of a hierarchical index of multi-dimensional data in a first module and metadata regarding the index in a second module
US20030009443A1 (en) * 2001-06-15 2003-01-09 Oleg Yatviskiy Generic data aggregation
US6591272B1 (en) * 1999-02-25 2003-07-08 Tricoron Networks, Inc. Method and apparatus to make and transmit objects from a database on a server computer to a client computer
US6635089B1 (en) * 1999-01-13 2003-10-21 International Business Machines Corporation Method for producing composite XML document object model trees using dynamic data retrievals
US20030217335A1 (en) * 2002-05-17 2003-11-20 Verity, Inc. System and method for automatically discovering a hierarchy of concepts from a corpus of documents
US20030223456A1 (en) * 2002-06-04 2003-12-04 Dimambro Francesco R. Apparatus and method for steering a communication to an open stream
US6675169B1 (en) * 1999-09-07 2004-01-06 Microsoft Corporation Method and system for attaching information to words of a trie
US20040006563A1 (en) * 2002-06-26 2004-01-08 Arthur Zwiegincew Manipulating schematized data in a database
US6678409B1 (en) * 2000-01-14 2004-01-13 Microsoft Corporation Parameterized word segmentation of unsegmented text
US20040034525A1 (en) * 2002-08-15 2004-02-19 Pentheroudakis Joseph E. Method and apparatus for expanding dictionaries during parsing
US6704729B1 (en) * 2000-05-19 2004-03-09 Microsoft Corporation Retrieval of relevant information categories
US6711585B1 (en) * 1999-06-15 2004-03-23 Kanisa Inc. System and method for implementing a knowledge management system
US20040064304A1 (en) * 2002-07-03 2004-04-01 Word Data Corp Text representation and method
US6738762B1 (en) * 2001-11-26 2004-05-18 At&T Corp. Multidimensional substring selectivity estimation using set hashing of cross-counts
US6751622B1 (en) * 1999-01-21 2004-06-15 Oracle International Corp. Generic hierarchical structure with hard-pegging of nodes with dependencies implemented in a relational database
US6768995B2 (en) * 2002-09-30 2004-07-27 Adaytum, Inc. Real-time aggregation of data within an enterprise planning environment
US20040169654A1 (en) * 2003-02-27 2004-09-02 Teracruz, Inc. System and method for tree map visualization for database performance data
US6799184B2 (en) * 2001-06-21 2004-09-28 Sybase, Inc. Relational database system providing XML query support
US20040243557A1 (en) * 2003-05-30 2004-12-02 International Business Machines Corporation System, method and computer program product for performing unstructured information management and automatic text analysis, including a search operator functioning as a weighted and (WAND)
US20050102255A1 (en) * 2003-11-06 2005-05-12 Bultman David C. Computer-implemented system and method for handling stored data
US6920608B1 (en) * 1999-05-21 2005-07-19 E Numerate Solutions, Inc. Chart view for reusable data markup language
US20050179684A1 (en) * 2004-02-17 2005-08-18 Wallace James H. Data exploration system
US6938204B1 (en) * 2000-08-31 2005-08-30 International Business Machines Corporation Array-based extensible document storage format
US20050234709A1 (en) * 2001-09-26 2005-10-20 Klavans Judith L System and method of generating dictionary entries
US7027052B1 (en) * 2001-08-13 2006-04-11 The Hive Group Treemap display with minimum cell size
US7228296B2 (en) * 2003-03-27 2007-06-05 Fujitsu Limited Devices for interpreting and retrieving XML documents, methods of interpreting and retrieving XML documents, and computer product
US7257599B2 (en) * 2001-02-01 2007-08-14 Sap Aktiengesellschaft Data organization in a fast query system

Family Cites Families (68)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4286330A (en) 1976-04-07 1981-08-25 Isaacson Joel D Autonomic string-manipulation system
US5245337A (en) 1991-05-29 1993-09-14 Triada, Ltd. Data compression with pipeline processors having separate memories
US6339767B1 (en) 1997-06-02 2002-01-15 Aurigin Systems, Inc. Using hyperbolic trees to visualize data generated by patent-centric and group-oriented data processing
US5630125A (en) 1994-05-23 1997-05-13 Zellweger; Paul Method and apparatus for information management using an open hierarchical data structure
US6160549A (en) 1994-07-29 2000-12-12 Oracle Corporation Method and apparatus for generating reports using declarative tools
US5768564A (en) 1994-10-07 1998-06-16 Tandem Computers Incorporated Method and apparatus for translating source code from one high-level computer language to another
US6286002B1 (en) 1996-01-17 2001-09-04 @Yourcommand System and method for storing and searching buy and sell information of a marketplace
US5978794A (en) 1996-04-09 1999-11-02 International Business Machines Corporation Method and system for performing spatial similarity joins on high-dimensional points
US5870735A (en) 1996-05-01 1999-02-09 International Business Machines Corporation Method and system for generating a decision-tree classifier in parallel in a multi-processor system
US5829004A (en) 1996-05-20 1998-10-27 Au; Lawrence Device for storage and retrieval of compact contiguous tree index records
US6745194B2 (en) 2000-08-07 2004-06-01 Alta Vista Company Technique for deleting duplicate records referenced in an index of a database
US5963965A (en) 1997-02-18 1999-10-05 Semio Corporation Text processing and retrieval system and method
US6102958A (en) 1997-04-08 2000-08-15 Drexel University Multiresolutional decision support system
US6389406B1 (en) 1997-07-30 2002-05-14 Unisys Corporation Semiotic decision making system for responding to natural language queries and components thereof
US5966709A (en) 1997-09-26 1999-10-12 Triada, Ltd. Method of optimizing an N-gram memory structure
US6018734A (en) 1997-09-29 2000-01-25 Triada, Ltd. Multi-dimensional pattern analysis
US5983232A (en) 1997-09-29 1999-11-09 Triada, Ltd. Virtual structured information system
US6029170A (en) 1997-11-25 2000-02-22 International Business Machines Corporation Hybrid tree array data structure and method
US6341281B1 (en) 1998-04-14 2002-01-22 Sybase, Inc. Database system with methods for optimizing performance of correlated subqueries by reusing invariant results of operator tree
US6138117A (en) 1998-04-29 2000-10-24 International Business Machines Corporation Method and system for mining long patterns from databases
US6115715A (en) 1998-06-29 2000-09-05 Sun Microsystems, Inc. Transaction management in a configuration database
US6769124B1 (en) 1998-07-22 2004-07-27 Cisco Technology, Inc. Persistent storage of information objects
US6826556B1 (en) 1998-10-02 2004-11-30 Ncr Corporation Techniques for deploying analytic models in a parallel
EP1129417A4 (en) 1998-12-04 2004-06-30 Technology Enabling Company Ll Systems and methods for organizing data
US6360224B1 (en) 1999-04-23 2002-03-19 Microsoft Corporation Fast extraction of one-way and two-way counts from sparse data
US6470344B1 (en) 1999-05-29 2002-10-22 Oracle Corporation Buffering a hierarchical index of multi-dimensional data
US6505184B1 (en) 1999-07-30 2003-01-07 Unisys Corporation Autognomic decision making system and method
US6278987B1 (en) 1999-07-30 2001-08-21 Unisys Corporation Data processing method for a semiotic decision making system used for responding to natural language queries and other purposes
US6453314B1 (en) 1999-07-30 2002-09-17 International Business Machines Corporation System and method for selective incremental deferred constraint processing after bulk loading data
US6470277B1 (en) 1999-07-30 2002-10-22 Agy Therapeutics, Inc. Techniques for facilitating identification of candidate genes
US6394263B1 (en) 1999-07-30 2002-05-28 Unisys Corporation Autognomic decision making system and method
US6381600B1 (en) 1999-09-22 2002-04-30 International Business Machines Corporation Exporting and importing of data in object-relational databases
US6662185B1 (en) 1999-10-15 2003-12-09 Dekalb Genetics Corporation Methods and systems for plant performance analysis
US6615202B1 (en) 1999-12-01 2003-09-02 Telesector Resources Group, Inc. Method for specifying a database import/export operation through a graphical user interface
AUPQ498500A0 (en) 2000-01-07 2000-02-03 Flixco Pty Limited Information system
US6760720B1 (en) 2000-02-25 2004-07-06 Pedestrian Concepts, Inc. Search-on-the-fly/sort-on-the-fly search engine for searching databases
US6900807B1 (en) 2000-03-08 2005-05-31 Accenture Llp System for generating charts in a knowledge management tool
US6473757B1 (en) 2000-03-28 2002-10-29 Lucent Technologies Inc. System and method for constraint based sequential pattern mining
US6831668B2 (en) 2000-04-03 2004-12-14 Business Objects, S.A. Analytical reporting on top of multidimensional data model
US6952736B1 (en) 2000-04-25 2005-10-04 Microsoft Corporation Object-based locking mechanism
JP3562572B2 (en) 2000-05-02 2004-09-08 インターナショナル・ビジネス・マシーンズ・コーポレーション Detect and track new items and new classes in database documents
US20020138353A1 (en) 2000-05-03 2002-09-26 Zvi Schreiber Method and system for analysis of database records having fields with sets
US6965892B1 (en) 2000-05-31 2005-11-15 International Business Machines Corporation Method, system and program products for concurrently accessing a global data repository by multithreaded clients
US6681225B1 (en) 2000-05-31 2004-01-20 International Business Machines Corporation Method, system and program products for concurrent write access to a global data repository
US6581063B1 (en) 2000-06-15 2003-06-17 International Business Machines Corporation Method and apparatus for maintaining a linked list
US6684207B1 (en) 2000-08-01 2004-01-27 Oracle International Corp. System and method for online analytical processing
US6868414B2 (en) 2001-01-03 2005-03-15 International Business Machines Corporation Technique for serializing data structure updates and retrievals without requiring searchers to use locks
US6959303B2 (en) 2001-01-17 2005-10-25 Arcot Systems, Inc. Efficient searching techniques
US6691109B2 (en) 2001-03-22 2004-02-10 Turbo Worx, Inc. Method and apparatus for high-performance sequence comparison
US6748378B1 (en) 2001-04-20 2004-06-08 Oracle International Corporation Method for retrieving data from a database
WO2002090978A1 (en) 2001-05-04 2002-11-14 Paracel, Inc. Method and apparatus for high-speed approximate sub-string searches
US6816856B2 (en) 2001-06-04 2004-11-09 Hewlett-Packard Development Company, L.P. System for and method of data compression in a valueless digital tree representing a bitset
US7117216B2 (en) 2001-06-07 2006-10-03 Sun Microsystems, Inc. Method and apparatus for runtime merging of hierarchical trees
KR100656528B1 (en) 2001-09-10 2006-12-12 한국과학기술원 Dynamic Update Cube and Hybrid Queries Search Method for Range-Sum Queries
DK200101619A (en) 2001-11-01 2003-05-02 Syntonetic Aps Automatic for template-based sequence production, as well as automatic sequence production method
US6904432B2 (en) * 2001-11-30 2005-06-07 Intelligent Medical Objects, Inc. Adaptive data manager
US6826568B2 (en) 2001-12-20 2004-11-30 Microsoft Corporation Methods and system for model matching
US6807541B2 (en) 2002-02-28 2004-10-19 International Business Machines Corporation Weak record locks in database query read processing
US7287033B2 (en) 2002-03-06 2007-10-23 Ori Software Development, Ltd. Efficient traversals over hierarchical data and indexing semistructured data
US6624762B1 (en) 2002-04-11 2003-09-23 Unisys Corporation Hardware-based, LZW data compression co-processor
US6694323B2 (en) 2002-04-25 2004-02-17 Sybase, Inc. System and methodology for providing compact B-Tree
US20040133590A1 (en) 2002-08-08 2004-07-08 Henderson Alex E. Tree data structure with range-specifying keys and associated methods and apparatuses
US7206776B2 (en) * 2002-08-15 2007-04-17 Microsoft Corporation Priority differentiated subtree locking
US7007027B2 (en) 2002-12-02 2006-02-28 Microsoft Corporation Algorithm for tree traversals using left links
US7424671B2 (en) 2003-05-16 2008-09-09 Justsystems Canada Inc. Methods and systems for enabling collaborative authoring of hierarchical documents
US8250107B2 (en) 2003-06-03 2012-08-21 Hewlett-Packard Development Company, L.P. Techniques for graph data structure management
US20050015383A1 (en) 2003-07-15 2005-01-20 Microsoft Corporation Method and system for accessing database objects in polyarchical relationships using data path expressions
US7349913B2 (en) 2003-08-21 2008-03-25 Microsoft Corporation Storage platform for organizing, searching, and sharing data

Patent Citations (54)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4464650A (en) * 1981-08-10 1984-08-07 Sperry Corporation Apparatus and method for compressing data signals and restoring the compressed data signals
US4823310A (en) * 1987-08-10 1989-04-18 Wang Laboratories, Inc. Device for enabling concurrent access of indexed sequential data files
US5151697A (en) * 1990-10-15 1992-09-29 Board Of Regents Of The University Of Washington Data structure management tagging system
US5592667A (en) * 1991-05-29 1997-01-07 Triada, Ltd. Method of storing compressed data for accelerated interrogation
US5634133A (en) * 1992-01-17 1997-05-27 Compaq Computer Corporation Constraint based graphics system
US5488717A (en) * 1992-07-06 1996-01-30 1St Desk Systems, Inc. MTree data structure for storage, indexing and retrieval of information
US5640551A (en) * 1993-04-14 1997-06-17 Apple Computer, Inc. Efficient high speed trie search process
US5534861A (en) * 1993-04-16 1996-07-09 International Business Machines Corporation Method and system for adaptively building a static Ziv-Lempel dictionary for database compression
US5894311A (en) * 1995-08-08 1999-04-13 Jerry Jackson Associates Ltd. Computer-based visual data evaluation
US5930805A (en) * 1995-12-01 1999-07-27 Sand Technology Systems International, Inc. Storage and retrieval of ordered sets of keys in a compact 0-complete tree
US6144962A (en) * 1996-10-15 2000-11-07 Mercury Interactive Corporation Visualization of web sites and hierarchical data structures
US5970490A (en) * 1996-11-05 1999-10-19 Xerox Corporation Integration platform for heterogeneous databases
US5918229A (en) * 1996-11-22 1999-06-29 Mangosoft Corporation Structured data storage using globally addressable memory
US5893102A (en) * 1996-12-06 1999-04-06 Unisys Corporation Textual database management, storage and retrieval system utilizing word-oriented, dictionary-based data compression/decompression
US6233575B1 (en) * 1997-06-24 2001-05-15 International Business Machines Corporation Multilevel taxonomy based on features derived from training documents classification using fisher values as discrimination values
US6061692A (en) * 1997-11-04 2000-05-09 Microsoft Corporation System and method for administering a meta database as an integral component of an information server
US6356902B1 (en) * 1998-07-28 2002-03-12 Matsushita Electric Industrial Co., Ltd. Method and system for storage and retrieval of multimedia objects
US6275801B1 (en) * 1998-11-03 2001-08-14 International Business Machines Corporation Non-leaf node penalty score assignment system and method for improving acoustic fast match speed in large vocabulary systems
US6374210B1 (en) * 1998-11-30 2002-04-16 U.S. Philips Corporation Automatic segmentation of a text
US6635089B1 (en) * 1999-01-13 2003-10-21 International Business Machines Corporation Method for producing composite XML document object model trees using dynamic data retrievals
US6373484B1 (en) * 1999-01-21 2002-04-16 International Business Machines Corporation Method and system for presenting data structures graphically
US6751622B1 (en) * 1999-01-21 2004-06-15 Oracle International Corp. Generic hierarchical structure with hard-pegging of nodes with dependencies implemented in a relational database
US6477683B1 (en) * 1999-02-05 2002-11-05 Tensilica, Inc. Automated processor generation system for designing a configurable processor and method for the same
US6591272B1 (en) * 1999-02-25 2003-07-08 Tricoron Networks, Inc. Method and apparatus to make and transmit objects from a database on a server computer to a client computer
US6920608B1 (en) * 1999-05-21 2005-07-19 E Numerate Solutions, Inc. Chart view for reusable data markup language
US20050198042A1 (en) * 1999-05-21 2005-09-08 E-Numerate Solutions, Inc. Chart view for reusable data markup language
US6505205B1 (en) * 1999-05-29 2003-01-07 Oracle Corporation Relational database system for storing nodes of a hierarchical index of multi-dimensional data in a first module and metadata regarding the index in a second module
US6711585B1 (en) * 1999-06-15 2004-03-23 Kanisa Inc. System and method for implementing a knowledge management system
US6675169B1 (en) * 1999-09-07 2004-01-06 Microsoft Corporation Method and system for attaching information to words of a trie
US6678409B1 (en) * 2000-01-14 2004-01-13 Microsoft Corporation Parameterized word segmentation of unsegmented text
US20020143783A1 (en) * 2000-02-28 2002-10-03 Hyperroll Israel, Limited Method of and system for data aggregation employing dimensional hierarchy transformation
US20050060325A1 (en) * 2000-02-28 2005-03-17 Reuven Bakalash Method of and apparatus for data aggregation utilizing a multidimensional database and multi-stage data aggregation operations
US6704729B1 (en) * 2000-05-19 2004-03-09 Microsoft Corporation Retrieval of relevant information categories
US6938204B1 (en) * 2000-08-31 2005-08-30 International Business Machines Corporation Array-based extensible document storage format
US7257599B2 (en) * 2001-02-01 2007-08-14 Sap Aktiengesellschaft Data organization in a fast query system
US20020120598A1 (en) * 2001-02-26 2002-08-29 Ori Software Development Ltd. Encoding semi-structured data for efficient search and browse
US20020143735A1 (en) * 2001-03-30 2002-10-03 Akin Ayi User scope-based data organization system
US20020147721A1 (en) * 2001-04-04 2002-10-10 Pankaj Gupta Compact data structures for pipelined message forwarding lookups
US20030009443A1 (en) * 2001-06-15 2003-01-09 Oleg Yatviskiy Generic data aggregation
US6799184B2 (en) * 2001-06-21 2004-09-28 Sybase, Inc. Relational database system providing XML query support
US7027052B1 (en) * 2001-08-13 2006-04-11 The Hive Group Treemap display with minimum cell size
US20050234709A1 (en) * 2001-09-26 2005-10-20 Klavans Judith L System and method of generating dictionary entries
US6738762B1 (en) * 2001-11-26 2004-05-18 At&T Corp. Multidimensional substring selectivity estimation using set hashing of cross-counts
US20030217335A1 (en) * 2002-05-17 2003-11-20 Verity, Inc. System and method for automatically discovering a hierarchy of concepts from a corpus of documents
US20030223456A1 (en) * 2002-06-04 2003-12-04 Dimambro Francesco R. Apparatus and method for steering a communication to an open stream
US20040006563A1 (en) * 2002-06-26 2004-01-08 Arthur Zwiegincew Manipulating schematized data in a database
US20040064304A1 (en) * 2002-07-03 2004-04-01 Word Data Corp Text representation and method
US20040034525A1 (en) * 2002-08-15 2004-02-19 Pentheroudakis Joseph E. Method and apparatus for expanding dictionaries during parsing
US6768995B2 (en) * 2002-09-30 2004-07-27 Adaytum, Inc. Real-time aggregation of data within an enterprise planning environment
US20040169654A1 (en) * 2003-02-27 2004-09-02 Teracruz, Inc. System and method for tree map visualization for database performance data
US7228296B2 (en) * 2003-03-27 2007-06-05 Fujitsu Limited Devices for interpreting and retrieving XML documents, methods of interpreting and retrieving XML documents, and computer product
US20040243557A1 (en) * 2003-05-30 2004-12-02 International Business Machines Corporation System, method and computer program product for performing unstructured information management and automatic text analysis, including a search operator functioning as a weighted and (WAND)
US20050102255A1 (en) * 2003-11-06 2005-05-12 Bultman David C. Computer-implemented system and method for handling stored data
US20050179684A1 (en) * 2004-02-17 2005-08-18 Wallace James H. Data exploration system

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20100169372A1 (en) * 2008-12-31 2010-07-01 Mazzagatti Jane C Adding and subtracting kstores
US20100169373A1 (en) * 2008-12-31 2010-07-01 Mazzagatti Jane C Updating values of elemental root nodes in kstore
US8010569B2 (en) * 2008-12-31 2011-08-30 Unisys Corporation Adding and subtracting KStores
US20150089435A1 (en) * 2013-09-25 2015-03-26 Microth, Inc. System and method for prediction and recognition of input sequences

Also Published As

Publication number Publication date
CN101438243A (en) 2009-05-20
CN101438271A (en) 2009-05-20
CN101128848A (en) 2008-02-20
US7418445B1 (en) 2008-08-26

Similar Documents

Publication Publication Date Title
US7072876B1 (en) System and method for mining execution traces with finite automata
US6918124B1 (en) Query trees including or nodes for event filtering
Maggi et al. Detecting intrusions through system call sequence and argument analysis
US7496592B2 (en) Systems and methods for maintaining closed frequent itemsets over a data stream sliding window
AU2005241532B2 (en) Identification of input files using reference files associated with nodes of a sparse binary tree
US10977152B2 (en) Rule-based continuous diagnosing and alerting from application logs
US20030093517A1 (en) System and method for uniform resource locator filtering
US11184368B2 (en) Systems and methods for reporting computer security incidents
US20060101018A1 (en) Method for processing new sequences being recorded into an interlocking trees datastore
US7237237B2 (en) Designating an object for destruction
CN104954345A (en) Attack recognition method based on object analysis and device thereof
CN112261052A (en) SDN data plane abnormal behavior detection method and system based on flow rule analysis
US7908240B1 (en) Facilitated use of column and field data for field record universe in a knowledge store
CN111740946A (en) Webshell message detection method and device
WO2002003242A9 (en) Method for analyzing words
US7088864B2 (en) Generating a test suite from an abstract state machine
CN108307001B (en) MAC address aging method and device and electronic equipment
CN114707145A (en) Legiong software detection method based on Fanotify mechanism
Lo et al. Bidirectional mining of non-redundant recurrent rules from a sequence database
O'Connor An unpredictability approach to finite-state randomness
KR101003502B1 (en) Signature String clustering Method Based on the Resemblance and Containment in the Sequence
CN114553580B (en) Network attack detection method and device based on rule generalization and attack reconstruction
Ning et al. TIAA: A visual toolkit for intrusion alert analysis
Parys How Many Numbers Can a Lambda-Term Contain?
Long et al. Adaptive similarity metrics in case-based reasoning

Legal Events

Date Code Title Description
AS Assignment

Owner name: UNISYS CORPORATION, PENNSYLVANIA

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:MAZZAGATTI, JANE CAMPBELL;CLAAR, JANE VAN KEUREN;RAJCAN, STEVEN L.;REEL/FRAME:016802/0004

Effective date: 20050718

AS Assignment

Owner name: CITIBANK, N.A.,NEW YORK

Free format text: SECURITY AGREEMENT;ASSIGNORS:UNISYS CORPORATION;UNISYS HOLDING CORPORATION;REEL/FRAME:018003/0001

Effective date: 20060531

Owner name: CITIBANK, N.A., NEW YORK

Free format text: SECURITY AGREEMENT;ASSIGNORS:UNISYS CORPORATION;UNISYS HOLDING CORPORATION;REEL/FRAME:018003/0001

Effective date: 20060531

STCB Information on status: application discontinuation

Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION

AS Assignment

Owner name: UNISYS CORPORATION, PENNSYLVANIA

Free format text: RELEASE BY SECURED PARTY;ASSIGNOR:CITIBANK, N.A.;REEL/FRAME:023312/0044

Effective date: 20090601

Owner name: UNISYS HOLDING CORPORATION, DELAWARE

Free format text: RELEASE BY SECURED PARTY;ASSIGNOR:CITIBANK, N.A.;REEL/FRAME:023312/0044

Effective date: 20090601

Owner name: UNISYS CORPORATION,PENNSYLVANIA

Free format text: RELEASE BY SECURED PARTY;ASSIGNOR:CITIBANK, N.A.;REEL/FRAME:023312/0044

Effective date: 20090601

Owner name: UNISYS HOLDING CORPORATION,DELAWARE

Free format text: RELEASE BY SECURED PARTY;ASSIGNOR:CITIBANK, N.A.;REEL/FRAME:023312/0044

Effective date: 20090601

AS Assignment

Owner name: UNISYS CORPORATION, PENNSYLVANIA

Free format text: RELEASE BY SECURED PARTY;ASSIGNOR:CITIBANK, N.A.;REEL/FRAME:023263/0631

Effective date: 20090601

Owner name: UNISYS HOLDING CORPORATION, DELAWARE

Free format text: RELEASE BY SECURED PARTY;ASSIGNOR:CITIBANK, N.A.;REEL/FRAME:023263/0631

Effective date: 20090601

Owner name: UNISYS CORPORATION,PENNSYLVANIA

Free format text: RELEASE BY SECURED PARTY;ASSIGNOR:CITIBANK, N.A.;REEL/FRAME:023263/0631

Effective date: 20090601

Owner name: UNISYS HOLDING CORPORATION,DELAWARE

Free format text: RELEASE BY SECURED PARTY;ASSIGNOR:CITIBANK, N.A.;REEL/FRAME:023263/0631

Effective date: 20090601