WO2003077151A2 - A dynamic dictionary and term repository system - Google Patents

A dynamic dictionary and term repository system Download PDF

Info

Publication number
WO2003077151A2
WO2003077151A2 PCT/US2003/006766 US0306766W WO03077151A2 WO 2003077151 A2 WO2003077151 A2 WO 2003077151A2 US 0306766 W US0306766 W US 0306766W WO 03077151 A2 WO03077151 A2 WO 03077151A2
Authority
WO
WIPO (PCT)
Prior art keywords
term
repository
terms
term repository
data
Prior art date
Application number
PCT/US2003/006766
Other languages
French (fr)
Other versions
WO2003077151A3 (en
Inventor
Robert Emmons Haskell
John Andrew Heil
James Cassidy
Original Assignee
Siemens Medical Solutions Health Services Corporation
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Siemens Medical Solutions Health Services Corporation filed Critical Siemens Medical Solutions Health Services Corporation
Priority to EP03744193A priority Critical patent/EP1481332A2/en
Priority to JP2003575296A priority patent/JP2005519411A/en
Priority to CA002477689A priority patent/CA2477689A1/en
Publication of WO2003077151A2 publication Critical patent/WO2003077151A2/en
Publication of WO2003077151A3 publication Critical patent/WO2003077151A3/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/237Lexical tools
    • G06F40/242Dictionaries
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16HHEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
    • G16H40/00ICT specially adapted for the management or administration of healthcare resources or facilities; ICT specially adapted for the management or operation of medical equipment or devices
    • G16H40/60ICT specially adapted for the management or administration of healthcare resources or facilities; ICT specially adapted for the management or operation of medical equipment or devices for the operation of medical equipment or devices
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16HHEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
    • G16H70/00ICT specially adapted for the handling or processing of medical references

Definitions

  • the present invention relates to a dictionary and term repository system which may be used to support a healthcare enterprise.
  • a healthcare enterprise is made up of one or more related healthcare facilities and may also be associated with other entities not included in the healthcare enterprise.
  • Healthcare information systems store information related to patients of the healthcare enterprise. The patient information is exchanged over a communications network interconnecting the various facilities in the healthcare enterprise. Some such information is easily handled by computing systems: for example, demographic information such as patient name, age, sex, race, address, etc.; and fiscal data such as coding, billing, collections, appeals, etc.
  • the most critical part of the patient record is the medical information: for example, clinical information such as symptoms, signs, side effects, complications, etc.; and outcomes such as performance, effectiveness, efficiency, etc. To be useful, these records need to be searchable on the medical terms in them, either for the particular patient (e.g., to apply best practice rules) or over a population of patients (e.g., to develop best practice rules).
  • each healthcare information system within the healthcare enterprise maintains a lexicon or dictionary which contains a repository of medical terms (i.e. words or phrases) which may be used in medical transactions in that facility. Searches may then be performed by looking up the regular term associated with each search term, analyzing the medical records to identify the regular terms associated with the terms used in the medical record, and performing the search using the regular terms, all in a known manner.
  • a lexicon or dictionary which contains a repository of medical terms (i.e. words or phrases) which may be used in medical transactions in that facility. Searches may then be performed by looking up the regular term associated with each search term, analyzing the medical records to identify the regular terms associated with the terms used in the medical record, and performing the search using the regular terms, all in a known manner.
  • US Patent publication 2002/0082868, published June 27, 2002 for Pories et al. relates to a system for creating an electronic medical record.
  • a general illness for a patient is first identified and supplied to the system via an input device.
  • a series of screen images are presented to the doctor displaying a plurality of terms, which had been pre-entered into a lexicon, related to the illness.
  • the doctor selects from among the displayed list of terms to describe the results of his examination.
  • other screen images with other lists of terms, related to those previously selected may be displayed, and the doctor may select from among those displayed, until the required level of detail is reached. All of the selected terms are then processed to automatically generate structured medical information for the medical record.
  • a term may be manually added to the lexicon, possibly after the approval of a medical director or other person of authority. For example, if a desired term is not in the lexicon, a doctor may request that it be added to the lexicon.
  • medical texts may be scanned and parsed, and terms found in the texts automatically added to the lexicon.
  • third party sources such as hospital, insurance, and/or federal agency databases, may be scanned and parsed to extract relevant terms and the extracted terms added to the lexicon. Further automatic processes may be performed to delete terms which have fallen into disuse.
  • US Patent 6,055,494, issued April 25, 2000 to Friedman discloses a system for parsing natural language medical records.
  • a natural language medical record is parsed and its terms compared to entries in the database to assign a canonical term for any natural language expression corresponding to that canonical term.
  • a database is used to hold information necessary to perform the parse.
  • the regular terms resulting from the parsing of the natural language medical record are then further processed. For example, they may be stored in a database record so that all such medical records may be searched using the regular terms.
  • a dictionary system provides a term repository supporting the operation of an enterprise, which dynamically updates its content in response to system use.
  • An input processor acquires transaction message data in at least one of a plurality of different data formats.
  • a data processor parses the acquired transaction message data to extract a term from the message data.
  • the processor compares the extracted term to terms in a first term repository.
  • the first term repository is updated to include the extracted term if the extracted term is absent from said first term repository.
  • a communication processor intermittently processes the content of said first term repository to be suitable for communication to a second term repository.
  • This second term repository represents the operational term repository for a health information system of one of the healthcare enterprise facilities.
  • FIG. 1 is a block diagram of a portion of a healthcare enterprise data processing system including a data dictionary system according to principles of the present invention.
  • FIG. 2 is a data diagram illustrating a portion of the contents of a term repository table in the data dictionary system illustrated in Fig. 1.
  • FIG. 1 is a block diagram of a portion of a healthcare enterprise data processing system including a data dictionary system according to principles of the present invention. Only those elements necessary to describe a data dictionary according to principles of the present invention are illustrated in Fig. 1. One skilled in the art will understand what other elements are necessary for a healthcare enterprise data processing system, how to specify, design, and implement those elements, and how to interconnect those other elements with the illustrated elements to form a working data processing system.
  • a healthcare enterprise includes a plurality of healthcare information systems (HISs) 100, each providing data processing services for one or more operational facilities in the healthcare enterprise.
  • HISs healthcare information systems
  • a healthcare enterprise may include a healthcare enterprise laboratory, a healthcare enterprise pharmacy, healthcare enterprise radiology department, a healthcare enterprise modality department, a healthcare enterprise administration operation, a healthcare enterprise orders or results management operation, and other such operations.
  • Each of the HISs 100 includes an input/output (I/O) processor 104 for receiving data from and transmitting data to the network connection 200.
  • I/O input/output
  • At least one of the HISs 100 also includes a term repository 102, containing data representing terms which describe the vocabulary of the users of that HIS and may be used to enter information, validate information, identify courses of treatment, and implement searches of the data records contained in that HIS 100, in a manner described above.
  • This dictionary may also contain other information associated with each term, such as a canonical or regular term, other synonyms if they exist, a count of the frequency of occurrence of the term (described in more detail below), a definition, etc. It is also possible for the dictionary to contain information representing the relationship of a term to other terms within a hierarchy of terms, such as might be used to describe the results of a battery of tests such as the complete blood count (CBC) laboratory test.
  • CBC complete blood count
  • the network connection 200 is also coupled to an electronic data interchange (EDI) circuit 400.
  • EDI electronic data interchange
  • the EDI 400 provides a bridge between the enterprise network connection 200 and other entities 300 inside or outside the enterprise.
  • these entities 300 may include doctors' offices, laboratories, pharmaceutical companies, payers, etc.
  • the network connection 200 is also coupled to a dictionary system 500 in accordance with principles of the present invention.
  • An input terminal of an input processor 502 is coupled to, and receives data from the network connection 200.
  • An output terminal of the input processor 502 is coupled to an input terminal of a data processor 504.
  • An output terminal of the data processor 504 is coupled to an input terminal of a communications processor 508.
  • An output terminal of the communications processor 508 is coupled to and transmits data to the network connection 200.
  • a term repository 506 is bidirectionally coupled to the data processor 504.
  • a maintenance processor 510 is bidirectionally coupled to the term repository 506.
  • the various HISs 100, the EDI 400 and the data dictionary system 500 communicate by sending transaction messages over the network connection 200.
  • Such transaction messages contain data which may include: (a) a communication involving a healthcare enterprise laboratory, (b) a communication involving a healthcare enterprise pharmacy, (c) a communication involving a healthcare enterprise radiology department, (d) a communication involving a healthcare enterprise modality department, (e) a communication involving a healthcare enterprise administration operation (f) a communication involving a healthcare enterprise orders or results management operation; and/or any other data to be communicated between one facility within the healthcare enterprise and other facility within or without the enterprise.
  • These transaction messages are sent from a source (e.g. one HIS 100) to a destination (e.g. another HIS 100).
  • a source e.g. one HIS 100
  • a destination e.g. another HIS 100
  • the incoming transaction message is queued for further processing.
  • the transaction messages in the queue are then processed in their turn.
  • the data fields in each transaction message in the queue are analyzed and processed in a reversible manner until it is clear that the transaction message is complete and accurate. This processing may include review by a human user.
  • the transaction message is 'posted', meaning that any changes specified by the data fields in the transaction message are made at the destination, and the transaction message is removed from the queue.
  • no 'posting' is made and therefore no incorrect changes are made in the destination HIS.
  • the source HIS may be informed of the problem.
  • HISs may be supplied by different vendors and therefore may communicate transaction messages containing different data using different data formats, called protocols, for the transactions.
  • protocols There are several known protocols for forming transaction messages: Health Level Seven (HL7), Extensible Markup Language (XML), Digital Imaging and Communications in Medicine (DICOM), or any other electronic data interchange (EDI) protocol recognized by the American National Standards Institute's (ANSI) Accredited Standards Committee (ASC) XI 2.
  • HL7 Health Level Seven
  • XML Extensible Markup Language
  • DIOM Digital Imaging and Communications in Medicine
  • EDI electronic data interchange
  • each transaction message includes several data fields, representing inter alia: the transaction message source location, the transaction message protocol, and the type of computer system at the transaction message source location.
  • the transaction message further includes data fields representing coded medical data and/or the text of the medical record.
  • the dictionary system 500 monitors the transaction message stream that flows through the network connection 200 connecting the multiple HISs 100 in the healthcare enterprise system 100 and Other Entities 300, and stores new terms (including medical codes) found in those transactions in a structured repository 506 of such terms.
  • the term repository 506 in the dictionary system 500 may also contain other information associated with each term, such as a canonical or regular term, other synonyms if they exist, a count of the frequency of use of the term, a definition, etc.
  • the term repository 506 may contain information representing the relationship of a term to other terms within a hierarchy of terms, such as might be used to describe the results of a battery of tests such as the complete blood count (CBC) laboratory test, as described in more detail below.
  • CBC complete blood count
  • the respective transaction message data fields representing coded medical data and/or texts of medical records are parsed to identify and extract codes or terms. New terms are added to the repository and counts are maintained of repeated terms. Utilities are provided to manage the repository, including viewing and maintaining its content (e.g., display the most frequently occurring terms, delete terms, reset counts, generate updates for the HIS term repository 102).
  • the central term repository 506 can be turned on early to collect a baseline set of terms, which can be subsequently refined to reflect new terminology requirements of the new HIS system. No complex installation is required because the new dictionary builds itself as transaction messages are processed.
  • the input processor 502 acquires transaction messages from the network connection 200 in at least one of the protocols, described above, in which the transaction messages are formatted. Preferably, however, the input processor 502 may properly acquire transaction messages in any of the protocols used in the system. To do this, the input processor 502 needs to know what protocol is being used to format the transaction message. As described above, each transaction message includes a data field representing the protocol of that transaction message. The input processor 502 extracts the value in that data field and in response conditions its own processing to properly acquire the remainder of the transaction message data.
  • the data processor 504 further processes the acquired transaction messages to update the term repository 506, as described generally above.
  • the acquired transaction message data and in particular the data field representing the medical data, is parsed to extract the terms in it.
  • Predetermined templates of transaction messages in the different possible protocols are developed and used to provide some basic positional information to drive the message parsing function. This satisfies the need to accommodate in a general way the different standard message protocols of the source systems.
  • Each such extracted term may be a word, a code, a symbol, a label, text, a text expression, a designation, a string of characters, or any other such grouping of symbols which, taken as a whole, represents a medical concept.
  • each term which is thus extracted is compared to the terms currently in the term repository 506.
  • the term repository 506 is updated. If the term is not in the term repository 506, it is added. When the term is added to the term repository 506, other data fields in the term repository 506 are also initialized with data related to the term and/or the transaction message containing the term.
  • a data field in the term repository 506 containing a usage count is incremented by one. This count will increase indefinitely until reset at some point in time by the user of the maintenance facility 510, such as when distributing an update to the facility term repositories or when an analysis of new usage counts is desired.
  • the usage count data field is initialized to one.
  • Fig. 2 is a data diagram illustrating a portion of the contents of a term repository table 600 in the data dictionary system 500 illustrated in Fig. 1. More specifically, key data fields, used for indexing the contents of the table 600, contain a source identification (ID) code representing the transaction message source and a term identification (ID) code representing the term itself, illustrated as the leftmost two columns in Fig. 2.
  • ID source identification
  • ID term identification
  • each transaction message includes a data field representing the identity of the source of that transaction message.
  • each term in that transaction message is associated with the ID code of the source.
  • the data processor 504 associates a term identification code with that term. If this term is already in the term repository table 600, the term ID is already in the table. If the term is not already in the term repository table 600, the data processor 504 associates a unique term ID with that new term. As described above, if the newly parsed term is not already in the term repository table 600, a new record is added with the source ID and term ID in the key data fields.
  • the maintenance processor 510 provides a means for maintaining the term repository 506.
  • the maintenance processor 510 accesses the contents of the term repository 506 to allow a user to view the terms and related data, manually add a term, manually delete a term, manually edit a term, and reset or otherwise adjust usage frequency counts.
  • the maintenance processor 510 may also generate reports related to the accumulated terms, all in a known manner.
  • the maintenance processor 510 also can evaluate the date/time field in the term repository table 600 to identify terms that haven't been used in transaction messages in a predetermined time interval.
  • the maintenance processor 510 also allows a user to initiate update of a facility HIS 100 term repository 102 in a manner described in more detail below.
  • respective HISs 100 in the operational facilities in the healthcare enterprise each include their own term repository 102 to contain the vocabulary which is relevant to that facility.
  • the facility term repositories 102 are initialized and/or updated using the terms accumulated in the dictionary system 500 and possibly from other sources specific to the facility HIS.
  • the communications processor 508 conditions the data processor 504 to process the content of the dictionary 500 term repository 506 to be suitable to communicate to the term repository 102 in a facility HIS 100. More specifically, the data processor 504 is conditioned to select those terms having a source ID associated with the facility (or possibly other facilities or sources as desired and appropriate) and/or with other record attributes that match predetermined rules, such as a time period for new terms added.
  • the selected terms and other associated data fields are then mapped and arranged as required by the facility HIS 100 term repository 102. These data fields are then communicated over the network connection 200 to the facility HIS 100 via a file and/or one or more transaction messages using the protocol required by the facility HIS 100. Upon receiving these transaction messages, the facility HIS 100 updates its term repository 102. The updating of the facility HIS 100 term repository 102 may be initiated by a user command issued from the maintenance processor 510, as described above. Alternatively, an update may be initiated by a command issued automatically in response to any desired criterion.
  • an update may be initiated in response to either a predetermined interval of time since the last update, the accumulation of a predetermined number of terms since the last update, or identification of terms having usage counts exceeding a predetermined threshold.
  • the facility term repository 102 may also be updated in an incremental manner by selecting only terms in the central term repository 506 which were added after a selected date (e.g. the date of the last update), and updating the facility term repository 102 with those terms.
  • the update may be based on more complicated criteria.
  • a user may perform a query on the central term repository 506 to identify a set of terms matching desired criteria, such as only new terms acquired within a particular time period, or criteria based on attributes of the terms themselves such as only terms acquired from a particular source, or only terms of a particular status.
  • the facility term repository 102 is updated with only these terms. Identical terms from different sources having the same canonical meaning may be consolidated, as described above, and those consolidated terms sent to the facility term repository 102.
  • a selected type of term for example laboratory terms, may be extracted and only those terms sent to the facility term repository 102.
  • the source of the term is included in the database.
  • the source may be a medical text, or a government or insurance industry database. Terms from these sources may need to be supplied to the facility term repository 102 as soon as practical after they are identified.
  • the database 600 in the term repository 506 is queried to identify desired terms and the selected terms sent to the facility term repository 102.
  • the above described methods are simply examples, and that any such method may be used to identify sets of terms and to initiate updating of the facility term repository 102 with these terms..

Abstract

A dictionary system provides a term repository supporting the operation of an enterprise, which dynamically updates its content in response to system use. An input processor acquires transaction message data in at least one of a plurality of different data formats. A data processor parses the acquired transaction message data to extract a term from the message data. The processor then compares the extracted term to terms in a first term repository. The first term repository is updated to include the extracted term if the extracted term is absent from said first term repository. A communication processor intermittently processes the content of said first term repository to be suitable for communication to a second term repository.

Description

A Dynamic Dictionary and Term Repository System
This application is a non-provisional application of provisional application 60/361,754 by R. E. Haskell et al. filed March 5, 2002.
FIELD OF THE INVENTION
[1] The present invention relates to a dictionary and term repository system which may be used to support a healthcare enterprise.
BACKGROUND OF THE INVENTION
[2] A healthcare enterprise is made up of one or more related healthcare facilities and may also be associated with other entities not included in the healthcare enterprise. Healthcare information systems store information related to patients of the healthcare enterprise. The patient information is exchanged over a communications network interconnecting the various facilities in the healthcare enterprise. Some such information is easily handled by computing systems: for example, demographic information such as patient name, age, sex, race, address, etc.; and fiscal data such as coding, billing, collections, appeals, etc. However, the most critical part of the patient record is the medical information: for example, clinical information such as symptoms, signs, side effects, complications, etc.; and outcomes such as performance, effectiveness, efficiency, etc. To be useful, these records need to be searchable on the medical terms in them, either for the particular patient (e.g., to apply best practice rules) or over a population of patients (e.g., to develop best practice rules).
[3] However, one problem is that in medical records different terms may be used by different facilities and/or doctors for the same or similar concepts. For example, the terms: "heart is enlarged", "enlarged heart", "heart shows enlargement", and "cardiac enlargement" have the same meaning, and may be translated to a canonical or regular term: "enlarged heart". Similarly, the same term may have different meanings to different facilities within the healthcare enterprise. For example, "cystic disease" has one meaning in X-ray diagnosis and a different meaning in mammography.
[4] In order to provide complete and accurate results when searching for records related to a medical concept, all terms which may be used to represent that concept need to be found in the search. To facilitate this function, each healthcare information system within the healthcare enterprise maintains a lexicon or dictionary which contains a repository of medical terms (i.e. words or phrases) which may be used in medical transactions in that facility. Searches may then be performed by looking up the regular term associated with each search term, analyzing the medical records to identify the regular terms associated with the terms used in the medical record, and performing the search using the regular terms, all in a known manner.
[5] US Patent publication 2002/0082868, published June 27, 2002 for Pories et al., relates to a system for creating an electronic medical record. A general illness for a patient is first identified and supplied to the system via an input device. In response, a series of screen images are presented to the doctor displaying a plurality of terms, which had been pre-entered into a lexicon, related to the illness. The doctor selects from among the displayed list of terms to describe the results of his examination. In response to the doctor's selections, other screen images with other lists of terms, related to those previously selected, may be displayed, and the doctor may select from among those displayed, until the required level of detail is reached. All of the selected terms are then processed to automatically generate structured medical information for the medical record. A term may be manually added to the lexicon, possibly after the approval of a medical director or other person of authority. For example, if a desired term is not in the lexicon, a doctor may request that it be added to the lexicon. In addition, medical texts may be scanned and parsed, and terms found in the texts automatically added to the lexicon. Also, third party sources, such as hospital, insurance, and/or federal agency databases, may be scanned and parsed to extract relevant terms and the extracted terms added to the lexicon. Further automatic processes may be performed to delete terms which have fallen into disuse.
[6] US Patent 6,055,494, issued April 25, 2000 to Friedman, discloses a system for parsing natural language medical records. A natural language medical record is parsed and its terms compared to entries in the database to assign a canonical term for any natural language expression corresponding to that canonical term. A database is used to hold information necessary to perform the parse. The regular terms resulting from the parsing of the natural language medical record are then further processed. For example, they may be stored in a database record so that all such medical records may be searched using the regular terms.
[7] US Patent 5,809,476, issued September 15, 1998 to Ryan, discloses a system for generating coded data from natural language medical records. Each term in the medical record, and relationships between terms, are analyzed to generate a compressed symbolic representation of the original information. The system provides for correction and/or supplement of the original information. This coded information may later be interrogated.
[8] By parsing and/or coding medical records to identify regular or canonical terms for the medical terms being recorded, the future searching of such records is facilitated, best practice rules may be defined using these terms, and more meaningful statistical analyses of the medical records may be performed.
[9] In Pories et al. only those terms in the lexicon are displayed and made available for a doctor to include in his medical record. In the other systems, only the terms in the lexicon are recognized and coded. Thus, all of the above mentioned systems require a populated database or lexicon to operate. However, the population of such a lexicon can be a time consuming and lengthy process, which may unduly restrict doctors when delivering health care and creating their own medical records. Also, even if automatically created from prior lexicons or from literature, there is no is no assurance that terms collected are relevant to the facility or doctor.
[10] In Pories et al., a doctor may request a term be added to the lexicon. But the update is made manually, possibly only upon approval of a medical director. This system may also automatically delete terms which have not been used for some predetermined time. Neither Friedman nor Ryan disclose dynamically updating the lexicon in response to use of the system.
[11] The requirement for a full lexicon means that such systems require an installation of such a lexicon before they may be used. Installation of a full lexicon may be accomplished in two ways: by moving a lexicon from one or more remote locations to the current health information system; and/or by manually entering the desired terms into the lexicon. The former requires that programs be written which can read the pre-existing lexicon in its format, and write the new lexicon in its, possibly different, format. The development of this program, its testing and execution require a substantial amount of time, and risks carrying over old, unused terms from the old lexicon into the new one. The latter takes even more time, and has a very high probability of the new lexicon having errors, omissions and inaccuracies. [12] A system which does not require a full lexicon to begin operation, provides a means to rapidly and accurately build a significant portion thereof, and which provides for full and accurate searching of medical records is desirable.
BRIEF SUMMARY OF THE INVENTION
[13] In accordance with principles of the present invention a dictionary system provides a term repository supporting the operation of an enterprise, which dynamically updates its content in response to system use. An input processor acquires transaction message data in at least one of a plurality of different data formats. A data processor parses the acquired transaction message data to extract a term from the message data. The processor then compares the extracted term to terms in a first term repository. The first term repository is updated to include the extracted term if the extracted term is absent from said first term repository. A communication processor intermittently processes the content of said first term repository to be suitable for communication to a second term repository. This second term repository represents the operational term repository for a health information system of one of the healthcare enterprise facilities.
BRIEF DESCRIPTION OF THE DRAWING [14] In the drawing:
[15] Fig. 1 is a block diagram of a portion of a healthcare enterprise data processing system including a data dictionary system according to principles of the present invention; and
[16] Fig. 2 is a data diagram illustrating a portion of the contents of a term repository table in the data dictionary system illustrated in Fig. 1.
DETAILED DESCRIPTION OF THE INVENTION
[17] Fig. 1 is a block diagram of a portion of a healthcare enterprise data processing system including a data dictionary system according to principles of the present invention. Only those elements necessary to describe a data dictionary according to principles of the present invention are illustrated in Fig. 1. One skilled in the art will understand what other elements are necessary for a healthcare enterprise data processing system, how to specify, design, and implement those elements, and how to interconnect those other elements with the illustrated elements to form a working data processing system.
[18] In Fig. 1, a healthcare enterprise includes a plurality of healthcare information systems (HISs) 100, each providing data processing services for one or more operational facilities in the healthcare enterprise. For example, a healthcare enterprise may include a healthcare enterprise laboratory, a healthcare enterprise pharmacy, healthcare enterprise radiology department, a healthcare enterprise modality department, a healthcare enterprise administration operation, a healthcare enterprise orders or results management operation, and other such operations.
[19] These HISs are interconnected using a network connection 200, including any of a wireless network, wired network (e.g. Ethernet), or any other form of computer interconnection structure. Each of the HISs 100 includes an input/output (I/O) processor 104 for receiving data from and transmitting data to the network connection 200. At least one of the HISs 100 also includes a term repository 102, containing data representing terms which describe the vocabulary of the users of that HIS and may be used to enter information, validate information, identify courses of treatment, and implement searches of the data records contained in that HIS 100, in a manner described above. This dictionary may also contain other information associated with each term, such as a canonical or regular term, other synonyms if they exist, a count of the frequency of occurrence of the term (described in more detail below), a definition, etc. It is also possible for the dictionary to contain information representing the relationship of a term to other terms within a hierarchy of terms, such as might be used to describe the results of a battery of tests such as the complete blood count (CBC) laboratory test.
[20] The network connection 200 is also coupled to an electronic data interchange (EDI) circuit 400. The EDI 400 provides a bridge between the enterprise network connection 200 and other entities 300 inside or outside the enterprise. For example, these entities 300 may include doctors' offices, laboratories, pharmaceutical companies, payers, etc.
[21] The network connection 200 is also coupled to a dictionary system 500 in accordance with principles of the present invention. An input terminal of an input processor 502 is coupled to, and receives data from the network connection 200. An output terminal of the input processor 502 is coupled to an input terminal of a data processor 504. An output terminal of the data processor 504 is coupled to an input terminal of a communications processor 508. An output terminal of the communications processor 508 is coupled to and transmits data to the network connection 200. A term repository 506 is bidirectionally coupled to the data processor 504. A maintenance processor 510 is bidirectionally coupled to the term repository 506.
[22] In operation, the various HISs 100, the EDI 400 and the data dictionary system 500 communicate by sending transaction messages over the network connection 200. Such transaction messages contain data which may include: (a) a communication involving a healthcare enterprise laboratory, (b) a communication involving a healthcare enterprise pharmacy, (c) a communication involving a healthcare enterprise radiology department, (d) a communication involving a healthcare enterprise modality department, (e) a communication involving a healthcare enterprise administration operation (f) a communication involving a healthcare enterprise orders or results management operation; and/or any other data to be communicated between one facility within the healthcare enterprise and other facility within or without the enterprise.
[23] These transaction messages are sent from a source (e.g. one HIS 100) to a destination ( e.g. another HIS 100). At the destination, the incoming transaction message is queued for further processing. The transaction messages in the queue are then processed in their turn. To enhance reliability, the data fields in each transaction message in the queue are analyzed and processed in a reversible manner until it is clear that the transaction message is complete and accurate. This processing may include review by a human user. If the transaction message is complete and accurate, then the transaction message is 'posted', meaning that any changes specified by the data fields in the transaction message are made at the destination, and the transaction message is removed from the queue. In this manner, if it is found that a transaction message is defective in any manner, e.g. by missing data, by containing corrupt data, or by including conflicting or nonsensical data, etc., no 'posting' is made and therefore no incorrect changes are made in the destination HIS. Further, the source HIS may be informed of the problem.
[24] In a healthcare enterprise data processing system, such as is illustrated in Fig. 1, different HISs may be supplied by different vendors and therefore may communicate transaction messages containing different data using different data formats, called protocols, for the transactions. There are several known protocols for forming transaction messages: Health Level Seven (HL7), Extensible Markup Language (XML), Digital Imaging and Communications in Medicine (DICOM), or any other electronic data interchange (EDI) protocol recognized by the American National Standards Institute's (ANSI) Accredited Standards Committee (ASC) XI 2. Regardless of the protocol, each transaction message includes several data fields, representing inter alia: the transaction message source location, the transaction message protocol, and the type of computer system at the transaction message source location. In some cases, for example for those transmitting medical records, the transaction message further includes data fields representing coded medical data and/or the text of the medical record.
[25] In general, the dictionary system 500, according to principles of the present invention, monitors the transaction message stream that flows through the network connection 200 connecting the multiple HISs 100 in the healthcare enterprise system 100 and Other Entities 300, and stores new terms (including medical codes) found in those transactions in a structured repository 506 of such terms. In a similar manner to the facility term repository 102 discussed above, the term repository 506 in the dictionary system 500 may also contain other information associated with each term, such as a canonical or regular term, other synonyms if they exist, a count of the frequency of use of the term, a definition, etc. It is also possible for the term repository 506 to contain information representing the relationship of a term to other terms within a hierarchy of terms, such as might be used to describe the results of a battery of tests such as the complete blood count (CBC) laboratory test, as described in more detail below.
[26] The respective transaction message data fields representing coded medical data and/or texts of medical records are parsed to identify and extract codes or terms. New terms are added to the repository and counts are maintained of repeated terms. Utilities are provided to manage the repository, including viewing and maintaining its content (e.g., display the most frequently occurring terms, delete terms, reset counts, generate updates for the HIS term repository 102). As preparation for installing a new HIS system containing a new term repository 102, the central term repository 506 can be turned on early to collect a baseline set of terms, which can be subsequently refined to reflect new terminology requirements of the new HIS system. No complex installation is required because the new dictionary builds itself as transaction messages are processed. [27] More specifically, the input processor 502 acquires transaction messages from the network connection 200 in at least one of the protocols, described above, in which the transaction messages are formatted. Preferably, however, the input processor 502 may properly acquire transaction messages in any of the protocols used in the system. To do this, the input processor 502 needs to know what protocol is being used to format the transaction message. As described above, each transaction message includes a data field representing the protocol of that transaction message. The input processor 502 extracts the value in that data field and in response conditions its own processing to properly acquire the remainder of the transaction message data.
[28] The data processor 504 further processes the acquired transaction messages to update the term repository 506, as described generally above. First, the acquired transaction message data, and in particular the data field representing the medical data, is parsed to extract the terms in it. Predetermined templates of transaction messages in the different possible protocols are developed and used to provide some basic positional information to drive the message parsing function. This satisfies the need to accommodate in a general way the different standard message protocols of the source systems. Each such extracted term may be a word, a code, a symbol, a label, text, a text expression, a designation, a string of characters, or any other such grouping of symbols which, taken as a whole, represents a medical concept. Second, each term which is thus extracted is compared to the terms currently in the term repository 506. Third, the term repository 506 is updated. If the term is not in the term repository 506, it is added. When the term is added to the term repository 506, other data fields in the term repository 506 are also initialized with data related to the term and/or the transaction message containing the term. Optionally, if the term is in the term repository, then a data field in the term repository 506 containing a usage count is incremented by one. This count will increase indefinitely until reset at some point in time by the user of the maintenance facility 510, such as when distributing an update to the facility term repositories or when an analysis of new usage counts is desired. When a term is initially added to the term repository 506, the usage count data field is initialized to one.
[29] Fig. 2 is a data diagram illustrating a portion of the contents of a term repository table 600 in the data dictionary system 500 illustrated in Fig. 1. More specifically, key data fields, used for indexing the contents of the table 600, contain a source identification (ID) code representing the transaction message source and a term identification (ID) code representing the term itself, illustrated as the leftmost two columns in Fig. 2. Each facility in the healthcare enterprise 100, and potentially each facility across all healthcare enterprises and other entities from which data might be received, for example, external facilities, and government and/or insurance industry databases, has associated with it an identification code. It is also possible for other sources of terms, such as medical texts, to be assigned an identification code.
[30] Including a source ID code as a key data field is needed to identify and maintain separately those identical terms that might have different meanings in different HIS facilities, as described above. Consolidation rules may be applied later by the maintenance processor 510 to merge identical terms with the same meaning, and to separate terms with different meanings. One skilled in the art would understand how to generate consolidation rules which may be applied later within the maintenance processor 510.
[31] As described above, each transaction message includes a data field representing the identity of the source of that transaction message. When a transaction message is parsed by the data processor 504, each term in that transaction message is associated with the ID code of the source. In addition, when a term is extracted from the transaction message, the data processor 504 associates a term identification code with that term. If this term is already in the term repository table 600, the term ID is already in the table. If the term is not already in the term repository table 600, the data processor 504 associates a unique term ID with that new term. As described above, if the newly parsed term is not already in the term repository table 600, a new record is added with the source ID and term ID in the key data fields.
[32] Further data fields in the term repository table 600 are illustrated from left to right for containing the full name of the source system, the extracted term in text form, the date and time the last transaction message was received, and the status of the term. These fields are initialized when the term is first added to the table 600. Also, another data field includes the usage count, described in detail above. Other fields (not shown) may also be provided to contain supporting detail for the term, the relationship to other terms, the date and time the term entry was first created, and any other information related to the term. Each term extracted from the transaction messages has a record in the term repository table 600.
[33] Referring again to Fig. 1, the maintenance processor 510 provides a means for maintaining the term repository 506. The maintenance processor 510 accesses the contents of the term repository 506 to allow a user to view the terms and related data, manually add a term, manually delete a term, manually edit a term, and reset or otherwise adjust usage frequency counts. The maintenance processor 510 may also generate reports related to the accumulated terms, all in a known manner. The maintenance processor 510 also can evaluate the date/time field in the term repository table 600 to identify terms that haven't been used in transaction messages in a predetermined time interval. Any such identified term may be inactivated by deleting the record from the table 600, marking the term status field as INACTIVE, moving the record to a separate table (not shown) which holds inactive terms, and/or any other suitable technique. The maintenance processor 510 also allows a user to initiate update of a facility HIS 100 term repository 102 in a manner described in more detail below.
[34] As described above, respective HISs 100 in the operational facilities in the healthcare enterprise each include their own term repository 102 to contain the vocabulary which is relevant to that facility. The facility term repositories 102 are initialized and/or updated using the terms accumulated in the dictionary system 500 and possibly from other sources specific to the facility HIS. The communications processor 508 conditions the data processor 504 to process the content of the dictionary 500 term repository 506 to be suitable to communicate to the term repository 102 in a facility HIS 100. More specifically, the data processor 504 is conditioned to select those terms having a source ID associated with the facility (or possibly other facilities or sources as desired and appropriate) and/or with other record attributes that match predetermined rules, such as a time period for new terms added. The selected terms and other associated data fields are then mapped and arranged as required by the facility HIS 100 term repository 102. These data fields are then communicated over the network connection 200 to the facility HIS 100 via a file and/or one or more transaction messages using the protocol required by the facility HIS 100. Upon receiving these transaction messages, the facility HIS 100 updates its term repository 102.The updating of the facility HIS 100 term repository 102 may be initiated by a user command issued from the maintenance processor 510, as described above. Alternatively, an update may be initiated by a command issued automatically in response to any desired criterion. For example, an update may be initiated in response to either a predetermined interval of time since the last update, the accumulation of a predetermined number of terms since the last update, or identification of terms having usage counts exceeding a predetermined threshold. The facility term repository 102 may also be updated in an incremental manner by selecting only terms in the central term repository 506 which were added after a selected date (e.g. the date of the last update), and updating the facility term repository 102 with those terms.
[35] It is also important to have some form of filtering on this upload process, especially if applying new content to an existing facility-specific term repository. Therefore, the update may be based on more complicated criteria. A user may perform a query on the central term repository 506 to identify a set of terms matching desired criteria, such as only new terms acquired within a particular time period, or criteria based on attributes of the terms themselves such as only terms acquired from a particular source, or only terms of a particular status. The facility term repository 102 is updated with only these terms. Identical terms from different sources having the same canonical meaning may be consolidated, as described above, and those consolidated terms sent to the facility term repository 102. Alternatively, a selected type of term, for example laboratory terms, may be extracted and only those terms sent to the facility term repository 102. Further, as illustrated in Fig. 2, the source of the term is included in the database. As described above, the source may be a medical text, or a government or insurance industry database. Terms from these sources may need to be supplied to the facility term repository 102 as soon as practical after they are identified. In all of these examples, the database 600 in the term repository 506 is queried to identify desired terms and the selected terms sent to the facility term repository 102. One skilled in the art will understand that the above described methods are simply examples, and that any such method may be used to identify sets of terms and to initiate updating of the facility term repository 102 with these terms..
[36] By accumulating terms in a central term repository (506) by monitoring transaction messages passing through the network connection (200), as described above, terms in a facility term repository (102) may be initially installed without requiring a full installation process. Instead, installation of terms occurs as those terms are used. Alternatively transaction streams may be monitored by a central term repository 506 while waiting for a new system to be delivered and the facility term repository 102 may be updated from the central term repository 506 when finally installed. This obviates the time and expense inherent in writing and testing software to transfer terms from pre-existing term repositories or medical texts, and eliminates the unreliability inherent in manual entry of terms into the new term repository. Instead, terms are accumulated rapidly and accurately, as the terms are used, and therefore needed. New terms are identified and integrated on an ongoing basis. [37] This invention has been described above in an embodiment of a healthcare enterprise including multiple HISs in corresponding healthcare facilities within the enterprise. However, one skilled in the art will understand that this invention may also serve as a generic extension to a single HIS, or as a general health industry data dictionary used, for example, for regional health surveillance or any other such purpose. In addition, there is nothing in this technique which is specific to healthcare and HISs. Any vocabulary term repository may be created and maintained by monitoring transaction streams carrying transactions in which terms are included, according to the invention, provided there is a standard within the data streams which will permit the necessary parsing and identification functions.

Claims

What is claimed is:
1. A healthcare dictionary system providing a term repository accessible for use in supporting the operation of a healthcare enterprise, comprising: an input processor for acquiring healthcare transaction message data in at least one of a plurality of different data formats; a data processor for, parsing said acquired transaction message data and extracting a term from said acquired transaction message data, comparing said extracted term to terms in a first term repository, and updating said first term repository to include said extracted term in response to a determination said extracted term is absent from said first term repository; and a communication processor for intermittently processing content of said first term repository to be suitable for communication to a second term repository.
2. The system according to claim 1 wherein the data processor further updates the first term repository to increment a usage count for the extracted term in the first term repository in response to a determination that said extracted term is present in the first term repository.
3. The system according to claims 1 or 2 further comprising a maintenance processor, coupled to the first term repository, for maintaining the contents of the first term repository.
4. The system according to claim 3 wherein the maintenance processor allows a user to reset the usage count for a term in the first term repository.
5. The system according to claim 3 wherein the maintenance processor allows a user to do one or more of: (a) view a term in the first term repository, (b) add a term to the first term repository, (c) delete a term from the first term repository, (d) edit a term in the first term repository, and (e) generate a report related to the terms in the first term repository.
6. A system according to claim 1, wherein said data processor further extracts at least one of, (a) transaction message source identification information, (b) transaction message protocol identification information and (c) transaction message source computer system identification information.
7. A system according to claim 1, wherein said transaction message data comprises at least one of, (a) a communication involving a healthcare enterprise laboratory, (b) a communication involving a healthcare enterprise pharmacy, (c) a communication involving a healthcare enterprise radiology department, (d) a communication involving a healthcare enterprise modality department, (e) a communication involving a healthcare enterprise administration operation and (f) a communication involving a healthcare enterprise orders or results management operation.
8. A system according to claim 1, wherein said first term repository is used to update a plurality of different health care information system term repositories including said second term repository and further comprising a maintenance processor, coupled to the first term repository, for initiating an update of said second term repository in response to a user command.
9. The system according to claim 10 wherein the update is initiated automatically in response to satisfaction of a predetermined criterion wherein the predetermined criterion relates to at least one of, (a) dates respective terms were included in the first term repository, (b) the number of terms included in the first term repository and (c) attributes of terms included in the first term repository.
10. A system according to claim 1, wherein said data processor processes said extracted term to be compatible with said first term repository by storing said term in said first term repository together with at least one of, (a) date or time said term is received, (b) a source system identifier, (c) a term status and (d) an updated indication of a number of times said term has been detected.
11. A system according to claim 1, wherein said data processor processes said extracted term to be compatible with said first term repository by storing said term in said first term repository together with at least one of, (a) a term identifier code created by said data processor and (b) an organization identifier code associated with a source system of said extracted term.
12. A system according to claim 1, wherein said extracted term comprises at least one of, (a) a word, (b) a code, (c) a symbol, (d) a label, (e) text, (f) a text expression, (g) a designation, and (h) a string of characters, derivable from transaction message data communicated in support of healthcare enterprise operation.
13. A system according to claim 1, wherein said data processor processes said extracted term to be compatible with said first term repository by storing said term in said first term repository together with at least one of, (a) a term identifier code created by said data processor and (b) an organization identifier code associated with a source system of said extracted term.
14. A method for providing a healthcare term repository accessible for use in supporting the operation of a health care enterprise, comprising the steps of: acquiring healthcare transaction message data in at least one of a plurality of different data formats; parsing said acquired transaction message data and extracting a term from said acquired transaction message data; comparing said extracted term to terms in a first term repository; updating said first term repository to include said extracted term in response to a determination said extracted term is absent from said first term repository; and processing content of said first term repository to be suitable for communication to a second term repository.
PCT/US2003/006766 2002-03-05 2003-03-05 A dynamic dictionary and term repository system WO2003077151A2 (en)

Priority Applications (3)

Application Number Priority Date Filing Date Title
EP03744193A EP1481332A2 (en) 2002-03-05 2003-03-05 A dynamic dictionary and term repository system
JP2003575296A JP2005519411A (en) 2002-03-05 2003-03-05 Dynamic dictionary and term storage system
CA002477689A CA2477689A1 (en) 2002-03-05 2003-03-05 A dynamic dictionary and term repository system

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US36175402P 2002-03-05 2002-03-05
US60/361,754 2002-03-05

Publications (2)

Publication Number Publication Date
WO2003077151A2 true WO2003077151A2 (en) 2003-09-18
WO2003077151A3 WO2003077151A3 (en) 2004-03-25

Family

ID=27805072

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2003/006766 WO2003077151A2 (en) 2002-03-05 2003-03-05 A dynamic dictionary and term repository system

Country Status (5)

Country Link
US (1) US7580831B2 (en)
EP (1) EP1481332A2 (en)
JP (1) JP2005519411A (en)
CA (1) CA2477689A1 (en)
WO (1) WO2003077151A2 (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2624143A1 (en) * 2012-01-31 2013-08-07 Tata Consultancy Services Limited Automated dictionary creation for scientific terms
WO2014138838A1 (en) * 2013-03-15 2014-09-18 Whoplusyou Inc. Method and system for candidate matching using dynamic dictionary maintenance heuristics

Families Citing this family (40)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20070156458A1 (en) * 2005-10-04 2007-07-05 Anuthep Benja-Athon Sieve of words in health-care data
CN100517300C (en) * 2002-11-28 2009-07-22 皇家飞利浦电子股份有限公司 Method to assign word class information
US20050027564A1 (en) * 2003-06-18 2005-02-03 Yantis David Brook Term management system suitable for healthcare and other use
US7788040B2 (en) * 2003-12-19 2010-08-31 Siemens Medical Solutions Usa, Inc. System for managing healthcare data including genomic and other patient specific information
US20050273365A1 (en) * 2004-06-04 2005-12-08 Agfa Corporation Generalized approach to structured medical reporting
US20060004745A1 (en) * 2004-06-04 2006-01-05 Agfa Corporation Structured reporting report data manager
JP4816409B2 (en) * 2006-01-10 2011-11-16 日産自動車株式会社 Recognition dictionary system and updating method thereof
US8458647B2 (en) * 2006-03-07 2013-06-04 Sap Portals Israel Ltd. Method and apparatus for graphically constructing applications utilizing information from multiple sources
US20070265831A1 (en) * 2006-05-09 2007-11-15 Itai Dinur System-Level Correction Service
US8849653B2 (en) * 2006-05-09 2014-09-30 International Business Machines Corporation Updating dictionary during application installation
EP3168764A1 (en) * 2006-09-26 2017-05-17 Ralph Korpman Individual health record system and apparatus
US9864838B2 (en) * 2008-02-20 2018-01-09 Medicomp Systems, Inc. Clinically intelligent parsing
WO2009136440A1 (en) * 2008-05-09 2009-11-12 富士通株式会社 Speech recognition dictionary creating support device, processing program, and processing method
US8180630B2 (en) 2008-06-06 2012-05-15 Zi Corporation Of Canada, Inc. Systems and methods for an automated personalized dictionary generator for portable devices
US20100017232A1 (en) * 2008-07-18 2010-01-21 StevenDale Software, LLC Information Transmittal And Notification System
US20100145720A1 (en) * 2008-12-05 2010-06-10 Bruce Reiner Method of extracting real-time structured data and performing data analysis and decision support in medical reporting
US8556169B2 (en) 2009-01-12 2013-10-15 Visa U.S.A. Inc. Opt in system and method
US8423353B2 (en) * 2009-03-25 2013-04-16 Microsoft Corporation Sharable distributed dictionary for applications
KR101158860B1 (en) * 2009-12-31 2012-06-25 에이든 정일 조 Method for searching subjective analysis according to mental status and system therefor
CN102411563B (en) * 2010-09-26 2015-06-17 阿里巴巴集团控股有限公司 Method, device and system for identifying target words
US20120239381A1 (en) * 2011-03-17 2012-09-20 Sap Ag Semantic phrase suggestion engine
US8527451B2 (en) 2011-03-17 2013-09-03 Sap Ag Business semantic network build
US8725760B2 (en) 2011-05-31 2014-05-13 Sap Ag Semantic terminology importer
US9240011B2 (en) 2011-07-13 2016-01-19 Visa International Service Association Systems and methods to communicate with transaction terminals
US8935230B2 (en) 2011-08-25 2015-01-13 Sap Se Self-learning semantic search engine
US8892575B2 (en) * 2012-06-06 2014-11-18 Sap Se Dictionary entry name generator
US20140156590A1 (en) 2012-12-05 2014-06-05 Chegg, Inc. Producing automated terms listings in html document publishing with enhanced terms definitions
US20150088493A1 (en) * 2013-09-20 2015-03-26 Amazon Technologies, Inc. Providing descriptive information associated with objects
US11210669B2 (en) 2014-10-24 2021-12-28 Visa International Service Association Systems and methods to set up an operation at a computer system connected with a plurality of computer systems via a computer network using a round trip communication of an identifier of the operation
EP3023895A1 (en) * 2014-11-24 2016-05-25 Siemens Aktiengesellschaft Method for comparing medical data sets
US10490306B2 (en) 2015-02-20 2019-11-26 Cerner Innovation, Inc. Medical information translation system
US20180025121A1 (en) * 2016-07-20 2018-01-25 Baidu Usa Llc Systems and methods for finer-grained medical entity extraction
US9842297B1 (en) * 2016-09-29 2017-12-12 International Business Machines Corporation Establishing industry ground truth
US10733224B2 (en) * 2017-02-07 2020-08-04 International Business Machines Corporation Automatic corpus selection and halting condition detection for semantic asset expansion
US11941033B2 (en) * 2017-11-27 2024-03-26 Affirm, Inc. Method and system for syntactic searching
CN110349639B (en) * 2019-07-12 2022-01-04 之江实验室 Multi-center medical term standardization system based on general medical term library
US11645344B2 (en) 2019-08-26 2023-05-09 Experian Health, Inc. Entity mapping based on incongruent entity data
US11763390B2 (en) 2019-12-31 2023-09-19 Cerner Innovation, Inc. Intelligently linking payer/health plan combinations to specific employers
US11269901B2 (en) 2020-01-16 2022-03-08 International Business Machines Corporation Cognitive test advisor facility for identifying test repair actions
US20220114189A1 (en) * 2020-10-14 2022-04-14 International Business Machines Corporation Extraction of structured information from unstructured documents

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1996041288A1 (en) * 1995-06-07 1996-12-19 E-Systems, Inc. Apparatus and method for centralized storage of heterogeneous medical records in managed health care organization
US6055494A (en) * 1996-10-28 2000-04-25 The Trustees Of Columbia University In The City Of New York System and method for medical language extraction and encoding

Family Cites Families (44)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7284A (en) * 1850-04-16 Grate for cooking-stoves
US82868A (en) * 1868-10-06 parks
US51880A (en) * 1866-01-02 button
US41992A (en) * 1864-03-22 Improvement in dovetailing-machines
US5251316A (en) * 1991-06-28 1993-10-05 Digital Equipment Corporation Method and apparatus for integrating a dynamic lexicon into a full-text information retrieval system
JPH0689302A (en) * 1992-09-08 1994-03-29 Hitachi Ltd Dictionary memory
JP3220885B2 (en) * 1993-06-18 2001-10-22 株式会社日立製作所 Keyword assignment system
WO1995000914A1 (en) 1993-06-28 1995-01-05 Scott & White Memorial Hospital And Scott, Sherwood And Brindley Foundation Electronic medical record using text database
US5560005A (en) 1994-02-25 1996-09-24 Actamed Corp. Methods and systems for object-based relational distributed databases
NZ248751A (en) 1994-03-23 1997-11-24 Ryan John Kevin Text analysis and coding
CA2125300C (en) 1994-05-11 1999-10-12 Douglas J. Ballantyne Method and apparatus for the electronic distribution of medical information and patient services
US5664109A (en) * 1995-06-07 1997-09-02 E-Systems, Inc. Method for extracting pre-defined data items from medical service records generated by health care providers
US5875443A (en) * 1996-01-30 1999-02-23 Sun Microsystems, Inc. Internet-based spelling checker dictionary system with automatic updating
US5974389A (en) 1996-03-01 1999-10-26 Clark; Melanie Ann Medical record management system and process with improved workflow features
US5809471A (en) * 1996-03-07 1998-09-15 Ibm Corporation Retrieval of additional information not found in interactive TV or telephony signal by application using dynamically extracted vocabulary
US5819263A (en) 1996-07-19 1998-10-06 American Express Financial Corporation Financial planning system incorporating relationship and group management
US5924074A (en) 1996-09-27 1999-07-13 Azron Incorporated Electronic medical records system
JPH10198680A (en) * 1997-01-07 1998-07-31 Hitachi Ltd Distributed dictionary managing method and machine translating method using the method
JP3596210B2 (en) * 1997-01-16 2004-12-02 富士ゼロックス株式会社 Related word dictionary creation device
US6112183A (en) 1997-02-11 2000-08-29 United Healthcare Corporation Method and apparatus for processing health care transactions through a common interface in a distributed computing environment
JP3466857B2 (en) * 1997-03-06 2003-11-17 株式会社東芝 Dictionary updating method and dictionary updating system
JP3556425B2 (en) * 1997-03-18 2004-08-18 株式会社東芝 Shared dictionary updating method and dictionary server
US6018713A (en) 1997-04-09 2000-01-25 Coli; Robert D. Integrated system and method for ordering and cumulative results reporting of medical tests
US5991728A (en) 1997-04-30 1999-11-23 Deroyal Industries, Inc. Method and system for the tracking and profiling of supply usage in a health care environment
US5915240A (en) * 1997-06-12 1999-06-22 Karpf; Ronald S. Computer system and method for accessing medical information over a network
US6163781A (en) 1997-09-11 2000-12-19 Physician Weblink Technology Services, Inc. Object-to-relational data converter mapping attributes to object instance into relational tables
CA2233794C (en) 1998-02-24 2001-02-06 Luc Bessette Method and apparatus for the management of medical files
JP3531468B2 (en) * 1998-03-30 2004-05-31 株式会社日立製作所 Document processing apparatus and method
US6260021B1 (en) 1998-06-12 2001-07-10 Philips Electronics North America Corporation Computer-based medical image distribution system and method
US20020023067A1 (en) 1998-06-12 2002-02-21 Harry T. Garland Integrating a primary record viewing system with a different secondary record viewing system
US6311192B1 (en) 1998-09-29 2001-10-30 Electronic Data Systems Corporation Method for initiating workflows in an automated organization management system
WO2000025192A2 (en) 1998-10-26 2000-05-04 Visionary Medical, Inc. Prescription-controlled data collection system and method
US6785869B1 (en) * 1999-06-17 2004-08-31 International Business Machines Corporation Method and apparatus for providing a central dictionary and glossary server
US20010051879A1 (en) 1999-12-01 2001-12-13 Johnson Robin D. System and method for managing security for a distributed healthcare application
US20020007284A1 (en) 1999-12-01 2002-01-17 Schurenberg Kurt B. System and method for implementing a global master patient index
US20010051880A1 (en) 1999-12-01 2001-12-13 Schurenberg Kurt B. System and method for connecting a healthcare business to a plurality of laboratories
US20020007287A1 (en) 1999-12-16 2002-01-17 Dietmar Straube System and method for electronic archiving and retrieval of medical documents
WO2001069500A1 (en) 2000-03-10 2001-09-20 Medorder, Inc. Method and system for accessing healthcare information using an anatomic user interface
US20010027403A1 (en) 2000-03-31 2001-10-04 Peterson Robert B. System and method for employing targeted messaging in connection with the submitting of an insurance claim
JP2001325365A (en) * 2000-05-16 2001-11-22 Toppan Printing Co Ltd Method and system for care information processing
US20010051889A1 (en) 2000-06-08 2001-12-13 Haney Ralph C. System and method for managing contract labor activities
US8751248B2 (en) 2000-07-28 2014-06-10 Visual Telecommunications Network, Inc. Method, apparatus, and medium using a master control file for computer software interoperability between disparate operating systems
US20020082868A1 (en) 2000-12-27 2002-06-27 Pories Walter J. Systems, methods and computer program products for creating and maintaining electronic medical records
CA2477690A1 (en) 2002-03-06 2003-09-18 Siemens Medical Solutions Health Services Corporation System and method for providing a generic health care data repository

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1996041288A1 (en) * 1995-06-07 1996-12-19 E-Systems, Inc. Apparatus and method for centralized storage of heterogeneous medical records in managed health care organization
US6055494A (en) * 1996-10-28 2000-04-25 The Trustees Of Columbia University In The City Of New York System and method for medical language extraction and encoding

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
Y. LIU AND Y. SATOMURA: "BUILDING A CONTROLLED HEALTH VOCABULARY IN JAPANESE " METHODS OF INFORMATION IN MEDICINE, vol. 40, no. 4, 2001, pages 307-314, XP008026570 Germany *

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2624143A1 (en) * 2012-01-31 2013-08-07 Tata Consultancy Services Limited Automated dictionary creation for scientific terms
JP2013178757A (en) * 2012-01-31 2013-09-09 Tata Consultancy Services Ltd Automatic creation of dictionary for scientific terms
US10176188B2 (en) 2012-01-31 2019-01-08 Tata Consultancy Services Limited Automated dictionary creation for scientific terms
WO2014138838A1 (en) * 2013-03-15 2014-09-18 Whoplusyou Inc. Method and system for candidate matching using dynamic dictionary maintenance heuristics

Also Published As

Publication number Publication date
WO2003077151A3 (en) 2004-03-25
JP2005519411A (en) 2005-06-30
US20030233251A1 (en) 2003-12-18
CA2477689A1 (en) 2003-09-18
EP1481332A2 (en) 2004-12-01
US7580831B2 (en) 2009-08-25

Similar Documents

Publication Publication Date Title
US7580831B2 (en) Dynamic dictionary and term repository system
CN110335647B (en) Clinical data standardization system and standardized data acquisition method
US5832450A (en) Electronic medical record using text database
CN109584975B (en) Medical data standardization processing method and device
US7657521B2 (en) System and method for parsing medical data
US8086468B2 (en) Method for computerising and standardizing medical information
US8688476B2 (en) Interoperability tools and procedures to aggregate and consolidate lab test results
US7483924B2 (en) Methodology for mapping HL7 V2 standards to HL7 V3 standards
US20020128861A1 (en) Mapping clinical data with a health data dictionary
US20040220895A1 (en) Systems and methods for coding information
US20040122717A1 (en) Claim submission system and method
US20020147615A1 (en) Physician decision support system with rapid diagnostic code identification
CA2704637C (en) Systems and methods for interfacing with healthcare organization coding system
KR100739570B1 (en) Hospital information system and method
US20100138241A1 (en) System and Method for Computerized Medical Records Review
JP2012027926A (en) System and method for analyzing anonymous health management information
US20200013491A1 (en) Interoperable Record Matching Process
CN111081329A (en) Automatic clinical data entry method and device, electronic equipment and storage medium
CN112397159A (en) Automatic clinical test report input method and device, electronic equipment and storage medium
US20040078217A1 (en) System and method for managing prepartum medical records
EP1729235A1 (en) Structured reporting report data manager
US20050043968A1 (en) Message data processing system suitable for healthcare and other fields
WO2003085577A1 (en) A method and system for providing healthcare information
Ed Hammond How the past teaches the future: ACMI distinguished lecture
Waters et al. Automated identification of unstandardized medication data: A scalable and flexible data standardization pipeline using RxNorm on GEMINI multicenter hospital data

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A2

Designated state(s): CA JP

AL Designated countries for regional patents

Kind code of ref document: A2

Designated state(s): AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IT LU MC NL PT RO SE SI SK TR

121 Ep: the epo has been informed by wipo that ep was designated in this application
WWE Wipo information: entry into national phase

Ref document number: 2003744193

Country of ref document: EP

WWE Wipo information: entry into national phase

Ref document number: 2477689

Country of ref document: CA

WWE Wipo information: entry into national phase

Ref document number: 2003575296

Country of ref document: JP

WWP Wipo information: published in national office

Ref document number: 2003744193

Country of ref document: EP