WO1999005324A1 - System for providing a polymorphism database - Google Patents
System for providing a polymorphism database Download PDFInfo
- Publication number
- WO1999005324A1 WO1999005324A1 PCT/US1998/015458 US9815458W WO9905324A1 WO 1999005324 A1 WO1999005324 A1 WO 1999005324A1 US 9815458 W US9815458 W US 9815458W WO 9905324 A1 WO9905324 A1 WO 9905324A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- records
- item
- computer
- polymo
- sequence
- Prior art date
Links
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04M—TELEPHONIC COMMUNICATION
- H04M1/00—Substation equipment, e.g. for use by subscribers
- H04M1/006—Call diverting means
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6809—Methods for determination or identification of nucleic acids involving differential detection
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B25/00—ICT specially adapted for hybridisation; ICT specially adapted for gene or protein expression
- G16B25/30—Microarray design
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B50/00—ICT programming tools or database systems specially adapted for bioinformatics
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B25/00—ICT specially adapted for hybridisation; ICT specially adapted for gene or protein expression
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04M—TELEPHONIC COMMUNICATION
- H04M1/00—Substation equipment, e.g. for use by subscribers
- H04M1/72—Mobile telephones; Cordless telephones, i.e. devices for establishing wireless links to base stations without route selection
- H04M1/724—User interfaces specially adapted for cordless or mobile telephones
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04M—TELEPHONIC COMMUNICATION
- H04M2207/00—Type of exchange or network, i.e. telephonic medium, in which the telephonic communication takes place
- H04M2207/20—Type of exchange or network, i.e. telephonic medium, in which the telephonic communication takes place hybrid systems
- H04M2207/206—Type of exchange or network, i.e. telephonic medium, in which the telephonic communication takes place hybrid systems composed of PSTN and wireless network
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04M—TELEPHONIC COMMUNICATION
- H04M3/00—Automatic or semi-automatic exchanges
- H04M3/42—Systems providing special services or facilities to subscribers
- H04M3/46—Arrangements for calling a number of substations in a predetermined sequence until an answer is obtained
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04M—TELEPHONIC COMMUNICATION
- H04M3/00—Automatic or semi-automatic exchanges
- H04M3/42—Systems providing special services or facilities to subscribers
- H04M3/54—Arrangements for diverting calls for one subscriber to another predetermined subscriber
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10—TECHNICAL SUBJECTS COVERED BY FORMER USPC
- Y10S—TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10S128/00—Surgery
- Y10S128/92—Computer assisted medical diagnostics
- Y10S128/922—Computer assisted medical diagnostics including image analysis
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10—TECHNICAL SUBJECTS COVERED BY FORMER USPC
- Y10S—TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10S707/00—Data processing: database and file management or data structures
- Y10S707/912—Applications of a database
- Y10S707/923—Intellectual property
- Y10S707/924—Patent procedure
- Y10S707/925—Drafting an application
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10—TECHNICAL SUBJECTS COVERED BY FORMER USPC
- Y10S—TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10S707/00—Data processing: database and file management or data structures
- Y10S707/953—Organization of data
- Y10S707/955—Object-oriented
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10—TECHNICAL SUBJECTS COVERED BY FORMER USPC
- Y10S—TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10S707/00—Data processing: database and file management or data structures
- Y10S707/99931—Database or file accessing
- Y10S707/99933—Query processing, i.e. searching
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10—TECHNICAL SUBJECTS COVERED BY FORMER USPC
- Y10S—TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10S707/00—Data processing: database and file management or data structures
- Y10S707/99931—Database or file accessing
- Y10S707/99933—Query processing, i.e. searching
- Y10S707/99934—Query formulation, input preparation, or translation
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10—TECHNICAL SUBJECTS COVERED BY FORMER USPC
- Y10S—TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10S707/00—Data processing: database and file management or data structures
- Y10S707/99941—Database schema or data structure
- Y10S707/99944—Object-oriented database structure
- Y10S707/99945—Object-oriented database structure processing
Definitions
- the present invention relates to the collection and storage of information pertaining to chips for processing biological samples and thereby identifying polymorphisms.
- the genomes of all organisms undergo spontaneous mutation in the course of their continuing evolution generating variant forms of progenitor sequences (Gusella, Ann. Rev. Biochem. 55, 831-854 (1986)).
- the variant form may confer an evolutionary advantage or disadvantage relative to a progenitor form or may be neutral.
- a variant form confers a lethal disadvantage and is not transmitted to subsequent generations of the organism.
- a variant form confers an evolutionary advantage to the species and is eventually incorpofaTed'into the DNA of many or most members of the species and effectively becomes the progenitor form.
- both progenitor and variant form(s) survive and co-exist in a species population. The coexistence of multiple forms of a sequence gives rise to polymorphisms.
- an array of nucleic acid probes is fabricated at known locations on a chip or substrate.
- a fluorescently labeled nucleic acid is then brought into contact with the chip and a scanner generates an image file indicating the locations where the labeled nucleic acids bound to the chip.
- Based upon the identities of the probes at these locations it becomes possible to extract information such as the identity of polymorphic forms in of DNA or RNA.
- Such systems have been used to form, for example, arrays of DNA that may be used to study and detect mutations relevant to cystic fibrosis, the P53 gene (relevant to certain cancers), HIV, and other genetic characteristics. It would be highly useful to apply such arrays to the study of polymorphisms on a large scale.
- the present invention provides systems and methods for organizing information relating to study of polymorphisms.
- a database model is provided which interrelates information about one or more of, e.g, subjects from whom samples are extracted, primers used in extracting the DNA from the subjects, about the samples themselves, about experiments done on samples, about particular oligonucleotide probe arrays used to perform experiments, about analysis procedures performed on the samples, and about analysis results .
- the model is readily translatable into database languages such as SQL.
- the database model scales to permit storage of information about large numbers of subjects, samples, experiments, chips, etc.
- Applications include linkage studies to determine resistance to drugs, susceptibility to diseases, and study of every characteristic of humans and other organisms that is related genetic variability.
- Another application of a database constructed according to this model is quality control of the various steps of performing a polymorphism study. By preserving information about every step of a polymorphism study, one can assess the reliability of the results or use the preserved information as feedback to improve procedures.
- Fig. 1 illustrates an overall system and process for forming and analyzing arrays of biological materials such as DNA or RNA.
- Fig. 2 A illustrates a computer system suitable for use in conjunction with the overall system of Fig. 1.
- Fig. 2B illustrates a computer network suitable for use in conjunction with the overall system of Fig. 1.
- Fig. 3 illustrates a key for interpreting a database model.
- Figs. 4A-4H illustrate a database model for maintaining information for the system and process of Fig. 1 according to one embodiment of the present invention.
- Polymorphisms are detected in a target nucleic acid from an individual being analyzed.
- genomic DNA virtually any biological sample (other than pure red blood cells) is suitable.
- tissue samples include whole blood, semen, saliva, tears, urine, fecal material, sweat, buccal, skin and hair.
- tissue sample must be obtained from an organ in which the target nucleic acid is expressed.
- the target nucleic acid is a cytochrome P450
- the liver is a suitable source.
- PCR DNA Amplification
- PCR Protocols A Guide to Methods and Applications (eds. Innis, et al., Academic Press, San Diego, CA, 1990); Mattila et al. , Nucleic Acids Res. 19, 4967 (1991); Eckert et al., PCR Methods and Applications 1, 17 (1991); PCR (eds. McPherson et al., IRL Press, Oxford); and U.S. Patent 4,683,202 (each of which is incorporated by reference for all purposes).
- LCR ligase chain reaction
- NASBA nucleic acid based sequence amplification
- the latter two amplification methods involve isothermal reactions based on isothermal transcription, which produce both single stranded RNA (ssRNA) and double stranded DNA (dsDNA) as the amplification products in a ratio of about 30 or 100 to 1 , respectively.
- ssRNA single stranded RNA
- dsDNA double stranded DNA
- the first type of analysis is sometimes referred to as de novo characterization. This analysis compares target sequences in different individuals to identify points of variation, i.e., polymorphic sites. By analyzing groups of individuals representing the greatest ethnic diversity among humans and greatest breed and species variety in plants and animals, patterns characteristic of the most common alleles/haplotypes of the locus can be identified, and the frequencies of such populations in the population determined. Additional allelic frequencies can be determined for subpopulations characterized by criteria such as geography, race, or gender.
- the second type of analysis is determining which form(s) of a characterized polymo ⁇ hism are present in individuals under test. There are a variety of suitable procedures, which are discussed in turn.
- Allele-Specific Probes The design and use of allele-specific probes for analyzing polymo ⁇ hisms is described by e.g., Saiki et al., Nature 324, 163-166 (1986); Dattagupta, EP 235,726, Saiki, WO 89/11548. Allele-specific probes can be designed that hybridize to a segment of target DNA from one individual but do not hybridize to the corresponding segment from another individual due to the presence of different polymo ⁇ hic forms in the respective segments from the two individuals. Hybridization conditions should be sufficiently stringent that there is a significant difference in hybridization intensity between alleles, and preferably an essentially binary response, whereby a probe hybridizes to only one of the alleles.
- Some probes are designed to hybridize to a segment of target DNA such that the polymo ⁇ hic site aligns with a central position (e.g., in a 15 mer at the 7 position; in a 16 mer, at either the 8 or 9 position) of the probe. This design of probe achieves good discrimination in hybridization between different allelic forms.
- Allele-specific probes are often used in pairs, one member of a pair showing a perfect match to a reference form of a target sequence and the other member showing a perfect match to a variant form. Several pairs of probes can then be immobilized on the same support for simultaneous analysis of multiple polymo ⁇ hisms within the same target sequence.
- the polymo ⁇ hisms can also be identified by hybridization to nucleic acid arrays, some example of which are described by WO 95/11995 (inco ⁇ orated by reference in its entirety for all pu ⁇ oses).
- WO 95/11995 also describes subarrays that are optimized for detection of a variant forms of a precharacterized polymo ⁇ hism.
- Such a subarray contains probes designed to be complementary to a second reference sequence, which is an allelic variant of the first reference sequence.
- the second group of probes is designed by the same principles as described in the Examples except that the probes exhibit complementarity to the second reference sequence.
- a second group (or further groups) can be particular useful for analyzing short subsequences of the primary reference sequence in which multiple mutations are expected to occur within a short distance commensurate with the length of the probes (i.e. , two or more mutations within 9 to 21 bases).
- An allele-specific primer hybridizes to a site on target DNA overlapping a polymo ⁇ hism and only primes amplification of an allelic form to which the primer exhibits perfect complementarily. See Gibbs, Nucleic Acid Res. 17, 2427-2448 (1989). This primer is used in conjunction with a second primer which hybridizes at a distal site. Amplification proceeds from the two primers leading to a detectable product signifying the particular allelic form is present. A control is usually performed with a second pair of primers, one of which shows a single base mismatch at the polymo ⁇ hic site and the other of which exhibits perfect complementarily to a distal site. The single-base mismatch prevents amplification and no detectable product is formed.
- the direct analysis of the sequence of polymo ⁇ hisms of the present invention can be accomplished using either the dideoxy chain termination method or the Maxam Gilbert method (see Sambrook et al. , Molecular Cloning, A Laboratory Manual (2nd Ed., CSHP, New York 1989); Zyskind et al., Recombinant DNA Laboratory Manual, (Acad. Press, 1988)).
- Amplification products generated using the polymerase chain reaction can be analyzed by the use of denaturing gradient gel electrophoresis. Different alleles can be identified based on the different sequence-dependent melting properties and electrophoretic migration of DNA in solution. Erlich, ed. , PCR Technology, Principles and Applications for DNA Amplification, (W.H. Freeman and Co, New York, 1992), Chapter 7.
- Single-Strand Conformation Polvmo ⁇ hism Analysis Alleles of target sequences can be differentiated using single-strand conformation polymo ⁇ hism analysis, which identifies base differences by alteration in electrophoretic migration of single stranded PCR products, as described in Orita et al., Proc. Nat. Acad. Sci. 86, 2766-2770 (1989).
- Amplified PCR products can be generated as described above, and heated or otherwise denatured, to form single stranded amplification products.
- Single-stranded nucleic acids may refold or form secondary structures which are partially dependent on the base sequence.
- the different electrophoretic mobilities of single-stranded amplification products can be related to base- sequence difference between alleles of target sequences.
- Biological Material Analysis System One embodiment of the present invention operates in the context of a system for analyzing biological or other materials using arrays that themselves include probes that may be made of biological materials such as RNA or DNA.
- the VLSIPSTM and GeneChipTM technologies provide methods of making and using very large arrays of polymers, such as nucleic acids, on chips. See U.S. Patent No. 5,143,854 and PCT Patent Publication Nos. WO 90/15070 and 92/10092, each of which is hereby inco ⁇ orated by reference for all pu ⁇ oses.
- Nucleic acid probes on the chip are used to detect complementary nucleic acid sequences in a sample nucleic acid of interest (the "target" nucleic acid).
- Fig. 1 illustrates an overall system 100 for forming and analyzing arrays of biological materials such as RNA or DNA.
- a part of system 100 is a polymo ⁇ hism database 102.
- Polymo ⁇ hism database 102 includes information about, e.g., biological sources, preparation of samples, design of arrays, raw data obtained from applying experiments to chips, analysis procedures applied, and analysis results, etc.
- Polymo ⁇ hism database 102 facilitates large scale study of polymo ⁇ hisms.
- a chip design system 104 is used to design arrays of polymers such as biological polymers such as RNA or DNA.
- Chip design system 104 may be, for example, an appropriately programmed Sun Workstation or personal computer or workstation, such as an IBM PC equivalent, including appropriate memory and a CPU.
- Chip design system 104 obtains inputs from a user regarding chip design objectives including polymo ⁇ hisms of interest, and other inputs regarding the desired features of the array.
- chip design system 104 from external databases such as GenBank.
- the output of chip design system 104 is a set of chip design computer files in the form of, for example, a switch matrix, as described in PCT application WO 92/10092, and other associated computer files.
- the chip design computer files form a part of polymo ⁇ hism database 102.
- Systems for designing chips for study of polymo ⁇ hisms are disclosed in U.S. Patent No. 5,571,639 and in PCT application WO 95/11995, the contents of which are herein inco ⁇ orated by reference.
- the chip design files are input to a mask design system (not shown) that designs the lithographic masks used in the fabrication of arrays of molecules such as DNA.
- the mask design system designs the lithographic masks used in the fabrication of probe arrays.
- the mask design system generates mask design files that are then used by a mask construction system (not shown) to construct masks or other synthesis patterns such as chrome-on-glass masks for use in the fabrication of polymer arrays.
- the masks are used in a synthesis system (not shown).
- the synthesis system includes the necessary hardware and software used to fabricate arrays of polymers on a substrate or chip.
- the synthesis system includes a light source and a chemical flow cell on which the substrate or chip is placed. A mask is placed between the light source and the substrate/chip, and the two are translated relative to each other at appropriate times for deprotection of selected regions of the chip. Selected chemical reagents are directed through the flow cell for coupling to deprotected regions, as well as for washing and other operations.
- the substrates fabricated by the synthesis system are optionally diced into smaller chips.
- the output of the synthesis system is a chip ready for application of a target sample.
- a biological source 112 is, for example, tissue from a plant or animal.
- Various processing steps are applied to material from biological source 112 by a sample preparation system 114. Operation of sample preparation system 114 in the context of a polymo ⁇ hism study is discussed below in further detail.
- the prepared samples include nucleic acid sequences such as DNA.
- the nucleic acids may or may not bond to the probes.
- the nucleic acids can be tagged with fluoroscein labels to determine which probes have bonded to nucleotide sequences from the sample.
- the prepared samples will be placed in a scanning system 118.
- Scanning system 118 includes a detection device such as a confocal microscope or CCD (charge- coupled device) that is used to detect the location where labeled receptors have bound to the substrate.
- the output of scanning system 118 is an image file(s) indicating, in the case of fluorescein labeled receptor, the fluorescence intensity (photon counts or other related measurements, such as voltage) as a function of position on the substrate.
- image files may also form a part of polymo ⁇ hism database 102. Since higher photon counts will be observed where the labeled nucleic acid(s) has bound more strongly to the array of probes, and since the monomer sequence of the probes on the substrate is known as a function of position, it becomes possible to analyze the sequence(s) of the nucleic acid(s) that are complementary to the probes.
- the image files and the design of the chips are input to an analysis system 120 that, e.g., calls bases.
- analysis techniques are described in EPO Pub. No. 0717113A, the contents of which are herein inco ⁇ orated by reference.
- Chip design system 104, analysis system 120 and control portions of exposure system 116, sample preparation system 114, and scanning system 118 may be appropriately programmed computers such as a Sun workstation or IBM-compatible PC.
- An independent computer for each system may perform the computer-implemented functions of these systems or one computer may combine the computerized functions of two or more systems.
- One or more computers may maintain chip design database 102 independent of the computers operating the systems of Fig. 1 or chip design database 102 may be fully or partially maintained by these computers.
- Fig. 2 A depicts a block diagram of a host computer system 10 suitable for implementing the present invention.
- Host computer system 210 includes a bus 212 which interconnects major subsystems such as a central processor 214, a system memory 216 (typically RAM), an input/output (I/O) adapter 218, an external device such as a display screen 224 via a display adapter 226, a keyboard 232 and a mouse 234 via an I/O adapter 218, a SCSI host adapter 236, and a floppy disk drive 238 operative to receive a floppy disk 240.
- I/O input/output
- SCSI host adapter 236 may act as a storage interface to a fixed disk drive 242 or a CD-ROM player 244 operative to receive a CD-ROM 246.
- Fixed disk 244 may be a part of host computer system 210 or may be separate and accessed through other interface systems.
- a network interface 248 may provide a direct connection to a remote server via a telephone link or to the Internet.
- Network interface 248 may also connect to a local area network (LAN) or other network interconnecting many computer systems. Many other devices or subsystems (not shown) may be connected in a similar manner.
- LAN local area network
- Fig. 2 A depicts a network 260 interconnecting multiple computer systems
- Network 260 may be a local area network (LAN), wide area network (WAN), etc.
- Bioinformatics database 102 and the computer-related operations of the other elements of Fig. 2B may be divided amongst computer systems 210 in any way with network 260 being used to communicate information among the various computers.
- Portable storage media such as floppy disks may be used to carry information between computers instead of network 260.
- Polymo ⁇ hism database 102 is preferably a relational database with a complex internal structure.
- the structure and contents of chip design database 102 will be described with reference to a logical model depicted in Figs. 4A-4H that describes the contents of tables of the database as well as interrelationships among the tables.
- a visual depiction of this model will be an Entity Relationship Diagram (ERD) which includes entities, relationships, and attributes.
- ERPs Entity Relationship Diagram
- a detailed discussion of ERDs is found in "ERwin version 3.0 Methods Guide" available from Logic Works, Inc. of Princeton, NJ, the contents of which are herein inco ⁇ orated by reference.
- Those of skill in the art will appreciate that automated tools such as Developer 2000 available from Oracle will convert the ERD from Figs. 4A-4H directly into executable code such as SQL code for creating and operating the database.
- FIG. 3 is a key to the ERD that will be used to describe the contents of chip design database 102.
- a representative table 302 includes one or more key attributes 304 and one or more non-key attributes 306.
- Representative table 302 includes one or more records where each record includes fields corresponding to the listed attributes. The contents of the key fields taken together identify an individual record.
- each table is represented by a rectangle divided by a horizontal line. The fields or attributes above the line are key while the fields or attributes below the line are non-key.
- An identifying relationship 308 signifies that the key attribute of a parent table 310 is also a key attribute of a child table 312.
- a non-identifying relationship 314 signifies that the key attribute of a parent table 316 is also a non-key attribute of a child table 318. Where (FK) appears in parenthesis, it indicates that an attribute of one table is a key attribute of another table. Both the depicted non-identifying and identifying relationship are one to one-or-more relationships where one record in the parent table corresponds to one or more records in the child table.
- An alternative non-identifying relationship 324 is a one to zero-or-more relationship where one record in a parent table 320 corresponds to zero or more records in a child table 322.
- Database Model Figs. 4A-4H are entity relationship diagrams (ERDs) showing elements of polymo ⁇ hism database 102 according to one embodiment of the present invention. Each rectangle in the diagram corresponds to a table in database 102. First, the relationships and general contents of the various tables will be described.
- ERPs entity relationship diagrams
- FIG. 4 A illustrates core elements of database 102 according to one embodiment of the present invention.
- a subject table 402 lists organisms from which samples have been extracted for polymo ⁇ hism analysis or other tissue sources. Samples may also be obtained from tissue collections not associated with any one identified organism.
- Information stored within subject table 402 includes the name, gender, family, position with family, (e.g. , father , mother, etc.), and ethnic group. For human subjects, the name and family will preferably be represented in coded form to assure privacy.
- Associated with each subject is a species as listed in a species table 404. Also, a relationship may be defined among subjects a subject relationship table 406 which includes records corresponding to related subjects. These relationships may be father- mother, sibling, twins, etc.
- Subjects may be part of a group that is being studied, e.g., a group with a congenital disease, or a toxic reaction to a particular drug.
- the groups are listed in a subject group table 408. Participation of subjects in groups is defined by a subject participation table 410 which lists all group memberships. Samples and their attributes are listed in a sample table 412. Each sample has an associated sample type. The sample types are listed in a sample type table 414. Possible sample types include blood, urine, etc. Companies or institutions that provide samples are listed in a sample source table 416.
- Database 102 provides an item table 418 that includes records for items. There are various types of items that correspond to different stages of the sample preparation process. An "item derivation" transforms an item of one type into an item of another type. The following table lists various item types and item derivation types for a representative embodiment.
- Target Sequences of Extracted DNA PCR interest amplified
- Item derivations are listed in an item derivation table 420. It should be noted that derivations need not produce a change between item types. Each item derivation occurs in accordance with a protocol that characterizes the step or steps in the derivation. Protocols are listed in a protocol table 428. Each item derivation is performed by an employee listed in employee table 432.
- Hybridized chips i.e., chips that have had target applied
- hybridized chip table 424 Hybridized sample map table 426 lists the relationships between hybridized chips and the samples that have been applied to them.
- Stained hybridized chips are scanned in a process referred to here as a scan experiment.
- Scan experiments are listed in a scan experiment table 430.
- the scan experiment occurs in accordance with a protocol listed in protocol table 428.
- the scan experiment is performed by an employee listed in employee table 432.
- Fig. 4B depicts further details of the data model for items and item derivations.
- the various item types are listed in an item type table 434 and the various item derivation types are listed in an item derivation type table 436.
- the relationships between successive item types, e.g., sample and target are defined in an item type derivation table 438.
- An item has associated attributes. For example, for a target, database 102 may store the concentration, volume, location and/or remaining amount. All item attributes are stored in an item attribute table 440. Item attributes may be shared among multiple items. For example, a series of targets may all share a preparation date.
- An item attribute item map table 442 implements a many-to-many relationship between item attributes and items.
- item attribute type table 444 The various types of item attributes such as preparer, preparation date, etc. are listed in an item attribute type table 444. Each item type has corresponding attribute types. Some attribute types are, however, shared among various item types. Accordingly, there is a many-to-many relationship among item attribute types and item types that is implemented by an item type map table 446.
- Fig. 4B represent a powerfully general model of the sample preparation process. Changes in process steps that require changes in the type of information that should be stored may be implemented by changing and adding table contents rather than providing new tables or changing relationships among tables.
- Fig. 4C depicts a detailed data model for storing information about protocols according to the present invention. Protocols as stored in protocol table 428 represent information about particular processes that have been performed including item derivations, analyses, and scan experiments. Each protocol has an associated protocol template. Protocol templates identify protocol types. For example, one protocol template may be a PCR template. All protocols associated with the PCR template identify parameters for performing a PCR procedure. Protocol templates are listed in a protocol template table 448.
- a parameter table 450 lists all the parameters and their values for all the protocols listed in protocol table 428.
- a parameter template table 452 lists the various parameter types along with default values.
- An examples of a parameter template would be a PCR reaction temperature.
- the parameter template would include a default value for this parameter.
- Parameter table 450 might then list many different PCR reaction temperature values that would be used by many different protocols. If a parameter value has not been modified by the user, it inherits the standard value of the associated parameter template.
- a parameter template set is a set of parameter templates that are used for a particular pu ⁇ ose, e.g., in association with protocols according to one or more protocol templates. Parameter template sets are listed in a parameter template set table 454.
- a mapping between parameter template sets and protocol templates is defined by a protocol template set map table 458.
- Protocol templates may have associated lengthy verbal information about how to perform protocol steps.
- a protocol template document table 460 stores references to documents that include instructions for performing protocols. As with the items, the data model for protocols defined by Fig. 4C is highly general and allows significant changes in the way item derivations, analyses, and experiments are performed without changing the underlying data model.
- a fragment table 462 lists all the sequence fragments investigated in conjunction with database 102. Associated with each fragment are one or more primer pairs used to amplify the fragment in a PCR process.
- a primer pair table 464 lists all the primer pairs including information about whether the primer pair actually worked to amplify the fragment.
- PCR table 466 that lists records identifying the outcome of multiple PCR operations. The individual PCR operations are identified by reference to item derivation table 420.
- a single PCR operation may be used to amplify many different fragments and thus employ many different primer pairs.
- a single primer pair may be used in multiple PCR operations.
- Information about individual primers is stored in a primer table 470.
- each primer has an associated protocol in protocol table 428 that characterizes the primer preparation process.
- Information about primer orders is listed in a primer order table 472.
- Each primer order is to a vendor and the vendors are listed in a vendor table 474.
- Each primer order is made by an employee listed in employee table 432.
- a primer order design map table 476 implements a many-to-many relationship between primer orders and primers.
- a wafer table 478 lists wafers. When chips are produced, many chips are produced at the same time as part of a single wafer. Chip table 422 stores references to wafer table 478 for each chip and the location of each chip on its wafer at production time. Sometimes there is analytic significance associated with the location of a chip on the wafer. Each wafer is produced as part of a lot and the identify of the lot for each wafer is recorded by wafer table 478 as a reference to a lot table 480 that lists each lot.
- FIG. 4D depicts further details of tables pertaining to chip design that are preferably maintained within polymo ⁇ hism database 102 according to one embodiment of the present invention.
- a tiling design table 482 lists tiling designs. Each tiling design represents the application of a particular tiling format to a sequence to be investigated. Tiling formats indicate probe orientation, probe length, and the position within a probe of a single nucleotide polymo ⁇ hism being investigated. In a preferred embodiment, there may be very few tiling formats and they are listed in a tiling format table 484.
- a particular tiling design includes many atom designs specifying the design of a single atom.
- an atom is a group of typically four probes used to investigate a single base position with each probe hybridizing to a sequence including a different base at that position.
- Atom designs are listed in an atom design table 486. Records identifying the designs of individual probes are listed in a probe design table 488.
- a probe design role table 490 indicates the roles of probes listed in probe design table 488 in the atom designs of atom design table 486. For combinations of probe design and atom design, probe design role table 490 indicates which base the probe hybridizes to at the substitution position and whether the probe represents a match or a mismatch to the wild type.
- a probe data table 492 gives the hybridization intensity values for particular probes designs as determined in particular scan experiments. Each record of the table also gives the number of pixels used to determine the intensity value and the standard deviation of intensity as measured among the pixels.
- Figs. 4E-4G depict aspects of polymo ⁇ hism database 102 related to analysis procedures and their results according to one embodiment of the present invention.
- An analysis table 494 lists analyses performed. An analysis generally refers to a non-trivial transformation of data. Records of analysis table 494 include references to protocol table 428 to specify parameters used for each analysis. Analyses may take as their input raw data or the results of previous analyses.
- An analysis dependency table 496 lists dependencies among analyses where one analysis depends on the data developed by another analysis.
- An analysis input table 498 lists inputs for analyses listed in analysis table 494.
- a chip design sequence map table 500 correlates particular fragments with chip designs.
- a sequence position table 502 lists investigated sequence positions indicating their positions on a fragment. Records of sequence position table 502 reference a genomic sequence position table 504 which gives sequence positions in the genome rather than within individual fragments.
- a scan experiment set table 506 lists sets of scan experiments. This allows for groupings of experiments for individuals or populations to serve as the basis for polymo ⁇ hism analysis.
- a scan experiment used table 508 lists records indicating memberships of a scan experiment in a scan experiment set.
- a tiling data table 510 lists records identifying tiling designs as implemented in particular chips measured by particular scan experiments.
- An atom data table 512 lists the intensities measured for particular sequence positions as measured in scan experiments identified by the tiling data records.
- a subject sequence position data table 514 lists combinations of sequence position and scan experiment.
- a series of tables in Figs. 4E-4G correspond to different types of analysis that occur during the course of a polymo ⁇ hism investigation. The types presented here are merely representative. A parallel series of tables provide the analysis results.
- a polymo ⁇ hism analysis table 516 lists references to analysis table 494. The results of the performed polymo ⁇ hism analyses are listed in a polymo ⁇ hism position result table 518. A record of this table gives a result for a polymo ⁇ hism analysis for a particular position as determined based on a particular set of scan experiments. In one embodiment the result is whether a particular mutation is certain, likely, possible, or not possible at the position. The result may also be that the reference is wrong.
- a user polymo ⁇ hism analysis table 520 lists user inte ⁇ retations of results as listed in polymo ⁇ hism position result table 518.
- the records of user polymo ⁇ hism analysis table 520 are references to analysis table 494.
- the user inte ⁇ retations themselves are stored in a user polymo ⁇ hism analysis result table 522.
- Each result is a likelihood of a particular mutation at a position as considered by a user plus an accompanying user comment.
- a P-Hat analysis estimates the relative concentrations of wild type sequence and sequence having a particular mutation as determined in a particular scan experiment.
- a P-Hat analysis table 524 lists references to analysis table 494.
- An atom result table 526 gives estimates of the relative concentration along with upper and lower bounds and a maximum intensity. For heterozygous mutations, the estimates of relative concentration will cluster around 0.5 For homozygous mutations, the estimates should cluster around 1.0.
- Base call analyses are determinations of the base at a particular position for a particular individual that may be based on more than one experiments.
- a base call analysis table 528 lists references to analysis table 494.
- a base call result table 530 lists the called bases for particular combinations of sequence position and subject.
- a P-Hat grouping analysis determines a measure of likelihood that data in a set of scan experiments results from separate genotypes.
- P-hat grouping analyses are listed in a p-hat grouping analysis table 532 by reference to analysis table 494.
- P-hat grouping analysis results are listed in a mutation fraction result table 534.
- a group separation is given for various combinations of sequence position and scan experiment set.
- a clustering analysis determines an alternative measure of likelihood that data in a set of scan experiments results from separate genotypes.
- Clustering analyses are listed in a clustering analysis table 536 by reference to analysis table 494.
- Clustering analysis results are listed in a clustering result table 538.
- a clustering factor is given for various combinations of sequence position and scan experiment set. Fig.
- Hybridization intensity measurements made in scan experiments should be normalized over a set of scan experiments. The normalization should take into account differences in amplification level produced by different PCR processes. Normalization is done by region of sequence. A normalization region analysis determines the boundaries of a region to be normalized. The determination of boundaries takes into account that different fragments of sequence are amplified by different PCR procedures. A normalization region analysis table 540 lists normalization region analyses by reference to analysis table 494. A normalization region result table 542 lists the boundaries for each determined normalization region.
- Normalization values for identified normalization regions are themselves determined by normalization analyses. Normalization analyses are listed in a normalization analysis table 544 by reference to analysis table 494. A normalization result table 546 lists the normalization values for regions.
- a footprint analysis determines regions of sequence for which the hybridization intensity is elevated for the pu ⁇ oses of quality control. Footprint analyses are listed in a footprint analysis table 548 by reference to analysis table 494. Footprints are identified by sequence starting point and ending point in a particular scan experiment in a footprint table 550.
- Fig. 4G depicts tables pertaining to measurement quality according to one embodiment of the present invention.
- a tiling data quality analysis determines the quality of results from a scan experiment. These analyses are listed in a tiling data quality analysis table 552 by reference to analysis table 494. Tiling data quality analysis results are listed in a tiling data quality result table 554. The results include an average hybridization intensity value for perfect match or mismatch probes.
- a wild type call rate gives the fraction of atom data where the probe corresponding to the reference base has the highest hybridization intensity.
- a wild type call rate of around 1.0 indicates good quality. Where the call rate is less than 0.75, the scan experiment should be rejected.
- An accept data field indicates whether the analysis indicates rejection or acceptance.
- a difficult region result table 558 lists the regions identified as being difficult.
- Analysis dependency table 496 indicates interrelationships among the various analyses of Figs. 4E-4G.
- a footprint analysis may depend on a normalization analysis which may in turn depend on a normalization region analysis.
- a basecall analysis or PHatGrouping analysis may depend on an atom analysis.
- a polymo ⁇ hism analysis may depend on any of these analyses and/or a user polymo ⁇ hism analysis and/or a clustering analysis.
- FIG. 4H shows tables of polymo ⁇ hism database 102 related to efforts to seek patent protection according to one embodiment of the present invention.
- a polymo ⁇ hism patent sequence table 560 lists sequences for which patent protection is sought.
- a patent application table 562 lists patent applications directed toward the protection of polymo ⁇ hisms.
- a polymer patent application sequence map table 564 implements a many-to-many relationship between patent applications and sequences.
- a prior application table 566 lists relationships between patent applications and prior related patent applications.
- An attorney table 568 lists attorneys responsible for preparing patent applications listed in patent application table 562.
- a law firm table 570 lists the law firms to which the attorneys listed in attorney table 568 belong.
- An employee group table 572 lists groups of inventors for the patent applications listed in table 562. Individual inventors are listed in employee table 432.
- An employee group map table 574 implements a many-to-many relationship between inventors and groups of inventors.
- the data model of Fig. 4H greatly facilitates the process of securing patent protection for polymo ⁇ hisms and thereby increases the commercial incentive for investigation of polymo ⁇ hisms.
- TypeId INTEGER Renamed reference to parameter template set type table.
- ParamTempSetTypeId INTEGER Parameter template set type identifier.
- VARCHAR2(50) Description of parameter template set type.
- ProtocolTemplateld.TNTEGER Reference to protocol template table.
- ParamTemplateSetId INTEGER Reference to parameter template set table.
- ProtocolID INTEGER Reference to protocol table.
- DatePerformed DATE Date analysis performed.
- SequencePositionID NUMBER Reference to sequence position table.
- ScanExperimentSetID NUMBER Reference to scan experiment set table.
- NormalizationRegionID INTEGER Normalization region identifier.
- ChipDesignSequenceMapID NUMBER Reference to chip design sequence map table.
- RegionEnd INTEGER Indication of end of the normalization region.
- NormalizationResultID INTEGER Normalization result identifier.
- TilingDataID INTEGER Reference to tiling data table.
- NormalizationRegionResultlD INTEGER Reference to normalization result.
- Normalization Value NUMBER Value used for normalization.
- Avg WTIntensity NUMBER Average wild type intensity.
- AcceptData INTEGER Whether data is of acceptable quality.
- ChipDesignSequenceMapID NUMBER Reference to chip design sequence map table.
- RgnStart NUMBER Beginning of difficult region in sequence.
- RgnEnd NUMBER End of difficult region in sequence. It is understood that the examples and embodiments described herein are for illustrative pu ⁇ oses only and that various modifications or changes in light thereof will be suggested to persons skilled in the art and are to be included within the spirit and purview of this application and scope of the appended claims. For example, tables may be deleted, contents of multiple tables may be consolidated, or contents of one or more tables may be distributed among more tables than described herein to improve query speeds and/or to aid system maintenance. Also, the database architecture and data models described herein are not limited to biological applications but may be used in any application. All publications, patents, and patent applications cited herein are hereby inco ⁇ orated by reference.
Abstract
Systems and methods for organising information relating to a study of polymorphisms. A database model (102) is provided which interralates information about one or more of, e.g, subjects (112) from whom samples (114) are extracted, primers used in extracting the DNA from the subjects, about the samples themselves, about experiments done on samples, about particular oligonucleotide probe arrays used to perform experiments, about analysis procedures performed on the samples, and about analysis results. The model is readily translatable into database languages such as SQL. The database model scales to permit storage of information about large numbers of subjects, samples, experiments, chips, etc.
Description
SYSTEM FOR PROVIDING A POLYMORPHISM DATABASE
CROSS-REFERENCE TO RELATED APPLICATIONS
The present application claims priority from U.S. Pro v. App. No. 60/053,842 filed July 25, 1997, entitled COMPREHENSIVE BIO-INFORMATICS DATABASE, from U.S. Prov. App. No. 60/069,198 filed on December 11, 1997, entitled COMPREHENSIVE DATABASE FOR BIOINFORMATICS , and from U.S. Prov. App. No. 60/069,436, entitled GENE EXPRESSION AND EVALUATION SYSTEM, filed on December 11, 1997. The contents of all three provisional applications are herein incorporated by reference.
The subject matter of the present application is related to the subject matter of the following three co-assigned applications filed on the same day as the present application. GENE EXPRESSION AND EVALUATION SYSTEM (Attorney Docket No. 018547-035010), METHOD AND APPARATUS FOR PROVIDING A BIOINFORMATICS DATABASE (Attorney Docket No. 018547-033810), METHOD AND SYSTEM FOR PROVIDING A PROBE ARRAY CHIP DESIGN DATABASE (Attorney Docket No. 018547-033830). The contents of these three applications are herein incorporated by reference.
BACKGROUND OF THE INVENTION The present invention relates to the collection and storage of information pertaining to chips for processing biological samples and thereby identifying polymorphisms. The genomes of all organisms undergo spontaneous mutation in the course of their continuing evolution generating variant forms of progenitor sequences (Gusella, Ann. Rev. Biochem. 55, 831-854 (1986)). The variant form may confer an evolutionary advantage or disadvantage relative to a progenitor form or may be neutral. In some instances, a variant form confers a lethal disadvantage and is not transmitted to subsequent generations of the organism. In other instances, a variant form confers an
evolutionary advantage to the species and is eventually incorpofaTed'into the DNA of many or most members of the species and effectively becomes the progenitor form. In many instances, both progenitor and variant form(s) survive and co-exist in a species population. The coexistence of multiple forms of a sequence gives rise to polymorphisms.
Despite the increased amount of nucleotide sequence data being generated in recent years, only a minute proportion of the total repository of polymorphisms in humans and other organisms has so far been identified. The paucity of polymorphisms hitherto identified is due to the large amount of work required for their detection by conventional methods. For example, a conventional approach to identifying polymorphisms might be to sequence the same stretch of oligonucleotides in a population of individuals by dideoxy sequencing. In this type of approach, the amount of work increases in proportion to both the length of sequence and the number of individuals in a population and becomes impractical for large stretches of DNA or large numbers of persons.
Devices and computer systems for forming and using arrays of materials on a substrate have been developed. These devices and systems have been used for identifying polymorphisms. For example, PCT application WO92/10588, incorporated herein by reference for all purposes, describes techniques for sequencing or sequence checking nucleic acids and other materials. Arrays for performing these operations may be formed in arrays according to the methods of, for example, the pioneering techniques disclosed in U.S. Patent No. 5,143,854 and U.S. Patent No. 5,571,639, both incorporated herein by reference for all purposes.
According to one aspect of the techniques described therein, an array of nucleic acid probes is fabricated at known locations on a chip or substrate. A fluorescently labeled nucleic acid is then brought into contact with the chip and a scanner generates an image file indicating the locations where the labeled nucleic acids bound to the chip. Based upon the identities of the probes at these locations, it becomes possible to extract information such as the identity of polymorphic forms in of DNA or RNA. Such systems have been used to form, for example, arrays of DNA that may be used to study and detect mutations relevant to cystic fibrosis, the P53 gene (relevant to certain cancers), HIV, and other genetic characteristics.
It would be highly useful to apply such arrays to the study of polymorphisms on a large scale. For example, it would be useful to conduct large scale studies on the correlation between certain polymorphisms and individual characteristics such as susceptibility to diseases and effectiveness of drug treatments. To achieve these benefits, it is contemplated that the operations of chip design, construction, sample preparation, and analysis will occur on a very large scale. The quantity of information related to each of these steps to store and correlate is vast. For large scale polymorphism studies, it will be necessary to store this information in a way to facilitate later advantageous querying and retrieval. What is needed is a system and method suitable for storing and organizing large quantities of information used in conjunction with polymorphism studies.
SUMMARY OF THE INVENTION
The present invention provides systems and methods for organizing information relating to study of polymorphisms. A database model is provided which interrelates information about one or more of, e.g, subjects from whom samples are extracted, primers used in extracting the DNA from the subjects, about the samples themselves, about experiments done on samples, about particular oligonucleotide probe arrays used to perform experiments, about analysis procedures performed on the samples, and about analysis results . The model is readily translatable into database languages such as SQL. The database model scales to permit storage of information about large numbers of subjects, samples, experiments, chips, etc.
Applications include linkage studies to determine resistance to drugs, susceptibility to diseases, and study of every characteristic of humans and other organisms that is related genetic variability. Another application of a database constructed according to this model is quality control of the various steps of performing a polymorphism study. By preserving information about every step of a polymorphism study, one can assess the reliability of the results or use the preserved information as feedback to improve procedures.
A further understanding of the nature and advantages of the inventions herein may be realized by reference to the remaining portions of the specification and the attached drawings.
BRIEF DESCRIPTION OF THE DRAWINGS
Fig. 1 illustrates an overall system and process for forming and analyzing arrays of biological materials such as DNA or RNA.
Fig. 2 A illustrates a computer system suitable for use in conjunction with the overall system of Fig. 1.
Fig. 2B illustrates a computer network suitable for use in conjunction with the overall system of Fig. 1.
Fig. 3 illustrates a key for interpreting a database model.
Figs. 4A-4H illustrate a database model for maintaining information for the system and process of Fig. 1 according to one embodiment of the present invention.
DESCRIPTION OF SPECIFIC EMBODIMENTS
Investigation of Polymorphisms
A. Preparation of Samples
Polymorphisms are detected in a target nucleic acid from an individual being analyzed. For assay of genomic DNA, virtually any biological sample (other than pure red blood cells) is suitable. For example, convenient tissue samples include whole blood, semen, saliva, tears, urine, fecal material, sweat, buccal, skin and hair. For assay of cDNA or mRNA, the tissue sample must be obtained from an organ in which the target nucleic acid is expressed. For example, if the target nucleic acid is a cytochrome P450, the liver is a suitable source.
Many of the methods described below require amplification of DNA from target samples. This can be accomplished by e.g., PCR. See generally PCR Technology: Principles and Applications for DNA Amplification (ed. H.A. Erlich, Freeman Press, NY, NY, 1992); PCR Protocols: A Guide to Methods and Applications (eds. Innis, et al., Academic Press, San Diego, CA, 1990); Mattila et al. , Nucleic Acids Res. 19, 4967 (1991); Eckert et al., PCR Methods and Applications 1, 17 (1991); PCR (eds. McPherson et al., IRL Press, Oxford); and U.S. Patent 4,683,202 (each of which is incorporated by reference for all purposes).
Other suitable amplification methods include the ligase chain reaction (LCR) (see Wu and Wallace, Genomics 4, 560 (1989), Landegren et al., Science 241, 1077 (1988), transcription amplification (Kwoh et al., Proc. Natl. Acad. Sci. USA 86, 1173 (1989)), and self-sustained sequence replication (Guatelli et al., Proc. Nat. Acad.
Sci. USA, 87, 1874 (1990)) and nucleic acid based sequence amplification (NASBA).
The latter two amplification methods involve isothermal reactions based on isothermal transcription, which produce both single stranded RNA (ssRNA) and double stranded DNA (dsDNA) as the amplification products in a ratio of about 30 or 100 to 1 , respectively.
B. Detection of Polymorphisms in Target DNA
There are two distinct types of analysis depending whether a polymorphism in question has already been characterized. The first type of analysis is sometimes referred to as de novo characterization. This analysis compares target sequences in different individuals to identify points of variation, i.e., polymorphic sites. By analyzing groups of individuals representing the greatest ethnic diversity among humans and greatest breed and species variety in plants and animals, patterns characteristic of the most common alleles/haplotypes of the locus can be identified, and the frequencies of such populations in the population determined. Additional allelic frequencies can be determined for subpopulations characterized by criteria such as geography, race, or gender. The second type of analysis is determining which form(s) of a characterized polymoφhism are present in individuals under test. There are a variety of suitable procedures, which are discussed in turn.
1. Allele-Specific Probes The design and use of allele-specific probes for analyzing polymoφhisms is described by e.g., Saiki et al., Nature 324, 163-166 (1986); Dattagupta, EP 235,726, Saiki, WO 89/11548. Allele-specific probes can be designed that hybridize to a segment of target DNA from one individual but do not hybridize to the corresponding segment from another individual due to the presence of different polymoφhic forms in the respective segments from the two individuals. Hybridization conditions should be sufficiently stringent that there is a significant difference in hybridization intensity between alleles, and preferably an essentially binary response, whereby a probe hybridizes to only one of the alleles. Some probes are designed to hybridize to a segment of target DNA such that the polymoφhic site aligns with a central position (e.g., in a 15 mer at the 7 position; in a 16 mer, at either the 8 or 9 position) of the
probe. This design of probe achieves good discrimination in hybridization between different allelic forms.
Allele-specific probes are often used in pairs, one member of a pair showing a perfect match to a reference form of a target sequence and the other member showing a perfect match to a variant form. Several pairs of probes can then be immobilized on the same support for simultaneous analysis of multiple polymoφhisms within the same target sequence.
2. Tiling Arrays
The polymoφhisms can also be identified by hybridization to nucleic acid arrays, some example of which are described by WO 95/11995 (incoφorated by reference in its entirety for all puφoses). WO 95/11995 also describes subarrays that are optimized for detection of a variant forms of a precharacterized polymoφhism. Such a subarray contains probes designed to be complementary to a second reference sequence, which is an allelic variant of the first reference sequence. The second group of probes is designed by the same principles as described in the Examples except that the probes exhibit complementarity to the second reference sequence. The inclusion of a second group (or further groups) can be particular useful for analyzing short subsequences of the primary reference sequence in which multiple mutations are expected to occur within a short distance commensurate with the length of the probes (i.e. , two or more mutations within 9 to 21 bases).
3. Allele-Specific Primers
An allele-specific primer hybridizes to a site on target DNA overlapping a polymoφhism and only primes amplification of an allelic form to which the primer exhibits perfect complementarily. See Gibbs, Nucleic Acid Res. 17, 2427-2448 (1989). This primer is used in conjunction with a second primer which hybridizes at a distal site. Amplification proceeds from the two primers leading to a detectable product signifying the particular allelic form is present. A control is usually performed with a second pair of primers, one of which shows a single base mismatch at the polymoφhic site and the other of which exhibits perfect complementarily to a distal site. The single-base mismatch prevents amplification and no detectable product is formed. The method works best when the mismatch is included in the 3 '-most position of the oligonucleotide aligned with the polymoφhism because this position is most destabilizing to elongation from the primer. See, e.g., WO 93/22456.
4. Direct-Sequencing
The direct analysis of the sequence of polymoφhisms of the present invention can be accomplished using either the dideoxy chain termination method or the Maxam Gilbert method (see Sambrook et al. , Molecular Cloning, A Laboratory Manual (2nd Ed., CSHP, New York 1989); Zyskind et al., Recombinant DNA Laboratory Manual, (Acad. Press, 1988)).
5. Denaturing Gradient Gel Electrophoresis
Amplification products generated using the polymerase chain reaction can be analyzed by the use of denaturing gradient gel electrophoresis. Different alleles can be identified based on the different sequence-dependent melting properties and electrophoretic migration of DNA in solution. Erlich, ed. , PCR Technology, Principles and Applications for DNA Amplification, (W.H. Freeman and Co, New York, 1992), Chapter 7.
6. Single-Strand Conformation Polvmoφhism Analysis Alleles of target sequences can be differentiated using single-strand conformation polymoφhism analysis, which identifies base differences by alteration in electrophoretic migration of single stranded PCR products, as described in Orita et al., Proc. Nat. Acad. Sci. 86, 2766-2770 (1989). Amplified PCR products can be generated as described above, and heated or otherwise denatured, to form single stranded amplification products. Single-stranded nucleic acids may refold or form secondary structures which are partially dependent on the base sequence. The different electrophoretic mobilities of single-stranded amplification products can be related to base- sequence difference between alleles of target sequences.
Biological Material Analysis System One embodiment of the present invention operates in the context of a system for analyzing biological or other materials using arrays that themselves include probes that may be made of biological materials such as RNA or DNA. The VLSIPS™ and GeneChip™ technologies provide methods of making and using very large arrays of polymers, such as nucleic acids, on chips. See U.S. Patent No. 5,143,854 and PCT Patent Publication Nos. WO 90/15070 and 92/10092, each of which is hereby incoφorated by reference for all puφoses. Nucleic acid probes on the chip are used to
detect complementary nucleic acid sequences in a sample nucleic acid of interest (the "target" nucleic acid).
Fig. 1 illustrates an overall system 100 for forming and analyzing arrays of biological materials such as RNA or DNA. A part of system 100 is a polymoφhism database 102. Polymoφhism database 102 includes information about, e.g., biological sources, preparation of samples, design of arrays, raw data obtained from applying experiments to chips, analysis procedures applied, and analysis results, etc. Polymoφhism database 102 facilitates large scale study of polymoφhisms.
A chip design system 104 is used to design arrays of polymers such as biological polymers such as RNA or DNA. Chip design system 104 may be, for example, an appropriately programmed Sun Workstation or personal computer or workstation, such as an IBM PC equivalent, including appropriate memory and a CPU. Chip design system 104 obtains inputs from a user regarding chip design objectives including polymoφhisms of interest, and other inputs regarding the desired features of the array. Optionally, chip design system 104 from external databases such as GenBank. The output of chip design system 104 is a set of chip design computer files in the form of, for example, a switch matrix, as described in PCT application WO 92/10092, and other associated computer files. The chip design computer files form a part of polymoφhism database 102. Systems for designing chips for study of polymoφhisms are disclosed in U.S. Patent No. 5,571,639 and in PCT application WO 95/11995, the contents of which are herein incoφorated by reference.
The chip design files are input to a mask design system (not shown) that designs the lithographic masks used in the fabrication of arrays of molecules such as DNA. The mask design system designs the lithographic masks used in the fabrication of probe arrays. The mask design system generates mask design files that are then used by a mask construction system (not shown) to construct masks or other synthesis patterns such as chrome-on-glass masks for use in the fabrication of polymer arrays.
The masks are used in a synthesis system (not shown). The synthesis system includes the necessary hardware and software used to fabricate arrays of polymers on a substrate or chip. The synthesis system includes a light source and a chemical flow cell on which the substrate or chip is placed. A mask is placed between the light source and the substrate/chip, and the two are translated relative to each other at appropriate times for deprotection of selected regions of the chip. Selected chemical reagents are
directed through the flow cell for coupling to deprotected regions, as well as for washing and other operations. The substrates fabricated by the synthesis system are optionally diced into smaller chips. The output of the synthesis system is a chip ready for application of a target sample. Information about the mask design, mask construction, and probe array synthesis is presented by way of background. A biological source 112 is, for example, tissue from a plant or animal. Various processing steps are applied to material from biological source 112 by a sample preparation system 114. Operation of sample preparation system 114 in the context of a polymoφhism study is discussed below in further detail.
The prepared samples include nucleic acid sequences such as DNA. When the sample is applied to the chip by a sample exposure system 116, the nucleic acids may or may not bond to the probes. The nucleic acids can be tagged with fluoroscein labels to determine which probes have bonded to nucleotide sequences from the sample. The prepared samples will be placed in a scanning system 118. Scanning system 118 includes a detection device such as a confocal microscope or CCD (charge- coupled device) that is used to detect the location where labeled receptors have bound to the substrate. The output of scanning system 118 is an image file(s) indicating, in the case of fluorescein labeled receptor, the fluorescence intensity (photon counts or other related measurements, such as voltage) as a function of position on the substrate. These image files may also form a part of polymoφhism database 102. Since higher photon counts will be observed where the labeled nucleic acid(s) has bound more strongly to the array of probes, and since the monomer sequence of the probes on the substrate is known as a function of position, it becomes possible to analize the sequence(s) of the nucleic acid(s) that are complementary to the probes.
The image files and the design of the chips are input to an analysis system 120 that, e.g., calls bases. Such analysis techniques are described in EPO Pub. No. 0717113A, the contents of which are herein incoφorated by reference.
Chip design system 104, analysis system 120 and control portions of exposure system 116, sample preparation system 114, and scanning system 118 may be appropriately programmed computers such as a Sun workstation or IBM-compatible PC. An independent computer for each system may perform the computer-implemented functions of these systems or one computer may combine the computerized functions of
two or more systems. One or more computers may maintain chip design database 102 independent of the computers operating the systems of Fig. 1 or chip design database 102 may be fully or partially maintained by these computers.
Fig. 2 A depicts a block diagram of a host computer system 10 suitable for implementing the present invention. Host computer system 210 includes a bus 212 which interconnects major subsystems such as a central processor 214, a system memory 216 (typically RAM), an input/output (I/O) adapter 218, an external device such as a display screen 224 via a display adapter 226, a keyboard 232 and a mouse 234 via an I/O adapter 218, a SCSI host adapter 236, and a floppy disk drive 238 operative to receive a floppy disk 240. SCSI host adapter 236 may act as a storage interface to a fixed disk drive 242 or a CD-ROM player 244 operative to receive a CD-ROM 246. Fixed disk 244 may be a part of host computer system 210 or may be separate and accessed through other interface systems. A network interface 248 may provide a direct connection to a remote server via a telephone link or to the Internet. Network interface 248 may also connect to a local area network (LAN) or other network interconnecting many computer systems. Many other devices or subsystems (not shown) may be connected in a similar manner.
Also, it is not necessary for all of the devices shown in Fig. 2 A to be present to practice the present invention, as discussed below. The devices and subsystems may be interconnected in different ways from that shown in Fig. 2A. The operation of a computer system such as that shown in Fig. 2 A is readily known in the art and is not discussed in detail in this application. Code to implement the present invention, may be operably disposed or stored in computer-readable storage media such as system memory 216, fixed disk 242, CD-ROM 246, or floppy disk 240. Fig. 2B depicts a network 260 interconnecting multiple computer systems
210. Network 260 may be a local area network (LAN), wide area network (WAN), etc. Bioinformatics database 102 and the computer-related operations of the other elements of Fig. 2B may be divided amongst computer systems 210 in any way with network 260 being used to communicate information among the various computers. Portable storage media such as floppy disks may be used to carry information between computers instead of network 260.
Overall Description of Database
Polymoφhism database 102 is preferably a relational database with a complex internal structure. The structure and contents of chip design database 102 will be described with reference to a logical model depicted in Figs. 4A-4H that describes the contents of tables of the database as well as interrelationships among the tables. A visual depiction of this model will be an Entity Relationship Diagram (ERD) which includes entities, relationships, and attributes. A detailed discussion of ERDs is found in "ERwin version 3.0 Methods Guide" available from Logic Works, Inc. of Princeton, NJ, the contents of which are herein incoφorated by reference. Those of skill in the art will appreciate that automated tools such as Developer 2000 available from Oracle will convert the ERD from Figs. 4A-4H directly into executable code such as SQL code for creating and operating the database.
Fig. 3 is a key to the ERD that will be used to describe the contents of chip design database 102. A representative table 302 includes one or more key attributes 304 and one or more non-key attributes 306. Representative table 302 includes one or more records where each record includes fields corresponding to the listed attributes. The contents of the key fields taken together identify an individual record. In the ERD, each table is represented by a rectangle divided by a horizontal line. The fields or attributes above the line are key while the fields or attributes below the line are non-key. An identifying relationship 308 signifies that the key attribute of a parent table 310 is also a key attribute of a child table 312. A non-identifying relationship 314 signifies that the key attribute of a parent table 316 is also a non-key attribute of a child table 318. Where (FK) appears in parenthesis, it indicates that an attribute of one table is a key attribute of another table. Both the depicted non-identifying and identifying relationship are one to one-or-more relationships where one record in the parent table corresponds to one or more records in the child table. An alternative non- identifying relationship 324 is a one to zero-or-more relationship where one record in a parent table 320 corresponds to zero or more records in a child table 322.
Database Model Figs. 4A-4H are entity relationship diagrams (ERDs) showing elements of polymoφhism database 102 according to one embodiment of the present invention. Each
rectangle in the diagram corresponds to a table in database 102. First, the relationships and general contents of the various tables will be described.
The interrelationships and general contents of the tables of database 102 will be described first. Then a chart will be presented listing and describing all of the fields of the various tables.
Fig. 4 A illustrates core elements of database 102 according to one embodiment of the present invention. A subject table 402 lists organisms from which samples have been extracted for polymoφhism analysis or other tissue sources. Samples may also be obtained from tissue collections not associated with any one identified organism. Information stored within subject table 402 includes the name, gender, family, position with family, (e.g. , father , mother, etc.), and ethnic group. For human subjects, the name and family will preferably be represented in coded form to assure privacy. Associated with each subject is a species as listed in a species table 404. Also, a relationship may be defined among subjects a subject relationship table 406 which includes records corresponding to related subjects. These relationships may be father- mother, sibling, twins, etc. Subjects may be part of a group that is being studied, e.g., a group with a congenital disease, or a toxic reaction to a particular drug. The groups are listed in a subject group table 408. Participation of subjects in groups is defined by a subject participation table 410 which lists all group memberships. Samples and their attributes are listed in a sample table 412. Each sample has an associated sample type. The sample types are listed in a sample type table 414. Possible sample types include blood, urine, etc. Companies or institutions that provide samples are listed in a sample source table 416.
Database 102 provides an item table 418 that includes records for items. There are various types of items that correspond to different stages of the sample preparation process. An "item derivation" transforms an item of one type into an item of another type. The following table lists various item types and item derivation types for a representative embodiment.
Item Type Derived from bv Item Derivation Type Sample other samples pooling
Sample other sample splitting
Extracted DNA Sample DNA Extraction
Target (Sequences of Extracted DNA PCR interest amplified)
Fluorescently Labeled Target Labeling
Target
Hybridized Chip Labeled Target Hybridization (application of target to chip)
Stained Hybridized Chip Hybridized Chip Staining
Item derivations are listed in an item derivation table 420. It should be noted that derivations need not produce a change between item types. Each item derivation occurs in accordance with a protocol that characterizes the step or steps in the derivation. Protocols are listed in a protocol table 428. Each item derivation is performed by an employee listed in employee table 432.
Unused chips are listed in a chip table 422. Hybridized chips (i.e., chips that have had target applied) are listed in a hybridized chip table 424. A hybridized sample map table 426 lists the relationships between hybridized chips and the samples that have been applied to them.
Stained hybridized chips are scanned in a process referred to here as a scan experiment. Scan experiments are listed in a scan experiment table 430. The scan experiment occurs in accordance with a protocol listed in protocol table 428. The scan experiment is performed by an employee listed in employee table 432.
Fig. 4B depicts further details of the data model for items and item derivations. The various item types are listed in an item type table 434 and the various item derivation types are listed in an item derivation type table 436. The relationships between successive item types, e.g., sample and target are defined in an item type derivation table 438. An item has associated attributes. For example, for a target, database 102 may store the concentration, volume, location and/or remaining amount. All item attributes are stored in an item attribute table 440. Item attributes may be shared among multiple items. For example, a series of targets may all share a preparation date. An item attribute item map table 442 implements a many-to-many
relationship between item attributes and items. The various types of item attributes such as preparer, preparation date, etc. are listed in an item attribute type table 444. Each item type has corresponding attribute types. Some attribute types are, however, shared among various item types. Accordingly, there is a many-to-many relationship among item attribute types and item types that is implemented by an item type map table 446.
The tables of Fig. 4B represent a powerfully general model of the sample preparation process. Changes in process steps that require changes in the type of information that should be stored may be implemented by changing and adding table contents rather than providing new tables or changing relationships among tables. Fig. 4C depicts a detailed data model for storing information about protocols according to the present invention. Protocols as stored in protocol table 428 represent information about particular processes that have been performed including item derivations, analyses, and scan experiments. Each protocol has an associated protocol template. Protocol templates identify protocol types. For example, one protocol template may be a PCR template. All protocols associated with the PCR template identify parameters for performing a PCR procedure. Protocol templates are listed in a protocol template table 448. A parameter table 450 lists all the parameters and their values for all the protocols listed in protocol table 428. A parameter template table 452 lists the various parameter types along with default values. An examples of a parameter template would be a PCR reaction temperature. The parameter template would include a default value for this parameter. Parameter table 450 might then list many different PCR reaction temperature values that would be used by many different protocols. If a parameter value has not been modified by the user, it inherits the standard value of the associated parameter template. A parameter template set is a set of parameter templates that are used for a particular puφose, e.g., in association with protocols according to one or more protocol templates. Parameter template sets are listed in a parameter template set table 454. There are different types of parameter template set and these are listed in a parameter template set table 456. A mapping between parameter template sets and protocol templates is defined by a protocol template set map table 458. Protocol templates may have associated lengthy verbal information about how to perform protocol steps. A protocol template document table 460 stores references to documents that include instructions for performing protocols.
As with the items, the data model for protocols defined by Fig. 4C is highly general and allows significant changes in the way item derivations, analyses, and experiments are performed without changing the underlying data model.
Referring again to Fig. 4 A, there are tables to record information concerning the use of primers in PCR. A fragment table 462 lists all the sequence fragments investigated in conjunction with database 102. Associated with each fragment are one or more primer pairs used to amplify the fragment in a PCR process. A primer pair table 464 lists all the primer pairs including information about whether the primer pair actually worked to amplify the fragment. In order to develop the information about the effectiveness of primer pairs, there is a PCR table 466 that lists records identifying the outcome of multiple PCR operations. The individual PCR operations are identified by reference to item derivation table 420.
A single PCR operation may be used to amplify many different fragments and thus employ many different primer pairs. Of course, a single primer pair may be used in multiple PCR operations. There is therefore a many-to-many relationship between PCR operations and primer pairs that is recorded by a primer pair PCR map table 468. Information about individual primers is stored in a primer table 470. Also, each primer has an associated protocol in protocol table 428 that characterizes the primer preparation process. Information about primer orders is listed in a primer order table 472. Each primer order is to a vendor and the vendors are listed in a vendor table 474. Each primer order is made by an employee listed in employee table 432. A primer order design map table 476 implements a many-to-many relationship between primer orders and primers.
The data model described here thus preserves information about primers used in PCR reactions. One can improve results by using primers that have successfully amplified a given fragment in the past. Sometimes particular groups of primer pairs cannot be multiplexed together in the same PCR process. The information preserved here thus permits experimenters to make optimal use of expensive and time consuming PCR procedures. It is also useful to preserve information about the chip production process and the origin of individual chips. A wafer table 478 lists wafers. When chips are produced, many chips are produced at the same time as part of a single wafer. Chip
table 422 stores references to wafer table 478 for each chip and the location of each chip on its wafer at production time. Sometimes there is analytic significance associated with the location of a chip on the wafer. Each wafer is produced as part of a lot and the identify of the lot for each wafer is recorded by wafer table 478 as a reference to a lot table 480 that lists each lot.
Fig. 4D depicts further details of tables pertaining to chip design that are preferably maintained within polymoφhism database 102 according to one embodiment of the present invention. A tiling design table 482 lists tiling designs. Each tiling design represents the application of a particular tiling format to a sequence to be investigated. Tiling formats indicate probe orientation, probe length, and the position within a probe of a single nucleotide polymoφhism being investigated. In a preferred embodiment, there may be very few tiling formats and they are listed in a tiling format table 484.
A particular tiling design includes many atom designs specifying the design of a single atom. In one embodiment, an atom is a group of typically four probes used to investigate a single base position with each probe hybridizing to a sequence including a different base at that position. Atom designs are listed in an atom design table 486. Records identifying the designs of individual probes are listed in a probe design table 488. A probe design role table 490 indicates the roles of probes listed in probe design table 488 in the atom designs of atom design table 486. For combinations of probe design and atom design, probe design role table 490 indicates which base the probe hybridizes to at the substitution position and whether the probe represents a match or a mismatch to the wild type.
A probe data table 492 gives the hybridization intensity values for particular probes designs as determined in particular scan experiments. Each record of the table also gives the number of pixels used to determine the intensity value and the standard deviation of intensity as measured among the pixels.
Figs. 4E-4G depict aspects of polymoφhism database 102 related to analysis procedures and their results according to one embodiment of the present invention. An analysis table 494 lists analyses performed. An analysis generally refers to a non-trivial transformation of data. Records of analysis table 494 include references to protocol table 428 to specify parameters used for each analysis. Analyses may take as their input raw data or the results of previous analyses. An analysis dependency table
496 lists dependencies among analyses where one analysis depends on the data developed by another analysis. An analysis input table 498 lists inputs for analyses listed in analysis table 494.
On the right side of Fig. 4E are various tables used to support analyses. A chip design sequence map table 500 correlates particular fragments with chip designs. A sequence position table 502 lists investigated sequence positions indicating their positions on a fragment. Records of sequence position table 502 reference a genomic sequence position table 504 which gives sequence positions in the genome rather than within individual fragments. A scan experiment set table 506 lists sets of scan experiments. This allows for groupings of experiments for individuals or populations to serve as the basis for polymoφhism analysis. A scan experiment used table 508 lists records indicating memberships of a scan experiment in a scan experiment set.
A tiling data table 510 lists records identifying tiling designs as implemented in particular chips measured by particular scan experiments. An atom data table 512 lists the intensities measured for particular sequence positions as measured in scan experiments identified by the tiling data records. A subject sequence position data table 514 lists combinations of sequence position and scan experiment.
A series of tables in Figs. 4E-4G correspond to different types of analysis that occur during the course of a polymoφhism investigation. The types presented here are merely representative. A parallel series of tables provide the analysis results. A polymoφhism analysis table 516 lists references to analysis table 494. The results of the performed polymoφhism analyses are listed in a polymoφhism position result table 518. A record of this table gives a result for a polymoφhism analysis for a particular position as determined based on a particular set of scan experiments. In one embodiment the result is whether a particular mutation is certain, likely, possible, or not possible at the position. The result may also be that the reference is wrong.
A user polymoφhism analysis table 520 lists user inteφretations of results as listed in polymoφhism position result table 518. The records of user polymoφhism analysis table 520 are references to analysis table 494. The user inteφretations themselves are stored in a user polymoφhism analysis result table 522. Each result is a
likelihood of a particular mutation at a position as considered by a user plus an accompanying user comment.
A P-Hat analysis estimates the relative concentrations of wild type sequence and sequence having a particular mutation as determined in a particular scan experiment. A P-Hat analysis table 524 lists references to analysis table 494. An atom result table 526 gives estimates of the relative concentration along with upper and lower bounds and a maximum intensity. For heterozygous mutations, the estimates of relative concentration will cluster around 0.5 For homozygous mutations, the estimates should cluster around 1.0. Base call analyses are determinations of the base at a particular position for a particular individual that may be based on more than one experiments. A base call analysis table 528 lists references to analysis table 494. A base call result table 530 lists the called bases for particular combinations of sequence position and subject.
A P-Hat grouping analysis determines a measure of likelihood that data in a set of scan experiments results from separate genotypes. P-hat grouping analyses are listed in a p-hat grouping analysis table 532 by reference to analysis table 494. P-hat grouping analysis results are listed in a mutation fraction result table 534. A group separation is given for various combinations of sequence position and scan experiment set. A clustering analysis determines an alternative measure of likelihood that data in a set of scan experiments results from separate genotypes. Clustering analyses are listed in a clustering analysis table 536 by reference to analysis table 494. Clustering analysis results are listed in a clustering result table 538. A clustering factor is given for various combinations of sequence position and scan experiment set. Fig. 4F shows tables which support normalization and footprint finding operations that support the analyses referred to in Fig. 4E. Hybridization intensity measurements made in scan experiments should be normalized over a set of scan experiments. The normalization should take into account differences in amplification level produced by different PCR processes. Normalization is done by region of sequence. A normalization region analysis determines the boundaries of a region to be normalized. The determination of boundaries takes into account that different fragments of sequence are amplified by
different PCR procedures. A normalization region analysis table 540 lists normalization region analyses by reference to analysis table 494. A normalization region result table 542 lists the boundaries for each determined normalization region.
Normalization values for identified normalization regions are themselves determined by normalization analyses. Normalization analyses are listed in a normalization analysis table 544 by reference to analysis table 494. A normalization result table 546 lists the normalization values for regions.
A footprint analysis determines regions of sequence for which the hybridization intensity is elevated for the puφoses of quality control. Footprint analyses are listed in a footprint analysis table 548 by reference to analysis table 494. Footprints are identified by sequence starting point and ending point in a particular scan experiment in a footprint table 550.
Fig. 4G depicts tables pertaining to measurement quality according to one embodiment of the present invention. A tiling data quality analysis determines the quality of results from a scan experiment. These analyses are listed in a tiling data quality analysis table 552 by reference to analysis table 494. Tiling data quality analysis results are listed in a tiling data quality result table 554. The results include an average hybridization intensity value for perfect match or mismatch probes. A wild type call rate gives the fraction of atom data where the probe corresponding to the reference base has the highest hybridization intensity. A wild type call rate of around 1.0 indicates good quality. Where the call rate is less than 0.75, the scan experiment should be rejected. An accept data field indicates whether the analysis indicates rejection or acceptance.
Where scan experiment measurements indicate two or more non-wild type bases within a probe length, this indicates a measurement problem for the affected region of sequence. These regions are identified by difficult region analyses listed in a difficult region analysis table 556 by reference to analysis table 494. A difficult region result table 558 lists the regions identified as being difficult.
Analysis dependency table 496 indicates interrelationships among the various analyses of Figs. 4E-4G. A footprint analysis may depend on a normalization analysis which may in turn depend on a normalization region analysis. A basecall analysis or PHatGrouping analysis may depend on an atom analysis. A polymoφhism
analysis may depend on any of these analyses and/or a user polymoφhism analysis and/or a clustering analysis.
Another aspect of the investigation of polymoφhisms is seeking patent protection for identified polymoφhisms. Fig. 4H shows tables of polymoφhism database 102 related to efforts to seek patent protection according to one embodiment of the present invention. A polymoφhism patent sequence table 560 lists sequences for which patent protection is sought. A patent application table 562 lists patent applications directed toward the protection of polymoφhisms. A polymer patent application sequence map table 564 implements a many-to-many relationship between patent applications and sequences. A prior application table 566 lists relationships between patent applications and prior related patent applications. An attorney table 568 lists attorneys responsible for preparing patent applications listed in patent application table 562. A law firm table 570 lists the law firms to which the attorneys listed in attorney table 568 belong.
An employee group table 572 lists groups of inventors for the patent applications listed in table 562. Individual inventors are listed in employee table 432. An employee group map table 574 implements a many-to-many relationship between inventors and groups of inventors.
The data model of Fig. 4H greatly facilitates the process of securing patent protection for polymoφhisms and thereby increases the commercial incentive for investigation of polymoφhisms.
Database Contents
The contents of the tables introduced above will now be presented in greater detail in the following chart.
TypeId:INTEGER Renamed reference to parameter template set type table.
Name:VARCHAR2(20) Name of parameter template set.
ParamTemplateSetType
ParamTempSetTypeId:INTEGER Parameter template set type identifier.
Description: VARCHAR2(50) Description of parameter template set type.
ParameterTemplateSetMap
ProtocolTemplateld.TNTEGER Reference to protocol template table.
ParamTemplateSetId:INTEGER Reference to parameter template set table.
AnalysisId:INTEGER Analysis identifier.
Analysis VersionID : INTEGER Reference to version of analysis.
ProtocolID:INTEGER Reference to protocol table.
DatePerformed: DATE Date analysis performed.
NeedsUpdate:NUMBER Whether analysis is current.
tblAnalysisDependency
ParentAnalysisId:INTEGER Analysis providing input.
SubAnalysisId:INTEGER Analysis receiving input.
Role:VARCHAR2(20) Role of data provided by parent analysis.
TblAnalysisInput
SuggestCheck:NUMBER Used to indicate whether this sample should be used for resequencing tblClusteringAnalysis
AnalysisId:INTEGER Reference to analysis table. tblClusteringResult
AnalysisId:INTEGER Reference to analysis table.
SequencePositionID : NUMBER Reference to sequence position table.
ScanExperimentSetID : NUMBER Reference to scan experiment set table.
ClusteringFactoπFLOAT Result of clustering analysis. tblNormalizationRegionAnalysis
AnalysisId:INTEGER Reference to analysis table. tblNormaiizationRegion
NormalizationRegionID : INTEGER Normalization region identifier.
AnalysisId:INTEGER Reference to analysis table.
TABLE FIELD COMMENT
ChipDesignSequenceMapID : NUMBER Reference to chip design sequence map table.
NumberScanExpt.Set Reference to scan experiment set table.
RegionEnd:INTEGER Indication of end of the normalization region.
RegionStart: INTEGER Indication of beginning of the normalization region. tblNormalizationAnalysis
AnalysisId:INTEGER Reference to analysis table. tblNormalizationResult
NormalizationResultID : INTEGER Normalization result identifier.
AnalysisId:INTEGER Reference to analysis table.
TilingDataID:INTEGER Reference to tiling data table.
NormalizationRegionResultlD : INTEGER Reference to normalization result.
AnalysisId:INTEGER Reference to analysis table.
Avg WTIntensity : NUMBER Average wild type intensity.
WTCallRate:NUMBER Fraction of atoms where brightest of probes is one with reference space.
AcceptData: INTEGER Whether data is of acceptable quality. tblDifficult Regionanalysis
AnalysisId:INTEGER Reference to analysis table. tblDifficultRegionResult
ScanExptId:INTEGER Reference to scan experiment table.
AnalysisId:INTEGER Reference to analysis table.
ChipDesignSequenceMapID : NUMBER Reference to chip design sequence map table.
RgnStart: NUMBER Beginning of difficult region in sequence.
RgnEnd: NUMBER End of difficult region in sequence.
It is understood that the examples and embodiments described herein are for illustrative puφoses only and that various modifications or changes in light thereof will be suggested to persons skilled in the art and are to be included within the spirit and purview of this application and scope of the appended claims. For example, tables may be deleted, contents of multiple tables may be consolidated, or contents of one or more tables may be distributed among more tables than described herein to improve query speeds and/or to aid system maintenance. Also, the database architecture and data models described herein are not limited to biological applications but may be used in any application. All publications, patents, and patent applications cited herein are hereby incoφorated by reference.
Claims
WHAT IS CLAIMED IS: 1. A computer-readable storage medium having stored thereon: an item table listing a plurality of item records identifying items; an item attribute table listing a plurality of item attribute records identifying attributes of said items; and wherein there is a many-to-many relationship between item records and item attribute records.
2. The computer-readable storage medium of claim 1 wherein an item attribute item map table implements said many-to-many relationship between item records and item attribute records, said item attribute item map table listing a plurality of map records identifying both a particular item attribute and a particular item.
3. The computer-readable storage medium of claim 1 having further stored thereon: an item derivation table listing a plurality of item derivation records identifying transformations between ones of said items used in biological analysis.
4. The computer-readable storage medium of claim 3 having further stored thereon: a protocol table listing a plurality of protocol records specifying parameters of said transformation.
5. The computer-readable storage medium wherein said items are used in a biological analysis.
6. The computer-readable storage medium of claim 1 wherein said biological analysis comprises a polymoφhism analysis.
7. A computer-readable storage medium having stored thereon: an atom result table listing a plurality of atom result records, specifying relative wild-type and mutant sequence concentrations in targets; and a subject sequence position table listing a plurality of subject sequence position records, specifying combinations of subjects from whom said targets are derived and sequence positions, each said atom result record being associated with one or more atom result records.
8. The computer-readable storage medium of claim 7 wherein said atom result records further specify upper and lower bounds for said concentrations.
9. The computer-readable storage medium of claim 7 having further stored thereon: a subject table listing subject records specifying said subjects.
10. A computer-readable storage medium having stored thereon: a polymoφhism table listing polymoφhism sequence records specifying sequences known to contain polymoφhisms; and a patent application table listing patent application records specifying one or more polymoφhisms specified by said polymoφhism sequence records.
11. The computer-readable storage medium of claim 10 wherein said polymoφhism sequence records specify for each one of said polymoφhisms a polymoφhism position, a reference allele, and a base allele.
12. The computer-readable storage medium of claim 11 wherein said polymoφhism sequence records further specify for each one of said polymoφhisms a measured heterozygocity.
13. A computer- implemented method comprising: creating n item table listing a plurality of item records identifying items used in biological analysis; and creating an item attribute table listing a plurality of item attribute records identifying attributes of said items; and wherein there is a many-to-many relationship between item records and item attribute records.
14. The computer-implemented method of claim 13 further comprising the step of: creating an item attribute item map table implements said many- to-many relationship between item records and item attribute records, said item attribute item map table listing a plurality of map records identifying both a particular item attribute and a particular item.
15. The computer-implemented method of claim 13 comprising: an item derivation table listing a plurality of item derivation records identifying transformations between ones of said items used in biological analysis.
16. The computer-implemented method of claim 15 further comprising: creating a protocol table listing a plurality of protocol records specifying parameters of said transformation.
17. The computer-implemented method of claim 13 wherein said biological analysis comprises a polymoφhism analysis.
18. A computer-implemented method comprising: creating an atom result table listing a plurality of atom result records, specifying relative wild-type and mutant sequence concentrations in targets; and creating a subject sequence position table listing a plurality of subject sequence position records, specifying combinations of subjects from whom said targets are derived and sequence positions, each said atom result record being associated with one or more atom result records.
19. The computer- implemented method of claim 18 wherein said atom result records further specify upper and lower bounds for said concentrations.
20. The computer-implemented method of claim 18 further comprising: creating a subject table listing subject records specifying said subjects.
21. A computer-implemented method comprising: creating a polymoφhism table listing polymoφhism sequence records specifying sequences known to contain polymoφhisms; and creating a patent application table listing patent application records specifying one or more polymoφhisms specified by said polymoφhism sequence records.
22. The computer-implemented method of claim 21 wherein said polymoφhism sequence records specify for each one of said polymoφhisms a polymoφhism position, a reference allele, and a base allele.
23. The computer-implemented method of claim 22 wherein said polymoφhism sequence records further specify for at least one of said polymoφhisms a measured heterozygocity.
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2000504291A JP2001515234A (en) | 1997-07-25 | 1998-07-24 | System for providing a polymorphism database |
EP98937128A EP1007737A4 (en) | 1997-07-25 | 1998-07-24 | System for providing a polymorphism database |
Applications Claiming Priority (6)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US5384297P | 1997-07-25 | 1997-07-25 | |
US6919897P | 1997-12-11 | 1997-12-11 | |
US6943697P | 1997-12-11 | 1997-12-11 | |
US60/069,436 | 1997-12-11 | ||
US60/069,198 | 1997-12-11 | ||
US60/053,842 | 1997-12-11 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO1999005324A1 true WO1999005324A1 (en) | 1999-02-04 |
Family
ID=27368502
Family Applications (4)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/US1998/015456 WO1999005574A1 (en) | 1997-07-25 | 1998-07-24 | Method and system for providing a probe array chip design database |
PCT/US1998/015151 WO1999005323A1 (en) | 1997-07-25 | 1998-07-24 | Gene expression and evaluation system |
PCT/US1998/015469 WO1999005591A2 (en) | 1997-07-25 | 1998-07-24 | Method and apparatus for providing a bioinformatics database |
PCT/US1998/015458 WO1999005324A1 (en) | 1997-07-25 | 1998-07-24 | System for providing a polymorphism database |
Family Applications Before (3)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/US1998/015456 WO1999005574A1 (en) | 1997-07-25 | 1998-07-24 | Method and system for providing a probe array chip design database |
PCT/US1998/015151 WO1999005323A1 (en) | 1997-07-25 | 1998-07-24 | Gene expression and evaluation system |
PCT/US1998/015469 WO1999005591A2 (en) | 1997-07-25 | 1998-07-24 | Method and apparatus for providing a bioinformatics database |
Country Status (6)
Country | Link |
---|---|
US (10) | US6484183B1 (en) |
EP (4) | EP1002264B1 (en) |
JP (6) | JP2001515234A (en) |
AT (1) | ATE264523T1 (en) |
DE (1) | DE69823206T2 (en) |
WO (4) | WO1999005574A1 (en) |
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6334099B1 (en) | 1999-05-25 | 2001-12-25 | Digital Gene Technologies, Inc. | Methods for normalization of experimental data |
US6713258B2 (en) | 2000-03-29 | 2004-03-30 | Center For The Application Of Molecular Biology To International Agriculture (Cambia) | Methods for genotyping by hybridization analysis |
US6931326B1 (en) | 2000-06-26 | 2005-08-16 | Genaissance Pharmaceuticals, Inc. | Methods for obtaining and using haplotype data |
US7058517B1 (en) | 1999-06-25 | 2006-06-06 | Genaissance Pharmaceuticals, Inc. | Methods for obtaining and using haplotype data |
GB2574582A (en) * | 2018-05-28 | 2019-12-18 | Rainer Gabriel Schweiger Martin | Method for simulating a technical device |
Families Citing this family (318)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CA1223831A (en) | 1982-06-23 | 1987-07-07 | Dean Engelhardt | Modified nucleotides, methods of preparing and utilizing and compositions containing the same |
US5989835A (en) | 1997-02-27 | 1999-11-23 | Cellomics, Inc. | System for cell-based screening |
US6420108B2 (en) * | 1998-02-09 | 2002-07-16 | Affymetrix, Inc. | Computer-aided display for comparative gene expression |
JP2001515234A (en) * | 1997-07-25 | 2001-09-18 | アフィメトリックス インコーポレイテッド | System for providing a polymorphism database |
US7068830B2 (en) * | 1997-07-25 | 2006-06-27 | Affymetrix, Inc. | Method and system for providing a probe array chip design database |
US6826296B2 (en) * | 1997-07-25 | 2004-11-30 | Affymetrix, Inc. | Method and system for providing a probe array chip design database |
US6990221B2 (en) * | 1998-02-07 | 2006-01-24 | Biodiscovery, Inc. | Automated DNA array image segmentation and analysis |
US6349144B1 (en) * | 1998-02-07 | 2002-02-19 | Biodiscovery, Inc. | Automated DNA array segmentation and analysis |
WO1999049403A1 (en) * | 1998-03-26 | 1999-09-30 | Incyte Pharmaceuticals, Inc. | System and methods for analyzing biomolecular sequences |
US6324479B1 (en) * | 1998-05-08 | 2001-11-27 | Rosetta Impharmatics, Inc. | Methods of determining protein activity levels using gene expression profiles |
US6606622B1 (en) * | 1998-07-13 | 2003-08-12 | James M. Sorace | Software method for the conversion, storage and querying of the data of cellular biological assays on the basis of experimental design |
US20040199544A1 (en) * | 2000-11-02 | 2004-10-07 | Affymetrix, Inc. | Method and apparatus for providing an expression data mining database |
US6185561B1 (en) * | 1998-09-17 | 2001-02-06 | Affymetrix, Inc. | Method and apparatus for providing and expression data mining database |
WO2000029984A2 (en) | 1998-11-13 | 2000-05-25 | Cellomics, Inc. | Methods and system for efficient collection and storage of experimental data |
US6453241B1 (en) * | 1998-12-23 | 2002-09-17 | Rosetta Inpharmatics, Inc. | Method and system for analyzing biological response signal data |
US6245511B1 (en) * | 1999-02-22 | 2001-06-12 | Vialogy Corp | Method and apparatus for exponentially convergent therapy effectiveness monitoring using DNA microarray based viral load measurements |
US6136541A (en) | 1999-02-22 | 2000-10-24 | Vialogy Corporation | Method and apparatus for analyzing hybridized biochip patterns using resonance interactions employing quantum expressor functions |
US20040111219A1 (en) * | 1999-02-22 | 2004-06-10 | Sandeep Gulati | Active interferometric signal analysis in software |
US6142681A (en) | 1999-02-22 | 2000-11-07 | Vialogy Corporation | Method and apparatus for interpreting hybridized bioelectronic DNA microarray patterns using self-scaling convergent reverberant dynamics |
US6507788B1 (en) * | 1999-02-25 | 2003-01-14 | Société de Conseils de Recherches et D'Applications Scientifiques (S.C.R.A.S.) | Rational selection of putative peptides from identified nucleotide, or peptide sequences, of unknown function |
US6215894B1 (en) * | 1999-02-26 | 2001-04-10 | General Scanning, Incorporated | Automatic imaging and analysis of microarray biochips |
JP2000258368A (en) * | 1999-03-12 | 2000-09-22 | Jeol Ltd | X-ray microanalyzer having sound monitor function |
EP1041514B1 (en) * | 1999-03-30 | 2006-03-01 | Fuji Photo Film Co., Ltd. | Method and apparatus for selectively displaying measurement result and corresponding images |
WO2000062205A1 (en) * | 1999-04-13 | 2000-10-19 | Schulze Michael D | Method of obtaining an electronically-stored financial document |
US7151847B2 (en) | 2001-02-20 | 2006-12-19 | Cytokinetics, Inc. | Image analysis of the golgi complex |
US6651008B1 (en) | 1999-05-14 | 2003-11-18 | Cytokinetics, Inc. | Database system including computer code for predictive cellular bioinformatics |
US20030228565A1 (en) * | 2000-04-26 | 2003-12-11 | Cytokinetics, Inc. | Method and apparatus for predictive cellular bioinformatics |
US6743576B1 (en) | 1999-05-14 | 2004-06-01 | Cytokinetics, Inc. | Database system for predictive cellular bioinformatics |
US6876760B1 (en) | 2000-12-04 | 2005-04-05 | Cytokinetics, Inc. | Classifying cells based on information contained in cell images |
JP3469504B2 (en) * | 1999-06-01 | 2003-11-25 | 日立ソフトウエアエンジニアリング株式会社 | Microarray chip and indexing method thereof |
CA2375220A1 (en) * | 1999-06-11 | 2000-12-21 | Narayan Baidya | Gene specific arrays and the use thereof |
US6716579B1 (en) | 1999-06-11 | 2004-04-06 | Narayan Baidya | Gene specific arrays, preparation and use |
US6931396B1 (en) | 1999-06-29 | 2005-08-16 | Gene Logic Inc. | Biological data processing |
US6631211B1 (en) * | 1999-07-08 | 2003-10-07 | Perkinelmer Las, Inc. | Interactive system for analyzing scatter plots |
US6470277B1 (en) | 1999-07-30 | 2002-10-22 | Agy Therapeutics, Inc. | Techniques for facilitating identification of candidate genes |
US7062076B1 (en) | 1999-08-27 | 2006-06-13 | Iris Biotechnologies, Inc. | Artificial intelligence system for genetic analysis |
WO2001016860A2 (en) * | 1999-08-27 | 2001-03-08 | Iris Bio Technologies, Inc. | Artificial intelligence system for genetic analysis |
EP1212599A2 (en) | 1999-08-30 | 2002-06-12 | Illumina, Inc. | Methods for improving signal detection from an array |
US7099502B2 (en) * | 1999-10-12 | 2006-08-29 | Biodiscovery, Inc. | System and method for automatically processing microarrays |
KR20020064298A (en) | 1999-10-13 | 2002-08-07 | 시쿼넘, 인코포레이티드 | Methods for generating databases and databases for identifying polymorphic genetic markers |
WO2001031333A1 (en) * | 1999-10-26 | 2001-05-03 | Genometrix Genomics Incorporated | Process for requesting biological experiments and for the delivery of experimental information |
CA2392753A1 (en) | 1999-12-10 | 2001-06-14 | Invitrogen Corporation | Use of multiple recombination sites with unique specificity in recombinational cloning |
CA2398382A1 (en) * | 2000-01-25 | 2001-08-02 | Affymetrix, Inc. | Method, system and computer software for providing a genomic web portal |
US20030097222A1 (en) * | 2000-01-25 | 2003-05-22 | Craford David M. | Method, system, and computer software for providing a genomic web portal |
EP1252598A2 (en) | 2000-01-25 | 2002-10-30 | Cellomics, Inc. | Method and system for automated inference of physico-chemical interaction knowledge |
US20050214825A1 (en) * | 2000-02-07 | 2005-09-29 | John Stuelpnagel | Multiplex sample analysis on universal arrays |
US7955794B2 (en) | 2000-09-21 | 2011-06-07 | Illumina, Inc. | Multiplex nucleic acid reactions |
US7582420B2 (en) | 2001-07-12 | 2009-09-01 | Illumina, Inc. | Multiplex nucleic acid reactions |
US8076063B2 (en) * | 2000-02-07 | 2011-12-13 | Illumina, Inc. | Multiplexed methylation detection methods |
US6770441B2 (en) * | 2000-02-10 | 2004-08-03 | Illumina, Inc. | Array compositions and methods of making same |
DE10015816A1 (en) * | 2000-03-30 | 2001-10-18 | Infineon Technologies Ag | Biosensor chip |
WO2001074405A1 (en) * | 2000-03-31 | 2001-10-11 | Gene Logic, Inc. | Gene expression profiles in esophageal tissue |
US20030040870A1 (en) * | 2000-04-18 | 2003-02-27 | Brooke Anderson | Automated system and process for custom-designed biological array design and analysis |
US20030171876A1 (en) * | 2002-03-05 | 2003-09-11 | Victor Markowitz | System and method for managing gene expression data |
US7020561B1 (en) | 2000-05-23 | 2006-03-28 | Gene Logic, Inc. | Methods and systems for efficient comparison, identification, processing, and importing of gene expression data |
US20030009295A1 (en) * | 2001-03-14 | 2003-01-09 | Victor Markowitz | System and method for retrieving and using gene expression data from multiple sources |
US6772160B2 (en) * | 2000-06-08 | 2004-08-03 | Ingenuity Systems, Inc. | Techniques for facilitating information acquisition and storage |
US6741986B2 (en) * | 2000-12-08 | 2004-05-25 | Ingenuity Systems, Inc. | Method and system for performing information extraction and quality control for a knowledgebase |
US7577683B2 (en) * | 2000-06-08 | 2009-08-18 | Ingenuity Systems, Inc. | Methods for the construction and maintenance of a knowledge representation system |
US20020152196A1 (en) * | 2000-07-07 | 2002-10-17 | Westbrook Carol A. | cDNA database and biochip for analysis of hematopoietic tissue |
JP3517644B2 (en) * | 2000-07-13 | 2004-04-12 | 孝 五條堀 | Method and system for displaying expression phenomenon of living organisms and program |
US20020059326A1 (en) * | 2000-07-25 | 2002-05-16 | Derek Bernhart | System, method, and computer program product for management of biological experiment information |
NL1016034C2 (en) | 2000-08-03 | 2002-02-08 | Tno | Method and system for identifying and quantifying chemical components of a mixture of materials to be investigated. |
US7198924B2 (en) | 2000-12-11 | 2007-04-03 | Invitrogen Corporation | Methods and compositions for synthesis of nucleic acid molecules using multiple recognition sites |
CA2388064A1 (en) | 2000-08-22 | 2002-05-02 | Affymetrix, Inc. | System method, and computer software product for controlling biological microarray scanner |
US6789040B2 (en) | 2000-08-22 | 2004-09-07 | Affymetrix, Inc. | System, method, and computer software product for specifying a scanning area of a substrate |
WO2002017190A1 (en) * | 2000-08-22 | 2002-02-28 | Varro Technologies, Inc. | Method and system for sharing biological information |
GB0021286D0 (en) * | 2000-08-30 | 2000-10-18 | Gemini Genomics Ab | Identification of drug metabolic capacity |
US6539102B1 (en) * | 2000-09-01 | 2003-03-25 | Large Scale Proteomics | Reference database |
US6813615B1 (en) | 2000-09-06 | 2004-11-02 | Cellomics, Inc. | Method and system for interpreting and validating experimental data with automated reasoning |
WO2002023308A2 (en) * | 2000-09-12 | 2002-03-21 | Viaken Systems, Inc. | Techniques for providing and obtaining research and development information technology on remote computing resources |
AU2001210599A1 (en) * | 2000-10-23 | 2002-05-06 | Diachip Limited | High precision and intellectual biochip arrayer having function of respotting |
US7117095B2 (en) * | 2000-11-21 | 2006-10-03 | Affymetrix, Inc. | Methods for selecting nucleic acid probes |
US8255791B2 (en) | 2000-11-29 | 2012-08-28 | Dov Koren | Collaborative, flexible, interactive real-time displays |
US7218764B2 (en) | 2000-12-04 | 2007-05-15 | Cytokinetics, Inc. | Ploidy classification method |
US6706867B1 (en) * | 2000-12-19 | 2004-03-16 | The United States Of America As Represented By The Department Of Health And Human Services | DNA array sequence selection |
US20020143768A1 (en) * | 2000-12-21 | 2002-10-03 | Berno Anthony Berno | Probe array data storage and retrieval |
US20020192671A1 (en) * | 2001-01-23 | 2002-12-19 | Castle Arthur L. | Method and system for predicting the biological activity, including toxicology and toxicity, of substances |
US20020183936A1 (en) * | 2001-01-24 | 2002-12-05 | Affymetrix, Inc. | Method, system, and computer software for providing a genomic web portal |
US20070015148A1 (en) * | 2001-01-25 | 2007-01-18 | Orr Michael S | Gene expression profiles in breast tissue |
US6949638B2 (en) * | 2001-01-29 | 2005-09-27 | Affymetrix, Inc. | Photolithographic method and system for efficient mask usage in manufacturing DNA arrays |
US20030017455A1 (en) * | 2001-01-29 | 2003-01-23 | Webb Peter G. | Chemical array fabrication with identity map |
US7315784B2 (en) | 2001-02-15 | 2008-01-01 | Siemens Aktiengesellschaft | Network for evaluating data obtained in a biochip measurement device |
US7016787B2 (en) | 2001-02-20 | 2006-03-21 | Cytokinetics, Inc. | Characterizing biological stimuli by response curves |
US6956961B2 (en) | 2001-02-20 | 2005-10-18 | Cytokinetics, Inc. | Extracting shape information contained in cell images |
JP3867046B2 (en) * | 2001-02-23 | 2007-01-10 | 株式会社日立製作所 | Analysis system |
US7110885B2 (en) | 2001-03-08 | 2006-09-19 | Dnaprint Genomics, Inc. | Efficient methods and apparatus for high-throughput processing of gene sequence data |
US20020168651A1 (en) * | 2001-03-12 | 2002-11-14 | Affymetrix, Inc. | Method and computer software product for determining orientation of sequence clusters |
US6804679B2 (en) | 2001-03-12 | 2004-10-12 | Affymetrix, Inc. | System, method, and user interfaces for managing genomic data |
JP2002269114A (en) * | 2001-03-14 | 2002-09-20 | Kousaku Ookubo | Knowledge database, and method for constructing knowledge database |
US20040033502A1 (en) * | 2001-03-28 | 2004-02-19 | Amanda Williams | Gene expression profiles in esophageal tissue |
JP2002297617A (en) * | 2001-03-29 | 2002-10-11 | Hitachi Software Eng Co Ltd | Method for displaying correlation between biopolymer and probe |
US20030087259A1 (en) * | 2001-04-18 | 2003-05-08 | Clancy Brian M. | Methods and compositions for regulating bone and cartilage formation |
US7251568B2 (en) | 2001-04-18 | 2007-07-31 | Wyeth | Methods and compositions for regulating bone and cartilage formation |
US7155453B2 (en) * | 2002-05-22 | 2006-12-26 | Agilent Technologies, Inc. | Biotechnology information naming system |
US20070015146A1 (en) * | 2001-05-22 | 2007-01-18 | Gene Logic, Inc. | Molecular nephrotoxicology modeling |
AU2002339829A1 (en) * | 2001-05-22 | 2002-12-03 | Gene Logic, Inc. | Molecular toxicology modeling |
US20030009294A1 (en) * | 2001-06-07 | 2003-01-09 | Jill Cheng | Integrated system for gene expression analysis |
WO2002103030A2 (en) * | 2001-06-14 | 2002-12-27 | Rigel Pharmaceuticals, Inc. | Multidimensional biodata integration and relationship inference |
AU2002315413A1 (en) * | 2001-06-22 | 2003-01-08 | Gene Logic, Inc. | Platform for management and mining of genomic data |
KR100794698B1 (en) * | 2001-06-28 | 2008-01-14 | (주)바이오니아 | Quality control method for biological chip |
EP1417629B1 (en) * | 2001-07-06 | 2008-02-20 | Lipomics Technologies, Inc. | Generating, viewing, interpreting, and utilizing a quantitative database of metabolites |
US7447594B2 (en) * | 2001-07-10 | 2008-11-04 | Ocimum Biosolutions, Inc. | Molecular cardiotoxicology modeling |
WO2003068908A2 (en) * | 2001-07-10 | 2003-08-21 | Gene Logic, Inc. | Cardiotoxin molecular toxicology modeling |
EP1420250A4 (en) * | 2001-07-31 | 2006-03-22 | Olympus Corp | Gene inspection apparatus and target nucleic acid extraction method using the same |
US7251642B1 (en) | 2001-08-06 | 2007-07-31 | Gene Logic Inc. | Analysis engine and work space manager for use with gene expression data |
JP3977038B2 (en) * | 2001-08-27 | 2007-09-19 | 株式会社半導体エネルギー研究所 | Laser irradiation apparatus and laser irradiation method |
US20030096248A1 (en) * | 2001-09-04 | 2003-05-22 | Vitivity, Inc. | Diagnosis and treatment of vascular disease |
DE60233743D1 (en) * | 2001-09-24 | 2009-10-29 | Lipomics Technologies Inc | METHOD OF USE OF QUANTITATIVE LIPID METABOLONDATE |
JP2003099624A (en) * | 2001-09-25 | 2003-04-04 | Toyo Kohan Co Ltd | Dna providing system |
AU2002334772A1 (en) * | 2001-10-12 | 2003-04-28 | Duke University | Methods for image analysis of high-density synthetic dna microarrays |
WO2003034064A2 (en) * | 2001-10-12 | 2003-04-24 | Duke University | Image analysis of high-density synthetic dna microarrays |
US20030224383A1 (en) * | 2002-04-23 | 2003-12-04 | Mike West | Atherosclerotic phenotype determinative genes and methods for using the same |
US20060141493A1 (en) * | 2001-11-09 | 2006-06-29 | Duke University Office Of Science And Technology | Atherosclerotic phenotype determinative genes and methods for using the same |
US20040255136A1 (en) * | 2001-11-12 | 2004-12-16 | Alexey Borisovich Fadyushin | Method and device for protecting information against unauthorised use |
KR100474840B1 (en) * | 2001-11-15 | 2005-03-08 | 삼성전자주식회사 | Method and system with directory for providing a genotyping microarray probe design |
CN1281324C (en) * | 2001-12-19 | 2006-10-25 | 阿菲梅特里克斯公司 | Manufacturing process for array plate assembly |
US20030176929A1 (en) * | 2002-01-28 | 2003-09-18 | Steve Gardner | User interface for a bioinformatics system |
US7225183B2 (en) * | 2002-01-28 | 2007-05-29 | Ipxl, Inc. | Ontology-based information management system and method |
US9418204B2 (en) * | 2002-01-28 | 2016-08-16 | Samsung Electronics Co., Ltd | Bioinformatics system architecture with data and process integration |
US8793073B2 (en) | 2002-02-04 | 2014-07-29 | Ingenuity Systems, Inc. | Drug discovery methods |
JP4594622B2 (en) * | 2002-02-04 | 2010-12-08 | インジェヌイティ システムズ インコーポレイテッド | Drug discovery method |
US20030162183A1 (en) * | 2002-02-27 | 2003-08-28 | Robert Kincaid | Array design system and method |
US8722331B2 (en) * | 2002-03-28 | 2014-05-13 | University Of Toledo | Method for selecting a treatment for non-small cell lung cancer using gene expression profiles |
EP1502098B1 (en) * | 2002-05-03 | 2008-09-17 | ViaLogy, LLC | Method for characterizing microarray output data |
US20030220844A1 (en) * | 2002-05-24 | 2003-11-27 | Marnellos Georgios E. | Method and system for purchasing genetic data |
US6763308B2 (en) | 2002-05-28 | 2004-07-13 | Sas Institute Inc. | Statistical outlier detection for gene expression microarray data |
US20030229848A1 (en) * | 2002-06-05 | 2003-12-11 | Udo Arend | Table filtering in a computer user interface |
US7504215B2 (en) | 2002-07-12 | 2009-03-17 | Affymetrix, Inc. | Nucleic acid labeling methods |
US20050112689A1 (en) * | 2003-04-04 | 2005-05-26 | Robert Kincaid | Systems and methods for statistically analyzing apparent CGH data anomalies and plotting same |
US20050216459A1 (en) * | 2002-08-08 | 2005-09-29 | Aditya Vailaya | Methods and systems, for ontological integration of disparate biological data |
US7941542B2 (en) * | 2002-09-06 | 2011-05-10 | Oracle International Corporation | Methods and apparatus for maintaining application execution over an intermittent network connection |
US7512496B2 (en) * | 2002-09-25 | 2009-03-31 | Soheil Shams | Apparatus, method, and computer program product for determining confidence measures and combined confidence measures for assessing the quality of microarrays |
US7482170B2 (en) * | 2002-09-30 | 2009-01-27 | Roche Nimblegen, Inc. | Parallel loading of arrays |
US20040101894A1 (en) * | 2002-10-01 | 2004-05-27 | Thomas Albert | Microarrays having multiple oligonucleotides in single array features |
US20040259105A1 (en) * | 2002-10-03 | 2004-12-23 | Jian-Bing Fan | Multiplex nucleic acid analysis using archived or fixed samples |
US9453251B2 (en) | 2002-10-08 | 2016-09-27 | Pfenex Inc. | Expression of mammalian proteins in Pseudomonas fluorescens |
JP2006514620A (en) * | 2002-11-06 | 2006-05-11 | マウント シナイ スクール オブ メディシン | Treatment of amyotrophic lateral sclerosis with Nimesulide |
EP2112229A3 (en) | 2002-11-25 | 2009-12-02 | Sequenom, Inc. | Methods for identifying risk of breast cancer and treatments thereof |
WO2004050840A2 (en) * | 2002-11-27 | 2004-06-17 | The Government Of The United States As Represented By The Secretary Of The Department Of Health And Human Services, Centers For Disease Control And Prevention | Integration of gene expression data and non-gene data |
JP2004191160A (en) * | 2002-12-11 | 2004-07-08 | Yokogawa Electric Corp | Biochip measuring method and device |
KR100506089B1 (en) * | 2003-02-05 | 2005-08-05 | 삼성전자주식회사 | System for designing probe array using heterogeneneous genomic information and method of the same |
US7825929B2 (en) * | 2003-04-04 | 2010-11-02 | Agilent Technologies, Inc. | Systems, tools and methods for focus and context viewing of large collections of graphs |
US7750908B2 (en) * | 2003-04-04 | 2010-07-06 | Agilent Technologies, Inc. | Focus plus context viewing and manipulation of large collections of graphs |
WO2004090100A2 (en) * | 2003-04-04 | 2004-10-21 | Agilent Technologies, Inc. | Visualizing expression data on chromosomal graphic schemes |
US7897744B2 (en) * | 2003-04-28 | 2011-03-01 | The Public Health Agency Of Canada | SARS virus nucleotide and amino acid sequences and uses thereof |
US20040249791A1 (en) * | 2003-06-03 | 2004-12-09 | Waters Michael D. | Method and system for developing and querying a sequence driven contextual knowledge base |
EP3305919A1 (en) | 2003-06-10 | 2018-04-11 | The Trustees of Boston University | Detection methods for disorders of the lung |
US20050064462A1 (en) * | 2003-06-17 | 2005-03-24 | Bernd Stein | Methods, compositions, and kits for predicting the effect of compounds on hot flash symptoms |
US20050014217A1 (en) | 2003-07-18 | 2005-01-20 | Cytokinetics, Inc. | Predicting hepatotoxicity using cell based assays |
GB2423988A (en) | 2003-07-18 | 2006-09-13 | Cytokinetics Inc | Characterizing biological stimuli by response curves |
US7235353B2 (en) | 2003-07-18 | 2007-06-26 | Cytokinetics, Inc. | Predicting hepatotoxicity using cell based assays |
US7353116B2 (en) * | 2003-07-31 | 2008-04-01 | Agilent Technologies, Inc. | Chemical array with test dependent signal reading or processing |
US20050026154A1 (en) * | 2003-07-31 | 2005-02-03 | Laurakay Bruhn | Masking chemical arrays |
US20050026306A1 (en) * | 2003-07-31 | 2005-02-03 | Robert Kincaid | Method and system for generating virtual-microarrays |
US7475087B1 (en) | 2003-08-29 | 2009-01-06 | The United States Of America As Represented By The Secretary Of Agriculture | Computer display tool for visualizing relationships between and among data |
US20050049796A1 (en) * | 2003-09-03 | 2005-03-03 | Webb Peter G. | Methods for encoding non-biological information on microarrays |
US20050048506A1 (en) * | 2003-09-03 | 2005-03-03 | Fredrick Joseph P. | Methods for encoding non-biological information on microarrays |
WO2005024068A2 (en) | 2003-09-05 | 2005-03-17 | Sequenom, Inc. | Allele-specific sequence variation analysis |
EP1697534B1 (en) | 2003-12-01 | 2010-06-02 | Life Technologies Corporation | Nucleic acid molecules containing recombination sites and methods of using the same |
CA2497324A1 (en) | 2004-02-17 | 2005-08-17 | Affymetrix, Inc. | Methods for fragmenting and labelling dna |
US7660709B2 (en) * | 2004-03-18 | 2010-02-09 | Van Andel Research Institute | Bioinformatics research and analysis system and methods associated therewith |
CA2561381C (en) | 2004-03-26 | 2015-05-12 | Sequenom, Inc. | Base specific cleavage of methylation-specific amplification products in combination with mass analysis |
KR100632973B1 (en) * | 2004-04-29 | 2006-10-12 | 주식회사 메딘텔 | Circular matching system and method for circular matching |
US20060003335A1 (en) * | 2004-06-30 | 2006-01-05 | Crispino John D | Methods for diagnosing acute megakaryoblastic leukemia |
US7323318B2 (en) | 2004-07-15 | 2008-01-29 | Cytokinetics, Inc. | Assay for distinguishing live and dead cells |
WO2006022628A1 (en) * | 2004-07-22 | 2006-03-02 | Sequenom, Inc. | Methods for identifying risk of type ii diabetes and treatments thereof |
EP1774017B1 (en) | 2004-07-26 | 2013-05-15 | Pfenex Inc. | Process for improved protein expression by strain engineering |
EP1792263A2 (en) * | 2004-09-02 | 2007-06-06 | Vialogy Corporation | Detecting events of interest using quantum resonance interferometry |
US20060073506A1 (en) | 2004-09-17 | 2006-04-06 | Affymetrix, Inc. | Methods for identifying biological samples |
US20060073511A1 (en) | 2004-10-05 | 2006-04-06 | Affymetrix, Inc. | Methods for amplifying and analyzing nucleic acids |
US20060083609A1 (en) * | 2004-10-14 | 2006-04-20 | Augspurger Murray D | Fluid cooled marine turbine housing |
US7682782B2 (en) | 2004-10-29 | 2010-03-23 | Affymetrix, Inc. | System, method, and product for multiple wavelength detection using single source excitation |
JP2006126204A (en) | 2004-10-29 | 2006-05-18 | Affymetrix Inc | Automated method for manufacturing polymer array |
WO2006088445A2 (en) * | 2005-02-11 | 2006-08-24 | Southern Illinois University | Metabolic primers for the detection of (per)chlorate-reducing bacteria and methods of use thereof |
US20060211004A1 (en) | 2005-02-15 | 2006-09-21 | Ilsley Diane D | Methods and compositions for determining non-specific cytotoxicity of a transfection agent |
WO2006089045A2 (en) | 2005-02-18 | 2006-08-24 | Monogram Biosciences, Inc. | Methods and compositions for determining hypersusceptibility of hiv-1 to non-nucleoside reverse transcriptase inhibitors |
US20090203547A1 (en) * | 2005-02-18 | 2009-08-13 | Albert Banes | Gene and Cognate Protein Profiles and Methods to Determine Connective Tissue Markers in Normal and Pathologic Conditions |
EP2522757A1 (en) | 2005-02-18 | 2012-11-14 | Monogram BioSciences, Inc. | Methods and compositions for determining anti-HIV drug susceptibility and replication capacity of HIV |
US20070118295A1 (en) * | 2005-03-02 | 2007-05-24 | Al-Murrani Samer Waleed Khedhe | Methods and Systems for Designing Animal Food Compositions |
US20060286569A1 (en) * | 2005-03-10 | 2006-12-21 | Bar-Or Yuval A | Method, apparatus, and system for authentication using labels containing nucleotide sequences |
JP2008537875A (en) | 2005-03-14 | 2008-10-02 | ザ ボード オブ トラスティーズ オブ ザ リーランド スタンフォード ジュニア ユニバーシティ | Methods and compositions for assessing graft survival in solid organ transplant recipients |
AU2006236621B2 (en) | 2005-04-14 | 2011-10-06 | The Trustees Of Boston University | Diagnostic for lung disorders using class prediction |
US20070026426A1 (en) * | 2005-04-26 | 2007-02-01 | Applera Corporation | System for genetic surveillance and analysis |
EP2458019A3 (en) | 2005-05-27 | 2012-07-25 | Monogram BioSciences, Inc. | Methods and compositions for determining resistance of HIV-1 to protease inhibitors |
US9506121B2 (en) | 2005-06-06 | 2016-11-29 | Monogram Biosciences, Inc. | Methods for determining resistance or susceptibility to HIV entry inhibitors |
US8071284B2 (en) * | 2005-06-06 | 2011-12-06 | Monogram Biosciences, Inc. | Methods and compositions for determining altered susceptibility of HIV-1 to anti-HIV drugs |
US8159959B2 (en) * | 2005-11-07 | 2012-04-17 | Vudu, Inc. | Graphic user interface for playing video data |
US7634363B2 (en) * | 2005-12-07 | 2009-12-15 | Affymetrix, Inc. | Methods for high throughput genotyping |
EP1969506A1 (en) * | 2005-12-13 | 2008-09-17 | Erasmus University Medical Center Rotterdam | Genetic brain tumor markers |
US7646450B2 (en) * | 2005-12-29 | 2010-01-12 | Lg Display Co., Ltd. | Light emitting diode array, method of manufacturing the same, backlight assembly having the same, and LCD having the same |
US20070198729A1 (en) * | 2006-02-07 | 2007-08-23 | Yechuri Sitaramarao S | SQL network gadget |
JP2009529329A (en) | 2006-03-09 | 2009-08-20 | トラスティーズ オブ ボストン ユニバーシティ | Methods for diagnosis and prognosis for lung diseases using gene expression profiles of nasal epithelial cells |
AU2007268369A1 (en) * | 2006-05-30 | 2007-12-06 | Synergenz Bioscience Limited | Methods and compositions for assessment of pulmonary function and disorders |
EP2602321B1 (en) | 2006-05-31 | 2017-08-23 | Sequenom, Inc. | Methods and compositions for the extraction and amplification of nucleic acid from a sample |
EP2236623A1 (en) | 2006-06-05 | 2010-10-06 | Cancer Care Ontario | Assessment of risk for colorectal cancer |
US20080033985A1 (en) * | 2006-06-09 | 2008-02-07 | Gulfstream Bioinformatics Corporation | Biomedical Information Modeling |
US7700756B2 (en) * | 2006-07-27 | 2010-04-20 | Southern Illinois University | Metabolic primers for the detection of perchlorate-reducing bacteria and methods of use thereof |
WO2008014495A2 (en) * | 2006-07-28 | 2008-01-31 | Ingenuity Systems, Inc. | Genomics based targeted advertising |
EP2057465A4 (en) | 2006-08-09 | 2010-04-21 | Homestead Clinical Corp | Organ-specific proteins and methods of their use |
JP2010506588A (en) * | 2006-10-17 | 2010-03-04 | シナージェンズ バイオサイエンス リミティド | Methods and compositions for assessment of lung function and disorders |
US9845494B2 (en) | 2006-10-18 | 2017-12-19 | Affymetrix, Inc. | Enzymatic methods for genotyping on arrays |
WO2008066655A2 (en) | 2006-11-02 | 2008-06-05 | Yale University | Assessment of oocyte competence |
NZ551157A (en) * | 2006-11-08 | 2008-06-30 | Rebecca Lee Roberts | Method of identifying individuals at risk of thiopurine drug resistance and intolerance - GMPS |
US8293684B2 (en) * | 2006-11-29 | 2012-10-23 | Exiqon | Locked nucleic acid reagents for labelling nucleic acids |
US7902345B2 (en) | 2006-12-05 | 2011-03-08 | Sequenom, Inc. | Detection and quantification of biomolecules using mass spectrometry |
CA2673092A1 (en) * | 2006-12-19 | 2008-06-26 | Synergenz Bioscience Limited | Methods and compositions for the assessment of cardiovascular function and disorders |
EA017985B1 (en) | 2007-01-30 | 2013-04-30 | Фармасайкликс, Инк. | Methods for determining cancer resistance to histone deacetylase inhibitors |
CA2677517C (en) | 2007-02-08 | 2015-11-03 | Sequenom, Inc. | Nucleic acid-based tests for rhd typing, gender determination and nucleic acid quantification |
US9581595B2 (en) | 2007-02-26 | 2017-02-28 | Laboratory Corporation Of America Holdings | Compositions and methods for determining whether a subject would benefit from co-receptor inhibitor therapy |
EP2079839A4 (en) | 2007-03-05 | 2009-11-18 | Cancer Care Ontario | Assessment of risk for colorectal cancer |
AU2008230813B2 (en) | 2007-03-26 | 2014-01-30 | Sequenom, Inc. | Restriction endonuclease enhanced polymorphic sequence detection |
CA2685326A1 (en) | 2007-04-27 | 2008-11-06 | Dow Global Technologies Inc. | Method for rapidly screening microbial hosts to identify certain strains with improved yield and/or quality in the expression of heterologous proteins |
US9580719B2 (en) * | 2007-04-27 | 2017-02-28 | Pfenex, Inc. | Method for rapidly screening microbial hosts to identify certain strains with improved yield and/or quality in the expression of heterologous proteins |
US8200440B2 (en) | 2007-05-18 | 2012-06-12 | Affymetrix, Inc. | System, method, and computer software product for genotype determination using probe array data |
ATE549419T1 (en) | 2007-08-29 | 2012-03-15 | Sequenom Inc | METHODS AND COMPOSITIONS FOR UNIVERSAL SIZE-SPECIFIC POLYMERASE CHAIN REACTION |
WO2009039190A1 (en) * | 2007-09-17 | 2009-03-26 | Gene Express, Inc. | Cancer risk biomarker |
CN101978073B (en) | 2008-01-25 | 2014-12-10 | 诊疗实验室 | Methods and compositions for the assessment of drug response |
CA2710807C (en) | 2008-03-11 | 2015-09-08 | Kyeong Man Hong | Method for measuring chromosome, gene or specific nucleotide sequence copy numbers using snp array |
EP2271772B1 (en) | 2008-03-11 | 2014-07-16 | Sequenom, Inc. | Nucleic acid-based tests for prenatal gender determination |
EP2276858A4 (en) | 2008-03-26 | 2011-10-05 | Sequenom Inc | Restriction endonuclease enhanced polymorphic sequence detection |
WO2009149297A1 (en) * | 2008-06-04 | 2009-12-10 | The Arizona Board Regents, On Behalf Of The University Of Arizona | Diffuse large b-cell lymphoma markers and uses therefor |
US20110201519A1 (en) | 2008-08-18 | 2011-08-18 | Sarwal Minnie M | Methods and Compositions for Determining a Graft Tolerant Phenotype in a Subject |
US8962247B2 (en) * | 2008-09-16 | 2015-02-24 | Sequenom, Inc. | Processes and compositions for methylation-based enrichment of fetal nucleic acid from a maternal sample useful for non invasive prenatal diagnoses |
US8476013B2 (en) | 2008-09-16 | 2013-07-02 | Sequenom, Inc. | Processes and compositions for methylation-based acid enrichment of fetal nucleic acid from a maternal sample useful for non-invasive prenatal diagnoses |
WO2010054189A1 (en) | 2008-11-06 | 2010-05-14 | University Of Miami | Role of soluble upar in the pathogenesis of proteinuric kidney disease |
US8039794B2 (en) | 2008-12-16 | 2011-10-18 | Quest Diagnostics Investments Incorporated | Mass spectrometry assay for thiopurine-S-methyl transferase activity and products generated thereby |
KR101025848B1 (en) * | 2008-12-30 | 2011-03-30 | 삼성전자주식회사 | The method and apparatus for integrating and managing personal genome |
AU2010203542B2 (en) | 2009-01-07 | 2017-02-23 | Myriad Genetics, Inc | Cancer biomarkers |
EP2389453B1 (en) * | 2009-01-20 | 2015-11-25 | The Board of Trustees of The Leland Stanford Junior University | Single cell gene expression for diagnosis, prognosis and identification of drug targets |
CN102439454B (en) | 2009-02-11 | 2015-05-13 | 卡里斯Mpi公司 | Molecular profiling of tumors |
US8832581B2 (en) * | 2009-03-05 | 2014-09-09 | Ming Zhang | Gene expression browser for web-based search and visualization of characteristics of gene expression |
WO2010107946A2 (en) * | 2009-03-18 | 2010-09-23 | Sequenom, Inc. | Use of thermostable endonucleases for generating reporter molecules |
EP3211095B1 (en) | 2009-04-03 | 2019-01-02 | Sequenom, Inc. | Nucleic acid preparation compositions and methods |
AU2010315400B2 (en) | 2009-10-27 | 2016-07-21 | Caris Mpi, Inc. | Molecular profiling for personalized medicine |
US20110201008A1 (en) * | 2009-12-01 | 2011-08-18 | University Of Miami | Assays, methods and kits for measuring response to therapy and predicting clinical outcome in patients with b-cell lymphoma |
US9290813B2 (en) | 2009-12-02 | 2016-03-22 | The Board Of Trustees Of The Leland Stanford Junior University | Biomarkers for determining an allograft tolerant phenotype |
US8501122B2 (en) | 2009-12-08 | 2013-08-06 | Affymetrix, Inc. | Manufacturing and processing polymer arrays |
US8835358B2 (en) | 2009-12-15 | 2014-09-16 | Cellular Research, Inc. | Digital counting of individual molecules by stochastic attachment of diverse labels |
CA2785020C (en) | 2009-12-22 | 2020-08-25 | Sequenom, Inc. | Processes and kits for identifying aneuploidy |
US9798855B2 (en) | 2010-01-07 | 2017-10-24 | Affymetrix, Inc. | Differential filtering of genetic data |
US9216189B2 (en) | 2010-01-25 | 2015-12-22 | Icahn School Of Medicine At Mount Sinai | Antibodies which bind type I cannabinoid receptor/angiotensis II receptor heteromers |
EP2803735B1 (en) | 2010-03-25 | 2020-03-25 | The Board of Trustees of the Leland Stanford Junior University | Protein and gene biomarkers for rejection of organ transplants |
WO2011139714A2 (en) | 2010-04-26 | 2011-11-10 | Atyr Pharma, Inc. | Innovative discovery of therapeutic, diagnostic, and antibody compositions related to protein fragments of cysteinyl-trna synthetase |
CN103096911B (en) | 2010-04-27 | 2018-05-29 | Atyr 医药公司 | Treatment relevant with the protein fragments of Isoleucyl-tRNA synthetase, diagnosis and the innovation of antibody compositions are found |
JP6008837B2 (en) | 2010-04-28 | 2016-10-19 | エータイアー ファーマ, インコーポレイテッド | Innovative discovery of therapeutic, diagnostic and antibody compositions related to protein fragments of alanyl tRNA synthetase |
EP2563912B1 (en) | 2010-04-29 | 2018-09-05 | aTyr Pharma, Inc. | Innovative discovery of therapeutic, diagnostic, and antibody compositions related to protein fragments of asparaginyl trna synthetases |
CN103118693B (en) | 2010-04-29 | 2017-05-03 | Atyr 医药公司 | Innovative discovery of therapeutic, diagnostic, and antibody compositions related to protein fragments of valyl tRNA synthetases |
CA2797978C (en) | 2010-05-03 | 2019-12-03 | Atyr Pharma, Inc. | Innovative discovery of therapeutic, diagnostic, and antibody compositions related to protein fragments of methionyl-trna synthetases |
US9034321B2 (en) | 2010-05-03 | 2015-05-19 | Atyr Pharma, Inc. | Innovative discovery of therapeutic, diagnostic, and antibody compositions related to protein fragments of phenylalanyl-alpha-tRNA synthetases |
WO2011139986A2 (en) | 2010-05-03 | 2011-11-10 | Atyr Pharma, Inc. | Innovative discovery of therapeutic, diagnostic, and antibody compositions related to protein fragments of arginyl-trna synthetases |
CA2798139C (en) | 2010-05-04 | 2019-09-24 | Atyr Pharma, Inc. | Innovative discovery of therapeutic, diagnostic, and antibody compositions related to protein fragments of p38 multi-trna synthetase complex |
JP6396656B2 (en) | 2010-05-14 | 2018-09-26 | エータイアー ファーマ, インコーポレイテッド | Innovative discovery of therapeutic, diagnostic and antibody compositions related to protein fragments of phenylalanyl βtRNA synthetase |
EP2575856B1 (en) | 2010-05-27 | 2017-08-16 | aTyr Pharma, Inc. | Innovative discovery of therapeutic, diagnostic, and antibody compositions related to protein fragments of glutaminyl-trna synthetases |
US8962560B2 (en) | 2010-06-01 | 2015-02-24 | Atyr Pharma Inc. | Innovative discovery of therapeutic, diagnostic, and antibody compositions related to protein fragments of Lysyl-tRNA synthetases |
US20120053253A1 (en) | 2010-07-07 | 2012-03-01 | Myriad Genetics, Incorporated | Gene signatures for cancer prognosis |
JP6116479B2 (en) | 2010-07-12 | 2017-04-19 | エータイアー ファーマ, インコーポレイテッド | Innovative discovery of therapeutic, diagnostic and antibody compositions related to protein fragments of glycyl-tRNA synthetase |
CN103108650A (en) | 2010-08-25 | 2013-05-15 | Atyr医药公司 | Innovative discovery of therapeutic, diagnostic, and antibody compositions related to protein fragments of tyrosyl-trna synthetases |
WO2012033961A2 (en) * | 2010-09-09 | 2012-03-15 | Abbott Laboratories | Systems and methods for displaying molecular probes and chromosomes |
EP2619574B1 (en) | 2010-09-15 | 2020-10-28 | Almac Diagnostic Services Limited | Molecular test for predicting responsiveness to dna-damage therapeutic agents in individuals having cancer |
CN103717750B (en) | 2011-04-29 | 2017-03-08 | 塞昆纳姆股份有限公司 | The quantitation of minority nucleic acid substances |
EP2807271B1 (en) | 2012-01-24 | 2018-08-22 | CD Diagnostics, Inc. | System for detecting infection in synovial fluid |
JP6170077B2 (en) | 2012-02-16 | 2017-07-26 | エータイアー ファーマ, インコーポレイテッド | Histidyl tRNA synthetase for treating autoimmune and inflammatory diseases |
EP2820174B1 (en) | 2012-02-27 | 2019-12-25 | The University of North Carolina at Chapel Hill | Methods and uses for molecular tags |
CA2865575C (en) | 2012-02-27 | 2024-01-16 | Cellular Research, Inc. | Compositions and kits for molecular counting |
EP3401399B1 (en) | 2012-03-02 | 2020-04-22 | Sequenom, Inc. | Methods and processes for non-invasive assessment of genetic variations |
US9920361B2 (en) | 2012-05-21 | 2018-03-20 | Sequenom, Inc. | Methods and compositions for analyzing nucleic acid |
AU2013290102B2 (en) | 2012-07-13 | 2018-11-15 | Sequenom, Inc. | Processes and compositions for methylation-based enrichment of fetal nucleic acid from a maternal sample useful for non-invasive prenatal diagnoses |
US8766754B2 (en) | 2012-07-18 | 2014-07-01 | The Regents Of The University Of California | Concave nanomagnets with widely tunable anisotropy |
EP4190918A1 (en) | 2012-11-16 | 2023-06-07 | Myriad Genetics, Inc. | Gene signatures for cancer prognosis |
US10260103B2 (en) | 2012-11-27 | 2019-04-16 | Pontificia Universidad Catolica De Chile | Compositions and methods for diagnosing thyroid tumors |
US9896728B2 (en) | 2013-01-29 | 2018-02-20 | Arcticrx Ltd. | Method for determining a therapeutic approach for the treatment of age-related macular degeneration (AMD) |
WO2014168711A1 (en) | 2013-03-13 | 2014-10-16 | Sequenom, Inc. | Primers for dna methylation analysis |
CA2906523A1 (en) | 2013-03-15 | 2014-09-25 | Myriad Genetics, Inc. | Genes and gene signatures for diagnosis and treatment of melanoma |
US10535420B2 (en) | 2013-03-15 | 2020-01-14 | Affymetrix, Inc. | Systems and methods for probe design to detect the presence of simple and complex indels |
KR101520615B1 (en) | 2013-03-20 | 2015-05-18 | 서울대학교산학협력단 | Markers for diagnosis of liver cancer |
US10390724B2 (en) | 2013-06-26 | 2019-08-27 | The Penn State Research Foundation | Three-dimensional bio-medical probe sensing and contacting structures with addressibility and tunability |
BR112016002845A2 (en) | 2013-08-12 | 2017-09-12 | Genentech Inc | compositions and methods for treating complement-associated conditions |
KR101527283B1 (en) | 2013-08-13 | 2015-06-10 | 서울대학교산학협력단 | Method for screening cancer marker based on de-glycosylation of glycoproteins and marker for HCC |
SG10201901076WA (en) | 2014-02-08 | 2019-03-28 | Genentech Inc | Methods of treating alzheimer's disease |
EP3117011B1 (en) | 2014-03-13 | 2020-05-06 | Sequenom, Inc. | Methods and processes for non-invasive assessment of genetic variations |
CA2946737A1 (en) | 2014-04-22 | 2015-10-29 | Envirologix, Inc. | Compositions and methods for enhancing and/or predicting dna amplification |
WO2015175692A1 (en) | 2014-05-13 | 2015-11-19 | Myriad Genetics, Inc. | Gene signatures for cancer prognosis |
EP4250299A3 (en) | 2014-07-02 | 2023-12-06 | Myriad myPath, LLC | Genes and gene signatures for diagnosis and treatment of melanoma |
US11053550B2 (en) | 2014-10-14 | 2021-07-06 | The University Of North Carolina At Chapel Hill | Gene-expression based subtyping of pancreatic ductal adenocarcinoma |
BR112017008082A2 (en) | 2014-10-20 | 2017-12-26 | Envirologix Inc | compositions and methods for detecting an rna virus |
BR112017007965A8 (en) | 2014-10-24 | 2022-11-08 | Koninklijke Philips Nv | COMPUTER-IMPLANTED METHOD FOR INFERRING THE ACTIVITY OF A TGF-B CELL SIGNALING PATHWAY IN AN INDIVIDUAL; APPARATUS FOR INFERRING THE ACTIVITY OF A TGF-B CELL SIGNALING PATHWAY IN AN INDIVIDUAL; NON-TRANSITORY STORAGE MEDIA; COMPUTER PROGRAM; KITS FOR MEASURING EXPRESSION LEVELS OF THREE OR MORE TGF-B CELL SIGNALING PATHWAY GENES IN A SAMPLE FROM AN INDIVIDUAL; TO INFER THE ACTIVITY OF A TGF-B CELL SIGNALING PATHWAY IN AN INDIVIDUAL; TO INFER THE ACTIVITY OF A TGF-B CELL SIGNALING PATHWAY IN AN INDIVIDUAL; AND USE OF THE KIT |
WO2016062892A1 (en) | 2014-10-24 | 2016-04-28 | Koninklijke Philips N.V. | Medical prognosis and prediction of treatment response using multiple cellular signaling pathway activities |
CN108064380A (en) | 2014-10-24 | 2018-05-22 | 皇家飞利浦有限公司 | Use the prediction of the medical prognosis and therapeutic response of various kinds of cell signal transduction path activity |
WO2016123029A1 (en) | 2015-01-30 | 2016-08-04 | Envirologix Inc. | Substrate molecule |
WO2016126253A1 (en) | 2015-02-05 | 2016-08-11 | The Penn State Research Foundation | Nano-pore arrays for bio-medical, environmental, and industrial sorting, filtering, monitoring, or dispensing |
EP3285224A4 (en) * | 2015-04-13 | 2018-10-10 | National Institute of Advanced Industrial Science and Technology | Experimental data recording device, computer program, experimental data, experimental data recording method, experimental data display device and experimental data display method |
GB201512869D0 (en) | 2015-07-21 | 2015-09-02 | Almac Diagnostics Ltd | Gene signature for minute therapies |
CN108136051A (en) | 2015-08-04 | 2018-06-08 | Cd诊断股份有限公司 | The method for detecting bad local organization reaction (ALTR) necrosis |
US10720227B2 (en) * | 2015-08-12 | 2020-07-21 | Samsung Electronics Co., Ltd. | Method and device for mutation prioritization for personalized therapy |
KR102618536B1 (en) * | 2015-08-12 | 2023-12-27 | 삼성전자주식회사 | Method and device for mutation prioritization for personalized therapy of one or more patients |
AU2016309659A1 (en) | 2015-08-14 | 2018-04-12 | Koninklijke Philips N.V. | Assessment of NFkB cellular signaling pathway activity using mathematical modelling of target gene expression |
IL296418A (en) | 2015-10-18 | 2022-11-01 | Affymetrix Inc | Multiallelic genotyping of single nucleotide polymorphisms and indels |
EP3377650A1 (en) | 2015-11-19 | 2018-09-26 | Susanne Wagner | Signatures for predicting cancer immune therapy response |
US9836444B2 (en) * | 2015-12-10 | 2017-12-05 | International Business Machines Corporation | Spread cell value visualization |
CA3010240A1 (en) | 2016-01-06 | 2017-07-13 | Alexander Gutin | Genes and gene signatures for diagnosis and treatment of melanoma |
JP6707181B2 (en) | 2016-04-20 | 2020-06-10 | エルディエックス・プログノスティクス・リミテッド・カンパニーLdx Prognostics Limited Co. | Kits or packages for identifying pregnant women at risk of preterm birth and use of such kits or packages |
WO2017193062A1 (en) | 2016-05-06 | 2017-11-09 | Myriad Genetics, Inc. | Gene signatures for renal cancer prognosis |
US20190346452A1 (en) | 2016-05-17 | 2019-11-14 | Ldx Prognostics Limited Co. | Methods and compositions for providing preeclampsia assessment |
CN106066948B (en) * | 2016-06-07 | 2018-09-28 | 北京大学 | A kind of gene expression amount shows method and device |
EP3520225A4 (en) * | 2016-09-27 | 2020-05-27 | BAE SYSTEMS Information and Electronic Systems Integration Inc. | Techniques for implementing a portable spectrum analyzer |
US20180165414A1 (en) * | 2016-12-14 | 2018-06-14 | FlowJo, LLC | Applied Computer Technology for Management, Synthesis, Visualization, and Exploration of Parameters in Large Multi-Parameter Data Sets |
KR102116178B1 (en) | 2017-05-10 | 2020-05-27 | 서울대학교산학협력단 | Biomarker for monitoring or detecting early onset of liver cancer from patient having high risk of liver cancer and its use |
JP7194119B2 (en) | 2017-05-25 | 2022-12-21 | フロージョー エルエルシー | Visualization, comparative analysis, and automatic difference detection for large multiparameter datasets |
EP3461915A1 (en) | 2017-10-02 | 2019-04-03 | Koninklijke Philips N.V. | Assessment of jak-stat1/2 cellular signaling pathway activity using mathematical modelling of target gene expression |
EP3502279A1 (en) | 2017-12-20 | 2019-06-26 | Koninklijke Philips N.V. | Assessment of mapk-ap 1 cellular signaling pathway activity using mathematical modelling of target gene expression |
EP3728321A1 (en) | 2017-12-22 | 2020-10-28 | F. Hoffmann-La Roche AG | Use of pilra binding agents for treatment of a disease |
WO2019246160A2 (en) | 2018-06-18 | 2019-12-26 | Igenomix, S.L. | Methods, compositions, and kits for assessing endometrial transformation |
US20210388449A1 (en) | 2018-10-09 | 2021-12-16 | Genecentric Therapeutics, Inc. | Detecting cancer cell of origin |
JP2022512080A (en) | 2018-11-30 | 2022-02-02 | カリス エムピーアイ インコーポレイテッド | Next Generation Molecular Profiling |
BR112021020779A8 (en) | 2019-04-17 | 2022-01-25 | Igenomix S L | Improved methods for early diagnosis of uterine leiomyomas and leiomyosarcomas |
EP4069865A4 (en) | 2019-12-02 | 2023-12-20 | Caris MPI, Inc. | Pan-cancer platinum response predictor |
WO2023178295A1 (en) | 2022-03-18 | 2023-09-21 | Ludwig Institute For Cancer Research Ltd | Methods and systems for analyzing chromatins |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5206137A (en) * | 1988-09-08 | 1993-04-27 | Lifecodes Corporation | Compositions and methods useful for genetic analysis |
WO1996023078A1 (en) * | 1995-01-27 | 1996-08-01 | Incyte Pharmaceuticals, Inc. | Computer system storing and analyzing microbiological data |
US5593839A (en) * | 1994-05-24 | 1997-01-14 | Affymetrix, Inc. | Computer-aided engineering system for design of sequence arrays and lithographic masks |
US5707806A (en) * | 1995-06-07 | 1998-01-13 | Genzyme Corporation | Direct sequence identification of mutations by cleavage- and ligation-associated mutation-specific sequencing |
Family Cites Families (67)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4683202A (en) | 1985-03-28 | 1987-07-28 | Cetus Corporation | Process for amplifying nucleic acid sequences |
NO870613L (en) | 1986-03-05 | 1987-09-07 | Molecular Diagnostics Inc | DETECTION OF MICROORGANISMS IN A SAMPLE CONTAINING NUCLEIC ACID. |
US4740611A (en) * | 1986-10-30 | 1988-04-26 | The Standard Oil Company | N,N'-disubstituted ureas |
US4705864A (en) * | 1986-11-10 | 1987-11-10 | The Standard Oil Company | Aryl oxime derivatives of hydantoins |
AU1042488A (en) | 1986-12-20 | 1988-07-15 | Kukita, Takeshi | Bilirubin antigen, monoclonal antibody therefor, process for their preparation, and their use |
US5525464A (en) | 1987-04-01 | 1996-06-11 | Hyseq, Inc. | Method of sequencing by hybridization of oligonucleotide probes |
US5202231A (en) | 1987-04-01 | 1993-04-13 | Drmanac Radoje T | Method of sequencing of genomes by hybridization of oligonucleotide probes |
US5700637A (en) | 1988-05-03 | 1997-12-23 | Isis Innovation Limited | Apparatus and method for analyzing polynucleotide sequences and method of generating oligonucleotide arrays |
JP2897959B2 (en) | 1988-05-20 | 1999-05-31 | エフ.ホフマン―ラ ロシュ アクチェンゲゼルシャフト | Immobilized sequence-specific probe |
US6203977B1 (en) * | 1988-11-15 | 2001-03-20 | Yale University | Delineation of individual human chromosomes in metaphase and interphase cells by in situ suppression hybridization |
EP0392546A3 (en) | 1989-04-14 | 1991-09-11 | Ro Institut Za Molekularnu Genetiku I Geneticko Inzenjerstvo | Process for determination of a complete or a partial contents of very short sequences in the samples of nucleic acids connected to the discrete particles of microscopic size by hybridization with oligonucleotide probes |
US5143854A (en) | 1989-06-07 | 1992-09-01 | Affymax Technologies N.V. | Large scale photolithographic solid phase synthesis of polypeptides and receptor binding screening thereof |
US5871928A (en) * | 1989-06-07 | 1999-02-16 | Fodor; Stephen P. A. | Methods for nucleic acid analysis |
US6040138A (en) | 1995-09-15 | 2000-03-21 | Affymetrix, Inc. | Expression monitoring by hybridization to high density oligonucleotide arrays |
US5925525A (en) * | 1989-06-07 | 1999-07-20 | Affymetrix, Inc. | Method of identifying nucleotide differences |
US5800992A (en) * | 1989-06-07 | 1998-09-01 | Fodor; Stephen P.A. | Method of detecting nucleic acids |
DE69132905T2 (en) | 1990-12-06 | 2002-08-01 | Affymetrix Inc N D Ges D Staat | Detection of nucleic acid sequences |
CA2097708A1 (en) | 1990-12-06 | 1992-06-07 | Stephen P. A. Fodor | Very large scale immobilized polymer synthesis |
GR1000797B (en) | 1991-06-10 | 1993-01-25 | Emmanouil E Petromanolakis | Wave-making energy absorber during vessel s propulsion |
ATE185596T1 (en) * | 1991-08-13 | 1999-10-15 | Wisconsin Milk Marketing Board | DNA SEQUENCE CODING FOR BOvine G(A)-LACTALBUMIN AND METHODS OF USE |
US5846717A (en) * | 1996-01-24 | 1998-12-08 | Third Wave Technologies, Inc. | Detection of nucleic acid sequences by invader-directed cleavage |
WO1993022456A1 (en) | 1992-04-27 | 1993-11-11 | Trustees Of Dartmouth College | Detection of gene sequences in biological fluids |
US6395494B1 (en) * | 1993-05-13 | 2002-05-28 | Neorx Corporation | Method to determine TGF-β |
US6251920B1 (en) * | 1993-05-13 | 2001-06-26 | Neorx Corporation | Prevention and treatment of cardiovascular pathologies |
US5524070A (en) * | 1992-10-07 | 1996-06-04 | The Research Foundation Of State University Of New York | Local adaptive contrast enhancement |
US5632282A (en) * | 1993-07-20 | 1997-05-27 | Hay; S. Hutson | Ocular disease detection apparatus |
AU8126694A (en) | 1993-10-26 | 1995-05-22 | Affymax Technologies N.V. | Arrays of nucleic acid probes on biological chips |
ATE424927T1 (en) | 1993-10-28 | 2009-03-15 | Houston Advanced Res Ct | MICROFABRICATED POROUS FLOW DEVICE FOR DISCRETE DETERMINATION OF BONDING REACTIONS |
US6096503A (en) * | 1993-11-12 | 2000-08-01 | The Scripps Research Institute | Method for simultaneous identification of differentially expresses mRNAs and measurement of relative concentrations |
US6339767B1 (en) * | 1997-06-02 | 2002-01-15 | Aurigin Systems, Inc. | Using hyperbolic trees to visualize data generated by patent-centric and group-oriented data processing |
EP0667586A3 (en) * | 1994-02-14 | 1996-08-28 | Digital Equipment Corp | Database generator. |
US5560005A (en) * | 1994-02-25 | 1996-09-24 | Actamed Corp. | Methods and systems for object-based relational distributed databases |
US5570291A (en) * | 1994-08-24 | 1996-10-29 | Wallace Computer Services, Inc. | Custom product estimating and order processing system |
WO1997027317A1 (en) | 1996-01-23 | 1997-07-31 | Affymetrix, Inc. | Nucleic acid analysis techniques |
US6600996B2 (en) | 1994-10-21 | 2003-07-29 | Affymetrix, Inc. | Computer-aided techniques for analyzing biological sequences |
US5795716A (en) * | 1994-10-21 | 1998-08-18 | Chee; Mark S. | Computer-aided visualization and analysis system for sequence evaluation |
US5961923A (en) * | 1995-04-25 | 1999-10-05 | Irori | Matrices with memories and uses thereof |
DE69622088T2 (en) * | 1995-06-06 | 2003-03-06 | Ube Industries | Aromatic polyimide and gas separation |
US5777888A (en) | 1995-08-09 | 1998-07-07 | Regents Of The University Of California | Systems for generating and analyzing stimulus-response output signal matrices |
US5871697A (en) | 1995-10-24 | 1999-02-16 | Curagen Corporation | Method and apparatus for identifying, classifying, or quantifying DNA sequences in a sample without sequencing |
GB9522615D0 (en) | 1995-11-03 | 1996-01-03 | Pharmacia Spa | 4-Phenyl-4-oxo-butanoic acid derivatives with kynurenine-3-hydroxylase inhibiting activity |
US5778200A (en) | 1995-11-21 | 1998-07-07 | Advanced Micro Devices, Inc. | Bus arbiter including aging factor counters to dynamically vary arbitration priority |
US6228575B1 (en) | 1996-02-08 | 2001-05-08 | Affymetrix, Inc. | Chip-based species identification and phenotypic characterization of microorganisms |
US6108635A (en) * | 1996-05-22 | 2000-08-22 | Interleukin Genetics, Inc. | Integrated disease information system |
US5989835A (en) * | 1997-02-27 | 1999-11-23 | Cellomics, Inc. | System for cell-based screening |
CA2270527A1 (en) * | 1996-11-04 | 1998-05-14 | 3-Dimensional Pharmaceuticals, Inc. | System, method, and computer program product for the visualization and interactive processing and analysis of chemical data |
US5968784A (en) * | 1997-01-15 | 1999-10-19 | Chugai Pharmaceutical Co., Ltd. | Method for analyzing quantitative expression of genes |
US6205447B1 (en) * | 1997-06-30 | 2001-03-20 | International Business Machines Corporation | Relational database management of multi-dimensional data |
JP2001515234A (en) * | 1997-07-25 | 2001-09-18 | アフィメトリックス インコーポレイテッド | System for providing a polymorphism database |
US6032151A (en) * | 1997-11-17 | 2000-02-29 | Sun Microsystems, Inc. | Database system employing polymorphic entry and entry matching |
US6025194A (en) * | 1997-11-19 | 2000-02-15 | Geron Corporation | Nucleic acid sequence of senescence asssociated gene |
US5991766A (en) * | 1997-12-02 | 1999-11-23 | Electronic Data Systems Corporation | Method and system for managing redundant objects in a distributed object system |
US20030036855A1 (en) * | 1998-03-16 | 2003-02-20 | Praelux Incorporated, A Corporation Of New Jersey | Method and apparatus for screening chemical compounds |
US6324533B1 (en) * | 1998-05-29 | 2001-11-27 | International Business Machines Corporation | Integrated database and data-mining system |
US6331396B1 (en) * | 1998-09-23 | 2001-12-18 | The Cleveland Clinic Foundation | Arrays for identifying agents which mimic or inhibit the activity of interferons |
US6140054A (en) * | 1998-09-30 | 2000-10-31 | University Of Utah Research Foundation | Multiplex genotyping using fluorescent hybridization probes |
US6251601B1 (en) * | 1999-02-02 | 2001-06-26 | Vysis, Inc. | Simultaneous measurement of gene expression and genomic abnormalities using nucleic acid microarrays |
US6284465B1 (en) * | 1999-04-15 | 2001-09-04 | Agilent Technologies, Inc. | Apparatus, systems and method for locating nucleic acids bound to surfaces |
US6338071B1 (en) * | 1999-08-18 | 2002-01-08 | Affymetrix, Inc. | Method and system for providing a contract management system using an action-item table |
US6754666B1 (en) * | 1999-08-19 | 2004-06-22 | A2I, Inc. | Efficient storage and access in a database management system |
AU6909300A (en) * | 1999-08-20 | 2001-03-19 | Merck & Co., Inc. | Substituted ureas as cell adhesion inhibitors |
US7072896B2 (en) * | 2000-02-16 | 2006-07-04 | Verizon Laboratories Inc. | System and method for automatic loading of an XML document defined by a document-type definition into a relational database including the generation of a relational schema therefor |
US6569615B1 (en) * | 2000-04-10 | 2003-05-27 | The United States Of America As Represented By The Department Of Veteran's Affairs | Composition and methods for tissue preservation |
US20020062258A1 (en) * | 2000-05-18 | 2002-05-23 | Bailey Steven C. | Computer-implemented procurement of items using parametric searching |
US7731904B2 (en) * | 2000-09-19 | 2010-06-08 | Canon Kabushiki Kaisha | Method for making probe support and apparatus used for the method |
AU2002322775A1 (en) * | 2001-07-27 | 2003-02-17 | The Regents Of The University Of California | Stk15 (stk6) gene polymorphism and methods of determining cancer risk |
US20030126139A1 (en) * | 2001-12-28 | 2003-07-03 | Lee Timothy A. | System and method for loading commercial web sites |
-
1998
- 1998-07-24 JP JP2000504291A patent/JP2001515234A/en active Pending
- 1998-07-24 EP EP98936012A patent/EP1002264B1/en not_active Expired - Lifetime
- 1998-07-24 US US09/122,169 patent/US6484183B1/en not_active Expired - Lifetime
- 1998-07-24 JP JP2000504502A patent/JP2001511529A/en not_active Ceased
- 1998-07-24 DE DE69823206T patent/DE69823206T2/en not_active Expired - Lifetime
- 1998-07-24 AT AT98936012T patent/ATE264523T1/en not_active IP Right Cessation
- 1998-07-24 JP JP2000504489A patent/JP2001511550A/en active Pending
- 1998-07-24 WO PCT/US1998/015456 patent/WO1999005574A1/en active Application Filing
- 1998-07-24 WO PCT/US1998/015151 patent/WO1999005323A1/en not_active Application Discontinuation
- 1998-07-24 US US09/122,167 patent/US6229911B1/en not_active Expired - Lifetime
- 1998-07-24 EP EP98937126A patent/EP0998697A4/en not_active Withdrawn
- 1998-07-24 EP EP98937128A patent/EP1007737A4/en not_active Withdrawn
- 1998-07-24 EP EP98936972A patent/EP1009861A4/en not_active Withdrawn
- 1998-07-24 WO PCT/US1998/015469 patent/WO1999005591A2/en active IP Right Grant
- 1998-07-24 US US09/122,434 patent/US6308170B1/en not_active Expired - Lifetime
- 1998-07-24 WO PCT/US1998/015458 patent/WO1999005324A1/en active Application Filing
- 1998-07-24 US US09/122,304 patent/US6188783B1/en not_active Expired - Lifetime
- 1998-07-24 JP JP2000504290A patent/JP3776728B2/en not_active Expired - Lifetime
-
2001
- 2001-04-16 US US09/836,867 patent/US6567540B2/en not_active Expired - Lifetime
- 2001-08-27 US US09/940,285 patent/US6532462B2/en not_active Expired - Lifetime
-
2002
- 2002-08-14 US US10/219,021 patent/US20030074363A1/en not_active Abandoned
-
2003
- 2003-02-25 US US10/374,170 patent/US6882742B2/en not_active Expired - Lifetime
-
2004
- 2004-09-02 JP JP2004256352A patent/JP2005100389A/en not_active Withdrawn
-
2005
- 2005-01-18 US US11/038,624 patent/US20050164270A1/en not_active Abandoned
- 2005-03-14 US US11/080,216 patent/US7215804B2/en not_active Expired - Fee Related
- 2005-05-30 JP JP2005158223A patent/JP2005353057A/en active Pending
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5206137A (en) * | 1988-09-08 | 1993-04-27 | Lifecodes Corporation | Compositions and methods useful for genetic analysis |
US5593839A (en) * | 1994-05-24 | 1997-01-14 | Affymetrix, Inc. | Computer-aided engineering system for design of sequence arrays and lithographic masks |
WO1996023078A1 (en) * | 1995-01-27 | 1996-08-01 | Incyte Pharmaceuticals, Inc. | Computer system storing and analyzing microbiological data |
US5707806A (en) * | 1995-06-07 | 1998-01-13 | Genzyme Corporation | Direct sequence identification of mutations by cleavage- and ligation-associated mutation-specific sequencing |
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6334099B1 (en) | 1999-05-25 | 2001-12-25 | Digital Gene Technologies, Inc. | Methods for normalization of experimental data |
US7058517B1 (en) | 1999-06-25 | 2006-06-06 | Genaissance Pharmaceuticals, Inc. | Methods for obtaining and using haplotype data |
US6713258B2 (en) | 2000-03-29 | 2004-03-30 | Center For The Application Of Molecular Biology To International Agriculture (Cambia) | Methods for genotyping by hybridization analysis |
US6931326B1 (en) | 2000-06-26 | 2005-08-16 | Genaissance Pharmaceuticals, Inc. | Methods for obtaining and using haplotype data |
GB2574582A (en) * | 2018-05-28 | 2019-12-18 | Rainer Gabriel Schweiger Martin | Method for simulating a technical device |
Also Published As
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US6484183B1 (en) | Method and system for providing a polymorphism database | |
US6826296B2 (en) | Method and system for providing a probe array chip design database | |
Nickerson et al. | PolyPhred: automating the detection and genotyping of single nucleotide substitutions using fluorescence-based resequencing | |
EP1124990B1 (en) | Complexity management and analysis of genomic dna | |
JP3055942B2 (en) | Oligoprobe Design Station: Computer-based design of oligonucleotide probes and primers | |
US20020049772A1 (en) | Computer program product for genetically characterizing an individual for evaluation using genetic and phenotypic variation over a wide area network | |
US20030220844A1 (en) | Method and system for purchasing genetic data | |
US20030028501A1 (en) | Computer based method for providing a laboratory information management system | |
US20060271513A1 (en) | Method and apparatus for providing an expression data mining database | |
US20040023275A1 (en) | Methods for genomic analysis | |
US20060212229A1 (en) | Method and system for providing a probe array chip design database | |
US20060259251A1 (en) | Computer software products for associating gene expression with genetic variations | |
WO2000016220A9 (en) | Method and apparatus for providing an expression data mining database and laboratory information management | |
EP1396800A2 (en) | Method and apparatus for providing a bioinformatics database | |
JP2003526133A6 (en) | Method and apparatus for providing expression data mining database and laboratory information management | |
Gilbert et al. | Strategies for genotype generation | |
JP2003526133A (en) | Method and apparatus for providing expression data mining database and laboratory information management | |
Carroll | Mobile elements: Genome-wide distribution and complexity | |
Tripathi et al. | Pharmacogenomics and computational genomics: An appraisal | |
EP1038245A1 (en) | Method and apparatus for providing an expression data mining database and laboratory information management |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AK | Designated states |
Kind code of ref document: A1 Designated state(s): JP |
|
AL | Designated countries for regional patents |
Kind code of ref document: A1 Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application | ||
DFPE | Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101) | ||
WWE | Wipo information: entry into national phase |
Ref document number: 1998937128 Country of ref document: EP |
|
WWP | Wipo information: published in national office |
Ref document number: 1998937128 Country of ref document: EP |