US20050186581A1 - Computer-aided visualization and analysis system for sequence evaluation - Google Patents
Computer-aided visualization and analysis system for sequence evaluation Download PDFInfo
- Publication number
- US20050186581A1 US20050186581A1 US10/791,373 US79137304A US2005186581A1 US 20050186581 A1 US20050186581 A1 US 20050186581A1 US 79137304 A US79137304 A US 79137304A US 2005186581 A1 US2005186581 A1 US 2005186581A1
- Authority
- US
- United States
- Prior art keywords
- sequence
- base
- sample
- intensity
- sequences
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Abandoned
Links
- 238000004458 analytical method Methods 0.000 title description 41
- 238000012800 visualization Methods 0.000 title description 5
- 238000011156 evaluation Methods 0.000 title description 3
- 239000000523 sample Substances 0.000 claims abstract description 316
- 238000000034 method Methods 0.000 claims abstract description 80
- 108020004711 Nucleic Acid Probes Proteins 0.000 claims abstract description 9
- 239000002853 nucleic acid probe Substances 0.000 claims abstract description 9
- 238000009396 hybridization Methods 0.000 claims description 10
- 239000000178 monomer Substances 0.000 claims description 7
- 238000004590 computer program Methods 0.000 claims 11
- 238000002474 experimental method Methods 0.000 abstract description 49
- 150000007523 nucleic acids Chemical group 0.000 abstract description 41
- 238000012545 processing Methods 0.000 abstract description 12
- 238000010835 comparative analysis Methods 0.000 abstract description 7
- 230000035772 mutation Effects 0.000 description 101
- 238000007430 reference method Methods 0.000 description 32
- 108091028043 Nucleic acid sequence Proteins 0.000 description 27
- 230000000295 complement effect Effects 0.000 description 21
- 108020003175 receptors Proteins 0.000 description 16
- 102000005962 receptors Human genes 0.000 description 16
- 239000000758 substrate Substances 0.000 description 15
- 238000007619 statistical method Methods 0.000 description 14
- 239000012488 sample solution Substances 0.000 description 13
- 108020004414 DNA Proteins 0.000 description 12
- 238000004364 calculation method Methods 0.000 description 12
- 238000013461 design Methods 0.000 description 12
- 238000003491 array Methods 0.000 description 11
- 230000014509 gene expression Effects 0.000 description 11
- 108090000623 proteins and genes Proteins 0.000 description 9
- 230000015572 biosynthetic process Effects 0.000 description 8
- 108020004707 nucleic acids Proteins 0.000 description 8
- 102000039446 nucleic acids Human genes 0.000 description 8
- 238000003786 synthesis reaction Methods 0.000 description 8
- 229920000642 polymer Polymers 0.000 description 7
- 238000011176 pooling Methods 0.000 description 7
- GNBHRKFJIUUOQI-UHFFFAOYSA-N fluorescein Chemical compound O1C(=O)C2=CC=CC=C2C21C1=CC=C(O)C=C1OC1=CC(O)=CC=C21 GNBHRKFJIUUOQI-UHFFFAOYSA-N 0.000 description 6
- 239000000463 material Substances 0.000 description 5
- 230000008569 process Effects 0.000 description 5
- 239000013074 reference sample Substances 0.000 description 5
- KDCGOANMDULRCW-UHFFFAOYSA-N 7H-purine Chemical compound N1=CNC2=NC=NC2=C1 KDCGOANMDULRCW-UHFFFAOYSA-N 0.000 description 4
- 230000000692 anti-sense effect Effects 0.000 description 4
- 230000006870 function Effects 0.000 description 4
- 239000003550 marker Substances 0.000 description 4
- 230000008901 benefit Effects 0.000 description 3
- 238000004519 manufacturing process Methods 0.000 description 3
- 238000007781 pre-processing Methods 0.000 description 3
- 238000011282 treatment Methods 0.000 description 3
- 102100025230 2-amino-3-ketobutyrate coenzyme A ligase, mitochondrial Human genes 0.000 description 2
- 108010087522 Aeromonas hydrophilia lipase-acyltransferase Proteins 0.000 description 2
- 108010010369 HIV Protease Proteins 0.000 description 2
- 101000804764 Homo sapiens Lymphotactin Proteins 0.000 description 2
- 101000869690 Homo sapiens Protein S100-A8 Proteins 0.000 description 2
- 102100035304 Lymphotactin Human genes 0.000 description 2
- 108091005804 Peptidases Proteins 0.000 description 2
- 239000004365 Protease Substances 0.000 description 2
- 102100032442 Protein S100-A8 Human genes 0.000 description 2
- 102100037486 Reverse transcriptase/ribonuclease H Human genes 0.000 description 2
- 239000012620 biological material Substances 0.000 description 2
- 239000003153 chemical reaction reagent Substances 0.000 description 2
- OPTASPLRGRRNAP-UHFFFAOYSA-N cytosine Chemical compound NC=1C=CNC(=O)N=1 OPTASPLRGRRNAP-UHFFFAOYSA-N 0.000 description 2
- 230000000694 effects Effects 0.000 description 2
- 230000005284 excitation Effects 0.000 description 2
- 238000011049 filling Methods 0.000 description 2
- 239000007850 fluorescent dye Substances 0.000 description 2
- UYTPUPDQBNUYGX-UHFFFAOYSA-N guanine Chemical compound O=C1NC(N)=NC2=C1N=CN2 UYTPUPDQBNUYGX-UHFFFAOYSA-N 0.000 description 2
- 230000003993 interaction Effects 0.000 description 2
- 238000002372 labelling Methods 0.000 description 2
- 230000002285 radioactive effect Effects 0.000 description 2
- 238000012163 sequencing technique Methods 0.000 description 2
- QCJPNTKIAXXSJI-UHFFFAOYSA-N 5-methyl-1h-pyrimidine-2,4-dione;1h-pyrimidine-2,4-dione Chemical compound O=C1C=CNC(=O)N1.CC1=CNC(=O)NC1=O QCJPNTKIAXXSJI-UHFFFAOYSA-N 0.000 description 1
- 208000030507 AIDS Diseases 0.000 description 1
- 229930024421 Adenine Natural products 0.000 description 1
- GFFGJBXGBJISGV-UHFFFAOYSA-N Adenine Chemical compound NC1=NC=NC2=C1N=CN2 GFFGJBXGBJISGV-UHFFFAOYSA-N 0.000 description 1
- 201000003883 Cystic fibrosis Diseases 0.000 description 1
- 102000053602 DNA Human genes 0.000 description 1
- 238000000018 DNA microarray Methods 0.000 description 1
- 239000003298 DNA probe Substances 0.000 description 1
- 241000725303 Human immunodeficiency virus Species 0.000 description 1
- 241000713772 Human immunodeficiency virus 1 Species 0.000 description 1
- 206010028980 Neoplasm Diseases 0.000 description 1
- CZPWVGJYEJSRLH-UHFFFAOYSA-N Pyrimidine Chemical compound C1=CN=CN=C1 CZPWVGJYEJSRLH-UHFFFAOYSA-N 0.000 description 1
- 102000006382 Ribonucleases Human genes 0.000 description 1
- 108010083644 Ribonucleases Proteins 0.000 description 1
- 238000012300 Sequence Analysis Methods 0.000 description 1
- 108020004682 Single-Stranded DNA Proteins 0.000 description 1
- -1 aMino Chemical group 0.000 description 1
- 229960000643 adenine Drugs 0.000 description 1
- 230000000052 comparative effect Effects 0.000 description 1
- 230000008878 coupling Effects 0.000 description 1
- 238000010168 coupling process Methods 0.000 description 1
- 238000005859 coupling reaction Methods 0.000 description 1
- 229940104302 cytosine Drugs 0.000 description 1
- 238000010511 deprotection reaction Methods 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 238000002651 drug therapy Methods 0.000 description 1
- 239000000975 dye Substances 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- JEIPFZHSYJVQDO-UHFFFAOYSA-N ferric oxide Chemical compound O=[Fe]O[Fe]=O JEIPFZHSYJVQDO-UHFFFAOYSA-N 0.000 description 1
- 238000001506 fluorescence spectroscopy Methods 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- 238000003205 genotyping method Methods 0.000 description 1
- 239000011521 glass Substances 0.000 description 1
- 230000036541 health Effects 0.000 description 1
- 239000011159 matrix material Substances 0.000 description 1
- 238000005259 measurement Methods 0.000 description 1
- 239000000203 mixture Substances 0.000 description 1
- CJWXCNXHAIFFMH-AVZHFPDBSA-N n-[(2s,3r,4s,5s,6r)-2-[(2r,3r,4s,5r)-2-acetamido-4,5,6-trihydroxy-1-oxohexan-3-yl]oxy-3,5-dihydroxy-6-methyloxan-4-yl]acetamide Chemical compound C[C@H]1O[C@@H](O[C@@H]([C@@H](O)[C@H](O)CO)[C@@H](NC(C)=O)C=O)[C@H](O)[C@@H](NC(C)=O)[C@@H]1O CJWXCNXHAIFFMH-AVZHFPDBSA-N 0.000 description 1
- 108700004029 pol Genes Proteins 0.000 description 1
- 101150088264 pol gene Proteins 0.000 description 1
- 238000002360 preparation method Methods 0.000 description 1
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 1
- 238000003908 quality control method Methods 0.000 description 1
- 238000012552 review Methods 0.000 description 1
- PYWVYCXTNDRMGF-UHFFFAOYSA-N rhodamine B Chemical compound [Cl-].C=12C=CC(=[N+](CC)CC)C=C2OC2=CC(N(CC)CC)=CC=C2C=1C1=CC=CC=C1C(O)=O PYWVYCXTNDRMGF-UHFFFAOYSA-N 0.000 description 1
- 239000001022 rhodamine dye Substances 0.000 description 1
- 238000005464 sample preparation method Methods 0.000 description 1
- 239000000126 substance Substances 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
- 230000007704 transition Effects 0.000 description 1
- 238000013519 translation Methods 0.000 description 1
- 238000012795 verification Methods 0.000 description 1
- 238000007794 visualization technique Methods 0.000 description 1
- 238000005406 washing Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B30/00—ICT specially adapted for sequence analysis involving nucleotides or amino acids
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B01—PHYSICAL OR CHEMICAL PROCESSES OR APPARATUS IN GENERAL
- B01J—CHEMICAL OR PHYSICAL PROCESSES, e.g. CATALYSIS OR COLLOID CHEMISTRY; THEIR RELEVANT APPARATUS
- B01J19/00—Chemical, physical or physico-chemical processes in general; Their relevant apparatus
- B01J19/0046—Sequential or parallel reactions, e.g. for the synthesis of polypeptides or polynucleotides; Apparatus and devices for combinatorial chemistry or for making molecular arrays
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B82—NANOTECHNOLOGY
- B82Y—SPECIFIC USES OR APPLICATIONS OF NANOSTRUCTURES; MEASUREMENT OR ANALYSIS OF NANOSTRUCTURES; MANUFACTURE OR TREATMENT OF NANOSTRUCTURES
- B82Y30/00—Nanotechnology for materials or surface science, e.g. nanocomposites
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6813—Hybridisation assays
- C12Q1/6827—Hybridisation assays for detection of mutation or polymorphism
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6869—Methods for sequencing
- C12Q1/6874—Methods for sequencing involving nucleic acid arrays, e.g. sequencing by hybridisation
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B25/00—ICT specially adapted for hybridisation; ICT specially adapted for gene or protein expression
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B25/00—ICT specially adapted for hybridisation; ICT specially adapted for gene or protein expression
- G16B25/20—Polymerase chain reaction [PCR]; Primer or probe design; Probe optimisation
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B30/00—ICT specially adapted for sequence analysis involving nucleotides or amino acids
- G16B30/20—Sequence assembly
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B01—PHYSICAL OR CHEMICAL PROCESSES OR APPARATUS IN GENERAL
- B01J—CHEMICAL OR PHYSICAL PROCESSES, e.g. CATALYSIS OR COLLOID CHEMISTRY; THEIR RELEVANT APPARATUS
- B01J2219/00—Chemical, physical or physico-chemical processes in general; Their relevant apparatus
- B01J2219/00274—Sequential or parallel reactions; Apparatus and devices for combinatorial chemistry or for making arrays; Chemical library technology
- B01J2219/00277—Apparatus
- B01J2219/00351—Means for dispensing and evacuation of reagents
- B01J2219/00427—Means for dispensing and evacuation of reagents using masks
- B01J2219/00432—Photolithographic masks
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B01—PHYSICAL OR CHEMICAL PROCESSES OR APPARATUS IN GENERAL
- B01J—CHEMICAL OR PHYSICAL PROCESSES, e.g. CATALYSIS OR COLLOID CHEMISTRY; THEIR RELEVANT APPARATUS
- B01J2219/00—Chemical, physical or physico-chemical processes in general; Their relevant apparatus
- B01J2219/00274—Sequential or parallel reactions; Apparatus and devices for combinatorial chemistry or for making arrays; Chemical library technology
- B01J2219/00277—Apparatus
- B01J2219/00497—Features relating to the solid phase supports
- B01J2219/00527—Sheets
- B01J2219/00529—DNA chips
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B01—PHYSICAL OR CHEMICAL PROCESSES OR APPARATUS IN GENERAL
- B01J—CHEMICAL OR PHYSICAL PROCESSES, e.g. CATALYSIS OR COLLOID CHEMISTRY; THEIR RELEVANT APPARATUS
- B01J2219/00—Chemical, physical or physico-chemical processes in general; Their relevant apparatus
- B01J2219/00274—Sequential or parallel reactions; Apparatus and devices for combinatorial chemistry or for making arrays; Chemical library technology
- B01J2219/00583—Features relative to the processes being carried out
- B01J2219/00603—Making arrays on substantially continuous surfaces
- B01J2219/00605—Making arrays on substantially continuous surfaces the compounds being directly bound or immobilised to solid supports
- B01J2219/00608—DNA chips
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B01—PHYSICAL OR CHEMICAL PROCESSES OR APPARATUS IN GENERAL
- B01J—CHEMICAL OR PHYSICAL PROCESSES, e.g. CATALYSIS OR COLLOID CHEMISTRY; THEIR RELEVANT APPARATUS
- B01J2219/00—Chemical, physical or physico-chemical processes in general; Their relevant apparatus
- B01J2219/00274—Sequential or parallel reactions; Apparatus and devices for combinatorial chemistry or for making arrays; Chemical library technology
- B01J2219/00583—Features relative to the processes being carried out
- B01J2219/00603—Making arrays on substantially continuous surfaces
- B01J2219/00605—Making arrays on substantially continuous surfaces the compounds being directly bound or immobilised to solid supports
- B01J2219/0061—The surface being organic
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B01—PHYSICAL OR CHEMICAL PROCESSES OR APPARATUS IN GENERAL
- B01J—CHEMICAL OR PHYSICAL PROCESSES, e.g. CATALYSIS OR COLLOID CHEMISTRY; THEIR RELEVANT APPARATUS
- B01J2219/00—Chemical, physical or physico-chemical processes in general; Their relevant apparatus
- B01J2219/00274—Sequential or parallel reactions; Apparatus and devices for combinatorial chemistry or for making arrays; Chemical library technology
- B01J2219/00583—Features relative to the processes being carried out
- B01J2219/00603—Making arrays on substantially continuous surfaces
- B01J2219/00605—Making arrays on substantially continuous surfaces the compounds being directly bound or immobilised to solid supports
- B01J2219/00612—Making arrays on substantially continuous surfaces the compounds being directly bound or immobilised to solid supports the surface being inorganic
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B01—PHYSICAL OR CHEMICAL PROCESSES OR APPARATUS IN GENERAL
- B01J—CHEMICAL OR PHYSICAL PROCESSES, e.g. CATALYSIS OR COLLOID CHEMISTRY; THEIR RELEVANT APPARATUS
- B01J2219/00—Chemical, physical or physico-chemical processes in general; Their relevant apparatus
- B01J2219/00274—Sequential or parallel reactions; Apparatus and devices for combinatorial chemistry or for making arrays; Chemical library technology
- B01J2219/00583—Features relative to the processes being carried out
- B01J2219/00603—Making arrays on substantially continuous surfaces
- B01J2219/00605—Making arrays on substantially continuous surfaces the compounds being directly bound or immobilised to solid supports
- B01J2219/00614—Delimitation of the attachment areas
- B01J2219/00617—Delimitation of the attachment areas by chemical means
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B01—PHYSICAL OR CHEMICAL PROCESSES OR APPARATUS IN GENERAL
- B01J—CHEMICAL OR PHYSICAL PROCESSES, e.g. CATALYSIS OR COLLOID CHEMISTRY; THEIR RELEVANT APPARATUS
- B01J2219/00—Chemical, physical or physico-chemical processes in general; Their relevant apparatus
- B01J2219/00274—Sequential or parallel reactions; Apparatus and devices for combinatorial chemistry or for making arrays; Chemical library technology
- B01J2219/00583—Features relative to the processes being carried out
- B01J2219/00603—Making arrays on substantially continuous surfaces
- B01J2219/00605—Making arrays on substantially continuous surfaces the compounds being directly bound or immobilised to solid supports
- B01J2219/00623—Immobilisation or binding
- B01J2219/00626—Covalent
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B01—PHYSICAL OR CHEMICAL PROCESSES OR APPARATUS IN GENERAL
- B01J—CHEMICAL OR PHYSICAL PROCESSES, e.g. CATALYSIS OR COLLOID CHEMISTRY; THEIR RELEVANT APPARATUS
- B01J2219/00—Chemical, physical or physico-chemical processes in general; Their relevant apparatus
- B01J2219/00274—Sequential or parallel reactions; Apparatus and devices for combinatorial chemistry or for making arrays; Chemical library technology
- B01J2219/00583—Features relative to the processes being carried out
- B01J2219/00603—Making arrays on substantially continuous surfaces
- B01J2219/00659—Two-dimensional arrays
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B01—PHYSICAL OR CHEMICAL PROCESSES OR APPARATUS IN GENERAL
- B01J—CHEMICAL OR PHYSICAL PROCESSES, e.g. CATALYSIS OR COLLOID CHEMISTRY; THEIR RELEVANT APPARATUS
- B01J2219/00—Chemical, physical or physico-chemical processes in general; Their relevant apparatus
- B01J2219/00274—Sequential or parallel reactions; Apparatus and devices for combinatorial chemistry or for making arrays; Chemical library technology
- B01J2219/0068—Means for controlling the apparatus of the process
- B01J2219/00686—Automatic
- B01J2219/00689—Automatic using computers
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B01—PHYSICAL OR CHEMICAL PROCESSES OR APPARATUS IN GENERAL
- B01J—CHEMICAL OR PHYSICAL PROCESSES, e.g. CATALYSIS OR COLLOID CHEMISTRY; THEIR RELEVANT APPARATUS
- B01J2219/00—Chemical, physical or physico-chemical processes in general; Their relevant apparatus
- B01J2219/00274—Sequential or parallel reactions; Apparatus and devices for combinatorial chemistry or for making arrays; Chemical library technology
- B01J2219/0068—Means for controlling the apparatus of the process
- B01J2219/00695—Synthesis control routines, e.g. using computer programs
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B01—PHYSICAL OR CHEMICAL PROCESSES OR APPARATUS IN GENERAL
- B01J—CHEMICAL OR PHYSICAL PROCESSES, e.g. CATALYSIS OR COLLOID CHEMISTRY; THEIR RELEVANT APPARATUS
- B01J2219/00—Chemical, physical or physico-chemical processes in general; Their relevant apparatus
- B01J2219/00274—Sequential or parallel reactions; Apparatus and devices for combinatorial chemistry or for making arrays; Chemical library technology
- B01J2219/00709—Type of synthesis
- B01J2219/00711—Light-directed synthesis
-
- B—PERFORMING OPERATIONS; TRANSPORTING
- B01—PHYSICAL OR CHEMICAL PROCESSES OR APPARATUS IN GENERAL
- B01J—CHEMICAL OR PHYSICAL PROCESSES, e.g. CATALYSIS OR COLLOID CHEMISTRY; THEIR RELEVANT APPARATUS
- B01J2219/00—Chemical, physical or physico-chemical processes in general; Their relevant apparatus
- B01J2219/00274—Sequential or parallel reactions; Apparatus and devices for combinatorial chemistry or for making arrays; Chemical library technology
- B01J2219/00718—Type of compounds synthesised
- B01J2219/0072—Organic compounds
- B01J2219/00722—Nucleotides
-
- C—CHEMISTRY; METALLURGY
- C40—COMBINATORIAL TECHNOLOGY
- C40B—COMBINATORIAL CHEMISTRY; LIBRARIES, e.g. CHEMICAL LIBRARIES
- C40B40/00—Libraries per se, e.g. arrays, mixtures
- C40B40/04—Libraries containing only organic compounds
- C40B40/06—Libraries containing nucleotides or polynucleotides, or derivatives thereof
-
- C—CHEMISTRY; METALLURGY
- C40—COMBINATORIAL TECHNOLOGY
- C40B—COMBINATORIAL CHEMISTRY; LIBRARIES, e.g. CHEMICAL LIBRARIES
- C40B60/00—Apparatus specially adapted for use in combinatorial chemistry or with libraries
- C40B60/14—Apparatus specially adapted for use in combinatorial chemistry or with libraries for creating libraries
-
- G01N15/1433—
Definitions
- a method is disclosed of processing reference and sample nucleic acid sequences to reduce the variations between the experiments by the steps of:
- a computer system is used for comparative analysis and visualization of multiple sequences by the steps of:
- FIG. 1 illustrates an overall system for forming and analyzing arrays of biological materials such as DNA or RNA
- FIG. 3 illustrates the high level flow of the intensity ratio method
- FIG. 5A illustrates the high level flow of another implementation of the reference method
- FIG. 5B shows a data table for use with the reference method
- FIG. 5C shows a graph of the normalized sample base intensities minus the normalized reference base intensities
- FIG. 5D shows other graphs of data in the data table
- FIG. 9 illustrates an intensity graph window for a selected base
- FIG. 11 illustrates the intensity ratio method correctly calling a mutation in solutions with varying concentrations
- FIG. 12 illustrates the reference method correctly calling a mutant base where the intensity ratio method incorrectly called the mutant base
- FIG. 13 illustrates the output of the ViewSeqTM program with four pretreatment samples and four posttreatment samples.
- the present invention provides methods of analyzing hybridization intensity files for a chip containing hybridized nucleic acid probes.
- the files represent fluorescence data from a biological array, but the files may also represent other data such as radioactive intensity data.
- the present invention is described as being part of a computer system that designs a chip mask, synthesizes the probes on the chip, labels the nucleic acids, and scans the hybridized nucleic acid probes. Such a system is fully described in U.S. patent application Ser. No. 08/249,188 which has been incorporated by reference for all purposes. However, the present invention may be used separately from the overall system for analyzing data generated by such systems.
- FIG. 1 illustrates a computerized system for forming and analyzing arrays of biological materials such as RNA or DNA.
- a computer 100 is used to design arrays of biological polymers such as RNA or DNA.
- the computer 100 may be, for example, an appropriately programmed Sun Workstation or personal computer or workstation, such as an IBM PC equivalent, including appropriate memory and a CPU.
- the computer system 100 obtains inputs from a user regarding characteristics of a gene of interest, and other inputs regarding the desired features of the array.
- the computer system may obtain information regarding a specific genetic sequence of interest from an external or internal database 102 such as GenBank.
- the output of the computer system 100 is a set of chip design computer files 104 in the form of, for example, a switch matrix, as described in PCT application WO 92/10092, and other associated computer files.
- the chip design files are provided to a system 106 that designs the lithographic masks used in the fabrication of arrays of molecules such as DNA.
- the system or process 106 may include the hardware necessary to manufacture masks 110 and also the necessary computer hardware and software 108 necessary to lay the mask patterns out on the mask in an efficient manner. As with the other features in FIG. 1 , such equipment may or may not be located at the same physical site, but is shown together for ease of illustration in FIG. 1 .
- the system 106 generates masks 110 or other synthesis patterns such as chrome-on-glass masks for use in the fabrication of polymer arrays.
- the output of scanner 120 is an image file(s) 124 indicating, in the case of fluorescein labeled receptor, the fluorescence intensity (photon counts or other related measurements, such as voltage) as a function of position on the substrate. Since higher photon counts will be observed where the labeled receptor has bound more strongly to the array of polymers, and since the monomer sequence of the polymers on the substrate is known as a function of position, it becomes possible to determine the sequence(s) of polymer(s) on the substrate that are complementary to the receptor.
- FIG. 2A provides a simplified illustration of the overall software system used in the operation of one embodiment of the invention.
- the system first identifies the genetic sequence(s) or targets that would be of interest in a particular analysis at step 202 .
- the sequences of interest may, for example, be normal or mutant portions of a gene, genes that identify heredity, or provide forensic information. Sequence selection may be provided via manual input of text files or may be from external sources such as GenBank.
- the system evaluates the gene to determine or assist the user in determining which probes would be desirable on the chip, and provides an appropriate “layout” on the chip for the probes.
- Cell 220 contains a wild-type probe that is the complement of a portion of the wild-type sequence.
- Cells 222 contain “mutation” probes for the wild-type sequence. For example, if the wild-type probe is 3′-ACGT, the probes 3′-ACAT, 3′-ACCT, 3′-ACGT, and 3′-ACTT may be the “mutation” probes.
- Cell 224 is the “blank” cell because it contains no probes (also called the “blank” probe). As the blank cell contains no probes, labeled receptors should not bind to the chip in this area. Thus, the blank cell provides an area that can be used to measure the background intensity.
- Opt-tiling is the process of tiling additional probes for suspected mutations.
- the wild-type target sequence is 5′-ACGT A TGCA-3′ and it is suspected that a mutant sequence has a possible T base mutation at the underlined base position.
- the chip will be synthesized with a “4 ⁇ 3” tiling strategy, meaning that probes of four monomers are used and that the monomers in position 3, counting left to right, of the probe are varied.
- the top row of the probes (along with one probe below each of the four wild-type probes) should bind to the target DNA sequence.
- the labeled mutant sequence will not bind that strongly to the probes in the columns around column 3.
- the mutant receptor that could bind with the probes in column 2 is 5′-CGTT which may not bind that strongly to any of the probes in column 2 because there are T bases at the ends of the receptor and probes (i.e., not complementary). This often results in a relatively dark scanned area around a mutation.
- the masks for the synthesis are designed.
- the software utilizes the mask design and layout information to make the DNA or other polymer chips. This software 208 will control, among other things, relative translation of a substrate and the mask, the flow of desired reagents through a flow cell, the synthesis temperature of the flow cell, and other parameters.
- another piece of software is used in scanning a chip thus synthesized and exposed to a labeled receptor. The software controls the scanning of the chip, and stores the data thus obtained in a file that may later be utilized to extract sequence information.
- the image file will contain four fluorescence intensities, one for each probe. Each fluorescence intensity can therefore be associated with the base of each probe that is different from the other probes. Additionally, the image file will contain a “blank” cell which can be used as the fluorescence intensity of the background.
- the unknown base will be identified by evaluation of up to four mutation probes and a “blank” cell, which is a location where a labeled receptor should not bind to the chip since no probe is present.
- a DNA sequence of interest or target sequence contains the sequence 5′-AGAA C CTGC-3′ with a possible mutation at the underlined base position.
- 5-mer probes are to be synthesized for the target sequence.
- a representative wild-type probe of 5′-TTGGA is complementary to the region of the sequence around the possible mutation.
- the “mutation” probes will be the same as the wild-type probe except for a different base at the third position as follows: 3′-TTAGA, 3′-TTCGA, 3′-TTGGA, and 3′-TTTGA.
- the identity of the unknown base is preferably determined by evaluating the relative fluorescence intensities of up to four of the mutation probes, and the “blank” cell. Because each mutation probe is identifiable by the mutation base, a mutation probe's intensity will be referred to the “base intensity” of the mutation base.
- each fluorescence intensity is from a probe
- the probes may be characterized by their unique mutation base so the bases may be said to have the following intensities: A ⁇ > 45 C ⁇ > 8 G ⁇ > 32 T ⁇ > 12
- base A will be described as having an intensity of 45, which corresponds to the intensity of the mutation probe with the mutation base A.
- the base intensities are sorted by intensity.
- the above bases would be sorted as follows: A ⁇ > 43 G ⁇ > 30 T ⁇ > 10 C ⁇ > 6
- the highest intensity base is compared to the second highest intensity base.
- the ratio A:G is then compared to a predetermined ratio cutoff which is a number that specifies the ratio required to identify the unknown base. For example, if the ratio cutoff is 1.2, the ratio A:G is greater than the ratio cutoff (1.4>1.2) and the unknown base is called by the mutation probe containing the mutation A.
- the sample sequence is called as having a mutation T, resulting in a called sample sequence of 5′-ATGTGGA T AGTTGTA-3′.
- the ratio cutoff in the previous examples was equal to 1.2. However, the ratio cutoff will generally need to be adjusted to produce optimal results for the specific chip design and wild-type target. Also, although the ratio cutoff used has been the same for each ratio comparison, the ratio cutoff may vary depending on whether the ratio comparisons involve the highest, second highest, third highest, etc. intensity base.
- the base intensities are sorted by intensity. Each base is then associated with a number from 1 to 4. The base with the highest intensity is 1, second highest 2, third highest 3, and fourth highest 4. Thus, the intensity of base 1 ⁇ base 2 ⁇ base 3 ⁇ base 4.
- the unknown base is assigned the code N (insufficient intensity) as shown at step 308 . Otherwise, the ratio of the intensity of base 1 to base 2 is calculated as shown at step 310 .
- the ratio of intensity of base 1:2 is compared to the ratio cutoff. If the ratio 1:2 is greater than the ratio cutoff, the unknown base is called as the complement of the highest intensity base (base 1) as shown at step 314 . Otherwise, the ratio of the intensity of base 2 to base 3 is calculated as shown at step 316 .
- the ratio of intensity of base 2:3 is compared to the ratio cutoff. If the ratio 2:3 is greater than the ratio cutoff, the unknown base is called as being an ambiguity code specifying the complements of the highest or second highest intensity bases (base 1 or 2) as shown at step 320 . Otherwise, the ratio of the intensity of base 3 to base 4 is calculated as shown at step 322 .
- the ratio of intensity of base 3:4 is compared to the ratio cutoff. If the ratio 3:4 is greater than the ratio cutoff, the unknown base is called as being an ambiguity code specifying the complements of the highest, second highest, or third highest bases (base 1, 2 or 3) as shown at step 326 . Otherwise, the unknown base is assigned the code X (insufficient discrimination) as shown at step 328 .
- the advantage of the intensity ratio method is that it is very accurate when there is good discrimination between the fluorescence intensities of hybrid matches and hybrid mismatches.
- the method is useful for comparative assessment of hybridization quality and as an indicator of sequence-specific problem spots.
- the intensity ratio method has been used to determine that ambiguities and miscalls tend to be very different from sequence to sequence, and reflect predominantly the composition and repetitiveness of the sequence. It has also been used to assess improvements obtained by varying hybridization conditions, sample preparation, and post-hybridization treatments (e.g., RNase treatment).
- the reference method is a method of calling bases in a sample nucleic acid sequence.
- the reference method depends very little on discrimination between the fluorescence intensities of hybrid matches and hybrid mismatches, and therefore is much less sensitive to cross-hybridization.
- the method compares the probe intensities of a reference sequence to the probe intensities of a sample sequence. Any significant changes are flagged as possible mutations. There are two implementations of the reference method disclosed herein.
- a reference sequence which differs from the chip wild-type by one base mutation, has the sequence 5′-AGAC A TTGC-3′ where the mutation base is underlined.
- the “mutation” probes for the reference sequence may be as follows: 3′-TGAAA, 3′-TGCAA, 3′-TGGAA, and 3′-TGTAA, where 3′-TGTAA is the reference wild-type probe since the reference sequence is known.
- the sample and reference sequences were tiled with the same chip wild-type, this is not required, and the tiling methods do not have to be identical as shown in the example.
- the unknown base will be called by comparing the “mutation” probes of the sample sequence to the “mutation” probes of the reference sequence.
- the mutation probes' intensities will be referred to the “base intensities” of their respective mutation bases.
- the four base intensities of the reference and sample sequences are adjusted by subtracting the background or “blank” cell intensity from each base intensity.
- Each set of “mutation” probes has an associated “blank” cell. Suppose that the reference “blank” cell intensity is 1 and the sample “blank” cell intensity is 2.
- the base intensity associated with the reference wild-type (column 2 of the analysis table) is checked to see if it has sufficient intensity to call the unknown base.
- the reference wild-type is C.
- the base intensity associated with the wild-type is the G base intensity, which is 79 in this example. This is because the base intensities actually represent the complementary “mutation” probes.
- the G base intensity is checked by determining if its intensity is greater than a predetermined background difference cutoff.
- the background difference cutoff is a number that specifies the intensity the base intensities must be above the background intensity in order to correctly call the unknown base.
- the base intensity associated with the reference wild-type must be greater than the background difference cutoff or the unknown base is not callable.
- the ratio of the base intensity associated with the reference wild-type to each of the possible bases are calculated.
- the ratio of the base intensity associated with the reference wild-type to itself will be 1 and the other ratios will usually be greater than 1.
- G:C ⁇ > 79/8 9.9
- G:G ⁇ > 79/79 1.0
- G:T ⁇ > 79/14 5.6
- the highest base intensity which is G in this example, has sufficient intensity (58>5) so a P (pass) is placed in column 8 of the analysis table as shown at step 412 . Otherwise, at step 413 an F (fail) is placed in column 8 of the analysis table.
- step 416 if both the reference and sample sequence probes failed to have sufficient intensity to call the unknown base, meaning there is an ‘F’ in columns 3 and 8 of the analysis table, the unknown base is assigned the code N (insufficient intensity) as shown at step 418 .
- An ‘N’ is placed in column 17 of the analysis table.
- the unknown base is assigned the code N (insufficient intensity) as shown at step 422 .
- An ‘N’ is placed in column 17 and a confidence code of 4 is placed in column 18 of the analysis table.
- both the reference and sample sequence probes have sufficient intensity to call the unknown base.
- the ratios of the reference ratios to the sample ratios for each base type are calculated.
- the ratio A:A (column 4 to column 9) is placed in column 13 of the analysis table.
- the ratio C:C (column 5 to column 10) is placed in column 14 of the analysis table.
- the ratio G:G (column 6 to column 11) is placed in column 15 of the analysis table.
- the ratio T:T (column 7 to column 12) is placed in column 16 of the analysis table.
- the unknown base is assigned the code of the reference wild-type as shown at step 432 .
- the code for the reference wild-type (as shown in column 2) would be placed in column 17 and a confidence code of 0 is placed in column 18 of the analysis table.
- ratios of ratios are complementary to their respective base as follows: A:A ⁇ > T C:C ⁇ > G G:G ⁇ > C so the unknown base is called as being either C, G, or T, which is identified by the IUPAC code B.
- This ambiguity code is placed in column 17 and a confidence code of 3 would be placed in column 18 of the analysis table.
- At step 438 at least one of the ratios of ratios is greater than the upper ratio cutoff and the unknown base is called as the base complementary to the highest ratio of ratios.
- the code for the base complementary to the highest ratio of ratios would be placed in column 17 and a confidence code of 1 is placed in column 18 of the analysis table.
- the ratios of ratios are as follows: A:A ⁇ > 1.1 C:C ⁇ > 4.3 G:G ⁇ > 1.0 T:T ⁇ > 0.4
- the unknown base is called the base complementary to the highest ratio of ratios.
- the highest ratio of ratios is C:C, which has a complementary base G.
- the unknown base is called G which is placed in column 17 and a confidence code of 1 is placed in column 18 of the analysis table.
- the example shows how the unknown base in the sample nucleic acid sequence was correctly called as base G.
- the complementary “mutation” probe associated with the base G (3′-GCCT) did not have the highest fluorescence intensity, the unknown base was called as base G because the associated “mutation” probe had the highest ratio increase over the other “mutation” probes.
- the base intensity associated with the reference wild-type is checked to see if it has sufficient intensity to call the unknown base.
- the reference wild-type is base A at position 241.
- the base intensity associated with the reference wild-type is identified by a lower case “a” in the left hand column.
- the base intensities in the data table are not identified by their complements and the reference wild-type at the mutation position has an intensity of 385.
- the reference wild-type intensity of 385 is checked by determining if its intensity is greater than a predetermined background difference cutoff.
- the background difference cutoff is a number that specifies the intensity the base intensities must be over the background intensity in order to correctly call the unknown base.
- the base intensity associated with the reference wild-type must be greater than the background difference cutoff or the unknown base is not callable.
- step 518 calculations are performed on the background subtracted base intensities of the sample sequence in order to “normalize” the intensities.
- Each position in the sample sequence has four background subtracted base intensities associated with it.
- the ratio of the intensity of each base to the sum of the intensities of the possible bases is calculated, resulting in four ratios, one for each base as shown in the data table.
- the unknown base is assigned the code N (insufficient intensity) as shown at step 522 .
- the normalized base intensities of the reference sequence are subtracted from the normalized base intensities of the sample sequence.
- each position is checked to see if there is a base difference greater than an upper difference cutoff and a base difference lower than a lower difference cutoff.
- FIG. 5C shows a graph the normalized sample base intensities minus the normalized reference base intensities.
- the upper difference cutoff is 0.15 and the lower difference cutoff is ⁇ 0.15 as shown by the horizontal lines in FIG. 5C .
- the G difference is 0.28 which is greater than 0.15, the upper difference cutoff.
- the A difference is ⁇ 0.32 which is less than ⁇ 0.15, the lower difference cutoff.
- the base at that position is assigned the code of the reference wild-type base as shown at step 528 .
- these ratios are compared to a ratio at a neighboring position.
- the ratio at position 242 (which equals 1.02) is subtracted from the ratio at position 241 (which equals 1.48). It has been found that a mutant can be confidently detected by analyzing the difference of these neighboring ratios.
- the base at that position is assigned the code of the reference wild-type base as shown at step 538 .
- the advantage of the reference method is that the correct base can be called even in the presence of significant levels of cross-hybridization, as long as ratios of intensities are fairly consistent from experiment to experiment. In practice, the number of miscalls and ambiguities is significantly reduced, while the number of correct calls is actually increased, making the reference method very useful for identifying candidate mutations.
- the reference method has also been used to compare the reproducibility of experiments in terms of base calling.
- the statistical method is a method of calling bases in a sample nucleic acid sequence.
- the statistical method utilizes the statistical variation across experiments to call the bases. Therefore, the statistical method is good at calling bases if data from multiple experiments is available and the data is fairly consistent among the experiments.
- the method compares the probe intensities of a sample sequence to statistics of probe intensities of a reference sequence in multiple experiments.
- a base at the same position in the reference and sample sequences will be associated with up to four mutation probes and a “blank” cell.
- the mutation probes' intensities will be referred to as the “base intensities” of their respective mutation bases.
- a gene of interest has the sequence 5′-AAAACTGAAAA-3′.
- a reference sequence has the sequence 5′-AAAAC C GAAAA-3′, which differs from the target sequence by the underlined base.
- a sample sequence is suspected to have the same sequence as the target sequence except for a T base mutation at the underlined base position in 5′-AAAACTGAAAA-3′.
- the reference sequence is marked and exposed to probes on a chip.
- the sample sequence is also marked and exposed to probes on a chip.
- the “mutation” probes shown for the reference sequence may be from only one experiment, the other experiments may have different “mutation” probes, chip wild-types, tiling methods, and the like.
- each fluorescence intensity is from a probe, since the probes may be identified by their unique mutation bases, the probe intensities may be identified by their respective bases as follows: Reference Sample 3′-TGAC ⁇ A 3′-GACT ⁇ A 3′-TGCC ⁇ C 3′-GCCT ⁇ C 3′-TGGC ⁇ G 3′-GGCT ⁇ G 3′-TGTC ⁇ T 3′-GTCT ⁇ T
- base A of the reference sequence will be described as having an intensity which corresponds to the intensity of the mutation probe with the mutation base A.
- the statistical method will now be described as calling the unknown base in the sample sequence by using this example.
- FIG. 6 illustrates the high level flow of the statistical method.
- the four base intensities associated with the sample sequence and each of the multiple reference experiments are adjusted by subtracting the background or “blank” cell intensity from each base intensity.
- the base intensity is set equal to a small positive number to prevent division by zero or negative numbers.
- the intensities of the reference wild-type bases in the multiple experiments are checked to see if they all have sufficient intensity to call the unknown base.
- the intensities are checked by determining if the intensity of the reference wild-type base of an experiment is greater than a predetermined background difference cutoff.
- the wild-type probe shown earlier for the reference sequence is 3′-TGGC, and thus the G base intensity is the wild-type base intensity.
- the wild-type experiments fail to have sufficient intensity as shown at step 606 . Otherwise, at step 608 the wild-type experiments pass by having sufficient intensity.
- each reference experiment has four background subtracted base intensities associated with it: one wild-type and three for the other possible bases.
- the G base intensity is the wild-type
- the A, C, and T base intensities being the “other” intensities.
- the ratios of the intensity of each base to the sum of the intensities of the possible bases are calculated, giving one wild-type ratio and three “other” ratios.
- each reference experiment will be associated with one wild-type ratio and three “other” ratios.
- the mean and standard deviation are calculated for all the wild-type ratios.
- the mean and standard deviation are also calculated for each of the other ratios, resulting in three other means and standard deviations for each of the bases that is not the wild-type base. Therefore, the following would be calculated: Mean and standard deviation of A ratios Mean and standard deviation of C ratios Mean and standard deviation of G ratios Mean and standard deviation of T ratios where the mean and standard deviation of the G ratios are also known as the wild-type mean and the wild-type standard deviation, respectively.
- the mean and standard deviation of the A, C, and T means and standard deviations are also known collectively as the “other” means and standard deviations.
- the steps up to and including step 612 are performed in a preprocessing stage for the multiple wild-type experiments.
- the results of the preprocessing stage are stored in a file so that the reference calculations do not have to be repeatedly calculated, which results in increased performance.
- Microfiche Appendices C and D contain the programming code to perform the preprocessing stage.
- the highest base intensity associated with the sample sequence is checked to see if it has sufficient intensity to call the unknown base.
- the intensity is checked by determining if the highest intensity unknown base is greater than the background difference cutoff. If the intensity is not greater than the background difference cutoff, the sample sequence fails to have sufficient intensity as shown at step 616 . Otherwise, at step 618 the sample sequence passes by having sufficient intensity.
- step 620 calculations are performed on the four background subtracted intensities of the sample sequence.
- the ratio of the background subtracted intensity of each base to the sum of the background subtracted intensities of the possible bases (all four) is calculated, giving four ratios, one for each base.
- the ratio associated with the reference wild-type base is called the wild-type ratio, with there being three “other” ratios.
- the unknown base is assigned the code of the reference wild-type base as shown at step 630 .
- the unknown base is assigned the code X (insufficient discrimination) as shown at step 636 .
- FIG. 7 illustrates the pooling processing of a reference and sample nucleic acid sequence.
- a reference nucleic acid sequence is marked with a fluorescent dye, such as a fluorescein.
- a sample nucleic acid sequence is marked with a dye that, upon excitation, emits light that of a different wavelength than the fluorescent dye of the reference sequence.
- the sample nucleic acid sequence may be marked with rhodamine.
- the labeled reference sequence and the labeled sample sequence are combined. After this step, processing continues in the same manner as for only one labeled sequence.
- the sequences are fragmented. The fragmented nucleic acid sequences are then hybridized on a chip containing probes as shown at step 710 .
- a scanner generates image files that indicate the locations where the labeled nucleic acids bound to the chip.
- the scanner generates an image file by focusing excitation light on the hybridized chip and detecting the fluorescent light that is emitted.
- the marker emitting the fluorescent light can be identified by the wavelength of the light.
- the fluorescence peak of fluorescein is about 530 nm while that of a typical rhodamine dye is about 580 nm.
- the scanner creates an image file for,the data associated with each fluorescent marker, indicating the locations where the correspondingly labeled nucleic acid bound to the chip. Based upon an analysis of the fluorescence intensities and locations, it becomes possible to extract information such as the monomer sequence of DNA or RNA.
- pooling processing reduces variations across individual experiments because much of the test environment is common. Although pooling processing has been described as being used to improve the combined processing of reference and sample nucleic acid sequences, the process may also be used for two reference sequences, two sample sequences, or multiple sequences by utilizing multiple distinguishable markers.
- the present invention provides a method of comparative analysis and visualization of multiple experiments.
- the method allows the intensity ratio, reference, and statistical methods to be run on multiple datafiles simultaneously. This permits different experimental conditions, sample preparations, and analysis parameters to be compared in terms of their effects on sequence calling.
- the method also provides verification and editing functions, which are essential to reading sequences, as well as navigation and analysis tools.
- Sample sequence area 816 is where sample or unknown experimental sequences are displayed for comparison with the reference sequences.
- the sample sequence area is divided into a sample name subarea 824 and sample base subarea 826 .
- the sample name subarea is shown with filenames that contain the sample sequences.
- the filename extensions indicate the method used to call the sample sequence where “.cq#” denotes the intensity ratio method, “.rq#” denotes the reference method, and “.sq#” denotes the statistical method (# indicates the unit on the chip).
- the sample base subarea contains the bases of the sample sequences.
- the bases of the sample sequences are identified by the codes previously set forth which, for the most part, conform to the IUPAC standard.
- Window 802 also contains a message panel 828 .
- the base becomes highlighted and the pathname of the file containing the base is displayed in the message panel.
- the base's position in the nucleic acid sequence is also displayed in the message panel.
- the user is able to load files of experimental sequences that have been tiled and scanned on a chip. There is a chip wild-type associated with each experimental sequence. The chip wild-type associated with the first experimental sequence loaded is read and shown as the chip wild-type in reference sequence area 814 .
- the user is also able to load files of known nucleic acid sequences as reference sequences for comparison purposes. As before, these known reference sequences may or may not have associated probe intensity data.
- the user is able to save sequences that are selected on the screen into a project file that can be loaded in at a later time.
- the project file also contains any linkage of the sequences, where sequences are linked for comparison purposes. Individual sequences, both reference and sample, are selected by selecting the sequence filename with an input device in the reference or sample name subareas.
- pull down menu Edit 806 the user is able to link together sequences in the reference and sample sequence areas. After the user has selected one reference and one or more sample sequences, the sample sequences can be linked to the reference sequence by selecting an entry in the pull down menu. Once the sequences are linked, a link number 830 is displayed next to each of the linked sequences. Each group of linked sequences is associated with a unique link number, so the user can easily identify which sequences are linked together. Linking sequences permits the user to more easily compare the linked sequences. The user is also able to remove and display links in this menu.
- the user is able to display intensity graphs for selected bases. Once a base is selected in the reference or-sample base subareas, the user may request an intensity graph showing the hybridized probe intensities of the selected base and a delineated neighborhood of bases near the selected base. Intensity graphs may be displayed for one or multiple selected bases. The user is also able to prepare comment files and reports in this menu.
- FIG. 9 illustrates an intensity graph window for a selected base at position 120.
- the filename containing the sequence data is displayed at 904 .
- the graph shows the intensities for each of the hybridized probes associated with a base.
- Each grouping of four vertical bars on the graph, which are labeled as “a”, “c”, “g”, and “t” on line 906 shows the background subtracted intensities of probes having the indicated substitution base.
- the called bases are shown in red.
- the wild-type base is shown at line 908
- the called base is shown at line 910
- the base position is shown at line 912 .
- the base selected is at position 120 as shown by arrow 914 .
- the wild-type base at this position is T; however, the called base is M which means the base is either A or C (amino).
- the user is able to use intensity graphs to visually compare the intensities of each of the possible calls.
- the user is able to compare the sequences of references and samples. At least four comparisons are available to the user, including the following: sample sequences to the chip wild-type sequence, sample sequences to any reference sequences, sample sequences to any linked reference sequences, and reference sequences to the chip wild-type sequence. For example, after the user has linked a reference and sample sequence, the user can compare the bases in the linked sequences. Bases in the sample sequence that are different from the reference sequence will then be indicated on the display device to the user (e.g., base is shown in a different color). In another example, the user is able to perform a comparison that will help identify sample sequences.
- each base in-the sample sequence that does not match the wild-type sequence is checked to see if it matches one of the linked reference sequences.
- the bases that match a linked reference sequence will then be indicated on the display device to the user. The user may then more easily identify the sample sequence as being one of the reference sequences.
- FIG. 11 illustrates the intensity ratio method correctly calling a mutation in solutions with varying concentrations.
- a window 1102 is shown with a chip wild-type 1104 and a mutant sequence 1106 .
- the mutant sequence differs from the chip wild-type at the position indicated by the rectangular box 1108 .
- the chip wild-type and mutant sequences are a region of HIV Pol Gene spanning mutations occurring in AZT drug therapy.
- sample sequences There are seven sample sequences that are called using the intensity ratio method.
- the sample sequences are actually solutions of different proportions of the chip wild-type sequence and the mutant sequence.
- sample solutions 1110 , 1112 , 1114 , 1116 , 1118 , 1120 , and 1122 The solutions are 15-mer tilings across the chip wild-type with increased percentages of the mutant sequence from 0 to 100% by weight.
- the following shows the proportions of the sample solutions: Sample Solution Chip Wild-Type:Mutant 1110 100:0 1112 90:10 1114 75:25 1116 50:50 1118 25:75 1120 10:90 1122 0:100
- sample solution 1114 contains 75% chip wild-type sequence and 25% mutant sequence.
- the intensity ratio method correctly calls sample solution 1110 as having a base A as in the chip-wild type sequence. This is correct because sample solution 1110 is 100% chip wild-type sequence.
- the intensity ratio method also calls sample solution 1112 as having a base A because the sample solution is 90% chip wild-type sequence.
- the intensity ratio method calls the identified base in sample solutions 1114 and 1116 as being an R, which is an ambiguity IUPAC code denoting A or G (purine). This also a correct base call because the sample solutions have from 75% to 50% chip-wild type sequence and from 25% to 50% mutation sequence. Thus, the intensity ratio method correctly calls the base in this transition state.
- Sample solutions 1118 , 1120 , and 1122 are called by the intensity ratio method as having a mutation base G at the specified location. This is a correct base call because the sample solutions primarily consist of the mutation sequence (75%, 90%, and 100% respectively). Again, the intensity ratio method correctly called the bases.
- FIG. 12 illustrates the reference method correctly calling a mutant base where the intensity ratio method incorrectly called the mutant base.
- Window 1206 shows the sample sequence called using the reference method.
- the reference method correctly calls the specified base as being base A.
- the reference method is preferable to the intensity ratio method because it compares probe intensities of a sample sequence to probe intensities of a reference sequence.
- the intensity ratio method was used in sequence analysis of various polymorphic HIV-1 clones using a protease chip. Single stranded DNA of a 382 nt region was used with 4 different clones (HXB2, SF2, NY5, pPol4mut18). Results were compared to results from an ABI sequencer. The results are illustrated below: ABI Protease Chip Sense Antisense Sense Antisense No call 0 4 9 4 Ambiguous 6 14 17 8 Wrong call 2 3 3 1 TOTAL 8 21 29 13 SUMMARY
- FIG. 13 illustrates the output of the ViewSeqTM program with four pretreatment samples and four posttreatment samples. Note the mutation at position 207 where a mutation has arisen. Even adjacent two additional mutations (gt), the “a” mutation has been properly detected.
Abstract
A computer system for analyzing nucleic acid sequences is provided. The computer system is used to perform multiple methods for determining unknown bases by analyzing the fluorescence intensities of hybridized nucleic acid probes. The results of individual experiments are improved by processing nucleic acid sequences together. Comparative analysis of multiple experiments is also provided by displaying reference sequences in one area and sample sequences in another area on a display device.
Description
- The present invention is a continuation of U.S. Ser. No. 09/049,805, filed Mar. 27, 1998, which is a Continuation of U.S. Ser. No. 08/327,525, filed Oct. 21, 2004, now U.S. Pat. No. 5,795,716, all of which are each incorporated herein by reference in their entireties.
- Portions of the material in this specification arose in the course of or under contract nos. 92ER81275 (SBIR) between Affymetrix, Inc. and the Department of Energy and/or H600813-1, -2 between Affymetrix, Inc. and the National Institutes of Health.
- A portion of the disclosure of this patent document contain material which is subject to copyright protection. The copyright owner has no objection to the xeroxographic reproduction by anyone of the patent document or the patent disclosure in exactly the form it appears in the Patent and Trademark office patent file or records, but otherwise reserves all copyright rights whatsoever.
- The present invention relates to the field of computer systems. More specifically, the present invention relates to computer system for visualizing biological sequences, as well as for evaluating and comparing biological sequences.
- Devices and computer systems for forming and using arrays of materials on a substrate are known. For example,
PCT application WO 92/10588, incorporated herein by reference for all purposes, describes techniques for sequencing or sequence checking nucleic acids and other materials. Arrays for performing these operations may be formed in arrays according to the methods of, for example, the pioneering techniques disclosed in U.S. Pat. No. 5,143,854 and U.S. patent application Ser. No. 08/249,188, both incorporated herein by reference for all purposes. - According to one aspect of the techniques described therein, an array of nucleic acid probes is fabricated at known locations on a chip or substrate. A fluorescently labeled nucleic acid is then brought into contact with the chip and a scanner generates an image file indicating the locations where the labeled nucleic acids bound to the chip. Based upon the identities of the probes at these locations, it becomes possible to extract information such as the monomer sequence of DNA or RNA. Such systems have been used to form, for example, arrays of DNA that may be used to study and detect mutations relevant to cystic fibrosis, the P53 gene (relevant to certain cancers), HIV, and other genetic characteristics.
- Improved computer systems and methods are needed to evaluate, analyze, and process the vast amount of information now used and made available by these pioneering technologies.
- An improved computer-aided system for visualizing and determining the sequence of nucleic acids is disclosed. The computer system provides, among other things, improved methods of analyzing fluorescent image files of a chip containing hybridized nucleic acid probes in order to call bases in sample nucleic acid sequences.
- According to one aspect of the invention, a computer system is used to identify an unknown base in a sample nucleic acid sequence by the steps of:
-
- inputting multiple probe intensities, each of the probe intensities being associated with a probe;
- the computer system comparing the multiple probe intensities where each of the probe intensities is substantially proportional to a probe hybridizing with at least one sequence; and
calling the unknown base according to the comparison of the multiple probe intensities.
- According to one specific aspect of the invention, a higher probe intensity is compared to a lower probe intensity to call the unknown base. According to another specific aspect of the invention, probe intensities of a sample sequence are compared to probe intensities of a reference sequence. According to yet another specific aspect of the invention, probe intensities of a sample sequence are compared to statistics about probe intensities of a reference sequence from multiple experiments.
- According to another aspect of the invention, a method is disclosed of processing reference and sample nucleic acid sequences to reduce the variations between the experiments by the steps of:
-
- providing a plurality of nucleic acid probes;
- labeling the reference nucleic acid sequence with a first marker;
- labeling the sample nucleic acid sequence with a second marker; and
hybridizing the labeled reference and sample nucleic acid sequences at the same time.
- According to yet another aspect of the invention, a computer system is used for comparative analysis and visualization of multiple sequences by the steps of:
-
- displaying at least one reference sequence in a first area on a display device; and
- displaying at least one sample sequence in a second area on said display device;
whereby a user is capable of visually comparing the multiple sequences.
- A further understanding of the nature and advantages of the inventions herein may be realized by reference to the remaining portions of the specification and the attached drawings.
-
FIG. 1 illustrates an overall system for forming and analyzing arrays of biological materials such as DNA or RNA; -
FIG. 2A is an illustration of the software for the overall system;FIG. 2B illustrates the global layout of a chip formed in the overall system; andFIG. 2C illustrates conceptually the binding of probes on chips; -
FIG. 3 illustrates the high level flow of the intensity ratio method; -
FIG. 4A illustrates the high level flow of one implementation of the reference method andFIG. 4B shows an analysis table for use with the reference method; -
FIG. 5A illustrates the high level flow of another implementation of the reference method;FIG. 5B shows a data table for use with the reference method;FIG. 5C shows a graph of the normalized sample base intensities minus the normalized reference base intensities; andFIG. 5D shows other graphs of data in the data table; -
FIG. 6 illustrates the high level flow of the statistical method; -
FIG. 7 illustrates the pooling processing of a reference and sample nucleic acid sequence; -
FIG. 8 illustrates the main screen and the associated pull down menus for comparative analysis and visualization of multiple experiments; -
FIG. 9 illustrates an intensity graph window for a selected base; -
FIG. 10 illustrates multiple intensity graph windows for selected bases; -
FIG. 11 illustrates the intensity ratio method correctly calling a mutation in solutions with varying concentrations; -
FIG. 12 illustrates the reference method correctly calling a mutant base where the intensity ratio method incorrectly called the mutant base; and -
FIG. 13 illustrates the output of the ViewSeq™ program with four pretreatment samples and four posttreatment samples. -
CONTENTS I. General II. Intensity Ratio Method III. Reference Method IV. Statistical Method V. Pooling Processing VI. Comparative Analysis VII. Examples VIII. Appendices - The present invention provides methods of analyzing hybridization intensity files for a chip containing hybridized nucleic acid probes. In a representative embodiment, the files represent fluorescence data from a biological array, but the files may also represent other data such as radioactive intensity data. For purposes of illustration, the present invention is described as being part of a computer system that designs a chip mask, synthesizes the probes on the chip, labels the nucleic acids, and scans the hybridized nucleic acid probes. Such a system is fully described in U.S. patent application Ser. No. 08/249,188 which has been incorporated by reference for all purposes. However, the present invention may be used separately from the overall system for analyzing data generated by such systems.
-
FIG. 1 illustrates a computerized system for forming and analyzing arrays of biological materials such as RNA or DNA. Acomputer 100 is used to design arrays of biological polymers such as RNA or DNA. Thecomputer 100 may be, for example, an appropriately programmed Sun Workstation or personal computer or workstation, such as an IBM PC equivalent, including appropriate memory and a CPU. Thecomputer system 100 obtains inputs from a user regarding characteristics of a gene of interest, and other inputs regarding the desired features of the array. Optionally, the computer system may obtain information regarding a specific genetic sequence of interest from an external orinternal database 102 such as GenBank. The output of thecomputer system 100 is a set of chip design computer files 104 in the form of, for example, a switch matrix, as described in PCT application WO 92/10092, and other associated computer files. - The chip design files are provided to a
system 106 that designs the lithographic masks used in the fabrication of arrays of molecules such as DNA. The system orprocess 106 may include the hardware necessary to manufacturemasks 110 and also the necessary computer hardware andsoftware 108 necessary to lay the mask patterns out on the mask in an efficient manner. As with the other features inFIG. 1 , such equipment may or may not be located at the same physical site, but is shown together for ease of illustration inFIG. 1 . Thesystem 106 generatesmasks 110 or other synthesis patterns such as chrome-on-glass masks for use in the fabrication of polymer arrays. - The
masks 110, as well as selected information relating to the design of the chips fromsystem 100, are used in asynthesis system 112.Synthesis system 112 includes the necessary hardware and software used to fabricate arrays of polymers on a substrate orchip 114. For example,synthesizer 112 includes alight source 116 and achemical flow cell 118 on which the substrate orchip 114 is placed.Mask 110 is placed between the light source and the substrate/chip, and the two are translated relative to each other at appropriate times for deprotection of selected regions of the chip. Selected chemical reagents are directed throughflow cell 118 for coupling to deprotected regions, as well as for washing and other operations. All operations are preferably directed by an appropriately programmedcomputer 119, which may or may not be the same computer as the computer(s) used in mask design and mask making. - The substrates fabricated by
synthesis system 112 are optionally diced into smaller chips and exposed to marked receptors. The receptors may or may not be complementary to one or more of the molecules on the substrate. The receptors are marked with a label such as a fluorescein label (indicated by an asterisk inFIG. 1 ) and placed inscanning system 120.Scanning system 120 again operates under the direction of an appropriately programmeddigital computer 122, which also may or may not be the same computer as the computers used in synthesis, mask making, and mask design. Thescanner 120 includes adetection device 124 such as a confocal microscope or CCD (charge-coupled device) that is used to detect the location where labeled receptor (*) has bound to the substrate. The output ofscanner 120 is an image file(s) 124 indicating, in the case of fluorescein labeled receptor, the fluorescence intensity (photon counts or other related measurements, such as voltage) as a function of position on the substrate. Since higher photon counts will be observed where the labeled receptor has bound more strongly to the array of polymers, and since the monomer sequence of the polymers on the substrate is known as a function of position, it becomes possible to determine the sequence(s) of polymer(s) on the substrate that are complementary to the receptor. - The
image file 124 is provided as input to ananalysis system 126 that incorporates the visualization and analysis methods of the present invention. Again, the analysis system may be any one of a wide variety of computer system(s), but in a preferred embodiment the analysis system is based on a Sun Workstation or equivalent. The present invention provides various methods of analyzing the chip design files and the image files, providingappropriate output 128. The present invention may further be used to identify specific mutations in a receptor such as DNA or RNA. -
FIG. 2A provides a simplified illustration of the overall software system used in the operation of one embodiment of the invention. As shown inFIG. 2A , the system first identifies the genetic sequence(s) or targets that would be of interest in a particular analysis atstep 202. The sequences of interest may, for example, be normal or mutant portions of a gene, genes that identify heredity, or provide forensic information. Sequence selection may be provided via manual input of text files or may be from external sources such as GenBank. Atstep 204 the system evaluates the gene to determine or assist the user in determining which probes would be desirable on the chip, and provides an appropriate “layout” on the chip for the probes. A wild-type probe is a probe that will ideally hybridize with the gene of interest and thus a wild-type gene (also called the chip wild-type) would ideally hybridize with all the wild-type probes on the chip. The layout implements desired characteristics such as arrangement on the chip that permits “reading” of genetic sequence and/or minimization of edge effects, ease of synthesis, and the like. -
FIG. 2B illustrates the global layout of a chip.Chip 114 is composed of multiple units where each unit may contain different tilings for the chip wild-type sequence.Unit 1 is shown in greater detail and shows that each unit is composed of multiple cells which are areas on the chip that may contain probes. Conceptually, each unit is composed of multiple sets of related cells. As used herein, the term cell refers to a region on a substrate that contains many copies of a molecule or molecules of interest. Each unit is composed of multiple cells that may be placed in rows and columns. In one embodiment, a set of five related cells includes the following: a wild-type cell 220, “mutation”cells 222, and a “blank”cell 224.Cell 220 contains a wild-type probe that is the complement of a portion of the wild-type sequence.Cells 222 contain “mutation” probes for the wild-type sequence. For example, if the wild-type probe is 3′-ACGT, theprobes 3′-ACAT, 3′-ACCT, 3′-ACGT, and 3′-ACTT may be the “mutation” probes.Cell 224 is the “blank” cell because it contains no probes (also called the “blank” probe). As the blank cell contains no probes, labeled receptors should not bind to the chip in this area. Thus, the blank cell provides an area that can be used to measure the background intensity. - In one embodiment, numerous tiling processes are available including sequence tiling, block tiling, and opt-tiling. Of course a wide range of layout strategies may be used according to the invention herein without departing from the scope of the invention. For example, the probes may be tiled on a substrate in an apparently random fashion where a computer system is utilized to keep track of the probe locations and correlate the data obtained from the substrate.
- Opt-tiling is the process of tiling additional probes for suspected mutations. As a simple example of opt-tiling, suppose the wild-type target sequence is 5′-ACGTATGCA-3′ and it is suspected that a mutant sequence has a possible T base mutation at the underlined base position. Suppose further that the chip will be synthesized with a “4×3” tiling strategy, meaning that probes of four monomers are used and that the monomers in
position 3, counting left to right, of the probe are varied. - In opt-tiling, extra probes are tiled for each suspected mutation. The extra probes are tiled as if the mutation base is a wild-type base. The following shows the probes that may be generated for this example:
TABLE 1 Probe Sequences (From 3′-end) 4 × 3 Opt-Tiling Wild TGCA GCAT CATA ATAC TACG A sub. TGAA GCAT CAAA ATAC TAAG C sub. TGCA GCCT CACA ATCC TACG G sub. TGGA GCGT CAGA ATGC TAGG T sub. TGTA GCTT CATA ATTC TATG Wild TGCA GCAA CAAA AAAC AACG A sub. TGAA GCAA CAAA AAAC AAAG C sub. TGCA GCCA CACA AACC AACG G sub. TGGA GCGA CAGA AAGC AAGG T sub. TGTA GCTA CATA AATC AATG
In the first “chip” above, the top row of the probes (along with one probe below each of the four wild-type probes) should bind to the target DNA sequence. However, if the target sequence has a T base mutation as suspected, the labeled mutant sequence will not bind that strongly to the probes in the columns aroundcolumn 3. For example, the mutant receptor that could bind with the probes incolumn 2 is 5′-CGTT which may not bind that strongly to any of the probes incolumn 2 because there are T bases at the ends of the receptor and probes (i.e., not complementary). This often results in a relatively dark scanned area around a mutation. - Opt-tiling provides the second “chip” above which treats the suspected mutation as the wild-type base. Thus, the
mutant receptor 5′-CGTT should bind strongly to the wild-type probe of column 2 (along with one probe below) and the mutation can be further detected. - Again referring to
FIG. 2A , atstep 206 the masks for the synthesis are designed. Atstep 208 the software utilizes the mask design and layout information to make the DNA or other polymer chips. Thissoftware 208 will control, among other things, relative translation of a substrate and the mask, the flow of desired reagents through a flow cell, the synthesis temperature of the flow cell, and other parameters. Atstep 210, another piece of software is used in scanning a chip thus synthesized and exposed to a labeled receptor. The software controls the scanning of the chip, and stores the data thus obtained in a file that may later be utilized to extract sequence information. - At step 212 a computer system according to the present invention utilizes the layout information and the fluorescence information to evaluate the hybridized nucleic acid probes on the chip. Among the important pieces of information obtained from DNA chips are the identification of mutant receptors and determination of genetic sequence of a particular receptor.
-
FIG. 2C illustrates the binding of a particular target DNA to an array of DNA probes 114. As shown in this simple example, the following probes are formed in the array (only one probe is shown for the wild-type probe):3′-AGAACGT AGACCGT AGAGCGT AGATCGT . . .
As shown, the set of probes differ by only one base so the probes are designed to determine the identity of the base at that location in the nucleic acid sequence. - When a fluorescein-labeled (or other marked) target with the
sequence 5′-TCTTGCA is exposed to the array, it is complementary only to theprobe 3′-AGAACGT, and fluorescein will be primarily found on the surface of the chip where 3′-AGAACGT is located. Thus, for each set of probes that differ by only one base, the image file will contain four fluorescence intensities, one for each probe. Each fluorescence intensity can therefore be associated with the base of each probe that is different from the other probes. Additionally, the image file will contain a “blank” cell which can be used as the fluorescence intensity of the background. By analyzing the five fluorescence intensities associated with a specific base location, it becomes possible to extract sequence information from such arrays using the methods of the invention disclosed herein. - The present invention calls bases by assigning the bases the following codes:
Code Group Meaning A A Adenine C C Cytosine G G Guanine T T(U) Thymine (Uracil) M A or C aMino R A or G puRine W A or T(U) Weak interaction (2 H bonds) Y C or T(U) pYrimidine S C or G Strong interaction (3 H bonds) K G or T(U) Keto V A, C or G not T(U) H A, C or T(U) not G D A, G or T(U) not C B C, G or T(U) not A N A, C, G, or T(U) Insufficient intensity to call X A, C, G, or T(U) Insufficient discrimination to call
Most of the codes conform to the IUPAC standard. However, code N has been redefined and code X has been added. - The intensity ratio method is a method of calling bases in a sample nucleic acid sequence. The intensity ratio method is most accurate when there is good discrimination between the fluorescence intensities of hybrid matches and hybrid mismatches. If there is insufficient discrimination, the intensity ratio method assigns a corresponding ambiguity code to the unknown base.
- For simplicity, the intensity ratio method will be described as being used to identify one unknown base in a sample nucleic acid sequence. In practice, the method is used to identify many or all the bases in a nucleic acid sequence.
- The unknown base will be identified by evaluation of up to four mutation probes and a “blank” cell, which is a location where a labeled receptor should not bind to the chip since no probe is present. For example, suppose a DNA sequence of interest or target sequence contains the
sequence 5′-AGAACCTGC-3′ with a possible mutation at the underlined base position. Suppose that 5-mer probes are to be synthesized for the target sequence. A representative wild-type probe of 5′-TTGGA is complementary to the region of the sequence around the possible mutation. The “mutation” probes will be the same as the wild-type probe except for a different base at the third position as follows: 3′-TTAGA, 3′-TTCGA, 3′-TTGGA, and 3′-TTTGA. - If the fluorescently marked sample sequence is exposed to the above four mutation probes, the intensity should be highest for the probe that binds most strongly to the sample sequence. Therefore, if the
probe 3′-TTTGA shows the highest intensity, the unknown base in the sample will generally be called an A mutation because the probes are complementary to the sample sequence. - The mutation probes are identical to the wild-type probes except that they each contain one of the four A, C, G, or T “mutations” for the unknown base. Although one of the “mutation” probes will optimally be identical to the wild-type probe, such redundant probes are intentionally synthesized for quality control and design consistency.
- The identity of the unknown base is preferably determined by evaluating the relative fluorescence intensities of up to four of the mutation probes, and the “blank” cell. Because each mutation probe is identifiable by the mutation base, a mutation probe's intensity will be referred to the “base intensity” of the mutation base.
- As a simple example of the intensity ratio method, suppose a gene of interest (target) is an HIV protease gene with the
sequence 5′-ATGTGGACAGTTGTA-3′. Suppose further that a sample sequence is suspected to have the same sequence as the target sequence except for a mutation of base C to base T at the underlined base position. Although hundreds of probes may be synthesized on the chip, the complementary mutation probes synthesized to detect a mutation in the sample sequence at the suspected mutation position may be as follows:3′- TATC 3′- TCTC 3′-TGTC (wild-type) 3′-TTTC
Themutation probe 3′-TGTC is also the wild-type probe as it should bind most strongly with the target sequence. - After the sample sequence is labeled, hybridized on the chip, and scanned, suppose the following fluorescence intensities were obtained:
3′-TATC → 45 3′- TCTC → 8 3′-TGTC → 32 3′- TTTC → 12
where the intensity is measured by the photon count detected by the scanner. The “blank” cell had a fluorescence intensity of 2. The photon counts in the examples herein are representative (not actual data) and provided for illustration purposes. In practice, the actual photon counts will vary greatly depending on the experiment parameters and the scanner utilized. - Although each fluorescence intensity is from a probe, the probes may be characterized by their unique mutation base so the bases may be said to have the following intensities:
A −> 45 C −> 8 G −> 32 T −> 12
Thus, base A will be described as having an intensity of 45, which corresponds to the intensity of the mutation probe with the mutation base A. - Initially, each mutation base intensity is reduced by the background or “blank” cell intensity. This is done as follows:
A −> 45 − 2 = 43 C −> 8 − 2 = 6 G −> 32 − 2 = 30 T −> 12 − 2 = 10 - Then, the base intensities are sorted by intensity. The above bases would be sorted as follows:
A −> 43 G −> 30 T −> 10 C −> 6
Next, the highest intensity base is compared to the second highest intensity base. Thus, the ratio of the intensity of base A to the intensity of base G is calculated as follows: A:G=43/30=1.4. The ratio A:G is then compared to a predetermined ratio cutoff which is a number that specifies the ratio required to identify the unknown base. For example, if the ratio cutoff is 1.2, the ratio A:G is greater than the ratio cutoff (1.4>1.2) and the unknown base is called by the mutation probe containing the mutation A. As probes are complementary to the sample sequence, the sample sequence is called as having a mutation T, resulting in a called sample sequence of 5′-ATGTGGATAGTTGTA-3′. - As another example, suppose everything else is the same as in the previous example except that the sorted background adjusted intensities were as follows:
C −> 42 A −> 40 G −> 10 T −> 8
The ratio of the highest intensity base to the second highest intensity base (C:A) is 1.05. Because this ratio is not greater than the ratio cutoff of 1.2, the unknown base will be called as being ambiguously one of two or more bases as follows. - The second highest intensity base is then compared to the third highest base. The ratio of A:G is 4. The ratio of A:G is then compared to the ratio cutoff of 1.2. As the ratio A:G is greater than the ratio cutoff (4>1.2), the unknown base is called by the mutation probes containing the mutations C or A. As probes are complementary to the sample sequence, the sample sequence is called as having either a mutation G or T, resulting in a sample sequence of 5′-ATGTGGAKAGTTGTA-3′ where K is the IUPAC code for G or T(U).
- The ratio cutoff in the previous examples was equal to 1.2. However, the ratio cutoff will generally need to be adjusted to produce optimal results for the specific chip design and wild-type target. Also, although the ratio cutoff used has been the same for each ratio comparison, the ratio cutoff may vary depending on whether the ratio comparisons involve the highest, second highest, third highest, etc. intensity base.
-
FIG. 3 illustrates the high level flow of the intensity ratio method. Atstep 302 the four base intensities are adjusted by subtracting the background or “blank” cell intensity from each base intensity. Preferably, if a base intensity is then less than or equal to zero, the base intensity is set equal to a small positive number to prevent division by zero or negative numbers in future calculations. - At
step 304 the base intensities are sorted by intensity. Each base is then associated with a number from 1 to 4. The base with the highest intensity is 1, second highest 2, third highest 3, and fourth highest 4. Thus, the intensity ofbase 1≧base 2≧base 3≧base 4. - At
step 306 the highest intensity base (base 1) is checked to see if it has sufficient intensity to call the unknown base. The intensity is checked by determining if the intensity ofbase 1 is greater than a predetermined background difference cutoff. The background difference cutoff is a number that specifies the intensity a base intensity must be over the background intensity in order to correctly call the unknown base. Thus, the background adjusted base intensity must be greater than the background difference cutoff or the unknown is not callable. - If the intensity of
base 1 is not greater than the background difference cutoff, the unknown base is assigned the code N (insufficient intensity) as shown at step 308. Otherwise, the ratio of the intensity ofbase 1 tobase 2 is calculated as shown atstep 310. - At
step 312 the ratio of intensity of base 1:2 is compared to the ratio cutoff. If the ratio 1:2 is greater than the ratio cutoff, the unknown base is called as the complement of the highest intensity base (base 1) as shown atstep 314. Otherwise, the ratio of the intensity ofbase 2 tobase 3 is calculated as shown atstep 316. - At
step 318 the ratio of intensity of base 2:3 is compared to the ratio cutoff. If the ratio 2:3 is greater than the ratio cutoff, the unknown base is called as being an ambiguity code specifying the complements of the highest or second highest intensity bases (base 1 or 2) as shown atstep 320. Otherwise, the ratio of the intensity ofbase 3 tobase 4 is calculated as shown atstep 322. - At
step 324 the ratio of intensity of base 3:4 is compared to the ratio cutoff. If the ratio 3:4 is greater than the ratio cutoff, the unknown base is called as being an ambiguity code specifying the complements of the highest, second highest, or third highest bases (base step 326. Otherwise, the unknown base is assigned the code X (insufficient discrimination) as shown atstep 328. - The advantage of the intensity ratio method is that it is very accurate when there is good discrimination between the fluorescence intensities of hybrid matches and hybrid mismatches. However, if the base corresponding to a correct hybrid gives a lower intensity than a mismatch (e.g., as a result of cross-hybridization), incorrect identification of the base will result. For this reason, however, the method is useful for comparative assessment of hybridization quality and as an indicator of sequence-specific problem spots. For example, the intensity ratio method has been used to determine that ambiguities and miscalls tend to be very different from sequence to sequence, and reflect predominantly the composition and repetitiveness of the sequence. It has also been used to assess improvements obtained by varying hybridization conditions, sample preparation, and post-hybridization treatments (e.g., RNase treatment).
- The reference method is a method of calling bases in a sample nucleic acid sequence. The reference method depends very little on discrimination between the fluorescence intensities of hybrid matches and hybrid mismatches, and therefore is much less sensitive to cross-hybridization. The method compares the probe intensities of a reference sequence to the probe intensities of a sample sequence. Any significant changes are flagged as possible mutations. There are two implementations of the reference method disclosed herein.
- For simplicity, the reference method will be described as being used to identify one unknown base in a sample nucleic acid sequence. In practice, the method is used to identify many or all the bases in a nucleic acid sequence.
- The unknown base will be called by comparing the probe intensities of a reference sequence to the probe intensities of a sample sequence. Preferably, the probe intensities of the reference sequence and the sample sequence are from chips having the same chip wild-type. However, the reference sequence may or may not be exactly the same as the chip wild-type as it may have mutations.
- The bases at the same position in the reference and sample sequences will each be associated with up to four mutation probes and a “blank” cell. The unknown base in the sample sequence is called by comparing probe intensities of the sample sequence to probe intensities of the reference sequence. For example, suppose the chip wild-type contains the
sequence 5′-AGACCTTGC-3′ and it is suspected that the sample has a possible mutation at the underlined base position, which is the unknown base that will be called by the reference method. The “mutation” probes for the sample sequence may be as follows: 3′-GAAA, 3′-GCAA, 3′-GGAA, and 3′-GTAA, where 3′-GGAA is the wild-type probe. - Suppose further that a reference sequence, which differs from the chip wild-type by one base mutation, has the
sequence 5′-AGACATTGC-3′ where the mutation base is underlined. The “mutation” probes for the reference sequence may be as follows: 3′-TGAAA, 3′-TGCAA, 3′-TGGAA, and 3′-TGTAA, where 3′-TGTAA is the reference wild-type probe since the reference sequence is known. Although generally the sample and reference sequences were tiled with the same chip wild-type, this is not required, and the tiling methods do not have to be identical as shown in the example. Thus, the unknown base will be called by comparing the “mutation” probes of the sample sequence to the “mutation” probes of the reference sequence. As before, because each mutation probe is identifiable by the mutation base, the mutation probes' intensities will be referred to the “base intensities” of their respective mutation bases. - As a simple example of one implementation of the reference method, suppose a gene of interest (target) has the
sequence 5′-AAAACTGAAAA-3′. Suppose a reference sequence has thesequence 5′-AAAACCGAAAA-3′, which differs from the target sequence by the underlined base. The reference sequence is marked and exposed to probes on a chip with the target sequence being the chip wild-type. Suppose further that a sample sequence is suspected to have the same sequence as the target sequence except for a mutation at the underlined base position in 5′-AAAACTGAAAA-3′. The sample sequence is also marked and exposed to probes on a chip with the target sequence being the chip wild-type. After hybridization and scanning, the following probe intensities (not actual data) were found for the respective complementary probes:Reference Sample 3′- TGAC → 12 3′- GACT → 11 3′- TGCC → 9 3′- GCCT → 30 3′-TGGC → 80 3′-GGCT → 60 3′- TGTC → 15 3′- GTCT → 6 - Although each fluorescence intensity is from a probe, the probes may be identified by their unique mutation base so the bases may be said to have the following intensities:
Reference Sample A −> 12 A −> 11 C −> 9 C −> 30 G −> 80 G −> 60 T −> 15 T −> 6
Thus, base A of the reference sequence will be described as having an intensity of 12, which corresponds to the intensity of the mutation probe with the mutation base A. The reference method will now be described as calling the unknown base in the sample sequence by using these intensities. -
FIG. 4A illustrates the high level flow of one implementation of the reference method. For illustration purposes, the reference method is described as filling in the columns (identified by the numbers along the bottom) of the analysis table shown inFIG. 4B . However, the generation of an analysis table is not necessary to practice the method. The analysis table is shown to aid the reader in understanding the method. - At
step 402 the four base intensities of the reference and sample sequences are adjusted by subtracting the background or “blank” cell intensity from each base intensity. Each set of “mutation” probes has an associated “blank” cell. Suppose that the reference “blank” cell intensity is 1 and the sample “blank” cell intensity is 2. The base intensities are then background subtracted as follows:Reference Sample A −> 12 − 1 = 11 A −> 11 − 2 = 9 C −> 9 − 1 = 8 C −> 30 − 2 = 28 G −> 80 − 1 = 79 G −> 60 − 2 = 58 T −> 15 − 1 = 14 T −> 6 − 2 = 4
Preferably, if a base intensity is then less than or equal to zero, the base intensity is set equal to a small positive number to prevent division by zero or negative numbers in future calculations. - For identification, the position of the bases of interest in the reference and sample sequences is placed in
column 1 of the analysis table. Also, since the reference sequence is a known sequence, the base at this position is known and is referred to as the reference wild-type. The reference wild-type is placed incolumn 2 of the analysis table, which is C for this example. - At
step 404 the base intensity associated with the reference wild-type (column 2 of the analysis table) is checked to see if it has sufficient intensity to call the unknown base. In this example, the reference wild-type is C. However, the base intensity associated with the wild-type is the G base intensity, which is 79 in this example. This is because the base intensities actually represent the complementary “mutation” probes. The G base intensity is checked by determining if its intensity is greater than a predetermined background difference cutoff. The background difference cutoff is a number that specifies the intensity the base intensities must be above the background intensity in order to correctly call the unknown base. Thus, the base intensity associated with the reference wild-type must be greater than the background difference cutoff or the unknown base is not callable. - If the background difference cutoff is 5, the base intensity associated with the reference wild-type has sufficient intensity (79>5) so a P (pass) is placed in
column 3 of the analysis table as shown atstep 406. Otherwise, atstep 407 an F (fail) is placed incolumn 3 of the analysis table. - At
step 408 the ratio of the base intensity associated with the reference wild-type to each of the possible bases are calculated. The ratio of the base intensity associated with the reference wild-type to itself will be 1 and the other ratios will usually be greater than 1. The base intensity associated with the reference wild-type is G so the following ratios are calculated:G:A −> 79/11 = 7.2 G:C −> 79/8 = 9.9 G:G −> 79/79 = 1.0 G:T −> 79/14 = 5.6
These ratios are placed incolumns 4 through 7 of the analysis table, respectively. - At
step 410 the highest base intensity associated with the sample sequence is checked to see if it has sufficient intensity to call the unknown base. The highest base intensity is checked by determining if the intensity is greater than the background difference cutoff. Thus, the highest base intensity must be greater than the background difference cutoff or the unknown base is not callable. - Again, if the background difference cutoff is 5, the highest base intensity, which is G in this example, has sufficient intensity (58>5) so a P (pass) is placed in
column 8 of the analysis table as shown atstep 412. Otherwise, atstep 413 an F (fail) is placed incolumn 8 of the analysis table. - At
step 414 the ratios of the highest base intensity of the sample to each of the possible bases are calculated. The ratio of the highest base intensity to itself will be 1 and the other ratios will usually be greater than 1. Thus, the highest base intensity is G so the following ratios are calculated:G:A −> 58/9 = 6.4 G:C −> 58/28 = 2.3 G:G −> 58/58 = 1.0 G:T −> 58/4 = 14.5
These ratios are placed incolumns 9 through 12 of the analysis table, respectively. - At
step 416 if both the reference and sample sequence probes failed to have sufficient intensity to call the unknown base, meaning there is an ‘F’ incolumns step 418. An ‘N’ is placed incolumn 17 of the analysis table. Additionally, a confidence code of 9 is placed incolumn 18 of the analysis table where the confidence codes have the following meanings:Code Meaning 0 Probable reference wild- type 1 Probable mutation 2 Reference sufficient intensity, insufficient intensity in sample suggests possible mutation 3 Borderline differences, unknown base ambiguous 4 Sample sufficient intensity, insufficient intensity in reference to allow comparison 5-8 Currently unassigned 9 Insufficient intensity in reference and sample, no interpretation possible
The confidence codes are useful for indicating to the user the resulting analysis of the reference method. - At
step 420 if only the reference sequence probes failed to have sufficient intensity to call the unknown base, meaning there is an ‘F’ incolumn 3 and a ‘P’ incolumn 8 of the analysis table, the unknown base is assigned the code N (insufficient intensity) as shown atstep 422. An ‘N’ is placed incolumn 17 and a confidence code of 4 is placed incolumn 18 of the analysis table. - At
step 424 if only the sample sequence probes failed to have sufficient intensity to call the unknown base, meaning there is a ‘P’ incolumn 3 and a ‘F’ incolumn 8 of the analysis table, the unknown base is assigned the code N (insufficient intensity) as shown atstep 426. An ‘N’ is placed incolumn 17 and a confidence code of 2 is placed incolumn 18 of the analysis table. - In this example, both the reference and sample sequence probes have sufficient intensity to call the unknown base. At
step 428 the ratios of the reference ratios to the sample ratios for each base type are calculated. Thus, the ratio A:A (column 4 to column 9) is placed incolumn 13 of the analysis table. The ratio C:C (column 5 to column 10) is placed incolumn 14 of the analysis table. The ratio G:G (column 6 to column 11) is placed incolumn 15 of the analysis table. Lastly, the ratio T:T (column 7 to column 12) is placed incolumn 16 of the analysis table. These ratios are calculated as follows:A:A −> 7.2/6.4 = 1.1 C:C −> 9.9/2.3 = 4.3 G:G −> 1.0/1.0 = 1.0 T:T −> 5.6/14.5 = 0.4
The unknown base is called by comparing these ratios of ratios to two predetermined values as follows. - At
step 430 if all the ratios of ratios (columns 13 to 16 of the analysis table) are less than a predetermined lower ratio cutoff, the unknown base is assigned the code of the reference wild-type as shown atstep 432. Thus, the code for the reference wild-type (as shown in column 2) would be placed incolumn 17 and a confidence code of 0 is placed incolumn 18 of the analysis table. - At
step 434 if all the ratios of ratios are less than a predetermined upper ratio cutoff, the unknown base is assigned an ambiguity code that indicates the unknown base may be any one of the bases that has a complementary ratio of ratios greater than the lower ratio cutoff and less than the upper ratio cutoff as shown atstep 436. Thus, if the ratio of ratios for A:A, C:C and G:G are all greater than the lower ratio cutoff and less than the upper ratio cutoff, the unknown base would be assigned the code B (meaning “not A”). This is because the ratios of ratios are complementary to their respective base as follows:A:A −> T C:C −> G G:G −> C
so the unknown base is called as being either C, G, or T, which is identified by the IUPAC code B. This ambiguity code is placed incolumn 17 and a confidence code of 3 would be placed incolumn 18 of the analysis table. - At
step 438 at least one of the ratios of ratios is greater than the upper ratio cutoff and the unknown base is called as the base complementary to the highest ratio of ratios. The code for the base complementary to the highest ratio of ratios would be placed incolumn 17 and a confidence code of 1 is placed incolumn 18 of the analysis table. - Assume for the purposes of this example that the lower ratio cutoff is 1.5 and the upper ratio cutoff is 3. Again, the ratios of ratios are as follows:
A:A −> 1.1 C:C −> 4.3 G:G −> 1.0 T:T −> 0.4
As all the ratios of ratios are not less than the upper ratio cutoff, the unknown base is called the base complementary to the highest ratio of ratios. The highest ratio of ratios is C:C, which has a complementary base G. Thus, the unknown base is called G which is placed incolumn 17 and a confidence code of 1 is placed incolumn 18 of the analysis table. - The example shows how the unknown base in the sample nucleic acid sequence was correctly called as base G. Although the complementary “mutation” probe associated with the base G (3′-GCCT) did not have the highest fluorescence intensity, the unknown base was called as base G because the associated “mutation” probe had the highest ratio increase over the other “mutation” probes.
-
FIG. 5A illustrates the high level flow of another implementation of the reference method. As in the previous implementation, this implementation also compares the probe intensities of a reference sequence to the probe intensities of a sample sequence. However, this implementation differs conceptually from the previous implementation in that neighboring probe intensities are also analyzed, resulting in more accurate base calling. - As a simple example of this implementation of the reference method, suppose a reference sequence has a sequence of 5′-AAACCCAATCCACATCA-3′ and a sample sequence has a sequence of 5′-AAACCCAGTCCACATCA-3′, where the mutant base is underlined. Thus, there is a mutation of A to G. Suppose further that the reference and sample sequences are tiled on chips with the reference sequence being the chip wild-type. This implementation of the reference method will be described as identifying this mutation base.
- For illustration purposes, this implementation of the reference method is described as filling in a data table shown in
FIG. 5B . Although the data table contains more data than is required for this implementation, the portions of the data table that are produced by steps inFIG. 5A are shown with the same reference numerals. The generation of a data table is not necessary, however, and is shown to aid the reader in understanding the method. The mutant base position is atposition 241 in the reference and sample sequences, which is shown in bold in the data table. - At
step 502 the base intensities of the reference and sample sequences are adjusted by subtracting the background or “blank” cell intensity from each base intensity. Preferably, if a base intensity is then less than or equal to zero, the base intensity is set equal to a small positive umber to prevent division by zero or negative numbers. In the data table,data 502A is the background subtracted base intensities for the reference sequence anddata 502B is the background subtracted base intensities for the sample sequence (also called the “mutant” sequence in the data table). - At
step 504 the base intensity associated with the reference wild-type is checked to see if it has sufficient intensity to call the unknown base. In this example, the reference wild-type is base A atposition 241. The base intensity associated with the reference wild-type is identified by a lower case “a” in the left hand column. Thus, the base intensities in the data table are not identified by their complements and the reference wild-type at the mutation position has an intensity of 385. The reference wild-type intensity of 385 is checked by determining if its intensity is greater than a predetermined background difference cutoff. The background difference cutoff is a number that specifies the intensity the base intensities must be over the background intensity in order to correctly call the unknown base. Thus, the base intensity associated with the reference wild-type must be greater than the background difference cutoff or the unknown base is not callable. - If the base intensity associated with the reference wild-type is not greater than the background difference cutoff, the wild-type sequence would fail to have sufficient intensity as shown at
step 506. Otherwise, atstep 508 the wild-type sequence would pass by having sufficient intensity. - At
step 510 calculations are performed on the background subtracted base intensities of the reference sequence in order to “normalize” the intensities. Each position in the reference sequence has four background subtracted base intensities associated with it. The ratio of the intensity of each base to the sum of the intensities of the possible bases (all four) is calculated, resulting in four ratios, one for each base as shown in the data table. Thus, the following ratios would be calculated at each position in the reference sequence:
A ratio=A/(A+C+G+T)
C ratio=C/(A+C+G+T)
G ratio=G/(A+C+G+T)
T ratio=T/(A+C+G+T)
Atposition 241, A ratio would be the wild-type ratio. These ratios are generally calculated in order to “normalize” the intensity data as the photon counts may vary widely from experiment to experiment. Thus, the ratios provide a way of reconciling the intensity variations between experiments. Preferably, if the photon counts do not vary widely from experiment to experiment, the probe intensities do not need to be “normalized.” - At
step 512 the highest base intensity associated with the sample sequence is checked to see if it has sufficient intensity to call the unknown base. The intensity is checked by determining if the highest intensity sample base is greater than the background difference cutoff. If the intensity is not greater than the background difference cutoff, the sample sequence fails to have sufficient intensity as shown atstep 514. Otherwise, atstep 516 the sample sequence passes by having sufficient intensity. - At
step 518 calculations are performed on the background subtracted base intensities of the sample sequence in order to “normalize” the intensities. Each position in the sample sequence has four background subtracted base intensities associated with it. The ratio of the intensity of each base to the sum of the intensities of the possible bases (all four) is calculated, resulting in four ratios, one for each base as shown in the data table. - At
step 520 if either the reference or sample sequences failed to have sufficient intensity, the unknown base is assigned the code N (insufficient intensity) as shown atstep 522. - At
step 524 the normalized base intensities of the reference sequence are subtracted from the normalized base intensities of the sample sequence. Thus, at each position the following calculations are performed:
A Difference=Sample A Ratio−Reference A Ratio
C Difference=Sample C Ratio—Reference C Ratio
G Difference=Sample G Ratio—Reference G Ratio
T Difference=Sample T Ratio—Reference T Ratio
where the reference and sample ratios are calculated atsteps - At
step 526 each position is checked to see if there is a base difference greater than an upper difference cutoff and a base difference lower than a lower difference cutoff. For example,FIG. 5C shows a graph the normalized sample base intensities minus the normalized reference base intensities. Suppose that the upper difference cutoff is 0.15 and the lower difference cutoff is −0.15 as shown by the horizontal lines inFIG. 5C . At the mutation position (labeled with a reference 0), the G difference is 0.28 which is greater than 0.15, the upper difference cutoff. Similarly, the A difference is −0.32 which is less than −0.15, the lower difference cutoff. As there is a base difference above the upper difference cutoff and a base difference below the lower difference cutoff, there may be mutation at this position. - If there is not a base difference above the upper difference cutoff and a base difference below the lower difference cutoff, the base at that position is assigned the code of the reference wild-type base as shown at
step 528. - At
step 530 the ratio of the highest background subtracted base intensity in the sample to the background subtracted reference wild-type base intensity is calculated. For example, at themutation position 241 in the data table, the highest background subtracted base intensity in the sample is 571 (base G). The background subtracted reference wild-type base intensity is 385 (base A). Thus, the ratio of 571:385 is calculated and results in 1.48 as shown in the data table. - At
step 532 these ratios are compared to a ratio at a neighboring position. The ratio for the nth position is subtracted from the ratio for the rth position, where r=n+1. For example, at themutation position 241 in the data table, the ratio at position 242 (which equals 1.02) is subtracted from the ratio at position 241 (which equals 1.48). It has been found that a mutant can be confidently detected by analyzing the difference of these neighboring ratios. -
FIG. 5D shows other graphs of data in the data table. Of particular importance is the graph identified as 532 because this is a graph of the calculations atstep 532. The pattern shown in a box ingraph 532 has been found to be characteristic of a mutation. Thus, if this pattern is detected, the base is called as the base (or bases) with a normalized difference greater than the upper difference cutoff as shown atstep 536. For example, the pattern was detected and atstep 526 it was shown that base G had a normalized difference of 0.28, which is greater than the upper difference cutoff of 0.15. Therefore, the base atposition 241 in the sample sequence is called a base G, which is a mutation from the reference sequence (A to G). - If the pattern is not detected at
step 534, the base at that position is assigned the code of the reference wild-type base as shown atstep 538. - This second implementation of the reference method is preferable in some instances as it takes inot account probe intensities of neighboring probes. Thus, the first implementation may not have detected the A to G mutation in this example.
- The advantage of the reference method is that the correct base can be called even in the presence of significant levels of cross-hybridization, as long as ratios of intensities are fairly consistent from experiment to experiment. In practice, the number of miscalls and ambiguities is significantly reduced, while the number of correct calls is actually increased, making the reference method very useful for identifying candidate mutations. The reference method has also been used to compare the reproducibility of experiments in terms of base calling.
- The statistical method is a method of calling bases in a sample nucleic acid sequence. The statistical method utilizes the statistical variation across experiments to call the bases. Therefore, the statistical method is good at calling bases if data from multiple experiments is available and the data is fairly consistent among the experiments. The method compares the probe intensities of a sample sequence to statistics of probe intensities of a reference sequence in multiple experiments.
- For simplicity, the statistical method will be described as being used to identify one unknown base in a sample nucleic acid sequence. In practice, the method is used to identify many or all the bases in a nucleic acid sequence.
- The unknown base will be called by comparing the probe intensities of a sample sequence to statistics on probe intensities of a reference sequence in multiple experiments. Generally, the probe intensities of the sample sequence and the reference sequence experiments are from chips having the same chip wild-type. However, the reference sequence may or may not be equal to the chip wild-type, as it may have mutations.
- A base at the same position in the reference and sample sequences will be associated with up to four mutation probes and a “blank” cell. As before, because each mutation probe is identifiable by the mutation base, the mutation probes' intensities will be referred to as the “base intensities” of their respective mutation bases.
- As a simple example of the statistical method, suppose a gene of interest (target) has the
sequence 5′-AAAACTGAAAA-3′. Suppose a reference sequence has thesequence 5′-AAAACCGAAAA-3′, which differs from the target sequence by the underlined base. Suppose further that a sample sequence is suspected to have the same sequence as the target sequence except for a T base mutation at the underlined base position in 5′-AAAACTGAAAA-3′. Suppose that in multiple experiments the reference sequence is marked and exposed to probes on a chip. Suppose further the sample sequence is also marked and exposed to probes on a chip. - The following are complementary “mutation” probes that could be used for a reference experiment and the sample sequence:
Reference Sample 3′- TGAC 3′- GACT 3′- TGCC 3′- GCCT 3′- TGGC 3′- GGCT 3′- TGTC 3′-GTCT - The “mutation” probes shown for the reference sequence may be from only one experiment, the other experiments may have different “mutation” probes, chip wild-types, tiling methods, and the like. Although each fluorescence intensity is from a probe, since the probes may be identified by their unique mutation bases, the probe intensities may be identified by their respective bases as follows:
Reference Sample 3′- TGAC → A 3′- GACT → A 3′- TGCC → C 3′- GCCT → C 3′- TGGC → G 3′- GGCT → G 3′- TGTC → T 3′-GTCT → T
Thus, base A of the reference sequence will be described as having an intensity which corresponds to the intensity of the mutation probe with the mutation base A. The statistical method will now be described as calling the unknown base in the sample sequence by using this example. -
FIG. 6 illustrates the high level flow of the statistical method. Atstep 602 the four base intensities associated with the sample sequence and each of the multiple reference experiments are adjusted by subtracting the background or “blank” cell intensity from each base intensity. Preferably, if a base intensity is then less than or equal to zero, the base intensity is set equal to a small positive number to prevent division by zero or negative numbers. - At
step 604 the intensities of the reference wild-type bases in the multiple experiments are checked to see if they all have sufficient intensity to call the unknown base. The intensities are checked by determining if the intensity of the reference wild-type base of an experiment is greater than a predetermined background difference cutoff. The wild-type probe shown earlier for the reference sequence is 3′-TGGC, and thus the G base intensity is the wild-type base intensity. These steps are analogous to steps in the other two methods described herein. - If the intensity of any one of the reference wild-type bases is not greater than the background difference cutoff, the wild-type experiments fail to have sufficient intensity as shown at
step 606. Otherwise, atstep 608 the wild-type experiments pass by having sufficient intensity. - At
step 610 calculations are performed on the background subtracted base intensities of each of the reference experiments in order to “normalize” the intensities. Each reference experiment has four background subtracted base intensities associated with it: one wild-type and three for the other possible bases. In this example, the G base intensity is the wild-type, the A, C, and T base intensities being the “other” intensities. The ratios of the intensity of each base to the sum of the intensities of the possible bases (all four) are calculated, giving one wild-type ratio and three “other” ratios. Thus, the following ratios would be calculated:
A ratio=A/(A+C+G+T)
C ratio=C/(A+C+G+T)
G ratio=G/(A+C+G+T)
T ratio=T/(A+C+G+T)
where G ratio is the wild-type ratio and A, C, and T ratios are the “other” ratios. These four ratios are calculated for each reference experiment. Thus if the number of reference experiments is n, there would be 4n ratios calculated. These ratios are generally calculated in order to “normalize” the intensity data as the photon counts may vary widely from experiment to experiment. However, if the probe intensities do not vary widely from experiment to experiment, the probe intensities do not need to be “normalized.” - At
step 612 statistics are prepared for the ratios calculated for each of the reference experiments. As stated before, each reference experiment will be associated with one wild-type ratio and three “other” ratios. The mean and standard deviation are calculated for all the wild-type ratios. The mean and standard deviation are also calculated for each of the other ratios, resulting in three other means and standard deviations for each of the bases that is not the wild-type base. Therefore, the following would be calculated:
Mean and standard deviation of A ratios
Mean and standard deviation of C ratios
Mean and standard deviation of G ratios
Mean and standard deviation of T ratios
where the mean and standard deviation of the G ratios are also known as the wild-type mean and the wild-type standard deviation, respectively. The mean and standard deviation of the A, C, and T means and standard deviations are also known collectively as the “other” means and standard deviations. - Suppose that the preceding calculations produced the following data:
A ratios −> mean = 0.16 std. dev. = 0.003 C ratios −> mean = 0.03 std. dev. = 0.002 G ratios −> mean = 0.71 std. dev. = 0.050 T ratios −> mean = 0.11 std. dev. = 0.004
In one embodiment, the steps up to and includingstep 612 are performed in a preprocessing stage for the multiple wild-type experiments. The results of the preprocessing stage are stored in a file so that the reference calculations do not have to be repeatedly calculated, which results in increased performance. Microfiche Appendices C and D contain the programming code to perform the preprocessing stage. - At
step 614 the highest base intensity associated with the sample sequence is checked to see if it has sufficient intensity to call the unknown base. The intensity is checked by determining if the highest intensity unknown base is greater than the background difference cutoff. If the intensity is not greater than the background difference cutoff, the sample sequence fails to have sufficient intensity as shown atstep 616. Otherwise, atstep 618 the sample sequence passes by having sufficient intensity. - At
step 620 calculations are performed on the four background subtracted intensities of the sample sequence. The ratio of the background subtracted intensity of each base to the sum of the background subtracted intensities of the possible bases (all four) is calculated, giving four ratios, one for each base. For consistency, the ratio associated with the reference wild-type base is called the wild-type ratio, with there being three “other” ratios. Thus, the following ratios would be calculated:
A ratio=A/(A+C+G+T)
C ratio=C/(A+C+G+T)
G ratio=G/(A+C+G+T)
T ratio=T/(A+C+G+T)
where ratio G is the wild-type ratio and ratios A, C, and T are the “other” ratios. - Suppose the background subtracted intensities associated with the sample are as follows:
A −> 310 C −> 50 G −> 26 T −> 100
Then, the corresponding ratios would be as follows:
A ratio=310/(310+50+26+100)=0.64
C ratio=50/(310+50+26+100)=0.10
G ratio=26/(310+50+26+100)=0.05
T ratio=100/(310+50+26+100)=0.21 - At
step 622 if either the reference experiments or the sample sequence failed to have sufficient intensity, the unknown base is assigned the code N (insufficient intensity) as shown atstep 624. - At
step 626 the wild-type and “other” ratios associated with the sample sequence are compared to statistical expressions. The statistical expressions include four predetermined standard deviation cutoffs, one associated with each base. Thus, there is a standard deviation cutoff for each of the bases A, C, G, and T. The standard deviation cutoffs allow the unknown base to be called with higher precision because each standard deviation cutoff can be set to a different value. Suppose the standard deviation cutoffs are set as follows:A standard deviation cutoff −> 4 C standard deviation cutoff −> 2 G standard deviation cutoff −> 8 T standard deviation cutoff −> 4
The wild-type base ratio associated with the sample is compared to a corresponding statistical expression:
WT ratio≧WT mean−(WT std. dev.*WT base std. dev. cutoff)
where the WT base std. dev. cutoff is the standard deviation cutoff for the wild-type base. As the wild-type base is G, the above comparison solves to the following:
0.05≧0.71−(0.050*8)
0.05≧0.31
which is not a true expression (0.05 is not greater than 0.31). - Each of the “other” ratios associated with the sample is compared to a corresponding statistical expression:
Other ratio>Other mean+(Other std. dev.*Other base std. dev. cutoff) - where the Other base std. dev. cutoff is the standard deviation cutoff for the particular “other” base. Thus, the above comparison solves to the following three expressions:
A −> 0.64 > 0.16 + (0.003 * 4) 0.64 > 0.17 C −> 0.10 > 0.03 + (0.002 * 2) 0.10 > 0.03 T −> 0.21 > 0.11 + (0.004 * 4) 0.21 > 0.13
which are all true expressions. - At
step 628 if only the wild-type ratio of the sample sequence was greater than the statistical expression, the unknown base is assigned the code of the reference wild-type base as shown atstep 630. - At
step 632 if one or more of the “other” ratios of the sample sequence were greater than their respective statistical expressions, the unknown base is assigned an ambiguity code that indicates the unknown base may be any one of the complements of these bases, including the reference wild-type. In this example, the “other” ratios for A, C, and T were all greater than their corresponding statistical expression. Thus, the unknown base would be called the complements of these bases, represented by the subset T, G, and A. Thus, the unknown base would be assigned the code D (meaning “not C”). - If none of the ratios are greater than their respective statistical expressions, the unknown base is assigned the code X (insufficient discrimination) as shown at
step 636. - The statistical method provides accurate base calling because it utilizes statistical data from multiple reference experiments to call the unknown base. The statistical method has also been used to implement confidence estimates and calling of mixed sequences.
- The present invention provides pooling processing which is a method of processing reference and sample nucleic acid sequences together to reduce variations across individual experiments. In the representative embodiment discussed herein, the reference and sample nucleic acid sequences are labeled with fluorescent markers emitting light at different wavelengths. However, the nucleic acids may be labeled with other types of markers including distinguishable radioactive markers.
- After the reference and sample nucleic acid sequences are labeled with different color fluorescent markers, the labeled reference and sample nucleic acid sequences are then combined and processed together. An apparatus for detecting targets labeled with different markers is provided in U.S. Application Ser. No. 08/195,889 and is hereby incorporated by reference for all purposes.
-
FIG. 7 illustrates the pooling processing of a reference and sample nucleic acid sequence. At step 702 a reference nucleic acid sequence is marked with a fluorescent dye, such as a fluorescein. At step 704 a sample nucleic acid sequence is marked with a dye that, upon excitation, emits light that of a different wavelength than the fluorescent dye of the reference sequence. For example, the sample nucleic acid sequence may be marked with rhodamine. - At
step 706 the labeled reference sequence and the labeled sample sequence are combined. After this step, processing continues in the same manner as for only one labeled sequence. Atstep 708 the sequences are fragmented. The fragmented nucleic acid sequences are then hybridized on a chip containing probes as shown atstep 710. - At step 712 a scanner generates image files that indicate the locations where the labeled nucleic acids bound to the chip. In general, the scanner generates an image file by focusing excitation light on the hybridized chip and detecting the fluorescent light that is emitted. The marker emitting the fluorescent light can be identified by the wavelength of the light. For example, the fluorescence peak of fluorescein is about 530 nm while that of a typical rhodamine dye is about 580 nm.
- The scanner creates an image file for,the data associated with each fluorescent marker, indicating the locations where the correspondingly labeled nucleic acid bound to the chip. Based upon an analysis of the fluorescence intensities and locations, it becomes possible to extract information such as the monomer sequence of DNA or RNA.
- Pooling processing reduces variations across individual experiments because much of the test environment is common. Although pooling processing has been described as being used to improve the combined processing of reference and sample nucleic acid sequences, the process may also be used for two reference sequences, two sample sequences, or multiple sequences by utilizing multiple distinguishable markers.
- The present invention provides a method of comparative analysis and visualization of multiple experiments. The method allows the intensity ratio, reference, and statistical methods to be run on multiple datafiles simultaneously. This permits different experimental conditions, sample preparations, and analysis parameters to be compared in terms of their effects on sequence calling. The method also provides verification and editing functions, which are essential to reading sequences, as well as navigation and analysis tools.
-
FIG. 8 illustrates the main screen and the associated pull down menus for comparative analysis and visualization of multiple experiments. The windows shown are from an appropriately programmed Sun Workstation. However, the comparative analysis software may also be implemented on or ported to a personal computer, including IBM PCs and compatibles, or other workstation environments. Awindow 802 is shown having pull down menus for the following functions:File 804,Edit 806,View 808,Highlight 810, andHelp 812. - The main section of the window is divided into a
reference sequence area 814 and asample sequence area 816. The reference sequence-area is where known sequences are displayed and is divided into a reference name subarea 818 andreference base subarea 820. The reference name subarea is shown with filenames that contain the reference sequences. The chip wild-type is identified by the filename with the extension “.wt#” where the # indicates a unit on the chip. The reference base subarea contains the bases of the reference sequences. Acapital C 822 is displayed to the right of the reference sequence that is the chip wild-type for the current analysis. Although the chip wild-type sequence has associated fluorescence intensities, the other reference sequences shown below the chip wild-type may be known sequences that have not been tiled on the chip. These may or may not have associated fluorescence intensities. The reference sequences other than the chip wild-type are used for sequence comparisons and may be in the form of simple ASCII text files. -
Sample sequence area 816 is where sample or unknown experimental sequences are displayed for comparison with the reference sequences. The sample sequence area is divided into a sample name subarea 824 andsample base subarea 826. The sample name subarea is shown with filenames that contain the sample sequences. The filename extensions indicate the method used to call the sample sequence where “.cq#” denotes the intensity ratio method, “.rq#” denotes the reference method, and “.sq#” denotes the statistical method (# indicates the unit on the chip). The sample base subarea contains the bases of the sample sequences. The bases of the sample sequences are identified by the codes previously set forth which, for the most part, conform to the IUPAC standard. -
Window 802 also contains amessage panel 828. When the user selects a base with an input device in the reference or sample base subarea, the base becomes highlighted and the pathname of the file containing the base is displayed in the message panel. The base's position in the nucleic acid sequence is also displayed in the message panel. - In pull down
menu File 804, the user is able to load files of experimental sequences that have been tiled and scanned on a chip. There is a chip wild-type associated with each experimental sequence. The chip wild-type associated with the first experimental sequence loaded is read and shown as the chip wild-type inreference sequence area 814. The user is also able to load files of known nucleic acid sequences as reference sequences for comparison purposes. As before, these known reference sequences may or may not have associated probe intensity data. Additionally, in this menu the user is able to save sequences that are selected on the screen into a project file that can be loaded in at a later time. The project file also contains any linkage of the sequences, where sequences are linked for comparison purposes. Individual sequences, both reference and sample, are selected by selecting the sequence filename with an input device in the reference or sample name subareas. - In pull down
menu Edit 806, the user is able to link together sequences in the reference and sample sequence areas. After the user has selected one reference and one or more sample sequences, the sample sequences can be linked to the reference sequence by selecting an entry in the pull down menu. Once the sequences are linked, alink number 830 is displayed next to each of the linked sequences. Each group of linked sequences is associated with a unique link number, so the user can easily identify which sequences are linked together. Linking sequences permits the user to more easily compare the linked sequences. The user is also able to remove and display links in this menu. - In pull down
menu View 808, the user is able to display intensity graphs for selected bases. Once a base is selected in the reference or-sample base subareas, the user may request an intensity graph showing the hybridized probe intensities of the selected base and a delineated neighborhood of bases near the selected base. Intensity graphs may be displayed for one or multiple selected bases. The user is also able to prepare comment files and reports in this menu. -
FIG. 9 illustrates an intensity graph window for a selected base atposition 120. The filename containing the sequence data is displayed at 904. The graph shows the intensities for each of the hybridized probes associated with a base. Each grouping of four vertical bars on the graph, which are labeled as “a”, “c”, “g”, and “t” online 906, shows the background subtracted intensities of probes having the indicated substitution base. In one embodiment, the called bases are shown in red. The wild-type base is shown at line 908, the called base is shown at line 910, and the base position is shown at line 912. InFIG. 9 , the base selected is atposition 120 as shown by arrow 914. The wild-type base at this position is T; however, the called base is M which means the base is either A or C (amino). The user is able to use intensity graphs to visually compare the intensities of each of the possible calls. -
FIG. 10 illustrates multiple intensity graph windows for selected bases. There are threeintensity graph windows rectangular box 1008 shows the selected bases' position in each of the sequences (position 162 inFIG. 10 ). The rectangular box aids the user in identifying the selected bases. - Referring again to
FIG. 8 , in pull downmenu Highlight 810, the user is able to compare the sequences of references and samples. At least four comparisons are available to the user, including the following: sample sequences to the chip wild-type sequence, sample sequences to any reference sequences, sample sequences to any linked reference sequences, and reference sequences to the chip wild-type sequence. For example, after the user has linked a reference and sample sequence, the user can compare the bases in the linked sequences. Bases in the sample sequence that are different from the reference sequence will then be indicated on the display device to the user (e.g., base is shown in a different color). In another example, the user is able to perform a comparison that will help identify sample sequences. After a sample is linked to multiple reference sequences, each base in-the sample sequence that does not match the wild-type sequence is checked to see if it matches one of the linked reference sequences. The bases that match a linked reference sequence will then be indicated on the display device to the user. The user may then more easily identify the sample sequence as being one of the reference sequences. - In pull down
menu Help 812, the user is able to get information and instructions regarding the comparative analysis program, the calling methods, and the IUPAC definitions used in the program. -
FIG. 11 illustrates the intensity ratio method correctly calling a mutation in solutions with varying concentrations. A window 1102 is shown with a chip wild-type 1104 and amutant sequence 1106. The mutant sequence differs from the chip wild-type at the position indicated by therectangular box 1108. The chip wild-type and mutant sequences are a region of HIV Pol Gene spanning mutations occurring in AZT drug therapy. - There are seven sample sequences that are called using the intensity ratio method. The sample sequences are actually solutions of different proportions of the chip wild-type sequence and the mutant sequence. Thus, there are
sample solutions Sample Solution Chip Wild-Type:Mutant 1110 100:0 1112 90:10 1114 75:25 1116 50:50 1118 25:75 1120 10:90 1122 0:100
For example,sample solution 1114 contains 75% chip wild-type sequence and 25% mutant sequence. - Now referring to the bases called in
rectangular box 1108 for the sample solutions, the intensity ratio method correctly callssample solution 1110 as having a base A as in the chip-wild type sequence. This is correct becausesample solution 1110 is 100% chip wild-type sequence. The intensity ratio method also callssample solution 1112 as having a base A because the sample solution is 90% chip wild-type sequence. - The intensity ratio method calls the identified base in
sample solutions -
Sample solutions - These experiments also show that the base calling methods of the present invention may also be used for solutions of more than one nucleic acid sequence.
-
FIG. 12 illustrates the reference method correctly calling a mutant base where the intensity ratio method incorrectly called the mutant base. There are threeintensity graph windows rectangular box 1208 outlines the bases of interest.Window 1202 shows a sample sequence called using the intensity ratio method. However, the base in therectangular box 1208 was incorrectly called base C because there is actually a base A at that position. The intensity ratio method incorrectly called the base as C because the probe intensity associated with base C is much higher than the other probe intensities. -
Window 1204 shows a reference sequence called using the intensity ratio method. As the reference sequence is known, it is not necessary to know the method used to call the reference sequence. However, it is important to have probe intensities for a reference sequence to use the reference method. The reference sequence has a base C at the position indicated by the rectangular box. -
Window 1206 shows the sample sequence called using the reference method. The reference method correctly calls the specified base as being base A. Thus, for some cases the reference method is preferable to the intensity ratio method because it compares probe intensities of a sample sequence to probe intensities of a reference sequence. - The intensity ratio method was used in sequence analysis of various polymorphic HIV-1 clones using a protease chip. Single stranded DNA of a 382 nt region was used with 4 different clones (HXB2, SF2, NY5, pPol4mut18). Results were compared to results from an ABI sequencer. The results are illustrated below:
ABI Protease Chip Sense Antisense Sense Antisense No call 0 4 9 4 Ambiguous 6 14 17 8 Wrong call 2 3 3 1 TOTAL 8 21 29 13
SUMMARY - ABI (sense)—99.5%
- Chip (sense)—98.1%
- ABI (antisense)—98.6%
- Chip (antisense)—99.1%
- HIV protease genotyping was performed using the described chips and CallSeq™ intensity ratio calculations. Samples were evaluated from AIDS patients before and after ddI treatment. Results were confirmed with ABI sequencing.
-
FIG. 13 illustrates the output of the ViewSeq™ program with four pretreatment samples and four posttreatment samples. Note the mutation atposition 207 where a mutation has arisen. Even adjacent two additional mutations (gt), the “a” mutation has been properly detected. - The above description is illustrative and not restrictive. Many variations of the invention will become apparent to those of skill in the art upon review of this disclosure. Merely by way of example, while the invention is illustrated with particular reference to the evaluation of DNA (natural or unnatural), the methods can be used in the analysis from chips with other materials synthesized thereon, such as RNA. The scope of the invention should, therefore, be determined not with reference to the above description, but instead should be determined with reference to the appended claims along with their full scope of equivalents.
Claims (23)
1-44. (canceled)
45. A computer implemented method of displaying sequences of bases, the method comprising:
displaying at least one reference sequence on a display device;
evaluating hybridization between at least one sample sequence and nucleic acid probes in order to call bases of said at least one sample sequence;
separately displaying said at least one sample sequence on said display device, wherein said displayed reference and sample sequences are displayed so that bases at a same position in said displayed reference and sample sequences are aligned on said display device;
receiving user input to select a plurality of displayed sequences for comparison by the computer system;
comparing said selected sequences; and
indicating on said display device bases that differ between or among the selected sequences at the same position including indicating bases in a selected sample sequence that differ from bases at corresponding positions in a selected reference sequence.
46. The method of claim 45 , further comprising indicating on said display device said selected sequences.
47. The method of claim 45 , further comprising displaying a common symbol on said display device next to said selected sequences.
48. The method of claim 45 , wherein said common symbol is a number.
49. The method of claim 45 , wherein said at least one reference sequence and said at least one sample sequence are monomer strands of DNA or RNA.
50. The method of claim 45 , wherein said bases are A, C, G, or T(U).
51. The method of claim 45 , wherein said at least one reference sequence includes a chip wild-type sequence.
52. The method of claim 51 , wherein said chip wild-type sequence is displayed as the first sequence.
53. The method of claim 51 , further comprising displaying a label to identify said chip wild-type sequence.
54. The method of claim 53 , wherein said label is a capital C.
55. The method of claim 45 , further comprising:
displaying a name associated with each of said at least one reference sequence; and
displaying a name associated with each of said at least one sample sequence.
56. A computer program product that displays sequences of bases, comprising:
computer code that displays at least one reference sequence on a display device;
computer code that evaluates hybridization between at least one sample sequence and nucleic acid probes in order call bases of said at least one sample sequence;
computer code that separately displays said at least one sample sequence on said display device, wherein said displayed reference and sample sequences are displayed so that bases at a same position in said displayed reference and sample sequences are aligned on said display device;
computer code that receives user input to select a plurality of displayed sequences for comparison by a computer system;
computer code that compares said selected sequences;
computer code that indicates on said display device bases that differ between or among the selected sequences at corresponding positions including indicating bases in a selected sample sequence that differ from bases at corresponding positions in a selected reference sequence; and
a computer readable medium that stores said computer codes.
57. The computer program product of claim 56 , further comprising computer code that indicates on said display device said selected sequences.
58. The computer program product of claim 56 , further comprising computer code that displays a common symbol on said display device next to said selected sequences.
59. The computer program product of claim 58 , wherein said common symbol is a number.
60. The computer program product of claim 56 , wherein said at least one reference sequence and said at least one sample sequence are monomer strands of DNA or RNA.
61. The computer program product of claim 56 , wherein said bases are A, C, G, or T(U).
62. The computer program product of claim 56 , wherein said at least one reference sequence includes a chip wild-type sequence.
63. The computer program product of claim 62 , wherein said chip wild-type sequence is displayed as the first sequence.
64. The computer program product of claim 62 , further comprising computer code that displays a label to identify said chip wild-type sequence.
65. The computer program product of claim 64 , wherein said label is a capital C.
66. The computer program product of claim 56 , further comprising:
computer code that displays a name associated with each of said at least one reference sequence; and
computer code that displays a name associated with each of said at least one sample sequence.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US10/791,373 US20050186581A1 (en) | 1994-10-21 | 2004-03-01 | Computer-aided visualization and analysis system for sequence evaluation |
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US08/327,525 US5795716A (en) | 1994-10-21 | 1994-10-21 | Computer-aided visualization and analysis system for sequence evaluation |
US09/049,805 US6733964B1 (en) | 1994-10-21 | 1998-03-27 | Computer-aided visualization and analysis system for sequence evaluation |
US10/791,373 US20050186581A1 (en) | 1994-10-21 | 2004-03-01 | Computer-aided visualization and analysis system for sequence evaluation |
Related Parent Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US09/049,805 Continuation US6733964B1 (en) | 1994-10-21 | 1998-03-27 | Computer-aided visualization and analysis system for sequence evaluation |
Publications (1)
Publication Number | Publication Date |
---|---|
US20050186581A1 true US20050186581A1 (en) | 2005-08-25 |
Family
ID=23276902
Family Applications (6)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US08/327,525 Expired - Lifetime US5795716A (en) | 1994-10-21 | 1994-10-21 | Computer-aided visualization and analysis system for sequence evaluation |
US08/531,137 Expired - Lifetime US5974164A (en) | 1994-10-21 | 1995-10-16 | Computer-aided visualization and analysis system for sequence evaluation |
US09/049,805 Expired - Fee Related US6733964B1 (en) | 1994-10-21 | 1998-03-27 | Computer-aided visualization and analysis system for sequence evaluation |
US09/158,765 Expired - Lifetime US6242180B1 (en) | 1994-10-21 | 1998-09-23 | Computer-aided visualization and analysis system for sequence evaluation |
US09/796,071 Expired - Fee Related US6607887B2 (en) | 1994-10-21 | 2001-02-27 | Computer-aided visualization and analysis system for sequence evaluation |
US10/791,373 Abandoned US20050186581A1 (en) | 1994-10-21 | 2004-03-01 | Computer-aided visualization and analysis system for sequence evaluation |
Family Applications Before (5)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US08/327,525 Expired - Lifetime US5795716A (en) | 1994-10-21 | 1994-10-21 | Computer-aided visualization and analysis system for sequence evaluation |
US08/531,137 Expired - Lifetime US5974164A (en) | 1994-10-21 | 1995-10-16 | Computer-aided visualization and analysis system for sequence evaluation |
US09/049,805 Expired - Fee Related US6733964B1 (en) | 1994-10-21 | 1998-03-27 | Computer-aided visualization and analysis system for sequence evaluation |
US09/158,765 Expired - Lifetime US6242180B1 (en) | 1994-10-21 | 1998-09-23 | Computer-aided visualization and analysis system for sequence evaluation |
US09/796,071 Expired - Fee Related US6607887B2 (en) | 1994-10-21 | 2001-02-27 | Computer-aided visualization and analysis system for sequence evaluation |
Country Status (2)
Country | Link |
---|---|
US (6) | US5795716A (en) |
EP (1) | EP0717113A2 (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20040175718A1 (en) * | 1995-10-16 | 2004-09-09 | Affymetrix, Inc. | Computer-aided visualization and analysis system for sequence evaluation |
Families Citing this family (329)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5202231A (en) | 1987-04-01 | 1993-04-13 | Drmanac Radoje T | Method of sequencing of genomes by hybridization of oligonucleotide probes |
US6040138A (en) * | 1995-09-15 | 2000-03-21 | Affymetrix, Inc. | Expression monitoring by hybridization to high density oligonucleotide arrays |
US20020048749A1 (en) * | 1998-04-15 | 2002-04-25 | Robert J. Lipshutz | Methods for polymorphism identifcation and profiling |
US6225059B1 (en) | 1993-11-01 | 2001-05-01 | Nanogen, Inc. | Advanced active electronic devices including collection electrodes for molecular biological analysis and diagnostics |
US6207373B1 (en) | 1998-02-25 | 2001-03-27 | Nanogen, Inc. | Methods for determining nature of repeat units in DNA |
US6331274B1 (en) | 1993-11-01 | 2001-12-18 | Nanogen, Inc. | Advanced active circuits and devices for molecular biological analysis and diagnostics |
US5807522A (en) * | 1994-06-17 | 1998-09-15 | The Board Of Trustees Of The Leland Stanford Junior University | Methods for fabricating microarrays of biological samples |
US7323298B1 (en) | 1994-06-17 | 2008-01-29 | The Board Of Trustees Of The Leland Stanford Junior University | Microarray for determining the relative abundances of polynuceotide sequences |
US7625697B2 (en) * | 1994-06-17 | 2009-12-01 | The Board Of Trustees Of The Leland Stanford Junior University | Methods for constructing subarrays and subarrays made thereby |
US7378236B1 (en) | 1994-06-17 | 2008-05-27 | The Board Of Trustees Of The Leland Stanford Junior University | Method for analyzing gene expression patterns |
US5795716A (en) * | 1994-10-21 | 1998-08-18 | Chee; Mark S. | Computer-aided visualization and analysis system for sequence evaluation |
US6600996B2 (en) * | 1994-10-21 | 2003-07-29 | Affymetrix, Inc. | Computer-aided techniques for analyzing biological sequences |
US20030220748A1 (en) * | 1994-10-21 | 2003-11-27 | Affymetrix, Inc., A California Corporation | Computer-aided techniques for analyzing biological sequences |
US5830645A (en) * | 1994-12-09 | 1998-11-03 | The Regents Of The University Of California | Comparative fluorescence hybridization to nucleic acid arrays |
US6300063B1 (en) * | 1995-11-29 | 2001-10-09 | Affymetrix, Inc. | Polymorphism detection |
US6953663B1 (en) | 1995-11-29 | 2005-10-11 | Affymetrix, Inc. | Polymorphism detection |
US20010018514A1 (en) | 1998-07-31 | 2001-08-30 | Mcgall Glenn H. | Nucleic acid labeling compounds |
EP0880598A4 (en) | 1996-01-23 | 2005-02-23 | Affymetrix Inc | Nucleic acid analysis techniques |
US7282327B2 (en) | 1996-01-23 | 2007-10-16 | Affymetrix, Inc. | Nucleic acid labeling compounds |
US7291463B2 (en) | 1996-01-23 | 2007-11-06 | Affymetrix, Inc. | Nucleic acid labeling compounds |
US6864059B2 (en) | 1996-01-23 | 2005-03-08 | Affymetrix, Inc. | Biotin containing C-glycoside nucleic acid labeling compounds |
US7423143B2 (en) | 1996-01-23 | 2008-09-09 | Affymetrix. Inc. | Nucleic acid labeling compounds |
US6965020B2 (en) | 1996-01-23 | 2005-11-15 | Affymetrix, Inc. | Nucleic acid labeling compounds |
EP1728875A3 (en) | 1996-02-08 | 2007-05-09 | Affymetrix, Inc. | Chip-based speciation and phenotypic characterization of microorganisms |
US6924094B1 (en) | 1996-02-08 | 2005-08-02 | Affymetrix, Inc. | Chip-based species identification and phenotypic characterization of microorganisms |
US6228575B1 (en) * | 1996-02-08 | 2001-05-08 | Affymetrix, Inc. | Chip-based species identification and phenotypic characterization of microorganisms |
US6013440A (en) * | 1996-03-11 | 2000-01-11 | Affymetrix, Inc. | Nucleic acid affinity columns |
US6329140B1 (en) * | 1996-09-19 | 2001-12-11 | Affymetrix, Inc. | Identification of molecular sequence signatures and methods involving the same |
US6391550B1 (en) | 1996-09-19 | 2002-05-21 | Affymetrix, Inc. | Identification of molecular sequence signatures and methods involving the same |
AU4428497A (en) | 1996-09-20 | 1998-04-14 | James P. Demers | Spatially addressable combinatorial chemical arrays in cd-rom format |
US6536944B1 (en) | 1996-10-09 | 2003-03-25 | Symyx Technologies, Inc. | Parallel screen for rapid thermal characterization of materials |
US6738529B1 (en) * | 1996-10-09 | 2004-05-18 | Symyx Technologies, Inc. | Analysis of chemical data from images |
US6826296B2 (en) | 1997-07-25 | 2004-11-30 | Affymetrix, Inc. | Method and system for providing a probe array chip design database |
US7068830B2 (en) * | 1997-07-25 | 2006-06-27 | Affymetrix, Inc. | Method and system for providing a probe array chip design database |
EP1007737A4 (en) | 1997-07-25 | 2002-07-03 | Affymetrix Inc | System for providing a polymorphism database |
US6420108B2 (en) | 1998-02-09 | 2002-07-16 | Affymetrix, Inc. | Computer-aided display for comparative gene expression |
US6607878B2 (en) | 1997-10-06 | 2003-08-19 | Stratagene | Collections of uniquely tagged molecules |
US6013449A (en) * | 1997-11-26 | 2000-01-11 | The United States Of America As Represented By The Department Of Health And Human Services | Probe-based analysis of heterozygous mutations using two-color labelling |
US6408308B1 (en) * | 1998-01-29 | 2002-06-18 | Incyte Pharmaceuticals, Inc. | System and method for generating, analyzing and storing normalized expression datasets from raw expression datasets derived from microarray includes nucleic acid probe sequences |
WO1999049774A2 (en) | 1998-03-31 | 1999-10-07 | Genzyme Corporation | Methods for the diagnosis and treatment of lung cancer |
US7108968B2 (en) * | 1998-04-03 | 2006-09-19 | Affymetrix, Inc. | Mycobacterial rpoB sequences |
US7875440B2 (en) | 1998-05-01 | 2011-01-25 | Arizona Board Of Regents | Method of determining the nucleotide sequence of oligonucleotides and DNA molecules |
US6780591B2 (en) | 1998-05-01 | 2004-08-24 | Arizona Board Of Regents | Method of determining the nucleotide sequence of oligonucleotides and DNA molecules |
US6525185B1 (en) | 1998-05-07 | 2003-02-25 | Affymetrix, Inc. | Polymorphisms associated with hypertension |
US6699659B2 (en) | 1998-06-01 | 2004-03-02 | Affymetrix, Inc. | Products and methods for analyzing nucleic acids including identification of substitutions, insertions and deletions |
EP2045334A1 (en) | 1998-06-24 | 2009-04-08 | Illumina, Inc. | Decoding of array sensors with microspheres |
US6593133B1 (en) | 1998-07-06 | 2003-07-15 | Nsgene A/S | Neurotrophic factors |
AU5677699A (en) * | 1998-08-21 | 2000-03-14 | Affymetrix, Inc. | Expression monitoring for human cytomegalovirus (hcmv) infection |
US6306643B1 (en) | 1998-08-24 | 2001-10-23 | Affymetrix, Inc. | Methods of using an array of pooled probes in genetic analysis |
US20040199544A1 (en) * | 2000-11-02 | 2004-10-07 | Affymetrix, Inc. | Method and apparatus for providing an expression data mining database |
US6185561B1 (en) | 1998-09-17 | 2001-02-06 | Affymetrix, Inc. | Method and apparatus for providing and expression data mining database |
US7034143B1 (en) | 1998-10-13 | 2006-04-25 | Brown University Research Foundation | Systems and methods for sequencing by hybridization |
US6489096B1 (en) * | 1998-10-15 | 2002-12-03 | Princeton University | Quantitative analysis of hybridization patterns and intensities in oligonucleotide arrays |
US7199809B1 (en) | 1998-10-19 | 2007-04-03 | Symyx Technologies, Inc. | Graphic design of combinatorial material libraries |
US6187540B1 (en) | 1998-11-09 | 2001-02-13 | Identigene, Inc. | Method of newborn identification and tracking |
US6670464B1 (en) | 1998-11-17 | 2003-12-30 | Curagen Corporation | Nucleic acids containing single nucleotide polymorphisms and methods of use thereof |
EP1179028B1 (en) | 1998-12-01 | 2005-03-02 | Syntrix Biochip, Inc. | Solvent resistant photosensitive compositions |
US6881570B1 (en) * | 1998-12-15 | 2005-04-19 | Fuji Photo Film Co., Ltd. | Test piece and quantitative method and apparatus for an organism-oriented substance |
US6142681A (en) | 1999-02-22 | 2000-11-07 | Vialogy Corporation | Method and apparatus for interpreting hybridized bioelectronic DNA microarray patterns using self-scaling convergent reverberant dynamics |
US6245511B1 (en) * | 1999-02-22 | 2001-06-12 | Vialogy Corp | Method and apparatus for exponentially convergent therapy effectiveness monitoring using DNA microarray based viral load measurements |
US20040111219A1 (en) * | 1999-02-22 | 2004-06-10 | Sandeep Gulati | Active interferometric signal analysis in software |
US6136541A (en) | 1999-02-22 | 2000-10-24 | Vialogy Corporation | Method and apparatus for analyzing hybridized biochip patterns using resonance interactions employing quantum expressor functions |
US6824866B1 (en) * | 1999-04-08 | 2004-11-30 | Affymetrix, Inc. | Porous silica substrates for polymer synthesis and assays |
US6171793B1 (en) | 1999-04-19 | 2001-01-09 | Affymetrix, Inc. | Method for scanning gene probe array to produce data having dynamic range that exceeds that of scanner |
US20060275782A1 (en) | 1999-04-20 | 2006-12-07 | Illumina, Inc. | Detection of nucleic acid reactions on bead arrays |
US20030215821A1 (en) * | 1999-04-20 | 2003-11-20 | Kevin Gunderson | Detection of nucleic acid reactions on bead arrays |
US7276336B1 (en) | 1999-07-22 | 2007-10-02 | Agilent Technologies, Inc. | Methods of fabricating an addressable array of biopolymer probes |
GB2388601B (en) * | 1999-04-30 | 2004-04-28 | Agilent Technologies Inc | Polynucleotide array fabrication |
US6507945B1 (en) | 1999-05-05 | 2003-01-14 | Symyx Technologies, Inc. | Synthesizing combinatorial libraries of materials |
AU5870200A (en) | 1999-06-03 | 2000-12-28 | Jonathan Hibbs | Non-cognate hybridization system (nchs) |
US6516276B1 (en) | 1999-06-18 | 2003-02-04 | Eos Biotechnology, Inc. | Method and apparatus for analysis of data from biomolecular arrays |
WO2000079007A1 (en) * | 1999-06-19 | 2000-12-28 | Hyseq Inc. | Improved methods of sequence assembly in sequencing by hybridization |
US7209836B1 (en) * | 1999-07-16 | 2007-04-24 | Perkinelmer Las, Inc. | Method and system for automatically creating crosstalk-corrected data of a microarray |
US7179638B2 (en) | 1999-07-30 | 2007-02-20 | Large Scale Biology Corporation | Microarrays and their manufacture by slicing |
US6653151B2 (en) | 1999-07-30 | 2003-11-25 | Large Scale Proteomics Corporation | Dry deposition of materials for microarrays using matrix displacement |
US6713309B1 (en) | 1999-07-30 | 2004-03-30 | Large Scale Proteomics Corporation | Microarrays and their manufacture |
AU6778600A (en) * | 1999-08-17 | 2001-03-13 | Affymetrix, Inc. | Methods, systems and computer software for designing and synthesizing sequence arrays |
EP1078993A1 (en) * | 1999-08-24 | 2001-02-28 | Biomerieux | Method for biochip data analysis |
EP1212599A2 (en) | 1999-08-30 | 2002-06-12 | Illumina, Inc. | Methods for improving signal detection from an array |
US6535624B1 (en) | 1999-09-01 | 2003-03-18 | Large Scale Proteomics Corporation | Gel electrophoresis image combining for improved dynamic range |
WO2001018232A2 (en) * | 1999-09-08 | 2001-03-15 | Genaissance Pharmaceuticals, Inc. | Drug target isogenes: polymorphisms in the uncoupling protein 3 (mitochondrial, proton carrier) gene |
US6941317B1 (en) | 1999-09-14 | 2005-09-06 | Eragen Biosciences, Inc. | Graphical user interface for display and analysis of biological sequence data |
US7211390B2 (en) | 1999-09-16 | 2007-05-01 | 454 Life Sciences Corporation | Method of sequencing a nucleic acid |
US7244559B2 (en) * | 1999-09-16 | 2007-07-17 | 454 Life Sciences Corporation | Method of sequencing a nucleic acid |
AU1075701A (en) | 1999-10-08 | 2001-04-23 | Protogene Laboratories, Inc. | Method and apparatus for performing large numbers of reactions using array assembly |
US6958225B2 (en) | 1999-10-27 | 2005-10-25 | Affymetrix, Inc. | Complexity management of genomic DNA |
US6867851B2 (en) * | 1999-11-04 | 2005-03-15 | Regents Of The University Of Minnesota | Scanning of biological samples |
US6784982B1 (en) * | 1999-11-04 | 2004-08-31 | Regents Of The University Of Minnesota | Direct mapping of DNA chips to detector arrays |
GB2363874B (en) * | 1999-11-06 | 2004-08-04 | Dennis Sunga Fernandez | Bioinformatic transaction scheme |
US20030097222A1 (en) * | 2000-01-25 | 2003-05-22 | Craford David M. | Method, system, and computer software for providing a genomic web portal |
US20020006617A1 (en) * | 2000-02-07 | 2002-01-17 | Jian-Bing Fan | Nucleic acid detection methods using universal priming |
US7955794B2 (en) | 2000-09-21 | 2011-06-07 | Illumina, Inc. | Multiplex nucleic acid reactions |
US8076063B2 (en) | 2000-02-07 | 2011-12-13 | Illumina, Inc. | Multiplexed methylation detection methods |
US20050214825A1 (en) * | 2000-02-07 | 2005-09-29 | John Stuelpnagel | Multiplex sample analysis on universal arrays |
US7582420B2 (en) | 2001-07-12 | 2009-09-01 | Illumina, Inc. | Multiplex nucleic acid reactions |
US6770441B2 (en) * | 2000-02-10 | 2004-08-03 | Illumina, Inc. | Array compositions and methods of making same |
AU2001257239A1 (en) * | 2000-04-25 | 2001-11-07 | Affymetrix, Inc. | Methods for monitoring the expression of alternatively spliced genes |
US7363165B2 (en) | 2000-05-04 | 2008-04-22 | The Board Of Trustees Of The Leland Stanford Junior University | Significance analysis of microarrays |
US6492115B1 (en) * | 2000-06-02 | 2002-12-10 | Dna Sciences Laboratories, Inc. | Genetic typing of the human cytochrome P450 2A6 gene and related materials and methods |
WO2002012468A2 (en) * | 2000-08-04 | 2002-02-14 | Microgenomics, Inc. | 2,5-dkg permeases |
JP2002071687A (en) * | 2000-08-31 | 2002-03-12 | Canon Inc | Screening method for variant gene |
JP3467004B2 (en) | 2000-08-31 | 2003-11-17 | キヤノン株式会社 | Analysis method of nucleic acid base sequence |
US6905816B2 (en) * | 2000-11-27 | 2005-06-14 | Intelligent Medical Devices, Inc. | Clinically intelligent diagnostic devices and methods |
GB0102357D0 (en) | 2001-01-30 | 2001-03-14 | Randox Lab Ltd | Imaging method |
US7442370B2 (en) * | 2001-02-01 | 2008-10-28 | Biogen Idec Ma Inc. | Polymer conjugates of mutated neublastin |
EP1373573B1 (en) * | 2001-03-09 | 2013-10-02 | TrovaGene, Inc. | Conjugate probes and optical detection of analytes |
US7276580B2 (en) * | 2001-03-12 | 2007-10-02 | Biogen Idec Ma Inc. | Neurotrophic factors |
AU2002258502A1 (en) | 2001-03-12 | 2002-09-24 | Affymetrix, Inc. | Nucleic acid labeling compounds |
AU785425B2 (en) | 2001-03-30 | 2007-05-17 | Genetic Technologies Limited | Methods of genomic analysis |
US7115726B2 (en) | 2001-03-30 | 2006-10-03 | Perlegen Sciences, Inc. | Haplotype structures of chromosome 21 |
US20030009294A1 (en) * | 2001-06-07 | 2003-01-09 | Jill Cheng | Integrated system for gene expression analysis |
WO2003004690A2 (en) * | 2001-07-06 | 2003-01-16 | 454$m(3) CORPORATION | Method for isolation of independent, parallel chemical micro-reactions using a porous filter |
CN1575475A (en) * | 2001-09-12 | 2005-02-02 | 长冈实业株式会社 | Methods for differential cell counts including related apparatus and software for performing same |
US20030068621A1 (en) * | 2001-10-04 | 2003-04-10 | Jonathan Briggs | Method and device for producing oligonucleotide arrays |
WO2003034064A2 (en) * | 2001-10-12 | 2003-04-24 | Duke University | Image analysis of high-density synthetic dna microarrays |
WO2003033128A2 (en) * | 2001-10-12 | 2003-04-24 | Duke University | Methods for image analysis of high-density synthetic dna microarrays |
US20050124022A1 (en) * | 2001-10-30 | 2005-06-09 | Maithreyan Srinivasan | Novel sulfurylase-luciferase fusion proteins and thermostable sulfurylase |
US6956114B2 (en) | 2001-10-30 | 2005-10-18 | '454 Corporation | Sulfurylase-luciferase fusion proteins and thermostable sulfurylase |
US6902921B2 (en) * | 2001-10-30 | 2005-06-07 | 454 Corporation | Sulfurylase-luciferase fusion proteins and thermostable sulfurylase |
AU2002357368A1 (en) * | 2001-12-19 | 2003-07-09 | Affymetrix, Inc. | Array plates and method for constructing array plates |
US20030149933A1 (en) * | 2002-02-01 | 2003-08-07 | Marco Falcioni | Graphical design of chemical discovery processes |
AU2003215240A1 (en) | 2002-02-14 | 2003-09-04 | Illumina, Inc. | Automated information processing in randomly ordered arrays |
EP1502098B1 (en) * | 2002-05-03 | 2008-09-17 | ViaLogy, LLC | Method for characterizing microarray output data |
US7504215B2 (en) | 2002-07-12 | 2009-03-17 | Affymetrix, Inc. | Nucleic acid labeling methods |
US7745203B2 (en) * | 2002-07-31 | 2010-06-29 | Kabushiki Kaisha Toshiba | Base sequence detection apparatus and base sequence automatic analyzing apparatus |
EP1535242A1 (en) | 2002-08-20 | 2005-06-01 | Cyvera Corporation | Diffraction grating-based encoded micro-particles for multiplexed experiments |
US20050032074A1 (en) * | 2002-09-09 | 2005-02-10 | Affymetrix, Inc. | Custom design method for resequencing arrays |
US7595883B1 (en) * | 2002-09-16 | 2009-09-29 | The Board Of Trustees Of The Leland Stanford Junior University | Biological analysis arrangement and approach therefor |
US7498176B2 (en) * | 2002-09-27 | 2009-03-03 | Roche Nimblegen, Inc. | Microarray with hydrophobic barriers |
US20040110212A1 (en) * | 2002-09-30 | 2004-06-10 | Mccormick Mark | Microarrays with visual alignment marks |
JP4571776B2 (en) * | 2002-11-05 | 2010-10-27 | Jx日鉱日石エネルギー株式会社 | Lubricating oil composition |
US7887752B2 (en) | 2003-01-21 | 2011-02-15 | Illumina, Inc. | Chemical reaction monitor |
CN101128601B (en) * | 2003-01-29 | 2011-06-08 | 454生命科学公司 | Methods of amplifying and sequencing nucleic acids |
US7575865B2 (en) * | 2003-01-29 | 2009-08-18 | 454 Life Sciences Corporation | Methods of amplifying and sequencing nucleic acids |
US20040157220A1 (en) | 2003-02-10 | 2004-08-12 | Purnima Kurnool | Methods and apparatus for sample tracking |
US7833714B1 (en) | 2003-04-11 | 2010-11-16 | Affymetrix, Inc. | Combinatorial affinity selection |
EP1613652A2 (en) * | 2003-04-16 | 2006-01-11 | Wyeth | A novel method of modulating bone-related activity |
ATE433759T1 (en) | 2003-04-18 | 2009-07-15 | Biogen Idec Inc | POLYMER CONJUGATED GLYCOSILATED NEUBLASTIN |
US20050136395A1 (en) * | 2003-05-08 | 2005-06-23 | Affymetrix, Inc | Methods for genetic analysis of SARS virus |
EP1636727B1 (en) * | 2003-06-10 | 2012-10-31 | Janssen Diagnostics BVBA | Computational method for predicting the contribution of mutations to the drug resistance phenotype exhibited by hiv based on a linear regression analysis of the log fold resistance |
EP1633892A4 (en) | 2003-06-10 | 2007-10-24 | Univ Boston | Detection methods for disorders of the lung |
US20040259100A1 (en) | 2003-06-20 | 2004-12-23 | Illumina, Inc. | Methods and compositions for whole genome amplification and genotyping |
EP1644184A2 (en) * | 2003-06-27 | 2006-04-12 | Nagaoka & Co., Ltd. | Fluidic circuits, methods and apparatus for use of whole blood samples in colorimetric assays |
US8131475B2 (en) | 2003-09-03 | 2012-03-06 | The United States Of America As Represented By The Secretary, Department Of Health And Human Services | Methods for identifying, diagnosing, and predicting survival of lymphomas |
AU2004271192B2 (en) | 2003-09-03 | 2011-11-17 | Government Of The United States Of America, As Represented By Secretary, Department Of Health And Human Services | Methods for identifying, diagnosing, and predicting survival of lymphomas |
US20050123971A1 (en) * | 2003-09-08 | 2005-06-09 | Affymetrix, Inc. | System, method, and computer software product for generating genotype calls |
US20050287575A1 (en) * | 2003-09-08 | 2005-12-29 | Affymetrix, Inc. | System and method for improved genotype calls using microarrays |
WO2005035788A2 (en) | 2003-10-10 | 2005-04-21 | Yissum Research Development Company Of The Hebrew University Of Jerusalem | Method and kit for assessing anxiety or disposition thereto in a subject |
US7169560B2 (en) | 2003-11-12 | 2007-01-30 | Helicos Biosciences Corporation | Short cycle methods for sequencing polynucleotides |
WO2005054516A2 (en) | 2003-11-26 | 2005-06-16 | Advandx, Inc. | Peptide nucleic acid probes for analysis of certain staphylococcus species |
US7881875B2 (en) * | 2004-01-16 | 2011-02-01 | Affymetrix, Inc. | Methods for selecting a collection of single nucleotide polymorphisms |
JP2007520222A (en) * | 2004-01-20 | 2007-07-26 | アイシス ファーマシューティカルズ インコーポレイテッド | Regulation of glucocorticoid receptor expression |
CA2497324A1 (en) | 2004-02-17 | 2005-08-17 | Affymetrix, Inc. | Methods for fragmenting and labelling dna |
EP1716254B1 (en) | 2004-02-19 | 2010-04-07 | Helicos Biosciences Corporation | Methods for analyzing polynucleotide sequences |
WO2005082110A2 (en) * | 2004-02-26 | 2005-09-09 | Illumina Inc. | Haplotype markers for diagnosing susceptibility to immunological conditions |
US7660709B2 (en) * | 2004-03-18 | 2010-02-09 | Van Andel Research Institute | Bioinformatics research and analysis system and methods associated therewith |
US7622281B2 (en) * | 2004-05-20 | 2009-11-24 | The Board Of Trustees Of The Leland Stanford Junior University | Methods and compositions for clonal amplification of nucleic acid |
BRPI0514534A (en) * | 2004-08-19 | 2008-06-17 | Biogen Idec Inc | neublastin variants |
US8722862B2 (en) | 2004-08-19 | 2014-05-13 | Biogen Idec Ma Inc. | Refolding transforming growth factor beta family proteins |
US8484000B2 (en) * | 2004-09-02 | 2013-07-09 | Vialogy Llc | Detecting events of interest using quantum resonance interferometry |
FI20041204A0 (en) | 2004-09-16 | 2004-09-16 | Riikka Lund | Methods for the utilization of new target genes associated with immune-mediated diseases |
US20060073506A1 (en) | 2004-09-17 | 2006-04-06 | Affymetrix, Inc. | Methods for identifying biological samples |
US20060073511A1 (en) | 2004-10-05 | 2006-04-06 | Affymetrix, Inc. | Methods for amplifying and analyzing nucleic acids |
US7682782B2 (en) | 2004-10-29 | 2010-03-23 | Affymetrix, Inc. | System, method, and product for multiple wavelength detection using single source excitation |
JP2006126204A (en) | 2004-10-29 | 2006-05-18 | Affymetrix Inc | Automated method for manufacturing polymer array |
US7647186B2 (en) * | 2004-12-07 | 2010-01-12 | Illumina, Inc. | Oligonucleotide ordering system |
US20090203547A1 (en) * | 2005-02-18 | 2009-08-13 | Albert Banes | Gene and Cognate Protein Profiles and Methods to Determine Connective Tissue Markers in Normal and Pathologic Conditions |
EP1910825B1 (en) | 2005-03-11 | 2012-08-08 | Firalis SAS | Biomarkers for cardiovascular side-effects induced by cox-2 inhibitory compounds |
EP2390347A1 (en) | 2005-04-14 | 2011-11-30 | Trustees Of Boston University | Diagnostic for lung disorders using class prediction |
DK1907583T4 (en) | 2005-06-15 | 2020-01-27 | Complete Genomics Inc | SINGLE MOLECULE ARRAYS FOR GENETIC AND CHEMICAL ANALYSIS |
US7666593B2 (en) | 2005-08-26 | 2010-02-23 | Helicos Biosciences Corporation | Single molecule sequencing of captured nucleic acids |
US7608395B2 (en) * | 2005-09-15 | 2009-10-27 | Baylor Research Institute | Systemic lupus erythematosus diagnostic assay |
US7962291B2 (en) * | 2005-09-30 | 2011-06-14 | Affymetrix, Inc. | Methods and computer software for detecting splice variants |
EP1952147A2 (en) | 2005-11-16 | 2008-08-06 | Novartis AG | Biomarkers for anti-nogo-a antibody treatment in spinal cord injury |
US7329860B2 (en) | 2005-11-23 | 2008-02-12 | Illumina, Inc. | Confocal imaging methods and apparatus |
US7634363B2 (en) | 2005-12-07 | 2009-12-15 | Affymetrix, Inc. | Methods for high throughput genotyping |
EP1960547A2 (en) | 2005-12-08 | 2008-08-27 | Novartis AG | Effects of inhibitors of fgfr3 on gene transcription |
US20070161031A1 (en) * | 2005-12-16 | 2007-07-12 | The Board Of Trustees Of The Leland Stanford Junior University | Functional arrays for high throughput characterization of gene expression regulatory elements |
DK1986661T3 (en) | 2006-02-08 | 2019-01-02 | Genzyme Corp | GENTERAPY AGAINST NIEMANN-PICK'S DISEASE TYPE A |
TWI501774B (en) | 2006-02-27 | 2015-10-01 | Biogen Idec Inc | Treatments for neurological disorders |
US20100056440A1 (en) * | 2006-03-01 | 2010-03-04 | Biogen Idec Ma Inc. | Compositions and methods for administering gdnf ligand family proteins |
US7397546B2 (en) | 2006-03-08 | 2008-07-08 | Helicos Biosciences Corporation | Systems and methods for reducing detected intensity non-uniformity in a laser beam |
US20090061454A1 (en) | 2006-03-09 | 2009-03-05 | Brody Jerome S | Diagnostic and prognostic methods for lung disorders using gene expression profiles from nose epithelial cells |
US7914988B1 (en) | 2006-03-31 | 2011-03-29 | Illumina, Inc. | Gene expression profiles to predict relapse of prostate cancer |
WO2007143669A2 (en) * | 2006-06-05 | 2007-12-13 | California Institute Of Technology | Real time micro arrays |
US11001881B2 (en) | 2006-08-24 | 2021-05-11 | California Institute Of Technology | Methods for detecting analytes |
US11525156B2 (en) | 2006-07-28 | 2022-12-13 | California Institute Of Technology | Multiplex Q-PCR arrays |
WO2008014485A2 (en) | 2006-07-28 | 2008-01-31 | California Institute Of Technology | Multiplex q-pcr arrays |
US8586006B2 (en) | 2006-08-09 | 2013-11-19 | Institute For Systems Biology | Organ-specific proteins and methods of their use |
US11560588B2 (en) | 2006-08-24 | 2023-01-24 | California Institute Of Technology | Multiplex Q-PCR arrays |
US9845494B2 (en) | 2006-10-18 | 2017-12-19 | Affymetrix, Inc. | Enzymatic methods for genotyping on arrays |
JP2010508815A (en) | 2006-11-02 | 2010-03-25 | イエール・ユニバーシテイ | Evaluation of oocyte receptivity |
US20080242560A1 (en) * | 2006-11-21 | 2008-10-02 | Gunderson Kevin L | Methods for generating amplified nucleic acid arrays |
US8293684B2 (en) * | 2006-11-29 | 2012-10-23 | Exiqon | Locked nucleic acid reagents for labelling nucleic acids |
DK2099935T3 (en) * | 2006-11-30 | 2014-04-07 | Univ California | ARRAY FOR DETECTION OF MICROBES |
WO2008084405A2 (en) | 2007-01-11 | 2008-07-17 | Erasmus University Medical Center | Circular chromosome conformation capture (4c) |
US8481259B2 (en) | 2007-02-05 | 2013-07-09 | Intelligent Bio-Systems, Inc. | Methods and devices for sequencing nucleic acids in smaller batches |
US11035823B2 (en) | 2009-03-17 | 2021-06-15 | Qiagen Sciences, Llc | Methods and devices for sequencing nucleic acids in smaller batches |
US11940413B2 (en) | 2007-02-05 | 2024-03-26 | IsoPlexis Corporation | Methods and devices for sequencing nucleic acids in smaller batches |
EP2135080A4 (en) * | 2007-03-08 | 2010-12-01 | Switchgear Genomics | Functional arrays for high throughput characterization of regulatory elements in untranslated regions of genes |
US20080241831A1 (en) * | 2007-03-28 | 2008-10-02 | Jian-Bing Fan | Methods for detecting small RNA species |
EP2851091B1 (en) | 2007-04-13 | 2017-12-27 | Dana-Farber Cancer Institute, Inc. | Methods for treating cancer resistant to ERBB therapeutics |
NZ580947A (en) | 2007-05-01 | 2012-05-25 | Biogen Idec Inc | Compositions and methods for increasing vascularization |
US8200440B2 (en) * | 2007-05-18 | 2012-06-12 | Affymetrix, Inc. | System, method, and computer software product for genotype determination using probe array data |
US9542394B2 (en) * | 2007-06-14 | 2017-01-10 | Excalibur Ip, Llc | Method and system for media-based event generation |
US20110135648A1 (en) * | 2007-08-08 | 2011-06-09 | Biogen Idec Ma Inc. | Anti-neublastin antibodies and uses thereof |
WO2009021684A2 (en) * | 2007-08-14 | 2009-02-19 | F. Hoffmann-La Roche Ag | Predictive marker for egfr inhibitor treatment |
CN101778954A (en) | 2007-08-14 | 2010-07-14 | 霍夫曼-拉罗奇有限公司 | Predictive markers for egfr inhibitor treatment |
WO2009021673A1 (en) | 2007-08-14 | 2009-02-19 | F. Hoffmann-La Roche Ag | Predictive markers for egfr inhibitors treatment |
CN101784676A (en) | 2007-08-14 | 2010-07-21 | 霍夫曼-拉罗奇有限公司 | Egfr inhibitor treatment marker |
US20090062138A1 (en) * | 2007-08-31 | 2009-03-05 | Curry Bo U | Array-based method for performing SNP analysis |
US7811810B2 (en) | 2007-10-25 | 2010-10-12 | Industrial Technology Research Institute | Bioassay system including optical detection apparatuses, and method for detecting biomolecules |
US7767441B2 (en) * | 2007-10-25 | 2010-08-03 | Industrial Technology Research Institute | Bioassay system including optical detection apparatuses, and method for detecting biomolecules |
US20090233809A1 (en) * | 2008-03-04 | 2009-09-17 | Affymetrix, Inc. | Resequencing methods for identification of sequence variants |
CA2710807C (en) * | 2008-03-11 | 2015-09-08 | Kyeong Man Hong | Method for measuring chromosome, gene or specific nucleotide sequence copy numbers using snp array |
US8623598B2 (en) * | 2008-03-19 | 2014-01-07 | Intelligent Bio Systems, Inc. | Methods and compositions for inhibiting undesired cleaving of labels |
US9017973B2 (en) | 2008-03-19 | 2015-04-28 | Intelligent Biosystems, Inc. | Methods and compositions for incorporating nucleotides |
US10745740B2 (en) | 2008-03-19 | 2020-08-18 | Qiagen Sciences, Llc | Sample preparation |
US8039817B2 (en) | 2008-05-05 | 2011-10-18 | Illumina, Inc. | Compensator for multiple surface imaging |
WO2009137521A2 (en) * | 2008-05-07 | 2009-11-12 | Illumina, Inc. | Compositions and methods for providing substances to and from an array |
WO2009149297A1 (en) * | 2008-06-04 | 2009-12-10 | The Arizona Board Regents, On Behalf Of The University Of Arizona | Diffuse large b-cell lymphoma markers and uses therefor |
US20110195064A1 (en) | 2008-06-06 | 2011-08-11 | The United States Of America, As Represented By The Secretary, Department Of Health And Human Serv | Survival predictor for diffuse large b cell lymphoma |
US20100087325A1 (en) * | 2008-10-07 | 2010-04-08 | Illumina, Inc. | Biological sample temperature control system and method |
US8541207B2 (en) | 2008-10-22 | 2013-09-24 | Illumina, Inc. | Preservation of information related to genomic DNA methylation |
ES2642793T3 (en) | 2008-11-06 | 2017-11-20 | University Of Miami | Role of soluble uPAR in the pathogenesis of proteinuric renal disease |
WO2010068702A2 (en) * | 2008-12-10 | 2010-06-17 | Illumina, Inc. | Methods and compositions for hybridizing nucleic acids |
ES2595410T3 (en) | 2009-01-07 | 2016-12-29 | Myriad Genetics, Inc. | Cancer biomarkers |
CN104878086A (en) | 2009-02-11 | 2015-09-02 | 卡里斯Mpi公司 | Molecular Profiling For Personalized Medicine |
US20100330569A1 (en) | 2009-04-23 | 2010-12-30 | Intelligent Bio-Systems, Inc. | Hydroxymethyl Linkers For Labeling Nucleotides |
WO2011006169A1 (en) | 2009-07-10 | 2011-01-13 | Portola Pharmaceuticals, Inc. | Methods for diagnosis and treatment of thrombotic disorders mediated by cyp2c19*2 |
US8815779B2 (en) | 2009-09-16 | 2014-08-26 | SwitchGear Genomics, Inc. | Transcription biomarkers of biological responses and methods |
EP2308892A1 (en) | 2009-10-01 | 2011-04-13 | Sanofi-Aventis | Use of siRNA targetting Sipa1l1 for the reduction of adipogenesis |
WO2011056688A2 (en) | 2009-10-27 | 2011-05-12 | Caris Life Sciences, Inc. | Molecular profiling for personalized medicine |
US20110201008A1 (en) * | 2009-12-01 | 2011-08-18 | University Of Miami | Assays, methods and kits for measuring response to therapy and predicting clinical outcome in patients with b-cell lymphoma |
US8501122B2 (en) | 2009-12-08 | 2013-08-06 | Affymetrix, Inc. | Manufacturing and processing polymer arrays |
US8835358B2 (en) | 2009-12-15 | 2014-09-16 | Cellular Research, Inc. | Digital counting of individual molecules by stochastic attachment of diverse labels |
US9798855B2 (en) | 2010-01-07 | 2017-10-24 | Affymetrix, Inc. | Differential filtering of genetic data |
US9216189B2 (en) | 2010-01-25 | 2015-12-22 | Icahn School Of Medicine At Mount Sinai | Antibodies which bind type I cannabinoid receptor/angiotensis II receptor heteromers |
DK2531880T3 (en) | 2010-02-01 | 2016-11-07 | Illumina Inc | FOCUSING METHODS AND OPTICAL SYSTEMS AND DEVICES THEREOF |
WO2011112465A1 (en) | 2010-03-06 | 2011-09-15 | Illumina, Inc. | Systems, methods, and apparatuses for detecting optical signals from a sample |
EP2563380B1 (en) | 2010-04-26 | 2018-05-30 | aTyr Pharma, Inc. | Innovative discovery of therapeutic, diagnostic, and antibody compositions related to protein fragments of cysteinyl-trna synthetase |
WO2011139799A2 (en) | 2010-04-27 | 2011-11-10 | Atyr Pharma, Inc. | Innovative discovery of therapeutic, diagnostic, and antibody compositions related to protein fragments of isoleucyl trna synthetases |
US8993723B2 (en) | 2010-04-28 | 2015-03-31 | Atyr Pharma, Inc. | Innovative discovery of therapeutic, diagnostic, and antibody compositions related to protein fragments of alanyl-tRNA synthetases |
AU2011248457B2 (en) | 2010-04-29 | 2017-02-16 | Pangu Biopharma Limited | Innovative discovery of therapeutic, diagnostic, and antibody compositions related to protein fragments of valyl tRNA synthetases |
US8986680B2 (en) | 2010-04-29 | 2015-03-24 | Atyr Pharma, Inc. | Innovative discovery of therapeutic, diagnostic, and antibody compositions related to protein fragments of Asparaginyl tRNA synthetases |
US9068177B2 (en) | 2010-04-29 | 2015-06-30 | Atyr Pharma, Inc | Innovative discovery of therapeutic, diagnostic, and antibody compositions related to protein fragments of glutaminyl-tRNA synthetases |
JP6008840B2 (en) | 2010-05-03 | 2016-10-19 | エータイアー ファーマ, インコーポレイテッド | Innovative discovery of therapeutic, diagnostic and antibody compositions related to protein fragments of phenylalanyl αtRNA synthetase |
US8981045B2 (en) | 2010-05-03 | 2015-03-17 | Atyr Pharma, Inc. | Innovative discovery of therapeutic, diagnostic, and antibody compositions related to protein fragments of methionyl-tRNA synthetases |
EP2566515B1 (en) | 2010-05-03 | 2017-08-02 | aTyr Pharma, Inc. | Innovative discovery of therapeutic, diagnostic, and antibody compositions related to protein fragments of arginyl-trna synthetases |
CA2798139C (en) | 2010-05-04 | 2019-09-24 | Atyr Pharma, Inc. | Innovative discovery of therapeutic, diagnostic, and antibody compositions related to protein fragments of p38 multi-trna synthetase complex |
EP2568996B1 (en) | 2010-05-14 | 2017-10-04 | aTyr Pharma, Inc. | Therapeutic, diagnostic, and antibody compositions related to protein fragments of phenylalanyl-beta-trna synthetases |
US8962560B2 (en) | 2010-06-01 | 2015-02-24 | Atyr Pharma Inc. | Innovative discovery of therapeutic, diagnostic, and antibody compositions related to protein fragments of Lysyl-tRNA synthetases |
US9353412B2 (en) | 2010-06-18 | 2016-05-31 | Illumina, Inc. | Conformational probes and methods for sequencing nucleic acids |
EP2591126B1 (en) | 2010-07-07 | 2020-12-30 | Myriad Genetics, Inc. | Gene signatures for cancer prognosis |
JP6116479B2 (en) | 2010-07-12 | 2017-04-19 | エータイアー ファーマ, インコーポレイテッド | Innovative discovery of therapeutic, diagnostic and antibody compositions related to protein fragments of glycyl-tRNA synthetase |
EP2608801B1 (en) | 2010-08-25 | 2019-08-21 | aTyr Pharma, Inc. | INNOVATIVE DISCOVERY OF THERAPEUTIC, DIAGNOSTIC, AND ANTIBODY COMPOSITIONS RELATED TO PROTEIN FRAGMENTS OF TYROSYL-tRNA SYNTHETASES |
EP2472470A3 (en) * | 2010-09-15 | 2014-01-15 | MBT Technology LLC | System, method & computer program for presenting an automated assessment of a sample's response to external influences |
US8759038B2 (en) | 2010-09-29 | 2014-06-24 | Illumina Cambridge Limited | Compositions and methods for sequencing nucleic acids |
US20120095029A1 (en) | 2010-10-15 | 2012-04-19 | Hoffmann-La Roche Inc. | Ipp complex as marker for erlotinib treatment |
WO2012055929A1 (en) | 2010-10-26 | 2012-05-03 | Illumina, Inc. | Sequencing methods |
WO2012090073A2 (en) | 2010-12-30 | 2012-07-05 | The Netherlands Cancer Institute | Methods and compositions for predicting chemotherapy sensitivity |
US8951781B2 (en) | 2011-01-10 | 2015-02-10 | Illumina, Inc. | Systems, methods, and apparatuses to image a sample for biological or chemical analysis |
WO2012138789A2 (en) | 2011-04-04 | 2012-10-11 | Netherlands Cancer Institute | Methods and compositions for predicting resistance to anticancer treatment |
EP2694677A2 (en) | 2011-04-04 | 2014-02-12 | Netherland Cancer Institute | Methods and compositions for predicting resistance to anticancer treatment with protein kinase inhibitors |
WO2012149038A1 (en) | 2011-04-25 | 2012-11-01 | Advanced Bioscience Laboratories, Inc. | Truncated hiv envelope proteins (env), methods and compositions related thereto |
EP2718465B1 (en) | 2011-06-09 | 2022-04-13 | Illumina, Inc. | Method of making an analyte array |
AU2012328662B2 (en) | 2011-10-28 | 2015-12-17 | Illumina, Inc. | Microarray fabrication system and method |
AU2012367247B2 (en) | 2012-01-24 | 2018-04-05 | Cd Diagnostics, Inc. | System for detecting infection in synovial fluid |
EP3311847A1 (en) | 2012-02-16 | 2018-04-25 | Atyr Pharma, Inc. | Histidyl-trna synthetases for treating autoimmune and inflammatory diseases |
WO2013123125A1 (en) | 2012-02-17 | 2013-08-22 | President And Fellows Of Harvard College | Assembly of nucleic acid sequences in emulsions |
EP2820174B1 (en) | 2012-02-27 | 2019-12-25 | The University of North Carolina at Chapel Hill | Methods and uses for molecular tags |
WO2013130674A1 (en) | 2012-02-27 | 2013-09-06 | Cellular Research, Inc. | Compositions and kits for molecular counting |
EP3699577B1 (en) | 2012-08-20 | 2023-11-08 | Illumina, Inc. | System for fluorescence lifetime based sequencing |
EP2920322B1 (en) | 2012-11-16 | 2023-01-11 | Myriad Genetics, Inc. | Gene signatures for cancer prognosis |
AU2013352339B2 (en) | 2012-11-27 | 2019-07-04 | Pontificia Universidad Catolica De Chile | Compositions and methods for diagnosing thyroid tumors |
US9411930B2 (en) | 2013-02-01 | 2016-08-09 | The Regents Of The University Of California | Methods for genome assembly and haplotype phasing |
CN108624668B (en) | 2013-02-01 | 2022-12-02 | 加利福尼亚大学董事会 | Methods for genome assembly and haplotype phasing |
US11021752B2 (en) | 2013-03-15 | 2021-06-01 | Myriad Genetics, Inc. | Genes and gene signatures for diagnosis and treatment of melanoma |
GB2525568B (en) | 2013-03-15 | 2020-10-14 | Abvitro Llc | Single cell barcoding for antibody discovery |
US10535420B2 (en) | 2013-03-15 | 2020-01-14 | Affymetrix, Inc. | Systems and methods for probe design to detect the presence of simple and complex indels |
EP3633048B1 (en) | 2013-03-27 | 2022-10-12 | Alan Handyside | Assessment of risk of aneuploidy |
JP6612232B2 (en) | 2013-08-28 | 2019-11-27 | クラウン バイオサイエンス インコーポレイテッド(タイカン) | Gene expression signatures for predicting responsiveness to multikinase inhibitors of interest, and uses thereof |
WO2015031691A1 (en) | 2013-08-28 | 2015-03-05 | Cellular Research, Inc. | Massively parallel single cell analysis |
US9352315B2 (en) | 2013-09-27 | 2016-05-31 | Taiwan Semiconductor Manufacturing Company, Ltd. | Method to produce chemical pattern in micro-fluidic structure |
US20150103181A1 (en) * | 2013-10-16 | 2015-04-16 | Checkpoint Technologies Llc | Auto-flat field for image acquisition |
US10607717B2 (en) | 2013-11-06 | 2020-03-31 | The United States Of America, As Represented By The Secretary, Department Of Health And Human Services | Method for subtyping lymphoma types by means of expression profiling |
EP3094747B1 (en) | 2014-01-16 | 2018-11-07 | Illumina, Inc. | Gene expression panel for prognosis of prostate cancer recurrence |
UA123387C2 (en) | 2014-04-22 | 2021-03-31 | Інвайролоджикс, Інк. | Compositions and methods for enhancing and/or predicting dna amplification |
EP3623482A1 (en) | 2014-05-13 | 2020-03-18 | Myriad Genetics, Inc. | Gene signatures for cancer prognosis |
EP4250299A3 (en) | 2014-07-02 | 2023-12-06 | Myriad myPath, LLC | Genes and gene signatures for diagnosis and treatment of melanoma |
US10526641B2 (en) | 2014-08-01 | 2020-01-07 | Dovetail Genomics, Llc | Tagging nucleic acids for sequence assembly |
EP3536786B1 (en) | 2014-09-15 | 2021-06-30 | AbVitro LLC | High-throughput nucleotide library sequencing |
US11053550B2 (en) | 2014-10-14 | 2021-07-06 | The University Of North Carolina At Chapel Hill | Gene-expression based subtyping of pancreatic ductal adenocarcinoma |
AU2015336086C1 (en) | 2014-10-20 | 2021-07-29 | Envirologix Inc. | Compositions and methods for detecting an RNA virus |
WO2016090174A1 (en) | 2014-12-03 | 2016-06-09 | Auckland Uniservices, Ltd. | Kinase inhibitor prodrug for the treatment of cancer |
MX2017009833A (en) | 2015-01-30 | 2017-11-02 | Envirologix Inc | Substrate molecule. |
US10801065B2 (en) | 2015-02-10 | 2020-10-13 | Dana-Farber Cancer Institute, Inc. | Methods of determining levels of exposure to radiation and uses thereof |
JP6777966B2 (en) | 2015-02-17 | 2020-10-28 | ダブテイル ゲノミクス エルエルシー | Nucleic acid sequence assembly |
US9708647B2 (en) | 2015-03-23 | 2017-07-18 | Insilixa, Inc. | Multiplexed analysis of nucleic acid hybridization thermodynamics using integrated arrays |
CA3225867A1 (en) | 2015-03-24 | 2016-09-29 | Illumina, Inc. | Methods, carrier assemblies, and systems for imaging samples for biological or chemical analysis |
GB2554572B (en) | 2015-03-26 | 2021-06-23 | Dovetail Genomics Llc | Physical linkage preservation in DNA storage |
ES2927121T3 (en) | 2015-05-29 | 2022-11-02 | Illumina Inc | Sample holder and assay system for performing designated reactions |
AU2016301231A1 (en) | 2015-08-04 | 2018-02-22 | Cd Diagnostics, Inc. | Methods for detecting adverse local tissue reaction (ALTR) necrosis |
EP3338096B1 (en) | 2015-08-24 | 2024-03-06 | Illumina, Inc. | In-line pressure accumulator and flow-control system for biological or chemical assays |
US9499861B1 (en) | 2015-09-10 | 2016-11-22 | Insilixa, Inc. | Methods and systems for multiplex quantitative nucleic acid amplification |
CA2999888A1 (en) | 2015-09-24 | 2017-03-30 | Abvitro Llc | Affinity-oligonucleotide conjugates and uses thereof |
EP3353550A1 (en) | 2015-09-25 | 2018-08-01 | AbVitro LLC | High throughput process for t cell receptor target identification of natively-paired t cell receptor sequences |
KR20180071299A (en) | 2015-10-18 | 2018-06-27 | 아피메트릭스, 인코포레이티드 | Analysis of allelic genotypes of single nucleotide polymorphisms and Indel |
JP7300831B2 (en) | 2015-10-19 | 2023-06-30 | ダブテイル ゲノミクス エルエルシー | Methods for Genome Assembly, Haplotype Phasing, and Target-Independent Nucleic Acid Detection |
WO2017087819A1 (en) | 2015-11-19 | 2017-05-26 | Myriad Genetics, Inc. | Signatures for predicting cancer immune therapy response |
WO2017120456A1 (en) | 2016-01-06 | 2017-07-13 | Myriad Genetics, Inc. | Genes and gene signatures for diagnosis and treatment of melanoma |
JP7441003B2 (en) | 2016-02-23 | 2024-02-29 | ダブテイル ゲノミクス エルエルシー | Generation of phased read sets and haplotype phasing for genome assembly |
WO2017155858A1 (en) | 2016-03-07 | 2017-09-14 | Insilixa, Inc. | Nucleic acid sequence identification using solid-phase cyclic single base extension |
JP7075896B2 (en) | 2016-04-20 | 2022-05-26 | アメリカ合衆国 | Evaluation of mantle cell lymphoma and related methods |
WO2017193062A1 (en) | 2016-05-06 | 2017-11-09 | Myriad Genetics, Inc. | Gene signatures for renal cancer prognosis |
SG11201810088SA (en) | 2016-05-13 | 2018-12-28 | Dovetail Genomics Llc | Recovering long-range linkage information from preserved samples |
AU2017332495A1 (en) | 2016-09-24 | 2019-04-11 | Abvitro Llc | Affinity-oligonucleotide conjugates and uses thereof |
WO2018064116A1 (en) | 2016-09-28 | 2018-04-05 | Illumina, Inc. | Methods and systems for data compression |
WO2018213803A1 (en) | 2017-05-19 | 2018-11-22 | Neon Therapeutics, Inc. | Immunogenic neoantigen identification |
EP3638814B8 (en) | 2017-06-14 | 2023-06-21 | The United States of America, as Represented by The Secretary, Department of Health and Human Services | Method for determining lymphoma type |
WO2019246160A2 (en) | 2018-06-18 | 2019-12-26 | Igenomix, S.L. | Methods, compositions, and kits for assessing endometrial transformation |
US20210388449A1 (en) | 2018-10-09 | 2021-12-16 | Genecentric Therapeutics, Inc. | Detecting cancer cell of origin |
WO2020113237A1 (en) | 2018-11-30 | 2020-06-04 | Caris Mpi, Inc. | Next-generation molecular profiling |
JP2022525322A (en) | 2019-03-14 | 2022-05-12 | インシリクサ, インコーポレイテッド | Methods and systems for time-gate fluorescence-based detection |
US20220220564A1 (en) | 2019-04-17 | 2022-07-14 | Igenomix, S.L. | Improved methods for the early diagnosis of uterine leiomyomas and leiomyosarcomas |
AU2020397802A1 (en) | 2019-12-02 | 2022-06-16 | Caris Mpi, Inc. | Pan-cancer platinum response predictor |
WO2023122363A1 (en) | 2021-12-23 | 2023-06-29 | Illumina Software, Inc. | Dynamic graphical status summaries for nucelotide sequencing |
US20230215515A1 (en) | 2021-12-23 | 2023-07-06 | Illumina Software, Inc. | Facilitating secure execution of external workflows for genomic sequencing diagnostics |
WO2023129764A1 (en) | 2021-12-29 | 2023-07-06 | Illumina Software, Inc. | Automatically switching variant analysis model versions for genomic analysis applications |
Citations (42)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4720786A (en) * | 1985-04-19 | 1988-01-19 | Fuji Photo Film Co., Ltd. | Method of compensating for offset distortion in rows of electrophoretic patterns |
US4741043A (en) * | 1985-11-04 | 1988-04-26 | Cell Analysis Systems, Inc. | Method of and an apparatus for image analyses of biological specimens |
US4777597A (en) * | 1983-01-08 | 1988-10-11 | Fuji Photo Film Co., Ltd. | Signal processing method in autoradiography |
US4802101A (en) * | 1985-08-19 | 1989-01-31 | Fuji Photo Film Co., Ltd. | Signal processing method for determining base sequence of nucleic acid |
US4811218A (en) * | 1986-06-02 | 1989-03-07 | Applied Biosystems, Inc. | Real time scanning electrophoresis apparatus for DNA sequencing |
US4837733A (en) * | 1983-01-08 | 1989-06-06 | Fuji Photo Film Co., Ltd. | Signal processing method in autoradiography |
US4885696A (en) * | 1986-03-26 | 1989-12-05 | Fuji Photo Film Co., Ltd. | Signal processing method for determining base sequence of nucleic acid |
US4888695A (en) * | 1983-01-08 | 1989-12-19 | Fuji Photo Film Co., Ltd. | Signal processing method in autoradiography |
US4894786A (en) * | 1987-01-06 | 1990-01-16 | Fuji Photo Film Co., Ltd. | Signal processing method for analyzing autoradiograph |
US4939667A (en) * | 1986-12-27 | 1990-07-03 | Fuji Photo Film Co., Ltd. | Signal processing method for analyzing autoradiograph |
US4941092A (en) * | 1985-05-23 | 1990-07-10 | Fuji Photo Film Co., Ltd. | Signal processing method for determining base sequence of nucleic acid |
US4958281A (en) * | 1985-10-11 | 1990-09-18 | Fuji Photo Film Co., Ltd. | Signal processing method for determining base sequence of nucleic acid |
US4965725A (en) * | 1988-04-08 | 1990-10-23 | Nueromedical Systems, Inc. | Neural network based automated cytological specimen classification system and method |
US4972325A (en) * | 1986-03-29 | 1990-11-20 | Fuji Photo Film Co., Ltd. | Signal processing method for determining base sequence of nucleic acid |
US4982326A (en) * | 1986-03-05 | 1991-01-01 | Fuji Photo Film Co., Ltd. | Method for analyzing autoradiograph for determining base sequence of nucleic acid |
US5002867A (en) * | 1988-04-25 | 1991-03-26 | Macevicz Stephen C | Nucleic acid sequence determination by multiple mixed oligonucleotide probes |
US5143854A (en) * | 1989-06-07 | 1992-09-01 | Affymax Technologies N.V. | Large scale photolithographic solid phase synthesis of polypeptides and receptor binding screening thereof |
US5200313A (en) * | 1983-08-05 | 1993-04-06 | Miles Inc. | Nucleic acid hybridization assay employing detectable anti-hybrid antibodies |
US5202231A (en) * | 1987-04-01 | 1993-04-13 | Drmanac Radoje T | Method of sequencing of genomes by hybridization of oligonucleotide probes |
US5235626A (en) * | 1991-10-22 | 1993-08-10 | International Business Machines Corporation | Segmented mask and exposure system for x-ray lithography |
US5260190A (en) * | 1982-11-05 | 1993-11-09 | Fuji Photo Film Co., Ltd. | Autoradiographic process |
US5270162A (en) * | 1983-09-19 | 1993-12-14 | Fuji Photo Film Co., Ltd. | Autoradiographic gene-screening method |
US5273632A (en) * | 1992-11-19 | 1993-12-28 | University Of Utah Research Foundation | Methods and apparatus for analysis of chromatographic migration patterns |
US5288514A (en) * | 1992-09-14 | 1994-02-22 | The Regents Of The University Of California | Solid phase and combinatorial synthesis of benzodiazepine compounds on a solid support |
US5297288A (en) * | 1989-11-28 | 1994-03-22 | United States Biochemical Corporation | System for use with a high resolution scanner for scheduling a sequence of software tools for determining the presence of bands in DNA sequencing samples |
US5306618A (en) * | 1986-07-02 | 1994-04-26 | E. I. Du Pont De Nemours And Company | Method systems and reagents for DNA sequencing |
US5384261A (en) * | 1991-11-22 | 1995-01-24 | Affymax Technologies N.V. | Very large scale immobilized polymer synthesis using mechanically directed flow paths |
US5470710A (en) * | 1993-10-22 | 1995-11-28 | University Of Utah | Automated hybridization/imaging device for fluorescent multiplex DNA sequencing |
US5503985A (en) * | 1993-02-18 | 1996-04-02 | Cathey; Cheryl A. | Disposable device for diagnostic assays |
US5525464A (en) * | 1987-04-01 | 1996-06-11 | Hyseq, Inc. | Method of sequencing by hybridization of oligonucleotide probes |
US5527681A (en) * | 1989-06-07 | 1996-06-18 | Affymax Technologies N.V. | Immobilized molecular synthesis of systematically substituted compounds |
US5556749A (en) * | 1992-11-12 | 1996-09-17 | Hitachi Chemical Research Center, Inc. | Oligoprobe designstation: a computerized method for designing optimal DNA probes |
US5665549A (en) * | 1992-03-04 | 1997-09-09 | The Regents Of The University Of California | Comparative genomic hybridization (CGH) |
US5700637A (en) * | 1988-05-03 | 1997-12-23 | Isis Innovation Limited | Apparatus and method for analyzing polynucleotide sequences and method of generating oligonucleotide arrays |
US5727098A (en) * | 1994-09-07 | 1998-03-10 | Jacobson; Joseph M. | Oscillating fiber optic display and imager |
US5795716A (en) * | 1994-10-21 | 1998-08-18 | Chee; Mark S. | Computer-aided visualization and analysis system for sequence evaluation |
US5834758A (en) * | 1994-09-02 | 1998-11-10 | Affymetrix, Inc. | Method and apparatus for imaging a sample on a device |
US6025136A (en) * | 1994-12-09 | 2000-02-15 | Hyseq, Inc. | Methods and apparatus for DNA sequencing and DNA identification |
US6355432B1 (en) * | 1989-06-07 | 2002-03-12 | Affymetrix Lnc. | Products for detecting nucleic acids |
US6600996B2 (en) * | 1994-10-21 | 2003-07-29 | Affymetrix, Inc. | Computer-aided techniques for analyzing biological sequences |
US20030220748A1 (en) * | 1994-10-21 | 2003-11-27 | Affymetrix, Inc., A California Corporation | Computer-aided techniques for analyzing biological sequences |
US20040175718A1 (en) * | 1995-10-16 | 2004-09-09 | Affymetrix, Inc. | Computer-aided visualization and analysis system for sequence evaluation |
Family Cites Families (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPS6285862A (en) * | 1985-10-11 | 1987-04-20 | Fuji Photo Film Co Ltd | Signal processing method for determining base sequence of nucleic acid |
GB8810400D0 (en) * | 1988-05-03 | 1988-06-08 | Southern E | Analysing polynucleotide sequences |
WO1990001564A1 (en) | 1988-08-09 | 1990-02-22 | Microprobe Corporation | Methods for multiple target analyses through nucleic acid hybridization |
EP0834576B1 (en) * | 1990-12-06 | 2002-01-16 | Affymetrix, Inc. (a Delaware Corporation) | Detection of nucleic acid sequences |
ES2155822T3 (en) * | 1990-12-06 | 2001-06-01 | Affymetrix Inc | COMPOUNDS AND ITS USE IN A BINARY SYNTHESIS STRATEGY. |
EP0514927A1 (en) | 1991-05-24 | 1992-11-25 | Walter Gilbert | Method and apparatus for rapid nucleic acid sequencing |
FR2684688B1 (en) | 1991-12-04 | 1994-03-18 | Bertin Et Cie | METHOD FOR SELECTING AT LEAST ONE MUTATION SCREEN, ITS APPLICATION TO A PROCESS FOR QUICK IDENTIFICATION OF POLYMORPHIC ALLELES AND DEVICE FOR ITS IMPLEMENTATION. |
US5503980A (en) * | 1992-11-06 | 1996-04-02 | Trustees Of Boston University | Positional sequencing by hybridization |
WO1995011995A1 (en) * | 1993-10-26 | 1995-05-04 | Affymax Technologies N.V. | Arrays of nucleic acid probes on biological chips |
EP0705271B1 (en) * | 1993-06-25 | 2002-11-13 | Affymetrix, Inc. (a Delaware Corporation) | Hybridization and sequencing of nucleic acids |
US5807522A (en) * | 1994-06-17 | 1998-09-15 | The Board Of Trustees Of The Leland Stanford Junior University | Methods for fabricating microarrays of biological samples |
-
1994
- 1994-10-21 US US08/327,525 patent/US5795716A/en not_active Expired - Lifetime
-
1995
- 1995-10-16 US US08/531,137 patent/US5974164A/en not_active Expired - Lifetime
- 1995-10-20 EP EP95307476A patent/EP0717113A2/en not_active Withdrawn
-
1998
- 1998-03-27 US US09/049,805 patent/US6733964B1/en not_active Expired - Fee Related
- 1998-09-23 US US09/158,765 patent/US6242180B1/en not_active Expired - Lifetime
-
2001
- 2001-02-27 US US09/796,071 patent/US6607887B2/en not_active Expired - Fee Related
-
2004
- 2004-03-01 US US10/791,373 patent/US20050186581A1/en not_active Abandoned
Patent Citations (56)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5260190A (en) * | 1982-11-05 | 1993-11-09 | Fuji Photo Film Co., Ltd. | Autoradiographic process |
US4837733A (en) * | 1983-01-08 | 1989-06-06 | Fuji Photo Film Co., Ltd. | Signal processing method in autoradiography |
US4777597A (en) * | 1983-01-08 | 1988-10-11 | Fuji Photo Film Co., Ltd. | Signal processing method in autoradiography |
US4888695A (en) * | 1983-01-08 | 1989-12-19 | Fuji Photo Film Co., Ltd. | Signal processing method in autoradiography |
US5200313A (en) * | 1983-08-05 | 1993-04-06 | Miles Inc. | Nucleic acid hybridization assay employing detectable anti-hybrid antibodies |
US5270162A (en) * | 1983-09-19 | 1993-12-14 | Fuji Photo Film Co., Ltd. | Autoradiographic gene-screening method |
US4720786A (en) * | 1985-04-19 | 1988-01-19 | Fuji Photo Film Co., Ltd. | Method of compensating for offset distortion in rows of electrophoretic patterns |
US4941092A (en) * | 1985-05-23 | 1990-07-10 | Fuji Photo Film Co., Ltd. | Signal processing method for determining base sequence of nucleic acid |
US4802101A (en) * | 1985-08-19 | 1989-01-31 | Fuji Photo Film Co., Ltd. | Signal processing method for determining base sequence of nucleic acid |
US4958281A (en) * | 1985-10-11 | 1990-09-18 | Fuji Photo Film Co., Ltd. | Signal processing method for determining base sequence of nucleic acid |
US4741043B1 (en) * | 1985-11-04 | 1994-08-09 | Cell Analysis Systems Inc | Method of and apparatus for image analyses of biological specimens |
US4741043A (en) * | 1985-11-04 | 1988-04-26 | Cell Analysis Systems, Inc. | Method of and an apparatus for image analyses of biological specimens |
US4982326A (en) * | 1986-03-05 | 1991-01-01 | Fuji Photo Film Co., Ltd. | Method for analyzing autoradiograph for determining base sequence of nucleic acid |
US4885696A (en) * | 1986-03-26 | 1989-12-05 | Fuji Photo Film Co., Ltd. | Signal processing method for determining base sequence of nucleic acid |
US4972325A (en) * | 1986-03-29 | 1990-11-20 | Fuji Photo Film Co., Ltd. | Signal processing method for determining base sequence of nucleic acid |
US4811218A (en) * | 1986-06-02 | 1989-03-07 | Applied Biosystems, Inc. | Real time scanning electrophoresis apparatus for DNA sequencing |
US5332666A (en) * | 1986-07-02 | 1994-07-26 | E. I. Du Pont De Nemours And Company | Method, system and reagents for DNA sequencing |
US5306618A (en) * | 1986-07-02 | 1994-04-26 | E. I. Du Pont De Nemours And Company | Method systems and reagents for DNA sequencing |
US4939667A (en) * | 1986-12-27 | 1990-07-03 | Fuji Photo Film Co., Ltd. | Signal processing method for analyzing autoradiograph |
US4894786A (en) * | 1987-01-06 | 1990-01-16 | Fuji Photo Film Co., Ltd. | Signal processing method for analyzing autoradiograph |
US5667972A (en) * | 1987-04-01 | 1997-09-16 | Hyseg, Inc. | Method of sequencing of genoms by hybridization of oligonucleotide probes |
US5492806A (en) * | 1987-04-01 | 1996-02-20 | Hyseq, Inc. | Method of determining an ordered sequence of subfragments of a nucleic acid fragment by hybridization of oligonucleotide probes |
US5695940A (en) * | 1987-04-01 | 1997-12-09 | Hyseq, Inc. | Method of sequencing by hybridization of oligonucleotide probes |
US5972619A (en) * | 1987-04-01 | 1999-10-26 | Hyseq, Inc. | Computer-aided analysis system for sequencing by hybridization |
US5202231A (en) * | 1987-04-01 | 1993-04-13 | Drmanac Radoje T | Method of sequencing of genomes by hybridization of oligonucleotide probes |
US6018041A (en) * | 1987-04-01 | 2000-01-25 | Hyseq, Inc. | Method of sequencing genomes by hybridization of oligonucleotide probes |
US5525464A (en) * | 1987-04-01 | 1996-06-11 | Hyseq, Inc. | Method of sequencing by hybridization of oligonucleotide probes |
US4965725A (en) * | 1988-04-08 | 1990-10-23 | Nueromedical Systems, Inc. | Neural network based automated cytological specimen classification system and method |
US4965725B1 (en) * | 1988-04-08 | 1996-05-07 | Neuromedical Systems Inc | Neural network based automated cytological specimen classification system and method |
US5002867A (en) * | 1988-04-25 | 1991-03-26 | Macevicz Stephen C | Nucleic acid sequence determination by multiple mixed oligonucleotide probes |
US5700637A (en) * | 1988-05-03 | 1997-12-23 | Isis Innovation Limited | Apparatus and method for analyzing polynucleotide sequences and method of generating oligonucleotide arrays |
US5445934A (en) * | 1989-06-07 | 1995-08-29 | Affymax Technologies N.V. | Array of oligonucleotides on a solid substrate |
US6355432B1 (en) * | 1989-06-07 | 2002-03-12 | Affymetrix Lnc. | Products for detecting nucleic acids |
US5143854A (en) * | 1989-06-07 | 1992-09-01 | Affymax Technologies N.V. | Large scale photolithographic solid phase synthesis of polypeptides and receptor binding screening thereof |
US6646243B2 (en) * | 1989-06-07 | 2003-11-11 | Affymetrix, Inc. | Nucleic acid reading and analysis system |
US5527681A (en) * | 1989-06-07 | 1996-06-18 | Affymax Technologies N.V. | Immobilized molecular synthesis of systematically substituted compounds |
US5297288A (en) * | 1989-11-28 | 1994-03-22 | United States Biochemical Corporation | System for use with a high resolution scanner for scheduling a sequence of software tools for determining the presence of bands in DNA sequencing samples |
US5235626A (en) * | 1991-10-22 | 1993-08-10 | International Business Machines Corporation | Segmented mask and exposure system for x-ray lithography |
US5384261A (en) * | 1991-11-22 | 1995-01-24 | Affymax Technologies N.V. | Very large scale immobilized polymer synthesis using mechanically directed flow paths |
US5665549A (en) * | 1992-03-04 | 1997-09-09 | The Regents Of The University Of California | Comparative genomic hybridization (CGH) |
US5288514A (en) * | 1992-09-14 | 1994-02-22 | The Regents Of The University Of California | Solid phase and combinatorial synthesis of benzodiazepine compounds on a solid support |
US5556749A (en) * | 1992-11-12 | 1996-09-17 | Hitachi Chemical Research Center, Inc. | Oligoprobe designstation: a computerized method for designing optimal DNA probes |
US5273632A (en) * | 1992-11-19 | 1993-12-28 | University Of Utah Research Foundation | Methods and apparatus for analysis of chromatographic migration patterns |
US5503985A (en) * | 1993-02-18 | 1996-04-02 | Cathey; Cheryl A. | Disposable device for diagnostic assays |
US5470710A (en) * | 1993-10-22 | 1995-11-28 | University Of Utah | Automated hybridization/imaging device for fluorescent multiplex DNA sequencing |
US5834758A (en) * | 1994-09-02 | 1998-11-10 | Affymetrix, Inc. | Method and apparatus for imaging a sample on a device |
US5727098A (en) * | 1994-09-07 | 1998-03-10 | Jacobson; Joseph M. | Oscillating fiber optic display and imager |
US5974164A (en) * | 1994-10-21 | 1999-10-26 | Affymetrix, Inc. | Computer-aided visualization and analysis system for sequence evaluation |
US6242180B1 (en) * | 1994-10-21 | 2001-06-05 | Affymetrix, Inc. | Computer-aided visualization and analysis system for sequence evaluation |
US6600996B2 (en) * | 1994-10-21 | 2003-07-29 | Affymetrix, Inc. | Computer-aided techniques for analyzing biological sequences |
US6607887B2 (en) * | 1994-10-21 | 2003-08-19 | Affymetrix, Inc. | Computer-aided visualization and analysis system for sequence evaluation |
US5795716A (en) * | 1994-10-21 | 1998-08-18 | Chee; Mark S. | Computer-aided visualization and analysis system for sequence evaluation |
US20030220748A1 (en) * | 1994-10-21 | 2003-11-27 | Affymetrix, Inc., A California Corporation | Computer-aided techniques for analyzing biological sequences |
US6733964B1 (en) * | 1994-10-21 | 2004-05-11 | Affymetrix Inc. | Computer-aided visualization and analysis system for sequence evaluation |
US6025136A (en) * | 1994-12-09 | 2000-02-15 | Hyseq, Inc. | Methods and apparatus for DNA sequencing and DNA identification |
US20040175718A1 (en) * | 1995-10-16 | 2004-09-09 | Affymetrix, Inc. | Computer-aided visualization and analysis system for sequence evaluation |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20040175718A1 (en) * | 1995-10-16 | 2004-09-09 | Affymetrix, Inc. | Computer-aided visualization and analysis system for sequence evaluation |
Also Published As
Publication number | Publication date |
---|---|
US20020012925A1 (en) | 2002-01-31 |
EP0717113A2 (en) | 1996-06-19 |
US5795716A (en) | 1998-08-18 |
US5974164A (en) | 1999-10-26 |
US6242180B1 (en) | 2001-06-05 |
EP0717113A3 (en) | 1996-07-17 |
US6607887B2 (en) | 2003-08-19 |
US6733964B1 (en) | 2004-05-11 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US6733964B1 (en) | Computer-aided visualization and analysis system for sequence evaluation | |
US6546340B2 (en) | Computer-aided probability base calling for arrays of nucleic acid probes on chips | |
US6600996B2 (en) | Computer-aided techniques for analyzing biological sequences | |
US6361937B1 (en) | Computer-aided nucleic acid sequencing | |
EP0923050B1 (en) | Scanned image alignment systems and methods | |
US6584410B2 (en) | Polymorphism detection utilizing clustering analysis | |
US6130046A (en) | Techniques for synthesis integrity evaluation utilizing cycle fidelity probes | |
US20070067111A1 (en) | Computer-aided visualization of expression comparison | |
US8204695B2 (en) | Techniques for identifying, confirming, mapping, and categorizing polymers | |
US6699659B2 (en) | Products and methods for analyzing nucleic acids including identification of substitutions, insertions and deletions | |
US20040175718A1 (en) | Computer-aided visualization and analysis system for sequence evaluation | |
US8934689B2 (en) | Feature intensity reconstruction of biological probe array | |
US20030220748A1 (en) | Computer-aided techniques for analyzing biological sequences | |
EP1632579A2 (en) | Computer-aided techniques for analyzing biological sequences | |
CA2302193A1 (en) | Techniques for identifying, confirming, mapping and categorizing polymers |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
STCB | Information on status: application discontinuation |
Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION |