WO2012040387A1 - Direct capture, amplification and sequencing of target dna using immobilized primers - Google Patents

Direct capture, amplification and sequencing of target dna using immobilized primers Download PDF

Info

Publication number
WO2012040387A1
WO2012040387A1 PCT/US2011/052645 US2011052645W WO2012040387A1 WO 2012040387 A1 WO2012040387 A1 WO 2012040387A1 US 2011052645 W US2011052645 W US 2011052645W WO 2012040387 A1 WO2012040387 A1 WO 2012040387A1
Authority
WO
WIPO (PCT)
Prior art keywords
sequence
population
adaptor
primer
seq
Prior art date
Application number
PCT/US2011/052645
Other languages
French (fr)
Inventor
Samuel Myllykangas
Jason Buenrostro
Hanlee P. Ji
Original Assignee
The Board Of Trustees Of The Leland Stanford Junior University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority to CA2810931A priority Critical patent/CA2810931C/en
Application filed by The Board Of Trustees Of The Leland Stanford Junior University filed Critical The Board Of Trustees Of The Leland Stanford Junior University
Priority to EP11827484.4A priority patent/EP2619329B1/en
Priority to NZ608313A priority patent/NZ608313A/en
Priority to EP19172328.7A priority patent/EP3572528A1/en
Priority to CN201180056177.8A priority patent/CN103228798B/en
Priority to IN522MUN2013 priority patent/IN2013MN00522A/en
Priority to RU2013118722/10A priority patent/RU2565550C2/en
Priority to MX2013003349A priority patent/MX346956B/en
Priority to JP2013530291A priority patent/JP5986572B2/en
Priority to KR1020137008317A priority patent/KR20130113447A/en
Priority to AU2011305445A priority patent/AU2011305445B2/en
Publication of WO2012040387A1 publication Critical patent/WO2012040387A1/en
Priority to IL225109A priority patent/IL225109A/en

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6806Preparing nucleic acids for analysis, e.g. for polymerase chain reaction [PCR] assay
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/11DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6813Hybridisation assays
    • C12Q1/6834Enzymatic or biochemical coupling of nucleic acids to a solid phase
    • C12Q1/6837Enzymatic or biochemical coupling of nucleic acids to a solid phase using probe arrays or probe chips
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6844Nucleic acid amplification reactions
    • C12Q1/6853Nucleic acid amplification reactions using modified primers or templates
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6869Methods for sequencing
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6869Methods for sequencing
    • C12Q1/6874Methods for sequencing involving nucleic acid arrays, e.g. sequencing by hybridisation
    • CCHEMISTRY; METALLURGY
    • C40COMBINATORIAL TECHNOLOGY
    • C40BCOMBINATORIAL CHEMISTRY; LIBRARIES, e.g. CHEMICAL LIBRARIES
    • C40B40/00Libraries per se, e.g. arrays, mixtures
    • C40B40/04Libraries containing only organic compounds
    • C40B40/06Libraries containing nucleotides or polynucleotides, or derivatives thereof

Definitions

  • a target is first captured and then sequenced.
  • target capture methodologies have been developed and integrated with high throughput sequencing systems. Specifically, hybridization-based assays using beads or microarrays and in-solution based techniques using molecular inversion probes or genomic circularization
  • oligonucleotides can be applied to capture target DNA. Captured DNA is then prepared for sequencing. Complicated molecular biology protocols are often employed to prepare the enriched DNA sample and in certain cases production of the sequencing library involves many enzymatic reactions, purification steps and size selection by gel electrophoresis. The sample preparation process for target capture DNA sequencing can be labor intensive and subsequent sample manipulations can cause bias in the DNA content and increase the sequencing error rate.
  • kits for practicing the method comprise: a) obtaining a substrate comprising a first population of surface-bound oligonucleotides and a second population of surface-bound oligonucleotides, wherein the members of the first and second populations of surface-bound oligonucleotides are not spatially addressed on the substrate; b) hybridizing a first member of the first population of surface-bound oligonucleotides to a selection oligonucleotide comprising a region that hybridizes with the first member and a region that contains a genomic sequence, c) extending the first member of the first population of surface-bound oligonucleotides to produce a support-bound selection primer that comprises a sequence that is complementary to the genomic sequence; d) hybridizing the support- bound selection primer to a
  • oligonucleotides to produce a PCR product.
  • the method comprises: a) obtaining a substrate comprising a first population of surface-bound oligonucleotides and a second population of surface-bound oligonucleotides, wherein the first and second populations of surface-bound oligonucleotides are not spatially addressed on the substrate; b) hybridizing a first member of the first population of surface-bound oligonucleotides to a selection oligonucleotide comprising a region that hybridizes with the first member and a region that contains a genomic sequence; c) extending the first member of the first population of surface-bound oligonucleotides to produce a support-bound selection primer that comprises a sequence that is complementary to the genomic sequence; d) hybridizing the support-bound selection primer to a nucleic acid fragment comprising the genomic sequence; e) extending the support-bound selection primer to produce an extension product that contains a sequence that flanks the genomic sequence, e.g., in a genome; and f
  • an adaptor may be either ligated to the genomic fragment prior to hybridization, or to the extension product after the support bound selection primer is extended.
  • the distal adaptor may hybridize to a surface bound oligonucleotide (which may itself be an extension product produced by a templated extension of the second population of surface-bound oligonucleotides), thereby allowing bridge PCR to occur.
  • the selection primer may also contain a sequencing primer binding site that can be employed to sequence the PCR product.
  • the method described above generally finds use in resequencing methods in which the sequence of a reference locus is available and the same locus is to be resequenced in a plurality of test samples.
  • a selection oligonucleotide is designed to hybridize to an oligonucleotide on the substrate and a region that flanks the locus to be resequenced.
  • the locus is captured on the substrate and then amplified prior to sequencing.
  • a single locus or multiple different loci e.g., up to 10, 50, 100, 200 or 1,000 or more loci
  • the method comprises: a) obtaining a substrate comprising a first population of surface-bound oligonucleotides and a second population of surface-bound oligonucleotides, wherein the first and second populations of surface-bound oligonucleotides are randomly interspersed on the substrate and not spatially addressed; b) hybridizing a first member of the first population of surface-bound oligonucleotides to a selection
  • oligonucleotide comprising a region that hybridizes with the first member and a region that contains a genomic sequence; c) extending the first member of the first population of surface-bound oligonucleotides to produce a support-bound selection primer that comprises a sequence that is complementary to the genomic sequence; d) hybridizing the support- bound selection primer to an adaptor-ligated fragment (e.g., an adaptor- ligated genomic fragment) comprising the genomic sequence; e) extending the support-bound selection primer to produce a product that contains a sequence that flanks the genomic sequence (e.g., in a genome) and the sequence of the adaptor of the adaptor-ligated genomic fragment; and f) amplifying the product using bridge PCR to produce a PCR product.
  • an adaptor-ligated fragment e.g., an adaptor- ligated genomic fragment
  • the method may comprise: a) obtaining a substrate comprising a first population of surface-bound oligonucleotides and a second population of surface-bound oligonucleotides, wherein the first and second populations of surface-bound oligonucleotides are randomly interspersed on the substrate and not spatially addressed; b) hybridizing a first member of the first population of surface-bound oligonucleotides to a selection oligonucleotide comprising a region that hybridizes with the first member and a region that contains a genomic sequence; c) extending the first member of the first population of surface-bound oligonucleotides to produce a support-bound selection primer that comprises a sequence that is complementary to the genomic sequence; e) extending the support-bound selection primer to produce a product that contains a sequence that flanks the genomic sequence; f) ligating a double stranded adapter onto the product to produce an adaptor modified product; and g) amplifying
  • the method may further comprise: i. ligating the genomic fragments to an adaptor that contains a site for a sequencing primer and a nucleotide sequence that is the same as the second surface bound oligonucleotides, ii. hybridizing the adaptor-ligated genomic fragments to a first member of the first population of surface-bound oligonucleotides, ii. extending the first member of the first population of surface-bound oligonucleotides to which the adaptor ligated fragment is hybridized; and iv. hybridizing the adaptor-containing end of the extension product to a second support bound polynucleotide, thereby producing a bridge and facilitating bridge PCR.
  • FIG. 1 An overview of the one embodiment of the subject method called "OS-Seq”.
  • OS-Seq is a targeted resequencing method that is seamlessly integrated with the Illumina NGS platform.
  • Target- specific oligonucleotides, a sequencing library and an Illumina cluster generation kit are needed for this method. Capture of targets, processing and sequencing are performed on the NGS system. Data originating from each primer-probe is targeted and strand- specific. Shown here is the median coverage profile for OS-Seq-366.
  • Step 1 Target- specific oligonucleotides are used to modify flow cell primers to primer-probes.
  • two types of primers (named C and D) are immobilized on a paired-end flow cell.
  • OS-Seq a subset of D primers are modified to primer-probes using complex library of oligonucleotides.
  • Oligonucleotides have sequences that hybridize to type D flow cell primers. Hybridized oligonucleotides are then used as a template for DNA polymerase and D primers are extended. After denaturation, target- specific primer-probes are randomly immobilized on the flow cell. Step 2: Genomic targets in a single-adaptor library are captured using primer- probes. Sample preparation for Illumina sequencing involves the addition of specific DNA adapters to the genomic DNA fragments. These adapters incorporate sites for sequencing primers and immobilized flow cell primers. In OS-Seq, we use a modified adapter to prepare single-adapter libraries from genomic DNA. Targets in single-adaptor library are captured during high heat hybridization to their complementary primer-probes.
  • Step 3 Immobilized targets are rendered to be compatible with Illumina sequencing. In Illumina sequencing, solid-phase amplification of the immobilized sequencing library fragments using C and D primers is required. In OS-Seq, during low heat hybridization the single- adapter tails of the immobilized targets hybridize to type C primers on the flow cell surface, which stabilizes a bridge structure. The 3' ends of immobilized targets and C primers are extended using DNA polymerase. After denaturation, two complementary, immobilized sequencing library fragments are formed that contain complete C and D priming sites and are compatible with solid-phase amplification.
  • immobilized targets are structurally identical to a standard paired-end Illumina library and are amplified and processed using Illumina' s standard kits and protocols.
  • the principles of this method may be employed on other sequencing platforms.
  • oligonucleotides are sorted by sequence capture yields, on the y-axis is the normalized primer-probe yield. To calculate normalized yield, each oligonucleotide's yield was divided by the median yield from all oligonucleotides.
  • Fig. 2 Sequencing library preparation for OS-Seq. A general scheme of genomic DNA fragmentation, end repair, A-tailing, Adaptor ligation and PCR was used in the preparation of OS-Seq libraries.
  • Fig. 4 Generation of OS-Seq oligonucleotides.
  • Column- synthesis yielded large amount of mature 101-mer OS-Seq oligonucleotides that were readily usable in the assay.
  • Microarray- synthesis was applied to generate high-content oligonucleotide pools.
  • Precursor oligonucleotides were amplified using primers that incorporated additional sequences into oligonucelotides. Uracil-excision was applied to cleave the amplification primer site from the coding strands of the OS-seq oligonucleotides.
  • Fig. 5 Structures of oligonucleotide components in OS-Seq.
  • Adapter for OS-Seq contained T-overhang for sticky-end ligation to the A-tailed genomic fragments. In addition, indexing sequences as well as flow cell primer 'C site were present in the dsDNA adapter.
  • Fragmentation of genomic DNA produces fragments between 200 and 2kb. Sequencing library preparation adds common adapter to the ends of the fragments. PCR amplification distorts the fragment size distribution further. Target sites are randomly distributed within the single- adapter library fragments. Library fragments were immobilized on the flow cell and the distance between primer-probe and adapter defined the size of a genomic DNA insert. Bridge-PCR is applied to amplify immobilized target DNA (generally, solid-phase PCR preferentially amplifies shorter fragments). After cluster amplification and processing, immobilized fragments are sequenced using two sites. Read 1 originates from the genomic DNA and Read 2 is derived from the synthetic primer-probes. Read 1 is used for assessing the genomic DNA sequence from OS-Seq data.
  • Fig. 8A-B Effect of GC content on targeting yield.
  • GC content of each target-specific primer-probe sequence.
  • primer-probes that were failing (captured 0 targets).
  • Proportions of failing primer-probes were compared between different %CG content categories.
  • X-axis presents the percentages of the sorted CG categories and y-axis reports the proportion of failed primer-probes within each GC content category.
  • Fig. 9A-B Comparison of the processing workflow for OS-Seq and shotgun library creation methods.
  • nucleic acids are written left to right in 5' to 3' orientation; amino acid sequences are written left to right in amino to carboxy orientation, respectively.
  • sample as used herein relates to a material or mixture of materials, typically, although not necessarily, in liquid form, containing one or more analytes of interest.
  • the nucleic acid samples used herein may be complex in that they contain multiple different molecules that contain sequences. Fragmented genomic DNA and cDNA made from mRNA from a mammal (e.g., mouse or human) are types of complex samples.
  • Complex samples may have more then 10 4 , 10 5 , 10 6 or 10 7 different nucleic acid molecules.
  • a DNA target may originate from any source such as genomic DNA, cDNA (from RNA) or artificial DNA constructs. Any sample containing nucleic acid, e.g., genomic DNA made from tissue culture cells, a sample of tissue, or an FPET samples, may be employed herein.
  • nucleotide is intended to include those moieties that contain not only the known purine and pyrimidine bases, but also other heterocyclic bases that have been modified. Such modifications include methylated purines or pyrimidines, acylated purines or pyrimidines, alkylated riboses or other heterocycles.
  • nucleotide includes those moieties that contain hapten or fluorescent labels and may contain not only conventional ribose and deoxyribose sugars, but other sugars as well.
  • Modified nucleosides or nucleotides also include modifications on the sugar moiety, e.g., wherein one or more of the hydroxyl groups are replaced with halogen atoms or aliphatic groups, are functionalized as ethers, amines, or the likes.
  • nucleic acid and “polynucleotide” are used interchangeably herein to describe a polymer of any length, e.g., greater than about 2 bases, greater than about 10 bases, greater than about 100 bases, greater than about 500 bases, greater than 1000 bases, up to about 10,000 or more bases composed of nucleotides, e.g., deoxyribonucleotides or ribonucleotides, and may be produced enzymatically or synthetically (e.g., PNA as described in U.S. Patent No.
  • Naturally-occurring nucleotides include guanine, cytosine, adenine and thymine (G, C, A and T, respectively).
  • nucleic acid sample denotes a sample containing nucleic acids.
  • target polynucleotide refers to a polynucleotide of interest under study.
  • a target polynucleotide contains one or more sequences that are of interest and under study.
  • oligonucleotide denotes a single- stranded multimer of nucleotide of from about 2 to 200 nucleotides, up to 500 nucleotides in length.
  • Oligonucleotides may be synthetic or may be made enzymatically, and, in some
  • Oligonucleotides are 30 to 150 nucleotides in length. Oligonucleotides may contain
  • ribonucleotide monomers i.e., may be oligoribonucleotides
  • deoxyribonucleotide monomers i.e., may be oligoribonucleotides
  • An oligonucleotide may be 10 to 20, 11 to 30, 31 to 40, 41 to 50, 51-60, 61 to 70, 71 to 80, 80 to 100, 100 to 150 or 150 to 200 nucleotides in length, for example.
  • hybridization refers to the process by which a strand of nucleic acid joins with a complementary strand through base pairing as known in the art.
  • a nucleic acid is considered to be "Selectively hybridizable" to a reference nucleic acid sequence if the two sequences specifically hybridize to one another under moderate to high stringency hybridization and wash conditions. Moderate and high stringency hybridization conditions are known (see, e.g., Ausubel, et al., Short Protocols in Molecular Biology, 3rd ed., Wiley & Sons 1995 and Sambrook et al., Molecular Cloning: A Laboratory Manual, Third Edition, 2001 Cold Spring Harbor, N.Y.).
  • high stringency conditions include hybridization at about 42C in 50% formamide, 5X SSC, 5X Denhardt's solution, 0.5% SDS and 100 ug/ml denatured carrier DNA followed by washing two times in 2X SSC and 0.5% SDS at room temperature and two additional times in 0.1 X SSC and 0.5% SDS at 42 °C.
  • duplex or “duplexed,” as used herein, describes two complementary polynucleotides that are base-paired, i.e., hybridized together.
  • amplifying refers to generating one or more copies of a target nucleic acid, using the target nucleic acid as a template.
  • determining means determining if an element is present or not. These terms include both quantitative and/or qualitative determinations. Assessing may be relative or absolute. “Assessing the presence of includes determining the amount of something present, as well as determining whether it is present or absent.
  • the term "using” has its conventional meaning, and, as such, means employing, e.g., putting into service, a method or composition to attain an end.
  • a program is used to create a file
  • a program is executed to make a file, the file usually being the output of the program.
  • a computer file it is usually accessed, read, and the information stored in the file employed to attain an end.
  • a unique identifier e.g., a barcode
  • the unique identifier is usually read to identify, for example, an object or file associated with the unique identifier.
  • T m refers to the melting temperature of an oligonucleotide duplex at which half of the duplexes remain hybridized and half of the duplexes dissociate into single strands.
  • polynucleotide that is not bound or tethered to another molecule.
  • denaturing refers to the separation of a nucleic acid duplex into two single strands.
  • genomic sequence refers to a sequence that occurs in a genome. Because RNAs are transcribed from a genome, this term encompasses sequence that exist in the nuclear genome of an organism, as well as sequences that are present in a cDNA copy of an RNA (e.g., an mRNA) transcribed from such a genome.
  • genomic fragment refers to a region of a genome, e.g., an animal or plant genome such as the genome of a human, monkey, rat, fish or insect or plant.
  • a genomic fragment may or may not be adaptor ligated.
  • a genomic fragment may be adaptor ligated (in which case it has an adaptor ligated to one or both ends of the fragment, to at least the 5' end of a molecule), or non-adaptor ligated.
  • an oligonucleotide used in the method described herein may be designed using a reference genomic region, i.e., a genomic region of known nucleotide sequence, e.g., a chromosomal region whose sequence is deposited at NCBI's Genbank database or other database, for example.
  • a reference genomic region i.e., a genomic region of known nucleotide sequence, e.g., a chromosomal region whose sequence is deposited at NCBI's Genbank database or other database, for example.
  • Such an oligonucleotide may be employed in an assay that uses a sample containing a test genome, where the test genome contains a binding site for the oligonucleotide.
  • ligating refers to the enzymatically catalyzed joining of the terminal nucleotide at the 5' end of a first DNA molecule to the terminal nucleotide at the 3' end of a second DNA molecule.
  • adaptive refers to double stranded as well as single stranded molecules.
  • a "plurality” contains at least 2 members. In certain cases, a plurality may have at least 10, at least 100, at least 100, at least 10,000, at least 100,000, at least 10 6 , at least 10 7 , at least 10 8 or at least 109 or more members.
  • nucleic acids are “complementary"
  • each base of one of the nucleic acids base pairs with corresponding nucleotides in the other nucleic acid.
  • complementary and perfectly complementary are used synonymously herein.
  • a "primer binding site” refers to a site to which a primer hybridizes in an
  • oligonucleotide or a complementary strand thereof.
  • separating refers to physical separation of two elements (e.g., by size or affinity, etc.) as well as degradation of one element, leaving the other intact.
  • sequence refers to a method by which the identity of at least 10 consecutive nucleotides (e.g., the identity of at least 20, at least 50, at least 100 or at least 200 or more consecutive nucleotides) of a polynucleotide are obtained.
  • not spatially addressed in the context of a substrate containing surface- bound populations of oligonucleotides that are not spatially addressed, refers to a substrate that contains a surface containing different oligonucleotide molecules that are in no particular order or position relative to one another, i.e., at random positions or randomly interspersed with one another. Such a substrate need not be planer and in certain cases may be in the form of a bead. Substrates that contain spatially or optically addressed populations of a single oligonucleotide (e.g., microarrays and encoded beads etc.) are excluded by this definition.
  • a substrate comprising a first population of surface-bound oligonucleotides and a second population of surface-bound oligonucleotides, wherein the first and second populations of surface-bound oligonucleotides not spatially addressed refers to a substrate containing at least two populations of different oligonucleotides that are randomly distributed across the substrate.
  • a substrate may planar or in the form of beads, for example..
  • adaptor-ligated refers to a nucleic acid that has been ligated to an adaptor.
  • the adaptor can be ligated to a 5' end or a 3' end of a nucleic acid molecule.
  • extending refers to the extension of a primer by the addition of nucleotides using a polymerase. If a primer that is annealed to a nucleic acid is extended, the nucleic acid acts as a template for extension reaction.
  • bridge PCR refers to a solid-phase polymerase chain reaction in which the primers that are extended in the reaction are tethered to a substrate by their 5' ends.
  • Bridge PCR (which may also be referred to as "cluster PCR") is used in Illumina's Solexa platform. Bridge PCR and Illumina's Solexa platform are generally described in a variety of publications, e.g., Gudmundsson et al (Nat. Genet. 2009 41:1122-6), Out et al (Hum. Mutat. 2009 30:1703-12) and Turner (Nat. Methods 2009 6:315-6), US patent 7,115,400, and publication application publication nos. US20080160580 and US20080286795.
  • barcode sequence refers to a unique sequence of nucleotides is used to identify and/or track the source of a polynucleotide in a reaction.
  • a barcode sequence may be at the 5'-end or 3'-end of a oligonucleotide. Barcode sequences may vary widely in size and composition; the following references provide guidance for selecting sets of barcode sequences appropriate for particular embodiments: Brenner, U.S. Pat. No. 5,635,400; Brenner et al, Proc. Natl. Acad. Sci., 97: 1665-1670 (2000); Shoemaker et al, Nature Genetics, 14: 450-456 (1996); Morris et al, European patent publication
  • a barcode sequence may have a length in range of from 4 to 36 nucleotides, or from 6 to 30 nucleotides, or from 8 to 20 nucleotides.
  • the method generally comprises obtaining a substrate that contains at least two surface bound oligonucleotides of differing sequence that are spatially interspersed with one another.
  • substrates are currently employed in Illumina's Solexa sequencing technology and are described in a variety of references, e.g., US patent no. 7,115,400 and publication nos. US20080160580 and US20080286795, which are incorporated by reference for such disclosure.
  • Some of the embodiments set forth below may describe the use of the method to isolate fragments of a genome. These embodiments may be readily adapted to other types of sequences, e.g., cDNA or synthetic DNA.
  • a first member of the first population of surface-bound oligonucleotides is hybridized to a selection oligonucleotide that contains a) a region that hybridizes with the first member and a region, a sequencing primer site and b) a region that contains a target genomic sequence.
  • the amount of selection oligonucleotide used in this step may be optimized such that sufficient number of oligonucleotides of the first population remain unhybridized to the selection oligonucleotide and available to be used in the bridge PCR step that occurs later in the protocol.
  • the first member of the first population of surface-bound oligonucleotides is extended to produce a duplex that contains support-bound selection primer that contains a sequence that is complementary to the target genomic sequence.
  • the selection oligonucleotide is removed by denaturation to leave he extended support-bound selection primer.
  • the extended support-bound selection primer is then hybridized with adapter- ligated genomic fragment (which may be made by fragmenting genomic DNA, chemically, physically or using an enzyme and then ligating adaptors to the ends of the resultant fragments) containing the target genomic sequence, sequence that flanks the target genomic sequence, and an adaptor sequence at the 5' end of one or both of the strands.
  • the support-bound selection primer is extended to produce a product that contains a sequence that flanks the genomic sequence in the genome and the sequence of the adaptor of the adaptor-ligated genomic fragment.
  • the adaptor of the adaptor-ligated genomic fragment may hybridize to the second population of surface-bound oligonucleotides.
  • second population of surface-bound oligonucleotides may be hybridized to a modifying oligonucleotide that contains a) a region that hybridizes with second member and a region that contain contains adaptor sequence.
  • the amount of modifying oligonucleotide used in this step may be optimized such that sufficient number of product molecules hybridize.
  • the second member of the second population of surface-bound oligonucleotides may be extended to produce a duplex that contains support-bound adapter primer that contains a sequence that is complementary to the adapter sequence.
  • the modifying oligonucleotide is removed by denaturation to leave support-bound adapter primer.
  • the product may be then amplified by bridge PCR.
  • the product is amplified by a first unextended surface-bound oligonucleotides as well as a second surface-bound oligonucleotide to produce a PCR product.
  • the genomic fragment is an adaptor-ligated genomic fragment comprising a 5' end adaptor.
  • members of the second population of the surface-bound oligonucleotides hybridize to the complement of the adaptor.
  • an adaptor may be ligated onto the extension product, thereby placing an adaptor that hybridizes to the second population of the surface-bound oligonucleotides onto the 3' end of the extension product.
  • the amplifying is done using: a) unextended members of the first population of surface-bound oligonucleotides; and b) support-bound primers that are made by: i. hybridizing members of the second population of surface-bound oligonucleotides to an oligonucleotide comprising a region that hybridizes with the members of the second population of surface-bound oligonucleotides and a region that is complementary to an adaptor; and ii. extending the members of the second population of surface-bound oligonucleotides to produce support-bound primers that hybridize to the 5' end of the extension product.
  • the genomic fragment is an adaptor-ligated genomic fragment comprising a 5' end adaptor, wherein the extending produces an extension product that comprises, on its 3' end, a sequence that is complementary to the adaptor, and wherein members of the second population of the surface-bound oligonucleotides hybridize to the sequence that is complementary to the adaptor during the bridge PCR.
  • the 5' end adaptor comprises a binding site for a sequencing primer at the end that is ligated to the genomic fragment.
  • the method comprises, between steps e) and f), ligating an adaptor onto the 3' end of the extension product, and wherein members of the second population of the surface-bound oligonucleotides hybridize to the adaptor during the bridge PCR.
  • the adaptor comprises a binding site for a sequencing primer at the end that is ligated to the genomic fragment.
  • the second population of surface-bound oligonucleotides are made by: i. hybridizing members of an initial second population of surface-bound oligonucleotides to an oligonucleotide comprising a region that hybridizes with the members of the second population of surface-bound oligonucleotides and a region that is
  • the second population of surface-bound oligonucleotides may be made by ligating an oligonucleotide comprising a region that is complementary to a sequence of said nucleic acid fragment to an initial second population of surface-bound oligonucleotides to produce said second population of surface-bound oligonucleotides.
  • This ligation may be facilitated by a splint oligonucleotide that forms a bridge between the two oligonucleotides being ligated.
  • a modifying oligonucleotide may be introduced by a ligation-based process in which a bridging oligonucleotide is used to guide the modification of the original solid support oligonucleotide to create the support-bound adapter primer.
  • the support-bound adapter primer can be created using a similar bridging oligonucleotide to create the primer extension necessary for the target modification.
  • the selection oligonucleotide comprises a binding site for a sequencing primer between said a region that hybridizes with said first member and said region that contains said genomic sequence.
  • the method may further comprises sequencing a first strand of the PCR product to obtain at least part of the nucleotide sequence of the sequence that flanks the genomic sequence.
  • This method may further comprise sequencing the second strand of the PCR product to obtain at least part of the nucleotide sequence of the sequence that flanks the genomic sequence.
  • the method may comprise fragmenting a mammalian genome to produce a fragmented genome, optionally adding adaptors to the fragmented genome, and applying the fragmented genome to the substrate.
  • the fragmenting is done physically, chemically or using a restriction enzyme.
  • the fragmenting is done by sonication or shearing, for example.
  • the hybridizing may be done by preparing a plurality of fragmented genomes from a plurality of different individuals, pooling the plurality of fragmented genomes to produce a pool, applying the pool of fragmented genomes to the substrate, and obtaining PCR products that comprise a sequence that flanks the genomic sequence in the different individuals.
  • These embodiments may further comprising sequencing at least the first strand of the PCR products to obtain at least part of the nucleotide sequence of the sequence that flanks the genomic sequence in the different individuals.
  • the adaptor comprises a barcode sequence that allows the source of the adaptor-ligated genomic fragment to be identified after the PCR products are sequenced.
  • the method comprises: adaptor-ligating fragmented genomic DNA from a first subject using a first adaptor that comprises a first barcode sequence to produce a first product; adaptor-ligating fragmented genomic DNA from a second subject using a second adaptor that comprises a second barcode sequence to produce a second product; combining the first and second products to produce a mixed template; and performing the method of claim 1 using the mixed template to provide first and second PCR product each containing the barcode sequence.
  • the mixed template in some cases may comprise fragmented genomic DNA from at least 1,000 subjects.
  • the method may involvei. ligating the genomic fragments to an adaptor that contains a site for a sequencing primer and a nucleotide sequence that is the same as the second surface bound oligonucleotides, ii. hybridizing the adaptor-ligated genomic fragments to a first member of the first population of surface-bound
  • oligonucleotides iii. extending the first member of the first population of surface-bound oligonucleotides to which the adaptor ligated fragment is hybridized; and iv. hybridizing the adaptor-containing end of the extension product to a second support bound polynucleotide, thereby producing a bridge and facilitating bridge PCR.
  • the system may comprises: a) a substrate comprising a first population of surface-bound oligonucleotides and a second population of surface-bound oligonucleotides, wherein the first and second populations of surface-bound oligonucleotides not spatially addressed on the substrate; b) a selection oligonucleotide that contains a region that hybridizes with a first member of the first population and a region that contains a genomic sequence; c) an adaptor; and e) instructions for performing the method of claim 1.
  • the PCR product may be sequenced, e.g, using Illumina's Solexa platform, or another solid-phase sequencing method, to obtain at least part of the nucleotide sequence of the sequence that flanks the targets genomic sequence.
  • the method may employ barcode sequences that allow the source of the sequence that flanks the target genomic sequence.
  • the adaptor of the adaptor-ligated genomic fragment may contain a barcode sequence that allows the source of the adaptor-ligated genomic fragment to be identified after PCR product is sequenced.
  • this method comprises adaptor-ligating fragmented genomic DNA from a first subject (which subject may be included in a pool of first subjects) using a first adaptor that comprises a first barcode sequence to produce a first product;
  • the adaptors used have a portion that has the same sequence and that hybridizes to a surface-bound oligonucleotide, and a portion that has a different nucleotide sequence that contains the barcode sequence.
  • a second method of amplifying a selected sequence is provided.
  • the principle of this method is similar to that of the method described above, except that a) the genomic fragment that is hybridized to the support-bound selection primer is not adaptor ligated; and b) adaptors are after the support-bound selection primer is extended.
  • Adaptor ligation the product may be employed in a bridge PCR reaction, as discussed above.
  • the amplifying is done using: a) unextended members of the first population of surface-bound oligonucleotides; and b) support-bound primers that are made by: i.
  • the PCR product may be sequenced to obtain at least part of the nucleotide sequence of the sequence that flanks the genomic sequence.
  • the genomic fragments may be ligated to an adaptor that not only contains a sequencing primer binding site, but also a sequence that is the same as second population of surface-bound oligonucleotides.
  • the extension product contains a sequence that hybridizes to the second population of surface- bound oligonucleotides (which is usually done at a lower temperature, e.g., lower than 60 °C, e.g., lower than 55 °C), thereby facilitating amplification of the genomic fragments using the first and second surface bound oligonucleotides.
  • a lower temperature e.g., lower than 60 °C, e.g., lower than 55 °C
  • the oligonucleotides of the first population are present at a molar excess of at least 5X, 10X, 20X, 50X, or 100X, 500X, ⁇ , ⁇ , 2000X, ⁇ , ⁇ , 50,000X relative to the amount of selection oligonucleotide applied to the substrate.
  • the molar excess may be in the rage of a 5X to 50,000X molar excess, e.g., a 100X to 5,000X molar excess.
  • a substrate may be contacted with plurality of different selection oligonucleotides, each comprising a region that hybridizes with members of the first population of surface-bound oligonucleotides (which region has the same nucleotide sequence in the different selection oligonucleotides) and a region that contains a genomic sequence.
  • the genomic sequence of each of the selection oligonucleotides is different, thereby allowing several genomic regions to be captured, amplified and sequenced on the substrate.
  • kits for practicing the subject method as described above may contain a) a substrate comprising a first population of surface-bound oligonucleotides and a second population of surface- bound oligonucleotides, wherein the first and second populations of surface-bound oligonucleotides not spatially addressed on the substrate and b) a selection oligonucleotide that contains a region that hybridizes with a first member of the first population and a region that contains a genomic sequence.
  • the kit may also contains other reagents described above and below that may be employed in the method, e.g., adaptors, ligase, hybridization buffers, etc.
  • the subject kit typically further includes instructions for using the components of the kit to practice the subject method.
  • the instructions for practicing the subject method are generally recorded on a suitable recording medium.
  • the instructions may be printed on a substrate, such as paper or plastic, etc.
  • the instructions may be present in the kits as a package insert, in the labeling of the container of the kit or components thereof (i.e., associated with the packaging or subpackaging) etc.
  • the instructions are present as an electronic storage data file present on a suitable computer readable storage medium, e.g. CD-ROM, diskette, etc.
  • the actual instructions are not present in the kit, but means for obtaining the instructions from a remote source, e.g.
  • kits that includes a web address where the instructions can be viewed and/or from which the instructions can be downloaded. As with the instructions, this means for obtaining the instructions is recorded on a suitable substrate.
  • Other required components will include related computer programs and/or computer scripts to implement the a modification to prior programs already installed on a sequencer.
  • kits may also include one or more control analyte mixtures, e.g., two or more control analytes for use in testing the kit.
  • the method is based on modifying a generic primer lawn (i.e. a lawn containing at least two primers that are randomly distributed) on a solid phase support to serve as a target DNA capture device, enabling direct sequencing of the captured DNA and without significant manipulation of the sample.
  • the method enables seamless integration of target DNA capture and sequencing experiments with a related fluidics platform.
  • This approach uses a universal primer lawn on a solid-phase support to serve as a DNA capture substrate while maintaining its sequencing potential.
  • the method can use non-processed, natural DNA as a template for sequencing. Sequencing using this method is not necessarily dependent on laboratory facilities.
  • the method can be used to analyze single and double stranded templates.
  • the ability to analyze single- strand DNA templates can be important for some sequencing applications that use formalin-fixed paraffin-embedded samples from
  • the straightforward capture sequencing assay is not restricted to human genomic DNA but other nucleic acid substrates, such as bacterial and viral DNA and RNA can be analyzed. Transcriptomes, noncoding and miRNAs can also be captured and sequenced. In addition nucleotide sequence capture and sequencing, other genetic and epigenetic properties can be studied, such as DNA methylation, large genomic DNA
  • the method may also be employed to select synthetic DNA from a population.
  • sequencing has been regarded as a process in which the DNA sample is structurally modified to facilitate the analysis on a sequencing system.
  • the method described below modifies the sequencing system and therefore there is no need to modify and extensively prepare the sample.
  • By functionalizing a generic primer lawn by using a synthetic DNA oligonucleotide library of target genes of non-processed samples may be directly assayed.
  • specific DNA components that provide sequences that are employed in the formation of the bridge- structure are brought- in sequentially, and the primer lawn is itself modified. Sequencing library preparation for all types of sequencers rely on adding specific double-strand adaptor sequences to the DNA template.
  • the library preparation for the assay only required an addition of a single adaptor. This substantially shortens the sample processing and does not require clonal amplification nor gel electrophoresis based size separation.
  • a second adapter may be added to the captured template on a solid support. Certain embodiments of the method allow for the use of raw DNA as a sequencing template.
  • Genomic DNA sample can be prepared for sequencing by a simple heat fragmentation step and the entire assay can be fully automated and performed on the solid support.
  • the capture and subsequent reactions can be mediated by a fluidics system.
  • An additional embodiment provides a method that allows the preparation of DNA fragments for sequencing on the solid support by using fragmented DNA as a template and adding sequencing adapters to the captured DNA fragments using a fluidics system.
  • an Illumina next-generation DNA sequencer was used to develop these approaches. The results from an integrated capture and sequencing preparation reaction using primer lawn modification and 366 target sites in the human genome are presented. With the exception of 25-minute heat fragmentation, all steps can be done on the solid-phase support of the Illumina flow cell.
  • the data described below demonstrates the robustness of the assay and applicability of a universal primer lawn and a fluidics system as a capture substrate. Unique parameters of the modification of primer lawns have been identified, which enable the method to work robustly. In addition to complex eukaryotic genomes, the method can be applied to capture microbial and other organisms' genomes, viral DNA and RNA, transcriptomes of different sources as well as synthetic DNA. Furthermore, the concept of "programming" a native primer lawn immobilized on a solid support of a fluidics system and executing specific applications is being introduced and validated.
  • Genomic DNA samples Genomic DNA samples. Genomic DNA for NA18507 was obtained from the Coriell Institute. Fresh frozen tissue samples were obtained from a colorectal cancer patient. Patient material was obtained with informed consent from the Stanford Cancer Center and the study was approved by the institutional review board (IRB) at Stanford University School of Medicine. Frozen tissue sections were prepared, hematoxylin-eosin staining was performed and the tumor composition of each sample was determined via pathological examination. Samples representing tumor and normal tissues were dissected from areas where cellular composition was 90% tumor or purely normal, respectively. Genomic DNA was extracted using E.Z.N.A SQ DNA/RNA Protein Kit (Omega Bio-Tek, Norcross, GA).
  • SNP 6.0 arrays Affymetrix, Santa Clara, CA. Data analysis was performed using the Genotyping Console software and Birdseed V2 algorithm (Affymetrix). Thirteen additional microarray data sets were analyzed in concert with the studied samples in order to assess the quality of the SNP calls. SNP 6.0 array data was filtered using P- value threshold of 0.01.
  • CCDS build release 20090902 human genome build NCBI 37 - hgl9 and dbSNP Build ID 131 were used as the polymorphism reference data set.
  • GeneRanker annotation database was used to choose 344 cancer genes prioritized by importance.
  • the exon definitions for the candidate genes were taken from CCDS.
  • the 40-mer target- specific sequences were 10 bases outside of the 5' end of the exon boundary (Fig. 3a). Both strands of the exons were targeted using individual primer-probes.
  • OS-Seq-366 only covered the flanks of exons.
  • exons larger than 500 bp were treated by tiling target- specific sequences until the entire exonic region was covered (Fig. 3b).
  • Repbase To improve the on-target specificity of OS-Seq-l lk, we used Repbase to identify and eliminate oligonucleotide sequences that targeted highly repetitive sequences.
  • Oligonucleotide synthesis Two strategies were applied for oligonucleotide synthesis.
  • OS-Seq-366 we designed 366 101 -mer oligonucleotides (Fig. 5a) which were then column-synthesized (Stanford Genome Technology Center, Stanford, CA) (Fig. 4a). Oligonucleotides were quantified and pooled in equimolar concentration.
  • OS-Seq- l lk an in-situ microarray synthesis (LC Sciences, Houston) approach was used to synthesize the 11,742 precursor oligonucleotides (Fig. 5b).
  • the sequences of target- specific oligonucleotides are in Table 2 below.
  • Amplification of microarray-synthesized oligonucleotides Three 25 ⁇ subpools of precursor 80-mer oligonucleotides were used (587, 638 and 415 nM) (Fig. 5b). A PCR approach was employed to amplify the precursor, low-concentration oligonucleotides (Fig. 4b). The array-synthesized oligonucleotide subpools were diluted to 10 fM/oligo and used as a template for PCR amplification.
  • PCR was performed using Taq DNA polymerase (NEB), and dNTPs (1 mM dATP, 1 mM dCTP, 1 mM cGTP, 500 nM dTTP and 500 nM dUTP) in standard reaction conditions. After denaturation in 95°C for 30 s., 20 amplification cycles (95°C, 30 s.; 55°C, 30 s.; 68°C, 30 s.) were performed. Amplification Primer 1 contained uracil at the 3' end, while Amplification Primer 2 incorporated additional functional sequences (Fig. 5b).
  • oligonucleotides were purified to remove excess primer (Fermentas), then processed using 0.1 U/ ⁇ Uracil DNA-excision Mix (Epicentre, Madison, WI) in 37°C for 45 min to detach the universal amplification primer site and cleave the mature 101-mer coding strands of the oligonucleotides.
  • the oligonucleotides require the 5' ends to be functional and free in order to have accurate extension of the target-specific site during primer-probe immobilization.
  • After heat shock inactivation of the enzymes 65°C, 10 min), the oligonucleotide preparations were purified (Fermentas). Finally, we quantified the three oligonucleotide subpools and created a single pool with equimolar concentration of each subpool.
  • polyacrylamide layer at extremely high density a subset of the 'D' primers was specifically modified using the Illumina Cluster station.Prior to the NGS primer modification, 133 nM oligonucleotide pools were heat denatured at 95°C for 5 min. We used heat shock (95°C for 5 min) to free the coding strand of the OS-Seq oligucleo tides. Additional strand purification was not required as the second strand is inactive on the flow cell and is washed away after hybridization. Denatured oligonucleotides were diluted with 4x
  • Hybridization buffer (20x SSC, 0.2% Tween-20). The resulting 100 nM oligonucleotides were used in the flow cell modification experiments. 30 ⁇ of oligonucleotide mixture was dispensed into each lane of the flow cell. During a temperature ramp (from 96°C to 40°C in 18 minutes) oligonucleotides annealed specifically to the immobilized primer 'D'. Then, DNA polymerase was used to extend the 'D' primer with the annealed oligonucleotide as a template. After extension, the original oligonucleotide template was denatured from the extended 'D' primer and washed from the solid phase support. Standard Illumina v4 reagents were used for extension, wash and denaturation steps. The modification of primer 'D' caused immobilization of the primer-probes.
  • Genomic DNA was fragmented using Covaris E210R (Covaris, Woburn, MA) to obtain a mean fragment size of 500 bp (duty cycle 5%, intensity 3, 200 cycles per burst and 80 seconds).
  • the randomly fragmented DNA was end- repaired using 0.25 U of Klenow large fragment (New England Biolabs, Ipswich, MA), 7.5 U of T4 DNA polymerase (NEB), 400 ⁇ of each dNTP (NEB), 25 U of T4 Polynucleotide kinase (NEB) and T4 DNA ligase buffer with ATP (NEB) in 50 ⁇ reaction volume at room
  • adenines were added to the 3' ends of the template DNA using 3.2 U of Taq DNA polymerase (NEB), 100 ⁇ dATP (Invitrogen) and Taq buffer with 1.5 mM MgC12 in 80 ul reaction in 72°C for 15 min. Before adapter ligation, reactions were purified using PCR purification kit (Fermentas).
  • the sequencing library adapters contain an optional 6-base indexing sequence, a sequencing primer 1 site and a 12-mer sequence for primer 'C' hybridization (Table 2 above, Fig. 5c). Designed sixteen indexing adapters were designed. Adapter oligonucleotides were synthesized at the Stanford Genome Technology Center. Prior to ligation, adapter oligonucleotides were annealed during temperature ramp down. For the targeted resequencing of NA18507, we used both a singleplex adapter as well as a multiplex adapter with 'AACCTG' tag.
  • Targets were captured on the flow cell using OS-Seq primer-probes (Fig. lb and oligonucleotide sequences below).
  • OS-Seq primer-probes Fig. lb and oligonucleotide sequences below.
  • Target DNA was hybridized to the primer-probes by incubating the sequencing libraries in the flow cell at 65°C for 20 hours. During genomic DNA library hybridization and subsequent extension, the flow cell was kept at a constant 65°C.
  • An Illumina Cluster Station was used to carry out the primer-probe hybridization and extension steps. Prior to hybridization to primer-probes, 22.5 ⁇ of sequencing libraries (40 - 56.6 ng/ ⁇ ) was denatured at 95°C for 5 min.
  • the genomic DNA libraries were diluted to a total volume of 30 ⁇ using 4x Hybridization buffer.
  • the final DNA concentrations of sequencing libraries ranged from 30 to 41.7 ng/ ⁇ . Due to the high concentration of the sequencing libraries, the hybridization volume was kept at minimum. Therefore, a custom Cluster Station program was developed to allow reproducible low- volume hybridization. The following extension, wash and denaturation steps were performed using Illumina v4 reagents.
  • Samples were sequenced using 40 by 40 (OS-Seq-366) or 60 by 60 (OS-Seq-l lk) paired-end cycles on an Illumina Genome Analyzer IIx using regular version 4 sequencing reagents and recipes (Illumina). Image analysis and base calling were performed using the SCS 2.8 and RTA 2.8 software (Illumina).
  • Sequence analysis and variant detection Sequence reads were aligned to the human genome version human genome build NCBI 37 - hgl9 using Burrows-Wheeler Aligner (BWA) 19 .
  • on-target reads (Read 1) were defined as being within 1 kb of the 5' end of the primer-probe.
  • Off-target reads were defined as aligning outside 1 kb of the 5 'end of the primer-probe or mapping on a different chromosome from the location of the associated primer- probe.
  • a perl script For the de-multiplexing of indexed lanes, we used a perl script to generate an index of the 7-base tags using the base-call files. This index file and another perl script were used to demultiplex either the combined base-call file (so that separate fastq files can be generated for further processing) or the aligned file.
  • insert size filtering on the mate pairs was applied. The insert size was determined by comparing alignment of paired sequence reads. For variant calling, extracted sequences were required to have an insert size greater than [40 + the length of Read 1]. After insert size filtering, variant calling was performed using SAMtools and BCFtools. A sequence pileup was performed against the human genome (hgl9) using SAMtools mpileup with a mapping quality threshold of 50.
  • BCFtools view was used to genotype base positions and data was filtered using vcfutils.pl, a variant filter perl script provided in the SAMtools package.
  • the vcfutils varFilter conditions were: i) coverage of 10 or greater, ii) removal of the strand bias filter (since OS-Seq is a strand- specific capture method), iii) forcing the script to output both reference and non-reference positions.
  • Reference and non-reference calls were used for comparisons with the Affymetrix SNP 6.0 array data. Genotyped positions were filtered to have a Phred-like quality score above 50.
  • variant calls of the NA18507 data were compared to calls from variants identified from a complete genome sequence analysis and Hapmap genotyping data (www.hapmap.org). Comparisons of OS-Seq data and Affymetrix SNP 6.0 array data were made using perl scripts. dbSNP131 was used for for SNP annotation.
  • Ad_top_FC_c apture_ A_tail Ad_top_FC_c apture_ A_tail :
  • Ad_bot_FC_capture_A_tail 5' - GATCGGAAGAGCGTCGTGTAGGGAAAGAGTGTAGATCTCG - 3' (SEQ ID NO: 39)
  • Flow cell primer 'C 5' - GATCGGAAGAGCGTCGTGTAGGGAAAGAGTGTAGATCTCG - 3' (SEQ ID NO: 39)
  • OS-Seq adaptor library amplification (Ad_top_FC_capture_A_tail, single primer PCR is used to amplify the adaptor library)
  • AAAGAGTGTAGATCTCG - 3' (captured DNA) (SEQ ID NO:64)
  • AAAGAGTGTAGATCTCG - 3' (captured DNA) (SEQ ID NO:65)
  • OS-Seq_Library (there is 12-mer homology between the OS-Seq adaptor and Oligo-C)
  • AAAGAGTGTAGATCTCG - 3' (captured DNA) (SEQ ID NO:66)
  • OS-Seq is an integrated approach in which both capture and sequencing of genomic targets are performed on the NGS solid phase support, such as the Illumina flow cell (Fig. la).
  • NGS solid phase support such as the Illumina flow cell (Fig. la).
  • a single-adapter sequencing library is prepared from genomic DNA and target-specific oligonucleotides are synthesized and used to construct primer-probes on the flow cell. Then, immobilized primer-probes on the flow cell are used to capture single molecule targets from a single-adapter genomic DNA library.
  • Processing of OS-Seq involves three-step where the Illumina sequencing system is modified to contain target-specific primer-probes, targets are captured from a single-adapter library and immobilized fragments are finalized for sequencing (Fig. lb),
  • targets are captured from a single-adapter library and immobilized fragments are finalized for sequencing (Fig. lb)
  • To prepare the capture substrate we molecularly re-engineer the Illumina flow cell by modifying a subset of the existing primer lawn to become target-specific primer-probes.
  • To create these primer-probes we hybridize the 3 ' universal sequence of a complex pool of oligonucleotides to its complement on the flow cell and extend the immobilized primer using a DNA polymerase extension reaction. The result is a set of randomly placed, target-specific primer-probes, which are fixed onto the flow cell surface.
  • the primer-probes specifically hybridize to target complementary sequences within the single- adapter genomic DNA library; after hybridization, the primer-probes then function as primers for another DNA polymerase extension reaction.
  • the extension step effectively captures the target sequence.
  • a denaturation step is performed followed by low-heat hybridization at 40°C to stabilize the sequencing library adapter to its complement on the flow cell, which creates a bridge structure.
  • a third DNA polymerase extension reaction incorporates additional sequence to the 3' ends, creating two molecules capable of solid phase amplification.
  • captured molecules are bridge amplified, processed and sequenced using the standard sequencing protocol from the Illumina NGS system. A detailed description of the molecular biology steps in OS-Seq is given in above and the Illumina cluster station programs for OS-Seq is modified accordingly.
  • OS-Seq-1 lk For high-throughput production of OS-Seq-1 lk, we synthesized the oligonucleotides on a programmable microarray. These array-synthesized oligonucleotides require amplification for processing and for obtaining sufficient material for OS-Seq (Fig. 4). Post-processed, OS-Seq oligonucleotides contain a target-specific 40-mer complementary to the 5' end of the targeted region (Fig. 5). These oligonucleotides also contain sequence required for annealing the paired-end sequencing primer and for hybridization to the immobilized primer lawn on the flow cell.
  • OS-Seq SNVs called from captured on-target region 105 985 871 727
  • OS- Seq primer-probes are strand-specific and only capture the 5' ends of the DNA targets (Fig. 6).
  • the median coverage profile of all primer-probes in OS-Seq-366 (Fig. la) illustrates how sequence is captured up to 1 kb downstream from the primer-probe.
  • a bias towards smaller insert sizes was detected, for OS-Seq-366 50% of targeted reads mapped within 283 bases from the primer-probes. In both assays, additional reads beyond the 1 kb interval and as far distant as 1.7 kb were identified.
  • sequence reads beyond 1 kb represent the tail end of the capture distribution from any given primer-probe and was less than 0.15% of the overall sequence data for both OS-Seq-366 and OS-Seq-1 lk. It was also observed that the characteristics of the coverage distribution is correlated with the fragment size introduced during library creation and from size constraints inherent to bridge-formation and solid-phase PCR (Fig. 6). Also, introducing a higher molar concentration of the single adapter library, sequencing additional lanes or using longer reads can increase coverage along the target.
  • On-target reads were defined as Read 1 sequences mapping within 1 kb of a primer- probe. Using these on-target coverage criteria, 86.9% of 40 base reads in OS-Seq-366 and 93.3% of 53 base reads in OS-Seq-1 lk were on-target (Table 1). OS-Seq-1 lk showed improved specificity given efforts to refine the in-silico design of the primer-probes. Specifically, for OS- Seq-1 lk in-silico primer-probe selection, a repeat masking filter was used, which resulted in fewer off-target reads.
  • the OS-Seq-l lk assay showed increased sequence coverage on exons due to an improvement of the primer-probe design over the OS-Seq-366 design, specifically, the OS-Seq- 1 lk design tiled primer-probes across exons larger than 500 bases.
  • OS-Seq primer-probes were sorted based on the observed capture yields and the distributions within OS-Seq-366 and OS- Seq-1 lk are presented in an overlay fashion in Fig. Id. In OS-Seq-366, it was observed observed that 100% of the primer-probes had a yield minimum of one sequence read and the yield of 89.6% of the primer-probes were within a 10-fold range.
  • OS-Seq-1 lk 95.7% of primer-probes had a capture yield minimum of one sequence read and 54% of the primer-probes had a yield within a 10-fold range.
  • OS-Seq-366 oligonucleotides were column- synthesized and quantified separately prior to pooling, which ensured that each target-specific sequence was in equimolar concentration in the primer-probe construction step. Higher variance in primer-probe yields for OS-Seq- 1 lk is most likely attributed to amplification bias introduced during PCR of the microarray-synthesized oligonucleotides used for primer-probe creation.
  • OS-Seq-1 lk analysis was also applied to genomic DNA derived from a matched normal
  • OS-Seq-366 and OS-Seq- 11k assays The capture efficiency of individual primer-probes within the OS-Seq-366 and OS-Seq- 11k assays was investigated, and the performance of each primer-probe was assessed.
  • a unique feature of OS-Seq is that captured genomic sequences can be matched to their corresponding primer-probes when sequenced with paired-ends.
  • Read 1 originates from the 3' end of the captured target and Read 2 begins at the OS-Seq primer-probe synthetic sequence.
  • Read 1 always represents the captured genomic DNA sequence while Read 2 functionally serves as a molecular barcode for a distinct primer-probe. This enables the identification of the exact OS- Seq primer-probe, which mediated the targeting, and facilitates the assessment of the performance of individual primer-probes.
  • OS-Seq The OS-Seq technology was developed for streamlined and highly scalable targeted resequencing.
  • the OS-Seq technology enables one to create custom targeted resequencing assays.
  • the design and production of the primer-probe oligonucleotides is relatively straightforward and target regions can be selected simply by using balanced GC and non-repetitive sequence.
  • Programmable microarray synthesis resources can be used to generate customized and complex oligonucleotide libraries en masse.
  • traditional oligonucleotide synthesis methods can be used to create customized assays for smaller target gene sets. While our largest targeting assay covered the exons and adjacent sequence of 344 genes, we believe that OS-Seq can be significantly scaled up to larger target contents. From the OS-Seq-366 data we estimated that there was over 2,000-fold excess of primer-probes compared to target fragments in the hybridization mix inside the flow cell. During 20-hour hybridization, we estimate that 4.9% of all potential targets within the library were captured for sequencing. We have also tested that the concentration of oligonucleotides can be increased at least 10-fold and the concentration of the sequencing library can be increased 5-fold (data not shown) without compromising cluster formation.
  • OS-Seq sample preparation is straightforward: it can be completed in one day and is readily automated (Fig. 9). In regard to labor, using OS-Seq compares favorably to executing a shotgun sequencing experiment. Because residual adapters are not hybridizing to the flow cell during capture, OS-Seq libraries can use DNA fragments of varying sizes without the necessity of narrow size purification by physical separation methods. Only a single adapter needs to be added to the 5 ' ends of a genomic DNA fragment. The single-adapter design also readily lends itself to indexing with introduction of a molecular barcode. This feature allows straightforward sample multiplexing of sequencing assays and has many potential applications. For example, matched normal tumor analysis occurs in the same capture reaction, which may reduce biases.
  • OS-Seq is particularly useful for translational studies and clinical diagnostics by enabling high-throughput analysis of candidate genes and identification of clinically actionable target regions.

Abstract

Certain embodiments provide a method for capturing a genomic fragment. The method may comprise: obtaining a substrate comprising a first population of surface-bound oligonucleotides and a second population of surface-bound oligonucleotides; hybridizing a first member of the first population of surface-bound oligonucleotides to a selection oligonucleotide comprising a region that hybridizes with the first member and a region that contains a genomic sequence; extending the first member of the first population of surface-bound oligonucleotides to produce a support-bound selection primer that comprises a sequence that is complementary to the genomic sequence; hybridizing the support-bound selection primer to a nucleic acid fragment comprising the genomic sequence; extending the support-bound selection primer to produce an extension product that contains a sequence that flanks the genomic sequence, e.g., in a genome; and amplifying the extension product on the substrate.

Description

DIRECT CAPTURE, AMPLIFICATION AND SEQUENCING OF TARGET DNA USING IMMOBILIZED PRIMERS
CROSS-REFERENCING
This application claims the benefit of U.S. provisional patent application serial nos. 61/386,390, filed on September 24, 2010, and 61/485,062 filed on May 11, 2011, which applications are incorporated herein in their entirety
GOVERNMENT RIGHTS
This invention was made with Government support under contract HG000205 awarded by the National Institutes of Health. The Government has certain rights in this invention.
BACKGROUND
In many sequencing methods, particularly re-sequencing methods (i.e., methods in which a locus is re- sequenced), a target is first captured and then sequenced. Several target capture methodologies have been developed and integrated with high throughput sequencing systems. Specifically, hybridization-based assays using beads or microarrays and in-solution based techniques using molecular inversion probes or genomic circularization
oligonucleotides can be applied to capture target DNA. Captured DNA is then prepared for sequencing. Complicated molecular biology protocols are often employed to prepare the enriched DNA sample and in certain cases production of the sequencing library involves many enzymatic reactions, purification steps and size selection by gel electrophoresis. The sample preparation process for target capture DNA sequencing can be labor intensive and subsequent sample manipulations can cause bias in the DNA content and increase the sequencing error rate.
SUMMARY
Provided herein are methods for capturing and amplifying a nucleic acid fragment, e.g., a genomic fragment or cDNA made from RNA. Kits for practicing the method are also provided. In certain embodiments, the method comprises: a) obtaining a substrate comprising a first population of surface-bound oligonucleotides and a second population of surface-bound oligonucleotides, wherein the members of the first and second populations of surface-bound oligonucleotides are not spatially addressed on the substrate; b) hybridizing a first member of the first population of surface-bound oligonucleotides to a selection oligonucleotide comprising a region that hybridizes with the first member and a region that contains a genomic sequence, c) extending the first member of the first population of surface-bound oligonucleotides to produce a support-bound selection primer that comprises a sequence that is complementary to the genomic sequence; d) hybridizing the support- bound selection primer to a nucleic acid fragment (e.g., a genomic fragment or cDNA) comprising the genomic sequence; e) extending the support-bound selection primer to produce an extension product that contains a sequence that flanks the genomic sequence, e.g., in the genome; f) amplifying the extension product on the substrate, e.g., by bridge PCR using unextended members of the first and second populations of surface-bound
oligonucleotides, to produce a PCR product.
In certain embodiments, the method comprises: a) obtaining a substrate comprising a first population of surface-bound oligonucleotides and a second population of surface-bound oligonucleotides, wherein the first and second populations of surface-bound oligonucleotides are not spatially addressed on the substrate; b) hybridizing a first member of the first population of surface-bound oligonucleotides to a selection oligonucleotide comprising a region that hybridizes with the first member and a region that contains a genomic sequence; c) extending the first member of the first population of surface-bound oligonucleotides to produce a support-bound selection primer that comprises a sequence that is complementary to the genomic sequence; d) hybridizing the support-bound selection primer to a nucleic acid fragment comprising the genomic sequence; e) extending the support-bound selection primer to produce an extension product that contains a sequence that flanks the genomic sequence, e.g., in a genome; and f) amplifying the extension product, e.g., using bridge PCR on the substrate to produce a PCR product.
Depending on how the method is implemented, an adaptor may be either ligated to the genomic fragment prior to hybridization, or to the extension product after the support bound selection primer is extended. The distal adaptor may hybridize to a surface bound oligonucleotide (which may itself be an extension product produced by a templated extension of the second population of surface-bound oligonucleotides), thereby allowing bridge PCR to occur. The selection primer may also contain a sequencing primer binding site that can be employed to sequence the PCR product.
The method described above generally finds use in resequencing methods in which the sequence of a reference locus is available and the same locus is to be resequenced in a plurality of test samples. In this utility, a selection oligonucleotide is designed to hybridize to an oligonucleotide on the substrate and a region that flanks the locus to be resequenced. The locus is captured on the substrate and then amplified prior to sequencing. For example, a single locus or multiple different loci (e.g., up to 10, 50, 100, 200 or 1,000 or more loci) may be captured from a sample that is made from one individual or multiple individuals (e.g., up to 10, 50, 100, 200 or 1,000 or more individuals).
In certain embodiments, the method comprises: a) obtaining a substrate comprising a first population of surface-bound oligonucleotides and a second population of surface-bound oligonucleotides, wherein the first and second populations of surface-bound oligonucleotides are randomly interspersed on the substrate and not spatially addressed; b) hybridizing a first member of the first population of surface-bound oligonucleotides to a selection
oligonucleotide comprising a region that hybridizes with the first member and a region that contains a genomic sequence; c) extending the first member of the first population of surface-bound oligonucleotides to produce a support-bound selection primer that comprises a sequence that is complementary to the genomic sequence; d) hybridizing the support- bound selection primer to an adaptor-ligated fragment (e.g., an adaptor- ligated genomic fragment) comprising the genomic sequence; e) extending the support-bound selection primer to produce a product that contains a sequence that flanks the genomic sequence (e.g., in a genome) and the sequence of the adaptor of the adaptor-ligated genomic fragment; and f) amplifying the product using bridge PCR to produce a PCR product.
In alternative embodiments, the method may comprise: a) obtaining a substrate comprising a first population of surface-bound oligonucleotides and a second population of surface-bound oligonucleotides, wherein the first and second populations of surface-bound oligonucleotides are randomly interspersed on the substrate and not spatially addressed; b) hybridizing a first member of the first population of surface-bound oligonucleotides to a selection oligonucleotide comprising a region that hybridizes with the first member and a region that contains a genomic sequence; c) extending the first member of the first population of surface-bound oligonucleotides to produce a support-bound selection primer that comprises a sequence that is complementary to the genomic sequence; e) extending the support-bound selection primer to produce a product that contains a sequence that flanks the genomic sequence; f) ligating a double stranded adapter onto the product to produce an adaptor modified product; and g) amplifying the adaptor-modified product using bridge PCR to produce a PCR product. In particular cases, the method may further comprise: i. ligating the genomic fragments to an adaptor that contains a site for a sequencing primer and a nucleotide sequence that is the same as the second surface bound oligonucleotides, ii. hybridizing the adaptor-ligated genomic fragments to a first member of the first population of surface-bound oligonucleotides, ii. extending the first member of the first population of surface-bound oligonucleotides to which the adaptor ligated fragment is hybridized; and iv. hybridizing the adaptor-containing end of the extension product to a second support bound polynucleotide, thereby producing a bridge and facilitating bridge PCR.
BRIEF DESCRIPTION OF THE FIGURES
Certain aspects of the following detailed description are best understood when read in conjunction with the accompanying drawings. It is emphasized that, according to common practice, the various features of the drawings are not to scale. On the contrary, the dimensions of the various features are arbitrarily expanded or reduced for clarity. Included in the drawings are the following figures:
Fig. 1. An overview of the one embodiment of the subject method called "OS-Seq".
(a) OS-Seq is a targeted resequencing method that is seamlessly integrated with the Illumina NGS platform. Target- specific oligonucleotides, a sequencing library and an Illumina cluster generation kit are needed for this method. Capture of targets, processing and sequencing are performed on the NGS system. Data originating from each primer-probe is targeted and strand- specific. Shown here is the median coverage profile for OS-Seq-366.
(b) Processing of OS-Seq involves three steps of hybridization, DNA polymerase-mediated extension and DNA denaturation. Step 1; Target- specific oligonucleotides are used to modify flow cell primers to primer-probes. In the Illumina sequencing system two types of primers (named C and D) are immobilized on a paired-end flow cell. In OS-Seq a subset of D primers are modified to primer-probes using complex library of oligonucleotides.
Oligonucleotides have sequences that hybridize to type D flow cell primers. Hybridized oligonucleotides are then used as a template for DNA polymerase and D primers are extended. After denaturation, target- specific primer-probes are randomly immobilized on the flow cell. Step 2: Genomic targets in a single-adaptor library are captured using primer- probes. Sample preparation for Illumina sequencing involves the addition of specific DNA adapters to the genomic DNA fragments. These adapters incorporate sites for sequencing primers and immobilized flow cell primers. In OS-Seq, we use a modified adapter to prepare single-adapter libraries from genomic DNA. Targets in single-adaptor library are captured during high heat hybridization to their complementary primer-probes. Captured single-adapter library fragments are used as a template for DNA polymerase and primer- probes are extended. Denaturation releases template DNA from immobilized targets. Step 3: Immobilized targets are rendered to be compatible with Illumina sequencing. In Illumina sequencing, solid-phase amplification of the immobilized sequencing library fragments using C and D primers is required. In OS-Seq, during low heat hybridization the single- adapter tails of the immobilized targets hybridize to type C primers on the flow cell surface, which stabilizes a bridge structure. The 3' ends of immobilized targets and C primers are extended using DNA polymerase. After denaturation, two complementary, immobilized sequencing library fragments are formed that contain complete C and D priming sites and are compatible with solid-phase amplification. After the three steps of OS-Seq, immobilized targets are structurally identical to a standard paired-end Illumina library and are amplified and processed using Illumina' s standard kits and protocols. The principles of this method may be employed on other sequencing platforms, (c) Shown is the coverage profile along the KRAS gene from the OS-Seq-366 assay. Base positions relative to the start of exon 1 are presented on the x-axis and KRAS exons are indicated, (d) Uniformity assessment of primer- probe yields within column and array- synthesized oligonucleotides. Uniformity of capture was compared between column- synthesized (blue, n=366) and array-synthesized (red, n=l 1,742) oligonucleotides. On the x-axis, oligonucleotides are sorted by sequence capture yields, on the y-axis is the normalized primer-probe yield. To calculate normalized yield, each oligonucleotide's yield was divided by the median yield from all oligonucleotides.
Fig. 2: Sequencing library preparation for OS-Seq. A general scheme of genomic DNA fragmentation, end repair, A-tailing, Adaptor ligation and PCR was used in the preparation of OS-Seq libraries.
Fig. 3. Design strategies for OS-Seq. (a) Primer-probes were placed 10 bases from the exon or (b) tiled every 500 bases inside large exons.
Fig. 4. Generation of OS-Seq oligonucleotides. Column- synthesis yielded large amount of mature 101-mer OS-Seq oligonucleotides that were readily usable in the assay. Microarray- synthesis was applied to generate high-content oligonucleotide pools. Precursor oligonucleotides were amplified using primers that incorporated additional sequences into oligonucelotides. Uracil-excision was applied to cleave the amplification primer site from the coding strands of the OS-seq oligonucleotides.
Fig. 5. Structures of oligonucleotide components in OS-Seq. (a) Mature 101-mer OS-Seq oligonucleotides contained target- specific site and sequences encoding for sequencing primer 2 and flow cell primer 'D'. (b) Microarray- synthesized oligonucleotides were amplified using primers that incorporated Uracil to the 5' end of the OS-Seq oligonucleotide and additional active sites for sequencing, (c) Adapter for OS-Seq contained T-overhang for sticky-end ligation to the A-tailed genomic fragments. In addition, indexing sequences as well as flow cell primer 'C site were present in the dsDNA adapter.
Fig. 6. Description of insert size distributions encountered in OS-Seq data.
Fragmentation of genomic DNA produces fragments between 200 and 2kb. Sequencing library preparation adds common adapter to the ends of the fragments. PCR amplification distorts the fragment size distribution further. Target sites are randomly distributed within the single- adapter library fragments. Library fragments were immobilized on the flow cell and the distance between primer-probe and adapter defined the size of a genomic DNA insert. Bridge-PCR is applied to amplify immobilized target DNA (generally, solid-phase PCR preferentially amplifies shorter fragments). After cluster amplification and processing, immobilized fragments are sequenced using two sites. Read 1 originates from the genomic DNA and Read 2 is derived from the synthetic primer-probes. Read 1 is used for assessing the genomic DNA sequence from OS-Seq data.
Fig. 7. Reproducibility of OS-Seq. (a) Technical reproducibility of OS-Seq. Two identical libraries were analyzed using OS-Seq. Sequencing yields of individual primer- probes were compared between technical replicates, (b) Biological reproducibility of OS- Seq. Two different genomic DNA libraries were prepared using indexed adapters. Libraries were analyzed in the same OS-Seq experiment. In the figure, primer-probe specific capture yields are compared between two independent biological replicates.
Fig. 8A-B. Effect of GC content on targeting yield. To analyze the effect of GC content in the efficiency of primer-probes, we determined the GC content of each target- specific primer-probe sequence. We classified primer-probes that were failing (captured 0 targets). Proportions of failing primer-probes were compared between different %CG content categories. X-axis presents the percentages of the sorted CG categories and y-axis reports the proportion of failed primer-probes within each GC content category.
Fig. 9A-B. Comparison of the processing workflow for OS-Seq and shotgun library creation methods.
DEFINITIONS
Unless defined otherwise herein, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs. Although any methods and materials similar or equivalent to those described herein can be used in the practice or testing of the present invention, the preferred methods and materials are described.
All patents and publications, including all sequences disclosed within such patents and publications, referred to herein are expressly incorporated by reference.
Numeric ranges are inclusive of the numbers defining the range. Unless otherwise indicated, nucleic acids are written left to right in 5' to 3' orientation; amino acid sequences are written left to right in amino to carboxy orientation, respectively.
The headings provided herein are not limitations of the various aspects or embodiments of the invention. Accordingly, the terms defined immediately below are more fully defined by reference to the specification as a whole.
Unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs. Singleton, et al., DICTIONARY OF MICROBIOLOGY AND MOLECULAR BIOLOGY, 2D ED., John Wiley and Sons, New York (1994), and Hale & Markham, THE HARPER COLLINS DICTIONARY OF BIOLOGY, Harper Perennial, N.Y. (1991) provide one of skill with the general meaning of many of the terms used herein. Still, certain terms are defined below for the sake of clarity and ease of reference.
The term "sample" as used herein relates to a material or mixture of materials, typically, although not necessarily, in liquid form, containing one or more analytes of interest. The nucleic acid samples used herein may be complex in that they contain multiple different molecules that contain sequences. Fragmented genomic DNA and cDNA made from mRNA from a mammal (e.g., mouse or human) are types of complex samples.
Complex samples may have more then 104, 105, 106 or 107 different nucleic acid molecules. A DNA target may originate from any source such as genomic DNA, cDNA (from RNA) or artificial DNA constructs. Any sample containing nucleic acid, e.g., genomic DNA made from tissue culture cells, a sample of tissue, or an FPET samples, may be employed herein.
The term "nucleotide" is intended to include those moieties that contain not only the known purine and pyrimidine bases, but also other heterocyclic bases that have been modified. Such modifications include methylated purines or pyrimidines, acylated purines or pyrimidines, alkylated riboses or other heterocycles. In addition, the term "nucleotide" includes those moieties that contain hapten or fluorescent labels and may contain not only conventional ribose and deoxyribose sugars, but other sugars as well. Modified nucleosides or nucleotides also include modifications on the sugar moiety, e.g., wherein one or more of the hydroxyl groups are replaced with halogen atoms or aliphatic groups, are functionalized as ethers, amines, or the likes.
The term "nucleic acid" and "polynucleotide" are used interchangeably herein to describe a polymer of any length, e.g., greater than about 2 bases, greater than about 10 bases, greater than about 100 bases, greater than about 500 bases, greater than 1000 bases, up to about 10,000 or more bases composed of nucleotides, e.g., deoxyribonucleotides or ribonucleotides, and may be produced enzymatically or synthetically (e.g., PNA as described in U.S. Patent No. 5,948,902 and the references cited therein) which can hybridize with naturally occurring nucleic acids in a sequence specific manner analogous to that of two naturally occurring nucleic acids, e.g., can participate in Watson-Crick base pairing interactions. Naturally-occurring nucleotides include guanine, cytosine, adenine and thymine (G, C, A and T, respectively).
The term "nucleic acid sample," as used herein denotes a sample containing nucleic acids.
The term "target polynucleotide," as use herein, refers to a polynucleotide of interest under study. In certain embodiments, a target polynucleotide contains one or more sequences that are of interest and under study.
The term "oligonucleotide" as used herein denotes a single- stranded multimer of nucleotide of from about 2 to 200 nucleotides, up to 500 nucleotides in length.
Oligonucleotides may be synthetic or may be made enzymatically, and, in some
embodiments, are 30 to 150 nucleotides in length. Oligonucleotides may contain
ribonucleotide monomers (i.e., may be oligoribonucleotides) or deoxyribonucleotide monomers. An oligonucleotide may be 10 to 20, 11 to 30, 31 to 40, 41 to 50, 51-60, 61 to 70, 71 to 80, 80 to 100, 100 to 150 or 150 to 200 nucleotides in length, for example.
The term "hybridization" refers to the process by which a strand of nucleic acid joins with a complementary strand through base pairing as known in the art. A nucleic acid is considered to be "Selectively hybridizable" to a reference nucleic acid sequence if the two sequences specifically hybridize to one another under moderate to high stringency hybridization and wash conditions. Moderate and high stringency hybridization conditions are known (see, e.g., Ausubel, et al., Short Protocols in Molecular Biology, 3rd ed., Wiley & Sons 1995 and Sambrook et al., Molecular Cloning: A Laboratory Manual, Third Edition, 2001 Cold Spring Harbor, N.Y.). One example of high stringency conditions include hybridization at about 42C in 50% formamide, 5X SSC, 5X Denhardt's solution, 0.5% SDS and 100 ug/ml denatured carrier DNA followed by washing two times in 2X SSC and 0.5% SDS at room temperature and two additional times in 0.1 X SSC and 0.5% SDS at 42 °C.
The term "duplex," or "duplexed," as used herein, describes two complementary polynucleotides that are base-paired, i.e., hybridized together.
The term "amplifying" as used herein refers to generating one or more copies of a target nucleic acid, using the target nucleic acid as a template.
The terms "determining", "measuring", "evaluating", "assessing," "assaying," and "analyzing" are used interchangeably herein to refer to any form of measurement, and include determining if an element is present or not. These terms include both quantitative and/or qualitative determinations. Assessing may be relative or absolute. "Assessing the presence of includes determining the amount of something present, as well as determining whether it is present or absent.
The term "using" has its conventional meaning, and, as such, means employing, e.g., putting into service, a method or composition to attain an end. For example, if a program is used to create a file, a program is executed to make a file, the file usually being the output of the program. In another example, if a computer file is used, it is usually accessed, read, and the information stored in the file employed to attain an end. Similarly if a unique identifier, e.g., a barcode is used, the unique identifier is usually read to identify, for example, an object or file associated with the unique identifier.
As used herein, the term "Tm" refers to the melting temperature of an oligonucleotide duplex at which half of the duplexes remain hybridized and half of the duplexes dissociate into single strands. The Tm of an oligonucleotide duplex may be experimentally determined or predicted using the following formula Tm = 81.5 + 16.6(logio[Na+]) + 0.41 (fraction G+C) - (60/N), where N is the chain length and [Na+] is less than 1 M. See Sambrook and Russell (2001; Molecular Cloning: A Laboratory Manual, 3 ed., Cold Spring Harbor Press, Cold Spring Harbor N.Y., ch. 10). Other formulas for predicting Tm of oligonucleotide duplexes exist and one formula may be more or less appropriate for a given condition or set of conditions.
The term "free in solution," as used here, describes a molecule, such as a
polynucleotide, that is not bound or tethered to another molecule.
The term "denaturing," as used herein, refers to the separation of a nucleic acid duplex into two single strands.
The term "genomic sequence", as used herein, refers to a sequence that occurs in a genome. Because RNAs are transcribed from a genome, this term encompasses sequence that exist in the nuclear genome of an organism, as well as sequences that are present in a cDNA copy of an RNA (e.g., an mRNA) transcribed from such a genome.
The term "genomic fragment", as used herein, refers to a region of a genome, e.g., an animal or plant genome such as the genome of a human, monkey, rat, fish or insect or plant. A genomic fragment may or may not be adaptor ligated. A genomic fragment may be adaptor ligated (in which case it has an adaptor ligated to one or both ends of the fragment, to at least the 5' end of a molecule), or non-adaptor ligated.
In certain cases, an oligonucleotide used in the method described herein may be designed using a reference genomic region, i.e., a genomic region of known nucleotide sequence, e.g., a chromosomal region whose sequence is deposited at NCBI's Genbank database or other database, for example. Such an oligonucleotide may be employed in an assay that uses a sample containing a test genome, where the test genome contains a binding site for the oligonucleotide.
The term "ligating", as used herein, refers to the enzymatically catalyzed joining of the terminal nucleotide at the 5' end of a first DNA molecule to the terminal nucleotide at the 3' end of a second DNA molecule.
The term "adaptor" refers to double stranded as well as single stranded molecules.
A "plurality" contains at least 2 members. In certain cases, a plurality may have at least 10, at least 100, at least 100, at least 10,000, at least 100,000, at least 106, at least 107, at least 10 8 or at least 109 or more members.
If two nucleic acids are "complementary", each base of one of the nucleic acids base pairs with corresponding nucleotides in the other nucleic acid. The term "complementary" and "perfectly complementary" are used synonymously herein.
A "primer binding site" refers to a site to which a primer hybridizes in an
oligonucleotide or a complementary strand thereof.
The term "separating", as used herein, refers to physical separation of two elements (e.g., by size or affinity, etc.) as well as degradation of one element, leaving the other intact.
The term "sequencing", as used herein, refers to a method by which the identity of at least 10 consecutive nucleotides (e.g., the identity of at least 20, at least 50, at least 100 or at least 200 or more consecutive nucleotides) of a polynucleotide are obtained.
The term "not spatially addressed", in the context of a substrate containing surface- bound populations of oligonucleotides that are not spatially addressed, refers to a substrate that contains a surface containing different oligonucleotide molecules that are in no particular order or position relative to one another, i.e., at random positions or randomly interspersed with one another. Such a substrate need not be planer and in certain cases may be in the form of a bead. Substrates that contain spatially or optically addressed populations of a single oligonucleotide (e.g., microarrays and encoded beads etc.) are excluded by this definition. A substrate comprising a first population of surface-bound oligonucleotides and a second population of surface-bound oligonucleotides, wherein the first and second populations of surface-bound oligonucleotides not spatially addressed, refers to a substrate containing at least two populations of different oligonucleotides that are randomly distributed across the substrate. A substrate may planar or in the form of beads, for example..
The term "adaptor-ligated", as used herein, refers to a nucleic acid that has been ligated to an adaptor. The adaptor can be ligated to a 5' end or a 3' end of a nucleic acid molecule.
The term "extending", as used herein, refers to the extension of a primer by the addition of nucleotides using a polymerase. If a primer that is annealed to a nucleic acid is extended, the nucleic acid acts as a template for extension reaction.
The term "bridge PCR" refers to a solid-phase polymerase chain reaction in which the primers that are extended in the reaction are tethered to a substrate by their 5' ends.
During amplification, the amplicons form a bridge between the tethered primers. Bridge PCR (which may also be referred to as "cluster PCR") is used in Illumina's Solexa platform. Bridge PCR and Illumina's Solexa platform are generally described in a variety of publications, e.g., Gudmundsson et al (Nat. Genet. 2009 41:1122-6), Out et al (Hum. Mutat. 2009 30:1703-12) and Turner (Nat. Methods 2009 6:315-6), US patent 7,115,400, and publication application publication nos. US20080160580 and US20080286795.
The term "barcode sequence", as used herein, refers to a unique sequence of nucleotides is used to identify and/or track the source of a polynucleotide in a reaction. A barcode sequence may be at the 5'-end or 3'-end of a oligonucleotide. Barcode sequences may vary widely in size and composition; the following references provide guidance for selecting sets of barcode sequences appropriate for particular embodiments: Brenner, U.S. Pat. No. 5,635,400; Brenner et al, Proc. Natl. Acad. Sci., 97: 1665-1670 (2000); Shoemaker et al, Nature Genetics, 14: 450-456 (1996); Morris et al, European patent publication
0799897A1; Wallace, U.S. Pat. No. 5,981,179; and the like. In particular embodiments, a barcode sequence may have a length in range of from 4 to 36 nucleotides, or from 6 to 30 nucleotides, or from 8 to 20 nucleotides.
Other definitions of terms may appear throughout the specification. DETAILED DESCRIPTION OF EXEMPLARY EMBODIMENTS
Certain features of the subject method are described with reference to Fig. 1, which illustrates an embodiment in which adaptors are ligated to a fragment prior to hybridization of the fragment to the substrate. In alternative embodiments, an adaptor may be added later in the protocol. The method generally comprises obtaining a substrate that contains at least two surface bound oligonucleotides of differing sequence that are spatially interspersed with one another. Such substrates are currently employed in Illumina's Solexa sequencing technology and are described in a variety of references, e.g., US patent no. 7,115,400 and publication nos. US20080160580 and US20080286795, which are incorporated by reference for such disclosure. Some of the embodiments set forth below may describe the use of the method to isolate fragments of a genome. These embodiments may be readily adapted to other types of sequences, e.g., cDNA or synthetic DNA.
In certain embodiments, a first member of the first population of surface-bound oligonucleotides is hybridized to a selection oligonucleotide that contains a) a region that hybridizes with the first member and a region, a sequencing primer site and b) a region that contains a target genomic sequence. The amount of selection oligonucleotide used in this step may be optimized such that sufficient number of oligonucleotides of the first population remain unhybridized to the selection oligonucleotide and available to be used in the bridge PCR step that occurs later in the protocol. The first member of the first population of surface-bound oligonucleotides is extended to produce a duplex that contains support-bound selection primer that contains a sequence that is complementary to the target genomic sequence. The selection oligonucleotide is removed by denaturation to leave he extended support-bound selection primer. The extended support-bound selection primer is then hybridized with adapter- ligated genomic fragment (which may be made by fragmenting genomic DNA, chemically, physically or using an enzyme and then ligating adaptors to the ends of the resultant fragments) containing the target genomic sequence, sequence that flanks the target genomic sequence, and an adaptor sequence at the 5' end of one or both of the strands. The support-bound selection primer is extended to produce a product that contains a sequence that flanks the genomic sequence in the genome and the sequence of the adaptor of the adaptor-ligated genomic fragment.
In some embodiments, the adaptor of the adaptor-ligated genomic fragment may hybridize to the second population of surface-bound oligonucleotides. However, in certain cases, before amplification, second population of surface-bound oligonucleotides may be hybridized to a modifying oligonucleotide that contains a) a region that hybridizes with second member and a region that contain contains adaptor sequence. The amount of modifying oligonucleotide used in this step may be optimized such that sufficient number of product molecules hybridize. The second member of the second population of surface-bound oligonucleotides may be extended to produce a duplex that contains support-bound adapter primer that contains a sequence that is complementary to the adapter sequence. The modifying oligonucleotide is removed by denaturation to leave support-bound adapter primer. The product may be then amplified by bridge PCR.
As illustrated in Fig. lb, the product is amplified by a first unextended surface-bound oligonucleotides as well as a second surface-bound oligonucleotide to produce a PCR product. In certain cases, the genomic fragment is an adaptor-ligated genomic fragment comprising a 5' end adaptor. In these cases, members of the second population of the surface-bound oligonucleotides hybridize to the complement of the adaptor. In alternative embodiments, an adaptor may be ligated onto the extension product, thereby placing an adaptor that hybridizes to the second population of the surface-bound oligonucleotides onto the 3' end of the extension product. In other embodiments, the amplifying is done using: a) unextended members of the first population of surface-bound oligonucleotides; and b) support-bound primers that are made by: i. hybridizing members of the second population of surface-bound oligonucleotides to an oligonucleotide comprising a region that hybridizes with the members of the second population of surface-bound oligonucleotides and a region that is complementary to an adaptor; and ii. extending the members of the second population of surface-bound oligonucleotides to produce support-bound primers that hybridize to the 5' end of the extension product.
In some embodiments, the genomic fragment is an adaptor-ligated genomic fragment comprising a 5' end adaptor, wherein the extending produces an extension product that comprises, on its 3' end, a sequence that is complementary to the adaptor, and wherein members of the second population of the surface-bound oligonucleotides hybridize to the sequence that is complementary to the adaptor during the bridge PCR. In this embodiment, the 5' end adaptor comprises a binding site for a sequencing primer at the end that is ligated to the genomic fragment.
In other embodiments, the method comprises, between steps e) and f), ligating an adaptor onto the 3' end of the extension product, and wherein members of the second population of the surface-bound oligonucleotides hybridize to the adaptor during the bridge PCR. In these embodiments, the adaptor comprises a binding site for a sequencing primer at the end that is ligated to the genomic fragment. In some embodiments, the second population of surface-bound oligonucleotides are made by: i. hybridizing members of an initial second population of surface-bound oligonucleotides to an oligonucleotide comprising a region that hybridizes with the members of the second population of surface-bound oligonucleotides and a region that is
complementary to a sequence of the genomic fragment; and ii. extending the members of the initial second population of surface-bound oligonucleotides to produce the second population of surface-bound oligonucleotides.
In some embodiments, the second population of surface-bound oligonucleotides may be made by ligating an oligonucleotide comprising a region that is complementary to a sequence of said nucleic acid fragment to an initial second population of surface-bound oligonucleotides to produce said second population of surface-bound oligonucleotides. This ligation may be facilitated by a splint oligonucleotide that forms a bridge between the two oligonucleotides being ligated. In other words, a modifying oligonucleotide may be introduced by a ligation-based process in which a bridging oligonucleotide is used to guide the modification of the original solid support oligonucleotide to create the support-bound adapter primer. Similarly, the support-bound adapter primer can be created using a similar bridging oligonucleotide to create the primer extension necessary for the target modification.
In some cases the selection oligonucleotide comprises a binding site for a sequencing primer between said a region that hybridizes with said first member and said region that contains said genomic sequence.
In some embodiments, the method may further comprises sequencing a first strand of the PCR product to obtain at least part of the nucleotide sequence of the sequence that flanks the genomic sequence. This method may further comprise sequencing the second strand of the PCR product to obtain at least part of the nucleotide sequence of the sequence that flanks the genomic sequence.
In particular embodiments, the method may comprise fragmenting a mammalian genome to produce a fragmented genome, optionally adding adaptors to the fragmented genome, and applying the fragmented genome to the substrate. The fragmenting is done physically, chemically or using a restriction enzyme. The fragmenting is done by sonication or shearing, for example.
In particular cases, the hybridizing may be done by preparing a plurality of fragmented genomes from a plurality of different individuals, pooling the plurality of fragmented genomes to produce a pool, applying the pool of fragmented genomes to the substrate, and obtaining PCR products that comprise a sequence that flanks the genomic sequence in the different individuals. These embodiments may further comprising sequencing at least the first strand of the PCR products to obtain at least part of the nucleotide sequence of the sequence that flanks the genomic sequence in the different individuals. In particular cases, prior to pooling, different adaptors are ligated to the fragmented genomes from the different individuals, wherein the the adaptor comprises a barcode sequence that allows the source of the adaptor-ligated genomic fragment to be identified after the PCR products are sequenced.
In some embodiments, the method comprises: adaptor-ligating fragmented genomic DNA from a first subject using a first adaptor that comprises a first barcode sequence to produce a first product; adaptor-ligating fragmented genomic DNA from a second subject using a second adaptor that comprises a second barcode sequence to produce a second product; combining the first and second products to produce a mixed template; and performing the method of claim 1 using the mixed template to provide first and second PCR product each containing the barcode sequence. The mixed template in some cases may comprise fragmented genomic DNA from at least 1,000 subjects.
In some embodiments, the method may involvei. ligating the genomic fragments to an adaptor that contains a site for a sequencing primer and a nucleotide sequence that is the same as the second surface bound oligonucleotides, ii. hybridizing the adaptor-ligated genomic fragments to a first member of the first population of surface-bound
oligonucleotides, iii. extending the first member of the first population of surface-bound oligonucleotides to which the adaptor ligated fragment is hybridized; and iv. hybridizing the adaptor-containing end of the extension product to a second support bound polynucleotide, thereby producing a bridge and facilitating bridge PCR.
Also provided is a system. In certain cases the system may comprises: a) a substrate comprising a first population of surface-bound oligonucleotides and a second population of surface-bound oligonucleotides, wherein the first and second populations of surface-bound oligonucleotides not spatially addressed on the substrate; b) a selection oligonucleotide that contains a region that hybridizes with a first member of the first population and a region that contains a genomic sequence; c) an adaptor; and e) instructions for performing the method of claim 1. The PCR product may be sequenced, e.g, using Illumina's Solexa platform, or another solid-phase sequencing method, to obtain at least part of the nucleotide sequence of the sequence that flanks the targets genomic sequence.
In particular embodiments, the method may employ barcode sequences that allow the source of the sequence that flanks the target genomic sequence. In these embodiments, the adaptor of the adaptor-ligated genomic fragment may contain a barcode sequence that allows the source of the adaptor-ligated genomic fragment to be identified after PCR product is sequenced. In particular embodiments, this method comprises adaptor-ligating fragmented genomic DNA from a first subject (which subject may be included in a pool of first subjects) using a first adaptor that comprises a first barcode sequence to produce a first product;
adaptor-ligating fragmented genomic DNA from a second subject (which subject may be included in a pool of second subjects) using a second adaptor that comprises a second barcode sequence to produce a second product; combining the first and second products to produce a mixed template; and performing the above-described method using the mixed template to provide first and second PCR products each containing the barcode sequence. In the above-method, the adaptors used have a portion that has the same sequence and that hybridizes to a surface-bound oligonucleotide, and a portion that has a different nucleotide sequence that contains the barcode sequence.
A second method of amplifying a selected sequence is provided. The principle of this method is similar to that of the method described above, except that a) the genomic fragment that is hybridized to the support-bound selection primer is not adaptor ligated; and b) adaptors are after the support-bound selection primer is extended. Adaptor ligation, the product may be employed in a bridge PCR reaction, as discussed above. As in the alternative embodiment described above, the amplifying is done using: a) unextended members of the first population of surface-bound oligonucleotides; and b) support-bound primers that are made by: i. hybridizing members of the second population of surface-bound oligonucleotides to an oligonucleotide comprising a region that hybridizes with the members of the second population of surface-bound oligonucleotides and a region that is complementary to the sequence of the adaptor; and ii. extending the members of the second population of surface- bound oligonucleotides to produce the support-bound primers. As with the method described above, the PCR product may be sequenced to obtain at least part of the nucleotide sequence of the sequence that flanks the genomic sequence.
In an alternative embodiment, the genomic fragments may be ligated to an adaptor that not only contains a sequencing primer binding site, but also a sequence that is the same as second population of surface-bound oligonucleotides. As shown, when the extended first population of surface-bound oligonucleotides (which is usually done at high temperature, i.e., at least 90 °C) are hybridized to the adaptor-ligated fragments and extended, the extension product contains a sequence that hybridizes to the second population of surface- bound oligonucleotides (which is usually done at a lower temperature, e.g., lower than 60 °C, e.g., lower than 55 °C), thereby facilitating amplification of the genomic fragments using the first and second surface bound oligonucleotides. This method is illustrated in Fig. 14.
In particular embodiments, the oligonucleotides of the first population are present at a molar excess of at least 5X, 10X, 20X, 50X, or 100X, 500X, Ι,ΟΟΟΧ, 2000X, ΙΟ,ΟΟΟΧ, 50,000X relative to the amount of selection oligonucleotide applied to the substrate. In one embodiment, the molar excess may be in the rage of a 5X to 50,000X molar excess, e.g., a 100X to 5,000X molar excess.
In certain embodiments, a substrate may be contacted with plurality of different selection oligonucleotides, each comprising a region that hybridizes with members of the first population of surface-bound oligonucleotides (which region has the same nucleotide sequence in the different selection oligonucleotides) and a region that contains a genomic sequence. The genomic sequence of each of the selection oligonucleotides is different, thereby allowing several genomic regions to be captured, amplified and sequenced on the substrate.
Kits
Also provided by the present disclosure are kits for practicing the subject method as described above. In certain embodiments, a subject kit may contain a) a substrate comprising a first population of surface-bound oligonucleotides and a second population of surface- bound oligonucleotides, wherein the first and second populations of surface-bound oligonucleotides not spatially addressed on the substrate and b) a selection oligonucleotide that contains a region that hybridizes with a first member of the first population and a region that contains a genomic sequence. The kit may also contains other reagents described above and below that may be employed in the method, e.g., adaptors, ligase, hybridization buffers, etc.
In addition to above-mentioned components, the subject kit typically further includes instructions for using the components of the kit to practice the subject method. The instructions for practicing the subject method are generally recorded on a suitable recording medium. For example, the instructions may be printed on a substrate, such as paper or plastic, etc. As such, the instructions may be present in the kits as a package insert, in the labeling of the container of the kit or components thereof (i.e., associated with the packaging or subpackaging) etc. In other embodiments, the instructions are present as an electronic storage data file present on a suitable computer readable storage medium, e.g. CD-ROM, diskette, etc. In yet other embodiments, the actual instructions are not present in the kit, but means for obtaining the instructions from a remote source, e.g. via the internet, are provided. An example of this embodiment is a kit that includes a web address where the instructions can be viewed and/or from which the instructions can be downloaded. As with the instructions, this means for obtaining the instructions is recorded on a suitable substrate. Other required components will include related computer programs and/or computer scripts to implement the a modification to prior programs already installed on a sequencer.
In addition to the instructions, the kits may also include one or more control analyte mixtures, e.g., two or more control analytes for use in testing the kit.
In order to further illustrate the present invention, the following specific examples are given with the understanding that they are being offered to illustrate the present invention and should not be construed in any way as limiting its scope.
The disclosure of U.S. provisional patent application serial nos. 61/386,390, filed on September 24, 2010, and 61/485,062 filed on May 11, 2011, including all figures, examples, detailed description, and oligonucleotide sequences, are incorporated herein in their entirety.
EXAMPLES
Presented below is a new approach to perform targeted DNA sequencing. The method is based on modifying a generic primer lawn (i.e. a lawn containing at least two primers that are randomly distributed) on a solid phase support to serve as a target DNA capture device, enabling direct sequencing of the captured DNA and without significant manipulation of the sample. The method enables seamless integration of target DNA capture and sequencing experiments with a related fluidics platform. This approach uses a universal primer lawn on a solid-phase support to serve as a DNA capture substrate while maintaining its sequencing potential. The method can use non-processed, natural DNA as a template for sequencing. Sequencing using this method is not necessarily dependent on laboratory facilities. Moreover, many of the biases introduced during sample processing are avoided and substantially smaller samples can be analyzed in lesser time and with reduced cost relative to other methods. The method can be used to analyze single and double stranded templates. The ability to analyze single- strand DNA templates can be important for some sequencing applications that use formalin-fixed paraffin-embedded samples from
pathological archives. Similarly, by allowing single- strand DNA template sequencing, the method does not require complicated nucleic acid extraction steps and expensive
fragmentation instrumentation that are designed to preserve the double-strand formation of the DNA. Rather, the sample may be prepared by lysis and heat fragmentation, which is inexpensive and effective. The straightforward capture sequencing assay is not restricted to human genomic DNA but other nucleic acid substrates, such as bacterial and viral DNA and RNA can be analyzed. Transcriptomes, noncoding and miRNAs can also be captured and sequenced. In addition nucleotide sequence capture and sequencing, other genetic and epigenetic properties can be studied, such as DNA methylation, large genomic
rearrengements, and gene expression. The method may also be employed to select synthetic DNA from a population.
Generally, sequencing has been regarded as a process in which the DNA sample is structurally modified to facilitate the analysis on a sequencing system. The method described below modifies the sequencing system and therefore there is no need to modify and extensively prepare the sample. By functionalizing a generic primer lawn by using a synthetic DNA oligonucleotide library of target genes of non-processed samples may be directly assayed. To reduce non-specific capture, specific DNA components that provide sequences that are employed in the formation of the bridge- structure are brought- in sequentially, and the primer lawn is itself modified. Sequencing library preparation for all types of sequencers rely on adding specific double-strand adaptor sequences to the DNA template. Since the capture oligonucleotides served as adaptors immobilized on a solid support, the library preparation for the assay only required an addition of a single adaptor. This substantially shortens the sample processing and does not require clonal amplification nor gel electrophoresis based size separation. In certain cases a second adapter may be added to the captured template on a solid support. Certain embodiments of the method allow for the use of raw DNA as a sequencing template.
Several current methods for performing high throughput re- sequencing involve capturing the target DNA and sequencing as separate methods. This can in certain case lead to multiple problems including i) significant labor and time intensive manipulations of DNA material, ii) errors secondary to complex experimental protocols, iii) bias created by the selection and molecular amplification process and iv) requirements for large quantities of starting material. The method described below is believed to eliminate the source of many of these problems since it involves little or no up-front sample manipulation and is totally automatable and highly scalable.
As a proof-of-concept, all exons of 10 cancer genes in the human genome were sequenced to show that the assay is reproducible and can be used to capture and sequence specific regions of the genome. This assay technology was demonstrated with an Illumina Genome Analyzer but note that this approach is broadly applicable to any sequencer that uses a solid-phase support.
The methods described below, some of the principles of which are illustrated in Figure 1, can be used to effectively capture any target DNA sequence and allows direct sequencing of the captured genomic fragments. Genomic DNA sample can be prepared for sequencing by a simple heat fragmentation step and the entire assay can be fully automated and performed on the solid support. The capture and subsequent reactions can be mediated by a fluidics system.
An additional embodiment provides a method that allows the preparation of DNA fragments for sequencing on the solid support by using fragmented DNA as a template and adding sequencing adapters to the captured DNA fragments using a fluidics system. As a proof-of concept an Illumina next-generation DNA sequencer was used to develop these approaches. The results from an integrated capture and sequencing preparation reaction using primer lawn modification and 366 target sites in the human genome are presented. With the exception of 25-minute heat fragmentation, all steps can be done on the solid-phase support of the Illumina flow cell.
The data described below demonstrates the robustness of the assay and applicability of a universal primer lawn and a fluidics system as a capture substrate. Unique parameters of the modification of primer lawns have been identified, which enable the method to work robustly. In addition to complex eukaryotic genomes, the method can be applied to capture microbial and other organisms' genomes, viral DNA and RNA, transcriptomes of different sources as well as synthetic DNA. Furthermore, the concept of "programming" a native primer lawn immobilized on a solid support of a fluidics system and executing specific applications is being introduced and validated.
Materials and Methods
Genomic DNA samples. Genomic DNA for NA18507 was obtained from the Coriell Institute. Fresh frozen tissue samples were obtained from a colorectal cancer patient. Patient material was obtained with informed consent from the Stanford Cancer Center and the study was approved by the institutional review board (IRB) at Stanford University School of Medicine. Frozen tissue sections were prepared, hematoxylin-eosin staining was performed and the tumor composition of each sample was determined via pathological examination. Samples representing tumor and normal tissues were dissected from areas where cellular composition was 90% tumor or purely normal, respectively. Genomic DNA was extracted using E.Z.N.A SQ DNA/RNA Protein Kit (Omega Bio-Tek, Norcross, GA). Standard protocols for DNA preparation, array hybridization and scanning were used to analyze samples using SNP 6.0 arrays (Affymetrix, Santa Clara, CA). Data analysis was performed using the Genotyping Console software and Birdseed V2 algorithm (Affymetrix). Thirteen additional microarray data sets were analyzed in concert with the studied samples in order to assess the quality of the SNP calls. SNP 6.0 array data was filtered using P- value threshold of 0.01.
Target selection and in silico OS-Seq oligonucleotide design. CCDS build release 20090902, human genome build NCBI 37 - hgl9 and dbSNP Build ID 131 were used as the polymorphism reference data set. For gene selection, the GeneRanker annotation database was used to choose 344 cancer genes prioritized by importance. In order to find target- specific sequences of oligonucleotides, the exon definitions for the candidate genes were taken from CCDS. For most targeted exons (less than 500 bp), the 40-mer target- specific sequences were 10 bases outside of the 5' end of the exon boundary (Fig. 3a). Both strands of the exons were targeted using individual primer-probes. OS-Seq-366 only covered the flanks of exons. In the OS-Seq-l lk assay, exons larger than 500 bp were treated by tiling target- specific sequences until the entire exonic region was covered (Fig. 3b). To improve the on-target specificity of OS-Seq-l lk, we used Repbase to identify and eliminate oligonucleotide sequences that targeted highly repetitive sequences.
Oligonucleotide synthesis. Two strategies were applied for oligonucleotide synthesis. For OS-Seq-366, we designed 366 101 -mer oligonucleotides (Fig. 5a) which were then column-synthesized (Stanford Genome Technology Center, Stanford, CA) (Fig. 4a). Oligonucleotides were quantified and pooled in equimolar concentration. For OS-Seq- l lk, an in-situ microarray synthesis (LC Sciences, Houston) approach was used to synthesize the 11,742 precursor oligonucleotides (Fig. 5b). The sequences of target- specific oligonucleotides are in Table 2 below.
Figure imgf000023_0001
Figure imgf000024_0001
Amplification of microarray-synthesized oligonucleotides. Three 25 μΐ subpools of precursor 80-mer oligonucleotides were used (587, 638 and 415 nM) (Fig. 5b). A PCR approach was employed to amplify the precursor, low-concentration oligonucleotides (Fig. 4b). The array-synthesized oligonucleotide subpools were diluted to 10 fM/oligo and used as a template for PCR amplification. PCR was performed using Taq DNA polymerase (NEB), and dNTPs (1 mM dATP, 1 mM dCTP, 1 mM cGTP, 500 nM dTTP and 500 nM dUTP) in standard reaction conditions. After denaturation in 95°C for 30 s., 20 amplification cycles (95°C, 30 s.; 55°C, 30 s.; 68°C, 30 s.) were performed. Amplification Primer 1 contained uracil at the 3' end, while Amplification Primer 2 incorporated additional functional sequences (Fig. 5b). Amplified oligonucleotides were purified to remove excess primer (Fermentas), then processed using 0.1 U/μΙ Uracil DNA-excision Mix (Epicentre, Madison, WI) in 37°C for 45 min to detach the universal amplification primer site and cleave the mature 101-mer coding strands of the oligonucleotides. The oligonucleotides require the 5' ends to be functional and free in order to have accurate extension of the target-specific site during primer-probe immobilization. After heat shock inactivation of the enzymes (65°C, 10 min), the oligonucleotide preparations were purified (Fermentas). Finally, we quantified the three oligonucleotide subpools and created a single pool with equimolar concentration of each subpool.
Preparation of OS-Seq primer-probes by modification of the flow cell primer lawn. In the Illumina Genome Analyzer IIx (Illumina, San Diego) system, the solid phase support (i.e. the flow cell) has two primers ('C and 'D'), which are randomly immobilized on a
polyacrylamide layer at extremely high density . For OS-Seq experiments, a subset of the 'D' primers was specifically modified using the Illumina Cluster station.Prior to the NGS primer modification, 133 nM oligonucleotide pools were heat denatured at 95°C for 5 min. We used heat shock (95°C for 5 min) to free the coding strand of the OS-Seq oligucleo tides. Additional strand purification was not required as the second strand is inactive on the flow cell and is washed away after hybridization. Denatured oligonucleotides were diluted with 4x
Hybridization buffer (20x SSC, 0.2% Tween-20). The resulting 100 nM oligonucleotides were used in the flow cell modification experiments. 30 μΐ of oligonucleotide mixture was dispensed into each lane of the flow cell. During a temperature ramp (from 96°C to 40°C in 18 minutes) oligonucleotides annealed specifically to the immobilized primer 'D'. Then, DNA polymerase was used to extend the 'D' primer with the annealed oligonucleotide as a template. After extension, the original oligonucleotide template was denatured from the extended 'D' primer and washed from the solid phase support. Standard Illumina v4 reagents were used for extension, wash and denaturation steps. The modification of primer 'D' caused immobilization of the primer-probes.
Sequencing library preparation. We outline the general scheme of genomic DNA fragmentation, end repair, A-tailing, adapter ligation and PCR used in the preparation of the OS- Seq sequencing library in Fig. 2. We used 1 μg of genomic DNA from NA18507 and a flash frozen colorectal cancer sample as starting material. Genomic DNA was fragmented using Covaris E210R (Covaris, Woburn, MA) to obtain a mean fragment size of 500 bp (duty cycle 5%, intensity 3, 200 cycles per burst and 80 seconds). The randomly fragmented DNA was end- repaired using 0.25 U of Klenow large fragment (New England Biolabs, Ipswich, MA), 7.5 U of T4 DNA polymerase (NEB), 400 μΜ of each dNTP (NEB), 25 U of T4 Polynucleotide kinase (NEB) and T4 DNA ligase buffer with ATP (NEB) in 50 μΐ reaction volume at room
temperature for 45 minutes. After end repair, adenines were added to the 3' ends of the template DNA using 3.2 U of Taq DNA polymerase (NEB), 100 μΜ dATP (Invitrogen) and Taq buffer with 1.5 mM MgC12 in 80 ul reaction in 72°C for 15 min. Before adapter ligation, reactions were purified using PCR purification kit (Fermentas).
An indexing system for OS-Seq was developed. The sequencing library adapters contain an optional 6-base indexing sequence, a sequencing primer 1 site and a 12-mer sequence for primer 'C' hybridization (Table 2 above, Fig. 5c). Designed sixteen indexing adapters were designed. Adapter oligonucleotides were synthesized at the Stanford Genome Technology Center. Prior to ligation, adapter oligonucleotides were annealed during temperature ramp down. For the targeted resequencing of NA18507, we used both a singleplex adapter as well as a multiplex adapter with 'AACCTG' tag. For the indexing of the matched normal tumor sample, we used a 'TGCTAA' barcode for the normal tissue while the tumor sample was tagged with 'AGGTCA'. Double-strand DNA adapters with T-overhang were li gated to the A-tailed templates using 2,000 U of T4 DNA ligase (NEB) and T4 DNA ligase buffer in room
temperature for 1 hour. After adaptor ligation, reactions were purified using PCR purification kit (Fermentas) and libraries were amplified using PCR. 50 ul reactions of 1 U of Phusion Hot Start DNA polymerase (Finnzymes, Finland), 1 μΜ library amplification primer (Supplemental Table 1), Phusion HF buffer and 200 μΜ of each dNTP (NEB) were prepared. Reactions were denatured in 98°C for 30 s. After that, 22 PCR cycles were performed (98°C for 10s, 65°C for 30s and 72°C for 30s) followed by 72°C for 7min and 4°C. Thereafter, PCR reactions were purified using PCR purification kit (Fermentas) and quantified. Multiplexed libraries were pooled in equal concentrations.
Capture of targets using primer-probes. Targets were captured on the flow cell using OS-Seq primer-probes (Fig. lb and oligonucleotide sequences below). We injected 30 ul of the genomic sequencing libraries (30 - 42 ng/ul) into the flow cell. Target DNA was hybridized to the primer-probes by incubating the sequencing libraries in the flow cell at 65°C for 20 hours. During genomic DNA library hybridization and subsequent extension, the flow cell was kept at a constant 65°C. An Illumina Cluster Station was used to carry out the primer-probe hybridization and extension steps. Prior to hybridization to primer-probes, 22.5 μΐ of sequencing libraries (40 - 56.6 ng/μΐ) was denatured at 95°C for 5 min. After heat shock, the genomic DNA libraries were diluted to a total volume of 30 μΐ using 4x Hybridization buffer. The final DNA concentrations of sequencing libraries ranged from 30 to 41.7 ng/μΐ. Due to the high concentration of the sequencing libraries, the hybridization volume was kept at minimum. Therefore, a custom Cluster Station program was developed to allow reproducible low- volume hybridization. The following extension, wash and denaturation steps were performed using Illumina v4 reagents.
Flow cell processing and sequencing. After capture of the targets, the temperature of the flow cell was lowered to 40°C for 30 min to allow the 12 bases in the 3' end of the captured genomic DNA library fragments to hybridize to primer 'C (Fig. lb and oligonucleotide sequences below). In the bridge formation, the library fragment and primer 'C were extended using DNA polymerase to finalize and replicate the captured DNA fragment. Afterwards, bridge-PCR was carried out to generate the clonally amplified sequencing clusters. Samples were sequenced using 40 by 40 (OS-Seq-366) or 60 by 60 (OS-Seq-l lk) paired-end cycles on an Illumina Genome Analyzer IIx using regular version 4 sequencing reagents and recipes (Illumina). Image analysis and base calling were performed using the SCS 2.8 and RTA 2.8 software (Illumina).
Sequence analysis and variant detection. Sequence reads were aligned to the human genome version human genome build NCBI 37 - hgl9 using Burrows-Wheeler Aligner (BWA) 19. After alignment, on-target reads (Read 1) were defined as being within 1 kb of the 5' end of the primer-probe. Off-target reads were defined as aligning outside 1 kb of the 5 'end of the primer-probe or mapping on a different chromosome from the location of the associated primer- probe. For the de-multiplexing of indexed lanes, we used a perl script to generate an index of the 7-base tags using the base-call files. This index file and another perl script were used to demultiplex either the combined base-call file (so that separate fastq files can be generated for further processing) or the aligned file.
To eliminate any synthetic primer-probe sequences for variant calling, insert size filtering on the mate pairs was applied. The insert size was determined by comparing alignment of paired sequence reads. For variant calling, extracted sequences were required to have an insert size greater than [40 + the length of Read 1]. After insert size filtering, variant calling was performed using SAMtools and BCFtools. A sequence pileup was performed against the human genome (hgl9) using SAMtools mpileup with a mapping quality threshold of 50.
BCFtools view was used to genotype base positions and data was filtered using vcfutils.pl, a variant filter perl script provided in the SAMtools package. The vcfutils varFilter conditions were: i) coverage of 10 or greater, ii) removal of the strand bias filter (since OS-Seq is a strand- specific capture method), iii) forcing the script to output both reference and non-reference positions. Reference and non-reference calls were used for comparisons with the Affymetrix SNP 6.0 array data. Genotyped positions were filtered to have a Phred-like quality score above 50. We used BEDtools intersectBed to define target regions for each primer-probe and combinations where probes overlap in their targets.
Variant comparison. For quality assessment of extracted variants, variant calls of the NA18507 data were compared to calls from variants identified from a complete genome sequence analysis and Hapmap genotyping data (www.hapmap.org). Comparisons of OS-Seq data and Affymetrix SNP 6.0 array data were made using perl scripts. dbSNP131 was used for for SNP annotation.
Further Oligonucleotide sequences
0) Oligonucleotides
OS-Seq oligonucleotide:
5' -
NNNNNNl^
ATGCCGAGACCGATCTCGTATGCCGTCTTCTGCTTG - 3' (Generic capture oligonucleotide, N = unique 40- mer sequence; SEQ ID NO: 37)
Ad_top_FC_c apture_ A_tail :
5' - CGAGATCTACACTCTTTCCCTACACGACGCTCTTCCGATCT - 3' (SEQ ID NO: 38)
Ad_bot_FC_capture_A_tail: 5' - GATCGGAAGAGCGTCGTGTAGGGAAAGAGTGTAGATCTCG - 3' (SEQ ID NO: 39) Flow cell primer 'C :
5' - PS-TTTTTTTTTTAATGATACGGCGACCACCGAGAUCTACAC - 3' (U = 2-deoxyuridine) (SEQ ID NO: 40)
Flow cell primer 'D' :
5' - PS-TTTTTTTTTTCAAGCAGAAGACGGCATACGAGoxoAT - 3', (Goxo = 8-oxoguanine) (SEQ ID NO: 41)
Sequencing primer 1 :
5' - ACACTCTTTCCCTACACGACGCTCTTCCGATCT - 3' (SEQ ID NO:42)
Sequencing primer 2:
5' - CGGTCTCGGCATTCCTGCTGAACCGCTCTTCCGATCT - 3' (SEQ ID NO:43)
1) Flow cell modification
Anneal
3' -
GTTCGTCTTCTGCCGTATGCTCTAGCCAGAGCCGTAAGGACGACTTGGCGAGAAGGCTAGANNNNNN NNNNNNNNNNN^ - 5' (OS-Seq oligonucleotide) (SEQ ID NO:44)
FC - CAAGCAGAAGACGGCATACGAGAT - 3' (Flow cell primer 'D') (SEQ ID NO:45)
Extension
3' -
GTTCGTCTTCTGCCGTATGCTCTAGCCAGAGCCGTAAGGACGACTTGGCGAGAAGGCTAGANNNNNN NNNNNNNNNNN^ - 5' (OS-Seq oligonucleotide) (SEQ ID NO:46)
FC -
CAAGCAGAAGACGGCATACGAGATCGGTCTCGGCATTCCTGCTGAACCGCTCTTCCGATCTNNNNNN ΝΝΝΝΝΝΝΝΝΝΝΝΝΝΝΝΊ^ - 3' (primer-probe) (SEQ ID NO:47)
Denature
FC -
CAAGCAGAAGACGGCATACGAGATCGGTCTCGGCATTCCTGCTGAACCGCTCTTCCGATCTNNNNNN ΝΝΝΝΝΝΝΝΝΝΝΝΝΝΝΝΊ^ - 3' (primer-probe) (SEQ ID NO:48)
2) Library prep
Fragmentation, end repair
5' - NNNNNNIS^ - 3' (genomic DNA)
3' - NNNNNNNI^^ - 5' (genomic DNA)
A-tailing
5' - NNNNNN1W - 3' (genomic DNA after A-tailing)
3' - ANNNNNN1W - 5' (genomic DNA after A-tailing)
Adaptor ligation
OS-Seq dsAdapter
5' - GATCGGAAGAGCGTCGTGTAGGGAAAGAGTGTAGATCTCG - 3' (Ad_bot_FC_capture_A_tail) (SEQ ID NO:49) 3' - TCTAGCCTTCTCGCAGCACATCCCTTTCTCACATCTAGAGC - 5' (Ad_top_FC_capture_A_tail) (SEQ ID NO:50)
OS-Seq dsAd library (This is the structure of the OS-Seq-adaptor library, N = random genomic DNA sequence defined by fragmentation)
5' -
CGAGATCTACACTCTTTCCCTACACGACGCTCTTCCGAT^
CGGAAGAGCGTCGTGTAGGGAAAGAGTGTAGATCTCG - 3' (SEQ ID NO:51)
3' -
GCTCTAGATGTGAGAAAGGGATGTGCTGCGAGAAGGCTAGANNlSnSlNlSnsnS^
AGCCTTCTCGCAGCACATCCCTTTCTCACATCTAGAGC - 5' (SEQ ID NO:52)
Library PCR
OS-Seq adaptor library amplification (Ad_top_FC_capture_A_tail, single primer PCR is used to amplify the adaptor library)
5' - CGAGATCTACACTCTTTCCCTACACGACGCTCTTCCGATCT - 3'
(Ad_top_FC_capture_A_tail) (SEQ ID NO:53)
3' -
GCTCTAGATGTGAGAAAGGGATGTGCTGCGAGAAGGCTAGANNl^^
AGCCTTCTCGCAGCACATCCCTTTCTCACATCTAGAGC - 5' (OS-Seq library fragment) (SEQ ID NO:54) 5' -
CGAGATCTACACTCTTTCCCTACACGACGCTCTTCCGATCT^
CGGAAGAGCGTCGTGTAGGGAAAGAGTGTAGATCTCG - 3' (OS-Seq library fragment) (SEQ ID NO:55) 3' -TCTAGCCTTCTCGCAGCACATCCCTTTCTCACATCTAGAGC - 5' (Ad_top_FC_capture_A_tail) (SEQ ID NO:56)
5' -
CGAGATCTACACTCTTTCCCTACACGACGCTCTTC^
CGGAAGAGCGTCGTGTAGGGAAAGAGTGTAGATCTCG - 3' (OS-Seq library fragment, amplified) (SEQ ID
NO:57)
3' -
GCTCTAGATGTGAGAAAGGGATGTGCTGCGAGAAGGCTAGANNlSnSlNlSnsnS^
AGCCTTCTCGCAGCACATCCCTTTCTCACATCTAGAGC - 5' (OS-Seq library fragment, amplified) (SEQ ID NO:58)
3) Capture
Anneal
OS-Seq adaptor library annealing (N = 40-mer specific capture site)
3' - GCTCTAGATGTGAGAAAGGGATGTGCTGCGAGAAGGCTAGAgenomicdna (SEQ ID NO:59) NNNNNNNN1W
CATCCCTTTCTCACATCTAGAGC - 5' (OS-Seq library fragment, amplified) (SEQ ID NO:60)
FC -
CAAGCAGAAGACGGCATACGAGATCGGTCTCGGCATTCCTGCTGAACCGCTCTTCCGATCTNNNNNN NNNNNNNN1W - 3' (primer-probe)
(SEQ ID NO:61)
Extension
OS-Seq capture
3' - GCTCTAGATGTGAGAAAGGGATGTGCTGCGAGAAGGCTAGAgenomicdna (SEQ ID NO:62) NNNNNNl^
CATCCCTTTCTCACATCTAGAGC - 5' (OS-Seq library fragment, amplified) (SEQ ID NO:63)
FC -
CAAGCAGAAGACGGCATACGAGATCGGTCTCGGCATTCCTGCTGAACCGCTCTTCCGATCTNNNNNN NNNNNNNN1W
AAAGAGTGTAGATCTCG - 3' (captured DNA) (SEQ ID NO:64)
Denature
OS-Seq library
FC -
CAAGCAGAAGACGGCATACGAGATCGGTCTCGGCATTCCTGCTGAACCGCTCTTCCGATCTNNNNNN NNNNNNNN1W
AAAGAGTGTAGATCTCG - 3' (captured DNA) (SEQ ID NO:65)
4) Adapter finalizing
Hybridization in 40C
OS-Seq_Library (there is 12-mer homology between the OS-Seq adaptor and Oligo-C)
FC -
CAAGCAGAAGACGGCATACGAGATCGGTCTCGGCATTCCTGCTGAACCGCTCTTCCGATCTNNNNNN NNNNNNNN1W
AAAGAGTGTAGATCTCG - 3' (captured DNA) (SEQ ID NO:66)
3' -
CACATCTAGAGCCACCAGCGGCATAGTAA - FC (Oligo'C) (SEQ ID NO:67)
Extend
FC -
CAAGCAGAAGACGGCATACGAGATCGGTCTCGGCATTCCTGCTGAACCGCTCTTCCGATCTNNNNNN ΝΝΝΝΝΝΝΝΝΝΝΝΝΝΝΝΊ^
AAAGAGTGTAGATCTCGGTGGTCGCCGTATCATT - 3' (finalized DNA) (SEQ ID NO:68)
3' -
GTTCGTCTTCTGCCGTATGCTCTAGCCAGAGCCGTAAGGACGACTTGGCGAGAAGGCTAGANNNNNN NNNNNNNNNNN^
TTCTCACATCTAGAGCCACCAGCGGCATAGTAA - FC (finalized DNA) (SEQ ID NO:69)
Denature
FC -
CAAGCAGAAGACGGCATACGAGATCGGTCTCGGCATTCCTGCTGAACCGCTCTTCCGATCTNNNNNN ΝΝΝΝΝΝΝΝΝΝΝΝΝΝΝΝΊ^
AAAGAGTGTAGATCTCGGTGGTCGCCGTATCATT - 3' (finalized DNA) (SEQ ID NO:70)
3' -
GTTCGTCTTCTGCCGTATGCTCTAGCCAGAGCCGTAAGGACGACTTGGCGAGAAGGCTAGANNNNNN NNNNNNNNNNN^
TTCTCACATCTAGAGCCACCAGCGGCATAGTAA - FC (finalized DNA) (SEQ ID NO:71)
5) Cluster generation
Anneal
FC -
CAAGCAGAAGACGGCATACGAGATCGGTCTCGGCATTCCTGCTGAACCGCTCTTCCGATCTNNNNNN NNNNNNNNNNNNNNI ^ AAAGAGTGTAGATCTCGGTGGTCGCCGTATCATT - 3' (finalized DNA) (SEQ ID NO:72)
3' -
CACATCTAGAGCCACCAGCGGCATAGTAA - FC (Oligo'C) (SEQ ID NO:73)
FC - CAAGCAGAAGACGGCATACGAGAT - 3' (SEQ ID NO:74)
(Oligo'D')
3' -
GTTCGTCTTCTGCCGTATGCTCTAGCCAGAGCCGTAAGGACGACTTGGCGAGAAGGCTAGANNNNNN NNlSnSINlS^
TTCTCACATCTAGAGCCACCAGCGGCATAGTAA - FC (finalized DNA) (SEQ ID NO:75)
Extend
FC -
CAAGCAGAAGACGGCATACGAGATCGGTCTCGGCATTCCTGCTGAACCGCTCTTCCGATCTNNNNNN NNNNNNNNNN
AAAGAGTGTAGATCTCGGTGGTCGCCGTATCATT - 3' (finalized DNA) (SEQ ID NO:76)
3' -
GTTCGTCTTCTGCCGTATGCTCTAGCCAGAGCCGTAAGGACGACTTGGCGAGAAGGCTAGANNNNNN NNNNNNNNNNN^
TTCTCACATCTAGAGCCACCAGCGGCATAGTAA - FC (finalized DNA) (SEQ ID NO:77)
Denature
FC -
CAAGCAGAAGACGGCATACGAGATCGGTCTCGGCATTCCTGCTGAACCGCTCTTCCGATCTNNNNNN NNNNNNNNNNNNNN^
AAAGAGTGTAGATCTCGGTGGTCGCCGTATCATT - 3' (Clustered DNA) (SEQ ID NO:78)
3' -
GTTCGTCTTCTGCCGTATGCTCTAGCCAGAGCCGTAAGGACGACTTGGCGAGAAGGCTAGANNNNNN NNNNNNNNNNN^
TTCTCACATCTAGAGCCACCAGCGGCATAGTAA - FC (Clustered DNA) (SEQ ID NO:79)
6) Sequencing
FC -
CAAGCAGAAGACGGCATACGAGATCGGTCTCGGCATTCCTGCTGAACCGCTCTTCCGATCTNNNNNN NNNNNNNNNNNNNN^
AAAGAGTGTAGATCTCGGTGGTCGCCGTATCATT - 3' (Clustered DNA) (SEQ ID NO:80)
3' - <
TCTAGCCTTCTCGCAGCACATCCCTTTCTCACA - 5' (Sequencing Primer 1) (SEQ ID NO:81)
5' - CGGTCTCGGCATTCCTGCTGAACCGCTCTTCCGATCT >
(Sequencing Primer 2) (SEQ ID NO:82)
3' -
GTTCGTCTTCTGCCGTATGCTCTAGCCAGAGCCGTAAGGACGACTTGGCGAGAAGGCTAGANNNNNN NNNNNN^^
TTCTCACATCTAGAGCCACCAGCGGCATAGTAA - FC (Clustered DNA) (SEQ ID NO:83)
Results
This section describes a new approach for targeted resequencing called Oligonucleotide- Selective Sequencing (OS-Seq) that solves many of the limitations seen in targeted
resequencing approaches. Conceptually different than other methods, OS-Seq is an integrated approach in which both capture and sequencing of genomic targets are performed on the NGS solid phase support, such as the Illumina flow cell (Fig. la). For preparation of OS-Seq, a single-adapter sequencing library is prepared from genomic DNA and target-specific oligonucleotides are synthesized and used to construct primer-probes on the flow cell. Then, immobilized primer-probes on the flow cell are used to capture single molecule targets from a single-adapter genomic DNA library.
Processing of OS-Seq involves three-step where the Illumina sequencing system is modified to contain target-specific primer-probes, targets are captured from a single-adapter library and immobilized fragments are finalized for sequencing (Fig. lb), To prepare the capture substrate, we molecularly re-engineer the Illumina flow cell by modifying a subset of the existing primer lawn to become target-specific primer-probes. To create these primer-probes, we hybridize the 3 ' universal sequence of a complex pool of oligonucleotides to its complement on the flow cell and extend the immobilized primer using a DNA polymerase extension reaction. The result is a set of randomly placed, target-specific primer-probes, which are fixed onto the flow cell surface. During high-heat incubation at 65°C, the primer-probes specifically hybridize to target complementary sequences within the single- adapter genomic DNA library; after hybridization, the primer-probes then function as primers for another DNA polymerase extension reaction. The extension step effectively captures the target sequence. After extension, a denaturation step is performed followed by low-heat hybridization at 40°C to stabilize the sequencing library adapter to its complement on the flow cell, which creates a bridge structure. A third DNA polymerase extension reaction incorporates additional sequence to the 3' ends, creating two molecules capable of solid phase amplification. After three steps specific to OS-Seq, captured molecules are bridge amplified, processed and sequenced using the standard sequencing protocol from the Illumina NGS system. A detailed description of the molecular biology steps in OS-Seq is given in above and the Illumina cluster station programs for OS-Seq is modified accordingly.
As a proof-of-principle demonstration, two capture assays were developed. First, 366 OS-Seq primer-probes to flank the exons of 10 cancer genes (OS-Seq-366) were designed (Fig. 3). This assay was intended to test the OS-Seq method and not for definitive exon coverage. We synthesized OS-Seq-366 oligonucleotides using column-based methods. Second, to demonstrate scalability, we designed and synthesized 11 ,742 primer-probes to capture the exons of 344 cancer genes (OS-Seq- I lk). These primer-probes avoided repeats and were tiled across large exons for improved exon coverage. For high-throughput production of OS-Seq-1 lk, we synthesized the oligonucleotides on a programmable microarray. These array-synthesized oligonucleotides require amplification for processing and for obtaining sufficient material for OS-Seq (Fig. 4). Post-processed, OS-Seq oligonucleotides contain a target-specific 40-mer complementary to the 5' end of the targeted region (Fig. 5). These oligonucleotides also contain sequence required for annealing the paired-end sequencing primer and for hybridization to the immobilized primer lawn on the flow cell.
To assess capture performance of the OS-Seq-366 and OS-Seql lk assays, DNA from a previously sequenced Yoruban individual was prepared (NA18507). Paired-end sequencing was conducted on all targeting assays. The first read (Read 1) is derived from targeted genomic DNA while the second read (Read 2) comes from the synthetic target-specific primer-probes (Fig. la). OS-Seq-366 was run on a single lane of a GAIIx run. Each sample of OS-Seq-1 lk was run on the equivalent of 1.3 lanes, based on our indexing scheme. We developed an indexing scheme using adapters with a unique barcode sequence (Fig. 5c) to tag samples.
Barcodes were derived from the first seven bases of Read 1. Overall, 87.6% of OS-Seq-366 reads and 91.3% of OS-Seq-1 lk reads, containing proper barcodes, mapped to the human genome reference (Table 1). In comparison, 58% of reads derived using a previously reported hybrid selection method could be mapped to the human genome reference.
Table 1.
Sample NA18507 NA18507 Normal Tumor
Number of primer-probes 366 11,742 11,742 11,742
Total reads 1,969,091 1,602,825 2,038,270 1,551,279
Mapped reads 1,725,215 1,463,782 1,897,967 1,415,388 (percentage of total reads) (87.6%) (91.3%) (93.1%) (91.2%)
Captured on-target readsa 1,499,052 1,365,305 1,747,192 1,316,563 (percentage of mapped reads) (86.9%) (93.3%) (92.1%) (93.0%)
Captured on-target exon readsb 518,318 624,937 725,072 608,458 (percentage of mapped reads) (30.0%) (42.7%) (38.2%) (43.0%)
Captured off-target reads 226,163 98,477 150,775 98,825 (percentage of mapped reads) (13.1%) (6.7%) (7.9%) (7.0%)
On-target region3 233 kb 7,296 kb 7,296 kb 7,296 kb
Captured on-target region used for SNV calling3' c 191 kb 1,541 kb 1,754 kb 1,476 kb (percentage of on-target region) (82.0%) (21.1%) (24.0%) (20.2%)
OS-Seq SNVs called from captured on-target region 105 985 871 727
OS-Seq SNPs which are reported 97%d 95.7%d
OS-Seq SNPS which concordant with array genotype 99.8%e 99.5%e
Exon regionsb 31 kb 959 kb 959 kb 959 kb
Captured exon regionsb' 26 kb 917 kb 901 kb 909 kb (percentage of exon regions) (83.9%) (95.6%) (94.0%) (94.8%)
Average fold-coverage on captured exonsb' 729 31 38 31 aWithin 1 kb from primer-probes. bWithin exons. 'Filtered insert size >40+read 1 length. Fold-coverage >10. Phred-like quality score >50. dMerged variant bases from Bentley et al. (2008) and dbSNP131. 'Positions genotyped using Affymetrix SNP 6.0 arrays. Fold-coverage >1.
To assess overall coverage of each primer-probe, we determined the number of reads originating from the Read 1 data that fell within 1 kb from the 3' end of the primer-probe. OS- Seq primer-probes are strand-specific and only capture the 5' ends of the DNA targets (Fig. 6). As an example, the median coverage profile of all primer-probes in OS-Seq-366 (Fig. la) illustrates how sequence is captured up to 1 kb downstream from the primer-probe. Generally, a bias towards smaller insert sizes was detected, for OS-Seq-366 50% of targeted reads mapped within 283 bases from the primer-probes. In both assays, additional reads beyond the 1 kb interval and as far distant as 1.7 kb were identified. The sequence reads beyond 1 kb represent the tail end of the capture distribution from any given primer-probe and was less than 0.15% of the overall sequence data for both OS-Seq-366 and OS-Seq-1 lk. It was also observed that the characteristics of the coverage distribution is correlated with the fragment size introduced during library creation and from size constraints inherent to bridge-formation and solid-phase PCR (Fig. 6). Also, introducing a higher molar concentration of the single adapter library, sequencing additional lanes or using longer reads can increase coverage along the target.
On-target reads were defined as Read 1 sequences mapping within 1 kb of a primer- probe. Using these on-target coverage criteria, 86.9% of 40 base reads in OS-Seq-366 and 93.3% of 53 base reads in OS-Seq-1 lk were on-target (Table 1). OS-Seq-1 lk showed improved specificity given efforts to refine the in-silico design of the primer-probes. Specifically, for OS- Seq-1 lk in-silico primer-probe selection, a repeat masking filter was used, which resulted in fewer off-target reads. In comparison, 89% of 76 base reads and 50% of 36 base reads mapped in proximity of a probe in a published hybrid selection method, suggesting similar on-target specificity between methods and inclining that moving towards longer reads may improve the on-target specificity of OS-Seq. On-exon specificity of OS-Seq was also similar to the published hybrid selection method. Using OS-Seq-1 IK, we observed that 42.7% of reads mapped within exons (Table 1), while a hybrid selection capture technology reported 42% of reads mapped to exons.
As an example of a typical gene coverage profile, we show the captured sequence data for the KRAS gene in Fig. lc. The exon targets are sequenced at high fold-coverage relative to the off-target adjacent regions. As noted previously, OS-Seq-366 was designed to flank exons and did not tile across large regions. The average fold coverage for exons in Table 1 and detailed breakdowns of coverage classes (i.e. 10X, 20X) in Table 2. Overall, 83.9% of exon bases in the OS-Seq-366 were covered with at least one read, with a portion of the remainder not having been intentionally targeted in this pilot assay. Similarly, among the three samples analyzed with OS-Seq- 1 lk, 94 to 95.6% of exon bases were covered with at least one read. Compared to OS-Seq-366, the OS-Seq-l lk assay showed increased sequence coverage on exons due to an improvement of the primer-probe design over the OS-Seq-366 design, specifically, the OS-Seq- 1 lk design tiled primer-probes across exons larger than 500 bases.
Also evaluated was the assay's target selection uniformity by binning Read 1 data by its associated primer-probe and counting reads aligning to its target. OS-Seq primer-probes were sorted based on the observed capture yields and the distributions within OS-Seq-366 and OS- Seq-1 lk are presented in an overlay fashion in Fig. Id. In OS-Seq-366, it was observed observed that 100% of the primer-probes had a yield minimum of one sequence read and the yield of 89.6% of the primer-probes were within a 10-fold range. Similarly, for OS-Seq-1 lk, 95.7% of primer-probes had a capture yield minimum of one sequence read and 54% of the primer-probes had a yield within a 10-fold range. OS-Seq-366 oligonucleotides were column- synthesized and quantified separately prior to pooling, which ensured that each target-specific sequence was in equimolar concentration in the primer-probe construction step. Higher variance in primer-probe yields for OS-Seq- 1 lk is most likely attributed to amplification bias introduced during PCR of the microarray-synthesized oligonucleotides used for primer-probe creation.
The technical reproducibility of OS-Seq was evaluated by comparing the sequence yields of individual primer-probes from the OS-Seq- 1 lk assay (Fig. 7). Multiplexed libraries (NA18507, normal and tumor) were pooled and the capture and sequencing was performed on two independent Illumina GAIIx lanes. The sequence yields of each individual primer-probe was compared between the technical replicates and calculated the correlation coefficient: R = 0.986. For evaluation of biological reproducibility, two different multiplexed sequencing libraries were run in the same lane. The correlation coefficient of biological replicates was R = 0.90. High reproducibility of OS-Seq is likely to be related to the inherent automation using the NGS system, the ability to perform the capture and sequencing steps in a single reaction volume and not having to apply post-capture PCR.
To assess the variant calling performance of OS-Seq-366 and OS-Seq- 1 lk assays, a targeted sequencing analysis on NA18507, a Yoruban individual who has undergone complete genome sequencing analysis, was conducted. For SNV calling with either OS-Seq assay, we analyzed only on-target positions with genotype quality scores greater than 50 and a minimum of 10X coverage (Table 1). For OS-Seq-366 and OS-Seq-l lk data, a total of 191 kb and 1,541 kb fulfilled these criteria, respectively. From these high quality, targeted positions, we called 105 SNVs from OS-Seq-336 and 985 SNVs from OS-Seq-1 lk (Table 1). We extracted the published NA18507 SNVs and other reported SNPs that occurred in these same high quality regions. In comparison, 97% of the OS-Seq-366 and 95.7% of the OS-Seq-1 lk had previously been reported (Table 1). For OS-Seq-366 and OS-Seq-l lk the sensitivity of variant detection was 0.97 and 0.95 respectively based on the reported SNPs (Table 3 below).
Table 3
Figure imgf000038_0001
OS-Seq-1 lk analysis was also applied to genomic DNA derived from a matched normal
- colorectal carcinoma tumor pair. Using the same quality and coverage criteria for the analysis of NA18507, identified 871 SNVs were identified from the normal sample and 727 from the tumor (Table 4). For comparison, the two samples with the Affymetrix SNP 6.0 array were genoyped. According to previous analyses, genotyping accuracy using Affymetrix SNP 6.0 arrays and the Birdseed algorithm is high, as the average successful call rate for SNPs is 99.47% and called SNPs have a 99.74% concordance with HapMap genotypes from other platforms. In comparing the OS-Seq SNVs to Affymetrix SNPs, a high concordance of 99.8% for the normal and 99.5% for the tumor was observed. By filtering normal tissue variants and considering novel cancer-specific variants where coverage was greater than 40, a clear pathogenic nonsense mutation of SMAD4 (SI 44*) was identified and validated. This gene is frequently mutated in colorectal cancer and a colon cancer driver gene.
Table 4
Figure imgf000039_0001
The capture efficiency of individual primer-probes within the OS-Seq-366 and OS-Seq- 11k assays was investigated, and the performance of each primer-probe was assessed. A unique feature of OS-Seq is that captured genomic sequences can be matched to their corresponding primer-probes when sequenced with paired-ends. Read 1 originates from the 3' end of the captured target and Read 2 begins at the OS-Seq primer-probe synthetic sequence. Thus, Read 1 always represents the captured genomic DNA sequence while Read 2 functionally serves as a molecular barcode for a distinct primer-probe. This enables the identification of the exact OS- Seq primer-probe, which mediated the targeting, and facilitates the assessment of the performance of individual primer-probes. For example, we observed a strong relationship between primer-probe GC content and target sequence yield (data not shown). Extremely low GC (less than 20%) or high GC content (>70%) was associated with increasing failure of a primer-probe to capture its target sequence (Fig. 8). It is believed that that the ability to directly evaluate the capture performance will be a useful primer-probe quality control measure.
The OS-Seq technology was developed for streamlined and highly scalable targeted resequencing. A departure from the traditional capture methods of pre-sequencing target enrichment, OS-Seq integrates capture and sequencing of the target DNA via hybridization and selection on the solid phase support of a NGS system. This proof-of -principle study shows that the OS-Seq assay effectively and reproducibly captures target genomic regions with good uniformity and high specificity. Variant analysis of the NA 18507 reference genome
demonstrated high specificity and low false discovery rate for SNV determination. Targeted resequencing of matched colorectal tumor and normal samples demonstrated the applicability of OS-Seq to high-throughput genetic analysis of cancer genomes.
The OS-Seq technology enables one to create custom targeted resequencing assays. The design and production of the primer-probe oligonucleotides is relatively straightforward and target regions can be selected simply by using balanced GC and non-repetitive sequence.
Programmable microarray synthesis resources can be used to generate customized and complex oligonucleotide libraries en masse. Likewise, traditional oligonucleotide synthesis methods can be used to create customized assays for smaller target gene sets. While our largest targeting assay covered the exons and adjacent sequence of 344 genes, we believe that OS-Seq can be significantly scaled up to larger target contents. From the OS-Seq-366 data we estimated that there was over 2,000-fold excess of primer-probes compared to target fragments in the hybridization mix inside the flow cell. During 20-hour hybridization, we estimate that 4.9% of all potential targets within the library were captured for sequencing. We have also tested that the concentration of oligonucleotides can be increased at least 10-fold and the concentration of the sequencing library can be increased 5-fold (data not shown) without compromising cluster formation.
The OS-Seq sample preparation is straightforward: it can be completed in one day and is readily automated (Fig. 9). In regard to labor, using OS-Seq compares favorably to executing a shotgun sequencing experiment. Because residual adapters are not hybridizing to the flow cell during capture, OS-Seq libraries can use DNA fragments of varying sizes without the necessity of narrow size purification by physical separation methods. Only a single adapter needs to be added to the 5 ' ends of a genomic DNA fragment. The single-adapter design also readily lends itself to indexing with introduction of a molecular barcode. This feature allows straightforward sample multiplexing of sequencing assays and has many potential applications. For example, matched normal tumor analysis occurs in the same capture reaction, which may reduce biases.
Given the increasing interest in "personalized medicine" there is a clear need to develop rapid and simple approaches to human genome resequencing. This includes the analysis of germline variants and the somatic mutations found in cancer genomes. As a practical and efficient approach for targeted resequencing, OS-Seq is particularly useful for translational studies and clinical diagnostics by enabling high-throughput analysis of candidate genes and identification of clinically actionable target regions.
For the method described above, an Illumina Genome Analyzer was used. However, it is anticipated that this system will be broadly applicable to any parallel sequencing platform.

Claims

What is claimed is:
1. A method of capturing and amplifying a selected sequence comprising:
a) obtaining a substrate comprising a first population of surface-bound oligonucleotides and a second population of surface-bound oligonucleotides, wherein the members of said first and second populations of surface-bound oligonucleotides are not spatially addressed on said substrate;
b) hybridizing a first member of said first population of surface-bound oligonucleotides to a selection oligonucleotide comprising a region that hybridizes with said first member and a region that contains a genomic sequence,
c) extending said first member of said first population of surface-bound oligonucleotides to produce a support-bound selection primer that comprises a sequence that is complementary to said genomic sequence;
d) hybridizing said support-bound selection primer to a nucleic acid fragment comprising said genomic sequence;
e) extending said support-bound selection primer to produce an extension product that contains a sequence that flanks said genomic sequence;
f) amplifying said extension product on said substrate using unextended members of said first and second populations of surface-bound oligonucleotides, to produce a PCR product.
2. The method of claim 1 , wherein said nucleic acid fragment is an adaptor- ligated genomic fragment comprising a 5 ' end adaptor, wherein said extending produces an extension product that comprises, on its 3' end, a sequence that is complementary to said adaptor, and wherein members of said second population of said surface-bound oligonucleotides hybridize to said sequence that is complementary to said adaptor during said bridge PCR.
3. The method of claim 2, wherein said 5' end adaptor comprises a binding site for a sequencing primer at the end that is ligated to said genomic fragment.
4. The method of claim 1 , wherein said method comprises, between steps e) and f), li gating an adaptor onto the 3 ' end of said extension product, and wherein members of said second population of said surface-bound oligonucleotides hybridize to said adaptor during said amplifying.
5. The method of claim 4, wherein said adaptor comprises a binding site for a sequencing primer at the end that is ligated to said nucleic acid fragment.
6. The method of claim 1 , wherein said second population of surface-bound
oligonucleotides are made by:
i. hybridizing members of an initial second population of surface-bound oligonucleotides to an oligonucleotide comprising a region that hybridizes with the members of said second population of surface-bound oligonucleotides and a region that is complementary to a sequence of said nucleic acid fragment; and
ii. extending said members of said initial second population of surface-bound oligonucleotides to produce said second population of surface-bound oligonucleotides.
7. The method of claim 1, wherein said selection oligonucleotide comprises a binding site for a sequencing primer between said a region that hybridizes with said first member and said region that contains said genomic sequence.
8. The method of claim 1, further comprising sequencing a first strand of said PCR product to obtain at least part of the nucleotide sequence of said sequence that flanks said genomic sequence.
9. The method of claim 1, further comprising sequencing the second strand of said PCR product to obtain at least part of the nucleotide sequence of said sequence that flanks said genomic sequence.
10. The method of claim 1, wherein said method comprises fragmenting a mammalian genome to produce a fragmented genome, optionally adding adaptors to said fragmented genome, and applying said fragmented genome to said substrate.
11. The method of claim 10, wherein said fragmenting is done physically, chemically or using a restriction enzyme.
12. The method of claim 11, wherein said fragmenting is done by sonication or shearing.
13. The method of claim 10, wherein said hybridizing is done by preparing a plurality of fragmented genomes from a plurality of different individuals, pooling said plurality of fragmented genomes to produce a pool, applying said pool of fragmented genomes to said substrate, and obtaining PCR products that comprise a sequence that flanks said genomic sequence in said different individuals.
14. The method of claim 13, further comprising sequencing at least the first strand of said PCR products to obtain at least part of the nucleotide sequence of said sequence that flanks said genomic sequence in said different individuals.
15. The method of claim 13, wherein, prior to pooling, different adaptors are ligated to said fragmented genomes from said different individuals, wherein said the adaptor comprises a barcode sequence that allows the source of said adaptor-ligated genomic fragment to be identified after said PCR products are sequenced.
16. The method of claim 15, wherein said method comprises:
adaptor-ligating fragmented genomic DNA from a first subject using a first adaptor that comprises a first barcode sequence to produce a first product;
adaptor-ligating fragmented genomic DNA from a second subject using a second adaptor that comprises a second barcode sequence to produce a second product;
combining said first and second products to produce a mixed template; and
performing the method of claim 1 using said mixed template to provide first and second PCR product each containing said barcode sequence.
17. The method of claim 16, wherein said mixed template comprises fragmented genomic DNA from at least 1,000 subjects.
18. The method of claim 1, comprising:
i. ligating the nucleic acid fragments to an adaptor that contains a site for a sequencing primer and a nucleotide sequence that is the same as the second surface bound oligonucleotides, ii. hybridizing the adaptor-ligated fragments to a first member of the first population of surface-bound oligonucleotides,
iii. extending the first member of the first population of surface-bound oligonucleotides to which the adaptor ligated fragment is hybridized; and
iv. hybridizing the adaptor-containing end of the extension product to a second support bound polynucleotide, thereby producing a bridge and facilitating bridge PCR.
19. The method of claim 1, wherein said second population of surface-bound
oligonucleotides are made by ligating an oligonucleotide comprising a region that is
complementary to a sequence of said nucleic acid fragment to an initial second population of surface-bound oligonucleotides to produce said second population of surface-bound
oligonucleotides .
20. The method of claim 1, wherein said amplification is by bridge PCR.
21. The method of claim 1, wherein said nucleic acid fragment is genomic fragment or cDNA.
22. A system comprising:
a) a substrate comprising a first population of surface-bound oligonucleotides and a second population of surface-bound oligonucleotides, wherein the first and second populations of surface-bound oligonucleotides not spatially addressed on the substrate;
b) a selection oligonucleotide that contains a region that hybridizes with a first member of the first population and a region that contains a genomic sequence;
c) an adaptor; and
e) instructions for performing the method of claim 1.
PCT/US2011/052645 2010-09-24 2011-09-21 Direct capture, amplification and sequencing of target dna using immobilized primers WO2012040387A1 (en)

Priority Applications (12)

Application Number Priority Date Filing Date Title
IN522MUN2013 IN2013MN00522A (en) 2010-09-24 2011-09-21
EP11827484.4A EP2619329B1 (en) 2010-09-24 2011-09-21 Direct capture, amplification and sequencing of target dna using immobilized primers
NZ608313A NZ608313A (en) 2010-09-24 2011-09-21 Direct capture, amplification and sequencing of target dna using immobilized primers
EP19172328.7A EP3572528A1 (en) 2010-09-24 2011-09-21 Direct capture, amplification and sequencing of target dna using immobilized primers
CN201180056177.8A CN103228798B (en) 2010-09-24 2011-09-21 Use fixing primer Direct Acquisition, amplification and order-checking target DNA
CA2810931A CA2810931C (en) 2010-09-24 2011-09-21 Direct capture, amplification and sequencing of target dna using immobilized primers
RU2013118722/10A RU2565550C2 (en) 2010-09-24 2011-09-21 Direct capture, amplification and sequencing of target dna using immobilised primers
KR1020137008317A KR20130113447A (en) 2010-09-24 2011-09-21 Direct capture, amplification and sequencing of target dna using immobilized primers
JP2013530291A JP5986572B2 (en) 2010-09-24 2011-09-21 Direct capture, amplification, and sequencing of target DNA using immobilized primers
MX2013003349A MX346956B (en) 2010-09-24 2011-09-21 Direct capture, amplification and sequencing of target dna using immobilized primers.
AU2011305445A AU2011305445B2 (en) 2010-09-24 2011-09-21 Direct capture, amplification and sequencing of target DNA using immobilized primers
IL225109A IL225109A (en) 2010-09-24 2013-03-07 Direct capture, amplification and sequencing of target dna using immobilized primers

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
US38639010P 2010-09-24 2010-09-24
US61/386,390 2010-09-24
US201161485062P 2011-05-11 2011-05-11
US61/485,062 2011-05-11

Publications (1)

Publication Number Publication Date
WO2012040387A1 true WO2012040387A1 (en) 2012-03-29

Family

ID=45874160

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2011/052645 WO2012040387A1 (en) 2010-09-24 2011-09-21 Direct capture, amplification and sequencing of target dna using immobilized primers

Country Status (13)

Country Link
US (3) US9309556B2 (en)
EP (2) EP2619329B1 (en)
JP (1) JP5986572B2 (en)
KR (1) KR20130113447A (en)
CN (1) CN103228798B (en)
AU (1) AU2011305445B2 (en)
CA (1) CA2810931C (en)
IL (1) IL225109A (en)
IN (1) IN2013MN00522A (en)
MX (1) MX346956B (en)
NZ (1) NZ608313A (en)
RU (1) RU2565550C2 (en)
WO (1) WO2012040387A1 (en)

Cited By (40)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2013117595A3 (en) * 2012-02-07 2013-10-03 Illumina Cambridge Limited Targeted enrichment and amplification of nucleic acids on a support
WO2013158540A1 (en) * 2012-04-16 2013-10-24 Good Start Genetics, Inc. Capture reactions
CN103571822A (en) * 2012-07-20 2014-02-12 中国科学院植物研究所 Multipurpose DNA segment enrichment method used for next generation sequencing
US8812422B2 (en) 2012-04-09 2014-08-19 Good Start Genetics, Inc. Variant database
WO2015106941A1 (en) * 2014-01-16 2015-07-23 Illumina Cambridge Limited Polynucleotide modification on solid support
US9115387B2 (en) 2013-03-14 2015-08-25 Good Start Genetics, Inc. Methods for analyzing nucleic acids
JP2015531588A (en) * 2012-07-17 2015-11-05 カウンシル,インコーポレーテッド System and method for detecting genetic variation
US9228233B2 (en) 2011-10-17 2016-01-05 Good Start Genetics, Inc. Analysis methods
CN105531375A (en) * 2012-12-10 2016-04-27 分析生物科学有限公司 Methods for targeted genomic analysis
JP2016513959A (en) * 2013-02-21 2016-05-19 トマ バイオサイエンシーズ, インコーポレイテッド Methods, compositions and kits for nucleic acid analysis
US9487828B2 (en) 2012-05-10 2016-11-08 The General Hospital Corporation Methods for determining a nucleotide sequence contiguous to a known target nucleotide sequence
US9535920B2 (en) 2013-06-03 2017-01-03 Good Start Genetics, Inc. Methods and systems for storing sequence read data
WO2018027048A1 (en) * 2016-08-05 2018-02-08 Bio-Rad Laboratories, Inc. Second strand direct
US9944924B2 (en) 2014-01-16 2018-04-17 Illumina, Inc. Polynucleotide modification on solid support
US10066259B2 (en) 2015-01-06 2018-09-04 Good Start Genetics, Inc. Screening for structural variants
US10329616B2 (en) * 2015-10-28 2019-06-25 Republic Of Korea (National Forensic Service Director, Ministry Of Public Administration & Security) Primer set for preparation of NGS library and method and kit for making NGS library using the same
US10429399B2 (en) 2014-09-24 2019-10-01 Good Start Genetics, Inc. Process control for increased robustness of genetic assays
US10450597B2 (en) 2014-01-27 2019-10-22 The General Hospital Corporation Methods of preparing nucleic acids for sequencing
EP3436607A4 (en) * 2016-03-28 2019-10-30 Boreal Genomics, Inc. Linked duplex target capture
US10604799B2 (en) 2012-04-04 2020-03-31 Molecular Loop Biosolutions, Llc Sequence assembly
JP2020103298A (en) * 2012-09-04 2020-07-09 ガーダント ヘルス, インコーポレイテッド Systems and methods to detect rare mutations and copy number variation
US10752946B2 (en) 2017-01-31 2020-08-25 Myriad Women's Health, Inc. Methods and compositions for enrichment of target polynucleotides
US10851414B2 (en) 2013-10-18 2020-12-01 Good Start Genetics, Inc. Methods for determining carrier status
US10865444B2 (en) 2014-01-16 2020-12-15 Illumina, Inc. Amplicon preparation and sequencing on solid supports
US10961573B2 (en) 2016-03-28 2021-03-30 Boreal Genomics, Inc. Linked duplex target capture
US10968447B2 (en) 2017-01-31 2021-04-06 Myriad Women's Health, Inc. Methods and compositions for enrichment of target polynucleotides
US11041203B2 (en) 2013-10-18 2021-06-22 Molecular Loop Biosolutions, Inc. Methods for assessing a genomic region of a subject
US11041852B2 (en) 2010-12-23 2021-06-22 Molecular Loop Biosciences, Inc. Methods for maintaining the integrity and identification of a nucleic acid template in a multiplex sequencing reaction
US11053548B2 (en) 2014-05-12 2021-07-06 Good Start Genetics, Inc. Methods for detecting aneuploidy
US11232850B2 (en) 2017-03-24 2022-01-25 Myriad Genetics, Inc. Copy number variant caller
US11268137B2 (en) 2016-12-09 2022-03-08 Boreal Genomics, Inc. Linked ligation
US11319594B2 (en) 2016-08-25 2022-05-03 Resolution Bioscience, Inc. Methods for the detection of genomic copy changes in DNA samples
US11339391B2 (en) 2015-11-11 2022-05-24 Resolution Bioscience, Inc. High efficiency construction of DNA libraries
US11390905B2 (en) 2016-09-15 2022-07-19 Archerdx, Llc Methods of nucleic acid sample preparation for analysis of DNA
US11408024B2 (en) 2014-09-10 2022-08-09 Molecular Loop Biosciences, Inc. Methods for selectively suppressing non-target sequences
US11473136B2 (en) 2019-01-03 2022-10-18 Ncan Genomics, Inc. Linked target capture
US11708574B2 (en) 2016-06-10 2023-07-25 Myriad Women's Health, Inc. Nucleic acid sequencing adapters and uses thereof
US11795492B2 (en) 2016-09-15 2023-10-24 ArcherDX, LLC. Methods of nucleic acid sample preparation
US11840730B1 (en) 2009-04-30 2023-12-12 Molecular Loop Biosciences, Inc. Methods and compositions for evaluating genetic markers
US11854666B2 (en) 2016-09-29 2023-12-26 Myriad Women's Health, Inc. Noninvasive prenatal screening using dynamic iterative depth optimization

Families Citing this family (92)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8999642B2 (en) 2008-03-10 2015-04-07 Illumina, Inc. Methods for selecting and amplifying polynucleotides
EP2318552B1 (en) 2008-09-05 2016-11-23 TOMA Biosciences, Inc. Methods for stratifying and annotating cancer drug treatment options
WO2010038042A1 (en) 2008-10-02 2010-04-08 Illumina Cambridge Ltd. Nucleic acid sample enrichment for sequencing applications
PT2556171E (en) 2010-04-05 2015-12-21 Prognosys Biosciences Inc Spatially encoded biological assays
US10787701B2 (en) 2010-04-05 2020-09-29 Prognosys Biosciences, Inc. Spatially encoded biological assays
US20190300945A1 (en) 2010-04-05 2019-10-03 Prognosys Biosciences, Inc. Spatially Encoded Biological Assays
CA2810931C (en) 2010-09-24 2018-04-17 The Board Of Trustees Of The Leland Stanford Junior University Direct capture, amplification and sequencing of target dna using immobilized primers
US9187783B2 (en) 2010-10-04 2015-11-17 Genapsys, Inc. Systems and methods for automated reusable parallel biological reactions
US9184099B2 (en) 2010-10-04 2015-11-10 The Board Of Trustees Of The Leland Stanford Junior University Biosensor devices, systems and methods therefor
US9399217B2 (en) 2010-10-04 2016-07-26 Genapsys, Inc. Chamber free nanoreactor system
GB201106254D0 (en) 2011-04-13 2011-05-25 Frisen Jonas Method and product
US8585973B2 (en) 2011-05-27 2013-11-19 The Board Of Trustees Of The Leland Stanford Junior University Nano-sensor array
US9926596B2 (en) 2011-05-27 2018-03-27 Genapsys, Inc. Systems and methods for genetic and biological analysis
CA2852949A1 (en) 2011-10-19 2013-04-25 Nugen Technologies, Inc. Compositions and methods for directional nucleic acid amplification and sequencing
CN104105797B (en) 2011-12-01 2016-08-31 吉纳普赛斯股份有限公司 System and method for efficent electronic order-checking with detection
EP3578697B1 (en) 2012-01-26 2024-03-06 Tecan Genomics, Inc. Compositions and methods for targeted nucleic acid sequence enrichment and high efficiency library generation
EP3305918B1 (en) * 2012-03-05 2020-06-03 President and Fellows of Harvard College Methods for epigenetic sequencing
US9957549B2 (en) 2012-06-18 2018-05-01 Nugen Technologies, Inc. Compositions and methods for negative selection of non-desired nucleic acid sequences
US20150011396A1 (en) 2012-07-09 2015-01-08 Benjamin G. Schroeder Methods for creating directional bisulfite-converted nucleic acid libraries for next generation sequencing
US9092401B2 (en) 2012-10-31 2015-07-28 Counsyl, Inc. System and methods for detecting genetic variation
US20140024541A1 (en) * 2012-07-17 2014-01-23 Counsyl, Inc. Methods and compositions for high-throughput sequencing
EP2971130A4 (en) 2013-03-15 2016-10-05 Nugen Technologies Inc Sequential sequencing
CN105051214B (en) 2013-03-15 2018-12-28 吉纳普赛斯股份有限公司 System and method for bioanalysis
WO2014210223A1 (en) 2013-06-25 2014-12-31 Prognosys Biosciences, Inc. Spatially encoded biological assays using a microfluidic device
US9116866B2 (en) 2013-08-21 2015-08-25 Seven Bridges Genomics Inc. Methods and systems for detecting sequence variants
US9898575B2 (en) 2013-08-21 2018-02-20 Seven Bridges Genomics Inc. Methods and systems for aligning sequences
WO2015058095A1 (en) 2013-10-18 2015-04-23 Seven Bridges Genomics Inc. Methods and systems for quantifying sequence alignment
WO2015058120A1 (en) 2013-10-18 2015-04-23 Seven Bridges Genomics Inc. Methods and systems for aligning sequences in the presence of repeating elements
CN105849279B (en) 2013-10-18 2020-02-18 七桥基因公司 Methods and systems for identifying disease-induced mutations
CA2927102C (en) 2013-10-18 2022-08-30 Seven Bridges Genomics Inc. Methods and systems for genotyping genetic samples
US9063914B2 (en) 2013-10-21 2015-06-23 Seven Bridges Genomics Inc. Systems and methods for transcriptome analysis
CN105849264B (en) 2013-11-13 2019-09-27 纽亘技术公司 For identifying the composition and method that repeat sequencing reading
CN111100911B (en) * 2013-11-26 2023-06-02 杭州联川基因诊断技术有限公司 Method for amplifying target nucleic acid
WO2015089238A1 (en) 2013-12-11 2015-06-18 Genapsys, Inc. Systems and methods for biological analysis and computation
CN105793438B (en) * 2013-12-15 2020-02-11 中央研究院 Full-length amplification method of double-strand linear nucleic acid with unknown sequence
US9745614B2 (en) 2014-02-28 2017-08-29 Nugen Technologies, Inc. Reduced representation bisulfite sequencing with diversity adaptors
EP3556864B1 (en) 2014-04-18 2020-12-09 Genapsys, Inc. Methods and systems for nucleic acid amplification
US20150298091A1 (en) 2014-04-21 2015-10-22 President And Fellows Of Harvard College Systems and methods for barcoding nucleic acids
ES2876432T3 (en) * 2014-05-16 2021-11-12 Illumina Inc Nucleic acid synthesis techniques
US20150361422A1 (en) * 2014-06-16 2015-12-17 Agilent Technologies, Inc. High throughput gene assembly in droplets
KR102598819B1 (en) * 2014-06-23 2023-11-03 더 제너럴 하스피탈 코포레이션 Genomewide unbiased identification of dsbs evaluated by sequencing (guide-seq)
US9558321B2 (en) 2014-10-14 2017-01-31 Seven Bridges Genomics Inc. Systems and methods for smart tools in sequence pipelines
CN107532207B (en) 2015-04-10 2021-05-07 空间转录公司 Spatially differentiated, multiplexed nucleic acid analysis of biological samples
JP2018511341A (en) 2015-04-17 2018-04-26 プレジデント アンド フェローズ オブ ハーバード カレッジ Barcoding systems and methods for gene sequencing and other applications
US10844428B2 (en) 2015-04-28 2020-11-24 Illumina, Inc. Error suppression in sequenced DNA fragments using redundant reads with unique molecular indices (UMIS)
US10275567B2 (en) 2015-05-22 2019-04-30 Seven Bridges Genomics Inc. Systems and methods for haplotyping
CA3222708A1 (en) * 2015-06-02 2016-12-08 Biofire Defense Llc Sample to sequence
RU2604198C1 (en) * 2015-06-22 2016-12-10 Федеральное Государственное Бюджетное Учреждение Науки Институт Молекулярной Биологии Им. В.А. Энгельгардта Российской Академии Наук (Имб Ран) Method of producing heterogeneous set of single-chain dna fragments for multiplex genetic analysis
US10793895B2 (en) 2015-08-24 2020-10-06 Seven Bridges Genomics Inc. Systems and methods for epigenetic analysis
US10724110B2 (en) 2015-09-01 2020-07-28 Seven Bridges Genomics Inc. Systems and methods for analyzing viral nucleic acids
US10584380B2 (en) 2015-09-01 2020-03-10 Seven Bridges Genomics Inc. Systems and methods for mitochondrial analysis
CN108348166B (en) * 2015-09-09 2022-06-03 普梭梅根公司 Microbiota-derived diagnostic and therapeutic methods and systems for infectious diseases and other health conditions associated with antibiotic use
BR112018001686A2 (en) 2015-09-18 2018-09-18 Vanadis Diagnostics probe set to analyze a dna sample and method to use the same
US11347704B2 (en) 2015-10-16 2022-05-31 Seven Bridges Genomics Inc. Biological graph or sequence serialization
US10364468B2 (en) 2016-01-13 2019-07-30 Seven Bridges Genomics Inc. Systems and methods for analyzing circulating tumor DNA
US10460829B2 (en) 2016-01-26 2019-10-29 Seven Bridges Genomics Inc. Systems and methods for encoding genetic variation for a population
US10262102B2 (en) 2016-02-24 2019-04-16 Seven Bridges Genomics Inc. Systems and methods for genotyping with graph reference
EP3488017A4 (en) 2016-07-20 2020-02-26 Genapsys Inc. Systems and methods for nucleic acid sequencing
US11250931B2 (en) 2016-09-01 2022-02-15 Seven Bridges Genomics Inc. Systems and methods for detecting recombination
GB201704754D0 (en) 2017-01-05 2017-05-10 Illumina Inc Kinetic exclusion amplification of nucleic acid libraries
EP3889962A1 (en) 2017-01-18 2021-10-06 Illumina, Inc. Methods and systems for generation and error-correction of unique molecular index sets with heterogeneous molecular lengths
US10787699B2 (en) * 2017-02-08 2020-09-29 Microsoft Technology Licensing, Llc Generating pluralities of primer and payload designs for retrieval of stored nucleotides
WO2018161019A1 (en) * 2017-03-03 2018-09-07 Counsyl, Inc. Methods for optimizing direct targeted sequencing
US11447818B2 (en) 2017-09-15 2022-09-20 Illumina, Inc. Universal short adapters with variable length non-random unique molecular identifiers
CN111566224A (en) 2017-09-21 2020-08-21 吉纳普赛斯股份有限公司 Systems and methods for nucleic acid sequencing
EP3692049A4 (en) * 2017-10-04 2021-06-23 Centrillion Technology Holdings Corporation Method and system for enzymatic synthesis of oligonucleotides
WO2019075197A1 (en) 2017-10-11 2019-04-18 The General Hospital Corporation Methods for detecting site-specific and spurious genomic deamination induced by base editing technologies
US11099202B2 (en) 2017-10-20 2021-08-24 Tecan Genomics, Inc. Reagent delivery system
WO2019157034A1 (en) * 2018-02-07 2019-08-15 Nugen Technologies, Inc. Library preparation
CN112313241A (en) 2018-04-17 2021-02-02 总医院公司 Sensitive in vitro assay of substrate preference and site for nucleic acid binding, modification, and cleavage reagents
WO2020018824A1 (en) * 2018-07-19 2020-01-23 Ultima Genomics, Inc. Nucleic acid clonal amplification and sequencing methods, systems, and kits
TW202035699A (en) * 2018-10-09 2020-10-01 中央研究院 Digital polymerase chain reaction method for detecting nucleic acids in samples
CN113348250A (en) * 2018-12-14 2021-09-03 深圳华大生命科学研究院 Nucleic acid synthesis apparatus, nucleic acid purification apparatus, use thereof, nucleic acid synthesis method, and nucleic acid purification method
SG11202012758WA (en) * 2018-12-17 2021-01-28 Illumina Cambridge Ltd Primer oligonucleotide for sequencing
US11926867B2 (en) 2019-01-06 2024-03-12 10X Genomics, Inc. Generating capture probes for spatial analysis
US11649485B2 (en) 2019-01-06 2023-05-16 10X Genomics, Inc. Generating capture probes for spatial analysis
US20220228139A1 (en) * 2019-05-21 2022-07-21 Academia Sinica Method for amplifying and detecting ribonucleic acid (rna) fragments
CN112029841B (en) * 2019-06-03 2024-02-09 香港中文大学 Method for quantifying telomere length and genomic motifs
CN112342269B (en) * 2019-08-09 2023-12-05 深圳市真迈生物科技有限公司 Method for capturing nucleic acid molecules and application thereof
US11795495B1 (en) * 2019-10-02 2023-10-24 FOXO Labs Inc. Machine learned epigenetic status estimator
US11821035B1 (en) 2020-01-29 2023-11-21 10X Genomics, Inc. Compositions and methods of making gene expression libraries
JP2023512522A (en) * 2020-01-31 2023-03-27 アヴィダ バイオメッド, インコーポレイテッド Systems and methods for targeted nucleic acid capture
FI3969613T3 (en) * 2020-02-26 2023-10-06 Illumina Inc Kits for genotyping
WO2021231263A2 (en) * 2020-05-12 2021-11-18 Singular Genomics Systems, Inc. Nucleic acid amplification methods
CN111676276A (en) * 2020-07-13 2020-09-18 湖北伯远合成生物科技有限公司 Method for rapidly and accurately determining gene editing mutation condition and application thereof
EP4153606A2 (en) 2020-07-13 2023-03-29 Singular Genomics Systems, Inc. Methods of sequencing complementary polynucleotides
US20230212560A1 (en) * 2020-07-31 2023-07-06 Arc Bio, Llc Systems, methods, and media for determining relative quality of oligonucleotide preparations
US20240011020A1 (en) * 2020-09-08 2024-01-11 Seqwell, Inc Sequencing oligonucleotides and methods of use thereof
RU2748380C1 (en) * 2020-11-10 2021-05-25 федеральное государственное автономное образовательное учреждение высшего образования "Казанский (Приволжский) федеральный университет" (ФГАОУ ВО КФУ) SET OF OLIGONUCLEOTIDE PRIMERS AND METHOD FOR GENOTYPING SINGLE NUCLEOTIDE POLYMORPHISM rs8065080 IN HUMAN TRPV1 GENE
CN112837749B (en) * 2021-02-01 2021-11-26 北京百奥纳芯生物科技有限公司 Optimization method of gene chip probe for cancer screening
AU2022313872A1 (en) * 2021-07-20 2024-02-22 Freenome Holdings, Inc. Compositions and methods for improved 5-hydroxymethylated cytosine resolution in nucleic acid sequencing
CN114621307A (en) * 2022-04-12 2022-06-14 中国科学院苏州生物医学工程技术研究所 Oligonucleotide space coordinate coding method and microfluidic device thereof

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20030224439A1 (en) * 2002-05-31 2003-12-04 Mike Lafferty Multiplexed systems for nucleic acid sequencing
US20050191656A1 (en) * 1999-01-06 2005-09-01 Callida Genomics, Inc. Enhanced sequencing by hybridization using pools of probes
US20060008824A1 (en) * 2004-05-20 2006-01-12 Leland Stanford Junior University Methods and compositions for clonal amplification of nucleic acid
US20080286795A1 (en) * 1997-04-01 2008-11-20 Solexa Limited Method of nucleic acid amplification
US20090117621A1 (en) * 2005-07-20 2009-05-07 Jonathan Mark Boutell Methods of nucleic acid amplification and sequencing
US20090124514A1 (en) * 2003-02-26 2009-05-14 Perlegen Sciences, Inc. Selection probe amplification

Family Cites Families (265)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US3575220A (en) 1968-08-12 1971-04-20 Scientific Industries Apparatus for dispensing liquid sample
US3996345A (en) 1974-08-12 1976-12-07 Syva Company Fluorescence quenching with immunological pairs in immunoassays
US4351760A (en) 1979-09-07 1982-09-28 Syva Company Novel alkyl substituted fluorescent compounds and polyamino acid conjugates
US4458066A (en) 1980-02-29 1984-07-03 University Patents, Inc. Process for preparing polynucleotides
US4683202A (en) 1985-03-28 1987-07-28 Cetus Corporation Process for amplifying nucleic acid sequences
US4739044A (en) 1985-06-13 1988-04-19 Amgen Method for derivitization of polynucleotides
US4757141A (en) 1985-08-26 1988-07-12 Applied Biosystems, Incorporated Amino-derivatized phosphite and phosphate linking agents, phosphoramidite precursors, and useful conjugates thereof
US5618711A (en) 1986-08-22 1997-04-08 Hoffmann-La Roche Inc. Recombinant expression vectors and purification methods for Thermus thermophilus DNA polymerase
US6127155A (en) 1986-08-22 2000-10-03 Roche Molecular Systems, Inc. Stabilized thermostable nucleic acid polymerase compositions containing non-ionic polymeric detergents
US4889818A (en) 1986-08-22 1989-12-26 Cetus Corporation Purified thermostable enzyme
US6090591A (en) 1987-07-31 2000-07-18 The Board Of Trustees Of The Leland Stanford Junior University Selective amplification of target polynucleotide sequences
US5231191A (en) 1987-12-24 1993-07-27 Applied Biosystems, Inc. Rhodamine phosphoramidite compounds
AU4128089A (en) 1988-09-15 1990-03-22 Rorer International (Overseas) Inc. Monoclonal antibodies specific to human epidermal growth factor receptor and therapeutic methods employing same
US4997928A (en) 1988-09-15 1991-03-05 E. I. Du Pont De Nemours And Company Fluorescent reagents for the preparation of 5'-tagged oligonucleotides
US5210015A (en) 1990-08-06 1993-05-11 Hoffman-La Roche Inc. Homogeneous assay system using the nuclease activity of a nucleic acid polymerase
US5994056A (en) 1991-05-02 1999-11-30 Roche Molecular Systems, Inc. Homogeneous methods for nucleic acid amplification and detection
US7223833B1 (en) 1991-05-24 2007-05-29 Isis Pharmaceuticals, Inc. Peptide nucleic acid conjugates
US5981179A (en) 1991-11-14 1999-11-09 Digene Diagnostics, Inc. Continuous amplification reaction
CA2134552A1 (en) 1992-04-27 1993-11-11 George D. Sorenson Detection of gene sequences in biological fluids
DE69429038T2 (en) 1993-07-28 2002-03-21 Pe Corp Ny Norwalk Device and method for nucleic acid amplification
US5925517A (en) 1993-11-12 1999-07-20 The Public Health Research Institute Of The City Of New York, Inc. Detectably labeled dual conformation oligonucleotide probes, assays and kits
US5538848A (en) 1994-11-16 1996-07-23 Applied Biosystems Division, Perkin-Elmer Corp. Method for detecting nucleic acid amplification using self-quenching fluorescence probe
CH686982A5 (en) 1993-12-16 1996-08-15 Maurice Stroun Method for diagnosis of cancers.
US5539083A (en) 1994-02-23 1996-07-23 Isis Pharmaceuticals, Inc. Peptide nucleic acid combinatorial libraries and improved methods of synthesis
US5604097A (en) 1994-10-13 1997-02-18 Spectragen, Inc. Methods for sorting polynucleotides using oligonucleotide tags
US5585069A (en) 1994-11-10 1996-12-17 David Sarnoff Research Center, Inc. Partitioned microelectronic and fluidic device array for clinical diagnostics and chemical synthesis
US6312894B1 (en) 1995-04-03 2001-11-06 Epoch Pharmaceuticals, Inc. Hybridization and mismatch discrimination using oligonucleotides conjugated to minor groove binders
US5750341A (en) 1995-04-17 1998-05-12 Lynx Therapeutics, Inc. DNA sequencing by parallel oligonucleotide extensions
US5789206A (en) 1995-07-07 1998-08-04 Myriad Genetics, Inc. Method for ligating adaptors to nucleic acids which methods are useful for obtaining the ends of genes
US5773258A (en) 1995-08-25 1998-06-30 Roche Molecular Systems, Inc. Nucleic acid amplification using a reversibly inactivated thermostable enzyme
NO954667D0 (en) 1995-11-17 1995-11-17 Dagfinn Oegreid Method for detecting Ki-ras mutations
US6156504A (en) 1996-03-15 2000-12-05 The Penn State Research Foundation Detection of extracellular tumor-associated nucleic acid in blood plasma or serum using nucleic acid amplification assays
EP0914462A4 (en) 1996-03-18 2002-05-22 Molecular Biology Resources Target nucleic acid sequence amplification
US6759217B2 (en) 1996-03-26 2004-07-06 Oncomedx, Inc. Method enabling use of extracellular RNA extracted from plasma or serum to detect, monitor or evaluate cancer
US6458530B1 (en) 1996-04-04 2002-10-01 Affymetrix Inc. Selecting tag nucleic acids
SE9602829D0 (en) 1996-07-19 1996-07-19 Yngve Johnsson Procedure and apparatus for payment processing and means of payment
US5998140A (en) 1996-07-31 1999-12-07 The Scripps Research Institute Complex formation between dsDNA and oligomer of cyclic heterocycles
US6482795B1 (en) 1997-01-30 2002-11-19 Myriad Genetics, Inc. Tumor suppressor designated TS10q23.3
US6262242B1 (en) 1997-01-30 2001-07-17 Board Of Regents, The University Of Texas System Tumor suppressor designated TS10Q23.3
US6143496A (en) 1997-04-17 2000-11-07 Cytonix Corporation Method of sampling, amplifying and quantifying segment of nucleic acid, polymerase chain reaction assembly having nanoliter-sized sample chambers, and method of filling assembly
JPH10341047A (en) 1997-06-06 1998-12-22 Sony Corp Magnetic tunnel device
EP1019496B1 (en) 1997-07-07 2004-09-29 Medical Research Council In vitro sorting method
US6008002A (en) 1997-09-29 1999-12-28 Bodey; Bela Immunomagnetic detection and isolation of cancer cells
US5948902A (en) 1997-11-20 1999-09-07 South Alabama Medical Science Foundation Antisense oligonucleotides to human serine/threonine protein phosphatase genes
BR9907852A (en) 1998-02-12 2000-10-24 Immunivest Corp Processes to detect and enumerate rare and cancerous cells in a mixed cell population, to diagnose early stage cancer in a test patient, to determine the likelihood of cancer recurrence in a previously treated human patient from cancer, to distinguish a carcinoma confined to the organ of a carcinoma with metastatic properties, to monitor the remission situation in a human cancer patient undergoing cancer therapy treatment and to increase amounts of circulating epithelial cells in a blood sample, coated magnetic particle, composition, sets test to assess a patient sample for the presence of rare circulating cells, for the presence of circulating tumor cells, for the presence of circulating breast cancer cells, for the presence of circulating prostate cancer cells, for the presence of circulating colon cancer cells , regarding the presence of circulating bladder cancer cells and to monitor a patient for cancer recurrence, and, peripheral blood fraction enriched for circulating neoplastic cells
JPH11355695A (en) 1998-06-10 1999-12-24 Sony Corp Video signal processor
DE19833738A1 (en) 1998-07-27 2000-02-03 Michael Giesing Process for isolating cancer cells from cell-containing body fluids and kits for performing this process
US6787308B2 (en) 1998-07-30 2004-09-07 Solexa Ltd. Arrayed biomolecules and their use in sequencing
US6204375B1 (en) 1998-07-31 2001-03-20 Ambion, Inc. Methods and reagents for preserving RNA in cell and tissue samples
AR021833A1 (en) 1998-09-30 2002-08-07 Applied Research Systems METHODS OF AMPLIFICATION AND SEQUENCING OF NUCLEIC ACID
US6086740A (en) 1998-10-29 2000-07-11 Caliper Technologies Corp. Multiplexed microfluidic devices and systems
US6429027B1 (en) 1998-12-28 2002-08-06 Illumina, Inc. Composite arrays utilizing microspheres
US6565727B1 (en) 1999-01-25 2003-05-20 Nanolytics, Inc. Actuators for microfluidics without moving parts
US9534254B1 (en) 1999-02-02 2017-01-03 Abbott Molecular Inc. Patient stratification for cancer therapy based on genomic DNA microarray analysis
US6395481B1 (en) 1999-02-16 2002-05-28 Arch Development Corp. Methods for detection of promoter polymorphism in a UGT gene promoter
US6410231B1 (en) 1999-02-26 2002-06-25 Incyte Genomics, Inc. SNP detection
US7324926B2 (en) 1999-04-09 2008-01-29 Whitehead Institute For Biomedical Research Methods for predicting chemosensitivity or chemoresistance
US20010034023A1 (en) 1999-04-26 2001-10-25 Stanton Vincent P. Gene sequence variations with utility in determining the treatment of disease, in genes relating to drug processing
US6492161B1 (en) 1999-06-02 2002-12-10 Prokaria Ltd. Bacteriophage RM 378 of a thermophilic host organism
US6300070B1 (en) * 1999-06-04 2001-10-09 Mosaic Technologies, Inc. Solid phase methods for amplifying multiple nucleic acids
US6524456B1 (en) 1999-08-12 2003-02-25 Ut-Battelle, Llc Microfluidic devices for the controlled manipulation of small volumes
CA2383928A1 (en) 1999-09-01 2001-03-08 Whitehead Institute For Biomedical Research Chromosome-wide analysis of protein-dna interactions
US6586177B1 (en) 1999-09-08 2003-07-01 Exact Sciences Corporation Methods for disease detection
US6849403B1 (en) 1999-09-08 2005-02-01 Exact Sciences Corporation Apparatus and method for drug screening
US7211390B2 (en) 1999-09-16 2007-05-01 454 Life Sciences Corporation Method of sequencing a nucleic acid
US7244559B2 (en) 1999-09-16 2007-07-17 454 Life Sciences Corporation Method of sequencing a nucleic acid
US7205105B2 (en) 1999-12-08 2007-04-17 Epoch Biosciences, Inc. Real-time linear detection probes: sensitive 5′-minor groove binder-containing probes for PCR analysis
AU2001231091A1 (en) 2000-01-21 2001-07-31 Vincent P. Stanton Jr. Identification of genetic components of drug response
MXPA02007317A (en) 2000-01-26 2004-07-30 Ventana Med Syst Inc A system for developing assays for personalized medicine.
AU2001241939A1 (en) 2000-02-28 2001-09-12 Maxygen, Inc. Single-stranded nucleic acid template-mediated recombination and nucleic acid fragment isolation
WO2001064958A2 (en) 2000-03-01 2001-09-07 Epoch Bioscienecs, Inc. Modified oligonucleotides for mismatch discrimination
US20020120409A1 (en) 2000-05-19 2002-08-29 Affymetrix, Inc. Methods for gene expression analysis
WO2001096382A2 (en) 2000-06-15 2001-12-20 Prokaria Ehf. Thermostable cellulase
US6773566B2 (en) 2000-08-31 2004-08-10 Nanolytics, Inc. Electrostatic actuators for microfluidics and methods for using same
WO2002023163A1 (en) 2000-09-15 2002-03-21 California Institute Of Technology Microfabricated crossflow devices and methods
EP1339872A2 (en) 2000-09-19 2003-09-03 Whitehead Institute For Biomedical Research Genetic markers for tumors
EP1366192B8 (en) 2000-10-24 2008-10-29 The Board of Trustees of the Leland Stanford Junior University Direct multiplex characterization of genomic dna
US20050021240A1 (en) 2000-11-02 2005-01-27 Epigenomics Ag Systems, methods and computer program products for guiding selection of a therapeutic treatment regimen based on the methylation status of the DNA
EP1349924A2 (en) 2000-11-03 2003-10-08 Dana Farber Cancer Institute Methods and compositions for the diagnosis of cancer susceptibilites and defective dna repair mechanisms and treatment thereof
EP1207209A3 (en) 2000-11-09 2002-09-11 Agilent Technologies, Inc. (a Delaware corporation) Methods using arrays for detection of single nucleotide polymorphisms
WO2002044413A2 (en) 2000-12-01 2002-06-06 Response Genetics, Inc. Method of determining epidermal growth factor receptor and her2-neu gene expression and correlation of levels thereof with survival rates
US6582919B2 (en) 2001-06-11 2003-06-24 Response Genetics, Inc. Method of determining epidermal growth factor receptor and HER2-neu gene expression and correlation of levels thereof with survival rates
US6958217B2 (en) 2001-01-24 2005-10-25 Genomic Expression Aps Single-stranded polynucleotide tags
US20020115073A1 (en) 2001-02-16 2002-08-22 Nickolas Papadopoulos Genome-based personalized medicine
WO2002068104A1 (en) 2001-02-23 2002-09-06 Japan Science And Technology Corporation Process for producing emulsion and microcapsules and apparatus therefor
US6632611B2 (en) 2001-07-20 2003-10-14 Affymetrix, Inc. Method of target enrichment and amplification
US20040110193A1 (en) 2001-07-31 2004-06-10 Gene Logic, Inc. Methods for classification of biological data
US20030165940A1 (en) 2001-12-06 2003-09-04 The Johns Hopkins University Disease detection by digital protein truncation assays
WO2003054149A2 (en) 2001-12-07 2003-07-03 University Of Massachusetts Targeted genetic risk-stratification using microarrays
JP4643909B2 (en) 2001-12-21 2011-03-02 ザ ウェルカム トラスト リミテッド gene
US6949342B2 (en) 2001-12-21 2005-09-27 Whitehead Institute For Biomedical Research Prostate cancer diagnosis and outcome prediction by expression analysis
US7730063B2 (en) 2002-12-10 2010-06-01 Asset Trust, Inc. Personalized medicine service
ATE441107T1 (en) 2002-03-01 2009-09-15 Siemens Healthcare Diagnostics ASSAY FOR MONITORING CANCER PATIENTS BASED ON LEVELS OF THE EXTRACELLULAR DOMAIN (ECD) ANALYTE OF THE EPIDERMAL GROWTH FACTOR RECEPTOR (EGFR), ALONE OR IN COMBINATION WITH OTHER ANALYTES, IN BODY FLUID SAMPLES
WO2003078450A2 (en) 2002-03-11 2003-09-25 Epoch Biosciences, Inc. Negatively charged minor groove binders
US7427480B2 (en) 2002-03-26 2008-09-23 Perlegen Sciences, Inc. Life sciences business systems and methods
US7138226B2 (en) 2002-05-10 2006-11-21 The University Of Miami Preservation of RNA and morphology in cells and tissues
US9388459B2 (en) * 2002-06-17 2016-07-12 Affymetrix, Inc. Methods for genotyping
TWI266907B (en) 2002-07-24 2006-11-21 Nitto Denko Corp Polarizing device, optical thin film using the same, and image display device using the same
WO2004027082A2 (en) 2002-09-19 2004-04-01 Applera Corporation Methods and compositions for detecting targets
AU2003260946A1 (en) 2002-09-20 2004-04-08 Prokaria Ehf. Thermostable rna ligase from thermus phage
US6911132B2 (en) 2002-09-24 2005-06-28 Duke University Apparatus for manipulating droplets by electrowetting-based techniques
WO2004046386A1 (en) 2002-11-15 2004-06-03 Genomic Health, Inc. Gene expression profiling of egfr positive cancer
US8275554B2 (en) 2002-12-20 2012-09-25 Caliper Life Sciences, Inc. System for differentiating the lengths of nucleic acids of interest in a sample
EP1587940A4 (en) 2002-12-20 2006-06-07 Caliper Life Sciences Inc Single molecule amplification and detection of dna
US20040137539A1 (en) 2003-01-10 2004-07-15 Bradford Sherry A. Cancer comprehensive method for identifying cancer protein patterns and determination of cancer treatment strategies
ES2396245T3 (en) 2003-01-29 2013-02-20 454 Life Sciences Corporation Nucleic Acid Amplification and Sequencing Method
US7041481B2 (en) 2003-03-14 2006-05-09 The Regents Of The University Of California Chemical amplification based on fluid partitioning
WO2004083460A1 (en) 2003-03-20 2004-09-30 Csi Biotech Oy Amp ligation assay (ala)
US8417459B2 (en) 2003-04-09 2013-04-09 Omicia, Inc. Methods of selection, reporting and analysis of genetic markers using broad-based genetic profiling applications
EP1623010A4 (en) 2003-04-25 2007-12-26 Janssen Pharmaceutica Nv Preservation of rna in a biological sample
JP2007507222A (en) 2003-05-28 2007-03-29 ゲノミック ヘルス, インコーポレイテッド Gene expression markers for predicting response to chemotherapy
RU2392324C2 (en) * 2003-09-18 2010-06-20 Симфоген А/С Concerned sequences linkage method
US20050153317A1 (en) 2003-10-24 2005-07-14 Metamorphix, Inc. Methods and systems for inferring traits to breed and manage non-beef livestock
EP1687609B1 (en) 2003-10-28 2014-12-10 Epoch Biosciences, Inc. Fluorescent probes for dna detection by hybridization with improved sensitivity and low background
EP1756137A4 (en) 2003-11-05 2007-10-31 Univ Texas Diagnostic and therapeutic methods and compositions involving pten and breast cancer
WO2005049849A2 (en) 2003-11-14 2005-06-02 Integrated Dna Technologies, Inc. Fluorescence quenching azo dyes, their methods of preparation and use
US20050181377A1 (en) 2004-02-13 2005-08-18 Markovic Svetomir N. Targeted cancer therapy
US20060195266A1 (en) 2005-02-25 2006-08-31 Yeatman Timothy J Methods for predicting cancer outcome and gene signatures for use therein
US20100216153A1 (en) 2004-02-27 2010-08-26 Helicos Biosciences Corporation Methods for detecting fetal nucleic acids and diagnosing fetal abnormalities
US20060046258A1 (en) 2004-02-27 2006-03-02 Lapidus Stanley N Applications of single molecule sequencing
JP2007527241A (en) 2004-03-01 2007-09-27 ユニバーシティ オブ シカゴ Polymorphisms in the epidermal growth factor receptor gene promoter
EP1730312B1 (en) 2004-03-24 2008-07-02 Applera Corporation Encoding and decoding reactions for determining target polynucleotides
EP3611273A1 (en) 2004-03-31 2020-02-19 The General Hospital Corporation Method to determine responsiveness of cancer to epidermal growth factor receptor targeting treatments
US20080176209A1 (en) 2004-04-08 2008-07-24 Biomatrica, Inc. Integration of sample storage and sample management for life science
WO2005100606A2 (en) 2004-04-09 2005-10-27 Genomic Health, Inc. Gene expression markers for predicting response to chemotherapy
US20090246756A1 (en) 2004-05-06 2009-10-01 Prokaria Ehe Thermostable polypeptide having polynucleotide kinase activity and/or phosphatase activity
JP2008504809A (en) 2004-06-04 2008-02-21 ジェネンテック・インコーポレーテッド EGFR mutation
US7329495B2 (en) 2004-06-09 2008-02-12 Board Of Regents, The University Of Texas System Mutations in KIT confer imatinib resistance in gastrointestinal stromal tumors
CA2572384A1 (en) 2004-07-01 2006-02-02 University Of Southern California Genetic markers for predicting disease and treatment outcome
EP1784646B1 (en) 2004-08-11 2012-06-13 Albert Einstein College Of Medicine Of Yeshiva University Method for identifying metastasis in motile cells
EP1805199B1 (en) 2004-10-18 2011-01-05 Brandeis University Methods for nucleic acid amplification
KR20070073917A (en) 2004-10-21 2007-07-10 뉴 잉글랜드 바이오랩스, 인크 Repair of nucleic acids for improved amplification
WO2006047787A2 (en) 2004-10-27 2006-05-04 Exact Sciences Corporation Method for monitoring disease progression or recurrence
US20060278241A1 (en) 2004-12-14 2006-12-14 Gualberto Ruano Physiogenomic method for predicting clinical outcomes of treatments in patients
US20060184489A1 (en) 2004-12-17 2006-08-17 General Electric Company Genetic knowledgebase creation for personalized analysis of medical conditions
US20060188909A1 (en) 2005-01-21 2006-08-24 Medical College Of Ohio Business methods for assessing nucleic acids
US7442507B2 (en) 2005-01-24 2008-10-28 New York University School Of Medicine Methods for detecting circulating mutant BRAF DNA
PL1859330T3 (en) 2005-01-28 2013-01-31 Univ Duke Apparatuses and methods for manipulating droplets on a printed circuit board
JP2008536493A (en) 2005-04-01 2008-09-11 アムジエン・インコーポレーテツド Copy number of epidermal growth factor receptor gene
EP1712639B1 (en) 2005-04-06 2008-08-27 Maurice Stroun Method for the diagnosis of cancer by detecting circulating DNA and RNA
JP2008535508A (en) 2005-04-14 2008-09-04 メルク パテント ゲゼルシャフト ミット ベシュレンクテル ハフトング Anti-EGFR antibody therapy based on increased copy number EGFR gene in tumor tissue
JP4547301B2 (en) 2005-05-13 2010-09-22 株式会社日立ハイテクノロジーズ Liquid transport device and analysis system
US20070020657A1 (en) 2005-05-20 2007-01-25 Grebe Stefan K Methods for detecting circulating tumor cells
EP1885878B1 (en) 2005-05-31 2010-08-11 Dako Denmark A/S Compositions and methods for predicting outcome of treatment
DK1913157T4 (en) 2005-06-28 2017-01-23 Genentech Inc EGFR and KRAS mutations for predicting patient response to EGFR inhibitor therapy.
US7993821B2 (en) 2005-08-11 2011-08-09 University Of Washington Methods and apparatus for the isolation and enrichment of circulating tumor cells
US7666593B2 (en) 2005-08-26 2010-02-23 Helicos Biosciences Corporation Single molecule sequencing of captured nucleic acids
US20070117121A1 (en) 2005-09-16 2007-05-24 Hutchison Stephen K cDNA library preparation
EP1946114A4 (en) 2005-09-21 2010-05-26 Ccc Diagnostics Llc Comprehensive diagnostic testing procedures for personalized anticancer chemotherapy (pac)
US20070172844A1 (en) 2005-09-28 2007-07-26 University Of South Florida Individualized cancer treatments
SI1948816T1 (en) 2005-10-24 2012-04-30 Johns Hopkins University Johns Hopkins Technology Transfer Improved methods for beaming
JP5198284B2 (en) 2005-12-22 2013-05-15 キージーン ナムローゼ フェンノートシャップ An improved strategy for transcript characterization using high-throughput sequencing techniques
WO2007091228A1 (en) 2006-02-07 2007-08-16 Stokes Bio Limited A liquid bridge and system
US20100304446A1 (en) 2006-02-07 2010-12-02 Stokes Bio Limited Devices, systems, and methods for amplifying nucleic acids
WO2007091230A1 (en) 2006-02-07 2007-08-16 Stokes Bio Limited A microfluidic analysis system
JP5143026B2 (en) 2006-02-16 2013-02-13 ベンタナ・メデイカル・システムズ・インコーポレーテツド Reagents and methods for cancer prognosis and pathological staging.
DK1991698T3 (en) 2006-03-01 2014-03-10 Keygene Nv "High-throughput" -sekvensbaseret detection of SNPs using ligeringsassays
US8165367B2 (en) 2006-03-08 2012-04-24 Olympus Medical Systems Corp. Medical image processing apparatus and medical image processing method having three-dimensional model estimating
WO2007106432A2 (en) 2006-03-10 2007-09-20 George Mason Intellectual Properties, Inc. Egf receptor phosphorylation status for disease treatment
WO2007109571A2 (en) 2006-03-17 2007-09-27 Prometheus Laboratories, Inc. Methods of predicting and monitoring tyrosine kinase inhibitor therapy
WO2007111937A1 (en) 2006-03-23 2007-10-04 Applera Corporation Directed enrichment of genomic dna for high-throughput sequencing
SG170763A1 (en) 2006-03-27 2011-05-30 Globeimmune Inc Ras mutation and compositions and methods related thereto
TWI395754B (en) 2006-04-24 2013-05-11 Amgen Inc Humanized c-kit antibody
US8380539B2 (en) 2006-05-09 2013-02-19 University Of Louisville Research Foundation, Inc. Personalized medicine management software
EP2047910B1 (en) 2006-05-11 2012-01-11 Raindance Technologies, Inc. Microfluidic device and method
MX2008014608A (en) 2006-05-18 2009-03-31 Molecular Profiling Inst Inc System and method for determining individualized medical intervention for a disease state.
US20100113299A1 (en) 2008-10-14 2010-05-06 Von Hoff Daniel D Gene and gene expressed protein targets depicting biomarker patterns and signature sets by tumor type
US8768629B2 (en) 2009-02-11 2014-07-01 Caris Mpi, Inc. Molecular profiling of tumors
US20080124721A1 (en) 2006-06-14 2008-05-29 Martin Fuchs Analysis of rare cell-enriched samples
WO2008015396A2 (en) 2006-07-31 2008-02-07 Solexa Limited Method of library preparation avoiding the formation of adaptor dimers
US20080065411A1 (en) 2006-09-08 2008-03-13 Diaceutics Method and system for developing a personalized medicine business plan
SG174826A1 (en) 2006-09-19 2011-10-28 Novartis Ag Biomarkers of target modulation, efficacy, diagnosis and/or prognosis for raf inhibitors
EP2069070B1 (en) 2006-09-28 2013-11-27 Stokes Bio Limited A qpcr analysis apparatus
US8568979B2 (en) 2006-10-10 2013-10-29 Illumina, Inc. Compositions and methods for representational selection of nucleic acids from complex mixtures using hybridization
SG175680A1 (en) 2006-10-27 2011-11-28 Decode Genetics Ehf Cancer susceptibility variants on chr8q24.21
EP2089055A4 (en) 2006-11-01 2011-02-02 George Mason Intellectual Prop Method for detecting and controlling cancer
US20100196889A1 (en) 2006-11-13 2010-08-05 Bankaitis-Davis Danute M Gene Expression Profiling for Identification, Monitoring and Treatment of Colorectal Cancer
US20080131887A1 (en) 2006-11-30 2008-06-05 Stephan Dietrich A Genetic Analysis Systems and Methods
US20100143935A1 (en) 2006-12-01 2010-06-10 Apocell, Inc. c-KIT Phosphorylation in Cancer
US8262900B2 (en) 2006-12-14 2012-09-11 Life Technologies Corporation Methods and apparatus for measuring analytes using large scale FET arrays
EP2677309B9 (en) 2006-12-14 2014-11-19 Life Technologies Corporation Methods for sequencing a nucleic acid using large scale FET arrays, configured to measure a limited pH range
US20080177608A1 (en) 2007-01-19 2008-07-24 Diaceutics Business method for enabling personalized medicine
AU2008221468A1 (en) 2007-02-26 2008-09-04 John Wayne Cancer Institute Utility of B-RAF DNA mutation in diagnosis and treatment of cancer
US20100166747A1 (en) 2007-03-02 2010-07-01 Beltran Pedro J Methods and compositions for treating tumor diseases
SI2412828T1 (en) 2007-03-13 2013-10-30 Amgen Inc. K-ras and B-raf mutations and anti-EGFr antibody therapy
HUE033695T2 (en) 2007-03-13 2017-12-28 Amgen Inc K-ras mutations and anti-egfr antibody therapy
US20090258795A1 (en) 2007-03-15 2009-10-15 Genomic Health, Inc. Gene expression markers for prediction of patient response to chemotherapy
CA2680593A1 (en) 2007-03-19 2008-09-25 Cold Spring Harbor Laboratory Identification of genetic alterations that modulate drug sensitivity in cancer treatments
EP2137535B1 (en) 2007-04-13 2015-06-03 Dana-Farber Cancer Institute, Inc. Receptor tyrosine kinase profiling
US20080255243A1 (en) 2007-04-13 2008-10-16 Petricoin Emanuel F Stat3 as a theranostic indicator
WO2008147879A1 (en) 2007-05-22 2008-12-04 Ryan Golhar Automated method and device for dna isolation, sequence determination, and identification
WO2008148072A2 (en) 2007-05-24 2008-12-04 The Brigham And Women's Hospital, Inc. Disease-associated genetic variations and methods for obtaining and using same
ES2559313T3 (en) 2007-06-19 2016-02-11 Stratos Genomics Inc. High performance nucleic acid sequencing by expansion
GB0712882D0 (en) * 2007-07-03 2007-08-15 Leicester University Of Nucleic acid amplification
BRPI0813583A2 (en) 2007-07-13 2014-12-30 Prometheus Lab Inc METHODS FOR SELECTING ANTICANUS MEDICINAL PRODUCT, IDENTIFYING A PULMONARY TUMOR RESPONSE, AND PROGNOSING A PATIENT'S RESPONSE, AND ARRANGEMENT
EP2527471B1 (en) 2007-07-23 2020-03-04 The Chinese University of Hong Kong Diagnosing cancer using genomic sequencing
US8043815B2 (en) 2007-08-06 2011-10-25 Health Research, Inc. Methods for analysis of PDEF and survivin as interconnected cancer biomarkers and targets for personalized medicine
US20090062138A1 (en) 2007-08-31 2009-03-05 Curry Bo U Array-based method for performing SNP analysis
US20110212991A1 (en) 2007-09-11 2011-09-01 Roche Molecular Systems, Inc. Diagnostic Test for Susceptibility to B-RAF Kinase Inhibitors
US20100173294A1 (en) 2007-09-11 2010-07-08 Roche Molecular Systems, Inc. Diagnostic test for susceptibility to b-raf kinase inhibitors
US9388457B2 (en) * 2007-09-14 2016-07-12 Affymetrix, Inc. Locus specific amplification using array probes
US20090209436A1 (en) * 2007-09-17 2009-08-20 Twof, Inc. Hydrogel labeled primer extension method for microarrays
CN101889074A (en) 2007-10-04 2010-11-17 哈尔西恩莫尔丘勒公司 Sequencing nucleic acid polymers with electron microscopy
EP2053132A1 (en) 2007-10-23 2009-04-29 Roche Diagnostics GmbH Enrichment and sequence analysis of geomic regions
US20090264298A1 (en) 2007-11-06 2009-10-22 Ambergen, Inc. Methods for enriching subpopulations
EP2063132A1 (en) * 2007-11-24 2009-05-27 Festo AG & Co. KG Linear drive device
JP2011509095A (en) 2008-01-09 2011-03-24 ライフ テクノロジーズ コーポレーション Method for producing a library of paired tags for nucleic acid sequencing
EP3699291A1 (en) 2008-01-17 2020-08-26 Sequenom, Inc. Single molecule nucleic acid sequence analysis processes and compositions
US11051733B2 (en) 2008-01-18 2021-07-06 Wake Forest University Health Sciences Isolating and purifying cells for therapy
US20110053157A1 (en) 2008-02-01 2011-03-03 The General Hospital Corporation Use of microvesicles in diagnosis, prognosis and treatment of medical diseases and conditions
EP2245198A1 (en) 2008-02-04 2010-11-03 Massachusetts Institute of Technology Selection of nucleic acids by solution hybridization to oligonucleotide baits
AU2009213689B2 (en) 2008-02-14 2014-11-20 Decode Genetics Ehf. Susceptibility variants for lung cancer
WO2009102957A2 (en) 2008-02-14 2009-08-20 The Johns Hopkins University Methods to connect gene set expression profiles to drug sensitivity
DK2250498T3 (en) 2008-02-25 2013-02-04 Nestec Sa PHARMACEUTICAL CHOICES FOR BREAST CANCER THERAPY USING ANTIBODY-BASED ARRAYS
US8999642B2 (en) 2008-03-10 2015-04-07 Illumina, Inc. Methods for selecting and amplifying polynucleotides
US20090226975A1 (en) * 2008-03-10 2009-09-10 Illumina, Inc. Constant cluster seeding
WO2009114836A1 (en) 2008-03-14 2009-09-17 Genomic Health, Inc. Gene expression markers for prediction of patient response to chemotherapy
WO2009117122A2 (en) 2008-03-19 2009-09-24 Existence Genetics Llc Genetic analysis
US7842248B2 (en) 2008-06-20 2010-11-30 Silverbrook Research Pty Ltd Microfluidic system comprising microfluidic pump, mixer or valve
US8198028B2 (en) 2008-07-02 2012-06-12 Illumina Cambridge Limited Using populations of beads for the fabrication of arrays on surfaces
US20100041048A1 (en) 2008-07-31 2010-02-18 The Johns Hopkins University Circulating Mutant DNA to Assess Tumor Dynamics
WO2010021936A1 (en) 2008-08-16 2010-02-25 The Board Of Trustees Of The Leland Stanford Junior University Digital pcr calibration for high throughput sequencing
JP2010041985A (en) * 2008-08-18 2010-02-25 Hitachi Plant Technologies Ltd Method for amplifying nucleic acid sequence, method for detecting nucleic acid sequence and substrate for nucleic acid amplification and detection
EP2318552B1 (en) 2008-09-05 2016-11-23 TOMA Biosciences, Inc. Methods for stratifying and annotating cancer drug treatment options
EP3964821A1 (en) 2008-09-23 2022-03-09 Bio-Rad Laboratories, Inc. Droplet-based assay system
US20100137143A1 (en) * 2008-10-22 2010-06-03 Ion Torrent Systems Incorporated Methods and apparatus for measuring analytes
WO2010059323A2 (en) 2008-11-18 2010-05-27 Life Technologies Corporation Sequence amplification with target primers
US10262103B2 (en) 2008-11-18 2019-04-16 Raphael LEHRER Individualized cancer treatment
EP2376659B1 (en) 2008-12-17 2015-12-02 Life Technologies Corporation Methods, compositions, and kits for detecting allelic variants
US8790873B2 (en) 2009-01-16 2014-07-29 Affymetrix, Inc. DNA ligation on RNA template
CA2751470C (en) 2009-02-16 2016-07-26 Epicentre Technologies Corporation Template-independent ligation of single-stranded dna
US20100286143A1 (en) 2009-04-24 2010-11-11 Dora Dias-Santagata Methods and materials for genetic analysis of tumors
EP2430441B1 (en) 2009-04-29 2018-06-13 Complete Genomics, Inc. Method and system for calling variations in a sample polynucleotide sequence with respect to a reference polynucleotide sequence
WO2012083189A2 (en) 2010-12-17 2012-06-21 Life Technologies Corporation Methods, compositions, systems, apparatuses and kits for nucleic acid amplification
WO2011032040A1 (en) 2009-09-10 2011-03-17 Centrillion Technology Holding Corporation Methods of targeted sequencing
WO2011050981A2 (en) * 2009-10-30 2011-05-05 Roche Diagnostics Gmbh Method for detecting balanced chromosomal aberrations in a genome
US20110117559A1 (en) 2009-11-13 2011-05-19 Integrated Dna Technologies, Inc. Small rna detection assays
EP2504448B1 (en) 2009-11-25 2016-10-19 Bio-Rad Laboratories, Inc. Methods and compositions for detecting genetic material
US20110157322A1 (en) 2009-12-31 2011-06-30 Broadcom Corporation Controlling a pixel array to support an adaptable light manipulator
EP2534267B1 (en) 2010-02-12 2018-04-11 Raindance Technologies, Inc. Digital analyte analysis
ES2565563T3 (en) 2010-02-25 2016-04-05 Advanced Liquid Logic, Inc. Method for preparing nucleic acid libraries
US20130143276A1 (en) 2010-04-01 2013-06-06 New England Biolabs, Inc. Compositions and Methods for Adenylating Oligonucleotides
US9255291B2 (en) 2010-05-06 2016-02-09 Bioo Scientific Corporation Oligonucleotide ligation methods for improving data quality and throughput using massively parallel sequencing
US20110299645A1 (en) 2010-06-07 2011-12-08 Korea Hydro & Nuclear Power Co., Ltd. Breeding Nuclear Fuel Mixture Using Metallic Thorium
US20120003657A1 (en) 2010-07-02 2012-01-05 Samuel Myllykangas Targeted sequencing library preparation by genomic dna circularization
CA2806670A1 (en) 2010-07-26 2012-02-09 Biomatrica, Inc. Compositions for stabilizing dna, rna and proteins in blood and other biological samples during shipping and storage at ambient temperatures
CA2810931C (en) 2010-09-24 2018-04-17 The Board Of Trustees Of The Leland Stanford Junior University Direct capture, amplification and sequencing of target dna using immobilized primers
US9353406B2 (en) 2010-10-22 2016-05-31 Fluidigm Corporation Universal probe assay methods
SG191818A1 (en) 2010-12-30 2013-08-30 Foundation Medicine Inc Optimization of multigene analysis of tumor samples
WO2012103154A1 (en) 2011-01-24 2012-08-02 Nugen Technologies, Inc. Stem-loop composite rna-dna adaptor-primers: compositions and methods for library generation, amplification and other downstream manipulations
ES2625288T3 (en) 2011-04-15 2017-07-19 The Johns Hopkins University Secure Sequencing System
BR112014005205A2 (en) 2011-09-07 2017-03-21 X-Chem Inc methods for tagging dna encoded libraries
EP2768985B1 (en) 2011-10-21 2019-03-20 Chronix Biomedical Colorectal cancer associated circulating nucleic acid biomarkers
EP3578697B1 (en) 2012-01-26 2024-03-06 Tecan Genomics, Inc. Compositions and methods for targeted nucleic acid sequence enrichment and high efficiency library generation
US20130252835A1 (en) 2012-01-27 2013-09-26 Lian Chye Winston Koh Methods for profiling and quantitating cell-free rna
WO2013119690A1 (en) 2012-02-06 2013-08-15 Wisconsin Alumni Research Foundation Nucleic acid ligation method
WO2013173472A1 (en) 2012-05-15 2013-11-21 Predictive Biosciences, Inc. Methods of assessing chromosomal instabilities
US11261494B2 (en) 2012-06-21 2022-03-01 The Chinese University Of Hong Kong Method of measuring a fractional concentration of tumor DNA
US9683230B2 (en) 2013-01-09 2017-06-20 Illumina Cambridge Limited Sample preparation on a solid support
WO2014110272A1 (en) 2013-01-09 2014-07-17 The Penn State Research Foundation Low sequence bias single-stranded dna ligation
US20140287937A1 (en) 2013-02-21 2014-09-25 Toma Biosciences, Inc. Methods for assessing cancer
WO2014130890A1 (en) 2013-02-21 2014-08-28 Toma Biosciences, Inc. Methods, compositions, and kits for nucleic acid analysis
US9217167B2 (en) 2013-07-26 2015-12-22 General Electric Company Ligase-assisted nucleic acid circularization and amplification
AU2014362227B2 (en) 2013-12-11 2021-05-13 Accuragen Holdings Limited Compositions and methods for detecting rare sequence variants
CN108885648A (en) 2016-02-09 2018-11-23 托马生物科学公司 System and method for analyzing nucleic acid

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20080286795A1 (en) * 1997-04-01 2008-11-20 Solexa Limited Method of nucleic acid amplification
US20050191656A1 (en) * 1999-01-06 2005-09-01 Callida Genomics, Inc. Enhanced sequencing by hybridization using pools of probes
US20030224439A1 (en) * 2002-05-31 2003-12-04 Mike Lafferty Multiplexed systems for nucleic acid sequencing
US20090124514A1 (en) * 2003-02-26 2009-05-14 Perlegen Sciences, Inc. Selection probe amplification
US20060008824A1 (en) * 2004-05-20 2006-01-12 Leland Stanford Junior University Methods and compositions for clonal amplification of nucleic acid
US20090117621A1 (en) * 2005-07-20 2009-05-07 Jonathan Mark Boutell Methods of nucleic acid amplification and sequencing

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
See also references of EP2619329A1 *

Cited By (82)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11840730B1 (en) 2009-04-30 2023-12-12 Molecular Loop Biosciences, Inc. Methods and compositions for evaluating genetic markers
US11768200B2 (en) 2010-12-23 2023-09-26 Molecular Loop Biosciences, Inc. Methods for maintaining the integrity and identification of a nucleic acid template in a multiplex sequencing reaction
US11041852B2 (en) 2010-12-23 2021-06-22 Molecular Loop Biosciences, Inc. Methods for maintaining the integrity and identification of a nucleic acid template in a multiplex sequencing reaction
US11041851B2 (en) 2010-12-23 2021-06-22 Molecular Loop Biosciences, Inc. Methods for maintaining the integrity and identification of a nucleic acid template in a multiplex sequencing reaction
US9228233B2 (en) 2011-10-17 2016-01-05 Good Start Genetics, Inc. Analysis methods
US9822409B2 (en) 2011-10-17 2017-11-21 Good Start Genetics, Inc. Analysis methods
US10370710B2 (en) 2011-10-17 2019-08-06 Good Start Genetics, Inc. Analysis methods
WO2013117595A3 (en) * 2012-02-07 2013-10-03 Illumina Cambridge Limited Targeted enrichment and amplification of nucleic acids on a support
US10604799B2 (en) 2012-04-04 2020-03-31 Molecular Loop Biosolutions, Llc Sequence assembly
US11149308B2 (en) 2012-04-04 2021-10-19 Invitae Corporation Sequence assembly
US11155863B2 (en) 2012-04-04 2021-10-26 Invitae Corporation Sequence assembly
US11667965B2 (en) 2012-04-04 2023-06-06 Invitae Corporation Sequence assembly
US9298804B2 (en) 2012-04-09 2016-03-29 Good Start Genetics, Inc. Variant database
US8812422B2 (en) 2012-04-09 2014-08-19 Good Start Genetics, Inc. Variant database
US10683533B2 (en) 2012-04-16 2020-06-16 Molecular Loop Biosolutions, Llc Capture reactions
US10227635B2 (en) 2012-04-16 2019-03-12 Molecular Loop Biosolutions, Llc Capture reactions
WO2013158540A1 (en) * 2012-04-16 2013-10-24 Good Start Genetics, Inc. Capture reactions
US9487828B2 (en) 2012-05-10 2016-11-08 The General Hospital Corporation Methods for determining a nucleotide sequence contiguous to a known target nucleotide sequence
US10017810B2 (en) 2012-05-10 2018-07-10 The General Hospital Corporation Methods for determining a nucleotide sequence contiguous to a known target nucleotide sequence
US11781179B2 (en) 2012-05-10 2023-10-10 The General Hospital Corporation Methods for determining a nucleotide sequence contiguous to a known target nucleotide sequence
US10718009B2 (en) 2012-05-10 2020-07-21 The General Hospital Corporation Methods for determining a nucleotide sequence contiguous to a known target nucleotide sequence
JP6234629B1 (en) * 2012-07-17 2017-11-22 カウンシル,インコーポレーテッド System and method for detecting genetic variation
JP2018038417A (en) * 2012-07-17 2018-03-15 カウンシル,インコーポレーテッド Systems and methods for detecting genetic variations
EP3243937A1 (en) * 2012-07-17 2017-11-15 Counsyl, Inc. System and methods for detecting genetic variation
JP2015531588A (en) * 2012-07-17 2015-11-05 カウンシル,インコーポレーテッド System and method for detecting genetic variation
JP2018019701A (en) * 2012-07-17 2018-02-08 カウンシル,インコーポレーテッド Systems and methods for detecting genetic variations
EP2875173A4 (en) * 2012-07-17 2015-12-30 Counsyl Inc System and methods for detecting genetic variation
CN103571822B (en) * 2012-07-20 2016-03-30 中国科学院植物研究所 A kind of multipurpose DNA fragmentation enriching method analyzed for new-generation sequencing
CN103571822A (en) * 2012-07-20 2014-02-12 中国科学院植物研究所 Multipurpose DNA segment enrichment method used for next generation sequencing
JP7119014B2 (en) 2012-09-04 2022-08-16 ガーダント ヘルス, インコーポレイテッド Systems and methods for detecting rare mutations and copy number variations
JP2020103298A (en) * 2012-09-04 2020-07-09 ガーダント ヘルス, インコーポレイテッド Systems and methods to detect rare mutations and copy number variation
US10907149B2 (en) 2012-12-10 2021-02-02 Resolution Bioscience, Inc. Methods for targeted genomic analysis
EP2929056A4 (en) * 2012-12-10 2016-11-09 Resolution Bioscience Inc Methods for targeted genomic analysis
CN105531375B (en) * 2012-12-10 2020-03-03 分析生物科学有限公司 Method for targeted genomic analysis
CN111254500A (en) * 2012-12-10 2020-06-09 分析生物科学有限公司 Method for targeted genomic analysis
CN111254500B (en) * 2012-12-10 2024-01-23 分析生物科学有限公司 Methods of targeted genomic analysis
US9932576B2 (en) 2012-12-10 2018-04-03 Resolution Bioscience, Inc. Methods for targeted genomic analysis
CN105531375A (en) * 2012-12-10 2016-04-27 分析生物科学有限公司 Methods for targeted genomic analysis
JP2019170390A (en) * 2013-02-21 2019-10-10 トマ バイオサイエンシーズ, インコーポレイテッド Method, composition and kit for nucleic acid analysis
JP2016513959A (en) * 2013-02-21 2016-05-19 トマ バイオサイエンシーズ, インコーポレイテッド Methods, compositions and kits for nucleic acid analysis
US10202637B2 (en) 2013-03-14 2019-02-12 Molecular Loop Biosolutions, Llc Methods for analyzing nucleic acid
US9677124B2 (en) 2013-03-14 2017-06-13 Good Start Genetics, Inc. Methods for analyzing nucleic acids
US9115387B2 (en) 2013-03-14 2015-08-25 Good Start Genetics, Inc. Methods for analyzing nucleic acids
US10706017B2 (en) 2013-06-03 2020-07-07 Good Start Genetics, Inc. Methods and systems for storing sequence read data
US9535920B2 (en) 2013-06-03 2017-01-03 Good Start Genetics, Inc. Methods and systems for storing sequence read data
US10851414B2 (en) 2013-10-18 2020-12-01 Good Start Genetics, Inc. Methods for determining carrier status
US11041203B2 (en) 2013-10-18 2021-06-22 Molecular Loop Biosolutions, Inc. Methods for assessing a genomic region of a subject
US9677132B2 (en) 2014-01-16 2017-06-13 Illumina, Inc. Polynucleotide modification on solid support
US10865444B2 (en) 2014-01-16 2020-12-15 Illumina, Inc. Amplicon preparation and sequencing on solid supports
US9944924B2 (en) 2014-01-16 2018-04-17 Illumina, Inc. Polynucleotide modification on solid support
WO2015106941A1 (en) * 2014-01-16 2015-07-23 Illumina Cambridge Limited Polynucleotide modification on solid support
US11807897B2 (en) 2014-01-27 2023-11-07 The General Hospital Corporation Methods of preparing nucleic acids for sequencing
US10450597B2 (en) 2014-01-27 2019-10-22 The General Hospital Corporation Methods of preparing nucleic acids for sequencing
US11053548B2 (en) 2014-05-12 2021-07-06 Good Start Genetics, Inc. Methods for detecting aneuploidy
US11408024B2 (en) 2014-09-10 2022-08-09 Molecular Loop Biosciences, Inc. Methods for selectively suppressing non-target sequences
US10429399B2 (en) 2014-09-24 2019-10-01 Good Start Genetics, Inc. Process control for increased robustness of genetic assays
US11680284B2 (en) 2015-01-06 2023-06-20 Moledular Loop Biosciences, Inc. Screening for structural variants
US10066259B2 (en) 2015-01-06 2018-09-04 Good Start Genetics, Inc. Screening for structural variants
US10329616B2 (en) * 2015-10-28 2019-06-25 Republic Of Korea (National Forensic Service Director, Ministry Of Public Administration & Security) Primer set for preparation of NGS library and method and kit for making NGS library using the same
US11339391B2 (en) 2015-11-11 2022-05-24 Resolution Bioscience, Inc. High efficiency construction of DNA libraries
US10961573B2 (en) 2016-03-28 2021-03-30 Boreal Genomics, Inc. Linked duplex target capture
EP3436607A4 (en) * 2016-03-28 2019-10-30 Boreal Genomics, Inc. Linked duplex target capture
US11021742B2 (en) 2016-03-28 2021-06-01 Boreal Genomics, Inc. Linked-fragment sequencing
US10961568B2 (en) 2016-03-28 2021-03-30 Boreal Genomics, Inc. Linked target capture
US11905556B2 (en) 2016-03-28 2024-02-20 Ncan Genomics, Inc. Linked target capture
EP4282974A3 (en) * 2016-03-28 2024-03-13 Ncan Genomics, Inc. Linked duplex target capture
US11708574B2 (en) 2016-06-10 2023-07-25 Myriad Women's Health, Inc. Nucleic acid sequencing adapters and uses thereof
US11725206B2 (en) 2016-08-05 2023-08-15 Bio-Rad Laboratories, Inc. Second strand direct
US10876112B2 (en) 2016-08-05 2020-12-29 Bio-Rad Laboratories, Inc. Second strand direct
US10676736B2 (en) 2016-08-05 2020-06-09 Bio-Rad Laboratories, Inc. Second strand direct
WO2018027048A1 (en) * 2016-08-05 2018-02-08 Bio-Rad Laboratories, Inc. Second strand direct
US11319594B2 (en) 2016-08-25 2022-05-03 Resolution Bioscience, Inc. Methods for the detection of genomic copy changes in DNA samples
US11795492B2 (en) 2016-09-15 2023-10-24 ArcherDX, LLC. Methods of nucleic acid sample preparation
US11390905B2 (en) 2016-09-15 2022-07-19 Archerdx, Llc Methods of nucleic acid sample preparation for analysis of DNA
US11854666B2 (en) 2016-09-29 2023-12-26 Myriad Women's Health, Inc. Noninvasive prenatal screening using dynamic iterative depth optimization
US11268137B2 (en) 2016-12-09 2022-03-08 Boreal Genomics, Inc. Linked ligation
US11879151B2 (en) 2016-12-09 2024-01-23 Ncan Genomics, Inc. Linked ligation
US10968447B2 (en) 2017-01-31 2021-04-06 Myriad Women's Health, Inc. Methods and compositions for enrichment of target polynucleotides
US10752946B2 (en) 2017-01-31 2020-08-25 Myriad Women's Health, Inc. Methods and compositions for enrichment of target polynucleotides
US11339431B2 (en) 2017-01-31 2022-05-24 Myriad Women's Health, Inc. Methods and compositions for enrichment of target polynucleotides
US11232850B2 (en) 2017-03-24 2022-01-25 Myriad Genetics, Inc. Copy number variant caller
US11473136B2 (en) 2019-01-03 2022-10-18 Ncan Genomics, Inc. Linked target capture

Also Published As

Publication number Publication date
AU2011305445B2 (en) 2017-03-16
MX346956B (en) 2017-04-06
US20190024141A1 (en) 2019-01-24
US20120157322A1 (en) 2012-06-21
EP2619329B1 (en) 2019-05-22
CN103228798B (en) 2015-12-09
US20150017635A1 (en) 2015-01-15
US10072283B2 (en) 2018-09-11
EP3572528A1 (en) 2019-11-27
EP2619329A4 (en) 2014-03-05
JP2013544498A (en) 2013-12-19
JP5986572B2 (en) 2016-09-06
AU2011305445A1 (en) 2013-04-04
US9309556B2 (en) 2016-04-12
CA2810931C (en) 2018-04-17
EP2619329A1 (en) 2013-07-31
KR20130113447A (en) 2013-10-15
NZ608313A (en) 2013-12-20
RU2013118722A (en) 2014-10-27
IN2013MN00522A (en) 2015-05-29
IL225109A (en) 2017-05-29
CN103228798A (en) 2013-07-31
RU2565550C2 (en) 2015-10-20
MX2013003349A (en) 2013-09-13
CA2810931A1 (en) 2012-03-29

Similar Documents

Publication Publication Date Title
US20190024141A1 (en) Direct Capture, Amplification and Sequencing of Target DNA Using Immobilized Primers
US20220042090A1 (en) PROGRAMMABLE RNA-TEMPLATED SEQUENCING BY LIGATION (rSBL)
KR102592367B1 (en) Systems and methods for clonal replication and amplification of nucleic acid molecules for genomic and therapeutic applications
WO2018195217A1 (en) Compositions and methods for library construction and sequence analysis
US10465241B2 (en) High resolution STR analysis using next generation sequencing
CA3011342A1 (en) Deep sequencing profiling of tumors
US20220267848A1 (en) Detection and quantification of rare variants with low-depth sequencing via selective allele enrichment or depletion
CN110869515A (en) Sequencing method for genome rearrangement detection
US11898202B2 (en) Methods for accurate parallel quantification of nucleic acids in dilute or non-purified samples
EP4060053A1 (en) Highly sensitive methods for accurate parallel quantification of nucleic acids
EP4332235A1 (en) Highly sensitive methods for accurate parallel quantification of variant nucleic acids
EP4332238A1 (en) Methods for accurate parallel detection and quantification of nucleic acids

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 11827484

Country of ref document: EP

Kind code of ref document: A1

ENP Entry into the national phase

Ref document number: 2810931

Country of ref document: CA

WWE Wipo information: entry into national phase

Ref document number: 225109

Country of ref document: IL

ENP Entry into the national phase

Ref document number: 2013530291

Country of ref document: JP

Kind code of ref document: A

WWE Wipo information: entry into national phase

Ref document number: 2011827484

Country of ref document: EP

WWE Wipo information: entry into national phase

Ref document number: MX/A/2013/003349

Country of ref document: MX

NENP Non-entry into the national phase

Ref country code: DE

ENP Entry into the national phase

Ref document number: 20137008317

Country of ref document: KR

Kind code of ref document: A

ENP Entry into the national phase

Ref document number: 2011305445

Country of ref document: AU

Date of ref document: 20110921

Kind code of ref document: A

ENP Entry into the national phase

Ref document number: 2013118722

Country of ref document: RU

Kind code of ref document: A