WO2008045136A2 - Nucleic acid size detection method - Google Patents

Nucleic acid size detection method Download PDF

Info

Publication number
WO2008045136A2
WO2008045136A2 PCT/US2007/008985 US2007008985W WO2008045136A2 WO 2008045136 A2 WO2008045136 A2 WO 2008045136A2 US 2007008985 W US2007008985 W US 2007008985W WO 2008045136 A2 WO2008045136 A2 WO 2008045136A2
Authority
WO
WIPO (PCT)
Prior art keywords
tandem repeat
segment
nucleic acid
fraction
repeats
Prior art date
Application number
PCT/US2007/008985
Other languages
French (fr)
Other versions
WO2008045136A3 (en
Inventor
Donghui Huang
Charles M. Strom
Steven J. Potts
Jenny Ellen Rooke
Original Assignee
Quest Diagnostics Investments Incorporated
U.S. Genomics
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Quest Diagnostics Investments Incorporated, U.S. Genomics filed Critical Quest Diagnostics Investments Incorporated
Priority to AT07755301T priority Critical patent/ATE515573T1/en
Priority to US12/444,361 priority patent/US8697399B2/en
Priority to BRPI0720549-0A priority patent/BRPI0720549A2/en
Priority to JP2009531370A priority patent/JP5386357B2/en
Priority to AU2007307320A priority patent/AU2007307320B2/en
Priority to EP07755301A priority patent/EP2082055B1/en
Priority to CA002665723A priority patent/CA2665723A1/en
Priority to MX2009003736A priority patent/MX2009003736A/en
Publication of WO2008045136A2 publication Critical patent/WO2008045136A2/en
Publication of WO2008045136A3 publication Critical patent/WO2008045136A3/en

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6813Hybridisation assays
    • C12Q1/6827Hybridisation assays for detection of mutation or polymorphism
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • C12Q1/6844Nucleic acid amplification reactions
    • C12Q1/6858Allele-specific amplification
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10TECHNICAL SUBJECTS COVERED BY FORMER USPC
    • Y10TTECHNICAL SUBJECTS COVERED BY FORMER US CLASSIFICATION
    • Y10T436/00Chemistry: analytical and immunological testing
    • Y10T436/17Nitrogen containing

Definitions

  • the present invention relates generally to the field of medical diagnostics.
  • the present invention relates to methods of detecting genetic mutations characterized by an expansion of tandem repeats.
  • a tandem repeat in DNA represents two or more contiguous approximate copies of a pattern of nucleotides. Tandem repeats have been shown to be associated cause a variety of human diseases. Dramatic expansion of trinucleotide repeats has been associated with such diseases as fragile-X mental retardation (see Verkerk, et al., (1991) Cell, 65, 905-914), Huntington's disease (see Huntington's Disease Collaborative Research Group.
  • Fragile X syndrome is one of the most common causes of inherited mental retardation, occurring in approximately one in 1,250 males and approximately one in 2,500 females.
  • Males with fragile X syndrome typically exhibit some degree of mental impairment, ranging from learning disabilities to mental retardation to autism.
  • Characteristic physical features e.g., enlarged ears, elongated face with prominent chin
  • connective tissue problems e.g., mitral valve prolapse, and double-jointed fingers
  • characteristic behaviors e.g., attention deficit disorders, speech disturbances, and unusual responses to various touch, auditory, or visual stimuli
  • Affected females present with similar but milder mental impairment, physical characteristics, and behavioral characteristics as those of affected males.
  • the mutation responsible for fragile X syndrome involves expansion of a trinucleotide (CGG) tandem repeat sequence located in the 5' untranslated region of the FMRl gene on the X chromosome.
  • CGG trinucleotide
  • the number of CGG repeats in the FMRl gene determines whether an individual is normal or has one of the two categories of mutation: premutation and full mutation.
  • the number of repeats ranges from less than 55 repeats in normal, non-carrier individuals, whereas a premutation consists of 55 to 200 repeats and full mutation consists of more than 200 repeats (Chen et al. Hum. MoI. Genetics 12(23):3067-74, 2003).
  • Both males having a premutation and females having a premutation in one FMRl gene are carriers but are unaffected.
  • Male carriers are referred to as "normal transmitting" males, and pass on the mutation, relatively unchanged in size to each daughter. Although such daughters are unaffected, they are at risk of having affected offspring because a premutation is susceptible to expansion after passage through a female meiosis. Furthermore, the larger the premutation, the higher the risk of expansion to a full mutation in any offspring.
  • determining the size of a particular nucleic acid segment of interest in a sample of nucleic acids is accomplished by separating fragments of a nucleic acid, wherein the fragments are prepared from a nucleic acid-containing sample, wherein the fragments include some which contain the segment and a marker sequence, wherein separating is into fractions according to size under conditions in which a fragment containing the segment will be located in the fractions according to the size of the segment, and identifying those fraction(s) containing the segment by detecting the marker sequence, wherein the size of the segment is determined by the fraction in which it is identified.
  • This method is applicable to essentially any nucleic acid segment of interest, however the method is particularly amenable to determining the size of nucleic acid segments which are, for example, difficult to size by methods utilizing amplification (e.g. PCR) across the nucleic acid segment.
  • nucleic acid segments which are difficult to size by methods utilizing amplification across the nucleic acid segment include nucleic acid segments which have high content of the bases guanine and cytosine, large nucleic acid segments, and/or segments having large numbers of tandem repeats.
  • the particular nucleic acid segment of interest is a tandem repeat nucleic acid sequence.
  • a length of nucleic acid which is difficult to amplify by PCR is generally greater than 50,000 bases, more typically greater than 100,000 bases, more typically greater than 150,000 bases, or more than 200,000 bases, or even more than 250,000 bases.
  • invention methods are used to determine the size of a particular nucleic acid segment in a sample from an individual, thereby determining if that individual has an abnormality in size of that particular nucleic acid segment, wherein the abnormality is due to a duplication, addition, or deletion in the particular nucleic acid segment.
  • the invention provides a method of detecting a mutation in a tandem repeat segment of a gene in a nucleic acid sample, wherein the mutation is characterized by an increase in the number of repeats compared to the number of repeats in the wild type allele.
  • the method is accomplished separating fragments of a nucleic acid, wherein the fragments are prepared from a nucleic acid-containing sample, wherein the fragments include some which contain the tandem repeat segment and a marker sequence, wherein the separating is into fractions according to size under conditions in which a fragment containing the tandem repeat segment will be located in the fractions according to the number of repeats in the tandem repeat segment; and identifying those fraction(s) containing the segment by detecting the marker sequence.
  • the number of repeats in the tandem repeat segment is determined by the fraction in which it is identified.
  • the number of repeats is compared to the number in the corresponding wild type allele, wherein a number of repeats greater that the number in wild type allele is indicative of a mutation.
  • the above aspect of the invention further includes determining if a mutation is a premutation or a full mutation. This determination is accomplished by comparing the number of repeats in the tandem repeat segment from the nucleic acid sample to the number in the corresponding full mutation allele, wherein a number of repeats greater than the wild type allele but less than the full mutation is indicative of a premutation allele, and a number of repeats greater than or equal to the full mutation is indicative of a full mutation allele. In other embodiments the number of repeats in the tandem repeat region of the nucleic acid sample can be compared to the number of repeats found in each of a wild type allele, a premutation allele, and a full mutation allele.
  • the method includes, fragmenting the nucleic acid in the sample from the individual into fragments, wherein the tandem repeat segment of the FMRl gene is associated with a marker sequence in the fragment, separating fragments of a nucleic acid, wherein the fragments are prepared from a nucleic acid containing sample of the individual, wherein the fragments include some which contain a tandem repeat segment of the FMRl gene and a marker sequence, the separating into fractions according to size under conditions in which a fragment containing the tandem repeat segment having a normal number of repeats will be located in a first fraction; and a fragment containing a tandem repeat segment having a premutation will be located in a second fraction; and a fragment having a tandem repeat region having a full mutation will be located in a third fraction, identifying
  • detecting carriers of genetic mutations characterized by the expansion or reduction of a tandem repeat segment of a gene and diagnosing individuals afflicted with diseases caused by such an expansion.
  • the method involves the detection of wild type alleles, premutation alleles and/or full mutation alleles for a particular gene as described above. A genotype may then be determined based on the allele(s) present in an individual, allowing the designation of normal, carrier, or affected status.
  • a “carrier” is an individual who carries an mutated or altered allele of a gene but is not affected by the disorder or disease associated with mutation. Carriers can pass the mutation to a child or offspring in future generations, who may be affected with the disease or disorder.
  • both males and females may be carriers.
  • the term “carrier” is used interchangeably with “premutation carrier” and refers to males having a premutation allele.
  • the term carrier encompasses females having a premutation allele or a full mutation allele.
  • Such female carriers may also be referred to herein as "premutation carrier” (i.e., having a premuation FMRl allele) or a "full mutation carrier” (i.e., having a full mutation FMRl allele).
  • affected refers to individuals who possess one or more mutated alleles of a particular gene and exhibit the disease or disorder (i.e., phenotype) associated that mutation.
  • disease or disorder i.e., phenotype
  • Fragile X syndrome males having a full mutation FMRl allele are affected, whereas females having a single full mutation allele may be affected or may be a full mutation carrier.
  • male individuals afflicted with Fragile X syndrome can be distinguished from individuals that are carriers (premutation) or from those that are normal.
  • a nucleic acid sample from the individual is fragmented to produce nucleic acid fragments in which the tandem repeat segment of the FMRl gene is associated with a marker sequence in a fragment.
  • the fragments are separated into fractions according to size under conditions in which the fragment(s) containing the tandem repeat segment will be located in the fractions according to the number of repeats in the tandem repeat segment, and identifying those fraction(s) containing the segment by detecting the marker sequence.
  • the fractions are chosen so that the first fraction captures fragments having a number of repeats within the range of repeats for a normal allele; the second fraction captures fragments having a number of repeats within the range of repeats for a premutation allele; and the third fraction captures fragments having a number of repeats within the range of repeats for a full mutation allele.
  • the range of repeats for a normal normal allele of the FMRl gene is less than 55 repeats; the range of repeats for a premutation allele is 55-200; and the range of repeats for a full mutation allele is greater than 200 repeats. These ranges may be +/- 10%.
  • males Since males generally have a single X-chromosome (which is where the FMRl gene resides), only one fraction should be positive for the marker sequence. Therefore, males can be assigned a phenotype, based on the genotype according to the following: if the first fraction is positive for the marker sequence, the individual is normal; if the second fraction is positive for the marker sequence, the individual is a carrier; if only the third fraction is positive for the marker sequence, the individual is affected with fragile X.
  • the method includes assaying nucleic acids from an individual to determine gender; and assaying the nucleic acid to determine the length of the tandem repeat region of the FMRl gene wherein the determining comprises amplifying tandem repeat region, detecting an amplification product, and determining the number of tandem repeats in the amplification product, wherein, in male individuals: the presence of an amplification product having less than 55 tandem repeats indicates the individual is not a carrier, the presence of an amplification product having 55 or more tandem repeats indicates the individual is a carrier, or in the absence of an amplification product, the carrier status is undetermined; and in female individuals: the presence of an amplification product having more than 55 tandem repeats indicates the individual is a carrier; or the presence of a single amplification product having less than 55 tandem repeats, the carrier status is undetermined.
  • the above method further includes, analyzing undetermined individuals to determine carrier status, wherein the analyzing includes, separating fragments of a nucleic acid, wherein the fragments are prepared from a nucleic acid containing sample of the individual, wherein the fragments include some which contain a tandem repeat segment of the FMRl gene and a marker sequence, the separating into fractions according to size under conditions in which a fragment containing the tandem repeat segment having a normal number of repeats will be located in a first fraction; and a fragment containing a tandem repeat segment having a premutation will be located in a second fraction; and a fragment having a tandem repeat region having a full mutation will be located in a third fraction, identifying those fraction(s) containing the segment by detecting the marker sequence, wherein the number of repeats in the tandem repeat segment is determined by the fraction in which it is identified, wherein in male individuals: a positive result in the first fraction indicates the individual is not a carrier, a positive result in the second fraction indicates the individual
  • the assaying of nucleic acids to determine gender includes amplification of a region of the nucleic acid, preferably by PCR.
  • sequences specific to the Y chromosome, such as the SRY locus may be targeted for amplification.
  • amplification only occurs in the presence of a Y chromosome.
  • certain genes which occur on both the X chromosome and the Y chromosome may be detected for gender determination, if the lengths of the corresponding genes are different on each chromosome.
  • amplification results in different sized amplicons having lengths specific to either the X or the Y chromosome.
  • amplification of nucleic acids from males would result in both amplicons, whereas samples from females would have only one amplicon.
  • examples of such genes include DXZl and DYZl and the amelogenin gene.
  • the assaying to determine gender includes amplifying a region of the amelogenin gene which produces different sizes of amplification products from the amelogenin gene on the X chromosome and the amelogenin gene on the Y chromosome, determining the size of the amplification product or products, wherein the presence of one product of a single size indicates the gender is female and the presence of two products of different sizes indicates the gender is male.
  • the assay to determine gender is performed in multiplex with the amplification of the tandem repeat region; preferably in multiplex PCR; preferably one or more internal controls are include in the multiplex reaction.
  • a region of the androgen insensitivity gene is amplified as an internal control.
  • a sample containing genomic DNA is assayed for an expansion in the tandem repeat region of the FMRl gene.
  • genomic samples are subjected to nucleic acid fragmentation and the resulting nucleic acid fragments are separated by size into fractions.
  • a marker sequence upstream or downstream of the tandem repeat region of the FMRl gene, which is associated with the tandem repeat region in the fragmented nucleic acid, is amplified by polymerase chain reaction.
  • the amplification of the marker sequence and detection of the amplicon is done using the TaqMan system.
  • the marker sequence is amplified and using a labeled primer and the resulting labeled amplicon is detected using capillary electrophoresis.
  • the separation of fragments into fractions by size can be modified so that the fractions correspond to either a normal number of tandem repeats or an abnormal number of tandem repeats.
  • the fragmented nucleic acid may be separated by size into two fractions, an upper fraction of larger size fragments and a lower fraction of smaller size fragments.
  • the fractions are designed such that fragments from nucleic acid containing a normal number of tandem repeats will be found in the lower fraction while fragments from nucleic acid containing an abnormally increased number of tandem repeats will be found in the upper fraction.
  • the fragmented DNA may be separated into any number of fractions.
  • the fragmented DNA may be separated into a number of fractions selected from the group consisting of 2-16, preferably 3 fractions, or 4 fractions, or 5 fractions, or 6 fractions, or 8 fractions, or even 16 fractions.
  • the fragmented DNA is separated into lower and upper fractions, wherein the lower fraction corresponds to a tandem repeat region containing less than 55 repeats (normal number of repeats) and an upper fraction containing 55 or more tandem repeats (premutation and full mutation).
  • a normal allele can therefore be distinguished from a premutation or a full mutation.
  • the fragmented DNA is separated into three fractions, wherein a first fraction corresponds to a tandem repeat region of a normal allele (i.e., less than 55 repeats), a second fraction corresponds to a tandem repeat region of a premutation allele (i.e. 55-200 repeats), a third fraction corresponds to a tandem repeat region of a full mutation allele (i.e., greater than 200 repeats).
  • the fragmented DNA is separated into four fractions, wherein a first fraction corresponds to a tandem repeat region of a normal allele, a second fraction corresponds to a tandem repeat region of a small premutation allele, a third fraction corresponds to a tandem repeat region of a large premutation allele, and a fourth fraction corresponds to a tandem repeat region of a full mutation allele.
  • the first fraction corresponds to 0-60 repeats; preferably the second fraction corresponds to 60-200 repeats; preferably the third fraction corresponds to 200-2000 repeats; and preferably the fourth fraction corresponds to 2000+ repeats.
  • the DNA is fragmented with BIpI and MIyI and fractionated such that the first fraction corresponds to 6-62 repeats; preferably the second fraction corresponds to 63-140 repeats; preferably the third fraction corresponds to 141-220 repeats; and preferably the fourth fraction corresponds to 221-2000+ repeats.
  • the DNA is fragmented with AIuI and fractionated such that the first fraction corresponds to 6-68 repeats; preferably the second fraction corresponds to 69-102 repeats; preferably the third fraction corresponds to 102-202 repeats; and preferably the fourth fraction corresponds to 203+ repeats.
  • the DNA is fragmented with Sphl and Bmtl and fractionated such that the first fraction corresponds to 6-62 repeats; preferably the second fraction corresponds to 63-163 repeats; preferably the third fraction corresponds to 164-196 repeats; and preferably the fourth fraction corresponds to 197+ repeats.
  • test samples containing genomic DNA are subjected to nucleic acid fragmentation.
  • the resulting nucleic acid fragments are separated by size into three or more size range fractions, and fragments containing tandem repeat segments in the various fractions are identified by detecting a marker sequence flanking the tandem repeat segment, which is associated with the tandem repeat region in the fragmented nucleic acid.
  • the size of the tandem repeat region detected is then determined by relating the fraction size containing the repeat to the size of a tandem repeat segment present in such nucleic acid fragments. Separation into three or more size range fractions allows a finer estimation of the number of tandem repeats. In the case of fragile X, the extent of the expansion of a premutation or full mutation can be assessed.
  • the method includes a second nucleic acid fragmentation.
  • the second fragmentation occurs after the size separation, which follows the first nucleic acid fragmentation; preferably the second fragmentation is by restriction enzyme digestion.
  • the second fragmentation cleaves the particular nucleic acid segment of interest (e.g., a tandem repeat segment) from a marker sequence flanking the particular nucleic acid segment.
  • the second nucleic acid fragmentation does not cleave within the marker sequence.
  • the marker sequence is detected by amplification of all or a portion of the marker sequence and detection of the amplicon.
  • the marker sequence is amplified by PCR and the amplicon is detected by electrophoresis.
  • a primer used in the PCR amplification reaction comprises a label, thereby labeling the resulting amplicon.
  • the so-labeled amplicon can then be detected by methods such as capillary electrophoresis.
  • the marker sequence is detected using real time PCR methods such as the TaqMan system. In this approach a probe is used to detect the amplified region of the marker sequence.
  • the marker sequence need not be amplified and can be detected directly by hybridization to two differentially labeled oligonucleotide probes.
  • the two probes which hybridize to distinct segments of a marker sequence, such that both probes can bind simultaneously, are contacted with the fragmented nucleic acids under hybridization conditions.
  • the simultaneous detection of differentially labeled probes hybridized to a single nucleic acid fragment in the fractions indicates the presence of a tandem repeat region in a fragment contained in that fraction.
  • the two oligonucleotide probes of this embodiment may be designed to hybridize to segments of a marker sequence upstream or downstream of the tandem repeat.
  • segments of the marker sequence may be adjoining the tandem repeat region or may be a distance upstream or downstream.
  • the segments of the marker sequence are within 500 bases upstream or downstream of the tandem repeat region; in more preferred embodiments the segments of the marker sequence are within 250 bases upstream or downstream of the tandem repeat region; in most preferred embodiments the segments of the marker sequence are within 100 bases upstream or downstream of the tandem repeat region.
  • the probes may be designed to hybridize to the same or to opposite strands of a double-stranded marker sequence.
  • the probes may both hybridize upstream or both downstream of the tandem repeat. Alternatively, one probe may hybridize upstream of the tandem repeats whereas the other probe hybridizes downstream.
  • the probes may hybridize to segments of the marker sequence that are separated by zero bases to several hundred thousand bases provided both segments are located on the same contiguous nucleic acid molecule after the fragmentation step or steps.
  • the probes are separated by less than 1 kb, or preferably less than 500 bases, or less than 300 bases, or less than 200 bases, or less than 100 bases, or less than 50 bases, or less than 20 bases, or less than 10 bases, or less than 5 bases, or 1 base, or 0 bases.
  • the method comprises measuring the size of a tandem repeat segment by a first method, measuring the size of the tandem repeat segment by a second method, and using the information obtained by the first and second methods to determine the size of the tandem repeat region.
  • the first method includes an amplification of the tandem repeat region, preferably the amplification is by PCR.
  • the PCR amplification includes a labeled primer.
  • the amplicon is subjected to electrophoresis, preferably capillary electrophoresis, and the size of the amplicon is determined by comparison to a standard run in parallel.
  • the first method includes Southern blotting.
  • the second method comprises, fragmenting the nucleic acids of the sample, separating the fragmented nucleic acids into fractions according to size, and detecting a marker sequence upstream or downstream of the particular nucleic acid segment of interest, wherein the marker sequence is associated with the particular nucleic acid segment of interest in the fragmented nucleic acid.
  • the size of the particular nucleic acid segment of interest is then determined by relating the fraction size containing the particular nucleic acid segment to the size of the particular nucleic acid segment in the sample of nucleic acids.
  • the first method is used as an initial screen and samples for which the size of the particular nucleic acid segment is unable to be determined by this method are further analyzed by the second method.
  • sizing by one of the above two methods is used to confirm the results of the sizing by the other of the above two methods.
  • sizing by the first method is used for finely determining the size of the particular nucleic acid fragment.
  • primers for amplification of marker sequences flanking the tandem repeat region of the FMRl gene are provided.
  • the primers are selected from the group consisting of SEQ ID NOs:4-9
  • kits for detecting the size of a particular nucleic acid segment in a sample comprising a primer pair for amplifying a marker nucleotide sequence upstream or downstream of the particular nucleic acid segment, and one or more restriction endonucleases for cleaving the nucleic acid sample to generate a fragment of the nucleic acid sample which contains the particular nucleic acid segment and the upstream or downstream marker sequence.
  • the kit further comprises one or more restriction endonucleases for cleaving the particular nucleic acid segment from the marker nucleotide sequence.
  • the kit may further contain one or more controls for verifying proper size separation of fragments; preferably the control consists of one or more primer pairs that are used to amplify one or more control fragments from the size-separated nucleic acid sample.
  • the kit may further contain one or more controls for verifying the completion of the one or more enzyme digests; preferably the control consists of on or more primer pairs designed to amplify a control fragment that includes a recognition site for the enzyme used.
  • the kit may further contain any necessary buffers or other reagents.
  • the particular nucleic acid segment contains a tandem repeat segment.
  • the kit is for the detection of the tandem repeat segment of the FMRl gene; preferably the enzyme for generating fragments of the nucleic acid sample is AIuI; preferably the enzyme for cleaving the marker sequence from the tandem repeat is BstNI; preferably the kit contains one or more control primers pairs to determine if enzyme digestions are completed; preferably the kit contains one or more primer pairs to detect the presence of one or more control fragments in the size-separated nucleic acid sample.
  • the kit is for the detection of the tandem repeat segment of the FMRl gene; preferably the enzymes for generating fragments of the nucleic acid sample are BIpI and MIyI; preferably the enzyme for cleaving the marker sequence from the tandem repeat is Bmtl; preferably the kit contains one or more control primers pairs to determine if enzyme digestions are completed; preferably the kit contains one or more primer pairs to detect the presence of one or more control fragments in the size-separated nucleic acid sample.
  • the kit is for the detection of the tandem repeat segment of the FMRl gene; preferably the enzymes for generating fragments of the nucleic acid sample are Sphl and Bmtl; preferably the enzyme for cleaving the marker sequence from the tandem repeat is BstNI; preferably the kit contains one or more control primers pairs to determine if enzyme digestions are completed; preferably the kit contains one or more primer pairs to detect the presence of one or more control fragments in the size-separated nucleic acid sample.
  • segment refers to a piece of contiguous nucleic acid.
  • Particular nucleic acid segment of interest refers to a specific "segment” or piece of nucleic acid having a known sequence, preferred segments are those segments that are difficult to amplify by PCR. Examples include nucleic acid segments having high content of the bases guanine and cytidine, large segments of nucleic acid or segments having large numbers of tandem repeats, generally more than 100 tri-nucleotide repeats.
  • the segment comprises a deletion, a duplication or an insertion.
  • the particular nucleic acid segment of interest comprises a tandem repeat region.
  • Nucleic acid segments which have high content of the bases guanine and cytosi ⁇ e or “GC-rich” refer to those nucleic acid segments of a genome are more than the average for that genome. Generally, GC-rich is more than 40% guanine and cytosine bases, or more than 50%, or more than 60%, or more than 75%.
  • Size as used in reference to a particular nucleic acid segment of interest refers to quantity or amount that describes the magnitude of that segment and can be represented by, for example, molecular weight, number of base pairs, or number of copies of a tandem repeat.
  • “Fragment” as used herein refers to a portion of nucleic acid resulting from a process in which longer lengths of nucleic acid are broken up into shorter lengths of nucleic acid.
  • Nucleic acids may be broken up or fragmented by chemical or biochemical means, preferably nucleic acids are fragmented in a manner that is reproducible, preferably nucleic acids are fragmented by one or more restriction endonucleases.
  • nucleic acids are fragmented so that the particular nucleic acid segment of interest and its associated marker sequence are located on the same fragment. The length of a fragment containing the nucleic acid segment of interest will depend on the length of the nucleic acid segment of interest as well as the restriction enzyme chosen to fragment the DNA.
  • the length of the of the fragment includes the nucleic acid segment of interest plus the region upstream of the segment to the 5' restriction enzyme recognition site (i.e., the 5' end of the fragment) and the region downstream of the segment region to the 3' restriction enzyme recognition site (i.e., the 3' end of the fragment).
  • Fractionation refers to a process whereby a single mixture of individual components is processed so that at least some of the individual components in the mixture become separated from each other.
  • chromatography is a fractionation method that separates a mixture of components based on some physical/chemical principle. The components may be separated in a gel or on a membrane so that the individual components may be separately identified. The individual components of a mixture may be fractionated by separating the mixture into the different components which are captured in separate aliquots of liquid (i.e. fractions).
  • fraction in the context of the invention refers to a collection of fragments having a certain size or range of sizes that differs from the size or range of sizes of the starting non-fractionated mixture of fragments.
  • Identifying those fractions containing the segment means that the fraction of size-separated fragments that contains the segment of interest, is determined by the detection of a marker sequence associated with that segment on a fragment of nucleic acid.
  • tandem repeat region refers to a region of DNA that contains a multiple copies of a short sequence of DNA.
  • tandem repeat sequences or “tandem repeats” or simply “repeats” are used interchangeably herein and refers to the short sequence of DNA that is repeated in the tandem repeat region.
  • tandem repeats can lie adjacent to each other in the same orientation (i.e., direct tandem repeats) or in the opposite direction to each other (i.e., inverted tandem repeats).
  • the repeated sequences may be di-, tri-, tetra-, or more nucleotides in length. Expansion the number of copies of the tandem repeat sequences within the coding or noncoding regions of some human genes is associated with repeat expansion disease.
  • sample refers to any liquid or solid material containing genomic DNA.
  • a test sample is obtained from a biological source (i.e., a "biological sample”), such as cells in culture or a tissue sample from an animal, most preferably, a human.
  • biological sample such as cells in culture or a tissue sample from an animal, most preferably, a human.
  • sample tissues include, but are not limited to, blood, bone marrow, body fluids, cerebrospinal fluid, plasma, serum, or tissue (e.g. biopsy material).
  • nucleic acid refers broadly to segments of a chromosome, segments or portions of DNA, cDNA, and/or RNA. Nucleic acid may be derived or obtained from an originally isolated nucleic acid sample from any source (e.g., isolated from, purified from, amplified from, cloned from, reverse transcribed from sample DNA or RNA).
  • Target nucleic acid refers to segments of a chromosome, a complete gene with or without intergenic sequence, segments or portions a gene with our without intergenic sequence, or sequence of nucleic acids to which probes or primers are designed.
  • Target nucleic acids may include wild type sequences, nucleic acid sequences containing mutations, deletions or duplications, tandem repeat regions, a gene of interest, a region of a gene of interest or any upstream or downstream region thereof.
  • Target nucleic acids may represent alternative sequences or alleles of a particular gene.
  • Target nucleic acids may be derived from genomic DNA, cDNA, or RNA.
  • target nucleic acid may be native DNA or a PCR amplified product.
  • the term "marker sequence” as used herein refers to a segment of nucleic acid which is associated with a nucleic acid segment of interest so that detection of the marker sequence in a sample is indicative of the presence of the nucleic acid segment of interest.
  • the marker sequence for detecting a particular nucleic acid segment of interest should be selected on the basis that the marker is uniquely or substantially associated with the nucleic acid segment of interest in fragments present in a particular size fraction.
  • Marker sequences can be detected by nucleic acid amplification using primer based hybridization methods. Marker sequences can also be detected by hybridization to one or more nucleic acid probe(s).
  • a fragment containing a tandem repeat segment is identified in size fractioned nucleic acid fragments by detecting a marker sequence that is either upstream or downstream of the tandem repeat.
  • Marker sequences may be within the nucleic acid segment of interest or may be flanking the nucleic acid of interest.
  • the term "flanking” as used herein refers to a region of DNA either adjoining or a distance from 1 a region of interest.
  • the flanking region may be "upstream” (i.e., 5 1 ) or "downstream” (i.e., 3') of the region of interest.
  • the marker sequence may be adjoining the tandem repeat region or may be located a distance upstream or downstream.
  • the marker sequence is within 500 bases upstream or downstream of the tandem repeat region; in more preferred embodiments the marker sequence is within 250 bases upstream or downstream of the tandem repeat region; in most preferred embodiments the marker sequence is within 100 bases upstream or downstream of the tandem repeat region.
  • the flanking region may be coding or non-coding sequence and may be the same or a different gene as the gene comprising the region of interest. In preferred embodiments, the marker sequence is flanking the nucleic acid segment of interest.
  • the size of the particular nucleic acid segment of interest is determined by relating the fraction size containing the particular nucleic acid segment to the size of the particular nucleic acid segment in the sample of nucleic acids.
  • This step of relating the fraction size containing the particular nucleic acid segment to the size of the particular nucleic acid segment in the sample of nucleic acids can be accomplished using a look-up table as disclosed herein. In other embodiments this step is accomplished with a computer program.
  • the phrase "relating the fraction size containing the particular nucleic acid segment of interest to the size of the particular nucleic acid segment of interest that would be present in that fraction under the conditions which generated the fragment" as used herein refers to the means by which the size of the particular nucleic acid segment of interest is determined from its location in a particular fraction size.
  • a look-up table is established for each combination of particular nucleic acid segment of interest and fragmentation approach (e.g. particular restriction endonuclease(s) used).
  • the look-up table links each fraction (containing a range of fragment sizes) to the length of the segment of interest that is present in such fragments. In any particular fraction, there are fragments that contain the segment of interest and other sequence.
  • any fraction one can calculate from sequence data the number of bases in the fragments that represent the segment of interest. This correlation may be established experimentally or by using known DNA sequence for the fragments generated. For any particular unknown sample, the size of a segment of interest can be determined by relating the fragment size that contains the segment of interest to the appropriate look-up table reflecting the same conditions for fragment generation. By using this process one relates the fraction size containing the particular nucleic acid segment of interest to the size of the particular nucleic acid segment of interest that would be present in the fraction under the conditions which generated the fragment. It is not essential that one prepare a look-up table to perform the method. For example, one could generate a computer program to perform the relating step.
  • Genomic nucleic acid refers to some or all of the DNA from the nucleus of a cell. Genomic DNA may be intact or fragmented (e.g., digested with restriction endonucleases by methods known in the art). In some embodiments, genomic DNA may include sequence from all or a portion of a single gene or from multiple genes, sequence from one or more chromosomes, or sequence from all chromosomes of a cell. In contrast, the term “total genomic nucleic acid” is used herein to refer to the full complement of DNA contained in the genome of a cell.
  • genomic nucleic acid includes gene coding regions, introns, 5' and 3' untranslated regions, 5' and 3 1 flanking DNA and structural segments such as telomeric and centromeric DNA, replication origins, and intergenic DNA.
  • Genomic nucleic acid may be obtained from the nucleus of a cell, or recombinantly produced. Genomic DNA also may be transcribed from DNA or RNA isolated directly from a cell nucleus. PCR amplification also may be used. Methods of purifying DNA and/or RNA from a variety of samples are well-known in the art.
  • allele and "allelic variant” are used interchangeably herein.
  • An allele is any one of a number of alternative forms or sequences of the same gene occupying a given locus or position on a chromosome. A single allele for each locus is inherited separately from each parent, resulting in two alleles for each gene. An individual having two copies of the same allele of a particular gene is homozygous at that locus whereas an individual having two different alleles of a particular gene is heterozygous.
  • Repeat expansion disease refers to any of about two dozen human diseases displaying Mendelian inheritance patterns shown to be caused by expansions of intrinsically polymorphic tandem repeats, mainly involving different trinucleotide motifs but also longer repetitive sequences up to 12-mers (Table 1).
  • a characteristic of an allele containing an expanded tandem repeat is an excessive instability in successive generations (dynamic mutations). Furthermore, these alleles can differ in lengths among cell populations of the same organism (mosaicism).
  • One type of repeat expansion disease is the trinucleotide repeat disorders (e.g., fragile X syndrome, myotonic dystrophy 1, etc.), the most abundant form of repeat expansion diseases. These diseases exhibit intergenerational repeat instability with a tendency towards further expansion of the tandem repeat. Increased repeat lengths in successive generations can lead to an earlier age of onset in affected individuals and/or an accentuation of clinical symptoms.
  • the methods of measuring tandem repeat length as described herein can be applied to measures tandem repeat length for any of the diseases/genes in Table 3.
  • oligonucleotide refers to a short polymer composed of deoxyribonucleotides, ribonucleotides or any combination thereof. Oligonucleotides of the invention are generally between about 10 and about 100 nucleotides in length. Oligonucleotides are preferably 15 to 70 nucleotides long, with 20 to 26 nucleotides being the most common. The single letter code for nucleotides is as described in the U.S. Patent Office Manual of Patent Examining Procedure, section 2422, table 1.
  • nucleotide designation "R” means guanine or adenine
  • Y means thymine (uracil if RNA) or cytosine
  • M means adenine or cytosine.
  • An oligonucleotide may be used as a primer or as a probe.
  • oligonucleotides does not require absolute purity. Instead, it represents an indication that the sequence is relatively more pure than in the natural environment. Such oligonucleotides may be obtained by a number of methods including, for example, laboratory synthesis, restriction enzyme digestion or PCR. A "substantially purified” oligonucleotide is preferably greater than 50% pure, more preferably at least 75% pure, and most preferably at least 95% pure.
  • an oligonucleotide is "specific" for a nucleic acid if the oligonucleotide has at least 50% sequence identity with a portion of the nucleic acid when the oligonucleotide and the nucleic acid are aligned.
  • An oligonucleotide that is specific for a nucleic acid is one that, under the appropriate hybridization or washing conditions, is capable of hybridizing to the target of interest and not substantially hybridizing to nucleic acids which are not of interest. Higher levels of sequence identity are preferred and include at least 75%, at least 80%, at least 85%, at least 90%, at least 95% and more preferably at least 98% sequence identity.
  • hybridize or “specifically hybridize” refers to a process where two complementary nucleic acid strands anneal to each other under appropriately stringent conditions. Hybridizations are typically and preferably conducted with probe- length nucleic acid molecules, preferably 20-100 nucleotides in length. Nucleic acid hybridization techniques are well known in the art. See, e.g., Sambrook, et al., 1989, Molecular Cloning: A Laboratory Manual, Second Edition, Cold Spring Harbor Press, Plainview, N. Y.
  • substantially complementary means that two sequences hybridize under stringent hybridization conditions. The skilled artisan will understand that substantially complementary sequences need not hybridize along their entire length. In particular, substantially complementary sequences comprise a contiguous sequence of bases that do not hybridize to a target or marker sequence, positioned 3' or 5 1 to a contiguous sequence of bases that hybridize under stringent hybridization conditions to a target or marker sequence.
  • complement means the complementary sequence to a nucleic acid according to standard Watson/Crick pairing rules. A complement sequence can also be a sequence of RNA complementary to the DNA sequence or its complement sequence, and can also be a cDNA.
  • coding sequence means a sequence of a nucleic acid or its complement, or a part thereof, that can be transcribed and/or translated to produce the mRNA for and/or the polypeptide or a fragment thereof. Coding sequences include exons in a genomic DNA or immature primary RNA transcripts, which are joined together by the cell's biochemical machinery to provide a mature mRNA. The anti-sense strand is the complement of such a nucleic acid, and the encoding sequence can be deduced therefrom.
  • non-coding sequence means a sequence of a nucleic acid or its complement, or a part thereof, that is not transcribed into amino acid in vivo, or where tRNA does not interact to place or attempt to place an amino acid.
  • Non-coding sequences include both intron sequences in genomic DNA or immature primary RNA transcripts, and gene-associated sequences such as promoters, enhancers, silencers, etc.
  • amplification means one or more methods known in the art for copying a target nucleic acid, thereby increasing the number of copies of a selected nucleic acid sequence. Amplification may be exponential or linear. A target nucleic acid may be either DNA or RNA. The sequences amplified in this manner form an "amplicon.” While the exemplary methods described hereinafter relate to amplification using the polymerase chain reaction (“PCR"), numerous other methods are known in the art for amplification of nucleic acids (e.g., isothermal methods, rolling circle methods, etc.). The skilled artisan will understand that these other methods may be used either in place of, or together with, PCR methods.
  • PCR polymerase chain reaction
  • a "primer” for amplification is an oligonucleotide that specifically anneals to a target or marker nucleotide sequence.
  • the 3' nucleotide of the primer should be identical to the target or marker sequence at a corresponding nucleotide position for optimal amplification.
  • Sense strand means the strand of double-stranded DNA (dsDNA) that includes at least a portion of a coding sequence of a functional protein.
  • Anti-sense strand means the strand of dsDNA that is the reverse complement of the sense strand.
  • a "forward primer” is a primer that anneals to the anti-sense strand of dsDNA.
  • a “reverse primer” anneals to the sense-strand of dsDNA.
  • sequences that have "high sequence identity” have identical nucleotides at least at about 50% of aligned nucleotide positions, preferably at least at about 75% of aligned nucleotide positions, more preferably at least at about 90% of aligned nucleotide positions, and most preferably at least at about 95% of aligned nucleotide positions.
  • TaqMan PCR detection system refers to a method for real time PCR.
  • a TaqMan probe which hybridizes to the nucleic acid segment amplified is included in the PCR reaction mix.
  • the TaqMan probe comprises a donor and a quencher fluorophore on either end of the probe and in close enough proximity to each other so that the fluorescence of the donor is taken up by the quencher.
  • the 5'-exonuclease activity of the Taq polymerase cleaves the probe thereby allowing the donor fluorophore to emit fluorescence which can be detected.
  • Figure 1 A schematic showing one embodiment of the method of detecting the tandem repeats. The presence of FMRl fragments with a particular length of tandem repeat is shown schematically at the bottom for each size fraction indicated (i.e. small, medium and large). The designation of + or — is shown below PCR to indicate whether PCR amplification of the flanking marker sequence occurs when the particular fragment is present in the fraction.
  • Figure 3 Exemplary sequence (SEQ ID NO:2) of the downstream 3' untranslated region of the DM-I gene showing the CTG tandem repeat region (single underlining), preferred locations for hybridizing PCR primers (shaded regions), and a preferred location for a hybridizing probe (double-underlining).
  • Figure 4 Exemplary sequence (SEQ ID NO:3) of the first intronic region of the FRDA gene showing the CAA tandem repeat region (single underlining), preferred locations for hybridizing PCR primers (shaded regions), and a preferred location for a hybridizing probe (double-underlining).
  • FIG. 5 Restriction enzyme map of a region of the FMRl gene.
  • FXCEF3, FXCER3, FMR1F4, FMR1R4, FXCEF2, and FXCER2 show the location of hybridization of preferred oligonucleotide primers.
  • FXCEF3/FXCER3, FMR1F4/FMR1R4, and FXCEF2/FXCER2 are preferred primer pairs for amplification of marker sequences when the nucleic acid is fragmented with Sphl/Bmtl, AIuI, and Blpl/Mlyl, respectively.
  • the particular nucleic acid segment of interest is a tandem repeat and the method is used to determine information about the size of such tandem repeat. This information may be used to determine if an individual carries a genetic mutation characterized by an increase (i.e., expansion) or a decrease (i.e., reduction) in the number of tandem repeats associated with a particular gene.
  • a method of measuring the size of a tandem repeat segment in a sample of nucleic acids comprising identifying the tandem repeat in fractions of size-separated nucleic acid fragments by detecting a marker sequence flanking the tandem repeat segment and then relating the fraction size containing the repeat to the size of a tandem repeat segment present in such nucleic acid fragments.
  • Figure 1 shows one embodiment of this method in schematic form. As will be discussed, a variation of this method is to perform a second fragmentation after the size separation and prior to the "analysis" step.
  • the methods of the present invention can be used to detect mutations characterized by an expansion or reduction of tandem repeat region of a gene in the genomic DNA of a test sample. Therefore, the method may be performed using any biological sample containing genomic DNA. Examples include tissue samples or any cell-containing bodily fluid. Blood is the preferred biological sample. Biological samples may be obtained by standard procedures and may be used immediately or stored, under conditions appropriate for the type of biological sample, for later use.
  • test samples are well known to those of skill in the art and include, but are not limited to, aspirations, tissue sections, drawing of blood or other fluids, surgical or needle biopsies, and the like.
  • the test sample may be obtained from individual or patient.
  • the test sample may contain cells, tissues or fluid obtained from a patient suspected being afflicted with or a carrier for a disorder caused by an expansion of tandem repeat sequences.
  • the test sample may be a cell-containing liquid or a tissue.
  • Samples may include, but are not limited to, amniotic fluid, biopsies, blood, blood cells, bone marrow, fine needle biopsy samples, peritoneal fluid, plasma, pleural fluid, saliva, semen, serum, tissue or tissue homogenates, frozen or paraffin sections of tissue. Samples may also be processed, such as sectioning of tissues, fractionation, purification, or cellular organelle separation.
  • the invention methods can be used to perform prenatal diagnosis using any type of embryonic or fetal cell or nucleic acid containing body fluid.
  • Fetal cells can be obtained through the pregnant female, or from a sample of an embryo.
  • fetal cells are present in amniotic fluid obtained by amniocentesis, chorionic villi aspirated by syringe, percutaneous umbilical blood, a fetal skin biopsy, a blastomere from a four-cell to eight-cell stage embryo (pre-implantation), or a trophectoderm sample from a blastocyst (pre-implantation or by uterine lavage).
  • genomic DNA may be used.
  • Genomic DNA may be isolated from cells or tissues using standard methods, see, e.g., Sambrook, et al., 1989, Molecular Cloning: A Laboratory Manual, Second Edition, Cold Spring Harbor Press, Plainview, NY.
  • Genomic DNA may be fragmented by various methods well-known in the art. Preferably, a restriction endonuclease digestion is used to fragment the DNA.
  • restriction endonuclease or “restriction enzyme” as used herein refers to an enzyme that cuts double-stranded DNA at a specific sequence (i.e., the recognition sequence or site).
  • the frequency with which a given restriction endonuclease cuts DNA depends on the length of the recognition site of the enzyme. For example, some enzymes recognize sites that are four nucleotides long (referred to as “four cutters"). In general one can estimate how frequently an enzyme should cut a piece of DNA based the length of the recognition site and the assumption that the probability of any one nucleotide occurring at a given location is 1 A. In the case of a "four cutter” a specific sequence of four nucleotides must be present.
  • restriction endonuclease fragmentation method a restriction endonuclease is combined with a sample of genomic DNA and buffer appropriate for optimal activity of the endonuclease.
  • 1 unit of endonuclease will digest l ⁇ g of DNA in 1 hour at 37°C.
  • this fragmentation method can be modified by using a restriction enzyme that cuts at a particular frequency or a particular site, or by using multiple restriction enzymes.
  • the choice of enzyme or enzyme combinations is chosen to suit the gene of interest in an assay. In general, one would choose an enzyme or enzyme combination to generate a fragment containing the entire tandem repeat region and the upstream or downstream marker sequence.
  • Enzymes for fragmentation can be chosen by using a restriction enzyme map of the region surrounding the tandem repeat. Such maps can be readily generated by software programs well-known to those of skill in the art.
  • an enzyme or a combination of enzymes to obtain an appropriately sized fragment to distinguish a normal length tandem repeat region from an abnormal length tandem repeat region.
  • determining an appropriate size for a fragment one would consider the difference in the range of lengths of a normal tandem repeat region as compared to that for an abnormal tandem repeat region. For example, if the difference between a normal tandem repeat region and an abnormal tandem repeat region is small, one would choose a shorter length fragment, whereas if the difference is large one would choose a longer length fragment.
  • AIuI is used to fragment the nucleic acids.
  • AIuI is a restriction enzyme that recognizes a 4-nucleotide sequence of double-stranded DNA (i.e., - AGCT-).
  • isoschizomers i.e., restriction enzymes with the same recognition sequence and cut site
  • examples of AIuI isoschizomers include, but are not limited to, BsaLI, Marl, MItI, and Oxal.
  • a neoschizomer i.e., a restriction enzyme with the same recognition sequence as another enzyme but with a different cut site
  • AIuI could also be substituted for AIuI.
  • AIuI which recognizes a 4-nucleotide sequence, cuts DNA at approximately every 256 bases.
  • Other enzymes with different 4-nucleotide recognition sequences e.g., Dpnl, Rsal, Mbol, and NIaI
  • Dpnl, Rsal, Mbol, and NIaI would be expected to cut at a similar frequency to AIuI and would therefore produce fragments of a size similar to those of AIuI.
  • BIpI and Mlyl are used in combination to fragment the nucleic acids.
  • Sphl and Bmtl are used in combination to fragment the nucleic acids.
  • Size Separation of DNA Fragments may be accomplished by various methods known to those of skill in the art. For example, various methods of gel electrophoresis or column chromatography (e.g., size-exclusion high performance liquid chromatography (SEC-HPLC) and denaturing HPLC (DHPLC)) may be used.
  • SEC-HPLC size-exclusion high performance liquid chromatography
  • DPLC denaturing HPLC
  • gel electrophoresis a gel matrix, to which an electric field is applied, is utilized to separate nucleic acid molecules or fragments thereof based on size. In general, smaller nucleic acid fragments will move faster through the gel matrix than larger fragments.
  • Preferred gel matrices include agarose and polyacrylamide.
  • capillary electrophoresis is used to separate the fragmented nucleic acids.
  • Capillary electrophoresis is a separation method based on the differential electrophoretic migration rate of sample components in a capillary when a voltage is applied. Separated fragments or molecules are generally detected "on-column" using UV spectrometric or fluorescence analysis through a window in the capillary.
  • one or more standards i.e., a segment of nucleic acid having known length
  • the elution times of the standards are used to determine the length of time over which a fraction will be collected in order to achieve a desired size range for that fraction.
  • the size ranges for the fractions used in an assay may be chosen based on the length in base pairs of commercially-available standards.
  • a number of standards are available containing mixtures of lengths of nucleic acids.
  • a standard containing nucleic acid fragments having the following lengths is used: 200 bp, 300 bp, 400 bp, 500 bp, 600 bp, 700 bp, 800 bp, 900 bp, and 1000 bp.
  • fractions would then be chosen based on one or more of the sizes present in the standard.
  • fractions 1) less than or equal to 300 bp, 2) 301-500 bp, and 3) greater than or equal to 501.
  • fractions can be chosen based on a desired number of tandem repeats for each fraction. In this case, standards representing the upper and lower limits in size of each fraction can be synthesized if not commercially available.
  • the fragments are separated into the lowest number of fractions in order to distinguish normal from abnormal length tandem repeat regions. In particular embodiments, the fragments may be separated into two fractions, one corresponding to a normal tandem repeat region and one corresponding to an abnormal tandem repeat region.
  • the fragments are separated into a larger number of fractions in order to determine the size of the tandem repeat region.
  • more fractions will allow a more precise determination of the length of the tandem repeat region.
  • the number of fractions may be chosen in order to achieve a desired level of precision in determining the length of the tandem repeat region.
  • AIuI fragmented DNA is separated into two fractions corresponding to sizes of approximately 211 - 358 bp (lower fraction) and 359 bp - 9 kb (upper fraction). Fragments from samples containing a normal FMRl gene (i.e., less than 55 tandem repeats), will separate into the lower fraction, whereas fragments from FMRl genes containing premutations (i.e., 55-200 tandem repeats) or full mutations (i.e., greater than 200 tandem repeats) will separate into the upper fraction.
  • AIuI fragmented DNA is separated into two fractions corresponding to sizes of approximately 21 1 - 400 bp (lower fraction) and 401 bp - 9 kb (upper fraction). Fragments from samples containing a normal FMRl gene (i.e., less than 55 tandem repeats) and premutations having 56-69 tandem repeats, will separate into the lower fraction, whereas fragments from FMRl genes containing premutations having 70-200 tandem repeats or full mutations (i.e., greater than 200 tandem repeats) will separate into the upper fraction.
  • AIuI fragmented DNA is separated into four fractions corresponding to sizes of approximately less than or equal to 400 bp (first/lowest fraction), 401-500 bp (second fraction), 501-800 bp (third fraction) and 800 bp - 9 kb (fourth/highest fraction).
  • Fragments from samples containing a normal FMRl gene (i.e., less than 55 tandem repeats) and those containing premutations having 56-68 tandem repeats will separate into the first/lowest fraction, whereas fragments from FMRl genes containing small premutations (i.e., 69-102 tandem repeats) will separate into the second fraction, whereas fragments from FMRl genes containing large premutations (i.e., 103-200 tandem repeats) and full mutations having 201-202 tandem repeats will separate into the third fraction, and whereas full mutations having greater than 202 tandem repeats will separate into the fourth/highest fraction.
  • DNA fragmented with a combination of Sphl and Bmtl is separated into four fractions corresponding to sizes of approximately less than or equal to 500 bp (first/lowest fraction), 501-800 bp (second fraction), 801-900 bp (third fraction) and 901 bp — 9 kb (fourth/highest fraction).
  • Fragments from samples containing a normal FMRl gene (i.e., less than 55 tandem repeats) and those containing premutations having 56-62 tandem repeats will separate into the first/lowest fraction, whereas fragments from FMRl genes containing small premutations (i.e., 63-163 tandem repeats) will separate into the second fraction, whereas fragments from FMRl genes containing large premutations (i.e., 164-196 tandem repeats) will separate into the third fraction, and whereas large premutations having 197-200 tandem repeats and full mutations (i.e., greater than 200 tandem repeats) will separate into- the fourth/highest fraction.
  • a normal FMRl gene i.e., less than 55 tandem repeats
  • fragments from FMRl genes containing small premutations i.e., 63-163 tandem repeats
  • fragments from FMRl genes containing large premutations i.e., 164-196 tandem repeats
  • DNA fragmented with a combination of BIpI an dMlyl is separated into four fractions corresponding to sizes of approximately less than or equal to 603 bp (first/lowest fraction), 604-840 bp (second fraction), 841-1078 bp (third fraction) and 1079 bp — 9 kb (fourth/highest fraction).
  • fragmented DNA is separated into a multiplicity of fractions, according to size. Automated fraction collection is accomplished using, for example, a preset fraction time window, beginning at approximately 200 bp and ending at 9 kb. This method allows for a finer estimation of fragment size and thereby, an estimation of the number of repeats.
  • the methods of the invention include a second nucleic acid fragmentation following the size separation of the first nucleic acid fragmentation.
  • a restriction endonuclease digestion is used to further fragment the DNA.
  • the second fragmentation separates the tandem repeat segment from an associated marker sequence.
  • restriction enzyme BsaWI, Hpyl88I, Hphl or BstNI is used for the second nucleic acid fragmentation when the marker sequence is upstream of the tandem repeat segment.
  • restriction enzyme SmII, Bbvl, or Bmtl is used for the second nucleic acid fragmentation when the marker sequence is downstream of the tandem repeat segment.
  • isoschizomers and neoschizomers of the listed restriction enzymes could also be used.
  • One of skill in the art would be able to identify a suitable restriction enzyme for the second nucleic acid fragmentation by analyzing factors which include, but are not limited to, the location of the marker, the sequence of the marker and the sequence between the marker sequence and the tandem repeat segment.
  • Size-separated DNA may be amplified by various methods known to the skilled artisan.
  • Amplification methods suitable for use with the present methods include, for example, polymerase chain reaction (PCR), ligase chain reaction (LCR), transcription-based amplification system (TAS), nucleic acid sequence based amplification (NASBA) reaction, self-sustained sequence replication (3SR), strand displacement amplification (SDA) reaction, boomerang DNA amplification (BDA), Q-beta replication, or isothermal nucleic acid sequence based amplification.
  • PCR polymerase chain reaction
  • LCR transcription-based amplification system
  • TAS transcription-based amplification system
  • NASBA nucleic acid sequence based amplification
  • SDA self-sustained sequence replication
  • BDA boomerang DNA amplification
  • Q-beta replication or isothermal nucleic acid sequence based amplification.
  • PCR is a technique for making many copies of a specific template DNA sequence.
  • the reaction consists of multiple amplification cycles and is initiated using a pair of primer sequences that hybridize to the 5' and 3' ends of the sequence to be copied.
  • the amplification cycle includes an initial denaturation, and up to 50 cycles of annealing, strand elongation and strand separation (denaturation).
  • the DNA sequence between the primers is copied.
  • Primers can bind to the copied DNA as well as the original template sequence, so the total number of copies increases exponentially with time.
  • PCR can be performed as according to Whelan, et al, Journal of Clinical Microbiology, 3_3(3):556- 561(1995).
  • a PCR reaction mixture includes two specific primers, dNTPs, approximately 0.25 U of Taq polymerase, and Ix PCR Buffer. For every 25 ⁇ l PCR reaction, 2 ⁇ l sample (e.g., isolated DNA from target organism) is added and amplified using a thermal cycler.
  • LCR is a method of DNA amplification similar to PCR, except that it uses four primers instead of two and uses the enzyme ligase to ligate or join two segments of DNA.
  • LCR can be performed as according to Moore et al. , Journal of Clinical Microbiology 36(4 ⁇ 1028-103 I (1998). Briefly, an LCR reaction mixture contains two pair of primers, dNTP, DNA ligase and DNA polymerase representing about 90 ⁇ l, to which is added 100 ⁇ l of isolated nucleic acid from the target organism. Amplification is performed in a thermal cycler (e.g., LCx of Abbott Labs, North Chicago, IL).
  • a thermal cycler e.g., LCx of Abbott Labs, North Chicago, IL.
  • TAS is a system of nucleic acid amplification in which each cycle is comprised of a cDNA synthesis step and an RNA transcription step.
  • a sequence recognized by a DNA-dependent RNA polymerase i.e., a polymerase-binding sequence or PBS
  • PBS polymerase-binding sequence
  • an RNA polymerase is used to synthesize multiple copies of RNA from the cDNA template.
  • Amplification using TAS requires only a few cycles because DNA-dependent RNA transcription can result in 10-1000 copies for each copy of cDNA template.
  • TAS can be performed according to Kwoh et al., PNAS 86: 1173-7 (1989). Briefly, extracted RNA is combined with TAS amplification buffer and bovine serum albumin, dNTPs, NTPs, and two oligonucleotide primers, one of which contains a PBS. The sample is heated to denature the RNA template and cooled to the primer annealing temperature. Reverse transcriptase (RT) is added the sample incubated at the appropriate temperature to allow cDNA elongation. Subsequently T7 RNA polymerase is added and the sample is incubated at 37°C for approximately 25 minutes for the synthesis of RNA. The above steps are then repeated.
  • RT Reverse transcriptase
  • both RT and RNA polymerase are added following a 1 minute 100 0 C denaturation followed by an RNA elongation of approximately 30 minutes at 37°C.
  • TAS can be also be performed on solid phase as according to Wylie et al, Journal of Clinical Microbiology, 36(12):3488-3491 (1998).
  • nucleic acid targets are captured with magnetic beads containing specific capture primers.
  • the beads with captured targets are washed and pelleted before adding amplification reagents which contains amplification primers, dNTP, NTP, 2500 U of reverse transcriptase and 2500 U of T7 RNA polymerase.
  • a 100 ⁇ l TMA reaction mixture is placed in a tube, 200 ⁇ l oil reagent is added and amplification is accomplished by incubation at 42 0 C in a waterbath for one hour.
  • NASBA is a transcription-based amplification method which amplifies RNA from either an RNA or DNA target.
  • NASBA is a method used for the continuous amplification of nucleic acids in a single mixture at one temperature.
  • AMV avian myeloblastosis virus
  • RNase H RNase H
  • T7 RNA polymerase T7 RNA polymerase
  • an NASBA reaction mixture contains two specific primers, dNTP, NTP, 6.4 U of AMV reverse transcriptase, 0.08 U of Escherichia coli Rnase H 5 and 32 U of T7 RNA polymerase.
  • the amplification is carried out for 120 min at 41 0 C in a total volume of 20 ⁇ l.
  • SDA is an isothermal nucleic acid amplification method.
  • a primer containing a restriction site is annealed to the template.
  • Amplification primers are then annealed to 5' adjacent sequences (forming a nick) and amplification is started at a fixed temperature.
  • Newly synthesized DNA strands are nicked by a restriction enzyme and the polymerase amplification begins again, displacing the newly synthesized strands.
  • SDA can be performed as according to Walker, et al, PNAS, 89:392-6 (1992).
  • an SDA reaction mixture contains four SDA primers, dGTP, dCTP, TTP, dATP, 150 U of Hinc II, and 5 U of exonuclease-deficient of the large fragment of E. coli DNA polymerase I (exo " Kl enow polymerase).
  • the sample mixture is heated 95°C for 4 minutes to denature target DNA prior to addition of the enzymes.
  • amplification is carried out for 120 min. at 37°C in a total volume of 50 ⁇ l. Then, the reaction is terminated by heating for 2 minutes at 95°C.
  • BDA Boomerang DNA amplification
  • This method involves an endonuclease digestion of a sample DNA, producing discrete DNA fragments with sticky ends, ligating the fragments to "adapter" polynucleotides (comprised of a ligatable end and first and second self- complementary sequences separated by a spacer sequence) thereby forming ligated duplexes.
  • the ligated duplexes are denatured to form templates to which an oligonucleotide primer anneals at a specific sequence within the target or marker sequence of interest.
  • the primer is extended with a DNA polymerase to form duplex products followed by denaturation of the duplex products. Subsequent multiple cycles of annealing, extending, and denaturing are performed to achieve the desired degree of amplification (U.S. Patent No. 5,470,724).
  • the Q-beta replication system uses RNA as a template.
  • Q-beta replicase synthesizes the single-stranded RNA genome of the coliphage QjS. Cleaving the RNA and ligating in a nucleic acid of interest allows the replication of that sequence when the RNA is replicated by Q-beta replicase (Kramer & Lizardi Trends Biotechnol. 1991 9£2):53-8, 1991).
  • a variety of amplification enzymes are well known in the art and include, for example, DNA polymerase, RNA polymerase, reverse transcriptase, Q-beta replicase, thermostable DNA and RNA polymerases.
  • Amplification methods suitable for use with the present methods include, for example, strand displacement amplification, rolling circle amplification, primer extension preamplification, or degenerate oligonucleotide PCR (DOP). These methods of amplification are well known in the art and each described briefly below.
  • PCR is used to amplify a target or marker sequence flanking the tandem repeat segment of interest.
  • two or more oligonucleotide primers that anneal to opposite strands of a target or marker sequence are repetitively annealed to their complementary sequences, extended by a DNA polymerase (e.g., AmpliTaq Gold polymerase), and heat denatured, resulting in exponential amplification of the target nucleic acid sequences. Cycling parameters can be varied, depending on the length of nucleic acids to be extended. The skilled artisan is capable of designing and preparing primers that are appropriate for amplifying a target or marker sequence.
  • the length of the amplification primers for use in the present invention depends on several factors including the nucleotide sequence identity and the temperature at which these nucleic acids are hybridized or used during in vitro nucleic acid amplification.
  • the considerations necessary to determine a preferred length for an amplification primer of a particular sequence identity are well-known to a person of ordinary skill and include considerations described herein.
  • the length of a short nucleic acid or oligonucleotide can relate to its hybridization specificity or selectivity.
  • the amplification may include a labeled primer, thereby allowing detection of the amplification product of that primer.
  • the amplification may include a multiplicity of labeled primers, preferably such primers are distinguishably labeled, allowing the simultaneous detection of multiple amplification products.
  • Oligonucleotide primers can be designed which are between about 10 and about 100 nucleotides in length and hybridize to the marker sequence. Oligonucleotide primers are preferably 12 to 70 nucleotides; more preferably 15-60 nucleotides in length; and most preferably 15-25 nucleotides in length.
  • a primer pair is designed to amplify a marker sequence upstream of the tandem repeat region of the FMRl gene following size separation of nucleic acids fragmented by AIuI.
  • An exemplary marker sequence upstream of the FMRl tandem repeat region for designing hybridization primers is depicted in Figure 2 (SEQ ID NO:!).
  • a forward primer can hybridize to SEQ ID NO:1 between nucleotides 1 and 45, more preferably between positions 22 and 39 while a reverse primer can hybridize to SEQ ID NO:1 between positions 70 and 115, more preferably between 97 and 113.
  • oligonucleotide primers which may be used as amplification primers include SEQ ID NO:4 (5'-GGTGGAGGGCCGCCTCTG-S') and SEQ ID NO:5 (5'- AGCGGCGCCTCCGTCACC -3')- Other preferred oligonucleotide primers are approximately 15-100 nucleotides in length and comprise SEQ ID NO:4 or SEQ ID NO:5.
  • oligonucleotide primers include an oligonucleotide sequence that hybridizes to the complement of a 15-100 nucleotide sequence including SEQ ID NO:4 or SEQ ID NO:5. Such oligonucleotides may be substantially purified.
  • a primer pair is designed to amplify a marker sequence flanking the tandem repeat region of the FMRl gene following size separation of nucleic acids fragmented by BIpI and MIyI.
  • a primer pair is used to amplify a region of the flanking sequence downstream of the tandem repeat region; more specifically using a forward primer, FXCEF2 (SEQ ID NO:6), and a reverse primer, FXCER2 (SEQ ID NO:7), to amplify an 86 bp region of the marker sequence.
  • preferred oligonucleotides which may be used as amplification primers include SEQ ID NO:6 (5'- GATGGAGGAGCTGGTGGTGG -3') and SEQ ID NO:7 (5'- GGAAGGGCGAAGATGGGG -3')-
  • SEQ ID NO:6 5'- GATGGAGGAGCTGGTGGTGG -3'
  • SEQ ID NO:7 5'- GGAAGGGCGAAGATGGGG -3'
  • Other preferred oligonucleotide primers are approximately 15-100 nucleotides in length and comprise SEQ ID NO:6 or SEQ ID NO:7.
  • Still other preferred oligonucleotide primers include an oligonucleotide sequence that hybridizes to the complement of a 15-100 nucleotide sequence including SEQ ID NO:6 or SEQ ID NO:7. Such oligonucleotides may be substantially purified.
  • a primer pair is designed to amplify a marker sequence flanking the tandem repeat region of the FMRl gene following size separation of nucleic acids fragmented by SpM and Bmtl.
  • a primer pair is used to amplify a region of the flanking sequence downstream of the tandem repeat region; more specifically using a forward primer, FXCEF3 (SEQ ID NO:8), and a reverse primer, FXCER3 (SEQ ID NO.9), to amplify an 86 bp region of the marker sequence.
  • preferred oligonucleotides which may be used as amplification primers include SEQ ID NO:8 (5'- CGTGACGTGGTTTCAGTGTTTACA -3') and SEQ ID NO:9 (5'- GGAAGTGAAACCGAAACGGAG -3')-
  • SEQ ID NO:8 5'- CGTGACGTGGTTTCAGTGTTTACA -3'
  • SEQ ID NO:9 5'- GGAAGTGAAACCGAAACGGAG -3'
  • Other preferred oligonucleotide primers are approximately 15-100 nucleotides in length and comprise SEQ ID NO:8 or SEQ ID NO:9.
  • Still other preferred oligonucleotide primers include an oligonucleotide sequence that hybridizes to the complement of a 15-100 nucleotide sequence including SEQ ID NO:8 or SEQ ID NO:9. Such oligonucleotides may be substantially purified.
  • Assay controls may be used in the assay for detecting carriers and individuals afflicted with fragile X syndrome. Positive controls for normal or wild type FMRl gene (i.e., less than 55 tandem repeats), the premutation (55-200 tandem repeats), and the full mutation (greater than 200 tandem repeats) may be used.
  • Additional controls may be included in the assay to determine if the restriction enzyme digestion of the genomic DNA is complete.
  • One approach to evaluate the completeness of digestion by a particular restriction enzyme is to determine if the digested DNA can support a PCR amplification using a test primer pair that spans the restriction enzyme site used for digestion. Thus, if the nucleic acid has been fully digested by the restriction enzyme, there should be no amplification from the test primer pair, however, if digestion is incomplete, leaving some intact nucleic acid, the test primer pair should amplify the target. This test digestion PCR amplification can be conducted anytime after digestion including during amplification of the marker sequence.
  • the genomic DNA is digested with AIuI.
  • AIuI the completion of the digestion of genomic DNA by AIuI can be determined by amplifying a region containing an AIuI recognition site.
  • a pair of primers, AIuIF and AIuIR are used to amplify a 103 bp target segment of a nucleic acid fragment containing an AIuI recognition site. Amplification of this segment will only occur when AIuI digestion is incomplete.
  • the genomic DNA is digested with BIpI and MIyI.
  • the completion of the digestion of genomic DNA by BIpI and MIyI can be determined by amplifying a region containing a BIpI or an MIyI recognition site.
  • a pair of primers, BIpIF and BIpIR are used to amplify a 138 bp segment of target segment of nucleic acid containing a BIpI recognition site. Amplification of this segment will only occur when BIpI digestion is incomplete.
  • Additional controls may be included in the assay to verify proper size-separation of fragmented DNA.
  • sequence analysis tools known to those of skill in the art, one could determine the size distribution of fragments obtained using a particular restriction enzyme. One could then identify a specific control fragment having a size that corresponds to the size range of a particular fraction. Thus one could verify proper size separation of the fragmented DNA by detecting the control fragment in the appropriate fraction for its size.
  • a control amplification is also included to determine if the largest tandem repeat-containing fragments (obtained through digestion with AIuI) are collected into the size-appropriate fraction.
  • a pair of primers, LargeF and LargeR SEQ ID NO:14 and SEQ ID NO:15, respectively
  • controls are included to detect proper size separation of fragments of 675 bp, 905 bp, and 7,031 bp.
  • a pair of primers, FIIctrllF and FIIctrllR (SEQ ID NO: 16 and SEQ ID NO: 17, respectively) will amplify a 102 bp segment of a 675 bp BIpI I MIyI fragment of the CFTR gene when present in a fraction.
  • a pair of primers, FIIIctrl3F and FIIIctrBR (SEQ ID NO: 18 and SEQ ID NO: 19, respectively) will amplify a 113 bp segment of a 905 bp BIpI I MIyI fragment of the CFTR gene when present in a fraction.
  • a pair of primers, LgctrlF and LgctrlR (SEQ ID NO:20 and SEQ ID NO:21, respectively) will amplify a 156 bp segment of a 7,031 bp BIpI I MIyI fragment from chromosome 21 (21 :20912766-20919795) when present in a fraction.
  • Marker sequences may be amplified prior to detection or may be detected directly after size separation without an amplification step.
  • the marker sequence is amplified and the resulting amplicon is detected by electrophoresis, preferably capillary electrophoresis.
  • the marker sequence is amplified using a labeled primer such that the resulting amplicon is detectably labeled.
  • the primer is fluorescently labeled, however, the primers may be labeled according to the methods described below for oligonucleotide probes.
  • the fragmented DNA is detected directly, without an amplification step, using two distinguishably-labeled nucleic acid probes which hybridize to two separate segments of a marker sequence upstream or downstream of the tandem repeats.
  • the simultaneous detection of both labels in one hybridization complex indicates the presence of the marker sequence (and thus the associated tandem repeat).
  • detection is accomplished using a Trilogy 2020 Analyzer (US Genomics Woburn, MA).
  • two probes with distinguishable fluorescent labels are contacted with the size-separated DNA fragments.
  • the resulting mixture of hybridization complexes is directed into a capillary tube where it is exposed to multiple lasers of differing wavelengths.
  • the fluorescent labels of the probes are excited such that photons are emitted and detected.
  • the simultaneous detection of fluorescent labels of different colors indicates the presence of the target region.
  • Probe oligonucleotides may be detectably labeled by methods known in the art.
  • Useful labels include, e.g., fluorescent dyes (e.g., Cy5®, Cy3®, FITC, rhodamine, lanthamide phosphors, Texas red), 32 P, 35 S, 3 H, 14 C, 125 I, 131 I, electron-dense reagents (e.g., gold), enzymes, e.g., as commonly used in an ELISA (e.g., horseradish peroxidase, beta- gal actosidase, luciferase, alkaline phosphatase), colorimetric labels (e.g., colloidal gold), magnetic labels (e.g., DynabeadsTM), biotin, dioxigenin, or haptens and proteins for which antisera or monoclonal antibodies are available.
  • fluorescent dyes e.g., Cy5®, Cy3®, FITC, r
  • labels include ligands or oligonucleotides capable of forming a complex with the corresponding receptor or oligonucleotide complement, respectively.
  • the label can be directly incorporated into the nucleic acid to be detected, or it can be attached to a probe (e.g., an oligonucleotide) or antibody that hybridizes or binds to the nucleic acid to be detected.
  • the detectable label is a fluorophore.
  • fluorophore refers to a molecule that absorbs light at a particular wavelength (excitation frequency), and subsequently emits light of a different, typically longer, wavelength (emission frequency) in response.
  • Suitable fluorescent moieties include the following fluorophores known in the art:
  • Alexa Fluor® 350, Alexa Fluor® 488, Alexa Fluor® 546, Alexa Fluor® 555, Alexa Fluor® 568, Alexa Fluor® 594, Alexa Fluor® 647 (Molecular Probes) 5-(2'-aminoethyl)aminonaphthalene-l-sulfonic acid (EDANS)
  • BHQTM Black Hole QuencherTM dyes (biosearch Technologies) BODIPY® R-6G, BOPIPY® 530/550, BODIPY® FL Brilliant Yellow coumarin and derivatives: coumarin
  • DAS 5-[dimethylamino]naphthalene-l-sulfonyl chloride
  • DBCYL 4-(4'-dimethylaminophenylazo)benzoic acid
  • DBITC 4-dimethylaminophenylazophenyl-4'-isothiocyanate
  • EclipseTM EclipseTM (Epoch Biosciences Inc.) eosin and derivatives: eosin eosin isothiocyanate erythrosin and derivatives: erythrosin B erythrosin isothiocyanate ethidium fluorescein and derivatives:
  • TAMRA ⁇ tetramethyl-6-carboxyrhodarnine
  • TRITC tetramethyl rhodamine tetramethyl rhodamine isothiocyanate
  • the detectable label can be incorporated into, associated with or conjugated to a nucleic acid.
  • Label can be attached by spacer arms of various lengths to reduce potential steric hindrance or impact on other useful or desired properties. See, e.g., Mansfield, MoI. Cell. Probes 9:145-156, 1995.
  • Detectable labels can be incorporated into nucleic acids by covalent or non- covalent means, e.g., by transcription, such as by random-primer labeling using Klenow polymerase, or nick translation, or, amplification, or equivalent as is known in the art.
  • a nucleotide base is conjugated to a detectable moiety, such as a fluorescent dye, e.g., Cy3® or Cy5® and then incorporated into genomic nucleic acids during nucleic acid synthesis or amplification.
  • Nucleic acids can thereby be labeled when synthesized using Cy3®- or Cy5®-dCTP conjugates mixed with unlabeled dCTP.
  • Nucleic acid probes can be labeled by using PCR or nick translation in the presence of labeled precursor nucleotides, for example, modified nucleotides synthesized by coupling allylamine-dUTP to the succinimidyl-ester derivatives of the fluorescent dyes or haptens (such as biotin or digoxigenin) can be used; this method allows custom preparation of most common fluorescent nucleotides, see, e.g., Henegariu, Nat. Biotechnol. 18:345-348, 2000.
  • Nucleic acid probes may be labeled by non-covalent means known in the art.
  • Kreatech Biotechnology's Universal Linkage System® ULS®
  • ULS® Kreatech Biotechnology's Universal Linkage System®
  • This technology may also be used to label proteins by binding to nitrogen and sulphur containing side chains of amino acids. See, e.g., U.S. Patent Nos. 5,580,990; 5,714,327; and 5,985,566; and European Patent No. 0539466.
  • the binding of a probe to the marker sequence flanking the tandem repeat region may be determined by hybridization as is well known in the art. Hybridization may be detected in real time or in non-real time.
  • One general method for real time PCR uses fluorescent probes such as the TaqMan® probes, molecular beacons and scorpions.
  • the probes employed in TaqMan® and molecular beacon technologies are based on the principle of fluorescence quenching and involve a donor fluorophore and a quenching moiety.
  • the term "donor fluorophore” as used herein means a fl ⁇ orophore that, when in close proximity to a quencher moiety, donates or transfers emission energy to the quencher. As a result of donating energy to the quencher moiety, the donor fluorophore will itself emit less light at a particular emission frequency that it would have in the absence of a closely positioned quencher moiety.
  • quencher moiety means a molecule that, in close proximity to a donor fluorophore, takes up emission energy generated by the donor and either dissipates the energy as heat or emits light of a longer wavelength than the emission wavelength of the donor. In the latter case, the quencher is considered to be an acceptor fluorophore.
  • the quenching moiety can act via proximal (i.e. collisional) quenching or by F ⁇ rster or fluorescence resonance energy transfer (“FRET"). Quenching by FRET is generally used in TaqMan® probes while proximal quenching is used in molecular beacon and scorpion type probes.
  • proximal quenching a.k.a. "contact” or “collisional” quenching
  • the donor is in close proximity to the quencher moiety such that energy of the donor is transferred to the quencher, which dissipates the energy as heat as opposed to a fluorescence emission.
  • FRET quenching the donor fluorophore transfers its energy to a quencher which releases the energy as fluorescence at a higher wavelength.
  • Proximal quenching requires very close positioning of the donor and quencher moiety, while FRET quenching, also distance related, occurs over a greater distance (generally 1 —10 nm, the energy transfer depending on R "6 , where R is the distance between the donor and the acceptor).
  • the quenching moiety is an acceptor fluorophore that has an excitation frequency spectrum that overlaps with the donor emission frequency spectrum.
  • the assay may detect an increase in donor fluorophore fluorescence resulting from increased distance between the donor and the quencher (acceptor fluorophore) or a decrease in acceptor fluorophore emission resulting from increased distance between the donor and the quencher (acceptor fluorophore).
  • TaqMan® probes use the fiuorogenic 5' exonuclease activity of Taq polymerase to measure the amount of target or marker sequences in DNA samples.
  • TaqMan® probes are oligonucleotides that contain a donor fluorophore usually at or near the 5' base, and a quenching moiety typically at or near the 3' base.
  • the quencher moiety may be a dye such as TAMRA or may be a non-fluorescent molecule such as 4-(4 - dimethylaminophenylazo)benzoic acid (DABCYL). See Tyagi et al., Nature Biotechnology 16:49-53 (1998).
  • FRET fluorescing
  • TaqMan® probes are designed to anneal to an internal region of a PCR product.
  • the polymerase replicates a template on which a TaqMan® probe is bound, its 5' exonuclease activity cleaves the probe. This ends the activity of quencher (no FRET) and the donor fluorophore starts to emit fluorescence which increases in each cycle proportional to the rate of probe cleavage. Accumulation of PCR product is detected by monitoring the increase in fluorescence of the reporter dye (note that primers are not labeled). If the quencher is an acceptor fluorophore, then accumulation of PCR product can be detected by monitoring the decrease in fluorescence of the acceptor fluorophore.
  • TaqMan® assay uses universal thermal cycling parameters and PCR reaction conditions. Because the cleavage occurs only if the probe hybridizes to the target, the fluorescence detected originates from specific amplification. The process of hybridization and cleavage does not interfere with the exponential accumulation of the product.
  • One specific requirement for fluorogenic probes is that there be no G at the 5' end. A 1 G' adjacent to the reporter dye quenches reporter fluorescence even after cleavage.
  • MGB EclipseTM probes Epoch Biosciences
  • MGB EclipseTM probes work by a hybridization-triggered fluorescence mechanism.
  • MGB EclipseTM probes have the EclipseTM Dark Quencher and the MGB positioned at the 5'-end of the probe. The fluorophore is located on the 3'-end of the probe. When the probe is in solution and not hybridized, the three dimensional conformation brings the quencher into close proximity of the fluorophore, and the fluorescence is quenched.
  • Suitable donor fluorophores include 6-carboxyfluorescein (FAM), tetrachloro-6- carboxyfluorescein (TET), 2'-chloro-7'-phenyl-l,4-dichloro-6-carboxyfIuorescein (VIC), and the like.
  • Suitable quenchers include tetra-methylcarboxyrhodamine (TAMRA) 4-(4 - dimethylaminophenylazo)benzoic acid (“DABCYL” or a DABCYL analog) and the like. Tetramethylrhodamine (TMR) or 5-carboxyrhodamine 6G (RHD) may be combined as donor fluorophores with DABCYL as quencher. Multiplex TaqMan assays can be performed using multiple detectable labels each comprising a different donor and quencher combination. Probes for detecting amplified sequence in real time may be stored frozen (-10° to -3O 0 C) as 100 ⁇ M stocks. TaqMan probes are available from Applied BioSystems (4316032).
  • real time PCR is performed using TaqMan® probes in combination with a suitable amplification/analyzer such as the ABI Prism 7900HT Sequence Detection System.
  • the ABI PRISM® 7900HT Sequence Detection System is a high- throughput real-time PCR system that detects and quantitates nucleic acid sequences.
  • TaqManTM probes specific for the amplified target or marker sequence are included in the PCR amplification reaction. These probes contain a reporter dye at the 5' end and a quencher dye at the 3' end. Probes hybridizing to different target or marker sequences are conjugated with a different fluorescent reporter dye.
  • the fluorescently labeled probes bind specifically to their respective target or marker sequences; the 5' nuclease activity of Taq polymerase cleaves the reporter dye from the probe and a fluorescent signal is generated.
  • the increase in fluorescence signal is detected only if the target or marker sequence is complementary to the probe and is amplified during PCR.
  • a mismatch between probe and target greatly reduces the efficiency of probe hybridization and cleavage.
  • the ABI Prism 7700HT or 7900HT Sequence detection System measures the increase in fluorescence during PCR thermal cycling, providing "real time" detection of PCR product accumulation.
  • Real Time detection on the ABI Prism 7900HT or 7900HT Sequence Detector monitors fluorescence and calculates Rn during each PCR cycle.
  • the threshold cycle, or Ct value is the cycle at which fluorescence intersects the threshold value.
  • the threshold value is determined by the sequence detection system software or manually.
  • Oligonucleotide probes can be designed which are between about 10 and about 100 nucleotides in length and hybridize to the amplified region. Oligonucleotides probes are preferably 12 to 70 nucleotides; more preferably 15-60 nucleotides in length; and most preferably 15-25 nucleotides in length. The probe may be labeled.
  • SEQ ID NO:26 can be used as an oligonucleotide probe to detect a marker sequence associated with the tandem repeat region of the FMRl gene (following genomic fragmentation by AIuI), when the marker sequence is amplified by forward and reverse primers as set forth in SEQ ID NO:4 and SEQ ID NO:5, respectively.
  • SEQ ID NO:27 can be used to detect an AIuI control fragment amplicons amplified by AluIFtaq and AluIRtaq (SEQ ID NOs:22 and 23, respectively) and .
  • SEQ ID NO:28 can be used to detect the 8,479 bp AIuI control fragment amplicon, as amplified by LargeFtaq and LargeRtaq (SEQ ID NOs: 24 and 25, respectively).
  • Amplified fragments may be detected using standard gel electrophoresis methods. For example, in preferred embodiments, amplified fractions are separated on an agarose gel and stained with ethidium bromide by methods known in the art to detect amplified fragments.
  • methods involving amplification of the tandem repeat region are used to measure the size of that region.
  • such methods are used as a screen prior to the use of a second method for sizing the tandem repeat region.
  • the amplification is preferably done by PCR.
  • the entire tandem repeat region is amplified.
  • the resulting amplicons are sized using electrophoresis, preferably capillary electrophoresis.
  • forward primer FX-5F (SEQ ID NO:29; 5'GCT CAG CTC CGT TTC GGT TTC ACT TCC GGT 3') is used in an amplification reaction with reverse primer FX-3F(SEQ ID NO:30; 5'-AGC CCC GCA CTT CCA CCA CCA GCT CCT CCA-3') to amplify the tandem repeat region of the FMRl gene.
  • one of the primers of this primer pair is labeled, preferably the label is a fluorescent label.
  • Amplification product may be detected and sized by electrophoresis, preferably capillary electrophoresis. Alternatively, amplification products can be detected and sized using Southern blot.
  • the fraction in which the marker sequence upstream or downstream of the tandem repeat region is detected corresponds to the size of the fragment containing the tandem repeat. This correlation enables an estimation of the number of tandem repeats and thus, whether an individual is normal or carries an allele having an expansion in the tandem repeat region.
  • individuals that are afflicted with a disease associated with a mutation in the tandem repeat region of a gene can be distinguished from those that are normal.
  • a nucleic acid sample from the individual is fragmented to produce nucleic acid fragments in which the tandem repeat segment of the gene is associated with a marker sequence in a fragment.
  • the fragments are separated into fractions according to size under conditions in which the fragment(s) containing the tandem repeat segment will be located in the fractions according to the number of repeats in the tandem repeat segment, and identifying those fraction(s) containing the segment by detecting the marker sequence.
  • the fractions are chosen so that one fraction corresponds to tandem repeat regions having a normal number of repeats (i.e., a normal allele) and another fraction that corresponds an abnormal number of repeats (i.e., a mutated allele). If only the former fraction is positive, the individual is normal; if only the latter fraction is positive, the individual may be a carrier or may be afflicted with the disease. A positive result in both fraction indicates a heterozygote and the individual may or may not be affected, depending on whether the disease is dominant or recessive. If the disease is dominant, heterozygotes will be affected; if the disease is recessive, the heterozygote will not be affected but will be a carrier of the disease.
  • individuals that have a normal allele can be distinguished from those that have a premutation or a full mutation in the tandem repeat region of a gene.
  • a nucleic acid sample from the individual is fragmented to produce nucleic acid fragments in which the tandem repeat segment of the gene is associated with a marker sequence in a fragment.
  • the fragments are separated into fractions according to size under conditions in which the fragment(s) containing the tandem repeat segment will be located in the fractions according to the number of repeats in the tandem repeat segment, and identifying those fraction(s) containing the segment by detecting the marker sequence.
  • the fractions are chosen so that a first fraction corresponds to tandem repeat regions having a normal number of repeats (i.e., a normal allele), a second fraction corresponds to tandem repeat regions having a number of repeats in a premutation (i.e., a premutation allele), and a third fraction that corresponds to tandem repeat regions having a number of repeats in a full mutation (i.e., a full mutation allele).
  • a first fraction corresponds to tandem repeat regions having a normal number of repeats (i.e., a normal allele)
  • a premutation i.e., a premutation allele
  • a third fraction that corresponds to tandem repeat regions having a number of repeats in a full mutation.
  • a positive result in more than one fraction indicates a heterozygote.
  • a heterozygote may be a carrier or an affected individual depending on the gene involved and the dominance of the disease.
  • individuals having a mutation associated with fragile X syndrome can be distinguished from individuals having a premutation or a normal allele.
  • a nucleic acid sample from the individual is fragmented to produce nucleic acid fragments in which the tandem repeat segment of the FMRl gene is associated with a marker sequence in a fragment.
  • the fragments are separated into fractions according to size under conditions in which the fragment(s) containing the tandem repeat segment will be located in the fractions according to the number of repeats in the tandem repeat segment, and identifying those fraction(s) containing the segment by detecting the marker sequence.
  • the fractions are chosen so that a first fraction corresponds to tandem repeat regions having a normal number of repeats (i.e., a normal allele), a second fraction corresponds to tandem repeat regions having a number of repeats in a premutation (i.e., a premutation allele), and a third fraction that corresponds to tandem repeat regions having a number of repeats in a full mutation (i.e., a full mutation allele).
  • the fractions are chosen so that the first fraction corresponds to tandem repeat regions having less than 55 repeats, the second corresponds to tandem repeat regions having 55-200 repeats, and the third corresponds to tandem repeat regions having greater than 200 repeats.
  • the first fraction is positive (i.e., the marker sequence was detected in this fraction)
  • it indicates that the tandem repeat region contains less than 55 repeats i.e., a normal allele
  • the second fraction is positive
  • it indicates that the tandem repeat region contains 55-200 repeats i.e., a premutation allele
  • the third fraction is positive, it indicates that the tandem repeat contains greater than 200 repeats (i.e., a full mutation allele).
  • a phenotype or disease status may be assigned based on these results.
  • Females heterozygous for a normal allele and a full mutation allele may or may not be affected, depending on other factors such as methylation status of the gene.
  • an assay to distinguish individuals having a normal tandem repeat region of the FMRl gene from those having a premutation or full mutation two fractions of AIuI fragmented genomic DNA are separated by capillary electrophoresis and collected by automatic fraction collector, one between 211 — 400 bp, one between 401 bp - 9 kb.
  • AIuI fragments having 6-68 repeats will be present in the lower fraction, thus normal alleles and those with small premutations (i.e., 55-68 repeats) will be separated into this fraction.
  • Normal alleles and small premutions can be further distinguished by amplification of the tandem repeat region and sizing with electrophoresis.
  • the premutation encompassing a range of 69-200 repeats, and the full mutation, encompassing 201-2000+ repeats will be present in the upper fraction. Therefore, if the lower fraction is positive (i.e., the marker sequence was detected in this fraction), it indicates that there is a CGG tandem repeat region that contains 6-68 repeats; if the upper fraction is positive, it indicates that there is CGG tandem repeat region that contains 68-2000+ repeats. A positive result in both fractions would indicate a heterozygote in which one allele is normal and the other allele contains the premuation or the full mutation.
  • DNA fragmented with a combination of BIpI an dMlyl is separated into four fractions corresponding to sizes of approximately less than or equal to 603 bp (first/lowest fraction), 604-840 bp (second fraction), 841 - 1078 bp (third fraction) and 1079 bp - 9 kb (fourth/highest fraction). Fragments from samples containing a normal FMRl gene (i.e., less than 55 tandem repeats) and those containing premutations having 56-62 tandem repeats will separate into the first/lowest fraction.
  • Normal alleles and small premutions can be further distinguished by amplification of the tandem repeat region and sizing with electrophoresis. Fragments from FMRl genes containing small premutations (i.e., 63-140 tandem repeats) will separate into the second fraction, whereas fragments from FMRl genes containing large premutations (i.e., 141-200 tandem repeats) and full mutations having 201-220 tandem repeats will separate into the third fraction, and whereas large premutations having greater than 220 tandem repeats will separate into the fourth/highest fraction. Large premuations 141-200 repeats can be distinguished from full mutations of 201-220 using, for example, standard Southern blot methods.
  • the AIuI digested genomic DNA is size fractionated using capillary electrophoresis into a multiplicity of fractions. Automated fraction collection is accomplished using a preset fraction time window of approximately 30 seconds per fraction, beginning at 200 bp and ending at 9 Kb. Approximately 16 fractions are collected. The fraction or fractions that are positive for detection of the marker sequence upstream or downstream of the tandem repeat region correspond to a size range and thus, the number tandem repeats can be estimated.
  • a nucleic acid assay to determine gender is combined with an assay to determine the length of the tandem repeat region of the FMRl gene.
  • the nucleic acid assay includes DNA amplification.
  • DNA amplification assays may target amplification of sequences specific to the Y chromosome (e.g., the SRY locus (Sinclair, et al., Nature 346:240 244, 1990)). In this case, amplification only occurs in the presence of a Y chromosome, indicating the nucleic acids are from a male. The absence of amplification suggests the nucleic acids are from a female.
  • a positive control is preferably included to detect false negatives.
  • certain genes which occur on both the X chromosome and the Y chromosome but having different lengths depending on whether the gene occurs on the X chromosome or the Y chromosome may be targeted for amplification.
  • a region encompassing the segment of the gene which differs between the X and Y chromosomes would be amplified. This results in amplification products having different sizes, corresponding to the template nucleic acid (i.e., the X chromosome or the Y chromosome).
  • the template nucleic acid i.e., the X chromosome or the Y chromosome
  • the amelogenin gene is targeted for gender determination. Sequence differences between the X and Y homologs of the amelogenin gene have been used to differentiate males from females. For example, two primer sets primer sets spanning a 6 base pair (bp) deletion of the amelogenin gene on the X chromosome have been used to generate fragments of 106/112 bp or 212/218 bp for XfY products, respectively (Sullivan et al., BioTechniques 15:636-9, 1993). In preferred embodiments, the following primers are used to amplify a region of the amelogenin gene:
  • AMLF2 primer S'-AGTACTTGACCACCACCTCCTGATCTACAAGG 3' (SEQ ID NO:40) and
  • AMLR2 primer 5'-TTTTTAACAGTTTACTTGCTGATAAAACTCAYCCC 3' (SEQ ID NO:41).
  • This primer pair results in a 134 bp amplicon corresponding to the X chromosome homolog and a 140 bp amplicon corresponding to the Y chromosome.
  • both amplicons would be generated by amplification of nucleic acids from males, whereas only one amplicon would be generated by amplification of nucleic acids from females.
  • Genomic DNA test samples and control samples of DNA were restriction endonuclease digested with AIuI. 1.0 ⁇ g of test or control DNA was used for each digest. Genomic DNA was purified and diluted to a concentration of 50 ng/ ⁇ L.
  • the reaction mix for the digest was prepared according to the following table. Table 5. AhA reaction mix
  • the digested samples were fractionated using the same conditions as the 1 kb DNA ladder.
  • the lower fraction (211-396 bp) and the upper fraction (396 bp -9 kb) were collected based on the sizing cutoff times as determined using the 1 kb ladder for each size range.
  • Fractions were collected using the P/ACE MDQ auto-collector and stored in a 96-well plate in 30 ⁇ L 0.1 X TBE buffer per well.
  • the digested samples were fractionated based on the same condition as the 1-kb DNA ladder running condition. A total 16 fractions were collected using P/ACE MDQ's auto collector and stored in a 96-well plate in 5 ⁇ L dH2 ⁇ per well.
  • a PCR Master Mix for amplifying fragments and for size separation analysis of PCR product was prepared as shown in Table 6. Table 6. Preparation of PCR primer (non-Taqman) master mix for CCG 5' flanking sequence
  • PCR primer master mix was stored in 1.5 mL aliquots at -20 0 C prior to use. When ready for use, PCR reactions were prepared as shown in Table 7.
  • the final amplification mixtures were sealed tightly in plates with Microseal A film.
  • the plates were vortexed briefly (approximately 5 sec) and spun down for approximately 30 sec in a plate centrifuge at 2,000-6,00Og (1,600 rpm in a Sorvall T6000D centrifuge).
  • the plate was transferred to the ABI 9700 thermal cycler.
  • the thermal cycler conditions for amplification were as follows:
  • Step 1 95°C for 15 minutes.
  • Step 3 64°C for 60 seconds.
  • Step 5 Steps 2-4 repeated, 34 times.
  • Step 6 72°C for 5 minutes.
  • Step 7 4°C indefinitely.
  • PCR products were then loaded onto the ABI 3100 genetic analyzer for detection.
  • a Taqman PCR master mix for detecting CCG 5 ' flanking sequence was prepared as shown in Table 8.
  • Taqman PCR master mix was stored in 1.5 mL aliquots at -20 0 C prior to use. When ready for use, Taqman PCR reactions were prepared as shown in Table 9.
  • thermocycler conditions for TaqMan were as follows:
  • Step 1 95 0 C for 15 minutes.
  • Step 3 64°C for 60 seconds.
  • Step 4 72 0 C for 30 seconds
  • Step 5 Steps 2-4 repeated, 40 times.
  • Step 6 72 0 C for 5 minutes.
  • Step 7 4°C indefinitely.
  • genomic DNA test samples are restriction endonuclease digested with AIuI. Approximately 1.0 ⁇ g of test genomic DNA is used for each digest. The reaction mix for the digest is prepared according to the enzyme supplier's protocol. The samples are mixed and incubated at 37°C to complete digestion.
  • the restriction enzyme digested DNA is separated according to size using capillary electrophoresis and two fractions are collected, such that the first fraction (250-360 bp) corresponds to the normal repeat range (e.g., 5-37 repeats) and the second fraction (400 bp - 9 kb) corresponds to an expanded repeat mutation (e.g., greater than 50 repeats).
  • 1 kb DNA ladder is first injected into the capillary to determine the correct sizing cutoff time. Automated fraction collection is accomplished using a preset fractionation time window corresponding to a lower fraction and an upper fraction. The separations are monitored on— column by UV detection. The digested samples are then fractionated using the same conditions as the 1 kb DNA ladder. The lower fraction and the upper fraction are collected based on the sizing cutoff times as determined using the 1 kb ladder for each size range.
  • Each fraction is analyzed using the TaqMan real time PCR method for the presence of a fragment containing the DM-I tandem repeat region.
  • a segment of the 3 '-untranslated region of DM-I gene is amplified using a forward primer (e.g., 5'- CCATTTCTTTCTTTCGGCCA-3'; SEQ ID NO:31) and a reverse primer (e.g., 5'- AGGCCTGC AGTTTGCCC-3'; SEQ ID NO:32).
  • the amplified fragment is detected with a TaqMan labeled probe, 5'-TGAGGCCCTGACGTGG-3 f (SEQ ID NO:33).
  • the presence of the amplified segment in only the lower fraction is indicative of an individual homozygous for the normal DM-I allele.
  • the presence of the amplified segment in only the upper fraction is indicative of an individual homozygous for a mutant allele(s).
  • the presence of the amplified segment in both fractions is indicative of a heterozygote.
  • genomic DNA test samples are restriction endonuclease digested with AIuI and Rsal. Approximately 1.0 ⁇ g of test genomic DNA is used for each digest. The reaction mix for the digest is prepared according to the enzyme supplier's protocol. The samples are mixed and incubated at 37°C to complete digestion.
  • the restriction enzyme digested DNA is separated according to size using capillary electrophoresis and two fractions are collected, such that the first fraction (300-405 bp) corresponds to the normal repeat range (e.g., 7-34 repeats) and the second fraction (600 bp - 9 kb) corresponds to an expanded repeat mutation ⁇ e.g., greater than 100 repeats).
  • 1 kb DNA ladder is first injected into the capillary to determine the correct sizing cutoff time. Automated fraction collection is accomplished using a preset fractionation time window corresponding to a lower fraction and an upper fraction. The separations are monitored on— column by UV detection. The digested samples are then fractionated using the same conditions as the 1 kb DNA ladder. The lower fraction and the upper fraction are collected based on the sizing cutoff times as determined using the 1 kb ladder for each size range.
  • Each fraction is analyzed using the TaqMan real time PCR method for the presence of a fragment containing the FRDA tandem repeat region.
  • a segment of the first intronic region of the FRDA gene is amplified using a forward primer (e.g., 5'- AGGCCT AGG A AGGTGGATCAC-3'; SEQ ID NO:34) and a reverse primer (e.g., 5'- ACCATGTTGGCCAGGTTAGTCT-3'; SEQ ID NO:35).
  • the amplified fragment is detected with a TaqMan labeled probe, 5'-TGAGGTCCGGAGTTC-S' (SEQ ID NO:36).
  • the presence of the amplified segment in only the lower fraction is indicative of an individual homozygous for the normal FRDA allele.
  • the presence of the amplified segment in only the upper fraction is indicative of an individual homozygous for a mutant allele(s).
  • the presence of the amplified segment in both fractions is indicative of a heterozygote.
  • each of the four fractions was digested with a second restriction endonuclease, BstNI, that cleaved the marker from the CCG tandem repeat region.
  • BstNI a second restriction endonuclease
  • each of the four fractions were subjected to PCR as described in Example 3B (i.e., TaqMan PCR Amplification).
  • expansion mutations of the tandem repeat region of the FMRl gene are detected by fragmentation with BIpI and MIyI, size fractionation, followed by a second restriction enzyme digestion with Bmtl.
  • Genomic DNA test samples are restriction endonuclease digested with BIpI and MIyI using approximately 1.5 ⁇ g of test or control DNA (purified and diluted to a concentration of 50 ng/ ⁇ L) for each digest.
  • the reaction mix for the digest is prepared according to the following table.
  • Fraction 1 (less than 603 bp) contains nucleic acids with 6-62 CCG tandem repeats
  • fraction 2 (603-840 bp) contains 63-140 CCG tandem repeats
  • fraction 3 (841-1078b ⁇ ) contains 141-220 CCG tandem repeats
  • fraction 4 (1079 bp - 9 kb) contains 221-2000+ CCG tandem repeats.
  • Bmtl restriction endonuclease
  • each of the four fractions are subjected to PCR using the following primers:
  • a PCR master mix for amplifying fragments for size analysis is prepared according to Table 15.
  • a final PCR amplification mixture is made by adding 0.5 ⁇ L HotStarTaq (Qiagen) is added to each individual PCR reaction followed by 5 ⁇ L digested fractionated DNA. The final amplification mixtures are sealed tightly in plates with Microseal A film. The plates are vortexed briefly (approximately 5 sec) and spun down for approximately 30 sec in a plate centrifuge at 2,000-6,00Og (1 ,600 rpm in a Sorvall T6000D centrifuge). The plate is transferred to the ABI 9700 thermal cycler.
  • thermal cycler conditions for amplification are as follows:
  • Step 1 95°C for 15 minutes.
  • Step 3 55°C for 30 seconds.
  • Step 5 Steps 2-4 repeated, 33 times.
  • Step 6 72°C for 10 minutes.
  • Step 7 4°C indefinitely.
  • expansion mutations of the tandem repeat region of the FMRl gene are detected by fragmentation with Sphl and Bmtl, size fractionation, followed by a second restriction enzyme digestion with BstNI.
  • genomic DNA test samples are restriction endonuclease digested with Sphl and Bmtl using approximately 1.5 ⁇ g of test or control DNA (purified and diluted to a concentration of 50 ng/ ⁇ L) for each digest.
  • the reaction mix for the digest is prepared according to the following table. Table 16.
  • Fraction 1 contains nucleic acids with 6-62 tandem repeats
  • fraction 2 contains 63-163 tandem repeats
  • fraction 3 contains 164-196 tandem repeats
  • fraction 4 contains greater than tandem repeats.
  • an aliquot of each of the four fractions is digested with a second restriction endonuclease, Bmtl, that cleaved the marker from the CCG tandem repeat region.
  • Bmtl second restriction endonuclease
  • Step 1 95°C for 15 minutes.
  • Step 3 55°C for 30 seconds.
  • Step 5 Steps 2-4 repeated, 33 times.
  • Step 6 72°C for 10 minutes.
  • Step 7 4°C indefinitely.
  • An AIuI fragment from a region of the genome distinct from FMRl, that was larger than 6000 bases, containing trinucleotide repeats, and having a high GC content and/or CpG islands was identified for use as a control fragment in the FMRl assay.
  • This fragment was identified as follows. All AIuI sites in the human genome were identified using the EMBOSS Restrict program (Rice et al., "EMBOSS: The European Molecular Biology Open Software Suite.” Trends in Genetics 16(6):276-7 (2000)), resulting in a predicted 11.5 million fragments produced by a digestion with A IuI. From these fragments, 20 fragments having a length longer than 6000 bases were identified using the TACG program (Mangalam, HJ.
  • the region is amplified by the polymerase chain reaction in the presence of a fluorescently-labeled primer (e.g., 6-FAM) and the sizes of the resulting labeled amplicons are determined by capillary electrophoresis.
  • a fluorescently-labeled primer e.g., 6-FAM
  • a second trinucleotide repeat (CAG in the X-linked androgen receptor gene) is co-amplified using a primer pair in which one of the primers is fluorescently-labeled and co-analyzed to provide an internal amplification control.
  • tandem repeat region of the FMRl gene is amplified using FX-5F (SEQ ID NO:29) as the forward primer and FX-3F (SEQ ID NO:30) as the reverse primer, and trinucleotide repeat of the X-linked androgen receptor gene using AR-5F (SEQ ID NO:37) as the forward primer and AR-R2 (SEQ ID NO.38) as the reverse primer, as set forth in the table below.
  • a PCR master mix for amplifying fragments for size analysis is prepared according to Table 19.
  • the PCR master mix is aliquoted into 1,100 ⁇ L aliquots to which 5.5 ⁇ L of Taq polymerase (5 U/ ⁇ L) from Qiagen and 22 ⁇ L of Pfu DNA polymerase (2.5 U/ ⁇ L) from Stratagene are added to make the polymerase/master mix solution.
  • Genomic DNA is diluted to 20 ng/ ⁇ L in TE buffer. Samples are heated to 93-97°C for 4-6 minutes and cooled on ice. 10 ⁇ L of polymerase/master mix solution is added to 2 ⁇ L (40 ng) diluted genomic DNA. [0205] The samples are transferred to the ABI 9700 thermal cycler once the thermal cycler has reached 85°C +/- 2°C.
  • the PCR conditions for amplification are as follows:
  • Step 3 60 0 C for 2 minutes
  • Step 5 Steps 2-4 repeated, 31 times
  • Step 7 4°C indefinitely.
  • PCR products are then loaded onto the ABI 3100 genetic analyzer for detection.
  • EXAMPLE 12 Carrier screening for Fragile X Syndrome
  • samples from male and female individuals were screened for carrier status of fragile X syndrome by a two-step method, in which samples were initially screened with multiplex PCR to establish gender and size of the FMRl region and were subjected to further sizing analysis based on the results of the multiplex PCR. All samples that were determined to be female and heterozygous for two normal alleles were not subjected to further analysis. All samples determined to be female and apparently homozygous at the FMRl locus (24% of all analysis) were subjected to further analysis to determine the size of the FMRl tandem repeat region.
  • Genomic DNA was extracted from 150 ⁇ L whole blood collected in EDTA anticoagulated blood collection vacuum tubes using an Xtractor GeneTM (Corbett Life Science, Mortlake, NSW, Australia) according to manufacturer's Whole Blood DNA Extraction Protocol. The final elution was carried out in 100 ⁇ L buffer to consistently yield concentrations of 50 - 100 ng/ ⁇ L.
  • Genomic DNA samples were then analyzed by a multiplex PCR consisting of an amplification of the FMRl tandem repeat region using FX-5F primer (FAM-labeled; SEQ ID NO:29) and FX-3F primer (SEQ ID NO:30); amplification of a region of the amelogenin gene to establish gender using AMLF2 primer, 5'-
  • AR-5F HEX-labeled SEQ ID NO:37
  • AR-R2 primer SEQ ID NO:38
  • a PCR mastermix for the multiplex PCR was prepared consisting of 3.3 ⁇ M of each of the above primers, IX Qiagen Standard PCR buffer, 0.4 mM MgCl 2 , 2% DMSO, 1 X Qiagen Q Solution, 0.2 mM dNTP, and 0.25 unit Qiagen Taq DNA polymerase (Qiagen, Valencia, CA), 0.5 unit Pfu DNA Polymerase (Strategene, La Jolla, CA). One ⁇ L of isolated DNA solution was added to 10 ⁇ L of the multiplex primer mix to a final volume of 11 ⁇ L.
  • the PCR conditions were as follows: 95 0 C for 6 min followed by 32 cycles of 95 0 C for 1 min, 60 0 C for 2 min, 75 0 C for 5 min, and finally the amplified products were extended at 75 0 C for 15 min.
  • the PCR fragments were analyzed on an ABI 3100 automated DNA sequencer (Applied Biosystems, Foster City, CA, USA) and fragment analysis accomplished with ABI GeneScanTM V3.7 and GenotyperTM V3.7 software (Applied Biosystems).
  • the amelogenin primer pair results in a 134 bp amplicon corresponding to the X chromosome homolog and a 140 bp amplicon corresponding to the Y chromosome.
  • genomic DNA was digested for 16 hours at 37°C with restriction enzymes BIpI and MIyI (New England BioLabs, Ipswich, MA 3 USA). Following incubation the restriction fragments were either pressure injected or vacuum injected onto a P/ ACETM MDQ capillary electrophoresis system with an UV/Vis Detector (Beckman Coulter, Fullerton, CA, USA). Undenatured double stranded DNA was separated at an electric field strength of 100 V/cm, in IX TBE buffer (90 mM Tris-Borate, 2 mM EDTA, pH 8.3). Capillary temperature was maintained at 25 0 C.
  • restriction enzymes BIpI and MIyI New England BioLabs, Ipswich, MA 3 USA. Following incubation the restriction fragments were either pressure injected or vacuum injected onto a P/ ACETM MDQ capillary electrophoresis system with an UV/Vis Detector (Beckman Coulter, Fullerton, CA
  • AU collected fractions were then subjected to restriction enzyme digestion with Bmtl (New England BioLabs) according to manufacturer's procedure in order to cleave the marker sequence from the tandem repeat region. Five ⁇ L of each digested fraction was transferred to 96-well plates containing 20 ⁇ L PCR mix in each well.
  • This PCR mix consists of IX Qiagen Standard PCR buffer, 1.5 mM MgCl 2 , 5% DMSO, 100 mM KCl, 0.2 mM dNTP, 2.5 units HotStart Taq DNA polymerase (Qiagen Inc), and l ⁇ M each of following primers: FXCEF2 primer (FAM-labeled; SEQ ID NO:6), FXCER2 primer (SEQ ID NO:7), and 0.01 ⁇ M each of following primers: BIpIF primer (HEX-labeled, SEQ ID NO:12), BIpIR primer (SEQ ID NO: 13), lgctrlF primer (FAM-labeled, SEQ ID NO:20), lgctrlR primer (SEQ ID NO:21), F2ctrllF primer (HEX-labeled, SEQ ID NO:16), F2ctrllR primer (SEQ ID NO: 17), F3ctrl3F primer (FAM
  • the PCR conditions were as follows: 95 0 C for 15 min following by 33 cycles of 95 0 C for 30 sec, 55 0 C for 30 sec, 72 0 C for 1 min, and finally the amplified products were extended at 72 0 C for 10 min.
  • the final PCR products were then analyzed on a 3100 Prism Genetics Analyzer (Applied Biosystems) using GenescanTM-350 ROX size standard (Applied Biosystems).

Abstract

The present invention provides methods of determining the size of a particular nucleic acid segment of interest in a sample of nucleic acids through fragmentation of DNA, size fractionation, an optional second fragmentation, and identification using a marker sequence. In particular aspects, an expansion or reduction of tandem repeat sequences can be detected. In further aspects, carriers and individuals afflicted with fragile X syndrome or other diseases associated with tandem repeats can be distinguished from normal individuals.

Description

NUCLEIC ACID SIZE DETECTION METHOD
FIELD OFTHE INVENTION
[0001] The present invention relates generally to the field of medical diagnostics. In particular, the present invention relates to methods of detecting genetic mutations characterized by an expansion of tandem repeats.
BACKGROUND
[0002] A tandem repeat in DNA represents two or more contiguous approximate copies of a pattern of nucleotides. Tandem repeats have been shown to be associated cause a variety of human diseases. Dramatic expansion of trinucleotide repeats has been associated with such diseases as fragile-X mental retardation (see Verkerk, et al., (1991) Cell, 65, 905-914), Huntington's disease (see Huntington's Disease Collaborative Research Group. (1993) Cell, 72, 971-983), myotonic dystrophy (see Fu, et al., (1992) Science, 255, 1256-1258), spinal and bulbar muscular atrophy (see La Spada, et al., (1991) Nature, 352, 77-79) and Friedreich's ataxia (see Campuzano, et al., (1996) Science, 271, 1423-1427).
[0003] Fragile X syndrome is one of the most common causes of inherited mental retardation, occurring in approximately one in 1,250 males and approximately one in 2,500 females. Males with fragile X syndrome typically exhibit some degree of mental impairment, ranging from learning disabilities to mental retardation to autism. Characteristic physical features (e.g., enlarged ears, elongated face with prominent chin), connective tissue problems (e.g., mitral valve prolapse, and double-jointed fingers), and characteristic behaviors (e.g., attention deficit disorders, speech disturbances, and unusual responses to various touch, auditory, or visual stimuli) may also be exhibited. Affected females present with similar but milder mental impairment, physical characteristics, and behavioral characteristics as those of affected males.
[0004] The mutation responsible for fragile X syndrome involves expansion of a trinucleotide (CGG) tandem repeat sequence located in the 5' untranslated region of the FMRl gene on the X chromosome. The number of CGG repeats in the FMRl gene determines whether an individual is normal or has one of the two categories of mutation: premutation and full mutation. The number of repeats ranges from less than 55 repeats in normal, non-carrier individuals, whereas a premutation consists of 55 to 200 repeats and full mutation consists of more than 200 repeats (Chen et al. Hum. MoI. Genetics 12(23):3067-74, 2003).
[0005] Both males having a premutation and females having a premutation in one FMRl gene are carriers but are unaffected. Male carriers are referred to as "normal transmitting" males, and pass on the mutation, relatively unchanged in size to each daughter. Although such daughters are unaffected, they are at risk of having affected offspring because a premutation is susceptible to expansion after passage through a female meiosis. Furthermore, the larger the premutation, the higher the risk of expansion to a full mutation in any offspring.
[0006] Most males with a full mutation exhibit mental retardation and stereotypical physical and behavioral characteristics. For females with a full mutation in one FMRl gene, about one-third exhibit normal intelligence, about one-third exhibit borderline intelligence, and about one-third exhibit mental retardation.
[0007] Currently, the industry standard for screening for carriers or affected individuals with expansion of tandem repeat regions such as Fragile X is a combination of PCR amplification of the tandem repeat region and analysis by Southern blotting.
SUMMARY OF THE INVENTION
[0008] In one aspect of the invention, there are provided methods of determining the size of a particular nucleic acid segment of interest in a sample of nucleic acids. This method is accomplished by separating fragments of a nucleic acid, wherein the fragments are prepared from a nucleic acid-containing sample, wherein the fragments include some which contain the segment and a marker sequence, wherein separating is into fractions according to size under conditions in which a fragment containing the segment will be located in the fractions according to the size of the segment, and identifying those fraction(s) containing the segment by detecting the marker sequence, wherein the size of the segment is determined by the fraction in which it is identified.
[0009] This method is applicable to essentially any nucleic acid segment of interest, however the method is particularly amenable to determining the size of nucleic acid segments which are, for example, difficult to size by methods utilizing amplification (e.g. PCR) across the nucleic acid segment. Examples of nucleic acid segments which are difficult to size by methods utilizing amplification across the nucleic acid segment include nucleic acid segments which have high content of the bases guanine and cytosine, large nucleic acid segments, and/or segments having large numbers of tandem repeats. In a preferred embodiment, the particular nucleic acid segment of interest is a tandem repeat nucleic acid sequence. A length of nucleic acid which is difficult to amplify by PCR is generally greater than 50,000 bases, more typically greater than 100,000 bases, more typically greater than 150,000 bases, or more than 200,000 bases, or even more than 250,000 bases.
[0010] In another aspects, invention methods are used to determine the size of a particular nucleic acid segment in a sample from an individual, thereby determining if that individual has an abnormality in size of that particular nucleic acid segment, wherein the abnormality is due to a duplication, addition, or deletion in the particular nucleic acid segment.
[0011] In still another aspect, the invention provides a method of detecting a mutation in a tandem repeat segment of a gene in a nucleic acid sample, wherein the mutation is characterized by an increase in the number of repeats compared to the number of repeats in the wild type allele. The method is accomplished separating fragments of a nucleic acid, wherein the fragments are prepared from a nucleic acid-containing sample, wherein the fragments include some which contain the tandem repeat segment and a marker sequence, wherein the separating is into fractions according to size under conditions in which a fragment containing the tandem repeat segment will be located in the fractions according to the number of repeats in the tandem repeat segment; and identifying those fraction(s) containing the segment by detecting the marker sequence. The number of repeats in the tandem repeat segment is determined by the fraction in which it is identified. The number of repeats is compared to the number in the corresponding wild type allele, wherein a number of repeats greater that the number in wild type allele is indicative of a mutation.
[0012] In certain embodiments, the above aspect of the invention further includes determining if a mutation is a premutation or a full mutation. This determination is accomplished by comparing the number of repeats in the tandem repeat segment from the nucleic acid sample to the number in the corresponding full mutation allele, wherein a number of repeats greater than the wild type allele but less than the full mutation is indicative of a premutation allele, and a number of repeats greater than or equal to the full mutation is indicative of a full mutation allele. In other embodiments the number of repeats in the tandem repeat region of the nucleic acid sample can be compared to the number of repeats found in each of a wild type allele, a premutation allele, and a full mutation allele.
[0013] In particular embodiments of the above aspect of the invention there are provided methods of identifying FMRl alleles having a normal number of tandem repeats, a premutation, or a full mutation in the nucleic acid of an individual, in which the method includes, fragmenting the nucleic acid in the sample from the individual into fragments, wherein the tandem repeat segment of the FMRl gene is associated with a marker sequence in the fragment, separating fragments of a nucleic acid, wherein the fragments are prepared from a nucleic acid containing sample of the individual, wherein the fragments include some which contain a tandem repeat segment of the FMRl gene and a marker sequence, the separating into fractions according to size under conditions in which a fragment containing the tandem repeat segment having a normal number of repeats will be located in a first fraction; and a fragment containing a tandem repeat segment having a premutation will be located in a second fraction; and a fragment having a tandem repeat region having a full mutation will be located in a third fraction, identifying those fraction(s) containing the segment by detecting the marker sequence, wherein the number of repeats in the tandem repeat segment is determined by the fraction in which it is identified, wherein a positive result in the first fraction indicates the individual has an FMRl allele with a normal number of tandem repeats; a positive result in the second fraction indicates the individual has a premutation FMRl allele; and a positive result in the third fraction indicates the individual has a full mutation FMRl allele.
[0014] In other aspects, there are provided methods for detecting carriers of genetic mutations characterized by the expansion or reduction of a tandem repeat segment of a gene and diagnosing individuals afflicted with diseases caused by such an expansion. The method involves the detection of wild type alleles, premutation alleles and/or full mutation alleles for a particular gene as described above. A genotype may then be determined based on the allele(s) present in an individual, allowing the designation of normal, carrier, or affected status.
[0015] As used herein a "carrier" is an individual who carries an mutated or altered allele of a gene but is not affected by the disorder or disease associated with mutation. Carriers can pass the mutation to a child or offspring in future generations, who may be affected with the disease or disorder. With respect to Fragile X syndrome, both males and females may be carriers. As used herein in reference to males, the term "carrier" is used interchangeably with "premutation carrier" and refers to males having a premutation allele. As used herein in reference to females, the term carrier encompasses females having a premutation allele or a full mutation allele. Such female carriers may also be referred to herein as "premutation carrier" (i.e., having a premuation FMRl allele) or a "full mutation carrier" (i.e., having a full mutation FMRl allele).
(0016] As used herein "affected" refers to individuals who possess one or more mutated alleles of a particular gene and exhibit the disease or disorder (i.e., phenotype) associated that mutation. With respect to Fragile X syndrome, males having a full mutation FMRl allele are affected, whereas females having a single full mutation allele may be affected or may be a full mutation carrier.
[0017] In some embodiments of the above aspect of the invention, male individuals afflicted with Fragile X syndrome (full mutation) can be distinguished from individuals that are carriers (premutation) or from those that are normal. A nucleic acid sample from the individual is fragmented to produce nucleic acid fragments in which the tandem repeat segment of the FMRl gene is associated with a marker sequence in a fragment. The fragments are separated into fractions according to size under conditions in which the fragment(s) containing the tandem repeat segment will be located in the fractions according to the number of repeats in the tandem repeat segment, and identifying those fraction(s) containing the segment by detecting the marker sequence. In some embodiments, the fractions are chosen so that the first fraction captures fragments having a number of repeats within the range of repeats for a normal allele; the second fraction captures fragments having a number of repeats within the range of repeats for a premutation allele; and the third fraction captures fragments having a number of repeats within the range of repeats for a full mutation allele. In accordance with current research in the field of fragile X, the range of repeats for a normal normal allele of the FMRl gene is less than 55 repeats; the range of repeats for a premutation allele is 55-200; and the range of repeats for a full mutation allele is greater than 200 repeats. These ranges may be +/- 10%. These ranges may change over time as new studies are conducted and more is learned about the correlation of number of repeats and disease status. Since males generally have a single X-chromosome (which is where the FMRl gene resides), only one fraction should be positive for the marker sequence. Therefore, males can be assigned a phenotype, based on the genotype according to the following: if the first fraction is positive for the marker sequence, the individual is normal; if the second fraction is positive for the marker sequence, the individual is a carrier; if only the third fraction is positive for the marker sequence, the individual is affected with fragile X.
[0018] In another aspect, there are provided methods for screening male and female individuals for carrier status of mutations in the tandem repeat region of the FMRl gene, the method includes assaying nucleic acids from an individual to determine gender; and assaying the nucleic acid to determine the length of the tandem repeat region of the FMRl gene wherein the determining comprises amplifying tandem repeat region, detecting an amplification product, and determining the number of tandem repeats in the amplification product, wherein, in male individuals: the presence of an amplification product having less than 55 tandem repeats indicates the individual is not a carrier, the presence of an amplification product having 55 or more tandem repeats indicates the individual is a carrier, or in the absence of an amplification product, the carrier status is undetermined; and in female individuals: the presence of an amplification product having more than 55 tandem repeats indicates the individual is a carrier; or the presence of a single amplification product having less than 55 tandem repeats, the carrier status is undetermined.
[0019] In certain embodiments, the above method further includes, analyzing undetermined individuals to determine carrier status, wherein the analyzing includes, separating fragments of a nucleic acid, wherein the fragments are prepared from a nucleic acid containing sample of the individual, wherein the fragments include some which contain a tandem repeat segment of the FMRl gene and a marker sequence, the separating into fractions according to size under conditions in which a fragment containing the tandem repeat segment having a normal number of repeats will be located in a first fraction; and a fragment containing a tandem repeat segment having a premutation will be located in a second fraction; and a fragment having a tandem repeat region having a full mutation will be located in a third fraction, identifying those fraction(s) containing the segment by detecting the marker sequence, wherein the number of repeats in the tandem repeat segment is determined by the fraction in which it is identified, wherein in male individuals: a positive result in the first fraction indicates the individual is not a carrier, a positive result in the second fraction indicates the individual is a premutation carrier, a positive result in the third fraction indicates the individual is affected; and in female individuals: a positive result in only the first fraction indicates the individual is homozygous for the a normal allele; a positive result in the second fraction indicates the individual is a premutation carrier; and a positive result in the third fraction indicates the individual is a full mutation carrier.
[0020] In preferred embodiments of the above method, the assaying of nucleic acids to determine gender includes amplification of a region of the nucleic acid, preferably by PCR. For example, sequences specific to the Y chromosome, such as the SRY locus may be targeted for amplification. In this case, amplification only occurs in the presence of a Y chromosome. (See Sinclair A. H., et al., Nature 346:240 244 (1990)). In other examples, certain genes which occur on both the X chromosome and the Y chromosome may be detected for gender determination, if the lengths of the corresponding genes are different on each chromosome. Thus, amplification results in different sized amplicons having lengths specific to either the X or the Y chromosome. In this case, amplification of nucleic acids from males would result in both amplicons, whereas samples from females would have only one amplicon. Examples of such genes include DXZl and DYZl and the amelogenin gene. In more preferred embodiments, the assaying to determine gender includes amplifying a region of the amelogenin gene which produces different sizes of amplification products from the amelogenin gene on the X chromosome and the amelogenin gene on the Y chromosome, determining the size of the amplification product or products, wherein the presence of one product of a single size indicates the gender is female and the presence of two products of different sizes indicates the gender is male.
[0021] In still further embodiments, the assay to determine gender is performed in multiplex with the amplification of the tandem repeat region; preferably in multiplex PCR; preferably one or more internal controls are include in the multiplex reaction. In preferred embodiments, a region of the androgen insensitivity gene is amplified as an internal control.
[0022] In a preferred embodiment of the above aspect of the invention, a sample containing genomic DNA is assayed for an expansion in the tandem repeat region of the FMRl gene. Thus, genomic samples are subjected to nucleic acid fragmentation and the resulting nucleic acid fragments are separated by size into fractions. A marker sequence upstream or downstream of the tandem repeat region of the FMRl gene, which is associated with the tandem repeat region in the fragmented nucleic acid, is amplified by polymerase chain reaction. In some embodiments the amplification of the marker sequence and detection of the amplicon is done using the TaqMan system. In other embodiments the marker sequence is amplified and using a labeled primer and the resulting labeled amplicon is detected using capillary electrophoresis.
|0023] One of skill in the art would readily recognize that the separation of fragments into fractions by size can be modified so that the fractions correspond to either a normal number of tandem repeats or an abnormal number of tandem repeats. In some embodiments, the fragmented nucleic acid may be separated by size into two fractions, an upper fraction of larger size fragments and a lower fraction of smaller size fragments. In this approach, the fractions are designed such that fragments from nucleic acid containing a normal number of tandem repeats will be found in the lower fraction while fragments from nucleic acid containing an abnormally increased number of tandem repeats will be found in the upper fraction. In other embodiments, the fragmented DNA may be separated into any number of fractions. In some embodiment the fragmented DNA may be separated into a number of fractions selected from the group consisting of 2-16, preferably 3 fractions, or 4 fractions, or 5 fractions, or 6 fractions, or 8 fractions, or even 16 fractions.
[0024] In a preferred embodiment of the above method, the fragmented DNA is separated into lower and upper fractions, wherein the lower fraction corresponds to a tandem repeat region containing less than 55 repeats (normal number of repeats) and an upper fraction containing 55 or more tandem repeats (premutation and full mutation). A normal allele can therefore be distinguished from a premutation or a full mutation.
[0025] In another preferred embodiment of the above method, the fragmented DNA is separated into three fractions, wherein a first fraction corresponds to a tandem repeat region of a normal allele (i.e., less than 55 repeats), a second fraction corresponds to a tandem repeat region of a premutation allele (i.e. 55-200 repeats), a third fraction corresponds to a tandem repeat region of a full mutation allele (i.e., greater than 200 repeats).
[0026] In yet another preferred embodiment of the above method, the fragmented DNA is separated into four fractions, wherein a first fraction corresponds to a tandem repeat region of a normal allele, a second fraction corresponds to a tandem repeat region of a small premutation allele, a third fraction corresponds to a tandem repeat region of a large premutation allele, and a fourth fraction corresponds to a tandem repeat region of a full mutation allele. In particular embodiments, the first fraction corresponds to 0-60 repeats; preferably the second fraction corresponds to 60-200 repeats; preferably the third fraction corresponds to 200-2000 repeats; and preferably the fourth fraction corresponds to 2000+ repeats. In another embodiment, the DNA is fragmented with BIpI and MIyI and fractionated such that the first fraction corresponds to 6-62 repeats; preferably the second fraction corresponds to 63-140 repeats; preferably the third fraction corresponds to 141-220 repeats; and preferably the fourth fraction corresponds to 221-2000+ repeats. In another embodiment, the DNA is fragmented with AIuI and fractionated such that the first fraction corresponds to 6-68 repeats; preferably the second fraction corresponds to 69-102 repeats; preferably the third fraction corresponds to 102-202 repeats; and preferably the fourth fraction corresponds to 203+ repeats. In yet another embodiment, the DNA is fragmented with Sphl and Bmtl and fractionated such that the first fraction corresponds to 6-62 repeats; preferably the second fraction corresponds to 63-163 repeats; preferably the third fraction corresponds to 164-196 repeats; and preferably the fourth fraction corresponds to 197+ repeats.
[0027] Also provided are methods of estimating the number of tandem repeats in a sample of genomic DNA. In this method, test samples containing genomic DNA are subjected to nucleic acid fragmentation. The resulting nucleic acid fragments are separated by size into three or more size range fractions, and fragments containing tandem repeat segments in the various fractions are identified by detecting a marker sequence flanking the tandem repeat segment, which is associated with the tandem repeat region in the fragmented nucleic acid. The size of the tandem repeat region detected is then determined by relating the fraction size containing the repeat to the size of a tandem repeat segment present in such nucleic acid fragments. Separation into three or more size range fractions allows a finer estimation of the number of tandem repeats. In the case of fragile X, the extent of the expansion of a premutation or full mutation can be assessed.
[0028] In preferred embodiments of the above aspects of the invention, the method includes a second nucleic acid fragmentation. Preferably the second fragmentation occurs after the size separation, which follows the first nucleic acid fragmentation; preferably the second fragmentation is by restriction enzyme digestion. In more preferred embodiments, the second fragmentation cleaves the particular nucleic acid segment of interest (e.g., a tandem repeat segment) from a marker sequence flanking the particular nucleic acid segment. Preferably the second nucleic acid fragmentation does not cleave within the marker sequence.
[0029] In some embodiments of the above aspects of invention, the marker sequence is detected by amplification of all or a portion of the marker sequence and detection of the amplicon. In preferred embodiments, the marker sequence is amplified by PCR and the amplicon is detected by electrophoresis. In more preferred embodiments, a primer used in the PCR amplification reaction comprises a label, thereby labeling the resulting amplicon. The so-labeled amplicon can then be detected by methods such as capillary electrophoresis. [00301 In other embodiments of the above aspects of the invention, the marker sequence is detected using real time PCR methods such as the TaqMan system. In this approach a probe is used to detect the amplified region of the marker sequence.
[0031] In still other embodiments of the above aspects of the invention, the marker sequence need not be amplified and can be detected directly by hybridization to two differentially labeled oligonucleotide probes. The two probes, which hybridize to distinct segments of a marker sequence, such that both probes can bind simultaneously, are contacted with the fragmented nucleic acids under hybridization conditions. The simultaneous detection of differentially labeled probes hybridized to a single nucleic acid fragment in the fractions indicates the presence of a tandem repeat region in a fragment contained in that fraction. The two oligonucleotide probes of this embodiment may be designed to hybridize to segments of a marker sequence upstream or downstream of the tandem repeat. These segments of the marker sequence may be adjoining the tandem repeat region or may be a distance upstream or downstream. In preferred embodiments, the segments of the marker sequence are within 500 bases upstream or downstream of the tandem repeat region; in more preferred embodiments the segments of the marker sequence are within 250 bases upstream or downstream of the tandem repeat region; in most preferred embodiments the segments of the marker sequence are within 100 bases upstream or downstream of the tandem repeat region. The probes may be designed to hybridize to the same or to opposite strands of a double-stranded marker sequence. The probes may both hybridize upstream or both downstream of the tandem repeat. Alternatively, one probe may hybridize upstream of the tandem repeats whereas the other probe hybridizes downstream. The probes may hybridize to segments of the marker sequence that are separated by zero bases to several hundred thousand bases provided both segments are located on the same contiguous nucleic acid molecule after the fragmentation step or steps. Preferably the probes are separated by less than 1 kb, or preferably less than 500 bases, or less than 300 bases, or less than 200 bases, or less than 100 bases, or less than 50 bases, or less than 20 bases, or less than 10 bases, or less than 5 bases, or 1 base, or 0 bases.
[0032] In another aspect of the invention, there are provided methods of determining the size of a particular nucleic acid segment in a sample of nucleic acids, wherein size is determined using information obtained using a first method and a second method. Thus, the method comprises measuring the size of a tandem repeat segment by a first method, measuring the size of the tandem repeat segment by a second method, and using the information obtained by the first and second methods to determine the size of the tandem repeat region. In some embodiments, the first method includes an amplification of the tandem repeat region, preferably the amplification is by PCR. In preferred embodiments, the PCR amplification includes a labeled primer. In other preferred embodiments the amplicon is subjected to electrophoresis, preferably capillary electrophoresis, and the size of the amplicon is determined by comparison to a standard run in parallel. In other embodiments the first method includes Southern blotting. In preferred embodiments the second method comprises, fragmenting the nucleic acids of the sample, separating the fragmented nucleic acids into fractions according to size, and detecting a marker sequence upstream or downstream of the particular nucleic acid segment of interest, wherein the marker sequence is associated with the particular nucleic acid segment of interest in the fragmented nucleic acid. The size of the particular nucleic acid segment of interest is then determined by relating the fraction size containing the particular nucleic acid segment to the size of the particular nucleic acid segment in the sample of nucleic acids. In certain embodiments, the first method is used as an initial screen and samples for which the size of the particular nucleic acid segment is unable to be determined by this method are further analyzed by the second method. In other embodiments, sizing by one of the above two methods is used to confirm the results of the sizing by the other of the above two methods. In still other embodiments, sizing by the first method is used for finely determining the size of the particular nucleic acid fragment.
[0033] In another aspect of the invention there are provided primers for amplification of marker sequences flanking the tandem repeat region of the FMRl gene. In particular embodiments, the primers are selected from the group consisting of SEQ ID NOs:4-9
|0034] In yet another aspect of the invention, there are provided kits for detecting the size of a particular nucleic acid segment in a sample comprising a primer pair for amplifying a marker nucleotide sequence upstream or downstream of the particular nucleic acid segment, and one or more restriction endonucleases for cleaving the nucleic acid sample to generate a fragment of the nucleic acid sample which contains the particular nucleic acid segment and the upstream or downstream marker sequence. In certain embodiments the kit further comprises one or more restriction endonucleases for cleaving the particular nucleic acid segment from the marker nucleotide sequence. In still other embodiments, the kit may further contain one or more controls for verifying proper size separation of fragments; preferably the control consists of one or more primer pairs that are used to amplify one or more control fragments from the size-separated nucleic acid sample. In further embodiments, the kit may further contain one or more controls for verifying the completion of the one or more enzyme digests; preferably the control consists of on or more primer pairs designed to amplify a control fragment that includes a recognition site for the enzyme used. The kit may further contain any necessary buffers or other reagents.
[0035] In preferred embodiments of the above aspect of the invention, the particular nucleic acid segment contains a tandem repeat segment. In more preferred embodiments, the kit is for the detection of the tandem repeat segment of the FMRl gene; preferably the enzyme for generating fragments of the nucleic acid sample is AIuI; preferably the enzyme for cleaving the marker sequence from the tandem repeat is BstNI; preferably the kit contains one or more control primers pairs to determine if enzyme digestions are completed; preferably the kit contains one or more primer pairs to detect the presence of one or more control fragments in the size-separated nucleic acid sample. In other preferred embodiments, the kit is for the detection of the tandem repeat segment of the FMRl gene; preferably the enzymes for generating fragments of the nucleic acid sample are BIpI and MIyI; preferably the enzyme for cleaving the marker sequence from the tandem repeat is Bmtl; preferably the kit contains one or more control primers pairs to determine if enzyme digestions are completed; preferably the kit contains one or more primer pairs to detect the presence of one or more control fragments in the size-separated nucleic acid sample. In still other preferred embodiments, the kit is for the detection of the tandem repeat segment of the FMRl gene; preferably the enzymes for generating fragments of the nucleic acid sample are Sphl and Bmtl; preferably the enzyme for cleaving the marker sequence from the tandem repeat is BstNI; preferably the kit contains one or more control primers pairs to determine if enzyme digestions are completed; preferably the kit contains one or more primer pairs to detect the presence of one or more control fragments in the size-separated nucleic acid sample.
[00361 "Segment" as used herein in reference to nucleic acid, refers to a piece of contiguous nucleic acid. [0037] "Particular nucleic acid segment of interest" as used herein refers to a specific "segment" or piece of nucleic acid having a known sequence, preferred segments are those segments that are difficult to amplify by PCR. Examples include nucleic acid segments having high content of the bases guanine and cytidine, large segments of nucleic acid or segments having large numbers of tandem repeats, generally more than 100 tri-nucleotide repeats. In some embodiments the segment comprises a deletion, a duplication or an insertion. In a preferred embodiment, the particular nucleic acid segment of interest comprises a tandem repeat region.
[0038] "Nucleic acid segments which have high content of the bases guanine and cytosiήe" or "GC-rich" refer to those nucleic acid segments of a genome are more than the average for that genome. Generally, GC-rich is more than 40% guanine and cytosine bases, or more than 50%, or more than 60%, or more than 75%.
[0039] "Size" as used in reference to a particular nucleic acid segment of interest refers to quantity or amount that describes the magnitude of that segment and can be represented by, for example, molecular weight, number of base pairs, or number of copies of a tandem repeat.
[0040] "Fragment" as used herein refers to a portion of nucleic acid resulting from a process in which longer lengths of nucleic acid are broken up into shorter lengths of nucleic acid. Nucleic acids may be broken up or fragmented by chemical or biochemical means, preferably nucleic acids are fragmented in a manner that is reproducible, preferably nucleic acids are fragmented by one or more restriction endonucleases. Preferably nucleic acids are fragmented so that the particular nucleic acid segment of interest and its associated marker sequence are located on the same fragment. The length of a fragment containing the nucleic acid segment of interest will depend on the length of the nucleic acid segment of interest as well as the restriction enzyme chosen to fragment the DNA. Thus, the length of the of the fragment includes the nucleic acid segment of interest plus the region upstream of the segment to the 5' restriction enzyme recognition site (i.e., the 5' end of the fragment) and the region downstream of the segment region to the 3' restriction enzyme recognition site (i.e., the 3' end of the fragment).
|0041] "Separating fragments" as used herein refers to the process whereby the fragments contained in a mixture of different fragments are physically separated from one another. 10042] "Fractionation" as used herein refers to a process whereby a single mixture of individual components is processed so that at least some of the individual components in the mixture become separated from each other. For example, chromatography is a fractionation method that separates a mixture of components based on some physical/chemical principle. The components may be separated in a gel or on a membrane so that the individual components may be separately identified. The individual components of a mixture may be fractionated by separating the mixture into the different components which are captured in separate aliquots of liquid (i.e. fractions). As used herein "fraction" in the context of the invention refers to a collection of fragments having a certain size or range of sizes that differs from the size or range of sizes of the starting non-fractionated mixture of fragments.
[0043] "Identifying those fractions containing the segment" as used herein means that the fraction of size-separated fragments that contains the segment of interest, is determined by the detection of a marker sequence associated with that segment on a fragment of nucleic acid.
[0044] The phrase "tandem repeat region" or "tandem repeat segment" as used herein refers to a region of DNA that contains a multiple copies of a short sequence of DNA. "Tandem repeat sequences" or "tandem repeats" or simply "repeats" are used interchangeably herein and refers to the short sequence of DNA that is repeated in the tandem repeat region. Such tandem repeats can lie adjacent to each other in the same orientation (i.e., direct tandem repeats) or in the opposite direction to each other (i.e., inverted tandem repeats). The repeated sequences may be di-, tri-, tetra-, or more nucleotides in length. Expansion the number of copies of the tandem repeat sequences within the coding or noncoding regions of some human genes is associated with repeat expansion disease.
[0045] As used herein, the term "sample" or "test sample" refers to any liquid or solid material containing genomic DNA. In preferred embodiments, a test sample is obtained from a biological source (i.e., a "biological sample"), such as cells in culture or a tissue sample from an animal, most preferably, a human. Preferred sample tissues include, but are not limited to, blood, bone marrow, body fluids, cerebrospinal fluid, plasma, serum, or tissue (e.g. biopsy material). [0046] As used herein, "nucleic acid" refers broadly to segments of a chromosome, segments or portions of DNA, cDNA, and/or RNA. Nucleic acid may be derived or obtained from an originally isolated nucleic acid sample from any source (e.g., isolated from, purified from, amplified from, cloned from, reverse transcribed from sample DNA or RNA).
[0047] "Target nucleic acid" as used herein refers to segments of a chromosome, a complete gene with or without intergenic sequence, segments or portions a gene with our without intergenic sequence, or sequence of nucleic acids to which probes or primers are designed. Target nucleic acids may include wild type sequences, nucleic acid sequences containing mutations, deletions or duplications, tandem repeat regions, a gene of interest, a region of a gene of interest or any upstream or downstream region thereof. Target nucleic acids may represent alternative sequences or alleles of a particular gene. Target nucleic acids may be derived from genomic DNA, cDNA, or RNA. As used herein target nucleic acid may be native DNA or a PCR amplified product.
[0048] The term "marker sequence" as used herein refers to a segment of nucleic acid which is associated with a nucleic acid segment of interest so that detection of the marker sequence in a sample is indicative of the presence of the nucleic acid segment of interest. The marker sequence for detecting a particular nucleic acid segment of interest should be selected on the basis that the marker is uniquely or substantially associated with the nucleic acid segment of interest in fragments present in a particular size fraction. Marker sequences can be detected by nucleic acid amplification using primer based hybridization methods. Marker sequences can also be detected by hybridization to one or more nucleic acid probe(s). In accordance with the methods disclosed herein, a fragment containing a tandem repeat segment is identified in size fractioned nucleic acid fragments by detecting a marker sequence that is either upstream or downstream of the tandem repeat.
[0049] Marker sequences may be within the nucleic acid segment of interest or may be flanking the nucleic acid of interest. The term "flanking" as used herein refers to a region of DNA either adjoining or a distance from1 a region of interest. The flanking region may be "upstream" (i.e., 51) or "downstream" (i.e., 3') of the region of interest. The marker sequence may be adjoining the tandem repeat region or may be located a distance upstream or downstream. In preferred embodiments, the marker sequence is within 500 bases upstream or downstream of the tandem repeat region; in more preferred embodiments the marker sequence is within 250 bases upstream or downstream of the tandem repeat region; in most preferred embodiments the marker sequence is within 100 bases upstream or downstream of the tandem repeat region. The flanking region may be coding or non-coding sequence and may be the same or a different gene as the gene comprising the region of interest. In preferred embodiments, the marker sequence is flanking the nucleic acid segment of interest.
[0050] In certain embodiments, the size of the particular nucleic acid segment of interest is determined by relating the fraction size containing the particular nucleic acid segment to the size of the particular nucleic acid segment in the sample of nucleic acids. This step of relating the fraction size containing the particular nucleic acid segment to the size of the particular nucleic acid segment in the sample of nucleic acids can be accomplished using a look-up table as disclosed herein. In other embodiments this step is accomplished with a computer program.
(0051] The phrase "relating the fraction size containing the particular nucleic acid segment of interest to the size of the particular nucleic acid segment of interest that would be present in that fraction under the conditions which generated the fragment" as used herein refers to the means by which the size of the particular nucleic acid segment of interest is determined from its location in a particular fraction size. In one approach, a look-up table is established for each combination of particular nucleic acid segment of interest and fragmentation approach (e.g. particular restriction endonuclease(s) used). The look-up table links each fraction (containing a range of fragment sizes) to the length of the segment of interest that is present in such fragments. In any particular fraction, there are fragments that contain the segment of interest and other sequence. Thus, in any fraction, one can calculate from sequence data the number of bases in the fragments that represent the segment of interest. This correlation may be established experimentally or by using known DNA sequence for the fragments generated. For any particular unknown sample, the size of a segment of interest can be determined by relating the fragment size that contains the segment of interest to the appropriate look-up table reflecting the same conditions for fragment generation. By using this process one relates the fraction size containing the particular nucleic acid segment of interest to the size of the particular nucleic acid segment of interest that would be present in the fraction under the conditions which generated the fragment. It is not essential that one prepare a look-up table to perform the method. For example, one could generate a computer program to perform the relating step.
[0052] "Genomic nucleic acid" or "genomic DNA" refers to some or all of the DNA from the nucleus of a cell. Genomic DNA may be intact or fragmented (e.g., digested with restriction endonucleases by methods known in the art). In some embodiments, genomic DNA may include sequence from all or a portion of a single gene or from multiple genes, sequence from one or more chromosomes, or sequence from all chromosomes of a cell. In contrast, the term "total genomic nucleic acid" is used herein to refer to the full complement of DNA contained in the genome of a cell. As is well known, genomic nucleic acid includes gene coding regions, introns, 5' and 3' untranslated regions, 5' and 31 flanking DNA and structural segments such as telomeric and centromeric DNA, replication origins, and intergenic DNA. Genomic nucleic acid may be obtained from the nucleus of a cell, or recombinantly produced. Genomic DNA also may be transcribed from DNA or RNA isolated directly from a cell nucleus. PCR amplification also may be used. Methods of purifying DNA and/or RNA from a variety of samples are well-known in the art.
[0053] The terms "allele" and "allelic variant" are used interchangeably herein. An allele is any one of a number of alternative forms or sequences of the same gene occupying a given locus or position on a chromosome. A single allele for each locus is inherited separately from each parent, resulting in two alleles for each gene. An individual having two copies of the same allele of a particular gene is homozygous at that locus whereas an individual having two different alleles of a particular gene is heterozygous.
[0054] "Repeat expansion disease" refers to any of about two dozen human diseases displaying Mendelian inheritance patterns shown to be caused by expansions of intrinsically polymorphic tandem repeats, mainly involving different trinucleotide motifs but also longer repetitive sequences up to 12-mers (Table 1). A characteristic of an allele containing an expanded tandem repeat is an excessive instability in successive generations (dynamic mutations). Furthermore, these alleles can differ in lengths among cell populations of the same organism (mosaicism). One type of repeat expansion disease is the trinucleotide repeat disorders (e.g., fragile X syndrome, myotonic dystrophy 1, etc.), the most abundant form of repeat expansion diseases. These diseases exhibit intergenerational repeat instability with a tendency towards further expansion of the tandem repeat. Increased repeat lengths in successive generations can lead to an earlier age of onset in affected individuals and/or an accentuation of clinical symptoms. The methods of measuring tandem repeat length as described herein can be applied to measures tandem repeat length for any of the diseases/genes in Table 3.
Table 1. Exemplary tandem repeat expansion diseases
Figure imgf000020_0001
Figure imgf000021_0001
* = X chromosome; # = autosomal recessive; undesignated = autosomal dominant
[0055] As used herein, the term "oligonucleotide" refers to a short polymer composed of deoxyribonucleotides, ribonucleotides or any combination thereof. Oligonucleotides of the invention are generally between about 10 and about 100 nucleotides in length. Oligonucleotides are preferably 15 to 70 nucleotides long, with 20 to 26 nucleotides being the most common. The single letter code for nucleotides is as described in the U.S. Patent Office Manual of Patent Examining Procedure, section 2422, table 1. In this regard, the nucleotide designation "R" means guanine or adenine, "Y" means thymine (uracil if RNA) or cytosine; and "M" means adenine or cytosine. An oligonucleotide may be used as a primer or as a probe.
[0056] As used herein, the term "substantially purified" in reference to oligonucleotides does not require absolute purity. Instead, it represents an indication that the sequence is relatively more pure than in the natural environment. Such oligonucleotides may be obtained by a number of methods including, for example, laboratory synthesis, restriction enzyme digestion or PCR. A "substantially purified" oligonucleotide is preferably greater than 50% pure, more preferably at least 75% pure, and most preferably at least 95% pure.
[0057] As used herein, an oligonucleotide is "specific" for a nucleic acid if the oligonucleotide has at least 50% sequence identity with a portion of the nucleic acid when the oligonucleotide and the nucleic acid are aligned. An oligonucleotide that is specific for a nucleic acid is one that, under the appropriate hybridization or washing conditions, is capable of hybridizing to the target of interest and not substantially hybridizing to nucleic acids which are not of interest. Higher levels of sequence identity are preferred and include at least 75%, at least 80%, at least 85%, at least 90%, at least 95% and more preferably at least 98% sequence identity.
[0058] As used herein, the term "hybridize" or "specifically hybridize" refers to a process where two complementary nucleic acid strands anneal to each other under appropriately stringent conditions. Hybridizations are typically and preferably conducted with probe- length nucleic acid molecules, preferably 20-100 nucleotides in length. Nucleic acid hybridization techniques are well known in the art. See, e.g., Sambrook, et al., 1989, Molecular Cloning: A Laboratory Manual, Second Edition, Cold Spring Harbor Press, Plainview, N. Y. Those skilled in the art understand how to estimate and adjust the stringency of hybridization conditions such that sequences having at least a desired level of complementarity will stably hybridize, while those having lower complementarity will not. For examples of hybridization conditions and parameters, see, e.g., Sambrook, et al., 1989, Molecular Cloning: A Laboratory Manual, Second Edition, Cold Spring Harbor Press, Plainview, N. Y.; Ausubel, F. M. et al. 1994, Current Protocols in Molecular Biology. John Wiley & Sons, Secaucus, NJ.
[0059] The term "substantially complementary" as used herein means that two sequences hybridize under stringent hybridization conditions. The skilled artisan will understand that substantially complementary sequences need not hybridize along their entire length. In particular, substantially complementary sequences comprise a contiguous sequence of bases that do not hybridize to a target or marker sequence, positioned 3' or 51 to a contiguous sequence of bases that hybridize under stringent hybridization conditions to a target or marker sequence. [0060] The term "complement" as used herein means the complementary sequence to a nucleic acid according to standard Watson/Crick pairing rules. A complement sequence can also be a sequence of RNA complementary to the DNA sequence or its complement sequence, and can also be a cDNA.
[0061] The term "coding sequence" as used herein means a sequence of a nucleic acid or its complement, or a part thereof, that can be transcribed and/or translated to produce the mRNA for and/or the polypeptide or a fragment thereof. Coding sequences include exons in a genomic DNA or immature primary RNA transcripts, which are joined together by the cell's biochemical machinery to provide a mature mRNA. The anti-sense strand is the complement of such a nucleic acid, and the encoding sequence can be deduced therefrom.
[0062] The term "non-coding sequence" as used herein means a sequence of a nucleic acid or its complement, or a part thereof, that is not transcribed into amino acid in vivo, or where tRNA does not interact to place or attempt to place an amino acid. Non-coding sequences include both intron sequences in genomic DNA or immature primary RNA transcripts, and gene-associated sequences such as promoters, enhancers, silencers, etc.
[0063] The term "amplification" or "amplify" as used herein means one or more methods known in the art for copying a target nucleic acid, thereby increasing the number of copies of a selected nucleic acid sequence. Amplification may be exponential or linear. A target nucleic acid may be either DNA or RNA. The sequences amplified in this manner form an "amplicon." While the exemplary methods described hereinafter relate to amplification using the polymerase chain reaction ("PCR"), numerous other methods are known in the art for amplification of nucleic acids (e.g., isothermal methods, rolling circle methods, etc.). The skilled artisan will understand that these other methods may be used either in place of, or together with, PCR methods. See, e.g., Saiki, "Amplification of Genomic DNA" in PCR Protocols, Innis et al., Eds., Academic Press, San Diego, CA 1990, pp 13-20; Wharam et al.s Nucleic Acids Res. 2001 Jun 1 ;29(1 1):E54-E54; Hafner et al., Biotechniques 2001 Apr;30(4):852-6, 858, 860 passim; Zhong et al., Biotechniques 2001 Apr;30(4):852-6, 858, 860 passim.
[0064] As used herein, a "primer" for amplification is an oligonucleotide that specifically anneals to a target or marker nucleotide sequence. The 3' nucleotide of the primer should be identical to the target or marker sequence at a corresponding nucleotide position for optimal amplification.
[0065] "Sense strand" means the strand of double-stranded DNA (dsDNA) that includes at least a portion of a coding sequence of a functional protein. "Anti-sense strand" means the strand of dsDNA that is the reverse complement of the sense strand.
(0066] As used herein, a "forward primer" is a primer that anneals to the anti-sense strand of dsDNA. A "reverse primer" anneals to the sense-strand of dsDNA.
[0067] As used herein, sequences that have "high sequence identity" have identical nucleotides at least at about 50% of aligned nucleotide positions, preferably at least at about 75% of aligned nucleotide positions, more preferably at least at about 90% of aligned nucleotide positions, and most preferably at least at about 95% of aligned nucleotide positions.
[0068] As used herein "TaqMan PCR detection system" refers to a method for real time PCR. In this method, a TaqMan probe which hybridizes to the nucleic acid segment amplified is included in the PCR reaction mix. The TaqMan probe comprises a donor and a quencher fluorophore on either end of the probe and in close enough proximity to each other so that the fluorescence of the donor is taken up by the quencher. However, when the probe hybridizes to the amplified segment, the 5'-exonuclease activity of the Taq polymerase cleaves the probe thereby allowing the donor fluorophore to emit fluorescence which can be detected.
BRIEF DESCRIPTION OF THE FIGURES
[0069] Figure 1. A schematic showing one embodiment of the method of detecting the tandem repeats. The presence of FMRl fragments with a particular length of tandem repeat is shown schematically at the bottom for each size fraction indicated (i.e. small, medium and large). The designation of + or — is shown below PCR to indicate whether PCR amplification of the flanking marker sequence occurs when the particular fragment is present in the fraction. [0070] Figure 2. Exemplary sequence (SEQ ID NO: 1) of a region of the FMRl gene showing the CGG tandem repeat region (single underlining), preferred locations for hybridizing PCR primers (shaded regions), and a preferred location for a hybridizing probe (double-underlining).
J0071] Figure 3. Exemplary sequence (SEQ ID NO:2) of the downstream 3' untranslated region of the DM-I gene showing the CTG tandem repeat region (single underlining), preferred locations for hybridizing PCR primers (shaded regions), and a preferred location for a hybridizing probe (double-underlining).
[0072] Figure 4. Exemplary sequence (SEQ ID NO:3) of the first intronic region of the FRDA gene showing the CAA tandem repeat region (single underlining), preferred locations for hybridizing PCR primers (shaded regions), and a preferred location for a hybridizing probe (double-underlining).
[0073] Figure 5. Restriction enzyme map of a region of the FMRl gene. FXCEF3, FXCER3, FMR1F4, FMR1R4, FXCEF2, and FXCER2 show the location of hybridization of preferred oligonucleotide primers. FXCEF3/FXCER3, FMR1F4/FMR1R4, and FXCEF2/FXCER2 are preferred primer pairs for amplification of marker sequences when the nucleic acid is fragmented with Sphl/Bmtl, AIuI, and Blpl/Mlyl, respectively.
DETAILED DESCRIPTION OF THE INVENTION
|0074] In accordance with the present invention, there are provided methods of detecting a particular nucleic acid segment of interest in a sample of nucleic acids. In particular embodiments, the particular nucleic acid segment of interest is a tandem repeat and the method is used to determine information about the size of such tandem repeat. This information may be used to determine if an individual carries a genetic mutation characterized by an increase (i.e., expansion) or a decrease (i.e., reduction) in the number of tandem repeats associated with a particular gene. Thus, described is a method of measuring the size of a tandem repeat segment in a sample of nucleic acids, the method comprising identifying the tandem repeat in fractions of size-separated nucleic acid fragments by detecting a marker sequence flanking the tandem repeat segment and then relating the fraction size containing the repeat to the size of a tandem repeat segment present in such nucleic acid fragments. Figure 1 shows one embodiment of this method in schematic form. As will be discussed, a variation of this method is to perform a second fragmentation after the size separation and prior to the "analysis" step.
Sample Preparation
[0075] The methods of the present invention can be used to detect mutations characterized by an expansion or reduction of tandem repeat region of a gene in the genomic DNA of a test sample. Therefore, the method may be performed using any biological sample containing genomic DNA. Examples include tissue samples or any cell-containing bodily fluid. Blood is the preferred biological sample. Biological samples may be obtained by standard procedures and may be used immediately or stored, under conditions appropriate for the type of biological sample, for later use.
[0076] Methods of obtaining test samples are well known to those of skill in the art and include, but are not limited to, aspirations, tissue sections, drawing of blood or other fluids, surgical or needle biopsies, and the like. The test sample may be obtained from individual or patient. The test sample may contain cells, tissues or fluid obtained from a patient suspected being afflicted with or a carrier for a disorder caused by an expansion of tandem repeat sequences. The test sample may be a cell-containing liquid or a tissue. Samples may include, but are not limited to, amniotic fluid, biopsies, blood, blood cells, bone marrow, fine needle biopsy samples, peritoneal fluid, plasma, pleural fluid, saliva, semen, serum, tissue or tissue homogenates, frozen or paraffin sections of tissue. Samples may also be processed, such as sectioning of tissues, fractionation, purification, or cellular organelle separation.
[0077] The invention methods can be used to perform prenatal diagnosis using any type of embryonic or fetal cell or nucleic acid containing body fluid. Fetal cells can be obtained through the pregnant female, or from a sample of an embryo. Thus, fetal cells are present in amniotic fluid obtained by amniocentesis, chorionic villi aspirated by syringe, percutaneous umbilical blood, a fetal skin biopsy, a blastomere from a four-cell to eight-cell stage embryo (pre-implantation), or a trophectoderm sample from a blastocyst (pre-implantation or by uterine lavage).
[0078] In particular embodiments, genomic DNA may be used. Genomic DNA may be isolated from cells or tissues using standard methods, see, e.g., Sambrook, et al., 1989, Molecular Cloning: A Laboratory Manual, Second Edition, Cold Spring Harbor Press, Plainview, NY.
Fragmentation of Genomic DNA
[0079] Genomic DNA may be fragmented by various methods well-known in the art. Preferably, a restriction endonuclease digestion is used to fragment the DNA.
[0080] A "restriction endonuclease" or "restriction enzyme" as used herein refers to an enzyme that cuts double-stranded DNA at a specific sequence (i.e., the recognition sequence or site). The frequency with which a given restriction endonuclease cuts DNA depends on the length of the recognition site of the enzyme. For example, some enzymes recognize sites that are four nucleotides long (referred to as "four cutters"). In general one can estimate how frequently an enzyme should cut a piece of DNA based the length of the recognition site and the assumption that the probability of any one nucleotide occurring at a given location is 1A. In the case of a "four cutter" a specific sequence of four nucleotides must be present. Assuming that each nucleotide has an equal chance {i.e., VA) of occurring at any particular site within the four nucleotide sequence, then a four-cutter should on average cut once every 256 base pairs {i.e., 1A x 1A x 1A x 1A = 1/256). A similar calculation can be applied to any restriction enzyme as long as the length of its recognition site is known, making it possible to predict the size and number of a DNA fragments that would be obtained by cutting a DNA molecule of known size. This allows one of skill in the art to produce DNA fragments of known size. Restriction endonucleases are obtained from bacteria or are produced through recombinant technology and are readily available through numerous commercial sources.
[0081] In the restriction endonuclease fragmentation method, a restriction endonuclease is combined with a sample of genomic DNA and buffer appropriate for optimal activity of the endonuclease. In general, 1 unit of endonuclease will digest lμg of DNA in 1 hour at 37°C.
[0082] One of skill in the art would recognize that this fragmentation method can be modified by using a restriction enzyme that cuts at a particular frequency or a particular site, or by using multiple restriction enzymes. The choice of enzyme or enzyme combinations is chosen to suit the gene of interest in an assay. In general, one would choose an enzyme or enzyme combination to generate a fragment containing the entire tandem repeat region and the upstream or downstream marker sequence. Enzymes for fragmentation can be chosen by using a restriction enzyme map of the region surrounding the tandem repeat. Such maps can be readily generated by software programs well-known to those of skill in the art.
[0083] In particular, one would choose an enzyme or a combination of enzymes to obtain an appropriately sized fragment to distinguish a normal length tandem repeat region from an abnormal length tandem repeat region. In determining an appropriate size for a fragment, one would consider the difference in the range of lengths of a normal tandem repeat region as compared to that for an abnormal tandem repeat region. For example, if the difference between a normal tandem repeat region and an abnormal tandem repeat region is small, one would choose a shorter length fragment, whereas if the difference is large one would choose a longer length fragment.
[0084] In preferred embodiments, AIuI is used to fragment the nucleic acids. AIuI is a restriction enzyme that recognizes a 4-nucleotide sequence of double-stranded DNA (i.e., - AGCT-). One of skill in the art would recognize that isoschizomers (i.e., restriction enzymes with the same recognition sequence and cut site) of AIuI can be readily substituted for AIuI. Examples of AIuI isoschizomers include, but are not limited to, BsaLI, Marl, MItI, and Oxal. One of skill in the art would further recognize that a neoschizomer (i.e., a restriction enzyme with the same recognition sequence as another enzyme but with a different cut site) of AIuI could also be substituted for AIuI.
[0085] One of skill in the art would recognize that other restriction enzymes with the same cutting frequency as AIuI could be substituted for AIuIm this method. For example, AIuI, which recognizes a 4-nucleotide sequence, cuts DNA at approximately every 256 bases. Other enzymes with different 4-nucleotide recognition sequences (e.g., Dpnl, Rsal, Mbol, and NIaI) would be expected to cut at a similar frequency to AIuI and would therefore produce fragments of a size similar to those of AIuI.
[0086] In other preferred embodiments, BIpI and Mlyl are used in combination to fragment the nucleic acids. In still other preferred embodiments, Sphl and Bmtl are used in combination to fragment the nucleic acids.
Size Separation of DNA Fragments [0087] Separation of DNA fragments according to size may be accomplished by various methods known to those of skill in the art. For example, various methods of gel electrophoresis or column chromatography (e.g., size-exclusion high performance liquid chromatography (SEC-HPLC) and denaturing HPLC (DHPLC)) may be used.
[0088] In gel electrophoresis, a gel matrix, to which an electric field is applied, is utilized to separate nucleic acid molecules or fragments thereof based on size. In general, smaller nucleic acid fragments will move faster through the gel matrix than larger fragments. Preferred gel matrices include agarose and polyacrylamide.
[0089] In preferred embodiments, capillary electrophoresis is used to separate the fragmented nucleic acids. Capillary electrophoresis is a separation method based on the differential electrophoretic migration rate of sample components in a capillary when a voltage is applied. Separated fragments or molecules are generally detected "on-column" using UV spectrometric or fluorescence analysis through a window in the capillary. In general, one or more standards (i.e., a segment of nucleic acid having known length) are first injected into the capillary to determine the elution time for each standard using on-column detection. Then, the elution times of the standards are used to determine the length of time over which a fraction will be collected in order to achieve a desired size range for that fraction.
[0090] In some embodiments, the size ranges for the fractions used in an assay may be chosen based on the length in base pairs of commercially-available standards. A number of standards are available containing mixtures of lengths of nucleic acids. In one example, a standard containing nucleic acid fragments having the following lengths is used: 200 bp, 300 bp, 400 bp, 500 bp, 600 bp, 700 bp, 800 bp, 900 bp, and 1000 bp. Thus, fractions would then be chosen based on one or more of the sizes present in the standard. For example, one might choose the following three fractions: 1) less than or equal to 300 bp, 2) 301-500 bp, and 3) greater than or equal to 501. One would then collect fractions of the fragmented nucleic acid sample based on the elution times of the 300 bp and 500 bp standards. The range in number of tandem repeats present in each fraction can be calculated.
[0091] Alternatively, fractions can be chosen based on a desired number of tandem repeats for each fraction. In this case, standards representing the upper and lower limits in size of each fraction can be synthesized if not commercially available. [0092] In some embodiments, the fragments are separated into the lowest number of fractions in order to distinguish normal from abnormal length tandem repeat regions. In particular embodiments, the fragments may be separated into two fractions, one corresponding to a normal tandem repeat region and one corresponding to an abnormal tandem repeat region.
[0093] In other embodiments, the fragments are separated into a larger number of fractions in order to determine the size of the tandem repeat region. In general, more fractions will allow a more precise determination of the length of the tandem repeat region. The number of fractions may be chosen in order to achieve a desired level of precision in determining the length of the tandem repeat region.
[0094] In a preferred embodiment of the assay to determine the number of tandem repeats of the FMRl gene, AIuI fragmented DNA is separated into two fractions corresponding to sizes of approximately 211 - 358 bp (lower fraction) and 359 bp - 9 kb (upper fraction). Fragments from samples containing a normal FMRl gene (i.e., less than 55 tandem repeats), will separate into the lower fraction, whereas fragments from FMRl genes containing premutations (i.e., 55-200 tandem repeats) or full mutations (i.e., greater than 200 tandem repeats) will separate into the upper fraction.
[0095] In another preferred embodiment of the assay to determine the number of tandem repeats of the FMRl gene, AIuI fragmented DNA is separated into two fractions corresponding to sizes of approximately 21 1 - 400 bp (lower fraction) and 401 bp - 9 kb (upper fraction). Fragments from samples containing a normal FMRl gene (i.e., less than 55 tandem repeats) and premutations having 56-69 tandem repeats, will separate into the lower fraction, whereas fragments from FMRl genes containing premutations having 70-200 tandem repeats or full mutations (i.e., greater than 200 tandem repeats) will separate into the upper fraction.
[0096] In another preferred embodiment of the assay to determine the number of tandem repeats of the FMRl gene, AIuI fragmented DNA is separated into four fractions corresponding to sizes of approximately less than or equal to 400 bp (first/lowest fraction), 401-500 bp (second fraction), 501-800 bp (third fraction) and 800 bp - 9 kb (fourth/highest fraction). Fragments from samples containing a normal FMRl gene (i.e., less than 55 tandem repeats) and those containing premutations having 56-68 tandem repeats will separate into the first/lowest fraction, whereas fragments from FMRl genes containing small premutations (i.e., 69-102 tandem repeats) will separate into the second fraction, whereas fragments from FMRl genes containing large premutations (i.e., 103-200 tandem repeats) and full mutations having 201-202 tandem repeats will separate into the third fraction, and whereas full mutations having greater than 202 tandem repeats will separate into the fourth/highest fraction.
[0097] In yet another preferred embodiment of the assay to determine the number of tandem repeats of the FMRl gene, DNA fragmented with a combination of Sphl and Bmtl is separated into four fractions corresponding to sizes of approximately less than or equal to 500 bp (first/lowest fraction), 501-800 bp (second fraction), 801-900 bp (third fraction) and 901 bp — 9 kb (fourth/highest fraction). Fragments from samples containing a normal FMRl gene (i.e., less than 55 tandem repeats) and those containing premutations having 56-62 tandem repeats will separate into the first/lowest fraction, whereas fragments from FMRl genes containing small premutations (i.e., 63-163 tandem repeats) will separate into the second fraction, whereas fragments from FMRl genes containing large premutations (i.e., 164-196 tandem repeats) will separate into the third fraction, and whereas large premutations having 197-200 tandem repeats and full mutations (i.e., greater than 200 tandem repeats) will separate into- the fourth/highest fraction.
[0098] In still another preferred embodiment of the assay to determine the number of tandem repeats of the FMRl gene, DNA fragmented with a combination of BIpI an dMlyl is separated into four fractions corresponding to sizes of approximately less than or equal to 603 bp (first/lowest fraction), 604-840 bp (second fraction), 841-1078 bp (third fraction) and 1079 bp — 9 kb (fourth/highest fraction). Fragments from samples containing a normal FMRl gene (i.e., less than 55 tandem repeats) and those containing premutations having 56-62 tandem repeats will separate into the first/lowest fraction, whereas fragments from FMRl genes containing small premutations (i.e., 63-140 tandem repeats) will separate into the second fraction, whereas fragments from FMRl genes containing large premutations (i.e., 141-200 tandem repeats) and full mutations having 201-220 tandem repeats will separate into the third fraction, and whereas large premutations having greater than 220 tandem repeats will separate into the fourth/highest fraction. [0099] In another preferred embodiment, fragmented DNA is separated into a multiplicity of fractions, according to size. Automated fraction collection is accomplished using, for example, a preset fraction time window, beginning at approximately 200 bp and ending at 9 kb. This method allows for a finer estimation of fragment size and thereby, an estimation of the number of repeats.
Second Fragmentation of DNA
[00100] In yet other preferred embodiments, the methods of the invention include a second nucleic acid fragmentation following the size separation of the first nucleic acid fragmentation. Preferably, a restriction endonuclease digestion is used to further fragment the DNA. Preferably, the second fragmentation separates the tandem repeat segment from an associated marker sequence.
[00101] In preferred embodiments, restriction enzyme BsaWI, Hpyl88I, Hphl or BstNI is used for the second nucleic acid fragmentation when the marker sequence is upstream of the tandem repeat segment. In other preferred embodiments, restriction enzyme SmII, Bbvl, or Bmtl is used for the second nucleic acid fragmentation when the marker sequence is downstream of the tandem repeat segment One of skill in the art would recognize that isoschizomers and neoschizomers of the listed restriction enzymes could also be used. One of skill in the art would be able to identify a suitable restriction enzyme for the second nucleic acid fragmentation by analyzing factors which include, but are not limited to, the location of the marker, the sequence of the marker and the sequence between the marker sequence and the tandem repeat segment.
Amplification of Size-separated DNA Fragments or DNA Fragments following Second Fragmentation
[00102] Size-separated DNA may be amplified by various methods known to the skilled artisan. Amplification methods suitable for use with the present methods include, for example, polymerase chain reaction (PCR), ligase chain reaction (LCR), transcription-based amplification system (TAS), nucleic acid sequence based amplification (NASBA) reaction, self-sustained sequence replication (3SR), strand displacement amplification (SDA) reaction, boomerang DNA amplification (BDA), Q-beta replication, or isothermal nucleic acid sequence based amplification. These methods of amplification each described briefly below and are well-known in the art.
[00103] PCR is a technique for making many copies of a specific template DNA sequence. The reaction consists of multiple amplification cycles and is initiated using a pair of primer sequences that hybridize to the 5' and 3' ends of the sequence to be copied. The amplification cycle includes an initial denaturation, and up to 50 cycles of annealing, strand elongation and strand separation (denaturation). In each cycle of the reaction, the DNA sequence between the primers is copied. Primers can bind to the copied DNA as well as the original template sequence, so the total number of copies increases exponentially with time. PCR can be performed as according to Whelan, et al, Journal of Clinical Microbiology, 3_3(3):556- 561(1995). Briefly, a PCR reaction mixture includes two specific primers, dNTPs, approximately 0.25 U of Taq polymerase, and Ix PCR Buffer. For every 25 μl PCR reaction, 2 μl sample (e.g., isolated DNA from target organism) is added and amplified using a thermal cycler.
[00104) LCR is a method of DNA amplification similar to PCR, except that it uses four primers instead of two and uses the enzyme ligase to ligate or join two segments of DNA. LCR can be performed as according to Moore et al. , Journal of Clinical Microbiology 36(4^ 1028-103 I (1998). Briefly, an LCR reaction mixture contains two pair of primers, dNTP, DNA ligase and DNA polymerase representing about 90 μl, to which is added 100 μl of isolated nucleic acid from the target organism. Amplification is performed in a thermal cycler (e.g., LCx of Abbott Labs, North Chicago, IL).
[00105] TAS is a system of nucleic acid amplification in which each cycle is comprised of a cDNA synthesis step and an RNA transcription step. In the cDNA synthesis step, a sequence recognized by a DNA-dependent RNA polymerase (i.e., a polymerase-binding sequence or PBS) is inserted into the cDNA copy downstream of the target or marker sequence to be amplified using a two-domain oligonucleotide primer. In the second step, an RNA polymerase is used to synthesize multiple copies of RNA from the cDNA template. Amplification using TAS requires only a few cycles because DNA-dependent RNA transcription can result in 10-1000 copies for each copy of cDNA template. TAS can be performed according to Kwoh et al., PNAS 86: 1173-7 (1989). Briefly, extracted RNA is combined with TAS amplification buffer and bovine serum albumin, dNTPs, NTPs, and two oligonucleotide primers, one of which contains a PBS. The sample is heated to denature the RNA template and cooled to the primer annealing temperature. Reverse transcriptase (RT) is added the sample incubated at the appropriate temperature to allow cDNA elongation. Subsequently T7 RNA polymerase is added and the sample is incubated at 37°C for approximately 25 minutes for the synthesis of RNA. The above steps are then repeated. Alternatively, after the initial cDNA synthesis, both RT and RNA polymerase are added following a 1 minute 1000C denaturation followed by an RNA elongation of approximately 30 minutes at 37°C. TAS can be also be performed on solid phase as according to Wylie et al, Journal of Clinical Microbiology, 36(12):3488-3491 (1998). In this method, nucleic acid targets are captured with magnetic beads containing specific capture primers. The beads with captured targets are washed and pelleted before adding amplification reagents which contains amplification primers, dNTP, NTP, 2500 U of reverse transcriptase and 2500 U of T7 RNA polymerase. A 100 μl TMA reaction mixture is placed in a tube, 200 μl oil reagent is added and amplification is accomplished by incubation at 420C in a waterbath for one hour.
[00106] NASBA is a transcription-based amplification method which amplifies RNA from either an RNA or DNA target. NASBA is a method used for the continuous amplification of nucleic acids in a single mixture at one temperature. For example, for RNA amplification, avian myeloblastosis virus (AMV) reverse transcriptase, RNase H and T7 RNA polymerase are used. This method can be performed as according to Heim, et al, Nucleic Acids Res., 26(9):2250-2251 (1998). Briefly, an NASBA reaction mixture contains two specific primers, dNTP, NTP, 6.4 U of AMV reverse transcriptase, 0.08 U of Escherichia coli Rnase H5 and 32 U of T7 RNA polymerase. The amplification is carried out for 120 min at 410C in a total volume of 20 μl.
J00107] In a related method, self-sustained sequence-replication (3SR) reaction, isothermal amplification of target DNA or RNA sequences in vitro using three enzymatic activities: reverse transcriptase, DNA-dependent RNA polymerase and Escherichia coli ribonuclease H. This method may be modified from a 3-enzyme system to a 2-enzyme system by using human immunodeficiency virus (HIV)-I reverse transcriptase instead of avian myeloblastosis virus (AMV) reverse transcriptase to allow amplification with T7 RNA polymerase but without E. coli ribonuclease H. In the 2-enzyme 3SR, the amplified RNA is obtained in a purer form compared with the 3-enzyme 3SR (Gebinoga & Oehlenschlager European Journal of Biochemistry, 235:256-261 , 1996).
[0100] SDA is an isothermal nucleic acid amplification method. A primer containing a restriction site is annealed to the template. Amplification primers are then annealed to 5' adjacent sequences (forming a nick) and amplification is started at a fixed temperature. Newly synthesized DNA strands are nicked by a restriction enzyme and the polymerase amplification begins again, displacing the newly synthesized strands. SDA can be performed as according to Walker, et al, PNAS, 89:392-6 (1992). Briefly, an SDA reaction mixture contains four SDA primers, dGTP, dCTP, TTP, dATP, 150 U of Hinc II, and 5 U of exonuclease-deficient of the large fragment of E. coli DNA polymerase I (exo" Kl enow polymerase). The sample mixture is heated 95°C for 4 minutes to denature target DNA prior to addition of the enzymes. After addition of the two enzymes, amplification is carried out for 120 min. at 37°C in a total volume of 50 μl. Then, the reaction is terminated by heating for 2 minutes at 95°C.
[0101] Boomerang DNA amplification (BDA) is a method in which the polymerase begins extension from a single primer-binding site and then makes a loop around to the other strand, eventually returning to the original priming site on the DNA. BDA is differs from PCR through its use of a single primer. This method involves an endonuclease digestion of a sample DNA, producing discrete DNA fragments with sticky ends, ligating the fragments to "adapter" polynucleotides (comprised of a ligatable end and first and second self- complementary sequences separated by a spacer sequence) thereby forming ligated duplexes. The ligated duplexes are denatured to form templates to which an oligonucleotide primer anneals at a specific sequence within the target or marker sequence of interest. The primer is extended with a DNA polymerase to form duplex products followed by denaturation of the duplex products. Subsequent multiple cycles of annealing, extending, and denaturing are performed to achieve the desired degree of amplification (U.S. Patent No. 5,470,724).
[0102] The Q-beta replication system uses RNA as a template. Q-beta replicase synthesizes the single-stranded RNA genome of the coliphage QjS. Cleaving the RNA and ligating in a nucleic acid of interest allows the replication of that sequence when the RNA is replicated by Q-beta replicase (Kramer & Lizardi Trends Biotechnol. 1991 9£2):53-8, 1991). [0103] A variety of amplification enzymes are well known in the art and include, for example, DNA polymerase, RNA polymerase, reverse transcriptase, Q-beta replicase, thermostable DNA and RNA polymerases. Because these and other amplification reactions are catalyzed by enzymes, in a single step assay the nucleic acid releasing reagents and the detection reagents should not be potential inhibitors of amplification enzymes if the ultimate detection is to be amplification based. Amplification methods suitable for use with the present methods include, for example, strand displacement amplification, rolling circle amplification, primer extension preamplification, or degenerate oligonucleotide PCR (DOP). These methods of amplification are well known in the art and each described briefly below.
[0104] Preferably, PCR is used to amplify a target or marker sequence flanking the tandem repeat segment of interest. In this method, two or more oligonucleotide primers that anneal to opposite strands of a target or marker sequence are repetitively annealed to their complementary sequences, extended by a DNA polymerase (e.g., AmpliTaq Gold polymerase), and heat denatured, resulting in exponential amplification of the target nucleic acid sequences. Cycling parameters can be varied, depending on the length of nucleic acids to be extended. The skilled artisan is capable of designing and preparing primers that are appropriate for amplifying a target or marker sequence. The length of the amplification primers for use in the present invention depends on several factors including the nucleotide sequence identity and the temperature at which these nucleic acids are hybridized or used during in vitro nucleic acid amplification. The considerations necessary to determine a preferred length for an amplification primer of a particular sequence identity are well-known to a person of ordinary skill and include considerations described herein. For example, the length of a short nucleic acid or oligonucleotide can relate to its hybridization specificity or selectivity.
[0105] In some embodiments, the amplification may include a labeled primer, thereby allowing detection of the amplification product of that primer. In particular embodiments the amplification may include a multiplicity of labeled primers, preferably such primers are distinguishably labeled, allowing the simultaneous detection of multiple amplification products.
[0106] Oligonucleotide primers can be designed which are between about 10 and about 100 nucleotides in length and hybridize to the marker sequence. Oligonucleotide primers are preferably 12 to 70 nucleotides; more preferably 15-60 nucleotides in length; and most preferably 15-25 nucleotides in length.
[0107] In one embodiment, a primer pair is designed to amplify a marker sequence upstream of the tandem repeat region of the FMRl gene following size separation of nucleic acids fragmented by AIuI. An exemplary marker sequence upstream of the FMRl tandem repeat region for designing hybridization primers is depicted in Figure 2 (SEQ ID NO:!). A forward primer can hybridize to SEQ ID NO:1 between nucleotides 1 and 45, more preferably between positions 22 and 39 while a reverse primer can hybridize to SEQ ID NO:1 between positions 70 and 115, more preferably between 97 and 113. One example is to use a primer pair to amplify a region of the flanking sequence corresponding to approximately 95 bp upstream of the tandem repeat region; more specifically using a forward primer, SEQ ID NO:4 and a reverse primer, SEQ ID NO: 5 to amplify a 93 bp region of the marker sequence. Thus, preferred oligonucleotides which may be used as amplification primers include SEQ ID NO:4 (5'-GGTGGAGGGCCGCCTCTG-S') and SEQ ID NO:5 (5'- AGCGGCGCCTCCGTCACC -3')- Other preferred oligonucleotide primers are approximately 15-100 nucleotides in length and comprise SEQ ID NO:4 or SEQ ID NO:5. Still other preferred oligonucleotide primers include an oligonucleotide sequence that hybridizes to the complement of a 15-100 nucleotide sequence including SEQ ID NO:4 or SEQ ID NO:5. Such oligonucleotides may be substantially purified.
Table 2. Primers for amplifying marker sequences flanking FMRl
Figure imgf000037_0001
[0108] In another embodiment, a primer pair is designed to amplify a marker sequence flanking the tandem repeat region of the FMRl gene following size separation of nucleic acids fragmented by BIpI and MIyI. In one example, a primer pair is used to amplify a region of the flanking sequence downstream of the tandem repeat region; more specifically using a forward primer, FXCEF2 (SEQ ID NO:6), and a reverse primer, FXCER2 (SEQ ID NO:7), to amplify an 86 bp region of the marker sequence. Thus, preferred oligonucleotides which may be used as amplification primers include SEQ ID NO:6 (5'- GATGGAGGAGCTGGTGGTGG -3') and SEQ ID NO:7 (5'- GGAAGGGCGAAGATGGGG -3')- Other preferred oligonucleotide primers are approximately 15-100 nucleotides in length and comprise SEQ ID NO:6 or SEQ ID NO:7. Still other preferred oligonucleotide primers include an oligonucleotide sequence that hybridizes to the complement of a 15-100 nucleotide sequence including SEQ ID NO:6 or SEQ ID NO:7. Such oligonucleotides may be substantially purified.
[0109] In another embodiment, a primer pair is designed to amplify a marker sequence flanking the tandem repeat region of the FMRl gene following size separation of nucleic acids fragmented by SpM and Bmtl. In one example, a primer pair is used to amplify a region of the flanking sequence downstream of the tandem repeat region; more specifically using a forward primer, FXCEF3 (SEQ ID NO:8), and a reverse primer, FXCER3 (SEQ ID NO.9), to amplify an 86 bp region of the marker sequence. Thus, preferred oligonucleotides which may be used as amplification primers include SEQ ID NO:8 (5'- CGTGACGTGGTTTCAGTGTTTACA -3') and SEQ ID NO:9 (5'- GGAAGTGAAACCGAAACGGAG -3')- Other preferred oligonucleotide primers are approximately 15-100 nucleotides in length and comprise SEQ ID NO:8 or SEQ ID NO:9. Still other preferred oligonucleotide primers include an oligonucleotide sequence that hybridizes to the complement of a 15-100 nucleotide sequence including SEQ ID NO:8 or SEQ ID NO:9. Such oligonucleotides may be substantially purified.
[0110] Assay controls may be used in the assay for detecting carriers and individuals afflicted with fragile X syndrome. Positive controls for normal or wild type FMRl gene (i.e., less than 55 tandem repeats), the premutation (55-200 tandem repeats), and the full mutation (greater than 200 tandem repeats) may be used.
[0111] Additional controls may be included in the assay to determine if the restriction enzyme digestion of the genomic DNA is complete. One approach to evaluate the completeness of digestion by a particular restriction enzyme is to determine if the digested DNA can support a PCR amplification using a test primer pair that spans the restriction enzyme site used for digestion. Thus, if the nucleic acid has been fully digested by the restriction enzyme, there should be no amplification from the test primer pair, however, if digestion is incomplete, leaving some intact nucleic acid, the test primer pair should amplify the target. This test digestion PCR amplification can be conducted anytime after digestion including during amplification of the marker sequence.
Table 3. Exemplary control primers
Figure imgf000039_0001
[0112] In a particular embodiment of an assay to distinguish normal individuals from carriers and affected individuals of fragile X syndrome, the genomic DNA is digested with AIuI. Thus, the completion of the digestion of genomic DNA by AIuI can be determined by amplifying a region containing an AIuI recognition site. In one example of such a control, a pair of primers, AIuIF and AIuIR (SEQ ID NO: 10 and SEQ ID NO:11, respectively) are used to amplify a 103 bp target segment of a nucleic acid fragment containing an AIuI recognition site. Amplification of this segment will only occur when AIuI digestion is incomplete. In another embodiment of this assay, the genomic DNA is digested with BIpI and MIyI. Thus, the completion of the digestion of genomic DNA by BIpI and MIyI can be determined by amplifying a region containing a BIpI or an MIyI recognition site. In one example of such a control, a pair of primers, BIpIF and BIpIR (SEQ ID NO:12 and SEQ ID NO:13, respectively) are used to amplify a 138 bp segment of target segment of nucleic acid containing a BIpI recognition site. Amplification of this segment will only occur when BIpI digestion is incomplete.
(0113] Additional controls may be included in the assay to verify proper size-separation of fragmented DNA. Using sequence analysis tools known to those of skill in the art, one could determine the size distribution of fragments obtained using a particular restriction enzyme. One could then identify a specific control fragment having a size that corresponds to the size range of a particular fraction. Thus one could verify proper size separation of the fragmented DNA by detecting the control fragment in the appropriate fraction for its size. One could use a single control fragment in one fraction, a control in each fraction relevant to the determination of a normal versus abnormal tandem repeat, or a control fraction in all fractions.
[0114] In a particular embodiment of an assay to distinguish individuals having a normal FMRl allele from those having a premutation FMRl allele and those having a full mutation FMRl allele, a control amplification is also included to determine if the largest tandem repeat-containing fragments (obtained through digestion with AIuI) are collected into the size-appropriate fraction. In one example of such a control, a pair of primers, LargeF and LargeR (SEQ ID NO:14 and SEQ ID NO:15, respectively) will amplify a 121 bp segment of an 8,479 bp AIuI fragment of the USP41 gene when present in a fraction. In another embodiment of the assay, in which the genomic DNA is digested with BIpI and MIyI, controls are included to detect proper size separation of fragments of 675 bp, 905 bp, and 7,031 bp. In one example, a pair of primers, FIIctrllF and FIIctrllR (SEQ ID NO: 16 and SEQ ID NO: 17, respectively) will amplify a 102 bp segment of a 675 bp BIpI I MIyI fragment of the CFTR gene when present in a fraction. In another example, a pair of primers, FIIIctrl3F and FIIIctrBR (SEQ ID NO: 18 and SEQ ID NO: 19, respectively) will amplify a 113 bp segment of a 905 bp BIpI I MIyI fragment of the CFTR gene when present in a fraction. In another example, a pair of primers, LgctrlF and LgctrlR (SEQ ID NO:20 and SEQ ID NO:21, respectively) will amplify a 156 bp segment of a 7,031 bp BIpI I MIyI fragment from chromosome 21 (21 :20912766-20919795) when present in a fraction. Detection of Marker Sequences
[0115] Marker sequences may be amplified prior to detection or may be detected directly after size separation without an amplification step. In some embodiments, the marker sequence is amplified and the resulting amplicon is detected by electrophoresis, preferably capillary electrophoresis. In preferred embodiments, the marker sequence is amplified using a labeled primer such that the resulting amplicon is detectably labeled. In preferred embodiments, the primer is fluorescently labeled, however, the primers may be labeled according to the methods described below for oligonucleotide probes.
[0116] In preferred embodiments, the fragmented DNA is detected directly, without an amplification step, using two distinguishably-labeled nucleic acid probes which hybridize to two separate segments of a marker sequence upstream or downstream of the tandem repeats. The simultaneous detection of both labels in one hybridization complex indicates the presence of the marker sequence (and thus the associated tandem repeat). In one embodiment, detection is accomplished using a Trilogy 2020 Analyzer (US Genomics Woburn, MA). In this embodiment, two probes with distinguishable fluorescent labels are contacted with the size-separated DNA fragments. The resulting mixture of hybridization complexes is directed into a capillary tube where it is exposed to multiple lasers of differing wavelengths. The fluorescent labels of the probes are excited such that photons are emitted and detected. The simultaneous detection of fluorescent labels of different colors indicates the presence of the target region.
[0117] Probe oligonucleotides may be detectably labeled by methods known in the art. Useful labels include, e.g., fluorescent dyes (e.g., Cy5®, Cy3®, FITC, rhodamine, lanthamide phosphors, Texas red), 32P, 35S, 3H, 14C, 125I, 131I, electron-dense reagents (e.g., gold), enzymes, e.g., as commonly used in an ELISA (e.g., horseradish peroxidase, beta- gal actosidase, luciferase, alkaline phosphatase), colorimetric labels (e.g., colloidal gold), magnetic labels (e.g., DynabeadsTM), biotin, dioxigenin, or haptens and proteins for which antisera or monoclonal antibodies are available. Other labels include ligands or oligonucleotides capable of forming a complex with the corresponding receptor or oligonucleotide complement, respectively. The label can be directly incorporated into the nucleic acid to be detected, or it can be attached to a probe (e.g., an oligonucleotide) or antibody that hybridizes or binds to the nucleic acid to be detected. (0118] In preferred embodiment the detectable label is a fluorophore. The term "fluorophore" as used herein refers to a molecule that absorbs light at a particular wavelength (excitation frequency), and subsequently emits light of a different, typically longer, wavelength (emission frequency) in response. Suitable fluorescent moieties include the following fluorophores known in the art:
4-acetamido-4'-isothiocyanatostilbene-2,2'disulfonic acid acridine and derivatives: acridine acridine isothiocyanate
Alexa Fluor® 350, Alexa Fluor® 488, Alexa Fluor® 546, Alexa Fluor® 555, Alexa Fluor® 568, Alexa Fluor® 594, Alexa Fluor® 647 (Molecular Probes) 5-(2'-aminoethyl)aminonaphthalene-l-sulfonic acid (EDANS)
4-amino-N-[3-vinylsulfonyl)phenyl]naphthalimide-3,5 disulfonate (Lucifer Yellow VS) N-(4-anilino- 1 -naphthyl)maleimide anthranilamide
Black Hole QuencherTM (BHQTM) dyes (biosearch Technologies) BODIPY® R-6G, BOPIPY® 530/550, BODIPY® FL Brilliant Yellow coumarin and derivatives: coumarin
7-amino-4-methylcoumarin (AMC, Coumarin 120) 7-amino-4-trifluoromethylcouluarin (Coumarin 151) Cy2®, Cy3®, Cy3.5®, Cy5®, Cy5.5® cyanosine
4',6-diaminidino-2-phenylindole (DAPI)
5', 5"-dibromopyrogallol-sulfonephthalein (Bromopyrogallol Red) 7-diethylamino-3-(4'-isothiocyanatophenyl)-4-methylcoumarin diethylenetri amine pentaacetate
4,4'-diisothiocyanatodihydro-stilbene-2,2'-disulfonic acid 4,4'-diisothiocyanatostiϊbene-2,2'-disulfonic acid
5-[dimethylamino]naphthalene-l-sulfonyl chloride (DNS, dansyl chloride) 4-(4'-dimethylaminophenylazo)benzoic acid (DABCYL) 4-dimethylaminophenylazophenyl-4'-isothiocyanate (DABITC) EclipseTM (Epoch Biosciences Inc.) eosin and derivatives: eosin eosin isothiocyanate erythrosin and derivatives: erythrosin B erythrosin isothiocyanate ethidium fluorescein and derivatives:
5-carboxyfluorescein (FAM)
5-(4,6-dichlorotriazin-2-yl)aminofluorescein (DTAF)
2',7'-dimethoxy-4'5'-dichloro-6-carboxyfluorescein (JOE) fluorescein fluorescein isothiocyanate (FITC) hexachloro-β-carboxyfluorescein (HEX)
QFITC (XRITC) tetrachlorofluorescein (TET) fluorescamine IR 144 IR 1446
Malachite Green isothiocyanate 4-methylumbelliferone ortho cresolphthalein nitrotyrosine pararosaniline Phenol Red
B-phycoerythrin, R-phycoerythrin o-phthaldialdehyde Oregon Green® propidium iodide pyrene and derivatives: pyrene pyrene butyrate succinimidyl 1 -pyrene butyrate
QSY® 7, QSY® 9, QSY® 21, QSY® 35 (Molecular Probes) Reactive Red 4 (Cibacron® Brilliant Red 3B-A) rhodamine and derivatives:
6-carboxy-X-rhodamine (ROX)
6-carboxyrhodamine (R6G) lissamine rhodamine B sulfonyl chloride rhodamine (Rhod) rhodamine B rhodamine 123 rhodamine green rhodamine X isothiocyanate sulforhodamine B sulforhodamine 101 sulfonyl chloride derivative of sulforhodamine 101 (Texas Red) N ,N ,N\N!~tetramethyl-6-carboxyrhodarnine (TAMRA) tetramethyl rhodamine tetramethyl rhodamine isothiocyanate (TRITC) riboflavin rosolic acid terbium chelate derivatives
[0119] Other fluorescent nucleotide analogs can be used, see, e.g., Jameson, Meth. Enzymol. 278:363-390, 1997; Zhu, Nucl. Acids Res. 22:3418-3422, 1994. U.S. Patent Nos. 5,652,099 and 6,268,132 also describe nucleoside analogs for incorporation into nucleic acids, e.g., DNA and/or RNA, or oligonucleotides, via either enzymatic or chemical synthesis to produce fluorescent oligonucleotides. U.S. Patent No. 5,135,717 describes phthalocyanine and tetrabenztriazaporphyrin reagents for use as fluorescent labels.
[0120] The detectable label can be incorporated into, associated with or conjugated to a nucleic acid. Label can be attached by spacer arms of various lengths to reduce potential steric hindrance or impact on other useful or desired properties. See, e.g., Mansfield, MoI. Cell. Probes 9:145-156, 1995.
[0121] Detectable labels can be incorporated into nucleic acids by covalent or non- covalent means, e.g., by transcription, such as by random-primer labeling using Klenow polymerase, or nick translation, or, amplification, or equivalent as is known in the art. For example, a nucleotide base is conjugated to a detectable moiety, such as a fluorescent dye, e.g., Cy3® or Cy5® and then incorporated into genomic nucleic acids during nucleic acid synthesis or amplification. Nucleic acids can thereby be labeled when synthesized using Cy3®- or Cy5®-dCTP conjugates mixed with unlabeled dCTP.
[0122] Nucleic acid probes can be labeled by using PCR or nick translation in the presence of labeled precursor nucleotides, for example, modified nucleotides synthesized by coupling allylamine-dUTP to the succinimidyl-ester derivatives of the fluorescent dyes or haptens (such as biotin or digoxigenin) can be used; this method allows custom preparation of most common fluorescent nucleotides, see, e.g., Henegariu, Nat. Biotechnol. 18:345-348, 2000.
[0123] Nucleic acid probes may be labeled by non-covalent means known in the art. For example, Kreatech Biotechnology's Universal Linkage System® (ULS®) provides a non- enzymatic labeling technology, wherein a platinum group forms a co-ordinative bond with DNA, RNA or nucleotides by binding to the N7 position of guanosine. This technology may also be used to label proteins by binding to nitrogen and sulphur containing side chains of amino acids. See, e.g., U.S. Patent Nos. 5,580,990; 5,714,327; and 5,985,566; and European Patent No. 0539466.
[0124] The binding of a probe to the marker sequence flanking the tandem repeat region may be determined by hybridization as is well known in the art. Hybridization may be detected in real time or in non-real time.
[0125] One general method for real time PCR uses fluorescent probes such as the TaqMan® probes, molecular beacons and scorpions. The probes employed in TaqMan® and molecular beacon technologies are based on the principle of fluorescence quenching and involve a donor fluorophore and a quenching moiety. [0126] The term "donor fluorophore" as used herein means a flυorophore that, when in close proximity to a quencher moiety, donates or transfers emission energy to the quencher. As a result of donating energy to the quencher moiety, the donor fluorophore will itself emit less light at a particular emission frequency that it would have in the absence of a closely positioned quencher moiety.
(0127] The term "quencher moiety" as used herein means a molecule that, in close proximity to a donor fluorophore, takes up emission energy generated by the donor and either dissipates the energy as heat or emits light of a longer wavelength than the emission wavelength of the donor. In the latter case, the quencher is considered to be an acceptor fluorophore. The quenching moiety can act via proximal (i.e. collisional) quenching or by Fδrster or fluorescence resonance energy transfer ("FRET"). Quenching by FRET is generally used in TaqMan® probes while proximal quenching is used in molecular beacon and scorpion type probes.
[0128] In proximal quenching (a.k.a. "contact" or "collisional" quenching), the donor is in close proximity to the quencher moiety such that energy of the donor is transferred to the quencher, which dissipates the energy as heat as opposed to a fluorescence emission. In FRET quenching, the donor fluorophore transfers its energy to a quencher which releases the energy as fluorescence at a higher wavelength. Proximal quenching requires very close positioning of the donor and quencher moiety, while FRET quenching, also distance related, occurs over a greater distance (generally 1 —10 nm, the energy transfer depending on R"6, where R is the distance between the donor and the acceptor). Thus, when FRET quenching is involved, the quenching moiety is an acceptor fluorophore that has an excitation frequency spectrum that overlaps with the donor emission frequency spectrum. When quenching by FRET is employed, the assay may detect an increase in donor fluorophore fluorescence resulting from increased distance between the donor and the quencher (acceptor fluorophore) or a decrease in acceptor fluorophore emission resulting from increased distance between the donor and the quencher (acceptor fluorophore).
[0129] TaqMan® probes (Heid et al., 1996) use the fiuorogenic 5' exonuclease activity of Taq polymerase to measure the amount of target or marker sequences in DNA samples. TaqMan® probes are oligonucleotides that contain a donor fluorophore usually at or near the 5' base, and a quenching moiety typically at or near the 3' base. The quencher moiety may be a dye such as TAMRA or may be a non-fluorescent molecule such as 4-(4 - dimethylaminophenylazo)benzoic acid (DABCYL). See Tyagi et al., Nature Biotechnology 16:49-53 (1998). When irradiated, the excited fluorescent donor transfers energy to the nearby quenching moiety by FRET rather than fluorescing. Thus, the close proximity of the donor and quencher prevents emission of donor fluorescence while the probe is intact.
[0130] TaqMan® probes are designed to anneal to an internal region of a PCR product. When the polymerase replicates a template on which a TaqMan® probe is bound, its 5' exonuclease activity cleaves the probe. This ends the activity of quencher (no FRET) and the donor fluorophore starts to emit fluorescence which increases in each cycle proportional to the rate of probe cleavage. Accumulation of PCR product is detected by monitoring the increase in fluorescence of the reporter dye (note that primers are not labeled). If the quencher is an acceptor fluorophore, then accumulation of PCR product can be detected by monitoring the decrease in fluorescence of the acceptor fluorophore.
[0131] TaqMan® assay uses universal thermal cycling parameters and PCR reaction conditions. Because the cleavage occurs only if the probe hybridizes to the target, the fluorescence detected originates from specific amplification. The process of hybridization and cleavage does not interfere with the exponential accumulation of the product. One specific requirement for fluorogenic probes is that there be no G at the 5' end. A 1G' adjacent to the reporter dye quenches reporter fluorescence even after cleavage.
[0132] Other methods of probe hybridization detected in real time can be used for detecting amplification a target or marker sequence flanking a tandem repeat region. For example, the commercially available MGB Eclipse™ probes (Epoch Biosciences), which do not rely on a probe degradation can be used. MGB Eclipse™ probes work by a hybridization-triggered fluorescence mechanism. MGB Eclipse™ probes have the Eclipse™ Dark Quencher and the MGB positioned at the 5'-end of the probe. The fluorophore is located on the 3'-end of the probe. When the probe is in solution and not hybridized, the three dimensional conformation brings the quencher into close proximity of the fluorophore, and the fluorescence is quenched. However, when the probe anneals to a target or marker sequence, the probe is unfolded, the quencher is moved from the fluorophore, and the resultant fluorescence can be detected. [0133] Suitable donor fluorophores include 6-carboxyfluorescein (FAM), tetrachloro-6- carboxyfluorescein (TET), 2'-chloro-7'-phenyl-l,4-dichloro-6-carboxyfIuorescein (VIC), and the like. Suitable quenchers include tetra-methylcarboxyrhodamine (TAMRA) 4-(4 - dimethylaminophenylazo)benzoic acid ("DABCYL" or a DABCYL analog) and the like. Tetramethylrhodamine (TMR) or 5-carboxyrhodamine 6G (RHD) may be combined as donor fluorophores with DABCYL as quencher. Multiplex TaqMan assays can be performed using multiple detectable labels each comprising a different donor and quencher combination. Probes for detecting amplified sequence in real time may be stored frozen (-10° to -3O0C) as 100 μM stocks. TaqMan probes are available from Applied BioSystems (4316032).
[0134] In a preferred embodiment, real time PCR is performed using TaqMan® probes in combination with a suitable amplification/analyzer such as the ABI Prism 7900HT Sequence Detection System. The ABI PRISM® 7900HT Sequence Detection System is a high- throughput real-time PCR system that detects and quantitates nucleic acid sequences. Briefly, TaqMan™ probes specific for the amplified target or marker sequence are included in the PCR amplification reaction. These probes contain a reporter dye at the 5' end and a quencher dye at the 3' end. Probes hybridizing to different target or marker sequences are conjugated with a different fluorescent reporter dye. During PCR, the fluorescently labeled probes bind specifically to their respective target or marker sequences; the 5' nuclease activity of Taq polymerase cleaves the reporter dye from the probe and a fluorescent signal is generated. The increase in fluorescence signal is detected only if the target or marker sequence is complementary to the probe and is amplified during PCR. A mismatch between probe and target greatly reduces the efficiency of probe hybridization and cleavage. The ABI Prism 7700HT or 7900HT Sequence detection System measures the increase in fluorescence during PCR thermal cycling, providing "real time" detection of PCR product accumulation.
[0135] Real Time detection on the ABI Prism 7900HT or 7900HT Sequence Detector monitors fluorescence and calculates Rn during each PCR cycle. The threshold cycle, or Ct value, is the cycle at which fluorescence intersects the threshold value. The threshold value is determined by the sequence detection system software or manually.
[0136] Oligonucleotide probes can be designed which are between about 10 and about 100 nucleotides in length and hybridize to the amplified region. Oligonucleotides probes are preferably 12 to 70 nucleotides; more preferably 15-60 nucleotides in length; and most preferably 15-25 nucleotides in length. The probe may be labeled. In one example, SEQ ID NO:26 can be used as an oligonucleotide probe to detect a marker sequence associated with the tandem repeat region of the FMRl gene (following genomic fragmentation by AIuI), when the marker sequence is amplified by forward and reverse primers as set forth in SEQ ID NO:4 and SEQ ID NO:5, respectively. SEQ ID NO:27 can be used to detect an AIuI control fragment amplicons amplified by AluIFtaq and AluIRtaq (SEQ ID NOs:22 and 23, respectively) and . SEQ ID NO:28 can be used to detect the 8,479 bp AIuI control fragment amplicon, as amplified by LargeFtaq and LargeRtaq (SEQ ID NOs: 24 and 25, respectively).
Table 4. TaqMan Probes
Figure imgf000049_0001
[0137] Amplified fragments may be detected using standard gel electrophoresis methods. For example, in preferred embodiments, amplified fractions are separated on an agarose gel and stained with ethidium bromide by methods known in the art to detect amplified fragments.
Sizing by PCR Amplification and Electrophoresis
[0138] In certain embodiments, methods involving amplification of the tandem repeat region are used to measure the size of that region. In some embodiments, such methods are used as a screen prior to the use of a second method for sizing the tandem repeat region. In preferred embodiments, the amplification is preferably done by PCR. In this method, the entire tandem repeat region is amplified. The resulting amplicons are sized using electrophoresis, preferably capillary electrophoresis.
[0139] In one example, forward primer FX-5F (SEQ ID NO:29; 5'GCT CAG CTC CGT TTC GGT TTC ACT TCC GGT 3') is used in an amplification reaction with reverse primer FX-3F(SEQ ID NO:30; 5'-AGC CCC GCA CTT CCA CCA CCA GCT CCT CCA-3') to amplify the tandem repeat region of the FMRl gene. Preferably, one of the primers of this primer pair is labeled, preferably the label is a fluorescent label. Amplification product may be detected and sized by electrophoresis, preferably capillary electrophoresis. Alternatively, amplification products can be detected and sized using Southern blot.
Correlation of Fragment Size to Normal or Carrier/Affected Status
[0140] The fraction in which the marker sequence upstream or downstream of the tandem repeat region is detected corresponds to the size of the fragment containing the tandem repeat. This correlation enables an estimation of the number of tandem repeats and thus, whether an individual is normal or carries an allele having an expansion in the tandem repeat region.
[0141] In some embodiments, individuals that are afflicted with a disease associated with a mutation in the tandem repeat region of a gene can be distinguished from those that are normal. A nucleic acid sample from the individual is fragmented to produce nucleic acid fragments in which the tandem repeat segment of the gene is associated with a marker sequence in a fragment. The fragments are separated into fractions according to size under conditions in which the fragment(s) containing the tandem repeat segment will be located in the fractions according to the number of repeats in the tandem repeat segment, and identifying those fraction(s) containing the segment by detecting the marker sequence. The fractions are chosen so that one fraction corresponds to tandem repeat regions having a normal number of repeats (i.e., a normal allele) and another fraction that corresponds an abnormal number of repeats (i.e., a mutated allele). If only the former fraction is positive, the individual is normal; if only the latter fraction is positive, the individual may be a carrier or may be afflicted with the disease. A positive result in both fraction indicates a heterozygote and the individual may or may not be affected, depending on whether the disease is dominant or recessive. If the disease is dominant, heterozygotes will be affected; if the disease is recessive, the heterozygote will not be affected but will be a carrier of the disease.
[0142] In other embodiments, individuals that have a normal allele can be distinguished from those that have a premutation or a full mutation in the tandem repeat region of a gene. A nucleic acid sample from the individual is fragmented to produce nucleic acid fragments in which the tandem repeat segment of the gene is associated with a marker sequence in a fragment. The fragments are separated into fractions according to size under conditions in which the fragment(s) containing the tandem repeat segment will be located in the fractions according to the number of repeats in the tandem repeat segment, and identifying those fraction(s) containing the segment by detecting the marker sequence. The fractions are chosen so that a first fraction corresponds to tandem repeat regions having a normal number of repeats (i.e., a normal allele), a second fraction corresponds to tandem repeat regions having a number of repeats in a premutation (i.e., a premutation allele), and a third fraction that corresponds to tandem repeat regions having a number of repeats in a full mutation (i.e., a full mutation allele). Generally, if only the first fraction is positive, the individual is normal; if only the second fraction is positive, the individual carries the premutation, and if only the third fraction is positive, the individual is affected with the disease. A positive result in more than one fraction indicates a heterozygote. A heterozygote may be a carrier or an affected individual depending on the gene involved and the dominance of the disease.
[0143] In some embodiments, individuals having a mutation associated with fragile X syndrome (i.e., a full mutation in the FMRl gene) can be distinguished from individuals having a premutation or a normal allele. A nucleic acid sample from the individual is fragmented to produce nucleic acid fragments in which the tandem repeat segment of the FMRl gene is associated with a marker sequence in a fragment. The fragments are separated into fractions according to size under conditions in which the fragment(s) containing the tandem repeat segment will be located in the fractions according to the number of repeats in the tandem repeat segment, and identifying those fraction(s) containing the segment by detecting the marker sequence. The fractions are chosen so that a first fraction corresponds to tandem repeat regions having a normal number of repeats (i.e., a normal allele), a second fraction corresponds to tandem repeat regions having a number of repeats in a premutation (i.e., a premutation allele), and a third fraction that corresponds to tandem repeat regions having a number of repeats in a full mutation (i.e., a full mutation allele). In preferred embodiments, the fractions are chosen so that the first fraction corresponds to tandem repeat regions having less than 55 repeats, the second corresponds to tandem repeat regions having 55-200 repeats, and the third corresponds to tandem repeat regions having greater than 200 repeats. Therefore, if the first fraction is positive (i.e., the marker sequence was detected in this fraction), it indicates that the tandem repeat region contains less than 55 repeats (i.e., a normal allele), if the second fraction is positive, it indicates that the tandem repeat region contains 55-200 repeats (i.e., a premutation allele), and if the third fraction is positive, it indicates that the tandem repeat contains greater than 200 repeats (i.e., a full mutation allele). In male individuals, a phenotype or disease status may be assigned based on these results. Males generally possess only one X chromosome (the chromosome which contains the FMRl gene), therefore, if the first fraction is positive, the individual is normal; if the second fraction is positive, the individual is a carrier of the premutation; if the third fraction is positive, the individual is afflicted with fragile X. Female individuals possess two X chromosomes, therefore, females possessing two normal alleles are normal and females possessing a premutation allele or a full mutation allele are carriers. Females heterozygous for a normal allele and a full mutation allele may or may not be affected, depending on other factors such as methylation status of the gene.
[0144] In certain embodiments of an assay to distinguish individuals having a normal tandem repeat region of the FMRl gene from those having a premutation or full mutation, two fractions of AIuI fragmented genomic DNA are separated by capillary electrophoresis and collected by automatic fraction collector, one between 211 — 400 bp, one between 401 bp - 9 kb. AIuI fragments having 6-68 repeats will be present in the lower fraction, thus normal alleles and those with small premutations (i.e., 55-68 repeats) will be separated into this fraction. Normal alleles and small premutions can be further distinguished by amplification of the tandem repeat region and sizing with electrophoresis. The premutation, encompassing a range of 69-200 repeats, and the full mutation, encompassing 201-2000+ repeats will be present in the upper fraction. Therefore, if the lower fraction is positive (i.e., the marker sequence was detected in this fraction), it indicates that there is a CGG tandem repeat region that contains 6-68 repeats; if the upper fraction is positive, it indicates that there is CGG tandem repeat region that contains 68-2000+ repeats. A positive result in both fractions would indicate a heterozygote in which one allele is normal and the other allele contains the premuation or the full mutation.
[0145J In other embodiments of the assay to determine size of the tandem repeat region of the FMRl gene, DNA fragmented with a combination of BIpI an dMlyl is separated into four fractions corresponding to sizes of approximately less than or equal to 603 bp (first/lowest fraction), 604-840 bp (second fraction), 841 - 1078 bp (third fraction) and 1079 bp - 9 kb (fourth/highest fraction). Fragments from samples containing a normal FMRl gene (i.e., less than 55 tandem repeats) and those containing premutations having 56-62 tandem repeats will separate into the first/lowest fraction. Normal alleles and small premutions can be further distinguished by amplification of the tandem repeat region and sizing with electrophoresis. Fragments from FMRl genes containing small premutations (i.e., 63-140 tandem repeats) will separate into the second fraction, whereas fragments from FMRl genes containing large premutations (i.e., 141-200 tandem repeats) and full mutations having 201-220 tandem repeats will separate into the third fraction, and whereas large premutations having greater than 220 tandem repeats will separate into the fourth/highest fraction. Large premuations 141-200 repeats can be distinguished from full mutations of 201-220 using, for example, standard Southern blot methods.
[0146] In another embodiment, the AIuI digested genomic DNA is size fractionated using capillary electrophoresis into a multiplicity of fractions. Automated fraction collection is accomplished using a preset fraction time window of approximately 30 seconds per fraction, beginning at 200 bp and ending at 9 Kb. Approximately 16 fractions are collected. The fraction or fractions that are positive for detection of the marker sequence upstream or downstream of the tandem repeat region correspond to a size range and thus, the number tandem repeats can be estimated.
Gender Determination
[0147] In some embodiments, a nucleic acid assay to determine gender is combined with an assay to determine the length of the tandem repeat region of the FMRl gene. In preferred embodiments, the nucleic acid assay includes DNA amplification. Such DNA amplification assays may target amplification of sequences specific to the Y chromosome (e.g., the SRY locus (Sinclair, et al., Nature 346:240 244, 1990)). In this case, amplification only occurs in the presence of a Y chromosome, indicating the nucleic acids are from a male. The absence of amplification suggests the nucleic acids are from a female. However, in these assays, a positive control is preferably included to detect false negatives.
[0148] In other examples, certain genes which occur on both the X chromosome and the Y chromosome but having different lengths depending on whether the gene occurs on the X chromosome or the Y chromosome, may be targeted for amplification. In this example, a region encompassing the segment of the gene which differs between the X and Y chromosomes would be amplified. This results in amplification products having different sizes, corresponding to the template nucleic acid (i.e., the X chromosome or the Y chromosome). Thus, amplification of nucleic acids from males would result in amplicons of both sizes, whereas samples from females would result in amplicons having only one size.
[0149] In a preferred embodiment, the amelogenin gene is targeted for gender determination. Sequence differences between the X and Y homologs of the amelogenin gene have been used to differentiate males from females. For example, two primer sets primer sets spanning a 6 base pair (bp) deletion of the amelogenin gene on the X chromosome have been used to generate fragments of 106/112 bp or 212/218 bp for XfY products, respectively (Sullivan et al., BioTechniques 15:636-9, 1993). In preferred embodiments, the following primers are used to amplify a region of the amelogenin gene:
AMLF2 primer, S'-AGTACTTGACCACCTCCTGATCTACAAGG 3' (SEQ ID NO:40) and
AMLR2 primer, 5'-TTTTTAACAGTTTACTTGCTGATAAAACTCAYCCC 3' (SEQ ID NO:41).
This primer pair results in a 134 bp amplicon corresponding to the X chromosome homolog and a 140 bp amplicon corresponding to the Y chromosome. Thus, both amplicons would be generated by amplification of nucleic acids from males, whereas only one amplicon would be generated by amplification of nucleic acids from females.
10150] The following examples serve to illustrate the present invention. These examples are in no way intended to limit the scope of the invention.
EXAMPLE 1
Restriction Enzyme Digestion
[0151] This example describes methods to detect expansion of the tandem repeat region of the FMRl gene. Genomic DNA test samples and control samples of DNA were restriction endonuclease digested with AIuI. 1.0 μg of test or control DNA was used for each digest. Genomic DNA was purified and diluted to a concentration of 50 ng/μL. The reaction mix for the digest was prepared according to the following table. Table 5. AhA reaction mix
Figure imgf000055_0001
[0152] 10 μL of the AIuI reaction mix was added to each sample of DNA. The samples were mixed by vortexing and spinning in a microfuge and are then incubated overnight at
37°C.
EXAMPLE 2
Size Separation of Digested DNA
[0153] A. Two fraction approach
[0154] Restriction enzyme digested DNA as described in Example 1 was separated into fractions according to size. 1 kb DNA ladder and the digested test genomic DNA samples were placed into a 96- well plate. The plate was subjected to auto-sampling and automatic fractionation using a Beckman Coulter P/ ACE MDQ Series Capillary Electrophoresis System in reversed polarity separation mode. The ladder was first injected into the capillary with 6kv/60sec, then run with 200V/cm to determine the correct sizing cutoff time. Automated fraction collection was accomplished using a preset fractionation time window corresponding to 211-400 bp (lower fraction) and 401 bp - 9 kb (upper fraction). The separations are monitored on-column by UV detection. The data were acquired and evaluated by the P/ACE MDQ 32 Karat software package.
[0155] The digested samples were fractionated using the same conditions as the 1 kb DNA ladder. The lower fraction (211-396 bp) and the upper fraction (396 bp -9 kb) were collected based on the sizing cutoff times as determined using the 1 kb ladder for each size range. Fractions were collected using the P/ACE MDQ auto-collector and stored in a 96-well plate in 30 μL 0.1 X TBE buffer per well. [0156] A. 16 fraction approach
[0157] Automated fraction collection was accomplished using a preset fractionation time window 30 seconds per fraction, starting roughly form 200bps and ending at 9Kbs. The separations were monitored on —column by UV detection. The data were acquired and evaluated by the P/ACE MDQ 32 Karat software package.
[0158] 1-kb DNA ladder and the digested genomic DNA samples were placed into a 96 well plate. The plate was subjected to autosampling, automatic fractionation on the Beckman Coulter P/ACE MDQ machine with cooling control. The ladder was first electronkinetic injected into the capillary with 6kv/60sec, then run with 200V/cm to determine the correct sizing window (preferably 200bps-9Kbs).
]0159] The digested samples were fractionated based on the same condition as the 1-kb DNA ladder running condition. A total 16 fractions were collected using P/ACE MDQ's auto collector and stored in a 96-well plate in 5 μL dH2θ per well.
EXAMPLE 3
PCR Amplification and Detection of Fragments with Tandem Repeats
[0160] In an assay to detect mutations characterized by an expansion of the tandem repeat region of the FMRl gene, working PCR master mixes were made as described.
[0161] A. PCR amplification (non-TaqMan)
[0162] A PCR Master Mix for amplifying fragments and for size separation analysis of PCR product was prepared as shown in Table 6. Table 6. Preparation of PCR primer (non-Taqman) master mix for CCG 5' flanking sequence
Figure imgf000057_0001
[0163] The PCR primer master mix was stored in 1.5 mL aliquots at -20 0C prior to use. When ready for use, PCR reactions were prepared as shown in Table 7.
Table 7. Final. PCR amplification mixture for CCG 5' flanking sequence
Figure imgf000057_0002
[0164] The final amplification mixtures were sealed tightly in plates with Microseal A film. The plates were vortexed briefly (approximately 5 sec) and spun down for approximately 30 sec in a plate centrifuge at 2,000-6,00Og (1,600 rpm in a Sorvall T6000D centrifuge). The plate was transferred to the ABI 9700 thermal cycler. [0165] The thermal cycler conditions for amplification were as follows:
Step 1 95°C for 15 minutes.
Step 2 95°C for 60 seconds
Step 3 64°C for 60 seconds.
Step 4 72°C for 30 seconds
Step 5 Steps 2-4 repeated, 34 times.
Step 6 72°C for 5 minutes.
Step 7 4°C indefinitely.
[0166] PCR products were then loaded onto the ABI 3100 genetic analyzer for detection.
[0167] B. TaqMan PCR amplification
[0168] A Taqman PCR master mix for detecting CCG 5 ' flanking sequence was prepared as shown in Table 8.
Table 8. Preparation of Taqman PCR master mix for CCG 5' flanking sequence
Figure imgf000059_0001
10169] The Taqman PCR master mix was stored in 1.5 mL aliquots at -200C prior to use. When ready for use, Taqman PCR reactions were prepared as shown in Table 9.
Table 9. TaqMan PCR Master Mix
Figure imgf000059_0002
[01701 45 μL of the TaqMan PCR master mix was added to each well of the 96-well plate containing the fractions of size separated DNA. The wells are sealed tightly with Microseal A film. The plates were vortexed briefly (approximately 5 sec) and spun down for approximately 30 sec in a plate centrifuge at 2,000-6,00Og (1,600 rpm in a Sorvall T6000D centrifuge). The plate was transferred to the ABI 7700 (or 7900HT) sequence detector.
[0171] The thermocycler conditions for TaqMan were as follows:
Step 1 950C for 15 minutes.
Step 2 95°C for 60 seconds
Step 3 64°C for 60 seconds.
Step 4 720C for 30 seconds
Step 5 Steps 2-4 repeated, 40 times.
Step 6 720C for 5 minutes.
Step 7 4°C indefinitely.
EXAMPLE 4
Detection of Amplified Fractions by Gel Electrophoresis
[0172] Gel electrophoresis was used to identify the size of PCR amplified fragments prepared as described in Example 3 A. 6 μL of 6X FEB (ficoll, EDTA bromphenol blue loading dye) was added to 6 μL PCR products and the resulting mixture was loaded into the gel. 50 bp DNA ladder was loaded into the first and last well of the gel. Samples were electrophoresed in 0.8% agarose at 200V for 1.5 hours. Completed gel was photographed with UV photodocumentation apparatus (Alpha Innotech Image Analysis System).
EXAMPLE 5
Detection of Expansion Mutations of the Tandem Repeat Region of the DM-I Gene
|0173] In an assay to detect mutations characterized by an expansion of the tandem repeat region of the DM-I gene, genomic DNA test samples are restriction endonuclease digested with AIuI. Approximately 1.0 μg of test genomic DNA is used for each digest. The reaction mix for the digest is prepared according to the enzyme supplier's protocol. The samples are mixed and incubated at 37°C to complete digestion. [0174] The restriction enzyme digested DNA is separated according to size using capillary electrophoresis and two fractions are collected, such that the first fraction (250-360 bp) corresponds to the normal repeat range (e.g., 5-37 repeats) and the second fraction (400 bp - 9 kb) corresponds to an expanded repeat mutation (e.g., greater than 50 repeats). 1 kb DNA ladder is first injected into the capillary to determine the correct sizing cutoff time. Automated fraction collection is accomplished using a preset fractionation time window corresponding to a lower fraction and an upper fraction. The separations are monitored on— column by UV detection. The digested samples are then fractionated using the same conditions as the 1 kb DNA ladder. The lower fraction and the upper fraction are collected based on the sizing cutoff times as determined using the 1 kb ladder for each size range.
[0175] Each fraction is analyzed using the TaqMan real time PCR method for the presence of a fragment containing the DM-I tandem repeat region. In this method, a segment of the 3 '-untranslated region of DM-I gene is amplified using a forward primer (e.g., 5'- CCATTTCTTTCTTTCGGCCA-3'; SEQ ID NO:31) and a reverse primer (e.g., 5'- AGGCCTGC AGTTTGCCC-3'; SEQ ID NO:32). The amplified fragment is detected with a TaqMan labeled probe, 5'-TGAGGCCCTGACGTGG-3f (SEQ ID NO:33).
[0176] The presence of the amplified segment in only the lower fraction is indicative of an individual homozygous for the normal DM-I allele. The presence of the amplified segment in only the upper fraction is indicative of an individual homozygous for a mutant allele(s). The presence of the amplified segment in both fractions is indicative of a heterozygote.
EXAMPLE 6
Detection of Expansion Mutations of the Tandem Repeat Region of the FRDA Gene
[0177] In an assay to detect mutations characterized by an expansion of the tandem repeat region of the FRDA gene, genomic DNA test samples are restriction endonuclease digested with AIuI and Rsal. Approximately 1.0 μg of test genomic DNA is used for each digest. The reaction mix for the digest is prepared according to the enzyme supplier's protocol. The samples are mixed and incubated at 37°C to complete digestion.
[0178] The restriction enzyme digested DNA is separated according to size using capillary electrophoresis and two fractions are collected, such that the first fraction (300-405 bp) corresponds to the normal repeat range (e.g., 7-34 repeats) and the second fraction (600 bp - 9 kb) corresponds to an expanded repeat mutation {e.g., greater than 100 repeats). 1 kb DNA ladder is first injected into the capillary to determine the correct sizing cutoff time. Automated fraction collection is accomplished using a preset fractionation time window corresponding to a lower fraction and an upper fraction. The separations are monitored on— column by UV detection. The digested samples are then fractionated using the same conditions as the 1 kb DNA ladder. The lower fraction and the upper fraction are collected based on the sizing cutoff times as determined using the 1 kb ladder for each size range.
[0179] Each fraction is analyzed using the TaqMan real time PCR method for the presence of a fragment containing the FRDA tandem repeat region. In this method, a segment of the first intronic region of the FRDA gene is amplified using a forward primer (e.g., 5'- AGGCCT AGG A AGGTGGATCAC-3'; SEQ ID NO:34) and a reverse primer (e.g., 5'- ACCATGTTGGCCAGGTTAGTCT-3'; SEQ ID NO:35). The amplified fragment is detected with a TaqMan labeled probe, 5'-TGAGGTCCGGAGTTC-S' (SEQ ID NO:36).
[0180] The presence of the amplified segment in only the lower fraction is indicative of an individual homozygous for the normal FRDA allele. The presence of the amplified segment in only the upper fraction is indicative of an individual homozygous for a mutant allele(s). The presence of the amplified segment in both fractions is indicative of a heterozygote.
EXAMPLE 7
Detection of Expansion Mutations of the Tandem Repeat Region of the FMRl Gene
[0181] In this example, expansion mutations of the tandem repeat region of the FMRl gene were detected by fragmentation with AIuI, size fractionation, followed by a second restriction enzyme digestion with BstNI. Thus, genomic DNA test samples were restriction endonuclease digested with AIuI as described above in Example 1. The digested DNA was then fractionated into four fractions using capillary electrophoresis. Fraction 1 contains nucleic acids with 0-60 CCG tandem repeats, fraction 2 contains 60-200 CCG tandem repeats, fraction 3 contains 200-2000 CCG tandem repeats and fraction 4 contains 2000+ CCG tandem repeats. Following fractionation, an aliquot of each of the four fractions was digested with a second restriction endonuclease, BstNI, that cleaved the marker from the CCG tandem repeat region. Following the second nucleic acid fractionation, each of the four fractions were subjected to PCR as described in Example 3B (i.e., TaqMan PCR Amplification).
[0182] A. Second Restriction Enzyme Digestion of Samples from Affected Individuals
[0183] Affected samples were tested with and without a second restriction enzyme digestion in which the marker sequence was cleaved from the tandem repeat region. Samples having the second digestion had much stronger signals (i.e., higher relative fluorescence unit (RFU) signals) as compared to samples that did not undergo a second enzyme digestion. Each sample was run in duplicate; data are shown below in table 9. These data suggest that there is increased amplification of the marker sequence when the marker sequence is cleaved from the tandem repeat region.
Table 10. Relative Fluorescence Unit (RFU) Signals of Affected Samples With ("with 2°") and Without a Second Restriction Enzyme Digestion ("without 2°").
Figure imgf000063_0001
[0184] B. Second Restriction Enzyme Digestion of Samples from Normal Individuals
[0185] Normal samples were tested with and without a second restriction enzyme digestion in which the marker sequence was cleaved from the tandem repeat region. Samples having less starting material that underwent the second digestion, had comparable or higher signals (i.e., higher relative fluorescence unit (RFU) signals) as compared to samples having more starting material that did not undergo a second enzyme digestion. Data are shown below in table 10. These data suggest that there is increased amplification of the marker sequence when the marker sequence is cleaved from the tandem repeat region.
Table 11. Relative Fluorescence Unit (RFU) Signals of Normal Samples With ("with 2°") and Without a Second Restriction Enzyme Digestion ("without 2°").
Figure imgf000064_0001
EXAMPLE 8
Detection of Expansion Mutations of the Tandem Repeat Region of the FMRl Gene
[0186] In this example, expansion mutations of the tandem repeat region of the FMRl gene are detected by fragmentation with BIpI and MIyI, size fractionation, followed by a second restriction enzyme digestion with Bmtl. [0187] Genomic DNA test samples are restriction endonuclease digested with BIpI and MIyI using approximately 1.5 μg of test or control DNA (purified and diluted to a concentration of 50 ng/μL) for each digest. The reaction mix for the digest is prepared according to the following table.
Table 12. BIpI I MIyI reaction mix
Figure imgf000065_0001
[0188] The samples are mixed by vortexing and spinning in a microfuge and are then incubated at 370C for 16 hours and stored at 4°C. The digested DNA is then fractionated into four fractions using capillary electrophoresis using P/ACE MDQ Capillary Electrophoresis System. Fraction 1 (less than 603 bp) contains nucleic acids with 6-62 CCG tandem repeats, fraction 2 (603-840 bp) contains 63-140 CCG tandem repeats, fraction 3 (841-1078bρ) contains 141-220 CCG tandem repeats and fraction 4 (1079 bp - 9 kb) contains 221-2000+ CCG tandem repeats. Following fractionation, an aliquot of each of the four fractions is digested with the restriction endonuclease, Bmtl, which cleaves the marker from the CCG tandem repeat region. The Bmtl reaction mix is prepared according to the following table.
Table 13. Bmtl reaction mix
Figure imgf000065_0002
[0189] The digestion reaction mixes are incubated at 37°C for 16 hours and stored at 4°C. [0190] Following the second nucleic acid digestion, each of the four fractions are subjected to PCR using the following primers:
Table 14. Primers for PCR amplification of BIpI I MIyI fragments
Figure imgf000066_0001
[0191) A PCR master mix for amplifying fragments for size analysis is prepared according to Table 15.
Table 15. PCR Master Mix
Figure imgf000066_0002
*, ** 2.5 μL 1OX PCR buffer contains 15 mM MgCl2, 50 mM KCl.. Total reaction MgCl2 is 2.25 mM, KCl is 105mM. [0192] A final PCR amplification mixture is made by adding 0.5 μL HotStarTaq (Qiagen) is added to each individual PCR reaction followed by 5 μL digested fractionated DNA. The final amplification mixtures are sealed tightly in plates with Microseal A film. The plates are vortexed briefly (approximately 5 sec) and spun down for approximately 30 sec in a plate centrifuge at 2,000-6,00Og (1 ,600 rpm in a Sorvall T6000D centrifuge). The plate is transferred to the ABI 9700 thermal cycler.
[0193] The thermal cycler conditions for amplification are as follows:
Step 1 95°C for 15 minutes.
Step 2 95°C for 30 seconds
Step 3 55°C for 30 seconds.
Step 4 72°C for 60 seconds
Step 5 Steps 2-4 repeated, 33 times.
Step 6 72°C for 10 minutes.
Step 7 4°C indefinitely.
[0194] 2 μL PCR product is combined with 10.5 μL Hi-Di formamide (Applied Biosystems) with ROX 350 size standard (Applied Biosystems), heated at 95 0C for 5 minutes followed by 5 minutes on ice. Samples are then loaded onto the ABI 3100 genetic analyzer for detection.
EXAMPLE 9
Detection of Expansion Mutations of the Tandem Repeat Region of the FMRl Gene
[0195] In this example, expansion mutations of the tandem repeat region of the FMRl gene are detected by fragmentation with Sphl and Bmtl, size fractionation, followed by a second restriction enzyme digestion with BstNI. Thus, genomic DNA test samples are restriction endonuclease digested with Sphl and Bmtl using approximately 1.5 μg of test or control DNA (purified and diluted to a concentration of 50 ng/μL) for each digest. The reaction mix for the digest is prepared according to the following table. Table 16. Sphl I Bmtl reaction mix
Figure imgf000068_0001
[0196J The samples are mixed by vortexing and spinning in a microfuge and are then incubated overnight at 37°C. The digested DNA is then fractionated into four fractions using capillary electrophoresis. Fraction 1 contains nucleic acids with 6-62 tandem repeats, fraction 2 contains 63-163 tandem repeats, fraction 3 contains 164-196 tandem repeats and fraction 4 contains greater than tandem repeats. Following fractionation, an aliquot of each of the four fractions is digested with a second restriction endonuclease, Bmtl, that cleaved the marker from the CCG tandem repeat region. Following the second nucleic acid fractionation, each of the four fractions are subjected to PCR using the following primers:
Table 17. Primers for PCR amplification of Sphl I Bmtl fragments
Figure imgf000068_0002
[01971 PCR is conducted using the following conditions:
Step 1 95°C for 15 minutes.
Step 2 95°C for 30 seconds
Step 3 55°C for 30 seconds.
Step 4 72°C for 60 seconds
Step 5 Steps 2-4 repeated, 33 times.
Step 6 72°C for 10 minutes.
Step 7 4°C indefinitely.
(0198] PCR products are then loaded onto the ABI 3100 genetic analyzer for detection. EXAMPLE 10
Identification of a Control Fragment for Use in the FMRl Assay
[0199] An AIuI fragment from a region of the genome distinct from FMRl, that was larger than 6000 bases, containing trinucleotide repeats, and having a high GC content and/or CpG islands was identified for use as a control fragment in the FMRl assay. This fragment was identified as follows. All AIuI sites in the human genome were identified using the EMBOSS Restrict program (Rice et al., "EMBOSS: The European Molecular Biology Open Software Suite." Trends in Genetics 16(6):276-7 (2000)), resulting in a predicted 11.5 million fragments produced by a digestion with A IuI. From these fragments, 20 fragments having a length longer than 6000 bases were identified using the TACG program (Mangalam, HJ. "tacg - a grep for DNA." BMC Bioinformatics 3:8 (2002)). The sequences of these 20 fragments were obtained using EMBOSS ExtractSeq. Of these 20 fragments, one fragment corresponding to a region of the USP41 gene on chromosome 22 (i.e., chromosome 22:19033185-19041663) was found to have a large trinucleotide repeat region. The GC content was analyzed using EMBOSS geecee and the CpG island analysis was performed using EMBOSS cpgseek and newcpgreport.
EXAMPLE 11
Determination of the Size of the Tandem Repeat Region of FMRl using PCR
[0200] To determine the size of the tandem repeat region of the FMRl region, the region is amplified by the polymerase chain reaction in the presence of a fluorescently-labeled primer (e.g., 6-FAM) and the sizes of the resulting labeled amplicons are determined by capillary electrophoresis. A second trinucleotide repeat (CAG in the X-linked androgen receptor gene) is co-amplified using a primer pair in which one of the primers is fluorescently-labeled and co-analyzed to provide an internal amplification control.
[0201] Thus, the tandem repeat region of the FMRl gene is amplified using FX-5F (SEQ ID NO:29) as the forward primer and FX-3F (SEQ ID NO:30) as the reverse primer, and trinucleotide repeat of the X-linked androgen receptor gene using AR-5F (SEQ ID NO:37) as the forward primer and AR-R2 (SEQ ID NO.38) as the reverse primer, as set forth in the table below.
Table 18. Primers for PCR amplification
Figure imgf000070_0001
[0202] A PCR master mix for amplifying fragments for size analysis is prepared according to Table 19.
Table 19. PCR Master Mix
Figure imgf000070_0002
[0203] The PCR master mix is aliquoted into 1,100 μL aliquots to which 5.5 μL of Taq polymerase (5 U/μL) from Qiagen and 22 μL of Pfu DNA polymerase (2.5 U/μL) from Stratagene are added to make the polymerase/master mix solution.
[0204] Genomic DNA is diluted to 20 ng/μL in TE buffer. Samples are heated to 93-97°C for 4-6 minutes and cooled on ice. 10 μL of polymerase/master mix solution is added to 2 μL (40 ng) diluted genomic DNA. [0205] The samples are transferred to the ABI 9700 thermal cycler once the thermal cycler has reached 85°C +/- 2°C. The PCR conditions for amplification are as follows:
Step 1 95°C for 6 minutes
Step 2 95°C for 1 minute
Step 3 600C for 2 minutes
Step 4 75°C for 5 minutes
Step 5 Steps 2-4 repeated, 31 times
Step 6 75°C for 13 minutes
Step 7 4°C indefinitely.
[0206] PCR products are then loaded onto the ABI 3100 genetic analyzer for detection.
EXAMPLE 12 Carrier screening for Fragile X Syndrome
[0207] In this example, samples from male and female individuals were screened for carrier status of fragile X syndrome by a two-step method, in which samples were initially screened with multiplex PCR to establish gender and size of the FMRl region and were subjected to further sizing analysis based on the results of the multiplex PCR. All samples that were determined to be female and heterozygous for two normal alleles were not subjected to further analysis. All samples determined to be female and apparently homozygous at the FMRl locus (24% of all analysis) were subjected to further analysis to determine the size of the FMRl tandem repeat region. Samples identified as male and hemizygous for a normal FMRl allele were not subjected to further analysis, whereas samples identified as male but exhibiting no FMRl amplification are subjected to further analysis to determine the size of the FMRl tandem repeat region.
[0208] Genomic DNA was extracted from 150 μL whole blood collected in EDTA anticoagulated blood collection vacuum tubes using an Xtractor GeneTM (Corbett Life Science, Mortlake, NSW, Australia) according to manufacturer's Whole Blood DNA Extraction Protocol. The final elution was carried out in 100 μL buffer to consistently yield concentrations of 50 - 100 ng/μL.
[0209] Genomic DNA samples were then analyzed by a multiplex PCR consisting of an amplification of the FMRl tandem repeat region using FX-5F primer (FAM-labeled; SEQ ID NO:29) and FX-3F primer (SEQ ID NO:30); amplification of a region of the amelogenin gene to establish gender using AMLF2 primer, 5'-
AGTACTTGACCACCTCCTGATCTACAAGG 3' (FAM-labeled; SEQ ID NO:40) and AMLR2 primer, 5'-TTTTTAACAGTTTACTTGCTGATAAAACTCAYCCC 3' (SEQ ID NO:41), and amplification the trinucleotide repeat of the X-linked androgen receptor using AR-5F (HEX-labeled SEQ ID NO:37) and AR-R2 primer (SEQ ID NO:38) as a positive internal control.
[0210] A PCR mastermix for the multiplex PCR was prepared consisting of 3.3 μM of each of the above primers, IX Qiagen Standard PCR buffer, 0.4 mM MgCl2, 2% DMSO, 1 X Qiagen Q Solution, 0.2 mM dNTP, and 0.25 unit Qiagen Taq DNA polymerase (Qiagen, Valencia, CA), 0.5 unit Pfu DNA Polymerase (Strategene, La Jolla, CA). One μL of isolated DNA solution was added to 10 μL of the multiplex primer mix to a final volume of 11 μL. The PCR conditions were as follows: 95 0C for 6 min followed by 32 cycles of 950C for 1 min, 60 0C for 2 min, 75 0C for 5 min, and finally the amplified products were extended at 75 0C for 15 min. The PCR fragments were analyzed on an ABI 3100 automated DNA sequencer (Applied Biosystems, Foster City, CA, USA) and fragment analysis accomplished with ABI GeneScanTM V3.7 and GenotyperTM V3.7 software (Applied Biosystems). The amelogenin primer pair results in a 134 bp amplicon corresponding to the X chromosome homolog and a 140 bp amplicon corresponding to the Y chromosome.
[0211] For the further analysis to determine size of the tandem repeat region of the FMRl gene, genomic DNA was digested for 16 hours at 37°C with restriction enzymes BIpI and MIyI (New England BioLabs, Ipswich, MA3 USA). Following incubation the restriction fragments were either pressure injected or vacuum injected onto a P/ ACE™ MDQ capillary electrophoresis system with an UV/Vis Detector (Beckman Coulter, Fullerton, CA, USA). Undenatured double stranded DNA was separated at an electric field strength of 100 V/cm, in IX TBE buffer (90 mM Tris-Borate, 2 mM EDTA, pH 8.3). Capillary temperature was maintained at 25 0C. Four fractions were collected into 0.1X TBE buffer (9 mM Tris-borate, 0.2 mM EDTA, pH 10) The initial fraction consisted of molecular weights between 400 bps to 600 bps; the second fraction included molecular weights of between 600 bps to 800 bps; the third fraction included molecular weights of 800 bps to 1,000 bps; and the fourth fraction included molecular weights of 1,000 bps to 8,000 bps. No internal control for incomplete restriction digestion was used because failure of either enzyme to cleave in FMRl would result in a fragment too large to be collected in any fraction.
[0212] AU collected fractions were then subjected to restriction enzyme digestion with Bmtl (New England BioLabs) according to manufacturer's procedure in order to cleave the marker sequence from the tandem repeat region. Five μL of each digested fraction was transferred to 96-well plates containing 20 μL PCR mix in each well. This PCR mix consists of IX Qiagen Standard PCR buffer, 1.5 mM MgCl2, 5% DMSO, 100 mM KCl, 0.2 mM dNTP, 2.5 units HotStart Taq DNA polymerase (Qiagen Inc), and lμM each of following primers: FXCEF2 primer (FAM-labeled; SEQ ID NO:6), FXCER2 primer (SEQ ID NO:7), and 0.01 μM each of following primers: BIpIF primer (HEX-labeled, SEQ ID NO:12), BIpIR primer (SEQ ID NO: 13), lgctrlF primer (FAM-labeled, SEQ ID NO:20), lgctrlR primer (SEQ ID NO:21), F2ctrllF primer (HEX-labeled, SEQ ID NO:16), F2ctrllR primer (SEQ ID NO: 17), F3ctrl3F primer (FAM-labeled SEQ ID NO: 18), F3ctrl3R primer (SEQ ID NO: 19). The PCR conditions were as follows: 95 0C for 15 min following by 33 cycles of 95 0C for 30 sec, 55 0C for 30 sec, 72 0C for 1 min, and finally the amplified products were extended at 72 0C for 10 min. The final PCR products were then analyzed on a 3100 Prism Genetics Analyzer (Applied Biosystems) using GenescanTM-350 ROX size standard (Applied Biosystems).
[0213] 1 ,662 blood samples, having been stripped of identifying data were submitted to analysis by the above described method. In this population of samples, there were 995 females and 557 males. Of the female individuals, 6 were determined to be premutation carriers (0.6%) and 7 were determined to be full mutation carriers (0.7%). All 13 carriers were correctly identified as verified by standard PCR/Southem blot analysis, leading to a sensitivity of 100%. A single patient interpreted as a noncarrier by the standard PCR / Southern blot assay, appeared to be a premutation carrier by the above method. This patient may be mosaic for a premutation allele or this represents a false positive result. Due to the anonymous nature of the samples, it was not possible to review the Southern blot data or to retest the sample. Assuming this result is a false positive, the specificity of the above method is 99.5%. Of the 557 males, there was one premutation carrier and 5 affected individuals. These determinations were confirmed by Southern blot analysis. Thus, the above method detected all premutation carrier males and affected males with no false positive results on the 551 unaffected males.
[0214] Unless otherwise defined, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs. All nucleotide sequences provided herein are presented in the 5' to 3' direction.
[0215] The inventions illustratively described herein may suitably be practiced in the absence of any element or elements, limitation or limitations, not specifically disclosed herein. Thus, for example, the terms "comprising", "including," containing", etc. shall be read expansively and without limitation. Additionally, the terms and expressions employed herein have been used as terms of description and not of limitation, and there is no intention in the use of such terms and expressions of excluding any equivalents of the features shown and described or portions thereof, but it is recognized that various modifications are possible within the scope of the invention claimed.
[0216] Thus, it should be understood that although the present invention has been specifically disclosed by preferred embodiments and optional features, modification, improvement and variation of the inventions embodied therein herein disclosed may be resorted to by those skilled in the art, and that such modifications, improvements and variations are considered to be within the scope of this invention. The materials, methods, and examples provided here are representative of preferred embodiments, are exemplary, and are not intended as limitations on the scope of the invention.
[0217] The invention has been described broadly and generically herein. Each of the narrower species and sύbgeneric groupings falling within the generic disclosure also form part of the invention. This includes the generic description of the invention with a proviso or negative limitation removing any subject matter from the genus, regardless of whether or not the excised material is specifically recited herein.
[0218] In addition, where features or aspects of the invention are described in terms of Markush groups, those skilled in the art will recognize that the invention is also thereby described in terms of any individual member or subgroup of members of the Markush group. [0219] All publications, patent applications, patents, and other references mentioned herein are expressly incorporated by reference in their entirety, to the same extent as if each were incorporated by reference individually. In case of conflict, the present specification, including definitions, will control.
[0220] Other embodiments are set forth within the following claims.

Claims

That which is claimed is:
1. A method for determining the size of a nucleic acid segment in a nucleic acid sample, comprising, separating fragments of a nucleic acid, said fragments prepared from a nucleic acid-containing sample, wherein said fragments include some which contain said segment and a marker sequence, said separating into fractions according to size under conditions in which a fragment containing said segment will be located in said fractions according to the size of the segment, and identifying those fraction(s) containing the segment by detecting the marker sequence, wherein the size of the segment is determined by the fraction in which it is identified.
2. The method of claim 1 wherein the nucleic acid segment is difficult to amplify by polymerase chain reaction.
3. The method of claim 2 wherein said nucleic acid segment is rich in guanine and cytosine bases.
4. A method according to claim 1 , wherein said fragments are prepared using one or more restriction endonucleases.
5. A method according to claim 1, wherein said fractions are chosen to separate fragments containing segments having a normal size from fragments containing segments having an abnormal size.
6. A method according to claim 1, wherein said separating is into a number of fractions selected from the group consisting of 2-16.
7. A method according to claim 1 , wherein said separating is by electrophoresis.
8. A method according to claim 7, wherein said electrophoresis is capillary electrophoresis.
9. A method according to claim 1 , wherein said separating is by chromatography.
10. A method according to claim 1 , further comprising after said separating step, cleaving all or a portion of the segment from said marker sequence in fragments which contain said segment and said marker sequence.
1 1. A method according to claim 10, wherein said cleaving is accomplished using a restriction endonuclease.
12. A method according to claim 1, wherein said detecting further comprises amplifying said marker sequence.
13. A method according to claim 12, wherein said amplifying is accomplished with polymerase chain reaction (PCR).
14. A method according to claim 13, wherein said PCR includes a labeled primer.
15. The method of claim 1 , wherein said detecting is accomplished using the Taqman PCR detection system.
16. A method according to claim 1 , wherein said marker sequence is detected by hybridizing to two probes that hybridize simultaneously to the marker sequence.
17. A method according to claim 1 , wherein said segment comprises a tandem repeat region.
18. A method according to claim 17, wherein the size determined is the number of repeats in said tandem repeat region.
19. A method according to claim 19, wherein said fractions are chosen to separate fragments with a normal number of tandem repeats from fragments containing an abnormal number of tandem repeats.
20. A method according to claim 17, wherein said tandem repeats are trinucleotide tandem repeats.
21. A method according to claim 17, wherein said tandem repeat is associated with a gene selected from the group consisting of DRPLA, EPMl, FRAXA, FMRl, FRDA, huntingtin, JPH-3, AR, DMl , DM2, SCAl , SCA2, SCA3, SCA6, SCA7, SCA8, SCAl 0, SCA12, SCA17, PABPNl and HOXD13.
22. A method according to claim 21 wherein said tandem repeat is associated with a gene selected from the group consisting of FMRl, FRDA, and DMl.
23. A method according to claim 22, wherein said gene is FMRl .
24. A method of detecting a mutation in a tandem repeat segment of a gene in a nucleic acid sample, wherein said mutation is characterized by a change in the number of repeats compared to the number of repeats in the wild type allele, said method comprising separating fragments of a nucleic acid, said fragments prepared from a nucleic acid- containing sample, wherein said fragments include some which contain said tandem repeat segment and a marker sequence, said separating into fractions according to size under conditions in which a fragment containing the tandem repeat segment will be located in said fractions according to the number of repeats in the tandem repeat segment; identifying those fraction(s) containing the segment by detecting the marker sequence, wherein the number of repeats in the tandem repeat segment is determined by the fraction in which it is identified, and comparing the number of repeats in the tandem repeat segment from the nucleic acid sample to the number in the corresponding wild type allele, wherein a number of repeats differing from the number in the wild type allele is indicative of a mutation.
25. A method according to claim 24, further comprising, determining if a mutation is a premutation or a full mutation by comparing the number of repeats in the tandem repeat segment from the nucleic acid sample to the number in the corresponding full mutation allele, wherein a number of repeats greater than the wild type allele but less than the full mutation is indicative of a premutation allele, whereas a number of repeats greater than or equal to the full mutation is indicative of a full mutation allele.
26. A method according to claim 24, wherein said fragments are obtained using one or more restriction endonucleases.
27. A method according to claim 26, wherein said one or more restriction endonucleases are selected from the group consisting of Alul, Sphl, Bmtl, BIpI, MIyI, and BstNI.
28. A method according to claim 27, wherein said one or more restriction endonuclease is AIuI.
29. A method according to claim 27, wherein said one or more restriction endonucleases are Sphl and Bmtl.
30. A method according to claim 27, wherein said one or more restriction endonucleases are BIpI and MyII.
31. A method according to claim 24, wherein said separating is by electrophoresis.
32. A method according to claim 33, wherein said electrophoresis is capillary electrophoresis.
33. A method according to claim 24, wherein said separating is by chromatography.
34. A method according to claim 24, wherein said fractions are chosen to separate fragments containing tandem repeat segments having a normal number of tandem repeats from fragments containing tandem repeat segments having an abnormal number of repeats.
35. A method according to claim 24, wherein said separating is into a number of fractions selected from the group consisting of 2-16.
36. A method according to claim 24, further comprising after said separating step, cleaving all or a portion of the tandem repeat segment from said marker sequence in fragments which contain said tandem repeat segment and said marker sequence.
37. A method according to claim 36, wherein said cleaving is accomplished using a restriction endonuclease.
38. A method according to claim 37, wherein said restriction endonuclease is selected from the group consisting of BsaWI, Hphl, Bbvl, BstNI, Hpyl881, SmII, and Bmtl.
39. A method according to claim 24, wherein said detecting further comprises amplifying said marker.
40. A method according to claim 39, wherein said amplifying is accomplished with polymerase chain reaction (PCR).
41. A method according to claim 40 wherein said amplifying employs a detectably labeled primer.
42. A method according to claim 41 said detecting is accomplished with electrophoresis.
43. The method of claim 24, wherein said detecting is accomplished using the Taqman PCR detection system.
44. A method according to claim 24, wherein said marker sequence is detected by hybridizing to two probes that hybridize simultaneously to the marker sequence.
45. A method according to claim 24, wherein said tandem repeats are trinucleotide tandem repeats.
46. A method of identifying FMRl alleles having a normal number of tandem repeats, a premutation, or a full mutation in the nucleic acid of an individual, said method comprising, separating fragments of a nucleic acid, said fragments prepared from a nucleic acid containing sample of the individual, wherein said fragments include some which contain a tandem repeat segment of the FMRl gene and a marker sequence, said separating into fractions according to size under conditions in which a fragment containing the tandem repeat segment having a normal number of repeats will be located in a first fraction; and a fragment containing a tandem repeat segment having a premutation will be located in a second fraction; and a fragment having a tandem repeat region having a full mutation will be located in a third fraction, identifying those fraction(s) containing the tandem repeat segment by detecting the marker sequence, wherein the number of repeats in the tandem repeat segment is determined by the fraction in which it is identified, wherein a positive result in the first fraction indicates the individual has an FMRl allele with a normal number of tandem repeats; a positive result in the second fraction indicates the individual has a premutation FMRl allele; and a positive result in the third fraction indicates the individual has a full mutation FMRl allele.
47. A method according to claim 46, wherein said fragments are obtained using one or more restriction endonucleases.
48. A method according to claim 47, wherein said one or more restriction endonucleases are selected from the group consisting of AIuI, Sphl, Bmtl, BIpI, MIyI, and BstNI.
49. A method according to claim 48, wherein said one or more restriction endonuclease is AIuI.
50. A method according to claim 48, wherein said one or more restriction endonucleases are Sphl and Bmtl.
51. A method according to claim 48, wherein said one or more restriction endonucleases are BIpI and MyII.
52. A method according to claim 46, wherein said separating is by electrophoresis.
53. A method according to claim 46, wherein said separating is by chromatography.
54. A method according to claim 46, further comprising after said separating step cleaving all or a portion of the tandem repeat segment from said marker sequence fragments which contain said tandem repeat segment and said marker sequence.
55. A method according to claim 54, wherein said cleaving is accomplished using a restriction endonuclease.
56. A method according to claim 55, wherein said restriction endonuclease is selected from the group consisting of BsaWI, Hphl, Bbvl, BstNI, Hpyl881, SmII, and Bmtl.
57. A method according to claim 46, wherein said detecting further comprises amplifying said marker.
58. A method according to claim 57, wherein said amplifying is accomplished with polymerase chain reaction (PCR).
59. A method according to claim 58 wherein said amplifying employs a detectably labeled primer.
60. A method according to claim 59 said detecting is accomplished with electrophoresis.
61. The method of claim 46, wherein said detecting is accomplished using the Taqman PCR detection system.
62. A method of determining the size of a tandem repeat segment in a sample of nucleic acids, said method comprising a) measuring the size of said tandem repeat segment by a first method which comprises amplification of the tandem repeat segment, b) measuring the size of said tandem repeat segment by a second method according to claim 17, and c) using the information obtained in steps a) and b) to determine the size of the tandem repeat segment.
63. A method according to claim 62, wherein the amplification of step a) is accomplished with polymerase chain reaction (PCR).
64. A method according to claim 63, further comprising sizing of the amplified tandem repeat region using electrophoresis.
65. A method for screening male and female individuals for carrier status of mutations in the tandem repeat region of the FMRl gene, said method comprising assaying nucleic acids from an individual to determine gender; and assaying said nucleic acid to determine the length of the tandem repeat region of the FMRl gene wherein said determining comprises amplifying tandem repeat region, detecting an amplification product, and determining the number of tandem repeats in the amplification product, wherein in male individuals: the presence of an amplification product having less than 55 tandem repeats indicates the individual is not a carrier, the presence of an amplification product having 55 or more tandem repeats indicates the individual is a carrier, or in the absence of an amplification product, the carrier status is undetermined; and in female individuals: the presence of an amplification product having more than 55 tandem repeats indicates the individual is a carrier; or the presence of a single amplification product having less than 55 tandem repeats, the carrier status is undetermined.
66. A method according to claim 65, further comprising, analyzing undetermined individuals to determine carrier status, wherein said analyzing comprises, separating fragments of a nucleic acid, said fragments prepared from a nucleic acid containing sample of the individual, wherein said fragments include some which contain a tandem repeat segment of the FMRl gene and a marker sequence, said separating into fractions according to size under conditions in which a fragment containing the tandem repeat segment having a normal number of repeats will be located in a first fraction; and a fragment containing a tandem repeat segment having a premutation will be located in a second fraction; and a fragment having a tandem repeat region having a full mutation will be located in a third fraction, identifying those fraction(s) containing the segment by detecting the marker sequence, wherein the number of repeats in the tandem repeat segment is determined by the fraction in which it is identified, wherein in male individuals: a positive result in the first fraction indicates the individual is not a carrier, a positive result in the second fraction indicates the individual is a premutation carrier, a positive result in the third fraction indicates the individual is affected; and in female individuals: a positive result in only the first fraction indicates the individual is homozygous for the a normal allele; a positive result in the second fraction indicates the individual is a premutation carrier; and a positive result in the third fraction indicates the individual is a full mutation carrier.
67. A method according to claim 65, wherein said assaying nucleic acids to determine gender comprises nucleic acid amplification.
68. A method according to claim 67, wherein said nucleic acid amplification comprises amplifying a region of the amelogenin gene which produces different sizes of amplification products from the amelogenin gene on the X chromosome and the amelogenin gene on the Y chromosome, determining the size of the amplification product or products, wherein the presence of one product of a single size indicates the gender is female and the presence of two products of different sizes indicates the gender is male.
69. A method according to claim 67, wherein said amplifying of a region of the amelogenin gene is performed in multiplex with the amplifying of the tandem repeat region of the FMRl gene.
70. A method according to claim 69, wherein said multiplex amplification further comprises amplifying one or more control sequences.
71. A method according to claim 66, further comprising after said separating step cleaving all or a portion tandem repeat segment from said marker sequence fragments in which contain said tandem repeat segment and said marker sequence.
72. A kit for detecting the size of a particular nucleic acid segment in a sample comprising a primer pair for amplifying a marker nucleotide sequence upstream or downstream of the particular nucleic acid segment, and one or more restriction endonucleases for cleaving the nucleic acid sample to generate a fragment of the nucleic acid sample which contains the particular nucleic acid segment and the upstream or downstream marker sequence, wherein said particular nucleic acid segment is a tandem repeat sequence.
73. A kit according to claim 72, further comprising one or more restriction enzymes for separating the particular nucleic acid segment from the marker sequence.
74. A kit according to claim 72, further comprising one or more controls that verify proper size separation of nucleic acids digested with the restriction endonuclease.
75. A kit according to claim 72 wherein the tandem repeat segment is from the FMRl gene.
PCT/US2007/008985 2006-10-05 2007-04-11 Nucleic acid size detection method WO2008045136A2 (en)

Priority Applications (8)

Application Number Priority Date Filing Date Title
AT07755301T ATE515573T1 (en) 2006-10-05 2007-04-11 METHOD FOR DETECTING THE SIZE OF NUCLEIC ACIDS
US12/444,361 US8697399B2 (en) 2006-10-05 2007-04-11 Nucleic acid size detection method
BRPI0720549-0A BRPI0720549A2 (en) 2006-10-05 2007-04-11 NUCLEIC ACID SIZE DETECTION METHOD
JP2009531370A JP5386357B2 (en) 2006-10-05 2007-04-11 Nucleic acid size detection method
AU2007307320A AU2007307320B2 (en) 2006-10-05 2007-04-11 Nucleic acid size detection method
EP07755301A EP2082055B1 (en) 2006-10-05 2007-04-11 Nucleic acid size detection method
CA002665723A CA2665723A1 (en) 2006-10-05 2007-04-11 Nucleic acid size detection method
MX2009003736A MX2009003736A (en) 2006-10-05 2007-04-11 Nucleic acid size detection method.

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US11/544,293 2006-10-05
US11/544,293 US8163480B2 (en) 2006-10-05 2006-10-05 Nucleic acid size detection method

Publications (2)

Publication Number Publication Date
WO2008045136A2 true WO2008045136A2 (en) 2008-04-17
WO2008045136A3 WO2008045136A3 (en) 2008-12-11

Family

ID=39283328

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2007/008985 WO2008045136A2 (en) 2006-10-05 2007-04-11 Nucleic acid size detection method

Country Status (10)

Country Link
US (2) US8163480B2 (en)
EP (1) EP2082055B1 (en)
JP (1) JP5386357B2 (en)
CN (1) CN101595227A (en)
AT (1) ATE515573T1 (en)
AU (1) AU2007307320B2 (en)
BR (1) BRPI0720549A2 (en)
CA (1) CA2665723A1 (en)
MX (1) MX2009003736A (en)
WO (1) WO2008045136A2 (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2010094061A1 (en) * 2009-02-17 2010-08-26 Murdoch Childrens Research Institute Assay for determining epigenetic profiles of markers of fragile x alleles
WO2014114922A1 (en) * 2013-01-23 2014-07-31 Medical Research Council Methods for estimating the size of disease-associated polynucleotide repeat expansions in genes
WO2016177020A1 (en) * 2015-05-05 2016-11-10 华南师范大学 Dna fragmentation method and device for implementing same
US10138521B2 (en) 2010-08-11 2018-11-27 Murdoch Childrens Research Institute Treatment and diagnosis of epigenetic disorders and conditions
US11237130B2 (en) 2015-07-01 2022-02-01 Cytiva Sweden Ab Method for determining a size of biomolecules

Families Citing this family (20)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8163480B2 (en) * 2006-10-05 2012-04-24 Quest Diagnostics Investments Incorporated Nucleic acid size detection method
WO2008070862A2 (en) * 2006-12-07 2008-06-12 Biocept, Inc. Non-invasive prenatal genetic screen
US8409805B2 (en) * 2009-02-13 2013-04-02 Asuragen, Inc. Method of amplification of GC-rich DNA templates
CN105296474B (en) 2009-03-24 2020-05-12 奥斯瑞根公司 PCR method for characterizing 5' untranslated region of FMR1 and FMR2 genes
CN101871001B (en) * 2009-04-22 2013-09-25 中山大学达安基因股份有限公司 Kit for detecting fragile X syndrome
ES2573092T3 (en) * 2009-07-10 2016-06-06 Perkinelmer Health Sciences, Inc. Multinucleotide Repeat Detection
US9382586B2 (en) 2010-02-05 2016-07-05 Quest Diagnostics Investments Incorporated Method to detect repeat sequence motifs in nucleic acid
US20130316339A1 (en) * 2010-09-01 2013-11-28 Orion Genomics Llc Detection of nucleic acid sequences adjacent to repeated sequences
WO2012138289A1 (en) * 2011-04-08 2012-10-11 Zain-Luqman Rula Diagnosis and treatment of friedreich's ataxia
AU2012318290B2 (en) 2011-11-04 2015-07-30 Gen-Probe Incorporated Molecular assay reagents and methods
US9371560B2 (en) 2012-07-20 2016-06-21 Asuragen, Inc. Comprehensive FMR1 genotyping
EP2732815A1 (en) * 2012-11-16 2014-05-21 Neurochlore Modulators of intracellular chloride concentration for treating fragile X syndrome
US9273349B2 (en) * 2013-03-14 2016-03-01 Affymetrix, Inc. Detection of nucleic acids
WO2014152822A2 (en) * 2013-03-14 2014-09-25 Quest Diagnostics Investments Incorporated Method for detecting cystic fibrosis
GB201418144D0 (en) * 2014-10-14 2014-11-26 Univ Cardiff High throughput sequencing
CN104816447B (en) * 2015-04-30 2017-03-08 青岛科技大学 A kind of planetary gear type kneading device
JP2019013206A (en) * 2017-07-11 2019-01-31 国立大学法人帯広畜産大学 Methods and kits for detecting azole-resistant aspergillus fumigatus
CN110184344A (en) * 2019-06-28 2019-08-30 北京和合医学诊断技术股份有限公司 Detect the method and primer pair of HTT gene C AG trinucleotide repeats sequence
CN110923305B (en) * 2019-11-25 2023-12-29 广州市达瑞生物技术股份有限公司 DNA molecular weight standard suitable for southern blot hybridization detection of fragile X syndrome
CA3192359A1 (en) 2020-08-18 2022-02-24 Enviro Metals, LLC Metal refinement

Family Cites Families (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB8606719D0 (en) 1986-03-19 1986-04-23 Lister Preventive Med Genetic probes
US5135717A (en) 1986-12-24 1992-08-04 British Technology Group Usa Inc. Tetrabenztriazaporphyrin reagents and kits containing the same
US5206137A (en) 1988-09-08 1993-04-27 Lifecodes Corporation Compositions and methods useful for genetic analysis
NL9001639A (en) 1990-07-19 1992-02-17 Amc Amsterdam PT-CONTAINING COMPOUND, METHOD FOR PREPARING IT, AND USE OF SUCH COMPOUNDS.
US5714327A (en) 1990-07-19 1998-02-03 Kreatech Diagnostics Platinum-containing compounds, methods for their preparation and applications thereof
US5364759B2 (en) 1991-01-31 1999-07-20 Baylor College Medicine Dna typing with short tandem repeat polymorphisms and identification of polymorphic short tandem repeats
US5652099A (en) 1992-02-12 1997-07-29 Conrad; Michael J. Probes comprising fluorescent nucleosides and uses thereof
US5695933A (en) * 1993-05-28 1997-12-09 Massachusetts Institute Of Technology Direct detection of expanded nucleotide repeats in the human genome
US5599666A (en) 1994-03-28 1997-02-04 Promega Corporation Allelic ladders for short tandem repeat loci
US6013444A (en) 1997-09-18 2000-01-11 Oligotrail, Llc DNA bracketing locus compatible standards for electrophoresis
US6238863B1 (en) 1998-02-04 2001-05-29 Promega Corporation Materials and methods for indentifying and analyzing intermediate tandem repeat DNA markers
US6074831A (en) 1998-07-09 2000-06-13 Agilent Technologies, Inc. Partitioning of polymorphic DNAs
US6960437B2 (en) 2001-04-06 2005-11-01 California Institute Of Technology Nucleic acid amplification utilizing microfluidic devices
US7855053B2 (en) 2006-07-19 2010-12-21 The Regents Of The University Of California Methods for detecting the presence of expanded CGG repeats in the FMR1 gene 5′ untranslated region
US8163480B2 (en) * 2006-10-05 2012-04-24 Quest Diagnostics Investments Incorporated Nucleic acid size detection method

Non-Patent Citations (7)

* Cited by examiner, † Cited by third party
Title
"Cell", vol. 72, 1993, HUNTINGTON'S DISEASE COLLABORATIVE RESEARCH GROUP., pages: 971 - 983
CAMPUZANO ET AL., SCIENCE, vol. 271, 1996, pages 1423 - 1427
CHEN ET AL., HUM. MOL. GENETICS, vol. 12, no. 23, 2003, pages 3067 - 74
FU ET AL., SCIENCE, vol. 255, 1992, pages 1256 - 1258
HUNTER JM ET AL., J NEUROSCI METH, vol. 144, 2005, pages 11 - 17
LA SPADA ET AL., NATURE, vol. 352, 1991, pages 77 - 79
VERKERK ET AL., CELL, vol. 65, 1991, pages 905 - 914

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2010094061A1 (en) * 2009-02-17 2010-08-26 Murdoch Childrens Research Institute Assay for determining epigenetic profiles of markers of fragile x alleles
US10138521B2 (en) 2010-08-11 2018-11-27 Murdoch Childrens Research Institute Treatment and diagnosis of epigenetic disorders and conditions
WO2014114922A1 (en) * 2013-01-23 2014-07-31 Medical Research Council Methods for estimating the size of disease-associated polynucleotide repeat expansions in genes
WO2016177020A1 (en) * 2015-05-05 2016-11-10 华南师范大学 Dna fragmentation method and device for implementing same
US11237130B2 (en) 2015-07-01 2022-02-01 Cytiva Sweden Ab Method for determining a size of biomolecules

Also Published As

Publication number Publication date
MX2009003736A (en) 2009-07-09
JP2010505413A (en) 2010-02-25
US8697399B2 (en) 2014-04-15
EP2082055B1 (en) 2011-07-06
AU2007307320A1 (en) 2008-04-17
EP2082055A2 (en) 2009-07-29
US20080124709A1 (en) 2008-05-29
CA2665723A1 (en) 2008-04-17
US8163480B2 (en) 2012-04-24
WO2008045136A3 (en) 2008-12-11
BRPI0720549A2 (en) 2014-07-29
JP5386357B2 (en) 2014-01-15
EP2082055A4 (en) 2010-02-17
CN101595227A (en) 2009-12-02
ATE515573T1 (en) 2011-07-15
AU2007307320B2 (en) 2013-07-11
US20100167284A1 (en) 2010-07-01

Similar Documents

Publication Publication Date Title
US8697399B2 (en) Nucleic acid size detection method
US20220098679A1 (en) Nucleic acid detection combining amplification with fragmentation
US10655179B2 (en) Cystic fibrosis transmembrane conductance regulator gene mutations
US20230265510A1 (en) Method to detect repeat sequence motifs in nucleic acid
US9765390B2 (en) Methods, compositions, and kits for rare allele detection
CA2826748A1 (en) Method of detecting variations in copy number of a target nucleic acid
US9365892B2 (en) Screening method for trinucleotide repeat sequences
WO2010135917A1 (en) Method for detecting variations in nucleic acid sequences
US20070184457A1 (en) Method for long range allele-specific PCR
EP1780292A1 (en) Gene methylation assay controls
US20050053957A1 (en) Polynucleotide sequence detection assays
US20070141559A1 (en) Methods for detecting and typing herpes simplex virus
US20090181366A1 (en) Internal positive control for nucleic acid assays
Handyside et al. Pre-implantation genetic diagnosis using whole genome amplification
JP2002223761A (en) Method for detecting target nucleic acid and reagent therefor

Legal Events

Date Code Title Description
WWE Wipo information: entry into national phase

Ref document number: 200780045172.9

Country of ref document: CN

121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 07755301

Country of ref document: EP

Kind code of ref document: A2

ENP Entry into the national phase

Ref document number: 2009531370

Country of ref document: JP

Kind code of ref document: A

Ref document number: 2665723

Country of ref document: CA

WWE Wipo information: entry into national phase

Ref document number: MX/A/2009/003736

Country of ref document: MX

NENP Non-entry into the national phase

Ref country code: DE

WWE Wipo information: entry into national phase

Ref document number: 1382/KOLNP/2009

Country of ref document: IN

WWE Wipo information: entry into national phase

Ref document number: 2007307320

Country of ref document: AU

WWE Wipo information: entry into national phase

Ref document number: 2007755301

Country of ref document: EP

ENP Entry into the national phase

Ref document number: 2007307320

Country of ref document: AU

Date of ref document: 20070411

Kind code of ref document: A

WWE Wipo information: entry into national phase

Ref document number: 12444361

Country of ref document: US

ENP Entry into the national phase

Ref document number: PI0720549

Country of ref document: BR

Kind code of ref document: A2

Effective date: 20090406