METHOD FOR SEQUENCING NUCLEIC ACIDS USING MATRIX-ASSISTED LASER DESORPTION FONIZATION TIME-OF-FLIGHT MASS SPECTROMETRY
BACKGROUND OF THE INVENTION
Field of the Invention
The invention relates to the determination of nucleotide sequences for nucleic acids and their analogs. Brief Summary of the Related Art
Deterirύnation of nucleotide sequences for various nucleic acids has become a fundamentally important analytical step for numerous molecular biology and biomedical applications. Consequently, a variety of methods have been developed to facilitate such nucleotide sequence determinations. Maxam and Gilbert, Proc. Natl. Acad. Sci. USA 74: 5460 (1977) discloses a chemical degradation approach for DNA sequence determination. Sanger et al, Proc. Natl. Acad. Sci. USA 74: 5463 (1977), discloses a chain termination method using complementary strand primer extension to determine DNA sequences. Each of these methods utilizes four separate reaction mixtures to create a nested set of fragments differing by a single nucleotide in length and representing a complete nucleotide sequence, followed by resolution of the fragments based on their size to determine the order of the fragments and hence the nucleotide sequence. Both of these procedures take from numerous hours to days to perform, and neither is effective for determining the nucleotide sequence for certain analogs of DNA. These popular approaches to nucleotide sequence determination are limited by their time consuming methodologies and by their inapplicability to certain types of nucleic acid analytes. For example, small synthetic oligonucleotides have recently become of interest as tools in molecular biology experiments, as well as for use in the antisense therapeutic approach to disease treatment. Correct sequences are necessary to the efficacy and safety of such oligonucleotides, and effective and rapid
analytical approaches are needed for quality control. This class of compounds presents three special problems for traditional sequence determination approaches. First, quality control procedures are needed which are more rapid than the traditional approaches. Second, the oligonucleotides are generally short, often in the range of from about 15 to about 35 nucleotides in length. As a consequence, traditional sequencing approaches result in the loss of much sequence information and thus provide only incomplete information about the nucleotide sequence of the oligonudeotides. Finally, many of these oligonucleotides have either modified internucleoside linkages or substitution at the 2' position of the ribose to improve their properties as potential therapeutic agents. U.S. Patent No. 5,220,007 discloses chimeric oligonucleotides having regions of oligonudeoside phosphodiester or phosphorothioate alongside regions of oligonudeoside alkylphosphonate or phosphoramidate. PCT publication WO94/02498 discloses hybrid oligonudeotides having DNA regions alongside 2' -substituted RNA regions. Uhlman and Peyman, Chemical Reviews 90: 544 (1990), discloses oligonudeotides having a variety of modifications along the internucleoside linkages, sugar residues or nucleoside bases.
Recently, some approaches have been developed to address these special problems. U.S. Patent No. 5,403,709 disdoses a method for sequencing oligonudeotides using another oligonudeotide as an extension and a third, bridging oligonudeotide to hold the first two together for ligation. Conventional primer extension is then used to create a complement for sequencing. This approach requires some advance knowledge of a portion of the sequence of the analyte oligonudeotide. VS. Patent No. 5,525,470 discloses a similar approach which avoids the need for such advance knowledge by utilizing RNA ligase to couple the analyte and extension oligonucleotides. Both of these approaches can be used with some modified oligonudeotides, espedally with those having phosphodiester or phosphorothioate internudeotide linkages. Both, however, are tme^onsuming and limited to analyte oligonucleotides which can act as templates for the polymerase enzyme used to synthesize their complement.
Recently, Nordhoff et al, J. Mass Spec. 30: 99 (1995) disclosed a direct sequendng method for oligonudeotides using infrared matrix-assisted laser desorption/ionization mass spectrometry. This method utilized time-of-flight analysis of prompt fragment ions generated by infrared laser desorption/ionization. Fragments were reported to be generated by consecutive cleavage of the deoxyribose phosphate backbone at apurinic sites. Unfortunately, such deavage does not occur at T residues, due to failure to remove the thymidine base, and may not occur in RNA due to stabilization of the glycosidic bond. In addition, there is a problem with the prompt fragment ions overlapping with "post-source decay" fragment ions (PSD ions) from metastable decays occurring in the field-free region of the time-of-flight analyzer.
Brown and Lennon, Anal. Chem. 6_7_: 3990 (1995) disdoses sequence-spedfic fragmentation of matrix-assisted laser-desorbed protein/peptide ions and detection of the fragments using time-of-flight mass spectrometry with delayed pulsed ion extraction. The delayed pulsed ion extraction is used to reduce the generation of PSD ions by expanding the desorbed neutral plume during the extraction delay period, thereby avoiding energetic collisions believed to play a role in the generation of PSD. The technique was found to be applicable to small peptides and one spedal case larger protein. There remains a need for more rapid approaches for deterrnining the nudeotide sequence of nudeic add analytes. In addition, there is a need for new sequendng approaches that address the special sequence determination problems presented by synthetic oligonudeotides. Ideally, sudi a new approach should be rapid and universal, should act diredly on the nudeic acid analyte, and should be applicable to any chemically modified oligonucleotides.
BRIEF SUMMARY OF THE INVENTION
The invention provides a universal analytical method for determining the nucleotide sequence of nucleic acid analytes, including any chemically modified oligonucleotides. This new method utilizes matrix-assisted laser desorption/ionization time-of-flight mass spectrometry (MALDI-TOFMS) with delayed pulsed ion extraction. This method is extremely rapid and acts directly on the nucleic acid analyte, creating in-source fragmentation (ISF) products which are used to determine the sequence of the nucleic add analyte. Such ISF products have earlier also been known as prompt fragmentation ions, and such terms can be used interchangeably although in-source fragmentation product is the currently preferred term and will be used throughout herein. It is effective for a variety of nucleic add analytes, including any chemically modified oligonudeotides which have not previously been successfully sequenced.
The invention provides a method for determining the nucleotide sequence of nucleic add analytes comprising providing a suitable target for matrix-assisted laser desorption/ionization, such target comprising a nucleic add analyte in a matrix suitable for such matrix-assisted laser desorption/ionization; irradiating the target with laser irradiation suitable for creating ISF products from the nudeic acid analyte; delaying pulsed ion extraction for a time suffident to reduce formation of PSD ion fragments; ordering the ISF products according to molecular weight by time-of-flight mass spectrometry; and deterrnining the nudeotide sequence of the nudeic add analyte by comparing the molecular weights of the ordered ISF products.
Preferably, the suitable target will indude a solid support, sudi as the commerdally available solid supports provided by Bruker Daltonics, formerly
Bruker Analytical Systems (Billerica, MA) or PerSeptive Biosystems (Cambridge, MA). Generally, the target is prepared by applying to the solid support a solution comprising the nudeic acid analyte and a matrix suitable for matrix-assisted laser desorption/ionization (MALDI), then allowing the solution on the solid support to evaporate to dryness. The nucleic add analyte can be a naturally occurring or
synthetic polynucleotide or oligonudeotide, including oligonucleotides having chemically modified internudeoside linkages, sugar bad<bones or nudeoside bases. The suitable matrix can be selected from the matrices known in the art to be suitable for MALDI, and includes a cation exchange resin, preferably in ammonium form. Preferred matrices indude, without limitation, 3-hydroxypicolinic acid, 3- hydroxypicolinic acid/picolinic acid, 3-hydroxypicolinic acid/sucrose, 2,5- dihydroxybenzoic acid, sinapinic acid, alpha-cyano-4-hydroxy-cinnamic acid, anthranilic acid, 6-aza-2-thiothymine, nicotinic acid, 2,4,6-trihydroxybenzoic acids, 2,4,6-trihydroxyacetophenone and combinations of the same. Other suitable matrices are identified in Fitzgerald et al, Anal. Chem. 65: 3204 (1993). In one particularly preferred embodiment, the matrix is 3-hydroxypicolinic acid/N-(3- indolylacetyl)-l-leucine (3-HPA/IAL).
The target can be irradiated with any of a variety of laser irradiation sources. Preferred laser irradiation sources include, without limitation, UV lasers, visible lasers and IR lasers. Particularly preferred UV lasers include a 337 run N2 laser, a
266 nm laser, a 193 run laser and a 355 nm laser.
The pulsed ion extraction is delayed for a time sufficient to increase mass resolution of the ISF products and to reduce PSD ion formation sufficiently to improve the profile of the time-of-flight mass spec (TOFMS) by providing a voltage pulse to defied low mass ions and avoid suppression of signals of the larger ions, thus resulting in discrete peaks for each spedes of ISF product. The appropriate time for the voltage pulse depends on the size of the largest ion spedes of interest and can conveniently be adjusted to maximize the mass resolution for such ion spedes. The ISF products are readily separated according to molecular weight by
TOFMS, which orders the spedes according to their molecular weight. Since the mass of each nudeotide is known, this allows determination of the nudeotide sequence of the nucleic acid analyte by simple comparison of the molecular weights of the consecutive species in the ordered array.
BRIEF DESCRIPTION OF THE DRAWINGS
Figure 1 shows TOFMS spectrum for nucleotides N to N-18 for a 20-mer oligonudeotide sequenced using a preferred embodiment of the method according to the invention. The "w" ions are marked and the molecular weight assignments are shown over each peak.
Figure 2 shows TOFMS spectrum for nucleotide N-19 for the same oligonudeotide shown in Figure 1.
Figure 3 shows TOFMS spectrum for a 15-mer PS all DNA oligonudeotide. The "w" ions are marked and the molecular weight assignments are shown over each peak.
Figure 4. MALDI-TOF mass spectrum of a 17-mer PS/PO oligodeoxynudeotide (ODN) d mera. In addition to "a-B" (marked by *), "w" and "d" ions, the "a" ions are also present.
Figure 5. MALDI-TOF mass spectrum of a 25-mer PS ODN.
Figure 6. MALDI-TOF mass spectrum of a 20-mer PS ODN with 2'-0- methylation on the sugar units of the first two and last four nucleosides.
Figure 7. MALDI-TOF mass spectrum of a 18-mer PS/MP chimera.
Figure 8. RP-HPLC chromatograms of a neat 25-mer PS ODN (upper trace) and a mixture of the 25-mer PS ODN and a 17-mer PS ODN (lower trace). The mixture was separated into 5 fractions as indicated.
Figure 9. MALDI-TOF mass spectra of the 4 of the 5 fractions separated by RP- HPLC.
Figure 10. Proposed fragmentation pathway for modified oligonucleotides (MONs) containing negatively charged backbone linkage.
Figure 11. Proposed fragmentation pathway for modified oligonucleotides containing neutral backbone linkage.
DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS
The invention relates to the determination of nudeotide sequences for nucleic acids and their analogs. The patents and publications cited herein are known to those skilled in this field and are hereby incorporated by reference in their entirety. The invention provides an analytical method for determining the nucleotide sequence of nucleic acid analytes, including any chemically modified oligonucleotides. This new method utilizes matrix-assisted laser desorption/ionization time-of-flight mass spectrometry (MALDI-TOFMS) with delayed pulsed ion extraction. This method is extremely rapid and acts directly on the nucleic add analyte. It is effective for a variety of nudeic acid analytes, induding any chemically modified oligonudeotides which have not previously been successfully sequenced.
The invention provides a method for determining the nucleotide sequence of nucleic acid analytes comprising providing a suitable target for matrix-assisted laser desorption/ionization, such target comprising a nudeic acid analyte in a matrix suitable for such matrix-assisted laser desorption/ionization; irradiating the target with laser irradiation suitable for creating ISF products from the nudeic acid analyte; delaying pulsed ion extraction for a time suffident to reduce formation of PSD ion fragments; ordering the ISF products according to molecular weight by time-of-flight mass spectrometry; and determining the nudeotide sequence of the nucleic add analyte by comparing the molecular weights of the ordered ISF products.
Preferably, the suitable target will indude a solid support, such as the commerdally available solid supports provided by Bruker Analytical Systems
(Billerica, MA) or PerSeptive Biosystems (Cambridge, MA). Preferred solid supports include stainless steel and polymer coated metals. Generally, the target is prepared by applying to the solid support a solution comprising the nudeic acid analyte and a matrix suitable for matrix-assisted laser desorption/ionization (MALDI), then allowing the solution on the solid support to evaporate to dryness.
The nucleic acid analyte can be a naturally occurring or synthetic polynucleotide or oligonudeotide, including oligonudeotides having diemically modified internucleoside linkages, sugar backbones or nucleoside bases. For purposes of the invention, the term "oligonudeotide" includes polymers of two or more deoxyribonucleoside, ribonucleoside or 2'-0-substituted ribonucleoside monomers, or any combination thereof. Preferably, such oligonucleotides will have from about 2 to about 100 monomers, and most preferably from about 13 to about 40. Such monomers may be coupled to each other by any of the numerous known internucleoside linkages. In certain preferred embodiments, these internucleoside linkages may be phosphodiester (PO), phosphotriester, phosphorothioate (PS), or phosphoramidate linkages, or combinations thereof. The term oligonudeotide also encompasses such polymers having chemically modified bases or sugars and/ or having additional substituents, induding without limitation lipophilic groups, intercalating agents, diamines and adamantane. For purposes of the invention the term "2'-0-substituted" means substitution of the 2' position of the pentose moiety with an -O-lower alkyl group containing 1-6 saturated or unsaturated carbon atoms, or with an -O-aryl or allyl group having 2-6 carbon atoms, wherein such alkyl, aryl or allyl group may be unsubstituted or may be substituted, e.g., with halo, hydroxy, trifluoromethyl, cyano, nitro, acyl, acyloxy, alkoxy, carboxyl, carbalkoxyl, or amino groups; or such 2' substitution may be with a hydroxy group (to produce a ribonucleoside), an amino or a halo group, but not with a 2'-H group.
The suitable matrix can be selected from the matrices known in the art to be suitable for MALDI, and indudes a cation exdiange resin, preferably in ammonium form. In a preferred embodiment, the cation exchange resin is a 200-400 mesh in ammonium form. Preferred matrices include, without limitation, 3-hydroxypicolinic add, 3-hydroxypicolinic add/picolinic add, 3-hydroxypicolinic add/sucrose, 2,5- dihydroxybenzoic add, sinapinic add, alpria-cyancH-hydroxy-c namic add, anthranilic acid, 6-aza-2-thiothymine, nicotinic add, 2,4,6-trihydroxybenzoic adds, 2,4,6-trihydroxyacetophenone and combinations of the same. Other suitable matrices are identified in Fitzgerald et al, Anal. Chem. 65: 3204 (1993). In one
particularly preferred embodiment, the matrix is 3-hydroxypicolinic acid/N-(3- indolylacetyl)-l-leucine (3-HPA/IAL).
The target can be irradiated with any of a variety of laser irradiation sources. Preferred laser irradiation sources include, without limitation, UV lasers, visible lasers and IR lasers. Particularly preferred UV lasers include a 337 nm N2 laser, a 266 nm laser and a 193 nm laser. One particularly preferred laser irradiation source is the 337 nm N2 laser provided with the MALDI unit produced by Bruker Analytical Systems. The laser power is sufficient to efficiently produce ISF products without saturating the detector system. To achieve this power level with the Bruker MALDI unit, a laser attenuation level of from about 40 to about 50 is used to give an irradiance of 1-2 107W/cm2. For other types of attenuation systems, attenuation levels of from about 10 to about 110 may be advantageously used. The laser attenuation level can be measured using a power meter (Laser Probe, Inc., Utica, NY, USA). The ion source of the mass spectrometer is opened so that the energy detector could be placed in the laser beam path in the ion source region. The energy output of the laser is recorded at a series of the neutral density attenuator settings.
Several parameters may be automatically set on the MALDI-TOFMS unit, such as on the unit produced by Bruker Daltonics. Preferably, the voltage on the target (acceleration voltage) will be from about +/- 3-30 kV (for positive or negative polarity). In one particularly preferred embodiment, such acceleration voltage will be about +25 kV. In another particularly preferred embodiment, such acceleration voltage will be about +20 kV. The voltage on the auxiliary plate after switching (P2) is preferably from about +/- 2-27 kV. In one particularly preferred embodiment, such P2 voltage is about +22.5 kV. The lens voltage is preferably from about +/- 0- 15 kV. In one particularly preferred embodiment, such lens voltage is about +9.2 kV. The voltage on the dual-microchannel-plates detector is preferably from about +/- 1.3-2.0 kV. In one particularly preferred embodiment, such detector voltage is about +1.5 kV. The time elapsed between laser firing and voltage switching on the auxiliary plate is preferably from about 100 to about 400 ns. In one particularly
preferred embodiment, such elapsed time is about 250 ns. The number of spectra summed may preferably range from about 1 to about 1000. In one particularly preferred embodiment, such number will be about 80. The vacuum in the sample chamber region is preferably at or below 5 x 10"6 torr. In one preferred embodiment, such vacuum is 5 x 10"7 torr.
The pulsed ion extraction is delayed for a time sufficient to increase mass resolution of the ISF products, and to reduce PSD ion formation sufficiently to improve the profile of the time-of-flight mass spec (TOFMS) by providing a voltage pulse to deflect low mass ions and avoid suppression of signals of the larger ions, thus resulting in discrete peaks for each of the ISF products. The MALDI-TOFMS unit produced by Bruker is available with the MS modified as to allow delay of the pulsed ion extraction following laser irradiation. The appropriate voltage and time for the ion deflection voltage pulse depends on the size of the largest ion species of interest and can conveniently be adjusted to maximize the mass resolution for such ion species. Preferably, such ion deflection voltage will range from about 0 to about 10 kV. In one particularly preferred embodiment, such ion deflection pulse will be about +2 kV. Preferably, such time will be from about 0-10 microseconds. In one particularly preferred embodiment, such time is about 1-2 microseconds.
The ISF products are readily separated according to molecular weight by TOFMS, which orders the products according to their molecular weight. The observed ISF products include the "w" and "d" ions, and also the "a" and "z" ions. As is understood in the art, the w and z ions are generated by fragmentation of the sequence from the 3' end and the a and d ions are generated by fragmentation of the sequence from the 5' end. Since the mass of each nudeotide is known, this allows determination of the nudeotide sequence of the nucleic add analyte by simple comparison of the molecular weights of the consecutive spedes in the ordered array.
The w ions together with the protonated molecular ions often allows the determination of the complete nudeotide sequence. However, when a sequence contains modified nudeotides, at times the w ions alone are insufficient to
determine the entire sequence, and in these cases the d, a and z ions provide supplemental information which allows the construction of the complete nucleotide sequence. Further the presence of the d, a, and z ions also fadlitates the determination of the backbone linkage. Thus, for the first time, the sequence and backbone linkage of an unknown sequence which includes modified nucleotides can be determined using a single technique.
Results of such fragment ordering and molecular weight assignment are shown in Figures 1, 2, 3, 4, 5, 6 and 7. The results correctly identified the nucleotide sequence of a 20-mer phosphorothioate oligonudeotide having 2 2'-0-methyl nudeosides at its 5' end and 4 2'-0-methyl nucleosides at its 3' end, Figures 1 and 2, and the nudeotide sequence of a 15-mer phosphorothioate deoxyribonudeotide, Figure 3. The results correctly identified the nudeotide sequence of a 17-mer phosphorothioate /phosphodiester (PS/PO) chimeric oligonudeotide, Figure 4. The results correctly identified the nucleotide sequence of a 25-mer phosphorothioate oligonudeotide, Figure 5. The results correctly identified the nucleotide sequence of a 20-mer phosphorothioate oligonudeotide having 2'-0-methylation on the sugar units of the first two and last four nucleosides, Figure 6. The results correctly identified the nucleotide sequence of a 18-mer phosphorothioate/ methylphosphosphonate (PS/MP) chimeric oligonudeotide, Figure 7. The generality of this method is illustrated by the successful sequendng of four distinct modified oligonudeotides as representative compounds.
The following examples illustrate certain preferred embodiments of novel methods for sequencing of oligonucleotides, in particular modified oligonudeotides. Using the method according to the invention, the ISF occurring in the MALDI process leads to the generation of sequence ions of modified oligonucleotides allowing a rapid determination of their sequences. While the fragmentation pattern appears to be little affected by the modification on the ribose ring of a nudeotide, a greater effert has been observed when modification occurs on the backbone linkages. In general, "w" ions provide the most extensive sequence information. The utility of the accompanying "a", "d" and "z" ions varies according to the type of
backbone modification. For PO ODN and MP ODN, useful confirmation or supplemental information comes from "d" ions. In contrast, "a" ions give the most useful supplemental information for PS ODN. Because the sequencing method of the invention involves no chemical or enzymatic reaction, it is by far the most general sequencing method for small antisense ODNs. Its other advantages include the ability to obtain information on the modification groups and speed; a sequence can be determined in less than 1 minute rather than hours.
The following examples are intended to further illustrate certain preferred embodiments of the invention and are not intended to limit the scope of the invention.
Example 1 Preparation of a matrix suitable for MALDI 3-hydroxypicolinic acid (3-HPA; Aldrid Chemical Co., Milwaukee, WI; 10.0 +/- 0.5 mg) was weighed and put into a 1.5 ml Eppendorf tube. To the Eppendorf tube was added 75 microliters HPLC grade water (J.T. Baker) and 75 microliters HPLC grade acetonitrile (EM Sdence, Gibbstown, NJ). The tube was capped and fixed onto the platform of a Vortex Gene 2 (Sdentific Industries, Inc. Bohemia, NY) vortexer, which was operated at vibration level 6 for 5 minutes. N-(3-indolylacetyl)- 1-leudne (IAL; Aldrich; 2.0 +/- 0.4 mg) was weighed and put into a 1.5 ml
Eppendorf tube, to which was added the freshly prepared 3-HPA solution at such volume as to result in a final IAL concentration of 16 g/1. The Eppendorf tube containing the 3-HPA/IAL was then vortexed for 1 minute. Cation exchange resin in ammonium form (Bio-Rad Laboratories, Richmond, CA; 200-400 mesh; 25 +/- 1 mg) was weighed and added to the Eppendorf tube containing the 3-HPA/IAL solution. The tube was then placed in a heating bath at 50*C for 10 minutes. The tube was then removed from the bath and allowed to cool at room temperature for 2 minutes prior to use. Storage for future use is in darkness.
Example 2
Preparation of an exemplary MALDI target An 20-mer oligonudeotide phosphorothioate having two 5' terminal 2'-0- methyl nucleosides and four 3* terminal 2'-0-methyl nucleosides was prepared as a solution in HPLC grade water (Baker) at a concentration of 5000 ppm. Four microliters of the matrix prepared according to Example 1 was transferred to an
Eppendorf tube, to which two microliters of the oligonudeotide solution was added. To the tube was then added 3 mg of cation exchange resin in ammonium form (200- 400 mesh), and the tube was then vortexed for 10 seconds. The Eppendorf tube was allowed to stand briefly, then 1 microliter of the clear solution was withdrawn and deposited on a stainless steel target (Bruker). The solvent was then allowed to evaporate at ambient conditions.
Example 3 Sequence determination of a 20-mer phosphorothioate oligonudeotide nucleic acid analyte using MALDI-TOFMS with delayed pulsed ion extraction
The MALDI target prepared according to Example 2 was placed in the sample chamber of a Bruker Matrix-assisted Laser Desorption/ionization Time-of- Flight Mass Spedrometer equipped to allow delay of pulsed ion extraction following laser irradiation. The instrument settings were as follows: (1) acceleration voltage = +25 kV; (2) P2 voltage = +22.5 kV; (3) lens voltage = +9.2 kV; (4) detector voltage = +1.5 kV; (5) time elapsed between laser firing and voltage switching on the auxiliary plate = 250 ns; (6) laser attenuation level = 40-50; (7) number of spectra summed = 80; (8) ion deflection voltage and deflection pulse duration: a +2 kV pulse was maintained for 2 microseconds to obtain a spectrum for "w" ions comprising N-18 to N for the oligonudeotide (see Figure 1); the same voltage pulse was maintained for 1 microsecond to obtain a spectrum for the "w" ion N-19 (see Figure 2). The MS was externally calibrated with a set of calibration constants generated separately from a known compound under the same conditions. Molecular weight assignments were made for each peak, from which the predicted oligonudeotide sequence was deduced.
Further, the modification on the 2' position of the ribose ring by O-methylation appears to have little effect on backbone fragmentation. When fragmented by MALDI, the 20-mer PS ODN with six 2'-0-methylated nucleotides in the sequence, two on the 5'-end and four on the 3'-end, gave four ion series (Figure 6). As in the case of the 25-mer PS ODN described below in Example 5, "a", and "w" ions are the dominant sequence ions. Additional fragment ion series include "d" and "z" (see Tables 1 and 2). Because the modification occurs to the nucleosides, each residue mass used to assign terminal nudeosides is increased by 30 μ into 238 (C), 239 (T), 263 (A), and 278 (G), respectively. Similarly, the residue mass for each nucleotide containing a 2"-0-methyl group is 335 (C), 350 (T), 359 (A), and 375 (G). The terminal nudeotides containing a 2'-0-methyl group should have the following m/z in the positive ion mode, 354 (C), 369 (T), 378 (A), and 394 (G).
Table / . Measured and theoretical m/z of the V ions of
the 20-mer all PS ON
on t e obsd. m/z theo /z ass b s s
Table 1. Measured and theoretical m/z of the "a", αd*% and "zM ions of the 20-mer all PS ON e obsd /z theor /z a s dif ase assi n.
Example 4 Sequence determination of a 17-mer PS/PO chimeric nucleic acid analyte using MALDI-TOFMS with delayed pulsed ion extraction A 17-mer PS/PO chimeric oligodeoxynucleotide MALDI target was placed in the sample chamber of a Bruker Matrix-assisted Laser Desorption/ionization Time- of-Flight Mass Spectrometer equipped to allow delay of pulsed ion extraction following laser irradiation. The flight path is 1.6 m. The desorption and ionization was effected with a 337 nm N
2 laser. The irradiance was set at an estimated value of 1-2 x 10
7W/cm
2 to achieve a laser attenuance level of 40-50, by measuring the laser energy using a power meter. To improve the signal-to-noise, typically 100 laser shots were summed. Other instrument settings were as follows: (1) acceleration voltage = +20 kV; (2) P2 voltage = +22.5 kV; (3) lens voltage = +9.2 kV; (4) detector voltage = +1.5 kV; and (5) time elapsed between laser firing and voltage switching on the auxiliary plate = 250 ns. Matrix ions were deflected to avoid saturation of the dual-microchannel-plated detector. For fragment ions with a mass to charge ration (m/z) less than 1500, the deflection duration was set at 1000 ns, while for those with m/z above 1500 the deflection duration was set at 2000 ns. To reduce the systematic error introduced by the use of delay-extraction, a mass spectrum was calibrated in two segments. In the lower mass region, a matrix ion peak and the doubly protonated molecular ion were used as the calibrants. In the higher mass region, the singly and doubly protonated molecular ions were used as the calibrants.
Figure 4 shows a mass spectrum of a 17-mer oligodeoxynudeotide consisting of both PS and PO nucleotides. Prominent ion series observable in the mass spectrum include "d" and "w" ion series. Also prominently present in the lower m/z region of the mass spectrum are the "a" ions (Tabled- Because the measurement was taken in positive ion mode, all ions should have two extra protons compared to their negative ion analogs, assuming they are all even electron species. For simplidty the same notation for the same ion type is used regardless the ion polarity. The presence of a complete set of w ions (w,-w16) provides for
ready sequence determination. Confirmation of the 17-mer sequence can be made by a simple comparison between the expected and experimentally determined m/z of "w" ions (Tabled ).
The presence of prominent "a" ions in a MALDI-TOF mass spectrum is a feature that has not been reported for mixed base ODNs before. Equally unusual is the presence of some of the an-B(T) (the type and sequence position of the nucleobase cleaved is denoted by the letter in the bracket and an integer number n). Although an-B ions are commonly observed in a MALDI-TOF mass spectrum of ODNs, the loss of thy mine has been generally considered a unfavored fragmentation pathway (Hillenkamp, Smith). Noticeably missing are the "y" ions, except for yl4, yl5, and yl6. These three "y" ions are likely the remaining failure sequences because the limited efficiency of RP-HPLC in ODN purification.
A sequence can be constructed from the "w" ions together with (M+H)+ ions. The sequence is then compared with segments of sequences constructed from other ion series for verification. In general, this is a more useful strategy because it can be used to determine an unknown sequence. In such an approach, the first step of the sequence determination is to find the wk-1 peak (k being the total number of the nucleotides in a given sequence). Because the difference in m/z between (M+H)+ and wk_, corresponds to the mass of a 5'-terminal nucleoside residue, it should match one of the five possible values for unmodified deoxynudeoside, 209 (C), 210 (U), 224 (T), 223 (A), and 249 (G). The m/z difference between two adjacent "w" ions, on the other hand, should correspond to the mass of a nucleotide residue. Thus the wk-2 is located by finding the peak having a m/z smaller by one nudeotides the residues mass are 289 (C), 304 (T), 313 (A), 329 (G). These become 305 (C), 320 (T), 329 (A), and 345 (G) for phosphorothioate. The remaining "w" ions except Wj can be determined successively this way. Being an terminal ion, w, has a mass larger by that of HzO than its corresponding residue mass of a repeating unit. Additional correction due to the ion polarity requires the addition of one proton in the positive ion mode. Therefore, PS w, can have the following masses: 324 (C), 339 (T), 348 (A), and 364 (G); PO wα has smaller masses by 16 μ.
For the 17-mer in Figure 4, although sequence determination could be achieved with the "w" ion series alone, other type of ions provide useful confirmation. Because the use of high laser irradiance caused peak broadening even with the use of delayed-extraction, the w13 ion peak at m/z 4212 is not well separated from the peak at m/z 4205. The w12 peak at m/z 3910 is also weak. This affects the assignment of nucleotides at position 4, 5, and 6 (counting from the 5' end). However, information can be unambiguously obtained from d3 (m/z 979), d4 (m/z 1308), d5 (m/z 1613), and d6 (m/z 1942). As shown in Table3^ rιe partial sequence constructed from these ions match the corresponding sequence constructed from "w" ions. Because w, appears in the region where matrix ions are very abundant, there may be cases when the former is overwhelmed by the latter. In such a case, the "dk-1" could give the identity of the 3' terminal nudeoside, as does the "wk-1" the 5' terminal nudeoside. Alternative to the "d" ions, the "a" ions could also give unambiguous sequence assignment for position 4-6 when "w" ions could not. The presence of both "a" and "d" ions could also be very useful in determining the structure of backbone linkage. When a backbone linkage in question is flanked by both "a" and "d" ion peaks, its identity can be revealed from the m/z difference of the two type of ions. In such a case, a m/z difference of 98 indicates a PO linkage, while that of 114 indicates PS linkage.
Table 3. Measured and theoretical m/z of the sequence ions from the PS-PO chimera
Sequence determination of a 25-mer PS oligodeoxynucleotide using MALDI- TOFMS with delayed pulsed ion extraction.
A 25-mer PS oligodeoxynucleotide was sequenced by MALDI-TOFMS using the instrument settings described in Example 4 above. Figure 5 shows a mass spectrum of a 25-mer PS ODN. A complete set of "w" ions can be identified (Table 4). Accompanying the "w" ions are the "a" ions giving a distinct pattern of paired peaks in the mass spectrum (Table 5). Other ion series with generally lower abundances, such as "b", "c", "d", and "z" ions have also been found (Table 6). As shown, the replacement of the non-bridging oxygen atom by a sulfur atom enhances the generation of "a" ions. While only partial "a" ion series is observable in the PS/PO chimera described in Example 4, an almost complete "a" ions sequence (a2-a24) is observed here. The a, ions could no be identified because of the presence of an abundant matrix ion peak at the same m/z position in the spectrum. Besides the prominence of the "a" ions, another feature that appears to be general for PS ODN is that the peaks of "w" ions are larger than those of "a" ions of the same length. Also noteworthy is the presence of "z" ion series hitherto unreported in the MALDI TOFMS literature. Interestingly, "a-B" ions are missing. Following the strategy discussed in the previous section, one compete and three partial sequences can be constructed based on the four types of ion series. The sequences are consistent with one another.
Table • Measured and theoretical m/z of the " " ions of the 25-mer all PS ODN
Table -5- Measured and theoretical m/z of the "a" ions from the 25-mer all PS ODN
Table (o. Measured and theoretical m/z of the "b", uc'\ αd", and "z" ions of the 25-mer all PS ODN
Sequence determination of a 18-mer PS/MP chimeric oligodeoxynucleotide using MALDI-TOFMS with delayed pulsed ion extraction.
A 18-mer PS/MP chimeric oligodeoxynucleotide was sequenced by MALDI- TOFMS using the instrument settings described in Example 4 above. Besides phosphorothioate, another popular modified backbone structure is methylphosphonate. The substitution of the non-bridging oxygen atom by a methyl, as opposed to a sulfur atom for ODNs, reduces the fragmentation efficiency (Figure 7). A gap appears in the methylphosphonate region. This proves to be extremely valuable as they fill the sequence gap. The residue masses for methylphosphonate deoxynucleotide are 287 (C), 302 (T), 311 (A), and 327 (G).
Example ' To investigate the origin of the ISF Products the 25-mer PS ODN was mixed with a 17-mer PS-ODN of the same sequence nested on the 3'-end in a molar ratio of 100:1. When analyzed by RP-HPLC, the two PS ODN appear as two separate components (Figure 8). Fractionation of the mixture at time points indicated in the chromatogram gave 5 fractions. Analysis by MALDI-TOFMS revealed that Fraction A consisted mostly of the 17-mer. Due to low the concentration, the signal of the 17-mer is low (data not shown). The MALDI-TOFMS mass spectra of the other fractions are shown in Figure 9. In Fraction B besides the 17-mer no other accompanying fragments are present. As expected, the latter eluting fractions contain decreasing amount of the 17-mer but increasing amount of the 25-mer. Importantly, none of these fractions showed the presence of the fragment ions. In contrast, Fraction E, having the highest concentration of the 25-mer, gave all the fragment ions (Fig. 10), Note the 17-mer is not present in the spectrum. If the fragments were present prior to MALDI, Fraction B is expected to contain not only the 17-mer, but also some of the fragments. Conversely, if the 17-mer could be removed from Fraction E, the fragments should also have been removed from the 25-mer if they had been present. We can therefore conclude that the fragment ions did not originate from species that are present prior to MALDI; rather, it is the result of MALDI .
Example 2> The influence of experimental conditions No qualitative difference has been observed when the 25-mer PS ODN, shown in Figure 5, was analyzed in either the positive or negative polarity, using 3- HPA/IAL as the matrix. As in the positive polarity, the dominant ion series are "w" and "a" ions. Additional ion series include "d" and "z" ion, the latter appear only in the lower m/z region. The tendency to arc in the ion source at high laser irradiance makes the positive polarity a better choice for routine analysis.
Although the signal to noise ratio (S/N) of the fast fragment ions can be improved as the irradiance increases from the threshold irradiance, which is defined as the irradiance necessary to give a S/N of 2 for a given fragment ion, the abundance of the fragment ions relative to that of the (M+H)+ ion of the 17-mer PS ODN did not scale linearly with the irradiance applied at the sample. At 6 x 106
W/cm2, only "w3" - "w12" were observed. Increasing the irradiance by 30% made all "w" ions and most of the "a" ions observable. However, from this point on further increase in irradiance actually decreased the relative abundance of smaller "w" ions (smaller than w10), while little affecting that of the larger "w" ions. The delay time affects the absolute intensity of all ions. However, the abundance of fragment ions relative to that of (M+H)+ remained little changed within 200 - 400 ns.
Example ^ Proposed fragmentation pathway for modified oligonucleotides
Because "a-B" ions of ODNs are usually present in a MALDI-TOF mass spectrum, most of the ion fragmentation mechanisms which have been proposed involve the cleavage of the base, followed by the cleavage of the 3' glycosidic bond- However, using the method according to the invention, a PS ODN could give prominent depurinated products while showing little sign of "a-B" ions. The
observation suggests that there is another fragmentation pathway that is independent of the base cleavage. The m/z values of the "a" and "w" ions suggest that the cleavage involves at least one hydrogen transfer. The transfer could occur directly from the "a" precursor to the "w" precursor or involve the matrix species, as the cleavage happens in an area where matrix ions and radicals are present.
However, the observation that PS ODNs have greater tendency than PO or MP ODNs to form "w" and "a" ions is not in favor of a cleavage mechanism involving matrix species. Instead, the observation is better explained by a mechanism involving El type syn elimination (Lowry, T, H., Richardson, K. S. Mechanism and theory in organic chemistry, Harper & Row: New York, 1981, pp 530-556). In this mechanism, the 4' hydrogen is transferred to the negatively charged non-bridging oxygen or sulfur atom resulting in the formation of a double bond between 3' and 4' positions and the cleavage of the glycosidic bond (Figure 10). The replacement of the non-bridging oxygen with a neutral methyl group would impede the process. In such a case, the 5' PM group could orient itself to participate in the elimination of the 4' hydrogen leading to the formation of a double bond between the 4' and 5' positions or a rearranged sugar ring (Figure 11).