WO2000022139A2 - Dna sequences for enzymatic synthesis of polyketide or heteropolyketide compounds - Google Patents

Dna sequences for enzymatic synthesis of polyketide or heteropolyketide compounds Download PDF

Info

Publication number
WO2000022139A2
WO2000022139A2 PCT/US1999/023535 US9923535W WO0022139A2 WO 2000022139 A2 WO2000022139 A2 WO 2000022139A2 US 9923535 W US9923535 W US 9923535W WO 0022139 A2 WO0022139 A2 WO 0022139A2
Authority
WO
WIPO (PCT)
Prior art keywords
seq
dna
sequences
pepocos6
sequence seq
Prior art date
Application number
PCT/US1999/023535
Other languages
French (fr)
Other versions
WO2000022139A3 (en
WO2000022139A9 (en
Inventor
Stefan Beyer
Helmut Bloecker
Petra Brandt
Paul M. Cino
Brian A. Dougherty
Steven L. Goldberg
Gerhard Hofle
Rolf-Joachim Mueller
Hans Reichenbach
Original Assignee
Bristol-Myers Squibb Company
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Bristol-Myers Squibb Company filed Critical Bristol-Myers Squibb Company
Priority to JP2000576029A priority Critical patent/JP2002527067A/en
Priority to CA002346499A priority patent/CA2346499A1/en
Priority to AU65126/99A priority patent/AU6512699A/en
Priority to EP99953117A priority patent/EP1119628A2/en
Publication of WO2000022139A2 publication Critical patent/WO2000022139A2/en
Publication of WO2000022139A9 publication Critical patent/WO2000022139A9/en
Publication of WO2000022139A3 publication Critical patent/WO2000022139A3/en

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/11DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
    • C12N15/52Genes encoding for enzymes or proenzymes
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12PFERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
    • C12P17/00Preparation of heterocyclic carbon compounds with only O, N, S, Se or Te as ring hetero atoms
    • C12P17/18Preparation of heterocyclic carbon compounds with only O, N, S, Se or Te as ring hetero atoms containing at least two hetero rings condensed among themselves or condensed with a common carbocyclic ring system, e.g. rifamycin
    • C12P17/181Heterocyclic compounds containing oxygen atoms as the only ring heteroatoms in the condensed system, e.g. Salinomycin, Septamycin
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12PFERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
    • C12P7/00Preparation of oxygen-containing organic compounds
    • C12P7/62Carboxylic acid esters

Definitions

  • the present invention relates to DNA sequences for enzymatic synthesis of polyketide or heteropolyketide compounds produced by the bacterium Sorangi um cellulosum.
  • PKS Polyketide synthases
  • NRPS non-ribosomal peptide syn- thetases
  • PKS condenses activated carbonic acids (usually acetate and propi- onate) and reduce the resulting 2-keto acid intermediates step- wise in a fatty acid biosynthesis-like fashion.
  • responsible for each reaction step is a specific domain that recognizes, acti- vates, condenses and reduces the carbonic acid.
  • every reduction stage can occur in the final product (Rawlings, Nat. Prod. Reports 14, 523-556 [1997]; for a review, see Chem. Rev. 97, 2463-2760 [1997]).
  • a typical example for the biosyn- thesis of a polyketide is the macrolide antibiotic erythromycin
  • NRPSs are also modular enzymes and condense via peptide bonds amino acids to low molecular weight bioactive substances like bacitracin or tyrocidin. Typical domains of these systems acti- vate the amino acid and condense it with the growing peptide chain. Methylations, epimerisations and modifications via additional protein domains are possible (Stachelhaus and Marahiel, FEMS Microbiol Lett. 125, 3-14 [1995]). Both types of enzymes (NRPS and PKS) share the modular organization of the proteins in which specific catalytic domains are responsible for recognition, activation, condensation and modification of the single elongation units. The growing chain of amino acids and/or carbonic acids is extended through the action of one module adding one unit.
  • each module carry the active centers responsible for the enzymatic steps of the biosynthesis. Little is known about the biosynthesis of biologically active polyketides and polypeptides from myxobacte ⁇ a . Fragments of the biosynthetic gene clusters of soraphen and saframycm have been described from Sorangium cellulosum So ce26 and Myxo- coccus xanthus, respectively (Schupp et al . , J. Bacte ⁇ ol . 177 , 3673-3679 [1995] and Pospiech et al . , Microbiology 141, 1793- 1803 [1995] ) .
  • a subject of the present invention is therefore to provide DNA sequences according to claim 1 the expression products of which perform or are involved m the enzymatic biosynthesis, mutasynthesis or partial synthesis of polyketide or heteropolyketide compounds.
  • the DNA sequences may be inserted into well known and optimized expression vectors by commmon tech- niques of molecular biology, thus allowing transformation, selection and cloning of cells, which cells are then capable of synthezis g polyketide or heteropolyketide compounds by fermentation. Using an overproducing clone allows the desired polyketide or heteropolyketide compounds be easily produced and recovered m high amounts.
  • the invention thus further relates to a recomb ant expression vector according to claim 16, cells transformed therewith according to claim 17 and to a process for enzymatic biosynthesis, mutasynthesis or partial synthesis of polyketide or heteropolyketide compounds according to claim 23.
  • the invention consists of (1) cloned Sorangium cellulosum polyketide synthase (PKS) and/or peptide synthetase (PS) biosynthetic cluster DNA and (2) the nucleotide sequence and predicted protein coding sequences of the cloned DNA.
  • PKS Sorangium cellulosum polyketide synthase
  • PS peptide synthetase
  • the invention can be used for, but not limited to, (a) increasing yields of PKS product Sorangium cellulosum (e.g., by amplification or genetic modification of the epothilone gene cluster or its component parts) , (b) increasing yields of polyketide and/or peptide synthetase product a heterologous system by transfer of the corresponding gene cluster or its component parts, which may De followed by amplification or genetic modification of the PKS and/or PS gene cluster or its component parts, (c) modification of the polyketide and/or peptide synthetase product chemical structure either Sorangium cellulosum or a heterologous host (e.g., by genetic modification of the corresponding gene cluster or its component parts) and (d) for the detection of genes and gene products involved making polyketides or related molecules m other organisms (e.g., by hybridization or complementation assays) .
  • Figure 1 is a restriction map of one of the DNA sequences of the present invention (cosmid A2 insert) indicating also the localization of regulatory DNA segments and the individual structural genes ("open reading frames" or ORFs) 1 to 16.
  • Figure 2 shows the open reading frames found on pEPOcos ⁇ region
  • DNA sequence data from A2 cosmid are as defined in claim 6.
  • Table 1 correlates ORFs 1 to 16 found on A2 cosmid with the respective biological function (Regulators, Enzymes) .
  • CA-2 agar is prepared by autoclaving 18 g Bacto-agar (Difco Laboratories, Detroit, MI) m 800 ml dH 2 0 for 20 min at 121°C and cooling to 50-55°C in a water bath. The following filter-sterilized solutions are added to the agar: 20% (w/v) glucose, 50 ml; Solution A (7.5%
  • G51t consists of 0.5% starch (Cerestar) , 0.2% tryptone, 0.1% yeast extract, 0.05% CaCl 2 , 0.05% MgSO 4 -7H 2 0, 1.2% 4 - (2 -hydroxyethyl ) - 1-piperazine- ethanesulfonic acid (HEPES) , 0.2% glucose, pH 7.6.
  • the flasks were shaken at 30°C, 160 rpm until a dense orange bacterial growth was obtained (ca. 5-7 d.) .
  • the cells were pelleted by centrifugation at 6,000 x g and used immediately or stored fro- zen at -20°C.
  • genomic DNA was isolated from S . cellulosum cells cultured as described m section A.l using the Midi Qiagen Blood & Cell Culture DNA purification Kit (Qiagen, Hilden, Germany) following the Qiagen Genomic DNA Handbook protocol for bacterial DNA isolation (1997, Qiagen, Hilden, Ger- many, p. 29 ff.) .
  • the precipitated DNA was recovered with a bent pasteur pipette as described m section A.l.
  • pFD666 is a bifunctional E. coli -Streptomyces cosmid cloning vector (see Denis and Brzezmski, Gene 111 , 115-118 [1992]) . To maintain stability of large inserts, it is present low-medium copy number when replicated E . coli . For this reason, isolation of sufficient pure DNA to carry out cloning experiments was difficult using commercial kits with standard protocols. A modified procedure was therefore used to obtain pFD666 DNA.
  • a 10 ml culture of DH10B (pFD666) was grown for 16- 20 hr at 37°C m LB (1% tryptone, 0.5% yeast extract, 0.5% NaCl , pH 7.0) medium containing 50 ⁇ g/ml kanamycm sulfate.
  • Fifty ml of LB + kanamycm was inoculated to a starting OD 60 o of ca 0.25 and shaken at 300 rpm, 37°C, until the OD 60 o reached ca . 0.6.
  • Five hundred ml of LB + kanamycm medium a 2 1 flask was inoculated with 25 ml of this culture and incubated under the same conditions for 2.5 hr .
  • Chloramphenicol ( 2.5 ml of a 34 mg/ml solution 100% EtOH) was added and the incubation continued for an additional 16-20 hr .
  • the previous steps were performed according to Maniatis et al . Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, 1989.
  • Cells were pelleted for 10 mm, 16,000 x g . They were resuspended in 9 ml of 50 mM glucose/25 mM Tris-HCl (pH 8.0) /10 mM EDTA and transferred to a 50 ml disposable centrifuge tube.
  • the tube was centrifuged at 6,000 x g for 10 mm at 4°C and the supernatant poured though 2 layers of sterile cheese cloth into a fresh 50 ml disposable tube. Isopropanol to a final concentration of 0.6% was added and the contents of the tube mixed several times. The precipitated nucleic acid was centrifuged at 6,000 x g for 10 mm at 4°C. The pellet was washed with 70% EtOH and any excess EtOH was aspirated from the pellet, which was allowed to air dry for 5 mm.
  • S . cellulosum chromosomal DNA prepared as described m section A.l. a was partially cleaved with restriction endonuclease Sau3Al m a 1000 ⁇ l reaction volume consisting of 50 ⁇ g chromo- so al DNA, 5 units enzyme (Pro ega, Madison , WI), 0.006 M Tris-HCl, 0.006 M MgCl 2/ 3.10 M NaCl , and 0.001 M dithiothrei- tol (pH 7.5) for 5 mm at 37°C.
  • the reaction mixture was extracted once with an equal volume of 1:1 phenol : chloroform. After centrifugation, tr.e upper aqueous phase was saved, to which 0.1 vol.
  • DNA was pelleted by centrifugation for 5 mm at 16,000 x g m a microfuge and washed once with 0.5 ml 70% EtOH. After drying m a SpeeaVac (Savant Instruments, Farmmgdale, NY) for 5 mm, the pellet was resuspended m 0.1 ml TE buffer.
  • the DNA was layered ontcp of a 12 ml 10-40% sucrose gradient prepared m TE buffer ana centrifuged at 113,600 x g for 16 hr, 10°C using a Beckman SW40T ⁇ rotor (Beckman Instruments, Palo Alto, CA) .
  • Sucrose was diluted from the corresponding 0.5 ml fraction by addition of 0.5 vol. TE . Subsequently, DNA was precipitated by addition of 0.1 vol. 3 M sodium acetate and 0.6 vol. isopropanol . DNA was pelleted by centrifugation at 16,000 x g for 10 mm m a microfuge. DNA was washed with 0.5 ml 70% EtOH and dried m a SpeedVac with moderate heat for 10 mm. Finally, the DNA was resuspended m distilled H 2 0 at a concentration of 0.5 mg/ml.
  • Vector pFD666 was cleaved with restriction endonuclease BamHI m a 0.02 ml reaction volume consisting of 2 ⁇ g plasmid DNA, 10 units of BamHI (Promega), 0.006 M Tris-HCl, 0.006 M MgCl 2 , 0.05 M NaCl , and 0.001 M dithiothreitol (pH 7.5) for 90 mm at 37°C.
  • 0.3 ml of stop buffer (0.01 M Tris-HCl [pH 7.5], 0.001 M EDTA, 0.2 M NaCl , 0.5% SDS) and 0.35 ml of 1 : 1 phenol; CHCl 3 was added to the reaction.
  • the sample was mixed gently several times by inversion and centrifuged at 16,000 x g for 3 mm to separate the phases.
  • the aqueous layer was removed to a new microfuge tube.
  • 0.1 vol. 3 M sodium acetate and 2 vol. 100% EtOH were added and the precipitated DNA pelleted by centrifugation at 16,000 x g for 10 mm. Liquid was removed by aspiration and the pellet washed once with 0.5 ml 70% EtOH.
  • the DNA was dried m a SpeedVac and resuspended m TE buffer to 0.5 mg/ml .
  • the entire packaging reaction (0.5 ml) was diluted with 4.5 ml SM buffer (per liter: 5.8 g NaCl, 2 g MgSO 4 .7H 2 0, 1 M T ⁇ s- HCl [pH 7.5] , 5 ml 2% gelatin solution) .
  • Transfection was performed by adding 10 ml cf an overnight culture of E. coli DH5 ⁇ that had been grown m LB medium with 0.01 M MgS0 4 and 0.2% maltose to the diluted pnage and incubating at 37°C for 20 mm.
  • 0.8 ml of LB was added and the cells shaken at 225 rpm for 1 hr at 37°C.
  • DNA was digested with restriction endonuclease Pstl and samples electrophoresed on a 0.8% TAE agarose gel for 1.5 hr at 100 V.
  • a unique restriction pattern was noted m eac sample and the total size of the insert was calculated to be between 40 and 45 kilobases.
  • digeste ⁇ vector DNA was ligated with 10 ⁇ g partially hydrolyzed genomic DNA from S . cellulosum (as described m section A.3.b) m a final volume of 20 ⁇ l using 2 U T4 DNA ligase and the appropriate reaction buffer (Gibco BRL, Eggen- stem, Germany) . The reaction was carried out at 16°C overnight. The reaction mixture was packaged into phage particles using the Gigapack III XL packaging extract kit (Stratagene) according to the manufacture's protocol. Treatment of packaging reaction mixture and transfection of E . coli SURE (Stratagene) was performed as described m 4. a.
  • Transfected cells were concentrated by centrifugation, resuspended m fresh LB medium and distributed on LB agar plates containing 50 ⁇ g/ml 1 kanamycm. The plates were incubated overnight at 30°C. 1600 recombmant clones were transferred into 96 well microtiter plates filled with 80 ⁇ l LB medium containing 50 ⁇ g/ml kanamycm per well and propagated overnight at 30°C. The following day the microtiter plates were used to inoculate a second set of microtiter plates m order to obtain a duplicate of the recombmant clones.
  • Plasmid pZero2.1 (Invitrogen, Carlsbad, CA) was cleaved with restriction endonuclease BamHI m a 0.02 ml reaction volume consisting of 1 ⁇ g plasmid DNA, 10 units of BamHI (Promega) , 0.006 M Tris-HCl, 0.006 M MgCl 2 , 0.05 M NaCl, and 0.001 M dl- thiothreitol ( pH 7.5) for 20 mm at 37°C. 0.08 ml of dH 2 0 and 0.1 ml of 1:1 phenol :CHC13 was added. The sample was briefly vortexed and centrifuged at 16,000 x g for 2 mm.
  • the aqueous layer was removed to a new microfuge tube. 0.1 vol. 3 M sodium acetate and 2 vol. 100% EtOH were added and the precipitated DNA pelleted by centrifugation at 16,000 x g for 10 mm. Liquid was removed by aspiration and the pellet washed once with 0.5 ml 70% EtOH. The DNA was dried m a SpeedVac and resuspended m TE buffer to 0.004 ⁇ g/ml. Digested pZero2.1 was ligated to the partially-cleaved chromosomal DNA m a 0.01 ml reaction consisting of 0.004 ⁇ g pZero2.1 , 0.05 ⁇ g S .
  • the ligated DNA was resuspended m 0.005 ml dH 2 0 and mixed with 0.04 ml of electro- competent Escherichia coli DH10B cells (GIBCO/BRL, Gaithers- burg, MD) .
  • the sample was placed into a pre-chilled 0.2 mm-gap electroporation cuvette and transformed into the bacteria by electroporation using a BioRad Gene Pulser II unit (BioRad, Hercules, CA) at 25 ⁇ F and 200 ⁇ .
  • a 20 x 20 cm sheet of Duralon UN membrane (Stratagene) was placed on top of a 24.5 x 24.5 cm square bioassay dish containing 250 ml LB agar - kanamycm. An aliquot of the frozen cosmid library m 1 ml L ⁇ medium was spread on the filter. The plate was incubated at 37"C for 24 hr . Colonies were replicated onto two fresh filters which were placed onto LB + kanamycm agar medium and incubated at 28°C for 18 hr. Lysis of cells and neutralization of released D ⁇ A was performed according to di- rections that were provided with the filters.
  • the D ⁇ A was crosslmked to the filters using a UV Stratalmker 2400 unit (Stratagene) m the autc crosslink mode.
  • Cell debris was removed by placing the filters m a container with a solution of 3 X SSC (20 X SSC contains, per liter, 173.5 g ⁇ aCl , 88.2 g so- dium citrate, pH adjuste ⁇ to 7.0 with 10 ⁇ ⁇ aOH) , 0.1% SDS and rubbing the lysed colonies with a Kimwipe .
  • the filters were then incubated at least 3 hr with the same wash solution for at least 3 hr at 65°C.
  • the plasmid library was treated similarly except cells were sprea ⁇ onto a 137 mm circular Duralon UN mem- brane placed on top of a 150 mm petri dish containing 80 ml LB agar + kanamycm.
  • a probe consisting of a 650-base pair (bp) polymerase chain C?CR) fragment representing a portion of a S . cell ulosum polyketide synthase gene was used.
  • the fragment was amplified using primers to consensus regions of Type I (macrolide) polyketide synthase (PKS) genes (Swan et al . , Mol . Gen. Genetics 242 , 358-362 [1994]).
  • PKS polyketide synthase
  • Reactions were carried out m a Perkm-Elmer Model 480 Thermocycler (Perkm- Elmer Corporation, Foster City, CA) under the following conditions: 94°C, 1 mm; 50°C, 1 mm, 72°C, 1.5 mm for a total of 30 cycles. Each possible combination of sense and anti-sense p ⁇ m- ers were tried.
  • a 650-bp and 350-bp fragment was amplified using oligos 120 + 124 and 123 +124, respectively.
  • the sequence of the fragments were determined using the ALFexpress AutoRead kit to fluorescently label the DNA, which was analyzed on an ALFexpress sequencing apparatus (Pharmacia) .
  • the 650-bp fragment was chosen for hybridization experiments.
  • the fragment was labeled with 32 P-dCTP using the NEBlot kit (New England Biolabs, Beverly, MA) and purified on a Bio-Spm 6 column (BioRad, Hercules, CA. ) .
  • Duplicate blots were pre- hybndized 3 X SSC (1 X SSC contains 0.15 M sodium chloride and 0.015 M sodium citrate, pH 7.0) , 4 X Denhardt ' s solution
  • Plasmid DNA was prepared using the standard alkaline lysis method and digested with restriction endonuclease Pstl. The digested DNA was electrophoresed on a 0.8% aga- rose gel m TAE for 3 hr at 100 V. Fragments were transferred to Duralon UV using the VacuGene XL vacuum blotting unit (Pharmacia) and the recommended alkaline denaturation protocol . Hybridization with radioactively-labeled PCR fragment and washing were carried out as described above.
  • a 137 -mm circle of Duralon UN membrane was placed on top of a 150-mm containing 75 ml LB agar + kanamycm.
  • An aliquot of the plasmid library (representing ca . 2,000 recombmant colonies) m 0.5 ml LB medium was spread on the filter.
  • the plate was incubated at 37°C for 20 hr .
  • Colonies were replicated onto two fresh filters which were placed onto LB + kanamycm agar medium and incubated at 37°C for 6 hr .
  • the filters were proc- essed for hybridization as described Section C.l.
  • This plasmid, called Sau4 was characterized in more detail.
  • the recombmant E . coli clones from the microtiter plates were used to produce two identical sets of hybridization filters order to identify cosmids carrying PKS and PS genes .
  • the recomc ant clones were spotted onto 2 sets of 22 x 22 cm LB agar plates containing 50 ⁇ g/ml kanamycm. Each plate contained 384 clones therefore representing 4 microtiter plates.
  • the clones were incubated at 30°C overnight. After pre-coolmg for approximately 3 h at 4°C, 20 x 20 cm Hybond ⁇ + Nylon membranes (A ⁇ ersham, Braunschweig, Germany) were placed onto the agar sur aces. After 2 mm.
  • the membranes were removed and placed for 15 mm. on Whatman 3 MM paper (Whatman paper Ltd., Maidstone, England) soaked with denaturation solution (0.5 N NaOH, 1,5 V NaCl) before they were transfered onto Whatman 3 MM paper saturated with neutralization solution (1 M Tris-HCl, pH 7.5, 1.5 iv NaCl) . Subsequently the membranes were placed onto Whatman 3 MM paper soaked with 2 X SSC (0.3 M NaCl, 0.03 M sodium citrate, pH 7.2) for 10 min. The membranes were baked for 40 min at 85°C.
  • each membrane was overlayed with 5 ml Proteinase K solution (2 mg/ml Proteinase K in 2 x SSC) and incubated at 37 °C for 90 min. Finally, cell debris was removed by wiping the membranes with a Kimwipe pre-wetted with 2 X SSC.
  • PCR fragments were used as gene probes to detect recombinant cosmids of the S . cellulosum cosmid library.
  • Oligonucleotides based on conserved amino acid sequences of ketosynthase domains from various type I PKS were optimized for myxobacterial DNA by comparison to a known myxobacterial biosynthetic gene cluster (Schupp et al . , J. Bacteriol . 177, 3673-3679 [1995]) resulting in primer KSlUp (5 ' -
  • KSD1 (5'-GG(A/G)TCICCIA(A/G) I (G/C) (T/A) IGTICCIGTICC (A/G) TG-3 ' ) .
  • PCR-primers TGD (5'- T(A/T) (C/T) CGIACIGGIGA(C/T) (C/T) (G/T) IG(G/T) ICG-3') and LGG ( 5 ' -
  • A(A/T) IGA(A/G) (G/T) (G/C) ICCICCI (A/G) (A/G) (G/C) I (A/C) (A/G) AA (A/G )AA-3') directed to genes encoding adenylation modules have been de- scribed by Turgay et al . (Pept. Res. 7, 238-241 [1994]).
  • PCR reaction mixtures with a final volume of 25 ⁇ l contained 0.1 ⁇ g template DNA, 0.2 U Taq DNA-polymerase (Gibco BRL, Eggenstem, Germany), 5 ⁇ mol dNTP, 5% dimethyl sulfoxide (Sigma), 1.5 mM MgCl 2 , 25 pmol of each primer and the appropriate reaction buffer supplied by Gibco BRL. Chromosomal DNA of S . cellulosum was used as template. Additionally, chromosomal DNA of Myxococ - cus fulvus was used with PS primers.
  • KS fragments Three different KS fragments, designated pM008.4, pM008.6, pM008.7, one PS fragment (pAPsl) corresponding to S . cellulosum and one PS fragment (pDPsl) obtained with chromosomal DNA of M. fulvus .
  • the PCR fragments were re- isolated by digestion with EcoRI from the plasmids pM008.4, pM008.6, and pM008.7, laoeled, pooled and used as gene probes m hybridization experiments as described below. The same procedure was performed with the PS fragments of pAPsl and pDPsl.
  • Hybridization with PKS and PS specific DNA probes was carried out using the DIG nonradioactive labeling and detection kit (Boehrmger Mannheim, Germany) and performed according to the supplier's manual using buffer containing 50% formamide .
  • the membranes were hybridized m plastic bags containing approx. 10 ml of hybridization solution at 39°C overnight. Unspecific binding of probes was removed by 2 wash steps with 2 x SSC, 0.1% SDS at room temperature for 20 mm. and one stringent wash step with 0.5 x SSC, 0.1% SDS at 60°C for 20 mm. Detection of hybridizing DNA fragments was performed with the above mentioned system according to the manufacturer's protocol using CSPD as chemilummescent substrate. The signals were recorded by exposure of the treated membrane to Hyperfilm ECL (Amersham Life Science, Little Chalfont, England) which was developed m appropriate time intervals.
  • Randomly sheared l oraries were constructed for cosmids and plasmids of interest using a protocol similar to that of of Fleischmann et al . , 19S ⁇ (Science 269, 496) and modified m Fraser et al . , 1995 (Science 370 , 397). Briefly, Qiagen-column purified cosmid DNA ( ⁇ 1C ⁇ g) was sheared to a size of approximately 2 kb and the DNA end-repaired using BAL31 nuclease.
  • the DNA was gel -purified after electrophoresis through a 0.75% low- melting temperature agarose gel containing 0.5 ⁇ g/ml ethidium bromide m IX TAE buffer run at ⁇ O V for 2 hours.
  • the volume of the low-melt agarose gel slice was estimated by adding the gel slice to a microfuge tune and weighing, then 0.1 vol. of 3 M sodium acetate (pH 7) was added and the agarose incubated at 60°C. Tne temperature .--as equilibrated to 37°C, and DNA ex- tracted twice using an equal volume of buffered phenol (Life Technologies) .
  • aqueous phase was transferred and extracted once with an equal volume of chloroform, then ethanol preci- pated by the addition of 2 vol. cold 100% ethanol.
  • DNA was con- centrated by spinning at 16,000 x g m a microcentrifuge. The DNA pellet was washed with 1 ml 70% ethanol and resuspended m 100 ⁇ l of 0. IX TE.
  • the DNA was ligated to Smal -digested, phos- phatase-treated pUCl ⁇ vector (Pharmacia) , and single insert re- combmants isolated by gel-pu ⁇ fication of the band containing vector plus a single insert, followed by T4 polymerase polishing, and a final intramolecular ligation of the vector-plus- s gle-msert DNA.
  • This final ligation represents a library of highly random ca . 2 kb fragments that was used for shotgun sequencing of the ca . 40 kb cosmids or ca. 10 kb plasmids.
  • Cosmid A2 Cosmid DNA with inserts of S . cellulosum was isolated by an alkaline lysis procedure and purified with Ma- cherey Nagel columns (Machery und Nagel GmbH und CoKG, D ⁇ ren, Germany) using manufacturer's recommendation. Purified Cosmid DNA was sonicated, end-repaired using T4 DNA Polymerase (Boe- hrmger Mannheim, Germany) . After gel -purification fragments of a size of approximately 2 kb were ligated into Smal -digested, phcsphatase-treated pTZl ⁇ R vector (Pharmacia) . The ligation represents a library of highly random ca. 2 kb fragments that was used for shotgun sequencing of the ca . 40 kb cosmid.
  • DNA (1 ⁇ l of 100 ⁇ l total m the library) was transformed into E . coli by electro- poration (20 ⁇ l of Electromax DH10B cells from Life Technolo- gies) and cells spread onto LB plates containing 50 ⁇ g/ml ampi- cillin. After growth overnight at 37 °C, transformants (ca. 300- 3000 CFU total) were tranfered to 96-well growth blocks and shaken overnight at 37°C in 1.3 ml LB medium with 50 ⁇ g/ml am- picillin.
  • coli DH10B by electroporation and cells were spread onto LB agar medium containing 50 mg/ml ampicillm. After growth for l ⁇ hr at 37°C, transformants were transferred to 96-well growth blocks and shaken overnight at 37°C m 1.3 ml 2x YT medium with 50 mg/ml ampicillm. Templates were prepared from these cells by an alkaline lysis procedure (Qiagen Qia- quick Turbo Prep) to yield purified, double-stranded plasmid DNA.
  • Cycle-sequencing of the plasmid templates was performed using universal forward and reverse primers and Big Dye Termi- nator sequencing kits (PEBiosystems) or Thermo Sequenase fluorescent labelled primer cycle sequencing kit (Amersham Pharmacia Biotech) using the manufacturer's protocols.
  • PBiosystems Universal Forward and reverse primers and Big Dye Termi- nator sequencing kits
  • Thermo Sequenase fluorescent labelled primer cycle sequencing kit Analogenase fluorescent labelled primer cycle sequencing kit
  • data were collected using Licor and ABI 377 automated sequencers and assembled with the GAP4 program (Bon- field, Smith, Staden, Nucl . Acids Res. 23, 4992-4999 [1995]). Gaps were closed using custom made primers (MWG-Biotech) on plasmid templates or PCR products m combination with dye- terminators .
  • ORF Open reading frame identification ORFs were identified the pEPOcos ⁇ region using the OMIGA 1.1.2 (GCG 0.4D) program from Oxford Molecular Limited. Default values were used (Stan ⁇ ard genetic code, all ORFs over 50 bases) to generate ORFs; analysis of these results lead to the list of 14 highest quality ORFs as defined m claim 9. Other ORFs, genes, or genetic elements may be found m the pEPOcos ⁇ insert that have not yet been annotated.
  • the MAGPIE automated genome analysis tool : (htt : //genomes . rockefeller . edu/magp e/magpie . html )
  • ORFs have been identified within the DNA sequences of A5 (contigs 10, 11, 12) and of A2 using the FramePlot analysis program from Ishikawa and Hotta (FEMS
  • BLASTP2 similarity searches were performed using the peptide files from the above ORF identification strategy as query sequences. Searches were performed using the m-house Biomfor- matics BLASTP2 (Version: BLASTP 2.0al9MP-WashU) web page at the Bristol-Myers Squibb Pharmaceutical Research Institute (allows BlastN2, BlastP2, BlastX2 , TblastN, and TBlastX searches). In addition, peptide files generated by the MAGPIE analysis were automatically searched using a FASTA algorithm.
  • a higher probability score indicates a higher liklihood that the probable function corresponds to that of the best match; e.g., the polyketide synthase matches are all above e-100, and given the very high significance scores are presumed to function as polyketide synthases (as are the high scoring peptide synthetases) .
  • IS-Scl An IS element
  • DNA sequence data are as defined in claim 7.
  • pEP0cos6_0RF7 sequences (cf. claim 9): the predicted N-terminus of 0RF7 shows 145 nucleotide overlap with ORF6.
  • Table 3 shows ORF data summary. Note: pEPOcos6_ORFi . seq is truncated at its 5' end; correspondingly pEPOcos6_ORFl .pep is truncated at its N-termmus.
  • DNA sequence data from contigs are as defmded claim 10. Table 4 shows more data.
  • pEPOcos ⁇ protein data are as defined m claim 11, i.e. for selected ORFs (polyketide synthase, peptide synthetases, or ORFs with high similarity to known genes) .
  • ORFs polyketide synthase, peptide synthetases, or ORFs with high similarity to known genes.
  • DNA sequence data from contigs are as defined in claim 12. Table 5 shows more data.
  • Protein sequence data from selected A5 ORFs are as defined in claim 13.
  • Insert contains PKS genes on two large contigs - most similar to the soraphen PKS gene from Sorangium.
  • DNA sequence data from plasmid Sau4 contigs are as defined m claim 14. Table 6 shows more data.
  • Protein sequence data from selected plasmid Sau4 ORFs are as defined m claim 15.
  • Heterologous expression of the ORFs shown m Figure 1 is performed by using a derivative of plasmid pSUP102 (Simon, R., Priefer, U. , P ⁇ hler, A., Methods m Enzymology (1986), vol. 118, pp. 643-659) .
  • pSUP102 plasmid pSUP102
  • the gene for chloramphenicol resistance is changed for a cassette comprising the gene for streptomycin resistance and the promoter element of the Tn5 transposon.
  • Short homologous genomic DNA segments from the host organism are ligated with the DNA sequences of Figure 1 and witr efficient regulatory elements into, for example, the EcoRI restriction site of the vector.
  • Heterologous expression of the ORFs shown m Figure 1 is performed by using bifunctional Strepomyces-Escherichia coli cosmids pKU206 and pOJ466.
  • Heterologous expression of the ORFs shown m Figure 1 is performed by using "bacterial artificial chromosomes", cosmids (for example Supercos, Stratagene GmbH, Heidelberg) and T7 ex- pression systems (Stratagene GmbH, Heidelberg; New England Bio- labs Schwalbach, FRG) .
  • Expression of recombinant enzymes occurs m Escherichia coli cells constitutively expressing phosphopan- tethemyl transferase required for the formation of holoenzyme polyketide synthetases and polypeptide synthetases .
  • the predicted ORFl gene and gene product is truncated due to cloning of the DNA into the cosmid vector.
  • 731 is the last nucleotide of the last amino-acid-encoding codon; 732-735
  • Termination codon is TGA (termination codon) . Termination codons have been excluded in the present annotation.
  • 11 bp terminal inverted repeat that is similar to 1S1131 from Agrobacteriura tume aciens (TS-66 like element, 2773 bp, 4 ORFs, 11 bp inverted repeat)

Abstract

The invention consists of: (1) cloned Sorangium cellulosum polyketide synthase (PKS) biosynthetic cluster DNA; and (2) the nucleotide sequence and predicted protein coding sequences of the cloned DNA. The invention can be used for, but not limited to: (a) increasing yields of PKS product in Sorangium cellulosum (e.g., by amplification or genetic modification of the epothilone gene cluster or its component parts); (b) increasing yields of polyketide product in a heterologous system by transfer of the epothilone gene cluster or its component parts, which may be followed by amplification or genetic modification of the PKS gene cluster or its component parts; (c) modification of the polyketide product chemical structure in either Sorangium cellulosum or a heterologous host (e.g., by genetic modification of the epothilone gene cluster or its component parts; and (d) for the detection of genes and gene products involved in making polyketides or related molecules in other organisms (e.g., by hybridization or complementation assays). DNA sequence and analysis is presented for the following cosmids and plasmids: A2 cosmid; the pEPOcos6 region (overlapping of pEPOcos6 and pEPOcos7); pEPOcos8 cosmid; A5 cosmid; Sau4 (10 kb plasmid).

Description

DNA sequences for enzymatic synthesis of polyketide or heteropolyketide compounds
The present invention relates to DNA sequences for enzymatic synthesis of polyketide or heteropolyketide compounds produced by the bacterium Sorangi um cellulosum.
Background and introduction
This patent application describes DNA sequences for the enzymatic synthesis of polyketide and/or heteropolyketide structures synthesized by the myxobacterium Sorangium cellulo- sum . Several of these compounds have known cytotoxic, immuno- suppressive, antibiotic and fungicidal biological activity, with the epothilones having been most studied and characterized. The fermentation of large quantities of secondary metabolites from microorganisms, especially from myxobacteria, is a time consuming and difficult process that often involves complications (e.g. contamination, low product yield, difficult isolation and purification) . Therefore it would be advantageous to use a well -characterized organism for such fermentations. After cloning of the desired biosynthetic genes one could create such an organism via genetic engineering and manipulate the biosynthesis of the compound. Identified sequences can be cloned into optimized expression vectors and generate recombinant cell lines that overproduce polyketide structures.
Polyketide synthases (PKS) and non-ribosomal peptide syn- thetases (NRPS) represent macromolecular and multifunctional enzymes which are characterized by a modular architecture. PKS condenses activated carbonic acids (usually acetate and propi- onate) and reduce the resulting 2-keto acid intermediates step- wise in a fatty acid biosynthesis-like fashion. Responsible for each reaction step is a specific domain that recognizes, acti- vates, condenses and reduces the carbonic acid. Depending on the presence of these domains in the corresponding modules, every reduction stage can occur in the final product (Rawlings, Nat. Prod. Reports 14, 523-556 [1997]; for a review, see Chem. Rev. 97, 2463-2760 [1997]). A typical example for the biosyn- thesis of a polyketide is the macrolide antibiotic erythromycin
(Staunton and Wilkinson, Chem. Rev. 97, 2611-2630 [1997] ) .
NRPSs are also modular enzymes and condense via peptide bonds amino acids to low molecular weight bioactive substances like bacitracin or tyrocidin. Typical domains of these systems acti- vate the amino acid and condense it with the growing peptide chain. Methylations, epimerisations and modifications via additional protein domains are possible (Stachelhaus and Marahiel, FEMS Microbiol Lett. 125, 3-14 [1995]). Both types of enzymes (NRPS and PKS) share the modular organization of the proteins in which specific catalytic domains are responsible for recognition, activation, condensation and modification of the single elongation units. The growing chain of amino acids and/or carbonic acids is extended through the action of one module adding one unit. The domains of each module carry the active centers responsible for the enzymatic steps of the biosynthesis. Little is known about the biosynthesis of biologically active polyketides and polypeptides from myxobacteπa . Fragments of the biosynthetic gene clusters of soraphen and saframycm have been described from Sorangium cellulosum So ce26 and Myxo- coccus xanthus, respectively (Schupp et al . , J. Bacteπol . 177 , 3673-3679 [1995] and Pospiech et al . , Microbiology 141, 1793- 1803 [1995] ) . We have constructed genomic libraries of the epothilone producer Sorangium cellulosum So ce90. Gene probes based on PKS and PS genes were used to isolate recomb ant cos- mids, which were then sequenced and characterized. Several unique pathways containing PKS, PS, or a combination of both types of genes were identified, demonstrating that this organism is potentially a rich source of novel bioactive compounds.
A subject of the present invention is therefore to provide DNA sequences according to claim 1 the expression products of which perform or are involved m the enzymatic biosynthesis, mutasynthesis or partial synthesis of polyketide or heteropolyketide compounds. The DNA sequences may be inserted into well known and optimized expression vectors by commmon tech- niques of molecular biology, thus allowing transformation, selection and cloning of cells, which cells are then capable of synthezis g polyketide or heteropolyketide compounds by fermentation. Using an overproducing clone allows the desired polyketide or heteropolyketide compounds be easily produced and recovered m high amounts. Further, knowledge of the localization of regulatory DNA segments and individual structural genes allows "site-directed mutagenesis" using common techniques for genetic engineering, and thus construction of optimized enzymes ("protein engineering"; for fermentative synthesis of polyketi- de or heteropolyketide compounds. The invention thus further relates to a recomb ant expression vector according to claim 16, cells transformed therewith according to claim 17 and to a process for enzymatic biosynthesis, mutasynthesis or partial synthesis of polyketide or heteropolyketide compounds according to claim 23.
Preferred and/or advantageous embodiments of the present invention are subject-matter of the subclaims.
In brief, the invention consists of (1) cloned Sorangium cellulosum polyketide synthase (PKS) and/or peptide synthetase (PS) biosynthetic cluster DNA and (2) the nucleotide sequence and predicted protein coding sequences of the cloned DNA. The invention can be used for, but not limited to, (a) increasing yields of PKS product Sorangium cellulosum (e.g., by amplification or genetic modification of the epothilone gene cluster or its component parts) , (b) increasing yields of polyketide and/or peptide synthetase product a heterologous system by transfer of the corresponding gene cluster or its component parts, which may De followed by amplification or genetic modification of the PKS and/or PS gene cluster or its component parts, (c) modification of the polyketide and/or peptide synthetase product chemical structure either Sorangium cellulosum or a heterologous host (e.g., by genetic modification of the corresponding gene cluster or its component parts) and (d) for the detection of genes and gene products involved making polyketides or related molecules m other organisms (e.g., by hybridization or complementation assays) . DNA sequence and analysis is presented for the following cosmids and plasmids : A2 cosmid as defined claim 6 the pEPOcos6 region (overlapping of pEPOcosβ and pEPOcos7) as defined m claim 7 pEPOcosδ cosmid as defined in claim 10
A5 cosmid as defined in claim 12
Sau4 (10 kb plasmid) as defined in claim 14
The invention is now described in more detail by examples and for illustration only. The examples are not to be construed as any limitation of the scope.
Figure 1 is a restriction map of one of the DNA sequences of the present invention (cosmid A2 insert) indicating also the localization of regulatory DNA segments and the individual structural genes ("open reading frames" or ORFs) 1 to 16.
Figure 2 shows the open reading frames found on pEPOcosδ region
DNA sequence data from A2 cosmid are as defined in claim 6.
Table 1 correlates ORFs 1 to 16 found on A2 cosmid with the respective biological function (Regulators, Enzymes) .
Table 1
Figure imgf000008_0001
Working Examples
A. Construction of a Sorangium cellulosum cosmid library
1. Isolation of genomic DNA from S . cellulosum So ce90
a. Sorangium cell ulosum So ce90 was spread onto solid CA-2 agar and incubated at 30°C for 5-7 days. CA-2 agar is prepared by autoclaving 18 g Bacto-agar (Difco Laboratories, Detroit, MI) m 800 ml dH20 for 20 min at 121°C and cooling to 50-55°C in a water bath. The following filter-sterilized solutions are added to the agar: 20% (w/v) glucose, 50 ml; Solution A (7.5%
[w/v] KN03,7.5% K2HP04) , 10 ml; Solution B (1.5% [w/v]
MgS04'7H20) , 10 ml; Solution C (0.2% [w/v] CaCl2'2H20, 0.15% [w/v] FeCl3) , 10 ml; 1 M HCl, 1 ml; autoclaved 4-day old Sorangium cellulosum broth, 100 ml. A sample of cells was removed from the plates with a sterile loop and inoculated into 50 ml of G51t medium in a 250 ml Erlenmeyer flask. G51t consists of 0.5% starch (Cerestar) , 0.2% tryptone, 0.1% yeast extract, 0.05% CaCl2, 0.05% MgSO4-7H20, 1.2% 4 - (2 -hydroxyethyl ) - 1-piperazine- ethanesulfonic acid (HEPES) , 0.2% glucose, pH 7.6. The flasks were shaken at 30°C, 160 rpm until a dense orange bacterial growth was obtained (ca. 5-7 d.) . The cells were pelleted by centrifugation at 6,000 x g and used immediately or stored fro- zen at -20°C.
The protocol used for isolating chromosomal DNA from bacteria using hexadecyltrimethylammmonium bromide (CTAB) has been described previously (A subel et al . , Current Protocols in Molecular Biology, John Wiley and Sons, New York, 1990) . The pre- cip tated DNA was recovered with a bent Pasteur pipette, washed with 70% and 95% ethanol , air-dried, and resuspended 0.5 ml TE buffer (0.01 M Tris-HCl, 0.001 M ethylenediammotetraacetic
Figure imgf000010_0001
b. Alternatively, genomic DNA was isolated from S . cellulosum cells cultured as described m section A.l using the Midi Qiagen Blood & Cell Culture DNA purification Kit (Qiagen, Hilden, Germany) following the Qiagen Genomic DNA Handbook protocol for bacterial DNA isolation (1997, Qiagen, Hilden, Ger- many, p. 29 ff.) . In order to obtain high molecular weight chromosomal DNA the precipitated DNA was recovered with a bent pasteur pipette as described m section A.l.
2. Isolation of plasmid DNA
a. pFD666: pFD666 is a bifunctional E. coli -Streptomyces cosmid cloning vector (see Denis and Brzezmski, Gene 111 , 115-118 [1992]) . To maintain stability of large inserts, it is present low-medium copy number when replicated E . coli . For this reason, isolation of sufficient pure DNA to carry out cloning experiments was difficult using commercial kits with standard protocols. A modified procedure was therefore used to obtain pFD666 DNA. A 10 ml culture of DH10B (pFD666) was grown for 16- 20 hr at 37°C m LB (1% tryptone, 0.5% yeast extract, 0.5% NaCl , pH 7.0) medium containing 50 μg/ml kanamycm sulfate. Fifty ml of LB + kanamycm was inoculated to a starting OD60o of ca 0.25 and shaken at 300 rpm, 37°C, until the OD60o reached ca . 0.6. Five hundred ml of LB + kanamycm medium a 2 1 flask was inoculated with 25 ml of this culture and incubated under the same conditions for 2.5 hr . Chloramphenicol ( 2.5 ml of a 34 mg/ml solution 100% EtOH) was added and the incubation continued for an additional 16-20 hr . (The previous steps were performed according to Maniatis et al . Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, 1989.) Cells were pelleted for 10 mm, 16,000 x g . They were resuspended in 9 ml of 50 mM glucose/25 mM Tris-HCl (pH 8.0) /10 mM EDTA and transferred to a 50 ml disposable centrifuge tube. One ml of a freshly-prepared 10 mg/ml lysozyme solution m 10 mM Tris-HCl, pH 8.0 was added and the cell suspension incubated m a 37°C water bath for 10 mm. Twenty ml of a freshly-prepared 0.2 NaOH/1% sodium dodecyl sulfate (SDS) solution was added and the tube inverted gently 5-7 times to mix the contents. After 5 mm at room temperature, 15 ml of 5 M po- tassium actate (pH 4.8) was added and the tube inverted sharply 3-4 times. The tube was centrifuged at 6,000 x g for 10 mm at 4°C and the supernatant poured though 2 layers of sterile cheese cloth into a fresh 50 ml disposable tube. Isopropanol to a final concentration of 0.6% was added and the contents of the tube mixed several times. The precipitated nucleic acid was centrifuged at 6,000 x g for 10 mm at 4°C. The pellet was washed with 70% EtOH and any excess EtOH was aspirated from the pellet, which was allowed to air dry for 5 mm. It was resuspended m 5 ml of 50 mM 3 - (N-Morpholmo) propanesulfonic acid (MOPS)/750 mM NaCl , pH 7.0 and added to an equilibrated to QIAfilter Midi column (Qiagen, Chatsworth, CA) . The manufacturer's protocol for washing and eluting the plasmid DNA was followed. b. SuperCos : SuperCos plasmid DNA was purchased from Strata- gene (La Jolla, CA) .
3. Preparation of ca. 38-47 kb Sau3Al fragments of S . cellulo- sum chromosomal DNA
a. S . cellulosum chromosomal DNA prepared as described m section A.l. a was partially cleaved with restriction endonuclease Sau3Al m a 1000 μl reaction volume consisting of 50 μg chromo- so al DNA, 5 units enzyme (Pro ega, Madison , WI), 0.006 M Tris-HCl, 0.006 M MgCl2/ 3.10 M NaCl , and 0.001 M dithiothrei- tol (pH 7.5) for 5 mm at 37°C. The reaction mixture was extracted once with an equal volume of 1:1 phenol : chloroform. After centrifugation, tr.e upper aqueous phase was saved, to which 0.1 vol. of 3 M sodium acetate and 0.6 vol. isopropanol was added. DNA was pelleted by centrifugation for 5 mm at 16,000 x g m a microfuge and washed once with 0.5 ml 70% EtOH. After drying m a SpeeaVac (Savant Instruments, Farmmgdale, NY) for 5 mm, the pellet was resuspended m 0.1 ml TE buffer. The DNA was layered ontcp of a 12 ml 10-40% sucrose gradient prepared m TE buffer ana centrifuged at 113,600 x g for 16 hr, 10°C using a Beckman SW40Tι rotor (Beckman Instruments, Palo Alto, CA) . Five hundred μl aliquots of the gradient were removed using a pipetor beginning at the top of the tube. Samples (5 μl) of the fractions were analyzed by electrophoresis through a 0.5% agarose gel m TAE buffer (0.04 M Tπzma base, 0.02 M acetic acid, ana 0.001 M EDTA, pH 8.3) containing 0.5 μg/ml ethidium bromide far 6 hr at 100 V. Fractions containing DNA fragments of ca. 4C--5 kb were identified by comparison to a nigh molecular wei nt DNA standard (Life Technologies, Gaithersburg, MD) . Sucrose was diluted from the corresponding 0.5 ml fraction by addition of 0.5 vol. TE . Subsequently, DNA was precipitated by addition of 0.1 vol. 3 M sodium acetate and 0.6 vol. isopropanol . DNA was pelleted by centrifugation at 16,000 x g for 10 mm m a microfuge. DNA was washed with 0.5 ml 70% EtOH and dried m a SpeedVac with moderate heat for 10 mm. Finally, the DNA was resuspended m distilled H20 at a concentration of 0.5 mg/ml.
b. Alternatively, 10 μg of S. cellulosum chromosomal DNA prepared as described m A.l.b was treated with 0.3 U Sau3Al (New England Biolabs, Beverly, MA) for 1 h at 37°C 400 μl of the supplier's recommended reaction buffer. Formation of DNA fragments of about 40 kb m size was checked by comparison of the motility behavior with high molecular weight DNA standards after a 0.3% agarose gel electrophoresis . An equal volume of phenol : chloroform (1:1) was added, mixed and centrifuged. The upper aqueous phase was recovered and 0.1 vol. of 3 M sodium acetate and 0.6 vol. of isopropanol were added. After centrifuga- tion, the precipitated DNA was washed twice with 0.5 ml 70% ice cold ethanol and finally air-dried. The DNA fragments were re- suspended m 100 μl shrimp alkaline phosphatase reaction buffer and dephosphorylated for 150 min. at 37°C using 2 U shrimp alkaline phosphatase (Amersham Life Science, Cleveland, OH) . A phenol : chloroform extraction followed as described above. Finally, the DNA was precipitated by addition of 0.1 vol . 3 M so- dιu~ acetate and 0.6 vol. isopropanol, dried, and dissolved m TE suffer. 4. Preparation of cosmid libraries
a. Using pFD666: Vector pFD666 was cleaved with restriction endonuclease BamHI m a 0.02 ml reaction volume consisting of 2 μg plasmid DNA, 10 units of BamHI (Promega), 0.006 M Tris-HCl, 0.006 M MgCl2, 0.05 M NaCl , and 0.001 M dithiothreitol (pH 7.5) for 90 mm at 37°C. Five μl of lOx alkaline phosphatase buffer (0.5 M Tris-HCl [pH 9.3], 0.01 M MgCl2, 0.001 M ZnCl2, 0.01 M spermidme) was added to the reaction followed by alkaline phosphatase (0.01 units/pmol ends; Promega) and distilled H20 to a final volume of 0.05 ml. The sample was incubated for 30 mm at 37°C and a second aliquot of phosphatase was added. After a further 30 mm at 37°C, 0.3 ml of stop buffer (0.01 M Tris-HCl [pH 7.5], 0.001 M EDTA, 0.2 M NaCl , 0.5% SDS) and 0.35 ml of 1 : 1 phenol; CHCl3 was added to the reaction. The sample was mixed gently several times by inversion and centrifuged at 16,000 x g for 3 mm to separate the phases. The aqueous layer was removed to a new microfuge tube. 0.1 vol. 3 M sodium acetate and 2 vol. 100% EtOH were added and the precipitated DNA pelleted by centrifugation at 16,000 x g for 10 mm. Liquid was removed by aspiration and the pellet washed once with 0.5 ml 70% EtOH. The DNA was dried m a SpeedVac and resuspended m TE buffer to 0.5 mg/ml .
Digested, phosphatase-treated pFD666 was ligated to the partially-cleaved chromosomal DNA (see sections A.3. a and B.l.a) m a 0.005 ml reaction consisting of 1 μg pFD666, 1 μg
5. cellulosum DNA, 0.03 M Tris-HCl (pH 7.8), 0.01 M MgCl2, 0.01 M dithiothreitol, and 0.0005 M adenosme-5 ' -tπphosphate and 1.5 Weiss units of T4 DNA ligase (Promega) . The reaction was carried out at room temperature for 2 hr . The entire reaction mix was packaged into bacteriophage λ in vi tro using Packagene extracts (Promega) according to the manufacturer's directions. The entire packaging reaction (0.5 ml) was diluted with 4.5 ml SM buffer (per liter: 5.8 g NaCl, 2 g MgSO4.7H20, 1 M Tπs- HCl [pH 7.5] , 5 ml 2% gelatin solution) . Transfection was performed by adding 10 ml cf an overnight culture of E. coli DH5α that had been grown m LB medium with 0.01 M MgS04 and 0.2% maltose to the diluted pnage and incubating at 37°C for 20 mm. 0.8 ml of LB was added and the cells shaken at 225 rpm for 1 hr at 37°C. Cells were pelleted, resuspended m LB, and spread onto a 150 mm LB + kanamycm agar plate. After 3 d. at 30°C, the colonies were harvested by picking ca . 800 colonies into 2.0 ml LB + kanamycm medium containing 20% glycerol , freezing on dry ice, and storing at -70°C. In addition, six kanamycm- resistant colonies were inoculated into 2 ml LB + kanamycm liquid medium and incubated at 37°C, 250 rpm, for 18-24 hr . Cosmid DNA was prepared using a standard alkaline lysis procedure starting with 1.5 ml of the culture. DNA was digested with restriction endonuclease Pstl and samples electrophoresed on a 0.8% TAE agarose gel for 1.5 hr at 100 V. A unique restriction pattern was noted m eac sample and the total size of the insert was calculated to be between 40 and 45 kilobases.
b. Using SuperCos: 30 μg of vector SuperCos was digested with Xbal (New England Biolacs, Beverly, MA) for 210 mm at 37°C m 100 μl of the recommenαed reaction buffer. Ten μl sodium acetate and 60 μl isopropanol was added before the solution was centrifuged for 30 mm at 16,000 x g. The precipitated DNA was washed twice with 500 μ_ ice cold 70% ethanol . The vector DNA was precipitated and air-dried, dissolved m 135 μl shrimp al- kalme phosphatase reaction buffer and treated with 2.5 U shrimp alkaline phosphatase for 150 mm. After heat mactiva- tion of the enzyme at 75°C for 20 mm, a phenol : chloroform extraction was performed as described m section 1. c. The DNA, resuspended m 100 μl BamHI restriction buffer was hydrolyzed with 15 U BamHI (New England Biolabs, Beverly, MA) for 180 mm. A phenol : chloroform extraction followed (see section A.3) . The SuperCos DNA was precipitated by additon of 0.1 vol 3 M sodium acetate and 0.6 vol isopropanol, centrifuged at 16,000 x g, and resuspended m 50 μl TE Differ.
Four μg of digesteα vector DNA was ligated with 10 μg partially hydrolyzed genomic DNA from S . cellulosum (as described m section A.3.b) m a final volume of 20 μl using 2 U T4 DNA ligase and the appropriate reaction buffer (Gibco BRL, Eggen- stem, Germany) . The reaction was carried out at 16°C overnight. The reaction mixture was packaged into phage particles using the Gigapack III XL packaging extract kit (Stratagene) according to the manufacture's protocol. Treatment of packaging reaction mixture and transfection of E . coli SURE (Stratagene) was performed as described m 4. a. Transfected cells were concentrated by centrifugation, resuspended m fresh LB medium and distributed on LB agar plates containing 50 μg/ml 1kanamycm. The plates were incubated overnight at 30°C. 1600 recombmant clones were transferred into 96 well microtiter plates filled with 80 μl LB medium containing 50 μg/ml kanamycm per well and propagated overnight at 30°C. The following day the microtiter plates were used to inoculate a second set of microtiter plates m order to obtain a duplicate of the recombmant clones. Each well of the original set of microtiter plates was supplemented with 80 μl 50 % glycero_ and the entire plate stored at -70°C. 20 randomly chosen transformants were inoculated into 3 ml LB medium with 50 μg/ml"1 kanamycm and incubated over night at 37°C m order to isolate plasmid DNA using the Qiagen plasmid extraction kit (Qiagen, Hilden, Germany) . Restriction fragment analysis of the recombmant cosmids using the restriction endo- nucleases Pstl and Bglll indicated that the cosmids contained inserts of approximately 35 to 42 kb m size.
B. Construction of a S . cellulosum plasmid library
1. Preparation of 8-12 kb fragments of S . cellulosum chromosomal DNA.
S . cellulosum chromosomal DNA prepared as described m sec- tion A.l. a was partially cleaved with restriction endonuclease Sau3Al m a 100 μL reaction volume consisting of 5 μg chromosomal DNA, 5 units enzyme (Promega, Madison , WI) , 0.006 M Tris- HCl, 0.006 M MgCl2/ 0.10 M NaCl, and 0.001 M dithiothreitol (pH 7.5) for 4 mm at 37°C. The digested DNA was electrophoresed through a 11 x 14 cm 0.8% TAE-agarose gel for 18 hr at 17 V. Fragments of 8-12 kb were cut from the gel and purified using the QIAquick Gel Extraction Kit using the manufacturer's protocol (Qiagen) .
2. Preparation of the plasmid library
Plasmid pZero2.1 (Invitrogen, Carlsbad, CA) was cleaved with restriction endonuclease BamHI m a 0.02 ml reaction volume consisting of 1 μg plasmid DNA, 10 units of BamHI (Promega) , 0.006 M Tris-HCl, 0.006 M MgCl2, 0.05 M NaCl, and 0.001 M dl- thiothreitol ( pH 7.5) for 20 mm at 37°C. 0.08 ml of dH20 and 0.1 ml of 1:1 phenol :CHC13 was added. The sample was briefly vortexed and centrifuged at 16,000 x g for 2 mm. The aqueous layer was removed to a new microfuge tube. 0.1 vol. 3 M sodium acetate and 2 vol. 100% EtOH were added and the precipitated DNA pelleted by centrifugation at 16,000 x g for 10 mm. Liquid was removed by aspiration and the pellet washed once with 0.5 ml 70% EtOH. The DNA was dried m a SpeedVac and resuspended m TE buffer to 0.004 μg/ml. Digested pZero2.1 was ligated to the partially-cleaved chromosomal DNA m a 0.01 ml reaction consisting of 0.004 μg pZero2.1 , 0.05 μg S . cellulosum DNA, 0.03 M Tris-HCl (pH 7.8), 0.01 M MgCl2, 0.01 M dithiothreitol, and 0.0005 M adenosme-5 ' -triphosphate and 1.5 Weiss units of T4 DNA ligase (Promega) . The reaction was carried out at room tem- perature for 2 hr. 0.015 ml dH20 and 0.25 ml of 1-butanol were added, the sample vortexed briefly, and centrifuged at 16,000 x g for 10 mm. Liquid was aspirated away from the pellet and the sample dried m a SpeedVac for 5 mm. The ligated DNA was resuspended m 0.005 ml dH20 and mixed with 0.04 ml of electro- competent Escherichia coli DH10B cells (GIBCO/BRL, Gaithers- burg, MD) . The sample was placed into a pre-chilled 0.2 mm-gap electroporation cuvette and transformed into the bacteria by electroporation using a BioRad Gene Pulser II unit (BioRad, Hercules, CA) at 25 μF and 200 Ω. 0.96 ml SOC medium (0.5% yeast extract, 2% tryptone, 10 mM NaCl, 2.5 mM KCl , 10 mM MgCl2, 20 mM MgS04, 20 mM glucose) was mixed with the cells and transferred to a 1.5 ml microfuge tube. The sample was incubated at 37°C, 225 rpm, for 1 hr . Aliquots of the cells were spread onto an LB agar - kanamycm and incubated at 37°C for 20 hr to estimate the number of transformants obtained. Six kana- mycm resistant colonies were confirmed to contain an insert of the expected size as described m section A.4. a.
C. Identification of cosmids possessing polyketide synthase genes
1. Colony blot hybridizations using cosmid library in pFD666:
A 20 x 20 cm sheet of Duralon UN membrane (Stratagene) was placed on top of a 24.5 x 24.5 cm square bioassay dish containing 250 ml LB agar - kanamycm. An aliquot of the frozen cosmid library m 1 ml LΞ medium was spread on the filter. The plate was incubated at 37"C for 24 hr . Colonies were replicated onto two fresh filters which were placed onto LB + kanamycm agar medium and incubated at 28°C for 18 hr. Lysis of cells and neutralization of released DΝA was performed according to di- rections that were provided with the filters. The DΝA was crosslmked to the filters using a UV Stratalmker 2400 unit (Stratagene) m the autc crosslink mode. Cell debris was removed by placing the filters m a container with a solution of 3 X SSC (20 X SSC contains, per liter, 173.5 g ΝaCl , 88.2 g so- dium citrate, pH adjusteα to 7.0 with 10 Ν ΝaOH) , 0.1% SDS and rubbing the lysed colonies with a Kimwipe . The filters were then incubated at least 3 hr with the same wash solution for at least 3 hr at 65°C. The plasmid library was treated similarly except cells were spreaα onto a 137 mm circular Duralon UN mem- brane placed on top of a 150 mm petri dish containing 80 ml LB agar + kanamycm.
For hybridizations, a probe consisting of a 650-base pair (bp) polymerase chain C?CR) fragment representing a portion of a S . cell ulosum polyketide synthase gene was used. The fragment was amplified using primers to consensus regions of Type I (macrolide) polyketide synthase (PKS) genes (Swan et al . , Mol . Gen. Genetics 242 , 358-362 [1994]). A series of sense and anti- sense oligonucleotides were prepared for PCR studies as indicated in the following table 2 :
Table 2
Figure imgf000020_0001
The selection of C or G where necessary in the third position of a codon reflects the very high overall G + C content of S . cellulosum (ca. 70%) . Conditions for PCR were as follows: 0.01 M Tris-HCl (pH 9.0), 0.C5 M KCl , 0.003 M MgCl2, 0.1% Triton X- 10C, 200 μM of each primer, 2.5 U Taq DNA polymerase (Promega), 5.0% dimethyl sulfoxide 'Sigma), and 0.01 μg of S . cellulosum chrc osomal DNA in a 0.05 ml reaction volume. Reactions were carried out m a Perkm-Elmer Model 480 Thermocycler (Perkm- Elmer Corporation, Foster City, CA) under the following conditions: 94°C, 1 mm; 50°C, 1 mm, 72°C, 1.5 mm for a total of 30 cycles. Each possible combination of sense and anti-sense pπm- ers were tried. A 650-bp and 350-bp fragment was amplified using oligos 120 + 124 and 123 +124, respectively. The sequence of the fragments were determined using the ALFexpress AutoRead kit to fluorescently label the DNA, which was analyzed on an ALFexpress sequencing apparatus (Pharmacia) . The data indicated both PCR fragments possessed significant homology to polyketide synthase genes of Type I antibiotics. The 650-bp fragment was chosen for hybridization experiments.
The fragment was labeled with 32P-dCTP using the NEBlot kit (New England Biolabs, Beverly, MA) and purified on a Bio-Spm 6 column (BioRad, Hercules, CA. ) . Duplicate blots were pre- hybndized 3 X SSC (1 X SSC contains 0.15 M sodium chloride and 0.015 M sodium citrate, pH 7.0) , 4 X Denhardt ' s solution
(100 X is 2% Ficoll [Type 400] , 2% polyvmylpyrrolidone , and 2% bovine serum albumin [Fraction V] ) , and 100 μg/ml sheared, de- natαred salmon sperm DNA; all reagents purchased from Sigma Chemicals, St. Louis. The labeled DNA was heated m a boiling water bath for 5 mm to denature the strands, cooled on ice, ana added to the pre-hybπdization solution. The filters were incubated for at least 18 hr m a roller bottle hybridization oven. They were transferred to new bottle, then washed two times m 2 X SSC, 0.1% SDS at 70°C for 30 mm (moderate stringency) . The membranes were placed on Whatman 3MM paper to remove excess liquid, covered with Saran Wrap, and exposed to autoradiography film (Kodak X-OMAT LS) with two intensifying screens. The cassette was placed at -70°C and developed at appropriate intervals.
Approximately 100 colonies were seen to have hybridized on the duplicate filters. Fourteen of these were isolated from the master plate and grown 4 ml LB + kanamycin medium for 20-24 hr, 37°C, 250 rpm. Plasmid DNA was prepared using the standard alkaline lysis method and digested with restriction endonuclease Pstl. The digested DNA was electrophoresed on a 0.8% aga- rose gel m TAE for 3 hr at 100 V. Fragments were transferred to Duralon UV using the VacuGene XL vacuum blotting unit (Pharmacia) and the recommended alkaline denaturation protocol . Hybridization with radioactively-labeled PCR fragment and washing were carried out as described above. Two prominent types of cosmids were observed; one contained Pstl fragments of ca. 7.0, 5.0, and 1.1 kb (pEPOcosδ and pEP0cos7) that hybridized to the probe; the other type had fragments of ca. 6.0 and 3.6 kb
(pEPOcosδ and pEPOcosl3/ which were homologous to the probe.
Restriction analysis confirmed that cosmids showing identical hybridization patterns nad identical or overlapping inserts. PCR reactions using primers representing consensus sequences of Type I PKS genes were performed using the isolated cosmid DNA as template under conditions described above, except ca. 0.01 μg of cosmid DNA was included as template. Cosmids pEPOcos6 and pEPOcosδ amplified the 650-bp fragment seen when oligonucleo- tides 120 + 124 were used, while pEPOcosδ and pEPOcosl3 supported amplification of an 1100-bp PCR fragment with oligos 122 and 124. The latter fragment was sequenced and confirmed to possess strong similarity to Type I PKS genes. These data confirm that the recombmant cosmids are related to each other and that all contain PKS-like genes. 2. Colony blot hybridizations of plasmid library in pZero2.1:
A 137 -mm circle of Duralon UN membrane was placed on top of a 150-mm containing 75 ml LB agar + kanamycm. An aliquot of the plasmid library (representing ca . 2,000 recombmant colonies) m 0.5 ml LB medium was spread on the filter. The plate was incubated at 37°C for 20 hr . Colonies were replicated onto two fresh filters which were placed onto LB + kanamycm agar medium and incubated at 37°C for 6 hr . The filters were proc- essed for hybridization as described Section C.l. Out of 8 positive colonies detected, one contained a plasmid with a DΝA region not encoded by either pEPOcosδ or pEPOcosδ . This plasmid, called Sau4, was characterized in more detail.
3. Colony blot hybridizations of cosmid library in SuperCos:
The recombmant E . coli clones from the microtiter plates (see section 4. b) were used to produce two identical sets of hybridization filters order to identify cosmids carrying PKS and PS genes . The recomc ant clones were spotted onto 2 sets of 22 x 22 cm LB agar plates containing 50 μg/ml kanamycm. Each plate contained 384 clones therefore representing 4 microtiter plates. The clones were incubated at 30°C overnight. After pre-coolmg for approximately 3 h at 4°C, 20 x 20 cm Hybond Ν+ Nylon membranes (A~ersham, Braunschweig, Germany) were placed onto the agar sur aces. After 2 mm. the membranes were removed and placed for 15 mm. on Whatman 3 MM paper (Whatman paper Ltd., Maidstone, England) soaked with denaturation solution (0.5 N NaOH, 1,5 V NaCl) before they were transfered onto Whatman 3 MM paper saturated with neutralization solution (1 M Tris-HCl, pH 7.5, 1.5 iv NaCl) . Subsequently the membranes were placed onto Whatman 3 MM paper soaked with 2 X SSC (0.3 M NaCl, 0.03 M sodium citrate, pH 7.2) for 10 min. The membranes were baked for 40 min at 85°C. Then, each membrane was overlayed with 5 ml Proteinase K solution (2 mg/ml Proteinase K in 2 x SSC) and incubated at 37 °C for 90 min. Finally, cell debris was removed by wiping the membranes with a Kimwipe pre-wetted with 2 X SSC.
As we were seeking in particular to identify biosynthetic pathways containing both PKS and PS genes, the following hy- bridization strategy was taken: The screening was initially focused on ketosynthase domains from type I PKSs and on the ade- nylation domain from PSs . Target-specific primers were used to amplify DNA fragments of the corresponding genes from chromosomal DNA of S . cellulosum by PCR. The fragments obtained were then cloned, sequenced and the deduced amino acid sequence compared to known ketosynthase and adenylation domains of PKS and PS, respectively. In a second step these PCR fragments were used as gene probes to detect recombinant cosmids of the S . cellulosum cosmid library. Oligonucleotides based on conserved amino acid sequences of ketosynthase domains from various type I PKS were optimized for myxobacterial DNA by comparison to a known myxobacterial biosynthetic gene cluster (Schupp et al . , J. Bacteriol . 177, 3673-3679 [1995]) resulting in primer KSlUp (5'-
C/A) GIGA (A/G) GCI (A/C/T) (A/T) I (C/G) (C/A) IATGGA (C/T) CCICA (A/G) CAI (A/OG-3') and
KSD1 (5'-GG(A/G)TCICCIA(A/G) I (G/C) (T/A) IGTICCIGTICC (A/G) TG-3 ' ) . PCR-primers TGD (5'- T(A/T) (C/T) CGIACIGGIGA(C/T) (C/T) (G/T) IG(G/T) ICG-3') and LGG ( 5 ' -
A(A/T) IGA(A/G) (G/T) (G/C) ICCICCI (A/G) (A/G) (G/C) I (A/C) (A/G) AA (A/G )AA-3') directed to genes encoding adenylation modules have been de- scribed by Turgay et al . (Pept. Res. 7, 238-241 [1994]). PCR reaction mixtures with a final volume of 25 μl contained 0.1 μg template DNA, 0.2 U Taq DNA-polymerase (Gibco BRL, Eggenstem, Germany), 5 μmol dNTP, 5% dimethyl sulfoxide (Sigma), 1.5 mM MgCl2, 25 pmol of each primer and the appropriate reaction buffer supplied by Gibco BRL. Chromosomal DNA of S . cellulosum was used as template. Additionally, chromosomal DNA of Myxococ - cus fulvus was used with PS primers. Reactions were carried out m an Eppendorf Mastercyler Gradient (Eppendorf, Germany) using the following conditions: denaturation 30 s at 97°C, annealing 30 s at 55°C, extension 60 s at 72°C for a total of 30 cycles. The formation of ca . 700 bp fragments using the KS primers and of ca. 350 bp fragments with the PS primers were confirmed by 0.8% agarose gel electrophoresis . Fragments of independent PCR reactions were ligated into vector pCR2.1T0P0 using the TOPO TA Cloning kit (Invitrogen, Leek, The Netherlands) according to the manufacturer's protocol and transformed into E . coli XL1- Blue . Sequencing of the resulting plasmids and analysis of the deduced ammo acid sequence revealed three different KS fragments, designated pM008.4, pM008.6, pM008.7, one PS fragment (pAPsl) corresponding to S . cellulosum and one PS fragment (pDPsl) obtained with chromosomal DNA of M. fulvus . The PCR fragments were re- isolated by digestion with EcoRI from the plasmids pM008.4, pM008.6, and pM008.7, laoeled, pooled and used as gene probes m hybridization experiments as described below. The same procedure was performed with the PS fragments of pAPsl and pDPsl.
Hybridization with PKS and PS specific DNA probes (see above) was carried out using the DIG nonradioactive labeling and detection kit (Boehrmger Mannheim, Germany) and performed according to the supplier's manual using buffer containing 50% formamide . The membranes were hybridized m plastic bags containing approx. 10 ml of hybridization solution at 39°C overnight. Unspecific binding of probes was removed by 2 wash steps with 2 x SSC, 0.1% SDS at room temperature for 20 mm. and one stringent wash step with 0.5 x SSC, 0.1% SDS at 60°C for 20 mm. Detection of hybridizing DNA fragments was performed with the above mentioned system according to the manufacturer's protocol using CSPD as chemilummescent substrate. The signals were recorded by exposure of the treated membrane to Hyperfilm ECL (Amersham Life Science, Little Chalfont, England) which was developed m appropriate time intervals.
71 signals were detected with the PKS specific gene probe. On the duplicate filters 35 signals were obtained with the PS specific gene probe of which 7 were already known from the PKS hybridization experiment. These recombmant cosmids harbored PKS- and PS-encodmg genes. In order to corroborate these results PCR experiments were performed with DNA of the 7 recombi- nant cosmids as template and PKS (KSlUp, KSD1) and PS specific primers (TGD, LGG) generating fragments of the expected size of approx. 700 bp and 350 bp, respectively (primers and reaction conditions see above) .
A comparison of the restriction fragment patterns of the DNA from the 7 recombmant cosmids carrying PKS and PS genes digested by BamHI facilitated an arrangement of the cosmids m 3 groups. They were represented by cosmids designated A2 and A5. The remaining group was represented by pEPOcosδ . Therefore, A2 and A5 represented good candidates for further DNA sequence analysis because they carry both PKS and PS genes.
D. Random "shotgun" sequencing of recombinant cosmids and plasmids
1. Library construction a. pEPOcosδ, pEPOcosδ, A5 , and Sau4 : pEPOcoso and pEPOcos7 were sequenced to completion, and contiguous sequence data and analysis for these overlapping cosmids is presented below for the "cos6 region" (cf . claims 7 and 9) . Sequencing of cosmid A5 , pEPOcosδ and plasmid Sau4 was taken to the point of large contiguous sequences (ccntigs) representing the S . cellulosum insert; sequence and analysis presented below (cf. claims 10 to 15) .
Randomly sheared l oraries were constructed for cosmids and plasmids of interest using a protocol similar to that of of Fleischmann et al . , 19SΞ (Science 269, 496) and modified m Fraser et al . , 1995 (Science 370 , 397). Briefly, Qiagen-column purified cosmid DNA (~1C μg) was sheared to a size of approximately 2 kb and the DNA end-repaired using BAL31 nuclease. The DNA was gel -purified after electrophoresis through a 0.75% low- melting temperature agarose gel containing 0.5 μg/ml ethidium bromide m IX TAE buffer run at δO V for 2 hours. The volume of the low-melt agarose gel slice was estimated by adding the gel slice to a microfuge tune and weighing, then 0.1 vol. of 3 M sodium acetate (pH 7) was added and the agarose incubated at 60°C. Tne temperature .--as equilibrated to 37°C, and DNA ex- tracted twice using an equal volume of buffered phenol (Life Technologies) . The aqueous phase was transferred and extracted once with an equal volume of chloroform, then ethanol preci- pated by the addition of 2 vol. cold 100% ethanol. DNA was con- centrated by spinning at 16,000 x g m a microcentrifuge. The DNA pellet was washed with 1 ml 70% ethanol and resuspended m 100 μl of 0. IX TE. The DNA was ligated to Smal -digested, phos- phatase-treated pUClδ vector (Pharmacia) , and single insert re- combmants isolated by gel-puπfication of the band containing vector plus a single insert, followed by T4 polymerase polishing, and a final intramolecular ligation of the vector-plus- s gle-msert DNA. This final ligation represents a library of highly random ca . 2 kb fragments that was used for shotgun sequencing of the ca . 40 kb cosmids or ca. 10 kb plasmids.
b. Cosmid A2 : Cosmid DNA with inserts of S . cellulosum was isolated by an alkaline lysis procedure and purified with Ma- cherey Nagel columns (Machery und Nagel GmbH und CoKG, Dύren, Germany) using manufacturer's recommendation. Purified Cosmid DNA was sonicated, end-repaired using T4 DNA Polymerase (Boe- hrmger Mannheim, Germany) . After gel -purification fragments of a size of approximately 2 kb were ligated into Smal -digested, phcsphatase-treated pTZlδR vector (Pharmacia) . The ligation represents a library of highly random ca. 2 kb fragments that was used for shotgun sequencing of the ca . 40 kb cosmid.
2. Sequencing and assembly a. pEPOcosβ, pEPOcosδ, Sau4, and A5 : DNA (1 μl of 100 μl total m the library) was transformed into E . coli by electro- poration (20 μl of Electromax DH10B cells from Life Technolo- gies) and cells spread onto LB plates containing 50 μg/ml ampi- cillin. After growth overnight at 37 °C, transformants (ca. 300- 3000 CFU total) were tranfered to 96-well growth blocks and shaken overnight at 37°C in 1.3 ml LB medium with 50 μg/ml am- picillin. Templates were prepared from these cells by an alkaline lysis procedure (Qiagen QiaQuick Turbo Prep) to yield purified, double-stranded plasmid DNA. Cycle-sequencing of the plasmid templates was performed using universal forward and reverse primers and BigDye Terminator sequencing kits (Applied Biosystems) , using the manufacturer's recommendations, then resolved using an ABI377 automated sequencer. Sequences were edited using Phred, then assembled into larger contiguous sequences using Phrap (Phil Green, University of Washington, St. Louis, MO) . b. Cosmid A2 : DNA (1 μl of 20 μl total m the ligation) was transformed into E. coli DH10B by electroporation and cells were spread onto LB agar medium containing 50 mg/ml ampicillm. After growth for lδ hr at 37°C, transformants were transferred to 96-well growth blocks and shaken overnight at 37°C m 1.3 ml 2x YT medium with 50 mg/ml ampicillm. Templates were prepared from these cells by an alkaline lysis procedure (Qiagen Qia- quick Turbo Prep) to yield purified, double-stranded plasmid DNA. Cycle-sequencing of the plasmid templates was performed using universal forward and reverse primers and Big Dye Termi- nator sequencing kits (PEBiosystems) or Thermo Sequenase fluorescent labelled primer cycle sequencing kit (Amersham Pharmacia Biotech) using the manufacturer's protocols. In the shotgun phase of a cosmid, identical amounts of samples were sequenced either by dye-primer or dye-terminator chemistries (Pharmacia, PE Biosystems) . Data were collected using Licor and ABI 377 automated sequencers and assembled with the GAP4 program (Bon- field, Smith, Staden, Nucl . Acids Res. 23, 4992-4999 [1995]). Gaps were closed using custom made primers (MWG-Biotech) on plasmid templates or PCR products m combination with dye- terminators .
E. Bioinformatic Methods
1. Open reading frame (ORF) identification ORFs were identified the pEPOcosδ region using the OMIGA 1.1.2 (GCG 0.4D) program from Oxford Molecular Limited. Default values were used (Stanαard genetic code, all ORFs over 50 bases) to generate ORFs; analysis of these results lead to the list of 14 highest quality ORFs as defined m claim 9. Other ORFs, genes, or genetic elements may be found m the pEPOcosδ insert that have not yet been annotated. In addition to hand- editing of the OMIGA-generated data, the MAGPIE automated genome analysis tool : (htt : //genomes . rockefeller . edu/magp e/magpie . html )
was used to identify genes for all the sequenced cosmids and plasmids. ORFs identified m this manner are presented as both nucleotide and peptide files below.
For cosmids A2 and A5 , ORFs have been identified within the DNA sequences of A5 (contigs 10, 11, 12) and of A2 using the FramePlot analysis program from Ishikawa and Hotta (FEMS
Microbiol. Lett., 174, 251-253 [1999] public available under
[http://www.nih.go.jp/~ un/cgi-bm/frameplot.pl] which is based on positional base preference codons typical for organisms having genomes with a high G + C content (Bibb et al . , Gene 30, 157-166 [1984] ) . Default parameters using ATG and GTG as start codons were used. The deduced amino acid sequence of predicted ORFs were compared with protein databases (GenBank, CDS translations, PDB, SwissProt, PIR, PRF) using BLASTP (Altschul et al., Nucleic Acids Res., 25, 3389-3402 [1997]). Additionally, high scoring amino acid sequences were analyzed using the Pfam program [http://www.sanger.ac.uk/Software/Pfam/], which identified specific domain structures of the submitted proteins (Bateman et al . Nucleic Acids Res., 27, 260-262 [1999]).
2. BLAST searches
BLASTP2 similarity searches were performed using the peptide files from the above ORF identification strategy as query sequences. Searches were performed using the m-house Biomfor- matics BLASTP2 (Version: BLASTP 2.0al9MP-WashU) web page at the Bristol-Myers Squibb Pharmaceutical Research Institute (allows BlastN2, BlastP2, BlastX2 , TblastN, and TBlastX searches). In addition, peptide files generated by the MAGPIE analysis were automatically searched using a FASTA algorithm.
3. Best match and probable identification
Analysis of the BLAS P2 and FASTA output led to an assignment of a best match and probable function. The best match was usually the top scoring match, although sometimes another match was given because it was a more relevant homolog, or no match was found with a significance greater than >e-4. Probable func¬ tion represents the best estimate of function given the initial analysis of the BLAST data and the published literature regarding the best match, and nay not necessarily represent the true function of the gene product (hypothetical proteins are of un- known function) . A higher probability score indicates a higher liklihood that the probable function corresponds to that of the best match; e.g., the polyketide synthase matches are all above e-100, and given the very high significance scores are presumed to function as polyketide synthases (as are the high scoring peptide synthetases) .
The following is a summary of the sequence data from the pEP0cos6 region, pEPOcosδ, A5 , Sau4 and A2.
a. Data from pEPOcosβ region:
Summary: A large PKS/PS cluster spanning multiple cosmids. An IS element (designated IS-Scl here) is found in the cluster - this may be a potential tool for genetic analysis of Soran - gi um .
Statistics: Sequence was assembled from over 2000 random sequences (forward and reverse reads of the ca . 2 kb cloned fragments derived) .
47,713 nucleotides of contiguous sequence (no pFD666 vector included)
DNA sequence data are as defined in claim 7.
Note: pEP0cos6_0RF7 sequences (cf. claim 9): the predicted N-terminus of 0RF7 shows 145 nucleotide overlap with ORF6.
Note: pEPOcos6_ORF8 sequences (cf. claim 9): >pEP0- cos6_ORF8. seq ("ORF9_up" in Fig.2) 67 . 3 % G+C
Table 3 shows ORF data summary. Note: pEPOcos6_ORFi . seq is truncated at its 5' end; correspondingly pEPOcos6_ORFl .pep is truncated at its N-termmus.
b. Data from pEPOcosδ region:
Summary: Two PKS genes found on a cosmid. A TnlOOO insertion is also found (occurred during E. coli propagation) . No peptide synthetase genes were found; one P450 hydroxylase was identified.
Statistics: 1952 random sequence reads from the pEPOcosδ library were assembled using phrap, with 1024 of the sequences assembling into 57 contigs. 12 of these contigs were chosen (totaling 56,537 bp) which each contained >6 reads and consisted of about 1000 bp or more. The sequences of these 12 contigs and the associated ORFs are given below.
DNA sequence data from contigs are as defmded claim 10. Table 4 shows more data.
pEPOcosδ protein data are as defined m claim 11, i.e. for selected ORFs (polyketide synthase, peptide synthetases, or ORFs with high similarity to known genes) . c. Data from cosmid A5 insert:
Summary: A cluster cf PKS and PS genes found on the cosmid. Other genes possibly involved in this secondary metabolite production include a downstream lipoxygenase gene higly similar to eukaryotic orthologs .
Statistics: 880 random sequence reads from the A5 library were assembled using phrap, with 530 of the sequences assem- bling into 12 contigs. 3 of these contigs were chosen (totaling 41,556 bp) which each contained >100 reads and consisted of about 9000 bp or more. The sequences of these 3 contigs and the associated ORFs are given below.
DNA sequence data from contigs are as defined in claim 12. Table 5 shows more data.
Protein sequence data from selected A5 ORFs are as defined in claim 13.
d. Data from plasmid Sau4 insert:
Summary: Insert contains PKS genes on two large contigs - most similar to the soraphen PKS gene from Sorangium.
Statistics: 565 random sequence reads from the Sau4 library were assembled using phrap, with 84 of the sequences assembling into 18 contigs. 2 of these contigs were chosen (totaling 6596 bp) which each contained >10 reads and consisted of about 1000 bp or more. The sequences of these 2 contigs and the associated ORFs are given below.
DNA sequence data from plasmid Sau4 contigs are as defined m claim 14. Table 6 shows more data.
Protein sequence data from selected plasmid Sau4 ORFs are as defined m claim 15.
e. Data from cosmid A2
Table 7 shows ORF data summary
F. Construction of suitable recombinant expression vectors
1. Expression in Myxobacteria
Heterologous expression of the ORFs shown m Figure 1 is performed by using a derivative of plasmid pSUP102 (Simon, R., Priefer, U. , Pϋhler, A., Methods m Enzymology (1986), vol. 118, pp. 643-659) . In this plasmid the gene for chloramphenicol resistance is changed for a cassette comprising the gene for streptomycin resistance and the promoter element of the Tn5 transposon. Short homologous genomic DNA segments from the host organism are ligated with the DNA sequences of Figure 1 and witr efficient regulatory elements into, for example, the EcoRI restriction site of the vector. Following amplifiction of the vectors m Escherichia coli the DNA is transfereα by electroporation of the host cells or by conjugation with Escherichia coli S17-I (Simon, R., Priefer, U. , Pύhler, A., Biotechnology (19S3), vol. 1, pp. 784-791). By means of the tetracyclme or streptomycin resistance, respectively, mediated by the vector the host cells are checked for integration of recombinant plasmid DNA into the chromosome by homologous recombination.
2. Expression in Streptomyces cells
Heterologous expression of the ORFs shown m Figure 1 is performed by using bifunctional Strepomyces-Escherichia coli cosmids pKU206 and pOJ466.
3. Expression in Escherichia coli cells
Heterologous expression of the ORFs shown m Figure 1 is performed by using "bacterial artificial chromosomes", cosmids (for example Supercos, Stratagene GmbH, Heidelberg) and T7 ex- pression systems (Stratagene GmbH, Heidelberg; New England Bio- labs Schwalbach, FRG) . Expression of recombinant enzymes occurs m Escherichia coli cells constitutively expressing phosphopan- tethemyl transferase required for the formation of holoenzyme polyketide synthetases and polypeptide synthetases .
Table 3. pEPOcosfi region gene annotation summary (continued).
GENE 5' 3' # BP #t-A M (DA) BEST MATCH (SCORE] PROBABLE IDENTIFICATION
0RF1 1 a 731 732 " 244 26218 ACDA_BACSU te-22] acyl-CoA dehydrogenase
0RF2 731 2275 1545 515 56317 AF047828 [e-94) peptide synthetase
0RF3 2301 3356 1058 352 39092 U57065 ZmaR (e-291 antibiotic resistance gene
0RF4 3412 4050 639 213 24819 {no match) hypothetical protein
0RF5 4248 12101 7854 261Θ 283282 PKSK_BACSU [e-190] polyketide synthase
0RF6 12097 14322 2226 742 80545 Z83Θ57 ppsE le-111) polyketide synthase
0RF7 1 ΠΘ"C 16451 2274 758 82007 (no match) hypothetical protein
0R 7.1'1 15866 14196 1671 557 61320 Y4HPJIHISN (e-2θ] hypothetical protein
0RF7.2d 16507 16154 354 118 13102 Y4H0 RHISN (e-13) hypothetical protein
0RF7.3d 16815 16507 309 103 11079 Y4RG_RRISN [e-13] hypothetical protein
Figure imgf000037_0001
ORFΘ 16830 17630 801 123 29633 PKSL_BACSU le-2] polyketide synthase
0RF9 17628 25244 7617 2539 279286 pir[S73015 [e-160] polyketide synthase
ORF10 25235 27877 2643 881 97101 ΛF04782B [e-111] peptide synthetase
ORF11 27867 32498 4632 1544 165084 AF091251 (e-167] polyketide synthase
0RF12 32498 39922 7425 2475 267116 AF040570 (e-223) polyketide synthase
0RF13 40031 45559 5529 1843 71258 AF091251 [e-119) polyketide synthase
ORFl3.1 45599 46018 420 140 14590 (no match) hypothetical protein
0RF1 46015 47577 1563 521 55671 PKN1 MYXXA [e-34] Serine/Threonine-Protein kinase
The predicted ORFl gene and gene product is truncated due to cloning of the DNA into the cosmid vector.
731 is the last nucleotide of the last amino-acid-encoding codon; 732-735
is TGA (termination codon) . Termination codons have been excluded in the present annotation.
c note: the predicted N-terminus of 0RF7 shows 145 nucleotide overlap with 0RF6
Figure imgf000038_0001
d note: ORFs on opposite strand - make up a transposable element from bp 16863-14130 (2733 bp,
11 bp terminal inverted repeat) that is similar to 1S1131 from Agrobacteriura tume aciens (TS-66 like element, 2773 bp, 4 ORFs, 11 bp inverted repeat)
Figure imgf000039_0001
Contig57_026 7424 6975 450 149
Contig57_027 7116 8375 1260 419 CPXK SftCER (e- Cytochrome P450 aonooxygenase
Contig57_030 9074 8337 738 245
Contig57 )31 Θ412 9080 669 222
Contig57_032 8432 9667 1236 411
Contig57_033 9724 8600 1125 374
Contig57_037 10621 9755 867 2Θ8
Contig57 ) 9 11085 10618 468 155
Contig57_040 10687 11091 405 134
Contig57_0 1 11887 10904 984 327
Contig57_043 11520 12998 1479 492 AF072709 (e-42} unknown ORF (S. lividans)
Contig57_044 13730 12018 1713 570
Contig57_047 13093 13797 705 234
Contig57_048 14064 13136 927 308 oo
Contig57_0 14371 13535 837 278
Contig57 )51 13900 14394 495 164
Contig57_052 14036 14440 405 134
Contig57_053 14678 15715 1038 345
Contig57 )56 15211 15900 690 229
Figure imgf000040_0001
u>
Figure imgf000041_0001
Table 5.
b. selected ORFs front cosmid A5
gene 3 ' ft bp fjaa best match [score ) probable identification
Contigl0_001 2861 582 2280 759 024657 saframycin (e-155) polyketide synthase
Contigl0 )06 2680 3323 444 147
Contigl0_007 3876 2902 975 324
ContiglOJJOβ 2953 3957 1005 334
Contigl0_009 4357 3299 1059 352
Cόntigl0_013 4180 6585 2406 801
ContiglO Olδ 7033 5600 1434 477
Contigl0_017 7043 6138 906 301
ContiglO Olθ 6500 7219 720 239 α
Contigl0_019 6954 7295 342 113
Contigl0_020 6982 7797 816 271
ContiglOJ21 7919 7040 780 259
ContiglO_023 7415 8029 615 204
ContiglO_024 7794 9435 1642 548
Contigl0_027 9435 Θ806 630 209
Contigll )01 770 417 354 117
Contigll_002 2526 751 1776 591 AL035640 (e-113] peptide synthetase
Contigl.l_004 1033 2787 1755 534
Contigll )05 3500 1977 1524 507
Contigll_007 3586 2501 1086 361
Cαntigll_008 2507 3814 1308 435
Contigll_011 5213 3579 1635 544 A.F047 Θ 2B syringoiaycin (e-86) peptide synthetase
Figure imgf000042_0001
Contigll_012 6459 3811 2649 882 AJ223012 [e-52] rifamycin polyketide synthase
Contigll Olδ 6511 5210 1302 433
Contigll Olθ 5551 6735 1185 394
Contigll_021 9899 6258 3642 1213 Z83857 ppsE M. tuberc [e-129] polyketide synthase
Contigll_026 17329 8090 9240 3079 AF047828 syringoiaycin [e-300+] peptide synthetase
Contigll_048 16733 17365 633 210
Contigll_049 17397 17723 327 108
Contigl2_001 1 1545 1545 514 AF082100 [e-63] (Streptomyces) peptide synthetase
Contigl2_002 1368 1 1368 456
Contigl2_003 3 1655 1653 550
Contigl2_005 2317 1361 957 318
Contigl2_006 2883 1549 1335 444
Contigl2_007 1777 4926 3150 1049
Contigl2_009 4659 2911 1749 582 OX5 MOUSE (e-54) ARACHIDONATE 5-LIP0XYGENASE
Contigl2_011 4523 5065 543 180
Contigl2_012 4638 5231 594 197
Contigl2_013 4942 5520 579 192
Contigl2_014 6056 5541 516 171
Contigl2_015 5765 6373 609 202
Contigl2_016 5896 7066 1191 396
Contigl2_017 7095 5899 1197 398
Contigl2_018 5955 7331 1377 458
Contigl2_020 7549 7010 540 179
Contigl2_021 7094 7822 729 242
Contigl2_022 7995 7318 678 225
Contigl2_023 8408 7716 693 230
Contigl2J324 7916 9550 1635 544
Contigl2_029 10751 9732 1020 339
Contigl2_030 9795 11132 1338 445
Contigl2 033 10Θ80 12124 1245 414
Contigl2 035 121B4 11252 933 310
Contigl2_036 12079 14098 2020 674
Contigl2_039 14098 12248 1851 616
to
Figure imgf000044_0001
Table 6. Sa 4 assembly analysis summary.
a. plasmid Sau4 assemblies
Assembly size(bp^ Contigl7 2581
Contiglθ 4015
b. selected ORFs from cosmid A5
gene 5' 3' 8 bp- Daa best match [score] probable identification
Contigl7_001 2485 1 2485 829 U24241 Sorangium [e-213] polyketide synthase Contiglβ_002 2 1510 1509 502 U24241 Sorangium [e-105] polyketide synthase Contiglθ_010 1494 4015 2522 841 U24241 Sorangium (e-245) polyketide synthase b >
Figure imgf000045_0001
Figure imgf000046_0001

Claims

Claims
1. DNA sequence, the expression products of which cause an enzymatic biosynthesis, a mutasynthesis or a partial synthesis of polyketide or heteropolyketide compounds or are involved therein.
2. DNA sequence according to claim 1, wherein the polyketide or heteropolyketide compounds are epothilones.
3. DNA sequence according to any of the preceding claims, wherein the DNA is derived from myxobactena .
4. DNA sequence according to any of the preceding claims, wherein the DNA is derived from Sorangium strains.
5. DNA sequence according to any of the preceding claims, wherein the DNA is derived from Sorangium cellulosum.
6. DNA sequence according to any of the preceding claims, wherein the DNA is selected from the group consisting of:
(a) the following DNA sequence:
Seq ID No 1 (A2 cosmid)
GGATCGCGGCGCCCTCGCGCTGCTCCTCGAGCGTGCGGAGGAACTCCCACGCCAGGCGCGACT TGCCGAGGCCAGGCGCGCCCACCACCACCACCGCGTTCGCGGAGGGCTCGTCGACGCAATGGC GCCACTCGGTCGCGAGCTGCGAGAGCTCGCGCTCCCGCCCCACGCAGGGCGTCGGCTTGCCGA GGAGCCGTGGGACGGCATCCGGCTCCTCCTTCGGGCCGCGAAGCCAGCACCCTCCGGGCCCCT GTACCGTCTCGAAGCGGCTCGCGAGCAGGCTGGCCGTCGCGTCGTCGAGCCGGATCTCCGGCG GCGACAGGCCATCTCGCCCGGCGATGAGCTGCGCGACCCGATCGACCAGCTCGCCGACCGGCA GCCTCGCCTCGACCTCGGCCAGCCCTGTCGCGACGGACACGGGCACGCCTCCGAGCGCCGCCC GCAGCGCGAGGGCGCAGTGGGCCGCCCGTGTGGCGAGATCCGTGGGCGACTCGGCGCCGGACA GCGCGACGAGCCACCAGCGCGCTTGCAGCCGATCGAGGCGCCCGCCGTGGCGCGCCGCGATGT CCCGCAGCGCCTCGGCCCGCGCGGCGCCGTCGTCCTCCGAGAGCGTGGCGCCGGCCTCGGCGC CGCCGTCTTCGGCCAGGATGACGCACATCACCTTGCGCTCGGCCGTCGTGATCGCCTCGCCCG GCGCGGCCGGCGCCGCGACCGCGCTCGCCCCGATCGAGAGCCCCTCGCCGGCCACGGCGGCGA GCTCCGCCGCGGCGGCGGCGCCGTCGCGCGGCCGCTCTCCCGCGTTCTTCGCCAGCATCCGCG CCACCAGGCGCTCGAGCGGCTCCGGGATACCGTCGCGGAGCTCCCCGAGCCGCGGCGGCTCTT CCAGGACGACCCGCATCAGGAGCGCGAGCGCGCTGTTGCCGAGGAACGGCGGGCGCCCCGCGA GGCACTGGAACAGCACGCACCCGAGCGCGAACACGTCGGCCCGGGCGTCGACCGGCGCGTCGC CGCGCACCTGCTCGGGCGCTATGTACCCGGGCGTGCCGAGCACGGCCCCGGGCGACGTGAGGG TCGGCGCGAGCCGGAGGTGGCGCGCGATGCCGAAGTCGAGCAGCGTGACGCGCTCGACCGCGC CGCCCACGAGCATCAGGTTGCTCGGCTTGAGGTCGCGGTGAACGACGCCGAGCCAGTGGATCG CGCCGAGCGTCGTGGCCACGCGCGCGGCCAGCGCCACGCTCTCGGCCAGCGTGAGCGGCGCCC CGGCGAGCCGCTCCTCCAGGGTCACGCCGTCGAGCCACTCCATGGCCAGGTACGGCCGCCCTG CGCCGGTCACCCCGTGCGCCACGTACTGCACCACGCCGGGCAGCCGGAGCGTCACGAGCGCCT CCGCCTCCCGCGCGAACCGGCGCAGGTCGTTGGCGCTCGCGCCCTGCAAGACCTTGAGCGCGA CCGCCTGCCCGGACACCCGGTCGCGCGCCCGGTACACGTCCCCCATCCCGCCGGAGACGGCGA GCCGCTCGATCTCGAAACGATCCTCGATCACATCCGCTGCGCGCATGGCGGTGCCAATGTACT CCGCGCGAGCCTCGGGCCCCCGCGCGTAAGTGCGGCCCTGCGCCCGGTTGAACGCCAGCCCGA GCGTGACCGCCTCGCGCTCGGGATCCACGGCCGCCGGATCGGTCCACGCCTCGACGAGCGCCT GCGTTGAACAACCCGCCACCGGGCGCACGCAGCCGGCATCGCCGCGCTGGCCACCCGGCGCTG CCGCCCTTAGGCTCACCTCCGCGATGCCCCGCTGGTTCAACACGGCAGGTCCCTGCAACCCGG
CCGATCACTACATGCTCCCGGCCGAGGAGCGCTTGCCCGCAGTGCGCGATCTGGTCGATCGCA AGGCCTACTTCGTCCTGCACGCCCCGCGGCAGATCGGCAAGACGACCTCGCTGCGCACGCTCG CCCAGGATCTCACGGCCGAAGGGCGCTACGTGGCCGTCCTCGTCTCGGCGGAGGTCGGCGCCC CCTTCTCTGACGATCCCGGCGCGGCCGAGCTCGCGATGCTCGCAGAATGGCGCGGCACCGCCG GCGCGCAGCTCCCCGCCGATCTGCGGCCGCCACCGTTCCCCGATGCGCCCGCCGGTCAGCGCA TCGGGGCCGCCCTGCGCGCCTGGGCTCAGGCCGCGCCGCGCCCGCTCGTCGTCTTCCTCGACG AGGCCGACGCCCTGCGCGACGCGACGCTCGTCTCCCTATTGCGCCAGATCCGCAGCGGCTATC CCGACCGCCCGCGTGACTTCCCGCACGCGCTCGCCCTCGTCGGCCTGCGCGACGTGCGCGACT ACAAGGTCGCGTCGGTCGACAGCGGCAGGCTCGGGACGTCGAGCCCCTTCAACATCAAGGTCG AGTCGCTCACGCTGCGCAACTTCACCCGCGACGAGGTCGCAACACTCTACGCTCAGCACACGG CCGAGACCGGTCAGGTCTTCCGGCCGGACGCCGTGGACCGCGCCTTCGAGCTCACCCAGGGCC AGCCGTGGCTCGCCAACGCGCTCGCCCGCCAGCTCGTCGAGGTCCTCGTCAAGGACCGCGCGC AACCCATCACGTCTGCGAACGTCGATCGCGCCAAGGAAATCCTCATCGAGCGGCAGGACACAC ACCTCGACAGCCTGGTGGATCGGCTGCGCGAGCCGCGCATCCGCGCGGTGATCGAGCCGATGC TCGCCGGCACCGCGTTGCCGAGCGTGCCCCCCGACGACCTTCGTTTCGCGATCGACCTCGGCC TCGTGCGCATGACCGCGGAGGGCGGGCTCGACGTCGCCAACCCCATCTACCGCGAGATCATCG TCCGCGAGCTCGCGTTCCCGATCC3CGCCTCACTCCCCCAGATCAAGGCCACGTGGCTCACGC AGGACGGCCGCCTCGACGCGGACCGCCTGCTCGACGCCTTCCTCTCCTTCTGGCGCCAGCACG GCGAGCCGCTCCTCGGCGCCGCGCCCTACCATGAGATCGCCCCGCACCTCGTGGTGATGGCCT TCCTCCACCGCGTGGTGAACGGCGGTGGCACCGTCGAGCGCGAGTACGCCATCGGCCGGGGCA GGATGGATCTCTGCGTTCGTTACGCGGGCGAGACGCTCGCGATCGAGCTCAAGGTCTGGCGAG ACGGCCGCCCCGATCCCGTCGCCGAGGGGCTCGCCCAGCTCGACGAGTACCTGGCCGGCCTGG GCCTCGATCGCGGATGGCTCATCCTCTTCGACCAGCGCTCCGGACAGCCCCCCATCGCCGAGC GCACGCGCCGCGAGCGCGCGCTCTCCCCCGCCGGCCGCGAGGTCGCCGTCATTCGCGCCTGAG GGAGCTCGCCGCGCGGCGAGCGCCCTCCACGAGGGCCGGGCCACCTCGGACAGCGTCTCTACT CCTCCGAGGCCGCCGCGCCCCCCGCCCCGGCCGCCGCCGCCGCCGCCGGCTCCAGCTCGCAGC GCACCACCAGGACCTCGCCATCCGCGAGCTCCGGCCGCTCCACGAGCGCGTGCGCGCCCGCGC GCACCGCCGTGAGCACGTCTCCCAGCGCCGGCTTCAGCCGCGCCAGCGTCGCGGCGTTCGCCC CGAGCGCGAGGTCGGTCACGACGCGCCCCACGCTCGCGCCGAGCTCGCTCTTGCGCTTGTTGA CCGCCGCCATCGCCGCCGCCGCCAGATCCAGGAGCCCCGGATCCGAAGGCGCCGCGACCGCCG
CGAAATCCGCCGCTGAAGGCCACT CGCCCGGTGGATCGAGGTATCGCCCGTCTCCTCCGCGT ACACCCAGCGCCAGACCTCGTCGGTGATGTACGGCAGGACCGGCGCGAACAGCCGCAGCAGCA CCGACAGCCCGAGCCGCAGCGCCGGCACCGCCGAGCCGCGCGCCGCCTCCCCGGCGCCGCCCT CGCCGCGCGCCCGCGCCTTCGCGAGCTCCAGGTAGGCGTCGGTGAACCAGCGCCAGAAGAAAT CCTCGGTCCGCTCGAGCGCCGCCGCGAACTCGTGCTCGTCGAACGAGCGCGTCGCGTCGTCCA CCACGGCCGACAGCTTGTGCAAGAGCGCCCGGTCGAGCTCCTCGGAGATCGGGTGGACCTCCG CCGACTGGCTGAGCACGTACTTGCTCGCGTTCCAGATCTTCGTGACGAGCCGCTTGCCGATCT TCAGCACCTTCTCGTCGAACGCCGTGTCCGTGCCGAGCCGCGCGCTCGCCGACCAGTAGCGGA CCGCGTCCGAAGAATACGTGTCGAGCAGGTGCATCGGCGTGACGACGTTGCCCTTGCTCTTCG ACATCTTCTTGCGATCCGGATCGAGGATCCACCCGGAGATCGCGACGTGGTGCCACGGGACCG ACGACTCGTGCAGCATCGCCTTCGCGATCGTGTAGAACGCCCACGTCCTGATGATGTCGTGGG CCTGCGGCCGCAGATCGGCCGGGAAGAGCCGCGCGTGGCGCGCCGGATCGTCCCCCCAGTGAG AGCTGATCTGCGGCGTGAGCGAGCTCGTGAACCACGTGTCGAAGACGTCGGACTCGGCGGTGA AGCCGCCGGGCTGGTCCCGCTGCGACGCCTCGTACCCGGGCGGCACGTCGACCGTCGGGTCGA CCGGGAGCATCTCGCGCGTCGCGAGCAGCGGCCGGCTGTGATCCGGGTTGCCCTCGGCGTCGA GCGGATACCAGACCGGGAACTGCACGCCGAAATACCGCTGGCGGCTGATGCACCAGTCACCCT GGAGCCCCTCGGTCCAGTTGCGGTACCGGAGGCGCATGAAATCCGGGTGCCACTTGATCTTGT CGCCGTATTCGAGGAGCTCGGCCTTCTTGTCGGCGAGCCGGACGAACCACTGCCGCGTGGGCA CGAACTCGAGCGGCTGGTCGCCCCGCTCGTAGAACTTCACCGCGCGCTCGATCGGCCTCGGCT CGCCCCGCAGCGCCGGCCCCCGGCCGGGCGCCGCCGCGTGCTCCTCGCGGCGGAGCAGCTCGA CCACCGCCGCGCGCGCCTGCTTCACCCCCCTGCCCTGGAGCGGCGCATACGCGGCGTTGGCCG CGGCCGGGTCGCGGCTCTCCCACGCGCCCTCGCCGAACGTCACCGGCAGGACACGGCCGTTCT TGCCGAGCATCTGCCGGAGCGGGAGCTTCTGCTCCCGCCACCAGATCACGTCGGTCGCGTCGC CGAAGGTACAGACCATCAGGATGCCCGTGCCCTTCTCGCGATCCACGAGCGGGCTCGGGAAGA TCGGCACCGGCGCGCGGAAGATCGGGGTGAGCGCCGTCTTGCCGAAGAGGTGCTGATACCGCG GGTCCTCCGGGTGCGCCGTGACGCCGACGCAGGCCGCGAGCAGCTCCGGGCGCGTCGTGGCGA TGACGAGCTCCTCGGCCGTCCCCTCCACCGCGAACGCGATGTCGTGGAACGCGCCCGATTGCG GGCGATCCTCGACCTCCGCCTGGGCGACCGCGGTCTGGAAATCGACGTCCCACATCGTCGGCG CGAAGACCGAGTAGAGGTGGCCCTTCTCGTGGAGATCCAGGAACGACAGCTGCGCCGTCCTGC GGCAGTGATCATCGATGGTGGCGTACTCGTTCCGCCAGTCGACCGAGAGGCCCACCCGGCGGA AGAGCGCCTTGAAGACCTGCTCGTCCTCGCGCGTGACCTTGTGGCAGAGCTCGATGAAGTTGG GCCGCGACACGATGCGCGGCGGCTCCTTCTTGATCGTCTCCGGCGCGGCCTGCGGCAAGGTCA GGCCGCGCTCGTACGGCGTGCGCACGTCGGTGCGGACGTGGAAGTAGTTCTGCACGCGCCGCT CGGΓGGGCAGGCCGTTGTCGTCCCAGCCCATCGGGTAGAAGATGTTGAAGCCGCGCATCCGGC GCTGGCGGACGACGACGTCCGTGTGCGTGTAGCTGAAGACGTGGCCGATGTGCAGCGAGCCCG AGGCGGTCGGCGGCGGGGTGTCGACGACGAAGGTCTCCTCGCGGGGGCGCGACGGGTCGTATC GGTACGTCCCGTCGGCCTCCCACAGGTCGGCCAGGCGCAGCTCGGCGACGGGCGAGTCGAAGT GCTTCGGGAGCGTCGCGGGATCGATGGAGCGGAACGTCTTCTTGATCGTCACGTGGTCACCTG CAGAACAGACCCCGCAGGAACCGCCCGCGGGGCCGGCATCCTACGTCGTCCCCCGGGTGCCGC TCAAGGCGCGCCGCGCCCGCGCGGCGGCGATCCGCGATCGCATCCGCGCATCCGCCAGAGCCC GGCGGCTCCGCCGGCGCGCGCGCGCCGTCCGTGGAGCCGAGAGGAGAGGCCGGCGCCCAGGTC GTGGAGGACGCCGGCGGCGCCGCCGCGGAGATCGCGGAGAGGCGGGCGCATCGATCGCGGCGA GGCCGGGGGCTCAGTCGTAGCGCTCGACGTGGACGTGCTTGCGGTGGACGCCGAGCTCGCCGC GGGCGAGCTCGCGGACGGACGAGACCATCCGATCCAGGCCGCAGATGAAGACGTGCGGCGCCG GATCTCCGCTCTTCTCCGCGAGCTCCCGGTAGAGCTCGGGCACGTGCGCCTGCACGTAGCCGC GGCGGCCGGCCCACGACGGGCCGCCGCGCGAGAGCGTGATCTCGTAGCGGATCCGGTCGGATC CGCGCGCGAGCGCCTCGAGCTCGTCGCGGTAGATGACGTCCTCCTCGAAGCGCGCGCCGAACA GGATCCACAGGTGGGGCGCGGCCAGCCCCGCGCGCAGGGAGGCGCGCAGCATGCTCCGGAGCG GCGTGATGCCGGTGCCGGTCGCGACGAACAAGGAGGGCGCGGAATCCCCGGGATCGCGGGTGA AGAGCCCGTGCGGGCCGATGGCGCGGAGCGTGGCGCCGGGCTCGAGCCGGTGCAGGTGCTCCG AGCCCGCCCCGCCCTGCACGAGCGTGACCGCGAGATCGAAGCGGGGCGAGCCGTCGGGCGCGG ATGCGATGGAGTAGGCGCGCTTCACCTCGCCGCCCGGGAGCGGGAGGACGAGGTTGACCCACT GGCCCGCCTCGAACAGAAACGACCTCCCGTCGGCGCGCTCGAACGAGAGCTCGCGCACGAAAG GGCTGAGGGGCCGGGCGGCGACGAGGCGGGCTTCGAACGGTTCGGCGTGGATCATGGTCGGGG CCCGGCGGGGCTCGGCTGCGAGGCCGCGCGGGTGGCGAGGTCTTACCGCAGCCTGCGCCCCGG CCCAATCGCGATCGCCGCGGGAAGGGCGCCGCCGGAGGGCGCGCAATCGCGGGAATCACGGGC TTCCGCCCCGTGCGCCGCCGGAGCGCGCGGCCGGGCCGCCGGCCCGCGCTCCGGCGGGGAGCC GTCGCGGGCTCTACCGCACGCCCATGCGGCGGCGCTGCGGGATGTTCACCGCCGGCCGGGAGC GATCCTGGTTGGGGAGCGCGCGCGGCGGGCGGGGATCCCGGTGCGCGGGCTTCTGCGCGGGGA GCTGCCCTCGCTGAGCCGGGCGCTGGTCGGGCGACTTGGCCGAGCCCAGCGCGAGATCGGAGA
CGGGGAGATGCGCGCGTCGCTGCATAGAATCCTCCATGGAATCGGTCATCAACACATCGGGAA GAGCACCCAGGCTGAAAGAAACCTTCGAAGAACCGGCTCTCATACACCCTCCATTCATCGTGC GACCCCGGATTCAGGACGGATCGAACCCGCGAGGGACGCTGGCTCTCTGGGCCTCTCCCTGCT CGCTCGACCGGCGCCCTCTCGACGCAACTCCGCCGTTCGTCGGGACGGGACGGTCCGCCTCGC CGCACGCTCCCCGTCGAGACGACTCAGCGTCTCGACGTCAGGAGAGATGACGACTCGGCCCGT CGCGCCACGACCCTTCCGGCTCGGTGCTTCGAGCGCGCGGCCAGCGAGCGAGGGGCGATCGCC AGGAGATCACGAATCTCCCGGCCATCGGCCTCCAGCGCCTCGGGCTCGTTCGCTCGTCGCCCC GCTCCGTCCCCGCGCGCGCACGACGCGAGCTCGCGCGGGGAACCGCGGGCCGCTGTCGTGGCT GCTGATGCGCGACGATACAGGGGGGACGCCGTGCCTACCTGGGCAACAGGCGCTCATCTTCTA CCACGGCGAGCACTACGGTGAGTGCTGCCATGAGTAGGCCCCTGAGGGTCCGCGCGACGGAGC GTGGTGTCAGCGAGAGATGCGCATGGTGGACGCGGGCTACGCGTCGAGAGGGACACTAGCACT CGACCTCGATCCTGCCCAGCACTTTTTGTCGGGGAGGGCTGCCCTCCCGCTGGCCGCTGGCCG CTGGCCGCTCGCCGCTGGCCGCTCGCCGCTGGCCGCTGGCCGCTCGCCGCTGGCCGCTGGCCA TGTGCGACGTGAGCTCGAGCAGCCCGCGGCTGACGGACAGACCCCGGAGTTCATCGAGCCGGT GATGCCGAACCCGCCAAGCGAAAAAACGTATCCGTTCGGCAGGTCGTGGCCTATCATGCAAGC TGCTCGATGCGCTGACAGGCTTCT CGAGATCCTCGTCGGTCTTTGCGAAGCAAAACCGCATG AAGCGACTCCCCTGCGTCCCTTCAAAGAAGGCGTCGCCTGGCACGCCCGCCACCCCGGTCTCG TCGAGCAAGTAGATGGCTCGCTCTCGACCTGTCCTCCCGGGTAGGCGAGACACATCCGCCAGC ACGTAGTACGTCCCCTGCGGCACGCAGGGTGGCAAGCCCGCTTTCTCCAGCGCCCGACAGAAC CGGTCTCGCTTCCGTTCATATCCCTGGGCAAGCCCCGTGTAAAACGAGCGAGGAAGGCCGCGG ATCCCGGCAGCGACTCCATGCTGCAGCGGCGTCGGCGCGCAGACATACAGCAGGTCGCTCATG GCTCCAATGGCCTTCGCCCACCTGGCATCGGCCACGCTGTAGCCGATCCGCCATCCTGTGATG CTGAAGGTCTTCGAGTAGCCGCCTATCGTGATCGTACGCTCGGACATGCGCGGAAGGGAGGCG ACGCTGACGTGCTCACGGCCGTCGAAGATAAAGTACTCGTAAATTTCGTCCGTGATCACCATG AGGTCATGGTGGCAGGCGAGATCGGCGATCTGTTCCAGCTCCATTCGGCCGAACACCTTCCCG GAAGGATTTCCAGGAGAGTTCACCACGATCGCCTTGGTCTTCGGGGTGATCGCGCGCTCCAGC TCGTCGCCGTCGACATTCCAGCTCAGGGATCGCGCCGTCACATACCGCGGAACAGCCTCGACG GCGAGGATAGCCTGGGCGTGATAGGCATAAAACGGCTCGAAGAGCAGCACTTCGTCCCCAGGA TTGAGCAAGGCCATGCAAGTGGCCTGAAAGGCCCCTGTCGCTCCGGCGCTCACCGTGATGTCA GTCTCCGGATCCGCCGCGATGCCATTATGGCGAGCCAGCTTCGCCGCGATCGCATGGCGCAGC
TCCACGATGCCGTCGAAGCGCGAATATGTATTGCACCCCCGATCCATCGCCTCCTTCACCGCT TGAAGGATCACCGAAGGAACTGGGGTATCACAGACGCCCTGGGACATATTGATCCCATGGACC TTGGCGCACGCCAGGGTCATGGTACGGATATCGGACTGGGCGAGGCGAGCCGCACGATCACTC GGTAGACTCTTCATCAGCGTGCTCCΓGCTTCTGTTCTGCGGCTCTGCATGGTGTCTTCGGGTG GGCTTGTCAGCTCGACGCGCCCATGCAGCGGCGCAGCCCTAGCGGCCGCAGGTCTGTCCACAC TTCTTTGATGAAAGCGAGACATTCGGCTTTCGTGCCCTGTTTGCCCGCAGCCCTCCAGCCCCC AGGTACGGGCTTGTCGGCGGGCCAGATCGAGTACTGCTCTTCGCCGTTCACCACGACCTGGCA ACGCGTCTTGCTTTCGTCGTCCCGATTCATGATTTTCCTCGCCCTTCGTCAGCGCTGCGCGAG CATGAAACGAATCGCTCATCGGCGCACAGGCGCGCGCCGGCTGCCCGGAGGCACTCCCACGCC TCCCTCACGGCAACCTCATCGCTCCGGATGTTCCCGATGGCGACTCGGATCGTGTACCTGCCG TGGAGACGGGTATGGGACAAAAATACCCTGCCCGACTTGTTGACCTCGTCCAGCAGCGCCTCG TTGAGGCGATCGAGCTCGCGTTCGATCGACTCTCTCTCTGCCTCGTCCGCCGACCGCATGATG CAAGCGAGCGCGGAGGGCCTCATGCGAAAGCAGACCGTACTGAACGGCGTCGGCGCGAGGCGC TCCCAATCGGGATCGGCGTCCACCCACTGGGCGAGCTGCTGCCCCAATCGGAGGTGCTCCCGG ATCCGGGCCGCCAGCCCTTCATGCCCGAAGTAGCGCACGATCATCCAGAGCTTCAGCGCTCGG AAGCGCCGACCGAGCTGGATACCCCAGTCCATGTAATTCGTGACGTCGCCCTCGGTGCGGAGG TATTCGGGCACCAGACTGAACGCGCGCTTCAGTCGGTCGGCGTCACGCACGTAGAGCACGCTG CAATCCATGGGGGTGAACAGCCACTTGTGAGGGTTCACTACCAGCGAGTCCGCCCCCTCGCAG CCCGCGAGCACGTCCCTGTGCTCGGGGACGATCGCGGCCATCCCCGCGTAGGCCGCGTCCACG TGAAGCCATAGCCCGTGCTCCCGGCAAACGCTGACGATGGCGGGGATGGGGTCGACGCTCGTC GTGGACGTCGTGCCCACCGTCGCCGCGACGCAGAAGGGTCGGAGGCCGGCCCCGAGGTCCTCC ACGACGGCGGCGCGCAGCGCCTCG3GGACCATGCGGAAGGCCGGATCCGTGGGGATCTTCCGC ACCCCCTCCTGCCCGATGCCGAGGGTGATGGCTGCCTTCTCGATGGATGAGTGCGCCTGCTCC GACGCGTAGAGTCGCATGCGCCGC GTCCCGCCATGCCCCGGAGCCGGATGGTCGGCTCGGCC GAGTCGCGCGCGGCCGCGATCGCGACCATGCTGGCGGTCGACGCGGTGTCCATGATCGCGCCG TGCAAGCCGGCGTCGAGATCCAGCATCTGACGCAGCCAGGAGAGGACGAGCTCCTCGAGCTCG GTGGCCGCCGGCGACGTGCGCCATAGCATCACGTTGACGTTGAGGCACGCCGCGAGCAGCTCG CCGAGGATCCCAGGACCAGACGCCGTGTTCGCGAAATACGCGAAGAATCGCGGATGATTCCAG TGCGTGATCCCCGGCAGAATGATCTGCTCGAAATCGGTGAGCACGGCGTCCATCGGCTCCGGC TCGACGGGCGGGGTCGGGGCCAGCCTGCCCTTCACGTCGCCGGGGCGGATCGCGGGAAAGACG
GGGTATCGATCCGGGTGGCCGAGGTAATCGGCCGCCCAATCGATGATTCTCATACCGATCCGG CGGAACTCCTCCAGATCCATGTCCCCGAGCCGTTCTTTCCGCGGGTCGCTCACGTCAACCTCC TCGCGCTGCCAGGACAGGATCCTCGAGGTCCCCTGGCTCCGGCGGTGGAAAGCGCTCCTTGAA CGTGAAGGCCCACGGGGTCGGTCCGTAGCGCCGCAGGTGCTCGAGCCGATCCTGCCCCTCGCG GACC-GACGGGATGTGCCCGGCCGGGACCCACCACAGCACGAGGTAATGCGGCTCGAGATGCTC GAACCACCGAGCGCGCTGTCGCAGGAACGCGGCATGATCCGCGGTGTAGGTGAAGGCGAACAG GTGCTCGATGGAGGTCCATACCGACAGGGTCACGAGGAGCCGCTGGTCCGGGTACGGACGGAT GGACACAGAGTTCCCCTCGGCCGTCTGCAGGCGCCACACGAACCCCTCGCTCCGATCGGCCAG ATGGTTGATATGGTCGAGCCCCTGGACGAAGCCCTCCATGATCGGATCCTCCAGCGGAGCGCG AATACATGCGAAGTTGTATTGCGCGATGTGGTGCCGATGCTCCGACATGTCGCTTTCCATCTC CAGCTCCCGCTCACCAATCCCAGCGCTGCTCCGGGGAGCTCATCAGGGCAGACGCGACATCGA TCCCGAAGCTCCGCCGCATCCCCTCGACGAAGGCGGCCTGGACCGCTTCGGCGACGGATCGGC CTGCCTCCGGCAAGACCTCGGAGACAAAGAAGAACCGCCTCGTGGAAGGGACAATCTTGCCCC GCTCCGCCTGGCGCCATACGAAGTGCCTCGTCACCAGTCCCTCCGCGTCGGCATACCCGACCT CGCCGGCGCCGACCGCCACGCTGCCGCCTGAGCCGAGCTCCACGAACGCCTCACCGCCTCGCG AGATCTCGAGGCGAACGTCCGGGCCAGCCAGATCGCCGAGGTCCCAAGCGCCGACGGGGACGG CGAACCGCAGCGACAGGAGGTTGTAAAAATCGACGAATGCGTTGATGTGCGGCAGCTCTCCAC CACCGAGGACCCGCTTCGCCAGCGCCTCGATCGAGCTCGGAAATTTCTTGCCAGAGACCCCCA CTCGCTTCATCGCCTCGCGCCAGGCAGCCACGTGCGGATGCGACTGGGCGTTTTCGTGGCCCC AGCTCCGTCGCAGCTCCTCCTCGACCTTCCGGAGCTCCTCCAGCACGGCCGGCCGCTCTGCGG CGTTGTCCAGGCCTTCCCCGTACCCGGTGACCAAGATCATCCCAGGAAACGACTCCCAGATTC GCGGATCGACGATGAATGCCATGTGCCTCCTGCCCCTCGAGAGCGATCGCCTCGATCGACACC AGGCTGTGGATGCATGAGCCGGGCCGTGCGGACGCAGGACCCCGCTACTCATGGCTCTTCGTG GCCGATGAACAGGTCCTCCACCCGTCGATCGTGCTCGGTGCCCCGATCCGTCCAGTCCCACCC GCCGGCGACCGCGATGTTTGCACCCGAGACGTACGAGGCGCGGTCGGAGGCGAGGAACGCCAC AGCATCTGCGACCTCGCTGGCGCGCCCCAGGCGGCCCATGGGGACGCGCCGCTCCATCCACTC CTTCTGCGCGGGCGGAAGGTATCCGTTGTCGATGAGCCCTGGAGACACACAGTTGACCAGGAT TCCATGAGGCGCCTCCTCCGTGGCCAGGCTGCGCGTGAGGATGAGCACGCCGGTCTTCGCGAT CGAGTACGCCGCCACGTTCGGCGCGCCGCGGATCGCGTACGTGGGGCTCAACCCGATATTGAT GATCCGGCCGCTCTTTCGCTGGCGCATGCGCGCCACGGCCGCGCGACAGAGGTAATGAACGCT
GCTCAGGTTGCTGTCCATGACGTTGCGCCATTCGTCGTCGGTCATCGCCGCAAGCGGCTTGAA GAAGAAGTCGCCCACGTTATTGACGAGGATGTCGATGGGGCCCAGCTGCGCCTCGACGCTGGA GAAGAGCTCCGCGGCCGCGTTGGGGCGGGTGACGTCGGCCTGCACCACCATGGTTCGTCGCCC GAGCGCGCGGATCTCGGCCGCCGTCTGCTCGGCCGCATCCTTGTTCGAATGGTAATTGACGGC GACGTCCGCGCCTTGCTCCGCGAGGCGCAGCGCGATCGCCTTGCCAATTCCGCGCGAGCTACC GGTGACCAGGGCGACGCGCCCGGCGAGCTCCAGCGATCGCGCCTGTGGCAGGGCCGGAGCAGC CTCCTGGTGGAGCTCGACGTCGACGGGGAGCTCCACGTGGTAGCTCGTCTCTCGCGGAGCCGC GCAGTACCTCTCGTAGAACGCCTCGAGGACGGGCTCTTCGCGTCGCATGATGTCCGCGTGGGA TTCGGCGCTGCGCCACGGATAGAGCACCAGGATCTCGTCGGGGCGCACGGTGCTCTGAAAGAA TCGCGCATGACCGCACCACCCCGGGTGTTCGTGGGCTCCTGGGCCGAGCAGATCATCCATTTT TTGCATGATCTGCGTGGCCTCGCCCTCCATGCCGGGTTTGATGCGCCATCGCTCCATAACGAG GATCATGTCTTGGCTCCTGTTCGTCATCGCCGTTTCGATCTGGGGGGGCTGCCCGCGCTCTCG AGGGCGCGCCCCTTGTATTGGCCGCGGATGGTCTGGGTAGCGCTCGCGAGCTTTCGCTTGTGG GCGGCGTTCAGGCTTGCGCCTTGATTGACGAACCGCTCGCAGACGAATGCGTGCGATTCATAT GCGGTCGCGAGCGCCCACAGGTAGAGACGGTGGCTACGAGCGAGCTTGGGCGGAAGCGCCCTC AAGGCGGGTCGGATGGATTTGAGGGTGCGAATCAAGCGGGCGTGCTCTTCGAAGAAGAGACCG CTGAATCCATCCTGGACGAACGGCGGGGCCATGGTAGGTCGCACGATGTTGTTGTAATCATCA TTCGAGAAATCGCCGGTAAACCGGAAAGATGCCGCCGTTGCCCACCAGAGCTGCGTGGAGATC GCGAGCGCCTCTCGCGCCCTCGGAACGTCACGGTTGGCGAGCGCCTCCGACAGCCACCTGTGC GCCATGAGCAGCGCCTGGACCAGCACGAAGAACGCGTGATGCCCCAATACCCAGCGGCCCAGC GCGCCGTCCGGAGCGCCCGCCTCCGCCGGAGGCGCAGCCAGCTGCGGGATGCCATTCCATGAT TTTGGCCTTCGCTTGCCGGAGAACTGGTGGAGGATGTCCTCGATGGACGCACAGAGATGCCCC ATTTCCATGGGTTGCAGGGAAGTACCTTTCAGGCTTTCGCGGATCATTCGGTAATATGCGACG ATCACCGCTTCGCAGTACGTCTCGAGCGGGTCTGGGGCGGAGACCCGGTGCACATGGAAATAG GCGTCGTACTCCGCCTCCCGGTCCGACAGGCTCCCCGGGATGTCCGGATCTGCCGCTTGCGCC TGCCATCGCTCGAGGATGGGGACGGCGAGGGGGGCGAGATGCTCGGCGAGCACCGCCAGGTCG CCCTCCGGCGCATGCGCCAACAAGACCTCGAAGGCCTCGGCTGCGACCTTGGAGGTCTCGCCC ATTCCGACGGCCTCGATGGCTTGGGGCAGCGGTAGATGGATGGTATATTTAGCCATGATTTGC CCGAAGATTGCCGCTGCGTCGACAGATCTTTCGCGAGCCGGAACGCCATTTCCACTGCTCTGG CTCTCAATATTGAATTGAGCCCTGGCGACTGCCATAGGCCCAGTCGCTCGACACAGTGTACGG
AGCGGCCCGATGCTTTCTCCTTTTTTAGTCCTGCACCGAATACTTCTGTTGGGCGCCAAAGAT CCCTTGCCGAGACTGTCCGGCGAGATGTCGTGTGCGAAGCGTCCGCACGTCCAGCGGGCCCAT GCGTTGCTAGAGCATAAAACGGTTCGATGCCTGGTCGAGAGGGAGACGCGAGGAGCCTCCCTT TGGGACGGATGAGGAATTTCGTGACCGAAATGTCGGCAGGAACAGCGGCGCAGAAGCGGCGCA TCGATGGGGAACCATGGGTTACGAAGACATTGATGATAATGTCGACGCAATCGCAATCGTCGC GATGAGCGGCCGCTTCCCCGGCGCGAGAAACGTCGAGGAGCTGTGGCAGAAGCTCCGCGCTGG CGTGGAATGCGTCGTCACCTTCACAGAGGCCGAGGCGCTCGCCGCGGGGGTGAGCCGCGAGAT GCTCGCGAATCCCAGCTACGTGCGCAGAGGCGCGCCGCTCGACGGCGTGGAGCTCTTCGACGC CTCGTTCTTCGGGTTCAGCCCGAGG3AGGCAGAGAGCATGGATCCGCAGCAGCGCATCTTCCT GGAGGTCGCCTGGGAGGCCCTCGAGCGCGCCGGTTACGACCCCGATGCCCATTCCGGGCCTAT CGGCGTCTTCGCGGGCAGCGCCCCGAGCGGCTACCACTCCCTGGCGCAGTCCGACCCGGAGAT CCTAGGCGCCCTCGGCCACTACCAACTGACGCTGAACAACGACAAGGATTATCTCACCACACA CGCCTCGTACAAGCTCAATCTGCGGGGCCCGAGCGTGTGCGTGCAGACGTCCTGCTCGACCTC GCTCGTGGCCGTGGTCATGGCCTGGCAGAGCCTGCTCAACCACGAGTGCGACATGGCGCTCGC GGGTGGCGTGGGGATCCATGCGCATCAGCGGAGGGGCTATCTGTATCAGGAGAACGGCATCTC TTCGCCCGATGGGCATTGCCGCGC3TTCGATGTGGCCGCCAAGGGCACCGTGGGCGGCAGTGG CATAGGCATCGTCGTCCTGAAGCGGCTCGCCGACGCGCTCGCCGACGGCGACCACGTGCACGC GGTGATTCGAGGAGCGGCGATCAACAACGACGGCTCGAGCAAGATCGGTTACACCGCGCCGAG CGTGCAGGGGCAGGCCGAGGTGATCGGCATGGCCCAGGCGCTCGCCGGCGTGGAGCCGGATGA CATCAGCTACATCGAGGCGCACGGCACGGGGACGCCGCTCGGCGATCCCATCGAGATCGCAGC CCTCACGCGCGTGTTCCGGGCGAAGACCGCACGAAGGCAGTTCTGCGCCATCGGCTCGCTCAA GACCAACCTCGGCCACCTCGATGCCGCCGCGGGCGTCGCCTCGCTGATCAAAACGGTCATGGC CCTCGAGCACCGCGAGCTGCCCCCGAGCCTGCACTTCGAGCGTCCGAATCCGAAGCTCGAGCT GGAGAGCAGCCCTTTCTACGTCAACACCCGCCTCACTCCGTGGCACGCGGCACGAGGTCCGCG CCGCGCTGGCGTCAGCTCGTTCGGCATCGGCGGCACCAACGCGCACGTGGTCCTCGAAGAAGC TCCGGCCCCGCCTCCGAGCGGCCCCTCGCGGCGTTGGCAGCTCCTCACCCTCGCGGCTCGCTC CGAGGCCGGGCTCGCGCGGGCCACGGGCGACATGATCGAGCACCTCGATCGCCACTCCGGCAC ATCGATCGCCGATGTCACGTACACGAGCCACGTGGGGCGCCGGGCCTGGCCCTTCCGGCGAGC GGTCGTCGGCGAGAGCGCCGCGGATCTCCGCGCCGCGCTCGCGAGCGAGGGCTCGCCGCGCTC GATCTCGTCATGCCAGGCGGCGAGGGAGAGGCCCGTCGTCTTCCTGTTCCCCGGTCAGGGAGC
GCAGCACCTCTTCATGGCGCGGGAGCTGTACGAGGTCGAGCCGATCTTCCGGCAGTCCCTCGA CCGCTGCGCCGAGCTCCTGCGCGGCGCGCTCGGCCTCGATCTGCGGCAGGTCCTCTACCCCGC CGAGGGGCAGCGCGACGACGCCGAGGAGGAGCTCGGTAGGACCGCGATCGCCCAGCCCGCGCT GTTCGCCATCGAGCTCTCGCTCGCCAAGCTGTGGATGGCCTGGGGGATCGTCCCCCAGGCGAT GATCGGCCACAGCGTCGGCGAGTTCGCCGCGGCTTGTCTGGCGGGCATCTTCCGCGAAGAGGA CGCGCTCCGCCTCGTCGCCGAGCGGGGCCGCCTGATGCAACAGATGCCGCCCGGCGCGATGCT GGCGGTGCCCCTCGCGGAGCCCGAGCTCGCCCCCTACCTCAGCGACGACATCTCGCTCGCGGC GATCAACGGTCCGGCTCTCTCGGTGGTCGCTGGGCCGATCGAGGCCATCGACGCGCTCGCGGC CGAGCTCTTGGACCACGGGCTCTCGTGCCGGCGACTCCACACGCGGCACGCCTTCCACTCGAA GATGATGGCCCCCGTCGTTGACGCCTTTACCCGATGCGTGTCCGCGGTCGAGCGCCGCCCGCC GTCAGGCCACTTCCTCTCGACCCTGACGGGCGGCTGGATCTCCCCCGAAGCAGCGACCATCCC CGCATACTGGGCCCGGCAGCTCGTGGAGCCGGTGCGCTTCGCCCAGGCCGTGAGGCAGCTGCT GTCCGAGTCGACGTGGCTCTGGCTCGAGCTGGGTCCGGGCCAGACCCTGAGCCCGCTCGTACG GCAGCAGGCCCGCGCGGATGGCGGCCAGGTGGTCGTCGCCTCGCTGCCGCGCGCGAAGGACGC GGGCGCCGACCACCTCGCGGTCATCGAGGCGCTCGGCCGTGTCTGGAGCGCTGGTGGGACGGT CGACTGGAAGCGCTTTCACGAGGGCGAGGCGCGGCGGCGGGTGCTGCTACCGACCTACCCCTT CGAGCGGCAACGATACTGGGCCTCTCCGCGCCACACGAGCGCTCCGCCGGAAGCGATAATCAA GCCGCTCCTCGCGAAGAACCCAAACGTCGCCGATTGGTTCTTCCTCCCTGCCTGGCGGCGCTC GGATCCTCCGGTCTCGTTCGACGCGCAGGCGGTGACCACGCGGCGCTCTACGTGGCTCGTCTT CATCGGGGACGAGGGCCTCGGCGCGGCGCTGGTGGAGGGCCTCGCGCGGCGGGGGCACGAGGT CGTCGCGGTGGTCACGGGTGAGAGGTTCGAGCAGACGGGCACGCAGCGCTACACGATCGATCC CGCCGCGAATGGCGATGTTGCGTCCCTCTTCGCGCGGCTCGAAATCGAAGGGCGCATGCCGGA CCGGATCGTCCATGCCTTCTGCACGTCGCCTGCGGACGGCGCGCGCATCGAGCGCGGAGCCGC GCTGGAGATCGAGCGCAGGCTGGGCTTCGATAGCCTCCTCCTCCTCGCCCAGGTGATCGCCGC ACAAAGGCATCCGAAGCCGCTGATGCTCGGCGTGATCACGACCCGGGCGCACTCCGTCATCGG AACCGAGATCATCGAGCCCCTGCGCGCTCTGGTGCTCGGCCCCTGCCGCGTCATCCCGCAAGA AATACCCCATGTCTCGTGCCGGAACATCGATATCGATCTCCCGGGCGAAGGCGGGCGCGCGGA GATCGCGGCGCGCCTGATCGCCGATCTGGAGCGAGAGTCGCCCGACTCGGTGGTGGCCTACCG CGGCGGCCGGCGCTGGGTCGAGAGCATAGAGCTCACCGATGTCGGCCGGCGGTCAGCTGGCGC CGCCCCGCGCCTCCGCCAGCGCGGGGCGTACCTCATTACCGGCGGCCTGGGGGGCATCGGCCT
CGTGGCTGCAGAGCTCTTGGCCCGAGAGGCGCACGCACGGCTGATCCTGGTTGGGCGGACAGG CCTC-CCAGCGCGGCAGGGGTGGGACGACTGGCTCGCGGCGCACGGCGCGGGCGACGCGACGAG CCGAAAGATCCTCCGGATCCGCGCGGTCGAGGAGGCCGGCGCCGAGGTGAAGATCGCCGCGGC CGACGTCTCCGATTTCAATGCGATGCGGAGCGTCATCGAGGAGGCCCGGACGCGCTTCGGCCG CATCGACGGCGTCATTCACTCCGCCGGCATCGCGAGTGGAGGCATGATCCAGCTCAGGACGCC GATGGCGGCTTGGCGCGTGATGGCGCCGAAGGTCGGCGGCACGCTCGTGCTCGATGCGCTCCT CCGGGACGAGCGTCCCGACTTCCTCCTGATCTGCTCGTCGTTGGCCTCGCTGGTCGGCGGCGC CACCCAGATCGATTACTGCGCCGCCAACGCCTTCCTCGACGCCTACGCGCAGAGCCGCGAGGG CGAGGAGGGATGCCGCGTCATCTCGGTGCAATGGGACACGTGGAGTGACGTCGGGATGGCGGT GGACTTCAAGCTCCCGGCCGATCTCCAAGAGGGGCGCCGCGAGAGCCTGAAGCGGGGCATCAG CTCGAGCGAGGGCGCCGAGGTGCTCGGCCGCATCTTGAGCGCAGGCATGAGCGGCCCGCTGGC GATTTGCACGTCGGATCTACCAGCGTACAAGCAGTCTGTCACGACACGCCGATCGCAGCACGA GCAAACTCCCGCCGCCCGGCCGATGCACTCGCGCCCAACGACCACGGGAGCCTATGTCGCTCC CGAGACCGAGACCGAACGGCGCATCGCCGCGATCTGGCAGGATCTCCTCGGCCTCGAGCAGGT AGGCGCAAACGACGATTTCCTCCAGCTGGGCGGCCATTCGCTGTTGGCCACGCAGGTCCTGTC TCGCGTCCTGCAGACCCTCAAGGTGGGGATCTCGTTGCCGCAGTTCTTCGATGCGCCGACGGT CGCAGGGCTTTCGCGCCTGGTCGACGCAGCACGGGCCGAAGGCGCCGGACCCGTCGCGCCGGC AATCGGCCGTGTCGAGCGAGACGCCTACCGAATCAAGCCGCCCGCGGCCGAACAGGCCGCCCG CACCAAGCCGTAACAAGAAGGGGATCGAGTCATGGAACCCGTCGGCGGCGTGGACATGAATCA GCCCGCAAAGCAGCAGGAGACCTGCGTCTTCCCGACCTCCTTCGCGCAGCGGCGGCTCTGGTT CCTCGACCAGCTCGAGCCGGGGAGCGCCGTCTACAACATGCCCGCCTCCTTCCGGACGCGCGG GCCGTACGACGTCGACTCGCTCGTGCGCAGCGTGAACGAGATCGTGCGGCGCCACGAGTCGCT GCGCACGACCGTCGATGTCATCGATGGCGAACCCGTGCAGGTGATCGCCCCCTCGCTGCGCAT CGAGGTGCCCGTCGTGGACCTGAGCGAGATCGACGAGCCGGAGCGAGAGGCGGAGGCCCGGCG GCTCATGGCGGAGGAGAGCCGCCGCCCCTTCGATCTCACGCGAGGGCCGCTGCTCCGAGCCAA GCTGCTCCGGCTCGGCGAGGCCGATCACGTGCTGATCTTGACGATGCATCATATCGTCTCCGA CGGCTGGTCGATGGACGTGCTGTTCAAGGAGCTTTCCACGCTCTACGCCGCCTTCCACGAGGG CCGCCCGTCGCCGCTCCCGGAGCTGCCGATTCAATACGCCGACTTCGCGGTGTGGCAGCGGGA GCTGCTCCAGGGCGAAGTTCTGGAATCGCACCTCGGGTACTGGAGAGAGCACCTCCGCGGCGC CCCCACGCTGCTGGAGCTTCCGATGGACCGGCCCCGGCCGCCGGCGCAGACGTTCCGGGGCTC
CCAGCGCGCGTTCCGACTCCCACTCTCCCTGCAACAGGCGGTGCAGGCGCTCAGCCGGCAGGA AGGCGCGACCCCCTTCATGACGCTGCTGACGGCGTTCAGCGTGCTGCTCTCGCGTTATGCGCG GCAGAGCGATCTGGTGGTTGGCACGCCCATCGCGAATCGCACCCGAGCAGAGCTGGAGGGGCT GATCGGCTTCTTCGTCAACATGCTGGCGCTGCGCATCGACCTCGGGGGCGACCCGAGCTTCCG CGAGCTGCTCGGGCGGGTGCGGGAGGTGACGTTGGGCGCCTACGCGCACCAGGACCTGCCCTT CGAACGGCTGGTGGAGGAGCTGTCACCAGGGCGGAGCCCCAGCCACAGCCCCTTGTTCCAGGT GTCCTTCACGTTGCAGAACACCCCGATGGATGCGACGAACAGAGCAGACATTGCATCGGGTGG CGCGCCGCTGGTGGAAATGAAGGCGGCGAAATTCGATCTGATCCTGGAGCTCTCGGAATCGCC GCAAGGGTTGCTCGGCACGTTCGAGTACAACACCGACCTGTTCGACGCCGGCACCATCGAGCG GATGGCCGGCCACCTGGAGGTGCTGCTCTCCAGCGCCGTCGCGGCGCCGGATCGACCCATTGC GGAGCTGCCGCTCATGGGGGCCGAGGAGCGCAGTCGGGTATTGGTGGAGTGGAACTCCACTGC CGCGCTGTATCCCGAGGACCATTGCATGCACGAGCTGTTCGAGCAGCAAGTGGAGCGGTCGCC CGAGGCGACCGCGGTGCTCCTCCAGCAGCAGACGTTGACGTATCGAGAGCTGAACATGCGCGC CAATCAGCTCGCGCATCACCTGCGGAGCCTGGGCGTGGGCCCAGAGGTGCGCGTCGGGTTGTA TCTCGAACGGTCAATCGAGACGGTCGTGGCGATCCTCGGCGTGCTCAAGGCTGGCGGGGCCTA CGTGCCGCTCGATCCGACGTACCCCAGCGAGCGCCTCGGGCTCATGATGGCGGACGCAGCGCC CTCGGTGCTGCTCACGCAGGCGTCGCTCCTCTCGAAGCTGCCGCCCCACGGGGATGCAACGCT GGTACAGCTCGACGCGCTGCACGAAGCGCTCTCCAGGCTGCCACACCATACCCCGCGGAGCGG CGTCACCGCCCAGAACCTCGCATACGTCATGTACACTTCCGGCTCGACCGGGCGGCCCAAGGG CGTGCTCGTCGAGCACCGCGGCCTCTGCAACCTGCCCACCGTGCAGGCCAAGCTCTATGGAAT CGCGCCGGGCGACAGGCTCCTCCAGTTCGCGCCGCTCTGCTTCGACACATCGTTCTGCGAGAT CGCGCTCGCGTTGCTCTCGGGAGCGACGCTGGTCATGGGCACGGCGGACGAGCTTCTCCCGGG ACCTCCGCTGGTCGAGCTGCTGAAGAAGCACGCGGTCACGGCGATGCTCCTGGCCCCTACCGT GCTCGCAGCGCTGCCAGAACAACAGAGCGCGGCGTTGCCGCTGCGCGTGCTCACGATGGCCGG TGAGGCGTGCCCGGCGGAGCTCGTCAAGCGCTGGAAGGCACCCGGACGGCGCCTGTTCAACTC CTATGGCCCGACCGAGACGACCATTTGGGCAAGCTCCGCAGCGGACCTGTCCGACGAACGGAT CCCGCCCATCGGCCGTCCGATTGCCAATACGCAAATCTACGTGCTCGACGAAGCGCTCGAGCC GGTGCCCATCGGCGTGCCGGGCGAGATCTTCATCGGCGGCGTGGGCGTCGCCCGGGGATATCA CGGGCGTCCGGACCTGACGGCCGAGCGATTCGTACCCGACCCCTTCGGGCAAACCAAAGGGGC GCGCCTGTATCGGACCGGCGATCGGGCGCGCTGGCTGCCGGACGGAAACCTCGAGTTTCTCGG
TCGAAACGACGAGCAGGTGAAGGTCCGCGGTGTCCGCATCGAGCTGGAGGAGATCCGCGCGGC GTTGCTCAAGCACCCGGCGGTCGCTCAAGCCGTGGCCGTGGTGCGCGAGGACACGCCGGGGGA CAAGCGGCTCGTCGCGTATGTCGTCGGACGCGGAGGAGCGCGCGTGACCGCCGCGGAGCTGCG CCAGTCCGTGAGCGAGCGATTGCC GCGACCATGGTGCCATCGTCCTTCGTGGCGCTCGACGC CTTGCCCCTGACGCCGAATGGCAAGGTGGACCGCCGCGCGCTGCCGGAGCCCGAGCAGAGCGC CGGCGGCGAGGACCACGTCGCGCCGCGCAACGCCGTCGAGGAGGAGCTCGCCAGGATCTGGGC GAGCGTCCTCCGGCTCGAAAGGGTCGGCGTCCACGACAACTTCTTCGAGATCGGCGGCGACTC GATCCTGAGCATCCAGATCGTGGTGCGCGCGCAGCAGGCAGGGCTGCGCCTCACCCCGCGTCA GATGTTCCAGCACCAGACCATCGCCGAGCTTTCGACCGTGGCTAGAGCCGTCGAGGCGGTCCA CGTCGAGCAGGACCCGGTGACCGGTCCCGCGCCGCTCACGCCGGTGCAGCGCTGGTGGCTGGA GCAGGAGGCGGCCGAGCCGCACCACTTCAACCAGTCGATCTTCCTCGAGGTACGCGAGCGGCT CGACGAGAGCGCGCTGGAGCAGGCCATCGCGCATCTGATCGACCACCACGACGCGCTCCGGTT GCGCCTCGCGCGCGACGAACGCGGCGCCCACCAGGTCTTCGCCGCGCCGGGAGGCTCGACCCC ATTTCAGCGCGTCGACCTCGGGGCGCTGCCCAGCGCCGAGCAGATCTCCGCCATGGAGAAGGC CGCGAGCGAGGCGCAGGCGAGCCTCGATCTGGCCGCGGGCCCGGTCGTCCGCGCCGTGCTCTT CGACCTCGGCGAGGTCGCCCCGCAACGGCTGCTCGTCATCGCCCACCATATTGCGGTCGACAG CGTCTCCTGGCGGATCCTGCTCGACGATCTCTTTGGGGCCTATGAGCAGGCGCGCCGCGGCGA GGCCGTACGCCTGCCGCCCAAGACCACGTCGGTCAAGCGCTGGGCCGAGCTGCTCACCGAGCA CGCCGGCTCCGAGGCCGTCAAGGCGGAGCTCGGCTACTGGCTCGACTCATCGCGACGAACGGT AGCTCCGCTGCCCGTGGATCGACGGGCCGGCGAGGACGTGTGGGGCTCGGCGCGCCACATCGT CGTCTCGCTCACGCCGGAGCAGACGGAGCAGCTCCTGCGCGAGGTGCCGCAGGCGTACCGCAC ACGGATCGACGACGCGCTCCTCACTGCGTTCGCGCAGGCCATCGCTCGGTGGACGGGCTCGCC GGCGGTGCTCCTCGACCTCGAGGGTCACGGGCGCGAGGAGCTCGCCGGCGTAGACCTCACGCG CACGGTCGGCTGGTTTACGGCCATGTACCCGATCCTACTCCGCGTCGACGCGGCGGATCCGGG TGAGGCGCTCAAATCGATCAAGGAGCAGCTCCGCGCCGTGCCAGGCCGCGGGCTCGGCTACGG CTTGTTGCGTTACCTTCGGTCCGATACCATCGCCGAGGTCCGCGCGTTGCCGCAGGCCGAGCT CTGCTTCAACTACCTCGGCCAGCTCGATCAGGCGATCCCCGAGGCTGCACCGTTCCGGCCGGC GCGCGAGTATCAAGGCTCGGAGCGCAGCCCCGGCGCCCATCGCGCCCACCTCATCGAGGTGAA CGCGAGCATCGCCAATGGGCGCCTGTACGCCACGTGGACGTACAGCGAGCGCCGCCACGAGCC CGAAACCATCGAGCGCGTCGCGGCGAGCTTCGTCACGGCGCTCCGCGCGCTCATCGCGCACTG
CACCTTGCCCGAGGTCGGCGGCAACACGCCTTCCGACTTCGACAAGGTGCGCCTGCGCCAGGA GACCATCGATGCTCTCGACGCAATCGACGCGGGCCCCGGGCCGTCTGCGAGGGGGAGCCGAAT CGAAGACGTCTACCCGCTCTCGCCGCTCCAGGAGGGCATCCTGTTCCACACGCTCTACGCCAC CGATTACACGGCGTATGTCGAGCAGTTCCACTGGACGCTGGAGGGCGATTTCGACGCCGAGGC GTTCACCCGCGCCCTCCAGGACGTGGTCGCTCGGCATGCCGCCCTGCGCACGTCGTTCGCCTG GGAGCGCCTCGATGCTCCACTTCAGATCGTCCGCACGGGCGCGGTCCTCCCCGTCGAGCACCA GGACCTACGCGGCCTCGCCGCGGAGGAGCAGACCGCGCACATCTCCCGTTACGTCGAGGCAGA GCGCCAGCGCCGGTTCGATCTGCGAAAGGCGCCCCTCATGCGCGCCGGGCTGCTCCGGCTCCG CAAGGACGCCTGGTGCCTCGTCGAGACCATCCACCACCTGATCCTGGACGGCTGGTCGACACA AATCTTGCTCAAAGAAGTGTTCACGCTCTACGAGGCGCACCGCGGACACCGTGGGCATCTCGC GCTGGAGCTCGAGCAGCCGCGGCCCTACGGCGATTACATCGGCTGGCTCGCGAAGCAGGACCA GGTGCGCACCGCGGCCTTCTGGCGGCGCGAGCTCGAGGGCTTCTCCGCGCCGACGCCGCTCGG CGTCGACCGCGCTGTGCCGCACGACGACGGCGGCCCGCGGTTTGGTTGGCGCCGCATCGCCCT CTCGGGCGACGACGCGGCCCGGCTCGCCGCCTTCGCGCGTCAGCATCAGCTCACGATGAGCAC GCTGGTGCAAGGCGCGTGGGCGCTGCTCTTGTCACGCTACAGCGGCGATCCCGACGTGCTCTT CGG ATGACCGTCTCGGGCCGCTCGGCGCCGATTCCCGGTATCGAGCGCATGACCGGCCTCTT CATCAACACCATTCCGGTGCGCGTGCGCGAGCCTGCCGACGCGTCGGTGCTCGCGTGGCTCAA GGCGCTCCAGGAGCACGAGGCAGAGCTGCTCGAGCACGAGCACAGCCCGCTGGTCGAGGTCCA GGCCCATAGCGACGTGCCGCGCGGGACCCCGCTCTTCGAGAGCCTCGTCGTGTTCGAGAACTA CCCGGTGCAGGTCATCTTCGAGGCCCCTCCGGTCGAGGGGCCGACGCGCGCGGAGGAGGGCCT CCGCATGATCGATGCGCAGTATATCAGTGATCCACCGTATCCGCTGACGGTCGTCGCGGCCTT CCA GGGACGCTTTATCTCAATATTGGCTACGAGCGCCGCCGGTTCGACGACCAGGCCGTCGA ACGGATGATCGGGCACGTCACGACGCTGCTCCGGGGCTTCGTGCAGAGGCCCGAGACGTCGGT CCGCGATCTGCCGTTGCTGACGGCCGAGGAGGAGCGCACCCAGCTCCACGCGTGGAATGCCAC GGCCGCGCCGTATCCCGAGGGCCATTGCATGCACGAGCTGTTCGAGCAGCAAGTGGAGCGGTC GCCGGAGGCGACCGCGGTGCTCCTCCAGCAGCAGACGTTGACGTATCGAGAGCTGAACATACG CGCCAATCAGCTCGCGCATCACCTGCGGAGCCTCGGCGTGGGCCCAGAAGTGCGCGTGGGCTT GTGTCTCGAACGGTCGATCGAGACGGTCGTGGCGATCCTCGGCGTGCTCAAGGCAGGCGGGGT CTAGGTGCCGCTCGACCCGACGTACCCCAGCGAGCGCCTCGGGCTCATGATGGAGGACGCGGC GCCCTCGGTGCTGCTCACGCAGACGTCGCTCCTCTCGAAGCTGCCGCCCCACGGGGATGCAAC
GCT GTACAGCTCGACGCGCTGCACGAAGCGCTCTCCAGGCTGCCACACCATACCCCGCGGAG CGGCGTCACGGCCCAGAACCTCGCATACGTCATGTACACTTCCGGCTCGACCGGGCGGCCCAA GGGCGTGCTCGTCGAGCACCGCGGCCTGTGCAATCTGCCCACCGTGCAGGCCAAGCTCTATGC AATCGCGCCGAGCGACCGGCTCCTCCAGTTCGCGCCGCTCTGCTTCGACACATCGTTCTGCGA GAT GCGCTCGCGTTGCTCTCGGGAGCGACGCTGGTGATGGGCACGGCGGACGAGCTCCTCCC GGGACCTCCGCTGGTCGAGCTGCTGAAAAAGCACGCGGTCACGGCGATGCTCCTGGCCCCTTC GGTGCTCGCAGCGCTGCCAGAACAACAGAGCGCGGCGTTGCCGCTGCGCGTGCTCGCGATGGC CGGCGAGGCGTGCCCGGCGGAGCTCGTCAAGCGCTGGAAGGCACCCGGACGGCGCCTGTTCAA CTCCTATGGCCCGACCGAGACCACCATTTGGGCAAGCTCCGCAGCGGACCTGTCCGACGAACG GATCCCGCCCATCGGCCGTCCGATTGCCAATACGCAAATCTACGTGCTCGACGAAGCGCTCGA GCCGGTGCCCATCGGCGTGCCGGGCGAGATCTTCATCGGCGGCGTGGGCGTCGCCCGGGGATA TCACGGGCGGCCGGACCTGACGGCCGAGCGATTCGTACCCGACCCCTTCGGGCAAACCAAAGG GGCGCGCCTGTATCGGACCGGCGATCGGGCGCGCTGGCTGCCGGACGGCAACCTCGAGTTTCT CGGTCGAAACGACGAGCAGGTGAAGGTCCGCGGTATCCGCATCGAGCTGGAGGAGATCCGCGC GGCGTTGCTGAAGCACCCGGCGGTCGCTCAAGCCGTGGCCGTGGTGCGCGAGGACGCGCCGGG GGACAAGCGGCTCGTCGCGTATGTCGTCGGACGCGGAGGAGCGCGCCTGACCGCCGCGGAGCT GCGCCAGTCCGTGAGCGAGCGATTGCCCGCGACCATGGTGCCGTCGTCCTTCGTGGCGCTCGA CGCCCTGCCCCTCACGCCGAACGGCAAGGTGGACCGCCGCGCGCTGCCGGAGCCCGAGCGGAG CGCCGGCGGCGAGGACCACGTCGCACCGCGCAACGCCATCGAGGAGGAGCTCACACGAATCTG GGCCGACGTACTTGGGGCAAAGCGGGTCGGTGTGCACGACAATTTCTTCGATCTCGGCGGCCA TTCCCTGCTGCTCGTCCGGGTGCATGATCGGCTCGGCCAGCGGTTCGATCGGCCGCCCTCGAT GGTCGACCTCTTCACCTATCCGACCGTGGCGTCGCTCGCGCGGTTCCTTGGCGAACGGGCGAA CGGCAAGCAATCGCCGAGGGAGGCCGCGGCGGACGTCACGGAGCGCGGCCGGCGCCGCCTGGA GGCGCGGGCGCGGCGGGCGAAGGCCATCCGTGGCCCGACCTGACCCGGGCACCCTTCCAAGCC CCGCCGTTCCTCGCACATCCGCCGCCTCGAGCGCCGCGTCCAGCGCCGCCGTTCGCCGACGAG GAGGCGCGAGACGACGGTCCAAGGCCTTCGTGGGCTCTTTGCCCCGCAATCCGGAAGCTGCGC GGCAGTTCGTCGCCCCTGCAATGCTGCCATTGTAGAGCTCCTCCGCTCGCCGCGGCCTCTTTT CTTGCGGCCCGTCCGCGATTGACCTCACATCCTGATCCCTTCTTGCGTCGTCCAGAAAGTGAT TGACGGCCAGCGCCGCGCTTGAGATCTTCCGGCGCGCGGCGATTTCATCGCTCCGGCGCGCCG TGACTGTCACCTGCGAAGGGATTATAATGAAACATAACATTGGGTGGCTTCTACCCGCCGCCC
TCGCGACGCTTGCCTTCGTCCCGGCCTGCAGCCCGAATCACGGTGAGGATGCGCCCTCCGTGA CGTCAGCAGAGAGCGGCGCGGCGCCGAGCGCTGACTGCGTCGCGCTCGGGGCGAAGCTCCAGG CGGCGCTGGACGGCGCCGCCGCCGCGCAAAAGGCTCCGGGAGCCGCAGCGGCGGTCCAGAGCG GGGACTGTGTCTGGCGGGGCGCCACGGGCGTCTCGGACCTGGTCGCGAGCACGCCGACGAAGC CTGGAGATCTCTTTCGGATCGGCAGCATCACCAAGACCTTCGTCTCTACGCTGATACTCATGC TCCGGGCAGAAGGCCGGTTGTCGCTCGACGACGCGGTGTCGAAGTATGTGAAGGGCATCCCCG CCGGCGACCAGATGACGCTGCGCCAGATCCTCGGTCACACGAGCGGGCTCTTCGATTACACGT ACAGCCCGGCGCTCGGCCAAATGATCGAGGTGGATCCGACCCGCGCCTTCGCGCCGGCAGAGC TCATCGCCCTCGCCACGGCCGAGGCGCCGTATTTCGCGCCGGGCGCGGGTTTTCGCTATTCGA ACACCAATTACATCGTGGCCGGCCTGGTGGCCGAGGCGGTGTCGGGCGGGACGCTCGCCGGGC TGCTCCGCACGCGCATCCTAGACCCTGTGGGCCTCGCGCACACGTATCTGGACGGCGCCGAGC CGCCGGTCCAAGGGCTCATCCGCGGCTACGGCGACTACGGCGCGGGCTTGGTCGACATCACCG ACCAGCTGTCGCCCACCGAGGCGTGGGCCGCCGGCGCCCTGGTGTCGAACGTCGATGACCTCA ATCGCTTCTTTGCCCTGCTCATCAGCCACGAGCTGCTCTCGTCGGACGAGCTTCAGGACATGA CCACCTGGACCCCGACGATGTGGCCCCACGAGCCCGGATATGGCCTCGGCCTCATCGAGCGCG ATTCTGCGCTCGGCTCCCTCAACGGGCACTGCGGAATCATCTGGGGCTTTCAATCGGCGTCGT ACGGGGTGCCCGGCCGCGGCGACGGGATCACCGCGCTCATCAACCGGAGCGACGGCGACGCAG CGCGGCTCGTCGACGAGCTCGCGAAGGTCGTGAAAGAGCGCTGATCGAGGCGGAATGGGAGCG CTTCGGCGGGTGGTGATGGCGCCCGGCGCTCAGAACGCGACGCGCAGCCCCGCGCTCAGCGGG CCTGCGCCGGGCGACGCGGCCACGGCGCCCGGACCGACGAGGAGCCGCGCGACGGCGGGCGCG CTCGGCGCGTCGTCTCGCCGCACCCGCCGCTTGCCGAACACGTAGAGCGGCAGGCCGACGGCG ACCCCGGCCACCCCGCCGAGCGCGGTGGCGATCGCCACCTCGGACGCCTCGGCGCGCGCGGCG CTGCTGTCGTCGTGGCTCGCGAAGACCAGCACCGCGCCGCTGAGGATGGCGGCGCCGCCCAGG GTCGTGAGGACGAGCCCCGAGATCACCATGACCGGGCTGTTCCACTCCGTCGTCCGCTCCTCG AAGTCGCGGAACGCCGCCCTCGCCGCCGCGAGCTCCAGCTCGATCCGGCGCTGCTCGGCGCGC CGCTCGTCCGCGGAGCCGATCTCGTGGACGCGGCGGGCCTGGCCCTCCAGCGCCGCGATGCGC GCCTCGTGCGCGGCGGCGGTCTCCTCCCACGTGGCCCCTGGCGGGACCCCCGCCACGGCCGGC GCGACAGAGGGCGCCGACGCCGGGGTCGAGGCGGGCGCCGCGGGCGGCTCCGCGGCCACGGAA GGCGCCGCCGCCGCGGGAGGCGCGGGCGGCTCCGCGGCCACGGAAGGCGCCGCCGCCGCGGAA GGCGCGGGCGACTCCGCGGCCACGGAAGGCGCGGCCGCCGCGGGAGGCGCGGGCGGCTCCGCT
GCAACGGAGAGCGCGGGCGCCGCGGCAGCCAGGCCCAGCGCCCACGCGACGACACGACGGCGC GCCGCAACCGCGCGCGGGCGCGCGAAGCGGAGGTGGACCTGCTCCATGCGCGCAGCGTCGCCC CTCGACAGGGCCGGGTCAAGGCGCGGGAGTCCGAGAGCACGAGACCTCCGCGCCGCAGGAAAC AGGCGCGCCGGCGGCCCGCGCGGCGGCTCGCCGCTCACCCCTCGCGCGGCCGGCCGCGGCGCC GCCTCCCCTCCCCGGCGGGCCGCGCGTCGGCGGCCACGCGGAGCAGCTCCTGGAAGTGCCGCT CCACCGGGCCGAGGTCGATGCCGTCCATGAACGACGTGAACGCGAAGTACGGCAGCAGCGTCT GCCAGGCGCGCAGCCAGCCCGGGAGGTAGCGCGGCCGCTCCAGCACGCCGGCGGCGCGCAGCT CCGCCAGCACCGCGATGTCGTCGAGCGCGATCCGGCCGCACACGAGCAGCTCCACCTCGTCGC GCAGCCCGCCGCGAAGGACCGCGGCCAGGAAGGCGTAGACCTCGAGCAGCCCGGCCAGGTAGC AGGCGTCCTTGGTGAACGGCGCGCCGCCCTCGACGAGCCCGCCGCGGCACACGCGCTGGGCGT CGAAGTAGGCGTCGCGGCGCTCGGCGCCGCGCTCGCGCAGGTGCCGGTACAGGTCGAGGAAGC TCGCGCCCTGCTCGGCCATGTCCACGAGCCGCACCCGCTCGGCGAGCCGGGTGAGGCGGCCGA TGGAGAGCGAGCGGCTGTAGAGCTCGGCGAAGATGGCGAGCCCCTCCTGCGTGCGCGTGGTGC GGGGGCCGCCCGAGCGCAGGAACGCGCACCGCGGCTGCGCCGCGCCGTTGTGCGCGGTGAGCG CGTGCGTCTCGACCTCGTGGTGCCAGAGCCCCTCGGCCTCCCACGCCGCGAAGGTCGCCTCCG GCCGGATGCGGACCCGGCTCATGCCGGCGACCACCTTGGCCGTGACGCGCGGGTCGACGGTGA TCTCGAGGTCGAGCCGCGGCGCCCGGCCGGCCACGCGGGCGGCGAGCATGTCCCGGAGCGCGC CGGCGTCGAGCGGCTCCTCCTCGGGATCGCTGGCCTCGTCCCAGCCGTGGACGCGCAGGCGCT CGGTGAGGTGCTCGGCGAGGTCGA GTTCCTGAGCGAGCCGCCGAAGAACCGCGAGCGCGCGC CGCCGTAGAGCTCCTGCGACCGCGCGGAGAACGCGCGGGTGCCCGCGGCCTCGAGCAGCTCCG CGGCCTGGATCTGCGCGCGGACGTTGTCCCGCAGCCAGCCGAGCGCCGGCGCGTCCCCGTCGA TGGCCCCGAGGAGCTCGCGCAGCTCGGCGACGCGCCGCGCGAGGCCGTCGCGATCGACGCGGT ACTCGACCTCGGGGAGGCGGTCCTCGCCGGCGGCGAAGAAGCGCTCCTCCACCTCGCGCGGCC AGGCGATGTCCTCGAGCAGCTTGAGGGCCTTGCCCTCCGCCAGGCGGCCGCCCACCCGATCGA GCTGCTCCAGCACGGCGCGGTCGAT3CTCATCGAGCGCAGGATCGCCGAAACCGCGAGACGCC GGAACCGTCATTCCCTCGACGAGGCAGCGATTGCCATGTTCCGTCGCTTTTTGGAGCGCCGTC GTCGCGCTCGCCTGCGGGCTCCGGCGATCCAGCGCGGTTGCATGCAGCGAGGGTGTTCCGGGG CTGGCTCGAGAGCGTCCTTTGGCCCACACCCGAGACACGAATGCTCCGCGCCGAGCGCGGTTG ACCGTGGACCCGCCGGAGAGCCGA 3ATACGGTCCGGCCGATGTCGGAGAGTGTAGCTCAACT CGAAGAACACCGCGCGGCGCTCACCGGACACTGCTACCGGATGCTGGGTTCGGTGGTCGACGC
CGACGACGCCGTCCAGGAGACGATGGTGCGCGCCTGGCGGAGCCTGGATAAGTTCGACGGGCG CTCGTCGCTGCGCACCTGGCTGTACCGCATCGCGACGAACGTCTGCATCGACCTGCGGGCCGA CCGCGCGCGCCGGGCGCGCCCCATCGAGGAAGGCCCGGTCGGCACGGTGGACGACGCGCTCGA GACGCGCCCGCGCACCCACTGGCTGGAGCCCGTCCCCGACGCGCACGCCCTGCCGGCGGACAT CGACGCCGCGGAGCGGGCGATGC CCGCCAGAGCATCCGCCTCGCGTTCGTCGCGGCGCTCCA GCACCTGCCGCCGAAGCAGCGCGCCGCGCTGCTGCTCACGGAGGTGCTCGGCTGGTCCGCCGC GGAGGTCGCCGACAGCCTCAACACCTCGGTCGCCGCGATCAACAGCGCGCTCCAGCGCGCGCG GGCGACGCTCGCGAGCCGCGATCTCGGCGACGCGCGCCCCTCGCTGCCGGAGCCGCAGTCCGC GCTGCTCGACCGCTACGTCAACGCCTTCGAGCGGTACGACGTCGACGCGCTCACGGCGCTGCT GCACCAGGACGCGACCCTGTCGATGCCGCCGTTCACCCTGTGGCTCCGCGGCCACGAGTCGAT CCGCGCCTGGCTCGTGGGCCCGGGAGCGGGCTGCCGCGGGTCGCGGCTCATCCCGACGGCGGC GAGCGGCTCGCCCGCGTTCGCGCAGTATCGCCCGGCGCCGGAGGGCGGCCACCGGGCCTGGGC GCTCATCGTCCTCGACGTCGCGGGGGACCGCATCGTCAGCATGACGTCCTTCCTCGACACCGA GACGCTCTTCCCGCGGTTCGGCCTGCCGCTCGATCTACCGGCGTAGCCGCGGGCGCCCTGCCT GCCTCGCCGCGGGTGCCCTGCCTGCCTAGCCGCGGGCGCCCGGCCTGGCCACGGGCGCCCGGC CTGGCCACGGGCGCCCGGCCAGCGACGGGGCGACGATTTTTTTCTGAGCGACCGATGAGTCCT GACGGGGCCGGGGGTCTACGGGGGTGAATCCAACACGGAGGCACCCATGACCGTGACCATCGC CAGGATCGATCATCGTGACCAGGACCTCATGACCGGGCCCCAGGCCAAGGCGCCGGCCCGCGC GGCGGCGCCCGACGCGGCGCCGTCCAGGCGAGCCGTGTGGGCGGGCCGCGTCCTGAGCGGGCT GGCCACGCTGTTCCTGACGTTCGACGCCGCGGTGAAGGTGCTGAAGCTGTTCCCCGCGGAGGC GTCGACCGCCGAGCTCGGGTTCCCGGCGCACCTCGTCCCCACCCTCGGCTACCTCCAGATCGC TTGCCTCGTGGCCTACCTGATCCCGCGCACCGCGGTGCTCGGCGCGATCCTGTGGACCGGCTA CCTGGGCGGCGCGATCGCGATCCACGTGCGGGTCGAGAACCCGCTCTTCAGCCACACGCTCTT CCCCATCTACGTCGCCGCGTTCCTCTGGGCGGGGCTCTGGCTGCGCGACCGCCGCGTGCGCGC GCTGACCGCGAGCCCGTCGTCGCAGGGCCGATGAGCTTCACGTTTCACGAGAGTCCATCACGG TAAAAGGAGAAGCGAGCCATGACCACAAAGAACCCCCGCAAGCTCTTCGTCAACCTGTCCGTC CGCGACCTGAAGCGATCGATGGAGTTCTTCAGCAAGCTCGGGTTCGAGTTCAACCCGCAGTTC ACGGACGAGAAGGCCGCCTGCATGGTCGTCAGCGAGGAGGCCTATGTCATGCTCCTCGTGGAG TCGTTCTTCAAGACGTTCATGAAGAAGGAGATCTGCAGCACGAGCACGCACACGGAAGGGCTC TTCGCGCTCTCGTGCAGCAGCCGGGCCGAGGTCGACGACATGGTGAAGAAGGCGGTCGCGGCG
GGCGGGTCGCACGCGATGGATCCGCAGGATCACGGCTTCATGTACGGGTGGAGCTTCTACGAC GTGGATGGCCACCACTGGGAGGTCATGTGGATGGATCCCAAGGCGATCCAGCCGTAGCCGACG GGGCTGGGCGCGCCGCCTGGAAGAGCCCCCGTGAGGCGGGGAGGCGGGAGGATCACCGTCTTC GTAGCCCACAGCGATGCAGTATCCGTCGCGCTTCGTATCGAAGCACGGCTGTTACGGGCGCGT CAGAGCGCGTCGCAGGTGATGCCGAGCCGCAGCAGCGACACGGGCACGAGCGTGGCTCCGATG GAGATGAGCCGAGTCTCGCCCATGGTCTCGGGGTCATGAATGGATGAGTAGGGGACTCGCTCC TTCGTCACGTCGTGCTCGACGGCGACGGCGAGGCCGAGCTCGAAGTGCACGGGGCCTGGACCG AAGATCCAGCTCGCCCCGGCGCGAGCCCCGACGAAAAGCGTGTCGCCGTCGACGCCAGGGCCG TCGTCCCAGCCGGGCGATCCCACCGCGGTGTAGGTGTGTTTCCCGAAGGAACCCGCGAGCGAG AGTCGAAGTCCGACCGGCGCTCGCCACGCGACGCCCGCTGTCGCGCCGACGCCGCCGAAGCTC TCCCCGAAAGGCTTATCCCCTGTCTCGATGAAGCCACCCACCTCGATGACGCTGATGCGGTAC GTGAGCGCGAGATTGAGGTGCACCCCAGCGCTGTCCGAGCCCGAGTAGAGGCCGGCGCCCACC TGCACGCTGAAATCCATGCTCGGCGCGGATCCGCGCGCAGGAGCGACGCCAGGGGCGCTGCCC TCCTGCGCGCGGGCCGTCCCGACGCAAAGAAAGAGGGCTGTCGCGAAGAATCCAAGCGAGATC GATCGAAGTGAGCGCATGTCGGGCCCTGGAGCATCCGCTGTACCAGGTGCGTCGTATTCATGC GGCGCGCCGCCGGGCGCCGCCGCGCTGGCCTGTCCGACGCGAGATCACGAATCCGCCATCGCT CCCCTGGGCCGCCGGCCGCTCTGGTTCGCCTGCGGGCGTGCGCCGGCGCTCGTGTGGCCCATG GCAACCTTGTCGCGGTGTCGCTCGAACAGCACAGAGAGTATCGCGTCCGCAACAACCGCGCGA CCCGGCGAGACGCTCGTGGGGCCCCCTGCCTCCCCACTTCATCATAACGCCATCAGGAGCACT CGACATTTCATTTCTTCACCTCCACTGGCTGAGGGCGACGGTGCTCGTCATCGGCCGGTTGCT CTGGCGGTTGCTCTGGCGGGGTTTCTGACGCCCGGAACTAACGCTTCGAGCGCTCCCCCTTGC TCTCCCGTTCCTTCAGCTCCTCCAGCAGGTCGTCGAGGCGCTCGTAGCTGCCTTCCCAGAAGC GGCGGTAGTTGTCGAGCCAGCCGCTGGCGTCCTCGAGCGGCTTGGCCTCGATCCGACAAGGCC TCCGCTGCGCGTCGCGGCCGCGCGAGATCAGGCCCGCTCGCTCCAGCACCTTGAGGTGCTTGG AGATCGCGGGCTGGCTCATCGCGAACGGCTTCGCCAGCTCGGTCACCGACGCCTCGCCGGACG CGAGGCGCGCGAGGATCGCTCGCCGTGTCGGATCGGCGAGCGCAGCGAACGTTGCGTCGAGGC GCTCGGACGGGGTCATTGCATAACTCCTTGGTATAAAAACCAGTTAGTTATACAACCTGGGGC CCGGGCGGTCAAGCCTCCAGGCGATGGCGGTTCGGCCCGGGGGCTCCGCTCGCGGCACGCGCG CCGCGCGGCTACGTGCGCGGCGCGGTGAGCACGTCCTGCAGCGTGGCGCCGACCACGGGCTTG GTCAGGTGCAGGTCGAAGCCGGCCCGCCTGGACCTGGCCTGATCGTCGGGCCCGCCGTAGCCC
GAGAGCGCCACCAGGTAGAGCGCTTCGCCGCCGGGCGCGGCCCGCGCCCGGCGCGCGACCTCA TAACCGTCGATGCCGGGCAAGCCGATGTCCACGAAGGCCACCTCGGGGCGCAGCTCCAGAAGC TTCTTCACGCCCTCCAGCCCGTCCACCGCCACCGTCACCTCGTGCCCCAGCGCCTCGATGTAC GCCCGCATCACCCGGCGCACGTCCTCCGCGTCCTCCACGACGAGCACCCGGCGCCGGTCAGCC GCCGCCTCGGGCGCCTCGGCGCGCTGCGCCGGAGGCGGCGGCGGCTCGTCGCGCTGCGCCGGA GGCGGCCCCTCGCGCGGCGGGGGCGGCCCGGCGCTCGGGGCAGGCTGCGGCGCCGCCCCGGGG CCGAGCGGCAGGCGCACGGTGAACTCGCTGCCCTGGCCCGGCCCGGCGCTCGCCGCGGCCACG CTGCCGCCGTGCAGTTCCAGGAGCCGCCGCACCAGCGTGAGCCCGAGCCCCAGCCCGCCCGTG CTCCGGTCGATGGTCTGGTCGACC73CGTGAACAGATCGAACACCTTCTCGAGCATCGCCGCC GGGATGCCGCGGCCCGTGTCGCGCACCCGCAGCACGGCCTCGGGCGCGCCGACCGCCGCCTCG CGCGTGAGGCGCACCGAGATCGAGCCCCCCGGCGGGGTGTACTTCGCGGCGTTGGTCAGGAGG TTCGTCACCACCTGCTCCAGCCGCGTCGCGTCGGCCCGCATGCCGAAGTCCCCGGGCCCCACC GACAGCGACACGTCATGGCGCCGGC-CCTCGACGGCCGGCCTCACCGCGGCGGCGGCGCTCTGC ACCACCGCCGCGAGATCGACGTCCTCGAGGCGCAGCTCCACCGTGCCCCGCGTGATGCGCGAC ACGTCGAGCAGATCGTCGACCAGCCGCACGAGGTGGCCCATCTGCCGCCGCGCGATCTCCCGG TAGCGCGCCGACGCGGGCCCGTCGCGGTCCGCGTCGTCGAGCAGCGTCAGCGACAGGCTGATC GAGGCCATCGGGTTCCGGAGCTCGTGCGCGAGCATCGCGAGGAACTCGTCCTTGCGCTGATCG GCGAGCTTCAGCGCCTCGACGAGCGCCTCCACGCGCCTCCGGGCGCGCACCTGGTCGGTCACG TCGAACGCGAACACGAAGACGCCCTGGACCGCCCCGTCGCGATCGCGCATCGGCTGGTAGACG AAGTTGAAGAACACCTCCTCCGTCGTGCCGTCGCCCCGGCGATCGAGCCGCACCGGGAGCTCC TTGCCGACGATGGGCTCGCCGGTGCGGACCACCGCGTCGAGGAGCTCCCAGATGCCCTGTCCC TCGAGCTCGGGGAGGGCGGCCCGGATGGGCTCGCCCACGAGCGATCGACCGCCGACGAGCCGC TGGTAGAGCGGGTTGACCACCTCGAAGACGTGCTCCGGCCCGCGGAGGATGGCGATGGGCCCC GGGGCCTGCATGAAGAGGTCGTTCAGGTACTGGCGCTGCCCCTCGGCCTCGCGCCGGCGGCGC GCGAGCTCGACGTGGATGCGGACCCGCGCGAGGAGCTCCTTCGCGGAGAACGGCTTCACGAGG AAGTCGTCGGCGCCGGCCTCGAGGCTGTCGACGCGCGCCTCCTCGCCCGCGCGCGCGGAGAGC ATCACCACGGCGACGCCGCGGGTGCGATCGTCGGCGCGCAGCGCCCTGAGCAGGCCGAAGCCG TCGAGCCGCGGCATCATCACGTCGGTGAGCACGAGATCCGGCGGGTGGGCGCGGGCGCGCTCC AGGGCGGCCCGACCGTCGGCCACGGCCTCCACCGTCCACCCCTCCGCCACGAGCAGCCGCAGC GCGTACTCGCGCATGTCCGCGTTG7GGTCGGCGACGAGGACGCGCCCCGGCAGCCTCCCGGCC GGCCCCTCGCCCGCCGGCCGGGACGGCGGCGCCTGCTCGCCGCGGAGCCACTGCGCGGCCTCG TCGAGGAAGGGCGCGGCGTCCCGCCCCCCCGCGGCCGGCGCCGAGGCCGGCGCGAC
or its complementary strand, (b) DNA-sequences which hybridise under stringent conditions to regions of DNA-sequences according to (a) encoding proteins or to fragments of said DNA-sequences,
(c) DNA-sequences which hybridise to the DNA-sequences according to (a) and (b) because of a degeneration of the genetic code,
(d) allele variations and mutants resulting by substitution, insertion or deletion of nucleotides or inversion of nucleotide segments of DNA-sequences according to (a) to (c) , wherein the variations and mutants offer isofunctional expression products.
7. DNA sequence according to claims 1 to 5, wherein the DNA is selected from the group consisting of
(a) the following DNA Sequence:
Seq ID No 2 (>pEPOcos6 region) GGATCACCTGCGGCGCGATCGCCGACCTCGTGCTGGTGTTCGGCTCGCTGGATGAGAAGCCGG CGGCGCTACTGATAGAGACGGCGACGCCCGGGCTGCGGGTGGAGCGGTTGCGGGAGATGCTCG GCTTTCGGGCGGCCCACCTGGCGAAGCTGTCCTTCGACGGTTGCGAGGTCCCCGAGGCTCAGC TGATTGGCCGGCCCGGCTTTGCGCTGATGTATCTGGCCCCCTACGCCCTGGATTTCGGTCGGG TCAGCGTCGCCTGGGCCTGCCTGGGCATGATCCGCGCTTGCCTGGAGACCTGCGCACAGCACA TCCTCACCCGCCGCACCTTCGGCCACCTGCTAGCCGATCACGGCATGATCCAAACCCTGATCA
CCAACCTGGGGATTCACCACCAGGCGACGCTGCTCCACACGCTGCAGGCCTGCCGCGCCAGGG ATCGCGGCGACGTGACCGCCTCCGAGGCCACCCTCGCCGCCAAATACCTCGCGTCGCGGACGG CGGTCCAGGAGACGACCAACGCGGTCCAGATCATGGGCGCGCTGGGCTGCGACGAGGAGGGCG CGATCGCCCGCCACTTCCGCGACGCCAAGACGACCGAAATCATCGAAGGCAGCAACCAGATCA TCGAGGCGCTGCTGGCCAAGAACA CGCCCGCGCCGGTCGCGACAACTATCGCCGCTTCCTCG ATGCGGAAGTCGAGCCCGGTCGGGCCGGAGGCGCACCATGACGAGCGCGGTCCCGACGCGTCA AACCAGCCTGCTCGACGACTTCGAGCGCGTCGCCGACGTCGATCCAGAGCGGATCGCCGTCCA CGCGAGCGAGACGAGCCTGCGCTATGGCGACATGAATGCGCGCGCCAACCGCATTGCCCACGG GCTACGGGCGCGCGGGATCGGGCCCAATCAAATCGTGGCGGTGGCGATGGCCCGCACGCCCGA GCTGATGATCGTGCTGTACGGCATCCTCAAGGCCGGCGCGGCCTACATGCCCATCGCCCGCGA CGCGCCGCCGCTGCGCCGCGATCATATGCTGCGCGAGAGCCAGGCTGCTCTGATGATCGCCGA CGAAGAGATCGCGGGACTCGCGGCCCGGGTGCTGACGCCGGCCGACCCGTTCTTCGCGGCCAT GCCGGACCACAACCCCGAGCCGCGTCACGACCCGACCGACCTGATTTACGTCATCTACACCTC GGGCTCGACCGGCCAGCCCAAGGGCGTGGCCATGGAGCACCGCGCCGTGTGGAATCGCCTGAC TTGGATGCAGGCCCAGTATCCAATCGACACGCAGGACGTGATCCTCCAAAAGACGCCGATCGT CTTCGACGTGTCGGTCTGGGAGCTGTTCTGGTGGCCGCTGGCCGGCGCCTCGGTGGCCCTGCT GCCGCAATCCATGGAGAAGTTCCCCTGGGCGATATCGGCGACGGTGGCGCGGTGCGGGGTGAC GGTGATGCATTTCGTACCATCGATGCTGATGGCCTTCCTTCAGGTGGTGGCGGGCCGGCCCGA GATGGCGGACCAGATGAAGGGCCTGCGCTACGTCTTCTGCAGCGGCGAGGCCCTGGCGCCGGC CCACGTGTCAGCCTTTCAGGAGCACATCAACCGAGCGGGCAGCATCAGCTTGACCAACCTCTA TGGACCCACCGAGGCGGCGGTCGACGTCAGCTACTTCGACTGCCCGCCCGGCGCGTCACTCGC GCGGGTGCCGATCGGACGAGCGATCACCGGCATCCAGCTGCTGGTCATGCGCGACGGCGTGCC TCAGCCGCCCGGCGTCGAGGGTGAGCTCGCCATCGGCGGCGTTGGTTTGGCGCGCGGCTACAT CTCACGGCCAGACCTGACCGCCGACCGGTTCGTGCCGCATCCAGGCGGCGACGGCCAGCGGCT CTACCGCACCGGCGATCTGGTGCGCAGGGACGCGGACGGCGAGCTGGTCTTCCTGGGGCGCAT CGACCATCAGGTGAAAATTCGCGGTCTGCGCATCGAGCCCGGGGAAATCGAGGCCCAGATCAG CGCCCATCCCGATGTGGCCGACTGCGCGCTGATTATCGAGCAGGACTCGGAAACCCTGCCCAA GCTGACCGCCTACATTGTCGTGGCGCGACCGGGCTTGACCCGGAAGGCGCTGCTACAGTTCCT GGGCGCGCGGCTGCCCGACTACATGCTCCCGAACCGCTTCCTGACCCTCACGGAGCTGCCCGT GACCGCCAACGGTAAGCGCGACTGGCGCGCGCTGCTCGGCCCGCTCGAGACCCTGCCTCTCCC
TTTCTCCTGAATCCAACCAATACGAGGGATTCATGTTACACCCGATTCCCACCGACCGTTTCG CCCTGAGCCGACCGCTCTTTCGCGGGTACCTCGCGCACGATCCGATCGTGCAGGGCGTGCTGG CGGGCGACCATCCAGGCTGGGTCCTGGTGGACCGCGAGCCCGAGCCGCGCACGGCGCTGCTGT GGGCCTTTTCCGATCGGCTCTTCTGCGTGGGCGCAGCTGACACGCTGACCCCGCACGCGCTGG CCGAGCTGTTCCACGACCGACTGATCCCCCAGGCCCGTAAGATCGGGCAGCCGTTTTTCCAGG TTCAGGGCGAGACGGTCGACACCTGGTCGGACCACCTGCATCAGGTGTCGCCGCACGCGACAG TCTCCTTCCGCCAGGCATTCCGCTTCGACCGCGACCTCTTCGAGCGGCTGCCAACCAAGCCGG AGCTGGCAGAGGCGCGGCTCGTGCCAATCGACGCGCGGCTGCTGGCCGAACAGGCTGATCTGC GCGAGCGGATACTGGCCTCCTGGTCCAGCGAAGCTGCCTTCCATGCGCGCGGTTTCGGCTTCT GCTACCGCGTAGGTGACCAGCTGCCGAGCGTGTGCCTGGCATCGCACGTAGGCGGCGGCGCGG CCGAGCTGAGCATCAACACCGAGCTCGAAGCGCGCAATCGAGGTATGGCAACGCGGCTGTGCC GGCGTTTCATCGCCGAATCGCTGCAGCGCGGCCTGACGCCTTGCTGGGGCACCGAGACCTTTC GCCTGCCGTCAATCGCGCTGGCCCAGAAGCTCGGTTTCATCCCGACCTTCACCTTCCCCACCT ACTGCTTCGCGACCGGCACCGAACAGCCGGACGACAACTTCCTAGGCGAGCTGTACTACAGGG AATCGCGCATCGCCGGAAGTGGGACCGATGAGCCGCAAGCGGTTCGGCTGGCGCGGGGTTGGA GCCTGGCCGGCGACACCGAGCGTGCCGCGAGCTTCGCCGCACGCGCCCTGGCCGAAGGGTGGG CCGGCCACTCGACTCTGGCCACCGATCCGGATTTCGCCCGATTGCGCGCCAGCGCCGCCTGGC CCCGCCTCAATGTCCCTTGAAAGGTCACGTGGACTCATGATGTCCCCTTGAAAGGTCACACTC CGAGTCATGATGATTTGTCACTCCCACCGCTTCATTTTCCTCCACGTTCCCAAGGTCGCCGGC ACAAGCGTCAAGGACGTCCTCGGCCAAGAGCTATTCCAGGAGGACCAGGTCACGTTCCAGATC GCTCCCAATCCCCACTACCCACCTGAATGGACTGCGCCTTACGAGGAGCACATTATTGCCGCT GAATTGAAGAGCCAGTTGGCGCCGGAAATTTGGGACGATTACTTCAAGTTCGCCTTCGTGCGC CATCCGCTCGACTGGGCGGTCTCCAATTACTTCTTCTTCCTGCGCGACCGCAAAGGCCATCCG GCCCACGAATTCCTGGAGCGGAAGGGCTTCGCCGGTACCATGGACATGTTTTTCGGAGCGGCC GGGCGCCATCCGCTGGTCGCCGGCATGCGCTTCAGCCAATGGGAGTTCTTGTGCGACAGCGAG GGCCGGACGCTGGTGGACTTCGTTGGCAAGTACGAGCGGCTCGAGCAGGACTTCGCCGCCGTG TGTATCCGCATCGGGCTGACCCCGCCCGACTTGCCGTGCCTCAACCAGACTCGCCACCAATCC TTTACCAGTTACTACGACGAGGCTTTGATGCGCCAAGTCAGCCGCGCGTTAGCTCGCGATTTC GAAATTTTTGATTATGCCTGAGGCGGACCCGTTGCTTCGCCACCGGTGGATTATTCGATAAGT TATTATATTTTCAGTTGATCATGTGAATGTCGATCCAGCCAACGAGGAGGATACCTCCGCGTG
CGGCTATGGGGGCGCAGAGGTCACGACTACGTGTAGAAATTTGTCGAACACACCACTAGCTGC CACCGATTGGGAGCTTTGACTTGAAGATGAAAGTGGACAAGCGGAATGTCGACGACATTCTCG GACTCACTCCGACACAGACAGGCATCTTGTACCACTACCTGCTGGACCCGCAGGCCGACGCCT ATTTCGAACAATTGACGCTGCACCTGGAGGGGCCGCTCGACGTAGCGCGCTTCCGCCGCGCCT GGGAGCGCGTGGTGGCGGCTCACGACCAGCTGCGCGCCGTGTTTCGCTGGCAAGGGATCGAAC ACCCGGTGCAGATCATCCTCAAGCAGCACGTGCCGGACCTGGAGTTGGCGGAGGTCCCGCGCG ACGCCGATCCGGCAGCCTTCCTGGCGCAATGGGTCGCGGCCGACCGGGCGCGCAAGTTCGACT TCGAGACGGTGCCCTTTCGCATCGGGCTCTGCCGGACTGATACCCAACATCACGTGATGCTGC TCAGCAATCACCATATCCTGATGGAGGGTTGGAGTACGGGCCTGATTCTGCGGGACTTCCTCG CCTGCTACGGCGACTCCGAAAACTGGCGGCCACGCACCCGAACGCACTTCAAGGCGTTCATCA AGTGGCACCAGAACCGGCCACGCCGGGGCGAGGAGCGATTTTGGCGCGACCTGTTGCGCGATG CGCCCGACGGCGGCTTTCCCCGCCTGGGCGTCGAAGAAGGCACCCGCCACTCGCTTGACTTCG GCGCCCGCAGCCGCGCTCTCGACGACCGCTTGACCCAAGGCTTGCGCGACATGGCTCGCGACC TCGACGTCACCCTCGCCGCGATGC GCATACCGCTTGGGGCCTTCTACTCCAGCGCTACCAGA ACAGCTGCGAAGTGATATTCGGGACGACCGTTTCCGGCCGCAACGTCGAGCTCGCCGGCCTCG ACGAGGTGGTCGGCTTGTTCATCAACACGATTCCGTTCCGCTTCTCGGCCGCGGCCGCGACGA CGCCCGTCGAGGCCTTCCGTGCGGTACAGCGCAATCTGCTGGCGAGAAGCGAGTTCGAAGCCA CCCCGCTGGTGGACATCAAGGGCTGGAGTGGTCTCGGTCCGGGCGCGGAACTGTTCGACACCA TCCTGGTCATCGAGAACTATCCCT GGACCGCGCTATCTTCGAGAGTGATTCCAGCCTGCGGT TGACCGACCACCAAATCTTCGAGCGCACCAATTACGGGCTGACCCTGACCATCGAGACCTTCA GCCGGTTGCACGTGACGCTAGCCCATCGCCGTGACCTGCTGGGCGACGCGGCCGCTGAGCGAA TGCTAGATCATTTCACCGGCCTGCTCCAAGCCATGCTGCGCTTCCCTCACCAGCCGTTCGCGC GCCTCGAGATGAAAAGCGAACACGAGGCCCACCGCGTCCTGCACCAACTCAACCAAACGCGTC AGCCGCTGCCGTCCCAATCGGCTTTCCACCAGTTGTTCTTCGAGCAGGCCCAGGCCGATGGGG CACGACCGGCGCTGTGGTGCGGCGCCACGCGCTGGACCTACGGCCAGCTGCTGGAACGTGCCC TGCGTCTGGCGGGACGGCTGCAGGAAGCCGGCTTCGCCCGAGGCGATGTCGCCGCCGTCAGCC TCGGCCCGGTTCCGGATCTGATTCCCGGTTTGCTGGGCCCGCTGTTCGCCGGCGGCGCCTACC TGCCGCTCGATCCCACCCTGCCGGCGCAGCGCTCGCGGTTCATCCTCGACGATGCCGGTTGCC GCTTCCTGATCAGCGACGCGCCAC GGCGGGGCCCACGCCGATCCATCCGGACCCTGCCGGCG CCAGCCCCGTTGACGTCATTTTTGCCTGTCAGGACGGCGCCGCGCAGCCCGCCTACCTGATCT
ACACCTCGGGCTCCACCGGCCAGCCGAAAGGCGTCTGGGTTAGCCACCGCAACCTGATCAACT TCCTGACGGGCATGAGCGCAATCCTGCCGGTCGCGGCCGACGACGTGTTCCTCTCGCTGACTA CCGTGTCGTTCGACATTTTCGGGCTGGAGACGTGGTTCCCGCTCAGCCGCGGCTGCACGATCG TCTTGGGCACGCGCGCCGAGCAGT GGACCCGGCCGCGGCTGCCAAGGCCATCTCCTGCCATG GCGTCACGGTTTACCAGGCGACGCGATCGCGACTCCAACTTCAACTGGAGCACCCCACATTTG TCCGCGCCATCGGCTCCCTGACGACCCTGCTGGTAGGCGGCGAACCCCTCCCAGCCGAGCTGC TGCGGCGCGTACGCGAAGTGACCGATGCGCGTATCTTCAACCTCTACGGTCCCACCGAAACCA CCATCTGGTCCACAGCCGGGGAGGTCACCGCGGCGGACGTCCCGGATATCGGCCGCCCGATCG CAAATACCGGCGTTTTCCTTCTGGCGCGAGACGGCTCGATCCAGCCGCCGGGCCTGGTGGGCG AGTTGTGCATCGCCGGCGAGGGCGTGGCGTTGGGCTACCACCGACGGCCGGACCTGAACCGAG AACGGTTTCGCGAGATTCCGCCGGGCCGCCTGCCCTTTGCCGGCAAGCTCTACCACACCGGCG ACCTGGCCCGCTGGACCGAAGACGGACGGCTCCTCTGCCTGGGCCGTCTGGACGACCAGCTCA AAGTGCGCGGCCATCGCGTCGAGCCGGGCGAGATCGAGGCAGTGATGGCGCGCCACCCGGCGG TCACGCAGGCGGTGGTCGTCACGCGGCCGCGCAACGGCGAGCCGGTCTTGGTCGGGTTCTGGA CTGCGGAAGGTGAGCCGATGCCAGAGGAAGCGCTGAGCGCTTACCTGGCCGACCGACTGCCGA GCTACATGGTACCCGAACGGTGCAΓCCTCATGAAGGCCATGCCGCTAACCGGCAACGGCAAGA TCGACCGGCGCGCCCTACCCAATCCGTTCGCCTTGACCGAGTCGACCCGGCAGGCGGCGCCGC GCACCTTGGCCCGCACCGCCGGCGAGCATCGGGTTGCCGAGCTGTGGCAGGCCTTGTTGCGAC GCGAGGCGATCGGCTTGGACGAACCCTTTTTTCAGGCCGGCGGGAACTCATTCGGCTTGATTC GGCTTCACGCCAAGCTGGAATCCGCCTTCGGGAAGTCGTTCCCGATCACCGATTTGTTCCAGC ATACCAGTATTCGCAGCCAGGCAGAAATGCTGAGCGGCTCGTCCGTCGAGGCGCCGCTCGCGG GAGCCGTGCCGCAACCCCCGGCCGCCGCCGCCCAAGTTGCCTCCTCGGCAGCTAAATCCCCAG GGGAGCGCGGCGCGGCAGCGACGTCGAGCGGCCTGACCGCGCAACCGCCCCAACCCCACTTCC GGCCCATCGCCGTTATCGGCCTCGCCGGCCGATTCCCCGCCGCACCCGACCTCGACGCCTTCC TTGAACTGCTCACGGAGGGTCGCTGCGGCATTCGCTTCTTCAGCCAAGCCGAGCTGCGCGACG AGGGTCTCGACGCGAATCGAATCGGGTGTCATAACTATGTCCCGGCCAAAGGTTTCCTCGACC GGGCCGACCACTTTGATGCCGACTTCTTCGGCATCCCGCCGCGCGACGCAGAAATCACCGATC CGCAAATTCGGCTTCTGCTTGAGTGCTGCTGGAACGCGCTGGAGCATGCCGGCTACCCGCCCG GCGGCGGCGAGATCGGGCTCTTCGCCGGCTCCTCGGCCAACTATCACTGGCTCGAATACGTGG GCATTTCCGAGGAGAGCAGCAATCGATTCGCCGTCATGATTCAAAACGAAAAGGACTACCTGG
CCACGCGGATCGCCTACCAGCTCGATTTGAAGGGCATTGCCGTCACCGTGCAAACGGCCTGCT CGTCGTCGCTGACCGCGGTCGAGCTGGCCTGCGATGCGTTACACGCCGGCCGCGTGACCATGG CTTTGGCTGGTGGCGTTGGTCTGACGTATCCGTTGCGCGCCGGATACCTGCACGAGGATGGAA TGATCTTCTCCCCCGACGGTCGG GCCGGGCCTTCGACGCCCAGGCGGCCGGCACGGTCTGCG GCAACGGTCTGGGCATGGTGGTGCTGAAACAGCTCGACGCGGCGCTGGCCGACGGCGATGCCA TCCACGCTGTGATTAAGGGCATCGCGGCCAACAACGACGGCGCGGCCAAGATCGGCTACACGG CGCCCTCGCAGAACGGTCAGGCGCGGGTGATCCGCGCCGCCCATAGGCTCGCCCAAGTCGCGC CGGAGACCATCGGCTATGTAGAAGCCCACGGTTCGGGCACGCCGCTGGGCGATCCGATCGAGG TGGCGGGCCTGACCGAGGCCTTTGACAGCCCGCGTCGCGGCTTCTGCGCCTTGGGTTCGGTCA AGTCGAATGTGGGTCATTTGGATGCGGCAGCGGGCATCGCGGGTTTCATCAAGGCGGTGCTCT CGCTGTCCCATCGGACCCTGTTCGCCAGCCTCCACGTCGACACGCCCAACCCGCAGATCCCGT TCGCCGACGGTCCGTTCCAGGTCAACACGGAGACCCGGCCCTGGCCAGCTGCCGACCATCCCC GCCGCGCCGGCGTCAGCTCCTTCGGCATCGGCGGCACCAACGTGCACGCCGTCCTGGAAGAGG CGCCGCAGTTGGCCGAGCACGCGGGGCGGCGGCGCGAGCGGCAGCTGTTCCTGGTCTCGGCGC GGACTGCAGCCGATCTGGAGCGACGCACCGCGGCGCTGGTCCGCCACCTGGCCGCGCATCCGG ACCTCGCACCAGATGACGTTGCCTTTACCTTGCACGCGGGCCGCAAACCGATGACCCACCGTC GTTTCCTGGTCGCCGCCGACCTCGCGGAAGCCGCCGCGCGTCTGGCCGAGCCCGATCCAGTCA AATCCGCCGCGGCGCGCGCCGACCGCTGCCAGGTCTGGATGTTCGCCGGTCTCGGCTCTCAAT ACCCCGGCATGTGTGGCGGCCTCTATCGCACCGAGCCGGCCTTTCGCGAGCAAGTCGACCGCT GTTTCGACCTCCTCGCGCCGCGTTGCGATTTGAAGCCCTCGCTCTTCCCCGAGCCCGATCAGG CCATCGACGCATCAGCCCTCGCGGCCATCGACACCGCCCAGATCGCCGTCTTCGTCTGCGAAT ACGCGCTCGCACGGATGCTGGAAGGCTGGGGGCTGCGTCCGGATCGGCTGATCGGTTACAGTT TCGGCGAATACGTGGCCGCCTGCCTGGCCGGCGTCTTCTCCCTGCCCGACGCCTTGGCAATCG TCCGCGAGCGTGGCCGGATCCTGGCGGCGGCCGAGCCGGGCGCGATGGTCAGCGTGCCCCTTC CGGCCGAGCGCGTCGCGTCGCTGCTGGAGCCGCCGCTTGCCTTGGCCATTGACAACGGCCCCT CATGCGTGGTGTCCGGGCCGGTCGAACCGGTGCGCACCTTCACCGCTCGCATGAAGCGGGACC GGGTCTGGGTGACGCCGCTCCAGGCCGAGCGCCCGATGCATTCGCCGCTGATGGCCGAGGCCG GCGGCTCACTGCGCGCCATGTTGGCCGGGTTCCGCCTGAATGCGCCGCGAATCCCGATCTTAA GCAATGTTACAGGAACCTACCTAACCGACGAGCAGGCCCGAGACCCCGATTACTGGGCCCGTC ACCTGTGCGGCAACGTTCGCTTCGCCGACGGTGTGCGAACCTTGTTGGCCGAGCGCGATCCGG
TGTTCCTTGAATTCGGGCCGGGCCGCGATCTGAGCTCCTTGGTGCGCCACCAGATGCCGGAAG GCGCCGACGAGCCGATCGCACTGATCCGTCATCGCGAAGATCCGGTGCGCGACGAAGACCTCC TGCTCGATGGCTTGGGCCGCTGCTTCCTGCGTGGGGCGACCCTCCACGGGCAGGCCTTGTACG CCGGCCGAGGCTGCCGCCGCGTGCCGCTGCCCGGTTACCCGTTCCAGGGTCCACGCTGCATGC CGGCCCGCGCCGGACTGCCCGGCCTGGCGCGACCGACCGTGGGAGCGACCACCATCAGCTACC GACCAGCCTGGAAGCGGGCGCCGCGCTTGGCGGCTGTCGAATCGCTCGCGCCGCAATCCTGGT TGGTATTCAGCGACGGCAGCGAATTGGCGGGCGAGCTGGTGGCCGGCCTGCGCGCTTCCGGTT GCGCGACCACCCTCGTCGAAGGTGGGCTGGCGTTCGCGCGCTTCGCGGGCGGCTTCCGCGCGA ATCCCCGCGAGGAACAAGATCTCGCACAGCTGTTCGCGACCCTGTCGGCCGAAGCGATGCTGC CCACCCACATCCTGCACCTGCTCAGCCTGCCGTCGCCGGAGCGCGACTCGCCGCTGGCGCGCC TGGAGCACCTCACCGAGCTGGGCTTCCACCATCTGCTGGCCCTGGCCCGCCAACTGGAGGCGG TCGGCGCCCCCGAGGTCCGCCTCGCCGTGGTGACAACCGGCCTGGCGGCGATTGGCGGCGAGT CCGAGCTGCGGCCCGAGGTCGGGCTGTTGCGGGGACCTGTCCGCGTGATTCCCTTTGAATTCC CGAACTTGCGGCTGCGCCTGATCGACCTCGACTCGGCCGATCCCATCTGGCGTAGCGGTTGTG AGCCGTTGCTGCGCGAAATGGGCGCTGCCCCGGGACCTGAAGAAATCGCGCTGCGCGGCACCA GCCGTTGGGAGTTGGGCTACGAGCCGGTCGAGGGGGGCACCGTGAGCACCATCTCCTCGCGAC TGCGCGAGGGCGGCGTCTATCTGATCACCGGTGGCCTCGGCGGCCTGGGTCTGGCCTTGGCCC GTCACCTCGCCCGGAAGTACCGCGCCACCCTGATCCTCGCTGGCCGGCGAGGCGCGCCGGCGC GCGAGCTCTGGCACCAGGCGCCAGCGGAGTTCGTACCGGTCGCAGCTGCGATCGCACAGATGG AGGAGTGTGGCGCCCGCGTGATTCCCGTCGCGCTCGACGTCACCGACGCCGACCAAGTGAACG CGTTGTTCGCCACCATAGAAGCTACGGTCGGCAAGATTGAAGGCGTTTTCCACATGGCTGGCA TCGTTGACGGCGGCATCATTCGAACGCGCACGCGCGCTGCCAGCGACGCCGTGCTGGCGCCCA AAACGGTCGGAACCTGGATTCTCGATCGGGCTCTCCGCGGCGCCGGTGGCCGCTTCCTGGTGC TGTACTCCTCGATCAACGCGGTCGTCGCGCCCTTCGGCCAGGTTGCCTACGCCGCCGCCAACG CCTTCCTCGACGCCTTCGCCAGCGCCCACGAACACGACGAGCGTCTTTTCCGCGTCAGCATCG GTTGGGACACCTGGCGCGAGGCCGGCATGGCCGTCGATGCCGCCCGCGCCCGCGGCGACCAGG CCCCGCTCGAAGGGCTTAGCGACGAGCAGGGCTTGCGCCTGCTCGAAAGCGCCTTGGTCGGTT GCGAACCGCGACTCCTCGTCTCCATCAGCGAACTGCGCGCTCGACTAGCCGAGCATCATCGCA ACGGCGGCATTCCCCGGTTGCTCGGGCCCCGCGCCAACGAGGCGGGTGCAGCTGATTCCGGCG AGGAGGGCGCCACGCAAGACGCGTCGCCGGCCCGTCGCGCCCGTCCCGATCTGGTCGTGGCCT
TCGCGCCGGCCGGCAACGAGCTGGAGCGCCGGATCGTGGCCATCATCGGCGCCTACCTGCGGC TCGGTCAGGTGGGCGTCGACGACAACTTCAACGATTTGGGCGCCACCTCGCTCGACCTCATCC AGATCGCCCAACGCCTCGGTCGCGAGTTGGGCCGCGATGTCCCTGTCGTCTCGCTCTACCAAC ACCGCACCGTACGCGGGCTGAGCC3CTTCCTCGGCGGCGCGCTCCAATCCGCGCGGTCCGGCG TCCCGACGGGCGCTGCCGCACCGGGCGCCGCCACGCCGGGGGTTGCCACCCCGCCGCGGCCAC AACCGTCGCGCCAGCACCTGGAAAAACGCCGTCAATTGAGGAAAAAAGGGGGGCCTTCCCATC ATGAGTGAAGTATCCATTCGCCCCGGCTTGGACATCGCGGTCATCGGCATGGCCTGCCGCTTT CCCGGTGCCCGCAACCTCGCCGAGTATTGGGCCAACCTGATCGAAGGCCTCGAAACGCTCAGC TTCTTCAGCGAAGAGGAGCTGCGTGAGGCCGGCTGCGATCCGGTCCAACTGGCCCAGCACAAC TACGTGCGCACCAAGGGCCTGCTCCCTGACGCAGACCGTTTCGACGCCGATTTTTTTGGTTAT TCCCCGCGCGAAGCCCAGGTGATGGACCCCCAGATCCGCGTCTTCCACGAGGTCTGTTGGCAG GCGCTGGAGCACGCGGGCTACAACCGGCATCGCCACACCGGCACGATCGGCCTGTTCGCCGGC GCCGCGCCCAACGTTTTTTGGGAGTTTCTCTCCTATCGGTCCGATGCCGCCAATTTAGGCAAC TTCACGCTGGGCCTGCACAACAACAAGGACTACCTGAGCTCGCGCATCGCCTACAACTTCAAC CTGACAGGGCCCAGCTACACCCTGTTCACCGCCTGCTCGACCTCGATGGTCGCCATCCACCAG GCCGTCCAGGCGCTGCTCAACGGCGAATGCGACCTGTGCATGGCCGGCTCGGTCTCCATTACG CTGCCACTGGTTGCCGGCTACACCTACACGCCGGGCATGATCGTCTCGCCCGACGGCCATTGC CGCACCTTCGACGCAGGCGCCAATGGCACTGTCTACGGCGACGGGGCCGGCGTGGTCGTTCTC AAGCGGGCCGAGGATGCGTTGGCCGACGGCGACCACATATTTGCGCTCATCAAGGGCTCGGCG CTCAACAACGATGGCAGTCGCAAGACCGGCTACACCGCGCCCAGCGTGCAGGGGCAGGTGGAG GTGATCCGCGCGGCGATGAACCTGGCGGAGGTCGAGCCGGAGGCGATCAGCTACGTGGAAACC CACGGGACGGGCACCACGGTGGGCGATCCGCTGGAGTTCGAGGCGCTAAAGGAGGCCTTCGGA GGTGGCTGCAAGGCCTTCTGTGGATTGGGTTCGGTCAAGCCGAACATCGGCCATCTGGACGTG ACGTCGGGGATCGCGAGCTTCATCAAGCTGGTCCTGGCGCTGGAGCACCGCATCCTACCGCCC ACGCTCCACTTCCAACTGCCCAACCGGAAGATGGATGTGGTCGATAGCCCCTTCTACATCGTG GCTGAGCGCGAACCCTGGCGCGAA3ATCTGCTGCCGCGTCGGGCCGGTGTCAGCGCGTTCGGT CTGGGTGGCACCAACGTCCACATGATTTTGGAGGAGTTTCAGCGCGAACCGGCGGCGAACAGC GCGCGCACGCGCCACCTGACGGTGCTGACGGCGCGGTCGCCGCAAGCCCTGGCGCAGCTGGCG GCCAACCTCGCCGAACACCTGCGCGAACACCCCGAGTTGGCGCTGGCCGATGTGGCCCATACG CTGCTGCACGGCCGCAAGCCACATCGATTCGCGCGCATCCTGGTGGCGACCGATACGACGGCG
GCGATCGACGCCTTGATGAACGACCGCGATCCGCGAACGCGTTTCTTCGAAGCGACCGGGCGC GGCGAGTCGGTGATCCTGTGTTTTGACGAAACGCCGCCGGAGCCGCGAAGCGCCCGCTACCTC TGGGATCACGAGCCGCTTTATCGCGCGGCGGCGACGTCGTGCTTGGCTGGTGAGGTCGCCGAC CCGGATCTGGAAGGCTGCTTTACTGGCCTGATCGCCGAGCAGGGCGCGGCAGCCGCCTTTTGC CACCAATACGCGCTGGCCGGATGGCTGCTGGCCATGGGGTTGACCCCGTCGGCGTTGATCGGC GTGGGCCAGGGCGAGTGGGTAGCAGCGGCGCTCGCGGAGGTGTTCCCGCCATCGGCCTGCTTG CGCTGGATTAGGTTCGGCGAACGGCTCCCGCAGCCGCGCGATCAACGGATTCCGTTTCTCTCC AATTTCTCTGGAAACTGGATCGTTGGGCGTGAGTTGGCCGACCCGGATTACCCCAGAAAGCAG AAGGGTAAGCGCTGCATGAAGCGCCGTCGGTCCCAACCTCGGTCAGCTGGTGCAGGATGGGGG CGATGGAACCGGCTCGGTCAGCTCGTCGCGCGCTGCTCTTCCGCGGGAAGCGGAGGCGGGACG GTGATCGGCCCGAGGGCGAGGTTCATCTCGTCGTCGACGAGCCGGGCGCGGGTGCGCGCCCAG TACCTGGGGGCGAGCTCGAGGTAGCGGTCCCGCGGCCAGTAGGGCATCGCGCGAATGACGTCG GCCAGGTAGGCCTCCGGGTCGAGCCCGTGCAGCTTGCAGCTCGCCACGAGCGAGAAGAGGTTG GCCGCGGCGGAGGCGTGGTCGTCGCTGCCGAAGAAGAGCCAGGACTTTCTCGCAACCGCAATG GATCGCAGCGCTCGCTCGCTGGCGTTGTTCTCCAGGCGCAGCCGACCGTCGTCGAGGAAGCGC CGCAACGGCTGCTCTTGGTTGAGGGCGTAGCCGAGCGCGGTGGAGACCAGGCCGCGCTCGCGG GGACGAGCGTGCTCGGCCCTGGCCCAGGCAAAGAACGCGTCGACCAGAGGGCGGACGACGACA TCGCGACGCACCTTGCGCTGCGCGGGCGGCAGGTCCGCCAGCGCGCGATCGGCGGCAAAGAGG GCGTTGATGCGCCGCAGCCCCTCGACACCGAGCTCGTGCTTGCAGACCGCCGCCTCCCAGAAG TTGGTACGGCAATGCGACCAGCATCCGACTTCGGTCGGGGGCGGACCGCGCTTCTCGTCGGCA GCAGCGCCTCTTGGTGGTGTGCCGCGGAAGAGGGCGTCATAGATGGCGTGAGCGTCAGCTTGA ATATACCGAGAGAAGCCGCGGAACATCTCGCAGACCGCGGCGCTGGTATGCTTGGGCTGGTAC TCGAAGAAGACGTGATCCTTGTCCGCGAGGACGACGAAGAAGTGTCCCTTGCGGCACGGCCCG GGCTTCTTGTCCTTGCGCTCCTGGATGGGCCCAGGCTGGACGGAGACCCCGGTGGCGTCCGTG GACAGGCAGAAGGCGGTCTCGAAGGCCTCTTTGCGCGCGGCCTCGACGATGGCGCCCAGGGTC GCACCGACGTCTTCGGCGTAGCGGCACATCGTGCCGCGATCGAGCGACGCGCCCTGAAGCTCC AGCTGCTGCTCCAGTCGATAGAACGGGACGCCGAGCAGGTACTTGCTGGTGAGGATGTGCGCA ATCATCGACGGCGCGAGGAACGACCGCCGGAACAACTCCTTCGGAAGCGGCGTCGTGATGAAG ACCGTGCAGGTCTCGCCCTTCGGCGCCGGCGGCGGCGCGTCGAGCGAGGGCGCGTCGAGCGCT GTGGAGGAAGCGCTCGGCTCGCCGGCCGCTGCCGTGTCCTCCGGGCTGACGCTCGCAGCGGGC
GTCGGCGTCGGGGCTTCTCTCGCGACGACCTGGAGCGGGGCCGCTTCCTCCTCGCCCGAACTG CTCGCATCCGTGACGGACCGCTCGGCCTTGTACACGACGCGTGCGAGCACGATGCGGCGCATT CCGCCGCGCTCGTAGCCGAGTCGCGAGGTCTCCTCGACCCCGATGCGCGTCGCCGTCGCATCG AGCTCGGGGCAGGAGAGCTCGATGCGGACGACGGGCAGGTCGGACTCGGACAGGTCGCGACGG CCCTTGCCGCCGGACCTTCGTTTCGGCCCCTTGGGGTCGTCGTGCTGCCGCTCGTCGCCTGTA TTGCGCTCGGCGGCGTCGAGTGCCTTCGCGAGGCGCTGGACCTCGAGGAACATCGAGTCGAAC GCCAGCTGCTCCGCGCTCACCTCGGCGCGCTCCGCCTTGGCCACGAACAGTCGACGTCGCAGA AGCTGCAGCTGCTCGAGCGCACGGGTGTAGGCGCGCCGAAGCTGCGCGAGCGCATCGCGCGCT CCCACGAGCTCGCTCTTTGCCGCGGCGAGCTCCGCTTCGAGCTGCGCGATGCGCTGCTGCTCG GCCGAGAGCGTCGGCTTGGCGGCGGCGTCGTGCACGACGCCGCTCTACGTAAGCCGCGCGTAC TTGTCGAGCGAATTCGTGCGGCTCAGTGGACGCGGCGCGGTGCGCGCCTTCGCGGTTTGGACG TGGGCGCGATCTCGATGCCGTCGAGCAGCGTCTCGAGCGTGGCGTCGTCCACCTCGACGTGCG TGGCGCCCTCGGTCGGGGGGTCGGGAAGTGCGAACGCTCCGCGATCAAGGCGTTTTGAAAACA GGCAGATTCCACTGCCATCGAAGAAGAGAATCTTGATCGTGGTCCGCCGCTTGCCGACGAACG CGAACAGCGCTCCGCAGCGAGCCTCGTACCCCACACGCTCACGGATGAGACCCGAAAGCCGCT CGAAGCCGTAGCGCATGTCCACCGGCTCCAGCGCGACGAACACCTGCACGCCCGCCGGAATCA TCGCCCCGCTCCGCCGAGGGCACGGACCACCTCCGCCAGCAGCGCGGGGTCGAACCCCGCGGC GACGCGCACCCGCGCGCCGCCGACCTCGACGACGAGCTCCGCAGCGCTGCTCGTCACGGCGGG CGCCTTCGGCACCAGGCGCAGAAAGCGCGGTGGCTCGGCCCGCGACAGCCGGCTCGACCAGCC GTGCAGCGTCGAGGCCGCAAATCCGCGGCTCCGAGCGAACTCCTCCGCCGTTTCACCACTCTC GCGCCACGCCCGAACGCGCTCGGACCACATCACTTCGGTCGCCTTCGTCCTTGTCATGCACGC CATCATGAACTGGACAGCGCAGCCGGGGTGAGACGGCGCTTCGCGCAGCGCTTACGCAGAAGG CGCGCCGCGCGCCATTGTCGGATGCGGTGCGCGACTTCGCCGCCGATCGGCTGTTGCTGGAAC TGGGACAACCACTGGACGTAACGGCTGAAGCGAGCCAACGGCTCCAGCTCGCGCGGGGCGACC TGTTCGGCGCCTACCAAGCGTTGGCCCAGCTCTGGATCTGCGGCGCCCTGGCCGAACCGCCGC GACTGTATCCCGACGAACACCGCCGGCGCGTGCCGCTGCCGAGCTACCCCTTCGAGGGAAAGC GGTTCTGGATCGAGGGCTCGCCGTTCGAAACCGCGCCCGCCGCCGGCGCCTCACCCCAACCCG CCGATTCGGGGGACATTCTCAAGGGCGACCCGGCGGACTGGTACTATCGGCCGCGTTTCGAAG CGGCGCCGCTCTTGCCCAGCCCGTTCGAGAGCGAACCCGGCGATTGGCTGGTGTTCGAAGATG AGCTGGGGCTCGGCGCCTGGCTGAGCGAGACCTTGCGCGACAAGGGCGCGCGGGTCGCGACAG
TCGTTCGAGGCACCGAGTTCCGACGCCTGGCGTCACAGCGCTTCCAGCTTCGTCCCGATCGAC GGGACGATTACCGGACCCTGCTGCACGAGTTGAAGGCGCAGGGCATCGCGCCGGTCCACCTGT GCCACCTATGGAGCGTGACCGCCGCACCGGATGCCGAGCAGTTGCTCGACGTCAGCTTTCACA GCCTGGTCCATTTGGCGGCCGCTTTGGGTTCGGTTGGCTACTTCCACGCCATGAAGTTGAACG TGGTCGCCAACCGGCTATTCGACCCCGAGTCGCCCGAGCGCACCGAGCCCGCCAAGAGTCTGT TGCTCGCGGTGACCAAAGTCCTGCCGCAAGAGGTGCCCAACGTTCGAACCCGCGCCATCAGCG TGGACCTGGATCGCTCGTTCGACGCGGCGGCGCCCGCCTGGGCCGCCAGTTTGTTGGTTGAAT GCGGCGCGCCCGTCGAGGAAACGGTGGTGACCTACCATGGCGCAGCCCGATGGCTGCGCCGCT TCGATCGCGTTGCGGTGAATGGTC CGGCCCGTTCCACCCCGATCAACCTGCGCCGCTGCTGC GCGAGCGCGGCGTGTACCTGATCAGCGGCGGCCTGGGCGGCGTGGCTGGCCAGTTGGCGCGCT ACCTGGCGCGGGCCTGCCGGGCGCGGTTGGTGCTCACCGCGCGCCGGCCCCTGCCCGAGCGCG ACCAGTGGGATCGGGAGTCGGCCGTGCTGTCATGGGACGACAAGACGCGCCAGCGCATCGAGC TGGTGCGCGAGCTGGAGCGGCTGGGGGCCGAAGTATTGGTGGTGGCTGCCGATGTCGCCGACG AAGCGGCCATGGCGCAGGCGATCGAGGCCTCACTGGCGCGATTCGACGCTTTGGACGGCTTGA TCCACGGCGCCGGGATCGTGCGGGTGGCGTCGGGCCGCACGCCGATCGGGAGTATGACGCGGG CCATGTGCGAGGAGCAGCTCCGCCCCAAGATGTTGGGCCTCGACGTCGTCGACCGCCTCCTGC GCGATCGCCGGTTGGACTTCCGCATTGCCATCTCGTCGCTCGCCCCGATTCTCGGCGGCCTCG GCCACGTCGCCTACGCCGCCGCCAACCTCTACATGGACGCGTTCGCGACGCGCGCCGCCGCCG GCAACGCGCCTTGGATCGCGCTGAACCTGGCCGAGTGGGAATACGAGGGCCCGGCTACCTACG ACGAGCGGGTGGGCCGTTCGCTCAAGCAGCTCGAGCTCACCAACGAGGAGGGTATCCGCGTCT TCCAGACGGTGTTGGCCTTGGCCGCGCGCGGCCCGCTACAGCAGATCATTATTTCCACCGGCG ACCTCCAGGCCCGCCTCGACAAATGGATTCACATCAAATCCCTGCATCGCCGACCGGGGCCGG TCCAGCTCAGTCGCCGGACCGCGGCACCCCAGGGCGGTTTCGGCTCGGAGCGCGCCGCCTTCG AGGCCGCCTTCGCTGACGCCTGGTGCGACTTCTTCGGGGTTGAAGAGGTCGACCCGAACAAAA ACTTCTTCGATCTGGGCGCCAGCTCGCTCGACTTCATCCACCTCGTCAGTCGCTTCAGCAAGG CCATCGAACAGCATGTACCGCTCGAGGCCCTGCTCGAACACTCCACCCTGCACGACCTCGCCG CCCACCTCGCGGGCGACGCGAACACCGACGCCAGCGACGAAGCGCGCATTCGCCAACGGCTGC AAGGCGCCAAGTCCGGCGACATCGCGATCATCGGCATGGCCGGCCGCTTCCCGCTCGCGCCCG ACCTGGACACCTATTGGCGCAACCTGGTCGGAGGCATCGACGCGGTCAGCTTCTTCAGCGCCG AGGAGTTGCGTGCTGCTGGCGTCACCGCGGCCGAGATCCACCACACCAACTACGTGCCGGCCA
AGGGGCGCTGCGCCGACCAGGACTTGTTCGATGCGGCCTTCTTCGAATACACTGCCAGCGACG CCGAGCTGATGGACCCGCAAAATCGGGTGTTACACGAGGTCGTGTGGCACGCGCTGGAAGACG CCTGTTTCGACTTCAACGGCGATCAGGGCCAGGTCGGCCTGTTCGCGGGCGCCTCGCCGAACC TGTGGTGGCAGTTCGTGGCCAGCT7TTCCGAGGCCGCCAAGACGCAGGGCATGTTCACCACCA CCCTGCTCAACGACAAGGACTCGATGGCGACCCAGATTTCATACAAGCTCGGTCTAAAGGGCC CCGCGGTCACCTTGTTCACCGGCTGTTCCACCTCGCTGGTAGCCGTTGACGCCGCCTGCCGCT CGATCTGGTCCGGTCAATCGGACATGGCCGTGGCCGGCGCGGTCTCGCTGACTCTCCCCGATA AGGCCGGCTACATCTACGAAAAGGGGATGCTCTTCTCGGCCGACGGCCATTGCCGGGCTTTCG ACGCCAACGCCACCGGCATGGTCT7CGGCGACGGCGCCGGCGCGATCGTGCTCAAGCCGTTGG ACGCGGCCCTGCGCGACGGCGACCGGATCCATGCGGTGATCAAGGGCTGCGCCACCAACAACG ACGGCGACCGCAAAGCCGGCTACACGAGCGTCAGCGCCCAAGGCCAGGCCGAGGTGATCCGCT CGGCCCAGATCCTGGCCGACGTGGCGCCCGAATCCATCAGCTACGTGGAAGCCCACGGTACCG GCACCAAGTTGGGCGACTCGATCGA3ATCAAGGCGTTGAAGCAAGCCTTCGCCAGCGACAAGA ACGGATTTTGCGGCATCGGGTCGGTCAAGACCAACCTCGGTCACCTGATGGCGGCGGCGGGGA TGGCCGGCCTGATCAAGACGGTTCΓGGCGATGAAGCACCGCCAATTGCCGCCATCGCTGCACT GCGAGGAAGTGAACCCCGACCTGGAGTTGGAGCGCAGTCCGTTCTACATCAACACCCGCCTGC GCGACTGGGTTGCACCGGGCGGGCC3CTGCGGGCCGGCGTGAGTTCGTTCGGGATCGGCGGAA CCAACGCTCACGTCATCCTGGAGGAGCCGCCGACGCGCGAGAGCGGCACGCGCATGCGCCACT GGAAATTATTGATGCTGTCGGCGGGCAGCGAGGCGGCGCTCGACCGCCAGGCCGATAACCTGG CCGACTACCTGGAGCGCCATCCCGAGGCCCACCTCAGCGACGTGGCCTATTCCCTCCAGACCG GCCGGCGCGTTCTGGCCTGGCGGCGGACGGTCCTATGCGAGTACCGCGAGGACGCGGTGACCA GTCTGCGCGAGCGACAGGCCAAGCGCGTCCAGACAAGTCGCGTCCGCTGGGACCACAAGGACG TGGTCTTCATGTTTCCCGGTCAGGGGGCCCAGTACCTCAACATGGGCCGCGACTTATACGTCA TGGAGCCGGTCTTCCGCGAGGTCA7GGACCGCTGCTTCGAGTTGCTGGCCCCTTTGTGGTCCG AGCATCCGCGCCAGATCCTTTATCCGGAGGGCGGGGTGTCGACCCTGCTCCACCGGACTGATT ACACCCAGCCGATCGTGTTCTGCTΓGGAGTACGCCCTCGCCCATTTGCTGCTCTCCTGGGGAT TGAAGCCGGCCGCGACCATCGGCTACAGCTTCGGCGAGTACGTTTCTGCCTGCCTCGCCGGCG TCTTGTCCCTGGAAGATGCGATCCGTCTGGTGACCGAGCGCGGTCGGCTGATGGCGGCTTTGC CCGCGGGCGCCATGCTCAGCGTCCCGGTTCCCGAATGCGAGCTGCTGCGGCTGCTGGACGGCT TCCACGCCCAATCGGCGGCCCATC7GGCGCTGGCCGTCGACAATGGCGCCTCCTGCATTGTGG
CCGGGGAGCAGGCCGCCATCTCGGCGTTCGAATCGATGCTTCGCAAGAAGCGTCTGTTGACCA TGCG3GTCGCGGTCAGCCACGCCGGTCATTCGCAGGTCATGACCGGCGCGACCGACGCCCTGC GCAGGATCCTGCGGAAGATCCCCCTGTCCGCGCCGACAATTCCCTTCATTTCCTGCGTCACCG GCACCTGGATCACTGCACAGCAGC-CΓACGGATCGCGAGTATTGGGTGAACCACATGTGCGGGA CGG7GCGGTTCGCGGCGGGTCTGAG3GAGCTGGGTCAAAACCGCGAGGCGGTGTTCCTGGAAG TAGGTCCGGGCCGCGACTTGACGTTGCTGGCCCACCGCATCCTGGCCGACAGCGCGGCCGTGT TCGAGCTGGTCAAGGCGCCCGACGGCGGCGACGACGATGGGTTCCTCCTGCTGGATCGATTGG CCAAGCTCTGGAGGCTGGGGATTTCGATTGACTGGGCCGGCTTCTACGCGGATGAGCGGCGGC GGAAACTCTCGCTGCCGGGATATCCGTTCGAGCGGCGGCGCTTCTGGATCGAGGGCAACCCGC TGGAGATCGCCGCCGGCAGGCCCAATGTCCAGGGGCCGCTGGTCAAGGCGTCGGACATCGGCG CTTGGTTCTACGTGCCGCAATGGCGGCGGTCGGTGCTCGCCGAGCCGGGTACAACGGCGGCGG GCGCCGCCGTCACGGCGGAGCAGGCACGCGTCGTGACCGAGCTACGGGCGGGATGCGCGTCGG CCGGCTTGGGCAGCGGGGCCTGCGGACTGAATGGCGGTGCCCCGTCCGAGCGTCCGAAAGAAA GTGTAGCGCCAGCCGGGTCGACCAGCGCAGCGGCGCAGACCGGCGCGGACTGCCCGACACCGA CTGGGGAGCCAGCGGCTGTGCCAAAGGACGGGGCCGAGCCGCGGCCGACCTGGCTTATTTTCG CCGACGCCGGCGGATTGGCCGAATCTTTCGCCAAGCGGGTTCAGGCCCGCGGCGAGAAGCTTT ACCTGGTGGCTTCCGGCTCGCGCTTCGAGCGCCTGGCCGAGACCCGCTTCCGCCTCGATCCCG GGGCCAAGTCCGATCACCGCCTGCTTTTCAAGGCGCTCGACGAGGCCGACATCCTGCCGACCC ACCTCCTCGACTTCCGCTCGCTTGACTGCGGCGGGCCCGACGCCGACCCCATGGACCAGGCCG GCTTCTTCGGGCTGTTGCACCTGGTCCAGGCGATGGCAGAGGCCGGCTACAGCCATCCCATTC GGCTGCTGATCGTCAGTTGCGGCGTCTACGATGTCACCGGTGCCGAACCGCTGCAGCCGGCGC GGGCCACGATGATCGGACCGGCTCTGTGCATCCCGCAACAGTATCCGCACCTCGAAACGAGCC ATGTGGATTTGGGCGTGGTCCATGCCGACGAGCTCCACGCCGCGCGCCAGCTCGACAGCCTAC TTGCCGAATGCCTAAGTGCAACGGCCGAGCGCCAATTGGCGCTGCGCGGCCGACACCGCTGGC TGCTGGACTACGAGCCAGTCCGCTTGCCGCCGCTCGACCCGGGCCGTCTGCCCTGGCGCCAGC GCGGGGTCTACTTGATCACCGGCGGTTTGGGCGGGATCGGCCGCATCCTGGCCGAACACCTGG CCCGCACGACCTCGGCTCGCCTGGTCCTAATCGGCCGCGAAACCCTGCCCGACCGCGACGACT GGGACGCCTGGCTGAACCGCCCGCAACCGGTCGACGCCACCCACGAACGGCTGCTGCACAAGA TCCGCGCGATTCGCGATCTGGAAGCGCTAGGCGCCGAAGTCCTGGTCCTCGCCGCCGACGTCG CCAACGAAGCCGCCATGCGCGAGGCCTACGATCGCGCCGAATCCCACTTCGGCACAATCCACG
GGGTGATTCACGGCGCCGGCCTGATGGACGCGCAAAGCTTCTCACTGATCGACGCCCTCGACC ACGACCTCTGCGCCCGCCAGTTCGAAGCAAAAATCCGCGGCGTCTGCGTGCTCGACCGCGTTC TGGCCGACCGCACGCTCGACTTCTGTCTGCTGATGTCTTCCATCTCCACCGTGCTCGGCGGCC TGGGCTATTTCGGTTACGCCGCGGCCAACGCCTTCCTCGACGCCTTCGCCCAGGCGCGCAGCC GCGACGCCGCTTTCCCCTGGCTTAGCGTGGCCTGGAGCGATTGGAAGTACTGGACCGAGCGCA AGATGGACAACGAGGTCGGCGCCGTCATCGACAGCCTCTCGATGGAACCCGCCGAGGGCTTCG AAGCCGTCACCCGCGTCTTGGCTTGGGGCAAGGCGCCCCACATCGCCAACTCGCCCGGTGACC TCGGTCGCCGCCGGGATCAATGGGTCAAACTGGCCAGCCTGAAATCGGCGCACTCCAGCGAGC CCGAGCCGGCTAGGCATGGACGTCCGGCGCTCTCCAGCGAATGGGTCGCGCCGCGCAACGTGG TCGAAGAGAAGCTGGTCGCCATTTTCGAGCAGGTGTTCGGCACTGCGGCACTGGGCATCGAGG ACAACTTCTTTGAGTTGCGCGGCGACTCGCTCAAGGCGGTCATGACCGCGGCCCGTATTCAAA AGGAGCTGAACGTGGAAGTGCCGCTGCCGACCTTCTTCCAGATGCCCACGGTCGCTGGCCTGG CCCAGTTCGTGACGCAAGCCAAGCGCAGCGGCCGGGAGACGATTCGGCGCACCGCGCCGCGCC CACATTACCCGCTCTCGGCTGCCCAGGGCCGCCATTACCTGCACTACCGCATGGACCCGCGTT GTACCGCATACAACGATCCCTTCGGCAACCTGATCGAGGGTCCGCTGGACGTGGATCGCGTGG AGCGCATCCTGCACACCCTCATCC ACGCCACGACTGCTTCCGCACCTCGTTCCACTTCCGCG AGGGCGAGCCGGTCCAGGTGATTCACGATCGGGTGGACTTCAACCTGGCGCGGATTACCTGCG CGCCCGAGGATTTGCCCGAACGGATGCGCGATTTCATCCGCTCCTTCGATCTGGAGCGACCGC CCGCCATGCGCGCCGGCCTCTTCGTCACGGGGCCCGAGCGCCACGTGCTGCTAATCGATTTTC ACCACATTATCACCGATGGCGTGTCGTTCGAGAACTTCGTCGGCGAGTTCGCGGCGCTCTACC GCGGCGAGATCCTGCCCGAGCTGGAACTCGAGTACAAGGATTTCGCGGTGTGGCAGCATGAGA ACCGGGGCCGCCGCGCCAACAGCGACCAGGCCCGCTACTGGACCGAGCAGTTGGCCAATGCGC CCGGGCCGATCGAGCTAACCACCGATTTCCCCCGTCCCAGTCGACGCAGCTTCCGCGGCGACC GCGTGCGGACCGTGCTTGATGCGGAGCTCGTTGCTCGACTCAAAGAGCACGCGGCGCGCCTCG GCATCACCCTCTATAGCCTGCTGCTGGGCGGATTCTCGTTATTGCAGCACAAGCTCTCCGACT CGCACGACATCGTCATCGGTTCGCCCGTCGCGGGCCGCACCCGGAGCGAACTCCAGGATCTGC TGGGCGCGTTCGTCAACACCCTGCCGATGCGCCACCGCATCGACCCGACCCATACCGCACGGG TCTTCTTGGAGCAGGTCCACCAGACAACCTTGGCGGCCCTCAGCTACCAGGAGCACCCTTTTG ACGAAATGGTGGCGACGCTCGGGTTCGCCGCCGATCCGGCTCGCAACCCGATCTTCGACACGA TGTTCTTGCTGCAGAACATGGCCATGGGTGCAACCACCATTCCCGGTCTGCGGCTCTCGCCTC
ACGACACTTTTCACCGCAAGGCATTGTGCGACCTGATGCTACAGGCGACCGAGTATGACTGCC ACCTGGAGCTGGTGCTCGAGTTCGCCACCGACCTGTTCCGGCTGGAAACCGCGCAAGTCTTGC TCGACCGCTACCGCCAAGTCTTGGAGTGGCTGTTGGCGTACCCCCATGAATCGATAGACGATT TGACGCTCGCCGGCCACTTTCGCGAAGTCGAAGTGACGATGTCGGACGAGGGCGACTTTGATT TCTCAGATTTCGAACCCCGCAACGTGAGAAACCTATGGCGCGCCTGAGCCGCACAGATCTCCA ACTCGCCATTCACCAGCGCACCGTGGAGCGCGAATATTGGCGCGCTCTGTTCGAGCGCCATCC GCAACGGTCCAGTTTGCCGGGGGTGCTCACCGCCCCGATCGGCGACGAGTCGACCCGCGAGAC CTTGTCATTCGTCCTCGACGAAGATCCCCTTCGGCTGAGTAATCGTTCGCCGCAACGCCTGCT CACGGTGTTGGCGGCTGGCCTCGCGGCTTTCCTCCACCGCTGCGACGGCGCTGAGCGCTTCAC CCTGGGGTTGGCCCTACCGCGCCAAGCCGATGACCATCACCCGATCCTCAACAGCTTGATCGC GCTGGGGGTCGCGGTCGACTCGAG ACGACCTTCCGCGATCTGCTCTATGCGCTTCGATCCGA ATACCACGAGGCGATGCGCCACGCCAACTTTCCGCTGGCGACCTGGTGGCGCGGCCTACCCGG CGGAACGGCGCCGTTCGACGTCGCCCTCAGCCTGGACCCCTTCACAGACGGCGATTCGCTGGA AGACCACGCGATCGGCGCGTTGTTCCGGTTCGCATTGGAGGGTGAGCGCCTCACCTGCCGATT GCGATTCGACCCTGCGCGCTATGACCGTCCCGCGATCGAAAACCTCGCCGATCGTTTCGCCCG CTTCCTCACGCGCCTGTGCCGGGAC3CCTCCACCGTCATCCAGGCGCTGGACCTTTCGCTGCC AAGCGATGAATCGGTGTGGCGCGTCACTGAAGGCGTGCGGCGCGGCTATTCGCAAGACCTGAC GCTAGACCGCGCGTTCCGCCGCCAGGCCGCGCAAACGCCCGATCAGCCGGCGATCACGTTGAA CGGGGACGTCCAGAGCTACGCCGAGGTCGACCGCCGCAGCGACGCGCTGGCCCGCCACCTCCG TCGCCACGGCGTCGGTCCGGAAACGATTGTGGCCGTCAACGCCCGGCGCGGGCCTAATCAGCT GACGGCCCTGCTCGCGGTCCATAAGGCCGGCGGCGCCTACCTGCCGATCGATGCCGAGGAGCC GGCTGCCCGCCAGCAATTCAAGGTGCGCGACAGCGGGGCGCGGTTGGCACTGGAGCCGTCGCC GGACCAGGCGCTGACCGTCACCGACCTGCCGCGGCTCTTCCTGGACGATGCCTCGCTCTTCGC TGACGGCGGGCTCGATGTGCCGCGCGGCGCCGACTCGCTCAATCCGGCCTATGTGATGTACAC GTCCGGCTCGACCGGACAGCCCAAGGGTGTGGTGGTTCCCCACCGCGGCGTGGTCAATCGTTT GAATTGGGGGCAGTCCCGTTTCCCC-CTGGACGAACGCGACCGAATCCTCCAAAAGACGCCGCT GCTGTTCGACGTGTCGGTCTACGAGCTGTTCTGGGGCGCATGGAGCGGGGCCACCCTGGACAT CCTCGAGCCCGGCGCCGAGCGCGACCCCGACGCAGTGGCCAGGGCCCTGGCCGAGCGCGCCAT TACCGTATGCCATTTCGTGCCTTCGATGCTGCTCGTCTACTTGGAAGTCATGCGGCGGCACCA TGCGCCGCCCGTGCCGGACCGGCTCGGTTACGTCTTCGTCAGTGGCGAGGCCCTCGAACCGGA
CCACCTCGCCGGGCTCCAGCAGATTGGTCGGCGCCTCGGCCGCACGATTCCCCTCGTTAATCT GTATGGACCAACCGAGGCCTCGATCGAAGTCTCCTGCTTCGCCTGTCCCGCCGACCATGTGCC GCGCCGGATCCCCATCGGGCAGCC3ATCGACAACGTCGCACTGCACGTTCTCGACCGGCGCGG CCGTCGCCAGCCGCCCTATCTTCC GGCGAGCTGTTCCTGGCCGGCGACTGCCTGGCGCGCGG CTACCTCAACCGTCCCGACCTGACCGCGCTCCACTTCGTGCCCAATCCCTTCGGCAACGGCGA GCGCATGTACCACAGCGGCGACTTGGCGCTCGTGCGCGGCGACGGCCAAGTGGCGTTTCTCGG CCGCCGTGACCACCAAATCAAAATCCGTGGTCAACGGGTCGAACTGGGCGAAATCGAGAGTCA TTTGCGCGGGCTCGAAGGCATCGCCGCCGCCGTCGTCCAGGCCGAGTCGCAGCACCATGAAAC CCTGCTGCACGCCTACGTCGTCACCAACGACGCGGGCCTCAATGCGGCCCGGCTGCGCGCCGC CCTCGCTCAACATCTGCCCGAGTACATGATTCCCCAGCGCTTCTCGCGGCTGGCCGAGTTGCC GCTGCTGGCGGCAGGCAAGATCGAGCGCGCCGCCCTCGCGCAACGTGCAACGCCGCTCGCCAG CGGCGCGCCCTTCGTGGAACCCAGCGGGCCCACCCAGCAGCGTATCGCAGAACTGTGGCGCCA GGTCTTAGCGGTCGCCGAAGTCGGCGCCGAGGATCCCTTCTTCAGCATCGGCGGCAACTCGCT CAATGTGCTCAAGCTCAGCGCCGCGCTGAGCGACGCCTTCGCGCGTGACATTCCCATGCCGGC CCTGTTCCAATACGACACCATCGCCGCCCAGGCCTCCTGGCTCGACGGGCAGGTTGACGAACG GGCCCAATCCGCCGCGCTCGACCGGCAGGCCGCCGAGGCGGCGCTGACCCTTCAAGAGACCGT GGCCATTTTTGAGGGATTCGATGACGAACCATGACCATCACGAGGAGAGCAGCGGCCTGGAGA TCGCCGTCATCAGCATGGCCTGCCGATTCCCGGGTGCTGCCGATTGCGACGCATTCTGGGAAA ACCTGATCAACGGGACCTCCTCGATCACCCATTTCAGCGACGACGAGCTGATCGCGGCCGGCG TTGACGCGCGCGACCTGACGCCGCAGTACGTGCGCGCGGCCGGCCAGATCGATGACGCCGAAC GGTTCGACGCGGCCTTCTTTGGGTACTCCCAGCGTGAGGCCGAGCTGATGGACCCCCAGTTCC GCCTGCTCCATGAATGCGCCTGGTCCTGTCTGGAACAGGCCGGCATCGATCCGCGCGTCGAAG CCGCGCCGATCGGGCTGTATGCCGGGGCAGCCGACAACACCTACTGGAACGCGCTCTCGTCGC TCGACCGGGGCTCGGCCGAATCGGAGCAATTCGCCGCCGAACAACTTTGCAACCGCGATTTTC TGTGCACGCTGGTCGCCGCCGCGCTCAACCTGAAAGGCCCCGCGGTGGTGGTTCAAAGCGCCT GTTCGACCTCGCTGTTGGCGGTCCACTCGGCCTGTCGTGCGCTCCTGACCGGCGAATGCCGAG TGGCCTTGGCCGGTGGGGTGGCGCTGCGCTTCCCACGCCCGAGCGGTTATCGCTACGAACCTG GCATGATCTTCTCGCCCGACGGGGTGTGCCGGCCGTTCGACGCGGGCGCTAACGGGACGGTGC CCGGCGAAGGCGCGGGGCTGGTAGCGTTGAAGACGCTGAAACGTGCCCTCCAGGACGGCGACA CGATCCACGCCGTGATTCGCGCGACCGCGGCAAACAACGATGGTGCCCGCAAGACCGGGTTCA
CCGCGCCCAGCGCCCACGGCCAAGCGGAAGTCATTCGCACGGCGCTGCGCCTGGCCCGGGTGC CGGCCGAATCGATCGACTACGTCGAGGCCCACGGAACCGGCACGCCGCTAGGCGACCCGATCG AGGTAGCCGGCTTGGTGGAGGCCT7CGCCAGCGAGAAGCGCGGCTATTGCCGGCTGGGCTCGG TCAAATCCAACCTTGGTCATCTGGAGACTGCTGCCGGCATCGCCGGCCTGATCAAGACCGTGC TGG2GCTCGAGCACGCGCACATCCCCAAGTCCTGCCACGTCGCCACGCCCAACCCCGCGGCGC GCCTACACAAGACGCCTTTCCGCATTGCCGCCGACGGGATGGCCTGGCCGCGGCGTATGGCGA CGCCGCGGCGGGCGGCGGTGAGTTCGTTCGGCATCGGCGGCACCAACGTCCACGCGATTTTGG AGGAGGCGCCGCCCCGCGCGCCCGAGCTGGCGGACGGGCGCAGTCAGGTGTTCGTCTTCTCCG CCAAGGACGAGGCGGCGCTGGACCGTGCCCTTGCCAACTATGGTGCGGCCTTGGAGAAGCGCG GCGACCTCGCGGCGGGCGCGGTGGCCTGGACGCTCCAAAACGGCCGGGCCGCATTCGAATGGC GAGCCAGCGCGGTGGCATCCGACCTCGACGAATTGGCGGGCGCATTGCGCGGCGAGCGGCCCG GCGCCGTCAAGAAAAACCGAATGGCGCGCGAGGATAAGCCGGTGGCGTTCTTATGTTCGGGGC AGGGGAGCCAGTACCGTGGCATGGGCCACGACCTGTACCGCGAAGAGCCGCGTTTCCGGCACC ACCTCGACGCCTGCCTCGCCATCCTCGCCGAACACAAGCCCGAGATCGACTGGCTGGCGTTGC TGGGCTACCGCGACGAGGACGAGCCAACCGACCAGATCGGGACGTCCTCGCAGGGCCCGAGCC GGTCAGCCGCATCGAACCCAGCGGAGCTCCTCGACAGCACCGAATTCGCCCAACCTTTGCTTT TCTCCATGTCCTACGCGCTCGGTCGGCTGTGGCTCGACTGGGGCGTGCGACCCACGGCGATGA TCGGGCACAGCCTGGGCGAGTACAGTGCTGCATGTATTGCAGATTTCTATGCACTCGATCAGG TGCTGCCCTTCATTCTGACCCGCGGTCGAGTCATGGCGCAATTGCGGCGCGGCTCGATGTTGG CCGTCAGCGGTGACAGCGTTCTGATGCGCGAGCTGATCGCCGATGCGCTCGATTTGGCGGCGA TCAACGGCGCTGACCAATTTGTCTGGAGCGGGCCGAGCGAGGCTGTCCAAGCCGCGGGGGTCC GACTGCGCGGCGCCGGCCTGCGTGCCACCGAGCTGAACACCTCACACGCGTTCCATTCAGCCA TGATGGATCCCATTCTGGAGGAGCTAACGGTTGCCGGTTCGCGACTTCAGGTCGGTGTCGGGA CGATTCCGGTCGTTTCATGCGTTACCGGAACCTGGTTGACGGCGAAGCAGCTGGCCGATCCGC GCTACCACGCGCGTCACGCGCGCGAACCGGTGCGGTTCGCGGCGGGCCTAGCGACGCTGACAG GGGAGGAGCCGCCGCTGATGCTCGAAGTGGGGCCGGGCTCGACCCTGGCGGCTTTGGCCCGCG AGCATTCGAATGCCCGCCTCCCGGTCGTCACCAGCCTGCGCCACGCTCGCCAGGCGACGCCCG ATCGCCAATACCTGCTCGAAACGCTCGGCTGCCTTTGGCGACACGGGGTTTCCGTCGATTGGG GGGCCCATGCCGGACGTTCGCGACGCTTGGTTTCGCTGCCCGGCTATCCCTTTTCCGGCGCGG TGCGCCGCTTAGCCGGCGACCCCCTCCGCCTGCTGGCCGGAGCCCGCGCCGTCGCCGCCCCGT
CGGGAACGCGCCAACTCAGCGCCGACGCGCGCGACCTCCCGAACACTCCGGAGCCGACATCCG GCGCCGTGTCGGCGATCAAAGCGCCAATCGCCGCCGCCGATCCCGGCCTCTATCGCCTCTCCT GGCGCCAGGCCGGAACGGCGCCGCTCGGTCCGCCCGATCTCGGTCCGCCCCGCGACTGGATCG TCTTCGCCTCTGATTCTCACCTGCTCCAGGCGCTCAGGGCCAATCTCGGGACGCGCGCTCAGC GGGTGACGCTGGTGACGCCGGGCCAGGAGTACGCAGCCGAGCCGTCCGGGTTTCGGCTGCGGC CGGACCAGATCGACGATTACCGCGCCCTGTGGGCGGACTTGGCGCAAACCGGTATTGTGCCAC GATACATCGCGTTCCTCGCCCCGTTCATGTACCGGGCGCGCATGGCGGGCGATGCCTCGACCC TGGACGAAGTGCGCGAGGGCGGCTTCCTGCCCCTGACCCGCTTGATCCAGACTCGCCCGCCAG GCGGACCGAGCGGACTTCTAAGCCTCACGATCGTCACCCCGGCCGCCCTGGCGCTGGGCGACG AAGCGACGCGCCCGGAATGGGCAATCCTGCACGGGATGGTCGCCGGCTTAAGCCGCGATTATC CCGAATGGCGCTTCGTCTCGATCGACGGCGGCGACCCATCCCCGCATCGGTGCGAAGGTCTGG CCCGCTTGATCGCGCTTCATGCGGTCGACGAGGCTGGCCCGACCCGCTTGGCGCTGCGCGGCC TTCACGCTTGGGTTCCACAGTGCGAGCACGTTCAGCCGGCCACCATCCCTGGGGCGGGTATGT GGCGCGAGGGTGGTGTGTACATGATAACGGGCGGATTCGGCGGGATCGGTCTGGCGCTGGCCC GCGCCCTGGCTCGAGAAGCTCGCGCCAAGCTGATCCTGGTCGGCCGAAACCTGCCCACCGCGC CGATCGATCTCGAGGCTTGGGACGCGCCGCCGTTGATTCTCACCGCCGACGTCGCCGACGAAG AGGCCATGCGCCGCGTCTTCGATGCCGCGCACGCCCGGTTCGGCGCCATCGACGGCATTCTTC ACGCGGCCGGTGTCCCCGGTGGCAGCCTGTTCGCCAACCAATCGGACGCGGCCTTCGAAGACG TGCTGCACGCCAAGGTTCGCGGTACCCTCGTGCTGCAAGGCCTGAGGGCAATCGATGCGCCGC TGTTGCTGATGTCCTCGCTGGACGCCTGGCTTCCCGGTCCCGGTCAGACCGCCTATGCCGCCG CCAACGCCTTCCTCGACGCCTTCGCCAGTCTGCGCCGGCGAGAGGGAGAGCCGGTGTACAGCG TTGGCTGGGACAGTTGGTGCGAGGTGGGCATGGCTGCTCGGGTCGCTGCCCGATCGGCCGACG AACGCGGCCGCCTGGCGCGCGAGGGGATCAGCCCTCGCCAGGGTTGGCAGGCTTTGAGCCGGG CGCTCGCCCTCGACCCCCCCCACCTGATGATCTCGCGCACCGACCTGACCTCGCGCTGGCACA GTCGATCCAGCCCTACGCCGGTCGGCTCGAGCGAACCCGAGGTGGCGCTGCCGCGCTGGACCG CATCCGCCTGCCAAGCCGTCATCGAGCGTGTTTGGTGCGAGCACTTCGCCACCGCCGCCGTGC CTCCCGATGGCAACTTTTTCGAGCTCGGCGCCAGTTCCTTCGACATCGTCCAGCTCAGCGCTC GACTTCAACAACAGTTCGGCCGAGATGTCAGCCACACCGTGCTCTACAGTCATCCCACCGTCG CCTTGCTGGCCGGCTACTTCGCCAATGACCCGACGCCGTCCGGTGCTGCTGCCGACGAACGCG ACGAAGCGGTGCGTCGCGGCCGCGACCTCTTGAAGAGCCGCCGGCGAGGAGTATGACCGTGGA
GCACGAAACCGGATTCGAAATCGCGGTCATCGGGCTGGCTTGCCGCGTTCCCGGCGCTGCCGA CGTGGCCGCCTTCTGGCGCAACCTGGTCGAGGCCAAGGAGAGCGTGCGCTTCTTCGAGGACCA CGAGCTGCGGGCCGCCGGCGTGCCCGAGGAGATCTTGCGCCTGCCCAACTACGTGAAGGCCAA GCCACTGCTCGCTGATGGCGAAGCTTTCGACGCGGACTTCTTCGGGTTCCATCCGCGCGAGGC CGCCTACCTGGACCCGCAAGTTCGGCTCCTGCACGAATGTTGTTGGACCGCGCTGGAGGATGC CGGCTACGATCCCGCGCAGTACGCC7ACCCGATCGGGTTGTTCGCGGGCGTCTCCAGCAATCT CTCGTTCCTGTTCGACCGCATCGATCCGCGCGACTCCCCCCTGCAGAAGCGCTATGTGGCCGA GCTGAACGCGGCCTCCTTCGCCACCCAGATCGCCTACCGGCTCGATCTGAAGGGGCCGGCCAT TTCGATTCAAACCGCCTGTTCGACGTCACTGGTGGCGATTCACCTGGCGGCGCAAAGCCTGAT CGGCGGCGAGTGCCACATGGCCTTGGCCGGCGGAGCGACCTTGGAGGTCCCCAAAAAGCCCGG CTATCTCTACCGCGAAGGCTACATCAACTCGCCGGACGGCCACTGCCGGGCCTTCGACGCCGA CGCGGCCGGCACCATCTTCGGCGACGGCGTCGGCATCGTCCTGCTCAAACGCTACCGCGACGC CCTACGCGACGGCGATCACGTGTACGCAGTGATCAAAGGCTCGGCGATCAACAGTGACGGCCA TCGCAAGGTGTCCTACACGGCGCCG3GCAAGAGCGGTCAAGTGGCGGTGATCCGCGCTGCGCT GGCGGCGGCCCAGGTAGAGCCGCAAAGCATTCGCTTCGTCGAGGCCCACGGGACCGGCACACT CGCCGGCGATCCGATCGAGGTAGAGGCGTTGACGGAGGTCTTTGCCGAAGCGGGTCGCGGTAC CTGCGCCCTGGGTTCGGTGAAGACCAACATCGGCCACTTGGATGTGGCGGCGGGCGTGGCCGG TTTCATCAAGGCGGTCTTGGCGCTCGAGCGGCGCGTCCTCCCGCCCAGCCTTCACTTCGTCCG GCCCAACCCGGCCATCGATTTCAACGGGCCCTTCTACGTTTGTCGCCAAATCGAGCGGTTGAC GGAGAACGGGCGGTTGCGGGCCGGGGTGAGTTCCTTTGGCATTGGCGGCACCAATGCCCACGT GATTCTGGAGGAAGCGCCGGCGCCGGAGGCGAGACTGCCGGCCGGGAGCCCGCCAGGCGCGAG TCCGTTCCTGTTCCCGCTATCGGCCAAGACGCCGGATGCGCTGGCAGGCCGTTGCCACGACCT TGCCGACCACCTGCGGGCGCACCCCGAGCTCCTCCTGGCCGATGTGGCCCTCACTCTGCAGAT GGGGCGGGCGTCGTTCGCCTACCGCCATGTGGTCCAGGCTGCGACGGCGGAGGAGCTGATTCG CGGTCTGGGAGCGTTCCGACAGGAGTCCATCCGCAAGAGGCGGAATCGAGTACAATGGGTGTT GGCAGGCGAGGCGATGTCGCTTGACGCCGGTTTGCGGCTGTACGCCGATTGGCCGGTCTATCG GGAGCGGGTCGACGTCTGTCTGGCGATCGTCGCCAAGCTGCGCCAAATCGACGGCCGGTCATT CCTACATGAGTGGATCGAGCGACCGCGCGAGGTTCCTGCCGAATGGTCGACGGCGCTGGCGTT CATGTTCCACTGCGCGCTGGCGCAAGCCCTGAGCCAGGCCGGCCTGCACCCGCAGCGCATGTG GAGCCGTGGGCTGGGCGGACAGGTCGGCGTGGTTTTGGCCGAATCCCTGTCGTTGGAACAAGC
GCTGGCGCTGGTGTTGTGCCAGACACCGGTTCCCGGCGATGCCACACCTCAGCGCGAACGCTT GGTTCGGACACTGGAAGGCTGCCGGTTTCGTCCACCACGATTTTTGATTTCGGCAGACAGCTC GGGTCGACCCCTGGACCTCGCCGAATTCGCTCATGTCGATTTTTGGTGCGGTGGCCAAAGCGC CTCGCCCAATGAGGCGGAGCTGCGCTCATGGAGCGACGCCGCGCCCGAGCTGGTGACCTTGGC GATCGGCCCATCCTTTCTCGAGGCCGCCTCCGGGACGGTGGGTCTGGCGATCGACCCCAAGCG ACCGATGACCTGTGTTCAGCGCACGGTGGCCGCGTTGTGGGAATGGGGATGTGACGTGCGCTG GGCTGCGTTCACCTCGTCGACCGGGCGTCGGGTTCCCCTGCCTACCTATCCCTTCGTGCGGGT AATTCCCACGATCGGCGACCCCCTTCGCGGAGCAGGCGCGGAGGATGACTTGATTGCGGCGAG CGCTTCCGCGTCGGCCGGATCGCCGCCCGAGCCGTCGGCAAACTCGGCAGCGGAACGCCCACG CGCCCAGTCAAGCATCGCCTCGGCAACCACACCGGCTCCGTCTCATACGTCGGCCAGCGTGGC CGTGGCCACCATTCTCGAAACCGTCCGTGCCTATTTCGGGTTCGCCGCCGTGCGTTCCACCGA CGCCTTCTTCGAATTGGGCGCGTCCTCGCTGGATTTGGTCAACCTGGGCCAGCTCCTTTCCGA TCGTCTCGGCCGCGAGGTTCCGACCCTGCTCCTCTACGACCACCCAACACCGGACCAGTTGGC GCTGGCCCTGACATCCGCGGCGCTCAGCGCAGAGGCGCCGCCCTTAAGGGGCGGTCATCGCGC ATCGACTTCCGGCACAGCCGCGAGCTCGGCCGCCTCCACCGCACCGACGTTCCCGGGGGACGC TCACTCGCAGCCCAGCTTCGTTCGCGAGCAGGACATCGCCATCATCGGGATGGCCTTCCGGGG ACCGGGCGCCGACGACCTGGACGCGTTCTGGAACAACCTGGTCGAAGGGGTCGAGTCGATCAC CTTCTTCAGCGAGGACGAGCTGCTGGCGGCGGGCGTCCCCCGCGAACATCTGGCCTCGACGCG CTACGTGCGGGCCAAGGGGGAACTGACTGGGATGATGGATTTCGAACCGGAATTTTTCGGTTA TTCGGCGCGCGAGGCGGCGGTCATGGACCCGCAGTTCCGCGTGTTCCACGAATGCTCCTGGCA CGCACTGGAGCACGGCGGCTACGATCCGACCCGATGCGCGGCATCGATTGGCGTCTACGCCGG CGTGACCAACCACCTGCCTTGGCTGATGCGAACTTTGCCGCACCTGACCGAGGAGGAGCAATT CGGCGCGCTGCTCCTCACCGACCGCGAGTTTTTCGCACCGCTGCTCTCCTACAAGGTCGGCCT GCGCGGACCCGCTATTTCGCTGCAAACCGCCTGTTCGACGTCGTTGGTGGCGATCGGCACGGC CTGTCGCGAATTGCGCGCGGGTGCCTGTCAGATGGCCCTAGCGGGCGGCGTGACGGCCAGCAT CGAGCGCTGCGGCTACTTCCACCAAGAAGGCTACATCCTCTCGCCTGACGGCCACACGCGCAG CTTCGACGCGGCGGCCGCCGGCACGGTCTTCGGCGACGGAGTCGGCATGGTGCTGCTGAAGCC GCTGGCCCAAGCCTTGGCCGACGGCGACACGATCCACGCGGTGATCAAGGGAATCGGCATCAA CAACGACGGCGCGCGCAAGGTCGGCTTCACCGCACCTAGCCGGGCCGGTCAGACCGAGGCGAT TCGGGCCGCGCTGCGCGACGCCGGGGTGGCGTCGAACCGCGTCAGCTACGTGGAGGCGCATGG
AACCGCGACCAGAATGGGCGACCCGATCGAGGTCGAGGCCTTGACCCAAGCCTTTCGCGCCGA AGCCGACGGTCCGCTTCCGCCCGGCTCCTGCCTACTCGGCTCGGTGAAGTCCAACGTGGGCCA CCTGAACGCCGCGGCCGGCGTGGCTGGTCTGGTAAAAACCGTGCTGGCGCTCCAACACCGCCG CCTGCCGACCAGCCTGTTCTACCAGTCGCCCAATCCACACATCGACTTTGCGGCGAGTCCGTT CCGC3TGAACGGCCAGACTTCGGATTGGGTCGCGCCAGAGGGGACGCGGTTGCTGGCGGGAGT GAGTTCGTTCGGTATCGGGGGAACCAACGCCCACCTGATCGTCGAGGAGGCGCCGAAAGCGCT ACCGACGACAGCGGCACCTCTGTCGACGGAGCCGAATGACCTCGACGCGGGCGACGCCGACGG GCTAGTGCTGCCGATCTCGGCCCGCACGCCGACCGCCCTGGCGCACATCGCGACCAACCTCGC CAATCACCTGGAACGACATCCGACCATCGCCCTGGCCGACGTCGCCCTGACCCTTCAGCTGGG CCGTCGCCAATGGCCCCATCGCCACAGCCTGATCTGCCGGAATCGAACGGAGGCGATCAAGCT GCTGCGCGCCGTCGTCCACTCCGCGGAGGTGCCGCCAGCTCAGGCGCCGGTCTCGGATGCGCC GCGCTGTGTTTTTCTTTTTCCCGGCCAGGGCGCCCAATACCCGAGCATGGCCCGCGACCTGGT TCGAAACTGTCCCGACTTCGCCCTGCACCTGGACCCCTGCCTCGACCAGTTGGCCGAACTGCT TCCCGAAGATCCGCGTTGCATCCTGTTCGGCGATGGCCCCGCCGATCGGCTCGACCAGACGGC CTACACTCAGCCGCTGCTCTTCTCCGTGTCCTACGCCTTGGCGCGCTGGTTGGGCGATTTCGG CATTCGCCCCGATGCGATGATCGGCCACAGCCTGGGCGAATACGTGGCGGCCTGCTTGGCCGG GCTTTTCTCGCTGAGCGATGCCCTGCTGCTGGTGAGTGAACGCGGCCGCCTGATGGGCTCGGC CGCGCGCGGAGCGATGCTGGCCGTCCCCTTGCCCGAATGGGAACTGGAGGAACGCCTGGAGCT TCTGGCCGACGACCGAATCAGCATCGCGGCGGTCAACACCGCCGAGAGCTGCGTCATCGCGGG ACCCAGCGAGGCGATCGAGCGCTGCGCCCAGCGCTGGGCCGCGCAAGGCCTGACCTGTACGCC GCTGCGCACGTCCCACGCCTTCCACTCCGCGATGATGGAGCCGATTGTCGAACCCTTCGGCCA TGTCTTGGCACGGGTCACCTTCGCGCCGCCGCGCGCGCGCTGGATCTCGAACCTCGACGGCAA GCCGATCGATTCCGCGGCGGTGATGCAGCCCGACTATTGGGTGCGCCACCTGCGCCAACCGGT CCGCTTTCACGAGGGACTCAGTCACCTGTTGGCCGAGGACACCCATGCTTGGGTCGAAGTGGG TCCCGGCCGAACCCTGTCCTCCTTCGTCCGCCGCCACCCGGCCTACCGTCACCAGCCAATCGT CAACCCCATGCGCCATGCAGTCGAGTCGACGGGCGACGTGCGCCGGTGGCGCCAAGCGCTGGG CGAACTATGGCGGGCCGGCATGCCGGTCGCCTGGGAGCGGCAGCGGCGCGGCCGGCATGCCGG ACGACGTGTGCCGCTGCCGGGCTACCCCTTCGAGCGGCGGCCCTTCGCGGCCCGAAGACCGGT GGAGCTGGCGCAGCCCGCGCCCAAGGCGGAGCTGGTGAAAAACCCCGATCCCGCGCGGTGGCT GTACCGCCGCGTCTGGCGCCCTGCCCAGGCTGCGGCCGGCGGACTGGCGGTGCAGGCGACCGT
TCTGGTCTTCGGCGACGGGTCCGAGCTGTGCCGCGCGGCGGTCGCTCAGGTGCAGCGCCAGGG GCTGAAGTGCGTCTCGATCACCGCGGGCCGCCAATTCGCGCGGGAGAGCGACATGCGCTTCAC GCTTGACCCCGCTGATCCGCGCCAGCTCGACCAGCTCTTCGCGGCCCTCGATGGCTCAGGCTC GCGGCCGCGGTACGTCCTGCACCTGCTGACCCTGAACCCGCCCCCGGATGCCTCGGCGATCAT CGCTCACAGCTACTACAGCCCGATGGCCTTGGCTCATGCCTTGGGCGCCCACGAGATCGCGCC TGTCTCGATCACCGTCGTCACCGCCGGGGTCGTCGCCGTCGCGGACGAAGCGATTCGCGAGCC GCTGCAGGCGCTGATCGTGGGCCCGTGCCTGGTCATCCCGCAGGAGTTTCCCGGGCTCAGCGT TCGGCTGCTGGACGTCAACGTCGACGATCCGGCACCGCGTCTGGCGGAGCGGCTCGTGGCCGA GCTCTCGGGCACGGATCACATGGTGGCGCTGCGCGGCGGCGAGCGCCTAGTGGCCGATGTCGA TCAAGTCGATGGCCTCGGTGTGGGGATCGCCAAGGTGCCCTTGCGCCGCGAGGGCCACTACCT GATTCTCGGCGGCCTGGGCGATATCGGCTACCACTGTGCCCGCTATCTGGCCCAAACCTACCG CGCCAAGCTGACGCTGACCGCGCGTTCGTCACTCCCGCCGCGCGCGTCGTGGGAGCGAATGCT GCGCGAGGGAAACCTGGATTCCCGGCAGCGCACGCGCATCGAGCGCGTGTTGTCGCTAGAGGC GTGCGGGGCCGAAGTCCAGACGGCTGCGGTCGACTTGGGCGATCGCCATCGCTTGGCCGATGT GTTCCGCGAAGCACGGGGCCGATTCGGCGCCATCGCGGGCGTGATTCACTCGGCGGGGATTCC GGGACACGTCCACTCGATCGACGAGCTGGTGCGCGTCCGCGACGAAGCCCAATTCACCGCGAA GGTTCGAGGGCTGCACCACCTGGCCGAGGTCGTCGATCCGCTGAACCTCGACTTTTGTCTGCT GTTCTCCTCGCTCTCGACCGTCCTCGGCGGGCTCGGCTACGGCGCCTATGCAGCGGCCAACGC CTACATGGACAGCTTCGCCCGCCGCCACGATCGGCCGGACGAATGTCGTTGGATCGCGGTCAA CTGGGACGCCTGGCTGTTCGAAGCCAAGACGTCGTCGGTCGGCGCCGAATTGGCGCGCCTGGC GATCGTGCCCGAGGACGCTCCGGCCCTGTTCGCGCGGGTGCTAGAGCGACTTCCGCAATCGTT CATCGTGTCCACCGCCGACCTTCGGGCCCGCATCGACACTTGGATCCGGGACAAGAACCGCGT CCCGCCCGCCGAGATCCGAGCGGTTCAACCGCGACCGGACCTGAGCCAGGCGTACGCCCCGCC GATCGGCCCGCTGGAGATTCAACTCTGCGGGCTGGTCTCCGCCTATTGCCGGTTCGACCGGAT CGGGCGGGACGATTCCTTCTTCGAAATCGGCCTCAGCTCGTTCGACTTGATCCAGCTCAGCTC GCGCATTCACCGCATCACCGGCAAGGATCTCAATACGACCCAACTGTTCAGCTACCCCACCGT GCGCGCCTTGGCGCTCTTCCTCGGCGGCGAACCGGAGGGGCTCGCGGCGGAGGAGCCCGCCAT GGAGAACCTGTGGCTGCAACGAAGCGATGCGACCCTCGATGAGTGAGACCGAGGTCGCCGACT GCGGCGCTACCGACCGCGGTCGAGGATTTTCCGCGCAGCGATCCGGGACGACTCGCTGAAGAA GCGCGATAGAAGAACGGAATCGTGTATGAAATACGAAACCACCGGATTGGAATTGGCCGTCAT
CGGTCTCGCTTGCCGCTTTCCAGGCTCACCCGATCCCGAACAGTTCTGGTCGAATCTGCGCGC AGGTCGCTCCGGAATCCGCCATTTCAGCGATGCCGAGCTGAGCCACATCCCCGCATCCCTGCG TCACCATCCGCATTACGTCAAGGCCAAAGGCGCGCTGGACCACGCCGATTTCGAACCAGCCTT CTTCGGCTACTCGCCCAAAGAGGCGGAGGTGATGGACCCTCAATTCCGGCTGCTCCATGAGTG CTGCTGGGAGGCGCTGGAGTCAGGCGGCTATGCGCCGAGCCAATTCGCGGGTCGGATCGGCTT GTTCGCGGCGGCGGCCTTCAACGACGGATGGATCGCCGGTACCCTCGACCGGCTGCGCACCGG CGTGGGTTTGAGCTCCCTGGAAACCGCGTTCTTGACCCTGCGCGATTACCTGACCACCCAGAT CTCCTATCGGCTCGATCTGCGGGGCCCCAGCCTGCTTGTCCAAACCGCCTGCTCGTCGTCGCT GGTGGCGGTCCAGCTCGCCCAGCAGGCGCTGATCTCCGGCGAATGCGCCCTGGCCTTGGCTGG CGGCGTGTGCGCGACCGATCCGCTGCATTCGGGATACCTCTATGAACCCGGCAACATCTACGC GCGCGACGGCGTCTGCCGACCGTTCGACGAGGCAGGCGCCGGTACGGTCTTCGGCGACGGGTG CGGCATGGTCCTGCTCAAGCGGCTGAGCGACGCCCAGCGCGACGGCGATACGATCTGGGCGGT CATTCGCGGGGCGGGCGTGAACAACGACGGGCACCACAAGGTTGGCTACACGGCTCCTGGCAC GAGGGGCCAGGTGGCTTTGCTTAAAAGTGTTTATCGCGCGAGCCGGGTCGACCCGGCGACGCT CGGCTACCTGGAGGCCCATGGCACCGGCACCGCGCTCGGCGATCCAATCGAGGTCGAGGCGCT TACCCAGGCCTTCGCCAGCAAACGTCGCGGCACCTGCGGCTTGGGCTCGGTCAAGGGCAACCT GGGTCACCTCAACACGGCGGCCGGCATCGCTGGACTGATCAAGGTGGTGCTGGCGCTGAAACA TCGCGAAGTGCCACCCACCCTCAATCTGCGCCGTCCCAATCCGAAAATCCGCTTCGACGAGAC GCCGTTTTTCCCAGTCGTCGAGTTGCAACCCTGGCCAAGCGGGACCGGCCCCTTGCGAGCCGG CGTGAGCTCCTTCGGCATCGGCGGTACGAACGCCCACGTCATCCTCGAGGAGGCACCGCCGAC GGCCAACCCGGCGCCACACGGCAGATTCCGACTGTTGCCGCTTTCGGCCAAGACACCGGCTGC GCTCGAAGCGAAGCGCCGCGATCTGGCCGGCTTCCTCGAACGCCACCCGGAGACCTCCTTGGC CGACCTCGCCTTTACCCTGCAACGCGGCCGCGAGGTCTTCAGTCACCGCGCCTGCCTCGCCGT GGAGACCTTAACGTCCGCGCGCACGCGGCTGAGCGGCGAGTCGTCGAGCACTTGCGTGGTGGG CCCCGCGCCCAGCGCCATATTTCTGTTCCCTGGTCAAGGCAGCCAGCTCGCCGGGATGGGCCG CGGTCTGTATCACCATTTCGAGCCGTTCCGCACGGCCGTCGATGCCTGTCTGCGCGAGCTGGA GCCAGGACTGCGGCAAGCGCTCAGCGCCCATTTCGATCCGAATCGCGGCGCGGACCCACCCGA TTCGACGACCTTCGTCCAACCCTTGTTGTTCCTCGTCGAGTACGGGGTGACCGAGTGGCTACG CTGCTTGGGTGTGCGGCCAACAATGGTGTTGGGTCACAGCTCTGGCGAGTATGCCGCAGCCTG CGTCGCGGGCGTTCTGTCGCCGTCCGCGGCGGTCTCGCTGCTGGCCGAGCGCGAGCGGCTGCT
GCGCGACCTGCCAGCCGGCGCCATGCTCGGCGTCCCGCTGGCCGCCGAGGCGCTCGAGGCGAT GTTGCCCGACGCTCTCGATCTGGCGGCGATCAACGGCTGTCAGCTTTGCGCCGTGTCCGGGCC GGTCGCGGCGGTCCACGCCTTCAAGGCCCAACTGGAAGCCGCCGGACATCACGCCCGCCTGTT GCACACCGATCGCGCCTTCCACTCGCGGCTGGTAGCACCGGTGCTTGACCGGTTCCAGGCAGC CGTTCAACACGTGGAGCTGCGGCGGCCGCAAGTACCTTACCTCTCGACCGTCAGCGGGCGATT GGAGGCGGATGGGCCGGCGAACCCGCACTACTGGGTGCGTCACCTGCGCGACACGGTGCGGTT TGGTCCAGCCCTGGAGGCGCTGCCGCCGGTGGATTCCTTCGTGTGCATCGAGGTGGGACCAGG CTCGGCCTTGAGCACCATGGCGCGCGAAACGTTGGGTTCCCAGGCGCGACTGATTTCGTTGCT GCCGCGGCCGCGAACGGGGCAAATCGAGCCCGGTCCGGTATTCGAACGACTGGCGGCGCTTTG GCGCAGCGGGTTGACATTGGATTGGTCTAAATTGACGGGCGGCGAAGAGGGTCATCGAATTCC CTTGCCAGTCTACCCGTTTCAGCGCAGCCATCTGTCGAGCTCCCTGGCGGCGGGCCACACGCC TTCGTCGCGGCCTGCAGTCGAATCAGGCGCCATCCTTGCCGAGCGATCCGCAGGGGAAAACGC TGAAACCCGGGATTGCCCGCTGCCAACCGCCACGCTCGAGCCCAAGGCGGTCGCTCCGGCCCC ACTCGAGGCTACCGACGCCGCAGGTACTCGCGAGCGACTGGCCGAACTTTGGCGCGAGTTGCT AGGGTTGACCTCGATTGGGCCCGACGACCATTTCTTCGACCTGGGCGGCCACTCGCTGACCGC CACGCGGCTGCGCGCCCTGATTCACCAGCGGTTCGATGTCGATCTCGGGCTCGACGAAATCTT CGCTCATTCGCGTCTCTCCCAGCTGGCCGCCCGTATCGAGGCGGCGGCCAAGAGCCGATTTTC CTCCATTCCCAGCGCGCCGGACCAGGACGACTATCCCTTGTCATCCGCCCAGCAGCGGATTCA CAGCATCGTCACGAGGGCCGAGGTCGGCACTGCTTATAATTTTCCGATCGTCCTCGAGCTGCA GGGCGCTCTGGATCGAGTGCGATTCGAGGCGACGTTCGCGGCATTGTTCCGGCGTCATGAGGG GTTCCGCACCCGCTTTGTGATGCGCGATGGCGGGCCGCGCCAGCGCATTGTACCGGACGTGGC GTTTCGCCTGCCGCTCACCCAGGTCGAGCCAGAGCAGGTTCCCGGGCGCATCGAGGCCTTCAT CCGTCCCTTCGATTTGGAACGCGCGCCGCTGTTCCGCGCGGAGCTGTTGCAGTTGGCCGAGCA GCGCCATCTGCTACTTTTCGACATGCACAACTTAATTGCCGACGGTATCTCGCTCAACCTGTT CGTCGCCGATTTCGCGGCCCTGTACCATGGTCGTCCGCTGGCGCCGCTGAAACTCCGCTATCG CGACTATGCCGTTTGGCAAGAGGCGCGGCTGGCCTCCGATGACCTGCGCAGCCAGCGCGAATG GTGGCACCGGCGGCTTTCGCCGCCGGTCGCCACGCTGGCGCTCCCTCCCGATTTCCCGCGTCC GGCGGTGCGCCGCTACAAGGGCCGTAATGTGGTGTTCCACCTGGACCGGGAGATCCGCGACCG CCTGGTGGCCCTGGCTCGAACCCAGGGGGTCACCATGAACGTGATGATGCTGGCGCTCTGGGC TGCGCTGCTGCATCGCGAAACCGGCCAATCGGAGCTGGTGGTCGGATCGCTGCTCGGCGGGCG
GCCGCACAGCGAGCTGCATCCCGTGATCGGGCTCTTCACCAACTTTTTGCCCTTGCGGTTGGC GGTCGAGGGATCGACCCGCTTCGATCGCTTCCTTGCCGCTTGCCACCAGGTGTTTCTCGAAGC CTATCAGCGCCAGGACTATCCGTTCCACTTGTTAGTCCAGGAACTCGTGCCGGTCAGGGACCC GTCGCGGTCGCCGCTGTTCCAGACCTCGCTCGTCTACCACAACGAAATTGACGGCAAGACCAA GCTGGAATTGGAAGGGCTGAAAGTCGAAGTGGTTCCCTTCGAAAAGGGTGTGGCGAGGCTGGA TTTGAAGCTGGATGTGACACCTTTTTCCGACCGACTCGAATGTGTTTTGCAATACGACTTGGA TCTGTTCTGCGAGGAGACGATGCGCGGCCTGATCGCGCGGTTCCAGGCGTTGGTGGCGGGGCT TGTCGCCGATCCGGCGCAATCGCTCGCCGCCGCGAGCGTTTCCGGGAAGCGGGCGCTGCGCGC GGGCGTGGCCACGGCAAGCGAATCGTCGCCGCAGTCACTGCCGCCGCAACCATCGACGGCGTA CGCCACTCCCTCACCGCAGTCACCGTCGCCGGTAGTCCTGACGGGACCCGCCGACCTGCCCGC GATCTTGGCGGCCTACGTGGGGCAGAACCCCCATCCGTTCGCGATCCATCGGGGTCTCATTTT GGAGGCGCCGCTGGGGTTGCGAGCGCTGCGGTCGGCGCTGGACGCAGTGCTCGGAGAACACAC CCATTGGCGCAGCGTGCGTGCGGGCGATCGCGCGCGGCGCGTGGATAAGTTGGAATTGACCAG CCTGGTGCGGCTCGACGACCTGCGCGGGTTGGTCAATCCTCAGGCGAATGCCTTCACCCTGGC TTGGCGCGATCTGGCGATGCCGTTCGGGGAGGGGCGTCCCCTGTGGCGACTCCGCCTGGCGTG GTCGGCTCCATCGCGCTGGTTGCTATTGCTGACGGTTCATCCATTGATCGGCGACAACGGCAC GGTCGACCTCTTTCTGGCGGCACTCGCCGATCACCTGCGCCGCGCGTCCGCTTTTCCCGTAGC ACCGCTCGATGAGGCCGAGCTGGAGGCGGAGCTGAAGTGGGGAGAGGAAGGGGAGGGCCTCGG GCTGACCGCGATCGCGCCGGTCCTGGGCCAATTGCGCGAAAGTCGGCTGAGTCCTGTGGCCCA GATGTGGCTGGACGAGGTCTGTCGCCGCCACGACCTCACCCCGCTAGAGGTCTTGGCGGCCCG GCTCCTCGATTGGACACGAAGCCACGGTCACGGGTCGATCGCTTTGTGGACGCCGCTGCCCGA GGACCATCCGCTTCGCGATGAAGGCCGCTGCCTCCAGGTTCGCCTGCTGGAGGGGCCGCCGTC GCAGCGAGGAGCGGGCGATCCAAGCTGGCTCGAGCAAATCGCCTTGAGACGGGGTACCCCTGC AACGGAGGTCGTTTGCCCTACTCCGACCCAACGGGCAGCCATCGACCTCGCGCTGGCCTGGCT GCCGCAGCCGCCTCTTCACGGTTTGGTCGGAACCGTTCAGCCGTGGCCGGAATCTCCATTGGT CTGTCCGTTTCCCCTCAATCTCGCGTTCCGGCCAAGCCATCCAATTGCCTACGCGCTCAAGCA CGAGGCCACGCTCGCGGTCACGGCACGGGCGCGCGATCTGATGCGTTTCCTCGACGGCTTGGG CCCGGAAAGCTGAAGATTAGCATAAGCGCCCGGCCAAGGGCATCCTAGGATGACGCAAGCCTC GGCCGCGTCGACGTCCCAGGTCGCGCCGGAGGTCACCCCCGGCCGAAAGGACGACGATGACGA TCAAATCCGAGATGTCGGCCGTTGCTCACTCTGCGGAGAGCGGCTTCCGCGCTGGGCCACGCG
TGGGCGGCGCGATGAAGCGGGGCCGGACGCCGGAGCAGGCCGGCGTGAAGCTGCTCCGCGCCC CGGTGAAGCGGAAGTGGCTGCCCCCGGCGCCCGTCCTGCGCCTGAGCGAGCGGCGTATCCCGG AGGTGTGGGCAGGCTACCGCGCGAGCGCGGGATGACCCGAGCCCCGCCCGCCGGCGCGACCAT GACGCCGCCCCACGGGGCGAGTCGTCCGGCGCGCCGGCGCGCGTCGGGGCTTCCGCCGCCGGG CGGGCAGGTGCAGGATGGTCGGGCATGGTGACGCGTCCGACGTCCGACGGCATCGAGGACGAG CTCGCGCCGTTCCCCCCGGTCCTGCGCGGCTGGCTCATCGAGGGCGAGCTCGGCCGCGGCGGG ATGGGGCGGGTGTTCCGGGCGCGGCACCCGAAGACGCGGGCGCGGGCGGCGATCAAGGTGCTG CTCGGCGACTACGCCCGCCGGCCGGACGTGGTGGCCCGCTTCCGGCAGGAGGCGATCGCCGTC AACATCATCAACCACCCGGGAATCGTCCGCGTCTTCGACTCCGGCGAGCTCGAGGACGGCTCG CCCTACATCGTGATGGAGTACCTGGACGGCCGGGGGCTGCGCGACTGGGTGCAGGCCGTGCCG CCCGCGGAGCGGCCGCGGCAGGTCGTGCGGCTCGGCTACCAGATCGCCTCGGCCATGGCCGCG GCGCACGCGTCCAAGGTCGTCCACCGCGATCTGAAGCCGGAGAACATCATGGTGGTCGAGGAC GAGCTCGCGCCCGGGGGCAGCCGCGTCAAGATCCTCGATTTCGGCATCGCGAAGGTCCTCTGG GGAGGTCTGCCCGAGGTGCTGGAGCTCGAGGGGAGAGGCTCCCTCGCGCCCGCGTCCGCGTCC ACGATCCGCACCGAGCTCTCGACGCGGCCGGCGCCGACGGTGGGCGCCACGACCGGCCCAGAG AGCCCGCTGGGCGCGAGCGCCACGCCAGAGAGCGCCCTGGGCGCGAGCGCCACGCCAGAGAGC GCCCTGGGCGCGAGCGCCACGCCAGAGAGCGAGGCCCACGAGGAAGACGCGCTCCGGAGCCTC CCCGTCGTGACCAGCGGCAGGCCCGCGATCCACCCCGCGCCGGTCGAGATCCCGCCCGAGGCG GTCTCCTCCGCGGCGTCGCGCGGGTCGCGCGCGTCGATCGAGCCAGGCGCGCCCGCGCCGCAG AGCGAGGGCGCGGGACAGCCCACGATGCCGTTCACGCAAGAGGGCGTGTGGGGCCTCGGGACG AGGAGCTACATGGCGCCGGAGCAGGAGCGCCACTCCGGGAGCGTGGACGTGAAGGCGGATGTC TACTCGCTCGGCGTCATCCTCTATGAGCTGCTCGAGGGGCGGACGCCCGACGCGCCGAGCGCC GCGTGGCCGCCCCCGATGAGCGCCGCCACGCCGCCCGATCTCGTCGCCCTCGTCCACCGGGTT CTGGCGTTCGATCCCGATGCGCGGCCGCGCATGGCGGAGGTGGCGAGCGCGCTTCACCGGCTC GGCCGGGCGAAGAAGGAGCTCGACGAGGCGCTCTCGAGGTGGGTCGTCGGCGGAGGGGCGCCG GGGCTCTTGCCGTGCGGCTATGCTCTTCTCGAACTGGTCCTCCTGGGCCCTGGGAACTTATAC GATTCTTTCCAGCCTGTAAGTGCATTTTTCTTTCAATATCGTCCTCTCTTCATATACGAGGTG AGTTCTCTGAGGTCCTCCTATAAGTCTGGGGTGTCCTATTCGGCCTCTTACTTGTTACTTCGC CTTCTTAGGAGTTTTTCCTTAATTTTGCCCTCTTACATTCCCGTATTCATTCTAACTGGGCCC TATCTCATTCGCTAATACGTTTCTGTATTGTGTACATCTCCTATCATGTGTCAATACTTGTTT CTGTTTATCATTATTCTTATTGTTTACGCTCTTATTTCATTCATAGTATAACATTAGTTTACT GATTATCGCACTTGAATTCGCG
or its complementary strand, (b) DNA-sequences which hybridise under stringent conditions to regions of DNA-sequences according to (a) encoding proteins or to fragments of said DNA-sequences,
(c) DNA-sequences which hybridise to the DNA-sequences according to (a) and (b) because of a degeneration of the genetic code,
(d) allele variations and mutants resulting by substitution, insertion or deletion of nucleotides or inversion of nucleotide segments of DNA-sequences according to (a) to (c) , wherein the variations and mutants offer isofunctional expression products.
8. DNA sequence according to claim 6 selected from the fol- lowing
(a) open reading frames:
Nucleotide Position
ORFl 1666 - 1 Seq ID No 3
ORF2 1605 3338 Seq ID No 4
ORF3 6100 - 3398 Seq ID No 5
0RF4 7110 6374 Seq ID No 6
ORFS 9590 - 8433 Seq ID No 7
ORF6 11393 - 9855 Seq ID No 8
ORF7 13656 - 12712 Seq ID No 9
ORFS 15374 - 18984 Seq ID No 10
ORFS 20003 - 27889 Seq ID No 11
ORF10 28251 - 29402 Seq ID No 12
ORF 1 31720 - 30401 Seq ID No 13 ORF12 31982 - 32932 Seq ID No 14
ORF13 33128 - 33613 Seq ID No 15
ORF14 33661 - - 34007 Seq ID No 16
ORF15 35611 - - 35255 Seq ID No 17
ORFl6 37856 - - 35730 Seq ID No 18
or DNA sequences complementary to said open reading frames,
(b) DNA-sequences which hybridise under stringent conditions to regions of DNA sequences according to (a) encoding proteins or to fragments of said DNA sequences,
(c) DNA-sequences which hybridise to the DNA-sequences according to (a) and (b) because of a degeneration of the genetic code,
(d) allele variations and mutants resulting by substitution, insertion or deletion of nucleotides or inversion of nucleotide segments of DNA-sequences according to (a) to (c) , wherein the variations and mutants offer isofunctional expression products,
and peptide sequences corresponding to said open reading frames
SEQ ID No 19 (>0RF1) λ/DPEREA¥TLGLAFNRAQGRTYARGPEARAEYIGTAMRAADVI EDRFE I ER AVSGGMGDVYR
ARDRVSGQAVALKVLQGASANDLRRFAREAEAL¥TLRLPG¥VQYVAHGVTGAGRPYLAME LD G¥TLEERLAGAPLTLAES¥ALAARVATTLGAIH LGWHRDLKPSNLMLVGGAVERVTLLDFG IARHLRLAPTLTSPGA¥LGTPGYIAPEQVRGDAPVDARD¥FALGCVLFQCLAGRPPFLGNSAL A LMRV LEEPPRLGELRDGI PEPLERLVARMLAKNAGERPRDGAAAAAELAAVAGEGLS IGA SAVAAPAAPGEAITTAERK¥MCVI LAEDGGAEAGATLSEDDGAARAEALRDIAARHGGRLDRL QAR L¥A SGAESPTDLATRAAHCALA RAALGGVPVSVATGLAEVEARLPVGELVDRVAQL IAGRDGLSPPEIRLDDATASLLASRFET¥QGPGGC LRGPKEEPDAVPRLLGKPTPC¥GRERE LSQLATEWRHCVDEPSANAVVWGAPGLGKSRLAWEFLRTLEQREGAAI
SEQ ID No 20 ( >ORF2 )
VRPCARLNASPS¥TASRSGSTAAGSVHASTSAC¥EQPATGRTQPASPR PPGAAALRLTSA P R FNTAGPCNPADHYMLPAEERLPAVRDLλ/DRKAYFVLHAPRQIGKTTSLRTLAQDLTAEGRY VAVL¥SAEVGAPFSDDPGAAELAMLAE RGTAGAQLPADLRPPPFPDAPAGQRIGAA RA AQ AAPRPL¥¥FLDEADALRDATL¥SL RQIRSGYPDRPRDFPHALA VGLRD¥RDYK¥ASVDSGR LGTSSPFNIKVESLTLRNFTRDE¥ATLYAQHTAETGQVFRPDAVDRAFELTQGQP LANALAR QL¥EVLVKDRAQPITSAN¥DRAKEILIERQDTHLDSLVDRLREPRIRAVIEPMLAGTALPSVP PDDLRFAIDLGLVRMTAEGGLDVANPIYREIIVRE AFPIRASLPQIKATW TQDGRLDADRL LDAF SF RQHGEPLLGAAPYHEIAPHLWMAF HRVNNGGGTVEREYAIGRGRMDLCVRYAG ETLAIELK RDGRPDP¥AEGLAQLDEYLAGLGLDRGWLILFDQRSGQPPIAERTRRERALSP AGRE¥A¥IRA
SEQ ID No 21 (>ORF3)
¥TIKKTFRSIDPATLPKHFDSPVAE RLADL EADGTYRYDPSRPREETFVVDTPPPTASGSL HIGH¥FSYTΗTDV RQRRMRGFNIFYPMG DDNGLPTERR¥QNYFHVRTD¥RTPYERGLTLP QAAPETIKKEPPRIVSRPNFIELCHKVTREDEQ¥FKALFRRVGLSVD RNEYATIDDHCRRTA QLSFLDLHEKGHLYSVFAPTM D¥DFQTAVAQAEVEDRPQSGAFHDIAFAVEGTAEEL¥IATT RPEL AACVGVTAHPEDPRYQHLFGKTALTPIFRAPVPIFPSPLVDREKGTGILMVCTFGDAT DVI WREQKLPLRQMLGKNGRVLPVTFGEGA ESRDPAAANAAYAPLQGRGVKQARAAWELL RREEHAAAPGRGPALRGEPRPIERAVKFYERGDQPLEFVPTRQ FVRLADKKAELLEYGDKIK HPDFMRLRYRN TEGLQGDWCISRQRYFG¥QFP¥WYPLDAEGNPDHSRPLLATREMLPVDPT
¥DVPPGYEASQRDQPGGFTAESDVFDT FTSSLTPQISSH GDDPARHARLFPADLRPQAHDI IRTWAFYTIAKAMLHESSVP HHVAISG ILDPDRKKMSKSKGNWTPMHLLDTYSSDAVRY SASAR GTDTAFDEK¥LKIGKRL¥TKI NASKYVLSQSAEVHPISEELDRALLHKLSA¥¥DDA TRSFDEHEFAAALERTEDFFWR FTDAYLE AKARARGEGGAGEAARGSAVAALRLGLS¥ LR LFAPVLPYITDEVWR VYAEETGDTSIHRAK PSAADFAAVAAPSDPGLLDLAAAAMAAVNKR KSE GASVGRWTDLALGANAATLARLKPALGDVLTAVRAGAHALVRPELADGEVLWRCELE PAAAAAAGAGGAAAS EE
SEQ ID No 22 ( >ORF4 ) MIHAEPFEARLVAARPLSPFVRELSFERADGRSFLFEAGQ VNL¥LPLPGGEVKRAYS IASAP DGSPRFDLAVTL¥QGGAGSEH HR ΞPGATLRAIGPHGLFTRDPGDSAPSLF¥ATGTGI TPLR SMLRASLRAGLAAPHLWILFGARFEΞD¥IYRDELEALARGSDRIRYEIT SRGGPSWAGRRGY VQAHVPELYRELAEKSGDPAPH¥FICGLDRMVSS¥RELARGELGVHRKHVH¥ERYD
SEQ ID No 23 ( >ORF5 )
MKSLPSDRAARLAQSDIRTMTLACAKVHGINMSQGVCDTPVPSVILQAVKEAMDRGCNTYSRF DGIVELRHAIAAK ARHNGIAADPΞTDIT¥SAGATGAFQATCMAL NPGDEV LFEPFYAYHA QAILAVEAVPRYVTARSLSWN¥DGDΞLERAITPKTKAIWNSPGNPSGKVFGRMELEQIADLA CHHDLMVITDEIYEYFIFDGREHVSVASLPRMSERTITIGGYSKTFSITG RIGYSVADARWA KAIGAMSDLLYVCAPTPLQHGVAAGIRGLPRSFYTGLAQGYERKRDRFCRALEKAGLPPCVPQ GTYYVLADVSRLPGRTGRERAIYLLDETGVAGVPGDAFFEGTQGSRFMRFCFAKTDEDLEEAC QRIEQLA
SEQ ID No 24 (>ORF6) VSDPRKERLGDMDLEEFRRIGMRI IDWAADYLGHPDRYPVFPAIRPGDVKGRLAPTPPVEPEP MDAVLTDFEQIILPGITHWNHPRFFAYFANTASGPGILGELLAACLN¥N¥ML RTSPAATELE E VLS LRQMLDLDAGLHGAIMDTASTASMVAIAAARDSAEPTIRLRGMAGQRRMRLYASEQA HSSIEKAAITLGIGQEGVRKIPTD?AFRMVPEALRAAVVEDLGAGLRPFC¥AATVGTTSTTSV DPIPAIVSVCREHGL LHVDAAYAGiAAIVPEHRDVLAGCEGADSLWNPHK LFTPMDCSVL YVRDADRLKRAFSLVPEYLRTEGDVTNYMD GIQLGRRFRALKL MIVRYFGHEGLAARIREH LRLGQQLAQW¥DADPDWERLAPTPFSTVCFRMRPSALACIMRSADEAERESIERELDRLNEAL LDEVNKSGRVFLSHTRLHGRYTIRVAIGNIRSDEVAVREA ECLRAAGARLCADERFVSCSRS ADEGRGKS SEQ ID No 25 ( >ORF7 )
MRREEPVLEAFYERYCAAPRETSYHVELPVD¥ELHQEAAPALPQARSLELAGRVALVTGSSRG IGKAIALRLAEQGAD¥AVNYHSNKDAAEQTAAEIRALGRRTMWQADVTRPNAAAELFSSVEA QLGPIDILVNN¥GDFFFKPLAAMTDDE RNVMDSNLSSVHYLCRAA¥ARMRQRKSGRIINIGL SPTYAIRGAPNVAAYSIAKTGVLILTRSLATEEAPHGILVNCVSPGLIDNGYLPPAQKE MER RVPMGRLGRASE¥ADAVAFLASDRASYVSGANIAVAGG D TDRGTEHDRR¥DLFIGHEEP
SEQ ID No 26 (>ORF8) MSGRFPGARNVEEL QKLRAGVECVVTFTEAEA AAGVSREMLANPSYVRRGAPLDGVELFDA SFFGFSPREAESMDPQQRIFLE¥AWEALERAGYDPDAHSGPIG¥FAGSAPSGYHSLAQSDPEI LGALGHYQLTLNNDKDYLTTHASYKLNLRGPSVCVQTSCSTSLVAVVMACQSLLNHECDMALA GGVGIHAHQRRGYLYQENGISSPDGHCRAFDVAAKGTVGGSGIGI LKR ADALADGDHVHA VIRGAAINNDGSSKIGYTAPSVQGQAEVIGMAQA AGVEPDDISYIEAHGTGTPLGDPIEIAA LTRVFRAKTARRQFCAIGSLKTNLGHLDAAAGVASLIKTVMALEHRELPPSLHFERPNPKLEL ESSPFYVNTRLTPWHAARGPRRAGVSSFGIGGTNAHWLEEAPAPPPSGPSRR QLLT AARS EAG ARATADMIEHLDRHSGTSIADVTYTSHVGRRA PFRRAVVGESAADLRAALASEGSPRS ISSCQAARERPV¥FLFPGQGAQHLFMARELYEVEPIFRQS DRCAELLRGPLGLDLRQVLYPA EGQRDDAEQELGRTAIAQPALFAIELSLAKLWMA GIVPQA IGHS¥GEFAAACLAGIFREED ALRLVAERGRLMQQMPPGAMLAVPLAEPELAPYLSDDISLAAINGPALSVVAGPIEAIDALAA ELLDHGLSCRRLHTRHAFHSKMMAPVVDAFTRCVSAVERRPPSGHFLSTLTGGWISPEAATIP AYWARQLVEP¥RFAQA¥RQLLSEST L LE GPGQTLSPLVRQQARADGGQVVVASLPRAKDA GADHLAVIEALGR SAGGTVDWKRFHEGEARRRVLLPTYPFERQRYWASPRHTSAPPEAIIK PLLAKNPΪWADWFFLPAWRRSDPPVSFDAQA¥TTRRSTWLVFIGDEGLGAA VEGLARRGHEV VAWTGERFEQTGTQRYTIDPAANGD¥ASLFARLEIEGRMPDRIVHAFCTSPADGARIERGAA LEIERRLGFDSLLLLAQVIAAQRHPKPLMLGVITTRAHSVIGTEIIEPLRALVLGPCRVIPQE
IPHVSCRNIDIDLPGEGGRAEIAARLIADLERESPDSWAYRGGRR VESIELTDVGRRSAGA APRLRQRGAY ITGGLGGIGLVAAΞLLAREAHARLILVGRTGLPARQG DD LAAHGAGDATS RKILRIRALEEAGAEVKIAAADVSDFNAMRSVIEEARTRFGRIDGVIHSAGIASGGMIQLRTP MAA RVMAPK¥GGTLVLDALLRDERPDFLLICSSLASL¥GGATQIDYCAANAFLDAYAQSREG EEGCRVISVQ DT SDVGMA¥DFKLPADLQEGRRESLKRGISSSEGAE¥LGRILSAGMSGPLA ICTSDLPAYKQSVTTRRSQHEQTPAARPMHSRPTTTGAYVAPETETERRIAAI QDLLGLEQV GANDDFLQLGGHS LATQVLSRVLQTLKVGISLPQFFDAPTVAGLSRLVDAARAEGAGPVAPA IGRVERDAYRIKPPAAEQAARTKP
SEQ ID No 27 (>ORF9)
MEP¥GGVDMNQPAKQQETCVFPTSFAQRR FLDQLEPGSAVYNMPASFRTRGPYD¥DSLVRS VNEIVRRHESLRTTVDVIDGEPVQ¥IAPSLRIEVP¥¥DLSEIDEPEREAEARRLMAEESRRPF D TRGPLLRAKLLR GEADHVLILTMHHI¥SDG SMDVLFKELSTLYAAFHEGRPSPLPELPI QYADFAV QRE LQGEVLESHLGY REHLRGAPTLLELPMDRPRPPAQTFRGSQRAFR PLSL QQA¥QALSRQEGATPFMTLLTAFSVLLSRYARQSDLWGTPIANRTRAELEGLIGFF¥NMLAL RIDLGGDPSFRELLGRVREVTLGAYAHQDLPFERLVEELSPGRSPSHSPLFQ¥SFTLQNTPMD ATNRADIASGGAPLVEMKAAKFDLILELSESPQGLLGTFEYNTDLFDAGTIERMAGH EVLLS SAVAAPDRPIAE PLMGAEERSRVL¥E NSTAALYPEDHCMHELFEQQVERSPEATAVLLQQQ TLTYRELNMRANQLAHHLRSLGVGPEVR¥GLYLERSIETVVAILG¥LKAGGAYVPLDPTYPSE RLGLMMADAAPS¥LLTQASLLSKLPPHGDATLVQLDALHEALSRLPHHTPRSGVTAQNLAYVM YTSGSTGRPKGVLVEHRGLCNLPTVQAKLYGIAPGDRLLQFAPLCFDTSFCEIALALLSGATL VMGTADELLPGPPLVELLKKHAVTAMLLAPT¥LAALPEQQSAALPLRVLTMAGEACPAEL¥KR WKAPGRRLFNSYGPTETTI ASSAADLSDERIPPIGRPIANTQIYVLDEALEPVPIG¥PGEIF IGGVGVARGYHGRPDLTAERF¥PDPFGQTKGARLYRTGDRAR LPDGNLEFLGRNDEQVKVRG VRIELEEIRAALLKHPA¥AQAVAVVREDTPGDKRLVAY¥VGRGGARVTAAELRQSVSERLPAT MVPSSFVALDALPLTPNGKVDRRALPEPEQSAGGEDHVAPRNAVEEE ARI ASVLRLER¥GV HDNFFEIGGDSILSIQI¥VRAQQAGLRLTPRQMFQHQTIAE STVARAVEAVH¥EQDPVTGPA PLTPVQRW LEQEAAEPHHFNQSIFLEVRERLDESALEQAIAHLIDHHDALRLRLARDERGAH QVFAAPGGSTPFQRVDLGALPSAEQISAMEKAASEAQASLDLAAGPVVRAVLFDLGEVAPQRL LVIAHHIAVDS¥S RILLDDLFGAYEQARRGEAVRLPPKTTSVKR AELLTEHAGSEA¥KAEL
GY LDSSRRT¥APLPVDRRAGED¥WGSARHIV¥SLTPEQTEQ LREVPQAYRTRIDDALLTAF AQAIAR TGSPAVLLDLEGHGREELAGVDLTRTVG FTAMYPILLRVDAADPGEALKSIKEQL RAVPGRGLGYGLLRYLRSDTIAEVRALPQAELCFNYLGQLDQAIPEAAPFRPAREYQGSERSP GAHRAHLIEVNASIANGRLYAT TYSERRHEPETIERVAASFVTALRALIAHCTLPEVGGNTP SDFDKVRLRQETIDALDAIDAGPGPSARGSRIEDVYPLSPLQEGILFHTLYATDYTAY¥EQFH WTLEGDFDAEAFTRALQDWARHAALRTSFA ERLDAPLQIVRTGA¥LPVEHQDLRGLAAEEQ TAHISRYVEAERQRRFDLRKAPLMRAGLLRLRKDA CLVETIHHLILDG STQILLKEVFTLY EAHRGHRGHLALELEQPRPYGDYIGWLAKQDQVRTAAFWRRELEGFSAPTPLGVDRAVPHDDG GPRFG RRIA SGDDAARLAAFARQHQLTMSTL¥QGA ALLLSRYSGDPDVLFGMTVSGRSAP IPGIERMTGLFINTIPVRVREPADAS¥LA LKALQEHEAELLEHEHSPL¥EVQAHSDVPRGTP LFESLVVFENYPVQVI FEAPPVEGPTRAEEGLRMIDAQYI SDPPYPLTVNAAFHGTLYLNIGY ERRRFDDQAVERMIGHVTTLLRGFVQRPETS¥RDLPLLTAEEERTQLHA NATAAPYPEGHCM HELFEQQVERSPEATAVLLQQQTLTYRELNIRANQLAHHLRSLGVGPEVR¥GLCLERSIETVV AILGVLKAGGVYVP DPTYPSERLGLMMEDAAPSVLLTQTSLLSKLPPHGDATLVQLDA HEA LSRLPHHTPRSG¥TAQNLAYVMYTSGSTGRPKGVLVEHRGLCNLPTVQAKLYAIAPSDRL QF APLCFDTSFCEIALALLSGATLVMGTADELLPGPPLVELLKKHAVTA L APS¥LAALPEQQS AALPLRVLAMAGEACPAELVKR KAPGRRLFNSYGPTETTI ASSAADLSDERIPPIGRPIAN TQIYVLDEALEPVPIGVPGEIFIGG¥GVARGYHGRPDLTAERFVPDPFGQTKGARLYRTGDRA RWLPDGNLEF GRNDEQVKVRGIRIELEEIRAALLKHPAVAQAVAWREDAPGDKRLVAYVVG RGGARLTAAELRQSVSERLPATMVPSSFVALDALPLTPNGKVDRRALPEPERSAGGEDHVAPR NAIEEELTRI AD¥LGAKRVG¥HDNFFDLGGHSLLLVRVHDRLGQRFDRPPSMVDLFTYPTVA SLARFLGERANGKQSPREAAADVTERGRRRLEARARRAKAIRGPT
SEQ ID No 28 (>ORF10) MKHNIG LPAALATLAFVPACSPNHGEDAPSVTSAESGAAPSADCVA GAKLQAALDGAAAA QKAPGAAAA¥QSGDC RGATGVSDLVASTPTKPGDLFRIGSITKTFVSTLILMLRAEGRLSL DDAVSKYVKGIPAGDQMTLRQILGHTSGLFDYTYSPALGQMIEVDPTRAFAPAELIALATAEA PYFAPGAGFRYSNTNYI¥AGLVAEA¥SGGTLAGLLRTRILDPVGLAHTYLDGAEPPVQGLIRG YGDYGAGLVDITDQLSPTEA AAGALVSNVDDLNRFFALLISHELLSSDE QDMTT TPTM P HEPGYGLGLI ERDSA GSLNGHCGI I GFQSASYGVPGRGDAITALINRSDGDAARL¥DELAK
VVKER .
SEQ ID No 2 9 ( >ORFl l ) MS IDRAVLEQLDRVGGRLAEGKALKLLEDIAWPREVEERFFAAGEDRLPEVEYRVDRDGLARR VAELRELLGAIDGDAPALG RDNVRAQIQAAELLEAAGTRAFSARSQELYGGARSRFFGGSL RNIDLAEHLTER R¥HG DEASDPEEEPLDAGALRDMLAARVAGRAPR DLEIT¥DPRVTAKV VAGMSRVRIRPEATFAA EAEGLWHHEVETHALTAHNGAAQPRCAFLRSGGPRTTRTQEGLAI FAELYSRSLSIGRLTRLAER¥RLVDMAEQGASFLDLYRHLRERGAERRDAYFDAQRVCRGGL¥ EGGAPFTKDACYLAGL E¥YAFLAAVLRGGLRDEVELL¥CGRIALDDIA¥LAELRAAGVLERP RYLPGWLRA QTLLPYFAFTSFMDGIDLGPVERHFQELLRVAADARPAGEGRRRRGRPREG
SEQ ID No 30 (>ORF12)
MSESVAQLEEHRAA TGHCYRMLGSVVDADDAVQETMVRA RSLDKFDGRSSLRT LYRIATN VCIDLRADRARRARPIEEGPVGTVDDALETRPRTHWLEPVPDAHALPADIDAAERAMLRQSIR LAFVAALQHLPPKQRAALLLTEVLG SAAEVADSLNTSVAAINSALQRARATLASRDLGDARP SLPEPQSALLDRYVNAFERYDVDALTALLHQDATLSMPPFTL LRGHESIRA LVGPGAGCRG SRLIPTAASGSPAFAQYRPAPEGGHRAWALIVLDVAGDRIVSMTSFLDTETLFPRFGLPLDLP A
SEQ ID No 31 (>ORF13)
VTIASIDHRDQDLMTGPQAKAPARAAAPDAAPSRRAV AGRVLSGLATLFLTFDAAVKVLKLF PAEASTAELGFPAHLVPTLGYLQIACLVAYLIPRTAVLGAIL TGYLGGAIAIHVRVENPLFS HTLFPIYVAAFL AGL LRDRRVRALTASPSSQGR
SEQ ID No 32 (>ORF14)
MTTKNPRKLFVNLSVRDLKRSMΞFFSKLGFEFNPQFTDEKAACMWSEEAYVMLLVESFFKTF MKKEICSTSTHTEGLFALSCSSRAΞVDDMVKKAVAAGGSHAMDPQDHGFMYG SFYDVDGHHW EVM MDPKAIQP
SEQ ID No 33 (>ORF15)
MTPSERLDATFAALADPTRRAILARLASGEASVTELAKPFAMSQPAISKHLKVLERAGLISRG RDAQRRPCRIEAKPLEDASG LDNYRRF EGSYERLDDLLEELKERESKGERSKR SEQ ID No 34 ( >ORF16 )
VAPASAPAAGGRDAAPFLDEAAQWLRGEQAPASRPAGEGPAGRLPGRVLVADDNADMREYALR LLVAEG TVEAVADGRAALERARAHPPDLVLTDVMMPRLDGFGLLRALRADDRTRGVAWMLS ARAGEEARVDSLEAGADDFLVKPFSAKELLARVRIHVELARRRREAEGQRQYLNDLFMQAPGP IAILRGPEHVFEWNPLYQRLVGGRSLVGEPIRAALPELEGQGIWELLDAWRTGEPIVGKEL PVRLDRRGDGTTEEVFFNFVYQPMRDRDGAVEGVFVFAFDVTDQVRARRRVEALVEALKLADQ RKDEFLAMLAHELRNPMASISLSLTLLDDADGDGPASARYREIARRQMGHLVRLVDDLLDVSR ITRGTVELRLEDVDLAAWQSAAAAVRPAVEARRHDVSLSVGPGDFGMRADATRLEQWTNLL TNAAKYTPPGGSISVRLTREAAVGAPEAVLRVRDTGRGIPAAMLEKVFDLFTQVDQTIDRSTG GLGLGLTLVRRLLELHGGSVAAASAGPGQGSEFTVRLPLGPGAAPQPAPSAGPPPPREGPPPA QRDEPPPPPAQRAEAPEAAADRRRVLWEDAEDVRRVMRAYIEALGHEVTVAVDGLEGVKKLL ELRPEVAFVDIGLPGIDGYEVARRARAAPGGEALYLVALSGYGGPDDQARSRRAGFDLHLTKP WGATLQDVLTAPRT
9. DNA sequence according to claim 7 selected from the following
(a) open reading frames, and peptide sequences corresponding to said open reading frames:
pEPOcosβ_ORFl sequences:
(1) nucleotide sequence
Seq ID No 35 (>pEPOcos6_ORFl . seq) GGATCACCTGCGGCGCGATCGCCGACCTCGTGCTGGTGTTCGGCTCGCTGGATGAGAAGCCGG
CGGCGCTACTGATAGAGACGGCGACGCCCGGGCTGCGGGTGGAGCGGTTGCGGGAGATGCTCG GCTTTCGGGCGGCCCACCTGGCGAAGCTGTCCTTCGACGGTTGCGAGGTCCCCGAGGCTCAGC TGATTGGCCGGCCCGGCTTTGCGCTGATGTATCTGGCCCCCTACGCCCTGGATTTCGGTCGGG TCAGCGTCGCCTGGGCCTGCCTGGGCATGATCCGCGCTTGCCTGGAGACCTGCGCACAGCACA TCCTCACCCGCCGCACCTTCGGCCACCTGCTAGCCGATCACGGCATGATCCAAACCCTGATCA CCAACCTGGGGATTCACCACCAGGCGACGCTGCTCCACACGCTGCAGGCCTGCCGCGCCAGGG ATCGCGGCGACGTGACCGCCTCCGAGGCCACCCTCGCCGCCAAATACCTCGCGTCGCGGACGG CGGTCCAGGAGACGACCAACGCGGTCCAGATCATGGGCGCGCTGGGCTGCGACGAGGAGGGCG CGATCGCCCGCCACTTCCGCGACGCCAAGACGACCGAAATCATCGAAGGCAGCAACCAGATCA TCGAGGCGCTGCTGGCCAAGAACATCGCCCGCGCCGGTCGCGACAACTATCGCCGCTTCCTCG ATGCGGAAGTCGAGCCCGGTCGGGCCGGAGGCGCACCA
(2) peptide sequence
Seq ID No 36 (>pEPOcos6_ORFl .pep) ITCGAIADLVLVFGSLDEKPAALLIETATPGLRVERLREMLGFRAAHLAKLSFDGCEVPEAQL IGRPGFALMYLAPYALDFGRVSVA ACLGMIRACLETCAQHILTRRTFGHLLADHGMIQTLIT NLGIHHQATLLHTLQACRARDRGDVTASEATLAAKYLASRTAVQETTNAVQIMGALGCDEEGA IARHFRDAKTTEIIEGSNQIIEALLAKNIARAGRDNYRRFLDAEVEPGRAGGAP*
pEP0cos6_0RF2 sequences:
(1) nucleotide sequence
Seq ID No 37 ( >pEPOcos6_ORF2. seq)
ATGACGAGCGCGGTCCCGACGCGTCAAACCAGCCTGCTCGACGACTTCGAGCGCGTCGCCGAC GTCGATCCAGAGCGGATCGCCGTCCACGCGAGCGAGACGAGCCTGCGCTATGGCGACATGAAT GCGCGCGCCAACCGCATTGCCCACGGGCTACGGGCGCGCGGGATCGGGCCCAATCAAATCGTG GCGGTGGCGATGGCCCGCACGCCCGAGCTGATGATCGTGCTGTACGGCATCCTCAAGGCCGGC GCGGCCTACATGCCCATCGCCCGCGACGCGCCGCCGCTGCGCCGCGATCATATGCTGCGCGAG AGCCAGGCTGCTCTGATGATCGCCGACGAAGAGATCGCGGGACTCGCGGCCCGGGTGCTGACG CCGGCCGACCCGTTCTTCGCGGCCATGCCGGACCACAACCCCGAGCCGCGTCACGACCCGACC
GACCTGATTTACGTCATCTACACCTCGGGCTCGACCGGCCAGCCCAAGGGCGTGGCCATGGAG CACCGCGCCGTGTGGAATCGCCTGACTTGGATGCAGGCCCAGTATCCAATCGACACGCAGGAC GTGATCCTCCAAAAGACGCCGATCGTCTTCGACGTGTCGGTCTGGGAGCTGTTCTGGTGGCCG CTGGCCGGCGCCTCGGTGGCCCTGCTGCCGCAATCCATGGAGAAGTTCCCCTGGGCGATATCG GCGACGGTGGCGCGGTGCGGGGTGACGGTGATGCATTTCGTACCATCGATGCTGATGGCCTTC CTTCAGGTGGTGGCGGGCCGGCCCGAGATGGCGGACCAGATGAAGGGCCTGCGCTACGTCTTC TGCAGCGGCGAGGCCCTGGCGCCGGCCCACGTGTCAGCCTTTCAGGAGCACATCAACCGAGCG GGCAGCATCAGCTTGACCAACCTC7ATGGACCCACCGAGGCGGCGGTCGACGTCAGCTACTTC GACTGCCCGCCCGGCGCGTCACTCGGGCGGGTGCCGATCGGACGAGCGATCACCGGCATCCAG CTGCTGGTCATGCGCGACGGCGTGCCTCAGCCGCCCGGCGTCGAGGGTGAGCTCGCCATCGGC GGCGTTGGTTTGGCGCGCGGCTACA7CTCACGGCCAGACCTGACCGCCGACCGGTTCGTGCCG CATCCAGGCGGCGACGGCCAGCGGC7CTACCGCACCGGCGATCTGGTGCGCAGGGACGCGGAC GGCGAGCTGGTCTTCCTGGGGCGCA7CGACCATCAGGTGAAAATTCGCGGTCTGCGCATCGAG CCCGGGGAAATCGAGGCCCAGATCAGCGCCCATCCCGATGTGGCCGACTGCGCGCTGATTATC GAGCAGGACTCGGAAACCCTGCCCAAGCTGACCGCCTACATTGTCGTGGCGCGACCGGGCTTG ACCCGGAAGGCGCTGCTACAGTTCC7GGGCGCGCGGCTGCCCGACTACATGCTCCCGAACCGC TTCCTGACCCTCACGGAGCTGCCCG7GACCGCCAACGGTAAGCGCGACTGGCGCGCGCTGCTC GGCCCGCTCGAGACCCTGCCTCTCCCTTTCTCC
(2) peptide sequence
Seq ID No 38 (>pEP0cos6_0RF2. ep)
MTSAVPTRQTSLLDDFERVADVDPΞRIAVHASETSLRYGDMNARANRIAHGLRARGIGPNQIV AVAMARTPELMIVLYGILKAGAAYKPIARDAPPLRRDHMLRESQAALMIADEEIAGLAARVLT PADPFFAAMPDHNPEPRHDPTDLIYVIYTSGSTGQPKGVAMEHRAVWNRLTWMQAQYPIDTQD VILQKTPIVFDVSV ELFW PLAG 3VALLPQSMEKFP AISATVARCGVTVMHFVPSMLMAF LQWAGRPEMADQMKGLRYVFCSGΞALAPAHVSAFQEHINRAGSISLTNLYGPTEAAVDVSYF DCPPGASLARVPIGRAITGIQLLVKRDGVPQPPGVEGELAIGGVGLARGYISRPDLTADRFVP HPGGDGQRLYRTGDLVRRDADGELVFLGRIDHQVKIRGLRIEPGEIEAQISAHPDVADCAL11 EQDSETLPKLTAYIWARPGLTRKALLQFLGARLPDYMLPNRFLTLTELPVTANGKRD RALL GPLETLPLPFS pEPOcos6_ORF3 sequences:
(1) nucleotide sequence
Seq ID No 39 (>pEP0cos6_0RF3. seq) ATGTTACACCCGATTCCCACCGACCGTTTCGCCCTGAGCCGACCGCTCTTTCGCGGGTACCTC GCGCACGATCCGATCGTGCAGGGCGTGCTGGCGGGCGACCATCCAGGCTGGGTCCTGGTGGAC CGCGAGCCCGAGCCGCGCACGGCGCTGCTGTGGGCCTTTTCCGATCGGCTCTTCTGCGTGGGC GCAGCTGACACGCTGACCCCGCACGCGCTGGCCGAGCTGTTCCACGACCGACTGATCCCCCAG GCCCGTAAGATCGGGCAGCCGTT77TCCAGGTTCAGGGCGAGACGGTCGACACCTGGTCGGAC CACCTGCATCAGGTGTCGCCGCACGCGACAGTCTCCTTCCGCCAGGCATTCCGCTTCGACCGC GACCTCTTCGAGCGGCTGCCAACCAAGCCGGAGCTGGCAGAGGCGCGGCTCGTGCCAATCGAC GCGCGGCTGCTGGCCGAACAGGC7GATCTGCGCGAGCGGATACTGGCCTCCTGGTCCAGCGAA GCTGCCTTCCATGCGCGCGGTTTCGGCTTCTGCTACCGCGTAGGTGACCAGCTGCCGAGCGTG TGCCTGGCATCGCACGTAGGCGGCGGCGCGGCCGAGCTGAGCATCAACACCGAGCTCGAAGCG CGCAATCGAGGTATGGCAACGCGGCTGTGCCGGCGTTTCATCGCCGAATCGCTGCAGCGCGGC CTGACGCCTTGCTGGGGCACCGAGACCTTTCGCCTGCCGTCAATCGCGCTGGCCCAGAAGCTC GGTTTCATCCCGACCTTCACCTTCCCCACCTACTGCTTCGCGACCGGCACCGAACAGCCGGAC GACAACTTCCTAGGCGAGCTGTAC7ACAGGGAATCGCGCATCGCCGGAAGTGGGACCGATGAG CCGCAAGCGGTTCGGCTGGCGCGGGGTTGGAGCCTGGCCGGCGACACCGAGCGTGCCGCGAGC TTCGCCGCACGCGCCCTGGCCGAAGGGTGGGCCGGCCACTCGACTCTGGCCACCGATCCGGAT TTCGCCCGATTGCGCGCCAGCGCCGCCTGGCCCCGCCTCAATGTCCCT
(2) peptide sequence
Seq ID No 40 ( >pEPOcos6_ORF3. pep) MLHPIPTDRFALSRPLFRGYLAHDPIVQGVLAGDHPG VLVDREPEPRTALLWAFSDRLFCVG
AADTLTPHALAELFHDRLIPQARKIGQPFFQVQGETVDT SDHLHQVSPHATVSFRQAFRFDR DLFERLPTKPELAEARLVPIDARLLAEQADLRERILASWSSEAAFHARGFGFCYRVGDQLPSV CLASHVGGGAAELSINTELEARNR3MATRLCRRFIAESLQRGLTPCWGTETFRLPSIALAQKL GFIPTFTFPTYCFATGTEQPDDNFLGELYYRESRIAGSGTDEPQAVRLARG SLAGDTERAAS FAARALAEG AGHSTLATDPDFARLRASAA PRLNVP pEPOcos6_ORF4 sequences:
(1) nucleotide sequence Seq ID No 41 ( >pEPOcos6_ORF4. seq)
ATGATTTGTCACTCCCACCGCTTCATTTTCCTCCACGTTCCCAAGGTCGCCGGCACAAGCGTC AAGGACGTCCTCGGCCAAGAGCTATTCCAGGAGGACCAGGTCACGTTCCAGATCGCTCCCAAT CCCCACTACCCACCTGAATGGACTGCGCCTTACGAGGAGCACATTATTGCCGCTGAATTGAAG AGCCAGTTGGCGCCGGAAATTTGGGACGATTACTTCAAGTTCGCCTTCGTGCGCCATCCGCTC GACTGGGCGGTCTCCAATTACTTCTTCTTCCTGCGCGACCGCAAAGGCCATCCGGCCCACGAA TTCCTGGAGCGGAAGGGCTTCGCCGGTACCATGGACATGTTTTTCGGAGCGGCCGGGCGCCAT CCGCTGGTCGCCGGCATGCGCTTCAGCCAATGGGAGTTCTTGTGCGACAGCGAGGGCCGGACG CTGGTGGACTTCGTTGGCAAGTACGAGCGGCTCGAGCAGGACTTCGCCGCCGTGTGTATCCGC ATCGGGCTGACCCCGCCCGACTTGCCGTGCCTCAACCAGACTCGCCACCAATCCTTTACCAGT TACTACGACGAGGCTTTGATGCGCCAAGTCAGCCGCGCGTTAGCTCGCGATTTCGAAATTTTT GATTATGCC
(2) peptide sequence
Seq ID No 42 ( >pEPOcos6_ORF4. pep) MICHSHRFIFLHVPKVAGTSVKDVLGQELFQEDQVTFQIAPNPHYPPE TAPYEEHI IAAELK
SQLAPEI DDYFKFAFVRHPLD AVSNYFFFLRDRKGHPAHEFLERKGFAGTMDMFFGAAGRH PLVAGMRFSQ EFLCDSEGRTLVDFVGKYERLEQDFAAVCIRIGLTPPDLPCLNQTRHQSFTS YYDEALMRQVSRALARDFEIFDYA
pEPOcos6_ORF5 sequences:
(1) nucleotide sequence
Seq ID No 43 ( >pEPOcos6_ORF5. seq) ATGAAAGTGGACAAGCGGAATGTCGACGACATTCTCGGACTCACTCCGACACAGACAGGCATC 7TGTACCACTACCTGCTGGACCCGCAGGCCGACGCCTATTTCGAACAATTGACGCTGCACCTG GAGGGGCCGCTCGACGTAGCGCGCTTCCGCCGCGCCTGGGAGCGCGTGGTGGCGGCTCACGAC CAGCTGCGCGCCGTGTTTCGCTGGCAAGGGATCGAACACCCGGTGCAGATCATCCTCAAGCAG CACGTGCCGGACCTGGAGTTGGCGGAGGTCCCGCGCGACGCCGATCCGGCAGCCTTCCTGGCG CAATGGGTCGCGGCCGACCGGGCGCGCAAGTTCGACTTCGAGACGGTGCCCTTTCGCATCGGC CTCTGCCGGACTGATACCCAACATCACGTGATGCTGCTCAGCAATCACCATATCCTGATGGAC GGTTGGAGTACGGGCCTGATTCTGCGGGACTTCCTCGCCTGCTACGGCGACTCCGAAAACTGG CGGCCACGCACCCGAACGCACTTCAAGGCGTTCATCAAGTGGCACCAGAACCGGCCACGCCGG GGCGAGGAGCGATTTTGGCGCGACCTGTTGCGCGATGCGCCCGACGGCGGCTTTCCCCGCCTG GGCGTCGAAGAAGGCACCCGCCACTCGCTTGACTTCGGCGCCCGCAGCCGCGCTCTCGACGAC CGCTTGACCCAAGGCTTGCGCGACATGGCTCGCGACCTCGACGTCACCCTCGCCGCGATGCTC CATACCGCTTGGGGCCTTCTAC7CCAGCGCTACCAGAACAGCTGCGAAGTGATATTCGGGACC ACCGTTTCCGGCCGCAACGTCGAGCTCGCCGGCCTCGACGAGGTGGTCGGCTTGTTCATCAAC ACGATTCCGTTCCGCTTCTCGGCCGCGGCCGCGACGACGCCCGTCGAGGCCTTCCGTGCGGTA CAGCGCAATCTGCTGGCGAGAAGCGAGTTCGAAGCCACCCCGCTGGTGGACATCAAGGGCTGG AGTGGTCTCGGTCCGGGCGCGGAACTGTTCGACACCATCCTGGTCATCGAGAACTATCCCTTG GACCGCGCTATCTTCGAGAGTGATTCCAGCCTGCGGTTGACCGACCACCAAATCTTCGAGCGC ACCAATTACGGGCTGACCCTGACCATCGAGACCTTCAGCCGGTTGCACGTGACGCTAGCCCAT CGCCGTGACCTGCTGGGCGACGCGGCCGCTGAGCGAATGCTAGATCATTTCACCGGCCTGCTC CAAGCCATGCTGCGCTTCCCTCACCAGCCGTTCGCGCGCCTCGAGATGAAAAGCGAACACGAG GCCCACCGCGTCCTGCACCAACTCAACCAAACGCGTCAGCCGCTGCCGTCCCAATCGGCTTTC CACCAGTTGTTCTTCGAGCAGGCCCAGGCCGATGGGGCACGACCGGCGCTGTGGTGCGGCGCC ACGCGCTGGACCTACGGCCAGCTGCTGGAACGTGCCCTGCGTCTGGCGGGACGGCTGCAGGAA GCCGGCTTCGCCCGAGGCGATGTCGCCGCCGTCAGCCTCGGCCCGGTTCCGGATCTGATTCCC GG7TTGCTGGGCCCGCTGTTCGCCGGCGGCGCCTACCTGCCGCTCGATCCCACCCTGCCGGCC CAGCGCTCGCGGTTCATCCTCGACGATGCCGGTTGCCGCTTCCTGATCAGCGACGCGCCACTC
GCGGGGCCCACGCCGATCCATCCGGACCCTGCCGGCGCCAGCCCCGTTGACGTCATTTTTGCC TGTCAGGACGGCGCCGCGCAGCCCGCCTACCTGATCTACACCTCGGGCTCCACCGGCCAGCCC AAAGGCGTCTGGGTTAGCCACCGCAACCTGATCAACTTCCTGACGGGCATGAGCGCAATCCTG CCGGTCGCGGCCGACGACGTGT7CCTCTCGCTGACTACCGTGTCGTTCGACATTTTCGGGCTC GA3ACGTGGTTCCCGCTCAGCCGCGGCTGCACGATCGTCTTGGGCACGCGCGCCGAGCAGTTG GACCCGGCCGCGGCTGCCAAGGCCATCTCCTGCCATGGCGTCACGGTTTACCAGGCGACGCCA TCGCGACTCCAACTTCAACTGGAGCACCCCACATTTGTCCGCGCCATCGGCTCCCTGACGACC CTGCTGGTAGGCGGCGAACCCCTCCCAGCCGAGCTGCTGCGGCGCGTACGCGAAGTGACCGAT GCGCGTATCTTCAACCTCTACGGTCCCACCGAAACCACCATCTGGTCCACAGCCGGGGAGGTC ACCGCGGCGGACGTCCCGGATATCGGCCGCCCGATCGCAAATACCGGCGTTTTCCTTCTGGCG CGAGACGGCTCGATCCAGCCGCCGGGCCTGGTGGGCGAGTTGTGCATCGCCGGCGAGGGCGTG GCGTTGGGCTACCACCGACGGCCGGACCTGAACCGAGAACGGTTTCGCGAGATTCCGCCGGGC CGCCTGCCCTTTGCCGGCAAGCTCTACCACACCGGCGACCTGGCCCGCTGGACCGAAGACGGA CGGCTCCTCTGCCTGGGCCGTCTGGACGACCAGCTCAAAGTGCGCGGCCATCGCGTCGAGCCG GGCGAGATCGAGGCAGTGATGGCGCGCCACCCGGCGGTCACGCAGGCGGTGGTCGTCACGCGG CCGCGCAACGGCGAGCCGGTCTTGGTCGGGTTCTGGACTGCGGAAGGTGAGCCGATGCCAGAG GAAGCGCTGAGCGCTTACCTGGCCGACCGACTGCCGAGCTACATGGTACCCGAACGGTGCATC CTCATGAAGGCCATGCCGCTAACCGGCAACGGCAAGATCGACCGGCGCGCCCTACCCAATCCC TTCGCCTTGACCGAGTCGACCCGGCAGGCGGCGCCGCGCACCTTGGCCCGCACCGCCGGCGAG CATCGGGTTGCCGAGCTGTGGCAGGCCTTGTTGCGACGCGAGGCGATCGGCTTGGACGAACCC TTTTTTCAGGCCGGCGGGAACTCATTCGGCTTGATTCGGCTTCACGCCAAGCTGGAATCCGCC TTCGGGAAGTCGTTCCCGATCACCGATTTGTTCCAGCATACCAGTATTCGCAGCCAGGCAGAA ATGCTGAGCGGCTCGTCCGTCGAGGCGCCGCTCGCGGGAGCCGTGCCGCAACCCCCGGCCGCC GCCGCCCAAGTTGCCTCCTCGGCAGCTAAATCCCCAGGGGAGCGCGGCGCGGCAGCGACGTCG AGCGGCCTGACCGCGCAACCGCCCCAACCCCACTTCCGGCCCATCGCCGTTATCGGCCTCGCC GGCCGATTCCCCGCCGCACCCGACCTCGACGCCTTCCTTGAACTGCTCACGGAGGGTCGCTGC GGCATTCGCTTCTTCAGCCAAGCCGAGCTGCGCGACGAGGGTCTCGACGCGAATCGAATCGCG TGTCATAACTATGTCCCGGCCAAAGGTTTCCTCGACCGGGCCGACCACTTTGATGCCGACTTC TTCGGCATCCCGCCGCGCGACGCAGAAATCACCGATCCGCAAATTCGGCTTCTGCTTGAGTGC TGCTGGAACGCGCTGGAGCATGCCGGCTACCCGCCCGGCGGCGGCGAGATCGGGCTCTTCGCC
GGCTCCTCGGCCAACTATCACTGGCTCGAATACGTGGGCATTTCCGAGGAGAGCAGCAATCGA TTCGCCGTCATGATTCAAAACGAAAAGGACTACCTGGCCACGCGGATCGCCTACCAGCTCGA7 TTGAAGGGCATTGCCGTCACCGTGCAAACGGCCTGCTCGTCGTCGCTGACCGCGGTCGAGCTG GCCTGCGATGCGTTACACGCCGGCCGCGTGACCATGGCTTTGGCTGGTGGCGTTGGTCTGACC TATCCGTTGCGCGCCGGATACCTGCACGAGGATGGAATGATCTTCTCCCCCGACGGTCGGTGC CGGGCCTTCGACGCCCAGGCGGCCGGCACGGTCTGCGGCAACGGTCTGGGCATGGTGGTGCTG AAACAGCTCGACGCGGCGCTGGCCGACGGCGATGCCATCCACGC7GTGATTAAGGGCATCGCG GCCAACAACGACGGCGCGGCCAAGATCGGCTACACGGCGCCCTCGCAGAACGGTCAGGCGCGG GTGATCCGCGCCGCCCATAGGCTCGCCCAAGTCGCGCCGGAGACCATCGGCTATGTAGAAGCC CACGGTTCGGGCACGCCGCTGGGCGATCCGATCGAGGTGGCGGGCCTGACCGAGGCCTTTGAC AGCCCGCGTCGCGGCTTCTGCGCCTTGGGTTCGGTCAAGTCGAATGTGGGTCATTTGGATGCG GCAGCGGGCATCGCGGGTTTCATCAAGGCGGTGCTCTCGCTGTCCCATCGGACCCTGTTCGCC AGCCTCCACGTCGACACGCCCAACCCGCAGATCCCGTTCGCCGACGGTCCGTTCCAGGTCAAC ACGGAGACCCGGCCCTGGCCAGCTGCCGACCATCCCCGCCGCGCCGGCGTCAGCTCCTTCGGC ATCGGCGGCACCAACGTGCACGCCGTCCTGGAAGAGGCGCCGCAGTTGGCCGAGCACGCGGGG CGGCGGCGCGAGCGGCAGCTGTTCCTGGTC7CGGCGCGGACTGCAGCCGATCTGGAGCGACGC ACCGCGGCGCTGGTCCGCCACCTGGCCGCGCATCCGGACCTCGCACCAGATGACGTTGCCTTT ACCTTGCACGCGGGCCGCAAACCGATGACCCACCGTCGTTTCCTGGTCGCCGCCGACCTCGCG GAAGCCGCCGCGCGTCTGGCCGAGCCCGATCCAGTCAAATCCGCCGCGGCGCGCGCCGACCGC TGCCAGGTCTGGATGTTCGCCGGTCTCGGCTCTCAATACCCCGGCATGTGTGGCGGCCTCTAT CGCACCGAGCCGGCCTTTCGCGAGCAAGTCGACCGCTGTTTCGACCTCCTCGCGCCGCGTTGC GATTTGAAGCCCTCGCTCTTCCCCGAGCCCGATCAGGCCATCGACGCATCAGCCCTCGCGGCC ATCGACACCGCCCAGATCGCCGTCTTCGTCTGCGAATACGCGCTCGCACGGATGCTGGAAGGC TGGGGGCTGCGTCCGGATCGGCTGATCGGTTACAGTTTCGGCGAATACGTGGCCGCCTGCCTG GCCGGCGTCTTCTCCCTGCCCGACGCCTTGGCAATCGTCCGCGAGCGTGGCCGGATCCTGGCG GCGGCCGAGCCGGGCGCGATGGTCAGCGTGCCCCTTCCGGCCGAGCGCGTCGCGTCGCTGCTG GAGCCGCCGCTTGCCTTGGCCATTGACAACGGCCCCTCATGCGTGGTGTCCGGGCCGGTCGAA CCGGTGCGCACCTTCACCGCTCGCATGAAGCGGGACCGGGTCTGGGTGACGCCGCTCCAGGCC GAGCGCCCGATGCATTCGCCGCTGATGGCCGAGGCCGGCGGCTCACTGCGCGCCATGTTGGCC GGGTTCCGCCTGAATGCGCCGCGAATCCCGATCTTAAGCAATGTTACAGGAACCTACCTAACC
GACGAGCAGGCCCGAGACCCCGATTACTGGGCCCGTCACCTGTGCGGCAACGTTCGCTTCGCC GACGGTGTGCGAACCTTGTTGGCCGAGCGCGATCCGGTGTTCC7TGAATTCGGGCCGGGCCGC GATCTGAGCTCCTTGGTGCGCCACCAGATGCCGGAAGGCGCCGACGAGCCGATCGCACTGATC CGTCATCGCGAAGATCCGGTGCGCGACGAAGACCTCCTGCTCGATGGCTTGGGCCGCTGCTTC CTGCGTGGGGCGACCCTCCACGGGCAGGCCTTGTACGCCGGCCGAGGCTGCCGCCGCGTGCCG CTGCCCGGTTACCCGTTCCAGGGTCCACGCTGCATGCCGGCCCGCGCCGGACTGCCCGGCCTG GCGCGACCGACCGTGGGAGCGACCACCATCAGCTACCGACCAGCCTGGAAGCGGGCGCCGCGC TTGGCGGCTGTCGAATCGCTCGCGCCGCAATCCTGGTTGGTATTCAGCGACGGCAGCGAATTG GCGGGCGAGCTGGTGGCCGGCCTGCGCGCTTCCGGTTGCGCGACCACCCTCGTCGAAGGTGGG CTGGCGTTCGCGCGCTTCGCGGGCGGCTTCCGCGCGAATCCCCGCGAGGAACAAGATCTCGCA CAGCTGTTCGCGACCCTGTCGGCCGAAGCGATGCTGCCCACCCACATCCTGCACCTGCTCAGC CTGCCGTCGCCGGAGCGCGACTCGCCGCTGGCGCGCCTGGAGCACCTCACCGAGCTGGGCTTC CACCATCTGCTGGCCCTGGCCCGCCAACTGGAGGCGGTCGGCGCCCCCGAGGTCCGCCTCGCC GTGGTGACAACCGGCCTGGCGGCGATTGGCGGCGAGTCCGAGCTGCGGCCCGAGGTCGGGCTG TTGCGGGGACCTGTCCGCGTGAT7CCCTTTGAATTCCCGAACTTGCGGCTGCGCCTGATCGAC CTCGACTCGGCCGATCCCATCTGGCGTAGCGGTTGTGAGCCGTTGCTGCGCGAAATGGGCGCT GCCCCGGGACCTGAAGAAATCGCGCTGCGCGGCACCAGCCGTTGGGAGTTGGGCTACGAGCCG GTCGAGGGGGGCACCGTGAGCACCATCTCCTCGCGACTGCGCGAGGGCGGCGTCTATCTGATC ACCGGTGGCCTCGGCGGCCTGGG7CTGGCCTTGGCCCGTCACCTCGCCCGGAAGTACCGCGCC ACCCTGATCCTCGCTGGCCGGCGAGGCGCGCCGGCGCGCGAGCTCTGGCACCAGGCGCCAGCG GAGTTCGTACCGGTCGCAGCTGCGATCGCACAGATGGAGGAGTGTGGCGCCCGCGTGATTCCC GTCGCGCTCGACGTCACCGACGCCGACCAAGTGAACGCGTTGTTCGCCACCATAGAAGCTACG GTCGGCAAGATTGAAGGCGTTTTCCACATGGCTGGCATCGTTGACGGCGGCATCATTCGAACG CGCACGCGCGCTGCCAGCGACGCCGTGCTGGCGCCCAAAACGGTCGGAACCTGGATTCTCGAT CGGGCTCTCCGCGGCGCCGGTGGCCGCTTCCTGGTGCTGTACTCCTCGATCAACGCGGTCGTC GCGCCCTTCGGCCAGGTTGCCTACGCCGCCGCCAACGCCTTCCTCGACGCCTTCGCCAGCGCC CACGAACACGACGAGCGTCTTTTCCGCGTCAGCATCGGTTGGGACACCTGGCGCGAGGCCGGC ATGGCCGTCGATGCCGCCCGCGCCCGCGGCGACCAGGCCCCGCTCGAAGGGCTTAGCGACGAG CAGGGCTTGCGCCTGCTCGAAAGCGCCTTGGTCGGTTGCGAACCGCGACTCCTCGTCTCCATC AGCGAACTGCGCGCTCGACTAGCCGAGCATCATCGCAACGGCGGCATTCCCCGGTTGCTCGGG
CCCCGCGCCAACGAGGCGGGTGCAGCTGATTCCGGCGAGGAGGGCGCCACGCAAGACGCGTCG CCGGCCCGTCGCGCCCGTCCCGA7CTGGTCGTGGCCTTCGCGCCGGCCGGCAACGAGCTGGAG CGCCGGATCGTGGCCATCATCGGCGCCTACCTGCGGCTCGGTCAGGTGGGCGTCGACGACAAC TTCAACGATTTGGGCGCCACCTCGCTCGACCTCATCCAGATCGCCCAACGCCTCGGTCGCGAG TTGGGCCGCGATGTCCCTGTCGTC7CGCTCTACCAACACCGCACCGTACGCGGGCTGAGCCGC TTCCTCGGCGGCGCGCTCCAATCCGCGCGGTCCGGCGTCCCGACGGGCGCTGCCGCACCGGGC GCCGCCACGCCGGGGGTTGCCACCCCGCCGCGGCCACAACCGTCGCGCCAGCACCTGGAAAAA CGCCGTCAATTGAGGAAAAAAGGGGGGCCTTCCCATCATGAG
(2) peptide sequence
Seq ID No 44 ( >pEP0cos6_0RF5. pep)
MKVDKRNVDDILGLTPTQTGILYHYLLDPQADAYFEQLTLHLEGPLDVARFRRAWERWAAHD QLRAVFRWQGIEHPVQIILKQHVPDLELAEVPRDADPAAFLAQ VAADRARKFDFETVPFRIG LCRTDTQHHVMLLSNHHILMDG STGLILRDFLACYGDSEN RPRTRTHFKAFIK HQNRPRR GEERF RDLLRDAPDGGFPRLGVEEGTRHSLDFGARSRALDDRLTQGLRDMARDLDVTLAAML HTA GLLLQRYQNSCEVIFGTTVSGRNVELAGLDEWGLFINTIPFRFSAAAATTPVEAFRAV QRNLLARSEFEATPLVDIKG SGLGPGAELFDTILVIENYPLDRAIFESDSSLRLTDHQIFER TNYGLTLTIETFSRLHVTLAHRRDLLGDAAAERMLDHFTGLLQAMLRFPHQPFARLEMKSEHE AHRVLHQLNQTRQPLPSQSAFHQLFFEQAQADGARPAL CGATRWTYGQLLERALRLAGRLQE AGFARGDVAAVSLGPVPDLIPGLLGPLFAGGAYLPLDPTLPAQRSRFILDDAGCRFLISDAPL AGPTPIHPDPAGASPVDVIFACQDGAAQPAYLIYTSGSTGQPKGVWVSHRNLINFLTGMSAIL PVAADDVFLSLTTVSFDIFGLET FPLSRGCTIVLGTRAEQLDPAAAAKAISCHGVTVYQATP SRLQLQLEHPTFVRAIGSLTTLLVGGEPLPAELLRRVREVTDARIFNLYGPTETTIWSTAGEV TAADVPDIGRPIANTGVFLLARDGSIQPPGLVGELCIAGEGVALGYHRRPDLNRERFREIPPG RLPFAGKLYHTGDLAR TEDGRLLCLGRLDDQLKVRGHRVEPGEIEAVMARHPAVTQAVWTR PRNGEPVLVGF TAEGEPMPEEALSAYLADRLPSYMVPERCILMKAMPLTGNGKIDRRALPNP FALTESTRQAAPRTLARTAGEHRVAEL QALLRREAIGLDEPFFQAGGNSFGLIRLHAKLESA FGKSFPITDLFQHTSIRSQAEMLSGSSVEAPLAGAVPQPPAAAAQVASSAAKSPGERGAAATS SGLTAQPPQPHFRPIAVIGLAGRFPAAPDLDAFLELLTEGRCGIRFFSQAELRDEGLDANRIA CHNYVPAKGFLDRADHFDADFFGIPPRDAEITDPQIRLLLECC NALEHAGYPPGGGEIGLFA
GSSANYH LEYVGISEESSNRFAVMIQNEKDYLATRIAYQLDLKGIAVTVQTACSSSLTAVEL ACDALHAGRVTMALAGGVGLTYPLRAGYLHEDGMIFSPDGRCRAFDAQAAGTVCGNGLGMWL KQLDAALADGDAIHAVIKGIAANNDGAAKIGYTAPSQNGQARVIRAAHRLAQVAPETIGYVEA HGSGTPLGDPIEVAGLTEAFDSPRRGFCALGSVKSNVGHLDAAAGIAGFIKAVLSLSHRTLFA SLHVDTPNPQIPFADGPFQVNTE7RP PAADHPRRAGVSSFGIGGTNVHAVLEEAPQLAEHAG RRRERQLFLVSARTAADLERRTAALVRHLAAHPDLAPDDVAFTLHAGRKPMTHRRFLVAADLA EAAARLAEPDPVKSAAARADRCQV MFAGLGSQYPGMCGGLYRTEPAFREQVDRCFDLLAPRC DLKPSLFPEPDQAIDASALAAIDTAQIAVFVCEYALARMLEG GLRPDRLIGYSFGEYVAACL AGVFSLPDALAIVRERGRILAAAEPGAMVSVPLPAERVASLLEPPLALAIDNGPSCWSGPVE PVRTFTARMKRDRVWVTPLQAERPMHSPLMAEAGGSLRAMLAGFRLNAPRIPILSNVTGTYLT DEQARDPDY ARHLCGNVRFADGVRTLLAERDPVFLEFGPGRDLSSLVRHQMPEGADEPIALI RHREDPVRDEDLLLDGLGRCFLRGATLHGQALYAGRGCRRVPLPGYPFQGPRCMPARAGLPGL ARPTVGATTISYRPAWKRAPRLAAVESLAPQS LVFSDGSELAGELVAGLRASGCATTLVEGG LAFARFAGGFRANPREEQDLAQLFATLSAEAMLPTHILHLLSLPSPERDSPLARLEHLTELGF HHLLALARQLEAVGAPEVRLAWTTGLAAIGGESELRPEVGLLRGPVRVIPFEFPNLRLRLID LDSADPIWRSGCEPLLREMGAAPGPEEIALRGTSR ELGYEPVEGGTVSTISSRLREGGVYLI TGGLGGLGLALARHLARKYRATLILAGRRGAPAREL HQAPAEFVPVAAAIAQMEECGARVIP VALDVTDADQVNALFATIEATVGKIEGVFHMAGIVDGGIIRTRTRAASDAVLAPKTVGTWILD RALRGAGGRFLVLYSSINAWAPFGQVAYAAANAFLDAFASAHEHDERLFRVSIG DT REAG MAVDAARARGDQAPLEGLSDEQGLRLLESALVGCEPRLLVSISELRARLAEHHRNGGIPRLLG PRANEAGAADSGEEGATQDASPARRARPDLWAFAPAGNELERRIVAIIGAYLRLGQVGVDDN FNDLGATSLDLIQIAQRLGRELGRDVPWSLYQHRTVRGLSRFLGGALQSARSGVPTGAAAPG AATPGVATPPRPQPSRQHLEKRRQLRKKGGPSHHE
pEPOcos6_ORF6 sequences:
(1) nucleotide sequence
Seq ID No 45 ( >pEPOcos6_ORF6. seq) ATGAGTGAAGTATCCATTCGCCCCGGCTTGGACATCGCGGTCATCGGCATGGCCTGCCGCTTT CCCGGTGCCCGCAACCTCGCCGAGTATTGGGCCAACCTGATCGAAGGCCTCGAAACGCTCAGC
TTCTTCAGCGAAGAGGAGCTGCGTGAGGCCGGCTGCGATCCGGTCCAACTGGCCCAGCACAAC TACGTGCGCACCAAGGGCCTGCTCCCTGACGCAGACCGTTTCGACGCCGATTTTTTTGGTTAT TCCCCGCGCGAAGCCCAGGTGATGGACCCCCAGATCCGCGTCTTCCACGAGGTCTGTTGGCAG GCGCTGGAGCACGCGGGCTACAACCCGCATCGCCACACCGGCACGATCGGCCTGTTCGCCGGC GCCGCGCCCAACGTTTTTTGGGAGTTTCTCTCCTATCGGTCCGA7GCCGCCAATTTAGGCAAC TTCACGCTGGGCCTGCACAACAACAAGGACTACCTGAGCTCGCGCATCGCCTACAACTTCAAC CTGACAGGGCCCAGCTACACCCTGTTCACCGCCTGCTCGACCTCGATGGTCGCCATCCACCAG GCCGTCCAGGCGCTGCTCAACGGCGAATGCGACCTGTGCATGGCCGGCTCGGTCTCCATTACG CTGCCACTGGTTGCCGGCTACACCTACACGCCGGGCATGATCGTCTCGCCCGACGGCCATTGC CGCACCTTCGACGCAGGCGCCAATGGCACTGTCTACGGCGACGGGGCCGGCGTGGTCGTTCTC AAGCGGGCCGAGGATGCGTTGGCCGACGGCGACCACATATTTGCGCTCATCAAGGGCTCGGCG CTCAACAACGATGGCAGTCGCAAGACCGGCTACACCGCGCCCAGCGTGCAGGGGCAGGTGGAG GTGATCCGCGCGGCGATGAACCTGGCGGAGGTCGAGCCGGAGGCGATCAGCTACGTGGAAACC CACGGGACGGGCACCACGGTGGGCGATCCGCTGGAGTTCGAGGCGCTAAAGGAGGCCTTCGGA GGTGGCTGCAAGGCCTTCTGTGGATTGGGTTCGGTCAAGCCGAACATCGGCCATCTGGACGTG ACGTCGGGGATCGCGAGCTTCATCAAGCTGGTCCTGGCGCTGGAGCACCGCATCCTACCGCCC ACGCTCCACTTCCAACTGCCCAACCCGAAGATGGATGTGGTCGATAGCCCCTTCTACATCGTG GCTGAGCGCGAACCCTGGCGCGAAGATCTGCTGCCGCGTCGGGCCGGTGTCAGCGCGTTCGGT CTGGGTGGCACCAACGTCCACATGATTTTGGAGGAGTTTCAGCGCGAACCGGCGGCGAACAGC GCGCGCACGCGCCACCTGACGGTGCTGACGGCGCGGTCGCCGCAAGCCCTGGCGCAGCTGGCG GCCAACCTCGCCGAACACCTGCGCGAACACCCCGAGTTGGCGCTGGCCGATGTGGCCCATACG CTGCTGCACGGCCGCAAGCCACATCCATTCGCGCGCATCCTGGTGGCGACCGATACGACGGCG GCGATCGACGCCTTGATGAACGACCGCGATCCGCGAACGCGTTTCTTCGAAGCGACCGGGCGC GGCGAGTCGGTGATCCTGTGTTTTGACGAAACGCCGCCGGAGCCGCGAAGCGCCCGCTACCTC TGGGATCACGAGCCGCTTTATCGCGCGGCGGCGACGTCGTGCTTGGCTGGTGAGGTCGCCGAC CCGGATCTGGAAGGCTGCTTTACTGCCCTGATCGCCGAGCAGGGCGCGGCAGCCGCCTTTTGC CACCAATACGCGCTGGCCGGATGGCTGCTGGCCATGGGGTTGACCCCGTCGGCGTTGATCGGC GTGGGCCAGGGCGAGTGGGTAGCAGCGGCGCTCGCGGAGGTGTTCCCGCCATCGGCCTGCTTG CGCTGGATTAGGTTCGGCGAACGGCTCCCGCAGCCGCGCGATCAACGGATTCCGTTTCTCTCC AATTTCTCTGGAAACTGGATCGTTGGGCGTGAGTTGGCCGACCCGGATTACCCCAGAAAGCAG AAGGGTAAGCGCTGCATGAAGCGCCGTCGGTCCCAACCTCGGTCAGCTGGTGCAGGATGGGGG CGATGGAACCGGCTCGGTCAGCTCGTCGCGCGCTGCTCTTCCGCGGGAAGCGGAGGCGGGACG GTGATCGGCCCGAGGGCGAGGTTCATCTCGTCGTCGACGAGCCGGGCGCGGGTGCGCGCCCAG TACCTGGGGGCGAGCTCGAGG (2) peptide sequence
Seq ID No 46 ( >pEPOcos6_ORF6. pep)
MACRFPGARNLAEYWANLIEGLETLSFFSEEELREAGCDPVQLAQHNYVRTKGLLPDADRFDA DFFGYSPREAQVMDPQIRVFHEVCWQALEHAGYNPHRHTGTIGLFAGAAPNVFWEFLSYRSDA ANLGNFTLGLHNNKDYLSSRIAYNFNLTGPSYTLFTACSTSMVAIHQAVQALLNGECDLCMAG SVSITLPLVAGYTYTPGMIVSPDGHCRTFDAGANGTVYGDGAGVWLKRAEDALADGDHIFAL IKGSALNNDGSRKTGYTAPSVQGQVEVIRAAMNLAEVEPEAISYVETHGTGTTVGDPLEFEAL KEAFGGGCKAFCGLGSVKPNIGHLDVTSGIASFIKLVLALEHRILPPTLHFQLPNPKMDWDS PFYIVAEREPWREDLLPRRAGVSAFGLGGTNVHMILEEFQREPAANSARTRHLTVLTARSPQA LAQLAANLAEHLREHPELALADVAKTLLHGRKPHPFARILVATDTTAAIDALMNDRDPRTRFF EATGRGESVILCFDETPPEPRSARYL DHEPLYRAAATSCLAGEVADPDLEGCFTALIAEQGA AAAFCHQYALAG LLAMGLTPSALIGVGQGE VAAALAEVFPPSACLR IRFGERLPQPRDQR IPFLSNFSGN IVGRELADPDYPRKQKGKRCMKRRRSQPRSAGAGWGR NRLGQLVARCSSAG SGGGTVIGPRARFISSSTSRARVRAQYLGASSR
pEPOcosβ_ORF7 sequences:
(1) nucleotide sequence
Seq ID No 47 ( >pEPOcos6_ORF7. seq) ATGGAACCGGCTCGGTCAGCTCG7CGCGCGCTGCTCTTCCGCGGGAAGCGGAGGCGGGACGGT GATCGGCCCGAGGGCGAGGTTCA7CTCGTCGTCGACGAGCCGGGCGCGGGTGCGCGCCCAGTA CCTGGGGGCGAGCTCGAGGTAGCGGTCCCGCGGCCAGTAGGGCATCGCGCGAATGACGTCGGC CAGGTAGGCCTCCGGGTCGAGCCCGTGCAGCTTGCAGCTCGCCACGAGCGAGAAGAGGTTGGC CGCGGCGGAGGCGTGGTCGTCGCTGCCGAAGAAGAGCCAGGACTTTCTCGCAACCGCAATGGA TCGCAGCGCTCGCTCGCTGGCGTTGTTCTCCAGGCGCAGCCGACCGTCGTCGAGGAAGCGCCG
CAACGGCTGCTCTTGGTTGAGGGCGTAGCCGAGCGCGGTGGAGACCAGGCCGCGCTCGCGGGG ACGAGCGTGCTCGGCCCTGGCCCAGGCAAAGAACGCGTCGACCAGAGGGCGGACGACGACATC GCGACGCACCTTGCGCTGCGCGGGCGGCAGGTCCGCCAGCGCGCGATCGGCGGCAAAGAGGGC GTTGATGCGCCGCAGCCCCTCGACACCGAGCTCGTGCTTGCAGACCGCCGCCTCCCAGAAGTT GGTACGGCAATGCGACCAGCATCCGACTTCGGTCGGGGGCGGACCGCGCTTCTCGTCGGCAGC AGCGCCTCTTGGTGGTGTGCCGCGGAAGAGGGCGTCATAGATGGCGTGAGCGTCAGCTTGAAT ATACCGAGAGAAGCCGCGGAACATCTCGCAGACCGCGGCGCTGGTATGCTTGGGCTGGTACTC GAAGAAGACGTGATCCTTGTCCGCGAGGACGACGAAGAAGTGTCCCTTGCGGCACGGCCCGGG CTTCTTGTCCTTGCGCTCCTGGATGGGCCCAGGCTGGACGGAGACCCCGGTGGCGTCCGTGGA CAGGCAGAAGGCGGTCTCGAAGGCCTCTTTGCGCGCGGCCTCGACGATGGCGCCCAGGGTCGC ACCGACGTCTTCGGCGTAGCGGCACATCGTGCCGCGATCGAGCGACGCGCCCTGAAGCTCCAG CTGCTGCTCCAGTCGATAGAACGGGACGCCGAGCAGGTACTTGCTGGTGAGGATGTGCGCAAT CATCGACGGCGCGAGGAACGACCGCCGGAACAACTCCTTCGGAAGCGGCGTCGTGATGAAGAC CG7GCAGGTCTCGCCCTTCGGCGCCGGCGGCGGCGCGTCGAGCGAGGGCGCGTCGAGCGCTGT GGAGGAAGCGCTCGGCTCGCCGGCCGCTGCCGTGTCCTCCGGGCTGACGCTCGCAGCGGGCGT CGGCGTCGGGGCTTCTCTCGCGACGACCTGGAGCGGGGCCGCTTCCTCCTCGCCCGAACTGCT CGCATCCGTGACGGACCGCTCGGCCTTGTACACGACGCGTGCGAGCACGATGCGGCGCATTCC GCCGCGCTCGTAGCCGAGTCGCGAGGTCTCCTCGACCCCGATGCGCGTCGCCGTCGCATCGAG CTCGGGGCAGGAGAGCTCGATGCGGACGACGGGCAGGTCGGACTCGGACAGGTCGCGACGGCC CTTGCCGCCGGACCTTCGTTTCGGCCCCTTGGGGTCGTCGTGCTGCCGCTCGTCGCCTGTATT GCGCTCGGCGGCGTCGAGTGCCTTCGCGAGGCGCTGGACCTCGAGGAACATCGAGTCGAACGC CAGCTGCTCCGCGCTCACCTCGGCGCGCTCCGCCTTGGCCACGAACAGTCGACGTCGCAGAAG CTGCAGCTGCTCGAGCGCACGGG7GTAGGCGCGCCGAAGCTGCGCGAGCGCATCGCGCGCTCC CACGAGCTCGCTCTTTGCCGCGGCGAGCTCCGCTTCGAGCTGCGCGATGCGCTGCTGCTCGGC CGAGAGCG7CGGCTTGGCGGCGGCGTCGTGCACGACGCCGCTCTACGTAAGCCGCGCGTACTT GTCGAGCGAATTCGTGCGGCTCAGTGGACGCGGCGCGGTGCGCGCCTTCGCGGTTTGGACGTG GGCGCGATCTCGATGCCGTCGAGCAGCGTCTCGAGCGTGGCGTCGTCCACCTCGACGTGCGTG GCGCCCTCGGTCGGGGGGTCGGGAAGTGCGAACGCTCCGCGATCAAGGCGTTTTGAAAACAGG CAGATTCCACTGCCATCGAAGAAGAGAATCTTGATCGTGGTCCGCCGCTTGCCGACGAACGCG AACAGCGCTCCGCAGCGAGCCTCGTACCCCACACGCTCACGGATGAGACCCGAAAGCCGCTCG AAGCCG (2) peptide sequence
Seq ID No 48 ( >pEPOcos6_ORF7. ep)
MEPARSARRALLFRGKRRRDGDRPEGEVHLWDEPGAGARPVPGGELEVAVPRPVGHRANDVG QVGLRVEPVQLAARHEREEVGRGGGVWAAEEEPGLSRNRNGSQRSLAGWLQAQPTWEEAP QRLLLVEGVAERGGDQAALAGTSVLGPGPGKERVDQRADDDIATHLALRGRQVRQRAIGGKEG VDAPQPLDTELVLADRRLPEVGTAMRPASDFGRGRTALLVGSSAS CAAEEGVIDGVSVSLN IPREAAEHLADRGAGMLGLVLEEDVILVREDDEEVSLAARPGLLVLALLDGPRLDGDPGGVRG QAEGGLEGLFARGLDDGAQGRTDVFGVAAHRAAIERRALKLQLLLQSIERDAEQVLAGEDVRN HRRREERPPEQLLRKRRRDEDRAGLALRRRRRRVERGRVERCGGSARLAGRCRVLRADARSGR RRRGFSRDDLERGRFLLARTARIRDGPLGLVHDACEHDAAHSAALVAESRGLLDPDARRRRIE LGAGELDADDGQVGLGQVATALAAGPSFRPLGVWLPLVACIALGGVECLREALDLEEHRVER QLLRAHLGALRLGHEQSTSQKLQLLERTGVGAPKLRERIARSHELALCRGELRFELRDALLLG RERRLGGGWHDAALRKPRVLVERIRAAQ TRRGARLRGLDVGAISMPSSSVSSVASSTSTCV APSVGGSGSANAPRSRRFENRQIPLPSKKRILIWRRLPTNANSAPQRASYPTRSRMRPESRS KP
pEPOcos6_ORF7.1 sequences:
(1) nucleotide sequence Seq ID No 49 ( >pEPOcos6_ORF7.1. seq)
ATGTTCCTCGAGGTCCAGCGCCTCGCGAAGGCACTCGACGCCGCCGAGCGCAATACAGGCGAC GAGCGGCAGCACGACGACCCCAAGGGGCCGAAACGAAGGTCCGGCGGCAAGGGCCGTCGCGAC CTGTCCGAGTCCGACCTGCCCGTCGTCCGCATCGAGCTCTCCTGCCCCGAGCTCGATGCGACG GCGACGCGCATCGGGGTCGAGGAGACCTCGCGACTCGGCTACGAGCGCGGCGGAATGCGCCGC ATCGTGCTCGCACGCGTCGTGTACAAGGCCGAGCGGTCCGTCACGGATGCGAGCAGTTCGGGC GAGGAGGAAGCGGCCCCGCTCCAGGTCGTCGCGAGAGAAGCCCCGACGCCGACGCCCGCTGCG AGCGTCAGCCCGGAGGACACGGCAGCGGCCGGCGAGCCGAGCGCTTCCTCCACAGCGCTCGAC GCGCCCTCGCTCGACGCGCCGCCGCCGGCGCCGAAGGGCGAGACCTGCACGGTCTTCATCACG ACGCCGCTTCCGAAGGAGTTGTTCCGGCGGTCGTTCCTCGCGCCGTCGATGATTGCGCACATC CTCACCAGCAAGTACCTGCTCGGCGTCCCGTTCTATCGACTGGAGCAGCAGCTGGAGCTTCAG GGCGCGTCGCTCGATCGCGGCACGATGTGCCGCTACGCCGAAGACGTCGGTGCGACCCTGGGC GCCATCGTCGAGGCCGCGCGCAAAGAGGCCTTCGAGACCGCCTTCTGCCTGTCCACGGACGCC ACCGGGGTCTCCGTCCAGCCTGGGCCCATCCAGGAGCGCAAGGACAAGAAGCCCGGGCCGTGC CGCAAGGGACACTTCTTCGTCGTCC7CGCGGACAAGGATCACGTCTTCTTCGAGTACCAGCCC AAGCATACCAGCGCCGCGGTCTGCGAGATGTTCCGCGGCTTCTCTCGGTATATTCAAGCTGAC GCTCACGCCATCTATGACGCCCTC7TCCGCGGCACACCACCAAGAGGCGCTGCTGCCGACGAG AAGCGCGGTCCGCCCCCGACCGAAGTCGGATGCTGGTCGCATTGCCGTACCAACTTCTGGGAG GCGGCGGTCTGCAAGCACGAGCTCGGTGTCGAGGGGCTGCGGCGCATCAACGCCCTCTTTGCC GCCGATCGCGCGCTGGCGGACCTGCCGCCCGCGCAGCGCAAGGTGCGTCGCGATGTCGTCGTC CGCCCTCTGGTCGACGCGTTCTTTGCCTGGGCCAGGGCCGAGCACGCTCGTCCCCGCGAGCGC GGCCTGGTCTCCACCGCGCTCGGCTACGCCCTCAACCAAGAGCAGCCGTTGCGGCGCTTCCTC GACGACGGTCGGCTGCGCCTGGAGAACAACGCCAGCGAGCGAGCGCTGCGATCCATTGCGGTT GCGAGAAAGTCCTGGCTCTTCTTCGGCAGCGACGACCACGCCTCCGCCGCGGCCAACCTCTTC TCGCTCGTGGCGAGCTGCAAGCTGCACGGGCTCGACCCGGAGGCCTACCTGGCCGACGTCATT CGCGCGATGCCCTACTGGCCGCGGGACCGCTACCTCGAGCTCGCCCCCAGGTACTGGGCGCGC ACCCGCGCCCGGCTCGTCGACGACGAGATGAACCTCGCCCTCGGGCCGATCACCGTCCCGCCT CCGCTTCCCGCGGAAGAGCAGCGCGCGACGAGC
(2) peptide sequence Seq ID No 50 ( >pEP0cos6_0RF7.1. pep)
MFLEVQRLAKALDAAERNTGDERQHDDPKGPKRRSGGKGRRDLSESDLPWRIELSCPELDAT ATRIGVEETSRLGYERGGMRRIVLARWYKAERSVTDASSSGEEEAAPLQWAREAPTPTPAA SVSPEDTAAAGEPSASSTALDAPSLDAPPPAPKGETCTVFITTPLPKELFRRSFLAPSMIAHI LTSKYLLGVPFYRLEQQLELQGASLDRGTMCRYAEDVGATLGAIVEAARKEAFETAFCLSTDA TGVSVQPGPIQERKDKKPGPCRKGKFFWLADKDHVFFEYQPKHTSAAVCEMFRGFSRYIQAD AHAIYDALFRGTPPRGAAADEKRGPPPTEVGCWSHCRTNF EAAVCKHELGVEGLRRINALFA ADRALADLPPAQRKVRRDVWRPLVDAFFA ARAEHARPRERGLVSTALGYALNQEQPLRRFL DDGRLRLENNASERALRSIAVARKS LFFGSDDHASAAANLFSLVASCKLHGLDPEAYLADVI RAMPYWPRDRYLELAPRYWARTRARLVDDEMNLALGPITVPPPLPAEEQRATS pEPOcos6_ORF7.2 sequences:
(1) nucleotide sequence
Seq ID No 51 (>pEPOcos6_ORF7.2. seq) ATGATTCCGGCGGGCGTGCAGGTGTTCGTCGCGCTGGAGCCGGTGGACATGCGCTACGGCTTC GAGCGGCTTTCGGGTCTCATCCGTGAGCGTGTGGGGTACGAGGCTCGCTGCGGAGCGCTGTTC GCGTTCGTCGGCAAGCGGCGGACCACGATCAAGATTCTCTTCTTCGATGGCAGTGGAATCTGC CTGTTTTCAAAACGCCTTGATCGCGGAGCGTTCGCACTTCCCGACCCCCCGACCGAGGGCGCC ACGCACGTCGAGGTGGACGACGCCACGCTCGAGACGCTGCTCGACGGCATCGAGATCGCGCCC ACGTCCAAACCGCGAAGGCGCGCACCGCGCCGCGTCCAC
(2) peptide sequence
Seq ID No 52 ( >pEPOcos6_ORF7.2.pep)
MIPAGVQVFVALEPVDMRYGFERLSGLIRERVGYEARCGALFAFVGKRRTTIKILFFDGSGIC LFSKRLDRGAFALPDPPTEGATHVEVDDATLETLLDGIEIAPTSKPRRRAPRRVH
pEPOcos6_ORF7.3 sequences:
(1) nucleotide sequence Seq ID No 53 ( >pEPOcos6_ORF7.3. seq)
ATGACAAGGACGAAGGCGACCGAAGTGATGTGGTCCGAGCGCGTTCGGGCGTGGCGCGAGAGT GGTGAAACGGCGGAGGAGTTCGCTCGGAGCCGCGGATTTGCGGCCTCGACGCTGCACGGCTGG TCGAGCCGGCTGTCGCGGGCCGAGCCACCGCGCTTTCTGCGCCTGGTGCCGAAGGCGCCCGCC GTGACGAGCAGCGCTGCGGAGCTCGTCGTCGAGGTCGGCGGCGCGCGGGTGCGCGTCGCCGCG GGGTTCGACCCCGCGCTGCTGGCGGAGGTGGTCCGTGCCCTCGGCGGAGCGGGGCGA
(2) peptide sequence
Seq ID No 54 ( >pEPOcos 6_ORF7 . 3 . pep ) MTRTKATEVMWSERVRAWRESGE7AEEFARSRGFAASTLHG SSRLSRAEPPRFLRLVPKAPA VTSSAAELWEVGGARVRVAAGFDPALLAEWRALGGAGR pEPOcos6_ORF8 sequences:
(1) nucleotide sequence Seq ID No 55 ( >pEPOcos6_ORF8. seq)
ACTGGACAGCGCAGCCGGGGTGAGACGGCGCTTCGCGCAGCGCTTACGCAGAAGGCGCGCCGC GCGCCATTGTCGGATGCGGTGCGCGACTTCGCCGCCGATCGGCTGTTGCTGGAACTGGGACAA CCACTGGACGTAACGGCTGAAGCGAGCCAACGGCTCCAGCTCGCGCGGGGCGACCTGTTCGGC GCCTACCAAGCGTTGGCCCAGCTCTGGATCTGCGGCGCCCTGGCCGAACCGCCGCGACTGTAT CCCGACGAACACCGCCGGCGCGTGCCGCTGCCGAGCTACCCCTTCGAGGGAAAGCGGTTCTGG ATCGAGGGCTCGCCGTTCGAAACCGCGCCCGCCGCCGGCGCCTCACCCCAACCCGCCGATTCG GGGGACATTCTCAAGGGCGACCCGGCGGACTGGTACTATCGGCCGCGTTTCGAAGCGGCGCCG CTCTTGCCCAGCCCGTTCGAGAGCGAACCCGGCGATTGGCTGGTGTTCGAAGATGAGCTGGGG CTCGGCGCCTGGCTGAGCGAGACCTTGCGCGACAAGGGCGCGCGGGTCGCGACAGTCGTTCGA GGCACCGAGTTCCGACGCCTGGCGTCACAGCGCTTCCAGCTTCGTCCCGATCGACGGGACGAT TACCGGACCCTGCTGCACGAGTTGAAGGCGCAGGGCATCGCGCCGGTCCACCTGTGCCACCTA TGGAGCGTGACCGCCGCACCGGATGCCGAGCAGTTGCTCGACGTCAGCTTTCACAGCCTGGTC CA7TTGGCGGCCGCTTTGGGTTCGGTTGGCTACTTCCACGCCATG
(2) peptide sequence
Seq ID No 56 ( >pEPOcos6_ORF8. pep)
TGQRSRGETALRAALTQKARRAPLSDAVRDFAADRLLLELGQPLDVTAEASQRLQLARGDLFG AYQALAQL ICGALAEPPRLYPDEHRRRVPLPSYPFEGKRF IEGSPFETAPAAGASPQPADS GDILKGDPADWYYRPRFEAAPLLPSPFESEPGD LVFEDELGLGA LSETLRDKGARVATWR GTEFRRLASQRFQLRPDRRDDYRTLLHELKAQGIAPVHLCHL SVTAAPDAEQLLDVSFHSLV HLAAALGSVGYFHAM pEPOcos6_ORF9 sequences:
(1) nucleotide sequence
Seq ID No 57 ( >pEP0cos6_0RF9. seq) ATGAAGTTGAACGTGGTCGCCAACCGGCTATTCGACCCCGAGTCGCCCGAGCGCACCGAGCCC GCCAAGAGTCTGTTGCTCGCGGTGACCAAAGTCCTGCCGCAAGAGGTGCCCAACGTTCGAACC CGCGCCATCAGCGTGGACCTGGATCGCTCGTTCGACGCGGCGGCGCCCGCCTGGGCCGCCAGT TTGTTGGTTGAATGCGGCGCGCCCGTCGAGGAAACGGTGGTGACCTACCATGGCGCAGCCCGA TGGCTGCGCCGCTTCGATCGCGTTGCGGTGAATGGTCTCGGCCCGTTCCACCCCGATCAACCT GCGCCGCTGCTGCGCGAGCGCGGCGTGTACCTGATCACCGGCGGCCTGGGCGGCGTGGCTGGC CAGTTGGCGCGCTACCTGGCGCGGGCCTGCCGGGCGCGGTTGGTGCTCACCGCGCGCCGGCCC CTGCCCGAGCGCGACCAGTGGGATCGGGAGTCGGCCGTGCTGTCATGGGACGACAAGACGCGC CAGCGCATCGAGCTGGTGCGCGAGCTGGAGCGGCTGGGGGCCGAAGTATTGGTGGTGGCTGCC GATGTCGCCGACGAAGCGGCCATGGCGCAGGCGATCGAGGCCTCACTGGCGCGATTCGACGCT TTGGACGGCTTGATCCACGGCGCCGGGATCGTGCGGGTCGCGTCGGGCCGCACGCCGATCGGG AGTATGACGCGGGCCATGTGCGAGGAGCAGCTCCGCCCCAAGATGTTGGGCCTCGACGTCGTC GACCGCCTCCTGCGCGATCGCCGGTTGGACTTCCGCATTGCCATCTCGTCGCTCGCCCCGATT CTCGGCGGCCTCGGCCACGTCGCCTACGCCGCCGCCAACCTCTACATGGACGCGTTCGCGACG CGCGCCGCCGCCGGCAACGCGCCTTGGATCGCGCTGAACCTGGCCGAGTGGGAATACGAGGGC CCGGCTACCTACGACGAGCGGGTGGGCCGTTCGCTCAAGCAGCTCGAGCTCACCAACGAGGAG GGTATCCGCGTCTTCCAGACGGTGTTGGCCTTGGCCGCGCGCGGCCCGCTACAGCAGATCATT ATTTCCACCGGCGACCTCCAGGCCCGCCTCGACAAATGGATTCACATCAAATCCCTGCATCGC CGACCGGGGCCGGTCCAGCTCAGTCGCCGGACCGCGGCACCCCAGGGCGGTTTCGGCTCGGAG CGCGCCGCCTTCGAGGCCGCCTTCGCTGACGCCTGGTGCGACTTCTTCGGGGTTGAAGAGGTC GACCCGAACAAAAACTTCTTCGATCTGGGCGCCAGCTCGCTCGACTTCATCCACCTCGTCAGT
CGCTTCAGCAAGGCCATCGAACAGCATGTACCGCTCGAGGCCCTGCTCGAACACTCCACCCTG CACGACCTCGCCGCCCACCTCGCGGGCGACGCGAACACCGACGCCAGCGACGAAGCGCGCATT CGCCAACGGCTGCAAGGCGCCAAG7CCGGCGACATCGCCATCATCGGCATGGCCGGCCGCTTC CCGCTCGCGCCCGACCTGGACACC7ATTGGCGCAACCTGGTCGGAGGCATCGACGCGGTCAGC TTCTTCAGCGCCGAGGAGTTGCG7GCTGCTGGCGTCACCGCGGCCGAGATCCACCACACCAAC TACGTGCCGGCCAAGGGGCGCTGCGCCGACCAGGACTTGTTCGATGCGGCCTTCTTCGAATAC ACTGCCAGCGACGCCGAGCTGATGGACCCGCAAAATCGCGTGTTACACGAGGTCGTGTGGCAC GCGCTGGAAGACGCCTGTTTCGAC7TCAACGGCGATCACGGCCAGGTCGGCCTGTTCGCGGGC GCCTCGCCGAACCTGTGGTGGCAGTTCGTGGCCAGCTTTTCCGAGGCCGCCAAGACGCAGGGC ATGTTCACCACCACCCTGCTCAACGACAAGGACTCGATCGCGACCCAGATTTCATACAAGCTC GGTCTAAAGGGCCCCGCGGTCACCTTGTTCACCGGCTGTTCCACCTCGCTGGTAGCCGTTGAC GCCGCCTGCCGCTCGATCTGGTCCGGTCAATCGGACATGGCCGTGGCCGGCGCGGTCTCGCTG ACTCTCCCCGATAAGGCCGGCTACATCTACGAAAAGGGCATGCTCTTCTCGGCCGACGGCCAT TGCCGGGCTTTCGACGCCAACGCCACCGGCATGGTCTTCGGCGACGGCGCCGGCGCGATCGTG CTCAAGCCGTTGGACGCGGCCC7GCGCGACGGCGACCCGATCCATGCGGTGATCAAGGGCTGC GCCACCAACAACGACGGCGACCGCAAAGCCGGCTACACGAGCGTCAGCGCCCAAGGCCAGGCC GAGGTGATCCGCTCGGCCCAGATCCTGGCCGACGTGGCGCCCGAATCCATCAGCTACGTGGAA GCCCACGGTACCGGCACCAAGTTGGGCGACTCGATCGAGATCAAGGCGTTGAAGCAAGCCTTC GCCAGCGACAAGAACGGATTTTGCGGCATCGGGTCGGTCAAGACCAACCTCGGTCACCTGATG GCGGCGGCGGGGATGGCCGGCCTGATCAAGACGGTTCTGGCGATGAAGCACCGCCAATTGCCG CCATCGCTGCACTGCGACGAAGTGAACCCCGACCTGGAGTTGGAGCGCAGTCCGTTCTACATC AACACCCGCCTGCGCGACTGGGTTGCACCGGGCGGGCCGCTGCGGGCCGGCGTGAGTTCGTTC GGGATCGGCGGAACCAACGCTCACGTCATCCTGGAGGAGCCGCCGACGCGCGAGAGCGGCACG CGCATGCGCCACTGGAAATTATTGATGCTGTCGGCGGCCAGCGAGGCGGCGCTCGACCGCCAG GCCGATAACCTGGCCGACTACCTGGAGCGCCATCCCGAGGCCCACCTCAGCGACGTGGCCTAT TCCCTCCAGACCGGCCGGCGCG77CTGGCCTGGCGGCGCACGGTCCTATGCGAGTACCGCGAG GACGCGGTGACCAGTCTGCGCGAGCGACAGGCCAAGCGCGTCCAGACAAGTCGCGTCCGCTGG GACCACAAGGACGTGGTCTTCATG7TTCCCGGTCAGGGCGCCCAGTACCTCAACATGGGCCGC GACTTATACGTCATGGAGCCGGTC7TCCGCGAGGTCATGGACCGCTGCTTCGAGTTGCTGGCC CCTTTGTGGTCCGAGCATCCGCGCCAGATCCTTTATCCGGAGGGCGGGGTGTCGACCCTGCTC
CACCGGACTGATTACACCCAGCCGATCGTGTTCTGCTTCGAGTACGCCCTCGCCCATTTGCTG CTCTCCTGGGGATTGAAGCCGGCCGCGACCATCGGCTACAGCTTCGGCGAGTACGTTTCTGCC TGCCTCGCCGGCGTCTTCTCCC7GGAAGATGCGATCCGTCTGGTGACCGAGCGCGGTCGGCTG ATGGCGGCTTTGCCCGCGGGCGCCATGCTCAGCGTCCCGGTTCCCGAA7GCGAGCTGCTGCGG CTGCTGGACGGCTTCCACGCCCAA7CGGCGGCCCATCTGGCGCTGGCCG7CGACAATGGCGCC TCCTGCATTGTGGCCGGCGAGCAGGCCGCCATCTCGGCCTTCGAATCGATGCTTCGCAAGAAG CGTCTGTTGACCATGCGGGTCGCGGTCAGCCACGCCGCTCATTCGCAGGTCATGACCGGCGCG ACCGACGCCCTGCGCAGCATCCTGCGGAAGATCCCCCTCTCCGCGCCGACAATTCCCTTCATT TCCTGCGTCACCGGCACCTGGATCACTGCACAGCAGGCTACGGATCGCGAGTATTGGGTGAAC CACATGTGCGGGACGGTGCGGTTCGCGGCGGGTCTGACCGAGCTGGGTCAAAACCGCGAGGCG GTGTTCCTGGAAGTAGGTCCGGGCCGCGACTTGACGTTGCTGGCCCACCGCATCCTGGCCGAC AGCGCGGCCGTGTTCGAGCTGGTCAAGGCGCCCGACGGCGGCGACGACGATGGGTTCCTCCTG CTGGATCGATTGGCCAAGCTCTGGAGGCTGGGGATTTCGATTGACTGGGCCGGCTTCTACGCG GATGAGCGGCGGCGGAAACTCTCGCTGCCGGGATATCCGTTCGAGCGGCGGCGCTTCTGGATC GAGGGCAACCCGCTGGAGATCGCCGCCGGCAGGCCCAATGTCCAGGGGCCGCTGGTCAAGGCG TCGGACATCGGCGCTTGGTTCTACGTGCCGCAATGGCGGCGGTCGGTGCTCGCCGAGCCGGGT ACAACGGCGGCGGGCGCCGCCGTCACGGCGGAGCAGGCACGCGTCGTGACCGAGCTACGGGCG GGATGCGCGTCGGCCGGCTTGGGCAGCGGGGCCTGCGGACTGAATGGCGGTGCCCCGTCCGAG CGTCCGAAAGAAAGTGTAGCGCCAGCCGGGTCGACCAGCGCAGCGGCGCAGACCGGCGCGGAC TGCCCGACACCGACTGGGGAGCCAGCGGCTGTGCCAAAGGACGGGGCCGAGCCGCGGCCGACC TGGCTTATTTTCGCCGACGCCGGCGGATTGGCCGAATCTTTCGCCAAGCGGGTTCAGGCCCGC GGCGAGAAGCTTTACCTGGTGGCTTCCGGCTCGCGCTTCGAGCGCCTGGCCGAGACCCGCTTC CGCCTCGATCCCGGGGCCAAGTCCGATCACCGCCTGCTTTTCAAGGCGCTCGACGAGGCCGAC ATCCTGCCGACCCACCTCCTCGACTTCCGCTCGCTTGACTGCGGCGGGCCCGACGCCGACCCC A7GGACCAGGCCGGCTTCTTCGGGCTGTTGCACCTGGTCCAGGCGATGGCAGAGGCCGGCTAC AGCCATCCCATTCGGCTGCTGATCGTCAGTTGCGGCGTCTACGATGTCACCGGTGCCGAACCG CTGCAGCCGGCGCGGGCCACGATGATCGGACCGGCTCTGTGCATCCCGCAACAGTATCCGCAC CTCGAAACGAGCCATGTGGATTTGGGCGTGGTCCATGCCGACGAGCTCCACGCCGCGCGCCAG C7CGACAGCCTACTTGCCGAATGCCTAAGTGCAACGGCCGAGCGCCAATTGGCGCTGCGCGGC CGACACCGCTGGCTGCTGGACTACGAGCCAGTCCGCTTGCCGCCGCTCGACCCGGGCCGTCTG
CCCTGGCGCCAGCGCGGGGTCTAC7TGATCACCGGCGGTTTGGGCGGGATCGGCCGCATCCTG GCCGAACACCTGGCCCGCACGACCTCGGCTCGCCTGGTCCTAATCGGCCGCGAAACCCTGCCC GACCGCGACGACTGGGACGCCTGGCTGAACCGCCCGCAACCGGTCGACGCCACCCACGAACGG C7GCTGCACAAGATCCGCGCGA77CGCGATCTGGAAGCGCTAGGCGCCGAAGTCCTGGTCCTC GCCGCCGACGTCGCCAACGAAGCCGCCATGCGCGAGGCCTACGATCGCGCCGAATCCCACTTC GGCACAATCCACGGGGTGATTCACGGCGCCGGCCTGATGGACGCGCAAAGCTTCTCACTGATC GACGCCCTCGACCACGACCTCTGCGCCCGCCAGTTCGAAGCAAAAATCCGCGGCGTCTGCGTG CTCGACCGCGTTCTGGCCGACCGCACGCTCGACTTCTGTCTGCTGATGTCTTCCATCTCCACC GTGCTCGGCGGCCTGGGCTATTTCGGTTACGCCGCGGCCAACGCCTTCCTCGACGCCTTCGCC CAGGCGCGCAGCCGCGACGCCGCTTTCCCCTGGCTTAGCGTGGCCTGGAGCGATTGGAAGTAC TGGACCGAGCGCAAGATGGACAACGAGGTCGGCGCCGTCATCGACAGCCTCTCGATGGAACCC GCCGAGGGCTTCGAAGCCGTCACCCGCGTCTTGGCTTGGGGCAAGGCGCCCCACATCGCCAAC TCGCCCGGTGACCTCGGTCGCCGCCGGGATCAATGGGTCAAACTGGCCAGCCTGAAATCGGCG CACTCCAGCGAGCCCGAGCCGGCTAGGCATGGACGTCCGGCGCTCTCCAGCGAATGGGTCGCG CCGCGCAACGTGGTCGAAGAGAAGCTGGTCGCCATTTTCGAGCAGGTGTTCGGCACTGCGGCA CTGGGCATCGAGGACAACTTCTTTGAGTTGCGCGGCGACTCGCTCAAGGCGGTCATGACCGCG GCCCGTATTCAAAAGGAGCTGAACGTGGAAGTGCCGCTGCCGACCTTCTTCCAGATGCCCACG GTCGCTGGCCTGGCCCAGTTCGTGACGCAAGCCAAGCGCAGCGGCCGGGAGACGATTCGGCGC ACCGCGCCGCGCCCACATTACCCGCTCTCGGCTGCCCAGGGCCGCCATTACCTGCACTACCGC ATGGACCCGCGTTGTACCGCATACAACGATCCCTTCGCCAACCTGATCGAGGGTCCGCTGGAC GTGGATCGCGTGGAGCGCATCCTGCACACCCTCATCCTACGCCACGACTGCTTCCGCACCTCG TTCCACTTCCGCGAGGGCGAGCCGGTCCAGGTGATTCACGATCGGGTGGACTTCAACCTGGCG CGGATTACCTGCGCGCCCGAGGATTTGCCCGAACGGATGCGCGATTTCATCCGCTCCTTCGAT CTGGAGCGACCGCCCGCCATGCGCGCCGGCCTCTTCGTCACGGGGCCCGAGCGCCACGTGCTG CTAATCGATTTTCACCACATTATCACCGATGGCGTGTCGTTCGAGAACTTCGTCGGCGAGTTC GCGGCGCTCTACCGCGGCGAGATCCTGCCCGAGCTGGAACTCGAGTACAAGGATTTCGCGGTG TGGCAGCATGAGAACCGGGGCCGCCGCGCCAACAGCGACCAGGCCCGCTACTGGACCGAGCAG TTGGCCAATGCGCCCGGGCCGATCGAGCTAACCACCGATTTCCCCCGTCCCAGTCGACGCAGC TTCCGCGGCGACCGCGTGCGGACCGTGCTTGATGCGGAGCTCGTTGCTCGACTCAAAGAGCAC GCGGCGCGCCTCGGCATCACCCTCTATAGCCTGCTGCTGGGCGGATTCTCGTTATTGCAGCAC
AAGCTCTCCGACTCGCACGACATCGTCATCGGTTCGCCCGTCGCGGGCCGCACCCGGAGCGAA CTCCAGGATCTGCTGGGCGCGTTCGTCAACACCCTGCCGATGCGCCACCGCATCGACCCGACC CATACCGCACGGGTCTTCTTGGAGCAGGTCCACCAGACAACCTTGGCGGCCCTCAGCTACCAG GAGCACCCTTTTGACGAAATGGTGGCGACGCTCGGGTTCGCCGCCGATCCGGCTCGCAACCCG A7CTTCGACACGATGTTCTTGCTGCAGAACATGGCCATGGGTGCAACCACCATTCCCGGTCTG CGGCTCTCGCCTCACGACACTTTTCACCGCAAGGCATTGTGCGACCTGATGCTACAGGCGACC GAGTATGACTGCCACCTGGAGCTGGTGCTCGAGTTCGCCACCGACCTGTTCCGGCTGGAAACC GCGCAAGTCTTGCTCGACCGCTACCGCCAAGTCTTGGAGTGGCTGTTGGCGTACCCCCATGAA TCGATAGACGATTTGACGCTCGCCGGCCACTTTCGCGAAGTCGAAGTGACGATGTCGGACGAG GGCGACTTTGATTTCTCAGATTTCGAACCCCGCAACGTGAGAAACCTATGGCGCGCC
(2) peptide sequence
Seq ID No 58 (>pEPOcos6_ORF9. pep) MKLNWANRLFDPESPERTEPAKSLLLAVTKVLPQEVPNVRTRAISVDLDRSFDAAAPA AAS LLVECGAPVEETWTYHGAARWLRRFDRVAVNGLGPFHPDQPAPLLRERGVYLITGGLGGVAG QLARYLARACRARLVLTARRPLPERDQ DRESAVLS DDKTRQRIELVRELERLGAEVLWAA DVADEAAMAQAIEASLARFDALDGLIHGAGIVRVASGRTPIGSMTRAMCEEQLRPKMLGLDVV DRLLRDRRLDFRIAISSLAPILGGLGHVAYAAANLYMDAFATRAAAGNAP IALNLAE EYEG PATYDERVGRSLKQLELTNEEGIRVFQTVLALAARGPLQQIIISTGDLQARLDK IHIKSLHR RPGPVQLSRRTAAPQGGFGSERAAFEAAFADA CDFFGVEEVDPNKNFFDLGASSLDFIHLVS RFSKAIEQHVPLEALLEHSTLHDLAAHLAGDANTDASDEARIRQRLQGAKSGDIAIIGMAGRF PLAPDLDTY RNLVGGIDAVSFFSAEELRAAGVTAAEIHHTNYVPAKGRCADQDLFDAAFFEY TASDAELMDPQNRVLHEWWHALEDACFDFNGDHGQVGLFAGASPNL QFVASFSEAAKTQG MFTTTLLNDKDSIATQISYKLGLKGPAVTLFTGCSTSLVAVDAACRSI SGQSDMAVAGAVSL TLPDKAGYIYEKGMLFSADGHCRAFDANATGMVFGDGAGAIVLKPLDAALRDGDPIHAVIKGC ATNNDGDRKAGYTSVSAQGQAEVIRSAQILADVAPESISYVEAHGTGTKLGDSIEIKALKQAF ASDKNGFCGIGSVKTNLGHLMAAAGMAGLIKTVLAMKHRQLPPSLHCDEVNPDLELERSPFYI NTRLRD VAPGGPLRAGVSSFGIGGTNAHVILEEPPTRESGTRMRH KLLMLSAASEAALDRQ ADNLADYLERHPEAHLSDVAYSLQTGRRVLA RRTVLCEYREDAVTSLRERQAKRVQTSRVR DHKDWFMFPGQGAQYLNMGRDLYVMEPVFREVMDRCFELLAPL SEHPRQILYPEGGVSTLL
HRTDYTQPIVFCFEYALAHLLLS GLKPAATIGYSFGEYVSACLAGVFSLEDAIRLVTERGRL MAALPAGAMLSVPVPECELLRLLDGFHAQSAAHLALAVDNGASCIVAGEQAAISAFESMLRKK RLLTMRVAVSHAAHSQVMTGATDALRSILRKIPLSAPTIPFISCVTGT ITAQQATDREYWVN HMCGTVRFAAGLTELGQNREAVFLEVGPGRDLTLLAHRILADSAAVFELVKAPDGGDDDGFLL LDRLAKL RLGISID AGFYADERRRKLSLPGYPFERRRF IEGNPLEIAAGRPNVQGPLVKA SDIGA FYVPQ RRSVLAEPGTTAAGAAVTAEQARWTELRAGCASAGLGSGACGLNGGAPSE RPKESVAPAGSTSAAAQTGADCPTPTGEPAAVPKDGAEPRPT LIFADAGGLAESFAKRVQAR GEKLYLVASGSRFERLAETRFRLDPGAKSDHRLLFKALDEADILPTHLLDFRSLDCGGPDADP MDQAGFFGLLHLVQAMAEAGYSHPIRLLIVSCGVYDVTGAEPLQPARATMIGPALCIPQQYPH LETSHVDLGWHADELHAARQLDSLLAECLSATAERQLALRGRHR LLDYEPVRLPPLDPGRL P RQRGVYLITGGLGGIGRILAEHLARTTSARLVLIGRETLPDRDDWDA LNRPQPVDATHER LLHKIRAIRDLEALGAEVLVLAADVANEAAMREAYDRAESHFGTIHGVIHGAGLMDAQSFSLI DALDHDLCARQFEAKIRGVCVLDRVLADRTLDFCLLMSSISTVLGGLGYFGYAAANAFLDAFA QARSRDAAFP LSVA SD KY TERKMDNEVGAVIDSLSMEPAEGFEAVTRVLA GKAPHIAN SPGDLGRRRDQ VKLASLKSAHSSΞPEPARHGRPALSSE VAPRNWEEKLVAIFEQVFGTAA LGIEDNFFELRGDSLKAVMTAARIQKELNVEVPLPTFFQMPTVAGLAQFVTQAKRSGRETIRR TAPRPHYPLSAAQGRHYLHYRMDPRCTAYNDPFANLIEGPLDVDRVERILHTLILRHDCFRTS FHFREGEPVQVIHDRVDFNLARI7CAPEDLPERMRDFIRSFDLERPPAMRAGLFVTGPERHVL LIDFHHIITDGVSFENFVGEFAALYRGEILPELELEYKDFAVWQHENRGRRANSDQARYWTEQ LANAPGPIELTTDFPRPSRRSFRGDRVRTVLDAELVARLKEHAARLGITLYSLLLGGFSLLQH KLSDSHDIVIGSPVAGRTRSELQDLLGAFVNTLPMRHRIDPTHTARVFLEQVHQTTLAALSYQ EHPFDEMVATLGFAADPARNPIFD7MFLLQNMAMGATTIPGLRLSPHDTFHRKALCDLMLQAT EYDCHLELVLEFATDLFRLETAQVLLDRYRQVLE LLAYPHESIDDLTLAGHFREVEVTMSDE GDFDFSDFEPRNVRNLWRA
pEPOcosδ ORF10 sequences
(1) nucleotide sequence
Seq ID No 59 ( >pEPOcos6_ORF10. seq) ATGGCGCGCCTGAGCCGCACAGA7CTCCAACTCGCCATTCACCAGCGCACCGTGGAGCGCGAA
TAT7GGCGCGCTCTGTTCGAGCGCCATCCGCAACGGTCCAGTTTGCCGGGGGTGCTCACCGCC CCGATCGGCGACGAGTCGACCCGCGAGACCTTGTCATTCGTCCTCGACGAAGATCCCCTTCGG CTGAGTAATCGTTCGCCGCAACGCC7GCTCACGGTGTTGGCGGCTGGCCTCGCGGCTTTCCTC CACCGCTGCGACGGCGCTGAGCGC7TCACCCTGGGGTTGGCCCTACCGCGCCAAGCCGATGAC CATCACCCGATCCTCAACAGCT7GA7CGCGCTGGGGGTCGCGGTCGACTCGAGTACGACCTTC CGCGATCTGCTCTATGCGCTTCGATCCGAATACCACGAGGCGATGCGCCACGCCAACTTTCCG CTGGCGACCTGGTGGCGCGGCCTACCCGGCGGAACGGCGCCGTTCGACGTCGCCCTCAGCCTG GACCCCTTCACAGACGGCGATTCGCTGGAAGACCACGCGATCGGCGCGTTGTTCCGGTTCGCA TTGGAGGGTGAGCGCCTCACCTGCCGATTGCGATTCGACCCTGCGCGCTATGACCGTCCCGCG ATCGAAAACCTCGCCGATCGTTTCGCCCGCTTCCTCACGCGCCTGTGCCGGGACGCCTCCACC GTCATCCAGGCGCTGGACCTTTCGCTGCCAAGCGATGAATCGGTGTGGCGCGTCACTGAAGGC GTGCGGCGCGGCTATTCGCAAGACCTGACGCTAGACCGCGCGTTCCGCCGCCAGGCCGCGCAA ACGCCCGATCAGCCGGCGATCACGTTGAACGGGGACGTCCAGAGCTACGCCGAGGTCGACCGC CGCAGCGACGCGCTGGCCCGCCACCTCCGTCGCCACGGCGTCGGTCCGGAAACGATTGTGGCC GTCAACGCCCGGCGCGGGCCTAATCAGCTGACGGCCCTGCTCGCGGTCCATAAGGCCGGCGGC GCC7ACCTGCCGATCGATGCCGAGGAGCCGGCTGCCCGCCAGCAATTCAAGGTGCGCGACAGC GGGGCGCGGTTGGCACTGGAGCCGTCGCCGGACCAGGCGCTGACCGTCACCGACCTGCCGCGG CTCTTCCTGGACGATGCCTCGCTCTTCGCTGACGGCGGGCTCGATGTGCCGCGCGGCGCCGAC TCGCTCAATCCGGCCTATGTGATGTACACGTCCGGCTCGACCGGACAGCCCAAGGGTGTGGTG GTTCCCCACCGCGGCGTGGTCAATCGTTTGAATTGGGGGCAGTCCCGTTTCCCGCTGGACGAA CGCGACCGAATCCTCCAAAAGACGCCGCTGCTGTTCGACGTGTCGGTCTACGAGCTGTTCTGG GGCGCATGGAGCGGGGCCACCCTGGACATCCTCGAGCCCGGCGCCGAGCGCGACCCCGACGCA GTGGCCAGGGCCCTGGCCGAGCGCGCCATTACCGTATGCCATTTCGTGCCTTCGATGCTGCTC GTCTACTTGGAAGTCATGCGGCGGCACCATGCGCCGCCCGTGCCGGACCGGCTCCGTTACGTC 7TCGTCAGTGGCGAGGCCCTCGAACCGGACCACCTCGCCGGGCTCCAGCAGATTGGTCGGCGC CTCGGCCGCACGATTCCCCTCGTTAATCTGTATGGACCAACCGAGGCCTCGATCGAAGTCTCC TGCTTCGCCTGTCCCGCCGACCATGTGCCGCGCCGGATCCCCATCGGGCAGCCGATCGACAAC GTCGCACTGCACGTTCTCGACCGGCGCGGCCGTCGCCAGCCGCCCTATCTTCCTGGCGAGCTG TTCCTGGCCGGCGACTGCCTGGCGCGCGGCTACCTCAACCGTCCCGACCTGACCGCGCTCCAC TTCGTGCCCAATCCCTTCGGCAACGGCGAGCGCATGTACCACAGCGGCGACTTGGCGCTCGTG
CGCGGCGACGGCCAAGTGGCGTT7CTCGGCCGCCGTGACCACCAAATCAAAATCCGTGGTCAA CGGGTCGAACTGGGCGAAATCGAGAGTCATTTGCGCGGGCTCGAAGGCATCGCCGCCGCCGTC GTCCAGGCCGAGTCGCAGCACCA7GAAACCCTGCTGCACGCCTACGTCGTCACCAACGACGCG GGCCTCAATGCGGCCCGGCTGCGCGCCGCCCTCGCTCAACATCTGCCCGAGTACATGATTCCC CAGCGCTTCTCGCGGCTGGCCGAGTTGCCGCTGCTGGCGGCAGGCAAGATCGACCGCGCCGCC CTCGCGCAACGTGCAACGCCGCTCGCCAGCGGCGCGCCCTTCGTGGAACCCAGCGGGCCCACC CAGCAGCGTATCGCAGAACTGTGGCGCCAGGTCTTAGCGGTCGCCGAAGTCGGCGCCGAGGAT CCCTTCTTCAGCATCGGCGGCAACTCGCTCAATGTGCTCAAGCTCAGCGCCGCGCTGAGCGAC GCCTTCGCGCGTGACATTCCCATGCCGGCCCTGTTCCAATACGACACCATCGCCGCCCAGGCC TCCTGGCTCGACGGGCAGGTTGACGAACGGGCCCAATCCGCCGCGCTCGACCGGCAGGCCGCC GAGGCGGCGCTGACCCTTCAAGAGACCGTGGCCATTTTTGAGGGATTCGATGACGAACCA
(2) peptide sequence
Seq ID No 60 (>pEPOcos6_ORF10.pep) MARLSRTDLQLAIHQRTVEREY RALFERHPQRSSLPGVLTAPIGDESTRETLSFVLDEDPLR LSNRSPQRLLTVLAAGLAAFLHRCDGAERFTLGLALPRQADDHHPILNSLIALGVAVDSSTTF RDLLYALRSEYHEAMRHANFPLAT RGLPGGTAPFDVALSLDPFTDGDSLEDHAIGALFRFA LEGERLTCRLRFDPARYDRPAIENLADRFARFLTRLCRDASTVIQALDLSLPSDESVWRVTEG VRRGYSQDLTLDRAFRRQAAQTPDQPAITLNGDVQSYAEVDRRSDALARHLRRHGVGPETIVA VNARRGPNQLTALLAVHKAGGAYLPIDAEEPAARQQFKVRDSGARLALEPSPDQALTVTDLPR LFLDDASLFADGGLDVPRGADSLNPAYVMYTSGSTGQPKGVWPHRGWNRLN GQSRFPLDE RDRILQKTPLLFDVSVYELF GA SGATLDILEPGAERDPDAVARALAERAITVCHFVPSMLL VYLEVMRRHHAPPVPDRLRYVFVSGEALEPDHLAGLQQIGRRLGRTIPLVNLYGPTEASIEVS CFACPADHVPRRIPIGQPIDNVALHVLDRRGRRQPPYLPGELFLAGDCLARGYLNRPDLTALH FVPNPFGNGERMYHSGDLALVRGDGQVAFLGRRDHQIKIRGQRVELGEIESHLRGLEGIAAAV VQAESQHHETLLHAYWTNDAGLNAARLRAALAQHLPEYMIPQRFSRLAELPLLAAGKIDRAA LAQRATPLASGAPFVEPSGPTQQRIAEL RQVLAVAEVGAEDPFFSIGGNSLNVLKLSAALSD AFARDIPMPALFQYDTIAAQAS LDGQVDERAQSAALDRQAAEAALTLQETVAIFEGFDDEP
pEPOcos6_ORFll sequences:
(1) nucleotide sequence
Seq ID No 61 (>pEP0cos6_0RFll . seq) ATGACGAACCATGACCATCACGAGGAGAGCAGCGGCCTGGAGATCGCCGTCATCAGCATGGCC TGCCGATTCCCGGGTGCTATGGCCTGCCGATTCCCGGGTGCTGCCGATTGCGACGCATTCTGG GAAAACCTGATCAACGGGACCTCC7CGATCACCCATTTCAGCGACGACGAGCTGATCGCGGCC GGCGTTGACGCGCGCGACCTGACGCCGCAGTACGTGCGCGCGGCCGGCCAGATCGATGACGCC GAACGGTTCGACGCGGCCTTCTTTGGGTACTCCCAGCGTGAGGCCGAGCTGATGGACCCCCAG TTCCGCCTGCTCCATGAATGCGCC7GGTCCTGTCTGGAACAGGCCGGCATCGATCCGCGCGTC GAAGCCGCGCCGATCGGGCTGTA7GCCGGCGCAGCCGACAACACCTACTGGAACGCGCTCTCG TCGCTCGACCGGGGCTCGGCCGAA7CGGAGCAATTCGCCGCCGAACAACTTTGCAACCGCGAT TTTCTGTGCACGCTGGTCGCCGCCGCGCTCAACCTGAAAGGCCCCGCGGTGGTGGTTCAAAGC GCCTGTTCGACCTCGCTGTTGGCGGTCCACTCGGCCTGTCGTGCGCTCCTGACCGGCGAATGC CGAGTGGCCTTGGCCGGTGGGGTGGCGCTGCGCTTCCCACGCCCGAGCGGTTATCGCTACGAA CCTGGCATGATCTTCTCGCCCGACGGGGTGTGCCGGCCGTTCGACGCGGGCGCTAACGGGACG GTGCCCGGCGAAGGCGCGGGGCTGGTAGCGTTGAAGACGCTGAAACGTGCCCTCCAGGACGGC GACACGATCCACGCCGTGATTCGCGCGACCGCGGCAAACAACGATGGTGCCCGCAAGACCGGG TTCACCGCGCCCAGCGCCCACGGCCAAGCCGAAGTCATTCGCACGGCGCTGCGCCTGGCCCGG GTGCCGGCCGAATCGATCGACTACGTCGAGGCCCACGGAACCGGCACGCCGCTAGGCGACCCG ATCGAGGTAGCCGGCTTGGTGGAG3CCTTCGCCAGCGAGAAGCGCGGCTATTGCCGGCTGGGC TCGGTCAAATCCAACCTTGGTCA7CTGGACACTGCTGCCGGCATCGCCGGCCTGATCAAGACC GTGCTGGCGCTCGAGCACGCGCACATCCCCAAGTCCTGCCACGTCGCCACGCCCAACCCCGCG GCGCGCCTACACAAGACGCCTTTC3GCATTGCCGCCGACGGGATGGCCTGGCCGCGGCGTATG GCGACGCCGCGGCGGGCGGCGGTGAGTTCGTTCGGCATCGGCGGCACCAACGTCCACGCGATT
TTGGAGGAGGCGCCGCCCCGCGCGCCCGAGCTGGCGGACGGGCGCAGTCAGGTGTTCGTCTTC TCCGCCAAGGACGAGGCGGCGC7GGACCG7GCCCTTGCCAACTATGGTGCGGCCTTGGAGAAG CGCGGCGACCTCGCGGCGGGCGCGGTGGCCTGGACGCTCCAAAACGGCCGGGCCGCATTCGAA TGGCGAGCCAGCGCGGTGGCATCCGACCTCGACGAATTGGCGGGCGCATTGCGCGGCGAGCGG CCCGGCGCCGTCAAGAAAAACCGAATGGCGCGCGAGGATAAGCCGGTGGCGTTCTTATGTTCG GGGCAGGGGAGCCAGTACCGTGGCATGGGCCACGACCTGTACCGCGAAGAGCCGCGTTTCCGG CACCACCTCGACGCCTGCCTCGCCATCCTCGCCGAACACAAGCCCGAGATCGACTGGCTGGCG TTGCTGGGCTACCGCGACGAGGACGAGCCAACCGACCAGATCGGGACGTCCTCGCAGGGCCCG AGCCGGTCAGCCGCATCGAACCCAGCGGAGCTCCTCGACAGCACCGAATTCGCCCAACCTTTG CTTTTCTCCATGTCCTACGCGCTCGGTCGGCTGTGGCTCGACTGGGGCGTGCGACCCACGGCG ATGATCGGGCACAGCCTGGGCGAGTACAGTGCTGCATGTATTGCAGATTTCTATGCACTCGAT CAGGTGCTGCCCTTCATTCTGACCCGCGGTCGAGTCATGGCGCAATTGCGGCGCGGCTCGATG TTGGCCGTCAGCGGTGACAGCGTTCTGATGCGCGAGCTGATCGCCGATGCGCTCGATTTGGCG GCGATCAACGGCGCTGACCAATTTGTCTGGAGCGGGCCGAGCGAGGCTGTCCAAGCCGCGGGG GTCCGACTGCGCGGCGCCGGCCTGCGTGCCACCGAGCTGAACACCTCACACGCGTTCCATTCA GCCATGATGGATCCCATTCTGGAGGAGCTAACGGTTGCCGGTTCGCGACTTCAGGTCGGTGTC GGGACGATTCCGGTCGTTTCATGCGTTACCGGAACCTGGTTGACGGCGAAGCAGCTGGCCGAT CCGCGCTACCACGCGCGTCACGCGCGCGAACCGGTGCGGTTCGCGGCGGGCCTAGCGACGCTG ACAGGGGAGGAGCCGCCGCTGATGCTCGAAGTGGGGCCGGGCTCGACCCTGGCGGCTTTGGCC CGCGAGCATTCGAATGCCCGCCTCCCGGTCGTCACCAGCCTGCGCCACGCTCGCCAGGCGACG CCCGATCGCCAATACCTGCTCGAAACGCTCGGCTGCCTTTGGCGACACGGGGTTTCCGTCGAT TGGGGGGCCCATGCCGGACGTTCGCGACGCTTGGTTTCGCTGCCCGGCTATCCCTTTTCCGGC GCGGTGCGCCGCTTAGCCGGCGACCCCCTCCGCCTGCTGGCCGGAGCCCGCGCCGTCGCCGCC CCGTCGGGAACGCGCCAACTCAGCGCCGACGCGCGCGACCTCCCGAACACTCCGGAGCCGACA TCCGGCGCCGTGTCGGCGATCAAAGCGCCAATCGCCGCCGCCGATCCCGGCCTCTATCGCCTC TCCTGGCGCCAGGCCGGAACGGCGCCGCTCGGTCCGCCCGATCTCGGTCCGCCCCGCGACTGG ATCGTCTTCGCCTCTGATTCTCACCTGCTCCAGGCGCTCAGGGCCAATCTCGGGACGCGCGCT CAGCGGGTGACGCTGGTGACGCCGGGCCAGGAGTACGCAGCCGAGCCGTCCGGGTTTCGGCTG CGGCCGGACCAGATCGACGATTACCGCGCCCTGTGGGCGGACTTGGCGCAAACCGGTATTGTG CCACGATACATCGCGTTCCTCGCCCCGTTCATGTACCGGGCGCGCATGGCGGGCGATGCCTCG
ACCCTGGACGAAGTGCGCGAGGGCGGCTTCCTGCCCCTGACCCGCTTGATCCAGACTCGCCCG CCAGGCGGACCGAGCGGACTTCTAAGCCTCACGATCGTCACCCCGGCCGCCCTGGCGCTGGGC GACGAAGCGACGCGCCCGGAATGGGCAATCCTGCACGGGATGGTCGCCGGCTTAAGCCGCGAT TA7CCCGAATGGCGCTTCGTCTCGATCGACGGCGGCGACCCATCCCCGCATCGGTGCGAAGGT CTGGCCCGCTTGATCGCGCTTCA7GCGGTCGACGAGGCTGGCCCGACCCGCTTGGCGCTGCGC GGCCTTCACGCTTGGGTTCCACAGTGCGAGCACGTTCAGCCGGCCACCATCCCTGGGGCGGGT ATGTGGCGCGAGGGTGGTGTGTACATGATAACGGGCGGATTCGGCGGGATCGGTCTGGCGCTG GCCCGCGCCCTGGCTCGAGAAGCTCGCGCCAAGCTGATCCTGGTCGGCCGAAACCTGCCCACC GCGCCGATCGATCTCGAGGCTTGGGACGCGCCGCCGTTGATTCTCACCGCCGACGTCGCCGAC GAAGAGGCCATGCGCCGCGTCTTCGATGCCGCGCACGCCCGGTTCGGCGCCATCGACGGCATT CTTCACGCGGCCGGTGTCCCCGGTGGCAGCCTGTTCGCCAACCAATCGGACGCGGCCTTCGAA GACGTGCTGCACGCCAAGGTTCGCGGTACCCTCGTGCTGCAAGGCCTGAGGGCAATCGATGCG CCGCTGTTGCTGATGTCCTCGCTGGACGCCTGGCTTCCCGGTCCCGGTCAGACCGCCTATGCC GCCGCCAACGCCTTCCTCGACGCCTTCGCCAGTCTGCGCCGGCGAGAGGGAGAGCCGGTGTAC AGCGTTGGCTGGGACAGTTGGTGCGAGGTGGGCATGGCTGCTCGGGTCGCTGCCCGATCGGCC GACGAACGCGGCCGCCTGGCGCGCGAGGGGATCAGCCCTCGCCAGGGTTGGCAGGCTTTGAGC CGGGCGCTCGCCCTCGACCCCCCCCACCTGATGATCTCGCGCACCGACCTGACCTCGCGCTGG CACAGTCGATCCAGCCCTACGCCGGTCGCCTCGAGCGAACCCGAGGTGGCGCTGCCGCGCTGG ACCGCATCCGCCTGCCAAGCCGTCATCGAGCGTGTTTGGTGCGAGCACTTCGCCACCGCCGCC GTGCCTCCCGATGGCAACTTTTTCGAGCTCGGCGCCAGTTCCTTCGACATCGTCCAGCTCAGC GCTCGACTTCAACAACAGTTCGGCCGAGATGTCAGCCACACCGTGCTCTACAGTCATCCCACC GTCGCCTTGCTGGCCGGCTACTTCGCCAATGACCCGACGCCGTCCGGTGCTGCTGCCGACGAA CGCGACGAAGCGGTGCGTCGCGGCCGCGACCTCTTGAAGAGCCGCCGGCGAGGAGTA
(2) peptide sequence
Seq ID No 62 ( >pEPOcos6_ORFll . pep)
MTNHDHHEESSGLEIAVISMACRFPGAADCDAF ENLINGTSSITHFSDDELIAAGVDARDLT PQYVRAAGQIDDAERFDAAFFGYSQREAELMDPQFRLLHECAWSCLEQAGIDPRVEAAPIGLY AGAADNTYWNALSSLDRGSAESEQFAAEQLCNRDFLCTLVAAALNLKGPAWVQSACSTSLLA VHSACRALLTGECRVALAGGVALRFPRPSGYRYEPGMIFSPDGVCRPFDAGANGTVPGEGAGL
VALKTLKRALQDGDTIHAVIRATAANNDGARKTGFTAPSAHGQAEVIRTALRLARVPAESIDY VEAHGTGTPLGDPIEVAGLVEAFASEKRGYCRLGSVKSNLGHLDTAAGIAGLIKTVLALEHAH IPKSCHVATPNPAARLHKTPFRIAADGMAWPRRMATPRRAAVSSFGIGGTNVHAILEEAPPRA PELADGRSQVFVFSAKDEAALDRALANYGAALEKRGDLAAGAVAWTLQNGRAAFE RASAVAS DLDELAGALRGERPGAVKKNRMAREDKPVAFLCSGQGSQYRGMGHDLYREEPRFRHHLDACLA ILAEHKPEID LALLGYRDEDEPTDQIGTSSQGPSRSAASNPAELLDSTEFAQPLLFSMSYAL GRL LD GVRPTAMIGHSLGEYSAACIADFYALDQVLPFILTRGRVMAQLRRGSMLAVSGDSV LMRELIADALDLAAINGADQFVWSGPSEAVQAAGVRLRGAGLRATELNTSHAFHSAMMDPILE ELTVAGSRLQVGVGTIPWSCVTGT LTAKQLADPRYHARHAREPVRFAAGLATLTGEEPPLM LEVGPGSTLAALAREHSNARLPWTSLRHARQATPDRQYLLETLGCL RHGVSVD GAHAGRS RRLVSLPGYPFSGAVRRLAGDPLRLLAGARAVAAPSGTRQLSADARDLPNTPEPTSGAVSAIK APIAAADPGLYRLSWRQAGTAPLGPPDLGPPRD IVFASDSHLLQALRANLGTRAQRVTLVTP GQEYAAEPSGFRLRPDQIDDYRAL ADLAQTGIVPRYIAFLAPFMYRARMAGDASTLDEVREG GFLPLTRLIQTRPPGGPSGLLSLTIVTPAALALGDEATRPE AILHGMVAGLSRDYPE RFVS IDGGDPSPHRCEGLARLIALHAVDEAGPTRLALRGLHAWVPQCEHVQPATIPGAGM REGGVY MITGGFGGIGLALARALAREARAKLILVGRNLPTAPIDLEA DAPPLILTADVADEEAMRRVF DAAHARFGAIDGILHAAGVPGGSLFANQSDAAFEDVLHAKVRGTLVLQGLRAIDAPLLLMSSL DAWLPGPGQTAYAAANAFLDAFASLRRREGEPVYSVG DS CEVGMAARVAARSADERGRLAR EGISPRQGWQALSRALALDPPHLMISRTDLTSRWHSRSSPTPVASSEPEVALPRWTASACQAV IERV CEHFATAAVPPDGNFFELGASSFDIVQLSARLQQQFGRDVSHTVLYSHPTVALLAGYF ANDPTPSGAAADERDEAVRRGRDLLKSRRRGV
pEPOcos6_ORF12 sequences:
(1) nucleotide sequence
Seq ID No 63 ( >pEPOcos6_ORF12. seq)
ATGACCGTGGAGCACGAAACCGGATTCGAAATCGCCGTCATCGGGCTGGCTTGCCGCGTTCCC GGCGCTGCCGACGTGGCCGCCTTCTGGCGCAACCTGGTCGAGGCCAAGGAGAGCGTGCGCTTC TTCGAGGACCACGAGCTGCGGGCCGCCGGCGTGCCCGAGGAGATCTTGCGCCTGCCCAACTAC GTGAAGGCCAAGCCACTGCTCGCTGATGGCGAAGCTTTCGACGCGGACTTCTTCGGGTTCCAT
CCGCGCGAGGCCGCCTACCTGGACCCGCAAGTTCGGCTCCTGCACGAATGTTGTTGGACCGCG C7GGAGGATGCCGGCTACGATCCCGCGCAGTACGCCTACCCGATCGGGTTGTTCGCGGGCGTC TCCAGCAATCTCTCGTTCCTGTTCGACCGCATCGATCCGCGCGACTCCCCCCTGCAGAAGCGC TA7GTGGCCGAGCTGAACGCGGCCTCCTTCGCCACCCAGATCGCCTACCGGCTCGATCTGAAG GGGCCGGCCATTTCGATTCAAACCGCCTGTTCGACGTCACTGGTGGCGATTCACCTGGCGGCG CAAAGCCTGATCGGCGGCGAGTGC3ACATGGCCTTGGCCGGCGGAGCGACCTTGGAGGTCCCC AAAAAGCCCGGCTATCTCTACCGCGAAGGCTACATCAACTCGCCGGACGGCCACTGCCGGGCC TTCGACGCCGACGCGGCCGGCACCATCTTCGGCGACGGCGTCGGCATCGTCCTGCTCAAACGC TACCGCGACGCCCTACGCGACGGCGATCACGTGTACGCAGTGATCAAAGGCTCGGCGATCAAC AGTGACGGCCATCGCAAGGTGTCC7ACACGGCGCCGGGCAAGAGCGGTCAAGTGGCGGTGATC CGCGCTGCGCTGGCGGCGGCCCAGGTAGAGCCGCAAACCATTCGCTTCGTCGAGGCCCACGGG ACCGGCACACTCGCCGGCGATCCGATCGAGGTAGAGGCGTTGACGGAGGTCTTTGCCGAAGCG GGTCGCGGTACCTGCGCCCTGGG77CGGTGAAGACCAACATCGGCCACTTGGATGTGGCGGCG GGCGTGGCCGGTTTCATCAAGGCGG7CTTGGCGCTCGAGCGGCGCGTCCTCCCGCCCAGCCTT CACTTCGTCCGGCCCAACCCGGCCATCGATTTCAACGGGCCCTTCTACGTTTGTCGCCAAATC GAGCGGTTGACGGAGAACGGGCGG77GCGGGCCGGGGTGAGTTCCTTTGGCATTGGCGGCACC AATGCCCACGTGATTCTGGAGGAA3CGCCGGCGCCGGAGGCGAGACTGCCGGCCGGGAGCCCG CCAGGCGCGAGTCCGTTCCTGTTC3CGCTATCGGCCAAGACGCCGGATGCGCTGGCAGGCCGT TGCCACGACCTTGCCGACCACCTGCGGGCGCACCCCGAGCTCCTCCTGGCCGATGTGGCCCTC ACTCTGCAGATGGGGCGGGCGTCG7TCGCCTACCGCCATGTGGTCCAGGCTGCGACGGCGGAG GAGCTGATTCGCGGTCTGGGAGCG7TCCGACAGGAGTCCATCCGCAAGAGGCGGAATCGAGTA CAATGGGTGTTGGCAGGCGAGGCGATGTCGCTTGACGCCGGTTTGCGGCTGTACGCCGATTGG CCGGTCTATCGGGAGCGGGTCGACGTCTGTCTGGCGATCGTCGCCAAGCTGCGCCAAATCGAC GGCCGGTCATTCCTACATGAGTGGATCGAGCGACCGCGCGAGGTTCCTGCCGAATGGTCGACG GCGCTGGCGTTCATGTTCCACTGCGCGCTGGCGCAAGCCCTGAGCCAGGCCGGCCTGCACCCG CAGCGCATGTGGAGCCGTGGGCTG33CGGACAGGTCGGCGTGGTTTTGGCCGAATCCCTGTCG TTGGAACAAGCGCTGGCGCTGGTG7TGTGCCAGACACCGGTTCCCGGCGATGCCACACCTCAG CGCGAACGCTTGGTTCGGACAC7G3AAGGCTGCCGGTTTCGTCCACCACGATTTTTGATTTCG GCAGACAGCTCGGGTCGACCCCTGGACCTCGCCGAATTCGCTCATGTCGATTTTTGGTGCGGT GGCCAAAGCGCCTCGCCCAATGAG3CGGAGCTGCGCTCATGGAGCGACGCCGCGCCCGAGCTG
GTGACCTTGGCGATCGGCCCATCC7TTCTCGAGGCCGCCTCCGGGACGGTGGGTCTGGCGATC GACCCCAAGCGACCGATGACCTG73TTCAGCGCACGGTGGCCGCGTTGTGGGAATGGGGATGT GACGTGCGCTGGGCTGCGTTCACC7CGTCGACCGGGCGTCGGGTTCCCCTGCCTACCTATCCC TTCGTGCGGGTAATTCCCACGATC3GCGACCCCCTTCGCGGAGCAGGCGCGGAGGATGACTTG ATTGCGGCGAGCGCTTCCGCGTC33CCGGATCGCCGCCCGAGCCGTCGGCAAACTCGGCAGCG GAACGCCCACGCGCCCAGTCAAGCATCGCCTCGGCAACCACACCGGCTCCGTCTCATACGTCG GCCAGCGTGGCCGTGGCCACCATTCTCGAAACCGTCCGTGCCTATTTCGGGTTCGCCGCCGTG CGTTCCACCGACGCCTTCTTCGAATTGGGCGCGTCCTCGCTGGATTTGGTCAACCTGGGCCAG CTCCTTTCCGATCGTCTCGGCCGCGAGGTTCCGACCCTGCTCCTCTACGACCACCCAACACCG GACCAGTTGGCGCTGGCCCTGACATCCGCGGCGCTCAGCGCAGAGGCGCCGCCCTTAAGGGGC GGTCATCGCGCATCGACTTCCGGCACAGCCGCGAGCTCGGCCGCCTCCACCGCACCGACGTTC CCGGGGGACGCTCACTCGCAGCCCAGCTTCGTTCGCGAGCAGGACATCGCCATCATCGGGATG GCCTTCCGGGGACCGGGCGCCGACGACCTGGACGCGTTCTGGAACAACCTGGTCGAAGGGGTC GAGTCGATCACCTTCTTCAGCGAGGACGAGCTGCTGGCGGCGGGCGTCCCCCGCGAACATCTG GCC7CGACGCGCTACGTGCGGGCCAAGGGGGAACTGACTGGGATGATGGATTTCGAACCGGAA TTT7TCGGTTATTCGGCGCGCGAGGCGGCGGTCATGGACCCGCAGTTCCGCGTGTTCCACGAA TGC7CCTGGCACGCACTGGAGCACGGCGGCTACGATCCGACCCGATGCGCGGCATCGATTGGC GTC7ACGCCGGCGTGACCAACCACCTGCCTTGGCTGATGCGAACTTTGCCGCACCTGACCGAG GAGGAGCAATTCGGCGCGCTGCTCCTCACCGACCGCGAGTTTTTCGCACCGCTGCTCTCCTAC AAGGTCGGCCTGCGCGGACCCGC7ATTTCGCTGCAAACCGCCTGTTCGACGTCGTTGGTGGCG ATCGGCACGGCCTGTCGCGAATTGCGCGCGGGTGCCTGTCAGATGGCCCTAGCGGGCGGCGTG ACGGCCAGCATCGAGCGCTGCGGCTACTTCCACCAAGAAGGCTACATCCTCTCGCCTGACGGC CACACGCGCAGCTTCGACGCGGCGGCCGCCGGCACGGTCTTCGGCGACGGAGTCGGCATGGTG CTGCTGAAGCCGCTGGCCCAAGCC7TGGCCGACGGCGACACGATCCACGCGGTGATCAAGGGA ATCGGCATCAACAACGACGGCGCGCGCAAGGTCGGCTTCACCGCACCTAGCCGGGCCGGTCAG ACCGAGGCGATTCGGGCCGCGCTGCGCGACGCCGGGGTGGCGTCGAACCGCGTCAGCTACGTG GAGGCGCATGGAACCGCGACCAGAATGGGCGACCCGATCGAGGTCGAGGCCTTGACCCAAGCC TT7CGCGCCGAAGCCGACGGTCCGCTTCCGCCCGGCTCCTGCCTACTCGGCTCGGTGAAGTCC AAC3TGGGCCACCTGAACGCCGCGGCCGGCGTGGCTGGTCTGGTAAAAACCGTGCTGGCGCTC CAACACCGCCGCCTGCCGACCAGCCTGTTCTACCAGTCGCCCAATCCACACATCGACTTTGCG
GCGAGTCCGTTCCGCGTGAACGGCCAGACTTCGGATTGGGTCGCGCCAGAGGGGACGCGGTTG CTGGCGGGAGTGAGTTCGTTCGG7ATCGGGGGAACCAACGCCCACCTGATCGTCGAGGAGGCG CC3AAAGCGCTACCGACGACAGCGGCACCTCTGTCGACGGAGCCGAATGACCTCGACGCGGGC GACGCCGACGGGCTAGTGCTGCCGATCTCGGCCCGCACGCCGACCGCCCTGGCGCACATCGCG ACCAACCTCGCCAATCACCTGGAACGACATCCGACCATCGCCCTGGCCGACGTCGCCCTGACC CTTCAGCTGGGCCGTCGCCAATGGCCCCATCGCCACAGCCTGATCTGCCGGAATCGAACGGAG GCGATCAAGCTGCTGCGCGCCGTCGTCCACTCCGCGGAGGTGCCGCCAGCTCAGGCGCCGGTC TCGGATGCGCCGCGCTGTGTTTTTCTTTTTCCCGGCCAGGGCGCCCAATACCCGAGCATGGCC CGCGACCTGGTTCGAAACTGTCCCGACTTCGCCCTGCACCTGGACCCCTGCCTCGACCAGTTG GCCGAACTGCTTCCCGAAGATCCGCGTTGCATCCTGTTCGGCGATGGCCCCGCCGATCGGCTC GACCAGACGGCCTACACTCAGCCGCTGCTCTTCTCCGTGTCCTACGCCTTGGCGCGCTGGTTG GGCGATTTCGGCATTCGCCCCGATGCGATGATCGGCCACAGCCTGGGCGAATACGTGGCGGCC TGCTTGGCCGGGCTTTTCTCGCTGAGCGATGCCCTGCTGCTGGTGAGTGAACGCGGCCGCCTG ATGGGCTCGGCCGCGCGCGGAGCGATGCTGGCCGTCCCCTTGCCCGAATGGGAACTGGAGGAA CGCCTGGAGCTTCTGGCCGACGACCGAATCAGCATCGCGGCGGTCAACACCGCCGAGAGCTGC GTCATCGCGGGACCCAGCGAGGCGATCGAGCGCTGCGCCCAGCGCTGGGCCGCGCAAGGCCTG ACCTGTACGCCGCTGCGCACGTCCCACGCCTTCCACTCCGCGATGATGGAGCCGATTGTCGAA CCCTTCGGCCATGTCTTGGCACGGGTCACCTTCGCGCCGCCGCGCGCGCGCTGGATCTCGAAC CTCGACGGCAAGCCGATCGATTCCGCGGCGGTGATGCAGCCCGACTATTGGGTGCGCCACCTG CGCCAACCGGTCCGCTTTCACGAGGGACTCAGTCACCTGTTGGCCGAGGACACCCATGCTTGG GTCGAAGTGGGTCCCGGCCGAACCCTGTCCTCCTTCGTCCGCCGCCACCCGGCCTACCGTCAC CAGCCAATCGTCAACCCCATGCGCCATGCAGTCGAGTCGACGGGCGACGTGCGCCGGTGGCGC CAA3CGCTGGGCGAACTATGGCGGGCCGGCATGCCGGTCGCCTGGGAGCGGCAGCGGCGCGGC CGGCATGCCGGACGACGTGTGCCGCTGCCGGGCTACCCCTTCGAGCGGCGGCCCTTCGCGGCC CGAAGACCGGTGGAGCTGGCGCAGCCCGCGCCCAAGGCGGAGCTGGTGAAAAACCCCGATCCC GCGCGGTGGCTGTACCGCCGCGTCTGGCGCCCTGCCCAGGCTGCGGCCGGCGGACTGGCGGTG CAGGCGACCGTTCTGGTCTTCGGCGACGGGTCCGAGCTGTGCCGCGCGGCGGTCGCTCAGGTG CAGCGCCAGGGGCTGAAGTGCGTCTCGATCACCGCGGGCCGCCAATTCGCGCGGGAGAGCGAC AT33GCTTCACGCTTGACCCCGCTGATCCGCGCCAGCTCGACCAGCTCTTCGCGGCCCTCGAT GGC7CAGGCTCGCGGCCGCGGTACGTCCTGCACCTGCTGACCCTGAACCCGCCCCCGGATGCC
TCG3CGATCATCGCTCACAGCTACTACAGCCCGATGGCCTTGGCTCATGCCTTGGGCGCCCAC GAGATCGCGCCTGTCTCGATCACCGTCGTCACCGCCGGGGTCGTCGCCGTCGCGGACGAAGCG AT7CGCGAGCCGCTGCAGGCGCTGATCGTGGGCCCGTGCCTGGTCATCCCGCAGGAGTTTCCC GGGCTCAGCGTTCGGCTGCTGGACGTCAACGTCGACGATCCGGCACCGCGTCTGGCGGAGCGG CTC3TGGCCGAGCTCTCGGGCACGGATCACATGGTGGCGCTGCGCGGCGGCGAGCGCCTAGTG GCCGATGTCGATCAAGTCGATGGCCTCGGTGTGGGGATCGCCAAGGTGCCCTTGCGCCGCGAG GGCCACTACCTGATTCTCGGCGGCCTGGGCGATATCGGCTACCACTGTGCCCGCTATCTGGCC CAAACCTACCGCGCCAAGCTGACGCTGACCGCGCGTTCGTCACTCCCGCCGCGCGCGTCGTGG GAGCGAATGCTGCGCGAGGGAAACCTGGATTCCCGGCAGCGCACGCGCATCGAGCGCGTGTTG TCGCTAGAGGCGTGCGGGGCCGAAGTCCAGACGGCTGCGGTCGACTTGGGCGATCGCCATCGC TTGGCCGATGTGTTCCGCGAAGCACGGGGCCGATTCGGCGCCATCGCGGGCGTGATTCACTCG GCGGGGATTCCGGGACACGTCCAC7CGATCGACGAGCTGGTGCGCGTCCGCGACGAAGCCCAA TTCACCGCGAAGGTTCGAGGGCTGCACCACCTGGCCGAGGTCGTCGATCCGCTGAACCTCGAC TTTTGTCTGCTGTTCTCCTCGCTCTCGACCGTCCTCGGCGGGCTCGGCTACGGCGCCTATGCA GCGGCCAACGCCTACATGGACAGCTTCGCCCGCCGCCACGATCGGCCGGACGAATGTCGTTGG ATCGCGGTCAACTGGGACGCCTGGCTGTTCGAAGCCAAGACGTCGTCGGTCGGCGCCGAATTG GCGCGCCTGGCGATCGTGCCCGAGGACGCTCCGGCCCTGTTCGCGCGGGTGCTAGAGCGACTT CCGCAATCGTTCATCGTGTCCACCGCCGACCTTCGGGCCCGCATCGACACTTGGATCCGGGAC AAGAACCGCGTCCCGCCCGCCGAGATCCGAGCGGTTCAACCGCGACCGGACCTGAGCCAGGCG TACGCCCCGCCGATCGGCCCGCTGGAGATTCAACTCTGCGGGCTGGTCTCCGCCTATTGCCGG TTCGACCGGATCGGGCGGGACGATTCCTTCTTCGAAATCGGCCTCAGCTCGTTCGACTTGATC CAGCTCAGCTCGCGCATTCACCGCATCACCGGCAAGGATCTCAATACGACCCAACTGTTCAGC TACCCCACCGTGCGCGCCTTGGCGCTCTTCCTCGGCGGCGAACCGGAGGGGCTCGCGGCGGAG GAGCCCGCCATGGAGAACCTGTGGCTGCAACGAAGCGATGCGACCCTCGATGAG
(2) peptide sequence
Seq ID No 64 ( >pEPOcos6_ORF12. pep)
MTVEHETGFEIAVIGLACRVPGAADVAAF RNLVEAKESVRFFEDHELRAAGVPEEILRLPNY
VKAKPLLADGEAFDADFFGFHPREAAYLDPQVRLLHECC TALEDAGYDPAQYAYPIGLFAGV SSNLSFLFDRIDPRDSPLQKRYVAELNAASFATQIAYRLDLKGPAISIQTACSTSLVAIHLAA
QSLIGGECHMALAGGATLEVPKKPGYLYREGYINSPDGHCRAFDADAAGTIFGDGVGIVLLKR YRDALRDGDHVYAVIKGSAINSDGHRKVSYTAPGKSGQVAVIRAALAAAQVEPQTIRFVEAHG TGTLAGDPIEVEALTEVFAEAGRG7CALGSVKTNIGHLDVAAGVAGFIKAVLALERRVLPPSL HFVRPNPAIDFNGPFYVCRQIERLTENGRLRAGVSSFGIGGTNAHVILEEAPAPEARLPAGSP PGASPFLFPLSAKTPDALAGRCHDLADHLRAHPELLLADVALTLQMGRASFAYRHWQAATAE ELIRGLGAFRQESIRKRRNRVQWVLAGEAMSLDAGLRLYAD PVYRERVDVCLAIVAKLRQID GRSFLHEWIERPREVPAE STALAFMFHCALAQALSQAGLHPQRM SRGLGGQVGWLAESLS LEQALALVLCQTPVPGDATPQRER1VRTLEGCRFRPPRFLISADSSGRPLDLAEFAHVDF CG GQSASPNEAELRS SDAAPELVTLAIGPSFLEAASGTVGLAIDPKRPMTCVQRTVAALWE GC DVR AAFTSSTGRRVPLPTYPFVRVIPTIGDPLRGAGAEDDLIAASASASAGSPPEPSANSAA ERPRAQSSIASATTPAPSHTSASVAVATILETVRAYFGFAAVRSTDAFFELGASSLDLVNLGQ LLSDRLGREVPTLLLYDHPTPDQLALALTSAALSAEAPPLRGGHRASTSGTAASSAASTAPTF PGDAHSQPSFVREQDIAIIGMAFR3PGADDLDAF NNLVEGVESITFFSEDELLAAGVPREHL AS7RYVRAKGELTGMMDFEPEFFGYSAREAAVMDPQFRVFHECS HALEHGGYDPTRCAASIG VYAGVTNHLP LMRTLPHLTEEEQFGALLLTDREFFAPLLSYKVGLRGPAISLQTACSTSLVA IGTACRELRAGACQMALAGGVTASIΞRCGYFHQEGYILSPDGHTRSFDAAAAGTVFGDGVGMV LLKPLAQALADGDTIHAVIKGIGINNDGARKVGFTAPSRAGQTEAIRAALRDAGVASNRVSYV EAHGTATRMGDPIEVEALTQAFRAΞADGPLPPGSCLLGSVKSNVGHLNAAAGVAGLVKTVLAL QHRRLPTSLFYQSPNPHIDFAASFFRVNGQTSD VAPEGTRLLAGVSSFGIGGTNAHLIVEEA PKALPTTAAPLSTEPNDLDAGDAE3LVLPISARTPTALAHIATNLANHLERHPTIALADVALT LQLGRRQ PHRHSLICRNRTEAIK LRAWHSAEVPPAQAPVSDAPRCVFLFPGQGAQYPSMA RDLVRNCPDFALHLDPCLDQLAELLPEDPRCILFGDGPADRLDQTAYTQPLLFSVSYALARWL GDFGIRPDAMIGHSLGEYVAACLA3LFSLSDALLLVSERGRLMGSAARGAMLAVPLPE ELEE RLELLADDRISIAAVNTAESCVIA3PSEAIERCAQR AAQGLTCTPLRTSHAFHSAMMEPIVE PFGHVLARVTFAPPRAR ISNLD3KPIDSAAVMQPDYWVRHLRQPVRFHEGLSHLLAEDTHA VEVGPGRTLSSFVRRHPAYRHQPIVNPMRHAVESTGDVRR RQALGEL RAGMPVAWERQRRG RHAGRRVPLPGYPFERRPFAARRFVELAQPAPKAELVKNPDPAR LYRRV RPAQAAAGGLAV QATVLVFGDGSELCRAAVAQVQRQ3LKCVSITAGRQFARESDMRFTLDPADPRQLDQLFAALD GSGSRPRYVLHLLTLNPPPDASAIIAHSYYSPMALAHALGAHEIAPVSITWTAGWAVADEA IRΞPLQALIVGPCLVI PQEFPGLSVRLLDVNVDDPAPRLAERLVAELSGTDHMVALRGGERLV
ADVDQVDGLGVGIAKVPLRREGHY1 I LGGLGDIGYHCARYLAQTYRAKLTLTARSSLPPRAS ERKLREGNLDSRQRTRIERVLSLEACGAEVQTAAVDLGDRHRLADVFREARGRFGAIAGVIHS AG: PGHVHS I DELVRVRDEAQF7AKVRGLHHLAEWDPLNLDFCLLFS SLSTVLGGLGYGAYA AA::AY DS FARRHDRPDECR IA\7 WDA LFEAKTS SVGAELARLAIVPEDAPALFARVLERL PQSFIVSTADLRARIDT IRDKNRVPPAEIRAVQPRPDLSQAYAPPIGPLEIQLCGLVSAYCR FDRIGRDDSFFEIGLSSFDLIQLSSRIHRITGKDLNTTQLFSYPTVRALALFLGGEPEGLAAE EPAMENL LQRSDATLDE
pEPOcos6_ORF13 sequences:
(1) nucleotide sequence
Seq ID No 65 (>pEPOcos6_ORF13. seq)
ATGAAATACGAAACCACCGGATTGGAATTGGCCGTCATCGGTCTCGCTTGCCGCTTTCCAGGC
TCACCCGATCCCGAACAGTTCTGGTCGAATCTGCGCGCAGGTCGCTCCGGAATCCGCCATTTC AGCGATGCCGAGCTGAGCCACATCCCCGCATCCCTGCGTCACCATCCGCATTACGTCAAGGCC AAAGGCGCGCTGGACCACGCCGATTTCGAACCAGCCTTCTTCGGCTACTCGCCCAAAGAGGCC GAGGTGATGGACCCTCAATTCCGGCTGCTCCATGAGTGCTGCTGGGAGGCGCTGGAGTCAGGC GGCTATGCGCCGAGCCAATTCGCGGGTCGGATCGGCTTGTTCGCGGCGGCGGCCTTCAACGAC GGATGGATCGCCGGTACCCTCGACCGGCTGCGCACCGGCGTGGGTTTGAGCTCCCTGGAAACC GCGTTCTTGACCCTGCGCGATTACCTGACCACCCAGATCTCCTATCGGCTCGATCTGCGGGGC CCCAGCCTGCTTGTCCAAACCGCCTGCTCGTCGTCGCTGGTGGCGGTCCAGCTCGCCCAGCAG GCGCTGATCTCCGGCGAATGCGCCCTGGCCTTGGCTGGCGGCGTGTGCGCGACCGATCCGCTG CAT7CGGGATACCTCTATGAACCCGGCAACATCTACGCGCGCGACGGCGTCTGCCGACCGTTC GACGAGGCAGGCGCCGGTACGGTCTTCGGCGACGGGTGCGGCATGGTCCTGCTCAAGCGGCTG AGCGACGCCCAGCGCGACGGCGATACGATCTGGGCGGTCATTCGCGGGGCGGGCGTGAACAAC GACGGGCACCACAAGGTTGGCTACACGGCTCCTGGCACGAGGGGCCAGGTGGCTTTGCTTAAA AGTGTTTATCGCGCGAGCCGGGTCGACCCGGCGACGCTCGGCTACCTGGAGGCCCATGGCACC GGCACCGCGCTCGGCGATCCAATCGAGGTCGAGGCGCTTACCCAGGCCTTCGCCAGCAAACGT CGCGGCACCTGCGGCTTGGGCTCGGTCAAGGGCAACCTGGGTCACCTCAACACGGCGGCCGGC A7CGCTGGACTGATCAAGGTGGTGCTGGCGCTGAAACATCGCGAAGTGCCACCCACCCTCAAT
C7GCGCCGTCCCAATCCGAAAATCCGCTTCGACGAGACGCCGTTTTTCCCAGTCGTCGAGTTG CAACCCTGGCCAAGCGGGACCGGCCCCTTGCGAGCCGGCGTGAGCTCCTTCGGCATCGGCGGT ACGAACGCCCACGTCATCCTCGAGGAGGCACCGCCGACGGCCAACCCGGCGCCACACGGCAGA T7 CGACTGTTGCCGCTTTCGGCCAAGACACCGGCTGCGCTCGAAGCGAAGCGCCGCGATCTG GCCGGCTTCCTCGAACGCCACCCGGAGACCTCCTTGGCCGACCTCGCCTTTACCCTGCAACGC GGCCGCGAGGTCTTCAGTCACCGCGCCTGCCTCGCCGTGGAGACCTTAACGTCCGCGCGCACG CGGCTGAGCGGCGAGTCGTCGAGCACTTGCGTGGTGGGCCCCGCGCCCAGCGCCATATTTCTG TTCCCTGGTCAAGGCAGCCAGCTCGCCGGGATGGGCCGCGGTCTGTATCACCATTTCGAGCCG TTCCGCACGGCCGTCGATGCCTGTCTGCGCGAGCTGGAGCCAGGACTGCGGCAAGCGCTCAGC GCCCATTTCGATCCGAATCGCGGCGCGGACCCACCCGATTCGACGACCTTCGTCCAACCCTTG TTGTTCCTCGTCGAGTACGGGGTGACCGAGTGGCTACGCTGCTTGGGTGTGCGGCCAACAATG GTGTTGGGTCACAGCTCTGGCGAGTATGCCGCAGCCTGCGTCGCGGGCGTTCTGTCGCCGTCC GCGGCGGTCTCGCTGCTGGCCGAGCGCGAGCGGCTGCTGCGCGACCTGCCAGCCGGCGCCATG CTCGGCGTCCCGCTGGCCGCCGAGGCGCTCGAGGCGATGTTGCCCGACGCTCTCGATCTGGCG GCGATCAACGGCTGTCAGCTTTGCGCCGTGTCCGGGCCGGTCGCGGCGGTCCACGCCTTCAAG GCCCAACTGGAAGCCGCCGGACATCACGCCCGCCTGTTGCACACCGATCGCGCCTTCCACTCG CGGCTGGTAGCACCGGTGCTTGACCGGTTCCAGGCAGCCGTTCAACACGTGGAGCTGCGGCGG CCGCAAGTACCTTACCTCTCGACCGTCAGCGGGCGATTGGAGGCGGATGGGCCGGCGAACCCG CACTACTGGGTGCGTCACCTGCGCGACACGGTGCGGTTTGGTCCAGCCCTGGAGGCGCTGCCG CCGGTGGATTCCTTCGTGTGCATCGAGGTGGGACCAGGCTCGGCCTTGAGCACCATGGCGCGC GAAACGTTGGGTTCCCAGGCGCGACTGATTTCGTTGCTGCCGCGGCCGCGAACGGGGCAAATC GAGCCCGGTCCGGTATTCGAACGACTGGCGGCGCTTTGGCGCAGCGGGTTGACATTGGATTGG TCTAAATTGACGGGCGGCGAAGAGGGTCATCGAATTCCCTTGCCAGTCTACCCGTTTCAGCGC AGCCATCTGTCGAGCTCCCTGGCGGCGGGCCACACGCCTTCGTCGCGGCCTGCAGTCGAATCA GGCGCCATCCTTGCCGAGCGATCCGCAGGGGAAAACGCTGAAACCCGGGATTGCCCGCTGCCA ACCGCCACGCTCGAGCCCAAGGCGGTCGCTCCGGCCCCACTCGAGGCTACCGACGCCGCAGGT ACTCGCGAGCGACTGGCCGAACTTTGGCGCGAGTTGCTAGGGTTGACCTCGATTGGGCCCGAC GACCATTTCTTCGACCTGGGCGGCCACTCGCTGACCGCCACGCGGCTGCGCGCCCTGATTCAC CAGCGGTTCGATGTCGATCTCGGGCTCGACGAAATCTTCGCTCATTCGCGTCTCTCCCAGCTG GCCGCCCGTATCGAGGCGGCGGCCAAGAGCCGATTTTCCTCCATTCCCAGCGCGCCGGACCAG
GAC3ACTATCCCTTGTCATCCGCCCAGCAGCGGATTCACAGCATCGTCACGAGGGCCGAGGTC GGCACTGCTTATAATTTTCCGATCGTCCTCGAGCTGCAGGGCGCTCTGGATCGAGTGCGATTC GAGGCGACGTTCGCGGCATTGTTCCGGCGTCATGAGGGGTTCCGCACCCGCTTTGTGATGCGC GATGGCGGGCCGCGCCAGCGCATTGTACCGGACGTGGCGTTTCGCCTGCCGCTCACCCAGGTC GAGCCAGAGCAGGTTCCCGGGCGCATCGAGGCCTTCATCCGTCCCTTCGATTTGGAACGCGCG CCGCTGTTCCGCGCGGAGCTGTTGCAGTTGGCCGAGCAGCGCCATCTGCTACTTTTCGACATG CACAACTTAATTGCCGACGGTATCTCGCTCAACCTGTTCGTCGCCGATTTCGCGGCCCTGTAC CATGGTCGTCCGCTGGCGCCGCTGAAACTCCGCTATCGCGACTATGCCGTTTGGCAAGAGGCG CGGCTGGCCTCCGATGACCTGCGCAGCCAGCGCGAATGGTGGCACCGGCGGCTTTCGCCGCCG GTCGCCACGCTGGCGCTCCCTCCCGATTTCCCGCGTCCGGCGGTGCGCCGCTACAAGGGCCGT AATGTGGTGTTCCACCTGGACCGGGAGATCCGCGACCGCCTGGTGGCCCTGGCTCGAACCCAG GGGGTCACCATGAACGTGATGATGCTGGCGCTCTGGGCTGCGCTGCTGCATCGCGAAACCGGC CAATCGGAGCTGGTGGTCGGATCGCTGCTCGGCGGGCGGCCGCACAGCGAGCTGCATCCCGTG ATCGGGCTCTTCACCAACTTTTTGCCCTTGCGGTTGGCGGTCGAGGGATCGACCCGCTTCGAT CGCTTCCTTGCCGCTTGCCACCAGG7GTTTCTCGAAGCCTATCAGCGCCAGGACTATCCGTTC CACTTGTTAGTCCAGGAACTCGTGCCGGTCAGGGACCCGTCGCGGTCGCCGCTGTTCCAGACC TCGCTCGTCTACCACAACGAAATTGACGGCAAGACCAAGCTGGAATTGGAAGGGCTGAAAGTC GAAGTGGTTCCCTTCGAAAAGGGTGTGGCGAGGCTGGATTTGAAGCTGGATGTGACACCTTTT TCCGACCGACTCGAATGTGTTTTGCAATACGACTTGGATCTGTTCTGCGAGGAGACGATGCGC GGCCTGATCGCGCGGTTCCAGGCGTTGGTGGCGGGGCTTGTCGCCGATCCGGCGCAATCGCTC GCCGCCGCGAGCGTTTCCGGGAAGCGGGCGCTGCGCGCGGGCGTGGCCACGGCAAGCGAATCG TCGCCGCAGTCACTGCCGCCGCAACCATCGACGGCGTACGCCACTCCCTCACCGCAGTCACCG TCGCCGGTAGTCCTGACGGGACCCGCCGACCTGCCCGCGATCTTGGCGGCCTACGTGGGGCAG AACCCCCATCCGTTCGCGATCCATCGGGGTCTCATTTTGGAGGCGCCGCTGGGGTTGCGAGCG CTGCGGTCGGCGCTGGACGCAGTGC7CGGAGAACACACCCATTGGCGCAGCGTGCGTGCGGGC GATCGCGCGCGGCGCGTGGATAAGT7GGAATTGACCAGCCTGGTGCGGCTCGACGACCTGCGC GGGTTGGTCAATCCTCAGGCGAATGCCTTCACCCTGGCTTGGCGCGATCTGGCGATGCCGTTC GGGGAGGGGCGTCCCCTGTGGCGACTCCGCCTGGCGTGGTCGGCTCCATCGCGCTGGTTGCTA TTGCTGACGGTTCATCCATTGATCGGCGACAACGGCACGGTCGACCTCTTTCTGGCGGCACTC GCCGATCACCTGCGCCGCGCGTCCGCTTTTCCCGTAGCACCGCTCGATGAGGCCGAGCTGGAG
GCGGAGCTGAAGTGGGGAGAGGAAGGGGAGGGCCTCGGGCTGACCGCGATCGCGCCGGTCCTG GGCCAATTGCGCGAAAGTCGGCTGAGTCCTGTGGCCCAGATGTGGCTGGACGAGGTCTGTCGC CGCCACGACCTCACCCCGCTAGAGGTCTTGGCGGCCCGGCTCCTCGATTGGACACGAAGCCAC GGTCACGGGTCGATCGCTTTGTGGACGCCGCTGCCCGAGGACCATCCGCTTCGCGATGAAGGC CGCTGCCTCCAGGTTCGCCTGCTGGAGGGGCCGCCGTCGCAGCGAGGAGCGGGCGATCCAAGC TGGCTCGAGCAAATCGCCTTGAGACGGGGTACCCCTGCAACGGAGGTCGTTTGCCCTACTCCG ACCCAACGGGCAGCCATCGACCTCGCGCTGGCCTGGCTGCCGCAGCCGCCTCTTCACGGTTTG GTCGGAACCGTTCAGCCGTGGCCGGAATCTCCATTGGTCTGTCCGTTTCCCCTCAATCTCGCG TTCCGGCCAAGCCATCCAATTGCCTACGCGCTCAAGCACGAGGCCACGCTCGCGGTCACGGCA CGGGCGCGCGATCTGATGCGTTTCCTCGACGGCTTGGGCCCGGAAAGC
(2) peptide sequence
Seq ID No 66 ( >pEPOcos6_ORF13. ep) MKYE7TGLELAVIGLACRFPGSPDPEQFWSNLRAGRSGIRHFSDAELSHIPASLRHHPHYVKA KGALDHADFEPAFFGYSPKEAEVMDPQFRLLHECC EALESGGYAPSQFAGRIGLFAAAAFND G IAGTLDRLRTGVGLSSLETAFLTLRDYLTTQISYRLDLRGPSLLVQTACSSSLVAVQLAQQ ALISGECALALAGGVCATDPLHSGYLYEPGNIYARDGVCRPFDEAGAGTVFGDGCGMVLLKRL SDAQRDGDTIWAVIRGAGVNNDGHHKVGYTAPGTRGQVALLKSVYRASRVDPATLGYLEAHGT GTALGDPIEVEALTQAFASKRRGTCGLGSVKGNLGHLNTAAGIAGLIKWLALKHREVPPTLN LRRPNPKIRFDETPFFPWELQPWPSGTGPLRAGVSSFGIGGTNAHVILEEAPPTANPAPHGR FRLLPLSAKTPAALEAKRRDLAGFLERHPETSLADLAFTLQRGREVFSHRACLAVETLTSART RLSGESSSTCWGPAPSAIFLFPGQGSQLAGMGRGLYHHFEPFRTAVDACLRELEPGLRQALS AHFDPNRGADPPDSTTFVQPLLFLVEYGVTE LRCLGVRPTMVLGHSSGEYAAACVAGVLSPS AAVSLLAERERLLRDLPAGAMLGVPLAAEALEAMLPDALDLAAINGCQLCAVSGPVAAVHAFK AQLEAAGHHARLLHTDRAFHSRLVAPVLDRFQAAVQHVELRRPQVPYLSTVSGRLEADGPANP HYWVRHLRDTVRFGPALEALPPVDSFVCIEVGPGSALSTMARETLGSQARLISLLPRPRTGQI EPGPVFERLAAL RSGLTLD SKLTGGEEGHRIPLPVYPFQRSHLSSSLAAGHTPSSRPAVES GAILAERSAGENAETRDCPLPTATLEPKAVAPAPLEATDAAGTRERLAELWRELLGLTSIGPD DHFFDLGGHSLTATRLRALIHQRFDVDLGLDEIFAHSRLSQLAARIEAAAKSRFSSIPSAPDQ DDYPLSSAQQRIHS IVTRAEVGTAYNFPIVLELQGALDRVRFEATFAALFRRHEGFRTRFVMR
DGGPRQRIVPDVAFRLPLTQVEPEQVPGRIEAFIRPFDLERAPLFRAELLQLAEQRHLLLFDM HNLIADGI SLNLFVADFAALYHGRPLAPLKLRYRDYAVWQEARLASDDLRSQRE HRRLS PP VA7LALPPDFPRPAVRRYKGRNWFHLDREIRDRLVALARTQGVTMNVMMLAL AALLHRETG QSELWGSLLGGRPHSELHPVI GLFTNFLPLRLAVEGSTRFDRFLAACHQVFLEAYQRQDYPF HLLVQELVPVRDPSRSPLFQTSLVYHNEIDGKTKLELEGLKVEWPFEKGVARLDLKLDVTPF SDRLECVLQYDLDLFCEETMRGLIARFQALVAGLVADPAQSLAAASVSGKRALRAGVATASES SPQSLPPQPSTAYATPSPQSPSPWLTGPADLPAILAAYVGQNPHPFAIHRGLILEAPLGLRA LRSALDAVLGEHTH RSVRAGDRARRVDKLELTSLVRLDDLRGLVNPQANAFTLA RDLAMPF GEGRPL RLRLAWSAPSR LLLLTVHPLIGDNGTVDLFLAALADHLRRASAFPVAPLDEAELE AELK GEEGEGLGLTAIAPVLGQLRESRLSPVAQM LDEVCRRHDLTPLEVLAARLLD TRSH GHGSIAL TPLPEDHPLRDEGRCLQVRLLEGPPSQRGAGDPS LEQIALRRGTPATEWCPTP TQRAAIDLALA LPQPPLHGLVGTVQP PESPLVCPFPLNLAFRPSHPIAYALKHEATLAVTA RARDLMRFLDGLGPES
pEPOcos6_ORF13.1 sequences:
(1) nucleotide sequence
Seq ID No 67 ( >pEPOcos6_ORF13.1. seq) ATGACGCAAGCCTCGGCCGCGTCGACGTCCCAGGTCGCGCCGGAGGTCACCCCCGGCCGAAAG GACGACGATGACGATCAAATCCGAGATGTCGGCCGTTGCTCACTCTGCGGAGAGCGGCTTCCG CGCTGGGCCACGCGTGGGCGGCGCGATGAAGCGGGGCCGGACGCCGGAGCAGGCCGGCGTGAA GCTGCTCCGCGCCCCGGTGAAGCGGAAGTGGCTGCCCCCGGCGCCCGTCCTGCGCCTGAGCGA GCGGCGTATCCCGGAGGTGTGGGCAGGCTACCGCGCGAGCGCGGGATGACCCGAGCCCCGCCC GCCGGCGCGACCATGACGCCGCCCCACGGGGCGAGTCGTCCGGCGCGCCGGCGCGCGTCGGGG CTTCCGCCGCCGGGCGGGCAGGTGCAGGATGGTCGGGCATGG
(2) peptide sequence
Seq ID No 68 ( >pEPOcos6_ORF13.1.pep)
MTQASAASTSQVAPEVTPGRKDDDDDQIRDVGRCSLCGERLPR ATRGRRDEAGPDAGAGRRE AAPRPGEAEVAAPGARPAPERAAYPGGVGRLPRERGMTRAPPAGATMTPPHGASRPARRRASG LPPPGGQVQDGRAW pEPOcos6_ORF14 sequences:
(1) nucleotide sequence
Seq ID No 69 ( >pEP0cos6_0RF14. seq) ATGGTGACGCGTCCGACGTCCGACGGCATCGAGGACGAGCTCGCGCCGTTCCCCCCGGTCCTG CGCGGCTGGCTCATCGAGGGCGAGCTCGGCCGCGGCGGGATGGGGCGGGTGTTCCGGGCGCGG CACCCGAAGACGCGGGCGCGGGCGGCGATCAAGGTGCTGCTCGGCGACTACGCCCGCCGGCCG GACGTGGTGGCCCGCTTCCGGCAGGAGGCGATCGCCGTCAACATCATCAACCACCCGGGAATC GTCCGCGTCTTCGACTCCGGCGAGCTCGAGGACGGCTCGCCCTACATCGTGATGGAGTACCTG GACGGCCGGGGGCTGCGCGACTGGGTGCAGGCCGTGCCGCCCGCGGAGCGGCCGCGGCAGGTC GTGCGGCTCGGCTACCAGATCGCC7CGGCCATGGCCGCGGCGCACGCGTCCAAGGTCGTCCAC CGCGATCTGAAGCCGGAGAACATCATGGTGGTCGAGGACGAGCTCGCGCCCGGGGGCAGCCGC GTCAAGATCCTCGATTTCGGCATCGCGAAGGTCCTCTGGGGAGGTCTGCCCGAGGTGCTGGAG CTCGAGGGGAGAGGCTCCCTCGCGCCCGCGTCCGCGTCCACGATCCGCACCGAGCTCTCGACG CGGCCGGCGCCGACGGTGGGCGCCACGACCGGCCCAGAGAGCCCGCTGGGCGCGAGCGCCACG CCAGAGAGCGCCCTGGGCGCGAGCGCCACGCCAGAGAGCGCCCTGGGCGCGAGCGCCACGCCA GAGAGCGAGGCCCACGAGGAAGACGCGCTCCGGAGCCTCCCCGTCGTGACCAGCGGCAGGCCC GCGATCCACCCCGCGCCGGTCGAGATCCCGCCCGAGGCGGTCTCCTCCGCGGCGTCGCGCGGG TCGCGCGCGTCGATCGAGCCAGGCGCGCCCGCGCCGCAGAGCGAGGGCGCGGGACAGCCCACG ATGCCGTTCACGCAAGAGGGCGTG7GGGGCCTCGGGACGAGGAGCTACATGGCGCCGGAGCAG GAGCGCCACTCCGGGAGCGTGGACGTGAAGGCGGATGTCTACTCGCTCGGCGTCATCCTCTAT GAGCTGCTCGAGGGGCGGACGCCCGACGCGCCGAGCGCCGCGTGGCCGCCCCCGATGAGCGCC GCCACGCCGCCCGATCTCGTCGCCCTCGTCCACCGGGTTCTGGCGTTCGATCCCGATGCGCGG CCGCGCATGGCGGAGGTGGCGAGCGCGCTTCACCGGCTCGGCCGGGCGAAGAAGGAGCTCGAC GAGGCGCTCTCGAGGTGGGTCGTCGGCGGAGGGGCGCCGGGGCTCTTGCCGTGCGGCTATGCT C7TCTCGAACTGGTCCTCCTGGGCCCTGGGAACTTATACGATTCTTTCCAGCCTGTAAGTGCA TT7TTCTTTCAATATCGTCCTCTC7TCATATACGAGGTGAGTTCTCTGAGGTCCTCCTATAAG TC7GGGGTGTCCTATTCGGCCTC77ACTTGTTACTTCGCCTTCTTAGGAGTTTTTCCTTAATT TTGCCCTCTTACATTCCCGTATTCATTCTAACTGGGCCCTATCTCATTCGC (2) peptide sequence
Seq ID No 70 (>pEP0cos6_0RF14.pep)
MVTRPTSDGIEDELAPFPPVLRG LIEGELGRGGMGRVFRARHPKTRARAAIKVLLGDYARRP DWARFRQEAIAVNIINHPGIVRVFDSGELEDGSPYIVMEYLDGRGLRD VQAVPPAERPRQV VRLGYQIASAMAAAHASKWHRDLKPENIMWEDELAPGGSRVKILDFGIAKVL GGLPEVLE LEGRGSLAPASASTIRTELSTRPAPTVGATTGPESPLGASATPESALGASATPESALGASATP ESEAHEEDALRSLPWTSGRPAIHPAPVEIPPEAVSSAASRGSRASIEPGAPAPQSEGAGQPT MPFTQEGV GLGTRSYMAPEQERHSGSVDVKADVYSLGVILYELLEGRTPDAPSAA PPPMSA ATPPDLVALVHRVLAFDPDARPRMAEVASALHRLGRAKKELDEALSR WGGGAPGLLPCGYA LLELVLLGPGNLYDSFQPVSAFFFQYRPLFIYEVSSLRSSYKSGVSYSASYLLLRLLRSFSLI LPSYIPVFILTGPYLIR,
or DNA sequences complementary to said open reading frames,
(b) DNA-sequences which hybridise under stringent conditions to regions of DNA sequences according to (a) encoding proteins or to fragments of said DNA sequences,
(c) DNA-sequences which hybridise to the DNA-sequences accord- mg to (a) and (b) because of a degeneration of the genetic code,
(d) allele variations and mutants resulting by substitution, insertion or deletion of nucleotides or inversion of nucleotide segments of DNA-sequences according to (a) to (c) , wherein the variations and mutants offer isofunctional expression products.
10. DNA sequence according to any of claims 1 to 5, wherein the DNA is selected from the group consisting of (a) the following DNA Sequence:
Seq ID No 71 (>Contig43)
CGGGTATTTGTGATATGTGGGCNGTAGTCGTATGCTTCATTAAGTACATC CGTCCGTNGTAGAGAGTGACTCTG7CGCAGCGATAATAGACACGCTTGTG ATGCTATAGGGAACATAGAGTCNTAGTAGATGATACGACGAGATATTNGT ATAGAGCGTATAGACCGACGTGTGAGCGTCATAAGTGTTGTGTGTCATGA GTG7GCTCAGAGGACGTGCAGACA7TATATGAGCAGATGATGAGAGAGAA TCAATGCTGCAAGNTATTCGTCGAATCTACATTATATCGAATCGTGTATG TGCG7TTGTCGCAGCGCGATNCGA7GAGATACCGAAAGGGTATGTATCTA TNT7CGTGACGCTCGATNAGAGCAAATCCGCTACCGTGGAGATATCGTGT ATCGACTCCATCACGATCAGTATCATGATACGTCAAACGAGTACACTCAT TAT7GATAACACACGTANGTGTGCATGCACAGTTATCGAGTGTATTGTGT GCA7GAGAGGTATAGGATNTATAGGCGAGCATATATATCTATATATATAG GTTAAGAGTAGAANACTATGAAGATGCAGGAAGTAGTATCTCGCGGACAA ACGGNGTACCTAGCGGGGTTGAAG7ATTATCGACAGTGTATAACGACTCA ACAG3GNTACGAGGTACATTGTAT7TACAGTGGTTGGAAGGATTGCGCGA GGAAAGGTAGTGGTACCGTGTGAGCTACGATGCTCGGGATAATGGTGATT AGA7AGAACCTTAGCGTTGCTAGATGAGTGAGTGGTGGTATGAGTAGAGT TTT7GTTCTAGCTTTGTGTCCAGCGAGGATTCGTTCAGTCTGAAGGGTAA
GAG7ACGTCCATCGCACACCCGACCGTTTTGAGGAGTTCTCGGTGCGTGG TCAGTGGGGTTTGGAGAAGACAGAGTTGATTCATAGGGTTATCAAACGAG TTA7GTGGATAGATGGTAGTGACCCCATTTGAGTGAGAGTGTTGGCGTTA ACANCAGCAGGATNTAT
SEQ ID No 72 (>Contig44'
TAG37CTTTGACACCATGGGAGC73CTACCGATGTTGCCGAGCACGATCG CGCZ3GCGCCGACGAGCGACTGCAAGCCGGCCGCGGCCATTTACGCCTGA CGA3CGAGGTGGGCGAAGTGCTGG7GCGCGCCGTGCGTGTCGAGCGCGCC CAG3TCCGCCGTTGCGCCGTCGCC3AGCAGTAGCGCGCCGTCGAAGACGA TCACCGCGATCGAGGTCAGCGTCGTGGGGGCGAGGCCGAGGAGCGCGAGG ATGCCGAGCACGACGCCGGCCGCGCCGTAGACGAGCTTGGCGCCCATGCC GCCGCCGAGCTCGGTGCGCGTGTCCCACTCGACGGGCGGCGCGGTGCGGC TCAACGCGCCGAAGCGCGAGGCGATCGCGCCGCCCTGCGCGATCAGCGCG GCGCCGAATACGATCGTGGCGATCTGAGGTGAGCTCTACTGGCATGATCC CCGTCAGCCCGAGGATGGTGAGGACAATCGTCGCGGCGCCGCACAGTACC TCGCACGAGCGAGCCTCCGAGCACGACCTTCGGCGTCGTCTCGTCCTTTG GTCTGCGTCGCGCGCCCGAGTGCGGCGTTATGTGGCTCTCCGGCTGTGCA AACCGTTCACGTTCTTCCGGTCCTGGAGTCAGCATCGGCATGATTCCCCC G7CCTGCGGTGAGGCCTTGTCGCGCTCACGCGCGCTCCGACTTGCACGTG CTGTGCCGGGTTCTCTCGCTCAGGAGGCGCCTCTCTTGGTGGTGCTTGCG TCCTGGTCCGTTTGCCCGCCTGTGCGGTAGGTTTCTTGAACCAGGTGACC TTCAGGGACCCCTTGATGCGCTCCATCGTGTCCTATGTCGATCCTTCTCT GACTTGTATGGGTCTCGAACCAACTACGCTTGATCAGGCCTTCGAAGGGT CCTTTGGGAGATCGACTCTGGATCCATACCGGGAGCCCCTGTTCTGCCGC TCTCTTAAGTTTCCCCTTCTGTATCCGTGTCGACCGGAAACGCTTTATCT CTAATGCGCTCTAATTGCGTCTCTGCCACACGTGCGCTTCACTCTGGATC TACTTCTTCTCCCTAGTCTTCTACCTCCGTACCCTTATTTGTTGGTTCTA TTTATTTCTTTTCGCTTCACCTCGCGTCATTGTCGCCTAGTGTTCCTCCC TCATATCGCCTTTGGTCTCCCTCGAGCGTACAGTCCTCTCTCTTCAGATG CT7TCCGGCTCCTCTTCTGCTGGCCCCTTATCCTTTCTAATACTTC
SEQ ID No 73 (>Contig48) ATGCGCCCAGGAACACCCCGGTGCGGCTGCCGTCGAGGGACTGGGGTGCG ATGCCGGCGTCCTCGAGCCCTTCCCAGGTGACCTCCAGCAGCAGGCGTTG
CTGAGGATCGAGCGACCGCGCCTCCCGAGGCGAGGTGCCAAAGAACGCGG CG7CGAAGCCGTCCACCGCCTCGGTGAGCAGTCCGGCCCAGCGCGGCACC TCC7CGCTGGGATGGACGCCGACCAGCGCCCAGCGCCGGTCGAGCGGCTG GACCGCGTCTCGGCCTGAGTCGAGCAGCTCCCAGAATGCCTCCGGAGTGT CCGCTCCGCCGGGGAAGCGGCAGCCAATGCCTACGATGGCGATCGGCTCG GTCCGCTCTTGCTCCAAAGACGCGTTCTTTTTCGCAAGCTTGTCCATGAG CAGAAGGGCATGCTCAAGCTTCCCGGCATTCGTGGTCGCCATACTCCCTC GGTCCCTTACTCACCAACGATCTGCGCGAGCTGCGCCAGCTTTTCGGCGA GCAACGCGTCCTTCTGCTCGTCCGTCATGCCCCGCAGAGCCTCGAGATCT GCGGCATCGTTCTCGAAGCTCTTCTCCCGCTCGGTGGCCGGAGCGTGGGT CGCGCCGGCATTCGGAAACAGAATGTCTAGCAAGCTCCCGCTCAGAGCTG CTACGTTAGGGTAGGTCCATAGCAGGGTCGCCGGCACGGTGATGCCGAGC GCGGCCTCGATGCGGTTGCGGAGC7CCAGGCCTATCAGCGAGTCCATGCC GAGATTGCTGAACGGCACGTGCCGCTCGATCCTCTCCGGCGGAAGGCGCA GCCCCCGCCCCAACAGCTCGCTCAAGTGCTTCTCCAGAATCAACTGACGA TCTTCGGGCCTGGCGCTCTGCAGCGCCTCGCGCAGGTTCGACGCGTTCGA CGCGCCTCGGTCGGCGCGGTCACGCTCCTTCAGCAGCTCCGCCCACAGCG CCAATCGGGCCGCGTTGGGATAGAACTC
SEQ ID No 74 (>Contig49)
ACCACCGCTTCACTCAGTATGTACTTTGTTATACTCGTCTTAGTACAATG ATATAATACTCATGTGTATTCTTAATCTCGGGGAGANAAAATTGGAATAC TGGACACCGTTGCCGCATGCNGACTCTAGAGATCCCCCTGCGACGGTATC CCACGGCACCGGTATGGCCGGCGCGCGCTCCGGGGGTCAACGCCCCGTGG TTGCCTTCACGACAACGCCGGTCGGGCGGGGCGCCGTTCGATGCCGCGGG CCCGCGCGCGGCGGCGCGTTATCCTGTGGAGCATCTGGAGGGCGCTCACG CACCTGTCAGTCTAGTTCTGGCCCGCCCGGAAGGAGTCCGGGAGGCCGAA GTTGAACCCGATGTAGAGCGCGATGAACGACGGGAGCACGCGCGCGGGGA TGTGCAGCGCGGCGCCGATCGGCGTCGCGAACAGGACGAGCTCGCCCGGC ATGCCGGGCACGACATACCCGAGCAGAAACACGATCGGCACCACGAGCGT
GAGCTCGAGCAGCGATATTTCATGACCGACCGCGCGGGCGGCGGCGCCCG CCATGACGAACACGCAGATCAACGTGCCGTTGACGTTGAGCCAACCGCCG AGTCCCACCACGAAGAGCCTGAGC7CCTGCGGCACCGCCGGATAACATTT GCGGACGAGGTGCAGGTTGAGCGGCGTCGCCAGCGCCTCGCTGCACGAGG CCCACAGCAGCGGATAGACCTTGAGCCAGTAGTTGACGAAATAG7CGCGC AGCGAGAACTCCGGGGCGGCGGCC77CATCCGCAGCAGGCTCGCCGCATG GAAGACGAGGCAGGCAGCGCCGACGACCCCGGACACGAGCAGATAGGAGA GCATGAGGTCTCCGGCGCCGAGCGAGGGCGCCCCCGCCGAGGCGCGCGCG AGGTCCCCCCCGTGCACCTGCGCGGCGAGCTGCGCGGGCAGCCCGCGGAG ATAGGCGCCGAGCCCGAACATGAAGAGCGGGACCAGGCACTGCACGGCGC CTCCCGCGCGCTCCAGCGCGTCCGCGGCGCGCTCCAGCGCGCGGGCGACC CGCGGCGCGCGTACGGCCGCGAACGACGTCACGATGCCGGCGTAAAGGGC GAGGAAGCACGGGCTCGAGATGACCAGGCCCGAGGCGCTATAGAGGGTGC GCGCGGCCTCGAACGGCGCGCCCG7GCTGTGGCTGGGGAGCAGCGGCAGC CCGAACACGAGCCATGTGACGACGAGCCCGAACAGGCACGCTGCCAGACG CTTGAGGGCGAGCCAGCCCATGA7G7ACGCGAGCAGCCGCCCCGGGCGCC CTTGCCGGTGCAGGCTCACGAAGG7CGGCACGAGGACGACGAAGATGACG ACCGGCGCCAGCGTGGTGTACCAA7GCAGGAGACCGTCCATGGCGCGGGT CGACCACCGCGTGACGCTGGTCTC7CTGTCTGACTCGATCATGGCCCATT CGCCTAAAACTAATGATCCGTTC7CAAATTGGTCAAAAAAAAGTTCCCTT AAGACTGTTTTACTCCGGAATATTAATATATTTCTGAGTGTGAGGTGATG TTAATCACACATTCTGATATTCTCAAGGGGAATCCGTGTCATTGTGAATA CTTCTCTCTCTACAAGAGAGGTTA7ATATGGTCTCGAATATCTCGTCCGC TCTTATATATATTCTCTTGTGATAATATATATCGAGTGTGGGTACTCAGC TCTCTTGGTGTAATCTATAACTCGGCATCTCTCATAATACCTTATATATA CACACTCTCTCGGTCATATCTCGCA7AATAGATATATTTTATATGTTCCG CGTTTTATCCGAGTGGGATACAC77TTTCTATATTTTCTTTGGTGTGACG CGTGGCGTCGAGCCTTATTATTGA7TTGGTAGTCACGATATTCTCTAGAT GACATCATACAGATGCTCATAAC7CGATAAACACAGGTCGTACACGACGA GACTCTCACTCTCACTCTT
SEQ ID No 75 (>Contig5C
TCCCCAGT77CTCCTCTCTACGCN7ACATCTCAGCAGGAAAAAANATAAT
GGAGAATCGTTGCGCTCTAGCAGCA7CTATAGGATCCCCGCTGCTCTTCT TCA7GCACC7CGTGGAGCAGAAG77CATCAACGCCTTCGCGATCATCGTG GCGGTGAGCTTCCTGGCGTTGCTCCTGTCGCTCGTCGTCGCCGACGTCGC GACGCGGAACACGTTCCCGCCCGCGCCTTTGCCGGCGCTGAGCCCGCCGG CGCCGGCGCTGATCCCGCCGGTGCCGGGCGGATCCGTTGGGCCGTCGCCG GAGCCGCTGTCGGTCGGCCGGTGATCGGTTGTGCGGGCGCCGTGCCTCGG GCTTACTACCCCCTCTCGCGGGTGGGGATATGGCCGTGGATGAGGGAGGC GATGAAAATCGTGATCGCCACGTGCGCGTTGTTCTAGATCGTCCCAGGCT GACCGTCGGGAGCGCCCAGCACGAGATGAAGAGCCACACCGCGAGGACCG TG77GAGGTACCGCACCGCAGGGGCGAGCATGGCGGTGATCGCGAAGATC ATGCAGAGCAGCCCGAGCACCCATGTGTTCGTCCGCTGCGCGTGGCTGTG CGGCCAGATGACGGCCGAGATGAGGAGCCAGAACCCGAGGACGACGTTCA CGA7GCGCGCCATGAGATTGCCAGCTCGAACCATGCTCCCTCCCACCTCC GA7CATGGGACCGATCGGGTCGCCACGGATCGATAACGGGCGTCAGGAGA CCG7CAATCGGCGAGCTCGTGAGCCATGGCGACAGCCCGCCGACCGCGCC GGCGGGTCTCTGGCCTGCTGGTCGCCGTGCCGGCGGCGGCGATGGGCCTG CCTCCGGTCGGGCCGCGCGGGCGCGCGGCGCTCCGCGCGAAGCTGGAGAC GCGGGACATCGCTCCCTCGCCCCGCGCTCGGACGAGCGGCGCGAGCCACT TCTCGACGGCCGAGCGGGCACTAAGCTTCCGTCATGAGGCTCGGCGCACG GCTCACCACGCACACGTTCTCGGCCGGCGCCGCCGGCATCAGCTTCGTCG TCCAGCCGATCCCGGGCTCGGACCAGCTGTTCGTCATTCCGATCCAGTAC CTGCTCGCGGCGTCGCTCGCGAAGGAGCGAGGCGCGCCGCTCTCGAAGGC GGCGTGGTCCCAGGTCCACCAGCTCATCTGGGGCGGCGGCGCGCTTCGCC TCATGCTCGGCTTGACCCTAGGGCTGATCCCGCTGGCCGGCGCGTTCACG AACGCGATGACGGCGTTCCTCACGACCGAATATCTCGGGTACTACGTGGA TAGAGCCCTCGACAACCCGGACAATCCGCCTCCGGCCCTGTCGATCCAGG ATG7CTTGGACGCCATCACCTCCTTCTTCACCGGGCGAGCGCGGTAGGCG
AGCGGTCCCTGGGTCGAGCCCACCCTGCGGCTCTAGGAGCCGAAGGGCGA GC7CCTCGGGAGCGGCGCGGCGTCACCACCAGATTCGCCGGCGCTTGCGG CCC-GAGCGTATCGCGACCGCCGCCACCGCCGCCACGGCGAGCACGGTGAC CGCGGCGGCCGCCGCGATGGCGACGCTCCGGGCCGTGTGCTCGGCCTCGC CC77GATGCCGCCCCACCTCGCCTTGACCGTGACCAGGTCGCGCGCGAGC CGCTCCTTGCTGTGCTCGATCTGCTCGGCAGGCCCCGCGGGCGCCACGAA ACCTGCGCCGGCGCTGACGTGACCTCGCTCGCTGAGGGGACTGGCGACCC TCTCCGTGTCGAACCGGATTGCGCTGGATGGATCCATATGTCCCGCGCTG CAATCGTTGCTCCCCGCCGGCACATCGGAGTGCTCGCCGGATCGCGCGGC AGCGCCGACGCCGTACTTCCATAGGATAGCCCACCCCATCGGACAAGCCG GCTCCTGACGGCGGGCACCGAATGTTCGCCAGACGGGCACAAGGCGCACG CCGCGGACGGATCGGCCGCACTGGCACTCCAGAGCGCATCGACGGATGGC CGACGGATGTGCAATGAGGCGCCCGCACGAAGCGAATTGTCCCGAATACA GCGAAGAAATCTATAGCGATGCGAGCAGAAGGATATGTCTATGGGGGGCA GTCAGAAACTGGGGACAGTCAACGCACATATTCTCTCCAANTGCTAACGA CAGCGTGCGCAGAGGAAGTATCCTACTAGTGTAAGAGGGACATTCGATGC GACCGCATAAACATTCAGTCTACAACGCGTGAGAGGATGGAACACCCCGC CCCTCTGAAGGCTAGACAACCATGAATATGTGCAGAGGAAACACAGAATT CCAAAGGTGAGAACATATGTAGGATCGCGCCACCCGAGATTGAGTGAAGA TATACATATATACTTATATGGATCTACAACATGGCGAACCGAACGTAGCA NAATAGTAGATATAATTGTAATACTGAGCTACCGACAGAAAGATACACAC GAGTGTACACACATCACACGCAGAGTGGTACCAAATTCACACCATGCGAG CCACAATGTGACACGGAGGAGCACAGCATGGGCGCCACTATGGAGGAGAA ACTACTGCAACCCACATCTGATGGACTGACCGCACGGACGGGACGTGTCT AT CATACAGATACATCNGATGGAGGAAGATGCATGTGCGATGATATCAT CGTCGCAAACTCATATGTCGAAGAAGATATGNGTCAACTCAGCACTACTC ACACGATACGTGAACAGGAGTGACTAGGACATCNCATGGTGTGTCGGCGC GTGCACGTGATATCAAACTCTCTGATCAACCACACACTATATAAGGAGTA TCGAGCGGCGATGGAACACCCCCTCACAGCATACGTATATGCACAACGTC TGAACACTCTNGAGACACAGTGGAAGG
SEQ ID No 76 (>Contig51)
GA7CCAGTTACGCCCCGCCGCCTCGGTCACGCCGGGGTTTTCGGCGTCGA
CCG3GGACGGTCGGGGCGACAACCGGGGGTTGTCGTCAGCGGTTCGCGTG GA73TGCGCGACGAAGTTGTCCGCCGCCGAGTCGTCCTGCGTGCCGCCGC GCTGCCGTACGAGCAGCCCGAGGCACCGCGCGAGCTCCTGCCGCTTCCGA TGCGCGGCGGCCTGCTCGGCCTCG7GCTCCGCGATTCGGCGCTGGCGCCT GCTCTCCTCGGCGCGTTCCTGCGTCCGCGCGAACAGCACCTCCTGCACCT CCGCGAGTTCCTCGGCGGTAGCGACCCGCACGTGATCCGCACGTCCTCCG AGACCGGTGACGACGTCAACCGCGGAGTAGTACGCCTCCGCGCGGACGAC GTCGTCGAGGTCGAGTTCAGCGTCGGCTCCGACGACCAGGCGCATCGCGT CGGAGGTGAGGACGACGCCACCGTCGAGCGGCGCGGCCTCCGGCTCGTAG ACGTACTGGAGGTCGGTCTCCTCG7CGTCGTCACCGTCGAGGTCGACGTG CCGCCGCAGGCCTCGGGTCCACTCGATCGCGCGTCGTCCGGCGAGGGCTT CCTCGTACTGGGCCCACCACGCGCGCAGCTGCTTCGGCGTGCCGTAGCCC TCGGCCATGTCGGGGTCGAGCCCGGCGACCTCGATGTCCCACAGTCGGTA GAGGATCTGGAACGGCGTCATGGAC7TCCGGCCCCGGCCGGTC7TGGAGT CCAGACGGGCGGTCTCCATCGCAGCTGCGCCGGCGGCTTCGAGGTCCTGG TCGACGGAGTCGGGCCGCTCTCGCTTCCCGTCCTGGTTCTTGGTGAGGTA CTCGATCAGCGCGACGTCGTCAGC7GACCGGACGATCGAGACCATCACGC CGTGGCCCTTGCCCTTGCACTTGCAGCCGGGGGTGTCGCAGTCGGTCGAG GGCTCGAACTTGGGGTCAGCCCGC7TGAGGGCGCCCGCCCACATCTCCCG GAGCCAGTCCTCCCAGTCCCCCAGGTCCGTCTCGGAGGGCTCAAAGTGTC CGACGACGTCACCCTTGGCCGGGGTGCCAGAGAGCTCGCCGCCGAGGAAG ACCAGCAGGTTGAGGTGGGGGTGGTAACCGTTCTTCTTGGACCGGGTGAC CTCAGCCGCGCGGACCATGCCGA7GTAGCCGATCCGGTGGCGGATGCCGT CCTCAGCGGGACGGACGTACTGCGTTCCG7CCTTCCGGGTGCGGCGGGCC TCAGGGCGGCCGTAGAAGGCCGGGGCCGTGAGCATCCGCTGGTAGGCACC GGGCGCGCGGCGGGGCTTGCCCGACCGGTCGAGGACCGGGGCGCCCTTGT CGTCCAGGAGAGGCCCGCCCCAGAGCGCGGCGACCAGGCTGTCGAGGTCG
GTGGTCTGGTTATGCCGGGCGGTGAGGACGACAACGGCGAGCGTGCCGCC GGCGGCGAGGTGCCGCAGAGCACCGGTCTTGATCTCCTCGGTCCGGCCAC GGCGGATCGCGGAGGAGCACTCCGGGCATAACCAGATCCGCCCGCAGCGG ACCAGGCCGATCGTGACGACGTACCCACGGCTCGACTTCGCGTAGATCAC GCCGGTGTCCGGGTCGAGGACCCG3CGCCCGCACCCGCCGCAGGCGTCGA TCCCGGAGACCCGGTTGAGCACC77GCGGCCCTGGTAGCGGCGTACGGCA GCGGTCGCCGCGCGCCTTGTGGTCTGTTCCGAAAGGGCTGCCGCCCTCTC GGACTCTCCCGTTCCTCCCACGACTGCCACTTCCGCAAAGTCGCTGGTCA GTGGGGGGTGGGAAAACTCTGTCAACCCTTTACCTAGGCGTCCCTTTTTG CCAGGGGCGGTCTCACGGGCGGCCTCGGCGGCTCGGTCGGCGGCCTTCCG GGCCCGCGCGGCCCGCTTCTTGCACGCCTCGGAGCAGAACTCCTTGGCTC GCTTGCCGGGGGTGATCGTGAGGGCGGCGCCGCAGCGGCAACGGGGGCCG GCGGGGACGCGGGCGGGCGACTGAGTCGGCGCGCCGATCAAAGAGGGGGT TGCGGACGCCAAAGCGTCCCTTAC3CTGGACACAGACGAGTACCTTGGTT GGTAGCCGGGTGGACGTCAGAAGCGGTCAGGGATTAGGACCCCTGGCCGT TTCGCTTTTTCTGGAGTTGTTCGGG7AGATCCTGCCGCATCGCCCGCCTC ACGCGTGGCTCGCCGCGCGGATGCCTCAGAGGCCCCACCGGTCGTCAGGA CGCAGACGTCGGCGTGCTCCTGGTGGTGAGTCACCAGCTCGACCACACGG GCGCGGCCGGCGACGTCTC
SEQ ID No 77 (>Contig52)
CGGGATCTGGCCTTCATTAACCAACGACGGGGCAAACATAATAGGCTGGG CATTGCGCTTCAGCTCACCACAGCCCGTTTTCTGGGAACATTTCTGACGG ATTTAACTCAGGTTCTGCCTGGTG77CAACATTTTGTCGCGGTACAGCTT AATATCCACCGTCCAGAAGTTCTC7CCCGCTATGCTGAACGGGACACTAC CCTTAGAGAACATACTGCATTAA7TAAGGAATATTACGGCTATCATGAAT TTGGTGATTTTCCATGGTCTTTCCGCCTGAAGCGTCTGCTATATACCCGG GCGTGGCTCAGTAATGAGCGACCGGGTCTGATGTTTGATTTTGCCACTGC ATGGTTGCTTCAAAATAAGGTAT7ACTGCCCGGAGCAACCACACTAGTAC GTCTCATCAGTGAAATTCGTGAAAGGGCAAATCAGCGGCTGTGGAAAAAG
CTGGCCGCACTGCCGAACAAATGGCAGGCAGCTCAAGTGATGGAGCTTCT GGTCATTCCGGAAGGTCAGCGTGTATCAGCACTGGAACAGTXXXXXXXXX XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGCTGGAACGATAT ATCCGATTACGAAGTCTTGAGTT77CCCGACTGAACTTTTCCGGTCTGCC TGCCATTCAACTGCGTAATCTGGC7CGTTATGCTGGCATGGCGTCGGTAA AATATATCGCTCGAATGCCACAGCAGAGAAAGCTTGCTGTACTTACTGCA TTCGTTAAAGCACAGGAAATAACGGCATTAGACGATGCCGTTGATGTGCT TGATATGCTAATTCTGGACATTATCCGCGAAGCAAAGAAAACCGGGCAAA AAAAAAGACTCAGGACACTGAAAGATCTTGATCAGGCCGCATTGTTACTG GCGCGGGCATGTGCATTGTTGCTGGATGATAATACAGATGTCCCAGATCT CAGGCAGGTTATCTTCAAGTGCGTACCCAAAAACAGACTGGCAGAATCTG TAAGCAAGGTTAATGAACTTGCTCGTCCACAGAACAXXXXXXXXXXXXXX XXXXXXXXXXXXXXXXXXXXXXXXXXXAAACGTTTTCTTCCGGCGGTGTT GCGGGACCTGCATTTCCGTGCGGCACCGGCAGGTGAACATGTACTGGCTG CGATTCATTATCTGGCAGAACTGAATGGTTCGAAAAAGCGCATCCTTGAT GATGCGCCTGAACATATTATCACCGGTCCCTGGAAACGCCTCGTATACGA TGCGGAGGGACGGATACAGCGTGCAGGTTATTCACTATGTTTGCTGGAAC GCCTTCAGGATGCACTGCGCCGCCGGGACATCTGGCTTGAAAACAGTGAT CGCTGGGGAGATCCTCGCGAGAAGTTGTTGCAAGGTGAAGAGTGGCAGAC TCAGCGTATTCCTGTCTGTCGGGCACTGGGACATCCTGTCGATGGACGTA AAGGTGTGCAACAACTGGCTATTCAGCTGGATGAGACCTGGAAAGCCGTG GCATCACGATTTGAAAAGAATGCGGAAGTTCATATCTGTAATGAAGGTAA ATATCCATCCCTGACTATCAGTTGTCTGGAGAAACAGGAAGAGCCACCAT CATTGCTTCGTCTAAATAATCGGATCAAACAGCTACTCCCACCGGTAGAT TTAACGGAACTGTTACTTGAGATAGATGCCCAGACAGGATTTACACATGA
G7TTGCGCATGTCAGAGAATCTGGTGCTCGAGCGCAAGATTTGCACATCA GTTTATGTGCGGTATGAATGGCTAAGCCCTGTAATATGGGCCTGAACCCG TTGATAAAGCACAATATACCAGCATTGACCCGCCATCGGCTCAGTTGGGT GAAACAGAATTACCTTCGTGCAGAAACGCTGGT
SEQ ID No 78 (>Contig53)
ATTCCACGCGCTCACGGTCAGCTTCGACCCGCGCGAGCGCCCGGCGGCCG CC7CGCAGAAGCGCGCGGTCACGCTGTCCGAGCTCGGCGCGGACGCGCAG GCGCCGGAGTGGCCGTTCCTCGTCGGCGACGAGGCGGCGACCCGCGCGCT CGCCGAGGACCTCGGGTTCCGCTACGCCTACGATCCGACCACCGATCAGT ACGCCCACCCGGCGGCCGTCTTCGTCCTGACGCCGGACGGGCGGATCTCC CGGTACCTGTACGGGACGGAGTTCCCGGCGCGCGATCTCCGGCTCGCGCT CCTGGAGGCGAGCCGCGGCGGTATCGGCACGATCGTCGATCGGGTGATCA TGACCTGCTATCGCTTCGACCCGGCGAGCCGGAGATACGCTCCGTTCCTA CTCGGCTTCCTCCGGCTCGGGGCGGCGGCCATCCTGATCACGGTCGGCGG GCTGCTCGCCGTCCTGTGGCGGCGCGAGCGCCGGCGGCCAGGTGCTCGCA CGAGCGCCGCCGTCGGTCGTGACGCCGTGGCCGACCGCCAGGGGAGGTCA CCATGATCAACGAGCTCCTGCGCAAGCTTCTTTTTCTGTCCGGCCAGTGG TCGACGATCGTGTTCGACATTTACAAGCTGCTTTACTTCGTGATCTCGGT GACGATGGCCGGCGCGACGCTCGTCGCCCTGTTCGCGGCCTACCTGATGA TCCGGTACCGCAGGCGCCAGCGGGATGTTGAAGGCCCGTTCCCCGGAGCG ACCGCGAGGCCTCCGCTCCTCCTCGAGGTCGGCATGGTGCTGGGCCTCAT CGTCCTGTTCCTCGTCTGGTGGGTCATTGGAATGCGGCAGTATGCAGAGC TCCGCGTCGCCCCCGCGGACCCGGTCGTGGTGTACGTGACCGGGAAGCAG TGGATGTGGAAGTTCGCCTACCCGGAGGGCCCGAGCTCGGTGGCGACGCT CTATGTGCCGGCGCGTCGGCCGGTGAAGCTCGTCATGACGTCCCGGGACG TGATCCACAGCTTCTTCGTCCCCGATTTTCGCATCAAGTACGATGTCGTC CCCGGCCGCTACACCACGCTGTGGTTCGAGGCGACCGCGCCGGGCGCCTA TCAGATCCTGTGCACCGAGTACTGCGGGACGAACCACTCCACCATGCGCG GCGAGGTGATCGCGCTCGAGCCCTCCGATTTCGCGCGGTGGCTCTCCGAC CGCGGGCGGGGCGCCGGTATCGCCGGACAGGAGTACACGCCGCCGTCGAC GCCGGGCGAGGGGATCCCGCGCGAGCCGCTCAGCCTCGTCCGGCTGGGCG AGAACATCGCGGCCGAGGAGGGCTGCCTGCGCTGCCACACGCCGGACGGG ACACCGCACATCGGGCCGACCTGGGCCGGCCTCTACATGTCGGTCGTCCC GCTGGAGAGCGGCGGCGCCGCGGTCGCCGACGACGCGTACATCACCGAGT
CGATGATGGATCCGCTCGCCCGGATCCACCGCGGCTACCAGCGGGTCATG CCCTCGTTCCTCGGCCGGCTCCAGCCGGCGCAGGTCGCCGCCATCGTCGA GTACATCCGGTCGTTGAGGGGCGTCGCGCCGGAGCCGGGCGCGCGGACGC CGCTGCCCGAGGGCCCGCCCTTCCTGCGCTCCGGCCCGGAGCGCCCCGCC CCGCTCAGCGGGGGCGCGCCGGTCGGCCCGATCGAGGGCGGCAAGCCCGG GGAGGAGCTCCGATGAGCACGGAA3CGTACGAATCTCTGCCCGACGCGCC GGCCGAGAGGCCCGAGCCCCGACTACCTCCATGTTTACCGCGGGGTGACG GAGTGGCTCACGACCACGGATCACAAGCGGATAGGTCTCATGTTCTACGC CGTCATCGTCGGGAAAGCTTCTTCCTCGGAGGCATATTCGCCCTCATCAT GCGGACCGAGCTCCTCACGCCCGA3CGGACCATCATCGACGCGGCGACCT ACAACCGGATGTTCACGCTGCACG33GTGATCATGGTCTGGCTGTTCATG ATCCCGTCGATCCCCAACGCGTTCGGCAACTTCGTCCTGCCGATCATGCT CGGCGCCAAGGACCTCGCGTTCCCCCGGATCAACCTCGCGAGCTTCTACA TCTACCTCCTCGGGGCGGCGATCG33ATGGGCGGCATGATCGCGGGCGGC ACGGACACCGGCTGGACGTTCTAC33GACGTACAGCCTGAAGACGCCGAT GACGCTGTTCCCGGTCGTCTTCGG3ATCTTCATCGTCGGCGTCTCGTCCA TCATGACGGCGGTCAACTTCATCG73ACCACGCACACGATGCGCGCCGAG GGGCTCACGTGGAGCCGCCTGCCG37CTTCGTCTGGAGCACCTACGCGAC GAGCATCATCCTGCTCTTCGCGAC3CCGGTCCTCGGGCTCTCGATCCTGC TCATCGGCATCGACCACGTGACCGCGCTCGGGATGTTCGATCCCCGGTTC GGCGGCGATCCGGTCCTCTTCCAGCACCTCTTCTGGTTCTACTCCCACCC CGCCGTCTACATCATGATCCTGCCGGCGTTCGGCGTGGTGAGCGAGGTCG TCTGCACGTTCGCGCACAAGCGCC3CGCGTCCTACTGGGCGATCGCCATC TCGTCGCTCGGGATCGCGTTCGTG3GGTTCTGGACGTGGGGCCACCACAT GTTCGTGGCGGGGATGAGCGAGTACGCCGCGGACGTCTTCGGCGTGCTCT CGATGTTCGTGGCCATCTTCTCGGCCATCAAGGTCTACACGTGGGTCGCG ACGCTGTACAGGGGCTCGATCCAC77CAACACGCCGCTGCTCTACTTCAT CGCCTTCCTCTTCCTGTTCGTCT73GGGGGGATGACGGGCGTGGCCGTCG CCACGCAGTCGCTGGACGTGCAC73GCACGACACATACTTCGTTGTGGCG CACTTCCACTTCATCATGGTGGGCGGGACGCTCACCATGTTCCTCGCGGC
GGCGCACTACTGGTTTCCGAAGA737TCGGGCGCCTCTACTCGGAGCGCG TCGGGCTCCTCTCGGCCGCGTCGG7GTTCCTCGGCTTCTTCTTGACCTTC TTCCCGCAGTTCCTCCTCGGGAACATGGGGATGCCCCGCCGCTATTACAG CTACCCGCCGCGCTACCAGTGGC73CACGTGCTCTCGACCGGCGGCGCCT ACCTGCTCGCCGCGGCGCTCGTGA7CTCGCTCCTGAACCTCGTCATCGCG CTCAAGTGGGGCCGGAAGGCCGGGAGGAACCCCTGGGGCGGGCGCACGCT CGAGTGGATGACCGGCGAGCCCTTGCCGCCCAAGCACAACTTCCCGGTCG CGCCGCTCGTCCGCCGCGGCCCGTACGAGTTCCAGCTCTCCGAGGAGGAC GCCCGTGCGACAACCACGCCCGCTGCGTGAGCAGTTCGAAGATCTCGAGA AGCAGACGCACGCGGCCCGCCTCGGGATGTGGTTGTTCCTCGGGAGCGAG GTGCTCCTCTTCACCGGGCTCTTCGCGCTGTACGCGGCGTACCGCGAGCT CTACCCGCGCGATTTCGCCGAGGCCATCGCGCACAACAACGTCGCGATCG GCACCACCATGACGCTCATCCTGA7CGGCAGCAGCTTCACCGTCGCCATG GCGGTGCACGCCGTCCGCGCCTCCCACCCGCGGCGCGCCGCGCTGTTCCT CGCGGTGAGCGTGGCGATCGGGATCGTGTTCCTCGTGCTGAAAGGGATCG AGTACGCGCAGCACTTCCGCGAGGGCATCTTCCCGGCCGGCGCCTACCGC TTCGCGGAGCTCCCGACGTTCGGCGCGCAGATGGCGTTCACGCTGTACTT CGCCATGACGGCGCTCCACGCCC7GCACGTCGTGGGCGGGGCCGGCCTCC TCACGGGGGTCGCGTGGGGGTGCTGGAAGGGCCGGTACTGGGCATACGAC CAGACGCCGGTGGAGCTCAGCGGCC7CTACTGGCACCTCGTGGACATCAT GTGGATCTTCATCTGGCCGCTCC7CTACCTGACGCGCAAGTGACGGCACA GCCCGGAGACGACCATGCCGCAAGAGCACGTCGCGGAAAGCACGCCCTGG ACCCGTTACCTCATGGACGCTGATGGCCCTCATTGCCCTCACGCTCCTGT CGTTAGCGCTCTCGTTCTTGCGCACGGGGGCTTGGGAAATACCGATCGCG CTGCTCCTCCCCGTGGTGAAGAGCG7GCTCGGGCTTGCTTTCTTCTTGCT CCG7CGAGGTGCATTAAGGTCATCAAACGCCTTTTTCAAACATTGGCTGC TGGTGTACTTACTTGGACGTTTCA7ACTGTCTCTTTTATGGCCCCCAACG GTCATTACGTCGACCTACACAATT7CTCTTCCCGCACCCTCCTACCTGTA TCTCTAAGCACTGCCTTGCGTCCTGCTCTATTACATTCTACTCCGGCTGT CCATGTGTGGGATTATATGCGCGAGGTACCTATTCCGCCGTGGAGTCTCC
ATTTACCTCTTGGACCTTGCCCG77CTGAATCTCGATCTCCTCATGCGTT GGTCCACCAiGTATTACCTCCTAGAATCTTATACTCCATATCTCTATATA TCTAGTTGTGCGTGTAATTGTGTCA7ATATTATCGCCACTGCTGTATGAA TACCGTGCCGACGTGCTATATACGAAANTACTCCTCGGTCGATATCTCCA CCTCATATATACCTCCGAGTGTAG7ATACGCACGAGTGTATATACTCTTC CTCTGGTCACGCGACTTCGTGCTGATATGATACCATCGTTCCATGTTACG CGAAGTTACTCATAAGATCTCCTCACACATCAACGAGTGTACTCCTATGT GTTTCATACAAACTCGATACCCTTCAGAGTAGTGTCATGCCTATGTGGTA TGCATAATGTTAGTATACTTT
SEQ ID No 79 (>Contig54)
TGGGAAAGAGGGCCACAGGGGATGTAGCAGGACGCTTAATAGTAAATGAC GAGGGTGTGCCGACGAGACCCGTAGGAAACAACGGGCACAGACGAGAGCA ATAAAGGGGGTTGGAAGGTACCCCGGATAGAGTAGAGAAGGCTAGCGGAC GAGTAAGACGCGGAGGAAATAAGTCGGCGTCGTAGAAGTTCTGTGGAGAA GGTACGACTCTTAAAGACCTAGGCGGGAGACAGTTTCCACCCGAGGCAGA GCAAGACCACAAGATTCAGAGGGAGTAAGGAGTTCCGAATTGGAGAGGTT GAGGGGCGTGTGAGCCGTCAAGTGGGGCGCGTACGCAAAGAAAGAAGCGT CCATGTCAGAGGCCCAGCGGCCGTTGCGGCCCTACATGGGTGATCACGGT GGGCCGTAGGCGGACCGGAGATGAGCGCGGCTCCGCCCACCGGACGGCGA GGGGCACGGCGCCTTCGTCCGGGGCGCTCGCGCGCGATCTCGCGCGCGCG GGGGGCGCCGGCGTCGCCGTCGTCGGCGACGGCCAGCCGCCCATCGTCCA CGCCCTCGGGCACGTCATCAACGCCGCGCTCCGCAGCCGGGCGGCCTGGA TGGTCGATCCTGTGCTGATCGACGCGGGCCCCTCCACGCAGAGCTTCTCC GAGCTCGTCGGCGAGCTCGGGCGCGGCGCGGTCGACACCTTGATCCTCCT CGACGTGAACCCCGTGTACGCCGCGCCGGCCGACGTCGATTTCGCGGGCC TCCTCGCGCGCGTGCCCACGAGCTTGAAGGCCGGGCTCTACGACGACGAG ACCGCCCGCGCTTGCACGTGGTTCGTGCCGACCCGGCATTACCTCGAGTC GTGGGGGGACGCGCGGGCGTACGACGGGACGGTCTCGTTCGTGCAACCCC TCGTCCGGCCGCTGTTCGACGGCCGGGCGGTGCCCGAGCTGCTCGCCGTC
TTCGCGGGGGACGAGCGCCCGGATCCCCGGCTGCTGCTGCGCGAGCACTG GCGCGGCGAGCGCGGAGGGGCGGATTTCGAGGCCTTCTGGGGCGAGGCAT TGAAGCGCGGCTTCCTCCCTGACAGCGCCCGGCCGAGGCAGACACCGGAG CTCGCGCCGGCCGATCTCGCTCAGGAGCTCGCGCGGCTCGCCGCCGCGCC GCGGCCGGCCGGCGGCGCGCTCGACGTGGCGTTCCTCAGGTCGCCGTCGC TCCACGACGGCAGGTTCGCCAACAACCCCTGGCTGCAAGAGCTCCCGCGG CCGATCACCAGGCTCACCTGGGGCAACGCCGCCATGATGAGCGCGGCGAC CGCGGCGCGGCTCGGCGTCGAGCGCGGCGATGTCGTCGAGCTCGCGCTGC GCGGCCGCACGATCGAGATCCCGGCCGTCGTCGTCCGCGGGCACGCCGAC GACGTGATCAGCGTCGACCTCGGCTATGGGCGCGACGCCGGCGAGGAGGT CGCGCGCGGGGTGGGCGTGTCGGCGTATCGGATCCGCCCGTCCGACGCGC GGTGGTTCGCGGGGGGCCTCTCCGTGAGGAAGACCGGCGCCACGGCCGCG CTCGCGCAGGCCCAGCTCGAGCTCTCCCAGCACGACCGTCCCATCGCGCT CAGGAGGACGCTGCCGCAGTACCGTGAACAGCCCGGTTTCGCGGAGGAGC ACAAGGGGCCGGTCCGCTCGATCCTGCCGGAGGTCCAGCACACCGGCGCG CAATGGGCGATGTCCATCGACATG7CGATCTGCACCGGGTGCTCCTCGTG CGTCGTGGCCTGTCAGGCCGAGAACAACGTCCTCGTCGTCGGCAAGGAGG AGGTGATGCACGGCCGCGAGATGCAGTGGTTGCGGATCGATCAGTACTTC GAGGGGGGAGGCGACGAGGTGAGCGTCGTCAACCAGCCGATGCTCTGCCA GCACTGCGAGAAGGCGCCGTGCGAGTACGTCTGTCCGGTGAACGCGACGG TCCACAGCCCCGACGGCCTCAACGAGATGATCTACAACCGATGCATCGGG ACGCGCTTTTGCTCCAACAACTGCCCGTACAAGATCCGGCGGTTCAATTT CTTCGACTACAATGCCCACGTCCCGTACAACGCCGGCCTCCGCAAGCTCC AGCGCAACCCGGACGTGACCGTCCGCGCCCGCGGCGTCATGGAGAAATGC ACGTACTGCGTGCAGCGGATCCGAGAGGCGGACATCCGCGCGCAGATCGA GCGGCGGCCGCTCCGGCCGGGCGAGGTGGTCACCGCCTGCCAGCAGGCCT GTCCGACCGGCGCGATCCAGTTCGGGTCGCTGGATCACGCGGATACCAAG ATGGTCGCGTGGCGCAGGGAGCCGCGCGCGTACGCCGTGCTCCACGACCT CGGCACCCGGCCGCGGACGGAGTACCTCGCCAAGATCGAGAACCCGAACC CCGAGATTGAATGAGCCATGGCGGGCCCGCTCATCCTGGACGCACCGACC
GACGATCAGCTGTCGAAGCAGCTCCTCGAGCCGGTATGGAAGCCGCGCTC CCGGCTCGGCTGGATGCTCGCGTTCGGGCTCGCGCTCGGCGGCACGGGCC TGCTCTTCCTCGCGATCACCTACACCGTCCTCACCGGGATCGGCGTGTGG GGCAACAACATCCCGGTCGCCTGGGCCTTCGCGATCACCAACTTCGTCTG GTGGATCGGGATCGGCCACGCCGGGACGTTCATCTCCGCGATCCTCCTCC TGCTCGAGCAGAAGTGGCGGACGAGCATCAACCGCTTCGCCGAGGCGATG ACGCTCTTCGCGGTCGTCCAGGCCGGCCTCTTTCCGGTCCTCCACCTCGG CCGCCCCTGGTTCGCCTACTGGA7C7TCCCGTACCCCGCGACGATGCAGG TGTGGCCGCAGTTCCGGAGCGCGC7GCCGTGGGACGCCGCCGCGATCGCG ACCTACTTCACGGTGTCGCTCCTG77CTGGTACATGGGCCTCGTCCCGGA TCTGGCGGCGCTGCGCGATCACGCCCCGGGCCGCGTCCGGCGGGTGATCT ACGGGCTCATGTCGTTCGGCTGGCACGGCGCCGCCGACCACTTCCGGCAT TACCGGGTGCTGTACGGGCTGCTCGCGGGGCTCGCGACGCCCCTCGTCGT CTCGGTGCACTCGATCGTGAGCAGCGATTTCGCGATCGCCCTGGTCCCCG GCTGGCACTCGACGCTCTTTCCGCCGTTCTTCGTCGCGGGCGCGATCTTC TCCGGGTTCGCGATGGTGCTCACGC7GCTCATCCCGGTGCGGCGGATCTA CGGGCTCCATAACGTCGTGACCGCGCGCCACCTCGACGATCTCGCGAAGA TGACGCTCGTGACCGGCTGGATCG7CATCCTCTCGTACATCATCGAGAAC TTCCTCGCCTGGTACAGCGGCTCGGCGTACGAAATGCATCAGTTTTTTCA GACACGCCTGCGCGGCCCGAACAACGCCGCCTACTGGGCCCAGCACGTCT GCAACGTGCTCGTCATCCAGCTCC7CTGGAGCGAGCGGATCCGGACGAGC CCCGTCGCGCTCTGGCTCATCTCCA7CCTCGTCAACGTCGGGATGTGGAG CGAGCGGTTCACGCTCATCGTGA7G7CGCTCGAGGAAGAGTTCCTCCCGT CCAAGTGGCACGGCTACAGCCCGACGTGGGTGGACTGGAGCCTCTTCATC GGGTCAGGCGGCTTCTTCATGCTCC7GTTCCTGAGCTTTTTGCGCGTCTT TCCGTTCATCCCCGTCGCGGAGG7CAAGGAGCTCAACCATGAAGAGCTGG AGAAGGCTCGGGGCAAGGGGGGGCGCTGATGGAGACCGGAACGCTCGGCG AGTTCGACGACCCGGAGGCGATGC7CCATGCGATCCGAGAGCTCAGGCGG CGCGGCTACCGCCGGGTGGAAGCG77CACGCCCTATCCGGTGAAGGGGCT CGACGAGGCGCTCGACCTCCCGCG77CGAACCTCAACCGGATGGTGCTGC
CCTTCGCGATCCTGGGGGTCGTGGGCGGCTACTTCGTCCAGTGGTTCTGC AACGCTTTCCACTATCCGCTGAACGTGGGCGGGCGCCCGCTGAACTCGGC GCCGGCGTTCATCCCGATCACGT7CGAGATGGGGGTGCTCTCCACCTCGA TCTTCGGCGTGCTCATCGGCTTT7ACCTGACGAGGCTGCCGAGGCTCTAC CTCCCGCTCTTCGACGCCCCGGGC77CGAGCGCGTCACGCTGGATCGGGT CCTGGTCGGGCTCGACGACACGGAACCTTCCTTCTCGAGCGCCCAGGCGG AGCGCGATCTCCTCGCGCTCGGCGCCAGGCGCGTCGTCGTGGCGAGGAGG CGCGAGGAGCCATGAGGGCCGGCGCCCCGGCTCGCCCCCTCGGGCGCGCG CTCGCGCCGTTCGCCCTCGTCCTGCTCGCCGGGTGCCGCGAGAAGGTGCT GCCCGAGCCGGACTTCGAGCGGATGATCCGCCAGGAGAAATACGGGCTCT GGGAGCCGTGCGAGCACTTCGACGACGGCCGCGCGATGCAGCACCCGCCC GAGGGGACCGTCGCGCGCGGGCGCGTCACCGGGCCGCCCGGCTATCTCCA GGGCGTCCTCGACGGGGCGTACGTCACGGAGGTGCCGCTCTCGCTCACGG TCGAGCTCGTGCAGCGCGGCCGGCAGCGCTTCGAGACCTTCTGCGCGCCG TGCCACGGGATCCTCGGCGACGGCAGCTCGCGCGTGGCGACGAACATGAC GCTGCGCCCGCCGCCGTCGCTCGTCGGACCCGAGGCGCGGAGCTTCCCGC CGGGCAGGATCTACCAGGTCATCATCGAGGGCTACGGCCTGATGCCGCGC TAC7CGGACGATCTGCCCGACATCGAAGAGCGCTGGGCCGTCGTCGCCTA CGTGAAGGCGCTTCAGCTGAGCCGCGGAGTGGCCGCGGGCGCCCTCCCGC CCGCGCTCCGCGGCCGGGCAGAGCAGGAGCTGCGATGAACAGGGATGCCA TCTAGTACAAGGGCGGCGCGACGATCGCGGCCTCGCTCGCGATCGCGGCG CTCGGCGCGGTCGCCGCGATCGTCGGCGGCTTCGTCGATCTCCGCCGGTT CTTCTTCTCGTACCTCGCCGCGTGGTCGTTCGCGGTATTCCTGTCCGTGG GCGCGCTCGTCACGCTCCTCACCTGCAACGCCATGCGCGCGGGCTGGCCC ACGGCGGTGCGCCGCCTCCTCGAGACGATGGTGGCGCCGCTGCCCCTGCT CGCGGCGCTCTCCGCGCCGATCCTGGTCGGCCTGGACACGCTGTACCCGT GGATGCACCCCGAGCGGATCGCCGGCGAGCACGCGCGGCGCATCCTCGAG CACAGAACGCCCTACTTCAATCCAGGCTTCTTCGTCGTGCGCTCGGCGAT CTAC7TCGCGATCTGGATCGCCG7CGCCCTCGTGCTCCGCCGGCGATCGT TCGCGCAGGACCGTGAGCCGAGGGCCGACGTCAAGGACGCGATGTATGGC
CTGAGCGGCGCCATGCTGCCGGTCGTGGCGATCACGATCGTCTTCTCGTC GTTCGACTGGCTCATGTCCCTCGACGCGACCTGGTACTCGACGATGTTCC CGG7CTACGTGTTCGCGAGCGCC7TCGTGACCGCCGTCGGCGCGCTCACG GTCC7CTCGTATGCCGCGCAGACGTCCGGTTACCTCGCGAGGCTCAACGA CTCGCACTATTACGCGCTCGGGCGGCTGCTCCTCGCGTTCACGATATTCT GGGCCTATGCGGCCTATTTCCAGTTCATGTTGATCTGGATCGCGAACAAG CCCGACGAGGTCGCCTTCTTCCTCGACCGCTGGGAAGGGCCCTGGCGGCC GACCTCCGTGCTCGTCGTCCTCACGCGGTTCGTCGTCCCGTTCCTGATCC TGATGTCGTACGCGATCAAGCGGCGCCCGCGCCAGCTCTCGTGGATGGCG CTCTGGGTCGTCGCCTCCGGCTACATCGACTTTCACTGGCTCGTGGTGCC GGCGACAGGGCGCCACGGGTTCGCCTATCACTGGCTCGACCTCGCGACCC TGTGCGTCGTGGGCGGCCTCTCGACCGCGTTCGCCGCGTGGCGGCTGCGA GGGCGGCCGGTGGTCCCGGTCCACGACCCGCGGCTCGAAGAGGCCTTTGC GTACCGGAGCATATGATGTTCCGTTTCCGTCACAGCGAGGTTCGCCAGGA GGAGGACACGCTCCCCTGGGGGCGCGTGATCCTCGCGTTCGCCGTCGTGC TCGCGATCGGCGGCGCGCTGACGCTCTGGGCCTGGCTCGCGATGCGGGCC CGCGAGGCGGATCTGCGGCCCTCCCTCGCGTTCCCCGAGAAGGATCTCGG GCCGCGGCGCGAGGTCAGCATGGTCCAGCAGTCGCTGTTCGACGAGGCGC GCCTGGGCCAGCAGCTCGTCGAGGCGCAGCGCGCGGAGCTCCGCCGCTTC GGCGTCGTCGATCGGGAGAGGGGCATCGTGAGCATCCCGATCGACGACGC GATCGAGCTCATGGTGGCGGAGGGCGCGCGATGAGCCGGGCCGTCGCCGT GGCCCTCCTGCTGGCAGCCGGCCTCGTGTCGCGCCCGGGCGCCGCGTCCG AGCCCGTATCGCTTTCGCCCCGCGCTGGGCCCGTCCGCGGGCGAGGCCGC GCTCTGAAACGACGGCTCCGGCGCGGATGAGCGGCCCGAGGCGACCTCCT GCAACCCACCGCGCTGCGTACAGGGGTAAATCAACTGCATTCCAGGATAC GGACCGCGCAGCATAAACCCTCACCGGACAGACTGAATAATGCCGAACCT TGAACTTCTATGCCCATGCGTGCGGGATCATCATGCCCAACTATTTAAAC TGGTCCCTCCGGCAAAAGGAACGGACCAGCACCCAGAATAACCCCTGTTT GGCCAGCGCAAAAAAATGAAAACTCTTCTCCTTGCGGCTAAAATAAACGA CTCCGGGGAACCGAATGGTATGCAATACACCATGGCAACGCGATTCGTCC
TAACCTTAAACAAACTTCCCAGCGACTCGTCCGTCGAACGCTTCTGACGC AAGACCATGACCCCACGAGAACCGGGCGGCGGACACACTGCCAGTGAAAC TCGGCCTAGGCCCGCCCTGCCTTCACGTATTCACCGTGGGGCGGTCCAAA ACTAAAAACAATACGTGACTCTTCTCAAATATCGTCGGATAAAGGCCAAC ACGCGTACCTCCCCCTAAAGGGAAGAAACCCCTACCAGGGTGGACCGTAT CCACCCGTGATCCCTGAAACATCTCCAGCCGCGTACAAATTAGGCTTTGA CAAAACC
SEQ ID No 80 (>Contig55) GGGGAAAGAAGTCGGGAGCAGCAGAAGAGCGCAGCGATTGAAAGCACGAC GCAAGGGCAAAGGACATNAGACGACAACAGAAAGGCGAAGACATGAGAAG AGGAGAAAGACTCGGAAATGGGCACAAAGCGAGAAAAAACGTAACGCTAT CTGAGAAAGAACACCATAAAGTCAGACTGGGGTAAAGCCATACACGCAGC GAAAGCAGACAAGCAAAAGTCATTAACAGATGCAACAAAGAACAAAAGAG AAAGGAAGGACACATGAAGGAGAAAGGCGGCTTCGAGAACTAAAGACGGG AGCCAACAATATACCTTTCACAGGGGCGAAGAAGGGCCCAAGGTTCAATC GAAGACGATTGAATCCAAGGAAGTCCAATCGGAAGAAAAAAGCATATTGA TAAACCAGACAACGACAGCAGGCC7GAACGTAGGAGCGAGATCGTGAGAC ATCAGTAGGCAAAACAAGAGCGCTACACCCAGGGGTCGTCAACCTAGAAA GGCGCGTCCTCAAGCCGGTAGCGGCCGCGCGCGACCAGCCCGATGCGGGC GCCTTCGCGCGCGAACCGCCGCACCGTCGCGCGCCCCACGCCCGCTGACG CCCCCGTGATCACCACGACCTCCGGCCGCCTGGAATCGCCCATGCCCGTC GCCTCCCGCCTCGCCCGCGGCGTCAAGCAACGTGAATGCCACCTGAGCGT GTCACTTCCTCAAGCTCGACAGCACGTCCTTGATCCGCTCGGTCGGACCC GCCGTGAGCGACCAGGGGCTCGGC7GCCGCACGACGTGCAGGTCGCCTCG ACGCTCGAGCACCTCGAACCGCGTCGTTCCCTGCTCGGTCCGGACGAACC GGAGCGCCACCACGGCCGTGCCCACATGGAGGTTCGAGAGGGTGATCTCG GGCAGCCACGCCGGCAGCGCCGGG7CGACCAGCAGCAGGTCGAGCGGCGC GAACGGGTACAGCCCGAGCATCGCCTGGAGCATCGCGAACACGCTCGAGC AGGACCAGGCCTGCGGCCAGTTGGCCTTCGGGTACAGCGCGGGGAACGGG TGCTCCGCGTCGCGCGGGTGGCCGCTCCAGCACTCGGGCAGCCGGTGGTG CTCGAACAGCGCCGCCGCCTCGAACACGGCGCGGCACAGGAGCGCCACGT GCCCGTGCAGGCCGTACCGCGCGAGCCCGAGCGCGATCGCGCCCTGATCG ACGGGCCAGACCGTCCCCCGGTGA7AGCTGTACGGATCGAACGCGGGGTG ACATGCGGATGGCGGTGGGCTCGGCGAGGGCCTGGTCCACGATCCCCGCC GCGAGGCAGAGCCCCGGATCCGAGCCGATCGAGCGGATCTGGCGCTTGTC CGGGCCGAGGCCCATCGCGAAGGTGCGCGCGTCGGGCATCCAGAAGGCGT CGTTGAACCGCCTCTGCAGCTCGAGCGCCTCGGCGAACAGCCGTCGCGCG TCGTCCTTGCGGCCGAACCAGAAGAGCAGCTCGGAGAGGCGCAGCTTCGA CAGGAACACGAAGCCCTGCATCTCGCACGTCCCGATCGGCGGCCGCACCT GAGAGCCGTCGGCGTGGACGATGGCGTCGTCGGAGTCCTTCCAGCCCTGG TTCTGGATCGACGCGCTCGAGCGGGGCTCGTACTCGTAGAACCCGTCGCC GTCGAGATCGCCCTCCTCGTCGA7CCAGCGCATGGCCCTGAGCGCAGGCT CGATCAGGCGGCCGACGCGCTCGCGATCGCCGGTCCAGTGCCAGAGCTCC GAGGCGGCCACCGCGTAGAACATCGTCGAGGTCGCCGACGCGTACGTGCG CCCCAGCGGGTTGTAGTTGAGGTCCGAGAGCGCTCCGTCCCTGGCCTGAT GCAGCATCCGGTCGGGCTGCTCGTCGCGCCAGTCGTCGACGACGCGCCCC TGCCAGCGCGGGAGCACGAGCGC7GTGCCTGCGAGGATGTCGGTCGTCAG CGCGGCGGCCTGCGTGCCCGCGGCGAGCGGGTCGCGGCCGAAGAGCCCGA TGTAGATCGGCAGTCCGGCGGCCACCGTCCAGGAGCGCTCGTCCTGGTCG ATATCGTACATGCGGAGCGCGATGAGATCGCGCTTGGCTCGCTCGAGCAC GGAGAAGACCGTATGCGAGAGCG7GTCGGCGCCCGGGACCGAGAAGGACG TGGCGCGGTGGTGGAAGGTGCCGCGCGCCACGTCGCGCGCGTTTCGCGTG CCGAAGAACGAGCGGCAGCCGGCGAGCAGCGGGAGCGGTTCGCCGCGGAT CAGGGCGATCACGTCGACGCAGCCGCGCCAGGTGCCATGGGGCTCGAGAT CGATCGAGAACCGGATCTCGCGGCCAGCGCAGGACGGGGGCGAGCCCGCG CTCCGGGCCCGTACGACGATCCCGGCGTCGAAGCGCGCGACGCCGGATTC TCCGGGGTGTTCATAGCGGTGCTCGGCGCGGTAGTCGCAGCGCAGCGCGC ATCCGTGCTCGGCGGGCTCGAGCGCCCACCGGGCGTCGCCGCGCTGGAGG
CGCGGGCCGTCCGTCTCCTCCGCG7CGGCGAAGTCGGCGTCGATCTCGAG CGCCAGCGTGAAGCGCACGCGCTCCTGCGTGAAGCTCGCGACGTCGATGT CCTCGTGGAAGCCGTCCCCCACGAAGCGCGACAGCCGCAGCTCGACGGTC CGCGGCGCCGCGCCGCCGCCCTCGGGCGGCGGGGCGATGTAGTAGCCGAG CCAGCTGTCCGGCTCGACGGCGGAGAGCGCGACGGGCGCCGGCGGCCTGC CGTCGATCAGGTGGCGGTAGAGGGAGAGGAGGCGGGTGTTGCGCACGAAC AGGCCGATGTGCGGCTCGGGGGCGATCGAGCCGTCCGGACGCATGCACAG GACCGTCCGGTTCTGGCTGATGTACAGGGAGCCGGCGCGGGGCCTCAGCG TGGCCAGCGAGCCGAAGGGTCTCTCGAGCGACATGGTCACCTCCAGGCGA GCCCGGGGTGAGGACTGCCAGGGGCGTGCCAACGACGTGGACGCGCTCAC GCCAAGCCGCTGGGCGGGCGCGGCGGCCCCTGCGCCCGAGGCTCAGCCGA GCGCGAGCGGTGTTTCGTCGGCGCGAGCGGGCCTCGTCGTCGCCTCCAGG TAGGCCTTTCCGATGTCGGCCTCGTCCACGAACCCGACGATCGCGCCCTC GCCGTCGACGACGGGGACCTCGCGGACGCCGTGCGCCACCATCGCCTCGG TCGCCGTCCGCAGATCGTCGGTGACCGTCACGGCCACCGGCGGCTGCATC GCG7CGGCGGCCACGGTCATCCGCTCGAGGTCGTGCTCCACCGCGATGAT CCGGAGCGACTCGGCGGTGATCATGCCGACCATCTTGCGCGACGGTTCGA GCACCGGGAACACCTCCTGCCAGCTCGCGTCGGCCGCCCGCCGGAGCATC TCGCGGGCCGGCGTCCCCGGCACGAACGTCACGAGCGCGCGCCCCTCGAT CATGATCTGCCGCACGCGGATGGTCTTGAGCACGTCGAGCGTCGGCACCG GGTGCGCGGGAGACTCGCGCTGGGTGGGGAGCTGCGCGTGGTAGAGCGAG TGCTTCCGCAGCGCGACGAAGGCGACGCCCTCGGCGAGCATCAGCGGAAC CAGGAGGTCGTAGCTGCCGGCGAGCTCGCAGACCATCACGAGGGAGCTCA CCGGCACGTGCGCGACGCCGCCGTAGAAGGTGCCCATGCCCACGAGCGCG AAGGCGCCCGGGTCGATGCGCGGATCGCCGAGCAGGAGCGCCGCCGCGCG CCCGAACGCGCCGCCGAAGAGCCCGCCGATGACGAGCGACGGCGCGAAGT CGCCGGCGCACCCGCCGCTGCCGAGCGTGAGCGACGAGGCGACGATCTTG GCGGCGCAGAGCAGGAGCAGGAGCTCCACGCCGCGCCAGCCCGGGTGGAG CCACGTGGCGCCGGTGATCGCGACCTGGACGGCGCCGTACCCGCCGCCGA GCAGCCCGAGCCCTTGCCCGGGGCTCTCGATCCTGCGGCCGACGAACCAG
AGGACGGGCACGCAGAAGAGGCCCAGCGCGAGCCCGCCGAGCCCCGGGCG CGCCCAGGGGGCGATGGGCAGGCGCGCCGCGATCGCCTTCACGCCGCCGA GGCACTTCAGGAAGCCGATCGCGACGAAGGCCAGGAGCAGCGCGAGCAGC GCATAGAGAGGGAGGTGCGACGGGACGAACGCATACTTCGGCGCGTGCGC GAAGAGCGTCGACTCGCCGTAGAACGAGATGAAGACCGAGTAGGAGACCA CGCTGGCGAGCAGCGCCGGGATCAGCGCCTCGGCCTCGAAGTCGTCGCGG TAGAGCACCTCGACGGCGAGCAGGGCGGCGCCGAGCGGCGTGCGGAAGAT GGCGGACATCCCGGCCGCGACCCCCGCGAGCATCAGGATGCGGTGCTCGC GCCGGCCGACCGCGAGCCCGCGCCCCACGAGCGAGCCGAGCGCTCCTCCG ACCTGCATGGTCGGCCCCTCGCGACCGCCGGCGCCGCCCGAGCCGAGCGT GAGGATCGACGCGACCGCCTTGACCCACGCGACCCGCTTGCGCATCCGGC CGCCGTGGTGGTGGAAGGCCTGGATCATCGCGTCGCCGCCGCCGCCCGCG GCCTCGGGGGCGAGGCGCCAGGTGAGGATGCCCCCGGCCAGCGCTCCGAG CGCCGGGATCAGCAGCAGCAGCCAGAGCCGGACGCTCCGGTGCTCGGGGC CGTCGCCGCTGAAGATGGCCTCGCCGTGGGCTCGAAGCCGCGCGTAGCCG GCGAGGCGGCCGAGGAGCAGCTCC7CGACGAGCTCGAGGGCGCCGAAGAA GAGCACCGCGACGAGGCCCGCGATGGCCCCGACGAGCACGGCATGCAGGA TCGTGCGCCCGACGAGCCTGAGATCGAGGGGCGCGACCTCCGAGAGGAGC GCGGAGAAGGGGCGCCGCCGTCGCACCACGCCGGCGCGCTCTGCTGGGAA TGGTTCCGTCACGGCGATGGT
SEQ ID No 81 (>Contig56)
GGATCCGGCCGCGAGCGGCTGCGGTGCCGATGCGCTCGACGGTGACGGGC GGGGTGATCGCGGGTCCGGAGCTCGGTGCGAGCTACTGGGCGGACAACCT TCGGCAGCCGGTGCGCTTCGCTGCGGCGGCGCAAGCGCTGCTGGAGGGTG GCCCCGCGCTGTTCATCGAGATGAGCCCGCACCCGATCCTGGTGCCGCCC CTGGACGAGATCCAGACGGCGGCCGAGCAAGGGGGCGCTGCGGTGGGCTC GCTGCGGCGAGGGCAGGACGAGCGCGCGACGCTGCTGGAGGCGCTGGGGA CGCTGTGGGCGTCCGGCTATCCGGTGAGCTGGGCTCGGCTGTTCCCCGCG GGCGGCAGGCGGGTTCCGCTGCCGACCTATCCCTGGCAGCACGAGCGGTG
CTGGATCGAGGTCGAGCCTGACGCCCGCCGCCTCGCCGCAGCCGACCCCA CCAAGGACTGGTTCTACCGAACGGACTGGCCCGAGGTGCCCCGCGCCGCC CCGAAATCGGAGACAGCTCATGGGAGCTGGCTGCTGTTGGCCGACAGGGG TGGGGTCGGTGAGGCGGTCGCTGCAGCGCTGTCGACGCGCGGACTTTCCT GCACCGTGCTTCATGCGTCGGCTGACGCCTCCACCGTCGCCGAGCAGGTA TCCGAAGCTGCCAGTCGCCGAAACGACTGGCAGGGAGTCCTCTACCTGTG GGGCCTCGACGCCGTCGTCGATGCTGGGGCATCGGCCGACGAAGTCAGCG AGGCTACCCGCCGTGCCACCGCACCCGTCCTTGGGCTGGTTCGATTCCTG AGCGCTGCGCCCCATCCTCCTCGCT7CTGGGTGGTGACCCGCGGGGCATG CACGGTGGGCGGCGAGCCAGAGGCC7CTCTTTGCCAAGCGGCGTTGTGGG GCCTCGCGCGCGTCGCGGCGCTGGAGCACCCCGCTGCCTGGGGTGGCCTC GTGGACCTGGATCCTCAGAAGAGCCCGACGGAGATCGAGCCCCTGGTGGC CGAGCTGCTTTCGCCGGACGCCGAGGATCAACTGGCGTTCCGCAGCGGTC GCAGGCACGCAGCACGCCTTGTAGCCGCCCCGCCGGAGGGCGACGTCGCA CCGATATCGCTGTCCGCGGAGGGGAGCTACCTGGTGACGGGCGGGCTGGG TGGCCTTGGTCTGCTCGTGGCTCGG7GGCTGGTGGAGCGGGGAGCTCGAC ATCTGGTGCTCACCAGCCGGCACGGGCTGCCAGAGCGACAGGCGTCGGGC GGAGAGCAGCCGCCGGAGGCCCGCGCGCGCATCGCAGCGGTCGAGGGGCT GGAAGCGCAGGGCGCGCGGGTGACCGTGGCAGCGGTGGATGTCGCCGAGG CCGATCCCATGACGGCGCTGCTGGCCGCCATCGAGCCCCCGTTGCGCGGG GTGGTGCACGCCGCCGGCGTCTTCCCCGTGCGTCACCTGGCGGAGACGGA CGAGGCCCTGCTGGAGTCGGTGCTCCGTCCCAAGGTGGCCGGGAGCTGGC TGCTGCACCGGCTGCTGCGCGACCGGCCTCTCGACCTGTTCGTGCTGTTC TCGTCGGGCGCGGCGGTGTGGGGTGGCAAAGGCCAAGGCGCATACGCCGC GGCCAATGCGTTCCTCGACGGGCTCGCGCACCATCGCCGCGCGCACTCGC TGCCGGCGTTGAGCCTCGCCTGGGGCTTATGGGCCGAGGGAGGCATGGTT GATGCAAAGGCTCATGCACGTCTGAGCGACATCGGGGTCCTGCCCATGGC CACGGGGCCGGCCTTGTCGGCGCTGGAGCGCCTGGTGAACACCAGCGCTG TCCAGCGTTCGGTCACACGGATGGACTGGGCGCGCTTCGCGCCGGTCTAT GCCGCGCGAGGGCGGCGCAACTTGCTTTCGGCTCTGGTCGCGGAGGACGA
GCGCGCTGCGTCTCCCCCGGTGCCGACGGCAAACCGGATCTGGCGCGGCC TGTCCGTTGCGGAGAGCCGCTCAGCCCTCTACGAGCTCGTTCGCGGCATC GTCGCCCGGGTGCTGGGCTTCTCCGACCCGGGCGCGCTCGACGTCGGCCG AGGCTTCGCCGAGCAGGGGCTCGACTCCCTGATGGCTCTGGAGATCCGTA ACCGCCTTCAGCGCGAGCTGGGCGAACGGCTGTCGGCGACTCTGGCCTTC GACCACCCGACGGTGGAGCGGCTGGTGGCGCATCTCCTCACCGACGTGCT GAAGCTGGAGGACCGGAGCGACACCCGGCACATCCGGTCGGTGGCGGCGG ATGACGACATCGCCATCGTCGGTGCCGCCTGCCGGTTCCCAGGTGGGGAT GAGGGCCTGGAGACATACTGGCGGCATCTGGCCGAGGGCATGGTGGTCAG CACCGAGGTGCCAGCCGACCGGTGGCGCGCGGCGGACTGGTACGACCCCG ATCCGGAGGTTCCGGGCCGGACC7ATGTGGCCAAGGGTGCCTTCCTCCGC GATGTGCGCAGCTTGGATGCGGCG7TCTTCGCCATTTCCCCTCGTGAGGC GATGAGCCTGGACCCGCAACAGCGGCTGTTGCTGGAGGTGAGCTGGGAGG CGATCGAGCGCGCTGGCCAGGACCCGATGGCGCTGCGCGAGAGCGCCACG GGCGTGTTCGTGGGCATGATCGGGAGCGAGCACGCCGAGCGGGTGCAGGG CCTCGACGACGACGCGGCGTTGC7GTACGGCACCACCGGCAACCTGCTCA GCGTCGCCGCTGGACGGCTGTCG77CTTCCTGGGTCTGCACGGCCCGACG ATGACGGTGGACACCGCCTGCTCG7CGTCGCTGGTGGCGTTGCACCTCGC CTGCCAGAGCCTGCGATTGGGCGAGTGCGACCAGGCCCTGGCCGGCGGGT CCAGCGTGCTTTTGTCGCCGCGG7CATTCGTCGCGGCGTCGCGCATGCGT TTGCTTTCGCCAGATGGGCGGTGCAAGACGTTCTCGGCCGCTGCAGACGG CTTTGCGCGGGCCGAGGGCTGCGCCGTGGTGGTGCTCAAGCGGCTCCGTG ACGCGCAGCGCGACCGCGACCCCA7CCTGGCGGTGGTCAGGAGCACGGCG ATCAACCACGATGGCCCGAGCAGCGGGCTCACGGTGCCCAGCGGTCCTGC CCAGCAGGCGTTGCTACGCCAGGC3CTGGCGCAAGCGGGCGTGGCGCCGG CCGAGGTCGATTTCGTGGAGTGCCACGGGACGGGGACAGCGCTGGGTGAC CCGATCGAGGTGCAGGCGCTGGGCGCGGTGTACGGGCGGGGCCGCCCCGC GGAGCGGCCGCTCTGGCTGGGCGC7GTCAAGGCCAACCTCGGCCACCTGG AGGCCGCGGCGGGCTTGGCCGGCGxGCTCAAGGTGCTCTTGGCGCTGGAG CACGAGCAGATTCCGGCTCAACCGGAGCTCGACGAGCTCAACCCGCACAT
CCCGTGGGCAGAGCTGCCAGTGGCCGTTGTCCGCAGGGCGGTCCCCTGGC CGCGCGGCGCGCGCCCGCGTCGTGGAGGCGTGAGCGCTTTCGGCCTGAGC GGGACCAACGCGCATGTGGTGTTGGAGGAGGCGCCGGCGGTGGAGCCTGT GGCCGCGGCCCCCGAGCGCGCAGCGGAGCTGTTCGTCCTGTCGGCGAAGA GCGCGGCGGCGCTGGATGCGCAC-3-CAGCCCGGCTGCGGGACCACCTGGAG AAGCATGTCGAGCTTGGCCTCGGCGATGTGGCGTTCAGCCTGGCGACGAC GCGCAGCGCGATGGAGCACCGGCTGGCGGTGGCCGCGAGCTCGCGCGAGG CGCTGCGAGGGGCGCTTTCGGCCGCAGCGCAGGGGCACACGCCGCCGGGA GCCGTGCGTGGGCGGGCCTCGGGCGGCAGCGCGCCGAAGGTGGTCTTCGT GTTTCCCGGCCAGGGCTCGCAGTGGGTGGGCATGGGCCGAAAGCTCATGG CCGAAGAGCCGGTCTTCCGGGCGGCGCTGGAGGGTTGCGACCGGGCCATC GAGGCGGAAGCGGGCTGGTCGCTGCTCGGGGAGCTCTCCGCCGACGAGGC CGCCTCGCAGCTCGGGCGCATCGACGTGGTTCAGCCGGTGCTGTTCGCCA TGGAAGTAGCGCTTTCTGCGCTGTGGCGGTCGTGGGGAGTGGAGCCGGAA GCGGTGGTGGGCCACAGCATGGGCGAGGTTGCGGCGGCGCACGTGGCCGG CGCGCTGTCGCTCGAGGACGCGGTGGCGATCATCTGCCGGCGCAGCCGGC TGCTGCGGCGGATCAGCGGTCAGGGGGAGATGGCGCTGGTCGAGCTGTCG CTGGAGGAGGCCGAGGCGGCGCTGCGTGGCCATGAGGGTCGGCTGAGCGT GGCGGTGAGCAACAGCCCGCGCTCGACCGTGCTCGCCGGCGAGCCGGCGG CGCTCTCGGAGGTGCTGGCGGCGCTGACGGCCAAGGGGGTGTTCTGGCGG CAGGTGAAGGTGGACGTCGCCAGCCATAGCCCGCAGGTCGACCCGCTGCG CGAAGAGCTGATCGCGGCGCTGGGAGCGATCCGGCCGCGAGCGGCTGCGG TGCCGATGCGCTCGACGGTGACGGGCGGGGTGATCGCGGGTCCGGAGCTC GGTGCGAGCTACTGGGCGGACAACCTTCGGCAGCCGGTGCGCTTCGCTGC GGCGGCGCAAGCGCTGCTGGAGGGTGGCCCCGCGCTGTTCATCGAGATGA GCCCGCACCCGATCCTGGTGCCGCCCCTGGACGAGATCCAGACGGCGGCC GAGCAAGGGGGCGCTGCGGTGGGCTCGCTGCGGCGAGGGCAGGACGAGCG CGCGACGCTGCTGGAGGCGCTGGGGACGCTGTGGGCGTCCGGCTATCCGG TGAGCTGGGCTCGGCTGTTCCCCGCGGGCGGCAGGCGGGTTCCGCTGCCG ACCTATCCCTGGCAGCACGAGCGGTACTGGATCGAGGACAGCGTGCATGG
GTCGAAGCCCTCGCTGCGGCTTCGGCAGCTTCGCAACGGCGCCACGGACC ATCCGCTGCTCGGGGCTCCATTGCTCGTCTCGGCGCGACCCGGAGCTCAC TTG7GGGAGCAAGCGCTGAGCGACGAGAGGCTATCCTACCTTTCGGAACA TAGGGTCCATGGCGAAGCCGTGTTGCCCAGCGCGGCGTATGTAGAGATGG CGCTCGCCGCCGGCGTAGATCTCTATGGCACGGCGACGCTGGTGCTGGAG CAGCTGGCGCTCGAGCGAGCCCTCGCCGTGCCCTCCGAAGGCGGACGCAT CGTGCAAGTGGCCCTCAGCGAAGAAGGTCCCGGTCGGGCCTCATTCCAGG TATCGAGTCGTGAGGAGGCAGGTAGGAGCTGGGTGCGGCACGCCACGGGG CACGTGTGTAGCGGCCAGAGCTCAGCGGTGGGAGCGTTGAAGGAAGCTCC GTGGGAGATTCAACGGCGATGTCCGAGCGTCCTGTCGTCGGAGGCGCTCT ATCCGCTGCTCAACGAGCACGCCCTCGACTATGGTCCCTGCTTCCAGGGC GTGGAGCAGGTGTGGCTCGGCACGGGGGAGGTGCTCGGCCGGGTACGCTT GCCAGGAGACATGGCATCCTCAAGTGGCGCCTACCGGATTCATCCCGCCT TGTTGGATGCATGTTTTCAGGTGCTGACAGCGCTGCTCACCACGCCGGAA TCCATCGAGATTCGGAGGCGGCTGACGGATCTCCACGAACCGGATCTCCC GCGGTCCAGGGCTCCGGTGAATCAAGCGGTGAGTGACACCTGGCTGTGGG ACGCCGCGCTGGACGGTGGACGGCGCCAGAGCGCGAGCGTGCCCGTCGAC CTGGTGCTCGGCAGCTTCCATGCGAAGTGGGAGGTCATGGAGCGCCTCGC GCAGGCGTACATCATCGGCACTCTCCGCATATGGAACGTCTTCTGCGCTG CTGGAGAGCGTCACACGATAGACGAGTTGCTCGTCAGGCTTCAAATCTCT GTCGTCTACAGGAAGGTCATCAAGCGATGGATGGAACACCTTGTCGCGAT CGGCATCCTTGTAGGGGACGGAGAGCATTTTGTGAGCTCTCAGCCGCTGC CGGAGCCTGATTTGGCGGCGGTGCTCGAGGAGGCCGGGAGGGTGTTCGCC GACCTCCCAGTCCTATTTGAGTGGTGCAAGTTTGCCGGGGAACGGCTCGC GGACGTATTGACCGGTAAGACGCTCGCGCTCGAGATCCTCTTCCCTGGTG GCTCGTTCGATATGGCGGAGCGAATCTATCGAGATTCGCCCATCGCCCGT TACTCGAACGGCATCGTGCGCGGTGTCGTCGAGTCGGCGGCGCGGGTGGT AGCACCGTCGGGAATGTTCAGCATCTTGGAGATCGGAGCAGGGACGGGCG CGACCACCGCCGCCGTCCTCCCGGTGTTGCTGCCTGACCGGACGGAGTAC CATTTCACCGATGTTTCTCCGCTCTTCCTTGCTCGCGCGGAGCAAAGATT
TCGAGATTATCCATTCCTGAAGTATGGCATTCTGGATGTCGACCAGGAGC CAGCTGGCCAGGGATACGCACATCAGAGGTTTGACGTCATCGTCGCGGCC AATGTCATCCATGCGACCCGCGATA7AAGAGCCACGGCGAAGCG7CTCCT GTCGTTGCTCGCGCCCGGAGGCCTTCTGGTGCTGGTCGAGGGCACAGGGC ATCCGATCTGGTTCGATATCACCACGGGATTGATTGAGGGGTGGCAGAAG TACGAAGATGATCTTCGTATCGACCATCCGCTCCTGCCTGCTCGGACCTG GTGTGACGTCCTGCGCCGGGTAGGC7TTGCGGACGCCGTGAGTCTGCCAG GCGACGGATCTCCGGCGGGGATCC7CGGACAGCACGTGATCCTCTCGCGC GCGCCGGGCATAGCAGGAGCCGC77GTGACAGCTCCGGTGAGTCGGCGAC CGAATCGCCGGCCGCGCGTGCAG7ACGGCAGGAATGGGCCGATGGCTCCG CTGACGTCGTCCATCGGATGGCGT7GGAGAGGATGTACTTCCACCGCCGG CCGGGCCGGCAGGTTTGGGTCCACGGTCGATTGCGTACCGGTGGAGGCGC GTTCACGAAGGCGCTCGCTGGAGA7CTGCTCCTGTTCGAAGACACCGGGC AGGTCGTGGCAGAGGTTCAGGGGC7CCGCCTGCCGCAGCTCGAGGCTTCT GCTTTCGCGCCGCGGGACCCGCGGGAAGAGTGGTTGTACGCTTTGGAATG GCAGCGCAAAGACCCTATACCAGAC-3CTCCGGCAGCCGCGTCTTCTTCCT CCGCGGGGGCTTGGCTCGTGCTGA7GGACCAGGGCGGGACAGGCGCTGCG CTCGTATCGCTGCTGGAAGGGCGAGGCGAGGCGTGCGTGCGCGTCATCGC GGGTACGGCATACGCCTGCCTCGCGCCGGGGCTGTATCAAGTCGATCCGG CGCAGCCAGATGGCTTTCATACCC7GCTCCGCGATGCATTCGGCGAGGAC CGGATTTGTCGCGCGGTAGTGCA7ATGTGGAGCCTTGATGCGACGGCAGC AGGGGAGAGGGCGACAGCGGAGTCGCTTCAGGCCGATCAACTCCTGGGGA GCCTGAGCGCGCTTTCTCTGGTGCAGGCGCTGGTGCGCCGGAGGTGGCGC AACATGCCGCGGCTTTGGCTCTTGACCCGCGCCGTGCATGCGGTGGGCGC GGAGGACGCAGCGGCCTCGGTGGCGCAGGCGCCGGTGTGGGGCCTCGGTC GGACGCTCGCGCTCGAGCATCCAGAGCTGCGGTGCACGCTCGTGGACGTG AACCCGGCGCCGTCTCCAGAGGACGCAGCCGCACTGGCGGTGGAGCTCGG GGCGAGCGACAGAGAGGACCAGG7CGCATTGCGCTCGGATGGCCGCTACG TGGCGCGCCTCGTGCGGAGCTCC77TTCCGGCAAGCCTGCTACGGATTGC GGCATCCGGGCGGACGGCAGCTA7GTGATCACCGATGGCATGGGGAGAGT
GGGGCTCTCGGTCGCGCAATGGA7GGTGATGCAGGGGGCCCGCCATGTGG TGCTCGTGGATCGCGGCGGCGCT7CCGAGGCATCCCGGGATGCCCTCCGG TCCATGGCCGAGGCTGGCGCGGAGG7GCAGATCGTGGAGGCCGACGTGGC TCGGCGCGACGATGTCGCTCGGC7CCTCTCGAAGATCGAACCGTCGATGC CGCCGCTTCGGGGGATCGTGTACC-7GGACGGGACCTTCCAGGGCGACTCC TCGATGCTGGAGCTGGATGCCCGTCGCTTCAAGGAGTGGATGTATCCCAA GGTGCTCGGAGCGTGGAACCTGCACGCGCTGACCAGGGATAGATCGCTGG ACTTCTTCGTCCTGTATTCCTCGGGCACCTCGCTTCTGGGCTTGCCAGGA CAGGGGAGCCGCGCCGCCGGTGACGCCTTCTTGGACGCCATCGCGCATCA CCGGTGCAAGGTGGGCCTTACAGCGATGAGCATCAACTGGGGATTGCTCT CCGAAGCATCATCGCCGGCGACCCCGAACGACGGCGGAGCACGGCTCGAA TACCGGGGGATGGAAGGCCTCACGCTGGAGCAGGGAGCGGCGGCGCTCGG GCGCTTGCTCGCACGACCCAGGGCGCAGGTAGGGGTGATGCGGCTGAATC TGCGCCAGTGGTTGGAGTTXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX XXXXTATGGTATAACTTATTGATTATAATACAGTATACAAAGGTCCCTTT TCAGGGACCCTTTCGTATGTTGTAGCTGATTTTATTTTCTTCTTTTCTTT TGGGTGCTTTTAATAGCCATATATGATGTTTCATATAAAGTTAAAAGTTT CCCCATCTGTCTATCTATAGCGTCATGTTTTTCAGGATTTCTTAATTTCT GCAGCAGTTTAGATATATATTTCGCAGGTAACTTTATTATTAATGAATTA TGGGATTGATACATTTCCTTTGTCCAATCTAAATCTTTTCCTGATTTCAA TACCTCACTCTCATCTCTATTAATTATAAAACCGAGCTTTCCATACGGAC CTGTTAAATATGATTGTAATTGTCTATACTCAGATGGACCTAATTCTTCA AAATTTTTAGCATCAAAAACAACTTGTCTTGTTTTATAGTCCTCCAATAC TCGTTTCCAAAAATCAGATTTGCCACCATTGGTGCCTATAATATCTCGTC TCTGAACTGCGTTACCATTTGGATGGGACTTGATGTCTGTTAGGTGGGAT GCAAATACTATTCTTAGTGCGTC7AAACACCATTGTTCAAATTCAGTGGC ACC7TCATTTCCTATCGGTATCTGATCTAGATGAGTAGTTATTTGACCTA TTGTTTTATTTCTAATGGCTGAATTATCTGAAATAATATTAATATCATAT TCGTCATTTATCTCCTCTGCTTCCTCTGGAGCAAGTGCATTACGATTTAG ATTCAAACCAAGCCAGTAACATGGATGAATTAATAATTTCTCATTACTTT
CAAAACCTTTATCTGGAGTTCGCCCGTCATGGCAAAATGAATAGGATGAG GTG7TTTTATCACGAATACCAACAAATCCTACACTATACAAACTTTGGAG AAT7CCACTTGCCTTTAACAACTGTATTTCAGACGTTATTTTAGGATCTC CAT7TTCTTCGATTAATTCGAAAGATGCTTCTATTTTTTTTAAGCACGTA TAAACTGTTAATTCAGGTTCAATGCTACGAAATGCACTAGTTATAACCTG TATTGAAGGAAAGATCTTCTGATACTCTTTCCAGAGATCTTCAAGTCTGG CCATGGAAATTGACTTGGCTGCATATTCTAGGTCAGTGTTTATGATAGTT TCTCTATTCTCTCTGAATGCGGAAAAAAAAGCTTCATTCAACAATGATAG TAAATCCCTGGGCCGGTAAAGGGTAAATTGCAAACATCGCTTAAAACCAT TCCTCCCTTTAAGATCATCCGCTGTGCATCTATCCCAAACTCGTTGATCT TTCTCAATATCTAGCTTAAATGCTACTTTCATTCTTTTAGCTGACAGCAT TAGGAGTTGTGCCCAGTCCCAATGCAACCTTATGACTTGACCCTCTATAT TTCTCGAGTAATCAGGATCTTCCTTTGATAGCGACCTAAATATATTATCC CTTAAAAAAATTATTGGACGAATGCATTTTGCTTTTTGATTTAATTCAAT AGA7GCATATGCTAGACCTGCAATGATTCCAATTCCTATATTATCCGGTT CATACGCCTCATCTAGCTTATCCATTAATATGACAACTTTCCTGTCTGAG CGTTCAAGAAGTGATACTATATTATTTTCTATTTCTGAGATATTCAAATT GAATTGAAGATCACCAATTGATTCTTCTGGGTTATTTTCATCTAAATACT CCTTTGCGACAAGCCTACACTTTCTTAAAATGTCACCTTGTGCAGAATTC CATTTTTTCAAATGTTCATTCAACAATGTTTCTGATGATATTTGAGATGA CAA7TTGTAATGAGATGATATATATGATGCTATCTCCATTAGCATAGCGT ATCGCCATAATAGTCTTGTTGCTGCCCTTGCTAAATTAAATGATCCTGTA AATGGTTTCAACATTGATCTGAAACCAATAATTTGAGAATCGTCTGGTGA GAAACTCAGGATTAATATTTTTTTGTCTTTCTTCCAATGCTCATTTAGCT GAA7AAATAAAGCACTTTTACCTGTCCCTCGTCTACCAACAACAATGGTC CTG7CATCAGTTTCAATTAGAGTCCTAAAGTCAGCAGTTTCAATGAAAGC ATTACTCAACATCTTTTTATCATTTTCTGCTGTCGTATCACCAAACGGAT TAGACTTTGAAGTAATATTCAATTCCATATTCAACCTTTTATGTTAGTTG CTTTCATTTATTACTTTATATACTGTTGAACGAGCAATATTCATTGTTTT TGATATATGTGAGGCACCTAACCCCTGTTGCCACATATTTAATACTGCAT
CTC7ATCTATTTTTCTTTTTCTACCAAAAACAACTCCTTTTGCCA
SEQ ID No 82 (>Contιg57) TCATCTATTGTATAGTTTGTATATTGATATGATATAATTATAACATATAA ACAGTAAACTTTCTCTACGTAGATCGAGGAGAAGACTCAATTTGTTGACA TCXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXACTACTCGCATACCGTTG CGCAACAGCGGCGCGAGGAGCAGGACGCATACGACATCACCGGCAATACG CTCAGCGTCGCCGACGGACGGTTG7CTTATACGCTAGGGCTGCAGGGACC CTGCCTGACCGTCGACACGGTCTGCTCGTCGTCGCTCGTGGCCATCCACC TTGCCTGCCGCAGCCTGCGCGCTCGCGAGAGCGATCTCGCGCTGGCGGGA GGCGTCAACATGCTCCTTTCGTCCAAGACGATGATAATGCTGGGGCGCAT CCAGGCGCTGTCGCCCGATGGCCACTGCCGGACATTCGACGCCTCGGCCA ACGGGTTCGTCCGTGGGGAGGGCTGCGGTATGGTCGTGCTCAAACGGCTC TCCGACGCCCAGCGACACGGCGATCGGATCTGGGCTCTGATCCGGGGTTC GGCCATGAATCAGGATGGCCGGTCGACAGGGTTGATGGCACCCAATGTGC TCGCTCAGGAGGCGCTCTTGCGCGAGGCGCTGCAGAGCGCTCGCGTCGAC GCCGGGGCCATCGGTTATGTCGAGACCCACGGAACGGGGACCTCGCTCGG CGACCCGATCGAGGTCGAGGCGCTGCGTGCCGTGTTGGGGCCGGCGCGGG CCGATGGGAGCCGCTGCGTGCTGGGCGCAGTGAAGACAAACCTCGGCCAC CTGGAGGGCGCTGCAGGCGTGGCGGGTTTGATCAAGGCGGCGCTGGCTCT GCACCACGAACTGATCCCGCGAAACCTCCATTTCCACACGCTCAATCCGC GGATCCGGATCGAGGGGACCGCGCTCGCGCTGGCGACGGAGCCGGTGCCG TGGCCGCGGGCGGGCCGACCGCGCTTCGCGGGGGTGAGCGCGTTCGGCCT CAGCGGCACCAACGTCCATGTCGTGCTGGAGGAGGCGCCGGCCACGGTGC TCGCACCGGCGACGCCGGGGCGCTCAGCGGAGCTTTTGGTGCTGTCGGCG AAGAGCGCCGCCGCGCTGGACGCACAGGCGGCGCGGCTCTCAGCGCACAT CGCCGCGTACCCGGAGCAGGGTCTCGGAGACGTCGCGTTCAGCCTGGTAT CGACGCGTAGCCCGATGGAGCACCGGCTCGCGGTGGCGGCGACCTCGCGC GAGGCGCTGCGAAGCGCGCTGGAGGTTGCGGCGCAGGGGCAGACCCCGGC AGGCGCGGCGCGCGGCAGGGCCGCTTCCTCGCCCGGCAAGCTCGCCTTCC
TGTTCGCCGGGCAGGGCGCGCAGG7GCCGGGCATGGGCCGTGGGTTGTGG GAGGCGTGGCCGGCGTTCCGCGAGACCTTCGACCGGTGCGTCACGCTCTT CGACCGGGAGCTCCATCAGCCGCTCTGCGAGGTGATGTGGGCCGAGCCGG GCAGCAGCAGGTCGTCGTTGCTGGACCAGACGGCGTTCACCCAGCCGGCG CTCTTTGCGCTGGAGTACGCGCTGGCCGCGCTCTTCCGGTCGTGGGGCGT GGAGCCGGAGCTCGTCGCTGGCCA7AGCCTCGGCGAGCTGGTGGCCGCCT GCGTGGCGGGTGTGTTCTCCCTCGAGGACGCCGTGCGCTTGGTGGTCGCG CGCGGCCGGTTGATGCAGGCGCTGCCGGCCGGCGGCGCGATGGTATCGAT CGCCGCGCCGGAGGCCGACGTGGC7GCCGCGGTGGCGCCGCACGCAGCGT TGGTGTCGATCGCGGCAGTCAATGGGCCGGAGCAGGTGGTGATCGCGGGC GCCGAGAAATTCGTGCAGCAGATCGCGGCGGCGTTCGCGGCGCGGGGGGC GCGAACCAAACCGCTGCATGTCTCGCACGCGTTCCACTCGCCGCTCATGG ATCCGATGCTGGAGGCGTTCCGGCGGGTGACTGAGTCGGTGACGTACCGG CGGCCTTCGATCGCGCTGGTGAGCAACCTGAGCGGGAAGCCCTGCACCGA TGAGGTGAGCGCGCCGGGTTACTG3GTGCGTCACGCGCGAGAGGCGGTGC GCTTCGCGGACGGAGTGAAGGCGC7GCACGCGGCCGGTGCGGGCCTCTTC GTCGAGGTGGGGCCGAAGCCGACGCTGCTCGGCCTTGTGCCGGCCTGCCT GCCGGATGCCAGGCCGGTGCTGC7CCCAGCGTCGCGCGCCGGGCGTGACG AGGCTGCGAGCGCGCTAGAGGCGC7GGGTGGGTTCTGGGTCGTCGGTGGA TCGGTCACCTGGTCGGGTGTCTTCCCTTCGGGCGGACGGCGGGTACCGCT GCCAACCTATCCCTGGCAGCGCGAGCGTTACTGGATCGAAGCGCCGGTCG ATCGTGAGGCGGACGGCACCGGCCGTGCTCGGGCGGGGGGCCACCCCCTT CTGGGTGAAGTCTTTTCCGTGTCGACCCATGCCGGTCTGCGCCTGTGGGA GACGACGCTGGACCGAAAGCGGCTGCCGTGGCTCGGCGAGCACCGGGCGC AGGGGGAGGTCGTGTTTCCTGGCGCCGGGTACCTGGAGATGGCGCTGTCG TCGGGGGCCGAGATCTTGGGCGA7GGACCGATCCAGGTCACGGATGTGGT GCTCATCGAGACGCTGACCTTCGCGGGCGATACGGCGGTACCGGTCCAGG TGGTGACGACCGAGGAGCGACCGGGACGGCTGCGGTTCCAGGTAGCGAGT CGGGAGCCGGGGGAACGTCGCGCGCCCTTCCGGATCCACGCCCGCGGCGT GCTGCGCCGGATCGGGCGCGTCGAGACCCCGGCGAGGTCGAACCTCGCCG
CCCTGCGCGCCCGGCTTCATGCCGCCGTGCCCGCTGCGGCTATCTATGGT GCGCTCGCCGAGATGGGGCTTCAA7ACGGCCCGGCGTTGCGGGGGCTCGC CGAGCTGTGGCGGGGTGAGGGCGAGGCGCTGGGCAGGGTGAGACTGCCTG AGGCCGCCGGCTCCGCGACAGCC7ACCAGCTGCATCCGGTGCTGCTGGAC GCGTGCGTCCAAATGATTGTTGGCGCGTTCGCCGATCGCGATGAGGCGAC GCCGTGGGCGCCGGTGGAGGTGGGCTCGGTGCGGCTGTTCCAGCGGTCTC CTGGGGAGCTATGGTGCCATGCGCGCGTCGTGAGCGATGGTCAACAGGCC TCCAGCCGGTGGAGCGCCGACTTTGAGTTGATGGACGGTACGGGCGCGGT GGTCGCCGAGATCTCCCGGCTGGTGGTGGAGCGGCTTGCGAGCGGTGTAC GCCGGCGCGACGCAGACGACTGGTTCCTGGAGCTGGATTGGGAGCCCGCG GCGCTCGGTGGGCCCAAGATCACAGCCGGCCGGTGGCTGCTGCTCGGCGA GGGTGGTGGGCTCGGGCGCTCGTTGTGCTCGGCGCTGAAGGCCGCCGGCC ATG7CGTCGTCCACGCCGCGGGGGACGACACGAGCACTGCAGGAATGCGC GCGCTCCTGGCCAACGCGTTCGACGGCCAGGCCCCGACGGCCGTGGTGCA CCTCAGCAGCCTCGACGGGGGCGGCCAGCTCGGCCCGGGGCTCGGGGCGC AGGGCGCGCTCGACGCGCCCCGGAGCCCAGATGTCGATGCCGATGCCCTC GAATCGGCGCTGATGCGTGGTTGCGACAGCGTGCTCTCCCTGGTGCAAGC GCTGGTCGGCATGGACCTCCGAAACGCGCCGCGGCTGTGGCTCTTGACCC GCGGGGCTCAGGCGGCCGCCGCCGGCGATGTCTCCGTGGTGCAAGCGCCG CTGTTGGGGCTGGGCCGCACCATCGCCTTGGAGCACGCCGAGCTGCGCTG TATCAGCGTCGACCTCGATCCAGCCGAGCCTGAAGGGGAAGCCGATGCTT TGCTGGCCGAGCTACTTGCAGATGATGCCGAGGAGGAGGTCGCGCTGCGC GGTGGCGACCGGCTCGTTGCGCGGCTCGTCCACCGGCTGCCCGACGCTCA GCGCCGGGAGAAGGTCGAGCCCGCCGGTGACAGGCCGTTCCGGCTAGAGA TCGATGAACCCGGCGCGCTGGACCAACTGGTGCTCCGAGCCACGGGGCGG CGCGCTCCTGGTCCGGGCGAGGTCGAGATCTCCGTCGAAGCGGCGGGGCT CGACTCCATCGACATCCAGCTGGCGTTGGGCGTTGCTCCCAATGATCTGC CTGGAGAAGAAATCGAGCCGTTGGTGCTCGGAAGCGAGTGCGCCGGGCGC ATCGTCGCTGTGGGCGAGGGCGTGAACGGCCTTGTGGTGGGCCAGCCGGT GA7CGCCCTTGCGGCGGGAGTATTTGCTACCCATGTCACCACGTCGGCCA
CGC7GGTGTTGCCTCGGCCTCTGGGGCTCTCGGCGACCGAGGCGGCCGCG ATGCCCCTCGCGTATTTGACGGCCTGGTACGCCCTCGACAAGGTCGCCCA CC7GCAGGCGGGGGAGCGGGTGCTGATCCATGCGGAGGCCGGTGGTGTCG GTC7TTGCGCGGTGCGATGGGCGCAGCGCGTGGGCGCCGAGGTGTATGCG ACCGCCGACACGCCCGAGAACCGTGCCTACCTGGAGTCGCTGGGCGTGCG GTACGTGAGCGATTCCCGCTCGGGCCGGTTCGTCACAGACGTGCATGCAT GGACGGACGGCGAGGGTGTGGACGTCGTGCTCGACTCGCTTTCGGGCGAG CGCATCGACAAGAGCCTCATGGTCCTGCGCGCCTGTGGTCGCCTTGTGAA GCTGGGCAGGCGCGACGACTGCGCCGACACGCAGCCTGGGCTGCCGCCGC TCCTACGGAATTTTTCCTTCTCGCAGGTGGACTTGCGGGGAATGATGCTC GATCAACCGGCGAGGATCCGTGCGCTCCTCGACGAGCTGTTCGGGTTGGT CGCAGCCGGTGCCATCAGCCCACTGGGGTCGGGGTTGCGCGTTGGCGGAT CCCTCACGCCACCGCCGGTCGAGACCTTCCCGATCTCTCGCGCAGCCGAG GCATTCCGGAGGATGGCGCAAGGACAGCATCTCGGGAAGCTCGTGCTCAC GCTGGACGACCCGGAGGTGCGGATCCGCGCTCCGGCCGAATCCAGCGTCG CCGTCCGCGCGGACGGCACCTACCTTGTGACCGGCGGTCTGGGTGGCCTC GGTCTGCGCGTGGCCGGATGGCTGGCCGAGCGGGGCGCGGGGCAACTGGT GCTGGTGGGCCGCTCCGGTGCGGCGAGCGCAGAGCAGCGAGCCGCCGTGG CGGCGCTGGAGGCCCACGGCGCGCGCGTCACGGTGGCGAAAGCGGACGTC GCCGATCGGTCACAGATCGAGCGGGTCCTCCGCGAGGTTACCGCGTCGGG GATGCCGCTGCGGGGTGTCGTGCATGCGGCAGGTCTCGTGGATGACGGGC TGCTGATGCAGCAGACTCCGGCGCGGTTCCGCACGGTGATGGGACCTAAG GTCCAGGGGGCCTTGCACTTGCACACGCTGACACGCGAAGCGCCTCTTTC CTTCTTCGTGCTGTACGCTTCTGCAGCTGGGCTTTTCGGCTCGCCAGGCC AGGGCAACTATGCCGCAGCCAACGCGTTCCTCGACGCCCTTTCGCATCAC CGAAGGGCGCAGGGCCTGCCGGCGCTGAGCATCGACTGGGGCATGTTCAC GGAGGTGGGGATGGCCGTTGCGCAAGAAAACCGTGGCGCGCGGCAGATCT CTCGCGGGATGCGGGGCATCACCCCCGATGAGGGTCTGTCAGCTCTGGCG CGCTTGCTCGAGGGTGATCGCGTGCAGACGGGGGTGATACCGATCACTCC GCGGCAGTGGGTGGAGTTCTACCCGGCAACAGCGGCCTCACGGAGGTTGT
CGCGGCTGGTGACCACGCAGCGCGCGGTCGCTGATCGGACCGCCGGGGAT CGGGACCTGCTCGAACAGCTTGCGTCGGCTGAGCCGAGCGCGCGGGCGGG GCTGCTGCAGGACGTCGTGCGCGTGCAGGTCTCGCATGTGCTGCGTCTCC CTGAAGACAAGATCGAGGTGGATGCCCCGCTCTCGAGCATGGGCATGGAC TCGCTGATGAGCCTGGAGCTGCGCAACCGCATCGAGGCTGCGCTGGGCGT CGCCGCGCCTGCAGCCTTGGGGTGGACGTACCCAACGGTAGCAGCGATAA CGCGCTGGCTGCTCGACGACGCCCTCGTCGTCCGGCTTGGCGGCGGGTCG GACACGGACGAATCGACGGCGAGCGCCGGTTCGTTCGTCCACGTCCTCCG CTTTCGTCCTGTCGTCAAGCCGCGGGCTCGTCTCTTCTGTTTTCACGGTT CTGGCGGCTCGCCCGAGGGCTTCCGTTCCTGGTCGGAGAAGTCTGAGTGG AGCGATCTGGAAATCGTGGCCATGTGGCACGATCGCAGCCTCGCCTCCGA GGACGCGCCTGGTAAGAAGTACGTCCAAGAGGCGGCCTCGCTGATTCAGC ACTATGCAGACGCACCGTTTGCGT7AGTAGGGTTCAGCCTGGGTGTCCGG TTCGTCATGGGGACAGCCGTGGAGCTCGCCAGTCGTTCCGGCGCACCGGC TCCGCTGGCCGTCTTCACGTTGGGCGGCAGCTTGATCTCTTCTTCAGAGA TCACCCCGGAGATGGAGACCGATA7AATAGCCAAGCTCTTCTTCCGAAAT GCCGCGGGTTTCGTGCGATCCACCCAACAAGTCCAGGCCGATGCTCGCGC AGACAAGGTCATCACAGACACCATGGTGGCTCCGGCCCCCGGGGACTCGA AGGAGCCGCCCGTGAAGATCGCGG7CCCTATCGTCGCCATCGCCGGCTCG GACGATGTGATCGTGCCTCCGAGCGACGTTCAGGATCTACAATCTCGCAC CACGGAGCGCTTCTATATGCATCTCCTTCCCGGAGATCACGAATTTCTCG TCGATCGAGGGCGCGAGATCATGCACATCGTCGACTCGCATCTCAATCCG CTGCTCGCCGCGAGGACGACGTCGTCAGGCCCCGCGTTCGAGGCAAAATG ATGGCAGCCTCCCTCGGGCGCGCGAGATGGTTGGGAGCAGCGTGGGCGCT GGCGGCCGGCGGCAGGCCGCGGAGGCGCATGAGCCTTCCTGGACGTTTGC AGTATAGGAGATTTTATGACACAGGAGCAAGCGAATCAGAGTGAGACGAA GCCTGCTTTCGACTTCAAGCCGTTCGCGCCTGGGTACGCGGAGGACCCGT TCCCCGCGATCGAGCGCCTGAGAGAGGCAACCCCCATCTTCTACTGGGAT GAAGGCCGCTCCTGGGTCCTCACCCGATACCACGACGTGTCGGCGGTGTT CCGCGACGAACGCTTCGCGGTCAG7CGAGAAGAGTGGGAATCGAGCGCGG
AGTACTCGTCGGCCATTCCCGAGC7CAGCGATATGAAGAAGTACGGATTG TTCGGGCTGCCGCCGGAGGATCACGCTCGGGTCCGCAAGCTCGTCAACCC GTCGTTTACGTCACGCGCCATCGACCTGCTGCGCGCCGAAATACAGCGCA CCGTCGACCAGCTGCTCGATGCTCGCTCCGGACAAGAGGAGTTCGACGTT GTGCGGGATTACGCGGAGGGAATCCCGATGCGCGCGATCAGCGCTCTGTT GAAGGTTCCGGCCGAGTGTGACGAGAAGTTCCGTCGCTTCGGCTCGGCGA CTGCGCGCGCGCTCGGCGTGGGTTTGGTGCCCCAGGTCGATGAGGAGACC AAGACCCTGGTCGCGTCCGTCACCGAGGGGCTCGCGCTGCTCCATGACGT CCTCGATGAGCGGCGCAGGAACCCGCTCGAAAATGACGTCTTGACGATGC TGCTTCAGGCCGAGGCCGACGGCAGCAGGCTGAGCACGAAGGAGCTGGTC GCGCTCGTGGGTGCGATTATCGCTGCTGGCACCGATACCACGATCTACCT TATCGCGTTCGCTGTGCTCAACC7GCTGCGGTCGCCCGAGGCGCTCGAGC TGGTGAAGGCCGAGCCCGGGCTCA7GAGGAACGCGCTCGATGAGGTGCTC CGCTTCGACAATATCCTCAGAATAGGAACTGTGCGTTTCGCCAGGCAGGA CCTGGAGTACTGCGGGGCATCGATCAAGAAAGGGGAGATGGTCTTTCTCC TGATCCCGAGCGCCCTGAGAGATGGGACTGTATTCTCCAGGCCAGACGTG TTTGATGTGCGACGGGACACGGGCGCGAGCCTCGCGTACGGTAGAGGCCC CCATGTCTGCCCCGGGGTGTCCCT7GCTCGCCTCGAGGCGGAGATCGCCG TGGGCACCATCTTCCGTAGGTTCCCCGAGATGAAGCTGAAAGAAACTCCC GTGTTTGGATACCACCCCGCGTTCCGGAACATCGAATCACTCAACGTCAT CTTGAAGCCCTCCAAAGCTGGATAGCTCGCGGGGGTATCGCTTCCCGAAC CTCATTCCCTCATGATACAGCTCGCGCGCGGGTGCTGTCTGCCGCGGGTG CGATTCGATCCAGCGGACAAGCCCATTGTCAGCGCGCGAAGATCGAATCC ACGGCCCGGAGAAGAGCCCGTCCGGGTGACGTCGGAAGAAGTGCCGGGCG CCGCCCTGGGAGCGCAAAGCTCGC7CGTTCGCGCTCAGCACGCCGCTCGT CATGTCCGGCCCTGCACCCGCGCCGAGGAGCCGCCCGCCCTGATGCACGG CCTCACCGAGCGGCAGGTTCTGCTCTCGCTCGTCGCCCTCGCGCTCGTCC TCCTGACCGCGCGCGCCTTCGGCGAGCTCGCGCGGCGGCTGCGCCAGCCC GAGGTGCTCGGCGAGCTCTTCGGCGGCGTGGTGCTGGGCCCGTCCGTCGT CGGCGCGCTCGCTCCTGGGTTCCA7CGAGTCCTCTTCCAGGATCCGGCGG
TCGGGGTCGTGCTCTCCGGCATC7CCTGGATAGGCGCGCTCGTCCTGCTG CTCATGGCGGGTATCGAGGTCGA7GTGAGCATCCTGCGCAAGGAGGCGCG CCCCGGGGCGCTCTCGGCGCTCGGCGCGATCGCGCCCCCGCTGCGCACGC CGGGGCCGCTGGTGCAGCGCATGCAGGGCGCGTTCACGTGGGATCTCGAC GTC7CGCCGCGACGCTCTGCGCAAGCCTGAGCCTCGGCGCCTGCTCGTAC ACCTCGCCGGTGCTCGCTCCGCCCGCGGACATCCGGCCGCCCGCCGCGGC CCAGCTCGAGCCGGACTCGCCGGATGACGAGGCCGACGAGGCCGACGAGG CGCTCCGCCCGTTCCGCGACGCGATCGCCGCGTACTCGGAGGCCGTTCGG TGGGCGGAGGCGGCGCAGCGGCCGCGGCTGGAGAGCCTCGTGCGGCTCGC GATCGTGCGGCTGGGCAAGGCGCTCGACAAGGTCCCTTTCGCGCACACGA CGGCCGGCGTCTCCCAGATCGCCGGCAGACTCCAGAACGATGCGGTCTGG TTCGATGTCGCCGCCCGGTACGCGAGCTTCCGCGCGGCGACGGAGCACGC GCTCCGCGACGCGGCGTCGGCCATGGAGGCGCTCGCGGCCGGCCCGTACC GCGGATCGAGCCGCGTGTCCGCTGCCGTAGGGGAGTTTCGGGGGGAGGCG GCGCGCCTTCACCCCGCGGACCGTGTACCCGCGTCCGACCAGCAGATCCT GACCGCGCTGCGCGCAGCCGAGCGGGCGCTCATCGCGCTCTACACTGCGT TCGCCCGTGAGGAGTGAGCCTCTCTCGGGCGCAGCCGAGCGGCGGCGTGC CGGTGGTTCCCTCTTCGCAACCATGACCGGAGCCGCGCTCGGTCCGCGCA GCGGCTAGCGCGCGTCGCGGCAGAGATCGCTGGAGCGACAGGCGACGACC CGCCCGAGGGTGTCGAACGGATTGCCGCAGCCCTCATTGCGGATCCCCTC CAGACACTCGTTCAGCTGCTTGGCGTCGATGCCGCCTGGGCACTCGCCGA AGGTCAGCTCGTCGCGCCACTCGGATCGGATCTTGTTCGAGCACGCGTCC TTGCTCGAATACTCCCGGTCTTGTCCGATGTTGTTGCACCGCGCCTCGCG GTCGCACCGCGCCGCCACGATGCTATCGACGGCGCTGCCGACTGGCACCG GCGCCTCGCCCTGCGCGCCACCCGGGGTTTGCGCCTCCCCGCCTGACCGC TTTTCGCCGCCGCACGCCGCGAGCAGGCTCATTCCCGACACCGAGATCAG GCCCACGACCAGCTTCCCAGCAATCTTTTGCATGGCTTCCCCTCCCTCAC GACACGTCACATCAGAGACTCTCCGCTCGGCTCGTCGGTTCGACAGCCGG CGACGGCCACGAGCAGAACCGTCCCCGACCAGAACAGCCGCATGCGGGTT TCTCGCAACATGCCCCGACATCCTTGCGACTAGCGTGCCTCCGCTCGTGC
CGAGATCGGCTGTCCTGTGCGACGGCAATATCCTGCGATCGGCCGGGCAG GAGGTACCGACACGGGCGCCGGGCGGGAGGTGCCGCCACGGGCTCGAAAT GTGCTGCGGCAGGCGCCTCCATGCCCGCAGCCGGGAACGCGGCGCCCGGC CAGCCTCGGGGTGACGCCGCAAACGGGAGATGCTCCCGGAGAGGCGCCGG GCACAGCCGAGCGCCGTCACCACCGTGCGCACTCGTGAGCTCCAGCTCCT CGGCATAGAAGAGACCGTCACTCCCGGTCCGTGTAGGCGATCGTGCTGAT CAGCGCGTTCTCCGCCTGACGCGAGTCGAGCCGGGTATGCTGCACGACAA TGGGAACGTCCGATTCGATCACGCTGGCATAGTCCGTATCGCGCGGGATC GGCTCGGGTTCGGTCAGATCGTTGAACCGGACGTGCCGGGTGCGCCTCGC TGGGACGGTCACCCGGTACGGCCCGGCGGGGTCGCGGTCGCTGAAGTAGA CGGTGATGGCGACCTGCGCGTCCCGGTCCGACGCATTCAACAGGCAGGCC GTCTCATGGCTCGTCATCTGCGGCTCGGGTCCGTTGCTCCGGCCTGGGAT GTAGCCCTCTGCGATTGCCCAGCGCGTCCGCCCGATCGGCTTCTCCATAT GTCCTCCCTGCTGGCTCCTCTTTGGCTGCCTCCCTCTGCTGTCCAGGAGC GACGGCCTCTTCTCCCGACGCGCTCGGGGATCCATGGCTGAGGATCCTCG CCGAGCGCTCCTTGCCGACCGGCGCGCCGAGCGCCGACGGGCTTTGAAAG CACGCGACCGGACACGTGATGCCGGCGCGACGAGGCCGCCCCGCGTCTGA TCCCGATCGTGACATCGCGACGTCCGCCGGCGCCTCTGCAGGCCGGCCTG AGCGTTGCGCGGTCATGGTCGTCCTCGCGTCACCGCCACCCGCCGATTCA CATCCCACCGCGGCACGACGCTTGCTCAAACCGCGGCGAGACGGCCGGGC GGCTGTGGTACCGGCCAGCCCGGACGCGAGGCCCGAGAGGGACAGTGGGT CCGCCGTGAAGCAGTGAGGCGATCGAGGTGGCAGATGAAACACGTTGACA CGGGCCGACGAGTCGGCCGCCGGATAGGGCTCACGCTCGGTCTCCTCGCG AGCATGGCGCTCGCCGGCTGTGGCGGCCCGAGCGAGAAAATCGTGCAGGG CACGCGGCTCGCGCCCGGCGCCGATGCGCACGTCGCCGCCGACGTCGACC CCGACGCCGCGACCACGCGGCTGGCGGTGGACGTCGTTCACCTCTCGCCG CCCGAGCGCATCGAGGCCGGCAGCGAGCGGTTCGTCGTCTGGCAGCGTCC GAGCTCCGAGTCCCCGTGGCAACGGGTCGGAGTGCTCGACTACAACGCTG CCAGCCGAAGAGGCAAGCTGGCCGAGACGACCGTGCCGCATGCCAACTTC GAGCTGCTCATCACCGTCGAGAAGCAGAGCAGCCCTCAGTCTCCATCTTC
TGCCGCCGTCATCGGGCCGACGTCCGTCGGGTAACATCGCGCTATCAGCA GCGCTGAGCCCGCCAGCAGGCCCCAGAGCCCTGCCTCGATCGCCTTCTCC ATCATATCATCCCTGCGTACTCCTCCAGCGACGGCCGCGTCGAAGCAACC GCCGTGCCGGCGCGGCTCTACGTGCGCGACAGGAGAGCGTCCTGGCGCGG CCTGCGCATCGCTGGAAGGATCGGCGGAGCATGGAGAAAGAATCGAGGAT CGCGATCTACGGCGCCATCGCAGCCAACGTGGCGATCGCGGCGGTCAAGT TCATCGCCGCCGCCGTGACCGGCAGCTCGGCGATGCTCTCCGAGGGCGTG CACTCCCTCGTCGATACTGCAGACGGGCTCCTCCTCCTGCTCGGCAAGCA CCGGAGCGCACGCCCGCCCGACGCCGAGCATCCGTTCGGCCACGGCAAGG AGCTCTATTTCTGGACGCTGATCG7CGCCATCATGATCTTCGCCGCGGGC GGCGGCGTCTCGATCTACGAAGGGATCTTGCACCTCTTGCACCCGCGCCA GATCGAGGATCCGACGTGGAACTACGTCGTCCTCGGCGCAGCGGCCGTCT TCGAGGGGACGTCGCTCATCATCTCGATCCACGAGTTCAAGAAGAAGGAC GGACAGGGCTACCTCGCGGCGATGCGGTCCAGCAAGGACCCGACGACGTT CACGATCGTCCTGGAGGACTCCGCGGCGCTCGCCGGGCTCACCATCGCCT TCCTCGGCGTCTGGCTCGGGCACCGCCTGGGAAACCCCTACCTCGACGGC GCGGCGTCGATCGGCATCGGCCTCGTGCTCGCCGCGGTCGCGGTCTTCCT CGCCAGCCAGAGCCGTGGGCTCCTCGTGGGGGAGAGCGCGGACAGGGAGC TCCTCGCCGCGATCCGCGCGCTCGCCAGCGCAGATCCTGGCGTGTCGGCG GTGGGGCGGCCCCTGACGATGCAC7TCGGTCCGCACGAAGTCCTGGTCGT GCTGCGCATCGAGTTCGACGCCGCGCTCACGGCGTCCGGGGTCGCGGAGG CGAGGGAGCGCATCGAGACCCGGA7ACGGAGCGAGCGACCCGACGTGAAG CACATCTACGTCGAGGCCAGGTCGCTCCACCAGCGCGCGAGGGCGTGACG CGCCGTGGAGAGACCGCGCGCGGCCTCCGCCATCCTCCGCGGCGCCCGGG CTCAGGTGGCCCTCGCAGCAGGGCGCGCCTGGCGGGCAAACCGTGCAGAC GTCGTCCTTCGACGCGAGGTACGC7GGTTGCAAGTCGTCACGCCGTATCG CGAGGTCCGGCAGCGCCGGAGCCCGGGCGGGCCGGGCGCACGAAGGCGCG GCGAGCGCAGGCTTCGAGGGGGGCGACGTCATGAGGAAGGCCAGGGCGCA TGGGGCGATGCTCGGCGGGCGAGA7GACGGCTGGCGTCGCGGCCTCCCCG GCGCCGGCGCGCTTCGCGCCGCGC7CCAGCGCGGTCGCTCGCGCGATCTC
GCCCGGCGCCGGCTCATCGCCTCCGTGTCCCTCGCCGGCGGCGCCAGCAT GGCGGTCGTCTCGCTGTTCCAGC7CGGGATCATCGAGCGCCTGCCCGATC CTCCGCTTCCAGGGTTCGATTCGGCCAAGGTGACGAGCTCCGATATCGCG TTCGGGCTCACGATGCCGGACGCGCCGCTCGCGCTCACCAGCTTCGCGTC CAACCTCGCGCTGGCTGGCTGGGGAGGCGCCGAGCGCGCCAGGAACACCC CCTGGATCCCCGTCGCCGTGGCGGCCAAGGCGGCCGTCGAGGCGGCCGTG TCCGGATGGCTCCTCGTCCAGATGCGACGGCGGGAGAGGGCCTGGTGCGC GTACTGCCTGGTCGCCA7GGCGGCCAACATGGCCGTGTTCGCGCTCTCGC TCCCGGAAGGGTGGGCGGCGCTGGGGAAGGCGCGAGCGCGCTCGTGACAG GACGGGCGCGGGCAGCCCCGGCCATCGGAGGCCGGCGTGCACCCGCTCCG TCACGCCCCAGCCCGCGCCGCGTGATCTCCCGCGGACAGGGCGCGTACCG TGGACCCCGCACGCGCCGCGTCGACGGACATCCCCGGCGACCCGCGCGGC GCGACCCGCGCAACTCCGGCCCGCCGCCGGGCATCGACATCTCCCGTGAG CAAGGGCACTCCGCTCCTGCCCGCGTCCGCGAACGATGGCTGCGCTGTTT CCACCCTGGAGCAACTCCGTTTACCGCGTGGCGCTCGTCGGGCTCGTCGC CTCGGCGGGCGGCGCCATCCTCGCGCTCATGATCTACGTCCGCACGCCGT GGAAGCGATACCAGTTCGAGCCCGTCGATCAGCCGGTGCAGTTCGATCAC CGCCATCACGTGCAGGACGACGCCATCGATTGCGTCTACTGCCACACCAC GGTGACCCGCTCGCCCACGGCGGGGATGCCGCCGACGGCCACGTGCATGG GGTGCCACAGCCAGATCTGGAATCAGAGCGTCATGCTCGAGCCCGTGCGG CGGAGCTGGTTCTCCGGCCACGCCGATCCCGTGGAACCGGGTGAAACTCC GTGCCCGACTTCGTCTATTTCAACCACGCGATCCACGTGAACAAGGGCGT GGGCTGGCGTGAAGCTGCCACGGGCGCGTGGACGAGATGGCGGCCGTCTA CAAGGTGGCGCCGATGACGATGGGCTGGTGCCTGGAGTGCCATCGCCTGC CGGAGCCGCACCTCCGCCCGCTC7CCGCGATCACCGACATGCGCTGGGAC CCGGGGGAGCGGAGGGATGAGCTCGGGGCGCAGCTCGCGAAGGAATACGG GGTCCGGCGGCTCACGCACTGCACAGCGTGCCATCGATGAACGATGAACA GGGGATCTCCTTGAAAGACGCAGATGAGATGAAGGAATGGTGGCTAGAAG CGCTCGGGCCGGCGGGAGAGCGCGCGTCCTACAGGCTGCTGGCGCCGCTC ATCGAGAGCCCGGAGCTCCGCGCGCTCGCCGCGGGCGAACCGCCCCGGGG
CG7GGACGAGCCGGCGGGCGTCAGCCGCCGCGCGCTGCTCAAGCTGCTCG GCGCGAGCATGGCGCTCGCCGGCGTCGCGGGCTGCACCCCGCATGAGCCC GAGAAGATCCTGCCGTACAACGAGACCCCGCCCGGCGTCGTGCCGGGTCT CTCCCAATCCTACGCGACGAGCA7GGTGCTCGACGGGTATGCCATGGGCC TCC7CGCCAAGAGCTACGCGGGGCGGCCCATCAAGATCGAGGGCAACCCC GCGCACCCGGCGAGCCTCGGCGCGACCGGCGTCCACGAGCAGGCCTCGAT CCTCTCGCTGTACGACCCGTACCGCGCGCGCGCGCCGACGCGCGGCGGCC AGGTCGCGTCGTGGGAGGCGCTCTCCGCGCGCTTCGGCGGCGACCGCGAG GACGGCGGCGCTGGCCTCCGCTTCGTCCTCCAGCCCACGAGCTCGCCCCT CATCGCCGCGCTGATCGAGCGCGTCCGGCGCAGGTTCCCCGGCGCGCGGT TCACCTTCTGGTCGCCGGTCCACGCCGAGCACGCGCTCGAAGGCGCGCGG GCGGCGCTCGGCCTCAGGCTCTTGCCTCAGCTCGACGTCGACCAGGCCGA GGTGATCCTCTCGCTGGACGCGGACTTCCTCGCGGACATGCCGTTCAGCG TGCGCTATGCGCGCGACTTCGCCGCGCGCCGCCGCCCCGCGAGCCCGGCG GCGGCCATGAGCCGCCTCTACGTCGCGGAGGCGATGTTCACGCCCACGGG GACGCTCGCCGACCACCGGCTCCGCGTGCGGCCCGCCGAGGTCGCGCGCG TCGCGGCCGGCGTCGCGGTGGGAACTCGTGCACGAGTCTTTGTCTTGCGC CCTGTCCGGGAATAACGGACACCTTATCGCGGGTCGCTCTTTGTGCGCGG CTTCTGTACCTCTCAGGACAGGTAGAAGAGGGACTCAGGGGCCCTTATGT TAACTGGGGATGCCTTCGGGACGGCCGCAAATATATCCTATCACCTCACT GGGTGTGGGGGAGCACCGCGAGGATGTACAACCTCTGTAACTCTATGTGA GATAATGTGTGCAGTGATCTGAGACTTATTTGTGTGACCGAGACGTCTCT CTTATTGGTACGCATAGTATAATATAACACGTCTCATACATACTCCCGAC ATATCCGCGGTATGCGCGCACATAGAATAGGTGATGATAAATCCCTAGTG TGTGGAACTAGAAGATGCGGGAGTTACCTGATATTTACGGAAAAAGTATT ATCTCAACTACCTCTCTGTTGAGACTATCACTTCGGTGTCGTTGTGCTGC TGGT,
or its complementary strand,
(b) DNA-sequences which hybridise under stringent conditions to regions of DNA-sequences according to (a) encoding proteins or to fragments of said DNA-sequences, (c) DNA-sequences which hybridise to the DNA-sequences according to (a) and (b) because of a degeneration of the genetic code,
(d) allele variations and mutants resulting by substitution, insertion or deletion of nucleotides or inversion of nucleotide segments of DNA-sequences according to (a) to (c) , wherein the variations and mutants offer isofunctional expression products.
11. Peptide encoded by a DNA sequence according to claim 10 selected from the group consisting of
Seq ID No 83
>Contιg56_003 2890 ammo acids M =307428 D pl=5.76 numambιg=13 IRPRAAAVPMRSTVTGGVIAGPELGASY ADNLRQPVRFAAAAQALLEGGPALFIEMSPH PILVPPLDEIQTAAEQGGAAVGSLRRGQDERATLLEALGTL ASGYPVS ARLFPAGGRR VPLPTYP QHERC IEVEPDARRLAAADPTKD FYRTD PEVPRAAPKSETAHGSWLLLA DRGGVGEAVAAALSTRGLSCTVLHASADASTVAEQVSEAASRRND QGVLYL GLDAWD AGASADEVSEATRRATAPVLGLVRFLSAAPHPPRF WTRGACTVGGEPEASLCQAAL G LARVAALEHPAA GGLVDLDPQKSPTEIEPLVAELLSPDAEDQLAFRSGRRHAARLVAAP PEGDVAPISLSAEGSYLVTGGLGGLGLLVAR LVERGARHLVLTSRHGLPERQASGGEQP PEARARIAAVEGLEAQGARVTVAAVDVAEADPMTALLAAIEPPLRGWHAAGVFPVRHLA ETDEALLESVLRPKVAGS LLHRLLRDRPLDLFVLFSSGAAVWGGKGQGAYAAANAFLDG LAHHRRAHSLPALSLA GL AEGGMVDAKAHARLSDIGVLPMATGPALSALERLVNTSAV QRSVTRMD ARFAPVYAARGRRNLLSALVAEDERAASPPVPTANRI RGLSVAESRSALY
ELVRGIVARVLGFSDPGALDVGRGFAEQGLDSLMALEIRNRLQRELGERLSATLAFDHPT VERLVAHLLTDVLKLEDRSDTRHIRSVAADDDIAIVGAACRFPGGDEGLETY RHLAEGM WSTEVPADR RAAD YDPDPEVPGRTYVAKGAFLRDVRSLDAAFFAISPREAMSLDPQQ RLLLEVS EAIERAGQDPMALRESATGVFVGMIGSEHAERVQGLDDDAALLYGTTGNLLS VAAGRLSFFLGLHGPTMTVDTACSSSLVALHLACQSLRLGECDQALAGGSSVLLSPRSFV AASRMRLLSPDGRCKTFSAAADGFARAEGCAVWLKRLRDAQRDRDPILAWRSTAINHD GPSSGLTVPSGPAQQALLRQALAQAGVAPAEVDFVECHGTGTALGDPIEVQALGAVYGRG RPAERPLWLGAVKANLGHLEAAAGLAGVLKVLLALEHEQIPAQPELDELNPHIPWAELPV AWRRAVPWPRGARPRRAGVSAFGLSGTNAHWLEEAPAVEPVAAAPERAAELFVLSAKS AAALDAQAARLRDHLEKHVELGLGDVAFSLATTRSAMEHRLAVAASSREALRGALSAAAQ GHTPPGAVRGRASGGSAPKWFVFPGQGSQWVGMGRKLMAEEPVFRAALEGCDRAIEAEA GWSLLGELSADEAASQLGRIDWQPVLFAMEVALSALWRSWGVEPEAWGHSMGEVAAAH VAGALSLEDAVAIICRRSRLLRRISGQGEMALVELSLEEAEAALRGHEGRLSVAVSNSPR STVLAGEPAALSEVLAALTAKGVFWRQVKVDVASHSPQVDPLREELIAALGAIRPRAAAV PMRSTVTGGVIAGPELGASYWADKLRQPVRFAAAAQALLEGGPALFIEMSPHPILVPPLD EIQTAAEQGGAAVGSLRRGQDERA7LLEALGTLWASGYPVSWARLFPAGGRRVPLPTYPW QHERYWIEDSVHGSKPSLRLRQLRNGATDHPLLGAPLLVSARPGAHLWEQALSDERLSYL SEHRVHGEAVLPSAAYVEMALAAGVDLYGTATLVLEQLALERALAVPSEGGRIVQVALSE EGPGRASFQVSSREEAGRSWVRHATGHVCSGQSSAVGALKEAPWEIQRRCPSVLSSEALY PLLNEHALDYGPCFQGVEQVWLGTGEVLGRVRLPGDMASSSGAYRIHPALLDACFQVLTA LLTTPESIEIRRRLTDLHEPDLPRSRAPVNQAVSDTWLWDAALDGGRRQSASVPVDLVLG SFHAKWEVMERLAQAYIIGTLRIVJNVFCAAGERHTIDELLVRLQISWYRKVIKRWMEHL VAIGILVGDGEHFVSSQPLPEPDLAAVLEEAGRVFADLPVLFEWCKFAGERLADVLTGKT LALEILFPGGSFDMAERIYRDSPIARYSNGIVRGWESAARWAPSGMFSILEIGAGTGA TTAAVLPVLLPDRTEYHFTDVSPLFLARAEQRFRDYPFLKYGILDVDQEPAGQGYAHQRF DVIVAANVIHATRDIRATAKRLLSLLAPGGLLVLVEGTGHPIWFDITTGLIEGWQKYEDD LRIDHPLLPARTWCDVLRRVGFADAVSLPGDGSPAGILGQHVILSRAPGIAGAACDSSGE SATESPAARAVRQEWADGSADWHRMALERMYFHRRPGRQVWVHGRLRTGGGAFTKALAG DLLLFEDTGQWAEVQGLRLPQLEASAFAPRDPREEWLYALEWQRKDPIPEAPAAASSSS AGAWLVLMDQGGTGAALVSLLEGRGEACVRVIAGTAYACLAPGLYQVDPAQPDGFHTLLR
DAFGEDRICRAWHMWSLDATAAGΞRATAESLQADQLLGSLSALSLVQALVRRRWRNMPR LWLLTRAVHAVGAEDAAASVAQAPVWGLGRTLALEHPELRCTLVDVNPAPSPEDAAALAV ELGASDREDQVALRSDGRYVARLVRSSFSGKPATDCGIRADGSYVITDGMGRVGLSVAQW MVMQGARHWLVDRGGASEASRDALRSMAEAGAEVQIVEADVARRDDVARLLSKIEPSMP PLRGIVYVDGTFQGDSSMLELDARRFKEWMYPKVLGAWNLHALTRDRSLDFFVLYSSGTS LLGLPGQGSRAAGDAFLDAIAHHRCKVGLTAMSINWGLLSEASSPATPNDGGARLEYRGM EGLTLEQGAAALGRLLARPRAQVGVMRLNLRQWLEXXXXXXXXXXXXXWYNLLIIIQYTK VPFQGPFRML*
Seq ID No 84
>Contig56_027 700 amino acids MW=80569 D pl=7.02 numambig=0 MNMELNITSKSNPFGDTTAENDKKMLSNAFIETADFRTLIETDDRTIWGRRGTGKSALF IQLNEHWKKDKKILILSFSPDDSQIIGFRSMLKPFTGSFNLARAATRLLWRYAMLMEIAS YISSHYKLSSQISSETLLNEHLKKWNSAQGDILRKCRLVAKEYLDENNPEESIGDLQFNL NISEIENNIVSLLERSDRKWILMDKLDEAYEPDNIGIGIIAGLAYASIELNQKAKCIRP IIFLRDNIFRSLSKEDPDYSRNIEGQVIRLHWDWAQLLMLSAKRMKVAFKLDIEKDQRVW DRCTADDLKGRNGFKRCLQFTLYRPRDLLSLLNEAFFSAFRENRETIINTDLEYAAKSIS MARLEDLWKEYQKIFPSIQVITSAFRSIEPELTVYTCLKKIEASFELIEENGDPKITSEI QLLKASGILQSLYSVGFVGIRDKNTSSYSFCHDGRTPDKGFESNEKLLIHPCYWLGLNLN RNALAPEEAEEINDEYDINIISDNSAIRNKTIGQITTHLDQIPIGNEGATEFEQWCLDAL RIVFASHLTDIKSHPNGNAVQRRDIIGTNGGKSDFWKRVLEDYKTRQWFDAKNFEELGP SEYRQLQSYLTGPYGKLGFIINRDESEVLKSGKDLDWTKEMYQSHNSLIIKLPAKYISKL LQKLRNPEKHDAIDRQMGKLLTLYETSYMAIKSTQKKRRK*
Seq ID No 85
>Contig57_001 372 amino acids MW=38411 D pl=12.39 numambig=10 MLTSXXXXXXXXXXLLAYRCATAARGAGRIRHHRQYAQRRRRTWLYARAAGTLPDRRHG LLWARGHPPCLPQPARSRERSRAGGRRQHAPFVQDDDNAGAHPGAVARWPLPDIRRLGQ RVRPWGGLRYGRAQTALRRPATRRSDLGSDPGFGHESGWPVDRVDGTQCARSGGALARGA AERSRRRRGHRLCRDPRNGDLARRPDRGRGAACRVGAGAGRWEPLRAGRSEDKPRPPGGR CRRGGFDQGGAGSAPRTDPAKPPFPHAQSADPDRGDRARAGDGAGAVAAGGPTALRGGER VRPQRHQRPCRAGGGAGHGARTGDAGALSGAFGAVGEERRRAGRTGGAALSAHRRVPGAG SRRRRVQPGIDA* Seq ID No 86
>Contig57_002 2259 amino acids MW=238258 D pl=5.92 numambig=0 MSYTLGLQGPCLTVDTVCSSSLVAIHLACRSLRARESDLALAGGVNMLLSSKTMIMLGRI QALSPDGHCRTFDASANGFVRGEGCGMWLKRLSDAQRHGDRIWALIRGSAMNQDGRSTG LMAPNVLAQEALLREALQSARVDAGAIGYVETHGTGTSLGDPIEVEALRAVLGPARADGS RCVLGAVKTNLGHLEGAAGVAGLIKAALALHHELIPRNLHFHTLNPRIRIEGTALALATE PVPWPRAGRPRFAGVSAFGLSGTNVHWLEEAPATVLAPATPGRSAELLVLSAKSAAALD AQAARLSAHIAAYPEQGLGDVAFSLVSTRSPMEHRLAVAATSREALRSALEVAAQGQTPA GAARGRAASSPGKLAFLFAGQGAQVPGMGRGLWEAWPAFRETFDRCVTLFDRELHQPLCE VMWAEPGSSRSSLLDQTAFTQPALFALEYALAALFRSWGVEPELVAGHSLGELVAACVAG VFSLEDAVRLWARGRLMQALPAGGAMVSIAAPEADVAAAVAPHAALVSIAAVNGPEQW IAGAEKFVQQIAAAFAARGARTKPLHVSHAFHSPLMDPMLEAFRRVTESVTYRRPSIALV SNLSGKPCTDEVSAPGYWVRHAREAVRFADGVKALHAAGAGLFVEVGPKPTLLGLVPACL PDARPVLLPASRAGRDEAASALEALGGFWWGGSVTWSGVFPSGGRRVPLPTYPWQRERY WIEAPVDREADGTGRARAGGHPLLGEVFSVSTHAGLRLWETTLDRKRLPWLGEHRAQGEV VFPGAGYLEMALSSGAEI GDGPIQVTDWLIETLTFAGDTAVPVQWTTEERPGRLRFQ VASREPGERRAPFRIHARGVLRRIGRVETPARSNLAALRARLHAAVPAAAIYGALAEMGL QYGPALRGLAELWRGEGEALGRVRLPEAAGSATAYQLHPVLLDACVQMIVGAFADRDEAT PWAPVEVGSVRLFQRSPGELWCHARWSDGQQASSRWSADFELMDGTGAWAEISRLWE RLASGVRRRDADDWFLELDWEPAALGGPKITAGRWLLLGEGGGLGRSLCSALKAAGHVW HAAGDDTSTAGMRALLANAFDGQAPTAWHLSSLDGGGQLGPGLGAQGALDAPRSPDVDA DALESALMRGCDSVLSLVQALVGMDLRNAPRLWLLTRGAQAAAAGDVSWQAPLLGLGRT IALEHAELRCISVDLDPAEPEGEADALLAELLADDAEEEVALRGGDRLVARLVHRLPDAQ RREKVEPAGDRPFRLEIDEPGALDQLVLRATGRRAPGPGEVEISVEAAGLDSIDIQLALG VAPNDLPGEEIEPLVLGSECAGRI VAVGEGVNGLWGQPVIALAAGVFATHVTTSATLVL
PRPLGLSATEAAAMPLAYLTAWYALDKVAHLQAGERVLIHAEAGGVGLCAVRWAQRVGAE VYA7ADTPENRAYLESLGVRYVSDSRSGRFVTDVHAWTDGEGVDWLDSLSGERIDKSLM VLRACGRLVKLGRRDDCADTQPGLPPLLRNFSFSQVDLRGMMLDQPARIRALLDELFGLV AAGAISPLGSGLRVGGSLTPPPVETFPISRAAEAFRRMAQGQHLGKLVLTLDDPEVRIRA PAΞSSVAVRADGTYLVTGGLGGLGLRVAGWLAERGAGQLVLVGRSGAASAEQRAAVAALE AHGARVTVAKADVADRSQIERVLREVTASGMPLRGWHAAGLVDDGLLMQQTPARFRTVM GPKVQGALHLHTLTREAPLSFFVLYASAAGLFGSPGQGNYAAANAFLDALSHHRRAQGLP ALSIDWGMFTEVGMAVAQENRGARQISRGMRGITPDEGLSALARLLEGDRVQTGVIPITP RQWVEFYPATAASRRLSRLVTTQRAVADRTAGDRDLLEQLASAEPSARAGLLQDWRVQV SHVLRLPEDKIEVDAPLSSMGMDSLMSLELRNRIEAALGVAAPAALGWTYPTVAAITRWL LDDALWRLGGGSDTDESTASAGSFVHVLRFRPWKPRARLFCFHGSGGSPEGFRSWSEK SEWSDLEIVAMWHDRSLASEDAPGKKYVQEAASLIQHYADAPFALVGFSLGVRFVMGTAV ELASRSGAPAPLAVFTLGGSLISSSEITPEMETDIIAKLFFRNAAGFVRSTQQVQADARA DKVITDTMVAPAPGDSKEPPVKIAVPIVAIAGSDDVIVPPSDVQDLQSRTTERFYMHLLP GDHEFLVDRGREIMHIVDSHLNPLLAARTTSSGPAFEAK*
Seq ID No 87
>Contig57_027 419 amino acids MW=46737 D pl=5.09 numambig=0
MTQEQANQSETKPAFDFKPFAPGYAEDPFPAIERLREATPIFYWDEGRSWVLTRYHDVSA VFRDERFAVSREEWESSAEYSSAIPELSDMKKYGLFGLPPEDHARVRKLVNPSFTSRAID LLRAEIQRTVDQLLDARSGQEEFDWRDYAEGIPMRAISALLKVPAECDEKFRRFGSATA RALGVGLVPQVDEETKTLVASVTEGLALLHDVLDERRRNPLENDVLTMLLQAEADGSRLS TKELVALVGAIIAAGTDTTIYLIAFAVLNLLRSPEALELVKAEPGLMRNALDEVLRFDNI LRIGTVRFARQDLEYCGASIKKGEMVFLLIPSALRDGTVFSRPDVFDVRRDTGASLAYGR GPHVCPGVSLARLEAEIAVGTIFRRFPEMKLKETPVFGYHPAFRNIESLNVILKPSKAG*
Seq ID No 88
>Contig57_043 492 amino acids MW=52617 D pl=11.54 numambig=0
MAARARKSCRARGSRPAPMRTSPPTSTPTPRPRGWRWTSFTSRRPSASRPAASGSSSGSV RAPSPRGNGSECSTTTLPAEEASWPRRPCRMPTSSCSSPSRSRAALSLHLLPPSSGRRPS
GNIALSAALSPPAGPRALPRSPSPSYHPCVLLQRRPRRSNRRAGAALRARQESVLARPAH RWKDRRSMEKESRIAIYGAIAANVAIAAVKFIAAAVTGSSAMLSEGVHSLVDTADGLLLL LGKHRSARPPDAEHPFGHGKELYFWTLIVAIMI FAAGGGVS IYEGILHLLHPRQIEDPTW NYWLGAAAVFEGTSLI I S IHEFKKKDGQGYLAAMRSSKDPTTFTIVLEDSAALAGLTIA FLGVWLGHRLGNPYLDGAASIGIGLVLAAVAVFLASQSRGLLVGESADRELLAAIRALAS ADPGVSAVGRPLTMHFGPHEVLWLRIEFDAALTASGVAEARERIETRIRSERPDVKHIY VEARSLHQRARA*
12. DNA sequence according to any of claims 1 to 5 wherein the DNA is selected from the group consisting of
(a) the following DNA sequences:
Seq ID No 89 (>Contigl0) GGTAGTGAAATATGCTGTATTCAACAGAAAGCTTGATGAATTGATCTAGA AAGTAGAGCGAGAGAATCAAGTAAGATAGTAGGATGCATTATAAATATAG AATATATACTGCATACGATGACAGCATGCGCACGAATAGAATGCATAAGA GGCAAGCCAATAACCAAAAGTGGAGCCAGAGGAGATAGTCTCGCCAGTAG AAATAATGCTCAGCCAAGCGAGGTTGGACATATCAGTTCCAGAGTAGGTC TCAACCCCGTATATGAGTCCAATGAAGCCTGTCTCATCCAGTTAACGGCC TTTTGAGCAGAGAATCCTCCCTATTTTCGGAGAGGACGCGTCGAATATAA AGCAGGTCCAAAGAAGCAAGCAATAGCCAAAAGTTTGAAAGGTTAGTACG AGCAGCGGCTGGAGGACACTATGGTCGTGCAACGGGGGTAAAGGGTTTCA CGTATTGTAGCAGAGCACGTCAGAGGGTTATTCGTGACATTCGAGGCCAA CGAGGCGGTAGGACTTCGTAAGCGCATGACCATCCCGGTCACAAACGTAG TGCGGAGCGCCTCGTCACGCTCAACAAGGCCCTAGAACGCGCGGCGCAGA TCGACCCTTTTAAACGCCGGCACCGAGCCGGACCGTCCTGCCCAGGTTGT AAAGCGCTCCATCGGCCGACTTATGGCACTCGAGCCAAATCGCCCGGTTC CCCATCGGTCAGCGCAAACGGCCCCCCCGGGCGTCGCCACCCGCGGCGAC GAGGGGCCGTCCAGACGGGTGATCTCTCTCGTGAGCTCGCGGAGAGAGCC
TCCTCGCAAGATCGATGTCAGCGGGATCGCGCGCCCCGTCCGCACCTGAA ACGCGTGCTGGAGCTCGACGGCAGCGAGGGAGTCGAGGCCGAACCGCGAT ATCGGCAGCGCGTCGTCGATCTGCCCGGCGTCCAGACGAAGCGCGCGGGC GAGGGTCGAGCGCAGCGCGTCCAGCAGGCTCCGGCCGGAGGGCTCCTCGG TCTCCGGGGGCGCGTCGTCCGGGGGCGAGGCGTCGTCGAGGAGCTCCGGC GCGAACGCGACGTGGCGCTCGCCGAGCGCGTCCTCGAGAAAGGCGCGCCG GCACTCCCTCCGGCGGACCTTCCCGCTCGACGTCTTCGGCAGCGCGCCCG GCGCGATCAGCGCGACGGCGTGCGCGACGAGCTGGTGCTCGGCGGTCACC GCCTCGCGCACGGCCGCCACGATCTCGCGCGGATCCGCGGCCACGCGCGG GTCGACCTCGCACACCACGGCGAGGCGCTCCTCGCCCTCGTGCTCCACGG AGAACGCGGCGCTGCAGCCCGGCCGGACGGCGCGATGGCTGCTCTCGACG GTCTTCTCGATGTCCTGCGGGAAGTGGTTGCGGCCTCGAAGGATGATGAG GTCCTTCGACCTCCCCACCACGAACAGCTCGCCGCCCCGGAGGAAGCCGA GATCTCCCGTGCGCAGGTAGCGCGGCGCCGCGCTGCCAGCGAGCGTGGCC CCGAACGTGGCCTCCGTCTCCTCCGGGCGCCCCCAGTAGCCGACGGCTAC GCTGGGCCCGGACACCCAGATCTCCCCGATCTCCCCCGGCCCGAGCTCGT TCCCCGCGGGATCGACGATCGCGACCGCCCGCGGATCGAGCGCCCGACCG CTGCCGACGAACACGCGCGCGCCC7CCGCCGCCGACGCGACGGCGCGCCC GAGCTCCACCTCCTCGGGGGCGAGGCGCGCCAGCACCGGCGCCTCGGCCC GCGCTCCGCCGCTCACGATGAGCG7GGCCTCGGCGAGCCCGTAGCAGGGA TAGAACGCCTCTCGCCGGAACCCGCTGACCGCGAAGGCGCGCGCGAAGCG ATCGAGCGTGTCGGCGCGCACCGGCTCGGCGCCCGTGAACGCGACCTCCC ACGACCGCAGATCGAGCGCCGCTCGCTCCTCCTCCGAGCTCTTCCGGACG CACAGGTCGTATGCGAAGTTCGGGCCGCCGCTCACCGAGGCGCCGAGCGC CGAGACGGCGCGGAGCCACCGCATCGGCCTCTGCAGGAACGAGAGCGGCG ACATGAGCGCGACGCGGATCCGCCGGTAGAGCGCCTGCAAGATCCCGCCG ATGAGCCCCATGTCGTGATACGGCGGCAGCCAGATCACCCCGACCGGATC CGGGCTCGTCAGGTCGAATCCATGCGCGATGAGCCGCGAGTTGTGCAGCA GATTCCCGTGGGTGAGCATCACCCCCTTGGGCTCGCCGGTCGAGCCGGAG GTGTATTGAAGGAACGCGACCGAC7CCGGCCGGAGCGCCGCGCCCGGCCC
CTCGATCGGGCCCGGCGACGGGCCGTCGGTCGCGATCCACCGGAGCCGCT GCAGCGCGGCGGCCGCGGCGCTGGCCGGCAGGGACGCCACGATGCCGGCG ACGGCCGATGACGTGAGCGCCGCC7CGGCGCGCGCGTCCGCGACGATGGA AGCGACGCGCGGCAGCGTCCGCTCGAGCCGGCCGAGATCCGGCGGATAGG CGGGCACGGTCCGGACTCCAGCG7AAAGACACCCGAAGAACGCGGTGATG TACTCGATCCCCGGCGGATACAGCAGCAGCGCGCGGGCCCCGGGGGCGAC GCCCGATGCCTGCAAGAGGGCCGCGACGGTTCGCGCGCGCTCGTCAATTT CCCGCAGGGTCACCCAGGTCGCCCCGGCCTCGACGTCGCCGGACTCAAGA AAGCAATAGATTGGGCGGGCGGGCTCAGCTTCGGCCCGCTGGCGCAAGAG GTCGATAACGGTGGAAGGGCGGTTCCGTTCGTTCCGTTCCAATGCAAGAA AAGCATCATTCATTGAACAGACCCCTCCGCCGCGGAGATAGCAGCTTGTC CGCTGCGACACAACCGCCGCGCGACGCGCGTGGCACGGCGGGATCCGGGC GTTACTCCACCTGCACTTCCCGTCGCGTCACGCTCGCTCCGCCGCGGGTG TCGTGAACCACCGCCCACAGCGACACGCGCCCTGGCTCCGAGGGCGGCGT CCACGTGGTCCCGTTGCCCCCGCGGGAGGCGCCGGTCGTATCGCTCACCA GGCGGCGCGCCCCGTCGAACTCGCCACCGTCCGTGTAATAGTCGACCCAG ATCGCCTCGCGCGCCGGTGGACCGCCGAGCCCGGCGGCTTCCTCGTCCAC CTCGGCGGCCTTCTCGGGCACGACAGCCTCAATCTCATAGGTCGTGCACT CGTCCTCGGCCGGCTCGGTCCGGCCGCAGCCTTGGGCCTGCTCCTCGGAC CGAACGCACCGCTTCACGACGGGCAAACCGTCCTCGCCCGGCGCGACCTC ATTGCCATCGAGCTTCAGCGTGAAGCCGTCGATGGGCGGGTTCGTGTTCA GCCGCTCCTTCTTGAAGACATAGACCTGCGTGTAGCCCACGACGAAGCTG TCCGGACCGAGCACCGTCCCGTCGTCGCCGACGCACTCCAGCGGAAACCC GGCCGTTTCGGGCGCCGAAGCCACGCGTGTCGTGCCGGCGCACACGGCGA ACAGCACGTAAGCCGACGAGTACACCGTCCCCGTCTCGGTGGGCCTCGCG TCCTTGAGGATCTCCTTGGGCAGCTTCCACCCGAACGAGACCGCATCGGG CTCGCCGCTCTTCTCCGGACCGATCTCCTGCTGCGCGAAGGGGACGGTGC GCTCCCCGTCGCCGCTGCCGCTGCCGCCGTCGCCGCTGCTGCCGCCGTCG CCGCTGGCACCGCCACCGCCGCCGCCGCCAGCGCCACCATTGCCGCCTTC GCCGCCTCCAGCGCCACCACTGCCGCCGTCGCCGCCGCTGCCACCATCGC
CACCACTGCCGCCGCCGCCGCTTCCGCCGCTGCCGGCCGGCACCGCCTCC CGGATTCGCGACGATTCCCACCGCATAGGTGCCCAGCCACTGCGGGATGC ACCCGAGGTGCTCGTCCACCCCGACCGGCGGATTCACGCAGCCGCCCACC CACGTGACCTCGACCTTCCGCGGCGCGCCGCCCTCCGCGCCTTTCGCGTC GGCGTACGTCATCCGGAACGTCACGAGCTCTTCCGCCGCCGCGTACGGCT TGTCCGCCGTCACGGCGAGGACGCGGAGCCCCTTCACCTCGGACGAAGGG GCCATGTCGCTCCCGGCGCAGCAAGGGATGCCCACGGCCAGGGTCGAGAG CGCGGCCAGCAGCGCGCGACGAGCGGGCAGTGCGGTCCGTTTCATCAGAA ATCTCCTCGCAGCCCGAGCGTGGGCAGGAAGGGGAGCCCCGTCACGTACT CCCGCTTCGTGTAGTTGAAGTTGTAGCTGATGCCCTCCGCAGCCATGTAA TTGTAGACGTTCTGGATATCGAGGTAGAGCCCGAGCTGCCACCTCTTGAA TTTCCACGTCTTGTCGGCGCGGATGTCGAGCTGGTGAAACAGCGGCATCC GCTCGCTGTAGTCACCCCCGAGCGGGATCGGCGAATACCTCGCCGAGGAC GCGTGGTAGATCGCGTTCACCCGGTTCGGATTGCACCCCTTCTCCTCCGG ATCGCAGACATAGGGCGTCTGCAGGTTGCCCGACACGAGCCGGAAGCGCG CGCCCAGCTCCCAGCCCCGGCCGAGCCGCAGGCTCCCGAGCACCGTCAGC ACGTGCGTCTGATCGAACTGGGTGAGGTGCTCCTCCTCGTCGGGGCCGTC CTTGCGCACCGACCGCGAGAGGGTGTACGCCGCCCAGCCGAAGAAGCGCT CGTCCGGCTTGTACTTCAACAAGAGCTCGCCGCCGACCGCGTATCCGGTG CCATCGTTGGCATAGTCGTCCTTCTCCGGCGAGAAGACGACCAGCCGATC GAGCTGCTTGTAGAACCCGTCCAGCGTCACCTCGATCTGCGGCGTGATCT CCTGCTCCACGCCGAGGCCGTAATGCACGGCGCGGTTCGACTTGAGCTCC GCATTGCCGAACGGCTCGATGCTCTCCGCGAACTGCGGCGCCTGATAATA AAGGCCCACGCCCCCCTTGGCCGTCGTCCGCGGGAAGCCGCTCCGGATGT CGTAGCGCGCGTTGACCCGCGGGCTCACGTCGAGCGTCTGCGTATCGAGC GCGTAGTCGACCCGCACCCCGGGGACGATCCGCGCCCGCGGCGAGGGGAC GACCTCGAGCTCGGCATACGCCGCGGGCCGCGAGTACGCGCCGTCGAACG ACCGATCCTGGAACGGGTACGTCGAGAACGGCTGGTTCGACGGGTGGCCC GCGGGCTGCTGCGACGGCGCGCGGATGTTGACCGTGGCGACGCCGCCCGA GAGGTCGGTGCCGACGTTCATCGTGAGGTACCGCGCGAACCTGTGCGAGA
GCTCCAGCCGCAGGTCGAGCGAGGTCGAGACGACGTTGAAGGCGAGGGGA GAGATCTCGAAGTCGGCGATGTCCCGGCCGAGCGCCATCGACCACAGCAG CCGATCCCGGCTCCCGATCCGGTTCTCGTAGCTGAGCTGGAAGCGCTGGA AGGCGGTGTGCAGCCCGAAATCGCCCGTCAGCGCCGGCTCGTCCTCCGGC GGCTTGTCCAGGGTGATCTTGAAGGCGTCGTCCGATCCGTAGAAGCTCGC GCGCACGCGCTCGCTCGCGGAGGGGCGGCCCTCGAGGACGAACTGGTAAT CATAGTAGACGGGCGCCTGCGTGACGCTGGAGCCCGCCTCCTTGAGCACG GGCCCGAGCCACGCGTCGACCCAGCTGCGGCGGCCCGCCGCGATGAACGT CCAGTCCTTGAGGAACGGGACGGGGCCCTCGAGGAGCACGCGCCCGTCGA TGAGGTCGAGCTGGACCACGCCGTGGTACTTGCCGTCCTGCTTCGGCGAG CGGAGCCCGACGTCGACGATGCCGCCCATGGCGCGGCCGTACACGGCGCT GAAGTTGCCCGGATAGAAGTCGATCTTCTCGAGCATCTCGGTCGGCACGA CCGAGGAGAGGCCGCCGAAGTGGTAGATGATCGGCACCGGGGTGCGATCG ACGAACGTGAGCGTGTCCTGGGGCGCGGACCCGCGCACGATGAGCAGCCC GAAGCCGCTGCGCGCGACGCCCGGCAGGCTCTGCAGCGACCGCAGCGCGT CGCCGCCGGTGCCGGGGATGCGGTCGATCTCGCGGCGCTCGATCGTCCTC CGCGTCACCTCGCGCGGCGGGCGC7CGCCCTGCACGGTCACCTCGATGCC CGGCGCCTTGCCGTCCTGCGGCGCGGCGAGCGAGATGCGGTAGCGCACCT CGATCGCCTCGCCGGCCGCGATCTCCTCCTCGGCGGCGAACGGCTCGAAC CCCGCGGCGGCGACCTCGACGCGGTACTTGCCGGGGGGGAGATTCTTGAA GCGGAACTTGCCGCCCTGGTCCGTCTTCGCCTCCTCGCGGCCGCCGTCGG GGCGCACGAGGGTGACCGCGATGTCCGGGAGCGGCTCGCCGGTGCCCGCG GACAGGACGGTCCCGACCACCGTC7CGACGTCGGCGGGCGGCGCCGGCGC GGCCGCATCGGCGGGCTTGGGCGTGAGCGTGAACGCGTACCGGTAGAGGA TGCGCGCCGCCGCGGGCGTGCCGTCCGGGCGCCGCGCCGGCGCGAACTCC AGGCCGGGCGCGGCCCGCGAGCGCCGCCTCGTTGAAGCCGTGCCCGCCGG GCGTCGCGACCTCGGCCTTGGTGACGCGCCCGGTCTTGTCGATGTCGAGC TTGAGGATGACGCTGCCCTCGACGCCGGCGCGCTGGGCCTCGATCGGATA CGCGGGCGGGGAGTACTTGATCAGCGTCGGCGGGCTGATGGCGGCGGGCG CCGGCGGGGGCGCGCCCGGCTGAGGGACGACGACCGCGCCGGCGCCGCCG
CGGGGGACCGAGGCGCCGCCCGAGTCGCCCTCGGCGGCAGGAGGAGGCTC GGGCGGGGGCGCGCCGGCGGGCTGCGCGCGCGCTGCGCTGCCGGTCATCG CGACCGCGAGCAGCAGCGCTTCCGAGACGACGAGGCGCATCACGGAGGAC GCTGTGGAAGGCATGCGGCCCGCCCTCTCGCATGGCGAGGCCGAGGCGGA AAGACGCATCGCGCAGCCAGGACCGTGCTTCACATTGCTTCACACAACGG GCGCCGCGCGCGCTCCCGGGCCGCGCGAGCGCAGGCGGCGCGCGCGCCCG CGGGCGGCGCGATCGCGAGCGGCGCGCGGTGCGATCAGCCGCCGACCTCG GCCACGAACCGGCTCACGTCGTCGCTGTCGCCCACGAGCAGCAGCGTGTC GCCGTCGCGGATCACGTAGTCCGGTGTGGGCGCCTCGAGCCGCGGCTTGT CGCCGGGCCGCTTGTTCGTGTGCGGCCGCACACCGAGCACGTTGATGCGG TACCGCTGGCGGATCTTCGAGCCGGCCAGCGTCTGCCCGACCAGCGGCCC GTGGGCGTTCCAGGGGACCACGCGGTAGTGGCTCGCGAGGTCGAGGAGGT CCTGCGCGAGCGGCATGGTGATGTCGGCGCCGACGCGGCGGCCCATCTCG GTCTCGAGCTGGATGACGCGGGTCGCGCCCACCGCGCGCAGGATGTCGGC CTGGCGATCGGTGGCGGCGCGCGCGATGATCTCGCGCACGCCCATCCGGA CGAGGGAGGCCACGCAGAGCACGGACGGCTCGAAGTGCTCGCCGAAGGTC ACGATCGCGGTCTCCACGTACTGCGCGCCGATCCCCTCGAGCACCTTGTG GACGGTGGCGTCGCCGACGAACGCGGCCGAGGTCTTGTCCTTCACGGCGT CGACGGCCTCCGGGTTGTTGTCGACCGCGATCACCTCGGCCCGGTTCTTC CAGAGGGTCTCGACGACCGACGTGCCGAACCGCCCGAGCCCCGATGACGA GGACGCTCTTCGATTTCATGGTC7CCGGTCGCGCGCCGCCCCCTCGGGGC GCGGCGCCGCGAAGGTCTCACGGA7GCGCCGGGAGCGCCACGGTTCGGCG CTGCCGTCGTCGCGACGGCGCGGCCCGCGCGGGCCGCGCCGCGCGGCGCT CAGTAGAGCTCGTCCTGGTGCTGCCACCGCTCCGAGATCCACGCCTTCAG GTACGCGATCTCCTCCTCGTACG7CGTGAAGTCGTCGCGCCAGCTCCAGC CCTCGTAGCTCCGGTACGCCTCGCCCCACCGCGCCTCGTCCCGGCGCGCG CTCGCGTCGATGCGCTCCACGTAGCCGTCCACGATCGCGTGGATCTCGGC CTCGGCGAGCGCGCCGCGCAGGACCTGATCGTAGCGGGCGCGCAGCGGGT CGCCGATCGACGGCTCCTCGAGGAGGCGCTCGAAGAGGAGGTTCACGTCG CGGTAGTCGACGCGATCCGACGCCGGCTCGCGCTCGGTCTCCCACGACTG
GCCGAAGCTCGCGTTGAAGTCCCACGGCGCGTAGCGGAATACGCCGTCCG CGGCCGGATCGCGGTAGTGGTAGC7GTTCTTTCCGGCCGAGTCGTTGGCC ACGATGAACGTGACGAAGATCCACCAGTCCTCGTAGTCGCGCAGATCGAT CCGCGACCCGATCTCGGCGGCGAACGTGGCGTCGTCGGACTCGGCCACGA AGCTCACGAGATCTTCCAGATCCGAGAACGCCTCCGGCTCGCCCTCGGCC GGCGCCCCTTCCTTCTTCTCGAAGCCGTCGTGCAGCGTGTCCTTGGGGTC GCCGGACCGGTCGGTCAGCGCGAAGTTCGCGTCGTGGCTGACCGCCTTGT AGAGGTTGCCGTCCTGCGGGTAGCCGTGGTCCTCCATCAGGTAGCCGTCG ACGTGATCCGCGACGGTGTAGAGCCCCGCGTACTCCCCGTCGAGGTACAG GACGGCGCTGTAGGTCTTGATCTGGATGTGCTCGGGATCGAGGCGGTTCC AGAGGTCATAGGCGAGGCGCTGCCGGACATAGGAGTTGTCGTCGAACGTC GTGATGAGCACGACCTTGCGGCGATCGGTGAAGCCGCCCGCCTCGTCGGG CTCGTTGAACTTGTCGTCCTTGGGGAACTTGAGGGTGTAGCTCCGCTTCG GGTACGAGAGCGAGCTCTCGCCGCGGAGCTCCGCCTCCGCGGCGTACGTG TGGCCGCGGTAGATCACCGTGGCCGGGGCGTACTCCTTGTCCTCGGGGAC GGGCGAGAGGAAGAGCACCGGCAGGCCGTACTCCTCGGGGTAGCGGGTCG GATCGACGACGGGCACGTTCGACGGATCGGCGAAGGCGTCGGCGACGCCG ACCTTGACGCGCCCGACCTCGGACGTCTGCGCGACGCGGATCTCGATGTC GTAGACGGCGGCCTGATCGAGCCCGGGCGAGAACGTCACCTCGCGCGCGA TCGGGTCGTACGCGGCGCCCTCGGGGAGCGGGCCGACCTCGAACGCGTCG CCGGCGAGCGCGAGGCCGCTCGCGCACGTCACCGGGAACGTCACGGTCTC CCCCTCGAGGAGCCAGTGCGGGCCGCCGCCCGACGGCTGGCAGCGCGAGC CCTCGGCGCTGGAGCCCCCGCCGCTGGAGCCGGAC
Seq ID No 90 (>Contigll)
GGCGACCCCACATATCACATAGTAGAATCAGTGTGAGTTAGACAAATGTC GAGTGATGAGAAGGACAGAAGTGAGAACTCTGTCGATCACTGTAGAACGA GAGAGTATGAGCCTGCATACATGATAGCGGACATGAGAACGAGTGTANTA TGATGCTACTAAGAGAGTAACAGATCAGAGACTAGAGTAGAGCAATAGAA NTCAGAGATAAGTCAATGACGAGGAGTAGTGATAGAGCTCTTAATAATGG
CTGAGGTCGAAGATAGAAGTGCATAGAGCGATAGATATACAATCGGTTGA AGCAGAGAGTAAGATAAGATCAGACACNGAGTACAGAGAGAGACGAATAG A7GGCGTGATNTCACAGAGAGGTGCGAGCGTAGCTGACGAGAGCAGAGAC GCAGAGTAAGTCACACCTAGATAGTTACGGCGAGAGACAAATGATAGGAA GGAGTGGACGAGATCAACAGNCCGGAGCACAAGAACGTGAGATGCGACCG TGTAATAAACAGGAGACAAGAGCGACTACATAAGAGAGCGAAGCGAATAG ATAAGATATAAGCCCAGAGCAAAATAGAAGGAGAGAGAGAGTATTTGTAA TAAAGCAACAAGACGGAGAGAGCGAAGCAGCAGGCAACGATTAGAAGAAA GACGACAGGAAAGTGAAAGCGAAAGAGAGCAGGTAGAAAGAGAACCAAAA AAGCACGAAGGAAAAGGAAGCTTCTATGATAGGTGCGGGACAAGGCGTAG CTACAGGAGACAGCCGGCATACGAGGAGCCGGTAAAAGCTAGCCTTTCAG AACACATCGGGAGCGCGTAAAGGCGGACCACGCTCGACGGGATCATGTAC GCCGACAGCGACGCCTTCAGCCCCGCGCGCACGTCCGGCGCGTCGCCCGC CGCGTCGCCGTCGAGCACGACGTAGGCGACCAGGCGGGCGTCGCCCGGCG CGTCCTCGCGCAGGACCACGGCCGCCTGGCCCACGCCGGGCACGCGCCGG ATCTGCGCCTCGACGTCGCCGAGCTCGATCCGGTGCCCCCGGAGCTTGAT CTGGTGGTCCGAGCGGCCCTGGAACTCGAGCATCCCGTCGGGCAAGAAGC GCGCGACGTCGCCGGTCCGGTACATCCGCCCGCCCGCGGCGCGCGCGCAC GGGTCGGGCAGGAAGCGCTCCGCGGTGAGCCCGGGCTGCCCCACGTAGCC GCGCGCGAGCGGCGCGCCCGCGATGTAAAGATCGCCGAGCGCGCCGATGG CGGGGCGGCGCATCGCGCCGTCGAGCACGAACACCTCGGCGTTCGCGACC GGCGCGCCGAGGGGGACCCACGTGACCCGCGGGTCGCTCGGCAGGACGCA GCCGGTCACCGCGATCGCGGCCTCGCTCGGCCCGTACATGTTGATGAGGT CGCCGTCGTGCTTCGCGTAGAAGCGCCGGACGAGATCGAGCGGCACCGCC TCGCCGCCCACGAGGACCTTCCGCAGGCTCGCGGGGAACGGCTGCTCGGG CCCCCCGAGGAACGCCGCGAGCATCGAGGAGACGAAGTACGCGGTCGTCG CCCCCTCGTCGCGCACGAGGCGCCGAAGGTACTCGGGATCGCGGTGCCCG CCGGCCCGGGCGACGACGATCCGCGCGCCGAACGAGAGGGGCCAGAAGAT CTCCCAGACGGAGACGTCGAAGCCGAACGCGGCCTTGAGCAGGACCCGGT CGTCCGCGGTGAGCGCCCAGTACCGCTGGATCCACTGCATCTGGTTGACG
ATGGCGCGGTGGGAGATGAGGCTCCCCTTCGGCGTGCCCGTCGATCCGGA CGTGTAGATGACGTACGCGCCGCTGTCCGGCGGCGGGCTCACGGCGGGCC GCGCGTCGGAGCACGCGGCGATCTCGGCGGCCTCGGCGTCGAGGAGCAGC G7CGTCCAGCCGCCGGTCGGGAGCTCGTCGGCGATCGCGTCGTGCGTGAC GAGGAGGCGCGCCCGCGCGTCCCGCATCATGAAGGCGAGGCGCTCGCCGG GGTACTCGTGGTCGAGCGGCAGGTAGGCGCCGCCCACCTTGAGCACCGCC AGGGTCGCGACGACCATGTCCTCGGAGCGCGGCACGCAGACCCCGACGAT CGTGTCGAGCCCGACGCCGCGGCGGCGCAGGCAGCTCGCGAGCCGGTTCG CGCGCCGCTCGAGCTCGCCGTACGTGAGCGACTTGCCCTCGCTGCGCACC GCGACGACGTCGGGGTGCTGCTCGGCGCGCTCCTCGAACCACCGGTGCAG CGCGCAGGCCGACGGCAGCTCCATCGCGGGGCCGCGCGACCACGCCTCGA TCTCGGCGCGCTCGCCGGGGCCGACGTACTCGCCCTGGGCGACCGGACGC TCGGGGTGGCGCGACAGGTCCTCGAGCAGCGCCGCGAGGCGCTCGGCGAG GCGCTCGGCGTCGCGGCGCGCGCCGGCCGACGCGTCGTAGCGGAGCTCCA GCGACGCCGACGGGCCCGCGCCGGCGCAGTGCAGCCGGCAGGCGACCTGG TCAGACGTGCTCCAGACGTCGAGCACGCGGGCCCGCGCGCCGTCGAGCGA CAGCGCCGGCGCGGCCTCCGCGGCGGAGAAGCCCCCAGCTCATCCGGTGG CTCACCCCGGGCGCGGCGTCCTGGTGCGCGGCCGCCTCGGCCTCGGCGAG CGCGAGCCGCCGCGCGACGTCGGCGAGCGTGTCCGAGGCCGAGATCTCGA TCCGCACCGGCAGGAACCGCGCGAACGGCCCCACCGCGCCCGCGAGCGCG TCCAGCGACCGCCCGTCGAAGCGGACGGCCACGGTGACCTCGGGCTCGTT GCCGCCGCTCATCCGCCACAGGAGCGACGCCCACAGGGCCAGGAGCACGA TCCGCTGCGGGACCTGCCACGACGACGACCAGCGCTCGACCTGCGCCATC CCGCCTTGTCCCAGATCGACCCGCGCGCGCCCCGAGCCGGCGCCGGCGCC GGCGCCGCCGCGGCTGAAGGCGAGG7GGAGCGGGGGCCCGAAA7GCGAGC GGCGCTCGGCCCAGAACCTGCGCCCGTCGCCGGCGTCCTCCGACTCGAGC ATCCCGTTGAGCCACTCGGCGACGTCCGCGTACTGCTGCTCGGGCGGCGC GCCCGCGCCCGCGGTCGACGCGCAGAGCTCGCGGACGAGCGGGGCGATCG ACTCCTCGTCGACGCACCACGCGGGCGCCGCGAGCACGAGCCGGCGCTCC TCCGGGCCGACGCGGACCAGGCCGACGCGCAGCCCGTCGTCCGCGCCGCG
GTCCTCCGAGAGGCGCGCGACGAGCCGCGACATCCGCTCGCCCTGCTCGG CTTCGGAGCACCCGACCCAGTCG7CCTGCTGACGCCCACGCGAAGCGCGG CTCGCCGACCACCTGCGCGGCCTCGCCCGCCCCGCCTCGACGAGGCGCGT GCGCAAGATCTCGTGCCGCTCGGCCAGCGCGAGCGCCGCCGCCGAGAGCC GTCCCTCGTCGCACGGGCCGGTCACGGCGACGACGGCCAGCGTCCGGCAC CCGGGCGCCCCCGCCTCCCGGTCGAGCGCGCGGATCGCCCGCTGCTGCGG CGAGAGGCTGAAGCCGGTCATCTCGTGGTCACTCATCCAGGTCGTCCTTC GGTGAGGTCTTCGCTTCGCCCGGCGCGCCCGGGCGCGGGAGGGTCACGGC GCGCCGGCGCGCGGAGGTCAGCTTGTCGAGCGCGGCGCCGCGCGCGGCCT TCCGCTCGAGCTCCCGGCGGGCCGCCGCGGCGCGCTCGAGCTCCCCCCGG AGCTCCGAGACCGGGGTGTCCGGCCGCGCCGTCGCAGTGGCCAGGATCTG CCGGTAATCGCTAAGGAAATTGTCGACCGTCGCCGCCCGGTACAGCTCGC TGCTGTGCTCGACGCCGAAGCGGAACGAGCCGCCGGCCTCGGCGACCGTG AGGACGAAGTCGAACGCCGTCGTGGTCGCCTCGCCCTCCAGCGCCTCGAG CTCGAGCCCCTCGAGCTTCATCGGGGGGACGTGCACGTTGCGCATGACGA ACTTCGCGTCGAAGAGGGGCACGTGCCCGACGGCCCCCTTCGGCCGCAGG GCCTCGACGAGCCGGTCGAACGGCAGGTCCTGGTGCTCGAACGCCTCGAG CGCGACGTCGCGCACGCGGCGGACCAGCGCGCCGAACGTCGGGTCGCCCC CGCAGTCGGTCCGGAGCACGAGCTGGTTGACGAAGAAGCCGATCATCGGC TCGGTCTCGACGCGGTTCCGGTTCGCGACGTCGGTGCCCACGACGAGGTC CTCGAGCCCGGTGCGCTGGTGCAGGACGAGCTTGTACGCGGCGAGCAGGG CCATGAAGGGGGAGATCGCCTCCCGCTCGCAGAACGCCTTGATCTGGCGG GTGAGCTCGGCCCCGGCGTCGAGGCTCCGCCGCGCCCCGCGCCACGTCCT TCGCCCCGCCGGCTCGTGGTCGACCGGCACGCGGGCCCGGCGCAGCGCGC CCGAGAGCTTCGTCGTCCAGTACCGGAGCTCGCCCTCCAGGACCTCGCCG GACAGCCACGCCCGCTGGGCTGCGGCGAAGTCGACGTACTGCGCCGGGAG CTCCGGCAGCCGGGAAGGCTGGCCCTGCGCGAAGCCGCCGTAGAGCGCGG CGAGCTCGCCGACGAAGACGCCGACCGACCAGACGTCGAACACGACGTGG TGCACGACGAGCGCGATGACGTGC7CGTCGTGGCGCTTCCGGATGACCCG CACGCGGAGGAGCGGCCCGCGGCTCAGGTCGAACGGCGCGAGGCTCTCCT
CGAGGACGAGCGCCGAGACCGCCGCGTCGAGGGCCTCGCCCGCGAGGTGC TCGAGGTCGGACATCCGGAACGGCACCCGGGCCTCGGGCGCGACGACCGG GAACGGCACGCCGTCCCTGGCGC7GAACGTCGTCCGCAGCGCCTCGTGGC GCCGCGCGATCTCGAACAGGCTGCGGCGGAGCGCGTCGACGTCGAGCCGG CCCGTCGCGCGCACCACGAACGGGATGTTGTACGCCGGGCTGCCCGGCTC GAGCTGATCGACGAACCACAGCCGGTGCTGCGCGAACGACAGCGGGAGCG GGCCGTCGCGGGGGATCCGCGCGATCGGGGGGAACTCGCGCCGCCGCGCC TCGCCGCGGCGCGCGGCGTCGACCTGGGCCGCGAGCGCCGCGACCGTCGG CCCCTGGAAGAGCGCCCGCAGCGGGAGCTCGACGCCGAGCTGCGCGCGGA TCCGGGACATCACCTGGGTCGCGACGAGCGAGTCGCCGTGCAGGCCGAAG AAGTCGTCGTGGACGCCGATCTCGTGGACGCCGAGGAGGGCGCTCCCAGA TCGCGGGCGATCGCGCGCTCGGACTCGGTCGACGGCGCGGCGAACGCGGC CCCGGCGTGCGCGCGGGAGACCGCCGACGTCGGGAGGGCGTTCGGCGCGG GCGCGAGGGGCGCGTCGGCCGGCGCCGGCTCGGCGGCGGGGGCGCCGCGG CGCGGCGCGATCCAGTGCCGCGCCCGCTCGAACGGGTACGTCGGCAAGCG GACGAGCGCGCCGGGGGACGCCCCCGCGGACGGGCCGTCCAGTCGACGGC GTGGCCCGCCTCCCAGAGCTGGCCGAGGGCCTCGGCCAGGCTCGCGGGCT CGGACGCGGCGTGGGTCGACCCGAGGCTCGCGATCGCGGCGCCGCCGCGC CCGGCCAGCGTCTGCCGCACCAGCGTGGTCAGCCCGCGGCCGGGGCCGAC CTCGAGGAACAGGGCGTGCCCGGACGCGAAGAGCGCCTCGACGCCGTCGC TGAAGCGGACCGGCTGGCGGAGGTGCCGCGCCCAGTAGGCCGGATCGGTC GCCTCGGCGTCGGTGAGGAGGGCGCCGGTGACGTTCGAGACCACGGGGAT CTCCGGCGGGGAGAGCCGCGCGCGCCGCACGCTCTCGAGGAACGGGGCCA CCGCGCCGTCGATGAGCGCGCAGTGGAACGCGTGGGACGTCTGCAGCGGC CGGGCGAACACCTCGCGCGCCTCGAGGCGCGCGGCGAGATCGCGGATCGC GCTCGCCGGGCCCGCGACAACCGTGAGCTTCGGGCTGTTGACCGCGGCGA TCTCCAGGCCGGCCTCGAGGAGGCCCTCGACGTCCGCGGCCGGCAGGCCG ACGGCCAGCATGCTCCCGGCCGGCGCCGCCTGCATGAAGCGCCCCCGATC GATGACCAGGGACATCGCGTCCTCGAGCGTGAACACGCCCGCGACGCAGG CCGCCACGAGCTCGCCGAGGCTGTGGCCGATCATCGCCGCGGGCTCGATC
CCCCAGCTCATCCAGAGCCTGGCGAGCGCGAGCTCGACGGCGAAGAGCGC GGGCTGCGCCAGCGCGGTGCCGAGCAGCGTGCGCCCGTCGCCCTCGCCCT CGCGGAAGACGACCTCGCCGAGATCGAGGCCGCGCGCCCGCGCCGCCGCC GCGCACGCGTTGAAGGCGCTCCGGAACGCCGCCTCCTGCGCGTAGAGCGC GCGGGCCATCCCGACGGCCTGCGCGCCCTGGCCCGGGAACGCGAAGACGG GCGCGGCTCATCGGGGCGCGCGAGCGCGCTCGCCCCCTCGCGGGCGAGCC CCTGGATCGCCTCGGCGCGCGTCCGGGCGACGACCGCCCGGCGGTACGGG TGCTCCGCGCGCCCGGTCTGGAGGGTGAACGCGACGTCGTCGAGCGGGAC GTCGGTCGCCTCGAGGTGCGCGGCGAGCTGCGCGCAGGCCGTCGACAGCG CCTCCGGCGTGCGCGCCGAGAGCGTCAGCACGTGATCGCGCTCCGGGGCC GGGGCGCGGGGCGGCAGCGGGGGCGGCTCCTCGAGCACGACGTGCGCGTT CGTCCCGCCGATCCCGAACGAGCTCACGCCCGCGCGGCGCGGGCGGAGCT CGCGGGGCCAGGGCGCCGCCTCCCGCGGGACGAAGAACGGGCTCGCCGCG AGGTCGAGCTTGGGGTTCGGCGCCTCGAAATGGACGCAGGGCGGGATCTC GCCGCTCCGCACGACGTGCGCCGCCTTGATGAGGCCCGCGACGCCCGCCG CGGCGTCGAGGTGGCCGATGTTCGCCTTGATCGAGCCGAGCGCGCAGTAC GCCTTCCTCGGGGTCTTGCGGCGGAAGGCCTGCGTGAGCGCCTCGACCTC GATCGGATCGCCGATCGCGGTCGCGGTGCCGTGGGCCTCGACGTAGCCGA TCGAGCCGGGATCGACGCCGGCGACCGACTGCGCCTCGGAGATCGCCGCC GCCTGGCCGTCGACGCTGGGCGCCATGAAGCCGACCTTGCGCCCGCCGTC GTTGTTGACGGCGGAGCCCCGGATCACCGCGTGGACCGTGTTTCGGTCGC GGAGGGCGTCCGCGAGGCGCTTCAGCGCGACGATGCCGACGCCGCTGCCG CCCACGGTCCCCTCGGCGCGCGCGTCGAACGGCCGGCAGCGGCCGTCGGG GGAGCAGATGCTGCCGGGCACGTACGGATACCCGCGCTTCTGCGGGATGC CGATGGAGACGCCGCCCGCCAGCGCGAGATCGCACTGGCCGCCGAGGAGG CTCTCGCACGCCATGTGGACGGCCACCAGCGACGTCGAGCACGCGGTCTG CACGACCACGCTCGGCCCGTGGAGGTCGAGTTTGTACGAGACCCGCGTCG CGAGGTAATCCTTCTCGCTCGCCAGCATGAGCGCGTGCGGATCGACGGTG GCCGCGAGATCCGGGTGCGAGAGGAGCTGGAGGAGGTACGTGTTGGAGCC GCACCCCCCGAAGACGCCGATCGCGCCCGGGAACCGGGCCGGA7CGCAGC
CGGCGTCCTCCAGGGCGGCGACCGCGCACTCCAGGAAGAGGCGCTGCTGC GGGTCCATGAGCTGCGCCTCGCGCGGCGAGTACCCGAAATAGGACGCGTC GAAGCGGTCGATGTCGTCGAGCAGGCCGCCCGCGCAGACGACGGGCGCCC CGGGGGCCGCGCTCGCGCCGACCGGCGGCTCCTCGCGCTCGCTCTCCGGG AAGCGCGCGATCGACTCGACGCCGCGCCGCACGTTCTCCCAGAGGGCGTC GACGCTCGGGGCGCCGGGGAAGCGGCCCGCCATGCCGACGATCGCGATGT CGCTCCCCCCGTCCTCGGTCTCGATCGGCTCTGACATGGCTATCCTCGCC CCCGGCGGCGTCGCGCGTCGCGGCGCGCCTCGGCGCGCTGCGCCCCGACG TCGGCCGGCTCGGCCTTGACCGTCGCCGCGTCGAGCCGCTGCGCCAGTTG CTCGATGGTCGGGTACTGGAACAGGTCGGTCAGCGACACGGCCTGCGCCG CGGCGCCCTCGTCGGGCGCGCGCGCCGCGATGCGCTCGGCGAGCAGGCGC TGCGCGCGCACGAGGAGCAGCGAGGTGAAGCCGAGCTCGAAGAGGTTGTC GGTCACGCCGACGGCCTCGACCTGCAAGACCTCCGCGAGCACCGAGGCGA TGAGCCGCTCGGTCGCGGTCCGCGGGGCGACGGCCGCGGCGCGCGGCGCG ACCGCGGCGGGATCCGGCAGGGCGGCGCGGTCCACCTTGCCGTTCGCGCT CAGCGGCAGCGCCGGGAGGACGACGACCTCCGCGGGGATCATGTACTCCG GCAGCTTCTTCCGGACGAAGTCGCGGAGCGCGGCGCCATCGCCGTCGGCG CCGACGACGTACGCGACCAGGCGCTTCTCGCCCGACGGATCGGTCTTCGC CGCCACGACCGCCTGCTCGACCGAGGGGTGCTGCGCGAGGGCGGCCTCGA TCTCGCCGAGCTCGATGCGGAAGCCGCGGATCTTCACCTGATGGTCGGTG CGCCCGAGCAGCTCGATGGTCCCGTCGGCGAAGTAGCGGCCCAGGTCGCC TGTCCTGTACAGCCGCTCGCCGGTCGTGGGGTGCTTCAGGAACCGCTCCC GGGTCCGCGCCTCGTCGCGCCAGTATCCGAGCGCGACGCCGATCCCGCCG ATGTGGATCTCGCCGGGGACCCCGATCGGACACGGCTCCAGCCCCTCGTC GAGCACGTAGGTGTGCTGGTTCGCGAGCGGGCGGCCGTAGGGGATGCTGC GCCACGCCGGGTCGACGTCCGCGA7CGGGTGGGCGATCGACCAGATCGAC GCCTCGGTCGCGCCGCCGAGGCTCACGACGCGGGGCGCGCGGCAGGCCGC GCGGATGCGATCGGGGAGCTTCAGCGGGATCCAGTCGCCGCTCATCATGA CGAGGCGGAGCGACGACAGCGCCGGGTCGCCCGCGCCGGGGGACGCGTCC ATGAGCATCTCCATCAGCGCCGGGACCGAGTTCCACACGGTCACCCGCTC
GCGCTCCACGAGCTCGCGCCAGTGCCCCGGATCCGAGGCGCGGGTACGGT CGGGGATCACGACGGCGCCTCCGGCGGCGAGCGTCCCGAACACGTCGTAG ACCGACAGGTCGAAGCTCAGCGACGAGAGCGCGAGCACCCGGTCCTCCGG GCCGACGTCGAAGCGGCGGTTGA7GTCGAGGACCGTGTTCACCGCGCCGC GGTGGTCGATCATCACGCCCTTG3GCAGCCCCGTGGACCCGGACGTGTAG ATCACGTAGGCCAGGTCGTCCGTGCTTCCGCCGGGCGGCCGGCGCGCGAC GGGCTGCTCGCGCCACCGCTCGTCCGCGTCGACGGCGAGGCGCTCGATGC CCGCGGGCCAGGCGATCGTCCCGTCGACCGCCGACTGCGTGAGGACGAGG CGGACCTCGGCGTGCTCCAGGAGGTGCCTGAGGCGCTCCTCGGGGAGGCG AGGGTCCAGGGGCAGGTAGGCGGCGCCGGCGCGCAGCACGCCGAGCACGG CGGCCACCTGCTCCCAGCCCTTC7CCATGACCACGGCGACGAGCGCGTTC GCGGTCGCTCCGGAGCGCGAGGCCGCCGCGGCGATCGCCTCGGCGCGCCG GGCGAGCTCCCCGTAGGTGAGGCGCCGCTCGGCGTCGACGACCGCGCACG CGTCGGGCTGCTCGACGGCGCGCTCAAAGAACGGCTCCTCCAGCCGGAGG TGATCCGGGGTTGCGACCGCGGTGTCGTTCCACGCGACGAGGGCGCGCTC GCGGTCTTCCGGCGCGACGGAGAGCGCGCGGACGCGCTGCGCGGGGTCCT GAGTGGCGCGCGAGAGCACGCTCTGCATCGTGGCGAGCATCCGGTCGATG GTCGCCGCGTCGAAGAGGTCGACGTTGTACTGGAGCGAGATCACGTCGCG CCCGCCGCGCGGCTCGACGCTGAAGCGCAGGTCGAAGCGCGTGGCCTCGA CCGGGAGATCGAGCGGCTCGATCCGCACCTCGCCGAGCTCGAGCGCCTCG GTTGGGGCGTTCTGCACGACGAGCATGACCTGGAACAGCGGCGAGCGGCT CAGGTCGCGGCGGGGGTTGACCGCCTCGACCACCTTCTCGAACGGGGCGT CCTGGTGCTCGAACGCCTCGAGCGCGACCTTCCGCGCCCGCGAGAGGAGC TCCTCGAAGGTCGGGTCGCCGCCGAGGTCGAGGCGCATGACGATCGTGTT CACGAAGAAGCCGACGAGGGGC7CGAGCTCGGGGCGAGGCCGGTTGGCGA CCGCGGTCCCGATGGCGAGGTCGTCCTGGCCCGAGCTGCGCCGGAGGAGC ACGCCGAGGGCGGCGAGCAGGACCATGAAGCGGGTGGCGCCGCGGCTCCG GGCGAGCTCGTCGAGCTGCGCCACGAGGCGCGCGTCGAGCGGGAGGACCC GCTCCGCGCCGCGGAACGTCTGGACGGGCGGCCGCGGTCGATCGGTCTGG AGCTCCAGGACCGGCAGCCCGCGGAGGGTCGCTGTCCAGTGAGCGAGCTT
G7CGGCGAGCCGCTTCCCCGCGAGGTGGCGGCGCTGCCACACCGCGAAAT CGACGTACTGGAGCGGCAGCTCGGGCATGTCCGCGGGCCCGCCGCCCCGC GCGCGCCGGTAGAGCTCCGCGAGATCGCGGACGAGGGGTTGGAAGGACCA GGCGTCCGTGACGATGTGGTGCGTGGACAGGACCAGGACGCAGACGTCGT GGTCGAGGCGGAACAGCCTGGCGCGGAACACGGGCCCGCGCGCGAGGTCG AACCCCGTGGCCTGCTCGCGCGACGCCCAGGCGCGCGCCGCGGCCTCCGC CTCGTCCGGGGGCGTGCCGCGGAGGTCGACCACCTCCGCGGGGGCCGCCT CGGGCTCGCAGATCTTCTGCGCCGGCGTGGGGCTCGCGACGAACACCGTG CGCAGGCTCCAGTGCCGCCGGACGAGCGCGGCGAGCGCGGAGGACAGCGC GTCGACGTCGACGAGGTTCCGCAGGCGGACCGCCTGCACCACGTTGTAGG CGGTCCCGCCGGGGAGCAGCTGCTCGAGGACCCACAGGCGCTCTTGCTCG TACGAGAGCGGATACGGCTCGTCCGCCGGCGCGCGGCCCAGCGAGGGCGC GATCTCGCTGGCGGGCACCGTCGCTGCGGCGGCGGTGGTCGAGGCGGCGC CGGAGGAGAGGCGATCGGCGAGCTGGTGGAGGGTTGGGTGCTCGAAGAGC GTGCGGAGGGTGGTGCGGATGCCGAGGGAGGACTCGATGCGTCCGAGGAC CTGCATGGCGAGCAGGGAGTGGCCGCCGAGGTCGAAGAAGCTGTCGTGTC GTCCGACGCGGTCGAGGTGGAGGACGGATTGCCAGATGTGCGCGAGCTCC CGCTCGAGCTCGCCCGAAGGGGGCTCGTAGTCGGCGTGCGCGGCGGGTGG CGCAGGGAGGAGCTTCTTGTCGACCTTGCCCGAGAGGGACATGGGCAAGG CGGGGAGCAGGACGAAGTGGGCGGGCACCAAGGCGTCGGGCACCAGGCGG GCCATGCCCTCGCGCAGGTCGCGCTCGGAGGGCGGGTCGGCGCCCGGCAC GACATAGGCAATCAGGCGCGCGGCGCTGCCTTGGCCGTGGAGGACGACGA CGCCCTCGCGGACGGCGGGCAAGCGTCGCAGGGCGGATTCGACCTCGCCG AGCTCGACACGGCGACCGCGGAGCTTGACCTGCTCGTCGCGGCGTCCGGC GAAGGCGAGCTGTCCGTCGGGGCGCCAGCGCACCAGGTCGCCGGTGCGGT AGAGGCGTGCGCCGGGCTGGCCGAAGGGATCGGGCAGGAAGCGCTCTGCG GTCAGGTCCGTGCGTGTGTAGCCCTGGGCGAGGCACGCTCCGCCGATGTA CAGCTCGCCGAGGACGCCGGGCGGGACGGGCTGCATGTGCGGGTCGAGGA CGTAGACGAGGGCGCTGTCGATGGGTCGGCCGAGCGGGGGCTCGTCGCCG AGGTCGGCGACCTCGGCGACGGTGGTGATGACGGTGGCCTCGGTGGGGCC
GTACATGTTGAAGAGGCGGAAAGGGAGCGGTCGCCGGAGCGGATGGAGCT TGTCGCCGCCGACGGTCATCGCGCGCAGGGCGATGCCGGTCCAGTCTTGC TCGAAGCACGCCTCGGCCAGGGGCGTGGGCATGAACGAGAGCGTGGCCCG CTGAGCGACAAGCCAGGAGACGAGCGCTGTGGGAGAGCGGAGCGCGTCGT CGTCGGCGAGGAGGAGTGCAGCGCCGCAGGCGAGCGGCGTCCAGATCTCG TAGACGGAGGCGTCGAAGCCGCTGGAGGCCAGCTGAGTCCAGCGATCGCG GGGTGAGAGCGCGAGCAGGTGCTGGAAGAAGGAGACGAGCCTTGAAAGGC TCGCATGGCGCACACAGACGCCCTTCGGCGTGCCGGAAGAGCCGGAGGTG AAGAGGACATAGGCCAGGTCGTCGGGCCTGGAGACGAGAGGAATGTGGGT GCTGGGCGCGCACGCCCCGTCCTGGACGAGGTGGACGGGGCAGGGGGCGG CGGTGAGCTTGTGGCTGGCCTGGCTGCTGGTGAGCACGAGCGCGGCGCGG CAGTCGGCGAGCATCTCGGCCAGGCGCGCCGGGGGGTTGGCGGGGTCGAG CGAGGCATAGGCGGCGCCTGCCTTGAGGACGGCGAGCTGGGCGGCGACCA TGCGGGGCGAGCGCTCGATGCAGACGCCGACGACGCTGCCGGGGCCGACG CCGCGGTCGCGCAGCCACAGGGCGAGCTCGGTGGACCAGGTGCTGAGCTC TGCGTAGGTGAAGCGCTGGTGTCCGAACTCGAGCGCCGTGGCGTCCGGCT GTCGAGCGGCGTGGGCCTCGAAGAGCGCATGGACGCAGGCGGGGGCCGGG GCGGAGGCGGCCTGTCGTGCGGCGGCAGCGCCGCTCCAGTCGTCGAGGAG CAATGCGCGCTCGGCGTCGGAGAGCATCCGGAGCTCGGAGAGCGGTCGAC CGGGGTGCTCGACGGCGCTTTCGAGCAGGAGCACGAAGTGGCGCGCCATC CGCTCGATGGTGGCGGGGTCGAAGAGCTGCTGGTCGTACTCGAAGCGCAG GGCGATGCCGGAGTCGAGCTCTGCGGCGAACAAGGCGAGATCGAACTCGG CCGCTGCCTGCTCGTCGGCGAGCGTGGTGAGCTCGAGCTCTCCCTGCGCG ATCCGCACGTCCTCCGCGCCGGTGGTGAGCGCGGCGAGGCGGGGATCCAG CGACGGCAGAGCGCCCTGGAAGGCGAAGGCGACGTCGAAGAGCGCGCCGC CTCGCCGGGCCGCGCCCCGGGGCTCTGCGAGCAGGTGCTGGAGGGCGCTG TCGCCGTGGGCCAGCCCGTCGAGGAACGCGTCGCGCACGCGGGCGACGAG CGCGTCGAAGGACGCGGCCCCGCGCAGCGCCACGCGCACGGGGAGCATCT GGACGAAATAGCCGAAAGCCCGAG7GCTCTCGTCGTCGTTCCGCCCCGCC GAGGGGACGCCCACGACAAGGTCG7TCTGCCCGCTCGCGCGATGGAGCAA
GACGGTGAGCGCCGACAGCAGGACCGAGAAGAGCGTGGTCCCGCGCTCGC GCGCGAGGCGCGCCAGCGCTCCGGTCAGGGGCTTTGGCAGCGTGATCGCG TGAGCGCGACCGCGGCGAGGGCTCGCGTCGTGGCGGGCCCGGTCGCGGGG AAGGTCGATGGCGGTCGTCGCGCCGTCGAGCGCCTTGCGCCAGTATTCTG CTCCGCCGGCCGCCTCCCGCGGCGAGGGACAGCTCACGCCGGCGGCGAAG AAGCTCGACGGCGGCGGCAGCTGCGGGGGCCGGCCCGCGCGCAGCGCCGA GTACAGCTCCCCCAGCTCGCGAACGAGCAGCGCGAACGACCAGTAGTCGA CCACGAGGTGGTGAACGACCACCGTGAGCAGCGGCGGCTGCCCCTCTCCG CGCCGCCAGACATGCACCCGGAGCAGCGGTCCGCGCTCCAGGTCGAACGC GCGGCGGCGCACCTCGTCCGCGCGGGCGACGATCTCGCGCTCGTCCAGCG CCATCGCCGGCTCTTCGGCCCATTCCAGGGCGACATGGCGGTGGACCTGC TGCAGCGGATGGCCGTCGCGCGTGAGGAACGTCGTGCGGAGCGCCTCGTG CCGCTCGACGAGGCCCTCGAACGCGCGGCGCAGCGCGGCCACGTCGACGC CGGCACCGAGCCGGACCGTCCTGCCCAGGTTGTAGAGCGCGCCGTCGGCC GACTTCTGGCACTCCAGCCACATCGCCCGCTGCCCCTCGGTCAGCGCAAA CGGCTCTTCCGGCGTCGCCACCCGCGGCGACGAGGGGCCGTCCGGACGGG TGAGCCCTCGCTCCAGGGCCGTCGCTGCGGCGGCGGAGGTCGAGGCGGCG CCGGAGGAGAGATGACTGGCGAGC7GCGCGAGGGTTGGGTGCTCGAAGAG CGTGCGGAGGGTGGTGCGGATGCCGAGGGAGGACTCGATGCGTCCGAGGA CCTGCATGGCGAGCAGGGAGTGGCCGCCGAGGTCGAAGAAGCTGTCGTGT CGTCCGACGCGGTCGAGGTGGAGGACGGATTGCCAGATGTGGGCGAGCTC GAGCTCGAGCTCGCCCGAGGGGGGCTCGTAGTCGGCGTGCGCGGCGGGGG GCGCAGGGAGGAGCTTCTTGTCGACCTTGCCCGAGAGGGACATGGGCAAG GCGGGGAGCAGGACGAAGTGGGCGGGCACCAGGGCGTCGGGCACCAGGCG GGCCATGCCTTCGCGCAGGTCGCGCTCGGAGGGCGGGTGGGCGTCTGGCA CGACATGGGCAATCAGGTGCGCGGCGCTGCCTTGGCCGTGGAGGACGACG ATGCCCTCGCGGACGCCGGGCAAGCGTCGCAGGACGGATTCGACCTCGCC GAGCTCGACGCGGCGACCGCGGAGCTTGACCTGCTCGTCGCGGCGCCCCG CGAAGGCGAGCTGTCCGTCGGGGCGCCAGCGCACCAGGTCGCCGGTGCGG TAGAGGCGTGCGCCGGGCTGGCCGAAGGGATCGGGCAGGAAGCGCTCTGC
GGTCAGGTCCGTGCGTGTGTAGCCCTGGGCGAGGCACGCTCCGCCGATGT ACAGCTCGCCGAGGGCGCCGGGCGGGACGGGCTGCATGTGCGGGTCGAGG ACGTAGACGAGGGCGCTGTCGACGGGTCGGCCGAGCGGGGGCTCGGCGCC GAGGTCGGCGATCTCGGCGACCG7GGTGATGACGGTGGCCTCGGTGGGCC CGTACATGTTGAAGAGGCGGAAAGGGAGCGGGCGCCGGAGCGGATGGAGC TTGTCGCCGCCGACGGTCATCGCGCGCAGGGCGGAGCCGGTCCAGTCTTG CTCGAAGCACGCCTCGGCCAGGGGCGTGGGCATGAATGAGAGTGTGGCCC GCTGAGCGACAAGCCATGAGACGAGCGCCGTGGGAGAGCGGAGCGCGTCG TCGTCGGCGAGGAGGAGGGCAGCGCCGCAGGCGAGCGGCGTCCAGATCTC GTAGACGGAGGCGTCGAAGCCGCTGGAGGCAACCTGAGTCCAGCGGTCGC TGGGCGAGAGATCGAGTCGGAGGTGGAGGAAGGAGACGAGCCTTGAAAGG CTCGCATGGCGCACACAGACGCCCTTGGGGGTGCCGGTGGAGCCGGAGGT GAAGAGGACATAGGCCAGGTCGTCGGGCCTGGAGACGAGAGGAATGTGGG TGCTGGGCGCGCACGCCCCGTCCTGGACGAGGTGGACGGGGCAGGGGGCG GCGGTGAGCTTGTGGCTGGCCTGGCTGCTGGTGAGCGCGAGCGAGGCGCG GCAGTCGGCGAGCATCTCGGCCAGGCGTGCCGGGGGGTTGGCGGGGTCGA GCGAGGCATAGGCGGCGCCTGCCTTGAGGACGGCGAGCTGGGCGGCGACC ATGCGGGGCGAGCGCTCGATGCAGACGCCGACGACGCTGCCGGGGCCGAC GCCGCGGTCGCGCAGCCACAGGGCGAGCTCGGTGGACCAGGTGCTGAGCT GTGCGTAGGTGAAGCGCTGGTGGCCGAACTCGAGCGCGGTGGCGTCCGGC TGTCGAGCGGCGTGGGCCTCGAACAGCGCGTGGACGCAGGCGGGGGCCGG GGCGGAGGCGGCCTGTCGTGCGGCGGCAGCGCCGCTCCAGTCGTCGAGGA GCAATGCGCGCTCGGCGTCGGAGAGCATCCGGAGCTCGGAGAGCGGTCGA CCGGGGTGCTCGACGGCGCTTTCGAGCAGGACCACGAAGTGGCGCGCCAT CCGCTCGATGGTGGCGGGGTCGAAGAGCTGCTGGTCGTACTCGAAGCGCA GGGCGATGCCGGCGTCGAGCTCTGCGGCGAACAAGGCGAGATCGAACTCG GCCGCTGCCTGCTCGTCGGCGAGCGTGGTGAGCTCGAGCTCTCCCTGCGC GATCCGCACGTCCCCCACGCCGATCGCGAGGGCTGACAGGCGTGCATCCA GCGATGGCGGGGTGCTCTGGAAGGCGAAGGCGACGTCGAACAGCGCGTCT CGCTGCGCCTCGCCCTGCGCTCGCGCGAGCAGGTGCCGGAGGGCGCTGTC
GCCGTGGGCCAGCGCGTCGAGGAACGCATCTCGCACGCGGGCGACGAGCG CGTCGAAGGACGCGGCCCCGCGCAGCGCCACGCGCACGGGGAGCATCTGG ACGAAGTAGCCAAAGGCCCTGGCGCTCTCGTCGTCGTGCCGCCCCGCCGA GGGGACGCCCACGACAAGGTCGC7CTGTCCGCTGGCGCGATGGAGCAAGA CGGTGAGCGCCGACAGCAGGACCGAGAAGAGCGTGGTCCCGCGCTCGCGC GCGAGGCGCGCCAGCGCTCCGGTCAGGGGCTTTGGCAGCGTGATCGCGTG AGCGCGACCGCGGCGAGCGCCCGCGTCGTGGCGAGCCCGGTCGCGGGGGA GGTCGATGGCGGTCGTCGCGCCGTCGAGCGCCTTGCGCCAGTATTCTGCT CCGCCGGCCGCCTCCCGCGGCGAGGGACAGCTCACGCCGGCGGCGAAGAA GCTCGACGGCGGCGGCAGCTGCGGGGGCCGGCCCGCGCGCAGCGCCGAGT ACAGCTCCCCCAGCTCGCGAACGATCAGTGCGAACGACCAGTATTCGACC ACTACGTGGTGATCCACCACCGTCAGCACTCTGCGTTTAGTCCTTTCTCC TGTTCGGCCTATTAATTGCTACTATGGATCCACACTGCTCCGCCTCTTGT ATCTCCCTTATCTGCACTTGCTGCGCTTNACCTTTACGTTCCTCTCCCTG CTCTATACTATTTCTTCCCCCGCTTCTCGTCCTATTCTGCATTTGTCATA TCGTATCTTCATATACCTTTCTTTCGCTATCCTTACTGCTTCTCGACCTT ATGTGCGTCTGTCTTCCCTTTCTNTATTATTTCTCTGTCTCACCGCTCTN TGCTCTGTCGCTCCTATCACTAAATTATGTCTCTATCACTGCTACTATCT GAAGCTGATCTTCGAGATCTCGCTNGGTGTCACTCTTTATCTCATAGNCG CCTCTGTCTTCTTGTCTCCTTAAGNCTGATTTTCTCGCTCTATTCGTGAC TACTCTGCTGTCTCTCACATACGTGTTCTTGAATCGTATTCGCGTTCTCG CTACTGTGATATCCATTGCCGACCTCTACTGCTCNTCTNTATGCTATACT TCTTAGTCTCTTACTACGTTNGTCTGATATNTTGCTGACGACGTCATGTC ACGCTCGCAACTCTTCANTTCTATCGTATACGCTGATCATCATTTTCTGT GAGGCTGATGTACTATACGTAATTACCTGTATACGTCGTCTATCTACTCT CGTGTCTTCACTCTTTCTACTCC
Seq ID No 91 (>Contigl2) CCCCCCGCCGTCCGCCGGTACGTCGCGGACCGCCGCCCCGAGCAGCTCCC CGCGCTCGCGCCGGAGGAGCGGGAGGCCGCGGCGCGCCGCCTGTCGGCCC
TCGGCGCGGCGCCGCCGCAGGTCCGGCGCCGCGGGCTGACGCGGGCGCCG CTCTCGTACGGGCAGAGCCGCATCTACTTCCTCGAGCAGCTCTCGCCCGG CAAGCCGCTCTTCAACGTCCCGGGCGCGGTCCGGCTCCGGGGCCCGGTCG ACGTCGCCCGCCTCTCGGCAGCG7TCGGCGAGATCGTGCGGCGCCACGAC GCCCTCCGCACGTCGATCGCCAACGTCGACGGCGAGCTCCTGCAGATCGC GCAGCCGCACGCGGGCTTCGCGCTCGACGTGGTGACCTCGACGCCCGAGG AGGCGGCCGAGCTCGACCGGCGGC7GCGCGCCGAGGCGTGGCGGCCCTTC GCGATCGGCGCGCCGCCGCTCCTGCGCGCCACGCTGTTCCGCCTCGCGGA GGACGAGCACGTGCTCCTCGTCACGATGCACCACGTGGTGTCGGACGACT GGTCGCTCGGCGTGATCCTCCGCGAGCTCCTCGCGCTGTACGCGGGCCGC TCGCTCCCGCCGCCGCGGCTCCAGGTCAGCGACTTCGCGGCGTGGCAGCG CGAGATGGTCGAGTCGGGGGCGCTCGACGGCCAGCGCGCGTACTGGCGAG AGCGCCTCCGGGGGCTGTCCCGGGCGAGCATCTCGGCCGGCGGCGGGGCG GAGGCGCCGAGCCACGACCCGTCCGGCGCCATCGAGGAGATCGCGCTCTC GCCGGACAAGGCGGCGGCGCTCGAGGCGCTCGCGCGGCGGGAGGGAGCGA CCCTGTTCATGGTGCTCCTCGCGC7CCTCGACCTCGTGATCCATGCGCGG TCCGGCGCACTGGACATCGCCGTGGGGACGCCCATCGCCAACCGGAACCG CCCGGAGCTCGAGGACGTGGTCGGCCTCTTGACGAACACGCTCGTGATCC GCGTCGATCTCGCGCGCGCCGGGGCGTTCCGCGACGTGCTCGCGCGGGCG CGCGTCCAGGCGCTCGACGCCTTCGCGAACCAGGACATCCCGTTCGATGT CGTCACCCAGGATCTGAAGCAGGAGCGCGACCACGCGCAGCACCCGCTCT TCCGCGTCTGGCTGGCGCTCCAGAACGCGCCGAAGCCCGCGCTGGAGGTC CGCGGGCTCCGGGTCGAGCCCCTGCCCCTCCGGCCCGAGCTCGTGCACTT CGAGGTCGCCCTCCTGCTCTGGCCGGCGGACGACGGATCGGTCGTGGGGC ACTTCGAGTTCCGGCGCGATCGCG7CGACGAGGGCGCGCGCAAGGAGATC GCGGCCGCATTCACGCACCTCGTCGACGCGGTGATCGCCCGGCCGGACGC GCCGGTGTCGACGCTCGTGGAGGGCGCCCGCGCCGAGGCCGCGCGAGCGC AGGCCGCGCTCGGCGAGGCGTTCGCCAGGGCGGCGACGGCGCGCCTCGGC CAGCTGCGGCGTCGCTCGGCGGGCGACCGGACGCCCCGCGAGTAGCGGTC AGCCCTCGGCGGCGGCCAGGCGCACGCGGAACGGCGCAGGGTAGCCGTGG
ACGCGCGGCATGGGGTCGATCGCGCTGGGGACGCCGGCCCGCAGCAGCTG CTTGATGGCGAGCGAGATGTGCAGGATGGCCACGTACTTGCCGTGGCACG TATGGATCCCTGACTCCCAGAACAGGTAGTTTCTGCTCGGCAGGCGCCCG GGCCTGAACTGGTCCGGGGCGTCGATGTGCTCGTGGTCGTGCATCGCCGA GGCGCTGCAGGCCATCACCAGCGCGCCGGCGGGGACCTTCTCCTCGTGCC GCGTGCCACGCCCGACCGTGTAGTCGCGCACGCAGAGGCTCGTGACGCCG GTCGACGGGGGACGGAAGCGCAGCGCCTCCAGCACATAGCCGGTGATGGC GGCGTCGTCCTCGACGTTCACCACG7TGAGCGCGTCGCGCAGGACGCGCG GGCGCTTCATCAGCTCGACCAGGGCGTTGACGATCGCGCCGCCGCTGAGA TCCACGCAGCCCATGAGCAGCCCCAGGATCACGTCGCGGATCCCCTCGTC GCTCTCGTAGGTCTCGGGGACCGACTGCATGACCAGGTAGCGGTCCAGCA CCGAGGGTTGCTCTGGCGGGGGCGACTTGGCCAGCTGCTTCTTCCGCGCG GCGACGATCGCGTCGATCATCGGCAGCGCCTCCTGACGAGCGGCCCTCGC CGCCGCCACGGCCGTCGGGTCGTTGGTCGGGTTGAGGAAGATCTCGTTGA ACAGCGCGTGGGTCCACGCCACCACCTTCTCGGTCGGGATCTCGCCGACG CCGAGGTACCGGGCCATCGCGCCGGCCGGCACCCTGAGCGCGTAGTCACC GGTGAGATCGAACGGCTTGTCGACGCCGACCTTGGCGAGCAGCCGGTTCG CCTCGTCCACGACGATCTGACGGTAGCGGGGCAGATCGGCGCGCGGGAAC GCGAGGCGCAGGAGCGACTTCTCGTGCTCGTACTTGGGCGAGTCGTTCAT CGCCAGGATGTTCTGGCCCACGTTCTCGACCAGCTTGGGCGCGATGTTGT CGACCGAGAAGACGTCGTTGGCG7TGAGGACCTCGACGACGTCGTTGTAC CGGGTCACGAGCGTGATGGCCGGGATGGAGAAGATGGGCTTCTCGCGCCG CAGCTGGCTGAGGAACGGGAGCGGCTCCTCCCTCAGCCACTTGAACACCA TGCCGGCCTCGATCTGCTTCCGCTTGACCGGATCGTTCTCGTGCGCCAGC GCGCTGTGGAGGGCCTGCAGGTAA7CGAACGGCGGCGCCTTGGCAGCGTC CGCTCGTCCCTCTTCTTCGATGTGAATGCTCATGGGGAGAATTCCTTTCT CGCATGCCGATCAGATCGCGACGC7CTGGGGGACCATCGACGGGAGCAGG TACAGGTACGGCTGCTCGCGGGCGCGGTTGCGCTCGGTGATCTCCCGCTC GATGTGCGCCAGGCGCTCGCGGTAGCGAGCAAACGCCTGCTTCGCCGCCG GATCTTGCAACAGGCACTTCATCG7CATCAGGCTCTCGCTGGTCAGCGGC
GGGCCCATGCTGAGCGAGCGGCTGAGCACCATCTGGCGGATGCTCTGCGC GCGCCCCGGGAGGCGCTCCAGCGGCTTGAACTGCCGCTTCTCGCTTCCGT TCAGGACATCGCCGTAGGAGCGG7AGGTGGCAAACTGAGCGTTCGGGATC CAGGTGTAATAGTCCGTCTGCCCGAAGTTCACGGCGGCGTGATATGCGGT CGCCGTGAAGATGATGTTGGTGACGATCGCGATCAGGTCGTCGAGGCTCG TGAGCTTCTCGAGCTGATCGGCTCGCTCCGGCGGGAGGAGGCTATCCATG CCGCCGAGCTGGGGGGACACGAGCTCGTGGATCCACCGCTGCAGGCTGGC GTCGCTCGACAGAGACCCCGGCGTCGGGTAGGCGATCTTCAGCACCTGTC CGACGTACTCCTGGATCGCGTCCCAGTGCAGCAGCGCGTCGTCGCGGTAG TGATAGCCGACCAGGTCGCGGACGTCGCGCGCCGACAGGTCGCGGGGGAG CGCGCTCTCGTAGAACCGCCACGGCTTGCCGCCGTACCCTTTGATGCCCT TGCCGGTGTAGGCGCGCGTCAAGAGCTCGAACGAGCCCATGGTGGCCACC GAGCTCGTGATGTCGAAGAAGCGCCCTCGCCCGAGGAAGCGCCGGCGAGC CAGCTCGTTGATGGCCAGGGTGTTGAAGAAATGCGGCCTGAGCAGCTGGT GGAGCGGATGCGTCGCGGGCAGGTTGCGGTAGGTGCTCACCGCGAACGGC TCCACGATCAGGTGCGCGTACAGCAGGTGGGTCACCTGGCCCTGGTAGAT GGCGTCGGCGCTCGCGACGGCGATCTTCGCCGTGAGCCAGTCGTCCGACG GACCCGAAGGGGTGAAGATCTTGTCGGGATGCGCCCCTTTCCCGGGGCGC GAGTGCACCAGCCTGATGGCCACGGGCAAGAGCTCACCGGCCGCGGTCTG GTGCAGCATGCACGTCGGCGCCAGCGGGTACTTGCCCAGCTCTTCCTGCA CGTCGGTGTCGACGATGTCCTTGAAGATGCGGTAGTCGAGGAAGTAGAGC TGCCCGCCCTCGCGCACCTCCTCCAGCGTGCGACCGTCGGCGATCGCGAT CGGCTTGGGCTCGGCGCCGCTCACGAAATCGGCGAGATCGGCCGGGGTCG CGCGGCGGATGTGCGCCGGGTTGATCCCCACGAGGCGCTGCCGCCCGAAC TCGGCGTCCTCGGCCCAGCGCGTCGCCACGAGGGGCTTGCGGATGAAGGT CCACGGCTTGAAGAACTCCTCGAACTGATCGAAGCTCTCCCAGTTGTCGA TGGACTCGAAGATGGCGCCCAGCCCGAGGTCGGACGTGGCCCTGAGGACG AACTTCCCCTCGCGATAGCGCTTGTACCCGTACTCGAACAGGTGAAGCGC CTGCGCGATTTGCAGTCCGGCAGTGTCCTTCCACTTGCCGAGGTTGAGCG CCAAGATCTTCTTGATCGGAAAGCCTTCCCCCGGCGGTACGGAATCGCTG
CCCGCCGGCAGGTTGGACCCGAAATTGCGCCAGTTCGGTGTCGAGCTGGG CTCCCTCATGCTCGTGCTCGCTTCTCCGTCTCAGACGGACGGTGGATTGG GTGGTTCACGTCAAACATCGCTCTCGCGTCGCAGCGGTCCGAGCGCGCGC CGGAATGGTTCCGTCTCAGTCGCAACAGGACTCAGTACATCCAGCGCCGC CCCCCGTCCTCGACCTGCCCCCGCAGCCGATCGCGCCGCCCTTCATCGTG GAATCGACAGGTGCGATTCCACGAAAAGCCGCCGCGCCGAGTTGCACGCG ACCGATGCTCACGCGTGCATTGTTGAGGCTGCTAGAAAACCGTGGAGCGT TCACGCATGTCAAGCCATTTTGTTCGGCGCCGCGGCGAGCGGCCGGATGC CGCGCGCCCCCGCGCCGGGCGTGTTCGCTCCCGACGTACCGCTACCTCGA CGACGTATGGCTTGAAGGGCAACCGCGCAAGTCGTCCGATTCGTGCTCGT ATCCTGCTCCTTCCAGCAGGATTTCCCCGCCGCCAGCGGCACAAAGGTGC CAGGGCGAGCAGAAAGAGCGCTGCCCGCCCCTCCCCGGCCGCGCTCGCCT CGATCAGCGCGCGCTCGTCGTCGGTGCCTGAGACGTTCGACGGACGTCAG TTAGTTAGCCTAGCTAACTTCAACACTGATGCGACTGATCGGGCCGACGC AACCGACGCAACCGACGCAACCGACGCAACCGACGCGACCGACGCAATCG ACGCAACCGACGTGACGGACGCTGGCGACTCGAAGAAAACCACGGACGCA CTCCACGTCATCGACGTCATCGACGTCATCGATGCGCTCGATGCAATCCA TGCACTTGACGCGATCGGTGCGAGCAGGCGACGAGGTCCTCTCGTGAAAC ACCGAACCGAGTGCCGGTAGCGGGCGCGCCGCAGTGTATGCTAGGCTCGG CCCTCTTGTCGAGGCCGCGCGCTCGGCGGTCGAGCGTGGGCTCGGGTGCC GCGGTATCCGGCTGAACCAAGGAGGAGCGAGCCATGCAGGCAGATGACGA CGCGACGATCTACAAGGTGGTGGTGAACCACGAGGAGCAATACTCCATCT GGCCGGCGGACCGAGAGAACCCGCTCGGCTGGACGGAGGCCGGCAAGACG GGCAACAAGGCGGAGTGTCTGGCGTACATCCAGGAGGTCTGGACGGACAT GCGCCCGCTCAGCCTCCGGAAGAAGATGGCCGAGAGCCCCTGAATCGCGG CCCGCCCGAGCGCCCGTCGCGAGCGGCCGGGCGGCGGGCTCAGCCGTGTC ATCGTCGCGCTCGACCGGCCGCGTCCCGCGGGATCGCGCGAGCCCGGCGG GGTCGTGCGCGCCGGCGCTTGTGCCGGGGCCCCCGCTCTCGTACGCCTCC GTCATGCCGCCCCTCGATCTGCACGTCGCCTTGTTCGGCGCCTCCGGCGC CGGCAAGACGGTCCTCCTGGCAGCCTTCTACCGGGCGCAGACCCAGCCCT
CGTTCCAGCAGGAGTACGCGTACAAGATCCAGGCGGTCAACAAGGCGCAG GGCAACCAGCTCCTCGGCCGGTTCTATCGCCTCGAAGAGGGCAGATTCCC GGACGGCAGCACGCGCTTCGACGAGTACGAGTTCGACTTCTTCCCGAGAG ATCTGCCCGAGCCGGCGGTCCGCA7CCACTGGTACGACTACCCGGGACGC TGGTGGGAGGACGAGCCGGTCGACGCGGACGAGCGGGAGGCGATGCGCCA GGGCCTCATCCGGCTCGGGATGAGCCAGGTGGGCATCCTCCTCGCGGACG GCGCGAAGTACCGGGCCGAGGGCACCGGGTACATCCGGTGGCTGTTCGAG CACTTCGCCGACGAGTGCGACCGGCTGCGCCGGGCCAGCGCCGCCACGGG CGACGAGGTGAGCTTCCCGCGGGAGTGGATCCTCGCCCTCAGCAAGGCCG ATCTCTGCCCGCCGGACTACAGCGCGCGGGACTTCGAGCGCGAGGTCTGC CGGGACGCCGACGATCAGCTGGCGAAGCTCTGCTCGGTGCTCCGCGCCGA GCACGCGTTCGGCCACCGCTTCATGCTGCTCTCGTCGGTCGCCGCCCCGG CCGGCGCGCAGGTCGATCCGAGGACCTCGCTCGGCGTGCGCACCCTCGCC CCCGCGATCCTGGTGAGCACGGTCGAGGGCGCGGTGCGCGAGGCGCAGGC GGCGAGAAAGGAGAAGTCGGCCGGAGAGACGTTCTTCCAGGGGCTGCGCG ATCTCGTGCAGTTCGTCGACTCCC7CGACGACTTCCTGCCGAAGCGATAC CAGATCGTGAGCAAGATCCTGCGG7TCATCTCGATCAAGGACTTCGCGAC CACCCGGCTCGACCGGCTCAAGAAGATGCGCGAGGACGCGATCCGGAAGG GCGACACCTTCACGGCGGTCCTGACCGCGATGGTCGCGGCCCTGCGCGAC GACGAGGGCGCCCGCGCCTACCACCAGAACCAGTGAGGTCGTCATGCCCG CGCCAGCGCCCCTCGTCGAGACATCGCGCCTCCTCTGGAGGACGCGCGGC GAGCACTGGGATTACGAGTTCATCTGTGTCCCCGAGATCCCGGCGCTGCC CGCCTGGCTCTCGACGCTCGAGGCGATGCTCGCCGACGCCGACGCCGGCG CCGGGGAGCTCCGCTATGGCCTGC7CGAGATCGACGATCGCGGGCAGAGG GCGCCGCGCGCCTATCCCTACGTGGCCGTGAGGTTCCTCGATCCGGCGCG GAGGGACTGGACCGGACGGCAGG7CCAGCACTTCGCGGCCTGGTTCCCGC CGGTCCCGCCCGAGGCGGTCGCGGAGTTGCCAGAAGCGGTCCCCGCCGAC TGGCACCTTCGCGTGCTCGACGGGCTCGCGGGGACGTACGGCTCCGGCGA GGTGTTCGGGCTCCCCGAGGCGACGATCCGCGCCTGGAAGCGGAGCCACG ACGAGAGCCGGGCCGCGCGCGCGATGGCGATCGTCAAGGCGACGCCGCCG
GTTTCGCTGGGCGGCGGCGAGGCGGCGCCGTCGCGGTGGACGCGGGTGCC GACATTAAAAAAAAAGCCGCCGGAGCCGCCGGCCGCGGCGGGCCTCCTCT CGGTGGGCGCGGTCCCTAGCGGCCAGGGCCGGCGATTCGGCTGCTTCGCG ATCGGCGCCATGATGCTCGCCGCC7TCTGTCGACTGATGCTCGCTTGCGG TGTGCGCCTCCTCGGCGCCTGACGGCTGCGCCGCGCAGGCCATCCGACGG GGGGTCGGCCCGGCCAGCGCCCGCCGGGCGACACCAGGGCATCGGCCCTC CGCTCGGGGCATCGATTGAGCTCTCCGAGCGGCGGTCCGTCGTCAATCGC CGCAGAGCTCCCACCGGGCGGAGCAGCTCTGGCCGGTGACCGCATAGGGG TTCGTCGGGCAGGTCCACCACTCGCCCTGGAAAGGACGCGGGTTGCAGTG CGGGAGGCACTCCACCCACCCCGACGAGCACGAGTTCCCTACCGAGACGG TCGGCTGAGCCGCGCAGAACCAGCGTTTTCCTGCGAACCAGCCGGGATTG CACACATCGGGCGCCCCGCCGACGGGCGGATACACCGTGGCGACCGCCTG GATGTCCACGGCGTCGAGGGTCTCGCTCCCGAAGCGCACGCCGTCCTGAT CCGAACACCCACTCCAGTAGGTCATGATGGAGTCGTAGTCGTAGAAACCA GGGTTCACGACGATGTACCGCCGGCTCGAGGGCCACCCGCTGGCGACGTC GCTCGCGGGGAGCGGCTCTCGT7GGCTGCAGGCGCTCGGGACCAACGGGT GATGCCACTCATGCATGAAGCCGATCGCATGACCCATCTCGTGGATCGCG TACTGCTCCACGCAGTCGAAGCTGTATTCGACCCGGGCTGTCTGCCAGTT GTACTTGATGCAACGGTTGAAGTCGGCGCCCCAGGGCTTGAACTGGACCG AGCCGCCCTTGTTGTAAACGCCGA7CGAGTCCGATTGGTTGGGCGCGTCG GGGTGGATCCTGACGCCGACGTAGGTCATGCGAGTGGCCGGCAGGAGCGA ATCGCAGCTCTCCCAGCCGGTGAAGCGAACCGAGCTCCAGCGTTCCCAGC TGCCCTGGAGCGCGGTGCGCACGCGCGTGATGACGTCCGCGAGCGAGGGG TTGGGCGCATGGATCAGCCCGCCCGCGGCGCCGTCGACCCTCTGCTCCGC CGAGCTCGTGGGGTCGATGCAGACCGGGATCCGGACATGGCCGTCAGCGT CCTCAGGCCAGCGACTCGCGCTG7CGAAGACGCTCGCCTCGGCGGACCGC GGCGCGGCGGAGACGGTCAGCGCGGCGCCCAGCGCTGCGAGGAGCAGCGG ACCGAGCGAAGAGCGAAACCGCACATGTCGTTCAGGGCCCCGCGTCGTGC GGTGCACCGAGACAATCTCGAGCGGGCTCATGGACGCAAACGCGTTGCGA TGGCCTTGCAGCATGTTCTTCTCCAATCGACGAGGGTTGTTCTGCTGAAC
GCGGCTCCAGCGTGGAGCTCGACGCGGTTCACCGGCTTCACGCCGGGGCC GTGGACGAGACCCGAGCACGGGGGAGGTCGCAGCCGCACCGGCTCGCGGC GCCTCCACCCTGCACCTACGACGAGCCTGCCGCTCGGTTTCGCGGAAAAT GCCACCCCGCTGCCCAGCGGGCGAAGCGCGGACGAGGCGCTCGTCCCCAC GGTAGCGCCGGTGCCGCTGCATCCACCGCGCTCCTCCATGGGTCGCTGCC CGCGGGTCGTCGAGGAGACGGACCCGGGGCGCGGATCCCTGGCTCGGCGT CGCATAGCTCGTAGGGGCGGCCTTGAGCCGGCGGTACGAGCGGCGCAGTT CAGCAGCCGACCACGTGGACGCGGCGCGCTCCGAGCTGCGCCGAGGAACC CTTCAAATATTCAGATGGAATTCACAGGGTGGCTGAGAGACGGGGAGTAA GATCTCAGAGATCTCCCTGCCTACCCGCATCCCTGTTCAATTTTCCGCCC ACAACGCGAACGGATGAGGAAATATCAGCCCGCGATCCCGACGGCCGACA GCATCAAAGGCCGCTCGAATCCAGGGGATTCGAGCGGCCTCGGTCGCGCG GACCCCCGCCGCGAGCCGCTTTGTCACCACTTCACCACTTCAGAGCTTCG ATCATCTTCTCACCATAACGCGTGCCCATGATAACAACGGACGCATGATC GAAGTGGTACTGATCCATGACGTTGGTCCCCTGCTGGCTGACCCAGTACC CCATAGGCAGCATGTCGGCCGCTTGGTGCACGAGGTTGTTATGACCGCCG CAGCACCCGCCCGCGGGGAGCTCTCCGAGAATGAAGGGAACGTCGTAGTC GACCCCCCAGGCTGCTTTCACCTCGTTATAGAGCTGAACGACCTTGCCGG GCCACGAGCTCTGGCCGTTGTCGGACTCACCCTGGTGGAAGATGATGCCC GCGAAGCGCGCGTTCTCGGCCGTCTTCGCTTTGGCGATCTTGTTCAAGAT CATCTGGTGATGCGAGCCACCAGTGATGAACGTGTTGATCGACTCGCCGC TCTCAGCGGTAGCGACCAACCCGATCGTATCCCCCTCAGGCAGCTTTCCG AGCAGGGTCTTGCCGAACCAGATGCCCGGGTCGACGGAGGTCGACAGGTT CCATCCTTTTTCACCAGGGCAATCGCTGAGCGGCGGATTGGCCAAGTTCC ACTGTCCGGCCGGCTGATTGCATCCGCCGAGGACCTTGAGCCGCGCGTCA GAATTTTTGTCGCTGTCCTGTTTGTCTGCGACACCAGCCATATTCGACTG GCCCATGAGCATGAAGATGTGAAACGTCGGACTCGCGCTCGGTGCGCCGC CGGTGCCTGCCCCGCTGCTAACGGATCCGGTCCCTCCCGTGGCGTCACCT CCAGTTCCAGCGTTCGTGCTGCCTGTCGCGTCGCCCCCGGTCCCCGCGCT CGTGCTGCCTGCCGTGGCGCCGCCGGTCCCCGCGCTCGTGCTGCCTGCCG
7GGCGCCGCCGGTCCCCGAGCCGGCCCCTCCGGTGTTGTCGTCCTCACCG GTCGCGCCGGACTCGCCACAACCGGACGCAGCGATGATGAAGAGGAATGG GAGGAGCAGGAACCTGGGTGTGCCTCGGGTCGTGCGGTTCATCTCGGTCA TGATCGTTACCTCGTCGCGCCGGGGCGCGATCTGAAGAGCATGGCGGAAT CGGTAGGCCGGCGTCGCGATGCCGGCGCGGCGAACCTCGCCCGCAAAGAG CTCAGCGCCGGGGCCTACCTTATCGCATCTTGGGCGCTTGGCGTCCAGGA TTCGGCCTTAGACAGCACAAGCAGAAGACCTTTGACACTGGATTTTTTCA TCATCGGCGGCGCTCGTTCTTCGCTGCGCCTCAAGCGCCGACCGTTCGTT TCGAAGCGAAGCGGTTTCACGATCCGGGATCGCGGAAATTTGAAACGGAC GCGTCGCGCGGGCAACGCAGGGGACTCATCACGAGGCAACCGCGCTGCGT CGCGAAATTGGCCAGCCTCTCGGAGTCCCTAGTTCCGTGCGTCAGACGCG TCACCCACCATGTCGAGCTCGGCGCGGCGCTCTACGTGCTTGAAAGACCT CCGCGAGCGCCGCGCTCTGCGCTCCGCGAGCGCACCAGGCTCCCCGTGGA TTCAGGGCAAGGCGGTCGTGATCACGTCCTTCGTGGTCGCCGTGCTCTGG CCGGGGCGGTCGACGCAGGACGGGTAGTACTGGCCGGCCCCGTTGAGCTT CTGGCAGACCCGGTCGCACGACCCGGCGATGCGGATCAGCCCGCACTCCA CGATCTGACCCCCGGAGGTCACGTGGCCCGCGGCGCAGTCGCGCTGGTAG GCGCGCGAGTTGTCGACCGTCGCGCTGTTGTAGCAAGCGTTGATATAGGG TTGCGCGGCGAACAGGTTGCCCCAGAAGGCCCCCTCGACGTCCGGATAAT CGATGAGCTCCTGGCTGGAGGAGAGCGTCTTCAGCGGATCCCGCAGCGAG CGGGCGGAGAGGAGCACCGGTACTTGATAGTAGTTCACGCGCGCCGCCAC GCAGCTGGACACGATGCGCTGCCCTGCGTCGTCGAGCGGCCCGCTCGCCC ACGCGGGCGCGACGCCGAGCAGCCCGGGGTAGCGCTCGTCGTGCCTCTTG CCGTTCGAGTCCGTCCACGAAAAATCGAAGGAGGCCGTGCTGCTCAGGGC GCAGCTCGCCGCGTAACGCAAGAAATCGCGCGCCAGCGCGCCGCTCGGCC CGGGATCCTGGATCGCGGCGAGGTTCCGCGCGCTGAGGCCGCTCAGGTTC AGGGCGTTCAGGTTCAGGGCGTTGAGGTTCAAGGCGTTGAGGTTCAGGGC GTTCGTGCTGAGCGCGTTGCCGCCCACGAGGGCCCCCTGGGATTCCCCCA CAGGCTCGCCCCACGCATCGGCGTCCACCACCTCGGCGGCGCAGCCCGAC AGCACCCCTGCCCAACCAAGCACGATGAATGTCCGCTCGAGAGACATGGA
TTCCCCCGTGTTCCTGGCGCATGACCCGACGGCGCCCTGCGCGCGGCGCG CGCGGGCTCCCATCGATTCGCTGGATGGGTTCAATATTCTACT7TTTCCC GCGCTCTCGCGCCGGTGAAAGTCGCTTCAGCGGCGGCGAGGTCGATGTCA GGAGCGTCCGACTCCGTCGCTCTCGTCAGCTCCGCGTACCAGCGACGGAG TCGCCCGCCCATGACGGTCGGAA7GGTAGAGGCGGCCGCGAGGGCGCGCT CGAGCTGCGCCCGGGCGTCGGCGCGCCGGCCGCGGCGCAGGGCGGCGAGC GCCCGCGCCTCGAGCACCTCGATCCGCTCCTGCCCGACCGAGCAGCGCGC GGAGCGCTCCTCGAGCGCGGCCCACGCGGCGCGGTCGTCGTCGCGGGTCG CGAGCTCGATCATCGCGCAGAGCACGTCCTCCGAGGGCTTCAGCGCCTCG CAGCCGGCGTCGTCTCGCGCCGCGCGGAGCCGCAGCGCGATCCGGCGAGC GCCCGCCTCGTCGCCCTGGTAGAGGCGCAGGCGCGCGATCAGGAGCGTTA CGACGACCGGCGCGTGGCGATCGCCGCAGCGCGGCGCTATCGCCTGGACG GCGCGCGCATGGGGCCTCGCGGCCGCGAGATCGTCCATCAGGTACAGGTA CTCGGCGAGGTTGTAGCGGCCGACGAGCTCGAACGCGGGCTGGCCGAGCT CGCGCCCGAGCGCGATGGTGCGC7CGAAATCGGCGATCATCCCGGCGCGA TCGCCCTGGAGCGCCCGCGCGAGCCCGCGGTTGTTGAGCGCGGCGCCGAG GTGCATGAGATCGCTGCGCTCCTCGCAGCTGAGGATCACCGCGTCGAGGT CTCGCGCCGCCTCCTCGACGCGGCCGAGGCTGGCCAGGATGAAGCCGAGC AGCAGCAGGGCGATGATGTGCGTCTCGTGGCCCTCGTCCCCGAGCCGCGC CGCCTGCGCCGCGGCGCGCGTCAGCACCGCGGCGGCCTCGTCCTCGCGGT CGGCGCGGTGGAGCGAGCGGCCCACGCCGAGGAGCAGGCGGGCGCCGAGC AGGGGCGAGGCCACCCGGCCGGCGAGGCGCTCGGCGGCCGCGACCCGCTC GCGCGCGGCCCGGTACTCGCCCG7CCAGTCGAGGATCATGGCCTCGTCGA GGAGGAGCTCGATCTCGGCCCCCGCCTCCGACGCCGCCGCCGCCGCCTCG CGCGCCGCGGCGAGGTCGGCGAGGGCCTCGGTGTGGCGCCCGAGCCGGAA GCGAGCGAGGCCCCGCGCTCGGCGGTCCTCGGGGAGCAGCGCGCCGAGCA GCGCCTCGACGCGCCCGTAGCAGCCCTCGGCGTCGAGGTAGGCCCGGCGC GCGGCCGCGAGCTCGGCGCCGCGGGCGAGGAGCGACGCCGCGCGGGCGGT CAGGCCGCCGCGCTCGCAGTGCGCCGCGAGCACCAGCGGATCGGCCTCGC CCGCGGCCTCGAGCCAGTCGGCGGCGAGGCGGTGGCCGAGCGCGCGATCG
GCAGCTCGCCGCGTAACGCAAGAAATCGCGCGCCAGCGCGCCGCTCGGCC CGGGATCC7GGATCGCGGCGAGG77CCGCGCGCTGAGGCCGCTCAGGTTC AGGGCGTTCAGGTTCAGGGCGTTGAGGTTCAAGGCGTTGAGGTTCAGGGC GTTCGTGC7GAGCGCGTTGCCGCCCACGAGGGCCCCCTGGGATTCCCCCA CAGGCTCGCCCCACGCATCGGCG7CCACCACCTCGGCGGCGCAGCCCGAC AGCACCCCTGCCCAACCAAGCACGATGAATGTCCGCTCGAGAGACATGGA TTCCCCCGTGTTCCTGGCGCATGACCCGACGGCGCCCTGCGCGCGGCGCG CGCGGGCTCCCATCGATTCGCTGGATGGGTTCAATATTCTACTTTTTCCC GCGCTCTCGCGCCGGTGAAAGTCGCTTCAGCGGCGGCGAGGTCGATGTCA GGAGCGTCCGACTCCGTCGCTCTCGTCAGCTCCGCGTACCAGCGACGGAG TCGCCCGCCCATGACGGTCGGAATGGTAGAGGCGGCCGCGAGGGCGCGCT CGAGCTGCGCCCGGGCGTCGGCGCGCCGGCCGCGGCGCAGGGCGGCGAGC GCCCGCGCCTCGAGCACCTCGATCCGCTCCTGCCCGACCGAGCAGCGCGC GGAGCGCTCCTCGAGCGCGGCCCACGCGGCGCGGTCGTCGTCGCGGGTCG CGAGCTCGATCATCGCGCAGAGCACGTCCTCCGAGGGCTTCAGCGCCTCG CAGCCGGCGTCGTCTCGCGCCGCGCGGAGCCGCAGCGCGATCCGGCGAGC GCCCGCCTCGTCGCCCTGGTAGAGGCGCAGGCGCGCGATCAGGAGCGTTA CGACGACCGGCGCGTGGCGATCGCCGCAGCGCGGCGCTATCGCCTGGACG GCGCGCGCATGGGGCCTCGCGGCCGCGAGATCGTCCATCAGGTACAGGTA CTCGGCGAGGTTGTAGCGGCCGACGAGCTCGAACGCGGGCTGGCCGAGCT CGCGCCCGAGCGCGATGGTGCGCTCGAAATCGGCGATCATCCCGGCGCGA TCGCCCTGGAGCGCCCGCGCGAGCCCGCGGTTGTTGAGCGCGGCGCCGAG GTGCATGAGATCGCTGCGCTCCTCGCAGCTGAGGATCACCGCGTCGAGGT CTCGCGCCGCCTCCTCGACGCGGCCGAGGCTGGCCAGGATGAAGCCGAGC AGCAGCAGGGCGATGATGTGCGTCTCGTGGCCCTCGTCCCCGAGCCGCGC CGCCTGCGCCGCGGCGCGCGTCAGCACCGCGGCGGCCTCGTCCTCGCGGT CGGCGCGGTGGAGCGAGCGGCCCACGCCGAGGAGCAGGCGGGCGCCGAGC AGGGGCGAGGCCACCCGGCCGGCGAGGCGCTCGGCGGCCGCGACCCGCTC GCGCGCGGCCCGGTACTCGCCCGTCCAGTCGAGGATCATGGCCTCGTCGA GGAGGAGCTCGATCTCGGCCCCCGCCTCCGACGCCGCCGCCGCCGCCTCG
CGCGCCGCGGCGAGGTCGGCGAGGGCCTCGGTGTGGCGCCCGAGCCGGAA GCGAGCGAGGCCCCGCGCTCGGCGCTCCTCGGGGAGCAGCGCGCCGAGCA GCGCCTCGACGCGCCCGTAGCAGCCCTCGGCGTCGAGGTAGGCCCGGCGC GCGGCCGCGAGCTCGGCGCCGCGGGCGAGGAGCGACGCCGCGCGGGCGGT CAGGCCGCCGCGCTCGCAGTGCGCCGCGAGCACCAGCGGATCGGCCTCGC CCGCGGCCTCGAGCCAGTCGGCGGCGAGGCGGTGGCCGAGCGCGCGATCG TCCTTGGTGAGCTGCGCGTAAGCGCCCTCGCGCAGGAGCGCCTGGCGGAA GGAGTACTCCTCCTCGCCGGGGAAGCGGCCCTCGCGGTGGCGGACGCAGA GCTCCCCGGCGACGAGCGCGGAGAGGTGCTCCGCGAGCGGAGCGGCCTCG TCGCCCCCGAGCAGGTGCGCGACGGCGCCTCGCCAGAACACCTCGCCGAG CACGCTGGCGGCCCGCAGGATCCGGCGCGCGGGGGGCGCGAGCGCCTCCA GCCGGACCTGCACCATCGCCACCACCGTCTCGGGCAGCGCGTCGCCGCGG CCCTCCGCCGTCGCGCGGATCAGCTCCTCGAGGAAGAACGGCTGGCCCTC GGACTGGGTGACCAGACGATCGATGAGGGCCCCGTCGGCCGCGTCGCCCA GCGCCTCCCGCGCGAGCTGCGCGCACGCCCTCGGCGGGAGCTGCCTGAGC CAGAGCTCCTGCCGCCCGCGCTCGGCCCAGAGATCGGGGTACGCTTGC,
or their complementary strands,
(b) DNA-sequences which hybridise under stringent conditions to regions of DNA-sequences according to (a) encoding proteins or to fragments of said DNA-sequences,
(c) DNA-sequences which hybridise to the DNA-sequences accord- ing to (a) and (b) because of a degeneration of the genetic code ,
(d) allele variations and mutants resulting by substitution, insertion or deletion of nucleotides or inversion of nucleotide segments of DNA-sequences according to (a) to (c) , wherein the variations and mutants offer isofunctional expression products.
13. Peptide encoded by a DNA sequence according to claim 12 selected from the group consisting of Seq ID No 92
>Contigll_002 591 amino acids MW=63639 D pl=5.80 numambig=0 MLDVWSTSDQVACRLHCAGAGPSASLELRYDASAGARRDAERLAERLAALLEDLSRHPER PVAQGEYVGPGERAEIEAWSRGPAMELPSACALHRWFEERAEQHPDWAVRSEGKSLTYG ELERRANRLASCLRRRGVGLDTIVGVCVPRSEDMWATLAVLKVGGAYLPLDHEYPGERL AFMMRDARARLLVTHDAIADELPTGGWTTLLLDAEAAEIAACSDARPAVSPPPDSGAYVI YTSGSTGTPKGSLISHRAIVNQMQWIQRY ALTADDRVLLKAAFGFDVSV EIF PLSFG ARIWARAGGHRDPEYLRRLVRDEGATTAYFVSSMLAAFLGGPEQPFPASLRKVLVGGEA VPLDLVRRFYAKHDGDLINMYGPSΞAAIAVTGCVLPSDPRVTWVPLGAPVANAEVFVLDG AMRRPAIGALGDLYIAGAPLARGYVGQPGLTAERFLPDPCARAAGGRMYRTGDVARFLPD GMLEFQGRSDHQIKLRGHRIELGDVEAQIRRVPGVGQAAWLREDAPGDARLVAYWLDG DAAGDAPDVRAGLKASLSAYMIPSSWRLYALPMCSERLAFTGSSYAGCLL*
Seq ID No 93
>Contigll_007 361 amino acids MW=38862 D pl=10.42 numambig=0 MSDHEMTGFSLSPQQRAIRALDREAGAPGCRTLAWAVTGPCDEGRLSAAALALAERHEI LRTRLVEAGRARPRRWSASRASRGRQQDDWVGCSEAEQGERMSRLVARLSEDRGADDGLR VGLVRVGPEERRLVLAAPAWCVDEΞSIAPLVRELCASTAGAGAPPEQQYADVAEWLNGML ESEDAGDGRRFWAERRSHFGPPLH AFSRGGAGAGAGSGRARVDLGQGGMAQVERWSSSW
QVPQRIVLLALWASLLWRMSGGNEPEVTVAVRFDGRSLDALAGAVGPFARFLPVRIEISA SDTLADVARRLALAEAEAAAHQDAAPGVSHRMSWGLLRRGGRAGAVARRRAGPRARRLEH V*
Seq ID No 94
>Contigll_012 882 amino acids MW=95015 D pl=12.69 numambig=0 MARALYAQEAAFRSAFNACAAAARARGLDLGEWFREGEGDGRTLLGTALAQPALFAVEL ALARLWMSWGIEPAAMIGHSLGELVAACVAGVFTLEDAMSLVIDRGRFMQAAPAGSMLAV GLPAADVEGLLEAGLEIAAVNSPKLTWAGPASAIRDLAARLEAREVFARPLQTSHAFHC ALIDGAVAPFLESVRRARLSPPEIPWSNVTGALLTDAEATDPAYWARHLRQPVRFSDGV EALFASGHALFLEVGPGRGLTTLVRQTLAGRGGAAIASLGSTHAASEPASLAEALGQLWE AGHAVDWTARPRGRPPARSSACRRTRSSGRGTGSRRAAAPPPPSRRRPTRPSRPRRTPSR RRRSPARTPGPRSPRRRPSPSARSPAIWERPPRRPRDRRPRRLLRPARRLARRDPGDVPD PRAARRRAPAAGALPGADGRGARGPGRRRAPRRGAAARVPPDRADPPRRPAPAWRAAPA WRRSARAGQPGVQHPVRGARDGPARRRRAPPQPVRDRAAPRGAADDVQRQGRRAVPGRR ARGPGAVPDVRPRAPRGRGPRRGGLGARPRGEPRAVRPEPRAAPPRAGHPEAPRRARHRA RRAPRRVRRLVGRRLRRRARRALRRLRAGPAFPAAGAPGAVRRLRRSPAGVAVRRGPGGR APVLDDEALGRAAPGPRAGRPRAGGAKDVARGAAEPRRRGRAHPPDQGVLRAGGDLPLHG PARRVQARPAPAHRARGPRRGHRRREPEPRRDRADDRLLRQPARAPDRLRGRPDVRRAGP PRARRRARGVRAPGPAVRPARRGPAAEGGRRARAPLRREVRHAQRARPPDEARGARARGA GGRGDHDGVRLRPHGRRGRRLVPLRRRAQQRAVPGGDGRQFP*
Seq ID No 95
>Contigll_021 1213 amino acids MW=131017 D pl=12.40 numambig=0 MRGRRRRAAPHLRGARPARRGDRRGGLALRSDRERARRRGHGEGLGAGGRRARRAARRRR LPAPGPSPPRGAPQAPPGARRGPPRPHAVGGRRDDRLARGHRAPRRRRGRAVARAARRAP AARRKHGRPGLRDLHVRVHGAAQGRDDRPPRRGEHGPRHQPPLRRRPGGPGARALVAELR PVGLRRVRDARRRRRRRDPRPYPRLGSGALARARGARAGDRVELGPGADGDAHGRVPRRG RPGAWAPPRHDERRLDPAEAPRSHPRGLPRAPRREPRRRDRGVDLVDRPPDRGRRPGVA QHPLRPPAREPAHLRARRGAGAVSDRGPRRDPHRRDRRRARILARRGADPGAVPEAPHDR RAAVQDRRPGPLLRRRDHRAARAHRPSGEDPRLPHRARRDRGRPRAAPLGRAGGRGGEDR SVGREAPGRVRRRRRRRWRRAPRLRPEEAAGVHDPRGGRRPPGAAAERERQGGPRRPAGS RRGRAARRGRRPADRDRAAHRLGARGGLAGRGRRRDRQPLRARLHLAAPRARAAPARRAH RGARARRGRRGAGRVADRPVPVPDHRATGAAARRGDGQGRAGRRRGAARRGAPRRATPPG ARIAMSEP IETEDGGSDIAIVGMAGRFPGAPSVDALWENVRRGVES IARFPESEREEPPV
GASAAPGAPWCAGGLLDDIDRFDASYFGYSPREAQLMDPQQRLFLECAVAALEDAGCDP ARFPGAIGVFGGCGSNTYLLQLLSHPDLAATVDPHALMLASEKDYLATRVSYKLDLHGPS VWQTACSTSLVAVHMACESLLGGQCDLALAGGVS I GI PQKRGYPYVPGS I CSPDGRCRP FDARAEG7VGGSGVGIVALKRLADALRDRNTVHAVIRGSAVNNDGGRKVGFMAPSVDGQA AAISEAQSVAGVDPGS IGYVEAHGxATAIGDPIEVEALTQAFRRKTPRKAYCALGS IKAN IGHLDAAAGVAGLIKAAHWRSGEIPPCVHFEAPNPKLDLAASPFFVPREAAPWPRELRP RRAGVSSFGIGGTNAHWLEEPPPLPPRAPAPERDHVLTLSARTPEALSTACAQLAAHLE ATDVPLDDVAFTLQTGRAEHPYRRAWARTRAEAIQGLAREGASALARPDEPRPSSRSRA RARRPSGWPARSTRRRRRSGAPSTRARRRRGRAASISARSSSARARATGARCSAPRWRSP RSSPSSSRSPGSG*
Seq ID No 96
>Contigll_026 3079 amino acids MW=332984 D pl=5.97 numambig=0
MLTWDHHVWEYWSFALIVRELGELYSALRAGRPPQLPPPSSFFAAGVSCPSPREAAGG AEYWRKALDGATTAIDLPRDRARHDAGARRGRAHAITLPKPLTGALARLARERGTTLFSV LLSALTVLLHRASGQSDLWGVPSAGRHDDESARAFGYFVQMLPVRVALRGAASFDALVA RVRDAFLDALAHGDSALRHLLARAQGEAQRDALFDVAFAFQSTPPSLDARLSALAIGVGD VRIAQGELELTTLADEQAAAEFDLALFAAELDAGIALRFEYDQQLFDPATIERMARHFW LLESAVEHPGRPLSELRMLSDAERALLLDDWSGAAAARQAASAPAPACVHALFEAHAARQ PDATALEFGHQRFTYAQLSTWSTELALWLRDRGVGPGSWGVCIERSPRMVAAQLAVLKA GAAYASLDPANPPARLAEMLADCRASLALTSSQASHKLTAAPCPVHLVQDGACAPSTHIP LVSRPDDLAYVLFTSGSTGTPKGVCVRHASLSRLVSFLHLRLDLSPSDRWTQVASSGFDA SVYEIWTPLACGAALLLADDDALRSPTALVSWLVAQRATLSFMPTPLAEACFEQDWTGSA LRAMTVGGDKLHPLRRPLPFRLFNMYGPTEATVITTVAEIADLGAEPPLGRPVDSALVYV LDPHMQPVPPGALGELYIGGACLAQGYTRTDLTAERFLPDPFGQPGARLYRTGDLVRWRP DGQLAFAGRRDEQVKLRGRRVELGEVESVLRRLPGVREGIWLHGQGSAAHLIAHWPDA HPPSERDLREGMARLVPDALVPAHFVLLPALPMSLSGKVDKKLLPAPPAAHADYEPPSGE LELELAHIWQSVLHLDRVGRHDSFFDLGGHSLLAMQVLGRIESSLGIRTTLRTLFEHPTL AQLASHLSSGAASTSAAAATALERGLTRPDGPSSPRVATPEEPFALTEGQRAMWLECQKS ADGALYNLGRTVRLGAGVDVAALRRAFEGLVERHEALRTTFLTRDGHPLQQVHRHVALEW
AEΞPAMALDEREIVARADEVRRRAFDLERGPLLRVHVWRRGEGQPPLLTVWHHLWDYW SFALLVRELGELYSALRAGRPPQLPPPSSFFAAGVSCPSPREAAGGAEYWRKALDGATTA ID1PRDRARHDASPRRGRAHAI TLPKPLTGALARLARERGTTLFSVLLSALTVLLHRASG QNDLWGVPSAGRNDDESTRAFGYFVQMLPVRVALRGAASFDALVARVRDAFLDGLAHGD SALQHLLAEPRGAARRGGALFDVAFAFQGALPSLDPRLAALTTGAEDVRIAQGELELTTL ADEQAAAEFDLALFAAELDSGIALRFEYDQQLFDPATIERMARHFVLLLESAVEHPGRPL SELRMLSDAERALLLDDWSGAAAARQAASAPAPACVHALFEAHAARQPDATALEFGHQRF TYAELSTWSTELALWLRDRGVGPGSWGVCIERSPRMVAAQLAVLKAGAAYASLDPANPP ARLAEMLADCRAALVLTSSQASHKLTAAPCPVHLVQDGACAPSTHIPLVSRPDDLAYVLF TSGSSGTPKGVCVRHASLSRLVSFFQHLLALSPRDRWTQLASSGFDASVYEIWTPLACGA ALLLADDDALRSPTALVSWLVAQRATLSFMPTPLAEACFEQDWTGIALRAMTVGGDKLHP LRRPLPFRLFNMYGPTEATVITTVAEVADLGDEPPLGRPIDSALVYVLDPHMQPVPPGVL GELYIGGACLAQGYTRTDLTAERFLPDPFGQPGARLYRTGDLVRWRPDGQLAFAGRRDEQ VKLRGRRVELGEVESALRRLPAVREGVWLHGQGSAARLIAYWPGADPPSERDLREGMA RLVPDALVPAHFVLLPALPMSLSGKVDKKLLPAPPAAHADYEPPSGELERELAHI QSVL HLDRVGRHDSFFDLGGHSLLAMQVLGRIESSLGIRTTLRTLFEHPTLHQLADRLSSGAAS TTAAAATVPASEIAPSLGRAPADEPYPLSYEQERLWVLEQLLPGGTAYNWQAVRLRNLV DVDALSSALAALVRRHWSLRTVFVASPTPAQKICEPEAAPAEWDLRGTPPDEAEAAARA WASREQATGFDLARGPVFRARLFRLDHDVCVLVLSTHHIVTDAWSFQPLVRDLAELYRRA RGGGPADMPELPLQYVDFAVWQRRHLAGKRLADKLAHWTATLRGLPVLELQTDRPRPPVQ TFRGAERVLPLDARLVAQLDELARSRGATRFMVLLAALGVLLRRSSGQDDLAIGTAVANR PRPELEPLVGFFVNTIVMRLDLGGDPTFEELLSRARKVALEAFEHQDAPFEKWEAVNPR RDLSRSPLFQVMLWQNAPTEALELGEVRIEPLDLPVEATRFDLRFSVEPRGGRDVISLQ YNVDLFDAATIDRMLATMQSVLSRATQDPAQRVRALSVAPEDRERALVAWNDTAVATPDH LRLEEPFFERAVEQPDACAWDAERRLTYGELARRAEAIAAAASRSGATANALVAWMEK GWEQVAAVLGVLRAGAAYLPLDPRLPEERLRHLLEHAEVRLVLTQSAVDGTIAWPAGIER LAVDADERWREQPVARRPPGGSTDDLAYVIYTSGSTGLPKGVMIDHRGAVNTVLDINRRF DVGPEDRVLALSSLSFDLSVYDVFGTLAAGGAWIPDRTRASDPGHWRELVERERVTVWN SVPALMEMLMDASPGAGDPALSSLRLVMMSGDWIPLKLPDRIRAACRAPRWSLGGATEA S IWS IAHPIADVDPAWRS I PYGRPLANQHTYVLDEGLEPCPIGVPGEIHIGGIGVALGYW
RDEARTRERFLKHPTTGERLYRTGDLGRYFADGT I ELLGRTDHQVKI RGFRI ELGE I EAA LAQHPSVEQAWAAKTDPSGEKRLVAYWGADGDGAALRDFVRKKLPEYMIPAEVWLPA LPLSANGKVDRAALPDPAAVAPRAAAVAPRTATERLIASVLAEVLQVEAVGVTDNLFELG FTSLLLVRAQRLLAERIAARAPDEGAAAQAVSLTDLFQYPTIEQLAQRLDAATVKAEPAD VGAQRAEARRDARRRRGRG* Seq ID No 97
>Contigll_011 544 amino acids MW=60164 D pl=9.10 numambig=0 MMSRIRAQLGVELPLRALFQGPTVAALAAQVDAARRGEARRREFPPIARIPRDGPLPLSF AQHRLWFVDQLEPGSPAYNIPFWRATGRLDVDALRRSLFEIARRHEALRTTFSARDGVP FPWAPEARVPFRMSDLEHLAGEALDAAVSALVLEESLAPFDLSRGPLLRVRVIRKRHDE HVIALWHHWFDVWSVGVFVGELAALYGGFAQGQPSRLPELPAQYVDFAAAQRAWLSGE VLEGELRYWTTKLSGALRRARVPVDHEPAGRRTWRGARRSLDAGAELTRQIKAFCEREAI SPFMALLAAYKLVLHQRTGLEDLWGTDVANRNRVETEPMIGFFVNQLVLRTDCGGDPTF GALVRRVRDVALEAFEHQDLPFDRLVEALRPKGAVGHVPLFDAKFVMRNVHVPPMKLEGL ELEALEGEATTTAFDFVLTVAEAGGSFRFGVEHSSELYRAATVDNFLSDYRQILATATAR PDTPVSELRGELERAAAARRELERKAARGAALDKLTSARRRAVTLPRPGAPGEAKTSPKD DLDE*
Seq ID No 98
>Contigl2_001 514 amino acids MW=56145 D pl=8.82 numambig=0 PPAVRRYVADRRPEQLPALAPEEREAAARRLSALGAAPPQVRRRGLTRAPLSYGQSRIYF LEQLSPGKPLFNVPGAVRLRGPVDVARLSAAFGEIVRRHDALRTSIANVDGELLQIAQPH AGFALDWTSTPEEAAELDRRLRAEAWRPFAIGAPPLLRATLFRLAEDEHVLLVTMHHW SDDWSLGVILRELLALYAGRSLPPPRLQVSDFAAWQREMVESGALDGQRAYWRERLRGLS RASISAGGGAEAPSHDPSGAIEEIALSPDKAAALEALARREGATLFMVLLALLDLVIHAR SGALDIAVGTPIANRNRPELEDWGLLTNTLVIRVDLARAGAFRDVLARARVQALDAFAN QDIPFDWTQDLKQERDHAQHPLFRVWLALQNAPKPALEVRGLRVEPLPLRPELVHFEVA LLLWPADDGSWGHFEFRRDRVDEGARKEIAAAFTHLVDAVIARPDAPVSTLVEGARAEA ARAQAALGEAFARAATARLGQLRRRSAGDRTPRE*
Seq ID No 99
>Contigl2_009 582 amino acids MW=65555 D pl=8.72 numambig=0
MREPSSTPNWRNFGSNLPAGSDSVPPGEGFPIKKILALNLGKWKDTAGLQIAQALHLFEY GYKRYREGKFVLRATSDLGLGAIFESIDNWESFDQFEEFFKPWTFIRKPLVATRWAEDAE FGRQRLVGINPAHIRRATPADLADFVSGAEPKPIAIADGRTLEEVREGGQLYFLDYRIFK DIVDTDVQEELGKYPLAPTCMLHQTAAGELLPVAIRLVHSRPGKGAHPDKIFTPSGPSDD WLTAKIAVASADAIYQGQVTHLLYAHLIVEPFAVSTYRNLPATHPLHQLLRPHFFNTLAI NELARRRFLGRGRFFDITSSVATMGSFELLTRAYTGKGIKGYGGKPWRFYESALPRDLSA RDVRDLVGYHYRDDALLHWDAIQEYVGQVLKIAYPTPGSLSSDASLQRWIHELVSPQLGG MDSLLPPERADQLEKLTSLDDLIAIVTNIIFTATAYHAAVNFGQTDYYTWIPNAQFATYR SYGDVLNGSEKRQFKPLERLPGRAQSIRQMVLSRSLSMGPPLTSESLMTMKCLLQDPAAK QAFARYRERLAHIEREITERNRAREQPYLYLLPSMVPQSVAI*
SEQ ID No 100 (>ORFl)
VSQRTSCYLRGGGVCSMNDAFLALERNERNRPSTVIDLLRQRAEAEPARPIYCFLESGDVEAG ATWVTLREIDERARTVAALLQASGVAPGARALLLYPPGIEYITAFFGCLYAGVRTVPAYPPDL GRLERTLPRVASIVADARAEAAL7SSAVAGIVASLPASAAAAALQRLRWIATDGPSPGPIEGP GAALRPESVAFLQYTSGSTGEPKGVMLTHGNLLHNSRLIAHGFDLTSPDPVGVIWLPPYHDMG LIGGILQALYRRIRVALMSPLSFLQRPMRWLRAVSALGASVSGGPNFAYDLCVRKSSEEERAA LDLRSWEVAFTGAEPVRADTLDRFARAFAVSGFRREAFYPCYGLAEATLIVSGGARAEAPVLA RLAPEEVELGRAVASAAEGARVFVGSGRALDPRAVAIVDPAGNELGPGEIGEIWVSGPSVAVG YWGRPEETEATFGATLAGSAAPRYLRTGDLGFLRGGELFWGRSKDLIILRGRNHFPQDIEKT VESSHRAVRPGCSAAFSVEHEGEERLAWCEVDPRVAADPREIVAAREAVTAEHQLVAHAVAL IAPGALPKTSSGKVRRRECRRAFLEDALGERHVAFAPELLDDASPPDDAPPETEEPSGRSLLD ALRSTLARALRLDAGQIDDALPISRFGLDSLAAVELQHAFQVRTGRAIPLTSILRGGSLRLTR EITRLDGPSSPRVATPGGAVCADRWGTGRFGSSAISRPMERFTTWAGRSGSVPAFKRVDLRRA F
SEQ ID No 101 (>ORF2)
VYSSAYVLFAVCAGTTRVASAPETAGFPLECVGDDGTVLGPDSFWGYTQVYVFKKERLNTNP PIDGFTLKLDGNEVAPGEDGLPWKRCVRSEEQAQGCGRTEPAEDECTTYEIEAWPEKAAEV DEEAAGLGGPPAREAIWVDYYTDGGEFDGARRLVSDTTGASRGGNGTTWTPPSEPGRVSLWAV VHDTRGGASVTRREVQVE SEQ ID No 102 ( >ORF3 )
WGTVLSAGTGEPLPDIAVTLVRPDGGREEAKTDQGGKFRFKNLPPGKYRVEVAAAGFEPFAA EEEIAAGEAIEVRYRISLAAPQDGKAPGIEVTVQGERPPREVTRRTIERREIDRIPGTGGDAL RSLQSLPGVARSGFGLLIVRGSAPQDTLTFVDRTPVPIIYHFGGLSSWPTEMLEKIDFYPGN FSAVYGRAMGGIVDVGLRSPKQDGKYHGWQLDLIDGRVLLEGPVPFLKDWTFIAAGRRSWVD AWLGPVLKEAGSSVTQAPVYYDYQFVLEGRPSASERVRASFYGSDDAFKITLDKPPEDEPALT GDFGLHTAFQRFQLSYENRIGSRDRLLWSMALGRDIADFEISPLAFNWSTSLDLRLELSHRF ARYLTMNVGTDLSGGVATVNIRAPSQQPAGHPSNQPFSTYPFQDRSFDGAYSRPAAYAELEW PSPRARIVPGVRVDYALDTQTLDVSPRVNARYDIRSGFPRTTAKGGVGLYYQAPQFAESIEPF GNAELKSNRAVHYGLGVEQEITPQIEVTLDGFYKQLDRLWFSPEKDDYADGTGYAVGGELLL KYKPDERFFGWAAYTLSRSVRKDGPDEEEHLTQFDQTHVLTVLGSLRLGRGWELARFRLVSGN LQTPYVCDPEEKGCNPNRVNAIYHASSARYSPIPLGGDYSERMPLFHQLDIRADKTWKFKRWQ LGLYLDIQNVYNYMAAEGISYNFNYTKREYVTGLPFLPTLGLRGDF
SEQ ID No 103 ( >ORF4 )
VIAVDNNPEAVDAVKDKTSAAFVGDATVHKVLEGIGAQYVETAIVTFGEHFEPSVLCVASLVR MGVRIIARAATDRQADILRAVGATRVIQLETEMGRRVGADITMPLAQDLLDLASHYRWPWNA HGPLVGQTLAGSKIRQRYRINVLGVRPHTNKRPGDKPRLEAPTPDYVIRDGDTLLLVGDSDDV SRFVAEVGG
SEQ ID No 104 (>ORF5)
SGSSGGGSSAEGSRCQPSGGGPHWLLEGETVTFPVTCASGLALAGDAFEVGPLPEGAAYDPIA REVTFSPGLDQAAVYDIEIRVAQTSEVGRVKVGVADAFADPSNVPWDPTRYPEEYGLPVLFL SPVPEDKEYAPATVIYRGHTYAAEAELRGESSLSYPKRSYTLKFPKDDKFNEPDEAGGFTDRR KWLITTFDDNSYVRQRLAYDLWNRLDPEHIQIKTYSAVLYLDGEYAGLYTVADHVDGYLMED
HGYPQDGNLYKAVSHDANFALTDRSGDPKDTLHDGFEKKEGAPAEGEPEAFSDLEDLVSFVAE SDDATFAAEIGSRIDLRDYEDWWIFVTFIVANDSAGKNSYHYRDPAADGVFRYAPWDFNASFG QSWETEREPASDRVDYRDVNLLFERLLEEPSIGDPLRARYDQVLRGALAEAEIHAIVDGYVER IDASARRDEARWGEAYRSYEGWSWRDDFTTYEEEIAYLK AWISERWQHQDELY SEQ ID No 105 (Contig 11 >0RF1)
VLDVWSTSDQVACRLHCAGAGPSASLELRYDASAGARRDAERLAERLAALLEDLSRHPERPVA QGEVGPGERAEIEAWSRGPAMELPSACALHRWFEERAEQHPDWAVRSEGKSLTYGELERRAN RLASCLRRRGVGLDTIVGVCVPRSEDMWATLAVLKVGGAYLPLDHEYPGERLAFMMRDARAR LLVTHDAIADELPTGGWTTLLLDAEAAEIAACSDARPAVSPPPDSGAYVIYTSGSTGTPKGSL ISHRAIVNQMQWIQRYWALTADDRVLLKAAFGFDVSVWEIFWPLSFGARIWARAGGHRDPEY LRRLVRDEGATTAYFVSSMLAAFLGGPEQPFPASLRKVLVGGEAVPLDLVRRFYAKHDGDLIN MYGPSEAAIAVTGCVLPSDPRVTWVPLGAPVANAEVFVLDGAMRRPAIGALGDLYIAGAPLAR GYVGQPGLTAERFLPDPCARAAGGRMYRTGDVARFLPDGMLEFQGRSDHQIKLRGHRIELGDV EAQIRRVPGVGQAAWLREDAPGDARLVAYWLDGDAAGDAPDVRAGLKASLSAYMIPSSWR LYALPMCSERLAFTGSSYAGCLL
SEQ ID No 106 (Contig 11 >ORF2 ) MSDHEMTGFSLSPQQRAIRALDREAGAPGCRTLAWAVTGPCDEGRLSAAALALAERHEILRT RLVEGRARPRRWSASRASRGRQQDD VGCSEAEQGERMSRLVARLSEDRGADDGLRVGLVRVG PEERRLVLAAPAWCVDEESIAPLVRELCASTAGAGAPPEQQYADVAEWLNGMLESEDAGDGRR FWAERRSHFGPPLHLAFSRGGAGAGAGSGRARVDLQGGMAQVERWSSSWQVPQRIVLLALWAS LLWRMSGGNEPEVTVAVRFDGRSLDALAGAVGPFARFLPVRIEISASDTLADVARRLALAEAE AAAHQDAAPGVSHRMSWGLLRRGGRAGAVARRRAGPRARRLEHV
SEQ ID No 107 (Contig 11 >ORF3 )
MSRIRAQLGVELPLRALFQGPTVAALAAQVDAARRGEARRREFPPIARIPRDGPLPLSFAQHR LWFVDQLEPGSPAYNIPFWRATGRLDVDALRRSLFEIARRHEALRTTFSARDGVPFPWAPE ARVPFRMSDLEHLAGEALDAAVSALVLEESLAPFDLSRGPLLRVRVIRKRHDEHVIALWHHV VFDVWSVGVFVGELAALYGGFAQGQPSRLPELPAQYVDFAAAQRAWLSGEVLEGELRYWTTKL SGALRRARVPVDHEPAGRRTWRGARRSLDAGAELTRQIKAFCEREAISPFMALLAAYKLVLHQ RTGLEDLWGTDVANRNRVETEPMIGFFVNQLVLRTDCGGDPTFGALVRRVRDVALEAFEHQD LPFDRLVEALRPKGAVGHVPLFDAKFVMRNVHVPPMKLEGLELEALEGEATTTAFDFVLTVAE AGGSFRFGVEHSSELYRAATVDNFLSDYRQILATATARPDTPVSELRGELERAAAARRELERK AARGAALDKLTSARRRAVTLPRPGAPGEAKTSPKDDLDE
SEQ ID No 108 (Contig 11 >ORF5) MSEPIETEDGGSDIAIVGMAGRFPGAPSVDALWENVRRGVESIARFPESEREEPPVGASAAPG APWCAGGLLDDIDRFDASYFGYSPREAQLMDPQQRLFLECAVAALEDAGCDPARFPGAIGVF GGCGSNTYLLQLLSHPDLAATVDPHALMLASEKDYLATRVSYKLDLHGPSWVQTACSTSLVA VHMACESLLGGQCDLALAGGVSIGIPQKRGYPYVPGSICSPDGRCRPFDARAEGTVGGSGVGI VALKRLADALRDRNTVHAVIRGSAVNNDGGRKVGFMAPSVDGQAAAISEAQSVAGVDPGSIGY VEAHGTATAIGDPIEVEALTQAFRRKTPRKAYCALGSIKANIGHLDAAAGVAGLIKAAHWRS GEIPPCVHFEAPNPKLDLAASPFFVPREAAPWPRELRPRRAGVSSFGIGGTNAHWLEEPPPL PPRAPAPERDHVLTLSARTPEALSTACAQLAAHLEATDVPLDDVAFTLQTGRAEHPYRRAWA RTRAEAIQGLAREGASALARPDEPRPSSRSRARARRPSGWPARSTRRRRSGAPSTRARRRRGR AASISARSSSARARATGARCSAPRWRSPRSSPSSSRSPGSG
SEQ ID No 109 ( Cont ig 11 >ORF6 )
WDHHVWEYWSFALIVRELGELYSALRAGRPPQLPPPSSFFAAGVSCPSPREAAGGAEYWRK ALDGTTAIDLPRDRARHDAGARRGRAHAI TLPKPLTGALARLARERGTTLFSVLLSALTVLLH RASGQSDLWGVPSAGRHDDESARAFGYFVQMLPVRVALRGAASFDALVARVRDAFLDALAHG DSALRHLLARAQGEAQRDALFDVAFAFQSTPPSLDARSALAIGVGDVRIAQGELELTTLADEQ AAAEFDLALFAAELDAGIALRFEYDQQLFDPATIERMARHFWLLESAVEHPGRPLSELRMLS DAERALLLDDWSGAAAARQAASAPAPACVHALFEAHAARQPDATALEFGHQRFTYAQLSTWST ELALWLRDRGVGPGSWGVCIERS PRMVAAQLAVLKAGAAYASLDPANPPARLAEMLADCRAS LALTSSQASHKLTAAPCPVHLVQDGACAPSTHI PLVSRPDDLAYVLFTSGSTGTPKGVCVRHA SLSRLVSFLHLRLDLSPSDRWTQVASSGFDASVYEIWTPLACGAALLLADDDALRSPTALVSW
LVAQRATLSFMPTPLAEACFEQDWTGSALRAMTVGGDKLHPLRRPPFRLFNMYGPTEATVITT VAEIADLGAEPPLGRPVDSALVYVLDPHMQPVPPGALGELYIGGACLAQGYTRTDLTAERFLP DPFGQPGARLYRTGDLVRWRPDGQLAFAGRRDEQVKLRGRRVELGEVESVLRRLPGVREGIW LHGQGSAAHLIAHWPDAHPPSERDLREGMARLVPDALVPAHFVLLPALPMSLSGKVDKKLLP APPAAHADYEPPSGELELELAHI QSVLHLDRVGRHDSFFDLGGHSLLAMQVLGRI ESSLGIR TTLRTLFEHPTLAQLASHLSSGAASTSAAAATALERGLTRPDGPSSPRVATPEEPFALTEGQR AMWLECQKSADG
ALYNLGRTVRLGAGVDVAALRRAFEGLVERHEALRTTFLTRDGHPLQQVHRHVALEWAEEPAM ALDEREIVARADEVRRRAFDLERGPLLRVHVWRRGEGQPPLLTWVHHLWDYWSFALLVREL GELYSALRAGRPPQLPPPSSFFAAGVSCPSPREAAGGAEYWRKALDGATTAIDLPRDRARHDA SPRRGRAHAITLPKPLTGALARLARERGTTLFSVLLSALTVLLHRASGQNDLWGVPSAGRND DESTRAFGYFVQMLPVRVALRGAASFDALVARVRDAFLDGLAHGDSALQHLLAEPRGAARRGG ALFDVAFAFQGALPSLDPRLAALTTGAEDVRIAQGELELTTLADEQAAAEFDLALFAAELDSG IALRFEYDQQLFDPATIERMARHFVLLLESAVEHPGRPLSELRMLSDAERALLLDDWSGAAAA RQAASAPAPACVHALFEAHAARQPDATALEFGHQRFTYAELSTWSTELALWLRDRGVGPGSW GVCIERSPRMVAAQLAVLKAGAAYASLDPANPPARLAEMLADCRAALVLTSSQASHKLTAAPC PVHLVQDGACAPSTHIPLVSRPDDLAYVLFTSGSSGTPKGVCVRHASLSRLVSFFQHLLALSP RDRWTQLASSGFDASVYEI TPLACGAALLLADDDALRSPTALVS LVAQRATLSFMPTPLAE ACFEQDWTGIALRAMTVGGDKLHPLRRPLPFRLFNMYGPTEATVITTVAEVADLGDEPPLGRP IDSALVYVLDPHMQPVPPGVLGELYIGGACLAQGYTRTDLTAERFLPDPFGQPGARLYRTGDL VRWRPDGQLAFAGRRDEQVKLRGRRVELGEVESALRRLPAVREGVWLHGQGSAARLIAYWP GADPPSERDLREGMARLVPDALVPAHFVLLPALPMSLSGKVDKKLLPAPPAAHADYEPPSGEL ERELAHIWQSVLHLDRVGRHDSFFDLGGHSLLAMQVLGRIESSLGIRTTLRTLFEHPTLHQLA DRLSSGAASTTAAAATVPASEIAPSLGRAPAD EPYPLSYEQERLWVLEQLLPGGTAYNWQAVRLRNLVDVDALSSALAALVRRHWSLRTVFVAS PTPQKICEPEAAPAEWDLRGTPPDΞAEAAARAWASREQATGFDLARGPVFRARLFRLDHDVC VLVLSTHHIVTDAWSFQPLVRDLAELYRRARGGGPADMPELPLQYVDFAVWQRRHLAGKRLAD KLAHWTATLRGLPVLELQTDRPRPPVQTFRGAERVLPLDARLVAQLDELARSRGATRFMVLLA ALGVLLRRSSGQDDLAIGTAVANRPRPELEPLVGFFVNTIVMRLDLGGDPTFEELLSRARKVA LEAFEHQDAPFEKWEAVNPRRDLSRSPLFQVMLWQNAPTEALELGEVRIEPLDLPVEATRF
DLRFSVEPRGGRDVISLQYNVDLFDAATIDRMLATMQSVLSRATQDPAQRVRALSVAPEDRER ALVAWNDTAVATPDHLRLEEPFFERAVEQPDACAWDAERRLTYGELARRAEAIAAAASRSGA TANALVAWMEKGWEQVAAVLGVLRAGAAYLPLDPRLPEERLRHLLEHAEVRLVLTQSAVDGT IAWPAGIERLAVDADERWREQPVARRPPGGSTDDLAYVIYTSGSTGLPKGVMIDHRGAVNTVL DINRRFDVGPEDRVLALSSLSFDLSVYDVFGTLAAGGAWIPDRTRASDPGHWRELVERERVT VWNSVPALMEMLMDASPGAGDPALSSLRLVMMSGDWIPLKLPDRIRAACRAPRWSLGGATEA SIWSIAHPIADVDPAWRSIPYGRPLANQHTYVLDEGLEPCPIGVPGEIHIGGIGVALGYWRDE ARTRERFLKHPTTGERLYRTGDLGRYFADGTIELLGRTDHQVKIRGFRIELGEIEAALAQHPS VEQAWAAKTDPSGEKRLVAYWGADGDGAALRDFVRKKLPEYMIPAEVWLPALPLSANGKV DRAALPDPAAVAPRAAAVAPRTATERLIASVLAEVLQVEAVGVTDNLFELGFTSLLLVRAQRL LAERIAARAPDEGAAAQAVSLTDLFQYPTIEQLAQRLDAATVKAEPADVGAQRAEARRDARRR RGRG
SEQ ID No 110 (Contig 12 >ORFl) PPAVRRYVADRRPEQLPALAPEEREAAARRLSALGAAPPQVRRRGLTRAPLSYGQSRIYFLEQ LSPGKPLFNVPGAVRLRGPVDVARLSAAFGEIVRRHDALRTSIANVDGELLQIAQPHAGFALD WTSTPEEAAELDRRLRAEAWRPFAIGAPPLLRATLFRLAEDEHVLLVTMHHWSDDWSLGVI LRELLALYAGRSLPPPRLQVSDFAAWQREMVESGALDGQRAYWRERLRGLSRASISAGGGAEA PSHDPSGAIEEIALSPDKAAALEALARREGATLFMVLLALLDLVIHARSGALDIAVGTPIANR NRPELEDWGLLTNTLVIRVDLARAGAFRDVLARARVQALDAFANQDIPFDWTQDLKQERDH AQHPLFRVWLALQNAPKPALEVRGLRVEPLPLRPELVHFEVALLLWPADDGSWGHFEFRRDR VDEGARKEIAAAFTHLVDAVIARPDAPVSTLVEGARAEAARAQAALGEAFARAATARLGQLRR RSAGDRTPRE
SEQ ID No 111 (Contig 12 >ORF2 )
MSIHIEEEGRADAAKAPPFDYLQALHSALAHENDPVKRKQIEAGMVFKWLREEPLPFLSQLRR EKPIFSIPAITLVTRYNDWEVLNANDVFSVDNIAPKLVENVGQNILAMNDSPKYEHEKSLLR LAFPRADLPRYRQIWDEANRLLAKVGVDKPFDLTGDYALRVPAGAMARYLGVGEIPTEKWA WTHALFNEIFLNPTNDPTAVAAARAARQEALPMIDAIVAARKKQLAKSPPPEQPSVLDRYLVM QSVPETYESDEGIRDVILGLLMGCVDLSGGAIVNALVELMKRPRVLRDALNWNVEDDAAITG YVLEALRFRPPSTGVTSLCVRDYTVGRGTRHEEKVPAGALVMACSASAMHDHEHIDAPDQFRP GRLPSRNYLFWESGIHTCHGKYVAILHISLAIKQLLRAGVPSAIDPMPRVHGYPAPFRVRLAA AEG SEQ ID No 112 (Contig 12 >ORF3 )
MREPSSTPNWRNFGSNLPAGSDSVPPGEGFPIKKILALNLGKWKDTAGLQIAQALHLFEYGYK RYREGKFVLRATSDLGLGAIFESIDNWESFDQFEEFFKPWTFIRKPLVATRWAEDAEFGRQRL VGINPAHIRRATPADLADFVSGAEPKPIAIADGRTLEEVREGGQLYFLDYRIFKDIVDTDVQE ELGKYPLAPTCMLHQTAAGELLPVAIRLVHSRPGKGAHPDKIFTPSGPSDDWLTAKIAVASAD AIYQGQVTHLLYAHLIVEPFAVSTYRNLPATHPLHQLLRPHFFNTLAINELARRRFLGRGRFF DITSSVATMGSFELLTRAYTGKGIKGYGGKPWRFYESALPRDLSARDVRDLVGYHYRDDALLH WDAIQEYVGQVLKIAYPTPGSLSSDASLQRWIHELVSPQLGGMDSLLPPERADQLEKLTSLDD LIAIVTNIIFTATAYHAAVNFGQTDYYTWIPNAQFATYRSYGDVLNGSEKRQFKPLERLPGRA QSIRQMVLSRSLSMGPPLTSESLMTMKCLLQDPAAKQAFARYRERLAHIEREITERNRAREQP YLYLLPSMVPQSVAI
SEQ ID No 113 (Contig 12 >ORF4 ) VSSSRSTGRVPRDRASPAGSCAPALVPGPPLSYASVMPPLDLHVALFGASGAGKTVLLAAFYR AQTQPSFQQEYAYKIQAVNKAQGNQLLGRFYRLEEGRFPDGSTRFDEYEFDFFPRDLPEPAVR IHWYDYPGRWWEDEPVDADEREAMRQGLIRLGMSQVGILLADGAKYRAEGTGYIRWLFEHFAD ECDRLRRASAATGDEVSFPREWILALSKADLCPPDYSARDFEREVCRDADDQLAKLCSVLRAE HAFGHRFMLLSSVAAPAGAQVDPRTSLGVRTLAPAILVSTVEGAVREAQAARKEKSAGETFFQ GLRDLVQFVDSLDDFLPKRYQIVSKILRFISIKDFATTRLDRLKKMREDAIRKGDTFTAVLTA MVAALRDDEGARAYHQNQ
SEQ ID No 114 (Contig 12 >ORF5)
MPAPAPLVETSRLLWRTRGEHWDYEFICVPEIPALPAWLSTLEAMLADADAGAGELRYGLLEI
DDRGQRAPRAYPYVAVRFLDPARRDWTGRQVQHFAAWFPPVPPEAVAELPEAVPADWHLRVLD GLAGTYGSGEVFGLPEATIRAWKRSHDESRAARAMAIVKATPPVSLGGGEAAPSRWTRVPTLK
KKPPEPPAAAGLLSVGAVPSGQGRRFGCFAIGAMMLAAFCRLMLACGVRLLGA
SEQ ID No 115 (Contig 12 >ORF6) VRFRSSLGPLLLAALGAALTVSAAPRSAEASVFDSASRWPEDADGHVRIPVCIDPTSSAEQRV DGAAGGLIHAPNPSLADVITRVRTALQGSWERWSSVRFTGWESCDSLLPATRMTYVGVRIHPD APNQSDSIGVYNKGGSVQFKPWGADFNRCIKYNWQTARVEYSFDCVEQYAIHEMGHAIGFMHE WHHPLVPSACSQREPLPASDVASGWPSSRRYIWNPGFYDYDSIMTYWSGCSDQDGVRFGSET LDAVDIQAVATVYPPVGGAPDVCNPGWFAGKRWFCAAQPTVSVGNSCSSGWVECLPHCNPRPF QGEWWTCPTNPYAVTGQSCSARWELCGD
SEQ ID No 116 (Contig 12 >ORF7)
VGESQGALVGGNALSTNALNLNALNLNALNLNALNLSGLSARNLAAIQDPGPSGALARDFLRY AASCALSSTASFDFSWTDSNGKRHDERYPGLLGVAPAWASGPLDDAGQRIVSSCVAARVNYYQ VPVLLSARSLRDPLKTLSSSQELIDYPDVEGAFWGNLFAAQPYINACYNSATVDNSRAYQRDC AAGHVTSGGQIVECGLIRIAGSCDRVCQKLNGAGQYYPSCVDRPGQSTATTKDVITTALP
SEQ ID No 117 (Contig 12 >ORF8)
VLAAHCERGGLTARAASLLARGAELAAARRAYLDAEGCYGRVEALLGALLPEERRARGLARFR LGRHTEALADLAAAREAAAAASEAGAEIELLLDEAMILDWTGEYRAARERVAAAERLAGRVAS PLLGARLLLGVGRSLHRADREDEAAAVLTRAAAQAARLGDEGHETHI IALLLLGFILASLGRV EEAARDLDAVILSCEERSDLMHLGAALNNRGLARALQGDRAGMIADFERTIALGRELGQPAFE LVGRYNLAEYLYLMDDLAAARPHARAVQAIAPRCGDRHAPVWTLLIARLRLYQGDEAGARRI ALRLRAARDDAGCEALKPSEDVLCAMIELATRDDDRAAWAALEERSARCSVGQERIEVLEARA LAALRRGRRADARAQLERALAAASTIPTVMGGRLRRWYAELTRATESDAPDIDLAAAEATFTG ARAREKVEY
SEQ ID No 118 ( Cont ig 12 >ORF9 )
QAYPDLWAERGRQELWLRQLPPRACAQLAREALGDAADGALIDRLVTQSEGQPFFLEELIRAT
AEGRGDALPETWAMVQVRLEALAPPARRILRAASVLGEVFWRGAVAHLLGGDEAAPLAEHLS ALVAGELCVRHREGRFPGEEEYSFRQALLREGAYAQLTKDDRALGHRLAADWLEAAGEADPLV
LAAHCERGGLTARAASLLARGAELAAARRAYLDAEGCYGRVEALLGALLPEERRARGLARFRL GRHTEALADLAAAREAAAAASEAGAEIELLLDEAMILDWTGEYRAARERVAAAERLAGRVASP LLGARLLLGVGRSLHRADREDEAAAVLTRAAAQAARLGDEGHETHIIALLLLGFILASLGRVE EAARDLDAVILSCEERSDLMHLGAALNNRGLARALQGDRAGMIADFERTIALGRELGQPAFEL VGRYNLAEYLYLMDDLAAARPHARAVQAIAPRCGDRHAPWVTLLIARLRLYQGDEAGARRIA LRLRAARDDAGCEALKPSEDVLCAMIELATRDDDRAAWAALEERSARCSVGQERIEVLEARAL AALRRGRRADARAQLERALAAASTIPTVMGGRLRRWYAELTRATESDAPDIDLAAAEATFTGA RAREKVEY
14. DNA sequence according to any of claims 1 to 5 wherein the DNA is selected from the group consisting of
(a) the following DNA sequences:
Seq ID No 119 (>Contigl7)
TTACGTTACTCATCCTATCTCGGCACCCTGTGTCGGTGATGTCGCTCGCC TCGAGCGCGAGCGGGACGACGTCGGCGCCGCGCTCGGTGAGCGCCGCCGC GAGGGCGCTCGCGAGATCGCTGGCGACGCCGGCCGGGGCCACGACGAGCC ACGTCCCCGCGACGTCGCCGCGTGACGCGGCGCTCACGGGTCTCCATTCG ACGCGGTAGCGCCACGCGCCCACGGTGCTCTGCTCTCGGCGGCTCCGCCG CCACGCCGACAGGGCCGGCATGAGGCTCTCGAGGGCCGAGCGCCGCCCGC TGTCGGCGACGTGGAGCGCGTCCGAGAGCGCCGCGACGTCGCCGCGCTCG ATGGCTCGCCAGAACGCGGTCTCCTCGGCGGACGCTCCCGGCGCCGCGTC CTCATCGTCCGACGCGTCGCCTGCGTCGAGCCAGAACCGCTCGCGCTGGA ACGCGTACGTCGGCAACGTCACGCGGCGCGCCCCGAGCGGAGCGAAGAAC GCACCCCAGTCGATGGCGTGCCCGCGCGCGTGGAGCTCGCCTGCCGAGAG GAGGAAGCGCTCGAGGTCGCCTTCGTCGCGGCGGAGCGAGGACACCACGG TCGCATCGCCGTCGATCGACGAGAGCGTCTCGTCGAGCGCGACGGTGAGC ACGGGGTGAGGGCTGACCTCGACGAAGAAGCGGTGGCCGTCGTCGAGCAG GGCGCGCGTGGCGTGCTCGAAGCGGACGGTGTGGCGCAGGTTTCGGTACC
AGTGGGCGGCGCCGAGGGCCTCGCCATCAAGCCTCTCGCCCGTCACCGCG GAGTAGAGCGGCACGGTCGCCGGGCGCGGCGCGATGCCGTCGAGCGCCTC CAGCATCGTCCGCTCGATGGCCTCCACGTGGGCGGAGTGGGAGGCGTACT CGACGCGGACCTTGCGGGCGAACAGCTGCGCCCCGCTCAGCTCTGCGACG AGCTCGTCGATAGCGCCGGGGTCTCCGGAGACGAGGGCCGCGTGAGGGCT GTTGATCGCCGCTATCGCCAGGCGTTCGCCCAAGGGCGCAAGGCGCGCCT CGAGCTCGGCGGTGGTGAGCTCGACGGCGGACATGGCGCCGCGTCCCGCG AGCTTCGTAATGGCGCGCGAGCGGAGCGCGACGACCCTGGCGGCGTCTTC TAGCGAGAGCGCGCCCGCGACGTACGCGGCCGCGATCTCGCCCTGGCTGT GGCCGACGACCGCGTCGGGCGTGACTCCGGCGGCGCGCCAGGTGGCGGCG AGGGCGATCATGACGGCGAACAGCACGGGCTGCACCACGTCGACGCGCTC GAGCATGGGCGCGGCGTGCGCTTCGTCGCCGCCGAGCACGGCGAGGAGCG ACCAGTCGACGTGCGGCGCCAGGGCGCGCTCGCACGCCTCGATCTCGGCC CGAAAGGCGGGCGAGGAGGCGAGCAGAGCGCGCGCCATCGATGGCCACTG CGAGCCCTGGCCGGGGAAGACGAAGGCGACCTTGCCCGGCGGGAGCGCCT CGCCCGCGACCGTTCCTGCCCCCGCGCGCCCCTCGGCGAGCGCCGCGAGC GCCGAGAGCAGCGCGGCGCGATCGTCTGCCACGACGGCGGCGCGACGCTC GAAATGCGACCGCGTGGTCGCGAGCGACGCCGCGACGTCGACGAGGGCGA CGTCCTCGTGCTCGGCGAGGTGCGCGTGGAGCTTGCCCGCCTGAGCGCGG AGCGCCGCGTCGCTCTTCGCCGAGAGGAGCACCGGCACCGGCGGCGCGAA GGGCGCGCGGGCGGGCTCCCCGGCCTGGTCGTCGCCGGCCGCCGCGCGCG GCGCTTCCTCGAGGACCACGTGCGCGTTGGTGCCGGAGATCCCGAACGAC GACACCGCCGCGCGCCGAGGAGACCCGCCTGGCTTCCACGGTACCTCCTC GGTCAAGAGGCGGATCGCGCCGGACGACCAATCGATGTGCTGCGACGGGC TCGCGGCGTGGAGCGTCCTCGGGAGGACGCCGCTCTGCAGCGCGAGCACC ATCTTGATGACGCCGCCGATCCCCGCGGCGGCCTGCGTGTGCCCGAGGTT CGACTTTAGGCTCCCGAGCCACAGCGGGCGCTCCTTCGCGTGCGCCGCGC CGTACGTCGCGAAGAGCGCGCGCGCCTCGATGGGATCGCCGAGCGTCGTG CCGGTTCCGTGCGCCTCGACGGCGTCGACGTCCGCGGGGGCGAGCCCCGC GCTCGCGAGCGCGTCCCGGATCACGCGCTCTTGCGCGGGGCCGTTCGGCG
CCGTGAGCCCTTGGCTCTTGCCGTCCTGGTTGACGGCCGATCCGCGCACG ATCGCGAGCACGGGGTGCCCGTTCTTCCGGGCGTCCGACAGGCGCTCGAG GAGCACTATCCCAGCGCCTTCCGACCAGCCCGCGCCGTTCGCGTGCGACG AGAACGACTTGCACCGCCCGTCCGGCGCGCCCGCGTGCTGCGCGCTGAAC TCGCCGAAGATCCCGGGGGTCGCCATCACGGTCACGCCGCCGGCGAGCGC GAGCGAGCACTCGCCTCGACGGATGGCGTGGCAGGCGAGGTGGAGCGCGA CGAGCGACGAGCTGCACGCCGTGTCGACGCT
Seq ID No 120 (>Contigl8) TTTTAGGANCCCCGACGTGCACGATCGGCTCGCCAACCTCGTGGCGCGCC GGGACTATTTTTACCAGCTCGCGTTGCGCGCCGCGGGGACCTACGTGCGG GGCCTCGTCCGCGCCCCGCACGACGGCGCGCGCCCCCCCGCGTTCGCGCC GCGTGGGGCGGCGCTCGTCACGGGCGGGACCGGGGCGCTCGGGGCGCACG TTGCCCGTTGGTTCGCGCGGATCGGCGCCGAGCACATCGTGCTCGCGAGC CGCCGCGGAGCCGCGGCCCCCGGCGCGGCCGCGCTCGCCGAGGAGCTTTC GGTGCTCGGCGCGCGCGTGACGCTGGTTGCGTGCGACGTCCCCGATCGTG AGGCGGTCGCGGGGCTCGTGCGCAACGTCAAGGCCGGCGGAGCGACGGTG CGCGCCGTGTTCCACGCGGGCGGTGCGATGCACGAGGCGCCGGTCGCCGC CATGCGTGTTGAGGAGCTCGCCGACGCGATCGCCGTGAAGGCCCGCGGCG CGCAGCACCTCCAAGACGTCTTCGCGCAGCGCCCGCTCAACGCGTTTGTC CTCTTCTCGTCAGAAACGGGTGTGTGGGGCGGTGGCCGGCAAGGCGCGTA CGCCGCGGCGAACGCGTTCCTCGACGCGCTCGCCGAGGCGCGTCGCGCGG ACGGCCTCGCGGCGACCTCGATCGCGTGGGGCGCGTGGGCGGGCGGCGGA ATGCTCGCGACCGACGCCGAGCGGCGCTTGAAGCATCGCGGCGTCGCGCC GATGGATCCGGAGCTCGCCGTCGCGGCCCTCGCGCACGCGCTCGATCACG CCGAGACGTGCCTCGCCGTCGCTGACGTCGACTGGGCGCGCTTCGCCCCG TCGTTCGCCTCGGCGCGTCCTCGCCCGCTCCTCGACGAGCTCGCGGAGGC GCGATCGGCGCTCGACGCGCTGCGCGAGCCACCGGACGACGCGCGCACGG CCGCCGGTCCCGAGCCCGCAAGCACGCTGAGGACCACGCTCGCGGCGCTC CCGGAGGGCGAGCGCCACCGCCACCTCCTCGCGCTCGTGCGGACGGAGAC GGCGGCGGTGCTCGGGCACGCGGACGCGTCGCGCGTCGAGCCGAACCGCG GGTTCTTTGACCTCGGGCTCGACTCGCTCATGTCCGTCGAGCTCCGCAGG CGCGTCCAGCGCGCGACCGGCATCAAGCTCCCGGCGACGCTCGCGTTCGA CCACCCGACGCCGAGCGCGCTCGCGAGCAAGGTGCTCGCCGCGATCGTCC
^GCCCGCCGCGGAGCTCGAGCGCCTC GAGGGGATGCTCTCGGCGATCTACGCGGACGAAGCGCTCCGCGACGACCT CACGGCGCGCCTCCGCGCCTTCCTGGACAAGCGCGCGGTCCGCACCGAAC GCCCCGACGACGCCGCGTTCGCCGAGAAGCTCGGCTCCGCGAGCGCCGAC GAACTCATTCGCCTGATCGATCAGAAGCTCGGAGATCGCATCGATGTCGA CCGTTACTAACGACACGCTCACGGAGTACTTGCGGCGCCTCACTCAAGAG CTCCACAGGAGCGAGACGCGCCTGCGTGCGACGGAAGAGAGGCGACATGA GCCGATCGCCATCGTCGGCCTCGGGCTCCCCTTCCGGGGCGGGATCCACG ACCGCGACACGCTCTGGACGTTCCTCGAGGAGGGCCGCGACGCCATCGCG CCGATCCTCGCGAGCCGCTGGAACGCGGACGCGACGTACGACCTCGATCC GGACGCCGTCGGCAAGAGCTACGTGCGCGACGCCGCCATGCTCGATCGCG TCGACCTTTTCGACGCCGATTTCTTCGGGATCAGCCCGCGCGAGGCGAAG TACGTCGACCCGCAGCACCGCCTC7TGCTCGAGACGTCGTGGCAAGCGCT CGAGGACGCGGGGATTGTGCCGGCGTCGCTGCGAGACTCGAAGACCGGCG TCTTCGTCGGCACGGGCGCGAGCGACTACGCGTTCCTCCAGAGCGATCGC GACGCCTCGGAGGCGTACGCGTTCATGGGGATGATCTCGTCGTTCGCGGC GGGCCGCCTCGCGTTCACGCTCGGGCTCCAAGGCCCCGCGCTATCGATCG ACACGGCGTGCTCTTCGTCGCTCGTCGCGCTCCACCTCGCGTGCCAGTCG CTGCGTCAAGGCGAGTGCGACCTCGCGCTCGTCGCGGGTGTGCAGGTCAT GTCGTCGCCGGAGGTGTTCGTGCTGCTCTCGCGCACGCGCGCGCTCGCGA GCGACGGGCGATCGAAGACGTTCTCGGCGAACGCCGACGGCTATGGCCGC GGCGAAGGCGTCGTCGTCCTGGCCGTCGAGCGCCTCCGCGACGCGCGCGC GAAAGGGCGCCCGATCCTCGCGGTGATCCGCGGCAGCGCGGTGAACCACG ACGGCACGTCGAGCGGGATCACGG7CCCGAACGGGCCCGCGCAGCAGAAG GTGCTCCGCGCCGCGCTCGACGACGCGCGGCTTGTCCCCGCCGACGTCGA CGTCGTCGAGTGCCACGGCACGGGGACCTCCATCGGCGATCCCATCGAAG
TGAACGCGCTCGCCGCCGTCTACGGCGAGGGGCGCCCCAAGGACCGCCCG CTG7TCCTGGGCGCGCTGAAGACCAACATCGGGCACCTCGAGTTCGCGTC GGGCCTCGCCGGCGTCGCGAAGATGGTCGCCTCCATGCGCCACGCGACCC TCCCCGCGACGCTGCACACGAGCCCGCTCAACCCGCTCGTCGACTGGGAC GCGCTCCCCGTGCGCGTCGTCGACGCCGCGCGCCCGTGGACGCGCCGCGA CGACGGCGCCCCCCGGCGCGCCGGCGTCACGGCGATCGTCGAGGAGGCGC CCGCCGAGCCCGAGCCCACGACGCCCGACGCCGCGCCCGCGCTTCCGGCC GTGCCCGTTCTCCTCTCGGGCAAGACCGACGAGGCGCTGCGCGCGCAGGC AGCGCGCCTCCACGCGCACCTCGCGGGGCGCCCCGACGCGCGGCTCGTCG ACATCGCCGCGTCGCTCGCGACGACGCGCACGCACTTCGATCGACGCGCG GCCGTCGTCGCGGCGGATCGCGACGAGCTCCTCGGCGCGCTCGACGCGCT CGCGCGCGGCGAGGCAGGCCCGGGGTCGGTCGTCGCGAGCGCGATCCCCG CCGGCAGGGTCGTGTTCGTGTTCCCCGGCCAAGGCTCGCAGTGGGTCGGG ATGGCGCGCGCGCTCCTCGCGTCGTCGGTGGTCTTCCGCGACGAGATCGC GGCCTGCGAGCGCGCGCTCGCGCCGCACGTCGCCTGGTCGCTCGGCGCCG TTCTCCGGGGCGACGGCGACGAGGCGACGCTCCTCGGCCGCGTCGACGTC GTGCAGCCGGTCCTCTTCGCCGTCATGGTCGCCCTCGCCGCGCTCTGGCG CTCGATCGGCGTCACGCCCGACGCCGTCGTCGGGCACAGCCAAGGCGAGA TCGCCGCCGCCTACGTCGCCGGCGCCCTCTCGCTCGAAGACGCCGCCAAG GTCGTCGCGCTGCGCGCACGAGCGCTCACGAAGATCGCGGGGCGCGGGGC GATGGCCGCCGTCGAGCTCGGCGCACGCGACACCGAGGCGCGCCTCGCGC CGTTCGGCGACGCCATCGCGATCGCGGCGATCAACAGCCCGCGCGCCACG CTCGTCGCGGGCGACACGGACGCGATCGACGCGCTCGTCCGCGACCTCGA GGCCGCGCAGATCTTCGCGCGGAAGGTGCGTGTCGACTACGCGTCGCACT CGGCGCACGTCGAGGCGATCGAGCGCGAGCTCCTCGCGGATCTCGCGGGG ATCGAACCGCGCGCGGGCGCTGTGCCGCTTTACTCCGCGGTGACGGGCGC GAAGCTCGACGGGAACCGCCTCGACCCCGCGCATTGGTTCCGGAACCTGC GCTCGACAAAAAACTTTGAGGACGCCACGCGCGCGCTCCACGACGACGGC CGCCGGGTATCCTCATNATCNNGGGCGTNCAGAGGAGTCGGTATTNCCCC CCCCCGCCTTNCCCG,
or their complementary strands, (b) DNA-sequences which hybridise under stringent conditions to regions of DNA-sequences according to (a) encoding proteins or to fragments of said DNA-sequences,
(c) DNA-sequences which hybridise to the DNA-sequences according to (a) and (b) because of a degeneration of the genetic code,
(d) allele variations and mutants resulting by substitution, insertion or deletion of nucleotides or inversion of nucleotide segments of DNA-sequences according to (a) to (c) , wherein the variations and mutants offer isofunctional expression products.
15. Peptide encoded by a DNA sequence according to claim 14 selected from the group consisting of
Seq ID No 121
>Cont ιgl 7_00 1 828 ammo ac ids MW= 86259 D pl =5 . 60 numambιg= l MTVMATPGI FGEFSAQHAGAPDGRCKSFSSHANGAGWSEGAGIVLLERLSDARKNGHPVL AIVRGSAVNQDGKSQGLTAPNGPAQERVIRDALASAGLAPADVDAVEAHGTGTTLGDPIE ARALFATYGAAHAKERPLWLGSLKSNLGHTQAAAGIGGVI KMVLALQSGVLPRTLHAASP SQHIDWSSGAIRLLTEEVPWKPGGSPRRAAVSSFGI SGTNAHWLEEAPRAAAGDDQAGE PARAPFAPPVPVLLSAKSDAALRAQAGKLHAHLAEHEDVALVDVAASLATTRSHFERRAA WADDRAALLSALAALAEGRAGAGTVAGEALPPGKVAFVFPGQGSQWPSMARALLASS PA FRAEIEACERALAPHVDWSLLAVLGGDEAHAAPMLERVDVVQPVLFAVMIALAATWRAAG
VTPDAWGHSQGEIAAAYVAGALSLEDAARWALRSRAITKLAGRGAMSAVELTTAELEA RLAPLGERLAIAAINSPHAALVSGDPGAIDELVAELSGAQLFARKVRVEYASHSAHVEAI ERTMLEALDGIAPRPATVPLYSAVTGERLDGEALGAAHWYRNLRHTVRFEHATRALLDDG HRFFVEVSPHPVLTVALDETLSS IDGDATWSSLRRDEGDLERFLLSAGELHARGHAIDW GAFFAPLGARRVTLPTYAFQRERFWLDAGDASDDEDAAPGASAEETAFWRAIERGDVAAL SDALHVADSGRRSALESLMPALSAWRRSRREQSTVGAWRYRVEWRPVSAASRGDVAGTWL WAPAGVASDLASALAAALTERGADWPLALEASDITDTGCRDRMSNVX
Seq ID No 122 >Contigl8_002 502 amino acids MW=53019 D pl=6.83 numambig=l FRXPDVHDRLANLVARRDYFYQLALRAAGTYVRGLVRAPHDGARPPAFAPRGAALVTGGT GALGAHVARWFARIGAEHIVLASRRGAAAPGAAALAEELSVLGARVTLVACDVPDREAVA GLVRNVKAGGATVRAVFHAGGAMHEAPVAAMRVEELADAIAVKARGAQHLQDVFAQRPLN AFVLFSSETGVWGGGRQGAYAAANAFLDALAEARRADGLAATSIAWGAWAGGGMLATDAE RRLKHRGVAPMDPELAVAALAHALDHAETCLAVADVDWARFAPSFASARPRPLLDELAEA RSALDALREPPDDARTAAGPEPASTLRTTLAALPEGERHRHLLALVRTETAAVLGHADAS RVEPNRGFFDLGLDSLMSVELRRRVQRATGIKLPATLAFDHPTPSALASKVLAAIVLHDA TPRASPAAELERLEGMLSAIYADEALRDDLTARLRAFLDKRAVRTERPDDAAFAEKLGSA SADELIRLIDQKLGDRIDVDRY*
Seq ID No 123
>Contigl8_010 840 amino acids MW=88062 D pl=5.74 numambig=6
MSTVTNDTLTEYLRRLTQELHRSETRLRATEERRHEPIAIVGLGLPFRGGIHDRDTLWTF
LEEGRDAIAPILASRWNADATYDLDPDAVGKSYVRDAAMLDRVDLFDADFFGISPREAKY VDPQHRLLLETSWQALEDAGIVPASLRDSKTGVFVGTGASDYAFLQSDRDASEAYAFMGM ISSFAAGRLAFTLGLQGPALSIDTACSSSLVALHLACQSLRQGECDLALVAGVQVMSSPE VFVLLSRTRALASDGRSKTFSANADGYGRGEGVWLAVERLRDARAKGRPILAVIRGSAV NHDGTSSGITVPNGPAQQKVLRAALDDARLVPADVDWECHGTGTSIGDPIEVNALAAVY GEGRPKDRPLFLGALKTNIGHLEFASGLAGVAKMVASMRHATLPATLHTSPLNPLVDWDA LPVRWDAARPWTRRDDGAPRRAGVTAIVEEAPAEPEPTTPDAAPALPAVPVLLSGKTDE
ALRAQAARLHAHLAGRPDARLVDIAASLATTRTHFDRRAAWAADRDELLGALDALARGE AGPGSWASAI PAGRWFVFPGQGSQWVGMARALLASSWFRDEIAACERALAPHVAWSL GAVLRGDGDEATLLGRVDWQPVLFAVMVALAALWRS I GVTPDAWGHSQGE I AAAYVAG ALSLEDAAKWALRARALTKIAGRGAMAAVELGARDTEARLAPFGDAIAIAAINSPRATL VAGDTDAIDALVRDLEAAQIFARKVRVDYASHSAHVEAIERELLADLAGIEPRAGAVPLY SAVTGAKLDGNRLDPAHWFRNLRSTKNFEDATRALHDDGRRVSSXSXAXRGVGIXPPRLX X
16. Recombinant expression vector which comprises a DNA- sequence according to any of claims 1 to 10, 12 and 14.
17. Procaryotic or eucaryotic cell which has been transfected or transformed with a DNA- sequence according to any of claims 1 to 10, 12 and 14 or with a recombinant expression vector ac- cording to claim 16.
18. Cell according to claim 17, wherein the cell is derived from myxobacteπa .
19. Cell according to claim 17, wherein the cell is derived from a Sorangium strain.
20. Cell according to claim 17, wherein the cell is derived from Sorangium cellulosum.
21. Cell according to claim 17, wherein the cell is derived from a Streptomyces strain.
22. Cell according to claim 17, wherein the cell is derived from Escherichia coli.
23. Process for an enzymatic biosynthesis, mutasynthesis or partial synthesis of polyketide or heteropolyketide compounds, wherein a cell according to any of claims 17 to 22 is culti- vated in a suitable culture medium and the polyketide or heteropolyketide compound is isolated from the medium.
24. Process according to claim 23, wherein the polyketide or heteropolyketide compound is an epothilone.
PCT/US1999/023535 1998-10-09 1999-10-11 Dna sequences for enzymatic synthesis of polyketide or heteropolyketide compounds WO2000022139A2 (en)

Priority Applications (4)

Application Number Priority Date Filing Date Title
JP2000576029A JP2002527067A (en) 1998-10-09 1999-10-11 DNA sequences for enzymatic synthesis of polyketide or heteropolyketide compounds
CA002346499A CA2346499A1 (en) 1998-10-09 1999-10-11 Dna sequences for enzymatic synthesis of polyketide or heteropolyketide compounds
AU65126/99A AU6512699A (en) 1998-10-09 1999-10-11 Dna sequences for enzymatic synthesis of polyketide or heteropolyketide compounds
EP99953117A EP1119628A2 (en) 1998-10-09 1999-10-11 Dna sequences for enzymatic synthesis of polyketide or heteropolyketide compounds

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
DE19846493.2 1998-10-09
DE1998146493 DE19846493A1 (en) 1998-10-09 1998-10-09 DNA sequence coding for products involved in the biosynthesis of polyketide or heteropolyketide compounds, especially epothilone

Publications (3)

Publication Number Publication Date
WO2000022139A2 true WO2000022139A2 (en) 2000-04-20
WO2000022139A9 WO2000022139A9 (en) 2000-09-08
WO2000022139A3 WO2000022139A3 (en) 2001-01-18

Family

ID=7883888

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US1999/023535 WO2000022139A2 (en) 1998-10-09 1999-10-11 Dna sequences for enzymatic synthesis of polyketide or heteropolyketide compounds

Country Status (6)

Country Link
EP (1) EP1119628A2 (en)
JP (1) JP2002527067A (en)
AU (1) AU6512699A (en)
CA (1) CA2346499A1 (en)
DE (1) DE19846493A1 (en)
WO (1) WO2000022139A2 (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2000031247A3 (en) * 1998-11-20 2000-12-07 Kosan Biosciences Inc Recombinant methods and materials for producing epothilone and epothilone derivatives
WO2001083800A2 (en) * 2000-04-28 2001-11-08 Kosan Biosciences, Inc. Heterologous production of polyketides
US6410301B1 (en) 1998-11-20 2002-06-25 Kosan Biosciences, Inc. Myxococcus host cells for the production of epothilones
US6998256B2 (en) 2000-04-28 2006-02-14 Kosan Biosciences, Inc. Methods of obtaining epothilone D using crystallization and /or by the culture of cells in the presence of methyl oleate

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6242469B1 (en) 1996-12-03 2001-06-05 Sloan-Kettering Institute For Cancer Research Synthesis of epothilones, intermediates thereto, analogues and uses thereof
US6121029A (en) * 1998-06-18 2000-09-19 Novartis Ag Genes for the biosynthesis of epothilones
CA2496477C (en) 2002-08-23 2012-10-16 Sloan-Kettering Institute For Cancer Research Synthesis of epothilones, intermediates thereto, analogues and uses thereof
US7649006B2 (en) 2002-08-23 2010-01-19 Sloan-Kettering Institute For Cancer Research Synthesis of epothilones, intermediates thereto and analogues thereof
CN112941002B (en) * 2021-02-08 2023-04-25 中国科学院天津工业生物技术研究所 Recombinant strain of escherichia coli for producing dopamine as well as construction method and application thereof

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1993013663A1 (en) * 1992-01-17 1993-07-22 Abbott Laboratories Method of directing biosynthesis of specific polyketides
US5716849A (en) * 1994-06-08 1998-02-10 Novartis Finance Corporation Genes for the biosynthesis of soraphen
WO1998022461A1 (en) * 1996-11-18 1998-05-28 GESELLSCHAFT FüR BIOTECHNOLOGISCHE FORSCHUNG MBH (GBF) Epothilone c, d, e and f, production process, and their use as cytostatic as well as phytosanitary agents
WO1999066028A2 (en) * 1998-06-18 1999-12-23 Novartis Ag Genes for the biosynthesis of epothilones
WO2000031247A2 (en) * 1998-11-20 2000-06-02 Kosan Biosciences, Inc. Recombinant methods and materials for producing epothilone and epothilone derivatives

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1993013663A1 (en) * 1992-01-17 1993-07-22 Abbott Laboratories Method of directing biosynthesis of specific polyketides
US5716849A (en) * 1994-06-08 1998-02-10 Novartis Finance Corporation Genes for the biosynthesis of soraphen
WO1998022461A1 (en) * 1996-11-18 1998-05-28 GESELLSCHAFT FüR BIOTECHNOLOGISCHE FORSCHUNG MBH (GBF) Epothilone c, d, e and f, production process, and their use as cytostatic as well as phytosanitary agents
WO1999066028A2 (en) * 1998-06-18 1999-12-23 Novartis Ag Genes for the biosynthesis of epothilones
WO2000031247A2 (en) * 1998-11-20 2000-06-02 Kosan Biosciences, Inc. Recombinant methods and materials for producing epothilone and epothilone derivatives

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
BEYER S ET AL: "METABOLIC DIVERSITY IN MYXOBACTERIA: IDENTIFICATION OF THE MYXALAMID AND THE STIGMATELLIN BIOSYNTHETIC GENE CLUSTER OF STIGMATELLA AURANTIACA SG A15 AND A COMBINED POLYKETIDE(POLY)PEPTIDE GENE CLUSTER FROM THE EPOTHILONE PRODUCING STRAIN SORANGIUM CELLULOSUM SO CE90" BIOCHIMICA ET BIOPHYSICA ACTA. GENE STRUCTURE AND EXPRESSION, vol. 1445, no. 2, 14 May 1999 (1999-05-14), pages 185-195, XP000915670 AMSTERDAM ISSN: 0167-4781 *
SCHUPP T ET AL: "SORANGIUM CELLULOSUM (MYXOBACTERIUM) GENE CLUSTER FOR THE BIOSYNTHESIS OF THE MACROLIDE ANTIBIOTIC SORAPHEN A: CLONING, CHARACTERIZATION, AND HOMOLOGY TO POLYKETIDE SYNTHASE GENES FROM ACTINOMYCETES" JOURNAL OF BACTERIOLOGY, vol. 177, no. 13, July 1995 (1995-07), pages 3673-3679, XP000893003 WASHINGTON, DC ISSN: 0021-9193 *

Cited By (21)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6583290B1 (en) 1998-11-20 2003-06-24 Kosam Biosciences, Inc. 14-methyl epothilone derivatives
JP2007097595A (en) * 1998-11-20 2007-04-19 Kosan Biosciences Inc Recombinant method and material for producing epothilone and epothilone derivatives
US7732186B2 (en) 1998-11-20 2010-06-08 Kosan Biosciences, Inc. Recombinant methods and materials for producing epothilone and epothilone derivatives
US6410301B1 (en) 1998-11-20 2002-06-25 Kosan Biosciences, Inc. Myxococcus host cells for the production of epothilones
JP2002530107A (en) * 1998-11-20 2002-09-17 コーサン バイオサイエンシーズ, インコーポレイテッド Recombinant methods and materials for producing epothilone and epothilone derivatives
WO2000031247A3 (en) * 1998-11-20 2000-12-07 Kosan Biosciences Inc Recombinant methods and materials for producing epothilone and epothilone derivatives
US6303342B1 (en) 1998-11-20 2001-10-16 Kason Biosciences, Inc. Recombinant methods and materials for producing epothilones C and D
US6858411B1 (en) 1998-11-20 2005-02-22 Kosan Biosciences, Inc. Recombinant methods and materials for producing epothilone and epothilone derivatives
US7402421B2 (en) 1998-11-20 2008-07-22 Kosan Biosciences, Inc. Recombinant methods and materials for producing epothilone and epothilone derivatives
KR100851418B1 (en) * 1998-11-20 2008-08-08 코산 바이오사이언시즈, 인코포레이티드 Recombinant methods and materials for producing epothilone and epothilone derivatives
US6921650B1 (en) 1998-11-20 2005-07-26 Kosan Biosciences, Inc. Recombinant methods and materials for producing epothilone and epothilone derivatives
US7067286B2 (en) 1998-11-20 2006-06-27 Kosan Biosciences, Inc. Cystobacterineae host cells containing heterologous PKS genes for the synthesis of polykedtides
KR100716272B1 (en) * 1998-11-20 2007-05-09 코산 바이오사이언시즈, 인코포레이티드 Recombinant methods and materials for producing epothilone and epothilone derivatives
US7129071B1 (en) 1998-11-20 2006-10-31 Kosan Biosciences, Inc. Recombinant methods and materials for producing epothilone and epothilone derivatives
WO2001083800A3 (en) * 2000-04-28 2003-04-10 Kosan Biosciences Inc Heterologous production of polyketides
EP1652926A3 (en) * 2000-04-28 2006-08-09 Kosan Biosciences, Inc. Heterologous production of polyketides
US7323573B2 (en) 2000-04-28 2008-01-29 Kosan Biosciences, Inc. Production of polyketides
KR100832145B1 (en) * 2000-04-28 2008-05-27 코산 바이오사이언시즈, 인코포레이티드 Production of polyketides
EP1652926A2 (en) * 2000-04-28 2006-05-03 Kosan Biosciences, Inc. Heterologous production of polyketides
US6998256B2 (en) 2000-04-28 2006-02-14 Kosan Biosciences, Inc. Methods of obtaining epothilone D using crystallization and /or by the culture of cells in the presence of methyl oleate
WO2001083800A2 (en) * 2000-04-28 2001-11-08 Kosan Biosciences, Inc. Heterologous production of polyketides

Also Published As

Publication number Publication date
DE19846493A1 (en) 2000-04-13
CA2346499A1 (en) 2000-04-20
EP1119628A2 (en) 2001-08-01
JP2002527067A (en) 2002-08-27
WO2000022139A3 (en) 2001-01-18
AU6512699A (en) 2000-05-01
WO2000022139A9 (en) 2000-09-08

Similar Documents

Publication Publication Date Title
Perlova et al. Identification and analysis of the chivosazol biosynthetic gene cluster from the myxobacterial model strain Sorangium cellulosum So ce56
JPH09224687A (en) Polyketide-synthase gene
JPH09224686A (en) Platenolide-synthase gene
KR20070033979A (en) DNA coding for polypeptides involved in biosynthesis of pladienolides
CN107868789B (en) Colimycin biosynthesis gene cluster
US7452693B2 (en) Discrete acyltransferases associated with type I polyketide synthases and methods of use
WO2000022139A2 (en) Dna sequences for enzymatic synthesis of polyketide or heteropolyketide compounds
CN108048472B (en) Engineering strain for high-efficiency heterologous expression of Disorazole Z, gene cluster for constructing strain and application of gene cluster
JP2004337169A (en) Cloning genes from streptomyces cyaneogriseus subsp.noncyanogenus for biosynthesis of antibiotics and methods of use
JP2008278895A (en) Biosynthetic gene for producing butenyl-spinosyn insecticide
CN101818158B (en) Biosynthetic gene cluster of FR901464
CN110857447B (en) Method for increasing yield of milbemycins A3/A4 or derivatives thereof
US7595187B2 (en) Elaiophylin biosynthetic gene cluster
US7364877B2 (en) Polynucleotides encoding disorazole polyketide synthase polypeptides
US7332576B2 (en) Biosynthetic gene cluster for ambruticins and the encoded proteins
US20040219645A1 (en) Polyketides and their synthesis
Stratigopoulos et al. Inactivation of a transcriptional repressor during empirical improvement of the tylosin producer, Streptomyces fradiae
KR20130097538A (en) Chejuenolide biosynthetic gene cluster from hahella chejuensis
AU2008201937B2 (en) Biosynthetic genes for butenyl-spinosyn insecticide production
KR100549690B1 (en) Genes for the Synthesis of FR-008 Polyketides
CN101142313A (en) Genes encoding the synthetic pathway for the production of disorazole
US20030157654A1 (en) Biosynthesis of enediyne compounds by manipulation of C-1027 gene pathway
JP2001112487A (en) Ml-236 biosynthesis-related dna
AU2002305118A1 (en) Biosynthetic genes for butenyl-spinosyn insecticide production

Legal Events

Date Code Title Description
ENP Entry into the national phase

Ref document number: 1999 65126

Country of ref document: AU

Kind code of ref document: A

AK Designated states

Kind code of ref document: A2

Designated state(s): AE AL AM AT AU AZ BA BB BG BR BY CA CH CN CR CU CZ DE DK DM EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG US UZ VN YU ZA ZW

AL Designated countries for regional patents

Kind code of ref document: A2

Designated state(s): GH GM KE LS MW SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
DFPE Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101)
AK Designated states

Kind code of ref document: C2

Designated state(s): AE AL AM AT AU AZ BA BB BG BR BY CA CH CN CR CU CZ DE DK DM EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG US UZ VN YU ZA ZW

AL Designated countries for regional patents

Kind code of ref document: C2

Designated state(s): GH GM KE LS MW SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG

COP Corrected version of pamphlet

Free format text: PAGES 35-44, DESCRIPTION, REPLACED BY NEW PAGES 35-44; DUE TO LATE TRANSMITTAL BY THE RECEIVING OFFICE

AK Designated states

Kind code of ref document: A3

Designated state(s): AE AL AM AT AU AZ BA BB BG BR BY CA CH CN CR CU CZ DE DK DM EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG US UZ VN YU ZA ZW

AL Designated countries for regional patents

Kind code of ref document: A3

Designated state(s): GH GM KE LS MW SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG

WWE Wipo information: entry into national phase

Ref document number: 65126/99

Country of ref document: AU

ENP Entry into the national phase

Ref document number: 2000 576029

Country of ref document: JP

Kind code of ref document: A

ENP Entry into the national phase

Ref document number: 2346499

Country of ref document: CA

Kind code of ref document: A

WWE Wipo information: entry into national phase

Ref document number: 1999953117

Country of ref document: EP

WWP Wipo information: published in national office

Ref document number: 1999953117

Country of ref document: EP

REG Reference to national code

Ref country code: DE

Ref legal event code: 8642

WWW Wipo information: withdrawn in national office

Ref document number: 1999953117

Country of ref document: EP

DPE2 Request for preliminary examination filed before expiration of 19th month from priority date (pct application filed from 20040101)