US20040172667A1 - Administration of transposon-based vectors to reproductive organs - Google Patents
Administration of transposon-based vectors to reproductive organs Download PDFInfo
- Publication number
- US20040172667A1 US20040172667A1 US10/746,149 US74614903A US2004172667A1 US 20040172667 A1 US20040172667 A1 US 20040172667A1 US 74614903 A US74614903 A US 74614903A US 2004172667 A1 US2004172667 A1 US 2004172667A1
- Authority
- US
- United States
- Prior art keywords
- promoter
- transposon
- sequence
- protein
- gene
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Abandoned
Links
- 239000013598 vector Substances 0.000 title claims abstract description 273
- 230000001850 reproductive effect Effects 0.000 title abstract description 17
- 210000000056 organ Anatomy 0.000 title abstract description 15
- 108090000623 proteins and genes Proteins 0.000 claims abstract description 374
- 102000004169 proteins and genes Human genes 0.000 claims abstract description 226
- 241001465754 Metazoa Species 0.000 claims abstract description 141
- 108090000765 processed proteins & peptides Proteins 0.000 claims abstract description 139
- 238000000034 method Methods 0.000 claims abstract description 109
- 230000009261 transgenic effect Effects 0.000 claims abstract description 95
- 102000004196 processed proteins & peptides Human genes 0.000 claims abstract description 78
- 210000003101 oviduct Anatomy 0.000 claims abstract description 76
- 241000271566 Aves Species 0.000 claims abstract description 71
- 230000014509 gene expression Effects 0.000 claims abstract description 46
- 239000000203 mixture Substances 0.000 claims abstract description 31
- 229920001184 polypeptide Polymers 0.000 claims abstract description 24
- 108010020764 Transposases Proteins 0.000 claims description 110
- 108010058846 Ovalbumin Proteins 0.000 claims description 83
- 102000008579 Transposases Human genes 0.000 claims description 77
- 150000001413 amino acids Chemical class 0.000 claims description 74
- 229940092253 ovalbumin Drugs 0.000 claims description 67
- 102000002322 Egg Proteins Human genes 0.000 claims description 62
- 108010000912 Egg Proteins Proteins 0.000 claims description 62
- 108020004705 Codon Proteins 0.000 claims description 53
- 241000286209 Phasianidae Species 0.000 claims description 51
- 239000003623 enhancer Substances 0.000 claims description 47
- 241000287828 Gallus gallus Species 0.000 claims description 46
- 235000013601 eggs Nutrition 0.000 claims description 45
- 235000014103 egg white Nutrition 0.000 claims description 42
- QCVGEOXPDFCNHA-UHFFFAOYSA-N 5,5-dimethyl-2,4-dioxo-1,3-oxazolidine-3-carboxamide Chemical compound CC1(C)OC(=O)N(C(N)=O)C1=O QCVGEOXPDFCNHA-UHFFFAOYSA-N 0.000 claims description 40
- 210000000969 egg white Anatomy 0.000 claims description 40
- 108010076504 Protein Sorting Signals Proteins 0.000 claims description 33
- 238000003780 insertion Methods 0.000 claims description 33
- 230000037431 insertion Effects 0.000 claims description 33
- 210000001672 ovary Anatomy 0.000 claims description 31
- 235000013336 milk Nutrition 0.000 claims description 30
- 239000008267 milk Substances 0.000 claims description 30
- 210000004080 milk Anatomy 0.000 claims description 30
- 108010064983 Ovomucin Proteins 0.000 claims description 28
- 102000040430 polynucleotide Human genes 0.000 claims description 28
- 108091033319 polynucleotide Proteins 0.000 claims description 28
- 239000002157 polynucleotide Substances 0.000 claims description 28
- 108010026206 Conalbumin Proteins 0.000 claims description 20
- 239000002773 nucleotide Substances 0.000 claims description 18
- 125000003729 nucleotide group Chemical group 0.000 claims description 18
- 210000001367 artery Anatomy 0.000 claims description 17
- 230000017448 oviposition Effects 0.000 claims description 16
- 241000124008 Mammalia Species 0.000 claims description 12
- 239000012096 transfection reagent Substances 0.000 claims description 9
- 230000001965 increasing effect Effects 0.000 claims description 8
- 210000004907 gland Anatomy 0.000 claims description 5
- 244000144977 poultry Species 0.000 claims description 5
- 108010000416 ovomacroglobulin Proteins 0.000 claims description 4
- 230000035755 proliferation Effects 0.000 claims description 4
- RWQNBRDOKXIBIV-UHFFFAOYSA-N thymine Chemical compound CC1=CNC(=O)NC1=O RWQNBRDOKXIBIV-UHFFFAOYSA-N 0.000 claims description 4
- 229930024421 Adenine Natural products 0.000 claims description 3
- GFFGJBXGBJISGV-UHFFFAOYSA-N Adenine Chemical compound NC1=NC=NC2=C1N=CN2 GFFGJBXGBJISGV-UHFFFAOYSA-N 0.000 claims description 3
- 229960000643 adenine Drugs 0.000 claims description 3
- 238000003306 harvesting Methods 0.000 claims description 3
- 229940113082 thymine Drugs 0.000 claims description 2
- 108010018858 Tn10 transposase Proteins 0.000 claims 1
- 108700019146 Transgenes Proteins 0.000 abstract description 35
- 230000008021 deposition Effects 0.000 abstract description 10
- 235000018102 proteins Nutrition 0.000 description 205
- 108020004414 DNA Proteins 0.000 description 103
- 235000001014 amino acid Nutrition 0.000 description 85
- 229940024606 amino acid Drugs 0.000 description 83
- 238000001415 gene therapy Methods 0.000 description 52
- 108010013369 Enteropeptidase Proteins 0.000 description 50
- 125000006850 spacer group Chemical group 0.000 description 50
- 210000004027 cell Anatomy 0.000 description 48
- 238000003776 cleavage reaction Methods 0.000 description 48
- 230000007017 scission Effects 0.000 description 48
- 102100029727 Enteropeptidase Human genes 0.000 description 47
- 108010076181 Proinsulin Proteins 0.000 description 42
- -1 antibodies Substances 0.000 description 42
- 235000013330 chicken meat Nutrition 0.000 description 38
- 101000609762 Gallus gallus Ovalbumin Proteins 0.000 description 37
- 238000004519 manufacturing process Methods 0.000 description 36
- 238000000746 purification Methods 0.000 description 36
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 35
- 239000013599 cloning vector Substances 0.000 description 35
- 101800001690 Transmembrane protein gp41 Proteins 0.000 description 34
- 238000010367 cloning Methods 0.000 description 34
- 210000001519 tissue Anatomy 0.000 description 34
- 241000701022 Cytomegalovirus Species 0.000 description 28
- NOESYZHRGYRDHS-UHFFFAOYSA-N insulin Chemical compound N1C(=O)C(NC(=O)C(CCC(N)=O)NC(=O)C(CCC(O)=O)NC(=O)C(C(C)C)NC(=O)C(NC(=O)CN)C(C)CC)CSSCC(C(NC(CO)C(=O)NC(CC(C)C)C(=O)NC(CC=2C=CC(O)=CC=2)C(=O)NC(CCC(N)=O)C(=O)NC(CC(C)C)C(=O)NC(CCC(O)=O)C(=O)NC(CC(N)=O)C(=O)NC(CC=2C=CC(O)=CC=2)C(=O)NC(CSSCC(NC(=O)C(C(C)C)NC(=O)C(CC(C)C)NC(=O)C(CC=2C=CC(O)=CC=2)NC(=O)C(CC(C)C)NC(=O)C(C)NC(=O)C(CCC(O)=O)NC(=O)C(C(C)C)NC(=O)C(CC(C)C)NC(=O)C(CC=2NC=NC=2)NC(=O)C(CO)NC(=O)CNC2=O)C(=O)NCC(=O)NC(CCC(O)=O)C(=O)NC(CCCNC(N)=N)C(=O)NCC(=O)NC(CC=3C=CC=CC=3)C(=O)NC(CC=3C=CC=CC=3)C(=O)NC(CC=3C=CC(O)=CC=3)C(=O)NC(C(C)O)C(=O)N3C(CCC3)C(=O)NC(CCCCN)C(=O)NC(C)C(O)=O)C(=O)NC(CC(N)=O)C(O)=O)=O)NC(=O)C(C(C)CC)NC(=O)C(CO)NC(=O)C(C(C)O)NC(=O)C1CSSCC2NC(=O)C(CC(C)C)NC(=O)C(NC(=O)C(CCC(N)=O)NC(=O)C(CC(N)=O)NC(=O)C(NC(=O)C(N)CC=1C=CC=CC=1)C(C)C)CC1=CN=CN1 NOESYZHRGYRDHS-UHFFFAOYSA-N 0.000 description 28
- QTBSBXVTEAMEQO-UHFFFAOYSA-N acetic acid Substances CC(O)=O QTBSBXVTEAMEQO-UHFFFAOYSA-N 0.000 description 27
- 238000006467 substitution reaction Methods 0.000 description 27
- 239000012634 fragment Substances 0.000 description 25
- 241000283707 Capra Species 0.000 description 24
- 108091092724 Noncoding DNA Proteins 0.000 description 24
- 239000000243 solution Substances 0.000 description 24
- 108010090932 Vitellogenins Proteins 0.000 description 23
- 238000010348 incorporation Methods 0.000 description 23
- 238000002347 injection Methods 0.000 description 22
- 239000007924 injection Substances 0.000 description 22
- 230000008685 targeting Effects 0.000 description 22
- 230000001939 inductive effect Effects 0.000 description 20
- 125000003275 alpha amino acid group Chemical group 0.000 description 19
- 239000004471 Glycine Substances 0.000 description 18
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 18
- MTCFGRXMJLQNBG-REOHCLBHSA-N (2S)-2-Amino-3-hydroxypropansäure Chemical compound OC[C@H](N)C(O)=O MTCFGRXMJLQNBG-REOHCLBHSA-N 0.000 description 17
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 17
- 235000003704 aspartic acid Nutrition 0.000 description 17
- OQFSQFPPLPISGP-UHFFFAOYSA-N beta-carboxyaspartic acid Natural products OC(=O)C(N)C(C(O)=O)C(O)=O OQFSQFPPLPISGP-UHFFFAOYSA-N 0.000 description 17
- 235000004400 serine Nutrition 0.000 description 17
- WHUUTDBJXJRKMK-UHFFFAOYSA-N Glutamic acid Natural products OC(=O)C(N)CCC(O)=O WHUUTDBJXJRKMK-UHFFFAOYSA-N 0.000 description 16
- 108090001061 Insulin Proteins 0.000 description 16
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 16
- 108091028043 Nucleic acid sequence Proteins 0.000 description 16
- 235000004279 alanine Nutrition 0.000 description 16
- 235000013922 glutamic acid Nutrition 0.000 description 16
- 102000004877 Insulin Human genes 0.000 description 15
- WHUUTDBJXJRKMK-VKHMYHEASA-N L-glutamic acid Chemical compound OC(=O)[C@@H](N)CCC(O)=O WHUUTDBJXJRKMK-VKHMYHEASA-N 0.000 description 15
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 description 15
- AYFVYJQAPQTCCC-GBXIJSLDSA-N L-threonine Chemical compound C[C@@H](O)[C@H](N)C(O)=O AYFVYJQAPQTCCC-GBXIJSLDSA-N 0.000 description 15
- KZSNJWFQEVHDMF-BYPYZUCNSA-N L-valine Chemical compound CC(C)[C@H](N)C(O)=O KZSNJWFQEVHDMF-BYPYZUCNSA-N 0.000 description 15
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 15
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 description 15
- 239000004473 Threonine Substances 0.000 description 15
- KZSNJWFQEVHDMF-UHFFFAOYSA-N Valine Natural products CC(C)C(N)C(O)=O KZSNJWFQEVHDMF-UHFFFAOYSA-N 0.000 description 15
- 210000004602 germ cell Anatomy 0.000 description 15
- 239000004220 glutamic acid Substances 0.000 description 15
- 229960000310 isoleucine Drugs 0.000 description 15
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 description 15
- 235000008521 threonine Nutrition 0.000 description 15
- 238000001890 transfection Methods 0.000 description 15
- 239000004474 valine Substances 0.000 description 15
- 102000004190 Enzymes Human genes 0.000 description 14
- 108090000790 Enzymes Proteins 0.000 description 14
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 14
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 description 14
- 239000004365 Protease Substances 0.000 description 14
- 229940088598 enzyme Drugs 0.000 description 14
- 229940125396 insulin Drugs 0.000 description 14
- 230000001105 regulatory effect Effects 0.000 description 14
- 108091026890 Coding region Proteins 0.000 description 13
- XUJNEKJLAYXESH-REOHCLBHSA-N L-Cysteine Chemical compound SC[C@H](N)C(O)=O XUJNEKJLAYXESH-REOHCLBHSA-N 0.000 description 13
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 description 13
- 102000035195 Peptidases Human genes 0.000 description 13
- 108091005804 Peptidases Proteins 0.000 description 13
- 238000007792 addition Methods 0.000 description 13
- 239000000427 antigen Substances 0.000 description 13
- 102000036639 antigens Human genes 0.000 description 13
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 description 13
- 235000018417 cysteine Nutrition 0.000 description 13
- 108010074605 gamma-Globulins Proteins 0.000 description 13
- 229940088597 hormone Drugs 0.000 description 13
- 239000005556 hormone Substances 0.000 description 13
- 229930182817 methionine Natural products 0.000 description 13
- 108010091135 Immunoglobulin Fc Fragments Proteins 0.000 description 12
- 102000018071 Immunoglobulin Fc Fragments Human genes 0.000 description 12
- 108091007433 antigens Proteins 0.000 description 12
- 238000000151 deposition Methods 0.000 description 12
- 238000011161 development Methods 0.000 description 12
- 230000018109 developmental process Effects 0.000 description 12
- 210000002969 egg yolk Anatomy 0.000 description 12
- 235000019419 proteases Nutrition 0.000 description 12
- HKZAAJSTFUZYTO-LURJTMIESA-N (2s)-2-[[2-[[2-[[2-[(2-aminoacetyl)amino]acetyl]amino]acetyl]amino]acetyl]amino]-3-hydroxypropanoic acid Chemical compound NCC(=O)NCC(=O)NCC(=O)NCC(=O)N[C@@H](CO)C(O)=O HKZAAJSTFUZYTO-LURJTMIESA-N 0.000 description 11
- 239000003102 growth factor Substances 0.000 description 11
- 230000010354 integration Effects 0.000 description 11
- 230000004048 modification Effects 0.000 description 11
- 238000012986 modification Methods 0.000 description 11
- 239000013612 plasmid Substances 0.000 description 11
- 239000000047 product Substances 0.000 description 11
- 239000011347 resin Substances 0.000 description 11
- 229920005989 resin Polymers 0.000 description 11
- 239000000126 substance Substances 0.000 description 11
- KDXKERNSBIXSRK-YFKPBYRVSA-N L-lysine Chemical compound NCCCC[C@H](N)C(O)=O KDXKERNSBIXSRK-YFKPBYRVSA-N 0.000 description 10
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 10
- 239000004472 Lysine Substances 0.000 description 10
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 10
- 239000003153 chemical reaction reagent Substances 0.000 description 10
- 238000012217 deletion Methods 0.000 description 10
- 230000037430 deletion Effects 0.000 description 10
- 238000005516 engineering process Methods 0.000 description 10
- 239000000499 gel Substances 0.000 description 10
- 238000002955 isolation Methods 0.000 description 10
- 235000018977 lysine Nutrition 0.000 description 10
- 102000005962 receptors Human genes 0.000 description 10
- 108020003175 receptors Proteins 0.000 description 10
- 238000000926 separation method Methods 0.000 description 10
- 238000011144 upstream manufacturing Methods 0.000 description 10
- 108050004290 Cecropin Proteins 0.000 description 9
- 108060003951 Immunoglobulin Proteins 0.000 description 9
- 206010035226 Plasma cell myeloma Diseases 0.000 description 9
- APKRGYLBSCWJJP-FXQIFTODSA-N Pro-Ala-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O APKRGYLBSCWJJP-FXQIFTODSA-N 0.000 description 9
- 230000008901 benefit Effects 0.000 description 9
- 230000027455 binding Effects 0.000 description 9
- IDLFZVILOHSSID-OVLDLUHVSA-N corticotropin Chemical compound C([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC=1NC=NC=1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](C(C)C)C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)NC(=O)[C@@H](N)CO)C1=CC=C(O)C=C1 IDLFZVILOHSSID-OVLDLUHVSA-N 0.000 description 9
- 235000013345 egg yolk Nutrition 0.000 description 9
- 102000037865 fusion proteins Human genes 0.000 description 9
- 108020001507 fusion proteins Proteins 0.000 description 9
- XLXSAKCOAKORKW-AQJXLSMYSA-N gonadorelin Chemical compound C([C@@H](C(=O)NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N1[C@@H](CCC1)C(=O)NCC(N)=O)NC(=O)[C@H](CO)NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)NC(=O)[C@H](CC=1N=CNC=1)NC(=O)[C@H]1NC(=O)CC1)C1=CC=C(O)C=C1 XLXSAKCOAKORKW-AQJXLSMYSA-N 0.000 description 9
- 102000018358 immunoglobulin Human genes 0.000 description 9
- 201000000050 myeloid neoplasm Diseases 0.000 description 9
- 108091008146 restriction endonucleases Proteins 0.000 description 9
- 210000002966 serum Anatomy 0.000 description 9
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 9
- 101800000414 Corticotropin Proteins 0.000 description 8
- IAZDPXIOMUYVGZ-UHFFFAOYSA-N Dimethylsulphoxide Chemical compound CS(C)=O IAZDPXIOMUYVGZ-UHFFFAOYSA-N 0.000 description 8
- 239000000579 Gonadotropin-Releasing Hormone Substances 0.000 description 8
- 102100039620 Granulocyte-macrophage colony-stimulating factor Human genes 0.000 description 8
- 241000282412 Homo Species 0.000 description 8
- 101000857870 Squalus acanthias Gonadoliberin Proteins 0.000 description 8
- 238000001042 affinity chromatography Methods 0.000 description 8
- 125000001931 aliphatic group Chemical group 0.000 description 8
- 235000009582 asparagine Nutrition 0.000 description 8
- 229960000258 corticotropin Drugs 0.000 description 8
- 201000010099 disease Diseases 0.000 description 8
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 8
- OPCBKDJCJYBGTQ-UHFFFAOYSA-N gamma-hydroxyarginine Chemical compound OC(=O)C(N)CC(O)CNC(N)=N OPCBKDJCJYBGTQ-UHFFFAOYSA-N 0.000 description 8
- 229940035638 gonadotropin-releasing hormone Drugs 0.000 description 8
- 230000000670 limiting effect Effects 0.000 description 8
- 210000005075 mammary gland Anatomy 0.000 description 8
- 102000039446 nucleic acids Human genes 0.000 description 8
- 108020004707 nucleic acids Proteins 0.000 description 8
- 150000007523 nucleic acids Chemical class 0.000 description 8
- 239000011780 sodium chloride Substances 0.000 description 8
- WXERCAHAIKMTKX-ZLUOBGJFSA-N Ala-Asp-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O WXERCAHAIKMTKX-ZLUOBGJFSA-N 0.000 description 7
- 239000004475 Arginine Substances 0.000 description 7
- DCXYFEDJOCDNAF-UHFFFAOYSA-N Asparagine Natural products OC(=O)C(N)CC(N)=O DCXYFEDJOCDNAF-UHFFFAOYSA-N 0.000 description 7
- 241000334119 Coturnix japonica Species 0.000 description 7
- 238000002965 ELISA Methods 0.000 description 7
- 101150048348 GP41 gene Proteins 0.000 description 7
- AHLPHDHHMVZTML-BYPYZUCNSA-N L-Ornithine Chemical compound NCCC[C@H](N)C(O)=O AHLPHDHHMVZTML-BYPYZUCNSA-N 0.000 description 7
- ODKSFYDXXFIFQN-BYPYZUCNSA-P L-argininium(2+) Chemical compound NC(=[NH2+])NCCC[C@H]([NH3+])C(O)=O ODKSFYDXXFIFQN-BYPYZUCNSA-P 0.000 description 7
- DCXYFEDJOCDNAF-REOHCLBHSA-N L-asparagine Chemical compound OC(=O)[C@@H](N)CC(N)=O DCXYFEDJOCDNAF-REOHCLBHSA-N 0.000 description 7
- ZDXPYRJPNDTMRX-VKHMYHEASA-N L-glutamine Chemical compound OC(=O)[C@@H](N)CCC(N)=O ZDXPYRJPNDTMRX-VKHMYHEASA-N 0.000 description 7
- 102000016267 Leptin Human genes 0.000 description 7
- 108010092277 Leptin Proteins 0.000 description 7
- 206010028980 Neoplasm Diseases 0.000 description 7
- AHLPHDHHMVZTML-UHFFFAOYSA-N Orn-delta-NH2 Natural products NCCCC(N)C(O)=O AHLPHDHHMVZTML-UHFFFAOYSA-N 0.000 description 7
- UTJLXEIPEHZYQJ-UHFFFAOYSA-N Ornithine Natural products OC(=O)C(C)CCCN UTJLXEIPEHZYQJ-UHFFFAOYSA-N 0.000 description 7
- 241000714474 Rous sarcoma virus Species 0.000 description 7
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 7
- 229960001230 asparagine Drugs 0.000 description 7
- 230000015572 biosynthetic process Effects 0.000 description 7
- 239000000872 buffer Substances 0.000 description 7
- 239000000470 constituent Substances 0.000 description 7
- LOKCTEFSRHRXRJ-UHFFFAOYSA-I dipotassium trisodium dihydrogen phosphate hydrogen phosphate dichloride Chemical compound P(=O)(O)(O)[O-].[K+].P(=O)(O)([O-])[O-].[Na+].[Na+].[Cl-].[K+].[Cl-].[Na+] LOKCTEFSRHRXRJ-UHFFFAOYSA-I 0.000 description 7
- 239000003814 drug Substances 0.000 description 7
- 238000002474 experimental method Methods 0.000 description 7
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 description 7
- 235000004554 glutamine Nutrition 0.000 description 7
- 210000000987 immune system Anatomy 0.000 description 7
- 229940039781 leptin Drugs 0.000 description 7
- NRYBAZVQPHGZNS-ZSOCWYAHSA-N leptin Chemical compound O=C([C@H](CO)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CO)NC(=O)CNC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC(C)C)CCSC)N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CS)C(O)=O NRYBAZVQPHGZNS-ZSOCWYAHSA-N 0.000 description 7
- 239000003446 ligand Substances 0.000 description 7
- 229960003104 ornithine Drugs 0.000 description 7
- 239000002953 phosphate buffered saline Substances 0.000 description 7
- 238000002360 preparation method Methods 0.000 description 7
- 230000010076 replication Effects 0.000 description 7
- 238000005096 rolling process Methods 0.000 description 7
- 238000003786 synthesis reaction Methods 0.000 description 7
- IPZQNYYAYVRKKK-FXQIFTODSA-N Ala-Pro-Ala Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O IPZQNYYAYVRKKK-FXQIFTODSA-N 0.000 description 6
- 108010088751 Albumins Proteins 0.000 description 6
- 102000009027 Albumins Human genes 0.000 description 6
- NAPNAGZWHQHZLG-ZLUOBGJFSA-N Asp-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)N NAPNAGZWHQHZLG-ZLUOBGJFSA-N 0.000 description 6
- 102100023804 Coagulation factor VII Human genes 0.000 description 6
- 102400000739 Corticotropin Human genes 0.000 description 6
- 102000001301 EGF receptor Human genes 0.000 description 6
- 108060006698 EGF receptor Proteins 0.000 description 6
- 108090000723 Insulin-Like Growth Factor I Proteins 0.000 description 6
- 108010025020 Nerve Growth Factor Proteins 0.000 description 6
- 102000007072 Nerve Growth Factors Human genes 0.000 description 6
- 241000283973 Oryctolagus cuniculus Species 0.000 description 6
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 6
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 6
- 108010041407 alanylaspartic acid Proteins 0.000 description 6
- BFNBIHQBYMNNAN-UHFFFAOYSA-N ammonium sulfate Chemical compound N.N.OS(O)(=O)=O BFNBIHQBYMNNAN-UHFFFAOYSA-N 0.000 description 6
- 229910052921 ammonium sulfate Inorganic materials 0.000 description 6
- 150000007860 aryl ester derivatives Chemical class 0.000 description 6
- 210000002919 epithelial cell Anatomy 0.000 description 6
- 229940011871 estrogen Drugs 0.000 description 6
- 239000000262 estrogen Substances 0.000 description 6
- 125000001495 ethyl group Chemical group [H]C([H])([H])C([H])([H])* 0.000 description 6
- 125000000524 functional group Chemical group 0.000 description 6
- 238000001476 gene delivery Methods 0.000 description 6
- 230000002401 inhibitory effect Effects 0.000 description 6
- 210000004185 liver Anatomy 0.000 description 6
- 230000008488 polyadenylation Effects 0.000 description 6
- 125000006239 protecting group Chemical group 0.000 description 6
- 238000013518 transcription Methods 0.000 description 6
- 230000035897 transcription Effects 0.000 description 6
- XQMVBICWFFHDNN-UHFFFAOYSA-N 5-amino-4-chloro-2-phenylpyridazin-3-one;(2-ethoxy-3,3-dimethyl-2h-1-benzofuran-5-yl) methanesulfonate Chemical compound O=C1C(Cl)=C(N)C=NN1C1=CC=CC=C1.C1=C(OS(C)(=O)=O)C=C2C(C)(C)C(OCC)OC2=C1 XQMVBICWFFHDNN-UHFFFAOYSA-N 0.000 description 5
- XPGVTUBABLRGHY-BIIVOSGPSA-N Asp-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N XPGVTUBABLRGHY-BIIVOSGPSA-N 0.000 description 5
- 241000283690 Bos taurus Species 0.000 description 5
- 102000003951 Erythropoietin Human genes 0.000 description 5
- 108090000394 Erythropoietin Proteins 0.000 description 5
- 108010017213 Granulocyte-Macrophage Colony-Stimulating Factor Proteins 0.000 description 5
- 102000001706 Immunoglobulin Fab Fragments Human genes 0.000 description 5
- 108010054477 Immunoglobulin Fab Fragments Proteins 0.000 description 5
- COLNVLDHVKWLRT-QMMMGPOBSA-N L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 description 5
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 description 5
- 241000270322 Lepidosauria Species 0.000 description 5
- 229920001213 Polysorbate 20 Polymers 0.000 description 5
- 108010057464 Prolactin Proteins 0.000 description 5
- 102000003946 Prolactin Human genes 0.000 description 5
- 241000287531 Psittacidae Species 0.000 description 5
- 241000287530 Psittaciformes Species 0.000 description 5
- 108010009583 Transforming Growth Factors Proteins 0.000 description 5
- 102000009618 Transforming Growth Factors Human genes 0.000 description 5
- 239000007983 Tris buffer Substances 0.000 description 5
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 description 5
- 241000700605 Viruses Species 0.000 description 5
- 108010087924 alanylproline Proteins 0.000 description 5
- 238000012870 ammonium sulfate precipitation Methods 0.000 description 5
- 238000004458 analytical method Methods 0.000 description 5
- 230000000890 antigenic effect Effects 0.000 description 5
- 125000001797 benzyl group Chemical group [H]C1=C([H])C([H])=C(C([H])=C1[H])C([H])([H])* 0.000 description 5
- 201000011510 cancer Diseases 0.000 description 5
- 238000005119 centrifugation Methods 0.000 description 5
- 210000000991 chicken egg Anatomy 0.000 description 5
- 229940105423 erythropoietin Drugs 0.000 description 5
- 108010043293 glycyl-prolyl-glycyl-glycine Proteins 0.000 description 5
- 230000012010 growth Effects 0.000 description 5
- 210000004681 ovum Anatomy 0.000 description 5
- 239000000199 parathyroid hormone Substances 0.000 description 5
- COLNVLDHVKWLRT-UHFFFAOYSA-N phenylalanine Natural products OC(=O)C(N)CC1=CC=CC=C1 COLNVLDHVKWLRT-UHFFFAOYSA-N 0.000 description 5
- 239000000256 polyoxyethylene sorbitan monolaurate Substances 0.000 description 5
- 235000010486 polyoxyethylene sorbitan monolaurate Nutrition 0.000 description 5
- OXCMYAYHXIHQOA-UHFFFAOYSA-N potassium;[2-butyl-5-chloro-3-[[4-[2-(1,2,4-triaza-3-azanidacyclopenta-1,4-dien-5-yl)phenyl]phenyl]methyl]imidazol-4-yl]methanol Chemical compound [K+].CCCCC1=NC(Cl)=C(CO)N1CC1=CC=C(C=2C(=CC=CC=2)C2=N[N-]N=N2)C=C1 OXCMYAYHXIHQOA-UHFFFAOYSA-N 0.000 description 5
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 5
- 229940097325 prolactin Drugs 0.000 description 5
- 238000012163 sequencing technique Methods 0.000 description 5
- 241000894007 species Species 0.000 description 5
- 230000000638 stimulation Effects 0.000 description 5
- 230000009466 transformation Effects 0.000 description 5
- 230000014616 translation Effects 0.000 description 5
- LENZDBCJOHFCAS-UHFFFAOYSA-N tris Chemical compound OCC(N)(CO)CO LENZDBCJOHFCAS-UHFFFAOYSA-N 0.000 description 5
- 229960005486 vaccine Drugs 0.000 description 5
- 238000001262 western blot Methods 0.000 description 5
- WRLUODMOTSXWIP-BYPYZUCNSA-N (2s)-5-(diaminomethylideneamino)-2-nitramidopentanoic acid Chemical compound NC(=N)NCCC[C@@H](C(O)=O)N[N+]([O-])=O WRLUODMOTSXWIP-BYPYZUCNSA-N 0.000 description 4
- HZAXFHJVJLSVMW-UHFFFAOYSA-N 2-Aminoethan-1-ol Chemical compound NCCO HZAXFHJVJLSVMW-UHFFFAOYSA-N 0.000 description 4
- QKNYBSVHEMOAJP-UHFFFAOYSA-N 2-amino-2-(hydroxymethyl)propane-1,3-diol;hydron;chloride Chemical compound Cl.OCC(N)(CO)CO QKNYBSVHEMOAJP-UHFFFAOYSA-N 0.000 description 4
- IFPQOXNWLSRZKX-UHFFFAOYSA-N 2-amino-4-(diaminomethylideneamino)butanoic acid Chemical compound OC(=O)C(N)CCN=C(N)N IFPQOXNWLSRZKX-UHFFFAOYSA-N 0.000 description 4
- 108020003589 5' Untranslated Regions Proteins 0.000 description 4
- 102000007469 Actins Human genes 0.000 description 4
- 108010085238 Actins Proteins 0.000 description 4
- 241000272517 Anseriformes Species 0.000 description 4
- 101800001288 Atrial natriuretic factor Proteins 0.000 description 4
- 102400001282 Atrial natriuretic peptide Human genes 0.000 description 4
- 101800001890 Atrial natriuretic peptide Proteins 0.000 description 4
- 108090001008 Avidin Proteins 0.000 description 4
- 241000894006 Bacteria Species 0.000 description 4
- 101800001982 Cholecystokinin Proteins 0.000 description 4
- 102100025841 Cholecystokinin Human genes 0.000 description 4
- 102100022641 Coagulation factor IX Human genes 0.000 description 4
- NBSCHQHZLSJFNQ-GASJEMHNSA-N D-Glucose 6-phosphate Chemical compound OC1O[C@H](COP(O)(O)=O)[C@@H](O)[C@H](O)[C@H]1O NBSCHQHZLSJFNQ-GASJEMHNSA-N 0.000 description 4
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 4
- 108010076282 Factor IX Proteins 0.000 description 4
- 108010023321 Factor VII Proteins 0.000 description 4
- 241000272184 Falconiformes Species 0.000 description 4
- 102000018233 Fibroblast Growth Factor Human genes 0.000 description 4
- 108050007372 Fibroblast Growth Factor Proteins 0.000 description 4
- VFRROHXSMXFLSN-UHFFFAOYSA-N Glc6P Natural products OP(=O)(O)OCC(O)C(O)C(O)C(O)C=O VFRROHXSMXFLSN-UHFFFAOYSA-N 0.000 description 4
- 102000006395 Globulins Human genes 0.000 description 4
- 108010044091 Globulins Proteins 0.000 description 4
- 239000000095 Growth Hormone-Releasing Hormone Substances 0.000 description 4
- 108010010234 HDL Lipoproteins Proteins 0.000 description 4
- 102000015779 HDL Lipoproteins Human genes 0.000 description 4
- 108010000521 Human Growth Hormone Proteins 0.000 description 4
- 239000000854 Human Growth Hormone Substances 0.000 description 4
- 102000002265 Human Growth Hormone Human genes 0.000 description 4
- 102000004218 Insulin-Like Growth Factor I Human genes 0.000 description 4
- 108010008212 Integrin alpha4beta1 Proteins 0.000 description 4
- 108010002350 Interleukin-2 Proteins 0.000 description 4
- 102000000588 Interleukin-2 Human genes 0.000 description 4
- 102000015696 Interleukins Human genes 0.000 description 4
- 108010063738 Interleukins Proteins 0.000 description 4
- 102100035792 Kininogen-1 Human genes 0.000 description 4
- ONIBWKKTOPOVIA-BYPYZUCNSA-N L-Proline Chemical compound OC(=O)[C@@H]1CCCN1 ONIBWKKTOPOVIA-BYPYZUCNSA-N 0.000 description 4
- HNDVDQJCIGZPNO-YFKPBYRVSA-N L-histidine Chemical compound OC(=O)[C@@H](N)CC1=CN=CN1 HNDVDQJCIGZPNO-YFKPBYRVSA-N 0.000 description 4
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 description 4
- 102000016943 Muramidase Human genes 0.000 description 4
- 108010014251 Muramidase Proteins 0.000 description 4
- 108010062010 N-Acetylmuramoyl-L-alanine Amidase Proteins 0.000 description 4
- 125000001429 N-terminal alpha-amino-acid group Chemical group 0.000 description 4
- 102000003982 Parathyroid hormone Human genes 0.000 description 4
- 108090000445 Parathyroid hormone Proteins 0.000 description 4
- 108010039918 Polylysine Proteins 0.000 description 4
- 102100027467 Pro-opiomelanocortin Human genes 0.000 description 4
- RJKFOVLPORLFTN-LEKSSAKUSA-N Progesterone Chemical compound C1CC2=CC(=O)CC[C@]2(C)[C@@H]2[C@@H]1[C@@H]1CC[C@H](C(=O)C)[C@@]1(C)CC2 RJKFOVLPORLFTN-LEKSSAKUSA-N 0.000 description 4
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 description 4
- 101710142969 Somatoliberin Proteins 0.000 description 4
- 102100022831 Somatoliberin Human genes 0.000 description 4
- 108091008874 T cell receptors Proteins 0.000 description 4
- 102000016266 T-Cell Antigen Receptors Human genes 0.000 description 4
- MUMGGOZAMZWBJJ-DYKIIFRCSA-N Testostosterone Chemical compound O=C1CC[C@]2(C)[C@H]3CC[C@](C)([C@H](CC4)O)[C@@H]4[C@@H]3CCC2=C1 MUMGGOZAMZWBJJ-DYKIIFRCSA-N 0.000 description 4
- 108010061174 Thyrotropin Proteins 0.000 description 4
- 102000011923 Thyrotropin Human genes 0.000 description 4
- 102000004887 Transforming Growth Factor beta Human genes 0.000 description 4
- 108090001012 Transforming Growth Factor beta Proteins 0.000 description 4
- 108010073929 Vascular Endothelial Growth Factor A Proteins 0.000 description 4
- 102000005789 Vascular Endothelial Growth Factors Human genes 0.000 description 4
- 108010019530 Vascular Endothelial Growth Factors Proteins 0.000 description 4
- 102000002852 Vasopressins Human genes 0.000 description 4
- 108010004977 Vasopressins Proteins 0.000 description 4
- 229960000446 abciximab Drugs 0.000 description 4
- 239000011543 agarose gel Substances 0.000 description 4
- 230000002303 anti-venom Effects 0.000 description 4
- KBZOIRJILGZLEJ-LGYYRGKSSA-N argipressin Chemical compound C([C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CSSC[C@@H](C(N[C@@H](CC=2C=CC(O)=CC=2)C(=O)N1)=O)N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)NCC(N)=O)C1=CC=CC=C1 KBZOIRJILGZLEJ-LGYYRGKSSA-N 0.000 description 4
- 230000003115 biocidal effect Effects 0.000 description 4
- NSQLIUXCMFBZME-MPVJKSABSA-N carperitide Chemical compound C([C@H]1C(=O)NCC(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@H](C(NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](CSSC[C@@H](C(=O)N1)NC(=O)[C@H](CO)NC(=O)[C@H](CO)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)=O)[C@@H](C)CC)C1=CC=CC=C1 NSQLIUXCMFBZME-MPVJKSABSA-N 0.000 description 4
- 238000006243 chemical reaction Methods 0.000 description 4
- 229940107137 cholecystokinin Drugs 0.000 description 4
- 210000000349 chromosome Anatomy 0.000 description 4
- 210000003555 cloaca Anatomy 0.000 description 4
- 230000029087 digestion Effects 0.000 description 4
- 230000000694 effects Effects 0.000 description 4
- 239000012149 elution buffer Substances 0.000 description 4
- 238000001914 filtration Methods 0.000 description 4
- 244000144992 flock Species 0.000 description 4
- 239000012530 fluid Substances 0.000 description 4
- 238000005194 fractionation Methods 0.000 description 4
- 229940045189 glucose-6-phosphate Drugs 0.000 description 4
- RWSXRVCMGQZWBV-WDSKDSINSA-N glutathione Chemical compound OC(=O)[C@@H](N)CCC(=O)N[C@@H](CS)C(=O)NCC(O)=O RWSXRVCMGQZWBV-WDSKDSINSA-N 0.000 description 4
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Chemical compound NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 4
- 239000000122 growth hormone Substances 0.000 description 4
- HNDVDQJCIGZPNO-UHFFFAOYSA-N histidine Natural products OC(=O)C(N)CC1=CN=CN1 HNDVDQJCIGZPNO-UHFFFAOYSA-N 0.000 description 4
- 235000014304 histidine Nutrition 0.000 description 4
- 229960000274 lysozyme Drugs 0.000 description 4
- 239000004325 lysozyme Substances 0.000 description 4
- 235000010335 lysozyme Nutrition 0.000 description 4
- 235000013372 meat Nutrition 0.000 description 4
- 239000012528 membrane Substances 0.000 description 4
- 125000002496 methyl group Chemical group [H]C([H])([H])* 0.000 description 4
- 210000000287 oocyte Anatomy 0.000 description 4
- 229960001319 parathyroid hormone Drugs 0.000 description 4
- 229920000656 polylysine Polymers 0.000 description 4
- 239000011148 porous material Substances 0.000 description 4
- 238000001556 precipitation Methods 0.000 description 4
- 230000006337 proteolytic cleavage Effects 0.000 description 4
- 238000011160 research Methods 0.000 description 4
- 210000002955 secretory cell Anatomy 0.000 description 4
- IZTQOLKUZKXIRV-YRVFCXMDSA-N sincalide Chemical compound C([C@@H](C(=O)N[C@@H](CCSC)C(=O)NCC(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC=1C=CC=CC=1)C(N)=O)NC(=O)[C@@H](N)CC(O)=O)C1=CC=C(OS(O)(=O)=O)C=C1 IZTQOLKUZKXIRV-YRVFCXMDSA-N 0.000 description 4
- ZRKFYGHZFMAOKI-QMGMOQQFSA-N tgfbeta Chemical compound C([C@H](NC(=O)[C@H](C(C)C)NC(=O)CNC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](CC(C)C)NC(=O)CNC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](NC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCSC)C(C)C)[C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](C)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N1[C@@H](CCC1)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O)C1=CC=C(O)C=C1 ZRKFYGHZFMAOKI-QMGMOQQFSA-N 0.000 description 4
- 230000001225 therapeutic effect Effects 0.000 description 4
- 108010060175 trypsinogen activation peptide Proteins 0.000 description 4
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 description 4
- 229960003726 vasopressin Drugs 0.000 description 4
- 108010049392 vitellogenin receptor Proteins 0.000 description 4
- VCOPTHOUUNAYKQ-WBTCAYNUSA-N (3s)-3,6-diamino-n-[[(2s,5s,8e,11s,15s)-15-amino-11-[(6r)-2-amino-1,4,5,6-tetrahydropyrimidin-6-yl]-8-[(carbamoylamino)methylidene]-2-(hydroxymethyl)-3,6,9,12,16-pentaoxo-1,4,7,10,13-pentazacyclohexadec-5-yl]methyl]hexanamide;(3s)-3,6-diamino-n-[[(2s,5s,8 Chemical compound N1C(=O)\C(=C/NC(N)=O)NC(=O)[C@H](CNC(=O)C[C@@H](N)CCCN)NC(=O)[C@H](C)NC(=O)[C@@H](N)CNC(=O)[C@@H]1[C@@H]1NC(N)=NCC1.N1C(=O)\C(=C/NC(N)=O)NC(=O)[C@H](CNC(=O)C[C@@H](N)CCCN)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CNC(=O)[C@@H]1[C@@H]1NC(N)=NCC1 VCOPTHOUUNAYKQ-WBTCAYNUSA-N 0.000 description 3
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 3
- PEZMQPADLFXCJJ-ZETCQYMHSA-N 2-[[2-[[(2s)-1-(2-aminoacetyl)pyrrolidine-2-carbonyl]amino]acetyl]amino]acetic acid Chemical compound NCC(=O)N1CCC[C@H]1C(=O)NCC(=O)NCC(O)=O PEZMQPADLFXCJJ-ZETCQYMHSA-N 0.000 description 3
- 239000000275 Adrenocorticotropic Hormone Substances 0.000 description 3
- 102000002260 Alkaline Phosphatase Human genes 0.000 description 3
- 108020004774 Alkaline Phosphatase Proteins 0.000 description 3
- 102400000068 Angiostatin Human genes 0.000 description 3
- 108010079709 Angiostatins Proteins 0.000 description 3
- 102000004506 Blood Proteins Human genes 0.000 description 3
- 108010017384 Blood Proteins Proteins 0.000 description 3
- 108090000715 Brain-derived neurotrophic factor Proteins 0.000 description 3
- 102000004219 Brain-derived neurotrophic factor Human genes 0.000 description 3
- 102000055006 Calcitonin Human genes 0.000 description 3
- 108060001064 Calcitonin Proteins 0.000 description 3
- 108010065839 Capreomycin Proteins 0.000 description 3
- 241000272201 Columbiformes Species 0.000 description 3
- 239000000055 Corticotropin-Releasing Hormone Substances 0.000 description 3
- OXFOKRAFNYSREH-BJDJZHNGSA-N Cys-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CS)N OXFOKRAFNYSREH-BJDJZHNGSA-N 0.000 description 3
- 102400001047 Endostatin Human genes 0.000 description 3
- 108010079505 Endostatins Proteins 0.000 description 3
- 108010092674 Enkephalins Proteins 0.000 description 3
- 241000588724 Escherichia coli Species 0.000 description 3
- 102000001690 Factor VIII Human genes 0.000 description 3
- 108010054218 Factor VIII Proteins 0.000 description 3
- 108010071289 Factor XIII Proteins 0.000 description 3
- 102000012673 Follicle Stimulating Hormone Human genes 0.000 description 3
- 108010079345 Follicle Stimulating Hormone Proteins 0.000 description 3
- 102400000921 Gastrin Human genes 0.000 description 3
- 108010052343 Gastrins Proteins 0.000 description 3
- 108700028146 Genetic Enhancer Elements Proteins 0.000 description 3
- 101800001586 Ghrelin Proteins 0.000 description 3
- 102400000442 Ghrelin-28 Human genes 0.000 description 3
- 102000034615 Glial cell line-derived neurotrophic factor Human genes 0.000 description 3
- 108091010837 Glial cell line-derived neurotrophic factor Proteins 0.000 description 3
- 108010017080 Granulocyte Colony-Stimulating Factor Proteins 0.000 description 3
- 102000004269 Granulocyte Colony-Stimulating Factor Human genes 0.000 description 3
- 108010051696 Growth Hormone Proteins 0.000 description 3
- 102000018997 Growth Hormone Human genes 0.000 description 3
- NYEYYMLUABXDMC-NHCYSSNCSA-N Ile-Gly-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(C)C)C(=O)O)N NYEYYMLUABXDMC-NHCYSSNCSA-N 0.000 description 3
- 108010007622 LDL Lipoproteins Proteins 0.000 description 3
- 102000007330 LDL Lipoproteins Human genes 0.000 description 3
- 108091026898 Leader sequence (mRNA) Proteins 0.000 description 3
- URLZCHNOLZSCCA-VABKMULXSA-N Leu-enkephalin Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)CNC(=O)CNC(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CC=CC=C1 URLZCHNOLZSCCA-VABKMULXSA-N 0.000 description 3
- 102000009151 Luteinizing Hormone Human genes 0.000 description 3
- 108010073521 Luteinizing Hormone Proteins 0.000 description 3
- 101710151321 Melanostatin Proteins 0.000 description 3
- YJPIGAIKUZMOQA-UHFFFAOYSA-N Melatonin Natural products COC1=CC=C2N(C(C)=O)C=C(CCN)C2=C1 YJPIGAIKUZMOQA-UHFFFAOYSA-N 0.000 description 3
- 241001529936 Murinae Species 0.000 description 3
- 102400000064 Neuropeptide Y Human genes 0.000 description 3
- 108090000573 Osteocalcin Proteins 0.000 description 3
- 102000004067 Osteocalcin Human genes 0.000 description 3
- 101800000989 Oxytocin Proteins 0.000 description 3
- 102400000050 Oxytocin Human genes 0.000 description 3
- XNOPRXBHLZRZKH-UHFFFAOYSA-N Oxytocin Natural products N1C(=O)C(N)CSSCC(C(=O)N2C(CCC2)C(=O)NC(CC(C)C)C(=O)NCC(N)=O)NC(=O)C(CC(N)=O)NC(=O)C(CCC(N)=O)NC(=O)C(C(C)CC)NC(=O)C1CC1=CC=C(O)C=C1 XNOPRXBHLZRZKH-UHFFFAOYSA-N 0.000 description 3
- 101800004937 Protein C Proteins 0.000 description 3
- 108010029485 Protein Isoforms Proteins 0.000 description 3
- 102000001708 Protein Isoforms Human genes 0.000 description 3
- 230000008305 RNA mechanism Effects 0.000 description 3
- 108091027981 Response element Proteins 0.000 description 3
- 102100037505 Secretin Human genes 0.000 description 3
- 108010086019 Secretin Proteins 0.000 description 3
- 108010056088 Somatostatin Proteins 0.000 description 3
- 102000005157 Somatostatin Human genes 0.000 description 3
- 239000012505 Superdex™ Substances 0.000 description 3
- 102000036693 Thrombopoietin Human genes 0.000 description 3
- 108010041111 Thrombopoietin Proteins 0.000 description 3
- 108060008682 Tumor Necrosis Factor Proteins 0.000 description 3
- 102000000852 Tumor Necrosis Factor-alpha Human genes 0.000 description 3
- 206010067584 Type 1 diabetes mellitus Diseases 0.000 description 3
- GXBMIBRIOWHPDT-UHFFFAOYSA-N Vasopressin Natural products N1C(=O)C(CC=2C=C(O)C=CC=2)NC(=O)C(N)CSSCC(C(=O)N2C(CCC2)C(=O)NC(CCCN=C(N)N)C(=O)NCC(N)=O)NC(=O)C(CC(N)=O)NC(=O)C(CCC(N)=O)NC(=O)C1CC1=CC=CC=C1 GXBMIBRIOWHPDT-UHFFFAOYSA-N 0.000 description 3
- 238000002835 absorbance Methods 0.000 description 3
- 230000004913 activation Effects 0.000 description 3
- 230000004075 alteration Effects 0.000 description 3
- 125000003277 amino group Chemical group 0.000 description 3
- 230000001512 anti-cytomegaloviral effect Effects 0.000 description 3
- 230000003302 anti-idiotype Effects 0.000 description 3
- 229940009098 aspartate Drugs 0.000 description 3
- FZCSTZYAHCUGEM-UHFFFAOYSA-N aspergillomarasmine B Natural products OC(=O)CNC(C(O)=O)CNC(C(O)=O)CC(O)=O FZCSTZYAHCUGEM-UHFFFAOYSA-N 0.000 description 3
- 239000011324 bead Substances 0.000 description 3
- 210000004369 blood Anatomy 0.000 description 3
- 239000008280 blood Substances 0.000 description 3
- 229940077737 brain-derived neurotrophic factor Drugs 0.000 description 3
- 238000009395 breeding Methods 0.000 description 3
- 230000001488 breeding effect Effects 0.000 description 3
- 210000004899 c-terminal region Anatomy 0.000 description 3
- 229960004015 calcitonin Drugs 0.000 description 3
- BBBFJLBPOGFECG-VJVYQDLKSA-N calcitonin Chemical compound N([C@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1NC=NC=1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CO)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N1[C@@H](CCC1)C(N)=O)C(C)C)C(=O)[C@@H]1CSSC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N1 BBBFJLBPOGFECG-VJVYQDLKSA-N 0.000 description 3
- 229960004602 capreomycin Drugs 0.000 description 3
- AOXOCDRNSPFDPE-UKEONUMOSA-N chembl413654 Chemical compound C([C@H](C(=O)NCC(=O)N[C@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@H](CCSC)C(=O)N[C@H](CC(O)=O)C(=O)N[C@H](CC=1C=CC=CC=1)C(N)=O)NC(=O)[C@@H](C)NC(=O)[C@@H](CCC(O)=O)NC(=O)[C@@H](CCC(O)=O)NC(=O)[C@@H](CCC(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)NC(=O)[C@H]1N(CCC1)C(=O)CNC(=O)[C@@H](N)CCC(O)=O)C1=CC=C(O)C=C1 AOXOCDRNSPFDPE-UKEONUMOSA-N 0.000 description 3
- 230000001419 dependent effect Effects 0.000 description 3
- 238000011033 desalting Methods 0.000 description 3
- 238000013461 design Methods 0.000 description 3
- 206010012601 diabetes mellitus Diseases 0.000 description 3
- 229940079593 drug Drugs 0.000 description 3
- 229940012413 factor vii Drugs 0.000 description 3
- 229960000301 factor viii Drugs 0.000 description 3
- 239000007850 fluorescent dye Substances 0.000 description 3
- 229940028334 follicle stimulating hormone Drugs 0.000 description 3
- 230000002496 gastric effect Effects 0.000 description 3
- 238000002523 gelfiltration Methods 0.000 description 3
- GNKDKYIHGQKHHM-RJKLHVOGSA-N ghrelin Chemical compound C([C@H](NC(=O)[C@@H](NC(=O)[C@H](CO)NC(=O)CN)COC(=O)CCCCCCC)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC=1N=CNC=1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N1[C@@H](CCC1)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O)C1=CC=CC=C1 GNKDKYIHGQKHHM-RJKLHVOGSA-N 0.000 description 3
- 238000004128 high performance liquid chromatography Methods 0.000 description 3
- 238000007901 in situ hybridization Methods 0.000 description 3
- ZPNFWUPYTFPOJU-LPYSRVMUSA-N iniprol Chemical compound C([C@H]1C(=O)NCC(=O)NCC(=O)N[C@H]2CSSC[C@H]3C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@H](C(N[C@H](C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC=4C=CC(O)=CC=4)C(=O)N[C@@H](CC=4C=CC=CC=4)C(=O)N[C@@H](CC=4C=CC(O)=CC=4)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CSSC[C@H](NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC=4C=CC=CC=4)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C)NC(=O)[C@H](CCCNC(N)=N)NC2=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CSSC[C@H](NC(=O)[C@H](CC=2C=CC=CC=2)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H]2N(CCC2)C(=O)[C@@H](N)CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N2[C@@H](CCC2)C(=O)N2[C@@H](CCC2)C(=O)N[C@@H](CC=2C=CC(O)=CC=2)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N2[C@@H](CCC2)C(=O)N3)C(=O)NCC(=O)NCC(=O)N[C@@H](C)C(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(=O)N[C@@H](CC=2C=CC=CC=2)C(=O)N[C@H](C(=O)N1)C(C)C)[C@@H](C)O)[C@@H](C)CC)=O)[C@@H](C)CC)C1=CC=C(O)C=C1 ZPNFWUPYTFPOJU-LPYSRVMUSA-N 0.000 description 3
- 108010034529 leucyl-lysine Proteins 0.000 description 3
- 238000011068 loading method Methods 0.000 description 3
- 229940040129 luteinizing hormone Drugs 0.000 description 3
- 239000000463 material Substances 0.000 description 3
- 229960003987 melatonin Drugs 0.000 description 3
- DRLFMBDRBRZALE-UHFFFAOYSA-N melatonin Chemical compound COC1=CC=C2NC=C(CCNC(C)=O)C2=C1 DRLFMBDRBRZALE-UHFFFAOYSA-N 0.000 description 3
- 230000011987 methylation Effects 0.000 description 3
- 238000007069 methylation reaction Methods 0.000 description 3
- URPYMXQQVHTUDU-OFGSCBOVSA-N nucleopeptide y Chemical compound C([C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(N)=O)NC(=O)[C@H](CC=1NC=NC=1)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](C)NC(=O)[C@H]1N(CCC1)C(=O)[C@H](C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)CNC(=O)[C@H]1N(CCC1)C(=O)[C@H](CC(N)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H]1N(CCC1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CO)NC(=O)[C@H]1N(CCC1)C(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CC=C(O)C=C1 URPYMXQQVHTUDU-OFGSCBOVSA-N 0.000 description 3
- XNOPRXBHLZRZKH-DSZYJQQASA-N oxytocin Chemical compound C([C@H]1C(=O)N[C@H](C(N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CSSC[C@H](N)C(=O)N1)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC(C)C)C(=O)NCC(N)=O)=O)[C@@H](C)CC)C1=CC=C(O)C=C1 XNOPRXBHLZRZKH-DSZYJQQASA-N 0.000 description 3
- 229960001723 oxytocin Drugs 0.000 description 3
- 235000013594 poultry meat Nutrition 0.000 description 3
- 230000008569 process Effects 0.000 description 3
- 230000002829 reductive effect Effects 0.000 description 3
- 229940107685 reopro Drugs 0.000 description 3
- 230000002441 reversible effect Effects 0.000 description 3
- 229960002101 secretin Drugs 0.000 description 3
- OWMZNFCDEHGFEP-NFBCVYDUSA-N secretin human Chemical compound C([C@@H](C(=O)N[C@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(N)=O)[C@@H](C)O)NC(=O)[C@@H](NC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC=1NC=NC=1)[C@@H](C)O)C1=CC=CC=C1 OWMZNFCDEHGFEP-NFBCVYDUSA-N 0.000 description 3
- 230000035945 sensitivity Effects 0.000 description 3
- 230000035938 sexual maturation Effects 0.000 description 3
- 238000001542 size-exclusion chromatography Methods 0.000 description 3
- 229960000553 somatostatin Drugs 0.000 description 3
- NHXLMOGPVYXJNR-ATOGVRKGSA-N somatostatin Chemical compound C([C@H]1C(=O)N[C@H](C(N[C@@H](CO)C(=O)N[C@@H](CSSC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC=2C=CC=CC=2)C(=O)N[C@@H](CC=2C=CC=CC=2)C(=O)N[C@@H](CC=2C3=CC=CC=C3NC=2)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(=O)N1)[C@@H](C)O)NC(=O)CNC(=O)[C@H](C)N)C(O)=O)=O)[C@H](O)C)C1=CC=CC=C1 NHXLMOGPVYXJNR-ATOGVRKGSA-N 0.000 description 3
- 150000003431 steroids Chemical class 0.000 description 3
- 239000000724 thymus hormone Substances 0.000 description 3
- 239000013603 viral vector Substances 0.000 description 3
- 230000003612 virological effect Effects 0.000 description 3
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 3
- SFLSHLFXELFNJZ-QMMMGPOBSA-N (-)-norepinephrine Chemical compound NC[C@H](O)C1=CC=C(O)C(O)=C1 SFLSHLFXELFNJZ-QMMMGPOBSA-N 0.000 description 2
- WCSPDMCSKYUFBX-ZJZGAYNASA-N (2s)-n-[(2s)-1-amino-1-oxo-3-phenylpropan-2-yl]-2-[[(2s)-2-[[(2s)-2-amino-3-phenylpropanoyl]amino]-4-methylsulfanylbutanoyl]amino]-5-(diaminomethylideneamino)pentanamide Chemical compound C([C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(N)=O)C1=CC=CC=C1 WCSPDMCSKYUFBX-ZJZGAYNASA-N 0.000 description 2
- NVEXXUGCBSXDLS-LNEXRSTESA-N (4S)-4-[[2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[[(2S)-1-[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S,3R)-2-[[(2S)-2-[[(2S)-2-[[(2S)-6-amino-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[2-[[2-[[(2S)-2-amino-1-hydroxy-3-(4-hydroxyphenyl)propylidene]amino]-1-hydroxyethylidene]amino]-1-hydroxyethylidene]amino]-1-hydroxy-3-phenylpropylidene]amino]-1-hydroxy-4-methylpentylidene]amino]-5-carbamimidamido-1-hydroxypentylidene]amino]-5-carbamimidamido-1-hydroxypentylidene]amino]-1,5-dihydroxy-5-iminopentylidene]amino]-1-hydroxy-3-phenylpropylidene]amino]-1-hydroxyhexylidene]amino]-1-hydroxy-3-methylbutylidene]amino]-1-hydroxy-3-methylbutylidene]amino]-1,3-dihydroxybutylidene]amino]-5-carbamimidamido-1-hydroxypentylidene]amino]-1,3-dihydroxypropylidene]amino]-1,5-dihydroxy-5-iminopentylidene]amino]-4-carboxy-1-hydroxybutylidene]amino]-3-carboxypropanoyl]pyrrolidin-2-yl]-hydroxymethylidene]amino]-1,4-dihydroxy-4-iminobutylidene]amino]-1-hydroxypropylidene]amino]-1-hydroxy-3-(4-hydroxyphenyl)propylidene]amino]-1,3-dihydroxypropylidene]amino]-1-hydroxyethylidene]amino]-5-[(2S)-1-[(2S)-1-[(2S)-3-carboxy-1-[(1S)-1-carboxyethyl]imino-1-hydroxypropan-2-yl]imino-1-hydroxy-3-phenylpropan-2-yl]imino-1-hydroxy-4-methylpentan-2-yl]imino-5-hydroxypentanoic acid Chemical compound CC(C)C[C@H](\N=C(/O)[C@H](CCC(O)=O)\N=C(/O)C\N=C(/O)[C@H](CO)\N=C(/O)[C@H](Cc1ccc(O)cc1)\N=C(/O)[C@H](C)\N=C(/O)[C@H](CC(O)=N)\N=C(/O)[C@@H]1CCCN1C(=O)[C@H](CC(O)=O)\N=C(/O)[C@H](CCC(O)=O)\N=C(/O)[C@H](CCC(O)=N)\N=C(/O)[C@H](CO)\N=C(/O)[C@H](CCCNC(N)=N)\N=C(/O)[C@@H](\N=C(/O)[C@@H](\N=C(/O)[C@@H](\N=C(/O)[C@H](CCCCN)\N=C(/O)[C@H](Cc1ccccc1)\N=C(/O)[C@H](CCC(O)=N)\N=C(/O)[C@H](CCCNC(N)=N)\N=C(/O)[C@H](CCCNC(N)=N)\N=C(/O)[C@H](CC(C)C)\N=C(/O)[C@H](Cc1ccccc1)\N=C(/O)C\N=C(/O)C\N=C(/O)[C@@H](N)Cc1ccc(O)cc1)C(C)C)C(C)C)[C@@H](C)O)C(\O)=N\[C@@H](Cc1ccccc1)C(\O)=N\[C@@H](CC(O)=O)C(\O)=N\[C@@H](C)C(O)=O NVEXXUGCBSXDLS-LNEXRSTESA-N 0.000 description 2
- VOXZDWNPVJITMN-ZBRFXRBCSA-N 17β-estradiol Chemical compound OC1=CC=C2[C@H]3CC[C@](C)([C@H](CC4)O)[C@@H]4[C@@H]3CCC2=C1 VOXZDWNPVJITMN-ZBRFXRBCSA-N 0.000 description 2
- 108010041801 2',3'-Cyclic Nucleotide 3'-Phosphodiesterase Proteins 0.000 description 2
- VLEIUWBSEKKKFX-UHFFFAOYSA-N 2-amino-2-(hydroxymethyl)propane-1,3-diol;2-[2-[bis(carboxymethyl)amino]ethyl-(carboxymethyl)amino]acetic acid Chemical compound OCC(N)(CO)CO.OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O VLEIUWBSEKKKFX-UHFFFAOYSA-N 0.000 description 2
- HVCOBJNICQPDBP-UHFFFAOYSA-N 3-[3-[3,5-dihydroxy-6-methyl-4-(3,4,5-trihydroxy-6-methyloxan-2-yl)oxyoxan-2-yl]oxydecanoyloxy]decanoic acid;hydrate Chemical compound O.OC1C(OC(CC(=O)OC(CCCCCCC)CC(O)=O)CCCCCCC)OC(C)C(O)C1OC1C(O)C(O)C(O)C(C)O1 HVCOBJNICQPDBP-UHFFFAOYSA-N 0.000 description 2
- WBLZUCOIBUDNBV-UHFFFAOYSA-N 3-nitropropanoic acid Chemical compound OC(=O)CC[N+]([O-])=O WBLZUCOIBUDNBV-UHFFFAOYSA-N 0.000 description 2
- QXZBMSIDSOZZHK-DOPDSADYSA-N 31362-50-2 Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(N)=O)NC(=O)CNC(=O)[C@@H](NC(=O)[C@H](C)NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CC(N)=O)NC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H]1NC(=O)CC1)C(C)C)C1=CNC=N1 QXZBMSIDSOZZHK-DOPDSADYSA-N 0.000 description 2
- VOUAQYXWVJDEQY-QENPJCQMSA-N 33017-11-7 Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)NCC(=O)NCC(=O)N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](C)C(=O)NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N1[C@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O)CCC1 VOUAQYXWVJDEQY-QENPJCQMSA-N 0.000 description 2
- 101710169336 5'-deoxyadenosine deaminase Proteins 0.000 description 2
- LVRVABPNVHYXRT-BQWXUCBYSA-N 52906-92-0 Chemical compound C([C@H](N)C(=O)N[C@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O)C(C)C)C1=CC=CC=C1 LVRVABPNVHYXRT-BQWXUCBYSA-N 0.000 description 2
- HFDKKNHCYWNNNQ-YOGANYHLSA-N 75976-10-2 Chemical compound C([C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(N)=O)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](C)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H]1N(CCC1)C(=O)[C@@H](NC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CC(O)=O)NC(=O)CNC(=O)[C@H]1N(CCC1)C(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)[C@@H](NC(=O)[C@H]1N(CCC1)C(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H]1N(CCC1)C(=O)[C@H](C)N)C(C)C)[C@@H](C)O)C1=CC=C(O)C=C1 HFDKKNHCYWNNNQ-YOGANYHLSA-N 0.000 description 2
- 101150079978 AGRN gene Proteins 0.000 description 2
- 235000009434 Actinidia chinensis Nutrition 0.000 description 2
- 235000009436 Actinidia deliciosa Nutrition 0.000 description 2
- 108010059616 Activins Proteins 0.000 description 2
- 102000005606 Activins Human genes 0.000 description 2
- 102000055025 Adenosine deaminases Human genes 0.000 description 2
- 229920000936 Agarose Polymers 0.000 description 2
- 102100040026 Agrin Human genes 0.000 description 2
- 108700019743 Agrin Proteins 0.000 description 2
- KUFVXLQLDHJVOG-SHGPDSBTSA-N Ala-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C)N)O KUFVXLQLDHJVOG-SHGPDSBTSA-N 0.000 description 2
- 102000015427 Angiotensins Human genes 0.000 description 2
- 108010064733 Angiotensins Proteins 0.000 description 2
- 108010071619 Apolipoproteins Proteins 0.000 description 2
- 102000007592 Apolipoproteins Human genes 0.000 description 2
- 102000013585 Bombesin Human genes 0.000 description 2
- 108010051479 Bombesin Proteins 0.000 description 2
- 101800004538 Bradykinin Proteins 0.000 description 2
- 102400000667 Brain natriuretic peptide 32 Human genes 0.000 description 2
- 101800000407 Brain natriuretic peptide 32 Proteins 0.000 description 2
- 101800002247 Brain natriuretic peptide 45 Proteins 0.000 description 2
- YNXLOPYTAAFMTN-SBUIBGKBSA-N C([C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCCN)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N1[C@@H](CCC1)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC=1NC=NC=1)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(N)=O)C1=CC=C(O)C=C1 Chemical compound C([C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCCN)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N1[C@@H](CCC1)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC=1NC=NC=1)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(N)=O)C1=CC=C(O)C=C1 YNXLOPYTAAFMTN-SBUIBGKBSA-N 0.000 description 2
- 108010075254 C-Peptide Proteins 0.000 description 2
- 102100031478 C-type natriuretic peptide Human genes 0.000 description 2
- 108090000932 Calcitonin Gene-Related Peptide Proteins 0.000 description 2
- 102000004414 Calcitonin Gene-Related Peptide Human genes 0.000 description 2
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical compound [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 description 2
- 102000014914 Carrier Proteins Human genes 0.000 description 2
- 108010078791 Carrier Proteins Proteins 0.000 description 2
- 241000271560 Casuariidae Species 0.000 description 2
- 102000006433 Chemokine CCL22 Human genes 0.000 description 2
- 108010083701 Chemokine CCL22 Proteins 0.000 description 2
- 102000004410 Cholesterol 7-alpha-monooxygenases Human genes 0.000 description 2
- 108090000943 Cholesterol 7-alpha-monooxygenases Proteins 0.000 description 2
- 102100021809 Chorionic somatomammotropin hormone 1 Human genes 0.000 description 2
- 108050006018 Coagulation factor VII Proteins 0.000 description 2
- 102100030563 Coagulation factor XI Human genes 0.000 description 2
- 206010053567 Coagulopathies Diseases 0.000 description 2
- 108060005980 Collagenase Proteins 0.000 description 2
- 102000029816 Collagenase Human genes 0.000 description 2
- 108010071942 Colony-Stimulating Factors Proteins 0.000 description 2
- 208000035473 Communicable disease Diseases 0.000 description 2
- 102100032768 Complement receptor type 2 Human genes 0.000 description 2
- 101710143772 Complement receptor type 2 Proteins 0.000 description 2
- 102100030851 Cortistatin Human genes 0.000 description 2
- 229930185483 Cortistatin Natural products 0.000 description 2
- 241000272177 Cuculiformes Species 0.000 description 2
- JDHMXPSXWMPYQZ-AAEUAGOBSA-N Cys-Gly-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CS)N JDHMXPSXWMPYQZ-AAEUAGOBSA-N 0.000 description 2
- 108010019673 Darbepoetin alfa Proteins 0.000 description 2
- 241000255581 Drosophila <fruit fly, genus> Species 0.000 description 2
- 108010065372 Dynorphins Proteins 0.000 description 2
- 238000012286 ELISA Assay Methods 0.000 description 2
- 241000272060 Elapidae Species 0.000 description 2
- 108010049140 Endorphins Proteins 0.000 description 2
- 102000009025 Endorphins Human genes 0.000 description 2
- 102000002045 Endothelin Human genes 0.000 description 2
- 108050009340 Endothelin Proteins 0.000 description 2
- 241000283073 Equus caballus Species 0.000 description 2
- 108010008165 Etanercept Proteins 0.000 description 2
- 101800000164 FMRF-amide Proteins 0.000 description 2
- 108010014173 Factor X Proteins 0.000 description 2
- 108010074864 Factor XI Proteins 0.000 description 2
- 108010080865 Factor XII Proteins 0.000 description 2
- 102000000429 Factor XII Human genes 0.000 description 2
- 102000008946 Fibrinogen Human genes 0.000 description 2
- 108010049003 Fibrinogen Proteins 0.000 description 2
- 102000003972 Fibroblast growth factor 7 Human genes 0.000 description 2
- 108090000385 Fibroblast growth factor 7 Proteins 0.000 description 2
- 108090001126 Furin Proteins 0.000 description 2
- 102000004961 Furin Human genes 0.000 description 2
- 102400001370 Galanin Human genes 0.000 description 2
- 101800002068 Galanin Proteins 0.000 description 2
- 241000272496 Galliformes Species 0.000 description 2
- 241000287830 Gallus sp. Species 0.000 description 2
- 102000004862 Gastrin releasing peptide Human genes 0.000 description 2
- 108090001053 Gastrin releasing peptide Proteins 0.000 description 2
- 102000051325 Glucagon Human genes 0.000 description 2
- 108060003199 Glucagon Proteins 0.000 description 2
- 108010088406 Glucagon-Like Peptides Proteins 0.000 description 2
- 102100031132 Glucose-6-phosphate isomerase Human genes 0.000 description 2
- 108010070600 Glucose-6-phosphate isomerase Proteins 0.000 description 2
- 108010024636 Glutathione Proteins 0.000 description 2
- 102000005720 Glutathione transferase Human genes 0.000 description 2
- 108010070675 Glutathione transferase Proteins 0.000 description 2
- LBDXVCBAJJNJNN-WHFBIAKZSA-N Gly-Ser-Cys Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(O)=O LBDXVCBAJJNJNN-WHFBIAKZSA-N 0.000 description 2
- 229930186217 Glycolipid Natural products 0.000 description 2
- 102100034221 Growth-regulated alpha protein Human genes 0.000 description 2
- QXZGBUJJYSLZLT-UHFFFAOYSA-N H-Arg-Pro-Pro-Gly-Phe-Ser-Pro-Phe-Arg-OH Natural products NC(N)=NCCCC(N)C(=O)N1CCCC1C(=O)N1C(C(=O)NCC(=O)NC(CC=2C=CC=CC=2)C(=O)NC(CO)C(=O)N2C(CCC2)C(=O)NC(CC=2C=CC=CC=2)C(=O)NC(CCCN=C(N)N)C(O)=O)CCC1 QXZGBUJJYSLZLT-UHFFFAOYSA-N 0.000 description 2
- 239000012981 Hank's balanced salt solution Substances 0.000 description 2
- 102000002812 Heat-Shock Proteins Human genes 0.000 description 2
- 108010004889 Heat-Shock Proteins Proteins 0.000 description 2
- 101710154606 Hemagglutinin Proteins 0.000 description 2
- 108010000487 High-Molecular-Weight Kininogen Proteins 0.000 description 2
- 108010093488 His-His-His-His-His-His Proteins 0.000 description 2
- 101001069921 Homo sapiens Growth-regulated alpha protein Proteins 0.000 description 2
- 101000603417 Homo sapiens Neuropeptide B Proteins 0.000 description 2
- 101001108235 Homo sapiens Neuropeptide W Proteins 0.000 description 2
- 101001012157 Homo sapiens Receptor tyrosine-protein kinase erbB-2 Proteins 0.000 description 2
- 241000700588 Human alphaherpesvirus 1 Species 0.000 description 2
- 102000002746 Inhibins Human genes 0.000 description 2
- 108010004250 Inhibins Proteins 0.000 description 2
- 102100025306 Integrin alpha-IIb Human genes 0.000 description 2
- 101710149643 Integrin alpha-IIb Proteins 0.000 description 2
- 102000006992 Interferon-alpha Human genes 0.000 description 2
- 108010047761 Interferon-alpha Proteins 0.000 description 2
- 102000014150 Interferons Human genes 0.000 description 2
- 108010050904 Interferons Proteins 0.000 description 2
- 108090000174 Interleukin-10 Proteins 0.000 description 2
- 102000010789 Interleukin-2 Receptors Human genes 0.000 description 2
- 108010038453 Interleukin-2 Receptors Proteins 0.000 description 2
- 108091092195 Intron Proteins 0.000 description 2
- 102000036770 Islet Amyloid Polypeptide Human genes 0.000 description 2
- 108010041872 Islet Amyloid Polypeptide Proteins 0.000 description 2
- 102000002397 Kinins Human genes 0.000 description 2
- 108010093008 Kinins Proteins 0.000 description 2
- XNSAINXGIQZQOO-UHFFFAOYSA-N L-pyroglutamyl-L-histidyl-L-proline amide Natural products NC(=O)C1CCCN1C(=O)C(NC(=O)C1NC(=O)CC1)CC1=CN=CN1 XNSAINXGIQZQOO-UHFFFAOYSA-N 0.000 description 2
- 102000004407 Lactalbumin Human genes 0.000 description 2
- 108090000942 Lactalbumin Proteins 0.000 description 2
- DLFAACQHIRSQGG-CIUDSAMLSA-N Leu-Asp-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O DLFAACQHIRSQGG-CIUDSAMLSA-N 0.000 description 2
- 102000004058 Leukemia inhibitory factor Human genes 0.000 description 2
- 108090000581 Leukemia inhibitory factor Proteins 0.000 description 2
- 102400000236 Leumorphin Human genes 0.000 description 2
- 102000003960 Ligases Human genes 0.000 description 2
- 108090000364 Ligases Proteins 0.000 description 2
- 102000004895 Lipoproteins Human genes 0.000 description 2
- 108090001030 Lipoproteins Proteins 0.000 description 2
- 102000004083 Lymphotoxin-alpha Human genes 0.000 description 2
- 108090000542 Lymphotoxin-alpha Proteins 0.000 description 2
- FHIAJWBDZVHLAH-YUMQZZPRSA-N Lys-Gly-Ser Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FHIAJWBDZVHLAH-YUMQZZPRSA-N 0.000 description 2
- TWRXJAOTZQYOKJ-UHFFFAOYSA-L Magnesium chloride Chemical compound [Mg+2].[Cl-].[Cl-] TWRXJAOTZQYOKJ-UHFFFAOYSA-L 0.000 description 2
- 101710175625 Maltose/maltodextrin-binding periplasmic protein Proteins 0.000 description 2
- 102400001132 Melanin-concentrating hormone Human genes 0.000 description 2
- 101800002739 Melanin-concentrating hormone Proteins 0.000 description 2
- 239000000637 Melanocyte-Stimulating Hormone Substances 0.000 description 2
- 108010007013 Melanocyte-Stimulating Hormones Proteins 0.000 description 2
- 241000289390 Monotremata Species 0.000 description 2
- 101800002372 Motilin Proteins 0.000 description 2
- 102000002419 Motilin Human genes 0.000 description 2
- 108091061960 Naked DNA Proteins 0.000 description 2
- 102100038842 Neuropeptide B Human genes 0.000 description 2
- 102100021875 Neuropeptide W Human genes 0.000 description 2
- 101800001814 Neurotensin Proteins 0.000 description 2
- 102400001103 Neurotensin Human genes 0.000 description 2
- 102000007999 Nuclear Proteins Human genes 0.000 description 2
- 108010089610 Nuclear Proteins Proteins 0.000 description 2
- 101710093908 Outer capsid protein VP4 Proteins 0.000 description 2
- 101710135467 Outer capsid protein sigma-1 Proteins 0.000 description 2
- 102400000978 PACAP-related peptide Human genes 0.000 description 2
- 101800002869 PACAP-related peptide Proteins 0.000 description 2
- 101800005322 Pancreastatin Proteins 0.000 description 2
- 102400000203 Pancreastatin Human genes 0.000 description 2
- 102000018886 Pancreatic Polypeptide Human genes 0.000 description 2
- 241000288108 Passeriformes Species 0.000 description 2
- 108010088847 Peptide YY Proteins 0.000 description 2
- 102100029909 Peptide YY Human genes 0.000 description 2
- 102100039087 Peptidyl-alpha-hydroxyglycine alpha-amidating lyase Human genes 0.000 description 2
- 101710189920 Peptidyl-alpha-hydroxyglycine alpha-amidating lyase Proteins 0.000 description 2
- 108010069013 Phenylalanine Hydroxylase Proteins 0.000 description 2
- 102100038223 Phenylalanine-4-hydroxylase Human genes 0.000 description 2
- 241000287509 Piciformes Species 0.000 description 2
- 108010004684 Pituitary adenylate cyclase-activating polypeptide Proteins 0.000 description 2
- 102000002808 Pituitary adenylate cyclase-activating polypeptide Human genes 0.000 description 2
- 108010003044 Placental Lactogen Proteins 0.000 description 2
- 239000000381 Placental Lactogen Substances 0.000 description 2
- 108010069820 Pro-Opiomelanocortin Proteins 0.000 description 2
- 239000000683 Pro-Opiomelanocortin Substances 0.000 description 2
- 102100024622 Proenkephalin-B Human genes 0.000 description 2
- 108010072866 Prostate-Specific Antigen Proteins 0.000 description 2
- 102100038358 Prostate-specific antigen Human genes 0.000 description 2
- 101710176177 Protein A56 Proteins 0.000 description 2
- 108010094028 Prothrombin Proteins 0.000 description 2
- 206010037742 Rabies Diseases 0.000 description 2
- 102100030086 Receptor tyrosine-protein kinase erbB-2 Human genes 0.000 description 2
- 241000725643 Respiratory syncytial virus Species 0.000 description 2
- 108010039491 Ricin Proteins 0.000 description 2
- 108091006774 SLC18A3 Proteins 0.000 description 2
- 102400000827 Saposin-D Human genes 0.000 description 2
- 101800001700 Saposin-D Proteins 0.000 description 2
- 101710205037 Sarafotoxin Proteins 0.000 description 2
- 108020004459 Small interfering RNA Proteins 0.000 description 2
- CDBYLPFSWZWCQE-UHFFFAOYSA-L Sodium Carbonate Chemical compound [Na+].[Na+].[O-]C([O-])=O CDBYLPFSWZWCQE-UHFFFAOYSA-L 0.000 description 2
- UIIMBOGNXHQVGW-UHFFFAOYSA-M Sodium bicarbonate Chemical compound [Na+].OC([O-])=O UIIMBOGNXHQVGW-UHFFFAOYSA-M 0.000 description 2
- 102000013275 Somatomedins Human genes 0.000 description 2
- 108010023197 Streptokinase Proteins 0.000 description 2
- 241001415849 Strigiformes Species 0.000 description 2
- 241000271567 Struthioniformes Species 0.000 description 2
- 108090000787 Subtilisin Proteins 0.000 description 2
- 101000983124 Sus scrofa Pancreatic prohormone precursor Proteins 0.000 description 2
- 102100025237 T-cell surface antigen CD2 Human genes 0.000 description 2
- 210000001744 T-lymphocyte Anatomy 0.000 description 2
- 239000004098 Tetracycline Substances 0.000 description 2
- 102000002933 Thioredoxin Human genes 0.000 description 2
- 108010000499 Thromboplastin Proteins 0.000 description 2
- 102000002262 Thromboplastin Human genes 0.000 description 2
- 239000000627 Thyrotropin-Releasing Hormone Substances 0.000 description 2
- 101800004623 Thyrotropin-releasing hormone Proteins 0.000 description 2
- 102400000336 Thyrotropin-releasing hormone Human genes 0.000 description 2
- 102100030951 Tissue factor pathway inhibitor Human genes 0.000 description 2
- 102400001320 Transforming growth factor alpha Human genes 0.000 description 2
- 101800004564 Transforming growth factor alpha Proteins 0.000 description 2
- 108090000631 Trypsin Proteins 0.000 description 2
- 102000004142 Trypsin Human genes 0.000 description 2
- 102000005630 Urocortins Human genes 0.000 description 2
- 108010059705 Urocortins Proteins 0.000 description 2
- 108010062497 VLDL Lipoproteins Proteins 0.000 description 2
- 108010003205 Vasoactive Intestinal Peptide Proteins 0.000 description 2
- 102000055135 Vasoactive Intestinal Peptide Human genes 0.000 description 2
- 102000045965 Vesicular Acetylcholine Transport Proteins Human genes 0.000 description 2
- 108010020033 Vesicular Monoamine Transport Proteins Proteins 0.000 description 2
- 102000009659 Vesicular Monoamine Transport Proteins Human genes 0.000 description 2
- 101710087237 Whey acidic protein Proteins 0.000 description 2
- 230000002378 acidificating effect Effects 0.000 description 2
- 230000009471 action Effects 0.000 description 2
- 239000000488 activin Substances 0.000 description 2
- UCTWMZQNUQWSLP-UHFFFAOYSA-N adrenaline Chemical compound CNCC(O)C1=CC=C(O)C(O)=C1 UCTWMZQNUQWSLP-UHFFFAOYSA-N 0.000 description 2
- 238000001261 affinity purification Methods 0.000 description 2
- 108010050122 alpha 1-Antitrypsin Proteins 0.000 description 2
- 102000015395 alpha 1-Antitrypsin Human genes 0.000 description 2
- 229940024142 alpha 1-antitrypsin Drugs 0.000 description 2
- 108010030291 alpha-Galactosidase Proteins 0.000 description 2
- 102000005840 alpha-Galactosidase Human genes 0.000 description 2
- 150000001408 amides Chemical class 0.000 description 2
- 125000000539 amino acid group Chemical group 0.000 description 2
- 235000011130 ammonium sulphate Nutrition 0.000 description 2
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 2
- 229960000723 ampicillin Drugs 0.000 description 2
- 238000003975 animal breeding Methods 0.000 description 2
- 230000000844 anti-bacterial effect Effects 0.000 description 2
- 230000002942 anti-growth Effects 0.000 description 2
- 230000002001 anti-metastasis Effects 0.000 description 2
- 230000003097 anti-respiratory effect Effects 0.000 description 2
- 230000001494 anti-thymocyte effect Effects 0.000 description 2
- 230000000840 anti-viral effect Effects 0.000 description 2
- 238000013459 approach Methods 0.000 description 2
- 125000003118 aryl group Chemical group 0.000 description 2
- 238000003556 assay Methods 0.000 description 2
- 210000003719 b-lymphocyte Anatomy 0.000 description 2
- 230000001580 bacterial effect Effects 0.000 description 2
- 238000010923 batch production Methods 0.000 description 2
- 108010042362 beta-Lipotropin Proteins 0.000 description 2
- 230000004071 biological effect Effects 0.000 description 2
- 229940057336 black widow spider venom Drugs 0.000 description 2
- QXZGBUJJYSLZLT-FDISYFBBSA-N bradykinin Chemical compound NC(=N)NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(=O)NCC(=O)N[C@@H](CC=2C=CC=CC=2)C(=O)N[C@@H](CO)C(=O)N2[C@@H](CCC2)C(=O)N[C@@H](CC=2C=CC=CC=2)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O)CCC1 QXZGBUJJYSLZLT-FDISYFBBSA-N 0.000 description 2
- 244000309466 calf Species 0.000 description 2
- 238000002619 cancer immunotherapy Methods 0.000 description 2
- 229910052799 carbon Inorganic materials 0.000 description 2
- 239000005018 casein Substances 0.000 description 2
- BECPQYXYKAMYBN-UHFFFAOYSA-N casein, tech. Chemical compound NCCCCC(C(O)=O)N=C(O)C(CC(O)=O)N=C(O)C(CCC(O)=N)N=C(O)C(CC(C)C)N=C(O)C(CCC(O)=O)N=C(O)C(CC(O)=O)N=C(O)C(CCC(O)=O)N=C(O)C(C(C)O)N=C(O)C(CCC(O)=N)N=C(O)C(CCC(O)=N)N=C(O)C(CCC(O)=N)N=C(O)C(CCC(O)=O)N=C(O)C(CCC(O)=O)N=C(O)C(COP(O)(O)=O)N=C(O)C(CCC(O)=N)N=C(O)C(N)CC1=CC=CC=C1 BECPQYXYKAMYBN-UHFFFAOYSA-N 0.000 description 2
- 235000021240 caseins Nutrition 0.000 description 2
- 230000003197 catalytic effect Effects 0.000 description 2
- 230000008859 change Effects 0.000 description 2
- XQNAUQUKWRBODG-UHFFFAOYSA-N chlornitrofen Chemical compound C1=CC([N+](=O)[O-])=CC=C1OC1=C(Cl)C=C(Cl)C=C1Cl XQNAUQUKWRBODG-UHFFFAOYSA-N 0.000 description 2
- 230000035602 clotting Effects 0.000 description 2
- 229960002424 collagenase Drugs 0.000 description 2
- 230000000052 comparative effect Effects 0.000 description 2
- 210000002808 connective tissue Anatomy 0.000 description 2
- 230000001276 controlling effect Effects 0.000 description 2
- 108010005430 cortistatin Proteins 0.000 description 2
- DDRPLNQJNRBRNY-WYYADCIBSA-N cortistatin-14 Chemical compound C([C@H]1C(=O)N[C@@H](CC=2C3=CC=CC=C3NC=2)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(N[C@@H](CC=2C=CC=CC=2)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CSSC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC=2C=CC=CC=2)C(=O)N1)NC(=O)[C@H]1NCCC1)C(=O)N[C@@H](CCCCN)C(O)=O)=O)[C@H](O)C)C1=CC=CC=C1 DDRPLNQJNRBRNY-WYYADCIBSA-N 0.000 description 2
- 230000008878 coupling Effects 0.000 description 2
- 238000010168 coupling process Methods 0.000 description 2
- 238000005859 coupling reaction Methods 0.000 description 2
- 239000000179 crotalid venom Substances 0.000 description 2
- 108010016616 cysteinylglycine Proteins 0.000 description 2
- 230000007423 decrease Effects 0.000 description 2
- 238000004925 denaturation Methods 0.000 description 2
- 230000036425 denaturation Effects 0.000 description 2
- 229960005156 digoxin Drugs 0.000 description 2
- 238000010790 dilution Methods 0.000 description 2
- 239000012895 dilution Substances 0.000 description 2
- 238000006471 dimerization reaction Methods 0.000 description 2
- VYFYYTLLBUKUHU-UHFFFAOYSA-N dopamine Chemical compound NCCC1=CC=C(O)C(O)=C1 VYFYYTLLBUKUHU-UHFFFAOYSA-N 0.000 description 2
- 238000004520 electroporation Methods 0.000 description 2
- 238000010828 elution Methods 0.000 description 2
- 229940073621 enbrel Drugs 0.000 description 2
- 210000002472 endoplasmic reticulum Anatomy 0.000 description 2
- ZUBDGKVDJUIMQQ-UBFCDGJISA-N endothelin-1 Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)NC(=O)[C@H]1NC(=O)[C@H](CC=2C=CC=CC=2)NC(=O)[C@@H](CC=2C=CC(O)=CC=2)NC(=O)[C@H](C(C)C)NC(=O)[C@H]2CSSC[C@@H](C(N[C@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N2)=O)NC(=O)[C@@H](CO)NC(=O)[C@H](N)CSSC1)C1=CNC=N1 ZUBDGKVDJUIMQQ-UBFCDGJISA-N 0.000 description 2
- 150000002148 esters Chemical class 0.000 description 2
- 229930182833 estradiol Natural products 0.000 description 2
- 229960005309 estradiol Drugs 0.000 description 2
- ZMMJGEGLRURXTF-UHFFFAOYSA-N ethidium bromide Chemical compound [Br-].C12=CC(N)=CC=C2C2=CC=C(N)C=C2[N+](CC)=C1C1=CC=CC=C1 ZMMJGEGLRURXTF-UHFFFAOYSA-N 0.000 description 2
- 229960005542 ethidium bromide Drugs 0.000 description 2
- 210000003527 eukaryotic cell Anatomy 0.000 description 2
- 238000011156 evaluation Methods 0.000 description 2
- 229960004222 factor ix Drugs 0.000 description 2
- 229940012952 fibrinogen Drugs 0.000 description 2
- PUBCCFNQJQKCNC-XKNFJVFFSA-N gastrin-releasingpeptide Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(N)=O)NC(=O)CNC(=O)[C@@H](NC(=O)[C@H](C)NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)NC(=O)[C@H](CC=1N=CNC=1)NC(=O)[C@H](CC(N)=O)NC(=O)CNC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H]1N(CCC1)C(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](NC(=O)[C@@H](NC(=O)CNC(=O)CNC(=O)CNC(=O)[C@H](C)NC(=O)[C@H]1N(CCC1)C(=O)[C@H](CC(C)C)NC(=O)[C@H]1N(CCC1)C(=O)[C@@H](N)C(C)C)[C@@H](C)O)C(C)C)[C@@H](C)O)C(C)C)C1=CNC=N1 PUBCCFNQJQKCNC-XKNFJVFFSA-N 0.000 description 2
- 230000002068 genetic effect Effects 0.000 description 2
- MASNOZXLGMXCHN-ZLPAWPGGSA-N glucagon Chemical compound C([C@@H](C(=O)N[C@H](C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O)C(C)C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](C)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@H](CO)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CO)NC(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](NC(=O)[C@H](CC=1C=CC=CC=1)NC(=O)[C@@H](NC(=O)CNC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC=1NC=NC=1)[C@@H](C)O)[C@@H](C)O)C1=CC=CC=C1 MASNOZXLGMXCHN-ZLPAWPGGSA-N 0.000 description 2
- 229960004666 glucagon Drugs 0.000 description 2
- 229960003180 glutathione Drugs 0.000 description 2
- 108010067216 glycyl-glycyl-glycine Proteins 0.000 description 2
- 239000000185 hemagglutinin Substances 0.000 description 2
- JYGXADMDTFJGBT-VWUMJDOOSA-N hydrocortisone Chemical compound O=C1CC[C@]2(C)[C@H]3[C@@H](O)C[C@](C)([C@@](CC4)(O)C(=O)CO)[C@@H]4[C@@H]3CCC2=C1 JYGXADMDTFJGBT-VWUMJDOOSA-N 0.000 description 2
- 229940072221 immunoglobulins Drugs 0.000 description 2
- 238000001114 immunoprecipitation Methods 0.000 description 2
- 230000002637 immunotoxin Effects 0.000 description 2
- 239000002596 immunotoxin Substances 0.000 description 2
- 229940051026 immunotoxin Drugs 0.000 description 2
- 231100000608 immunotoxin Toxicity 0.000 description 2
- 238000011534 incubation Methods 0.000 description 2
- 230000006698 induction Effects 0.000 description 2
- 208000015181 infectious disease Diseases 0.000 description 2
- 229960003971 influenza vaccine Drugs 0.000 description 2
- 239000000893 inhibin Substances 0.000 description 2
- 229940079322 interferon Drugs 0.000 description 2
- 239000003407 interleukin 1 receptor blocking agent Substances 0.000 description 2
- 210000000936 intestine Anatomy 0.000 description 2
- 238000005342 ion exchange Methods 0.000 description 2
- 238000004255 ion exchange chromatography Methods 0.000 description 2
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 2
- 238000011031 large-scale manufacturing process Methods 0.000 description 2
- 108010013555 lipoprotein-associated coagulation inhibitor Proteins 0.000 description 2
- 210000002751 lymph Anatomy 0.000 description 2
- 210000004324 lymphatic system Anatomy 0.000 description 2
- 210000004698 lymphocyte Anatomy 0.000 description 2
- 229940070813 lymphocyte immune globulin Drugs 0.000 description 2
- 239000011159 matrix material Substances 0.000 description 2
- ORRDHOMWDPJSNL-UHFFFAOYSA-N melanin concentrating hormone Chemical compound N1C(=O)C(C(C)C)NC(=O)C(CCCNC(N)=N)NC(=O)CNC(=O)C(C(C)C)NC(=O)C(CCSC)NC(=O)C(NC(=O)C(CCCNC(N)=N)NC(=O)C(NC(=O)C(NC(=O)C(N)CC(O)=O)C(C)O)CCSC)CSSCC(C(=O)NC(CC=2C3=CC=CC=C3NC=2)C(=O)NC(CCC(O)=O)C(=O)NC(C(C)C)C(O)=O)NC(=O)C2CCCN2C(=O)C(CCCNC(N)=N)NC(=O)C1CC1=CC=C(O)C=C1 ORRDHOMWDPJSNL-UHFFFAOYSA-N 0.000 description 2
- BDAGIHXWWSANSR-UHFFFAOYSA-N methanoic acid Natural products OC=O BDAGIHXWWSANSR-UHFFFAOYSA-N 0.000 description 2
- CWWARWOPSKGELM-SARDKLJWSA-N methyl (2s)-2-[[(2s)-2-[[2-[[(2s)-2-[[(2s)-2-[[(2s)-5-amino-2-[[(2s)-5-amino-2-[[(2s)-1-[(2s)-6-amino-2-[[(2s)-1-[(2s)-2-amino-5-(diaminomethylideneamino)pentanoyl]pyrrolidine-2-carbonyl]amino]hexanoyl]pyrrolidine-2-carbonyl]amino]-5-oxopentanoyl]amino]-5 Chemical compound C([C@@H](C(=O)NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)OC)NC(=O)[C@H](CC=1C=CC=CC=1)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H]1N(CCC1)C(=O)[C@H](CCCCN)NC(=O)[C@H]1N(CCC1)C(=O)[C@@H](N)CCCN=C(N)N)C1=CC=CC=C1 CWWARWOPSKGELM-SARDKLJWSA-N 0.000 description 2
- 239000002395 mineralocorticoid Substances 0.000 description 2
- 239000003068 molecular probe Substances 0.000 description 2
- SLZIZIJTGAYEKK-CIJSCKBQSA-N molport-023-220-247 Chemical compound C([C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC=1N=CNC=1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC=1N=CNC=1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(N)=O)NC(=O)[C@H]1N(CCC1)C(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)CNC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)NC(=O)CN)[C@@H](C)O)C1=CNC=N1 SLZIZIJTGAYEKK-CIJSCKBQSA-N 0.000 description 2
- 229960003816 muromonab-cd3 Drugs 0.000 description 2
- 230000035772 mutation Effects 0.000 description 2
- 239000000709 neurohypophysis hormone Substances 0.000 description 2
- PCJGZPGTCUMMOT-ISULXFBGSA-N neurotensin Chemical compound C([C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@H]1N(CCC1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H]1N(CCC1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)[C@H](CC(C)C)NC(=O)[C@H]1NC(=O)CC1)C1=CC=C(O)C=C1 PCJGZPGTCUMMOT-ISULXFBGSA-N 0.000 description 2
- 229960002748 norepinephrine Drugs 0.000 description 2
- SFLSHLFXELFNJZ-UHFFFAOYSA-N norepinephrine Natural products NCC(O)C1=CC=C(O)C(O)=C1 SFLSHLFXELFNJZ-UHFFFAOYSA-N 0.000 description 2
- 229940094443 oxytocics prostaglandins Drugs 0.000 description 2
- 210000000496 pancreas Anatomy 0.000 description 2
- RYZUEKXRBSXBRH-CTXORKPYSA-N pancreastatin Chemical compound C([C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@H](C)NC(=O)CNC(=O)[C@H](CCCCN)NC(=O)CNC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(N)=O)NC(=O)[C@H](C)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](C)NC(=O)CNC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](CCCCN)NC(=O)CNC(=O)[C@H](C)NC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)NC(=O)[C@H]1N(CCC1)C(=O)[C@H](C)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H]1N(CCC1)C(=O)[C@H](CC=1C2=CC=CC=C2NC=1)NC(=O)CN)CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)NCC(=O)N[C@@H](C)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(N)=O)C1=CN=CN1 RYZUEKXRBSXBRH-CTXORKPYSA-N 0.000 description 2
- 108010091742 peptide F Proteins 0.000 description 2
- RJSZPKZQGIKVAU-UXBJKDEOSA-N peptide f Chemical compound C([C@@H](C(=O)N[C@@H](CCSC)C(O)=O)NC(=O)CNC(=O)CNC(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@H](CCCCN)NC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H](NC(=O)[C@H](CCC(O)=O)NC(=O)CNC(=O)CNC(=O)[C@H](CC(N)=O)NC(=O)[C@H](C)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H]1N(CCC1)C(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC=1C=CC=CC=1)NC(=O)CNC(=O)CNC(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C(C)C)C(C)C)C1=CC=CC=C1 RJSZPKZQGIKVAU-UXBJKDEOSA-N 0.000 description 2
- 239000000813 peptide hormone Substances 0.000 description 2
- 229940037129 plain mineralocorticoids for systemic use Drugs 0.000 description 2
- 238000005498 polishing Methods 0.000 description 2
- 229940068189 posterior pituitary hormone Drugs 0.000 description 2
- 108010074732 preproenkephalin Proteins 0.000 description 2
- 239000000186 progesterone Substances 0.000 description 2
- 229960003387 progesterone Drugs 0.000 description 2
- 150000003180 prostaglandins Chemical class 0.000 description 2
- 229960000856 protein c Drugs 0.000 description 2
- 239000012460 protein solution Substances 0.000 description 2
- XNSAINXGIQZQOO-SRVKXCTJSA-N protirelin Chemical compound NC(=O)[C@@H]1CCCN1C(=O)[C@@H](NC(=O)[C@H]1NC(=O)CC1)CC1=CN=CN1 XNSAINXGIQZQOO-SRVKXCTJSA-N 0.000 description 2
- 238000003127 radioimmunoassay Methods 0.000 description 2
- 238000003753 real-time PCR Methods 0.000 description 2
- 239000003488 releasing hormone Substances 0.000 description 2
- 210000004994 reproductive system Anatomy 0.000 description 2
- 210000005000 reproductive tract Anatomy 0.000 description 2
- 230000000717 retained effect Effects 0.000 description 2
- 239000000523 sample Substances 0.000 description 2
- 238000012216 screening Methods 0.000 description 2
- 230000003248 secreting effect Effects 0.000 description 2
- 230000001568 sexual effect Effects 0.000 description 2
- 239000003998 snake venom Substances 0.000 description 2
- 239000002708 spider venom Substances 0.000 description 2
- 238000010186 staining Methods 0.000 description 2
- 238000010561 standard procedure Methods 0.000 description 2
- 229960005202 streptokinase Drugs 0.000 description 2
- 239000006228 supernatant Substances 0.000 description 2
- 238000001356 surgical procedure Methods 0.000 description 2
- 101150047061 tag-72 gene Proteins 0.000 description 2
- 238000012360 testing method Methods 0.000 description 2
- 229960003604 testosterone Drugs 0.000 description 2
- 229960002180 tetracycline Drugs 0.000 description 2
- 229930101283 tetracycline Natural products 0.000 description 2
- 235000019364 tetracycline Nutrition 0.000 description 2
- 150000003522 tetracyclines Chemical class 0.000 description 2
- 238000002560 therapeutic procedure Methods 0.000 description 2
- 108060008226 thioredoxin Proteins 0.000 description 2
- 229940094937 thioredoxin Drugs 0.000 description 2
- 229940034199 thyrotropin-releasing hormone Drugs 0.000 description 2
- 239000012588 trypsin Substances 0.000 description 2
- 239000002753 trypsin inhibitor Substances 0.000 description 2
- 241000701161 unidentified adenovirus Species 0.000 description 2
- 239000000777 urocortin Substances 0.000 description 2
- BJFIDCADFRDPIO-DZCXQCEKSA-N (2S)-N-[(2S)-6-amino-1-[(2-amino-2-oxoethyl)amino]-1-oxohexan-2-yl]-1-[[(4R,7S,10S,13S,16S,19R)-19-amino-7-(2-amino-2-oxoethyl)-10-(3-amino-3-oxopropyl)-16-[(4-hydroxyphenyl)methyl]-6,9,12,15,18-pentaoxo-13-(phenylmethyl)-1,2-dithia-5,8,11,14,17-pentazacycloeicos-4-yl]-oxomethyl]-2-pyrrolidinecarboxamide Chemical compound NCCCC[C@@H](C(=O)NCC(N)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CC=2C=CC=CC=2)NC(=O)[C@H](CC=2C=CC(O)=CC=2)NC(=O)[C@@H](N)CSSC1 BJFIDCADFRDPIO-DZCXQCEKSA-N 0.000 description 1
- BIIBYWQGRFWQKM-JVVROLKMSA-N (2S)-N-[4-(cyclopropylamino)-3,4-dioxo-1-[(3S)-2-oxopyrrolidin-3-yl]butan-2-yl]-2-[[(E)-3-(2,4-dichlorophenyl)prop-2-enoyl]amino]-4,4-dimethylpentanamide Chemical compound CC(C)(C)C[C@@H](C(NC(C[C@H](CCN1)C1=O)C(C(NC1CC1)=O)=O)=O)NC(/C=C/C(C=CC(Cl)=C1)=C1Cl)=O BIIBYWQGRFWQKM-JVVROLKMSA-N 0.000 description 1
- DQJCDTNMLBYVAY-ZXXIYAEKSA-N (2S,5R,10R,13R)-16-{[(2R,3S,4R,5R)-3-{[(2S,3R,4R,5S,6R)-3-acetamido-4,5-dihydroxy-6-(hydroxymethyl)oxan-2-yl]oxy}-5-(ethylamino)-6-hydroxy-2-(hydroxymethyl)oxan-4-yl]oxy}-5-(4-aminobutyl)-10-carbamoyl-2,13-dimethyl-4,7,12,15-tetraoxo-3,6,11,14-tetraazaheptadecan-1-oic acid Chemical compound NCCCC[C@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)CC[C@H](C(N)=O)NC(=O)[C@@H](C)NC(=O)C(C)O[C@@H]1[C@@H](NCC)C(O)O[C@H](CO)[C@H]1O[C@H]1[C@H](NC(C)=O)[C@@H](O)[C@H](O)[C@@H](CO)O1 DQJCDTNMLBYVAY-ZXXIYAEKSA-N 0.000 description 1
- XMQUEQJCYRFIQS-YFKPBYRVSA-N (2s)-2-amino-5-ethoxy-5-oxopentanoic acid Chemical compound CCOC(=O)CC[C@H](N)C(O)=O XMQUEQJCYRFIQS-YFKPBYRVSA-N 0.000 description 1
- MZOFCQQQCNRIBI-VMXHOPILSA-N (3s)-4-[[(2s)-1-[[(2s)-1-[[(1s)-1-carboxy-2-hydroxyethyl]amino]-4-methyl-1-oxopentan-2-yl]amino]-5-(diaminomethylideneamino)-1-oxopentan-2-yl]amino]-3-[[2-[[(2s)-2,6-diaminohexanoyl]amino]acetyl]amino]-4-oxobutanoic acid Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN MZOFCQQQCNRIBI-VMXHOPILSA-N 0.000 description 1
- NMWKYTGJWUAZPZ-WWHBDHEGSA-N (4S)-4-[[(4R,7S,10S,16S,19S,25S,28S,31R)-31-[[(2S)-2-[[(1R,6R,9S,12S,18S,21S,24S,27S,30S,33S,36S,39S,42R,47R,53S,56S,59S,62S,65S,68S,71S,76S,79S,85S)-47-[[(2S)-2-[[(2S)-4-amino-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-amino-3-methylbutanoyl]amino]-3-methylbutanoyl]amino]-3-hydroxypropanoyl]amino]-3-(1H-imidazol-4-yl)propanoyl]amino]-3-phenylpropanoyl]amino]-4-oxobutanoyl]amino]-3-carboxypropanoyl]amino]-18-(4-aminobutyl)-27,68-bis(3-amino-3-oxopropyl)-36,71,76-tribenzyl-39-(3-carbamimidamidopropyl)-24-(2-carboxyethyl)-21,56-bis(carboxymethyl)-65,85-bis[(1R)-1-hydroxyethyl]-59-(hydroxymethyl)-62,79-bis(1H-imidazol-4-ylmethyl)-9-methyl-33-(2-methylpropyl)-8,11,17,20,23,26,29,32,35,38,41,48,54,57,60,63,66,69,72,74,77,80,83,86-tetracosaoxo-30-propan-2-yl-3,4,44,45-tetrathia-7,10,16,19,22,25,28,31,34,37,40,49,55,58,61,64,67,70,73,75,78,81,84,87-tetracosazatetracyclo[40.31.14.012,16.049,53]heptaoctacontane-6-carbonyl]amino]-3-methylbutanoyl]amino]-7-(3-carbamimidamidopropyl)-25-(hydroxymethyl)-19-[(4-hydroxyphenyl)methyl]-28-(1H-imidazol-4-ylmethyl)-10-methyl-6,9,12,15,18,21,24,27,30-nonaoxo-16-propan-2-yl-1,2-dithia-5,8,11,14,17,20,23,26,29-nonazacyclodotriacontane-4-carbonyl]amino]-5-[[(2S)-1-[[(2S)-1-[[(2S)-3-carboxy-1-[[(2S)-1-[[(2S)-1-[[(1S)-1-carboxyethyl]amino]-4-methyl-1-oxopentan-2-yl]amino]-4-methyl-1-oxopentan-2-yl]amino]-1-oxopropan-2-yl]amino]-1-oxopropan-2-yl]amino]-3-(1H-imidazol-4-yl)-1-oxopropan-2-yl]amino]-5-oxopentanoic acid Chemical compound CC(C)C[C@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](Cc1c[nH]cn1)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CSSC[C@H](NC(=O)[C@@H](NC(=O)[C@@H]2CSSC[C@@H]3NC(=O)[C@H](Cc4ccccc4)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](NC(=O)[C@H](Cc4c[nH]cn4)NC(=O)[C@H](CO)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]4CCCN4C(=O)[C@H](CSSC[C@H](NC(=O)[C@@H](NC(=O)CNC(=O)[C@H](Cc4c[nH]cn4)NC(=O)[C@H](Cc4ccccc4)NC3=O)[C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](Cc3ccccc3)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N3CCC[C@H]3C(=O)N[C@@H](C)C(=O)N2)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](Cc2ccccc2)NC(=O)[C@H](Cc2c[nH]cn2)NC(=O)[C@H](CO)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)C(C)C)[C@@H](C)O)C(C)C)C(=O)N[C@@H](Cc2c[nH]cn2)C(=O)N[C@@H](CO)C(=O)NCC(=O)N[C@@H](Cc2ccc(O)cc2)C(=O)N[C@@H](C(C)C)C(=O)NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N1)C(=O)N[C@@H](C)C(O)=O NMWKYTGJWUAZPZ-WWHBDHEGSA-N 0.000 description 1
- DEQANNDTNATYII-OULOTJBUSA-N (4r,7s,10s,13r,16s,19r)-10-(4-aminobutyl)-19-[[(2r)-2-amino-3-phenylpropanoyl]amino]-16-benzyl-n-[(2r,3r)-1,3-dihydroxybutan-2-yl]-7-[(1r)-1-hydroxyethyl]-13-(1h-indol-3-ylmethyl)-6,9,12,15,18-pentaoxo-1,2-dithia-5,8,11,14,17-pentazacycloicosane-4-carboxa Chemical compound C([C@@H](N)C(=O)N[C@H]1CSSC[C@H](NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](CC=2C3=CC=CC=C3NC=2)NC(=O)[C@H](CC=2C=CC=CC=2)NC1=O)C(=O)N[C@H](CO)[C@H](O)C)C1=CC=CC=C1 DEQANNDTNATYII-OULOTJBUSA-N 0.000 description 1
- UCTWMZQNUQWSLP-VIFPVBQESA-N (R)-adrenaline Chemical compound CNC[C@H](O)C1=CC=C(O)C(O)=C1 UCTWMZQNUQWSLP-VIFPVBQESA-N 0.000 description 1
- 229930182837 (R)-adrenaline Natural products 0.000 description 1
- 102000040650 (ribonucleotides)n+m Human genes 0.000 description 1
- 102100025573 1-alkyl-2-acetylglycerophosphocholine esterase Human genes 0.000 description 1
- NVKAWKQGWWIWPM-ABEVXSGRSA-N 17-β-hydroxy-5-α-Androstan-3-one Chemical compound C1C(=O)CC[C@]2(C)[C@H]3CC[C@](C)([C@H](CC4)O)[C@@H]4[C@@H]3CC[C@H]21 NVKAWKQGWWIWPM-ABEVXSGRSA-N 0.000 description 1
- KSXTUUUQYQYKCR-LQDDAWAPSA-M 2,3-bis[[(z)-octadec-9-enoyl]oxy]propyl-trimethylazanium;chloride Chemical compound [Cl-].CCCCCCCC\C=C/CCCCCCCC(=O)OCC(C[N+](C)(C)C)OC(=O)CCCCCCC\C=C/CCCCCCCC KSXTUUUQYQYKCR-LQDDAWAPSA-M 0.000 description 1
- DWKNOLCXIFYNFV-HSZRJFAPSA-N 2-[[(2r)-1-[1-[(4-chloro-3-methylphenyl)methyl]piperidin-4-yl]-5-oxopyrrolidine-2-carbonyl]amino]-n,n,6-trimethylpyridine-4-carboxamide Chemical compound CN(C)C(=O)C1=CC(C)=NC(NC(=O)[C@@H]2N(C(=O)CC2)C2CCN(CC=3C=C(C)C(Cl)=CC=3)CC2)=C1 DWKNOLCXIFYNFV-HSZRJFAPSA-N 0.000 description 1
- CFBILACNYSPRPM-UHFFFAOYSA-N 2-amino-2-(hydroxymethyl)propane-1,3-diol;2-[[1,3-dihydroxy-2-(hydroxymethyl)propan-2-yl]amino]acetic acid Chemical compound OCC(N)(CO)CO.OCC(CO)(CO)NCC(O)=O CFBILACNYSPRPM-UHFFFAOYSA-N 0.000 description 1
- IVLXQGJVBGMLRR-UHFFFAOYSA-N 2-aminoacetic acid;hydron;chloride Chemical compound Cl.NCC(O)=O IVLXQGJVBGMLRR-UHFFFAOYSA-N 0.000 description 1
- OZDAOHVKBFBBMZ-UHFFFAOYSA-N 2-aminopentanedioic acid;hydrate Chemical compound O.OC(=O)C(N)CCC(O)=O OZDAOHVKBFBBMZ-UHFFFAOYSA-N 0.000 description 1
- NLJVXZFCYKWXLH-DXTIXLATSA-N 3-[(3r,6s,9s,12s,15s,17s,20s,22r,25s,28s)-20-(2-amino-2-oxoethyl)-9-(3-aminopropyl)-3,22,25-tribenzyl-15-[(4-hydroxyphenyl)methyl]-6-(2-methylpropyl)-2,5,8,11,14,18,21,24,27-nonaoxo-12-propan-2-yl-1,4,7,10,13,16,19,23,26-nonazabicyclo[26.3.0]hentriacontan Chemical compound C([C@H]1C(=O)N[C@H](C(=O)N[C@@H](CCCN)C(=O)N[C@H](C(N[C@H](CC=2C=CC=CC=2)C(=O)N2CCC[C@H]2C(=O)N[C@@H](CC=2C=CC=CC=2)C(=O)N[C@H](CC=2C=CC=CC=2)C(=O)[C@H](CC(N)=O)NC(=O)[C@H](CCC(O)=O)N1)=O)CC(C)C)C(C)C)C1=CC=C(O)C=C1 NLJVXZFCYKWXLH-DXTIXLATSA-N 0.000 description 1
- OSWFIVFLDKOXQC-UHFFFAOYSA-N 4-(3-methoxyphenyl)aniline Chemical compound COC1=CC=CC(C=2C=CC(N)=CC=2)=C1 OSWFIVFLDKOXQC-UHFFFAOYSA-N 0.000 description 1
- XZKIHKMTEMTJQX-UHFFFAOYSA-N 4-Nitrophenyl Phosphate Chemical compound OP(O)(=O)OC1=CC=C([N+]([O-])=O)C=C1 XZKIHKMTEMTJQX-UHFFFAOYSA-N 0.000 description 1
- UXHQLGLGLZKHTC-CUNXSJBXSA-N 4-[(3s,3ar)-3-cyclopentyl-7-(4-hydroxypiperidine-1-carbonyl)-3,3a,4,5-tetrahydropyrazolo[3,4-f]quinolin-2-yl]-2-chlorobenzonitrile Chemical compound C1CC(O)CCN1C(=O)C1=CC=C(C=2[C@@H]([C@H](C3CCCC3)N(N=2)C=2C=C(Cl)C(C#N)=CC=2)CC2)C2=N1 UXHQLGLGLZKHTC-CUNXSJBXSA-N 0.000 description 1
- HFGHRUCCKVYFKL-UHFFFAOYSA-N 4-ethoxy-2-piperazin-1-yl-7-pyridin-4-yl-5h-pyrimido[5,4-b]indole Chemical compound C1=C2NC=3C(OCC)=NC(N4CCNCC4)=NC=3C2=CC=C1C1=CC=NC=C1 HFGHRUCCKVYFKL-UHFFFAOYSA-N 0.000 description 1
- QTBSBXVTEAMEQO-UHFFFAOYSA-M Acetate Chemical compound CC([O-])=O QTBSBXVTEAMEQO-UHFFFAOYSA-M 0.000 description 1
- 244000298697 Actinidia deliciosa Species 0.000 description 1
- 241000251468 Actinopterygii Species 0.000 description 1
- 229920001817 Agar Polymers 0.000 description 1
- PQSUYGKTWSAVDQ-ZVIOFETBSA-N Aldosterone Chemical compound C([C@@]1([C@@H](C(=O)CO)CC[C@H]1[C@@H]1CC2)C=O)[C@H](O)[C@@H]1[C@]1(C)C2=CC(=O)CC1 PQSUYGKTWSAVDQ-ZVIOFETBSA-N 0.000 description 1
- PQSUYGKTWSAVDQ-UHFFFAOYSA-N Aldosterone Natural products C1CC2C3CCC(C(=O)CO)C3(C=O)CC(O)C2C2(C)C1=CC(=O)CC2 PQSUYGKTWSAVDQ-UHFFFAOYSA-N 0.000 description 1
- 241001455272 Amniota Species 0.000 description 1
- 108090000886 Ananain Proteins 0.000 description 1
- 244000099147 Ananas comosus Species 0.000 description 1
- 235000007119 Ananas comosus Nutrition 0.000 description 1
- 241000272525 Anas platyrhynchos Species 0.000 description 1
- 108090001067 Angiotensinogen Proteins 0.000 description 1
- 102000004881 Angiotensinogen Human genes 0.000 description 1
- 235000002198 Annona diversifolia Nutrition 0.000 description 1
- 241000272814 Anser sp. Species 0.000 description 1
- 108090000935 Antithrombin III Proteins 0.000 description 1
- 102000004411 Antithrombin III Human genes 0.000 description 1
- 101710095342 Apolipoprotein B Proteins 0.000 description 1
- 102100040202 Apolipoprotein B-100 Human genes 0.000 description 1
- 102000013918 Apolipoproteins E Human genes 0.000 description 1
- 108010025628 Apolipoproteins E Proteins 0.000 description 1
- 108010039627 Aprotinin Proteins 0.000 description 1
- 241000726096 Aratinga Species 0.000 description 1
- KVMPVNGOKHTUHZ-GCJQMDKQSA-N Asp-Ala-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KVMPVNGOKHTUHZ-GCJQMDKQSA-N 0.000 description 1
- OAMLVOVXNKILLQ-BQBZGAKWSA-N Asp-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC(O)=O OAMLVOVXNKILLQ-BQBZGAKWSA-N 0.000 description 1
- 108010024976 Asparaginase Proteins 0.000 description 1
- 208000023275 Autoimmune disease Diseases 0.000 description 1
- 108010001478 Bacitracin Proteins 0.000 description 1
- 102100033735 Bactericidal permeability-increasing protein Human genes 0.000 description 1
- UYIFTLBWAOGQBI-BZDYCCQFSA-N Benzhormovarine Chemical compound C([C@@H]1[C@@H](C2=CC=3)CC[C@]4([C@H]1CC[C@@H]4O)C)CC2=CC=3OC(=O)C1=CC=CC=C1 UYIFTLBWAOGQBI-BZDYCCQFSA-N 0.000 description 1
- 102100026189 Beta-galactosidase Human genes 0.000 description 1
- 108010049931 Bone Morphogenetic Protein 2 Proteins 0.000 description 1
- 108010049870 Bone Morphogenetic Protein 7 Proteins 0.000 description 1
- 108010007726 Bone Morphogenetic Proteins Proteins 0.000 description 1
- 102000007350 Bone Morphogenetic Proteins Human genes 0.000 description 1
- 102100024506 Bone morphogenetic protein 2 Human genes 0.000 description 1
- 102100022544 Bone morphogenetic protein 7 Human genes 0.000 description 1
- 241000589969 Borreliella burgdorferi Species 0.000 description 1
- 241001416153 Bos grunniens Species 0.000 description 1
- 241000283699 Bos indicus Species 0.000 description 1
- 241000030939 Bubalus bubalis Species 0.000 description 1
- 101100505161 Caenorhabditis elegans mel-32 gene Proteins 0.000 description 1
- OYPRJOBELJOOCE-UHFFFAOYSA-N Calcium Chemical compound [Ca] OYPRJOBELJOOCE-UHFFFAOYSA-N 0.000 description 1
- 241000282832 Camelidae Species 0.000 description 1
- KXDHJXZQYSOELW-UHFFFAOYSA-N Carbamic acid Chemical group NC(O)=O KXDHJXZQYSOELW-UHFFFAOYSA-N 0.000 description 1
- BVKZGUZCCUSVTD-UHFFFAOYSA-L Carbonate Chemical compound [O-]C([O-])=O BVKZGUZCCUSVTD-UHFFFAOYSA-L 0.000 description 1
- 102000004031 Carboxy-Lyases Human genes 0.000 description 1
- 108090000489 Carboxy-Lyases Proteins 0.000 description 1
- 102000003670 Carboxypeptidase B Human genes 0.000 description 1
- 108090000087 Carboxypeptidase B Proteins 0.000 description 1
- 102000000496 Carboxypeptidases A Human genes 0.000 description 1
- 108010080937 Carboxypeptidases A Proteins 0.000 description 1
- 102000055007 Cartilage Oligomeric Matrix Human genes 0.000 description 1
- 108700005376 Cartilage Oligomeric Matrix Proteins 0.000 description 1
- 102000011632 Caseins Human genes 0.000 description 1
- 108010076119 Caseins Proteins 0.000 description 1
- 108090000994 Catalytic RNA Proteins 0.000 description 1
- 102000053642 Catalytic RNA Human genes 0.000 description 1
- 108010059892 Cellulase Proteins 0.000 description 1
- 108090000751 Ceramidases Proteins 0.000 description 1
- 241000282693 Cercopithecidae Species 0.000 description 1
- GHXZTYHSJHQHIJ-UHFFFAOYSA-N Chlorhexidine Chemical compound C=1C=C(Cl)C=CC=1NC(N)=NC(N)=NCCCCCCN=C(N)N=C(N)NC1=CC=C(Cl)C=C1 GHXZTYHSJHQHIJ-UHFFFAOYSA-N 0.000 description 1
- 102000011022 Chorionic Gonadotropin Human genes 0.000 description 1
- 108010062540 Chorionic Gonadotropin Proteins 0.000 description 1
- 102100037529 Coagulation factor V Human genes 0.000 description 1
- 102100026735 Coagulation factor VIII Human genes 0.000 description 1
- 102100029117 Coagulation factor X Human genes 0.000 description 1
- 108010078777 Colistin Proteins 0.000 description 1
- 108010035532 Collagen Proteins 0.000 description 1
- 102000008186 Collagen Human genes 0.000 description 1
- PMATZTZNYRCHOR-CGLBZJNRSA-N Cyclosporin A Chemical compound CC[C@@H]1NC(=O)[C@H]([C@H](O)[C@H](C)C\C=C\C)N(C)C(=O)[C@H](C(C)C)N(C)C(=O)[C@H](CC(C)C)N(C)C(=O)[C@H](CC(C)C)N(C)C(=O)[C@@H](C)NC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)N(C)C(=O)[C@H](C(C)C)NC(=O)[C@H](CC(C)C)N(C)C(=O)CN(C)C1=O PMATZTZNYRCHOR-CGLBZJNRSA-N 0.000 description 1
- 108010036949 Cyclosporine Proteins 0.000 description 1
- 102000015833 Cystatin Human genes 0.000 description 1
- 102000004127 Cytokines Human genes 0.000 description 1
- 108090000695 Cytokines Proteins 0.000 description 1
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 1
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 1
- 108090000626 DNA-directed RNA polymerases Proteins 0.000 description 1
- 102000004163 DNA-directed RNA polymerases Human genes 0.000 description 1
- 108010000437 Deamino Arginine Vasopressin Proteins 0.000 description 1
- 241000702421 Dependoparvovirus Species 0.000 description 1
- 108010016626 Dipeptides Proteins 0.000 description 1
- 241000271559 Dromaiidae Species 0.000 description 1
- UPEZCKBFRMILAV-JNEQICEOSA-N Ecdysone Natural products O=C1[C@H]2[C@@](C)([C@@H]3C([C@@]4(O)[C@@](C)([C@H]([C@H]([C@@H](O)CCC(O)(C)C)C)CC4)CC3)=C1)C[C@H](O)[C@H](O)C2 UPEZCKBFRMILAV-JNEQICEOSA-N 0.000 description 1
- 241001144268 Echidna Species 0.000 description 1
- 108010014258 Elastin Proteins 0.000 description 1
- 102000016942 Elastin Human genes 0.000 description 1
- 102000005593 Endopeptidases Human genes 0.000 description 1
- 108010059378 Endopeptidases Proteins 0.000 description 1
- 241000709661 Enterovirus Species 0.000 description 1
- 108010056764 Eptifibatide Proteins 0.000 description 1
- 241000283086 Equidae Species 0.000 description 1
- 241000283074 Equus asinus Species 0.000 description 1
- 241000206602 Eukaryota Species 0.000 description 1
- 108010091443 Exopeptidases Proteins 0.000 description 1
- 102000018389 Exopeptidases Human genes 0.000 description 1
- 102000010834 Extracellular Matrix Proteins Human genes 0.000 description 1
- 108010037362 Extracellular Matrix Proteins Proteins 0.000 description 1
- 108010014172 Factor V Proteins 0.000 description 1
- 108010000196 Factor XIIIa Proteins 0.000 description 1
- 108010074860 Factor Xa Proteins 0.000 description 1
- 206010016275 Fear Diseases 0.000 description 1
- 102000009123 Fibrin Human genes 0.000 description 1
- 108010073385 Fibrin Proteins 0.000 description 1
- BWGVNKXGVNDBDI-UHFFFAOYSA-N Fibrin monomer Chemical compound CNC(=O)CNC(=O)CN BWGVNKXGVNDBDI-UHFFFAOYSA-N 0.000 description 1
- 108010088842 Fibrinolysin Proteins 0.000 description 1
- 102000003974 Fibroblast growth factor 2 Human genes 0.000 description 1
- 108090000379 Fibroblast growth factor 2 Proteins 0.000 description 1
- 102100037362 Fibronectin Human genes 0.000 description 1
- 108010067306 Fibronectins Proteins 0.000 description 1
- 241000287227 Fringillidae Species 0.000 description 1
- 102000001390 Fructose-Bisphosphate Aldolase Human genes 0.000 description 1
- 108010068561 Fructose-Bisphosphate Aldolase Proteins 0.000 description 1
- 102100039556 Galectin-4 Human genes 0.000 description 1
- 101000930822 Giardia intestinalis Dipeptidyl-peptidase 4 Proteins 0.000 description 1
- LLEUXCDZPQOJMY-AAEUAGOBSA-N Glu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(O)=O)N)C(O)=O)=CNC2=C1 LLEUXCDZPQOJMY-AAEUAGOBSA-N 0.000 description 1
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 1
- 108010017544 Glucosylceramidase Proteins 0.000 description 1
- 102000004547 Glucosylceramidase Human genes 0.000 description 1
- 102000006485 Glutamyl Aminopeptidase Human genes 0.000 description 1
- 108010058940 Glutamyl Aminopeptidase Proteins 0.000 description 1
- 108010051815 Glutamyl endopeptidase Proteins 0.000 description 1
- UHPAZODVFFYEEL-QWRGUYRKSA-N Gly-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN UHPAZODVFFYEEL-QWRGUYRKSA-N 0.000 description 1
- SOEGEPHNZOISMT-BYPYZUCNSA-N Gly-Ser-Gly Chemical compound NCC(=O)N[C@@H](CO)C(=O)NCC(O)=O SOEGEPHNZOISMT-BYPYZUCNSA-N 0.000 description 1
- WTUSRDZLLWGYAT-KCTSRDHCSA-N Gly-Trp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)CN WTUSRDZLLWGYAT-KCTSRDHCSA-N 0.000 description 1
- 102000002068 Glycopeptides Human genes 0.000 description 1
- 108010015899 Glycopeptides Proteins 0.000 description 1
- 102000003886 Glycoproteins Human genes 0.000 description 1
- 108090000288 Glycoproteins Proteins 0.000 description 1
- 102400000932 Gonadoliberin-1 Human genes 0.000 description 1
- 108010069236 Goserelin Proteins 0.000 description 1
- BLCLNMBMMGCOAS-URPVMXJPSA-N Goserelin Chemical compound C([C@@H](C(=O)N[C@H](COC(C)(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1[C@@H](CCC1)C(=O)NNC(N)=O)NC(=O)[C@H](CO)NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)NC(=O)[C@H](CC=1NC=NC=1)NC(=O)[C@H]1NC(=O)CC1)C1=CC=C(O)C=C1 BLCLNMBMMGCOAS-URPVMXJPSA-N 0.000 description 1
- 108010026389 Gramicidin Proteins 0.000 description 1
- 229940033330 HIV vaccine Drugs 0.000 description 1
- 108090000100 Hepatocyte Growth Factor Proteins 0.000 description 1
- 102100021866 Hepatocyte growth factor Human genes 0.000 description 1
- 108010068250 Herpes Simplex Virus Protein Vmw65 Proteins 0.000 description 1
- 241000238631 Hexapoda Species 0.000 description 1
- 101000871785 Homo sapiens Bactericidal permeability-increasing protein Proteins 0.000 description 1
- 101100220047 Homo sapiens CD36 gene Proteins 0.000 description 1
- 101001027836 Homo sapiens Coagulation factor V Proteins 0.000 description 1
- 101000911390 Homo sapiens Coagulation factor VIII Proteins 0.000 description 1
- 101000608765 Homo sapiens Galectin-4 Proteins 0.000 description 1
- 101500026183 Homo sapiens Gonadoliberin-1 Proteins 0.000 description 1
- 101000599951 Homo sapiens Insulin-like growth factor I Proteins 0.000 description 1
- 101000904196 Homo sapiens Pancreatic secretory granule membrane major glycoprotein GP2 Proteins 0.000 description 1
- 101000574060 Homo sapiens Progesterone receptor Proteins 0.000 description 1
- 101000824318 Homo sapiens Protocadherin Fat 1 Proteins 0.000 description 1
- 101000652736 Homo sapiens Transgelin Proteins 0.000 description 1
- 241000701828 Human papillomavirus type 11 Species 0.000 description 1
- 108010003272 Hyaluronate lyase Proteins 0.000 description 1
- 102000001974 Hyaluronidases Human genes 0.000 description 1
- AVXURJPOCDRRFD-UHFFFAOYSA-N Hydroxylamine Chemical compound ON AVXURJPOCDRRFD-UHFFFAOYSA-N 0.000 description 1
- 208000013016 Hypoglycemia Diseases 0.000 description 1
- 102000015611 Hypothalamic Hormones Human genes 0.000 description 1
- 108010024118 Hypothalamic Hormones Proteins 0.000 description 1
- 102000004627 Iduronidase Human genes 0.000 description 1
- 108010003381 Iduronidase Proteins 0.000 description 1
- 108010002231 IgA-specific serine endopeptidase Proteins 0.000 description 1
- TVYWVSJGSHQWMT-AJNGGQMLSA-N Ile-Leu-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N TVYWVSJGSHQWMT-AJNGGQMLSA-N 0.000 description 1
- 102000008394 Immunoglobulin Fragments Human genes 0.000 description 1
- 108010021625 Immunoglobulin Fragments Proteins 0.000 description 1
- 102100037852 Insulin-like growth factor I Human genes 0.000 description 1
- 102100032999 Integrin beta-3 Human genes 0.000 description 1
- 108010020950 Integrin beta3 Proteins 0.000 description 1
- 102000005755 Intercellular Signaling Peptides and Proteins Human genes 0.000 description 1
- 108010070716 Intercellular Signaling Peptides and Proteins Proteins 0.000 description 1
- 102000003996 Interferon-beta Human genes 0.000 description 1
- 108090000467 Interferon-beta Proteins 0.000 description 1
- 102000008070 Interferon-gamma Human genes 0.000 description 1
- 108010074328 Interferon-gamma Proteins 0.000 description 1
- 108090000177 Interleukin-11 Proteins 0.000 description 1
- 108010065805 Interleukin-12 Proteins 0.000 description 1
- 108090000176 Interleukin-13 Proteins 0.000 description 1
- 108010002386 Interleukin-3 Proteins 0.000 description 1
- 108090000978 Interleukin-4 Proteins 0.000 description 1
- 108010002616 Interleukin-5 Proteins 0.000 description 1
- 108090001005 Interleukin-6 Proteins 0.000 description 1
- 108010002586 Interleukin-7 Proteins 0.000 description 1
- 108090001007 Interleukin-8 Proteins 0.000 description 1
- 108010002335 Interleukin-9 Proteins 0.000 description 1
- 102100024319 Intestinal-type alkaline phosphatase Human genes 0.000 description 1
- 101710184243 Intestinal-type alkaline phosphatase Proteins 0.000 description 1
- 206010022998 Irritability Diseases 0.000 description 1
- ZGUNAGUHMKGQNY-ZETCQYMHSA-N L-alpha-phenylglycine zwitterion Chemical group OC(=O)[C@@H](N)C1=CC=CC=C1 ZGUNAGUHMKGQNY-ZETCQYMHSA-N 0.000 description 1
- 150000008575 L-amino acids Chemical class 0.000 description 1
- 108010001831 LDL receptors Proteins 0.000 description 1
- 108010059881 Lactase Proteins 0.000 description 1
- GUBGYTABKSRVRQ-QKKXKWKRSA-N Lactose Natural products OC[C@H]1O[C@@H](O[C@H]2[C@H](O)[C@@H](O)C(O)O[C@@H]2CO)[C@H](O)[C@@H](O)[C@H]1O GUBGYTABKSRVRQ-QKKXKWKRSA-N 0.000 description 1
- 241000282838 Lama Species 0.000 description 1
- 102000007547 Laminin Human genes 0.000 description 1
- 108010085895 Laminin Proteins 0.000 description 1
- 241000238867 Latrodectus Species 0.000 description 1
- 108010000817 Leuprolide Proteins 0.000 description 1
- 101800000171 Lipovitellin I Proteins 0.000 description 1
- 101800001557 Lipovitellin-1 Proteins 0.000 description 1
- 102100024640 Low-density lipoprotein receptor Human genes 0.000 description 1
- 239000006137 Luria-Bertani broth Substances 0.000 description 1
- 108010048179 Lypressin Proteins 0.000 description 1
- 102000009571 Macrophage Inflammatory Proteins Human genes 0.000 description 1
- 108010009474 Macrophage Inflammatory Proteins Proteins 0.000 description 1
- 201000009906 Meningitis Diseases 0.000 description 1
- 241000272038 Micrurus Species 0.000 description 1
- 101710151805 Mitochondrial intermediate peptidase 1 Proteins 0.000 description 1
- 101100446513 Mus musculus Fgf4 gene Proteins 0.000 description 1
- 102000047918 Myelin Basic Human genes 0.000 description 1
- 108700028031 Myelin Basic Proteins 0.000 description 1
- 108060008487 Myosin Proteins 0.000 description 1
- 102000003505 Myosin Human genes 0.000 description 1
- 102100035044 Myosin light chain kinase, smooth muscle Human genes 0.000 description 1
- ZBJNZFQKYZCUJU-PAHFEQBRSA-N N-[(2S)-4-amino-1-[[(2S,3R)-1-[[(2S)-4-amino-1-oxo-1-[[(3S,6S,9S,12S,15R,18R,21S)-6,9,18-tris(2-aminoethyl)-15-benzyl-3-[(1R)-1-hydroxyethyl]-12-(2-methylpropyl)-2,5,8,11,14,17,20-heptaoxo-1,4,7,10,13,16,19-heptazacyclotricos-21-yl]amino]butan-2-yl]amino]-3-hydroxy-1-oxobutan-2-yl]amino]-1-oxobutan-2-yl]-6-methylheptanamide (6S)-N-[(2S)-4-amino-1-[[(2S,3R)-1-[[(2S)-4-amino-1-oxo-1-[[(3S,6S,9S,12S,15R,18R,21S)-6,9,18-tris(2-aminoethyl)-15-benzyl-3-[(1R)-1-hydroxyethyl]-12-(2-methylpropyl)-2,5,8,11,14,17,20-heptaoxo-1,4,7,10,13,16,19-heptazacyclotricos-21-yl]amino]butan-2-yl]amino]-3-hydroxy-1-oxobutan-2-yl]amino]-1-oxobutan-2-yl]-6-methyloctanamide Polymers CC(C)CCCCC(=O)N[C@@H](CCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCN)C(=O)N[C@H]1CCNC(=O)[C@@H](NC(=O)[C@H](CCN)NC(=O)[C@H](CCN)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](Cc2ccccc2)NC(=O)[C@@H](CCN)NC1=O)[C@@H](C)O.CC[C@H](C)CCCCC(=O)N[C@@H](CCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCN)C(=O)N[C@H]1CCNC(=O)[C@@H](NC(=O)[C@H](CCN)NC(=O)[C@H](CCN)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](Cc2ccccc2)NC(=O)[C@@H](CCN)NC1=O)[C@@H](C)O ZBJNZFQKYZCUJU-PAHFEQBRSA-N 0.000 description 1
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 1
- 108010021717 Nafarelin Proteins 0.000 description 1
- 101100342977 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) leu-1 gene Proteins 0.000 description 1
- 108010016076 Octreotide Proteins 0.000 description 1
- 102000015636 Oligopeptides Human genes 0.000 description 1
- 108010038807 Oligopeptides Proteins 0.000 description 1
- 241000289371 Ornithorhynchus anatinus Species 0.000 description 1
- 108010077077 Osteonectin Proteins 0.000 description 1
- 102000009890 Osteonectin Human genes 0.000 description 1
- 240000007019 Oxalis corniculata Species 0.000 description 1
- 238000012408 PCR amplification Methods 0.000 description 1
- 102100024019 Pancreatic secretory granule membrane major glycoprotein GP2 Human genes 0.000 description 1
- 108010019160 Pancreatin Proteins 0.000 description 1
- 108090000526 Papain Proteins 0.000 description 1
- 241000287127 Passeridae Species 0.000 description 1
- 241001494479 Pecora Species 0.000 description 1
- 102100027913 Peptidyl-prolyl cis-trans isomerase FKBP1A Human genes 0.000 description 1
- 241000288049 Perdix perdix Species 0.000 description 1
- 239000000474 Pituitary Hormone-Releasing Hormone Substances 0.000 description 1
- 108010031037 Pituitary Hormone-Releasing Hormones Proteins 0.000 description 1
- 102000005726 Pituitary Hormone-Releasing Hormones Human genes 0.000 description 1
- 108090000113 Plasma Kallikrein Proteins 0.000 description 1
- 108010001014 Plasminogen Activators Proteins 0.000 description 1
- 102000001938 Plasminogen Activators Human genes 0.000 description 1
- 108010038512 Platelet-Derived Growth Factor Proteins 0.000 description 1
- 102000010780 Platelet-Derived Growth Factor Human genes 0.000 description 1
- 229940124867 Poliovirus vaccine Drugs 0.000 description 1
- 102100037935 Polyubiquitin-C Human genes 0.000 description 1
- 108010070873 Posterior Pituitary Hormones Proteins 0.000 description 1
- 102000005320 Posterior Pituitary Hormones Human genes 0.000 description 1
- 241000288906 Primates Species 0.000 description 1
- 102000007327 Protamines Human genes 0.000 description 1
- 108010007568 Protamines Proteins 0.000 description 1
- 101800001491 Protease 3C Proteins 0.000 description 1
- 229940096437 Protein S Drugs 0.000 description 1
- 108010066124 Protein S Proteins 0.000 description 1
- 102000029301 Protein S Human genes 0.000 description 1
- 102100038103 Protein-glutamine gamma-glutamyltransferase 4 Human genes 0.000 description 1
- 102100027378 Prothrombin Human genes 0.000 description 1
- 241000125945 Protoparvovirus Species 0.000 description 1
- 241000289388 Prototheria Species 0.000 description 1
- 108010041520 Pulmonary Surfactant-Associated Proteins Proteins 0.000 description 1
- 102000000528 Pulmonary Surfactant-Associated Proteins Human genes 0.000 description 1
- 241000282941 Rangifer tarandus Species 0.000 description 1
- 101100029566 Rattus norvegicus Rabggta gene Proteins 0.000 description 1
- 101100431670 Rattus norvegicus Ybx3 gene Proteins 0.000 description 1
- 102000003743 Relaxin Human genes 0.000 description 1
- 108090000103 Relaxin Proteins 0.000 description 1
- 241000271569 Rhea Species 0.000 description 1
- 229940124859 Rotavirus vaccine Drugs 0.000 description 1
- 101000733770 Schizosaccharomyces pombe (strain 972 / ATCC 24843) Aminopeptidase 1 Proteins 0.000 description 1
- 238000012300 Sequence Analysis Methods 0.000 description 1
- YMTLKLXDFCSCNX-BYPYZUCNSA-N Ser-Gly-Gly Chemical compound OC[C@H](N)C(=O)NCC(=O)NCC(O)=O YMTLKLXDFCSCNX-BYPYZUCNSA-N 0.000 description 1
- 241000287219 Serinus canaria Species 0.000 description 1
- 241000700584 Simplexvirus Species 0.000 description 1
- 108010061312 Sphingomyelin Phosphodiesterase Proteins 0.000 description 1
- 108091081024 Start codon Proteins 0.000 description 1
- 229940124858 Streptococcus pneumoniae vaccine Drugs 0.000 description 1
- 101710172711 Structural protein Proteins 0.000 description 1
- 241000272534 Struthio camelus Species 0.000 description 1
- 241000282887 Suidae Species 0.000 description 1
- 102000019197 Superoxide Dismutase Human genes 0.000 description 1
- 108010012715 Superoxide dismutase Proteins 0.000 description 1
- 108700012411 TNFSF10 Proteins 0.000 description 1
- 108010006877 Tacrolimus Binding Protein 1A Proteins 0.000 description 1
- 241000270666 Testudines Species 0.000 description 1
- TZQWJCGVCIJDMU-HEIBUPTGSA-N Thr-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)O)N)O TZQWJCGVCIJDMU-HEIBUPTGSA-N 0.000 description 1
- 108090000190 Thrombin Proteins 0.000 description 1
- 102100026966 Thrombomodulin Human genes 0.000 description 1
- 108010079274 Thrombomodulin Proteins 0.000 description 1
- 108060008245 Thrombospondin Proteins 0.000 description 1
- 102000002938 Thrombospondin Human genes 0.000 description 1
- AUYYCJSJGJYCDS-LBPRGKRZSA-N Thyrolar Chemical class IC1=CC(C[C@H](N)C(O)=O)=CC(I)=C1OC1=CC=C(O)C(I)=C1 AUYYCJSJGJYCDS-LBPRGKRZSA-N 0.000 description 1
- 108010023603 Transcobalamins Proteins 0.000 description 1
- 102000011409 Transcobalamins Human genes 0.000 description 1
- 102100031013 Transgelin Human genes 0.000 description 1
- 102000004243 Tubulin Human genes 0.000 description 1
- 108090000704 Tubulin Proteins 0.000 description 1
- 206010054094 Tumour necrosis Diseases 0.000 description 1
- 241000287436 Turdus merula Species 0.000 description 1
- 108010021006 Tyrothricin Proteins 0.000 description 1
- 108010056354 Ubiquitin C Proteins 0.000 description 1
- 108090000435 Urokinase-type plasminogen activator Proteins 0.000 description 1
- 102000003990 Urokinase-type plasminogen activator Human genes 0.000 description 1
- 102100040613 Uromodulin Human genes 0.000 description 1
- 108010027007 Uromodulin Proteins 0.000 description 1
- 108010059993 Vancomycin Proteins 0.000 description 1
- 241000251539 Vertebrata <Metazoa> Species 0.000 description 1
- MECHNRXZTMCUDQ-UHFFFAOYSA-N Vitamin D2 Natural products C1CCC2(C)C(C(C)C=CC(C)C(C)C)CCC2C1=CC=C1CC(O)CCC1=C MECHNRXZTMCUDQ-UHFFFAOYSA-N 0.000 description 1
- MCRWZBYTLVCCJJ-DKALBXGISA-N [(1s,3r)-3-[[(3s,4s)-3-methoxyoxan-4-yl]amino]-1-propan-2-ylcyclopentyl]-[(1s,4s)-5-[6-(trifluoromethyl)pyrimidin-4-yl]-2,5-diazabicyclo[2.2.1]heptan-2-yl]methanone Chemical compound C([C@]1(N(C[C@]2([H])C1)C(=O)[C@@]1(C[C@@H](CC1)N[C@@H]1[C@@H](COCC1)OC)C(C)C)[H])N2C1=CC(C(F)(F)F)=NC=N1 MCRWZBYTLVCCJJ-DKALBXGISA-N 0.000 description 1
- 239000008351 acetate buffer Substances 0.000 description 1
- 239000002253 acid Substances 0.000 description 1
- 102000010126 acid sphingomyelin phosphodiesterase activity proteins Human genes 0.000 description 1
- 230000001154 acute effect Effects 0.000 description 1
- 125000002252 acyl group Chemical group 0.000 description 1
- 239000003329 adenohypophysis hormone Substances 0.000 description 1
- 230000001919 adrenal effect Effects 0.000 description 1
- 239000008272 agar Substances 0.000 description 1
- 230000002776 aggregation Effects 0.000 description 1
- 238000004220 aggregation Methods 0.000 description 1
- VFRROHXSMXFLSN-KCDKBNATSA-N aldehydo-D-galactose 6-phosphate Chemical compound OP(=O)(O)OC[C@@H](O)[C@H](O)[C@H](O)[C@@H](O)C=O VFRROHXSMXFLSN-KCDKBNATSA-N 0.000 description 1
- 229960002478 aldosterone Drugs 0.000 description 1
- 125000003545 alkoxy group Chemical group 0.000 description 1
- 125000000217 alkyl group Chemical group 0.000 description 1
- UPEZCKBFRMILAV-UHFFFAOYSA-N alpha-Ecdysone Natural products C1C(O)C(O)CC2(C)C(CCC3(C(C(C(O)CCC(C)(C)O)C)CCC33O)C)C3=CC(=O)C21 UPEZCKBFRMILAV-UHFFFAOYSA-N 0.000 description 1
- 125000006242 amine protecting group Chemical group 0.000 description 1
- 150000001412 amines Chemical class 0.000 description 1
- PBSXKCQOTWYLMQ-LWECRCKRSA-N anaritide Chemical compound C([C@@H](C(=O)NCC(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@H](CO)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CCCNC(N)=N)C1=CC=CC=C1 PBSXKCQOTWYLMQ-LWECRCKRSA-N 0.000 description 1
- 108010005565 anaritide Proteins 0.000 description 1
- 229950004772 anaritide Drugs 0.000 description 1
- 239000003098 androgen Substances 0.000 description 1
- 229940030486 androgens Drugs 0.000 description 1
- 238000005571 anion exchange chromatography Methods 0.000 description 1
- 238000000137 annealing Methods 0.000 description 1
- 239000003242 anti bacterial agent Substances 0.000 description 1
- 230000003092 anti-cytokine Effects 0.000 description 1
- 230000003172 anti-dna Effects 0.000 description 1
- 230000000843 anti-fungal effect Effects 0.000 description 1
- 230000000603 anti-haemophilic effect Effects 0.000 description 1
- 230000003388 anti-hormonal effect Effects 0.000 description 1
- 108010018823 anti-inhibitor coagulant complex Proteins 0.000 description 1
- 229940070435 anti-inhibitor coagulant complex Drugs 0.000 description 1
- 230000002141 anti-parasite Effects 0.000 description 1
- 230000002788 anti-peptide Effects 0.000 description 1
- 230000000702 anti-platelet effect Effects 0.000 description 1
- 230000000842 anti-protozoal effect Effects 0.000 description 1
- 230000000692 anti-sense effect Effects 0.000 description 1
- 230000000259 anti-tumor effect Effects 0.000 description 1
- 238000011091 antibody purification Methods 0.000 description 1
- 239000003146 anticoagulant agent Substances 0.000 description 1
- 108091000831 antigen binding proteins Proteins 0.000 description 1
- 239000003096 antiparasitic agent Substances 0.000 description 1
- 239000003904 antiprotozoal agent Substances 0.000 description 1
- 229960005348 antithrombin iii Drugs 0.000 description 1
- 229960004405 aprotinin Drugs 0.000 description 1
- 125000005161 aryl oxy carbonyl group Chemical group 0.000 description 1
- 150000001508 asparagines Chemical class 0.000 description 1
- 108010092854 aspartyllysine Proteins 0.000 description 1
- 229940092117 atgam Drugs 0.000 description 1
- 230000002238 attenuated effect Effects 0.000 description 1
- 229960003071 bacitracin Drugs 0.000 description 1
- 229930184125 bacitracin Natural products 0.000 description 1
- CLKOFPXJLQSYAH-ABRJDSQDSA-N bacitracin A Chemical compound C1SC([C@@H](N)[C@@H](C)CC)=N[C@@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]1C(=O)N[C@H](CCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](CC=2C=CC=CC=2)C(=O)N[C@@H](CC=2N=CNC=2)C(=O)N[C@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCCCC1 CLKOFPXJLQSYAH-ABRJDSQDSA-N 0.000 description 1
- 229960004669 basiliximab Drugs 0.000 description 1
- 210000000227 basophil cell of anterior lobe of hypophysis Anatomy 0.000 description 1
- 210000003323 beak Anatomy 0.000 description 1
- XMQFTWRPUQYINF-UHFFFAOYSA-N bensulfuron-methyl Chemical compound COC(=O)C1=CC=CC=C1CS(=O)(=O)NC(=O)NC1=NC(OC)=CC(OC)=N1 XMQFTWRPUQYINF-UHFFFAOYSA-N 0.000 description 1
- FFBHFFJDDLITSX-UHFFFAOYSA-N benzyl N-[2-hydroxy-4-(3-oxomorpholin-4-yl)phenyl]carbamate Chemical compound OC1=C(NC(=O)OCC2=CC=CC=C2)C=CC(=C1)N1CCOCC1=O FFBHFFJDDLITSX-UHFFFAOYSA-N 0.000 description 1
- 108010005774 beta-Galactosidase Proteins 0.000 description 1
- 108010020169 beta-microseminoprotein Proteins 0.000 description 1
- 108010015799 bilirubin glucuronoside glucuronosyltransferase Proteins 0.000 description 1
- 230000005540 biological transmission Effects 0.000 description 1
- 102000043871 biotin binding protein Human genes 0.000 description 1
- 108700021042 biotin binding protein Proteins 0.000 description 1
- 230000000903 blocking effect Effects 0.000 description 1
- 210000000988 bone and bone Anatomy 0.000 description 1
- 230000008468 bone growth Effects 0.000 description 1
- 210000002805 bone matrix Anatomy 0.000 description 1
- 229940112869 bone morphogenetic protein Drugs 0.000 description 1
- 244000309464 bull Species 0.000 description 1
- 229910052791 calcium Inorganic materials 0.000 description 1
- 239000011575 calcium Substances 0.000 description 1
- 150000001720 carbohydrates Chemical class 0.000 description 1
- 235000014633 carbohydrates Nutrition 0.000 description 1
- 150000001732 carboxylic acid derivatives Chemical class 0.000 description 1
- 125000002843 carboxylic acid group Chemical group 0.000 description 1
- 125000006244 carboxylic acid protecting group Chemical group 0.000 description 1
- 239000000969 carrier Substances 0.000 description 1
- 210000000845 cartilage Anatomy 0.000 description 1
- 238000004113 cell culture Methods 0.000 description 1
- 210000000170 cell membrane Anatomy 0.000 description 1
- 230000001413 cellular effect Effects 0.000 description 1
- 230000036755 cellular response Effects 0.000 description 1
- 229940106157 cellulase Drugs 0.000 description 1
- 238000012512 characterization method Methods 0.000 description 1
- 230000009920 chelation Effects 0.000 description 1
- 239000003795 chemical substances by application Substances 0.000 description 1
- 229960003260 chlorhexidine Drugs 0.000 description 1
- 229940015047 chorionic gonadotropin Drugs 0.000 description 1
- 238000013375 chromatographic separation Methods 0.000 description 1
- 238000004587 chromatography analysis Methods 0.000 description 1
- 239000012539 chromatography resin Substances 0.000 description 1
- 229960001265 ciclosporin Drugs 0.000 description 1
- 239000011248 coating agent Substances 0.000 description 1
- 238000000576 coating method Methods 0.000 description 1
- AGVAZMGAQJOSFJ-WZHZPDAFSA-M cobalt(2+);[(2r,3s,4r,5s)-5-(5,6-dimethylbenzimidazol-1-yl)-4-hydroxy-2-(hydroxymethyl)oxolan-3-yl] [(2r)-1-[3-[(1r,2r,3r,4z,7s,9z,12s,13s,14z,17s,18s,19r)-2,13,18-tris(2-amino-2-oxoethyl)-7,12,17-tris(3-amino-3-oxopropyl)-3,5,8,8,13,15,18,19-octamethyl-2 Chemical compound [Co+2].N#[C-].[N-]([C@@H]1[C@H](CC(N)=O)[C@@]2(C)CCC(=O)NC[C@@H](C)OP(O)(=O)O[C@H]3[C@H]([C@H](O[C@@H]3CO)N3C4=CC(C)=C(C)C=C4N=C3)O)\C2=C(C)/C([C@H](C\2(C)C)CCC(N)=O)=N/C/2=C\C([C@H]([C@@]/2(CC(N)=O)C)CCC(N)=O)=N\C\2=C(C)/C2=N[C@]1(C)[C@@](C)(CC(N)=O)[C@@H]2CCC(N)=O AGVAZMGAQJOSFJ-WZHZPDAFSA-M 0.000 description 1
- 229960003346 colistin Drugs 0.000 description 1
- 229920001436 collagen Polymers 0.000 description 1
- 238000004440 column chromatography Methods 0.000 description 1
- 230000000295 complement effect Effects 0.000 description 1
- 238000006482 condensation reaction Methods 0.000 description 1
- 230000006552 constitutive activation Effects 0.000 description 1
- 230000001054 cortical effect Effects 0.000 description 1
- 239000003246 corticosteroid Substances 0.000 description 1
- 229960001334 corticosteroids Drugs 0.000 description 1
- 239000006071 cream Substances 0.000 description 1
- 238000005520 cutting process Methods 0.000 description 1
- ATDGTVJJHBUTRL-UHFFFAOYSA-N cyanogen bromide Chemical compound BrC#N ATDGTVJJHBUTRL-UHFFFAOYSA-N 0.000 description 1
- 125000000113 cyclohexyl group Chemical group [H]C1([H])C([H])([H])C([H])([H])C([H])(*)C([H])([H])C1([H])[H] 0.000 description 1
- 229930182912 cyclosporin Natural products 0.000 description 1
- 108050004038 cystatin Proteins 0.000 description 1
- 230000009089 cytolysis Effects 0.000 description 1
- 230000006378 damage Effects 0.000 description 1
- 230000034994 death Effects 0.000 description 1
- 230000003247 decreasing effect Effects 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 230000002950 deficient Effects 0.000 description 1
- 230000002939 deleterious effect Effects 0.000 description 1
- 229960004281 desmopressin Drugs 0.000 description 1
- NFLWUMRGJYTJIN-NXBWRCJVSA-N desmopressin Chemical compound C([C@H]1C(=O)N[C@H](C(N[C@@H](CC(N)=O)C(=O)N[C@@H](CSSCCC(=O)N[C@@H](CC=2C=CC(O)=CC=2)C(=O)N1)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(N)=O)=O)CCC(=O)N)C1=CC=CC=C1 NFLWUMRGJYTJIN-NXBWRCJVSA-N 0.000 description 1
- 239000000032 diagnostic agent Substances 0.000 description 1
- DHQUQYYPAWHGAR-UHFFFAOYSA-N dibenzyl 2-aminopentanedioate Chemical compound C=1C=CC=CC=1COC(=O)C(N)CCC(=O)OCC1=CC=CC=C1 DHQUQYYPAWHGAR-UHFFFAOYSA-N 0.000 description 1
- 235000005911 diet Nutrition 0.000 description 1
- 230000000378 dietary effect Effects 0.000 description 1
- ZBCBWPMODOFKDW-UHFFFAOYSA-N diethanolamine Chemical compound OCCNCCO ZBCBWPMODOFKDW-UHFFFAOYSA-N 0.000 description 1
- 238000007865 diluting Methods 0.000 description 1
- 229960001188 diphtheria antitoxin Drugs 0.000 description 1
- 238000010494 dissociation reaction Methods 0.000 description 1
- 230000005593 dissociations Effects 0.000 description 1
- YEJSPQZHMWGIGP-UHFFFAOYSA-N dl-glutamic acid dimethyl ester Natural products COC(=O)CCC(N)C(=O)OC YEJSPQZHMWGIGP-UHFFFAOYSA-N 0.000 description 1
- 229960003638 dopamine Drugs 0.000 description 1
- 230000036267 drug metabolism Effects 0.000 description 1
- UPEZCKBFRMILAV-JMZLNJERSA-N ecdysone Chemical compound C1[C@@H](O)[C@@H](O)C[C@]2(C)[C@@H](CC[C@@]3([C@@H]([C@@H]([C@H](O)CCC(C)(C)O)C)CC[C@]33O)C)C3=CC(=O)[C@@H]21 UPEZCKBFRMILAV-JMZLNJERSA-N 0.000 description 1
- 229920002549 elastin Polymers 0.000 description 1
- 238000001962 electrophoresis Methods 0.000 description 1
- 210000002257 embryonic structure Anatomy 0.000 description 1
- 210000000750 endocrine system Anatomy 0.000 description 1
- 229940066758 endopeptidases Drugs 0.000 description 1
- 210000003989 endothelium vascular Anatomy 0.000 description 1
- 230000002255 enzymatic effect Effects 0.000 description 1
- 229960005139 epinephrine Drugs 0.000 description 1
- 210000000981 epithelium Anatomy 0.000 description 1
- CZKPOZZJODAYPZ-LROMGURASA-N eptifibatide Chemical compound N1C(=O)[C@H](CC(O)=O)NC(=O)CNC(=O)[C@H](CCCCNC(=N)N)NC(=O)CCSSC[C@@H](C(N)=O)NC(=O)[C@@H]2CCCN2C(=O)[C@@H]1CC1=CNC2=CC=CC=C12 CZKPOZZJODAYPZ-LROMGURASA-N 0.000 description 1
- 229960002061 ergocalciferol Drugs 0.000 description 1
- 210000003743 erythrocyte Anatomy 0.000 description 1
- 229950002007 estradiol benzoate Drugs 0.000 description 1
- 230000007717 exclusion Effects 0.000 description 1
- 229940012444 factor xiii Drugs 0.000 description 1
- 210000003746 feather Anatomy 0.000 description 1
- 230000006408 female gonad development Effects 0.000 description 1
- 238000000855 fermentation Methods 0.000 description 1
- 230000004151 fermentation Effects 0.000 description 1
- 229950003499 fibrin Drugs 0.000 description 1
- 229940001501 fibrinolysin Drugs 0.000 description 1
- GNBHRKFJIUUOQI-UHFFFAOYSA-N fluorescein Chemical compound O1C(=O)C2=CC=CC=C2C21C1=CC=C(O)C=C1OC1=CC(O)=CC=C21 GNBHRKFJIUUOQI-UHFFFAOYSA-N 0.000 description 1
- 238000002376 fluorescence recovery after photobleaching Methods 0.000 description 1
- 235000019253 formic acid Nutrition 0.000 description 1
- 230000004927 fusion Effects 0.000 description 1
- BTCSSZJGUNDROE-UHFFFAOYSA-N gamma-aminobutyric acid Chemical compound NCCCC(O)=O BTCSSZJGUNDROE-UHFFFAOYSA-N 0.000 description 1
- 229940044627 gamma-interferon Drugs 0.000 description 1
- 210000005095 gastrointestinal system Anatomy 0.000 description 1
- 238000001502 gel electrophoresis Methods 0.000 description 1
- 238000001641 gel filtration chromatography Methods 0.000 description 1
- 238000003197 gene knockdown Methods 0.000 description 1
- 239000011521 glass Substances 0.000 description 1
- 230000002518 glial effect Effects 0.000 description 1
- 239000003862 glucocorticoid Substances 0.000 description 1
- 239000008103 glucose Substances 0.000 description 1
- 150000004676 glycans Chemical class 0.000 description 1
- 108010001064 glycyl-glycyl-glycyl-glycine Proteins 0.000 description 1
- 229960001442 gonadorelin Drugs 0.000 description 1
- 229960002913 goserelin Drugs 0.000 description 1
- 229960004905 gramicidin Drugs 0.000 description 1
- ZWCXYZRRTRDGQE-SORVKSEFSA-N gramicidina Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](CC(C)C)NC(=O)[C@H](CC=3C4=CC=CC=C4NC=3)NC(=O)[C@@H](CC(C)C)NC(=O)[C@H](CC=3C4=CC=CC=C4NC=3)NC(=O)[C@@H](CC(C)C)NC(=O)[C@H](CC=3C4=CC=CC=C4NC=3)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](C(C)C)NC(=O)[C@H](C)NC(=O)[C@H](NC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](NC=O)C(C)C)CC(C)C)C(=O)NCCO)=CNC2=C1 ZWCXYZRRTRDGQE-SORVKSEFSA-N 0.000 description 1
- 229960001036 haemophilus b conjugate vaccine Drugs 0.000 description 1
- 229910052736 halogen Inorganic materials 0.000 description 1
- 150000002367 halogens Chemical class 0.000 description 1
- 208000006454 hepatitis Diseases 0.000 description 1
- 208000002672 hepatitis B Diseases 0.000 description 1
- 229940036107 hepatitis b immunoglobulin Drugs 0.000 description 1
- 239000013628 high molecular weight specie Substances 0.000 description 1
- 108700020746 histrelin Proteins 0.000 description 1
- 229960002193 histrelin Drugs 0.000 description 1
- HHXHVIJIIXKSOE-QILQGKCVSA-N histrelin Chemical compound CCNC(=O)[C@@H]1CCCN1C(=O)[C@H](CCCNC(N)=N)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](NC(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)[C@H](CO)NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)NC(=O)[C@H](CC=1N=CNC=1)NC(=O)[C@H]1NC(=O)CC1)CC(N=C1)=CN1CC1=CC=CC=C1 HHXHVIJIIXKSOE-QILQGKCVSA-N 0.000 description 1
- 239000012510 hollow fiber Substances 0.000 description 1
- 230000003054 hormonal effect Effects 0.000 description 1
- 102000046621 human FAT1 Human genes 0.000 description 1
- 229960002773 hyaluronidase Drugs 0.000 description 1
- 238000009396 hybridization Methods 0.000 description 1
- 229960000890 hydrocortisone Drugs 0.000 description 1
- 230000007062 hydrolysis Effects 0.000 description 1
- 238000006460 hydrolysis reaction Methods 0.000 description 1
- 125000002887 hydroxy group Chemical group [H]O* 0.000 description 1
- 230000003345 hyperglycaemic effect Effects 0.000 description 1
- 201000001421 hyperglycemia Diseases 0.000 description 1
- 230000002218 hypoglycaemic effect Effects 0.000 description 1
- 229940043650 hypothalamic hormone Drugs 0.000 description 1
- 239000000601 hypothalamic hormone Substances 0.000 description 1
- 125000001841 imino group Chemical group [H]N=* 0.000 description 1
- 230000036039 immunity Effects 0.000 description 1
- 230000002163 immunogen Effects 0.000 description 1
- 239000000367 immunologic factor Substances 0.000 description 1
- 230000001506 immunosuppresive effect Effects 0.000 description 1
- 230000008676 import Effects 0.000 description 1
- 238000007850 in situ PCR Methods 0.000 description 1
- 239000000411 inducer Substances 0.000 description 1
- 230000004941 influx Effects 0.000 description 1
- 230000005764 inhibitory process Effects 0.000 description 1
- 238000011081 inoculation Methods 0.000 description 1
- 239000012212 insulator Substances 0.000 description 1
- 229940056984 integrilin Drugs 0.000 description 1
- 238000001361 intraarterial administration Methods 0.000 description 1
- 230000003834 intracellular effect Effects 0.000 description 1
- 238000007913 intrathecal administration Methods 0.000 description 1
- 210000004153 islets of langerhan Anatomy 0.000 description 1
- 125000001449 isopropyl group Chemical group [H]C([H])([H])C([H])(*)C([H])([H])[H] 0.000 description 1
- 210000003734 kidney Anatomy 0.000 description 1
- 238000002372 labelling Methods 0.000 description 1
- 229940116108 lactase Drugs 0.000 description 1
- 239000008101 lactose Substances 0.000 description 1
- 210000000265 leukocyte Anatomy 0.000 description 1
- GFIJNRVAKGFPGQ-LIJARHBVSA-N leuprolide Chemical compound CCNC(=O)[C@@H]1CCCN1C(=O)[C@H](CCCNC(N)=N)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](CC(C)C)NC(=O)[C@@H](NC(=O)[C@H](CO)NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)NC(=O)[C@H](CC=1N=CNC=1)NC(=O)[C@H]1NC(=O)CC1)CC1=CC=C(O)C=C1 GFIJNRVAKGFPGQ-LIJARHBVSA-N 0.000 description 1
- 229960004338 leuprorelin Drugs 0.000 description 1
- 125000005647 linker group Chemical group 0.000 description 1
- 150000002632 lipids Chemical class 0.000 description 1
- 150000002634 lipophilic molecules Chemical class 0.000 description 1
- 239000002502 liposome Substances 0.000 description 1
- 239000004973 liquid crystal related substance Substances 0.000 description 1
- 210000005229 liver cell Anatomy 0.000 description 1
- 230000033001 locomotion Effects 0.000 description 1
- 239000003055 low molecular weight heparin Substances 0.000 description 1
- 239000013627 low molecular weight specie Substances 0.000 description 1
- 229940127215 low-molecular weight heparin Drugs 0.000 description 1
- 229940042470 lyme disease vaccine Drugs 0.000 description 1
- 210000001165 lymph node Anatomy 0.000 description 1
- 230000002535 lyotropic effect Effects 0.000 description 1
- 229960003837 lypressin Drugs 0.000 description 1
- 230000002101 lytic effect Effects 0.000 description 1
- 229920002521 macromolecule Polymers 0.000 description 1
- 229910001629 magnesium chloride Inorganic materials 0.000 description 1
- 239000003550 marker Substances 0.000 description 1
- 238000004949 mass spectrometry Methods 0.000 description 1
- 230000035800 maturation Effects 0.000 description 1
- 230000001404 mediated effect Effects 0.000 description 1
- 239000002609 medium Substances 0.000 description 1
- 108020004999 messenger RNA Proteins 0.000 description 1
- 229910052751 metal Inorganic materials 0.000 description 1
- 239000002184 metal Substances 0.000 description 1
- SEWIYICDCVPBEW-UHFFFAOYSA-N methyl glutamate Chemical compound COC(=O)C(N)CCC(O)=O SEWIYICDCVPBEW-UHFFFAOYSA-N 0.000 description 1
- 125000000325 methylidene group Chemical group [H]C([H])=* 0.000 description 1
- 210000003470 mitochondria Anatomy 0.000 description 1
- 208000012268 mitochondrial disease Diseases 0.000 description 1
- 108010032806 molgramostim Proteins 0.000 description 1
- 229960003063 molgramostim Drugs 0.000 description 1
- 238000012544 monitoring process Methods 0.000 description 1
- 210000003205 muscle Anatomy 0.000 description 1
- JORAUNFTUVJTNG-BSTBCYLQSA-N n-[(2s)-4-amino-1-[[(2s,3r)-1-[[(2s)-4-amino-1-oxo-1-[[(3s,6s,9s,12s,15r,18s,21s)-6,9,18-tris(2-aminoethyl)-3-[(1r)-1-hydroxyethyl]-12,15-bis(2-methylpropyl)-2,5,8,11,14,17,20-heptaoxo-1,4,7,10,13,16,19-heptazacyclotricos-21-yl]amino]butan-2-yl]amino]-3-h Chemical compound CC(C)CCCCC(=O)N[C@@H](CCN)C(=O)N[C@H]([C@@H](C)O)CN[C@@H](CCN)C(=O)N[C@H]1CCNC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](CCN)NC(=O)[C@H](CCN)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](CC(C)C)NC(=O)[C@H](CCN)NC1=O.CCC(C)CCCCC(=O)N[C@@H](CCN)C(=O)N[C@H]([C@@H](C)O)CN[C@@H](CCN)C(=O)N[C@H]1CCNC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](CCN)NC(=O)[C@H](CCN)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](CC(C)C)NC(=O)[C@H](CCN)NC1=O JORAUNFTUVJTNG-BSTBCYLQSA-N 0.000 description 1
- VOVZXURTCKPRDQ-CQSZACIVSA-N n-[4-[chloro(difluoro)methoxy]phenyl]-6-[(3r)-3-hydroxypyrrolidin-1-yl]-5-(1h-pyrazol-5-yl)pyridine-3-carboxamide Chemical compound C1[C@H](O)CCN1C1=NC=C(C(=O)NC=2C=CC(OC(F)(F)Cl)=CC=2)C=C1C1=CC=NN1 VOVZXURTCKPRDQ-CQSZACIVSA-N 0.000 description 1
- 125000004123 n-propyl group Chemical group [H]C([H])([H])C([H])([H])C([H])([H])* 0.000 description 1
- 229960002333 nafarelin Drugs 0.000 description 1
- RWHUEXWOYVBUCI-ITQXDASVSA-N nafarelin Chemical compound C([C@@H](C(=O)N[C@H](CC=1C=C2C=CC=CC2=CC=1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1[C@@H](CCC1)C(=O)NCC(N)=O)NC(=O)[C@H](CO)NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)NC(=O)[C@H](CC=1NC=NC=1)NC(=O)[C@H]1NC(=O)CC1)C1=CC=C(O)C=C1 RWHUEXWOYVBUCI-ITQXDASVSA-N 0.000 description 1
- 230000001537 neural effect Effects 0.000 description 1
- 239000000765 neuroimmunophilin Substances 0.000 description 1
- 230000000508 neurotrophic effect Effects 0.000 description 1
- 125000000449 nitro group Chemical group [O-][N+](*)=O 0.000 description 1
- 231100001160 nonlethal Toxicity 0.000 description 1
- 230000009871 nonspecific binding Effects 0.000 description 1
- 229960002700 octreotide Drugs 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- VHFGEBVPHAGQPI-MYYQHNLBSA-N oritavancin Chemical compound O([C@@H]1C2=CC=C(C(=C2)Cl)OC=2C=C3C=C(C=2O[C@H]2[C@@H]([C@@H](O)[C@H](O)[C@@H](CO)O2)O[C@@H]2O[C@@H](C)[C@H](O)[C@@](C)(NCC=4C=CC(=CC=4)C=4C=CC(Cl)=CC=4)C2)OC2=CC=C(C=C2Cl)[C@@H](O)[C@H](C(N[C@@H](CC(N)=O)C(=O)N[C@H]3C(=O)N[C@H]2C(=O)N[C@@H]1C(N[C@H](C1=CC(O)=CC(O)=C1C=1C(O)=CC=C2C=1)C(O)=O)=O)=O)NC(=O)[C@@H](CC(C)C)NC)[C@H]1C[C@](C)(N)[C@@H](O)[C@H](C)O1 VHFGEBVPHAGQPI-MYYQHNLBSA-N 0.000 description 1
- 230000003204 osmotic effect Effects 0.000 description 1
- 210000002394 ovarian follicle Anatomy 0.000 description 1
- 230000016087 ovulation Effects 0.000 description 1
- 229940055695 pancreatin Drugs 0.000 description 1
- 229940055729 papain Drugs 0.000 description 1
- 235000019834 papain Nutrition 0.000 description 1
- 230000037361 pathway Effects 0.000 description 1
- 229940023041 peptide vaccine Drugs 0.000 description 1
- 230000000737 periodic effect Effects 0.000 description 1
- 125000001997 phenyl group Chemical group [H]C1=C([H])C([H])=C(*)C([H])=C1[H] 0.000 description 1
- 239000013600 plasmid vector Substances 0.000 description 1
- 229940127126 plasminogen activator Drugs 0.000 description 1
- 229940031999 pneumococcal conjugate vaccine Drugs 0.000 description 1
- 229920000768 polyamine Polymers 0.000 description 1
- XDJYMJULXQKGMM-UHFFFAOYSA-N polymyxin E1 Natural products CCC(C)CCCCC(=O)NC(CCN)C(=O)NC(C(C)O)C(=O)NC(CCN)C(=O)NC1CCNC(=O)C(C(C)O)NC(=O)C(CCN)NC(=O)C(CCN)NC(=O)C(CC(C)C)NC(=O)C(CC(C)C)NC(=O)C(CCN)NC1=O XDJYMJULXQKGMM-UHFFFAOYSA-N 0.000 description 1
- KNIWPHSUTGNZST-UHFFFAOYSA-N polymyxin E2 Natural products CC(C)CCCCC(=O)NC(CCN)C(=O)NC(C(C)O)C(=O)NC(CCN)C(=O)NC1CCNC(=O)C(C(C)O)NC(=O)C(CCN)NC(=O)C(CCN)NC(=O)C(CC(C)C)NC(=O)C(CC(C)C)NC(=O)C(CCN)NC1=O KNIWPHSUTGNZST-UHFFFAOYSA-N 0.000 description 1
- 239000003910 polypeptide antibiotic agent Substances 0.000 description 1
- 229920001282 polysaccharide Polymers 0.000 description 1
- 239000005017 polysaccharide Substances 0.000 description 1
- 239000013641 positive control Substances 0.000 description 1
- 230000029279 positive regulation of transcription, DNA-dependent Effects 0.000 description 1
- 239000002244 precipitate Substances 0.000 description 1
- 238000011085 pressure filtration Methods 0.000 description 1
- 230000000135 prohibitive effect Effects 0.000 description 1
- 229940048914 protamine Drugs 0.000 description 1
- 230000006920 protein precipitation Effects 0.000 description 1
- 230000007026 protein scission Effects 0.000 description 1
- 230000018883 protein targeting Effects 0.000 description 1
- 229940039716 prothrombin Drugs 0.000 description 1
- 230000010346 psychosocial stress Effects 0.000 description 1
- 238000011002 quantification Methods 0.000 description 1
- ZAHRKKWIAAJSAO-UHFFFAOYSA-N rapamycin Natural products COCC(O)C(=C/C(C)C(=O)CC(OC(=O)C1CCCCN1C(=O)C(=O)C2(O)OC(CC(OC)C(=CC=CC=CC(C)CC(C)C(=O)C)C)CCC2C)C(C)CC3CCC(O)C(C3)OC)C ZAHRKKWIAAJSAO-UHFFFAOYSA-N 0.000 description 1
- 230000009257 reactivity Effects 0.000 description 1
- 230000000384 rearing effect Effects 0.000 description 1
- 238000011084 recovery Methods 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 230000022532 regulation of transcription, DNA-dependent Effects 0.000 description 1
- 238000004153 renaturation Methods 0.000 description 1
- 230000003362 replicative effect Effects 0.000 description 1
- 230000001718 repressive effect Effects 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 230000000284 resting effect Effects 0.000 description 1
- 238000012552 review Methods 0.000 description 1
- PYWVYCXTNDRMGF-UHFFFAOYSA-N rhodamine B Chemical compound [Cl-].C=12C=CC(=[N+](CC)CC)C=C2OC2=CC(N(CC)CC)=CC=C2C=1C1=CC=CC=C1C(O)=O PYWVYCXTNDRMGF-UHFFFAOYSA-N 0.000 description 1
- 108010053455 riboflavin-binding protein Proteins 0.000 description 1
- 108091092562 ribozyme Proteins 0.000 description 1
- 238000009394 selective breeding Methods 0.000 description 1
- 230000036301 sexual development Effects 0.000 description 1
- 229960002930 sirolimus Drugs 0.000 description 1
- QFJCIRLUMZQUOT-HPLJOQBZSA-N sirolimus Chemical compound C1C[C@@H](O)[C@H](OC)C[C@@H]1C[C@@H](C)[C@H]1OC(=O)[C@@H]2CCCCN2C(=O)C(=O)[C@](O)(O2)[C@H](C)CC[C@H]2C[C@H](OC)/C(C)=C/C=C/C=C/[C@@H](C)C[C@@H](C)C(=O)[C@H](OC)[C@H](O)/C(C)=C/[C@@H](C)C(=O)C1 QFJCIRLUMZQUOT-HPLJOQBZSA-N 0.000 description 1
- 230000009645 skeletal growth Effects 0.000 description 1
- 210000003491 skin Anatomy 0.000 description 1
- 150000003384 small molecules Chemical class 0.000 description 1
- 210000002460 smooth muscle Anatomy 0.000 description 1
- 229910000030 sodium bicarbonate Inorganic materials 0.000 description 1
- 235000017557 sodium bicarbonate Nutrition 0.000 description 1
- 229910000029 sodium carbonate Inorganic materials 0.000 description 1
- 210000001082 somatic cell Anatomy 0.000 description 1
- 230000009870 specific binding Effects 0.000 description 1
- 238000002798 spectrophotometry method Methods 0.000 description 1
- 210000000952 spleen Anatomy 0.000 description 1
- 208000010110 spontaneous platelet aggregation Diseases 0.000 description 1
- 239000012192 staining solution Substances 0.000 description 1
- 210000000130 stem cell Anatomy 0.000 description 1
- 239000003270 steroid hormone Substances 0.000 description 1
- 238000003756 stirring Methods 0.000 description 1
- 210000002536 stromal cell Anatomy 0.000 description 1
- 125000001424 substituent group Chemical group 0.000 description 1
- 239000000758 substrate Substances 0.000 description 1
- 230000001360 synchronised effect Effects 0.000 description 1
- 229940037128 systemic glucocorticoids Drugs 0.000 description 1
- 108010009889 telokin Proteins 0.000 description 1
- 210000001550 testis Anatomy 0.000 description 1
- 229940036116 tetanus immunoglobulin Drugs 0.000 description 1
- 229960000814 tetanus toxoid Drugs 0.000 description 1
- 229940124597 therapeutic agent Drugs 0.000 description 1
- 108010065722 thiamine binding protein Proteins 0.000 description 1
- 229960004072 thrombin Drugs 0.000 description 1
- 210000001541 thymus gland Anatomy 0.000 description 1
- 239000005495 thyroid hormone Substances 0.000 description 1
- 229940036555 thyroid hormone Drugs 0.000 description 1
- 229960000874 thyrotropin Drugs 0.000 description 1
- 230000001748 thyrotropin Effects 0.000 description 1
- 231100000167 toxic agent Toxicity 0.000 description 1
- 239000003440 toxic substance Substances 0.000 description 1
- 238000010361 transduction Methods 0.000 description 1
- 230000026683 transduction Effects 0.000 description 1
- 230000032258 transport Effects 0.000 description 1
- 229960001322 trypsin Drugs 0.000 description 1
- 239000000439 tumor marker Substances 0.000 description 1
- 229960003281 tyrothricin Drugs 0.000 description 1
- 238000000108 ultra-filtration Methods 0.000 description 1
- 241001430294 unidentified retrovirus Species 0.000 description 1
- 208000019206 urinary tract infection Diseases 0.000 description 1
- 210000002700 urine Anatomy 0.000 description 1
- 229960005356 urokinase Drugs 0.000 description 1
- MYPYJXKWCTUITO-LYRMYLQWSA-N vancomycin Chemical compound O([C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@H]1OC1=C2C=C3C=C1OC1=CC=C(C=C1Cl)[C@@H](O)[C@H](C(N[C@@H](CC(N)=O)C(=O)N[C@H]3C(=O)N[C@H]1C(=O)N[C@H](C(N[C@@H](C3=CC(O)=CC(O)=C3C=3C(O)=CC=C1C=3)C(O)=O)=O)[C@H](O)C1=CC=C(C(=C1)Cl)O2)=O)NC(=O)[C@@H](CC(C)C)NC)[C@H]1C[C@](C)(N)[C@H](O)[C@H](C)O1 MYPYJXKWCTUITO-LYRMYLQWSA-N 0.000 description 1
- 229960003165 vancomycin Drugs 0.000 description 1
- MYPYJXKWCTUITO-UHFFFAOYSA-N vancomycin Natural products O1C(C(=C2)Cl)=CC=C2C(O)C(C(NC(C2=CC(O)=CC(O)=C2C=2C(O)=CC=C3C=2)C(O)=O)=O)NC(=O)C3NC(=O)C2NC(=O)C(CC(N)=O)NC(=O)C(NC(=O)C(CC(C)C)NC)C(O)C(C=C3Cl)=CC=C3OC3=CC2=CC1=C3OC1OC(CO)C(O)C(O)C1OC1CC(C)(N)C(O)C(C)O1 MYPYJXKWCTUITO-UHFFFAOYSA-N 0.000 description 1
- 229940021648 varicella vaccine Drugs 0.000 description 1
- MECHNRXZTMCUDQ-RKHKHRCZSA-N vitamin D2 Chemical compound C1(/[C@@H]2CC[C@@H]([C@]2(CCC1)C)[C@H](C)/C=C/[C@H](C)C(C)C)=C\C=C1\C[C@@H](O)CCC1=C MECHNRXZTMCUDQ-RKHKHRCZSA-N 0.000 description 1
- 235000001892 vitamin D2 Nutrition 0.000 description 1
- 239000011653 vitamin D2 Substances 0.000 description 1
- 108010047303 von Willebrand Factor Proteins 0.000 description 1
- 102100036537 von Willebrand factor Human genes 0.000 description 1
- 229960001134 von willebrand factor Drugs 0.000 description 1
- KMIOJWCYOHBUJS-HAKPAVFJSA-N vorolanib Chemical compound C1N(C(=O)N(C)C)CC[C@@H]1NC(=O)C1=C(C)NC(\C=C/2C3=CC(F)=CC=C3NC\2=O)=C1C KMIOJWCYOHBUJS-HAKPAVFJSA-N 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/435—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
- C07K14/575—Hormones
- C07K14/62—Insulins
-
- A—HUMAN NECESSITIES
- A01—AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
- A01K—ANIMAL HUSBANDRY; CARE OF BIRDS, FISHES, INSECTS; FISHING; REARING OR BREEDING ANIMALS, NOT OTHERWISE PROVIDED FOR; NEW BREEDS OF ANIMALS
- A01K67/00—Rearing or breeding animals, not otherwise provided for; New breeds of animals
- A01K67/027—New breeds of vertebrates
- A01K67/0275—Genetically modified vertebrates, e.g. transgenic
-
- A—HUMAN NECESSITIES
- A23—FOODS OR FOODSTUFFS; TREATMENT THEREOF, NOT COVERED BY OTHER CLASSES
- A23L—FOODS, FOODSTUFFS, OR NON-ALCOHOLIC BEVERAGES, NOT COVERED BY SUBCLASSES A21D OR A23B-A23J; THEIR PREPARATION OR TREATMENT, e.g. COOKING, MODIFICATION OF NUTRITIVE QUALITIES, PHYSICAL TREATMENT; PRESERVATION OF FOODS OR FOODSTUFFS, IN GENERAL
- A23L15/00—Egg products; Preparation or treatment thereof
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K16/00—Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K16/00—Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies
- C07K16/02—Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies from eggs
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K16/00—Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies
- C07K16/08—Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies against material from viruses
- C07K16/10—Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies against material from viruses from RNA viruses
- C07K16/1036—Retroviridae, e.g. leukemia viruses
- C07K16/1045—Lentiviridae, e.g. HIV, FIV, SIV
- C07K16/1063—Lentiviridae, e.g. HIV, FIV, SIV env, e.g. gp41, gp110/120, gp160, V3, PND, CD4 binding site
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K16/00—Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies
- C07K16/18—Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies against material from animals or humans
- C07K16/26—Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies against material from animals or humans against hormones ; against hormone releasing or inhibiting factors
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/85—Vectors or expression systems specially adapted for eukaryotic hosts for animal cells
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/85—Vectors or expression systems specially adapted for eukaryotic hosts for animal cells
- C12N15/8509—Vectors or expression systems specially adapted for eukaryotic hosts for animal cells for producing genetically modified animals, e.g. transgenic
-
- A—HUMAN NECESSITIES
- A01—AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
- A01K—ANIMAL HUSBANDRY; CARE OF BIRDS, FISHES, INSECTS; FISHING; REARING OR BREEDING ANIMALS, NOT OTHERWISE PROVIDED FOR; NEW BREEDS OF ANIMALS
- A01K2217/00—Genetically modified animals
- A01K2217/05—Animals comprising random inserted nucleic acids (transgenic)
-
- A—HUMAN NECESSITIES
- A01—AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
- A01K—ANIMAL HUSBANDRY; CARE OF BIRDS, FISHES, INSECTS; FISHING; REARING OR BREEDING ANIMALS, NOT OTHERWISE PROVIDED FOR; NEW BREEDS OF ANIMALS
- A01K2227/00—Animals characterised by species
- A01K2227/10—Mammal
- A01K2227/105—Murine
-
- A—HUMAN NECESSITIES
- A01—AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
- A01K—ANIMAL HUSBANDRY; CARE OF BIRDS, FISHES, INSECTS; FISHING; REARING OR BREEDING ANIMALS, NOT OTHERWISE PROVIDED FOR; NEW BREEDS OF ANIMALS
- A01K2227/00—Animals characterised by species
- A01K2227/30—Bird
-
- A—HUMAN NECESSITIES
- A01—AGRICULTURE; FORESTRY; ANIMAL HUSBANDRY; HUNTING; TRAPPING; FISHING
- A01K—ANIMAL HUSBANDRY; CARE OF BIRDS, FISHES, INSECTS; FISHING; REARING OR BREEDING ANIMALS, NOT OTHERWISE PROVIDED FOR; NEW BREEDS OF ANIMALS
- A01K2267/00—Animals characterised by purpose
- A01K2267/01—Animal expressing industrially exogenous proteins
-
- A—HUMAN NECESSITIES
- A23—FOODS OR FOODSTUFFS; TREATMENT THEREOF, NOT COVERED BY OTHER CLASSES
- A23V—INDEXING SCHEME RELATING TO FOODS, FOODSTUFFS OR NON-ALCOHOLIC BEVERAGES AND LACTIC OR PROPIONIC ACID BACTERIA USED IN FOODSTUFFS OR FOOD PREPARATION
- A23V2002/00—Food compositions, function of food ingredients or processes for food or foodstuffs
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
- A61K2039/505—Medicinal preparations containing antigens or antibodies comprising antibodies
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K48/00—Medicinal preparations containing genetic material which is inserted into cells of the living body to treat genetic diseases; Gene therapy
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2317/00—Immunoglobulins specific features
- C07K2317/20—Immunoglobulins specific features characterized by taxonomic origin
- C07K2317/23—Immunoglobulins specific features characterized by taxonomic origin from birds
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2800/00—Nucleic acids vectors
- C12N2800/90—Vectors containing a transposable element
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2830/00—Vector systems having a special element relevant for transcription
- C12N2830/001—Vector systems having a special element relevant for transcription controllable enhancer/promoter combination
- C12N2830/002—Vector systems having a special element relevant for transcription controllable enhancer/promoter combination inducible enhancer/promoter combination, e.g. hypoxia, iron, transcription factor
- C12N2830/003—Vector systems having a special element relevant for transcription controllable enhancer/promoter combination inducible enhancer/promoter combination, e.g. hypoxia, iron, transcription factor tet inducible
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2830/00—Vector systems having a special element relevant for transcription
- C12N2830/008—Vector systems having a special element relevant for transcription cell type or tissue specific enhancer/promoter combination
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2830/00—Vector systems having a special element relevant for transcription
- C12N2830/80—Vector systems having a special element relevant for transcription from vertebrates
- C12N2830/90—Vector systems having a special element relevant for transcription from vertebrates avian
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N2840/00—Vectors comprising a special translation-regulating system
- C12N2840/20—Vectors comprising a special translation-regulating system translation of more than one cistron
Definitions
- the present invention relates generally to administration of a transposon-based vector to the reproductive tract in an animal.
- the reproductive tract includes an ovary, ova within an ovary, and an oviduct.
- Such administration results in incorporation of a gene of interest contained in the vector in the ovary, the oviduct or an ovum of the animal.
- the present invention further includes production of a protein encoded by the gene in an egg produced by the animal.
- Transgenic animals are desirable for a variety of reasons, including their potential as biological factories to produce desired molecules for pharmaceutical, diagnostic and industrial uses. This potential is attractive to the industry due to the inadequate capacity in facilities used for recombinant production of desired molecules and the increasing demand by the pharmaceutical industry for use of these facilities. Numerous attempts to produce transgenic animals have met several problems, including low rates of gene incorporation and unstable gene incorporation. Accordingly, improved gene technologies are needed for the development of transgenic animals for the production of desired molecules.
- Type 1 diabetes is an autoimmune disease that ultimately results in destruction of the insulin producing ⁇ -cells in the pancreas.
- patients with Type 1 diabetes may be treated adequately with insulin injections or insulin pumps, these therapies are only partially effective. Insulin replacement, such as via insulin injection or pump administration, cannot fully reverse the defect in the vascular endothelium found in the hyperglycemic state (Pieper et al., 1996. Diabetes Res. Clin. Pract. Suppl. S157-S162).
- virus-based delivery vectors such as adeno and adeno-associated viruses, retroviruses, and other viruses, which have been attenuated to no longer replicate.
- viral vectors There are multiple problems associated with the use of viral vectors. Firstly, they are not tissue-specific. In fact, a gene therapy trial using adenovirus was recently halted because the vector was present in the patient's sperm (Gene trial to proceed despite fears that therapy could change child's genetic makeup. The New York Times, Dec. 23, 2001). Secondly, viral vectors are likely to be transiently incorporated, which necessitates re-treating a patient at specified time intervals. (Kay, M. A., et al. 2001. Nature Medicine 7:33-40). Thirdly, there is a concern that a viral-based vector could revert to its virulent form and cause disease.
- viral-based vectors require a dividing cell for stable integration.
- viral-based vectors indiscriminately integrate into various cells, which can result in undesirable germline integration.
- the required high titers needed to achieve the desired effect have resulted in the death of one patient and they are believed to be responsible for induction of cancer in a separate study. (Science, News of the Week, Oct. 4, 2002).
- inducible promoters are regulated by substances either produced or recognized by the transcription control elements within the cell in which the gene is incorporated.
- control of gene expression is desired in transgenic animals or humans so that incorporated genes are selectively activated at desired times and/or under the influence of specific substances. Accordingly, what is needed is a means to selectively activate genes introduced into the genome of cells of a transgenic animal or human.
- Transgenic animals include all egg-laying animals and milk-producing animals. Transgenic animals further include but are not limited to avians, fish, amphibians, reptiles, insects, mammals and humans.
- the animal is a milk-producing animal, including but not limited to bovine, porcine, ovine and equine animals.
- the animal is an avian animal.
- the animal is a mammal. Animals are made transgenic through administration of a composition comprising a transposon-based vector designed for incorporation of a gene of interest for production of a desired protein, together with an acceptable carrier.
- compositions of the present invention are introduced into the reproductive system of an animal.
- the compositions of the present invention are administered to a reproductive organ including, but not limited to, an oviduct, an ovary, or into the duct system of the mammary gland.
- the compositions of the present invention are may be administered to a reproductive organ of an animal through the cloaca.
- the compositions of the present invention may be directly administered to a reproductive organ or can be administered to an artery leading to the reproductive organ.
- the compositions of the present invention are introduced into the the reproductive system of an avian animal.
- the compositions of the present invention are introduced into the the intramammary duct system of a mammal.
- a transfection reagent is optionally added to the composition before administration.
- the transposon-based vectors of the present invention include a transposase, operably-linked to a first promoter, and a coding sequence for a protein or peptide of interest operably-linked to a second promoter, wherein the coding sequence for the protein or peptide of interest and its operably-linked promoter are flanked by transposase insertion sequences recognized by the transposase.
- the transposon-based vector also includes the following characteristics: a) one or more modified Kozak sequences at the 3′ end of the first promoter to enhance expression of the transposase; b) modifications of the codons for the first several N-terminal amino acids of the transposase, wherein the nucleotide at the third base position of each codon is changed to an A or a T without changing the corresponding amino acid; c) addition of one or more stop codons to enhance the termination of transposase synthesis; and/or, d) addition of an effective polyA sequence operably-linked to the transposase to further enhance expression of the transposase gene.
- the effective polyA sequence is an avian optimized polyA sequence.
- the present invention also provides for tissue-specific incorporation and/or expression of a gene of interest.
- Tissue-specific incorporation of a gene of interest may be achieved by placing the transposase gene under the control of a tissue-specific promoter, whereas tissue-specific expression of a gene of interest may be achieved by placing the gene of interest under the control of a tissue-specific promoter.
- the gene of interest is transcribed under the influence of an ovalbumin, or other oviduct specific, promoter. Linking the gene of interest to an oviduct specific promoter in an egg-laying animal results in synthesis of a desired molecule and deposition of the desired molecule in a developing egg.
- the present invention advantageously produces a high number of transgenic animals having a gene of interest stably incorporated.
- these transgenic animals successfully pass the desired gene to their progeny.
- the present invention can be used to obtain transgenic animals having the gene of interest incorporated into the germline through transfection of the ovary or the present invention can be used to obtain transgenic animals having the gene of interest incorporated into the oviduct in a tissue-specific manner.
- Both types of transgenic animals of the present invention produce large amounts of a desired molecule encoded by the transgene.
- Transgenic egg-laying animals, particularly avians produce large amounts of a desired protein that is deposited in the egg for rapid harvest and purification.
- Any desired gene may be incorporated into the novel transposon-based vectors of the present invention in order to synthesize a desired molecule in the transgenic animals.
- Proteins, peptides and nucleic acids are preferred desired molecules to be produced by the transgenic animals of the present invention.
- Particularly preferred proteins are antibody proteins and other immunopharmecuetical proteins.
- This invention provides a composition useful for the production of transgenic hens capable of producing substantially high amounts of a desired protein or peptide. Entire flocks of transgenic birds may be developed very quickly in order to produce industrial amounts of desired molecules.
- the present invention solves the problems inherent in the inadequate capacity of fermentation facilities used for bacterial production of molecules and provides a more efficient and economical way to produce desired molecules. Accordingly, the present invention provides a means to produce large amounts of therapeutic, diagnostic and reagent molecules.
- Transgenic chickens are excellent in terms of convenience and efficiency of manufacturing molecules such as proteins and peptides. Starting with a single transgenic rooster, thousands of transgenic offspring can be produced within a year. (In principle, up to forty million offspring could be produced in just three generations). Each transgenic female is expected to lay at least 250 eggs/year, each potentially containing hundreds of milligrams of the selected protein. Flocks of chickens numbering in the hundreds of thousands are readily handled through established commercial systems. The technologies for obtaining eggs and fractionating them are also well known and widely accepted. Thus, for each therapeutic, diagnostic, or other protein of interest, large amounts of a substantially pure material can be produced at relatively low incremental cost.
- a wide range of recombinant peptides and proteins can be produced in transgenic egg-laying animals. Enzymes, hormones, antibodies, growth factors, serum proteins, commodity proteins, biological response modifiers, peptides and designed proteins may all be made through practice of the present invention. For example, rough estimates suggest that it is possible to produce in bulk growth hormone, insulin, or Factor VIII, and deposit them in egg whites, for an incremental cost in the order of one dollar per gram. At such prices it is feasible to consider administering such medical agents by inhalation or even orally, instead of through injection. Even if bioavailability rates through these avenues were low, the cost of a much higher effective-dose would not be prohibitive.
- the egg-laying transgenic animal is an avian.
- the method of the present invention may be used in avians including Ratites, Psittaciformes, Falconiformes, Piciformes, Strigiformes, Passeriformes, Coraciformes, Ralliformes, Cuculiformes, Columbiformes, Galliformes, Anseriformes, and Herodiones.
- the egg-laying transgenic animal is a poultry bird. More preferably, the bird is a chicken, turkey, duck, goose or quail.
- Another preferred bird is a ratite, such as, an emu, an ostrich, a rhea, or a cassowary.
- Other preferred birds are partridge, pheasant, kiwi, parrot, parakeet, macaw, falcon, eagle, hawk, pigeon, cockatoo, song birds, jay bird, blackbird, finch, warbler, canary, toucan, mynah, or sparrow.
- Another object of the present invention is to produce transgenic animals through intraoviduct or intraovarian administration of a transposon-based vector, wherein the transgenic animals produce desired proteins or peptides.
- Yet another object of the present invention is to provide a method to produce transgenic animals through intraoviduct or intraovarian administration of a transposon-based vector that are capable of producing a desired molecule, such as a protein, peptide or nucleic acid.
- Another object of the present invention is to provide a method to produce transgenic animals through intraoviduct or intraovarian administration of a transposon-based vector, wherein such administration results in modulation of endogenous gene expression.
- Still another object of the present invention is to provide a method to produce transgenic avians through intraoviduct or intraovarian administration of a transposon-based vector that are capable of producing proteins or peptides and depositing these proteins or peptides in the egg.
- Another object of the present invention is to provide transgenic avians that contain a stably incorporated transgene.
- Still another object of the present invention is to provide eggs containing desired proteins or peptides encoded by a transgene incorporated into the transgenic avian that produces the egg.
- Still another object of the present invention is to provide a method to produce transgenic milk-producing animals through administration of a transposon-based vector that are capable of producing proteins or peptides and depositing these proteins or peptides in their milk.
- Another object of the present invention is to provide transgenic milk-producing animals that contain a stably incorporated transgene.
- Another object of the present invention is to provide transgenic milk-producing animals that are capable of producing proteins or peptides and depositing these proteins or peptides in their milk.
- Yet another object of the present invention is to provide milk containing desired molecules encoded by a transgene incorporated into the transgenic milk-producing animals that produce the milk.
- Still another object of the present invention is to provide milk containing desired proteins or peptides encoded by a transgene incorporated into the transgenic milk-producing animals that produce the milk.
- An advantage of the present invention is that transgenic animals are produced with higher efficiencies than observed in the prior art.
- Another advantage of the present invention is that these transgenic animals possess high copy numbers of the transgene.
- Another advantage of the present invention is that the transgenic animals produce large amounts of desired molecules encoded by the transgene.
- Still another advantage of the present invention is that desired molecules are produced by the transgenic animals much more efficiently and economically than prior art methods, thereby providing a means for large scale production of desired molecules, particularly proteins and peptides.
- Yet another advantage of the present invention is that the desired proteins and peptides are produced rapidly after making animals transgenic through introduction of the vectors of the present invention.
- FIG. 1 depicts schematically a transposon-based vector containing a transposase operably linked to a first promoter and a gene of interest operably-linked to a second promoter, wherein the gene of interest and its operably-linked promoter are flanked by insertion sequences (IS) recognized by the transposase.
- IS insertion sequences
- FIG. 2 depicts schematically a transposon-based vector for targeting deposition of a polypeptide in an egg white wherein Ov pro is the ovalbumin promoter, Ov protein is the ovalbumin protein and PolyA is a polyadenylation sequence.
- the TAG sequence includes a spacer sequence, the gp41 hairpin loop from HIV I and a protease cleavage site.
- FIG. 3 depicts schematically a transposon-based vector for targeting deposition of a polypeptide in an egg white wherein Ovo pro is the ovomucoid promoter and Ovo SS is the ovomucoid signal sequence.
- the TAG sequence includes a spacer, the gp41 hairpin loop from HIV I and a protease cleavage site.
- FIG. 4 depicts schematically a transposon based-vector for expression of an RNAi molecule.
- Tet i pro indicates a tetracycline inducible promoter whereas “pro” indicates the pro portion of a prepro sequence as described herein.
- Ovgen indicates approximately 60 base pairs of an ovalbumin gene
- Ovotrans indicates approximately 60 base pairs of an ovotransferrin gene
- Ovomucin indicates approximately 60 base pairs of an ovomucin gene.
- FIG. 5 is a picture of an SDS-PAGE gel wherein a pooled fraction of an isolated proinsulin fusion protein was run in lanes 4 and 6 .
- Lanes 1 and 10 of the gel contain molecular weight standards, lanes 2 and 8 contain non-trangenic chicken egg white, and lanes 3 , 5 , 7 and 9 are blank.
- the present invention provides a new, effective and efficient method of producing transgenic animals, particularly egg-laying animals and milk-producing animals, through administration of a composition comprising a transposon-based vector designed for incorporation of a gene of interest and production of a desired molecule.
- the transposon-based vectors are administered to a reproductive organ including, but not limited to, an oviduct, an ovary, or into the duct system of the mammary gland.
- the vectors may be directly administered to a reproductive organ or can be administered to an artery leading to the reproductive organ or to a lymph system proximate to the cells to be genetically altered.
- the vectors may be administered to a reproductive organ of an animal through the cloaca.
- One method of direct administration is by injection, and in one embodiment, the lumen of the magnum of the oviduct is injected with a transposon-based vector.
- Another method of direct administration is by injection, and in one embodiment, the lumen of the infundibulum of the oviduct is injected with a transposon-based vector.
- a preferred intrarterial administration is an administration into an artery that supplies the oviduct or the ovary.
- administration of the transposon-based vector to an oviduct or an artery that leads to the oviduct results in incorporation of the vector into the epithelial and/or secretory cells of the oviduct.
- administration of the transposon-based vector to an ovary or an artery that leads to the ovary or a lymphatic system proximal to the ovary results in incorporation of the vector into an oocyte or a germinal disk inside the ovary.
- antibody is used interchangeably with the term “immunoglobulin” and is defined herein as a protein synthesized by an animal or a cell of the immune system in response to the presence of a foreign substance commonly referred to as an “antigen” or an “immunogen”.
- the term antibody includes fragments of antibodies.
- Antibodies are characterized by specific affinity to a site on the antigen, wherein the site is referred to an “antigenic determinant” or an “epitope”. Antigens can be naturally occurring or artificially engineered.
- Artificially engineered antigens include, but are not limited to, small molecules, such as small peptides, attached to haptens such as macromolecules, for example proteins, nucleic acids, or polysaccharides.
- Artificially designed or engineered variants of naturally occurring antibodies and artificially designed or engineered antibodies not occurring in nature are all included in the current definition. Such variants include conservatively substituted amino acids and other forms of substitution as described in the section concerning proteins and polypeptides.
- the term “egg-laying animal” includes all amniotes such as birds, turtles, lizards and monotremes. Monotremes are egg-laying mammals and include the platypus and echidna.
- the term “bird” or “fowl,” as used herein, is defined as a member of the Aves class of animals which are characterized as warm-blooded, egg-laying vertebrates primarily adapted for flying.
- Avians include, without limitation, Ratites, Psittaciformes, Falconiformes, Piciformes, Strigiformes, Passeriformes, Coraciformes, Ralliformes, Cuculiformes, Columbiformes, Galliformes, Anseriformes, and Herodiones.
- Ratite is defined as a group of flightless, mostly large, running birds comprising several orders and including the emus, ostriches, kiwis, and cassowaries.
- Psittaciformes includes parrots and refers to a monofamilial order of birds that exhibit zygodactylism and have a strong hooked bill.
- a “parrot” is defined as any member of the avian family Psittacidae (the single family of the Psittaciformes), distinguished by the short, stout, strongly hooked beak.
- Avians include all poultry birds, especially chickens, geese, turkeys, ducks and quail.
- chickens used for table egg production such as egg-type chickens, chickens reared for public meat consumption, or broilers, and chickens reared for both egg and meat production (“dual-purpose” chickens).
- the term “chicken” also denotes chickens produced by primary breeder companies, or chickens that are the parents, grandparents, great-grandparents, etc. of those chickens reared for public table egg, meat, or table egg and meat consumption.
- egg is defined herein as including a large female sex cell enclosed in a porous, calcarous or leathery shell, produced by birds and reptiles.
- ovum is defined as a female gamete, and is also known as an egg. Therefore, egg production in all animals other than birds and reptiles, as used herein, is defined as the production and discharge of an ovum from an ovary, or “ovulation”. Accordingly, it is to be understood that the term “egg” as used herein is defined as a large female sex cell enclosed in a porous, calcarous or leathery shell, when a bird or reptile produces it, or it is an ovum when it is produced by all other animals.
- milk-producing animal refers herein to mammals including, but not limited to, bovine, ovine, porcine, equine, and primate animals. Milk-producing animals include but are not limited to cows, llamas, camels, goats, reindeer, zebu, water buffalo, yak, horses, pigs, rabbits, non-human primates, and humans.
- gene is defined herein to include a coding region for a protein, peptide or polypeptide.
- transgenic animal refers to an animal having at least a portion of the transposon-based vector DNA incorporated into its DNA. While a transgenic animal includes an animal wherein the transposon-based vector DNA is incorporated into the germline DNA, a transgenic animal also includes an animal having DNA in one or more cells that contain a portion of the transposon-based vector DNA for any period of time. In a preferred embodiment, a portion of the transposon-based vector comprises a gene of interest. More preferably, the gene of interest is incorporated into the animal's DNA for a period of at least five days, more preferably the reproductive life of the animal, and most preferably the life of the animal. In a further preferred embodiment, the animal is an avian.
- vector is used interchangeably with the terms “construct”, “DNA construct” and “genetic construct” to denote synthetic nucleotide sequences used for manipulation of genetic material, including but not limited to cloning, subcloning, sequencing, or introduction of exogenous genetic material into cells, tissues or organisms, such as birds. It is understood by one skilled in the art that vectors may contain synthetic DNA sequences, naturally occurring DNA sequences, or both.
- the vectors of the present invention are transposon-based vectors as described herein.
- operably-linked is defined herein to mean that the two sequences are associated in a manner that allows the regulatory sequence to affect expression of the other nucleotide sequence. It is not required that the operably-linked sequences be directly adjacent to one another with no intervening sequence(s).
- regulatory sequence is defined herein as including promoters, enhancers and other expression control elements such as polyadenylation sequences, matrix attachment sites, insulator regions for expression of multiple genes on a single construct, ribosome entry/attachment sites, introns that are able to enhance expression, and silencers.
- DNA construct is an important factor in successfully producing transgenic animals.
- the DNA (or RNA) constructs previously used often do not integrate into the host DNA, or integrate only at low frequencies. Other factors may have also played a part, such as poor entry of the vector into target cells.
- the present invention provides transposon-based vectors that can be administered to an animal that overcome the prior art problems relating to low transgene integration frequencies.
- pTnMCS SEQ ID NO:2
- pTnMod SEQ ID NO:3
- the transposon-based vectors of the present invention produce integration frequencies an order of magnitude greater than has been achieved with previous vectors. More specifically, intratesticular injections performed with a prior art transposon-based vector (described in U.S. Pat. No. 5,719,055) resulted in 41% sperm positive roosters whereas intratesticular injections performed with the novel transposon-based vectors of the present invention resulted in 77% sperm positive roosters. Actual frequencies of integration were estimated by either or both comparative strength of the PCR signal from the sperm and histological evaluation of the testes and sperm by quantitative PCR.
- the transposon-based vectors of the present invention include a transposase gene operably-linked to a first promoter, and a coding sequence for a desired protein or peptide operably-linked to a second promoter, wherein the coding sequence for the desired protein or peptide and its operably-linked promoter are flanked by transposase insertion sequences recognized by the transposase.
- the transposon-based vector also includes one or more of the following characteristics: a) one or more modified Kozak sequences comprising ACCATG (SEQ ID NO:1) at the 3′ end of the first promoter to enhance expression of the transposase; b) modifications of the codons for the first several N-terminal amino acids of the transposase, wherein the third base of each codon was changed to an A or a T without changing the corresponding amino acid; c) addition of one or more stop codons to enhance the termination of transposase synthesis; and/or, d) addition of an effective polyA sequence operably-linked to the transposase to further enhance expression of the transposase gene.
- the transposon-based vector may additionally or alternatively include one or more of the following Kozak sequences at the 3′ end of any promoter, including the promoter operably-linked to the transposase: ACCATGG (SEQ ID NO:4), AAGATGT (SEQ ID NO:5), ACGATGA (SEQ ID NO:6), AAGATGG (SEQ ID NO:7), GACATGA (SEQ ID NO:8), ACCATGA (SEQ ID NO:9), and ACCATGA (SEQ ID NO:10), ACCATGT (SEQ ID NO:52).
- FIG. 1 shows a schematic representation of several components of the transposon-based vector.
- the present invention further includes vectors containing more than one gene of interest, wherein a second or subsequent gene of interest is operably-linked to the second promoter or to a different promoter.
- the transposon-based vectors shown in the Figures are representative of the present invention and that the order of the vector elements may be different than that shown in the Figures, that the elements may be present in various orientations, and that the vectors may contain additional elements not shown in the Figures.
- the transposase found in the transposase-based vector is an altered target site (ATS) transposase and the insertion sequences are those recognized by the ATS transposase.
- ATS target site
- the transposase located in the transposase-based vectors is not limited to a modified ATS transposase and can be derived from any transposase.
- Transposases known in the prior art include those found in AC7, Tn5SEQ1, Tn916, Tn951, Tn1721, Tn 2410, Tn1681, Tn1, Tn2, Tn3, Tn4, Tn5, Tn6, Tn9, Tn10, Tn30, Tn101, Tn903, Tn501, Tn1000 ( ⁇ ), Tn1681, Tn2901, AC transposons, Mp transposons, Spm transposons, En transposons, Dotted transposons, Mu transposons, Ds transposons, dSpm transposons and I transposons.
- these transposases and their regulatory sequences are modified for improved functioning as follows: a) the addition one or more modified Kozak sequences comprising ACCATG (SEQ ID NO:1) at the 3′ end of the promoter operably-linked to the transposase; b) a change of the codons for the first several amino acids of the transposase, wherein the third base of each codon was changed to an A or a T without changing the corresponding amino acid; c) the addition of one or more stop codons to enhance the termination of transposase synthesis; and/or, d) the addition of an effective polyA sequence operably-linked to the transposase to further enhance expression of the transposase gene.
- ACCATG SEQ ID NO:1
- a change of the codons for the first several amino acids of the transposase wherein the third base of each codon was changed to an A or a T without changing the corresponding amino acid
- the modifications of the first several N-terminal codons of the transposase gene increase transcription of the transposase gene, in part, by increasing strand dissociation. It is preferable that between approximately 1 and 20, more preferably 3 and 15, and most preferably between 4 and 12 of the first N-terminal codons of the transposase are modified such that the third base of each codon is changed to an A or a T without changing the encoded amino acid. In one embodiment, the first ten N-terminal codons of the transposase gene are modified in this manner. It is also preferred that the transposase contain mutations that make it less specific for preferred insertion sites and thus increases the rate of transgene insertion as discussed in U.S. Pat. No. 5,719,055.
- the transposon-based vectors are optimized for expression in a particular host by changing the methylation patterns of the vector DNA. For example, prokaryotic methylation may be reduced by using a methylation deficient organism for production of the transposon-based vector.
- the transposon-based vectors may also be methylated to resemble eukaryotic DNA for expression in a eukaryotic host.
- Transposases and insertion sequences from other analogous eukaryotic transposon-based vectors that can also be modified and used are, for example, the Drosophila P element derived vectors disclosed in U.S. Pat. No. 6,291,243; the Drosophila mariner element described in Sherman et al. (1998); or the sleeping beauty transposon. See also Hackett et al. (1999); D. Lampe et al., 1999. Proc. Natl. Acad. Sci. USA, 96:11428-11433; S. Fischer et al., 2001. Proc. Natl. Acad. Sci. USA, 98:6759-6764; L. Zagoraiou et al., 2001.
- transposases recognize different insertion sequences, and therefore, it is to be understood that a transposase-based vector will contain insertion sequences recognized by the particular transposase also found in the transposase-based vector.
- the insertion sequences have been shortened to about 70 base pairs in length as compared to those found in wild-type transposons that typically contain insertion sequences of well over 100 base pairs.
- the present invention also encompasses the use of a “rolling replication” type transposon-based vector.
- Use of a rolling replication type transposon allows multiple copies of the transposon/transgene to be made from a single transgene construct and the copies inserted. This type of transposon-based system thereby provides for insertion of multiple copies of a transgene into a single genome.
- a rolling replication type transposon-based vector may be preferred when the promoter operably-linked to gene of interest is endogenous to the host cell and present in a high copy number or highly expressed.
- Tn1, Tn2, Tn3, Tn4, Tn5, Tn9, Tn21, Tn501, Tn551, Tn951, Tn1721, Tn2410 and Tn2603 are examples of a rolling replication type transposon, although Tn5 could be both a rolling replication and a cut and insert type transposon.
- the transposon-based vector contains two stop codons operably-linked to the transposase and/or to the gene of interest.
- one stop codon of UAA or UGA is operably linked to the transposase and/or to the gene of interest.
- an “effective polyA sequence” refers to either a synthetic or non-synthetic sequence that contains multiple and sequential nucleotides containing an adenine base (an A polynucleotide string) and that increases expression of the gene to which it is operably-linked.
- a polyA sequence may be operably-linked to any gene in the transposon-based vector including, but not limited to, a transposase gene and a gene of interest.
- a preferred polyA sequence is optimized for use in the host animal or human. In one embodiment, the polyA sequence is optimized for use in an avian species and more specifically, a chicken.
- An avian optimized polyA sequence generally contains a minimum of 40 base pairs, preferably between approximately 40 and several hundred base pairs, and more preferably approximately 75 base pairs that precede the A polynucleotide string and thereby separate the stop codon from the A polynucleotide string.
- the polyA sequence comprises a conalbumin polyA sequence as provided in SEQ ID NO:11 and as taken from GenBank accession #Y00407, base pairs 10651-11058.
- the polyA sequence comprises a synthetic polynucleotide sequence shown in SEQ ID NO:12.
- the polyA sequence comprises an avian optimized polyA sequence provided in SEQ ID NO:13.
- a chicken optimized polyA sequence may also have a reduced amount of CT repeats as compared to a synthetic polyA sequence.
- the present invention includes methods of or increasing incorporation of a gene of interest wherein the gene of interest resides in a transposon-based vector containing a transposase gene and wherein the transposase gene is operably linked to an avian optimized polyA sequence.
- the present invention also includes methods of increasing expression of a gene of interest in an avian that includes administering a gene of interest to the avian, wherein the gene of interest is operably-linked to an avian optimized polyA sequence.
- An avian optimized polyA nucleotide string is defined herein as a polynucleotide containing an A polynucleotide string and a minimum of 40 base pairs, preferably between approximately 40 and several hundred base pairs, and more preferably approximately 60 base pairs that precede the A polynucleotide string.
- the present invention further provides transposon-based vectors containing a gene of interest or transposase gene operably linked to an avian optimized polyA sequence.
- the first promoter operably-linked to the transposase gene and the second promoter operably-linked to the gene of interest can be a constitutive promoter or an inducible promoter.
- Constitutive promoters include, but are not limited to, immediate early cytomegalovirus (CMV) promoter, herpes simplex virus 1 (HSV1) immediate early promoter, SV40 promoter, lysozyme promoter, early and late CMV promoters, early and late HSV promoters, ⁇ -actin promoter, tubulin promoter, Rous-Sarcoma virus (RSV) promoter, and heat-shock protein (HSP) promoter.
- CMV immediate early cytomegalovirus
- HSV40 promoter herpes simplex virus 1 immediate early promoter
- lysozyme promoter early and late CMV promoters
- early and late HSV promoters early and late HSV promoters
- ⁇ -actin promoter tubulin promoter
- Inducible promoters include tissue-specific promoters, developmentally-regulated promoters and chemically inducible promoters.
- tissue-specific promoters include the glucose 6 phosphate (G6P) promoter, vitellogenin promoter, ovalbumin promoter, ovomucoid promoter, conalbumin promoter, ovotransferrin promoter, prolactin promoter, kidney uromodulin promoter, and placental lactogen promoter.
- the vitellogenin promoter includes a polynucleotide sequence of SEQ ID NO:14.
- the G6P promoter sequence may be deduced from a rat G6P gene untranslated upstream region provided in GenBank accession number U57552.1.
- Examples of developmentally-regulated promoters include the homeobox promoters and several hormone induced promoters.
- Examples of chemically inducible promoters include reproductive hormone induced promoters and antibiotic inducible promoters such as the tetracycline inducible promoter and the zinc-inducible metallothionine promoter.
- inducible promoter systems include the Lac operator repressor system inducible by IPTG (isopropyl beta-D-thiogalactoside) (Cronin, A. et al. 2001. Genes and Development, v. 15), ecdysone-based inducible systems (Hoppe, U. C. et al. 2000. Mol. Ther. 1:159-164); estrogen-based inducible systems (Braselmann, S. et al. 1993. Proc. Natl. Acad. Sci.
- progesterone-based inducible systems using a chimeric regulator, GLVP, which is a hybrid protein consisting of the GAL4 binding domain and the herpes simplex virus transcriptional activation domain, VP16, and a truncated form of the human progesterone receptor that retains the ability to bind ligand and can be turned on by RU486 (Wang, et al. 1994. Proc. Natl. Acad. Sci.
- GLVP chimeric regulator
- CID-based inducible systems using chemical inducers of dimerization (CIDs) to regulate gene expression, such as a system wherein rapamycin induces dimerization of the cellular proteins FKBP12 and FRAP (Belshaw, P. J. et al. 1996. J. Chem. Biol. 3:731-738; Fan, L. et al. 1999. Hum. Gene Ther. 10:2273-2285; Shariat, S. F. et al. 2001. Cancer Res. 61:2562-2571; Spencer, D. M. 1996. Curr. Biol. 6:839-847).
- Chemical substances that activate the chemically inducible promoters can be administered to the animal containing the transgene of interest via any method known to those of skill in the art.
- cell or tissue-specific and constitutive promoters include but are not limited to smooth-muscle SM22 promoter, including chimeric SM22alpha/telokin promoters (Hoggatt A. M. et al., 2002. Circ Res. 91(12):1151-9); ubiquitin C promoter (Biochim Biophys Acta, 2003. Jan. 3;1625(l):52-63); Hsf2 promoter; murine COMP (cartilage oligomeric matrix protein) promoter; early B cell-specific mb-1 promoter (Sigvardsson M., et al., 2002. Mol. Cell Biol.
- PSA prostate specific antigen
- promoter of the human FAT/CD36 gene (Kuriki C., et al., 2002. Biol. Pharm. Bull. 25(11):1476-8); VL30 promoter (Staplin W. R. et al., 2002. Blood Oct. 24, 2002); and, IL-10 promoter (Brenner S., et al., 2002. J. Biol. Chem. Dec. 18, 2002).
- avian promoters include, but are not limited to, promoters controlling expression of egg white proteins, such as ovalbumin, ovotransferrin (conalbumin), ovomucoid, lysozyme, ovomucin, g2 ovoglobulin, g3 ovoglobulin, ovoflavoprotein, ovostatin (ovomacroglobin), cystatin, avidin, thiamine-binding protein, glutamyl aminopeptidase minor glycoprotein 1, minor glycoprotein 2; and promoters controlling expression of egg-yolk proteins, such as vitellogenin, very low-density lipoproteins, low density lipoprotein, cobalamin-binding protein, riboflavin-binding protein, biotin-binding protein (Awade, 1996.
- egg white proteins such as ovalbumin, ovotransferrin (conalbumin), ovomucoid, lysozyme, ovo
- vitellogenin promoter is that it is active during the egg-laying stage of an animal's life-cycle, which allows for the production of the protein of interest to be temporally connected to the import of the protein of interest into the egg yolk when the protein of interest is equipped with an appropriate targeting sequence.
- the avian promoter is an oviduct-specific promoter.
- oviduct-specific promoter includes, but is not limited to, ovalbumin; ovotransferrin (conalbumin); ovomucoid; 01, 02, 03, 04 or 05 avidin; ovomucin; g2 ovoglobulin; g3 ovoglobulin; ovoflavoprotein; and ovostatin (ovomacroglobin) promoters.
- liver-specific promoters may be operably-linked to the gene of interest to achieve liver-specific expression of the transgene.
- Liver-specific promoters of the present invention include, but are not limited to, the following promoters, vitellogenin promoter, G6P promoter, cholesterol-7-alpha-hydroxylase (CYP7A) promoter, phenylalanine hydroxylase (PAH) promoter, protein C gene promoter, insulin-like growth factor I (IGF-I) promoter, bilirubin UDP-glucuronosyltransferase promoter, aldolase B promoter, furin promoter, metallothioneine promoter, albumin promoter, and insulin promoter.
- promoters that can be used to target expression of a protein of interest into the milk of a milk-producing animal including, but not limited to, ⁇ lactoglobin promoter, whey acidic protein promoter, lactalbumin promoter and casein promoter.
- immune system-specific promoters may be operably-linked to the gene of interest to achieve immune system-specific expression of the transgene. Accordingly, promoters associated with cells of the immune system may also be used. Acute phase promoters such as interleukin (IL)-1 and IL-2 may be employed. Promoters for heavy and light chain Ig may also be employed. The promoters of the T cell receptor components CD4 and CD8, B cell promoters and the promoters of CR2 (complement receptor type 2) may also be employed. Immune system promoters are preferably used when the desired protein is an antibody protein.
- modified promoters/enhancers wherein elements of a single promoter are duplicated, modified, or otherwise changed.
- steroid hormone-binding domains of the ovalbumin promoter are moved from about ⁇ 6.5 kb to within approximately the first 1000 base pairs of the gene of interest. Modifying an existing promoter with promoter/enhancer elements not found naturally in the promoter, as well as building an entirely synthetic promoter, or drawing promoter/enhancer elements from various genes together on a non-natural backbone, are all encompassed by the current invention.
- the promoters contained within the transposon-based vectors of the present invention may be entire promoter sequences or fragments of promoter sequences.
- the promoter operably linked to a gene of interest is an approximately 900 base pair fragment of a chicken ovalbumin promoter (SEQ ID NO:15).
- the constitutive and inducible promoters contained within the transposon-based vectors may also be modified by the addition of one or more modified Kozak sequences of ACCATG (SEQ ID NO:1).
- the present invention includes transposon-based vectors containing one or more enhancers.
- enhancers may or may not be operably-linked to their native promoter and may be located at any distance from their operably-linked promoter.
- a promoter operably-linked to an enhancer and a promoter modified to eliminate repressive regulatory effects are referred to herein as an “enhanced promoter.”
- the enhancers contained within the transposon-based vectors are preferably enhancers found in birds, and more preferably, an ovalbumin enhancer, but are not limited to these types of enhancers.
- an approximately 675 base pair enhancer element of an ovalbumin promoter is cloned upstream of an ovalbumin promoter with 300 base pairs of spacer DNA separating the enhancer and promoter.
- the enhancer used as a part of the present invention comprises base pairs 1-675 of a chicken ovalbumin enhancer from GenBank accession #S82527.1. The polynucleotide sequence of this enhancer is provided in SEQ ID NO:16.
- cap sites and fragments of cap sites are also included in some of the transposon-based vectors of the present invention.
- approximately 50 base pairs of a 5′ untranslated region wherein the capsite resides are added on the 3′ end of an enhanced promoter or promoter.
- An exemplary 5′ untranslated region is provided in SEQ ID NO:17.
- a putative cap-site residing in this 5′ untranslated region preferably comprises the polynucleotide sequence provided in SEQ ID NO:18.
- the first promoter operably-linked to the transposase gene is a constitutive promoter and the second promoter operably-linked to the gene of interest is a tissue-specific promoter.
- the first constitutive promoter allows for constitutive activation of the transposase gene and incorporation of the gene of interest into virtually all cell types, including the germline of the recipient animal. Although the gene of interest is incorporated into the germline generally, the gene of interest may only be expressed in a tissue-specific manner.
- a transposon-based vector having a constitutive promoter operably-linked to the transposase gene can be administered by any route, and in one embodiment, the vector is administered to an ovary, to an artery leading to the ovary or to a lymphatic system or fluid proximal to the ovary.
- cell-or tissue-specific expression as described herein does not require a complete absence of expression in cells or tissues other than the preferred cell or tissue. Instead, “cell-specific” or “tissue-specific” expression refers to a majority of the expression of a particular gene of interest in the preferred cell or tissue, respectively.
- the first promoter operably-linked to the transposase gene can be a tissue-specific promoter.
- transfection of a transposon-based vector containing a transposase gene operably-linked to an oviduct specific promoter such as the ovalbumin promoter provides for activation of the transposase gene and incorporation of the gene of interest in the cells of the oviduct but not into the germline and other cells generally.
- the second promoter operably-linked to the gene of interest can be a constitutive promoter or an inducible promoter. In a preferred embodiment, both the first promoter and the second promoter are an ovalbumin promoter.
- the transposon-based vector is administered directly to the tissue of interest, to an artery leading to the tissue of interest or to fluids surrounding the tissue of interest.
- the tissue of interest is the oviduct and administration is achieved by direct injection into the oviduct or an artery leading to the oviduct.
- administration is achieved by direct injection into the lumen of the magnum or the infundibulum of the oviduct. Indirect administration to the oviduct may occur through the cloaca.
- cell specific promoters may be used to enhance transcription in selected tissues.
- promoters that are found in cells of the fallopian tube such as ovalbumin, conalbumin, ovomucoid and/or lysozyme, are used in the vectors to ensure transcription of the gene of interest in the epithelial cells and tubular gland cells of the fallopian tube, leading to synthesis of the desired protein encoded by the gene and deposition into the egg white.
- promoters specific for the epithelial cells of the alveoli of the mammary gland such as prolactin, insulin, beta lactoglobin, whey acidic protein, lactalbumin, casein, and/or placental lactogen, are used in the design of vectors used for transfection of these cells for the production of desired proteins for deposition into the milk.
- the G6P promoter may be employed to drive transcription of the gene of interest for protein production. Proteins made in the liver of birds may be delivered to the egg yolk.
- the promoter and other regulatory sequences operably-linked to the transposase gene may be those derived from the host. These host specific regulatory sequences can be tissue specific as described above or can be of a constitutive nature. For example, an avian actin promoter and its associated polyA sequence can be operably-linked to a transposase in a transposase-based vector for transfection into an avian. Examples of other host specific promoters that could be operably-linked to the transposase include the myosin and DNA or RNA polymerase promoters.
- the gene of interest is operably-linked to a directing sequence or a sequence that provides proper conformation to the desired protein encoded by the gene of interest.
- directing sequence refers to both signal sequences and targeting sequences.
- An egg directing sequence includes, but is not limited to, an ovomucoid signal sequence, an ovalbumin signal sequence, a cecropin pre pro signal sequence, and a vitellogenin targeting sequence.
- signal sequence refers to an amino acid sequence, or the polynucleotide sequence that encodes the amino acid sequence, that directs the protein to which it is linked to the endoplasmic reticulum in a eukaryote, and more preferably the translocational pores in the endoplasmic reticulum, or the plasma membrane in a prokaryote, or mitochondria, such as for the purpose of gene therapy for mitochondrial diseases.
- Signal and targeting sequences can be used to direct a desired protein into, for example, the milk, when the transposon-based vectors are administered to a milk-producing animal.
- Signal sequences can also be used to direct a desired protein into, for example, a secretory pathway for incorporation into the egg yolk or the egg white, when the transposon-based vectors are administered to a bird or other egg-laying animal.
- a transposon-based vector is provided in FIG. 3 wherein the gene of interest is operably linked to the ovomucoid signal sequence.
- the present invention also includes a gene of interest operably-linked to a second gene containing a signal sequence.
- FIG. 2 An example of such an embodiment is shown in FIG. 2 wherein the gene of interest is operably-linked to the ovalbumin gene that contains an ovalbumin signal sequence.
- the signal sequence is an ovalbumin signal sequence including a sequence shown in SEQ ID NO:19.
- the signal sequence is a modified ovalbumin signal sequence including a sequence shown in SEQ ID NO:20 or SEQ ID NO:21.
- targeting sequence refers to an amino acid sequence, or the polynucleotide sequence encoding the amino acid sequence, which amino acid sequence is recognized by a receptor located on the exterior of a cell. Binding of the receptor to the targeting sequence results in uptake of the protein or peptide operably-linked to the targeting sequence by the cell.
- a targeting sequence is a vitellogenin targeting sequence that is recognized by a vitellogenin receptor (or the low density lipoprotein receptor) on the exterior of an oocyte.
- the vitellogenin targeting sequence includes the polynucleotide sequence of SEQ ID NO:22.
- the vitellogenin targeting sequence includes all or part of the vitellogenin gene.
- Other targeting sequences include VLDL and Apo E, which are also capable of binding the vitellogenin receptor. Since the ApoE protein is not endogenously expressed in birds, its presence may be used advantageously to identify birds carrying the transposon-based vectors of the present invention.
- a gene of interest selected for stable incorporation is designed to encode any desired protein or peptide or to regulate any cellular response.
- the desired proteins or peptides are deposited in an egg or in milk.
- the present invention encompasses transposon-based vectors containing multiple genes of interest.
- the multiple genes of interest may each be operably-linked to a separate promoter and other regulatory sequence(s) or may all be operably-linked to the same promoter and other regulatory sequences(s).
- multiple gene of interest are linked to a single promoter and other regulatory sequence(s) and each gene of interest is separated by a cleavage site or a pro portion of a signal sequence.
- a gene of interest may contain modifications of the codons for the first several N-terminal amino acids of the gene of interest, wherein the third base of each codon is changed to an A or a T without changing the corresponding amino acid.
- Protein and peptide hormones are a preferred class of proteins in the present invention. Such protein and peptide hormones are synthesized throughout the endocrine system and include, but are not limited to, hypothalamic hormones and hypophysiotropic hormones, anterior, intermediate and posterior pituitary hormones, pancreatic islet hormones, hormones made in the gastrointestinal system, renal hormones, thymic hormones, parathyroid hormones, adrenal cortical and medullary hormones.
- hormones that can be produced using the present invention include, but are not limited to, chorionic gonadotropin, corticotropin, erythropoietin, glucagons, IGF-1, oxytocin, platelet-derived growth factor, calcitonin, follicle-stimulating hormone, luteinizing hormone, thyroid-stimulating hormone, insulin, gonadotropin-releasing hormone and its analogs, vasopressin, octreotide, somatostatin, prolactin, adrenocorticotropic hormone, antidiuretic hormone, thyrotropin-releasing hormone (TRH), growth hormone-releasing hormone (GHRH), dopamine, melatonin, thyroxin (T 4 ), parathyroid hormone (PTH), glucocorticoids such as cortisol, mineralocorticoids such as aldosterone, androgens such as testosterone, adrenaline (epinephrine), noradrenaline (
- the gene of interest is a proinsulin gene and the desired molecule is insulin.
- Proinsulin consists of three parts: a C-peptide and two strands of amino acids (the alpha and beta chains) that later become linked together to form the insulin molecule.
- FIGS. 2 and 3 are schematics of transposon-based vector constructs containing a proinsulin gene operably-linked to an ovalbumin promoter and ovalbumin protein or an ovomucoid promoter and ovomucoid signal sequence, respectively.
- proinsulin is expressed in the oviduct tubular gland cells and then deposited in the egg white.
- SEQ ID NO:23 One example of a proinsulin polynucleotide sequence is shown in SEQ ID NO:23, wherein the C-peptide cleavage site spans from Arg at position 31 to Arg at position 65.
- Serum proteins including lipoproteins such as high density lipoprotein (HDL), HDL-Milano and low density lipoprotein, albumin, clotting cascade factors, factor VIII, factor IX, fibrinogen, and globulins are also included in the group of desired proteins of the present invention.
- Immunoglobulins are one class of desired globulin molecules and include but are not limited to IgG, IgM, IgA, IgD, IgE, IgY, lambda chains, kappa chains and fragments thereof; Fc fragments, and Fab fragments. Desired antibodies include, but are not limited to, naturally occurring antibodies, human antibodies, humanized antibodies, and hybrid antibodies.
- transposon-based vectors of the present invention may be incorporated into the transposon-based vectors of the present invention. Desired antibodies also include antibodies with the ability to bind specific ligands, for example, antibodies against proteins associated with cancer-related molecules, such as anti-her 2, or anti-CA125. Accordingly, the present invention encompasses a transposon-based vector containing one or more genes encoding a heavy immunoglobulin (Ig) chain and a light Ig chain. Further, more than one gene encoding for more than one antibody may be administered in one or more transposon-based vectors of the present invention. In this manner, an egg may contain more than one type of antibody in the egg white, the egg yolk or both. In one embodiment, a transposon-based vector contains a heavy Ig chain and a light Ig chain, both operably linked to a promoter.
- Ig immunoglobulin
- a transposon-based vector contains a heavy Ig chain and a light Ig chain, both operably linked to a promote
- Antibodies used as therapeutic reagents include but are not limited to antibodies for use in cancer immunotherapy against specific antigens, or for providing passive immunity to an animal or a human against an infectious disease or a toxic agent.
- Antibodies used as diagnostic reagents include, but are not limited to antibodies that may be labeled and detected with a detector, for example antibodies with a fluorescent label attached that may be detected following exposure to specific wavelengths.
- Such labeled antibodies may be primary antibodies directed to a specific antigen, for example, rhodamine-labeled rabbit anti-growth hormone, or may be labeled secondary antibodies, such as fluorescein-labeled goat-anti chicken IgG.
- Such labeled antibodies are known to one of ordinary skill in the art.
- Labels useful for attachment to antibodies are also known to one of ordinary skill in the art. Some of these labels are described in the “Handbook of Fluorescent Probes and Research Products”, ninth edition, Richard P. Haugland (ed) Molecular Probes, Inc. Eugene, Oreg.), which is incorporated herein in its entirety.
- Antibodies produced with using the present invention may be used as laboratory reagents for numerous applications including radioimmunoassay, western blots, dot blots, ELISA, immunoaffinity columns and other procedures requiring antibodies as known to one of ordinary skill in the art.
- Such antibodies include primary antibodies, secondary antibodies and tertiary antibodies, which may be labeled or unlabeled.
- Antibodies that may be made with the practice of the present invention include, but are not limited to primary antibodies, secondary antibodies, designer antibodies, anti-protein antibodies, anti-peptide antibodies, anti-DNA antibodies, anti-RNA antibodies, anti-hormone antibodies, anti-hypophysiotropic peptides, antibodies against non-natural antigens, anti-anterior pituitary hormone antibodies, anti-posterior pituitary hormone antibodies, anti-venom antibodies, anti-tumor marker antibodies, antibodies directed against epitopes associated with infectious disease, including, anti-viral, anti-bacterial, anti-protozoal, anti-fungal, anti-parasitic, anti-receptor, anti-lipid, anti-phospholipid, anti-growth factor, anti-cytokine, anti-monokine, anti-idiotype, and anti-accessory (presentation) protein antibodies. Antibodies made with the present invention, as well as light chains or heavy chains, may also be used to inhibit enzyme activity.
- Antibodies that may be produced using the present invention include, but are not limited to, antibodies made against the following proteins: Bovine ⁇ -Globulin, Serum; Bovine IgG, Plasma; Chicken ⁇ -Globulin, Serum; Human ⁇ -Globulin, Serum; Human IgA, Plasma; Human IgA 1 , Myeloma; Human IgA 2 , Myeloma; Human IgA 2 , Plasma; Human IgD, Plasma; Human IgE, Myeloma; Human IgG, Plasma; Human IgG, Fab Fragment, Plasma; Human IgG, F(ab′) 2 Fragment, Plasma; Human IgG, Fc Fragment, Plasma; Human IgG 1 , Myeloma; Human IgG 2 , Myeloma; Human IgG 3 , Myeloma; Human IgG 4 , Myeloma; Human IgM, Myelom
- the transposon-based vector comprises the coding sequence of light and heavy chains of a murine monoclonal antibody that shows specificity for human seminoprotein (GenBank Accession numbers AY129006 and AY129304 for the light and heavy chains, respectively).
- a further non-limiting list of antibodies that recognize other antibodies is as follows: Anti-Chicken IgG, heavy (H) & light (L) Chain Specific (Sheep); Anti-Goat ⁇ -Globulin (Donkey); Anti-Goat IgG, Fc Fragment Specific (Rabbit); Anti-Guinea Pig ⁇ -Globulin (Goat); Anti-Human Ig, Light Chain, Type ⁇ Specific; Anti-Human Ig, Light Chain, Type ⁇ Specific; Anti-Human IgA, ⁇ -Chain Specific (Goat); Anti-Human IgA, Fab Fragment Specific; Anti-Human IgA, Fc Fragment Specific; Anti-Human IgA, Secretory; Anti-Human IgE, ⁇ -Chain Specific (Goat); Anti-Human IgE, Fc Fragment Specific; Anti-Human IgG, Fc Fragment Specific (Goat); Anti-Human IgG, ⁇ -Chain Specific
- Antibodies made by the transgenic animals of the present invention include antibodies that may be used as therapeutic reagents, for example in cancer immunotherapy against specific antigens, as diagnostic reagents and as laboratory reagents for numerous applications including immunoneutralization, radioimmunoassay, western blots, dot blots, ELISA, immunoprecipitation and immunoaffinity columns.
- antibodies include, but are not limited to, antibodies which bind the following ligands: adrenomedulin, amylin, calcitonin, amyloid, calcitonin gene-related peptide, cholecystokinin, gastrin, gastric inhibitory peptide, gastrin releasing peptide, interleukin, interferon, cortistatin, somatostatin, endothelin, sarafotoxin, glucagon, glucagon-like peptide, insulin, atrial natriuretic peptide, BNP, CNP, neurokinin, substance P, leptin, neuropeptide Y, melanin concentrating hormone, melanocyte stimulating hormone, orphanin, endorphin, dynorphin, enkephalin, enkephalin, leumorphin, peptide F, PACAP, PACAP-related peptide, parathyroid hormone, urocortin, corticotrophin
- abciximab (ReoPro), abciximab anti-platelet aggregation monoclonal antibody, anti-CD11a (hu1124), anti-CD 18 antibody, anti-CD20 antibody, anti-cytomegalovirus (CMV) antibody, anti-digoxin antibody, anti-hepatitis B antibody, anti-HER-2 antibody, anti-idiotype antibody to GD3 glycolipid, anti-IgE antibody, anti-IL-2R antibody, antimetastatic cancer antibody (mAb 17-1A), anti-rabies antibody, anti-respiratory syncytial virus (RSV) antibody, anti-Rh antibody, anti-TCR, anti-TNF antibody, anti-VEGF antibody and fab fragment thereof, rattlesnake venom antibody, black widow spider venom antibody, coral snake venom antibody, antibody against very late antigen-4 (VLA-4), C225 humanized antibody to EGF receptor, chi
- the antibodies prepared using the methods of the present invention may also be designed to possess specific labels that may be detected through means known to one of ordinary skill in the art.
- the antibodies may also be designed to possess specific sequences useful for purification through means known to one of ordinary skill in the art.
- Specialty antibodies designed for binding specific antigens may also be made in transgenic animals using the transposon-based vectors of the present invention.
- Production of a monoclonal antibody using the transposon-based vectors of the present invention can be accomplished in a variety of ways.
- two vectors may be constructed: one that encodes the light chain, and a second vector that encodes the heavy chain of the monoclonal antibody. These vectors may then be incorporated into the genome of the target animal by methods disclosed herein.
- the sequences encoding light and heavy chains of a monoclonal antibody may be included on a single DNA construct.
- the coding sequence of light and heavy chains of a murine monoclonal antibody that show specificity for human seminoprotein can be expressed using transposon-based constructs of the present invention (GenBank Accession numbers AY129006 and AY129304 for the light and heavy chains, respectively).
- proteins and peptides synthesized by the immune system including those synthesized by the thymus, lymph nodes, spleen, and the gastrointestinal associated lymph tissues (GALT) system.
- the immune system proteins and peptides proteins that can be made in transgenic animals using the transposon-based vectors of the present invention include, but are not limited to, alpha-interferon, beta-interferon, gamma-interferon, alpha-interferon A, alpha-interferon 1, G-CSF, GM-CSF, interlukin-1 (IL-1), IL-2, IL-3, IL4, IL-5, IL-6, IL-7, IL-8, IL-9, IL-10, IL-11, IL-12, IL-13, TNF- ⁇ , and TNF- ⁇ .
- cytokines included in the present invention include cardiotrophin, stromal cell derived factor, macrophage derived chemokine (MDC), melanoma growth stimulatory activity (MGSA), macrophage inflammatory proteins 1 alpha (MIP-1 alpha), 2, 3 alpha, 3 beta, 4 and 5.
- Lytic peptides such as p146 are also included in the desired molecules of the present invention.
- the p146 peptide comprises an amino acid sequence of SEQ ID NO:24.
- the present invention also encompasses a transposon-based vector comprising a p146 nucleic acid comprising a polynucleotide sequence of SEQ ID NO:25.
- Enzymes are another class of proteins that may be made through the use of the transposon-based vectors of the present invention.
- Such enzymes include but are not limited to adenosine deaminase, alpha-galactosidase, cellulase, collagenase, dnaseI, hyaluronidase, lactase, L-asparaginase, pancreatin, papain, streptokinase B, subtilisin, superoxide dismutase, thrombin, trypsin, urokinase, fibrinolysin, glucocerebrosidase and plasminogen activator.
- additional amino acids and a protease cleavage site are added to the carboxy end of the enzyme of interest in order to prevent expression of a functional enzyme. Subsequent digestion of the enzyme with a protease results in activation of the enzyme.
- Extracellular matrix proteins are one class of desired proteins that may be made through the use of the present invention. Examples include but are not limited to collagen, fibrin, elastin, laminin, and fibronectin and subtypes thereof. Intracellular proteins and structural proteins are other classes of desired proteins in the present invention.
- Growth factors are another desired class of proteins that may be made through the use of the present invention and include, but are not limited to, transforming growth factor- ⁇ (“TGF- ⁇ ”), transforming growth factor- ⁇ (TGF- ⁇ ), platelet-derived growth factors (PDGF), fibroblast growth factors (FGF), including FGF acidic isoforms 1 and 2, FGF basic form 2 and FGF 4, 8, 9 and 10, nerve growth factors (NGF) including NGF 2.5s, NGF 7.0s and beta NGF and neurotrophins, brain derived neurotrophic factor, cartilage derived factor, growth factors for stimulation of the production of red blood cells, growth factors for stimulation of the production of white blood cells, bone growth factors (BGF), basic fibroblast growth factor, vascular endothelial growth factor (VEGF), granulocyte colony stimulating factor (G-CSF), insulin like growth factor (IGF) I and II, hepatocyte growth factor, glial neurotrophic growth factor (GDNF), stem cell factor (SCF), keratinocyte growth factor (K
- Another desired class of proteins that may be made may be made through the use of the present invention include, but are not limited to, leptin, leukemia inhibitory factor (LIF), tumor necrosis factor alpha and beta, ENBREL, angiostatin, endostatin, thrombospondin, osteogenic protein-1, bone morphogenetic proteins 2 and 7, osteonectin, somatomedin-like peptide, and osteocalcin.
- LIF leukemia inhibitory factor
- ENBREL ENBREL
- angiostatin endostatin
- thrombospondin thrombospondin
- osteogenic protein-1 bone morphogenetic proteins 2 and 7, osteonectin, somatomedin-like peptide, and osteocalcin.
- Yet another desired class of proteins are blood proteins or clotting cascade protein including albumin, Prekallikrein, High molecular weight kininogen (HMWK) (contact activation cofactor; Fitzgerald, Flaujeac Williams factor), Factor I (Fibrinogen), Factor II (prothrombin), Factor III (Tissue Factor), Factor IV (calcium), Factor V (proaccelerin, labile factor, accelerator (Ac-) globulin), Factor VI (Va) (accelerin), Factor VII (proconvertin), serum prothrombin conversion accelerator (SPCA), cothromboplastin), Factor VIII (antihemophiliac factor A, antihemophilic globulin (AHG)), Factor IX (Christmas Factor, antihemophilic factor B,plasma thromboplastin component (PTC)), Factor X (Stuart-Prower Factor), Factor XI (Plasma thromboplastin antecedent (PTA)), Factor
- a non-limiting list of the peptides and proteins that may be made may be made through the use of the present invention is provided in product catalogs of companies such as Phoenix Pharmaceuticals, Inc. (www.phoenixpeptide.com; 530 Harbor Boulevard, Belmont, Calif.), Peninsula Labs (San Carlos Calif.), SIGMA, (St.Louis, Mo. www.sigma-aldrich.com), Cappel ICN (Irvine, Calif., www.icnbiomed.com), and Calbiochem (La Jolla, Calif., www.calbiochem.com).
- the polynucleotide sequences encoding these proteins and peptides of interest may be obtained from the scientific literature, from patents, and from databases such as GenBank. Alternatively, one of ordinary skill in the art may design the polynucleotide sequence to be incorporated into the genome by choosing the codons that encode for each amino acid in the desired protein or peptide.
- Other desired proteins that may be made by the transgenic animals of the present invention include bacitracin, polymixin b, vancomycin, cyclosporine, anti-RSV antibody, alpha-1 antitrypsin (AAT), anti-cytomegalovirus antibody, anti-hepatitis antibody, anti-inhibitor coagulant complex, anti-rabies antibody, anti-Rh(D) antibody, adenosine deaminase, anti-digoxin antibody, antivenin crotalidae (rattlesnake venom antibody), antivenin latrodectus (black widow spider venom antibody), antivenin micrurus (coral snake venom antibody), aprotinin, corticotropin (ACTH), diphtheria antitoxin, lymphocyte immune globulin (anti-thymocyte antibody), protamine, thyrotropin, capreomycin, ⁇ -galactosidase, gramicidin, strepto
- IR502 IR502
- IR501 BI 1050/1272 mAb against very late antigen 4
- VLA-4 very late antigen 4
- C225 humanized mAb to EGF receptor anti-idiotype antibody to GD3 glycolipid, antibacterial peptide against H. pylori , MDX-447 bispecific humanized mAb to EGF receptor, anti-cytomegalovirus (CMV), Medi-491 B 19 parvovirus vaccine, humanized recombinant mAb (IgG1k) against respiratory syncytial virus (RSV), urinary tract infection vaccine (against “pili” on Escherechia coli strains), proteins of lyme disease vaccine against B.
- IgG1k humanized recombinant mAb against respiratory syncytial virus
- RSV respiratory syncytial virus
- urinary tract infection vaccine against “pili” on Escherechia coli strains
- burgdorferi protein DbpA
- proteins of Medi-501 human papilloma virus-11 vaccine HPV
- Streptococcus pneumoniae vaccine Medi-507 mAb (humanized form of BTI-322) against CD2 receptor on T-cells
- MDX-33 mAb to Fc ⁇ R1 receptor MDX-RA immunotoxin (ricin A linked) mAb
- MDX-210 bi-specific mAb against HER-2 MDX-447 bi-specific mAb against EGF receptor
- MDX-22 MDX-220 bi-specific mAb against TAG-72 on tumors
- colony-stimulating factor (CSF) molgramostim
- humanized mAb to the IL-2 R ⁇ -chain basiciliximab
- mAb to IgE IGE 025A
- myelin basic protein-altered peptide MSP771A
- humanized mAb against the epidermal growth receptor-2 humanized mAb against the ⁇ subunit of the inter
- the peptides and proteins made using the present invention may be labeled using labels and techniques known to one of ordinary skill in the art. Some of these labels are described in the “Handbook of Fluorescent Probes and Research Products”, ninth edition, Richard P. Haugland (ed) Molecular Probes, Inc. Eugene, Oreg.), which is incorporated herein in its entirety. Some of these labels may be genetically engineered into the polynucleotide sequence for the expression of the selected protein or peptide. The peptides and proteins may also have label-incorporation “handles” incorporated to allow labeling of an otherwise difficult or impossible to label protein.
- the present invention may also be used to produce desired molecules other than proteins and peptides including, but not limited to, lipoproteins such as high density lipoprotein (HDL), HDL-Milano, and low density lipoprotein, lipids, carbohydrates, siRNA and ribozymes.
- lipoproteins such as high density lipoprotein (HDL), HDL-Milano, and low density lipoprotein, lipids, carbohydrates, siRNA and ribozymes.
- a gene of interest encodes a nucleic acid molecule or a protein that directs production of the desired molecule.
- the present invention further encompasses the use of inhibitory molecules to inhibit endogenous (i.e., non-vector) protein production.
- inhibitory molecules include antisense nucleic acids, siRNA and inhibitory proteins.
- the endogenous protein whose expression is inhibited is an egg white protein including, but not limited to ovalbumin, ovotransferrin, and ovomucin.
- a transposon-based vector containing an ovalbumin DNA sequence, that upon transcription forms a double stranded RNA molecule, is transfected into an animal such as a bird and the bird's production of endogenous ovalbumin protein is reduced by the interference RNA mechanism (RNAi).
- RNAi interference RNA mechanism
- One exemplary prepro sequence is that of cecropin and comprising base pairs 563-733 of the Cecropin cap site and Prepro provided in Genbank accession number X07404. Additional cecropin prepro and pro sequences are provided in SEQ ID NO:48, SEQ ID NO:49, SEQ ID NO:50, and SEQ ID NO:51. Additionally, inducible knockouts or knockdowns of the endogenous protein may be created to achieve a reduction or inhibition of endogenous protein production. Endogenous egg white production can be inhibited in an avian at any time, but is preferably inhibited preceding, or immediately preceding, the harvest of eggs.
- Proteins are chains of amino acids (typically L-amino acids) whose alpha carbons are linked through peptide bonds formed by a condensation reaction between the carboxyl group of the alpha carbon of one amino acid and the amino group of the alpha carbon of another amino acid.
- the terminal amino acid at one end of the chain i.e., the amino terminal
- the terminal amino acid at the other end of the chain i.e., the carboxy terminal
- amino terminus refers to the free alpha-amino group on the amino acid at the amino terminal of the protein, or to the alpha-amino group (imino group when participating in a peptide bond) of an amino acid at any other location within the protein.
- carboxy terminus refers to the free carboxyl group on the amino acid at the carboxy terminus of a protein, or to the carboxyl group of an amino acid at any other location within the protein.
- amino acids making up a protein are numbered in order, starting at the amino terminal and increasing in the direction toward the carboxy terminal of the protein. Thus, when one amino acid is said to “follow” another, that amino acid is positioned closer to the carboxy terminal of the protein than the preceding amino acid.
- amino acid is used herein to refer to an amino acid (D or L) or an amino acid mimetic that is incorporated into a protein by an amide bond.
- the amino acid may be a naturally occurring amino acid or, unless otherwise limited, may encompass known analogs of natural amino acids that function in a manner similar to the naturally occurring amino acids (i.e., amino acid mimetics).
- an amide bond mimetic includes peptide backbone modifications well known to those skilled in the art.
- Suitable protecting groups are described in Green and Wuts, “Protecting Groups in Organic Synthesis”, John Wiley and Sons, Chapters 5 and 7, 1991, the teachings of which are incorporated herein by reference.
- Preferred protecting groups are those which facilitate transport of the peptide through membranes, for example, by reducing the hydrophilicity and increasing the lipophilicity of the peptide, and which can be cleaved, either by hydrolysis or enzymatically (Ditter et al., 1968. J. Pharm. Sci. 57:783; Ditter et al., 1968. J. Pharm. Sci. 57:828; Ditter et al., 1969. J. Pharm. Sci.
- Suitable hydroxyl protecting groups include ester, carbonate and carbamate protecting groups.
- Suitable amine protecting groups include acyl groups and alkoxy or aryloxy carbonyl groups, as described above for N-terminal protecting groups.
- Suitable carboxylic acid protecting groups include aliphatic, benzyl and aryl esters, as described below for C-terminal protecting groups.
- the carboxylic acid group in the side chain of one or more glutamic acid or aspartic acid residues in a peptide of the present invention is protected, preferably as a methyl, ethyl, benzyl or substituted benzyl ester, more preferably as a benzyl ester.
- each amino acid in a group has similar electronic and steric properties.
- a conservative substitution can be made by substituting an amino acid with another amino acid from the same group. It is to be understood that these groups are non-limiting, i.e. that there are additional modified amino acids which could be included in each group.
- Group I includes leucine, isoleucine, valine, methionine and modified amino acids having the following side chains: ethyl, n-propyl n-butyl.
- Group I includes leucine, isoleucine, valine and methionine.
- Group II includes glycine, alanine, valine and a modified amino acid having an ethyl side chain.
- Group II includes glycine and alanine.
- Group III includes phenylalanine, phenylglycine, tyrosine, tryptophan, cyclohexylmethyl glycine, and modified amino residues having substituted benzyl or phenyl side chains.
- Preferred substituents include one or more of the following: halogen, methyl, ethyl, nitro, —NH 2 , methoxy, ethoxy and —CN.
- Group III includes phenylalanine, tyrosine and tryptophan.
- Group IV includes glutamic acid, aspartic acid, a substituted or unsubstituted aliphatic, aromatic or benzylic ester of glutamic or aspartic acid (e.g., methyl, ethyl, n-propyl iso-propyl, cyclohexyl, benzyl or substituted benzyl), glutamine, asparagine, —CO—NH— alkylated glutamine or asparagines (e.g., methyl, ethyl, n-propyl and iso-propyl) and modified amino acids having the side chain —CH 2 ) 3 —COOH, an ester thereof (substituted or unsubstituted aliphatic, aromatic or benzylic ester), an amide thereof and a substituted or unsubstituted N-alkylated amide thereof.
- glutamic acid e.g., methyl, ethyl, n-propyl iso
- Group IV includes glutamic acid, aspartic acid, methyl aspartate, ethyl aspartate, benzyl aspartate and methyl glutamate, ethyl glutamate and benzyl glutamate, glutamine and asparagine.
- Group V includes histidine, lysine, ornithine, arginine, N-nitroarginine, ⁇ -cycloarginine, ⁇ -hydroxyarginine, N-amidinocitruline and 2-amino-4-guanidinobutanoic acid, homologs of lysine, homologs of arginine and homologs of ornithine.
- Group V includes histidine, lysine, arginine and ornithine.
- a homolog of an amino acid includes from 1 to about 3 additional or subtracted methylene units in the side chain.
- Group VI includes serine, threonine, cysteine and modified amino acids having C1-C5 straight or branched alkyl side chains substituted with —OH or —SH, for example, —CH 2 CH 2 OH, —CH 2 CH 2 CH 2 OH or —CH 2 CH 2 OHCH 3 .
- Group VI includes serine, cysteine or threonine.
- suitable substitutions for amino acid residues include “severe” substitutions.
- a “severe substitution” is a substitution in which the substituting amino acid (naturally occurring or modified) has significantly different size and/or electronic properties compared with the amino acid being substituted.
- the side chain of the substituting amino acid can be significantly larger (or smaller) than the side chain of the amino acid being substituted and/or can have functional groups with significantly different electronic properties than the amino acid being substituted.
- severe substitutions of this type include the substitution of phenylalanine or cyclohexylmethyl glycine for alanine, isoleucine for glycine, a D amino acid for the corresponding L amino acid, or —NH—CH[(—CH 2 ) 5 —COOH]—CO— for aspartic acid.
- a functional group may be added to the side chain, deleted from the side chain or exchanged with another functional group.
- severe substitutions of this type include adding of valine, leucine or isoleucine, exchanging the carboxylic acid in the side chain of aspartic acid or glutamic acid with an amine, or deleting the amine group in the side chain of lysine or ornithine.
- the side chain of the substituting amino acid can have significantly different steric and electronic properties that the functional group of the amino acid being substituted.
- modifications include tryptophan for glycine, lysine for aspartic acid and —(CH 2 ) 4 COOH for the side chain of serine. These examples are not meant to be limiting.
- the individual amino acids may be substituted according in the following manner:
- AA 1 is serine, glycine, alanine, cysteine or threonine
- AA 2 is alanine, threonine, glycine, cysteine or serine;
- AA 3 is valine, arginine, leucine, isoleucine, methionine, omithine, lysine, N-nitroarginine, ⁇ -cycloarginine, ⁇ -hydroxyarginine, N-amidinocitruline or 2-amino-4-guanidinobutanoic acid;
- AA 4 is proline, leucine, valine, isoleucine or methionine;
- AA 5 is tryptophan, alanine, phenylalanine, tyrosine or glycine;
- AA 6 is serine, glycine, alanine, cysteine or threonine
- AA 7 is proline, leucine, valine, isoleucine or methionine;
- AA 8 is alanine, threonine, glycine, cysteine or serine;
- AA 9 is alanine, threonine, glycine, cysteine or serine;
- AA 10 is leucine, isoleucine, methionine or valine;
- AA 11 is serine, glycine, alanine, cysteine or threonine
- AA 12 is leucine, isoleucine, methionine or valine;
- AA 13 is leucine, isoleucine, methionine or valine;
- AA 14 is glutamine, glutamic acid, aspartic acid, asparagine, or a substituted or unsubstituted aliphatic or aryl ester of glutamic acid or aspartic acid;
- AA 15 is arginine, N-nitroarginine, ⁇ -cycloarginine, ⁇ -hydroxy-arginine, N-amidinocitruline or 2-amino4-guanidino-butanoic acid
- AA 16 is proline, leucine, valine, isoleucine or methionine;
- AA 17 is serine, glycine, alanine, cysteine or threonine
- AA 18 is glutamic acid, aspartic acid, asparagine, glutamine or a substituted or unsubstituted aliphatic or aryl ester of glutamic acid or aspartic acid;
- AA 19 is aspartic acid, asparagine, glutamic acid, glutamine, leucine, valine, isoleucine, methionine or a substituted or unsubstituted aliphatic or aryl ester of glutamic acid or aspartic acid;
- AA 20 is valine, arginine, leucine, isoleucine, methionine, ornithine, lysine, N-nitroarginine, ⁇ -cycloarginine, ⁇ -hydroxyarginine, N-amidinocitruline or 2-amino-4-guanidinobutanoic acid;
- AA 21 is alanine, threonine, glycine, cysteine or serine;
- AA 22 is alanine, threonine, glycine, cysteine or serine;
- AA 23 is histidine, serine, threonine, cysteine, lysine or ornithine;
- AA 24 is threonine, aspartic acid, serine, glutamic acid or a substituted or unsubstituted aliphatic or aryl ester of glutamic acid or aspartic acid;
- AA 25 is asparagine, aspartic acid, glutamic acid, glutamine, leucine, valine, isoleucine, methionine or a substituted or unsubstituted aliphatic or aryl ester of glutamic acid or aspartic acid;
- AA 26 is cysteine, histidine, serine, threonine, lysine or ornithine.
- codons for the first several N-terminal amino acids of the transposase are modified such that the third base of each codon is changed to an A or a T without changing the corresponding amino acid. It is preferable that between approximately 1 and 20, more preferably 3 and 15, and most preferably between 4 and 12 of the first N-terminal codons of the gene of interest are modified such that the third base of each codon is changed to an A or a T without changing the corresponding amino acid. In one embodiment, the first ten N-terminal codons of the gene of interest are modified in this manner.
- proteins, protein fragments or peptides may be separated by a spacer molecule such as, for example, a peptide, consisting of one or more amino acids.
- a spacer molecule such as, for example, a peptide, consisting of one or more amino acids.
- the spacer will have no specific biological activity other than to join the desired proteins, protein fragments or peptides together, or to preserve some minimum distance or other spatial relationship between them.
- the constituent amino acids of the spacer may be selected to influence some property of the molecule such as the folding, net charge, or hydrophobicity.
- the spacer may also be contained within a nucleotide sequence with a purification handle or be flanked by cleavage sites, such as proteolytic cleavage sites.
- Such polypeptide spacers may have from about 5 to about 40 amino acid residues.
- the spacers in a polypeptide are independently chosen, but are preferably all the same.
- the spacers should allow for flexibility of movement in space and are therefore typically rich in small amino acids, for example, glycine, serine, proline or alanine.
- peptide spacers contain at least 60%, more preferably at least 80% glycine or alanine.
- peptide spacers generally have little or no biological and antigenic activity.
- Preferred spacers are (Gly-Pro-Gly-Gly) x (SEQ ID NO:26) and (Gly 4 -Ser) y , wherein x is an integer from about 3 to about 9 and y is an integer from about 1 to about 8.
- suitable spacers include (Gly-Pro-Gly-Gly) 3 SEQ ID NO:27 Gly Pro Gly Gly Gly Pro Gly Gly Gly Pro Gly Gly Gly Pro Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly (Gly 4 -Ser) 3 SEQ ID NO:28 Gly Gly Gly Gly Ser Gly Gly Gly Gly Gly Ser or (Gly 4 -Ser) 4 SEQ ID NO:29 Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser.
- Nucleotide sequences encoding for the production of residues which may be useful in purification of the expressed recombinant protein may also be built into the vector. Such sequences are known in the art and include the glutathione binding domain from glutathione S-transferase, polylysine, hexa-histidine or other cationic amino acids, thioredoxin, hemagglutinin antigen and maltose binding protein.
- nucleotide sequences may be inserted into the gene of interest to be incorporated so that the protein or peptide can also include from one to about six amino acids that create signals for proteolytic cleavage.
- specific nucleotide sequences encoding for amino acids recognized by enzymes may be incorporated into the gene to facilitate cleavage of the large protein or peptide sequence into desired peptides or proteins or both.
- nucleotides encoding a proteolytic cleavage site can be introduced into the gene of interest so that a signal sequence can be cleaved from a protein or peptide encoded by the gene of interest.
- Nucleotide sequences encoding other amino acid sequences which display pH sensitivity or chemical sensitivity may also be added to the vector to facilitate separation of the signal sequence from the peptide or protein of interest.
- Proteolytic cleavage sites include cleavage sites recognized by exopeptidases such as carboxypeptidase A, carboxypeptidase B, aminopeptidase I, and dipeptidylaminopeptidase; endopeptidases such as trypsin, V8-protease, enterokinase, factor Xa, collagenase, endoproteinase, subtilisin, and thombin; and proteases such as Protease 3C IgA protease (Igase) Rhinovirus 3C(preScission)protease. Chemical cleavage sites are also included in the defintion of cleavage site as used herein. Chemical cleavage sites include, but are not limited to, site cleaved by cyanogen bromide, hydroxylamine, formic acid, and acetic acid.
- a TAG sequence is linked to the gene of interest.
- the TAG sequence serves three purposes: 1) it allows free rotation of the peptide or protein to be isolated so there is no interference from the native protein or signal sequence, i.e. vitellogenin, 2) it provides a “purification handle” to isolate the protein using column purification, and 3) it includes a cleavage site to remove the desired protein from the signal and purification sequences.
- a TAG sequence includes a spacer sequence, a purification handle and a cleavage site.
- the spacer sequences in the TAG proteins contain one or more repeats shown in SEQ ID NO:30.
- a preferred spacer sequence comprises the sequence provided in SEQ ID NO:31.
- a purification handle is the gp41 hairpin loop from HIV I.
- Exemplary gp41 polynucleotide and polypeptide sequences are provided in SEQ ID NO:32 and SEQ ID NO:33, respectively.
- any antigenic region may be used as a purification handle, including any antigenic region of gp41.
- Preferred purification handles are those that elicit highly specific antibodies.
- the cleavage site can be any protein cleavage site known to one of ordinary skill in the art and includes an enterokinase cleavage site comprising the Asp Asp Asp Asp Lys sequence (SEQ ID NO:34) and a furin cleavage site. Constructs containing a TAG sequence are shown in FIGS. 2 and 3.
- the TAG sequence comprises a polynucleotide sequence of SEQ ID NO:35.
- the present invention also includes methods of administering the transposon-based vectors to an animal, methods of producing a transgenic animal wherein a gene of interest is incorporated into the germline of the animal and methods of producing a transgenic animal wherein a gene of interest is incorporated into cells other than the germline cells (somatic cells) of the animal.
- the transposon-based vectors of the present invention are administered to a reproductive organ of an animal via any method known to those of skill in the art.
- Preferred reproductive organs include an ovary, an oviduct, a mammary gland, and a fallopian tube.
- a transposon-based vector is directly administered to the reproductive organ.
- Direct administration encompasses injection into the organ, and in a preferred embodiment, a transposon-based vector is injected into the lumen of the oviduct, and more preferably, the lumen of the magnum or the infindibulum of the oviduct.
- the transposon-based vectors may additionally or alternatively be placed in an artery supplying the reproductive organ. Administering the vectors to the artery supplying the ovary results in transfection of follicles and oocytes in the ovary to create a germline transgenic animal.
- supplying the vectors through an artery leading to the oviduct would preferably transfect the tubular gland and epithelial cells. Such transfected cells could manufacture a desired protein or peptide for deposition in the egg white.
- a transposon-based vector is administered into the lumen of the magnum or the infundibulum of the oviduct and to an artery supplying the oviduct. Indirect administration to the oviduct epithelium may occur through the cloaca. Direct administration into the mammary gland comprises introduction into the duct system of the mammary gland.
- transposon-based vectors may occur in arteries supplying the ovary and or through direct intrathecal administration into the ovary through injection.
- the transposon-based vectors may be administered in a single administration, multiple administrations, continuously, or intermittently.
- the transposon-based vectors may be administered by injection, via a catheter, an osmotic mini-pump or any other method.
- the transposon-based vector is administered to an animal in multiple administrations, each administration containing the vector and a different transfecting reagent.
- the transposon-based vectors may be administered to the animal at any point during the lifetime of the animal, however, it is preferable that the vectors are administered prior to the animal reaching sexual maturity.
- the transposon-based vectors are preferably administered to a chicken between approximately 14 and 16 weeks of age and to a quail between approximately 5 and 10 weeks of age, more preferably 5 and 8 weeks of age, and most preferably between 5 and 6 weeks of age, when standard poultry rearing practices are used.
- the vectors may be administered at earlier ages when exogenous hormones are used to induce early sexual maturation in the bird.
- the transposon-based vector is administered to an animal following an increase in proliferation of the oviduct epithelial cells and/or the tubular gland cells.
- the transposon-based vector is administered following an increase in proliferation of the oviduct epithelial cells and before the avian begins to produce egg white constituents.
- the animal is an egg-laying animal, and more preferably, an avian.
- between approximately 1 and 150 ⁇ g, 1 and 100 ⁇ g, 1 and 50 ⁇ g, preferably between 1 and 20 ⁇ g, and more preferably between 5 and 10 ⁇ g of transposon-based vector DNA is administered to the oviduct of a bird.
- Optimal ranges depend upon the type of bird and the bird's stage of sexual maturity.
- In a chicken it is preferred that between approximately 1 and 100 ⁇ g, or 5 and 50 ⁇ g are administered.
- In a quail it is preferred that between approximately 5 and 10 ⁇ g are administered.
- Intraoviduct administration of the transposon-based vectors of the present invention result in incorporation of the gene of interest into the cells of the oviduct as evidenced by a PCR positive signal in the oviduct tissue.
- the transposon-based vector is administered to an artery that supplies the oviduct.
- the transposon-based vector is administered in conjunction with an acceptable carrier and/or transfection reagent.
- Acceptable carriers include, but are not limited to, water, saline, Hanks Balanced Salt Solution (HBSS), Tris-EDTA (TE) and lyotropic liquid crystals.
- Transfection reagents commonly known to one of ordinary skill in the art that may be employed include, but are not limited to, the following: cationic lipid transfection reagents, cationic lipid mixtures, polyamine reagents, liposomes and combinations thereof; SUPERFECT®, Cytofectene, BioPORTER®, GenePORTER®, NeuroPORTER®, and perfectin from Gene Therapy Systems; lipofectamine, cellfectin, DMRIE-C oligofectamine, TROJENE® and PLUS reagent from InVitrogen; Xtreme gene, fugene, DOSPER and DOTAP from Roche; Lipotaxi and Genejammer from Strategene; and Escort from SIGMA.
- the transfection reagent is SUPERFECT®.
- the ratio of DNA to transfection reagent may vary based upon the method of administration.
- the transposon-based vector is administered to the oviduct and the ratio of DNA to transfection reagent can be from 1:1.5 to 1:15, preferably 1:2 to 1:5, all expressed as wt/vol.
- Transfection may also be accomplished using other means known to one of ordinary skill in the art, including without limitation electroporation, gene guns, injection of naked DNA, and use of dimethyl sulfoxide (DMSO).
- DMSO dimethyl sulfoxide
- transposon-based vector may be important. Plasmids harvested from bacteria are generally closed circular supercoiled molecules, and this is the preferred state of a vector for gene delivery because of the ease of preparation. In some instances, transposase expression and insertion may be more efficient in a relaxed, closed circular configuration or in a linear configuration. In still other instances, a purified transposase protein may be co-injected with a transposon-based vector containing the gene of interest for more immediate insertion. This could be accomplished by using a transfection reagent complexed with both the purified transposase protein and the transposon-based vector.
- transposon-based vector Following administration of a transposon-based vector to an animal, DNA is extracted from the animal to confirm integration of the gene of interest.
- Advantages provided by the present invention include the high rates of integration, or incorporation, and transcription of the gene of interest when administered to a bird via an intraoviduct or intraovarian route (including intraarterial administrations to arteries leading to the oviduct or ovary).
- Example 6 below describes isolation of a proinsulin/ENT TAG protein from a transgenic hen following ammonium sulfate precipitation and ion exchange chromatography.
- FIG. 5 demonstrates successful administration of a transposon-based vector to a hen, successful integration of the gene of interest, successful production of a protein encoded by the gene of interest, and successful deposition of the protein in egg white produced by the transgenic hen.
- PRINS primed in situ hybridization technique
- breeding experiments are also conducted to determine if germline transmission of the transgene has occurred.
- each male bird was exposed to 2-3 different adult female birds for 3-4 days each. This procedure was continued with different females for a total period of 6-12 weeks. Eggs ae collected daily for up to 14 days after the last exposure to the transgenic male, and each egg is incubated in a standard incubator. The resulting embryos are examined for transgene presence at day 3 or 4 using PCR. It is to be understood that the above procedure can be modified to suit animals other than birds and that selective breeding techniques may be performed to amplify gene copy numbers and protein output.
- the transposon-based vectors of the present invention may be administered to a bird for production of desired proteins or peptides in the egg white.
- These transposon-based vectors preferably contain one or more of an ovalbumin promoter, an ovomucoid promoter, an ovalbumin signal sequence and an ovomucoid signal sequence.
- Oviduct-specific ovalbumin promoters are described in B. O'Malley et al., 1987. EMBO J., vol. 6, pp. 2305-12; A. Qiu et al., 1994. Proc. Nat. Acad. Sci. (USA), vol. 91, pp. 4451-4455; D. Monroe et al., 2000.
- transposon-based vectors designed for production of a desired protein in an egg white are shown in FIGS. 2 and 3.
- the present invention is particularly advantageous for production of recombinant peptides and proteins of low solubility in the egg yolk.
- proteins include, but are not limited to, membrane-associated or membrane-bound proteins, lipophilic compounds; attachment factors, receptors, and components of second messenger transduction machinery.
- Low solubility peptides and proteins are particularly challenging to produce using conventional recombinant protein production techniques (cell and tissue cultures) because they aggregate in water-based, hydrophilic environments. Such aggregation necessitates denaturation and re-folding of the recombinantly-produced proteins, which may deleteriously affect their structure and function.
- the present invention provides an advantageous resolution of the problem of protein and peptide solubility during production of large amounts of recombinant proteins.
- deposition of a desired protein into the egg yolk is accomplished in offspring by attaching a sequence encoding a protein capable of binding to the yolk vitellogenin receptor to a gene of interest that encodes a desired protein.
- This transposon-based vector can be used for the receptor-mediated uptake of the desired protein by the oocytes.
- the sequence ensuring the binding to the vitellogenin receptor is a targeting sequence of a vitellogenin protein.
- the invention encompasses various vitellogenin proteins and their targeting sequences.
- a chicken vitellogenin protein targeting sequence is used, however, due to the high degree of conservation among vitellogenin protein sequences and known cross-species reactivity of vitellogenin targeting sequences with their egg-yolk receptors, other vitellogenin targeting sequences can be substituted.
- a construct for use in the transposon-based vectors of the present invention and for deposition of an insulin protein in an egg yolk is a transposon-based vector containing a vitellogenin promoter, a vitellogenin targeting sequence, a TAG sequence, a pro-insulin sequence and a synthetic polyA sequence.
- the present invention includes, but is not limited to, vitellogenin targeting sequences residing in the N-terminal domain of vitellogenin, particularly in lipovitellin I.
- vitellogenin targeting sequence contains the polynucleotide sequence of SEQ ID NO:22.
- the transposon-based vector contains a transposase gene operably-linked to a constitutive promoter and a gene of interest operably-linked to a liver-specific promoter and a vitellogenin targeting sequence.
- an animal breeding stock that is homozygous for the transgene is preferred.
- Such homozygous individuals are obtained and identified through, for example, standard animal breeding procedures or PCR protocols.
- peptides, polypeptides and proteins can be purified according to standard procedures known to one of ordinary skill in the art, including ammonium sulfate precipitation, affinity columns, column chromatography, gel electrophoresis, high performance liquid chromatography, immunoprecipitation and the like. Substantially pure compositions of about 50 to 99% homogeneity are preferred, and 80 to 95% or greater homogeneity are most preferred for use as therapeutic agents.
- the animal in which the desired protein is produced is an egg-laying animal.
- the animal is an avian and a desired peptide, polypeptide or protein is isolated from an egg white.
- Egg white containing the exogenous protein or peptide is separated from the yolk and other egg constituents on an industrial scale by any of a variety of methods known in the egg industry. See, e.g., W. Stadelman et al. (Eds.), Egg Science & Technology, Haworth Press, Binghamton, N.Y. (1995).
- Isolation of the exogenous peptide or protein from the other egg white constituents is accomplished by any of a number of polypeptide isolation and purification methods well known to one of ordinary skill in the art. These techniques include, for example, chromatographic methods such as gel permeation, ion exchange, affinity separation, metal chelation, HPLC, and the like, either alone or in combination. Another means that may be used for isolation or purification, either in lieu of or in addition to chromatographic separation methods, includes electrophoresis. Successful isolation and purification is confirmed by standard analytic techniques, including HPLC, mass spectroscopy, and spectrophotometry. These separation methods are often facilitated if the first step in the separation is the removal of the endogenous ovalbumin fraction of egg white, as doing so will reduce the total protein content to be further purified by about 50%.
- transposon-based vectors may include one or more additional epitopes or domains.
- epitopes or domains include DNA sequences encoding enzymatic or chemical cleavage sites including, but not limited to, an enterokinase cleavage site; the glutathione binding domain from glutathione S-transferase; polylysine; hexa-histidine or other cationic amino acids; thioredoxin; hemagglutinin antigen; maltose binding protein; a fragment of gp41 from HIV; and other purification epitopes or domains commonly known to one of skill in the art.
- purification of desired proteins from egg white utilizes the antigenicity of the ovalbumin carrier protein and particular attributes of a TAG linker sequence that spans ovalbumin and the desired protein.
- the TAG sequence is particularly useful in this process because it contains 1) a highly antigenic epitope, a fragment of gp41 from HIV, allowing for stringent affinity purification, and, 2) a recognition site for the protease enterokinase immediately juxtaposed to the desired protein.
- the TAG sequence comprises approximately 50 amino acids.
- a representative TAG sequence is provided below.
- the underlined sequences were taken from the hairpin loop domain of HIV gp-41 (SEQ ID NO:33). Sequences in italics represent the cleavage site for enterokinase (SEQ ID NO:34).
- the spacer sequence upstream of the loop domain was made from repeats of (Pro Ala Asp Asp Ala) (SEQ ID NO:31) to provide free rotation and promote surface availability of the hairpin loop from the ovalbumin carrier protein.
- transgenic ovalbumin-TAG-desired protein is left attached to the gp41 affinity resin (beads) from step 4 and the protease enterokinase is added. This liberates the transgene target protein from the gp41 affinity resin while the ovalbumin-TAG sequence is retained. Separation by centrifugation (in a batch process) or flow through (in a column purification), leaves the desired protein together with enterokinase in solution. Enterokinase is recovered and reused.
- enterokinase is immobilized on resin (beads) by the addition of poly-lysine moieties to a non-catalytic area of the protease.
- the transgenic ovalbumin-TAG-desired protein eluted from the affinity column of step 4 is then applied to the protease resin.
- Protease action cleaves the ovalbumin-TAG sequence from the desired protein and leaves both entities in solution.
- the immobilized enterokinase resin is recharged and reused.
- a final separation of either of these two (5a or 5b) protein mixtures is made using size exclusion, or enterokinase affinity chromatography. This step allows for desalting, buffer exchange and/or polishing, as needed.
- egg whites containing a protein of interest were pooled and separated, in any order, from the yolks and other egg constituents by methods known to one skilled in the art. A variety of such methods is described in manuals known in the art, such as Egg Science & Technology , W. Stadelman, et al. (Eds.), Haworth Press, Binghamton, N.Y. (1995).
- One non-limiting example of a method for isolating a desired peptide, polypeptide or protein from an egg white is as follows. It is to be understood that this method may be employed to isolate any desired peptide, polypeptide or protein from the eggs of transgenic animals of the present invention. This present example involved transgenes that used a portion of or the entire ovalbumin protein, or specific ovalbumin epitopes, as a carrier, linked to the protein of interest via the specified TAG sequence, or another affinity/cleavage sequence.
- the TAG sequence contains the hairpin loop epitope from HIV I followed by an enterokinase cleavage site.
- the viscosity of the egg white was lowered by subjecting the egg white to low shear forces of 3140 cps (Tung et al., 1969). The resulting pourable solution was then filtered to remove chalazae. An ammonium sulfate precipitation was then used to enrich the fraction of transgenic protein (see, for example, Practical Protein Chemistry A Handbook A. Darbre (Ed.), John Wiley & Sons Ltd., 1986). Other methods of crude fractionation known in the art are also used as needed. The supernatant of this separation was then fractionated using size-exclusion chromatography, further enriching the transgenic fusion protein fraction and eliminating the ammonium sulfate from the material.
- the fusion protein was isolated by anti-ovalbumin affinity chromatography (batch or column) using methods known to one skilled in the art. This step may capture native ovalbumin in addition to an ovalbumin-transgene fusion protein. After elution from the anti-ovalbumin affinity resin, the transgenic protein was specifically isolated using anti-gp41 affinity chromatography (batch or column) using methods known to one skilled in the art.
- transgenic ovalbumin-TAG-transgene target protein was left attached to the gp41 affinity resin and the protease enterokinase was added. Cleavage of the transgene by enterokinase liberated the transgene target protein from the gp41 affinity resin while the ovalbumin-TAG sequence was retained. Separation by centrifugation (in a batch process) or flow through (in a column purification), kept the transgene target protein together with enterokinase in solution. Enterokinase was recovered and reused.
- enterokinase was immobilized on resin (beads) by the addition of poly-lysine moieties to a non-catalytic area of the protease.
- the transgenic ovalbumin-TAG-transgene target protein was eluted from the gp41 affinity chromatography resin and then applied to the protease resin. Protease action cleaved the ovalbumin-TAG sequence from the transgene target protein and left both entities in solution.
- the immobilized enterokinase resin was recharged and reused. The choice between these alternatives is made on a case-by case basis, depending upon the size and chemical composition of the transgene target protein.
- a final separation of either of these two (process 1 or 2) protein mixtures was made using size exclusion chromatography, or enterokinase affinity chromatography. This step also allows for desalting, concentrating, buffer exchange and/or polishing, as needed.
- a typical chicken egg produced by a transgenic animal of the present invention will contain at least 0.001 mg, from about 0.001 to 1.0 mg, or from about 0.001 to 100.0 mg of exogenous protein, peptide or polypeptide, in addition to the normal constituents of egg white (or possibly replacing a small fraction of the latter). In some embodiments, a chicken egg will contain between 50 and 75 mg of exogenous protein.
- the desired proteins, fragments thereof and peptides may possess a conformation substantially different than the native conformations of the proteins, fragments thereof and peptides. In this case, it is often necessary to denature and reduce protein and then to cause the protein to re-fold into the preferred conformation. Methods of reducing and denaturing proteins and inducing re-folding are well known to those of skill in the art.
- the present invention encompasses methods for the production of milk containing transgenic proteins or peptides. These methods include the administration of a transposon-based vector described above to a mammal through the duct system.
- the transposon-based vector contains a transposase operably-linked to a constitutive promoter and a gene of interest operably-linked to mammary specific promoter. Genes of interest can include, but are not limited to antiviral and antibacterial proteins and immunoglobulins.
- a transposon-based vector is administered to the ovary of an animal and gerrnline transformation is obtained. In these embodiments, offspring of the transfected animal express a gene of interest in the mammary gland under the control of a mammary gland-specific promoter.
- Quail or chicken were selected for administration of the transposon-based vectors of the present invention. Feathers were removed from the area where surgery was performed and the area was cleansed and sterilized by rinsing it with ethanol (alcohol) and 0.5% chlorhexidine. Using the scalpel, a dorsolateral incision was made through the skin over the ovary approximately 2 cm in length. Using blunt scissors, a second incision was made through the muscle between the last two ribs to expose the oviduct beneath. A small animal retractor was used to spread the last two ribs, exposing the oviduct beneath. The oviduct was further exposed using retractors to pull the intestines to one side.
- a delivery solution containing a transposon-based vector and SUPERFECT® was prepared fresh immediately before surgery. Specific ratios of vector and SUPERFECT® that were used in each experiment are provided in the Examples below.
- the delivery solution was warmed to room temperature prior to injection into the bird. Approximately 250-500 ⁇ 1 of the delivery solution was injected into the lumen of the magnum of the oviduct using a 1 cc syringe with a 27 gauge needle attached. The wound was closed and antibiotic cream liberally applied to the area surrounding the wound.
- a vector was designed for inserting a desired coding sequence into the genome of eukaryotic cells, given below as SEQ ID NO:3.
- This vector employed a cytomegalovirus (CMV) promoter.
- CMV cytomegalovirus
- a modified Kozak sequence (ACCATG) (SEQ ID NO:1) was added to the promoter.
- the nucleotide in the wobble position in nucleotide triplet codons encoding the first 10 amino acids of transposase was changed to an adenine (A) or thymine (T), which did not alter the amino acid encoded by this codon.
- Two stop codons were added and a synthetic polyA was used to provide a strong termination sequence.
- This vector uses a promoter designed to be active soon after entering the cell (without any induction) to increase the likelihood of stable integration. The additional stop codons and synthetic polyA insures proper termination without read through to potential genes downstream.
- the first step in constructing this vector was to modify the transposase to have the desired changes. Modifications to the transposase were accomplished with the primers High Efficiency forward primer (Hef) Altered transposase (ATS)-Hef 5′ ATCTCGAGACCATGTG T GAACT T GATATTTTACATGA T TCTCTTTACC 3′ (SEQ ID NO:36) and Altered transposase-High efficiency reverse primer (Her) 5′ GATTGATCATTATCATAATTTCCCCAAAGCGTAACC 3′ (SEQ ID NO:37, a reverse complement primer).
- the sequence CTCGAG (SEQ ID NO:38) is the recognition site for the restriction enzyme Xho I, which permits directional cloning of the amplified gene.
- the sequence ACCATG (SEQ ID NO:1) contains the Kozak sequence and start codon for the transposase and the underlined bases represent changes in the wobble position to an A or T of codons for the first 10 amino acids (without changing the amino acid coded by the codon).
- Primer ATS-Her (SEQ ID NO:37) contains an additional stop codon TAA in addition to native stop codon TGA and adds a Bcl I restriction site, TGATCA (SEQ ID NO:39), to allow directional cloning.
- pTnLac plasmid
- tn defines transposon
- lac defines the beta fragment of the lactose gene, which contains a multiple cloning site
- Amplified PCR product was electrophoresed on a 1% agarose gel, stained with ethidium bromide, and visualized on an ultraviolet transilluminator.
- a band corresponding to the expected size was excised from the gel and purified from the agarose using a Zymo Clean Gel Recovery Kit (Zymo Research, Orange, Calif.). Purified DNA was digested with restriction enzymes Xho 1 (5′) and Bcl 1 (3′) (New England Biolabs, Beverly, Mass.) according to the manufacturer's protocol. Digested DNA was purified from restriction enzymes using a Zymo DNA Clean and Concentrator kit (Zymo Research).
- Plasmid gWhiz (Gene Therapy Systems, San Diego, Calif.) was digested with restriction enzymes Sal I and BamH I (New England Biolabs), which are compatible with Xho I and Bcl I, but destroy the restriction sites. Digested gwhiz was separated on an agarose gel, the desired band excised and purified as described above. Cutting the vector in this manner facilitated directional cloning of the modified transposase (mATS) between the CMV promoter and synthetic polyA.
- mATS modified transposase
- Colonies producing a plasmid of the expected size were cultured in at least 250 ml of LB/amp broth and plasmid DNA harvested using a Qiagen Maxi-Prep Kit (column purification) according to the manufacturer's protocol (Qiagen, Inc., Chatsworth, Calif.). Column purified DNA was used as template for sequencing to verify the changes made in the transposase were the desired changes and no further changes or mutations occurred due to PCR amplification. For sequencing, Perkin-Elmer's Big Dye Sequencing Kit was used. All samples were sent to the Gene Probes and Expression Laboratory (LSU School of Veterinary Medicine) for sequencing on a Perkin-Elmer Model 377 Automated Sequencer.
- primers CMVf-NgoM IV (5′ TT GCCGGC ATCAGATTGGCTAT (SEQ ID NO:40); underlined bases denote a NgoM IV recognition site) and Syn-polyA-BstE II (5′ AGA GGTCACC GGGTCAATTCTTCAGCACCTGGTA (SEQ ID NO:41); underlined bases denote a BstE II recognition site) were used to PCR amplify the entire CMV promoter, mATS, and synthetic polyA for cloning upstream of the transposon in pTnLac.
- PCR was conducted with FailSafeTM as described above, purified using the Zymo Clean and Concentrator kit, the ends digested with NgoM IV and BstE II (New England Biolabs), purified with the Zymo kit again and cloned upstream of the transposon in pTnLac as described below.
- Plasmid pTnLac was digested with NgoM IV and BstE II to remove the ptac promoter and transposase and the fragments separated on an agarose gel.
- the band corresponding to the vector and transposon was excised, purified from the agarose, and dephosphorylated with calf intestinal alkaline phosphatase (New England Biolabs) to prevent self-annealing.
- the enzyme was removed from the vector using a Zymo DNA Clean and Concentrator-5.
- the purified vector and CMVp/mATS/polyA were ligated together using a Stratagene T4 Ligase Kit and transformed into E. coli as described above.
- Base pairs 1-130 are a remainder of F1( ⁇ ) on from pBluescriptII sk( ⁇ ) (Stratagene), corresponding to base pairs 1-130 of pBluescriptII sk( ⁇ ).
- Base pairs 131-132 are a residue from ligation of restriction enzyme sites used in constructing the vector.
- Base pairs 133-1777 are the CMV promoter/enhancer taken from vector pGWiz (Gene Therapy Systems), corresponding to bp 229-1873 of pGWiz.
- the CMV promoter was modified by the addition of an ACC sequence upstream of ATG.
- Base pairs 1778-1779 are a residue from ligation of restriction enzyme sites used in constructing the vector.
- Base pairs 1780-2987 are the coding sequence for the transposase, modified from Tn10 (GenBank accession J01829) by optimizing codons for stability of the transposase mRNA and for the expression of protein. More specifically, in each of the codons for the first ten amino acids of the transposase, G or C was changed to A or T when such a substitution would not alter the amino acid that was encoded.
- Base pairs 2988-2993 are two engineered stop codons.
- Base pair 2994 is a residue from ligation of restriction enzyme sites used in constructing the vector.
- Base pairs 2995-3410 are a synthetic polyA sequence taken from the pGWiz vector (Gene Therapy Systems), corresponding to bp 1922-2337 of 10 pGWiz.
- Base pairs 3415-3718 are non-coding DNA that is residual from vector pNK2859.
- Base pairs 3719-3761 are non-coding ⁇ DNA that is residual from pNK2859.
- Base pairs 3762-3831 are the 70 bp of the left insertion sequence recognized by the transposon Tn10.
- Base pairs 3832-3837 are a residue from ligation of restriction enzyme sites used in constructing the vector.
- Base pairs 3838-4527 are the multiple cloning site from pBluescriptII sk(20), corresponding to bp 924-235 of pBluescriptll sk( ⁇ ). This multiple cloning site may be used to insert any coding sequence of interest into the vector.
- Base pairs 4528-4532 are a residue from ligation of restriction enzyme sites used in constructing the vector.
- Base pairs 4533-4602 are the 70 bp of the right insertion sequence recognized by the transposon Tn10.
- Base pairs 4603-4644 are non-coding ⁇ DNA that is residual from pNK2859.
- Base pairs 4645-5488 are non-coding DNA that is residual from pNK2859.
- Base pairs 5489-7689 are from the pBluescriptII sk( ⁇ ) base vector—(Stratagene, Inc.), corresponding to bp 761-2961 of pBluescriptII sk( ⁇ ).
- Completing pTnMod is a pBlueScript backbone that contains a colE I origin of replication and an antibiotic resistance marker (ampicillin).
- Plasmid DNA was isolated by standard procedures. Briefly, Escherichia coli containing the plasmid was grown in 500 mL aliquots of LB broth (supplemented with an appropriate antibiotic) at 37° C. overnight with shaking. Plasmid DNA was recovered from the bacteria using a Qiagen Maxi-Prep kit (Qiagen, Inc., Chatsworth, Calif.) according to the manufacturer's protocol. Plasmid DNA was resuspended in 500 ⁇ L of PCR-grade water and stored at ⁇ 20° C. until used.
- transposon-based vector was designed for inserting a desired coding sequence into the genome of eukaryotic cells.
- This vector was termed pTnMCS and its constituents are provided below.
- the sequence of the pTnMCS vector is provided in SEQ ID NO:2.
- the pTnMCS vector contains an avian optimized polyA sequence operably-linked to the transposase gene.
- the avian optimized polyA sequence contains approximately 40 nucleotides that precede the A nucleotide string.
- Bp 133-1777 CMV promoter/enhancer taken from vector pGWIZ (Gene Therapy Systems) bp 229-1873
- a vector was designed to insert a humsan proinsulin coding sequence under the control of a chicken ovalbumin promoter, and a ovalbumin gene including an ovalbumin signal sequence, into the genome of a bird given below as SEQ ID NO:42.
- Base pairs 1-130 are a remainder of F1( ⁇ ) ori of pBluescriptII sk( ⁇ ) (Stratagene) corresponding to base pairs 1-130 of pBluescriptll sk( ⁇ ).
- Base pairs 133-1777 are a CMV promoter/enhancer taken from vector pGWiz (Gene Therapy Systems) corresponding to base pairs 229-1873 of pGWiz.
- Base pairs 1780-2987 are a transposase, modified from Tn10 (GenBank accession number J01829).
- Base pairs 2988-2993 are two engineered stop codons.
- Base pairs 2995-3410 are a synthetic polyA from pGWiz (Gene Therapy Systems) corresponding to base pairs 1922-2337 of pGWiz.
- Base pairs 3415-3718 are non coding DNA that is residual from vector pNK2859.
- Base pairs 3719-3761 are ⁇ DNA that is residual from pNK2859.
- Base pairs 3762-3831 are the 70 base pairs of the left insertion sequence (IS10) recognized by the transposon Tn10.
- Base pairs 3838-4044 are a multiple cloning site from pBluescriptII sk( ⁇ ) corresponding to base pairs 924-718 of pBluescriptII sk( ⁇ ).
- Base pairs 4050-4951 are a chicken ovalbumin promoter (including SDRE) that corresponds to base pairs 431-1332 of the chicken ovalbumin promoter in GenBank Accession Number J00895 M24999.
- Base pairs 4958-6115 are a chicken ovalbumin signal sequence and ovalbumin gene that correspond to base pairs 66-1223 of GenBank Accession Number V00383.1. (The STOP codon being omitted).
- Base pairs 6122-6271 are a TAG sequence containing a gp41 hairpin loop from HIV I, an enterokinase cleavage site and a spacer (synthetic).
- Base pairs 6272-6531 are a proinsulin gene.
- Base pairs 6539-6891 are a synthetic polyadenylation sequence from pGWiz (Gene Therapy Systems) corresponding to base pairs 1920-2272 of pGWiz.
- Base pairs 6897-7329 are a multiple cloning site from pBlueScriptII sk( ⁇ ) corresponding to base pairs 667-235 of pBluescriptll sk( ⁇ ).
- Base pairs 7335-7404 are the 70 base pairs of the right insertion sequence (IS10) recognized by the transposon Tn10.
- Base pairs 7405-7446 are ⁇ DNA that is residual from pNK2859.
- Base pairs 7447-8311 are non coding DNA that is residual from pNK2859.
- Base pairs 8312-10512 are pBlueScript sk( ⁇ ) base vector (Stratagene, Inc.) corresponding to base pairs 761-2961 of pBluescriptll sk( ⁇ ).
- a vector was designed to insert a proinsulin coding sequence under the control of a quail ovalbumin promoter, and a ovalbumin gene including an ovalbumin signal sequence, into the genome of a bird given below as SEQ ID NO:43.
- Bp 4051-5695 CMV promoter/enhancer taken from vector pGWIZ (Gene therapy systems), bp 230-1864
- Bp 5702-6855 Chicken ovalbumin gene taken from GenBank accession #V00383, bp 66-1219
- the Oval promoter/Oval gene/GP41 Enterokinase TAG/Proinsulin/Poly A containing construct was injected into the lumen of the oviduct of sexually mature quail; three hens received 5 ⁇ g at a 1:3 SUPERFECT® ratio and three received 10 ⁇ g at a 1:3 SUPERFECT® ratio.
- at least one bird that received above-mentioned construct was producing human proinsulin in egg white (other birds remain to be tested).
- each quail egg contains approximately 1.4 ⁇ g/ml of the proinsulin protein. It is also estimated that each transgenic chicken egg contains 50-75 mg of protein encoded by the gene of interest.
- the transposon-based vector containing CMV promoter/Oval gene/GP41 Enterokinase TAG/Proinsulin/Poly A was injected into the lumen of the oviduct of sexually immature Japanese quail. A total of 9 birds were injected. Of the 8 survivors, 3 produced human proinsulin in the white of their eggs for over 6 weeks.
- An ELISA assay described in detail below was developed to detect GP41 in the fusion peptide (Oval gene/GP41 Enterokinase TAG/Proinsulin) since the GP41 peptide sequence is unique and not found as part of normal egg white protein. In all ELISA assays, the same birds produced positive results and all controls worked as expected.
- ELISA Procedure Individual egg white samples were diluted in sodium carbonate buffer, pH 9.6, and added to individual wells of 96 well microtiter ELISA plates at a total volume of 0.1 ml. These plates were then allowed to coat overnight at 4° C. Prior to ELISA development, the plates were allowed warm to room temperature. Upon decanting the coating solutions and blotting away any excess, non-specific binding of antibodies was blocked by adding a solution of phosphate buffered saline (PBS), 1% (w/v) BSA, and 0.05% (v/v) Tween 20 and allowing it to incubate with shaking for a minimum of 45 minutes.
- PBS phosphate buffered saline
- BSA 1%
- v/v 0.05%
- This blocking solution was subsequently decanted and replaced with a solution of the primary antibody (Goat Anti-GP41 TAG) diluted in fresh PBS/BSA/Tween 20. After a two hour period of incubation with the primary antibody, each plate was washed with a solution of PBS and 0.05% Tween 20 in an automated plate washer to remove unbound antibody. Next, the secondary antibody, Rabbit anti-Goat Alkaline Phosphatase-conjugated, was diluted in PBS/BSA/Tween 20 and allowed to incubate 1 hour. The plates were then subjected to a second wash with PBS/Tween 20. Antigen was detected using a solution of p-Nitrophenyl Phosphate in Diethanolamine Substrate Buffer for Alkaline Phosphatase and measuring the absorbance at 30 minutes and 1 hour.
- the primary antibody Goat Anti-GP41 TAG
- a proinsulin fusion protein produced using a construct described above was isolated from egg white using ammonium sulfate precipitation and ion exchange chromotgraphy.
- a pooled fraction of the isolated fusion protein was run on an SDS-PAGE gel shown in FIG. 5, lanes 4 and 6 .
- Lanes 1 and 10 of the gel contain molecular weight standards, lanes 2 and 8 contain non-trangenic chicken egg white, whereas lanes 3 , 5 , 7 and 9 are blank.
- a HiTrap NHS-activated 1 mL column (Amersham) was charged with a 30 amino acid peptide that contained the gp-41 epitope containing gp-41's native disulfide bond that stabilizes the formation of the gp-41 hairpin loop.
- the 30 amino acid gp41 peptide is provided as SEQ ID NO:32.
- Approximately 10 mg of the peptide was dissolved in coupling buffer (0.2 M NaHCO3, 0.5 M NaCl, pH 8.3 and the ligand was circulated on the column for 2 hours at room temperature at 0.5 mL/minute.
- Antibodies to gp-41 were raised in goats by inoculation with the gp-41 peptide described above. More specifically, goats were inoculated, given a booster injection of the gp-41 peptide and blood samples were obtained by veinupuncture. Serum was harvested by centrifugation. Approximately 30 mL of goat serum was filtered to 0.45 uM and passed over a TAG column at a rate of 0.5 mL/min. The column was washed with 75 mM Tris, pH 8.0 until absorbance at 280 nm reached a baseline.
- the pooled fractions from the Anti-TAG affinity column were characterized by SDS-PAGE and western blot analysis.
- SDS-PAGE of the pooled fractions revealed a 60 kDal molecular weight band not present in control egg white fluid, consistent with the predicted molecular weight of the transgenic protein. Although some contaminating bands were observed, the 60 kDal species was greatly enriched compared to the other proteins.
- An aliquot of the pooled fractions was cleaved overnight at room temperature with the protease, enterokinase.
- SDS-PAGE analysis of the cleavage product revealed a band not present in the uncut material that co-migrated with a commercial human proinsulin positive control.
- An ELISA was employed for the initial screening of eggs and, thereby, identification of hens producing positive eggs. With further modifications this procedure was used for the initial quantification of recombinant protein amounts. These procedures were aided by the successful purification of an initial stock of the recombinant proinsulin (RPI). This stock of protein is used in the development of a double antibody assay that increases the sensitivity and reduces the background in the assay. Subsequent identification of hens producing positive eggs obviate the need to screen each egg collected. Only periodic checks are needed to determine if production levels are consistent.
- RPI proinsulin
- the egg white solution was filtered to at least 0.45 um.
- Amersham's hollow-fiber ultrafiltration apparatus was used to produced a column-ready solution filtered down to ⁇ 0.2 um with an undiluted starting solution. This approach minimized the time and the solution dilution needed to prepare the egg white solution for column loading.
- the egg white solution was subjected to protein precipitation using a 40% ammonium sulfate fractionation.
- the precipitated protein was subsequently collected via centrifugation and resuspended in 50 mM Tris-HCl, pH 8.
- the resuspended protein solution was dialyzed to remove residual (NH 4 ) 2 SO 4 or subjected to gel filtration to remove the (NH 4 ) 2 SO 4 and partially isolate the RPI from the remaining egg white protein.
- the RPI was further isolated via anion exchange chromatography using a 0 to 0.5M NaCl gradient in 50 mM Tris-HCl, pH 8. Two possible elution profiles were observed.
- Cleavage of the RPI Enterokinase recognition site was accomplished using purified enterokinase from Sigma. Enterokinase, 0.004 Unit/ ⁇ l per reaction, was applied to the pooled and, if necessary, concentrated protein solution. The digestion reaction was incubated at room temperature (up to 30° C. in a rolling hybridization oven) for a minimum of 16 h and in some cases up to 48 hrs of incubation. The digestion efficiency was followed using 16.5% Tris-Tricine SDS-PAGE peptide gels. All gel staining utilized Simply Blue Coomassie Staining Solutions. Free Proinsulin was observed on gels after digestion.
- a subsequent gel filtration separation was employed to obtain purified Proinsulin, and to remove the remaining Ovalbumin portion of the RPI and residual native EW proteins. Select steps in the purification process were analyzed using the 2-dimensional Beckman Coulter ProteomeLab PF2D Protein Fractionation System.
- the lumens of the oviducts of treated hens are injected with the transposon-based vector. Hens are subjected to additional estrogen stimulation after an optimized time during which the transposon-based vector is taken up into oviduct secretory cells. Re-stimulation by estrogen activates transposon expression, causing the integration of the gene of interest into the host genome. Estrogen stimulation is then withdrawn and hens continue normal sexual development. If a developmentally regulated promoter such as the ovalbumin promoter is used, expression of the transposon-based vector initiates in the oviduct at the time of sexual maturation. Intra-ovarian artery injection during this window allows for high and uniform transfection efficiencies of ovarian follicles to produce germ-line transfections and possibly oviduct expression.
- a developmentally regulated promoter such as the ovalbumin promoter
- a vector is designed for inserting a proinsulin gene under the control of a quail ovalbumin promoter, and a ovalbumin gene including an ovalbumin signal sequence, into the genome of a bird given below as SEQ ID NO:44.
- Base pairs 1-130 are a remainder of F1( ⁇ ) ori of pBluescriptII sk( ⁇ ) (Stratagene) corresponding to base pairs 1-130 of pBluescriptII sk( ⁇ ).
- Base pairs 133-1777 are a CMV promoter/enhancer taken from vector pGWiz (Gene Therapy Systems) corresponding to base pairs 229-1873 of pGWiz.
- Base pairs 1780-2987 are a transposase, modified from Tn10 (GenBank accession number J01829).
- Base pairs 2988-2993 are an engineered stop codon.
- Base pairs 2995-3410 are a synthetic polyA from pGWiz (Gene Therapy Systems) corresponding to base pairs 1922-2337 of pGWiz.
- Base pairs 3415-3718 are non coding DNA that is residual from vector pNK2859.
- Base pairs 3719-3761 are ⁇ DNA that is residual from pNK2859.
- Base pairs 3762-3831 are the 70 base pairs of the left insertion sequence (IS10) recognized by the transposon Tn10.
- Base pairs 3838-4044 are a multiple cloning site from pBlueScriptII sk( ⁇ ) corresponding to base pairs 924-718 of pBluescriptII sk( ⁇ ).
- Base pairs 4050-4938 are the Japanese quail ovalbumin promoter (including SDRE, steroid-dependent response element).
- the Japanese quail ovalbumin promoter was isolated by its high degree of homology to the chicken ovalbumin promoter (GenBank accession number J00895 M24999, base pairs 431-1332). Some deletions were noted in the quail sequence, as compared to the chicken sequence.
- Base pairs 4945-6092 are a quail ovalbumin signal sequence and ovalbumin gene that corresponds to base pairs 54-1201 of GenBank accession number X53964.1. (The STOP codon being omitted).
- Base pairs 6093-6246 are a TAG sequence containing a gp41 hairpin loop from HIV I an enterokinase cleavage site and a spacer (synthetic).
- Base pairs 6247-6507 are a proinsulin gene.
- Base pairs 6514-6866 are a synthetic polyadenylation sequence from pGWiz (Gene Therapy Systems) corresponding to base pairs 1920-2272 of pGWiz.
- Base pairs 6867-7303 are a multiple cloning site from pBlueScriptll sk( ⁇ ) corresponding to base pairs 667-235 of pBluescriptII sk( ⁇ ).
- Base pairs 7304-7379 are the 70 base pairs of the right insertion sequence (IS10) recognized by the transposon Tn10.
- Base pairs 7380-7421 are ⁇ DNA that is residual from pNK2859.
- Base pairs 7422-8286 are non coding DNA that is residual from pNK2859.
- Base pairs 8287-10487 are pBlueScript sk( ⁇ ) base vector (Stratagene, Inc.) corresponding to base pairs 761-2961 of pBluescriptII sk( ⁇ ).
- a vector was designed for inserting a p146 gene under the control of a chicken ovalbumin promoter, and a ovalbumin gene including an ovalbumin signal sequence, into the genome of a bird.
- the vector sequence is provided below as SEQ ID NO:45.
- Base pairs 1-130 are a remainder of F1( ⁇ ) ori of pBluescriptlI sk( ⁇ ) (Stratagene) corresponding to base pairs 1-130 of pBluescriptll sk( ⁇ ).
- Base pairs 133-1777 are a CMV promoter/enhancer taken from vector pGWiz (Gene Therapy Systems) corresponding to base pairs 229-1873 of pGWiz.
- Base pairs 1780-2987 are a transposase, modified from Tn10 (GenBank accession number J01829).
- Base pairs 2988-2993 are an engineered stop codon.
- Base pairs 2995-3410 are a synthetic polyA from pGWiz (Gene Therapy Systems) corresponding to base pairs 1922-2337 of pGWiz.
- Base pairs 3415-3718 are non coding DNA that is residual from vector pNK2859.
- Base pairs 3719-3761 are ⁇ DNA that is residual from pNK2859.
- Base pairs 3762-3831 are the 70 base pairs of the left insertion sequence (IS10) recognized by the transposon Tn10.
- Base pairs 3838-4044 are a multiple cloning site from pBlueScriptII sk( ⁇ ) corresponding to base pairs 924-718 of pBluescriptll sk( ⁇ ).
- Base pairs 4050-4951 are a chicken ovalbumin promoter (including SDRE, steroid-dependent response element) that corresponds to base pairs 431-1332 of the chicken ovalbumin promoter in GenBank Accession Number J00895 M24999.
- Base pairs 4958-6115 are a chicken ovalbumin signal sequence and Ovalbumin gene that correspond to base pairs 66-1223 of GenBank Accession Number V00383.1 (The STOP codon being omitted).
- Base pairs 6122-6271 are a TAG sequence containing a gp41 hairpin loop from HIV I, an enterokinase cleavage site and a spacer (synthetic).
- Base pairs 6272-6316 are a p146 sequence (synthetic) with 2 added stop codons.
- Base pairs 6324-6676 are a synthetic polyadenylation sequence from pGWiz (Gene Therapy Systems) corresponding to base pairs 1920-2272 of pGWiz.
- Base pairs 6682-7114 are a multiple cloning site from pBlueScriptII sk( ⁇ ) corresponding to base pairs 667-235 of pBluescriptll sk( ⁇ ).
- Base pairs 7120-7189 are the 70 base pairs of the right insertion sequence (IS10) recognized by the transposon Tn10.
- Base pairs 7190-7231 are ⁇ DNA that is residual from pNK2859.
- Base pairs 7232-8096 are non coding DNA that is residual from pNK2859.
- Base pairs 8097-10297 are pBlueScript sk( ⁇ ) base vector (Stratagene, Inc.) corresponding to base pairs 761-2961 of pBluescriptll sk( ⁇ ).
- a vector was designed for inserting a p146 gene under the control of a quail ovalbumin promoter, and a ovalbumin gene including an ovalbumin signal sequence, into the genome of a bird.
- the vector sequence is given below as SEQ ID NO:46.
- Base pairs 1-130 are a remainder of F1( ⁇ ) ori of pBluescriptII sk( ⁇ ) (Stratagene) corresponding to base pairs 1-130 of pBluescriptll sk( ⁇ ).
- Base pairs 133-1777 are a CMV promoter/enhancer taken from vector pGWiz (Gene Therapy Systems) corresponding to base pairs 229-1873 of pGWiz.
- Base pairs 1780-2987 are a transposase, modified from Tn10 (GenBank accession number J01829).
- Base pairs 2988-2993 are an engineered stop codon.
- Base pairs 2995-3410 are a synthetic polyA from pGWiz (Gene Therapy Systems) corresponding to base pairs 1922-2337 of pGWiz.
- Base pairs 3415-3718 are non coding DNA that is residual from vector pNK2859.
- Base pairs 3719-3761 are ⁇ DNA that is residual from pNK2859.
- Base pairs 3762-3831 are the 70 base pairs of the left insertion sequence (IS10) recognized by the transposon Tn10.
- Base pairs 3838-4044 are a multiple cloning site from pBlueScriptII sk( ⁇ ) corresponding to base pairs 924-718 of pBluescriptll sk( ⁇ ).
- Base pairs 4050-4938 are the Japanese quail ovalbumin promoter (including SDRE, steroid-dependent response element).
- the Japanese quail ovalbumin promoter was isolated by its high degree of homology to the chicken ovalbumin promoter (GenBank accession number J00895 M24999, base pairs 431-1332).
- Bp 4945-6092 are a quail ovalbumin signal sequence and ovalbumin gene that corresponds to base pairs 54-1201 of GenBank accession number X53964.1. (The STOP codon being omitted).
- Base pairs 6097-6246 are a TAG sequence containing a gp41 hairpin loop from HIV I, an enterokinase cleavage site and a spacer (synthetic).
- Base pairs 6247-6291 are a p146 sequence (synthetic) with 2 added stop codons.
- Base pairs 6299-6651 are a synthetic polyadenylation sequence from pGWiz (Gene Therapy Systems) corresponding to base pairs 1920-2272of pGWiz.
- Base pairs 6657-7089 are a multiple cloning site from pBlueScriptII sk( ⁇ ) corresponding to base pairs 667-235 of pBluescriptll sk( ⁇ ).
- Base pairs 7095-7164 are the 70 base pairs of the right insertion sequence (IS10) recognized by the transposon Tn10.
- Base pairs 7165-7206 are ⁇ DNA that is residual from pNK2859.
- Base pairs 7207-8071 are non coding DNA that is residual from pNK2859.
- Base pairs 8072-10272 are pBlueScript sk( ⁇ ) base vector (Stratagene, Inc.) corresponding to base pairs 761-2961of pBluescriptll sk( ⁇ ).
- transposon-based vectors of the present invention provides a description of various transposon-based vectors of the present invention and several constructs that have been made for insertion into the transposon-based vectors of the present invention, all for intraoviduct administration. These examples are not meant to be limiting in any way.
- the constructs for insertion into a transposon-based vector are provided in a cloning vector pTnMCS or pTnMod, both described above.
- pTnMCS CMV-CHOVg-ent-Prolnsulin-synPA
- Bp 3676-5320 CMV promoter/enhancer taken from vector pGWIZ (Gene Therapy Systems), bp 230-1864
- Bp 6487-6636 Synthetic spacer sequence and hairpin loop of HIV gp41 with an added enterokinase cleavage site
- Bp 3676-5320 CMV promoter/enhancer taken from vector pGWIZ (Gene Therapy Systems), bp 230-1864
- Bp 5914-5958 Spacer DNA, derived as an artifact from the cloning vectors pTOPO Blunt II (Invitrogen) and pGWIZ (Gene Therapy Systems)
- Bp 7335-7379 Spacer DNA derived as an artifact from the cloning vectors pTOPO Blunt II (Invitrogen) and gWIZ (Gene Therapy Systems)
- Bp 3676-4333 Quail Ovalbumin enhancer 658 bp sequence, amplified in-house from quail genomic DNA, roughly equivalent to the far-upstream chicken ovalbumin enhancer, GenBank accession #S82527.1, bp 1-675. (There are multiple base pair substitutions and deletions in the quail sequence, relative tochicken, so the number of bases does not correspond exactly.)
- Bp 4340-5705 Quail Ovalbumin promoter 1366 bp sequence, amplified in-house from quail genomic DNA, roughly corresponding to chicken ovalbumin promoter, GenBank accession #J00895-M24999 bp 1-1336. (There are multiple base pair substitutions and deletions between the quail and chicken sequences, so the number of bases does not correspond exactly.)
- Bp 7328-7372 Spacer DNA derived as an artifact from the cloning vectors pTOPO Blunt II (Invitrogen) and gWIZ (Gene Therapy Systems)
- Bp 3676-4333 Quail Ovalbumin enhancer 658 bp sequence, amplified from quail genomic DNA, roughly equivalent to the far-upstream chicken ovalbumin enhancer, GenBank accession #S82527.1, bp 1-675. (There are multiple base pair substitutions and deletions in the quail sequence, relative to chicken, so the number of bases does not correspond exactly.)
- Bp 4340-5705 Quail Ovalbumin promoter 1366 bp sequence, amplified from quail genomic DNA, roughly corresponding to chicken ovalbumin promoter, GenBank accession #J00895-M24999 bp 1-1336. (There are multiple base pair substitutions and deletions between the quail and chicken sequences, so the number of bases does not correspond exactly.)
- Bp 4051-5695 CMV promoter/enhancer taken from vector pGWIZ (Gene therapy systems), bp 230-1864
- Bp 7710-7754 Spacer DNA derived as an artifact from the cloning vectors pTOPO Blunt II (Invitrogen) and gWIZ (Gene Therapy Systems)
- Bp 6251-6400 Synthetic spacer sequence and hairpin loop of HIV gp41 with an added enterokinase cleavage site
- Bp 4051-4708 Quail Ovalbumin enhancer 658 bp sequence, amplified in-house from quail genomic DNA, roughly equivalent to the far-upstream chicken ovalbumin enhancer, GenBank accession #S82527.1, bp 1-675. (There are multiple base pair substitutions and deletions in the quail sequence, relative to chicken, so the number of bases does not correspond exactly.)
- Bp 4715-6080 Quail Ovalbumin promoter 1366 bp sequence, amplified in-house from quail genomic DNA, roughly corresponding to chicken ovalbumin promoter, GenBank accession #J00895-M24999 bp 1-1336. (There are multiple base pair substitutions and deletions between the quail and chicken sequences, so the number of bases does not correspond exactly.)
- Bp 4051-4708 Quail Ovalbumin enhancer 658 bp sequence, amplified in-housefrom quail genomic DNA, roughly equivalent to the far-upstream chicken ovalbumin enhancer, GenBank accession #S82527.1, bp 1-675. (There are multiple base pair substitutions and deletions in the quail sequence, relative to chicken, so the number of bases does not correspond exactly.)
- Bp 4715-6080 Quail Ovalbumin promoter 1366 bp sequence, amplified in-house from quail genomic DNA, roughly corresponding to chicken ovalbumin promoter, GenBank accession #J00895-M24999 bp 1-1336. (There are multiple base pair substitutions and deletions between the quail and chicken sequences, so the number of bases does not correspond exactly.)
- Bp 4051-5694 CMV promoter/enhancer taken from vector pGWIZ (Gene therapy systems), bp 230-1873
- Bp 3676-5319 CMV promoter/enhancer taken from vector pGWIZ (Gene therapy systems), bp 230-1873
- BP 133-1777 CMV promoter/enhancer taken from vector pGWIZ (Gene Therapy Systems) bp 229-1873.
- BP 1780-2987 Transposase, modified from Tn10 (GenBank #J01829).
- BP 2994-3343 non coding DNA from vector pNK2859.
- BP 3457-3674 multiple cloning site from pBluescriptII sk( ⁇ ) bp 924-707.
- BP 5698-5865 prepro with Cap site amplified from cecropin of pMON200 GenBank #X07404 (5′BamHI, 3′KpnI)
- BP 5872-7338 Protein A gene from GenBank#J01786, mature peptide bp 292-1755 (5′Kpnl, 3′SacII)
- BP 7753-8195 multiple cloning site from pBluescriptII sk( ⁇ ) bp 677-235.
Abstract
Description
- The present application is a continuation-in-part of U.S. patent application No. 10/609,019 filed on Jun. 26, 2003, and claims the priority benefit of U.S. Provisional Patent Application No. 60/441,392 filed Jan. 21, 2003; U.S. Provisional Patent Application No. 60/441,377 filed Jan. 21, 2003; U.S. Provisional Patent Application No. 60/441,502 filed Jan. 21, 2003; U.S. Provisional Patent Application No. 60/441,405 filed Jan. 21, 2003; U.S. Provisional Patent Application No. 60/441,447 filed Jan. 21, 2003; U.S. Provisional Patent Application No. 60/441,381 filed Jan. 21, 2003; and U.S. Provisional Patent Application No. 60/392,415 filed Jun. 26, 2002.
- [0002] The U.S. Government has certain rights in this invention. The development of this invention was partially funded by the United States Government under a HATCH grant from the United States Department of Agriculture, partially funded by the United States Government with Formula 1433 funds from the United States Department of Agriculture and partially funded by the United States Government under contract DAAD 19-02016 awarded by the Army.
- The present invention relates generally to administration of a transposon-based vector to the reproductive tract in an animal. The reproductive tract includes an ovary, ova within an ovary, and an oviduct. Such administration results in incorporation of a gene of interest contained in the vector in the ovary, the oviduct or an ovum of the animal. In some embodiments, the present invention further includes production of a protein encoded by the gene in an egg produced by the animal.
- Transgenic animals are desirable for a variety of reasons, including their potential as biological factories to produce desired molecules for pharmaceutical, diagnostic and industrial uses. This potential is attractive to the industry due to the inadequate capacity in facilities used for recombinant production of desired molecules and the increasing demand by the pharmaceutical industry for use of these facilities. Numerous attempts to produce transgenic animals have met several problems, including low rates of gene incorporation and unstable gene incorporation. Accordingly, improved gene technologies are needed for the development of transgenic animals for the production of desired molecules.
- Improved gene delivery technologies are also needed for the treatment of disease in animals and humans. Many diseases and conditions can be treated with gene-delivery technologies, which provide a gene of interest to a patient suffering from the disease or the condition. An example of such disease is Type 1 diabetes. Type 1 diabetes is an autoimmune disease that ultimately results in destruction of the insulin producing β-cells in the pancreas. Although patients with Type 1 diabetes may be treated adequately with insulin injections or insulin pumps, these therapies are only partially effective. Insulin replacement, such as via insulin injection or pump administration, cannot fully reverse the defect in the vascular endothelium found in the hyperglycemic state (Pieper et al., 1996. Diabetes Res. Clin. Pract. Suppl. S157-S162). In addition, hyper- and hypoglycemia occurs frequently despite intensive home blood glucose monitoring. Finally, careful dietary constraints are needed to maintain an adequate ratio of calories consumed. This often causes major psychosocial stress for many diabetic patients. Development of gene therapies providing delivery of the insulin gene into the pancreas of diabetic patients could overcome many of these problems and result in improved life expectancy and quality of life.
- Several of the prior art gene delivery technologies employed viruses that are associated with potentially undesirable side effects and safety concerns. The majority of current gene-delivery technologies useful for gene therapy rely on virus-based delivery vectors, such as adeno and adeno-associated viruses, retroviruses, and other viruses, which have been attenuated to no longer replicate. (Kay, M. A., et al. 2001. Nature Medicine 7:33-40).
- There are multiple problems associated with the use of viral vectors. Firstly, they are not tissue-specific. In fact, a gene therapy trial using adenovirus was recently halted because the vector was present in the patient's sperm (Gene trial to proceed despite fears that therapy could change child's genetic makeup. The New York Times, Dec. 23, 2001). Secondly, viral vectors are likely to be transiently incorporated, which necessitates re-treating a patient at specified time intervals. (Kay, M. A., et al. 2001. Nature Medicine 7:33-40). Thirdly, there is a concern that a viral-based vector could revert to its virulent form and cause disease. Fourthly, viral-based vectors require a dividing cell for stable integration. Fifthly, viral-based vectors indiscriminately integrate into various cells, which can result in undesirable germline integration. Sixthly, the required high titers needed to achieve the desired effect have resulted in the death of one patient and they are believed to be responsible for induction of cancer in a separate study. (Science, News of the Week, Oct. 4, 2002).
- Accordingly, what is needed is a new method to produce transgenic animals and humans with stably incorporated genes, in which the vector containing those genes does not cause disease or other unwanted side effects. There is also a need for DNA constructs that would be stably incorporated into the tissues and cells of animals and humans, including cells in the resting state that are not replicating. There is a further recognized need in the art for DNA constructs capable of delivering genes to specific tissues and cells of animals and humans.
- When incorporating a gene of interest into an animal for the production of a desired protein or when incorporating a gene of interest in an animal or human for the treatment of a disease, it is often desirable to selectively activate incorporated genes using inducible promoters. These inducible promoters are regulated by substances either produced or recognized by the transcription control elements within the cell in which the gene is incorporated. In many instances, control of gene expression is desired in transgenic animals or humans so that incorporated genes are selectively activated at desired times and/or under the influence of specific substances. Accordingly, what is needed is a means to selectively activate genes introduced into the genome of cells of a transgenic animal or human. This can be taken a step further to cause incorporation to be tissue-specific, which prevents widespread gene incorporation throughout a patient's body (animal or human). This decreases the amount of DNA needed for a treatment, decreases the chance of incorporation in gametes, and targets gene delivery, incorporation, and expression to the desired tissue where the gene is needed to function. What is also needed is a rapid expression method for rapidly producing a protein or peptide of interest in eggs and milk of transgenic animals.
- The present invention addresses the problems described above by providing new, effective and efficient compositions for producing transgenic animals and for treating disease in animals or humans. Transgenic animals include all egg-laying animals and milk-producing animals. Transgenic animals further include but are not limited to avians, fish, amphibians, reptiles, insects, mammals and humans. In another preferred embodiment, the animal is a milk-producing animal, including but not limited to bovine, porcine, ovine and equine animals. In a preferred embodiment, the animal is an avian animal. In another preferred embodiment, the animal is a mammal. Animals are made transgenic through administration of a composition comprising a transposon-based vector designed for incorporation of a gene of interest for production of a desired protein, together with an acceptable carrier. The compositions of the present invention are introduced into the reproductive system of an animal. The compositions of the present invention are administered to a reproductive organ including, but not limited to, an oviduct, an ovary, or into the duct system of the mammary gland. The compositions of the present invention are may be administered to a reproductive organ of an animal through the cloaca. The compositions of the present invention may be directly administered to a reproductive organ or can be administered to an artery leading to the reproductive organ. In a preferred embodiment, the compositions of the present invention are introduced into the the reproductive system of an avian animal. In another preferred embodiment, the compositions of the present invention are introduced into the the intramammary duct system of a mammal. A transfection reagent is optionally added to the composition before administration.
- The transposon-based vectors of the present invention include a transposase, operably-linked to a first promoter, and a coding sequence for a protein or peptide of interest operably-linked to a second promoter, wherein the coding sequence for the protein or peptide of interest and its operably-linked promoter are flanked by transposase insertion sequences recognized by the transposase. The transposon-based vector also includes the following characteristics: a) one or more modified Kozak sequences at the 3′ end of the first promoter to enhance expression of the transposase; b) modifications of the codons for the first several N-terminal amino acids of the transposase, wherein the nucleotide at the third base position of each codon is changed to an A or a T without changing the corresponding amino acid; c) addition of one or more stop codons to enhance the termination of transposase synthesis; and/or, d) addition of an effective polyA sequence operably-linked to the transposase to further enhance expression of the transposase gene. In some embodiments, the effective polyA sequence is an avian optimized polyA sequence.
- The present invention also provides for tissue-specific incorporation and/or expression of a gene of interest. Tissue-specific incorporation of a gene of interest may be achieved by placing the transposase gene under the control of a tissue-specific promoter, whereas tissue-specific expression of a gene of interest may be achieved by placing the gene of interest under the control of a tissue-specific promoter. In some embodiments, the gene of interest is transcribed under the influence of an ovalbumin, or other oviduct specific, promoter. Linking the gene of interest to an oviduct specific promoter in an egg-laying animal results in synthesis of a desired molecule and deposition of the desired molecule in a developing egg.
- The present invention advantageously produces a high number of transgenic animals having a gene of interest stably incorporated. In some embodiments wherein the transposon-based vector is administered to the ovary, these transgenic animals successfully pass the desired gene to their progeny. Accordingly, the present invention can be used to obtain transgenic animals having the gene of interest incorporated into the germline through transfection of the ovary or the present invention can be used to obtain transgenic animals having the gene of interest incorporated into the oviduct in a tissue-specific manner. Both types of transgenic animals of the present invention produce large amounts of a desired molecule encoded by the transgene. Transgenic egg-laying animals, particularly avians, produce large amounts of a desired protein that is deposited in the egg for rapid harvest and purification.
- Any desired gene may be incorporated into the novel transposon-based vectors of the present invention in order to synthesize a desired molecule in the transgenic animals. Proteins, peptides and nucleic acids are preferred desired molecules to be produced by the transgenic animals of the present invention. Particularly preferred proteins are antibody proteins and other immunopharmecuetical proteins.
- This invention provides a composition useful for the production of transgenic hens capable of producing substantially high amounts of a desired protein or peptide. Entire flocks of transgenic birds may be developed very quickly in order to produce industrial amounts of desired molecules. The present invention solves the problems inherent in the inadequate capacity of fermentation facilities used for bacterial production of molecules and provides a more efficient and economical way to produce desired molecules. Accordingly, the present invention provides a means to produce large amounts of therapeutic, diagnostic and reagent molecules.
- Transgenic chickens are excellent in terms of convenience and efficiency of manufacturing molecules such as proteins and peptides. Starting with a single transgenic rooster, thousands of transgenic offspring can be produced within a year. (In principle, up to forty million offspring could be produced in just three generations). Each transgenic female is expected to lay at least 250 eggs/year, each potentially containing hundreds of milligrams of the selected protein. Flocks of chickens numbering in the hundreds of thousands are readily handled through established commercial systems. The technologies for obtaining eggs and fractionating them are also well known and widely accepted. Thus, for each therapeutic, diagnostic, or other protein of interest, large amounts of a substantially pure material can be produced at relatively low incremental cost.
- A wide range of recombinant peptides and proteins can be produced in transgenic egg-laying animals. Enzymes, hormones, antibodies, growth factors, serum proteins, commodity proteins, biological response modifiers, peptides and designed proteins may all be made through practice of the present invention. For example, rough estimates suggest that it is possible to produce in bulk growth hormone, insulin, or Factor VIII, and deposit them in egg whites, for an incremental cost in the order of one dollar per gram. At such prices it is feasible to consider administering such medical agents by inhalation or even orally, instead of through injection. Even if bioavailability rates through these avenues were low, the cost of a much higher effective-dose would not be prohibitive.
- In one embodiment, the egg-laying transgenic animal is an avian. The method of the present invention may be used in avians including Ratites, Psittaciformes, Falconiformes, Piciformes, Strigiformes, Passeriformes, Coraciformes, Ralliformes, Cuculiformes, Columbiformes, Galliformes, Anseriformes, and Herodiones. Preferably, the egg-laying transgenic animal is a poultry bird. More preferably, the bird is a chicken, turkey, duck, goose or quail. Another preferred bird is a ratite, such as, an emu, an ostrich, a rhea, or a cassowary. Other preferred birds are partridge, pheasant, kiwi, parrot, parakeet, macaw, falcon, eagle, hawk, pigeon, cockatoo, song birds, jay bird, blackbird, finch, warbler, canary, toucan, mynah, or sparrow.
- Accordingly, it is an object of the present invention to provide novel transposon-based vectors.
- It is another object of the present invention to provide novel transposon-based vectors that encode for the production of desired proteins or peptides in cells.
- It is an object of the present invention to produce transgenic animals through intraoviduct or intraovarian administration of a transposon-based vector.
- Another object of the present invention is to produce transgenic animals through intraoviduct or intraovarian administration of a transposon-based vector, wherein the transgenic animals produce desired proteins or peptides.
- It is further an object of the present invention to provide a method to produce transgenic animals through intraovarian administration of a transposon-based vector that are capable of producing transgenic progeny.
- Yet another object of the present invention is to provide a method to produce transgenic animals through intraoviduct or intraovarian administration of a transposon-based vector that are capable of producing a desired molecule, such as a protein, peptide or nucleic acid.
- Another object of the present invention is to provide a method to produce transgenic animals through intraoviduct or intraovarian administration of a transposon-based vector, wherein such administration results in modulation of endogenous gene expression.
- It is yet another object of the present invention to provide a method to produce transgenic avians through intraoviduct or intraovarian administration of a transposon-based vector that are capable of producing proteins, peptides or nucleic acids.
- It is another object of the present invention to produce transgenic animals through intraoviduct or intraovarian administration of a transposon-based vector encoding an antibody or a fragment thereof.
- Still another object of the present invention is to provide a method to produce transgenic avians through intraoviduct or intraovarian administration of a transposon-based vector that are capable of producing proteins or peptides and depositing these proteins or peptides in the egg.
- Another object of the present invention is to provide transgenic avians that contain a stably incorporated transgene.
- Still another object of the present invention is to provide eggs containing desired proteins or peptides encoded by a transgene incorporated into the transgenic avian that produces the egg.
- It is further an object of the present invention to provide a method to produce transgenic milk-producing animals through administration of a transposon-based vector that are capable of producing proteins, peptides or nucleic acids.
- Still another object of the present invention is to provide a method to produce transgenic milk-producing animals through administration of a transposon-based vector that are capable of producing proteins or peptides and depositing these proteins or peptides in their milk.
- Another object of the present invention is to provide transgenic milk-producing animals that contain a stably incorporated transgene.
- Another object of the present invention is to provide transgenic milk-producing animals that are capable of producing proteins or peptides and depositing these proteins or peptides in their milk.
- Yet another object of the present invention is to provide milk containing desired molecules encoded by a transgene incorporated into the transgenic milk-producing animals that produce the milk.
- Still another object of the present invention is to provide milk containing desired proteins or peptides encoded by a transgene incorporated into the transgenic milk-producing animals that produce the milk.
- An advantage of the present invention is that transgenic animals are produced with higher efficiencies than observed in the prior art.
- Another advantage of the present invention is that these transgenic animals possess high copy numbers of the transgene.
- Another advantage of the present invention is that the transgenic animals produce large amounts of desired molecules encoded by the transgene.
- Still another advantage of the present invention is that desired molecules are produced by the transgenic animals much more efficiently and economically than prior art methods, thereby providing a means for large scale production of desired molecules, particularly proteins and peptides.
- Yet another advantage of the present invention is that the desired proteins and peptides are produced rapidly after making animals transgenic through introduction of the vectors of the present invention.
- These and other objects, features and advantages of the present invention will become apparent after a review of the following detailed description of the disclosed embodiments and claims.
- FIG. 1 depicts schematically a transposon-based vector containing a transposase operably linked to a first promoter and a gene of interest operably-linked to a second promoter, wherein the gene of interest and its operably-linked promoter are flanked by insertion sequences (IS) recognized by the transposase. “Pro” designates a promoter. In this and subsequent figures, the size of the actual nucleotide sequence is not necessarily proportionate to the box representing that sequence.
- FIG. 2 depicts schematically a transposon-based vector for targeting deposition of a polypeptide in an egg white wherein Ov pro is the ovalbumin promoter, Ov protein is the ovalbumin protein and PolyA is a polyadenylation sequence. The TAG sequence includes a spacer sequence, the gp41 hairpin loop from HIV I and a protease cleavage site.
- FIG. 3 depicts schematically a transposon-based vector for targeting deposition of a polypeptide in an egg white wherein Ovo pro is the ovomucoid promoter and Ovo SS is the ovomucoid signal sequence. The TAG sequence includes a spacer, the gp41 hairpin loop from HIV I and a protease cleavage site.
- FIG. 4 depicts schematically a transposon based-vector for expression of an RNAi molecule. “Teti pro” indicates a tetracycline inducible promoter whereas “pro” indicates the pro portion of a prepro sequence as described herein. “Ovgen” indicates approximately 60 base pairs of an ovalbumin gene, “Ovotrans” indicates approximately 60 base pairs of an ovotransferrin gene and “Ovomucin” indicates approximately 60 base pairs of an ovomucin gene.
- FIG. 5 is a picture of an SDS-PAGE gel wherein a pooled fraction of an isolated proinsulin fusion protein was run in lanes4 and 6. Lanes 1 and 10 of the gel contain molecular weight standards, lanes 2 and 8 contain non-trangenic chicken egg white, and lanes 3, 5, 7 and 9 are blank.
- The present invention provides a new, effective and efficient method of producing transgenic animals, particularly egg-laying animals and milk-producing animals, through administration of a composition comprising a transposon-based vector designed for incorporation of a gene of interest and production of a desired molecule. The transposon-based vectors are administered to a reproductive organ including, but not limited to, an oviduct, an ovary, or into the duct system of the mammary gland. The vectors may be directly administered to a reproductive organ or can be administered to an artery leading to the reproductive organ or to a lymph system proximate to the cells to be genetically altered. The vectors may be administered to a reproductive organ of an animal through the cloaca. One method of direct administration is by injection, and in one embodiment, the lumen of the magnum of the oviduct is injected with a transposon-based vector. Another method of direct administration is by injection, and in one embodiment, the lumen of the infundibulum of the oviduct is injected with a transposon-based vector. A preferred intrarterial administration is an administration into an artery that supplies the oviduct or the ovary. In some embodiments, administration of the transposon-based vector to an oviduct or an artery that leads to the oviduct results in incorporation of the vector into the epithelial and/or secretory cells of the oviduct. In other embodiments, administration of the transposon-based vector to an ovary or an artery that leads to the ovary or a lymphatic system proximal to the ovary results in incorporation of the vector into an oocyte or a germinal disk inside the ovary.
- It is to be understood that as used in the specification and in the claims, “a” or “an” can mean one or more, depending upon the context in which it is used. Thus, for example, reference to “a cell” can mean that at least one cell can be utilized.
- The term “antibody” is used interchangeably with the term “immunoglobulin” and is defined herein as a protein synthesized by an animal or a cell of the immune system in response to the presence of a foreign substance commonly referred to as an “antigen” or an “immunogen”. The term antibody includes fragments of antibodies. Antibodies are characterized by specific affinity to a site on the antigen, wherein the site is referred to an “antigenic determinant” or an “epitope”. Antigens can be naturally occurring or artificially engineered. Artificially engineered antigens include, but are not limited to, small molecules, such as small peptides, attached to haptens such as macromolecules, for example proteins, nucleic acids, or polysaccharides. Artificially designed or engineered variants of naturally occurring antibodies and artificially designed or engineered antibodies not occurring in nature are all included in the current definition. Such variants include conservatively substituted amino acids and other forms of substitution as described in the section concerning proteins and polypeptides.
- As used herein, the term “egg-laying animal” includes all amniotes such as birds, turtles, lizards and monotremes. Monotremes are egg-laying mammals and include the platypus and echidna. The term “bird” or “fowl,” as used herein, is defined as a member of the Aves class of animals which are characterized as warm-blooded, egg-laying vertebrates primarily adapted for flying. Avians include, without limitation, Ratites, Psittaciformes, Falconiformes, Piciformes, Strigiformes, Passeriformes, Coraciformes, Ralliformes, Cuculiformes, Columbiformes, Galliformes, Anseriformes, and Herodiones. The term “Ratite,” as used herein, is defined as a group of flightless, mostly large, running birds comprising several orders and including the emus, ostriches, kiwis, and cassowaries. The term “Psittaciformes”, as used herein, includes parrots and refers to a monofamilial order of birds that exhibit zygodactylism and have a strong hooked bill. A “parrot” is defined as any member of the avian family Psittacidae (the single family of the Psittaciformes), distinguished by the short, stout, strongly hooked beak. Avians include all poultry birds, especially chickens, geese, turkeys, ducks and quail. The term “chicken” as used herein denotes chickens used for table egg production, such as egg-type chickens, chickens reared for public meat consumption, or broilers, and chickens reared for both egg and meat production (“dual-purpose” chickens). The term “chicken” also denotes chickens produced by primary breeder companies, or chickens that are the parents, grandparents, great-grandparents, etc. of those chickens reared for public table egg, meat, or table egg and meat consumption.
- The term “egg” is defined herein as including a large female sex cell enclosed in a porous, calcarous or leathery shell, produced by birds and reptiles. The term “ovum” is defined as a female gamete, and is also known as an egg. Therefore, egg production in all animals other than birds and reptiles, as used herein, is defined as the production and discharge of an ovum from an ovary, or “ovulation”. Accordingly, it is to be understood that the term “egg” as used herein is defined as a large female sex cell enclosed in a porous, calcarous or leathery shell, when a bird or reptile produces it, or it is an ovum when it is produced by all other animals.
- The term “milk-producing animal” refers herein to mammals including, but not limited to, bovine, ovine, porcine, equine, and primate animals. Milk-producing animals include but are not limited to cows, llamas, camels, goats, reindeer, zebu, water buffalo, yak, horses, pigs, rabbits, non-human primates, and humans.
- The term “gene” is defined herein to include a coding region for a protein, peptide or polypeptide.
- The term “transgenic animal” refers to an animal having at least a portion of the transposon-based vector DNA incorporated into its DNA. While a transgenic animal includes an animal wherein the transposon-based vector DNA is incorporated into the germline DNA, a transgenic animal also includes an animal having DNA in one or more cells that contain a portion of the transposon-based vector DNA for any period of time. In a preferred embodiment, a portion of the transposon-based vector comprises a gene of interest. More preferably, the gene of interest is incorporated into the animal's DNA for a period of at least five days, more preferably the reproductive life of the animal, and most preferably the life of the animal. In a further preferred embodiment, the animal is an avian.
- The term “vector” is used interchangeably with the terms “construct”, “DNA construct” and “genetic construct” to denote synthetic nucleotide sequences used for manipulation of genetic material, including but not limited to cloning, subcloning, sequencing, or introduction of exogenous genetic material into cells, tissues or organisms, such as birds. It is understood by one skilled in the art that vectors may contain synthetic DNA sequences, naturally occurring DNA sequences, or both. The vectors of the present invention are transposon-based vectors as described herein.
- When referring to two nucleotide sequences, one being a regulatory sequence, the term “operably-linked” is defined herein to mean that the two sequences are associated in a manner that allows the regulatory sequence to affect expression of the other nucleotide sequence. It is not required that the operably-linked sequences be directly adjacent to one another with no intervening sequence(s).
- The term “regulatory sequence” is defined herein as including promoters, enhancers and other expression control elements such as polyadenylation sequences, matrix attachment sites, insulator regions for expression of multiple genes on a single construct, ribosome entry/attachment sites, introns that are able to enhance expression, and silencers.
- While not wanting to be bound by the following statement, it is believed that the nature of the DNA construct is an important factor in successfully producing transgenic animals. The “standard” types of plasmid and viral vectors that have previously been almost universally used for transgenic work in all species, especially avians, have low efficiencies and may constitute a major reason for the low rates of transformation previously observed. The DNA (or RNA) constructs previously used often do not integrate into the host DNA, or integrate only at low frequencies. Other factors may have also played a part, such as poor entry of the vector into target cells. The present invention provides transposon-based vectors that can be administered to an animal that overcome the prior art problems relating to low transgene integration frequencies. Two preferred transposon-based vectors of the present invention in which a tranposase, gene of interest and other polynucleotide sequences may be introduced are termed pTnMCS (SEQ ID NO:2) and pTnMod (SEQ ID NO:3).
- The transposon-based vectors of the present invention produce integration frequencies an order of magnitude greater than has been achieved with previous vectors. More specifically, intratesticular injections performed with a prior art transposon-based vector (described in U.S. Pat. No. 5,719,055) resulted in 41% sperm positive roosters whereas intratesticular injections performed with the novel transposon-based vectors of the present invention resulted in 77% sperm positive roosters. Actual frequencies of integration were estimated by either or both comparative strength of the PCR signal from the sperm and histological evaluation of the testes and sperm by quantitative PCR.
- The transposon-based vectors of the present invention include a transposase gene operably-linked to a first promoter, and a coding sequence for a desired protein or peptide operably-linked to a second promoter, wherein the coding sequence for the desired protein or peptide and its operably-linked promoter are flanked by transposase insertion sequences recognized by the transposase. The transposon-based vector also includes one or more of the following characteristics: a) one or more modified Kozak sequences comprising ACCATG (SEQ ID NO:1) at the 3′ end of the first promoter to enhance expression of the transposase; b) modifications of the codons for the first several N-terminal amino acids of the transposase, wherein the third base of each codon was changed to an A or a T without changing the corresponding amino acid; c) addition of one or more stop codons to enhance the termination of transposase synthesis; and/or, d) addition of an effective polyA sequence operably-linked to the transposase to further enhance expression of the transposase gene. The transposon-based vector may additionally or alternatively include one or more of the following Kozak sequences at the 3′ end of any promoter, including the promoter operably-linked to the transposase: ACCATGG (SEQ ID NO:4), AAGATGT (SEQ ID NO:5), ACGATGA (SEQ ID NO:6), AAGATGG (SEQ ID NO:7), GACATGA (SEQ ID NO:8), ACCATGA (SEQ ID NO:9), and ACCATGA (SEQ ID NO:10), ACCATGT (SEQ ID NO:52).
- FIG. 1 shows a schematic representation of several components of the transposon-based vector. The present invention further includes vectors containing more than one gene of interest, wherein a second or subsequent gene of interest is operably-linked to the second promoter or to a different promoter. It is also to be understood that the transposon-based vectors shown in the Figures are representative of the present invention and that the order of the vector elements may be different than that shown in the Figures, that the elements may be present in various orientations, and that the vectors may contain additional elements not shown in the Figures.
- In a further embodiment of the present invention, the transposase found in the transposase-based vector is an altered target site (ATS) transposase and the insertion sequences are those recognized by the ATS transposase. However, the transposase located in the transposase-based vectors is not limited to a modified ATS transposase and can be derived from any transposase. Transposases known in the prior art include those found in AC7, Tn5SEQ1, Tn916, Tn951, Tn1721, Tn 2410, Tn1681, Tn1, Tn2, Tn3, Tn4, Tn5, Tn6, Tn9, Tn10, Tn30, Tn101, Tn903, Tn501, Tn1000 (γδ), Tn1681, Tn2901, AC transposons, Mp transposons, Spm transposons, En transposons, Dotted transposons, Mu transposons, Ds transposons, dSpm transposons and I transposons. According to the present invention, these transposases and their regulatory sequences are modified for improved functioning as follows: a) the addition one or more modified Kozak sequences comprising ACCATG (SEQ ID NO:1) at the 3′ end of the promoter operably-linked to the transposase; b) a change of the codons for the first several amino acids of the transposase, wherein the third base of each codon was changed to an A or a T without changing the corresponding amino acid; c) the addition of one or more stop codons to enhance the termination of transposase synthesis; and/or, d) the addition of an effective polyA sequence operably-linked to the transposase to further enhance expression of the transposase gene.
- Although not wanting to be bound by the following statement, it is believed that the modifications of the first several N-terminal codons of the transposase gene increase transcription of the transposase gene, in part, by increasing strand dissociation. It is preferable that between approximately 1 and 20, more preferably 3 and 15, and most preferably between 4 and 12 of the first N-terminal codons of the transposase are modified such that the third base of each codon is changed to an A or a T without changing the encoded amino acid. In one embodiment, the first ten N-terminal codons of the transposase gene are modified in this manner. It is also preferred that the transposase contain mutations that make it less specific for preferred insertion sites and thus increases the rate of transgene insertion as discussed in U.S. Pat. No. 5,719,055.
- In some embodiments, the transposon-based vectors are optimized for expression in a particular host by changing the methylation patterns of the vector DNA. For example, prokaryotic methylation may be reduced by using a methylation deficient organism for production of the transposon-based vector. The transposon-based vectors may also be methylated to resemble eukaryotic DNA for expression in a eukaryotic host.
- Transposases and insertion sequences from other analogous eukaryotic transposon-based vectors that can also be modified and used are, for example, the Drosophila P element derived vectors disclosed in U.S. Pat. No. 6,291,243; the Drosophila mariner element described in Sherman et al. (1998); or the sleeping beauty transposon. See also Hackett et al. (1999); D. Lampe et al., 1999. Proc. Natl. Acad. Sci. USA, 96:11428-11433; S. Fischer et al., 2001. Proc. Natl. Acad. Sci. USA, 98:6759-6764; L. Zagoraiou et al., 2001. Proc. Natl. Acad. Sci. USA, 98:11474-11478; and D. Berg et al. (Eds.), Mobile DNA, Amer. Soc. Microbiol. (Washington, D.C., 1989). However, it should be noted that bacterial transposon-based elements are preferred, as there is less likelihood that a eukaryotic transposase in the recipient species will recognize prokaryotic insertion sequences bracketing the transgene.
- Many transposases recognize different insertion sequences, and therefore, it is to be understood that a transposase-based vector will contain insertion sequences recognized by the particular transposase also found in the transposase-based vector. In a preferred embodiment of the invention, the insertion sequences have been shortened to about 70 base pairs in length as compared to those found in wild-type transposons that typically contain insertion sequences of well over 100 base pairs.
- While the examples provided below incorporate a “cut and insert” Tn10 based vector that is destroyed following the insertion event, the present invention also encompasses the use of a “rolling replication” type transposon-based vector. Use of a rolling replication type transposon allows multiple copies of the transposon/transgene to be made from a single transgene construct and the copies inserted. This type of transposon-based system thereby provides for insertion of multiple copies of a transgene into a single genome. A rolling replication type transposon-based vector may be preferred when the promoter operably-linked to gene of interest is endogenous to the host cell and present in a high copy number or highly expressed. However, use of a rolling replication system may require tight control to limit the insertion events to non-lethal levels. Tn1, Tn2, Tn3, Tn4, Tn5, Tn9, Tn21, Tn501, Tn551, Tn951, Tn1721, Tn2410 and Tn2603 are examples of a rolling replication type transposon, although Tn5 could be both a rolling replication and a cut and insert type transposon.
- In one embodiment, the transposon-based vector contains two stop codons operably-linked to the transposase and/or to the gene of interest. In an alternate embodiment, one stop codon of UAA or UGA is operably linked to the transposase and/or to the gene of interest.
- As used herein an “effective polyA sequence” refers to either a synthetic or non-synthetic sequence that contains multiple and sequential nucleotides containing an adenine base (an A polynucleotide string) and that increases expression of the gene to which it is operably-linked. A polyA sequence may be operably-linked to any gene in the transposon-based vector including, but not limited to, a transposase gene and a gene of interest. A preferred polyA sequence is optimized for use in the host animal or human. In one embodiment, the polyA sequence is optimized for use in an avian species and more specifically, a chicken. An avian optimized polyA sequence generally contains a minimum of 40 base pairs, preferably between approximately 40 and several hundred base pairs, and more preferably approximately 75 base pairs that precede the A polynucleotide string and thereby separate the stop codon from the A polynucleotide string. In one embodiment of the present invention, the polyA sequence comprises a conalbumin polyA sequence as provided in SEQ ID NO:11 and as taken from GenBank accession #Y00407, base pairs 10651-11058. In another embodiment, the polyA sequence comprises a synthetic polynucleotide sequence shown in SEQ ID NO:12. In yet another embodiment, the polyA sequence comprises an avian optimized polyA sequence provided in SEQ ID NO:13. A chicken optimized polyA sequence may also have a reduced amount of CT repeats as compared to a synthetic polyA sequence.
- It is a surprising discovery of the present invention that such an avian optimized poly A sequence increases expression of a polynucleotide to which it is operably-linked in an avian as compared to a non-avian optimized polyA sequence. Accordingly, the present invention includes methods of or increasing incorporation of a gene of interest wherein the gene of interest resides in a transposon-based vector containing a transposase gene and wherein the transposase gene is operably linked to an avian optimized polyA sequence. The present invention also includes methods of increasing expression of a gene of interest in an avian that includes administering a gene of interest to the avian, wherein the gene of interest is operably-linked to an avian optimized polyA sequence. An avian optimized polyA nucleotide string is defined herein as a polynucleotide containing an A polynucleotide string and a minimum of 40 base pairs, preferably between approximately 40 and several hundred base pairs, and more preferably approximately 60 base pairs that precede the A polynucleotide string. The present invention further provides transposon-based vectors containing a gene of interest or transposase gene operably linked to an avian optimized polyA sequence.
- The first promoter operably-linked to the transposase gene and the second promoter operably-linked to the gene of interest can be a constitutive promoter or an inducible promoter. Constitutive promoters include, but are not limited to, immediate early cytomegalovirus (CMV) promoter, herpes simplex virus 1 (HSV1) immediate early promoter, SV40 promoter, lysozyme promoter, early and late CMV promoters, early and late HSV promoters, β-actin promoter, tubulin promoter, Rous-Sarcoma virus (RSV) promoter, and heat-shock protein (HSP) promoter. Inducible promoters include tissue-specific promoters, developmentally-regulated promoters and chemically inducible promoters. Examples of tissue-specific promoters include the glucose 6 phosphate (G6P) promoter, vitellogenin promoter, ovalbumin promoter, ovomucoid promoter, conalbumin promoter, ovotransferrin promoter, prolactin promoter, kidney uromodulin promoter, and placental lactogen promoter. In one embodiment, the vitellogenin promoter includes a polynucleotide sequence of SEQ ID NO:14. The G6P promoter sequence may be deduced from a rat G6P gene untranslated upstream region provided in GenBank accession number U57552.1. Examples of developmentally-regulated promoters include the homeobox promoters and several hormone induced promoters. Examples of chemically inducible promoters include reproductive hormone induced promoters and antibiotic inducible promoters such as the tetracycline inducible promoter and the zinc-inducible metallothionine promoter.
- Other inducible promoter systems include the Lac operator repressor system inducible by IPTG (isopropyl beta-D-thiogalactoside) (Cronin, A. et al. 2001. Genes and Development, v. 15), ecdysone-based inducible systems (Hoppe, U. C. et al. 2000. Mol. Ther. 1:159-164); estrogen-based inducible systems (Braselmann, S. et al. 1993. Proc. Natl. Acad. Sci. 90:1657-1661); progesterone-based inducible systems using a chimeric regulator, GLVP, which is a hybrid protein consisting of the GAL4 binding domain and the herpes simplex virus transcriptional activation domain, VP16, and a truncated form of the human progesterone receptor that retains the ability to bind ligand and can be turned on by RU486 (Wang, et al. 1994. Proc. Natl. Acad. Sci. 91:8180-8184); CID-based inducible systems using chemical inducers of dimerization (CIDs) to regulate gene expression, such as a system wherein rapamycin induces dimerization of the cellular proteins FKBP12 and FRAP (Belshaw, P. J. et al. 1996. J. Chem. Biol. 3:731-738; Fan, L. et al. 1999. Hum. Gene Ther. 10:2273-2285; Shariat, S. F. et al. 2001. Cancer Res. 61:2562-2571; Spencer, D. M. 1996. Curr. Biol. 6:839-847). Chemical substances that activate the chemically inducible promoters can be administered to the animal containing the transgene of interest via any method known to those of skill in the art.
- Other examples of cell or tissue-specific and constitutive promoters include but are not limited to smooth-muscle SM22 promoter, including chimeric SM22alpha/telokin promoters (Hoggatt A. M. et al., 2002. Circ Res. 91(12):1151-9); ubiquitin C promoter (Biochim Biophys Acta, 2003. Jan. 3;1625(l):52-63); Hsf2 promoter; murine COMP (cartilage oligomeric matrix protein) promoter; early B cell-specific mb-1 promoter (Sigvardsson M., et al., 2002. Mol. Cell Biol. 22(24):8539-51); prostate specific antigen (PSA) promoter (Yoshimura I. et al., 2002, J. Urol. 168(6):2659-64); exorh promoter and pineal expression-promoting element (Asaoka Y., et al., 2002. Proc. Natl. Acad. Sci. 99(24):15456-61); neural and liver ceramidase gene promoters (Okino N. et al., 2002. Biochem. Biophys. Res. Commun. 299(1):160-6); PSP94 gene promoter/enhancer (Gabril M. Y. et al., 2002. Gene Ther. 9(23): 1589-99); promoter of the human FAT/CD36 gene (Kuriki C., et al., 2002. Biol. Pharm. Bull. 25(11):1476-8); VL30 promoter (Staplin W. R. et al., 2002. Blood Oct. 24, 2002); and, IL-10 promoter (Brenner S., et al., 2002. J. Biol. Chem. Dec. 18, 2002).
- Examples of avian promoters include, but are not limited to, promoters controlling expression of egg white proteins, such as ovalbumin, ovotransferrin (conalbumin), ovomucoid, lysozyme, ovomucin, g2 ovoglobulin, g3 ovoglobulin, ovoflavoprotein, ovostatin (ovomacroglobin), cystatin, avidin, thiamine-binding protein, glutamyl aminopeptidase minor glycoprotein 1, minor glycoprotein 2; and promoters controlling expression of egg-yolk proteins, such as vitellogenin, very low-density lipoproteins, low density lipoprotein, cobalamin-binding protein, riboflavin-binding protein, biotin-binding protein (Awade, 1996. Z. Lebensm. Unters. Forsch. 202:1-14). An advantage of using the vitellogenin promoter is that it is active during the egg-laying stage of an animal's life-cycle, which allows for the production of the protein of interest to be temporally connected to the import of the protein of interest into the egg yolk when the protein of interest is equipped with an appropriate targeting sequence. In some embodiments, the avian promoter is an oviduct-specific promoter. As used herein, the term “oviduct-specific promoter” includes, but is not limited to, ovalbumin; ovotransferrin (conalbumin); ovomucoid; 01, 02, 03, 04 or 05 avidin; ovomucin; g2 ovoglobulin; g3 ovoglobulin; ovoflavoprotein; and ovostatin (ovomacroglobin) promoters.
- When germline transformation occurs via intraovarian administration, liver-specific promoters may be operably-linked to the gene of interest to achieve liver-specific expression of the transgene. Liver-specific promoters of the present invention include, but are not limited to, the following promoters, vitellogenin promoter, G6P promoter, cholesterol-7-alpha-hydroxylase (CYP7A) promoter, phenylalanine hydroxylase (PAH) promoter, protein C gene promoter, insulin-like growth factor I (IGF-I) promoter, bilirubin UDP-glucuronosyltransferase promoter, aldolase B promoter, furin promoter, metallothioneine promoter, albumin promoter, and insulin promoter.
- Also included in the present invention are promoters that can be used to target expression of a protein of interest into the milk of a milk-producing animal including, but not limited to, β lactoglobin promoter, whey acidic protein promoter, lactalbumin promoter and casein promoter.
- When germline transformation occurs via intraovarian administration, immune system-specific promoters may be operably-linked to the gene of interest to achieve immune system-specific expression of the transgene. Accordingly, promoters associated with cells of the immune system may also be used. Acute phase promoters such as interleukin (IL)-1 and IL-2 may be employed. Promoters for heavy and light chain Ig may also be employed. The promoters of the T cell receptor components CD4 and CD8, B cell promoters and the promoters of CR2 (complement receptor type 2) may also be employed. Immune system promoters are preferably used when the desired protein is an antibody protein.
- Also included in this invention are modified promoters/enhancers wherein elements of a single promoter are duplicated, modified, or otherwise changed. In one embodiment, steroid hormone-binding domains of the ovalbumin promoter are moved from about −6.5 kb to within approximately the first 1000 base pairs of the gene of interest. Modifying an existing promoter with promoter/enhancer elements not found naturally in the promoter, as well as building an entirely synthetic promoter, or drawing promoter/enhancer elements from various genes together on a non-natural backbone, are all encompassed by the current invention.
- Accordingly, it is to be understood that the promoters contained within the transposon-based vectors of the present invention may be entire promoter sequences or fragments of promoter sequences. For example, in one embodiment, the promoter operably linked to a gene of interest is an approximately 900 base pair fragment of a chicken ovalbumin promoter (SEQ ID NO:15). The constitutive and inducible promoters contained within the transposon-based vectors may also be modified by the addition of one or more modified Kozak sequences of ACCATG (SEQ ID NO:1).
- As indicated above, the present invention includes transposon-based vectors containing one or more enhancers. These enhancers may or may not be operably-linked to their native promoter and may be located at any distance from their operably-linked promoter. A promoter operably-linked to an enhancer and a promoter modified to eliminate repressive regulatory effects are referred to herein as an “enhanced promoter.” The enhancers contained within the transposon-based vectors are preferably enhancers found in birds, and more preferably, an ovalbumin enhancer, but are not limited to these types of enhancers. In one embodiment, an approximately 675 base pair enhancer element of an ovalbumin promoter is cloned upstream of an ovalbumin promoter with 300 base pairs of spacer DNA separating the enhancer and promoter. In one embodiment, the enhancer used as a part of the present invention comprises base pairs 1-675 of a chicken ovalbumin enhancer from GenBank accession #S82527.1. The polynucleotide sequence of this enhancer is provided in SEQ ID NO:16.
- Also included in some of the transposon-based vectors of the present invention are cap sites and fragments of cap sites. In one embodiment, approximately 50 base pairs of a 5′ untranslated region wherein the capsite resides are added on the 3′ end of an enhanced promoter or promoter. An exemplary 5′ untranslated region is provided in SEQ ID NO:17. A putative cap-site residing in this 5′ untranslated region preferably comprises the polynucleotide sequence provided in SEQ ID NO:18.
- In one embodiment of the present invention, the first promoter operably-linked to the transposase gene is a constitutive promoter and the second promoter operably-linked to the gene of interest is a tissue-specific promoter. In the second embodiment, use of the first constitutive promoter allows for constitutive activation of the transposase gene and incorporation of the gene of interest into virtually all cell types, including the germline of the recipient animal. Although the gene of interest is incorporated into the germline generally, the gene of interest may only be expressed in a tissue-specific manner. A transposon-based vector having a constitutive promoter operably-linked to the transposase gene can be administered by any route, and in one embodiment, the vector is administered to an ovary, to an artery leading to the ovary or to a lymphatic system or fluid proximal to the ovary.
- It should be noted that cell-or tissue-specific expression as described herein does not require a complete absence of expression in cells or tissues other than the preferred cell or tissue. Instead, “cell-specific” or “tissue-specific” expression refers to a majority of the expression of a particular gene of interest in the preferred cell or tissue, respectively.
- When incorporation of the gene of interest into the germline is not preferred, the first promoter operably-linked to the transposase gene can be a tissue-specific promoter. For example, transfection of a transposon-based vector containing a transposase gene operably-linked to an oviduct specific promoter such as the ovalbumin promoter provides for activation of the transposase gene and incorporation of the gene of interest in the cells of the oviduct but not into the germline and other cells generally. In this embodiment, the second promoter operably-linked to the gene of interest can be a constitutive promoter or an inducible promoter. In a preferred embodiment, both the first promoter and the second promoter are an ovalbumin promoter. In embodiments wherein tissue-specific expression or incorporation is desired, it is preferred that the transposon-based vector is administered directly to the tissue of interest, to an artery leading to the tissue of interest or to fluids surrounding the tissue of interest. In a preferred embodiment, the tissue of interest is the oviduct and administration is achieved by direct injection into the oviduct or an artery leading to the oviduct. In a further preferred embodiment, administration is achieved by direct injection into the lumen of the magnum or the infundibulum of the oviduct. Indirect administration to the oviduct may occur through the cloaca.
- Accordingly, cell specific promoters may be used to enhance transcription in selected tissues. In birds, for example, promoters that are found in cells of the fallopian tube, such as ovalbumin, conalbumin, ovomucoid and/or lysozyme, are used in the vectors to ensure transcription of the gene of interest in the epithelial cells and tubular gland cells of the fallopian tube, leading to synthesis of the desired protein encoded by the gene and deposition into the egg white. In mammals, promoters specific for the epithelial cells of the alveoli of the mammary gland, such as prolactin, insulin, beta lactoglobin, whey acidic protein, lactalbumin, casein, and/or placental lactogen, are used in the design of vectors used for transfection of these cells for the production of desired proteins for deposition into the milk. In liver cells, the G6P promoter may be employed to drive transcription of the gene of interest for protein production. Proteins made in the liver of birds may be delivered to the egg yolk.
- In order to achieve higher or more efficient expression of the transposase gene, the promoter and other regulatory sequences operably-linked to the transposase gene may be those derived from the host. These host specific regulatory sequences can be tissue specific as described above or can be of a constitutive nature. For example, an avian actin promoter and its associated polyA sequence can be operably-linked to a transposase in a transposase-based vector for transfection into an avian. Examples of other host specific promoters that could be operably-linked to the transposase include the myosin and DNA or RNA polymerase promoters.
- Directing Sequences
- In some embodiments of the present invention, the gene of interest is operably-linked to a directing sequence or a sequence that provides proper conformation to the desired protein encoded by the gene of interest. As used herein, the term “directing sequence” refers to both signal sequences and targeting sequences. An egg directing sequence includes, but is not limited to, an ovomucoid signal sequence, an ovalbumin signal sequence, a cecropin pre pro signal sequence, and a vitellogenin targeting sequence. The term “signal sequence” refers to an amino acid sequence, or the polynucleotide sequence that encodes the amino acid sequence, that directs the protein to which it is linked to the endoplasmic reticulum in a eukaryote, and more preferably the translocational pores in the endoplasmic reticulum, or the plasma membrane in a prokaryote, or mitochondria, such as for the purpose of gene therapy for mitochondrial diseases. Signal and targeting sequences can be used to direct a desired protein into, for example, the milk, when the transposon-based vectors are administered to a milk-producing animal.
- Signal sequences can also be used to direct a desired protein into, for example, a secretory pathway for incorporation into the egg yolk or the egg white, when the transposon-based vectors are administered to a bird or other egg-laying animal. One example of such a transposon-based vector is provided in FIG. 3 wherein the gene of interest is operably linked to the ovomucoid signal sequence. The present invention also includes a gene of interest operably-linked to a second gene containing a signal sequence. An example of such an embodiment is shown in FIG. 2 wherein the gene of interest is operably-linked to the ovalbumin gene that contains an ovalbumin signal sequence. Other signal sequences that can be included in the transposon-based vectors include, but are not limited to the ovotransferrin and lysozyme signal sequences. In one embodiment, the signal sequence is an ovalbumin signal sequence including a sequence shown in SEQ ID NO:19. In another embodiment, the signal sequence is a modified ovalbumin signal sequence including a sequence shown in SEQ ID NO:20 or SEQ ID NO:21.
- As also used herein, the term “targeting sequence” refers to an amino acid sequence, or the polynucleotide sequence encoding the amino acid sequence, which amino acid sequence is recognized by a receptor located on the exterior of a cell. Binding of the receptor to the targeting sequence results in uptake of the protein or peptide operably-linked to the targeting sequence by the cell. One example of a targeting sequence is a vitellogenin targeting sequence that is recognized by a vitellogenin receptor (or the low density lipoprotein receptor) on the exterior of an oocyte. In one embodiment, the vitellogenin targeting sequence includes the polynucleotide sequence of SEQ ID NO:22. In another embodiment, the vitellogenin targeting sequence includes all or part of the vitellogenin gene. Other targeting sequences include VLDL and Apo E, which are also capable of binding the vitellogenin receptor. Since the ApoE protein is not endogenously expressed in birds, its presence may be used advantageously to identify birds carrying the transposon-based vectors of the present invention.
- A gene of interest selected for stable incorporation is designed to encode any desired protein or peptide or to regulate any cellular response. In some embodiments, the desired proteins or peptides are deposited in an egg or in milk. It is to be understood that the present invention encompasses transposon-based vectors containing multiple genes of interest. The multiple genes of interest may each be operably-linked to a separate promoter and other regulatory sequence(s) or may all be operably-linked to the same promoter and other regulatory sequences(s). In one embodiment, multiple gene of interest are linked to a single promoter and other regulatory sequence(s) and each gene of interest is separated by a cleavage site or a pro portion of a signal sequence. A gene of interest may contain modifications of the codons for the first several N-terminal amino acids of the gene of interest, wherein the third base of each codon is changed to an A or a T without changing the corresponding amino acid.
- Protein and peptide hormones are a preferred class of proteins in the present invention. Such protein and peptide hormones are synthesized throughout the endocrine system and include, but are not limited to, hypothalamic hormones and hypophysiotropic hormones, anterior, intermediate and posterior pituitary hormones, pancreatic islet hormones, hormones made in the gastrointestinal system, renal hormones, thymic hormones, parathyroid hormones, adrenal cortical and medullary hormones. Specifically, hormones that can be produced using the present invention include, but are not limited to, chorionic gonadotropin, corticotropin, erythropoietin, glucagons, IGF-1, oxytocin, platelet-derived growth factor, calcitonin, follicle-stimulating hormone, luteinizing hormone, thyroid-stimulating hormone, insulin, gonadotropin-releasing hormone and its analogs, vasopressin, octreotide, somatostatin, prolactin, adrenocorticotropic hormone, antidiuretic hormone, thyrotropin-releasing hormone (TRH), growth hormone-releasing hormone (GHRH), dopamine, melatonin, thyroxin (T4), parathyroid hormone (PTH), glucocorticoids such as cortisol, mineralocorticoids such as aldosterone, androgens such as testosterone, adrenaline (epinephrine), noradrenaline (norepinephrine), estrogens such as estradiol, progesterone, glucagons, calcitrol, calciferol, atrial-natriuretic peptide, gastrin, secretin, cholecystokinin (CCK), neuropeptide Y, ghrelin, PYY3-36, angiotensinogen, thrombopoietin, and leptin. By using appropriate polynucleotide sequences, species-specific hormones may be made by transgenic animals.
- In one embodiment of the present invention, the gene of interest is a proinsulin gene and the desired molecule is insulin. Proinsulin consists of three parts: a C-peptide and two strands of amino acids (the alpha and beta chains) that later become linked together to form the insulin molecule. FIGS. 2 and 3 are schematics of transposon-based vector constructs containing a proinsulin gene operably-linked to an ovalbumin promoter and ovalbumin protein or an ovomucoid promoter and ovomucoid signal sequence, respectively. In these embodiments, proinsulin is expressed in the oviduct tubular gland cells and then deposited in the egg white. One example of a proinsulin polynucleotide sequence is shown in SEQ ID NO:23, wherein the C-peptide cleavage site spans from Arg at position 31 to Arg at position 65.
- Serum proteins including lipoproteins such as high density lipoprotein (HDL), HDL-Milano and low density lipoprotein, albumin, clotting cascade factors, factor VIII, factor IX, fibrinogen, and globulins are also included in the group of desired proteins of the present invention. Immunoglobulins are one class of desired globulin molecules and include but are not limited to IgG, IgM, IgA, IgD, IgE, IgY, lambda chains, kappa chains and fragments thereof; Fc fragments, and Fab fragments. Desired antibodies include, but are not limited to, naturally occurring antibodies, human antibodies, humanized antibodies, and hybrid antibodies. Genes encoding modified versions of naturally occurring antibodies or fragments thereof and genes encoding artificially designed antibodies or fragments thereof may be incorporated into the transposon-based vectors of the present invention. Desired antibodies also include antibodies with the ability to bind specific ligands, for example, antibodies against proteins associated with cancer-related molecules, such as anti-her 2, or anti-CA125. Accordingly, the present invention encompasses a transposon-based vector containing one or more genes encoding a heavy immunoglobulin (Ig) chain and a light Ig chain. Further, more than one gene encoding for more than one antibody may be administered in one or more transposon-based vectors of the present invention. In this manner, an egg may contain more than one type of antibody in the egg white, the egg yolk or both. In one embodiment, a transposon-based vector contains a heavy Ig chain and a light Ig chain, both operably linked to a promoter.
- Antibodies used as therapeutic reagents include but are not limited to antibodies for use in cancer immunotherapy against specific antigens, or for providing passive immunity to an animal or a human against an infectious disease or a toxic agent. Antibodies used as diagnostic reagents include, but are not limited to antibodies that may be labeled and detected with a detector, for example antibodies with a fluorescent label attached that may be detected following exposure to specific wavelengths. Such labeled antibodies may be primary antibodies directed to a specific antigen, for example, rhodamine-labeled rabbit anti-growth hormone, or may be labeled secondary antibodies, such as fluorescein-labeled goat-anti chicken IgG. Such labeled antibodies are known to one of ordinary skill in the art. Labels useful for attachment to antibodies are also known to one of ordinary skill in the art. Some of these labels are described in the “Handbook of Fluorescent Probes and Research Products”, ninth edition, Richard P. Haugland (ed) Molecular Probes, Inc. Eugene, Oreg.), which is incorporated herein in its entirety.
- Antibodies produced with using the present invention may be used as laboratory reagents for numerous applications including radioimmunoassay, western blots, dot blots, ELISA, immunoaffinity columns and other procedures requiring antibodies as known to one of ordinary skill in the art. Such antibodies include primary antibodies, secondary antibodies and tertiary antibodies, which may be labeled or unlabeled.
- Antibodies that may be made with the practice of the present invention include, but are not limited to primary antibodies, secondary antibodies, designer antibodies, anti-protein antibodies, anti-peptide antibodies, anti-DNA antibodies, anti-RNA antibodies, anti-hormone antibodies, anti-hypophysiotropic peptides, antibodies against non-natural antigens, anti-anterior pituitary hormone antibodies, anti-posterior pituitary hormone antibodies, anti-venom antibodies, anti-tumor marker antibodies, antibodies directed against epitopes associated with infectious disease, including, anti-viral, anti-bacterial, anti-protozoal, anti-fungal, anti-parasitic, anti-receptor, anti-lipid, anti-phospholipid, anti-growth factor, anti-cytokine, anti-monokine, anti-idiotype, and anti-accessory (presentation) protein antibodies. Antibodies made with the present invention, as well as light chains or heavy chains, may also be used to inhibit enzyme activity.
- Antibodies that may be produced using the present invention include, but are not limited to, antibodies made against the following proteins: Bovine γ-Globulin, Serum; Bovine IgG, Plasma; Chicken γ-Globulin, Serum; Human γ-Globulin, Serum; Human IgA, Plasma; Human IgA1, Myeloma; Human IgA2, Myeloma; Human IgA2, Plasma; Human IgD, Plasma; Human IgE, Myeloma; Human IgG, Plasma; Human IgG, Fab Fragment, Plasma; Human IgG, F(ab′)2 Fragment, Plasma; Human IgG, Fc Fragment, Plasma; Human IgG1, Myeloma; Human IgG2, Myeloma; Human IgG3, Myeloma; Human IgG4, Myeloma; Human IgM, Myeloma; Human IgM, Plasma; Human Immunoglobulin, Light Chain K, Urine; Human Immunoglobulin, Light Chains κ and λ, Plasma; Mouse γ-Globulin, Serum; Mouse IgG, Serum; Mouse IgM, Myeloma; Rabbit γ-Globulin, Serum; Rabbit IgG, Plasma; and Rat γ-Globulin, Serum. In one embodiment, the transposon-based vector comprises the coding sequence of light and heavy chains of a murine monoclonal antibody that shows specificity for human seminoprotein (GenBank Accession numbers AY129006 and AY129304 for the light and heavy chains, respectively).
- A further non-limiting list of antibodies that recognize other antibodies is as follows: Anti-Chicken IgG, heavy (H) & light (L) Chain Specific (Sheep); Anti-Goat γ-Globulin (Donkey); Anti-Goat IgG, Fc Fragment Specific (Rabbit); Anti-Guinea Pig γ-Globulin (Goat); Anti-Human Ig, Light Chain, Type κ Specific; Anti-Human Ig, Light Chain, Type λ Specific; Anti-Human IgA, α-Chain Specific (Goat); Anti-Human IgA, Fab Fragment Specific; Anti-Human IgA, Fc Fragment Specific; Anti-Human IgA, Secretory; Anti-Human IgE, ε-Chain Specific (Goat); Anti-Human IgE, Fc Fragment Specific; Anti-Human IgG, Fc Fragment Specific (Goat); Anti-Human IgG, γ-Chain Specific (Goat); Anti-Human IgG, Fc Fragment Specific; Anti-Human IgG, Fd Fragment Specific; Anti-Human IgG, H & L Chain Specific (Goat); Anti-Human IgG1, Fc Fragment Specific; Anti-Human IgG2, Fc Fragment Specific; Anti-Human IgG2, Fd Fragment Specific; Anti-Human IgG3, Hinge Specific; Anti-Human IgG4, Fc Fragment Specific; Anti-Human IgM, Fc Fragment Specific; Anti-Human IgM, μ-Chain Specific; Anti-Mouse IgE, ε-Chain Specific; Anti-Mouse γ-Globulin (Goat); Anti-Mouse IgG, γ-Chain Specific (Goat); Anti-Mouse IgG, γ-Chain Specific (Goat) F(ab′)2 Fragment; Anti-Mouse IgG, H & L Chain Specific (Goat); Anti-Mouse IgM, μ-Chain Specific (Goat); Anti-Mouse IgM, H & L Chain Specific (Goat); Anti-Rabbit γ-Globulin (Goat); Anti-Rabbit IgG, Fc Fragment Specific (Goat); Anti-Rabbit IgG, H & L Chain Specific (Goat); Anti-Rat γ-Globulin (Goat); Anti-Rat IgG, H & L Chain Specific; Anti-Rhesus Monkey γ-Globulin (Goat); and, Anti-Sheep IgG, H & L Chain Specific.
- Another non-limiting list of the antibodies that may be produced using the present invention is provided in product catalogs of companies such as Phoenix Pharmaceuticals, Inc. (www.phoenixpeptide.com; 530 Harbor Boulevard, Belmont, Calif.), Peninsula Labs (San Carlos Calif.), SIGMA (St.Louis, Mo. www.sigma-aldrich.com), Cappel ICN (Irvine, Calif., www.icnbiomed.com), and Calbiochem (La Jolla, Calif., www.calbiochem.com), which are all incorporated herein by reference in their entirety. The polynucleotide sequences encoding these antibodies may be obtained from the scientific literature, from patents, and from databases such as GenBank. Alternatively, one of ordinary skill in the art may design the polynucleotide sequence to be incorporated into the genome by choosing the codons that encode for each amino acid in the desired antibody. Antibodies made by the transgenic animals of the present invention include antibodies that may be used as therapeutic reagents, for example in cancer immunotherapy against specific antigens, as diagnostic reagents and as laboratory reagents for numerous applications including immunoneutralization, radioimmunoassay, western blots, dot blots, ELISA, immunoprecipitation and immunoaffinity columns. Some of these antibodies include, but are not limited to, antibodies which bind the following ligands: adrenomedulin, amylin, calcitonin, amyloid, calcitonin gene-related peptide, cholecystokinin, gastrin, gastric inhibitory peptide, gastrin releasing peptide, interleukin, interferon, cortistatin, somatostatin, endothelin, sarafotoxin, glucagon, glucagon-like peptide, insulin, atrial natriuretic peptide, BNP, CNP, neurokinin, substance P, leptin, neuropeptide Y, melanin concentrating hormone, melanocyte stimulating hormone, orphanin, endorphin, dynorphin, enkephalin, enkephalin, leumorphin, peptide F, PACAP, PACAP-related peptide, parathyroid hormone, urocortin, corticotrophin releasing hormone, PHM, PHI, vasoactive intestinal polypeptide, secretin, ACTH, angiotensin, angiostatin, bombesin, endostatin, bradykinin, FMRF amide, galanin, gonadotropin releasing hormone (GnRH) associated peptide, GnRH, growth hormone releasing hormone, inhibin, granulocyte-macrophage colony stimulating factor (GM-CSF), motilin, neurotensin, oxytocin, vasopressin, osteocalcin, pancreastatin, pancreatic polypeptide, peptide YY, proopiomelanocortin, transforming growth factor, vascular endothelial growth factor, vesicular monoamine transporter, vesicular acetylcholine transporter, ghrelin, NPW, NPB, C3d, prokinetican, thyroid stimulating hormone, luteinizing hormone, follicle stimulating hormone, prolactin, growth hormone, beta-lipotropin, melatonin, kallikriens, kinins, prostaglandins, erythropoietin, p146 (SEQ ID NO:24 amino acid sequence, SEQ ID NO:25, nucleotide sequence), estrogen, testosterone, corticosteroids, mineralocorticoids, thyroid hormone, thymic hormones, connective tissue proteins, nuclear proteins, actin, avidin, activin, agrin, albumin, and prohormones, propeptides, splice variants, fragments and analogs thereof.
- The following is yet another non-limiting list of antibodies that can be produced by the methods of present invention: abciximab (ReoPro), abciximab anti-platelet aggregation monoclonal antibody, anti-CD11a (hu1124), anti-CD 18 antibody, anti-CD20 antibody, anti-cytomegalovirus (CMV) antibody, anti-digoxin antibody, anti-hepatitis B antibody, anti-HER-2 antibody, anti-idiotype antibody to GD3 glycolipid, anti-IgE antibody, anti-IL-2R antibody, antimetastatic cancer antibody (mAb 17-1A), anti-rabies antibody, anti-respiratory syncytial virus (RSV) antibody, anti-Rh antibody, anti-TCR, anti-TNF antibody, anti-VEGF antibody and fab fragment thereof, rattlesnake venom antibody, black widow spider venom antibody, coral snake venom antibody, antibody against very late antigen-4 (VLA-4), C225 humanized antibody to EGF receptor, chimeric (human & mouse) antibody against TNFo, antibody directed against GPIIb/IIIa receptor on human platelets, gamma globulin, anti-hepatitis B immunoglobulin, human anti-D immunoglobulin, human antibodies against S aureus, human tetanus immunoglobulin, humanized antibody against the epidermal growth receptor-2, humanized antibody against the a subunit of the interleukin-2 receptor, humanized antibody CTLA4IG, humanized antibody to the IL-2 R α-chain, humanized anti-CD40-ligand monoclonal antibody (5c8), humanized mAb against the epidermal growth receptor-2, humanized mAb to rous sarcoma virus, humanized recombinant antibody (IgGlk) against respiratory syncytial virus (RSV), lymphocyte immunoglobulin (anti-thymocyte antibody), lymphocyte immunoglobulin, mAb against factor VII, MDX-210 bi-specific antibody against HER-2, MDX-22, MDX-220 bi-specific antibody against TAG-72 on tumors, MDX-33 antibody to FcγR1 receptor, MDX-447 bi-specific antibody against EGF receptor, MDX-447 bispecific humanized antibody to EGF receptor, MDX-RA immunotoxin (ricin A linked) antibody, Medi-507 antibody (humanized form of BTI-322) against CD2 receptor on T-cells, monoclonal antibody LDP-02, muromonab-CD3(OKT3) antibody, OKT3 (“muromomab-CD3”) antibody, PRO 542 antibody, ReoPro (“abciximab”) antibody, and TNF-IgG fusion protein.
- The antibodies prepared using the methods of the present invention may also be designed to possess specific labels that may be detected through means known to one of ordinary skill in the art. The antibodies may also be designed to possess specific sequences useful for purification through means known to one of ordinary skill in the art. Specialty antibodies designed for binding specific antigens may also be made in transgenic animals using the transposon-based vectors of the present invention.
- Production of a monoclonal antibody using the transposon-based vectors of the present invention can be accomplished in a variety of ways. In one embodiment, two vectors may be constructed: one that encodes the light chain, and a second vector that encodes the heavy chain of the monoclonal antibody. These vectors may then be incorporated into the genome of the target animal by methods disclosed herein. In an alternative embodiment, the sequences encoding light and heavy chains of a monoclonal antibody may be included on a single DNA construct. For example, the coding sequence of light and heavy chains of a murine monoclonal antibody that show specificity for human seminoprotein can be expressed using transposon-based constructs of the present invention (GenBank Accession numbers AY129006 and AY129304 for the light and heavy chains, respectively).
- Further included in the present invention are proteins and peptides synthesized by the immune system including those synthesized by the thymus, lymph nodes, spleen, and the gastrointestinal associated lymph tissues (GALT) system. The immune system proteins and peptides proteins that can be made in transgenic animals using the transposon-based vectors of the present invention include, but are not limited to, alpha-interferon, beta-interferon, gamma-interferon, alpha-interferon A, alpha-interferon 1, G-CSF, GM-CSF, interlukin-1 (IL-1), IL-2, IL-3, IL4, IL-5, IL-6, IL-7, IL-8, IL-9, IL-10, IL-11, IL-12, IL-13, TNF-α, and TNF-β. Other cytokines included in the present invention include cardiotrophin, stromal cell derived factor, macrophage derived chemokine (MDC), melanoma growth stimulatory activity (MGSA), macrophage inflammatory proteins 1 alpha (MIP-1 alpha), 2, 3 alpha, 3 beta, 4 and 5.
- Lytic peptides such as p146 are also included in the desired molecules of the present invention. In one embodiment, the p146 peptide comprises an amino acid sequence of SEQ ID NO:24. The present invention also encompasses a transposon-based vector comprising a p146 nucleic acid comprising a polynucleotide sequence of SEQ ID NO:25.
- Enzymes are another class of proteins that may be made through the use of the transposon-based vectors of the present invention. Such enzymes include but are not limited to adenosine deaminase, alpha-galactosidase, cellulase, collagenase, dnaseI, hyaluronidase, lactase, L-asparaginase, pancreatin, papain, streptokinase B, subtilisin, superoxide dismutase, thrombin, trypsin, urokinase, fibrinolysin, glucocerebrosidase and plasminogen activator. In some embodiments wherein the enzyme could have deleterious effects, additional amino acids and a protease cleavage site are added to the carboxy end of the enzyme of interest in order to prevent expression of a functional enzyme. Subsequent digestion of the enzyme with a protease results in activation of the enzyme.
- Extracellular matrix proteins are one class of desired proteins that may be made through the use of the present invention. Examples include but are not limited to collagen, fibrin, elastin, laminin, and fibronectin and subtypes thereof. Intracellular proteins and structural proteins are other classes of desired proteins in the present invention.
- Growth factors are another desired class of proteins that may be made through the use of the present invention and include, but are not limited to, transforming growth factor-α (“TGF-α”), transforming growth factor-β (TGF-β), platelet-derived growth factors (PDGF), fibroblast growth factors (FGF), including FGF acidic isoforms 1 and 2, FGF basic form 2 and FGF 4, 8, 9 and 10, nerve growth factors (NGF) including NGF 2.5s, NGF 7.0s and beta NGF and neurotrophins, brain derived neurotrophic factor, cartilage derived factor, growth factors for stimulation of the production of red blood cells, growth factors for stimulation of the production of white blood cells, bone growth factors (BGF), basic fibroblast growth factor, vascular endothelial growth factor (VEGF), granulocyte colony stimulating factor (G-CSF), insulin like growth factor (IGF) I and II, hepatocyte growth factor, glial neurotrophic growth factor (GDNF), stem cell factor (SCF), keratinocyte growth factor (KGF), transforming growth factors (TGF), including TGFs alpha, beta, beta1, beta2, beta3, skeletal growth factor, bone matrix derived growth factors, bone derived growth factors, erythropoietin (EPO) and mixtures thereof.
- Another desired class of proteins that may be made may be made through the use of the present invention include, but are not limited to, leptin, leukemia inhibitory factor (LIF), tumor necrosis factor alpha and beta, ENBREL, angiostatin, endostatin, thrombospondin, osteogenic protein-1, bone morphogenetic proteins 2 and 7, osteonectin, somatomedin-like peptide, and osteocalcin.
- Yet another desired class of proteins are blood proteins or clotting cascade protein including albumin, Prekallikrein, High molecular weight kininogen (HMWK) (contact activation cofactor; Fitzgerald, Flaujeac Williams factor), Factor I (Fibrinogen), Factor II (prothrombin), Factor III (Tissue Factor), Factor IV (calcium), Factor V (proaccelerin, labile factor, accelerator (Ac-) globulin), Factor VI (Va) (accelerin), Factor VII (proconvertin), serum prothrombin conversion accelerator (SPCA), cothromboplastin), Factor VIII (antihemophiliac factor A, antihemophilic globulin (AHG)), Factor IX (Christmas Factor, antihemophilic factor B,plasma thromboplastin component (PTC)), Factor X (Stuart-Prower Factor), Factor XI (Plasma thromboplastin antecedent (PTA)), Factor XII (Hageman Factor), Factor XIII (rotransglutaminase, fibrin stabilizing factor (FSF), fibrinoligase), von Willebrand factor, Protein C, Protein S, Thrombomodulin, Antithrombin III.
- A non-limiting list of the peptides and proteins that may be made may be made through the use of the present invention is provided in product catalogs of companies such as Phoenix Pharmaceuticals, Inc. (www.phoenixpeptide.com; 530 Harbor Boulevard, Belmont, Calif.), Peninsula Labs (San Carlos Calif.), SIGMA, (St.Louis, Mo. www.sigma-aldrich.com), Cappel ICN (Irvine, Calif., www.icnbiomed.com), and Calbiochem (La Jolla, Calif., www.calbiochem.com). The polynucleotide sequences encoding these proteins and peptides of interest may be obtained from the scientific literature, from patents, and from databases such as GenBank. Alternatively, one of ordinary skill in the art may design the polynucleotide sequence to be incorporated into the genome by choosing the codons that encode for each amino acid in the desired protein or peptide.
- Some of these desired proteins or peptides that may be made through the use of the present invention include but are not limited to the following: adrenomedulin, amylin, calcitonin, amyloid, calcitonin gene-related peptide, cholecystokinin, gastrin, gastric inhibitory peptide, gastrin releasing peptide, interleukin, interferon, cortistatin, somatostatin, endothelin, sarafotoxin, glucagon, glucagon-like peptide, insulin, atrial natriuretic peptide, BNP, CNP, neurokinin, substance P, leptin, neuropeptide Y, melanin concentrating hormone, melanocyte stimulating hormone, orphanin, endorphin, dynorphin, enkephalin, leumorphin, peptide F, PACAP, PACAP-related peptide, parathyroid hormone, urocortin, corticotrophin releasing hormone, PHM, PHI, vasoactive intestinal polypeptide, secretin, ACTH, angiotensin, angiostatin, bombesin, endostatin, bradykinin, FMRF amide, galanin, gonadotropin releasing hormone (GnRH) associated peptide, GnRH, growth hormone releasing hormone, inhibin, granulocyte-macrophage colony stimulating factor (GM-CSF), motilin, neurotensin, oxytocin, vasopressin, osteocalcin, pancreastatin, pancreatic polypeptide, peptide YY, proopiomelanocortin, transforming growth factor, vascular endothelial growth factor, vesicular monoamine transporter, vesicular acetylcholine transporter, ghrelin, NPW, NPB, C3d, prokinetican, thyroid stimulating hormone, luteinizing hormone, follicle stimulating hormone, prolactin, growth hormone, beta-lipotropin, melatonin, kallikriens, kinins, prostaglandins, erythropoietin, p146 (SEQ ID NO:24, amino acid sequence, SEQ ID NO:25, nucleotide sequence), thymic hormones, connective tissue proteins, nuclear proteins, actin, avidin, activin, agrin, albumin, apolipoproteins, apolipoprotein A, apolipoprotein B, and prohormones, propeptides, splice variants, fragments and analogs thereof.
- Other desired proteins that may be made by the transgenic animals of the present invention include bacitracin, polymixin b, vancomycin, cyclosporine, anti-RSV antibody, alpha-1 antitrypsin (AAT), anti-cytomegalovirus antibody, anti-hepatitis antibody, anti-inhibitor coagulant complex, anti-rabies antibody, anti-Rh(D) antibody, adenosine deaminase, anti-digoxin antibody, antivenin crotalidae (rattlesnake venom antibody), antivenin latrodectus (black widow spider venom antibody), antivenin micrurus (coral snake venom antibody), aprotinin, corticotropin (ACTH), diphtheria antitoxin, lymphocyte immune globulin (anti-thymocyte antibody), protamine, thyrotropin, capreomycin, α-galactosidase, gramicidin, streptokinase, tetanus toxoid, tyrothricin, IGF-1, proteins of varicella vaccine, anti-TNF antibody, anti-IL-2r antibody, anti-HER-2 antibody, OKT3 (“muromonab-CD3”) antibody, TNF-IgG fusion protein, ReoPro (“abciximab”) antibody, ACTH fragment 1-24, desmopressin, gonadotropin-releasing hormone, histrelin, leuprolide, lypressin, nafarelin, peptide that binds GPIIb/GPIIIa on platelets (integrilin), goserelin, capreomycin, colistin, anti-respiratory syncytial virus, lymphocyte immune globulin (Thymoglovin, Atgam), panorex, alpha-antitrypsin, botulinin, lung surfactant protein, tumor necrosis receptor-IgG fusion protein (enbrel), gonadorelin, proteins of influenza vaccine, proteins of rotavirus vaccine, proteins of haemophilus b conjugate vaccine, proteins of poliovirus vaccine, proteins of pneumococcal conjugate vaccine, proteins of meningococcal C vaccine, proteins of influenza vaccine, megakaryocyte growth and development factor (MGDF), neuroimmunophilin ligand-A (NIL-A), brain-derived neurotrophic factor (BDNF), glial cell line-derived neurotrophic factor (GDNF), leptin (native), leptin B, leptin C, IL-1RA (interleukin-1RA), R-568, novel erythropoiesis-stimulating protein (NESP), humanized mAb to rous sarcoma virus (MEDI-493), glutamyl-tryptophan dipeptide IM862, LFA-3TIP immunosuppressive, humanized anti-CD40-ligand monoclonal antibody (5c8), gelsonin enzyme, tissue factor pathway inhibitor (TFPI), proteins of meningitis B vaccine, antimetastatic cancer antibody (mAb 17-1A), chimeric (human & mouse) mAb against TNFα, mAb against factor VII, relaxin, capreomycin, glycopeptide (LY333328), recombinant human activated protein C (rhAPC), humanized mAb against the epidermal growth receptor-2, altepase, anti-CD20 antigen, C2B8 antibody, insulin-like growth factor-1, atrial natriuretic peptide (anaritide), tenectaplase, anti-CD11a antibody (hu 1124), anti-CD18 antibody, mAb LDP-02, anti-VEGF antibody, fab fragment of anti-VEGF Ab, AP02 ligand (tumor necrosis factor-related apoptosis-inducing ligand), rTGF-β (transforming growth factor-β), alpha-antitrypsin, ananain (a pineapple enzyme), humanized mAb CTLA4IG, PRO 542 (mAb), D2E7 (mAb), calf intestine alkaline phosphatase, α-L-iduronidase, α-L-galactosidase (humanglutamic acid decarboxylase, acid sphingomyelinase, bone morphogenetic protein-2 (rhBMP-2), proteins of HIV vaccine, T cell receptor (TCR) peptide vaccine, TCR peptides, V beta 3 and V beta 13.1. (IR502), (IR501), BI 1050/1272 mAb against very late antigen 4 (VLA-4), C225 humanized mAb to EGF receptor, anti-idiotype antibody to GD3 glycolipid, antibacterial peptide againstH. pylori, MDX-447 bispecific humanized mAb to EGF receptor, anti-cytomegalovirus (CMV), Medi-491 B 19 parvovirus vaccine, humanized recombinant mAb (IgG1k) against respiratory syncytial virus (RSV), urinary tract infection vaccine (against “pili” on Escherechia coli strains), proteins of lyme disease vaccine against B. burgdorferi protein (DbpA), proteins of Medi-501 human papilloma virus-11 vaccine (HPV), Streptococcus pneumoniae vaccine, Medi-507 mAb (humanized form of BTI-322) against CD2 receptor on T-cells, MDX-33 mAb to FcγR1 receptor, MDX-RA immunotoxin (ricin A linked) mAb, MDX-210 bi-specific mAb against HER-2, MDX-447 bi-specific mAb against EGF receptor, MDX-22, MDX-220 bi-specific mAb against TAG-72 on tumors, colony-stimulating factor (CSF) (molgramostim), humanized mAb to the IL-2 R α-chain (basiliximab), mAb to IgE (IGE 025A), myelin basic protein-altered peptide (MSP771A), humanized mAb against the epidermal growth receptor-2, humanized mAb against the α subunit of the interleukin-2 receptor, low molecular weight heparin, anti-hemophillic factor, and bactericidal/permeability-increasing protein (r-BPI).
- The peptides and proteins made using the present invention may be labeled using labels and techniques known to one of ordinary skill in the art. Some of these labels are described in the “Handbook of Fluorescent Probes and Research Products”, ninth edition, Richard P. Haugland (ed) Molecular Probes, Inc. Eugene, Oreg.), which is incorporated herein in its entirety. Some of these labels may be genetically engineered into the polynucleotide sequence for the expression of the selected protein or peptide. The peptides and proteins may also have label-incorporation “handles” incorporated to allow labeling of an otherwise difficult or impossible to label protein.
- It is to be understood that the various classes of desired peptides and proteins, as well as specific peptides and proteins described in this section may be modified as described below by inserting selected codons for desired amino acid substitutions into the gene incorporated into the transgenic animal.
- The present invention may also be used to produce desired molecules other than proteins and peptides including, but not limited to, lipoproteins such as high density lipoprotein (HDL), HDL-Milano, and low density lipoprotein, lipids, carbohydrates, siRNA and ribozymes. In these embodiments, a gene of interest encodes a nucleic acid molecule or a protein that directs production of the desired molecule.
- The present invention further encompasses the use of inhibitory molecules to inhibit endogenous (i.e., non-vector) protein production. These inhibitory molecules include antisense nucleic acids, siRNA and inhibitory proteins. In a preferred embodiment, the endogenous protein whose expression is inhibited is an egg white protein including, but not limited to ovalbumin, ovotransferrin, and ovomucin. In one embodiment, a transposon-based vector containing an ovalbumin DNA sequence, that upon transcription forms a double stranded RNA molecule, is transfected into an animal such as a bird and the bird's production of endogenous ovalbumin protein is reduced by the interference RNA mechanism (RNAi). In other embodiments, a transposon-based vector encodes an inhibitory RNA molecule that inhibits the expression of more than one egg white protein. One exemplary construct is provided in FIG. 4 wherein “Ovgen” indicates approximately 60 base pairs of an ovalbumin gene, “Ovotrans” indicates approximately 60 base pairs of an ovotransferrin gene and “Ovomucin” indicates approximately 60 base pairs of an ovomucin gene. These ovalbumin, ovotransferrin and ovomucin can be from any avian species, and in some embodiments, are from a chicken or quail. The term “pro” indicates the pro portion of a prepro sequence. One exemplary prepro sequence is that of cecropin and comprising base pairs 563-733 of the Cecropin cap site and Prepro provided in Genbank accession number X07404. Additional cecropin prepro and pro sequences are provided in SEQ ID NO:48, SEQ ID NO:49, SEQ ID NO:50, and SEQ ID NO:51. Additionally, inducible knockouts or knockdowns of the endogenous protein may be created to achieve a reduction or inhibition of endogenous protein production. Endogenous egg white production can be inhibited in an avian at any time, but is preferably inhibited preceding, or immediately preceding, the harvest of eggs.
- “Proteins”, “peptides,” “polypeptides” and “oligopeptides” are chains of amino acids (typically L-amino acids) whose alpha carbons are linked through peptide bonds formed by a condensation reaction between the carboxyl group of the alpha carbon of one amino acid and the amino group of the alpha carbon of another amino acid. The terminal amino acid at one end of the chain (i.e., the amino terminal) has a free amino group, while the terminal amino acid at the other end of the chain (i.e., the carboxy terminal) has a free carboxyl group. As such, the term “amino terminus” (abbreviated N-terminus) refers to the free alpha-amino group on the amino acid at the amino terminal of the protein, or to the alpha-amino group (imino group when participating in a peptide bond) of an amino acid at any other location within the protein. Similarly, the term “carboxy terminus” (abbreviated C-terminus) refers to the free carboxyl group on the amino acid at the carboxy terminus of a protein, or to the carboxyl group of an amino acid at any other location within the protein.
- Typically, the amino acids making up a protein are numbered in order, starting at the amino terminal and increasing in the direction toward the carboxy terminal of the protein. Thus, when one amino acid is said to “follow” another, that amino acid is positioned closer to the carboxy terminal of the protein than the preceding amino acid.
- The term “residue” is used herein to refer to an amino acid (D or L) or an amino acid mimetic that is incorporated into a protein by an amide bond. As such, the amino acid may be a naturally occurring amino acid or, unless otherwise limited, may encompass known analogs of natural amino acids that function in a manner similar to the naturally occurring amino acids (i.e., amino acid mimetics). Moreover, an amide bond mimetic includes peptide backbone modifications well known to those skilled in the art.
- Furthermore, one of skill will recognize that, as mentioned above, individual substitutions, deletions or additions which alter, add or delete a single amino acid or a small percentage of amino acids (typically less than about 5%, more typically less than about 1%) in an encoded sequence are conservatively modified variations where the alterations result in the substitution of an amino acid with a chemically similar amino acid. Conservative substitution tables providing functionally similar amino acids are well known in the art. The following six groups each contain amino acids that are conservative substitutions for one another:
- 1) Alanine (A), Serine (S), Threonine (T);
- 2) Aspartic acid (D), Glutamic acid (E);
- 3) Asparagine (N), Glutamine (Q);
- 4) Arginine (R), Lysine (K);
- 5) Isoleucine (I), Leucine (L), Methionine (M), Valine (V); and
- 6) Phenylalanine (F), Tyrosine (Y), Tryptophan (W).
- A conservative substitution is a substitution in which the substituting amino acid (naturally occurring or modified) is structurally related to the amino acid being substituted, i.e., has about the same size and electronic properties as the amino acid being substituted. Thus, the substituting amino acid would have the same or a similar functional group in the side chain as the original amino acid. A “conservative substitution” also refers to utilizing a substituting amino acid which is identical to the amino acid being substituted except that a functional group in the side chain is protected with a suitable protecting group.
- Suitable protecting groups are described in Green and Wuts, “Protecting Groups in Organic Synthesis”, John Wiley and Sons, Chapters 5 and 7, 1991, the teachings of which are incorporated herein by reference. Preferred protecting groups are those which facilitate transport of the peptide through membranes, for example, by reducing the hydrophilicity and increasing the lipophilicity of the peptide, and which can be cleaved, either by hydrolysis or enzymatically (Ditter et al., 1968. J. Pharm. Sci. 57:783; Ditter et al., 1968. J. Pharm. Sci. 57:828; Ditter et al., 1969. J. Pharm. Sci. 58:557; King et al., 1987. Biochemistry 26:2294; Lindberg et al., 1989. Drug Metabolism and Disposition 17:311; Tunek et al., 1988. Biochem. Pharm. 37:3867; Anderson et al., 1985 Arch. Biochem. Biophys. 239:538; and Singhal et al., 1987. FASEB J. 1:220). Suitable hydroxyl protecting groups include ester, carbonate and carbamate protecting groups. Suitable amine protecting groups include acyl groups and alkoxy or aryloxy carbonyl groups, as described above for N-terminal protecting groups. Suitable carboxylic acid protecting groups include aliphatic, benzyl and aryl esters, as described below for C-terminal protecting groups. In one embodiment, the carboxylic acid group in the side chain of one or more glutamic acid or aspartic acid residues in a peptide of the present invention is protected, preferably as a methyl, ethyl, benzyl or substituted benzyl ester, more preferably as a benzyl ester.
- Provided below are groups of naturally occurring and modified amino acids in which each amino acid in a group has similar electronic and steric properties. Thus, a conservative substitution can be made by substituting an amino acid with another amino acid from the same group. It is to be understood that these groups are non-limiting, i.e. that there are additional modified amino acids which could be included in each group.
- Group I includes leucine, isoleucine, valine, methionine and modified amino acids having the following side chains: ethyl, n-propyl n-butyl. Preferably, Group I includes leucine, isoleucine, valine and methionine.
- Group II includes glycine, alanine, valine and a modified amino acid having an ethyl side chain. Preferably, Group II includes glycine and alanine.
- Group III includes phenylalanine, phenylglycine, tyrosine, tryptophan, cyclohexylmethyl glycine, and modified amino residues having substituted benzyl or phenyl side chains. Preferred substituents include one or more of the following: halogen, methyl, ethyl, nitro, —NH2, methoxy, ethoxy and —CN. Preferably, Group III includes phenylalanine, tyrosine and tryptophan.
- Group IV includes glutamic acid, aspartic acid, a substituted or unsubstituted aliphatic, aromatic or benzylic ester of glutamic or aspartic acid (e.g., methyl, ethyl, n-propyl iso-propyl, cyclohexyl, benzyl or substituted benzyl), glutamine, asparagine, —CO—NH— alkylated glutamine or asparagines (e.g., methyl, ethyl, n-propyl and iso-propyl) and modified amino acids having the side chain —CH2)3—COOH, an ester thereof (substituted or unsubstituted aliphatic, aromatic or benzylic ester), an amide thereof and a substituted or unsubstituted N-alkylated amide thereof. Preferably, Group IV includes glutamic acid, aspartic acid, methyl aspartate, ethyl aspartate, benzyl aspartate and methyl glutamate, ethyl glutamate and benzyl glutamate, glutamine and asparagine.
- Group V includes histidine, lysine, ornithine, arginine, N-nitroarginine, β-cycloarginine, γ-hydroxyarginine, N-amidinocitruline and 2-amino-4-guanidinobutanoic acid, homologs of lysine, homologs of arginine and homologs of ornithine. Preferably, Group V includes histidine, lysine, arginine and ornithine. A homolog of an amino acid includes from 1 to about 3 additional or subtracted methylene units in the side chain.
- Group VI includes serine, threonine, cysteine and modified amino acids having C1-C5 straight or branched alkyl side chains substituted with —OH or —SH, for example, —CH2CH2OH, —CH2CH2CH2OH or —CH2CH2OHCH3. Preferably, Group VI includes serine, cysteine or threonine.
- In another aspect, suitable substitutions for amino acid residues include “severe” substitutions. A “severe substitution” is a substitution in which the substituting amino acid (naturally occurring or modified) has significantly different size and/or electronic properties compared with the amino acid being substituted. Thus, the side chain of the substituting amino acid can be significantly larger (or smaller) than the side chain of the amino acid being substituted and/or can have functional groups with significantly different electronic properties than the amino acid being substituted. Examples of severe substitutions of this type include the substitution of phenylalanine or cyclohexylmethyl glycine for alanine, isoleucine for glycine, a D amino acid for the corresponding L amino acid, or —NH—CH[(—CH2)5—COOH]—CO— for aspartic acid. Alternatively, a functional group may be added to the side chain, deleted from the side chain or exchanged with another functional group. Examples of severe substitutions of this type include adding of valine, leucine or isoleucine, exchanging the carboxylic acid in the side chain of aspartic acid or glutamic acid with an amine, or deleting the amine group in the side chain of lysine or ornithine. In yet another alternative, the side chain of the substituting amino acid can have significantly different steric and electronic properties that the functional group of the amino acid being substituted. Examples of such modifications include tryptophan for glycine, lysine for aspartic acid and —(CH2)4COOH for the side chain of serine. These examples are not meant to be limiting.
- In another embodiment, for example in the synthesis of a peptide 26 amino acids in length, the individual amino acids may be substituted according in the following manner:
- AA1 is serine, glycine, alanine, cysteine or threonine;
- AA2 is alanine, threonine, glycine, cysteine or serine;
- AA3 is valine, arginine, leucine, isoleucine, methionine, omithine, lysine, N-nitroarginine, β-cycloarginine, γ-hydroxyarginine, N-amidinocitruline or 2-amino-4-guanidinobutanoic acid;
- AA4 is proline, leucine, valine, isoleucine or methionine;
- AA5 is tryptophan, alanine, phenylalanine, tyrosine or glycine;
- AA6 is serine, glycine, alanine, cysteine or threonine;
- AA7 is proline, leucine, valine, isoleucine or methionine;
- AA8 is alanine, threonine, glycine, cysteine or serine;
- AA9 is alanine, threonine, glycine, cysteine or serine;
- AA10 is leucine, isoleucine, methionine or valine;
- AA11 is serine, glycine, alanine, cysteine or threonine;
- AA12 is leucine, isoleucine, methionine or valine;
- AA13 is leucine, isoleucine, methionine or valine;
- AA14 is glutamine, glutamic acid, aspartic acid, asparagine, or a substituted or unsubstituted aliphatic or aryl ester of glutamic acid or aspartic acid;
- AA15 is arginine, N-nitroarginine, β-cycloarginine, γ-hydroxy-arginine, N-amidinocitruline or 2-amino4-guanidino-butanoic acid
- AA16 is proline, leucine, valine, isoleucine or methionine;
- AA17 is serine, glycine, alanine, cysteine or threonine;
- AA18 is glutamic acid, aspartic acid, asparagine, glutamine or a substituted or unsubstituted aliphatic or aryl ester of glutamic acid or aspartic acid;
- AA19 is aspartic acid, asparagine, glutamic acid, glutamine, leucine, valine, isoleucine, methionine or a substituted or unsubstituted aliphatic or aryl ester of glutamic acid or aspartic acid;
- AA20 is valine, arginine, leucine, isoleucine, methionine, ornithine, lysine, N-nitroarginine, β-cycloarginine, γ-hydroxyarginine, N-amidinocitruline or 2-amino-4-guanidinobutanoic acid;
- AA21 is alanine, threonine, glycine, cysteine or serine;
- AA22 is alanine, threonine, glycine, cysteine or serine;
- AA23 is histidine, serine, threonine, cysteine, lysine or ornithine;
- AA24 is threonine, aspartic acid, serine, glutamic acid or a substituted or unsubstituted aliphatic or aryl ester of glutamic acid or aspartic acid;
- AA25 is asparagine, aspartic acid, glutamic acid, glutamine, leucine, valine, isoleucine, methionine or a substituted or unsubstituted aliphatic or aryl ester of glutamic acid or aspartic acid; and
- AA26 is cysteine, histidine, serine, threonine, lysine or ornithine.
- It is to be understood that these amino acid substitutions may be made for longer or shorter peptides than the 26 mer in the preceding example above, and for proteins.
- In one embodiment of the present invention, codons for the first several N-terminal amino acids of the transposase are modified such that the third base of each codon is changed to an A or a T without changing the corresponding amino acid. It is preferable that between approximately 1 and 20, more preferably 3 and 15, and most preferably between 4 and 12 of the first N-terminal codons of the gene of interest are modified such that the third base of each codon is changed to an A or a T without changing the corresponding amino acid. In one embodiment, the first ten N-terminal codons of the gene of interest are modified in this manner.
- When several desired proteins, protein fragments or peptides are encoded in the gene of interest to be incorporated into the genome, one of skill in the art will appreciate that the proteins, protein fragments or peptides may be separated by a spacer molecule such as, for example, a peptide, consisting of one or more amino acids. Generally, the spacer will have no specific biological activity other than to join the desired proteins, protein fragments or peptides together, or to preserve some minimum distance or other spatial relationship between them. However, the constituent amino acids of the spacer may be selected to influence some property of the molecule such as the folding, net charge, or hydrophobicity. The spacer may also be contained within a nucleotide sequence with a purification handle or be flanked by cleavage sites, such as proteolytic cleavage sites.
- Such polypeptide spacers may have from about 5 to about 40 amino acid residues. The spacers in a polypeptide are independently chosen, but are preferably all the same. The spacers should allow for flexibility of movement in space and are therefore typically rich in small amino acids, for example, glycine, serine, proline or alanine. Preferably, peptide spacers contain at least 60%, more preferably at least 80% glycine or alanine. In addition, peptide spacers generally have little or no biological and antigenic activity. Preferred spacers are (Gly-Pro-Gly-Gly)x (SEQ ID NO:26) and (Gly4-Ser)y, wherein x is an integer from about 3 to about 9 and y is an integer from about 1 to about 8. Specific examples of suitable spacers include
(Gly-Pro-Gly-Gly)3 SEQ ID NO:27 Gly Pro Gly Gly Gly Pro Gly Gly Gly Pro Gly Gly (Gly4-Ser)3 SEQ ID NO:28 Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser or (Gly4-Ser)4 SEQ ID NO:29 Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser. - Nucleotide sequences encoding for the production of residues which may be useful in purification of the expressed recombinant protein may also be built into the vector. Such sequences are known in the art and include the glutathione binding domain from glutathione S-transferase, polylysine, hexa-histidine or other cationic amino acids, thioredoxin, hemagglutinin antigen and maltose binding protein.
- Additionally, nucleotide sequences may be inserted into the gene of interest to be incorporated so that the protein or peptide can also include from one to about six amino acids that create signals for proteolytic cleavage. In this manner, if a gene is designed to make one or more peptides or proteins of interest in the transgenic animal, specific nucleotide sequences encoding for amino acids recognized by enzymes may be incorporated into the gene to facilitate cleavage of the large protein or peptide sequence into desired peptides or proteins or both. For example, nucleotides encoding a proteolytic cleavage site can be introduced into the gene of interest so that a signal sequence can be cleaved from a protein or peptide encoded by the gene of interest. Nucleotide sequences encoding other amino acid sequences which display pH sensitivity or chemical sensitivity may also be added to the vector to facilitate separation of the signal sequence from the peptide or protein of interest.
- Proteolytic cleavage sites include cleavage sites recognized by exopeptidases such as carboxypeptidase A, carboxypeptidase B, aminopeptidase I, and dipeptidylaminopeptidase; endopeptidases such as trypsin, V8-protease, enterokinase, factor Xa, collagenase, endoproteinase, subtilisin, and thombin; and proteases such as Protease 3C IgA protease (Igase) Rhinovirus 3C(preScission)protease. Chemical cleavage sites are also included in the defintion of cleavage site as used herein. Chemical cleavage sites include, but are not limited to, site cleaved by cyanogen bromide, hydroxylamine, formic acid, and acetic acid.
- In one embodiment of the present invention, a TAG sequence is linked to the gene of interest. The TAG sequence serves three purposes: 1) it allows free rotation of the peptide or protein to be isolated so there is no interference from the native protein or signal sequence, i.e. vitellogenin, 2) it provides a “purification handle” to isolate the protein using column purification, and 3) it includes a cleavage site to remove the desired protein from the signal and purification sequences. Accordingly, as used herein, a TAG sequence includes a spacer sequence, a purification handle and a cleavage site. The spacer sequences in the TAG proteins contain one or more repeats shown in SEQ ID NO:30. A preferred spacer sequence comprises the sequence provided in SEQ ID NO:31. One example of a purification handle is the gp41 hairpin loop from HIV I. Exemplary gp41 polynucleotide and polypeptide sequences are provided in SEQ ID NO:32 and SEQ ID NO:33, respectively. However, it should be understood that any antigenic region may be used as a purification handle, including any antigenic region of gp41. Preferred purification handles are those that elicit highly specific antibodies. Additionally, the cleavage site can be any protein cleavage site known to one of ordinary skill in the art and includes an enterokinase cleavage site comprising the Asp Asp Asp Asp Lys sequence (SEQ ID NO:34) and a furin cleavage site. Constructs containing a TAG sequence are shown in FIGS. 2 and 3. In one embodiment of the present invention, the TAG sequence comprises a polynucleotide sequence of SEQ ID NO:35.
- In addition to the transposon-based vectors described above, the present invention also includes methods of administering the transposon-based vectors to an animal, methods of producing a transgenic animal wherein a gene of interest is incorporated into the germline of the animal and methods of producing a transgenic animal wherein a gene of interest is incorporated into cells other than the germline cells (somatic cells) of the animal. The transposon-based vectors of the present invention are administered to a reproductive organ of an animal via any method known to those of skill in the art. Preferred reproductive organs include an ovary, an oviduct, a mammary gland, and a fallopian tube.
- In some embodiments, a transposon-based vector is directly administered to the reproductive organ. Direct administration encompasses injection into the organ, and in a preferred embodiment, a transposon-based vector is injected into the lumen of the oviduct, and more preferably, the lumen of the magnum or the infindibulum of the oviduct. The transposon-based vectors may additionally or alternatively be placed in an artery supplying the reproductive organ. Administering the vectors to the artery supplying the ovary results in transfection of follicles and oocytes in the ovary to create a germline transgenic animal. Alternatively, supplying the vectors through an artery leading to the oviduct would preferably transfect the tubular gland and epithelial cells. Such transfected cells could manufacture a desired protein or peptide for deposition in the egg white. In one embodiment, a transposon-based vector is administered into the lumen of the magnum or the infundibulum of the oviduct and to an artery supplying the oviduct. Indirect administration to the oviduct epithelium may occur through the cloaca. Direct administration into the mammary gland comprises introduction into the duct system of the mammary gland.
- Administration of transposon-based vectors may occur in arteries supplying the ovary and or through direct intrathecal administration into the ovary through injection.
- The transposon-based vectors may be administered in a single administration, multiple administrations, continuously, or intermittently. The transposon-based vectors may be administered by injection, via a catheter, an osmotic mini-pump or any other method. In some embodiments, the transposon-based vector is administered to an animal in multiple administrations, each administration containing the vector and a different transfecting reagent.
- The transposon-based vectors may be administered to the animal at any point during the lifetime of the animal, however, it is preferable that the vectors are administered prior to the animal reaching sexual maturity. The transposon-based vectors are preferably administered to a chicken between approximately 14 and 16 weeks of age and to a quail between approximately 5 and 10 weeks of age, more preferably 5 and 8 weeks of age, and most preferably between 5 and 6 weeks of age, when standard poultry rearing practices are used. The vectors may be administered at earlier ages when exogenous hormones are used to induce early sexual maturation in the bird. In some embodiments, the transposon-based vector is administered to an animal following an increase in proliferation of the oviduct epithelial cells and/or the tubular gland cells. Such an increase in proliferation normally follows an influx of reproductive hormones in the area of the oviduct. When the animal is an avian, the transposon-based vector is administered following an increase in proliferation of the oviduct epithelial cells and before the avian begins to produce egg white constituents.
- In a preferred embodiment, the animal is an egg-laying animal, and more preferably, an avian. In one embodiment, between approximately 1 and 150 μg, 1 and 100 μg, 1 and 50 μg, preferably between 1 and 20 μg, and more preferably between 5 and 10 μg of transposon-based vector DNA is administered to the oviduct of a bird. Optimal ranges depend upon the type of bird and the bird's stage of sexual maturity. In a chicken, it is preferred that between approximately 1 and 100 μg, or 5 and 50 μg are administered. In a quail, it is preferred that between approximately 5 and 10 μg are administered. Intraoviduct administration of the transposon-based vectors of the present invention result in incorporation of the gene of interest into the cells of the oviduct as evidenced by a PCR positive signal in the oviduct tissue. In other embodiments, the transposon-based vector is administered to an artery that supplies the oviduct. These methods of administration may also be combined with any methods for facilitating transfection, including without limitation, electroporation, gene guns, injection of naked DNA, and use of dimethyl sulfoxide (DMSO).
- According to the present invention, the transposon-based vector is administered in conjunction with an acceptable carrier and/or transfection reagent. Acceptable carriers include, but are not limited to, water, saline, Hanks Balanced Salt Solution (HBSS), Tris-EDTA (TE) and lyotropic liquid crystals. Transfection reagents commonly known to one of ordinary skill in the art that may be employed include, but are not limited to, the following: cationic lipid transfection reagents, cationic lipid mixtures, polyamine reagents, liposomes and combinations thereof; SUPERFECT®, Cytofectene, BioPORTER®, GenePORTER®, NeuroPORTER®, and perfectin from Gene Therapy Systems; lipofectamine, cellfectin, DMRIE-C oligofectamine, TROJENE® and PLUS reagent from InVitrogen; Xtreme gene, fugene, DOSPER and DOTAP from Roche; Lipotaxi and Genejammer from Strategene; and Escort from SIGMA. In one embodiment, the transfection reagent is SUPERFECT®. The ratio of DNA to transfection reagent may vary based upon the method of administration. In one embodiment, the transposon-based vector is administered to the oviduct and the ratio of DNA to transfection reagent can be from 1:1.5 to 1:15, preferably 1:2 to 1:5, all expressed as wt/vol. Transfection may also be accomplished using other means known to one of ordinary skill in the art, including without limitation electroporation, gene guns, injection of naked DNA, and use of dimethyl sulfoxide (DMSO).
- Depending upon the cell or tissue type targeted for transfection, the form of the transposon-based vector may be important. Plasmids harvested from bacteria are generally closed circular supercoiled molecules, and this is the preferred state of a vector for gene delivery because of the ease of preparation. In some instances, transposase expression and insertion may be more efficient in a relaxed, closed circular configuration or in a linear configuration. In still other instances, a purified transposase protein may be co-injected with a transposon-based vector containing the gene of interest for more immediate insertion. This could be accomplished by using a transfection reagent complexed with both the purified transposase protein and the transposon-based vector.
- Following administration of a transposon-based vector to an animal, DNA is extracted from the animal to confirm integration of the gene of interest. Advantages provided by the present invention include the high rates of integration, or incorporation, and transcription of the gene of interest when administered to a bird via an intraoviduct or intraovarian route (including intraarterial administrations to arteries leading to the oviduct or ovary). Example 6 below describes isolation of a proinsulin/ENT TAG protein from a transgenic hen following ammonium sulfate precipitation and ion exchange chromatography. FIG. 5 demonstrates successful administration of a transposon-based vector to a hen, successful integration of the gene of interest, successful production of a protein encoded by the gene of interest, and successful deposition of the protein in egg white produced by the transgenic hen.
- Actual frequencies of integration may be estimated both by comparative strength of the PCR signal, and by histological evaluation of the tissues by quantitative PCR. Another method for estimating the rate of transgene insertion is the so-called primed in situ hybridization technique (PRINS). This method determines not only which cells carry a transgene of interest, but also into which chromosome the gene has inserted, and even what portion of the chromosome. Briefly, labeled primers are annealed to chromosome spreads (affixed to glass slides) through one round of PCR, and the slides are then developed through normal in situ hybridization procedures. This technique combines the best features of in situ PCR and fluorescence in situ hybridization (FISH) to provide distinct chromosome location and copy number of the gene in question.
- Breeding experiments are also conducted to determine if germline transmission of the transgene has occurred. In a general bird breeding experiment performed according to the present invention, each male bird was exposed to 2-3 different adult female birds for 3-4 days each. This procedure was continued with different females for a total period of 6-12 weeks. Eggs ae collected daily for up to 14 days after the last exposure to the transgenic male, and each egg is incubated in a standard incubator. The resulting embryos are examined for transgene presence at day 3 or 4 using PCR. It is to be understood that the above procedure can be modified to suit animals other than birds and that selective breeding techniques may be performed to amplify gene copy numbers and protein output.
- In one embodiment, the transposon-based vectors of the present invention may be administered to a bird for production of desired proteins or peptides in the egg white. These transposon-based vectors preferably contain one or more of an ovalbumin promoter, an ovomucoid promoter, an ovalbumin signal sequence and an ovomucoid signal sequence. Oviduct-specific ovalbumin promoters are described in B. O'Malley et al., 1987. EMBO J., vol. 6, pp. 2305-12; A. Qiu et al., 1994. Proc. Nat. Acad. Sci. (USA), vol. 91, pp. 4451-4455; D. Monroe et al., 2000. Biochim. Biophys. Acta, 1517 (1):27-32; H. Park et al., 2000. Biochem., 39:8537-8545; and T. Muramatsu et al., 1996. Poult. Avian Biol. Rev., 6:107-123. Examples of transposon-based vectors designed for production of a desired protein in an egg white are shown in FIGS. 2 and 3.
- The present invention is particularly advantageous for production of recombinant peptides and proteins of low solubility in the egg yolk. Such proteins include, but are not limited to, membrane-associated or membrane-bound proteins, lipophilic compounds; attachment factors, receptors, and components of second messenger transduction machinery. Low solubility peptides and proteins are particularly challenging to produce using conventional recombinant protein production techniques (cell and tissue cultures) because they aggregate in water-based, hydrophilic environments. Such aggregation necessitates denaturation and re-folding of the recombinantly-produced proteins, which may deleteriously affect their structure and function. Moreover, even highly soluble recombinant peptides and proteins may precipitate and require denaturation and renaturation when produced in sufficiently high amounts in recombinant protein production systems. The present invention provides an advantageous resolution of the problem of protein and peptide solubility during production of large amounts of recombinant proteins.
- In one embodiment of the present invention wherein germline transfection is obtained via intraovarian administration of the transposon-based vector, deposition of a desired protein into the egg yolk is accomplished in offspring by attaching a sequence encoding a protein capable of binding to the yolk vitellogenin receptor to a gene of interest that encodes a desired protein. This transposon-based vector can be used for the receptor-mediated uptake of the desired protein by the oocytes. In a preferred embodiment, the sequence ensuring the binding to the vitellogenin receptor is a targeting sequence of a vitellogenin protein. The invention encompasses various vitellogenin proteins and their targeting sequences. In a preferred embodiment, a chicken vitellogenin protein targeting sequence is used, however, due to the high degree of conservation among vitellogenin protein sequences and known cross-species reactivity of vitellogenin targeting sequences with their egg-yolk receptors, other vitellogenin targeting sequences can be substituted. One example of a construct for use in the transposon-based vectors of the present invention and for deposition of an insulin protein in an egg yolk is a transposon-based vector containing a vitellogenin promoter, a vitellogenin targeting sequence, a TAG sequence, a pro-insulin sequence and a synthetic polyA sequence. The present invention includes, but is not limited to, vitellogenin targeting sequences residing in the N-terminal domain of vitellogenin, particularly in lipovitellin I. In one embodiment, the vitellogenin targeting sequence contains the polynucleotide sequence of SEQ ID NO:22. In a preferred embodiment, the transposon-based vector contains a transposase gene operably-linked to a constitutive promoter and a gene of interest operably-linked to a liver-specific promoter and a vitellogenin targeting sequence.
- For large-scale production of protein, an animal breeding stock that is homozygous for the transgene is preferred. Such homozygous individuals are obtained and identified through, for example, standard animal breeding procedures or PCR protocols.
- Once expressed, peptides, polypeptides and proteins can be purified according to standard procedures known to one of ordinary skill in the art, including ammonium sulfate precipitation, affinity columns, column chromatography, gel electrophoresis, high performance liquid chromatography, immunoprecipitation and the like. Substantially pure compositions of about 50 to 99% homogeneity are preferred, and 80 to 95% or greater homogeneity are most preferred for use as therapeutic agents.
- In one embodiment of the present invention, the animal in which the desired protein is produced is an egg-laying animal. In a preferred embodiment of the present invention, the animal is an avian and a desired peptide, polypeptide or protein is isolated from an egg white. Egg white containing the exogenous protein or peptide is separated from the yolk and other egg constituents on an industrial scale by any of a variety of methods known in the egg industry. See, e.g., W. Stadelman et al. (Eds.), Egg Science & Technology, Haworth Press, Binghamton, N.Y. (1995). Isolation of the exogenous peptide or protein from the other egg white constituents is accomplished by any of a number of polypeptide isolation and purification methods well known to one of ordinary skill in the art. These techniques include, for example, chromatographic methods such as gel permeation, ion exchange, affinity separation, metal chelation, HPLC, and the like, either alone or in combination. Another means that may be used for isolation or purification, either in lieu of or in addition to chromatographic separation methods, includes electrophoresis. Successful isolation and purification is confirmed by standard analytic techniques, including HPLC, mass spectroscopy, and spectrophotometry. These separation methods are often facilitated if the first step in the separation is the removal of the endogenous ovalbumin fraction of egg white, as doing so will reduce the total protein content to be further purified by about 50%.
- To facilitate or enable purification of a desired protein or peptide, transposon-based vectors may include one or more additional epitopes or domains. Such epitopes or domains include DNA sequences encoding enzymatic or chemical cleavage sites including, but not limited to, an enterokinase cleavage site; the glutathione binding domain from glutathione S-transferase; polylysine; hexa-histidine or other cationic amino acids; thioredoxin; hemagglutinin antigen; maltose binding protein; a fragment of gp41 from HIV; and other purification epitopes or domains commonly known to one of skill in the art.
- In one representative embodiment, purification of desired proteins from egg white utilizes the antigenicity of the ovalbumin carrier protein and particular attributes of a TAG linker sequence that spans ovalbumin and the desired protein. The TAG sequence is particularly useful in this process because it contains 1) a highly antigenic epitope, a fragment of gp41 from HIV, allowing for stringent affinity purification, and, 2) a recognition site for the protease enterokinase immediately juxtaposed to the desired protein. In a preferred embodiment, the TAG sequence comprises approximately 50 amino acids. A representative TAG sequence is provided below.
- Pro Ala Asp Asp Ala Pro Ala Asp Asp Ala Pro Ala Asp Asp Ala Pro Ala Asp Asp Ala Pro Ala Asp Asp Ala Pro Ala Asp AspAla Thr Thr Cys Ile Leu Lys Gly Ser Cys Gly Trp, Ile Gly Leu Leu Asp Asp Asp Asp Lys (SEQ ID NO:35)
- The underlined sequences were taken from the hairpin loop domain of HIV gp-41 (SEQ ID NO:33). Sequences in italics represent the cleavage site for enterokinase (SEQ ID NO:34). The spacer sequence upstream of the loop domain was made from repeats of (Pro Ala Asp Asp Ala) (SEQ ID NO:31) to provide free rotation and promote surface availability of the hairpin loop from the ovalbumin carrier protein.
- Isolation and purification of a desired protein is performed as follows:
- 1. Enrichment of the egg white protein fraction containing ovalbumin and the transgenic ovalbumin-TAG-desired protein.
- 2. Size exclusion chromatography to isolate only those proteins within a narrow range of molecular weights (a further enrichment of step 1).
- 3. Ovalbumin affinity chromatography. Highly specific antibodies to ovalbumin will eliminate virtually all extraneous egg white proteins except ovalbumin and the transgenic ovalbumin-TAG-desired protein.
- 4. gp41 affinity chromatography using anti-gp41 antibodies. Stringent application of this step will result in virtually pure transgenic ovalbumin-TAG-desired protein.
- 5. Cleavage of the transgene product can be accomplished in at least one of two ways:
- a. The transgenic ovalbumin-TAG-desired protein is left attached to the gp41 affinity resin (beads) from step 4 and the protease enterokinase is added. This liberates the transgene target protein from the gp41 affinity resin while the ovalbumin-TAG sequence is retained. Separation by centrifugation (in a batch process) or flow through (in a column purification), leaves the desired protein together with enterokinase in solution. Enterokinase is recovered and reused.
- b. Alternatively, enterokinase is immobilized on resin (beads) by the addition of poly-lysine moieties to a non-catalytic area of the protease. The transgenic ovalbumin-TAG-desired protein eluted from the affinity column of step 4 is then applied to the protease resin. Protease action cleaves the ovalbumin-TAG sequence from the desired protein and leaves both entities in solution. The immobilized enterokinase resin is recharged and reused.
- c. The choice of these alternatives is made depending upon the size and chemical composition of the transgene target protein.
- 6. A final separation of either of these two (5a or 5b) protein mixtures is made using size exclusion, or enterokinase affinity chromatography. This step allows for desalting, buffer exchange and/or polishing, as needed.
- Cleavage of the transgene product (ovalbumin-TAG-desired protein) by enterokinase, then, results in two products: ovalbumin-TAG and the desired protein. More specific methods for isolation using the TAG label is provided in the Examples. Some desired proteins may require additions or modifications of the above-described approach as known to one of ordinary skill in the art. The method is scaleable from the laboratory bench to pilot and production facility largely because the techniques applied are well documented in each of these settings.
- In another representative embodiment, egg whites containing a protein of interest were pooled and separated, in any order, from the yolks and other egg constituents by methods known to one skilled in the art. A variety of such methods is described in manuals known in the art, such asEgg Science & Technology, W. Stadelman, et al. (Eds.), Haworth Press, Binghamton, N.Y. (1995).
- One non-limiting example of a method for isolating a desired peptide, polypeptide or protein from an egg white is as follows. It is to be understood that this method may be employed to isolate any desired peptide, polypeptide or protein from the eggs of transgenic animals of the present invention. This present example involved transgenes that used a portion of or the entire ovalbumin protein, or specific ovalbumin epitopes, as a carrier, linked to the protein of interest via the specified TAG sequence, or another affinity/cleavage sequence. The TAG sequence contains the hairpin loop epitope from HIV I followed by an enterokinase cleavage site.
- First, the viscosity of the egg white was lowered by subjecting the egg white to low shear forces of 3140 cps (Tung et al., 1969). The resulting pourable solution was then filtered to remove chalazae. An ammonium sulfate precipitation was then used to enrich the fraction of transgenic protein (see, for example,Practical Protein Chemistry A Handbook A. Darbre (Ed.), John Wiley & Sons Ltd., 1986). Other methods of crude fractionation known in the art are also used as needed. The supernatant of this separation was then fractionated using size-exclusion chromatography, further enriching the transgenic fusion protein fraction and eliminating the ammonium sulfate from the material. The fusion protein was isolated by anti-ovalbumin affinity chromatography (batch or column) using methods known to one skilled in the art. This step may capture native ovalbumin in addition to an ovalbumin-transgene fusion protein. After elution from the anti-ovalbumin affinity resin, the transgenic protein was specifically isolated using anti-gp41 affinity chromatography (batch or column) using methods known to one skilled in the art.
- Cleavage of the transgene product from the carrier and the TAG sequences was accomplished in one of at least two ways:
- 1) The transgenic ovalbumin-TAG-transgene target protein was left attached to the gp41 affinity resin and the protease enterokinase was added. Cleavage of the transgene by enterokinase liberated the transgene target protein from the gp41 affinity resin while the ovalbumin-TAG sequence was retained. Separation by centrifugation (in a batch process) or flow through (in a column purification), kept the transgene target protein together with enterokinase in solution. Enterokinase was recovered and reused.
- 2) Alternatively, enterokinase was immobilized on resin (beads) by the addition of poly-lysine moieties to a non-catalytic area of the protease. The transgenic ovalbumin-TAG-transgene target protein was eluted from the gp41 affinity chromatography resin and then applied to the protease resin. Protease action cleaved the ovalbumin-TAG sequence from the transgene target protein and left both entities in solution. The immobilized enterokinase resin was recharged and reused. The choice between these alternatives is made on a case-by case basis, depending upon the size and chemical composition of the transgene target protein.
- A final separation of either of these two (process 1 or 2) protein mixtures was made using size exclusion chromatography, or enterokinase affinity chromatography. This step also allows for desalting, concentrating, buffer exchange and/or polishing, as needed.
- It is believed that a typical chicken egg produced by a transgenic animal of the present invention will contain at least 0.001 mg, from about 0.001 to 1.0 mg, or from about 0.001 to 100.0 mg of exogenous protein, peptide or polypeptide, in addition to the normal constituents of egg white (or possibly replacing a small fraction of the latter). In some embodiments, a chicken egg will contain between 50 and 75 mg of exogenous protein.
- One of skill in the art will recognize that after biological expression or purification, the desired proteins, fragments thereof and peptides may possess a conformation substantially different than the native conformations of the proteins, fragments thereof and peptides. In this case, it is often necessary to denature and reduce protein and then to cause the protein to re-fold into the preferred conformation. Methods of reducing and denaturing proteins and inducing re-folding are well known to those of skill in the art.
- In addition to methods of producing eggs containing transgenic proteins or peptides, the present invention encompasses methods for the production of milk containing transgenic proteins or peptides. These methods include the administration of a transposon-based vector described above to a mammal through the duct system. In one embodiment, the transposon-based vector contains a transposase operably-linked to a constitutive promoter and a gene of interest operably-linked to mammary specific promoter. Genes of interest can include, but are not limited to antiviral and antibacterial proteins and immunoglobulins. In other embodiments, a transposon-based vector is administered to the ovary of an animal and gerrnline transformation is obtained. In these embodiments, offspring of the transfected animal express a gene of interest in the mammary gland under the control of a mammary gland-specific promoter.
- The following examples will serve to further illustrate the present invention without, at the same time, however, constituting any limitation thereof. On the contrary, it is to be clearly understood that resort may be had to various embodiments, modifications and equivalents thereof which, after reading the description herein, may suggest themselves to those skilled in the art without departing from the spirit of the invention.
- Quail or chicken were selected for administration of the transposon-based vectors of the present invention. Feathers were removed from the area where surgery was performed and the area was cleansed and sterilized by rinsing it with ethanol (alcohol) and 0.5% chlorhexidine. Using the scalpel, a dorsolateral incision was made through the skin over the ovary approximately 2 cm in length. Using blunt scissors, a second incision was made through the muscle between the last two ribs to expose the oviduct beneath. A small animal retractor was used to spread the last two ribs, exposing the oviduct beneath. The oviduct was further exposed using retractors to pull the intestines to one side.
- A delivery solution containing a transposon-based vector and SUPERFECT® was prepared fresh immediately before surgery. Specific ratios of vector and SUPERFECT® that were used in each experiment are provided in the Examples below. The delivery solution was warmed to room temperature prior to injection into the bird. Approximately 250-500 μ1 of the delivery solution was injected into the lumen of the magnum of the oviduct using a 1 cc syringe with a 27 gauge needle attached. The wound was closed and antibiotic cream liberally applied to the area surrounding the wound.
- A vector was designed for inserting a desired coding sequence into the genome of eukaryotic cells, given below as SEQ ID NO:3. The vector of SEQ ID NO:3, termed pTnMod, was constructed and its sequence verified.
- This vector employed a cytomegalovirus (CMV) promoter. A modified Kozak sequence (ACCATG) (SEQ ID NO:1) was added to the promoter. The nucleotide in the wobble position in nucleotide triplet codons encoding the first 10 amino acids of transposase was changed to an adenine (A) or thymine (T), which did not alter the amino acid encoded by this codon. Two stop codons were added and a synthetic polyA was used to provide a strong termination sequence. This vector uses a promoter designed to be active soon after entering the cell (without any induction) to increase the likelihood of stable integration. The additional stop codons and synthetic polyA insures proper termination without read through to potential genes downstream.
- The first step in constructing this vector was to modify the transposase to have the desired changes. Modifications to the transposase were accomplished with the primers High Efficiency forward primer (Hef) Altered transposase (ATS)-Hef 5′ ATCTCGAGACCATGTGTGAACTTGATATTTTACATGATTCTCTTTACC 3′ (SEQ ID NO:36) and Altered transposase-High efficiency reverse primer (Her) 5′ GATTGATCATTATCATAATTTCCCCAAAGCGTAACC 3′ (SEQ ID NO:37, a reverse complement primer). In the 5′ forward primer ATS-Hef, the sequence CTCGAG (SEQ ID NO:38) is the recognition site for the restriction enzyme Xho I, which permits directional cloning of the amplified gene. The sequence ACCATG (SEQ ID NO:1) contains the Kozak sequence and start codon for the transposase and the underlined bases represent changes in the wobble position to an A or T of codons for the first 10 amino acids (without changing the amino acid coded by the codon). Primer ATS-Her (SEQ ID NO:37) contains an additional stop codon TAA in addition to native stop codon TGA and adds a Bcl I restriction site, TGATCA (SEQ ID NO:39), to allow directional cloning. These primers were used in a PCR reaction with pTnLac (p defines plasmid, tn defines transposon, and lac defines the beta fragment of the lactose gene, which contains a multiple cloning site) as the template for the transposase and a FailSafe™ PCR System (which includes enzyme, buffers, dNTP's, MgCl2 and PCR Enhancer; Epicentre Technologies, Madison, Wis.). Amplified PCR product was electrophoresed on a 1% agarose gel, stained with ethidium bromide, and visualized on an ultraviolet transilluminator. A band corresponding to the expected size was excised from the gel and purified from the agarose using a Zymo Clean Gel Recovery Kit (Zymo Research, Orange, Calif.). Purified DNA was digested with restriction enzymes Xho 1 (5′) and Bcl 1 (3′) (New England Biolabs, Beverly, Mass.) according to the manufacturer's protocol. Digested DNA was purified from restriction enzymes using a Zymo DNA Clean and Concentrator kit (Zymo Research).
- Plasmid gWhiz (Gene Therapy Systems, San Diego, Calif.) was digested with restriction enzymes Sal I and BamH I (New England Biolabs), which are compatible with Xho I and Bcl I, but destroy the restriction sites. Digested gwhiz was separated on an agarose gel, the desired band excised and purified as described above. Cutting the vector in this manner facilitated directional cloning of the modified transposase (mATS) between the CMV promoter and synthetic polyA.
- To insert the mATS between the CMV promoter and synthetic polyA in gWhiz, a Stratagene T4 Ligase Kit (Stratagene, Inc. La Jolla, Calif.) was used and the ligation set up according to the manufacturer's protocol. Ligated product was transformed intoE. coli Top10 competent cells (Invitrogen Life Technologies, Carlsbad, Calif.) using chemical transformation according to Invitrogen's protocol. Transformed bacteria were incubated in 1 ml of SOC (GIBCO BRL, CAT#15544-042) medium for 1 hour at 37° C. before being spread to LB (Luria-Bertani media (broth or agar)) plates supplemented with 100 μg/ml ampicillin (LB/amp plates). These plates were incubated overnight at 37° C. and resulting colonies picked to LB/amp broth for overnight growth at 37° C. Plasmid DNA was isolated using a modified alkaline lysis protocol (Sambrook et al., 1989), electrophoresed on a 1% agarose gel, and visualized on a U.V. transilluminator after ethidium bromide staining. Colonies producing a plasmid of the expected size (approximately 6.4 kbp) were cultured in at least 250 ml of LB/amp broth and plasmid DNA harvested using a Qiagen Maxi-Prep Kit (column purification) according to the manufacturer's protocol (Qiagen, Inc., Chatsworth, Calif.). Column purified DNA was used as template for sequencing to verify the changes made in the transposase were the desired changes and no further changes or mutations occurred due to PCR amplification. For sequencing, Perkin-Elmer's Big Dye Sequencing Kit was used. All samples were sent to the Gene Probes and Expression Laboratory (LSU School of Veterinary Medicine) for sequencing on a Perkin-Elmer Model 377 Automated Sequencer.
- Once a clone was identified that contained the desired mATS in the correct orientation, primers CMVf-NgoM IV (5′ TTGCCGGCATCAGATTGGCTAT (SEQ ID NO:40); underlined bases denote a NgoM IV recognition site) and Syn-polyA-BstE II (5′ AGAGGTCACCGGGTCAATTCTTCAGCACCTGGTA (SEQ ID NO:41); underlined bases denote a BstE II recognition site) were used to PCR amplify the entire CMV promoter, mATS, and synthetic polyA for cloning upstream of the transposon in pTnLac. The PCR was conducted with FailSafe™ as described above, purified using the Zymo Clean and Concentrator kit, the ends digested with NgoM IV and BstE II (New England Biolabs), purified with the Zymo kit again and cloned upstream of the transposon in pTnLac as described below.
- Plasmid pTnLac was digested with NgoM IV and BstE II to remove the ptac promoter and transposase and the fragments separated on an agarose gel. The band corresponding to the vector and transposon was excised, purified from the agarose, and dephosphorylated with calf intestinal alkaline phosphatase (New England Biolabs) to prevent self-annealing. The enzyme was removed from the vector using a Zymo DNA Clean and Concentrator-5. The purified vector and CMVp/mATS/polyA were ligated together using a Stratagene T4 Ligase Kit and transformed intoE. coli as described above.
- Colonies resulting from this transformation were screened (mini-preps) as describe above and clones that were the correct size were verified by DNA sequence analysis as described above. The vector was given the name pTnMod (SEQ ID NO:3) and includes the following components:
- Base pairs 1-130 are a remainder of F1(−) on from pBluescriptII sk(−) (Stratagene), corresponding to base pairs 1-130 of pBluescriptII sk(−).
- Base pairs 131-132 are a residue from ligation of restriction enzyme sites used in constructing the vector.
- Base pairs 133-1777 are the CMV promoter/enhancer taken from vector pGWiz (Gene Therapy Systems), corresponding to bp 229-1873 of pGWiz. The CMV promoter was modified by the addition of an ACC sequence upstream of ATG.
- Base pairs 1778-1779 are a residue from ligation of restriction enzyme sites used in constructing the vector.
- Base pairs 1780-2987 are the coding sequence for the transposase, modified from Tn10 (GenBank accession J01829) by optimizing codons for stability of the transposase mRNA and for the expression of protein. More specifically, in each of the codons for the first ten amino acids of the transposase, G or C was changed to A or T when such a substitution would not alter the amino acid that was encoded.
- Base pairs 2988-2993 are two engineered stop codons.
- Base pair 2994 is a residue from ligation of restriction enzyme sites used in constructing the vector.
- Base pairs 2995-3410 are a synthetic polyA sequence taken from the pGWiz vector (Gene Therapy Systems), corresponding to bp 1922-2337 of 10 pGWiz.
- Base pairs 3415-3718 are non-coding DNA that is residual from vector pNK2859.
- Base pairs 3719-3761 are non-coding λ DNA that is residual from pNK2859.
- Base pairs 3762-3831 are the 70 bp of the left insertion sequence recognized by the transposon Tn10.
- Base pairs 3832-3837 are a residue from ligation of restriction enzyme sites used in constructing the vector.
- Base pairs 3838-4527 are the multiple cloning site from pBluescriptII sk(20), corresponding to bp 924-235 of pBluescriptll sk(−). This multiple cloning site may be used to insert any coding sequence of interest into the vector.
- Base pairs 4528-4532 are a residue from ligation of restriction enzyme sites used in constructing the vector.
- Base pairs 4533-4602 are the 70 bp of the right insertion sequence recognized by the transposon Tn10.
- Base pairs 4603-4644 are non-coding λ DNA that is residual from pNK2859.
- Base pairs 4645-5488 are non-coding DNA that is residual from pNK2859.
- Base pairs 5489-7689 are from the pBluescriptII sk(−) base vector—(Stratagene, Inc.), corresponding to bp 761-2961 of pBluescriptII sk(−).
- Completing pTnMod is a pBlueScript backbone that contains a colE I origin of replication and an antibiotic resistance marker (ampicillin).
- It should be noted that all non-coding DNA sequences described above can be replaced with any other non-coding DNA sequence(s). Missing nucleotide sequences in the above construct represent restriction site remnants.
- All plasmid DNA was isolated by standard procedures. Briefly,Escherichia coli containing the plasmid was grown in 500 mL aliquots of LB broth (supplemented with an appropriate antibiotic) at 37° C. overnight with shaking. Plasmid DNA was recovered from the bacteria using a Qiagen Maxi-Prep kit (Qiagen, Inc., Chatsworth, Calif.) according to the manufacturer's protocol. Plasmid DNA was resuspended in 500 μL of PCR-grade water and stored at −20° C. until used.
- Another transposon-based vector was designed for inserting a desired coding sequence into the genome of eukaryotic cells. This vector was termed pTnMCS and its constituents are provided below. The sequence of the pTnMCS vector is provided in SEQ ID NO:2. The pTnMCS vector contains an avian optimized polyA sequence operably-linked to the transposase gene. The avian optimized polyA sequence contains approximately 40 nucleotides that precede the A nucleotide string.
- Bp 1-130 Remainder of F1 (−) ori of pBluescriptII sk(−) (Stratagene) bp1-130
- Bp 133-1777 CMV promoter/enhancer taken from vector pGWIZ (Gene Therapy Systems) bp 229-1873
- Bp 1783-2991 Transposase, from Tn10 (GenBank accession #J01829) bp 108-1316
- Bp 2992-3344 Non coding DNA from vector pNK2859
- Bp 3345-3387 Lambda DNA from pNK2859
- Bp 3388-3457 70 bp of IS10 left from Tn10
- Bp 3464-3670 Multiple cloning site from pBluescriptII sk(−), thru the XmaI site bp 924-718
- Bp 3671-3715 Multiple cloning site from pBluescriptII sk(−), from the XmaI site thru the XhoI site. These base pairs are usually lost when cloning into pTnMCS bp 717-673
- Bp 3716-4153 Multiple cloning site from pBluescriptII sk(−), from the XhoI site bp 672-235
- Bp 4159-4228 70 bp of IS10 right from Tn10
- Bp 4229-4270 Lambda DNA from pNK2859
- Bp 4271-5114 Non-coding DNA from pNK2859
- Bp 5115-7315 pBluescript sk(−) base vector (Stratagene, Inc.) bp 761-2961.
- A vector was designed to insert a humsan proinsulin coding sequence under the control of a chicken ovalbumin promoter, and a ovalbumin gene including an ovalbumin signal sequence, into the genome of a bird given below as SEQ ID NO:42.
- Base pairs 1-130 are a remainder of F1(−) ori of pBluescriptII sk(−) (Stratagene) corresponding to base pairs 1-130 of pBluescriptll sk(−).
- Base pairs 133-1777 are a CMV promoter/enhancer taken from vector pGWiz (Gene Therapy Systems) corresponding to base pairs 229-1873 of pGWiz.
- Base pairs 1780-2987 are a transposase, modified from Tn10 (GenBank accession number J01829).
- Base pairs 2988-2993 are two engineered stop codons.
- Base pairs 2995-3410 are a synthetic polyA from pGWiz (Gene Therapy Systems) corresponding to base pairs 1922-2337 of pGWiz.
- Base pairs 3415-3718 are non coding DNA that is residual from vector pNK2859.
- Base pairs 3719-3761 are λ DNA that is residual from pNK2859.
- Base pairs 3762-3831 are the 70 base pairs of the left insertion sequence (IS10) recognized by the transposon Tn10.
- Base pairs 3838-4044 are a multiple cloning site from pBluescriptII sk(−) corresponding to base pairs 924-718 of pBluescriptII sk(−).
- Base pairs 4050-4951 are a chicken ovalbumin promoter (including SDRE) that corresponds to base pairs 431-1332 of the chicken ovalbumin promoter in GenBank Accession Number J00895 M24999.
- Base pairs 4958-6115 are a chicken ovalbumin signal sequence and ovalbumin gene that correspond to base pairs 66-1223 of GenBank Accession Number V00383.1. (The STOP codon being omitted).
- Base pairs 6122-6271 are a TAG sequence containing a gp41 hairpin loop from HIV I, an enterokinase cleavage site and a spacer (synthetic).
- Base pairs 6272-6531 are a proinsulin gene.
- Base pairs 6539-6891 are a synthetic polyadenylation sequence from pGWiz (Gene Therapy Systems) corresponding to base pairs 1920-2272 of pGWiz.
- Base pairs 6897-7329 are a multiple cloning site from pBlueScriptII sk(−) corresponding to base pairs 667-235 of pBluescriptll sk(−).
- Base pairs 7335-7404 are the 70 base pairs of the right insertion sequence (IS10) recognized by the transposon Tn10.
- Base pairs 7405-7446 are λ DNA that is residual from pNK2859.
- Base pairs 7447-8311 are non coding DNA that is residual from pNK2859.
- Base pairs 8312-10512 are pBlueScript sk(−) base vector (Stratagene, Inc.) corresponding to base pairs 761-2961 of pBluescriptll sk(−).
- It should be noted that all non-coding DNA sequences described above can be replaced with any other non-coding DNA sequence(s). Missing nucleotide sequences in the above construct represent restriction site remnants.
- A vector was designed to insert a proinsulin coding sequence under the control of a quail ovalbumin promoter, and a ovalbumin gene including an ovalbumin signal sequence, into the genome of a bird given below as SEQ ID NO:43.
- Bp 1-4045 from vector pTnMod, bp 1-4045
- Bp 4051-5695 CMV promoter/enhancer taken from vector pGWIZ (Gene therapy systems), bp 230-1864
- Bp 5702-6855 Chicken ovalbumin gene taken from GenBank accession #V00383, bp 66-1219
- Bp 6862-7011 Synthetic spacer sequence and hairpin loop of HIV gp41 with an added enterokinase cleavage site
- Bp 7012-7272 Human Proinsulin taken from GenBank accession #NM000207, bp 117-377
- Bp 7273-7317 Spacer DNA, derived as an artifact from the cloning vectors pTOPO Blunt II (Invitrogen) and pGWIZ (Gene Therapy Systems)
- Bp 7318-7670 Synthetic polyA from the cloning vector pGWIZ (Gene Therapy Systems), bp 1920-2271
- Bp 7672-11271 from cloning vector pTnMCS, bp 3716-7315
- Two experiments were conducted in Japanese quail using transpson-based vectors containing either Oval promoter/Oval gene/GP41 Enterokinase TAG/Proinsulin/Poly A (SEQ ID NO:42) or CMV promoter/Oval gene/GP41 Enterokinase TAG/Proinsulin/Poly A (SEQ ID NO:43).
- In the first experiment, the Oval promoter/Oval gene/GP41 Enterokinase TAG/Proinsulin/Poly A containing construct was injected into the lumen of the oviduct of sexually mature quail; three hens received 5 μg at a 1:3 SUPERFECT® ratio and three received 10 μg at a 1:3 SUPERFECT® ratio. As of the writing of the present application, at least one bird that received above-mentioned construct was producing human proinsulin in egg white (other birds remain to be tested). This experiment indicates that 1) the DNA has been stable for at least 3 months; 2) protein levels are comparable to those observed with a constitutive promoter such as the CMV promoter; and 3) sexually mature birds can be injected and results obtained without the need for cell culture. It is estimated that each quail egg contains approximately 1.4 μg/ml of the proinsulin protein. It is also estimated that each transgenic chicken egg contains 50-75 mg of protein encoded by the gene of interest.
- In the second experiment, the transposon-based vector containing CMV promoter/Oval gene/GP41 Enterokinase TAG/Proinsulin/Poly A was injected into the lumen of the oviduct of sexually immature Japanese quail. A total of 9 birds were injected. Of the 8 survivors, 3 produced human proinsulin in the white of their eggs for over 6 weeks. An ELISA assay described in detail below was developed to detect GP41 in the fusion peptide (Oval gene/GP41 Enterokinase TAG/Proinsulin) since the GP41 peptide sequence is unique and not found as part of normal egg white protein. In all ELISA assays, the same birds produced positive results and all controls worked as expected.
- ELISA Procedure: Individual egg white samples were diluted in sodium carbonate buffer, pH 9.6, and added to individual wells of 96 well microtiter ELISA plates at a total volume of 0.1 ml. These plates were then allowed to coat overnight at 4° C. Prior to ELISA development, the plates were allowed warm to room temperature. Upon decanting the coating solutions and blotting away any excess, non-specific binding of antibodies was blocked by adding a solution of phosphate buffered saline (PBS), 1% (w/v) BSA, and 0.05% (v/v) Tween 20 and allowing it to incubate with shaking for a minimum of 45 minutes. This blocking solution was subsequently decanted and replaced with a solution of the primary antibody (Goat Anti-GP41 TAG) diluted in fresh PBS/BSA/Tween 20. After a two hour period of incubation with the primary antibody, each plate was washed with a solution of PBS and 0.05% Tween 20 in an automated plate washer to remove unbound antibody. Next, the secondary antibody, Rabbit anti-Goat Alkaline Phosphatase-conjugated, was diluted in PBS/BSA/Tween 20 and allowed to incubate 1 hour. The plates were then subjected to a second wash with PBS/Tween 20. Antigen was detected using a solution of p-Nitrophenyl Phosphate in Diethanolamine Substrate Buffer for Alkaline Phosphatase and measuring the absorbance at 30 minutes and 1 hour.
- Additionally, a proinsulin fusion protein produced using a construct described above was isolated from egg white using ammonium sulfate precipitation and ion exchange chromotgraphy. A pooled fraction of the isolated fusion protein was run on an SDS-PAGE gel shown in FIG. 5, lanes4 and 6. Lanes 1 and 10 of the gel contain molecular weight standards, lanes 2 and 8 contain non-trangenic chicken egg white, whereas lanes 3, 5, 7 and 9 are blank.
- A HiTrap NHS-activated 1 mL column (Amersham) was charged with a 30 amino acid peptide that contained the gp-41 epitope containing gp-41's native disulfide bond that stabilizes the formation of the gp-41 hairpin loop. The 30 amino acid gp41 peptide is provided as SEQ ID NO:32. Approximately 10 mg of the peptide was dissolved in coupling buffer (0.2 M NaHCO3, 0.5 M NaCl, pH 8.3 and the ligand was circulated on the column for 2 hours at room temperature at 0.5 mL/minute. Excess active groups were then deactivated using 6 column volumes of 0.5 M ethanolamine, 0.5 M NaCl, pH 8.3 and the column was washed alternately with 6 column volumes of acetate buffer (0.1 M acetate, 0.5 M NaCl, pH 4.0) and ethanolamine (above). The column was neutralized using 1×PBS. The column was then washed with buffers to be used in affinity purification: 75 mM Tris, pH 8.0 and elution buffer, 100 mM glycine-HCl, 0.5 M NaCl, pH 2.7. Finally, the column was equilibrated in 75 mM Tris buffer, pH 8.0.
- Antibodies to gp-41 were raised in goats by inoculation with the gp-41 peptide described above. More specifically, goats were inoculated, given a booster injection of the gp-41 peptide and blood samples were obtained by veinupuncture. Serum was harvested by centrifugation. Approximately 30 mL of goat serum was filtered to 0.45 uM and passed over a TAG column at a rate of 0.5 mL/min. The column was washed with 75 mM Tris, pH 8.0 until absorbance at 280 nm reached a baseline. Three column volumes (3 mL) of elution buffer (100 mM glycine, 0.5 M NaCl, pH 2.7) was applied, followed by 75 mM Tris buffer, pH 8.0, all at a rate of 0.5 mL/min. One milliliter fractions were collected. Fractions were collected into 200 uL 1 M Tris, pH 9.0 to neutralize acidic factions as rapidly as possible. A large peak eluted from the column, coincident with the application the elution buffer. Fractions were pooled. Analysis by SDS-PAGE showed a high molecular weight species that separated into two fragments under reducing condition, in keeping with the heavy and light chain structure of IgG.
- Pooled antibody fractions were used to charge two 1 mL HiTrap NHS-activated columns, attached in series. Coupling was carried out in the same manner as that used for charging the TAG column.
- Egg white from quail and chickens treated by intra-oviduct injection of the CMV-ovalbumin-TAG-proinsulin construct were pooled. Viscosity was lowered by subjecting the allantoid fluid to successively finer pore sizes using negative pressure filtration, finishing with a 0.22 μM pore size. Through the process, egg white was diluted approximately 1:16. The clarified sample was loaded on the Anti-TAG column and eluted in the same manner as described for the purification of the anti-TAG antibodies. A peak of absorbance at 280 nm, coincident with the application of the elution buffer, indicated that protein had been specifically eluted from the Anti-TAG column. Fractions containing the eluted peak were pooled for analysis.
- The pooled fractions from the Anti-TAG affinity column were characterized by SDS-PAGE and western blot analysis. SDS-PAGE of the pooled fractions revealed a 60 kDal molecular weight band not present in control egg white fluid, consistent with the predicted molecular weight of the transgenic protein. Although some contaminating bands were observed, the 60 kDal species was greatly enriched compared to the other proteins. An aliquot of the pooled fractions was cleaved overnight at room temperature with the protease, enterokinase. SDS-PAGE analysis of the cleavage product, revealed a band not present in the uncut material that co-migrated with a commercial human proinsulin positive control. Western blot analysis showed specific binding to the 60 kDal species under non-reducing condition (which preserved the hairpin epitope of gp-41 by retaining the disulfide bond). Western analysis of the low molecular weight species that appeared upon cleavage with an anti-human proinsulin antibody, conclusively identified the cleaved fragment as human proinsulin.
- I. ELISA data for egg characterization/identification
- An ELISA was employed for the initial screening of eggs and, thereby, identification of hens producing positive eggs. With further modifications this procedure was used for the initial quantification of recombinant protein amounts. These procedures were aided by the successful purification of an initial stock of the recombinant proinsulin (RPI). This stock of protein is used in the development of a double antibody assay that increases the sensitivity and reduces the background in the assay. Subsequent identification of hens producing positive eggs obviate the need to screen each egg collected. Only periodic checks are needed to determine if production levels are consistent.
- II. Egg White (EW) or Albumin Preparation
- A. Clarification—Ovomucin precipitation
- Eggs from hens positively identified as producing RPI are pooled for RPI purification. The initial purification step involved diluting the pool 1:1 with 100 mM Tris-HCl, pH 8 for a final concentration of 50 mM Tris-HCl. The pH of this solution was then adjusted to 6 and ovomucin was allowed to precipitate at 4° C. for a minimum of 3 hrs (preferably overnight) with constant stirring. The precipitated ovomucin was then pelleted and removed by centrifugation at 2400×g. After collection of the RPI containing supernatant, the pH of this solution was readjusted to 8.
- B. Filtration
- To prepare the egg white for loading onto the column and, thereby, minimize the potential for clogging the columns during loading, the egg white solution was filtered to at least 0.45 um.
- Initially, the ovomucin precipitated egg white solution was subjected to successive filtration steps with the pore size of the filtration membrane decreasing at each step. This procedure involved time and dilution of the egg white solution to reach 0.45 um filtration.
- Amersham's hollow-fiber ultrafiltration apparatus was used to produced a column-ready solution filtered down to <0.2 um with an undiluted starting solution. This approach minimized the time and the solution dilution needed to prepare the egg white solution for column loading.
- III. Purification
- A. Affinity Chromatography
- Using antibody with specificity to a synthetic peptide modeled after the enterokinase recognition site, initial purification schemes involved developing a one-step column purification procedure for the RPI.
- Goats immunized with the synthetic Ent peptide were employed to produce anti-Ent Tag antiserum which was used in the egg screening ELISAs followed by antibody purification. The purified goat Anti-Ent Tag antibodies were covalently bound to the matrix of HiTrap NHS-activated HP columns (Amersham) and subsequently used to specifically bind and purify the RPI.
- An initial attempt was made to direct the first purification step against the ovalbumin portion of the recombinant protein using an antibody specific for the ovalbumin portion. The present purification scheme employed a combination of classical techniques such as ammonium sulfate precipitation, ion exchange, and gel filtration chromatography.
- After the initial ovomucin precipitation, the egg white solution was subjected to protein precipitation using a 40% ammonium sulfate fractionation. The precipitated protein was subsequently collected via centrifugation and resuspended in 50 mM Tris-HCl, pH 8. The resuspended protein solution was dialyzed to remove residual (NH4)2SO4 or subjected to gel filtration to remove the (NH4)2SO4 and partially isolate the RPI from the remaining egg white protein. The RPI was further isolated via anion exchange chromatography using a 0 to 0.5M NaCl gradient in 50 mM Tris-HCl, pH 8. Two possible elution profiles were observed. One at approximately 25% of the 0.5 M NaCl gradient without (NH4)2SO4 precipitation. The second was observed at less than 16% gradient (approximately 7%) following 40% (NH4)2SO4 precipitation and a longer gradient. Fractions containing RPI were identified by SDS-PAGE analysis and pooled.
- Three gel filtration columns, differing by column size and fractionation range, were employed in RPI purification and/or desalting: Superdex 75 10/300 GL, Hiload 26/60 Superdex 75, and Hiload 26/60 Superdex 200. Using these individual columns at different steps in the purification scheme increased the efficiency of the process. Fractions containing RPI were identified by SDS-PAGE analysis and pooled.
- Cleavage of the RPI Enterokinase recognition site was accomplished using purified enterokinase from Sigma. Enterokinase, 0.004 Unit/μl per reaction, was applied to the pooled and, if necessary, concentrated protein solution. The digestion reaction was incubated at room temperature (up to 30° C. in a rolling hybridization oven) for a minimum of 16 h and in some cases up to 48 hrs of incubation. The digestion efficiency was followed using 16.5% Tris-Tricine SDS-PAGE peptide gels. All gel staining utilized Simply Blue Coomassie Staining Solutions. Free Proinsulin was observed on gels after digestion.
- A subsequent gel filtration separation was employed to obtain purified Proinsulin, and to remove the remaining Ovalbumin portion of the RPI and residual native EW proteins. Select steps in the purification process were analyzed using the 2-dimensional Beckman Coulter ProteomeLab PF2D Protein Fractionation System.
- Overall transfection rates of oviduct cells in a flock of chicken or quail hens are enhanced by synchronizing the development of the oviduct and ovary within the flock. When the development of the oviducts and ovaries are uniform across a group of hens and when the stage of oviduct and ovarian development can be determined or predicted, timing of injections is optimized to transfect the greatest number of cells. Accordingly, oviduct development is synchronized as described below to ensure that a large and uniform proportion of oviduct secretory cells are transfected with the gene of interest.
- Hens are treated with estradiol to stimulate oviduct maturation as described in Oka and Schimke (T. Oka and RT Schimke, J. Cell Biol., 41, 816 (1969)), Palmiter, Christensen and Schimke (J Biol. Chem. 245(4):833-845, 1970). Specifically, repeated daily injections of 1 mg estradiol benzoate are performed sometime before the onset of sexual maturation, a period ranging from 1-14 weeks of age. After a stimulation period sufficient to maximize development of the oviduct, hormone treatment is withdrawn thereby causing regression in oviduct secretory cell size but not cell number. At an optimum time after hormone withdrawal, the lumens of the oviducts of treated hens are injected with the transposon-based vector. Hens are subjected to additional estrogen stimulation after an optimized time during which the transposon-based vector is taken up into oviduct secretory cells. Re-stimulation by estrogen activates transposon expression, causing the integration of the gene of interest into the host genome. Estrogen stimulation is then withdrawn and hens continue normal sexual development. If a developmentally regulated promoter such as the ovalbumin promoter is used, expression of the transposon-based vector initiates in the oviduct at the time of sexual maturation. Intra-ovarian artery injection during this window allows for high and uniform transfection efficiencies of ovarian follicles to produce germ-line transfections and possibly oviduct expression.
- Other means are also used to synchronize the development, or regression, of the oviduct and ovary to allow high and uniform transfection efficiencies. Alterations of lighting and/or feed regimens, for example, cause hens to ‘molt’ during which time the oviduct and ovary regress. Molting is used to synchronize hens for transfection, and may be used in conjunction with other hormonal methods to control regression and/or development of the oviduct and ovary.
- A vector is designed for inserting a proinsulin gene under the control of a quail ovalbumin promoter, and a ovalbumin gene including an ovalbumin signal sequence, into the genome of a bird given below as SEQ ID NO:44.
- Base pairs 1-130 are a remainder of F1(−) ori of pBluescriptII sk(−) (Stratagene) corresponding to base pairs 1-130 of pBluescriptII sk(−).
- Base pairs 133-1777 are a CMV promoter/enhancer taken from vector pGWiz (Gene Therapy Systems) corresponding to base pairs 229-1873 of pGWiz.
- Base pairs 1780-2987 are a transposase, modified from Tn10 (GenBank accession number J01829).
- Base pairs 2988-2993 are an engineered stop codon.
- Base pairs 2995-3410 are a synthetic polyA from pGWiz (Gene Therapy Systems) corresponding to base pairs 1922-2337 of pGWiz.
- Base pairs 3415-3718 are non coding DNA that is residual from vector pNK2859.
- Base pairs 3719-3761 are λ DNA that is residual from pNK2859.
- Base pairs 3762-3831 are the 70 base pairs of the left insertion sequence (IS10) recognized by the transposon Tn10.
- Base pairs 3838-4044 are a multiple cloning site from pBlueScriptII sk(−) corresponding to base pairs 924-718 of pBluescriptII sk(−).
- Base pairs 4050-4938 are the Japanese quail ovalbumin promoter (including SDRE, steroid-dependent response element). The Japanese quail ovalbumin promoter was isolated by its high degree of homology to the chicken ovalbumin promoter (GenBank accession number J00895 M24999, base pairs 431-1332). Some deletions were noted in the quail sequence, as compared to the chicken sequence.
- Base pairs 4945-6092 are a quail ovalbumin signal sequence and ovalbumin gene that corresponds to base pairs 54-1201 of GenBank accession number X53964.1. (The STOP codon being omitted).
- Base pairs 6093-6246 are a TAG sequence containing a gp41 hairpin loop from HIV I an enterokinase cleavage site and a spacer (synthetic).
- Base pairs 6247-6507 are a proinsulin gene.
- Base pairs 6514-6866 are a synthetic polyadenylation sequence from pGWiz (Gene Therapy Systems) corresponding to base pairs 1920-2272 of pGWiz.
- Base pairs 6867-7303 are a multiple cloning site from pBlueScriptll sk(−) corresponding to base pairs 667-235 of pBluescriptII sk(−).
- Base pairs 7304-7379 are the 70 base pairs of the right insertion sequence (IS10) recognized by the transposon Tn10.
- Base pairs 7380-7421 are λ DNA that is residual from pNK2859.
- Base pairs 7422-8286 are non coding DNA that is residual from pNK2859.
- Base pairs 8287-10487 are pBlueScript sk(−) base vector (Stratagene, Inc.) corresponding to base pairs 761-2961 of pBluescriptII sk(−).
- It should be noted that all non-coding DNA sequences described above can be replaced with any other non-coding DNA sequence(s). Missing nucleotide sequences in the above construct represent restriction site remnants.
- A vector was designed for inserting a p146 gene under the control of a chicken ovalbumin promoter, and a ovalbumin gene including an ovalbumin signal sequence, into the genome of a bird. The vector sequence is provided below as SEQ ID NO:45.
- Base pairs 1-130 are a remainder of F1(−) ori of pBluescriptlI sk(−) (Stratagene) corresponding to base pairs 1-130 of pBluescriptll sk(−).
- Base pairs 133-1777 are a CMV promoter/enhancer taken from vector pGWiz (Gene Therapy Systems) corresponding to base pairs 229-1873 of pGWiz.
- Base pairs 1780-2987 are a transposase, modified from Tn10 (GenBank accession number J01829).
- Base pairs 2988-2993 are an engineered stop codon.
- Base pairs 2995-3410 are a synthetic polyA from pGWiz (Gene Therapy Systems) corresponding to base pairs 1922-2337 of pGWiz.
- Base pairs 3415-3718 are non coding DNA that is residual from vector pNK2859.
- Base pairs 3719-3761 are λ DNA that is residual from pNK2859.
- Base pairs 3762-3831 are the 70 base pairs of the left insertion sequence (IS10) recognized by the transposon Tn10.
- Base pairs 3838-4044 are a multiple cloning site from pBlueScriptII sk(−) corresponding to base pairs 924-718 of pBluescriptll sk(−).
- Base pairs 4050-4951 are a chicken ovalbumin promoter (including SDRE, steroid-dependent response element) that corresponds to base pairs 431-1332 of the chicken ovalbumin promoter in GenBank Accession Number J00895 M24999.
- Base pairs 4958-6115 are a chicken ovalbumin signal sequence and Ovalbumin gene that correspond to base pairs 66-1223 of GenBank Accession Number V00383.1 (The STOP codon being omitted).
- Base pairs 6122-6271 are a TAG sequence containing a gp41 hairpin loop from HIV I, an enterokinase cleavage site and a spacer (synthetic).
- Base pairs 6272-6316 are a p146 sequence (synthetic) with 2 added stop codons.
- Base pairs 6324-6676 are a synthetic polyadenylation sequence from pGWiz (Gene Therapy Systems) corresponding to base pairs 1920-2272 of pGWiz.
- Base pairs 6682-7114 are a multiple cloning site from pBlueScriptII sk(−) corresponding to base pairs 667-235 of pBluescriptll sk(−).
- Base pairs 7120-7189 are the 70 base pairs of the right insertion sequence (IS10) recognized by the transposon Tn10.
- Base pairs 7190-7231 are λ DNA that is residual from pNK2859.
- Base pairs 7232-8096 are non coding DNA that is residual from pNK2859.
- Base pairs 8097-10297 are pBlueScript sk(−) base vector (Stratagene, Inc.) corresponding to base pairs 761-2961 of pBluescriptll sk(−).
- It should be noted that all non-coding DNA sequences described above can be replaced with any other non-coding DNA sequence(s). Missing nucleotide sequences in the above construct represent restriction site remnants.
- A vector was designed for inserting a p146 gene under the control of a quail ovalbumin promoter, and a ovalbumin gene including an ovalbumin signal sequence, into the genome of a bird. The vector sequence is given below as SEQ ID NO:46.
- Base pairs 1-130 are a remainder of F1(−) ori of pBluescriptII sk(−) (Stratagene) corresponding to base pairs 1-130 of pBluescriptll sk(−).
- Base pairs 133-1777 are a CMV promoter/enhancer taken from vector pGWiz (Gene Therapy Systems) corresponding to base pairs 229-1873 of pGWiz.
- Base pairs 1780-2987 are a transposase, modified from Tn10 (GenBank accession number J01829).
- Base pairs 2988-2993 are an engineered stop codon.
- Base pairs 2995-3410 are a synthetic polyA from pGWiz (Gene Therapy Systems) corresponding to base pairs 1922-2337 of pGWiz.
- Base pairs 3415-3718 are non coding DNA that is residual from vector pNK2859.
- Base pairs 3719-3761 are λ DNA that is residual from pNK2859.
- Base pairs 3762-3831 are the 70 base pairs of the left insertion sequence (IS10) recognized by the transposon Tn10.
- Base pairs 3838-4044 are a multiple cloning site from pBlueScriptII sk(−) corresponding to base pairs 924-718 of pBluescriptll sk(−).
- Base pairs 4050-4938 are the Japanese quail ovalbumin promoter (including SDRE, steroid-dependent response element). The Japanese quail ovalbumin promoter was isolated by its high degree of homology to the chicken ovalbumin promoter (GenBank accession number J00895 M24999, base pairs 431-1332).
- Bp 4945-6092 are a quail ovalbumin signal sequence and ovalbumin gene that corresponds to base pairs 54-1201 of GenBank accession number X53964.1. (The STOP codon being omitted).
- Base pairs 6097-6246 are a TAG sequence containing a gp41 hairpin loop from HIV I, an enterokinase cleavage site and a spacer (synthetic).
- Base pairs 6247-6291 are a p146 sequence (synthetic) with 2 added stop codons.
- Base pairs 6299-6651 are a synthetic polyadenylation sequence from pGWiz (Gene Therapy Systems) corresponding to base pairs 1920-2272of pGWiz.
- Base pairs 6657-7089 are a multiple cloning site from pBlueScriptII sk(−) corresponding to base pairs 667-235 of pBluescriptll sk(−).
- Base pairs 7095-7164 are the 70 base pairs of the right insertion sequence (IS10) recognized by the transposon Tn10.
- Base pairs 7165-7206 are λ DNA that is residual from pNK2859.
- Base pairs 7207-8071 are non coding DNA that is residual from pNK2859.
- Base pairs 8072-10272 are pBlueScript sk(−) base vector (Stratagene, Inc.) corresponding to base pairs 761-2961of pBluescriptll sk(−).
- It should be noted that all non-coding DNA sequences described above can be replaced with any other non-coding DNA sequence(s). Missing nucleotide sequences in the above construct represent restriction site remnants.
- The following example provides a description of various transposon-based vectors of the present invention and several constructs that have been made for insertion into the transposon-based vectors of the present invention, all for intraoviduct administration. These examples are not meant to be limiting in any way. The constructs for insertion into a transposon-based vector are provided in a cloning vector pTnMCS or pTnMod, both described above.
- pTnMCS (CMV-CHOVg-ent-Prolnsulin-synPA) (SEQ ID NO:47)
- Bp 1-3670 from vector PTnMCS, bp 1-3670
- Bp 3676-5320 CMV promoter/enhancer taken from vector pGWIZ (Gene Therapy Systems), bp 230-1864
- Bp 5327-6480 Chicken ovalbumin gene taken from GenBank accession #V00383, bp 66-1219
- Bp 6487-6636 Synthetic spacer sequence and hairpin loop of HIV gp41 with an added enterokinase cleavage site
- Bp 6637-6897 Human Proinsulin taken from GenBank accession #NM000207, bp 117-377
- Bp 6898-6942 Spacer DNA, derived as an artifact from the cloning vectors pTOPO Blunt II (Invitrogen) and pGWIZ (Gene Therapy Systems)
- Bp 6943-7295 Synthetic polyA from the cloning vector pGWIZ (Gene Therapy Systems), bp 1920-2271
- Bp 7296-10895 from cloning vector pTnMCS, bp 3716-7315
- pTnMCS (CMV-prepro-ent-ProInsulin-synPA)
- Bp 1-3670 from vector PTnMCS, bp 1-3670
- Bp 3676-5320 CMV promoter/enhancer taken from vector pGWIZ (Gene Therapy Systems), bp 230-1864
- Bp 5326-5496 Capsite/prepro taken fron GenBank accession #X07404, bp 563-733
- Bp 5504-5652 Synthetic spacer sequence and hairpin loop of HIV gp41 with an added enterokinase cleavage site
- Bp 5653-5913 Human Proinsulin taken from GenBank accession #NM000207, bp 117-377
- Bp 5914-5958 Spacer DNA, derived as an artifact from the cloning vectors pTOPO Blunt II (Invitrogen) and pGWIZ (Gene Therapy Systems)
- Bp 5959-6310 Synthetic polyA from the cloning vector pGWIZ (Gene Therapy Systems), bp 1920-2271
- Bp 6313-9912 from cloning vector pTnMCS, bp 3716-7315
- pTnMCS(Chicken OVep+OVg′+ENT+proins+syn polyA)
- Bp 1-3670 from vector pTnMCS, bp 1-3670
- Bp 3676-4350 Chicken Ovalbumin enhancer taken from GenBank accession #S82527.1 bp 1-675
- Bp 4357-5692 Chicken Ovalbumin promoter taken from GenBank accession #J00895M24999 bp 1-1336
- Bp 5699-6917 Chicken Ovalbumin gene from GenBank Accession #V00383.1 bp 2-1220. (This sequence includes the 5′UTR, containing putative cap site, bp 5699-5762.)
- Bp 6924-7073 Synthetic spacer sequence and hairpin loop of HIV gp41 with an added enterokinase cleavage site
- Bp 7074-7334 Human proinsulin GenBank Accession #NM000207 bp 117-377
- Bp 7335-7379 Spacer DNA, derived as an artifact from the cloning vectors pTOPO Blunt II (Invitrogen) and gWIZ (Gene Therapy Systems)
- Bp 7380-7731 Synthetic polyA from the cloning vector gWIZ (Gene Therapy Systems) bp 1920-2271
- Bp 7733-11332 from vector pTnMCS, bp 3716-7315
- pTnMCS(Chicken OVep+prepro+ENT+proins+syn polyA)
- Bp 1-3670 from cloning vector pTnMCS, bp 1-3670
- Bp 3676-4350 Chicken Ovalbumin enhancer taken from GenBank accession #S82527.1 bp 1-675
- Bp 4357-5692 Chicken Ovalbumin promoter taken from GenBank accession #J00895-M24999bp 1-1336
- Bp 5699-5869 Cecropin cap site and prepro, Genbank accession #X07404 bp 563-733
- Bp 5876-6025 Synthetic spacer sequence and hairpin loop of HIV gp41 with an added enterokinase cleavage site
- Bp 6026-6286 Human proinsulin GenBank Accession #NM000207 bp 117-377
- Bp 6287-6331 Spacer DNA, derived as an artifact from the cloning vectors pTOPO Blunt II (Invitrogen) and gWIZ (Gene Therapy Systems)
- Bp 6332-6683 Synthetic polyA from the cloning vector gWIZ (Gene Therapy Systems) bp 1920-2271
- Bp 6685-10284 from cloning vector pTnMCS, bp 3716-7315
- pTnMCS(Quail OVep+OVg′+ENT+proins+syn polyA)
- Bp 1-3670 from cloning vector pTnMCS, bp 1-3670
- Bp 3676-4333 Quail Ovalbumin enhancer: 658 bp sequence, amplified in-house from quail genomic DNA, roughly equivalent to the far-upstream chicken ovalbumin enhancer, GenBank accession #S82527.1, bp 1-675. (There are multiple base pair substitutions and deletions in the quail sequence, relative tochicken, so the number of bases does not correspond exactly.)
- Bp 4340-5705 Quail Ovalbumin promoter: 1366 bp sequence, amplified in-house from quail genomic DNA, roughly corresponding to chicken ovalbumin promoter, GenBank accession #J00895-M24999 bp 1-1336. (There are multiple base pair substitutions and deletions between the quail and chicken sequences, so the number of bases does not correspond exactly.)
- Bp 5712-6910 Quail Ovalbumin gene, EMBL accession #X53964, bp 1-1199. (This sequence includes the 5′UTR, containing putative cap site bp 5712-5764.)
- Bp 6917-7066 Synthetic spacer sequence and hairpin loop of HIV gp41 with an added enterokinase cleavage site
- Bp 7067-7327 Human proinsulin GenBank Accession #NM000207 bp 117-377
- Bp 7328-7372 Spacer DNA, derived as an artifact from the cloning vectors pTOPO Blunt II (Invitrogen) and gWIZ (Gene Therapy Systems)
- Bp 7373-7724 Synthetic polyA from the cloning vector gWIZ (Gene Therapy Systems) bp 1920-2271
- Bp 7726-11325 from cloning vector pTnMCS, bp 3716-7315
- pTnMCS(Ouail OVep+prepro+ENT+proins+syn polyA)
- Bp 1-3670 from cloning vector pTnMCS, bp 1-3670
- Bp 3676-4333 Quail Ovalbumin enhancer: 658 bp sequence, amplified from quail genomic DNA, roughly equivalent to the far-upstream chicken ovalbumin enhancer, GenBank accession #S82527.1, bp 1-675. (There are multiple base pair substitutions and deletions in the quail sequence, relative to chicken, so the number of bases does not correspond exactly.)
- Bp 4340-5705 Quail Ovalbumin promoter: 1366 bp sequence, amplified from quail genomic DNA, roughly corresponding to chicken ovalbumin promoter, GenBank accession #J00895-M24999 bp 1-1336. (There are multiple base pair substitutions and deletions between the quail and chicken sequences, so the number of bases does not correspond exactly.)
- Bp 5712-5882 Cecropin cap site and prepro, Genbank accession #X07404 bp 563-733
- Bp 5889-6038 Synthetic spacer sequence and hairpin loop of HIV gp41 with an added enterokinase cleavage site
- Bp 6039-6299 Human proinsulin GenBank Accession #NM000207 bp 117-377
- Bp 6300-6344 Spacer DNA, derived as an artifact from the cloning vectors pTOPO Blunt II (Invitrogen) and gWIZ (Gene Therapy Systems)
- Bp 6345-6696 Synthetic polyA from the cloning vector gWIZ (Gene Therapy Systems) bp 1920-2271
- Bp 6698-10297 from cloning vector pTnMCS, bp 3716-7315.
- pTnMOD (CMV-prepro-ent-proins-synPA)
- Bp 1-4045 from vector PTnMCS, bp 1-4045
- Bp 4051-5695 CMV promoter/enhancer taken from vector pGWIZ (Gene therapy systems), bp 230-1864
- Bp 5701-5871 Capsite/prepro taken from GenBank accession #X07404, bp 563-733
- Bp 5879-6027 Synthetic spacer sequence and hairpin loop of HIV gp41 with an added enterokinase cleavage site
- Bp 6028-6288 Human Proinsulin taken from GenBank accession #NM000207, bp 117-377
- Bp 6289-6333 Spacer DNA, derived as an artifact from the cloning vectors pTOPO Blunt II (Invitrogen) and pGWIZ (Gene Therapy Systems)
- Bp 6334-6685 Synthetic polyA from the cloning vector pGWIZ (Gene Therapy Systems), bp 1920-2271
- Bp 6687-10286 from cloning vector pTnMCS, bp 3716-7315
- pTnMOD(Chicken OVep+OVg′+ENT+proins+syn polyA)
- Bp 1-4045 from cloning vector pTnMod, bp 1-4045
- Bp 4051-4725 Chicken Ovalbumin enhancer taken from GenBank accession #S82527.1 bp 1-675
- Bp 4732-6067 Chicken Ovalbumin promoter taken from GenBank accession #J00895-M24999 bp 1-1336
- Bp 6074-7292 Chicken Ovalbumin gene from GenBank Accession #V00383.1 bp 2-1220. (This sequence includes the 5′UTR, containing putative cap site bp 6074-6137.)
- Bp 7299-7448 Synthetic spacer sequence and hairpin loop of HIV gp41 with an added enterokinase cleavage site
- Bp 7449-7709 Human proinsulin GenBank Accession #NM000207 bp 117-377
- Bp 7710-7754 Spacer DNA, derived as an artifact from the cloning vectors pTOPO Blunt II (Invitrogen) and gWIZ (Gene Therapy Systems)
- Bp 7755-8106 Synthetic polyA from the cloning vector gWIZ (Gene Therapy Systems) bp 1920-2271
- Bp 8108-11707 from cloning vector pTnMod, bp 3716-7315
- pTnMOD(Chicken OVep+prepro+ENT+proins+syn polyA)
- Bp 1-4045 from cloning vector pTnMCS, bp 1-4045
- Bp 4051-4725 Chicken Ovalbumin enhancer taken from GenBank accession #S82527.1 bp 1-675
- Bp 4732-6067 Chicken Ovalbumin promoter taken from GenBank accession #J00895-M24999 bp 1-1336
- Bp 6074-6244 Cecropin cap site and prepro, Genbank accession #X07404 bp 563-733
- Bp 6251-6400 Synthetic spacer sequence and hairpin loop of HIV gp41 with an added enterokinase cleavage site
- Bp 6401-6661 Human proinsulin GenBank Accession #NM000207 bp 117-377
- Bp 6662-6706 Spacer DNA, derived as an artifact from the cloning vectors pTOPO Blunt II (Invitrogen) and gWIZ (Gene Therapy Systems)
- Bp 6707-7058 Synthetic polyA from the cloning vector gWIZ (Gene Therapy Systems) bp 1920-2271
- Bp 7060-10659 from cloning vector pTnMCS, bp 3716-7315
- pTnMOD(Quail OVep+OVg′+ENT+proins+syn polyA)
- Bp 1-4045 from cloning vector pTnMCS, bp 1-4045
- Bp 4051-4708 Quail Ovalbumin enhancer: 658 bp sequence, amplified in-house from quail genomic DNA, roughly equivalent to the far-upstream chicken ovalbumin enhancer, GenBank accession #S82527.1, bp 1-675. (There are multiple base pair substitutions and deletions in the quail sequence, relative to chicken, so the number of bases does not correspond exactly.)
- Bp 4715-6080 Quail Ovalbumin promoter: 1366 bp sequence, amplified in-house from quail genomic DNA, roughly corresponding to chicken ovalbumin promoter, GenBank accession #J00895-M24999 bp 1-1336. (There are multiple base pair substitutions and deletions between the quail and chicken sequences, so the number of bases does not correspond exactly.)
- Bp 6087-7285 Quail Ovalbumin gene, EMBL accession #X53964, bp 1-1199. (This sequence includes the 5′UTR, containing putative cap site bp 6087-6139.)
- Bp 7292-7441 Synthetic spacer sequence and hairpin loop of HIV gp41 with an added enterokinase cleavage site
- Bp 7442-7702 Human proinsulin GenBank Accession #NM000207 bp 117-377
- Bp 7703-7747 Spacer DNA, derived as an artifact from the cloning vectors pTOPO Blunt II (Invitrogen) and gWIZ (Gene Therapy Systems)
- Bp 7748-8099 Synthetic polyA from the cloning vector gWIZ (Gene Therapy Systems) bp 1920-2271
- Bp 8101-11700 from cloning vector pTnMCS, bp 3716-7315
- pTnMOD(Quail OVep+prepro+ENT+proins+svn polyA)
- Bp 1-4045 from cloning vector pTnMCS, bp 1-4045
- Bp 4051-4708 Quail Ovalbumin enhancer: 658 bp sequence, amplified in-housefrom quail genomic DNA, roughly equivalent to the far-upstream chicken ovalbumin enhancer, GenBank accession #S82527.1, bp 1-675. (There are multiple base pair substitutions and deletions in the quail sequence, relative to chicken, so the number of bases does not correspond exactly.)
- Bp 4715-6080 Quail Ovalbumin promoter: 1366 bp sequence, amplified in-house from quail genomic DNA, roughly corresponding to chicken ovalbumin promoter, GenBank accession #J00895-M24999 bp 1-1336. (There are multiple base pair substitutions and deletions between the quail and chicken sequences, so the number of bases does not correspond exactly.)
- Bp 6087-6257 Cecropin cap site and Prepro, Genbank accession #X07404 bp 563-733
- Bp 6264-6413 Synthetic spacer sequence and hairpin loop of HIV gp41 with an added enterokinase cleavage site
- Bp 6414-6674 Human proinsulin GenBank Accession #NM000207 bp 117-377
- Bp 6675-6719 Spacer DNA, derived as an artifact from the cloning vectors pTOPO Blunt II (Invitrogen) and gWIZ (Gene Therapy Systems)
- Bp 6720-7071 Synthetic polyA from the cloning vector gWIZ (Gene Therapy Systems) bp 1920-2271
- Bp 7073-10672 from cloning vector pTnMCS, bp 3716-7315
- pTnMOD (CMV-prepro-ent-hGH-CPA)
- Bp 1-4045 from vector PTnMOD, bp 1-4045
- Bp 4051-5694 CMV promoter/enhancer taken from vector pGWIZ (Gene therapy systems), bp 230-1873
- Bp 5701-5871 Capsite/Prepro taken fron GenBank accession #X07404, bp 563-733
- Bp 5878-6012 Synthetic spacer sequence and hairpin loop of HIV gp41 with an added enterokinase cleavage site
- Bp 6013-6666 Human growth hormone taken from GenBank accession #V00519, bp 1-654
- Bp 6673-7080 Conalbumin polyA taken from GenBank accession #Y00407, bp 10651-11058
- Bp 7082-10681 from cloning vector pTnMOD, bp 4091-7690
- pTnMCS (CHOVep-prepro-ent-hGH-CPA)
- Bp 1-3670 from vector PTnMCS, bp 1-3670
- Bp 3676-4350 Chicken Ovalbumin enhancer taken from GenBank accession #S82527.1, bp 1-675
- Bp 4357-5692 Chicken Ovalbumin promoter taken from GenBank accession #J00899-M24999, bp 1-1336
- Bp 5699-5869 Capsite/Prepro taken fron GenBank accession #X07404, bp 563-733
- Bp 5876-6010 Synthetic spacer sequence and hairpin loop of HIV gp41 with an added enterokinase cleavage site
- Bp 6011-6664 Human growth hormone taken from GenBank accession #V00519, bp 1-654
- Bp 6671-7078 Conalbumin polyA taken from GenBank accession #Y00407, bp 10651-11058
- Bp 7080-10679 from cloning vector pTnMCS, bp 3716-7315
- pTnMCS (CMV-prepro-ent-hGH-CPA)
- Bp 1-3670 from vector PTnMCS, bp 1-3670
- Bp 3676-5319 CMV promoter/enhancer taken from vector pGWIZ (Gene therapy systems), bp 230-1873
- Bp 5326-5496 Capsite/Prepro taken fron GenBank accession #X07404, bp 563-733
- Bp 5503-5637 Synthetic spacer sequence and hairpin loop of HIV gp41 with an added enterokinase cleavage site
- Bp 5638-6291 Human growth hormone taken from GenBank accession #V00519, bp 1-654
- Bp 6298-6705 Conalbumin polyA taken from GenBank accession #Y00407, bp 10651-11058
- Bp 6707-10306 from cloning vector pTnMCS, bp 3716-7315
- pTnMOD (CHOVep-prepro-ent-hGH-CPA)
- Bp 1-4045 from vector PTnMOD, bp 1-4045
- Bp 4051-4725 Chicken Ovalbumin enhancer taken from GenBank accession #S82527.1, bp 1-675
- Bp 4732-6067 Chicken Ovalbumin promoter taken from GenBank accession #J00899-M24999, bp 1-1336
- Bp 6074-6244 Capsite/Prepro taken fron GenBank accession #X07404, bp 563-733
- Bp 6251-6385 Synthetic spacer sequence and hairpin loop of HIV gp41 with an added enterokinase cleavage site
- Bp 6386-7039 Human growth hormone taken from GenBank accession #V00519, bp 1-654
- Bp 7046-7453 Conalbumin polyA taken from GenBank accession #Y00407, bp 10651-11058
- Bp 7455-11054 from cloning vector pTnMOD, bp 4091-7690
- PTnMod(CMV/Transposase/ChickOvep/prepro/ProteinA/ConpolyA)
- BP 1-130 remainder of F1 (−) ori of pBluescriptII sk(−) (Stragagene) bp 1-130.
- BP 133-1777 CMV promoter/enhancer taken from vector pGWIZ (Gene Therapy Systems) bp 229-1873.
- BP 1780-2987 Transposase, modified from Tn10 (GenBank #J01829).
- BP 2988-2993 Engineered DOUBLE stop codon.
- BP 2994-3343 non coding DNA from vector pNK2859.
- BP 3344-3386 Lambda DNA from pNK2859.
- BP 3387-3456 70bp of IS10 left from Tn10.
- BP 3457-3674 multiple cloning site from pBluescriptII sk(−) bp 924-707.
- BP 3675-5691 Chicken Ovalbumin enhancer plus promoter from a Topo Clone 10 maxi 040303 (5′ XmaI, 3′ BamHI)
- BP 5698-5865 prepro with Cap site amplified from cecropin of pMON200 GenBank #X07404 (5′BamHI, 3′KpnI)
- BP 5872-7338 Protein A gene from GenBank#J01786, mature peptide bp 292-1755 (5′Kpnl, 3′SacII)
- BP 7345-7752 ConPolyA from Chicken conalbumin polyA from GenBank #Y00407 bp 10651-11058. (5′SacII, 3′XhoI)
- BP 7753-8195 multiple cloning site from pBluescriptII sk(−) bp 677-235.
- BP 8196-8265 70 bp of IS10 left from Tn10.
- BP 8266-8307 Lamda DNA from pNK2859
- BP 8308-9151 noncoding DNA from pNK2859
- BP 9152-11352 pBluescriptIl sk(−) base vector (Stratagene, INC.) bp 761-2961
- All patents, publications and abstracts cited above are incorporated herein by reference in their entirety. It should be understood that the foregoing relates only to preferred embodiments of the present invention and that numerous modifications or alterations may be made therein without departing from the spirit and the scope of the present invention as defined in the following claims.
-
1 52 1 680 DNA Gallus sp. 1 ccgggctgca gaaaaatgcc aggtggacta tgaactcaca tccaaaggag cttgacctga 60 tacctgattt tcttcaaact ggggaaacaa cacaatccca caaaacagct cagagagaaa 120 ccatcactga tggctacagc accaaggtat gcaatggcaa tccattcgac attcatctgt 180 gacctgagca aaatgattta tctctccatg aatggttgct tctttccctc atgaaaaggc 240 aatttccaca ctcacaatat gcaacaaaga caaacagaga acaattaatg tgctccttcc 300 taatgtcaaa attgtagtgg caaagaggag aacaaaatct caagttctga gtaggtttta 360 gtgattggat aagaggcttt gacctgtgag ctcacctgga cttcatatcc ttttggataa 420 aaagtgcttt tataactttc aggtctccga gtctttattc atgagactgt tggtttaggg 480 acagacccac aatgaaatgc ctggcatagg aaagggcagc agagccttag ctgacctttt 540 cttgggacaa gcattgtcaa acaatgtgtg acaaaactat ttgtactgct ttgcacagct 600 gtgctgggca gggcaatcca ttgccaccta tcccaggtaa ccttccaact gcaagaagat 660 tgttgcttac tctctctaga 680 2 7315 DNA Artificial Sequence Synthetic 2 ctgacgcgcc ctgtagcggc gcattaagcg cggcgggtgt ggtggttacg cgcagcgtga 60 ccgctacact tgccagcgcc ctagcgcccg ctcctttcgc tttcttccct tcctttctcg 120 ccacgttcgc cggcatcaga ttggctattg gccattgcat acgttgtatc catatcataa 180 tatgtacatt tatattggct catgtccaac attaccgcca tgttgacatt gattattgac 240 tagttattaa tagtaatcaa ttacggggtc attagttcat agcccatata tggagttccg 300 cgttacataa cttacggtaa atggcccgcc tggctgaccg cccaacgacc cccgcccatt 360 gacgtcaata atgacgtatg ttcccatagt aacgccaata gggactttcc attgacgtca 420 atgggtggag tatttacggt aaactgccca cttggcagta catcaagtgt atcatatgcc 480 aagtacgccc cctattgacg tcaatgacgg taaatggccc gcctggcatt atgcccagta 540 catgacctta tgggactttc ctacttggca gtacatctac gtattagtca tcgctattac 600 catggtgatg cggttttggc agtacatcaa tgggcgtgga tagcggtttg actcacgggg 660 atttccaagt ctccacccca ttgacgtcaa tgggagtttg ttttggcacc aaaatcaacg 720 ggactttcca aaatgtcgta acaactccgc cccattgacg caaatgggcg gtaggcgtgt 780 acggtgggag gtctatataa gcagagctcg tttagtgaac cgtcagatcg cctggagacg 840 ccatccacgc tgttttgacc tccatagaag acaccgggac cgatccagcc tccgcggccg 900 ggaacggtgc attggaacgc ggattccccg tgccaagagt gacgtaagta ccgcctatag 960 actctatagg cacacccctt tggctcttat gcatgctata ctgtttttgg cttggggcct 1020 atacaccccc gcttccttat gctataggtg atggtatagc ttagcctata ggtgtgggtt 1080 attgaccatt attgaccact cccctattgg tgacgatact ttccattact aatccataac 1140 atggctcttt gccacaacta tctctattgg ctatatgcca atactctgtc cttcagagac 1200 tgacacggac tctgtatttt tacaggatgg ggtcccattt attatttaca aattcacata 1260 tacaacaacg ccgtcccccg tgcccgcagt ttttattaaa catagcgtgg gatctccacg 1320 cgaatctcgg gtacgtgttc cggacatggg ctcttctccg gtagcggcgg agcttccaca 1380 tccgagccct ggtcccatgc ctccagcggc tcatggtcgc tcggcagctc cttgctccta 1440 acagtggagg ccagacttag gcacagcaca atgcccacca ccaccagtgt gccgcacaag 1500 gccgtggcgg tagggtatgt gtctgaaaat gagcgtggag attgggctcg cacggctgac 1560 gcagatggaa gacttaaggc agcggcagaa gaagatgcag gcagctgagt tgttgtattc 1620 tgataagagt cagaggtaac tcccgttgcg gtgctgttaa cggtggaggg cagtgtagtc 1680 tgagcagtac tcgttgctgc cgcgcgcgcc accagacata atagctgaca gactaacaga 1740 ctgttccttt ccatgggtct tttctgcagt caccgtcgga ccatgtgcga actcgatatt 1800 ttacacgact ctctttacca attctgcccc gaattacact taaaacgact caacagctta 1860 acgttggctt gccacgcatt acttgactgt aaaactctca ctcttaccga acttggccgt 1920 aacctgccaa ccaaagcgag aacaaaacat aacatcaaac gaatcgaccg attgttaggt 1980 aatcgtcacc tccacaaaga gcgactcgct gtataccgtt ggcatgctag ctttatctgt 2040 tcgggcaata cgatgcccat tgtacttgtt gactggtctg atattcgtga gcaaaaacga 2100 cttatggtat tgcgagcttc agtcgcacta cacggtcgtt ctgttactct ttatgagaaa 2160 gcgttcccgc tttcagagca atgttcaaag aaagctcatg accaatttct agccgacctt 2220 gcgagcattc taccgagtaa caccacaccg ctcattgtca gtgatgctgg ctttaaagtg 2280 ccatggtata aatccgttga gaagctgggt tggtactggt taagtcgagt aagaggaaaa 2340 gtacaatatg cagacctagg agcggaaaac tggaaaccta tcagcaactt acatgatatg 2400 tcatctagtc actcaaagac tttaggctat aagaggctga ctaaaagcaa tccaatctca 2460 tgccaaattc tattgtataa atctcgctct aaaggccgaa aaaatcagcg ctcgacacgg 2520 actcattgtc accacccgtc acctaaaatc tactcagcgt cggcaaagga gccatgggtt 2580 ctagcaacta acttacctgt tgaaattcga acacccaaac aacttgttaa tatctattcg 2640 aagcgaatgc agattgaaga aaccttccga gacttgaaaa gtcctgccta cggactaggc 2700 ctacgccata gccgaacgag cagctcagag cgttttgata tcatgctgct aatcgccctg 2760 atgcttcaac taacatgttg gcttgcgggc gttcatgctc agaaacaagg ttgggacaag 2820 cacttccagg ctaacacagt cagaaatcga aacgtactct caacagttcg cttaggcatg 2880 gaagttttgc ggcattctgg ctacacaata acaagggaag acttactcgt ggctgcaacc 2940 ctactagctc aaaatttatt cacacatggt tacgctttgg ggaaattatg aggggatcgc 3000 tctagagcga tccgggatct cgggaaaagc gttggtgacc aaaggtgcct tttatcatca 3060 ctttaaaaat aaaaaacaat tactcagtgc ctgttataag cagcaattaa ttatgattga 3120 tgcctacatc acaacaaaaa ctgatttaac aaatggttgg tctgccttag aaagtatatt 3180 tgaacattat cttgattata ttattgataa taataaaaac cttatcccta tccaagaagt 3240 gatgcctatc attggttgga atgaacttga aaaaaattag ccttgaatac attactggta 3300 aggtaaacgc cattgtcagc aaattgatcc aagagaacca acttaaagct ttcctgacgg 3360 aatgttaatt ctcgttgacc ctgagcactg atgaatcccc taatgatttt ggtaaaaatc 3420 attaagttaa ggtggataca catcttgtca tatgatcccg gtaatgtgag ttagctcact 3480 cattaggcac cccaggcttt acactttatg cttccggctc gtatgttgtg tggaattgtg 3540 agcggataac aatttcacac aggaaacagc tatgaccatg attacgccaa gcgcgcaatt 3600 aaccctcact aaagggaaca aaagctggag ctccaccgcg gtggcggccg ctctagaact 3660 agtggatccc ccgggctgca ggaattcgat atcaagctta tcgataccgc tgacctcgag 3720 ggggggcccg gtacccaatt cgccctatag tgagtcgtat tacgcgcgct cactggccgt 3780 cgttttacaa cgtcgtgact gggaaaaccc tggcgttacc caacttaatc gccttgcagc 3840 acatccccct ttcgccagct ggcgtaatag cgaagaggcc cgcaccgatc gcccttccca 3900 acagttgcgc agcctgaatg gcgaatggaa attgtaagcg ttaatatttt gttaaaattc 3960 gcgttaaatt tttgttaaat cagctcattt tttaaccaat aggccgaaat cggcaaaatc 4020 ccttataaat caaaagaata gaccgagata gggttgagtg ttgttccagt ttggaacaag 4080 agtccactat taaagaacgt ggactccaac gtcaaagggc gaaaaaccgt ctatcagggc 4140 gatggcccac tactccggga tcatatgaca agatgtgtat ccaccttaac ttaatgattt 4200 ttaccaaaat cattagggga ttcatcagtg ctcagggtca acgagaatta acattccgtc 4260 aggaaagctt atgatgatga tgtgcttaaa aacttactca atggctggtt atgcatatcg 4320 caatacatgc gaaaaaccta aaagagcttg ccgataaaaa aggccaattt attgctattt 4380 accgcggctt tttattgagc ttgaaagata aataaaatag ataggtttta tttgaagcta 4440 aatcttcttt atcgtaaaaa atgccctctt gggttatcaa gagggtcatt atatttcgcg 4500 gaataacatc atttggtgac gaaataacta agcacttgtc tcctgtttac tcccctgagc 4560 ttgaggggtt aacatgaagg tcatcgatag caggataata atacagtaaa acgctaaacc 4620 aataatccaa atccagccat cccaaattgg tagtgaatga ttataaataa cagcaaacag 4680 taatgggcca ataacaccgg ttgcattggt aaggctcacc aataatccct gtaaagcacc 4740 ttgctgatga ctctttgttt ggatagacat cactccctgt aatgcaggta aagcgatccc 4800 accaccagcc aataaaatta aaacagggaa aactaaccaa ccttcagata taaacgctaa 4860 aaaggcaaat gcactactat ctgcaataaa tccgagcagt actgccgttt tttcgcccat 4920 ttagtggcta ttcttcctgc cacaaaggct tggaatactg agtgtaaaag accaagaccc 4980 gtaatgaaaa gccaaccatc atgctattca tcatcacgat ttctgtaata gcaccacacc 5040 gtgctggatt ggctatcaat gcgctgaaat aataatcaac aaatggcatc gttaaataag 5100 tgatgtatac cgatcagctt ttgttccctt tagtgagggt taattgcgcg cttggcgtaa 5160 tcatggtcat agctgtttcc tgtgtgaaat tgttatccgc tcacaattcc acacaacata 5220 cgagccggaa gcataaagtg taaagcctgg ggtgcctaat gagtgagcta actcacatta 5280 attgcgttgc gctcactgcc cgctttccag tcgggaaacc tgtcgtgcca gctgcattaa 5340 tgaatcggcc aacgcgcggg gagaggcggt ttgcgtattg ggcgctcttc cgcttcctcg 5400 ctcactgact cgctgcgctc ggtcgttcgg ctgcggcgag cggtatcagc tcactcaaag 5460 gcggtaatac ggttatccac agaatcaggg gataacgcag gaaagaacat gtgagcaaaa 5520 ggccagcaaa aggccaggaa ccgtaaaaag gccgcgttgc tggcgttttt ccataggctc 5580 cgcccccctg acgagcatca caaaaatcga cgctcaagtc agaggtggcg aaacccgaca 5640 ggactataaa gataccaggc gtttccccct ggaagctccc tcgtgcgctc tcctgttccg 5700 accctgccgc ttaccggata cctgtccgcc tttctccctt cgggaagcgt ggcgctttct 5760 catagctcac gctgtaggta tctcagttcg gtgtaggtcg ttcgctccaa gctgggctgt 5820 gtgcacgaac cccccgttca gcccgaccgc tgcgccttat ccggtaacta tcgtcttgag 5880 tccaacccgg taagacacga cttatcgcca ctggcagcag ccactggtaa caggattagc 5940 agagcgaggt atgtaggcgg tgctacagag ttcttgaagt ggtggcctaa ctacggctac 6000 actagaagga cagtatttgg tatctgcgct ctgctgaagc cagttacctt cggaaaaaga 6060 gttggtagct cttgatccgg caaacaaacc accgctggta gcggtggttt ttttgtttgc 6120 aagcagcaga ttacgcgcag aaaaaaagga tctcaagaag atcctttgat cttttctacg 6180 gggtctgacg ctcagtggaa cgaaaactca cgttaaggga ttttggtcat gagattatca 6240 aaaaggatct tcacctagat ccttttaaat taaaaatgaa gttttaaatc aatctaaagt 6300 atatatgagt aaacttggtc tgacagttac caatgcttaa tcagtgaggc acctatctca 6360 gcgatctgtc tatttcgttc atccatagtt gcctgactcc ccgtcgtgta gataactacg 6420 atacgggagg gcttaccatc tggccccagt gctgcaatga taccgcgaga cccacgctca 6480 ccggctccag atttatcagc aataaaccag ccagccggaa gggccgagcg cagaagtggt 6540 cctgcaactt tatccgcctc catccagtct attaattgtt gccgggaagc tagagtaagt 6600 agttcgccag ttaatagttt gcgcaacgtt gttgccattg ctacaggcat cgtggtgtca 6660 cgctcgtcgt ttggtatggc ttcattcagc tccggttccc aacgatcaag gcgagttaca 6720 tgatccccca tgttgtgcaa aaaagcggtt agctccttcg gtcctccgat cgttgtcaga 6780 agtaagttgg ccgcagtgtt atcactcatg gttatggcag cactgcataa ttctcttact 6840 gtcatgccat ccgtaagatg cttttctgtg actggtgagt actcaaccaa gtcattctga 6900 gaatagtgta tgcggcgacc gagttgctct tgcccggcgt caatacggga taataccgcg 6960 ccacatagca gaactttaaa agtgctcatc attggaaaac gttcttcggg gcgaaaactc 7020 tcaaggatct taccgctgtt gagatccagt tcgatgtaac ccactcgtgc acccaactga 7080 tcttcagcat cttttacttt caccagcgtt tctgggtgag caaaaacagg aaggcaaaat 7140 gccgcaaaaa agggaataag ggcgacacgg aaatgttgaa tactcatact cttccttttt 7200 caatattatt gaagcattta tcagggttat tgtctcatga gcggatacat atttgaatgt 7260 atttagaaaa ataaacaaat aggggttccg cgcacatttc cccgaaaagt gccac 7315 3 7689 DNA Artificial Sequence Synthetic 3 ctgacgcgcc ctgtagcggc gcattaagcg cggcgggtgt ggtggttacg cgcagcgtga 60 ccgctacact tgccagcgcc ctagcgcccg ctcctttcgc tttcttccct tcctttctcg 120 ccacgttcgc cggcatcaga ttggctattg gccattgcat acgttgtatc catatcataa 180 tatgtacatt tatattggct catgtccaac attaccgcca tgttgacatt gattattgac 240 tagttattaa tagtaatcaa ttacggggtc attagttcat agcccatata tggagttccg 300 cgttacataa cttacggtaa atggcccgcc tggctgaccg cccaacgacc cccgcccatt 360 gacgtcaata atgacgtatg ttcccatagt aacgccaata gggactttcc attgacgtca 420 atgggtggag tatttacggt aaactgccca cttggcagta catcaagtgt atcatatgcc 480 aagtacgccc cctattgacg tcaatgacgg taaatggccc gcctggcatt atgcccagta 540 catgacctta tgggactttc ctacttggca gtacatctac gtattagtca tcgctattac 600 catggtgatg cggttttggc agtacatcaa tgggcgtgga tagcggtttg actcacgggg 660 atttccaagt ctccacccca ttgacgtcaa tgggagtttg ttttggcacc aaaatcaacg 720 ggactttcca aaatgtcgta acaactccgc cccattgacg caaatgggcg gtaggcgtgt 780 acggtgggag gtctatataa gcagagctcg tttagtgaac cgtcagatcg cctggagacg 840 ccatccacgc tgttttgacc tccatagaag acaccgggac cgatccagcc tccgcggccg 900 ggaacggtgc attggaacgc ggattccccg tgccaagagt gacgtaagta ccgcctatag 960 actctatagg cacacccctt tggctcttat gcatgctata ctgtttttgg cttggggcct 1020 atacaccccc gcttccttat gctataggtg atggtatagc ttagcctata ggtgtgggtt 1080 attgaccatt attgaccact cccctattgg tgacgatact ttccattact aatccataac 1140 atggctcttt gccacaacta tctctattgg ctatatgcca atactctgtc cttcagagac 1200 tgacacggac tctgtatttt tacaggatgg ggtcccattt attatttaca aattcacata 1260 tacaacaacg ccgtcccccg tgcccgcagt ttttattaaa catagcgtgg gatctccacg 1320 cgaatctcgg gtacgtgttc cggacatggg ctcttctccg gtagcggcgg agcttccaca 1380 tccgagccct ggtcccatgc ctccagcggc tcatggtcgc tcggcagctc cttgctccta 1440 acagtggagg ccagacttag gcacagcaca atgcccacca ccaccagtgt gccgcacaag 1500 gccgtggcgg tagggtatgt gtctgaaaat gagcgtggag attgggctcg cacggctgac 1560 gcagatggaa gacttaaggc agcggcagaa gaagatgcag gcagctgagt tgttgtattc 1620 tgataagagt cagaggtaac tcccgttgcg gtgctgttaa cggtggaggg cagtgtagtc 1680 tgagcagtac tcgttgctgc cgcgcgcgcc accagacata atagctgaca gactaacaga 1740 ctgttccttt ccatgggtct tttctgcagt caccgtcgga ccatgtgtga acttgatatt 1800 ttacatgatt ctctttacca attctgcccc gaattacact taaaacgact caacagctta 1860 acgttggctt gccacgcatt acttgactgt aaaactctca ctcttaccga acttggccgt 1920 aacctgccaa ccaaagcgag aacaaaacat aacatcaaac gaatcgaccg attgttaggt 1980 aatcgtcacc tccacaaaga gcgactcgct gtataccgtt ggcatgctag ctttatctgt 2040 tcgggaatac gatgcccatt gtacttgttg actggtctga tattcgtgag caaaaacgac 2100 ttatggtatt gcgagcttca gtcgcactac acggtcgttc tgttactctt tatgagaaag 2160 cgttcccgct ttcagagcaa tgttcaaaga aagctcatga ccaatttcta gccgaccttg 2220 cgagcattct accgagtaac accacaccgc tcattgtcag tgatgctggc tttaaagtgc 2280 catggtataa atccgttgag aagctgggtt ggtactggtt aagtcgagta agaggaaaag 2340 tacaatatgc agacctagga gcggaaaact ggaaacctat cagcaactta catgatatgt 2400 catctagtca ctcaaagact ttaggctata agaggctgac taaaagcaat ccaatctcat 2460 gccaaattct attgtataaa tctcgctcta aaggccgaaa aaatcagcgc tcgacacgga 2520 ctcattgtca ccacccgtca cctaaaatct actcagcgtc ggcaaaggag ccatgggttc 2580 tagcaactaa cttacctgtt gaaattcgaa cacccaaaca acttgttaat atctattcga 2640 agcgaatgca gattgaagaa accttccgag acttgaaaag tcctgcctac ggactaggcc 2700 tacgccatag ccgaacgagc agctcagagc gttttgatat catgctgcta atcgccctga 2760 tgcttcaact aacatgttgg cttgcgggcg ttcatgctca gaaacaaggt tgggacaagc 2820 acttccaggc taacacagtc agaaatcgaa acgtactctc aacagttcgc ttaggcatgg 2880 aagttttgcg gcattctggc tacacaataa caagggaaga cttactcgtg gctgcaaccc 2940 tactagctca aaatttattc acacatggtt acgctttggg gaaattatga taatgatcca 3000 gatcacttct ggctaataaa agatcagagc tctagagatc tgtgtgttgg ttttttgtgg 3060 atctgctgtg ccttctagtt gccagccatc tgttgtttgc ccctcccccg tgccttcctt 3120 gaccctggaa ggtgccactc ccactgtcct ttcctaataa aatgaggaaa ttgcatcgca 3180 ttgtctgagt aggtgtcatt ctattctggg gggtggggtg gggcagcaca gcaaggggga 3240 ggattgggaa gacaatagca ggcatgctgg ggatgcggtg ggctctatgg gtacctctct 3300 ctctctctct ctctctctct ctctctctct ctctcggtac ctctctctct ctctctctct 3360 ctctctctct ctctctctct cggtaccagg tgctgaagaa ttgacccggt gaccaaaggt 3420 gccttttatc atcactttaa aaataaaaaa caattactca gtgcctgtta taagcagcaa 3480 ttaattatga ttgatgccta catcacaaca aaaactgatt taacaaatgg ttggtctgcc 3540 ttagaaagta tatttgaaca ttatcttgat tatattattg ataataataa aaaccttatc 3600 cctatccaag aagtgatgcc tatcattggt tggaatgaac ttgaaaaaaa ttagccttga 3660 atacattact ggtaaggtaa acgccattgt cagcaaattg atccaagaga accaacttaa 3720 agctttcctg acggaatgtt aattctcgtt gaccctgagc actgatgaat cccctaatga 3780 ttttggtaaa aatcattaag ttaaggtgga tacacatctt gtcatatgat cccggtaatg 3840 tgagttagct cactcattag gcaccccagg ctttacactt tatgcttccg gctcgtatgt 3900 tgtgtggaat tgtgagcgga taacaatttc acacaggaaa cagctatgac catgattacg 3960 ccaagcgcgc aattaaccct cactaaaggg aacaaaagct ggagctccac cgcggtggcg 4020 gccgctctag aactagtgga tcccccgggc tgcaggaatt cgatatcaag cttatcgata 4080 ccgctgacct cgaggggggg cccggtaccc aattcgccct atagtgagtc gtattacgcg 4140 cgctcactgg ccgtcgtttt acaacgtcgt gactgggaaa accctggcgt tacccaactt 4200 aatcgccttg cagcacatcc ccctttcgcc agctggcgta atagcgaaga ggcccgcacc 4260 gatcgccctt cccaacagtt gcgcagcctg aatggcgaat ggaaattgta agcgttaata 4320 ttttgttaaa attcgcgtta aatttttgtt aaatcagctc attttttaac caataggccg 4380 aaatcggcaa aatcccttat aaatcaaaag aatagaccga gatagggttg agtgttgttc 4440 cagtttggaa caagagtcca ctattaaaga acgtggactc caacgtcaaa gggcgaaaaa 4500 ccgtctatca gggcgatggc ccactactcc gggatcatat gacaagatgt gtatccacct 4560 taacttaatg atttttacca aaatcattag gggattcatc agtgctcagg gtcaacgaga 4620 attaacattc cgtcaggaaa gcttatgatg atgatgtgct taaaaactta ctcaatggct 4680 ggttatgcat atcgcaatac atgcgaaaaa cctaaaagag cttgccgata aaaaaggcca 4740 atttattgct atttaccgcg gctttttatt gagcttgaaa gataaataaa atagataggt 4800 tttatttgaa gctaaatctt ctttatcgta aaaaatgccc tcttgggtta tcaagagggt 4860 cattatattt cgcggaataa catcatttgg tgacgaaata actaagcact tgtctcctgt 4920 ttactcccct gagcttgagg ggttaacatg aaggtcatcg atagcaggat aataatacag 4980 taaaacgcta aaccaataat ccaaatccag ccatcccaaa ttggtagtga atgattataa 5040 ataacagcaa acagtaatgg gccaataaca ccggttgcat tggtaaggct caccaataat 5100 ccctgtaaag caccttgctg atgactcttt gtttggatag acatcactcc ctgtaatgca 5160 ggtaaagcga tcccaccacc agccaataaa attaaaacag ggaaaactaa ccaaccttca 5220 gatataaacg ctaaaaaggc aaatgcacta ctatctgcaa taaatccgag cagtactgcc 5280 gttttttcgc ccatttagtg gctattcttc ctgccacaaa ggcttggaat actgagtgta 5340 aaagaccaag acccgtaatg aaaagccaac catcatgcta ttcatcatca cgatttctgt 5400 aatagcacca caccgtgctg gattggctat caatgcgctg aaataataat caacaaatgg 5460 catcgttaaa taagtgatgt ataccgatca gcttttgttc cctttagtga gggttaattg 5520 cgcgcttggc gtaatcatgg tcatagctgt ttcctgtgtg aaattgttat ccgctcacaa 5580 ttccacacaa catacgagcc ggaagcataa agtgtaaagc ctggggtgcc taatgagtga 5640 gctaactcac attaattgcg ttgcgctcac tgcccgcttt ccagtcggga aacctgtcgt 5700 gccagctgca ttaatgaatc ggccaacgcg cggggagagg cggtttgcgt attgggcgct 5760 cttccgcttc ctcgctcact gactcgctgc gctcggtcgt tcggctgcgg cgagcggtat 5820 cagctcactc aaaggcggta atacggttat ccacagaatc aggggataac gcaggaaaga 5880 acatgtgagc aaaaggccag caaaaggcca ggaaccgtaa aaaggccgcg ttgctggcgt 5940 ttttccatag gctccgcccc cctgacgagc atcacaaaaa tcgacgctca agtcagaggt 6000 ggcgaaaccc gacaggacta taaagatacc aggcgtttcc ccctggaagc tccctcgtgc 6060 gctctcctgt tccgaccctg ccgcttaccg gatacctgtc cgcctttctc ccttcgggaa 6120 gcgtggcgct ttctcatagc tcacgctgta ggtatctcag ttcggtgtag gtcgttcgct 6180 ccaagctggg ctgtgtgcac gaaccccccg ttcagcccga ccgctgcgcc ttatccggta 6240 actatcgtct tgagtccaac ccggtaagac acgacttatc gccactggca gcagccactg 6300 gtaacaggat tagcagagcg aggtatgtag gcggtgctac agagttcttg aagtggtggc 6360 ctaactacgg ctacactaga aggacagtat ttggtatctg cgctctgctg aagccagtta 6420 ccttcggaaa aagagttggt agctcttgat ccggcaaaca aaccaccgct ggtagcggtg 6480 gtttttttgt ttgcaagcag cagattacgc gcagaaaaaa aggatctcaa gaagatcctt 6540 tgatcttttc tacggggtct gacgctcagt ggaacgaaaa ctcacgttaa gggattttgg 6600 tcatgagatt atcaaaaagg atcttcacct agatcctttt aaattaaaaa tgaagtttta 6660 aatcaatcta aagtatatat gagtaaactt ggtctgacag ttaccaatgc ttaatcagtg 6720 aggcacctat ctcagcgatc tgtctatttc gttcatccat agttgcctga ctccccgtcg 6780 tgtagataac tacgatacgg gagggcttac catctggccc cagtgctgca atgataccgc 6840 gagacccacg ctcaccggct ccagatttat cagcaataaa ccagccagcc ggaagggccg 6900 agcgcagaag tggtcctgca actttatccg cctccatcca gtctattaat tgttgccggg 6960 aagctagagt aagtagttcg ccagttaata gtttgcgcaa cgttgttgcc attgctacag 7020 gcatcgtggt gtcacgctcg tcgtttggta tggcttcatt cagctccggt tcccaacgat 7080 caaggcgagt tacatgatcc cccatgttgt gcaaaaaagc ggttagctcc ttcggtcctc 7140 cgatcgttgt cagaagtaag ttggccgcag tgttatcact catggttatg gcagcactgc 7200 ataattctct tactgtcatg ccatccgtaa gatgcttttc tgtgactggt gagtactcaa 7260 ccaagtcatt ctgagaatag tgtatgcggc gaccgagttg ctcttgcccg gcgtcaatac 7320 gggataatac cgcgccacat agcagaactt taaaagtgct catcattgga aaacgttctt 7380 cggggcgaaa actctcaagg atcttaccgc tgttgagatc cagttcgatg taacccactc 7440 gtgcacccaa ctgatcttca gcatctttta ctttcaccag cgtttctggg tgagcaaaaa 7500 caggaaggca aaatgccgca aaaaagggaa taagggcgac acggaaatgt tgaatactca 7560 tactcttcct ttttcaatat tattgaagca tttatcaggg ttattgtctc atgagcggat 7620 acatatttga atgtatttag aaaaataaac aaataggggt tccgcgcaca tttccccgaa 7680 aagtgccac 7689 4 7 DNA Artificial Sequence Synthetic 4 accatgg 7 5 7 DNA Artificial Sequence Synthetic 5 accatgt 7 6 7 DNA Artificial Sequence Synthetic 6 aagatgt 7 7 7 DNA Artificial Sequence Synthetic 7 acgatga 7 8 7 DNA Artificial Sequence Synthetic 8 aagatgg 7 9 7 DNA Artificial Sequence Synthetic 9 gacatga 7 10 7 DNA Artificial Sequence Synthetic 10 accatga 7 11 315 DNA Gallus sp. 11 tctgccattg ctgcttcctc tgcccttcct cgtcactctg aatgtggctt cttcgctact 60 gccacagcaa gaaataaaat ctcaacatct aaatgggttt cctgaggttt ttcaagagtc 120 gttaagcaca ttccttcccc agcacccctt gctgcaggcc agtgccaggc accaacttgg 180 ctactgctgc ccatgagaga aatccagttc aatattttcc aaagcaaaat ggattacata 240 tgccctagat cctgattaac aggcgtttgt attatctagt gctttcgctt cacccagatt 300 atcccattgc ctccc 315 12 361 DNA Artificial Sequence Synthetic 12 ggcgcctgga tccagatcac ttctggctaa taaaagatca gagctctaga gatctgtgtg 60 ttggtttttt gtggatctgc tgtgccttct agttgccagc catctgttgt ttgcccctcc 120 cccgtgcctt ccttgaccct ggaaggtgcc actcccactg tcctttccta ataaaatgag 180 gaaattgcat cgcattgtct gagtaggtgt cattctattc tggggggtgg ggtggggcag 240 cacagcaagg gggaggattg ggaagacaat agcaggcatg ctggggatgc ggtgggctct 300 atgggtacct ctctctctct ctctctctct ctctctctct ctctctctcg gtacctctct 360 c 361 13 350 DNA Artificial Sequence Synthetic 13 ggggatcgct ctagagcgat ccgggatctc gggaaaagcg ttggtgacca aaggtgcctt 60 ttatcatcac tttaaaaata aaaaacaatt actcagtgcc tgttataagc agcaattaat 120 tatgattgat gcctacatca caacaaaaac tgatttaaca aatggttggt ctgccttaga 180 aagtatattt gaacattatc ttgattatat tattgataat aataaaaacc ttatccctat 240 ccaagaagtg atgcctatca ttggttggaa tgaacttgaa aaaaattagc cttgaataca 300 ttactggtaa ggtaaacgcc attgtcagca aattgatcca agagaaccaa 350 14 908 DNA Artificial Sequence Synthetic 14 tgaatgtgtt cttgtgttat caatataaat cacagttagt gatgaagttg gctgcaagcc 60 tgcatcagtt cagctacttg gctgcatttt gtatttggtt ctgtaggaaa tgcaaaaggt 120 tctaggctga cctgcacttc tatccctctt gccttactgc tgagaatctc tgcaggtttt 180 aattgttcac attttgctcc catttacttt ggaagataaa atatttacag aatgcttatg 240 aaacctttgt tcatttaaaa atattcctgg tcagcgtgac cggagctgaa agaacacatt 300 gatcccgtga tttcaataaa tacatatgtt ccatatattg tttctcagta gcctcttaaa 360 tcatgtgcgt tggtgcacat atgaatacat gaatagcaaa ggtttatctg gattacgctc 420 tggcctgcag gaatggccat aaaccaaagc tgagggaaga gggagagtat agtcaatgta 480 gattatactg attgctgatt gggttattat cagctagata acaacttggg tcaggtgcca 540 ggtcaacata acctgggcaa aaccagtctc atctgtggca ggaccatgta ccagcagcca 600 gccgtgaccc aatctaggaa agcaagtagc acatcaattt taaatttatt gtaaatgccg 660 tagtagaagt gttttactgt gatacattga aacttctggt caatcagaaa aaggtttttt 720 atcagagatg ccaaggtatt atttgatttt ctttattcgc cgtgaagaga atttatgatt 780 gcaaaaagag gagtgtttac ataaactgat aaaaaacttg aggaattcag cagaaaacag 840 ccacgtgttc ctgaacattc ttccataaaa gtctcaccat gcctggcaga gccctattca 900 ccttcgct 908 15 901 DNA Gallus 15 gaggtcagaa tggtttcttt actgtttgtc aattctatta tttcaataca gaacaatagc 60 ttctataact gaaatatatt tgctattgta tattatgatt gtccctcgaa ccatgaacac 120 tcctccagct gaatttcaca attcctctgt catctgccag gccattaagt tattcatgga 180 agatctttga ggaacactgc aagttcatat cataaacaca tttgaaattg agtattgttt 240 tgcattgtat ggagctatgt tttgctgtat cctcagaaaa aaagtttgtt ataaagcatt 300 cacacccata aaaagataga tttaaatatt ccagctatag gaaagaaagt gcgtctgctc 360 ttcactctag tctcagttgg ctccttcaca tgcatgcttc tttatttctc ctattttgtc 420 aagaaaataa taggtcacgt cttgttctca cttatgtcct gcctagcatg gctcagatgc 480 acgttgtaga tacaagaagg atcaaatgaa acagacttct ggtctgttac tacaaccata 540 gtaataagca cactaactaa taattgctaa ttatgttttc catctctaag gttcccacat 600 ttttctgttt tcttaaagat cccattatct ggttgtaact gaagctcaat ggaacatgag 660 caatatttcc cagtcttctc tcccatccaa cagtcctgat ggattagcag aacaggcaga 720 aaacacattg ttacccagaa ttaaaaacta atatttgctc tccattcaat ccaaaatgga 780 cctattgaaa ctaaaatcta acccaatccc attaaatgat ttctatggcg tcaaaggtca 840 aacttctgaa gggaacctgt gggtgggtca caattcaggc tatatattcc ccagggctca 900 g 901 16 680 DNA Gallus 16 ccgggctgca gaaaaatgcc aggtggacta tgaactcaca tccaaaggag cttgacctga 60 tacctgattt tcttcaaact ggggaaacaa cacaatccca caaaacagct cagagagaaa 120 ccatcactga tggctacagc accaaggtat gcaatggcaa tccattcgac attcatctgt 180 gacctgagca aaatgattta tctctccatg aatggttgct tctttccctc atgaaaaggc 240 aatttccaca ctcacaatat gcaacaaaga caaacagaga acaattaatg tgctccttcc 300 taatgtcaaa attgtagtgg caaagaggag aacaaaatct caagttctga gtaggtttta 360 gtgattggat aagaggcttt gacctgtgag ctcacctgga cttcatatcc ttttggataa 420 aaagtgcttt tataactttc aggtctccga gtctttattc atgagactgt tggtttaggg 480 acagacccac aatgaaatgc ctggcatagg aaagggcagc agagccttag ctgacctttt 540 cttgggacaa gcattgtcaa acaatgtgtg acaaaactat ttgtactgct ttgcacagct 600 gtgctgggca gggcaatcca ttgccaccta tcccaggtaa ccttccaact gcaagaagat 660 tgttgcttac tctctctaga 680 17 72 DNA Artificial Sequence Synthetic 17 gtggatcaac atacagctag aaagctgtat tgcctttagc actcaagctc aaaagacaac 60 tcagagttca cc 72 18 62 DNA Artificial Sequence Synthetic 18 acatacagct agaaagctgt attgccttta gcactcaagc tcaaaagaca actcagagtt 60 ca 62 19 1158 DNA Gallus 19 atgggctcca tcggcgcagc aagcatggaa ttttgttttg atgtattcaa ggagctcaaa 60 gtccaccatg ccaatgagaa catcttctac tgccccattg ccatcatgtc agctctagcc 120 atggtatacc tgggtgcaaa agacagcacc aggacacaga taaataaggt tgttcgcttt 180 gataaacttc caggattcgg agacagtatt gaagctcagt gtggcacatc tgtaaacgtt 240 cactcttcac ttagagacat cctcaaccaa atcaccaaac caaatgatgt ttattcgttc 300 agccttgcca gtagacttta tgctgaagag agatacccaa tcctgccaga atacttgcag 360 tgtgtgaagg aactgtatag aggaggcttg gaacctatca actttcaaac agctgcagat 420 caagccagag agctcatcaa ttcctgggta gaaagtcaga caaatggaat tatcagaaat 480 gtccttcagc caagctccgt ggattctcaa actgcaatgg ttctggttaa tgccattgtc 540 ttcaaaggac tgtgggagaa aacatttaag gatgaagaca cacaagcaat gcctttcaga 600 gtgactgagc aagaaagcaa acctgtgcag atgatgtacc agattggttt atttagagtg 660 gcatcaatgg cttctgagaa aatgaagatc ctggagcttc catttgccag tgggacaatg 720 agcatgttgg tgctgttgcc tgatgaagtc tcaggccttg agcagcttga gagtataatc 780 aactttgaaa aactgactga atggaccagt tctaatgtta tggaagagag gaagatcaaa 840 gtgtacttac ctcgcatgaa gatggaggaa aaatacaacc tcacatctgt cttaatggct 900 atgggcatta ctgacgtgtt tagctcttca gccaatctgt ctggcatctc ctcagcagag 960 agcctgaaga tatctcaagc tgtccatgca gcacatgcag aaatcaatga agcaggcaga 1020 gaggtggtag ggtcagcaga ggctggagtg gatgctgcaa gcgtctctga agaatttagg 1080 gctgaccatc cattcctctt ctgtatcaag cacatcgcaa ccaacgccgt tctcttcttt 1140 ggcagatgtg tttcccct 1158 20 53 DNA Gallus 20 atgggctcca tcggcgcagc aagcatggaa ttttgttttg atgtattcaa gga 53 21 103 DNA Gallus 21 atgggctcca tcggcgcagc aagcatggaa ttttgttttg atgtattcaa ggagctcaaa 60 gtccaccatg ccaatgagaa catcttctac tgccccattg cca 103 22 63 DNA Artificial Sequence Synthetic 22 atgaggggga tcatactggc attagtgctc acccttgtag gcagccagaa gtttgacatt 60 ggt 63 23 260 DNA Artificial Sequence Synthetic 23 tttgtgaacc aacacctgtg cggctcacac ctggtggaag ctctctacct agtgtgcggg 60 gaacgaggct tcttctacac acccaagacc cgccgggagg cagaggacct gcaggtgggg 120 caggtggagc tgggcggggg ccctggtgca ggcagcctgc agcccttggc cctggagggg 180 tccctgcaga agcgtggcat tgtggaacaa tgctgtacca gcatctgctc cctctaccag 240 ctggagaact ctgcaactag 260 24 9 DNA Artificial Sequence Synthetic 24 kykkakkak 9 25 39 DNA Artificial Sequence Synthetic 25 aaatacaaaa aagcactgaa aaaactggca aaactgctg 39 26 4 PRT Artificial Sequence Synthetic 26 Gly Pro Gly Gly 1 27 12 PRT Artificial Sequence Synthetic 27 Gly Pro Gly Gly Gly Pro Gly Gly Gly Pro Gly Gly 1 5 10 28 15 PRT Artificial Sequence Synthetic 28 Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser 1 5 10 15 29 20 PRT Artificial Sequence Synthetic 29 Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly 1 5 10 15 Gly Gly Gly Ser 20 30 5 PRT Artificial Sequence Synthetic 30 Pro Ala Asp Asp Ala 1 5 31 29 PRT Artificial Sequence Synthetic 31 Pro Ala Asp Asp Ala Pro Ala Asp Asp Ala Pro Ala Asp Asp Ala Pro 1 5 10 15 Ala Asp Asp Ala Pro Ala Asp Asp Ala Pro Ala Asp Asp 20 25 32 16 PRT Artificial Sequence Synthetic 32 Ala Thr Thr Cys Ile Leu Lys Gly Ser Cys Gly Trp Ile Gly Leu Leu 1 5 10 15 33 30 PRT Artificial Sequence Synthetic 33 Pro Ala Asp Asp Ala Pro Ala Asp Asp Ala Thr Thr Cys Ile Leu Lys 1 5 10 15 Gly Ser Cys Gly Trp Ile Gly Leu Leu Asp Asp Asp Asp Lys 20 25 30 34 5 PRT Artificial Sequence Synthetic 34 Asp Asp Asp Asp Lys 1 5 35 50 PRT Artificial Sequence Synthetic 35 Pro Ala Asp Asp Ala Pro Ala Asp Asp Ala Pro Ala Asp Asp Ala Pro 1 5 10 15 Ala Asp Asp Ala Pro Ala Asp Asp Ala Pro Ala Asp Asp Ala Thr Thr 20 25 30 Cys Ile Leu Lys Gly Ser Cys Gly Trp Ile Gly Leu Leu Asp Asp Asp 35 40 45 Asp Lys 50 36 48 DNA Artificial Sequence Synthetic 36 atctcgagac catgtgtgaa cttgatattt tacatgattc tctttacc 48 37 36 DNA Artificial Sequence Synthetic 37 gattgatcat tatcataatt tccccaaagc gtaacc 36 38 6 DNA Artificial Sequence Synthetic 38 ctcgag 6 39 6 DNA Artificial Sequence Synthetic 39 tgatca 6 40 22 DNA Artificial Sequence Synthetic 40 ttgccggcat cagattggct at 22 41 34 DNA Artificial Sequence Synthetic 41 agaggtcacc gggtcaattc ttcagcacct ggta 34 42 10512 DNA Artificial Sequence Synthetic 42 ctgacgcgcc ctgtagcggc gcattaagcg cggcgggtgt ggtggttacg cgcagcgtga 60 ccgctacact tgccagcgcc ctagcgcccg ctcctttcgc tttcttccct tcctttctcg 120 ccacgttcgc cggcatcaga ttggctattg gccattgcat acgttgtatc catatcataa 180 tatgtacatt tatattggct catgtccaac attaccgcca tgttgacatt gattattgac 240 tagttattaa tagtaatcaa ttacggggtc attagttcat agcccatata tggagttccg 300 cgttacataa cttacggtaa atggcccgcc tggctgaccg cccaacgacc cccgcccatt 360 gacgtcaata atgacgtatg ttcccatagt aacgccaata gggactttcc attgacgtca 420 atgggtggag tatttacggt aaactgccca cttggcagta catcaagtgt atcatatgcc 480 aagtacgccc cctattgacg tcaatgacgg taaatggccc gcctggcatt atgcccagta 540 catgacctta tgggactttc ctacttggca gtacatctac gtattagtca tcgctattac 600 catggtgatg cggttttggc agtacatcaa tgggcgtgga tagcggtttg actcacgggg 660 atttccaagt ctccacccca ttgacgtcaa tgggagtttg ttttggcacc aaaatcaacg 720 ggactttcca aaatgtcgta acaactccgc cccattgacg caaatgggcg gtaggcgtgt 780 acggtgggag gtctatataa gcagagctcg tttagtgaac cgtcagatcg cctggagacg 840 ccatccacgc tgttttgacc tccatagaag acaccgggac cgatccagcc tccgcggccg 900 ggaacggtgc attggaacgc ggattccccg tgccaagagt gacgtaagta ccgcctatag 960 actctatagg cacacccctt tggctcttat gcatgctata ctgtttttgg cttggggcct 1020 atacaccccc gcttccttat gctataggtg atggtatagc ttagcctata ggtgtgggtt 1080 attgaccatt attgaccact cccctattgg tgacgatact ttccattact aatccataac 1140 atggctcttt gccacaacta tctctattgg ctatatgcca atactctgtc cttcagagac 1200 tgacacggac tctgtatttt tacaggatgg ggtcccattt attatttaca aattcacata 1260 tacaacaacg ccgtcccccg tgcccgcagt ttttattaaa catagcgtgg gatctccacg 1320 cgaatctcgg gtacgtgttc cggacatggg ctcttctccg gtagcggcgg agcttccaca 1380 tccgagccct ggtcccatgc ctccagcggc tcatggtcgc tcggcagctc cttgctccta 1440 acagtggagg ccagacttag gcacagcaca atgcccacca ccaccagtgt gccgcacaag 1500 gccgtggcgg tagggtatgt gtctgaaaat gagcgtggag attgggctcg cacggctgac 1560 gcagatggaa gacttaaggc agcggcagaa gaagatgcag gcagctgagt tgttgtattc 1620 tgataagagt cagaggtaac tcccgttgcg gtgctgttaa cggtggaggg cagtgtagtc 1680 tgagcagtac tcgttgctgc cgcgcgcgcc accagacata atagctgaca gactaacaga 1740 ctgttccttt ccatgggtct tttctgcagt caccgtcgga ccatgtgtga acttgatatt 1800 ttacatgatt ctctttacca attctgcccc gaattacact taaaacgact caacagctta 1860 acgttggctt gccacgcatt acttgactgt aaaactctca ctcttaccga acttggccgt 1920 aacctgccaa ccaaagcgag aacaaaacat aacatcaaac gaatcgaccg attgttaggt 1980 aatcgtcacc tccacaaaga gcgactcgct gtataccgtt ggcatgctag ctttatctgt 2040 tcgggaatac gatgcccatt gtacttgttg actggtctga tattcgtgag caaaaacgac 2100 ttatggtatt gcgagcttca gtcgcactac acggtcgttc tgttactctt tatgagaaag 2160 cgttcccgct ttcagagcaa tgttcaaaga aagctcatga ccaatttcta gccgaccttg 2220 cgagcattct accgagtaac accacaccgc tcattgtcag tgatgctggc tttaaagtgc 2280 catggtataa atccgttgag aagctgggtt ggtactggtt aagtcgagta agaggaaaag 2340 tacaatatgc agacctagga gcggaaaact ggaaacctat cagcaactta catgatatgt 2400 catctagtca ctcaaagact ttaggctata agaggctgac taaaagcaat ccaatctcat 2460 gccaaattct attgtataaa tctcgctcta aaggccgaaa aaatcagcgc tcgacacgga 2520 ctcattgtca ccacccgtca cctaaaatct actcagcgtc ggcaaaggag ccatgggttc 2580 tagcaactaa cttacctgtt gaaattcgaa cacccaaaca acttgttaat atctattcga 2640 agcgaatgca gattgaagaa accttccgag acttgaaaag tcctgcctac ggactaggcc 2700 tacgccatag ccgaacgagc agctcagagc gttttgatat catgctgcta atcgccctga 2760 tgcttcaact aacatgttgg cttgcgggcg ttcatgctca gaaacaaggt tgggacaagc 2820 acttccaggc taacacagtc agaaatcgaa acgtactctc aacagttcgc ttaggcatgg 2880 aagttttgcg gcattctggc tacacaataa caagggaaga cttactcgtg gctgcaaccc 2940 tactagctca aaatttattc acacatggtt acgctttggg gaaattatga taatgatcca 3000 gatcacttct ggctaataaa agatcagagc tctagagatc tgtgtgttgg ttttttgtgg 3060 atctgctgtg ccttctagtt gccagccatc tgttgtttgc ccctcccccg tgccttcctt 3120 gaccctggaa ggtgccactc ccactgtcct ttcctaataa aatgaggaaa ttgcatcgca 3180 ttgtctgagt aggtgtcatt ctattctggg gggtggggtg gggcagcaca gcaaggggga 3240 ggattgggaa gacaatagca ggcatgctgg ggatgcggtg ggctctatgg gtacctctct 3300 ctctctctct ctctctctct ctctctctct ctctcggtac ctctctctct ctctctctct 3360 ctctctctct ctctctctct cggtaccagg tgctgaagaa ttgacccggt gaccaaaggt 3420 gccttttatc atcactttaa aaataaaaaa caattactca gtgcctgtta taagcagcaa 3480 ttaattatga ttgatgccta catcacaaca aaaactgatt taacaaatgg ttggtctgcc 3540 ttagaaagta tatttgaaca ttatcttgat tatattattg ataataataa aaaccttatc 3600 cctatccaag aagtgatgcc tatcattggt tggaatgaac ttgaaaaaaa ttagccttga 3660 atacattact ggtaaggtaa acgccattgt cagcaaattg atccaagaga accaacttaa 3720 agctttcctg acggaatgtt aattctcgtt gaccctgagc actgatgaat cccctaatga 3780 ttttggtaaa aatcattaag ttaaggtgga tacacatctt gtcatatgat cccggtaatg 3840 tgagttagct cactcattag gcaccccagg ctttacactt tatgcttccg gctcgtatgt 3900 tgtgtggaat tgtgagcgga taacaatttc acacaggaaa cagctatgac catgattacg 3960 ccaagcgcgc aattaaccct cactaaaggg aacaaaagct ggagctccac cgcggtggcg 4020 gccgctctag aactagtgga tcccccgggg aggtcagaat ggtttcttta ctgtttgtca 4080 attctattat ttcaatacag aacaatagct tctataactg aaatatattt gctattgtat 4140 attatgattg tccctcgaac catgaacact cctccagctg aatttcacaa ttcctctgtc 4200 atctgccagg ccattaagtt attcatggaa gatctttgag gaacactgca agttcatatc 4260 ataaacacat ttgaaattga gtattgtttt gcattgtatg gagctatgtt ttgctgtatc 4320 ctcagaaaaa aagtttgtta taaagcattc acacccataa aaagatagat ttaaatattc 4380 cagctatagg aaagaaagtg cgtctgctct tcactctagt ctcagttggc tccttcacat 4440 gcatgcttct ttatttctcc tattttgtca agaaaataat aggtcacgtc ttgttctcac 4500 ttatgtcctg cctagcatgg ctcagatgca cgttgtagat acaagaagga tcaaatgaaa 4560 cagacttctg gtctgttact acaaccatag taataagcac actaactaat aattgctaat 4620 tatgttttcc atctctaagg ttcccacatt tttctgtttt cttaaagatc ccattatctg 4680 gttgtaactg aagctcaatg gaacatgagc aatatttccc agtcttctct cccatccaac 4740 agtcctgatg gattagcaga acaggcagaa aacacattgt tacccagaat taaaaactaa 4800 tatttgctct ccattcaatc caaaatggac ctattgaaac taaaatctaa cccaatccca 4860 ttaaatgatt tctatggcgt caaaggtcaa acttctgaag ggaacctgtg ggtgggtcac 4920 aattcaggct atatattccc cagggctcag cggatccatg ggctccatcg gcgcagcaag 4980 catggaattt tgttttgatg tattcaagga gctcaaagtc caccatgcca atgagaacat 5040 cttctactgc cccattgcca tcatgtcagc tctagccatg gtatacctgg gtgcaaaaga 5100 cagcaccagg acacagataa ataaggttgt tcgctttgat aaacttccag gattcggaga 5160 cagtattgaa gctcagtgtg gcacatctgt aaacgttcac tcttcactta gagacatcct 5220 caaccaaatc accaaaccaa atgatgttta ttcgttcagc cttgccagta gactttatgc 5280 tgaagagaga tacccaatcc tgccagaata cttgcagtgt gtgaaggaac tgtatagagg 5340 aggcttggaa cctatcaact ttcaaacagc tgcagatcaa gccagagagc tcatcaattc 5400 ctgggtagaa agtcagacaa atggaattat cagaaatgtc cttcagccaa gctccgtgga 5460 ttctcaaact gcaatggttc tggttaatgc cattgtcttc aaaggactgt gggagaaaac 5520 atttaaggat gaagacacac aagcaatgcc tttcagagtg actgagcaag aaagcaaacc 5580 tgtgcagatg atgtaccaga ttggtttatt tagagtggca tcaatggctt ctgagaaaat 5640 gaagatcctg gagcttccat ttgccagtgg gacaatgagc atgttggtgc tgttgcctga 5700 tgaagtctca ggccttgagc agcttgagag tataatcaac tttgaaaaac tgactgaatg 5760 gaccagttct aatgttatgg aagagaggaa gatcaaagtg tacttacctc gcatgaagat 5820 ggaggaaaaa tacaacctca catctgtctt aatggctatg ggcattactg acgtgtttag 5880 ctcttcagcc aatctgtctg gcatctcctc agcagagagc ctgaagatat ctcaagctgt 5940 ccatgcagca catgcagaaa tcaatgaagc aggcagagag gtggtagggt cagcagaggc 6000 tggagtggat gctgcaagcg tctctgaaga atttagggct gaccatccat tcctcttctg 6060 tatcaagcac atcgcaacca acgccgttct cttctttggc agatgtgttt cccctccgcg 6120 gccagcagat gacgcaccag cagatgacgc accagcagat gacgcaccag cagatgacgc 6180 accagcagat gacgcaccag cagatgacgc aacaacatgt atcctgaaag gctcttgtgg 6240 ctggatcggc ctgctggatg acgatgacaa atttgtgaac caacacctgt gcggctcaca 6300 cctggtggaa gctctctacc tagtgtgcgg ggaacgaggc ttcttctaca cacccaagac 6360 ccgccgggag gcagaggacc tgcaggtggg gcaggtggag ctgggcgggg gccctggtgc 6420 aggcagcctg cagcccttgg ccctggaggg gtccctgcag aagcgtggca ttgtggaaca 6480 atgctgtacc agcatctgct ccctctacca gctggagaac tactgcaact agggcgcctg 6540 gatccagatc acttctggct aataaaagat cagagctcta gagatctgtg tgttggtttt 6600 ttgtggatct gctgtgcctt ctagttgcca gccatctgtt gtttgcccct cccccgtgcc 6660 ttccttgacc ctggaaggtg ccactcccac tgtcctttcc taataaaatg aggaaattgc 6720 atcgcattgt ctgagtaggt gtcattctat tctggggggt ggggtggggc agcacagcaa 6780 gggggaggat tgggaagaca atagcaggca tgctggggat gcggtgggct ctatgggtac 6840 ctctctctct ctctctctct ctctctctct ctctctctct cggtacctct ctcgaggggg 6900 ggcccggtac ccaattcgcc ctatagtgag tcgtattacg cgcgctcact ggccgtcgtt 6960 ttacaacgtc gtgactggga aaaccctggc gttacccaac ttaatcgcct tgcagcacat 7020 ccccctttcg ccagctggcg taatagcgaa gaggcccgca ccgatcgccc ttcccaacag 7080 ttgcgcagcc tgaatggcga atggaaattg taagcgttaa tattttgtta aaattcgcgt 7140 taaatttttg ttaaatcagc tcatttttta accaataggc cgaaatcggc aaaatccctt 7200 ataaatcaaa agaatagacc gagatagggt tgagtgttgt tccagtttgg aacaagagtc 7260 cactattaaa gaacgtggac tccaacgtca aagggcgaaa aaccgtctat cagggcgatg 7320 gcccactact ccgggatcat atgacaagat gtgtatccac cttaacttaa tgatttttac 7380 caaaatcatt aggggattca tcagtgctca gggtcaacga gaattaacat tccgtcagga 7440 aagcttatga tgatgatgtg cttaaaaact tactcaatgg ctggttatgc atatcgcaat 7500 acatgcgaaa aacctaaaag agcttgccga taaaaaaggc caatttattg ctatttaccg 7560 cggcttttta ttgagcttga aagataaata aaatagatag gttttatttg aagctaaatc 7620 ttctttatcg taaaaaatgc cctcttgggt tatcaagagg gtcattatat ttcgcggaat 7680 aacatcattt ggtgacgaaa taactaagca cttgtctcct gtttactccc ctgagcttga 7740 ggggttaaca tgaaggtcat cgatagcagg ataataatac agtaaaacgc taaaccaata 7800 atccaaatcc agccatccca aattggtagt gaatgattat aaataacagc aaacagtaat 7860 gggccaataa caccggttgc attggtaagg ctcaccaata atccctgtaa agcaccttgc 7920 tgatgactct ttgtttggat agacatcact ccctgtaatg caggtaaagc gatcccacca 7980 ccagccaata aaattaaaac agggaaaact aaccaacctt cagatataaa cgctaaaaag 8040 gcaaatgcac tactatctgc aataaatccg agcagtactg ccgttttttc gccccattta 8100 gtggctattc ttcctgccac aaaggcttgg aatactgagt gtaaaagacc aagacccgct 8160 aatgaaaagc caaccatcat gctattccat ccaaaacgat tttcggtaaa tagcacccac 8220 accgttgcgg gaatttggcc tatcaattgc gctgaaaaat aaataatcaa caaaatggca 8280 tcgttttaaa taaagtgatg tataccgaat tcagcttttg ttccctttag tgagggttaa 8340 ttgcgcgctt ggcgtaatca tggtcatagc tgtttcctgt gtgaaattgt tatccgctca 8400 caattccaca caacatacga gccggaagca taaagtgtaa agcctggggt gcctaatgag 8460 tgagctaact cacattaatt gcgttgcgct cactgcccgc tttccagtcg ggaaacctgt 8520 cgtgccagct gcattaatga atcggccaac gcgcggggag aggcggtttg cgtattgggc 8580 gctcttccgc ttcctcgctc actgactcgc tgcgctcggt cgttcggctg cggcgagcgg 8640 tatcagctca ctcaaaggcg gtaatacggt tatccacaga atcaggggat aacgcaggaa 8700 agaacatgtg agcaaaaggc cagcaaaagg ccaggaaccg taaaaaggcc gcgttgctgg 8760 cgtttttcca taggctccgc ccccctgacg agcatcacaa aaatcgacgc tcaagtcaga 8820 ggtggcgaaa cccgacagga ctataaagat accaggcgtt tccccctgga agctccctcg 8880 tgcgctctcc tgttccgacc ctgccgctta ccggatacct gtccgccttt ctcccttcgg 8940 gaagcgtggc gctttctcat agctcacgct gtaggtatct cagttcggtg taggtcgttc 9000 gctccaagct gggctgtgtg cacgaacccc ccgttcagcc cgaccgctgc gccttatccg 9060 gtaactatcg tcttgagtcc aacccggtaa gacacgactt atcgccactg gcagcagcca 9120 ctggtaacag gattagcaga gcgaggtatg taggcggtgc tacagagttc ttgaagtggt 9180 ggcctaacta cggctacact agaaggacag tatttggtat ctgcgctctg ctgaagccag 9240 ttaccttcgg aaaaagagtt ggtagctctt gatccggcaa acaaaccacc gctggtagcg 9300 gtggtttttt tgtttgcaag cagcagatta cgcgcagaaa aaaaggatct caagaagatc 9360 ctttgatctt ttctacgggg tctgacgctc agtggaacga aaactcacgt taagggattt 9420 tggtcatgag attatcaaaa aggatcttca cctagatcct tttaaattaa aaatgaagtt 9480 ttaaatcaat ctaaagtata tatgagtaaa cttggtctga cagttaccaa tgcttaatca 9540 gtgaggcacc tatctcagcg atctgtctat ttcgttcatc catagttgcc tgactccccg 9600 tcgtgtagat aactacgata cgggagggct taccatctgg ccccagtgct gcaatgatac 9660 cgcgagaccc acgctcaccg gctccagatt tatcagcaat aaaccagcca gccggaaggg 9720 ccgagcgcag aagtggtcct gcaactttat ccgcctccat ccagtctatt aattgttgcc 9780 gggaagctag agtaagtagt tcgccagtta atagtttgcg caacgttgtt gccattgcta 9840 caggcatcgt ggtgtcacgc tcgtcgtttg gtatggcttc attcagctcc ggttcccaac 9900 gatcaaggcg agttacatga tcccccatgt tgtgcaaaaa agcggttagc tccttcggtc 9960 ctccgatcgt tgtcagaagt aagttggccg cagtgttatc actcatggtt atggcagcac 10020 tgcataattc tcttactgtc atgccatccg taagatgctt ttctgtgact ggtgagtact 10080 caaccaagtc attctgagaa tagtgtatgc ggcgaccgag ttgctcttgc ccggcgtcaa 10140 tacgggataa taccgcgcca catagcagaa ctttaaaagt gctcatcatt ggaaaacgtt 10200 cttcggggcg aaaactctca aggatcttac cgctgttgag atccagttcg atgtaaccca 10260 ctcgtgcacc caactgatct tcagcatctt ttactttcac cagcgtttct gggtgagcaa 10320 aaacaggaag gcaaaatgcc gcaaaaaagg gaataagggc gacacggaaa tgttgaatac 10380 tcatactctt cctttttcaa tattattgaa gcatttatca gggttattgt ctcatgagcg 10440 gatacatatt tgaatgtatt tagaaaaata aacaaatagg ggttccgcgc acatttcccc 10500 gaaaagtgcc ac 10512 43 11255 DNA Artificial Sequence Synthetic 43 ctgacgcgcc ctgtagcggc gcattaagcg cggcgggtgt ggtggttacg cgcagcgtga 60 ccgctacact tgccagcgcc ctagcgcccg ctcctttcgc tttcttccct tcctttctcg 120 ccacgttcgc cggcatcaga ttggctattg gccattgcat acgttgtatc catatcataa 180 tatgtacatt tatattggct catgtccaac attaccgcca tgttgacatt gattattgac 240 tagttattaa tagtaatcaa ttacggggtc attagttcat agcccatata tggagttccg 300 cgttacataa cttacggtaa atggcccgcc tggctgaccg cccaacgacc cccgcccatt 360 gacgtcaata atgacgtatg ttcccatagt aacgccaata gggactttcc attgacgtca 420 atgggtggag tatttacggt aaactgccca cttggcagta catcaagtgt atcatatgcc 480 aagtacgccc cctattgacg tcaatgacgg taaatggccc gcctggcatt atgcccagta 540 catgacctta tgggactttc ctacttggca gtacatctac gtattagtca tcgctattac 600 catggtgatg cggttttggc agtacatcaa tgggcgtgga tagcggtttg actcacgggg 660 atttccaagt ctccacccca ttgacgtcaa tgggagtttg ttttggcacc aaaatcaacg 720 ggactttcca aaatgtcgta acaactccgc cccattgacg caaatgggcg gtaggcgtgt 780 acggtgggag gtctatataa gcagagctcg tttagtgaac cgtcagatcg cctggagacg 840 ccatccacgc tgttttgacc tccatagaag acaccgggac cgatccagcc tccgcggccg 900 ggaacggtgc attggaacgc ggattccccg tgccaagagt gacgtaagta ccgcctatag 960 actctatagg cacacccctt tggctcttat gcatgctata ctgtttttgg cttggggcct 1020 atacaccccc gcttccttat gctataggtg atggtatagc ttagcctata ggtgtgggtt 1080 attgaccatt attgaccact cccctattgg tgacgatact ttccattact aatccataac 1140 atggctcttt gccacaacta tctctattgg ctatatgcca atactctgtc cttcagagac 1200 tgacacggac tctgtatttt tacaggatgg ggtcccattt attatttaca aattcacata 1260 tacaacaacg ccgtcccccg tgcccgcagt ttttattaaa catagcgtgg gatctccacg 1320 cgaatctcgg gtacgtgttc cggacatggg ctcttctccg gtagcggcgg agcttccaca 1380 tccgagccct ggtcccatgc ctccagcggc tcatggtcgc tcggcagctc cttgctccta 1440 acagtggagg ccagacttag gcacagcaca atgcccacca ccaccagtgt gccgcacaag 1500 gccgtggcgg tagggtatgt gtctgaaaat gagcgtggag attgggctcg cacggctgac 1560 gcagatggaa gacttaaggc agcggcagaa gaagatgcag gcagctgagt tgttgtattc 1620 tgataagagt cagaggtaac tcccgttgcg gtgctgttaa cggtggaggg cagtgtagtc 1680 tgagcagtac tcgttgctgc cgcgcgcgcc accagacata atagctgaca gactaacaga 1740 ctgttccttt ccatgggtct tttctgcagt caccgtcgga ccatgtgtga acttgatatt 1800 ttacatgatt ctctttacca attctgcccc gaattacact taaaacgact caacagctta 1860 acgttggctt gccacgcatt acttgactgt aaaactctca ctcttaccga acttggccgt 1920 aacctgccaa ccaaagcgag aacaaaacat aacatcaaac gaatcgaccg attgttaggt 1980 aatcgtcacc tccacaaaga gcgactcgct gtataccgtt ggcatgctag ctttatctgt 2040 tcgggcaata cgatgcccat tgtacttgtt gactggtctg atattcgtga gcaaaaacga 2100 cttatggtat tgcgagcttc agtcgcacta cacggtcgtt ctgttactct ttatgagaaa 2160 gcgttcccgc tttcagagca atgttcaaag aaagctcatg accaatttct agccgacctt 2220 gcgagcattc taccgagtaa caccacaccg ctcattgtca gtgatgctgg ctttaaagtg 2280 ccatggtata aatccgttga gaagctgggt tggtactggt taagtcgagt aagaggaaaa 2340 gtacaatatg cagacctagg agcggaaaac tggaaaccta tcagcaactt acatgatatg 2400 tcatctagtc actcaaagac tttaggctat aagaggctga ctaaaagcaa tccaatctca 2460 tgccaaattc tattgtataa atctcgctct aaaggccgaa aaaatcagcg ctcgacacgg 2520 actcattgtc accacccgtc acctaaaatc tactcagcgt cggcaaagga gccatgggtt 2580 ctagcaacta acttacctgt tgaaattcga acacccaaac aacttgttaa tatctattcg 2640 aagcgaatgc agattgaaga aaccttccga gacttgaaaa gtcctgccta cggactaggc 2700 ctacgccata gccgaacgag cagctcagag cgttttgata tcatgctgct aatcgccctg 2760 atgcttcaac taacatgttg gcttgcgggc gttcatgctc agaaacaagg ttgggacaag 2820 cacttccagg ctaacacagt cagaaatcga aacgtactct caacagttcg cttaggcatg 2880 gaagttttgc ggcattctgg ctacacaata acaagggaag acttactcgt ggctgcaacc 2940 ctactagctc aaaatttatt cacacatggt tacgctttgg ggaaattatg ataatgatcc 3000 agatcacttc tggctaataa aagatcagag ctctagagat ctgtgtgttg gttttttgtg 3060 gatctgctgt gccttctagt tgccagccat ctgttgtttg cccctccccc gtgccttcct 3120 tgaccctgga aggtgccact cccactgtcc tttcctaata aaatgaggaa attgcatcgc 3180 attgtctgag taggtgtcat tctattctgg ggggtggggt ggggcagcac agcaaggggg 3240 aggattggga agacaatagc aggcatgctg gggatgcggt gggctctatg ggtacctctc 3300 tctctctctc tctctctctc tctctctctc tctctcggta cctctctctc tctctctctc 3360 tctctctctc tctctctctc tcggtaccag gtgctgaaga attgacccgg tgaccaaagg 3420 tgccttttat catcacttta aaaataaaaa acaattactc agtgcctgtt ataagcagca 3480 attaattatg attgatgcct acatcacaac aaaaactgat ttaacaaatg gttggtctgc 3540 cttagaaagt atatttgaac attatcttga ttatattatt gataataata aaaaccttat 3600 ccctatccaa gaagtgatgc ctatcattgg ttggaatgaa cttgaaaaaa attagccttg 3660 aatacattac tggtaaggta aacgccattg tcagcaaatt gatccaagag aaccaactta 3720 aagctttcct gacggaatgt taattctcgt tgaccctgag cactgatgaa tcccctaatg 3780 attttggtaa aaatcattaa gttaaggtgg atacacatct tgtcatatga tcccggtaat 3840 gtgagttagc tcactcatta ggcaccccag gctttacact ttatgcttcc ggctcgtatg 3900 ttgtgtggaa ttgtgagcgg ataacaattt cacacaggaa acagctatga ccatgattac 3960 gccaagcgcg caattaaccc tcactaaagg gaacaaaagc tggagctcca ccgcggtggc 4020 ggccgctcta gaactagtgg atcccccggg catcagattg gctattggcc attgcatacg 4080 ttgtatccat atcataatat gtacatttat attggctcat gtccaacatt accgccatgt 4140 tgacattgat tattgactag ttattaatag taatcaatta cggggtcatt agttcatagc 4200 ccatatatgg agttccgcgt tacataactt acggtaaatg gcccgcctgg ctgaccgccc 4260 aacgaccccc gcccattgac gtcaataatg acgtatgttc ccatagtaac gccaataggg 4320 actttccatt gacgtcaatg ggtggagtat ttacggtaaa ctgcccactt ggcagtacat 4380 caagtgtatc atatgccaag tacgccccct attgacgtca atgacggtaa atggcccgcc 4440 tggcattatg cccagtacat gaccttatgg gactttccta cttggcagta catctacgta 4500 ttagtcatcg ctattaccat ggtgatgcgg ttttggcagt acatcaatgg gcgtggatag 4560 cggtttgact cacggggatt tccaagtctc caccccattg acgtcaatgg gagtttgttt 4620 tggcaccaaa atcaacggga ctttccaaaa tgtcgtaaca actccgcccc attgacgcaa 4680 atgggcggta ggcgtgtacg gtgggaggtc tatataagca gagctcgttt agtgaaccgt 4740 cagatcgcct ggagacgcca tccacgctgt tttgacctcc atagaagaca ccgggaccga 4800 tccagcctcc gcggccggga acggtgcatt ggaacgcgga ttccccgtgc caagagtgac 4860 gtaagtaccg cctatagact ctataggcac acccctttgg ctcttatgca tgctatactg 4920 tttttggctt ggggcctata cacccccgct tccttatgct ataggtgatg gtatagctta 4980 gcctataggt gtgggttatt gaccattatt gaccactccc ctattggtga cgatactttc 5040 cattactaat ccataacatg gctctttgcc acaactatct ctattggcta tatgccaata 5100 ctctgtcctt cagagactga cacggactct gtatttttac aggatggggt cccatttatt 5160 atttacaaat tcacatatac aacaacgccg tcccccgtgc ccgcagtttt tattaaacat 5220 agcgtgggat ctccacgcga atctcgggta cgtgttccgg acatgggctc ttctccggta 5280 gcggcggagc ttccacatcc gagccctggt cccatgcctc cagcggctca tggtcgctcg 5340 gcagctcctt gctcctaaca gtggaggcca gacttaggca cagcacaatg cccaccacca 5400 ccagtgtgcc gcacaaggcc gtggcggtag ggtatgtgtc tgaaaatgag cgtggagatt 5460 gggctcgcac ggctgacgca gatggaagac ttaaggcagc ggcagaagaa gatgcaggca 5520 gctgagttgt tgtattctga taagagtcag aggtaactcc cgttgcggtg ctgttaacgg 5580 tggagggcag tgtagtctga gcagtactcg ttgctgccgc gcgcgccacc agacataata 5640 gctgacagac taacagactg ttcctttcca tgggtctttt ctgcagtcac cgtcggatca 5700 atgggctcca tcggtgcagc aagcatggaa ttttgttttg atgtattcaa ggagctcaaa 5760 gtccaccatg ccaatgagaa catcttctac tgccccattg ccatcatgtc agctctagcc 5820 atggtatacc tgggtgcaaa agacagcacc aggacacaaa taaataaggt tgttcgcttt 5880 gataaacttc caggattcgg agacagtatt gaagctcagt gtggcacatc tgtaaacgtt 5940 cactcttcac ttagagacat cctcaaccaa atcaccaaac caaatgatgt ttattcgttc 6000 agccttgcca gtagacttta tgctgaagag agatacccaa tcctgccaga atacttgcag 6060 tgtgtgaagg aactgtatag aggaggcttg gaacctatca actttcaaac agctgcagat 6120 caagccagag agctcatcaa ttcctgggta gaaagtcaga caaatggaat tatcagaaat 6180 gtccttcagc caagctccgt ggattctcaa actgcaatgg ttctggttaa tgccattgtc 6240 ttcaaaggac tgtgggagaa agcatttaag gatgaagaca cacaagcaat gcctttcaga 6300 gtgactgagc aagaaagcaa acctgtgcag atgatgtacc agattggttt atttagagtg 6360 gcatcaatgg cttctgagaa aatgaagatc ctggagcttc catttgccag tgggacaatg 6420 agcatgttgg tgctgttgcc tgatgaagtc tcaggccttg agcagcttga gagtataatc 6480 aactttgaaa aactgactga atggaccagt tctaatgtta tggaagagag aagatcaaag 6540 tgtacttacc tcgcatgaag atggaggaaa aatacaacct cacatctgtc ttaatggcta 6600 tgggcattac tgacgtgttt agctcttcag ccaatctgtc tggcatctcc tcagcagaga 6660 gcctgaagat atctcaagct gtccatgcag cacatgcaga aatcaatgaa gcaggcagag 6720 aggtggtagg gtcagcagag gctggagtgg atgctgcaag cgtctctgaa gaatttaggg 6780 ctgaccatcc attcctcttc tgtatcaagc acatcgcaac caacgccgtt ctcttctttt 6840 ggcagatgtg tttcccgcgg ccagcagatg acgcaccagc agatgacgca ccagcagatg 6900 acgcaccagc agatgacgca ccagcagatg acgcaacaac atgtatcctg aaaggctctt 6960 gtggctggat cggcctgctg gatgacgatg acaaatttgt gaaccaacac ctgtgcggct 7020 cacacctggt ggaagctctc tacctagtgt gcggggaacg aggcttcttc tacacaccca 7080 agacccgccg ggaggcagag gacctgcagg tggggcaggt ggagctgggc gggggccctg 7140 gtgcaggcag cctgcagccc ttggccctgg aggggtccct gcagaagcgt ggcattgtgg 7200 aacaatgctg taccagcatc tgctccctct accagctgga gaactactgc aactagggcg 7260 cctaaagggc gaattatcgc ggccgctcta gaccaggcgc ctggatccag atcacttctg 7320 gctaataaaa gatcagagct ctagagatct gtgtgttggt tttttgtgga tctgctgtgc 7380 cttctagttg ccagccatct gttgtttgcc cctcccccgt gccttccttg accctggaag 7440 gtgccactcc cactgtcctt tcctaataaa atgaggaaat tgcatcgcat tgtctgagta 7500 ggtgtcattc tattctgggg ggtggggtgg ggcagcacag caagggggag gattgggaag 7560 acaatagcag gcatgctggg gatgcggtgg gctctatggg tacctctctc tctctctctc 7620 tctctctcac tctctctctc tctcggtacc tctcctcgag ggggggcccg gtacccaatt 7680 cgccctatag tgagtcgtat tacgcgcgct cactggccgt cgttttacaa cgtcgtgact 7740 gggaaaaccc tggcgttacc caacttaatc gccttgcagc acatccccct ttcgccagct 7800 ggcgtaatag cgaagaggcc cgcaccgatc gcccttccca acagttgcgc agcctgaatg 7860 gcgaatggaa attgtaagcg ttaatatttt gttaaaattc gcgttaaatt tttgttaaat 7920 cagctcattt tttaaccaat aggccgaaat cggcaaaatc ccttataaat caaaagaata 7980 gaccgagata gggttgagtg ttgttccagt ttggaacaag agtccactat taaagaacgt 8040 ggactccaac gtcaaagggc gaaaaaccgt ctatcagggc gatggcccac tactccggga 8100 tcatatgaca agatgtgtat ccaccttaac ttaatgattt ttaccaaaat cattagggga 8160 ttcatcagtg ctcagggtca acgagaatta acattccgtc aggaaagctt atgatgatga 8220 tgtgcttaaa aacttactca atggctggtt atgcatatcg caatacatgc gaaaaaccta 8280 aaagagcttg ccgataaaaa aggccaattt attgctattt accgcggctt tttattgagc 8340 ttgaaagata aataaaatag ataggtttta tttgaagcta aatcttcttt atcgtaaaaa 8400 atgccctctt gggttatcaa gagggtcatt atatttcgcg gaataacatc atttggtgac 8460 gaaataacta agcacttgtc tcctgtttac tcccctgagc ttgaggggtt aacatgaagg 8520 tcatcgatag caggataata atacagtaaa acgctaaacc aataatccaa atccagccat 8580 cccaaattgg tagtgaatga ttataaataa cagcaaacag taatgggcca ataacaccgg 8640 ttgcattggt aaggctcacc aataatccct gtaaagcacc ttgctgatga ctctttgttt 8700 ggatagacat cactccctgt aatgcaggta aagcgatccc accaccagcc aataaaatta 8760 aaacagggaa aactaaccaa ccttcagata taaacgctaa aaaggcaaat gcactactat 8820 ctgcaataaa tccgagcagt actgccgttt tttcgcccat ttagtggcta ttcttcctgc 8880 cacaaaggct tggaatactg agtgtaaaag accaagaccc gtaatgaaaa gccaaccatc 8940 atgctattca tcatcacgat ttctgtaata gcaccacacc gtgctggatt ggctatcaat 9000 gcgctgaaat aataatcaac aaatggcatc gttaaataag tgatgtatac cgatcagctt 9060 ttgttccctt tagtgagggt taattgcgcg cttggcgtaa tcatggtcat agctgtttcc 9120 tgtgtgaaat tgttatccgc tcacaattcc acacaacata cgagccggaa gcataaagtg 9180 taaagcctgg ggtgcctaat gagtgagcta actcacatta attgcgttgc gctcactgcc 9240 cgctttccag tcgggaaacc tgtcgtgcca gctgcattaa tgaatcggcc aacgcgcggg 9300 gagaggcggt ttgcgtattg ggcgctcttc cgcttcctcg ctcactgact cgctgcgctc 9360 ggtcgttcgg ctgcggcgag cggtatcagc tcactcaaag gcggtaatac ggttatccac 9420 agaatcaggg gataacgcag gaaagaacat gtgagcaaaa ggccagcaaa aggccaggaa 9480 ccgtaaaaag gccgcgttgc tggcgttttt ccataggctc cgcccccctg acgagcatca 9540 caaaaatcga cgctcaagtc agaggtggcg aaacccgaca ggactataaa gataccaggc 9600 gtttccccct ggaagctccc tcgtgcgctc tcctgttccg accctgccgc ttaccggata 9660 cctgtccgcc tttctccctt cgggaagcgt ggcgctttct catagctcac gctgtaggta 9720 tctcagttcg gtgtaggtcg ttcgctccaa gctgggctgt gtgcacgaac cccccgttca 9780 gcccgaccgc tgcgccttat ccggtaacta tcgtcttgag tccaacccgg taagacacga 9840 cttatcgcca ctggcagcag ccactggtaa caggattagc agagcgaggt atgtaggcgg 9900 tgctacagag ttcttgaagt ggtggcctaa ctacggctac actagaagga cagtatttgg 9960 tatctgcgct ctgctgaagc cagttacctt cggaaaaaga gttggtagct cttgatccgg 10020 caaacaaacc accgctggta gcggtggttt ttttgtttgc aagcagcaga ttacgcgcag 10080 aaaaaaagga tctcaagaag atcctttgat cttttctacg gggtctgacg ctcagtggaa 10140 cgaaaactca cgttaaggga ttttggtcat gagattatca aaaaggatct tcacctagat 10200 ccttttaaat taaaaatgaa gttttaaatc aatctaaagt atatatgagt aaacttggtc 10260 tgacagttac caatgcttaa tcagtgaggc acctatctca gcgatctgtc tatttcgttc 10320 atccatagtt gcctgactcc ccgtcgtgta gataactacg atacgggagg gcttaccatc 10380 tggccccagt gctgcaatga taccgcgaga cccacgctca ccggctccag atttatcagc 10440 aataaaccag ccagccggaa gggccgagcg cagaagtggt cctgcaactt tatccgcctc 10500 catccagtct attaattgtt gccgggaagc tagagtaagt agttcgccag ttaatagttt 10560 gcgcaacgtt gttgccattg ctacaggcat cgtggtgtca cgctcgtcgt ttggtatggc 10620 ttcattcagc tccggttccc aacgatcaag gcgagttaca tgatccccca tgttgtgcaa 10680 aaaagcggtt agctccttcg gtcctccgat cgttgtcaga agtaagttgg ccgcagtgtt 10740 atcactcatg gttatggcag cactgcataa ttctcttact gtcatgccat ccgtaagatg 10800 cttttctgtg actggtgagt actcaaccaa gtcattctga gaatagtgta tgcggcgacc 10860 gagttgctct tgcccggcgt caatacggga taataccgcg ccacatagca gaactttaaa 10920 agtgctcatc attggaaaac gttcttcggg gcgaaaactc tcaaggatct taccgctgtt 10980 gagatccagt tcgatgtaac ccactcgtgc acccaactga tcttcagcat cttttacttt 11040 caccagcgtt tctgggtgag caaaaacagg aaggcaaaat gccgcaaaaa agggaataag 11100 ggcgacacgg aaatgttgaa tactcatact cttccttttt caatattatt gaagcattta 11160 tcagggttat tgtctcatga gcggatacat atttgaatgt atttagaaaa ataaacaaat 11220 aggggttccg cgcacatttc cccgaaaagt gccac 11255 44 10487 DNA Artificial Sequence Synthetic 44 ctgacgcgcc ctgtagcggc gcattaagcg cggcgggtgt ggtggttacg cgcagcgtga 60 ccgctacact tgccagcgcc ctagcgcccg ctcctttcgc tttcttccct tcctttctcg 120 ccacgttcgc cggcatcaga ttggctattg gccattgcat acgttgtatc catatcataa 180 tatgtacatt tatattggct catgtccaac attaccgcca tgttgacatt gattattgac 240 tagttattaa tagtaatcaa ttacggggtc attagttcat agcccatata tggagttccg 300 cgttacataa cttacggtaa atggcccgcc tggctgaccg cccaacgacc cccgcccatt 360 gacgtcaata atgacgtatg ttcccatagt aacgccaata gggactttcc attgacgtca 420 atgggtggag tatttacggt aaactgccca cttggcagta catcaagtgt atcatatgcc 480 aagtacgccc cctattgacg tcaatgacgg taaatggccc gcctggcatt atgcccagta 540 catgacctta tgggactttc ctacttggca gtacatctac gtattagtca tcgctattac 600 catggtgatg cggttttggc agtacatcaa tgggcgtgga tagcggtttg actcacgggg 660 atttccaagt ctccacccca ttgacgtcaa tgggagtttg ttttggcacc aaaatcaacg 720 ggactttcca aaatgtcgta acaactccgc cccattgacg caaatgggcg gtaggcgtgt 780 acggtgggag gtctatataa gcagagctcg tttagtgaac cgtcagatcg cctggagacg 840 ccatccacgc tgttttgacc tccatagaag acaccgggac cgatccagcc tccgcggccg 900 ggaacggtgc attggaacgc ggattccccg tgccaagagt gacgtaagta ccgcctatag 960 actctatagg cacacccctt tggctcttat gcatgctata ctgtttttgg cttggggcct 1020 atacaccccc gcttccttat gctataggtg atggtatagc ttagcctata ggtgtgggtt 1080 attgaccatt attgaccact cccctattgg tgacgatact ttccattact aatccataac 1140 atggctcttt gccacaacta tctctattgg ctatatgcca atactctgtc cttcagagac 1200 tgacacggac tctgtatttt tacaggatgg ggtcccattt attatttaca aattcacata 1260 tacaacaacg ccgtcccccg tgcccgcagt ttttattaaa catagcgtgg gatctccacg 1320 cgaatctcgg gtacgtgttc cggacatggg ctcttctccg gtagcggcgg agcttccaca 1380 tccgagccct ggtcccatgc ctccagcggc tcatggtcgc tcggcagctc cttgctccta 1440 acagtggagg ccagacttag gcacagcaca atgcccacca ccaccagtgt gccgcacaag 1500 gccgtggcgg tagggtatgt gtctgaaaat gagcgtggag attgggctcg cacggctgac 1560 gcagatggaa gacttaaggc agcggcagaa gaagatgcag gcagctgagt tgttgtattc 1620 tgataagagt cagaggtaac tcccgttgcg gtgctgttaa cggtggaggg cagtgtagtc 1680 tgagcagtac tcgttgctgc cgcgcgcgcc accagacata atagctgaca gactaacaga 1740 ctgttccttt ccatgggtct tttctgcagt caccgtcgga ccatgtgtga acttgatatt 1800 ttacatgatt ctctttacca attctgcccc gaattacact taaaacgact caacagctta 1860 acgttggctt gccacgcatt acttgactgt aaaactctca ctcttaccga acttggccgt 1920 aacctgccaa ccaaagcgag aacaaaacat aacatcaaac gaatcgaccg attgttaggt 1980 aatcgtcacc tccacaaaga gcgactcgct gtataccgtt ggcatgctag ctttatctgt 2040 tcgggaatac gatgcccatt gtacttgttg actggtctga tattcgtgag caaaaacgac 2100 ttatggtatt gcgagcttca gtcgcactac acggtcgttc tgttactctt tatgagaaag 2160 cgttcccgct ttcagagcaa tgttcaaaga aagctcatga ccaatttcta gccgaccttg 2220 cgagcattct accgagtaac accacaccgc tcattgtcag tgatgctggc tttaaagtgc 2280 catggtataa atccgttgag aagctgggtt ggtactggtt aagtcgagta agaggaaaag 2340 tacaatatgc agacctagga gcggaaaact ggaaacctat cagcaactta catgatatgt 2400 catctagtca ctcaaagact ttaggctata agaggctgac taaaagcaat ccaatctcat 2460 gccaaattct attgtataaa tctcgctcta aaggccgaaa aaatcagcgc tcgacacgga 2520 ctcattgtca ccacccgtca cctaaaatct actcagcgtc ggcaaaggag ccatgggttc 2580 tagcaactaa cttacctgtt gaaattcgaa cacccaaaca acttgttaat atctattcga 2640 agcgaatgca gattgaagaa accttccgag acttgaaaag tcctgcctac ggactaggcc 2700 tacgccatag ccgaacgagc agctcagagc gttttgatat catgctgcta atcgccctga 2760 tgcttcaact aacatgttgg cttgcgggcg ttcatgctca gaaacaaggt tgggacaagc 2820 acttccaggc taacacagtc agaaatcgaa acgtactctc aacagttcgc ttaggcatgg 2880 aagttttgcg gcattctggc tacacaataa caagggaaga cttactcgtg gctgcaaccc 2940 tactagctca aaatttattc acacatggtt acgctttggg gaaattatga taatgatcca 3000 gatcacttct ggctaataaa agatcagagc tctagagatc tgtgtgttgg ttttttgtgg 3060 atctgctgtg ccttctagtt gccagccatc tgttgtttgc ccctcccccg tgccttcctt 3120 gaccctggaa ggtgccactc ccactgtcct ttcctaataa aatgaggaaa ttgcatcgca 3180 ttgtctgagt aggtgtcatt ctattctggg gggtggggtg gggcagcaca gcaaggggga 3240 ggattgggaa gacaatagca ggcatgctgg ggatgcggtg ggctctatgg gtacctctct 3300 ctctctctct ctctctctct ctctctctct ctctcggtac ctctctctct ctctctctct 3360 ctctctctct ctctctctct cggtaccagg tgctgaagaa ttgacccggt gaccaaaggt 3420 gccttttatc atcactttaa aaataaaaaa caattactca gtgcctgtta taagcagcaa 3480 ttaattatga ttgatgccta catcacaaca aaaactgatt taacaaatgg ttggtctgcc 3540 ttagaaagta tatttgaaca ttatcttgat tatattattg ataataataa aaaccttatc 3600 cctatccaag aagtgatgcc tatcattggt tggaatgaac ttgaaaaaaa ttagccttga 3660 atacattact ggtaaggtaa acgccattgt cagcaaattg atccaagaga accaacttaa 3720 agctttcctg acggaatgtt aattctcgtt gaccctgagc actgatgaat cccctaatga 3780 ttttggtaaa aatcattaag ttaaggtgga tacacatctt gtcatatgat cccggtaatg 3840 tgagttagct cactcattag gcaccccagg ctttacactt tatgcttccg gctcgtatgt 3900 tgtgtggaat tgtgagcgga taacaatttc acacaggaaa cagctatgac catgattacg 3960 ccaagcgcgc aattaaccct cactaaaggg aacaaaagct ggagctccac cgcggtggcg 4020 gccgctctag aactagtgga tcccccgggg aggtcagaat ggtttcttta ctgtttgtca 4080 attctattat ttcaatacag aacaaaagct tctataactg aaatatattt gctattgtat 4140 attatgattg tccctcgaac catgaacact cctccagctg aatttcacaa ttcctctgtc 4200 atctgccagg ctggaagatc atggaagatc tctgaggaac attgcaagtt cataccataa 4260 actcatttgg aattgagtat tattttgctt tgaatggagc tatgttttgc agttccctca 4320 gaagaaaagc ttgttataaa gcgtctacac ccatcaaaag atatatttaa atattccaac 4380 tacagaaaga ttttgtctgc tcttcactct gatctcagtt ggtttcttca cgtacatgct 4440 tctttatttg cctattttgt caagaaaata ataggtcaag tcctgttctc acttatctcc 4500 tgcctagcat ggcttagatg cacgttgtac attcaagaag gatcaaatga aacagacttc 4560 tggtctgtta caacaaccat agtaataaac agactaacta ataattgcta attatgtttt 4620 ccatctctaa ggttcccaca tttttctgtt ttaagatccc attatctggt tgtaactgaa 4680 gctcaatgga acatgaacag tatttctcag tcttttctcc agcaatcctg acggattaga 4740 agaactggca gaaaacactt tgttacccag aattaaaaac taatatttgc tctcccttca 4800 atccaaaatg gacctattga aactaaaatc tgacccaatc ccattaaatt atttctatgg 4860 cgtcaaaggt caaacttttg aagggaacct gtgggtgggt cccaattcag gctatatatt 4920 ccccagggct cagccagtgg atccatgggc tccatcggtg cagcaagcat ggaattttgt 4980 tttgatgtat tcaaggagct caaagtccac catgccaatg acaacatgct ctactccccc 5040 tttgccatct tgtcaactct ggccatggtc ttcctaggtg caaaagacag caccaggacc 5100 cagataaata aggttgttca ctttgataaa cttccaggat tcggagacag tattgaagct 5160 cagtgtggca catctgtaaa tgttcactct tcacttagag acatactcaa ccaaatcacc 5220 aaacaaaatg atgcttattc gttcagcctt gccagtagac tttatgctca agagacatac 5280 acagtcgtgc cggaatactt gcaatgtgtg aaggaactgt atagaggagg cttagaatcc 5340 gtcaactttc aaacagctgc agatcaagcc agaggcctca tcaatgcctg ggtagaaagt 5400 cagacaaacg gaattatcag aaacatcctt cagccaagct ccgtggattc tcaaactgca 5460 atggtcctgg ttaatgccat tgccttcaag ggactgtggg agaaagcatt taaggctgaa 5520 gacacgcaaa caataccttt cagagtgact gagcaagaaa gcaaacctgt gcagatgatg 5580 taccagattg gttcatttaa agtggcatca atggcttctg agaaaatgaa gatcctggag 5640 cttccatttg ccagtggaac aatgagcatg ttggtgctgt tgcctgatga tgtctcaggc 5700 cttgagcagc ttgagagtat aatcagcttt gaaaaactga ctgaatggac cagttctagt 5760 attatggaag agaggaaggt caaagtgtac ttacctcgca tgaagatgga ggagaaatac 5820 aacctcacat ctctcttaat ggctatggga attactgacc tgttcagctc ttcagccaat 5880 ctgtctggca tctcctcagt agggagcctg aagatatctc aagctgtcca tgcagcacat 5940 gcagaaatca atgaagcggg cagagatgtg gtaggctcag cagaggctgg agtggatgct 6000 actgaagaat ttagggctga ccatccattc ctcttctgtg tcaagcacat cgaaaccaac 6060 gccattctcc tctttggcag atgtgtttct ccgcggccag cagatgacgc accagcagat 6120 gacgcaccag cagatgacgc accagcagat gacgcaccag cagatgacgc accagcagat 6180 gacgcaacaa catgtatcct gaaaggctct tgtggctgga tcggcctgct ggatgacgat 6240 gacaaatttg tgaaccaaca cctgtgcggc tcacacctgg tggaagctct ctacctagtg 6300 tgcggggaac gaggcttctt ctacacaccc aagacccgcc gggaggcaga ggacctgcag 6360 gtggggcagg tggagctggg cgggggccct ggtgcaggca gcctgcagcc cttggccctg 6420 gaggggtccc tgcagaagcg tggcattgtg gaacaatgct gtaccagcat ctgctccctc 6480 taccagctgg agaactactg caactagggc gcctggatcc agatcacttc tggctaataa 6540 aagatcagag ctctagagat ctgtgtgttg gttttttgtg gatctgctgt gccttctagt 6600 tgccagccat ctgttgtttg cccctccccc gtgccttcct tgaccctgga aggtgccact 6660 cccactgtcc tttcctaata aaatgaggaa attgcatcgc attgtctgag taggtgtcat 6720 tctattctgg ggggtggggt ggggcagcac agcaaggggg aggattggga agacaatagc 6780 aggcatgctg gggatgcggt gggctctatg ggtacctctc tctctctctc tctctctctc 6840 tctctctctc tctctcggta cctctctcga gggggggccc ggtacccaat tcgccctata 6900 gtgagtcgta ttacgcgcgc tcactggccg tcgttttaca acgtcgtgac tgggaaaacc 6960 ctggcgttac ccaacttaat cgccttgcag cacatccccc tttcgccagc tggcgtaata 7020 gcgaagaggc ccgcaccgat cgcccttccc aacagttgcg cagcctgaat ggcgaatgga 7080 aattgtaagc gttaatattt tgttaaaatt cgcgttaaat ttttgttaaa tcagctcatt 7140 ttttaaccaa taggccgaaa tcggcaaaat cccttataaa tcaaaagaat agaccgagat 7200 agggttgagt gttgttccag tttggaacaa gagtccacta ttaaagaacg tggactccaa 7260 cgtcaaaggg cgaaaaaccg tctatcaggg cgatggccca ctactccggg atcatatgac 7320 aagatgtgta tccaccttaa cttaatgatt tttaccaaaa tcattagggg attcatcagt 7380 gctcagggtc aacgagaatt aacattccgt caggaaagct tatgatgatg atgtgcttaa 7440 aaacttactc aatggctggt tatgcatatc gcaatacatg cgaaaaacct aaaagagctt 7500 gccgataaaa aaggccaatt tattgctatt taccgcggct ttttattgag cttgaaagat 7560 aaataaaata gataggtttt atttgaagct aaatcttctt tatcgtaaaa aatgccctct 7620 tgggttatca agagggtcat tatatttcgc ggaataacat catttggtga cgaaataact 7680 aagcacttgt ctcctgttta ctcccctgag cttgaggggt taacatgaag gtcatcgata 7740 gcaggataat aatacagtaa aacgctaaac caataatcca aatccagcca tcccaaattg 7800 gtagtgaatg attataaata acagcaaaca gtaatgggcc aataacaccg gttgcattgg 7860 taaggctcac caataatccc tgtaaagcac cttgctgatg actctttgtt tggatagaca 7920 tcactccctg taatgcaggt aaagcgatcc caccaccagc caataaaatt aaaacaggga 7980 aaactaacca accttcagat ataaacgcta aaaaggcaaa tgcactacta tctgcaataa 8040 atccgagcag tactgccgtt ttttcgcccc atttagtggc tattcttcct gccacaaagg 8100 cttggaatac tgagtgtaaa agaccaagac ccgctaatga aaagccaacc atcatgctat 8160 tccatccaaa acgattttcg gtaaatagca cccacaccgt tgcgggaatt tggcctatca 8220 attgcgctga aaaataaata atcaacaaaa tggcatcgtt ttaaataaag tgatgtatac 8280 cgaattcagc ttttgttccc tttagtgagg gttaattgcg cgcttggcgt aatcatggtc 8340 atagctgttt cctgtgtgaa attgttatcc gctcacaatt ccacacaaca tacgagccgg 8400 aagcataaag tgtaaagcct ggggtgccta atgagtgagc taactcacat taattgcgtt 8460 gcgctcactg cccgctttcc agtcgggaaa cctgtcgtgc cagctgcatt aatgaatcgg 8520 ccaacgcgcg gggagaggcg gtttgcgtat tgggcgctct tccgcttcct cgctcactga 8580 ctcgctgcgc tcggtcgttc ggctgcggcg agcggtatca gctcactcaa aggcggtaat 8640 acggttatcc acagaatcag gggataacgc aggaaagaac atgtgagcaa aaggccagca 8700 aaaggccagg aaccgtaaaa aggccgcgtt gctggcgttt ttccataggc tccgcccccc 8760 tgacgagcat cacaaaaatc gacgctcaag tcagaggtgg cgaaacccga caggactata 8820 aagataccag gcgtttcccc ctggaagctc cctcgtgcgc tctcctgttc cgaccctgcc 8880 gcttaccgga tacctgtccg cctttctccc ttcgggaagc gtggcgcttt ctcatagctc 8940 acgctgtagg tatctcagtt cggtgtaggt cgttcgctcc aagctgggct gtgtgcacga 9000 accccccgtt cagcccgacc gctgcgcctt atccggtaac tatcgtcttg agtccaaccc 9060 ggtaagacac gacttatcgc cactggcagc agccactggt aacaggatta gcagagcgag 9120 gtatgtaggc ggtgctacag agttcttgaa gtggtggcct aactacggct acactagaag 9180 gacagtattt ggtatctgcg ctctgctgaa gccagttacc ttcggaaaaa gagttggtag 9240 ctcttgatcc ggcaaacaaa ccaccgctgg tagcggtggt ttttttgttt gcaagcagca 9300 gattacgcgc agaaaaaaag gatctcaaga agatcctttg atcttttcta cggggtctga 9360 cgctcagtgg aacgaaaact cacgttaagg gattttggtc atgagattat caaaaaggat 9420 cttcacctag atccttttaa attaaaaatg aagttttaaa tcaatctaaa gtatatatga 9480 gtaaacttgg tctgacagtt accaatgctt aatcagtgag gcacctatct cagcgatctg 9540 tctatttcgt tcatccatag ttgcctgact ccccgtcgtg tagataacta cgatacggga 9600 gggcttacca tctggcccca gtgctgcaat gataccgcga gacccacgct caccggctcc 9660 agatttatca gcaataaacc agccagccgg aagggccgag cgcagaagtg gtcctgcaac 9720 tttatccgcc tccatccagt ctattaattg ttgccgggaa gctagagtaa gtagttcgcc 9780 agttaatagt ttgcgcaacg ttgttgccat tgctacaggc atcgtggtgt cacgctcgtc 9840 gtttggtatg gcttcattca gctccggttc ccaacgatca aggcgagtta catgatcccc 9900 catgttgtgc aaaaaagcgg ttagctcctt cggtcctccg atcgttgtca gaagtaagtt 9960 ggccgcagtg ttatcactca tggttatggc agcactgcat aattctctta ctgtcatgcc 10020 atccgtaaga tgcttttctg tgactggtga gtactcaacc aagtcattct gagaatagtg 10080 tatgcggcga ccgagttgct cttgcccggc gtcaatacgg gataataccg cgccacatag 10140 cagaacttta aaagtgctca tcattggaaa acgttcttcg gggcgaaaac tctcaaggat 10200 cttaccgctg ttgagatcca gttcgatgta acccactcgt gcacccaact gatcttcagc 10260 atcttttact ttcaccagcg tttctgggtg agcaaaaaca ggaaggcaaa atgccgcaaa 10320 aaagggaata agggcgacac ggaaatgttg aatactcata ctcttccttt ttcaatatta 10380 ttgaagcatt tatcagggtt attgtctcat gagcggatac atatttgaat gtatttagaa 10440 aaataaacaa ataggggttc cgcgcacatt tccccgaaaa gtgccac 10487 45 10297 DNA Artificial Sequence Synthetic 45 ctgacgcgcc ctgtagcggc gcattaagcg cggcgggtgt ggtggttacg cgcagcgtga 60 ccgctacact tgccagcgcc ctagcgcccg ctcctttcgc tttcttccct tcctttctcg 120 ccacgttcgc cggcatcaga ttggctattg gccattgcat acgttgtatc catatcataa 180 tatgtacatt tatattggct catgtccaac attaccgcca tgttgacatt gattattgac 240 tagttattaa tagtaatcaa ttacggggtc attagttcat agcccatata tggagttccg 300 cgttacataa cttacggtaa atggcccgcc tggctgaccg cccaacgacc cccgcccatt 360 gacgtcaata atgacgtatg ttcccatagt aacgccaata gggactttcc attgacgtca 420 atgggtggag tatttacggt aaactgccca cttggcagta catcaagtgt atcatatgcc 480 aagtacgccc cctattgacg tcaatgacgg taaatggccc gcctggcatt atgcccagta 540 catgacctta tgggactttc ctacttggca gtacatctac gtattagtca tcgctattac 600 catggtgatg cggttttggc agtacatcaa tgggcgtgga tagcggtttg actcacgggg 660 atttccaagt ctccacccca ttgacgtcaa tgggagtttg ttttggcacc aaaatcaacg 720 ggactttcca aaatgtcgta acaactccgc cccattgacg caaatgggcg gtaggcgtgt 780 acggtgggag gtctatataa gcagagctcg tttagtgaac cgtcagatcg cctggagacg 840 ccatccacgc tgttttgacc tccatagaag acaccgggac cgatccagcc tccgcggccg 900 ggaacggtgc attggaacgc ggattccccg tgccaagagt gacgtaagta ccgcctatag 960 actctatagg cacacccctt tggctcttat gcatgctata ctgtttttgg cttggggcct 1020 atacaccccc gcttccttat gctataggtg atggtatagc ttagcctata ggtgtgggtt 1080 attgaccatt attgaccact cccctattgg tgacgatact ttccattact aatccataac 1140 atggctcttt gccacaacta tctctattgg ctatatgcca atactctgtc cttcagagac 1200 tgacacggac tctgtatttt tacaggatgg ggtcccattt attatttaca aattcacata 1260 tacaacaacg ccgtcccccg tgcccgcagt ttttattaaa catagcgtgg gatctccacg 1320 cgaatctcgg gtacgtgttc cggacatggg ctcttctccg gtagcggcgg agcttccaca 1380 tccgagccct ggtcccatgc ctccagcggc tcatggtcgc tcggcagctc cttgctccta 1440 acagtggagg ccagacttag gcacagcaca atgcccacca ccaccagtgt gccgcacaag 1500 gccgtggcgg tagggtatgt gtctgaaaat gagcgtggag attgggctcg cacggctgac 1560 gcagatggaa gacttaaggc agcggcagaa gaagatgcag gcagctgagt tgttgtattc 1620 tgataagagt cagaggtaac tcccgttgcg gtgctgttaa cggtggaggg cagtgtagtc 1680 tgagcagtac tcgttgctgc cgcgcgcgcc accagacata atagctgaca gactaacaga 1740 ctgttccttt ccatgggtct tttctgcagt caccgtcgga ccatgtgtga acttgatatt 1800 ttacatgatt ctctttacca attctgcccc gaattacact taaaacgact caacagctta 1860 acgttggctt gccacgcatt acttgactgt aaaactctca ctcttaccga acttggccgt 1920 aacctgccaa ccaaagcgag aacaaaacat aacatcaaac gaatcgaccg attgttaggt 1980 aatcgtcacc tccacaaaga gcgactcgct gtataccgtt ggcatgctag ctttatctgt 2040 tcgggaatac gatgcccatt gtacttgttg actggtctga tattcgtgag caaaaacgac 2100 ttatggtatt gcgagcttca gtcgcactac acggtcgttc tgttactctt tatgagaaag 2160 cgttcccgct ttcagagcaa tgttcaaaga aagctcatga ccaatttcta gccgaccttg 2220 cgagcattct accgagtaac accacaccgc tcattgtcag tgatgctggc tttaaagtgc 2280 catggtataa atccgttgag aagctgggtt ggtactggtt aagtcgagta agaggaaaag 2340 tacaatatgc agacctagga gcggaaaact ggaaacctat cagcaactta catgatatgt 2400 catctagtca ctcaaagact ttaggctata agaggctgac taaaagcaat ccaatctcat 2460 gccaaattct attgtataaa tctcgctcta aaggccgaaa aaatcagcgc tcgacacgga 2520 ctcattgtca ccacccgtca cctaaaatct actcagcgtc ggcaaaggag ccatgggttc 2580 tagcaactaa cttacctgtt gaaattcgaa cacccaaaca acttgttaat atctattcga 2640 agcgaatgca gattgaagaa accttccgag acttgaaaag tcctgcctac ggactaggcc 2700 tacgccatag ccgaacgagc agctcagagc gttttgatat catgctgcta atcgccctga 2760 tgcttcaact aacatgttgg cttgcgggcg ttcatgctca gaaacaaggt tgggacaagc 2820 acttccaggc taacacagtc agaaatcgaa acgtactctc aacagttcgc ttaggcatgg 2880 aagttttgcg gcattctggc tacacaataa caagggaaga cttactcgtg gctgcaaccc 2940 tactagctca aaatttattc acacatggtt acgctttggg gaaattatga taatgatcca 3000 gatcacttct ggctaataaa agatcagagc tctagagatc tgtgtgttgg ttttttgtgg 3060 atctgctgtg ccttctagtt gccagccatc tgttgtttgc ccctcccccg tgccttcctt 3120 gaccctggaa ggtgccactc ccactgtcct ttcctaataa aatgaggaaa ttgcatcgca 3180 ttgtctgagt aggtgtcatt ctattctggg gggtggggtg gggcagcaca gcaaggggga 3240 ggattgggaa gacaatagca ggcatgctgg ggatgcggtg ggctctatgg gtacctctct 3300 ctctctctct ctctctctct ctctctctct ctctcggtac ctctctctct ctctctctct 3360 ctctctctct ctctctctct cggtaccagg tgctgaagaa ttgacccggt gaccaaaggt 3420 gccttttatc atcactttaa aaataaaaaa caattactca gtgcctgtta taagcagcaa 3480 ttaattatga ttgatgccta catcacaaca aaaactgatt taacaaatgg ttggtctgcc 3540 ttagaaagta tatttgaaca ttatcttgat tatattattg ataataataa aaaccttatc 3600 cctatccaag aagtgatgcc tatcattggt tggaatgaac ttgaaaaaaa ttagccttga 3660 atacattact ggtaaggtaa acgccattgt cagcaaattg atccaagaga accaacttaa 3720 agctttcctg acggaatgtt aattctcgtt gaccctgagc actgatgaat cccctaatga 3780 ttttggtaaa aatcattaag ttaaggtgga tacacatctt gtcatatgat cccggtaatg 3840 tgagttagct cactcattag gcaccccagg ctttacactt tatgcttccg gctcgtatgt 3900 tgtgtggaat tgtgagcgga taacaatttc acacaggaaa cagctatgac catgattacg 3960 ccaagcgcgc aattaaccct cactaaaggg aacaaaagct ggagctccac cgcggtggcg 4020 gccgctctag aactagtgga tcccccgggg aggtcagaat ggtttcttta ctgtttgtca 4080 attctattat ttcaatacag aacaatagct tctataactg aaatatattt gctattgtat 4140 attatgattg tccctcgaac catgaacact cctccagctg aatttcacaa ttcctctgtc 4200 atctgccagg ccattaagtt attcatggaa gatctttgag gaacactgca agttcatatc 4260 ataaacacat ttgaaattga gtattgtttt gcattgtatg gagctatgtt ttgctgtatc 4320 ctcagaaaaa aagtttgtta taaagcattc acacccataa aaagatagat ttaaatattc 4380 cagctatagg aaagaaagtg cgtctgctct tcactctagt ctcagttggc tccttcacat 4440 gcatgcttct ttatttctcc tattttgtca agaaaataat aggtcacgtc ttgttctcac 4500 ttatgtcctg cctagcatgg ctcagatgca cgttgtagat acaagaagga tcaaatgaaa 4560 cagacttctg gtctgttact acaaccatag taataagcac actaactaat aattgctaat 4620 tatgttttcc atctctaagg ttcccacatt tttctgtttt cttaaagatc ccattatctg 4680 gttgtaactg aagctcaatg gaacatgagc aatatttccc agtcttctct cccatccaac 4740 agtcctgatg gattagcaga acaggcagaa aacacattgt tacccagaat taaaaactaa 4800 tatttgctct ccattcaatc caaaatggac ctattgaaac taaaatctaa cccaatccca 4860 ttaaatgatt tctatggcgt caaaggtcaa acttctgaag ggaacctgtg ggtgggtcac 4920 aattcaggct atatattccc cagggctcag cggatccatg ggctccatcg gcgcagcaag 4980 catggaattt tgttttgatg tattcaagga gctcaaagtc caccatgcca atgagaacat 5040 cttctactgc cccattgcca tcatgtcagc tctagccatg gtatacctgg gtgcaaaaga 5100 cagcaccagg acacagataa ataaggttgt tcgctttgat aaacttccag gattcggaga 5160 cagtattgaa gctcagtgtg gcacatctgt aaacgttcac tcttcactta gagacatcct 5220 caaccaaatc accaaaccaa atgatgttta ttcgttcagc cttgccagta gactttatgc 5280 tgaagagaga tacccaatcc tgccagaata cttgcagtgt gtgaaggaac tgtatagagg 5340 aggcttggaa cctatcaact ttcaaacagc tgcagatcaa gccagagagc tcatcaattc 5400 ctgggtagaa agtcagacaa atggaattat cagaaatgtc cttcagccaa gctccgtgga 5460 ttctcaaact gcaatggttc tggttaatgc cattgtcttc aaaggactgt gggagaaaac 5520 atttaaggat gaagacacac aagcaatgcc tttcagagtg actgagcaag aaagcaaacc 5580 tgtgcagatg atgtaccaga ttggtttatt tagagtggca tcaatggctt ctgagaaaat 5640 gaagatcctg gagcttccat ttgccagtgg gacaatgagc atgttggtgc tgttgcctga 5700 tgaagtctca ggccttgagc agcttgagag tataatcaac tttgaaaaac tgactgaatg 5760 gaccagttct aatgttatgg aagagaggaa gatcaaagtg tacttacctc gcatgaagat 5820 ggaggaaaaa tacaacctca catctgtctt aatggctatg ggcattactg acgtgtttag 5880 ctcttcagcc aatctgtctg gcatctcctc agcagagagc ctgaagatat ctcaagctgt 5940 ccatgcagca catgcagaaa tcaatgaagc aggcagagag gtggtagggt cagcagaggc 6000 tggagtggat gctgcaagcg tctctgaaga atttagggct gaccatccat tcctcttctg 6060 tatcaagcac atcgcaacca acgccgttct cttctttggc agatgtgttt cccctccgcg 6120 gccagcagat gacgcaccag cagatgacgc accagcagat gacgcaccag cagatgacgc 6180 accagcagat gacgcaccag cagatgacgc aacaacatgt atcctgaaag gctcttgtgg 6240 ctggatcggc ctgctggatg acgatgacaa aaaatacaaa aaagcactga aaaaactggc 6300 aaaactgctg taatgagggc gcctggatcc agatcacttc tggctaataa aagatcagag 6360 ctctagagat ctgtgtgttg gttttttgtg gatctgctgt gccttctagt tgccagccat 6420 ctgttgtttg cccctccccc gtgccttcct tgaccctgga aggtgccact cccactgtcc 6480 tttcctaata aaatgaggaa attgcatcgc attgtctgag taggtgtcat tctattctgg 6540 ggggtggggt ggggcagcac agcaaggggg aggattggga agacaatagc aggcatgctg 6600 gggatgcggt gggctctatg ggtacctctc tctctctctc tctctctctc tctctctctc 6660 tctctcggta cctctctcga gggggggccc ggtacccaat tcgccctata gtgagtcgta 6720 ttacgcgcgc tcactggccg tcgttttaca acgtcgtgac tgggaaaacc ctggcgttac 6780 ccaacttaat cgccttgcag cacatccccc tttcgccagc tggcgtaata gcgaagaggc 6840 ccgcaccgat cgcccttccc aacagttgcg cagcctgaat ggcgaatgga aattgtaagc 6900 gttaatattt tgttaaaatt cgcgttaaat ttttgttaaa tcagctcatt ttttaaccaa 6960 taggccgaaa tcggcaaaat cccttataaa tcaaaagaat agaccgagat agggttgagt 7020 gttgttccag tttggaacaa gagtccacta ttaaagaacg tggactccaa cgtcaaaggg 7080 cgaaaaaccg tctatcaggg cgatggccca ctactccggg atcatatgac aagatgtgta 7140 tccaccttaa cttaatgatt tttaccaaaa tcattagggg attcatcagt gctcagggtc 7200 aacgagaatt aacattccgt caggaaagct tatgatgatg atgtgcttaa aaacttactc 7260 aatggctggt tatgcatatc gcaatacatg cgaaaaacct aaaagagctt gccgataaaa 7320 aaggccaatt tattgctatt taccgcggct ttttattgag cttgaaagat aaataaaata 7380 gataggtttt atttgaagct aaatcttctt tatcgtaaaa aatgccctct tgggttatca 7440 agagggtcat tatatttcgc ggaataacat catttggtga cgaaataact aagcacttgt 7500 ctcctgttta ctcccctgag cttgaggggt taacatgaag gtcatcgata gcaggataat 7560 aatacagtaa aacgctaaac caataatcca aatccagcca tcccaaattg gtagtgaatg 7620 attataaata acagcaaaca gtaatgggcc aataacaccg gttgcattgg taaggctcac 7680 caataatccc tgtaaagcac cttgctgatg actctttgtt tggatagaca tcactccctg 7740 taatgcaggt aaagcgatcc caccaccagc caataaaatt aaaacaggga aaactaacca 7800 accttcagat ataaacgcta aaaaggcaaa tgcactacta tctgcaataa atccgagcag 7860 tactgccgtt ttttcgcccc atttagtggc tattcttcct gccacaaagg cttggaatac 7920 tgagtgtaaa agaccaagac ccgctaatga aaagccaacc atcatgctat tccatccaaa 7980 acgattttcg gtaaatagca cccacaccgt tgcgggaatt tggcctatca attgcgctga 8040 aaaataaata atcaacaaaa tggcatcgtt ttaaataaag tgatgtatac cgaattcagc 8100 ttttgttccc tttagtgagg gttaattgcg cgcttggcgt aatcatggtc atagctgttt 8160 cctgtgtgaa attgttatcc gctcacaatt ccacacaaca tacgagccgg aagcataaag 8220 tgtaaagcct ggggtgccta atgagtgagc taactcacat taattgcgtt gcgctcactg 8280 cccgctttcc agtcgggaaa cctgtcgtgc cagctgcatt aatgaatcgg ccaacgcgcg 8340 gggagaggcg gtttgcgtat tgggcgctct tccgcttcct cgctcactga ctcgctgcgc 8400 tcggtcgttc ggctgcggcg agcggtatca gctcactcaa aggcggtaat acggttatcc 8460 acagaatcag gggataacgc aggaaagaac atgtgagcaa aaggccagca aaaggccagg 8520 aaccgtaaaa aggccgcgtt gctggcgttt ttccataggc tccgcccccc tgacgagcat 8580 cacaaaaatc gacgctcaag tcagaggtgg cgaaacccga caggactata aagataccag 8640 gcgtttcccc ctggaagctc cctcgtgcgc tctcctgttc cgaccctgcc gcttaccgga 8700 tacctgtccg cctttctccc ttcgggaagc gtggcgcttt ctcatagctc acgctgtagg 8760 tatctcagtt cggtgtaggt cgttcgctcc aagctgggct gtgtgcacga accccccgtt 8820 cagcccgacc gctgcgcctt atccggtaac tatcgtcttg agtccaaccc ggtaagacac 8880 gacttatcgc cactggcagc agccactggt aacaggatta gcagagcgag gtatgtaggc 8940 ggtgctacag agttcttgaa gtggtggcct aactacggct acactagaag gacagtattt 9000 ggtatctgcg ctctgctgaa gccagttacc ttcggaaaaa gagttggtag ctcttgatcc 9060 ggcaaacaaa ccaccgctgg tagcggtggt ttttttgttt gcaagcagca gattacgcgc 9120 agaaaaaaag gatctcaaga agatcctttg atcttttcta cggggtctga cgctcagtgg 9180 aacgaaaact cacgttaagg gattttggtc atgagattat caaaaaggat cttcacctag 9240 atccttttaa attaaaaatg aagttttaaa tcaatctaaa gtatatatga gtaaacttgg 9300 tctgacagtt accaatgctt aatcagtgag gcacctatct cagcgatctg tctatttcgt 9360 tcatccatag ttgcctgact ccccgtcgtg tagataacta cgatacggga gggcttacca 9420 tctggcccca gtgctgcaat gataccgcga gacccacgct caccggctcc agatttatca 9480 gcaataaacc agccagccgg aagggccgag cgcagaagtg gtcctgcaac tttatccgcc 9540 tccatccagt ctattaattg ttgccgggaa gctagagtaa gtagttcgcc agttaatagt 9600 ttgcgcaacg ttgttgccat tgctacaggc atcgtggtgt cacgctcgtc gtttggtatg 9660 gcttcattca gctccggttc ccaacgatca aggcgagtta catgatcccc catgttgtgc 9720 aaaaaagcgg ttagctcctt cggtcctccg atcgttgtca gaagtaagtt ggccgcagtg 9780 ttatcactca tggttatggc agcactgcat aattctctta ctgtcatgcc atccgtaaga 9840 tgcttttctg tgactggtga gtactcaacc aagtcattct gagaatagtg tatgcggcga 9900 ccgagttgct cttgcccggc gtcaatacgg gataataccg cgccacatag cagaacttta 9960 aaagtgctca tcattggaaa acgttcttcg gggcgaaaac tctcaaggat cttaccgctg 10020 ttgagatcca gttcgatgta acccactcgt gcacccaact gatcttcagc atcttttact 10080 ttcaccagcg tttctgggtg agcaaaaaca ggaaggcaaa atgccgcaaa aaagggaata 10140 agggcgacac ggaaatgttg aatactcata ctcttccttt ttcaatatta ttgaagcatt 10200 tatcagggtt attgtctcat gagcggatac atatttgaat gtatttagaa aaataaacaa 10260 ataggggttc cgcgcacatt tccccgaaaa gtgccac 10297 46 10272 DNA Artificial Sequence Synthetic 46 ctgacgcgcc ctgtagcggc gcattaagcg cggcgggtgt ggtggttacg cgcagcgtga 60 ccgctacact tgccagcgcc ctagcgcccg ctcctttcgc tttcttccct tcctttctcg 120 ccacgttcgc cggcatcaga ttggctattg gccattgcat acgttgtatc catatcataa 180 tatgtacatt tatattggct catgtccaac attaccgcca tgttgacatt gattattgac 240 tagttattaa tagtaatcaa ttacggggtc attagttcat agcccatata tggagttccg 300 cgttacataa cttacggtaa atggcccgcc tggctgaccg cccaacgacc cccgcccatt 360 gacgtcaata atgacgtatg ttcccatagt aacgccaata gggactttcc attgacgtca 420 atgggtggag tatttacggt aaactgccca cttggcagta catcaagtgt atcatatgcc 480 aagtacgccc cctattgacg tcaatgacgg taaatggccc gcctggcatt atgcccagta 540 catgacctta tgggactttc ctacttggca gtacatctac gtattagtca tcgctattac 600 catggtgatg cggttttggc agtacatcaa tgggcgtgga tagcggtttg actcacgggg 660 atttccaagt ctccacccca ttgacgtcaa tgggagtttg ttttggcacc aaaatcaacg 720 ggactttcca aaatgtcgta acaactccgc cccattgacg caaatgggcg gtaggcgtgt 780 acggtgggag gtctatataa gcagagctcg tttagtgaac cgtcagatcg cctggagacg 840 ccatccacgc tgttttgacc tccatagaag acaccgggac cgatccagcc tccgcggccg 900 ggaacggtgc attggaacgc ggattccccg tgccaagagt gacgtaagta ccgcctatag 960 actctatagg cacacccctt tggctcttat gcatgctata ctgtttttgg cttggggcct 1020 atacaccccc gcttccttat gctataggtg atggtatagc ttagcctata ggtgtgggtt 1080 attgaccatt attgaccact cccctattgg tgacgatact ttccattact aatccataac 1140 atggctcttt gccacaacta tctctattgg ctatatgcca atactctgtc cttcagagac 1200 tgacacggac tctgtatttt tacaggatgg ggtcccattt attatttaca aattcacata 1260 tacaacaacg ccgtcccccg tgcccgcagt ttttattaaa catagcgtgg gatctccacg 1320 cgaatctcgg gtacgtgttc cggacatggg ctcttctccg gtagcggcgg agcttccaca 1380 tccgagccct ggtcccatgc ctccagcggc tcatggtcgc tcggcagctc cttgctccta 1440 acagtggagg ccagacttag gcacagcaca atgcccacca ccaccagtgt gccgcacaag 1500 gccgtggcgg tagggtatgt gtctgaaaat gagcgtggag attgggctcg cacggctgac 1560 gcagatggaa gacttaaggc agcggcagaa gaagatgcag gcagctgagt tgttgtattc 1620 tgataagagt cagaggtaac tcccgttgcg gtgctgttaa cggtggaggg cagtgtagtc 1680 tgagcagtac tcgttgctgc cgcgcgcgcc accagacata atagctgaca gactaacaga 1740 ctgttccttt ccatgggtct tttctgcagt caccgtcgga ccatgtgtga acttgatatt 1800 ttacatgatt ctctttacca attctgcccc gaattacact taaaacgact caacagctta 1860 acgttggctt gccacgcatt acttgactgt aaaactctca ctcttaccga acttggccgt 1920 aacctgccaa ccaaagcgag aacaaaacat aacatcaaac gaatcgaccg attgttaggt 1980 aatcgtcacc tccacaaaga gcgactcgct gtataccgtt ggcatgctag ctttatctgt 2040 tcgggaatac gatgcccatt gtacttgttg actggtctga tattcgtgag caaaaacgac 2100 ttatggtatt gcgagcttca gtcgcactac acggtcgttc tgttactctt tatgagaaag 2160 cgttcccgct ttcagagcaa tgttcaaaga aagctcatga ccaatttcta gccgaccttg 2220 cgagcattct accgagtaac accacaccgc tcattgtcag tgatgctggc tttaaagtgc 2280 catggtataa atccgttgag aagctgggtt ggtactggtt aagtcgagta agaggaaaag 2340 tacaatatgc agacctagga gcggaaaact ggaaacctat cagcaactta catgatatgt 2400 catctagtca ctcaaagact ttaggctata agaggctgac taaaagcaat ccaatctcat 2460 gccaaattct attgtataaa tctcgctcta aaggccgaaa aaatcagcgc tcgacacgga 2520 ctcattgtca ccacccgtca cctaaaatct actcagcgtc ggcaaaggag ccatgggttc 2580 tagcaactaa cttacctgtt gaaattcgaa cacccaaaca acttgttaat atctattcga 2640 agcgaatgca gattgaagaa accttccgag acttgaaaag tcctgcctac ggactaggcc 2700 tacgccatag ccgaacgagc agctcagagc gttttgatat catgctgcta atcgccctga 2760 tgcttcaact aacatgttgg cttgcgggcg ttcatgctca gaaacaaggt tgggacaagc 2820 acttccaggc taacacagtc agaaatcgaa acgtactctc aacagttcgc ttaggcatgg 2880 aagttttgcg gcattctggc tacacaataa caagggaaga cttactcgtg gctgcaaccc 2940 tactagctca aaatttattc acacatggtt acgctttggg gaaattatga taatgatcca 3000 gatcacttct ggctaataaa agatcagagc tctagagatc tgtgtgttgg ttttttgtgg 3060 atctgctgtg ccttctagtt gccagccatc tgttgtttgc ccctcccccg tgccttcctt 3120 gaccctggaa ggtgccactc ccactgtcct ttcctaataa aatgaggaaa ttgcatcgca 3180 ttgtctgagt aggtgtcatt ctattctggg gggtggggtg gggcagcaca gcaaggggga 3240 ggattgggaa gacaatagca ggcatgctgg ggatgcggtg ggctctatgg gtacctctct 3300 ctctctctct ctctctctct ctctctctct ctctcggtac ctctctctct ctctctctct 3360 ctctctctct ctctctctct cggtaccagg tgctgaagaa ttgacccggt gaccaaaggt 3420 gccttttatc atcactttaa aaataaaaaa caattactca gtgcctgtta taagcagcaa 3480 ttaattatga ttgatgccta catcacaaca aaaactgatt taacaaatgg ttggtctgcc 3540 ttagaaagta tatttgaaca ttatcttgat tatattattg ataataataa aaaccttatc 3600 cctatccaag aagtgatgcc tatcattggt tggaatgaac ttgaaaaaaa ttagccttga 3660 atacattact ggtaaggtaa acgccattgt cagcaaattg atccaagaga accaacttaa 3720 agctttcctg acggaatgtt aattctcgtt gaccctgagc actgatgaat cccctaatga 3780 ttttggtaaa aatcattaag ttaaggtgga tacacatctt gtcatatgat cccggtaatg 3840 tgagttagct cactcattag gcaccccagg ctttacactt tatgcttccg gctcgtatgt 3900 tgtgtggaat tgtgagcgga taacaatttc acacaggaaa cagctatgac catgattacg 3960 ccaagcgcgc aattaaccct cactaaaggg aacaaaagct ggagctccac cgcggtggcg 4020 gccgctctag aactagtgga tcccccgggg aggtcagaat ggtttcttta ctgtttgtca 4080 attctattat ttcaatacag aacaaaagct tctataactg aaatatattt gctattgtat 4140 attatgattg tccctcgaac catgaacact cctccagctg aatttcacaa ttcctctgtc 4200 atctgccagg ctggaagatc atggaagatc tctgaggaac attgcaagtt cataccataa 4260 actcatttgg aattgagtat tattttgctt tgaatggagc tatgttttgc agttccctca 4320 gaagaaaagc ttgttataaa gcgtctacac ccatcaaaag atatatttaa atattccaac 4380 tacagaaaga ttttgtctgc tcttcactct gatctcagtt ggtttcttca cgtacatgct 4440 tctttatttg cctattttgt caagaaaata ataggtcaag tcctgttctc acttatctcc 4500 tgcctagcat ggcttagatg cacgttgtac attcaagaag gatcaaatga aacagacttc 4560 tggtctgtta caacaaccat agtaataaac agactaacta ataattgcta attatgtttt 4620 ccatctctaa ggttcccaca tttttctgtt ttaagatccc attatctggt tgtaactgaa 4680 gctcaatgga acatgaacag tatttctcag tcttttctcc agcaatcctg acggattaga 4740 agaactggca gaaaacactt tgttacccag aattaaaaac taatatttgc tctcccttca 4800 atccaaaatg gacctattga aactaaaatc tgacccaatc ccattaaatt atttctatgg 4860 cgtcaaaggt caaacttttg aagggaacct gtgggtgggt cccaattcag gctatatatt 4920 ccccagggct cagccagtgg atccatgggc tccatcggtg cagcaagcat ggaattttgt 4980 tttgatgtat tcaaggagct caaagtccac catgccaatg acaacatgct ctactccccc 5040 tttgccatct tgtcaactct ggccatggtc ttcctaggtg caaaagacag caccaggacc 5100 cagataaata aggttgttca ctttgataaa cttccaggat tcggagacag tattgaagct 5160 cagtgtggca catctgtaaa tgttcactct tcacttagag acatactcaa ccaaatcacc 5220 aaacaaaatg atgcttattc gttcagcctt gccagtagac tttatgctca agagacatac 5280 acagtcgtgc cggaatactt gcaatgtgtg aaggaactgt atagaggagg cttagaatcc 5340 gtcaactttc aaacagctgc agatcaagcc agaggcctca tcaatgcctg ggtagaaagt 5400 cagacaaacg gaattatcag aaacatcctt cagccaagct ccgtggattc tcaaactgca 5460 atggtcctgg ttaatgccat tgccttcaag ggactgtggg agaaagcatt taaggctgaa 5520 gacacgcaaa caataccttt cagagtgact gagcaagaaa gcaaacctgt gcagatgatg 5580 taccagattg gttcatttaa agtggcatca atggcttctg agaaaatgaa gatcctggag 5640 cttccatttg ccagtggaac aatgagcatg ttggtgctgt tgcctgatga tgtctcaggc 5700 cttgagcagc ttgagagtat aatcagcttt gaaaaactga ctgaatggac cagttctagt 5760 attatggaag agaggaaggt caaagtgtac ttacctcgca tgaagatgga ggagaaatac 5820 aacctcacat ctctcttaat ggctatggga attactgacc tgttcagctc ttcagccaat 5880 ctgtctggca tctcctcagt agggagcctg aagatatctc aagctgtcca tgcagcacat 5940 gcagaaatca atgaagcggg cagagatgtg gtaggctcag cagaggctgg agtggatgct 6000 actgaagaat ttagggctga ccatccattc ctcttctgtg tcaagcacat cgaaaccaac 6060 gccattctcc tctttggcag atgtgtttct ccgcggccag cagatgacgc accagcagat 6120 gacgcaccag cagatgacgc accagcagat gacgcaccag cagatgacgc accagcagat 6180 gacgcaacaa catgtatcct gaaaggctct tgtggctgga tcggcctgct ggatgacgat 6240 gacaaaaaat acaaaaaagc actgaaaaaa ctggcaaaac tgctgtaatg agggcgcctg 6300 gatccagatc acttctggct aataaaagat cagagctcta gagatctgtg tgttggtttt 6360 ttgtggatct gctgtgcctt ctagttgcca gccatctgtt gtttgcccct cccccgtgcc 6420 ttccttgacc ctggaaggtg ccactcccac tgtcctttcc taataaaatg aggaaattgc 6480 atcgcattgt ctgagtaggt gtcattctat tctggggggt ggggtggggc agcacagcaa 6540 gggggaggat tgggaagaca atagcaggca tgctggggat gcggtgggct ctatgggtac 6600 ctctctctct ctctctctct ctctctctct ctctctctct cggtacctct ctcgaggggg 6660 ggcccggtac ccaattcgcc ctatagtgag tcgtattacg cgcgctcact ggccgtcgtt 6720 ttacaacgtc gtgactggga aaaccctggc gttacccaac ttaatcgcct tgcagcacat 6780 ccccctttcg ccagctggcg taatagcgaa gaggcccgca ccgatcgccc ttcccaacag 6840 ttgcgcagcc tgaatggcga atggaaattg taagcgttaa tattttgtta aaattcgcgt 6900 taaatttttg ttaaatcagc tcatttttta accaataggc cgaaatcggc aaaatccctt 6960 ataaatcaaa agaatagacc gagatagggt tgagtgttgt tccagtttgg aacaagagtc 7020 cactattaaa gaacgtggac tccaacgtca aagggcgaaa aaccgtctat cagggcgatg 7080 gcccactact ccgggatcat atgacaagat gtgtatccac cttaacttaa tgatttttac 7140 caaaatcatt aggggattca tcagtgctca gggtcaacga gaattaacat tccgtcagga 7200 aagcttatga tgatgatgtg cttaaaaact tactcaatgg ctggttatgc atatcgcaat 7260 acatgcgaaa aacctaaaag agcttgccga taaaaaaggc caatttattg ctatttaccg 7320 cggcttttta ttgagcttga aagataaata aaatagatag gttttatttg aagctaaatc 7380 ttctttatcg taaaaaatgc cctcttgggt tatcaagagg gtcattatat ttcgcggaat 7440 aacatcattt ggtgacgaaa taactaagca cttgtctcct gtttactccc ctgagcttga 7500 ggggttaaca tgaaggtcat cgatagcagg ataataatac agtaaaacgc taaaccaata 7560 atccaaatcc agccatccca aattggtagt gaatgattat aaataacagc aaacagtaat 7620 gggccaataa caccggttgc attggtaagg ctcaccaata atccctgtaa agcaccttgc 7680 tgatgactct ttgtttggat agacatcact ccctgtaatg caggtaaagc gatcccacca 7740 ccagccaata aaattaaaac agggaaaact aaccaacctt cagatataaa cgctaaaaag 7800 gcaaatgcac tactatctgc aataaatccg agcagtactg ccgttttttc gccccattta 7860 gtggctattc ttcctgccac aaaggcttgg aatactgagt gtaaaagacc aagacccgct 7920 aatgaaaagc caaccatcat gctattccat ccaaaacgat tttcggtaaa tagcacccac 7980 accgttgcgg gaatttggcc tatcaattgc gctgaaaaat aaataatcaa caaaatggca 8040 tcgttttaaa taaagtgatg tataccgaat tcagcttttg ttccctttag tgagggttaa 8100 ttgcgcgctt ggcgtaatca tggtcatagc tgtttcctgt gtgaaattgt tatccgctca 8160 caattccaca caacatacga gccggaagca taaagtgtaa agcctggggt gcctaatgag 8220 tgagctaact cacattaatt gcgttgcgct cactgcccgc tttccagtcg ggaaacctgt 8280 cgtgccagct gcattaatga atcggccaac gcgcggggag aggcggtttg cgtattgggc 8340 gctcttccgc ttcctcgctc actgactcgc tgcgctcggt cgttcggctg cggcgagcgg 8400 tatcagctca ctcaaaggcg gtaatacggt tatccacaga atcaggggat aacgcaggaa 8460 agaacatgtg agcaaaaggc cagcaaaagg ccaggaaccg taaaaaggcc gcgttgctgg 8520 cgtttttcca taggctccgc ccccctgacg agcatcacaa aaatcgacgc tcaagtcaga 8580 ggtggcgaaa cccgacagga ctataaagat accaggcgtt tccccctgga agctccctcg 8640 tgcgctctcc tgttccgacc ctgccgctta ccggatacct gtccgccttt ctcccttcgg 8700 gaagcgtggc gctttctcat agctcacgct gtaggtatct cagttcggtg taggtcgttc 8760 gctccaagct gggctgtgtg cacgaacccc ccgttcagcc cgaccgctgc gccttatccg 8820 gtaactatcg tcttgagtcc aacccggtaa gacacgactt atcgccactg gcagcagcca 8880 ctggtaacag gattagcaga gcgaggtatg taggcggtgc tacagagttc ttgaagtggt 8940 ggcctaacta cggctacact agaaggacag tatttggtat ctgcgctctg ctgaagccag 9000 ttaccttcgg aaaaagagtt ggtagctctt gatccggcaa acaaaccacc gctggtagcg 9060 gtggtttttt tgtttgcaag cagcagatta cgcgcagaaa aaaaggatct caagaagatc 9120 ctttgatctt ttctacgggg tctgacgctc agtggaacga aaactcacgt taagggattt 9180 tggtcatgag attatcaaaa aggatcttca cctagatcct tttaaattaa aaatgaagtt 9240 ttaaatcaat ctaaagtata tatgagtaaa cttggtctga cagttaccaa tgcttaatca 9300 gtgaggcacc tatctcagcg atctgtctat ttcgttcatc catagttgcc tgactccccg 9360 tcgtgtagat aactacgata cgggagggct taccatctgg ccccagtgct gcaatgatac 9420 cgcgagaccc acgctcaccg gctccagatt tatcagcaat aaaccagcca gccggaaggg 9480 ccgagcgcag aagtggtcct gcaactttat ccgcctccat ccagtctatt aattgttgcc 9540 gggaagctag agtaagtagt tcgccagtta atagtttgcg caacgttgtt gccattgcta 9600 caggcatcgt ggtgtcacgc tcgtcgtttg gtatggcttc attcagctcc ggttcccaac 9660 gatcaaggcg agttacatga tcccccatgt tgtgcaaaaa agcggttagc tccttcggtc 9720 ctccgatcgt tgtcagaagt aagttggccg cagtgttatc actcatggtt atggcagcac 9780 tgcataattc tcttactgtc atgccatccg taagatgctt ttctgtgact ggtgagtact 9840 caaccaagtc attctgagaa tagtgtatgc ggcgaccgag ttgctcttgc ccggcgtcaa 9900 tacgggataa taccgcgcca catagcagaa ctttaaaagt gctcatcatt ggaaaacgtt 9960 cttcggggcg aaaactctca aggatcttac cgctgttgag atccagttcg atgtaaccca 10020 ctcgtgcacc caactgatct tcagcatctt ttactttcac cagcgtttct gggtgagcaa 10080 aaacaggaag gcaaaatgcc gcaaaaaagg gaataagggc gacacggaaa tgttgaatac 10140 tcatactctt cctttttcaa tattattgaa gcatttatca gggttattgt ctcatgagcg 10200 gatacatatt tgaatgtatt tagaaaaata aacaaatagg ggttccgcgc acatttcccc 10260 gaaaagtgcc ac 10272 47 10880 DNA Artificial Sequence Synthetic 47 ctgacgcgcc ctgtagcggc gcattaagcg cggcgggtgt ggtggttacg cgcagcgtga 60 ccgctacact tgccagcgcc ctagcgcccg ctcctttcgc tttcttccct tcctttctcg 120 ccacgttcgc cggcatcaga ttggctattg gccattgcat acgttgtatc catatcataa 180 tatgtacatt tatattggct catgtccaac attaccgcca tgttgacatt gattattgac 240 tagttattaa tagtaatcaa ttacggggtc attagttcat agcccatata tggagttccg 300 cgttacataa cttacggtaa atggcccgcc tggctgaccg cccaacgacc cccgcccatt 360 gacgtcaata atgacgtatg ttcccatagt aacgccaata gggactttcc attgacgtca 420 atgggtggag tatttacggt aaactgccca cttggcagta catcaagtgt atcatatgcc 480 aagtacgccc cctattgacg tcaatgacgg taaatggccc gcctggcatt atgcccagta 540 catgacctta tgggactttc ctacttggca gtacatctac gtattagtca tcgctattac 600 catggtgatg cggttttggc agtacatcaa tgggcgtgga tagcggtttg actcacgggg 660 atttccaagt ctccacccca ttgacgtcaa tgggagtttg ttttggcacc aaaatcaacg 720 ggactttcca aaatgtcgta acaactccgc cccattgacg caaatgggcg gtaggcgtgt 780 acggtgggag gtctatataa gcagagctcg tttagtgaac cgtcagatcg cctggagacg 840 ccatccacgc tgttttgacc tccatagaag acaccgggac cgatccagcc tccgcggccg 900 ggaacggtgc attggaacgc ggattccccg tgccaagagt gacgtaagta ccgcctatag 960 actctatagg cacacccctt tggctcttat gcatgctata ctgtttttgg cttggggcct 1020 atacaccccc gcttccttat gctataggtg atggtatagc ttagcctata ggtgtgggtt 1080 attgaccatt attgaccact cccctattgg tgacgatact ttccattact aatccataac 1140 atggctcttt gccacaacta tctctattgg ctatatgcca atactctgtc cttcagagac 1200 tgacacggac tctgtatttt tacaggatgg ggtcccattt attatttaca aattcacata 1260 tacaacaacg ccgtcccccg tgcccgcagt ttttattaaa catagcgtgg gatctccacg 1320 cgaatctcgg gtacgtgttc cggacatggg ctcttctccg gtagcggcgg agcttccaca 1380 tccgagccct ggtcccatgc ctccagcggc tcatggtcgc tcggcagctc cttgctccta 1440 acagtggagg ccagacttag gcacagcaca atgcccacca ccaccagtgt gccgcacaag 1500 gccgtggcgg tagggtatgt gtctgaaaat gagcgtggag attgggctcg cacggctgac 1560 gcagatggaa gacttaaggc agcggcagaa gaagatgcag gcagctgagt tgttgtattc 1620 tgataagagt cagaggtaac tcccgttgcg gtgctgttaa cggtggaggg cagtgtagtc 1680 tgagcagtac tcgttgctgc cgcgcgcgcc accagacata atagctgaca gactaacaga 1740 ctgttccttt ccatgggtct tttctgcagt caccgtcgga ccatgtgcga actcgatatt 1800 ttacacgact ctctttacca attctgcccc gaattacact taaaacgact caacagctta 1860 acgttggctt gccacgcatt acttgactgt aaaactctca ctcttaccga acttggccgt 1920 aacctgccaa ccaaagcgag aacaaaacat aacatcaaac gaatcgaccg attgttaggt 1980 aatcgtcacc tccacaaaga gcgactcgct gtataccgtt ggcatgctag ctttatctgt 2040 tcgggcaata cgatgcccat tgtacttgtt gactggtctg atattcgtga gcaaaaacga 2100 cttatggtat tgcgagcttc agtcgcacta cacggtcgtt ctgttactct ttatgagaaa 2160 gcgttcccgc tttcagagca atgttcaaag aaagctcatg accaatttct agccgacctt 2220 gcgagcattc taccgagtaa caccacaccg ctcattgtca gtgatgctgg ctttaaagtg 2280 ccatggtata aatccgttga gaagctgggt tggtactggt taagtcgagt aagaggaaaa 2340 gtacaatatg cagacctagg agcggaaaac tggaaaccta tcagcaactt acatgatatg 2400 tcatctagtc actcaaagac tttaggctat aagaggctga ctaaaagcaa tccaatctca 2460 tgccaaattc tattgtataa atctcgctct aaaggccgaa aaaatcagcg ctcgacacgg 2520 actcattgtc accacccgtc acctaaaatc tactcagcgt cggcaaagga gccatgggtt 2580 ctagcaacta acttacctgt tgaaattcga acacccaaac aacttgttaa tatctattcg 2640 aagcgaatgc agattgaaga aaccttccga gacttgaaaa gtcctgccta cggactaggc 2700 ctacgccata gccgaacgag cagctcagag cgttttgata tcatgctgct aatcgccctg 2760 atgcttcaac taacatgttg gcttgcgggc gttcatgctc agaaacaagg ttgggacaag 2820 cacttccagg ctaacacagt cagaaatcga aacgtactct caacagttcg cttaggcatg 2880 gaagttttgc ggcattctgg ctacacaata acaagggaag acttactcgt ggctgcaacc 2940 ctactagctc aaaatttatt cacacatggt tacgctttgg ggaaattatg aggggatcgc 3000 tctagagcga tccgggatct cgggaaaagc gttggtgacc aaaggtgcct tttatcatca 3060 ctttaaaaat aaaaaacaat tactcagtgc ctgttataag cagcaattaa ttatgattga 3120 tgcctacatc acaacaaaaa ctgatttaac aaatggttgg tctgccttag aaagtatatt 3180 tgaacattat cttgattata ttattgataa taataaaaac cttatcccta tccaagaagt 3240 gatgcctatc attggttgga atgaacttga aaaaaattag ccttgaatac attactggta 3300 aggtaaacgc cattgtcagc aaattgatcc aagagaacca acttaaagct ttcctgacgg 3360 aatgttaatt ctcgttgacc ctgagcactg atgaatcccc taatgatttt ggtaaaaatc 3420 attaagttaa ggtggataca catcttgtca tatgatcccg gtaatgtgag ttagctcact 3480 cattaggcac cccaggcttt acactttatg cttccggctc gtatgttgtg tggaattgtg 3540 agcggataac aatttcacac aggaaacagc tatgaccatg attacgccaa gcgcgcaatt 3600 aaccctcact aaagggaaca aaagctggag ctccaccgcg gtggcggccg ctctagaact 3660 agtggatccc ccgggcatca gattggctat tggccattgc atacgttgta tccatatcat 3720 aatatgtaca tttatattgg ctcatgtcca acattaccgc catgttgaca ttgattattg 3780 actagttatt aatagtaatc aattacgggg tcattagttc atagcccata tatggagttc 3840 cgcgttacat aacttacggt aaatggcccg cctggctgac cgcccaacga cccccgccca 3900 ttgacgtcaa taatgacgta tgttcccata gtaacgccaa tagggacttt ccattgacgt 3960 caatgggtgg agtatttacg gtaaactgcc cacttggcag tacatcaagt gtatcatatg 4020 ccaagtacgc cccctattga cgtcaatgac ggtaaatggc ccgcctggca ttatgcccag 4080 tacatgacct tatgggactt tcctacttgg cagtacatct acgtattagt catcgctatt 4140 accatggtga tgcggttttg gcagtacatc aatgggcgtg gatagcggtt tgactcacgg 4200 ggatttccaa gtctccaccc cattgacgtc aatgggagtt tgttttggca ccaaaatcaa 4260 cgggactttc caaaatgtcg taacaactcc gccccattga cgcaaatggg cggtaggcgt 4320 gtacggtggg aggtctatat aagcagagct cgtttagtga accgtcagat cgcctggaga 4380 cgccatccac gctgttttga cctccataga agacaccggg accgatccag cctccgcggc 4440 cgggaacggt gcattggaac gcggattccc cgtgccaaga gtgacgtaag taccgcctat 4500 agactctata ggcacacccc tttggctctt atgcatgcta tactgttttt ggcttggggc 4560 ctatacaccc ccgcttcctt atgctatagg tgatggtata gcttagccta taggtgtggg 4620 ttattgacca ttattgacca ctcccctatt ggtgacgata ctttccatta ctaatccata 4680 acatggctct ttgccacaac tatctctatt ggctatatgc caatactctg tccttcagag 4740 actgacacgg actctgtatt tttacaggat ggggtcccat ttattattta caaattcaca 4800 tatacaacaa cgccgtcccc cgtgcccgca gtttttatta aacatagcgt gggatctcca 4860 cgcgaatctc gggtacgtgt tccggacatg ggctcttctc cggtagcggc ggagcttcca 4920 catccgagcc ctggtcccat gcctccagcg gctcatggtc gctcggcagc tccttgctcc 4980 taacagtgga ggccagactt aggcacagca caatgcccac caccaccagt gtgccgcaca 5040 aggccgtggc ggtagggtat gtgtctgaaa atgagcgtgg agattgggct cgcacggctg 5100 acgcagatgg aagacttaag gcagcggcag aagaagatgc aggcagctga gttgttgtat 5160 tctgataaga gtcagaggta actcccgttg cggtgctgtt aacggtggag ggcagtgtag 5220 tctgagcagt actcgttgct gccgcgcgcg ccaccagaca taatagctga cagactaaca 5280 gactgttcct ttccatgggt cttttctgca gtcaccgtcg gatcaatggg ctccatcggt 5340 gcagcaagca tggaattttg ttttgatgta ttcaaggagc tcaaagtcca ccatgccaat 5400 gagaacatct tctactgccc cattgccatc atgtcagctc tagccatggt atacctgggt 5460 gcaaaagaca gcaccaggac acaaataaat aaggttgttc gctttgataa acttccagga 5520 ttcggagaca gtattgaagc tcagtgtggc acatctgtaa acgttcactc ttcacttaga 5580 gacatcctca accaaatcac caaaccaaat gatgtttatt cgttcagcct tgccagtaga 5640 ctttatgctg aagagagata cccaatcctg ccagaatact tgcagtgtgt gaaggaactg 5700 tatagaggag gcttggaacc tatcaacttt caaacagctg cagatcaagc cagagagctc 5760 atcaattcct gggtagaaag tcagacaaat ggaattatca gaaatgtcct tcagccaagc 5820 tccgtggatt ctcaaactgc aatggttctg gttaatgcca ttgtcttcaa aggactgtgg 5880 gagaaagcat ttaaggatga agacacacaa gcaatgcctt tcagagtgac tgagcaagaa 5940 agcaaacctg tgcagatgat gtaccagatt ggtttattta gagtggcatc aatggcttct 6000 gagaaaatga agatcctgga gcttccattt gccagtggga caatgagcat gttggtgctg 6060 ttgcctgatg aagtctcagg ccttgagcag cttgagagta taatcaactt tgaaaaactg 6120 actgaatgga ccagttctaa tgttatggaa gagagaagat caaagtgtac ttacctcgca 6180 tgaagatgga ggaaaaatac aacctcacat ctgtcttaat ggctatgggc attactgacg 6240 tgtttagctc ttcagccaat ctgtctggca tctcctcagc agagagcctg aagatatctc 6300 aagctgtcca tgcagcacat gcagaaatca atgaagcagg cagagaggtg gtagggtcag 6360 cagaggctgg agtggatgct gcaagcgtct ctgaagaatt tagggctgac catccattcc 6420 tcttctgtat caagcacatc gcaaccaacg ccgttctctt cttttggcag atgtgtttcc 6480 cgcggccagc agatgacgca ccagcagatg acgcaccagc agatgacgca ccagcagatg 6540 acgcaccagc agatgacgca acaacatgta tcctgaaagg ctcttgtggc tggatcggcc 6600 tgctggatga cgatgacaaa tttgtgaacc aacacctgtg cggctcacac ctggtggaag 6660 ctctctacct agtgtgcggg gaacgaggct tcttctacac acccaagacc cgccgggagg 6720 cagaggacct gcaggtgggg caggtggagc tgggcggggg ccctggtgca ggcagcctgc 6780 agcccttggc cctggagggg tccctgcaga agcgtggcat tgtggaacaa tgctgtacca 6840 gcatctgctc cctctaccag ctggagaact actgcaacta gggcgcctaa agggcgaatt 6900 atcgcggccg ctctagacca ggcgcctgga tccagatcac ttctggctaa taaaagatca 6960 gagctctaga gatctgtgtg ttggtttttt gtggatctgc tgtgccttct agttgccagc 7020 catctgttgt ttgcccctcc cccgtgcctt ccttgaccct ggaaggtgcc actcccactg 7080 tcctttccta ataaaatgag gaaattgcat cgcattgtct gagtaggtgt cattctattc 7140 tggggggtgg ggtggggcag cacagcaagg gggaggattg ggaagacaat agcaggcatg 7200 ctggggatgc ggtgggctct atgggtacct ctctctctct ctctctctct ctcactctct 7260 ctctctctcg gtacctctcc tcgagggggg gcccggtacc caattcgccc tatagtgagt 7320 cgtattacgc gcgctcactg gccgtcgttt tacaacgtcg tgactgggaa aaccctggcg 7380 ttacccaact taatcgcctt gcagcacatc cccctttcgc cagctggcgt aatagcgaag 7440 aggcccgcac cgatcgccct tcccaacagt tgcgcagcct gaatggcgaa tggaaattgt 7500 aagcgttaat attttgttaa aattcgcgtt aaatttttgt taaatcagct cattttttaa 7560 ccaataggcc gaaatcggca aaatccctta taaatcaaaa gaatagaccg agatagggtt 7620 gagtgttgtt ccagtttgga acaagagtcc actattaaag aacgtggact ccaacgtcaa 7680 agggcgaaaa accgtctatc agggcgatgg cccactactc cgggatcata tgacaagatg 7740 tgtatccacc ttaacttaat gatttttacc aaaatcatta ggggattcat cagtgctcag 7800 ggtcaacgag aattaacatt ccgtcaggaa agcttatgat gatgatgtgc ttaaaaactt 7860 actcaatggc tggttatgca tatcgcaata catgcgaaaa acctaaaaga gcttgccgat 7920 aaaaaaggcc aatttattgc tatttaccgc ggctttttat tgagcttgaa agataaataa 7980 aatagatagg ttttatttga agctaaatct tctttatcgt aaaaaatgcc ctcttgggtt 8040 atcaagaggg tcattatatt tcgcggaata acatcatttg gtgacgaaat aactaagcac 8100 ttgtctcctg tttactcccc tgagcttgag gggttaacat gaaggtcatc gatagcagga 8160 taataataca gtaaaacgct aaaccaataa tccaaatcca gccatcccaa attggtagtg 8220 aatgattata aataacagca aacagtaatg ggccaataac accggttgca ttggtaaggc 8280 tcaccaataa tccctgtaaa gcaccttgct gatgactctt tgtttggata gacatcactc 8340 cctgtaatgc aggtaaagcg atcccaccac cagccaataa aattaaaaca gggaaaacta 8400 accaaccttc agatataaac gctaaaaagg caaatgcact actatctgca ataaatccga 8460 gcagtactgc cgttttttcg cccatttagt ggctattctt cctgccacaa aggcttggaa 8520 tactgagtgt aaaagaccaa gacccgtaat gaaaagccaa ccatcatgct attcatcatc 8580 acgatttctg taatagcacc acaccgtgct ggattggcta tcaatgcgct gaaataataa 8640 tcaacaaatg gcatcgttaa ataagtgatg tataccgatc agcttttgtt ccctttagtg 8700 agggttaatt gcgcgcttgg cgtaatcatg gtcatagctg tttcctgtgt gaaattgtta 8760 tccgctcaca attccacaca acatacgagc cggaagcata aagtgtaaag cctggggtgc 8820 ctaatgagtg agctaactca cattaattgc gttgcgctca ctgcccgctt tccagtcggg 8880 aaacctgtcg tgccagctgc attaatgaat cggccaacgc gcggggagag gcggtttgcg 8940 tattgggcgc tcttccgctt cctcgctcac tgactcgctg cgctcggtcg ttcggctgcg 9000 gcgagcggta tcagctcact caaaggcggt aatacggtta tccacagaat caggggataa 9060 cgcaggaaag aacatgtgag caaaaggcca gcaaaaggcc aggaaccgta aaaaggccgc 9120 gttgctggcg tttttccata ggctccgccc ccctgacgag catcacaaaa atcgacgctc 9180 aagtcagagg tggcgaaacc cgacaggact ataaagatac caggcgtttc cccctggaag 9240 ctccctcgtg cgctctcctg ttccgaccct gccgcttacc ggatacctgt ccgcctttct 9300 cccttcggga agcgtggcgc tttctcatag ctcacgctgt aggtatctca gttcggtgta 9360 ggtcgttcgc tccaagctgg gctgtgtgca cgaacccccc gttcagcccg accgctgcgc 9420 cttatccggt aactatcgtc ttgagtccaa cccggtaaga cacgacttat cgccactggc 9480 agcagccact ggtaacagga ttagcagagc gaggtatgta ggcggtgcta cagagttctt 9540 gaagtggtgg cctaactacg gctacactag aaggacagta tttggtatct gcgctctgct 9600 gaagccagtt accttcggaa aaagagttgg tagctcttga tccggcaaac aaaccaccgc 9660 tggtagcggt ggtttttttg tttgcaagca gcagattacg cgcagaaaaa aaggatctca 9720 agaagatcct ttgatctttt ctacggggtc tgacgctcag tggaacgaaa actcacgtta 9780 agggattttg gtcatgagat tatcaaaaag gatcttcacc tagatccttt taaattaaaa 9840 atgaagtttt aaatcaatct aaagtatata tgagtaaact tggtctgaca gttaccaatg 9900 cttaatcagt gaggcaccta tctcagcgat ctgtctattt cgttcatcca tagttgcctg 9960 actccccgtc gtgtagataa ctacgatacg ggagggctta ccatctggcc ccagtgctgc 10020 aatgataccg cgagacccac gctcaccggc tccagattta tcagcaataa accagccagc 10080 cggaagggcc gagcgcagaa gtggtcctgc aactttatcc gcctccatcc agtctattaa 10140 ttgttgccgg gaagctagag taagtagttc gccagttaat agtttgcgca acgttgttgc 10200 cattgctaca ggcatcgtgg tgtcacgctc gtcgtttggt atggcttcat tcagctccgg 10260 ttcccaacga tcaaggcgag ttacatgatc ccccatgttg tgcaaaaaag cggttagctc 10320 cttcggtcct ccgatcgttg tcagaagtaa gttggccgca gtgttatcac tcatggttat 10380 ggcagcactg cataattctc ttactgtcat gccatccgta agatgctttt ctgtgactgg 10440 tgagtactca accaagtcat tctgagaata gtgtatgcgg cgaccgagtt gctcttgccc 10500 ggcgtcaata cgggataata ccgcgccaca tagcagaact ttaaaagtgc tcatcattgg 10560 aaaacgttct tcggggcgaa aactctcaag gatcttaccg ctgttgagat ccagttcgat 10620 gtaacccact cgtgcaccca actgatcttc agcatctttt actttcacca gcgtttctgg 10680 gtgagcaaaa acaggaaggc aaaatgccgc aaaaaaggga ataagggcga cacggaaatg 10740 ttgaatactc atactcttcc tttttcaata ttattgaagc atttatcagg gttattgtct 10800 catgagcgga tacatatttg aatgtattta gaaaaataaa caaatagggg ttccgcgcac 10860 atttccccga aaagtgccac 10880 48 78 DNA Artificial Sequence Synthetic 48 aatttctcaa ggatattttt cttcgtgttc gctttggttc tggctttgtc aacagtttcg 60 gctgcgccag agccgaaa 78 49 93 DNA Artificial Sequence Synthetic 49 aatttctcaa ggatattttt cttcgtgttc gctttggttc tggctttgtc aacagtttcg 60 gctgcgccag agccgaaatg gaaagtcttc aag 93 50 15 DNA Artificial Sequence Synthetic 50 gcgccagagc cgaaa 15 51 30 DNA Artificial Sequence Synthetic 51 gcgccagagc cgaaatggaa agtcttcaag 30 52 7 DNA Artificial Sequence Synthetic 52 accatgt 7
Claims (44)
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US10/746,149 US20040172667A1 (en) | 2002-06-26 | 2003-12-24 | Administration of transposon-based vectors to reproductive organs |
US11/981,629 US8283518B2 (en) | 2002-06-26 | 2007-10-31 | Administration of transposon-based vectors to reproductive organs |
Applications Claiming Priority (9)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US39241502P | 2002-06-26 | 2002-06-26 | |
US44150203P | 2003-01-21 | 2003-01-21 | |
US44138103P | 2003-01-21 | 2003-01-21 | |
US44137703P | 2003-01-21 | 2003-01-21 | |
US44139203P | 2003-01-21 | 2003-01-21 | |
US44140503P | 2003-01-21 | 2003-01-21 | |
US44144703P | 2003-01-21 | 2003-01-21 | |
US10/609,019 US7527966B2 (en) | 2002-06-26 | 2003-06-26 | Gene regulation in transgenic animals using a transposon-based vector |
US10/746,149 US20040172667A1 (en) | 2002-06-26 | 2003-12-24 | Administration of transposon-based vectors to reproductive organs |
Related Parent Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US10/609,019 Continuation-In-Part US7527966B2 (en) | 2002-06-26 | 2003-06-26 | Gene regulation in transgenic animals using a transposon-based vector |
Related Child Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US11/981,629 Continuation US8283518B2 (en) | 2002-06-26 | 2007-10-31 | Administration of transposon-based vectors to reproductive organs |
Publications (1)
Publication Number | Publication Date |
---|---|
US20040172667A1 true US20040172667A1 (en) | 2004-09-02 |
Family
ID=46123530
Family Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US10/746,149 Abandoned US20040172667A1 (en) | 2002-06-26 | 2003-12-24 | Administration of transposon-based vectors to reproductive organs |
US11/981,629 Expired - Fee Related US8283518B2 (en) | 2002-06-26 | 2007-10-31 | Administration of transposon-based vectors to reproductive organs |
Family Applications After (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US11/981,629 Expired - Fee Related US8283518B2 (en) | 2002-06-26 | 2007-10-31 | Administration of transposon-based vectors to reproductive organs |
Country Status (1)
Country | Link |
---|---|
US (2) | US20040172667A1 (en) |
Cited By (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20040197910A1 (en) * | 2002-06-26 | 2004-10-07 | Cooper Richard K. | Gene regulation in transgenic animals using a transposon-based vector |
US20040235011A1 (en) * | 2002-06-26 | 2004-11-25 | Cooper Richard K. | Production of multimeric proteins |
US20050273873A1 (en) * | 2003-03-07 | 2005-12-08 | Avigenics, Inc. | Genomic modification |
US20060123504A1 (en) * | 2004-12-07 | 2006-06-08 | Avigenics, Inc. | Methods of producing polyclonal antibodies |
US20090131272A1 (en) * | 2005-05-17 | 2009-05-21 | Temasek Life Sciences Laboratory Limited | Transposition of maize ac/ds elements in vertebrates |
US8071364B2 (en) | 2003-12-24 | 2011-12-06 | Transgenrx, Inc. | Gene therapy using transposon-based vectors |
US8283518B2 (en) | 2002-06-26 | 2012-10-09 | Transgenrx, Inc. | Administration of transposon-based vectors to reproductive organs |
US9150880B2 (en) | 2008-09-25 | 2015-10-06 | Proteovec Holding, L.L.C. | Vectors for production of antibodies |
US9150881B2 (en) | 2009-04-09 | 2015-10-06 | Proteovec Holding, L.L.C. | Production of proteins using transposon-based vectors |
US9157097B2 (en) | 2008-09-25 | 2015-10-13 | Proteovec Holding, L.L.C. | Vectors for production of growth hormone |
Families Citing this family (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2012051615A1 (en) | 2010-10-15 | 2012-04-19 | Transgenrx, Inc. | Novel vectors for production of glycosylated interferon |
AU2013204327B2 (en) | 2012-04-20 | 2016-09-01 | Aviagen | Cell transfection method |
US10287622B2 (en) | 2013-11-07 | 2019-05-14 | Agilent Technologies, Inc. | Plurality of transposase adapters for DNA manipulations |
Citations (81)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4670388A (en) * | 1982-12-30 | 1987-06-02 | Carnegie Institution Of Washington | Method of incorporating DNA into genome of drosophila |
US4870009A (en) * | 1982-11-22 | 1989-09-26 | The Salk Institute For Biological Studies | Method of obtaining gene product through the generation of transgenic animals |
US4914025A (en) * | 1985-12-05 | 1990-04-03 | Colin Manoil | Export of intra-cellular substances |
US5102797A (en) * | 1989-05-26 | 1992-04-07 | Dna Plant Technology Corporation | Introduction of heterologous genes into bacteria using transposon flanked expression cassette and a binary vector system |
US5212080A (en) * | 1987-10-05 | 1993-05-18 | Washington University | Method of DNA sequencing using DNA transposon Tn5seql |
US5512483A (en) * | 1993-05-21 | 1996-04-30 | Mcgill University | Expression vectors responsive to steroid hormones |
US5556782A (en) * | 1993-06-30 | 1996-09-17 | Board Of Supervisors Of Louisiana State University And Agricultural & Mechanical College | Transformed mammalian cells capable of expressing cecropin b |
US5645991A (en) * | 1993-05-04 | 1997-07-08 | Univ. Of Connecticut | Transposon-containing DNA cloning vector and uses thereof |
US5719055A (en) * | 1993-06-30 | 1998-02-17 | Board Of Supervisors Of Louisiana State University And Agricultural And Mechanical College | Transposon-based transformation vectors |
US5733779A (en) * | 1992-11-13 | 1998-03-31 | Idec Pharmaceuticals Corporation | Impaired dominant selectable marker sequence and intronic insertion strategies for enhancement of expression of gene product and expression vector systems comprising same |
US5753502A (en) * | 1993-08-05 | 1998-05-19 | Icos Corporation | Neuron-specific ICAM-4 promoter |
US5861478A (en) * | 1987-07-06 | 1999-01-19 | Helix Biomedix, Inc. | Lytic peptides |
US5869296A (en) * | 1987-10-05 | 1999-02-09 | Washington University | DNA transposon Tn5seq1 |
US5925545A (en) * | 1996-09-09 | 1999-07-20 | Wisconsin Alumni Research Foundation | System for in vitro transposition |
US5958775A (en) * | 1997-07-25 | 1999-09-28 | Thomas Jefferson University | Composition and method for targeted integration into cells |
US6080912A (en) * | 1997-03-20 | 2000-06-27 | Wisconsin Alumni Research Foundation | Methods for creating transgenic animals |
US6107477A (en) * | 1996-09-26 | 2000-08-22 | Aurora Biosciences Corporation | Non-optimal Kozaks sequences |
US6171861B1 (en) * | 1995-06-07 | 2001-01-09 | Life Technologies, Inc. | Recombinational cloning using engineered recombination sites |
US6218185B1 (en) * | 1996-04-19 | 2001-04-17 | The United States Of America As Represented By The Secretary Of Agriculture | Piggybac transposon-based genetic transformation system for insects |
US6258571B1 (en) * | 1998-04-10 | 2001-07-10 | Genset | High throughput DNA sequencing vector |
US6261554B1 (en) * | 1995-07-25 | 2001-07-17 | Introgene B.V. | Compositions for targeted gene delivery |
US6291214B1 (en) * | 1998-05-11 | 2001-09-18 | Glaxo Wellcome Inc. | System for generating recombinant viruses |
US6291243B1 (en) * | 1999-04-28 | 2001-09-18 | The Board Of Trustees Of The Leland Stanford Jr. University | P element derived vector and methods for its use |
US20020007051A1 (en) * | 1999-12-10 | 2002-01-17 | David Cheo | Use of multiple recombination sites with unique specificity in recombinational cloning |
US20020013955A1 (en) * | 1998-06-10 | 2002-01-31 | Sharon Ogden | Production of recombinant protein in transgenic fish |
US20020016975A1 (en) * | 1997-03-11 | 2002-02-07 | Regents Of The University Of Minnesota | Dna-based transposon system for the introduction of nucleic acid into dna of a cell |
US20020028488A1 (en) * | 2000-06-19 | 2002-03-07 | Sujay Singh | Transgenic avian species for making human and chimeric antibodies |
US6358710B1 (en) * | 1996-06-07 | 2002-03-19 | Neorx Corporation | Humanized antibodies that bind to the antigen bound by antibody NR-LU-13 |
US6376743B1 (en) * | 1998-08-11 | 2002-04-23 | University Of Hawaii | Mammalian transgenesis by intracytoplasmic sperm injection |
US20020053092A1 (en) * | 1997-11-14 | 2002-05-02 | Readhead Carol W. | Nucleic acid constructs containing a cyclin A1 promoter, and kit |
US20020052047A1 (en) * | 2000-06-22 | 2002-05-02 | Akira Hasebe | Insertion sequence element derived from ralstonia solanacearum |
US20020055172A1 (en) * | 1999-10-07 | 2002-05-09 | Harrington John J. | Multiple promoter expression constructs and methods of use |
US20020072097A1 (en) * | 2000-07-07 | 2002-06-13 | Delcardayre Stephen | Molecular breeding of transposable elements |
US20020076797A1 (en) * | 1998-12-04 | 2002-06-20 | Haifan Lin | Purified and isolated piwi family genes and gene products and methods using same |
US20020083479A1 (en) * | 1997-11-14 | 2002-06-27 | Robert Winston | In vitro transfection, storage and transfer of male germ cells for generation of transgenic species |
US20020099015A1 (en) * | 2000-09-30 | 2002-07-25 | Barber Elizabeth K. | Gene expression control DNA element and associated protein |
US20020108132A1 (en) * | 2001-02-02 | 2002-08-08 | Avigenics Inc. | Production of a monoclonal antibody by a transgenic chicken |
US20020119573A1 (en) * | 2001-02-28 | 2002-08-29 | Shaw Karen J. | Footprinting plasmid |
US20020132349A1 (en) * | 2000-12-05 | 2002-09-19 | Goryshin Igor Yu | Double transposition methods for manipulating nucleic acids |
US6503729B1 (en) * | 1996-08-22 | 2003-01-07 | The Board Of Trustees Of The University Of Illinois | Selected polynucleotide and polypeptide sequences of the methanogenic archaeon, methanococcus jannashii |
US20030017534A1 (en) * | 2000-08-03 | 2003-01-23 | Roland Buelow | Production of humanized antibodies in transgenic animals |
US6515199B1 (en) * | 1992-01-27 | 2003-02-04 | North Carolina State University | Gene transfer in poultry by introduction of embryo cells in ovo |
US6514728B1 (en) * | 1998-11-09 | 2003-02-04 | Nippon Biocaptal Limited | Process for preparation of cytokines using Sendai virus expression system |
US6528699B1 (en) * | 1997-02-25 | 2003-03-04 | Genzyme Transgenics Corporation | Transgenically produced non-secreted proteins |
US20030056241A1 (en) * | 2001-06-07 | 2003-03-20 | Haruo Matsuda | Chicken leukemia inhibitory factor (LIF) and gene thereof |
US20030055017A1 (en) * | 1997-07-24 | 2003-03-20 | Baylor College Of Medicine And Genemedicine | Growth hormone releasing hormone expression system and methods of use, including use in animals |
US20030061629A1 (en) * | 2001-09-21 | 2003-03-27 | Pramod Sutrave | Production of transgenic birds using stage X primordial germ cells |
US20030074681A1 (en) * | 1996-06-12 | 2003-04-17 | Macarthur William C. | Vectors and methods for tissue specific synthesis of protein in eggs of transgenic hens |
US20030074680A1 (en) * | 1993-03-19 | 2003-04-17 | Johns Hopkins University School Of Medicine | Growth differentiation factor-8 |
US6563017B2 (en) * | 1996-07-08 | 2003-05-13 | Dnavec Research Inc. | In vivo electroporation method for early stage embryo of chickens |
US20030101472A1 (en) * | 2001-09-13 | 2003-05-29 | David Baltimore | Method for producing transgenic animals |
US20030115622A1 (en) * | 1997-08-04 | 2003-06-19 | Ponce De Leon F. Abel | Production of avian embryonic germ (eg) cell lines by prolonged culturing of pgc's, use thereof for cloning and chimerization |
US20030121062A1 (en) * | 2001-12-21 | 2003-06-26 | Oxford Biomedica (Uk) Limited | Transgenic organism |
US20030126628A1 (en) * | 2001-11-30 | 2003-07-03 | Harvey Alex J. | Ovomucoid promoter and methods of use |
US20030126629A1 (en) * | 2001-09-18 | 2003-07-03 | Rapp Jeffrey C. | Production of a transgenic avian by cytoplasmic injection |
US20030140363A1 (en) * | 2001-03-30 | 2003-07-24 | Rapp Jeffrey C. | Avian lysozyme promoter |
US20030143740A1 (en) * | 2001-10-15 | 2003-07-31 | Christine Wooddell | Processes for transposase mediated integration into mammalian cells |
US6602686B1 (en) * | 1997-09-26 | 2003-08-05 | Athersys, Inc. | Compositions and method for non-targeted activation of endogenous genes |
US20030150007A1 (en) * | 2000-03-21 | 2003-08-07 | Charalambos Savakis | Method of generating transgenic organisms using transposons |
US20030154502A1 (en) * | 1999-08-12 | 2003-08-14 | Wimmer Ernst A. | Universal markers of transgenesis |
US20040006776A1 (en) * | 1993-12-20 | 2004-01-08 | Genzyme Transgenics Corporation | Transgenic production of antibodies in milk |
US20040018624A1 (en) * | 2001-10-22 | 2004-01-29 | Athersys, Inc. | Compositions and methods for making mutations in cell lines and animals |
US20040019922A1 (en) * | 1997-10-16 | 2004-01-29 | Avigenics, Inc. | Exogenous proteins expressed in avians and their eggs |
US20040040052A1 (en) * | 2001-12-21 | 2004-02-26 | Oxford Biomedica (Uk) Limited | Transgenic organism |
US6716823B1 (en) * | 1997-08-13 | 2004-04-06 | The Uab Research Foundation | Noninvasive genetic immunization, expression products therefrom, and uses thereof |
US6730822B1 (en) * | 1997-10-16 | 2004-05-04 | Avigenics, Inc. | Vectors in avian transgenesis |
US6759573B2 (en) * | 1999-12-15 | 2004-07-06 | Regents Of The University Of Minnesota | Method to enhance agrobacterium-mediated transformation of plants |
US20040142475A1 (en) * | 2000-06-02 | 2004-07-22 | Barman Shikha P. | Delivery systems for bioactive agents |
US20050004030A1 (en) * | 2002-05-17 | 2005-01-06 | Fischetti Vincent A. | Phage-associated lytic enzymes for treatment of Bacillus anthracis and related conditions |
US6852510B2 (en) * | 2000-07-03 | 2005-02-08 | Gala Design Inc | Host cells containing multiple integrating vectors |
US7005296B1 (en) * | 1999-08-19 | 2006-02-28 | The United States Of America As Represented By The Secretary Of Agriculture | PiggyBac transformation system |
US20060046248A1 (en) * | 2004-08-25 | 2006-03-02 | Avigenics, Inc. | RNA interference in avians |
US7019193B2 (en) * | 1995-02-21 | 2006-03-28 | Gtc Biotherapeutics, Inc. | Treatments using transgenic goat produced antithrombin III |
US7034115B1 (en) * | 1999-12-03 | 2006-04-25 | Japan Science And Technology Corporation | Transposase and method of gene modification |
US20060123504A1 (en) * | 2004-12-07 | 2006-06-08 | Avigenics, Inc. | Methods of producing polyclonal antibodies |
US20060121509A1 (en) * | 2004-12-01 | 2006-06-08 | Schering Aktiengesellschaft | Generation of replication competent viruses for therapeutic use |
US7083980B2 (en) * | 2003-04-17 | 2006-08-01 | Wisconsin Alumni Research Foundation | Tn5 transposase mutants and the use thereof |
US7160682B2 (en) * | 1998-11-13 | 2007-01-09 | Regents Of The University Of Minnesota | Nucleic acid transfer vector for the introduction of nucleic acid into the DNA of a cell |
US20070009991A1 (en) * | 2002-03-14 | 2007-01-11 | Avigenics, Inc. | Gene expression in transgenic avians |
US20070022485A1 (en) * | 2003-07-08 | 2007-01-25 | Japan Science Techonology Agency | Method of preparing transgenic organism with use of methylation and system therefor |
US20070113299A1 (en) * | 2001-11-30 | 2007-05-17 | Avigenics, Inc. | Transgenic avians containing recombinant ovomucoid promoters |
Family Cites Families (64)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP0832981A1 (en) | 1987-02-17 | 1998-04-01 | Pharming B.V. | DNA sequences to target proteins to the mammary gland for efficient secretion |
CA1327311C (en) | 1987-07-06 | 1994-03-01 | Jesse M. Jaynes | Therapeutic antimicrobial polypeptides, their use and methods for preparation |
JP2962555B2 (en) | 1987-07-06 | 1999-10-12 | ヘリックス バイオメディックス,インコーポレイテッド | Suppression of eukaryotic pathogens and neoplasms by lytic peptides and stimulation of fibroblasts and lymphocytes |
US5162215A (en) | 1988-09-22 | 1992-11-10 | Amgen Inc. | Method of gene transfer into chickens and other avian species |
US5703055A (en) | 1989-03-21 | 1997-12-30 | Wisconsin Alumni Research Foundation | Generation of antibodies through lipid mediated DNA delivery |
US6607884B1 (en) | 1993-03-19 | 2003-08-19 | The Johns Hopkins University School Of Medicine | Methods of detecting growth differentiation factor-8 |
US6156568A (en) | 1993-06-30 | 2000-12-05 | Board Of Supervisors Of Louisiana State University And Agricultural And Mechanical College | Transformed eukaryotic cells |
US5648244A (en) * | 1993-09-27 | 1997-07-15 | President And Fellows Of Harvard College | Production, purification, cleavage and use of fusion peptides |
US20030167492A1 (en) | 1994-07-08 | 2003-09-04 | Johns Hopkins University School Of Medicine | Transgenic non-human animals expressing a gdf-11 dominant negative polypeptide, and methods of making and using same |
WO1996001845A1 (en) | 1994-07-08 | 1996-01-25 | The Johns Hopkins University School Of Medicine | Growth differentiation factor-11 |
US5998698A (en) | 1995-06-07 | 1999-12-07 | Board Of Supervisors Of Louisiana State University And Agricultural And Mechanical College | Transgenic fish capable of expressing exogenous lytic peptides |
US5965443A (en) | 1996-09-09 | 1999-10-12 | Wisconsin Alumni Research Foundation | System for in vitro transposition |
EP1036183B1 (en) | 1997-02-20 | 2007-10-03 | The Johns Hopkins University School Of Medicine | Mutations in atp-dependent transposition proteins that reduce target-site specificity |
EP1006790A1 (en) | 1997-08-22 | 2000-06-14 | Biotechnology and Biological Sciences Research Council | Use of mariner transposon in the production of transgenic animals |
US6140129A (en) | 1997-09-17 | 2000-10-31 | Wisconsin Alumni Research Foundation | Chromosomal targeting in bacteria using FLP recombinase |
US7511120B2 (en) | 1997-10-16 | 2009-03-31 | Synageva Biopharma Corp. | Glycosylated G-CSF obtained from a transgenic chicken |
TW445295B (en) * | 1997-12-31 | 2001-07-11 | Shiu Li Wei | Expression vector pcDNA3.1-HC for human erythropoietin, BHK-21 host cell line transformed therewith, and production of human erythropoietin using the transformed cell |
US20030217375A1 (en) | 1998-08-31 | 2003-11-20 | Eyal Zcharia | Transgenic animals expressing heparanase and uses thereof |
US6159736A (en) | 1998-09-23 | 2000-12-12 | Wisconsin Alumni Research Foundation | Method for making insertional mutations using a Tn5 synaptic complex |
EP1375654A3 (en) | 1998-11-02 | 2008-01-16 | GTC Biotherapeutics, Inc. | Transgenic and cloned mammals |
US20020148000A1 (en) | 1998-12-04 | 2002-10-10 | Shen Che-Kun James | HS-40 enhancer-containing vector in transgenic animals |
US6217185B1 (en) * | 1999-03-08 | 2001-04-17 | International Business Machines Corporation | Efficient backlighting for a portable display |
AU2103601A (en) | 1999-12-17 | 2001-06-25 | Oregon Health And Science University | Methods for producing transgenic animals |
WO2001070949A1 (en) | 2000-03-17 | 2001-09-27 | Benitec Australia Ltd | Genetic silencing |
US6589783B2 (en) | 2000-04-13 | 2003-07-08 | Novagen, Inc. | Multiple host expression vector |
US7354755B2 (en) | 2000-05-01 | 2008-04-08 | Midwest Research Institute | Stable zymomonas mobilis xylose and arabinose fermenting strains |
US7105343B1 (en) | 2000-10-31 | 2006-09-12 | University Of Notre Dame Du Lac | Methods and compositions for transposition using minimal segments of the eukaryotic transformation vector Piggybac |
US7176300B2 (en) | 2001-03-30 | 2007-02-13 | Avigenics, Inc. | Avian lysozyme promoter |
US7972853B2 (en) | 2001-10-22 | 2011-07-05 | Abt Holding Company | Compositions and methods for making mutations in cell lines and animals |
AU2002365184A1 (en) | 2001-10-26 | 2003-07-30 | Id Biomedical Corporation Of Washington | Efficient protein expression system |
US7294507B2 (en) * | 2001-11-30 | 2007-11-13 | Avigenics, Inc. | Ovomucoid promoters and methods of use |
US20040210954A1 (en) | 2003-03-07 | 2004-10-21 | Alex Harvey | Integrase mediated avian transgenesis |
US7323618B2 (en) | 2002-02-01 | 2008-01-29 | Origen Therapeutics, Inc. | Tissue specific expression of exogenous proteins in transgenic chickens |
US7145057B2 (en) | 2002-02-01 | 2006-12-05 | Origen Therapeutics, Inc. | Chimeric bird from embryonic stem cells |
US7135562B2 (en) | 2002-03-14 | 2006-11-14 | University Of Cincinnati | Avian iFABP gene expression controlling region |
US20030182675A1 (en) | 2002-03-22 | 2003-09-25 | Origen Therapeutics | Functional disruption of avian immunoglobulin genes |
US20040172667A1 (en) | 2002-06-26 | 2004-09-02 | Cooper Richard K. | Administration of transposon-based vectors to reproductive organs |
WO2004003157A2 (en) | 2002-06-26 | 2004-01-08 | Transgenrx, Inc. | Gene regulation in transgenic animals using a transposon-based vector |
US7527966B2 (en) | 2002-06-26 | 2009-05-05 | Transgenrx, Inc. | Gene regulation in transgenic animals using a transposon-based vector |
US20040235011A1 (en) | 2002-06-26 | 2004-11-25 | Cooper Richard K. | Production of multimeric proteins |
ATE494370T1 (en) | 2002-07-24 | 2011-01-15 | Manoa Biosciences Inc | TRANSPOSON-BASED VECTORS AND METHOD FOR INTEGRATION OF NUCLEIC ACIDS |
US7700356B2 (en) | 2002-11-08 | 2010-04-20 | The United States Of America As Represented By The Secretary Of Agriculture | System for gene targeting and producing stable genomic transgene insertions |
GB0227645D0 (en) | 2002-11-27 | 2003-01-08 | Viragen Inc | Protein production in transgenic avians |
WO2004065581A2 (en) | 2003-01-15 | 2004-08-05 | Discovery Genomics, Inc. | Transposon-insulator element delivery systems |
JP2006517104A (en) | 2003-02-10 | 2006-07-20 | マックス−デルブルック−セントラム フユール モレクラーレ メディツィン(エムディーシー) | Targeting system based on transposon |
ATE536419T1 (en) | 2003-02-10 | 2011-12-15 | Max Delbrueck Centrum | TRANSPOSON BASED TARGETING SYSTEM |
US20050273873A1 (en) | 2003-03-07 | 2005-12-08 | Avigenics, Inc. | Genomic modification |
US20050034186A1 (en) * | 2003-03-07 | 2005-02-10 | Harvey Alex J. | Site specific nucleic acid integration |
US20050198700A1 (en) | 2003-03-07 | 2005-09-08 | Avigenics, Inc. | Genomic modification |
US20040255345A1 (en) | 2003-03-07 | 2004-12-16 | Rapp Jeffrey C. | Production of transgenic avians |
US7381712B2 (en) * | 2003-05-09 | 2008-06-03 | Avigenics, Inc. | In vivo transfection in avians |
WO2005040215A2 (en) | 2003-06-06 | 2005-05-06 | Avigenics, Inc. | Ovomucoid promoters and mehtods of use |
WO2005054463A1 (en) | 2003-11-21 | 2005-06-16 | Osaka Industrial Promotion Organization | Development of mammalian genome modification technique using retrotransposon |
US7569223B2 (en) | 2004-03-22 | 2009-08-04 | The Rockefeller University | Phage-associated lytic enzymes for treatment of Streptococcus pneumoniae and related conditions |
GB0419424D0 (en) | 2004-09-02 | 2004-10-06 | Viragen Scotland Ltd | Transgene optimisation |
WO2006053245A2 (en) | 2004-11-12 | 2006-05-18 | Iogenetics, Llc | Retroviral vectors with introns |
US20060141627A1 (en) | 2004-11-18 | 2006-06-29 | Stratatech Corporation | Vectors for stable gene expression |
WO2006055040A2 (en) | 2004-11-19 | 2006-05-26 | Government Of The United States Of America, Department Of Health And Human Services | Identification of proteins in a genome |
US20060153800A1 (en) | 2004-12-14 | 2006-07-13 | Roland Buelow | DNA immunization with recombinase/transposase |
WO2006093847A1 (en) | 2005-02-28 | 2006-09-08 | Avigenics, Inc. | Artificial chromosomes and transchromosomic avians |
US9150880B2 (en) * | 2008-09-25 | 2015-10-06 | Proteovec Holding, L.L.C. | Vectors for production of antibodies |
US9157097B2 (en) * | 2008-09-25 | 2015-10-13 | Proteovec Holding, L.L.C. | Vectors for production of growth hormone |
US20100081789A1 (en) * | 2008-09-25 | 2010-04-01 | Cooper Richard K | Novel Vectors for Production of Interferon |
US9150881B2 (en) | 2009-04-09 | 2015-10-06 | Proteovec Holding, L.L.C. | Production of proteins using transposon-based vectors |
-
2003
- 2003-12-24 US US10/746,149 patent/US20040172667A1/en not_active Abandoned
-
2007
- 2007-10-31 US US11/981,629 patent/US8283518B2/en not_active Expired - Fee Related
Patent Citations (99)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4870009A (en) * | 1982-11-22 | 1989-09-26 | The Salk Institute For Biological Studies | Method of obtaining gene product through the generation of transgenic animals |
US4670388A (en) * | 1982-12-30 | 1987-06-02 | Carnegie Institution Of Washington | Method of incorporating DNA into genome of drosophila |
US4914025A (en) * | 1985-12-05 | 1990-04-03 | Colin Manoil | Export of intra-cellular substances |
US5861478A (en) * | 1987-07-06 | 1999-01-19 | Helix Biomedix, Inc. | Lytic peptides |
US6255282B1 (en) * | 1987-07-06 | 2001-07-03 | Helix Biomedix, Inc. | Lytic peptides |
US5869296A (en) * | 1987-10-05 | 1999-02-09 | Washington University | DNA transposon Tn5seq1 |
US5212080A (en) * | 1987-10-05 | 1993-05-18 | Washington University | Method of DNA sequencing using DNA transposon Tn5seql |
US5102797A (en) * | 1989-05-26 | 1992-04-07 | Dna Plant Technology Corporation | Introduction of heterologous genes into bacteria using transposon flanked expression cassette and a binary vector system |
US20030150006A1 (en) * | 1992-01-27 | 2003-08-07 | James Petitte | Gene transfer in poultry by introduction of embryo cells in ovo |
US6515199B1 (en) * | 1992-01-27 | 2003-02-04 | North Carolina State University | Gene transfer in poultry by introduction of embryo cells in ovo |
US5733779A (en) * | 1992-11-13 | 1998-03-31 | Idec Pharmaceuticals Corporation | Impaired dominant selectable marker sequence and intronic insertion strategies for enhancement of expression of gene product and expression vector systems comprising same |
US20030074680A1 (en) * | 1993-03-19 | 2003-04-17 | Johns Hopkins University School Of Medicine | Growth differentiation factor-8 |
US5645991A (en) * | 1993-05-04 | 1997-07-08 | Univ. Of Connecticut | Transposon-containing DNA cloning vector and uses thereof |
US5512483A (en) * | 1993-05-21 | 1996-04-30 | Mcgill University | Expression vectors responsive to steroid hormones |
US5719055A (en) * | 1993-06-30 | 1998-02-17 | Board Of Supervisors Of Louisiana State University And Agricultural And Mechanical College | Transposon-based transformation vectors |
US5556782A (en) * | 1993-06-30 | 1996-09-17 | Board Of Supervisors Of Louisiana State University And Agricultural & Mechanical College | Transformed mammalian cells capable of expressing cecropin b |
US5753502A (en) * | 1993-08-05 | 1998-05-19 | Icos Corporation | Neuron-specific ICAM-4 promoter |
US20040006776A1 (en) * | 1993-12-20 | 2004-01-08 | Genzyme Transgenics Corporation | Transgenic production of antibodies in milk |
US7019193B2 (en) * | 1995-02-21 | 2006-03-28 | Gtc Biotherapeutics, Inc. | Treatments using transgenic goat produced antithrombin III |
US6171861B1 (en) * | 1995-06-07 | 2001-01-09 | Life Technologies, Inc. | Recombinational cloning using engineered recombination sites |
US6261554B1 (en) * | 1995-07-25 | 2001-07-17 | Introgene B.V. | Compositions for targeted gene delivery |
US6218185B1 (en) * | 1996-04-19 | 2001-04-17 | The United States Of America As Represented By The Secretary Of Agriculture | Piggybac transposon-based genetic transformation system for insects |
US6358710B1 (en) * | 1996-06-07 | 2002-03-19 | Neorx Corporation | Humanized antibodies that bind to the antigen bound by antibody NR-LU-13 |
US20030074681A1 (en) * | 1996-06-12 | 2003-04-17 | Macarthur William C. | Vectors and methods for tissue specific synthesis of protein in eggs of transgenic hens |
US6563017B2 (en) * | 1996-07-08 | 2003-05-13 | Dnavec Research Inc. | In vivo electroporation method for early stage embryo of chickens |
US6503729B1 (en) * | 1996-08-22 | 2003-01-07 | The Board Of Trustees Of The University Of Illinois | Selected polynucleotide and polypeptide sequences of the methanogenic archaeon, methanococcus jannashii |
US5948622A (en) * | 1996-09-09 | 1999-09-07 | Wisconsin Alumni Research Foundation | System for in vitro transposition |
US5925545A (en) * | 1996-09-09 | 1999-07-20 | Wisconsin Alumni Research Foundation | System for in vitro transposition |
US6107477A (en) * | 1996-09-26 | 2000-08-22 | Aurora Biosciences Corporation | Non-optimal Kozaks sequences |
US6528699B1 (en) * | 1997-02-25 | 2003-03-04 | Genzyme Transgenics Corporation | Transgenically produced non-secreted proteins |
US20020016975A1 (en) * | 1997-03-11 | 2002-02-07 | Regents Of The University Of Minnesota | Dna-based transposon system for the introduction of nucleic acid into dna of a cell |
US20020104109A1 (en) * | 1997-03-20 | 2002-08-01 | Bremel Robert D. | Transgenic animals |
US6291740B1 (en) * | 1997-03-20 | 2001-09-18 | Wisconsin Alumni Research Foundation | Transgenic animals |
US6080912A (en) * | 1997-03-20 | 2000-06-27 | Wisconsin Alumni Research Foundation | Methods for creating transgenic animals |
US20030055017A1 (en) * | 1997-07-24 | 2003-03-20 | Baylor College Of Medicine And Genemedicine | Growth hormone releasing hormone expression system and methods of use, including use in animals |
US5958775A (en) * | 1997-07-25 | 1999-09-28 | Thomas Jefferson University | Composition and method for targeted integration into cells |
US20030115622A1 (en) * | 1997-08-04 | 2003-06-19 | Ponce De Leon F. Abel | Production of avian embryonic germ (eg) cell lines by prolonged culturing of pgc's, use thereof for cloning and chimerization |
US6716823B1 (en) * | 1997-08-13 | 2004-04-06 | The Uab Research Foundation | Noninvasive genetic immunization, expression products therefrom, and uses thereof |
US6602686B1 (en) * | 1997-09-26 | 2003-08-05 | Athersys, Inc. | Compositions and method for non-targeted activation of endogenous genes |
US20040019922A1 (en) * | 1997-10-16 | 2004-01-29 | Avigenics, Inc. | Exogenous proteins expressed in avians and their eggs |
US20060188478A1 (en) * | 1997-10-16 | 2006-08-24 | Avigenics, Inc | Glycosylated interferon |
US6730822B1 (en) * | 1997-10-16 | 2004-05-04 | Avigenics, Inc. | Vectors in avian transgenesis |
US20040158882A1 (en) * | 1997-10-16 | 2004-08-12 | Avigenics, Inc. | Novel vectors in avian transgenesis |
US20060123488A1 (en) * | 1997-10-16 | 2006-06-08 | Avigenics, Inc. And University Of Georgia Research Foundation, Inc. | Avian eggs and exogenous proteins |
US20060171921A1 (en) * | 1997-10-16 | 2006-08-03 | Avigenics, Inc. | Glycosylated interferon |
US20060185029A1 (en) * | 1997-10-16 | 2006-08-17 | Avigenics, Inc. | Avians that produce eggs containing exogenous proteins |
US20060185024A1 (en) * | 1997-10-16 | 2006-08-17 | Avigenics, Inc. | Avians that produce eggs containing exogenous proteins |
US20020133835A1 (en) * | 1997-11-14 | 2002-09-19 | Robert Winston | Kit for transfection, storage and transfer of male germ cells for generation of transgenic species |
US20020053092A1 (en) * | 1997-11-14 | 2002-05-02 | Readhead Carol W. | Nucleic acid constructs containing a cyclin A1 promoter, and kit |
US20020056148A1 (en) * | 1997-11-14 | 2002-05-09 | Readhead Carol W. | Transfection, storage and transfer of male germ cells for generation of selectable transgenic stem cells |
US20020129398A1 (en) * | 1997-11-14 | 2002-09-12 | Robert Winston | Transfection, storage and transfer of male germ cells for generation of trangenic species |
US20020083479A1 (en) * | 1997-11-14 | 2002-06-27 | Robert Winston | In vitro transfection, storage and transfer of male germ cells for generation of transgenic species |
US6258571B1 (en) * | 1998-04-10 | 2001-07-10 | Genset | High throughput DNA sequencing vector |
US6291214B1 (en) * | 1998-05-11 | 2001-09-18 | Glaxo Wellcome Inc. | System for generating recombinant viruses |
US20020042137A1 (en) * | 1998-05-11 | 2002-04-11 | Richards Cynthia Ann | System for generating recombinant viruses |
US20020013955A1 (en) * | 1998-06-10 | 2002-01-31 | Sharon Ogden | Production of recombinant protein in transgenic fish |
US6376743B1 (en) * | 1998-08-11 | 2002-04-23 | University Of Hawaii | Mammalian transgenesis by intracytoplasmic sperm injection |
US6514728B1 (en) * | 1998-11-09 | 2003-02-04 | Nippon Biocaptal Limited | Process for preparation of cytokines using Sendai virus expression system |
US7160682B2 (en) * | 1998-11-13 | 2007-01-09 | Regents Of The University Of Minnesota | Nucleic acid transfer vector for the introduction of nucleic acid into the DNA of a cell |
US20020076797A1 (en) * | 1998-12-04 | 2002-06-20 | Haifan Lin | Purified and isolated piwi family genes and gene products and methods using same |
US6291243B1 (en) * | 1999-04-28 | 2001-09-18 | The Board Of Trustees Of The Leland Stanford Jr. University | P element derived vector and methods for its use |
US20020028513A1 (en) * | 1999-04-28 | 2002-03-07 | Patrick Fogarty | P element derived vector and methods for its use |
US20030154502A1 (en) * | 1999-08-12 | 2003-08-14 | Wimmer Ernst A. | Universal markers of transgenesis |
US7005296B1 (en) * | 1999-08-19 | 2006-02-28 | The United States Of America As Represented By The Secretary Of Agriculture | PiggyBac transformation system |
US20020055172A1 (en) * | 1999-10-07 | 2002-05-09 | Harrington John J. | Multiple promoter expression constructs and methods of use |
US7034115B1 (en) * | 1999-12-03 | 2006-04-25 | Japan Science And Technology Corporation | Transposase and method of gene modification |
US20020007051A1 (en) * | 1999-12-10 | 2002-01-17 | David Cheo | Use of multiple recombination sites with unique specificity in recombinational cloning |
US6759573B2 (en) * | 1999-12-15 | 2004-07-06 | Regents Of The University Of Minnesota | Method to enhance agrobacterium-mediated transformation of plants |
US20030150007A1 (en) * | 2000-03-21 | 2003-08-07 | Charalambos Savakis | Method of generating transgenic organisms using transposons |
US20040142475A1 (en) * | 2000-06-02 | 2004-07-22 | Barman Shikha P. | Delivery systems for bioactive agents |
US20020028488A1 (en) * | 2000-06-19 | 2002-03-07 | Sujay Singh | Transgenic avian species for making human and chimeric antibodies |
US20020052047A1 (en) * | 2000-06-22 | 2002-05-02 | Akira Hasebe | Insertion sequence element derived from ralstonia solanacearum |
US20030009026A1 (en) * | 2000-06-22 | 2003-01-09 | Akira Hasebe | Insertion sequence element derived from ralstonia solanacearum |
US6852510B2 (en) * | 2000-07-03 | 2005-02-08 | Gala Design Inc | Host cells containing multiple integrating vectors |
US20020072097A1 (en) * | 2000-07-07 | 2002-06-13 | Delcardayre Stephen | Molecular breeding of transposable elements |
US20030017534A1 (en) * | 2000-08-03 | 2003-01-23 | Roland Buelow | Production of humanized antibodies in transgenic animals |
US20020099015A1 (en) * | 2000-09-30 | 2002-07-25 | Barber Elizabeth K. | Gene expression control DNA element and associated protein |
US20020132349A1 (en) * | 2000-12-05 | 2002-09-19 | Goryshin Igor Yu | Double transposition methods for manipulating nucleic acids |
US20020108132A1 (en) * | 2001-02-02 | 2002-08-08 | Avigenics Inc. | Production of a monoclonal antibody by a transgenic chicken |
US20020119573A1 (en) * | 2001-02-28 | 2002-08-29 | Shaw Karen J. | Footprinting plasmid |
US7199279B2 (en) * | 2001-03-30 | 2007-04-03 | Avigenics, Inc. | Recombinant promoters in avian cells |
US20030140363A1 (en) * | 2001-03-30 | 2003-07-24 | Rapp Jeffrey C. | Avian lysozyme promoter |
US20030056241A1 (en) * | 2001-06-07 | 2003-03-20 | Haruo Matsuda | Chicken leukemia inhibitory factor (LIF) and gene thereof |
US20030101472A1 (en) * | 2001-09-13 | 2003-05-29 | David Baltimore | Method for producing transgenic animals |
US20030126629A1 (en) * | 2001-09-18 | 2003-07-03 | Rapp Jeffrey C. | Production of a transgenic avian by cytoplasmic injection |
US20030061629A1 (en) * | 2001-09-21 | 2003-03-27 | Pramod Sutrave | Production of transgenic birds using stage X primordial germ cells |
US20030143740A1 (en) * | 2001-10-15 | 2003-07-31 | Christine Wooddell | Processes for transposase mediated integration into mammalian cells |
US20040018624A1 (en) * | 2001-10-22 | 2004-01-29 | Athersys, Inc. | Compositions and methods for making mutations in cell lines and animals |
US20070113299A1 (en) * | 2001-11-30 | 2007-05-17 | Avigenics, Inc. | Transgenic avians containing recombinant ovomucoid promoters |
US20030126628A1 (en) * | 2001-11-30 | 2003-07-03 | Harvey Alex J. | Ovomucoid promoter and methods of use |
US20040040052A1 (en) * | 2001-12-21 | 2004-02-26 | Oxford Biomedica (Uk) Limited | Transgenic organism |
US20030121062A1 (en) * | 2001-12-21 | 2003-06-26 | Oxford Biomedica (Uk) Limited | Transgenic organism |
US20070009991A1 (en) * | 2002-03-14 | 2007-01-11 | Avigenics, Inc. | Gene expression in transgenic avians |
US20050004030A1 (en) * | 2002-05-17 | 2005-01-06 | Fischetti Vincent A. | Phage-associated lytic enzymes for treatment of Bacillus anthracis and related conditions |
US7083980B2 (en) * | 2003-04-17 | 2006-08-01 | Wisconsin Alumni Research Foundation | Tn5 transposase mutants and the use thereof |
US20070022485A1 (en) * | 2003-07-08 | 2007-01-25 | Japan Science Techonology Agency | Method of preparing transgenic organism with use of methylation and system therefor |
US20060046248A1 (en) * | 2004-08-25 | 2006-03-02 | Avigenics, Inc. | RNA interference in avians |
US20060121509A1 (en) * | 2004-12-01 | 2006-06-08 | Schering Aktiengesellschaft | Generation of replication competent viruses for therapeutic use |
US20060123504A1 (en) * | 2004-12-07 | 2006-06-08 | Avigenics, Inc. | Methods of producing polyclonal antibodies |
Cited By (13)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20040235011A1 (en) * | 2002-06-26 | 2004-11-25 | Cooper Richard K. | Production of multimeric proteins |
US8283518B2 (en) | 2002-06-26 | 2012-10-09 | Transgenrx, Inc. | Administration of transposon-based vectors to reproductive organs |
US20040197910A1 (en) * | 2002-06-26 | 2004-10-07 | Cooper Richard K. | Gene regulation in transgenic animals using a transposon-based vector |
US20050273873A1 (en) * | 2003-03-07 | 2005-12-08 | Avigenics, Inc. | Genomic modification |
US8236294B2 (en) | 2003-12-24 | 2012-08-07 | The Board Of Supervisors Of Louisiana State University And Agricultural And Mechanical College | Gene therapy using transposon-based vectors |
US8071364B2 (en) | 2003-12-24 | 2011-12-06 | Transgenrx, Inc. | Gene therapy using transposon-based vectors |
US20060123504A1 (en) * | 2004-12-07 | 2006-06-08 | Avigenics, Inc. | Methods of producing polyclonal antibodies |
US8137974B2 (en) * | 2005-05-17 | 2012-03-20 | Temasek Life Sciences Laboratory Limited | Transposition of maize AC/DS elements in vertebrates |
US20090131272A1 (en) * | 2005-05-17 | 2009-05-21 | Temasek Life Sciences Laboratory Limited | Transposition of maize ac/ds elements in vertebrates |
US8399257B2 (en) | 2005-05-17 | 2013-03-19 | Temasek Life Sciences Laboratory Limited | Transposition of maize Ac/Ds elements in vertebrates |
US9150880B2 (en) | 2008-09-25 | 2015-10-06 | Proteovec Holding, L.L.C. | Vectors for production of antibodies |
US9157097B2 (en) | 2008-09-25 | 2015-10-13 | Proteovec Holding, L.L.C. | Vectors for production of growth hormone |
US9150881B2 (en) | 2009-04-09 | 2015-10-06 | Proteovec Holding, L.L.C. | Production of proteins using transposon-based vectors |
Also Published As
Publication number | Publication date |
---|---|
US8283518B2 (en) | 2012-10-09 |
US20080235815A1 (en) | 2008-09-25 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US8283518B2 (en) | Administration of transposon-based vectors to reproductive organs | |
US7527966B2 (en) | Gene regulation in transgenic animals using a transposon-based vector | |
KR102609858B1 (en) | Adeno-Associated Viral Vectors for the Treatment of Mucopolysaccharidosis | |
KR100880509B1 (en) | A Novel vector and expression cell line for mass production of recombinant protein and a process of producing recombinant protein using same | |
US20040235011A1 (en) | Production of multimeric proteins | |
AU775988B2 (en) | Ligand activated transcriptional regulator proteins | |
US20040077572A1 (en) | Transposon system and methods of use | |
CN108495685B (en) | Yeast-based immunotherapy against clostridium difficile infection | |
US20030119104A1 (en) | Chromosome-based platforms | |
US20040003420A1 (en) | Modified recombinase | |
CN110023500A (en) | The attenuation glutamine synthelase alternatively marked | |
CN111094569A (en) | Light-controlled viral protein, gene thereof, and viral vector containing same | |
CN114181957B (en) | Stable T7 expression system based on virus capping enzyme and method for expressing protein in eukaryote | |
WO2005081716A2 (en) | DNA VACCINES TARGETING ANTIGENS OF THE SEVERE ACUTE RESPIRATORY SYNDROME CORONAVIRUS (SARS-CoV) | |
CN112877292A (en) | Human antibody producing cell | |
CN109762846B (en) | Repair of GALC associated with krabbe disease using base editingC1586TMutational reagents and methods | |
WO2002038613A2 (en) | Modified recombinase | |
KR20220161297A (en) | new cell line | |
KR102523209B1 (en) | Foot-and-mouth disease type O recombinant virus (O TWN-3A) inserted with T cell epitope for enhancing cellular immunity and vaccine composition containing inactivated antigen isolated and purified from the virus | |
CN111471665B (en) | DNA cyclization molecule and application thereof | |
CN102071213B (en) | Secretory type alkaline phosphatase traced TA cloning eukaryon expression vector and construction method thereof | |
KR20230072149A (en) | Recombinant foot-and-mouth disease type O virus that induces strong adaptive immune response and overcomes maternally-derived antibody and foot-and-mouth disease vaccine composition comprising the same | |
KR20230072150A (en) | Recombinant foot-and-mouth disease type A virus that induces strong adaptive immune response and overcomes maternally-derived antibody and foot-and-mouth disease vaccine composition comprising the same | |
CA2522166C (en) | Lambda integrase mutein for use in recombination | |
KR20230117327A (en) | An expression vector comprising a soluble alkaline phosphatase construct and a polynucleotide encoding the soluble alkaline phosphatase construct. |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AS | Assignment |
Owner name: TRANSGENRX, INC., TEXAS Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:CADD, GARY C.;FIORETTI, WILLIAM C.;REEL/FRAME:015010/0117 Effective date: 20040120 Owner name: BOARD OF SUPERVISORS OF LOUISIANA STATE UNIVERSITY Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:COOPER, RICHARD K.;REEL/FRAME:015012/0049 Effective date: 20040120 |
|
AS | Assignment |
Owner name: TRANSGENRX, INC., TEXAS Free format text: TO CORRECT NAME OF ASSIGNOR GARY C. CADD TO GARY G. CADD ON NOTICE OF ASSIGNMENT RECORDATION DOCUMENT ON REEL 015010, FRAME 0117; RECORDED ON 02/26/2004;ASSIGNORS:CADD, GARY G.;FIORETTI, WILLIAM C.;REEL/FRAME:015954/0963 Effective date: 20040120 |
|
STCB | Information on status: application discontinuation |
Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION |