US20030104361A1 - Method of detection of nucleic acids with a specific sequence composition - Google Patents
Method of detection of nucleic acids with a specific sequence composition Download PDFInfo
- Publication number
- US20030104361A1 US20030104361A1 US08/860,844 US86084497A US2003104361A1 US 20030104361 A1 US20030104361 A1 US 20030104361A1 US 86084497 A US86084497 A US 86084497A US 2003104361 A1 US2003104361 A1 US 2003104361A1
- Authority
- US
- United States
- Prior art keywords
- nucleic acid
- seq
- binding
- tba
- sequence
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Abandoned
Links
- 150000007523 nucleic acids Chemical class 0.000 title claims abstract description 380
- 102000039446 nucleic acids Human genes 0.000 title claims abstract description 325
- 108020004707 nucleic acids Proteins 0.000 title claims abstract description 325
- 238000000034 method Methods 0.000 title claims abstract description 181
- 238000001514 detection method Methods 0.000 title claims abstract description 47
- 239000000203 mixture Substances 0.000 title claims abstract description 40
- 230000027455 binding Effects 0.000 claims abstract description 325
- 239000000523 sample Substances 0.000 claims abstract description 143
- 238000000429 assembly Methods 0.000 claims abstract description 33
- 230000000712 assembly Effects 0.000 claims abstract description 33
- 238000004519 manufacturing process Methods 0.000 claims abstract description 31
- 230000004807 localization Effects 0.000 claims abstract description 21
- 230000001717 pathogenic effect Effects 0.000 claims abstract description 21
- 238000012544 monitoring process Methods 0.000 claims abstract description 4
- 230000035945 sensitivity Effects 0.000 claims abstract description 4
- 239000012634 fragment Substances 0.000 claims description 89
- 238000009396 hybridization Methods 0.000 claims description 84
- 108020004414 DNA Proteins 0.000 claims description 80
- 102000003945 NF-kappa B Human genes 0.000 claims description 77
- 108010057466 NF-kappa B Proteins 0.000 claims description 76
- 108090000623 proteins and genes Proteins 0.000 claims description 75
- 102000004169 proteins and genes Human genes 0.000 claims description 70
- 230000000295 complement effect Effects 0.000 claims description 64
- 150000001413 amino acids Chemical group 0.000 claims description 63
- 229920000642 polymer Polymers 0.000 claims description 38
- 102000044158 nucleic acid binding protein Human genes 0.000 claims description 32
- 108700020942 nucleic acid binding protein Proteins 0.000 claims description 32
- 238000012360 testing method Methods 0.000 claims description 31
- 239000011324 bead Substances 0.000 claims description 28
- 102000052510 DNA-Binding Proteins Human genes 0.000 claims description 26
- 239000002773 nucleotide Substances 0.000 claims description 23
- 125000003729 nucleotide group Chemical group 0.000 claims description 23
- 108091028043 Nucleic acid sequence Proteins 0.000 claims description 22
- 102000040430 polynucleotide Human genes 0.000 claims description 20
- 108091033319 polynucleotide Proteins 0.000 claims description 20
- 239000002157 polynucleotide Substances 0.000 claims description 20
- 239000013598 vector Substances 0.000 claims description 20
- 101710096438 DNA-binding protein Proteins 0.000 claims description 17
- 108010077850 Nuclear Localization Signals Proteins 0.000 claims description 16
- 239000000047 product Substances 0.000 claims description 16
- 239000000126 substance Substances 0.000 claims description 16
- 108010083127 phage repressor proteins Proteins 0.000 claims description 15
- 101710149951 Protein Tat Proteins 0.000 claims description 14
- 244000052769 pathogen Species 0.000 claims description 14
- 238000002360 preparation method Methods 0.000 claims description 14
- 241000701959 Escherichia virus Lambda Species 0.000 claims description 13
- 230000001225 therapeutic effect Effects 0.000 claims description 11
- 108010038807 Oligopeptides Proteins 0.000 claims description 10
- 102000015636 Oligopeptides Human genes 0.000 claims description 10
- 238000006243 chemical reaction Methods 0.000 claims description 10
- 108091008324 binding proteins Proteins 0.000 claims description 9
- 239000000872 buffer Substances 0.000 claims description 9
- 230000000069 prophylactic effect Effects 0.000 claims description 9
- 102000004190 Enzymes Human genes 0.000 claims description 8
- 108090000790 Enzymes Proteins 0.000 claims description 8
- WEDDFMCSUNNZJR-WDSKDSINSA-N Met-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(O)=O WEDDFMCSUNNZJR-WDSKDSINSA-N 0.000 claims description 8
- 239000007790 solid phase Substances 0.000 claims description 8
- 230000002596 correlated effect Effects 0.000 claims description 7
- 238000011282 treatment Methods 0.000 claims description 7
- 238000011161 development Methods 0.000 claims description 6
- 238000001727 in vivo Methods 0.000 claims description 6
- NOESYZHRGYRDHS-UHFFFAOYSA-N insulin Chemical compound N1C(=O)C(NC(=O)C(CCC(N)=O)NC(=O)C(CCC(O)=O)NC(=O)C(C(C)C)NC(=O)C(NC(=O)CN)C(C)CC)CSSCC(C(NC(CO)C(=O)NC(CC(C)C)C(=O)NC(CC=2C=CC(O)=CC=2)C(=O)NC(CCC(N)=O)C(=O)NC(CC(C)C)C(=O)NC(CCC(O)=O)C(=O)NC(CC(N)=O)C(=O)NC(CC=2C=CC(O)=CC=2)C(=O)NC(CSSCC(NC(=O)C(C(C)C)NC(=O)C(CC(C)C)NC(=O)C(CC=2C=CC(O)=CC=2)NC(=O)C(CC(C)C)NC(=O)C(C)NC(=O)C(CCC(O)=O)NC(=O)C(C(C)C)NC(=O)C(CC(C)C)NC(=O)C(CC=2NC=NC=2)NC(=O)C(CO)NC(=O)CNC2=O)C(=O)NCC(=O)NC(CCC(O)=O)C(=O)NC(CCCNC(N)=N)C(=O)NCC(=O)NC(CC=3C=CC=CC=3)C(=O)NC(CC=3C=CC=CC=3)C(=O)NC(CC=3C=CC(O)=CC=3)C(=O)NC(C(C)O)C(=O)N3C(CCC3)C(=O)NC(CCCCN)C(=O)NC(C)C(O)=O)C(=O)NC(CC(N)=O)C(O)=O)=O)NC(=O)C(C(C)CC)NC(=O)C(CO)NC(=O)C(C(C)O)NC(=O)C1CSSCC2NC(=O)C(CC(C)C)NC(=O)C(NC(=O)C(CCC(N)=O)NC(=O)C(CC(N)=O)NC(=O)C(NC(=O)C(N)CC=1C=CC=CC=1)C(C)C)CC1=CN=CN1 NOESYZHRGYRDHS-UHFFFAOYSA-N 0.000 claims description 6
- 102000044126 RNA-Binding Proteins Human genes 0.000 claims description 5
- 102000006467 TATA-Box Binding Protein Human genes 0.000 claims description 5
- 108010044281 TATA-Box Binding Protein Proteins 0.000 claims description 5
- 230000006872 improvement Effects 0.000 claims description 5
- 108090000103 Relaxin Proteins 0.000 claims description 4
- 102000003743 Relaxin Human genes 0.000 claims description 4
- 238000000855 fermentation Methods 0.000 claims description 4
- 230000004151 fermentation Effects 0.000 claims description 4
- 230000003100 immobilizing effect Effects 0.000 claims description 4
- 230000030648 nucleus localization Effects 0.000 claims description 4
- 101710159080 Aconitate hydratase A Proteins 0.000 claims description 3
- 101710159078 Aconitate hydratase B Proteins 0.000 claims description 3
- 108090001061 Insulin Proteins 0.000 claims description 3
- 102000004877 Insulin Human genes 0.000 claims description 3
- 101710105008 RNA-binding protein Proteins 0.000 claims description 3
- 101710172711 Structural protein Proteins 0.000 claims description 3
- 241000251539 Vertebrata <Metazoa> Species 0.000 claims description 3
- 239000007795 chemical reaction product Substances 0.000 claims description 3
- 238000011065 in-situ storage Methods 0.000 claims description 3
- 229940125396 insulin Drugs 0.000 claims description 3
- 238000013508 migration Methods 0.000 claims description 3
- 230000005012 migration Effects 0.000 claims description 3
- 238000012163 sequencing technique Methods 0.000 claims description 3
- 239000000055 Corticotropin-Releasing Hormone Substances 0.000 claims description 2
- IDLFZVILOHSSID-OVLDLUHVSA-N corticotropin Chemical compound C([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC=1NC=NC=1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](C(C)C)C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)NC(=O)[C@@H](N)CO)C1=CC=C(O)C=C1 IDLFZVILOHSSID-OVLDLUHVSA-N 0.000 claims description 2
- 229960000258 corticotropin Drugs 0.000 claims description 2
- 239000003937 drug carrier Substances 0.000 claims description 2
- 230000001456 gonadotroph Effects 0.000 claims description 2
- 239000005556 hormone Substances 0.000 claims description 2
- 229940088597 hormone Drugs 0.000 claims description 2
- 239000007788 liquid Substances 0.000 claims description 2
- 101150015482 BBR gene Proteins 0.000 claims 44
- 108020004711 Nucleic Acid Probes Proteins 0.000 claims 16
- 239000002853 nucleic acid probe Substances 0.000 claims 16
- 101150006308 botA gene Proteins 0.000 claims 11
- 102100024554 Tetranectin Human genes 0.000 claims 2
- 101000742658 Human papillomavirus type 1 Regulatory protein E2 Proteins 0.000 claims 1
- 102000023732 binding proteins Human genes 0.000 claims 1
- 230000001771 impaired effect Effects 0.000 claims 1
- 230000002194 synthesizing effect Effects 0.000 claims 1
- 230000003321 amplification Effects 0.000 abstract description 20
- 238000003199 nucleic acid amplification method Methods 0.000 abstract description 20
- 230000009870 specific binding Effects 0.000 abstract description 11
- 238000004458 analytical method Methods 0.000 abstract description 4
- 201000010099 disease Diseases 0.000 abstract description 4
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 abstract description 4
- 230000002068 genetic effect Effects 0.000 abstract description 3
- 238000003745 diagnosis Methods 0.000 abstract 1
- 230000000692 anti-sense effect Effects 0.000 description 119
- 241000725303 Human immunodeficiency virus Species 0.000 description 85
- 239000002299 complementary DNA Substances 0.000 description 64
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 58
- 239000002253 acid Substances 0.000 description 55
- 150000007513 acids Chemical class 0.000 description 55
- 108090000765 processed proteins & peptides Proteins 0.000 description 43
- 230000015572 biosynthetic process Effects 0.000 description 29
- 230000004568 DNA-binding Effects 0.000 description 28
- 210000004027 cell Anatomy 0.000 description 28
- 241000701806 Human papillomavirus Species 0.000 description 25
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 23
- 239000000539 dimer Substances 0.000 description 23
- 108010054155 lysyllysine Proteins 0.000 description 22
- 108010006519 Molecular Chaperones Proteins 0.000 description 18
- 238000003556 assay Methods 0.000 description 18
- 108010050848 glycylleucine Proteins 0.000 description 18
- 108010047857 aspartylglycine Proteins 0.000 description 17
- 108010061238 threonyl-glycine Proteins 0.000 description 17
- 238000007792 addition Methods 0.000 description 15
- 108010073969 valyllysine Proteins 0.000 description 15
- 108010010147 glycylglutamine Proteins 0.000 description 14
- QMQZYILAWUOLPV-JYJNAYRXSA-N Arg-Tyr-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)CC1=CC=C(O)C=C1 QMQZYILAWUOLPV-JYJNAYRXSA-N 0.000 description 13
- XLXPYSDGMXTTNQ-UHFFFAOYSA-N Ile-Phe-Leu Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(CC(C)C)C(O)=O)CC1=CC=CC=C1 XLXPYSDGMXTTNQ-UHFFFAOYSA-N 0.000 description 13
- 108091034117 Oligonucleotide Proteins 0.000 description 13
- 102100035348 Serine/threonine-protein phosphatase 2B catalytic subunit alpha isoform Human genes 0.000 description 13
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 12
- KYJHWKAMFISDJE-RCWTZXSCSA-N Met-Thr-Met Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCSC KYJHWKAMFISDJE-RCWTZXSCSA-N 0.000 description 12
- 101100342977 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) leu-1 gene Proteins 0.000 description 12
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 11
- WTWGOQRNRFHFQD-JBDRJPRFSA-N Ser-Ala-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WTWGOQRNRFHFQD-JBDRJPRFSA-N 0.000 description 11
- 239000003814 drug Substances 0.000 description 11
- 108010040030 histidinoalanine Proteins 0.000 description 11
- 241000341655 Human papillomavirus type 16 Species 0.000 description 10
- JWQNAFHCXKVZKZ-UVOCVTCTSA-N Thr-Lys-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JWQNAFHCXKVZKZ-UVOCVTCTSA-N 0.000 description 10
- 108010008355 arginyl-glutamine Proteins 0.000 description 10
- 108010062796 arginyllysine Proteins 0.000 description 10
- 238000010586 diagram Methods 0.000 description 10
- 108010025306 histidylleucine Proteins 0.000 description 10
- 238000006116 polymerization reaction Methods 0.000 description 10
- 108091008146 restriction endonucleases Proteins 0.000 description 10
- 238000013518 transcription Methods 0.000 description 10
- 230000035897 transcription Effects 0.000 description 10
- 108700020911 DNA-Binding Proteins Proteins 0.000 description 9
- POZULHZYLPGXMR-ONGXEEELSA-N Leu-Gly-Val Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O POZULHZYLPGXMR-ONGXEEELSA-N 0.000 description 9
- RZXLZBIUTDQHJQ-SRVKXCTJSA-N Leu-Lys-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O RZXLZBIUTDQHJQ-SRVKXCTJSA-N 0.000 description 9
- KEPWSUPUFAPBRF-DKIMLUQUSA-N Lys-Ile-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O KEPWSUPUFAPBRF-DKIMLUQUSA-N 0.000 description 9
- XZFYRXDAULDNFX-UHFFFAOYSA-N N-L-cysteinyl-L-phenylalanine Natural products SCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XZFYRXDAULDNFX-UHFFFAOYSA-N 0.000 description 9
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 9
- VGNYHOBZJKWRGI-CIUDSAMLSA-N Ser-Asn-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO VGNYHOBZJKWRGI-CIUDSAMLSA-N 0.000 description 9
- CXWJFWAZIVWBOS-XQQFMLRXSA-N Val-Lys-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N CXWJFWAZIVWBOS-XQQFMLRXSA-N 0.000 description 9
- 108010068265 aspartyltyrosine Proteins 0.000 description 9
- 210000004369 blood Anatomy 0.000 description 9
- 239000008280 blood Substances 0.000 description 9
- 235000013305 food Nutrition 0.000 description 9
- 230000014509 gene expression Effects 0.000 description 9
- 239000007787 solid Substances 0.000 description 9
- 108010029384 tryptophyl-histidine Proteins 0.000 description 9
- WKOBSJOZRJJVRZ-FXQIFTODSA-N Ala-Glu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WKOBSJOZRJJVRZ-FXQIFTODSA-N 0.000 description 8
- WGDNWOMKBUXFHR-BQBZGAKWSA-N Ala-Gly-Arg Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N WGDNWOMKBUXFHR-BQBZGAKWSA-N 0.000 description 8
- NMXKFWOEASXOGB-QSFUFRPTSA-N Ala-Ile-His Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 NMXKFWOEASXOGB-QSFUFRPTSA-N 0.000 description 8
- SDZRIBWEVVRDQI-CIUDSAMLSA-N Ala-Lys-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O SDZRIBWEVVRDQI-CIUDSAMLSA-N 0.000 description 8
- 102000014914 Carrier Proteins Human genes 0.000 description 8
- QADCTXFNLZBZAB-GHCJXIJMSA-N Ile-Asn-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C)C(=O)O)N QADCTXFNLZBZAB-GHCJXIJMSA-N 0.000 description 8
- OBVHKUFUDCPZDW-JYJNAYRXSA-N Met-Arg-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OBVHKUFUDCPZDW-JYJNAYRXSA-N 0.000 description 8
- 108700026244 Open Reading Frames Proteins 0.000 description 8
- 241001631646 Papillomaviridae Species 0.000 description 8
- ODRUTDLAONAVDV-IHRRRGAJSA-N Ser-Val-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ODRUTDLAONAVDV-IHRRRGAJSA-N 0.000 description 8
- 241000700605 Viruses Species 0.000 description 8
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 8
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 8
- 230000008901 benefit Effects 0.000 description 8
- 239000003153 chemical reaction reagent Substances 0.000 description 8
- 108010015792 glycyllysine Proteins 0.000 description 8
- 230000003993 interaction Effects 0.000 description 8
- 108010034529 leucyl-lysine Proteins 0.000 description 8
- 108010003700 lysyl aspartic acid Proteins 0.000 description 8
- 230000008569 process Effects 0.000 description 8
- 108010048818 seryl-histidine Proteins 0.000 description 8
- MKRDNSWGJWTBKZ-GVXVVHGQSA-N Gln-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N MKRDNSWGJWTBKZ-GVXVVHGQSA-N 0.000 description 7
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 7
- CCQLQKZTXZBXTN-NHCYSSNCSA-N Leu-Gly-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O CCQLQKZTXZBXTN-NHCYSSNCSA-N 0.000 description 7
- 206010028980 Neoplasm Diseases 0.000 description 7
- OJFFAQFRCVPHNN-JYBASQMISA-N Ser-Thr-Trp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O OJFFAQFRCVPHNN-JYBASQMISA-N 0.000 description 7
- 239000012491 analyte Substances 0.000 description 7
- 108010092854 aspartyllysine Proteins 0.000 description 7
- 230000001413 cellular effect Effects 0.000 description 7
- 108010016616 cysteinylglycine Proteins 0.000 description 7
- 238000012217 deletion Methods 0.000 description 7
- 230000037430 deletion Effects 0.000 description 7
- 229940079593 drug Drugs 0.000 description 7
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 7
- 230000003902 lesion Effects 0.000 description 7
- 108010057821 leucylproline Proteins 0.000 description 7
- 239000013612 plasmid Substances 0.000 description 7
- 108010079317 prolyl-tyrosine Proteins 0.000 description 7
- 230000002285 radioactive effect Effects 0.000 description 7
- 239000000243 solution Substances 0.000 description 7
- 239000003656 tris buffered saline Substances 0.000 description 7
- ZYPWIUFLYMQZBS-SRVKXCTJSA-N Asn-Lys-Lys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N ZYPWIUFLYMQZBS-SRVKXCTJSA-N 0.000 description 6
- DHDOADIPGZTAHT-YUMQZZPRSA-N Gly-Glu-Arg Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DHDOADIPGZTAHT-YUMQZZPRSA-N 0.000 description 6
- 108700025685 HIV Enhancer Proteins 0.000 description 6
- YERBCFWVWITTEJ-NAZCDGGXSA-N His-Trp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CC3=CN=CN3)N)O YERBCFWVWITTEJ-NAZCDGGXSA-N 0.000 description 6
- 241000880493 Leptailurus serval Species 0.000 description 6
- 101710163270 Nuclease Proteins 0.000 description 6
- WFIVLLFYUZZWOD-RHYQMDGZSA-N Pro-Lys-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WFIVLLFYUZZWOD-RHYQMDGZSA-N 0.000 description 6
- NDZYTIMDOZMECO-SHGPDSBTSA-N Thr-Thr-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O NDZYTIMDOZMECO-SHGPDSBTSA-N 0.000 description 6
- 108010038633 aspartylglutamate Proteins 0.000 description 6
- 238000001574 biopsy Methods 0.000 description 6
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 6
- 230000000694 effects Effects 0.000 description 6
- 108010049041 glutamylalanine Proteins 0.000 description 6
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Chemical compound NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 6
- 108010081551 glycylphenylalanine Proteins 0.000 description 6
- 108010085325 histidylproline Proteins 0.000 description 6
- 238000002372 labelling Methods 0.000 description 6
- 239000012071 phase Substances 0.000 description 6
- 108010051242 phenylalanylserine Proteins 0.000 description 6
- 108010029020 prolylglycine Proteins 0.000 description 6
- 108010015796 prolylisoleucine Proteins 0.000 description 6
- 108010026333 seryl-proline Proteins 0.000 description 6
- QQACQIHVWCVBBR-GVARAGBVSA-N Ala-Ile-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QQACQIHVWCVBBR-GVARAGBVSA-N 0.000 description 5
- VQAVBBCZFQAAED-FXQIFTODSA-N Ala-Pro-Asn Chemical compound C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)N)C(=O)O)N VQAVBBCZFQAAED-FXQIFTODSA-N 0.000 description 5
- HJDNZFIYILEIKR-OSUNSFLBSA-N Arg-Ile-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HJDNZFIYILEIKR-OSUNSFLBSA-N 0.000 description 5
- NIUDXSFNLBIWOB-DCAQKATOSA-N Arg-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N NIUDXSFNLBIWOB-DCAQKATOSA-N 0.000 description 5
- INXWADWANGLMPJ-JYJNAYRXSA-N Arg-Phe-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCNC(N)=N)C(O)=O)CC1=CC=CC=C1 INXWADWANGLMPJ-JYJNAYRXSA-N 0.000 description 5
- ZDJZEGYVKANKED-NRPADANISA-N Gln-Cys-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O ZDJZEGYVKANKED-NRPADANISA-N 0.000 description 5
- COVXELOAORHTND-LSJOCFKGSA-N Gly-Ile-Val Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O COVXELOAORHTND-LSJOCFKGSA-N 0.000 description 5
- DCGXHWINSHEPIR-SRVKXCTJSA-N Leu-Lys-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)O)N DCGXHWINSHEPIR-SRVKXCTJSA-N 0.000 description 5
- QESXLSQLQHHTIX-RHYQMDGZSA-N Leu-Val-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QESXLSQLQHHTIX-RHYQMDGZSA-N 0.000 description 5
- WTZUSCUIVPVCRH-SRVKXCTJSA-N Lys-Gln-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N WTZUSCUIVPVCRH-SRVKXCTJSA-N 0.000 description 5
- 108010079364 N-glycylalanine Proteins 0.000 description 5
- NJJBATPLUQHRBM-IHRRRGAJSA-N Phe-Pro-Ser Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)N)C(=O)N[C@@H](CO)C(=O)O NJJBATPLUQHRBM-IHRRRGAJSA-N 0.000 description 5
- BPMRXBZYPGYPJN-WHFBIAKZSA-N Ser-Gly-Asn Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O BPMRXBZYPGYPJN-WHFBIAKZSA-N 0.000 description 5
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 5
- NQJDICVXXIMMMB-XDTLVQLUSA-N Tyr-Glu-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O NQJDICVXXIMMMB-XDTLVQLUSA-N 0.000 description 5
- OPGWZDIYEYJVRX-AVGNSLFASA-N Val-His-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N OPGWZDIYEYJVRX-AVGNSLFASA-N 0.000 description 5
- QRVPEKJBBRYISE-XUXIUFHCSA-N Val-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N QRVPEKJBBRYISE-XUXIUFHCSA-N 0.000 description 5
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 5
- 108010005233 alanylglutamic acid Proteins 0.000 description 5
- 108010047495 alanylglycine Proteins 0.000 description 5
- 108010077245 asparaginyl-proline Proteins 0.000 description 5
- 201000011510 cancer Diseases 0.000 description 5
- 208000019065 cervical carcinoma Diseases 0.000 description 5
- 108010004073 cysteinylcysteine Proteins 0.000 description 5
- 230000001747 exhibiting effect Effects 0.000 description 5
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 5
- 108010078326 glycyl-glycyl-valine Proteins 0.000 description 5
- 108010036413 histidylglycine Proteins 0.000 description 5
- 230000010354 integration Effects 0.000 description 5
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 5
- 108010027338 isoleucylcysteine Proteins 0.000 description 5
- 230000036210 malignancy Effects 0.000 description 5
- 238000012986 modification Methods 0.000 description 5
- 230000004048 modification Effects 0.000 description 5
- 108010077112 prolyl-proline Proteins 0.000 description 5
- OTZMRMHZCMZOJZ-SRVKXCTJSA-N Arg-Leu-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O OTZMRMHZCMZOJZ-SRVKXCTJSA-N 0.000 description 4
- REQUGIWGOGSOEZ-ZLUOBGJFSA-N Asn-Ser-Asn Chemical compound C([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)C(=O)N REQUGIWGOGSOEZ-ZLUOBGJFSA-N 0.000 description 4
- HPBNLFLSSQDFQW-WHFBIAKZSA-N Asn-Ser-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O HPBNLFLSSQDFQW-WHFBIAKZSA-N 0.000 description 4
- 101100315624 Caenorhabditis elegans tyr-1 gene Proteins 0.000 description 4
- 108091026890 Coding region Proteins 0.000 description 4
- XTHUKRLJRUVVBF-WHFBIAKZSA-N Cys-Gly-Ser Chemical compound SC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O XTHUKRLJRUVVBF-WHFBIAKZSA-N 0.000 description 4
- SAEVTQWAYDPXMU-KATARQTJSA-N Cys-Thr-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O SAEVTQWAYDPXMU-KATARQTJSA-N 0.000 description 4
- PGPJSRSLQNXBDT-YUMQZZPRSA-N Gln-Arg-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O PGPJSRSLQNXBDT-YUMQZZPRSA-N 0.000 description 4
- IWUFOVSLWADEJC-AVGNSLFASA-N Gln-His-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O IWUFOVSLWADEJC-AVGNSLFASA-N 0.000 description 4
- OUBUHIODTNUUTC-WDCWCFNPSA-N Gln-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O OUBUHIODTNUUTC-WDCWCFNPSA-N 0.000 description 4
- ZZLDMBMFKZFQMU-NRPADANISA-N Gln-Val-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O ZZLDMBMFKZFQMU-NRPADANISA-N 0.000 description 4
- ZJICFHQSPWFBKP-AVGNSLFASA-N Glu-Asn-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZJICFHQSPWFBKP-AVGNSLFASA-N 0.000 description 4
- XMVLTPMCUJTJQP-FXQIFTODSA-N Glu-Gln-Cys Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N XMVLTPMCUJTJQP-FXQIFTODSA-N 0.000 description 4
- PXXGVUVQWQGGIG-YUMQZZPRSA-N Glu-Gly-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N PXXGVUVQWQGGIG-YUMQZZPRSA-N 0.000 description 4
- JBRBACJPBZNFMF-YUMQZZPRSA-N Gly-Ala-Lys Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN JBRBACJPBZNFMF-YUMQZZPRSA-N 0.000 description 4
- KQDMENMTYNBWMR-WHFBIAKZSA-N Gly-Asp-Ala Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O KQDMENMTYNBWMR-WHFBIAKZSA-N 0.000 description 4
- FEUPVVCGQLNXNP-IRXDYDNUSA-N Gly-Phe-Phe Chemical compound C([C@H](NC(=O)CN)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 FEUPVVCGQLNXNP-IRXDYDNUSA-N 0.000 description 4
- 208000031886 HIV Infections Diseases 0.000 description 4
- 208000037357 HIV infectious disease Diseases 0.000 description 4
- JCGMFFQQHJQASB-PYJNHQTQSA-N Ile-Val-His Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)O JCGMFFQQHJQASB-PYJNHQTQSA-N 0.000 description 4
- 101710125507 Integrase/recombinase Proteins 0.000 description 4
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 4
- LZDNBBYBDGBADK-UHFFFAOYSA-N L-valyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-UHFFFAOYSA-N 0.000 description 4
- RRSLQOLASISYTB-CIUDSAMLSA-N Leu-Cys-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(O)=O RRSLQOLASISYTB-CIUDSAMLSA-N 0.000 description 4
- DZQMXBALGUHGJT-GUBZILKMSA-N Leu-Glu-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O DZQMXBALGUHGJT-GUBZILKMSA-N 0.000 description 4
- QLDHBYRUNQZIJQ-DKIMLUQUSA-N Leu-Ile-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QLDHBYRUNQZIJQ-DKIMLUQUSA-N 0.000 description 4
- UHNQRAFSEBGZFZ-YESZJQIVSA-N Leu-Phe-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N2CCC[C@@H]2C(=O)O)N UHNQRAFSEBGZFZ-YESZJQIVSA-N 0.000 description 4
- TUIOUEWKFFVNLH-DCAQKATOSA-N Leu-Val-Cys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CS)C(O)=O TUIOUEWKFFVNLH-DCAQKATOSA-N 0.000 description 4
- AAKRWBIIGKPOKQ-ONGXEEELSA-N Leu-Val-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AAKRWBIIGKPOKQ-ONGXEEELSA-N 0.000 description 4
- IIPHCNKHEZYSNE-DCAQKATOSA-N Met-Arg-Gln Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O IIPHCNKHEZYSNE-DCAQKATOSA-N 0.000 description 4
- DJDFBVNNDAUPRW-GUBZILKMSA-N Met-Glu-Gln Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O DJDFBVNNDAUPRW-GUBZILKMSA-N 0.000 description 4
- 208000009608 Papillomavirus Infections Diseases 0.000 description 4
- ZWJKVFAYPLPCQB-UNQGMJICSA-N Phe-Arg-Thr Chemical compound C[C@@H](O)[C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)Cc1ccccc1)C(O)=O ZWJKVFAYPLPCQB-UNQGMJICSA-N 0.000 description 4
- IWRZUGHCHFZYQZ-UFYCRDLUSA-N Phe-Arg-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 IWRZUGHCHFZYQZ-UFYCRDLUSA-N 0.000 description 4
- PEFJUUYFEGBXFA-BZSNNMDCSA-N Phe-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 PEFJUUYFEGBXFA-BZSNNMDCSA-N 0.000 description 4
- GOUWCZRDTWTODO-YDHLFZDLSA-N Phe-Val-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O GOUWCZRDTWTODO-YDHLFZDLSA-N 0.000 description 4
- JUJCUYWRJMFJJF-AVGNSLFASA-N Pro-Lys-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H]1CCCN1 JUJCUYWRJMFJJF-AVGNSLFASA-N 0.000 description 4
- KBUAPZAZPWNYSW-SRVKXCTJSA-N Pro-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 KBUAPZAZPWNYSW-SRVKXCTJSA-N 0.000 description 4
- 108020004511 Recombinant DNA Proteins 0.000 description 4
- JWOBLHJRDADHLN-KKUMJFAQSA-N Ser-Leu-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JWOBLHJRDADHLN-KKUMJFAQSA-N 0.000 description 4
- NIOYDASGXWLHEZ-CIUDSAMLSA-N Ser-Met-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O NIOYDASGXWLHEZ-CIUDSAMLSA-N 0.000 description 4
- BSNZTJXVDOINSR-JXUBOQSCSA-N Thr-Ala-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O BSNZTJXVDOINSR-JXUBOQSCSA-N 0.000 description 4
- DWYAUVCQDTZIJI-VZFHVOOUSA-N Thr-Ala-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O DWYAUVCQDTZIJI-VZFHVOOUSA-N 0.000 description 4
- VTMGKRABARCZAX-OSUNSFLBSA-N Thr-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O VTMGKRABARCZAX-OSUNSFLBSA-N 0.000 description 4
- CSZFFQBUTMGHAH-UAXMHLISSA-N Thr-Thr-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N)O CSZFFQBUTMGHAH-UAXMHLISSA-N 0.000 description 4
- BZTSQFWJNJYZSX-JRQIVUDYSA-N Thr-Tyr-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O BZTSQFWJNJYZSX-JRQIVUDYSA-N 0.000 description 4
- XGFYGMKZKFRGAI-RCWTZXSCSA-N Thr-Val-Arg Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N XGFYGMKZKFRGAI-RCWTZXSCSA-N 0.000 description 4
- 102100030246 Transcription factor Sp1 Human genes 0.000 description 4
- GZUIDWDVMWZSMI-KKUMJFAQSA-N Tyr-Lys-Cys Chemical compound NCCCC[C@@H](C(=O)N[C@@H](CS)C(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 GZUIDWDVMWZSMI-KKUMJFAQSA-N 0.000 description 4
- XGJLNBNZNMVJRS-NRPADANISA-N Val-Glu-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O XGJLNBNZNMVJRS-NRPADANISA-N 0.000 description 4
- LCHZBEUVGAVMKS-RHYQMDGZSA-N Val-Thr-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)[C@@H](C)O)C(O)=O LCHZBEUVGAVMKS-RHYQMDGZSA-N 0.000 description 4
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 4
- 108010013835 arginine glutamate Proteins 0.000 description 4
- 239000012472 biological sample Substances 0.000 description 4
- 239000003795 chemical substances by application Substances 0.000 description 4
- 238000010367 cloning Methods 0.000 description 4
- 238000012790 confirmation Methods 0.000 description 4
- 239000000356 contaminant Substances 0.000 description 4
- 108010060199 cysteinylproline Proteins 0.000 description 4
- 108010054813 diprotin B Proteins 0.000 description 4
- 238000013467 fragmentation Methods 0.000 description 4
- 238000006062 fragmentation reaction Methods 0.000 description 4
- 108010077515 glycylproline Proteins 0.000 description 4
- 108010037850 glycylvaline Proteins 0.000 description 4
- UYTPUPDQBNUYGX-UHFFFAOYSA-N guanine Chemical compound O=C1NC(N)=NC2=C1N=CN2 UYTPUPDQBNUYGX-UHFFFAOYSA-N 0.000 description 4
- 108010018006 histidylserine Proteins 0.000 description 4
- 239000000710 homodimer Substances 0.000 description 4
- 208000033519 human immunodeficiency virus infectious disease Diseases 0.000 description 4
- 108010031424 isoleucyl-prolyl-proline Proteins 0.000 description 4
- 108010078274 isoleucylvaline Proteins 0.000 description 4
- 108010051673 leucyl-glycyl-phenylalanine Proteins 0.000 description 4
- 108010044348 lysyl-glutamyl-aspartic acid Proteins 0.000 description 4
- 230000003211 malignant effect Effects 0.000 description 4
- 239000011159 matrix material Substances 0.000 description 4
- 239000011859 microparticle Substances 0.000 description 4
- 108010031719 prolyl-serine Proteins 0.000 description 4
- 108010070643 prolylglutamic acid Proteins 0.000 description 4
- 238000000746 purification Methods 0.000 description 4
- -1 ribonucleoside triphosphates Chemical class 0.000 description 4
- 238000003786 synthesis reaction Methods 0.000 description 4
- 230000036964 tight binding Effects 0.000 description 4
- GJLXVWOMRRWCIB-MERZOTPQSA-N (2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-acetamido-5-(diaminomethylideneamino)pentanoyl]amino]-3-(4-hydroxyphenyl)propanoyl]amino]-3-(4-hydroxyphenyl)propanoyl]amino]-5-(diaminomethylideneamino)pentanoyl]amino]-3-(1H-indol-3-yl)propanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanamide Chemical compound C([C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(N)=O)C1=CC=C(O)C=C1 GJLXVWOMRRWCIB-MERZOTPQSA-N 0.000 description 3
- SVBXIUDNTRTKHE-CIUDSAMLSA-N Ala-Arg-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O SVBXIUDNTRTKHE-CIUDSAMLSA-N 0.000 description 3
- NXSFUECZFORGOG-CIUDSAMLSA-N Ala-Asn-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NXSFUECZFORGOG-CIUDSAMLSA-N 0.000 description 3
- LJFNNUBZSZCZFN-WHFBIAKZSA-N Ala-Gly-Cys Chemical compound N[C@@H](C)C(=O)NCC(=O)N[C@@H](CS)C(=O)O LJFNNUBZSZCZFN-WHFBIAKZSA-N 0.000 description 3
- CBCCCLMNOBLBSC-XVYDVKMFSA-N Ala-His-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O CBCCCLMNOBLBSC-XVYDVKMFSA-N 0.000 description 3
- IFKQPMZRDQZSHI-GHCJXIJMSA-N Ala-Ile-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O IFKQPMZRDQZSHI-GHCJXIJMSA-N 0.000 description 3
- FOHXUHGZZKETFI-JBDRJPRFSA-N Ala-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C)N FOHXUHGZZKETFI-JBDRJPRFSA-N 0.000 description 3
- LNNSWWRRYJLGNI-NAKRPEOUSA-N Ala-Ile-Val Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O LNNSWWRRYJLGNI-NAKRPEOUSA-N 0.000 description 3
- OYJCVIGKMXUVKB-GARJFASQSA-N Ala-Leu-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N OYJCVIGKMXUVKB-GARJFASQSA-N 0.000 description 3
- MDNAVFBZPROEHO-UHFFFAOYSA-N Ala-Lys-Val Natural products CC(C)C(C(O)=O)NC(=O)C(NC(=O)C(C)N)CCCCN MDNAVFBZPROEHO-UHFFFAOYSA-N 0.000 description 3
- REWSWYIDQIELBE-FXQIFTODSA-N Ala-Val-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O REWSWYIDQIELBE-FXQIFTODSA-N 0.000 description 3
- OTCJMMRQBVDQRK-DCAQKATOSA-N Arg-Asp-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O OTCJMMRQBVDQRK-DCAQKATOSA-N 0.000 description 3
- LLZXKVAAEWBUPB-KKUMJFAQSA-N Arg-Gln-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LLZXKVAAEWBUPB-KKUMJFAQSA-N 0.000 description 3
- ZZZWQALDSQQBEW-STQMWFEESA-N Arg-Gly-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZZZWQALDSQQBEW-STQMWFEESA-N 0.000 description 3
- AFNHFVVOJZBIJD-GUBZILKMSA-N Arg-Met-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O AFNHFVVOJZBIJD-GUBZILKMSA-N 0.000 description 3
- GSUFZRURORXYTM-STQMWFEESA-N Arg-Phe-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 GSUFZRURORXYTM-STQMWFEESA-N 0.000 description 3
- PRLPSDIHSRITSF-UNQGMJICSA-N Arg-Phe-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PRLPSDIHSRITSF-UNQGMJICSA-N 0.000 description 3
- RZVVKNIACROXRM-ZLUOBGJFSA-N Asn-Ala-Asp Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N RZVVKNIACROXRM-ZLUOBGJFSA-N 0.000 description 3
- XWGJDUSDTRPQRK-ZLUOBGJFSA-N Asn-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(N)=O XWGJDUSDTRPQRK-ZLUOBGJFSA-N 0.000 description 3
- OLVIPTLKNSAYRJ-YUMQZZPRSA-N Asn-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N OLVIPTLKNSAYRJ-YUMQZZPRSA-N 0.000 description 3
- KMCRKVOLRCOMBG-DJFWLOJKSA-N Asn-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(=O)N)N KMCRKVOLRCOMBG-DJFWLOJKSA-N 0.000 description 3
- SEKBHZJLARBNPB-GHCJXIJMSA-N Asn-Ile-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O SEKBHZJLARBNPB-GHCJXIJMSA-N 0.000 description 3
- YVXRYLVELQYAEQ-SRVKXCTJSA-N Asn-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N YVXRYLVELQYAEQ-SRVKXCTJSA-N 0.000 description 3
- GKKUBLFXKRDMFC-BQBZGAKWSA-N Asn-Pro-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O GKKUBLFXKRDMFC-BQBZGAKWSA-N 0.000 description 3
- LMIWYCWRJVMAIQ-NHCYSSNCSA-N Asn-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N LMIWYCWRJVMAIQ-NHCYSSNCSA-N 0.000 description 3
- SOYOSFXLXYZNRG-CIUDSAMLSA-N Asp-Arg-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O SOYOSFXLXYZNRG-CIUDSAMLSA-N 0.000 description 3
- WSOKZUVWBXVJHX-CIUDSAMLSA-N Asp-Arg-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O WSOKZUVWBXVJHX-CIUDSAMLSA-N 0.000 description 3
- TVVYVAUGRHNTGT-UGYAYLCHSA-N Asp-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O TVVYVAUGRHNTGT-UGYAYLCHSA-N 0.000 description 3
- LKIYSIYBKYLKPU-BIIVOSGPSA-N Asp-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)N)C(=O)O LKIYSIYBKYLKPU-BIIVOSGPSA-N 0.000 description 3
- FAUPLTGRUBTXNU-FXQIFTODSA-N Asp-Pro-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O FAUPLTGRUBTXNU-FXQIFTODSA-N 0.000 description 3
- VNXQRBXEQXLERQ-CIUDSAMLSA-N Asp-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)O)N VNXQRBXEQXLERQ-CIUDSAMLSA-N 0.000 description 3
- YIDFBWRHIYOYAA-LKXGYXEUSA-N Asp-Ser-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O YIDFBWRHIYOYAA-LKXGYXEUSA-N 0.000 description 3
- XQFLFQWOBXPMHW-NHCYSSNCSA-N Asp-Val-His Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)O XQFLFQWOBXPMHW-NHCYSSNCSA-N 0.000 description 3
- 241000894006 Bacteria Species 0.000 description 3
- 0 CC(C)C(CC1C23)C(C4C56)*12C3C45C1C6C(C)C1 Chemical compound CC(C)C(CC1C23)C(C4C56)*12C3C45C1C6C(C)C1 0.000 description 3
- YMBAVNPKBWHDAW-CIUDSAMLSA-N Cys-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)N YMBAVNPKBWHDAW-CIUDSAMLSA-N 0.000 description 3
- VBPGTULCFGKGTF-ACZMJKKPSA-N Cys-Glu-Asp Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O VBPGTULCFGKGTF-ACZMJKKPSA-N 0.000 description 3
- WTXCNOPZMQRTNN-BWBBJGPYSA-N Cys-Thr-Ser Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CS)N)O WTXCNOPZMQRTNN-BWBBJGPYSA-N 0.000 description 3
- 102000053602 DNA Human genes 0.000 description 3
- 241000588724 Escherichia coli Species 0.000 description 3
- YJIUYQKQBBQYHZ-ACZMJKKPSA-N Gln-Ala-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O YJIUYQKQBBQYHZ-ACZMJKKPSA-N 0.000 description 3
- REJJNXODKSHOKA-ACZMJKKPSA-N Gln-Ala-Asp Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N REJJNXODKSHOKA-ACZMJKKPSA-N 0.000 description 3
- RZSLYUUFFVHFRQ-FXQIFTODSA-N Gln-Ala-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O RZSLYUUFFVHFRQ-FXQIFTODSA-N 0.000 description 3
- LVNILKSSFHCSJZ-IHRRRGAJSA-N Gln-Gln-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N LVNILKSSFHCSJZ-IHRRRGAJSA-N 0.000 description 3
- TWTWUBHEWQPMQW-ZPFDUUQYSA-N Gln-Ile-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O TWTWUBHEWQPMQW-ZPFDUUQYSA-N 0.000 description 3
- TWIAMTNJOMRDAK-GUBZILKMSA-N Gln-Lys-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O TWIAMTNJOMRDAK-GUBZILKMSA-N 0.000 description 3
- HPCOBEHVEHWREJ-DCAQKATOSA-N Gln-Lys-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HPCOBEHVEHWREJ-DCAQKATOSA-N 0.000 description 3
- MFORDNZDKAVNSR-SRVKXCTJSA-N Gln-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCC(N)=O MFORDNZDKAVNSR-SRVKXCTJSA-N 0.000 description 3
- BUZMZDDKFCSKOT-CIUDSAMLSA-N Glu-Glu-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BUZMZDDKFCSKOT-CIUDSAMLSA-N 0.000 description 3
- OPAINBJQDQTGJY-JGVFFNPUSA-N Glu-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCC(=O)O)N)C(=O)O OPAINBJQDQTGJY-JGVFFNPUSA-N 0.000 description 3
- ZSWGJYOZWBHROQ-RWRJDSDZSA-N Glu-Ile-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZSWGJYOZWBHROQ-RWRJDSDZSA-N 0.000 description 3
- KRRFFAHEAOCBCQ-SIUGBPQLSA-N Glu-Ile-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KRRFFAHEAOCBCQ-SIUGBPQLSA-N 0.000 description 3
- ZWMYUDZLXAQHCK-CIUDSAMLSA-N Glu-Met-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O ZWMYUDZLXAQHCK-CIUDSAMLSA-N 0.000 description 3
- QSTLUOIOYLYLLF-WDSKDSINSA-N Gly-Asp-Glu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QSTLUOIOYLYLLF-WDSKDSINSA-N 0.000 description 3
- SOEATRRYCIPEHA-BQBZGAKWSA-N Gly-Glu-Glu Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SOEATRRYCIPEHA-BQBZGAKWSA-N 0.000 description 3
- XPJBQTCXPJNIFE-ZETCQYMHSA-N Gly-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)CN XPJBQTCXPJNIFE-ZETCQYMHSA-N 0.000 description 3
- OLPPXYMMIARYAL-QMMMGPOBSA-N Gly-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)CN OLPPXYMMIARYAL-QMMMGPOBSA-N 0.000 description 3
- SWQALSGKVLYKDT-UHFFFAOYSA-N Gly-Ile-Ala Natural products NCC(=O)NC(C(C)CC)C(=O)NC(C)C(O)=O SWQALSGKVLYKDT-UHFFFAOYSA-N 0.000 description 3
- HAXARWKYFIIHKD-ZKWXMUAHSA-N Gly-Ile-Ser Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HAXARWKYFIIHKD-ZKWXMUAHSA-N 0.000 description 3
- LOEANKRDMMVOGZ-YUMQZZPRSA-N Gly-Lys-Asp Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CC(O)=O)C(O)=O LOEANKRDMMVOGZ-YUMQZZPRSA-N 0.000 description 3
- IUKIDFVOUHZRAK-QWRGUYRKSA-N Gly-Lys-His Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 IUKIDFVOUHZRAK-QWRGUYRKSA-N 0.000 description 3
- OQQKUTVULYLCDG-ONGXEEELSA-N Gly-Lys-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)CN)C(O)=O OQQKUTVULYLCDG-ONGXEEELSA-N 0.000 description 3
- FXLVSYVJDPCIHH-STQMWFEESA-N Gly-Phe-Arg Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FXLVSYVJDPCIHH-STQMWFEESA-N 0.000 description 3
- NSVOVKWEKGEOQB-LURJTMIESA-N Gly-Pro-Gly Chemical compound NCC(=O)N1CCC[C@H]1C(=O)NCC(O)=O NSVOVKWEKGEOQB-LURJTMIESA-N 0.000 description 3
- JNGHLWWFPGIJER-STQMWFEESA-N Gly-Pro-Tyr Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 JNGHLWWFPGIJER-STQMWFEESA-N 0.000 description 3
- LBDXVCBAJJNJNN-WHFBIAKZSA-N Gly-Ser-Cys Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(O)=O LBDXVCBAJJNJNN-WHFBIAKZSA-N 0.000 description 3
- JSLVAHYTAJJEQH-QWRGUYRKSA-N Gly-Ser-Phe Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JSLVAHYTAJJEQH-QWRGUYRKSA-N 0.000 description 3
- ZLCLYFGMKFCDCN-XPUUQOCRSA-N Gly-Ser-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CO)NC(=O)CN)C(O)=O ZLCLYFGMKFCDCN-XPUUQOCRSA-N 0.000 description 3
- 206010019799 Hepatitis viral Diseases 0.000 description 3
- YPQDTQJBOFOTJQ-SXTJYALSSA-N Ile-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N YPQDTQJBOFOTJQ-SXTJYALSSA-N 0.000 description 3
- SJIGTGZVQGLMGG-NAKRPEOUSA-N Ile-Cys-Arg Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(=O)O SJIGTGZVQGLMGG-NAKRPEOUSA-N 0.000 description 3
- FADXGVVLSPPEQY-GHCJXIJMSA-N Ile-Cys-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)N)C(=O)O)N FADXGVVLSPPEQY-GHCJXIJMSA-N 0.000 description 3
- HUORUFRRJHELPD-MNXVOIDGSA-N Ile-Leu-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N HUORUFRRJHELPD-MNXVOIDGSA-N 0.000 description 3
- TVYWVSJGSHQWMT-AJNGGQMLSA-N Ile-Leu-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N TVYWVSJGSHQWMT-AJNGGQMLSA-N 0.000 description 3
- XLXPYSDGMXTTNQ-DKIMLUQUSA-N Ile-Phe-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CC(C)C)C(O)=O XLXPYSDGMXTTNQ-DKIMLUQUSA-N 0.000 description 3
- NLZVTPYXYXMCIP-XUXIUFHCSA-N Ile-Pro-Lys Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O NLZVTPYXYXMCIP-XUXIUFHCSA-N 0.000 description 3
- FQYQMFCIJNWDQZ-CYDGBPFRSA-N Ile-Pro-Pro Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 FQYQMFCIJNWDQZ-CYDGBPFRSA-N 0.000 description 3
- ZLFNNVATRMCAKN-ZKWXMUAHSA-N Ile-Ser-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)NCC(=O)O)N ZLFNNVATRMCAKN-ZKWXMUAHSA-N 0.000 description 3
- 108091092195 Intron Proteins 0.000 description 3
- SUPVSFFZWVOEOI-CQDKDKBSSA-N Leu-Ala-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 SUPVSFFZWVOEOI-CQDKDKBSSA-N 0.000 description 3
- SUPVSFFZWVOEOI-UHFFFAOYSA-N Leu-Ala-Tyr Natural products CC(C)CC(N)C(=O)NC(C)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 SUPVSFFZWVOEOI-UHFFFAOYSA-N 0.000 description 3
- KAFOIVJDVSZUMD-UHFFFAOYSA-N Leu-Gln-Gln Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)NC(CCC(N)=O)C(O)=O KAFOIVJDVSZUMD-UHFFFAOYSA-N 0.000 description 3
- SGIIOQQGLUUMDQ-IHRRRGAJSA-N Leu-His-Val Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](C(C)C)C(=O)O)N SGIIOQQGLUUMDQ-IHRRRGAJSA-N 0.000 description 3
- LIINDKYIGYTDLG-PPCPHDFISA-N Leu-Ile-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LIINDKYIGYTDLG-PPCPHDFISA-N 0.000 description 3
- IFMPDNRWZZEZSL-SRVKXCTJSA-N Leu-Leu-Cys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(O)=O IFMPDNRWZZEZSL-SRVKXCTJSA-N 0.000 description 3
- MJTOYIHCKVQICL-ULQDDVLXSA-N Leu-Met-Phe Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N MJTOYIHCKVQICL-ULQDDVLXSA-N 0.000 description 3
- FGZVGOAAROXFAB-IXOXFDKPSA-N Leu-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(C)C)N)O FGZVGOAAROXFAB-IXOXFDKPSA-N 0.000 description 3
- LJBVRCDPWOJOEK-PPCPHDFISA-N Leu-Thr-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LJBVRCDPWOJOEK-PPCPHDFISA-N 0.000 description 3
- LMDVGHQPPPLYAR-IHRRRGAJSA-N Leu-Val-His Chemical compound N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)O LMDVGHQPPPLYAR-IHRRRGAJSA-N 0.000 description 3
- NFLFJGGKOHYZJF-BJDJZHNGSA-N Lys-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN NFLFJGGKOHYZJF-BJDJZHNGSA-N 0.000 description 3
- IXHKPDJKKCUKHS-GARJFASQSA-N Lys-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N IXHKPDJKKCUKHS-GARJFASQSA-N 0.000 description 3
- DCRWPTBMWMGADO-AVGNSLFASA-N Lys-Glu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DCRWPTBMWMGADO-AVGNSLFASA-N 0.000 description 3
- WAIHHELKYSFIQN-XUXIUFHCSA-N Lys-Ile-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O WAIHHELKYSFIQN-XUXIUFHCSA-N 0.000 description 3
- WBSCNDJQPKSPII-KKUMJFAQSA-N Lys-Lys-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O WBSCNDJQPKSPII-KKUMJFAQSA-N 0.000 description 3
- YDDDRTIPNTWGIG-SRVKXCTJSA-N Lys-Lys-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O YDDDRTIPNTWGIG-SRVKXCTJSA-N 0.000 description 3
- QQPSCXKFDSORFT-IHRRRGAJSA-N Lys-Lys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN QQPSCXKFDSORFT-IHRRRGAJSA-N 0.000 description 3
- RPWQJSBMXJSCPD-XUXIUFHCSA-N Lys-Val-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCCCN)C(C)C)C(O)=O RPWQJSBMXJSCPD-XUXIUFHCSA-N 0.000 description 3
- 239000000637 Melanocyte-Stimulating Hormone Substances 0.000 description 3
- 108010007013 Melanocyte-Stimulating Hormones Proteins 0.000 description 3
- 102000005431 Molecular Chaperones Human genes 0.000 description 3
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 3
- 108091005804 Peptidases Proteins 0.000 description 3
- UMKYAYXCMYYNHI-AVGNSLFASA-N Phe-Gln-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N UMKYAYXCMYYNHI-AVGNSLFASA-N 0.000 description 3
- MFQXSDWKUXTOPZ-DZKIICNBSA-N Phe-Gln-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N MFQXSDWKUXTOPZ-DZKIICNBSA-N 0.000 description 3
- BFYHIHGIHGROAT-HTUGSXCWSA-N Phe-Glu-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BFYHIHGIHGROAT-HTUGSXCWSA-N 0.000 description 3
- JEBWZLWTRPZQRX-QWRGUYRKSA-N Phe-Gly-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O JEBWZLWTRPZQRX-QWRGUYRKSA-N 0.000 description 3
- YCCUXNNKXDGMAM-KKUMJFAQSA-N Phe-Leu-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YCCUXNNKXDGMAM-KKUMJFAQSA-N 0.000 description 3
- XZQYIJALMGEUJD-OEAJRASXSA-N Phe-Lys-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XZQYIJALMGEUJD-OEAJRASXSA-N 0.000 description 3
- FQUUYTNBMIBOHS-IHRRRGAJSA-N Phe-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N FQUUYTNBMIBOHS-IHRRRGAJSA-N 0.000 description 3
- LTAWNJXSRUCFAN-UNQGMJICSA-N Phe-Thr-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O LTAWNJXSRUCFAN-UNQGMJICSA-N 0.000 description 3
- CLNJSLSHKJECME-BQBZGAKWSA-N Pro-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H]1CCCN1 CLNJSLSHKJECME-BQBZGAKWSA-N 0.000 description 3
- SVXXJYJCRNKDDE-AVGNSLFASA-N Pro-Pro-His Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H]1N(CCC1)C(=O)[C@H]1NCCC1)C1=CN=CN1 SVXXJYJCRNKDDE-AVGNSLFASA-N 0.000 description 3
- QUBVFEANYYWBTM-VEVYYDQMSA-N Pro-Thr-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O QUBVFEANYYWBTM-VEVYYDQMSA-N 0.000 description 3
- KHRLUIPIMIQFGT-AVGNSLFASA-N Pro-Val-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KHRLUIPIMIQFGT-AVGNSLFASA-N 0.000 description 3
- FHJQROWZEJFZPO-SRVKXCTJSA-N Pro-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 FHJQROWZEJFZPO-SRVKXCTJSA-N 0.000 description 3
- 239000004365 Protease Substances 0.000 description 3
- 102000009661 Repressor Proteins Human genes 0.000 description 3
- 102100037486 Reverse transcriptase/ribonuclease H Human genes 0.000 description 3
- HRNQLKCLPVKZNE-CIUDSAMLSA-N Ser-Ala-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O HRNQLKCLPVKZNE-CIUDSAMLSA-N 0.000 description 3
- UOLGINIHBRIECN-FXQIFTODSA-N Ser-Glu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UOLGINIHBRIECN-FXQIFTODSA-N 0.000 description 3
- IOVHBRCQOGWAQH-ZKWXMUAHSA-N Ser-Gly-Ile Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O IOVHBRCQOGWAQH-ZKWXMUAHSA-N 0.000 description 3
- IXZHZUGGKLRHJD-DCAQKATOSA-N Ser-Leu-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IXZHZUGGKLRHJD-DCAQKATOSA-N 0.000 description 3
- GYDFRTRSSXOZCR-ACZMJKKPSA-N Ser-Ser-Glu Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O GYDFRTRSSXOZCR-ACZMJKKPSA-N 0.000 description 3
- PCJLFYBAQZQOFE-KATARQTJSA-N Ser-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N)O PCJLFYBAQZQOFE-KATARQTJSA-N 0.000 description 3
- 108020004682 Single-Stranded DNA Proteins 0.000 description 3
- VPZKQTYZIVOJDV-LMVFSUKVSA-N Thr-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(O)=O VPZKQTYZIVOJDV-LMVFSUKVSA-N 0.000 description 3
- TYVAWPFQYFPSBR-BFHQHQDPSA-N Thr-Ala-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)NCC(O)=O TYVAWPFQYFPSBR-BFHQHQDPSA-N 0.000 description 3
- KEGBFULVYKYJRD-LFSVMHDDSA-N Thr-Ala-Phe Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KEGBFULVYKYJRD-LFSVMHDDSA-N 0.000 description 3
- CAJFZCICSVBOJK-SHGPDSBTSA-N Thr-Ala-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAJFZCICSVBOJK-SHGPDSBTSA-N 0.000 description 3
- VGYBYGQXZJDZJU-XQXXSGGOSA-N Thr-Glu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VGYBYGQXZJDZJU-XQXXSGGOSA-N 0.000 description 3
- ZESGVALRVJIVLZ-VFCFLDTKSA-N Thr-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O ZESGVALRVJIVLZ-VFCFLDTKSA-N 0.000 description 3
- VEENWOSZGWWKHW-SZZJOZGLSA-N Thr-Trp-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC3=CN=CN3)C(=O)O)N)O VEENWOSZGWWKHW-SZZJOZGLSA-N 0.000 description 3
- MNYNCKZAEIAONY-XGEHTFHBSA-N Thr-Val-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O MNYNCKZAEIAONY-XGEHTFHBSA-N 0.000 description 3
- 239000007983 Tris buffer Substances 0.000 description 3
- XZLHHHYSWIYXHD-XIRDDKMYSA-N Trp-Gln-Arg Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O XZLHHHYSWIYXHD-XIRDDKMYSA-N 0.000 description 3
- VMBBTANKMSRJSS-JSGCOSHPSA-N Trp-Glu-Gly Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VMBBTANKMSRJSS-JSGCOSHPSA-N 0.000 description 3
- CDRYEAWHKJSGAF-BPNCWPANSA-N Tyr-Ala-Met Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O CDRYEAWHKJSGAF-BPNCWPANSA-N 0.000 description 3
- VFJIWSJKZJTQII-SRVKXCTJSA-N Tyr-Asp-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O VFJIWSJKZJTQII-SRVKXCTJSA-N 0.000 description 3
- HZZKQZDUIKVFDZ-AVGNSLFASA-N Tyr-Gln-Ser Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N)O HZZKQZDUIKVFDZ-AVGNSLFASA-N 0.000 description 3
- HSBZWINKRYZCSQ-KKUMJFAQSA-N Tyr-Lys-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O HSBZWINKRYZCSQ-KKUMJFAQSA-N 0.000 description 3
- NWEGIYMHTZXVBP-JSGCOSHPSA-N Tyr-Val-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O NWEGIYMHTZXVBP-JSGCOSHPSA-N 0.000 description 3
- VFOHXOLPLACADK-GVXVVHGQSA-N Val-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C(C)C)N VFOHXOLPLACADK-GVXVVHGQSA-N 0.000 description 3
- ZEVNVXYRZRIRCH-GVXVVHGQSA-N Val-Gln-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N ZEVNVXYRZRIRCH-GVXVVHGQSA-N 0.000 description 3
- MDYSKHBSPXUOPV-JSGCOSHPSA-N Val-Gly-Phe Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N MDYSKHBSPXUOPV-JSGCOSHPSA-N 0.000 description 3
- KZKMBGXCNLPYKD-YEPSODPASA-N Val-Gly-Thr Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O KZKMBGXCNLPYKD-YEPSODPASA-N 0.000 description 3
- YQYFYUSYEDNLSD-YEPSODPASA-N Val-Thr-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O YQYFYUSYEDNLSD-YEPSODPASA-N 0.000 description 3
- RTJPAGFXOWEBAI-SRVKXCTJSA-N Val-Val-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N RTJPAGFXOWEBAI-SRVKXCTJSA-N 0.000 description 3
- 108010081404 acein-2 Proteins 0.000 description 3
- 108010008685 alanyl-glutamyl-aspartic acid Proteins 0.000 description 3
- 108010044940 alanylglutamine Proteins 0.000 description 3
- 108010072041 arginyl-glycyl-aspartic acid Proteins 0.000 description 3
- 108010060035 arginylproline Proteins 0.000 description 3
- 238000003149 assay kit Methods 0.000 description 3
- 230000001580 bacterial effect Effects 0.000 description 3
- 238000005119 centrifugation Methods 0.000 description 3
- 230000001876 chaperonelike Effects 0.000 description 3
- 150000001875 compounds Chemical class 0.000 description 3
- 108010069495 cysteinyltyrosine Proteins 0.000 description 3
- 238000013461 design Methods 0.000 description 3
- 238000002337 electrophoretic mobility shift assay Methods 0.000 description 3
- 239000000284 extract Substances 0.000 description 3
- 239000012530 fluid Substances 0.000 description 3
- GNBHRKFJIUUOQI-UHFFFAOYSA-N fluorescein Chemical compound O1C(=O)C2=CC=CC=C2C21C1=CC=C(O)C=C1OC1=CC(O)=CC=C21 GNBHRKFJIUUOQI-UHFFFAOYSA-N 0.000 description 3
- 230000006870 function Effects 0.000 description 3
- 239000000499 gel Substances 0.000 description 3
- 238000001502 gel electrophoresis Methods 0.000 description 3
- 239000003365 glass fiber Substances 0.000 description 3
- 108010078144 glutaminyl-glycine Proteins 0.000 description 3
- 108010042598 glutamyl-aspartyl-glycine Proteins 0.000 description 3
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 3
- 108010019832 glycyl-asparaginyl-glycine Proteins 0.000 description 3
- 108010067216 glycyl-glycyl-glycine Proteins 0.000 description 3
- 108010026364 glycyl-glycyl-leucine Proteins 0.000 description 3
- 108010079413 glycyl-prolyl-glutamic acid Proteins 0.000 description 3
- 108010089804 glycyl-threonine Proteins 0.000 description 3
- 238000004128 high performance liquid chromatography Methods 0.000 description 3
- 238000003018 immunoassay Methods 0.000 description 3
- 208000015181 infectious disease Diseases 0.000 description 3
- 108010077158 leucinyl-arginyl-tryptophan Proteins 0.000 description 3
- 108010038320 lysylphenylalanine Proteins 0.000 description 3
- 239000000463 material Substances 0.000 description 3
- 239000000178 monomer Substances 0.000 description 3
- 230000035772 mutation Effects 0.000 description 3
- 238000005457 optimization Methods 0.000 description 3
- 239000002245 particle Substances 0.000 description 3
- 108010018625 phenylalanylarginine Proteins 0.000 description 3
- 108010012581 phenylalanylglutamate Proteins 0.000 description 3
- 238000012545 processing Methods 0.000 description 3
- 239000011541 reaction mixture Substances 0.000 description 3
- 238000012552 review Methods 0.000 description 3
- PYWVYCXTNDRMGF-UHFFFAOYSA-N rhodamine B Chemical compound [Cl-].C=12C=CC(=[N+](CC)CC)C=C2OC2=CC(N(CC)CC)=CC=C2C=1C1=CC=CC=C1C(O)=O PYWVYCXTNDRMGF-UHFFFAOYSA-N 0.000 description 3
- 239000011780 sodium chloride Substances 0.000 description 3
- 238000010561 standard procedure Methods 0.000 description 3
- 239000000758 substrate Substances 0.000 description 3
- LENZDBCJOHFCAS-UHFFFAOYSA-N tris Chemical compound OCC(N)(CO)CO LENZDBCJOHFCAS-UHFFFAOYSA-N 0.000 description 3
- 108010020532 tyrosyl-proline Proteins 0.000 description 3
- 241000701447 unidentified baculovirus Species 0.000 description 3
- 230000003612 virological effect Effects 0.000 description 3
- 102000040650 (ribonucleotides)n+m Human genes 0.000 description 2
- BJRCFZKVYNDCJE-WBSNEMHCSA-N 99489-95-9 Chemical compound C([C@@H]1NC(=O)[C@H](CCCCN)NC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CO)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@H](CCCCN)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@@H]2CSSC[C@@H](C(=O)N[C@H](C(N[C@@H](CC(C)C)C(=O)N[C@H](C(=O)NCC(=O)N2)[C@@H](C)CC)=O)CSSC[C@@H](C(NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(=O)N[C@@H](C)C(=O)N[C@H](C(=O)N[C@@H](CSSC[C@H](NC1=O)C(O)=O)C(=O)NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O)[C@@H](C)CC)[C@@H](C)CC)C(C)C)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)NC(=O)[C@@H](N)CCCCN)C(C)C)[C@@H](C)CC)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CC=1C=CC=CC=1)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@H](CC=1C=CC(O)=CC=1)NC(=O)[C@H]1N(CCC1)C(=O)[C@@H](N)CCCNC(N)=N)C(C)C)C1=CC=C(O)C=C1 BJRCFZKVYNDCJE-WBSNEMHCSA-N 0.000 description 2
- LWUWMHIOBPTZBA-DCAQKATOSA-N Ala-Arg-Lys Chemical compound NC(=N)NCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O LWUWMHIOBPTZBA-DCAQKATOSA-N 0.000 description 2
- XQGIRPGAVLFKBJ-CIUDSAMLSA-N Ala-Asn-Lys Chemical compound N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)O XQGIRPGAVLFKBJ-CIUDSAMLSA-N 0.000 description 2
- ZIWWTZWAKYBUOB-CIUDSAMLSA-N Ala-Asp-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O ZIWWTZWAKYBUOB-CIUDSAMLSA-N 0.000 description 2
- LSLIRHLIUDVNBN-CIUDSAMLSA-N Ala-Asp-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN LSLIRHLIUDVNBN-CIUDSAMLSA-N 0.000 description 2
- NJIFPLAJSVUQOZ-JBDRJPRFSA-N Ala-Cys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](C)N NJIFPLAJSVUQOZ-JBDRJPRFSA-N 0.000 description 2
- KXEVYGKATAMXJJ-ACZMJKKPSA-N Ala-Glu-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O KXEVYGKATAMXJJ-ACZMJKKPSA-N 0.000 description 2
- NBTGEURICRTMGL-WHFBIAKZSA-N Ala-Gly-Ser Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O NBTGEURICRTMGL-WHFBIAKZSA-N 0.000 description 2
- OKIKVSXTXVVFDV-MMWGEVLESA-N Ala-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C)N OKIKVSXTXVVFDV-MMWGEVLESA-N 0.000 description 2
- VNYMOTCMNHJGTG-JBDRJPRFSA-N Ala-Ile-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O VNYMOTCMNHJGTG-JBDRJPRFSA-N 0.000 description 2
- HHRAXZAYZFFRAM-CIUDSAMLSA-N Ala-Leu-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O HHRAXZAYZFFRAM-CIUDSAMLSA-N 0.000 description 2
- WUHJHHGYVVJMQE-BJDJZHNGSA-N Ala-Leu-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WUHJHHGYVVJMQE-BJDJZHNGSA-N 0.000 description 2
- ZJLORAAXDAJLDC-CQDKDKBSSA-N Ala-Tyr-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O ZJLORAAXDAJLDC-CQDKDKBSSA-N 0.000 description 2
- IYKVSFNGSWTTNZ-GUBZILKMSA-N Ala-Val-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IYKVSFNGSWTTNZ-GUBZILKMSA-N 0.000 description 2
- VHAQSYHSDKERBS-XPUUQOCRSA-N Ala-Val-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O VHAQSYHSDKERBS-XPUUQOCRSA-N 0.000 description 2
- 102000002260 Alkaline Phosphatase Human genes 0.000 description 2
- 108020004774 Alkaline Phosphatase Proteins 0.000 description 2
- YFWTXMRJJDNTLM-LSJOCFKGSA-N Arg-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N YFWTXMRJJDNTLM-LSJOCFKGSA-N 0.000 description 2
- WVNFNPGXYADPPO-BQBZGAKWSA-N Arg-Gly-Ser Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O WVNFNPGXYADPPO-BQBZGAKWSA-N 0.000 description 2
- GXXWTNKNFFKTJB-NAKRPEOUSA-N Arg-Ile-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O GXXWTNKNFFKTJB-NAKRPEOUSA-N 0.000 description 2
- LLUGJARLJCGLAR-CYDGBPFRSA-N Arg-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N LLUGJARLJCGLAR-CYDGBPFRSA-N 0.000 description 2
- UZGFHWIJWPUPOH-IHRRRGAJSA-N Arg-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UZGFHWIJWPUPOH-IHRRRGAJSA-N 0.000 description 2
- KSUALAGYYLQSHJ-RCWTZXSCSA-N Arg-Met-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KSUALAGYYLQSHJ-RCWTZXSCSA-N 0.000 description 2
- UIUXXFIKWQVMEX-UFYCRDLUSA-N Arg-Phe-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O UIUXXFIKWQVMEX-UFYCRDLUSA-N 0.000 description 2
- XSPKAHFVDKRGRL-DCAQKATOSA-N Arg-Pro-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O XSPKAHFVDKRGRL-DCAQKATOSA-N 0.000 description 2
- STHNZYKCJHWULY-AVGNSLFASA-N Arg-Pro-His Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCCN=C(N)N)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O STHNZYKCJHWULY-AVGNSLFASA-N 0.000 description 2
- ASQKVGRCKOFKIU-KZVJFYERSA-N Arg-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O ASQKVGRCKOFKIU-KZVJFYERSA-N 0.000 description 2
- PSUXEQYPYZLNER-QXEWZRGKSA-N Arg-Val-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O PSUXEQYPYZLNER-QXEWZRGKSA-N 0.000 description 2
- AYOAHKWVQLNPDM-HJGDQZAQSA-N Asn-Lys-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AYOAHKWVQLNPDM-HJGDQZAQSA-N 0.000 description 2
- SZNGQSBRHFMZLT-IHRRRGAJSA-N Asn-Pro-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SZNGQSBRHFMZLT-IHRRRGAJSA-N 0.000 description 2
- NYLBGYLHBDFRHL-VEVYYDQMSA-N Asp-Arg-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NYLBGYLHBDFRHL-VEVYYDQMSA-N 0.000 description 2
- JGDBHIVECJGXJA-FXQIFTODSA-N Asp-Asp-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JGDBHIVECJGXJA-FXQIFTODSA-N 0.000 description 2
- XDGBFDYXZCMYEX-NUMRIWBASA-N Asp-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)O)N)O XDGBFDYXZCMYEX-NUMRIWBASA-N 0.000 description 2
- UMHUHHJMEXNSIV-CIUDSAMLSA-N Asp-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UMHUHHJMEXNSIV-CIUDSAMLSA-N 0.000 description 2
- MYLZFUMPZCPJCJ-NHCYSSNCSA-N Asp-Lys-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O MYLZFUMPZCPJCJ-NHCYSSNCSA-N 0.000 description 2
- DJCAHYVLMSRBFR-QXEWZRGKSA-N Asp-Met-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CC(O)=O DJCAHYVLMSRBFR-QXEWZRGKSA-N 0.000 description 2
- RPUYTJJZXQBWDT-SRVKXCTJSA-N Asp-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N RPUYTJJZXQBWDT-SRVKXCTJSA-N 0.000 description 2
- GGRSYTUJHAZTFN-IHRRRGAJSA-N Asp-Pro-Tyr Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)O)N)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O GGRSYTUJHAZTFN-IHRRRGAJSA-N 0.000 description 2
- KGHLGJAXYSVNJP-WHFBIAKZSA-N Asp-Ser-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O KGHLGJAXYSVNJP-WHFBIAKZSA-N 0.000 description 2
- JSHWXQIZOCVWIA-ZKWXMUAHSA-N Asp-Ser-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O JSHWXQIZOCVWIA-ZKWXMUAHSA-N 0.000 description 2
- USENATHVGFXRNO-SRVKXCTJSA-N Asp-Tyr-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(O)=O)C(O)=O)CC1=CC=C(O)C=C1 USENATHVGFXRNO-SRVKXCTJSA-N 0.000 description 2
- 241000304886 Bacilli Species 0.000 description 2
- CHRHSLSUCQHLMG-UHFFFAOYSA-N CC(C)C(C)C(C)C(C)C(C)C(C)C(C)C(C)C(C)C(C)C(C)C(C)C(C)C(C)C(C)C.CC(C)C(C)C(C)C(C)C(C)C(C)C(C)C(C)C(C)C(C)C(C)C(C)C(C)C(C)C(C)C Chemical compound CC(C)C(C)C(C)C(C)C(C)C(C)C(C)C(C)C(C)C(C)C(C)C(C)C(C)C(C)C(C)C.CC(C)C(C)C(C)C(C)C(C)C(C)C(C)C(C)C(C)C(C)C(C)C(C)C(C)C(C)C(C)C CHRHSLSUCQHLMG-UHFFFAOYSA-N 0.000 description 2
- 241000606161 Chlamydia Species 0.000 description 2
- GMXSSZUVDNPRMA-FXQIFTODSA-N Cys-Arg-Asp Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O GMXSSZUVDNPRMA-FXQIFTODSA-N 0.000 description 2
- SQJSYLDKQBZQTG-FXQIFTODSA-N Cys-Asn-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CS)N SQJSYLDKQBZQTG-FXQIFTODSA-N 0.000 description 2
- ASHTVGGFIMESRD-LKXGYXEUSA-N Cys-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)N)O ASHTVGGFIMESRD-LKXGYXEUSA-N 0.000 description 2
- DZSICRGTVPDCRN-YUMQZZPRSA-N Cys-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CS)N DZSICRGTVPDCRN-YUMQZZPRSA-N 0.000 description 2
- WZZGXXNRSZIQFC-VGDYDELISA-N Cys-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CS)N WZZGXXNRSZIQFC-VGDYDELISA-N 0.000 description 2
- UVZFZTWNHOQWNK-NAKRPEOUSA-N Cys-Ile-Arg Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UVZFZTWNHOQWNK-NAKRPEOUSA-N 0.000 description 2
- UCSXXFRXHGUXCQ-SRVKXCTJSA-N Cys-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CS)N UCSXXFRXHGUXCQ-SRVKXCTJSA-N 0.000 description 2
- KFYPRIGJTICABD-XGEHTFHBSA-N Cys-Thr-Val Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CS)N)O KFYPRIGJTICABD-XGEHTFHBSA-N 0.000 description 2
- UOEYKPDDHSFMLI-DCAQKATOSA-N Cys-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CS)N UOEYKPDDHSFMLI-DCAQKATOSA-N 0.000 description 2
- 241000701022 Cytomegalovirus Species 0.000 description 2
- 239000003298 DNA probe Substances 0.000 description 2
- 102000004163 DNA-directed RNA polymerases Human genes 0.000 description 2
- 108090000626 DNA-directed RNA polymerases Proteins 0.000 description 2
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 2
- 108010042407 Endonucleases Proteins 0.000 description 2
- LZRMPXRYLLTAJX-GUBZILKMSA-N Gln-Arg-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O LZRMPXRYLLTAJX-GUBZILKMSA-N 0.000 description 2
- MWLYSLMKFXWZPW-ZPFDUUQYSA-N Gln-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CCC(N)=O MWLYSLMKFXWZPW-ZPFDUUQYSA-N 0.000 description 2
- PKVWNYGXMNWJSI-CIUDSAMLSA-N Gln-Gln-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O PKVWNYGXMNWJSI-CIUDSAMLSA-N 0.000 description 2
- ZNZPKVQURDQFFS-FXQIFTODSA-N Gln-Glu-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZNZPKVQURDQFFS-FXQIFTODSA-N 0.000 description 2
- HVQCEQTUSWWFOS-WDSKDSINSA-N Gln-Gly-Cys Chemical compound C(CC(=O)N)[C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N HVQCEQTUSWWFOS-WDSKDSINSA-N 0.000 description 2
- KQOPMGBHNQBCEL-HVTMNAMFSA-N Gln-His-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KQOPMGBHNQBCEL-HVTMNAMFSA-N 0.000 description 2
- IULKWYSYZSURJK-AVGNSLFASA-N Gln-Leu-Lys Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O IULKWYSYZSURJK-AVGNSLFASA-N 0.000 description 2
- HSHCEAUPUPJPTE-JYJNAYRXSA-N Gln-Leu-Tyr Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N HSHCEAUPUPJPTE-JYJNAYRXSA-N 0.000 description 2
- KLKYKPXITJBSNI-CIUDSAMLSA-N Gln-Met-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O KLKYKPXITJBSNI-CIUDSAMLSA-N 0.000 description 2
- JNVGVECJCOZHCN-DRZSPHRISA-N Gln-Phe-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(O)=O JNVGVECJCOZHCN-DRZSPHRISA-N 0.000 description 2
- XUMFMAVDHQDATI-DCAQKATOSA-N Gln-Pro-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O XUMFMAVDHQDATI-DCAQKATOSA-N 0.000 description 2
- OSCLNNWLKKIQJM-WDSKDSINSA-N Gln-Ser-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O OSCLNNWLKKIQJM-WDSKDSINSA-N 0.000 description 2
- UXXIVIQGOODKQC-NUMRIWBASA-N Gln-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O UXXIVIQGOODKQC-NUMRIWBASA-N 0.000 description 2
- WOSRKEJQESVHGA-CIUDSAMLSA-N Glu-Arg-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O WOSRKEJQESVHGA-CIUDSAMLSA-N 0.000 description 2
- CKRUHITYRFNUKW-WDSKDSINSA-N Glu-Asn-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CKRUHITYRFNUKW-WDSKDSINSA-N 0.000 description 2
- WLIPTFCZLHCNFD-LPEHRKFASA-N Glu-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N)C(=O)O WLIPTFCZLHCNFD-LPEHRKFASA-N 0.000 description 2
- QQLBPVKLJBAXBS-FXQIFTODSA-N Glu-Glu-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O QQLBPVKLJBAXBS-FXQIFTODSA-N 0.000 description 2
- QJCKNLPMTPXXEM-AUTRQRHGSA-N Glu-Glu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O QJCKNLPMTPXXEM-AUTRQRHGSA-N 0.000 description 2
- OAGVHWYIBZMWLA-YFKPBYRVSA-N Glu-Gly-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)NCC(O)=O OAGVHWYIBZMWLA-YFKPBYRVSA-N 0.000 description 2
- LRPXYSGPOBVBEH-IUCAKERBSA-N Glu-Gly-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O LRPXYSGPOBVBEH-IUCAKERBSA-N 0.000 description 2
- CXRWMMRLEMVSEH-PEFMBERDSA-N Glu-Ile-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O CXRWMMRLEMVSEH-PEFMBERDSA-N 0.000 description 2
- ATVYZJGOZLVXDK-IUCAKERBSA-N Glu-Leu-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O ATVYZJGOZLVXDK-IUCAKERBSA-N 0.000 description 2
- IVGJYOOGJLFKQE-AVGNSLFASA-N Glu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N IVGJYOOGJLFKQE-AVGNSLFASA-N 0.000 description 2
- SWRVAQHFBRZVNX-GUBZILKMSA-N Glu-Lys-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O SWRVAQHFBRZVNX-GUBZILKMSA-N 0.000 description 2
- BCYGDJXHAGZNPQ-DCAQKATOSA-N Glu-Lys-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O BCYGDJXHAGZNPQ-DCAQKATOSA-N 0.000 description 2
- HRBYTAIBKPNZKQ-AVGNSLFASA-N Glu-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(O)=O HRBYTAIBKPNZKQ-AVGNSLFASA-N 0.000 description 2
- YRMZCZIRHYCNHX-RYUDHWBXSA-N Glu-Phe-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O YRMZCZIRHYCNHX-RYUDHWBXSA-N 0.000 description 2
- QJVZSVUYZFYLFQ-CIUDSAMLSA-N Glu-Pro-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O QJVZSVUYZFYLFQ-CIUDSAMLSA-N 0.000 description 2
- UCZXXMREFIETQW-AVGNSLFASA-N Glu-Tyr-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O UCZXXMREFIETQW-AVGNSLFASA-N 0.000 description 2
- QRWPTXLWHHTOCO-DZKIICNBSA-N Glu-Val-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QRWPTXLWHHTOCO-DZKIICNBSA-N 0.000 description 2
- KFMBRBPXHVMDFN-UWVGGRQHSA-N Gly-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCNC(N)=N KFMBRBPXHVMDFN-UWVGGRQHSA-N 0.000 description 2
- IWAXHBCACVWNHT-BQBZGAKWSA-N Gly-Asp-Arg Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IWAXHBCACVWNHT-BQBZGAKWSA-N 0.000 description 2
- PEZZSFLFXXFUQD-XPUUQOCRSA-N Gly-Cys-Val Chemical compound [H]NCC(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O PEZZSFLFXXFUQD-XPUUQOCRSA-N 0.000 description 2
- DGKBSGNCMCLDSL-BYULHYEWSA-N Gly-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN DGKBSGNCMCLDSL-BYULHYEWSA-N 0.000 description 2
- LUJVWKKYHSLULQ-ZKWXMUAHSA-N Gly-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN LUJVWKKYHSLULQ-ZKWXMUAHSA-N 0.000 description 2
- LRQXRHGQEVWGPV-NHCYSSNCSA-N Gly-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN LRQXRHGQEVWGPV-NHCYSSNCSA-N 0.000 description 2
- UUYBFNKHOCJCHT-VHSXEESVSA-N Gly-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN UUYBFNKHOCJCHT-VHSXEESVSA-N 0.000 description 2
- MXIULRKNFSCJHT-STQMWFEESA-N Gly-Phe-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 MXIULRKNFSCJHT-STQMWFEESA-N 0.000 description 2
- GGLIDLCEPDHEJO-BQBZGAKWSA-N Gly-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)CN GGLIDLCEPDHEJO-BQBZGAKWSA-N 0.000 description 2
- VNNRLUNBJSWZPF-ZKWXMUAHSA-N Gly-Ser-Ile Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VNNRLUNBJSWZPF-ZKWXMUAHSA-N 0.000 description 2
- FFJQHWKSGAWSTJ-BFHQHQDPSA-N Gly-Thr-Ala Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O FFJQHWKSGAWSTJ-BFHQHQDPSA-N 0.000 description 2
- YJDALMUYJIENAG-QWRGUYRKSA-N Gly-Tyr-Asn Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN)O YJDALMUYJIENAG-QWRGUYRKSA-N 0.000 description 2
- GBYYQVBXFVDJPJ-WLTAIBSBSA-N Gly-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)CN)O GBYYQVBXFVDJPJ-WLTAIBSBSA-N 0.000 description 2
- GWCJMBNBFYBQCV-XPUUQOCRSA-N Gly-Val-Ala Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O GWCJMBNBFYBQCV-XPUUQOCRSA-N 0.000 description 2
- COZMNNJEGNPDED-HOCLYGCPSA-N Gly-Val-Trp Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O COZMNNJEGNPDED-HOCLYGCPSA-N 0.000 description 2
- JBJNKUOMNZGQIM-PYJNHQTQSA-N His-Arg-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JBJNKUOMNZGQIM-PYJNHQTQSA-N 0.000 description 2
- IDQKGZWUPVOGPZ-GUBZILKMSA-N His-Cys-Gln Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N IDQKGZWUPVOGPZ-GUBZILKMSA-N 0.000 description 2
- VYMGAXSNYUFVCK-GUBZILKMSA-N His-Gln-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N VYMGAXSNYUFVCK-GUBZILKMSA-N 0.000 description 2
- BPOHQCZZSFBSON-KKUMJFAQSA-N His-Leu-His Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)Cc1cnc[nH]1)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O BPOHQCZZSFBSON-KKUMJFAQSA-N 0.000 description 2
- LVXFNTIIGOQBMD-SRVKXCTJSA-N His-Leu-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O LVXFNTIIGOQBMD-SRVKXCTJSA-N 0.000 description 2
- GJMHMDKCJPQJOI-IHRRRGAJSA-N His-Lys-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CN=CN1 GJMHMDKCJPQJOI-IHRRRGAJSA-N 0.000 description 2
- ULRFSEJGSHYLQI-YESZJQIVSA-N His-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CC3=CN=CN3)N)C(=O)O ULRFSEJGSHYLQI-YESZJQIVSA-N 0.000 description 2
- UIRUVUUGUYCMBY-KCTSRDHCSA-N His-Trp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CC3=CN=CN3)N UIRUVUUGUYCMBY-KCTSRDHCSA-N 0.000 description 2
- CGAMSLMBYJHMDY-ONGXEEELSA-N His-Val-Gly Chemical compound CC(C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC1=CN=CN1)N CGAMSLMBYJHMDY-ONGXEEELSA-N 0.000 description 2
- 101000617808 Homo sapiens Synphilin-1 Proteins 0.000 description 2
- 108010001336 Horseradish Peroxidase Proteins 0.000 description 2
- 241000701828 Human papillomavirus type 11 Species 0.000 description 2
- MHAJPDPJQMAIIY-UHFFFAOYSA-N Hydrogen peroxide Chemical compound OO MHAJPDPJQMAIIY-UHFFFAOYSA-N 0.000 description 2
- JXUGDUWBMKIJDC-NAKRPEOUSA-N Ile-Ala-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O JXUGDUWBMKIJDC-NAKRPEOUSA-N 0.000 description 2
- ASCFJMSGKUIRDU-ZPFDUUQYSA-N Ile-Arg-Gln Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O ASCFJMSGKUIRDU-ZPFDUUQYSA-N 0.000 description 2
- KMBPQYKVZBMRMH-PEFMBERDSA-N Ile-Gln-Asn Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O KMBPQYKVZBMRMH-PEFMBERDSA-N 0.000 description 2
- PHIXPNQDGGILMP-YVNDNENWSA-N Ile-Glu-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N PHIXPNQDGGILMP-YVNDNENWSA-N 0.000 description 2
- KYLIZSDYWQQTFM-PEDHHIEDSA-N Ile-Ile-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N KYLIZSDYWQQTFM-PEDHHIEDSA-N 0.000 description 2
- SJLVSMMIFYTSGY-GRLWGSQLSA-N Ile-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N SJLVSMMIFYTSGY-GRLWGSQLSA-N 0.000 description 2
- TWYOYAKMLHWMOJ-ZPFDUUQYSA-N Ile-Leu-Asn Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O TWYOYAKMLHWMOJ-ZPFDUUQYSA-N 0.000 description 2
- DSDPLOODKXISDT-XUXIUFHCSA-N Ile-Leu-Val Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O DSDPLOODKXISDT-XUXIUFHCSA-N 0.000 description 2
- CKRFDMPBSWYOBT-PPCPHDFISA-N Ile-Lys-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N CKRFDMPBSWYOBT-PPCPHDFISA-N 0.000 description 2
- IIWQTXMUALXGOV-PCBIJLKTSA-N Ile-Phe-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N IIWQTXMUALXGOV-PCBIJLKTSA-N 0.000 description 2
- SAVXZJYTTQQQDD-QEWYBTABSA-N Ile-Phe-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N SAVXZJYTTQQQDD-QEWYBTABSA-N 0.000 description 2
- IVXJIMGDOYRLQU-XUXIUFHCSA-N Ile-Pro-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O IVXJIMGDOYRLQU-XUXIUFHCSA-N 0.000 description 2
- KBDIBHQICWDGDL-PPCPHDFISA-N Ile-Thr-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N KBDIBHQICWDGDL-PPCPHDFISA-N 0.000 description 2
- OMDWJWGZGMCQND-CFMVVWHZSA-N Ile-Tyr-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N OMDWJWGZGMCQND-CFMVVWHZSA-N 0.000 description 2
- RMJWFINHACYKJI-SIUGBPQLSA-N Ile-Tyr-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N RMJWFINHACYKJI-SIUGBPQLSA-N 0.000 description 2
- AUIYHFRUOOKTGX-UKJIMTQDSA-N Ile-Val-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N AUIYHFRUOOKTGX-UKJIMTQDSA-N 0.000 description 2
- 108060003951 Immunoglobulin Proteins 0.000 description 2
- 108010065920 Insulin Lispro Proteins 0.000 description 2
- HGCNKOLVKRAVHD-UHFFFAOYSA-N L-Met-L-Phe Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 HGCNKOLVKRAVHD-UHFFFAOYSA-N 0.000 description 2
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 2
- LJHGALIOHLRRQN-DCAQKATOSA-N Leu-Ala-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N LJHGALIOHLRRQN-DCAQKATOSA-N 0.000 description 2
- KWTVLKBOQATPHJ-SRVKXCTJSA-N Leu-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N KWTVLKBOQATPHJ-SRVKXCTJSA-N 0.000 description 2
- YOZCKMXHBYKOMQ-IHRRRGAJSA-N Leu-Arg-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N YOZCKMXHBYKOMQ-IHRRRGAJSA-N 0.000 description 2
- XYUBOFCTGPZFSA-WDSOQIARSA-N Leu-Arg-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC(C)C)C(O)=O)=CNC2=C1 XYUBOFCTGPZFSA-WDSOQIARSA-N 0.000 description 2
- DLCOFDAHNMMQPP-SRVKXCTJSA-N Leu-Asp-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DLCOFDAHNMMQPP-SRVKXCTJSA-N 0.000 description 2
- JQSXWJXBASFONF-KKUMJFAQSA-N Leu-Asp-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JQSXWJXBASFONF-KKUMJFAQSA-N 0.000 description 2
- VQPPIMUZCZCOIL-GUBZILKMSA-N Leu-Gln-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O VQPPIMUZCZCOIL-GUBZILKMSA-N 0.000 description 2
- VPKIQULSKFVCSM-SRVKXCTJSA-N Leu-Gln-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VPKIQULSKFVCSM-SRVKXCTJSA-N 0.000 description 2
- DLCXCECTCPKKCD-GUBZILKMSA-N Leu-Gln-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O DLCXCECTCPKKCD-GUBZILKMSA-N 0.000 description 2
- KAFOIVJDVSZUMD-DCAQKATOSA-N Leu-Gln-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O KAFOIVJDVSZUMD-DCAQKATOSA-N 0.000 description 2
- GPICTNQYKHHHTH-GUBZILKMSA-N Leu-Gln-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O GPICTNQYKHHHTH-GUBZILKMSA-N 0.000 description 2
- HFBCHNRFRYLZNV-GUBZILKMSA-N Leu-Glu-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HFBCHNRFRYLZNV-GUBZILKMSA-N 0.000 description 2
- CSFVADKICPDRRF-KKUMJFAQSA-N Leu-His-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CN=CN1 CSFVADKICPDRRF-KKUMJFAQSA-N 0.000 description 2
- HRTRLSRYZZKPCO-BJDJZHNGSA-N Leu-Ile-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HRTRLSRYZZKPCO-BJDJZHNGSA-N 0.000 description 2
- RTIRBWJPYJYTLO-MELADBBJSA-N Leu-Lys-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N RTIRBWJPYJYTLO-MELADBBJSA-N 0.000 description 2
- BIZNDKMFQHDOIE-KKUMJFAQSA-N Leu-Phe-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(O)=O)CC1=CC=CC=C1 BIZNDKMFQHDOIE-KKUMJFAQSA-N 0.000 description 2
- KWLWZYMNUZJKMZ-IHRRRGAJSA-N Leu-Pro-Leu Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O KWLWZYMNUZJKMZ-IHRRRGAJSA-N 0.000 description 2
- PWPBLZXWFXJFHE-RHYQMDGZSA-N Leu-Pro-Thr Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O PWPBLZXWFXJFHE-RHYQMDGZSA-N 0.000 description 2
- IWMJFLJQHIDZQW-KKUMJFAQSA-N Leu-Ser-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 IWMJFLJQHIDZQW-KKUMJFAQSA-N 0.000 description 2
- SVBJIZVVYJYGLA-DCAQKATOSA-N Leu-Ser-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O SVBJIZVVYJYGLA-DCAQKATOSA-N 0.000 description 2
- AEDWWMMHUGYIFD-HJGDQZAQSA-N Leu-Thr-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O AEDWWMMHUGYIFD-HJGDQZAQSA-N 0.000 description 2
- LCNASHSOFMRYFO-WDCWCFNPSA-N Leu-Thr-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(N)=O LCNASHSOFMRYFO-WDCWCFNPSA-N 0.000 description 2
- SJNZALDHDUYDBU-IHRRRGAJSA-N Lys-Arg-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(O)=O SJNZALDHDUYDBU-IHRRRGAJSA-N 0.000 description 2
- WALVCOOOKULCQM-ULQDDVLXSA-N Lys-Arg-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WALVCOOOKULCQM-ULQDDVLXSA-N 0.000 description 2
- NQCJGQHHYZNUDK-DCAQKATOSA-N Lys-Arg-Ser Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CCCN=C(N)N NQCJGQHHYZNUDK-DCAQKATOSA-N 0.000 description 2
- QYOXSYXPHUHOJR-GUBZILKMSA-N Lys-Asn-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QYOXSYXPHUHOJR-GUBZILKMSA-N 0.000 description 2
- HQVDJTYKCMIWJP-YUMQZZPRSA-N Lys-Asn-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O HQVDJTYKCMIWJP-YUMQZZPRSA-N 0.000 description 2
- IWWMPCPLFXFBAF-SRVKXCTJSA-N Lys-Asp-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O IWWMPCPLFXFBAF-SRVKXCTJSA-N 0.000 description 2
- VEGLGAOVLFODGC-GUBZILKMSA-N Lys-Glu-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O VEGLGAOVLFODGC-GUBZILKMSA-N 0.000 description 2
- GPJGFSFYBJGYRX-YUMQZZPRSA-N Lys-Gly-Asp Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O GPJGFSFYBJGYRX-YUMQZZPRSA-N 0.000 description 2
- ISHNZELVUVPCHY-ZETCQYMHSA-N Lys-Gly-Gly Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)NCC(O)=O ISHNZELVUVPCHY-ZETCQYMHSA-N 0.000 description 2
- FHIAJWBDZVHLAH-YUMQZZPRSA-N Lys-Gly-Ser Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FHIAJWBDZVHLAH-YUMQZZPRSA-N 0.000 description 2
- CAVGLNOOIFHJOF-SRVKXCTJSA-N Lys-His-Cys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCCN)N CAVGLNOOIFHJOF-SRVKXCTJSA-N 0.000 description 2
- MXMDJEJWERYPMO-XUXIUFHCSA-N Lys-Ile-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MXMDJEJWERYPMO-XUXIUFHCSA-N 0.000 description 2
- VMTYLUGCXIEDMV-QWRGUYRKSA-N Lys-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCCCN VMTYLUGCXIEDMV-QWRGUYRKSA-N 0.000 description 2
- YUAXTFMFMOIMAM-QWRGUYRKSA-N Lys-Lys-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O YUAXTFMFMOIMAM-QWRGUYRKSA-N 0.000 description 2
- GAHJXEMYXKLZRQ-AJNGGQMLSA-N Lys-Lys-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O GAHJXEMYXKLZRQ-AJNGGQMLSA-N 0.000 description 2
- ATNKHRAIZCMCCN-BZSNNMDCSA-N Lys-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N ATNKHRAIZCMCCN-BZSNNMDCSA-N 0.000 description 2
- YXPJCVNIDDKGOE-MELADBBJSA-N Lys-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N)C(=O)O YXPJCVNIDDKGOE-MELADBBJSA-N 0.000 description 2
- PLDJDCJLRCYPJB-VOAKCMCISA-N Lys-Lys-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PLDJDCJLRCYPJB-VOAKCMCISA-N 0.000 description 2
- KFSALEZVQJYHCE-AVGNSLFASA-N Lys-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCCCN)N KFSALEZVQJYHCE-AVGNSLFASA-N 0.000 description 2
- MSSABBQOBUZFKZ-IHRRRGAJSA-N Lys-Pro-His Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCCCN)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O MSSABBQOBUZFKZ-IHRRRGAJSA-N 0.000 description 2
- TXTZMVNJIRZABH-ULQDDVLXSA-N Lys-Val-Phe Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 TXTZMVNJIRZABH-ULQDDVLXSA-N 0.000 description 2
- VHGIWFGJIHTASW-FXQIFTODSA-N Met-Ala-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O VHGIWFGJIHTASW-FXQIFTODSA-N 0.000 description 2
- ZEDVFJPQNNBMST-CYDGBPFRSA-N Met-Arg-Ile Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZEDVFJPQNNBMST-CYDGBPFRSA-N 0.000 description 2
- XOMXAVJBLRROMC-IHRRRGAJSA-N Met-Asp-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XOMXAVJBLRROMC-IHRRRGAJSA-N 0.000 description 2
- CEGVMWAVGBRVFS-XGEHTFHBSA-N Met-Cys-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CEGVMWAVGBRVFS-XGEHTFHBSA-N 0.000 description 2
- GPAHWYRSHCKICP-GUBZILKMSA-N Met-Glu-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O GPAHWYRSHCKICP-GUBZILKMSA-N 0.000 description 2
- OGAZPKJHHZPYFK-GARJFASQSA-N Met-Glu-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N OGAZPKJHHZPYFK-GARJFASQSA-N 0.000 description 2
- UZVKFARGHHMQGX-IUCAKERBSA-N Met-Gly-Met Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCSC UZVKFARGHHMQGX-IUCAKERBSA-N 0.000 description 2
- YGNUDKAPJARTEM-GUBZILKMSA-N Met-Val-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O YGNUDKAPJARTEM-GUBZILKMSA-N 0.000 description 2
- 206010027476 Metastases Diseases 0.000 description 2
- 241000408551 Meza Species 0.000 description 2
- AUEJLPRZGVVDNU-UHFFFAOYSA-N N-L-tyrosyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CC1=CC=C(O)C=C1 AUEJLPRZGVVDNU-UHFFFAOYSA-N 0.000 description 2
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 2
- 108010065395 Neuropep-1 Proteins 0.000 description 2
- 101100424716 Patiria pectinifera TBXT gene Proteins 0.000 description 2
- BJEYSVHMGIJORT-NHCYSSNCSA-N Phe-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 BJEYSVHMGIJORT-NHCYSSNCSA-N 0.000 description 2
- YYRCPTVAPLQRNC-ULQDDVLXSA-N Phe-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CC1=CC=CC=C1 YYRCPTVAPLQRNC-ULQDDVLXSA-N 0.000 description 2
- WGXOKDLDIWSOCV-MELADBBJSA-N Phe-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O WGXOKDLDIWSOCV-MELADBBJSA-N 0.000 description 2
- IDUCUXTUHHIQIP-SOUVJXGZSA-N Phe-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O IDUCUXTUHHIQIP-SOUVJXGZSA-N 0.000 description 2
- FMMIYCMOVGXZIP-AVGNSLFASA-N Phe-Glu-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O FMMIYCMOVGXZIP-AVGNSLFASA-N 0.000 description 2
- VADLTGVIOIOKGM-BZSNNMDCSA-N Phe-His-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC=CC=1)C1=CN=CN1 VADLTGVIOIOKGM-BZSNNMDCSA-N 0.000 description 2
- SMFGCTXUBWEPKM-KBPBESRZSA-N Phe-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 SMFGCTXUBWEPKM-KBPBESRZSA-N 0.000 description 2
- OSBADCBXAMSPQD-YESZJQIVSA-N Phe-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N OSBADCBXAMSPQD-YESZJQIVSA-N 0.000 description 2
- IEOHQGFKHXUALJ-JYJNAYRXSA-N Phe-Met-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IEOHQGFKHXUALJ-JYJNAYRXSA-N 0.000 description 2
- JLLJTMHNXQTMCK-UBHSHLNASA-N Phe-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=CC=C1 JLLJTMHNXQTMCK-UBHSHLNASA-N 0.000 description 2
- FZBGMXYQPACKNC-HJWJTTGWSA-N Phe-Pro-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FZBGMXYQPACKNC-HJWJTTGWSA-N 0.000 description 2
- QSWKNJAPHQDAAS-MELADBBJSA-N Phe-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O QSWKNJAPHQDAAS-MELADBBJSA-N 0.000 description 2
- BSKMOCNNLNDIMU-CDMKHQONSA-N Phe-Thr-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O BSKMOCNNLNDIMU-CDMKHQONSA-N 0.000 description 2
- GNRMAQSIROFNMI-IXOXFDKPSA-N Phe-Thr-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O GNRMAQSIROFNMI-IXOXFDKPSA-N 0.000 description 2
- PULPZRAHVFBVTO-DCAQKATOSA-N Pro-Glu-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PULPZRAHVFBVTO-DCAQKATOSA-N 0.000 description 2
- MGDFPGCFVJFITQ-CIUDSAMLSA-N Pro-Glu-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MGDFPGCFVJFITQ-CIUDSAMLSA-N 0.000 description 2
- QCARZLHECSFOGG-CIUDSAMLSA-N Pro-Glu-Cys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O QCARZLHECSFOGG-CIUDSAMLSA-N 0.000 description 2
- NMELOOXSGDRBRU-YUMQZZPRSA-N Pro-Glu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(=O)O)NC(=O)[C@@H]1CCCN1 NMELOOXSGDRBRU-YUMQZZPRSA-N 0.000 description 2
- VYWNORHENYEQDW-YUMQZZPRSA-N Pro-Gly-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 VYWNORHENYEQDW-YUMQZZPRSA-N 0.000 description 2
- UIMCLYYSUCIUJM-UWVGGRQHSA-N Pro-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 UIMCLYYSUCIUJM-UWVGGRQHSA-N 0.000 description 2
- FFSLAIOXRMOFIZ-GJZGRUSLSA-N Pro-Gly-Trp Chemical compound N([C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)O)C(=O)CNC(=O)[C@@H]1CCCN1 FFSLAIOXRMOFIZ-GJZGRUSLSA-N 0.000 description 2
- GBRUQFBAJOKCTF-DCAQKATOSA-N Pro-His-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(O)=O GBRUQFBAJOKCTF-DCAQKATOSA-N 0.000 description 2
- AJCRQOHDLCBHFA-SRVKXCTJSA-N Pro-His-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(O)=O AJCRQOHDLCBHFA-SRVKXCTJSA-N 0.000 description 2
- YTWNSIDWAFSEEI-RWMBFGLXSA-N Pro-His-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)N3CCC[C@@H]3C(=O)O YTWNSIDWAFSEEI-RWMBFGLXSA-N 0.000 description 2
- POQFNPILEQEODH-FXQIFTODSA-N Pro-Ser-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O POQFNPILEQEODH-FXQIFTODSA-N 0.000 description 2
- RNEFESSBTOQSAC-DCAQKATOSA-N Pro-Ser-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O RNEFESSBTOQSAC-DCAQKATOSA-N 0.000 description 2
- LNICFEXCAHIJOR-DCAQKATOSA-N Pro-Ser-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LNICFEXCAHIJOR-DCAQKATOSA-N 0.000 description 2
- CHYAYDLYYIJCKY-OSUNSFLBSA-N Pro-Thr-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CHYAYDLYYIJCKY-OSUNSFLBSA-N 0.000 description 2
- 108700020471 RNA-Binding Proteins Proteins 0.000 description 2
- 230000004570 RNA-binding Effects 0.000 description 2
- 241000607142 Salmonella Species 0.000 description 2
- BTKUIVBNGBFTTP-WHFBIAKZSA-N Ser-Ala-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)NCC(O)=O BTKUIVBNGBFTTP-WHFBIAKZSA-N 0.000 description 2
- SFZKGGOGCNQPJY-CIUDSAMLSA-N Ser-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N SFZKGGOGCNQPJY-CIUDSAMLSA-N 0.000 description 2
- KNCJWSPMTFFJII-ZLUOBGJFSA-N Ser-Cys-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(O)=O KNCJWSPMTFFJII-ZLUOBGJFSA-N 0.000 description 2
- YRBGKVIWMNEVCZ-WDSKDSINSA-N Ser-Glu-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O YRBGKVIWMNEVCZ-WDSKDSINSA-N 0.000 description 2
- KDGARKCAKHBEDB-NKWVEPMBSA-N Ser-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CO)N)C(=O)O KDGARKCAKHBEDB-NKWVEPMBSA-N 0.000 description 2
- QGAHMVHBORDHDC-YUMQZZPRSA-N Ser-His-Gly Chemical compound OC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CN=CN1 QGAHMVHBORDHDC-YUMQZZPRSA-N 0.000 description 2
- CICQXRWZNVXFCU-SRVKXCTJSA-N Ser-His-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O CICQXRWZNVXFCU-SRVKXCTJSA-N 0.000 description 2
- JIPVNVNKXJLFJF-BJDJZHNGSA-N Ser-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N JIPVNVNKXJLFJF-BJDJZHNGSA-N 0.000 description 2
- UBRMZSHOOIVJPW-SRVKXCTJSA-N Ser-Leu-Lys Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O UBRMZSHOOIVJPW-SRVKXCTJSA-N 0.000 description 2
- BSXKBOUZDAZXHE-CIUDSAMLSA-N Ser-Pro-Glu Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O BSXKBOUZDAZXHE-CIUDSAMLSA-N 0.000 description 2
- SRSPTFBENMJHMR-WHFBIAKZSA-N Ser-Ser-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SRSPTFBENMJHMR-WHFBIAKZSA-N 0.000 description 2
- WUXCHQZLUHBSDJ-LKXGYXEUSA-N Ser-Thr-Asp Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CC(O)=O)C(O)=O WUXCHQZLUHBSDJ-LKXGYXEUSA-N 0.000 description 2
- BDMWLJLPPUCLNV-XGEHTFHBSA-N Ser-Thr-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O BDMWLJLPPUCLNV-XGEHTFHBSA-N 0.000 description 2
- HAUVENOGHPECML-BPUTZDHNSA-N Ser-Trp-Val Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](C(C)C)C(O)=O)NC(=O)[C@@H](N)CO)=CNC2=C1 HAUVENOGHPECML-BPUTZDHNSA-N 0.000 description 2
- QYBRQMLZDDJBSW-AVGNSLFASA-N Ser-Tyr-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O QYBRQMLZDDJBSW-AVGNSLFASA-N 0.000 description 2
- KIEIJCFVGZCUAS-MELADBBJSA-N Ser-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CO)N)C(=O)O KIEIJCFVGZCUAS-MELADBBJSA-N 0.000 description 2
- LGIMRDKGABDMBN-DCAQKATOSA-N Ser-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N LGIMRDKGABDMBN-DCAQKATOSA-N 0.000 description 2
- 108010053551 Sp1 Transcription Factor Proteins 0.000 description 2
- 241001478880 Streptobacillus moniliformis Species 0.000 description 2
- 102100021997 Synphilin-1 Human genes 0.000 description 2
- GFDUZZACIWNMPE-KZVJFYERSA-N Thr-Ala-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O GFDUZZACIWNMPE-KZVJFYERSA-N 0.000 description 2
- XYEXCEPTALHNEV-RCWTZXSCSA-N Thr-Arg-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O XYEXCEPTALHNEV-RCWTZXSCSA-N 0.000 description 2
- CEXFELBFVHLYDZ-XGEHTFHBSA-N Thr-Arg-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O CEXFELBFVHLYDZ-XGEHTFHBSA-N 0.000 description 2
- MFEBUIFJVPNZLO-OLHMAJIHSA-N Thr-Asp-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O MFEBUIFJVPNZLO-OLHMAJIHSA-N 0.000 description 2
- XDARBNMYXKUFOJ-GSSVUCPTSA-N Thr-Asp-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XDARBNMYXKUFOJ-GSSVUCPTSA-N 0.000 description 2
- DJDSEDOKJTZBAR-ZDLURKLDSA-N Thr-Gly-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O DJDSEDOKJTZBAR-ZDLURKLDSA-N 0.000 description 2
- KRGDDWVBBDLPSJ-CUJWVEQBSA-N Thr-His-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O KRGDDWVBBDLPSJ-CUJWVEQBSA-N 0.000 description 2
- SXAGUVRFGJSFKC-ZEILLAHLSA-N Thr-His-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SXAGUVRFGJSFKC-ZEILLAHLSA-N 0.000 description 2
- AHOLTQCAVBSUDP-PPCPHDFISA-N Thr-Ile-Lys Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)[C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O AHOLTQCAVBSUDP-PPCPHDFISA-N 0.000 description 2
- MECLEFZMPPOEAC-VOAKCMCISA-N Thr-Leu-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MECLEFZMPPOEAC-VOAKCMCISA-N 0.000 description 2
- MGJLBZFUXUGMML-VOAKCMCISA-N Thr-Lys-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MGJLBZFUXUGMML-VOAKCMCISA-N 0.000 description 2
- GFRIEEKFXOVPIR-RHYQMDGZSA-N Thr-Pro-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O GFRIEEKFXOVPIR-RHYQMDGZSA-N 0.000 description 2
- BBPCSGKKPJUYRB-UVOCVTCTSA-N Thr-Thr-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O BBPCSGKKPJUYRB-UVOCVTCTSA-N 0.000 description 2
- KZTLZZQTJMCGIP-ZJDVBMNYSA-N Thr-Val-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KZTLZZQTJMCGIP-ZJDVBMNYSA-N 0.000 description 2
- 101710085924 Transcription factor Sp1 Proteins 0.000 description 2
- ABRICLFKFRFDKS-IHPCNDPISA-N Trp-Ser-Tyr Chemical compound C([C@H](NC(=O)[C@H](CO)NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)N)C(O)=O)C1=CC=C(O)C=C1 ABRICLFKFRFDKS-IHPCNDPISA-N 0.000 description 2
- SEXRBCGSZRCIPE-LYSGOOTNSA-N Trp-Thr-Gly Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O SEXRBCGSZRCIPE-LYSGOOTNSA-N 0.000 description 2
- BURPTJBFWIOHEY-UWJYBYFXSA-N Tyr-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 BURPTJBFWIOHEY-UWJYBYFXSA-N 0.000 description 2
- JBBYKPZAPOLCPK-JYJNAYRXSA-N Tyr-Arg-Met Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(O)=O JBBYKPZAPOLCPK-JYJNAYRXSA-N 0.000 description 2
- DKKHULUSOSWGHS-UWJYBYFXSA-N Tyr-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N DKKHULUSOSWGHS-UWJYBYFXSA-N 0.000 description 2
- ILTXFANLDMJWPR-SIUGBPQLSA-N Tyr-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N ILTXFANLDMJWPR-SIUGBPQLSA-N 0.000 description 2
- NKUGCYDFQKFVOJ-JYJNAYRXSA-N Tyr-Leu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NKUGCYDFQKFVOJ-JYJNAYRXSA-N 0.000 description 2
- ARMNWLJYHCOSHE-KKUMJFAQSA-N Tyr-Pro-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O ARMNWLJYHCOSHE-KKUMJFAQSA-N 0.000 description 2
- BQASAMYRHNCKQE-IHRRRGAJSA-N Tyr-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N BQASAMYRHNCKQE-IHRRRGAJSA-N 0.000 description 2
- PQPWEALFTLKSEB-DZKIICNBSA-N Tyr-Val-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O PQPWEALFTLKSEB-DZKIICNBSA-N 0.000 description 2
- ISAKRJDGNUQOIC-UHFFFAOYSA-N Uracil Chemical compound O=C1C=CNC(=O)N1 ISAKRJDGNUQOIC-UHFFFAOYSA-N 0.000 description 2
- FZSPNKUFROZBSG-ZKWXMUAHSA-N Val-Ala-Asp Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O FZSPNKUFROZBSG-ZKWXMUAHSA-N 0.000 description 2
- YFOCMOVJBQDBCE-NRPADANISA-N Val-Ala-Glu Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N YFOCMOVJBQDBCE-NRPADANISA-N 0.000 description 2
- JYVKKBDANPZIAW-AVGNSLFASA-N Val-Arg-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](C(C)C)N JYVKKBDANPZIAW-AVGNSLFASA-N 0.000 description 2
- IVXJODPZRWHCCR-JYJNAYRXSA-N Val-Arg-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N IVXJODPZRWHCCR-JYJNAYRXSA-N 0.000 description 2
- UDLYXGYWTVOIKU-QXEWZRGKSA-N Val-Asn-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N UDLYXGYWTVOIKU-QXEWZRGKSA-N 0.000 description 2
- DDNIHOWRDOXXPF-NGZCFLSTSA-N Val-Asp-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N DDNIHOWRDOXXPF-NGZCFLSTSA-N 0.000 description 2
- KVRLNEILGGVBJX-IHRRRGAJSA-N Val-His-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CN=CN1 KVRLNEILGGVBJX-IHRRRGAJSA-N 0.000 description 2
- UMPVMAYCLYMYGA-ONGXEEELSA-N Val-Leu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O UMPVMAYCLYMYGA-ONGXEEELSA-N 0.000 description 2
- SYSWVVCYSXBVJG-RHYQMDGZSA-N Val-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)N)O SYSWVVCYSXBVJG-RHYQMDGZSA-N 0.000 description 2
- OFQGGTGZTOTLGH-NHCYSSNCSA-N Val-Met-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N OFQGGTGZTOTLGH-NHCYSSNCSA-N 0.000 description 2
- HJSLDXZAZGFPDK-ULQDDVLXSA-N Val-Phe-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](C(C)C)N HJSLDXZAZGFPDK-ULQDDVLXSA-N 0.000 description 2
- NHXZRXLFOBFMDM-AVGNSLFASA-N Val-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)C(C)C NHXZRXLFOBFMDM-AVGNSLFASA-N 0.000 description 2
- LTTQCQRTSHJPPL-ZKWXMUAHSA-N Val-Ser-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N LTTQCQRTSHJPPL-ZKWXMUAHSA-N 0.000 description 2
- UGFMVXRXULGLNO-XPUUQOCRSA-N Val-Ser-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O UGFMVXRXULGLNO-XPUUQOCRSA-N 0.000 description 2
- NZYNRRGJJVSSTJ-GUBZILKMSA-N Val-Ser-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O NZYNRRGJJVSSTJ-GUBZILKMSA-N 0.000 description 2
- PQSNETRGCRUOGP-KKHAAJSZSA-N Val-Thr-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(N)=O PQSNETRGCRUOGP-KKHAAJSZSA-N 0.000 description 2
- GVNLOVJNNDZUHS-RHYQMDGZSA-N Val-Thr-Lys Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O GVNLOVJNNDZUHS-RHYQMDGZSA-N 0.000 description 2
- HTONZBWRYUKUKC-RCWTZXSCSA-N Val-Thr-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O HTONZBWRYUKUKC-RCWTZXSCSA-N 0.000 description 2
- GUIYPEKUEMQBIK-JSGCOSHPSA-N Val-Tyr-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)NCC(O)=O GUIYPEKUEMQBIK-JSGCOSHPSA-N 0.000 description 2
- VVIZITNVZUAEMI-DLOVCJGASA-N Val-Val-Gln Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(N)=O VVIZITNVZUAEMI-DLOVCJGASA-N 0.000 description 2
- 101100424717 Xenopus laevis tbxt gene Proteins 0.000 description 2
- 230000002159 abnormal effect Effects 0.000 description 2
- 230000004913 activation Effects 0.000 description 2
- 108010070944 alanylhistidine Proteins 0.000 description 2
- 239000000427 antigen Substances 0.000 description 2
- 102000036639 antigens Human genes 0.000 description 2
- 108091007433 antigens Proteins 0.000 description 2
- 108010069926 arginyl-glycyl-serine Proteins 0.000 description 2
- 108010043240 arginyl-leucyl-glycine Proteins 0.000 description 2
- 108010093581 aspartyl-proline Proteins 0.000 description 2
- 108010058966 bacteriophage T7 induced DNA polymerase Proteins 0.000 description 2
- 230000033228 biological regulation Effects 0.000 description 2
- 230000000903 blocking effect Effects 0.000 description 2
- 210000004899 c-terminal region Anatomy 0.000 description 2
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 2
- 238000003776 cleavage reaction Methods 0.000 description 2
- 230000000875 corresponding effect Effects 0.000 description 2
- 230000001419 dependent effect Effects 0.000 description 2
- 238000006471 dimerization reaction Methods 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 230000002255 enzymatic effect Effects 0.000 description 2
- 230000007717 exclusion Effects 0.000 description 2
- 239000007850 fluorescent dye Substances 0.000 description 2
- 238000005194 fractionation Methods 0.000 description 2
- 108010085059 glutamyl-arginyl-proline Proteins 0.000 description 2
- 108010080575 glutamyl-aspartyl-alanine Proteins 0.000 description 2
- 108010079547 glutamylmethionine Proteins 0.000 description 2
- 230000005484 gravity Effects 0.000 description 2
- 238000010438 heat treatment Methods 0.000 description 2
- 239000000833 heterodimer Substances 0.000 description 2
- 102000018358 immunoglobulin Human genes 0.000 description 2
- 238000000338 in vitro Methods 0.000 description 2
- 230000002757 inflammatory effect Effects 0.000 description 2
- 206010022000 influenza Diseases 0.000 description 2
- 239000003112 inhibitor Substances 0.000 description 2
- PNDPGZBMCMUPRI-UHFFFAOYSA-N iodine Chemical compound II PNDPGZBMCMUPRI-UHFFFAOYSA-N 0.000 description 2
- 108010090333 leucyl-lysyl-proline Proteins 0.000 description 2
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 2
- 108010091871 leucylmethionine Proteins 0.000 description 2
- 208000032839 leukemia Diseases 0.000 description 2
- 230000000670 limiting effect Effects 0.000 description 2
- 210000004072 lung Anatomy 0.000 description 2
- 108010045397 lysyl-tyrosyl-lysine Proteins 0.000 description 2
- 108010017391 lysylvaline Proteins 0.000 description 2
- 108010056582 methionylglutamic acid Proteins 0.000 description 2
- 108010068488 methionylphenylalanine Proteins 0.000 description 2
- 239000004005 microsphere Substances 0.000 description 2
- 231100000219 mutagenic Toxicity 0.000 description 2
- 230000003505 mutagenic effect Effects 0.000 description 2
- 239000013642 negative control Substances 0.000 description 2
- 238000007899 nucleic acid hybridization Methods 0.000 description 2
- 244000045947 parasite Species 0.000 description 2
- 108010024607 phenylalanylalanine Proteins 0.000 description 2
- 230000004983 pleiotropic effect Effects 0.000 description 2
- 230000000379 polymerizing effect Effects 0.000 description 2
- 108010090894 prolylleucine Proteins 0.000 description 2
- 230000010076 replication Effects 0.000 description 2
- 210000005000 reproductive tract Anatomy 0.000 description 2
- 230000007017 scission Effects 0.000 description 2
- 238000000926 separation method Methods 0.000 description 2
- 210000002966 serum Anatomy 0.000 description 2
- 239000007858 starting material Substances 0.000 description 2
- 229940124597 therapeutic agent Drugs 0.000 description 2
- 108010031491 threonyl-lysyl-glutamic acid Proteins 0.000 description 2
- RWQNBRDOKXIBIV-UHFFFAOYSA-N thymine Chemical compound CC1=CNC(=O)NC1=O RWQNBRDOKXIBIV-UHFFFAOYSA-N 0.000 description 2
- 210000001519 tissue Anatomy 0.000 description 2
- 108010003137 tyrosyltyrosine Proteins 0.000 description 2
- IBIDRSSEHFLGSD-UHFFFAOYSA-N valinyl-arginine Natural products CC(C)C(N)C(=O)NC(C(O)=O)CCCN=C(N)N IBIDRSSEHFLGSD-UHFFFAOYSA-N 0.000 description 2
- 108010009962 valyltyrosine Proteins 0.000 description 2
- 239000011534 wash buffer Substances 0.000 description 2
- 238000005406 washing Methods 0.000 description 2
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Chemical compound O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 2
- VLAFRQCSFRYCLC-FXQIFTODSA-N (2s)-2-[[(2s)-2-[[2-[[(2s)-2-aminopropanoyl]amino]acetyl]amino]-3-hydroxypropanoyl]amino]pentanedioic acid Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O VLAFRQCSFRYCLC-FXQIFTODSA-N 0.000 description 1
- BCHIXGBGRHLSBE-UHFFFAOYSA-N (4-methyl-2-oxochromen-7-yl) dihydrogen phosphate Chemical compound C1=C(OP(O)(O)=O)C=CC2=C1OC(=O)C=C2C BCHIXGBGRHLSBE-UHFFFAOYSA-N 0.000 description 1
- NOIIUHRQUVNIDD-UHFFFAOYSA-N 3-[[oxo(pyridin-4-yl)methyl]hydrazo]-N-(phenylmethyl)propanamide Chemical compound C=1C=CC=CC=1CNC(=O)CCNNC(=O)C1=CC=NC=C1 NOIIUHRQUVNIDD-UHFFFAOYSA-N 0.000 description 1
- IMIZPWSVYADSCN-UHFFFAOYSA-N 4-methyl-2-[[4-methyl-2-[[4-methyl-2-(pyrrolidine-2-carbonylamino)pentanoyl]amino]pentanoyl]amino]pentanoic acid Chemical compound CC(C)CC(C(O)=O)NC(=O)C(CC(C)C)NC(=O)C(CC(C)C)NC(=O)C1CCCN1 IMIZPWSVYADSCN-UHFFFAOYSA-N 0.000 description 1
- HRPVXLWXLXDGHG-UHFFFAOYSA-N Acrylamide Chemical compound NC(=O)C=C HRPVXLWXLXDGHG-UHFFFAOYSA-N 0.000 description 1
- 241000186361 Actinobacteria <class> Species 0.000 description 1
- 241000186046 Actinomyces Species 0.000 description 1
- 241000186042 Actinomyces bovis Species 0.000 description 1
- 241000186045 Actinomyces naeslundii Species 0.000 description 1
- DKJPOZOEBONHFS-ZLUOBGJFSA-N Ala-Ala-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O DKJPOZOEBONHFS-ZLUOBGJFSA-N 0.000 description 1
- DVWVZSJAYIJZFI-FXQIFTODSA-N Ala-Arg-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O DVWVZSJAYIJZFI-FXQIFTODSA-N 0.000 description 1
- FSBCNCKIQZZASN-GUBZILKMSA-N Ala-Arg-Met Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(O)=O FSBCNCKIQZZASN-GUBZILKMSA-N 0.000 description 1
- PJNSIUPOXFBHDM-GUBZILKMSA-N Ala-Arg-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O PJNSIUPOXFBHDM-GUBZILKMSA-N 0.000 description 1
- XQJAFSDFQZPYCU-UWJYBYFXSA-N Ala-Asn-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N XQJAFSDFQZPYCU-UWJYBYFXSA-N 0.000 description 1
- GSCLWXDNIMNIJE-ZLUOBGJFSA-N Ala-Asp-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O GSCLWXDNIMNIJE-ZLUOBGJFSA-N 0.000 description 1
- IKKVASZHTMKJIR-ZKWXMUAHSA-N Ala-Asp-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O IKKVASZHTMKJIR-ZKWXMUAHSA-N 0.000 description 1
- FVSOUJZKYWEFOB-KBIXCLLPSA-N Ala-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](C)N FVSOUJZKYWEFOB-KBIXCLLPSA-N 0.000 description 1
- GGNHBHYDMUDXQB-KBIXCLLPSA-N Ala-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](C)N GGNHBHYDMUDXQB-KBIXCLLPSA-N 0.000 description 1
- HXNNRBHASOSVPG-GUBZILKMSA-N Ala-Glu-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O HXNNRBHASOSVPG-GUBZILKMSA-N 0.000 description 1
- ROLXPVQSRCPVGK-XDTLVQLUSA-N Ala-Glu-Tyr Chemical compound N[C@@H](C)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O ROLXPVQSRCPVGK-XDTLVQLUSA-N 0.000 description 1
- NHLAEBFGWPXFGI-WHFBIAKZSA-N Ala-Gly-Asn Chemical compound C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N NHLAEBFGWPXFGI-WHFBIAKZSA-N 0.000 description 1
- MPLOSMWGDNJSEV-WHFBIAKZSA-N Ala-Gly-Asp Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MPLOSMWGDNJSEV-WHFBIAKZSA-N 0.000 description 1
- KMGOBAQSCKTBGD-DLOVCJGASA-N Ala-His-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CN=CN1 KMGOBAQSCKTBGD-DLOVCJGASA-N 0.000 description 1
- DVJSJDDYCYSMFR-ZKWXMUAHSA-N Ala-Ile-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O DVJSJDDYCYSMFR-ZKWXMUAHSA-N 0.000 description 1
- SOBIAADAMRHGKH-CIUDSAMLSA-N Ala-Leu-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O SOBIAADAMRHGKH-CIUDSAMLSA-N 0.000 description 1
- QUIGLPSHIFPEOV-CIUDSAMLSA-N Ala-Lys-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O QUIGLPSHIFPEOV-CIUDSAMLSA-N 0.000 description 1
- VCSABYLVNWQYQE-SRVKXCTJSA-N Ala-Lys-Lys Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O VCSABYLVNWQYQE-SRVKXCTJSA-N 0.000 description 1
- VCSABYLVNWQYQE-UHFFFAOYSA-N Ala-Lys-Lys Natural products NCCCCC(NC(=O)C(N)C)C(=O)NC(CCCCN)C(O)=O VCSABYLVNWQYQE-UHFFFAOYSA-N 0.000 description 1
- NINQYGGNRIBFSC-CIUDSAMLSA-N Ala-Lys-Ser Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CO)C(O)=O NINQYGGNRIBFSC-CIUDSAMLSA-N 0.000 description 1
- MDNAVFBZPROEHO-DCAQKATOSA-N Ala-Lys-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O MDNAVFBZPROEHO-DCAQKATOSA-N 0.000 description 1
- DWYROCSXOOMOEU-CIUDSAMLSA-N Ala-Met-Glu Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N DWYROCSXOOMOEU-CIUDSAMLSA-N 0.000 description 1
- ZBLQIYPCUWZSRZ-QEJZJMRPSA-N Ala-Phe-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=CC=C1 ZBLQIYPCUWZSRZ-QEJZJMRPSA-N 0.000 description 1
- WQLDNOCHHRISMS-NAKRPEOUSA-N Ala-Pro-Ile Chemical compound [H]N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WQLDNOCHHRISMS-NAKRPEOUSA-N 0.000 description 1
- XWFWAXPOLRTDFZ-FXQIFTODSA-N Ala-Pro-Ser Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O XWFWAXPOLRTDFZ-FXQIFTODSA-N 0.000 description 1
- KLALXKYLOMZDQT-ZLUOBGJFSA-N Ala-Ser-Asn Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(N)=O KLALXKYLOMZDQT-ZLUOBGJFSA-N 0.000 description 1
- HOVPGJUNRLMIOZ-CIUDSAMLSA-N Ala-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N HOVPGJUNRLMIOZ-CIUDSAMLSA-N 0.000 description 1
- QOIGKCBMXUCDQU-KDXUFGMBSA-N Ala-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C)N)O QOIGKCBMXUCDQU-KDXUFGMBSA-N 0.000 description 1
- QRIYOHQJRDHFKF-UWJYBYFXSA-N Ala-Tyr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=C(O)C=C1 QRIYOHQJRDHFKF-UWJYBYFXSA-N 0.000 description 1
- 241000588813 Alcaligenes faecalis Species 0.000 description 1
- 108700015979 Arabidopsis TatA Proteins 0.000 description 1
- SGYSTDWPNPKJPP-GUBZILKMSA-N Arg-Ala-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SGYSTDWPNPKJPP-GUBZILKMSA-N 0.000 description 1
- GXCSUJQOECMKPV-CIUDSAMLSA-N Arg-Ala-Gln Chemical compound C[C@H](NC(=O)[C@@H](N)CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O GXCSUJQOECMKPV-CIUDSAMLSA-N 0.000 description 1
- SBVJJNJLFWSJOV-UBHSHLNASA-N Arg-Ala-Phe Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SBVJJNJLFWSJOV-UBHSHLNASA-N 0.000 description 1
- DPXDVGDLWJYZBH-GUBZILKMSA-N Arg-Asn-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O DPXDVGDLWJYZBH-GUBZILKMSA-N 0.000 description 1
- JSHVMZANPXCDTL-GMOBBJLQSA-N Arg-Asp-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JSHVMZANPXCDTL-GMOBBJLQSA-N 0.000 description 1
- SQKPKIJVWHAWNF-DCAQKATOSA-N Arg-Asp-Lys Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(O)=O SQKPKIJVWHAWNF-DCAQKATOSA-N 0.000 description 1
- XTGGTAWGUFXJSV-NAKRPEOUSA-N Arg-Cys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCCN=C(N)N)N XTGGTAWGUFXJSV-NAKRPEOUSA-N 0.000 description 1
- GIVWETPOBCRTND-DCAQKATOSA-N Arg-Gln-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O GIVWETPOBCRTND-DCAQKATOSA-N 0.000 description 1
- PTVGLOCPAVYPFG-CIUDSAMLSA-N Arg-Gln-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O PTVGLOCPAVYPFG-CIUDSAMLSA-N 0.000 description 1
- JUWQNWXEGDYCIE-YUMQZZPRSA-N Arg-Gln-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O JUWQNWXEGDYCIE-YUMQZZPRSA-N 0.000 description 1
- QAODJPUKWNNNRP-DCAQKATOSA-N Arg-Glu-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QAODJPUKWNNNRP-DCAQKATOSA-N 0.000 description 1
- CYXCAHZVPFREJD-LURJTMIESA-N Arg-Gly-Gly Chemical compound NC(=N)NCCC[C@H](N)C(=O)NCC(=O)NCC(O)=O CYXCAHZVPFREJD-LURJTMIESA-N 0.000 description 1
- SYAUZLVLXCDRSH-IUCAKERBSA-N Arg-Gly-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCCN=C(N)N)N SYAUZLVLXCDRSH-IUCAKERBSA-N 0.000 description 1
- NVUIWHJLPSZZQC-CYDGBPFRSA-N Arg-Ile-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NVUIWHJLPSZZQC-CYDGBPFRSA-N 0.000 description 1
- HCIUUZGFTDTEGM-NAKRPEOUSA-N Arg-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N HCIUUZGFTDTEGM-NAKRPEOUSA-N 0.000 description 1
- BTJVOUQWFXABOI-IHRRRGAJSA-N Arg-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCNC(N)=N BTJVOUQWFXABOI-IHRRRGAJSA-N 0.000 description 1
- RIQBRKVTFBWEDY-RHYQMDGZSA-N Arg-Lys-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RIQBRKVTFBWEDY-RHYQMDGZSA-N 0.000 description 1
- QBQVKUNBCAFXSV-ULQDDVLXSA-N Arg-Lys-Tyr Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QBQVKUNBCAFXSV-ULQDDVLXSA-N 0.000 description 1
- JBIRFLWXWDSDTR-CYDGBPFRSA-N Arg-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCCN=C(N)N)N JBIRFLWXWDSDTR-CYDGBPFRSA-N 0.000 description 1
- VEAIMHJZTIDCIH-KKUMJFAQSA-N Arg-Phe-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O VEAIMHJZTIDCIH-KKUMJFAQSA-N 0.000 description 1
- XFXZKCRBBOVJKS-BVSLBCMMSA-N Arg-Phe-Trp Chemical compound C([C@H](NC(=O)[C@H](CCCN=C(N)N)N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=CC=C1 XFXZKCRBBOVJKS-BVSLBCMMSA-N 0.000 description 1
- HNJNAMGZQZPSRE-GUBZILKMSA-N Arg-Pro-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O HNJNAMGZQZPSRE-GUBZILKMSA-N 0.000 description 1
- OVQJAKFLFTZDNC-GUBZILKMSA-N Arg-Pro-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O OVQJAKFLFTZDNC-GUBZILKMSA-N 0.000 description 1
- VENMDXUVHSKEIN-GUBZILKMSA-N Arg-Ser-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O VENMDXUVHSKEIN-GUBZILKMSA-N 0.000 description 1
- ADPACBMPYWJJCE-FXQIFTODSA-N Arg-Ser-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O ADPACBMPYWJJCE-FXQIFTODSA-N 0.000 description 1
- LFAUVOXPCGJKTB-DCAQKATOSA-N Arg-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N LFAUVOXPCGJKTB-DCAQKATOSA-N 0.000 description 1
- URAUIUGLHBRPMF-NAKRPEOUSA-N Arg-Ser-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O URAUIUGLHBRPMF-NAKRPEOUSA-N 0.000 description 1
- OQPAZKMGCWPERI-GUBZILKMSA-N Arg-Ser-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O OQPAZKMGCWPERI-GUBZILKMSA-N 0.000 description 1
- ZJBUILVYSXQNSW-YTWAJWBKSA-N Arg-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O ZJBUILVYSXQNSW-YTWAJWBKSA-N 0.000 description 1
- ZPWMEWYQBWSGAO-ZJDVBMNYSA-N Arg-Thr-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZPWMEWYQBWSGAO-ZJDVBMNYSA-N 0.000 description 1
- UVTGNSWSRSCPLP-UHFFFAOYSA-N Arg-Tyr Natural products NC(CCNC(=N)N)C(=O)NC(Cc1ccc(O)cc1)C(=O)O UVTGNSWSRSCPLP-UHFFFAOYSA-N 0.000 description 1
- XRLOBFSLPCHYLQ-ULQDDVLXSA-N Arg-Tyr-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O XRLOBFSLPCHYLQ-ULQDDVLXSA-N 0.000 description 1
- WOZDCBHUGJVJPL-AVGNSLFASA-N Arg-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N WOZDCBHUGJVJPL-AVGNSLFASA-N 0.000 description 1
- 240000003291 Armoracia rusticana Species 0.000 description 1
- 235000011330 Armoracia rusticana Nutrition 0.000 description 1
- LEFKSBYHUGUWLP-ACZMJKKPSA-N Asn-Ala-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LEFKSBYHUGUWLP-ACZMJKKPSA-N 0.000 description 1
- MFFOYNGMOYFPBD-DCAQKATOSA-N Asn-Arg-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O MFFOYNGMOYFPBD-DCAQKATOSA-N 0.000 description 1
- AYKKKGFJXIDYLX-ACZMJKKPSA-N Asn-Gln-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O AYKKKGFJXIDYLX-ACZMJKKPSA-N 0.000 description 1
- CTQIOCMSIJATNX-WHFBIAKZSA-N Asn-Gly-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O CTQIOCMSIJATNX-WHFBIAKZSA-N 0.000 description 1
- PBSQFBAJKPLRJY-BYULHYEWSA-N Asn-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N PBSQFBAJKPLRJY-BYULHYEWSA-N 0.000 description 1
- HYQYLOSCICEYTR-YUMQZZPRSA-N Asn-Gly-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O HYQYLOSCICEYTR-YUMQZZPRSA-N 0.000 description 1
- GJFYPBDMUGGLFR-NKWVEPMBSA-N Asn-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CC(=O)N)N)C(=O)O GJFYPBDMUGGLFR-NKWVEPMBSA-N 0.000 description 1
- ZKDGORKGHPCZOV-DCAQKATOSA-N Asn-His-Arg Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N ZKDGORKGHPCZOV-DCAQKATOSA-N 0.000 description 1
- KHCNTVRVAYCPQE-CIUDSAMLSA-N Asn-Lys-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O KHCNTVRVAYCPQE-CIUDSAMLSA-N 0.000 description 1
- WXVGISRWSYGEDK-KKUMJFAQSA-N Asn-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)N)N WXVGISRWSYGEDK-KKUMJFAQSA-N 0.000 description 1
- PPCORQFLAZWUNO-QWRGUYRKSA-N Asn-Phe-Gly Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC(=O)N)N PPCORQFLAZWUNO-QWRGUYRKSA-N 0.000 description 1
- HZZIFFOVHLWGCS-KKUMJFAQSA-N Asn-Phe-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O HZZIFFOVHLWGCS-KKUMJFAQSA-N 0.000 description 1
- NJSNXIOKBHPFMB-GMOBBJLQSA-N Asn-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC(=O)N)N NJSNXIOKBHPFMB-GMOBBJLQSA-N 0.000 description 1
- FHCRKXCTKSHNOE-QEJZJMRPSA-N Asn-Trp-Glu Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N FHCRKXCTKSHNOE-QEJZJMRPSA-N 0.000 description 1
- DXHINQUXBZNUCF-MELADBBJSA-N Asn-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC(=O)N)N)C(=O)O DXHINQUXBZNUCF-MELADBBJSA-N 0.000 description 1
- SYZWMVSXBZCOBZ-QXEWZRGKSA-N Asn-Val-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(=O)N)N SYZWMVSXBZCOBZ-QXEWZRGKSA-N 0.000 description 1
- GHWWTICYPDKPTE-NGZCFLSTSA-N Asn-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N GHWWTICYPDKPTE-NGZCFLSTSA-N 0.000 description 1
- SDHFVYLZFBDSQT-DCAQKATOSA-N Asp-Arg-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)O)N SDHFVYLZFBDSQT-DCAQKATOSA-N 0.000 description 1
- FAEIQWHBRBWUBN-FXQIFTODSA-N Asp-Arg-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N)CN=C(N)N FAEIQWHBRBWUBN-FXQIFTODSA-N 0.000 description 1
- GWTLRDMPMJCNMH-WHFBIAKZSA-N Asp-Asn-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GWTLRDMPMJCNMH-WHFBIAKZSA-N 0.000 description 1
- JDHOJQJMWBKHDB-CIUDSAMLSA-N Asp-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N JDHOJQJMWBKHDB-CIUDSAMLSA-N 0.000 description 1
- ACEDJCOOPZFUBU-CIUDSAMLSA-N Asp-Cys-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)O)N ACEDJCOOPZFUBU-CIUDSAMLSA-N 0.000 description 1
- PJERDVUTUDZPGX-ZKWXMUAHSA-N Asp-Cys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CC(O)=O PJERDVUTUDZPGX-ZKWXMUAHSA-N 0.000 description 1
- VILLWIDTHYPSLC-PEFMBERDSA-N Asp-Glu-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VILLWIDTHYPSLC-PEFMBERDSA-N 0.000 description 1
- RRKCPMGSRIDLNC-AVGNSLFASA-N Asp-Glu-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O RRKCPMGSRIDLNC-AVGNSLFASA-N 0.000 description 1
- YNCHFVRXEQFPBY-BQBZGAKWSA-N Asp-Gly-Arg Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N YNCHFVRXEQFPBY-BQBZGAKWSA-N 0.000 description 1
- KHGPWGKPYHPOIK-QWRGUYRKSA-N Asp-Gly-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O KHGPWGKPYHPOIK-QWRGUYRKSA-N 0.000 description 1
- PGUYEUCYVNZGGV-QWRGUYRKSA-N Asp-Gly-Tyr Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PGUYEUCYVNZGGV-QWRGUYRKSA-N 0.000 description 1
- KPNUCOPMVSGRCR-DCAQKATOSA-N Asp-His-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O KPNUCOPMVSGRCR-DCAQKATOSA-N 0.000 description 1
- CYCKJEFVFNRWEZ-UGYAYLCHSA-N Asp-Ile-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O CYCKJEFVFNRWEZ-UGYAYLCHSA-N 0.000 description 1
- KQBVNNAPIURMPD-PEFMBERDSA-N Asp-Ile-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O KQBVNNAPIURMPD-PEFMBERDSA-N 0.000 description 1
- KLYPOCBLKMPBIQ-GHCJXIJMSA-N Asp-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N KLYPOCBLKMPBIQ-GHCJXIJMSA-N 0.000 description 1
- SPWXXPFDTMYTRI-IUKAMOBKSA-N Asp-Ile-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SPWXXPFDTMYTRI-IUKAMOBKSA-N 0.000 description 1
- CLUMZOKVGUWUFD-CIUDSAMLSA-N Asp-Leu-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O CLUMZOKVGUWUFD-CIUDSAMLSA-N 0.000 description 1
- AITKTFCQOBRJTG-CIUDSAMLSA-N Asp-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N AITKTFCQOBRJTG-CIUDSAMLSA-N 0.000 description 1
- AYFVRYXNDHBECD-YUMQZZPRSA-N Asp-Leu-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O AYFVRYXNDHBECD-YUMQZZPRSA-N 0.000 description 1
- CJUKAWUWBZCTDQ-SRVKXCTJSA-N Asp-Leu-Lys Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O CJUKAWUWBZCTDQ-SRVKXCTJSA-N 0.000 description 1
- KFAFUJMGHVVYRC-DCAQKATOSA-N Asp-Leu-Met Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O KFAFUJMGHVVYRC-DCAQKATOSA-N 0.000 description 1
- VSMYBNPOHYAXSD-GUBZILKMSA-N Asp-Lys-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O VSMYBNPOHYAXSD-GUBZILKMSA-N 0.000 description 1
- KPSHWSWFPUDEGF-FXQIFTODSA-N Asp-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(O)=O KPSHWSWFPUDEGF-FXQIFTODSA-N 0.000 description 1
- OZBXOELNJBSJOA-UBHSHLNASA-N Asp-Ser-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)O)N OZBXOELNJBSJOA-UBHSHLNASA-N 0.000 description 1
- PDIYGFYAMZZFCW-JIOCBJNQSA-N Asp-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N)O PDIYGFYAMZZFCW-JIOCBJNQSA-N 0.000 description 1
- RSMZEHCMIOKNMW-GSSVUCPTSA-N Asp-Thr-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RSMZEHCMIOKNMW-GSSVUCPTSA-N 0.000 description 1
- AWPWHMVCSISSQK-QWRGUYRKSA-N Asp-Tyr-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O AWPWHMVCSISSQK-QWRGUYRKSA-N 0.000 description 1
- PLOKOIJSGCISHE-BYULHYEWSA-N Asp-Val-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O PLOKOIJSGCISHE-BYULHYEWSA-N 0.000 description 1
- 241001225321 Aspergillus fumigatus Species 0.000 description 1
- 241000351920 Aspergillus nidulans Species 0.000 description 1
- 241000711404 Avian avulavirus 1 Species 0.000 description 1
- 241000193755 Bacillus cereus Species 0.000 description 1
- 241000193388 Bacillus thuringiensis Species 0.000 description 1
- 241000606660 Bartonella Species 0.000 description 1
- 241000606108 Bartonella quintana Species 0.000 description 1
- 102100026189 Beta-galactosidase Human genes 0.000 description 1
- 241000335423 Blastomyces Species 0.000 description 1
- 108010006654 Bleomycin Proteins 0.000 description 1
- 241000588832 Bordetella pertussis Species 0.000 description 1
- 241000589968 Borrelia Species 0.000 description 1
- 241000283690 Bos taurus Species 0.000 description 1
- 108700023463 Bovine papillomavirus E2 Proteins 0.000 description 1
- 206010006187 Breast cancer Diseases 0.000 description 1
- 208000026310 Breast neoplasm Diseases 0.000 description 1
- 241000589567 Brucella abortus Species 0.000 description 1
- 241001148106 Brucella melitensis Species 0.000 description 1
- 241001148111 Brucella suis Species 0.000 description 1
- OTMSDBZUPAUEDD-UHFFFAOYSA-N CC Chemical compound CC OTMSDBZUPAUEDD-UHFFFAOYSA-N 0.000 description 1
- XNRYHMOJNJKGJO-UHFFFAOYSA-N CC(C)C(C)C(C)C(C)C(C)C(C)C(C)C(C)C(C)C(C)C(C)C(C)C(C)C(C)C(C)C Chemical compound CC(C)C(C)C(C)C(C)C(C)C(C)C(C)C(C)C(C)C(C)C(C)C(C)C(C)C(C)C(C)C XNRYHMOJNJKGJO-UHFFFAOYSA-N 0.000 description 1
- 102000015280 CCAAT-Enhancer-Binding Protein-beta Human genes 0.000 description 1
- 108010064535 CCAAT-Enhancer-Binding Protein-beta Proteins 0.000 description 1
- 101100505161 Caenorhabditis elegans mel-32 gene Proteins 0.000 description 1
- 241000222122 Candida albicans Species 0.000 description 1
- 241000712083 Canine morbillivirus Species 0.000 description 1
- 206010008342 Cervix carcinoma Diseases 0.000 description 1
- 201000006082 Chickenpox Diseases 0.000 description 1
- 241000222290 Cladosporium Species 0.000 description 1
- 241000193155 Clostridium botulinum Species 0.000 description 1
- 241000186581 Clostridium novyi Species 0.000 description 1
- 241000193468 Clostridium perfringens Species 0.000 description 1
- 241000193466 Clostridium septicum Species 0.000 description 1
- 241000193470 Clostridium sporogenes Species 0.000 description 1
- 241000186528 Clostridium tertium Species 0.000 description 1
- 241000193449 Clostridium tetani Species 0.000 description 1
- 241000223205 Coccidioides immitis Species 0.000 description 1
- 108700020473 Cyclic AMP Receptor Proteins 0.000 description 1
- PRXCTTWKGJAPMT-ZLUOBGJFSA-N Cys-Ala-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O PRXCTTWKGJAPMT-ZLUOBGJFSA-N 0.000 description 1
- MBPKYKSYUAPLMY-DCAQKATOSA-N Cys-Arg-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O MBPKYKSYUAPLMY-DCAQKATOSA-N 0.000 description 1
- WXKWQSDHEXKKNC-ZKWXMUAHSA-N Cys-Asp-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)N WXKWQSDHEXKKNC-ZKWXMUAHSA-N 0.000 description 1
- UXUSHQYYQCZWET-WDSKDSINSA-N Cys-Glu-Gly Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O UXUSHQYYQCZWET-WDSKDSINSA-N 0.000 description 1
- MUZAUPFGPMMZSS-GUBZILKMSA-N Cys-Glu-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CS)N MUZAUPFGPMMZSS-GUBZILKMSA-N 0.000 description 1
- DZLQXIFVQFTFJY-BYPYZUCNSA-N Cys-Gly-Gly Chemical compound SC[C@H](N)C(=O)NCC(=O)NCC(O)=O DZLQXIFVQFTFJY-BYPYZUCNSA-N 0.000 description 1
- ZQHQTSONVIANQR-BQBZGAKWSA-N Cys-Gly-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CS)N ZQHQTSONVIANQR-BQBZGAKWSA-N 0.000 description 1
- IDFVDSBJNMPBSX-SRVKXCTJSA-N Cys-Lys-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O IDFVDSBJNMPBSX-SRVKXCTJSA-N 0.000 description 1
- KGIHMGPYGXBYJJ-SRVKXCTJSA-N Cys-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CS KGIHMGPYGXBYJJ-SRVKXCTJSA-N 0.000 description 1
- RAGIABZNLPZBGS-FXQIFTODSA-N Cys-Pro-Cys Chemical compound N[C@@H](CS)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CS)C(O)=O RAGIABZNLPZBGS-FXQIFTODSA-N 0.000 description 1
- MBRWOKXNHTUJMB-CIUDSAMLSA-N Cys-Pro-Glu Chemical compound [H]N[C@@H](CS)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O MBRWOKXNHTUJMB-CIUDSAMLSA-N 0.000 description 1
- CLEFUAZULXANBU-MELADBBJSA-N Cys-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CS)N)C(=O)O CLEFUAZULXANBU-MELADBBJSA-N 0.000 description 1
- VRJZMZGGAKVSIQ-SRVKXCTJSA-N Cys-Tyr-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O VRJZMZGGAKVSIQ-SRVKXCTJSA-N 0.000 description 1
- FNXOZWPPOJRBRE-XGEHTFHBSA-N Cys-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CS)N)O FNXOZWPPOJRBRE-XGEHTFHBSA-N 0.000 description 1
- 108090000695 Cytokines Proteins 0.000 description 1
- 102000004127 Cytokines Human genes 0.000 description 1
- 102000012410 DNA Ligases Human genes 0.000 description 1
- 108010061982 DNA Ligases Proteins 0.000 description 1
- 108020003215 DNA Probes Proteins 0.000 description 1
- 241000450599 DNA viruses Species 0.000 description 1
- 238000009007 Diagnostic Kit Methods 0.000 description 1
- 108700034853 E coli TRPR Proteins 0.000 description 1
- 238000002965 ELISA Methods 0.000 description 1
- 241000710945 Eastern equine encephalitis virus Species 0.000 description 1
- 102100031780 Endonuclease Human genes 0.000 description 1
- 102000004533 Endonucleases Human genes 0.000 description 1
- 241000709661 Enterovirus Species 0.000 description 1
- 241000186810 Erysipelothrix rhusiopathiae Species 0.000 description 1
- 108060002716 Exonuclease Proteins 0.000 description 1
- 241000223664 Exophiala jeanselmei Species 0.000 description 1
- CWYNVVGOOAEACU-UHFFFAOYSA-N Fe2+ Chemical compound [Fe+2] CWYNVVGOOAEACU-UHFFFAOYSA-N 0.000 description 1
- 241000122862 Fonsecaea Species 0.000 description 1
- 101710177291 Gag polyprotein Proteins 0.000 description 1
- 102100024375 Gamma-glutamylaminecyclotransferase Human genes 0.000 description 1
- 101710201613 Gamma-glutamylaminecyclotransferase Proteins 0.000 description 1
- 102100040004 Gamma-glutamylcyclotransferase Human genes 0.000 description 1
- 108700039691 Genetic Promoter Regions Proteins 0.000 description 1
- INKFLNZBTSNFON-CIUDSAMLSA-N Gln-Ala-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O INKFLNZBTSNFON-CIUDSAMLSA-N 0.000 description 1
- NNQHEEQNPQYPGL-FXQIFTODSA-N Gln-Ala-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O NNQHEEQNPQYPGL-FXQIFTODSA-N 0.000 description 1
- NUMFTVCBONFQIQ-DRZSPHRISA-N Gln-Ala-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O NUMFTVCBONFQIQ-DRZSPHRISA-N 0.000 description 1
- YNNXQZDEOCYJJL-CIUDSAMLSA-N Gln-Arg-Asp Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)CN=C(N)N YNNXQZDEOCYJJL-CIUDSAMLSA-N 0.000 description 1
- WOACHWLUOFZLGJ-GUBZILKMSA-N Gln-Arg-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O WOACHWLUOFZLGJ-GUBZILKMSA-N 0.000 description 1
- JFOKLAPFYCTNHW-SRVKXCTJSA-N Gln-Arg-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)N)N JFOKLAPFYCTNHW-SRVKXCTJSA-N 0.000 description 1
- DTMLKCYOQKZXKZ-HJGDQZAQSA-N Gln-Arg-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DTMLKCYOQKZXKZ-HJGDQZAQSA-N 0.000 description 1
- WMOMPXKOKASNBK-PEFMBERDSA-N Gln-Asn-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WMOMPXKOKASNBK-PEFMBERDSA-N 0.000 description 1
- ODBLJLZVLAWVMS-GUBZILKMSA-N Gln-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)N)N ODBLJLZVLAWVMS-GUBZILKMSA-N 0.000 description 1
- KWLMLNHADZIJIS-CIUDSAMLSA-N Gln-Asn-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)N)N KWLMLNHADZIJIS-CIUDSAMLSA-N 0.000 description 1
- RMOCFPBLHAOTDU-ACZMJKKPSA-N Gln-Asn-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O RMOCFPBLHAOTDU-ACZMJKKPSA-N 0.000 description 1
- RBWKVOSARCFSQQ-FXQIFTODSA-N Gln-Gln-Ser Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O RBWKVOSARCFSQQ-FXQIFTODSA-N 0.000 description 1
- UFNSPPFJOHNXRE-AUTRQRHGSA-N Gln-Gln-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O UFNSPPFJOHNXRE-AUTRQRHGSA-N 0.000 description 1
- NSNUZSPSADIMJQ-WDSKDSINSA-N Gln-Gly-Asp Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O NSNUZSPSADIMJQ-WDSKDSINSA-N 0.000 description 1
- MFJAPSYJQJCQDN-BQBZGAKWSA-N Gln-Gly-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O MFJAPSYJQJCQDN-BQBZGAKWSA-N 0.000 description 1
- GXMBDEGTXHQBAO-NKIYYHGXSA-N Gln-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)N)N)O GXMBDEGTXHQBAO-NKIYYHGXSA-N 0.000 description 1
- FTIJVMLAGRAYMJ-MNXVOIDGSA-N Gln-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(N)=O FTIJVMLAGRAYMJ-MNXVOIDGSA-N 0.000 description 1
- ITZWDGBYBPUZRG-KBIXCLLPSA-N Gln-Ile-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O ITZWDGBYBPUZRG-KBIXCLLPSA-N 0.000 description 1
- CAXXTYYGFYTBPV-IUCAKERBSA-N Gln-Leu-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O CAXXTYYGFYTBPV-IUCAKERBSA-N 0.000 description 1
- YPMDZWPZFOZYFG-GUBZILKMSA-N Gln-Leu-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YPMDZWPZFOZYFG-GUBZILKMSA-N 0.000 description 1
- MLSKFHLRFVGNLL-WDCWCFNPSA-N Gln-Leu-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MLSKFHLRFVGNLL-WDCWCFNPSA-N 0.000 description 1
- FKXCBKCOSVIGCT-AVGNSLFASA-N Gln-Lys-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O FKXCBKCOSVIGCT-AVGNSLFASA-N 0.000 description 1
- XZUUUKNKNWVPHQ-JYJNAYRXSA-N Gln-Phe-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O XZUUUKNKNWVPHQ-JYJNAYRXSA-N 0.000 description 1
- HMIXCETWRYDVMO-GUBZILKMSA-N Gln-Pro-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O HMIXCETWRYDVMO-GUBZILKMSA-N 0.000 description 1
- NYCVMJGIJYQWDO-CIUDSAMLSA-N Gln-Ser-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NYCVMJGIJYQWDO-CIUDSAMLSA-N 0.000 description 1
- LGWNISYVKDNJRP-FXQIFTODSA-N Gln-Ser-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O LGWNISYVKDNJRP-FXQIFTODSA-N 0.000 description 1
- ZMXZGYLINVNTKH-DZKIICNBSA-N Gln-Val-Phe Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ZMXZGYLINVNTKH-DZKIICNBSA-N 0.000 description 1
- RUFHOVYUYSNDNY-ACZMJKKPSA-N Glu-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O RUFHOVYUYSNDNY-ACZMJKKPSA-N 0.000 description 1
- ITYRYNUZHPNCIK-GUBZILKMSA-N Glu-Ala-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O ITYRYNUZHPNCIK-GUBZILKMSA-N 0.000 description 1
- ATRHMOJQJWPVBQ-DRZSPHRISA-N Glu-Ala-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ATRHMOJQJWPVBQ-DRZSPHRISA-N 0.000 description 1
- RCCDHXSRMWCOOY-GUBZILKMSA-N Glu-Arg-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O RCCDHXSRMWCOOY-GUBZILKMSA-N 0.000 description 1
- VTTSANCGJWLPNC-ZPFDUUQYSA-N Glu-Arg-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VTTSANCGJWLPNC-ZPFDUUQYSA-N 0.000 description 1
- KKCUFHUTMKQQCF-SRVKXCTJSA-N Glu-Arg-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O KKCUFHUTMKQQCF-SRVKXCTJSA-N 0.000 description 1
- VPKBCVUDBNINAH-GARJFASQSA-N Glu-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)O)N)C(=O)O VPKBCVUDBNINAH-GARJFASQSA-N 0.000 description 1
- NKSGKPWXSWBRRX-ACZMJKKPSA-N Glu-Asn-Cys Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N NKSGKPWXSWBRRX-ACZMJKKPSA-N 0.000 description 1
- AFODTOLGSZQDSL-PEFMBERDSA-N Glu-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N AFODTOLGSZQDSL-PEFMBERDSA-N 0.000 description 1
- PKYAVRMYTBBRLS-FXQIFTODSA-N Glu-Cys-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(O)=O PKYAVRMYTBBRLS-FXQIFTODSA-N 0.000 description 1
- XHUCVVHRLNPZSZ-CIUDSAMLSA-N Glu-Gln-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XHUCVVHRLNPZSZ-CIUDSAMLSA-N 0.000 description 1
- CJWANNXUTOATSJ-DCAQKATOSA-N Glu-Gln-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N CJWANNXUTOATSJ-DCAQKATOSA-N 0.000 description 1
- LVCHEMOPBORRLB-DCAQKATOSA-N Glu-Gln-Lys Chemical compound NCCCC[C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCC(O)=O)C(O)=O LVCHEMOPBORRLB-DCAQKATOSA-N 0.000 description 1
- GYCPQVFKCPPRQB-GUBZILKMSA-N Glu-Gln-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N GYCPQVFKCPPRQB-GUBZILKMSA-N 0.000 description 1
- HNVFSTLPVJWIDV-CIUDSAMLSA-N Glu-Glu-Gln Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HNVFSTLPVJWIDV-CIUDSAMLSA-N 0.000 description 1
- LGYZYFFDELZWRS-DCAQKATOSA-N Glu-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O LGYZYFFDELZWRS-DCAQKATOSA-N 0.000 description 1
- CUXJIASLBRJOFV-LAEOZQHASA-N Glu-Gly-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O CUXJIASLBRJOFV-LAEOZQHASA-N 0.000 description 1
- GGJOGFJIPPGNRK-JSGCOSHPSA-N Glu-Gly-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)CNC(=O)[C@H](CCC(O)=O)N)C(O)=O)=CNC2=C1 GGJOGFJIPPGNRK-JSGCOSHPSA-N 0.000 description 1
- XIKYNVKEUINBGL-IUCAKERBSA-N Glu-His-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)NCC(O)=O XIKYNVKEUINBGL-IUCAKERBSA-N 0.000 description 1
- WVTIBGWZUMJBFY-GUBZILKMSA-N Glu-His-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O WVTIBGWZUMJBFY-GUBZILKMSA-N 0.000 description 1
- ZWABFSSWTSAMQN-KBIXCLLPSA-N Glu-Ile-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O ZWABFSSWTSAMQN-KBIXCLLPSA-N 0.000 description 1
- QIQABBIDHGQXGA-ZPFDUUQYSA-N Glu-Ile-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QIQABBIDHGQXGA-ZPFDUUQYSA-N 0.000 description 1
- QXDXIXFSFHUYAX-MNXVOIDGSA-N Glu-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O QXDXIXFSFHUYAX-MNXVOIDGSA-N 0.000 description 1
- XTZDZAXYPDISRR-MNXVOIDGSA-N Glu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N XTZDZAXYPDISRR-MNXVOIDGSA-N 0.000 description 1
- WTMZXOPHTIVFCP-QEWYBTABSA-N Glu-Ile-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 WTMZXOPHTIVFCP-QEWYBTABSA-N 0.000 description 1
- VSRCAOIHMGCIJK-SRVKXCTJSA-N Glu-Leu-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O VSRCAOIHMGCIJK-SRVKXCTJSA-N 0.000 description 1
- LZMQSTPFYJLVJB-GUBZILKMSA-N Glu-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N LZMQSTPFYJLVJB-GUBZILKMSA-N 0.000 description 1
- DNPCBMNFQVTHMA-DCAQKATOSA-N Glu-Leu-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O DNPCBMNFQVTHMA-DCAQKATOSA-N 0.000 description 1
- JDAYMLXPUJRSDJ-XIRDDKMYSA-N Glu-Trp-Arg Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(O)=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 JDAYMLXPUJRSDJ-XIRDDKMYSA-N 0.000 description 1
- ZNOHKCPYDAYYDA-BPUTZDHNSA-N Glu-Trp-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZNOHKCPYDAYYDA-BPUTZDHNSA-N 0.000 description 1
- KXRORHJIRAOQPG-SOUVJXGZSA-N Glu-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CCC(=O)O)N)C(=O)O KXRORHJIRAOQPG-SOUVJXGZSA-N 0.000 description 1
- MLILEEIVMRUYBX-NHCYSSNCSA-N Glu-Val-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O MLILEEIVMRUYBX-NHCYSSNCSA-N 0.000 description 1
- QXUPRMQJDWJDFR-NRPADANISA-N Glu-Val-Ser Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O QXUPRMQJDWJDFR-NRPADANISA-N 0.000 description 1
- CLODWIOAKCSBAN-BQBZGAKWSA-N Gly-Arg-Asp Chemical compound NC(N)=NCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CC(O)=O)C(O)=O CLODWIOAKCSBAN-BQBZGAKWSA-N 0.000 description 1
- OGCIHJPYKVSMTE-YUMQZZPRSA-N Gly-Arg-Glu Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O OGCIHJPYKVSMTE-YUMQZZPRSA-N 0.000 description 1
- MXXXVOYFNVJHMA-IUCAKERBSA-N Gly-Arg-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)CN MXXXVOYFNVJHMA-IUCAKERBSA-N 0.000 description 1
- GWCRIHNSVMOBEQ-BQBZGAKWSA-N Gly-Arg-Ser Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O GWCRIHNSVMOBEQ-BQBZGAKWSA-N 0.000 description 1
- WKJKBELXHCTHIJ-WPRPVWTQSA-N Gly-Arg-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N WKJKBELXHCTHIJ-WPRPVWTQSA-N 0.000 description 1
- UXJHNZODTMHWRD-WHFBIAKZSA-N Gly-Asn-Ala Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O UXJHNZODTMHWRD-WHFBIAKZSA-N 0.000 description 1
- WJZLEENECIOOSA-WDSKDSINSA-N Gly-Asn-Gln Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)O WJZLEENECIOOSA-WDSKDSINSA-N 0.000 description 1
- NZAFOTBEULLEQB-WDSKDSINSA-N Gly-Asn-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)CN NZAFOTBEULLEQB-WDSKDSINSA-N 0.000 description 1
- GNBMOZPQUXTCRW-STQMWFEESA-N Gly-Asn-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)CN)C(O)=O)=CNC2=C1 GNBMOZPQUXTCRW-STQMWFEESA-N 0.000 description 1
- XEJTYSCIXKYSHR-WDSKDSINSA-N Gly-Asp-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN XEJTYSCIXKYSHR-WDSKDSINSA-N 0.000 description 1
- RPLLQZBOVIVGMX-QWRGUYRKSA-N Gly-Asp-Phe Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O RPLLQZBOVIVGMX-QWRGUYRKSA-N 0.000 description 1
- LXXLEUBUOMCAMR-NKWVEPMBSA-N Gly-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)CN)C(=O)O LXXLEUBUOMCAMR-NKWVEPMBSA-N 0.000 description 1
- LGQZOQRDEUIZJY-YUMQZZPRSA-N Gly-Cys-Lys Chemical compound NCCCC[C@H](NC(=O)[C@H](CS)NC(=O)CN)C(O)=O LGQZOQRDEUIZJY-YUMQZZPRSA-N 0.000 description 1
- XLFHCWHXKSFVIB-BQBZGAKWSA-N Gly-Gln-Gln Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O XLFHCWHXKSFVIB-BQBZGAKWSA-N 0.000 description 1
- BYYNJRSNDARRBX-YFKPBYRVSA-N Gly-Gln-Gly Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O BYYNJRSNDARRBX-YFKPBYRVSA-N 0.000 description 1
- ZQIMMEYPEXIYBB-IUCAKERBSA-N Gly-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)CN ZQIMMEYPEXIYBB-IUCAKERBSA-N 0.000 description 1
- QSVCIFZPGLOZGH-WDSKDSINSA-N Gly-Glu-Ser Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O QSVCIFZPGLOZGH-WDSKDSINSA-N 0.000 description 1
- UFPXDFOYHVEIPI-BYPYZUCNSA-N Gly-Gly-Asp Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O UFPXDFOYHVEIPI-BYPYZUCNSA-N 0.000 description 1
- XMPXVJIDADUOQB-RCOVLWMOSA-N Gly-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C([O-])=O)NC(=O)CNC(=O)C[NH3+] XMPXVJIDADUOQB-RCOVLWMOSA-N 0.000 description 1
- UESJMAMHDLEHGM-NHCYSSNCSA-N Gly-Ile-Leu Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O UESJMAMHDLEHGM-NHCYSSNCSA-N 0.000 description 1
- MIIVFRCYJABHTQ-ONGXEEELSA-N Gly-Leu-Val Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O MIIVFRCYJABHTQ-ONGXEEELSA-N 0.000 description 1
- CLNSYANKYVMZNM-UWVGGRQHSA-N Gly-Lys-Arg Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)N[C@H](C(O)=O)CCCN=C(N)N CLNSYANKYVMZNM-UWVGGRQHSA-N 0.000 description 1
- VEPBEGNDJYANCF-QWRGUYRKSA-N Gly-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCCN VEPBEGNDJYANCF-QWRGUYRKSA-N 0.000 description 1
- BBTCXWTXOXUNFX-IUCAKERBSA-N Gly-Met-Arg Chemical compound CSCC[C@H](NC(=O)CN)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O BBTCXWTXOXUNFX-IUCAKERBSA-N 0.000 description 1
- LXTRSHQLGYINON-DTWKUNHWSA-N Gly-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN LXTRSHQLGYINON-DTWKUNHWSA-N 0.000 description 1
- IEGFSKKANYKBDU-QWHCGFSZSA-N Gly-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)CN)C(=O)O IEGFSKKANYKBDU-QWHCGFSZSA-N 0.000 description 1
- JJGBXTYGTKWGAT-YUMQZZPRSA-N Gly-Pro-Glu Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O JJGBXTYGTKWGAT-YUMQZZPRSA-N 0.000 description 1
- GAAHQHNCMIAYEX-UWVGGRQHSA-N Gly-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN GAAHQHNCMIAYEX-UWVGGRQHSA-N 0.000 description 1
- FGPLUIQCSKGLTI-WDSKDSINSA-N Gly-Ser-Glu Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O FGPLUIQCSKGLTI-WDSKDSINSA-N 0.000 description 1
- SOEGEPHNZOISMT-BYPYZUCNSA-N Gly-Ser-Gly Chemical compound NCC(=O)N[C@@H](CO)C(=O)NCC(O)=O SOEGEPHNZOISMT-BYPYZUCNSA-N 0.000 description 1
- ABPRMMYHROQBLY-NKWVEPMBSA-N Gly-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)CN)C(=O)O ABPRMMYHROQBLY-NKWVEPMBSA-N 0.000 description 1
- WCORRBXVISTKQL-WHFBIAKZSA-N Gly-Ser-Ser Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WCORRBXVISTKQL-WHFBIAKZSA-N 0.000 description 1
- YXTFLTJYLIAZQG-FJXKBIBVSA-N Gly-Thr-Arg Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YXTFLTJYLIAZQG-FJXKBIBVSA-N 0.000 description 1
- YDIDLLVFCYSXNY-RCOVLWMOSA-N Gly-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN YDIDLLVFCYSXNY-RCOVLWMOSA-N 0.000 description 1
- DKJWUIYLMLUBDX-XPUUQOCRSA-N Gly-Val-Cys Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CS)C(=O)O DKJWUIYLMLUBDX-XPUUQOCRSA-N 0.000 description 1
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 1
- 102100028967 HLA class I histocompatibility antigen, alpha chain G Human genes 0.000 description 1
- 101710197836 HLA class I histocompatibility antigen, alpha chain G Proteins 0.000 description 1
- 108700038832 HPV-33 E2 Proteins 0.000 description 1
- 241000193159 Hathewaya histolytica Species 0.000 description 1
- 241000700721 Hepatitis B virus Species 0.000 description 1
- 241000709721 Hepatovirus A Species 0.000 description 1
- 208000028782 Hereditary disease Diseases 0.000 description 1
- 208000009889 Herpes Simplex Diseases 0.000 description 1
- BIAKMWKJMQLZOJ-ZKWXMUAHSA-N His-Ala-Ala Chemical compound C[C@H](NC(=O)[C@H](C)NC(=O)[C@@H](N)Cc1cnc[nH]1)C(O)=O BIAKMWKJMQLZOJ-ZKWXMUAHSA-N 0.000 description 1
- HDXNWVLQSQFJOX-SRVKXCTJSA-N His-Arg-Gln Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N HDXNWVLQSQFJOX-SRVKXCTJSA-N 0.000 description 1
- IDNNYVGVSZMQTK-IHRRRGAJSA-N His-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N IDNNYVGVSZMQTK-IHRRRGAJSA-N 0.000 description 1
- UZZXGLOJRZKYEL-DJFWLOJKSA-N His-Asn-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UZZXGLOJRZKYEL-DJFWLOJKSA-N 0.000 description 1
- JWTKVPMQCCRPQY-SRVKXCTJSA-N His-Asn-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JWTKVPMQCCRPQY-SRVKXCTJSA-N 0.000 description 1
- VOKCBYNCZVSILJ-KKUMJFAQSA-N His-Asn-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC2=CN=CN2)N)O VOKCBYNCZVSILJ-KKUMJFAQSA-N 0.000 description 1
- UJWYPUUXIAKEES-CUJWVEQBSA-N His-Cys-Thr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UJWYPUUXIAKEES-CUJWVEQBSA-N 0.000 description 1
- UPGJWSUYENXOPV-HGNGGELXSA-N His-Gln-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CN=CN1)N UPGJWSUYENXOPV-HGNGGELXSA-N 0.000 description 1
- ZNNNYCXPCKACHX-DCAQKATOSA-N His-Gln-Gln Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZNNNYCXPCKACHX-DCAQKATOSA-N 0.000 description 1
- VBOFRJNDIOPNDO-YUMQZZPRSA-N His-Gly-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N VBOFRJNDIOPNDO-YUMQZZPRSA-N 0.000 description 1
- RAVLQPXCMRCLKT-KBPBESRZSA-N His-Gly-Phe Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O RAVLQPXCMRCLKT-KBPBESRZSA-N 0.000 description 1
- FZKFYOXDVWDELO-KBPBESRZSA-N His-Gly-Tyr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O FZKFYOXDVWDELO-KBPBESRZSA-N 0.000 description 1
- WJGSTIMGSIWHJX-HVTMNAMFSA-N His-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N WJGSTIMGSIWHJX-HVTMNAMFSA-N 0.000 description 1
- QMUHTRISZMFKAY-MXAVVETBSA-N His-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N QMUHTRISZMFKAY-MXAVVETBSA-N 0.000 description 1
- JENKOCSDMSVWPY-SRVKXCTJSA-N His-Leu-Asn Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O JENKOCSDMSVWPY-SRVKXCTJSA-N 0.000 description 1
- FHGVHXCQMJWQPK-SRVKXCTJSA-N His-Lys-Asn Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O FHGVHXCQMJWQPK-SRVKXCTJSA-N 0.000 description 1
- DEOQGJUXUQGUJN-KKUMJFAQSA-N His-Lys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N DEOQGJUXUQGUJN-KKUMJFAQSA-N 0.000 description 1
- SVVULKPWDBIPCO-BZSNNMDCSA-N His-Phe-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O SVVULKPWDBIPCO-BZSNNMDCSA-N 0.000 description 1
- XIGFLVCAVQQGNS-IHRRRGAJSA-N His-Pro-His Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CN=CN1 XIGFLVCAVQQGNS-IHRRRGAJSA-N 0.000 description 1
- STGQSBKUYSPPIG-CIUDSAMLSA-N His-Ser-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CN=CN1 STGQSBKUYSPPIG-CIUDSAMLSA-N 0.000 description 1
- JMSONHOUHFDOJH-GUBZILKMSA-N His-Ser-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CN=CN1 JMSONHOUHFDOJH-GUBZILKMSA-N 0.000 description 1
- MDOBWSFNSNPENN-PMVVWTBXSA-N His-Thr-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O MDOBWSFNSNPENN-PMVVWTBXSA-N 0.000 description 1
- XVZJRZQIHJMUBG-TUBUOCAGSA-N His-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](CC1=CN=CN1)N XVZJRZQIHJMUBG-TUBUOCAGSA-N 0.000 description 1
- NBWATNYAUVSAEQ-ZEILLAHLSA-N His-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N)O NBWATNYAUVSAEQ-ZEILLAHLSA-N 0.000 description 1
- FRDFAWHTPDKRHG-ULQDDVLXSA-N His-Tyr-Arg Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)C1=CN=CN1 FRDFAWHTPDKRHG-ULQDDVLXSA-N 0.000 description 1
- FBOMZVOKCZMDIG-XQQFMLRXSA-N His-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N FBOMZVOKCZMDIG-XQQFMLRXSA-N 0.000 description 1
- 102000008949 Histocompatibility Antigens Class I Human genes 0.000 description 1
- 108010088652 Histocompatibility Antigens Class I Proteins 0.000 description 1
- 108010033040 Histones Proteins 0.000 description 1
- 102000006947 Histones Human genes 0.000 description 1
- 241000228404 Histoplasma capsulatum Species 0.000 description 1
- 241000282412 Homo Species 0.000 description 1
- 101000886680 Homo sapiens Gamma-glutamylcyclotransferase Proteins 0.000 description 1
- 108010070875 Human Immunodeficiency Virus tat Gene Products Proteins 0.000 description 1
- 241000598436 Human T-cell lymphotropic virus Species 0.000 description 1
- 108700003968 Human immunodeficiency virus 1 tat peptide (49-57) Proteins 0.000 description 1
- 101000622327 Human papillomavirus type 18 Regulatory protein E2 Proteins 0.000 description 1
- XQFRJNBWHJMXHO-RRKCRQDMSA-N IDUR Chemical compound C1[C@H](O)[C@@H](CO)O[C@H]1N1C(=O)NC(=O)C(I)=C1 XQFRJNBWHJMXHO-RRKCRQDMSA-N 0.000 description 1
- QICVAHODWHIWIS-HTFCKZLJSA-N Ile-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N QICVAHODWHIWIS-HTFCKZLJSA-N 0.000 description 1
- QTUSJASXLGLJSR-OSUNSFLBSA-N Ile-Arg-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N QTUSJASXLGLJSR-OSUNSFLBSA-N 0.000 description 1
- IDAHFEPYTJJZFD-PEFMBERDSA-N Ile-Asp-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N IDAHFEPYTJJZFD-PEFMBERDSA-N 0.000 description 1
- RGSOCXHDOPQREB-ZPFDUUQYSA-N Ile-Asp-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N RGSOCXHDOPQREB-ZPFDUUQYSA-N 0.000 description 1
- PPSQSIDMOVPKPI-BJDJZHNGSA-N Ile-Cys-Leu Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)O PPSQSIDMOVPKPI-BJDJZHNGSA-N 0.000 description 1
- OONBGFHNQVSUBF-KBIXCLLPSA-N Ile-Gln-Cys Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CS)C(O)=O OONBGFHNQVSUBF-KBIXCLLPSA-N 0.000 description 1
- LJKDGRWXYUTRSH-YVNDNENWSA-N Ile-Gln-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N LJKDGRWXYUTRSH-YVNDNENWSA-N 0.000 description 1
- JLWLMGADIQFKRD-QSFUFRPTSA-N Ile-His-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CN=CN1 JLWLMGADIQFKRD-QSFUFRPTSA-N 0.000 description 1
- CMNMPCTVCWWYHY-MXAVVETBSA-N Ile-His-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(C)C)C(=O)O)N CMNMPCTVCWWYHY-MXAVVETBSA-N 0.000 description 1
- KEKTTYCXKGBAAL-VGDYDELISA-N Ile-His-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CO)C(=O)O)N KEKTTYCXKGBAAL-VGDYDELISA-N 0.000 description 1
- SVBAHOMTJRFSIC-SXTJYALSSA-N Ile-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(=O)N)C(=O)O)N SVBAHOMTJRFSIC-SXTJYALSSA-N 0.000 description 1
- HUWYGQOISIJNMK-SIGLWIIPSA-N Ile-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N HUWYGQOISIJNMK-SIGLWIIPSA-N 0.000 description 1
- GAZGFPOZOLEYAJ-YTFOTSKYSA-N Ile-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N GAZGFPOZOLEYAJ-YTFOTSKYSA-N 0.000 description 1
- HPCFRQWLTRDGHT-AJNGGQMLSA-N Ile-Leu-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O HPCFRQWLTRDGHT-AJNGGQMLSA-N 0.000 description 1
- GLYJPWIRLBAIJH-UHFFFAOYSA-N Ile-Lys-Pro Natural products CCC(C)C(N)C(=O)NC(CCCCN)C(=O)N1CCCC1C(O)=O GLYJPWIRLBAIJH-UHFFFAOYSA-N 0.000 description 1
- HQEPKOFULQTSFV-JURCDPSOSA-N Ile-Phe-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)O)N HQEPKOFULQTSFV-JURCDPSOSA-N 0.000 description 1
- OTSVBELRDMSPKY-PCBIJLKTSA-N Ile-Phe-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N OTSVBELRDMSPKY-PCBIJLKTSA-N 0.000 description 1
- UAELWXJFLZBKQS-WHOFXGATSA-N Ile-Phe-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](Cc1ccccc1)C(=O)NCC(O)=O UAELWXJFLZBKQS-WHOFXGATSA-N 0.000 description 1
- FGBRXCZYVRFNKQ-MXAVVETBSA-N Ile-Phe-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)O)N FGBRXCZYVRFNKQ-MXAVVETBSA-N 0.000 description 1
- XQLGNKLSPYCRMZ-HJWJTTGWSA-N Ile-Phe-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(=O)O)N XQLGNKLSPYCRMZ-HJWJTTGWSA-N 0.000 description 1
- KCTIFOCXAIUQQK-QXEWZRGKSA-N Ile-Pro-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O KCTIFOCXAIUQQK-QXEWZRGKSA-N 0.000 description 1
- RQJUKVXWAKJDBW-SVSWQMSJSA-N Ile-Ser-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N RQJUKVXWAKJDBW-SVSWQMSJSA-N 0.000 description 1
- DRCKHKZYDLJYFQ-YWIQKCBGSA-N Ile-Thr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DRCKHKZYDLJYFQ-YWIQKCBGSA-N 0.000 description 1
- PZWBBXHHUSIGKH-OSUNSFLBSA-N Ile-Thr-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PZWBBXHHUSIGKH-OSUNSFLBSA-N 0.000 description 1
- QGXQHJQPAPMACW-PPCPHDFISA-N Ile-Thr-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)O)N QGXQHJQPAPMACW-PPCPHDFISA-N 0.000 description 1
- QHUREMVLLMNUAX-OSUNSFLBSA-N Ile-Thr-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)O)N QHUREMVLLMNUAX-OSUNSFLBSA-N 0.000 description 1
- GVEODXUBBFDBPW-MGHWNKPDSA-N Ile-Tyr-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=C(O)C=C1 GVEODXUBBFDBPW-MGHWNKPDSA-N 0.000 description 1
- KXUKTDGKLAOCQK-LSJOCFKGSA-N Ile-Val-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O KXUKTDGKLAOCQK-LSJOCFKGSA-N 0.000 description 1
- PWWVAXIEGOYWEE-UHFFFAOYSA-N Isophenergan Chemical compound C1=CC=C2N(CC(C)N(C)C)C3=CC=CC=C3SC2=C1 PWWVAXIEGOYWEE-UHFFFAOYSA-N 0.000 description 1
- 241001221585 Keratinomyces Species 0.000 description 1
- 241000588747 Klebsiella pneumoniae Species 0.000 description 1
- 108010054278 Lac Repressors Proteins 0.000 description 1
- 241000589902 Leptospira Species 0.000 description 1
- 241001550390 Leptospira interrogans serovar Canicola Species 0.000 description 1
- CZCSUZMIRKFFFA-CIUDSAMLSA-N Leu-Ala-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O CZCSUZMIRKFFFA-CIUDSAMLSA-N 0.000 description 1
- MJOZZTKJZQFKDK-GUBZILKMSA-N Leu-Ala-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(N)=O MJOZZTKJZQFKDK-GUBZILKMSA-N 0.000 description 1
- UILIPCLTHRPCRB-XUXIUFHCSA-N Leu-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(C)C)N UILIPCLTHRPCRB-XUXIUFHCSA-N 0.000 description 1
- IGUOAYLTQJLPPD-DCAQKATOSA-N Leu-Asn-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IGUOAYLTQJLPPD-DCAQKATOSA-N 0.000 description 1
- VCSBGUACOYUIGD-CIUDSAMLSA-N Leu-Asn-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O VCSBGUACOYUIGD-CIUDSAMLSA-N 0.000 description 1
- FIJMQLGQLBLBOL-HJGDQZAQSA-N Leu-Asn-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FIJMQLGQLBLBOL-HJGDQZAQSA-N 0.000 description 1
- WXHFZJFZWNCDNB-KKUMJFAQSA-N Leu-Asn-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 WXHFZJFZWNCDNB-KKUMJFAQSA-N 0.000 description 1
- MMEDVBWCMGRKKC-GARJFASQSA-N Leu-Asp-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N MMEDVBWCMGRKKC-GARJFASQSA-N 0.000 description 1
- PVMPDMIKUVNOBD-CIUDSAMLSA-N Leu-Asp-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O PVMPDMIKUVNOBD-CIUDSAMLSA-N 0.000 description 1
- HUEBCHPSXSQUGN-GARJFASQSA-N Leu-Cys-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)N1CCC[C@@H]1C(=O)O)N HUEBCHPSXSQUGN-GARJFASQSA-N 0.000 description 1
- RVVBWTWPNFDYBE-SRVKXCTJSA-N Leu-Glu-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RVVBWTWPNFDYBE-SRVKXCTJSA-N 0.000 description 1
- NEEOBPIXKWSBRF-IUCAKERBSA-N Leu-Glu-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O NEEOBPIXKWSBRF-IUCAKERBSA-N 0.000 description 1
- LLBQJYDYOLIQAI-JYJNAYRXSA-N Leu-Glu-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LLBQJYDYOLIQAI-JYJNAYRXSA-N 0.000 description 1
- LAPSXOAUPNOINL-YUMQZZPRSA-N Leu-Gly-Asp Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O LAPSXOAUPNOINL-YUMQZZPRSA-N 0.000 description 1
- KEVYYIMVELOXCT-KBPBESRZSA-N Leu-Gly-Phe Chemical compound CC(C)C[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 KEVYYIMVELOXCT-KBPBESRZSA-N 0.000 description 1
- BTNXKBVLWJBTNR-SRVKXCTJSA-N Leu-His-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(O)=O BTNXKBVLWJBTNR-SRVKXCTJSA-N 0.000 description 1
- AVEGDIAXTDVBJS-XUXIUFHCSA-N Leu-Ile-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AVEGDIAXTDVBJS-XUXIUFHCSA-N 0.000 description 1
- QJXHMYMRGDOHRU-NHCYSSNCSA-N Leu-Ile-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O QJXHMYMRGDOHRU-NHCYSSNCSA-N 0.000 description 1
- KUIDCYNIEJBZBU-AJNGGQMLSA-N Leu-Ile-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O KUIDCYNIEJBZBU-AJNGGQMLSA-N 0.000 description 1
- HNDWYLYAYNBWMP-AJNGGQMLSA-N Leu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N HNDWYLYAYNBWMP-AJNGGQMLSA-N 0.000 description 1
- NRFGTHFONZYFNY-MGHWNKPDSA-N Leu-Ile-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 NRFGTHFONZYFNY-MGHWNKPDSA-N 0.000 description 1
- XVZCXCTYGHPNEM-UHFFFAOYSA-N Leu-Leu-Pro Natural products CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(O)=O XVZCXCTYGHPNEM-UHFFFAOYSA-N 0.000 description 1
- ZRHDPZAAWLXXIR-SRVKXCTJSA-N Leu-Lys-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O ZRHDPZAAWLXXIR-SRVKXCTJSA-N 0.000 description 1
- REPBGZHJKYWFMJ-KKUMJFAQSA-N Leu-Lys-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N REPBGZHJKYWFMJ-KKUMJFAQSA-N 0.000 description 1
- VCHVSKNMTXWIIP-SRVKXCTJSA-N Leu-Lys-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O VCHVSKNMTXWIIP-SRVKXCTJSA-N 0.000 description 1
- DRWMRVFCKKXHCH-BZSNNMDCSA-N Leu-Phe-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CC=CC=C1 DRWMRVFCKKXHCH-BZSNNMDCSA-N 0.000 description 1
- FYPWFNKQVVEELI-ULQDDVLXSA-N Leu-Phe-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=CC=C1 FYPWFNKQVVEELI-ULQDDVLXSA-N 0.000 description 1
- WMIOEVKKYIMVKI-DCAQKATOSA-N Leu-Pro-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WMIOEVKKYIMVKI-DCAQKATOSA-N 0.000 description 1
- BMVFXOQHDQZAQU-DCAQKATOSA-N Leu-Pro-Asp Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N BMVFXOQHDQZAQU-DCAQKATOSA-N 0.000 description 1
- DPURXCQCHSQPAN-AVGNSLFASA-N Leu-Pro-Pro Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DPURXCQCHSQPAN-AVGNSLFASA-N 0.000 description 1
- IDGZVZJLYFTXSL-DCAQKATOSA-N Leu-Ser-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IDGZVZJLYFTXSL-DCAQKATOSA-N 0.000 description 1
- AMSSKPUHBUQBOQ-SRVKXCTJSA-N Leu-Ser-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N AMSSKPUHBUQBOQ-SRVKXCTJSA-N 0.000 description 1
- ZJZNLRVCZWUONM-JXUBOQSCSA-N Leu-Thr-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O ZJZNLRVCZWUONM-JXUBOQSCSA-N 0.000 description 1
- LFSQWRSVPNKJGP-WDCWCFNPSA-N Leu-Thr-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(O)=O LFSQWRSVPNKJGP-WDCWCFNPSA-N 0.000 description 1
- ILDSIMPXNFWKLH-KATARQTJSA-N Leu-Thr-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ILDSIMPXNFWKLH-KATARQTJSA-N 0.000 description 1
- VUBIPAHVHMZHCM-KKUMJFAQSA-N Leu-Tyr-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CC=C(O)C=C1 VUBIPAHVHMZHCM-KKUMJFAQSA-N 0.000 description 1
- FBNPMTNBFFAMMH-UHFFFAOYSA-N Leu-Val-Arg Natural products CC(C)CC(N)C(=O)NC(C(C)C)C(=O)NC(C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-UHFFFAOYSA-N 0.000 description 1
- YQFZRHYZLARWDY-IHRRRGAJSA-N Leu-Val-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN YQFZRHYZLARWDY-IHRRRGAJSA-N 0.000 description 1
- 241000144128 Lichtheimia corymbifera Species 0.000 description 1
- 241000186779 Listeria monocytogenes Species 0.000 description 1
- 208000016604 Lyme disease Diseases 0.000 description 1
- KCXUCYYZNZFGLL-SRVKXCTJSA-N Lys-Ala-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O KCXUCYYZNZFGLL-SRVKXCTJSA-N 0.000 description 1
- WQWZXKWOEVSGQM-DCAQKATOSA-N Lys-Ala-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN WQWZXKWOEVSGQM-DCAQKATOSA-N 0.000 description 1
- JGAMUXDWYSXYLM-SRVKXCTJSA-N Lys-Arg-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O JGAMUXDWYSXYLM-SRVKXCTJSA-N 0.000 description 1
- SWWCDAGDQHTKIE-RHYQMDGZSA-N Lys-Arg-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWWCDAGDQHTKIE-RHYQMDGZSA-N 0.000 description 1
- NLOZZWJNIKKYSC-WDSOQIARSA-N Lys-Arg-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CCCCN)C(O)=O)=CNC2=C1 NLOZZWJNIKKYSC-WDSOQIARSA-N 0.000 description 1
- WLCYCADOWRMSAJ-CIUDSAMLSA-N Lys-Asn-Cys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CS)C(O)=O WLCYCADOWRMSAJ-CIUDSAMLSA-N 0.000 description 1
- QUYCUALODHJQLK-CIUDSAMLSA-N Lys-Asp-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O QUYCUALODHJQLK-CIUDSAMLSA-N 0.000 description 1
- LMVOVCYVZBBWQB-SRVKXCTJSA-N Lys-Asp-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN LMVOVCYVZBBWQB-SRVKXCTJSA-N 0.000 description 1
- LXNPMPIQDNSMTA-AVGNSLFASA-N Lys-Gln-His Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 LXNPMPIQDNSMTA-AVGNSLFASA-N 0.000 description 1
- LLSUNJYOSCOOEB-GUBZILKMSA-N Lys-Glu-Asp Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O LLSUNJYOSCOOEB-GUBZILKMSA-N 0.000 description 1
- LCMWVZLBCUVDAZ-IUCAKERBSA-N Lys-Gly-Glu Chemical compound [NH3+]CCCC[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CCC([O-])=O LCMWVZLBCUVDAZ-IUCAKERBSA-N 0.000 description 1
- GTAXSKOXPIISBW-AVGNSLFASA-N Lys-His-Gln Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N GTAXSKOXPIISBW-AVGNSLFASA-N 0.000 description 1
- GNLJXWBNLAIPEP-MELADBBJSA-N Lys-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CCCCN)N)C(=O)O GNLJXWBNLAIPEP-MELADBBJSA-N 0.000 description 1
- PGLGNCVOWIORQE-SRVKXCTJSA-N Lys-His-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O PGLGNCVOWIORQE-SRVKXCTJSA-N 0.000 description 1
- LJADEBULDNKJNK-IHRRRGAJSA-N Lys-Leu-Val Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O LJADEBULDNKJNK-IHRRRGAJSA-N 0.000 description 1
- RIJCHEVHFWMDKD-SRVKXCTJSA-N Lys-Lys-Asn Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O RIJCHEVHFWMDKD-SRVKXCTJSA-N 0.000 description 1
- AHFOKDZWPPGJAZ-SRVKXCTJSA-N Lys-Lys-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)O)N AHFOKDZWPPGJAZ-SRVKXCTJSA-N 0.000 description 1
- PYFNONMJYNJENN-AVGNSLFASA-N Lys-Lys-Gln Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PYFNONMJYNJENN-AVGNSLFASA-N 0.000 description 1
- ALEVUGKHINJNIF-QEJZJMRPSA-N Lys-Phe-Ala Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 ALEVUGKHINJNIF-QEJZJMRPSA-N 0.000 description 1
- HKXSZKJMDBHOTG-CIUDSAMLSA-N Lys-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CCCCN HKXSZKJMDBHOTG-CIUDSAMLSA-N 0.000 description 1
- PLOUVAYOMTYJRG-JXUBOQSCSA-N Lys-Thr-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O PLOUVAYOMTYJRG-JXUBOQSCSA-N 0.000 description 1
- DLCAXBGXGOVUCD-PPCPHDFISA-N Lys-Thr-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DLCAXBGXGOVUCD-PPCPHDFISA-N 0.000 description 1
- BDFHWFUAQLIMJO-KXNHARMFSA-N Lys-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N)O BDFHWFUAQLIMJO-KXNHARMFSA-N 0.000 description 1
- CAVRAQIDHUPECU-UVOCVTCTSA-N Lys-Thr-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAVRAQIDHUPECU-UVOCVTCTSA-N 0.000 description 1
- YFQSSOAGMZGXFT-MEYUZBJRSA-N Lys-Thr-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YFQSSOAGMZGXFT-MEYUZBJRSA-N 0.000 description 1
- WINFHLHJTRGLCV-BZSNNMDCSA-N Lys-Tyr-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(O)=O)CC1=CC=C(O)C=C1 WINFHLHJTRGLCV-BZSNNMDCSA-N 0.000 description 1
- IKXQOBUBZSOWDY-AVGNSLFASA-N Lys-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N IKXQOBUBZSOWDY-AVGNSLFASA-N 0.000 description 1
- 241001444203 Madurella mycetomatis Species 0.000 description 1
- 101710125418 Major capsid protein Proteins 0.000 description 1
- 241000124008 Mammalia Species 0.000 description 1
- 241000712079 Measles morbillivirus Species 0.000 description 1
- 208000024556 Mendelian disease Diseases 0.000 description 1
- ONGCSGVHCSAATF-CIUDSAMLSA-N Met-Ala-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O ONGCSGVHCSAATF-CIUDSAMLSA-N 0.000 description 1
- TUSOIZOVPJCMFC-FXQIFTODSA-N Met-Asp-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O TUSOIZOVPJCMFC-FXQIFTODSA-N 0.000 description 1
- GODBLDDYHFTUAH-CIUDSAMLSA-N Met-Asp-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(O)=O GODBLDDYHFTUAH-CIUDSAMLSA-N 0.000 description 1
- CNUPMMXDISGXMU-CIUDSAMLSA-N Met-Cys-Glu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(O)=O CNUPMMXDISGXMU-CIUDSAMLSA-N 0.000 description 1
- HLQWFLJOJRFXHO-CIUDSAMLSA-N Met-Glu-Ser Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O HLQWFLJOJRFXHO-CIUDSAMLSA-N 0.000 description 1
- SXWQMBGNFXAGAT-FJXKBIBVSA-N Met-Gly-Thr Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O SXWQMBGNFXAGAT-FJXKBIBVSA-N 0.000 description 1
- ZIIMORLEZLVRIP-SRVKXCTJSA-N Met-Leu-Gln Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZIIMORLEZLVRIP-SRVKXCTJSA-N 0.000 description 1
- DBXMFHGGHMXYHY-DCAQKATOSA-N Met-Leu-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O DBXMFHGGHMXYHY-DCAQKATOSA-N 0.000 description 1
- IILAGWCGKJSBGB-IHRRRGAJSA-N Met-Phe-Asp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N IILAGWCGKJSBGB-IHRRRGAJSA-N 0.000 description 1
- SJLPOVNXMJFKHJ-ULQDDVLXSA-N Met-Phe-His Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N SJLPOVNXMJFKHJ-ULQDDVLXSA-N 0.000 description 1
- OIFHHODAXVWKJN-ULQDDVLXSA-N Met-Phe-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=CC=C1 OIFHHODAXVWKJN-ULQDDVLXSA-N 0.000 description 1
- NTYQUVLERIHPMU-HRCADAONSA-N Met-Phe-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N2CCC[C@@H]2C(=O)O)N NTYQUVLERIHPMU-HRCADAONSA-N 0.000 description 1
- CIDICGYKRUTYLE-FXQIFTODSA-N Met-Ser-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O CIDICGYKRUTYLE-FXQIFTODSA-N 0.000 description 1
- PCTFVQATEGYHJU-FXQIFTODSA-N Met-Ser-Asn Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O PCTFVQATEGYHJU-FXQIFTODSA-N 0.000 description 1
- XTSBLBXAUIBMLW-KKUMJFAQSA-N Met-Tyr-Glu Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N XTSBLBXAUIBMLW-KKUMJFAQSA-N 0.000 description 1
- MFDDVIJCQYOOES-GUBZILKMSA-N Met-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCSC)N MFDDVIJCQYOOES-GUBZILKMSA-N 0.000 description 1
- CQRGINSEMFBACV-WPRPVWTQSA-N Met-Val-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O CQRGINSEMFBACV-WPRPVWTQSA-N 0.000 description 1
- 241001480037 Microsporum Species 0.000 description 1
- 241000893980 Microsporum canis Species 0.000 description 1
- 241000588772 Morganella morganii Species 0.000 description 1
- 241000235395 Mucor Species 0.000 description 1
- 241000711386 Mumps virus Species 0.000 description 1
- 102100038895 Myc proto-oncogene protein Human genes 0.000 description 1
- 101710135898 Myc proto-oncogene protein Proteins 0.000 description 1
- 241000187482 Mycobacterium avium subsp. paratuberculosis Species 0.000 description 1
- 241000186366 Mycobacterium bovis Species 0.000 description 1
- 241000187479 Mycobacterium tuberculosis Species 0.000 description 1
- 241000202934 Mycoplasma pneumoniae Species 0.000 description 1
- PESQCPHRXOFIPX-UHFFFAOYSA-N N-L-methionyl-L-tyrosine Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 PESQCPHRXOFIPX-UHFFFAOYSA-N 0.000 description 1
- 108010066427 N-valyltryptophan Proteins 0.000 description 1
- 241000588652 Neisseria gonorrhoeae Species 0.000 description 1
- 101710204212 Neocarzinostatin Proteins 0.000 description 1
- 208000009869 Neu-Laxova syndrome Diseases 0.000 description 1
- 101100068676 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) gln-1 gene Proteins 0.000 description 1
- 241000187678 Nocardia asteroides Species 0.000 description 1
- 241001503696 Nocardia brasiliensis Species 0.000 description 1
- 108091007494 Nucleic acid- binding domains Proteins 0.000 description 1
- 108020005187 Oligonucleotide Probes Proteins 0.000 description 1
- 108700020796 Oncogene Proteins 0.000 description 1
- 102000043276 Oncogene Human genes 0.000 description 1
- 241000283973 Oryctolagus cuniculus Species 0.000 description 1
- 206010061535 Ovarian neoplasm Diseases 0.000 description 1
- 108091081548 Palindromic sequence Proteins 0.000 description 1
- 206010061902 Pancreatic neoplasm Diseases 0.000 description 1
- 241000193157 Paraclostridium bifermentans Species 0.000 description 1
- 241000526686 Paracoccidioides brasiliensis Species 0.000 description 1
- 208000002606 Paramyxoviridae Infections Diseases 0.000 description 1
- 206010033976 Paravaccinia Diseases 0.000 description 1
- 241000606860 Pasteurella Species 0.000 description 1
- 208000031481 Pathologic Constriction Diseases 0.000 description 1
- LSXGADJXBDFXQU-DLOVCJGASA-N Phe-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 LSXGADJXBDFXQU-DLOVCJGASA-N 0.000 description 1
- PLNHHOXNVSYKOB-JYJNAYRXSA-N Phe-Arg-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC1=CC=CC=C1)N PLNHHOXNVSYKOB-JYJNAYRXSA-N 0.000 description 1
- LJUUGSWZPQOJKD-JYJNAYRXSA-N Phe-Arg-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)Cc1ccccc1)C(O)=O LJUUGSWZPQOJKD-JYJNAYRXSA-N 0.000 description 1
- HHOOEUSPFGPZFP-QWRGUYRKSA-N Phe-Asn-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O HHOOEUSPFGPZFP-QWRGUYRKSA-N 0.000 description 1
- UEXCHCYDPAIVDE-SRVKXCTJSA-N Phe-Asp-Cys Chemical compound SC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 UEXCHCYDPAIVDE-SRVKXCTJSA-N 0.000 description 1
- RIYZXJVARWJLKS-KKUMJFAQSA-N Phe-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 RIYZXJVARWJLKS-KKUMJFAQSA-N 0.000 description 1
- OJUMUUXGSXUZJZ-SRVKXCTJSA-N Phe-Asp-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O OJUMUUXGSXUZJZ-SRVKXCTJSA-N 0.000 description 1
- HQVPQHLNOVTLDD-IHRRRGAJSA-N Phe-Cys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC1=CC=CC=C1)N HQVPQHLNOVTLDD-IHRRRGAJSA-N 0.000 description 1
- CDQCFGOQNYOICK-IHRRRGAJSA-N Phe-Glu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 CDQCFGOQNYOICK-IHRRRGAJSA-N 0.000 description 1
- KYYMILWEGJYPQZ-IHRRRGAJSA-N Phe-Glu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 KYYMILWEGJYPQZ-IHRRRGAJSA-N 0.000 description 1
- MGECUMGTSHYHEJ-QEWYBTABSA-N Phe-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 MGECUMGTSHYHEJ-QEWYBTABSA-N 0.000 description 1
- KJJROSNFBRWPHS-JYJNAYRXSA-N Phe-Glu-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O KJJROSNFBRWPHS-JYJNAYRXSA-N 0.000 description 1
- BIYWZVCPZIFGPY-QWRGUYRKSA-N Phe-Gly-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CO)C(O)=O BIYWZVCPZIFGPY-QWRGUYRKSA-N 0.000 description 1
- GYEPCBNTTRORKW-PCBIJLKTSA-N Phe-Ile-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O GYEPCBNTTRORKW-PCBIJLKTSA-N 0.000 description 1
- KBVJZCVLQWCJQN-KKUMJFAQSA-N Phe-Leu-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O KBVJZCVLQWCJQN-KKUMJFAQSA-N 0.000 description 1
- YKUGPVXSDOOANW-KKUMJFAQSA-N Phe-Leu-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YKUGPVXSDOOANW-KKUMJFAQSA-N 0.000 description 1
- YTILBRIUASDGBL-BZSNNMDCSA-N Phe-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 YTILBRIUASDGBL-BZSNNMDCSA-N 0.000 description 1
- LRBSWBVUCLLRLU-BZSNNMDCSA-N Phe-Leu-Lys Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)Cc1ccccc1)C(=O)N[C@@H](CCCCN)C(O)=O LRBSWBVUCLLRLU-BZSNNMDCSA-N 0.000 description 1
- OQTDZEJJWWAGJT-KKUMJFAQSA-N Phe-Lys-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O OQTDZEJJWWAGJT-KKUMJFAQSA-N 0.000 description 1
- BSHMIVKDJQGLNT-ACRUOGEOSA-N Phe-Lys-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 BSHMIVKDJQGLNT-ACRUOGEOSA-N 0.000 description 1
- VHDNDCPMHQMXIR-IHRRRGAJSA-N Phe-Met-Cys Chemical compound SC[C@@H](C(O)=O)NC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CC1=CC=CC=C1 VHDNDCPMHQMXIR-IHRRRGAJSA-N 0.000 description 1
- WURZLPSMYZLEGH-UNQGMJICSA-N Phe-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC1=CC=CC=C1)N)O WURZLPSMYZLEGH-UNQGMJICSA-N 0.000 description 1
- MMJJFXWMCMJMQA-STQMWFEESA-N Phe-Pro-Gly Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)NCC(O)=O)C1=CC=CC=C1 MMJJFXWMCMJMQA-STQMWFEESA-N 0.000 description 1
- IPFXYNKCXYGSSV-KKUMJFAQSA-N Phe-Ser-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N IPFXYNKCXYGSSV-KKUMJFAQSA-N 0.000 description 1
- MCIXMYKSPQUMJG-SRVKXCTJSA-N Phe-Ser-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MCIXMYKSPQUMJG-SRVKXCTJSA-N 0.000 description 1
- MSSXKZBDKZAHCX-UNQGMJICSA-N Phe-Thr-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O MSSXKZBDKZAHCX-UNQGMJICSA-N 0.000 description 1
- GCFNFKNPCMBHNT-IRXDYDNUSA-N Phe-Tyr-Gly Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)NCC(=O)O)N GCFNFKNPCMBHNT-IRXDYDNUSA-N 0.000 description 1
- JTKGCYOOJLUETJ-ULQDDVLXSA-N Phe-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 JTKGCYOOJLUETJ-ULQDDVLXSA-N 0.000 description 1
- MWQXFDIQXIXPMS-UNQGMJICSA-N Phe-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC1=CC=CC=C1)N)O MWQXFDIQXIXPMS-UNQGMJICSA-N 0.000 description 1
- 241001531356 Phialophora verrucosa Species 0.000 description 1
- 108091000080 Phosphotransferase Proteins 0.000 description 1
- 108010021757 Polynucleotide 5'-Hydroxyl-Kinase Proteins 0.000 description 1
- 102000008422 Polynucleotide 5'-hydroxyl-kinase Human genes 0.000 description 1
- KIZQGKLMXKGDIV-BQBZGAKWSA-N Pro-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 KIZQGKLMXKGDIV-BQBZGAKWSA-N 0.000 description 1
- GRIRJQGZZJVANI-CYDGBPFRSA-N Pro-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H]1CCCN1 GRIRJQGZZJVANI-CYDGBPFRSA-N 0.000 description 1
- ZSKJPKFTPQCPIH-RCWTZXSCSA-N Pro-Arg-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZSKJPKFTPQCPIH-RCWTZXSCSA-N 0.000 description 1
- SWXSLPHTJVAWDF-VEVYYDQMSA-N Pro-Asn-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWXSLPHTJVAWDF-VEVYYDQMSA-N 0.000 description 1
- VJLJGKQAOQJXJG-CIUDSAMLSA-N Pro-Asp-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VJLJGKQAOQJXJG-CIUDSAMLSA-N 0.000 description 1
- SGCZFWSQERRKBD-BQBZGAKWSA-N Pro-Asp-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 SGCZFWSQERRKBD-BQBZGAKWSA-N 0.000 description 1
- HJSCRFZVGXAGNG-SRVKXCTJSA-N Pro-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H]1CCCN1 HJSCRFZVGXAGNG-SRVKXCTJSA-N 0.000 description 1
- DRIJZWBRGMJCDD-DCAQKATOSA-N Pro-Gln-Met Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O DRIJZWBRGMJCDD-DCAQKATOSA-N 0.000 description 1
- KIPIKSXPPLABPN-CIUDSAMLSA-N Pro-Glu-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 KIPIKSXPPLABPN-CIUDSAMLSA-N 0.000 description 1
- FRKBNXCFJBPJOL-GUBZILKMSA-N Pro-Glu-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FRKBNXCFJBPJOL-GUBZILKMSA-N 0.000 description 1
- VPEVBAUSTBWQHN-NHCYSSNCSA-N Pro-Glu-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O VPEVBAUSTBWQHN-NHCYSSNCSA-N 0.000 description 1
- HAAQQNHQZBOWFO-LURJTMIESA-N Pro-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H]1CCCN1 HAAQQNHQZBOWFO-LURJTMIESA-N 0.000 description 1
- FEVDNIBDCRKMER-IUCAKERBSA-N Pro-Gly-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)CNC(=O)[C@@H]1CCCN1 FEVDNIBDCRKMER-IUCAKERBSA-N 0.000 description 1
- SOACYAXADBWDDT-CYDGBPFRSA-N Pro-Ile-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SOACYAXADBWDDT-CYDGBPFRSA-N 0.000 description 1
- VZKBJNBZMZHKRC-XUXIUFHCSA-N Pro-Ile-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O VZKBJNBZMZHKRC-XUXIUFHCSA-N 0.000 description 1
- BCNRNJWSRFDPTQ-HJWJTTGWSA-N Pro-Ile-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O BCNRNJWSRFDPTQ-HJWJTTGWSA-N 0.000 description 1
- YXHYJEPDKSYPSQ-AVGNSLFASA-N Pro-Leu-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 YXHYJEPDKSYPSQ-AVGNSLFASA-N 0.000 description 1
- CLJLVCYFABNTHP-DCAQKATOSA-N Pro-Leu-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O CLJLVCYFABNTHP-DCAQKATOSA-N 0.000 description 1
- BRJGUPWVFXKBQI-XUXIUFHCSA-N Pro-Leu-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BRJGUPWVFXKBQI-XUXIUFHCSA-N 0.000 description 1
- MCWHYUWXVNRXFV-RWMBFGLXSA-N Pro-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 MCWHYUWXVNRXFV-RWMBFGLXSA-N 0.000 description 1
- CPRLKHJUFAXVTD-ULQDDVLXSA-N Pro-Leu-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CPRLKHJUFAXVTD-ULQDDVLXSA-N 0.000 description 1
- INDVYIOKMXFQFM-SRVKXCTJSA-N Pro-Lys-Gln Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)N)C(=O)O INDVYIOKMXFQFM-SRVKXCTJSA-N 0.000 description 1
- ABSSTGUCBCDKMU-UWVGGRQHSA-N Pro-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H]1CCCN1 ABSSTGUCBCDKMU-UWVGGRQHSA-N 0.000 description 1
- VWHJZETTZDAGOM-XUXIUFHCSA-N Pro-Lys-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VWHJZETTZDAGOM-XUXIUFHCSA-N 0.000 description 1
- VGVCNKSUVSZEIE-IHRRRGAJSA-N Pro-Phe-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O VGVCNKSUVSZEIE-IHRRRGAJSA-N 0.000 description 1
- MLKVIVZCFYRTIR-KKUMJFAQSA-N Pro-Phe-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O MLKVIVZCFYRTIR-KKUMJFAQSA-N 0.000 description 1
- GNADVDLLGVSXLS-ULQDDVLXSA-N Pro-Phe-His Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC=N1)C(O)=O GNADVDLLGVSXLS-ULQDDVLXSA-N 0.000 description 1
- LEIKGVHQTKHOLM-IUCAKERBSA-N Pro-Pro-Gly Chemical compound OC(=O)CNC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 LEIKGVHQTKHOLM-IUCAKERBSA-N 0.000 description 1
- NAIPAPCKKRCMBL-JYJNAYRXSA-N Pro-Pro-Phe Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H]1N(CCC1)C(=O)[C@H]1NCCC1)C1=CC=CC=C1 NAIPAPCKKRCMBL-JYJNAYRXSA-N 0.000 description 1
- RCYUBVHMVUHEBM-RCWTZXSCSA-N Pro-Pro-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O RCYUBVHMVUHEBM-RCWTZXSCSA-N 0.000 description 1
- MKGIILKDUGDRRO-FXQIFTODSA-N Pro-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 MKGIILKDUGDRRO-FXQIFTODSA-N 0.000 description 1
- KWMZPPWYBVZIER-XGEHTFHBSA-N Pro-Ser-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KWMZPPWYBVZIER-XGEHTFHBSA-N 0.000 description 1
- 206010036790 Productive cough Diseases 0.000 description 1
- 208000000236 Prostatic Neoplasms Diseases 0.000 description 1
- 102000002067 Protein Subunits Human genes 0.000 description 1
- 108010001267 Protein Subunits Proteins 0.000 description 1
- 241000589516 Pseudomonas Species 0.000 description 1
- 241000589517 Pseudomonas aeruginosa Species 0.000 description 1
- 108020004518 RNA Probes Proteins 0.000 description 1
- 239000003391 RNA probe Substances 0.000 description 1
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 1
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 1
- 241000702263 Reovirus sp. Species 0.000 description 1
- 108010034634 Repressor Proteins Proteins 0.000 description 1
- 241000725643 Respiratory syncytial virus Species 0.000 description 1
- 240000005384 Rhizopus oryzae Species 0.000 description 1
- 241000235546 Rhizopus stolonifer Species 0.000 description 1
- 241000606701 Rickettsia Species 0.000 description 1
- 241000606723 Rickettsia akari Species 0.000 description 1
- 241000606720 Rickettsia australis Species 0.000 description 1
- 241000606697 Rickettsia prowazekii Species 0.000 description 1
- 241000606726 Rickettsia typhi Species 0.000 description 1
- 241000711897 Rinderpest morbillivirus Species 0.000 description 1
- 241000710799 Rubella virus Species 0.000 description 1
- 241000293869 Salmonella enterica subsp. enterica serovar Typhimurium Species 0.000 description 1
- 241000223598 Scedosporium boydii Species 0.000 description 1
- 241000710961 Semliki Forest virus Species 0.000 description 1
- FIXILCYTSAUERA-FXQIFTODSA-N Ser-Ala-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FIXILCYTSAUERA-FXQIFTODSA-N 0.000 description 1
- HBZBPFLJNDXRAY-FXQIFTODSA-N Ser-Ala-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O HBZBPFLJNDXRAY-FXQIFTODSA-N 0.000 description 1
- WDXYVIIVDIDOSX-DCAQKATOSA-N Ser-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CCCN=C(N)N WDXYVIIVDIDOSX-DCAQKATOSA-N 0.000 description 1
- RZUOXAKGNHXZTB-GUBZILKMSA-N Ser-Arg-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(O)=O RZUOXAKGNHXZTB-GUBZILKMSA-N 0.000 description 1
- OBXVZEAMXFSGPU-FXQIFTODSA-N Ser-Asn-Arg Chemical compound C(C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CO)N)CN=C(N)N OBXVZEAMXFSGPU-FXQIFTODSA-N 0.000 description 1
- FIDMVVBUOCMMJG-CIUDSAMLSA-N Ser-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO FIDMVVBUOCMMJG-CIUDSAMLSA-N 0.000 description 1
- KAAPNMOKUUPKOE-SRVKXCTJSA-N Ser-Asn-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KAAPNMOKUUPKOE-SRVKXCTJSA-N 0.000 description 1
- DKKGAAJTDKHWOD-BIIVOSGPSA-N Ser-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CO)N)C(=O)O DKKGAAJTDKHWOD-BIIVOSGPSA-N 0.000 description 1
- RDFQNDHEHVSONI-ZLUOBGJFSA-N Ser-Asn-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O RDFQNDHEHVSONI-ZLUOBGJFSA-N 0.000 description 1
- XSYJDGIDKRNWFX-SRVKXCTJSA-N Ser-Cys-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O XSYJDGIDKRNWFX-SRVKXCTJSA-N 0.000 description 1
- MOVJSUIKUNCVMG-ZLUOBGJFSA-N Ser-Cys-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)O)N)O MOVJSUIKUNCVMG-ZLUOBGJFSA-N 0.000 description 1
- XWCYBVBLJRWOFR-WDSKDSINSA-N Ser-Gln-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O XWCYBVBLJRWOFR-WDSKDSINSA-N 0.000 description 1
- HJEBZBMOTCQYDN-ACZMJKKPSA-N Ser-Glu-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HJEBZBMOTCQYDN-ACZMJKKPSA-N 0.000 description 1
- VQBCMLMPEWPUTB-ACZMJKKPSA-N Ser-Glu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O VQBCMLMPEWPUTB-ACZMJKKPSA-N 0.000 description 1
- GZBKRJVCRMZAST-XKBZYTNZSA-N Ser-Glu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZBKRJVCRMZAST-XKBZYTNZSA-N 0.000 description 1
- WSTIOCFMWXNOCX-YUMQZZPRSA-N Ser-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CO)N WSTIOCFMWXNOCX-YUMQZZPRSA-N 0.000 description 1
- JFWDJFULOLKQFY-QWRGUYRKSA-N Ser-Gly-Phe Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JFWDJFULOLKQFY-QWRGUYRKSA-N 0.000 description 1
- MLSQXWSRHURDMF-GARJFASQSA-N Ser-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CO)N)C(=O)O MLSQXWSRHURDMF-GARJFASQSA-N 0.000 description 1
- MQQBBLVOUUJKLH-HJPIBITLSA-N Ser-Ile-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MQQBBLVOUUJKLH-HJPIBITLSA-N 0.000 description 1
- YUJLIIRMIAGMCQ-CIUDSAMLSA-N Ser-Leu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YUJLIIRMIAGMCQ-CIUDSAMLSA-N 0.000 description 1
- MUJQWSAWLLRJCE-KATARQTJSA-N Ser-Leu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MUJQWSAWLLRJCE-KATARQTJSA-N 0.000 description 1
- VIIJCAQMJBHSJH-FXQIFTODSA-N Ser-Met-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(O)=O VIIJCAQMJBHSJH-FXQIFTODSA-N 0.000 description 1
- OZPDGESCTGGNAD-CIUDSAMLSA-N Ser-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CO OZPDGESCTGGNAD-CIUDSAMLSA-N 0.000 description 1
- AABIBDJHSKIMJK-FXQIFTODSA-N Ser-Ser-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O AABIBDJHSKIMJK-FXQIFTODSA-N 0.000 description 1
- SZRNDHWMVSFPSP-XKBZYTNZSA-N Ser-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N)O SZRNDHWMVSFPSP-XKBZYTNZSA-N 0.000 description 1
- ZSDXEKUKQAKZFE-XAVMHZPKSA-N Ser-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N)O ZSDXEKUKQAKZFE-XAVMHZPKSA-N 0.000 description 1
- PZHJLTWGMYERRJ-SRVKXCTJSA-N Ser-Tyr-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N)O PZHJLTWGMYERRJ-SRVKXCTJSA-N 0.000 description 1
- 241000710960 Sindbis virus Species 0.000 description 1
- 208000001203 Smallpox Diseases 0.000 description 1
- 241000605008 Spirillum Species 0.000 description 1
- 241001085826 Sporotrichum Species 0.000 description 1
- 241000191963 Staphylococcus epidermidis Species 0.000 description 1
- 241000193998 Streptococcus pneumoniae Species 0.000 description 1
- 241000193996 Streptococcus pyogenes Species 0.000 description 1
- 241000194024 Streptococcus salivarius Species 0.000 description 1
- 210000001744 T-lymphocyte Anatomy 0.000 description 1
- 101710137500 T7 RNA polymerase Proteins 0.000 description 1
- NJEMRSFGDNECGF-GCJQMDKQSA-N Thr-Ala-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O NJEMRSFGDNECGF-GCJQMDKQSA-N 0.000 description 1
- YRNBANYVJJBGDI-VZFHVOOUSA-N Thr-Ala-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CS)C(=O)O)N)O YRNBANYVJJBGDI-VZFHVOOUSA-N 0.000 description 1
- FQPQPTHMHZKGFM-XQXXSGGOSA-N Thr-Ala-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O FQPQPTHMHZKGFM-XQXXSGGOSA-N 0.000 description 1
- PXQUBKWZENPDGE-CIQUZCHMSA-N Thr-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)O)N PXQUBKWZENPDGE-CIQUZCHMSA-N 0.000 description 1
- LVHHEVGYAZGXDE-KDXUFGMBSA-N Thr-Ala-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(=O)O)N)O LVHHEVGYAZGXDE-KDXUFGMBSA-N 0.000 description 1
- XSLXHSYIVPGEER-KZVJFYERSA-N Thr-Ala-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O XSLXHSYIVPGEER-KZVJFYERSA-N 0.000 description 1
- VASYSJHSMSBTDU-LKXGYXEUSA-N Thr-Asn-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N)O VASYSJHSMSBTDU-LKXGYXEUSA-N 0.000 description 1
- QNJZOAHSYPXTAB-VEVYYDQMSA-N Thr-Asn-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O QNJZOAHSYPXTAB-VEVYYDQMSA-N 0.000 description 1
- QILPDQCTQZDHFM-HJGDQZAQSA-N Thr-Gln-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QILPDQCTQZDHFM-HJGDQZAQSA-N 0.000 description 1
- UHBPFYOQQPFKQR-JHEQGTHGSA-N Thr-Gln-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O UHBPFYOQQPFKQR-JHEQGTHGSA-N 0.000 description 1
- GARULAKWZGFIKC-RWRJDSDZSA-N Thr-Gln-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O GARULAKWZGFIKC-RWRJDSDZSA-N 0.000 description 1
- LGNBRHZANHMZHK-NUMRIWBASA-N Thr-Glu-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O LGNBRHZANHMZHK-NUMRIWBASA-N 0.000 description 1
- GKWNLDNXMMLRMC-GLLZPBPUSA-N Thr-Glu-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O GKWNLDNXMMLRMC-GLLZPBPUSA-N 0.000 description 1
- VYEHBMMAJFVTOI-JHEQGTHGSA-N Thr-Gly-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O VYEHBMMAJFVTOI-JHEQGTHGSA-N 0.000 description 1
- IMULJHHGAUZZFE-MBLNEYKQSA-N Thr-Gly-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O IMULJHHGAUZZFE-MBLNEYKQSA-N 0.000 description 1
- ZTPXSEUVYNNZRB-CDMKHQONSA-N Thr-Gly-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZTPXSEUVYNNZRB-CDMKHQONSA-N 0.000 description 1
- XOWKUMFHEZLKLT-CIQUZCHMSA-N Thr-Ile-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O XOWKUMFHEZLKLT-CIQUZCHMSA-N 0.000 description 1
- WPAKPLPGQNUXGN-OSUNSFLBSA-N Thr-Ile-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WPAKPLPGQNUXGN-OSUNSFLBSA-N 0.000 description 1
- FQPDRTDDEZXCEC-SVSWQMSJSA-N Thr-Ile-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O FQPDRTDDEZXCEC-SVSWQMSJSA-N 0.000 description 1
- BVOVIGCHYNFJBZ-JXUBOQSCSA-N Thr-Leu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O BVOVIGCHYNFJBZ-JXUBOQSCSA-N 0.000 description 1
- RRRRCRYTLZVCEN-HJGDQZAQSA-N Thr-Leu-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O RRRRCRYTLZVCEN-HJGDQZAQSA-N 0.000 description 1
- VRUFCJZQDACGLH-UVOCVTCTSA-N Thr-Leu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VRUFCJZQDACGLH-UVOCVTCTSA-N 0.000 description 1
- ZSPQUTWLWGWTPS-HJGDQZAQSA-N Thr-Lys-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O ZSPQUTWLWGWTPS-HJGDQZAQSA-N 0.000 description 1
- NWECYMJLJGCBOD-UNQGMJICSA-N Thr-Phe-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O NWECYMJLJGCBOD-UNQGMJICSA-N 0.000 description 1
- NBIIPOKZPUGATB-BWBBJGPYSA-N Thr-Ser-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N)O NBIIPOKZPUGATB-BWBBJGPYSA-N 0.000 description 1
- SGAOHNPSEPVAFP-ZDLURKLDSA-N Thr-Ser-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SGAOHNPSEPVAFP-ZDLURKLDSA-N 0.000 description 1
- BCYUHPXBHCUYBA-CUJWVEQBSA-N Thr-Ser-His Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O BCYUHPXBHCUYBA-CUJWVEQBSA-N 0.000 description 1
- HUPLKEHTTQBXSC-YJRXYDGGSA-N Thr-Ser-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HUPLKEHTTQBXSC-YJRXYDGGSA-N 0.000 description 1
- COYHRQWNJDJCNA-NUJDXYNKSA-N Thr-Thr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O COYHRQWNJDJCNA-NUJDXYNKSA-N 0.000 description 1
- BGHVVGPELPHRCI-HZTRNQAASA-N Thr-Trp-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC3=CNC4=CC=CC=C43)C(=O)O)N)O BGHVVGPELPHRCI-HZTRNQAASA-N 0.000 description 1
- CYCGARJWIQWPQM-YJRXYDGGSA-N Thr-Tyr-Ser Chemical compound C[C@@H](O)[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CO)C([O-])=O)CC1=CC=C(O)C=C1 CYCGARJWIQWPQM-YJRXYDGGSA-N 0.000 description 1
- BKIOKSLLAAZYTC-KKHAAJSZSA-N Thr-Val-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O BKIOKSLLAAZYTC-KKHAAJSZSA-N 0.000 description 1
- FYBFTPLPAXZBOY-KKHAAJSZSA-N Thr-Val-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O FYBFTPLPAXZBOY-KKHAAJSZSA-N 0.000 description 1
- ILUOMMDDGREELW-OSUNSFLBSA-N Thr-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)[C@@H](C)O ILUOMMDDGREELW-OSUNSFLBSA-N 0.000 description 1
- PWONLXBUSVIZPH-RHYQMDGZSA-N Thr-Val-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O PWONLXBUSVIZPH-RHYQMDGZSA-N 0.000 description 1
- SPIFGZFZMVLPHN-UNQGMJICSA-N Thr-Val-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SPIFGZFZMVLPHN-UNQGMJICSA-N 0.000 description 1
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 description 1
- 239000004473 Threonine Substances 0.000 description 1
- 102000040945 Transcription factor Human genes 0.000 description 1
- 108091023040 Transcription factor Proteins 0.000 description 1
- 101710150448 Transcriptional regulator Myc Proteins 0.000 description 1
- 241000045663 Trematosphaeria grisea Species 0.000 description 1
- 241000589886 Treponema Species 0.000 description 1
- 241001045770 Trichophyton mentagrophytes Species 0.000 description 1
- CZWIHKFGHICAJX-BPUTZDHNSA-N Trp-Glu-Glu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O)=CNC2=C1 CZWIHKFGHICAJX-BPUTZDHNSA-N 0.000 description 1
- UEFHVUQBYNRNQC-SFJXLCSZSA-N Trp-Phe-Thr Chemical compound C([C@@H](C(=O)N[C@@H]([C@H](O)C)C(O)=O)NC(=O)[C@@H](N)CC=1C2=CC=CC=C2NC=1)C1=CC=CC=C1 UEFHVUQBYNRNQC-SFJXLCSZSA-N 0.000 description 1
- WSMVEHPVOYXPAQ-XIRDDKMYSA-N Trp-Ser-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N WSMVEHPVOYXPAQ-XIRDDKMYSA-N 0.000 description 1
- QHWMVGCEQAPQDK-UMPQAUOISA-N Trp-Thr-Arg Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O QHWMVGCEQAPQDK-UMPQAUOISA-N 0.000 description 1
- VMXLNDRJXVAJFT-JYBASQMISA-N Trp-Thr-Ser Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O VMXLNDRJXVAJFT-JYBASQMISA-N 0.000 description 1
- XHALUUQSNXSPLP-UFYCRDLUSA-N Tyr-Arg-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 XHALUUQSNXSPLP-UFYCRDLUSA-N 0.000 description 1
- PEVVXUGSAKEPEN-AVGNSLFASA-N Tyr-Asn-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PEVVXUGSAKEPEN-AVGNSLFASA-N 0.000 description 1
- VTFWAGGJDRSQFG-MELADBBJSA-N Tyr-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O VTFWAGGJDRSQFG-MELADBBJSA-N 0.000 description 1
- ZNFPUOSTMUMUDR-JRQIVUDYSA-N Tyr-Asn-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZNFPUOSTMUMUDR-JRQIVUDYSA-N 0.000 description 1
- DANHCMVVXDXOHN-SRVKXCTJSA-N Tyr-Asp-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 DANHCMVVXDXOHN-SRVKXCTJSA-N 0.000 description 1
- RCLOWEZASFJFEX-KKUMJFAQSA-N Tyr-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 RCLOWEZASFJFEX-KKUMJFAQSA-N 0.000 description 1
- LOOCQRRBKZTPKO-AVGNSLFASA-N Tyr-Glu-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 LOOCQRRBKZTPKO-AVGNSLFASA-N 0.000 description 1
- LHTGRUZSZOIAKM-SOUVJXGZSA-N Tyr-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O LHTGRUZSZOIAKM-SOUVJXGZSA-N 0.000 description 1
- JKUZFODWJGEQAP-KBPBESRZSA-N Tyr-Gly-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N)O JKUZFODWJGEQAP-KBPBESRZSA-N 0.000 description 1
- QAYSODICXVZUIA-WLTAIBSBSA-N Tyr-Gly-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O QAYSODICXVZUIA-WLTAIBSBSA-N 0.000 description 1
- JHORGUYURUBVOM-KKUMJFAQSA-N Tyr-His-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O JHORGUYURUBVOM-KKUMJFAQSA-N 0.000 description 1
- DWAMXBFJNZIHMC-KBPBESRZSA-N Tyr-Leu-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O DWAMXBFJNZIHMC-KBPBESRZSA-N 0.000 description 1
- JAGGEZACYAAMIL-CQDKDKBSSA-N Tyr-Lys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CC=C(C=C1)O)N JAGGEZACYAAMIL-CQDKDKBSSA-N 0.000 description 1
- 241001201507 Udea Species 0.000 description 1
- 208000006105 Uterine Cervical Neoplasms Diseases 0.000 description 1
- 206010046865 Vaccinia virus infection Diseases 0.000 description 1
- LTFLDDDGWOVIHY-NAKRPEOUSA-N Val-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](C(C)C)N LTFLDDDGWOVIHY-NAKRPEOUSA-N 0.000 description 1
- GNWUWQAVVJQREM-NHCYSSNCSA-N Val-Asn-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N GNWUWQAVVJQREM-NHCYSSNCSA-N 0.000 description 1
- LNYOXPDEIZJDEI-NHCYSSNCSA-N Val-Asn-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N LNYOXPDEIZJDEI-NHCYSSNCSA-N 0.000 description 1
- OGNMURQZFMHFFD-NHCYSSNCSA-N Val-Asn-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N OGNMURQZFMHFFD-NHCYSSNCSA-N 0.000 description 1
- QHDXUYOYTPWCSK-RCOVLWMOSA-N Val-Asp-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)O)N QHDXUYOYTPWCSK-RCOVLWMOSA-N 0.000 description 1
- SRWWRLKBEJZFPW-IHRRRGAJSA-N Val-Cys-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N SRWWRLKBEJZFPW-IHRRRGAJSA-N 0.000 description 1
- XJFXZQKJQGYFMM-GUBZILKMSA-N Val-Cys-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)O)N XJFXZQKJQGYFMM-GUBZILKMSA-N 0.000 description 1
- HURRXSNHCCSJHA-AUTRQRHGSA-N Val-Gln-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N HURRXSNHCCSJHA-AUTRQRHGSA-N 0.000 description 1
- CPTQYHDSVGVGDZ-UKJIMTQDSA-N Val-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C(C)C)N CPTQYHDSVGVGDZ-UKJIMTQDSA-N 0.000 description 1
- JXGWQYWDUOWQHA-DZKIICNBSA-N Val-Gln-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N JXGWQYWDUOWQHA-DZKIICNBSA-N 0.000 description 1
- JTWIMNMUYLQNPI-WPRPVWTQSA-N Val-Gly-Arg Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N JTWIMNMUYLQNPI-WPRPVWTQSA-N 0.000 description 1
- SYOMXKPPFZRELL-ONGXEEELSA-N Val-Gly-Lys Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N SYOMXKPPFZRELL-ONGXEEELSA-N 0.000 description 1
- LAYSXAOGWHKNED-XPUUQOCRSA-N Val-Gly-Ser Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LAYSXAOGWHKNED-XPUUQOCRSA-N 0.000 description 1
- BVWPHWLFGRCECJ-JSGCOSHPSA-N Val-Gly-Tyr Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N BVWPHWLFGRCECJ-JSGCOSHPSA-N 0.000 description 1
- YTUABZMPYKCWCQ-XQQFMLRXSA-N Val-His-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N2CCC[C@@H]2C(=O)O)N YTUABZMPYKCWCQ-XQQFMLRXSA-N 0.000 description 1
- HQYVQDRYODWONX-DCAQKATOSA-N Val-His-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CO)C(=O)O)N HQYVQDRYODWONX-DCAQKATOSA-N 0.000 description 1
- PYPZMFDMCCWNST-NAKRPEOUSA-N Val-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C(C)C)N PYPZMFDMCCWNST-NAKRPEOUSA-N 0.000 description 1
- UKEVLVBHRKWECS-LSJOCFKGSA-N Val-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](C(C)C)N UKEVLVBHRKWECS-LSJOCFKGSA-N 0.000 description 1
- SDUBQHUJJWQTEU-XUXIUFHCSA-N Val-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](C(C)C)N SDUBQHUJJWQTEU-XUXIUFHCSA-N 0.000 description 1
- ZZGPVSZDZQRJQY-ULQDDVLXSA-N Val-Leu-Phe Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](Cc1ccccc1)C(O)=O ZZGPVSZDZQRJQY-ULQDDVLXSA-N 0.000 description 1
- BTWMICVCQLKKNR-DCAQKATOSA-N Val-Leu-Ser Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C([O-])=O BTWMICVCQLKKNR-DCAQKATOSA-N 0.000 description 1
- KTEZUXISLQTDDQ-NHCYSSNCSA-N Val-Lys-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N KTEZUXISLQTDDQ-NHCYSSNCSA-N 0.000 description 1
- SJLVYVZBFDTRCG-DCAQKATOSA-N Val-Lys-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)O)N SJLVYVZBFDTRCG-DCAQKATOSA-N 0.000 description 1
- YMTOEGGOCHVGEH-IHRRRGAJSA-N Val-Lys-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O YMTOEGGOCHVGEH-IHRRRGAJSA-N 0.000 description 1
- UOUIMEGEPSBZIV-ULQDDVLXSA-N Val-Lys-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 UOUIMEGEPSBZIV-ULQDDVLXSA-N 0.000 description 1
- KISFXYYRKKNLOP-IHRRRGAJSA-N Val-Phe-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)O)N KISFXYYRKKNLOP-IHRRRGAJSA-N 0.000 description 1
- GQMNEJMFMCJJTD-NHCYSSNCSA-N Val-Pro-Gln Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O GQMNEJMFMCJJTD-NHCYSSNCSA-N 0.000 description 1
- RYQUMYBMOJYYDK-NHCYSSNCSA-N Val-Pro-Glu Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(=O)O)C(=O)O)N RYQUMYBMOJYYDK-NHCYSSNCSA-N 0.000 description 1
- MIKHIIQMRFYVOR-RCWTZXSCSA-N Val-Pro-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C(C)C)N)O MIKHIIQMRFYVOR-RCWTZXSCSA-N 0.000 description 1
- VIKZGAUAKQZDOF-NRPADANISA-N Val-Ser-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O VIKZGAUAKQZDOF-NRPADANISA-N 0.000 description 1
- USXYVSTVPHELAF-RCWTZXSCSA-N Val-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](C(C)C)N)O USXYVSTVPHELAF-RCWTZXSCSA-N 0.000 description 1
- PDDJTOSAVNRJRH-UNQGMJICSA-N Val-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](C(C)C)N)O PDDJTOSAVNRJRH-UNQGMJICSA-N 0.000 description 1
- YLBNZCJFSVJDRJ-KJEVXHAQSA-N Val-Thr-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O YLBNZCJFSVJDRJ-KJEVXHAQSA-N 0.000 description 1
- UFCHCOKFAGOQSF-BQFCYCMXSA-N Val-Trp-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N UFCHCOKFAGOQSF-BQFCYCMXSA-N 0.000 description 1
- QPJSIBAOZBVELU-BPNCWPANSA-N Val-Tyr-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N QPJSIBAOZBVELU-BPNCWPANSA-N 0.000 description 1
- DFQZDQPLWBSFEJ-LSJOCFKGSA-N Val-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N DFQZDQPLWBSFEJ-LSJOCFKGSA-N 0.000 description 1
- AOILQMZPNLUXCM-AVGNSLFASA-N Val-Val-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN AOILQMZPNLUXCM-AVGNSLFASA-N 0.000 description 1
- LLJLBRRXKZTTRD-GUBZILKMSA-N Val-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N LLJLBRRXKZTTRD-GUBZILKMSA-N 0.000 description 1
- 206010046980 Varicella Diseases 0.000 description 1
- 241000870995 Variola Species 0.000 description 1
- 241000700647 Variola virus Species 0.000 description 1
- 241000607626 Vibrio cholerae Species 0.000 description 1
- 241000710951 Western equine encephalitis virus Species 0.000 description 1
- 241000607479 Yersinia pestis Species 0.000 description 1
- 238000001261 affinity purification Methods 0.000 description 1
- 239000011543 agarose gel Substances 0.000 description 1
- 108010069490 alanyl-glycyl-seryl-glutamic acid Proteins 0.000 description 1
- 108010011559 alanylphenylalanine Proteins 0.000 description 1
- 229940005347 alcaligenes faecalis Drugs 0.000 description 1
- 230000004075 alteration Effects 0.000 description 1
- 238000010171 animal model Methods 0.000 description 1
- 238000000137 annealing Methods 0.000 description 1
- 230000036436 anti-hiv Effects 0.000 description 1
- 108010001271 arginyl-glutamyl-arginine Proteins 0.000 description 1
- 108010052670 arginyl-glutamyl-glutamic acid Proteins 0.000 description 1
- 108010036533 arginylvaline Proteins 0.000 description 1
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 1
- 229940091771 aspergillus fumigatus Drugs 0.000 description 1
- 108010005774 beta-Galactosidase Proteins 0.000 description 1
- 238000010256 biochemical assay Methods 0.000 description 1
- 239000013060 biological fluid Substances 0.000 description 1
- 229960001561 bleomycin Drugs 0.000 description 1
- OYVAGSVQBOHSSS-UAPAGMARSA-O bleomycin A2 Chemical compound N([C@H](C(=O)N[C@H](C)[C@@H](O)[C@H](C)C(=O)N[C@@H]([C@H](O)C)C(=O)NCCC=1SC=C(N=1)C=1SC=C(N=1)C(=O)NCCC[S+](C)C)[C@@H](O[C@H]1[C@H]([C@@H](O)[C@H](O)[C@H](CO)O1)O[C@@H]1[C@H]([C@@H](OC(N)=O)[C@H](O)[C@@H](CO)O1)O)C=1N=CNC=1)C(=O)C1=NC([C@H](CC(N)=O)NC[C@H](N)C(N)=O)=NC(N)=C1C OYVAGSVQBOHSSS-UAPAGMARSA-O 0.000 description 1
- 210000001124 body fluid Anatomy 0.000 description 1
- 239000010839 body fluid Substances 0.000 description 1
- 210000000481 breast Anatomy 0.000 description 1
- 229940056450 brucella abortus Drugs 0.000 description 1
- 229940038698 brucella melitensis Drugs 0.000 description 1
- 230000000711 cancerogenic effect Effects 0.000 description 1
- 229940095731 candida albicans Drugs 0.000 description 1
- VPKDCDLSJZCGKE-UHFFFAOYSA-N carbodiimide group Chemical group N=C=N VPKDCDLSJZCGKE-UHFFFAOYSA-N 0.000 description 1
- 150000001718 carbodiimides Chemical class 0.000 description 1
- 231100000315 carcinogenic Toxicity 0.000 description 1
- 108091092356 cellular DNA Proteins 0.000 description 1
- 210000001175 cerebrospinal fluid Anatomy 0.000 description 1
- 201000010881 cervical cancer Diseases 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 238000004587 chromatography analysis Methods 0.000 description 1
- 231100000670 co-carcinogen Toxicity 0.000 description 1
- 230000004186 co-expression Effects 0.000 description 1
- 210000001072 colon Anatomy 0.000 description 1
- 208000029742 colonic neoplasm Diseases 0.000 description 1
- 230000002860 competitive effect Effects 0.000 description 1
- 238000011109 contamination Methods 0.000 description 1
- 210000000805 cytoplasm Anatomy 0.000 description 1
- 230000001086 cytosolic effect Effects 0.000 description 1
- 230000007812 deficiency Effects 0.000 description 1
- 230000002950 deficient Effects 0.000 description 1
- 238000003936 denaturing gel electrophoresis Methods 0.000 description 1
- 238000002405 diagnostic procedure Methods 0.000 description 1
- 230000004069 differentiation Effects 0.000 description 1
- 230000029087 digestion Effects 0.000 description 1
- 108010009297 diglycyl-histidine Proteins 0.000 description 1
- 206010013023 diphtheria Diseases 0.000 description 1
- 239000012153 distilled water Substances 0.000 description 1
- VHJLVAABSRFDPM-QWWZWVQMSA-N dithiothreitol Chemical compound SC[C@@H](O)[C@H](O)CS VHJLVAABSRFDPM-QWWZWVQMSA-N 0.000 description 1
- 238000001962 electrophoresis Methods 0.000 description 1
- 230000007613 environmental effect Effects 0.000 description 1
- 238000007824 enzymatic assay Methods 0.000 description 1
- ZMMJGEGLRURXTF-UHFFFAOYSA-N ethidium bromide Chemical compound [Br-].C12=CC(N)=CC=C2C2=CC=C(N)C=C2[N+](CC)=C1C1=CC=CC=C1 ZMMJGEGLRURXTF-UHFFFAOYSA-N 0.000 description 1
- 229960005542 ethidium bromide Drugs 0.000 description 1
- 102000013165 exonuclease Human genes 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 239000013604 expression vector Substances 0.000 description 1
- 210000000416 exudates and transudate Anatomy 0.000 description 1
- 206010016256 fatigue Diseases 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- 238000001917 fluorescence detection Methods 0.000 description 1
- 230000004927 fusion Effects 0.000 description 1
- 238000007429 general method Methods 0.000 description 1
- 210000004392 genitalia Anatomy 0.000 description 1
- 108010077435 glycyl-phenylalanyl-glycine Proteins 0.000 description 1
- 108010043293 glycyl-prolyl-glycyl-glycine Proteins 0.000 description 1
- 108010074027 glycyl-seryl-phenylalanine Proteins 0.000 description 1
- 108010059898 glycyl-tyrosyl-lysine Proteins 0.000 description 1
- 108010087823 glycyltyrosine Proteins 0.000 description 1
- 230000012010 growth Effects 0.000 description 1
- 239000010440 gypsum Substances 0.000 description 1
- 229910052602 gypsum Inorganic materials 0.000 description 1
- 208000006454 hepatitis Diseases 0.000 description 1
- 231100000283 hepatitis Toxicity 0.000 description 1
- 206010073071 hepatocellular carcinoma Diseases 0.000 description 1
- 210000005260 human cell Anatomy 0.000 description 1
- 238000003384 imaging method Methods 0.000 description 1
- 238000010348 incorporation Methods 0.000 description 1
- 238000011534 incubation Methods 0.000 description 1
- 239000000411 inducer Substances 0.000 description 1
- 229910010272 inorganic material Inorganic materials 0.000 description 1
- 239000011147 inorganic material Substances 0.000 description 1
- 238000003780 insertion Methods 0.000 description 1
- 230000037431 insertion Effects 0.000 description 1
- 108010053037 kyotorphin Proteins 0.000 description 1
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 1
- 108010030617 leucyl-phenylalanyl-valine Proteins 0.000 description 1
- 239000002502 liposome Substances 0.000 description 1
- 208000020816 lung neoplasm Diseases 0.000 description 1
- 210000003712 lysosome Anatomy 0.000 description 1
- 230000001868 lysosomic effect Effects 0.000 description 1
- 108010009298 lysylglutamic acid Proteins 0.000 description 1
- 108010064235 lysylglycine Proteins 0.000 description 1
- 201000001441 melanoma Diseases 0.000 description 1
- 108020004999 messenger RNA Proteins 0.000 description 1
- 244000005700 microbiome Species 0.000 description 1
- 208000020298 milker nodule Diseases 0.000 description 1
- 238000010369 molecular cloning Methods 0.000 description 1
- 208000008588 molluscum contagiosum Diseases 0.000 description 1
- 238000002703 mutagenesis Methods 0.000 description 1
- 231100000350 mutagenesis Toxicity 0.000 description 1
- QZGIWPZCWHMVQL-UIYAJPBUSA-N neocarzinostatin chromophore Chemical compound O1[C@H](C)[C@H](O)[C@H](O)[C@@H](NC)[C@H]1O[C@@H]1C/2=C/C#C[C@H]3O[C@@]3([C@@H]3OC(=O)OC3)C#CC\2=C[C@H]1OC(=O)C1=C(O)C=CC2=C(C)C=C(OC)C=C12 QZGIWPZCWHMVQL-UIYAJPBUSA-N 0.000 description 1
- 238000001668 nucleic acid synthesis Methods 0.000 description 1
- 239000002751 oligonucleotide probe Substances 0.000 description 1
- 238000002515 oligonucleotide synthesis Methods 0.000 description 1
- 210000000056 organ Anatomy 0.000 description 1
- 239000011368 organic material Substances 0.000 description 1
- 238000004816 paper chromatography Methods 0.000 description 1
- 239000013610 patient sample Substances 0.000 description 1
- 238000002823 phage display Methods 0.000 description 1
- 108010084572 phenylalanyl-valine Proteins 0.000 description 1
- 108010073101 phenylalanylleucine Proteins 0.000 description 1
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 1
- 102000020233 phosphotransferase Human genes 0.000 description 1
- 230000010287 polarization Effects 0.000 description 1
- 229920002401 polyacrylamide Polymers 0.000 description 1
- 238000003752 polymerase chain reaction Methods 0.000 description 1
- 239000013641 positive control Substances 0.000 description 1
- 108010020755 prolyl-glycyl-glycine Proteins 0.000 description 1
- 108010025826 prolyl-leucyl-arginine Proteins 0.000 description 1
- 108010087846 prolyl-prolyl-glycine Proteins 0.000 description 1
- 108010004914 prolylarginine Proteins 0.000 description 1
- 108010053725 prolylvaline Proteins 0.000 description 1
- 238000011321 prophylaxis Methods 0.000 description 1
- 210000002307 prostate Anatomy 0.000 description 1
- 230000017854 proteolysis Effects 0.000 description 1
- 238000010791 quenching Methods 0.000 description 1
- 230000000171 quenching effect Effects 0.000 description 1
- 238000003127 radioimmunoassay Methods 0.000 description 1
- 238000003259 recombinant expression Methods 0.000 description 1
- 238000010188 recombinant method Methods 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 230000002829 reductive effect Effects 0.000 description 1
- 230000001105 regulatory effect Effects 0.000 description 1
- 230000008521 reorganization Effects 0.000 description 1
- 230000003362 replicative effect Effects 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 229920005989 resin Polymers 0.000 description 1
- 239000011347 resin Substances 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 230000000717 retained effect Effects 0.000 description 1
- 230000001177 retroviral effect Effects 0.000 description 1
- 239000002342 ribonucleoside Substances 0.000 description 1
- 229940046939 rickettsia prowazekii Drugs 0.000 description 1
- 210000003296 saliva Anatomy 0.000 description 1
- 239000012266 salt solution Substances 0.000 description 1
- 238000013341 scale-up Methods 0.000 description 1
- 108010048397 seryl-lysyl-leucine Proteins 0.000 description 1
- 238000002741 site-directed mutagenesis Methods 0.000 description 1
- 235000010378 sodium ascorbate Nutrition 0.000 description 1
- 229960005055 sodium ascorbate Drugs 0.000 description 1
- PPASLZSBLFJQEF-RKJRWTFHSA-M sodium ascorbate Substances [Na+].OC[C@@H](O)[C@H]1OC(=O)C(O)=C1[O-] PPASLZSBLFJQEF-RKJRWTFHSA-M 0.000 description 1
- YZHUMGUJCQRKBT-UHFFFAOYSA-M sodium chlorate Chemical compound [Na+].[O-]Cl(=O)=O YZHUMGUJCQRKBT-UHFFFAOYSA-M 0.000 description 1
- PPASLZSBLFJQEF-RXSVEWSESA-M sodium-L-ascorbate Chemical compound [Na+].OC[C@H](O)[C@H]1OC(=O)C(O)=C1[O-] PPASLZSBLFJQEF-RXSVEWSESA-M 0.000 description 1
- 239000002904 solvent Substances 0.000 description 1
- 238000000527 sonication Methods 0.000 description 1
- 238000001179 sorption measurement Methods 0.000 description 1
- 241000894007 species Species 0.000 description 1
- 210000003802 sputum Anatomy 0.000 description 1
- 208000024794 sputum Diseases 0.000 description 1
- 210000004085 squamous epithelial cell Anatomy 0.000 description 1
- 230000035882 stress Effects 0.000 description 1
- 238000010998 test method Methods 0.000 description 1
- 108010072986 threonyl-seryl-lysine Proteins 0.000 description 1
- 229940113082 thymine Drugs 0.000 description 1
- 108091006106 transcriptional activators Proteins 0.000 description 1
- 230000002103 transcriptional effect Effects 0.000 description 1
- 239000013638 trimer Substances 0.000 description 1
- 239000001226 triphosphate Substances 0.000 description 1
- 235000011178 triphosphate Nutrition 0.000 description 1
- 108010038745 tryptophylglycine Proteins 0.000 description 1
- 108010051110 tyrosyl-lysine Proteins 0.000 description 1
- 238000005199 ultracentrifugation Methods 0.000 description 1
- 241001529453 unidentified herpesvirus Species 0.000 description 1
- 241001515965 unidentified phage Species 0.000 description 1
- 241001430294 unidentified retrovirus Species 0.000 description 1
- 229940035893 uracil Drugs 0.000 description 1
- 210000002700 urine Anatomy 0.000 description 1
- 208000007089 vaccinia Diseases 0.000 description 1
- 238000012795 verification Methods 0.000 description 1
- 230000035899 viability Effects 0.000 description 1
- 229940118696 vibrio cholerae Drugs 0.000 description 1
- 239000013603 viral vector Substances 0.000 description 1
- 230000000007 visual effect Effects 0.000 description 1
- 239000002699 waste material Substances 0.000 description 1
- 229950009268 zinostatin Drugs 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6811—Selection methods for production or design of target specific oligonucleotides or binding molecules
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P31/00—Antiinfectives, i.e. antibiotics, antiseptics, chemotherapeutics
- A61P31/12—Antivirals
- A61P31/14—Antivirals for RNA viruses
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P31/00—Antiinfectives, i.e. antibiotics, antiseptics, chemotherapeutics
- A61P31/12—Antivirals
- A61P31/14—Antivirals for RNA viruses
- A61P31/18—Antivirals for RNA viruses for HIV
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6804—Nucleic acid analysis using immunogens
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6813—Hybridisation assays
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6813—Hybridisation assays
- C12Q1/6816—Hybridisation assays characterised by the detection means
- C12Q1/682—Signal amplification
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6813—Hybridisation assays
- C12Q1/6827—Hybridisation assays for detection of mutation or polymorphism
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6844—Nucleic acid amplification reactions
- C12Q1/6853—Nucleic acid amplification reactions using modified primers or templates
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/70—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving virus or bacteriophage
- C12Q1/701—Specific hybridization probes
- C12Q1/702—Specific hybridization probes for retroviruses
- C12Q1/703—Viruses associated with AIDS
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/70—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving virus or bacteriophage
- C12Q1/701—Specific hybridization probes
- C12Q1/708—Specific hybridization probes for papilloma
Definitions
- This invention provides a method and compositions for use in binding, detecting, and amplifying the detection of specific Target Nucleic Acid sequences in a sample with fidelity and accuracy, even in the presence of closely related but different nucleic acids.
- the binding may involve the chaperoning and assembly of specific molecules into Target Binding Assemblies which specifically bind Target Binding Regions formed by the hybridization of Probe Nucleic Acids and Target Nucleic Acid sequences.
- the amplifying may involve the chaperoning and/or assembly of specific molecules into Booster Binding Assemblies which specifically bind Booster Binding Regions formed by the hybridization of Booster Nucleic Acids with Probe Nucleic Acids, Target Nucleic Acids, or other Booster Nucleic Acids.
- a method, and compositions, involving Hairpin Nucleic Acids is also provided to enable control of the size of specifically or non-specifically elongated Booster Nucleic Acids and Booster Binding Assemblies used in the amplification.
- the detecting involves providing one or more detectable labels, including radioactive, light- or fluorescent-emitting, enzymatic, or other detectable or signal-generating molecules, in association with the Probe Nucleic Acid, the Target Binding Assembly, the Booster Nucleic Acid, the Booster Binding Assembly, or the Hairpin Nucleic Acid.
- a method is presented for isolating nucleic acid fragments from an organism which has TBA component binding sites in order to create a probe nucleic acid and a TBA which is unique for that fragment and/or organism. Therapeutic and prophylactic uses of the Target Binding Assemblies and compositions for such use are also provided.
- TNAs Target Nucleic Acids
- CNAs Cousin Nucleic Acids
- Booster Nucleic Acid BNA
- the instant invention provides methods and compositions for achieving the foregoing desired objectives. As revealed by the following review, the instant compositions and methods have not been reported or suggested in the art. A general and comprehensive review of the state of art of nucleic acid detection is provided in Keller, H., M. M. Manak (1989) DNA Probes , Stockton Press.
- the DNA molecules are fractionated on high percentage polyacrylamide gels such that modified and unmodified fragments can be distinguished.
- Ford et al. applied this technique in order to locate and purify DNA sequence differences responsible for phenotype variation and inherited disease.
- this method is useful for following variations in genetic material, it has a large number of steps, it requires costly components, and it does not offer a direct means of determining whether a PNA has hybridized to the TNA exclusive of CNAs in the sample.
- the probe comprises a single self-complementary sequence which, under hybridizing conditions, forms a hairpin structure having a functional promoter region, and further comprises a single stranded probe sequence extending from the 3′ end of the hairpin sequence.
- the target sequence is rendered transcribable in the presence of a suitable RNA polymerase and appropriate ribonucleoside triphosphates (rNTPs).
- Amplification is accomplished by hybridizing the desired TNA sequence with the probe, ligating the TNA to the PNA, adding the RNA polymerase and the rNTPs to the separated hybrids, and allowing transcription to proceed until a desired amount of RNA transcription product has accumulated.
- That method generally and specifically involves the use of hairpin DNA formed with a single stranded unpaired end to anneal a target sequence. When the target sequence is bound, the production of RNA transcription products is enabled.
- the method involves the detection of secondary transcription products rather than the use of a nucleic acid binding assembly to directly immobilize and/or localize a target sequence.
- a CNA could easily bind to the probe, and the lack of complementarity would not necessarily interfere with the formation of a CNA-PNA hybrid which could then support the production of unwanted transcription products.
- a CNA bound to the PNA might be detected if the lack of complementarity interferes with the susceptibility of the hybrid CNA-PNA pair to be cut by a restriction endonuclease.
- novel methods for assaying a nucleic acid analyte were provided, which employ polynucleotides having oligonucleotide sequences substantially homologous to a sequence of interest in the analyte, where the presence or absence of hybridization at a predetermined stringency provides for the release of a label from a support.
- U.S. Pat. No. 5,124,246 to Urdea et al. discloses liner or branched oligonucleotide multimers useful as amplifiers in biochemical assays which comprise (1) at least one first single-stranded oligonucleotide unit (PNA) that is complementary to a single-stranded oligomucleotide sequence of interest (TNA), and (2) a multiplicity of second single-stranded, oligonucleotide units that are complementary to a single-stranded labeled oligonucleotide.
- PNA first single-stranded oligonucleotide unit
- TAA single-stranded oligomucleotide sequence of interest
- second single-stranded, oligonucleotide units that are complementary to a single-stranded labeled oligonucleotide.
- an analyte polynucleotide strand having an analyte sequence is detected within a sample containing polynucleotides by contacting the analyte polynucleotide with a capture probe (PNA) under hybridizing conditions, where the capture probe has a first binding partner specific for the TNA, and a second binding sequence specific for a solid phase third binding partner.
- PNA capture probe
- the analyte polynucleotide is optionally displaced from the solid phase, then amplified by PCR.
- the PCR primers each have a polynucleotide region capable of hybridizing to a region of the analyte polynucleotide, and at least one of the primers further has an additional binding partner capable of binding a solid-phase binding partner.
- the amplified product is then separated from the reaction mixture by specific binding between the binding partners, and the amplified product is detected.
- U.S. Pat. No. 5,122,600 to Kawaguchi et al. discloses a DNA-immobilized microsphere comprising DNA chains having base sequences which specifically bind a particular protein, and a carrier having a particle size of not more than 50 ⁇ m and not less tan 0.01 ⁇ m which does not adsorb any protein, said carrier and said DNA chains being bound to each other by a chemical bond, and a process for purifying a protein using said microsphere.
- EP 0 453 301 a method for detecting a polynucleotide target sequence in a sample was described wherein sequences in a TNA are detected by hybridizing a first and a second PNA to the TNA.
- Each of said first and second PNAs contained a pre-formed duplex sequence, or a duplex that is formed through chin extension, capable of binding a nucleotide sequence specific binding protein.
- a method for binding a nucleotide specific binding protein to a duplex formed between a TNA and a PNA only upon formation of a duplex between the PNA and TNA is neither disclosed nor suggested.
- TBA Target Nucleic Acid
- PNA Probe Nucleic Acids
- TBA Target Binding Assemblies
- TBR Target Binding Region
- the TBA is comprised of one or more molecules, one or more of which can bind to TBR sequences in a specific and sequence or conformation dependent manner.
- the TBA may comprise one or more piloting sequences, called “PILOTS” or “Asymmetry Sequences,” which assemble and constrain the nucleotide binding components of the TBA to specific geometries.
- the PILOTS act to assemble specific nucleic acid recognition units or other pilots to which specific nucleic acid recognition units are attached into the TBAs in a predetermined fashion.
- the TBA may also contain one or more molecules which anchor or localize the TBA.
- Novel TBAs having unique discriminating characteristics which surprisingly render the TBAs useful not only as diagnostic tools but also as prophylactic or therapeutic compounds, are also disclosed.
- the PNAs in addition to TNA-specific sequences, may also contain one or more sequences, 1 ⁇ 2 BBRs, capable of hybridizing with complementary 1 ⁇ 2 BBRs in Booster Nucleic Acids (BNAs).
- BNAs Booster Nucleic Acids
- BBRs Booster Binding Regions
- Each BBR is capable of being a Booster Binding Assembly (BBA).
- BBA is comprised of molecules, one or more of which can bind to a BBR in a specific and sequence or conformation dependent manner.
- the BBA may comprise one or more piloting sequences, called “PILOTS” or “Asymmetry Sequences,” which assemble and constrain the nucleotide binding components of the TBA to specific geometries.
- the PILOTS act to assemble specific nucleic acid recognition units or other pilots to which specific nucleic acid recognition units are attached into the BBAs in a predetermined fashion.
- the BBA may contain molecules which anchor or localize the BBA or which allow for detection of the bound BBAs and thereby of the TBA-TNA-PNA complexes to which they, in turn, are bound.
- HNAs Hairpin Nucleic Acids
- the HNAs contain a self-hybridizing region and a single stranded 1 ⁇ 2 BBR which, under hybridizing conditions, can hybridize directly to the 1 ⁇ 2 BBRs in the PNAs or the 1 ⁇ 2 BBRs in BNAs already bound to the PNAs, to terminate the extension of BNAs onto the PNA or onto other BNAs.
- Methods and compositions are disclosed for test procedures and the production of a test kit containing PNAs, TBAs, TBRs, BNAs, BBRs, BBAs and HNAs for the detection, localization and differentiation of specific nucleic acid sequences, including nucleic acid sequences which are found in human cells, in the Human Immunodeficiency Virus (HIV), Human Papillomavirus (HPV), and in other nucleic acid containing systems including viruses and bacteria.
- HIV Human Immunodeficiency Virus
- HPV Human Papillomavirus
- Another object of this invention is to provide a method and compositions for the creation of Booster Binding Assemblies which specifically bind Booster Binding Regions formed by the hybridization of Booster Nucleic Acid sequences with Probe Nucleic Acids, Booster Nucleic Acids and Hairpin Nucleic Acids.
- Another object of this invention is to provide a method and compositions containing Hairpin Nucleic Acids which enable the control of the size of specifically or non-specifically elongated Booster Nucleic Acids and Booster Binding Assemblies used in amplification of PNA-TNA hybridization events.
- Another object of this invention is to provide a method and compositions for use in the selection, assembly and or chaperoning of specific molecules, each with nucleic acid binding discriminating capabilities, into Target and Booster Binding Assemblies.
- Another object of this invention is to provide a method and compositions for use in amplifying the detection of Target Binding assemblies bound to Target Binding Regions using Booster Binding Assemblies and Booster Nucleic Acids.
- Another object of this invention is to provide a method and compositions which allow the use of one or more detectable labels, including but limited to radioactive labels, light emitting, fluorescent, enzymatic or other signal generating molecules. These labels are used in association with Probe Nucleic Acids, Target Binding Assemblies, Booster Binding Assemblies, Booster Nucleic Acids or Hairpin Nucleic Acids.
- detectable labels including but limited to radioactive labels, light emitting, fluorescent, enzymatic or other signal generating molecules.
- Another object of this invention is to provide a method for isolating nucleic acid fragments form an organism which has TBA component binding sites in order to create Probe Nucleic Acids and TBAs which are unique for that fragment or organism.
- FIG. 1-I is a PNA containing a 1 ⁇ 2 TBR, which is a single-stranded sequence which is complementary to a TNA and a 1 ⁇ 2 BBR sequence.
- FIG. 1-IIa is a TNA to which is added the components of FIG. 1-I, and, under hybridizing conditions, binds the PNA to form the components of FIG. 1-IIIa, a PNA-TNA hybrid containing at least one TBR
- FIG. 1-IVa is a BNA which is a to the components of FIG. 1-IIIa and, under hybridizing conditions, binds the 1 ⁇ 2 BBR of FIG. 1-IIIa to form a PNA-BNA hybrid containing a BBR shown in FIG. 1-Va.
- FIG. I-IIb is a BNA which is added the components of FIG. 1-I, and which, under hybridizing conditions, binds the PNA to form the components of FIG. 1-IIIb, a PNA-TNA hybrid containing a BBR.
- FIG. I-IVb is a TNA to which is added the components of FIG. 1-IIIb and which, under hybridizing conditions, binds the 1 ⁇ 2 TBR of FIG. 1-IIIb to form a PNA-BNA hybrid containing a TBR shown in FIG. I-Vb.
- FIG. 1-IIc is a HNA which is added to the components of FIG. 1-I and which, under hybridizing conditions, binds the PNA to form the components of FIG. 1-IIIc, a PNA-HNA hybrid containing a BBR.
- FIG. 1-IVc is a TNA which is added to the components of FIG. 1-IIIc and which, under hybridizing conditions, binds the 1 ⁇ 2 TBR of FIG. 1-IIIc to form a PNA-BNA hybrid containing a BBR shown in FIG. I-Vc.
- the hybrids which form the TBRs and BBRs are useful in the present invention.
- the PNAs and BNAs may contain no attached support and/or indicator (OSA), or an attached support or other means of localization, including, but not limited to, attachment to heads, polymers, and surfaces, and/or indicators.
- OSA attached support and/or indicator
- FIG. 2 a is a diagram of strategies for polymerization of BNAs onto PNAs and capping by HNAs.
- FIG. 2 b is a diagram of additional strategies for amplifying PNA-TNA signals via polymerization of BNAs and capping by HNAs.
- FIG. 3 is a diagram showing the use of BNAs containing multiple 1 ⁇ 2 BBRs per BNA.
- FIG. 4 a is a diagram showing the binding of TBAs and BBAs to TBRs and BBRs, and the ability of the TBA to discriminate between TNAs and CNAs. According to this embodiment, if the TBA is immobilized, either on a bead, microtiter plate surface, or any other such surface, only complexes such as complex X would be red and detected, while complexes such as complex XI would not.
- FIG. 4 b is a diagram exemplifying events similar to those shown in FIG. 4 a but in a slightly different order of occurrence.
- FIG. 5 is a diagram exemplifying PNAs containing between one 1 ⁇ 2 TBR and no 1 ⁇ 2 BBR to PNAs containing up to five 1 ⁇ 2 TBRs and one 1 ⁇ 2 BBR.
- the (a) and (b) members of each numeral (I, II, III, IV, V) form a set which, upon hybridization to a TNA, provide TBRs either with ((a) members) or without ((b) members) an available 1 ⁇ 2 BBR for amplification via hybridization to BNAs having complementary 1 ⁇ 2 BBRs.
- FIG. 6 a is a diagram exemplifying a particular TNA having two 1 ⁇ 2 TBRs which, upon binding an appropriate PNA, forms two closely associated TBRs capable of binding two TBAs. A 1 ⁇ 2 BER is also provided for amplification.
- FIG. 6 b is a diagram showing the same events as in FIG. 6 a except here, a double TBA is used so that discrimination between single TBRs that occur in normal cellular samples may be discriminated from abnormal, double TBRs.
- FIG. 6 c is a diagram showing the same scenario as in FIG. 6 a except that here, five TBRs are identified in the TNA.
- Each TBR may be bound to a TBA same or different, and each TBA may be differentially labeled, allowing for confirmation that all five sites are present in the TNA.
- FIG. 6 d is a diagram of the same events as in FIG. 6 c except here, a double TBA is shown, extending what is shown in FIG. 6 b to the use of the double TBA.
- An example of the TNA shown in item II in FIGS. 6 a , 6 b , 6 c and 6 d is HIV single stranded DNA or RNA.
- FIG. 7 shows the HIV LTR as a TNA, and two PNAs, and a strategy for detection of the TNA using the PNAs.
- FIG. 8 is a schematic of one embodiment of the invention wherein a target binding assembly is used to bind a hybrid TNA-PNA, and booster binding assemblies are used to bind polymerized BNAs.
- FIG. 9 is a schematic of a modular TBA in which assembly sequences, linker sequences, and asymmetry sequences are used to chaperone desired nucleic acid recognition units together to form a TBA.
- FIG. 10 shows modular TBAs useful in detection of HIV-specific sequences.
- FIG. 11 shows modular TBAs useful in the detection of human papillomavirus sequences. Each unit of E2 is actually a dimer of the DNA binding portion of E2.
- FIG. 12 a is a schematic of TNA fractionation and shift in mobility due to binding of a TBA.
- FIG. 12 b is a schematic of TNA fractionation and enhanced shift in mobility due to binding of BBAs in addition to TBAs.
- FIG. 13 shows a detection strategy for deletion sequences; an example of use of this strategy is for a human papillomavirus integration assay.
- FIG. 14 shows assembly of higher order TBAs through use of nucleic acid recognition units, linker, assembly, and asymmetry sequences such that various Target Binding Assemblies specific to binding sites in the HIV LTR are formed.
- FIG. 15 shows assembly of higher order TBAs through use of DNA recognition units, linker, assembly, and asymmetry sequences such that various Target Binding Assemblies specific to binding sites in the HPV genome are formed.
- FIG. 16 shows the discrimination achieved by using a complex TBA and the ability of endogenous competitor target binding molecules to eliminate binding of the TBA to a cousin nucleic acid but not from the TNA which contains the appropriate orientation of more than one site recognized by the TBA.
- FIG. 17 shows the ability of a TBA to specifically be targeted to bind to sites of sequence mismatch and to preferentially bind those sites over cousin sites which do not contain all of the targeted mismatches.
- SEQ ID NO. 1 corresponds to FIG. 5-Ia- 1 and shows the class I MHC NF-kB binding site.
- SEQ ID NO. 2 corresponds to FIG. 5 (Ia) and shows the B2-microglobulin NF-kB binding site.
- SEQ ID NO. 3 corresponds to FIG. 5 (Ia) and shows the kappa immunoglobulin NF-kB binding site.
- SEQ ED NO. 4 corresponds to FIG. 5 (Ia) and shows one of the HIV NF-kB binding sites.
- SEQ ID NO. 5 corresponds to FIG. 5 (Ia) and shows one of the HIV NF-kB binding sites.
- SEQ ID NO. 6 corresponds to FIG. 5 (Ia) and shows the c-myc NF-kB binding site.
- SEQ ID NO. 7 corresponds to FIG. 5 (IIa) and shows a double HIV NF-kB binding site.
- SEQ ID NO. 8 corresponds to FIG. 5 (IIa) and shows a double HIV NF-kB binding site.
- SEQ ID NOS. 9-16 correspond to FIG. 5 (IIa) and show a double binding site with one site being an HIV NF-kB binding site, and the other site being an HIV SP1 binding site.
- SEQ ID NOS. 17-18 correspond to FIG. 5 (IIa) and show a double HIV SP1 binding site.
- SEQ ID NOS. 19-31 correspond to FIG. 5 (IIIa) and show a double HIV NF-kB binding site and an HIV SP1 binding site.
- SEQ ID NOS. 32-33 correspond to FIG. 5 (IVa) and show a quadruple binding site where two sites are HIV NF-kB binding sites and two sites are HIV SP1 binding sites.
- SEQ ID NO. 34 corresponds to FIG. 5 VIa) and show a quintuple binding site where two sites are HIV NF-kB binding sites and three sites are HIV SP1 binding sites.
- SEQ ID NO. 35 is an example of a 1 ⁇ 2 BBR, in this case the OL1, OL2 and OL3 elements of the bacteriophage lambda left operator, including intervening sequences.
- SEQ ID NO. 36 is an example of a 1 ⁇ 2 BBR, in this case the OR3, OR2 and OR1 elements of the bacteriophage lambda right operator, including intervening sequences.
- SEQ ID NO. 37 is the HIV LTR.
- SEQ ID NO. 38 is a PNA complementary to PNA of the HIV LTR.
- SEQ ID NO. 39 is a PNA complementary to a different PNA of the HIV LTR than SEQ ID NO. 38.
- SEQ ID NO. 40 is a PNA complementary to part of the HIV LTR and it also contains a 1 ⁇ 2 BBR and an overhang sequence for polymerizing BNAs onto the PNA.
- SEQ ID NO. 41 is a BNA complementary to the SEQ ID NO. 40 1 ⁇ 2 BBR.
- SEQ ID NO. 42 is a BNA that will polymerize onto the SEQ ID NO. 41 BNA and which, with SEQ ID NOS. 40 and 41, creates a PstI recognition site.
- SEQ ID NO. 43 is a BNA that is complementary to the SEQ ID NO. 42 BNA and which completes a BamHI recognition site.
- SEQ ID NO. 44 is an HNA which has a BamHI recognition site that will hybridize with the BamHI recognition site created by SEQ ID NOS. 42 and 43 to the growing polymer.
- SEQ ID NO. 45 is a second PNA which, like SEQ ID NO. 40, is complementary to part of the HIV LTR, but not to the same sequences as SEQ ID NO. 40.
- SEQ ID NO. 45 also encodes a 1 ⁇ 2 BBR and an overhang which will allow polymerization of BNAs starting with a Sph1 recognition site.
- SEQ ID NOS. 46-62 are human papillomavirus (HPV) specific PNAs which, upon hybridization with HPV sequences, form TBRs which bind HPV DNA binding proteins.
- HPV human papillomavirus
- SEQ ID NOS. 63-71 are NF-kB DNA recognition units for incorporation into TBAs.
- SEQ ID NO. 72 is a nuclear localization sequence.
- SEQ ID NO. 73 is a SP1sequence recognition unit.
- SEQ ID NO. 74 is a TATA binding protein recognition unit.
- SEQ ID NOS. 75-84 are papillomavirus E2 DNA recognition units.
- SEQ ID NOS. 85-92 are asymmetry sequences.
- SEQ ID NO. 93 is an arabidopsis TATA binding protein recognition unit.
- SEQ ID NO. 94 is an HPV-16-E2-1 DNA binding protein recognition unit.
- SEQ ID NO. 95 is an HPV-16E2-2 DNA binding protein recognition unit.
- SEQ ID NO. 96 is an HPV-18-E2 DNA binding protein recognition unit.
- SEQ ID NO. 97 is an HPV-33-E2 DNA binding protein recognition unit.
- SEQ ID NO. 98 is a bovine papillomavirus E2 DNA binding protein recognition unit.
- SEQ ID NOS. 99-102 are exemplary linker sequences.
- SEQ ID NO. 103 is an exemplary nuclear localization signal sequence (NLS).
- SEQ ID NOS. 104-108 are exemplary chaperone sequences.
- SEQ ID NOS. 109-116 are exemplary assembled TBA sequences
- SEQ ID NO. 117 is a consensus NF-kB binding site.
- SEQ ID NO. 118 an HIV Tat amino acid sequence.
- Abbreviations single stranded nucleic acid double-stranded nucleic acid binding region on nucleic acid no support or indicators, or solid support, or other means of localization, in- cluding, but not limited to, attachment to beads, polymers, and surfaces, or indicators OSA BBA booster binding assembly BBR booster binding region BNA booster nucleic acid CNA cousin nucleic acid 1/2 BBR single-stranded region which, when hybridized to the complementary sequence from an HNA or a BNA, can bind a BBA 1/2 TBR single-stranded region of the PNA which, when hybridized to the complementary sequence from a TNA, can bind a TBA OSA optional support or attch- ment, circle with box PNA probe nucleic acid TBA target binding assembly TBR target binding region TNA target nucleic acid HNA Hairpin Nucleic Acid
- TBAs target binding assemblies
- BBAs booster binding assemblies
- DNA binding proteins nucleic acid binding proteins or RNA binding proteins
- TBAs compositions comprised of molecules which bind to DNA or RNA target nucleic acid sequences (TNAs) irrespective of the specificity of the category of binding molecules from which they are derived.
- TBAs target nucleic acid sequences
- a TBA adapted to bind to human immunodeficiency virus sequences may be most similar to an NF-kB transcriptional factor which typically binds DNA sequences.
- the TBA may be adapted for optimal use to bind to RNA sequences of a particular sequence composition or conformation.
- TBA and BBA discrimination of TNAs from related sequences may be the formation of precise probe nucleic acid (PNA)-target nucleic acid (TNA) hybrid segments (PNA-TNA hybrids).
- PNA probe nucleic acid
- TNA-TNA hybrids precise probe nucleic acid hybrid segments
- the basis of discrimination may just as well be the formation of a particular conformation, and may not require the complete absence of mismatched-base pairing in the TNA-PNA hybrid.
- TBA or BBA operation should be understood throughout to depend on discrimination of any property unique to the TNA-PNA hybrid as opposed to any properties displayed by any PNA-CNA hybrids that may be formed in a test sample contacted with a given PNA.
- the present invention provides a method for specifically identifying a target nucleic acid (TNA) in a sample through the use of target binding assemblies (TBAs) which incorporate specific nucleic acid binding proteins.
- TSAs target binding assemblies
- PNAs probe nucleic acids
- TBA duplex target binding region
- TBA-TNA-PNA complex is formed.
- any of a number of nucleic acid amplification systems including polymerase chain reaction, or the use of branched DNA, each branch of which contains a detetable label, may be used.
- a novel method of amplification is described herein where the amplifiable portion of the PNA contains sequences onto which booster nucleic acids (BNAs) may be polymerized.
- BNAs booster nucleic acids
- BBR booster binding region
- BBA booster binding assembly
- the TNA will be understood to include specific nucleic acid sequences.
- the TBA will be understood to be any molecular assembly which can specifically and tightly bind to a formed TNA-PNA hybrid.
- the TBA will contain one or more molecules whose sequences are sufficient to bind to the TBR.
- Nucleic acid binding domains which are known can either be used directly as components of the TBA or modified according to the teachings provided herein. The most readily available molecules with such sequences are the DNA-binding domains of DNA-binding proteins. Specifically, many DNA or RNA binding proteins are known which can either used directly as the known, unmodified protein, or the TBA may be a nucleic acid binding protein, modified according to the specific teachings provided herein.
- modifications that are desirable would include optimization of binding affinities, removal of unwanted activities (such as nuclease activity and reorganization of the TBA in the presence of other molecules with an affinity for components of the TBA), optimization of selectivity of a target sequence over closely related sequences, and optimization of stability.
- DNA binding proteins which could be used according to this invention are the DNA-binding portions of the transcription factor NF-kB (p50 and p65), NF-IL6, NF-AT, rel, TBP, the papilloma virus'E2 protein, sp1, the repressors cro and CI from bacteriophage lambda, and like proteins are well known proton whose DNA binding portion has been isolated, cloned, sequenced, and characterized.
- any other DNA-binding protein or portion of a protein that is necessary and sufficient to bind to a TBR hybrid or a BBR is included.
- proteins or portions of wild-type proteins with altered DNA binding activity as well as protein created with altered DNA-binding specificity, such as the exchange of a DNA-binding recognition helix from one protein to another.
- proteins which exhibit nucleic acid biding and other nucleic acid functions such as restriction endonucleases, could be used as the nucleic acid binding function.
- Proteins which bind to target regions in DNA-RNA hybrids as well as RNA-RNA hybrids are included. (See, for example, Shi 1995, DeStefano 1993, Zhu 1995, Gonzales 1994, Salazar 1993, Jaishree 1993, Wang 1992, Roberts 1992, Kainz 1992, Salazar 1993(b)).
- the binding assemblies may be constructed with the use of a molecule which chaperones portions of the binding assembly so that specific component combinations and geometries can be achieved.
- This molecule is designated here as a PILOT. Pilots can be comprised of proteins or any combination of organic and inorganic materials which achieve the combinatorial selection and/or to induce specific geometries between members of the TBA or BBAs.
- a chaperone is a stable scaffold upon which a TBA or BBA may be constructed such that the correct conformation of the TBA or BBA is provided while at the same time eliminating undesirable properties of a naturally occurring nucleic acid binding protein.
- NF-kB pleiotropic transcription factor
- a modified version of the pleiotropic transcription factor, NF-kB is provided using a modified bacteriophage lambda cro protein as the chaperone.
- Each NF-kB binding dimer retains the picomolar binding affinity for the NF-kB binding site while at the same time the binding assembly presents several advantageous manufacturing, stability, and specificity characteristics.
- the PNAs of the present invention comprise at least three principal parts joined together.
- the first part of the PNA is one or more sequences of bases, designated “1 ⁇ 2 TBR”.
- the 1 ⁇ 2 TBR in the PNA is complementary to a sequence of interest in a sample, the TNA containing a 1 ⁇ 2 TBR.
- the TNA when added to the PNA under hybridizing conditions, forms a PNA-TNA hybrid containing a TBR.
- the second part of the PNA is a sequence of bases, designated “1 ⁇ 2 BBR”.
- the 1 ⁇ 2 BBR in the PNA is complementary to a 1 ⁇ 2 BBR contained in a BNA or a HNA.
- the BNA or HNA when added to the PNA under hybridizing conditions, forms a PNA-BNA hybrid or PNA-HNA hybrid, respectively, containing a BBR.
- the third part of the PNA is the OSA, designated by a circle with a box around it.
- the OSA is no support and/or an indicator, or solid support, or other means of localization, including but not limited to, attachment to beads, polymers, and surfaces and/or indicators which is/are covalently attached to, or non-covalently, but specifically, associated with the PNA.
- the OSA may be an atom or molecule which aids in the separation and/or localization such as a solid support binding group or label which can be detected by various physical means including, but not limited to, adsorption or imaging of emitted particles or light.
- Methods for attaching indicators to oligonucleotides or for immobilizing oligonucleotides to solid supports are well known in the art (see Keller and Manak, supra, herein incorporated by reference).
- the PNA of the present invention can be prepared by any suitable method. Such methods, in general, will include oligonucleotide synthesis and cloning in a replicable vector. Methods for nucleic acid synthesis are well-known in the art. When cloned or synthesized, strand purification and separation may be necessary to use the product as a pure PNA. Methods of preparing RNA probes are well known (see for example Blais 1993, Blais 1994, which uses in vitro transcription from a PCR reaction incorporating a T7 RNA polymerase promoter).
- PNA length and specific sequence of the PNA will be understood by those skilled in the art to depend on the length and sequence to be detected in a TNA, and the strictures for achieving tight and specific binding of the particular TBA to be used (see discussion on TBAs below).
- PNAs of sequence lengths between about 10 and about 300 nucleotides in length are adequate, with lengths of about 15-100 nucleotides being desirable for many of the embodiments specifically exemplified herein.
- the PNA may be constructed so as to contain more than one 1 ⁇ 2 TBR and to produce more than one TBR for one or more TBAs, same or different, as well as complex TBRs recognized by novel duplex and multiplex TBAs (see description below regarding these novel TBAs) upon hybridization of the PNAs and TNAs.
- FIG. 5 illustrates specific PNAs which contain one or more 1 ⁇ 2 TBRs. Specific sequences which correspond to the 1 ⁇ 2 TBR sees illustrated in FIG. 5 (Ia, IIa, IIIa, IVa, and Va) are SEQ ID NOS. 1-34 (see Description of Sequences above).
- the PNA containing a 1 ⁇ 2 TBR
- the PNA may be hybridized with one or more BNAs (see description below) and the chain of BNAs polymerized to any desired potential length for amplification of the TNA-PNA hybridization evens.
- BNAs see description below
- the PNA may contain several 1 ⁇ 2 TBRs, same or different, which can hybridize with several 1 ⁇ 2 TBRs in a TNA.
- TBR Target Binding Region
- FIGS. 6 a and 6 b the PNA may contain several 1 ⁇ 2 TBRs, same or different, which can hybridize with several 1 ⁇ 2 TBRs in a TNA.
- a Target Binding Region TBR
- two different PNAs are used to detect sequences on a particular TNA.
- FIG. 7 shows one representation of the human immunodeficiency virus (HIV) long terminal repeat (LTR).
- the HIV LTR comprises two NF-kB binding sites and three SP1 binding sites, in close proximity, wherein NF-kB and SP1 are known DNA binding proteins.
- FIG. 7 provides two PNAs, PNA1 (SEQ ID NO. 38) and PNA2 (SEQ ID NO. 39), each of which is complementary to the opposite strand shown as a TNA (SEQ ID NO. 37), which shows the two NF-kB binding sites and the three SP1 binding sites of the HIV LTR.
- PNA1 specifically hybridizes with that section of the TNA shown in FIG. 7 with bases underscored with a “+” symbol
- PNA2 specifically hybridizes with that section of the TNA shown in FIG.
- each of PNA1 or PNA2 may also contain sequences (indicated by the symbols “#” or “*”) which will hybridize with a BNA's 1 ⁇ 2 BBR sequences (see below).
- each of PNA1 and PNA2 may be differentially tagged with an OSA, such as a fluorophore such as a fluorescein or a rhodamine label, which would allow confirmation that both probes have become bound to the TNA. If only one label or neither label is detected, it is concluded that the TNA is not present in the sample being tested.
- the TBR may have a helical structure.
- PNA1 creates TBRs on one “face” of the helix
- PNA2 creates a TBR on either the same or a different face of the helix, depending on the distance between the middle of each TBR (underlined in FIG. 7). If the middle of each binding site is an integral product of 10.5 bases apart, the TBRs will be on the same side of the helix, while non-integer products of 10.5 bases apart would place the TBRs on opposite sides of the helix.
- any cooperativity in binding by the TBA recognizing the PNA1 TBR and the TBA recognizing the PNA2 TBR can be manipulated (see Hochschild, A., M. Ptashne [1986 ] Cell 44:681-687, showing this effect for the binding of bacteriophage lambda repressor to two different operator sites located at different distances from each other in a DNA helix).
- cooperativity between NF-kB and the SP1 sites is required to achieve activation of the HIV LTR.
- the double NF-kB-triple SP1 binding site motif in the HIV LTR may be taken advantage of by providing a single, novel binding protein capable of binding both sites simultaneously, but only if the spacing between the sites is geometrically feasible. This is controlled both by the structure of the selected TBA and by the PNAs used.
- the two probes may be used with a large enough interprobe region of single-stranded DNA remaining such that, even if the NF-kB and SP1 binding sites are on opposite sides of the helix, the single-stranded region between the probes provides a sufficiently flexible “hinge” so that the DNA can both bend and twist to accommodate the geometry of the TBA.
- a more stringent assay may be designed by narrowing the interprobe distance such that the DNA may only bend, but not twist.
- the probes may be so closely spaced or a single PNA used, such that the DNA can only bend but not twist.
- the TNA may be obtained from biopsy materials taken from organs and fluids suspected of containing the cancerous cells.
- the TNA may be obtained from patient samples containing the affected cells.
- the TNA may be obtained from samples taken at various stages in the fermentation or treatment process.
- the TNA sample may be obtained from the food or drug, swabs of food or surfaces in contact with the food, fluids in contact with the food, processing materials, fluids and the like associated with the manufacture of or in contact with the food, drug, or biological samples taken from those in contact with the food or drug or the like.
- BNAs Nucleic Acids
- BBRs Booster Binding Regions
- the BNAs of the present invention are comprised of at least one or more 1 ⁇ 2 BBRs coupled to an OSA.
- the 1 ⁇ 2 BBRs can hybridize to complementary 1 ⁇ 2 BBRs contained in the PNA, other BNAs or an HNA.
- the simplest BNA is comprised of two parts.
- the first part of the simplest BNA is a sequence of bases which is complementary to the sequence in the PNA which is designated “1 ⁇ 2 BBR”.
- the second part of the simplest BNA is the OSA, designate by a circle with a box around it.
- the OSA is no support and/or indicator, or solid support, or other means of localization, including but not limited to, attachment to beads, polymers, and surfaces and/or indicators which are covalently attached to, or non-covalently, but specifically, associated with the BNA.
- the BNA may contain more than one 1 ⁇ 2 BBR sequence.
- the BNA illustrated in FIG. 3(II) contains a sequence which is complementary to the PNA illustrated in FIG. 3(I) and two other 1 ⁇ 2 BBR sequences.
- the BNA illustrated in FIG. 3(III) contains two 1 ⁇ 2 BBR sequences which are complementary to two of the 1 ⁇ 2 BBR sequences in the BNA illustrated in FIG. 3(II), plus up to “n” additional 1 ⁇ 2 BBRs for polymerization of additional BNAs.
- the BNA illustrated in FIG. 3(II) when combined with the PNA illustrated in FIG. 3(I), creates the PNA-BNA hybrid illustrated in FIG. 3(IVa) containing a BBR and an unhybridized extension with two additional 1 ⁇ 2 BBR sequences or “booster” sequences.
- the BBRs created by said hybridization can be identical, similar or dissimilar in sequences.
- the BBRs created by said hybridization can bind identical, similar or dissimilar BBAs (see below).
- the BNAs may have prepared analogously to the PNAs.
- the BNA-BNA hybrid illustrated in FIG. 3(IVb) when combined with the PNA illustrated in FIG. 3(Vb), creates the PNA-BNA hybrid illustrated in FIG. 3(VI) containing a BBR, two additional BNA-BNA hybrids containing BBRs, and an unhybridized extension with an additional 1 ⁇ 2 BBR sequence, a “booster” sequence.
- the BBRs created by said hybridization can be identical, similar or dissimilar in sequence.
- the BBRs created by said hybridization can bind identical, similar or dissimilar BBAs (see below).
- the BNAs may be prepared in a fashion analogous to preparation of the PNTAs.
- TAAs Target Nucleic Acids
- the first step in detecting and amplifying signals produced through detection of a particular TNA according to the present method is the hybridization of such target with the PNA in a suitable mixture. Such hybridization is achieved under suitable conditions well known in the art.
- the sample suspected or known to contain the intended TNA may be obtained from a variety of sources. It can be a biological sample, a food or agricultural sample, an environmental sample and so forth
- the TNA may be obtained from a biopsy sample, a body fluid or exudate such as urine, blood, mill; cerebrospinal fluid, sputum, saliva, stool lung aspirates, throat or genital swabs and the like.
- detection may be in situ (see for example Embretson 1993; Patterson 1993; Adams 1994).
- PNAs specific to vertebrates may be envisioned and used according to the instant method: Corynebacteria Corynebactenum diphtheria Bacillus Bacillus thuringiensis Pneumococci Diplococcus pneumoniae uz,1/6 Streptococci Streptococcus pyogenes Streptococcus salivarius Staphylococcus Staphylococcus aureus Staphylococcus albus Pseudomonas Pseudomonas stutzen Neisseria Neisseria meningitidis Neisseria gonorrhea Enterobacteriaceae Escherichia coli Aerobacteria aerogenes Klebsiella pneumoniae The coliform bacteria Salmonella typhosa Salmonella choleroesuis The Salmonellae Salmonella typhimurium Shigellae dysenteria
- Actinomycetes (fungus-like bacteria) Actinomyces isaeli Actinomyces bovis Actinomyces naeslundii Nocardia asteroides Nocardia brasiliensis The Spirochetes Treponema pallidum Treponema per pneumonia Treponema carateum Spirillum minus Streptobacillus moniliformis Borrelia recurrens Leptospira icterohemorrhagiae Leptospira canicola Trypanasomes Mycoplasmas Mycoplasma pneumoniae Other pathogens Listeria monocytogenes Erysipelothrix rhusiopathiae Streptobacillus moniliformis Donvania granulomatis Bartonella bacillformis Rickettsiae (bacteria-like parasites) Rickettsia prowazekii Rickettsia mooser
- Retroviruses Human immunodeficiency viruses HBV Human T-cell lymphotrophic virus I & II (HTLV) Hepatitis Hepatitis A virus Hepatitis B virus Hepatitis nonA-nonB virus Hepatitis, C, D, E Tumor viruses Rauscher leukemia virus Gross virus Maloney leukemia virus Human papilloma viruses
- Fragmentation of nucleic acid samples containing TNAs is usually required to decrease the sample viscosity and to increase the accessibility of the TNAs to the PNAs. Such fragmentation is accomplished by random or specific means known in the art Thus, for example, specific nucleases known to cut with a particular frequency in the particular genome being analyzed, may be used to produce fragments of a known average molecular size. In addition, other nucleases, phosphodiesteses, exonucleases and endonucleases, physical shear and sonication are all methods amenable for this purpose. These processes are well known in the art. The use of restriction enzymes for the purpose of DNA fragmentation is generally preferred.
- DNA can also be fragmented by a variety of chemical means such as the use of the following types of reagents: EDTA-Fe(II) (according to Stroebel et al. [1988 ] J. Am. Chem. Soc. 110:7927; Dervan [1986 ] Science 232:464); Cu(II)-phenanthroline (according to Chen and Sigman [1987 ] Science 237:1197); class IIS restriction enzyme (according to Kim et al. [1988 ] Science 240:504); hybrid DNAse (according to Corey et al. [1989 ] Biochem. 28:8277); bleomycin (according to Umezawa et al.
- the TNAs of the present invention should be long enough so that there is a sufficient amount of double-stranded hybrid flanking the TBR so that a TBA can bind unpertrbed by the unligated fragment ends.
- fragments in the range of about 10 nucleotides to about 100,000 nucleotides, and preferably in the range of about 20 nucleotides to about 1,000 nucleotides are used as the average size for TNA fragments.
- TNA sequences that could be detected are sequences complementary to the PNA sequences described herein for detection of normal cellular, abnormal cellular (as in activated oncogenes, integrated foreign genes, genetically defective genes), and pathogen-specific nucleic acid sequences, for which specific nucleic acid binding proteins are known, or which can be produced according to methods described in this disclosure.
- a specific HIV-related TNA is shown as SEQ ID) NO. 37.
- BNAs can be added that hybridize to the PNAs, PNA-BNA hybrids, BNAs and/or BNA-BNA hybrids.
- the aforementioned additions can be made in a non-vectorial polymeric fashion or in a vectorial fashion, with a known order of BNAs.
- a booster polymer is produced by adding two BNAs, illustrated in FIG. 2 a (Ib and Ic), which when combined under hybridizing conditions with the PNA, form PNA-BNA-BNA hybrids, comprised of the PNA and “booster extensions”, illustrated in FIG. 2 a (IIa,IIb,IIc and IId) leaving at least one unpaired 1 ⁇ 2 BBR sequence.
- Each unpaired 1 ⁇ 2 BBR sequence, illustrated in FIG. 2 a (IIa, IIb, IIc, IId) can hybridize with additional BNAs to form additional “booster” extensions.
- FIG. 2 a (IIa,IIb,IIc and IId) can hybridize with added HNAs, illustrated in FIG. 2 a (IIIa and IIIb).
- the hybridization of the HNAs which cannot hybridize additional BNAs, acts to “cap” the addition of the BNAs onto the PNA, as illustrated in FIG. 2 a (IVa, IVb, IVc and IVd).
- Vectorial polymer extension is accomplished by adding a BNA which is specific for the PNA, as illustrated in FIG. 2 b (Ia and IIa), which when combined under hybridizing conditions with the PNA, form PNA-BNA-BNA hybrids, comprised of the PNA and “booster” extensions.
- BNA-BNA-BNA hybrids comprised of the PNA and “booster” extensions.
- BNAs Any of a number of methods may be used to prepare the BNAs, including, e.g., synthesis via known chemistry or via recombinant DNA production methods. In the latter method, an essentially unlimited number of BNAs may be produced simply and inexpensively, for example, by production in prokaryotes ( E. coli for example) of a plasmid DNA having multiple repeats of the specific BNA sequences flanked by restriction sites having overhanging ends.
- prokaryotes E. coli for example
- the bacteriophage lambda left or right operator sites or any other DNA or other nucleic acid sequence known to specifically and tightly bind a particular BBA, such as a DNA or RNA binding protein, may be produced in an essentially unlimited number of copies, with each copy flanked by an EcoRI, PstI, BamHI or any of a number of other common restriction nuclease sites.
- a polymer at repeated sites may be excised by unique restriction sites not present within the polymer.
- pBR322 Large quantities of pBR322, pUC plasmid or other plasmid having multiple copies of these sequences are produced by methods well known in the art, the plasmid cut with the restriction enzyme flanking the polymerized site, and the liberated multiple copies of the operators isolated either by chromatography or any other convenient means known in the art.
- the BNA prior to use, is then strand separated and is then amenable for polymerization onto a PNA encoding a single stranded complementary copy of the operator as a 1 ⁇ 2 BBR.
- the BNAs may be polymerized vectorially onto the PNA by using different restriction enzymes to flank each repeat of the polymer in the plasmid used to produce multiple copies of the BNA.
- the BNA polymer may be hybridized to the PNA via overhangs at one or both ends of the BNA polymer, without the need to strand separate and anneal each BNA segment.
- Examples of specific BNA sequences are provided above in the section entitled Description of Sequences, as SEQ ID NOS. 35-36.
- DNA ligase may be used to covalently link the hybridized BNAs.
- HNAs Hairpin Nucleic Acids
- the HNAs of the present invention comprise at least two principal parts joined together: A single-stranded sequence, which is complementary to a 1 ⁇ 2 BBR, and a double-stranded nucleic acid region formed, under hybridizing conditions, by the self-association of self-complementary sequences within the HNA.
- the 1 ⁇ 2 BBR in the HNA may be constructed so as to be complementary to the 1 ⁇ 2 BBR sequence in the PNA.
- the aforementioned HNA when added to the PNA under hybridizing conditions, forms a PNA-HNA hybrid containing a BBR
- a PNA-HNA hybrid under hybridizing conditions, upon addition of the TNA, can form a TNA-PNA-HNA hybrid containing a TBR and a BBR.
- the HNAs can be used to “cap” or terminate the addition of BNA extensions to the PNA.
- the two BNAs in FIG. 2 a (Ib and Ic) can associate to form the hybrid shown in FIG. 3(IVb) or can hybridize directly and individually to the PNA as illustrated in FIG. 2 a (Ia-c, IIa-d).
- the two HNAs (shown in FIG. 2 a (IIIa and IIIb)) can terminate the hybridization of the BNA to other BNAs which extend from the PNA, as illustrated in FIG. 2 a (IVa-d).
- the HNA in FIG. 2 a (IIIa) can terminate the PNA-BNA hybrids shown in FIG.
- the HNA in FIG. 2 a (IIIb) can terminate the PNA-BNA hybrids shown in FIG. 2 a (IIa and IIc) and any PNA-BNA hybrid with two single stranded 1 ⁇ 2 BBRs which are complementary to the 1 ⁇ 2 BBRs in the HNA illustrated in FIG. 2 a (IIIb).
- HNAs are constructed that will terminate PNA-BNA hybrids which are constructed from the sequential addition of BNAs to the PNA as illustrated in FIG. (2b).
- the single stranded 1 ⁇ 2 BBR sequences illustrated in FIG. 2 b (Ia, IIIa, Va, and VIIa) are specifically complementary to the single stranded 1 ⁇ 2 BBR sequences illustrated in FIG. 2 b (Ib,IIIb,Vb and VIIb) and produce the unique capped PNA-BNA-HNA hybrids illustrated in FIG. 2 b (Ic,IIIc,Vc and VIIc).
- the self-complementary sequences in the HNA and the loop sequence which links the self-complementary hairpin sequences can be of any composition and length, as long as they do not substantially impede or inhibit the presentation of the single-stranded 1 ⁇ 2 BBR that comprises part of the HNA by the HNA or selectively bind the BBA or the TBA.
- the loop sequences should be selected so that formation of the loop does not impede formation of the hairpin.
- An examples of an HNA useful in this application is provided as SEQ ID NO. 44 (see Description of Sequences above).
- TSAs Target Binding Assemblies
- a TBA may be any substance which binds a particular TBR formed by hybridization of particular TNAs and PNAs, provided that the TBA must have at least the following attributes:
- the TBA must bind the TBR(s) in a fashion that is highly specific to the TBR(s) of interest. That is, the TBA must discriminate between TBRs present in the TNA-PNA hybrid and similar duplex sequences formed by PNA-CNA hybrids.
- the TBA must bind the PNA-CNA hybrid with a sufficiently low avidity that upon washing the TBA-TNA-PNA complex, the PNA-CNA hybrid is displaced and the PNA-TNA hybrid is not displaced;
- the TBA must avidly bind the TBR(s) created by the hybridization of the TNA with the PNA. Binding affinities in the range of 10 ⁇ 5 to about 10 ⁇ 12 or higher are generally considered sufficient. As noted below, in some instances, it might be desirable to utilize a particular TBA which has a very low avidity for a particular TBR, but which has a greatly increased affinity when a particular configuration of multiple TBRs is provided so that the square of the affinity of the TBA for each TBR becomes the affinity of relevance to that particular TBA.
- Examples of the DNA binding components useful in the formation of TBAs include, but are not limited to NF-kB, papillomavirus E2 protein, transcription factor SP1, inactive restriction enzymes, antibodies, etc. Each of these proteins has been recognized in the art to contain sequences which bind to particular nucleic acid sequences and the affinities of these interactions are known. Naturally, the method of the instant invention is not limited to the use of these known DNA binding proteins or fragments thereof. From the instant disclosure, it would be apparent to one of ordinary skill that the instant method could easily be applied to the use of novel TBAs exhibiting at least the required attributes noted above.
- a sequence specific DNA binding molecule comprising an oligonucleotide conjugate formed by the covalent attachment of a DNA binding drug to a triplex forming oligonucleotide was described.
- the method of that disclosure could be used to produce novel TBAs for use according to the instant disclosure, provided that the TBAs thus formed meet the criteria described above.
- the methods of U.S. Pat. Nos. 5,096,815, 5,198,346, and WO88/06601, herein incorporated by reference may be used to generate novel TBAs for use according to the method of this invention. Specific antibodies or portions thereof could be used (see for example Blais 1994).
- the TBA is a protein, or a complex of proteins
- any of a number of methods routine in the art may be used to produce the TBA.
- the TBA may be isolated from its naturally occurring environment in nature, or if this is impractical, produced by the standard techniques of molecular biology.
- this binding assembly could be produced according to recombinant methods known in the art (see for example Ghosh [1990 ] Cell 62:1019-1029, describing the cloning of the p50 DNA binding subunit of NF-kB and the homology of that protein to rel and dorsal).
- DNA and other nucleic acid binding proteins are known which can be used as or in TBAs according to this invention.
- an appropriate DNA sequence encoding the protein can either be prepared by synthetic means, or a cDNA copy of the mRNA encoding the protein from an appropriate tissue source can be used.
- genomic copies encoding the protein may be obtained and introns spliced out according to methods known in the art.
- the TBAs may be chemically synthesized.
- site-directed mutagenesis may be used to alter the amino acid sequence encoded to produce mutant nucleic acid binding proteins exhibiting more desirable binding characteristics than those of the original nucleic acid binding protein.
- the amino acid sequence of the DNA binding portions of NF-kB can be altered so as to produce an NP-kB′ molecule which more tightly binds the NF-kB binding site (see examples below—HIV-Detect and HIV-Lock).
- NF-kB NF-kB
- a TBA may be prepared using the naturally occurring NF-kB molecule.
- this molecule is present in vanishingly small quantities in cells, and because the subunits of this DNA binding protein have been cloned, it would be more reasonable to prepare large quantities of the complex via recombinant DNA means as has already been accomplished for this protein (see for example Ghosh [1990] Cell 62:1019-1029).
- NF-kB is a pleiotropic inducer of genes involved in immune, inflammatory and growth regulatory responses to primary pathogenic (viral, bacterial or stress) challenges or secondary pathogenic (inflammatory cytokine) challenges.
- NF-kB is a dimeric DNA binding protein comprising a p50 and a p65 subunit, both of which contact and bind to specific DNA sequences. In an inactivated state, NF-kB resides in the cellular cytoplasm, complexed with a specific inhibitor, I-kB, to form a cytoplasmic heterotrimer.
- the inhibitor Upon activation, the inhibitor is decomplexed, and the p50-p65 dimer relocate via a specific nuclear localization signal (NLS) to the cell's nucleus where it can bind DNA and effect its role as a transcriptional activator of numerous genes (see Grimm and Baeuerle [1993 ] Biochem. J. 290:297-308, for a review of the state of the art regarding NF-kB).
- NLS nuclear localization signal
- the p50-p65 dimer binds with picomolar affinity to sequences matching the consensus GGGAMTNYCC (SEQ ID NO. 117), with slightly different affinities depending on the exact sequence. It is worth noting that homodimers of p50 and p65 have also been observed to occur. These homodimers display different biochemical properties as well as slightly different affinities of binding sequences within and similar to the above consensus. Thus, depending on the desired binding characteristics of the TBA, a p50-p65 heterodimer, a p50-p50 homodimer, or a p65-p65 homodimer or fragments of the aforementioned dimers may be used.
- the nucleic acid recognition units of the TBA may be assembled and associated with similar or dissimilar TBA nucleic acid recognition units via a “chaperone.”
- the chaperone is a structure on which the various TBA recognition elements are built and which confers desirable properties on the nucleic acid recognition units.
- the chaperone is comprised of any sequence which provides assembly sequences such that same or different nucleic acid recognition units are brought into close and stable association with each other.
- a TBA is assembled by providing lambda cro sequences as assembly sequences, linked to the nucleic acid binding sequences for either NF-kB p50 or p65.
- the p50 or p65 nucleic acid binding sequences are linked to the cro sequences at either the carboxy or amino terminus of cro and either the carboxy or amino terminus of the nucleic acid recognition unit of the p50 or p65.
- Linking sequences are optionally provided to allow appropriate spacing of the nucleic acid recognition units for optimal TBR binding.
- the assembly sequences comprise any stable oligopeptides which naturally and strongly bond to like sequences.
- cro it is well known that a dimer of cro binds to the bacteriophage lambda operator sites (Anderson et al. [1981 ] Nature 290:754-758; Harrison and Aggarwal [1990 ] Ann. Rev. Biochem. 59:933-969).
- the monomer units of cro tightly and specifically associate with each other.
- by linking DNA recognition unit sequences to the cro sequences close and tight association is achieved.
- the optional linker sequences comprise any amino acid sequence which does not interfere with TBA assembly or nucleic acid binding, and which is not labile so as to liberate the nucleic acid recognition unit from the complete TBA. It is desirable but not necessary that the linker sequences be covalently linked to other binding assembly components. The association should be specific so as to aid in the assembly and manufacture of the binding assemblies. Examples of such sequences include, but are not limited to, such well known sequences as are found linking various domains in structural proteins. Thus, for example, in the lambda repressor protein, there is a linking sequence between the DNA binding domain and the dimerization domain which is useful for this purpose.
- the novel TBAs of this invention optionally have asymmetry or PILOT TNA sequences and one or more OSA units.
- the asymmetry sequences are provided to encourage or prevent certain desirable or undesirable associations.
- the asymmetry sequences are provided to disrupt the naturally stronger association of NF-B p50 subunits and p65 subunits, while not disrupting the assembly sequences from bringing together p50 subunits. Examples of such sequences are provided herein as SEQ ID NOS. 85-92 and SEQ ID NOS. 105 and 106.
- NF-kB p50 subunit sequences are brought into close association with transcription factor SP1 DNA recognition unit sequences. This is desirable in the event that an NF-kB/SP1 binding motif is of significance, as in the HIV LTR where a motif of at least six DNA binding protein recognition sites, two NF-kB, three SP1, and a TATA site are mown to exist. Since it is also known that the second NF-kB and first SP1 site are significant to regulation of HIV transcription (Perkins et al. [1993 ] Embo J. 12:3551-3558), this particular configuration of TBA is useful not only the detection of HIV, but as a therapeutic or prophylactic against HIV infection (see below). In a similar fashion, the long control region (LCR) of human papillomavirus may be used as a key control region for probing according to this method.
- LCR long control region
- TBA TATA sequence DNA binding protein
- the modular structure shown in FIG. 9 is adapted to detection and or treatment or prophylaxis of a completely different pathogen.
- FIG. 11 in a similar fashion to the above described “HIV-detect I-IV” molecules, a series of “HPV-Detect I-IV” molecules is produced.
- advantage is taken of the DNA binding properties of the E2 protein of human papillomavirus (HPV).
- HPV human papillomavirus
- SP1 and TBP are taken advantage of by providing specific DNA recognition units adapted to bind to these sequences in the HPV genome.
- a TBA containing a bovine E2 dimer and a human E2 dimer DNA binding domain may be particularly useful.
- the various sequences described above may either be chemically linked using pure oligopeptide starting materials, or they may be line through provision of recombinant nucleic acids encoding, via the well known genetic code, the various subelements.
- liking cro coding sequences to sequences of nucleic acid recognition units to form TBAs is advantagous because not only does cro act as assembly sequences in the chaperone, it also acts to direct the proper folding of the nucleic acid recognition elements.
- Exemplary sequences for chaperones are provided herein as SEQ ID NOS. 104-108.
- proper design of the asymmetry sequences allows such structures to be made.
- TBAs are prepared which bind to their cognate binding sites with high affinity.
- the NF-kB DNA binding components of the TBAs of FIG. 10 are expected to bind to the HIV-LTR with an affinity of between about 10 ⁇ 8 and 10 ⁇ 12 molar.
- Sequences useful as the DNA recognition units are provided as SEQ ID NOS. 63-71, 73-84, 93-98, and 104-108 and exemplified further below.
- a DNA binding protein structure may be assembled by linking an NF-kB p50 subunit to an antigen, such as a circulated (through disulfide bonds) melanocyte stimulating hormone (MSH).
- an antigen such as a circulated (through disulfide bonds) melanocyte stimulating hormone (MSH).
- MSH melanocyte stimulating hormone
- FIG. 9 The modular structure provided by FIG. 9 reveals that a great variety of TBAs may be assembled using different combinations of components.
- representative embodiments of this general structure are provided as SEQ ID NOS. 109-116.
- a BBA may be any substance which binds a particular BBR formed by hybridization of particular PNAs and BNAs, including when multiple BNAs (up to and including “n” BNAs, i.e., BNA n , wherein “n” is theoretically 0- ⁇ , but practically is between about 0 and 100) are polymerized onto the PNA for signal amplification, provided that the BBA must have at least the following attributes:
- the BRA must bind the BBRs in a fashion that is highly specific to the BBR of interest. That is, the BBA must discriminate between BBRs present in the PNA-BNA hybrid and similar duplex sequences in BNA-CNA hybrids or other CNAs. Thus, where even a single base mismatch or conformational differences with or without base mismatches occur in the production of the PNA-BNA n or PNA-BNA n -HNA hybrid, the BBA must bind the hybrid with a sufficiently low avidity that upon washing the TBA-TNA-PNA-BNA complex, the BBA is displaced from the CNA sequences but not the BBR sequences.
- the BBA must avidly bind the BBR(s). Binding affinities in the range of 10 ⁇ 5 to about 10 ⁇ 9 or higher are generally considered sufficient.
- BBAs include, but are not limited to cro, and the bacteriophage lambda repressor protein, CI.
- CI bacteriophage lambda repressor protein
- U.S. Pat. No. 4,556,643 herein incorporated by reference, which suggests other DNA sequences and specific binding proteins such as repressors, histones, DNA modifying enzymes, and catabolite gene activator protein.
- EP 0 453 301 herein incorporated by reference, which suggests a multitude of nucleotide sequence specific binding proteins (NSSBPs) such as the tetacycline repressor, the lac repressor, and the tryptophan repressor.
- NSSBPs nucleotide sequence specific binding proteins
- novel BBAs useful according to this aspect of the invention include novel proteins based on the motif of a known DNA or RNA or DNA:RNA binding protein such as cro or the ⁇ CI repressor protein.
- modifications are made to improve the handling of these components of the invention.
- One of the negative qualities of cro is that at high concentrations, the binding of cro to its DNA target comes into competition with cro-cro interactions.
- a chaperoned or mutated cro may be produced which does not have this shortcoming. Examples of such altered chaperones are SEQ ID NOS. 105-106 and 108.
- the BBA is a protein, or a complex of proteins
- any of a number of methods routine in the art may be used to produce the BBA.
- the BBA may be isolated from its naturally occurring environment in nature, or if this is impractical, produced by the standard techniques of molecular biology.
- the sequence of the cro protein is known and any molecular clone of bacteriophage lambda may be used to obtain appropriate nucleic acids encoding cro for recombinant production thereof.
- the TBAs described herein may be used as BRAs, provided that different TBAs are used to bind TBRs and BBRs.
- the highly specific and extremely tight binding of TBAs comprised of nucleic acid binding components is used to produce an amplifiable nucleic acid sandwich assay.
- a solid support is coated with a first TBA creating an immobilized TBA.
- a PNA and TNA are contacted under hybridizing conditions and then contacted with the immobilized TBA. Only those PNA-TNA interactions which form the specific TBR recognized by the immobilized TBA are retained upon wash-out of the solid surface which binds the TBA-TBR complex
- Detection of the bound TBR is accomplished through binding of Booster Nucleic Acids, BNAs, to the 1 ⁇ 2 BBRs present on the PNAs under hybridizing conditions.
- BNAs Booster Nucleic Acids
- a large, amplified signal may be produced by polymerizing multiple BNAs to the immobilized TNA.
- Each BNA which binds to the TNA forms a BBR which can be bound by BBAs which, like the TBAs immobilized on the solid surface, may be chosen for their very tight and specific binding to particular nucleic acid structures.
- the immobilized TBA may contain the DNA binding portion of NF-kB, which very specifically and tightly binds to NF-kB binding sites formed upon hybridization of the TNA and PNA to form such a site.
- HIV human immunodeficiency virus
- discrimination between these TNAs and CNAs is accomplished by taking advantage of the fact that in the HIV LTR, there are two NF-kB binding sites, followed by three SP1 sites (see, for example, Koken et al. (1992) Virology 191:968-972), while cellular NF-kB binding sites with the same sequences are not found in tandem.
- the TNA contains more than one 1 ⁇ 2 TBR and it is desirable to pursue the therapeutic and prophylactic applications of the TBAs, it may be desirable to use more than one TBA, each with the capacity to bind a TBR in the TNA-PNA complex. In this case, it may be advantageous to select, as components of the TBAs, DNA-binding or RNA-binding domains with lesser affinity for its TBR than the wild-type DNA-binding or RNA-binding domain.
- TBAs which are involved in the binding to the multiple TBRs can either assemble together before binding to their TBRs or assemble together after binding to their TBRs, the individual TBAs will not block the corresponding TBRs in the other genomes than the target genome unless the TBRs are spatially capable of binding the assembled TBA complex.
- One feature of the multimeric assembly of TBAs which is specifically claimed here as part of this invention is that such a multimeric assembly is expected to have a much reduced affinity for a single site within the TNA.
- the TBA complex would be expected to not compete for the binding of any single TBR with the corresponding native proteins in situ but bind tightly to sequences in the PNA-TNA hybrid containing the TBRs for each of the nucleic acid-binding components assembled in the TBA.
- the TBA complex should be assembled and linkers adjusted in the individual TBAs so as to allow the nucleic acid-binding regions contained in the TBA complex to simultaneously reach and bind to these targets.
- the PNA is labeled with an OSA, such as a radionuclide, colored beads, or an enzyme capable of forming a colored reaction product.
- the PNA also may contain at least one 1 ⁇ 2 BBR.
- the 1 ⁇ 2 BBR sequences are chosen so as to be complementary to unique 1 ⁇ 2 BBR sequences in BNAs.
- the 1 ⁇ 2 BBRs may provide hybridizable (that is, single-stranded, complementary) sequences of the left or right bacteriophage lambda operators (see, for example, Ptashne [1982] Scientific American 247:128-140, and references cited therein for sequences of these operators). These may be polymerized onto the PNA 1 ⁇ 2 BBRs in a vectorial fashion (see FIGS.
- the appeal of using the DNA-binding portions of NF-kB protein as the TBA and the NF-kB recognition binding elements as the TBRs is that these elements form an important “control point” for the replication of HIV. That is, it is known that HIV is required to use NF-kB as a critical feature in its replicative life cycle. Similar control points for other pathogens are chosen and used as a basis for detection according to the methods described herein.
- the method of this invention is adaptable to a method and devices using chromatographic test kits described in U.S. Pat. Nos. 4,690,691 and 5,310,650 (the '691 and '650 patents).
- a porous medium was used to immobilize either a TNA or a capture probe, and a solvent was used to transport a mobile phase containing either a labeled PNA, if the TNA was immobilized, or the TNA, if a capture probe was immobile, into the “capture zone”.
- a labeled PNA was chromatographed through the capture zone and any bound label was detected.
- Adapting the instant invention to such a system provides the improvement of using a Target Binding Assembly in the capture zone and therefore, the capture of only perfectly matched TBR sequences or other TBRs representing nucleic acid confirmations specifically bound by the TBA within the TNA-PNA duplexes by virtue of the previously described sensitive discrimination by the TBA between TNAs and CNAs.
- the signal is amplified by adding BNAs or chromatographing BNAs through the capture zone. Finally, the signal may be further amplified by adding BBAs or chromatographing labeled BBAs through the capture zone.
- BNAs or chromatographing labeled BBAs through the capture zone.
- this invention is adaptable for use in such instruments as Abbott Laboratories' (Abbott Park, Ill.) IMx tabletop analyzer.
- the IMx is currently designed to run both fluorescent polarization immunoassay (FPZA, see Kier [1983 ] KCLA 3:13-15) and microparticle enzyme immunoassay (MEZA, see Laboratory Medicine , Vol. 20, No. 1, January 1989, pp. 47-49).
- FPZA fluorescent polarization immunoassay
- MEZA microparticle enzyme immunoassay
- the MEZA method is easily transformed into a nucleic acid detection method using the instant invention by using a TBA as a capture molecule coated onto a submicron ( ⁇ 0.5 ⁇ m on average) sized microparticle suspended in solution.
- the microparticles coated with TBA are pipetted into a reaction cell.
- the IMx then pipettes sample (Hybridized PNA-TNA) into the reaction cell, forming a complex with the TBA. After an appropriate incubation period, the solution is transferred to an inert glass fiber matrix for which the particles have a strong affinity and to which the microparticles adhere.
- BNAs and BBAs are added, or another signal amplification and detection means is used which depends on specific formation of TNA-PNA hybrids.
- the immobilized complex is washed and the unbound material flows through the glass fiber matrix.
- the bound complexes are detected by means of alkaline phosphatase labeled BBAs or otherwise (radioactively, enzymatically, fluorescently) labeled BBAs.
- the fluorescent substrate 4-methyl umbelliferyl phosphate or like reagent may be added.
- the enzyme may be bypassed by directly labeling BBAs with this or a like reagent. In any event, fluorescence or other signal is proportional to the amount of PNA-TNA hybrids present.
- the fluorescence is detected on the surface of the matrix by means of a front surface fluorometer as described by the manufacturer of the IMx.
- a front surface fluorometer as described by the manufacturer of the IMx.
- an improvement of the well known electrophoretic mobility shift assay is conducted as follows (See FIGS. 12 a and 12 b ):
- a sample of DNA is fragmented, either through random cleavage or through specific restriction endonuclease treatment.
- the DNA in the sample is then split into two equal aliquots and a specific TNA is added to the first aliquot but not to the second.
- the first and second aliquot are then electrophoresed in an acrylamide or agarose gel, and the pattern of DNA bands (either visualized through ethidium bromide binding or through being radioactively labeled prior to electrophoresis is then compared for the two aliquots. Fragments of DNA having binding sites to which the TBA is specific are retarded in their migration through the electrophoretic medium. By using an appropriate TBA, any number of DNA or other nucleic acid sequences may be tracked in this fashion.
- fragmented TNA is hybridized with a PNA and fractionated in a fist dimension.
- the fractionated DNA is then reacted with an appropriate TBA and the change in mobility of the DNA fragments is noted. Enhancement of the retardation is possible by adding BBAs as described above. (See, for example, Vijg and references cited therein for known techniques of two (2) dimensional nucleic acid electrophoresis, to which the instant method may be applied).
- a TBA comprising tight and specific binding for the HIV-LTR, by virtue of having an NF-kB p50 and an SP1 DNA recognition unit in close association (see FIG. 10, HIV-Detect II) is useful to bind up the HIV-LTR and thereby prevent transcription from this key element of the HIV genome.
- the unique features of the assembly sequences of the TBA allow recombinant vectors to induce DNA encoding such a TMA into a cell and the proper folding of the expressed sequences.
- the nuclear localization signals of the p50 subunit directs the transport of the TBA to the nucleus where it binds tightly to the LTR of any integrated HIV, effectively shutting the pathogen down.
- a prophylactic mode one that is concerned about potential HIV exposure is administered a sufficient dose of a TBA or a recombinant vector able to express the TBA, so as to lock up any HIV that might have entered the person.
- the use of the TBA is analogous to passive protection with a specific immune globulin.
- NLS sequences are used in place of the OSAs used in the diagnostic mode. Exemplary NLS sequences are provided as SEQ ID NOS.
- the TBA is administered in a pharmaceutically-acceptable carrier, known in the art such as a sterile salt solution or associated with a lysosome or in the form of a recombinant vector, preferably one which directs expression of the TBA in a chosen cell type, or by a protein delivery system.
- a pharmaceutically-acceptable carrier known in the art such as a sterile salt solution or associated with a lysosome or in the form of a recombinant vector, preferably one which directs expression of the TBA in a chosen cell type, or by a protein delivery system.
- a probe nucleic acid comprising:
- a single-stranded sequence, 1 ⁇ 2 TBR which is capable of forming, under hybridizing conditions, a hybrid, TBR, with a 1 ⁇ 2 TBR present in a target nucleic acid (TNA);
- an OSA which is no attached support and/or indicator, or an attached support or other means of localization, including, but not limited to, attachment to beads, polymers, and surfaces, and/or indicators;
- TBR is capable of binding with high affinity to a TBA
- said TBA being a substance capable of discriminating between a paired TBR and a TBR having unpaired nucleotides
- said BBR is capable of binding with high affinity to a BBA
- said BBA being a substance capable of discriminating between a paired BBR and a BBR having unpaired nucleotides.
- TBRs which are nucleic acid binding protein recognition sites, such as the HIV LTR, and other nucleic acid binding protein recognition sites in other pathogens, some of which are noted above.
- the PNA of this embodiment of the invention may produce a TBR which is a nucleic acid binding protein recognition site present in the genome of a pathogen or is a binding site associated with a pathogenic condition in the human genome or a contaminant in a fermentation process.
- a booster nucleic acid (BNA) comprising:
- a 1 ⁇ 2 BBR which has a sequence which is complementary to a 1 ⁇ 2 BBR sequence in a PNA or another BNA already hybridized to the PNA and which is capable of forming, under hybridizing conditions, a hybrid, BBR, with the PNA;
- an OSA attached support or other means of localization including, but not limited to, attachment to beads, polymers, and surfaces, and/or indicators;
- said BBR is capable of binding with high affinity to a BBA, said BBA being a substance capable of discriminating between a paired BBR and a BBR having unpaired nucleotides.
- a Hairpin Nucleic Acid comprising a single-stranded sequence, 1 ⁇ 2 BBR, which under hybridizing conditions is capable of forming a hairpin while at the same time binding to a BNA to form a BBR capable of binding a BBA, wherein said BBR is capable of binding with high affinity to a BBA, said BBA being a substance capable of discriminating between a paired BBR and a BBR having unpaired nucleotides.
- a method for detecting a specific TNA sequence comprising the steps of:
- step (d) adding BBAs to the mixture in step (c) wherein said BBA comprises:
- This method includes the use of a protein indicator, including enzymes capable of catalyzing reactions leading to production of colored reaction products. It also includes indicators such as a radionuclide or colored beads.
- a method for detecting the presence in a sample of a specific Target Nucleic Acid, TNA which comprises:
- a method for detecting or localizing specific nucleic acid sequences with a high degree of sensitivity and specificity which comprises:
- step (b) binding the TBRs formed in step (a) to an immobilized TBA to form a TBA-TNA-PNA complex;
- step (c) adding Booster Nucleic Acids, BNAs, containing booster binding regions, 1 ⁇ 2 BBRs, to the complex formed in step (b) such that the 1 ⁇ 2 BBRs in the BNAs hybridize with the 1 ⁇ 2 BBR sequences present in the PNAs or to 1 ⁇ 2 BBRs present in BNAs already bound to the PNA, to form BBRs, such that TBA-TNA-PNA-(BNA) n complexes are formed;
- step (d) adding Hairpin Nucleic Acids, HNAs, containing 1 ⁇ 2 BBR sequences, to the complex formed in step (c) such that the 1 ⁇ 2 BBRs in the HNAs hybridize with any available 1 ⁇ 2 BBR sequences present in the BNAs of the complex of step (c), thereby capping the extension of the BNAs onto the TBA-TNA-PNA-(BNA) n complexes of step (c) to form TBA-TNA-PNA-(BNA) n -HNA complexes;
- step (f) detecting the signals produced by the indicator moieties inked to the TBAs, PNAs, BNAs, BBAs or HNAs in the TBA-TNA-PNA(BNA-BBA) n -HNA complexes of step (e);
- the TNA comprises:
- the PNA comprises:
- a single stranded sequence, 1 ⁇ 2 TBR which is capable of forming, under hybridizing conditions, a hybrid, TBR, with a 1 ⁇ 2 TBR present in a target nucleic acid (TNA);
- an OSA which is no attached support and/or indicator, or an attached support or other means of localization, including, but not limited to, attachment to beads, polymers, and surfaces, and/or indicators;
- the BNA comprises:
- a 1 ⁇ 2 BBR as shown in FIG. 1(IIb), which has a sequence which is complementary to a 1 ⁇ 2 BBR sequence in a PNA and which is capable of forming, under hybridizing conditions, a hybrid, BBR, with the PNA;
- an OSA attached support or other means of localization including, but not limited to, attachment to beads, polymers, and surfaces, and/or indicators;
- the BBA comprises:
- the TBA comprises:
- An improvement to a solid phase hybridization method for detecting the presence of a target polynucleotide involving: immobilizing a target polynucleotide, if present in a test sample, directly or via an intermediate capture structure, on a solid phase at a capture site; before, during or after said immobilization, attaching a detectable label to said target polynucleotide, if present; and detecting said label, if any, at said capture site; the improvement comprising:
- TBA Target Binding Assembly
- a target binding assembly comprising one or more nucleic acid recognition units, linker sequence(s), assembly sequence(s), asymmetry sequence(s), nuclear localization sign sequence(s) (NLS) and OSA(s).
- the nucleic acid recognition unit may be an NF-kB binding unit, an SP1 binding unit, a TATA binding unit, a human papillomavirus binding unit, an HIV LTR binding unit, or a binding unit for any other fragment of specific sequence the detection of which is desirable and which can be achieved through specific association with the TBA.
- recognition units include, but are not limited to those exemplified herein as SEQ ID NO. 63, SEQ ID NO. 64, SEQ ID NO. 65, SEQ ID NO.
- Linker sequences such as oligopeptides which do not interfere with the nucleic acid recognition function of the nucleic acid recognition unit and which provide stability and control over the spacing of the nucleic acid recognition unit from the remainder of the TBA. Examples of such link sequences are well known in the art and include, but are not limited to oligopeptide sequences from the interdomain primary sequence of a structural protein. Assembly sequences include oligopeptide sequences which direct the folding and association of nucleic acid recognition units.
- a preferred example of such sequences are oligopeptides derived from the bacteriophage lambda cro protein.
- the asymmetry sequence directs the association of nucleic acid recognition and assembly sequences in a predetermined order.
- Such asymmetry sequences are exemplified by sequences derived from insulin, relaxin, gonadotropic hormone, FSH, HCG, LH, ACTH, including but not limited to SEQ ID NOS. 85-92.
- SEQ ID NO. 85 is an “A” and SEQ ID NO. 86 is a “B” sequence
- SEQ ID NO. 87 is an “A”
- SEQ ID NO. 88 is a “B” sequence SEQ ID NO.
- the TBA may contain nuclear localization signal sequences, NLS, which direct the migration and uptake of a protein or complex associated with said NLS into the nucleus of a cell. Examples of such NLS sequences are provided as SEQ ID NOS. 72 and 103.
- NLS sequences are provided as SEQ ID NOS. 72 and 103.
- Preferred embodiments of the TBA include but are not limited to HIV Detect I-IV or HPV Detect I-IV, and SEQ ID NOS. 109-116.
- Methods of using the novel TBAs of this invention include, but are not limited to a method of using the TBA to bind a particular nucleic acid sequence in a target nucleic acid sample which comprises:
- the probe nucleic acid in addition to sequences complementary to said particular nucleic acid sequence of interest, also may have additional sequences to which a booster nucleic acid can bind to form a booster binding site to which a labeled booster binding assembly can bind to provide a signal showing and amplifying the binding of the probe nucleic acid to the target nucleic acid sequence of interest.
- An additional aspect of this invention not requiring fragmentation of Target Nucleic Acid involves administration of the TBA to a patient in need of such treatment of a therapeutically or prophylactically effective amount of said TBA, which comprises administering the TBA, either in the form of a purified protein complex or in the form of a recombinant vector which, upon entry into the patient is able to express the TBA, such that the TBA binds the particular nucleic acid sequence to achieve the desired prophylactic or therapeutic result.
- This may include providing a dosage which can be determined by routine experimentation to be sufficient to prevent establishment of an active infection by a pathogen. Dosages of purified TBAs may be in the range of about 0.001 to 100 mg/kg.
- dosages of the recombinant nucleic acid may be substantially lower, particularly if provided in the form of non-pathogenic viral vector.
- the methods of using the TBAs also include monitoring the shift in mobility of nucleic acids in target nucleic acid samples as a function of the size such that binding of the TBA to a particular fragment in the sample modifies the mobility of the fragment. This aspect of the method provides a useful method of analyzing nucleic acid fragments for particular aberrations, such as might be found associated with metastases.
- a method of assembling multimeric TBAs in vivo which comprises introducing nucleic acids encoding component TBAs into a cell.
- the component TBAs should each contain a nucleic acid recognition unit, assembly sequences, asymmetry sequences, and nuclear localization signal sequences. Linker sequences, optionally included if TBA footprinting experiments indicate the need for such linkers to attain optimal geometry of the multimeric TBA.
- component expressed TBAs are directed to assemble via the included assembly and asymmetry sequences into multimeric TBAs.
- such multimeric TBAs will have the advantage of binding specifically with high affinity to TBRs in a specific target sequence, but not at all or with very low affinity to cousin nucleic acids.
- Probe nucleic acids may be prepared by means well known in the art.
- single stranded polynucleotide PNAs of defined sequence may be prepared via solid phase chemical synthesis according to Merrifield.
- PNAs may be prepared by automated synthesis using commercially available technology, such as resins and machines produced or marketed by Applied Biosystems, ABI, or other manufacturers.
- particular PNA sequences are synthesized in vivo, for example by cloning a duplex PNA into a vector which can replicate in E. coli , large quantities of the duplex PNA may be prepared.
- Multimers of the PNA may be cloned into the vector such that for each mole of vector, several moles of PNA is liberated upon digestion of the vector with a restriction fragment flanking the PNA sequence. Subsequent to synthesis or recombinant production, the PNAs are purified by methods well known in the art such as by gel electrophoresis or high pressure liquid chromatography (HPLC). If the PNA is produced as a duplex, prior to use in a hybridization assay for detection of target nucleic acid sequences the strands of the PNA are separated by heating or other methods known in the art.
- the specific sequence of bases in the PNA is chosen to reflect the sequence to be detected in a TNA, with the proviso that, according to this invention, the PNA contains a 1 ⁇ 2 TBR sequence, which is one that upon hybridization of the PNA and TNA, a TBR is formed.
- the choice of the PNA sequence is amenable to selection by the skilled researcher for any given application.
- the sequence of the HIV LTR is one such sequence, which upon hybridization of a PNA encoding portions of the LTR with TNAs encoding the HIV LTR, TBRs capable of binding the NP-kB or SP1 DNA binding proteins are formed.
- the PNA also may contain a 1 ⁇ 2 BBR.
- This sequence is one which, upon hybridization with a booster nucleic acid, BNA, forms a BBR which is capable of binding a BBA.
- the BBA is preferably a DNA binding protein having high affinity for the BBR sequence.
- hybridization between a PNA having as a 1 ⁇ 2 TBR, SEQ ID NO. 4 and, at the 3′ end of that sequence, a 1 ⁇ 2 BBR sequence shown as SEQ ID NO. 35 is either used without labeling or is labeled with a radioactive isotope such as P 32 , S 35 , or a similar isotope, according to methods known in the art.
- the PNA is bound to a bead of between 0.01 to 10 ⁇ m, which may be colored for easy visual detection.
- This label forms the OSA as described in the specification.
- This probe hybridizes with HIV LTR sequences to form a TBR that binds NF-kB.
- the PNA hybridizes with BNAs having a complementary 1 ⁇ 2 BBR to form a bacteriophage lambda left operator that binds either cro or lambda repressor proteins.
- PNAs are used wherein the 1 ⁇ 2 TBR is any one of SEQ ID NO. 5 or SEQ ID NOS. 7-34, and a 1 ⁇ 2 BBR, such as SEQ ID NO. 35 or SEQ ID NO. 36 is either at the 3′ end or 5′ end of the 1 ⁇ 2 TBR.
- BNAs are prepared and labeled according to methods known in the art. As described in U.S. Pat. No. 4,556,643, herein incorporated by reference (see particularly Example 1), nucleic acid sequences encoding particular nucleic acid binding sequences may be mass produced by cloning into a replicable vector.
- the 1 ⁇ 2 TBR and 1 ⁇ 2 BBR sequences may be co-linearly produced in this fashion, with the distinction, however, that according to the instant invention, the 1 ⁇ 2 TBR sequence itself forms a nucleic acid binding component recognition site and the 1 ⁇ 2 BBR, while forming a nucleic acid binding component recognition site, also provides a means of amplifying the signal produced upon binding of the 1 ⁇ 2 TBR to complementary sequences in the TNA by providing for polymerization of BNAs onto the TNA bound PNA.
- a sequence such as SEQ ID NO. 35, which encodes the left operator of bacteriophage lambda, is provided with additional sequences such that an overhang sequence is created on one or both ends of the BNA upon hybridization with the PNA.
- vectorial polymerization of BNAs onto a TNA is provided by SEQ ID NOS. 40-43.
- SEQ ID NO. 40 encodes two 1 ⁇ 2 TBRs which will hybridize with two 1 ⁇ 2 TBRs in a TNA to form two NT-kB binding sites, while at the same time providing a bacteriophage lambda left operator 1 ⁇ 2 BBR, which additionally is terminated at the 3′ end with the recognition site for the restriction enzyme PstI.
- Addition of the BNA, SEQ ID NO. 41, with the 1 ⁇ 2 BBR complementary to the 1 ⁇ 2 BBR on the PNA SEQ ID NO.
- SEQ ID NO. 42 which has a four base pair sequence at the 3′ end which is complementary to the four-base overhang remaining from the hybridization of SEQ ID NOS. 40 and41.
- SEQ ID NO. 42 is provided with a five base sequence at its 5′ end which forms part of a BamHI recognition site.
- the growing polymer of BNAs is extended further by addition of the BNA SEQ ID NO. 43, which is complementary to SEQ ID NO.
- BNAs may be hybridized extensively so as to greatly amplify the signal of a single PNA-TNA hybridization event.
- the BNAs may be used in an unlabeled form or may be labeled according to methods known in the art and described in Example 1. It will also be appreciated that, rather than produce the BNA polymer by sequential addition of BNAs to the PNA-TNA complex, the BNA polymer may be performed and added directly to the PNA-TNA complex.
- One simple method for performing such a BNA polymer includes the recombinant production of a vector in which multimers of the BNA are provided with a unique restriction site at either end of the polymer.
- This polymer of BNAs containing multiple BBRs is cut out of the vector and hybridizes to a single stranded 1 ⁇ 2 BBR remaining in the PNA upon hybridization of the PNA and the TNA. This is accomplished by providing a single stranded sequence in the PNA complementary to an overhang produced in the BNA polymer when it is excised from the production vector.
- the HNAs of this invention are produced according to methods known in the art for polynucleotide production as described in Examples 1 and 2 for PNAs and BNAs.
- the sequence of the HNA is specifically designed so that a substantial portion of the HNA forms a self-complementary palindrome to form a hairpin, while at the same time, leaving in single stranded form enough bases to be able to hybridize with single stranded sequences in the growing chain of BNAs described in Example 2.
- a HNA of SEQ ID NO. 44 is provided to cap the extension of BNAs onto the PNA in Example 2 after the addition of the BNA, SEQ ID NO. 43. This is accomplished because SEQ ID NO. 44, while having a palindromic sequence that forms a stable hairpin, also has a sequence at the 5′ end of the HNA which completes the BamHI sequence formed by the hybridization of SEQ ID NO. 42 and SEQ ID NO. 43. Naturally, termination of the polymer after addition of only 3 BNAs is for the purpose of simplicity in demonstrating the invention. As described above, his polymerization may be continued essentially indefinitely to amplify the signal of the PNA-TNA hybridization event. Once the HNA hybridizes to the growing chain of BNAs, the polymer is capped and no further extension of the polymer is possible.
- the TBAs and BBAs which may be used according to the instant invention include any substance which can specifically bind to the TBRs and BBRs formed by hybridization of the PNAs, TNAs and BNAs. Use of DNA binding proteins forms one example of such substances.
- the TBA is the dimer of the DNA binding portion of p50
- the BBA is the lambda cro protein.
- These proteins may be produced according to methods known in the art. The genes for both of these proteins have been cloned. Thus, these proteins are recombinantly produced and purified according to methods known in the art. Furthermore, these proteins are labeled, either with a radioisotope, such as radioactive iodine, or with an enzyme, such as beta-galactosidase or horseradish peroidase, or with a fluorescent dye such as fluorescein or rhodamine, according to methods well known in the art.
- a radioisotope such as radioactive iodine
- an enzyme such as beta-galactosidase or horseradish peroidase
- a fluorescent dye such as fluorescein or rhodamine
- either or both of the TBA and BBA may be immobilized on a solid surface such as the surface of a microfiter plate or the surface of a bead, such as a colored bead of diameter anywhere from 0.01 to 10 ⁇ m.
- the labels on the TBAs and BBAs may be the same or different.
- the TBA containing the dimeric p50 DNA binding domain is labeled with rhodamine, while the BBA, cro, is labeled with fluorescein.
- the nucleic acid hybrids, if formed, are contacted with excess labeled TBA and cro.
- the fluorescence of these labels is measured according to known methods and, detection of both signals is indicative of the presence of 1 ⁇ 2 TBR sequences in the TNA.
- the differential signal produced by the fluorescence of the NF-kB and cro is a measure of the degree to which the polymerization of BNAs onto the PNA-TBA hybrid has resulted in amplification of the signal. Amplification from one to over a thousand fold is contemplated according to the method of this invention.
- the PNAs, PNA1, SEQ ID NO. 40 and PNA2, SEQ ID NO. 45 are used in about ten-fold molar excess over the concentration of TNAs in a test sample.
- an isolated duplex HIV LTR wherein one strand of which has the sequence SEQ ID NO. 37, shown in FIG. 7, and the other stand of which is complementary to the sequence shown in FIG. 7, is used as the TNA.
- a duplex isolated CNA is also used in this example, one strand of which has the same sequence as SEQ ID NO. 37, except that, in the first NF-kB binding site shown in FIG. 7, at the center of the binding site, position 1 in FIG. 7, instead of a “T,” there is an “A,” the complementary strand of which therefore mismatches with the SEQ ID NO. 40 PNA at that location
- SEQ ID NO. 40 and SEQ ID NO. 45 are both added to separate reactions, the first containing the above described TNA and the second containing the above described CNA.
- the samples are solubilized in an appropriate hybridization buffer, such as 10 mM Tris (pH 7.5), 1 mM EDTA.
- the samples reheated to about 90° C. for about five minutes to strand separate the duplex TNAs and CNAs in the samples, and then the samples are allowed to cool to allow strands of PNAs, TNAs and CNAs to anneal.
- the SEQ ID NO. 40 PNA is polymerized by addition of BNAs as in Example 2 and the SEQ ID NO. 45 PNA2 probe is polymerized with BNAs staring with Sph1 recognition site overhang.
- the separate samples are added to beads coated with covalently immobilized NF-kB, and the NF-kB is allowed to bind to any TBRs formed in the TNA and CNA samples.
- the samples are washed twice with about three volumes of an appropriate washing buffer, such as 10 mM Tris, pH 7.5, 100 mM NaCl, or another buffer pre-determined not to interfere with NF-kB, or bacteriophage lambda CI repressor protein binding activity.
- an appropriate washing buffer such as 10 mM Tris, pH 7.5, 100 mM NaCl, or another buffer pre-determined not to interfere with NF-kB, or bacteriophage lambda CI repressor protein binding activity.
- the beads are allowed to settle under gravity or by brief centrifugation. This removes any nucleic acids which do not have a perfect NF-kB binding site formed by hybridization of the PNA1 and TNA sequences.
- bacteriophage lambda CI repressor protein labeled with a radioactive isotope such as with radioactive iodine
- an enzyme such as horseradish peroxidase
- the samples are then washed several times (about 3) with several volumes (about 2) of an appropriate washing buffer such as 10 mM Tris, pH 7.5, 100 mM NaCl, or another buffer pre-determined not to interfere with NF-kB, or bacteriophage lambda CI repressor protein binding activity.
- the beads are allowed to settle under gravity or by brief centrifugation.
- the bound label is quantitated by detecting the bound radioactivity, liberated color in an enzymatic assay, color of bound beads, or fluorescence detection.
- an anti-CI antibody can be added and a standard sandwich enzyme linked immunoassay or radioimmunoassay performed to detect bound repressor.
- a negative control background
- all of the foregoing manipulations are carried out in tandem with a sample in which beads are used having no immobilized NF-kB.
- control and CNA containing samples have similarly low signals while the TNA containing sample has a signal well above background.
- Tube containing single stranded HIV PNAs (a mixture of pre-mixed oligonucleotides encoding two NF-10 1 ⁇ 2 binding sites, i.e. a mixture of SEQ. ID. Nos. 7 and 8).
- microtiter plate (item 1) is coated with the solution of recombinantly produced NF-kB (item 2) at a concentration of 1 mg/mL in tris buffered saline overnight at 4° C. with rocking.
- nuclease (item 5) is added to each of tubes A-C using the medicine dropper and the tubes agitated and allowed to sit for 10 minutes.
- step (f) While the hybridization is allowed to occur in step (d), the excess protein is drained from the surface and the microtiter plate, from step (a), and the plate is rinsed with tris buffered saline (tube 10).
- Novel TBAs for use according to the instant invention are prepared as follows:
- NFkB/NF-kB HAV-Detect I
- a nucleic acid encoding any one of SEQ ID NOS. 63-71 or a like NF-kB DNA binding protein is fused, in frame, to a nucleotide sequence encoding an assembly sequence, such as cro, such that the NE-kB DNA recognition sequence is encoded at amino or carboxy terminus of the cro sequence.
- a linker sequence is provided between the NF-kB sequence and the cro sequence.
- a nuclear localization signal sequence such as SEQ ID NO. 72
- asymmetry sequences are optionally provided at the cro terminus unused by the NF-kB recognition sequence. Examples of complete TBAs are shown below.
- NF-kB/SP1 HV-Detect II
- a recombinant coding sequence encoding an NF-kB recognition domain is prepared.
- the coding sequence for the DNA recognition portion of SP1 is included.
- Such a sequence should encode all or a functional part of SEQ ID NO. 73, which is that portion of the SP1 transcription factor exhibiting DNA binding (see Kadonaga et al. [1987 ] Cell 51:1079-1090).
- the NF-kB-encoding vector and the SP1-encoding vector are then co-transfected into an appropriate expression system such as is well known in the art.
- a monomeric NF-kB recognition unit is added to complete the NF-kB recognition dimer after the assembly of the SP1 and NF-kB recognition units by the chaperone.
- the asymmetry sequences prevent the formation of NF-kB or SP1 dimers and direct, instead, the formation of NFkB-SP1 heterodimers (i.e., HIV-Detect II), which are then isolated from the expression system (mammalian or bacterial cells) by known methods.
- an SP1-encoding TBA recombinant is produced.
- a recombinant encoding a TBA having the binding sequence, SEQ ID NO. 74, or like sequence encoding a TATA recognition unit is prepared with asymmetry sequences complementary to those included in the SP1 TBA-encoding construct.
- An SP1-encoding construct is prepared as in (b) above.
- An E2 TBA-encoding construct is prepared by using a sequence encoding any one of SEQ ID NOS. 75-84 and 94-98 which are papillomavirus E2 DNA recognition units (see Hegde et al. [1992] Nature 359:505-512) or like recognition units, is prepared and co-transformed or co-transfected with the SP1 TBA-encoding construct.
- Monomeric E2 recognition unit is added to the complete E2 recognition dimer after the assembly of the E2-SP1 recognition unit by the chaperone.
- the heterodimer HPV-Detect I is isolated according to known methods.
- E2-E2 HPV-Detect II
- an E2 TBA-encoding construct is prepared except that asymmetry sequences are included which permit the formation of E2 dimers.
- the expressed dimers are then isolated by known methods including affinity for a dimeric E2 binding site on a DNA affinity column.
- E2-TATA HPV-Detect III
- E2 and TATA binding TBAs are prepared (respectively), except that asymmetry sequences are included which enhance the formation of heterodimers rather than homodimers. These constructs are then co-expressed and the heterodimers are isolated.
- TBAs for any given pathogen or disease state may be produced by identifying specific DNA binding proteins and forming an expression construct using appropriate linker, assembly, and asymmetry sequences.
- Example 5 In a similar fashion to the assay described in Example 5, a more stringent assay is produced by using the duplex NF-kB-SP1 binding protein prepared according to Example 6. Accordingly, the probes shown in FIG. 7 and used in Example 5 may be lengthened to reduce the interprobe distance and thereby reduce the flexibility of the DNA in the TNA.
- TBAs are produced which are dimers, trimers, tetameres, pentamers, or hexamers of particular DNA recognition units.
- a hexameric TBA is produced by making a first NF-kB p50 dimeric TBA using asymmetry sequences which enable dimer formation.
- the asymmetry sequences enable the tetramerization of the p50 dimer with an SP1-SP1 dimer.
- additional asymmetry sequences direct the hexamerization with a dimer exhibiting nuclear localization sequences. This is accomplished by incorporating, for example, asymmetry sequences from insulin, which in nature forms hexamers.
- This hexamer formation is directed by the sequences, SEQ ID NOS. 85 (A) and 86 (B), 87 (A) and 88 (B), 89 (A) and 90 (B), and 91 (A) and 92 (B) (see FIGS. 13 and 14).
- HIV-Lock Because of the extremely high affinity for the HIV-LTR that can be generated using a multimeric TBA, the compounds having this structure and which can be used for this purpose are referred to herein as “HIV-Lock.”
- An optimal HIV-Lock is defined by footprinting (according to methods well known in the art) TBAs bound to TBRs in the HIV LTR to confirm that the binding affinity of each DNA binding protein contributing to the formation of the multimeric TBA complex is downshifted relative to the affinity for any natural target sequence (i.e. CNAs) from which the DNA binding recognition unit of the TBA is derived. Any concomitant loss in binding affinity for the HIV TBRs is more than compensated for upon formation of the multimer as described below.
- the length and composition of linkers is adjusted to achieve optimal discrimination between target HIV sequences and natural sequences.
- each component TBA will have a low affinity for CNA and TBR sequences
- the multimeric complex will have an extremely high affinity for the now expanded TBR recognized by the multimeric complex (the square of the affinity of each TBR recognized by each component TBA of the multimeric TBA), while still having a low affinity for CNAs.
- other multimeric TBA complexes, aside from HIV-Lock, are prepared.
- TBAs which can be formed in this fashion include the following sequences, which are assembled by linking either the protein subunits or nucleic acid sequences encoding these subunits, as follows: Set Link Sequences from Groups A I + II + III B IV + V + III C IV + III
- groups I-V consist of sequences selected from: Group Selected from Sequences I Any of SEQ ID NOS. 85-92 II Met Ser, linked to any of SEQ ID NOS 104-106, each of which is linked to SEQ ID NO. 99. III SEQ ID NO. 100 linked to any of SEQ ID NOS. 75-84 or 94-98; SEQ ID NO. 101 linked to either SEQ ID NO. 74 or SEQ ID NO. 93; or SEQ 1D) NO. 102 linkedto SEQ ID NO. 74 or SEQ ID NO. 93; or any of SEQ ID NO. 72, 103, 73, or 63-71. IV Any of SEQ ID NOS. 104-106. V SEQ ID NO. 99.
- TBAs are SEQ ID NOS. 109-116, assembled as follows: Set SEQ ID NO. Link SEQ IDS A 109 85 + MetSer + 104 + 99 + 100 + 94 A 110 85 + Met Ser + 104 + 99 + 72 A 111 86 + MetSer + 105 + 99 + 102 + 74 A 112 86 + MetSer + 106 + 99 + 73 A 113 89 + MetSer + 106 + 99 + 63 C 114 106 + 64 C 115 105 + 64 B 116 106 + 99 + 73
- SEQ ID NOS. 114 and 115 sets of these, such as SEQ ID NOS. 114 and 115,will associate with each other but dimers of SEQ ID NO. 114 or 115 will not form due to charge repulsion in the mutated assembly sequences (SEQ ID NO. 104 is cro; SEQ ID NO. 105 is a novel mutated, negatively charged cro, and SEQ ID NO. 106 is a novel mutated, positively charged cro).
- Example 9 In much the same method as used in Example 6, the “HIV-LOCK” produced according to Example 9 is used as the TBA, reagent 2, with similar results.
- HIV-LOCK produced according to Example 9 is formulated as a 1 mg/mL solution in liposomes and injected intravenously into a subject who has been tested and confirmed to be infected with HIV. A dose of it 0.1 mg to 100 mg of “HIV-LOCK”/kilogram body mass is infused over a twenty-four hour period and the concentration of HIV p24 in the patient's serum monitored. The treatment is repeated as often as necessary, such as when elevations in the serum p24 occur.
- a recombinant retroviral or like vector is used to deliver a construct encoding an HIV-LTR binding TBA to an infected patient.
- the vector encodes a chaperone, such as cro, and sequences DNA for binding portions of p50.
- the same vector also encodes a chaperone on which an SP1 TBA folds.
- Asymmetry sequences are provided such that upon co-expression of the p50-TBA and the SP1-TBA in a single HIV infected cell in vivo, an immediate association occurs between these TBAs, while at the same time preventing any association between the DNA binding portion of p50 and endogenous p50 or p65 monomers.
- NLS sequences are also provided in the TBAs so that, upon dimer formation, the TBA immediately relocates to the nucleus of the cell and binds specifically to integrated HIV sequences, thus preventing any transcription from that locus.
- This diagnostic for human papillomavirus takes advantage of the known differential between benign and carcinogenic HPV to provide a test which indicates the susceptibility to malignancy in a patient.
- the papillomaviruses a group of small DNA viruses associated with benign squamous epithelial cell tumors in higher vertebrates.
- HPVs human types of papillomaviruses
- HPV-6, HPV-11, HPV-16, HPV-18, and HPV-33 have been associated with human genital tract lesions.
- PV-6 and HPV-11 DNAs have been found associated with benign lesions of the genital tract.
- HPV-16, HPV-18, and HPV-33 have also been found associated with premalignant and malignant lesions and are transcribed in most cell lines established from cervical carcinomas. HPV-16, HPV-18, and HPV-33 are likely to be only two members of a large set of HPV DNAs associated with malignant human cervical carcinomas.
- HPV DNA has been found in metastases of cervical carcinomas. In malignant cervical lesions, HPV DNA is usually integrated into the human genome, but there may also be extrachromosomal HPV DNA present integration of HPV to form the provirus usually results in the disruption of the viral E2 open reading frame (ORF). Despite disruption of the E2 ORF, and examination of cell lines from several cervical carcinomas has shown transcriptionally active and integrated HPV-16 and HPV-18. When HPV-16 genomes which are present in the human cervical carcinoma cell lines SiHa and CaSki have been examined, there are differences found in the integration of HPV-16.
- the single HPV-16 genome integration occurred at bases 3132 and 3384, disrupting the E1 and E2 ORFs with a deletion of 0.3 kb.
- An additional 50-basepair deletion of HPV-16 DNA resulted in the E2 and E4 OFRs being fused.
- the 5′ portion of the HPV-16 DNA, consisting of the disrupted E2 ORF, is ligated to continuous human right flanking sequences.
- a single additional guanine is detected at nucleotide 1138 in the middle of the E1 ORF. This basepair addition results in the fusion of the E1a and E1b ORFs to a single E1 ORF.
- HPV-16 is available on GenBank as accession number K02718; the complete genome of HPV-33 is available on GenBank as accession number M12732; the complete genome of HPV-18 is available on GenBank as accession number X05015.
- a PNA having SEQ ID NO. 62 is hybridized with SEQ ID NO. 41 to form a BBR which will bind to cro or ⁇ CI repressor as a BBA, leaving a single-stranded portion capable of hybridizing with the TATA site on SEQ ID NO. 61. This added to form a TBR on the 5′ end of the large deletion.
- the TBR is immobilized by a TBA having a TATA binding protein DNA recognition unit.
- the bound fragments are detected by adding BNAs and BBAs as described above.
- the TBP recognition unit used in the TBA for this assay may be chosen, for example, from a sequence such as SEQ ID NO. 70 or SEQ D NO. 93.
- the kinase treated oligonucleotides are annealed to the single-stranded template, and a mutagenic strand is synthesized and ligated according to the MutaGene protocol, with the exception that Sequenase 2.0 provides the polymerase.
- Libraries are screened using both g- 32 P end-labeled nucleotides containing sequences complementary to the introduced mutations and by isolating the plasmid DNA and identifying the mutants by the presence of the introduced restriction site. The mutations are also confirmed by sequencing with a Sequenase kit.
- the HIV-LockTM DNA is cloned into the baculovirus expression system with a polyhedron promotor.
- Sf-9 cells are cultured to a pre-determined density (about 1 ⁇ 10 6 cells/ml, log phase), infected with the baculovirus containing the HIV-LockTM instructions and harvested to recover the recombinant proteins comprising the HIV-LockTM.
- a pre-determined density about 1 ⁇ 10 6 cells/ml, log phase
- cultures are expanded from flasks to spinners and subsequently to bioreactors. Following infection the cells are harvested at 12, 24, 36 and 48 hours for the protein. Indices of viability are monitored throughout the entire process.
- the harvested proteins are first separated from particulates by flow-through ultracentrifugation to facilitate downstream purification.
- the centrifuged product is then sterile filtered. Extracts are then centrifuged at 40,000 rpm at 4° C. for 30 minutes and aliquots are immunoprecipitated with polyclonal rabbit antibody against one of the HIV-LockTM components. Immunoprecipitated proteins are run on an SDS-10% PAGE gel.
- Mobility shift assays are carried out using an oligonucleotide probe comprising elements of the HIV long terminal repeat and fragments containing NFKB binding DNA associated with kappa light chain and microglobulin regulation.
- the oligonucleotide is annealed to its complimentary strand and end-labeled with g- 32 P ATP.
- Footprinting is accomplished by combining small (10 ⁇ 15 M) of radiolabeled HIV LTR DNA with a slightly larger amount of HIV-LockTM in a buffer at room temperature for 10 minutes. Dithiothreitol is added prior to the addition of protein Iron (II), EDTA, hydrogen peroxide and sodium ascorbate are added and the reaction mixture is incubated. A quenching agent is added and the products are analyzed suing denaturing gel electrophoresis. This is done for different concentrations of protein. The resulting gel is imaged using a phosphoimager scanner and the resulting high resolution image file is analyzed to abstract the binding affinity of HIV-LockTMfor the HIV DNA relative to cellular DNA.
- II protein Iron
- EDTA hydrogen peroxide
- sodium ascorbate sodium ascorbate
- TBAs for other organisms and TNAs for sequences within these organisms can be made using the aforementioned method. This method is valid when producing binding assemblies for all nucleic acid TBRs including DNA-DNA, DNA-RNA and RNA-RNA hybrids and combinations of these hybrids.
- target binding assemblies and booster binding assemblies are assembled by identifying nucleic acid binding molecules, and linking the nucleic acid binding portions of the molecules in such a fashion as to achieve TBAs which discriminate between particular target sequences and even closely related sequences.
- One method for identifying the nucleic acid binding molecules involves the following steps:
- Obtaining a biological sample containing the target nucleic acid This could be, for example, an organism or a tissue extract infected with a pathogen.
- nucleic acid motifs are used for either diagnostic or therapeutic purposes wherein the target nucleic acid has more than a single utilizable nucleic acid binding molecule target.
- a complex target binding assembly can be generated which takes advantage of the proximity of different nucleic acid binding molecular motifs to enhance the specificity of the TBA assembled from the individual nucleic acid binding components identified.
- the various nucleic acid binding portions of the nucleic acid binding molecules are then assembled into the complete TBAs as described above, for example, for HIV-LOCKTM.
- any nucleic acid sequence can be specifically identified. Identification of target HIV RNA in a sample is achieved by obtaining a sample of a patient's blood or other biological fluid or extract which may contain the HIV RNA, and testing for the presence of TAR binding sites.
- Tat is a positive regulator of HIV replication which binds to the TAR region of the HIV RNA.
- the smallest naturally occurring, fully active form of HIV-Tat is 72 amino acids in length, SEQ Id. 118 herein. Tat contains at least two functional domains, and transactivates gene expression from the HIV long terminal repeat (HIV LTR).
- Tat binds to an RNA stem loop structure formed from the self-hybridization of sequences in TAR, which is just 5′ to the HIV LTR HIV TAR RNA forms a dinucleotide bulge and two stem-loop structures (Rhim et al. 1994 Virology:202, 202-211).
- the Tat (SEQ. Id. 118) binds to this structure with lower avidity than does Tat variants wherein Ala58 is a threonine or where His65 is an Asp residue. (Derse et al., 1993 Virology: 194,530-536). Utilizing these facts in the instant method is accomplished by:
- TBA contacting a TBA with the sample which identifies a hybrid TAR binding protein sequence and a proximate flanking sequence in the HIV genome.
- the TBA used for this purpose is assembled on cro as the chaperone using Tat as the HIV RNA specific binding molecule.
- the TBA also has an antibody component which recognizes the DNA-RNA hybrid target binding region formed when a probe nucleic acid binds to the HIV LTR RNA.
- the TBA Eliminating any “cross-talk” produced by binding of Tat to the TAR region of the HIV RNA due to such contaminants (cousin RNAs) as the CMV TAR sequence by contacting the reaction with excess Tat variant (either the Ala58 to Thr or the His65 to Asp variants) which bind more avidly. In this way, single binding events due to the TBA binding to a cousin RNAs are competed from the nucleic acid sample by the Tat variant. On the other hand, by appropriately selecting the affinity of the double binding achieved as a result of the antibody and Tat, the TBA is not displaced from true targets. This process is illustrated in FIG. 16. In another aspect of this same method, the TBA could be one in which, rather than using a variant of Tat, an antibody is used which recognizes this nucleic acid segment, and the TBA used is a double antibody TBA.
- a probe nucleic acid may be used which hybridizes with the HIV LTR RNA. Accordingly, a duplex segment of the LTR sp1 sites can be created as part of the target binding region. This region of the HIV RNA flanks the TAR region which is 5′ to the LTR but is in close proximity thereto. A TBA containing Tat and two Sp1 binding units is chaperoned to provide Tat binding to TAR and Sp1 binding to the Sp1 binding sites. Amplification and detection is then carried out by adding appropriate BNAs, BBAs and HNAs. In yet another alternative, PNAs having Seq. ID. 38 and Seq. ID. 39 (see FIG. 7) could be used.
- a TBA which contains one or more Sp1 binding units and an antibody unit which binds to the DNA-RNA hybrid produced from sample RNA and the Seq. Id. 38 PNA.
- Appropriate BNAs, BBAs and HNAs are then added to amplify the signal.
Abstract
This invention is a novel method for detecting and localizing specific nucleic acid sequences in a sample with a high degree of sensitivity and specificity. The method and novel compositions used in the method involve the use of Probe Nucleic Acids, the production of nucleic acid binding regions and the use of nucleic acid Target Binding Assemblies to detect and localize specific Target Nucleic Acids. The detection and localization of the Target Nucleic Acid is accomplished even in the presence of nucleic acids which have similar sequences. The method provides for a high degree of amplification of the signal produced by each specific binding event. In particular, methods and compositions are presented for the detection of HIV and HPV nucleic acid in samples. These methods and compositions find use in diagnosis of disease, genetic monitoring, forensics, and analysis of nucleic acid mixtures. Some of the novel compositions used in the detection method are useful in preventing or treating pathogenic conditions.
Description
- 1. Field of the Invention
- This invention provides a method and compositions for use in binding, detecting, and amplifying the detection of specific Target Nucleic Acid sequences in a sample with fidelity and accuracy, even in the presence of closely related but different nucleic acids. The binding may involve the chaperoning and assembly of specific molecules into Target Binding Assemblies which specifically bind Target Binding Regions formed by the hybridization of Probe Nucleic Acids and Target Nucleic Acid sequences. The amplifying may involve the chaperoning and/or assembly of specific molecules into Booster Binding Assemblies which specifically bind Booster Binding Regions formed by the hybridization of Booster Nucleic Acids with Probe Nucleic Acids, Target Nucleic Acids, or other Booster Nucleic Acids. A method, and compositions, involving Hairpin Nucleic Acids is also provided to enable control of the size of specifically or non-specifically elongated Booster Nucleic Acids and Booster Binding Assemblies used in the amplification. The detecting involves providing one or more detectable labels, including radioactive, light- or fluorescent-emitting, enzymatic, or other detectable or signal-generating molecules, in association with the Probe Nucleic Acid, the Target Binding Assembly, the Booster Nucleic Acid, the Booster Binding Assembly, or the Hairpin Nucleic Acid. A method is presented for isolating nucleic acid fragments from an organism which has TBA component binding sites in order to create a probe nucleic acid and a TBA which is unique for that fragment and/or organism. Therapeutic and prophylactic uses of the Target Binding Assemblies and compositions for such use are also provided.
- 2. Background and Description of Related Art
- There are an increasing number of cases in which it is important to be able to detect nucleic acids containing a specific sequence, hereinafter named Target Nucleic Acids (TNAs), in a sample. It is desirable to be able to detect the TNAs with the smallest number of processing steps, with the simplest components and to the exclusion of other similar but different nucleic acids, hereinafter named Cousin Nucleic Acids (CNAs). It is desirable to be able to detect specific TNAs to the exclusion of any and all CNAs in the detection sample without the necessity of amplification or other post-detection processing.
- There are numerous methods which use immobilized or tagged nucleic acids as probes for TNAs. However, using known methods, it is difficult to discriminate between a TNA bound to the Probe Nucleic Acid (PNA) as opposed to a CNA bound to the PNA. For example, one or more base mismatches between the PNA and a CNA can still result in a CNA-PNA hybridization which is almost indistinguishable from a TNA-PNA hybridization. Thus, hybridization alone is not an optimal indicator that a PNA has hybridized to a unique TNA.
- There are many situations in which a PNA would be used to try to determine whether a TNA was present in a sample which may contain CNAs. Hybridization of the PNA to any CNA in this situation would limit the diagnostic value that the PNA might have for the detection of a TNA, absent additional verification. Furthermore, it is desirable to be able to detect and localize TNAs with low copy numbers in samples which may contain many copies of CNAs, without the necessity of creating additional copies of the TNA. It would also be desirable to be able to confirm the presence of CNAs, independent of the TNAs, without the necessity of separating the CNAs and TNAs in the sample.
- Furthermore, it would be desirable to be able to amplify the signal of even a low frequency hybridization of a particular TNA-PNA. For this purpose, a method of polymers multiple copies of a label, hereinafter referred to as a Booster Nucleic Acid (BNA) onto the TNA-PNA would be desirable.
- The instant invention provides methods and compositions for achieving the foregoing desired objectives. As revealed by the following review, the instant compositions and methods have not been reported or suggested in the art. A general and comprehensive review of the state of art of nucleic acid detection is provided in Keller, H., M. M. Manak (1989)DNA Probes, Stockton Press.
- A method has been reported for detecting base pair mismatches by chemical means in order to determine whether a PNA has hybridized to a CNA rather than to a TNA. In U.S. Pat. No. 4,794,075 to Ford et al., a method for distinguishing fragments of DNA which contain single base mismatches from their perfectly paired homologs is discussed. Single stranded regions within a duplex fragment are modified with carbodiimide, which reacts with unpaired guanine (G) and thymine (T) residues in DNA. Linear duplex DNA molecules do not react, while DNA molecules with single base mismatches react quantitatively. Following reaction with carbodiimide, the DNA molecules are fractionated on high percentage polyacrylamide gels such that modified and unmodified fragments can be distinguished. Ford et al. applied this technique in order to locate and purify DNA sequence differences responsible for phenotype variation and inherited disease. Although this method is useful for following variations in genetic material, it has a large number of steps, it requires costly components, and it does not offer a direct means of determining whether a PNA has hybridized to the TNA exclusive of CNAs in the sample.
- There have been some attempts to assure that at least a portion of the hybridization between the PNA and another nucleic acid is complementary. One method involves the monitoring of transcription products which are produced if the PNA hybridizes to a nucleic acid sufficiently to be transcribed from a promoter site contained in the probe. U.S. Pat. No. 5,215,899 to Dattagupta discloses how specific nucleic acid sequences are amplified through the use of a hairpin probe which, upon hybridization with and ligation to a target sequence, is capable of being transcribed. The probe comprises a single self-complementary sequence which, under hybridizing conditions, forms a hairpin structure having a functional promoter region, and further comprises a single stranded probe sequence extending from the 3′ end of the hairpin sequence. Upon hybridization with a target sequence complementary to the probe sequence and ligation of the 3′ end of the hybridized target sequence to the 5′ end of the hairpin probe, the target sequence is rendered transcribable in the presence of a suitable RNA polymerase and appropriate ribonucleoside triphosphates (rNTPs). Amplification is accomplished by hybridizing the desired TNA sequence with the probe, ligating the TNA to the PNA, adding the RNA polymerase and the rNTPs to the separated hybrids, and allowing transcription to proceed until a desired amount of RNA transcription product has accumulated. That method generally and specifically involves the use of hairpin DNA formed with a single stranded unpaired end to anneal a target sequence. When the target sequence is bound, the production of RNA transcription products is enabled. Thus, the method involves the detection of secondary transcription products rather than the use of a nucleic acid binding assembly to directly immobilize and/or localize a target sequence. A CNA could easily bind to the probe, and the lack of complementarity would not necessarily interfere with the formation of a CNA-PNA hybrid which could then support the production of unwanted transcription products.
- A CNA bound to the PNA might be detected if the lack of complementarity interferes with the susceptibility of the hybrid CNA-PNA pair to be cut by a restriction endonuclease. In U.S. Pat. No. 5,118,605 to Udea and U.S. Pat. No. 4,775,619 to Urdea, novel methods for assaying a nucleic acid analyte were provided, which employ polynucleotides having oligonucleotide sequences substantially homologous to a sequence of interest in the analyte, where the presence or absence of hybridization at a predetermined stringency provides for the release of a label from a support. Various techniques are employed for binding a label to a support, whereupon cleavage of either a single or double strand a label may be released from a support, and the release of the label can be detected as indicative of the presence of a particular polynucleotide sequence in a sample. However, this technique has the shortcoming that a CNA-PNA pair could be cut by the restriction endonuclease, even if there is a mismatch, so long as the mismatch was outside of the endonuclease recognition region. This would lead to failure of the assay to identify a CNA-PNA hybrid.
- Another method uses a branched DNA probe to detect nucleic acids. U.S. Pat. No. 5,124,246 to Urdea et al. discloses liner or branched oligonucleotide multimers useful as amplifiers in biochemical assays which comprise (1) at least one first single-stranded oligonucleotide unit (PNA) that is complementary to a single-stranded oligomucleotide sequence of interest (TNA), and (2) a multiplicity of second single-stranded, oligonucleotide units that are complementary to a single-stranded labeled oligonucleotide. Although amplified sandwich nucleic acid hybridizations and immunoassays using the multimers are described, the method has the limitation that PNA-CNA hybridization could occur and would reset in production of unwanted signal.
- In addition to methods for identification of TNAs, methods have been disclosed for the amplification of this DNA. In U.S. Pat. No. 5,200,314 to Urdea, an analyte polynucleotide strand having an analyte sequence (TNA) is detected within a sample containing polynucleotides by contacting the analyte polynucleotide with a capture probe (PNA) under hybridizing conditions, where the capture probe has a first binding partner specific for the TNA, and a second binding sequence specific for a solid phase third binding partner. The resulting duplex is then immobilized by specific binding between the binding partners, and non-bound polynucleotides are separated from the bound species. The analyte polynucleotide is optionally displaced from the solid phase, then amplified by PCR. The PCR primers each have a polynucleotide region capable of hybridizing to a region of the analyte polynucleotide, and at least one of the primers further has an additional binding partner capable of binding a solid-phase binding partner. The amplified product is then separated from the reaction mixture by specific binding between the binding partners, and the amplified product is detected. Although it is possible to confirm (by PCR) that a particular nucleic acid has hybridized with the PNA, confirmation is expensive and involves multiple steps.
- As for reports that involve the interaction of a double stranded nucleic acid and a DNA-binding protein, a method has been described whereby a sequence of immobilized DNA which contains binding sites for a single protein is used to purify that protein. U.S. Pat. No. 5,122,600 to Kawaguchi et al. discloses a DNA-immobilized microsphere comprising DNA chains having base sequences which specifically bind a particular protein, and a carrier having a particle size of not more than 50 μm and not less tan 0.01 μm which does not adsorb any protein, said carrier and said DNA chains being bound to each other by a chemical bond, and a process for purifying a protein using said microsphere. As this is a purification method for a protein, it does not disclose a method of detection of a TNA nor a method whereby more than one protein is bound to a double stranded nucleic acid for the purposes of detection and localization of specific TNA sequences.
- In EP 0 453 301, a method for detecting a polynucleotide target sequence in a sample was described wherein sequences in a TNA are detected by hybridizing a first and a second PNA to the TNA. Each of said first and second PNAs contained a pre-formed duplex sequence, or a duplex that is formed through chin extension, capable of binding a nucleotide sequence specific binding protein. A method for binding a nucleotide specific binding protein to a duplex formed between a TNA and a PNA only upon formation of a duplex between the PNA and TNA is neither disclosed nor suggested.
- In US. Pat. No. 4,556,643, a method was disclosed for the non-radioactive detection of specific nucleotide sequences in a sample which involved hybridization of a probe containing DNA binding protein specific sequences. However, this disclosure neither taught nor suggested a method for binding a nucleotide specific biding protein to a duplex formed between a TNA and a PNA only upon formation of a duplex between sequences present in the PNA and sequences present in the TNA.
- Disclosed are methods by which specific Target Nucleic Acid (TNA) sequences are detected through the use of Probe Nucleic Acids (PNAs) which, upon hybridization with TNAs, are capable of binding Target Binding Assemblies (TBAs). Each TBA binds at least one specific region of the PNA-TNA hybrid pair, the Target Binding Region (TBR). The TBA is comprised of one or more molecules, one or more of which can bind to TBR sequences in a specific and sequence or conformation dependent manner. The TBA may comprise one or more piloting sequences, called “PILOTS” or “Asymmetry Sequences,” which assemble and constrain the nucleotide binding components of the TBA to specific geometries. The PILOTS act to assemble specific nucleic acid recognition units or other pilots to which specific nucleic acid recognition units are attached into the TBAs in a predetermined fashion. The TBA may also contain one or more molecules which anchor or localize the TBA. Novel TBAs having unique discriminating characteristics which surprisingly render the TBAs useful not only as diagnostic tools but also as prophylactic or therapeutic compounds, are also disclosed. Disclosed are methods and compositions for utilization of the PNAs, TBRs, TBAs, and TBA PILOTS, including their utilization as components of diagnostic and forensic test kits and the utilization of the novel TBAs as prophylactic or therapeutic agents.
- The PNAs, in addition to TNA-specific sequences, may also contain one or more sequences, ½ BBRs, capable of hybridizing with complementary ½ BBRs in Booster Nucleic Acids (BNAs). Through hybridization of added BNAs to the starter ½ BBRs present in the PNAs, extensions of the PNAs are made in the form of PNA-BNA and then BNA-BNA hybrids. These extensions can contain one or more Booster Binding Regions (BBRs). Each BBR is capable of being a Booster Binding Assembly (BBA). The BBA is comprised of molecules, one or more of which can bind to a BBR in a specific and sequence or conformation dependent manner. The BBA may comprise one or more piloting sequences, called “PILOTS” or “Asymmetry Sequences,” which assemble and constrain the nucleotide binding components of the TBA to specific geometries. The PILOTS act to assemble specific nucleic acid recognition units or other pilots to which specific nucleic acid recognition units are attached into the BBAs in a predetermined fashion. The BBA may contain molecules which anchor or localize the BBA or which allow for detection of the bound BBAs and thereby of the TBA-TNA-PNA complexes to which they, in turn, are bound. Disclosed are methods and compositions for utilization of the ½ BBRs, BNAs, BBRs, BBAs, and BBA PILOTS, including their utilization as components of diagnostic and forensic test kits.
- Methods and compositions are disclosed for the use of Hairpin Nucleic Acids (HNAs) as capping structures. The HNAs contain a self-hybridizing region and a single stranded ½ BBR which, under hybridizing conditions, can hybridize directly to the ½ BBRs in the PNAs or the ½ BBRs in BNAs already bound to the PNAs, to terminate the extension of BNAs onto the PNA or onto other BNAs.
- Methods and compositions are disclosed for test procedures and the production of a test kit containing PNAs, TBAs, TBRs, BNAs, BBRs, BBAs and HNAs for the detection, localization and differentiation of specific nucleic acid sequences, including nucleic acid sequences which are found in human cells, in the Human Immunodeficiency Virus (HIV), Human Papillomavirus (HPV), and in other nucleic acid containing systems including viruses and bacteria.
- Accordingly, it is an object of this invention to provide methods and compositions for use in binding, detecting, and amplifying the detection of specific Target Nucleic Acid sequences in a sample with fidelity and accuracy, even in the presence of closely related but different nucleic acid sequences.
- Accordingly, it is an object of this invention to provide methods and compositions for the creation of Target Binding Assemblies which specifically bind Target Binding Regions formed by the hybridization of Probe Nucleic Acids and Target Nucleic Acid sequences.
- Another object of this invention is to provide a method and compositions for the creation of Booster Binding Assemblies which specifically bind Booster Binding Regions formed by the hybridization of Booster Nucleic Acid sequences with Probe Nucleic Acids, Booster Nucleic Acids and Hairpin Nucleic Acids.
- Another object of this invention is to provide a method and compositions containing Hairpin Nucleic Acids which enable the control of the size of specifically or non-specifically elongated Booster Nucleic Acids and Booster Binding Assemblies used in amplification of PNA-TNA hybridization events.
- Another object of this invention is to provide a method and compositions for use in the selection, assembly and or chaperoning of specific molecules, each with nucleic acid binding discriminating capabilities, into Target and Booster Binding Assemblies.
- Another object of this invention is to provide a method and compositions for use in amplifying the detection of Target Binding assemblies bound to Target Binding Regions using Booster Binding Assemblies and Booster Nucleic Acids.
- Another object of this invention is to provide a method and compositions which allow the use of one or more detectable labels, including but limited to radioactive labels, light emitting, fluorescent, enzymatic or other signal generating molecules. These labels are used in association with Probe Nucleic Acids, Target Binding Assemblies, Booster Binding Assemblies, Booster Nucleic Acids or Hairpin Nucleic Acids.
- Another object of this invention is to provide a method for isolating nucleic acid fragments form an organism which has TBA component binding sites in order to create Probe Nucleic Acids and TBAs which are unique for that fragment or organism.
- The following illustrations are contained in FIG. 1: FIG. 1-I is a PNA containing a ½ TBR, which is a single-stranded sequence which is complementary to a TNA and a ½ BBR sequence. FIG. 1-IIa is a TNA to which is added the components of FIG. 1-I, and, under hybridizing conditions, binds the PNA to form the components of FIG. 1-IIIa, a PNA-TNA hybrid containing at least one TBR FIG. 1-IVa is a BNA which is a to the components of FIG. 1-IIIa and, under hybridizing conditions, binds the ½ BBR of FIG. 1-IIIa to form a PNA-BNA hybrid containing a BBR shown in FIG. 1-Va.
- FIG. I-IIb is a BNA which is added the components of FIG. 1-I, and which, under hybridizing conditions, binds the PNA to form the components of FIG. 1-IIIb, a PNA-TNA hybrid containing a BBR. FIG. I-IVb is a TNA to which is added the components of FIG. 1-IIIb and which, under hybridizing conditions, binds the ½ TBR of FIG. 1-IIIb to form a PNA-BNA hybrid containing a TBR shown in FIG. I-Vb.
- FIG. 1-IIc is a HNA which is added to the components of FIG. 1-I and which, under hybridizing conditions, binds the PNA to form the components of FIG. 1-IIIc, a PNA-HNA hybrid containing a BBR. FIG. 1-IVc is a TNA which is added to the components of FIG. 1-IIIc and which, under hybridizing conditions, binds the ½ TBR of FIG. 1-IIIc to form a PNA-BNA hybrid containing a BBR shown in FIG. I-Vc.
- The hybrids which form the TBRs and BBRs are useful in the present invention. The PNAs and BNAs, as indicated in FIG. 1, may contain no attached support and/or indicator (OSA), or an attached support or other means of localization, including, but not limited to, attachment to heads, polymers, and surfaces, and/or indicators.
- FIG. 2a is a diagram of strategies for polymerization of BNAs onto PNAs and capping by HNAs.
- FIG. 2b is a diagram of additional strategies for amplifying PNA-TNA signals via polymerization of BNAs and capping by HNAs.
- FIG. 3 is a diagram showing the use of BNAs containing multiple ½ BBRs per BNA.
- FIG. 4a is a diagram showing the binding of TBAs and BBAs to TBRs and BBRs, and the ability of the TBA to discriminate between TNAs and CNAs. According to this embodiment, if the TBA is immobilized, either on a bead, microtiter plate surface, or any other such surface, only complexes such as complex X would be red and detected, while complexes such as complex XI would not.
- FIG. 4b is a diagram exemplifying events similar to those shown in FIG. 4a but in a slightly different order of occurrence.
- FIG. 5 is a diagram exemplifying PNAs containing between one ½ TBR and no ½ BBR to PNAs containing up to five ½ TBRs and one ½ BBR. The (a) and (b) members of each numeral (I, II, III, IV, V) form a set which, upon hybridization to a TNA, provide TBRs either with ((a) members) or without ((b) members) an available ½ BBR for amplification via hybridization to BNAs having complementary ½ BBRs.
- FIG. 6a is a diagram exemplifying a particular TNA having two ½ TBRs which, upon binding an appropriate PNA, forms two closely associated TBRs capable of binding two TBAs. A ½ BER is also provided for amplification.
- FIG. 6b is a diagram showing the same events as in FIG. 6a except here, a double TBA is used so that discrimination between single TBRs that occur in normal cellular samples may be discriminated from abnormal, double TBRs.
- FIG. 6c is a diagram showing the same scenario as in FIG. 6a except that here, five TBRs are identified in the TNA. Each TBR may be bound to a TBA same or different, and each TBA may be differentially labeled, allowing for confirmation that all five sites are present in the TNA.
- FIG. 6d is a diagram of the same events as in FIG. 6c except here, a double TBA is shown, extending what is shown in FIG. 6b to the use of the double TBA. An example of the TNA shown in item II in FIGS. 6a, 6 b, 6 c and 6 d is HIV single stranded DNA or RNA.
- FIG. 7 shows the HIV LTR as a TNA, and two PNAs, and a strategy for detection of the TNA using the PNAs.
- FIG. 8 is a schematic of one embodiment of the invention wherein a target binding assembly is used to bind a hybrid TNA-PNA, and booster binding assemblies are used to bind polymerized BNAs.
- FIG. 9 is a schematic of a modular TBA in which assembly sequences, linker sequences, and asymmetry sequences are used to chaperone desired nucleic acid recognition units together to form a TBA.
- FIG. 10 shows modular TBAs useful in detection of HIV-specific sequences.
- FIG. 11 shows modular TBAs useful in the detection of human papillomavirus sequences. Each unit of E2 is actually a dimer of the DNA binding portion of E2.
- FIG. 12a is a schematic of TNA fractionation and shift in mobility due to binding of a TBA.
- FIG. 12b is a schematic of TNA fractionation and enhanced shift in mobility due to binding of BBAs in addition to TBAs.
- FIG. 13 shows a detection strategy for deletion sequences; an example of use of this strategy is for a human papillomavirus integration assay.
- FIG. 14 shows assembly of higher order TBAs through use of nucleic acid recognition units, linker, assembly, and asymmetry sequences such that various Target Binding Assemblies specific to binding sites in the HIV LTR are formed.
- FIG. 15 shows assembly of higher order TBAs through use of DNA recognition units, linker, assembly, and asymmetry sequences such that various Target Binding Assemblies specific to binding sites in the HPV genome are formed.
- FIG. 16 shows the discrimination achieved by using a complex TBA and the ability of endogenous competitor target binding molecules to eliminate binding of the TBA to a cousin nucleic acid but not from the TNA which contains the appropriate orientation of more than one site recognized by the TBA.
- FIG. 17 shows the ability of a TBA to specifically be targeted to bind to sites of sequence mismatch and to preferentially bind those sites over cousin sites which do not contain all of the targeted mismatches.
- SEQ ID NO. 1 corresponds to FIG. 5-Ia-1 and shows the class I MHC NF-kB binding site.
- SEQ ID NO. 2 corresponds to FIG. 5 (Ia) and shows the B2-microglobulin NF-kB binding site.
- SEQ ID NO. 3 corresponds to FIG. 5 (Ia) and shows the kappa immunoglobulin NF-kB binding site.
- SEQ ED NO. 4 corresponds to FIG. 5 (Ia) and shows one of the HIV NF-kB binding sites.
- SEQ ID NO. 5 corresponds to FIG. 5 (Ia) and shows one of the HIV NF-kB binding sites.
- SEQ ID NO. 6 corresponds to FIG. 5 (Ia) and shows the c-myc NF-kB binding site.
- SEQ ID NO. 7 corresponds to FIG. 5 (IIa) and shows a double HIV NF-kB binding site.
- SEQ ID NO. 8 corresponds to FIG. 5 (IIa) and shows a double HIV NF-kB binding site.
- SEQ ID NOS. 9-16 correspond to FIG. 5 (IIa) and show a double binding site with one site being an HIV NF-kB binding site, and the other site being an HIV SP1 binding site.
- SEQ ID NOS. 17-18 correspond to FIG. 5 (IIa) and show a double HIV SP1 binding site.
- SEQ ID NOS. 19-31 correspond to FIG. 5 (IIIa) and show a double HIV NF-kB binding site and an HIV SP1 binding site.
- SEQ ID NOS. 32-33 correspond to FIG. 5 (IVa) and show a quadruple binding site where two sites are HIV NF-kB binding sites and two sites are HIV SP1 binding sites.
- SEQ ID NO. 34 corresponds to FIG. 5 VIa) and show a quintuple binding site where two sites are HIV NF-kB binding sites and three sites are HIV SP1 binding sites.
- SEQ ID NO. 35 is an example of a ½ BBR, in this case the OL1, OL2 and OL3 elements of the bacteriophage lambda left operator, including intervening sequences.
- SEQ ID NO. 36 is an example of a ½ BBR, in this case the OR3, OR2 and OR1 elements of the bacteriophage lambda right operator, including intervening sequences.
- SEQ ID NO. 37 is the HIV LTR.
- SEQ ID NO. 38 is a PNA complementary to PNA of the HIV LTR.
- SEQ ID NO. 39 is a PNA complementary to a different PNA of the HIV LTR than SEQ ID NO. 38.
- SEQ ID NO. 40 is a PNA complementary to part of the HIV LTR and it also contains a ½ BBR and an overhang sequence for polymerizing BNAs onto the PNA.
- SEQ ID NO. 41 is a BNA complementary to the SEQ ID NO. 40 ½ BBR.
- SEQ ID NO. 42 is a BNA that will polymerize onto the SEQ ID NO. 41 BNA and which, with SEQ ID NOS. 40 and 41, creates a PstI recognition site.
- SEQ ID NO. 43 is a BNA that is complementary to the SEQ ID NO. 42 BNA and which completes a BamHI recognition site.
- SEQ ID NO. 44 is an HNA which has a BamHI recognition site that will hybridize with the BamHI recognition site created by SEQ ID NOS. 42 and 43 to the growing polymer.
- SEQ ID NO. 45 is a second PNA which, like SEQ ID NO. 40, is complementary to part of the HIV LTR, but not to the same sequences as SEQ ID NO. 40. SEQ ID NO. 45 also encodes a ½ BBR and an overhang which will allow polymerization of BNAs starting with a Sph1 recognition site.
- SEQ ID NOS. 46-62 are human papillomavirus (HPV) specific PNAs which, upon hybridization with HPV sequences, form TBRs which bind HPV DNA binding proteins.
- SEQ ID NOS. 63-71 are NF-kB DNA recognition units for incorporation into TBAs.
- SEQ ID NO. 72 is a nuclear localization sequence.
- SEQ ID NO. 73 is a SP1sequence recognition unit.
- SEQ ID NO. 74 is a TATA binding protein recognition unit.
- SEQ ID NOS. 75-84 are papillomavirus E2 DNA recognition units.
- SEQ ID NOS. 85-92 are asymmetry sequences.
- SEQ ID NO. 93 is an arabidopsis TATA binding protein recognition unit.
- SEQ ID NO. 94 is an HPV-16-E2-1 DNA binding protein recognition unit.
- SEQ ID NO. 95 is an HPV-16E2-2 DNA binding protein recognition unit.
- SEQ ID NO. 96 is an HPV-18-E2 DNA binding protein recognition unit.
- SEQ ID NO. 97 is an HPV-33-E2 DNA binding protein recognition unit.
- SEQ ID NO. 98 is a bovine papillomavirus E2 DNA binding protein recognition unit.
- SEQ ID NOS. 99-102 are exemplary linker sequences.
- SEQ ID NO. 103 is an exemplary nuclear localization signal sequence (NLS).
- SEQ ID NOS. 104-108 are exemplary chaperone sequences.
- SEQ ID NOS. 109-116 are exemplary assembled TBA sequences
- SEQ ID NO. 117 is a consensus NF-kB binding site.
- SEQ ID NO. 118 an HIV Tat amino acid sequence.
Abbreviations single stranded nucleic acid double-stranded nucleic acid binding region on nucleic acid no support or indicators, or solid support, or other means of localization, in- cluding, but not limited to, attachment to beads, polymers, and surfaces, or indicators = OSA BBA booster binding assembly BBR booster binding region BNA booster nucleic acid CNA cousin nucleic acid 1/2 BBR single-stranded region which, when hybridized to the complementary sequence from an HNA or a BNA, can bind a BBA 1/2 TBR single-stranded region of the PNA which, when hybridized to the complementary sequence from a TNA, can bind a TBA OSA optional support or attch- ment, circle with box PNA probe nucleic acid TBA target binding assembly TBR target binding region TNA target nucleic acid HNA Hairpin Nucleic Acid - It should also be understood from the disclosure which follows that when mention is made of such terms as target binding assemblies (TBAs), booster binding assemblies (BBAs), DNA binding proteins, nucleic acid binding proteins or RNA binding proteins, what is intended are compositions comprised of molecules which bind to DNA or RNA target nucleic acid sequences (TNAs) irrespective of the specificity of the category of binding molecules from which they are derived. Thus, for example, a TBA adapted to bind to human immunodeficiency virus sequences may be most similar to an NF-kB transcriptional factor which typically binds DNA sequences. However, as used herein, it will be understood that the TBA may be adapted for optimal use to bind to RNA sequences of a particular sequence composition or conformation.
- The fidelity of the detection method disclosed herein depends in large measure on the selective binding of TBAs and BBAs to particular nucleic acid motifs. It should be understood throughout this disclosure that the basis of TBA and BBA discrimination of TNAs from related sequences (cousin nucleic acids or CNAs) may be the formation of precise probe nucleic acid (PNA)-target nucleic acid (TNA) hybrid segments (PNA-TNA hybrids). However, the basis of discrimination may just as well be the formation of a particular conformation, and may not require the complete absence of mismatched-base pairing in the TNA-PNA hybrid. Accordingly, the basis of TBA or BBA operation should be understood throughout to depend on discrimination of any property unique to the TNA-PNA hybrid as opposed to any properties displayed by any PNA-CNA hybrids that may be formed in a test sample contacted with a given PNA.
- The present invention provides a method for specifically identifying a target nucleic acid (TNA) in a sample through the use of target binding assemblies (TBAs) which incorporate specific nucleic acid binding proteins. By using probe nucleic acids (PNAs) specific to a given TNA sequence, and a TBA which is specific to the duplex target binding region (TBR) formed upon formation of hybrid TNA-PNA sequences, a stable TBA-TNA-PNA complex is formed. By additionally providing specific amplifiable sequences in the PNA, in addition to sequences which specifically contribute to the formation of the TBR recognized by the TBA, the binding of the PNA to the TNA is detected and the detection amplified. For this purpose, any of a number of nucleic acid amplification systems, including polymerase chain reaction, or the use of branched DNA, each branch of which contains a detetable label, may be used. In particular, a novel method of amplification is described herein where the amplifiable portion of the PNA contains sequences onto which booster nucleic acids (BNAs) may be polymerized. Upon formation of each BNA-PNA hybrid, a booster binding region (BBR) is formed to which a booster binding assembly (BBA) binds specifically. If detectably labeled, the BBAs or BNAs provide essentially unlimited amplification of the original TNA-PNA binding event.
- According to this invention, the TNA will be understood to include specific nucleic acid sequences. The TBA will be understood to be any molecular assembly which can specifically and tightly bind to a formed TNA-PNA hybrid. The TBA will contain one or more molecules whose sequences are sufficient to bind to the TBR. Nucleic acid binding domains which are known can either be used directly as components of the TBA or modified according to the teachings provided herein. The most readily available molecules with such sequences are the DNA-binding domains of DNA-binding proteins. Specifically, many DNA or RNA binding proteins are known which can either used directly as the known, unmodified protein, or the TBA may be a nucleic acid binding protein, modified according to the specific teachings provided herein. In the later case, specific modifications that are desirable would include optimization of binding affinities, removal of unwanted activities (such as nuclease activity and reorganization of the TBA in the presence of other molecules with an affinity for components of the TBA), optimization of selectivity of a target sequence over closely related sequences, and optimization of stability.
- Examples of DNA binding proteins which could be used according to this invention are the DNA-binding portions of the transcription factor NF-kB (p50 and p65), NF-IL6, NF-AT, rel, TBP, the papilloma virus'E2 protein, sp1, the repressors cro and CI from bacteriophage lambda, and like proteins are well known proton whose DNA binding portion has been isolated, cloned, sequenced, and characterized. In addition, any other DNA-binding protein or portion of a protein that is necessary and sufficient to bind to a TBR hybrid or a BBR is included. This includes proteins or portions of wild-type proteins with altered DNA binding activity as well as protein created with altered DNA-binding specificity, such as the exchange of a DNA-binding recognition helix from one protein to another. In addition, proteins which exhibit nucleic acid biding and other nucleic acid functions, such as restriction endonucleases, could be used as the nucleic acid binding function. Proteins which bind to target regions in DNA-RNA hybrids as well as RNA-RNA hybrids are included. (See, for example, Shi 1995, DeStefano 1993, Zhu 1995, Gonzales 1994, Salazar 1993, Jaishree 1993, Wang 1992, Roberts 1992, Kainz 1992, Salazar 1993(b)). The binding assemblies may be constructed with the use of a molecule which chaperones portions of the binding assembly so that specific component combinations and geometries can be achieved. This molecule is designated here as a PILOT. Pilots can be comprised of proteins or any combination of organic and inorganic materials which achieve the combinatorial selection and/or to induce specific geometries between members of the TBA or BBAs. A chaperone is a stable scaffold upon which a TBA or BBA may be constructed such that the correct conformation of the TBA or BBA is provided while at the same time eliminating undesirable properties of a naturally occurring nucleic acid binding protein. As a specific example of this embodiment, a modified version of the pleiotropic transcription factor, NF-kB, is provided using a modified bacteriophage lambda cro protein as the chaperone. Each NF-kB binding dimer retains the picomolar binding affinity for the NF-kB binding site while at the same time the binding assembly presents several advantageous manufacturing, stability, and specificity characteristics.
- In view of the foregoing, the various aspects and embodiments of this invention are described below in detail.
- 1. The Probe Nucleic Acids (PNAs) and Their Preparation.
- The PNAs of the present invention comprise at least three principal parts joined together. With reference to FIG. 1(I) of the drawings, the first part of the PNA is one or more sequences of bases, designated “½ TBR”. With reference to FIG. 1(I and IIa) of the drawings, the ½ TBR in the PNA is complementary to a sequence of interest in a sample, the TNA containing a ½ TBR. With reference to FIG. I(IIIa) of the drawings, the TNA, when added to the PNA under hybridizing conditions, forms a PNA-TNA hybrid containing a TBR. With reference to FIG. 1(I) of the drawings, the second part of the PNA is a sequence of bases, designated “½ BBR”. With reference to FIG. 1(I, IIb, IIc, and IVa) of the drawings, the ½ BBR in the PNA is complementary to a ½ BBR contained in a BNA or a HNA. With reference to FIG. 1(IIIb, IIIc, and Va) of the drawings, the BNA or HNA, when added to the PNA under hybridizing conditions, forms a PNA-BNA hybrid or PNA-HNA hybrid, respectively, containing a BBR. With reference to FIG. 1(I) of the drawings, the third part of the PNA is the OSA, designated by a circle with a box around it. The OSA is no support and/or an indicator, or solid support, or other means of localization, including but not limited to, attachment to beads, polymers, and surfaces and/or indicators which is/are covalently attached to, or non-covalently, but specifically, associated with the PNA. The OSA may be an atom or molecule which aids in the separation and/or localization such as a solid support binding group or label which can be detected by various physical means including, but not limited to, adsorption or imaging of emitted particles or light. Methods for attaching indicators to oligonucleotides or for immobilizing oligonucleotides to solid supports are well known in the art (see Keller and Manak, supra, herein incorporated by reference).
- The PNA of the present invention can be prepared by any suitable method. Such methods, in general, will include oligonucleotide synthesis and cloning in a replicable vector. Methods for nucleic acid synthesis are well-known in the art. When cloned or synthesized, strand purification and separation may be necessary to use the product as a pure PNA. Methods of preparing RNA probes are well known (see for example Blais 1993, Blais 1994, which uses in vitro transcription from a PCR reaction incorporating a T7 RNA polymerase promoter).
- The length and specific sequence of the PNA will be understood by those skilled in the art to depend on the length and sequence to be detected in a TNA, and the strictures for achieving tight and specific binding of the particular TBA to be used (see discussion on TBAs below). In general, PNAs of sequence lengths between about 10 and about 300 nucleotides in length are adequate, with lengths of about 15-100 nucleotides being desirable for many of the embodiments specifically exemplified herein.
- It should also be understood that the PNA may be constructed so as to contain more than one ½ TBR and to produce more than one TBR for one or more TBAs, same or different, as well as complex TBRs recognized by novel duplex and multiplex TBAs (see description below regarding these novel TBAs) upon hybridization of the PNAs and TNAs. FIG. 5 illustrates specific PNAs which contain one or more ½ TBRs. Specific sequences which correspond to the ½ TBR sees illustrated in FIG. 5 (Ia, IIa, IIIa, IVa, and Va) are SEQ ID NOS. 1-34 (see Description of Sequences above).
- As shown in FIGS. 2a and 2 b, the PNA, containing a ½ TBR, may be hybridized with one or more BNAs (see description below) and the chain of BNAs polymerized to any desired potential length for amplification of the TNA-PNA hybridization evens. Preferably, between about 0 and about 10 ½ BBRs will be present in the PNA.
- As shown in FIGS. 6a and 6 b, the PNA may contain several ½ TBRs, same or different, which can hybridize with several ½ TBRs in a TNA. Each time a ½ TBR in the PNA matches a ½ TBR in a TNA, a Target Binding Region, TBR, is formed which can bind a TBA. Furthermore, it is not essential that all of the TBRs be on a single, contiguous PNA. Thus, in one embodiment of the invention, two different PNAs are used to detect sequences on a particular TNA. As an illustration of this aspect of the invention, FIG. 7 shows one representation of the human immunodeficiency virus (HIV) long terminal repeat (LTR). As is known in the art, the HIV LTR comprises two NF-kB binding sites and three SP1 binding sites, in close proximity, wherein NF-kB and SP1 are known DNA binding proteins. FIG. 7 provides two PNAs, PNA1 (SEQ ID NO. 38) and PNA2 (SEQ ID NO. 39), each of which is complementary to the opposite strand shown as a TNA (SEQ ID NO. 37), which shows the two NF-kB binding sites and the three SP1 binding sites of the HIV LTR. According to this aspect of the invention, PNA1 specifically hybridizes with that section of the TNA shown in FIG. 7 with bases underscored with a “+” symbol, while PNA2 specifically hybridizes with that section of the TNA shown in FIG. 7 with bases underscored with an “=” symbol. Each of PNA1 or PNA2 may also contain sequences (indicated by the symbols “#” or “*”) which will hybridize with a BNA's ½ BBR sequences (see below). In addition, each of PNA1 and PNA2 may be differentially tagged with an OSA, such as a fluorophore such as a fluorescein or a rhodamine label, which would allow confirmation that both probes have become bound to the TNA. If only one label or neither label is detected, it is concluded that the TNA is not present in the sample being tested.
- In a further aspect of the embodiment shown in FIG. 7, a method for altering the specificity of the instant assay method is shown. By changing the length of the gap between PNA1 and PNA2, such that the region of TNA remaining unhybridized is altered, one practicing this invention is able to alter the discrimination of the assay.
- In order to more clearly exemplify this aspect of the invention, it is necessary to emphasize that the TBR may have a helical structure. Thus, while PNA1 creates TBRs on one “face” of the helix, PNA2 creates a TBR on either the same or a different face of the helix, depending on the distance between the middle of each TBR (underlined in FIG. 7). If the middle of each binding site is an integral product of 10.5 bases apart, the TBRs will be on the same side of the helix, while non-integer products of 10.5 bases apart would place the TBRs on opposite sides of the helix. In this fashion, any cooperativity in binding by the TBA recognizing the PNA1 TBR and the TBA recognizing the PNA2 TBR can be manipulated (see Hochschild, A., M. Ptashne [1986] Cell 44:681-687, showing this effect for the binding of bacteriophage lambda repressor to two different operator sites located at different distances from each other in a DNA helix). As described by Perkins et al. ([1993] EMBO J. 12:3551-3558), cooperativity between NF-kB and the SP1 sites is required to achieve activation of the HIV LTR. However, for the purpose of the instant invention, the double NF-kB-triple SP1 binding site motif in the HIV LTR may be taken advantage of by providing a single, novel binding protein capable of binding both sites simultaneously, but only if the spacing between the sites is geometrically feasible. This is controlled both by the structure of the selected TBA and by the PNAs used. Thus, in the embodiment exemplified in FIG. 7, the two probes may be used with a large enough interprobe region of single-stranded DNA remaining such that, even if the NF-kB and SP1 binding sites are on opposite sides of the helix, the single-stranded region between the probes provides a sufficiently flexible “hinge” so that the DNA can both bend and twist to accommodate the geometry of the TBA. Alternatively, a more stringent assay may be designed by narrowing the interprobe distance such that the DNA may only bend, but not twist. Finally, the probes may be so closely spaced or a single PNA used, such that the DNA can only bend but not twist. Thus, this figure exemplifies and enables the production of detection systems with any given desired degree of discrimination between target nucleic acids having similar sequences, but different juxtapositions of these sequences.
- In terms of a diagnostic or forensic kit for HIV, those skilled in the art would understand that the aforementioned aspects of this invention allow for the tailoring of the components of the diagnostic or forensic kit to match what is known at any given time about the prevalent strains of HIV or another pathogen or disease condition. It will also be appreciated by those skilled in the art that, while detection of HIV infection is not the only utility of the instant invention, due to the mutability of the HIV genome, it is probably one of the most complex test environments for such a diagnostic. It is precisely in such a mutable environment, however, where the flexibility of the instant method, coupled with its ability to discriminate between very closely related sequences, may be most clearly appreciated. In less mutable environments, some of the sophistication to which this invention is amenable need not be utilized. Thus, in a diagnostic kit for papillomavirus infection, all of the discrimination characteristics of the TBA-TBR interaction are available, along with the ability to amplify the signal using the BNAs and BBAs, but a single, simple PNA, such as any one of SEQ ID NOS. 46-62, may be used which identifies unique papillomavirus sequences, which also are known to bind to a TBA such as the papillomavirus E2 protein or truncated DNA binding portions thereof (see Hegde et al. [1992] Nature 359:505-512; Monini et al. [1991] J. Virol. 65:2124-2130).
- In applying the instant method to the detection of a particular TNA for the purposes of assessing whether certain nucleic acids are present which are associated with the progression of melanoma, hepatoma, breast, cervical, lung, colon, prostate, pancreatic or ovarian cancers, the TNA may be obtained from biopsy materials taken from organs and fluids suspected of containing the cancerous cells. For the detection of genetic deficiencies, the TNA may be obtained from patient samples containing the affected cells. For detection of fermentation contaminants and products in the manufacture of food, chemical or biotechnology products or in the bioremediation of wastes, the TNA may be obtained from samples taken at various stages in the fermentation or treatment process. For detection of food or drug pathogens or contaminants the TNA sample may be obtained from the food or drug, swabs of food or surfaces in contact with the food, fluids in contact with the food, processing materials, fluids and the like associated with the manufacture of or in contact with the food, drug, or biological samples taken from those in contact with the food or drug or the like.
- 2. The Booster Nucleic Acids (BNAs). Booster Binding Regions (BBRs) and Their Preparation.
- The BNAs of the present invention are comprised of at least one or more ½ BBRs coupled to an OSA. The ½ BBRs can hybridize to complementary ½ BBRs contained in the PNA, other BNAs or an HNA.
- With referee to FIG. 1(I, IIb and IIIb) of the drawings, the simplest BNA is comprised of two parts. With reference to FIG. I (IIb) of the drawings, the first part of the simplest BNA is a sequence of bases which is complementary to the sequence in the PNA which is designated “½ BBR”. With reference to FIG. 1(IIb) of the drawings, the second part of the simplest BNA is the OSA, designate by a circle with a box around it. The OSA is no support and/or indicator, or solid support, or other means of localization, including but not limited to, attachment to beads, polymers, and surfaces and/or indicators which are covalently attached to, or non-covalently, but specifically, associated with the BNA.
- With reference to FIG. 2a(II and III) of the drawings, the BNA may contain more than one ½ BBR sequence. The BNA illustrated in FIG. 3(II) contains a sequence which is complementary to the PNA illustrated in FIG. 3(I) and two other ½ BBR sequences. The BNA illustrated in FIG. 3(III) contains two ½ BBR sequences which are complementary to two of the ½ BBR sequences in the BNA illustrated in FIG. 3(II), plus up to “n” additional ½ BBRs for polymerization of additional BNAs.
- Under hybridizing conditions, the BNA illustrated in FIG. 3(II), when combined with the PNA illustrated in FIG. 3(I), creates the PNA-BNA hybrid illustrated in FIG. 3(IVa) containing a BBR and an unhybridized extension with two additional ½ BBR sequences or “booster” sequences. The BBRs created by said hybridization can be identical, similar or dissimilar in sequences. The BBRs created by said hybridization can bind identical, similar or dissimilar BBAs (see below). The BNAs may have prepared analogously to the PNAs.
- Under hybridizing conditions, the BNA-BNA hybrid illustrated in FIG. 3(IVb), when combined with the PNA illustrated in FIG. 3(Vb), creates the PNA-BNA hybrid illustrated in FIG. 3(VI) containing a BBR, two additional BNA-BNA hybrids containing BBRs, and an unhybridized extension with an additional ½ BBR sequence, a “booster” sequence. The BBRs created by said hybridization can be identical, similar or dissimilar in sequence. The BBRs created by said hybridization can bind identical, similar or dissimilar BBAs (see below). The BNAs may be prepared in a fashion analogous to preparation of the PNTAs.
- 3. The Target Nucleic Acids (TNAs) and Their Preparation.
- The first step in detecting and amplifying signals produced through detection of a particular TNA according to the present method is the hybridization of such target with the PNA in a suitable mixture. Such hybridization is achieved under suitable conditions well known in the art.
- The sample suspected or known to contain the intended TNA may be obtained from a variety of sources. It can be a biological sample, a food or agricultural sample, an environmental sample and so forth In applying the instant method to the detection of a particular TNA for the purposes of medical diagnostics or forensics, the TNA may be obtained from a biopsy sample, a body fluid or exudate such as urine, blood, mill; cerebrospinal fluid, sputum, saliva, stool lung aspirates, throat or genital swabs and the like. In addition, detection may be in situ (see for example Embretson 1993; Patterson 1993; Adams 1994).
- Accordingly, PNAs specific to vertebrates (including mammals and including humans) or to any or all of the following microorganisms of interest may be envisioned and used according to the instant method:
Corynebacteria Corynebactenum diphtheria Bacillus Bacillus thuringiensis Pneumococci Diplococcus pneumoniae uz,1/6 Streptococci Streptococcus pyogenes Streptococcus salivarius Staphylococcus Staphylococcus aureus Staphylococcus albus Pseudomonas Pseudomonas stutzen Neisseria Neisseria meningitidis Neisseria gonorrhea Enterobacteriaceae Escherichia coli Aerobacteria aerogenes Klebsiella pneumoniae The coliform bacteria Salmonella typhosa Salmonella choleroesuis The Salmonellae Salmonella typhimurium Shigellae dysenteriae Shigellae schmzitzii Shigellae arabinotardo Shigellae flexneri The Shigellae Shigellae boydii Shigellae sonnei Other enteric bacilli Proteus vulgaris Proteus mirabilis Proteus species Proteus morgani Pseudomonas aeruginosa Alcaligenes faecalis Vibrio cholerae Hemophilus-Bordetella group Hemophilus influenza, H ducryi Hemophilus hemophilus Hemophilus aegypticus Hemophilus parainfluenzae Bordetella pertussis Pasteurellae Pasteurella pestis Pasteurella tulareusis Brucellae Brucella melitensis Brucella abortus Brucella suis Aerobic Spore-Forming Bacilli Bacillus anthracis Bacillus subtilis Bacillus megaterium Bacillus cereus Anaerobic Spore-Forming Bacilli Clostridium botulinum Clostridium tetani Clostridium perfringens Clostridium novyi Clostridium septicum Clostridium histolyticum Clostridium tertium Clostridium bifermentans Clostridium sporogenes Mycobacteria Mycobacterium tuberculosis hominis Mycobacterium bovis Mycobacterium avium Mycobacterium leprae Mycobacterium paratuberculosis. Actinomycetes (fungus-like bacteria) Actinomyces isaeli Actinomyces bovis Actinomyces naeslundii Nocardia asteroides Nocardia brasiliensis The Spirochetes Treponema pallidum Treponema pertenue Treponema carateum Spirillum minus Streptobacillus moniliformis Borrelia recurrens Leptospira icterohemorrhagiae Leptospira canicola Trypanasomes Mycoplasmas Mycoplasma pneumoniae Other pathogens Listeria monocytogenes Erysipelothrix rhusiopathiae Streptobacillus moniliformis Donvania granulomatis Bartonella bacillformis Rickettsiae (bacteria-like parasites) Rickettsia prowazekii Rickettsia mooseri Rickeasia rickettsii Rickettsia conori Rickettsia australis Rickettsia sibiricus Rickettsia akari Rickettsia tsutsugamushi Rickettsia burnetti Rickettsia quintana Chlamydia (unclassifiable parasites bacterial/viral) Chlamydia agents (naming uncertain) Fungi Cryptococcus neoformans Blastomyces dermatidis Histoplasma capsulatum Coccidioides immitis Paracoccidioides brasiliensis Candida albicans Aspergillus fumigatus Mucor corymbifera (Absidia corymbifera) Rhizopus oryzae Rhizopus arrhizus Phycomycetes Rhizopus nigricans Sporotrichum schenkii Flonsecaea pedrosoi Fonsecaea compact Fonsecacae dermatidis Cladosporium carrioni Phialophora verrucosa Aspergillus nidulans Madurella mycetomi Madurella grisea Allescheria boydii Phialophora jeanselmei Microsporum gypsum Trichophyton mentagrophytes Keratinomyces ajelloi Microsporum canis Trichophyton rubrum Microsporum adouini Viruses Adenoviruses Herpes Viruses Herpes simplex Varicella (Chicken pox) Herpes zosrer (Shingles) Virus B Cytomegalovirus Pox Viruses Variola (smallpox) Vaccinia Poxvirus bovis Paravaccinia Molluscum contagiosum Picornaviruses Poliovirus Coxsackievirus Echoviruses Rhinoviruses Myxoviruses Influenza (A, B, and C) Parainfluenza (1-4) Mumps virus Newcastle disease virus Measles virus Rinderpest virus Canine distemper virus Respiratory syncytial virus Rubella virus Arboviruses Eastern equine encephalitis virus Western equine encephalitis virus Sindbis virus Chikugunya virus Semliki forest virus Mayora virus St. Louis encephalitis virus California encephalitis virus Colorado tick fever virus Yellow fever virus Dengue virus Reoviruses Reovirus types 1-3 Retroviruses Human immunodeficiency viruses (HIV) Human T-cell lymphotrophic virus I & II (HTLV) Hepatitis Hepatitis A virus Hepatitis B virus Hepatitis nonA-nonB virus Hepatitis, C, D, E Tumor viruses Rauscher leukemia virus Gross virus Maloney leukemia virus Human papilloma viruses - It would be understood by one of skill in the art that it is generally required to treat samples suspected of containing a particular TNA in such a fashion as to produce fragments that can easily hybridize with the PNA. It may be necessary to treat the test sample to effect release of or to extract the TNA for hybridization, such as by exposing blood or other cells to a hypotonic environment, or otherwise disrupting the sample using more vigorous means. When the TNA is thought to be present in double stranded form, it would naturally be desirable to separate the stands to render the TNA hybridizable in single stranded form by methods well known in the art, including but not limited to heating or limited exposure to alkaline conditions which may be neutralized upon addition of the single stranded PNA to allow hybridization to occur. Methods for preparing RNA targets are well known (see Waterhouse 1993, Mitchell 1992).
- Fragmentation of nucleic acid samples containing TNAs is usually required to decrease the sample viscosity and to increase the accessibility of the TNAs to the PNAs. Such fragmentation is accomplished by random or specific means known in the art Thus, for example, specific nucleases known to cut with a particular frequency in the particular genome being analyzed, may be used to produce fragments of a known average molecular size. In addition, other nucleases, phosphodiesteses, exonucleases and endonucleases, physical shear and sonication are all methods amenable for this purpose. These processes are well known in the art. The use of restriction enzymes for the purpose of DNA fragmentation is generally preferred. However, DNA can also be fragmented by a variety of chemical means such as the use of the following types of reagents: EDTA-Fe(II) (according to Stroebel et al. [1988] J. Am. Chem. Soc. 110:7927; Dervan [1986] Science 232:464); Cu(II)-phenanthroline (according to Chen and Sigman [1987] Science 237:1197); class IIS restriction enzyme (according to Kim et al. [1988] Science 240:504); hybrid DNAse (according to Corey et al. [1989] Biochem. 28:8277); bleomycin (according to Umezawa et al. [1986] J. Antibiot. (Tokyo) Ser. A, 19:200); neocarzinostatin (Goldberg et al. [1981] Second Annual Bristol-Myers Symposium in Cancer Research, Academic Press, New York, p. 163); and methidiumpropyl-EDTA-Fe(II) (according to Hertzberg et al. [1982]J. Am. Chem. Soc. 104:313). Removal of proteins, as by treatment with a protease, is also generally desirable and methods for effecting protein removal from nucleic acid samples, without appreciable loss of nucleic acid, are well known in the art.
- The TNAs of the present invention should be long enough so that there is a sufficient amount of double-stranded hybrid flanking the TBR so that a TBA can bind unpertrbed by the unligated fragment ends. Typically, fragments in the range of about 10 nucleotides to about 100,000 nucleotides, and preferably in the range of about 20 nucleotides to about 1,000 nucleotides are used as the average size for TNA fragments. Examples of specific TNA sequences that could be detected are sequences complementary to the PNA sequences described herein for detection of normal cellular, abnormal cellular (as in activated oncogenes, integrated foreign genes, genetically defective genes), and pathogen-specific nucleic acid sequences, for which specific nucleic acid binding proteins are known, or which can be produced according to methods described in this disclosure. With reference to FIG. 7, a specific HIV-related TNA is shown as SEQ ID) NO. 37.
- 4. Extensions to the PNA Using BNAs, Their Preparation, and Signal Amplification.
- Under hybridizing conditions, BNAs can be added that hybridize to the PNAs, PNA-BNA hybrids, BNAs and/or BNA-BNA hybrids. The aforementioned additions can be made in a non-vectorial polymeric fashion or in a vectorial fashion, with a known order of BNAs.
- With reference to FIG. 2a, a simple booster is presented. A booster polymer is produced by adding two BNAs, illustrated in FIG. 2a(Ib and Ic), which when combined under hybridizing conditions with the PNA, form PNA-BNA-BNA hybrids, comprised of the PNA and “booster extensions”, illustrated in FIG. 2a(IIa,IIb,IIc and IId) leaving at least one unpaired ½ BBR sequence. Each unpaired ½ BBR sequence, illustrated in FIG. 2a(IIa, IIb, IIc, IId) can hybridize with additional BNAs to form additional “booster” extensions. Each unpaired ½ BBR sequence, illustrated in FIG. 2a(IIa,IIb,IIc and IId) can hybridize with added HNAs, illustrated in FIG. 2a(IIIa and IIIb). The hybridization of the HNAs, which cannot hybridize additional BNAs, acts to “cap” the addition of the BNAs onto the PNA, as illustrated in FIG. 2a(IVa, IVb, IVc and IVd).
- With reference to FIG. 2b, it is possible to control and specify the order and components of extensions to the PNA. If a single BBR is required, a HNA containing the complementary sequence to the ½ BBR in the PMA is added to the PNA to produce a single BBR and to “cap” any “booster” extensions to the PNA. If additional BBRs are to be added to the PNA, a controlled extension of the PNA can be accomplished.
- With reference to FIG. 2b, a simple booster is presented. Vectorial polymer extension is accomplished by adding a BNA which is specific for the PNA, as illustrated in FIG. 2b(Ia and IIa), which when combined under hybridizing conditions with the PNA, form PNA-BNA-BNA hybrids, comprised of the PNA and “booster” extensions. These extensions, if labeled with an OSA, provide a method for greatly amplifying any signal produced upon binding of a PNA to a TNA in the sample. Furthermore, by binding labeled BBAs to the BBRs in the polymer, additional amplification is achieved.
- Any of a number of methods may be used to prepare the BNAs, including, e.g., synthesis via known chemistry or via recombinant DNA production methods. In the latter method, an essentially unlimited number of BNAs may be produced simply and inexpensively, for example, by production in prokaryotes (E. coli for example) of a plasmid DNA having multiple repeats of the specific BNA sequences flanked by restriction sites having overhanging ends. In this fashion, for example, the bacteriophage lambda left or right operator sites, or any other DNA or other nucleic acid sequence known to specifically and tightly bind a particular BBA, such as a DNA or RNA binding protein, may be produced in an essentially unlimited number of copies, with each copy flanked by an EcoRI, PstI, BamHI or any of a number of other common restriction nuclease sites. Alternatively, a polymer at repeated sites may be excised by unique restriction sites not present within the polymer. Large quantities of pBR322, pUC plasmid or other plasmid having multiple copies of these sequences are produced by methods well known in the art, the plasmid cut with the restriction enzyme flanking the polymerized site, and the liberated multiple copies of the operators isolated either by chromatography or any other convenient means known in the art. The BNA, prior to use, is then strand separated and is then amenable for polymerization onto a PNA encoding a single stranded complementary copy of the operator as a ½ BBR. The BNAs may be polymerized vectorially onto the PNA by using different restriction enzymes to flank each repeat of the polymer in the plasmid used to produce multiple copies of the BNA. Alternatively, the BNA polymer may be hybridized to the PNA via overhangs at one or both ends of the BNA polymer, without the need to strand separate and anneal each BNA segment. Examples of specific BNA sequences are provided above in the section entitled Description of Sequences, as SEQ ID NOS. 35-36. To stabilize the BNA polymer, DNA ligase may be used to covalently link the hybridized BNAs.
- 5. The Hairpin Nucleic Acids (HNAs) and Their Preparation.
- The HNAs of the present invention comprise at least two principal parts joined together: A single-stranded sequence, which is complementary to a ½ BBR, and a double-stranded nucleic acid region formed, under hybridizing conditions, by the self-association of self-complementary sequences within the HNA. With reference to FIG. 1(IIc) of the drawings, the ½ BBR in the HNA may be constructed so as to be complementary to the ½ BBR sequence in the PNA. With reference to FIG. 1(I, IIc and IIIc) of the drawings, the aforementioned HNA, when added to the PNA under hybridizing conditions, forms a PNA-HNA hybrid containing a BBR With reference to FIG. 1(IIIc, IVc and Vc) of the drawings, a PNA-HNA hybrid, under hybridizing conditions, upon addition of the TNA, can form a TNA-PNA-HNA hybrid containing a TBR and a BBR.
- With reference to FIGS. 2a and 2 b, the HNAs can be used to “cap” or terminate the addition of BNA extensions to the PNA. The two BNAs in FIG. 2a(Ib and Ic) can associate to form the hybrid shown in FIG. 3(IVb) or can hybridize directly and individually to the PNA as illustrated in FIG. 2a(Ia-c, IIa-d). The two HNAs (shown in FIG. 2a(IIIa and IIIb)) can terminate the hybridization of the BNA to other BNAs which extend from the PNA, as illustrated in FIG. 2a(IVa-d). The HNA in FIG. 2a(IIIa) can terminate the PNA-BNA hybrids shown in FIG. 2a(IIb and IId) and any PNA-BNA hybrid with a single stranded ½ BBR which is complementary to the ½ BBR in the HNA illustrated in FIG. 2a(IIIa). Similarly, the HNA in FIG. 2a(IIIb) can terminate the PNA-BNA hybrids shown in FIG. 2a(IIa and IIc) and any PNA-BNA hybrid with two single stranded ½ BBRs which are complementary to the ½ BBRs in the HNA illustrated in FIG. 2a(IIIb).
- HNAs are constructed that will terminate PNA-BNA hybrids which are constructed from the sequential addition of BNAs to the PNA as illustrated in FIG. (2b). The single stranded ½ BBR sequences illustrated in FIG. 2b(Ia, IIIa, Va, and VIIa) are specifically complementary to the single stranded ½ BBR sequences illustrated in FIG. 2b(Ib,IIIb,Vb and VIIb) and produce the unique capped PNA-BNA-HNA hybrids illustrated in FIG. 2b(Ic,IIIc,Vc and VIIc).
- The self-complementary sequences in the HNA and the loop sequence which links the self-complementary hairpin sequences can be of any composition and length, as long as they do not substantially impede or inhibit the presentation of the single-stranded ½ BBR that comprises part of the HNA by the HNA or selectively bind the BBA or the TBA. The loop sequences should be selected so that formation of the loop does not impede formation of the hairpin. An examples of an HNA useful in this application is provided as SEQ ID NO. 44 (see Description of Sequences above).
- 6. The Target Binding Assemblies (TBAs) and Their Preparation.
- A TBA may be any substance which binds a particular TBR formed by hybridization of particular TNAs and PNAs, provided that the TBA must have at least the following attributes:
- (a) The TBA must bind the TBR(s) in a fashion that is highly specific to the TBR(s) of interest. That is, the TBA must discriminate between TBRs present in the TNA-PNA hybrid and similar duplex sequences formed by PNA-CNA hybrids. The TBA must bind the PNA-CNA hybrid with a sufficiently low avidity that upon washing the TBA-TNA-PNA complex, the PNA-CNA hybrid is displaced and the PNA-TNA hybrid is not displaced;
- (b) The TBA must avidly bind the TBR(s) created by the hybridization of the TNA with the PNA. Binding affinities in the range of 10−5 to about 10−12 or higher are generally considered sufficient. As noted below, in some instances, it might be desirable to utilize a particular TBA which has a very low avidity for a particular TBR, but which has a greatly increased affinity when a particular configuration of multiple TBRs is provided so that the square of the affinity of the TBA for each TBR becomes the affinity of relevance to that particular TBA.
- Examples of the DNA binding components useful in the formation of TBAs include, but are not limited to NF-kB, papillomavirus E2 protein, transcription factor SP1, inactive restriction enzymes, antibodies, etc. Each of these proteins has been recognized in the art to contain sequences which bind to particular nucleic acid sequences and the affinities of these interactions are known. Naturally, the method of the instant invention is not limited to the use of these known DNA binding proteins or fragments thereof. From the instant disclosure, it would be apparent to one of ordinary skill that the instant method could easily be applied to the use of novel TBAs exhibiting at least the required attributes noted above. Thus, for example, in WO 92/20698, a sequence specific DNA binding molecule comprising an oligonucleotide conjugate formed by the covalent attachment of a DNA binding drug to a triplex forming oligonucleotide was described. The method of that disclosure could be used to produce novel TBAs for use according to the instant disclosure, provided that the TBAs thus formed meet the criteria described above. In addition, the methods of U.S. Pat. Nos. 5,096,815, 5,198,346, and WO88/06601, herein incorporated by reference, may be used to generate novel TBAs for use according to the method of this invention. Specific antibodies or portions thereof could be used (see for example Blais 1994).
- Where the TBA is a protein, or a complex of proteins, it will be recognized that any of a number of methods routine in the art may be used to produce the TBA. The TBA may be isolated from its naturally occurring environment in nature, or if this is impractical, produced by the standard techniques of molecular biology. Thus, using NF-kB as an example, using the DNA binding portions of p50 or p65 subunits, this binding assembly could be produced according to recombinant methods known in the art (see for example Ghosh [1990] Cell 62:1019-1029, describing the cloning of the p50 DNA binding subunit of NF-kB and the homology of that protein to rel and dorsal).
- Many DNA and other nucleic acid binding proteins are known which can be used as or in TBAs according to this invention. Once the amino acid sequence of any DNA, RNA:DNA, RNA or other nucleic acid binding protein is known, an appropriate DNA sequence encoding the protein can either be prepared by synthetic means, or a cDNA copy of the mRNA encoding the protein from an appropriate tissue source can be used. Furthermore, genomic copies encoding the protein may be obtained and introns spliced out according to methods known in the art. Furthermore, the TBAs may be chemically synthesized.
- Once an appropriate coding sequence has been obtained, site-directed mutagenesis may be used to alter the amino acid sequence encoded to produce mutant nucleic acid binding proteins exhibiting more desirable binding characteristics than those of the original nucleic acid binding protein. As an example of this process, the amino acid sequence of the DNA binding portions of NF-kB can be altered so as to produce an NP-kB′ molecule which more tightly binds the NF-kB binding site (see examples below—HIV-Detect and HIV-Lock).
- To provide further insight into this aspect of the invention, the following considerations are to be noted. Using NF-kB as an example, a TBA may be prepared using the naturally occurring NF-kB molecule. However, because this molecule is present in vanishingly small quantities in cells, and because the subunits of this DNA binding protein have been cloned, it would be more reasonable to prepare large quantities of the complex via recombinant DNA means as has already been accomplished for this protein (see for example Ghosh [1990] Cell 62:1019-1029). NF-kB is a pleiotropic inducer of genes involved in immune, inflammatory and growth regulatory responses to primary pathogenic (viral, bacterial or stress) challenges or secondary pathogenic (inflammatory cytokine) challenges. NF-kB is a dimeric DNA binding protein comprising a p50 and a p65 subunit, both of which contact and bind to specific DNA sequences. In an inactivated state, NF-kB resides in the cellular cytoplasm, complexed with a specific inhibitor, I-kB, to form a cytoplasmic heterotrimer. Upon activation, the inhibitor is decomplexed, and the p50-p65 dimer relocate via a specific nuclear localization signal (NLS) to the cell's nucleus where it can bind DNA and effect its role as a transcriptional activator of numerous genes (see Grimm and Baeuerle [1993] Biochem. J. 290:297-308, for a review of the state of the art regarding NF-kB).
- The p50-p65 dimer binds with picomolar affinity to sequences matching the consensus GGGAMTNYCC (SEQ ID NO. 117), with slightly different affinities depending on the exact sequence. It is worth noting that homodimers of p50 and p65 have also been observed to occur. These homodimers display different biochemical properties as well as slightly different affinities of binding sequences within and similar to the above consensus. Thus, depending on the desired binding characteristics of the TBA, a p50-p65 heterodimer, a p50-p50 homodimer, or a p65-p65 homodimer or fragments of the aforementioned dimers may be used.
- One way in which various novel TBAs may be produced for use according to this invention is shown schematically in FIG. 9. The nucleic acid recognition units of the TBA may be assembled and associated with similar or dissimilar TBA nucleic acid recognition units via a “chaperone.” The chaperone is a structure on which the various TBA recognition elements are built and which confers desirable properties on the nucleic acid recognition units. The chaperone is comprised of any sequence which provides assembly sequences such that same or different nucleic acid recognition units are brought into close and stable association with each other. Thus, for example, in the case of a TBA designed to tightly bind NF-kB TBRs, a TBA is assembled by providing lambda cro sequences as assembly sequences, linked to the nucleic acid binding sequences for either NF-kB p50 or p65. The p50 or p65 nucleic acid binding sequences are linked to the cro sequences at either the carboxy or amino terminus of cro and either the carboxy or amino terminus of the nucleic acid recognition unit of the p50 or p65. Linking sequences are optionally provided to allow appropriate spacing of the nucleic acid recognition units for optimal TBR binding.
- The assembly sequences, exemplified above by cro and CI sequences (SEQ ID NOS. 104-108), comprise any stable oligopeptides which naturally and strongly bond to like sequences. Thus, in the case of cro, it is well known that a dimer of cro binds to the bacteriophage lambda operator sites (Anderson et al. [1981] Nature 290:754-758; Harrison and Aggarwal [1990] Ann. Rev. Biochem. 59:933-969). The monomer units of cro tightly and specifically associate with each other. Thus, by linking DNA recognition unit sequences to the cro sequences, close and tight association is achieved.
- The optional linker sequences comprise any amino acid sequence which does not interfere with TBA assembly or nucleic acid binding, and which is not labile so as to liberate the nucleic acid recognition unit from the complete TBA. It is desirable but not necessary that the linker sequences be covalently linked to other binding assembly components. The association should be specific so as to aid in the assembly and manufacture of the binding assemblies. Examples of such sequences include, but are not limited to, such well known sequences as are found linking various domains in structural proteins. Thus, for example, in the lambda repressor protein, there is a linking sequence between the DNA binding domain and the dimerization domain which is useful for this purpose. Many other such sequences are known and the precise sequence thereof is not critical to this invention, provided hat routine experimentation is conducted to ensure stability and non-interference with target nucleic acid binding. Examples of such sequences are provided herein as Met Ser and SEQ IN NOS. 99-102. Insertion of specific, known proteolysis sites into these linkers is also an integral part of this invention. The presence of such sites in the linker sequences would provide manufacturing advantages, allowing different molecules to be assembled on the chaperone scaffold.
- In addition to the nucleic acid recognition units, optional linking sequences, and assembly sequences, the novel TBAs of this invention optionally have asymmetry or PILOT TNA sequences and one or more OSA units. The asymmetry sequences are provided to encourage or prevent certain desirable or undesirable associations. For example, in the event that a TBA having homodimeric p50 DNA recognition units is desired, the asymmetry sequences are provided to disrupt the naturally stronger association of NF-B p50 subunits and p65 subunits, while not disrupting the assembly sequences from bringing together p50 subunits. Examples of such sequences are provided herein as SEQ ID NOS. 85-92 and SEQ ID NOS. 105 and 106.
- In a different configuration, NF-kB p50 subunit sequences are brought into close association with transcription factor SP1 DNA recognition unit sequences. This is desirable in the event that an NF-kB/SP1 binding motif is of significance, as in the HIV LTR where a motif of at least six DNA binding protein recognition sites, two NF-kB, three SP1, and a TATA site are mown to exist. Since it is also known that the second NF-kB and first SP1 site are significant to regulation of HIV transcription (Perkins et al. [1993] Embo J. 12:3551-3558), this particular configuration of TBA is useful not only the detection of HIV, but as a therapeutic or prophylactic against HIV infection (see below). In a similar fashion, the long control region (LCR) of human papillomavirus may be used as a key control region for probing according to this method.
- In view of the different elements that can be associated, cassette fashion, according to this method of TBA formation, an essentially unlimited variety of TBAs are produced. In FIG. 10, a series of different molecules, referred to as “HIV-detect I-IV” are exemplified wherein “CHAP” denotes the chaperone, “nfkb” denotes NF-kB subunits, “sp1” denotes the nucleic acid recognition unit of the SP1 transcription factor, and “TATA” denotes a dimer of the DNA recognition unit of a TATA sequence DNA binding protein (TBP), also known as a TATA binding protein, or TBP. These configurations are further exemplified below and are all integral parts of the instant invention.
- In yet another configuration, the modular structure shown in FIG. 9 is adapted to detection and or treatment or prophylaxis of a completely different pathogen. In FIG. 11, in a similar fashion to the above described “HIV-detect I-IV” molecules, a series of “HPV-Detect I-IV” molecules is produced. In this embodiment, advantage is taken of the DNA binding properties of the E2 protein of human papillomavirus (HPV). In addition, the roles of SP1 and TBP are taken advantage of by providing specific DNA recognition units adapted to bind to these sequences in the HPV genome. In the formation of the E2-specific TBAs for use in detecting HPV infection, it may be desirable to use any of SEQ ID NOS. 75-84 or 93-98 as the E2 DNA recognition units. A TBA containing a bovine E2 dimer and a human E2 dimer DNA binding domain may be particularly useful.
- The various sequences described above may either be chemically linked using pure oligopeptide starting materials, or they may be line through provision of recombinant nucleic acids encoding, via the well known genetic code, the various subelements. In the event of recombinant production, liking cro coding sequences to sequences of nucleic acid recognition units to form TBAs is advantagous because not only does cro act as assembly sequences in the chaperone, it also acts to direct the proper folding of the nucleic acid recognition elements. Exemplary sequences for chaperones are provided herein as SEQ ID NOS. 104-108. Furthermore, in the event that higher order structures comprising multiple binding sites is desired, as in a pentameric NF-kB/NF-kB/SP1/SP1/SP1 TBA, proper design of the asymmetry sequences allows such structures to be made.
- In the foregoing fashion, TBAs are prepared which bind to their cognate binding sites with high affinity. For example, the NF-kB DNA binding components of the TBAs of FIG. 10 are expected to bind to the HIV-LTR with an affinity of between about 10−8 and 10−12 molar. Sequences useful as the DNA recognition units are provided as SEQ ID NOS. 63-71, 73-84, 93-98, and 104-108 and exemplified further below.
- In view of the foregoing description of directed assembly of nucleic acid binding proteins using assembly and asymmetry (or piloting) sequences, those skilled in the art will recognize that a generally applicable method for assembling protein structures is provided by this invention. The generality of this method is demonstrated further by consideration, by way of further example, of the use of an antibody-epitope interaction in the assembly of desired structures. By way of specificity, a DNA binding protein structure may be assembled by linking an NF-kB p50 subunit to an antigen, such as a circulated (through disulfide bonds) melanocyte stimulating hormone (MSH). This pro-MSH molecule may then be bound by an anti-MSH antibody to provide a novel nucleic acid binding assembly, with the antigen and antibody acting as assembly sequences.
- The modular structure provided by FIG. 9 reveals that a great variety of TBAs may be assembled using different combinations of components. Thus, representative embodiments of this general structure are provided as SEQ ID NOS. 109-116.
- 7. The Booster Binding Assemblies (BBAs) and Their Preparation.
- A BBA may be any substance which binds a particular BBR formed by hybridization of particular PNAs and BNAs, including when multiple BNAs (up to and including “n” BNAs, i.e., BNAn, wherein “n” is theoretically 0-∞, but practically is between about 0 and 100) are polymerized onto the PNA for signal amplification, provided that the BBA must have at least the following attributes:
- (a) The BRA must bind the BBRs in a fashion that is highly specific to the BBR of interest. That is, the BBA must discriminate between BBRs present in the PNA-BNA hybrid and similar duplex sequences in BNA-CNA hybrids or other CNAs. Thus, where even a single base mismatch or conformational differences with or without base mismatches occur in the production of the PNA-BNAn or PNA-BNAn-HNA hybrid, the BBA must bind the hybrid with a sufficiently low avidity that upon washing the TBA-TNA-PNA-BNA complex, the BBA is displaced from the CNA sequences but not the BBR sequences.
- (b) The BBA must avidly bind the BBR(s). Binding affinities in the range of 10−5 to about 10−9 or higher are generally considered sufficient.
- Examples of BBAs include, but are not limited to cro, and the bacteriophage lambda repressor protein, CI. In addition, see U.S. Pat. No. 4,556,643, herein incorporated by reference, which suggests other DNA sequences and specific binding proteins such as repressors, histones, DNA modifying enzymes, and catabolite gene activator protein. See also EP 0 453 301, herein incorporated by reference, which suggests a multitude of nucleotide sequence specific binding proteins (NSSBPs) such as the tetacycline repressor, the lac repressor, and the tryptophan repressor. Each of these BBAs has been recognized in the art to bind to particular, known nucleic acid sequences and the affinities of these interactions are known. Naturally, the method of the instant invention is not limited to the use of these known BBAs. From the instant disclosure, one of ordinary skill could easily apply the use of novel BBAs exhibiting at least the required attributes noted above to the instant method.
- Examples of novel BBAs useful according to this aspect of the invention include novel proteins based on the motif of a known DNA or RNA or DNA:RNA binding protein such as cro or the λ CI repressor protein. Preferably, such modifications are made to improve the handling of these components of the invention. Thus, it may be desirable to add a high concentration of cro to an assay. One of the negative qualities of cro is that at high concentrations, the binding of cro to its DNA target comes into competition with cro-cro interactions. Thus, for example, a chaperoned or mutated cro may be produced which does not have this shortcoming. Examples of such altered chaperones are SEQ ID NOS. 105-106 and 108. Methods known in the art, such as production of novel target binding proteins using variegated populations of nucleic acids and selection of bacteriophage binding to particular, pre-selected targets (i.e., so-called phage-display technology, see discussion above for production of novel TBAs) may be used to produce such novel BBAs as well as the aforementioned novel TBAs.
- Where the BBA is a protein, or a complex of proteins, it will be recognized that any of a number of methods routine in the art may be used to produce the BBA. The BBA may be isolated from its naturally occurring environment in nature, or if this is impractical, produced by the standard techniques of molecular biology. Thus, for example, the sequence of the cro protein is known and any molecular clone of bacteriophage lambda may be used to obtain appropriate nucleic acids encoding cro for recombinant production thereof. In addition, the TBAs described herein may be used as BRAs, provided that different TBAs are used to bind TBRs and BBRs.
- 8. The Use of BBAs and BBRs to Localize and Amplify the Localization of the PNA-TNA-TBA Complexes (See FIG. 8).
- In one embodiment of this invention, the highly specific and extremely tight binding of TBAs comprised of nucleic acid binding components is used to produce an amplifiable nucleic acid sandwich assay. According to one aspect of this embodiment, a solid support is coated with a first TBA creating an immobilized TBA. In solution, a PNA and TNA are contacted under hybridizing conditions and then contacted with the immobilized TBA. Only those PNA-TNA interactions which form the specific TBR recognized by the immobilized TBA are retained upon wash-out of the solid surface which binds the TBA-TBR complex
- Detection of the bound TBR is accomplished through binding of Booster Nucleic Acids, BNAs, to the ½ BBRs present on the PNAs under hybridizing conditions. In this manner, even if only a single TBA-TBR complex is bound to the immobilized TBA, a large, amplified signal may be produced by polymerizing multiple BNAs to the immobilized TNA. Each BNA which binds to the TNA forms a BBR which can be bound by BBAs which, like the TBAs immobilized on the solid surface, may be chosen for their very tight and specific binding to particular nucleic acid structures. Thus, according to this embodiment, the immobilized TBA may contain the DNA binding portion of NF-kB, which very specifically and tightly binds to NF-kB binding sites formed upon hybridization of the TNA and PNA to form such a site.
- Because it is well known that there are NF-kB binding sites both in the normal human genome and in the long terminal repeats of human immunodeficiency virus (HIV), this invention provides a method of discriminating between the “normal” human sites and the sites present in cells due to HIV infection. Therefore, in a test designed to determine the presence or absence of HIV DNA in a sample of human DNA, the HIV NF-kB binding sites may be viewed as the TNA, and the normal human NF-kB binding sites may be viewed as CNAs. According to the method of this invention, discrimination between these TNAs and CNAs is accomplished by taking advantage of the fact that in the HIV LTR, there are two NF-kB binding sites, followed by three SP1 sites (see, for example, Koken et al. (1992)Virology 191:968-972), while cellular NF-kB binding sites with the same sequences are not found in tandem.
- In cases where the TNA contains more than one ½ TBR and it is desirable to pursue the therapeutic and prophylactic applications of the TBAs, it may be desirable to use more than one TBA, each with the capacity to bind a TBR in the TNA-PNA complex. In this case, it may be advantageous to select, as components of the TBAs, DNA-binding or RNA-binding domains with lesser affinity for its TBR than the wild-type DNA-binding or RNA-binding domain. Given that the TBAs which are involved in the binding to the multiple TBRs can either assemble together before binding to their TBRs or assemble together after binding to their TBRs, the individual TBAs will not block the corresponding TBRs in the other genomes than the target genome unless the TBRs are spatially capable of binding the assembled TBA complex. One feature of the multimeric assembly of TBAs which is specifically claimed here as part of this invention is that such a multimeric assembly is expected to have a much reduced affinity for a single site within the TNA. However, since the binding is dramatically increased relative to any one TBA, the TBA complex would be expected to not compete for the binding of any single TBR with the corresponding native proteins in situ but bind tightly to sequences in the PNA-TNA hybrid containing the TBRs for each of the nucleic acid-binding components assembled in the TBA. The TBA complex should be assembled and linkers adjusted in the individual TBAs so as to allow the nucleic acid-binding regions contained in the TBA complex to simultaneously reach and bind to these targets.
- Once the TNA-PNA hybrids have formed and been contacted with the immobilized TBA, unbound nucleic acid is washed from the immobilized surface and the immobilize hybrids detected. This is accomplished in any one of several ways. In one aspect of this invention, the PNA is labeled with an OSA, such as a radionuclide, colored beads, or an enzyme capable of forming a colored reaction product. Furthermore, in addition to having one or more ½ TBRs, the PNA also may contain at least one ½ BBR. The ½ BBR sequences are chosen so as to be complementary to unique ½ BBR sequences in BNAs. In the embodiment described above, for example, where the TBA is NF-kB and the TBR formed upon TNA-PNA hybridization is one or more NF-kB binding sites, the ½ BBRs may provide hybridizable (that is, single-stranded, complementary) sequences of the left or right bacteriophage lambda operators (see, for example, Ptashne [1982]Scientific American 247:128-140, and references cited therein for sequences of these operators). These may be polymerized onto the PNA ½ BBRs in a vectorial fashion (see FIGS. 2 and 3) providing up to “n” BBRs, and each BBR forms a cro binding site, Enzymatically, radioactively, or otherwise labeled cro, is contacted with the TBA-TNA-PNA-(BNA). complex. In this fashion, a highly selective and amplified signal is produced. Signal produced using a PNA having a single ½ TBR indicates success of the assay in achieving TBA-TBR binding and polymerization of the BNAs to produce signal from cellular sites (i.e. from CNAs). Absence of signal when a dimerized TBA is used indicates that in the TNA, there were no HIV LTRs as no double NF-kB binding sites were present. On the other hand, presence of signal using the dimer NF-kB indicates HIV infection. As a specific example of the foregoing description of this embodiment of the invention, see Example 6 describing an HIV test kit.
- Naturally, those skilled in the art will recognize that the foregoing description is subject to several modifications in the choice of PNAs, TNAs, TBAs, BNAs, and BBAs. Furthermore, in systems other than HIV, those skilled in the art will recognize that the general method described above could be likewise applied. However, these other applications may be simpler than the above described as the TBAs used may not recognize any normal cellular sites and therefore resort to dimerization or other methods of discriminating between TNAs and CNAs may be less critical. In designing probes and binding assemblies for these other systems, the skilled artisan will be guided by the following principles and considerations.
- In the above-described embodiment, the appeal of using the DNA-binding portions of NF-kB protein as the TBA and the NF-kB recognition binding elements as the TBRs is that these elements form an important “control point” for the replication of HIV. That is, it is known that HIV is required to use NF-kB as a critical feature in its replicative life cycle. Similar control points for other pathogens are chosen and used as a basis for detection according to the methods described herein.
- From the foregoing description of general features of this invention and the mode of its operation, one skilled in the art will recognize that there are a multiplicity of specific modes for practicing this invention. By way of example, the method of this invention is adaptable to a method and devices using chromatographic test kits described in U.S. Pat. Nos. 4,690,691 and 5,310,650 (the '691 and '650 patents). In those patents, a porous medium was used to immobilize either a TNA or a capture probe, and a solvent was used to transport a mobile phase containing either a labeled PNA, if the TNA was immobilized, or the TNA, if a capture probe was immobile, into the “capture zone”. Once the TNA was bound in the capture zone, either by directly immobilizing it or through capture, a labeled PNA was chromatographed through the capture zone and any bound label was detected.
- Adapting the instant invention to such a system provides the improvement of using a Target Binding Assembly in the capture zone and therefore, the capture of only perfectly matched TBR sequences or other TBRs representing nucleic acid confirmations specifically bound by the TBA within the TNA-PNA duplexes by virtue of the previously described sensitive discrimination by the TBA between TNAs and CNAs.
- Once the TNA-PNA hybrids become bound to the immobilized TBA, the signal is amplified by adding BNAs or chromatographing BNAs through the capture zone. Finally, the signal may be further amplified by adding BBAs or chromatographing labeled BBAs through the capture zone. In this fashion, the ease of performing the analysis steps described in the '691 and '650 patents is improved upon herein by providing the additional ability to increase the specificity and, through amplification, the sensitivity of the method described in those patents. The disclosure of the '691 and '650 patents is herein incorporated by reference for the purpose of showing the details of that method and for the teachings provided therein of specific operating conditions to which the compositions and methods of the instant invention are adaptable.
- Those skilled in the art will also recognize that the method of the instant invention is amenable to being run in microtiter plates or to automation. The use of machines incorporating the method of this invention therefore naturally falls within the scope of the instant disclosure and the claims appended hereto. Thus, for example, this invention is adaptable for use in such instruments as Abbott Laboratories' (Abbott Park, Ill.) IMx tabletop analyzer. The IMx is currently designed to run both fluorescent polarization immunoassay (FPZA, see Kier [1983] KCLA 3:13-15) and microparticle enzyme immunoassay (MEZA, see Laboratory Medicine, Vol. 20, No. 1, January 1989, pp. 47-49). The MEZA method is easily transformed into a nucleic acid detection method using the instant invention by using a TBA as a capture molecule coated onto a submicron (<0.5 μm on average) sized microparticle suspended in solution. The microparticles coated with TBA are pipetted into a reaction cell. The IMx then pipettes sample (Hybridized PNA-TNA) into the reaction cell, forming a complex with the TBA. After an appropriate incubation period, the solution is transferred to an inert glass fiber matrix for which the particles have a strong affinity and to which the microparticles adhere. Either prior to or after filtering the reaction mixture through the glass fiber matrix, BNAs and BBAs are added, or another signal amplification and detection means is used which depends on specific formation of TNA-PNA hybrids. The immobilized complex is washed and the unbound material flows through the glass fiber matrix.
- The bound complexes are detected by means of alkaline phosphatase labeled BBAs or otherwise (radioactively, enzymatically, fluorescently) labeled BBAs. In the case of alkaline phosphatase labeled BBAs, the fluorescent substrate 4-methyl umbelliferyl phosphate or like reagent may be added. Alternatively, the enzyme may be bypassed by directly labeling BBAs with this or a like reagent. In any event, fluorescence or other signal is proportional to the amount of PNA-TNA hybrids present.
- The fluorescence is detected on the surface of the matrix by means of a front surface fluorometer as described by the manufacturer of the IMx. With minor adjustments that can be made through routine experimentation to optimize an instrument such as the IMx for nucleic acid hybridization and nucleic acid-TBA interactions, the instant invention is completely adaptable to automated analyses of TNA samples.
- 9. Other Diagnostic Applications of this Invention.
- While the foregoing description enables the use of the instant invention in a number of different modes, many additional utilities of this invention are readily appreciated, for example, in a mobility retardation system.
- In this embodiment of the invention, an improvement of the well known electrophoretic mobility shift assay (EMSA) is conducted as follows (See FIGS. 12a and 12 b):
- A sample of DNA is fragmented, either through random cleavage or through specific restriction endonuclease treatment. The DNA in the sample is then split into two equal aliquots and a specific TNA is added to the first aliquot but not to the second. The first and second aliquot are then electrophoresed in an acrylamide or agarose gel, and the pattern of DNA bands (either visualized through ethidium bromide binding or through being radioactively labeled prior to electrophoresis is then compared for the two aliquots. Fragments of DNA having binding sites to which the TBA is specific are retarded in their migration through the electrophoretic medium. By using an appropriate TBA, any number of DNA or other nucleic acid sequences may be tracked in this fashion.
- In a modification of the EMSA described above, fragmented TNA is hybridized with a PNA and fractionated in a fist dimension. The fractionated DNA is then reacted with an appropriate TBA and the change in mobility of the DNA fragments is noted. Enhancement of the retardation is possible by adding BBAs as described above. (See, for example, Vijg and references cited therein for known techniques of two (2) dimensional nucleic acid electrophoresis, to which the instant method may be applied).
- 10. Therapeutic Applications.
- Because of the very tight and selective nucleic acid binding characteristics of the novel TBAs described herein, therapeutic utilities are contemplated in addition to the diagnostic utilities of these compounds. Thus, a TBA comprising tight and specific binding for the HIV-LTR, by virtue of having an NF-kB p50 and an SP1 DNA recognition unit in close association (see FIG. 10, HIV-Detect II) is useful to bind up the HIV-LTR and thereby prevent transcription from this key element of the HIV genome. The unique features of the assembly sequences of the TBA allow recombinant vectors to induce DNA encoding such a TMA into a cell and the proper folding of the expressed sequences. Once inside the cell, the nuclear localization signals of the p50 subunit directs the transport of the TBA to the nucleus where it binds tightly to the LTR of any integrated HIV, effectively shutting the pathogen down. In a prophylactic mode, one that is concerned about potential HIV exposure is administered a sufficient dose of a TBA or a recombinant vector able to express the TBA, so as to lock up any HIV that might have entered the person. In this mode, the use of the TBA is analogous to passive protection with a specific immune globulin. In the therapeutic or prophylactic mode, NLS sequences are used in place of the OSAs used in the diagnostic mode. Exemplary NLS sequences are provided as SEQ ID NOS. 72 and 103 (see also Heinzinger 1994 and Bukinsky 1993, describing NLS sequences of the HIV Vpr and gag proteins respectively). In any event, the TBA is administered in a pharmaceutically-acceptable carrier, known in the art such as a sterile salt solution or associated with a lysosome or in the form of a recombinant vector, preferably one which directs expression of the TBA in a chosen cell type, or by a protein delivery system.
- II. Embodiments of the Invention
- In view of the foregoing description and the examples which follow, those skilled in the art will appreciate that this disclosure describes and enables various embodiments of this invention, including:
- 1. A probe nucleic acid (PNA) comprising:
- (a) a single-stranded sequence, ½ TBR, which is capable of forming, under hybridizing conditions, a hybrid, TBR, with a ½ TBR present in a target nucleic acid (TNA);
- (b) zero, one or more, and preferably one to ten single stranded sequences, ½ BBR, which is capable of forming, under hybridizing conditions, a hybrid BBR, with a ½ BBR present in a booster nucleic acid (BNA); and
- (c) an OSA, which is no attached support and/or indicator, or an attached support or other means of localization, including, but not limited to, attachment to beads, polymers, and surfaces, and/or indicators;
- wherein said TBR is capable of binding with high affinity to a TBA, said TBA being a substance capable of discriminating between a paired TBR and a TBR having unpaired nucleotides, and further, wherein said BBR is capable of binding with high affinity to a BBA, said BBA being a substance capable of discriminating between a paired BBR and a BBR having unpaired nucleotides. This embodiment includes TBRs which are nucleic acid binding protein recognition sites, such as the HIV LTR, and other nucleic acid binding protein recognition sites in other pathogens, some of which are noted above. The PNA of this embodiment of the invention may produce a TBR which is a nucleic acid binding protein recognition site present in the genome of a pathogen or is a binding site associated with a pathogenic condition in the human genome or a contaminant in a fermentation process.
- 2. A booster nucleic acid (BNA) comprising:
- (a) a ½ BBR which has a sequence which is complementary to a ½ BBR sequence in a PNA or another BNA already hybridized to the PNA and which is capable of forming, under hybridizing conditions, a hybrid, BBR, with the PNA;
- (b) an OSA attached support or other means of localization, including, but not limited to, attachment to beads, polymers, and surfaces, and/or indicators; and
- (c) additional hybridization sites, ½ BBRs, for hybridization with additional BNAs;
- wherein said BBR is capable of binding with high affinity to a BBA, said BBA being a substance capable of discriminating between a paired BBR and a BBR having unpaired nucleotides.
- 3. A Hairpin Nucleic Acid (HNA) comprising a single-stranded sequence, ½ BBR, which under hybridizing conditions is capable of forming a hairpin while at the same time binding to a BNA to form a BBR capable of binding a BBA, wherein said BBR is capable of binding with high affinity to a BBA, said BBA being a substance capable of discriminating between a paired BBR and a BBR having unpaired nucleotides.
- 4. A method for detecting a specific TNA sequence, comprising the steps of:
- (a) hybridizing said TNA with a PNA as described above;
- (b) hybridizing said PNA with a BNA containing a ½ BBR whose sequence is complementary to a ½ BBR sequence in the PNA;
- (c) adding the products of steps (a) and (b) containing a TBR and a BBR, to a surface, liquid or other medium containing a TBA;
- (d) adding BBAs to the mixture in step (c) wherein said BBA comprises:
- (i) a molecule or a portion of a molecule which is capable of selectively binding to a BBR; and
- (ii) a detectible indicator; and
- (e) detecting signal produced by the indicator attached to the BBA. This method includes the use of a protein indicator, including enzymes capable of catalyzing reactions leading to production of colored reaction products. It also includes indicators such as a radionuclide or colored beads.
- 5. A method for detecting the presence in a sample of a specific Target Nucleic Acid, TNA, which comprises:
- (a) contacting said sample a Probe Nucleic Acid, PNA, which, upon hybridization with said TNA if present in said sample, forms a Target Binding Region, TBR, which is capable of binding a Target Binding Assembly, TBA;
- (b) contacting said sample, already in contact with said PNA, with a TBA capable of binding to any TBRs formed by the hybridization of said PNA and said TNA in the sample.
- 6. A method for detecting or localizing specific nucleic acid sequences with a high degree of sensitivity and specificity which comprises:
- (a) adding PNAs containing a ½ BBR and a ½ TBR to a sample containing or suspected of containing TNAs containing ½ TBR sequences, to form a complex having target binding regions, TBRs, formed by the hybridization of complementary ½ TBRs present in the PNAs and TNAs respectively;
- (b) binding the TBRs formed in step (a) to an immobilized TBA to form a TBA-TNA-PNA complex;
- (c) adding Booster Nucleic Acids, BNAs, containing booster binding regions, ½ BBRs, to the complex formed in step (b) such that the ½ BBRs in the BNAs hybridize with the ½ BBR sequences present in the PNAs or to ½ BBRs present in BNAs already bound to the PNA, to form BBRs, such that TBA-TNA-PNA-(BNA)n complexes are formed;
- (d) adding Hairpin Nucleic Acids, HNAs, containing ½ BBR sequences, to the complex formed in step (c) such that the ½ BBRs in the HNAs hybridize with any available ½ BBR sequences present in the BNAs of the complex of step (c), thereby capping the extension of the BNAs onto the TBA-TNA-PNA-(BNA)n complexes of step (c) to form TBA-TNA-PNA-(BNA)n-HNA complexes;
- (e) adding Booster Binding Assemblies, BBAs, linked to indicator moieties, to the TBA-TNA-PNA-(BNA)n-HNA complexes formed in step (d) to form TBA-TNA-PNA-(BNA-BBA)n-HNA complexes; and
- (f) detecting the signals produced by the indicator moieties inked to the TBAs, PNAs, BNAs, BBAs or HNAs in the TBA-TNA-PNA(BNA-BBA)n-HNA complexes of step (e);
- wherein:
- the TNA comprises:
- (i) one or more specific ½ TBR nucleic acid sequences, the presence or absence of which in a particular sample is to be confirmed;
- the PNA comprises:
- (i) a single stranded sequence, ½ TBR, which is capable of forming, under hybridizing conditions, a hybrid, TBR, with a ½ TBR present in a target nucleic acid (TNA);
- (ii) a single stranded sequence, ½ BBR, which is capable of forming, under hybridizing conditions, a hybrid BBR with a ½ BBR present in a booster nucleic acid (BNA); and
- (iii) an OSA, which is no attached support and/or indicator, or an attached support or other means of localization, including, but not limited to, attachment to beads, polymers, and surfaces, and/or indicators;
- the BNA comprises:
- (i) a ½ BBR, as shown in FIG. 1(IIb), which has a sequence which is complementary to a ½ BBR sequence in a PNA and which is capable of forming, under hybridizing conditions, a hybrid, BBR, with the PNA;
- (ii) an OSA attached support or other means of localization, including, but not limited to, attachment to beads, polymers, and surfaces, and/or indicators;
- (iii) additional hybridization sites, ½ BBRs, for other BNAs; and
- (iv) sequences, ½ BBRs, which can hybridize to BNAs already hybridized to the PNA;
- the BBA comprises:
- (i) a molecule or a portion of a molecule which is capable of selectively binding to a BBR, and
- (ii) no attached support and/or indicator, or an attached support or other means of localization, including, but not limited to, attachment to beads, polymers, and surfaces, and/or indicators;
- and the TBA comprises:
- (i) a molecule or a portion of a molecule which is capable of selectively binding to a TBR; and
- (ii) no attached support and/or indicator, or an attached support or other means of localization, including, but not limited to, attachment to beads, polymers, and surfaces, and/or indicators.
- 7. An improvement to a solid phase hybridization method for detecting the presence of a target polynucleotide involving: immobilizing a target polynucleotide, if present in a test sample, directly or via an intermediate capture structure, on a solid phase at a capture site; before, during or after said immobilization, attaching a detectable label to said target polynucleotide, if present; and detecting said label, if any, at said capture site; the improvement comprising:
- (a) using a Target Binding Assembly, TBA, as the means for achieving immobilization of said target polynucleotide, wherein said TBA binds only to a perfect hybrid formed between a specific Probe Nucleic Acid, PNA, and said target nucleic acid such that a perfect Target Binding Region, TBR, recognizable by said TBA is formed; and
- (b) including in the PNA a single stranded sequence, ½ BBR, capable of binding a Booster Nucleic Acid, BNA, containing a single stranded complementary ½ BBR which, upon hybridization with the ½ BBR in the PNA, forms a BBR capable of binding labeled Booster Binding Assemblies, BBAs.
- 8. A target binding assembly, TBA, comprising one or more nucleic acid recognition units, linker sequence(s), assembly sequence(s), asymmetry sequence(s), nuclear localization sign sequence(s) (NLS) and OSA(s). The nucleic acid recognition unit may be an NF-kB binding unit, an SP1 binding unit, a TATA binding unit, a human papillomavirus binding unit, an HIV LTR binding unit, or a binding unit for any other fragment of specific sequence the detection of which is desirable and which can be achieved through specific association with the TBA. Such recognition units include, but are not limited to those exemplified herein as SEQ ID NO. 63, SEQ ID NO. 64, SEQ ID NO. 65, SEQ ID NO. 66, SEQ ID NO. 67, SEQ ID NO. 68, SEQ ID NO. 69, SEQ ID NO. 70, SEQ ID NO. 71, SEQ ID NO. 72, and SEQ ID NO. 73. Linker sequences such as oligopeptides which do not interfere with the nucleic acid recognition function of the nucleic acid recognition unit and which provide stability and control over the spacing of the nucleic acid recognition unit from the remainder of the TBA. Examples of such link sequences are well known in the art and include, but are not limited to oligopeptide sequences from the interdomain primary sequence of a structural protein. Assembly sequences include oligopeptide sequences which direct the folding and association of nucleic acid recognition units. A preferred example of such sequences are oligopeptides derived from the bacteriophage lambda cro protein. The asymmetry sequence directs the association of nucleic acid recognition and assembly sequences in a predetermined order. Such asymmetry sequences are exemplified by sequences derived from insulin, relaxin, gonadotropic hormone, FSH, HCG, LH, ACTH, including but not limited to SEQ ID NOS. 85-92. With reference to FIGS. 14 and 15, SEQ ID NO. 85 is an “A” and SEQ ID NO. 86 is a “B” sequence; SEQ ID NO. 87 is an “A” and SEQ ID NO. 88 is a “B” sequence SEQ ID NO. 89 is a human relaxin “A” and SEQ ID NO. 90 is a human relaxin “B” sequence; SEQ ID NO. 91 is a skate relaxin “A” and SEQ ID NO. 92 is a skate relaxin “B” sequence. In addition, the TBA may contain nuclear localization signal sequences, NLS, which direct the migration and uptake of a protein or complex associated with said NLS into the nucleus of a cell. Examples of such NLS sequences are provided as SEQ ID NOS. 72 and 103. Preferred embodiments of the TBA include but are not limited to HIV Detect I-IV or HPV Detect I-IV, and SEQ ID NOS. 109-116.
- 9. Methods of using the novel TBAs of this invention include, but are not limited to a method of using the TBA to bind a particular nucleic acid sequence in a target nucleic acid sample which comprises:
- (a) fragmenting the nucleic acid in the target nucleic acid sample;
- (b) contacting, under hybridizing conditions, the fragmented nucleic acid with a probe nucleic acid complementary to the particular nucleic acid sequence of interest, wherein said probe nucleic acid, upon hybridization with said particular nucleic acid sequence of interest forms a target binding region to which said TBA specifically binds.
- In this method, the probe nucleic acid, in addition to sequences complementary to said particular nucleic acid sequence of interest, also may have additional sequences to which a booster nucleic acid can bind to form a booster binding site to which a labeled booster binding assembly can bind to provide a signal showing and amplifying the binding of the probe nucleic acid to the target nucleic acid sequence of interest.
- An additional aspect of this invention not requiring fragmentation of Target Nucleic Acid, involves administration of the TBA to a patient in need of such treatment of a therapeutically or prophylactically effective amount of said TBA, which comprises administering the TBA, either in the form of a purified protein complex or in the form of a recombinant vector which, upon entry into the patient is able to express the TBA, such that the TBA binds the particular nucleic acid sequence to achieve the desired prophylactic or therapeutic result. This may include providing a dosage which can be determined by routine experimentation to be sufficient to prevent establishment of an active infection by a pathogen. Dosages of purified TBAs may be in the range of about 0.001 to 100 mg/kg. When provided as a recombinant expression vector which will direct the in vivo expression and folding of the TBA, dosages of the recombinant nucleic acid may be substantially lower, particularly if provided in the form of non-pathogenic viral vector. The methods of using the TBAs also include monitoring the shift in mobility of nucleic acids in target nucleic acid samples as a function of the size such that binding of the TBA to a particular fragment in the sample modifies the mobility of the fragment. This aspect of the method provides a useful method of analyzing nucleic acid fragments for particular aberrations, such as might be found associated with metastases.
- 10. Diagnostic or forensic kits useful in determining the presence of an infection, the susceptibility to a disease, or the origin of a particular nucleic acid containing sample.
- 11. A method of assembling multimeric TBAs in vivo which comprises introducing nucleic acids encoding component TBAs into a cell. The component TBAs should each contain a nucleic acid recognition unit, assembly sequences, asymmetry sequences, and nuclear localization signal sequences. Linker sequences, optionally included if TBA footprinting experiments indicate the need for such linkers to attain optimal geometry of the multimeric TBA. Upon in vivo expression of each component TBA and primal binding, via the nucleic acid recognition unit of each component TBA to nucleic acid sequences encountered in the nucleus or elsewhere in the cell, component expressed TBAs are directed to assemble via the included assembly and asymmetry sequences into multimeric TBAs. As described above, such multimeric TBAs will have the advantage of binding specifically with high affinity to TBRs in a specific target sequence, but not at all or with very low affinity to cousin nucleic acids.
- The forgoing description of the invention will be appreciated by those skilled in the art to enable preferred embodiments as well as the best mode of this invention. Without limiting the subject matter to the specifics of the examples provided hereinafter, the following examples are provided to further guide those skilled in the art on methods of practicing this invention. Standard recombinant DNA techniques as disclosed in Sambrook, Fritsch, and Maniatis (1989)Molecular Cloning: A Laboratory Manual, 2nd Ed, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., and more recent texts are not disclosed as these are now well within the skill of the ordinary artisan.
- Probe nucleic acids, PNAs, may be prepared by means well known in the art. Thus, single stranded polynucleotide PNAs of defined sequence may be prepared via solid phase chemical synthesis according to Merrifield. PNAs may be prepared by automated synthesis using commercially available technology, such as resins and machines produced or marketed by Applied Biosystems, ABI, or other manufacturers. Alternatively, through known recombinant DNA methods, particular PNA sequences are synthesized in vivo, for example by cloning a duplex PNA into a vector which can replicate inE. coli, large quantities of the duplex PNA may be prepared. Multimers of the PNA may be cloned into the vector such that for each mole of vector, several moles of PNA is liberated upon digestion of the vector with a restriction fragment flanking the PNA sequence. Subsequent to synthesis or recombinant production, the PNAs are purified by methods well known in the art such as by gel electrophoresis or high pressure liquid chromatography (HPLC). If the PNA is produced as a duplex, prior to use in a hybridization assay for detection of target nucleic acid sequences the strands of the PNA are separated by heating or other methods known in the art.
- The specific sequence of bases in the PNA is chosen to reflect the sequence to be detected in a TNA, with the proviso that, according to this invention, the PNA contains a ½ TBR sequence, which is one that upon hybridization of the PNA and TNA, a TBR is formed. As there are an essentially unlimited number of such sequences known in the art, the choice of the PNA sequence is amenable to selection by the skilled researcher for any given application. The sequence of the HIV LTR is one such sequence, which upon hybridization of a PNA encoding portions of the LTR with TNAs encoding the HIV LTR, TBRs capable of binding the NP-kB or SP1 DNA binding proteins are formed.
- In addition to sequences which will form a TBR upon hybridization, the PNA also may contain a ½ BBR. This sequence is one which, upon hybridization with a booster nucleic acid, BNA, forms a BBR which is capable of binding a BBA. The BBA is preferably a DNA binding protein having high affinity for the BBR sequence.
- In this particular example, hybridization between a PNA having as a ½ TBR, SEQ ID NO. 4 and, at the 3′ end of that sequence, a ½ BBR sequence shown as SEQ ID NO. 35. The PNA encoding these sequences is either used without labeling or is labeled with a radioactive isotope such as P32, S35, or a similar isotope, according to methods known in the art. Alternatively, the PNA is bound to a bead of between 0.01 to 10 μm, which may be colored for easy visual detection. This label forms the OSA as described in the specification. This probe hybridizes with HIV LTR sequences to form a TBR that binds NF-kB. In addition, the PNA hybridizes with BNAs having a complementary ½ BBR to form a bacteriophage lambda left operator that binds either cro or lambda repressor proteins.
- In a manner similar to that described above, PNAs are used wherein the ½ TBR is any one of SEQ ID NO. 5 or SEQ ID NOS. 7-34, and a ½ BBR, such as SEQ ID NO. 35 or SEQ ID NO. 36 is either at the 3′ end or 5′ end of the ½ TBR.
- Similar to the methods described in Example 1 for preparation and labeling of PNAs, BNAs are prepared and labeled according to methods known in the art. As described in U.S. Pat. No. 4,556,643, herein incorporated by reference (see particularly Example 1), nucleic acid sequences encoding particular nucleic acid binding sequences may be mass produced by cloning into a replicable vector. Furthermore, similar to that disclosure, the ½ TBR and ½ BBR sequences may be co-linearly produced in this fashion, with the distinction, however, that according to the instant invention, the ½ TBR sequence itself forms a nucleic acid binding component recognition site and the ½ BBR, while forming a nucleic acid binding component recognition site, also provides a means of amplifying the signal produced upon binding of the ½ TBR to complementary sequences in the TNA by providing for polymerization of BNAs onto the TNA bound PNA. To enable this, a sequence such as SEQ ID NO. 35, which encodes the left operator of bacteriophage lambda, is provided with additional sequences such that an overhang sequence is created on one or both ends of the BNA upon hybridization with the PNA.
- As a specific example, vectorial polymerization of BNAs onto a TNA is provided by SEQ ID NOS. 40-43. In this example, SEQ ID NO. 40 encodes two ½ TBRs which will hybridize with two ½ TBRs in a TNA to form two NT-kB binding sites, while at the same time providing a bacteriophage lambda left operator ½ BBR, which additionally is terminated at the 3′ end with the recognition site for the restriction enzyme PstI. Addition of the BNA, SEQ ID NO. 41, with the ½ BBR complementary to the ½ BBR on the PNA, SEQ ID NO. 40, completes the BBR while at the same time completing the PstI recognition site, leaving a four base overhang for hybridization with additional BNAs. Accordingly, SEQ ID NO. 42 is added which has a four base pair sequence at the 3′ end which is complementary to the four-base overhang remaining from the hybridization of SEQ ID NOS. 40 and41. In addition, SEQ ID NO. 42 is provided with a five base sequence at its 5′ end which forms part of a BamHI recognition site. The growing polymer of BNAs is extended further by addition of the BNA SEQ ID NO. 43, which is complementary to SEQ ID NO. 42, completing the BBR while at the same time completing the BamHI recognition site and leaving a four base overhang which may be further hybridized with BNAs having complementary sequences. In this fashion, the BNAs may be hybridized extensively so as to greatly amplify the signal of a single PNA-TNA hybridization event.
- As with the PNAs described in Example 1, the BNAs may be used in an unlabeled form or may be labeled according to methods known in the art and described in Example 1. It will also be appreciated that, rather than produce the BNA polymer by sequential addition of BNAs to the PNA-TNA complex, the BNA polymer may be performed and added directly to the PNA-TNA complex. One simple method for performing such a BNA polymer includes the recombinant production of a vector in which multimers of the BNA are provided with a unique restriction site at either end of the polymer. This polymer of BNAs containing multiple BBRs is cut out of the vector and hybridizes to a single stranded ½ BBR remaining in the PNA upon hybridization of the PNA and the TNA. This is accomplished by providing a single stranded sequence in the PNA complementary to an overhang produced in the BNA polymer when it is excised from the production vector.
- The HNAs of this invention are produced according to methods known in the art for polynucleotide production as described in Examples 1 and 2 for PNAs and BNAs. In the production of the HNAs, however, the sequence of the HNA is specifically designed so that a substantial portion of the HNA forms a self-complementary palindrome to form a hairpin, while at the same time, leaving in single stranded form enough bases to be able to hybridize with single stranded sequences in the growing chain of BNAs described in Example 2.
- In this Example, a HNA of SEQ ID NO. 44 is provided to cap the extension of BNAs onto the PNA in Example 2 after the addition of the BNA, SEQ ID NO. 43. This is accomplished because SEQ ID NO. 44, while having a palindromic sequence that forms a stable hairpin, also has a sequence at the 5′ end of the HNA which completes the BamHI sequence formed by the hybridization of SEQ ID NO. 42 and SEQ ID NO. 43. Naturally, termination of the polymer after addition of only 3 BNAs is for the purpose of simplicity in demonstrating the invention. As described above, his polymerization may be continued essentially indefinitely to amplify the signal of the PNA-TNA hybridization event. Once the HNA hybridizes to the growing chain of BNAs, the polymer is capped and no further extension of the polymer is possible.
- The TBAs and BBAs which may be used according to the instant invention include any substance which can specifically bind to the TBRs and BBRs formed by hybridization of the PNAs, TNAs and BNAs. Use of DNA binding proteins forms one example of such substances.
- For this example, the TBA is the dimer of the DNA binding portion of p50, and the BBA is the lambda cro protein. These proteins may be produced according to methods known in the art. The genes for both of these proteins have been cloned. Thus, these proteins are recombinantly produced and purified according to methods known in the art. Furthermore, these proteins are labeled, either with a radioisotope, such as radioactive iodine, or with an enzyme, such as beta-galactosidase or horseradish peroidase, or with a fluorescent dye such as fluorescein or rhodamine, according to methods well known in the art. In addition, either or both of the TBA and BBA may be immobilized on a solid surface such as the surface of a microfiter plate or the surface of a bead, such as a colored bead of diameter anywhere from 0.01 to 10 μm. The labels on the TBAs and BBAs may be the same or different.
- In this example, the TBA containing the dimeric p50 DNA binding domain is labeled with rhodamine, while the BBA, cro, is labeled with fluorescein. Accordingly, upon hybridization of the PNAs, TNAs, BNAs and HNAs as described in this patent disclosure and the foregoing and following examples, the nucleic acid hybrids, if formed, are contacted with excess labeled TBA and cro. The fluorescence of these labels is measured according to known methods and, detection of both signals is indicative of the presence of ½ TBR sequences in the TNA. The differential signal produced by the fluorescence of the NF-kB and cro is a measure of the degree to which the polymerization of BNAs onto the PNA-TBA hybrid has resulted in amplification of the signal. Amplification from one to over a thousand fold is contemplated according to the method of this invention.
- The PNAs, PNA1, SEQ ID NO. 40 and PNA2, SEQ ID NO. 45, are used in about ten-fold molar excess over the concentration of TNAs in a test sample. For this example, an isolated duplex HIV LTR, wherein one strand of which has the sequence SEQ ID NO. 37, shown in FIG. 7, and the other stand of which is complementary to the sequence shown in FIG. 7, is used as the TNA. A duplex isolated CNA is also used in this example, one strand of which has the same sequence as SEQ ID NO. 37, except that, in the first NF-kB binding site shown in FIG. 7, at the center of the binding site,
position 1 in FIG. 7, instead of a “T,” there is an “A,” the complementary strand of which therefore mismatches with the SEQ ID NO. 40 PNA at that location - SEQ ID NO. 40 and SEQ ID NO. 45 are both added to separate reactions, the first containing the above described TNA and the second containing the above described CNA. The samples are solubilized in an appropriate hybridization buffer, such as 10 mM Tris (pH 7.5), 1 mM EDTA. The samples reheated to about 90° C. for about five minutes to strand separate the duplex TNAs and CNAs in the samples, and then the samples are allowed to cool to allow strands of PNAs, TNAs and CNAs to anneal.
- Once the hybridization has gone to completion, which can be determined according to known methods such as by calculating the t½ based on base compositions and annealing temperature according to known methods, the SEQ ID NO. 40 PNA is polymerized by addition of BNAs as in Example 2 and the SEQ ID NO. 45 PNA2 probe is polymerized with BNAs staring with Sph1 recognition site overhang. Following addition of the BNAs and a brief hybridization period, the separate samples are added to beads coated with covalently immobilized NF-kB, and the NF-kB is allowed to bind to any TBRs formed in the TNA and CNA samples. After about 15 minutes of binding, the samples are washed twice with about three volumes of an appropriate washing buffer, such as 10 mM Tris, pH 7.5, 100 mM NaCl, or another buffer pre-determined not to interfere with NF-kB, or bacteriophage lambda CI repressor protein binding activity. After each wash, the beads are allowed to settle under gravity or by brief centrifugation. This removes any nucleic acids which do not have a perfect NF-kB binding site formed by hybridization of the PNA1 and TNA sequences.
- After the final wash, bacteriophage lambda CI repressor protein labeled with a radioactive isotope, such as with radioactive iodine, or labeled with an enzyme, such as horseradish peroxidase, with colored beads, or with a fluorescent label is added to each sample. The samples are then washed several times (about 3) with several volumes (about 2) of an appropriate washing buffer such as 10 mM Tris, pH 7.5, 100 mM NaCl, or another buffer pre-determined not to interfere with NF-kB, or bacteriophage lambda CI repressor protein binding activity. After each wash, the beads are allowed to settle under gravity or by brief centrifugation. Following the last settling or centrifugation, the bound label is quantitated by detecting the bound radioactivity, liberated color in an enzymatic assay, color of bound beads, or fluorescence detection. Alternatively, an anti-CI antibody can be added and a standard sandwich enzyme linked immunoassay or radioimmunoassay performed to detect bound repressor. In addition, as a negative control (background), all of the foregoing manipulations are carried out in tandem with a sample in which beads are used having no immobilized NF-kB.
- As a result of the foregoing assay, the control and CNA containing samples have similarly low signals while the TNA containing sample has a signal well above background.
- A. Kit contents:
- 1. Microtiter plate.
- 2. 1 mg/mL solution of recombinantly produced NF-kB in tris-buffered saline.
- 3. Tube containing single stranded HIV PNAs (a mixture of pre-mixed oligonucleotides encoding two NF-10 ½ binding sites, i.e. a mixture of SEQ. ID. Nos. 7 and 8).
- 4. Tube containing single stranded human genomic PNA, SEQ ID NO. 1.
- 5. Tube of nuclease (PstI).
- 6. Tube of protease.
- 7. Tube containing pre-polymerized BNAs, 100 repeat units of bacteriophage lambda OR, capped with an HNA but with free ½ BBRs available for binding to PNA-TNA hybrids.
- 8. Tube of horseradish peroxidase (hrp) conjugated cro.
- 9. Tube of hrp colored substrate.
- 10. Tris buffered saline, 100 mL.
- 11. Lancet.
- 12. Reaction tubes A, B, C, each containing 250 μL of distilled water.
- 13. Medicine dropper.
- B. Assay Method:
- (a) The microtiter plate (item 1) is coated with the solution of recombinantly produced NF-kB (item 2) at a concentration of 1 mg/mL in tris buffered saline overnight at 4° C. with rocking.
- (b) Three drops of blood of the test taker is obtained by pricking a finger with the lancet (reagent 11), and a drop of blood is dispensed into each of reaction tubes A, B, and C (reagent 12).
- (c) Into each tube is dispensed one drop of protease solution (reagent 6) with the medicine dropper (item 12) and the tube agitated and allowed to sit for 5 minutes.
- (d) One drop of nuclease (item 5) is added to each of tubes A-C using the medicine dropper and the tubes agitated and allowed to sit for 10 minutes.
- (e) One drop of item 3 is added to tube A (test sample); one drop of item 4 is added to tube B (positive control); and one drop of saline (item 12) is added to tube C as a negative control. The tubes are heated to 50° C. in hot water and allowed to cool to room temperature over one hour.
- (f) While the hybridization is allowed to occur in step (d), the excess protein is drained from the surface and the microtiter plate, from step (a), and the plate is rinsed with tris buffered saline (tube 10).
- (g) The contents of tubes A-C from step (e) are transferred to three wells of the microtiter plate and allowed to stand for 1 hour with rocking.
- (h) The microtiter wells containing the contents of tubes A-C are rinsed with tris buffered saline and emptied.
- (i) One drop of item 7 is added to each well and allowed to hybridize with any ½ BBR sites bound to the plate, over one hour, followed by three rinses with tris buffered saline.
- (j) One drop of item 8 is added to each well and cro is allowed to bind to any bound BNA's over 10 minutes, followed by five, one mL washes with tris-buffered saline.
- (k) One drop of hrp substrate is added to each well and color allowed to develop.
- C. Results:
- If wells A and B both show color development, and well C does not, the test is valid and the subject has been infected wit HIV. If only well A shows color development, or if well C shows color development, the test has been performed incorrectly, and is invalid. If wells A and C show no color development but well B does, the test is valid and the individual has not been infected with HIV.
- Novel TBAs for use according to the instant invention are prepared as follows:
- (a) NFkB/NF-kB (HIV-Detect I). A nucleic acid encoding any one of SEQ ID NOS. 63-71 or a like NF-kB DNA binding protein, is fused, in frame, to a nucleotide sequence encoding an assembly sequence, such as cro, such that the NE-kB DNA recognition sequence is encoded at amino or carboxy terminus of the cro sequence. Optionally, a linker sequence is provided between the NF-kB sequence and the cro sequence. At the other terminus of cro, a nuclear localization signal sequence, such as SEQ ID NO. 72, is optionally provided. Further, asymmetry sequences are optionally provided at the cro terminus unused by the NF-kB recognition sequence. Examples of complete TBAs are shown below.
- (b) NF-kB/SP1 (HIV-Detect II). In a similar fashion to that described in (a) above, a recombinant coding sequence encoding an NF-kB recognition domain is prepared. In a separate construct, instead of SEQ ID NOS. 63-72, the coding sequence for the DNA recognition portion of SP1 is included. Such a sequence should encode all or a functional part of SEQ ID NO. 73, which is that portion of the SP1 transcription factor exhibiting DNA binding (see Kadonaga et al. [1987] Cell 51:1079-1090). The NF-kB-encoding vector and the SP1-encoding vector are then co-transfected into an appropriate expression system such as is well known in the art. A monomeric NF-kB recognition unit is added to complete the NF-kB recognition dimer after the assembly of the SP1 and NF-kB recognition units by the chaperone. The asymmetry sequences prevent the formation of NF-kB or SP1 dimers and direct, instead, the formation of NFkB-SP1 heterodimers (i.e., HIV-Detect II), which are then isolated from the expression system (mammalian or bacterial cells) by known methods.
- (c) SP1/SP1 TBAs (HIV-Detect III). As described in (b) above, an SP1-encoding TBA construct is prepared. However, only this construct is transfected into the expression system, and asymmetry sequences allowing the formation of SP1-
SP 1 dimers are included. - (d) SP1-TATA (HIV-Detect IV). As described in (b) above, an SP1-encoding TBA recombinant is produced. In addition, a recombinant encoding a TBA having the binding sequence, SEQ ID NO. 74, or like sequence encoding a TATA recognition unit is prepared with asymmetry sequences complementary to those included in the SP1 TBA-encoding construct. These constructs are co-transfected and the heterodimers isolated by standard methods, including affinity purification on a DNA column having the appropriate SP1-TATA target binding regions.
- (e) SP1-E2 (HPV-Detect I). An SP1-encoding construct is prepared as in (b) above. An E2 TBA-encoding construct is prepared by using a sequence encoding any one of SEQ ID NOS. 75-84 and 94-98 which are papillomavirus E2 DNA recognition units (see Hegde et al. [1992]Nature 359:505-512) or like recognition units, is prepared and co-transformed or co-transfected with the SP1 TBA-encoding construct. Monomeric E2 recognition unit is added to the complete E2 recognition dimer after the assembly of the E2-SP1 recognition unit by the chaperone. The heterodimer HPV-Detect I is isolated according to known methods.
- (f) E2-E2 (HPV-Detect II). As described above in (e), an E2 TBA-encoding construct is prepared except that asymmetry sequences are included which permit the formation of E2 dimers. The expressed dimers are then isolated by known methods including affinity for a dimeric E2 binding site on a DNA affinity column.
- (g) E2-TATA (HPV-Detect III). As described above in (e) and (d), E2 and TATA binding TBAs are prepared (respectively), except that asymmetry sequences are included which enhance the formation of heterodimers rather than homodimers. These constructs are then co-expressed and the heterodimers are isolated.
- (h) TATA-TATA (HPV-Detect IV). As described above in (a) and (d), a TATA binding TBA-encoding construct is prepared using asymmetry sequences that encourage this homodimer formation and the homodimer is isolated.
- (i) Other TBAs. As described above for HIV and HPV TBAs, TBAs for any given pathogen or disease state may be produced by identifying specific DNA binding proteins and forming an expression construct using appropriate linker, assembly, and asymmetry sequences.
- In a similar fashion to the assay described in Example 5, a more stringent assay is produced by using the duplex NF-kB-SP1 binding protein prepared according to Example 6. Accordingly, the probes shown in FIG. 7 and used in Example 5 may be lengthened to reduce the interprobe distance and thereby reduce the flexibility of the DNA in the TNA.
- By the appropriate use of asymmetry sequences, TBAs are produced which are dimers, trimers, tetameres, pentamers, or hexamers of particular DNA recognition units. In this fashion, a hexameric TBA is produced by making a first NF-kB p50 dimeric TBA using asymmetry sequences which enable dimer formation. In addition, the asymmetry sequences enable the tetramerization of the p50 dimer with an SP1-SP1 dimer. Finally, additional asymmetry sequences direct the hexamerization with a dimer exhibiting nuclear localization sequences. This is accomplished by incorporating, for example, asymmetry sequences from insulin, which in nature forms hexamers. This hexamer formation is directed by the sequences, SEQ ID NOS. 85 (A) and 86 (B), 87 (A) and 88 (B), 89 (A) and 90 (B), and 91 (A) and 92 (B) (see FIGS. 13 and 14).
- Because of the extremely high affinity for the HIV-LTR that can be generated using a multimeric TBA, the compounds having this structure and which can be used for this purpose are referred to herein as “HIV-Lock.”
- An optimal HIV-Lock is defined by footprinting (according to methods well known in the art) TBAs bound to TBRs in the HIV LTR to confirm that the binding affinity of each DNA binding protein contributing to the formation of the multimeric TBA complex is downshifted relative to the affinity for any natural target sequence (i.e. CNAs) from which the DNA binding recognition unit of the TBA is derived. Any concomitant loss in binding affinity for the HIV TBRs is more than compensated for upon formation of the multimer as described below.
- There may be competition between the binding of each component TBA for its TBR and assembly, via asymmetry sequences to form the multimer. This is obviated by adjusting the linkers between the chaperone and asymmetry sequences in each TBA component such that these competing events are uncoupled. The resultant reduction in the dimensionality of diffision (effective concentration increase) for the TBA asymmetry and assembly components results in efficient formation of the multimeric complex.
- On the basis of the footprinting, the length and composition of linkers is adjusted to achieve optimal discrimination between target HIV sequences and natural sequences. In this fashion, although each component TBA will have a low affinity for CNA and TBR sequences, the multimeric complex will have an extremely high affinity for the now expanded TBR recognized by the multimeric complex (the square of the affinity of each TBR recognized by each component TBA of the multimeric TBA), while still having a low affinity for CNAs. In the same fashion, other multimeric TBA complexes, aside from HIV-Lock, are prepared.
- TBAs which can be formed in this fashion include the following sequences, which are assembled by linking either the protein subunits or nucleic acid sequences encoding these subunits, as follows:
Set Link Sequences from Groups A I + II + III B IV + V + III C IV + III - wherein groups I-V consist of sequences selected from:
Group Selected from Sequences I Any of SEQ ID NOS. 85-92 II Met Ser, linked to any of SEQ ID NOS 104-106, each of which is linked to SEQ ID NO. 99. III SEQ ID NO. 100 linked to any of SEQ ID NOS. 75-84 or 94-98; SEQ ID NO. 101 linked to either SEQ ID NO. 74 or SEQ ID NO. 93; or SEQ 1D) NO. 102 linkedto SEQ ID NO. 74 or SEQ ID NO. 93; or any of SEQ ID NO. 72, 103, 73, or 63-71. IV Any of SEQ ID NOS. 104-106. V SEQ ID NO. 99. - Specific examples of such TBAs are SEQ ID NOS. 109-116, assembled as follows:
Set SEQ ID NO. Link SEQ IDS A 109 85 + MetSer + 104 + 99 + 100 + 94 A 110 85 + Met Ser + 104 + 99 + 72 A 111 86 + MetSer + 105 + 99 + 102 + 74 A 112 86 + MetSer + 106 + 99 + 73 A 113 89 + MetSer + 106 + 99 + 63 C 114 106 + 64 C 115 105 + 64 B 116 106 + 99 + 73 - In this fashion, choosing between appropriate asymmetry sequences, assembly sequences, and DNA recognition units, many different TBAs may be formed. Furthermore, sets of these, such as SEQ ID NOS. 114 and 115,will associate with each other but dimers of SEQ ID NO. 114 or 115 will not form due to charge repulsion in the mutated assembly sequences (SEQ ID NO. 104 is cro; SEQ ID NO. 105 is a novel mutated, negatively charged cro, and SEQ ID NO. 106 is a novel mutated, positively charged cro).
- Naturally, given the amino acid sequence of these TBAs, one of ordinary skill could produce recombinant nucleic acid clones encoding these, and such recombinant clones naturally form an integral part of this invention.
- In much the same method as used in Example 6, the “HIV-LOCK” produced according to Example 9 is used as the TBA,
reagent 2, with similar results. - When the quantity of blood to be tested is not limiting, as when samples of blood for donation are to be tested for HIV contamination, tests similar to Example 6 are run, but for each of tubes A-C, about 5 mL of blood is pelleted in a tabletop centrifuge. Other reagents are scaled up as necessary to handle the larger quantity of TNA present in the sample.
- EXAMPLE 12
- “HIV-LOCK” produced according to Example 9 is formulated as a 1 mg/mL solution in liposomes and injected intravenously into a subject who has been tested and confirmed to be infected with HIV. A dose of it 0.1 mg to 100 mg of “HIV-LOCK”/kilogram body mass is infused over a twenty-four hour period and the concentration of HIV p24 in the patient's serum monitored. The treatment is repeated as often as necessary, such as when elevations in the serum p24 occur.
- A recombinant retroviral or like vector is used to deliver a construct encoding an HIV-LTR binding TBA to an infected patient. The vector encodes a chaperone, such as cro, and sequences DNA for binding portions of p50. The same vector also encodes a chaperone on which an SP1 TBA folds. Asymmetry sequences are provided such that upon co-expression of the p50-TBA and the SP1-TBA in a single HIV infected cell in vivo, an immediate association occurs between these TBAs, while at the same time preventing any association between the DNA binding portion of p50 and endogenous p50 or p65 monomers. NLS sequences are also provided in the TBAs so that, upon dimer formation, the TBA immediately relocates to the nucleus of the cell and binds specifically to integrated HIV sequences, thus preventing any transcription from that locus.
- For this purpose, it is desirable to select sequences encoding DNA binding domains such that the expressed monomers are assembled into a TBA which does not bind to natural human sequences. Thus, it is only upon binding of the TBA components to their target sequences that association between all components of the TBA occurs to form a complex which tightly and specifically binds the HIV LTR.
- This diagnostic for human papillomavirus takes advantage of the known differential between benign and carcinogenic HPV to provide a test which indicates the susceptibility to malignancy in a patient. The papillomaviruses a group of small DNA viruses associated with benign squamous epithelial cell tumors in higher vertebrates. At least 27 distinct human types of papillomaviruses (HPVs) have been found; many of these have been associated with specific clinical lesions. Four of these, HPV-6, HPV-11, HPV-16, HPV-18, and HPV-33 have been associated with human genital tract lesions. In general, PV-6 and HPV-11 DNAs have been found associated with benign lesions of the genital tract. HPV-16, HPV-18, and HPV-33 have also been found associated with premalignant and malignant lesions and are transcribed in most cell lines established from cervical carcinomas. HPV-16, HPV-18, and HPV-33 are likely to be only two members of a large set of HPV DNAs associated with malignant human cervical carcinomas.
- Animal models have shown that benign papillomavirus lesions can progress to malignant lesions in the presence of a co-carcinogen. HPV DNA has been found in metastases of cervical carcinomas. In malignant cervical lesions, HPV DNA is usually integrated into the human genome, but there may also be extrachromosomal HPV DNA present integration of HPV to form the provirus usually results in the disruption of the viral E2 open reading frame (ORF). Despite disruption of the E2 ORF, and examination of cell lines from several cervical carcinomas has shown transcriptionally active and integrated HPV-16 and HPV-18. When HPV-16 genomes which are present in the human cervical carcinoma cell lines SiHa and CaSki have been examined, there are differences found in the integration of HPV-16. In the SiHa line, the single HPV-16 genome integration occurred at bases 3132 and 3384, disrupting the E1 and E2 ORFs with a deletion of 0.3 kb. An additional 50-basepair deletion of HPV-16 DNA resulted in the E2 and E4 OFRs being fused. The 5′ portion of the HPV-16 DNA, consisting of the disrupted E2 ORF, is ligated to continuous human right flanking sequences. In addition, a single additional guanine is detected at nucleotide 1138 in the middle of the E1 ORF. This basepair addition results in the fusion of the E1a and E1b ORFs to a single E1 ORF.
- The complete genome of HPV-16 is available on GenBank as accession number K02718; the complete genome of HPV-33 is available on GenBank as accession number M12732; the complete genome of HPV-18 is available on GenBank as accession number X05015.
- As a preliminary screen, the fact of an HPV infection is established for a given cervical biopsy sample by a simple “yes/no” type of analysis using, for example, any or all of the PNAs SEQ ID NOS. 46-53 and an E2 TBA as described above (i.e., fragment DNA, binding the PNA, immobilize with the TBA, and detect signal with BNAs and BBAs).
- Once a biopsy sample is found to be positive for HPV, additional information is obtained as to the malignancy potential of the HPV by analyzing the integration status of die virus in the human genome.
- 1. Fragment the DNA in the cervical biopsy sample and hybridize to a blocking probe having the sequence, SEQ ID NO. 60. This probe will bind to all the fragments in the DNA which have not spliced out the 0.3 kb fragment.
- 2. Expose the DNA in the biopsy sample to a PNA having the sequence, SEQ ID NO. 61. This probe will only bind to fragments which have deleted the 0.3 kb fragment (the blocking probe will prevent the looping out of the large deletion segments if present).
- 3. A PNA having SEQ ID NO. 62 is hybridized with SEQ ID NO. 41 to form a BBR which will bind to cro or λ CI repressor as a BBA, leaving a single-stranded portion capable of hybridizing with the TATA site on SEQ ID NO. 61. This added to form a TBR on the 5′ end of the large deletion.
- 4. The TBR is immobilized by a TBA having a TATA binding protein DNA recognition unit.
- 5. The bound fragments are detected by adding BNAs and BBAs as described above.
- Detection of signal in this assay indicates that the large fragment is deleted in HPV present in the TNA. Since this deletion is correlated with malignancy, this assay provides insight into the malignancy potential of the HPV infection. This conclusion can be confirmed by performing an analogous assay based on the deletion of the 52-basepair fragment which is also correlated with HPV-induced malignancy.
- The TBP recognition unit used in the TBA for this assay may be chosen, for example, from a sequence such as SEQ ID NO. 70 or SEQ D NO. 93.
- Phase One—Preparation of DNA to Produce the HIV-Lock™.
- In vitro mutagenesis of the coding regions of the naturally occurring, cloned components of the HMV-Lock™ which need to be modified is performed with a MutaGene Phagemid kit. The modified protocol includes the use of a Blue-script plasmid containing each of the binding components of MV-Lock™. These are transformed into competent cells and uracil-containing phagenids are grown. Single stranded DNA is extracted and used as a template for the mutagenic strand. Oligonucleotides containing the desired mutations, including the incorporation of a novel restriction site, are synthesized and treated with polynucleotide kinase and ATP. The kinase treated oligonucleotides are annealed to the single-stranded template, and a mutagenic strand is synthesized and ligated according to the MutaGene protocol, with the exception that Sequenase 2.0 provides the polymerase. Libraries are screened using both g-32P end-labeled nucleotides containing sequences complementary to the introduced mutations and by isolating the plasmid DNA and identifying the mutants by the presence of the introduced restriction site. The mutations are also confirmed by sequencing with a Sequenase kit. The HIV-Lock™ DNA is cloned into the baculovirus expression system with a polyhedron promotor.
- Phase Two—Production of HIV-Lock™ Proteins Using Baculovirus.
- Sf-9 cells are cultured to a pre-determined density (about 1×106 cells/ml, log phase), infected with the baculovirus containing the HIV-Lock™ instructions and harvested to recover the recombinant proteins comprising the HIV-Lock™. In the scale-up process, cultures are expanded from flasks to spinners and subsequently to bioreactors. Following infection the cells are harvested at 12, 24, 36 and 48 hours for the protein. Indices of viability are monitored throughout the entire process.
- Phase Three—Purification of the HIV-Lock™ Proteins.
- The harvested proteins are first separated from particulates by flow-through ultracentrifugation to facilitate downstream purification. The centrifuged product is then sterile filtered. Extracts are then centrifuged at 40,000 rpm at 4° C. for 30 minutes and aliquots are immunoprecipitated with polyclonal rabbit antibody against one of the HIV-Lock™ components. Immunoprecipitated proteins are run on an SDS-10% PAGE gel.
- Phase Four—Test of HIV-Lock™Proteins Against HIV DNA
- Mobility shift assays are carried out using an oligonucleotide probe comprising elements of the HIV long terminal repeat and fragments containing NFKB binding DNA associated with kappa light chain and microglobulin regulation. The oligonucleotide is annealed to its complimentary strand and end-labeled with g-32P ATP.
- Footprinting is accomplished by combining small (10−15 M) of radiolabeled HIV LTR DNA with a slightly larger amount of HIV-Lock™ in a buffer at room temperature for 10 minutes. Dithiothreitol is added prior to the addition of protein Iron (II), EDTA, hydrogen peroxide and sodium ascorbate are added and the reaction mixture is incubated. A quenching agent is added and the products are analyzed suing denaturing gel electrophoresis. This is done for different concentrations of protein. The resulting gel is imaged using a phosphoimager scanner and the resulting high resolution image file is analyzed to abstract the binding affinity of HIV-Lock™for the HIV DNA relative to cellular DNA.
- Multiple design and testing iterations may be used in order to refind binding of HIV-Lock™ and other TBAs for HIV and other organisms. This process makes it possible to design binding assemblies such that the binding assembly is not competitive with the wild type proteins for single binding sites in the genome samples. The development of TBAs for other organisms and TNAs for sequences within these organisms can be made using the aforementioned method. This method is valid when producing binding assemblies for all nucleic acid TBRs including DNA-DNA, DNA-RNA and RNA-RNA hybrids and combinations of these hybrids.
- In the method of this invention, target binding assemblies and booster binding assemblies are assembled by identifying nucleic acid binding molecules, and linking the nucleic acid binding portions of the molecules in such a fashion as to achieve TBAs which discriminate between particular target sequences and even closely related sequences. One method for identifying the nucleic acid binding molecules involves the following steps:
- 1. Obtaining a biological sample containing the target nucleic acid. This could be, for example, an organism or a tissue extract infected with a pathogen.
- 2. Fragmenting the sample so as to expose the nucleic acids and to reduce the size complexity of the nucleic acids contained in the sample.
- 3. Contacting a first aliquot of the fragmented nucleic acids with a control buffer medium and contacting a second aliquot of the fragmented nucleic acids with the control buffer medium containing a known profile of nucleic acid binding molecules.
- 4. Analyzing the two aliquots to identity fragments which have altered behavior in the aliquot contacted with the target binding molecules as opposed to the control aliquot. This is accomplished by single dimension gel electrophoresis, two dimension gel electrophoresis, high performance liquid chromatography, paper chromatography or any other means which reveals a different behavior of the nucleic acid fragments when bound to a nucleic acid binding molecule as opposed to when the nucleic acid fragment is unbound.
- 5. Identifying and isolating fragments which do exhibit altered behavior when contacted with the nucleic acid binding molecule and either sequencing the nucleic acid fragment to determine whether known nucleic acid binding molecule motifs are present, or directly identifying the nucleic acid binding molecule bound to the nucleic acid. The latter can be achieved, for example, by contacting a two dimensional grid of the electrophoresed nucleic acids with differentially labeled antibodies which bind to the various nucleic acid binding molecules.
- In this method, preferably nucleic acid motifs are used for either diagnostic or therapeutic purposes wherein the target nucleic acid has more than a single utilizable nucleic acid binding molecule target. In this way, a complex target binding assembly can be generated which takes advantage of the proximity of different nucleic acid binding molecular motifs to enhance the specificity of the TBA assembled from the individual nucleic acid binding components identified. The various nucleic acid binding portions of the nucleic acid binding molecules are then assembled into the complete TBAs as described above, for example, for HIV-LOCK™.
- According to the methods and compositions taught in this invention, any nucleic acid sequence can be specifically identified. Identification of target HIV RNA in a sample is achieved by obtaining a sample of a patient's blood or other biological fluid or extract which may contain the HIV RNA, and testing for the presence of TAR binding sites. Tat is a positive regulator of HIV replication which binds to the TAR region of the HIV RNA. The smallest naturally occurring, fully active form of HIV-Tat is 72 amino acids in length, SEQ Id. 118 herein. Tat contains at least two functional domains, and transactivates gene expression from the HIV long terminal repeat (HIV LTR). Tat binds to an RNA stem loop structure formed from the self-hybridization of sequences in TAR, which is just 5′ to the HIV LTR HIV TAR RNA forms a dinucleotide bulge and two stem-loop structures (Rhim et al. 1994 Virology:202, 202-211). The Tat (SEQ. Id. 118) binds to this structure with lower avidity than does Tat variants wherein Ala58 is a threonine or where His65 is an Asp residue. (Derse et al., 1993 Virology: 194,530-536). Utilizing these facts in the instant method is accomplished by:
- 1. Fragmenting a biological sample to the nucleic acids and reduce the size complexity of the nucleic acids.
- 2 Contacting a TBA with the sample which identifies a hybrid TAR binding protein sequence and a proximate flanking sequence in the HIV genome. The TBA used for this purpose is assembled on cro as the chaperone using Tat as the HIV RNA specific binding molecule. To provide specificity such that cross-talk between the HIV TAR site and closely related TAR sites which may be present due to such other pathogens as cytomegalovirus, the TBA also has an antibody component which recognizes the DNA-RNA hybrid target binding region formed when a probe nucleic acid binds to the HIV LTR RNA.
- 3. Eliminating any “cross-talk” produced by binding of Tat to the TAR region of the HIV RNA due to such contaminants (cousin RNAs) as the CMV TAR sequence by contacting the reaction with excess Tat variant (either the Ala58 to Thr or the His65 to Asp variants) which bind more avidly. In this way, single binding events due to the TBA binding to a cousin RNAs are competed from the nucleic acid sample by the Tat variant. On the other hand, by appropriately selecting the affinity of the double binding achieved as a result of the antibody and Tat, the TBA is not displaced from true targets. This process is illustrated in FIG. 16. In another aspect of this same method, the TBA could be one in which, rather than using a variant of Tat, an antibody is used which recognizes this nucleic acid segment, and the TBA used is a double antibody TBA.
- In an alternate version of this method, a probe nucleic acid may be used which hybridizes with the HIV LTR RNA. Accordingly, a duplex segment of the LTR sp1 sites can be created as part of the target binding region. This region of the HIV RNA flanks the TAR region which is 5′ to the LTR but is in close proximity thereto. A TBA containing Tat and two Sp1 binding units is chaperoned to provide Tat binding to TAR and Sp1 binding to the Sp1 binding sites. Amplification and detection is then carried out by adding appropriate BNAs, BBAs and HNAs. In yet another alternative, PNAs having Seq. ID. 38 and Seq. ID. 39 (see FIG. 7) could be used. A TBA is used which contains one or more Sp1 binding units and an antibody unit which binds to the DNA-RNA hybrid produced from sample RNA and the Seq. Id. 38 PNA. Appropriate BNAs, BBAs and HNAs are then added to amplify the signal.
- Naturally, those skilled in the art will recognize that other TBA and TNA combinations could be used to optimize the methods exemplified herein.
- It should be understood that the examples and embodiments described herein are for illustrative purposes only and that various modifications or changes in light thereof will be suggested to persons skilled in the art and are to be included within the spirit and purview of this application and the scope of the appended claims. It will be understood that sequences provided herein are exemplary only and that other like sequences suggested by these could be used in the methods of this invention. It will also be understood that although any sequence provided herein might be designated as linear, it could be used in a circularly or otherwise permuted form and although designated as not being anti-sense, it could be used in the coding or non-coding form or to bind to coding or non-coding complementary sequences.
# SEQUENCE LIS #TING (1) GENERAL INFORMATION: (iii) NUMBER OF SEQUENCES: 118 (2) INFORMATION FOR SEQ ID NO:1: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 13 base #pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: both (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1: TGGGGATTCC CCA # # # 13 (2) INFORMATION FOR SEQ ID NO:2: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 13 base #pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: both (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: AAGGGACTTT CCC # # # 13 (2) INFORMATION FOR SEQ ID NO:3: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 13 base #pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: both (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: AGGGGACTTT CCG # # # 13 (2) INFORMATION FOR SEQ ID NO:4: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 15 base #pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: both (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: GCTGGGGACT TTCCA # # # 15 (2) INFORMATION FOR SEQ ID NO:5: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 15 base #pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: both (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (xi) SEQUENCE DESCRIPTION: SEQ ID NO:5: ACAAGGGACT TTCCG # # # 15 (2) INFORMATION FOR SEQ ID NO:6: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 13 base #pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: both (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (xi) SEQUENCE DESCRIPTION: SEQ ID NO:6: CCGGGTTTTC CCC # # # 13 (2) INFORMATION FOR SEQ ID NO:7: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 27 base #pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: both (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (xi) SEQUENCE DESCRIPTION: SEQ ID NO:7: AAGGGACTTT CCGCTGGGGA CTTTCCA # # 27 (2) INFORMATION FOR SEQ ID NO:8: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 27 base #pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: both (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (xi) SEQUENCE DESCRIPTION: SEQ ID NO:8: AAGGGACTTT CCGCTGGGGA CTTTCCG # # 27 (2) INFORMATION FOR SEQ ID NO:9: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 26 base #pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: both (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (xi) SEQUENCE DESCRIPTION: SEQ ID NO:9: GCTGGGGACT TTCCAGGGAG GCGTGG # # 26 (2) INFORMATION FOR SEQ ID NO:10: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 26 base #pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: both (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (xi) SEQUENCE DESCRIPTION: SEQ ID NO:10: GCTGGGGACT TTCCAGGGGA GGTGTG # # 26 (2) INFORMATION FOR SEQ ID NO:11: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 26 base #pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: both (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (xi) SEQUENCE DESCRIPTION: SEQ ID NO:11: GCTGGGGACT TTCCGGGGAG CGTGGC # # 26 (2) INFORMATION FOR SEQ ID NO:12: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 26 base #pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: both (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (xi) SEQUENCE DESCRIPTION: SEQ ID NO:12: GCTGGGGACT TTCCGGGGAG GCGCGG # # 26 (2) INFORMATION FOR SEQ ID NO:13: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 26 base #pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: both (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (xi) SEQUENCE DESCRIPTION: SEQ ID NO:13: GCTGGGGACT TTCCAGAGAG GCGTGG # # 26 (2) INFORMATION FOR SEQ ID NO:14: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 26 base #pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: both (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (xi) SEQUENCE DESCRIPTION: SEQ ID NO:14: GCTGGGGACT TTCCAGGGGA GGCGTG # # 26 (2) INFORMATION FOR SEQ ID NO:15: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 26 base #pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: both (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (xi) SEQUENCE DESCRIPTION: SEQ ID NO:15: GCTGGGGACT TTCCAGGGAG GCGTGG # # 26 (2) INFORMATION FOR SEQ ID NO:16: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 26 base #pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: both (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (xi) SEQUENCE DESCRIPTION: SEQ ID NO:16: GCTGGGGACT TTCCAGGGAG GCTGCC # # 26 (2) INFORMATION FOR SEQ ID NO:17: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 33 base #pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: both (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (xi) SEQUENCE DESCRIPTION: SEQ ID NO:17: TTTCCAGGGA GGCGTGGCCT GGGCGGGACT GGG # # 33 (2) INFORMATION FOR SEQ ID NO:18: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 33 base #pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: both (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (xi) SEQUENCE DESCRIPTION: SEQ ID NO:18: CGTGGCCTGG GCGGGACTGG GGAGTGGCGT CCC # # 33 (2) INFORMATION FOR SEQ ID NO:19: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 45 base #pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: both (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (xi) SEQUENCE DESCRIPTION: SEQ ID NO:19: CTACAAGGGA CTTTCCGCTG GGGACTTTCC AGGGAGGCGT GGCCT # #45 (2) INFORMATION FOR SEQ ID NO:20: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 46 base #pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: both (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: CAGCAAGGGA CTTTCCGCTG GGGACTTTCC AGGGGAGGTG TGGCCT # 46 (2) INFORMATION FOR SEQ ID NO:21: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 46 base #pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: both (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (xi) SEQUENCE DESCRIPTION: SEQ ID NO:21: CATCAAGGGA CTTTCCGCTG GGGACTTTCC AGGGGAGGTG TGGCCT # 46 (2) INFORMATION FOR SEQ ID NO:22: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 46 base #pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: both (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: CAACAAGGGA CTTTCCGCTG GGGACTTTCC AGGGGAGGTG TGGCCT # 46 (2) INFORMATION FOR SEQ ID NO:23: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 45 base #pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: both (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: CTACAAGGGA CTTTCCGCTG GGGACTTTCC AGGGAGGCGT GGCAT # #45 (2) INFORMATION FOR SEQ ID NO:24: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 44 base #pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: both (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: CTACAAGGGA CTTTCCGCTG GGGACTTTCC GGGGAGCGTG GCCT # # 44 (2) INFORMATION FOR SEQ ID NO:25: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 44 base #pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: both (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (xi) SEQUENCE DESCRIPTION: SEQ ID NO:25: CTACAAGGGA CTTTCCGCTG GGGACTTTCC GGGGAGGCGC GGCT # # 44 (2) INFORMATION FOR SEQ ID NO:26: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 45 base #pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: both (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (xi) SEQUENCE DESCRIPTION: SEQ ID NO:26: CTACAAGGGA CTTTCCGCTG GGGACTTTCC AGAGAGGCGT GGACT # #45 (2) INFORMATION FOR SEQ ID NO:27: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 46 base #pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: both (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (xi) SEQUENCE DESCRIPTION: SEQ ID NO:27: CTACAAGGGA CTTTCCGCTG GGGACTTTCC AGGGGAGGCG TGGACT # 46 (2) INFORMATION FOR SEQ ID NO:28: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 46 base #pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: both (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (xi) SEQUENCE DESCRIPTION: SEQ ID NO:28: CTACAGGGGA CTTTCCGCTG GGGACTTTCC AGGGAGGCGT GGGGAG # 46 (2) INFORMATION FOR SEQ ID NO:29: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 43 base #pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: both (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (xi) SEQUENCE DESCRIPTION: SEQ ID NO:29: CTACAGGGGA CTTTCCGCTG GGGACTTTCC AGGGAGGCTG CCT # # 43 (2) INFORMATION FOR SEQ ID NO:30: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 48 base #pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: both (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (xi) SEQUENCE DESCRIPTION: SEQ ID NO:30: CTTTCCGCTG GGGACTTTCC AGGGAGGCGT GGCCTGGGCG GGACTGGG # 48 (2) INFORMATION FOR SEQ ID NO:31: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 45 base #pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: both (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (xi) SEQUENCE DESCRIPTION: SEQ ID NO:31: TTTCCAGGGA GGCGTGGCCT GGGCGGGACT GGGGAGTGGC GTCCC # #45 (2) INFORMATION FOR SEQ ID NO:32: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 59 base #pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: both (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (xi) SEQUENCE DESCRIPTION: SEQ ID NO:32: CTACAAGGGA CTTTCCGCTG GGGACTTTCC AGGGAGGCGT GGCCTGGGCG GG #ACTGGGG 59 (2) INFORMATION FOR SEQ ID NO:33: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 59 base #pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: both (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (xi) SEQUENCE DESCRIPTION: SEQ ID NO:33: TTTCCGCTGG GGACTTTCCA GGGAGGCGTG GCCTGGGCGG GACTGGGGAG TG #GCGTCCC 59 (2) INFORMATION FOR SEQ ID NO:34: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 70 base #pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: both (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (xi) SEQUENCE DESCRIPTION: SEQ ID NO:34: CTACAAGGGA CTTTCCGCTG GGGACTTTCC AGGGAGGCGT GGCCTGGGCG GG #ACTGGGGA 60 GTGGCGTCCC # # # 70 (2) INFORMATION FOR SEQ ID NO:35: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 61 base #pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: both (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (xi) SEQUENCE DESCRIPTION: SEQ ID NO:35: TATCACCGCC AGTGGTATTT ATGTCAACAC CGCCAGAGAT AATTTATCAC CG #CAGATGGT 60 T # # # 61 (2) INFORMATION FOR SEQ ID NO:36: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 64 base #pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: both (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (xi) SEQUENCE DESCRIPTION: SEQ ID NO:36: TATCACCGCA AGGGATAAAT ATCTAACACC GTGCGTGTTG ACTATTTTAC CT #CTGGCGGT 60 GATA # # # 64 (2) INFORMATION FOR SEQ ID NO:37: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 70 base #pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: both (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (xi) SEQUENCE DESCRIPTION: SEQ ID NO:37: CTACAAGGGA CTTTCCGCTG GGGACTTTCC AGGGAGGCGT GGCCTGGGCG GG #ACTGGGGA 60 GTGGCGTCCC # # # 70 (2) INFORMATION FOR SEQ ID NO:38: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 37 base #pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: both (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (xi) SEQUENCE DESCRIPTION: SEQ ID NO:38: CTACAAGGGA CTTTCCGCTG GGGACTTTCC AGGGAGG # # 37 (2) INFORMATION FOR SEQ ID NO:39: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 22 base #pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: both (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (xi) SEQUENCE DESCRIPTION: SEQ ID NO:39: CGGGACTGGG GAGTGGCGTC CC # # 22 (2) INFORMATION FOR SEQ ID NO:40: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 103 base #pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: both (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (xi) SEQUENCE DESCRIPTION: SEQ ID NO:40: CTACAAGGGA CTTTCCGCTG GGGACTTTCC AGGGAGGTAT CACCGCCAGT GG #TATTTATG 60 TCAACACCGC CAGAGATAAT TTATCACCGC AGATGGTTCT GCA # #103 (2) INFORMATION FOR SEQ ID NO:41: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 62 base #pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: both (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (xi) SEQUENCE DESCRIPTION: SEQ ID NO:41: GAACCATCTG CGGTGATAAA TTATCTCTGG CGGTGTTGAC ATAAATACCA CT #GGCGGTGA 60 TA # # # 62 (2) INFORMATION FOR SEQ ID NO:42: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 71 base #pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: both (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (xi) SEQUENCE DESCRIPTION: SEQ ID NO:42: GATCCAACCA TCTGCGGTGA TAAATTATCT CTGGCGGTGT TGACATAAAT AC #CACTGGCG 60 GTGATACTGC A # # # 71 (2) INFORMATION FOR SEQ ID NO:43: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 63 base #pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: both (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (xi) SEQUENCE DESCRIPTION: SEQ ID NO:43: GTATCACCGC CAGTGGTATT TATGTCAACA CCGCCAGAGA TAATTTATCA CC #GCAGATGG 60 TTG # # # 63 (2) INFORMATION FOR SEQ ID NO:44: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 21 base #pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: both (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (xi) SEQUENCE DESCRIPTION: SEQ ID NO:44: GATCCGGGGG GATACCCCCC G # # #21 (2) INFORMATION FOR SEQ ID NO:45: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 91 base #pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: both (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (xi) SEQUENCE DESCRIPTION: SEQ ID NO:45: CGGGACTGGG GAGTGGCGTC CCTATCACCG CAAGGGATAA ATATCTAACA CC #GTGCGTGT 60 TGACTATTTT ACCTCTGGCG GTGATAGCAT G # # 91 (2) INFORMATION FOR SEQ ID NO:46: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 53 base #pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: both (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (xi) SEQUENCE DESCRIPTION: SEQ ID NO:46: CTAAGGGCGT AACCGAAATC GGTTGAACCG AAACCGGTTA GTATAAAAGC AG #A 53 (2) INFORMATION FOR SEQ ID NO:47: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 54 base #pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: both (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (xi) SEQUENCE DESCRIPTION: SEQ ID NO:47: AAAAGGGAGT AACCGAAAAC GGTCGGGACC GAAAACGGTG TATATAAAAG AT #GT 54 (2) INFORMATION FOR SEQ ID NO:48: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 54 base #pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: both (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (xi) SEQUENCE DESCRIPTION: SEQ ID NO:48: AGTAGGGTGT AACCGAAAGC GGTTCAACCG AAAACGGTGC ATATATAAAG CA #AA 54 (2) INFORMATION FOR SEQ ID NO:49: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 24 base #pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: both (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (xi) SEQUENCE DESCRIPTION: SEQ ID NO:49: GCTTCAACCG AATTCGGTTG CATG # # 24 (2) INFORMATION FOR SEQ ID NO:50: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 24 base #pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: both (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (xi) SEQUENCE DESCRIPTION: SEQ ID NO:50: TGTGCAACCG ATTTCGGTTG CCTT # # 24 (2) INFORMATION FOR SEQ ID NO:51: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 24 base #pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: both (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (xi) SEQUENCE DESCRIPTION: SEQ ID NO:51: TATGCAACCG AAATAGGTTG GGCA # # 24 (2) INFORMATION FOR SEQ ID NO:52: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 24 base #pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: both (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (xi) SEQUENCE DESCRIPTION: SEQ ID NO:52: TGCCTAACCG TTTTCGGTTA CTTG # # 24 (2) INFORMATION FOR SEQ ID NO:53: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 24 base #pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: both (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (xi) SEQUENCE DESCRIPTION: SEQ ID NO:53: GGACTAACCG TTTTAGGTCA TATT # # 24 (2) INFORMATION FOR SEQ ID NO:54: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 52 base #pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: both (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (xi) SEQUENCE DESCRIPTION: SEQ ID NO:54: GACGACTATC CAGCGACCAA GATCAGAGCC AGACACCGGA AACCCCTGCC AC # 52 (2) INFORMATION FOR SEQ ID NO:55: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 53 base #pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: both (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (xi) SEQUENCE DESCRIPTION: SEQ ID NO:55: GACGACACGG TATCCGCTAC TCAGCTTGTT AAACAGCTAC AGCACACCCC CT #C 53 (2) INFORMATION FOR SEQ ID NO:56: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 60 base #pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: both (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (xi) SEQUENCE DESCRIPTION: SEQ ID NO:56: GACGACGACC TGCAGACACC ACAGACACCG CCCAGCCCCT TACAAAGCTG TT #CTGTGCAG 60 (2) INFORMATION FOR SEQ ID NO:57: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 68 base #pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: both (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (xi) SEQUENCE DESCRIPTION: SEQ ID NO:57: CATACCAAAG CCGTCGCCTT GGGCACCGAA GAAACACAAC CACTAAGTTG TT #GCACAGAG 60 ACTCAGTG # # # 68 (2) INFORMATION FOR SEQ ID NO:58: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 77 base #pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: both (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (xi) SEQUENCE DESCRIPTION: SEQ ID NO:58: TAATGTAATT GATTGTAATG ACTCTATGTG CAGTACCAGT ACCGTATTCC AG #CACCGTGT 60 CCGTGGGCAC CGCAAAG # # # 77 (2) INFORMATION FOR SEQ ID NO:59: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 80 base #pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: both (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (xi) SEQUENCE DESCRIPTION: SEQ ID NO:59: ACAGACAACG ATAACCGACC ACCACAAGCA GCGGCCAAAC ACCCCGCCTT GG #ACAATAGA 60 ACAGCACGTA CTGCAACTAA # # # 80 (2) INFORMATION FOR SEQ ID NO:60: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 266 base #pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: both (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (xi) SEQUENCE DESCRIPTION: SEQ ID NO:60: CATATGCAAT ACAATGCATT ATACAAACTG GACACATATA TATATTTGTG AA #GAAGCATC 60 AGTAACTGTG GTAGAGGGTC AAGTTGACTA TTATGGTTTA TATTATGTTC AT #GAAGGAAT 120 ACGAACATAT TTTGTGCAGT TTAAAGATGA TGCAGAAAAA TATAGTAAAA AT #AAAGTATG 180 GGAAGTTCAT GCGGGTGGTC AGGTAATATT ATGTCCTACA TCTGTGTTTA GC #AGCAACGA 240 AGTATCCTCT CCTGAAATTA TTAGGC # # 266 (2) INFORMATION FOR SEQ ID NO:61: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 95 base #pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: both (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (xi) SEQUENCE DESCRIPTION: SEQ ID NO:61: AGGATGTATA AAAAAACATG GATATACAGT GGAAGTGCAG TTTGATGGAG AC #ATATGCTA 60 TTAGGCAGCA CTTGGCCAAC CACCCCGCCG CGACC # # 95 (2) INFORMATION FOR SEQ ID NO:62: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 81 base #pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: both (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (xi) SEQUENCE DESCRIPTION: SEQ ID NO:62: CATGTTTTTT TATACATCCA TATCACCGCC AGTGGTATTT ATGTCAACAC CG #CCAGAGAT 60 AATTTATCAC CGCAGATGGT T # # #81 (2) INFORMATION FOR SEQ ID NO:63: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 322 amino #acids (B) TYPE: amino acid (D) TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: internal (xi) SEQUENCE DESCRIPTION: SEQ ID NO:63: Met Ala Asp Asp Asp Pro Tyr Gly Thr Gly Gl #n Met Phe His Leu Asn 1 5 # 10 # 15 Thr Ala Leu Thr His Ser Ile Phe Asn Ala Gl #u Leu Tyr Ser Pro Glu 20 # 25 # 30 Ile Pro Leu Ser Thr Asp Gly Pro Tyr Leu Gl #n Ile Leu Glu Gln Pro 35 # 40 # 45 Lys Gln Arg Gly Phe Arg Phe Arg Tyr Val Cy #s Glu Gly Pro Ser His 50 # 55 # 60 Gly Gly Leu Pro Gly Ala Ser Ser Glu Lys As #n Lys Lys Ser Tyr Pro 65 #70 #75 #80 Gln Val Lys Ile Cys Asn Tyr Val Gly Pro Al #a Lys Val Ile Val Gln 85 # 90 # 95 Leu Val Thr Asn Gly Lys Asn Ile His Leu Hi #s Ala His Ser Leu Val 100 # 105 # 110 Gly Lys His Cys Glu Asp Gly Val Cys Thr Va #l Thr Ala Gly Pro Lys 115 # 120 # 125 Asp Met Val Val Gly Phe Ala Asn Leu Gly Il #e Leu His Val Thr Lys 130 # 135 # 140 Lys Lys Val Phe Glu Thr Leu Glu Ala Arg Me #t Thr Glu Ala Cys Ile 145 1 #50 1 #55 1 #60 Arg Gly Tyr Asn Pro Gly Leu Leu Val His Se #r Asp Leu Ala Tyr Leu 165 # 170 # 175 Gln Ala Glu Gly Gly Gly Asp Arg Gln Leu Th #r Asp Arg Glu Lys Glu 180 # 185 # 190 Ile Ile Arg Gln Ala Ala Val Gln Gln Thr Ly #s Glu Met Asp Leu Ser 195 # 200 # 205 Val Val Arg Leu Met Phe Thr Ala Phe Leu Pr #o Asp Ser Thr Gly Ser 210 # 215 # 220 Phe Thr Arg Arg Leu Glu Pro Val Val Ser As #p Ala Ile Tyr Asp Ser 225 2 #30 2 #35 2 #40 Lys Ala Pro Asn Ala Ser Asn Leu Lys Ile Va #l Arg Met Asp Arg Thr 245 # 250 # 255 Ala Gly Cys Val Thr Gly Gly Glu Glu Ile Ty #r Leu Leu Cys Asp Lys 260 # 265 # 270 Val Gln Lys Asp Asp Ile Gln Ile Arg Phe Ty #r Glu Glu Glu Glu Asn 275 # 280 # 285 Gly Gly Val Trp Glu Gly Phe Gly Asp Phe Se #r Pro Thr Asp Val His 290 # 295 # 300 Arg Gln Phe Ala Ile Val Phe Lys Thr Pro Ly #s Tyr Lys Asp Val Asn 305 3 #10 3 #15 3 #20 Ile Thr (2) INFORMATION FOR SEQ ID NO:64: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 325 amino #acids (B) TYPE: amino acid (D) TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: internal (xi) SEQUENCE DESCRIPTION: SEQ ID NO:64: Met Ala Glu Asp Asp Pro Tyr Leu Gly Arg Pr #o Glu Gln Met Phe His 1 5 # 10 # 15 Leu Asp Pro Ser Leu Thr His Thr Ile Phe As #n Pro Glu Val Phe Gln 20 # 25 # 30 Pro Gln Met Ala Leu Pro Thr Ala Asp Gly Pr #o Tyr Leu Gln Ile Leu 35 # 40 # 45 Glu Gln Pro Lys Gln Arg Gly Phe Arg Phe Ar #g Tyr Val Cys Glu Gly 50 # 55 # 60 Pro Ser His Gly Gly Leu Pro Gly Ala Ser Se #r Glu Lys Asn Lys Lys 65 #70 #75 #80 Ser Tyr Pro Gln Val Lys Ile Cys Asn Tyr Va #l Gly Pro Ala Lys Val 85 # 90 # 95 Ile Val Gln Leu Val Thr Asn Gly Lys Asn Il #e His Leu His Ala His 100 # 105 # 110 Ser Leu Val Gly Lys His Cys Glu Asp Gly Il #e Cys Thr Val Thr Ala 115 # 120 # 125 Gly Pro Glu Asp Cys Val His Gly Phe Ala As #n Leu Gly Ile Leu His 130 # 135 # 140 Val Thr Lys Lys Lys Val Phe Glu Thr Leu Gl #u Ala Arg Met Thr Glu 145 1 #50 1 #55 1 #60 Ala Cys Ile Arg Gly Tyr Asn Pro Gly Leu Le #u Val His Pro Asp Leu 165 # 170 # 175 Ala Tyr Leu Gln Ala Glu Gly Gly Gly Asp Ar #g Gln Leu Gly Asp Arg 180 # 185 # 190 Glu Lys Glu Leu Ile Arg Gln Ala Ala Leu Gl #n Gln Thr Lys Glu Met 195 # 200 # 205 Asp Leu Ser Val Val Arg Leu Met Phe Thr Al #a Phe Leu Pro Asp Ser 210 # 215 # 220 Thr Gly Ser Phe Thr Arg Arg Leu Glu Pro Va #l Val Ser Asp Ala Ile 225 2 #30 2 #35 2 #40 Tyr Asp Ser Lys Ala Pro Asn Ala Ser Asn Le #u Lys Ile Val Arg Met 245 # 250 # 255 Asp Arg Thr Ala Gly Cys Val Thr Gly Gly Gl #u Glu Ile Tyr Leu Leu 260 # 265 # 270 Cys Asp Lys Val Gln Lys Asp Asp Ile Gln Il #e Arg Phe Tyr Glu Glu 275 # 280 # 285 Glu Glu Asn Gly Gly Val Trp Glu Gly Phe Gl #y Asp Phe Ser Pro Thr 290 # 295 # 300 Asp Val His Arg Gln Phe Ala Ile Val Phe Ly #s Thr Pro Lys Tyr Lys 305 3 #10 3 #15 3 #20 Asp Ile Asn Ile Thr 325 (2) INFORMATION FOR SEQ ID NO:65: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 268 amino #acids (B) TYPE: amino acid (D) TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: internal (xi) SEQUENCE DESCRIPTION: SEQ ID NO:65: Met Glu Pro Ala Asp Leu Leu Pro Leu Tyr Le #u Gln Pro Glu Trp Gly 1 5 # 10 # 15 Glu Gln Glu Pro Gly Gly Ala Thr Pro Phe Va #l Glu Ile Leu Glu Gln 20 # 25 # 30 Pro Lys Gln Arg Gly Met Arg Phe Arg Tyr Ly #s Cys Glu Gly Arg Ser 35 # 40 # 45 Ala Gly Ser Ile Pro Gly Glu His Ser Thr As #p Ser Ala Arg Thr His 50 # 55 # 60 Pro Thr Ile Arg Val Asn His Tyr Arg Gly Pr #o Gly Arg Val Arg Val 65 #70 #75 #80 Ser Leu Val Thr Lys Asp Pro Pro His Gly Pr #o His Pro His Glu Leu 85 # 90 # 95 Val Gly Arg His Cys Gln His Gly Tyr Tyr Gl #u Ala Glu Leu Ser Pro 100 # 105 # 110 Asp Arg Ser Ile His Ser Phe Gln Asn Leu Gl #y Ile Gln Cys Val Lys 115 # 120 # 125 Lys Arg Glu Leu Glu Ala Ala Val Ala Glu Ar #g Ile Arg Thr Asn Asn 130 # 135 # 140 Asn Pro Phe Asn Val Pro Met Glu Glu Arg Gl #y Ala Glu Tyr Asp Leu 145 1 #50 1 #55 1 #60 Ser Ala Val Arg Leu Cys Phe Gln Val Trp Va #l Asn Gly Pro Gly Gly 165 # 170 # 175 Leu Cys Pro Leu Pro Pro Val Leu Ser Gln Pr #o Ile Tyr Asp Asn Arg 180 # 185 # 190 Ala Pro Ser Thr Ala Glu Leu Arg Ile Leu Pr #o Gly Asp Arg Asn Ser 195 # 200 # 205 Gly Ser Cys Gln Gly Gly Asp Glu Ile Phe Le #u Leu Cys Asp Lys Val 210 # 215 # 220 Gln Lys Glu Asp Ile Glu Val Arg Phe Trp Al #a Glu Gly Trp Glu Ala 225 2 #30 2 #35 2 #40 Lys Gly Ser Phe Ala Ala Ala Asp Val His Ar #g Gln Val Ala Ile Val 245 # 250 # 255 Phe Arg Thr Pro Pro Phe Arg Glu Arg Ser Le #u Arg 260 # 265 (2) INFORMATION FOR SEQ ID NO:66: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 263 amino #acids (B) TYPE: amino acid (D) TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: internal (xi) SEQUENCE DESCRIPTION: SEQ ID NO:66: Met Asp Asp Leu Phe Pro Leu Ile Phe Pro Se #r Glu Pro Ala Gln Ala 1 5 # 10 # 15 Ser Gly Pro Tyr Val Glu Ile Ile Glu Gln Pr #o Lys Gln Arg Gly Met 20 # 25 # 30 Arg Phe Arg Tyr Lys Cys Glu Gly Arg Ser Al #a Gly Ser Ile Pro Gly 35 # 40 # 45 Glu Arg Ser Thr Asp Thr Thr Lys Thr His Pr #o Thr Ile Lys Ile Asn 50 # 55 # 60 Gly Tyr Thr Gly Pro Gly Thr Val Arg Ile Se #r Leu Val Thr Lys Asp 65 #70 #75 #80 Pro Pro His Arg Pro His Pro His Glu Leu Va #l Gly Lys Asp Cys Arg 85 # 90 # 95 Asp Gly Tyr Tyr Glu Ala Asp Leu Cys Pro As #p Arg Ser Ile His Ser 100 # 105 # 110 Phe Gln Asn Leu Gly Ile Gln Cys Val Lys Ly #s Arg Asp Leu Glu Gln 115 # 120 # 125 Ala Ile Ser Gln Arg Ile Gln Thr Asn Asn As #n Pro Phe His Val Pro 130 # 135 # 140 Ile Glu Glu Gln Arg Gly Asp Tyr Asp Leu As #n Ala Val Arg Leu Cys 145 1 #50 1 #55 1 #60 Phe Gln Val Thr Val Arg Asp Pro Ala Gly Ar #g Pro Leu Leu Leu Thr 165 # 170 # 175 Pro Val Leu Ser His Pro Ile Phe Asp Asn Ar #g Ala Pro Asn Thr Ala 180 # 185 # 190 Glu Leu Lys Ile Cys Arg Val Asn Arg Asn Se #r Gly Ser Cys Leu Gly 195 # 200 # 205 Gly Asp Glu Ile Phe Leu Leu Cys Asp Lys Va #l Gln Lys Glu Asp Ile 210 # 215 # 220 Glu Val Tyr Phe Thr Gly Pro Gly Trp Glu Al #a Arg Gly Ser Phe Ser 225 2 #30 2 #35 2 #40 Gln Ala Asp Val His Arg Gln Val Ala Ile Va #l Phe Arg Thr Pro Pro 245 # 250 # 255 Tyr Ala Asp Pro Ser Leu Gln 260 (2) INFORMATION FOR SEQ ID NO:67: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 263 amino #acids (B) TYPE: amino acid (D) TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: internal (xi) SEQUENCE DESCRIPTION: SEQ ID NO:67: Met Asp Glu Leu Phe Pro Leu Ile Phe Pro Al #a Glu Pro Ala Gln Ala 1 5 # 10 # 15 Ser Gly Pro Tyr Val Glu Ile Ile Glu Gln Pr #o Lys Gln Arg Gly Met 20 # 25 # 30 Arg Phe Arg Tyr Lys Cys Glu Gly Arg Ser Al #a Gly Ser Ile Pro Gly 35 # 40 # 45 Glu Arg Ser Thr Asp Thr Thr Lys Thr His Pr #o Thr Ile Lys Ile Asn 50 # 55 # 60 Gly Tyr Thr Gly Pro Gly Thr Val Arg Ile Se #r Leu Val Thr Lys Asp 65 #70 #75 #80 Pro Pro His Arg Pro His Pro His Glu Leu Va #l Gly Lys Asp Cys Arg 85 # 90 # 95 Asp Gly Phe Tyr Glu Ala Glu Leu Cys Pro As #p Arg Cys Ile His Ser 100 # 105 # 110 Phe Gln Asn Leu Gly Ile Gln Cys Val Lys Ly #s Arg Asp Leu Glu Gln 115 # 120 # 125 Ala Ile Ser Gln Arg Ile Gln Thr Asn Asn As #n Pro Phe Gln Val Pro 130 # 135 # 140 Ile Glu Glu Gln Arg Gly Asp Tyr Asp Leu As #n Ala Val Arg Leu Cys 145 1 #50 1 #55 1 #60 Phe Gln Val Thr Val Arg Asp Pro Ser Gly Ar #g Pro Leu Arg Leu Pro 165 # 170 # 175 Pro Val Leu Pro His Pro Ile Phe Asp Asn Ar #g Ala Pro Asn Thr Ala 180 # 185 # 190 Glu Leu Lys Ile Cys Arg Val Asn Arg Asn Se #r Gly Ser Cys Leu Gly 195 # 200 # 205 Gly Asp Glu Ile Phe Leu Leu Cys Asp Lys Va #l Gln Lys Glu Asp Ile 210 # 215 # 220 Glu Val Tyr Phe Thr Gly Pro Gly Trp Glu Al #a Arg Gly Ser Phe Ser 225 2 #30 2 #35 2 #40 Gln Ala Asp Val His Arg Gln Val Ala Ile Va #l Phe Arg Thr Pro Pro 245 # 250 # 255 Tyr Ala Asp Pro Ser Leu Gln 260 (2) INFORMATION FOR SEQ ID NO:68: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 299 amino #acids (B) TYPE: amino acid (D) TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: internal (xi) SEQUENCE DESCRIPTION: SEQ ID NO:68: Met Phe Pro Asn Gln Asn Asn Gly Ala Ala Pr #o Gly Gln Gly Pro Ala 1 5 # 10 # 15 Val Asp Gly Gln Gln Ser Leu Asn Tyr Asn Gl #y Leu Pro Ala Gln Gln 20 # 25 # 30 Gln Gln Gln Leu Ala Gln Ser Thr Lys Asn Va #l Arg Lys Lys Pro Tyr 35 # 40 # 45 Val Lys Ile Thr Glu Gln Pro Ala Gly Lys Al #a Leu Arg Phe Arg Tyr 50 # 55 # 60 Glu Cys Glu Gly Arg Ser Ala Gly Ser Ile Pr #o Gly Val Asn Ser Thr 65 #70 #75 #80 Pro Glu Asn Lys Thr Tyr Pro Thr Ile Glu Il #e Val Gly Tyr Lys Gly 85 # 90 # 95 Arg Ala Val Val Val Val Ser Cys Val Thr Ly #s Asp Thr Pro Tyr Arg 100 # 105 # 110 Pro His Pro His Asn Leu Val Gly Lys Glu Gl #y Cys Lys Lys Gly Val 115 # 120 # 125 Cys Thr Leu Glu Ile Asn Ser Glu Thr Met Ar #g Ala Val Phe Ser Asn 130 # 135 # 140 Leu Gly Ile Gln Cys Val Lys Lys Lys Asp Il #e Glu Ala Ala Leu Lys 145 1 #50 1 #55 1 #60 Ala Arg Glu Glu Ile Arg Val Asp Pro Phe Ly #s Thr Gly Phe Ser His 165 # 170 # 175 Arg Phe Gln Pro Ser Ser Ile Asp Leu Asn Se #r Val Arg Leu Cys Phe 180 # 185 # 190 Gln Val Phe Met Glu Ser Glu Gln Lys Gly Ar #g Phe Thr Ser Pro Leu 195 # 200 # 205 Pro Pro Val Val Ser Glu Pro Ile Phe Asp Ly #s Lys Ala Met Ser Asp 210 # 215 # 220 Leu Val Ile Cys Arg Leu Cys Ser Cys Ser Al #a Thr Val Phe Gly Asn 225 2 #30 2 #35 2 #40 Thr Gln Ile Ile Leu Leu Cys Glu Lys Val Al #a Lys Glu Asp Ile Ser 245 # 250 # 255 Val Arg Phe Phe Glu Glu Lys Asn Gly Gln Se #r Val Trp Glu Ala Phe 260 # 265 # 270 Gly Asp Phe Gln His Thr Asp Val His Lys Gl #n Thr Ala Ile Thr Phe 275 # 280 # 285 Lys Thr Pro Arg Tyr His Thr Leu Asp Ile Th #r 290 # 295 (2) INFORMATION FOR SEQ ID NO:69: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 261 amino #acids (B) TYPE: amino acid (D) TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: internal (xi) SEQUENCE DESCRIPTION: SEQ ID NO:69: Met Asp Phe Leu Thr Asn Leu Arg Phe Thr Gl #u Gly Ile Ser Glu Pro 1 5 # 10 # 15 Tyr Ile Glu Ile Phe Glu Gln Pro Arg Gln Ar #g Gly Thr Arg Phe Arg 20 # 25 # 30 Tyr Lys Cys Glu Gly Arg Ser Ala Gly Ser Il #e Pro Gly Glu His Ser 35 # 40 # 45 Thr Asp Asn Asn Lys Thr Phe Pro Ser Ile Gl #n Ile Leu Asn Tyr Phe 50 # 55 # 60 Gly Lys Val Lys Ile Arg Thr Thr Leu Val Th #r Lys Asn Glu Pro Tyr 65 #70 #75 #80 Lys Pro His Pro His Asp Leu Val Gly Lys Gl #y Cys Arg Asp Gly Tyr 85 # 90 # 95 Tyr Glu Ala Glu Phe Gly Pro Glu Arg Gln Va #l Leu Ser Phe Gln Asn 100 # 105 # 110 Leu Gly Ile Gln Cys Val Lys Lys Lys Asp Le #u Lys Glu Ser Ile Ser 115 # 120 # 125 Leu Arg Ile Ser Lys Lys Asn Pro Phe Asn Va #l Pro Glu Glu Gln Leu 130 # 135 # 140 His Asn Ile Asp Glu Tyr Asp Leu Asn Val Va #l Arg Leu Cys Phe Gln 145 1 #50 1 #55 1 #60 Ala Phe Leu Pro Asp Glu His Gly Asn Tyr Th #r Leu Ala Leu Pro Pro 165 # 170 # 175 Leu Ile Ser Asn Pro Ile Tyr Asp Asn Arg Al #a Pro Asn Thr Ala Glu 180 # 185 # 190 Leu Arg Ile Cys Arg Val Asn Lys Asn Cys Gl #y Ser Val Lys Gly Gly 195 # 200 # 205 Asp Glu Ile Phe Leu Leu Cys Asp Lys Val Gl #n Lys Asp Asp Ile Glu 210 # 215 # 220 Val Arg Phe Val Leu Gly Asn Trp Glu Ala Ly #s Gly Ser Phe Ser Gln 225 2 #30 2 #35 2 #40 Ala Asp Val His Arg Gln Val Ala Ile Val Ph #e Arg Thr Pro Pro Phe 245 # 250 # 255 Leu Gly Asp Ile Thr 260 (2) INFORMATION FOR SEQ ID NO:70: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 262 amino #acids (B) TYPE: amino acid (D) TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: internal (xi) SEQUENCE DESCRIPTION: SEQ ID NO:70: Met Asp Phe Leu Thr Asn Leu Arg Phe Thr Gl #u Gly Ile Ser Glu Pro 1 5 # 10 # 15 Tyr Ile Glu Ile Phe Glu Gln Pro Arg Gln Ar #g Gly Met Arg Phe Arg 20 # 25 # 30 Tyr Lys Cys Glu Gly Arg Ser Ala Gly Ser Il #e Pro Gly Glu His Ser 35 # 40 # 45 Thr Asp Asn Asn Lys Thr Phe Pro Ser Ile Gl #n Ile Leu Asn Tyr Phe 50 # 55 # 60 Gly Lys Val Lys Ile Arg Thr Thr Leu Val Th #r Lys Asn Glu Pro Tyr 65 #70 #75 #80 Lys Pro His Pro His Asp Leu Val Gly Lys Gl #y Cys Arg Asp Gly Tyr 85 # 90 # 95 Tyr Glu Ala Glu Phe Gly Pro Glu Arg Gln Va #l Leu Ser Phe Gln Asn 100 # 105 # 110 Leu Gly Ile Gln Cys Val Lys Lys Lys Asp Le #u Lys Glu Ser Ile Ser 115 # 120 # 125 Leu Arg Ile Ser Lys Lys Ile Asn Pro Phe As #n Val Pro Glu Glu Gln 130 # 135 # 140 Leu His Asn Ile Asp Glu Tyr Asp Leu Asn Va #l Val Arg Leu Cys Phe 145 1 #50 1 #55 1 #60 Gln Ala Phe Leu Pro Asp Glu His Gly Asn Ty #r Thr Leu Ala Leu Pro 165 # 170 # 175 Pro Leu Ile Ser Asn Pro Ile Tyr Asp Asn Ar #g Ala Pro Asn Thr Ala 180 # 185 # 190 Glu Leu Arg Ile Cys Arg Val Asn Lys Asn Cy #s Gly Ser Val Lys Gly 195 # 200 # 205 Gly Asp Glu Ile Phe Leu Leu Cys Asp Lys Va #l Gln Lys Asp Asp Ile 210 # 215 # 220 Glu Val Arg Phe Val Leu Gly Asn Trp Glu Al #a Lys Gly Ser Phe Ser 225 2 #30 2 #35 2 #40 Gln Ala Asp Val His Arg Gln Val Ala Ile Va #l Phe Arg Thr Pro Pro 245 # 250 # 255 Phe Leu Gly Asp Ile Thr 260 (2) INFORMATION FOR SEQ ID NO:71: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 314 amino #acids (B) TYPE: amino acid (D) TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: internal (xi) SEQUENCE DESCRIPTION: SEQ ID NO:71: Met Ser Asn Lys Lys Gln Ser Asn Arg Leu Th #r Glu Gln His Lys Leu 1 5 # 10 # 15 Ser Gln Gly Val Ile Gly Ile Phe Gly Asp Ty #r Ala Lys Ala His Asp 20 # 25 # 30 Leu Ala Val Gly Glu Val Ser Lys Leu Val Ly #s Lys Ala Leu Ser Asn 35 # 40 # 45 Glu Tyr Pro Gln Leu Ser Phe Arg Tyr Arg As #p Ser Ile Lys Lys Thr 50 # 55 # 60 Glu Ile Asn Glu Ala Leu Lys Lys Ile Asp Pr #o Asp Leu Gly Gly Thr 65 #70 #75 #80 Leu Phe Val Ser Asn Ser Ser Ile Lys Pro As #p Gly Gly Ile Val Glu 85 # 90 # 95 Val Lys Asp Asp Tyr Gly Glu Trp Arg Val Va #l Leu Val Ala Glu Ala 100 # 105 # 110 Lys His Gln Gly Lys Asp Ile Ile Asn Ile Ar #g Asn Gly Leu Leu Val 115 # 120 # 125 Gly Lys Arg Gly Asp Gln Asp Leu Met Ala Al #a Gly Asn Ala Ile Glu 130 # 135 # 140 Arg Ser His Asn Ile Ser Glu Ile Ala Asn Ph #e Met Leu Ser Glu Ser 145 1 #50 1 #55 1 #60 His Phe Pro Tyr Val Leu Phe Leu Glu Gly Se #r Asn Phe Leu Thr Glu 165 # 170 # 175 Asn Ile Ser Ile Thr Arg Pro Asp Gly Arg Va #l Val Asn Leu Glu Tyr 180 # 185 # 190 Asn Ser Gly Ser Glu Ser His Phe Pro Tyr Va #l Leu Phe Leu Glu Gly 195 # 200 # 205 Ser Asn Phe Leu Thr Glu Asn Ile Ser Ile Th #r Arg Pro Asp Gly Arg 210 # 215 # 220 Val Val Asn Leu Glu Tyr Asn Ser Gly Ile Le #u Asn Arg Leu Asp Arg 225 2 #30 2 #35 2 #40 Leu Thr Ala Ala Asn Tyr Gly Met Pro Ile As #n Ser Asn Leu Cys Ile 245 # 250 # 255 Asn Lys Phe Val Asn His Lys Asp Lys Ser Il #e Met Leu Gln Ala Ala 260 # 265 # 270 Ser Ile Tyr Thr Gln Gly Asp Gly Arg Glu Tr #p Asp Ser Lys Ile Met 275 # 280 # 285 Phe Glu Ile Met Phe Asp Ile Ser Thr Thr Se #r Leu Arg Val Leu Gly 290 # 295 # 300 Arg Asp Leu Phe Glu Gln Leu Thr Ser Lys 305 3 #10 (2) INFORMATION FOR SEQ ID NO:72: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 17 amino #acids (B) TYPE: amino acid (D) TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: internal (xi) SEQUENCE DESCRIPTION: SEQ ID NO:72: Cys Asp Thr Asp Asp Arg His Arg Ile Glu Gl #u Lys Arg Lys Arg Lys 1 5 # 10 # 15 Thr (2) INFORMATION FOR SEQ ID NO:73: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 168 amino #acids (B) TYPE: amino acid (D) TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: internal (xi) SEQUENCE DESCRIPTION: SEQ ID NO:73: Gly Asp Pro Gly Lys Lys Lys Gln His Ile Cy #s His Ile Gln Gly Cys 1 5 # 10 # 15 Gly Lys Val Tyr Gly Lys Thr Ser His Leu Ar #g Ala His Leu Arg Trp 20 # 25 # 30 His Thr Gly Glu Arg Pro Phe Met Cys Thr Tr #p Ser Tyr Cys Gly Lys 35 # 40 # 45 Arg Phe Thr Arg Ser Asp Glu Leu Gln Arg Hi #s Lys Arg Thr His Thr 50 # 55 # 60 Gly Glu Lys Lys Phe Ala Cys Pro Glu Cys Pr #o Lys Arg Phe Met Arg 65 #70 #75 #80 Ser Asp His Leu Ser Lys His Ile Lys Thr Hi #s Gln Asn Lys Lys Gly 85 # 90 # 95 Gly Pro Gly Val Ala Leu Ser Val Gly Thr Le #u Pro Leu Asp Ser Gly 100 # 105 # 110 Ala Gly Ser Glu Gly Ser Gly Thr Ala Thr Pr #o Ser Ala Leu Ile Thr 115 # 120 # 125 Thr Asn Met Val Ala Met Glu Ala Ile Cys Pr #o Glu Gly Ile Ala Arg 130 # 135 # 140 Leu Ala Asn Ser Gly Ile Asn Val Met Gln Va #l Ala Asp Leu Gln Ser 145 1 #50 1 #55 1 #60 Ile Asn Ile Ser Gly Asn Gly Phe 165 (2) INFORMATION FOR SEQ ID NO:74: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 181 amino #acids (B) TYPE: amino acid (D) TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: internal (xi) SEQUENCE DESCRIPTION: SEQ ID NO:74: Ser Gly Ile Val Pro Gln Leu Gln Asn Ile Va #l Ser Thr Val Asn Leu 1 5 # 10 # 15 Gly Cys Lys Leu Asp Leu Lys Thr Ile Ala Le #u Arg Ala Arg Asn Ala 20 # 25 # 30 Glu Tyr Asn Pro Lys Arg Phe Ala Ala Val Il #e Met Arg Ile Arg Glu 35 # 40 # 45 Pro Arg Thr Thr Ala Leu Ile Phe Ser Ser Gl #y Lys Met Val Cys Thr 50 # 55 # 60 Gly Ala Lys Ser Glu Glu Gln Ser Arg Leu Al #a Ala Arg Lys Tyr Ala 65 #70 #75 #80 Arg Val Val Gln Lys Leu Gly Phe Pro Ala Ly #s Phe Leu Asp Phe Lys 85 # 90 # 95 Ile Gln Asn Met Val Gly Ser Cys Asp Val Ly #s Phe Pro Ile Arg Leu 100 # 105 # 110 Glu Gly Leu Val Leu Thr His Gln Gln Phe Se #r Ser Tyr Glu Pro Glu 115 # 120 # 125 Leu Phe Pro Gly Leu Ile Tyr Arg Met Ile Ly #s Pro Arg Ile Val Leu 130 # 135 # 140 Leu Ile Phe Val Ser Gly Lys Val Val Leu Th #r Gly Ala Lys Val Arg 145 1 #50 1 #55 1 #60 Ala Glu Ile Tyr Glu Ala Phe Glu Asn Ile Ty #r Pro Ile Leu Lys Gly 165 # 170 # 175 Phe Arg Lys Thr Thr 180 (2) INFORMATION FOR SEQ ID NO:75: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 85 amino #acids (B) TYPE: amino acid (D) TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: internal (xi) SEQUENCE DESCRIPTION: SEQ ID NO:75: Ser Cys Phe Ala Leu Ile Ser Gly Thr Ala As #n Gln Val Lys Cys Tyr 1 5 # 10 # 15 Arg Phe Arg Val Lys Lys Asn His Arg His Ar #g Tyr Glu Asn Cys Thr 20 # 25 # 30 Thr Thr Trp Phe Thr Val Ala Asp Asn Gly Al #a Glu Arg Gln Gly Gln 35 # 40 # 45 Ala Gln Ile Leu Ile Thr Phe Gly Ser Pro Se #r Gln Arg Gln Asp Phe 50 # 55 # 60 Leu Lys His Val Pro Leu Pro Pro Gly Met As #n Ile Ser Gly Phe Thr 65 #70 #75 #80 Ala Ser Leu Asp Phe 85 (2) INFORMATION FOR SEQ ID NO:76: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 87 amino #acids (B) TYPE: amino acid (D) TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: internal (xi) SEQUENCE DESCRIPTION: SEQ ID NO:76: Cys Pro Cys Leu Leu Ile Gly Thr Ser Gly As #n Gly Asn Gln Val Lys 1 5 # 10 # 15 Cys Tyr Ser Phe Arg Val Lys Arg Trp His As #p Arg Asp Lys Tyr His 20 # 25 # 30 His Thr Thr Thr Trp Trp Ala Val Gly Gly Gl #n Gly Ser Glu Arg Pro 35 # 40 # 45 Gly Asp Ala Thr Val Ile Val Thr Phe Lys As #p Gln Ser Gln Arg Ser 50 # 55 # 60 His Phe Leu Gln Gln Val Pro Leu Pro Pro Gl #y Met Ser Ala His Gly 65 #70 #75 #80 Val Thr Met Thr Val Asp Phe 85 (2) INFORMATION FOR SEQ ID NO:77: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 84 amino #acids (B) TYPE: amino acid (D) TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: internal (xi) SEQUENCE DESCRIPTION: SEQ ID NO:77: Pro Pro Val Ile Cys Leu Lys Gly Gly His As #n Gln Leu Lys Cys Leu 1 5 # 10 # 15 Arg Tyr Arg Leu Lys Ser Lys His Ser Ser Le #u Phe Asp Cys Ile Ser 20 # 25 # 30 Thr Thr Trp Ser Trp Val Asp Thr Thr Ser Th #r Cys Arg Leu Gly Ser 35 # 40 # 45 Gly Arg Met Leu Ile Lys Phe Ala Asp Ser Gl #u Gln Arg Asp Lys Phe 50 # 55 # 60 Leu Ser Arg Val Pro Leu Pro Ser Thr Thr Gl #n Val Phe Leu Gly Asn 65 #70 #75 #80 Phe Tyr Gly Leu (2) INFORMATION FOR SEQ ID NO:78: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 84 amino #acids (B) TYPE: amino acid (D) TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: internal (xi) SEQUENCE DESCRIPTION: SEQ ID NO:78: Pro Pro Val Ile Leu Val Arg Gly Gly Ala As #n Thr Leu Lys Cys Phe 1 5 # 10 # 15 Arg Asn Arg Ala Arg Val Arg Tyr Arg Gly Le #u Phe Lys Tyr Phe Ser 20 # 25 # 30 Thr Thr Trp Ser Trp Val Ala Gly Asp Ser Th #r Glu Arg Leu Gly Arg 35 # 40 # 45 Ser Arg Met Leu Ile Leu Phe Thr Ser Ala Cy #s Gln Arg Glu Lys Pro 50 # 55 # 60 Asp Glu Thr Val Lys Tyr Pro Lys Gly Val As #p Thr Ser Tyr Gly Asn 65 #70 #75 #80 Leu Asp Ser Leu (2) INFORMATION FOR SEQ ID NO:79: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 84 amino #acids (B) TYPE: amino acid (D) TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: internal (xi) SEQUENCE DESCRIPTION: SEQ ID NO:79: Pro Pro Val Val Cys Val Lys Gly Gly Ala As #n Gln Leu Lys Cys Leu 1 5 # 10 # 15 Arg Tyr Arg Leu Lys Ala Ser Thr Gln Val As #p Phe Asp Ser Ile Ser 20 # 25 # 30 Thr Thr Trp His Trp Thr Asp Arg Lys Asn Th #r Glu Arg Ile Gly Ser 35 # 40 # 45 Ala Arg Met Leu Val Lys Phe Ile Asp Glu Al #a Gln Arg Glu Lys Phe 50 # 55 # 60 Leu Glu Arg Val Ala Leu Pro Arg Ser Val Se #r Val Phe Leu Gly Gln 65 #70 #75 #80 Phe Asn Gly Ser (2) INFORMATION FOR SEQ ID NO:80: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 84 amino #acids (B) TYPE: amino acid (D) TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: internal (xi) SEQUENCE DESCRIPTION: SEQ ID NO:80: Thr Pro Ile Val Gln Leu Gln Gly Asp Ser As #n Cys Leu Lys Cys Phe 1 5 # 10 # 15 Arg Tyr Arg Leu Asn Asp Lys Tyr Lys His Le #u Phe Glu Leu Ala Ser 20 # 25 # 30 Ser Thr Trp His Trp Ala Ser Pro Glu Ala Pr #o His Lys Asn Ala Ile 35 # 40 # 45 Val Thr Leu Thr Tyr Ser Ser Glu Glu Gln Ar #g Gln Gln Phe Leu Asn 50 # 55 # 60 Ser Val Lys Ile Pro Pro Thr Ile Arg His Ly #s Val Gly Phe Met Ser 65 #70 #75 #80 Leu His Leu Leu (2) INFORMATION FOR SEQ ID NO:81: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 84 amino #acids (B) TYPE: amino acid (D) TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: internal (xi) SEQUENCE DESCRIPTION: SEQ ID NO:81: Thr Pro Ile Val Gln Phe Gln Gly Glu Ser As #n Cys Leu Lys Cys Phe 1 5 # 10 # 15 Arg Tyr Arg Leu Asn Arg Asp His Arg His Le #u Phe Asp Leu Ile Ser 20 # 25 # 30 Ser Thr Trp His Trp Ala Ser Ser Lys Ala Pr #o His Lys His Ala Ile 35 # 40 # 45 Val Thr Val Thr Tyr Asp Ser Glu Glu Gln Ar #g Gln Gln Phe Leu Asp 50 # 55 # 60 Val Val Lys Ile Pro Pro Thr Ile Ser His Ly #s Leu Gly Phe Met Ser 65 #70 #75 #80 Leu His Leu Leu (2) INFORMATION FOR SEQ ID NO:82: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 80 amino #acids (B) TYPE: amino acid (D) TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: internal (xi) SEQUENCE DESCRIPTION: SEQ ID NO:82: Thr Pro Ile Ile His Leu Lys Gly Asp Arg As #n Ser Leu Lys Cys Leu 1 5 # 10 # 15 Arg Tyr Arg Leu Arg Lys His Ser Asp His Ty #r Arg Asp Ile Ser Ser 20 # 25 # 30 Thr Trp His Trp Thr Gly Ala Gly Asn Glu Ly #s Thr Gly Ile Leu Thr 35 # 40 # 45 Val Thr Tyr His Ser Glu Thr Gln Arg Thr Ly #s Phe Leu Asn Thr Val 50 # 55 # 60 Ala Ile Pro Asp Ser Val Gln Ile Leu Val Gl #y Tyr Asn Thr Met Tyr 65 #70 #75 #80 (2) INFORMATION FOR SEQ ID NO:83: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 80 amino #acids (B) TYPE: amino acid (D) TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: internal (xi) SEQUENCE DESCRIPTION: SEQ ID NO:83: Thr Pro Ile Val His Leu Lys Gly Asp Ala As #n Thr Leu Lys Cys Leu 1 5 # 10 # 15 Arg Tyr Arg Phe Lys Lys His Cys Thr Leu Ty #r Thr Ala Val Ser Ser 20 # 25 # 30 Thr Trp His Trp Thr Gly His Asn Tyr Lys Hi #s Lys Ser Ala Ile Val 35 # 40 # 45 Thr Leu Thr Tyr Asp Ser Glu Trp Gln Arg As #p Gln Phe Leu Ser Gln 50 # 55 # 60 Val Lys Ile Pro Lys Thr Ile Thr Val Ser Th #r Gly Phe Met Ser Ile 65 #70 #75 #80 (2) INFORMATION FOR SEQ ID NO:84: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 81 amino #acids (B) TYPE: amino acid (D) TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: internal (xi) SEQUENCE DESCRIPTION: SEQ ID NO:84: Ala Pro Ile Val His Leu Lys Gly Glu Ser As #n Ser Leu Lys Cys Leu 1 5 # 10 # 15 Arg Tyr Arg Leu Lys Pro Tyr Asn Glu Leu Ty #r Ser Ser Met Ser Ser 20 # 25 # 30 Thr Trp His Trp Thr Ser Asp Asn Lys Asn Se #r Lys Asn Gly Ile Val 35 # 40 # 45 Thr Val Thr Phe Val Thr Gly Gln Gln Gln Gl #n Met Phe Leu Gly Thr 50 # 55 # 60 Val Lys Ile Pro Pro Thr Val Gln Ile Ser Th #r Gly Phe Met Thr Leu 65 #70 #75 #80 Val (2) INFORMATION FOR SEQ ID NO:85: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 21 amino #acids (B) TYPE: amino acid (D) TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: internal (xi) SEQUENCE DESCRIPTION: SEQ ID NO:85: Gly Ile Val Glu Gln Cys Cys Thr Ser Ile Cy #s Ser Leu Tyr Gln Leu 1 5 # 10 # 15 Glu Asn Tyr Cys Asn 20 (2) INFORMATION FOR SEQ ID NO:86: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 30 amino #acids (B) TYPE: amino acid (D) TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: internal (xi) SEQUENCE DESCRIPTION: SEQ ID NO:86: Phe Val Asn Gln His Leu Cys Gly Ser His Le #u Val Glu Ala Leu Tyr 1 5 # 10 # 15 Leu Val Cys Gly Glu Arg Gly Phe Phe Tyr Th #r Pro Lys Thr 20 # 25 # 30 (2) INFORMATION FOR SEQ ID NO:87: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 21 amino #acids (B) TYPE: amino acid (D) TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: internal (xi) SEQUENCE DESCRIPTION: SEQ ID NO:87: Gly Ile Val Glu Gln Cys Cys Ala Ser Val Cy #s Ser Leu Tyr Gln Leu 1 5 # 10 # 15 Glu Asn Tyr Cys Asn 20 (2) INFORMATION FOR SEQ ID NO:88: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 30 amino #acids (B) TYPE: amino acid (D) TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: internal (xi) SEQUENCE DESCRIPTION: SEQ ID NO:88: Phe Val Asn Gln His Leu Cys Gly Ser His Le #u Val Glu Ala Leu Tyr 1 5 # 10 # 15 Leu Val Cys Gly Glu Arg Gly Phe Phe Tyr Th #r Pro Lys Thr 20 # 25 # 30 (2) INFORMATION FOR SEQ ID NO:89: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 24 amino #acids (B) TYPE: amino acid (D) TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: internal (xi) SEQUENCE DESCRIPTION: SEQ ID NO:89: Gln Leu Tyr Ser Ala Leu Ala Asn Lys Cys Cy #s His Val Gly Cys Ile 1 5 # 10 # 15 Lys Arg Ser Leu Ala Arg Phe Cys 20 (2) INFORMATION FOR SEQ ID NO:90: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 33 amino #acids (B) TYPE: amino acid (D) TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: internal (xi) SEQUENCE DESCRIPTION: SEQ ID NO:90: Asp Ser Trp Met Glu Glu Val Ile Lys Ile Cy #s Gly Arg Glu Leu Val 1 5 # 10 # 15 Arg Ala Gln Ile Ala Ile Cys Gly Met Ser Th #r Trp Ser Lys Arg Ser 20 # 25 # 30 Leu (2) INFORMATION FOR SEQ ID NO:91: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 24 amino #acids (B) TYPE: amino acid (D) TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: internal (xi) SEQUENCE DESCRIPTION: SEQ ID NO:91: Glu Glu Lys Met Gly Thr Ala Lys Lys Cys Cy #s Ala Ile Gly Cys Ser 1 5 # 10 # 15 Thr Glu Asp Phe Arg Met Val Cys 20 (2) INFORMATION FOR SEQ ID NO:92: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 40 amino #acids (B) TYPE: amino acid (D) TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: internal (xi) SEQUENCE DESCRIPTION: SEQ ID NO:92: Arg Pro Asn Trp Glu Glu Arg Ser Arg Leu Cy #s Gly Arg Asp Leu Ile 1 5 # 10 # 15 Arg Ala Phe Ile Tyr Leu Cys Gly Gly Thr Ar #g Trp Thr Arg Leu Pro 20 # 25 # 30 Asn Phe Gly Asn Tyr Pro Ile Met 35 # 40 (2) INFORMATION FOR SEQ ID NO:93: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 182 amino #acids (B) TYPE: amino acid (D) TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: internal (xi) SEQUENCE DESCRIPTION: SEQ ID NO:93: Ser Gly Ile Val Pro Thr Leu Gln Asn Ile Va #l Ser Thr Val Asn Leu 1 5 # 10 # 15 Asp Cys Lys Leu Asp Leu Lys Ala Ile Ala Le #u Gln Ala Arg Asn Ala 20 # 25 # 30 Glu Tyr Asn Pro Lys Arg Phe Ala Ala Val Il #e Met Arg Ile Arg Glu 35 # 40 # 45 Pro Lys Thr Thr Ala Leu Ile Phe Ala Ser Gl #y Lys Met Val Cys Thr 50 # 55 # 60 Gly Ala Lys Ser Glu Asp Phe Ser Lys Met Al #a Ala Arg Lys Tyr Ala 65 #70 #75 #80 Arg Ile Val Gln Lys Leu Gly Phe Pro Ala Ly #s Phe Lys Asp Phe Lys 85 # 90 # 95 Ile Gln Asn Ile Val Gly Ser Cys Asp Val Ly #s Phe Pro Ile Arg Leu 100 # 105 # 110 Glu Gly Leu Ala Tyr Ser His Ala Ala Phe Se #r Ser Tyr Glu Pro Glu 115 # 120 # 125 Leu Phe Pro Gly Leu Ile Tyr Arg Met Lys Va #l Pro Lys Ile Val Leu 130 # 135 # 140 Leu Ile Phe Val Ser Gly Lys Ile Val Ile Th #r Gly Ala Lys Met Arg 145 1 #50 1 #55 1 #60 Asp Glu Thr Tyr Lys Ala Phe Glu Asn Ile Ty #r Pro Val Leu Ser Glu 165 # 170 # 175 Phe Arg Lys Ile Gln Gln 180 (2) INFORMATION FOR SEQ ID NO:94: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 84 amino #acids (B) TYPE: amino acid (D) TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: internal (xi) SEQUENCE DESCRIPTION: SEQ ID NO:94: Asn Ser Asn Ser Thr Pro Ile Val His Leu Ly #s Gly Asp Ala Asn Thr 1 5 # 10 # 15 Leu Lys Cys Leu Arg Tyr Arg Phe Lys Lys Hi #s Cys Thr Leu Tyr Thr 20 # 25 # 30 Ala Val Ser Ser Thr Trp His Trp Thr Gly Hi #s Asn Val Lys His Lys 35 # 40 # 45 Ser Ala Ile Val Thr Leu Thr Tyr Asp Ser Gl #u Trp Gln Arg Asp Gln 50 # 55 # 60 Phe Leu Ser Gln Val Lys Ile Pro Lys Thr Il #e Thr Val Ser Thr Gly 65 #70 #75 #80 Phe Met Ser Ile (2) INFORMATION FOR SEQ ID NO:95: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 84 amino #acids (B) TYPE: amino acid (D) TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: internal (xi) SEQUENCE DESCRIPTION: SEQ ID NO:95: Asn Ser Asn Thr Thr Pro Ile Val His Leu Ly #s Gly Asp Ala Asn Thr 1 5 # 10 # 15 Leu Lys Cys Leu Arg Tyr Arg Phe Lys Lys Hi #s Cys Thr Leu Tyr Thr 20 # 25 # 30 Ala Val Ser Ser Thr Trp His Trp Thr Gly Hi #s Asn Val Lys His Lys 35 # 40 # 45 Ser Ala Ile Val Thr Leu Thr Tyr Asp Ser Gl #u Trp Gln Arg Asp Gln 50 # 55 # 60 Phe Leu Ser Gln Val Lys Ile Pro Lys Thr Il #e Thr Val Ser Thr Gly 65 #70 #75 #80 Phe Met Ser Ile (2) INFORMATION FOR SEQ ID NO:96: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 83 amino #acids (B) TYPE: amino acid (D) TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: internal (xi) SEQUENCE DESCRIPTION: SEQ ID NO:96: Ser Gly Asn Thr Thr Pro Ile Ile His Leu Ly #s Gly Asp Arg Asn Ser 1 5 # 10 # 15 Leu Lys Cys Leu Arg Tyr Arg Leu Arg Lys Hi #s Ser Asp His Tyr Arg 20 # 25 # 30 Asp Ile Ser Ser Thr Trp His Trp Thr Gly Al #a Gly Asn Glu Lys Thr 35 # 40 # 45 Gly Ile Leu Thr Val Thr Tyr His Ser Glu Th #r Gln Arg Thr Lys Phe 50 # 55 # 60 Leu Asn Thr Val Ala Ile Pro Asp Ser Val Gl #n Ile Leu Val Gly Tyr 65 #70 #75 #80 Met Thr Met (2) INFORMATION FOR SEQ ID NO:97: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 84 amino #acids (B) TYPE: amino acid (D) TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: internal (xi) SEQUENCE DESCRIPTION: SEQ ID NO:97: Ser Gly Asn Thr Ala Pro Ile Val His Leu Ly #s Gly Glu Ser Asn Ser 1 5 # 10 # 15 Leu Lys Cys Leu Arg Tyr Arg Leu Lys Pro Ty #r Lys Glu Leu Tyr Ser 20 # 25 # 30 Ser Met Ser Ser Thr Trp His Trp Thr Ser As #p Asn Lys Asn Ser Lys 35 # 40 # 45 Asn Gly Ile Val Thr Val Thr Phe Val Thr Gl #u Gln Gln Gln Gln Met 50 # 55 # 60 Phe Leu Gly Thr Val Lys Ile Pro Pro Thr Va #l Gln Ile Ser Thr Gly 65 #70 #75 #80 Phe Met Thr Leu (2) INFORMATION FOR SEQ ID NO:98: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 89 amino #acids (B) TYPE: amino acid (D) TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: internal (xi) SEQUENCE DESCRIPTION: SEQ ID NO:98: Ser Gly Asn Thr Ser Cys Phe Ala Leu Ile Se #r Gly Thr Ala Asn Gln 1 5 # 10 # 15 Val Lys Cys Tyr Arg Phe Arg Val Lys Lys As #n His Arg His Arg Tyr 20 # 25 # 30 Glu Asn Cys Thr Thr Thr Trp Phe Thr Val Al #a Asp Asn Gly Ala Glu 35 # 40 # 45 Arg Gln Gly Gln Ala Gln Ile Leu Ile Thr Ph #e Gly Ser Pro Ser Gln 50 # 55 # 60 Arg Gln Asp Phe Leu Lys His Val Pro Leu Pr #o Pro Gly Met Asn Ile 65 #70 #75 #80 Ser Gly Phe Thr Ala Ser Leu Asp Phe 85 (2) INFORMATION FOR SEQ ID NO:99: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 7 amino #acids (B) TYPE: amino acid (D) TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: C-terminal (xi) SEQUENCE DESCRIPTION: SEQ ID NO:99: Ser Asn Lys Lys Thr Thr Ala 1 5 (2) INFORMATION FOR SEQ ID NO:100: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 4 amino #acids (B) TYPE: amino acid (D) TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: internal (xi) SEQUENCE DESCRIPTION: SEQ ID NO:100: Asn Ser Asn Thr 1 (2) INFORMATION FOR SEQ ID NO:101: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 4 amino #acids (B) TYPE: amino acid (D) TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: internal (xi) SEQUENCE DESCRIPTION: SEQ ID NO:101: Ser Gly Asn Thr 1 (2) INFORMATION FOR SEQ ID NO:102: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 6 amino #acids (B) TYPE: amino acid (D) TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: internal (xi) SEQUENCE DESCRIPTION: SEQ ID NO:102: Ser Ser Gly Ser Ser Gly 1 5 (2) INFORMATION FOR SEQ ID NO:103: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 15 amino #acids (B) TYPE: amino acid (D) TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: internal (xi) SEQUENCE DESCRIPTION: SEQ ID NO:103: Cys Tyr Pro Glu Ile Lys Asp Lys Glu Glu Va #l Gln Arg Lys Arg 1 5 # 10 # 15 (2) INFORMATION FOR SEQ ID NO:104: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 66 amino #acids (B) TYPE: amino acid (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: N-terminal (xi) SEQUENCE DESCRIPTION: SEQ ID NO:104: Met Glu Gln Arg Ile Thr Leu Lys Asp Tyr Al #a Met Arg Phe Gly Gln 1 5 # 10 # 15 Thr Lys Thr Ala Lys Asp Leu Gly Val Tyr Gl #n Ser Ala Ile Asn Lys 20 # 25 # 30 Ala Ile His Ala Gly Arg Lys Ile Phe Leu Th #r Ile Asn Ala Asp Gly 35 # 40 # 45 Ser Val Tyr Ala Glu Glu Val Lys Pro Phe Pr #o Ser Asn Lys Lys Thr 50 # 55 # 60 Thr Ala 65 (2) INFORMATION FOR SEQ ID NO:105: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 66 amino #acids (B) TYPE: amino acid (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: N-terminal (xi) SEQUENCE DESCRIPTION: SEQ ID NO:105: Met Glu Gln Glu Ile Thr Leu Lys Asp Tyr Al #a Met Arg Phe Gly Gln 1 5 # 10 # 15 Thr Lys Thr Ala Lys Asp Leu Gly Val Tyr Gl #n Ser Ala Ile Asn Lys 20 # 25 # 30 Ala Ile His Ala Gly Arg Lys Ile Phe Leu Th #r Ile Asn Ala Asp Gly 35 # 40 # 45 Ser Val Tyr Ala Glu Glu Val Lys Pro Phe Pr #o Ser Asn Lys Lys Thr 50 # 55 # 60 Thr Ala 65 (2) INFORMATION FOR SEQ ID NO:106: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 66 amino #acids (B) TYPE: amino acid (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: N-terminal (xi) SEQUENCE DESCRIPTION: SEQ ID NO:106: Met Arg Gln Arg Ile Thr Leu Lys Asp Tyr Al #a Met Arg Phe Gly Gln 1 5 # 10 # 15 Thr Lys Thr Ala Lys Asp Leu Gly Val Tyr Gl #n Ser Ala Ile Asn Lys 20 # 25 # 30 Ala Ile His Ala Gly Arg Lys Ile Phe Leu Th #r Ile Asn Ala Asp Gly 35 # 40 # 45 Ser Val Tyr Ala Glu Glu Val Lys Pro Phe Pr #o Ser Asn Lys Lys Thr 50 # 55 # 60 Thr Ala 65 (2) INFORMATION FOR SEQ ID NO:107: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 96 amino #acids (B) TYPE: amino acid (D) TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: N-terminal (xi) SEQUENCE DESCRIPTION: SEQ ID NO:107: Ser Thr Lys Lys Lys Pro Leu Thr Gln Glu Gl #n Leu Glu Asp Ala Arg 1 5 # 10 # 15 Arg Leu Lys Ala Ile Tyr Glu Lys Lys Lys As #n Glu Leu Gly Leu Ser 20 # 25 # 30 Gln Glu Ser Val Ala Asp Lys Met Gly Met Gl #y Gln Ser Gly Val Gly 35 # 40 # 45 Ala Leu Phe Asn Gly Ile Asn Ala Leu Asn Al #a Tyr Asn Ala Ala Leu 50 # 55 # 60 Leu Ala Lys Ile Leu Lys Val Ser Val Glu Gl #u Phe Ser Pro Ser Ile 65 #70 #75 #80 Ala Arg Glu Ile Tyr Glu Met Tyr Glu Ala Va #l Ser Met Glu Pro Ser 85 # 90 # 95 (2) INFORMATION FOR SEQ ID NO:108: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 96 amino #acids (B) TYPE: amino acid (D) TOPOLOGY: linear (ii) MOLECULE TYPE: peptide (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: N-terminal (xi) SEQUENCE DESCRIPTION: SEQ ID NO:108: Ser Thr Lys Lys Lys Pro Leu Thr Gln Glu Gl #n Leu Glu Asp Ala Arg 1 5 # 10 # 15 Arg Leu Lys Ala Ile Tyr Glu Lys Lys Lys As #n Glu Leu Gly Leu Ser 20 # 25 # 30 Gln Glu Ser Val Ala Asp Lys Met Gly Met Gl #y Gln Ser Gly Val Gly 35 # 40 # 45 Ala Leu Phe Asn Gly Ile Asn Ala Leu Asn Al #a Tyr Asn Ala Ala Leu 50 # 55 # 60 Leu Ala Lys Ile Leu Lys Val Ser Val Glu Gl #u Phe Ser Pro Ser Ile 65 #70 #75 #80 Ala Arg Glu Ile Tyr Glu Met Cys Glu Ala Va #l Ser Met Glu Pro Ser 85 # 90 # 95 (2) INFORMATION FOR SEQ ID NO:109: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 180 amino #acids (B) TYPE: amino acid (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: N-terminal (xi) SEQUENCE DESCRIPTION: SEQ ID NO:109: Gly Ile Val Glu Gln Cys Cys Thr Ser Ile Cy #s Ser Leu Tyr Gln Leu 1 5 # 10 # 15 Glu Asn Tyr Cys Asn Met Ser Met Glu Gln Ar #g Ile Thr Leu Lys Asp 20 # 25 # 30 Tyr Ala Met Arg Phe Gly Gln Thr Lys Thr Al #a Lys Asp Leu Gly Val 35 # 40 # 45 Tyr Gln Ser Ala Ile Asn Lys Ala Ile His Al #a Gly Arg Lys Ile Phe 50 # 55 # 60 Leu Thr Ile Asn Ala Asp Gly Ser Val Tyr Al #a Glu Glu Val Lys Pro 65 #70 #75 #80 Phe Pro Ser Asn Lys Lys Thr Thr Ala Ser As #n Lys Lys Thr Thr Ala 85 # 90 # 95 Asn Ser Asn Thr Thr Pro Ile Val His Leu Ly #s Gly Asp Ala Asn Thr 100 # 105 # 110 Leu Lys Cys Leu Arg Tyr Arg Phe Lys Lys Hi #s Cys Thr Leu Tyr Thr 115 # 120 # 125 Ala Val Ser Ser Thr Trp His Trp Thr Gly Hi #s Asn Val Lys His Lys 130 # 135 # 140 Ser Ala Ile Val Thr Leu Thr Tyr Asp Ser Gl #u Trp Gln Arg Asp Gln 145 1 #50 1 #55 1 #60 Phe Leu Ser Gln Val Lys Ile Pro Lys Thr Il #e Thr Val Ser Thr Gly 165 # 170 # 175 Phe Met Ser Ile 180 (2) INFORMATION FOR SEQ ID NO:110: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 113 amino #acids (B) TYPE: amino acid (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: N-terminal (xi) SEQUENCE DESCRIPTION: SEQ ID NO:110: Gly Ile Val Glu Gln Cys Cys Thr Ser Ile Cy #s Ser Leu Tyr Gln Leu 1 5 # 10 # 15 Glu Asn Tyr Cys Asn Met Ser Met Glu Gln Ar #g Ile Thr Leu Lys Asp 20 # 25 # 30 Tyr Ala Met Arg Phe Gly Gln Thr Lys Thr Al #a Lys Asp Leu Gly Val 35 # 40 # 45 Tyr Gln Ser Ala Ile Asn Lys Ala Ile His Al #a Gly Arg Lys Ile Phe 50 # 55 # 60 Leu Thr Ile Asn Ala Asp Gly Ser Val Tyr Al #a Glu Glu Val Lys Pro 65 #70 #75 #80 Phe Pro Ser Asn Lys Lys Thr Thr Ala Ser As #n Lys Lys Thr Thr Ala 85 # 90 # 95 Cys Asp Thr Asp Asp Arg His Arg Ile Glu Gl #u Lys Arg Lys Arg Lys 100 # 105 # 110 Thr (2) INFORMATION FOR SEQ ID NO:111: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 292 amino #acids (B) TYPE: amino acid (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: N-terminal (xi) SEQUENCE DESCRIPTION: SEQ ID NO:111: Phe Val Asn Gln His Leu Cys Gly Ser His Le #u Val Glu Ala Leu Tyr 1 5 # 10 # 15 Leu Val Cys Gly Glu Arg Gly Phe Phe Tyr Th #r Pro Lys Thr Met Ser 20 # 25 # 30 Met Glu Gln Glu Ile Thr Leu Lys Asp Tyr Al #a Met Arg Phe Gly Gln 35 # 40 # 45 Thr Lys Thr Ala Lys Asp Leu Gly Val Tyr Gl #n Ser Ala Ile Asn Lys 50 # 55 # 60 Ala Ile His Ala Gly Arg Lys Ile Phe Leu Th #r Ile Asn Ala Asp Gly 65 #70 #75 #80 Ser Val Tyr Ala Glu Glu Val Lys Pro Phe Pr #o Ser Asn Lys Lys Thr 85 # 90 # 95 Thr Ala Ser Asn Lys Lys Thr Thr Ala Ser Se #r Gly Ser Ser Gly Ser 100 # 105 # 110 Gly Ile Val Pro Gln Leu Gln Asn Ile Val Se #r Thr Val Asn Leu Gly 115 # 120 # 125 Cys Lys Leu Asp Leu Lys Thr Ile Ala Leu Ar #g Ala Arg Asn Ala Glu 130 # 135 # 140 Tyr Asn Pro Lys Arg Phe Ala Ala Val Ile Me #t Arg Ile Arg Glu Pro 145 1 #50 1 #55 1 #60 Arg Thr Thr Ala Leu Ile Phe Ser Ser Gly Ly #s Met Val Cys Thr Gly 165 # 170 # 175 Ala Lys Ser Glu Glu Gln Ser Arg Leu Ala Al #a Arg Lys Tyr Ala Arg 180 # 185 # 190 Val Val Gln Lys Leu Gly Phe Pro Ala Lys Ph #e Leu Asp Phe Lys Ile 195 # 200 # 205 Gln Asn Met Val Gly Ser Cys Asp Val Lys Ph #e Pro Ile Arg Leu Glu 210 # 215 # 220 Gly Leu Val Leu Thr His Gln Gln Phe Ser Se #r Tyr Glu Pro Glu Leu 225 2 #30 2 #35 2 #40 Phe Pro Gly Leu Ile Tyr Arg Met Ile Lys Pr #o Arg Ile Val Leu Leu 245 # 250 # 255 Ile Phe Val Ser Gly Lys Val Val Leu Thr Gl #y Ala Lys Val Arg Ala 260 # 265 # 270 Glu Ile Tyr Glu Ala Phe Glu Asn Ile Tyr Pr #o Ile Leu Lys Gly Phe 275 # 280 # 285 Arg Lys Thr Thr 290 (2) INFORMATION FOR SEQ ID NO:112: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 273 amino #acids (B) TYPE: amino acid (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: N-terminal (xi) SEQUENCE DESCRIPTION: SEQ ID NO:112: Phe Val Asn Gln His Leu Cys Gly Ser His Le #u Val Glu Ala Leu Tyr 1 5 # 10 # 15 Leu Val Cys Gly Glu Arg Gly Phe Phe Tyr Th #r Pro Lys Thr Met Ser 20 # 25 # 30 Met Arg Gln Arg Ile Thr Leu Lys Asp Tyr Al #a Met Arg Phe Gly Gln 35 # 40 # 45 Thr Lys Thr Ala Lys Asp Leu Gly Val Tyr Gl #n Ser Ala Ile Asn Lys 50 # 55 # 60 Ala Ile His Ala Gly Arg Lys Ile Phe Leu Th #r Ile Asn Ala Asp Gly 65 #70 #75 #80 Ser Val Tyr Ala Glu Glu Val Lys Pro Phe Pr #o Ser Asn Lys Lys Thr 85 # 90 # 95 Thr Ala Ser Asn Lys Lys Thr Thr Ala Gly As #p Pro Gly Lys Lys Lys 100 # 105 # 110 Gln His Ile Cys His Ile Gln Gly Cys Gly Ly #s Val Tyr Gly Lys Thr 115 # 120 # 125 Ser His Leu Arg Ala His Leu Arg Trp His Th #r Gly Glu Arg Pro Phe 130 # 135 # 140 Met Cys Thr Trp Ser Tyr Cys Gly Lys Arg Ph #e Thr Arg Ser Asp Glu 145 1 #50 1 #55 1 #60 Leu Gln Arg His Lys Arg Thr His Thr Gly Gl #u Lys Lys Phe Ala Cys 165 # 170 # 175 Pro Glu Cys Pro Lys Arg Phe Met Arg Ser As #p His Leu Ser Lys His 180 # 185 # 190 Ile Lys Thr His Gln Asn Lys Lys Gly Gly Pr #o Gly Val Ala Leu Ser 195 # 200 # 205 Val Gly Thr Leu Pro Leu Asp Ser Gly Ala Gl #y Ser Glu Gly Ser Gly 210 # 215 # 220 Thr Ala Thr Pro Ser Ala Leu Ile Thr Thr As #n Met Val Ala Met Glu 225 2 #30 2 #35 2 #40 Ala Ile Cys Pro Glu Gly Ile Ala Arg Leu Al #a Asn Ser Gly Ile Asn 245 # 250 # 255 Val Met Gln Val Ala Asp Leu Gln Ser Ile As #n Ile Ser Gly Asn Gly 260 # 265 # 270 Phe (2) INFORMATION FOR SEQ ID NO:113: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 421 amino #acids (B) TYPE: amino acid (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: N-terminal (xi) SEQUENCE DESCRIPTION: SEQ ID NO:113: Gln Leu Tyr Ser Ala Leu Ala Asn Lys Cys Cy #s His Val Gly Cys Ile 1 5 # 10 # 15 Lys Arg Ser Leu Ala Arg Phe Cys Met Ser Me #t Arg Gln Arg Ile Thr 20 # 25 # 30 Leu Lys Asp Tyr Ala Met Arg Phe Gly Gln Th #r Lys Thr Ala Lys Asp 35 # 40 # 45 Leu Gly Val Tyr Gln Ser Ala Ile Asn Lys Al #a Ile His Ala Gly Arg 50 # 55 # 60 Lys Ile Phe Leu Thr Ile Asn Ala Asp Gly Se #r Val Tyr Ala Glu Glu 65 #70 #75 #80 Val Lys Pro Phe Pro Ser Asn Lys Lys Thr Th #r Ala Ser Asn Lys Lys 85 # 90 # 95 Thr Thr Ala Met Ala Asp Asp Asp Pro Tyr Gl #y Thr Gly Gln Met Phe 100 # 105 # 110 His Leu Asn Thr Ala Leu Thr His Ser Ile Ph #e Asn Ala Glu Leu Tyr 115 # 120 # 125 Ser Pro Glu Ile Pro Leu Ser Thr Asp Gly Pr #o Tyr Leu Gln Ile Leu 130 # 135 # 140 Glu Gln Pro Lys Gln Arg Gly Phe Arg Phe Ar #g Tyr Val Cys Glu Gly 145 1 #50 1 #55 1 #60 Pro Ser His Gly Gly Leu Pro Gly Ala Ser Se #r Glu Lys Asn Lys Lys 165 # 170 # 175 Ser Tyr Pro Gln Val Lys Ile Cys Asn Tyr Va #l Gly Pro Ala Lys Val 180 # 185 # 190 Ile Val Gln Leu Val Thr Asn Gly Lys Asn Il #e His Leu His Ala His 195 # 200 # 205 Ser Leu Val Gly Lys His Cys Glu Asp Gly Va #l Cys Thr Val Thr Ala 210 # 215 # 220 Gly Pro Lys Asp Met Val Val Gly Phe Ala As #n Leu Gly Ile Leu His 225 2 #30 2 #35 2 #40 Val Thr Lys Lys Lys Val Phe Glu Thr Leu Gl #u Ala Arg Met Thr Glu 245 # 250 # 255 Ala Cys Ile Arg Gly Tyr Asn Pro Gly Leu Le #u Val His Ser Asp Leu 260 # 265 # 270 Ala Tyr Leu Gln Ala Glu Gly Gly Gly Asp Ar #g Gln Leu Thr Asp Arg 275 # 280 # 285 Glu Lys Glu Ile Ile Arg Gln Ala Ala Val Gl #n Gln Thr Lys Glu Met 290 # 295 # 300 Asp Leu Ser Val Val Arg Leu Met Phe Thr Al #a Phe Leu Pro Asp Ser 305 3 #10 3 #15 3 #20 Thr Gly Ser Phe Thr Arg Arg Leu Glu Pro Va #l Val Ser Asp Ala Ile 325 # 330 # 335 Tyr Asp Ser Lys Ala Pro Asn Ala Ser Asn Le #u Lys Ile Val Arg Met 340 # 345 # 350 Asp Arg Thr Ala Gly Cys Val Thr Gly Gly Gl #u Glu Ile Tyr Leu Leu 355 # 360 # 365 Cys Asp Lys Val Gln Lys Asp Asp Ile Gln Il #e Arg Phe Tyr Glu Glu 370 # 375 # 380 Glu Glu Asn Gly Gly Val Trp Glu Gly Phe Gl #y Asp Phe Ser Pro Thr 385 3 #90 3 #95 4 #00 Asp Val His Arg Gln Phe Ala Ile Val Phe Ly #s Thr Pro Lys Tyr Lys 405 # 410 # 415 Asp Val Asn Ile Thr 420 (2) INFORMATION FOR SEQ ID NO:114: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 391 amino #acids (B) TYPE: amino acid (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: N-terminal (xi) SEQUENCE DESCRIPTION: SEQ ID NO:114: Met Arg Gln Arg Ile Thr Leu Lys Asp Tyr Al #a Met Arg Phe Gly Gln 1 5 # 10 # 15 Thr Lys Thr Ala Lys Asp Leu Gly Val Tyr Gl #n Ser Ala Ile Asn Lys 20 # 25 # 30 Ala Ile His Ala Gly Arg Lys Ile Phe Leu Th #r Ile Asn Ala Asp Gly 35 # 40 # 45 Ser Val Tyr Ala Glu Glu Val Lys Pro Phe Pr #o Ser Asn Lys Lys Thr 50 # 55 # 60 Thr Ala Met Ala Glu Asp Asp Pro Tyr Leu Gl #y Arg Pro Glu Gln Met 65 #70 #75 #80 Phe His Leu Asp Pro Ser Leu Thr His Thr Il #e Phe Asn Pro Glu Val 85 # 90 # 95 Phe Gln Pro Gln Met Ala Leu Pro Thr Ala As #p Gly Pro Tyr Leu Gln 100 # 105 # 110 Ile Leu Glu Gln Pro Lys Gln Arg Gly Phe Ar #g Phe Arg Tyr Val Cys 115 # 120 # 125 Glu Gly Pro Ser His Gly Gly Leu Pro Gly Al #a Ser Ser Glu Lys Asn 130 # 135 # 140 Lys Lys Ser Tyr Pro Gln Val Lys Ile Cys As #n Tyr Val Gly Pro Ala 145 1 #50 1 #55 1 #60 Lys Val Ile Val Gln Leu Val Thr Asn Gly Ly #s Asn Ile His Leu His 165 # 170 # 175 Ala His Ser Leu Val Gly Lys His Cys Glu As #p Gly Ile Cys Thr Val 180 # 185 # 190 Thr Ala Gly Pro Glu Asp Cys Val His Gly Ph #e Ala Asn Leu Gly Ile 195 # 200 # 205 Leu His Val Thr Lys Lys Lys Val Phe Glu Th #r Leu Glu Ala Arg Met 210 # 215 # 220 Thr Glu Ala Cys Ile Arg Gly Tyr Asn Pro Gl #y Leu Leu Val His Pro 225 2 #30 2 #35 2 #40 Asp Leu Ala Tyr Leu Gln Ala Glu Gly Gly Gl #y Asp Arg Gln Leu Gly 245 # 250 # 255 Asp Arg Glu Lys Glu Leu Ile Arg Gln Ala Al #a Leu Gln Gln Thr Lys 260 # 265 # 270 Glu Met Asp Leu Ser Val Val Arg Leu Met Ph #e Thr Ala Phe Leu Pro 275 # 280 # 285 Asp Ser Thr Gly Ser Phe Thr Arg Arg Leu Gl #u Pro Val Val Ser Asp 290 # 295 # 300 Ala Ile Tyr Asp Ser Lys Ala Pro Asn Ala Se #r Asn Leu Lys Ile Val 305 3 #10 3 #15 3 #20 Arg Met Asp Arg Thr Ala Gly Cys Val Thr Gl #y Gly Glu Glu Ile Tyr 325 # 330 # 335 Leu Leu Cys Asp Lys Val Gln Lys Asp Asp Il #e Gln Ile Arg Phe Tyr 340 # 345 # 350 Glu Glu Glu Glu Asn Gly Gly Val Trp Glu Gl #y Phe Gly Asp Phe Ser 355 # 360 # 365 Pro Thr Asp Val His Arg Gln Phe Ala Ile Va #l Phe Lys Thr Pro Lys 370 # 375 # 380 Tyr Lys Asp Ile Asn Ile Thr 385 3 #90 (2) INFORMATION FOR SEQ ID NO:115: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 391 amino #acids (B) TYPE: amino acid (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: N-terminal (xi) SEQUENCE DESCRIPTION: SEQ ID NO:115: Met Glu Gln Glu Ile Thr Leu Lys Asp Tyr Al #a Met Arg Phe Gly Gln 1 5 # 10 # 15 Thr Lys Thr Ala Lys Asp Leu Gly Val Tyr Gl #n Ser Ala Ile Asn Lys 20 # 25 # 30 Ala Ile His Ala Gly Arg Lys Ile Phe Leu Th #r Ile Asn Ala Asp Gly 35 # 40 # 45 Ser Val Tyr Ala Glu Glu Val Lys Pro Phe Pr #o Ser Asn Lys Lys Thr 50 # 55 # 60 Thr Ala Met Ala Glu Asp Asp Pro Tyr Leu Gl #y Arg Pro Glu Gln Met 65 #70 #75 #80 Phe His Leu Asp Pro Ser Leu Thr His Thr Il #e Phe Asn Pro Glu Val 85 # 90 # 95 Phe Gln Pro Gln Met Ala Leu Pro Thr Ala As #p Gly Pro Tyr Leu Gln 100 # 105 # 110 Ile Leu Glu Gln Pro Lys Gln Arg Gly Phe Ar #g Phe Arg Tyr Val Cys 115 # 120 # 125 Glu Gly Pro Ser His Gly Gly Leu Pro Gly Al #a Ser Ser Glu Lys Asn 130 # 135 # 140 Lys Lys Ser Tyr Pro Gln Val Lys Ile Cys As #n Tyr Val Gly Pro Ala 145 1 #50 1 #55 1 #60 Lys Val Ile Val Gln Leu Val Thr Asn Gly Ly #s Asn Ile His Leu His 165 # 170 # 175 Ala His Ser Leu Val Gly Lys His Cys Glu As #p Gly Ile Cys Thr Val 180 # 185 # 190 Thr Ala Gly Pro Glu Asp Cys Val His Gly Ph #e Ala Asn Leu Gly Ile 195 # 200 # 205 Leu His Val Thr Lys Lys Lys Val Phe Glu Th #r Leu Glu Ala Arg Met 210 # 215 # 220 Thr Glu Ala Cys Ile Arg Gly Tyr Asn Pro Gl #y Leu Leu Val His Pro 225 2 #30 2 #35 2 #40 Asp Leu Ala Tyr Leu Gln Ala Glu Gly Gly Gl #y Asp Arg Gln Leu Gly 245 # 250 # 255 Asp Arg Glu Lys Glu Leu Ile Arg Gln Ala Al #a Leu Gln Gln Thr Lys 260 # 265 # 270 Glu Met Asp Leu Ser Val Val Arg Leu Met Ph #e Thr Ala Phe Leu Pro 275 # 280 # 285 Asp Ser Thr Gly Ser Phe Thr Arg Arg Leu Gl #u Pro Val Val Ser Asp 290 # 295 # 300 Ala Ile Tyr Asp Ser Lys Ala Pro Asn Ala Se #r Asn Leu Lys Ile Val 305 3 #10 3 #15 3 #20 Arg Met Asp Arg Thr Ala Gly Cys Val Thr Gl #y Gly Glu Glu Ile Tyr 325 # 330 # 335 Leu Leu Cys Asp Lys Val Gln Lys Asp Asp Il #e Gln Ile Arg Phe Tyr 340 # 345 # 350 Glu Glu Glu Glu Asn Gly Gly Val Trp Glu Gl #y Phe Gly Asp Phe Ser 355 # 360 # 365 Pro Thr Asp Val His Arg Gln Phe Ala Ile Va #l Phe Lys Thr Pro Lys 370 # 375 # 380 Tyr Lys Asp Ile Asn Ile Thr 385 3 #90 (2) INFORMATION FOR SEQ ID NO:116: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 241 amino #acids (B) TYPE: amino acid (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: N-terminal (xi) SEQUENCE DESCRIPTION: SEQ ID NO:116: Met Arg Gln Arg Ile Thr Leu Lys Asp Tyr Al #a Met Arg Phe Gly Gln 1 5 # 10 # 15 Thr Lys Thr Ala Lys Asp Leu Gly Val Tyr Gl #n Ser Ala Ile Asn Lys 20 # 25 # 30 Ala Ile His Ala Gly Arg Lys Ile Phe Leu Th #r Ile Asn Ala Asp Gly 35 # 40 # 45 Ser Val Tyr Ala Glu Glu Val Lys Pro Phe Pr #o Ser Asn Lys Lys Thr 50 # 55 # 60 Thr Ala Ser Asn Lys Lys Thr Thr Ala Gly As #p Pro Gly Lys Lys Lys 65 #70 #75 #80 Gln His Ile Cys His Ile Gln Gly Cys Gly Ly #s Val Tyr Gly Lys Thr 85 # 90 # 95 Ser His Leu Arg Ala His Leu Arg Trp His Th #r Gly Glu Arg Pro Phe 100 # 105 # 110 Met Cys Thr Trp Ser Tyr Cys Gly Lys Arg Ph #e Thr Arg Ser Asp Glu 115 # 120 # 125 Leu Gln Arg His Lys Arg Thr His Thr Gly Gl #u Lys Lys Phe Ala Cys 130 # 135 # 140 Pro Glu Cys Pro Lys Arg Phe Met Arg Ser As #p His Leu Ser Lys His 145 1 #50 1 #55 1 #60 Ile Lys Thr His Gln Asn Lys Lys Gly Gly Pr #o Gly Val Ala Leu Ser 165 # 170 # 175 Val Gly Thr Leu Pro Leu Asp Ser Gly Ala Gl #y Ser Glu Gly Ser Gly 180 # 185 # 190 Thr Ala Thr Pro Ser Ala Leu Ile Thr Thr As #n Met Val Ala Met Glu 195 # 200 # 205 Ala Ile Cys Pro Glu Gly Ile Ala Arg Leu Al #a Asn Ser Gly Ile Asn 210 # 215 # 220 Val Met Gln Val Ala Asp Leu Gln Ser Ile As #n Ile Ser Gly Asn Gly 225 2 #30 2 #35 2 #40 Phe (2) INFORMATION FOR SEQ ID NO:117: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 10 base #pairs (B) TYPE: nucleic acid (C) STRANDEDNESS: both (D) TOPOLOGY: linear (ii) MOLECULE TYPE: cDNA (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (xi) SEQUENCE DESCRIPTION: SEQ ID NO:117: GGGAMTNYCC # # # 10 (2) INFORMATION FOR SEQ ID NO:118: (i) SEQUENCE CHARACTERISTICS: (A) LENGTH: 72 amino #acids (B) TYPE: amino acid (D) TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (v) FRAGMENT TYPE: N-terminal (xi) SEQUENCE DESCRIPTION: SEQ ID NO:118: Met Glu Pro Val Asp Pro Arg Leu Glu Pro Tr #p Lys His Pro Gly Ser 1 5 # 10 # 15 Gln Pro Lys Thr Ala Cys Thr Asn Cys Tyr Cy #s Lys Lys Cys Cys Phe 20 # 25 # 30 His Cys Gln Val Cys Phe Ile Thr Lys Ala Le #u Gly Ile Ser Tyr Gly 35 # 40 # 45 Arg Lys Lys Arg Arg Gln Arg Arg Arg Ala Hi #s Gln Asn Ser Gln Thr 50 # 55 # 60 His Gln Ala Ser Leu Ser Lys Gln 65 #70
Claims (51)
1. A probe nucleic acid (PNA) comprising:
(a) a single-stranded sequence, ½ TBR, which is capable of forming under hybridizing conditions, a hybrid, TBR, with a ½ TBR present in a target nucleic acid (TNA);
(b) a single stranded sequence, ½ BBR, which is capable of forming, under hybridizing conditions, a hybrid BBR, with about 0-10 ½ BBR present in a booster nucleic acid (BNA); and
(c) an OSA, which is no attached support and/or indicator, or an attached support or other means of localization, including, but not limited to, attachment to beads, polymers, and surfaces, and/or indicators;
wherein said TBR is capable of binding with high affinity to a TBA, said TBA being a substance capable of discriminating between a paired TBR and a TBR having unpaired nucleotides, and further, wherein said BBR is capable of binding with high affinity to a BBA, said BBA being a substance capable of discriminating between a paired BBR and a BBR having unpaired nucleotides.
2. A booster nucleic acid (DNA) comprising:
(a) a ½ BBR which has a sequence which is complementary to a ½ BBR sequence in a PNA or another BNA and which is capable of forming, under hybridizing conditions a hybrid, BBR, with the PNA;
(b) an OSA, which is no attached support or indicator or an attached support or other means of localization, including, but not limited to, attachment to beads, polymers, and surfaces, and/or indicators; and
(c) additional hybridization sites, ½ BBRs, for hybridization with additional BNAs;
wherein said BBR is capable of binding with high affinity to a BBA, said BBA being a substance capable of discriminating between a paired BBR and a BBR having unpaired nucleotides.
3. A Hairpin Nucleic Acid (HNA) comprising a single-stranded sequence, ½ BBR, which under hybridizing condition is capable of forming a hairpin while at the same time binding to a BNA to form a BBR capable of binding a BBA, wherein said BBR is capable of binding with high affinity to a BBA, said being a substance capable of discriminating between a perfect BBR and a BBR having impaired nucleotides.
4. The PNA of claim 1 wherein the TBR is comprised of one or more recognition sites for a nucleic acid binding protein, a DNA binding protein, a DNA-RNA hybrid binding protein or an RNA binding protein.
5. The PNA of claim 4 wherein the TBR is a nucleic acid binding protein recognition site present in the genome of a pathogen or is a binding site associated with a pathogenic condition in a vertebrate genome is a nucleic acid binding protein recognition site present in the genome of an organism which contaminates a fermentation process.
6. The PNA of claim 4 wherein the TBR is the HIV-LTR or a portion thereof.
7. A method for detecting or localizing a specific TNA sequence, comprising the steps of:
(a) hybridizing said TNA with the PNA of claim 1;
(b) hybridizing said PNA with a BNA containing a ½ BBR whose sequence is complementary to a ½ BBR sequence in the PNA;
(c) adding the products of steps (a) and (b) containing a TBR and a BBR, to a surface, liquid or other medium containing a TBA;
(d) adding BBAs to the mixture in step (c) wherein said BBA comprises:
(i) molecule or a portion of a molecule which is capable of selectively binding to a BBR;
(ii) a detectible indicator; and
(e) detecting signal produced by the indicator attached to the BBA.
8. The method of claim 7 wherein said indicator is a protein, including enzymes capable of catalyzing reactions leading to production of colored reaction products; a radionuclide; colored beads.
9. A method of detecting the presence in a sample of a specific Target Nucleic Acid, TNA, which comprises:
(a) contacting said sample with a Probe Nucleic Acid, PNA, which, upon hybridization with said TNA if present in said sample, forms a Target Binding Region, TBR, which is capable of binding a Target Binding Assembly, TBA; and
(b) contacting said sample, already in contact with said PNA, with a TBA capable of binding to any TBRs formed by the hybridization of said PNA and said TNA in the sample.
10. A method for detecting or localizing specific nucleic acid sequences with a high degree of sensitivity and specificity which comprises:
(a) adding PNAs containing a ½ BBR and a ½ TBR to a sample containing or suspected of containing TNAs containing ½ TBR sequences, to form a complex having target binding regions, TBRs, formed by the hybridization of complementary ½ TBRs present in the PNAs and TNAs respectively;
(b) binding the TBRs formed in step (a) to an immobilized TBA to form a TBA-TNA-PNA complex;
(c) adding Booster Nucleic Acids, BNAs, containing booster binding regions, ½ BBRs, to the complex formed in step (b) such that the ½ BBRs in the BNAs hybridize with the ½ BBR present in the PNAs or to ½ BBRs present in BNAs already bound to the PNA, to form BBRs, such that TBA-TNA-PNA-(BNA)n complexes are formed;
(d) adding Hairpin Nucleic Acids, HNAs, containing ½ BBR sequences, to the complex formed in step (c) such that the ½ BBRs in the HNAs hybridize with any available ½ BBR sequences present in the BNAs of the complex of step (c), thereby capping the extension of the BNAs onto the TBA-TNA-PNA-(BNA)n complexes of step (c) to form TBA-TNA-PNA-(BNA)n-HNA complexes;
(e) adding Booster Binding Assemblies, BBAs, linked to indicator moieties, to the TBA-TNA-PNA-(BNA)n-HNA complexes formed in step (d) to form TBA-TNA-PNA-(BNA-BBA)n-HNA complexes; and
(f) detecting the signals produced by the indicator moieties linked to the TBAs, PNAs, BNAs, BBAs or HNAs in the TBA-TNA-PNA-(BNA-BBA)n-HNA complexes of step (e);
wherein the TNA comprises:
(i) one or more specific ½ TBR nucleic acid sequences, the presence or absence of which in a particular sample is to be confirmed;
the PNA comprises:
(i) a single/stranded sequence, ½ TBR, which is capable of forming, under hybridizing conditions, a hybrid, TBR, with a ½ TBR present in a target nucleic acid (TNA);
(ii) a single stranded sequence, ½ BBR, which is capable of forming, under hybridizing conditions, a hybrid BBR with a ½ BBR present in a booster nucleic acid (BNA); and
(iii) an OSA, which is no attached support and/or indicator, or an attached support or other means of localization, including, but not limit to, attachment to beads, polymers, and surfaces, and/or indicators;
the BNA comprises:
(i) a ½ BBR, as shown in FIG. 1(IIb), which has a sequence which is complementary to a ½ BBR sequence in a PNA and which is capable of forming, under hybridizing conditions, a hybrid, BBR, with the PNA;
(ii) an OSA, which is no attached support or indicator or an attached support or other means of localization, including, but not limited to, attachment to beads, polymers, and surfaces, and/or indicators;
(iii) additional hybridization sites, ½ BBRs, for other BNAs; and
(iv) sequences, ½ BBRs, which can hybridize to BNAs already hybridized to the PNA;
the BBA comprises:
(i) a molecule or a portion of a molecule which is capable of selectively binding to a BBR; and
(ii) an OSA, which is no attached support and/or indicator, or an attached support or other means of localization, including, but not limited to, attachment to beads, polymers, and surfaces, and/or indicators;
and the TBA comprises:
(i) a molecule or a portion of a molecule which is capable of selectively binding to a TBR; and
(ii) no attached support and/or indicator, or an attached support or other means of localization, including, but not limited to, attachment to beads, polymers, and surfaces, and/or indicators.
11. In a solid phase hybridization method for detecting the presence of a target polynucleotide involving: immobilizing a target polynucleotide, if present in a test sample, directly or via an intermediate capture structure, on a solid phase at a capture site; before, during or after said immobilization, attaching a detectable label to said target polynucleotide, if present; and detecting said label, if any, at said capture site; the improvement comprising:
(a) using a Target Binding Assembly, TBA, as the means for achieving immobilization of said target polynucleotide, wherein said TBA binds only to a unique hybrid formed between a specific Probe Nucleic Acid, PNA, and said target nucleic acid such that a perfect Target Binding Region, TBR, recognizable by said TBA is formed; and
(b) including in the PNA a single stranded sequence, ½ BBR, capable of binding a Booster Nucleic Acid, BNA, containing a single stranded complementary ½ BBR which, upon hybridization with the ½ BBR in the PNA, forms a BBR capable of binding labeled Booster Binding Assemblies, BBAs.
12. A target binding assembly, TBA, or a booster binding assembly, BBA, comprising at least one nucleic acid recognition unit, and optional one or all of the sequences selected from the group consisting of a linker sequence, an assembly sequence, an asymmetry sequence, a nuclear localization signal sequence (NLS) and an OSA.
13. The TBA of claim 12 wherein the nucleic acid recognition unit is selected from the group consisting of an NF-kB binding unit, an SP1 binding unit, a TATA binding unit, a human papillomavirus E2 binding unit, an HPV LTR binding unit, an HIV LTR binding unit, and Tat.
14. The TBA of claim 13 wherein the nucleic acid recognition unit has the sequence selected from the group consisting of SEQ ID NO. 63, SEQ ID NO. 64, SEQ ID NO. 65, SEQ ID NO. 66, SEQ ID NO. 67, SEQ ID NO. 68 SEQ ID NO. 69, SEQ ID NO. 70, SEQ ID NO. 71, SEQ ID NO. 72, SEQ ID NO. 73, SEQ ID NO. 74, SEQ ID NO. 75, SEQ ID NO. 76, SEQ ID NO. 77, SEQ ID NO. 78, SEQ ID NO. 79, SEQ ID NO. 80, SEQ ID NO. 81, SEQ ID NO. 82, SEQ ID NO. 83, SEQ ID NO. 84, SEQ ID NO. 93, SEQ ID NO. 94, SEQ ID NO. 95, SEQ ID NO. 96, SEQ ID NO. 97,SEQ ID NO. 98, and SEQ ID NO. 118.
15. The TBA of claim 12 wherein the linker sequence is an oligopeptide which does not interfere with the nucleic acid recognition function of the nucleic acid recognition unit and which provides stability and control over the spacing of the nucleic acid recognition unit from the remainder of the TBA.
16. The TBA of claim 15 wherein the linker sequence is an oligopeptide sequence from the interdomain primary of a structural protein.
17. The TBA of claim 12 wherein the assembly sequence is an oligopeptide sequence which directs the folding and association of nucleic acid recognition units.
18. The TBA of claim 17 wherein the assembly sequence is derived from the bacteriophage lambda cro protein or the CI protein and is selected from the group consisting of SEQ ID NO. 104, SEQ ID NO. 105, SEQ IN NO. 106 , SEQ ID NO. 107, and SEQ ID NO. 108.
19. The TBA of claim 12 wherein the asymmetry sequence directs the association of nucleic acid recognition and assembly sequences in a predetermined order.
20. The TBA of claim 19 wherein the asymmetry sequence is derived from insulin, gonadotropic hormone, FSH, HCG, LH, ACTH, or relaxin.
21. The TBA of claim 20 wherein the asymmetry sequence is selected from the group consisting of SEQ ID NO. 85, SEQ ID NO. 86, SEQ ID NO. 87, SEQ ID NO. 88, SEQ ID NO. 89, SEQ ID NO. 90, SEQ ID NO. 91, and SEQ ID NO. 92.
22. The TBA of claim 12 wherein the NLS is an oligopeptide which directs the migration and uptake of a protein or complex associated with said NLS into the nucleus of a cell.
23. The TBA of claim 22 wherein the NLS is selected from the group consisting of SEQ ID NO. 72 and SEQ ID NO. 103.
24. The TBA of claim 12 which is HIV Detect I-IV or HPV Detect I-IV.
25. The TBA of claim 12 having a sequence selected from the group consisting of SEQ ID NO. 109, SEQ ID NO. 110, SEQ ID NO. 111, SEQ ID NO. 112, SEQ ID NO. 113, SEQ ID NO. 114, SEQ ID NO. 115, SEQ ID NO. 116.
26. A method of using the TBA of claim 12 to bind a particular nucleic acid sequence in a target nucleic acid sample which comprises:
(a) fragmenting the nucleic acid in the target nucleic acid sample;
(b) contacting, under hybridizing conditions, the fragmented nucleic acid with a probe nucleic acid complementary to the particular nucleic acid sequence of interest, wherein said probe nucleic acid, upon hybridization with said particular nucleic acid sequence of interest forms a target binding region to which said TBA specifically binds.
27. The method of claim 26 wherein said probe nucleic acid, in addition to sequences complementary to said particular nucleic acid sequence of interest, also has additional sequences to which a booster nucleic acid can bind to form a booster binding site to which a labeled booster binding assembly can bind to provide a signal showing and amplifying the binding of the probe nucleic acid to the target nucleic acid sequence of interest.
28. A method of using the TBA of claim 12 wherein said TBA is administered to patient in need of such treatment a therapeutically or prophylactically effective amount of said TBA, which comprises administering the TBA, either in the form of a purified protein complex or in the form of a recombinant vector which, upon entry into the patient is able to express the TBA, such that the TBA binds the particular nucleic acid sequence to achieve the desired prophylactic or therapeutic result.
29. The method of claim 28 wherein said TBA is selected from the group consisting of SEQ ID NO. 109, SEQ ID NO. 110, SEQ ID NO. 111, SEQ ID NO. 112, SEQ ID NO. 113, SEQ ID NO. 114, SEQ ID NO. 115, and SEQ ID NO. 116, and the patient is infected with HIV or HPV.
30. The method of claim 26 further comprising the step of:
(c) monitoring the shift in mobility of nucleic acids in the target nucleic acid sample as a function of the size such that binding of the TBA to a particular fragment in the sample modifies the mobility of the fragment.
31. A diagnostic or forensic test kit for the detection in a sample of nucleic acid having a specific sequence composition, which comprises:
(a) a first nucleic acid probe complementary to nucleic acid with specific sequence composition, the presence of which is to be ascertained in a test sample, wherein said first nucleic acid probe and said nucleic acid with specific sequence composition forming, upon hybridization, a binding site for a first nucleic acid binding protein, and wherein said first nucleic acid probe further comprises additional sequence complementary to a second nucleic acid probe;
(b) a first nucleic acid binding protein specific for the duplex formed by hybridization of said first nucleic acid probe and said nucleic acid with specific sequence composition;
(c) a second nucleic acid probe complementary to said additional sequence in said first nucleic acid probe, wherein, upon hybridization of said first and second nucleic acid probes, a binding site for a second nucleic acid binding protein is formed;
(d) a second nucleic acid binding protein which binds specifically to the duplex formed upon hybridization of said first nucleic acid probe and said second nucleic acid probe, wherein said second nucleic acid binding protein is labeled with a detetable label.
32. The diagnostic or forensic test kit of claim 31 wherein said first nucleic acid probe is complementary to the HIV LTR, such that upon hybridization of said first nucleic acid probe with an HIV LTR, a binding site is formed for NF-kB or a subunit thereof, SP1, TATA binding protein, HIV-Detect I, II, III, or IV, or HIV-Lock.
33. The diagnostic or forensic test kit of claim 32 wherein said first nucleic acid binding protein is NF-kB or a subunit thereon SP1, TATA binding protein HIV-Detect I, II, III, or IV, or HIV-Lock.
34. The diagnostic or forensic test kit of claim 33 wherein said first nucleic acid probe, in addition to being complementary to the HIV LTR, comprises a sequence encoding the bacteriophage lambda left or right operator and said second nucleic acid probe comprises sequences complementary to said bacteriophage lambda left or right operator sequences in said first nucleic acid probe, such that upon hybridization of said first and second nucleic acid probes, a binding site for the bacteriophage lambda CI repressor protein, the bacteriophage lambda cro protein or a derivative or homology thereof, is formed.
35. The diagnostic or forensic test kit of claim 34 wherein said second nucleic acid binding protein is the bacteriophage lambda CI repressor protein, the bacteriophage lambda cro protein or a derivative or homology thereof.
36. A composition comprising HIV-Lock or a recombinant vector encoding HIV-Lock and a pharmaceutically acceptable carrier.
37. A method of differentially binding a nucleic acid binding protein to a nucleic acid sequence correlated with a pathogenic condition which comprises:
(a) selecting a particular configuration of nucleic acid binding protein sequences present in the nucleic acid sequence correlated with a pathogenic condition as a target sequence for designing a probe nucleic acid which will hybridize to that particular configuration of nucleic acid sequences if present in a test sample, and further, ensuring that a binding site for an available nucleic acid binding protein is formed upon hybridization of said probe nucleic acid and said particular configuration of nucleic acid sequences chosen as a target;
(b) selecting a nucleic acid binding protein which specifically binds to the selected particular configuration of nucleic acid binding protein sequences correlated with a pathogenic condition, but which does not bind to sequences not correlated with said pathogenic condition;
(c) hybridizing said probe nucleic acid with a test sample suspected of containing said particular configuration of nucleic acid binding protein sequences present in nucleic acid sequences correlated with a pathogenic condition;
(d) contacting said nucleic acid binding protein with any hybrids formed in step (b); and
(e) detecting any binding of said nucleic acid binding protein with said hybrids.
38. The method of claim 37 wherein said particular configuration of nucleic acid binding protein sequences is chosen from a necessary step or control point in the development of a pathogenic condition.
39. The method of claim 9 wherein said method is carried out in an automated fashion.
40. The method of claim 39 wherein the method is carried out in the Abbott Laboratories IMx machine.
41. The method of claim 9 carried out in a microtiter plate.
42. A method of amplifying the signal obtained through binding the PNA of claim 1 to a TNA which comprises binding BNAs to the PNA-TNA hybrid and binding labeled BBAs to the BNAs.
43. A method of assembling a nucleic acid binding complex which comprises using asymmetry sequences to direct the association or non-association of components of the nucleic acid binding complex.
44. A method of assembling a nucleic acid binding complex which comprises using assembly sequences derived from bacteriophage lambda cro or CI to assemble associated components of the nucleic acid binding complex.
45. A method of using assembly, asymmetry, or piloting sequences to assemble a multimeric protein complex which comprises linking subunits to be incorporated into the multimeric protein complex to said assembly, asymmetry, or piloting sequence, and recovering said multimeric complex.
46. A composition comprising SEQ ID NO. 105, SEQ ID NO. 106, or SEQ ID NO. 108.
47. A nucleic acid encoding the TBA or BBA of claim 12 .
48. The TBA of claim 12 or a nucleic acid encoding said TBA, wherein the amino acid sequence of said TBA is selected from the group consisting of set A, set B and set C, wherein said sets are comprised as follows:
wherein groups I-V consist of sequences selected from:
Group Selected from Sequences
I Any of SEQ ID NOS. 85-92
II Met Ser, linked to any of SEQ ID NOS. 104-106, each of which is linked to SEQ ID NO. 99
III SEQ ID NO. 100 linked to any of SEQ ID NOS. 75-84 or 94-98; SEQ ID NO. 101 linked to either SEQ ID NO. 74 of SEQ ID NO. 93; or SEQ ID NO. 102 linked to SEQ ID NO. 74 or SEQ ID NO. 93; or any of SEQ ID NOS. 72, 103, 73, or 63-71
IV Any of SEQ ID NOS. 104-108
V SEQ ID NO. 99.
49. A method of assembling multimeric TBAs in vivo or in situ which comprises introducing component TBAs into a cell utilizing a covalently or non-covalently attached protein or bi-layer vesicle or by introducing nucleic acids encoding component TBAs into a cell, said component TBAs each comprising a DNA recognition unit, assembly sequences, asymmetry sequences, nuclear localization signal sequences, and optional linker sequences, such that upon proximal binding via the DNA recognition unit of each component TBA to nucleic acid sequences encountered in the nucleus or elsewhere in the cell, component expressed TBAs assemble via said assembly and asymmetry sequences into multimeric TBAs.
50. A method for identifying nucleic acid binding molecules for preparation of a target binding assembly or a booster binding assembly comprising:
a. Obtaining a sample containing the target nucleic acid;
b. Fragmenting the sample so as to expose the nucleic acids and to reduce the size complexity of the nucleic acids contained in the sample;
c. Contacting a first aliquot of the fragmented nucleic acids with a control buffer medium and contacting a second aliquot of the fragmented nucleic acids with the control buffer medium containing a known profile of nucleic acid binding molecules;
d. Analyzing the two aliquots to identify fragments which have altered behavior in the aliquot contacted with the target binding molecules as opposed to the aliquot contacted with the control buffer medium;
e. Identifying and isolating fragments which do exhibit altered behavior when contacted with the nucleic acid binding molecule and either sequencing the nucleic acid fragment to determine whether known nucleic acid binding molecule motifs are present, or directly identifying the nucleic acid binding molecule bound to the nucleic acid; and
f. synthesizing TBAs comprising the nucleic acid binding molecules which produced the altered behavior using assembly, asymmetry nuclear localization and, optionally, linker sequences.
51. A method for identifying specific nucleic acid sequences in a sample comprising:
a. Fragmenting the nucleic acids in said sample to expose the nucleic acids and reduce the size complexity of the nucleic acids;
b. Contacting a TBA with the sample, said TBA comprising two or more nucleic acid binding components each of which has a relatively weak binding for its nucleic acid recognition unit within the TBR but which in combination provides strong binding for the complete TBR; and
c. Eliminating any “cross-talk” produced by binding of the TBA to cousin nucleic acids that contain individual recognition units, which comprises contacting the sample with excess nucleic acid binding components with relatively strong binding affinity for cousin nucleic acids that contain the individual recognition units but relatively weak binding relative to the TBA's affinity for binding to the complete TBR having said two or more nucleic acid binding components.
Priority Applications (5)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US08/860,844 US20030104361A1 (en) | 1997-09-29 | 1995-12-07 | Method of detection of nucleic acids with a specific sequence composition |
US10/407,543 US20030175789A1 (en) | 1994-12-09 | 2003-04-03 | Method of detection of nucleic acids with a specific sequence composition |
US12/493,877 US8114962B2 (en) | 1994-12-09 | 2009-06-29 | Method of detection of nucleic acids with a specific sequence composition |
US12/899,251 US20110172169A1 (en) | 1994-12-09 | 2010-10-06 | Method of Detection of Nucleic Acids with a Specific Sequence Composition |
US13/093,127 US20110263045A1 (en) | 1994-12-09 | 2011-04-25 | Method of Detection of Nucleic Acids With a Specific Sequence Composition |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US08/860,844 US20030104361A1 (en) | 1997-09-29 | 1995-12-07 | Method of detection of nucleic acids with a specific sequence composition |
Related Parent Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US08/353,476 Continuation-In-Part US5871902A (en) | 1994-12-09 | 1994-12-09 | Sequence-specific detection of nucleic acid hybrids using a DNA-binding molecule or assembly capable of discriminating perfect hybrids from non-perfect hybrids |
PCT/US1995/015944 A-371-Of-International WO1996017956A2 (en) | 1994-12-09 | 1995-12-07 | Method of detection of nucleic acids with a specific sequence composition |
Related Child Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US10/407,543 Continuation US20030175789A1 (en) | 1994-12-09 | 2003-04-03 | Method of detection of nucleic acids with a specific sequence composition |
Publications (1)
Publication Number | Publication Date |
---|---|
US20030104361A1 true US20030104361A1 (en) | 2003-06-05 |
Family
ID=25334160
Family Applications (5)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US08/860,844 Abandoned US20030104361A1 (en) | 1994-12-09 | 1995-12-07 | Method of detection of nucleic acids with a specific sequence composition |
US10/407,543 Abandoned US20030175789A1 (en) | 1994-12-09 | 2003-04-03 | Method of detection of nucleic acids with a specific sequence composition |
US12/493,877 Expired - Fee Related US8114962B2 (en) | 1994-12-09 | 2009-06-29 | Method of detection of nucleic acids with a specific sequence composition |
US12/899,251 Abandoned US20110172169A1 (en) | 1994-12-09 | 2010-10-06 | Method of Detection of Nucleic Acids with a Specific Sequence Composition |
US13/093,127 Abandoned US20110263045A1 (en) | 1994-12-09 | 2011-04-25 | Method of Detection of Nucleic Acids With a Specific Sequence Composition |
Family Applications After (4)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US10/407,543 Abandoned US20030175789A1 (en) | 1994-12-09 | 2003-04-03 | Method of detection of nucleic acids with a specific sequence composition |
US12/493,877 Expired - Fee Related US8114962B2 (en) | 1994-12-09 | 2009-06-29 | Method of detection of nucleic acids with a specific sequence composition |
US12/899,251 Abandoned US20110172169A1 (en) | 1994-12-09 | 2010-10-06 | Method of Detection of Nucleic Acids with a Specific Sequence Composition |
US13/093,127 Abandoned US20110263045A1 (en) | 1994-12-09 | 2011-04-25 | Method of Detection of Nucleic Acids With a Specific Sequence Composition |
Country Status (1)
Country | Link |
---|---|
US (5) | US20030104361A1 (en) |
Cited By (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20110212462A1 (en) * | 2010-03-01 | 2011-09-01 | Quanterix Corporation | Ultra-sensitive detection of molecules using dual detection methods |
US20120277114A1 (en) * | 2010-03-01 | 2012-11-01 | Quanterix Corporation | Ultra-sensitive detection of molecules or particles using beads or other capture objects |
US9110025B2 (en) | 2010-03-01 | 2015-08-18 | Quanterix Corporation | Methods and systems for extending dynamic range in assays for the detection of molecules or particles |
US9395359B2 (en) | 2006-02-21 | 2016-07-19 | Trustees Of Tufts College | Methods and arrays for target analyte detection and determination of target analyte concentration in solution |
US9551663B2 (en) | 2010-03-01 | 2017-01-24 | Quanterix Corporation | Methods and systems for extending dynamic range in assays for the detection of molecules or particles |
US9809838B2 (en) | 2007-08-30 | 2017-11-07 | Trustees Of Tufts College | Methods for determining the concentration of an analyte in solution |
US9829491B2 (en) | 2009-10-09 | 2017-11-28 | The Research Foundation For The State University Of New York | pH-insensitive glucose indicator protein |
US9932626B2 (en) | 2013-01-15 | 2018-04-03 | Quanterix Corporation | Detection of DNA or RNA using single molecule arrays and other techniques |
US9952237B2 (en) | 2011-01-28 | 2018-04-24 | Quanterix Corporation | Systems, devices, and methods for ultra-sensitive detection of molecules or particles |
US10393759B2 (en) | 2011-04-12 | 2019-08-27 | Quanterix Corporation | Methods of determining a treatment protocol for and/or a prognosis of a patient's recovery from a brain injury |
US11237171B2 (en) | 2006-02-21 | 2022-02-01 | Trustees Of Tufts College | Methods and arrays for target analyte detection and determination of target analyte concentration in solution |
Families Citing this family (17)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7601497B2 (en) | 2000-06-15 | 2009-10-13 | Qiagen Gaithersburg, Inc. | Detection of nucleic acids by target-specific hybrid capture method |
US7439016B1 (en) * | 2000-06-15 | 2008-10-21 | Digene Corporation | Detection of nucleic acids by type-specific hybrid capture method |
US7306915B2 (en) * | 2004-03-22 | 2007-12-11 | Sysmex Corporation | Probe set for detection of target substance and detection method using the same |
EP2262911B1 (en) * | 2008-04-17 | 2016-10-12 | QIAGEN Gaithersburg, Inc. | Compositions, methods, and kits using synthetic probes for determining the presence of a target nucleic acid |
JP2012506705A (en) | 2008-10-27 | 2012-03-22 | キアジェン ゲイサーズバーグ インコーポレイテッド | Hybrid capture analysis and system for quick results |
CA2750338C (en) * | 2009-01-28 | 2019-06-25 | Qiagen Gaithersburg, Inc. | Sequence-specific large volume sample preparation method and assay |
JP5738278B2 (en) * | 2009-05-01 | 2015-06-24 | キアジェン ゲイサーズバーグ インコーポレイテッド | Non-target amplification method for detecting RNA splicing morphology in a sample |
US20110053143A1 (en) * | 2009-09-03 | 2011-03-03 | Qiagen Gaithersburg Inc. | Methods and kits for determining viral load in clinical samples |
WO2011032124A1 (en) * | 2009-09-14 | 2011-03-17 | Qiagen Gaithersburg, Inc. | Compositions and methods for recovery of nucleic acids or proteins from tissue samples fixed in cytology media |
EP2521778B1 (en) * | 2010-01-04 | 2014-03-19 | QIAGEN Gaithersburg, Inc. | Methods, compositions, and kits for recovery of nucleic acids or proteins from fixed tissue samples |
EP2521795B8 (en) * | 2010-01-08 | 2014-01-08 | QIAGEN Gaithersburg, Inc. | Materials and methods for isothermal nucleic acid amplification |
JP6103941B2 (en) * | 2010-01-29 | 2017-03-29 | キアジェン ゲイサーズバーグ インコーポレイテッド | Methods and compositions for sequence-specific purification and multiplex analysis of nucleic acids |
JP2013517803A (en) | 2010-01-29 | 2013-05-20 | キアジェン ゲイサーズバーグ インコーポレイテッド | Method for determining and confirming the presence of HPV in a sample |
JP2013528049A (en) | 2010-05-19 | 2013-07-08 | キアゲン ガイサーズバーグ アイエヌシー. | Methods and compositions for sequence-specific purification and multiplex analysis of nucleic acids |
US9376727B2 (en) | 2010-05-25 | 2016-06-28 | Qiagen Gaithersburg, Inc. | Fast results hybrid capture assay and associated strategically truncated probes |
AU2016249392A1 (en) | 2015-04-17 | 2017-11-16 | The Regents Of The University Of California | Methods for detecting agglutination and compositions for use in practicing the same |
WO2023245145A2 (en) * | 2022-06-16 | 2023-12-21 | Nanopath Inc. | Multiplexed pathogen detection using nanoplasmonic sensor for human papillomavirus |
Citations (23)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4556643A (en) * | 1982-07-26 | 1985-12-03 | Agracetus | Assay method and probe for polynucleotide sequences |
US4683195A (en) * | 1986-01-30 | 1987-07-28 | Cetus Corporation | Process for amplifying, detecting, and/or-cloning nucleic acid sequences |
US4751177A (en) * | 1985-06-13 | 1988-06-14 | Amgen | Methods and kits for performing nucleic acid hybridization assays |
US4775619A (en) * | 1984-10-16 | 1988-10-04 | Chiron Corporation | Polynucleotide determination with selectable cleavage sites |
US4794075A (en) * | 1985-08-29 | 1988-12-27 | Lifecodes Corporation | Method for locating and purifying DNA containing single base mismatches |
US4868105A (en) * | 1985-12-11 | 1989-09-19 | Chiron Corporation | Solution phase nucleic acid sandwich assay |
US4882269A (en) * | 1985-12-13 | 1989-11-21 | Princeton University | Amplified hybridization assay |
US4948882A (en) * | 1983-02-22 | 1990-08-14 | Syngene, Inc. | Single-stranded labelled oligonucleotides, reactive monomers and methods of synthesis |
US4960691A (en) * | 1986-09-29 | 1990-10-02 | Abbott Laboratories | Chromatographic test strip for determining ligands or receptors |
US5096815A (en) * | 1989-01-06 | 1992-03-17 | Protein Engineering Corporation | Generation and selection of novel dna-binding proteins and polypeptides |
US5118605A (en) * | 1984-10-16 | 1992-06-02 | Chiron Corporation | Polynucleotide determination with selectable cleavage sites |
US5122600A (en) * | 1988-09-24 | 1992-06-16 | Nippon Zeon Co., Ltd. | DNA-immobilized microspheres and a process for purifying a DNA-transcription-controlling factor using the same |
US5124246A (en) * | 1987-10-15 | 1992-06-23 | Chiron Corporation | Nucleic acid multimers and amplified nucleic acid hybridization assays using same |
US5198346A (en) * | 1989-01-06 | 1993-03-30 | Protein Engineering Corp. | Generation and selection of novel DNA-binding proteins and polypeptides |
US5200314A (en) * | 1990-03-23 | 1993-04-06 | Chiron Corporation | Polynucleotide capture assay employing in vitro amplification |
US5215882A (en) * | 1989-11-30 | 1993-06-01 | Ortho Diagnostic Systems, Inc. | Method of immobilizing nucleic acid on a solid surface for use in nucleic acid hybridization assays |
US5215899A (en) * | 1989-11-09 | 1993-06-01 | Miles Inc. | Nucleic acid amplification employing ligatable hairpin probe and transcription |
US5290677A (en) * | 1990-01-24 | 1994-03-01 | The United States Of America As Represented By The Department Of Health And Human Services | Rapid and sensitive test for detecting hepatitis A virus |
US5306619A (en) * | 1991-06-27 | 1994-04-26 | Genelabs Technologies, Inc. | Screening assay for the detection of DNA-binding molecules |
US5306614A (en) * | 1986-01-22 | 1994-04-26 | Institut Pasteur | Methods and kits for diagnosing human immunodeficiency virus type 2(HIV-2) |
US5310650A (en) * | 1986-09-29 | 1994-05-10 | Abbott Laboratoires | Method and device for improved reaction kinetics in nucleic acid hybridizations |
US5324829A (en) * | 1988-12-16 | 1994-06-28 | Ortho Diagnostic Systems, Inc. | High specific activity nucleic acid probes having target recognition and signal generating moieties |
US5453362A (en) * | 1992-12-04 | 1995-09-26 | Tularik Inc. | Eukaryotic transcription protein; host cell factor |
Family Cites Families (26)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4623627A (en) * | 1983-08-19 | 1986-11-18 | Cetus Corporation | Monoclonal antibody having specificity for the double-stranded conformation of native DNA and diagnostic methods using same |
US4777129A (en) | 1983-12-12 | 1988-10-11 | Molecular Diagnostics, Inc. | Nucleic acid probe detectable by specific nucleic acid binding protein |
EP0146039A3 (en) | 1983-12-12 | 1986-08-27 | Miles Inc. | Hybridization assay with immobilization of hybrids by antihybrid binding |
US4724202A (en) * | 1983-12-12 | 1988-02-09 | Molecular Diagnostics, Inc. | Use of non-hybridizable nucleic acids for the detection of nucleic acid hybridization |
US5668255A (en) | 1984-06-07 | 1997-09-16 | Seragen, Inc. | Hybrid molecules having translocation region and cell-binding region |
ATE65092T1 (en) | 1985-05-31 | 1991-07-15 | Amoco Corp | AMPLIFICATION OF HYBRIDIZATION SIGNALS THROUGH USE OF COMPLEMENTARY DNA STRANDS. |
WO1988006601A1 (en) | 1987-03-02 | 1988-09-07 | Genex Corporation | Gene repressors |
US5716826A (en) * | 1988-03-21 | 1998-02-10 | Chiron Viagene, Inc. | Recombinant retroviruses |
US5185243A (en) | 1988-08-25 | 1993-02-09 | Syntex (U.S.A.) Inc. | Method for detection of specific nucleic acid sequences |
US5459039A (en) * | 1989-05-12 | 1995-10-17 | Duke University | Methods for mapping genetic mutations |
US5556750A (en) * | 1989-05-12 | 1996-09-17 | Duke University | Methods and kits for fractionating a population of DNA molecules based on the presence or absence of a base-pair mismatch utilizing mismatch repair systems |
WO1991006679A1 (en) | 1989-10-24 | 1991-05-16 | Stratagene | An improved method for hybridizing nucleic acids using single-stranded nucleic acid binding protein |
US5652122A (en) * | 1989-12-21 | 1997-07-29 | Frankel; Alan | Nucleic acids encoding and methods of making tat-derived transport polypeptides |
CA2039517C (en) | 1990-04-03 | 2006-11-07 | David Segev | Dna probe signal amplification |
JPH04222599A (en) | 1990-04-20 | 1992-08-12 | Syntex Usa Inc | Method for assaying double receptor polynucleotide |
CA2108144A1 (en) * | 1991-03-06 | 1992-09-07 | Jack A. Roth | Methods and compositions for the selective inhibition of gene expression |
AU2028792A (en) | 1991-05-17 | 1992-12-30 | Uab Research Foundation | Sequence specific dna binding drugs |
ES2091976T3 (en) * | 1991-06-20 | 1996-11-16 | Hoffmann La Roche | PERFECTED METHODS FOR THE AMPLIFICATION OF NUCLEIC ACID. |
EP0672177B1 (en) | 1992-01-28 | 2006-07-19 | President And Fellows of Harvard College | Method for nucleic acid hybridization using single-stranded dna binding protein |
WO1993020233A1 (en) | 1992-03-27 | 1993-10-14 | University Of Maryland At Baltimore | Detection of gene mutations with mismatch repair enzymes |
JPH05308999A (en) | 1992-05-08 | 1993-11-22 | Sumitomo Metal Ind Ltd | Method for determining pre-c mutation of hepatitis b virus |
AU682340B2 (en) | 1993-01-28 | 1997-10-02 | Regents Of The University Of California, The | TATA-binding protein associated factors, nucleic acids encoding TAFs, and methods of use |
US5593834A (en) * | 1993-06-17 | 1997-01-14 | The Research Foundation Of State University Of New York | Method of preparing DNA sequences with known ligand binding characteristics |
US6140466A (en) * | 1994-01-18 | 2000-10-31 | The Scripps Research Institute | Zinc finger protein derivatives and methods therefor |
US5882941A (en) * | 1994-05-04 | 1999-03-16 | Massachusette Institute Of Technology | Programmable genotoxic agents and uses therefor |
US5871902A (en) * | 1994-12-09 | 1999-02-16 | The Gene Pool, Inc. | Sequence-specific detection of nucleic acid hybrids using a DNA-binding molecule or assembly capable of discriminating perfect hybrids from non-perfect hybrids |
-
1995
- 1995-12-07 US US08/860,844 patent/US20030104361A1/en not_active Abandoned
-
2003
- 2003-04-03 US US10/407,543 patent/US20030175789A1/en not_active Abandoned
-
2009
- 2009-06-29 US US12/493,877 patent/US8114962B2/en not_active Expired - Fee Related
-
2010
- 2010-10-06 US US12/899,251 patent/US20110172169A1/en not_active Abandoned
-
2011
- 2011-04-25 US US13/093,127 patent/US20110263045A1/en not_active Abandoned
Patent Citations (24)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4556643A (en) * | 1982-07-26 | 1985-12-03 | Agracetus | Assay method and probe for polynucleotide sequences |
US4948882A (en) * | 1983-02-22 | 1990-08-14 | Syngene, Inc. | Single-stranded labelled oligonucleotides, reactive monomers and methods of synthesis |
US4775619A (en) * | 1984-10-16 | 1988-10-04 | Chiron Corporation | Polynucleotide determination with selectable cleavage sites |
US5118605A (en) * | 1984-10-16 | 1992-06-02 | Chiron Corporation | Polynucleotide determination with selectable cleavage sites |
US4751177A (en) * | 1985-06-13 | 1988-06-14 | Amgen | Methods and kits for performing nucleic acid hybridization assays |
US4794075A (en) * | 1985-08-29 | 1988-12-27 | Lifecodes Corporation | Method for locating and purifying DNA containing single base mismatches |
US4868105A (en) * | 1985-12-11 | 1989-09-19 | Chiron Corporation | Solution phase nucleic acid sandwich assay |
US4882269A (en) * | 1985-12-13 | 1989-11-21 | Princeton University | Amplified hybridization assay |
US5306614A (en) * | 1986-01-22 | 1994-04-26 | Institut Pasteur | Methods and kits for diagnosing human immunodeficiency virus type 2(HIV-2) |
US4683195B1 (en) * | 1986-01-30 | 1990-11-27 | Cetus Corp | |
US4683195A (en) * | 1986-01-30 | 1987-07-28 | Cetus Corporation | Process for amplifying, detecting, and/or-cloning nucleic acid sequences |
US4960691A (en) * | 1986-09-29 | 1990-10-02 | Abbott Laboratories | Chromatographic test strip for determining ligands or receptors |
US5310650A (en) * | 1986-09-29 | 1994-05-10 | Abbott Laboratoires | Method and device for improved reaction kinetics in nucleic acid hybridizations |
US5124246A (en) * | 1987-10-15 | 1992-06-23 | Chiron Corporation | Nucleic acid multimers and amplified nucleic acid hybridization assays using same |
US5122600A (en) * | 1988-09-24 | 1992-06-16 | Nippon Zeon Co., Ltd. | DNA-immobilized microspheres and a process for purifying a DNA-transcription-controlling factor using the same |
US5324829A (en) * | 1988-12-16 | 1994-06-28 | Ortho Diagnostic Systems, Inc. | High specific activity nucleic acid probes having target recognition and signal generating moieties |
US5198346A (en) * | 1989-01-06 | 1993-03-30 | Protein Engineering Corp. | Generation and selection of novel DNA-binding proteins and polypeptides |
US5096815A (en) * | 1989-01-06 | 1992-03-17 | Protein Engineering Corporation | Generation and selection of novel dna-binding proteins and polypeptides |
US5215899A (en) * | 1989-11-09 | 1993-06-01 | Miles Inc. | Nucleic acid amplification employing ligatable hairpin probe and transcription |
US5215882A (en) * | 1989-11-30 | 1993-06-01 | Ortho Diagnostic Systems, Inc. | Method of immobilizing nucleic acid on a solid surface for use in nucleic acid hybridization assays |
US5290677A (en) * | 1990-01-24 | 1994-03-01 | The United States Of America As Represented By The Department Of Health And Human Services | Rapid and sensitive test for detecting hepatitis A virus |
US5200314A (en) * | 1990-03-23 | 1993-04-06 | Chiron Corporation | Polynucleotide capture assay employing in vitro amplification |
US5306619A (en) * | 1991-06-27 | 1994-04-26 | Genelabs Technologies, Inc. | Screening assay for the detection of DNA-binding molecules |
US5453362A (en) * | 1992-12-04 | 1995-09-26 | Tularik Inc. | Eukaryotic transcription protein; host cell factor |
Cited By (25)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US9395359B2 (en) | 2006-02-21 | 2016-07-19 | Trustees Of Tufts College | Methods and arrays for target analyte detection and determination of target analyte concentration in solution |
US11874279B2 (en) | 2006-02-21 | 2024-01-16 | Trustees Of Tufts College | Methods and arrays for target analyte detection and determination of target analyte concentration in solution |
US11237171B2 (en) | 2006-02-21 | 2022-02-01 | Trustees Of Tufts College | Methods and arrays for target analyte detection and determination of target analyte concentration in solution |
US10261089B2 (en) | 2006-02-21 | 2019-04-16 | Trustees Of Tufts College | Methods and arrays for target analyte detection and determination of target analyte concentration in solution |
US9809838B2 (en) | 2007-08-30 | 2017-11-07 | Trustees Of Tufts College | Methods for determining the concentration of an analyte in solution |
US9829491B2 (en) | 2009-10-09 | 2017-11-28 | The Research Foundation For The State University Of New York | pH-insensitive glucose indicator protein |
US9846155B2 (en) * | 2010-03-01 | 2017-12-19 | Quanterix Corporation | Methods and systems for extending dynamic range in assays for the detection of molecules or particles |
US9110025B2 (en) | 2010-03-01 | 2015-08-18 | Quanterix Corporation | Methods and systems for extending dynamic range in assays for the detection of molecules or particles |
US9551663B2 (en) | 2010-03-01 | 2017-01-24 | Quanterix Corporation | Methods and systems for extending dynamic range in assays for the detection of molecules or particles |
US9678068B2 (en) * | 2010-03-01 | 2017-06-13 | Quanterix Corporation | Ultra-sensitive detection of molecules using dual detection methods |
US9310360B2 (en) * | 2010-03-01 | 2016-04-12 | Quanterix Corporation | Ultra-sensitive detection of molecules or particles using beads or other capture objects |
US20150355182A1 (en) * | 2010-03-01 | 2015-12-10 | Quanterix Corporation | Methods and systems for extending dynamic range in assays for the detection of molecules or particles |
US20110212462A1 (en) * | 2010-03-01 | 2011-09-01 | Quanterix Corporation | Ultra-sensitive detection of molecules using dual detection methods |
US20120277114A1 (en) * | 2010-03-01 | 2012-11-01 | Quanterix Corporation | Ultra-sensitive detection of molecules or particles using beads or other capture objects |
US11619631B2 (en) | 2010-03-01 | 2023-04-04 | Quanterix Corporation | Ultra-sensitive detection of molecules or particles using beads or other capture objects |
US9482662B2 (en) * | 2010-03-01 | 2016-11-01 | Quanterix Corporation | Ultra-sensitive detection of molecules or particles using beads or other capture objects |
US20120289428A1 (en) * | 2010-03-01 | 2012-11-15 | Quanterix Corporation | Ultra-sensitive detection of molecules or particles using beads or other capture objects |
US10989713B2 (en) * | 2010-03-01 | 2021-04-27 | Quanterix Corporation | Methods and systems for extending dynamic range in assays for the detection of molecules or particles |
US10725032B2 (en) * | 2010-03-01 | 2020-07-28 | Quanterix Corporation | Ultra-sensitive detection of molecules or particles using beads or other capture objects |
US11112415B2 (en) | 2011-01-28 | 2021-09-07 | Quanterix Corporation | Systems, devices, and methods for ultra-sensitive detection of molecules or particles |
US9952237B2 (en) | 2011-01-28 | 2018-04-24 | Quanterix Corporation | Systems, devices, and methods for ultra-sensitive detection of molecules or particles |
US10393759B2 (en) | 2011-04-12 | 2019-08-27 | Quanterix Corporation | Methods of determining a treatment protocol for and/or a prognosis of a patient's recovery from a brain injury |
US11275092B2 (en) | 2011-04-12 | 2022-03-15 | Quanterix Corporation | Methods of determining a treatment protocol for and/or a prognosis of a patient's recovery from a brain injury |
US10640814B2 (en) | 2013-01-15 | 2020-05-05 | Quanterix Corporation | Detection of DNA or RNA using single molecule arrays and other techniques |
US9932626B2 (en) | 2013-01-15 | 2018-04-03 | Quanterix Corporation | Detection of DNA or RNA using single molecule arrays and other techniques |
Also Published As
Publication number | Publication date |
---|---|
US20110172169A1 (en) | 2011-07-14 |
US20030175789A1 (en) | 2003-09-18 |
US20100113344A1 (en) | 2010-05-06 |
US20110263045A1 (en) | 2011-10-27 |
US8114962B2 (en) | 2012-02-14 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US8114962B2 (en) | Method of detection of nucleic acids with a specific sequence composition | |
EP0796344B1 (en) | Method of detection of nucleic acids with a specific sequence composition | |
US5800984A (en) | Nucleic acid sequence detection by triple helix formation at primer site in amplification reactions | |
JP2846018B2 (en) | Amplification and detection of nucleic acid sequences | |
JP3289089B2 (en) | Method for detecting nucleotide sequence by sandwich hybridization technology | |
EP0361983B1 (en) | RNA template end-linked probe constructs and methods for use | |
CN108368538B (en) | Method for generating proximity probes | |
JP3138471B2 (en) | Competitive PCR for DNA quantification | |
US5795722A (en) | Method and kit for quantitation and nucleic acid sequencing of nucleic acid analytes in a sample | |
EP0427074A2 (en) | Nucleic acid amplification employing transcribable hairpin probe | |
IE83814B1 (en) | Quantification of nucleic acid | |
Gudibande et al. | Rapid, non-separation electrochemiluminescent DNA hybridization assays for PCR products, using 3′-labelled oligonucleotide probes | |
WO1997012058A1 (en) | Method for quantifying nucleic acid using multiple competitor nucleic acids | |
AU757472B2 (en) | Method of detection of nucleic acids with a specific sequence composition | |
AU4418996A (en) | Method of detection of nucleic acids with a specific sequence composition | |
AU2007201871A1 (en) | Method of detection of nucleic acids with a specific sequence composition | |
WO2000043545A2 (en) | Detection of drug resistant organisms | |
WO2000043546A9 (en) | Detection of drug resistant organisms | |
CA2037673A1 (en) | Ribonuclease scission chain reaction | |
LE et al. | Patent 2772123 Summary | |
Lundeberg | DIANA, A DNA-diagnostic method for detection and analysis of viruses, bacteria and parasites. | |
JP2005110664A (en) | Method for detecting presence of target dna by amplification of nucleic acid signal, and method for detecting or determining immunological ligand |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AS | Assignment |
Owner name: GENE POOL, INC., WASHINGTON Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:WEININGER, SUSAN;WEININGER, ARTHUR M.;REEL/FRAME:008738/0535 Effective date: 19970710 |
|
STCB | Information on status: application discontinuation |
Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION |