CA2480404A1 - Fc receptor homolog, reagents, and uses thereof - Google Patents
Fc receptor homolog, reagents, and uses thereof Download PDFInfo
- Publication number
- CA2480404A1 CA2480404A1 CA002480404A CA2480404A CA2480404A1 CA 2480404 A1 CA2480404 A1 CA 2480404A1 CA 002480404 A CA002480404 A CA 002480404A CA 2480404 A CA2480404 A CA 2480404A CA 2480404 A1 CA2480404 A1 CA 2480404A1
- Authority
- CA
- Canada
- Prior art keywords
- seq
- fcrh
- nucleic acid
- ser
- amino acid
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Abandoned
Links
- 108010087819 Fc receptors Proteins 0.000 title abstract description 8
- 102000009109 Fc receptors Human genes 0.000 title abstract description 8
- 239000003153 chemical reaction reagent Substances 0.000 title description 23
- 150000007523 nucleic acids Chemical class 0.000 claims abstract description 171
- 102000039446 nucleic acids Human genes 0.000 claims abstract description 166
- 108020004707 nucleic acids Proteins 0.000 claims abstract description 166
- 239000012634 fragment Substances 0.000 claims abstract description 109
- 108090000765 processed proteins & peptides Proteins 0.000 claims abstract description 89
- 102000004196 processed proteins & peptides Human genes 0.000 claims abstract description 77
- 230000014509 gene expression Effects 0.000 claims abstract description 73
- 229920001184 polypeptide Polymers 0.000 claims abstract description 70
- 206010028980 Neoplasm Diseases 0.000 claims abstract description 60
- 230000036210 malignancy Effects 0.000 claims abstract description 54
- 201000011510 cancer Diseases 0.000 claims abstract description 51
- 239000013598 vector Substances 0.000 claims abstract description 51
- 230000001086 cytosolic effect Effects 0.000 claims abstract description 49
- 210000003958 hematopoietic stem cell Anatomy 0.000 claims abstract description 35
- 208000023275 Autoimmune disease Diseases 0.000 claims abstract description 21
- 230000028996 humoral immune response Effects 0.000 claims abstract description 10
- 125000003275 alpha amino acid group Chemical group 0.000 claims description 142
- 210000004027 cell Anatomy 0.000 claims description 137
- 238000000034 method Methods 0.000 claims description 125
- 150000001413 amino acids Chemical class 0.000 claims description 81
- 241000282414 Homo sapiens Species 0.000 claims description 71
- 125000003729 nucleotide group Chemical group 0.000 claims description 63
- 239000002773 nucleotide Substances 0.000 claims description 61
- 230000027455 binding Effects 0.000 claims description 60
- 238000009739 binding Methods 0.000 claims description 59
- 239000000523 sample Substances 0.000 claims description 56
- 238000006467 substitution reaction Methods 0.000 claims description 52
- 238000009396 hybridization Methods 0.000 claims description 38
- 102100031511 Fc receptor-like protein 2 Human genes 0.000 claims description 36
- 101000846911 Homo sapiens Fc receptor-like protein 2 Proteins 0.000 claims description 36
- 108010029485 Protein Isoforms Proteins 0.000 claims description 28
- 102000001708 Protein Isoforms Human genes 0.000 claims description 28
- 210000003719 b-lymphocyte Anatomy 0.000 claims description 28
- 239000012472 biological sample Substances 0.000 claims description 24
- 230000000295 complement effect Effects 0.000 claims description 21
- 102000005962 receptors Human genes 0.000 claims description 18
- 108020003175 receptors Proteins 0.000 claims description 18
- 239000013604 expression vector Substances 0.000 claims description 16
- 210000001744 T-lymphocyte Anatomy 0.000 claims description 13
- 239000013068 control sample Substances 0.000 claims description 11
- 238000004519 manufacturing process Methods 0.000 claims description 11
- 238000012258 culturing Methods 0.000 claims description 10
- 230000004988 N-glycosylation Effects 0.000 claims description 9
- 230000002378 acidificating effect Effects 0.000 claims description 7
- 230000003211 malignant effect Effects 0.000 claims description 6
- WHUUTDBJXJRKMK-VKHMYHEASA-N L-glutamic acid Chemical compound OC(=O)[C@@H](N)CCC(O)=O WHUUTDBJXJRKMK-VKHMYHEASA-N 0.000 claims description 5
- 210000000066 myeloid cell Anatomy 0.000 claims description 4
- 239000003053 toxin Substances 0.000 claims description 4
- 231100000765 toxin Toxicity 0.000 claims description 4
- 229930195712 glutamate Natural products 0.000 claims description 2
- 102100031517 Fc receptor-like protein 1 Human genes 0.000 claims 1
- 101710120224 Fc receptor-like protein 1 Proteins 0.000 claims 1
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 abstract description 16
- 238000011282 treatment Methods 0.000 abstract description 11
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 abstract description 9
- 230000004913 activation Effects 0.000 abstract description 7
- 108010021625 Immunoglobulin Fragments Proteins 0.000 abstract description 6
- 102000008394 Immunoglobulin Fragments Human genes 0.000 abstract description 6
- 230000002401 inhibitory effect Effects 0.000 abstract description 6
- 208000027866 inflammatory disease Diseases 0.000 abstract description 5
- 230000002757 inflammatory effect Effects 0.000 abstract description 5
- 102000027257 transmembrane receptors Human genes 0.000 abstract description 3
- 108091008578 transmembrane receptors Proteins 0.000 abstract description 3
- 238000003745 diagnosis Methods 0.000 abstract description 2
- 108090000623 proteins and genes Proteins 0.000 description 104
- 108020004414 DNA Proteins 0.000 description 94
- 235000001014 amino acid Nutrition 0.000 description 83
- 229940024606 amino acid Drugs 0.000 description 60
- 101000846907 Mus musculus Fc receptor-like protein 5 Proteins 0.000 description 45
- 101000846910 Homo sapiens Fc receptor-like protein 3 Proteins 0.000 description 41
- 102100031512 Fc receptor-like protein 3 Human genes 0.000 description 40
- 102000004169 proteins and genes Human genes 0.000 description 39
- 235000018102 proteins Nutrition 0.000 description 35
- 108010076504 Protein Sorting Signals Proteins 0.000 description 34
- 108060003951 Immunoglobulin Proteins 0.000 description 23
- 102000018358 immunoglobulin Human genes 0.000 description 23
- 241000699666 Mus <mouse, genus> Species 0.000 description 22
- 239000000427 antigen Substances 0.000 description 22
- 108091007433 antigens Proteins 0.000 description 22
- 102000036639 antigens Human genes 0.000 description 22
- 108010050848 glycylleucine Proteins 0.000 description 22
- 238000004458 analytical method Methods 0.000 description 21
- 239000002299 complementary DNA Substances 0.000 description 20
- 241000880493 Leptailurus serval Species 0.000 description 19
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Natural products NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 19
- 239000000203 mixture Substances 0.000 description 19
- 210000001519 tissue Anatomy 0.000 description 19
- 108010026333 seryl-proline Proteins 0.000 description 17
- 230000000875 corresponding effect Effects 0.000 description 16
- 108010067216 glycyl-glycyl-glycine Proteins 0.000 description 15
- 108010053725 prolylvaline Proteins 0.000 description 15
- 108091035707 Consensus sequence Proteins 0.000 description 14
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 14
- 108010061238 threonyl-glycine Proteins 0.000 description 14
- 241000701161 unidentified adenovirus Species 0.000 description 14
- 241000700605 Viruses Species 0.000 description 13
- 108010069926 arginyl-glycyl-serine Proteins 0.000 description 13
- 108010015796 prolylisoleucine Proteins 0.000 description 13
- 108700024394 Exon Proteins 0.000 description 12
- 210000000349 chromosome Anatomy 0.000 description 12
- 210000004408 hybridoma Anatomy 0.000 description 12
- 238000003780 insertion Methods 0.000 description 12
- 230000037431 insertion Effects 0.000 description 12
- 239000000047 product Substances 0.000 description 12
- 230000002441 reversible effect Effects 0.000 description 12
- 238000012217 deletion Methods 0.000 description 11
- 230000037430 deletion Effects 0.000 description 11
- 238000002360 preparation method Methods 0.000 description 11
- 235000002374 tyrosine Nutrition 0.000 description 11
- 108091026890 Coding region Proteins 0.000 description 10
- LURCIJSJAKFCRO-QWRGUYRKSA-N Gly-Asn-Tyr Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LURCIJSJAKFCRO-QWRGUYRKSA-N 0.000 description 10
- KSZCCRIGNVSHFH-UWVGGRQHSA-N Leu-Arg-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O KSZCCRIGNVSHFH-UWVGGRQHSA-N 0.000 description 10
- CQGSYZCULZMEDE-UHFFFAOYSA-N Leu-Gln-Pro Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)N1CCCC1C(O)=O CQGSYZCULZMEDE-UHFFFAOYSA-N 0.000 description 10
- ORWTWZXGDBYVCP-BJDJZHNGSA-N Leu-Ile-Cys Chemical compound SC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC(C)C ORWTWZXGDBYVCP-BJDJZHNGSA-N 0.000 description 10
- BPMRXBZYPGYPJN-WHFBIAKZSA-N Ser-Gly-Asn Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O BPMRXBZYPGYPJN-WHFBIAKZSA-N 0.000 description 10
- DINQYZRMXGWWTG-GUBZILKMSA-N Ser-Pro-Pro Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DINQYZRMXGWWTG-GUBZILKMSA-N 0.000 description 10
- 125000000539 amino acid group Chemical group 0.000 description 10
- 238000006243 chemical reaction Methods 0.000 description 10
- 108010049041 glutamylalanine Proteins 0.000 description 10
- 108010018006 histidylserine Proteins 0.000 description 10
- 210000001165 lymph node Anatomy 0.000 description 10
- 108010029020 prolylglycine Proteins 0.000 description 10
- 230000001177 retroviral effect Effects 0.000 description 10
- 108700004896 tripeptide FEG Proteins 0.000 description 10
- 230000003612 virological effect Effects 0.000 description 10
- YUZPQIQWXLRFBW-ACZMJKKPSA-N Cys-Glu-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O YUZPQIQWXLRFBW-ACZMJKKPSA-N 0.000 description 9
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 9
- KHRLUIPIMIQFGT-AVGNSLFASA-N Pro-Val-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KHRLUIPIMIQFGT-AVGNSLFASA-N 0.000 description 9
- 210000005220 cytoplasmic tail Anatomy 0.000 description 9
- 230000000694 effects Effects 0.000 description 9
- 108010003700 lysyl aspartic acid Proteins 0.000 description 9
- 230000004048 modification Effects 0.000 description 9
- 238000012986 modification Methods 0.000 description 9
- 239000000126 substance Substances 0.000 description 9
- 241001430294 unidentified retrovirus Species 0.000 description 9
- 102100024222 B-lymphocyte antigen CD19 Human genes 0.000 description 8
- NWOUBJNMZDDGDT-AVGNSLFASA-N Glu-Leu-His Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 NWOUBJNMZDDGDT-AVGNSLFASA-N 0.000 description 8
- 101000980825 Homo sapiens B-lymphocyte antigen CD19 Proteins 0.000 description 8
- PWUMCBLVWPCKNO-MGHWNKPDSA-N Ile-Leu-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PWUMCBLVWPCKNO-MGHWNKPDSA-N 0.000 description 8
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 8
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 8
- FIODMZKLZFLYQP-GUBZILKMSA-N Pro-Val-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O FIODMZKLZFLYQP-GUBZILKMSA-N 0.000 description 8
- BRKHVZNDAOMAHX-BIIVOSGPSA-N Ser-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N BRKHVZNDAOMAHX-BIIVOSGPSA-N 0.000 description 8
- YMTLKLXDFCSCNX-BYPYZUCNSA-N Ser-Gly-Gly Chemical compound OC[C@H](N)C(=O)NCC(=O)NCC(O)=O YMTLKLXDFCSCNX-BYPYZUCNSA-N 0.000 description 8
- MUJQWSAWLLRJCE-KATARQTJSA-N Ser-Leu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MUJQWSAWLLRJCE-KATARQTJSA-N 0.000 description 8
- RFKVQLIXNVEOMB-WEDXCCLWSA-N Thr-Leu-Gly Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N)O RFKVQLIXNVEOMB-WEDXCCLWSA-N 0.000 description 8
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 8
- 108010060035 arginylproline Proteins 0.000 description 8
- 238000003556 assay Methods 0.000 description 8
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 8
- 108010085325 histidylproline Proteins 0.000 description 8
- 238000004806 packaging method and process Methods 0.000 description 8
- 108010051242 phenylalanylserine Proteins 0.000 description 8
- 239000013603 viral vector Substances 0.000 description 8
- VUGWHBXPMAHEGZ-SRVKXCTJSA-N Arg-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCN=C(N)N VUGWHBXPMAHEGZ-SRVKXCTJSA-N 0.000 description 7
- GZUKEVBTYNNUQF-WDSKDSINSA-N Gly-Ala-Gln Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O GZUKEVBTYNNUQF-WDSKDSINSA-N 0.000 description 7
- LJPIRKICOISLKN-WHFBIAKZSA-N Gly-Ala-Ser Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O LJPIRKICOISLKN-WHFBIAKZSA-N 0.000 description 7
- XOWMDXHFSBCAKQ-SRVKXCTJSA-N Leu-Ser-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C XOWMDXHFSBCAKQ-SRVKXCTJSA-N 0.000 description 7
- 241001465754 Metazoa Species 0.000 description 7
- 241000699670 Mus sp. Species 0.000 description 7
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 7
- 108010079364 N-glycylalanine Proteins 0.000 description 7
- FQPQPTHMHZKGFM-XQXXSGGOSA-N Thr-Ala-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O FQPQPTHMHZKGFM-XQXXSGGOSA-N 0.000 description 7
- KWKJGBHDYJOVCR-SRVKXCTJSA-N Tyr-Ser-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N)O KWKJGBHDYJOVCR-SRVKXCTJSA-N 0.000 description 7
- LCHZBEUVGAVMKS-RHYQMDGZSA-N Val-Thr-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)[C@@H](C)O)C(O)=O LCHZBEUVGAVMKS-RHYQMDGZSA-N 0.000 description 7
- 108010069020 alanyl-prolyl-glycine Proteins 0.000 description 7
- 108010005233 alanylglutamic acid Proteins 0.000 description 7
- 108010077245 asparaginyl-proline Proteins 0.000 description 7
- 238000009472 formulation Methods 0.000 description 7
- 230000006870 function Effects 0.000 description 7
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 7
- 230000003993 interaction Effects 0.000 description 7
- 108010034529 leucyl-lysine Proteins 0.000 description 7
- 239000003446 ligand Substances 0.000 description 7
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 7
- 108010073969 valyllysine Proteins 0.000 description 7
- DBKNLHKEVPZVQC-LPEHRKFASA-N Arg-Ala-Pro Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(O)=O DBKNLHKEVPZVQC-LPEHRKFASA-N 0.000 description 6
- KMFPQTITXUKJOV-DCAQKATOSA-N Arg-Ser-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O KMFPQTITXUKJOV-DCAQKATOSA-N 0.000 description 6
- QLSRIZIDQXDQHK-RCWTZXSCSA-N Arg-Val-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QLSRIZIDQXDQHK-RCWTZXSCSA-N 0.000 description 6
- GWTLRDMPMJCNMH-WHFBIAKZSA-N Asp-Asn-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GWTLRDMPMJCNMH-WHFBIAKZSA-N 0.000 description 6
- UJGRZQYSNYTCAX-SRVKXCTJSA-N Asp-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UJGRZQYSNYTCAX-SRVKXCTJSA-N 0.000 description 6
- 108010047041 Complementarity Determining Regions Proteins 0.000 description 6
- 108020004635 Complementary DNA Proteins 0.000 description 6
- NDNZRWUDUMTITL-FXQIFTODSA-N Cys-Ser-Val Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O NDNZRWUDUMTITL-FXQIFTODSA-N 0.000 description 6
- PCBBLFVHTYNQGG-LAEOZQHASA-N Glu-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N PCBBLFVHTYNQGG-LAEOZQHASA-N 0.000 description 6
- ZHHLTWUOWXHVQJ-YUMQZZPRSA-N His-Ser-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CO)C(=O)NCC(=O)O)N ZHHLTWUOWXHVQJ-YUMQZZPRSA-N 0.000 description 6
- CNNQBZRGQATKNY-DCAQKATOSA-N Leu-Arg-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N CNNQBZRGQATKNY-DCAQKATOSA-N 0.000 description 6
- UCOCBWDBHCUPQP-DCAQKATOSA-N Leu-Arg-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O UCOCBWDBHCUPQP-DCAQKATOSA-N 0.000 description 6
- ZJZNLRVCZWUONM-JXUBOQSCSA-N Leu-Thr-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O ZJZNLRVCZWUONM-JXUBOQSCSA-N 0.000 description 6
- LINKCQUOMUDLKN-KATARQTJSA-N Leu-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(C)C)N)O LINKCQUOMUDLKN-KATARQTJSA-N 0.000 description 6
- QWWPYKKLXWOITQ-VOAKCMCISA-N Leu-Thr-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QWWPYKKLXWOITQ-VOAKCMCISA-N 0.000 description 6
- NHHZWPNMYQUNEH-ACRUOGEOSA-N Phe-Tyr-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CC3=CN=CN3)C(=O)O)N NHHZWPNMYQUNEH-ACRUOGEOSA-N 0.000 description 6
- 206010035226 Plasma cell myeloma Diseases 0.000 description 6
- 108010046644 Polymeric Immunoglobulin Receptors Proteins 0.000 description 6
- 238000010240 RT-PCR analysis Methods 0.000 description 6
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 6
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 description 6
- NRCJWSGXMAPYQX-LPEHRKFASA-N Ser-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CO)N)C(=O)O NRCJWSGXMAPYQX-LPEHRKFASA-N 0.000 description 6
- PTWIYDNFWPXQSD-GARJFASQSA-N Ser-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CO)N)C(=O)O PTWIYDNFWPXQSD-GARJFASQSA-N 0.000 description 6
- HHJFMHQYEAAOBM-ZLUOBGJFSA-N Ser-Ser-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O HHJFMHQYEAAOBM-ZLUOBGJFSA-N 0.000 description 6
- DTWMJYGOUWNWEC-IHPCNDPISA-N Tyr-Trp-Cys Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CS)C(O)=O)C1=CC=C(O)C=C1 DTWMJYGOUWNWEC-IHPCNDPISA-N 0.000 description 6
- QHFQQRKNGCXTHL-AUTRQRHGSA-N Val-Gln-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QHFQQRKNGCXTHL-AUTRQRHGSA-N 0.000 description 6
- 108010047495 alanylglycine Proteins 0.000 description 6
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 6
- 150000001875 compounds Chemical class 0.000 description 6
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 6
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 6
- 108010010147 glycylglutamine Proteins 0.000 description 6
- 239000001963 growth medium Substances 0.000 description 6
- 230000002209 hydrophobic effect Effects 0.000 description 6
- 238000002347 injection Methods 0.000 description 6
- 239000007924 injection Substances 0.000 description 6
- 239000002609 medium Substances 0.000 description 6
- 230000035772 mutation Effects 0.000 description 6
- 201000000050 myeloid neoplasm Diseases 0.000 description 6
- 230000008488 polyadenylation Effects 0.000 description 6
- 108010020432 prolyl-prolylisoleucine Proteins 0.000 description 6
- 108010031719 prolyl-serine Proteins 0.000 description 6
- 230000010076 replication Effects 0.000 description 6
- 108010031491 threonyl-lysyl-glutamic acid Proteins 0.000 description 6
- 108010072986 threonyl-seryl-lysine Proteins 0.000 description 6
- 108010003137 tyrosyltyrosine Proteins 0.000 description 6
- 102100031585 ADP-ribosyl cyclase/cyclic ADP-ribose hydrolase 1 Human genes 0.000 description 5
- NJPMYXWVWQWCSR-ACZMJKKPSA-N Ala-Glu-Asn Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O NJPMYXWVWQWCSR-ACZMJKKPSA-N 0.000 description 5
- XIWKVCDQMCNKOZ-UVBJJODRSA-N Ala-Met-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N XIWKVCDQMCNKOZ-UVBJJODRSA-N 0.000 description 5
- YFWTXMRJJDNTLM-LSJOCFKGSA-N Arg-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N YFWTXMRJJDNTLM-LSJOCFKGSA-N 0.000 description 5
- GNYUVVJYGJFKHN-RVMXOQNASA-N Arg-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N GNYUVVJYGJFKHN-RVMXOQNASA-N 0.000 description 5
- HYQYLOSCICEYTR-YUMQZZPRSA-N Asn-Gly-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O HYQYLOSCICEYTR-YUMQZZPRSA-N 0.000 description 5
- RDRMWJBLOSRRAW-BYULHYEWSA-N Asp-Asn-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O RDRMWJBLOSRRAW-BYULHYEWSA-N 0.000 description 5
- MXOODARRORARSU-ACZMJKKPSA-N Glu-Ala-Ser Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)O)N MXOODARRORARSU-ACZMJKKPSA-N 0.000 description 5
- MTAOBYXRYJZRGQ-WDSKDSINSA-N Glu-Gly-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MTAOBYXRYJZRGQ-WDSKDSINSA-N 0.000 description 5
- RVKIPWVMZANZLI-UHFFFAOYSA-N H-Lys-Trp-OH Natural products C1=CC=C2C(CC(NC(=O)C(N)CCCCN)C(O)=O)=CNC2=C1 RVKIPWVMZANZLI-UHFFFAOYSA-N 0.000 description 5
- SDTPKSOWFXBACN-GUBZILKMSA-N His-Glu-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O SDTPKSOWFXBACN-GUBZILKMSA-N 0.000 description 5
- FLXCRBXJRJSDHX-AVGNSLFASA-N His-Pro-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O FLXCRBXJRJSDHX-AVGNSLFASA-N 0.000 description 5
- 101000777636 Homo sapiens ADP-ribosyl cyclase/cyclic ADP-ribose hydrolase 1 Proteins 0.000 description 5
- 241000701044 Human gammaherpesvirus 4 Species 0.000 description 5
- GVKKVHNRTUFCCE-BJDJZHNGSA-N Ile-Leu-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)O)N GVKKVHNRTUFCCE-BJDJZHNGSA-N 0.000 description 5
- MLSUZXHSNRBDCI-CYDGBPFRSA-N Ile-Pro-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)O)N MLSUZXHSNRBDCI-CYDGBPFRSA-N 0.000 description 5
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 5
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 5
- WGNOPSQMIQERPK-UHFFFAOYSA-N Leu-Asn-Pro Natural products CC(C)CC(N)C(=O)NC(CC(=O)N)C(=O)N1CCCC1C(=O)O WGNOPSQMIQERPK-UHFFFAOYSA-N 0.000 description 5
- IAJFFZORSWOZPQ-SRVKXCTJSA-N Leu-Leu-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IAJFFZORSWOZPQ-SRVKXCTJSA-N 0.000 description 5
- CHJKEDSZNSONPS-DCAQKATOSA-N Leu-Pro-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O CHJKEDSZNSONPS-DCAQKATOSA-N 0.000 description 5
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 5
- SEOXPEFQEOYURL-PMVMPFDFSA-N Leu-Tyr-Trp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O SEOXPEFQEOYURL-PMVMPFDFSA-N 0.000 description 5
- SJNZALDHDUYDBU-IHRRRGAJSA-N Lys-Arg-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(O)=O SJNZALDHDUYDBU-IHRRRGAJSA-N 0.000 description 5
- AAORVPFVUIHEAB-YUMQZZPRSA-N Lys-Asp-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O AAORVPFVUIHEAB-YUMQZZPRSA-N 0.000 description 5
- VEGLGAOVLFODGC-GUBZILKMSA-N Lys-Glu-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O VEGLGAOVLFODGC-GUBZILKMSA-N 0.000 description 5
- 241001529936 Murinae Species 0.000 description 5
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 5
- MECSIDWUTYRHRJ-KKUMJFAQSA-N Phe-Asn-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O MECSIDWUTYRHRJ-KKUMJFAQSA-N 0.000 description 5
- FIRWJEJVFFGXSH-RYUDHWBXSA-N Phe-Glu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 FIRWJEJVFFGXSH-RYUDHWBXSA-N 0.000 description 5
- SNGZLPOXVRTNMB-LPEHRKFASA-N Pro-Ser-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N2CCC[C@@H]2C(=O)O SNGZLPOXVRTNMB-LPEHRKFASA-N 0.000 description 5
- 241000283984 Rodentia Species 0.000 description 5
- MWMKFWJYRRGXOR-ZLUOBGJFSA-N Ser-Ala-Asn Chemical compound N[C@H](C(=O)N[C@H](C(=O)N[C@H](C(=O)O)CC(N)=O)C)CO MWMKFWJYRRGXOR-ZLUOBGJFSA-N 0.000 description 5
- HQTKVSCNCDLXSX-BQBZGAKWSA-N Ser-Arg-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O HQTKVSCNCDLXSX-BQBZGAKWSA-N 0.000 description 5
- WTPKKLMBNBCCNL-ACZMJKKPSA-N Ser-Cys-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CO)N WTPKKLMBNBCCNL-ACZMJKKPSA-N 0.000 description 5
- YPUSXTWURJANKF-KBIXCLLPSA-N Ser-Gln-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YPUSXTWURJANKF-KBIXCLLPSA-N 0.000 description 5
- QYSFWUIXDFJUDW-DCAQKATOSA-N Ser-Leu-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QYSFWUIXDFJUDW-DCAQKATOSA-N 0.000 description 5
- GDUZTEQRAOXYJS-SRVKXCTJSA-N Ser-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CO)N GDUZTEQRAOXYJS-SRVKXCTJSA-N 0.000 description 5
- PMTWIUBUQRGCSB-FXQIFTODSA-N Ser-Val-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O PMTWIUBUQRGCSB-FXQIFTODSA-N 0.000 description 5
- YOOAQCZYZHGUAZ-KATARQTJSA-N Thr-Leu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YOOAQCZYZHGUAZ-KATARQTJSA-N 0.000 description 5
- AEFJNECXZCODJM-UWVGGRQHSA-N Val-Val-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](C(C)C)C(=O)NCC([O-])=O AEFJNECXZCODJM-UWVGGRQHSA-N 0.000 description 5
- 230000003213 activating effect Effects 0.000 description 5
- 108010076324 alanyl-glycyl-glycine Proteins 0.000 description 5
- 108010044940 alanylglutamine Proteins 0.000 description 5
- 108010087924 alanylproline Proteins 0.000 description 5
- 230000003321 amplification Effects 0.000 description 5
- 108010062796 arginyllysine Proteins 0.000 description 5
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 5
- 230000008901 benefit Effects 0.000 description 5
- 230000015572 biosynthetic process Effects 0.000 description 5
- 239000003795 chemical substances by application Substances 0.000 description 5
- 235000018417 cysteine Nutrition 0.000 description 5
- 230000029087 digestion Effects 0.000 description 5
- 210000001280 germinal center Anatomy 0.000 description 5
- 108010080575 glutamyl-aspartyl-alanine Proteins 0.000 description 5
- 108010050475 glycyl-leucyl-tyrosine Proteins 0.000 description 5
- 108010089804 glycyl-threonine Proteins 0.000 description 5
- 229940072221 immunoglobulins Drugs 0.000 description 5
- 108020004999 messenger RNA Proteins 0.000 description 5
- 108010056582 methionylglutamic acid Proteins 0.000 description 5
- 238000002703 mutagenesis Methods 0.000 description 5
- 231100000350 mutagenesis Toxicity 0.000 description 5
- 238000003199 nucleic acid amplification method Methods 0.000 description 5
- 239000000546 pharmaceutical excipient Substances 0.000 description 5
- 108010090894 prolylleucine Proteins 0.000 description 5
- 238000003127 radioimmunoassay Methods 0.000 description 5
- 238000012216 screening Methods 0.000 description 5
- 108010071207 serylmethionine Proteins 0.000 description 5
- 239000000243 solution Substances 0.000 description 5
- 210000000952 spleen Anatomy 0.000 description 5
- 230000001225 therapeutic effect Effects 0.000 description 5
- 238000013518 transcription Methods 0.000 description 5
- 230000035897 transcription Effects 0.000 description 5
- 125000001493 tyrosinyl group Chemical class [H]OC1=C([H])C([H])=C(C([H])=C1[H])C([H])([H])C([H])(N([H])[H])C(*)=O 0.000 description 5
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 4
- BUANFPRKJKJSRR-ACZMJKKPSA-N Ala-Ala-Gln Chemical compound C[C@H]([NH3+])C(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CCC(N)=O BUANFPRKJKJSRR-ACZMJKKPSA-N 0.000 description 4
- GSCLWXDNIMNIJE-ZLUOBGJFSA-N Ala-Asp-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O GSCLWXDNIMNIJE-ZLUOBGJFSA-N 0.000 description 4
- BVSGPHDECMJBDE-HGNGGELXSA-N Ala-Glu-His Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N BVSGPHDECMJBDE-HGNGGELXSA-N 0.000 description 4
- HXNNRBHASOSVPG-GUBZILKMSA-N Ala-Glu-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O HXNNRBHASOSVPG-GUBZILKMSA-N 0.000 description 4
- QIWYWCYNUMJBTC-CIUDSAMLSA-N Arg-Cys-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(O)=O QIWYWCYNUMJBTC-CIUDSAMLSA-N 0.000 description 4
- SUMJNGAMIQSNGX-TUAOUCFPSA-N Arg-Val-Pro Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCCNC(N)=N)C(=O)N1CCC[C@@H]1C(O)=O SUMJNGAMIQSNGX-TUAOUCFPSA-N 0.000 description 4
- DPWDPEVGACCWTC-SRVKXCTJSA-N Asn-Tyr-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O DPWDPEVGACCWTC-SRVKXCTJSA-N 0.000 description 4
- GHWWTICYPDKPTE-NGZCFLSTSA-N Asn-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N GHWWTICYPDKPTE-NGZCFLSTSA-N 0.000 description 4
- XOQYDFCQPWAMSA-KKHAAJSZSA-N Asn-Val-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOQYDFCQPWAMSA-KKHAAJSZSA-N 0.000 description 4
- WQAOZCVOOYUWKG-LSJOCFKGSA-N Asn-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CC(=O)N)N WQAOZCVOOYUWKG-LSJOCFKGSA-N 0.000 description 4
- WSWYMRLTJVKRCE-ZLUOBGJFSA-N Asp-Ala-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O WSWYMRLTJVKRCE-ZLUOBGJFSA-N 0.000 description 4
- HPNDBHLITCHRSO-WHFBIAKZSA-N Asp-Ala-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)NCC(O)=O HPNDBHLITCHRSO-WHFBIAKZSA-N 0.000 description 4
- KGHLGJAXYSVNJP-WHFBIAKZSA-N Asp-Ser-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O KGHLGJAXYSVNJP-WHFBIAKZSA-N 0.000 description 4
- TVYMKYUSZSVOAG-ZLUOBGJFSA-N Cys-Ala-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O TVYMKYUSZSVOAG-ZLUOBGJFSA-N 0.000 description 4
- ZEXHDOQQYZKOIB-ACZMJKKPSA-N Cys-Glu-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZEXHDOQQYZKOIB-ACZMJKKPSA-N 0.000 description 4
- OZHXXYOHPLLLMI-CIUDSAMLSA-N Cys-Lys-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O OZHXXYOHPLLLMI-CIUDSAMLSA-N 0.000 description 4
- ZALVANCAZFPKIR-GUBZILKMSA-N Cys-Pro-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CS)N ZALVANCAZFPKIR-GUBZILKMSA-N 0.000 description 4
- 108091060211 Expressed sequence tag Proteins 0.000 description 4
- 102100031508 Fc receptor-like protein 6 Human genes 0.000 description 4
- JESJDAAGXULQOP-CIUDSAMLSA-N Gln-Arg-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)CN=C(N)N JESJDAAGXULQOP-CIUDSAMLSA-N 0.000 description 4
- ICDIMQAMJGDHSE-GUBZILKMSA-N Gln-His-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O ICDIMQAMJGDHSE-GUBZILKMSA-N 0.000 description 4
- GXMBDEGTXHQBAO-NKIYYHGXSA-N Gln-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)N)N)O GXMBDEGTXHQBAO-NKIYYHGXSA-N 0.000 description 4
- KKCJHBXMYYVWMX-KQXIARHKSA-N Gln-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N KKCJHBXMYYVWMX-KQXIARHKSA-N 0.000 description 4
- RAUDKMVXNOWDLS-WDSKDSINSA-N Glu-Gly-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O RAUDKMVXNOWDLS-WDSKDSINSA-N 0.000 description 4
- JGHNIWVNCAOVRO-DCAQKATOSA-N Glu-His-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(O)=O JGHNIWVNCAOVRO-DCAQKATOSA-N 0.000 description 4
- IVGJYOOGJLFKQE-AVGNSLFASA-N Glu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N IVGJYOOGJLFKQE-AVGNSLFASA-N 0.000 description 4
- SYWCGQOIIARSIX-SRVKXCTJSA-N Glu-Pro-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O SYWCGQOIIARSIX-SRVKXCTJSA-N 0.000 description 4
- KFMBRBPXHVMDFN-UWVGGRQHSA-N Gly-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCNC(N)=N KFMBRBPXHVMDFN-UWVGGRQHSA-N 0.000 description 4
- XCLCVBYNGXEVDU-WHFBIAKZSA-N Gly-Asn-Ser Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O XCLCVBYNGXEVDU-WHFBIAKZSA-N 0.000 description 4
- SOEGEPHNZOISMT-BYPYZUCNSA-N Gly-Ser-Gly Chemical compound NCC(=O)N[C@@H](CO)C(=O)NCC(O)=O SOEGEPHNZOISMT-BYPYZUCNSA-N 0.000 description 4
- NGRPGJGKJMUGDM-XVKPBYJWSA-N Gly-Val-Gln Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O NGRPGJGKJMUGDM-XVKPBYJWSA-N 0.000 description 4
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 4
- 108010043121 Green Fluorescent Proteins Proteins 0.000 description 4
- 102000004144 Green Fluorescent Proteins Human genes 0.000 description 4
- GIRSNERMXCMDBO-GARJFASQSA-N His-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC2=CN=CN2)N)C(=O)O GIRSNERMXCMDBO-GARJFASQSA-N 0.000 description 4
- 241000282412 Homo Species 0.000 description 4
- 101000846906 Homo sapiens Fc receptor-like protein 6 Proteins 0.000 description 4
- 102000018251 Hypoxanthine Phosphoribosyltransferase Human genes 0.000 description 4
- 108010091358 Hypoxanthine Phosphoribosyltransferase Proteins 0.000 description 4
- 108010054477 Immunoglobulin Fab Fragments Proteins 0.000 description 4
- 102000001706 Immunoglobulin Fab Fragments Human genes 0.000 description 4
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 4
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 4
- BQSLGJHIAGOZCD-CIUDSAMLSA-N Leu-Ala-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O BQSLGJHIAGOZCD-CIUDSAMLSA-N 0.000 description 4
- JUWJEAPUNARGCF-DCAQKATOSA-N Leu-Arg-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O JUWJEAPUNARGCF-DCAQKATOSA-N 0.000 description 4
- GLBNEGIOFRVRHO-JYJNAYRXSA-N Leu-Gln-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O GLBNEGIOFRVRHO-JYJNAYRXSA-N 0.000 description 4
- CQGSYZCULZMEDE-SRVKXCTJSA-N Leu-Gln-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(O)=O CQGSYZCULZMEDE-SRVKXCTJSA-N 0.000 description 4
- GPICTNQYKHHHTH-GUBZILKMSA-N Leu-Gln-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O GPICTNQYKHHHTH-GUBZILKMSA-N 0.000 description 4
- OGUUKPXUTHOIAV-SDDRHHMPSA-N Leu-Glu-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N OGUUKPXUTHOIAV-SDDRHHMPSA-N 0.000 description 4
- OXRLYTYUXAQTHP-YUMQZZPRSA-N Leu-Gly-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](C)C(O)=O OXRLYTYUXAQTHP-YUMQZZPRSA-N 0.000 description 4
- DDEMUMVXNFPDKC-SRVKXCTJSA-N Leu-His-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CS)C(=O)O)N DDEMUMVXNFPDKC-SRVKXCTJSA-N 0.000 description 4
- UHNQRAFSEBGZFZ-YESZJQIVSA-N Leu-Phe-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N2CCC[C@@H]2C(=O)O)N UHNQRAFSEBGZFZ-YESZJQIVSA-N 0.000 description 4
- VUBIPAHVHMZHCM-KKUMJFAQSA-N Leu-Tyr-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CC=C(O)C=C1 VUBIPAHVHMZHCM-KKUMJFAQSA-N 0.000 description 4
- NNCDAORZCMPZPX-GUBZILKMSA-N Lys-Gln-Ser Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N NNCDAORZCMPZPX-GUBZILKMSA-N 0.000 description 4
- ZXEUFAVXODIPHC-GUBZILKMSA-N Lys-Glu-Asn Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ZXEUFAVXODIPHC-GUBZILKMSA-N 0.000 description 4
- QOJDBRUCOXQSSK-AJNGGQMLSA-N Lys-Ile-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCCN)C(O)=O QOJDBRUCOXQSSK-AJNGGQMLSA-N 0.000 description 4
- IEVXCWPVBYCJRZ-IXOXFDKPSA-N Lys-Thr-His Chemical compound NCCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 IEVXCWPVBYCJRZ-IXOXFDKPSA-N 0.000 description 4
- 108010087066 N2-tryptophyllysine Proteins 0.000 description 4
- ILMLVTGTUJPQFP-FXQIFTODSA-N Pro-Asp-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ILMLVTGTUJPQFP-FXQIFTODSA-N 0.000 description 4
- HAAQQNHQZBOWFO-LURJTMIESA-N Pro-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H]1CCCN1 HAAQQNHQZBOWFO-LURJTMIESA-N 0.000 description 4
- RYJRPPUATSKNAY-STECZYCISA-N Pro-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@@H]2CCCN2 RYJRPPUATSKNAY-STECZYCISA-N 0.000 description 4
- GOMUXSCOIWIJFP-GUBZILKMSA-N Pro-Ser-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GOMUXSCOIWIJFP-GUBZILKMSA-N 0.000 description 4
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 4
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 4
- BTKUIVBNGBFTTP-WHFBIAKZSA-N Ser-Ala-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)NCC(O)=O BTKUIVBNGBFTTP-WHFBIAKZSA-N 0.000 description 4
- KNZQGAUEYZJUSQ-ZLUOBGJFSA-N Ser-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N KNZQGAUEYZJUSQ-ZLUOBGJFSA-N 0.000 description 4
- OHKFXGKHSJKKAL-NRPADANISA-N Ser-Glu-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OHKFXGKHSJKKAL-NRPADANISA-N 0.000 description 4
- CAOYHZOWXFFAIR-CIUDSAMLSA-N Ser-His-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O CAOYHZOWXFFAIR-CIUDSAMLSA-N 0.000 description 4
- VMLONWHIORGALA-SRVKXCTJSA-N Ser-Leu-Leu Chemical compound CC(C)C[C@@H](C([O-])=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CO VMLONWHIORGALA-SRVKXCTJSA-N 0.000 description 4
- XQJCEKXQUJQNNK-ZLUOBGJFSA-N Ser-Ser-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O XQJCEKXQUJQNNK-ZLUOBGJFSA-N 0.000 description 4
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 4
- KBBRNEDOYWMIJP-KYNKHSRBSA-N Thr-Gly-Thr Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KBBRNEDOYWMIJP-KYNKHSRBSA-N 0.000 description 4
- CCZXBOFIBYQLEV-IHPCNDPISA-N Trp-Leu-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)Cc1c[nH]c2ccccc12)C(O)=O CCZXBOFIBYQLEV-IHPCNDPISA-N 0.000 description 4
- UMIACFRBELJMGT-GQGQLFGLSA-N Trp-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N UMIACFRBELJMGT-GQGQLFGLSA-N 0.000 description 4
- 108091005906 Type I transmembrane proteins Proteins 0.000 description 4
- ZLFHAAGHGQBQQN-GUBZILKMSA-N Val-Ala-Pro Natural products CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O ZLFHAAGHGQBQQN-GUBZILKMSA-N 0.000 description 4
- SYSWVVCYSXBVJG-RHYQMDGZSA-N Val-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)N)O SYSWVVCYSXBVJG-RHYQMDGZSA-N 0.000 description 4
- VENKIVFKIPGEJN-NHCYSSNCSA-N Val-Met-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N VENKIVFKIPGEJN-NHCYSSNCSA-N 0.000 description 4
- QSPOLEBZTMESFY-SRVKXCTJSA-N Val-Pro-Val Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O QSPOLEBZTMESFY-SRVKXCTJSA-N 0.000 description 4
- AJNUKMZFHXUBMK-GUBZILKMSA-N Val-Ser-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N AJNUKMZFHXUBMK-GUBZILKMSA-N 0.000 description 4
- VHIZXDZMTDVFGX-DCAQKATOSA-N Val-Ser-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N VHIZXDZMTDVFGX-DCAQKATOSA-N 0.000 description 4
- IECQJCJNPJVUSB-IHRRRGAJSA-N Val-Tyr-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CO)C(O)=O IECQJCJNPJVUSB-IHRRRGAJSA-N 0.000 description 4
- 108010008355 arginyl-glutamine Proteins 0.000 description 4
- 210000004436 artificial bacterial chromosome Anatomy 0.000 description 4
- 230000001413 cellular effect Effects 0.000 description 4
- 238000010367 cloning Methods 0.000 description 4
- 210000001072 colon Anatomy 0.000 description 4
- 125000000151 cysteine group Chemical class N[C@@H](CS)C(=O)* 0.000 description 4
- 238000004925 denaturation Methods 0.000 description 4
- 230000036425 denaturation Effects 0.000 description 4
- 210000003527 eukaryotic cell Anatomy 0.000 description 4
- 108020001507 fusion proteins Proteins 0.000 description 4
- 102000037865 fusion proteins Human genes 0.000 description 4
- 108010006664 gamma-glutamyl-glycyl-glycine Proteins 0.000 description 4
- 108010026364 glycyl-glycyl-leucine Proteins 0.000 description 4
- 108010037850 glycylvaline Proteins 0.000 description 4
- 239000005090 green fluorescent protein Substances 0.000 description 4
- 210000003917 human chromosome Anatomy 0.000 description 4
- 230000003053 immunization Effects 0.000 description 4
- -1 less than 103 Chemical class 0.000 description 4
- 108010000761 leucylarginine Proteins 0.000 description 4
- 210000004698 lymphocyte Anatomy 0.000 description 4
- 210000003563 lymphoid tissue Anatomy 0.000 description 4
- 108010044348 lysyl-glutamyl-aspartic acid Proteins 0.000 description 4
- 108010054155 lysyllysine Proteins 0.000 description 4
- 108010017391 lysylvaline Proteins 0.000 description 4
- 210000004962 mammalian cell Anatomy 0.000 description 4
- 210000004379 membrane Anatomy 0.000 description 4
- 239000012528 membrane Substances 0.000 description 4
- 108010085203 methionylmethionine Proteins 0.000 description 4
- 230000002093 peripheral effect Effects 0.000 description 4
- 230000004481 post-translational protein modification Effects 0.000 description 4
- 108010004914 prolylarginine Proteins 0.000 description 4
- 230000001105 regulatory effect Effects 0.000 description 4
- 238000010839 reverse transcription Methods 0.000 description 4
- 238000003757 reverse transcription PCR Methods 0.000 description 4
- 150000003839 salts Chemical class 0.000 description 4
- 108010048818 seryl-histidine Proteins 0.000 description 4
- 210000002027 skeletal muscle Anatomy 0.000 description 4
- 235000002639 sodium chloride Nutrition 0.000 description 4
- 241000894007 species Species 0.000 description 4
- 238000010186 staining Methods 0.000 description 4
- 238000003786 synthesis reaction Methods 0.000 description 4
- 238000012546 transfer Methods 0.000 description 4
- 241001529453 unidentified herpesvirus Species 0.000 description 4
- 241000220479 Acacia Species 0.000 description 3
- NJWJSLCQEDMGNC-MBLNEYKQSA-N Ala-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](C)N)O NJWJSLCQEDMGNC-MBLNEYKQSA-N 0.000 description 3
- YHBDGLZYNIARKJ-GUBZILKMSA-N Ala-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N YHBDGLZYNIARKJ-GUBZILKMSA-N 0.000 description 3
- VJVQKGYHIZPSNS-FXQIFTODSA-N Ala-Ser-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N VJVQKGYHIZPSNS-FXQIFTODSA-N 0.000 description 3
- UISQLSIBJKEJSS-GUBZILKMSA-N Arg-Arg-Ser Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(O)=O UISQLSIBJKEJSS-GUBZILKMSA-N 0.000 description 3
- WVNFNPGXYADPPO-BQBZGAKWSA-N Arg-Gly-Ser Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O WVNFNPGXYADPPO-BQBZGAKWSA-N 0.000 description 3
- YVTHEZNOKSAWRW-DCAQKATOSA-N Arg-Lys-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O YVTHEZNOKSAWRW-DCAQKATOSA-N 0.000 description 3
- YCYXHLZRUSJITQ-SRVKXCTJSA-N Arg-Pro-Pro Chemical compound NC(=N)NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 YCYXHLZRUSJITQ-SRVKXCTJSA-N 0.000 description 3
- ISJWBVIYRBAXEB-CIUDSAMLSA-N Arg-Ser-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O ISJWBVIYRBAXEB-CIUDSAMLSA-N 0.000 description 3
- XSGBIBGAMKTHMY-WHFBIAKZSA-N Asn-Asp-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O XSGBIBGAMKTHMY-WHFBIAKZSA-N 0.000 description 3
- IBLAOXSULLECQZ-IUKAMOBKSA-N Asn-Ile-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC(N)=O IBLAOXSULLECQZ-IUKAMOBKSA-N 0.000 description 3
- YXVAESUIQFDBHN-SRVKXCTJSA-N Asn-Phe-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O YXVAESUIQFDBHN-SRVKXCTJSA-N 0.000 description 3
- GKKUBLFXKRDMFC-BQBZGAKWSA-N Asn-Pro-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O GKKUBLFXKRDMFC-BQBZGAKWSA-N 0.000 description 3
- SNYCNNPOFYBCEK-ZLUOBGJFSA-N Asn-Ser-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O SNYCNNPOFYBCEK-ZLUOBGJFSA-N 0.000 description 3
- YHXNKGKUDJCAHB-PBCZWWQYSA-N Asn-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O YHXNKGKUDJCAHB-PBCZWWQYSA-N 0.000 description 3
- NSTBNYOKCZKOMI-AVGNSLFASA-N Asn-Tyr-Glu Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O NSTBNYOKCZKOMI-AVGNSLFASA-N 0.000 description 3
- MJIJBEYEHBKTIM-BYULHYEWSA-N Asn-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N MJIJBEYEHBKTIM-BYULHYEWSA-N 0.000 description 3
- JDHOJQJMWBKHDB-CIUDSAMLSA-N Asp-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N JDHOJQJMWBKHDB-CIUDSAMLSA-N 0.000 description 3
- GHODABZPVZMWCE-FXQIFTODSA-N Asp-Glu-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O GHODABZPVZMWCE-FXQIFTODSA-N 0.000 description 3
- PZXPWHFYZXTFBI-YUMQZZPRSA-N Asp-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O PZXPWHFYZXTFBI-YUMQZZPRSA-N 0.000 description 3
- XXAMCEGRCZQGEM-ZLUOBGJFSA-N Asp-Ser-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O XXAMCEGRCZQGEM-ZLUOBGJFSA-N 0.000 description 3
- YUELDQUPTAYEGM-XIRDDKMYSA-N Asp-Trp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CC(=O)O)N YUELDQUPTAYEGM-XIRDDKMYSA-N 0.000 description 3
- MFDPBZAFCRKYEY-LAEOZQHASA-N Asp-Val-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MFDPBZAFCRKYEY-LAEOZQHASA-N 0.000 description 3
- QOJJMJKTMKNFEF-ZKWXMUAHSA-N Asp-Val-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O QOJJMJKTMKNFEF-ZKWXMUAHSA-N 0.000 description 3
- NMWZMKLDGZXRKP-BZSNNMDCSA-N Cys-Phe-Phe Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O NMWZMKLDGZXRKP-BZSNNMDCSA-N 0.000 description 3
- 102000004190 Enzymes Human genes 0.000 description 3
- 108090000790 Enzymes Proteins 0.000 description 3
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 3
- PGPJSRSLQNXBDT-YUMQZZPRSA-N Gln-Arg-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O PGPJSRSLQNXBDT-YUMQZZPRSA-N 0.000 description 3
- XOKGKOQWADCLFQ-GARJFASQSA-N Gln-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)N)N)C(=O)O XOKGKOQWADCLFQ-GARJFASQSA-N 0.000 description 3
- LJEPDHWNQXPXMM-NHCYSSNCSA-N Gln-Arg-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O LJEPDHWNQXPXMM-NHCYSSNCSA-N 0.000 description 3
- LFIVHGMKWFGUGK-IHRRRGAJSA-N Gln-Glu-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N LFIVHGMKWFGUGK-IHRRRGAJSA-N 0.000 description 3
- VOLVNCMGXWDDQY-LPEHRKFASA-N Gln-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N)C(=O)O VOLVNCMGXWDDQY-LPEHRKFASA-N 0.000 description 3
- XSBGUANSZDGULP-IUCAKERBSA-N Gln-Gly-Lys Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CCCCN)C(O)=O XSBGUANSZDGULP-IUCAKERBSA-N 0.000 description 3
- XFAUJGNLHIGXET-AVGNSLFASA-N Gln-Leu-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XFAUJGNLHIGXET-AVGNSLFASA-N 0.000 description 3
- QKWBEMCLYTYBNI-GVXVVHGQSA-N Gln-Lys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(N)=O QKWBEMCLYTYBNI-GVXVVHGQSA-N 0.000 description 3
- YPFFHGRJCUBXPX-NHCYSSNCSA-N Gln-Pro-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCC(N)=O)C(O)=O YPFFHGRJCUBXPX-NHCYSSNCSA-N 0.000 description 3
- HPBKQFJXDUVNQV-FHWLQOOXSA-N Gln-Tyr-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O HPBKQFJXDUVNQV-FHWLQOOXSA-N 0.000 description 3
- SZXSSXUNOALWCH-ACZMJKKPSA-N Glu-Ala-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O SZXSSXUNOALWCH-ACZMJKKPSA-N 0.000 description 3
- RDPOETHPAQEGDP-ACZMJKKPSA-N Glu-Asp-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O RDPOETHPAQEGDP-ACZMJKKPSA-N 0.000 description 3
- HJIFPJUEOGZWRI-GUBZILKMSA-N Glu-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)N HJIFPJUEOGZWRI-GUBZILKMSA-N 0.000 description 3
- NKLRYVLERDYDBI-FXQIFTODSA-N Glu-Glu-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NKLRYVLERDYDBI-FXQIFTODSA-N 0.000 description 3
- PHONAZGUEGIOEM-GLLZPBPUSA-N Glu-Glu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PHONAZGUEGIOEM-GLLZPBPUSA-N 0.000 description 3
- DNPCBMNFQVTHMA-DCAQKATOSA-N Glu-Leu-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O DNPCBMNFQVTHMA-DCAQKATOSA-N 0.000 description 3
- WNRZUESNGGDCJX-JYJNAYRXSA-N Glu-Leu-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WNRZUESNGGDCJX-JYJNAYRXSA-N 0.000 description 3
- NJCALAAIGREHDR-WDCWCFNPSA-N Glu-Leu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NJCALAAIGREHDR-WDCWCFNPSA-N 0.000 description 3
- GJBUAAAIZSRCDC-GVXVVHGQSA-N Glu-Leu-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O GJBUAAAIZSRCDC-GVXVVHGQSA-N 0.000 description 3
- DLISPGXMKZTWQG-IFFSRLJSSA-N Glu-Thr-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O DLISPGXMKZTWQG-IFFSRLJSSA-N 0.000 description 3
- QRWPTXLWHHTOCO-DZKIICNBSA-N Glu-Val-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QRWPTXLWHHTOCO-DZKIICNBSA-N 0.000 description 3
- VSVZIEVNUYDAFR-YUMQZZPRSA-N Gly-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN VSVZIEVNUYDAFR-YUMQZZPRSA-N 0.000 description 3
- MHHUEAIBJZWDBH-YUMQZZPRSA-N Gly-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN MHHUEAIBJZWDBH-YUMQZZPRSA-N 0.000 description 3
- LCNXZQROPKFGQK-WHFBIAKZSA-N Gly-Asp-Ser Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O LCNXZQROPKFGQK-WHFBIAKZSA-N 0.000 description 3
- PMNHJLASAAWELO-FOHZUACHSA-N Gly-Asp-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PMNHJLASAAWELO-FOHZUACHSA-N 0.000 description 3
- CCQOOWAONKGYKQ-BYPYZUCNSA-N Gly-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)CN CCQOOWAONKGYKQ-BYPYZUCNSA-N 0.000 description 3
- UPADCCSMVOQAGF-LBPRGKRZSA-N Gly-Gly-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)CNC(=O)CN)C(O)=O)=CNC2=C1 UPADCCSMVOQAGF-LBPRGKRZSA-N 0.000 description 3
- HAXARWKYFIIHKD-ZKWXMUAHSA-N Gly-Ile-Ser Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HAXARWKYFIIHKD-ZKWXMUAHSA-N 0.000 description 3
- CCBIBMKQNXHNIN-ZETCQYMHSA-N Gly-Leu-Gly Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O CCBIBMKQNXHNIN-ZETCQYMHSA-N 0.000 description 3
- AFWYPMDMDYCKMD-KBPBESRZSA-N Gly-Leu-Tyr Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 AFWYPMDMDYCKMD-KBPBESRZSA-N 0.000 description 3
- ABPRMMYHROQBLY-NKWVEPMBSA-N Gly-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)CN)C(=O)O ABPRMMYHROQBLY-NKWVEPMBSA-N 0.000 description 3
- FKYQEVBRZSFAMJ-QWRGUYRKSA-N Gly-Ser-Tyr Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 FKYQEVBRZSFAMJ-QWRGUYRKSA-N 0.000 description 3
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 3
- 241000238631 Hexapoda Species 0.000 description 3
- RXVOMIADLXPJGW-GUBZILKMSA-N His-Asp-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O RXVOMIADLXPJGW-GUBZILKMSA-N 0.000 description 3
- BDHUXUFYNUOUIT-SRVKXCTJSA-N His-Asp-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N BDHUXUFYNUOUIT-SRVKXCTJSA-N 0.000 description 3
- LMMPTUVWHCFTOT-GARJFASQSA-N His-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC2=CN=CN2)N)C(=O)O LMMPTUVWHCFTOT-GARJFASQSA-N 0.000 description 3
- CYHWWHKRCKHYGQ-GUBZILKMSA-N His-Cys-Glu Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N CYHWWHKRCKHYGQ-GUBZILKMSA-N 0.000 description 3
- KNNSUUOHFVVJOP-GUBZILKMSA-N His-Glu-Ser Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N KNNSUUOHFVVJOP-GUBZILKMSA-N 0.000 description 3
- CCUSLCQWVMWTIS-IXOXFDKPSA-N His-Thr-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O CCUSLCQWVMWTIS-IXOXFDKPSA-N 0.000 description 3
- WYKXJGWSJUULSL-AVGNSLFASA-N His-Val-Arg Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)Cc1cnc[nH]1)C(=O)N[C@@H](CCCNC(=N)N)C(=O)O WYKXJGWSJUULSL-AVGNSLFASA-N 0.000 description 3
- HDOYNXLPTRQLAD-JBDRJPRFSA-N Ile-Ala-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(=O)O)N HDOYNXLPTRQLAD-JBDRJPRFSA-N 0.000 description 3
- DMHGKBGOUAJRHU-UHFFFAOYSA-N Ile-Arg-Pro Natural products CCC(C)C(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O DMHGKBGOUAJRHU-UHFFFAOYSA-N 0.000 description 3
- DMZOUKXXHJQPTL-GRLWGSQLSA-N Ile-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N DMZOUKXXHJQPTL-GRLWGSQLSA-N 0.000 description 3
- MTFVYKQRLXYAQN-LAEOZQHASA-N Ile-Glu-Gly Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O MTFVYKQRLXYAQN-LAEOZQHASA-N 0.000 description 3
- QHUREMVLLMNUAX-OSUNSFLBSA-N Ile-Thr-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)O)N QHUREMVLLMNUAX-OSUNSFLBSA-N 0.000 description 3
- 108091092195 Intron Proteins 0.000 description 3
- XUJNEKJLAYXESH-REOHCLBHSA-N L-Cysteine Chemical compound SC[C@H](N)C(O)=O XUJNEKJLAYXESH-REOHCLBHSA-N 0.000 description 3
- LZDNBBYBDGBADK-UHFFFAOYSA-N L-valyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-UHFFFAOYSA-N 0.000 description 3
- MJOZZTKJZQFKDK-GUBZILKMSA-N Leu-Ala-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(N)=O MJOZZTKJZQFKDK-GUBZILKMSA-N 0.000 description 3
- FMEICTQWUKNAGC-YUMQZZPRSA-N Leu-Gly-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O FMEICTQWUKNAGC-YUMQZZPRSA-N 0.000 description 3
- AUBMZAMQCOYSIC-MNXVOIDGSA-N Leu-Ile-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O AUBMZAMQCOYSIC-MNXVOIDGSA-N 0.000 description 3
- OVZLLFONXILPDZ-VOAKCMCISA-N Leu-Lys-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OVZLLFONXILPDZ-VOAKCMCISA-N 0.000 description 3
- DRWMRVFCKKXHCH-BZSNNMDCSA-N Leu-Phe-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CC=CC=C1 DRWMRVFCKKXHCH-BZSNNMDCSA-N 0.000 description 3
- MVHXGBZUJLWZOH-BJDJZHNGSA-N Leu-Ser-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MVHXGBZUJLWZOH-BJDJZHNGSA-N 0.000 description 3
- SBANPBVRHYIMRR-GARJFASQSA-N Leu-Ser-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N SBANPBVRHYIMRR-GARJFASQSA-N 0.000 description 3
- AIQWYVFNBNNOLU-RHYQMDGZSA-N Leu-Thr-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O AIQWYVFNBNNOLU-RHYQMDGZSA-N 0.000 description 3
- 235000010643 Leucaena leucocephala Nutrition 0.000 description 3
- KPJJOZUXFOLGMQ-CIUDSAMLSA-N Lys-Asp-Asn Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N KPJJOZUXFOLGMQ-CIUDSAMLSA-N 0.000 description 3
- LLSUNJYOSCOOEB-GUBZILKMSA-N Lys-Glu-Asp Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O LLSUNJYOSCOOEB-GUBZILKMSA-N 0.000 description 3
- AIRZWUMAHCDDHR-KKUMJFAQSA-N Lys-Leu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O AIRZWUMAHCDDHR-KKUMJFAQSA-N 0.000 description 3
- IPTUBUUIFRZMJK-ACRUOGEOSA-N Lys-Phe-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 IPTUBUUIFRZMJK-ACRUOGEOSA-N 0.000 description 3
- HYSVGEAWTGPMOA-IHRRRGAJSA-N Lys-Pro-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O HYSVGEAWTGPMOA-IHRRRGAJSA-N 0.000 description 3
- UWHCKWNPWKTMBM-WDCWCFNPSA-N Lys-Thr-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O UWHCKWNPWKTMBM-WDCWCFNPSA-N 0.000 description 3
- 241000124008 Mammalia Species 0.000 description 3
- CFRRIZLGFGJEDB-SRVKXCTJSA-N Met-His-Gln Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(O)=O CFRRIZLGFGJEDB-SRVKXCTJSA-N 0.000 description 3
- AWGBEIYZPAXXSX-RWMBFGLXSA-N Met-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCSC)N AWGBEIYZPAXXSX-RWMBFGLXSA-N 0.000 description 3
- ZMXDDKWLCZADIW-UHFFFAOYSA-N N,N-Dimethylformamide Chemical compound CN(C)C=O ZMXDDKWLCZADIW-UHFFFAOYSA-N 0.000 description 3
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 3
- 108010066427 N-valyltryptophan Proteins 0.000 description 3
- BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 description 3
- 238000000636 Northern blotting Methods 0.000 description 3
- 108091028043 Nucleic acid sequence Proteins 0.000 description 3
- 108090000526 Papain Proteins 0.000 description 3
- 102000007079 Peptide Fragments Human genes 0.000 description 3
- 108010033276 Peptide Fragments Proteins 0.000 description 3
- AUJWXNGCAQWLEI-KBPBESRZSA-N Phe-Lys-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O AUJWXNGCAQWLEI-KBPBESRZSA-N 0.000 description 3
- MRWOVVNKSXXLRP-IHPCNDPISA-N Phe-Ser-Trp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O MRWOVVNKSXXLRP-IHPCNDPISA-N 0.000 description 3
- OOLOTUZJUBOMAX-GUBZILKMSA-N Pro-Ala-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O OOLOTUZJUBOMAX-GUBZILKMSA-N 0.000 description 3
- UAYHMOIGIQZLFR-NHCYSSNCSA-N Pro-Gln-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O UAYHMOIGIQZLFR-NHCYSSNCSA-N 0.000 description 3
- CPRLKHJUFAXVTD-ULQDDVLXSA-N Pro-Leu-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CPRLKHJUFAXVTD-ULQDDVLXSA-N 0.000 description 3
- RTQKBZIRDWZLDF-BZSNNMDCSA-N Pro-Pro-Trp Chemical compound C([C@H]1C(=O)N[C@@H](CC=2C3=CC=CC=C3NC=2)C(=O)O)CCN1C(=O)[C@@H]1CCCN1 RTQKBZIRDWZLDF-BZSNNMDCSA-N 0.000 description 3
- GBUNEGKQPSAMNK-QTKMDUPCSA-N Pro-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@@H]2CCCN2)O GBUNEGKQPSAMNK-QTKMDUPCSA-N 0.000 description 3
- VDHGTOHMHHQSKG-JYJNAYRXSA-N Pro-Val-Phe Chemical compound CC(C)[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](Cc1ccccc1)C(O)=O VDHGTOHMHHQSKG-JYJNAYRXSA-N 0.000 description 3
- DNIAPMSPPWPWGF-UHFFFAOYSA-N Propylene glycol Chemical compound CC(O)CO DNIAPMSPPWPWGF-UHFFFAOYSA-N 0.000 description 3
- 239000004365 Protease Substances 0.000 description 3
- SRTCFKGBYBZRHA-ACZMJKKPSA-N Ser-Ala-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SRTCFKGBYBZRHA-ACZMJKKPSA-N 0.000 description 3
- FCRMLGJMPXCAHD-FXQIFTODSA-N Ser-Arg-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O FCRMLGJMPXCAHD-FXQIFTODSA-N 0.000 description 3
- GZFAWAQTEYDKII-YUMQZZPRSA-N Ser-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO GZFAWAQTEYDKII-YUMQZZPRSA-N 0.000 description 3
- BEAFYHFQTOTVFS-VGDYDELISA-N Ser-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CO)N BEAFYHFQTOTVFS-VGDYDELISA-N 0.000 description 3
- ZIFYDQAFEMIZII-GUBZILKMSA-N Ser-Leu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZIFYDQAFEMIZII-GUBZILKMSA-N 0.000 description 3
- XNCUYZKGQOCOQH-YUMQZZPRSA-N Ser-Leu-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O XNCUYZKGQOCOQH-YUMQZZPRSA-N 0.000 description 3
- PMCMLDNPAZUYGI-DCAQKATOSA-N Ser-Lys-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PMCMLDNPAZUYGI-DCAQKATOSA-N 0.000 description 3
- FZEUTKVQGMVGHW-AVGNSLFASA-N Ser-Phe-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O FZEUTKVQGMVGHW-AVGNSLFASA-N 0.000 description 3
- JCLAFVNDBJMLBC-JBDRJPRFSA-N Ser-Ser-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JCLAFVNDBJMLBC-JBDRJPRFSA-N 0.000 description 3
- XJDMUQCLVSCRSJ-VZFHVOOUSA-N Ser-Thr-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O XJDMUQCLVSCRSJ-VZFHVOOUSA-N 0.000 description 3
- UKKROEYWYIHWBD-ZKWXMUAHSA-N Ser-Val-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O UKKROEYWYIHWBD-ZKWXMUAHSA-N 0.000 description 3
- OYTNZCBFDXGQGE-XQXXSGGOSA-N Thr-Gln-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](C)C(=O)O)N)O OYTNZCBFDXGQGE-XQXXSGGOSA-N 0.000 description 3
- LGNBRHZANHMZHK-NUMRIWBASA-N Thr-Glu-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O LGNBRHZANHMZHK-NUMRIWBASA-N 0.000 description 3
- KBLYJPQSNGTDIU-LOKLDPHHSA-N Thr-Glu-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O KBLYJPQSNGTDIU-LOKLDPHHSA-N 0.000 description 3
- QQWNRERCGGZOKG-WEDXCCLWSA-N Thr-Gly-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O QQWNRERCGGZOKG-WEDXCCLWSA-N 0.000 description 3
- MPUMPERGHHJGRP-WEDXCCLWSA-N Thr-Gly-Lys Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N)O MPUMPERGHHJGRP-WEDXCCLWSA-N 0.000 description 3
- FDALPRWYVKJCLL-PMVVWTBXSA-N Thr-His-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)NCC(O)=O FDALPRWYVKJCLL-PMVVWTBXSA-N 0.000 description 3
- KRGDDWVBBDLPSJ-CUJWVEQBSA-N Thr-His-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O KRGDDWVBBDLPSJ-CUJWVEQBSA-N 0.000 description 3
- HOVLHEKTGVIKAP-WDCWCFNPSA-N Thr-Leu-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O HOVLHEKTGVIKAP-WDCWCFNPSA-N 0.000 description 3
- XIULAFZYEKSGAJ-IXOXFDKPSA-N Thr-Leu-His Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CNC=N1 XIULAFZYEKSGAJ-IXOXFDKPSA-N 0.000 description 3
- ABWNZPOIUJMNKT-IXOXFDKPSA-N Thr-Phe-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O ABWNZPOIUJMNKT-IXOXFDKPSA-N 0.000 description 3
- PWONLXBUSVIZPH-RHYQMDGZSA-N Thr-Val-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O PWONLXBUSVIZPH-RHYQMDGZSA-N 0.000 description 3
- KZTLZZQTJMCGIP-ZJDVBMNYSA-N Thr-Val-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KZTLZZQTJMCGIP-ZJDVBMNYSA-N 0.000 description 3
- NECCMBOBBANRIT-RNXOBYDBSA-N Trp-Phe-Tyr Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O NECCMBOBBANRIT-RNXOBYDBSA-N 0.000 description 3
- XHALUUQSNXSPLP-UFYCRDLUSA-N Tyr-Arg-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 XHALUUQSNXSPLP-UFYCRDLUSA-N 0.000 description 3
- UMXSDHPSMROQRB-YJRXYDGGSA-N Tyr-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O UMXSDHPSMROQRB-YJRXYDGGSA-N 0.000 description 3
- HKYTWJOWZTWBQB-AVGNSLFASA-N Tyr-Glu-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 HKYTWJOWZTWBQB-AVGNSLFASA-N 0.000 description 3
- CVXURBLRELTJKO-BWAGICSOSA-N Tyr-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)O CVXURBLRELTJKO-BWAGICSOSA-N 0.000 description 3
- RZAGEHHVNYESNR-RNXOBYDBSA-N Tyr-Trp-Tyr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O RZAGEHHVNYESNR-RNXOBYDBSA-N 0.000 description 3
- QVYFTFIBKCDHIE-ACRUOGEOSA-N Tyr-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CCCCN)C(=O)O)N)O QVYFTFIBKCDHIE-ACRUOGEOSA-N 0.000 description 3
- 241000700618 Vaccinia virus Species 0.000 description 3
- UMPVMAYCLYMYGA-ONGXEEELSA-N Val-Leu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O UMPVMAYCLYMYGA-ONGXEEELSA-N 0.000 description 3
- YLRAFVVWZRSZQC-DZKIICNBSA-N Val-Phe-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N YLRAFVVWZRSZQC-DZKIICNBSA-N 0.000 description 3
- VIKZGAUAKQZDOF-NRPADANISA-N Val-Ser-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O VIKZGAUAKQZDOF-NRPADANISA-N 0.000 description 3
- PGQUDQYHWICSAB-NAKRPEOUSA-N Val-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N PGQUDQYHWICSAB-NAKRPEOUSA-N 0.000 description 3
- PZTZYZUTCPZWJH-FXQIFTODSA-N Val-Ser-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PZTZYZUTCPZWJH-FXQIFTODSA-N 0.000 description 3
- LLJLBRRXKZTTRD-GUBZILKMSA-N Val-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N LLJLBRRXKZTTRD-GUBZILKMSA-N 0.000 description 3
- JVGDAEKKZKKZFO-RCWTZXSCSA-N Val-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C(C)C)N)O JVGDAEKKZKKZFO-RCWTZXSCSA-N 0.000 description 3
- 235000010443 alginic acid Nutrition 0.000 description 3
- 229920000615 alginic acid Polymers 0.000 description 3
- 238000013459 approach Methods 0.000 description 3
- 108010029539 arginyl-prolyl-proline Proteins 0.000 description 3
- 108010068380 arginylarginine Proteins 0.000 description 3
- 108010068265 aspartyltyrosine Proteins 0.000 description 3
- 238000003149 assay kit Methods 0.000 description 3
- 230000004071 biological effect Effects 0.000 description 3
- 210000001185 bone marrow Anatomy 0.000 description 3
- 210000004899 c-terminal region Anatomy 0.000 description 3
- 230000008859 change Effects 0.000 description 3
- 239000003086 colorant Substances 0.000 description 3
- 239000002131 composite material Substances 0.000 description 3
- 108010069495 cysteinyltyrosine Proteins 0.000 description 3
- 238000001514 detection method Methods 0.000 description 3
- 238000002405 diagnostic procedure Methods 0.000 description 3
- 201000010099 disease Diseases 0.000 description 3
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 3
- 238000009826 distribution Methods 0.000 description 3
- 238000005516 engineering process Methods 0.000 description 3
- 108700004025 env Genes Proteins 0.000 description 3
- 229940088598 enzyme Drugs 0.000 description 3
- MHMNJMPURVTYEJ-UHFFFAOYSA-N fluorescein-5-isothiocyanate Chemical compound O1C(=O)C2=CC(N=C=S)=CC=C2C21C1=CC=C(O)C=C1OC1=CC(O)=CC=C21 MHMNJMPURVTYEJ-UHFFFAOYSA-N 0.000 description 3
- 238000001943 fluorescence-activated cell sorting Methods 0.000 description 3
- 230000003325 follicular Effects 0.000 description 3
- 230000004927 fusion Effects 0.000 description 3
- 108700004026 gag Genes Proteins 0.000 description 3
- 238000001415 gene therapy Methods 0.000 description 3
- 210000004602 germ cell Anatomy 0.000 description 3
- 108010013768 glutamyl-aspartyl-proline Proteins 0.000 description 3
- 230000013595 glycosylation Effects 0.000 description 3
- 238000006206 glycosylation reaction Methods 0.000 description 3
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 3
- 108010000434 glycyl-alanyl-leucine Proteins 0.000 description 3
- 108010020688 glycylhistidine Proteins 0.000 description 3
- 108010077515 glycylproline Proteins 0.000 description 3
- 210000003714 granulocyte Anatomy 0.000 description 3
- 108010028295 histidylhistidine Proteins 0.000 description 3
- 108010092114 histidylphenylalanine Proteins 0.000 description 3
- 230000001900 immune effect Effects 0.000 description 3
- 238000002649 immunization Methods 0.000 description 3
- 229940124452 immunizing agent Drugs 0.000 description 3
- 238000003018 immunoassay Methods 0.000 description 3
- 230000016784 immunoglobulin production Effects 0.000 description 3
- 230000008676 import Effects 0.000 description 3
- 238000000338 in vitro Methods 0.000 description 3
- 208000015181 infectious disease Diseases 0.000 description 3
- 230000010354 integration Effects 0.000 description 3
- 210000003734 kidney Anatomy 0.000 description 3
- 108010053037 kyotorphin Proteins 0.000 description 3
- 108010057821 leucylproline Proteins 0.000 description 3
- 208000032839 leukemia Diseases 0.000 description 3
- 108010038320 lysylphenylalanine Proteins 0.000 description 3
- 239000003550 marker Substances 0.000 description 3
- 239000002674 ointment Substances 0.000 description 3
- 230000008520 organization Effects 0.000 description 3
- 229940055729 papain Drugs 0.000 description 3
- 235000019834 papain Nutrition 0.000 description 3
- 239000002245 particle Substances 0.000 description 3
- 210000005259 peripheral blood Anatomy 0.000 description 3
- 239000011886 peripheral blood Substances 0.000 description 3
- 108010018625 phenylalanylarginine Proteins 0.000 description 3
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 3
- 239000013612 plasmid Substances 0.000 description 3
- 210000004180 plasmocyte Anatomy 0.000 description 3
- 108700004029 pol Genes Proteins 0.000 description 3
- 210000001236 prokaryotic cell Anatomy 0.000 description 3
- 108010077112 prolyl-proline Proteins 0.000 description 3
- 238000000746 purification Methods 0.000 description 3
- 230000011664 signaling Effects 0.000 description 3
- 239000000758 substrate Substances 0.000 description 3
- 208000024891 symptom Diseases 0.000 description 3
- 238000001890 transfection Methods 0.000 description 3
- 230000014621 translational initiation Effects 0.000 description 3
- 108010080629 tryptophan-leucine Proteins 0.000 description 3
- 108010051110 tyrosyl-lysine Proteins 0.000 description 3
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 3
- 238000001262 western blot Methods 0.000 description 3
- YBJHBAHKTGYVGT-ZKWXMUAHSA-N (+)-Biotin Chemical compound N1C(=O)N[C@@H]2[C@H](CCCCC(=O)O)SC[C@@H]21 YBJHBAHKTGYVGT-ZKWXMUAHSA-N 0.000 description 2
- NNRFRJQMBSBXGO-CIUDSAMLSA-N (3s)-3-[[2-[[(2s)-2-amino-5-(diaminomethylideneamino)pentanoyl]amino]acetyl]amino]-4-[[(1s)-1-carboxy-2-hydroxyethyl]amino]-4-oxobutanoic acid Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O NNRFRJQMBSBXGO-CIUDSAMLSA-N 0.000 description 2
- KDCGOANMDULRCW-UHFFFAOYSA-N 7H-purine Chemical compound N1=CNC2=NC=NC2=C1 KDCGOANMDULRCW-UHFFFAOYSA-N 0.000 description 2
- YLTKNGYYPIWKHZ-ACZMJKKPSA-N Ala-Ala-Glu Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O YLTKNGYYPIWKHZ-ACZMJKKPSA-N 0.000 description 2
- SDMAQFGBPOJFOM-GUBZILKMSA-N Ala-Arg-Arg Chemical compound NC(=N)NCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SDMAQFGBPOJFOM-GUBZILKMSA-N 0.000 description 2
- PBAMJJXWDQXOJA-FXQIFTODSA-N Ala-Asp-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PBAMJJXWDQXOJA-FXQIFTODSA-N 0.000 description 2
- IKKVASZHTMKJIR-ZKWXMUAHSA-N Ala-Asp-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O IKKVASZHTMKJIR-ZKWXMUAHSA-N 0.000 description 2
- CVHJIWVKTFNGHT-ACZMJKKPSA-N Ala-Gln-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N CVHJIWVKTFNGHT-ACZMJKKPSA-N 0.000 description 2
- YHKANGMVQWRMAP-DCAQKATOSA-N Ala-Leu-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YHKANGMVQWRMAP-DCAQKATOSA-N 0.000 description 2
- HHRAXZAYZFFRAM-CIUDSAMLSA-N Ala-Leu-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O HHRAXZAYZFFRAM-CIUDSAMLSA-N 0.000 description 2
- RGDKRCPIFODMHK-HJWJTTGWSA-N Ala-Leu-Leu-His Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 RGDKRCPIFODMHK-HJWJTTGWSA-N 0.000 description 2
- OYJCVIGKMXUVKB-GARJFASQSA-N Ala-Leu-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N OYJCVIGKMXUVKB-GARJFASQSA-N 0.000 description 2
- VCSABYLVNWQYQE-UHFFFAOYSA-N Ala-Lys-Lys Natural products NCCCCC(NC(=O)C(N)C)C(=O)NC(CCCCN)C(O)=O VCSABYLVNWQYQE-UHFFFAOYSA-N 0.000 description 2
- RUXQNKVQSKOOBS-JURCDPSOSA-N Ala-Phe-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RUXQNKVQSKOOBS-JURCDPSOSA-N 0.000 description 2
- MAZZQZWCCYJQGZ-GUBZILKMSA-N Ala-Pro-Arg Chemical compound [H]N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MAZZQZWCCYJQGZ-GUBZILKMSA-N 0.000 description 2
- GMGWOTQMUKYZIE-UBHSHLNASA-N Ala-Pro-Phe Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 GMGWOTQMUKYZIE-UBHSHLNASA-N 0.000 description 2
- OLVCTPPSXNRGKV-GUBZILKMSA-N Ala-Pro-Pro Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 OLVCTPPSXNRGKV-GUBZILKMSA-N 0.000 description 2
- KLALXKYLOMZDQT-ZLUOBGJFSA-N Ala-Ser-Asn Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(N)=O KLALXKYLOMZDQT-ZLUOBGJFSA-N 0.000 description 2
- PEEYDECOOVQKRZ-DLOVCJGASA-N Ala-Ser-Phe Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PEEYDECOOVQKRZ-DLOVCJGASA-N 0.000 description 2
- WNHNMKOFKCHKKD-BFHQHQDPSA-N Ala-Thr-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O WNHNMKOFKCHKKD-BFHQHQDPSA-N 0.000 description 2
- LFFOJBOTZUWINF-ZANVPECISA-N Ala-Trp-Gly Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C)C(=O)NCC(O)=O)=CNC2=C1 LFFOJBOTZUWINF-ZANVPECISA-N 0.000 description 2
- VYMJAWXRWHJIMS-LKTVYLICSA-N Ala-Tyr-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N VYMJAWXRWHJIMS-LKTVYLICSA-N 0.000 description 2
- JNJHNBXBGNJESC-KKXDTOCCSA-N Ala-Tyr-Phe Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JNJHNBXBGNJESC-KKXDTOCCSA-N 0.000 description 2
- YJHKTAMKPGFJCT-NRPADANISA-N Ala-Val-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O YJHKTAMKPGFJCT-NRPADANISA-N 0.000 description 2
- OMSKGWFGWCQFBD-KZVJFYERSA-N Ala-Val-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OMSKGWFGWCQFBD-KZVJFYERSA-N 0.000 description 2
- SGYSTDWPNPKJPP-GUBZILKMSA-N Arg-Ala-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SGYSTDWPNPKJPP-GUBZILKMSA-N 0.000 description 2
- GXCSUJQOECMKPV-CIUDSAMLSA-N Arg-Ala-Gln Chemical compound C[C@H](NC(=O)[C@@H](N)CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O GXCSUJQOECMKPV-CIUDSAMLSA-N 0.000 description 2
- GIVATXIGCXFQQA-FXQIFTODSA-N Arg-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N GIVATXIGCXFQQA-FXQIFTODSA-N 0.000 description 2
- BIOCIVSVEDFKDJ-GUBZILKMSA-N Arg-Arg-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O BIOCIVSVEDFKDJ-GUBZILKMSA-N 0.000 description 2
- MFAMTAVAFBPXDC-LPEHRKFASA-N Arg-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O MFAMTAVAFBPXDC-LPEHRKFASA-N 0.000 description 2
- HKRXJBBCQBAGIM-FXQIFTODSA-N Arg-Asp-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N)CN=C(N)N HKRXJBBCQBAGIM-FXQIFTODSA-N 0.000 description 2
- AQPVUEJJARLJHB-BQBZGAKWSA-N Arg-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CCCN=C(N)N AQPVUEJJARLJHB-BQBZGAKWSA-N 0.000 description 2
- AUFHLLPVPSMEOG-YUMQZZPRSA-N Arg-Gly-Glu Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O AUFHLLPVPSMEOG-YUMQZZPRSA-N 0.000 description 2
- FRMQITGHXMUNDF-GMOBBJLQSA-N Arg-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N FRMQITGHXMUNDF-GMOBBJLQSA-N 0.000 description 2
- GXXWTNKNFFKTJB-NAKRPEOUSA-N Arg-Ile-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O GXXWTNKNFFKTJB-NAKRPEOUSA-N 0.000 description 2
- GRRXPUAICOGISM-RWMBFGLXSA-N Arg-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O GRRXPUAICOGISM-RWMBFGLXSA-N 0.000 description 2
- UIUXXFIKWQVMEX-UFYCRDLUSA-N Arg-Phe-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O UIUXXFIKWQVMEX-UFYCRDLUSA-N 0.000 description 2
- ATABBWFGOHKROJ-GUBZILKMSA-N Arg-Pro-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O ATABBWFGOHKROJ-GUBZILKMSA-N 0.000 description 2
- KXOPYFNQLVUOAQ-FXQIFTODSA-N Arg-Ser-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O KXOPYFNQLVUOAQ-FXQIFTODSA-N 0.000 description 2
- VENMDXUVHSKEIN-GUBZILKMSA-N Arg-Ser-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O VENMDXUVHSKEIN-GUBZILKMSA-N 0.000 description 2
- AMIQZQAAYGYKOP-FXQIFTODSA-N Arg-Ser-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O AMIQZQAAYGYKOP-FXQIFTODSA-N 0.000 description 2
- BECXEHHOZNFFFX-IHRRRGAJSA-N Arg-Ser-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O BECXEHHOZNFFFX-IHRRRGAJSA-N 0.000 description 2
- UZSQXCMNUPKLCC-FJXKBIBVSA-N Arg-Thr-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O UZSQXCMNUPKLCC-FJXKBIBVSA-N 0.000 description 2
- INOIAEUXVVNJKA-XGEHTFHBSA-N Arg-Thr-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O INOIAEUXVVNJKA-XGEHTFHBSA-N 0.000 description 2
- CTAPSNCVKPOOSM-KKUMJFAQSA-N Arg-Tyr-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O CTAPSNCVKPOOSM-KKUMJFAQSA-N 0.000 description 2
- CNBIWSCSSCAINS-UFYCRDLUSA-N Arg-Tyr-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CNBIWSCSSCAINS-UFYCRDLUSA-N 0.000 description 2
- LLQIAIUAKGNOSE-NHCYSSNCSA-N Arg-Val-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N LLQIAIUAKGNOSE-NHCYSSNCSA-N 0.000 description 2
- 206010003445 Ascites Diseases 0.000 description 2
- PFOYSEIHFVKHNF-FXQIFTODSA-N Asn-Ala-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PFOYSEIHFVKHNF-FXQIFTODSA-N 0.000 description 2
- PHJPKNUWWHRAOC-PEFMBERDSA-N Asn-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N PHJPKNUWWHRAOC-PEFMBERDSA-N 0.000 description 2
- SEKBHZJLARBNPB-GHCJXIJMSA-N Asn-Ile-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O SEKBHZJLARBNPB-GHCJXIJMSA-N 0.000 description 2
- JWQWPRCDYWNVNM-ACZMJKKPSA-N Asn-Ser-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N JWQWPRCDYWNVNM-ACZMJKKPSA-N 0.000 description 2
- MYTHOBCLNIOFBL-SRVKXCTJSA-N Asn-Ser-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MYTHOBCLNIOFBL-SRVKXCTJSA-N 0.000 description 2
- NCXTYSVDWLAQGZ-ZKWXMUAHSA-N Asn-Ser-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O NCXTYSVDWLAQGZ-ZKWXMUAHSA-N 0.000 description 2
- AMGQTNHANMRPOE-LKXGYXEUSA-N Asn-Thr-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O AMGQTNHANMRPOE-LKXGYXEUSA-N 0.000 description 2
- MYRLSKYSMXNLLA-LAEOZQHASA-N Asn-Val-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O MYRLSKYSMXNLLA-LAEOZQHASA-N 0.000 description 2
- VTYQAQFKMQTKQD-ACZMJKKPSA-N Asp-Ala-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O VTYQAQFKMQTKQD-ACZMJKKPSA-N 0.000 description 2
- SNDBKTFJWVEVPO-WHFBIAKZSA-N Asp-Gly-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SNDBKTFJWVEVPO-WHFBIAKZSA-N 0.000 description 2
- RTXQQDVBACBSCW-CFMVVWHZSA-N Asp-Ile-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O RTXQQDVBACBSCW-CFMVVWHZSA-N 0.000 description 2
- LTCKTLYKRMCFOC-KKUMJFAQSA-N Asp-Phe-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O LTCKTLYKRMCFOC-KKUMJFAQSA-N 0.000 description 2
- ZBYLEBZCVKLPCY-FXQIFTODSA-N Asp-Ser-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ZBYLEBZCVKLPCY-FXQIFTODSA-N 0.000 description 2
- ZVGRHIRJLWBWGJ-ACZMJKKPSA-N Asp-Ser-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZVGRHIRJLWBWGJ-ACZMJKKPSA-N 0.000 description 2
- DRCOAZZDQRCGGP-GHCJXIJMSA-N Asp-Ser-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DRCOAZZDQRCGGP-GHCJXIJMSA-N 0.000 description 2
- QSFHZPQUAAQHAQ-CIUDSAMLSA-N Asp-Ser-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O QSFHZPQUAAQHAQ-CIUDSAMLSA-N 0.000 description 2
- MGSVBZIBCCKGCY-ZLUOBGJFSA-N Asp-Ser-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MGSVBZIBCCKGCY-ZLUOBGJFSA-N 0.000 description 2
- RSMZEHCMIOKNMW-GSSVUCPTSA-N Asp-Thr-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RSMZEHCMIOKNMW-GSSVUCPTSA-N 0.000 description 2
- WAEDSQFVZJUHLI-BYULHYEWSA-N Asp-Val-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O WAEDSQFVZJUHLI-BYULHYEWSA-N 0.000 description 2
- GIKOVDMXBAFXDF-NHCYSSNCSA-N Asp-Val-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GIKOVDMXBAFXDF-NHCYSSNCSA-N 0.000 description 2
- 208000010839 B-cell chronic lymphocytic leukemia Diseases 0.000 description 2
- 241000283690 Bos taurus Species 0.000 description 2
- VTYYLEPIZMXCLO-UHFFFAOYSA-L Calcium carbonate Chemical compound [Ca+2].[O-]C([O-])=O VTYYLEPIZMXCLO-UHFFFAOYSA-L 0.000 description 2
- 229920002261 Corn starch Polymers 0.000 description 2
- VNLYIYOYUNGURO-ZLUOBGJFSA-N Cys-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)N VNLYIYOYUNGURO-ZLUOBGJFSA-N 0.000 description 2
- KEBJBKIASQVRJS-WDSKDSINSA-N Cys-Gln-Gly Chemical compound C(CC(=O)N)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CS)N KEBJBKIASQVRJS-WDSKDSINSA-N 0.000 description 2
- SBORMUFGKSCGEN-XHNCKOQMSA-N Cys-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CS)N)C(=O)O SBORMUFGKSCGEN-XHNCKOQMSA-N 0.000 description 2
- ODDOYXKAHLKKQY-MMWGEVLESA-N Cys-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CS)N ODDOYXKAHLKKQY-MMWGEVLESA-N 0.000 description 2
- WVLZTXGTNGHPBO-SRVKXCTJSA-N Cys-Leu-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O WVLZTXGTNGHPBO-SRVKXCTJSA-N 0.000 description 2
- NRVQLLDIJJEIIZ-VZFHVOOUSA-N Cys-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CS)N)O NRVQLLDIJJEIIZ-VZFHVOOUSA-N 0.000 description 2
- KXHAPEPORGOXDT-UWJYBYFXSA-N Cys-Tyr-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O KXHAPEPORGOXDT-UWJYBYFXSA-N 0.000 description 2
- 230000006820 DNA synthesis Effects 0.000 description 2
- 241000702421 Dependoparvovirus Species 0.000 description 2
- VSLCIGXQLCYQTD-NPJQDHAYSA-N Dynorphin B (10-13) Chemical compound NCCCC[C@@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(C)C)C(=O)N[C@@H]([C@H](C)O)C(O)=O VSLCIGXQLCYQTD-NPJQDHAYSA-N 0.000 description 2
- 241000196324 Embryophyta Species 0.000 description 2
- 241000588724 Escherichia coli Species 0.000 description 2
- 102100031507 Fc receptor-like protein 5 Human genes 0.000 description 2
- 241000233866 Fungi Species 0.000 description 2
- KVYVOGYEMPEXBT-GUBZILKMSA-N Gln-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O KVYVOGYEMPEXBT-GUBZILKMSA-N 0.000 description 2
- RGXXLQWXBFNXTG-CIUDSAMLSA-N Gln-Arg-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O RGXXLQWXBFNXTG-CIUDSAMLSA-N 0.000 description 2
- BTSPOOHJBYJRKO-CIUDSAMLSA-N Gln-Asp-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O BTSPOOHJBYJRKO-CIUDSAMLSA-N 0.000 description 2
- UICOTGULOUGGLC-NUMRIWBASA-N Gln-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N)O UICOTGULOUGGLC-NUMRIWBASA-N 0.000 description 2
- VNCLJDOTEPPBBD-GUBZILKMSA-N Gln-Cys-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCC(=O)N)N VNCLJDOTEPPBBD-GUBZILKMSA-N 0.000 description 2
- RBWKVOSARCFSQQ-FXQIFTODSA-N Gln-Gln-Ser Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O RBWKVOSARCFSQQ-FXQIFTODSA-N 0.000 description 2
- PNENQZWRFMUZOM-DCAQKATOSA-N Gln-Glu-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O PNENQZWRFMUZOM-DCAQKATOSA-N 0.000 description 2
- KHGGWBRVRPHFMH-PEFMBERDSA-N Gln-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N KHGGWBRVRPHFMH-PEFMBERDSA-N 0.000 description 2
- LGIKBBLQVSWUGK-DCAQKATOSA-N Gln-Leu-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O LGIKBBLQVSWUGK-DCAQKATOSA-N 0.000 description 2
- ZBKUIQNCRIYVGH-SDDRHHMPSA-N Gln-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N ZBKUIQNCRIYVGH-SDDRHHMPSA-N 0.000 description 2
- WEAVZFWWIPIANL-SRVKXCTJSA-N Gln-Lys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)N)N WEAVZFWWIPIANL-SRVKXCTJSA-N 0.000 description 2
- ZEEPYMXTJWIMSN-GUBZILKMSA-N Gln-Lys-Ser Chemical compound NCCCC[C@@H](C(=O)N[C@@H](CO)C(O)=O)NC(=O)[C@@H](N)CCC(N)=O ZEEPYMXTJWIMSN-GUBZILKMSA-N 0.000 description 2
- ZVQZXPADLZIQFF-FHWLQOOXSA-N Gln-Phe-Tyr Chemical compound C([C@H](NC(=O)[C@H](CCC(N)=O)N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 ZVQZXPADLZIQFF-FHWLQOOXSA-N 0.000 description 2
- HMIXCETWRYDVMO-GUBZILKMSA-N Gln-Pro-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O HMIXCETWRYDVMO-GUBZILKMSA-N 0.000 description 2
- VNTGPISAOMAXRK-CIUDSAMLSA-N Gln-Pro-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O VNTGPISAOMAXRK-CIUDSAMLSA-N 0.000 description 2
- KUBFPYIMAGXGBT-ACZMJKKPSA-N Gln-Ser-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O KUBFPYIMAGXGBT-ACZMJKKPSA-N 0.000 description 2
- BYKZWDGMJLNFJY-XKBZYTNZSA-N Gln-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N)O BYKZWDGMJLNFJY-XKBZYTNZSA-N 0.000 description 2
- YRHZWVKUFWCEPW-GLLZPBPUSA-N Gln-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O YRHZWVKUFWCEPW-GLLZPBPUSA-N 0.000 description 2
- VLOLPWWCNKWRNB-LOKLDPHHSA-N Gln-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O VLOLPWWCNKWRNB-LOKLDPHHSA-N 0.000 description 2
- VEYGCDYMOXHJLS-GVXVVHGQSA-N Gln-Val-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VEYGCDYMOXHJLS-GVXVVHGQSA-N 0.000 description 2
- GJLXZITZLUUXMJ-NHCYSSNCSA-N Gln-Val-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCC(=O)N)N GJLXZITZLUUXMJ-NHCYSSNCSA-N 0.000 description 2
- RUFHOVYUYSNDNY-ACZMJKKPSA-N Glu-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O RUFHOVYUYSNDNY-ACZMJKKPSA-N 0.000 description 2
- FHPXTPQBODWBIY-CIUDSAMLSA-N Glu-Ala-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FHPXTPQBODWBIY-CIUDSAMLSA-N 0.000 description 2
- OGMQXTXGLDNBSS-FXQIFTODSA-N Glu-Ala-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O OGMQXTXGLDNBSS-FXQIFTODSA-N 0.000 description 2
- WOSRKEJQESVHGA-CIUDSAMLSA-N Glu-Arg-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O WOSRKEJQESVHGA-CIUDSAMLSA-N 0.000 description 2
- AFODTOLGSZQDSL-PEFMBERDSA-N Glu-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N AFODTOLGSZQDSL-PEFMBERDSA-N 0.000 description 2
- WATXSTJXNBOHKD-LAEOZQHASA-N Glu-Asp-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O WATXSTJXNBOHKD-LAEOZQHASA-N 0.000 description 2
- LSTFYPOGBGFIPP-FXQIFTODSA-N Glu-Cys-Gln Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(O)=O LSTFYPOGBGFIPP-FXQIFTODSA-N 0.000 description 2
- ALCAUWPAMLVUDB-FXQIFTODSA-N Glu-Gln-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ALCAUWPAMLVUDB-FXQIFTODSA-N 0.000 description 2
- HUFCEIHAFNVSNR-IHRRRGAJSA-N Glu-Gln-Tyr Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HUFCEIHAFNVSNR-IHRRRGAJSA-N 0.000 description 2
- APHGWLWMOXGZRL-DCAQKATOSA-N Glu-Glu-His Chemical compound N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O APHGWLWMOXGZRL-DCAQKATOSA-N 0.000 description 2
- QJCKNLPMTPXXEM-AUTRQRHGSA-N Glu-Glu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O QJCKNLPMTPXXEM-AUTRQRHGSA-N 0.000 description 2
- QIQABBIDHGQXGA-ZPFDUUQYSA-N Glu-Ile-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QIQABBIDHGQXGA-ZPFDUUQYSA-N 0.000 description 2
- WVYJNPCWJYBHJG-YVNDNENWSA-N Glu-Ile-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O WVYJNPCWJYBHJG-YVNDNENWSA-N 0.000 description 2
- IRXNJYPKBVERCW-DCAQKATOSA-N Glu-Leu-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IRXNJYPKBVERCW-DCAQKATOSA-N 0.000 description 2
- FBEJIDRSQCGFJI-GUBZILKMSA-N Glu-Leu-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FBEJIDRSQCGFJI-GUBZILKMSA-N 0.000 description 2
- SUIAHERNFYRBDZ-GVXVVHGQSA-N Glu-Lys-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O SUIAHERNFYRBDZ-GVXVVHGQSA-N 0.000 description 2
- FGSGPLRPQCZBSQ-AVGNSLFASA-N Glu-Phe-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O FGSGPLRPQCZBSQ-AVGNSLFASA-N 0.000 description 2
- MRWYPDWDZSLWJM-ACZMJKKPSA-N Glu-Ser-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O MRWYPDWDZSLWJM-ACZMJKKPSA-N 0.000 description 2
- DMYACXMQUABZIQ-NRPADANISA-N Glu-Ser-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O DMYACXMQUABZIQ-NRPADANISA-N 0.000 description 2
- MWTGQXBHVRTCOR-GLLZPBPUSA-N Glu-Thr-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MWTGQXBHVRTCOR-GLLZPBPUSA-N 0.000 description 2
- YQAQQKPWFOBSMU-WDCWCFNPSA-N Glu-Thr-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O YQAQQKPWFOBSMU-WDCWCFNPSA-N 0.000 description 2
- VHPVBPCCWVDGJL-IRIUXVKKSA-N Glu-Thr-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VHPVBPCCWVDGJL-IRIUXVKKSA-N 0.000 description 2
- MLILEEIVMRUYBX-NHCYSSNCSA-N Glu-Val-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O MLILEEIVMRUYBX-NHCYSSNCSA-N 0.000 description 2
- SOYWRINXUSUWEQ-DLOVCJGASA-N Glu-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O SOYWRINXUSUWEQ-DLOVCJGASA-N 0.000 description 2
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 2
- JRDYDYXZKFNNRQ-XPUUQOCRSA-N Gly-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN JRDYDYXZKFNNRQ-XPUUQOCRSA-N 0.000 description 2
- WKJKBELXHCTHIJ-WPRPVWTQSA-N Gly-Arg-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N WKJKBELXHCTHIJ-WPRPVWTQSA-N 0.000 description 2
- BGVYNAQWHSTTSP-BYULHYEWSA-N Gly-Asn-Ile Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BGVYNAQWHSTTSP-BYULHYEWSA-N 0.000 description 2
- XBWMTPAIUQIWKA-BYULHYEWSA-N Gly-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)CN XBWMTPAIUQIWKA-BYULHYEWSA-N 0.000 description 2
- FZQLXNIMCPJVJE-YUMQZZPRSA-N Gly-Asp-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O FZQLXNIMCPJVJE-YUMQZZPRSA-N 0.000 description 2
- VOCMRCVMAPSSAL-IUCAKERBSA-N Gly-Gln-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)CN VOCMRCVMAPSSAL-IUCAKERBSA-N 0.000 description 2
- LXXANCRPFBSSKS-IUCAKERBSA-N Gly-Gln-Leu Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LXXANCRPFBSSKS-IUCAKERBSA-N 0.000 description 2
- JLJLBWDKDRYOPA-RYUDHWBXSA-N Gly-Gln-Tyr Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 JLJLBWDKDRYOPA-RYUDHWBXSA-N 0.000 description 2
- QPDUVFSVVAOUHE-XVKPBYJWSA-N Gly-Gln-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)CN)C(O)=O QPDUVFSVVAOUHE-XVKPBYJWSA-N 0.000 description 2
- BEQGFMIBZFNROK-JGVFFNPUSA-N Gly-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)CN)C(=O)O BEQGFMIBZFNROK-JGVFFNPUSA-N 0.000 description 2
- CUYLIWAAAYJKJH-RYUDHWBXSA-N Gly-Glu-Tyr Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 CUYLIWAAAYJKJH-RYUDHWBXSA-N 0.000 description 2
- QPTNELDXWKRIFX-YFKPBYRVSA-N Gly-Gly-Gln Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O QPTNELDXWKRIFX-YFKPBYRVSA-N 0.000 description 2
- XPJBQTCXPJNIFE-ZETCQYMHSA-N Gly-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)CN XPJBQTCXPJNIFE-ZETCQYMHSA-N 0.000 description 2
- QITBQGJOXQYMOA-ZETCQYMHSA-N Gly-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)CN QITBQGJOXQYMOA-ZETCQYMHSA-N 0.000 description 2
- LPCKHUXOGVNZRS-YUMQZZPRSA-N Gly-His-Ser Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O LPCKHUXOGVNZRS-YUMQZZPRSA-N 0.000 description 2
- LRQXRHGQEVWGPV-NHCYSSNCSA-N Gly-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN LRQXRHGQEVWGPV-NHCYSSNCSA-N 0.000 description 2
- VEPBEGNDJYANCF-QWRGUYRKSA-N Gly-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCCN VEPBEGNDJYANCF-QWRGUYRKSA-N 0.000 description 2
- GGAPHLIUUTVYMX-QWRGUYRKSA-N Gly-Phe-Ser Chemical compound OC[C@@H](C([O-])=O)NC(=O)[C@@H](NC(=O)C[NH3+])CC1=CC=CC=C1 GGAPHLIUUTVYMX-QWRGUYRKSA-N 0.000 description 2
- ZZJVYSAQQMDIRD-UWVGGRQHSA-N Gly-Pro-His Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O ZZJVYSAQQMDIRD-UWVGGRQHSA-N 0.000 description 2
- YOBGUCWZPXJHTN-BQBZGAKWSA-N Gly-Ser-Arg Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YOBGUCWZPXJHTN-BQBZGAKWSA-N 0.000 description 2
- WNGHUXFWEWTKAO-YUMQZZPRSA-N Gly-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN WNGHUXFWEWTKAO-YUMQZZPRSA-N 0.000 description 2
- POJJAZJHBGXEGM-YUMQZZPRSA-N Gly-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)CN POJJAZJHBGXEGM-YUMQZZPRSA-N 0.000 description 2
- FFALDIDGPLUDKV-ZDLURKLDSA-N Gly-Thr-Ser Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O FFALDIDGPLUDKV-ZDLURKLDSA-N 0.000 description 2
- PYFIQROSWQERAS-LBPRGKRZSA-N Gly-Trp-Gly Chemical compound C1=CC=C2C(C[C@H](NC(=O)CN)C(=O)NCC(O)=O)=CNC2=C1 PYFIQROSWQERAS-LBPRGKRZSA-N 0.000 description 2
- HQSKKSLNLSTONK-JTQLQIEISA-N Gly-Tyr-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 HQSKKSLNLSTONK-JTQLQIEISA-N 0.000 description 2
- GWCJMBNBFYBQCV-XPUUQOCRSA-N Gly-Val-Ala Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O GWCJMBNBFYBQCV-XPUUQOCRSA-N 0.000 description 2
- DVHGLDYMGWTYKW-GUBZILKMSA-N His-Gln-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O DVHGLDYMGWTYKW-GUBZILKMSA-N 0.000 description 2
- NTXIJPDAHXSHNL-ONGXEEELSA-N His-Gly-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O NTXIJPDAHXSHNL-ONGXEEELSA-N 0.000 description 2
- LVWIJITYHRZHBO-IXOXFDKPSA-N His-Leu-Thr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LVWIJITYHRZHBO-IXOXFDKPSA-N 0.000 description 2
- JUIOPCXACJLRJK-AVGNSLFASA-N His-Lys-Glu Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N JUIOPCXACJLRJK-AVGNSLFASA-N 0.000 description 2
- WCHONUZTYDQMBY-PYJNHQTQSA-N His-Pro-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WCHONUZTYDQMBY-PYJNHQTQSA-N 0.000 description 2
- DLTCGJZBNFOWFL-LKTVYLICSA-N His-Tyr-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC2=CN=CN2)N DLTCGJZBNFOWFL-LKTVYLICSA-N 0.000 description 2
- 208000017604 Hodgkin disease Diseases 0.000 description 2
- 101000846908 Homo sapiens Fc receptor-like protein 5 Proteins 0.000 description 2
- 241000725303 Human immunodeficiency virus Species 0.000 description 2
- YPWHUFAAMNHMGS-QSFUFRPTSA-N Ile-Ala-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N YPWHUFAAMNHMGS-QSFUFRPTSA-N 0.000 description 2
- TZCGZYWNIDZZMR-UHFFFAOYSA-N Ile-Arg-Ala Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(C)C(O)=O)CCCN=C(N)N TZCGZYWNIDZZMR-UHFFFAOYSA-N 0.000 description 2
- NPROWIBAWYMPAZ-GUDRVLHUSA-N Ile-Asp-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N NPROWIBAWYMPAZ-GUDRVLHUSA-N 0.000 description 2
- OVPYIUNCVSOVNF-ZPFDUUQYSA-N Ile-Gln-Pro Natural products CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(O)=O OVPYIUNCVSOVNF-ZPFDUUQYSA-N 0.000 description 2
- WZDCVAWMBUNDDY-KBIXCLLPSA-N Ile-Glu-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](C)C(=O)O)N WZDCVAWMBUNDDY-KBIXCLLPSA-N 0.000 description 2
- NHJKZMDIMMTVCK-QXEWZRGKSA-N Ile-Gly-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N NHJKZMDIMMTVCK-QXEWZRGKSA-N 0.000 description 2
- TWPSALMCEHCIOY-YTFOTSKYSA-N Ile-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(=O)O)N TWPSALMCEHCIOY-YTFOTSKYSA-N 0.000 description 2
- FCWFBHMAJZGWRY-XUXIUFHCSA-N Ile-Leu-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)O)N FCWFBHMAJZGWRY-XUXIUFHCSA-N 0.000 description 2
- IIWQTXMUALXGOV-PCBIJLKTSA-N Ile-Phe-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N IIWQTXMUALXGOV-PCBIJLKTSA-N 0.000 description 2
- VZSDQFZFTCVEGF-ZEWNOJEFSA-N Ile-Phe-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](Cc1ccc(O)cc1)C(O)=O VZSDQFZFTCVEGF-ZEWNOJEFSA-N 0.000 description 2
- KTNGVMMGIQWIDV-OSUNSFLBSA-N Ile-Pro-Thr Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O KTNGVMMGIQWIDV-OSUNSFLBSA-N 0.000 description 2
- YKZAMJXNJUWFIK-JBDRJPRFSA-N Ile-Ser-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(=O)O)N YKZAMJXNJUWFIK-JBDRJPRFSA-N 0.000 description 2
- KBDIBHQICWDGDL-PPCPHDFISA-N Ile-Thr-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N KBDIBHQICWDGDL-PPCPHDFISA-N 0.000 description 2
- 108090001007 Interleukin-8 Proteins 0.000 description 2
- 102000004890 Interleukin-8 Human genes 0.000 description 2
- KVRKAGGMEWNURO-CIUDSAMLSA-N Leu-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(C)C)N KVRKAGGMEWNURO-CIUDSAMLSA-N 0.000 description 2
- CQQGCWPXDHTTNF-GUBZILKMSA-N Leu-Ala-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O CQQGCWPXDHTTNF-GUBZILKMSA-N 0.000 description 2
- WSGXUIQTEZDVHJ-GARJFASQSA-N Leu-Ala-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(O)=O WSGXUIQTEZDVHJ-GARJFASQSA-N 0.000 description 2
- MDVZJYGNAGLPGJ-KKUMJFAQSA-N Leu-Asn-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MDVZJYGNAGLPGJ-KKUMJFAQSA-N 0.000 description 2
- VQPPIMUZCZCOIL-GUBZILKMSA-N Leu-Gln-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O VQPPIMUZCZCOIL-GUBZILKMSA-N 0.000 description 2
- RSFGIMMPWAXNML-MNXVOIDGSA-N Leu-Gln-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RSFGIMMPWAXNML-MNXVOIDGSA-N 0.000 description 2
- HPBCTWSUJOGJSH-MNXVOIDGSA-N Leu-Glu-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HPBCTWSUJOGJSH-MNXVOIDGSA-N 0.000 description 2
- ZFNLIDNJUWNIJL-WDCWCFNPSA-N Leu-Glu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZFNLIDNJUWNIJL-WDCWCFNPSA-N 0.000 description 2
- KEVYYIMVELOXCT-KBPBESRZSA-N Leu-Gly-Phe Chemical compound CC(C)C[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 KEVYYIMVELOXCT-KBPBESRZSA-N 0.000 description 2
- VGPCJSXPPOQPBK-YUMQZZPRSA-N Leu-Gly-Ser Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O VGPCJSXPPOQPBK-YUMQZZPRSA-N 0.000 description 2
- JRJLGNFWYFSJHB-HOCLYGCPSA-N Leu-Gly-Trp Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O JRJLGNFWYFSJHB-HOCLYGCPSA-N 0.000 description 2
- SGIIOQQGLUUMDQ-IHRRRGAJSA-N Leu-His-Val Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](C(C)C)C(=O)O)N SGIIOQQGLUUMDQ-IHRRRGAJSA-N 0.000 description 2
- KUIDCYNIEJBZBU-AJNGGQMLSA-N Leu-Ile-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O KUIDCYNIEJBZBU-AJNGGQMLSA-N 0.000 description 2
- IFMPDNRWZZEZSL-SRVKXCTJSA-N Leu-Leu-Cys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(O)=O IFMPDNRWZZEZSL-SRVKXCTJSA-N 0.000 description 2
- XVZCXCTYGHPNEM-UHFFFAOYSA-N Leu-Leu-Pro Natural products CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(O)=O XVZCXCTYGHPNEM-UHFFFAOYSA-N 0.000 description 2
- QNTJIDXQHWUBKC-BZSNNMDCSA-N Leu-Lys-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QNTJIDXQHWUBKC-BZSNNMDCSA-N 0.000 description 2
- JVTYXRRFZCEPPK-RHYQMDGZSA-N Leu-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC(C)C)N)O JVTYXRRFZCEPPK-RHYQMDGZSA-N 0.000 description 2
- SYRTUBLKWNDSDK-DKIMLUQUSA-N Leu-Phe-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SYRTUBLKWNDSDK-DKIMLUQUSA-N 0.000 description 2
- IRMLZWSRWSGTOP-CIUDSAMLSA-N Leu-Ser-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O IRMLZWSRWSGTOP-CIUDSAMLSA-N 0.000 description 2
- VDIARPPNADFEAV-WEDXCCLWSA-N Leu-Thr-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O VDIARPPNADFEAV-WEDXCCLWSA-N 0.000 description 2
- ODRREERHVHMIPT-OEAJRASXSA-N Leu-Thr-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ODRREERHVHMIPT-OEAJRASXSA-N 0.000 description 2
- VHTIZYYHIUHMCA-JYJNAYRXSA-N Leu-Tyr-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O VHTIZYYHIUHMCA-JYJNAYRXSA-N 0.000 description 2
- XOEDPXDZJHBQIX-ULQDDVLXSA-N Leu-Val-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XOEDPXDZJHBQIX-ULQDDVLXSA-N 0.000 description 2
- KCXUCYYZNZFGLL-SRVKXCTJSA-N Lys-Ala-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O KCXUCYYZNZFGLL-SRVKXCTJSA-N 0.000 description 2
- YNNPKXBBRZVIRX-IHRRRGAJSA-N Lys-Arg-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O YNNPKXBBRZVIRX-IHRRRGAJSA-N 0.000 description 2
- NQCJGQHHYZNUDK-DCAQKATOSA-N Lys-Arg-Ser Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CCCN=C(N)N NQCJGQHHYZNUDK-DCAQKATOSA-N 0.000 description 2
- GCMWRRQAKQXDED-IUCAKERBSA-N Lys-Glu-Gly Chemical compound [NH3+]CCCC[C@H]([NH3+])C(=O)N[C@@H](CCC([O-])=O)C(=O)NCC([O-])=O GCMWRRQAKQXDED-IUCAKERBSA-N 0.000 description 2
- DCRWPTBMWMGADO-AVGNSLFASA-N Lys-Glu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DCRWPTBMWMGADO-AVGNSLFASA-N 0.000 description 2
- CANPXOLVTMKURR-WEDXCCLWSA-N Lys-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN CANPXOLVTMKURR-WEDXCCLWSA-N 0.000 description 2
- HAUUXTXKJNVIFY-ONGXEEELSA-N Lys-Gly-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HAUUXTXKJNVIFY-ONGXEEELSA-N 0.000 description 2
- ZJWIXBZTAAJERF-IHRRRGAJSA-N Lys-Lys-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CCCN=C(N)N ZJWIXBZTAAJERF-IHRRRGAJSA-N 0.000 description 2
- YUAXTFMFMOIMAM-QWRGUYRKSA-N Lys-Lys-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O YUAXTFMFMOIMAM-QWRGUYRKSA-N 0.000 description 2
- ATNKHRAIZCMCCN-BZSNNMDCSA-N Lys-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N ATNKHRAIZCMCCN-BZSNNMDCSA-N 0.000 description 2
- PDIDTSZKKFEDMB-UWVGGRQHSA-N Lys-Pro-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O PDIDTSZKKFEDMB-UWVGGRQHSA-N 0.000 description 2
- YKBSXQFZWFXFIB-VOAKCMCISA-N Lys-Thr-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCCN)C(O)=O YKBSXQFZWFXFIB-VOAKCMCISA-N 0.000 description 2
- VHTOGMKQXXJOHG-RHYQMDGZSA-N Lys-Thr-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O VHTOGMKQXXJOHG-RHYQMDGZSA-N 0.000 description 2
- LMMBAXJRYSXCOQ-ACRUOGEOSA-N Lys-Tyr-Phe Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](Cc1ccccc1)C(O)=O LMMBAXJRYSXCOQ-ACRUOGEOSA-N 0.000 description 2
- 239000004472 Lysine Substances 0.000 description 2
- ONGCSGVHCSAATF-CIUDSAMLSA-N Met-Ala-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O ONGCSGVHCSAATF-CIUDSAMLSA-N 0.000 description 2
- PNDCUTDWYVKBHX-IHRRRGAJSA-N Met-Asp-Tyr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PNDCUTDWYVKBHX-IHRRRGAJSA-N 0.000 description 2
- SJDQOYTYNGZZJX-SRVKXCTJSA-N Met-Glu-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O SJDQOYTYNGZZJX-SRVKXCTJSA-N 0.000 description 2
- XMQZLGBUJMMODC-AVGNSLFASA-N Met-His-Val Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(O)=O XMQZLGBUJMMODC-AVGNSLFASA-N 0.000 description 2
- YLDSJJOGQNEQJK-AVGNSLFASA-N Met-Pro-Leu Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O YLDSJJOGQNEQJK-AVGNSLFASA-N 0.000 description 2
- DBMLDOWSVHMQQN-XGEHTFHBSA-N Met-Ser-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DBMLDOWSVHMQQN-XGEHTFHBSA-N 0.000 description 2
- QQPMHUCGDRJFQK-RHYQMDGZSA-N Met-Thr-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QQPMHUCGDRJFQK-RHYQMDGZSA-N 0.000 description 2
- IHRFZLQEQVHXFA-RHYQMDGZSA-N Met-Thr-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCCN IHRFZLQEQVHXFA-RHYQMDGZSA-N 0.000 description 2
- CNFMPVYIVQUJOO-NHCYSSNCSA-N Met-Val-Gln Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(N)=O CNFMPVYIVQUJOO-NHCYSSNCSA-N 0.000 description 2
- 229920000168 Microcrystalline cellulose Polymers 0.000 description 2
- 241000713869 Moloney murine leukemia virus Species 0.000 description 2
- 108700005084 Multigene Family Proteins 0.000 description 2
- WUGMRIBZSVSJNP-UHFFFAOYSA-N N-L-alanyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UHFFFAOYSA-N 0.000 description 2
- PESQCPHRXOFIPX-UHFFFAOYSA-N N-L-methionyl-L-tyrosine Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 PESQCPHRXOFIPX-UHFFFAOYSA-N 0.000 description 2
- 102000007999 Nuclear Proteins Human genes 0.000 description 2
- 108010089610 Nuclear Proteins Proteins 0.000 description 2
- 108020005187 Oligonucleotide Probes Proteins 0.000 description 2
- 241000283973 Oryctolagus cuniculus Species 0.000 description 2
- 238000010222 PCR analysis Methods 0.000 description 2
- AWAYOWOUGVZXOB-BZSNNMDCSA-N Phe-Asn-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 AWAYOWOUGVZXOB-BZSNNMDCSA-N 0.000 description 2
- HPECNYCQLSVCHH-BZSNNMDCSA-N Phe-Cys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O)N HPECNYCQLSVCHH-BZSNNMDCSA-N 0.000 description 2
- MGECUMGTSHYHEJ-QEWYBTABSA-N Phe-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 MGECUMGTSHYHEJ-QEWYBTABSA-N 0.000 description 2
- METZZBCMDXHFMK-BZSNNMDCSA-N Phe-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N METZZBCMDXHFMK-BZSNNMDCSA-N 0.000 description 2
- FZBGMXYQPACKNC-HJWJTTGWSA-N Phe-Pro-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FZBGMXYQPACKNC-HJWJTTGWSA-N 0.000 description 2
- BONHGTUEEPIMPM-AVGNSLFASA-N Phe-Ser-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O BONHGTUEEPIMPM-AVGNSLFASA-N 0.000 description 2
- MSSXKZBDKZAHCX-UNQGMJICSA-N Phe-Thr-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O MSSXKZBDKZAHCX-UNQGMJICSA-N 0.000 description 2
- APECKGGXAXNFLL-RNXOBYDBSA-N Phe-Trp-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 APECKGGXAXNFLL-RNXOBYDBSA-N 0.000 description 2
- 108010004729 Phycoerythrin Proteins 0.000 description 2
- 239000002202 Polyethylene glycol Substances 0.000 description 2
- FZHBZMDRDASUHN-NAKRPEOUSA-N Pro-Ala-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1)C(O)=O FZHBZMDRDASUHN-NAKRPEOUSA-N 0.000 description 2
- VCYJKOLZYPYGJV-AVGNSLFASA-N Pro-Arg-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O VCYJKOLZYPYGJV-AVGNSLFASA-N 0.000 description 2
- CYQQWUPHIZVCNY-GUBZILKMSA-N Pro-Arg-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O CYQQWUPHIZVCNY-GUBZILKMSA-N 0.000 description 2
- TXPUNZXZDVJUJQ-LPEHRKFASA-N Pro-Asn-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)N)C(=O)N2CCC[C@@H]2C(=O)O TXPUNZXZDVJUJQ-LPEHRKFASA-N 0.000 description 2
- ZYBUKTMPPFQSHL-JYJNAYRXSA-N Pro-Asp-Trp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O ZYBUKTMPPFQSHL-JYJNAYRXSA-N 0.000 description 2
- UPJGUQPLYWTISV-GUBZILKMSA-N Pro-Gln-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UPJGUQPLYWTISV-GUBZILKMSA-N 0.000 description 2
- NXEYSLRNNPWCRN-SRVKXCTJSA-N Pro-Glu-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NXEYSLRNNPWCRN-SRVKXCTJSA-N 0.000 description 2
- OFGUOWQVEGTVNU-DCAQKATOSA-N Pro-Lys-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O OFGUOWQVEGTVNU-DCAQKATOSA-N 0.000 description 2
- QGLFRQCECIWXFA-RCWTZXSCSA-N Pro-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@@H]1CCCN1)O QGLFRQCECIWXFA-RCWTZXSCSA-N 0.000 description 2
- JFBJPBZSTMXGKL-JYJNAYRXSA-N Pro-Met-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JFBJPBZSTMXGKL-JYJNAYRXSA-N 0.000 description 2
- DWPXHLIBFQLKLK-CYDGBPFRSA-N Pro-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 DWPXHLIBFQLKLK-CYDGBPFRSA-N 0.000 description 2
- KBUAPZAZPWNYSW-SRVKXCTJSA-N Pro-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 KBUAPZAZPWNYSW-SRVKXCTJSA-N 0.000 description 2
- GMJDSFYVTAMIBF-FXQIFTODSA-N Pro-Ser-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O GMJDSFYVTAMIBF-FXQIFTODSA-N 0.000 description 2
- BGWKULMLUIUPKY-BQBZGAKWSA-N Pro-Ser-Gly Chemical compound OC(=O)CNC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 BGWKULMLUIUPKY-BQBZGAKWSA-N 0.000 description 2
- LNICFEXCAHIJOR-DCAQKATOSA-N Pro-Ser-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LNICFEXCAHIJOR-DCAQKATOSA-N 0.000 description 2
- VBZXFFYOBDLLFE-HSHDSVGOSA-N Pro-Trp-Thr Chemical compound N([C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H]([C@H](O)C)C(O)=O)C(=O)[C@@H]1CCCN1 VBZXFFYOBDLLFE-HSHDSVGOSA-N 0.000 description 2
- WWXNZNWZNZPDIF-SRVKXCTJSA-N Pro-Val-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 WWXNZNWZNZPDIF-SRVKXCTJSA-N 0.000 description 2
- YDTUEBLEAVANFH-RCWTZXSCSA-N Pro-Val-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 YDTUEBLEAVANFH-RCWTZXSCSA-N 0.000 description 2
- PGSWNLRYYONGPE-JYJNAYRXSA-N Pro-Val-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O PGSWNLRYYONGPE-JYJNAYRXSA-N 0.000 description 2
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 description 2
- 241000125945 Protoparvovirus Species 0.000 description 2
- 108010003201 RGH 0205 Proteins 0.000 description 2
- 238000002123 RNA extraction Methods 0.000 description 2
- 241000700159 Rattus Species 0.000 description 2
- 238000012300 Sequence Analysis Methods 0.000 description 2
- FIXILCYTSAUERA-FXQIFTODSA-N Ser-Ala-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FIXILCYTSAUERA-FXQIFTODSA-N 0.000 description 2
- WTWGOQRNRFHFQD-JBDRJPRFSA-N Ser-Ala-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WTWGOQRNRFHFQD-JBDRJPRFSA-N 0.000 description 2
- HBZBPFLJNDXRAY-FXQIFTODSA-N Ser-Ala-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O HBZBPFLJNDXRAY-FXQIFTODSA-N 0.000 description 2
- WDXYVIIVDIDOSX-DCAQKATOSA-N Ser-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CCCN=C(N)N WDXYVIIVDIDOSX-DCAQKATOSA-N 0.000 description 2
- QVOGDCQNGLBNCR-FXQIFTODSA-N Ser-Arg-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O QVOGDCQNGLBNCR-FXQIFTODSA-N 0.000 description 2
- TYYBJUYSTWJHGO-ZKWXMUAHSA-N Ser-Asn-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O TYYBJUYSTWJHGO-ZKWXMUAHSA-N 0.000 description 2
- CTRHXXXHUJTTRZ-ZLUOBGJFSA-N Ser-Asp-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N)C(=O)O CTRHXXXHUJTTRZ-ZLUOBGJFSA-N 0.000 description 2
- DBIDZNUXSLXVRG-FXQIFTODSA-N Ser-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N DBIDZNUXSLXVRG-FXQIFTODSA-N 0.000 description 2
- OLIJLNWFEQEFDM-SRVKXCTJSA-N Ser-Asp-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OLIJLNWFEQEFDM-SRVKXCTJSA-N 0.000 description 2
- KMWFXJCGRXBQAC-CIUDSAMLSA-N Ser-Cys-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CO)N KMWFXJCGRXBQAC-CIUDSAMLSA-N 0.000 description 2
- RNMRYWZYFHHOEV-CIUDSAMLSA-N Ser-Gln-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RNMRYWZYFHHOEV-CIUDSAMLSA-N 0.000 description 2
- HVKMTOIAYDOJPL-NRPADANISA-N Ser-Gln-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O HVKMTOIAYDOJPL-NRPADANISA-N 0.000 description 2
- SMIDBHKWSYUBRZ-ACZMJKKPSA-N Ser-Glu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O SMIDBHKWSYUBRZ-ACZMJKKPSA-N 0.000 description 2
- UFKPDBLKLOBMRH-XHNCKOQMSA-N Ser-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)N)C(=O)O UFKPDBLKLOBMRH-XHNCKOQMSA-N 0.000 description 2
- MIJWOJAXARLEHA-WDSKDSINSA-N Ser-Gly-Glu Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O MIJWOJAXARLEHA-WDSKDSINSA-N 0.000 description 2
- UIPXCLNLUUAMJU-JBDRJPRFSA-N Ser-Ile-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O UIPXCLNLUUAMJU-JBDRJPRFSA-N 0.000 description 2
- UBRMZSHOOIVJPW-SRVKXCTJSA-N Ser-Leu-Lys Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O UBRMZSHOOIVJPW-SRVKXCTJSA-N 0.000 description 2
- VZQRNAYURWAEFE-KKUMJFAQSA-N Ser-Leu-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 VZQRNAYURWAEFE-KKUMJFAQSA-N 0.000 description 2
- YUJLIIRMIAGMCQ-CIUDSAMLSA-N Ser-Leu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YUJLIIRMIAGMCQ-CIUDSAMLSA-N 0.000 description 2
- IXZHZUGGKLRHJD-DCAQKATOSA-N Ser-Leu-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IXZHZUGGKLRHJD-DCAQKATOSA-N 0.000 description 2
- GVMUJUPXFQFBBZ-GUBZILKMSA-N Ser-Lys-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O GVMUJUPXFQFBBZ-GUBZILKMSA-N 0.000 description 2
- RWDVVSKYZBNDCO-MELADBBJSA-N Ser-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CO)N)C(=O)O RWDVVSKYZBNDCO-MELADBBJSA-N 0.000 description 2
- XQAPEISNMXNKGE-FXQIFTODSA-N Ser-Pro-Cys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CO)N)C(=O)N[C@@H](CS)C(=O)O XQAPEISNMXNKGE-FXQIFTODSA-N 0.000 description 2
- WNDUPCKKKGSKIQ-CIUDSAMLSA-N Ser-Pro-Gln Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O WNDUPCKKKGSKIQ-CIUDSAMLSA-N 0.000 description 2
- BSXKBOUZDAZXHE-CIUDSAMLSA-N Ser-Pro-Glu Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O BSXKBOUZDAZXHE-CIUDSAMLSA-N 0.000 description 2
- AZWNCEBQZXELEZ-FXQIFTODSA-N Ser-Pro-Ser Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O AZWNCEBQZXELEZ-FXQIFTODSA-N 0.000 description 2
- FLONGDPORFIVQW-XGEHTFHBSA-N Ser-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO FLONGDPORFIVQW-XGEHTFHBSA-N 0.000 description 2
- KQNDIKOYWZTZIX-FXQIFTODSA-N Ser-Ser-Arg Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCNC(N)=N KQNDIKOYWZTZIX-FXQIFTODSA-N 0.000 description 2
- NVNPWELENFJOHH-CIUDSAMLSA-N Ser-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CO)N NVNPWELENFJOHH-CIUDSAMLSA-N 0.000 description 2
- BMKNXTJLHFIAAH-CIUDSAMLSA-N Ser-Ser-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O BMKNXTJLHFIAAH-CIUDSAMLSA-N 0.000 description 2
- OLKICIBQRVSQMA-SRVKXCTJSA-N Ser-Ser-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OLKICIBQRVSQMA-SRVKXCTJSA-N 0.000 description 2
- VGQVAVQWKJLIRM-FXQIFTODSA-N Ser-Ser-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O VGQVAVQWKJLIRM-FXQIFTODSA-N 0.000 description 2
- RTXKJFWHEBTABY-IHPCNDPISA-N Ser-Trp-Tyr Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)O)NC(=O)[C@H](CO)N RTXKJFWHEBTABY-IHPCNDPISA-N 0.000 description 2
- BIWBTRRBHIEVAH-IHPCNDPISA-N Ser-Tyr-Trp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O BIWBTRRBHIEVAH-IHPCNDPISA-N 0.000 description 2
- HNDMFDBQXYZSRM-IHRRRGAJSA-N Ser-Val-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HNDMFDBQXYZSRM-IHRRRGAJSA-N 0.000 description 2
- JGUWRQWULDWNCM-FXQIFTODSA-N Ser-Val-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O JGUWRQWULDWNCM-FXQIFTODSA-N 0.000 description 2
- 241000700584 Simplexvirus Species 0.000 description 2
- 229930006000 Sucrose Natural products 0.000 description 2
- CZMRCDWAGMRECN-UGDNZRGBSA-N Sucrose Chemical compound O[C@H]1[C@H](O)[C@@H](CO)O[C@@]1(CO)O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 CZMRCDWAGMRECN-UGDNZRGBSA-N 0.000 description 2
- TWLMXDWFVNEFFK-FJXKBIBVSA-N Thr-Arg-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O TWLMXDWFVNEFFK-FJXKBIBVSA-N 0.000 description 2
- QGXCWPNQVCYJEL-NUMRIWBASA-N Thr-Asn-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QGXCWPNQVCYJEL-NUMRIWBASA-N 0.000 description 2
- QILPDQCTQZDHFM-HJGDQZAQSA-N Thr-Gln-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QILPDQCTQZDHFM-HJGDQZAQSA-N 0.000 description 2
- RKDFEMGVMMYYNG-WDCWCFNPSA-N Thr-Gln-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O RKDFEMGVMMYYNG-WDCWCFNPSA-N 0.000 description 2
- DIPIPFHFLPTCLK-LOKLDPHHSA-N Thr-Gln-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N)O DIPIPFHFLPTCLK-LOKLDPHHSA-N 0.000 description 2
- SHOMROOOQBDGRL-JHEQGTHGSA-N Thr-Glu-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O SHOMROOOQBDGRL-JHEQGTHGSA-N 0.000 description 2
- SLUWOCTZVGMURC-BFHQHQDPSA-N Thr-Gly-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O SLUWOCTZVGMURC-BFHQHQDPSA-N 0.000 description 2
- DJDSEDOKJTZBAR-ZDLURKLDSA-N Thr-Gly-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O DJDSEDOKJTZBAR-ZDLURKLDSA-N 0.000 description 2
- KLCCPYZXGXHAGS-QTKMDUPCSA-N Thr-His-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCSC)C(=O)O)N)O KLCCPYZXGXHAGS-QTKMDUPCSA-N 0.000 description 2
- WPAKPLPGQNUXGN-OSUNSFLBSA-N Thr-Ile-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WPAKPLPGQNUXGN-OSUNSFLBSA-N 0.000 description 2
- BVOVIGCHYNFJBZ-JXUBOQSCSA-N Thr-Leu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O BVOVIGCHYNFJBZ-JXUBOQSCSA-N 0.000 description 2
- MEJHFIOYJHTWMK-VOAKCMCISA-N Thr-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)[C@@H](C)O MEJHFIOYJHTWMK-VOAKCMCISA-N 0.000 description 2
- TZJSEJOXAIWOST-RHYQMDGZSA-N Thr-Lys-Arg Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CCCN=C(N)N TZJSEJOXAIWOST-RHYQMDGZSA-N 0.000 description 2
- UJQVSMNQMQHVRY-KZVJFYERSA-N Thr-Met-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O UJQVSMNQMQHVRY-KZVJFYERSA-N 0.000 description 2
- WRQLCVIALDUQEQ-UNQGMJICSA-N Thr-Phe-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WRQLCVIALDUQEQ-UNQGMJICSA-N 0.000 description 2
- BIBYEFRASCNLAA-CDMKHQONSA-N Thr-Phe-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 BIBYEFRASCNLAA-CDMKHQONSA-N 0.000 description 2
- VEIKMWOMUYMMMK-FCLVOEFKSA-N Thr-Phe-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 VEIKMWOMUYMMMK-FCLVOEFKSA-N 0.000 description 2
- MEBDIIKMUUNBSB-RPTUDFQQSA-N Thr-Phe-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MEBDIIKMUUNBSB-RPTUDFQQSA-N 0.000 description 2
- VTMGKRABARCZAX-OSUNSFLBSA-N Thr-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O VTMGKRABARCZAX-OSUNSFLBSA-N 0.000 description 2
- XZUBGOYOGDRYFC-XGEHTFHBSA-N Thr-Ser-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O XZUBGOYOGDRYFC-XGEHTFHBSA-N 0.000 description 2
- VBMOVTMNHWPZJR-SUSMZKCASA-N Thr-Thr-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VBMOVTMNHWPZJR-SUSMZKCASA-N 0.000 description 2
- GJOBRAHDRIDAPT-NGTWOADLSA-N Thr-Trp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H]([C@@H](C)O)N GJOBRAHDRIDAPT-NGTWOADLSA-N 0.000 description 2
- CYCGARJWIQWPQM-YJRXYDGGSA-N Thr-Tyr-Ser Chemical compound C[C@@H](O)[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CO)C([O-])=O)CC1=CC=C(O)C=C1 CYCGARJWIQWPQM-YJRXYDGGSA-N 0.000 description 2
- XGFYGMKZKFRGAI-RCWTZXSCSA-N Thr-Val-Arg Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N XGFYGMKZKFRGAI-RCWTZXSCSA-N 0.000 description 2
- QNXZCKMXHPULME-ZNSHCXBVSA-N Thr-Val-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O QNXZCKMXHPULME-ZNSHCXBVSA-N 0.000 description 2
- IQFYYKKMVGJFEH-XLPZGREQSA-N Thymidine Chemical compound O=C1NC(=O)C(C)=CN1[C@@H]1O[C@H](CO)[C@@H](O)C1 IQFYYKKMVGJFEH-XLPZGREQSA-N 0.000 description 2
- 108700019146 Transgenes Proteins 0.000 description 2
- YHRCLOURJWJABF-WDSOQIARSA-N Trp-His-Arg Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CN=CN3)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N YHRCLOURJWJABF-WDSOQIARSA-N 0.000 description 2
- XGFGVFMXDXALEV-XIRDDKMYSA-N Trp-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N XGFGVFMXDXALEV-XIRDDKMYSA-N 0.000 description 2
- MEZCXKYMMQJRDE-PMVMPFDFSA-N Trp-Leu-Tyr Chemical compound C([C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CC=1C2=CC=CC=C2NC=1)CC(C)C)C(O)=O)C1=CC=C(O)C=C1 MEZCXKYMMQJRDE-PMVMPFDFSA-N 0.000 description 2
- KOVPHHXMHLFWPL-BPUTZDHNSA-N Trp-Pro-Asn Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CNC3=CC=CC=C32)N)C(=O)N[C@@H](CC(=O)N)C(=O)O KOVPHHXMHLFWPL-BPUTZDHNSA-N 0.000 description 2
- ARKBYVBCEOWRNR-UBHSHLNASA-N Trp-Ser-Ser Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O ARKBYVBCEOWRNR-UBHSHLNASA-N 0.000 description 2
- DTPWXZXGFAHEKL-NWLDYVSISA-N Trp-Thr-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O DTPWXZXGFAHEKL-NWLDYVSISA-N 0.000 description 2
- UIDJDMVRDUANDL-BVSLBCMMSA-N Trp-Tyr-Arg Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UIDJDMVRDUANDL-BVSLBCMMSA-N 0.000 description 2
- PKZIWSHDJYIPRH-JBACZVJFSA-N Trp-Tyr-Gln Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O PKZIWSHDJYIPRH-JBACZVJFSA-N 0.000 description 2
- YCQKQFKXBPJXRY-PMVMPFDFSA-N Trp-Tyr-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)N[C@@H](CCCCN)C(=O)O)N YCQKQFKXBPJXRY-PMVMPFDFSA-N 0.000 description 2
- NIHNMOSRSAYZIT-BPNCWPANSA-N Tyr-Ala-Arg Chemical compound NC(=N)NCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NIHNMOSRSAYZIT-BPNCWPANSA-N 0.000 description 2
- QOEZFICGUZTRFX-IHRRRGAJSA-N Tyr-Cys-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O QOEZFICGUZTRFX-IHRRRGAJSA-N 0.000 description 2
- KCPFDGNYAMKZQP-KBPBESRZSA-N Tyr-Gly-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O KCPFDGNYAMKZQP-KBPBESRZSA-N 0.000 description 2
- GFJXBLSZOFWHAW-JYJNAYRXSA-N Tyr-His-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(O)=O GFJXBLSZOFWHAW-JYJNAYRXSA-N 0.000 description 2
- WPXKRJVHBXYLDT-JUKXBJQTSA-N Tyr-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CC=C(C=C2)O)N WPXKRJVHBXYLDT-JUKXBJQTSA-N 0.000 description 2
- SOAUMCDLIUGXJJ-SRVKXCTJSA-N Tyr-Ser-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O SOAUMCDLIUGXJJ-SRVKXCTJSA-N 0.000 description 2
- UMSZZGTXGKHTFJ-SRVKXCTJSA-N Tyr-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 UMSZZGTXGKHTFJ-SRVKXCTJSA-N 0.000 description 2
- ANHVRCNNGJMJNG-BZSNNMDCSA-N Tyr-Tyr-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CS)C(=O)O)N)O ANHVRCNNGJMJNG-BZSNNMDCSA-N 0.000 description 2
- ZLFHAAGHGQBQQN-AEJSXWLSSA-N Val-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZLFHAAGHGQBQQN-AEJSXWLSSA-N 0.000 description 2
- PAPWZOJOLKZEFR-AVGNSLFASA-N Val-Arg-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N PAPWZOJOLKZEFR-AVGNSLFASA-N 0.000 description 2
- VMRFIKXKOFNMHW-GUBZILKMSA-N Val-Arg-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N VMRFIKXKOFNMHW-GUBZILKMSA-N 0.000 description 2
- LIQJSDDOULTANC-QSFUFRPTSA-N Val-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N LIQJSDDOULTANC-QSFUFRPTSA-N 0.000 description 2
- NWDOPHYLSORNEX-QXEWZRGKSA-N Val-Asn-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCSC)C(=O)O)N NWDOPHYLSORNEX-QXEWZRGKSA-N 0.000 description 2
- SCBITHMBEJNRHC-LSJOCFKGSA-N Val-Asp-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N SCBITHMBEJNRHC-LSJOCFKGSA-N 0.000 description 2
- VVZDBPBZHLQPPB-XVKPBYJWSA-N Val-Glu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VVZDBPBZHLQPPB-XVKPBYJWSA-N 0.000 description 2
- ROLGIBMFNMZANA-GVXVVHGQSA-N Val-Glu-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N ROLGIBMFNMZANA-GVXVVHGQSA-N 0.000 description 2
- DJEVQCWNMQOABE-RCOVLWMOSA-N Val-Gly-Asp Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)O)C(=O)O)N DJEVQCWNMQOABE-RCOVLWMOSA-N 0.000 description 2
- OPGWZDIYEYJVRX-AVGNSLFASA-N Val-His-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N OPGWZDIYEYJVRX-AVGNSLFASA-N 0.000 description 2
- XBRMBDFYOFARST-AVGNSLFASA-N Val-His-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](C(C)C)C(=O)O)N XBRMBDFYOFARST-AVGNSLFASA-N 0.000 description 2
- SDUBQHUJJWQTEU-XUXIUFHCSA-N Val-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](C(C)C)N SDUBQHUJJWQTEU-XUXIUFHCSA-N 0.000 description 2
- OTJMMKPMLUNTQT-AVGNSLFASA-N Val-Leu-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](C(C)C)N OTJMMKPMLUNTQT-AVGNSLFASA-N 0.000 description 2
- HGJRMXOWUWVUOA-GVXVVHGQSA-N Val-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N HGJRMXOWUWVUOA-GVXVVHGQSA-N 0.000 description 2
- AEMPCGRFEZTWIF-IHRRRGAJSA-N Val-Leu-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O AEMPCGRFEZTWIF-IHRRRGAJSA-N 0.000 description 2
- BTWMICVCQLKKNR-DCAQKATOSA-N Val-Leu-Ser Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C([O-])=O BTWMICVCQLKKNR-DCAQKATOSA-N 0.000 description 2
- YMTOEGGOCHVGEH-IHRRRGAJSA-N Val-Lys-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O YMTOEGGOCHVGEH-IHRRRGAJSA-N 0.000 description 2
- GQMNEJMFMCJJTD-NHCYSSNCSA-N Val-Pro-Gln Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O GQMNEJMFMCJJTD-NHCYSSNCSA-N 0.000 description 2
- RYHUIHUOYRNNIE-NRPADANISA-N Val-Ser-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N RYHUIHUOYRNNIE-NRPADANISA-N 0.000 description 2
- KRAHMIJVUPUOTQ-DCAQKATOSA-N Val-Ser-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N KRAHMIJVUPUOTQ-DCAQKATOSA-N 0.000 description 2
- HTONZBWRYUKUKC-RCWTZXSCSA-N Val-Thr-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O HTONZBWRYUKUKC-RCWTZXSCSA-N 0.000 description 2
- SVLAAUGFIHSJPK-JYJNAYRXSA-N Val-Trp-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CO)C(=O)O)N SVLAAUGFIHSJPK-JYJNAYRXSA-N 0.000 description 2
- NLNCNKIVJPEFBC-DLOVCJGASA-N Val-Val-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O NLNCNKIVJPEFBC-DLOVCJGASA-N 0.000 description 2
- SSKKGOWRPNIVDW-AVGNSLFASA-N Val-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N SSKKGOWRPNIVDW-AVGNSLFASA-N 0.000 description 2
- 239000002253 acid Substances 0.000 description 2
- 150000007513 acids Chemical class 0.000 description 2
- 239000004480 active ingredient Substances 0.000 description 2
- 238000001042 affinity chromatography Methods 0.000 description 2
- 108010070944 alanylhistidine Proteins 0.000 description 2
- 239000000783 alginic acid Substances 0.000 description 2
- 229960001126 alginic acid Drugs 0.000 description 2
- 150000004781 alginic acids Chemical class 0.000 description 2
- 208000026935 allergic disease Diseases 0.000 description 2
- 238000000137 annealing Methods 0.000 description 2
- 108010001271 arginyl-glutamyl-arginine Proteins 0.000 description 2
- 108010089975 arginyl-glycyl-aspartyl-serine Proteins 0.000 description 2
- 108010009111 arginyl-glycyl-glutamic acid Proteins 0.000 description 2
- 108010091092 arginyl-glycyl-proline Proteins 0.000 description 2
- 206010003246 arthritis Diseases 0.000 description 2
- 108010093581 aspartyl-proline Proteins 0.000 description 2
- 108010038633 aspartylglutamate Proteins 0.000 description 2
- WQZGKKKJIJFFOK-VFUOTHLCSA-N beta-D-glucose Chemical compound OC[C@H]1O[C@@H](O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-VFUOTHLCSA-N 0.000 description 2
- 210000004369 blood Anatomy 0.000 description 2
- 239000008280 blood Substances 0.000 description 2
- 239000000969 carrier Substances 0.000 description 2
- 230000036755 cellular response Effects 0.000 description 2
- 238000006482 condensation reaction Methods 0.000 description 2
- 238000010276 construction Methods 0.000 description 2
- 230000001276 controlling effect Effects 0.000 description 2
- 238000007796 conventional method Methods 0.000 description 2
- 239000008120 corn starch Substances 0.000 description 2
- 239000006071 cream Substances 0.000 description 2
- 210000004748 cultured cell Anatomy 0.000 description 2
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 description 2
- 230000002950 deficient Effects 0.000 description 2
- 108010054812 diprotin A Proteins 0.000 description 2
- 239000003814 drug Substances 0.000 description 2
- 239000003937 drug carrier Substances 0.000 description 2
- 239000012636 effector Substances 0.000 description 2
- 239000003623 enhancer Substances 0.000 description 2
- 210000000267 erythroid cell Anatomy 0.000 description 2
- ZMMJGEGLRURXTF-UHFFFAOYSA-N ethidium bromide Chemical compound [Br-].C12=CC(N)=CC=C2C2=CC=C(N)C=C2[N+](CC)=C1C1=CC=CC=C1 ZMMJGEGLRURXTF-UHFFFAOYSA-N 0.000 description 2
- 229960005542 ethidium bromide Drugs 0.000 description 2
- LZCLXQDLBQLTDK-UHFFFAOYSA-N ethyl 2-hydroxypropanoate Chemical compound CCOC(=O)C(C)O LZCLXQDLBQLTDK-UHFFFAOYSA-N 0.000 description 2
- 239000013613 expression plasmid Substances 0.000 description 2
- 239000012530 fluid Substances 0.000 description 2
- 238000002875 fluorescence polarization Methods 0.000 description 2
- 230000002068 genetic effect Effects 0.000 description 2
- 230000002518 glial effect Effects 0.000 description 2
- 125000000291 glutamic acid group Chemical group N[C@@H](CCC(O)=O)C(=O)* 0.000 description 2
- 125000000404 glutamine group Chemical group N[C@@H](CCC(N)=O)C(=O)* 0.000 description 2
- 102000006602 glyceraldehyde-3-phosphate dehydrogenase Human genes 0.000 description 2
- 108020004445 glyceraldehyde-3-phosphate dehydrogenase Proteins 0.000 description 2
- 108010045126 glycyl-tyrosyl-glycine Proteins 0.000 description 2
- 108010087823 glycyltyrosine Proteins 0.000 description 2
- 230000012010 growth Effects 0.000 description 2
- 230000036541 health Effects 0.000 description 2
- 238000004128 high performance liquid chromatography Methods 0.000 description 2
- 125000000487 histidyl group Chemical group [H]N([H])C(C(=O)O*)C([H])([H])C1=C([H])N([H])C([H])=N1 0.000 description 2
- 210000005260 human cell Anatomy 0.000 description 2
- FDGQSTZJBFJUBT-UHFFFAOYSA-N hypoxanthine Chemical compound O=C1NC=NC2=C1NC=N2 FDGQSTZJBFJUBT-UHFFFAOYSA-N 0.000 description 2
- 238000003384 imaging method Methods 0.000 description 2
- 230000028993 immune response Effects 0.000 description 2
- 230000002163 immunogen Effects 0.000 description 2
- 238000007901 in situ hybridization Methods 0.000 description 2
- 238000001727 in vivo Methods 0.000 description 2
- 230000001965 increasing effect Effects 0.000 description 2
- 230000005764 inhibitory process Effects 0.000 description 2
- XKTZWUACRZHVAN-VADRZIEHSA-N interleukin-8 Chemical compound C([C@H](NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)NC(=O)[C@@H](NC(C)=O)CCSC)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCSC)C(=O)N1[C@H](CCC1)C(=O)N1[C@H](CCC1)C(=O)N[C@@H](C)C(=O)N[C@H](CC(O)=O)C(=O)N[C@H](CCC(O)=O)C(=O)N[C@H](CC(O)=O)C(=O)N[C@H](CC=1C=CC(O)=CC=1)C(=O)N[C@H](CO)C(=O)N1[C@H](CCC1)C(N)=O)C1=CC=CC=C1 XKTZWUACRZHVAN-VADRZIEHSA-N 0.000 description 2
- 229940096397 interleukin-8 Drugs 0.000 description 2
- 238000010253 intravenous injection Methods 0.000 description 2
- 108010031424 isoleucyl-prolyl-proline Proteins 0.000 description 2
- 108010078274 isoleucylvaline Proteins 0.000 description 2
- 238000005304 joining Methods 0.000 description 2
- 108010083708 leucyl-aspartyl-valine Proteins 0.000 description 2
- 108010047926 leucyl-lysyl-tyrosine Proteins 0.000 description 2
- 108010044056 leucyl-phenylalanine Proteins 0.000 description 2
- 239000007788 liquid Substances 0.000 description 2
- 239000012669 liquid formulation Substances 0.000 description 2
- 210000004185 liver Anatomy 0.000 description 2
- 210000004072 lung Anatomy 0.000 description 2
- 108010009298 lysylglutamic acid Proteins 0.000 description 2
- 108010064235 lysylglycine Proteins 0.000 description 2
- 239000000463 material Substances 0.000 description 2
- 210000003519 mature b lymphocyte Anatomy 0.000 description 2
- 238000002844 melting Methods 0.000 description 2
- 230000008018 melting Effects 0.000 description 2
- 210000001806 memory b lymphocyte Anatomy 0.000 description 2
- 108010016686 methionyl-alanyl-serine Proteins 0.000 description 2
- 229920000609 methyl cellulose Polymers 0.000 description 2
- 239000001923 methylcellulose Substances 0.000 description 2
- 235000010981 methylcellulose Nutrition 0.000 description 2
- 235000019813 microcrystalline cellulose Nutrition 0.000 description 2
- 239000008108 microcrystalline cellulose Substances 0.000 description 2
- 229940016286 microcrystalline cellulose Drugs 0.000 description 2
- 210000001616 monocyte Anatomy 0.000 description 2
- 230000009871 nonspecific binding Effects 0.000 description 2
- 239000003921 oil Substances 0.000 description 2
- 235000019198 oils Nutrition 0.000 description 2
- 239000002751 oligonucleotide probe Substances 0.000 description 2
- 210000001672 ovary Anatomy 0.000 description 2
- 210000002741 palatine tonsil Anatomy 0.000 description 2
- 210000005105 peripheral blood lymphocyte Anatomy 0.000 description 2
- 239000008194 pharmaceutical composition Substances 0.000 description 2
- 108010070409 phenylalanyl-glycyl-glycine Proteins 0.000 description 2
- 108010064486 phenylalanyl-leucyl-valine Proteins 0.000 description 2
- 108010024654 phenylalanyl-prolyl-alanine Proteins 0.000 description 2
- 108010089198 phenylalanyl-prolyl-arginine Proteins 0.000 description 2
- 229920001223 polyethylene glycol Polymers 0.000 description 2
- 239000013641 positive control Substances 0.000 description 2
- 230000002062 proliferating effect Effects 0.000 description 2
- 108010007513 prolyl-glycyl-prolyl-leucine Proteins 0.000 description 2
- 108010087846 prolyl-prolyl-glycine Proteins 0.000 description 2
- 108010070643 prolylglutamic acid Proteins 0.000 description 2
- 108020001580 protein domains Proteins 0.000 description 2
- 230000009467 reduction Effects 0.000 description 2
- 230000003362 replicative effect Effects 0.000 description 2
- 238000011160 research Methods 0.000 description 2
- 239000011347 resin Substances 0.000 description 2
- 229920005989 resin Polymers 0.000 description 2
- 238000002821 scintillation proximity assay Methods 0.000 description 2
- 238000002741 site-directed mutagenesis Methods 0.000 description 2
- 239000007787 solid Substances 0.000 description 2
- 239000005720 sucrose Substances 0.000 description 2
- 239000006228 supernatant Substances 0.000 description 2
- 238000012360 testing method Methods 0.000 description 2
- 229940124597 therapeutic agent Drugs 0.000 description 2
- 125000000341 threoninyl group Chemical group [H]OC([H])(C([H])([H])[H])C([H])(N([H])[H])C(*)=O 0.000 description 2
- 210000001541 thymus gland Anatomy 0.000 description 2
- 230000000699 topical effect Effects 0.000 description 2
- 230000009466 transformation Effects 0.000 description 2
- 230000032258 transport Effects 0.000 description 2
- 108010029384 tryptophyl-histidine Proteins 0.000 description 2
- 241000701447 unidentified baculovirus Species 0.000 description 2
- IBIDRSSEHFLGSD-UHFFFAOYSA-N valinyl-arginine Natural products CC(C)C(N)C(=O)NC(C(O)=O)CCCN=C(N)N IBIDRSSEHFLGSD-UHFFFAOYSA-N 0.000 description 2
- 238000005406 washing Methods 0.000 description 2
- CNKBMTKICGGSCQ-ACRUOGEOSA-N (2S)-2-[[(2S)-2-[[(2S)-2,6-diamino-1-oxohexyl]amino]-1-oxo-3-phenylpropyl]amino]-3-(4-hydroxyphenyl)propanoic acid Chemical compound C([C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 CNKBMTKICGGSCQ-ACRUOGEOSA-N 0.000 description 1
- JNTMAZFVYNDPLB-PEDHHIEDSA-N (2S,3S)-2-[[[(2S)-1-[(2S,3S)-2-amino-3-methyl-1-oxopentyl]-2-pyrrolidinyl]-oxomethyl]amino]-3-methylpentanoic acid Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JNTMAZFVYNDPLB-PEDHHIEDSA-N 0.000 description 1
- WCDDVEOXEIYWFB-VXORFPGASA-N (2s,3s,4r,5r,6r)-3-[(2s,3r,5s,6r)-3-acetamido-5-hydroxy-6-(hydroxymethyl)oxan-2-yl]oxy-4,5,6-trihydroxyoxane-2-carboxylic acid Chemical compound CC(=O)N[C@@H]1C[C@H](O)[C@@H](CO)O[C@H]1O[C@@H]1[C@@H](C(O)=O)O[C@@H](O)[C@H](O)[C@H]1O WCDDVEOXEIYWFB-VXORFPGASA-N 0.000 description 1
- 125000003088 (fluoren-9-ylmethoxy)carbonyl group Chemical group 0.000 description 1
- GZCWLCBFPRFLKL-UHFFFAOYSA-N 1-prop-2-ynoxypropan-2-ol Chemical compound CC(O)COCC#C GZCWLCBFPRFLKL-UHFFFAOYSA-N 0.000 description 1
- OWEGMIWEEQEYGQ-UHFFFAOYSA-N 100676-05-9 Natural products OC1C(O)C(O)C(CO)OC1OCC1C(O)C(O)C(O)C(OC2C(OC(O)C(O)C2O)CO)O1 OWEGMIWEEQEYGQ-UHFFFAOYSA-N 0.000 description 1
- 125000003287 1H-imidazol-4-ylmethyl group Chemical group [H]N1C([H])=NC(C([H])([H])[*])=C1[H] 0.000 description 1
- DQVAZKGVGKHQDS-UHFFFAOYSA-N 2-[[1-[2-[(2-amino-4-methylpentanoyl)amino]-4-methylpentanoyl]pyrrolidine-2-carbonyl]amino]-4-methylpentanoic acid Chemical compound CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(=O)NC(CC(C)C)C(O)=O DQVAZKGVGKHQDS-UHFFFAOYSA-N 0.000 description 1
- YEJQWBFDKKTPNO-UHFFFAOYSA-N 2-[[2-[[1-(2-amino-3-methylbutanoyl)pyrrolidine-2-carbonyl]amino]acetyl]amino]-3-methylbutanoic acid Chemical compound CC(C)C(N)C(=O)N1CCCC1C(=O)NCC(=O)NC(C(C)C)C(O)=O YEJQWBFDKKTPNO-UHFFFAOYSA-N 0.000 description 1
- GOJUJUVQIVIZAV-UHFFFAOYSA-N 2-amino-4,6-dichloropyrimidine-5-carbaldehyde Chemical group NC1=NC(Cl)=C(C=O)C(Cl)=N1 GOJUJUVQIVIZAV-UHFFFAOYSA-N 0.000 description 1
- SNBCLPGEMZEWLU-QXFUBDJGSA-N 2-chloro-n-[[(2r,3s,5r)-3-hydroxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methyl]acetamide Chemical compound O=C1NC(=O)C(C)=CN1[C@@H]1O[C@H](CNC(=O)CCl)[C@@H](O)C1 SNBCLPGEMZEWLU-QXFUBDJGSA-N 0.000 description 1
- PINRUEQFGKWBTO-UHFFFAOYSA-N 3-methyl-5-phenyl-1,3-oxazolidin-2-imine Chemical compound O1C(=N)N(C)CC1C1=CC=CC=C1 PINRUEQFGKWBTO-UHFFFAOYSA-N 0.000 description 1
- OSJPPGNTCRNQQC-UWTATZPHSA-N 3-phospho-D-glyceric acid Chemical compound OC(=O)[C@H](O)COP(O)(O)=O OSJPPGNTCRNQQC-UWTATZPHSA-N 0.000 description 1
- TVZGACDUOSZQKY-LBPRGKRZSA-N 4-aminofolic acid Chemical compound C1=NC2=NC(N)=NC(N)=C2N=C1CNC1=CC=C(C(=O)N[C@@H](CCC(O)=O)C(O)=O)C=C1 TVZGACDUOSZQKY-LBPRGKRZSA-N 0.000 description 1
- HIQIXEFWDLTDED-UHFFFAOYSA-N 4-hydroxy-1-piperidin-4-ylpyrrolidin-2-one Chemical compound O=C1CC(O)CN1C1CCNCC1 HIQIXEFWDLTDED-UHFFFAOYSA-N 0.000 description 1
- 239000013607 AAV vector Substances 0.000 description 1
- 102000013563 Acid Phosphatase Human genes 0.000 description 1
- 108010051457 Acid Phosphatase Proteins 0.000 description 1
- HRPVXLWXLXDGHG-UHFFFAOYSA-N Acrylamide Chemical compound NC(=O)C=C HRPVXLWXLXDGHG-UHFFFAOYSA-N 0.000 description 1
- 241000251468 Actinopterygii Species 0.000 description 1
- 102000007469 Actins Human genes 0.000 description 1
- 108010085238 Actins Proteins 0.000 description 1
- 208000026872 Addison Disease Diseases 0.000 description 1
- BYXHQQCXAJARLQ-ZLUOBGJFSA-N Ala-Ala-Ala Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O BYXHQQCXAJARLQ-ZLUOBGJFSA-N 0.000 description 1
- RLMISHABBKUNFO-WHFBIAKZSA-N Ala-Ala-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O RLMISHABBKUNFO-WHFBIAKZSA-N 0.000 description 1
- JBVSSSZFNTXJDX-YTLHQDLWSA-N Ala-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N JBVSSSZFNTXJDX-YTLHQDLWSA-N 0.000 description 1
- LBJYAILUMSUTAM-ZLUOBGJFSA-N Ala-Asn-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O LBJYAILUMSUTAM-ZLUOBGJFSA-N 0.000 description 1
- NXSFUECZFORGOG-CIUDSAMLSA-N Ala-Asn-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NXSFUECZFORGOG-CIUDSAMLSA-N 0.000 description 1
- ZIBWKCRKNFYTPT-ZKWXMUAHSA-N Ala-Asn-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O ZIBWKCRKNFYTPT-ZKWXMUAHSA-N 0.000 description 1
- MCKSLROAGSDNFC-ACZMJKKPSA-N Ala-Asp-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MCKSLROAGSDNFC-ACZMJKKPSA-N 0.000 description 1
- BUDNAJYVCUHLSV-ZLUOBGJFSA-N Ala-Asp-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O BUDNAJYVCUHLSV-ZLUOBGJFSA-N 0.000 description 1
- WCBVQNZTOKJWJS-ACZMJKKPSA-N Ala-Cys-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(O)=O WCBVQNZTOKJWJS-ACZMJKKPSA-N 0.000 description 1
- NJIFPLAJSVUQOZ-JBDRJPRFSA-N Ala-Cys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](C)N NJIFPLAJSVUQOZ-JBDRJPRFSA-N 0.000 description 1
- LGFCAXJBAZESCF-ACZMJKKPSA-N Ala-Gln-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O LGFCAXJBAZESCF-ACZMJKKPSA-N 0.000 description 1
- CXQODNIBUNQWAS-CIUDSAMLSA-N Ala-Gln-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N CXQODNIBUNQWAS-CIUDSAMLSA-N 0.000 description 1
- IFTVANMRTIHKML-WDSKDSINSA-N Ala-Gln-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O IFTVANMRTIHKML-WDSKDSINSA-N 0.000 description 1
- FBHOPGDGELNWRH-DRZSPHRISA-N Ala-Glu-Phe Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O FBHOPGDGELNWRH-DRZSPHRISA-N 0.000 description 1
- XYTNPQNAZREREP-XQXXSGGOSA-N Ala-Glu-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XYTNPQNAZREREP-XQXXSGGOSA-N 0.000 description 1
- OMMDTNGURYRDAC-NRPADANISA-N Ala-Glu-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OMMDTNGURYRDAC-NRPADANISA-N 0.000 description 1
- WGDNWOMKBUXFHR-BQBZGAKWSA-N Ala-Gly-Arg Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N WGDNWOMKBUXFHR-BQBZGAKWSA-N 0.000 description 1
- NHLAEBFGWPXFGI-WHFBIAKZSA-N Ala-Gly-Asn Chemical compound C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N NHLAEBFGWPXFGI-WHFBIAKZSA-N 0.000 description 1
- VGPWRRFOPXVGOH-BYPYZUCNSA-N Ala-Gly-Gly Chemical compound C[C@H](N)C(=O)NCC(=O)NCC(O)=O VGPWRRFOPXVGOH-BYPYZUCNSA-N 0.000 description 1
- LMFXXZPPZDCPTA-ZKWXMUAHSA-N Ala-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N LMFXXZPPZDCPTA-ZKWXMUAHSA-N 0.000 description 1
- BLIMFWGRQKRCGT-YUMQZZPRSA-N Ala-Gly-Lys Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN BLIMFWGRQKRCGT-YUMQZZPRSA-N 0.000 description 1
- MQIGTEQXYCRLGK-BQBZGAKWSA-N Ala-Gly-Pro Chemical compound C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O MQIGTEQXYCRLGK-BQBZGAKWSA-N 0.000 description 1
- SMCGQGDVTPFXKB-XPUUQOCRSA-N Ala-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N SMCGQGDVTPFXKB-XPUUQOCRSA-N 0.000 description 1
- CBCCCLMNOBLBSC-XVYDVKMFSA-N Ala-His-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O CBCCCLMNOBLBSC-XVYDVKMFSA-N 0.000 description 1
- OKIKVSXTXVVFDV-MMWGEVLESA-N Ala-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C)N OKIKVSXTXVVFDV-MMWGEVLESA-N 0.000 description 1
- NOGFDULFCFXBHB-CIUDSAMLSA-N Ala-Leu-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)O)N NOGFDULFCFXBHB-CIUDSAMLSA-N 0.000 description 1
- CCDFBRZVTDDJNM-GUBZILKMSA-N Ala-Leu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CCDFBRZVTDDJNM-GUBZILKMSA-N 0.000 description 1
- MNZHHDPWDWQJCQ-YUMQZZPRSA-N Ala-Leu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O MNZHHDPWDWQJCQ-YUMQZZPRSA-N 0.000 description 1
- DPNZTBKGAUAZQU-DLOVCJGASA-N Ala-Leu-His Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N DPNZTBKGAUAZQU-DLOVCJGASA-N 0.000 description 1
- WUHJHHGYVVJMQE-BJDJZHNGSA-N Ala-Leu-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WUHJHHGYVVJMQE-BJDJZHNGSA-N 0.000 description 1
- MEFILNJXAVSUTO-JXUBOQSCSA-N Ala-Leu-Thr Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MEFILNJXAVSUTO-JXUBOQSCSA-N 0.000 description 1
- OMDNCNKNEGFOMM-BQBZGAKWSA-N Ala-Met-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O OMDNCNKNEGFOMM-BQBZGAKWSA-N 0.000 description 1
- DRARURMRLANNLS-GUBZILKMSA-N Ala-Met-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O DRARURMRLANNLS-GUBZILKMSA-N 0.000 description 1
- OSRZOHXQCUFIQG-FPMFFAJLSA-N Ala-Phe-Pro Chemical compound C([C@H](NC(=O)[C@@H]([NH3+])C)C(=O)N1[C@H](CCC1)C([O-])=O)C1=CC=CC=C1 OSRZOHXQCUFIQG-FPMFFAJLSA-N 0.000 description 1
- IORKCNUBHNIMKY-CIUDSAMLSA-N Ala-Pro-Glu Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O IORKCNUBHNIMKY-CIUDSAMLSA-N 0.000 description 1
- BTRULDJUUVGRNE-DCAQKATOSA-N Ala-Pro-Lys Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O BTRULDJUUVGRNE-DCAQKATOSA-N 0.000 description 1
- BHTBAVZSZCQZPT-GUBZILKMSA-N Ala-Pro-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N BHTBAVZSZCQZPT-GUBZILKMSA-N 0.000 description 1
- XWFWAXPOLRTDFZ-FXQIFTODSA-N Ala-Pro-Ser Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O XWFWAXPOLRTDFZ-FXQIFTODSA-N 0.000 description 1
- NZGRHTKZFSVPAN-BIIVOSGPSA-N Ala-Ser-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N NZGRHTKZFSVPAN-BIIVOSGPSA-N 0.000 description 1
- NCQMBSJGJMYKCK-ZLUOBGJFSA-N Ala-Ser-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O NCQMBSJGJMYKCK-ZLUOBGJFSA-N 0.000 description 1
- XQNRANMFRPCFFW-GCJQMDKQSA-N Ala-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C)N)O XQNRANMFRPCFFW-GCJQMDKQSA-N 0.000 description 1
- KUFVXLQLDHJVOG-SHGPDSBTSA-N Ala-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C)N)O KUFVXLQLDHJVOG-SHGPDSBTSA-N 0.000 description 1
- IETUUAHKCHOQHP-KZVJFYERSA-N Ala-Thr-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@H](C)N)[C@@H](C)O)C(O)=O IETUUAHKCHOQHP-KZVJFYERSA-N 0.000 description 1
- MTDDMSUUXNQMKK-BPNCWPANSA-N Ala-Tyr-Arg Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N MTDDMSUUXNQMKK-BPNCWPANSA-N 0.000 description 1
- MUGAESARFRGOTQ-IGNZVWTISA-N Ala-Tyr-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N MUGAESARFRGOTQ-IGNZVWTISA-N 0.000 description 1
- ZCUFMRIQCPNOHZ-NRPADANISA-N Ala-Val-Gln Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N ZCUFMRIQCPNOHZ-NRPADANISA-N 0.000 description 1
- VHAQSYHSDKERBS-XPUUQOCRSA-N Ala-Val-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O VHAQSYHSDKERBS-XPUUQOCRSA-N 0.000 description 1
- XCIGOVDXZULBBV-DCAQKATOSA-N Ala-Val-Lys Chemical compound CC(C)[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](CCCCN)C(O)=O XCIGOVDXZULBBV-DCAQKATOSA-N 0.000 description 1
- NLYYHIKRBRMAJV-AEJSXWLSSA-N Ala-Val-Pro Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N NLYYHIKRBRMAJV-AEJSXWLSSA-N 0.000 description 1
- 101710194180 Alcohol oxidase 1 Proteins 0.000 description 1
- 239000005995 Aluminium silicate Substances 0.000 description 1
- 102100022749 Aminopeptidase N Human genes 0.000 description 1
- 108091093088 Amplicon Proteins 0.000 description 1
- 229920000856 Amylose Polymers 0.000 description 1
- 208000032467 Aplastic anaemia Diseases 0.000 description 1
- OTOXOKCIIQLMFH-KZVJFYERSA-N Arg-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N OTOXOKCIIQLMFH-KZVJFYERSA-N 0.000 description 1
- NABSCJGZKWSNHX-RCWTZXSCSA-N Arg-Arg-Thr Chemical compound NC(N)=NCCC[C@@H](C(=O)N[C@@H]([C@H](O)C)C(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N NABSCJGZKWSNHX-RCWTZXSCSA-N 0.000 description 1
- DPXDVGDLWJYZBH-GUBZILKMSA-N Arg-Asn-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O DPXDVGDLWJYZBH-GUBZILKMSA-N 0.000 description 1
- BVBKBQRPOJFCQM-DCAQKATOSA-N Arg-Asn-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BVBKBQRPOJFCQM-DCAQKATOSA-N 0.000 description 1
- PQWTZSNVWSOFFK-FXQIFTODSA-N Arg-Asp-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)CN=C(N)N PQWTZSNVWSOFFK-FXQIFTODSA-N 0.000 description 1
- YSUVMPICYVWRBX-VEVYYDQMSA-N Arg-Asp-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O YSUVMPICYVWRBX-VEVYYDQMSA-N 0.000 description 1
- IGULQRCJLQQPSM-DCAQKATOSA-N Arg-Cys-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O IGULQRCJLQQPSM-DCAQKATOSA-N 0.000 description 1
- BEXGZLUHRXTZCC-CIUDSAMLSA-N Arg-Gln-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N)CN=C(N)N BEXGZLUHRXTZCC-CIUDSAMLSA-N 0.000 description 1
- QAODJPUKWNNNRP-DCAQKATOSA-N Arg-Glu-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QAODJPUKWNNNRP-DCAQKATOSA-N 0.000 description 1
- RKRSYHCNPFGMTA-CIUDSAMLSA-N Arg-Glu-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O RKRSYHCNPFGMTA-CIUDSAMLSA-N 0.000 description 1
- PBSOQGZLPFVXPU-YUMQZZPRSA-N Arg-Glu-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PBSOQGZLPFVXPU-YUMQZZPRSA-N 0.000 description 1
- NKBQZKVMKJJDLX-SRVKXCTJSA-N Arg-Glu-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NKBQZKVMKJJDLX-SRVKXCTJSA-N 0.000 description 1
- UFBURHXMKFQVLM-CIUDSAMLSA-N Arg-Glu-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O UFBURHXMKFQVLM-CIUDSAMLSA-N 0.000 description 1
- NVUIWHJLPSZZQC-CYDGBPFRSA-N Arg-Ile-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NVUIWHJLPSZZQC-CYDGBPFRSA-N 0.000 description 1
- YKBHOXLMMPZPHQ-GMOBBJLQSA-N Arg-Ile-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O YKBHOXLMMPZPHQ-GMOBBJLQSA-N 0.000 description 1
- GMFAGHNRXPSSJS-SRVKXCTJSA-N Arg-Leu-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O GMFAGHNRXPSSJS-SRVKXCTJSA-N 0.000 description 1
- COXMUHNBYCVVRG-DCAQKATOSA-N Arg-Leu-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O COXMUHNBYCVVRG-DCAQKATOSA-N 0.000 description 1
- BNYNOWJESJJIOI-XUXIUFHCSA-N Arg-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCN=C(N)N)N BNYNOWJESJJIOI-XUXIUFHCSA-N 0.000 description 1
- XUGATJVGQUGQKY-ULQDDVLXSA-N Arg-Lys-Phe Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XUGATJVGQUGQKY-ULQDDVLXSA-N 0.000 description 1
- RIQBRKVTFBWEDY-RHYQMDGZSA-N Arg-Lys-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RIQBRKVTFBWEDY-RHYQMDGZSA-N 0.000 description 1
- OVQJAKFLFTZDNC-GUBZILKMSA-N Arg-Pro-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O OVQJAKFLFTZDNC-GUBZILKMSA-N 0.000 description 1
- AWMAZIIEFPFHCP-RCWTZXSCSA-N Arg-Pro-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O AWMAZIIEFPFHCP-RCWTZXSCSA-N 0.000 description 1
- VRTWYUYCJGNFES-CIUDSAMLSA-N Arg-Ser-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O VRTWYUYCJGNFES-CIUDSAMLSA-N 0.000 description 1
- ICRHGPYYXMWHIE-LPEHRKFASA-N Arg-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O ICRHGPYYXMWHIE-LPEHRKFASA-N 0.000 description 1
- FRBAHXABMQXSJQ-FXQIFTODSA-N Arg-Ser-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O FRBAHXABMQXSJQ-FXQIFTODSA-N 0.000 description 1
- FBXMCPLCVYUWBO-BPUTZDHNSA-N Arg-Ser-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N FBXMCPLCVYUWBO-BPUTZDHNSA-N 0.000 description 1
- OQPAZKMGCWPERI-GUBZILKMSA-N Arg-Ser-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O OQPAZKMGCWPERI-GUBZILKMSA-N 0.000 description 1
- LYJXHXGPWDTLKW-HJGDQZAQSA-N Arg-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O LYJXHXGPWDTLKW-HJGDQZAQSA-N 0.000 description 1
- UVTGNSWSRSCPLP-UHFFFAOYSA-N Arg-Tyr Natural products NC(CCNC(=N)N)C(=O)NC(Cc1ccc(O)cc1)C(=O)O UVTGNSWSRSCPLP-UHFFFAOYSA-N 0.000 description 1
- QHUOOCKNNURZSL-IHRRRGAJSA-N Arg-Tyr-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O QHUOOCKNNURZSL-IHRRRGAJSA-N 0.000 description 1
- ISVACHFCVRKIDG-SRVKXCTJSA-N Arg-Val-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O ISVACHFCVRKIDG-SRVKXCTJSA-N 0.000 description 1
- ULBHWNVWSCJLCO-NHCYSSNCSA-N Arg-Val-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N ULBHWNVWSCJLCO-NHCYSSNCSA-N 0.000 description 1
- 239000004475 Arginine Substances 0.000 description 1
- 206010003210 Arteriosclerosis Diseases 0.000 description 1
- SWLOHUMCUDRTCL-ZLUOBGJFSA-N Asn-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N SWLOHUMCUDRTCL-ZLUOBGJFSA-N 0.000 description 1
- LEFKSBYHUGUWLP-ACZMJKKPSA-N Asn-Ala-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LEFKSBYHUGUWLP-ACZMJKKPSA-N 0.000 description 1
- POOCJCRBHHMAOS-FXQIFTODSA-N Asn-Arg-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O POOCJCRBHHMAOS-FXQIFTODSA-N 0.000 description 1
- PCKRJVZAQZWNKM-WHFBIAKZSA-N Asn-Asn-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O PCKRJVZAQZWNKM-WHFBIAKZSA-N 0.000 description 1
- FUHFYEKSGWOWGZ-XHNCKOQMSA-N Asn-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)N)N)C(=O)O FUHFYEKSGWOWGZ-XHNCKOQMSA-N 0.000 description 1
- SRUUBQBAVNQZGJ-LAEOZQHASA-N Asn-Gln-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)N)N SRUUBQBAVNQZGJ-LAEOZQHASA-N 0.000 description 1
- WONGRTVAMHFGBE-WDSKDSINSA-N Asn-Gly-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N WONGRTVAMHFGBE-WDSKDSINSA-N 0.000 description 1
- PBSQFBAJKPLRJY-BYULHYEWSA-N Asn-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N PBSQFBAJKPLRJY-BYULHYEWSA-N 0.000 description 1
- RAQMSGVCGSJKCL-FOHZUACHSA-N Asn-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(N)=O RAQMSGVCGSJKCL-FOHZUACHSA-N 0.000 description 1
- NKLRWRRVYGQNIH-GHCJXIJMSA-N Asn-Ile-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O NKLRWRRVYGQNIH-GHCJXIJMSA-N 0.000 description 1
- OLISTMZJGQUOGS-GMOBBJLQSA-N Asn-Ile-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N OLISTMZJGQUOGS-GMOBBJLQSA-N 0.000 description 1
- LTZIRYMWOJHRCH-GUDRVLHUSA-N Asn-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N LTZIRYMWOJHRCH-GUDRVLHUSA-N 0.000 description 1
- SPCONPVIDFMDJI-QSFUFRPTSA-N Asn-Ile-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O SPCONPVIDFMDJI-QSFUFRPTSA-N 0.000 description 1
- HFPXZWPUVFVNLL-GUBZILKMSA-N Asn-Leu-Gln Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O HFPXZWPUVFVNLL-GUBZILKMSA-N 0.000 description 1
- HDHZCEDPLTVHFZ-GUBZILKMSA-N Asn-Leu-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O HDHZCEDPLTVHFZ-GUBZILKMSA-N 0.000 description 1
- YVXRYLVELQYAEQ-SRVKXCTJSA-N Asn-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N YVXRYLVELQYAEQ-SRVKXCTJSA-N 0.000 description 1
- DJIMLSXHXKWADV-CIUDSAMLSA-N Asn-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(N)=O DJIMLSXHXKWADV-CIUDSAMLSA-N 0.000 description 1
- FTSAJSADJCMDHH-CIUDSAMLSA-N Asn-Lys-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N FTSAJSADJCMDHH-CIUDSAMLSA-N 0.000 description 1
- PBFXCUOEGVJTMV-QXEWZRGKSA-N Asn-Met-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O PBFXCUOEGVJTMV-QXEWZRGKSA-N 0.000 description 1
- BYLSYQASFJJBCL-DCAQKATOSA-N Asn-Pro-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O BYLSYQASFJJBCL-DCAQKATOSA-N 0.000 description 1
- GMUOCGCDOYYWPD-FXQIFTODSA-N Asn-Pro-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O GMUOCGCDOYYWPD-FXQIFTODSA-N 0.000 description 1
- UGXYFDQFLVCDFC-CIUDSAMLSA-N Asn-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O UGXYFDQFLVCDFC-CIUDSAMLSA-N 0.000 description 1
- HPNDKUOLNRVRAY-BIIVOSGPSA-N Asn-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N)C(=O)O HPNDKUOLNRVRAY-BIIVOSGPSA-N 0.000 description 1
- WUQXMTITJLFXAU-JIOCBJNQSA-N Asn-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N)O WUQXMTITJLFXAU-JIOCBJNQSA-N 0.000 description 1
- BCADFFUQHIMQAA-KKHAAJSZSA-N Asn-Thr-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O BCADFFUQHIMQAA-KKHAAJSZSA-N 0.000 description 1
- RDLYUKRPEJERMM-XIRDDKMYSA-N Asn-Trp-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(C)C)C(O)=O RDLYUKRPEJERMM-XIRDDKMYSA-N 0.000 description 1
- YSYTWUMRHSFODC-QWRGUYRKSA-N Asn-Tyr-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O YSYTWUMRHSFODC-QWRGUYRKSA-N 0.000 description 1
- QUCCLIXMVPIVOB-BZSNNMDCSA-N Asn-Tyr-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC(=O)N)N QUCCLIXMVPIVOB-BZSNNMDCSA-N 0.000 description 1
- CGYKCTPUGXFPMG-IHPCNDPISA-N Asn-Tyr-Trp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O CGYKCTPUGXFPMG-IHPCNDPISA-N 0.000 description 1
- XZFONYMRYTVLPL-NHCYSSNCSA-N Asn-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(=O)N)N XZFONYMRYTVLPL-NHCYSSNCSA-N 0.000 description 1
- UWMIZBCTVWVMFI-FXQIFTODSA-N Asp-Ala-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UWMIZBCTVWVMFI-FXQIFTODSA-N 0.000 description 1
- RGKKALNPOYURGE-ZKWXMUAHSA-N Asp-Ala-Val Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O RGKKALNPOYURGE-ZKWXMUAHSA-N 0.000 description 1
- XYBJLTKSGFBLCS-QXEWZRGKSA-N Asp-Arg-Val Chemical compound NC(N)=NCCC[C@@H](C(=O)N[C@@H](C(C)C)C(O)=O)NC(=O)[C@@H](N)CC(O)=O XYBJLTKSGFBLCS-QXEWZRGKSA-N 0.000 description 1
- UQBGYPFHWFZMCD-ZLUOBGJFSA-N Asp-Asn-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O UQBGYPFHWFZMCD-ZLUOBGJFSA-N 0.000 description 1
- KNMRXHIAVXHCLW-ZLUOBGJFSA-N Asp-Asn-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N)C(=O)O KNMRXHIAVXHCLW-ZLUOBGJFSA-N 0.000 description 1
- WCFCYFDBMNFSPA-ACZMJKKPSA-N Asp-Asp-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(O)=O WCFCYFDBMNFSPA-ACZMJKKPSA-N 0.000 description 1
- BFOYULZBKYOKAN-OLHMAJIHSA-N Asp-Asp-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BFOYULZBKYOKAN-OLHMAJIHSA-N 0.000 description 1
- RYKWOUUZJFSJOH-FXQIFTODSA-N Asp-Gln-Glu Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N RYKWOUUZJFSJOH-FXQIFTODSA-N 0.000 description 1
- VAWNQIGQPUOPQW-ACZMJKKPSA-N Asp-Glu-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VAWNQIGQPUOPQW-ACZMJKKPSA-N 0.000 description 1
- RATOMFTUDRYMKX-ACZMJKKPSA-N Asp-Glu-Cys Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N RATOMFTUDRYMKX-ACZMJKKPSA-N 0.000 description 1
- VFUXXFVCYZPOQG-WDSKDSINSA-N Asp-Glu-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VFUXXFVCYZPOQG-WDSKDSINSA-N 0.000 description 1
- KHBLRHKVXICFMY-GUBZILKMSA-N Asp-Glu-Lys Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O KHBLRHKVXICFMY-GUBZILKMSA-N 0.000 description 1
- ZSVJVIOVABDTTL-YUMQZZPRSA-N Asp-Gly-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)O)N ZSVJVIOVABDTTL-YUMQZZPRSA-N 0.000 description 1
- POTCZYQVVNXUIG-BQBZGAKWSA-N Asp-Gly-Pro Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O POTCZYQVVNXUIG-BQBZGAKWSA-N 0.000 description 1
- NRIFEOUAFLTMFJ-AAEUAGOBSA-N Asp-Gly-Trp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O NRIFEOUAFLTMFJ-AAEUAGOBSA-N 0.000 description 1
- PGUYEUCYVNZGGV-QWRGUYRKSA-N Asp-Gly-Tyr Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 PGUYEUCYVNZGGV-QWRGUYRKSA-N 0.000 description 1
- KQBVNNAPIURMPD-PEFMBERDSA-N Asp-Ile-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O KQBVNNAPIURMPD-PEFMBERDSA-N 0.000 description 1
- KFAFUJMGHVVYRC-DCAQKATOSA-N Asp-Leu-Met Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O KFAFUJMGHVVYRC-DCAQKATOSA-N 0.000 description 1
- QNMKWNONJGKJJC-NHCYSSNCSA-N Asp-Leu-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O QNMKWNONJGKJJC-NHCYSSNCSA-N 0.000 description 1
- UZFHNLYQWMGUHU-DCAQKATOSA-N Asp-Lys-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UZFHNLYQWMGUHU-DCAQKATOSA-N 0.000 description 1
- DYDKXJWQCIVTMR-WDSKDSINSA-N Asp-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC(O)=O DYDKXJWQCIVTMR-WDSKDSINSA-N 0.000 description 1
- YTXCCDCOHIYQFC-GUBZILKMSA-N Asp-Met-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O YTXCCDCOHIYQFC-GUBZILKMSA-N 0.000 description 1
- JXGJJQJHXHXJQF-CIUDSAMLSA-N Asp-Met-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O JXGJJQJHXHXJQF-CIUDSAMLSA-N 0.000 description 1
- KPSHWSWFPUDEGF-FXQIFTODSA-N Asp-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(O)=O KPSHWSWFPUDEGF-FXQIFTODSA-N 0.000 description 1
- HJZLUGQGJWXJCJ-CIUDSAMLSA-N Asp-Pro-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O HJZLUGQGJWXJCJ-CIUDSAMLSA-N 0.000 description 1
- DINOVZWPTMGSRF-QXEWZRGKSA-N Asp-Pro-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O DINOVZWPTMGSRF-QXEWZRGKSA-N 0.000 description 1
- BRRPVTUFESPTCP-ACZMJKKPSA-N Asp-Ser-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O BRRPVTUFESPTCP-ACZMJKKPSA-N 0.000 description 1
- GWWSUMLEWKQHLR-NUMRIWBASA-N Asp-Thr-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O GWWSUMLEWKQHLR-NUMRIWBASA-N 0.000 description 1
- JSNWZMFSLIWAHS-HJGDQZAQSA-N Asp-Thr-Leu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O JSNWZMFSLIWAHS-HJGDQZAQSA-N 0.000 description 1
- XWKBWZXGNXTDKY-ZKWXMUAHSA-N Asp-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O XWKBWZXGNXTDKY-ZKWXMUAHSA-N 0.000 description 1
- XMKXONRMGJXCJV-LAEOZQHASA-N Asp-Val-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O XMKXONRMGJXCJV-LAEOZQHASA-N 0.000 description 1
- RKXVTTIQNKPCHU-KKHAAJSZSA-N Asp-Val-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O RKXVTTIQNKPCHU-KKHAAJSZSA-N 0.000 description 1
- ZUNMTUPRQMWMHX-LSJOCFKGSA-N Asp-Val-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O ZUNMTUPRQMWMHX-LSJOCFKGSA-N 0.000 description 1
- 241000416162 Astragalus gummifer Species 0.000 description 1
- 208000012657 Atopic disease Diseases 0.000 description 1
- 102100039341 Atrial natriuretic peptide receptor 2 Human genes 0.000 description 1
- 101710102159 Atrial natriuretic peptide receptor 2 Proteins 0.000 description 1
- 241000271566 Aves Species 0.000 description 1
- 241000193830 Bacillus <bacterium> Species 0.000 description 1
- 241000894006 Bacteria Species 0.000 description 1
- 208000023328 Basedow disease Diseases 0.000 description 1
- DWRXFEITVBNRMK-UHFFFAOYSA-N Beta-D-1-Arabinofuranosylthymine Natural products O=C1NC(=O)C(C)=CN1C1C(O)C(O)C(CO)O1 DWRXFEITVBNRMK-UHFFFAOYSA-N 0.000 description 1
- 241000220450 Cajanus cajan Species 0.000 description 1
- 241000222120 Candida <Saccharomycetales> Species 0.000 description 1
- 241000282472 Canis lupus familiaris Species 0.000 description 1
- 241000283707 Capra Species 0.000 description 1
- 101710132601 Capsid protein Proteins 0.000 description 1
- 208000031229 Cardiomyopathies Diseases 0.000 description 1
- 241000700199 Cavia porcellus Species 0.000 description 1
- 108010001857 Cell Surface Receptors Proteins 0.000 description 1
- 102000000844 Cell Surface Receptors Human genes 0.000 description 1
- 241000282552 Chlorocebus aethiops Species 0.000 description 1
- 101710094648 Coat protein Proteins 0.000 description 1
- 101100007328 Cocos nucifera COS-1 gene Proteins 0.000 description 1
- 241000557626 Corvus corax Species 0.000 description 1
- 241000699802 Cricetulus griseus Species 0.000 description 1
- 208000011231 Crohn disease Diseases 0.000 description 1
- GRNOCLDFUNCIDW-ACZMJKKPSA-N Cys-Ala-Glu Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CS)N GRNOCLDFUNCIDW-ACZMJKKPSA-N 0.000 description 1
- SZQCDCKIGWQAQN-FXQIFTODSA-N Cys-Arg-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O SZQCDCKIGWQAQN-FXQIFTODSA-N 0.000 description 1
- NQSUTVRXXBGVDQ-LKXGYXEUSA-N Cys-Asn-Thr Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NQSUTVRXXBGVDQ-LKXGYXEUSA-N 0.000 description 1
- FEJCUYOGOBCFOQ-ACZMJKKPSA-N Cys-Asp-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)N FEJCUYOGOBCFOQ-ACZMJKKPSA-N 0.000 description 1
- BVFQOPGFOQVZTE-ACZMJKKPSA-N Cys-Gln-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O BVFQOPGFOQVZTE-ACZMJKKPSA-N 0.000 description 1
- MBILEVLLOHJZMG-FXQIFTODSA-N Cys-Gln-Glu Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CS)N MBILEVLLOHJZMG-FXQIFTODSA-N 0.000 description 1
- VKAWJBQTFCBHQY-GUBZILKMSA-N Cys-Gln-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CS)N VKAWJBQTFCBHQY-GUBZILKMSA-N 0.000 description 1
- HHABWQIFXZPZCK-ACZMJKKPSA-N Cys-Gln-Ser Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CS)N HHABWQIFXZPZCK-ACZMJKKPSA-N 0.000 description 1
- UXUSHQYYQCZWET-WDSKDSINSA-N Cys-Glu-Gly Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O UXUSHQYYQCZWET-WDSKDSINSA-N 0.000 description 1
- BDWIZLQVVWQMTB-XKBZYTNZSA-N Cys-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CS)N)O BDWIZLQVVWQMTB-XKBZYTNZSA-N 0.000 description 1
- VIRYODQIWJNWNU-NRPADANISA-N Cys-Glu-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CS)N VIRYODQIWJNWNU-NRPADANISA-N 0.000 description 1
- DZLQXIFVQFTFJY-BYPYZUCNSA-N Cys-Gly-Gly Chemical compound SC[C@H](N)C(=O)NCC(=O)NCC(O)=O DZLQXIFVQFTFJY-BYPYZUCNSA-N 0.000 description 1
- LBSKYJOZIIOZIO-DCAQKATOSA-N Cys-Lys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CS)N LBSKYJOZIIOZIO-DCAQKATOSA-N 0.000 description 1
- MXZYQNJCBVJHSR-KATARQTJSA-N Cys-Lys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CS)N)O MXZYQNJCBVJHSR-KATARQTJSA-N 0.000 description 1
- UIKLEGZPIOXFHJ-DLOVCJGASA-N Cys-Phe-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(O)=O UIKLEGZPIOXFHJ-DLOVCJGASA-N 0.000 description 1
- BCFXQBXXDSEHRS-FXQIFTODSA-N Cys-Ser-Arg Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O BCFXQBXXDSEHRS-FXQIFTODSA-N 0.000 description 1
- LKHMGNHQULEPFY-ACZMJKKPSA-N Cys-Ser-Glu Chemical compound SC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O LKHMGNHQULEPFY-ACZMJKKPSA-N 0.000 description 1
- DQGIAOGALAQBGK-BWBBJGPYSA-N Cys-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CS)N)O DQGIAOGALAQBGK-BWBBJGPYSA-N 0.000 description 1
- ALNKNYKSZPSLBD-ZDLURKLDSA-N Cys-Thr-Gly Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O ALNKNYKSZPSLBD-ZDLURKLDSA-N 0.000 description 1
- WTXCNOPZMQRTNN-BWBBJGPYSA-N Cys-Thr-Ser Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CS)N)O WTXCNOPZMQRTNN-BWBBJGPYSA-N 0.000 description 1
- AZDQAZRURQMSQD-XPUUQOCRSA-N Cys-Val-Gly Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AZDQAZRURQMSQD-XPUUQOCRSA-N 0.000 description 1
- FBPFZTCFMRRESA-KVTDHHQDSA-N D-Mannitol Chemical compound OC[C@@H](O)[C@@H](O)[C@H](O)[C@H](O)CO FBPFZTCFMRRESA-KVTDHHQDSA-N 0.000 description 1
- 241000450599 DNA viruses Species 0.000 description 1
- 102000004163 DNA-directed RNA polymerases Human genes 0.000 description 1
- 108090000626 DNA-directed RNA polymerases Proteins 0.000 description 1
- 238000009007 Diagnostic Kit Methods 0.000 description 1
- 235000019739 Dicalciumphosphate Nutrition 0.000 description 1
- BWGNESOTFCXPMA-UHFFFAOYSA-N Dihydrogen disulfide Chemical compound SS BWGNESOTFCXPMA-UHFFFAOYSA-N 0.000 description 1
- 239000006144 Dulbecco’s modified Eagle's medium Substances 0.000 description 1
- 101150029662 E1 gene Proteins 0.000 description 1
- 238000008157 ELISA kit Methods 0.000 description 1
- 241000991587 Enterovirus C Species 0.000 description 1
- 239000004593 Epoxy Substances 0.000 description 1
- 241000283086 Equidae Species 0.000 description 1
- 241000283073 Equus caballus Species 0.000 description 1
- 241000701959 Escherichia virus Lambda Species 0.000 description 1
- 239000001856 Ethyl cellulose Substances 0.000 description 1
- ZZSNKZQZMQGXPY-UHFFFAOYSA-N Ethyl cellulose Chemical compound CCOCC1OC(OC)C(OCC)C(OCC)C1OC1C(O)C(O)C(OC)C(CO)O1 ZZSNKZQZMQGXPY-UHFFFAOYSA-N 0.000 description 1
- 241000206602 Eukaryota Species 0.000 description 1
- 108050001049 Extracellular proteins Proteins 0.000 description 1
- 108010074860 Factor Xa Proteins 0.000 description 1
- 102100031513 Fc receptor-like protein 4 Human genes 0.000 description 1
- 241000282326 Felis catus Species 0.000 description 1
- 108010010803 Gelatin Proteins 0.000 description 1
- 108700039691 Genetic Promoter Regions Proteins 0.000 description 1
- 206010071602 Genetic polymorphism Diseases 0.000 description 1
- YJIUYQKQBBQYHZ-ACZMJKKPSA-N Gln-Ala-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O YJIUYQKQBBQYHZ-ACZMJKKPSA-N 0.000 description 1
- PRBLYKYHAJEABA-SRVKXCTJSA-N Gln-Arg-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O PRBLYKYHAJEABA-SRVKXCTJSA-N 0.000 description 1
- PHZYLYASFWHLHJ-FXQIFTODSA-N Gln-Asn-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PHZYLYASFWHLHJ-FXQIFTODSA-N 0.000 description 1
- ZPDVKYLJTOFQJV-WDSKDSINSA-N Gln-Asn-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O ZPDVKYLJTOFQJV-WDSKDSINSA-N 0.000 description 1
- CYTSBCIIEHUPDU-ACZMJKKPSA-N Gln-Asp-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O CYTSBCIIEHUPDU-ACZMJKKPSA-N 0.000 description 1
- QFTRCUPCARNIPZ-XHNCKOQMSA-N Gln-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)[C@H](CCC(=O)N)N)C(=O)O QFTRCUPCARNIPZ-XHNCKOQMSA-N 0.000 description 1
- MFLMFRZBAJSGHK-ACZMJKKPSA-N Gln-Cys-Ser Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)O)N MFLMFRZBAJSGHK-ACZMJKKPSA-N 0.000 description 1
- VSXBYIJUAXPAAL-WDSKDSINSA-N Gln-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CCC(N)=O VSXBYIJUAXPAAL-WDSKDSINSA-N 0.000 description 1
- SMLDOQHTOAAFJQ-WDSKDSINSA-N Gln-Gly-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SMLDOQHTOAAFJQ-WDSKDSINSA-N 0.000 description 1
- NXPXQIZKDOXIHH-JSGCOSHPSA-N Gln-Gly-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)N)N NXPXQIZKDOXIHH-JSGCOSHPSA-N 0.000 description 1
- HDUDGCZEOZEFOA-KBIXCLLPSA-N Gln-Ile-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CCC(=O)N)N HDUDGCZEOZEFOA-KBIXCLLPSA-N 0.000 description 1
- GIVHPCWYVWUUSG-HVTMNAMFSA-N Gln-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N GIVHPCWYVWUUSG-HVTMNAMFSA-N 0.000 description 1
- HWEINOMSWQSJDC-SRVKXCTJSA-N Gln-Leu-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O HWEINOMSWQSJDC-SRVKXCTJSA-N 0.000 description 1
- QKCZZAZNMMVICF-DCAQKATOSA-N Gln-Leu-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O QKCZZAZNMMVICF-DCAQKATOSA-N 0.000 description 1
- PSERKXGRRADTKA-MNXVOIDGSA-N Gln-Leu-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PSERKXGRRADTKA-MNXVOIDGSA-N 0.000 description 1
- SHAUZYVSXAMYAZ-JYJNAYRXSA-N Gln-Leu-Phe Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N SHAUZYVSXAMYAZ-JYJNAYRXSA-N 0.000 description 1
- UWKPRVKWEKEMSY-DCAQKATOSA-N Gln-Lys-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O UWKPRVKWEKEMSY-DCAQKATOSA-N 0.000 description 1
- KSKFIECUYMYWNS-AVGNSLFASA-N Gln-Lys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)N)N KSKFIECUYMYWNS-AVGNSLFASA-N 0.000 description 1
- FKXCBKCOSVIGCT-AVGNSLFASA-N Gln-Lys-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O FKXCBKCOSVIGCT-AVGNSLFASA-N 0.000 description 1
- FALJZCPMTGJOHX-SRVKXCTJSA-N Gln-Met-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O FALJZCPMTGJOHX-SRVKXCTJSA-N 0.000 description 1
- SFAFZYYMAWOCIC-KKUMJFAQSA-N Gln-Phe-Arg Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N SFAFZYYMAWOCIC-KKUMJFAQSA-N 0.000 description 1
- UESYBOXFJWJVSB-AVGNSLFASA-N Gln-Phe-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O UESYBOXFJWJVSB-AVGNSLFASA-N 0.000 description 1
- XUMFMAVDHQDATI-DCAQKATOSA-N Gln-Pro-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O XUMFMAVDHQDATI-DCAQKATOSA-N 0.000 description 1
- OREPWMPAUWIIAM-ZPFDUUQYSA-N Gln-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)N)N OREPWMPAUWIIAM-ZPFDUUQYSA-N 0.000 description 1
- NYCVMJGIJYQWDO-CIUDSAMLSA-N Gln-Ser-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NYCVMJGIJYQWDO-CIUDSAMLSA-N 0.000 description 1
- RWQCWSGOOOEGPB-FXQIFTODSA-N Gln-Ser-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O RWQCWSGOOOEGPB-FXQIFTODSA-N 0.000 description 1
- KVQOVQVGVKDZNW-GUBZILKMSA-N Gln-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N KVQOVQVGVKDZNW-GUBZILKMSA-N 0.000 description 1
- NHMRJKKAVMENKJ-WDCWCFNPSA-N Gln-Thr-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O NHMRJKKAVMENKJ-WDCWCFNPSA-N 0.000 description 1
- OUBUHIODTNUUTC-WDCWCFNPSA-N Gln-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O OUBUHIODTNUUTC-WDCWCFNPSA-N 0.000 description 1
- RONJIBWTGKVKFY-HTUGSXCWSA-N Gln-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O RONJIBWTGKVKFY-HTUGSXCWSA-N 0.000 description 1
- ARYKRXHBIPLULY-XKBZYTNZSA-N Gln-Thr-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ARYKRXHBIPLULY-XKBZYTNZSA-N 0.000 description 1
- HLRLXVPRJJITSK-IFFSRLJSSA-N Gln-Thr-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O HLRLXVPRJJITSK-IFFSRLJSSA-N 0.000 description 1
- VCUNGPMMPNJSGS-JYJNAYRXSA-N Gln-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O VCUNGPMMPNJSGS-JYJNAYRXSA-N 0.000 description 1
- ZFBBMCKQSNJZSN-AUTRQRHGSA-N Gln-Val-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZFBBMCKQSNJZSN-AUTRQRHGSA-N 0.000 description 1
- MKRDNSWGJWTBKZ-GVXVVHGQSA-N Gln-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N MKRDNSWGJWTBKZ-GVXVVHGQSA-N 0.000 description 1
- HNAUFGBKJLTWQE-IFFSRLJSSA-N Gln-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CCC(=O)N)N)O HNAUFGBKJLTWQE-IFFSRLJSSA-N 0.000 description 1
- FTMLQFPULNGION-ZVZYQTTQSA-N Gln-Val-Trp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O FTMLQFPULNGION-ZVZYQTTQSA-N 0.000 description 1
- SOEXCCGNHQBFPV-DLOVCJGASA-N Gln-Val-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O SOEXCCGNHQBFPV-DLOVCJGASA-N 0.000 description 1
- 206010018364 Glomerulonephritis Diseases 0.000 description 1
- WZZSKAJIHTUUSG-ACZMJKKPSA-N Glu-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O WZZSKAJIHTUUSG-ACZMJKKPSA-N 0.000 description 1
- LKDIBBOKUAASNP-FXQIFTODSA-N Glu-Ala-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LKDIBBOKUAASNP-FXQIFTODSA-N 0.000 description 1
- ITYRYNUZHPNCIK-GUBZILKMSA-N Glu-Ala-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O ITYRYNUZHPNCIK-GUBZILKMSA-N 0.000 description 1
- HUWSBFYAGXCXKC-CIUDSAMLSA-N Glu-Ala-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O HUWSBFYAGXCXKC-CIUDSAMLSA-N 0.000 description 1
- FYBSCGZLICNOBA-XQXXSGGOSA-N Glu-Ala-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FYBSCGZLICNOBA-XQXXSGGOSA-N 0.000 description 1
- NCWOMXABNYEPLY-NRPADANISA-N Glu-Ala-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O NCWOMXABNYEPLY-NRPADANISA-N 0.000 description 1
- VTTSANCGJWLPNC-ZPFDUUQYSA-N Glu-Arg-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VTTSANCGJWLPNC-ZPFDUUQYSA-N 0.000 description 1
- MLCPTRRNICEKIS-FXQIFTODSA-N Glu-Asn-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MLCPTRRNICEKIS-FXQIFTODSA-N 0.000 description 1
- CKRUHITYRFNUKW-WDSKDSINSA-N Glu-Asn-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O CKRUHITYRFNUKW-WDSKDSINSA-N 0.000 description 1
- RJONUNZIMUXUOI-GUBZILKMSA-N Glu-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N RJONUNZIMUXUOI-GUBZILKMSA-N 0.000 description 1
- LJLPOZGRPLORTF-CIUDSAMLSA-N Glu-Asn-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O LJLPOZGRPLORTF-CIUDSAMLSA-N 0.000 description 1
- ZJICFHQSPWFBKP-AVGNSLFASA-N Glu-Asn-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZJICFHQSPWFBKP-AVGNSLFASA-N 0.000 description 1
- JPHYJQHPILOKHC-ACZMJKKPSA-N Glu-Asp-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O JPHYJQHPILOKHC-ACZMJKKPSA-N 0.000 description 1
- CKOFNWCLWRYUHK-XHNCKOQMSA-N Glu-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)N)C(=O)O CKOFNWCLWRYUHK-XHNCKOQMSA-N 0.000 description 1
- SBCYJMOOHUDWDA-NUMRIWBASA-N Glu-Asp-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SBCYJMOOHUDWDA-NUMRIWBASA-N 0.000 description 1
- CLROYXHHUZELFX-FXQIFTODSA-N Glu-Gln-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O CLROYXHHUZELFX-FXQIFTODSA-N 0.000 description 1
- XMVLTPMCUJTJQP-FXQIFTODSA-N Glu-Gln-Cys Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N XMVLTPMCUJTJQP-FXQIFTODSA-N 0.000 description 1
- QQLBPVKLJBAXBS-FXQIFTODSA-N Glu-Glu-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O QQLBPVKLJBAXBS-FXQIFTODSA-N 0.000 description 1
- MUSGDMDGNGXULI-DCAQKATOSA-N Glu-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O MUSGDMDGNGXULI-DCAQKATOSA-N 0.000 description 1
- PXXGVUVQWQGGIG-YUMQZZPRSA-N Glu-Gly-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N PXXGVUVQWQGGIG-YUMQZZPRSA-N 0.000 description 1
- WRNAXCVRSBBKGS-BQBZGAKWSA-N Glu-Gly-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O WRNAXCVRSBBKGS-BQBZGAKWSA-N 0.000 description 1
- OGNJZUXUTPQVBR-BQBZGAKWSA-N Glu-Gly-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O OGNJZUXUTPQVBR-BQBZGAKWSA-N 0.000 description 1
- OAGVHWYIBZMWLA-YFKPBYRVSA-N Glu-Gly-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)NCC(O)=O OAGVHWYIBZMWLA-YFKPBYRVSA-N 0.000 description 1
- HILMIYALTUQTRC-XVKPBYJWSA-N Glu-Gly-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HILMIYALTUQTRC-XVKPBYJWSA-N 0.000 description 1
- WVTIBGWZUMJBFY-GUBZILKMSA-N Glu-His-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O WVTIBGWZUMJBFY-GUBZILKMSA-N 0.000 description 1
- GXMXPCXXKVWOSM-KQXIARHKSA-N Glu-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N GXMXPCXXKVWOSM-KQXIARHKSA-N 0.000 description 1
- YKBUCXNNBYZYAY-MNXVOIDGSA-N Glu-Lys-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YKBUCXNNBYZYAY-MNXVOIDGSA-N 0.000 description 1
- QDMVXRNLOPTPIE-WDCWCFNPSA-N Glu-Lys-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QDMVXRNLOPTPIE-WDCWCFNPSA-N 0.000 description 1
- JDUKCSSHWNIQQZ-IHRRRGAJSA-N Glu-Phe-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O JDUKCSSHWNIQQZ-IHRRRGAJSA-N 0.000 description 1
- RXESHTOTINOODU-JYJNAYRXSA-N Glu-Phe-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CCC(=O)O)N RXESHTOTINOODU-JYJNAYRXSA-N 0.000 description 1
- YTRBQAQSUDSIQE-FHWLQOOXSA-N Glu-Phe-Phe Chemical compound C([C@H](NC(=O)[C@H](CCC(O)=O)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 YTRBQAQSUDSIQE-FHWLQOOXSA-N 0.000 description 1
- ITVBKCZZLJUUHI-HTUGSXCWSA-N Glu-Phe-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ITVBKCZZLJUUHI-HTUGSXCWSA-N 0.000 description 1
- TWYFJOHWGCCRIR-DCAQKATOSA-N Glu-Pro-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O TWYFJOHWGCCRIR-DCAQKATOSA-N 0.000 description 1
- DXVOKNVIKORTHQ-GUBZILKMSA-N Glu-Pro-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O DXVOKNVIKORTHQ-GUBZILKMSA-N 0.000 description 1
- ZKONLKQGTNVAPR-DCAQKATOSA-N Glu-Pro-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)O)N ZKONLKQGTNVAPR-DCAQKATOSA-N 0.000 description 1
- DCBSZJJHOTXMHY-DCAQKATOSA-N Glu-Pro-Pro Chemical compound OC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DCBSZJJHOTXMHY-DCAQKATOSA-N 0.000 description 1
- BIYNPVYAZOUVFQ-CIUDSAMLSA-N Glu-Pro-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O BIYNPVYAZOUVFQ-CIUDSAMLSA-N 0.000 description 1
- WIKMTDVSCUJIPJ-CIUDSAMLSA-N Glu-Ser-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N WIKMTDVSCUJIPJ-CIUDSAMLSA-N 0.000 description 1
- GMVCSRBOSIUTFC-FXQIFTODSA-N Glu-Ser-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O GMVCSRBOSIUTFC-FXQIFTODSA-N 0.000 description 1
- RFTVTKBHDXCEEX-WDSKDSINSA-N Glu-Ser-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RFTVTKBHDXCEEX-WDSKDSINSA-N 0.000 description 1
- JSIQVRIXMINMTA-ZDLURKLDSA-N Glu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](N)CCC(O)=O JSIQVRIXMINMTA-ZDLURKLDSA-N 0.000 description 1
- HZISRJBYZAODRV-XQXXSGGOSA-N Glu-Thr-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O HZISRJBYZAODRV-XQXXSGGOSA-N 0.000 description 1
- TWYSSILQABLLME-HJGDQZAQSA-N Glu-Thr-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O TWYSSILQABLLME-HJGDQZAQSA-N 0.000 description 1
- DTLLNDVORUEOTM-WDCWCFNPSA-N Glu-Thr-Lys Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O DTLLNDVORUEOTM-WDCWCFNPSA-N 0.000 description 1
- RXJFSLQVMGYQEL-IHRRRGAJSA-N Glu-Tyr-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCC(O)=O)C(O)=O)CC1=CC=C(O)C=C1 RXJFSLQVMGYQEL-IHRRRGAJSA-N 0.000 description 1
- XOEKMEAOMXMURD-JYJNAYRXSA-N Glu-Tyr-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O XOEKMEAOMXMURD-JYJNAYRXSA-N 0.000 description 1
- QLNKFGTZOBVMCS-JBACZVJFSA-N Glu-Tyr-Trp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O QLNKFGTZOBVMCS-JBACZVJFSA-N 0.000 description 1
- KIEICAOUSNYOLM-NRPADANISA-N Glu-Val-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O KIEICAOUSNYOLM-NRPADANISA-N 0.000 description 1
- LZEUDRYSAZAJIO-AUTRQRHGSA-N Glu-Val-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LZEUDRYSAZAJIO-AUTRQRHGSA-N 0.000 description 1
- QXUPRMQJDWJDFR-NRPADANISA-N Glu-Val-Ser Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O QXUPRMQJDWJDFR-NRPADANISA-N 0.000 description 1
- WGYHAAXZWPEBDQ-IFFSRLJSSA-N Glu-Val-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WGYHAAXZWPEBDQ-IFFSRLJSSA-N 0.000 description 1
- JBRBACJPBZNFMF-YUMQZZPRSA-N Gly-Ala-Lys Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN JBRBACJPBZNFMF-YUMQZZPRSA-N 0.000 description 1
- UPOJUWHGMDJUQZ-IUCAKERBSA-N Gly-Arg-Arg Chemical compound NC(=N)NCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UPOJUWHGMDJUQZ-IUCAKERBSA-N 0.000 description 1
- RQZGFWKQLPJOEQ-YUMQZZPRSA-N Gly-Arg-Gln Chemical compound C(C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)CN)CN=C(N)N RQZGFWKQLPJOEQ-YUMQZZPRSA-N 0.000 description 1
- OGCIHJPYKVSMTE-YUMQZZPRSA-N Gly-Arg-Glu Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O OGCIHJPYKVSMTE-YUMQZZPRSA-N 0.000 description 1
- DTPOVRRYXPJJAZ-FJXKBIBVSA-N Gly-Arg-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N DTPOVRRYXPJJAZ-FJXKBIBVSA-N 0.000 description 1
- DUYYPIRFTLOAJQ-YUMQZZPRSA-N Gly-Asn-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)CN DUYYPIRFTLOAJQ-YUMQZZPRSA-N 0.000 description 1
- JVACNFOPSUPDTK-QWRGUYRKSA-N Gly-Asn-Phe Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JVACNFOPSUPDTK-QWRGUYRKSA-N 0.000 description 1
- GRIRDMVMJJDZKV-RCOVLWMOSA-N Gly-Asn-Val Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O GRIRDMVMJJDZKV-RCOVLWMOSA-N 0.000 description 1
- KQDMENMTYNBWMR-WHFBIAKZSA-N Gly-Asp-Ala Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O KQDMENMTYNBWMR-WHFBIAKZSA-N 0.000 description 1
- IWAXHBCACVWNHT-BQBZGAKWSA-N Gly-Asp-Arg Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IWAXHBCACVWNHT-BQBZGAKWSA-N 0.000 description 1
- QSTLUOIOYLYLLF-WDSKDSINSA-N Gly-Asp-Glu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QSTLUOIOYLYLLF-WDSKDSINSA-N 0.000 description 1
- TZOVVRJYUDETQG-RCOVLWMOSA-N Gly-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)CN TZOVVRJYUDETQG-RCOVLWMOSA-N 0.000 description 1
- CEXINUGNTZFNRY-BYPYZUCNSA-N Gly-Cys-Gly Chemical compound [NH3+]CC(=O)N[C@@H](CS)C(=O)NCC([O-])=O CEXINUGNTZFNRY-BYPYZUCNSA-N 0.000 description 1
- IXKRSKPKSLXIHN-YUMQZZPRSA-N Gly-Cys-Leu Chemical compound [H]NCC(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O IXKRSKPKSLXIHN-YUMQZZPRSA-N 0.000 description 1
- QCTLGOYODITHPQ-WHFBIAKZSA-N Gly-Cys-Ser Chemical compound [H]NCC(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O QCTLGOYODITHPQ-WHFBIAKZSA-N 0.000 description 1
- BYYNJRSNDARRBX-YFKPBYRVSA-N Gly-Gln-Gly Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O BYYNJRSNDARRBX-YFKPBYRVSA-N 0.000 description 1
- HDNXXTBKOJKWNN-WDSKDSINSA-N Gly-Glu-Asn Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O HDNXXTBKOJKWNN-WDSKDSINSA-N 0.000 description 1
- GDOZQTNZPCUARW-YFKPBYRVSA-N Gly-Gly-Glu Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O GDOZQTNZPCUARW-YFKPBYRVSA-N 0.000 description 1
- UQJNXZSSGQIPIQ-FBCQKBJTSA-N Gly-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)CN UQJNXZSSGQIPIQ-FBCQKBJTSA-N 0.000 description 1
- INLIXXRWNUKVCF-JTQLQIEISA-N Gly-Gly-Tyr Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 INLIXXRWNUKVCF-JTQLQIEISA-N 0.000 description 1
- VAXIVIPMCTYSHI-YUMQZZPRSA-N Gly-His-Asp Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)CN VAXIVIPMCTYSHI-YUMQZZPRSA-N 0.000 description 1
- QSVMIMFAAZPCAQ-PMVVWTBXSA-N Gly-His-Thr Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QSVMIMFAAZPCAQ-PMVVWTBXSA-N 0.000 description 1
- ALOBJFDJTMQQPW-ONGXEEELSA-N Gly-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)CN ALOBJFDJTMQQPW-ONGXEEELSA-N 0.000 description 1
- SCWYHUQOOFRVHP-MBLNEYKQSA-N Gly-Ile-Thr Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SCWYHUQOOFRVHP-MBLNEYKQSA-N 0.000 description 1
- IUZGUFAJDBHQQV-YUMQZZPRSA-N Gly-Leu-Asn Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IUZGUFAJDBHQQV-YUMQZZPRSA-N 0.000 description 1
- UHPAZODVFFYEEL-QWRGUYRKSA-N Gly-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN UHPAZODVFFYEEL-QWRGUYRKSA-N 0.000 description 1
- YSDLIYZLOTZZNP-UWVGGRQHSA-N Gly-Leu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN YSDLIYZLOTZZNP-UWVGGRQHSA-N 0.000 description 1
- NNCSJUBVFBDDLC-YUMQZZPRSA-N Gly-Leu-Ser Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O NNCSJUBVFBDDLC-YUMQZZPRSA-N 0.000 description 1
- LHYJCVCQPWRMKZ-WEDXCCLWSA-N Gly-Leu-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LHYJCVCQPWRMKZ-WEDXCCLWSA-N 0.000 description 1
- MIIVFRCYJABHTQ-ONGXEEELSA-N Gly-Leu-Val Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O MIIVFRCYJABHTQ-ONGXEEELSA-N 0.000 description 1
- LOEANKRDMMVOGZ-YUMQZZPRSA-N Gly-Lys-Asp Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CC(O)=O)C(O)=O LOEANKRDMMVOGZ-YUMQZZPRSA-N 0.000 description 1
- MHZXESQPPXOING-KBPBESRZSA-N Gly-Lys-Phe Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O MHZXESQPPXOING-KBPBESRZSA-N 0.000 description 1
- WDEHMRNSGHVNOH-VHSXEESVSA-N Gly-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)CN)C(=O)O WDEHMRNSGHVNOH-VHSXEESVSA-N 0.000 description 1
- YYXJFBMCOUSYSF-RYUDHWBXSA-N Gly-Phe-Gln Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O YYXJFBMCOUSYSF-RYUDHWBXSA-N 0.000 description 1
- GGLIDLCEPDHEJO-BQBZGAKWSA-N Gly-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)CN GGLIDLCEPDHEJO-BQBZGAKWSA-N 0.000 description 1
- NSVOVKWEKGEOQB-LURJTMIESA-N Gly-Pro-Gly Chemical compound NCC(=O)N1CCC[C@H]1C(=O)NCC(O)=O NSVOVKWEKGEOQB-LURJTMIESA-N 0.000 description 1
- HFPVRZWORNJRRC-UWVGGRQHSA-N Gly-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN HFPVRZWORNJRRC-UWVGGRQHSA-N 0.000 description 1
- BMWFDYIYBAFROD-WPRPVWTQSA-N Gly-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN BMWFDYIYBAFROD-WPRPVWTQSA-N 0.000 description 1
- CSMYMGFCEJWALV-WDSKDSINSA-N Gly-Ser-Gln Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(N)=O CSMYMGFCEJWALV-WDSKDSINSA-N 0.000 description 1
- WCORRBXVISTKQL-WHFBIAKZSA-N Gly-Ser-Ser Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WCORRBXVISTKQL-WHFBIAKZSA-N 0.000 description 1
- LCRDMSSAKLTKBU-ZDLURKLDSA-N Gly-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN LCRDMSSAKLTKBU-ZDLURKLDSA-N 0.000 description 1
- CQMFNTVQVLQRLT-JHEQGTHGSA-N Gly-Thr-Gln Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O CQMFNTVQVLQRLT-JHEQGTHGSA-N 0.000 description 1
- JQFILXICXLDTRR-FBCQKBJTSA-N Gly-Thr-Gly Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)NCC(O)=O JQFILXICXLDTRR-FBCQKBJTSA-N 0.000 description 1
- TVTZEOHWHUVYCG-KYNKHSRBSA-N Gly-Thr-Thr Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O TVTZEOHWHUVYCG-KYNKHSRBSA-N 0.000 description 1
- LKJCZEPXHOIAIW-HOTGVXAUSA-N Gly-Trp-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)CN LKJCZEPXHOIAIW-HOTGVXAUSA-N 0.000 description 1
- ONSARSFSJHTMFJ-STQMWFEESA-N Gly-Trp-Ser Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(O)=O ONSARSFSJHTMFJ-STQMWFEESA-N 0.000 description 1
- UMBDRSMLCUYIRI-DVJZZOLTSA-N Gly-Trp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)CN)O UMBDRSMLCUYIRI-DVJZZOLTSA-N 0.000 description 1
- NGBGZCUWFVVJKC-IRXDYDNUSA-N Gly-Tyr-Tyr Chemical compound C([C@H](NC(=O)CN)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 NGBGZCUWFVVJKC-IRXDYDNUSA-N 0.000 description 1
- BAYQNCWLXIDLHX-ONGXEEELSA-N Gly-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN BAYQNCWLXIDLHX-ONGXEEELSA-N 0.000 description 1
- SBVMXEZQJVUARN-XPUUQOCRSA-N Gly-Val-Ser Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O SBVMXEZQJVUARN-XPUUQOCRSA-N 0.000 description 1
- 239000004471 Glycine Substances 0.000 description 1
- AEMRFAOFKBGASW-UHFFFAOYSA-M Glycolate Chemical compound OCC([O-])=O AEMRFAOFKBGASW-UHFFFAOYSA-M 0.000 description 1
- 108090000288 Glycoproteins Proteins 0.000 description 1
- 102000003886 Glycoproteins Human genes 0.000 description 1
- 102100021181 Golgi phosphoprotein 3 Human genes 0.000 description 1
- 208000024869 Goodpasture syndrome Diseases 0.000 description 1
- 208000009329 Graft vs Host Disease Diseases 0.000 description 1
- 108050002220 Green fluorescent protein, GFP Proteins 0.000 description 1
- 208000001204 Hashimoto Disease Diseases 0.000 description 1
- 208000030836 Hashimoto thyroiditis Diseases 0.000 description 1
- 208000002250 Hematologic Neoplasms Diseases 0.000 description 1
- 208000035186 Hemolytic Autoimmune Anemia Diseases 0.000 description 1
- 102100026122 High affinity immunoglobulin gamma Fc receptor I Human genes 0.000 description 1
- DCRODRAURLJOFY-XPUUQOCRSA-N His-Ala-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)NCC(O)=O DCRODRAURLJOFY-XPUUQOCRSA-N 0.000 description 1
- ZPVJJPAIUZLSNE-DCAQKATOSA-N His-Arg-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O ZPVJJPAIUZLSNE-DCAQKATOSA-N 0.000 description 1
- QSLKWWDKIXMWJV-SRVKXCTJSA-N His-Cys-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)O)N QSLKWWDKIXMWJV-SRVKXCTJSA-N 0.000 description 1
- UJWYPUUXIAKEES-CUJWVEQBSA-N His-Cys-Thr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UJWYPUUXIAKEES-CUJWVEQBSA-N 0.000 description 1
- NWGXCPUKPVISSJ-AVGNSLFASA-N His-Gln-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N NWGXCPUKPVISSJ-AVGNSLFASA-N 0.000 description 1
- AKEDPWJFQULLPE-IUCAKERBSA-N His-Glu-Gly Chemical compound N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O AKEDPWJFQULLPE-IUCAKERBSA-N 0.000 description 1
- KWBISLAEQZUYIC-UWJYBYFXSA-N His-His-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CN=CN2)N KWBISLAEQZUYIC-UWJYBYFXSA-N 0.000 description 1
- IDQNVIWPPWAFSY-AVGNSLFASA-N His-His-Gln Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(O)=O IDQNVIWPPWAFSY-AVGNSLFASA-N 0.000 description 1
- JIUYRPFQJJRSJB-QWRGUYRKSA-N His-His-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1NC=NC=1)C(=O)NCC(O)=O)C1=CN=CN1 JIUYRPFQJJRSJB-QWRGUYRKSA-N 0.000 description 1
- MLZVJIREOKTDAR-SIGLWIIPSA-N His-Ile-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MLZVJIREOKTDAR-SIGLWIIPSA-N 0.000 description 1
- QEYUCKCWTMIERU-SRVKXCTJSA-N His-Lys-Asp Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N QEYUCKCWTMIERU-SRVKXCTJSA-N 0.000 description 1
- LDFWDDVELNOGII-MXAVVETBSA-N His-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CN=CN1)N LDFWDDVELNOGII-MXAVVETBSA-N 0.000 description 1
- XDIVYNSPYBLSME-DCAQKATOSA-N His-Met-Asp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N XDIVYNSPYBLSME-DCAQKATOSA-N 0.000 description 1
- VCBWXASUBZIFLQ-IHRRRGAJSA-N His-Pro-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O VCBWXASUBZIFLQ-IHRRRGAJSA-N 0.000 description 1
- KAXZXLSXFWSNNZ-XVYDVKMFSA-N His-Ser-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O KAXZXLSXFWSNNZ-XVYDVKMFSA-N 0.000 description 1
- DGLAHESNTJWGDO-SRVKXCTJSA-N His-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N DGLAHESNTJWGDO-SRVKXCTJSA-N 0.000 description 1
- CUEQQFOGARVNHU-VGDYDELISA-N His-Ser-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CUEQQFOGARVNHU-VGDYDELISA-N 0.000 description 1
- UOYGZBIPZYKGSH-SRVKXCTJSA-N His-Ser-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N UOYGZBIPZYKGSH-SRVKXCTJSA-N 0.000 description 1
- RXKFKJVJVHLRIE-XIRDDKMYSA-N His-Ser-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC3=CN=CN3)N RXKFKJVJVHLRIE-XIRDDKMYSA-N 0.000 description 1
- UPJODPVSKKWGDQ-KLHWPWHYSA-N His-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N)O UPJODPVSKKWGDQ-KLHWPWHYSA-N 0.000 description 1
- WSXNWASHQNSMRX-GVXVVHGQSA-N His-Val-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N WSXNWASHQNSMRX-GVXVVHGQSA-N 0.000 description 1
- SYPULFZAGBBIOM-GVXVVHGQSA-N His-Val-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N SYPULFZAGBBIOM-GVXVVHGQSA-N 0.000 description 1
- FFYYUUWROYYKFY-IHRRRGAJSA-N His-Val-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O FFYYUUWROYYKFY-IHRRRGAJSA-N 0.000 description 1
- FBOMZVOKCZMDIG-XQQFMLRXSA-N His-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N FBOMZVOKCZMDIG-XQQFMLRXSA-N 0.000 description 1
- 101000757160 Homo sapiens Aminopeptidase N Proteins 0.000 description 1
- 101000846909 Homo sapiens Fc receptor-like protein 4 Proteins 0.000 description 1
- 101000913074 Homo sapiens High affinity immunoglobulin gamma Fc receptor I Proteins 0.000 description 1
- 101000945335 Homo sapiens Killer cell immunoglobulin-like receptor 2DL5B Proteins 0.000 description 1
- 101000945339 Homo sapiens Killer cell immunoglobulin-like receptor 2DS2 Proteins 0.000 description 1
- 101000946889 Homo sapiens Monocyte differentiation antigen CD14 Proteins 0.000 description 1
- 206010020751 Hypersensitivity Diseases 0.000 description 1
- 206010021074 Hypoplastic anaemia Diseases 0.000 description 1
- UGQMRVRMYYASKQ-UHFFFAOYSA-N Hypoxanthine nucleoside Natural products OC1C(O)C(CO)OC1N1C(NC=NC2=O)=C2N=C1 UGQMRVRMYYASKQ-UHFFFAOYSA-N 0.000 description 1
- WUEIUSDAECDLQO-NAKRPEOUSA-N Ile-Ala-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)O)N WUEIUSDAECDLQO-NAKRPEOUSA-N 0.000 description 1
- TZCGZYWNIDZZMR-NAKRPEOUSA-N Ile-Arg-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](C)C(=O)O)N TZCGZYWNIDZZMR-NAKRPEOUSA-N 0.000 description 1
- FVEWRQXNISSYFO-ZPFDUUQYSA-N Ile-Arg-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N FVEWRQXNISSYFO-ZPFDUUQYSA-N 0.000 description 1
- YOTNPRLPIPHQSB-XUXIUFHCSA-N Ile-Arg-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N YOTNPRLPIPHQSB-XUXIUFHCSA-N 0.000 description 1
- DMHGKBGOUAJRHU-RVMXOQNASA-N Ile-Arg-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N DMHGKBGOUAJRHU-RVMXOQNASA-N 0.000 description 1
- QTUSJASXLGLJSR-OSUNSFLBSA-N Ile-Arg-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N QTUSJASXLGLJSR-OSUNSFLBSA-N 0.000 description 1
- AZEYWPUCOYXFOE-CYDGBPFRSA-N Ile-Arg-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](C(C)C)C(=O)O)N AZEYWPUCOYXFOE-CYDGBPFRSA-N 0.000 description 1
- LLZLRXBTOOFODM-QSFUFRPTSA-N Ile-Asp-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N LLZLRXBTOOFODM-QSFUFRPTSA-N 0.000 description 1
- DURWCDDDAWVPOP-JBDRJPRFSA-N Ile-Cys-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)O)N DURWCDDDAWVPOP-JBDRJPRFSA-N 0.000 description 1
- YBJWJQQBWRARLT-KBIXCLLPSA-N Ile-Gln-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O YBJWJQQBWRARLT-KBIXCLLPSA-N 0.000 description 1
- DVRDRICMWUSCBN-UKJIMTQDSA-N Ile-Gln-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N DVRDRICMWUSCBN-UKJIMTQDSA-N 0.000 description 1
- LPXHYGGZJOCAFR-MNXVOIDGSA-N Ile-Glu-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N LPXHYGGZJOCAFR-MNXVOIDGSA-N 0.000 description 1
- WUKLZPHVWAMZQV-UKJIMTQDSA-N Ile-Glu-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N WUKLZPHVWAMZQV-UKJIMTQDSA-N 0.000 description 1
- KFVUBLZRFSVDGO-BYULHYEWSA-N Ile-Gly-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O KFVUBLZRFSVDGO-BYULHYEWSA-N 0.000 description 1
- AMSYMDIIIRJRKZ-HJPIBITLSA-N Ile-His-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N AMSYMDIIIRJRKZ-HJPIBITLSA-N 0.000 description 1
- AFERFBZLVUFWRA-HTFCKZLJSA-N Ile-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CS)C(=O)O)N AFERFBZLVUFWRA-HTFCKZLJSA-N 0.000 description 1
- OUUCIIJSBIBCHB-ZPFDUUQYSA-N Ile-Leu-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O OUUCIIJSBIBCHB-ZPFDUUQYSA-N 0.000 description 1
- YGDWPQCLFJNMOL-MNXVOIDGSA-N Ile-Leu-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N YGDWPQCLFJNMOL-MNXVOIDGSA-N 0.000 description 1
- HPCFRQWLTRDGHT-AJNGGQMLSA-N Ile-Leu-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O HPCFRQWLTRDGHT-AJNGGQMLSA-N 0.000 description 1
- TVYWVSJGSHQWMT-AJNGGQMLSA-N Ile-Leu-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N TVYWVSJGSHQWMT-AJNGGQMLSA-N 0.000 description 1
- GVNNAHIRSDRIII-AJNGGQMLSA-N Ile-Lys-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N GVNNAHIRSDRIII-AJNGGQMLSA-N 0.000 description 1
- MASWXTFJVNRZPT-NAKRPEOUSA-N Ile-Met-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(=O)O)N MASWXTFJVNRZPT-NAKRPEOUSA-N 0.000 description 1
- RENBRDSDKPSRIH-HJWJTTGWSA-N Ile-Phe-Met Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(=O)O RENBRDSDKPSRIH-HJWJTTGWSA-N 0.000 description 1
- IITVUURPOYGCTD-NAKRPEOUSA-N Ile-Pro-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O IITVUURPOYGCTD-NAKRPEOUSA-N 0.000 description 1
- BATWGBRIZANGPN-ZPFDUUQYSA-N Ile-Pro-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(=O)N)C(=O)O)N BATWGBRIZANGPN-ZPFDUUQYSA-N 0.000 description 1
- FQYQMFCIJNWDQZ-CYDGBPFRSA-N Ile-Pro-Pro Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 FQYQMFCIJNWDQZ-CYDGBPFRSA-N 0.000 description 1
- CAHCWMVNBZJVAW-NAKRPEOUSA-N Ile-Pro-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)O)N CAHCWMVNBZJVAW-NAKRPEOUSA-N 0.000 description 1
- JODPUDMBQBIWCK-GHCJXIJMSA-N Ile-Ser-Asn Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O JODPUDMBQBIWCK-GHCJXIJMSA-N 0.000 description 1
- JZNVOBUNTWNZPW-GHCJXIJMSA-N Ile-Ser-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N JZNVOBUNTWNZPW-GHCJXIJMSA-N 0.000 description 1
- FBGXMKUWQFPHFB-JBDRJPRFSA-N Ile-Ser-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N FBGXMKUWQFPHFB-JBDRJPRFSA-N 0.000 description 1
- ZLFNNVATRMCAKN-ZKWXMUAHSA-N Ile-Ser-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)NCC(=O)O)N ZLFNNVATRMCAKN-ZKWXMUAHSA-N 0.000 description 1
- SHVFUCSSACPBTF-VGDYDELISA-N Ile-Ser-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N SHVFUCSSACPBTF-VGDYDELISA-N 0.000 description 1
- HJDZMPFEXINXLO-QPHKQPEJSA-N Ile-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N HJDZMPFEXINXLO-QPHKQPEJSA-N 0.000 description 1
- WXLYNEHOGRYNFU-URLPEUOOSA-N Ile-Thr-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N WXLYNEHOGRYNFU-URLPEUOOSA-N 0.000 description 1
- DGTOKVBDZXJHNZ-WZLNRYEVSA-N Ile-Thr-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N DGTOKVBDZXJHNZ-WZLNRYEVSA-N 0.000 description 1
- RMJWFINHACYKJI-SIUGBPQLSA-N Ile-Tyr-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N RMJWFINHACYKJI-SIUGBPQLSA-N 0.000 description 1
- ZGKVPOSSTGHJAF-HJPIBITLSA-N Ile-Tyr-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CO)C(=O)O)N ZGKVPOSSTGHJAF-HJPIBITLSA-N 0.000 description 1
- NGKPIPCGMLWHBX-WZLNRYEVSA-N Ile-Tyr-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N NGKPIPCGMLWHBX-WZLNRYEVSA-N 0.000 description 1
- ZYVTXBXHIKGZMD-QSFUFRPTSA-N Ile-Val-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N ZYVTXBXHIKGZMD-QSFUFRPTSA-N 0.000 description 1
- NJGXXYLPDMMFJB-XUXIUFHCSA-N Ile-Val-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N NJGXXYLPDMMFJB-XUXIUFHCSA-N 0.000 description 1
- 102000009786 Immunoglobulin Constant Regions Human genes 0.000 description 1
- 108010009817 Immunoglobulin Constant Regions Proteins 0.000 description 1
- 102000018071 Immunoglobulin Fc Fragments Human genes 0.000 description 1
- 108010091135 Immunoglobulin Fc Fragments Proteins 0.000 description 1
- 108700005091 Immunoglobulin Genes Proteins 0.000 description 1
- 102000012745 Immunoglobulin Subunits Human genes 0.000 description 1
- 108010079585 Immunoglobulin Subunits Proteins 0.000 description 1
- 102100033628 Killer cell immunoglobulin-like receptor 2DL5B Human genes 0.000 description 1
- ONIBWKKTOPOVIA-BYPYZUCNSA-N L-Proline Chemical compound OC(=O)[C@@H]1CCCN1 ONIBWKKTOPOVIA-BYPYZUCNSA-N 0.000 description 1
- UGTHTQWIQKEDEH-BQBZGAKWSA-N L-alanyl-L-prolylglycine zwitterion Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UGTHTQWIQKEDEH-BQBZGAKWSA-N 0.000 description 1
- ODKSFYDXXFIFQN-BYPYZUCNSA-P L-argininium(2+) Chemical compound NC(=[NH2+])NCCC[C@H]([NH3+])C(O)=O ODKSFYDXXFIFQN-BYPYZUCNSA-P 0.000 description 1
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 1
- KDXKERNSBIXSRK-YFKPBYRVSA-N L-lysine Chemical compound NCCCC[C@H](N)C(O)=O KDXKERNSBIXSRK-YFKPBYRVSA-N 0.000 description 1
- COLNVLDHVKWLRT-QMMMGPOBSA-N L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 description 1
- TYYLDKGBCJGJGW-UHFFFAOYSA-N L-tryptophan-L-tyrosine Natural products C=1NC2=CC=CC=C2C=1CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 TYYLDKGBCJGJGW-UHFFFAOYSA-N 0.000 description 1
- GUBGYTABKSRVRQ-QKKXKWKRSA-N Lactose Natural products OC[C@H]1O[C@@H](O[C@H]2[C@H](O)[C@@H](O)C(O)O[C@@H]2CO)[C@H](O)[C@@H](O)[C@H]1O GUBGYTABKSRVRQ-QKKXKWKRSA-N 0.000 description 1
- 208000031671 Large B-Cell Diffuse Lymphoma Diseases 0.000 description 1
- SUPVSFFZWVOEOI-UHFFFAOYSA-N Leu-Ala-Tyr Natural products CC(C)CC(N)C(=O)NC(C)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 SUPVSFFZWVOEOI-UHFFFAOYSA-N 0.000 description 1
- QUAAUWNLWMLERT-IHRRRGAJSA-N Leu-Arg-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(C)C)C(O)=O QUAAUWNLWMLERT-IHRRRGAJSA-N 0.000 description 1
- KKXDHFKZWKLYGB-GUBZILKMSA-N Leu-Asn-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KKXDHFKZWKLYGB-GUBZILKMSA-N 0.000 description 1
- JKGHDYGZRDWHGA-SRVKXCTJSA-N Leu-Asn-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JKGHDYGZRDWHGA-SRVKXCTJSA-N 0.000 description 1
- TWQIYNGNYNJUFM-NHCYSSNCSA-N Leu-Asn-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O TWQIYNGNYNJUFM-NHCYSSNCSA-N 0.000 description 1
- ILJREDZFPHTUIE-GUBZILKMSA-N Leu-Asp-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ILJREDZFPHTUIE-GUBZILKMSA-N 0.000 description 1
- QCSFMCFHVGTLFF-NHCYSSNCSA-N Leu-Asp-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O QCSFMCFHVGTLFF-NHCYSSNCSA-N 0.000 description 1
- DPWGZWUMUUJQDT-IUCAKERBSA-N Leu-Gln-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O DPWGZWUMUUJQDT-IUCAKERBSA-N 0.000 description 1
- LOLUPZNNADDTAA-AVGNSLFASA-N Leu-Gln-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LOLUPZNNADDTAA-AVGNSLFASA-N 0.000 description 1
- NEEOBPIXKWSBRF-IUCAKERBSA-N Leu-Glu-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O NEEOBPIXKWSBRF-IUCAKERBSA-N 0.000 description 1
- QVFGXCVIXXBFHO-AVGNSLFASA-N Leu-Glu-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O QVFGXCVIXXBFHO-AVGNSLFASA-N 0.000 description 1
- HQUXQAMSWFIRET-AVGNSLFASA-N Leu-Glu-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HQUXQAMSWFIRET-AVGNSLFASA-N 0.000 description 1
- LAPSXOAUPNOINL-YUMQZZPRSA-N Leu-Gly-Asp Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O LAPSXOAUPNOINL-YUMQZZPRSA-N 0.000 description 1
- AOFYPTOHESIBFZ-KKUMJFAQSA-N Leu-His-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O AOFYPTOHESIBFZ-KKUMJFAQSA-N 0.000 description 1
- XBCWOTOCBXXJDG-BZSNNMDCSA-N Leu-His-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CN=CN1 XBCWOTOCBXXJDG-BZSNNMDCSA-N 0.000 description 1
- WRLPVDVHNWSSCL-MELADBBJSA-N Leu-His-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N2CCC[C@@H]2C(=O)O)N WRLPVDVHNWSSCL-MELADBBJSA-N 0.000 description 1
- DBSLVQBXKVKDKJ-BJDJZHNGSA-N Leu-Ile-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O DBSLVQBXKVKDKJ-BJDJZHNGSA-N 0.000 description 1
- HGFGEMSVBMCFKK-MNXVOIDGSA-N Leu-Ile-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O HGFGEMSVBMCFKK-MNXVOIDGSA-N 0.000 description 1
- QNBVTHNJGCOVFA-AVGNSLFASA-N Leu-Leu-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O QNBVTHNJGCOVFA-AVGNSLFASA-N 0.000 description 1
- FAELBUXXFQLUAX-AJNGGQMLSA-N Leu-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(C)C FAELBUXXFQLUAX-AJNGGQMLSA-N 0.000 description 1
- UBZGNBKMIJHOHL-BZSNNMDCSA-N Leu-Leu-Phe Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 UBZGNBKMIJHOHL-BZSNNMDCSA-N 0.000 description 1
- RXGLHDWAZQECBI-SRVKXCTJSA-N Leu-Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O RXGLHDWAZQECBI-SRVKXCTJSA-N 0.000 description 1
- IEWBEPKLKUXQBU-VOAKCMCISA-N Leu-Leu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IEWBEPKLKUXQBU-VOAKCMCISA-N 0.000 description 1
- UCNNZELZXFXXJQ-BZSNNMDCSA-N Leu-Leu-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 UCNNZELZXFXXJQ-BZSNNMDCSA-N 0.000 description 1
- DCGXHWINSHEPIR-SRVKXCTJSA-N Leu-Lys-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)O)N DCGXHWINSHEPIR-SRVKXCTJSA-N 0.000 description 1
- LZHJZLHSRGWBBE-IHRRRGAJSA-N Leu-Lys-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O LZHJZLHSRGWBBE-IHRRRGAJSA-N 0.000 description 1
- GCXGCIYIHXSKAY-ULQDDVLXSA-N Leu-Phe-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GCXGCIYIHXSKAY-ULQDDVLXSA-N 0.000 description 1
- YESNGRDJQWDYLH-KKUMJFAQSA-N Leu-Phe-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CS)C(=O)O)N YESNGRDJQWDYLH-KKUMJFAQSA-N 0.000 description 1
- KQFZKDITNUEVFJ-JYJNAYRXSA-N Leu-Phe-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(C)C)CC1=CC=CC=C1 KQFZKDITNUEVFJ-JYJNAYRXSA-N 0.000 description 1
- INCJJHQRZGQLFC-KBPBESRZSA-N Leu-Phe-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O INCJJHQRZGQLFC-KBPBESRZSA-N 0.000 description 1
- KTOIECMYZZGVSI-BZSNNMDCSA-N Leu-Phe-His Chemical compound C([C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CC=CC=C1 KTOIECMYZZGVSI-BZSNNMDCSA-N 0.000 description 1
- PTRKPHUGYULXPU-KKUMJFAQSA-N Leu-Phe-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O PTRKPHUGYULXPU-KKUMJFAQSA-N 0.000 description 1
- YWKNKRAKOCLOLH-OEAJRASXSA-N Leu-Phe-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=CC=C1 YWKNKRAKOCLOLH-OEAJRASXSA-N 0.000 description 1
- WMIOEVKKYIMVKI-DCAQKATOSA-N Leu-Pro-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WMIOEVKKYIMVKI-DCAQKATOSA-N 0.000 description 1
- XWEVVRRSIOBJOO-SRVKXCTJSA-N Leu-Pro-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O XWEVVRRSIOBJOO-SRVKXCTJSA-N 0.000 description 1
- MUCIDQMDOYQYBR-IHRRRGAJSA-N Leu-Pro-His Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N MUCIDQMDOYQYBR-IHRRRGAJSA-N 0.000 description 1
- XXXXOVFBXRERQL-ULQDDVLXSA-N Leu-Pro-Phe Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XXXXOVFBXRERQL-ULQDDVLXSA-N 0.000 description 1
- JDBQSGMJBMPNFT-AVGNSLFASA-N Leu-Pro-Val Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O JDBQSGMJBMPNFT-AVGNSLFASA-N 0.000 description 1
- IZPVWNSAVUQBGP-CIUDSAMLSA-N Leu-Ser-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IZPVWNSAVUQBGP-CIUDSAMLSA-N 0.000 description 1
- AKVBOOKXVAMKSS-GUBZILKMSA-N Leu-Ser-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O AKVBOOKXVAMKSS-GUBZILKMSA-N 0.000 description 1
- KLSUAWUZBMAZCL-RHYQMDGZSA-N Leu-Thr-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(O)=O KLSUAWUZBMAZCL-RHYQMDGZSA-N 0.000 description 1
- LSLUTXRANSUGFY-XIRDDKMYSA-N Leu-Trp-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(O)=O)C(O)=O LSLUTXRANSUGFY-XIRDDKMYSA-N 0.000 description 1
- HOMFINRJHIIZNJ-HOCLYGCPSA-N Leu-Trp-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)NCC(O)=O HOMFINRJHIIZNJ-HOCLYGCPSA-N 0.000 description 1
- LXGSOEPHQJONMG-PMVMPFDFSA-N Leu-Trp-Tyr Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)O)N LXGSOEPHQJONMG-PMVMPFDFSA-N 0.000 description 1
- RIHIGSWBLHSGLV-CQDKDKBSSA-N Leu-Tyr-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O RIHIGSWBLHSGLV-CQDKDKBSSA-N 0.000 description 1
- VJGQRELPQWNURN-JYJNAYRXSA-N Leu-Tyr-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O VJGQRELPQWNURN-JYJNAYRXSA-N 0.000 description 1
- AXVIGSRGTMNSJU-YESZJQIVSA-N Leu-Tyr-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N AXVIGSRGTMNSJU-YESZJQIVSA-N 0.000 description 1
- BGGTYDNTOYRTTR-MEYUZBJRSA-N Leu-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC(C)C)N)O BGGTYDNTOYRTTR-MEYUZBJRSA-N 0.000 description 1
- XZNJZXJZBMBGGS-NHCYSSNCSA-N Leu-Val-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XZNJZXJZBMBGGS-NHCYSSNCSA-N 0.000 description 1
- AIMGJYMCTAABEN-GVXVVHGQSA-N Leu-Val-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIMGJYMCTAABEN-GVXVVHGQSA-N 0.000 description 1
- QESXLSQLQHHTIX-RHYQMDGZSA-N Leu-Val-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QESXLSQLQHHTIX-RHYQMDGZSA-N 0.000 description 1
- MSFITIBEMPWCBD-ULQDDVLXSA-N Leu-Val-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 MSFITIBEMPWCBD-ULQDDVLXSA-N 0.000 description 1
- 206010025323 Lymphomas Diseases 0.000 description 1
- XFIHDSBIPWEYJJ-YUMQZZPRSA-N Lys-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN XFIHDSBIPWEYJJ-YUMQZZPRSA-N 0.000 description 1
- ABHIXYDMILIUKV-CIUDSAMLSA-N Lys-Asn-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ABHIXYDMILIUKV-CIUDSAMLSA-N 0.000 description 1
- YKIRNDPUWONXQN-GUBZILKMSA-N Lys-Asn-Gln Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N YKIRNDPUWONXQN-GUBZILKMSA-N 0.000 description 1
- QUCDKEKDPYISNX-HJGDQZAQSA-N Lys-Asn-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QUCDKEKDPYISNX-HJGDQZAQSA-N 0.000 description 1
- LMVOVCYVZBBWQB-SRVKXCTJSA-N Lys-Asp-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN LMVOVCYVZBBWQB-SRVKXCTJSA-N 0.000 description 1
- SSJBMGCZZXCGJJ-DCAQKATOSA-N Lys-Asp-Met Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O SSJBMGCZZXCGJJ-DCAQKATOSA-N 0.000 description 1
- XFBBBRDEQIPGNR-KATARQTJSA-N Lys-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCCCN)N)O XFBBBRDEQIPGNR-KATARQTJSA-N 0.000 description 1
- RZHLIPMZXOEJTL-AVGNSLFASA-N Lys-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCCN)N RZHLIPMZXOEJTL-AVGNSLFASA-N 0.000 description 1
- ITWQLSZTLBKWJM-YUMQZZPRSA-N Lys-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CCCCN ITWQLSZTLBKWJM-YUMQZZPRSA-N 0.000 description 1
- XNKDCYABMBBEKN-IUCAKERBSA-N Lys-Gly-Gln Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O XNKDCYABMBBEKN-IUCAKERBSA-N 0.000 description 1
- NKKFVJRLCCUJNA-QWRGUYRKSA-N Lys-Gly-Lys Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN NKKFVJRLCCUJNA-QWRGUYRKSA-N 0.000 description 1
- FHIAJWBDZVHLAH-YUMQZZPRSA-N Lys-Gly-Ser Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FHIAJWBDZVHLAH-YUMQZZPRSA-N 0.000 description 1
- KZJQUYFDSCFSCO-DLOVCJGASA-N Lys-His-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCCCN)N KZJQUYFDSCFSCO-DLOVCJGASA-N 0.000 description 1
- ZXFRGTAIIZHNHG-AJNGGQMLSA-N Lys-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N ZXFRGTAIIZHNHG-AJNGGQMLSA-N 0.000 description 1
- IZJGPPIGYTVXLB-FQUUOJAGSA-N Lys-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N IZJGPPIGYTVXLB-FQUUOJAGSA-N 0.000 description 1
- PRSBSVAVOQOAMI-BJDJZHNGSA-N Lys-Ile-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN PRSBSVAVOQOAMI-BJDJZHNGSA-N 0.000 description 1
- NCZIQZYZPUPMKY-PPCPHDFISA-N Lys-Ile-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NCZIQZYZPUPMKY-PPCPHDFISA-N 0.000 description 1
- WAIHHELKYSFIQN-XUXIUFHCSA-N Lys-Ile-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O WAIHHELKYSFIQN-XUXIUFHCSA-N 0.000 description 1
- WVJNGSFKBKOKRV-AJNGGQMLSA-N Lys-Leu-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WVJNGSFKBKOKRV-AJNGGQMLSA-N 0.000 description 1
- WRODMZBHNNPRLN-SRVKXCTJSA-N Lys-Leu-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O WRODMZBHNNPRLN-SRVKXCTJSA-N 0.000 description 1
- LJADEBULDNKJNK-IHRRRGAJSA-N Lys-Leu-Val Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O LJADEBULDNKJNK-IHRRRGAJSA-N 0.000 description 1
- ALGGDNMLQNFVIZ-SRVKXCTJSA-N Lys-Lys-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N ALGGDNMLQNFVIZ-SRVKXCTJSA-N 0.000 description 1
- HVAUKHLDSDDROB-KKUMJFAQSA-N Lys-Lys-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O HVAUKHLDSDDROB-KKUMJFAQSA-N 0.000 description 1
- WBSCNDJQPKSPII-KKUMJFAQSA-N Lys-Lys-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O WBSCNDJQPKSPII-KKUMJFAQSA-N 0.000 description 1
- PLDJDCJLRCYPJB-VOAKCMCISA-N Lys-Lys-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PLDJDCJLRCYPJB-VOAKCMCISA-N 0.000 description 1
- LOGFVTREOLYCPF-RHYQMDGZSA-N Lys-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCCN LOGFVTREOLYCPF-RHYQMDGZSA-N 0.000 description 1
- JMNRXRPBHFGXQX-GUBZILKMSA-N Lys-Ser-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O JMNRXRPBHFGXQX-GUBZILKMSA-N 0.000 description 1
- WZVSHTFTCYOFPL-GARJFASQSA-N Lys-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCCCN)N)C(=O)O WZVSHTFTCYOFPL-GARJFASQSA-N 0.000 description 1
- DIBZLYZXTSVGLN-CIUDSAMLSA-N Lys-Ser-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O DIBZLYZXTSVGLN-CIUDSAMLSA-N 0.000 description 1
- RMOKGALPSPOYKE-KATARQTJSA-N Lys-Thr-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O RMOKGALPSPOYKE-KATARQTJSA-N 0.000 description 1
- GVKINWYYLOLEFQ-XIRDDKMYSA-N Lys-Trp-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(O)=O GVKINWYYLOLEFQ-XIRDDKMYSA-N 0.000 description 1
- PSVAVKGDUAKZKU-BZSNNMDCSA-N Lys-Tyr-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CCCCN)N)O PSVAVKGDUAKZKU-BZSNNMDCSA-N 0.000 description 1
- FPQMQEOVSKMVMA-ACRUOGEOSA-N Lys-Tyr-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)NC(=O)[C@H](CCCCN)N)O FPQMQEOVSKMVMA-ACRUOGEOSA-N 0.000 description 1
- XABXVVSWUVCZST-GVXVVHGQSA-N Lys-Val-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN XABXVVSWUVCZST-GVXVVHGQSA-N 0.000 description 1
- BWECSLVQIWEMSC-IHRRRGAJSA-N Lys-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCCN)N BWECSLVQIWEMSC-IHRRRGAJSA-N 0.000 description 1
- OZVXDDFYCQOPFD-XQQFMLRXSA-N Lys-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N OZVXDDFYCQOPFD-XQQFMLRXSA-N 0.000 description 1
- RIPJMCFGQHGHNP-RHYQMDGZSA-N Lys-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CCCCN)N)O RIPJMCFGQHGHNP-RHYQMDGZSA-N 0.000 description 1
- IKXQOBUBZSOWDY-AVGNSLFASA-N Lys-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N IKXQOBUBZSOWDY-AVGNSLFASA-N 0.000 description 1
- 101710125418 Major capsid protein Proteins 0.000 description 1
- GUBGYTABKSRVRQ-PICCSMPSSA-N Maltose Natural products O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@@H]1O[C@@H]1[C@@H](CO)OC(O)[C@H](O)[C@H]1O GUBGYTABKSRVRQ-PICCSMPSSA-N 0.000 description 1
- 101710175625 Maltose/maltodextrin-binding periplasmic protein Proteins 0.000 description 1
- 229930195725 Mannitol Natural products 0.000 description 1
- JYCQGAGDJQYEDB-GUBZILKMSA-N Met-Gln-Gln Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O JYCQGAGDJQYEDB-GUBZILKMSA-N 0.000 description 1
- IUYCGMNKIZDRQI-BQBZGAKWSA-N Met-Gly-Ala Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O IUYCGMNKIZDRQI-BQBZGAKWSA-N 0.000 description 1
- SXWQMBGNFXAGAT-FJXKBIBVSA-N Met-Gly-Thr Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O SXWQMBGNFXAGAT-FJXKBIBVSA-N 0.000 description 1
- QZPXMHVKPHJNTR-DCAQKATOSA-N Met-Leu-Asn Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O QZPXMHVKPHJNTR-DCAQKATOSA-N 0.000 description 1
- SODXFJOPSCXOHE-IHRRRGAJSA-N Met-Leu-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O SODXFJOPSCXOHE-IHRRRGAJSA-N 0.000 description 1
- MSSJHBAKDDIRMJ-SRVKXCTJSA-N Met-Lys-Gln Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O MSSJHBAKDDIRMJ-SRVKXCTJSA-N 0.000 description 1
- LXCSZPUQKMTXNW-BQBZGAKWSA-N Met-Ser-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O LXCSZPUQKMTXNW-BQBZGAKWSA-N 0.000 description 1
- MIXPUVSPPOWTCR-FXQIFTODSA-N Met-Ser-Ser Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MIXPUVSPPOWTCR-FXQIFTODSA-N 0.000 description 1
- GMMLGMFBYCFCCX-KZVJFYERSA-N Met-Thr-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O GMMLGMFBYCFCCX-KZVJFYERSA-N 0.000 description 1
- OOXVBECOTYHTCK-WDSOQIARSA-N Met-Trp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CCSC)N OOXVBECOTYHTCK-WDSOQIARSA-N 0.000 description 1
- PNHRPOWKRRJATF-IHRRRGAJSA-N Met-Tyr-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CC=C(O)C=C1 PNHRPOWKRRJATF-IHRRRGAJSA-N 0.000 description 1
- QAVZUKIPOMBLMC-AVGNSLFASA-N Met-Val-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(C)C QAVZUKIPOMBLMC-AVGNSLFASA-N 0.000 description 1
- 108091092878 Microsatellite Proteins 0.000 description 1
- 102100035877 Monocyte differentiation antigen CD14 Human genes 0.000 description 1
- 241000699660 Mus musculus Species 0.000 description 1
- 101000713102 Mus musculus C-C motif chemokine 1 Proteins 0.000 description 1
- 101000981253 Mus musculus GPI-linked NAD(P)(+)-arginine ADP-ribosyltransferase 1 Proteins 0.000 description 1
- 208000009525 Myocarditis Diseases 0.000 description 1
- FXHOOIRPVKKKFG-UHFFFAOYSA-N N,N-Dimethylacetamide Chemical compound CN(C)C(C)=O FXHOOIRPVKKKFG-UHFFFAOYSA-N 0.000 description 1
- WYBVBIHNJWOLCJ-UHFFFAOYSA-N N-L-arginyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCCN=C(N)N WYBVBIHNJWOLCJ-UHFFFAOYSA-N 0.000 description 1
- 101100068676 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) gln-1 gene Proteins 0.000 description 1
- 101100342977 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) leu-1 gene Proteins 0.000 description 1
- 101100281510 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) met-6 gene Proteins 0.000 description 1
- 208000015914 Non-Hodgkin lymphomas Diseases 0.000 description 1
- 108020004711 Nucleic Acid Probes Proteins 0.000 description 1
- 101710141454 Nucleoprotein Proteins 0.000 description 1
- 230000004989 O-glycosylation Effects 0.000 description 1
- 238000012408 PCR amplification Methods 0.000 description 1
- 241001494479 Pecora Species 0.000 description 1
- 102000057297 Pepsin A Human genes 0.000 description 1
- 108090000284 Pepsin A Proteins 0.000 description 1
- 208000031845 Pernicious anaemia Diseases 0.000 description 1
- AYPMIIKUMNADSU-IHRRRGAJSA-N Phe-Arg-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O AYPMIIKUMNADSU-IHRRRGAJSA-N 0.000 description 1
- MPGJIHFJCXTVEX-KKUMJFAQSA-N Phe-Arg-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O MPGJIHFJCXTVEX-KKUMJFAQSA-N 0.000 description 1
- AGYXCMYVTBYGCT-ULQDDVLXSA-N Phe-Arg-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O AGYXCMYVTBYGCT-ULQDDVLXSA-N 0.000 description 1
- UEXCHCYDPAIVDE-SRVKXCTJSA-N Phe-Asp-Cys Chemical compound SC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 UEXCHCYDPAIVDE-SRVKXCTJSA-N 0.000 description 1
- SWZKMTDPQXLQRD-XVSYOHENSA-N Phe-Asp-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWZKMTDPQXLQRD-XVSYOHENSA-N 0.000 description 1
- WFDAEEUZPZSMOG-SRVKXCTJSA-N Phe-Cys-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O WFDAEEUZPZSMOG-SRVKXCTJSA-N 0.000 description 1
- DHZOGDVYRQOGAC-BZSNNMDCSA-N Phe-Cys-Tyr Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N DHZOGDVYRQOGAC-BZSNNMDCSA-N 0.000 description 1
- GDBOREPXIRKSEQ-FHWLQOOXSA-N Phe-Gln-Phe Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O GDBOREPXIRKSEQ-FHWLQOOXSA-N 0.000 description 1
- IDUCUXTUHHIQIP-SOUVJXGZSA-N Phe-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O IDUCUXTUHHIQIP-SOUVJXGZSA-N 0.000 description 1
- NKLDZIPTGKBDBB-HTUGSXCWSA-N Phe-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N)O NKLDZIPTGKBDBB-HTUGSXCWSA-N 0.000 description 1
- ZKSLXIGKRJMALF-MGHWNKPDSA-N Phe-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CC=CC=C2)N ZKSLXIGKRJMALF-MGHWNKPDSA-N 0.000 description 1
- MSHZERMPZKCODG-ACRUOGEOSA-N Phe-Leu-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 MSHZERMPZKCODG-ACRUOGEOSA-N 0.000 description 1
- OQTDZEJJWWAGJT-KKUMJFAQSA-N Phe-Lys-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O OQTDZEJJWWAGJT-KKUMJFAQSA-N 0.000 description 1
- RBRNEFJTEHPDSL-ACRUOGEOSA-N Phe-Phe-Lys Chemical compound C([C@@H](C(=O)N[C@@H](CCCCN)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 RBRNEFJTEHPDSL-ACRUOGEOSA-N 0.000 description 1
- DSXPMZMSJHOKKK-HJOGWXRNSA-N Phe-Phe-Tyr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O DSXPMZMSJHOKKK-HJOGWXRNSA-N 0.000 description 1
- JLLJTMHNXQTMCK-UBHSHLNASA-N Phe-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=CC=C1 JLLJTMHNXQTMCK-UBHSHLNASA-N 0.000 description 1
- ZLAKUZDMKVKFAI-JYJNAYRXSA-N Phe-Pro-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O ZLAKUZDMKVKFAI-JYJNAYRXSA-N 0.000 description 1
- WEDZFLRYSIDIRX-IHRRRGAJSA-N Phe-Ser-Arg Chemical compound NC(=N)NCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=CC=C1 WEDZFLRYSIDIRX-IHRRRGAJSA-N 0.000 description 1
- IIEOLPMQYRBZCN-SRVKXCTJSA-N Phe-Ser-Cys Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O IIEOLPMQYRBZCN-SRVKXCTJSA-N 0.000 description 1
- UNBFGVQVQGXXCK-KKUMJFAQSA-N Phe-Ser-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O UNBFGVQVQGXXCK-KKUMJFAQSA-N 0.000 description 1
- IPFXYNKCXYGSSV-KKUMJFAQSA-N Phe-Ser-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N IPFXYNKCXYGSSV-KKUMJFAQSA-N 0.000 description 1
- GLJZDMZJHFXJQG-BZSNNMDCSA-N Phe-Ser-Phe Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O GLJZDMZJHFXJQG-BZSNNMDCSA-N 0.000 description 1
- QSWKNJAPHQDAAS-MELADBBJSA-N Phe-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O QSWKNJAPHQDAAS-MELADBBJSA-N 0.000 description 1
- BSKMOCNNLNDIMU-CDMKHQONSA-N Phe-Thr-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O BSKMOCNNLNDIMU-CDMKHQONSA-N 0.000 description 1
- FGWUALWGCZJQDJ-URLPEUOOSA-N Phe-Thr-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FGWUALWGCZJQDJ-URLPEUOOSA-N 0.000 description 1
- SHUFSZDAIPLZLF-BEAPCOKYSA-N Phe-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N)O SHUFSZDAIPLZLF-BEAPCOKYSA-N 0.000 description 1
- VFDRDMOMHBJGKD-UFYCRDLUSA-N Phe-Tyr-Arg Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N VFDRDMOMHBJGKD-UFYCRDLUSA-N 0.000 description 1
- MMPBPRXOFJNCCN-ZEWNOJEFSA-N Phe-Tyr-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MMPBPRXOFJNCCN-ZEWNOJEFSA-N 0.000 description 1
- ZYNBEWGJFXTBDU-ACRUOGEOSA-N Phe-Tyr-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC2=CC=CC=C2)N ZYNBEWGJFXTBDU-ACRUOGEOSA-N 0.000 description 1
- BQMFWUKNOCJDNV-HJWJTTGWSA-N Phe-Val-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BQMFWUKNOCJDNV-HJWJTTGWSA-N 0.000 description 1
- VIIRRNQMMIHYHQ-XHSDSOJGSA-N Phe-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N VIIRRNQMMIHYHQ-XHSDSOJGSA-N 0.000 description 1
- IEIFEYBAYFSRBQ-IHRRRGAJSA-N Phe-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N IEIFEYBAYFSRBQ-IHRRRGAJSA-N 0.000 description 1
- 102000004160 Phosphoric Monoester Hydrolases Human genes 0.000 description 1
- 108090000608 Phosphoric Monoester Hydrolases Proteins 0.000 description 1
- 108091000080 Phosphotransferase Proteins 0.000 description 1
- 241000235648 Pichia Species 0.000 description 1
- 208000007452 Plasmacytoma Diseases 0.000 description 1
- 241000276498 Pollachius virens Species 0.000 description 1
- 241001505332 Polyomavirus sp. Species 0.000 description 1
- 239000004372 Polyvinyl alcohol Substances 0.000 description 1
- 241000288906 Primates Species 0.000 description 1
- APKRGYLBSCWJJP-FXQIFTODSA-N Pro-Ala-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O APKRGYLBSCWJJP-FXQIFTODSA-N 0.000 description 1
- ALJGSKMBIUEJOB-FXQIFTODSA-N Pro-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@@H]1CCCN1 ALJGSKMBIUEJOB-FXQIFTODSA-N 0.000 description 1
- CGBYDGAJHSOGFQ-LPEHRKFASA-N Pro-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 CGBYDGAJHSOGFQ-LPEHRKFASA-N 0.000 description 1
- XQLBWXHVZVBNJM-FXQIFTODSA-N Pro-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 XQLBWXHVZVBNJM-FXQIFTODSA-N 0.000 description 1
- HFZNNDWPHBRNPV-KZVJFYERSA-N Pro-Ala-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HFZNNDWPHBRNPV-KZVJFYERSA-N 0.000 description 1
- HPXVFFIIGOAQRV-DCAQKATOSA-N Pro-Arg-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O HPXVFFIIGOAQRV-DCAQKATOSA-N 0.000 description 1
- IHCXPSYCHXFXKT-DCAQKATOSA-N Pro-Arg-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O IHCXPSYCHXFXKT-DCAQKATOSA-N 0.000 description 1
- BNBBNGZZKQUWCD-IUCAKERBSA-N Pro-Arg-Gly Chemical compound NC(N)=NCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H]1CCCN1 BNBBNGZZKQUWCD-IUCAKERBSA-N 0.000 description 1
- ORPZXBQTEHINPB-SRVKXCTJSA-N Pro-Arg-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H]1CCCN1)C(O)=O ORPZXBQTEHINPB-SRVKXCTJSA-N 0.000 description 1
- FUVBEZJCRMHWEM-FXQIFTODSA-N Pro-Asn-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O FUVBEZJCRMHWEM-FXQIFTODSA-N 0.000 description 1
- SGCZFWSQERRKBD-BQBZGAKWSA-N Pro-Asp-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 SGCZFWSQERRKBD-BQBZGAKWSA-N 0.000 description 1
- QVIZLAUEAMQKGS-GUBZILKMSA-N Pro-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 QVIZLAUEAMQKGS-GUBZILKMSA-N 0.000 description 1
- ZCXQTRXYZOSGJR-FXQIFTODSA-N Pro-Asp-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZCXQTRXYZOSGJR-FXQIFTODSA-N 0.000 description 1
- OZAPWFHRPINHND-GUBZILKMSA-N Pro-Cys-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O OZAPWFHRPINHND-GUBZILKMSA-N 0.000 description 1
- LSIWVWRUTKPXDS-DCAQKATOSA-N Pro-Gln-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O LSIWVWRUTKPXDS-DCAQKATOSA-N 0.000 description 1
- HJSCRFZVGXAGNG-SRVKXCTJSA-N Pro-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H]1CCCN1 HJSCRFZVGXAGNG-SRVKXCTJSA-N 0.000 description 1
- XZONQWUEBAFQPO-HJGDQZAQSA-N Pro-Gln-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XZONQWUEBAFQPO-HJGDQZAQSA-N 0.000 description 1
- VDGTVWFMRXVQCT-GUBZILKMSA-N Pro-Glu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 VDGTVWFMRXVQCT-GUBZILKMSA-N 0.000 description 1
- NMELOOXSGDRBRU-YUMQZZPRSA-N Pro-Glu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(=O)O)NC(=O)[C@@H]1CCCN1 NMELOOXSGDRBRU-YUMQZZPRSA-N 0.000 description 1
- LGSANCBHSMDFDY-GARJFASQSA-N Pro-Glu-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)O)C(=O)N2CCC[C@@H]2C(=O)O LGSANCBHSMDFDY-GARJFASQSA-N 0.000 description 1
- LXVLKXPFIDDHJG-CIUDSAMLSA-N Pro-Glu-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O LXVLKXPFIDDHJG-CIUDSAMLSA-N 0.000 description 1
- ULIWFCCJIOEHMU-BQBZGAKWSA-N Pro-Gly-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 ULIWFCCJIOEHMU-BQBZGAKWSA-N 0.000 description 1
- JMVQDLDPDBXAAX-YUMQZZPRSA-N Pro-Gly-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 JMVQDLDPDBXAAX-YUMQZZPRSA-N 0.000 description 1
- FKLSMYYLJHYPHH-UWVGGRQHSA-N Pro-Gly-Leu Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O FKLSMYYLJHYPHH-UWVGGRQHSA-N 0.000 description 1
- FFSLAIOXRMOFIZ-GJZGRUSLSA-N Pro-Gly-Trp Chemical compound N([C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)O)C(=O)CNC(=O)[C@@H]1CCCN1 FFSLAIOXRMOFIZ-GJZGRUSLSA-N 0.000 description 1
- JUJGNDZIKKQMDJ-IHRRRGAJSA-N Pro-His-His Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC=N1)C(O)=O JUJGNDZIKKQMDJ-IHRRRGAJSA-N 0.000 description 1
- BWCZJGJKOFUUCN-ZPFDUUQYSA-N Pro-Ile-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O BWCZJGJKOFUUCN-ZPFDUUQYSA-N 0.000 description 1
- KWMUAKQOVYCQJQ-ZPFDUUQYSA-N Pro-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@@H]1CCCN1 KWMUAKQOVYCQJQ-ZPFDUUQYSA-N 0.000 description 1
- VZKBJNBZMZHKRC-XUXIUFHCSA-N Pro-Ile-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O VZKBJNBZMZHKRC-XUXIUFHCSA-N 0.000 description 1
- BCNRNJWSRFDPTQ-HJWJTTGWSA-N Pro-Ile-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O BCNRNJWSRFDPTQ-HJWJTTGWSA-N 0.000 description 1
- KLSOMAFWRISSNI-OSUNSFLBSA-N Pro-Ile-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 KLSOMAFWRISSNI-OSUNSFLBSA-N 0.000 description 1
- RUDOLGWDSKQQFF-DCAQKATOSA-N Pro-Leu-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O RUDOLGWDSKQQFF-DCAQKATOSA-N 0.000 description 1
- GURGCNUWVSDYTP-SRVKXCTJSA-N Pro-Leu-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O GURGCNUWVSDYTP-SRVKXCTJSA-N 0.000 description 1
- XYSXOCIWCPFOCG-IHRRRGAJSA-N Pro-Leu-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XYSXOCIWCPFOCG-IHRRRGAJSA-N 0.000 description 1
- MCWHYUWXVNRXFV-RWMBFGLXSA-N Pro-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 MCWHYUWXVNRXFV-RWMBFGLXSA-N 0.000 description 1
- FKYKZHOKDOPHSA-DCAQKATOSA-N Pro-Leu-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FKYKZHOKDOPHSA-DCAQKATOSA-N 0.000 description 1
- SUENWIFTSTWUKD-AVGNSLFASA-N Pro-Leu-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O SUENWIFTSTWUKD-AVGNSLFASA-N 0.000 description 1
- RMODQFBNDDENCP-IHRRRGAJSA-N Pro-Lys-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O RMODQFBNDDENCP-IHRRRGAJSA-N 0.000 description 1
- DWGFLKQSGRUQTI-IHRRRGAJSA-N Pro-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H]1CCCN1 DWGFLKQSGRUQTI-IHRRRGAJSA-N 0.000 description 1
- MHHQQZIFLWFZGR-DCAQKATOSA-N Pro-Lys-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O MHHQQZIFLWFZGR-DCAQKATOSA-N 0.000 description 1
- WOIFYRZPIORBRY-AVGNSLFASA-N Pro-Lys-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O WOIFYRZPIORBRY-AVGNSLFASA-N 0.000 description 1
- XZBYTHCRAVAXQQ-DCAQKATOSA-N Pro-Met-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O XZBYTHCRAVAXQQ-DCAQKATOSA-N 0.000 description 1
- MHBSUKYVBZVQRW-HJWJTTGWSA-N Pro-Phe-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MHBSUKYVBZVQRW-HJWJTTGWSA-N 0.000 description 1
- WHNJMTHJGCEKGA-ULQDDVLXSA-N Pro-Phe-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O WHNJMTHJGCEKGA-ULQDDVLXSA-N 0.000 description 1
- SPLBRAKYXGOFSO-UNQGMJICSA-N Pro-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@@H]2CCCN2)O SPLBRAKYXGOFSO-UNQGMJICSA-N 0.000 description 1
- SBVPYBFMIGDIDX-SRVKXCTJSA-N Pro-Pro-Pro Chemical compound OC(=O)[C@@H]1CCCN1C(=O)[C@H]1N(C(=O)[C@H]2NCCC2)CCC1 SBVPYBFMIGDIDX-SRVKXCTJSA-N 0.000 description 1
- SEZGGSHLMROBFX-CIUDSAMLSA-N Pro-Ser-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O SEZGGSHLMROBFX-CIUDSAMLSA-N 0.000 description 1
- FNGOXVQBBCMFKV-CIUDSAMLSA-N Pro-Ser-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O FNGOXVQBBCMFKV-CIUDSAMLSA-N 0.000 description 1
- MKGIILKDUGDRRO-FXQIFTODSA-N Pro-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 MKGIILKDUGDRRO-FXQIFTODSA-N 0.000 description 1
- PRKWBYCXBBSLSK-GUBZILKMSA-N Pro-Ser-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O PRKWBYCXBBSLSK-GUBZILKMSA-N 0.000 description 1
- KIDXAAQVMNLJFQ-KZVJFYERSA-N Pro-Thr-Ala Chemical compound C[C@@H](O)[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](C)C(O)=O KIDXAAQVMNLJFQ-KZVJFYERSA-N 0.000 description 1
- DCHQYSOGURGJST-FJXKBIBVSA-N Pro-Thr-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O DCHQYSOGURGJST-FJXKBIBVSA-N 0.000 description 1
- AIOWVDNPESPXRB-YTWAJWBKSA-N Pro-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2)O AIOWVDNPESPXRB-YTWAJWBKSA-N 0.000 description 1
- VVAWNPIOYXAMAL-KJEVXHAQSA-N Pro-Thr-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O VVAWNPIOYXAMAL-KJEVXHAQSA-N 0.000 description 1
- ZAUHSLVPDLNTRZ-QXEWZRGKSA-N Pro-Val-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O ZAUHSLVPDLNTRZ-QXEWZRGKSA-N 0.000 description 1
- IMNVAOPEMFDAQD-NHCYSSNCSA-N Pro-Val-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IMNVAOPEMFDAQD-NHCYSSNCSA-N 0.000 description 1
- 101710083689 Probable capsid protein Proteins 0.000 description 1
- 206010036790 Productive cough Diseases 0.000 description 1
- 206010037549 Purpura Diseases 0.000 description 1
- 241001672981 Purpura Species 0.000 description 1
- 108010079005 RDV peptide Proteins 0.000 description 1
- 102000014450 RNA Polymerase III Human genes 0.000 description 1
- 108010078067 RNA Polymerase III Proteins 0.000 description 1
- 239000012980 RPMI-1640 medium Substances 0.000 description 1
- 108020004511 Recombinant DNA Proteins 0.000 description 1
- 108091081062 Repeated sequence (DNA) Proteins 0.000 description 1
- 108091027981 Response element Proteins 0.000 description 1
- 241000712907 Retroviridae Species 0.000 description 1
- 108010039491 Ricin Proteins 0.000 description 1
- 206010039710 Scleroderma Diseases 0.000 description 1
- 229920002684 Sepharose Polymers 0.000 description 1
- DWUIECHTAMYEFL-XVYDVKMFSA-N Ser-Ala-His Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 DWUIECHTAMYEFL-XVYDVKMFSA-N 0.000 description 1
- JPIDMRXXNMIVKY-VZFHVOOUSA-N Ser-Ala-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JPIDMRXXNMIVKY-VZFHVOOUSA-N 0.000 description 1
- GXXTUIUYTWGPMV-FXQIFTODSA-N Ser-Arg-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O GXXTUIUYTWGPMV-FXQIFTODSA-N 0.000 description 1
- NLQUOHDCLSFABG-GUBZILKMSA-N Ser-Arg-Arg Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@H](CO)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NLQUOHDCLSFABG-GUBZILKMSA-N 0.000 description 1
- QEDMOZUJTGEIBF-FXQIFTODSA-N Ser-Arg-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O QEDMOZUJTGEIBF-FXQIFTODSA-N 0.000 description 1
- QWZIOCFPXMAXET-CIUDSAMLSA-N Ser-Arg-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O QWZIOCFPXMAXET-CIUDSAMLSA-N 0.000 description 1
- YUSRGTQIPCJNHQ-CIUDSAMLSA-N Ser-Arg-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O YUSRGTQIPCJNHQ-CIUDSAMLSA-N 0.000 description 1
- KYKKKSWGEPFUMR-NAKRPEOUSA-N Ser-Arg-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KYKKKSWGEPFUMR-NAKRPEOUSA-N 0.000 description 1
- QFBNNYNWKYKVJO-DCAQKATOSA-N Ser-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CCCN=C(N)N QFBNNYNWKYKVJO-DCAQKATOSA-N 0.000 description 1
- XVAUJOAYHWWNQF-ZLUOBGJFSA-N Ser-Asn-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O XVAUJOAYHWWNQF-ZLUOBGJFSA-N 0.000 description 1
- ZXLUWXWISXIFIX-ACZMJKKPSA-N Ser-Asn-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZXLUWXWISXIFIX-ACZMJKKPSA-N 0.000 description 1
- VAUMZJHYZQXZBQ-WHFBIAKZSA-N Ser-Asn-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O VAUMZJHYZQXZBQ-WHFBIAKZSA-N 0.000 description 1
- DKKGAAJTDKHWOD-BIIVOSGPSA-N Ser-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CO)N)C(=O)O DKKGAAJTDKHWOD-BIIVOSGPSA-N 0.000 description 1
- UGJRQLURDVGULT-LKXGYXEUSA-N Ser-Asn-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UGJRQLURDVGULT-LKXGYXEUSA-N 0.000 description 1
- CNIIKZQXBBQHCX-FXQIFTODSA-N Ser-Asp-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O CNIIKZQXBBQHCX-FXQIFTODSA-N 0.000 description 1
- MESDJCNHLZBMEP-ZLUOBGJFSA-N Ser-Asp-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MESDJCNHLZBMEP-ZLUOBGJFSA-N 0.000 description 1
- ZHYMUFQVKGJNRM-ZLUOBGJFSA-N Ser-Cys-Asn Chemical compound OC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC(N)=O ZHYMUFQVKGJNRM-ZLUOBGJFSA-N 0.000 description 1
- MOVJSUIKUNCVMG-ZLUOBGJFSA-N Ser-Cys-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)O)N)O MOVJSUIKUNCVMG-ZLUOBGJFSA-N 0.000 description 1
- HJEBZBMOTCQYDN-ACZMJKKPSA-N Ser-Glu-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HJEBZBMOTCQYDN-ACZMJKKPSA-N 0.000 description 1
- GYXVUTAOICLGKJ-ACZMJKKPSA-N Ser-Glu-Cys Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N GYXVUTAOICLGKJ-ACZMJKKPSA-N 0.000 description 1
- VQBCMLMPEWPUTB-ACZMJKKPSA-N Ser-Glu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O VQBCMLMPEWPUTB-ACZMJKKPSA-N 0.000 description 1
- GZBKRJVCRMZAST-XKBZYTNZSA-N Ser-Glu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZBKRJVCRMZAST-XKBZYTNZSA-N 0.000 description 1
- MUARUIBTKQJKFY-WHFBIAKZSA-N Ser-Gly-Asp Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MUARUIBTKQJKFY-WHFBIAKZSA-N 0.000 description 1
- SNVIOQXAHVORQM-WDSKDSINSA-N Ser-Gly-Gln Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O SNVIOQXAHVORQM-WDSKDSINSA-N 0.000 description 1
- KDGARKCAKHBEDB-NKWVEPMBSA-N Ser-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CO)N)C(=O)O KDGARKCAKHBEDB-NKWVEPMBSA-N 0.000 description 1
- UIGMAMGZOJVTDN-WHFBIAKZSA-N Ser-Gly-Ser Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O UIGMAMGZOJVTDN-WHFBIAKZSA-N 0.000 description 1
- SFTZWNJFZYOLBD-ZDLURKLDSA-N Ser-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO SFTZWNJFZYOLBD-ZDLURKLDSA-N 0.000 description 1
- CXBFHZLODKPIJY-AAEUAGOBSA-N Ser-Gly-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CO)N CXBFHZLODKPIJY-AAEUAGOBSA-N 0.000 description 1
- XXXAXOWMBOKTRN-XPUUQOCRSA-N Ser-Gly-Val Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXXAXOWMBOKTRN-XPUUQOCRSA-N 0.000 description 1
- FYUIFUJFNCLUIX-XVYDVKMFSA-N Ser-His-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(O)=O FYUIFUJFNCLUIX-XVYDVKMFSA-N 0.000 description 1
- QBUWQRKEHJXTOP-DCAQKATOSA-N Ser-His-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QBUWQRKEHJXTOP-DCAQKATOSA-N 0.000 description 1
- CICQXRWZNVXFCU-SRVKXCTJSA-N Ser-His-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O CICQXRWZNVXFCU-SRVKXCTJSA-N 0.000 description 1
- MLSQXWSRHURDMF-GARJFASQSA-N Ser-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CO)N)C(=O)O MLSQXWSRHURDMF-GARJFASQSA-N 0.000 description 1
- FUMGHWDRRFCKEP-CIUDSAMLSA-N Ser-Leu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O FUMGHWDRRFCKEP-CIUDSAMLSA-N 0.000 description 1
- KCGIREHVWRXNDH-GARJFASQSA-N Ser-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N KCGIREHVWRXNDH-GARJFASQSA-N 0.000 description 1
- BYCVMHKULKRVPV-GUBZILKMSA-N Ser-Lys-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O BYCVMHKULKRVPV-GUBZILKMSA-N 0.000 description 1
- XUDRHBPSPAPDJP-SRVKXCTJSA-N Ser-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CO XUDRHBPSPAPDJP-SRVKXCTJSA-N 0.000 description 1
- CRJZZXMAADSBBQ-SRVKXCTJSA-N Ser-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CO CRJZZXMAADSBBQ-SRVKXCTJSA-N 0.000 description 1
- WGDYNRCOQRERLZ-KKUMJFAQSA-N Ser-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CO)N WGDYNRCOQRERLZ-KKUMJFAQSA-N 0.000 description 1
- LRZLZIUXQBIWTB-KATARQTJSA-N Ser-Lys-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LRZLZIUXQBIWTB-KATARQTJSA-N 0.000 description 1
- QJKPECIAWNNKIT-KKUMJFAQSA-N Ser-Lys-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QJKPECIAWNNKIT-KKUMJFAQSA-N 0.000 description 1
- ZGFRMNZZTOVBOU-CIUDSAMLSA-N Ser-Met-Gln Chemical compound N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(=O)O ZGFRMNZZTOVBOU-CIUDSAMLSA-N 0.000 description 1
- WOJYIMBIKTWKJO-KKUMJFAQSA-N Ser-Phe-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CO)N WOJYIMBIKTWKJO-KKUMJFAQSA-N 0.000 description 1
- XVWDJUROVRQKAE-KKUMJFAQSA-N Ser-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CC1=CC=CC=C1 XVWDJUROVRQKAE-KKUMJFAQSA-N 0.000 description 1
- MQUZANJDFOQOBX-SRVKXCTJSA-N Ser-Phe-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O MQUZANJDFOQOBX-SRVKXCTJSA-N 0.000 description 1
- ZKBKUWQVDWWSRI-BZSNNMDCSA-N Ser-Phe-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZKBKUWQVDWWSRI-BZSNNMDCSA-N 0.000 description 1
- QMCDMHWAKMUGJE-IHRRRGAJSA-N Ser-Phe-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O QMCDMHWAKMUGJE-IHRRRGAJSA-N 0.000 description 1
- NUEHQDHDLDXCRU-GUBZILKMSA-N Ser-Pro-Arg Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NUEHQDHDLDXCRU-GUBZILKMSA-N 0.000 description 1
- PJIQEIFXZPCWOJ-FXQIFTODSA-N Ser-Pro-Asp Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O PJIQEIFXZPCWOJ-FXQIFTODSA-N 0.000 description 1
- RHAPJNVNWDBFQI-BQBZGAKWSA-N Ser-Pro-Gly Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O RHAPJNVNWDBFQI-BQBZGAKWSA-N 0.000 description 1
- FKYWFUYPVKLJLP-DCAQKATOSA-N Ser-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO FKYWFUYPVKLJLP-DCAQKATOSA-N 0.000 description 1
- NMZXJDSKEGFDLJ-DCAQKATOSA-N Ser-Pro-Lys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CO)N)C(=O)N[C@@H](CCCCN)C(=O)O NMZXJDSKEGFDLJ-DCAQKATOSA-N 0.000 description 1
- CKDXFSPMIDSMGV-GUBZILKMSA-N Ser-Pro-Val Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O CKDXFSPMIDSMGV-GUBZILKMSA-N 0.000 description 1
- WLJPJRGQRNCIQS-ZLUOBGJFSA-N Ser-Ser-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O WLJPJRGQRNCIQS-ZLUOBGJFSA-N 0.000 description 1
- GYDFRTRSSXOZCR-ACZMJKKPSA-N Ser-Ser-Glu Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O GYDFRTRSSXOZCR-ACZMJKKPSA-N 0.000 description 1
- OZPDGESCTGGNAD-CIUDSAMLSA-N Ser-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CO OZPDGESCTGGNAD-CIUDSAMLSA-N 0.000 description 1
- ILZAUMFXKSIUEF-SRVKXCTJSA-N Ser-Ser-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ILZAUMFXKSIUEF-SRVKXCTJSA-N 0.000 description 1
- CUXJENOFJXOSOZ-BIIVOSGPSA-N Ser-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CO)N)C(=O)O CUXJENOFJXOSOZ-BIIVOSGPSA-N 0.000 description 1
- PYTKULIABVRXSC-BWBBJGPYSA-N Ser-Ser-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PYTKULIABVRXSC-BWBBJGPYSA-N 0.000 description 1
- KKKVOZNCLALMPV-XKBZYTNZSA-N Ser-Thr-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O KKKVOZNCLALMPV-XKBZYTNZSA-N 0.000 description 1
- ZKOKTQPHFMRSJP-YJRXYDGGSA-N Ser-Thr-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZKOKTQPHFMRSJP-YJRXYDGGSA-N 0.000 description 1
- XTWXRUWACCXBMU-XIRDDKMYSA-N Ser-Trp-His Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CN=CN3)C(=O)O)NC(=O)[C@H](CO)N XTWXRUWACCXBMU-XIRDDKMYSA-N 0.000 description 1
- UBTNVMGPMYDYIU-HJPIBITLSA-N Ser-Tyr-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UBTNVMGPMYDYIU-HJPIBITLSA-N 0.000 description 1
- ANOQEBQWIAYIMV-AEJSXWLSSA-N Ser-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N ANOQEBQWIAYIMV-AEJSXWLSSA-N 0.000 description 1
- VYPSYNLAJGMNEJ-UHFFFAOYSA-N Silicium dioxide Chemical compound O=[Si]=O VYPSYNLAJGMNEJ-UHFFFAOYSA-N 0.000 description 1
- 208000021386 Sjogren Syndrome Diseases 0.000 description 1
- 238000002105 Southern blotting Methods 0.000 description 1
- 229920002472 Starch Polymers 0.000 description 1
- 235000021355 Stearic acid Nutrition 0.000 description 1
- 208000037065 Subacute sclerosing leukoencephalitis Diseases 0.000 description 1
- 206010042297 Subacute sclerosing panencephalitis Diseases 0.000 description 1
- 241000282887 Suidae Species 0.000 description 1
- 108010006785 Taq Polymerase Proteins 0.000 description 1
- IGROJMCBGRFRGI-YTLHQDLWSA-N Thr-Ala-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O IGROJMCBGRFRGI-YTLHQDLWSA-N 0.000 description 1
- LVHHEVGYAZGXDE-KDXUFGMBSA-N Thr-Ala-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(=O)O)N)O LVHHEVGYAZGXDE-KDXUFGMBSA-N 0.000 description 1
- DWYAUVCQDTZIJI-VZFHVOOUSA-N Thr-Ala-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O DWYAUVCQDTZIJI-VZFHVOOUSA-N 0.000 description 1
- DGDCHPCRMWEOJR-FQPOAREZSA-N Thr-Ala-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 DGDCHPCRMWEOJR-FQPOAREZSA-N 0.000 description 1
- OJRNZRROAIAHDL-LKXGYXEUSA-N Thr-Asn-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O OJRNZRROAIAHDL-LKXGYXEUSA-N 0.000 description 1
- DHPPWTOLRWYIDS-XKBZYTNZSA-N Thr-Cys-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(O)=O DHPPWTOLRWYIDS-XKBZYTNZSA-N 0.000 description 1
- KGKWKSSSQGGYAU-SUSMZKCASA-N Thr-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KGKWKSSSQGGYAU-SUSMZKCASA-N 0.000 description 1
- WDFPMSHYMRBLKM-NKIYYHGXSA-N Thr-Glu-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O WDFPMSHYMRBLKM-NKIYYHGXSA-N 0.000 description 1
- XFTYVCHLARBHBQ-FOHZUACHSA-N Thr-Gly-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O XFTYVCHLARBHBQ-FOHZUACHSA-N 0.000 description 1
- NIEWSKWFURSECR-FOHZUACHSA-N Thr-Gly-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O NIEWSKWFURSECR-FOHZUACHSA-N 0.000 description 1
- XPNSAQMEAVSQRD-FBCQKBJTSA-N Thr-Gly-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)NCC(O)=O XPNSAQMEAVSQRD-FBCQKBJTSA-N 0.000 description 1
- NQVDGKYAUHTCME-QTKMDUPCSA-N Thr-His-Arg Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N)O NQVDGKYAUHTCME-QTKMDUPCSA-N 0.000 description 1
- WBCCCPZIJIJTSD-TUBUOCAGSA-N Thr-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H]([C@@H](C)O)N WBCCCPZIJIJTSD-TUBUOCAGSA-N 0.000 description 1
- AHOLTQCAVBSUDP-PPCPHDFISA-N Thr-Ile-Lys Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)[C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O AHOLTQCAVBSUDP-PPCPHDFISA-N 0.000 description 1
- FQPDRTDDEZXCEC-SVSWQMSJSA-N Thr-Ile-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O FQPDRTDDEZXCEC-SVSWQMSJSA-N 0.000 description 1
- AMXMBCAXAZUCFA-RHYQMDGZSA-N Thr-Leu-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AMXMBCAXAZUCFA-RHYQMDGZSA-N 0.000 description 1
- VRUFCJZQDACGLH-UVOCVTCTSA-N Thr-Leu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VRUFCJZQDACGLH-UVOCVTCTSA-N 0.000 description 1
- JLNMFGCJODTXDH-WEDXCCLWSA-N Thr-Lys-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O JLNMFGCJODTXDH-WEDXCCLWSA-N 0.000 description 1
- SPVHQURZJCUDQC-VOAKCMCISA-N Thr-Lys-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O SPVHQURZJCUDQC-VOAKCMCISA-N 0.000 description 1
- JWQNAFHCXKVZKZ-UVOCVTCTSA-N Thr-Lys-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JWQNAFHCXKVZKZ-UVOCVTCTSA-N 0.000 description 1
- APIDTRXFGYOLLH-VQVTYTSYSA-N Thr-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](N)[C@@H](C)O APIDTRXFGYOLLH-VQVTYTSYSA-N 0.000 description 1
- WNQJTLATMXYSEL-OEAJRASXSA-N Thr-Phe-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O WNQJTLATMXYSEL-OEAJRASXSA-N 0.000 description 1
- MXNAOGFNFNKUPD-JHYOHUSXSA-N Thr-Phe-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MXNAOGFNFNKUPD-JHYOHUSXSA-N 0.000 description 1
- MXDOAJQRJBMGMO-FJXKBIBVSA-N Thr-Pro-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O MXDOAJQRJBMGMO-FJXKBIBVSA-N 0.000 description 1
- STUAPCLEDMKXKL-LKXGYXEUSA-N Thr-Ser-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O STUAPCLEDMKXKL-LKXGYXEUSA-N 0.000 description 1
- IVDFVBVIVLJJHR-LKXGYXEUSA-N Thr-Ser-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IVDFVBVIVLJJHR-LKXGYXEUSA-N 0.000 description 1
- IQPWNQRRAJHOKV-KATARQTJSA-N Thr-Ser-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN IQPWNQRRAJHOKV-KATARQTJSA-N 0.000 description 1
- WPSKTVVMQCXPRO-BWBBJGPYSA-N Thr-Ser-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WPSKTVVMQCXPRO-BWBBJGPYSA-N 0.000 description 1
- ZMYCLHFLHRVOEA-HEIBUPTGSA-N Thr-Thr-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ZMYCLHFLHRVOEA-HEIBUPTGSA-N 0.000 description 1
- NLWDSYKZUPRMBJ-IEGACIPQSA-N Thr-Trp-Leu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(C)C)C(=O)O)N)O NLWDSYKZUPRMBJ-IEGACIPQSA-N 0.000 description 1
- JAWUQFCGNVEDRN-MEYUZBJRSA-N Thr-Tyr-Leu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N)O JAWUQFCGNVEDRN-MEYUZBJRSA-N 0.000 description 1
- XVHAUVJXBFGUPC-RPTUDFQQSA-N Thr-Tyr-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O XVHAUVJXBFGUPC-RPTUDFQQSA-N 0.000 description 1
- DIHPMRTXPYMDJZ-KAOXEZKKSA-N Thr-Tyr-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N)O DIHPMRTXPYMDJZ-KAOXEZKKSA-N 0.000 description 1
- YOPQYBJJNSIQGZ-JNPHEJMOSA-N Thr-Tyr-Tyr Chemical compound C([C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 YOPQYBJJNSIQGZ-JNPHEJMOSA-N 0.000 description 1
- BKIOKSLLAAZYTC-KKHAAJSZSA-N Thr-Val-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O BKIOKSLLAAZYTC-KKHAAJSZSA-N 0.000 description 1
- BKVICMPZWRNWOC-RHYQMDGZSA-N Thr-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)[C@@H](C)O BKVICMPZWRNWOC-RHYQMDGZSA-N 0.000 description 1
- VYVBSMCZNHOZGD-RCWTZXSCSA-N Thr-Val-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O VYVBSMCZNHOZGD-RCWTZXSCSA-N 0.000 description 1
- 108091036066 Three prime untranslated region Proteins 0.000 description 1
- 102000002262 Thromboplastin Human genes 0.000 description 1
- 108010000499 Thromboplastin Proteins 0.000 description 1
- 108020004440 Thymidine kinase Proteins 0.000 description 1
- 229920001615 Tragacanth Polymers 0.000 description 1
- 108700009124 Transcription Initiation Site Proteins 0.000 description 1
- 108091023040 Transcription factor Proteins 0.000 description 1
- 102000040945 Transcription factor Human genes 0.000 description 1
- MVHHTXAUJCIOMZ-WDSOQIARSA-N Trp-Arg-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N MVHHTXAUJCIOMZ-WDSOQIARSA-N 0.000 description 1
- HJTYJQVRIQXMHM-XIRDDKMYSA-N Trp-Asp-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N HJTYJQVRIQXMHM-XIRDDKMYSA-N 0.000 description 1
- OFSLQLHHDQOWDB-QEJZJMRPSA-N Trp-Cys-Gln Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(O)=O)=CNC2=C1 OFSLQLHHDQOWDB-QEJZJMRPSA-N 0.000 description 1
- XEHGAHOCTDKOKP-XIRDDKMYSA-N Trp-Cys-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)O)N XEHGAHOCTDKOKP-XIRDDKMYSA-N 0.000 description 1
- OGXQLUCMJZSJPW-LYSGOOTNSA-N Trp-Gly-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC1=CNC2=CC=CC=C21)N)O OGXQLUCMJZSJPW-LYSGOOTNSA-N 0.000 description 1
- RRVUOLRWIZXBRQ-IHPCNDPISA-N Trp-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N RRVUOLRWIZXBRQ-IHPCNDPISA-N 0.000 description 1
- GWBWCGITOYODER-YTQUADARSA-N Trp-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N GWBWCGITOYODER-YTQUADARSA-N 0.000 description 1
- OSYOKZZRVGUDMO-HSCHXYMDSA-N Trp-Lys-Ile Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OSYOKZZRVGUDMO-HSCHXYMDSA-N 0.000 description 1
- ADMHZNPMMVKGJW-BPUTZDHNSA-N Trp-Ser-Arg Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N ADMHZNPMMVKGJW-BPUTZDHNSA-N 0.000 description 1
- UIRPULWLRODAEQ-QEJZJMRPSA-N Trp-Ser-Glu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O)=CNC2=C1 UIRPULWLRODAEQ-QEJZJMRPSA-N 0.000 description 1
- GBEAUNVBIMLWIB-IHPCNDPISA-N Trp-Ser-Phe Chemical compound C([C@H](NC(=O)[C@H](CO)NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)N)C(O)=O)C1=CC=CC=C1 GBEAUNVBIMLWIB-IHPCNDPISA-N 0.000 description 1
- YCQXZDHDSUHUSG-FJHTZYQYSA-N Trp-Thr-Ala Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 YCQXZDHDSUHUSG-FJHTZYQYSA-N 0.000 description 1
- 206010067584 Type 1 diabetes mellitus Diseases 0.000 description 1
- MICSYKFECRFCTJ-IHRRRGAJSA-N Tyr-Arg-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O MICSYKFECRFCTJ-IHRRRGAJSA-N 0.000 description 1
- HTHCZRWCFXMENJ-KKUMJFAQSA-N Tyr-Arg-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O HTHCZRWCFXMENJ-KKUMJFAQSA-N 0.000 description 1
- GFZQWWDXJVGEMW-ULQDDVLXSA-N Tyr-Arg-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O GFZQWWDXJVGEMW-ULQDDVLXSA-N 0.000 description 1
- AYHSJESDFKREAR-KKUMJFAQSA-N Tyr-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AYHSJESDFKREAR-KKUMJFAQSA-N 0.000 description 1
- JRXKIVGWMMIIOF-YDHLFZDLSA-N Tyr-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N JRXKIVGWMMIIOF-YDHLFZDLSA-N 0.000 description 1
- YWXMGBUGMLJMIP-IHPCNDPISA-N Tyr-Cys-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC3=CC=C(C=C3)O)N YWXMGBUGMLJMIP-IHPCNDPISA-N 0.000 description 1
- FXYOYUMPUJONGW-FHWLQOOXSA-N Tyr-Gln-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 FXYOYUMPUJONGW-FHWLQOOXSA-N 0.000 description 1
- LOOCQRRBKZTPKO-AVGNSLFASA-N Tyr-Glu-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 LOOCQRRBKZTPKO-AVGNSLFASA-N 0.000 description 1
- HVHJYXDXRIWELT-RYUDHWBXSA-N Tyr-Glu-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O HVHJYXDXRIWELT-RYUDHWBXSA-N 0.000 description 1
- IMXAAEFAIBRCQF-SIUGBPQLSA-N Tyr-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N IMXAAEFAIBRCQF-SIUGBPQLSA-N 0.000 description 1
- OHNXAUCZVWGTLL-KKUMJFAQSA-N Tyr-His-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CS)C(=O)O)N)O OHNXAUCZVWGTLL-KKUMJFAQSA-N 0.000 description 1
- HVPPEXXUDXAPOM-MGHWNKPDSA-N Tyr-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 HVPPEXXUDXAPOM-MGHWNKPDSA-N 0.000 description 1
- OHOVFPKXPZODHS-SJWGOKEGSA-N Tyr-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N OHOVFPKXPZODHS-SJWGOKEGSA-N 0.000 description 1
- MVFQLSPDMMFCMW-KKUMJFAQSA-N Tyr-Leu-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O MVFQLSPDMMFCMW-KKUMJFAQSA-N 0.000 description 1
- MNWINJDPGBNOED-ULQDDVLXSA-N Tyr-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=C(O)C=C1 MNWINJDPGBNOED-ULQDDVLXSA-N 0.000 description 1
- ZSXJENBJGRHKIG-UWVGGRQHSA-N Tyr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 ZSXJENBJGRHKIG-UWVGGRQHSA-N 0.000 description 1
- IEWKKXZRJLTIOV-AVGNSLFASA-N Tyr-Ser-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O IEWKKXZRJLTIOV-AVGNSLFASA-N 0.000 description 1
- MQGGXGKQSVEQHR-KKUMJFAQSA-N Tyr-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 MQGGXGKQSVEQHR-KKUMJFAQSA-N 0.000 description 1
- HRHYJNLMIJWGLF-BZSNNMDCSA-N Tyr-Ser-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 HRHYJNLMIJWGLF-BZSNNMDCSA-N 0.000 description 1
- WQOHKVRQDLNDIL-YJRXYDGGSA-N Tyr-Thr-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O WQOHKVRQDLNDIL-YJRXYDGGSA-N 0.000 description 1
- GZWPQZDVTBZVEP-BZSNNMDCSA-N Tyr-Tyr-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O GZWPQZDVTBZVEP-BZSNNMDCSA-N 0.000 description 1
- AEOFMCAKYIQQFY-YDHLFZDLSA-N Tyr-Val-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AEOFMCAKYIQQFY-YDHLFZDLSA-N 0.000 description 1
- 206010046865 Vaccinia virus infection Diseases 0.000 description 1
- IZFVRRYRMQFVGX-NRPADANISA-N Val-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N IZFVRRYRMQFVGX-NRPADANISA-N 0.000 description 1
- YFOCMOVJBQDBCE-NRPADANISA-N Val-Ala-Glu Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N YFOCMOVJBQDBCE-NRPADANISA-N 0.000 description 1
- JOQSQZFKFYJKKJ-GUBZILKMSA-N Val-Arg-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N JOQSQZFKFYJKKJ-GUBZILKMSA-N 0.000 description 1
- KKHRWGYHBZORMQ-NHCYSSNCSA-N Val-Arg-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KKHRWGYHBZORMQ-NHCYSSNCSA-N 0.000 description 1
- LNYOXPDEIZJDEI-NHCYSSNCSA-N Val-Asn-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N LNYOXPDEIZJDEI-NHCYSSNCSA-N 0.000 description 1
- JLFKWDAZBRYCGX-ZKWXMUAHSA-N Val-Asn-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N JLFKWDAZBRYCGX-ZKWXMUAHSA-N 0.000 description 1
- DBOXBUDEAJVKRE-LSJOCFKGSA-N Val-Asn-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N DBOXBUDEAJVKRE-LSJOCFKGSA-N 0.000 description 1
- BWVHQINTNLVWGZ-ZKWXMUAHSA-N Val-Cys-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N BWVHQINTNLVWGZ-ZKWXMUAHSA-N 0.000 description 1
- CFSSLXZJEMERJY-NRPADANISA-N Val-Gln-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O CFSSLXZJEMERJY-NRPADANISA-N 0.000 description 1
- XTAUQCGQFJQGEJ-NHCYSSNCSA-N Val-Gln-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N XTAUQCGQFJQGEJ-NHCYSSNCSA-N 0.000 description 1
- VFOHXOLPLACADK-GVXVVHGQSA-N Val-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C(C)C)N VFOHXOLPLACADK-GVXVVHGQSA-N 0.000 description 1
- XGJLNBNZNMVJRS-NRPADANISA-N Val-Glu-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O XGJLNBNZNMVJRS-NRPADANISA-N 0.000 description 1
- VLDMQVZZWDOKQF-AUTRQRHGSA-N Val-Glu-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N VLDMQVZZWDOKQF-AUTRQRHGSA-N 0.000 description 1
- NXRAUQGGHPCJIB-RCOVLWMOSA-N Val-Gly-Asn Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O NXRAUQGGHPCJIB-RCOVLWMOSA-N 0.000 description 1
- URIRWLJVWHYLET-ONGXEEELSA-N Val-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C URIRWLJVWHYLET-ONGXEEELSA-N 0.000 description 1
- SYOMXKPPFZRELL-ONGXEEELSA-N Val-Gly-Lys Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N SYOMXKPPFZRELL-ONGXEEELSA-N 0.000 description 1
- LAYSXAOGWHKNED-XPUUQOCRSA-N Val-Gly-Ser Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LAYSXAOGWHKNED-XPUUQOCRSA-N 0.000 description 1
- XXROXFHCMVXETG-UWVGGRQHSA-N Val-Gly-Val Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXROXFHCMVXETG-UWVGGRQHSA-N 0.000 description 1
- APEBUJBRGCMMHP-HJWJTTGWSA-N Val-Ile-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 APEBUJBRGCMMHP-HJWJTTGWSA-N 0.000 description 1
- OVBMCNDKCWAXMZ-NAKRPEOUSA-N Val-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](C(C)C)N OVBMCNDKCWAXMZ-NAKRPEOUSA-N 0.000 description 1
- BZWUSZGQOILYEU-STECZYCISA-N Val-Ile-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 BZWUSZGQOILYEU-STECZYCISA-N 0.000 description 1
- FEXILLGKGGTLRI-NHCYSSNCSA-N Val-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N FEXILLGKGGTLRI-NHCYSSNCSA-N 0.000 description 1
- GVJUTBOZZBTBIG-AVGNSLFASA-N Val-Lys-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N GVJUTBOZZBTBIG-AVGNSLFASA-N 0.000 description 1
- QRVPEKJBBRYISE-XUXIUFHCSA-N Val-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N QRVPEKJBBRYISE-XUXIUFHCSA-N 0.000 description 1
- HPANGHISDXDUQY-ULQDDVLXSA-N Val-Lys-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N HPANGHISDXDUQY-ULQDDVLXSA-N 0.000 description 1
- JAKHAONCJJZVHT-DCAQKATOSA-N Val-Lys-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)O)N JAKHAONCJJZVHT-DCAQKATOSA-N 0.000 description 1
- ILMVQSHENUZYIZ-JYJNAYRXSA-N Val-Met-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N ILMVQSHENUZYIZ-JYJNAYRXSA-N 0.000 description 1
- YKNOJPJWNVHORX-UNQGMJICSA-N Val-Phe-Thr Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=CC=C1 YKNOJPJWNVHORX-UNQGMJICSA-N 0.000 description 1
- GIAZPLMMQOERPN-YUMQZZPRSA-N Val-Pro Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(O)=O GIAZPLMMQOERPN-YUMQZZPRSA-N 0.000 description 1
- WANVRBAZGSICCP-SRVKXCTJSA-N Val-Pro-Met Chemical compound CSCC[C@H](NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)C(C)C)C(O)=O WANVRBAZGSICCP-SRVKXCTJSA-N 0.000 description 1
- MIKHIIQMRFYVOR-RCWTZXSCSA-N Val-Pro-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C(C)C)N)O MIKHIIQMRFYVOR-RCWTZXSCSA-N 0.000 description 1
- NSUUANXHLKKHQB-BZSNNMDCSA-N Val-Pro-Trp Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CNC2=CC=CC=C12 NSUUANXHLKKHQB-BZSNNMDCSA-N 0.000 description 1
- KSFXWENSJABBFI-ZKWXMUAHSA-N Val-Ser-Asn Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O KSFXWENSJABBFI-ZKWXMUAHSA-N 0.000 description 1
- QTPQHINADBYBNA-DCAQKATOSA-N Val-Ser-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN QTPQHINADBYBNA-DCAQKATOSA-N 0.000 description 1
- TVGWMCTYUFBXAP-QTKMDUPCSA-N Val-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](C(C)C)N)O TVGWMCTYUFBXAP-QTKMDUPCSA-N 0.000 description 1
- USXYVSTVPHELAF-RCWTZXSCSA-N Val-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](C(C)C)N)O USXYVSTVPHELAF-RCWTZXSCSA-N 0.000 description 1
- PDDJTOSAVNRJRH-UNQGMJICSA-N Val-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](C(C)C)N)O PDDJTOSAVNRJRH-UNQGMJICSA-N 0.000 description 1
- OFTXTCGQJXTNQS-XGEHTFHBSA-N Val-Thr-Ser Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](C(C)C)N)O OFTXTCGQJXTNQS-XGEHTFHBSA-N 0.000 description 1
- ZHWZDZFWBXWPDW-GUBZILKMSA-N Val-Val-Cys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CS)C(O)=O ZHWZDZFWBXWPDW-GUBZILKMSA-N 0.000 description 1
- 241000251539 Vertebrata <Metazoa> Species 0.000 description 1
- 108020005202 Viral DNA Proteins 0.000 description 1
- 230000002159 abnormal effect Effects 0.000 description 1
- 230000005856 abnormality Effects 0.000 description 1
- DPXJVFZANSGRMM-UHFFFAOYSA-N acetic acid;2,3,4,5,6-pentahydroxyhexanal;sodium Chemical compound [Na].CC(O)=O.OCC(O)C(O)C(O)C(O)C=O DPXJVFZANSGRMM-UHFFFAOYSA-N 0.000 description 1
- CUJRVFIICFDLGR-UHFFFAOYSA-N acetylacetonate Chemical compound CC(=O)[CH-]C(C)=O CUJRVFIICFDLGR-UHFFFAOYSA-N 0.000 description 1
- 230000021736 acetylation Effects 0.000 description 1
- 238000006640 acetylation reaction Methods 0.000 description 1
- 210000005221 acidic domain Anatomy 0.000 description 1
- 230000009471 action Effects 0.000 description 1
- 230000002730 additional effect Effects 0.000 description 1
- 239000000443 aerosol Substances 0.000 description 1
- 239000011543 agarose gel Substances 0.000 description 1
- 238000000246 agarose gel electrophoresis Methods 0.000 description 1
- 125000003295 alanine group Chemical group N[C@@H](C)C(=O)* 0.000 description 1
- 230000007815 allergy Effects 0.000 description 1
- 235000012211 aluminium silicate Nutrition 0.000 description 1
- 230000009435 amidation Effects 0.000 description 1
- 238000007112 amidation reaction Methods 0.000 description 1
- 150000001412 amines Chemical group 0.000 description 1
- 229960003896 aminopterin Drugs 0.000 description 1
- 210000001691 amnion Anatomy 0.000 description 1
- 210000004102 animal cell Anatomy 0.000 description 1
- 238000010171 animal model Methods 0.000 description 1
- 230000000692 anti-sense effect Effects 0.000 description 1
- 230000005875 antibody response Effects 0.000 description 1
- 210000000628 antibody-producing cell Anatomy 0.000 description 1
- 229940121375 antifungal agent Drugs 0.000 description 1
- 239000003429 antifungal agent Substances 0.000 description 1
- 102000025171 antigen binding proteins Human genes 0.000 description 1
- 108091000831 antigen binding proteins Proteins 0.000 description 1
- 239000008135 aqueous vehicle Substances 0.000 description 1
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 1
- 108010013835 arginine glutamate Proteins 0.000 description 1
- 125000000637 arginyl group Chemical group N[C@@H](CCCNC(N)=N)C(=O)* 0.000 description 1
- 208000011775 arteriosclerosis disease Diseases 0.000 description 1
- 238000013473 artificial intelligence Methods 0.000 description 1
- 125000000613 asparagine group Chemical group N[C@@H](CC(N)=O)C(=O)* 0.000 description 1
- 229940009098 aspartate Drugs 0.000 description 1
- 108010047857 aspartylglycine Proteins 0.000 description 1
- 108010092854 aspartyllysine Proteins 0.000 description 1
- 208000006673 asthma Diseases 0.000 description 1
- 201000000448 autoimmune hemolytic anemia Diseases 0.000 description 1
- 230000001580 bacterial effect Effects 0.000 description 1
- 108010058966 bacteriophage T7 induced DNA polymerase Proteins 0.000 description 1
- 229910052614 beryl Inorganic materials 0.000 description 1
- IQFYYKKMVGJFEH-UHFFFAOYSA-N beta-L-thymidine Natural products O=C1NC(=O)C(C)=CN1C1OC(CO)C(O)C1 IQFYYKKMVGJFEH-UHFFFAOYSA-N 0.000 description 1
- 239000011230 binding agent Substances 0.000 description 1
- 230000000975 bioactive effect Effects 0.000 description 1
- 229960000074 biopharmaceutical Drugs 0.000 description 1
- 229960002685 biotin Drugs 0.000 description 1
- 235000020958 biotin Nutrition 0.000 description 1
- 239000011616 biotin Substances 0.000 description 1
- 230000037396 body weight Effects 0.000 description 1
- 210000004556 brain Anatomy 0.000 description 1
- 210000000481 breast Anatomy 0.000 description 1
- 229910000019 calcium carbonate Inorganic materials 0.000 description 1
- 235000010216 calcium carbonate Nutrition 0.000 description 1
- 239000001506 calcium phosphate Substances 0.000 description 1
- 208000035269 cancer or benign tumor Diseases 0.000 description 1
- 150000001720 carbohydrates Chemical group 0.000 description 1
- 239000001768 carboxy methyl cellulose Substances 0.000 description 1
- 235000010418 carrageenan Nutrition 0.000 description 1
- 239000000679 carrageenan Substances 0.000 description 1
- 229920001525 carrageenan Polymers 0.000 description 1
- 229940113118 carrageenan Drugs 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 238000004113 cell culture Methods 0.000 description 1
- 230000024245 cell differentiation Effects 0.000 description 1
- 230000006037 cell lysis Effects 0.000 description 1
- 108091092328 cellular RNA Proteins 0.000 description 1
- 230000007969 cellular immunity Effects 0.000 description 1
- 230000007541 cellular toxicity Effects 0.000 description 1
- 210000003169 central nervous system Anatomy 0.000 description 1
- 210000003679 cervix uteri Anatomy 0.000 description 1
- 238000012512 characterization method Methods 0.000 description 1
- 238000012412 chemical coupling Methods 0.000 description 1
- 238000001311 chemical methods and process Methods 0.000 description 1
- 239000007795 chemical reaction product Substances 0.000 description 1
- 210000004978 chinese hamster ovary cell Anatomy 0.000 description 1
- 238000004587 chromatography analysis Methods 0.000 description 1
- 230000002759 chromosomal effect Effects 0.000 description 1
- 230000001684 chronic effect Effects 0.000 description 1
- 208000025302 chronic primary adrenal insufficiency Diseases 0.000 description 1
- 239000008119 colloidal silica Substances 0.000 description 1
- 238000004040 coloring Methods 0.000 description 1
- 230000000052 comparative effect Effects 0.000 description 1
- 238000003271 compound fluorescence assay Methods 0.000 description 1
- 238000004590 computer program Methods 0.000 description 1
- 229940099112 cornstarch Drugs 0.000 description 1
- 230000002596 correlated effect Effects 0.000 description 1
- 238000004132 cross linking Methods 0.000 description 1
- 108010016616 cysteinylglycine Proteins 0.000 description 1
- 108010060199 cysteinylproline Proteins 0.000 description 1
- 230000002559 cytogenic effect Effects 0.000 description 1
- 230000002380 cytological effect Effects 0.000 description 1
- 230000003013 cytotoxicity Effects 0.000 description 1
- 231100000135 cytotoxicity Toxicity 0.000 description 1
- 230000007423 decrease Effects 0.000 description 1
- 230000003247 decreasing effect Effects 0.000 description 1
- 230000007123 defense Effects 0.000 description 1
- 238000006731 degradation reaction Methods 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 238000001212 derivatisation Methods 0.000 description 1
- 239000008121 dextrose Substances 0.000 description 1
- 238000012631 diagnostic technique Methods 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 238000000502 dialysis Methods 0.000 description 1
- NEFBYIFKOOEVPA-UHFFFAOYSA-K dicalcium phosphate Chemical compound [Ca+2].[Ca+2].[O-]P([O-])([O-])=O NEFBYIFKOOEVPA-UHFFFAOYSA-K 0.000 description 1
- 229940038472 dicalcium phosphate Drugs 0.000 description 1
- 229910000390 dicalcium phosphate Inorganic materials 0.000 description 1
- 238000010790 dilution Methods 0.000 description 1
- 239000012895 dilution Substances 0.000 description 1
- 206010013023 diphtheria Diseases 0.000 description 1
- 230000006334 disulfide bridging Effects 0.000 description 1
- 230000002222 downregulating effect Effects 0.000 description 1
- 241001493065 dsRNA viruses Species 0.000 description 1
- 230000009977 dual effect Effects 0.000 description 1
- 239000000839 emulsion Substances 0.000 description 1
- 210000003989 endothelium vascular Anatomy 0.000 description 1
- 230000002255 enzymatic effect Effects 0.000 description 1
- 238000006911 enzymatic reaction Methods 0.000 description 1
- 210000000981 epithelium Anatomy 0.000 description 1
- 150000002148 esters Chemical class 0.000 description 1
- 235000019325 ethyl cellulose Nutrition 0.000 description 1
- 229920001249 ethyl cellulose Polymers 0.000 description 1
- 229940116333 ethyl lactate Drugs 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 230000002349 favourable effect Effects 0.000 description 1
- 210000003608 fece Anatomy 0.000 description 1
- 230000003619 fibrillary effect Effects 0.000 description 1
- 239000000796 flavoring agent Substances 0.000 description 1
- 238000000684 flow cytometry Methods 0.000 description 1
- GNBHRKFJIUUOQI-UHFFFAOYSA-N fluorescein Chemical compound O1C(=O)C2=CC=CC=C2C21C1=CC=C(O)C=C1OC1=CC(O)=CC=C21 GNBHRKFJIUUOQI-UHFFFAOYSA-N 0.000 description 1
- 229940020947 fluorescein sodium Drugs 0.000 description 1
- 239000007850 fluorescent dye Substances 0.000 description 1
- 238000002594 fluoroscopy Methods 0.000 description 1
- 201000003444 follicular lymphoma Diseases 0.000 description 1
- 235000013355 food flavoring agent Nutrition 0.000 description 1
- 235000003599 food sweetener Nutrition 0.000 description 1
- 230000002496 gastric effect Effects 0.000 description 1
- 238000001502 gel electrophoresis Methods 0.000 description 1
- 229920000159 gelatin Polymers 0.000 description 1
- 239000008273 gelatin Substances 0.000 description 1
- 235000019322 gelatine Nutrition 0.000 description 1
- 235000011852 gelatine desserts Nutrition 0.000 description 1
- 238000003500 gene array Methods 0.000 description 1
- 239000008103 glucose Substances 0.000 description 1
- 108010078144 glutaminyl-glycine Proteins 0.000 description 1
- 108010042598 glutamyl-aspartyl-glycine Proteins 0.000 description 1
- 230000002414 glycolytic effect Effects 0.000 description 1
- 108010028188 glycyl-histidyl-serine Proteins 0.000 description 1
- 108010066198 glycyl-leucyl-phenylalanine Proteins 0.000 description 1
- 108010074027 glycyl-seryl-phenylalanine Proteins 0.000 description 1
- 108010015792 glycyllysine Proteins 0.000 description 1
- 108010084389 glycyltryptophan Proteins 0.000 description 1
- 208000024908 graft versus host disease Diseases 0.000 description 1
- 230000003394 haemopoietic effect Effects 0.000 description 1
- 238000003306 harvesting Methods 0.000 description 1
- 210000002216 heart Anatomy 0.000 description 1
- 210000002443 helper t lymphocyte Anatomy 0.000 description 1
- 210000003494 hepatocyte Anatomy 0.000 description 1
- 108010040030 histidinoalanine Proteins 0.000 description 1
- 108010036413 histidylglycine Proteins 0.000 description 1
- 238000002868 homogeneous time resolved fluorescence Methods 0.000 description 1
- 230000004727 humoral immunity Effects 0.000 description 1
- 229940014041 hyaluronate Drugs 0.000 description 1
- 125000001165 hydrophobic group Chemical group 0.000 description 1
- 125000002887 hydroxy group Chemical group [H]O* 0.000 description 1
- 238000012872 hydroxylapatite chromatography Methods 0.000 description 1
- 230000033444 hydroxylation Effects 0.000 description 1
- 238000005805 hydroxylation reaction Methods 0.000 description 1
- 239000001866 hydroxypropyl methyl cellulose Substances 0.000 description 1
- 235000010979 hydroxypropyl methyl cellulose Nutrition 0.000 description 1
- 229920003088 hydroxypropyl methyl cellulose Polymers 0.000 description 1
- UFVKGYZPFZQRLF-UHFFFAOYSA-N hydroxypropyl methyl cellulose Chemical compound OC1C(O)C(OC)OC(CO)C1OC1C(O)C(O)C(OC2C(C(O)C(OC3C(C(O)C(O)C(CO)O3)O)C(CO)O2)O)C(CO)O1 UFVKGYZPFZQRLF-UHFFFAOYSA-N 0.000 description 1
- 210000000987 immune system Anatomy 0.000 description 1
- 230000036039 immunity Effects 0.000 description 1
- 238000003119 immunoblot Methods 0.000 description 1
- 230000009851 immunogenic response Effects 0.000 description 1
- 230000005847 immunogenicity Effects 0.000 description 1
- 230000001771 impaired effect Effects 0.000 description 1
- 238000010348 incorporation Methods 0.000 description 1
- 230000001939 inductive effect Effects 0.000 description 1
- 239000012678 infectious agent Substances 0.000 description 1
- 230000002458 infectious effect Effects 0.000 description 1
- 230000036512 infertility Effects 0.000 description 1
- 208000000509 infertility Diseases 0.000 description 1
- 208000021267 infertility disease Diseases 0.000 description 1
- 210000004969 inflammatory cell Anatomy 0.000 description 1
- 108091008042 inhibitory receptors Proteins 0.000 description 1
- 239000007972 injectable composition Substances 0.000 description 1
- 238000007689 inspection Methods 0.000 description 1
- 230000003834 intracellular effect Effects 0.000 description 1
- 238000007918 intramuscular administration Methods 0.000 description 1
- 238000002955 isolation Methods 0.000 description 1
- 125000000741 isoleucyl group Chemical group [H]N([H])C(C(C([H])([H])[H])C([H])([H])C([H])([H])[H])C(=O)O* 0.000 description 1
- 229950003188 isovaleryl diethylamide Drugs 0.000 description 1
- 208000018937 joint inflammation Diseases 0.000 description 1
- NLYAJNPCOHFWQQ-UHFFFAOYSA-N kaolin Chemical compound O.O.O=[Al]O[Si](=O)O[Si](=O)O[Al]=O NLYAJNPCOHFWQQ-UHFFFAOYSA-N 0.000 description 1
- 210000003292 kidney cell Anatomy 0.000 description 1
- 238000002372 labelling Methods 0.000 description 1
- 238000011005 laboratory method Methods 0.000 description 1
- 239000008101 lactose Substances 0.000 description 1
- 125000001909 leucine group Chemical group [H]N(*)C(C(*)=O)C([H])([H])C(C([H])([H])[H])C([H])([H])[H] 0.000 description 1
- 108010051673 leucyl-glycyl-phenylalanine Proteins 0.000 description 1
- 210000000265 leukocyte Anatomy 0.000 description 1
- 230000000670 limiting effect Effects 0.000 description 1
- 239000002502 liposome Substances 0.000 description 1
- 244000144972 livestock Species 0.000 description 1
- 230000004807 localization Effects 0.000 description 1
- 150000004668 long chain fatty acids Chemical class 0.000 description 1
- 239000006210 lotion Substances 0.000 description 1
- 239000000314 lubricant Substances 0.000 description 1
- 230000001926 lymphatic effect Effects 0.000 description 1
- 210000000207 lymphocyte subset Anatomy 0.000 description 1
- 230000000527 lymphocytic effect Effects 0.000 description 1
- 210000005210 lymphoid organ Anatomy 0.000 description 1
- 125000003588 lysine group Chemical group [H]N([H])C([H])([H])C([H])([H])C([H])([H])C([H])([H])C([H])(N([H])[H])C(*)=O 0.000 description 1
- HQKMJHAJHXVSDF-UHFFFAOYSA-L magnesium stearate Chemical class [Mg+2].CCCCCCCCCCCCCCCCCC([O-])=O.CCCCCCCCCCCCCCCCCC([O-])=O HQKMJHAJHXVSDF-UHFFFAOYSA-L 0.000 description 1
- 235000019359 magnesium stearate Nutrition 0.000 description 1
- 238000007898 magnetic cell sorting Methods 0.000 description 1
- 238000002595 magnetic resonance imaging Methods 0.000 description 1
- 238000012423 maintenance Methods 0.000 description 1
- 230000014759 maintenance of location Effects 0.000 description 1
- 239000000594 mannitol Substances 0.000 description 1
- 235000010355 mannitol Nutrition 0.000 description 1
- 230000001404 mediated effect Effects 0.000 description 1
- 230000011987 methylation Effects 0.000 description 1
- 238000007069 methylation reaction Methods 0.000 description 1
- 238000010208 microarray analysis Methods 0.000 description 1
- 230000000394 mitotic effect Effects 0.000 description 1
- 238000010369 molecular cloning Methods 0.000 description 1
- 238000003032 molecular docking Methods 0.000 description 1
- 238000012544 monitoring process Methods 0.000 description 1
- CQDGTJPVBWZJAZ-UHFFFAOYSA-N monoethyl carbonate Chemical compound CCOC(O)=O CQDGTJPVBWZJAZ-UHFFFAOYSA-N 0.000 description 1
- 201000006417 multiple sclerosis Diseases 0.000 description 1
- 230000000869 mutational effect Effects 0.000 description 1
- 206010028417 myasthenia gravis Diseases 0.000 description 1
- 210000004897 n-terminal region Anatomy 0.000 description 1
- 210000000822 natural killer cell Anatomy 0.000 description 1
- 238000007857 nested PCR Methods 0.000 description 1
- 230000001537 neural effect Effects 0.000 description 1
- 210000002569 neuron Anatomy 0.000 description 1
- 231100000065 noncytotoxic Toxicity 0.000 description 1
- 230000002020 noncytotoxic effect Effects 0.000 description 1
- 239000002853 nucleic acid probe Substances 0.000 description 1
- 210000004940 nucleus Anatomy 0.000 description 1
- QIQXTHQIDYTFRH-UHFFFAOYSA-N octadecanoic acid Chemical compound CCCCCCCCCCCCCCCCCC(O)=O QIQXTHQIDYTFRH-UHFFFAOYSA-N 0.000 description 1
- OQCDKBAXFALNLD-UHFFFAOYSA-N octadecanoic acid Natural products CCCCCCCC(C)CCCCCCCCC(O)=O OQCDKBAXFALNLD-UHFFFAOYSA-N 0.000 description 1
- 229940049964 oleate Drugs 0.000 description 1
- ZQPPMHVWECSIRJ-KTKRTIGZSA-N oleic acid Chemical compound CCCCCCCC\C=C/CCCCCCCC(O)=O ZQPPMHVWECSIRJ-KTKRTIGZSA-N 0.000 description 1
- 231100000590 oncogenic Toxicity 0.000 description 1
- 230000002246 oncogenic effect Effects 0.000 description 1
- 238000011275 oncology therapy Methods 0.000 description 1
- 210000000056 organ Anatomy 0.000 description 1
- 210000003463 organelle Anatomy 0.000 description 1
- 239000012188 paraffin wax Substances 0.000 description 1
- 230000007170 pathology Effects 0.000 description 1
- 235000010987 pectin Nutrition 0.000 description 1
- 229920001277 pectin Polymers 0.000 description 1
- 239000001814 pectin Substances 0.000 description 1
- 229940111202 pepsin Drugs 0.000 description 1
- 230000007030 peptide scission Effects 0.000 description 1
- 238000010647 peptide synthesis reaction Methods 0.000 description 1
- 210000004976 peripheral blood cell Anatomy 0.000 description 1
- 210000001986 peyer's patch Anatomy 0.000 description 1
- 238000002823 phage display Methods 0.000 description 1
- 239000008250 pharmaceutical cream Substances 0.000 description 1
- 229940124531 pharmaceutical excipient Drugs 0.000 description 1
- COLNVLDHVKWLRT-UHFFFAOYSA-N phenylalanine Natural products OC(=O)C(N)CC1=CC=CC=C1 COLNVLDHVKWLRT-UHFFFAOYSA-N 0.000 description 1
- 125000000405 phenylalanyl group Chemical group 0.000 description 1
- 108010012581 phenylalanylglutamate Proteins 0.000 description 1
- 108010083476 phenylalanyltryptophan Proteins 0.000 description 1
- 230000026731 phosphorylation Effects 0.000 description 1
- 238000006366 phosphorylation reaction Methods 0.000 description 1
- 102000020233 phosphotransferase Human genes 0.000 description 1
- 238000000053 physical method Methods 0.000 description 1
- 210000002826 placenta Anatomy 0.000 description 1
- 210000005059 placental tissue Anatomy 0.000 description 1
- 210000002381 plasma Anatomy 0.000 description 1
- 239000013600 plasmid vector Substances 0.000 description 1
- 238000003752 polymerase chain reaction Methods 0.000 description 1
- 229920005862 polyol Polymers 0.000 description 1
- 150000003077 polyols Chemical class 0.000 description 1
- 229920001296 polysiloxane Polymers 0.000 description 1
- 229920002451 polyvinyl alcohol Polymers 0.000 description 1
- 235000019422 polyvinyl alcohol Nutrition 0.000 description 1
- 239000001267 polyvinylpyrrolidone Substances 0.000 description 1
- 229920000036 polyvinylpyrrolidone Polymers 0.000 description 1
- 235000013855 polyvinylpyrrolidone Nutrition 0.000 description 1
- 230000001323 posttranslational effect Effects 0.000 description 1
- 239000000843 powder Substances 0.000 description 1
- 230000037452 priming Effects 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 230000000644 propagated effect Effects 0.000 description 1
- 238000011321 prophylaxis Methods 0.000 description 1
- 210000002307 prostate Anatomy 0.000 description 1
- 238000000159 protein binding assay Methods 0.000 description 1
- 210000004777 protein coat Anatomy 0.000 description 1
- 238000001742 protein purification Methods 0.000 description 1
- 230000017854 proteolysis Effects 0.000 description 1
- 238000002601 radiography Methods 0.000 description 1
- 230000010837 receptor-mediated endocytosis Effects 0.000 description 1
- 238000010188 recombinant method Methods 0.000 description 1
- 230000002829 reductive effect Effects 0.000 description 1
- 230000012385 regulation of binding Effects 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 230000000284 resting effect Effects 0.000 description 1
- 238000012552 review Methods 0.000 description 1
- 206010039073 rheumatoid arthritis Diseases 0.000 description 1
- 210000003296 saliva Anatomy 0.000 description 1
- 238000013391 scatchard analysis Methods 0.000 description 1
- 108010078070 scavenger receptors Proteins 0.000 description 1
- 102000014452 scavenger receptors Human genes 0.000 description 1
- 238000007789 sealing Methods 0.000 description 1
- 210000005212 secondary lymphoid organ Anatomy 0.000 description 1
- 230000003248 secreting effect Effects 0.000 description 1
- 230000028327 secretion Effects 0.000 description 1
- 238000002864 sequence alignment Methods 0.000 description 1
- 125000003607 serino group Chemical group [H]N([H])[C@]([H])(C(=O)[*])C(O[H])([H])[H] 0.000 description 1
- 108010048397 seryl-lysyl-leucine Proteins 0.000 description 1
- 230000007781 signaling event Effects 0.000 description 1
- 230000037432 silent mutation Effects 0.000 description 1
- 235000019812 sodium carboxymethyl cellulose Nutrition 0.000 description 1
- 229920001027 sodium carboxymethylcellulose Polymers 0.000 description 1
- 239000011780 sodium chloride Substances 0.000 description 1
- 229940023144 sodium glycolate Drugs 0.000 description 1
- 229940080313 sodium starch Drugs 0.000 description 1
- 239000007790 solid phase Substances 0.000 description 1
- 230000009870 specific binding Effects 0.000 description 1
- 210000004989 spleen cell Anatomy 0.000 description 1
- 230000002269 spontaneous effect Effects 0.000 description 1
- 210000003802 sputum Anatomy 0.000 description 1
- 208000024794 sputum Diseases 0.000 description 1
- 102000009076 src-Family Kinases Human genes 0.000 description 1
- 108010087686 src-Family Kinases Proteins 0.000 description 1
- 230000006641 stabilisation Effects 0.000 description 1
- 238000011105 stabilization Methods 0.000 description 1
- 230000010473 stable expression Effects 0.000 description 1
- 238000003153 stable transfection Methods 0.000 description 1
- 238000010561 standard procedure Methods 0.000 description 1
- 239000008107 starch Substances 0.000 description 1
- 229940032147 starch Drugs 0.000 description 1
- 235000019698 starch Nutrition 0.000 description 1
- 239000008117 stearic acid Substances 0.000 description 1
- 230000019635 sulfation Effects 0.000 description 1
- 238000005670 sulfation reaction Methods 0.000 description 1
- 230000004083 survival effect Effects 0.000 description 1
- 239000000375 suspending agent Substances 0.000 description 1
- 239000000725 suspension Substances 0.000 description 1
- 239000003765 sweetening agent Substances 0.000 description 1
- 239000006188 syrup Substances 0.000 description 1
- 235000020357 syrup Nutrition 0.000 description 1
- 201000000596 systemic lupus erythematosus Diseases 0.000 description 1
- 239000000454 talc Substances 0.000 description 1
- 229910052623 talc Inorganic materials 0.000 description 1
- 210000001550 testis Anatomy 0.000 description 1
- MPLHNVLQVRSVEE-UHFFFAOYSA-N texas red Chemical compound [O-]S(=O)(=O)C1=CC(S(Cl)(=O)=O)=CC=C1C(C1=CC=2CCCN3CCCC(C=23)=C1O1)=C2C1=C(CCC1)C3=[N+]1CCCC3=C2 MPLHNVLQVRSVEE-UHFFFAOYSA-N 0.000 description 1
- 238000002560 therapeutic procedure Methods 0.000 description 1
- 238000011285 therapeutic regimen Methods 0.000 description 1
- 150000007970 thio esters Chemical class 0.000 description 1
- 125000003396 thiol group Chemical group [H]S* 0.000 description 1
- 206010043554 thrombocytopenia Diseases 0.000 description 1
- 229940104230 thymidine Drugs 0.000 description 1
- 238000006257 total synthesis reaction Methods 0.000 description 1
- 235000010487 tragacanth Nutrition 0.000 description 1
- 239000000196 tragacanth Substances 0.000 description 1
- 229940116362 tragacanth Drugs 0.000 description 1
- 230000005030 transcription termination Effects 0.000 description 1
- 230000002103 transcriptional effect Effects 0.000 description 1
- 230000002463 transducing effect Effects 0.000 description 1
- 238000010361 transduction Methods 0.000 description 1
- 230000026683 transduction Effects 0.000 description 1
- 238000003151 transfection method Methods 0.000 description 1
- 238000011426 transformation method Methods 0.000 description 1
- 230000009261 transgenic effect Effects 0.000 description 1
- 238000011830 transgenic mouse model Methods 0.000 description 1
- 230000001052 transient effect Effects 0.000 description 1
- 238000003146 transient transfection Methods 0.000 description 1
- 238000002054 transplantation Methods 0.000 description 1
- 230000001228 trophic effect Effects 0.000 description 1
- 230000010415 tropism Effects 0.000 description 1
- 108010044292 tryptophyltyrosine Proteins 0.000 description 1
- 208000035408 type 1 diabetes mellitus 1 Diseases 0.000 description 1
- 108010020532 tyrosyl-proline Proteins 0.000 description 1
- 108010078580 tyrosylleucine Proteins 0.000 description 1
- 238000011144 upstream manufacturing Methods 0.000 description 1
- 210000002700 urine Anatomy 0.000 description 1
- 208000007089 vaccinia Diseases 0.000 description 1
- 125000002987 valine group Chemical group [H]N([H])C([H])(C(*)=O)C([H])(C([H])([H])[H])C([H])([H])[H] 0.000 description 1
- 235000015112 vegetable and seed oil Nutrition 0.000 description 1
- 239000008158 vegetable oil Substances 0.000 description 1
- 239000003981 vehicle Substances 0.000 description 1
- 108700026215 vpr Genes Proteins 0.000 description 1
- 239000008215 water for injection Substances 0.000 description 1
- 239000001993 wax Substances 0.000 description 1
- 239000000080 wetting agent Substances 0.000 description 1
- UHVMMEOXYDMDKI-JKYCWFKZSA-L zinc;1-(5-cyanopyridin-2-yl)-3-[(1s,2s)-2-(6-fluoro-2-hydroxy-3-propanoylphenyl)cyclopropyl]urea;diacetate Chemical compound [Zn+2].CC([O-])=O.CC([O-])=O.CCC(=O)C1=CC=C(F)C([C@H]2[C@H](C2)NC(=O)NC=2N=CC(=CC=2)C#N)=C1O UHVMMEOXYDMDKI-JKYCWFKZSA-L 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K16/00—Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies
- C07K16/18—Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies against material from animals or humans
- C07K16/28—Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies against material from animals or humans against receptors, cell surface antigens or cell surface determinants
- C07K16/2803—Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies against material from animals or humans against receptors, cell surface antigens or cell surface determinants against the immunoglobulin superfamily
- C07K16/283—Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies against material from animals or humans against receptors, cell surface antigens or cell surface determinants against the immunoglobulin superfamily against Fc-receptors, e.g. CD16, CD32, CD64
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P35/00—Antineoplastic agents
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P35/00—Antineoplastic agents
- A61P35/02—Antineoplastic agents specific for leukemia
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P37/00—Drugs for immunological or allergic disorders
- A61P37/02—Immunomodulators
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/435—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
- C07K14/705—Receptors; Cell surface antigens; Cell surface determinants
- C07K14/70503—Immunoglobulin superfamily
- C07K14/70535—Fc-receptors, e.g. CD16, CD32, CD64 (CD2314/705F)
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
- A61K2039/505—Medicinal preparations containing antigens or antibodies comprising antibodies
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K38/00—Medicinal preparations containing peptides
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K48/00—Medicinal preparations containing genetic material which is inserted into cells of the living body to treat genetic diseases; Gene therapy
Abstract
The invention relates to members of the Fc receptor homolog (FcRH) subfamily, as well as fragments and variants thereof. Each FcRH is a Type I transmembrane receptor, preferably, comprises an extracellular region, a transmembrane region, and a cytoplasmic region. The cytoplasmic region preferably comprises one or more immunoreceptor tyrosine-based inhibitory or activation motifs ("ITIMs" or "ITAMs). The invention provides polypeptides, nucleic acids, vectors, expression systems, and antibodies and antibody fragments related to the FcRHs as well as uses thereof. Such uses include uses in the diagnosis and treatment of a malignancy of hematopoietic cell lineage or an inflammatory or autoimmune disease in a subject and in the modulation of a humoral immune response in a subject.
Description
MEMBERS OF THE FC RECEPTOR HOMOLOG GENE FAMILY
(FCRHl-3, 6), RELATED REAGENTS, AND USES THEREOF
This application claims the benefit of U.S. Provisional Application No.
60/367,667, filed March 25, 2002.
ACKNOWLEDGEMENTS
This invention was made with government support under Grants 2837 and AI39816 awarded by NIAID. The government has certain rights in the invention.
FIELD OF THE INVENTION
This invention relates generally to immunology and modulation of immunologic responses in the context of inflammatory diseases and cancer.
BACKGROUND OF THE INVENTION
Receptors for the Fc region (FcRs) of Igs have broad tissue distribution patterns and can modulate cellular and humoral immunity by linking their antibody ligands with effector cells of the immune system (Ravetch, J. V. & Kinet, J.-P. (1991) Annu. Rev.
Immunol. 9, 457-492; Daeron, M. (1997) Annu. Rev. Immunol. 15, 203-234. These cellular, receptors have the ability to sense humoral concentrations of antibody, initiate cellular responses in host defense, and participate in autoimmune disorders (Ravetch, J.
V. & Bolland, S. (2001) Annu. Rev. Immunol. 19, 275-290). Their diverse regulatory roles depend on the Ig isotype specificity and cellular distribution of the individual FcR.
These Ig superfamily members share similarities in their ligand binding subunits, and they may have inhibitory or activating signaling motifs in their intracellular domains or instead pair with signal transducing subunits possessing activating signaling motifs.
Recently, characterization of FcR hornologs in mice, the paired Ig-like receptors (Kubagawa, H. et al. (1997) Proc. Natl. Acad. Sci. USA 94, 5266; Hayami, K. et al. (1997) J. Biol. Chem. 272, 7320-7327), and their relatives in humans the Ig-like transcripts/ leucocyte Ig-like receptors (Borges, L. et al.
(1997) J.
Immunol. 159, 5192-5196; Samaridis, J. & Colonna, M. (1997) Eur. J. Immunol.
27, 660-665) have been elucidated. This multigene family, which includes the FcaR
(Kremer, E. J. et al. (1992) Hum. Genet. 89, 107-108) and the natural killer cell Ig-like receptors (Wagtmann, N. et al. (1997) Curr. Biol. 7, 615-618), is located in a human chromosome 19q13 region lcnown as the leucocyte receptor complex (LRC) (Wende, H.
et al. (1999) Mamm. Genome 10, 154-160; Wilson, M. J. et al. (2000) Proc.
Natl. Acad.
Sci. USA 97, 4778-4783). These Ig-like multigene families belong to a larger class of receptors characterized by their possession of common cytoplasmic tyrosine-based signaling motifs. These can be either immunoreceptor tyrosine-based activation motifs (ITAMs) containing two repeats of the consensus sequence Y-X-X-L/I spaced by 6-amino acids (E/D)-X-X-Y-X-X-(L/I)-X~_8-Y-X-X-(L/IJ (SEQ ID N0:64, with six amino acid between the consensus sequences; SEQ ff~ N0:65, with seven amino acid residues between the consensus sequences; and SEQ ID N0:66, with eight amino acid residues between the consensus sequences) or immunoreceptor tyrosine-based inhibitory motifs (ITIMs) with a 6-amino acid consensus sequence (I/V/L/S)-X-Y-X-X-(L/V) (SEQ ID
N0:67) (Beth, M. (1992) Annu. Rev. Immunol. 10, 97-121; Vely, F. & Vivier, E.
(1997) J. Immunol. 159, 2075-2077; Ravetch, J. V. ~ Lamer, L. L. (2000) Science 290, 84-89; Gergely, J. et al. (1999) Immunol. Lett. 68, 3-15). The phylogenetic conservation of these types of receptors in birds (Dennis, G. et al. (2000) Proc. Natl.
Acad. Sci. USA
97, 13245-13250) and bony fish (Yoder, J. A. et al. (2001) Proc. Natl. Acad.
Sci. USA
98, 6771-6717) is indicative of their biological value. After ligand binding of the activating receptor complexes, TTAM tyrosines are rapidly phosphorylated by Src family kinases to initiate a cascade of signaling events that trigger cellular activation. In the case of ITIM-bearing receptors, the tyrosines provide a docking site for phosphatases containing Src homology 2 domains that can abrogate cellular activation (Long, E. O. (1999) Annu. Rev. Immunol. 17, 875-904; Unkeless, J. C. & Jin, J.
(1997) Curr. Opin. hnmunol. 9, 338-343). The balance in the utilization of these activating and inhibitory receptor pairs can serve to modulate cellular responses to a variety of stimuli.
The genes encoding the classical FcyRs, FcyRI, Fc~yRII, FcyRIII, and FcsRI, lie on the long arm of chromosome 1 (1q21-23) near the polymeric Ig receptor (pIgR) and Fca/~,R genes (1q32) (20-23). Members of this FcR subfamily have relatively low extracellular homology with the FcR-related genes that reside in the LRC on chromosome 19. Like the FcyR- and FcsR-activating receptors, the ligand binding chain of the FcaR coassociates with the ITAM containing FcR common y-chain (Pfefferkorn, L. C. & Yeaman, G. R. (1994) J. Immunol. 153, 3228-3236; Morton, E.
C. et al. (1995) J. Biol. Chem. 270, 29781-29787). New members of the FcR
family were sought which could have diverse signally properties and oncogenic potential.
SUMMARY OF THE INVENTION
In accordance with the purposes) of this invention, as embodied and broadly described herein, this invention, in one aspect, relates to members of a cluster of FcR
and FcR gene relatives encoded, for example, by genes in the human chromosome 1q21-23 region, or analogous region in non-human subjects. The members are Type I
transmembrane receptors, or alternatively spliced forms thereof, with homology to the FcR family and are referred to herein as FcRHs. Each FcRH can comprise an extracellular region, a transmembrane region, and a cytoplasmic region. The cytoplasmic region preferably comprises one or more immunoreceptor tyrosine-based inhibitory or activation motifs ("ITIMs" or "ITAMs).
The invention relates to polypeptides corresponding to isolated FcRHs (e.g., huFcRH 1, 2, 3, and 6 and moFcRHl, 2, and 3), as well as fragments and isoforms thereof. The invention further relates to nucleic acids that encode the FcRHs, as well as hybridization probes related thereto and complementary sequences. The invention further provides vectors and cells related to the nucleic acids of the invention.
The invention further relates to making an FcRH, or a fragment or variant thereof, comprising culturing a cell comprising a vector of the invention under conditions permitting expression of the FcRH. The invention also provides an antibody reagent kit comprising the antibody, or a fragment or variant thereof, and reagents for detecting binding of the antibody, fragment, or antibody variant to a ligand.
The invention further relates to uses of the polypeptides, nucleic acids and antibodies of the invention. For example, the invention relates to methods of diagnosing and methods of treating a malignancy of hematopoietic cell lineage or an inflammatory or autoimmune disease in a subject. The invention also relates to modulation of a humoral immune response in a subject.
Additional advantages of the invention will be set forth in part in the description which follows, and in part will be obvious from the description, or may be learned by practice of the invention. The advantages of the invention will be realized and attained by means of the elements and combinations particularly pointed out in the appended claims. It is to be understood that both the foregoing general description and the following detailed description are exemplary and explanatory only and are not restrictive of the invention, as claimed.
BRIEF DESCRIPTION OF THE DRAWINGS
The accompanying drawings, which are incorporated in and constitute a part of this specification, illustrate (one) several embodiments) of the invention and together with the description, serve to explain the principles of the invention.
Figure 1 shows the relative position of the FcRH locus within the FcR cluster on chromosome 1. The cytogenetic location of the FcR genes is approximated from the GenBank Mapview database. The BAC clones (4, GenBank accession no. AL139409;
(FCRHl-3, 6), RELATED REAGENTS, AND USES THEREOF
This application claims the benefit of U.S. Provisional Application No.
60/367,667, filed March 25, 2002.
ACKNOWLEDGEMENTS
This invention was made with government support under Grants 2837 and AI39816 awarded by NIAID. The government has certain rights in the invention.
FIELD OF THE INVENTION
This invention relates generally to immunology and modulation of immunologic responses in the context of inflammatory diseases and cancer.
BACKGROUND OF THE INVENTION
Receptors for the Fc region (FcRs) of Igs have broad tissue distribution patterns and can modulate cellular and humoral immunity by linking their antibody ligands with effector cells of the immune system (Ravetch, J. V. & Kinet, J.-P. (1991) Annu. Rev.
Immunol. 9, 457-492; Daeron, M. (1997) Annu. Rev. Immunol. 15, 203-234. These cellular, receptors have the ability to sense humoral concentrations of antibody, initiate cellular responses in host defense, and participate in autoimmune disorders (Ravetch, J.
V. & Bolland, S. (2001) Annu. Rev. Immunol. 19, 275-290). Their diverse regulatory roles depend on the Ig isotype specificity and cellular distribution of the individual FcR.
These Ig superfamily members share similarities in their ligand binding subunits, and they may have inhibitory or activating signaling motifs in their intracellular domains or instead pair with signal transducing subunits possessing activating signaling motifs.
Recently, characterization of FcR hornologs in mice, the paired Ig-like receptors (Kubagawa, H. et al. (1997) Proc. Natl. Acad. Sci. USA 94, 5266; Hayami, K. et al. (1997) J. Biol. Chem. 272, 7320-7327), and their relatives in humans the Ig-like transcripts/ leucocyte Ig-like receptors (Borges, L. et al.
(1997) J.
Immunol. 159, 5192-5196; Samaridis, J. & Colonna, M. (1997) Eur. J. Immunol.
27, 660-665) have been elucidated. This multigene family, which includes the FcaR
(Kremer, E. J. et al. (1992) Hum. Genet. 89, 107-108) and the natural killer cell Ig-like receptors (Wagtmann, N. et al. (1997) Curr. Biol. 7, 615-618), is located in a human chromosome 19q13 region lcnown as the leucocyte receptor complex (LRC) (Wende, H.
et al. (1999) Mamm. Genome 10, 154-160; Wilson, M. J. et al. (2000) Proc.
Natl. Acad.
Sci. USA 97, 4778-4783). These Ig-like multigene families belong to a larger class of receptors characterized by their possession of common cytoplasmic tyrosine-based signaling motifs. These can be either immunoreceptor tyrosine-based activation motifs (ITAMs) containing two repeats of the consensus sequence Y-X-X-L/I spaced by 6-amino acids (E/D)-X-X-Y-X-X-(L/I)-X~_8-Y-X-X-(L/IJ (SEQ ID N0:64, with six amino acid between the consensus sequences; SEQ ff~ N0:65, with seven amino acid residues between the consensus sequences; and SEQ ID N0:66, with eight amino acid residues between the consensus sequences) or immunoreceptor tyrosine-based inhibitory motifs (ITIMs) with a 6-amino acid consensus sequence (I/V/L/S)-X-Y-X-X-(L/V) (SEQ ID
N0:67) (Beth, M. (1992) Annu. Rev. Immunol. 10, 97-121; Vely, F. & Vivier, E.
(1997) J. Immunol. 159, 2075-2077; Ravetch, J. V. ~ Lamer, L. L. (2000) Science 290, 84-89; Gergely, J. et al. (1999) Immunol. Lett. 68, 3-15). The phylogenetic conservation of these types of receptors in birds (Dennis, G. et al. (2000) Proc. Natl.
Acad. Sci. USA
97, 13245-13250) and bony fish (Yoder, J. A. et al. (2001) Proc. Natl. Acad.
Sci. USA
98, 6771-6717) is indicative of their biological value. After ligand binding of the activating receptor complexes, TTAM tyrosines are rapidly phosphorylated by Src family kinases to initiate a cascade of signaling events that trigger cellular activation. In the case of ITIM-bearing receptors, the tyrosines provide a docking site for phosphatases containing Src homology 2 domains that can abrogate cellular activation (Long, E. O. (1999) Annu. Rev. Immunol. 17, 875-904; Unkeless, J. C. & Jin, J.
(1997) Curr. Opin. hnmunol. 9, 338-343). The balance in the utilization of these activating and inhibitory receptor pairs can serve to modulate cellular responses to a variety of stimuli.
The genes encoding the classical FcyRs, FcyRI, Fc~yRII, FcyRIII, and FcsRI, lie on the long arm of chromosome 1 (1q21-23) near the polymeric Ig receptor (pIgR) and Fca/~,R genes (1q32) (20-23). Members of this FcR subfamily have relatively low extracellular homology with the FcR-related genes that reside in the LRC on chromosome 19. Like the FcyR- and FcsR-activating receptors, the ligand binding chain of the FcaR coassociates with the ITAM containing FcR common y-chain (Pfefferkorn, L. C. & Yeaman, G. R. (1994) J. Immunol. 153, 3228-3236; Morton, E.
C. et al. (1995) J. Biol. Chem. 270, 29781-29787). New members of the FcR
family were sought which could have diverse signally properties and oncogenic potential.
SUMMARY OF THE INVENTION
In accordance with the purposes) of this invention, as embodied and broadly described herein, this invention, in one aspect, relates to members of a cluster of FcR
and FcR gene relatives encoded, for example, by genes in the human chromosome 1q21-23 region, or analogous region in non-human subjects. The members are Type I
transmembrane receptors, or alternatively spliced forms thereof, with homology to the FcR family and are referred to herein as FcRHs. Each FcRH can comprise an extracellular region, a transmembrane region, and a cytoplasmic region. The cytoplasmic region preferably comprises one or more immunoreceptor tyrosine-based inhibitory or activation motifs ("ITIMs" or "ITAMs).
The invention relates to polypeptides corresponding to isolated FcRHs (e.g., huFcRH 1, 2, 3, and 6 and moFcRHl, 2, and 3), as well as fragments and isoforms thereof. The invention further relates to nucleic acids that encode the FcRHs, as well as hybridization probes related thereto and complementary sequences. The invention further provides vectors and cells related to the nucleic acids of the invention.
The invention further relates to making an FcRH, or a fragment or variant thereof, comprising culturing a cell comprising a vector of the invention under conditions permitting expression of the FcRH. The invention also provides an antibody reagent kit comprising the antibody, or a fragment or variant thereof, and reagents for detecting binding of the antibody, fragment, or antibody variant to a ligand.
The invention further relates to uses of the polypeptides, nucleic acids and antibodies of the invention. For example, the invention relates to methods of diagnosing and methods of treating a malignancy of hematopoietic cell lineage or an inflammatory or autoimmune disease in a subject. The invention also relates to modulation of a humoral immune response in a subject.
Additional advantages of the invention will be set forth in part in the description which follows, and in part will be obvious from the description, or may be learned by practice of the invention. The advantages of the invention will be realized and attained by means of the elements and combinations particularly pointed out in the appended claims. It is to be understood that both the foregoing general description and the following detailed description are exemplary and explanatory only and are not restrictive of the invention, as claimed.
BRIEF DESCRIPTION OF THE DRAWINGS
The accompanying drawings, which are incorporated in and constitute a part of this specification, illustrate (one) several embodiments) of the invention and together with the description, serve to explain the principles of the invention.
Figure 1 shows the relative position of the FcRH locus within the FcR cluster on chromosome 1. The cytogenetic location of the FcR genes is approximated from the GenBank Mapview database. The BAC clones (4, GenBank accession no. AL139409;
3, GenBank accession no. AL356276; 2, GenBank accession no. AL135929; and 1, GenBank accession no. AL353721) that span the locus are oriented in relation to their respective FcRH genes (shaded area).
Figure 2 shows the structural and sequence diversity of FcRHl, FcRH2, and FcRH3. Figure 2A is a schematic representation of FcRH molecules. The three cDNAs encode type I transmembrane proteins with similar extracellular domains, but different cytoplasmic regions. The extracellular (EC) regions contain different numbers of C2-like Ig domains and potential sites of N-linked glycosylation. The transmembrane (TM) domains are uncharged, except in the case of FcRHl. The cytoplasmic (CY) region of FcRHl contains two ITAMs (light gray boxes) and one ITAM-like region (small, lined box), whereas FcRH2 contains one ITAM and two ITIMs (dark gray boxes). FcRH3 has a long cytoplasmic tail with one ITAM, one IT1M, and an ITAM-like region. The amino acid length of each region is indicated. Figure 2B shows the multiple alignment comparison of FcRHl, FcRH2, and FcRH3 amino acid sequences (one-letter code) based on the FcRH3 sequence. Amino acid identity is represented by dots, and gaps are indicated by dashes. Predicted N-linked glycosylation sites and transmembrane domains are underlined in black. Consensus ITAM (bold) and ITIM (bold, underlined) motifs are indicated. Putative structural domains are labeled: SP, signal peptide; EC, extracellular domain; MP-TM, membrane proximal-transmembrane; and CY, cytoplasmic regions.
Amino acid lengths are indicated in parentheses.
Figure 3 shows a composite analysis of the extracellular homology among FcRH and FcR family members. Pairwise analysis of individual Ig-like subunits was performed with the CLUSTAL method algorithm using FcRH3 as the index of comparison. Individual homologous domains are coded to indicate relatedness.
Percent amino acid identities for related domains are indicated and aligned in relation to the comparative FcRH3 subunit. The amino acid identity for the membrane proximal domains (light gray subunits) of FcRHS are provided as the range of identity for all individually related domains. Comparisons that are not applicable are left blank. Amino acid sequences were derived from IRTA1 (GenBank accession no. AF343659), IRTA2 (GenBank accession no. AF34364), moFcRH (GenBank accession no. AAG28775) FcyRI (GenBank accession no. AAA35678), FcyRII (Swiss-Prot accession no.
P31994), Fc~yRIII (Swiss-Prot accession no. P08637), FcsRI (Swiss-Prot accession no.
P12319), and FcaRI (Swiss-Prot accession no. P24071).
Figure 4 shows the relative location of the mouse FcR family. Location is indicated in reference to the human FcR related genes at Ch 1q21-23 and their orthologous loci on mouse Ch 3 and Ch 1. The microsatellite marker d3Mitl87 is located within moFcRHl.
Figure 5 shows the multiple alignment comparisons of huFcRHI-5 and mouse FcRHl and 2 amino acid sequences (one-letter code) based on the FcRH3 sequence.
Amino acid gaps are indicated by dashes. Consensus ITAM (underlined) and ITIM
(italic, underlined) motifs are indicated. Amino acid lengths are indicated in parentheses.
Figure 6 shows domains marked to indicate relatedness of the Ig-like subunits.
Ig-like domain homology was determined by generation of a phylogenetic tree using DNAStar software with the CLUSTAL program and assigning arbitrary colors to individual Ig-domains of a given branch. Amino acid identities for full length, extracellular and, cytoplasmic domain comparisons are based on huFcRH3.
Closest cytoplasmic relatives are indicated in parentheses. Most identical extracellular comparisons between mouse and human relatives are highlighted in horizontal lines.
Comparisons that are not applicable are left blank.
Figure 7 shows the domains of huFcRHI-6, moFcRHl-3 and related proteins.
Domains are colored to indicate relatedness of the Ig-like subunits. Ig-like domain homology was determined by generation of a phylogenetic tree using DNAStar software with the CLUSTAL program and assigning arbitrary colors to individual Ig-domains of a given branch. Amino acid identities for full length, extracellular and, cytoplasmic domain comparisons are based on huFcRH3. Closest cytoplasmic relatives are indicated in parentheses. Most identical extracellular comparisons between mouse and human relatives are highlighted in red. Comparisons that are not applicable are left blank.
Figure 8 shows the structural characteristics of the mouse FcRH isoforms.
DESCRIPTION OF THE PREFERRED EMBODIMENTS
The present invention may be understood more readily by reference to the following detailed description of preferred embodiments of the invention and the Examples included therein and to the Figures and their previous and following description.
In this specification and in the claims that follow, reference will be made to a number of terms that shall be defined to have the following meanings:
As used in the specification and the appended claims, the singular forms "a,"
"an" and "the" include plural referents unless the context clearly dictates otherwise.
Thus, for example, reference to a receptor includes mixtures of various receptors, reference to "a pharmaceutical carrier" includes mixtures of two or more such carriers, and the like.
Ranges may be expressed herein as from "about" one particular value, and/or to "about" another particular value. When such a range is expressed, another embodiment includes from the one particular value and/or to the other particular value.
Similarly, when values are expressed as approximations, by use of the antecedent "about,"
it will be understood that the particular value forms another embodiment. It will be further understood that the endpoints of each of the ranges are significant both in relation to the other endpoint, and independently of the other endpoint.
"Optional" or "optionally" means that the subsequently described event or circumstance may or may not occur, and that the description includes instances where said event or circumstance occurs and instances where it does not. For example, the phrase "optionally contains two ITAM consensus motifs" means that the two ITAMs may or may not be present and that the description includes both the presence and absence of two ITAM consensus motifs.
As used throughout, by "subject" is meant an individual. Preferably, the subject is a mammal such as a primate, and, more preferably, a human. The term "subject"
can include domesticated animals, such as cats, dogs, etc., livestock (e.g., cattle, horses, pigs, sheep, goats, etc.), and laboratory animals (e.g., mouse, rabbit, rat, guinea pig, etc.).
By "isolated nucleic acid" is meant a nucleic acid the structure of which is not identical to that of the naturally occurring nucleic acid or to that of any fragment of the naturally occurring genomic nucleic acid spanning more than three separate genes. The term therefore covers, for example, (a) a DNA which has the sequence of part of the naturally occurring genomic DNA molecules but is not flanked by both of the coding sequences that flank that part of the molecule in the genome of the organism in which it naturally occurs; (b) a nucleic acid incorporated into a vector or into the genomic DNA
of a prokaryote or eukaryote in a manner such that the resulting molecule is not identical to any naturally occurnng vector or genomic DNA; (c) a separate molecule such as cDNA, a genomic fragment, a fragment produced by polymerase chain reaction, or a restriction fragment; and (d) a recombinant nucleotide sequence that is part of a hybrid gene, i.e., a gene encoding a fusion protein.
By "label" is meant any detectable tag that can be attached directly (e.g., a fluorescent molecule integrated into a polypeptide or nucleic acid) or indirectly (e.g., by way of binging to a primary antibody a secondary antibody with an integrated fluorscent molecule) to the molecule of interest. A "label" is any tag that can be visualized with imaging methods. The detectable tag can be a radio-opaque substance, radiolabel, a fluorescent label, or a magnetic label. The detectable tag can be selected from the group consisting of gamma-emitters, beta-emitters, and alpha-emitters, gamma-emitters, positron-emitters, X-ray-emitters and fluorescence-emitters suitable for localization. Suitable fluorescent compounds include fluorescein sodium, fluorescein isothiocyanate, phycoerythrin, and Texas Red sulfonyl chloride. See, de Belder & Wik (Preparation and properties of fluorescein-labelled hyaluronate. Carbohydr.
Res.44(2):251-57 (1975). Those skilled in the art will know, or will be able to ascertain with no more than routine experimentation, other fluorescent compounds that are suitable for labeling the molecule.
Polyneptides The invention provides members of a cluster of FcR and FcR gene relatives encoded by genes in the human chromosome 1q21-23 region, or analogous region in non-human subjects, including for example, chromosome 3 in mouse. A consensus amino acid motif, based on the FcyRI, FcyRII, Fc~yRIII, and pIgR extracellular regions, was used in a GenBank protein database query to identify member of the gene subfamily. Genomic clones were identified that were found to contain FcR
relatives and are termed the Fc receptor homolog (FcRH) subfamily: specifically, FcRHI, FcRH2, FcRH3, and FcRH6. Also, found were mouse Fc receptor homologs designated moFcRl, 2, and 3.
By "homologous" is meant about 25% percent homology or greater. Homology is also characterized by proximity in the location of the genes and by similarities as identified in a composite analysis. As used herein, "percent homology" of two amino acid sequences or of two nucleic acid sequences is determined using the algorithm of Karlin and Altschul (Proc. Natl. Acad. Sci. USA 87:2264-2268 (1990)). Such an algorithm is incorporated into the NBLAST and XBLAST programs of Altschul et al.
(J. Mol. Biol. 215:403-410 (1990)). BLAST nucleotide searches are performed with the NBLAST program, score 100, wordlength = 12, to obtain nucleotide sequences homologous to a nucleic acid molecule of the invention. BLAST protein searches are performed with the XBLAST program, score = 50, wordlength = 3, to obtain amino acid sequences homologous to a reference polypeptide . To obtain gapped alignments for comparison purposes, Gapped Blast is utilized as described in Altschul et al. (Nucl.
Acids Res. 25: 3389-3402 (1997)). When utilizing BLAST and Gapped BLAST
programs, the default parameters of the respective programs (e.g., XBLAST and NBLAST) are used. See http://ww.ncbi.nlm.nih.gov.
By "FcRH" is meant a Type I transmembrane receptor, or an alternatively spliced form thereof, including, for example, a secreted form or a GPI-anchored form, with homology to the classical Fc receptor family. In a preferred embodiment, the FcRH shows homology with the extracellular regions of Fc~yRI, FcyRII, Fc~yRIII, or pIgR. More specifically, the FcRH shows homology with an amino acid sequence corresponding with the amino terminal sequences of the second Ig domains of the Fc~yRs and the third Ig domain of pIgR or Fc~yRHl. The FcRH can comprise an extracellular region, a transmembrane region, and a cytoplasmic region. The extracellular region preferably comprises one or more Ig domains, and more preferably less than 9, and even more preferably less than 7 or less than 8 Ig domains.
Preferably, the cytoplasmic region comprises more than 107 (including more than 108, 109, 110, 111, 112, 113, 114, 115, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, or 145 amino acids).
Alternatively, the cytoplasmic region comprises less than 104 amino acids (including less than 103, 102, 101, 100, 99, 98, 97, 96, 95, 94, 93, 92, 91, 90, 89, 88, 87, 86, 85, 84, 83, 82, 81, 80). The cytoplasmic region preferably comprises one or more immunoreceptor tyrosine-based inhibitory or activation motifs ("ITIMs" or "ITAMs).
The invention provides isolated FcRHs (e.g., huFcRH 1, 2, 3, and 6, and moFcRHl-3, as described in detail below), as well as fragments and isoforms thereof.
The isolated amino acid sequences provided herein optionally are combined with a human signal sequence (e.g., MLPRLLLLICAPLCEP (SEQ ID N0:29), MLLWSLLVIFDAVTEQADS (SEQ ID NO:30), MLLWLLLLILTPGREQS (SEQ ID
N0:31), MLLWTAVLLFVPCVG (SEQ ID N0:32)) or a mouse signal sequence (e.g., MPLCLLLLVFAPVGVQS (SEQ ID N0:69), MLPWLLLLICALPCEPA (SEQ ID
N0:72), MSGSFSPCVVFTQMWLTLLVVTPVN (SEQ ID NO:79)).
In one embodiment, the invention provides huFcRHl and its fragments and isoforms. Thus, in one embodiment of the isolated FcRH, the extracellular region comprises less than four Ig domains. Preferably, the cytoplasmic region comprises less than 104 amino acids and, even more preferably, comprises less than 104 and more than 86 amino acids. In one embodiment, the transmembrane region comprises an acidic amino acid (e.g., glutamate or aspartate). The isolated FcRH of the invention comprises a cytoplasmic region having the amino acid sequence of SEQ ID NO:1, in the presence or absence of conservative amino acid substitutions. Further provided is the isolated FcRH, wherein the extracellular region comprises the amino acid sequence of SEQ ID
N0:21, in the presence or absence of conservative amino acid substitutions, and in the presence and absence of a signal sequence. More specifically, the isolated FcRH
comprises the amino acid sequence of SEQ ~ N0:2, in the presence or absence of conservative amino acid substitutions, and in the presence or absence of a signal sequence. In one embodiment the signal sequence is MLPRLLLLICAPLCEP (SEQ ID
N0:29). In a preferred embodiment, the FcRH of the invention is expressed by myeloid cells (e.g., granulocytes and monocytes). Additional characteristics of the full length FcRHl include a predicted molecular weight of about 46-47 kDaltons; about 425-(e.g., 429) amino acids in length with about 35 strongly basic(+) amino acids (I~,R), about 45 strongly acidic(-) amino acids (D,E), about 144 hydrophobic amino acids (A,I,L,F,W,V), and about127 polar amino acids (N,C,Q,S,T,I~; a predicted isolelectric point of about 5-5.5 (e.g., 5.310); and charge of about -9 at PH 7Ø
In another embodiment, the invention provides an isolated FcRH corresponding to huFcRH2, its fragments, or isoforms. Thus, the invention provides a FcRH
wherein the cytoplasmic region comprises less than 99 amino acids (e.g., 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98) and wherein the receptor further 5 comprises an extracellular domain with up to four Ig domains and up to five N-linked glycosylation sites. More specifically, the isolated FcRH has a cytoplasmic region that comprises the amino acid sequence of SEQ ID N0:3, in the presence or absence of conservative amino acid substitutions, or an extracellular region comprising SEQ ID
N0:22, in the presence or absence of conservative amino acid substitutions, and in the 10 presence or absence of a signal sequence. Even more specifically, the isolated FcRH
comprises the amino acid sequence of SEQ ID N0:4, in the presence or absence of conservative amino acid substitutions, and in the presence or absence of a signal sequence. In one embodiment, the signal sequence WSLLVIFDAVTEQADS (SEQ ID
N0:30). Additional characteristics of the full length FcRHl include a predicted molecular weight of about 50-60 kDaltons; about 495-515 (e.g., 508) amino acids in length with about 44 strongly basic(+) amino acids (K,R), about 49 strongly acidic(-) amino acids (D,E), about 175 hydrophobic amino acids (A,I,L,F,W,V), and about polar amino acids (N,C,Q,S,T,Y); a predicted isolelectric point of about 6-6.5 (e.g., 6.188); and charge of about -4 at PH 7Ø
In another embodiment, the invention provides huFcRH3, its fragments, and isoforms. More specifically, the invention provides an isolated FcRH having a cytoplasmic region that comprises more than 107 amino acids (e.g., 108, 109, 110, 11 l, 112, 113, 114, 115, 116, 117, 118, 119, 120, 212, 122, 123, 124, 125, 126 ,127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 247, 148, 149, 150 amino acids). Optionally, the isolated FcRH has a cytoplasmic region comprising one ITAM and one ITIM. More specifically, the cytoplasmic region comprises the amino acid sequence of SEQ ID NO:S or SEQ ID N0:23, in the presence or absence of conservative amino acid substitutions. In one embodiment, the extracellular domain of the FcRH comprises the amino acid sequence of SEQ ID
NO:24, in the presence or absence of conservative amino acid substitutions, and in the presence or absence of a signal sequence. Also provided is an isolated FcRH
comprising the amino acid sequence of SEQ ID NO:6 or SEQ ID N0:25, in the presence or absence of one or more amino acid substitutions, and in the presence or absence of a signal sequence. In one embodiment the signal sequence comprises MLLWLLLLILTPGREQS (SEQ ID N0:31). Additional characteristics of the full length FcRHl include a predicted molecular weight of about 80-90 kDaltons;
about 725-740 (e.g., 734) amino acids in length with about 68 strongly basic(+) amino acids (K,R), about 75 strongly acidic(-) amino acids (D,E), about 232 hydrophobic amino acids (A,I,L,F,W,V), and about 224 polar amino acids (N,C,Q,S,T,Y); a predicted isolelectric point of about 6.5-7.0 (e.g., 6.852); and charge of about -2 at PH 7Ø
The invention further provides an isolated huFcRH6, its fragments, and isoforms. More specifically, the FcRH comprises a cytoplasmic region having the amino acid sequence of SEQ ID N0:26, in the presence or absence or one or more conservative amino acid substitutions. The extracellular domain comprises the amino acid sequence of SEQ ff~ N0:27, in the presence or absence of conservative amino acid substitutions, and in the presence or absence of a signal sequence. Also, provided by the invention is a FcRH having the amino acid substitutions of SEQ ID N0:28, in the presence or absence of conservative amino acid substitutions, and in the presence or absence of a signal sequence. In one embodiment the signal sequence is MLLWTAVLLFVPCVG (SEQ ff~ N0:32).
The invention further provides a polypeptide comprising the amino acid sequence of SEQ ID N0:1, 21, 2, 3, 22, 4, 5, 23, 24, 6, 25, 26, 27, or 28, in the presence or absence of conservative amino acid substitutions. The invention also provides a polypeptide having at least 80, 85, 90, or 95% homology with SEQ ID NOs: l, 21, 2, 3, 22, 4, 5, 23, 24, 6, 25, 26, 27, or 28.
The invention further provides an isolated moFcRHl isoform, its fragments, and isofornzs. The moFcRHI is an isoform of SEQ 11? N0:68. More specifically, the FcRH comprises four Ig domains, optionally having the sequence of SEQ ff~ N0:
70, in the presence or absence or one or more conservative amino acid substitutions, and in the presence or absence of a signal sequence (e.g., the sequence of SEQ ff~
N0:71).
The invention further provides an isolated moFcRH2, its fragments, and isoforms. The provided isoforms include one isoform with a transmembrane region and one isoform lacking the transmembrane region. More specifically, the FcRH
comprises a cytoplasmic region having the amino acid sequence of SEQ ID N0:76, in the presence or absence or one or more conservative amino acid substitutions. The extracellular domain comprises the amino acid sequence of SEQ ID N0:74, in the presence or absence of conservative amino acid substitutions, and in the presence or absence of a signal sequence. Also, provided by the invention is a FcRH having the amino acid sequence of SEQ ID N0:73, which comprises a transmembrane region, or SEQ ID
N0:77, which lacks the transmembrane region. In each case, the FcRH sequence can include the presence or absence of conservative amino acid substitutions, and the presence or absence of a signal sequence. In one embodiment the signal sequence is the sequence of SEQ ID N0:72.
The invention also provided a moFcRH3, its fragments and isoforms. The cytoplasmic region can comprise the amino acid sequence of SEQ ID N0:81, in the presence or absence of conservative amino acid substitutions. Optionally, the extracellular domain comprises the amino acid sequence of SEQ ff~ N0:80, in the presence or absence of conservative amino acid substitutions or in the presence or absence of a signal sequence (e.g., the sequence of SEQ ID N0:79). The full length sequence optionally has the amin oacid sequence of SEQ m N0:78, in the presence or absence of conservative amino acid substitutions or in the presence or absence of a signal sequence (e.g., the sequence of SEQ ID N0:79).
Fragments, variants, or isoforms of the FcRHs of the invention are provided.
It is understood that these terms include functional variants. Fragments can include the cytoplasmic region, the extracellular region, the transmembrane region or any portion of at least 10 amino acids or any combination of the regions or portions. The variants are produced by making amino acid substitutions, deletions, and insertions, as well as post-translational modifications. Variations in post-translational modifications can include variations in the type or amount of carbohydrate moieties of the protein core or any fragment or derivative thereof. Variations in amino acid sequence may arise naturally as allelic variations (e.g., due to genetic polymorphism) or may be produced by human intervention (e.g., by mutagenesis of cloned DNA sequences), such as induced point, deletion, insertion and substitution mutants. These modifications can result in changes in the amino acid sequence, provide silent mutations, modify a restriction site, or provide other specific mutations.
Amino acid sequence modifications fall into one or more of three classes:
substitutional, insertional or deletional variants. Insertions include amino and/or carboxyl tennii~al fusions as well as intrasequence insertions of single or multiple amino acid residues. Insertions ordinarily will be smaller insertions than those of amino or carboxyl terminal fusions, for example, on the order of one to four residues.
Deletions are characterized by the removal of one or more amino acid residues from the protein sequence. Typically, no more than about 2 to 6 residues are deleted at any one site within the protein molecule. These variants ordinarily are prepared by site-specific mutagenesis of nucleotides in the DNA encoding the protein, thereby producing DNA
encoding the variant, and thereafter expressing the DNA in recombinant cell culture.
Techniques for making substitution mutations at predetermined sites in DNA
having a lcnown sequence are well known and include, for example, M13 primer mutagenesis and PCR mutagenesis. Amino acid substitutions are typically of single residues but may include multiple substitutions at different positions; insertions usually will be on the order of about from 1 to 10 amino acid residues but can be more; and deletions will range about from 1 to 30 residues, but can be more. Deletions or insertions preferably are made in adjacent pairs, i.e. a deletion of 2 residues or insertion of 2 residues.
Substitutions, deletions, insertions or any combination thereof may be combined to arrive at a final construct. The mutations must not place the sequence out of reading frame and preferably will not create complementary regions that could produce secondary mRNA structure. Substitutional variants are those in which at least one residue has been removed and a different residue inserted in its place. Such substitutions generally are made in accordance with Table 1 and are referred to as conservative substitutions.
TABLE l: Amino Acid Substitutions Original ResidueExemplary Substitutions Ala Ser Arg Lys Asn Gln Asp Glu Cys S er Gln Asn Glu Asp Gly Pro His Gln Tle Leu; Val Leu Ile; Val Lys Arg; Gln Met Leu; Ile Phe Met; Leu; Tyr Ser T~' T~. Ser Trp Tyr Tyt. Trp; Phe Val Ile; Leu Substantial changes in function or immunological identity are made by selecting substitutions that are less conservative than those in Table 1, i.e., selecting residues that differ more significantly in their effect on maintaining (a) the structure of the polypeptide backbone in the area of the substitution, for example as a sheet or helical conformation, (b) the charge or hydrophobicity of the molecule at the target site or (c) the bulk of the side chain. The substitutions that in general are expected to produce the greatest changes in the protein properties will be those in which (a) a hydrophilic residue, e.g. seryl or threonyl, is substituted for (or by) a hydrophobic residue, e.g.
leucyl, isoleucyl, phenylalanyl, valyl or alanyl; (b) a cysteine or proline is substituted for (or by) any other residue; (c) a residue having an electropositive side chain, e.g., lysyl, arginyl, or histidyl, is substituted for (or by) an electronegative residue, e.g., glutamyl or aspartyl; or (d) a residue having a bulky side chain, e.g., phenylalanine, is substituted for (or by) one not having a side chain, e.g., glycine, in this case, (e) by increasing the number of sites for sulfation and/or glycosylation.
Substitutional or deletional mutagenesis can be employed to insert sites for N-glycosylation (Asn-X-Thr/Ser) or O-glycosylation (Ser or Thr). Deletions of cysteine or other labile residues also may be desirable. Deletions or substitutions of potential proteolysis sites, e.g. Arg, is accomplished for example by deleting one of the basic residues or substituting one by glutaminyl or histidyl residues.
Certain post-translational derivatizations are the result of the action of recombinant host cells on the expressed polypeptide. Glutaminyl and asparaginyl residues are frequently post-translationally deamidated to the corresponding glutamyl and asparyl residues. Alternatively, these residues are deamidated under mildly acidic conditions. Other post-translational modifications include hydroxylation of proline and lysine, phosphorylation of hydroxyl groups of Beryl or threonyl residues, methylation of the o-amino groups of lysine, arginine, and histidine side chains (T.E.
Creighton, Proteins: Structure and Molecular Properties, W. H. Freeman & Co., San Francisco pp 79-86 [1983]), acetylation of the N-terminal amine and, in some instances, amidation of the C-terminal carboxyl. Modifications in the FcRH can also include modifications in glyco sylation.
In all mutational events, it is understood that the controlling aspect of the mutation is the function that the subsequent protein possesses. The preferred mutations are those that do not detectably change the desired function or that increase the desired function.
Nucleic Acids Also provided is an isolated nucleic acid that encodes the FcRH of the invention. The nucleic acid can be single or double stranded and can be RNA or DNA.
More specifically, the invention provides an isolated nucleic acid, comprising a 10 nucleotide sequence that encodes SEQ ID NO:1, SEQ ID N0:21, SEQ ID N0:2.
SEQ
ID NO: 3, SEQ ID N0:22, SEQ ID N0:4, SEQ ID NO:S, SEQ ID N0:23, SEQ ID
N0:24, SEQ ID N0:25, SEQ ID N0:26, SEQ ID N0:27, SEQ ID N0:28, or SEQ ID
N0:6, SEQ 1T7 N0:70, SEQ ID N0:73, SEQ ID N0:74, SEQ ID N0:76, SEQ ID
N0:77, SEQ ID N0:78, SEQ ID N0:80, SEQ ID N0:81, optionally with conservative 15 amino acid substitutions. Optionally the nucleic acid further encodes a signal sequence (e.g., the signal sequences of SEQ >D N0:29, 30, 31, 32, 71, 75, 79). The isolated nucleic acid optionally encodes the sequences with 80, 85, 90, or 95 %
identity. More specifically, the invention provides an isolated nucleic acid, comprising a nucleotide sequence of SEQ ID N0:7, SEQ ID N0:13, SEQ ID N0:8, SEQ ID N0:34, SEQ ID
N0:9, SEQ ~ N0:14, SEQ ID NO:10, SEQ ID N0:36, SEQ ID NO:11, SEQ ID
NO:15, SEQ ID N0:16, SEQ >D N0:12, SEQ ID N0:38, SEQ ID N0:17, SEQ ID
N0:18, SEQ ID N0:19, SEQ ID N0:20; SEQ m N0:40, SEQ ID N0:84, SEQ ID
N0:85, SEQ 11? N0:87, SEQ m N0:88, SEQ ID N0:89~, SEQ ff~ N0:90, SEQ ID
N0:91, SEQ ID N0:92, SEQ ID N0:93, SEQ ID N0:94, SEQ ID N0:95, SEQ ~
N0:96, SEQ ID N0:97, SEQ ID N0:98, SEQ ID N0:99, SEQ ID NO:100, SEQ ID
NO:101, or SEQ ID N0:102 . Optionally, the isolated nucleic acid can further included bases that encode a signal sequence and thus the nucleotide sequence encoding the extracellular region or full-length huFcRHl, 2, 3, or 6 can optionally further comprise the nucleotide sequence of SEQ ID N0:33, SEQ ID N0:35, SEQ 1177 N0:37, SEQ m N0:39. Optionally, the isolated nucleic acids for rnoFcRHs include nucleic acid sequences that encode signal sequences as well, including for example, those portions of nucleic acid sequences SEQ ID NO:101, SEQ D7 N0:97, SEQ m N0:94, SEQ ID
N0:91, SEQ ID N0:88, SEQ ID N0:84.
Preferably the nucleic acid that encodes the full length FcRHl includes about 1290 bases. The nucleic acid that encodes the full length FcRH2 includes about bases, and the nucleic acid that encodes the full length FcRH3 includes about bases.
The invention also provides an isolated nucleic acid comprising a sequence that hybridizes under stringent conditions to a hybridization probe, wherein the hybridization probe comprises the nucleotide sequence of SEQ ff~ N0:7, SEQ m N0:13, SEQ m N0:8, SEQ m N0:34, SEQ m N0:9, SEQ m N0:14, SEQ ff~ NO:10, SEQ m N0:36, SEQ m NO:11, SEQ m NO:15, SEQ m NO:16, SEQ m N0:12, SEQ
m N0:38, SEQ m N0:17, SEQ m N0:18, SEQ m N0:19, and SEQ m N0:20; SEQ
ff~ N0:40, SEQ ID N0:84, SEQ m N0:85, SEQ m N0:87, SEQ m N0:88, SEQ m N0:89, SEQ m NO:90, SEQ m N0:91, SEQ ff~ NO:92, SEQ m N0:93, SEQ m N0:94, SEQ B7 N0:95, SEQ m N0:96, SEQ m N0:97, SEQ m N0:98, SEQ m NO:99, SEQ m NO:100, SEQ m NO:101, or SEQ m N0:102, or the complement of either sequence.
Further provided is a single stranded nucleic acid that hybridizes under stringent conditions to a nucleic acid having the sequence of SEQ m NO:7, SEQ m N0:13, SEQ m N0:8, SEQ m NO:34, SEQ ~ N0:9, SEQ m N0:14, SEQ m NO:10, SEQ
m NO:36, SEQ ff) NO:11, SEQ m NO:15, SEQ m N0:16, SEQ ff~ N0:12, SEQ m N0:38, SEQ ~ N0:17, SEQ m NO:18, SEQ ff~ N0:19, and SEQ m N0:20; SEQ m N0:40, SEQ m N0:84, SEQ m N0:85, SEQ m NO:87, SEQ m N0:88, SEQ m N0:89, SEQ m N0:90, SEQ m N0:91, SEQ ~ N0:92, SEQ m N0:93, SEQ m N0:94, SEQ m N0:95, SEQ m N0:96, SEQ m N0:97, SEQ m N0:98, SEQ m N0:99, SEQ m NO:100, SEQ m NO:101, or SEQ m N0:102.
By "hybridizing under stringent conditions" or "hybridizing under highly stringent conditions" is meant that the hybridizing portion of the hybridizing nucleic acid, typically comprising at least 15 (e.g., 20, 25, 30, or 50 nucleotides), hybridizes to all or a portion of the provided nucleotide sequence under stringent conditions. The term "hybridization" typically means a sequence driven interaction between at least two nucleic acid molecules, such as a primer or a probe and a gene. Sequence driven interaction means an interaction that occurs between two nucleotides or nucleotide analogs or nucleotide derivatives in a nucleotide specific manner. For example, G
interacting with C or A interacting with T are sequence driven interactions.
Typically sequence driven interactions occur on the Watson-Crick face or Hoogsteen face of the nucleotide. The hybridization of two nucleic acids is affected by a number of conditions and parameters known to those of skill in the art. For example, the salt concentrations, pH, and temperature of the reaction all affect whether two nucleic acid molecules will hybridize. Generally, the hybridizing portion of the hybridizing nucleic acid is at least 80%, for example, at least 90%, 95%, or 98%, identical to the sequence of or a portion of a nucleic acid encoding an FcRH of the invention, or its complement.
Hybridizing nucleic acids of the invention can be used, for example, as a cloning probe, a primer (e.g., for PCR), a diagnostic probe, or an antisense probe.
Hybridization of the oligonucleotide probe to a nucleic acid sample typically is performed under stringent conditions. Nucleic acid duplex or hybrid stability is expressed as the melting temperature or Tm, which is the temperature at which a probe dissociates from a target DNA. This melting temperature is used to define the required stringency conditions. If sequences are to be identified that are related and substantially identical to the probe, rather than identical, then it is useful to first establish the lowest temperature at which only homologous hybridization occurs with a particular concentration of salt (e.g., SSC or SSPE). Assuming that a 1% mismatch results in a 1°C decrease in the Tm, the temperature of the final wash in the hybridization reaction is reduced accordingly (for example, if sequence having >95% identity with the probe are sought, the final wash temperature is decreased by 5 °C). In practice, the change in Tm can be between 0.5 °C and 1.5 °C per 1% mismatch. Stringent conditions involve hybridizing at 68 °C in Sx SSC/Sx Denhardt's solution/1.0% SDS, and washing in 0.2x SSC/0.1% SDS at room temperature. Moderately stringent conditions include washing in 3x SSC at 42 °C. The parameters of salt concentration and temperature can be varied to achieve the optimal level of identity between the probe and the target nucleic acid.
Additional guidance regarding such conditions is readily available in the art, for example, in Sambrook et al., 1989, Molecular Cloning, A Laboratory Manual, Cold Spring Harbor Press, NY; and Ausubel et al. (eds.), 1995, Current Protocols in Molecular Biology, (John Wiley & Sons, NY) at Unit 2.10.
The nucleic acids of the present invention are optionally labeled, directly or indirectly. Such labeled nucleic acids are useful in various diagnostic techniques including for example, izz situ hybridization, FISH, izz situ PCR, and PRINS.
Both methods involve the preparation of short sequences of single-stranded nucleic acid probes that are complementary to the nucleic acid sequences that encode an FcRH.
See, e.g., M Andreeff and D Pinkel (1999), An Introduction to Flourescent In-Situ Hybridization: Principles and Clinical Applications, John Wiley ~ Sons, Ltd;
Roche Applied Sciences (2000), Nonradioactive In Situ Hybridization Application Manual;
Roche Applied Sciences (1999), PCR Manual, 2d edition, which are incorporated in their entirety for methods of using nucleic acids.
Vectors cells and methods of using Also provided is an expression vector comprising a nucleic acid of the invention, wherein the nucleic acid is operably linked to an expression control sequence. A wide variety of expression system/regulatory sequence combinations may be employed in expressing the disclosed. Such useful regulatory sequences include, for example, the early or late promoters of SV40, CMV, vaccinia, polyoma or adenovirus, the lac system, the trp system, the TAC system, the TRC system, the LTR
system, the major operator and promoter regions of phage lambda, the control regions of fd coat protein, the promoter for 3-phosphoglycerate kinase or other glycolytic enzymes, the promoters of acid phosphatase (for example, PhoS), the AOX 1 promoter of methylotrophic yeast, the promoters of the yeast a-mating factors, and other sequences known to control the expression of genes of prokaryotic or eukaryotic cells or their viruses, and various combinations thereof.
Such an expression vector can be designed to be expressed by eukaryotic cells or prokaryotic cells. The vectors of the present invention thus provide DNA
molecules which are capable of integration into a prokaryotic or eukaryotic chromosome and expression. The inserted genes in viral and retroviral vectors usually contain promoters, and/or enhancers to help control the expression of the desired gene product. A
promoter is generally a sequence or sequences of DNA that function when in a relatively fixed location in regard to the transcription start site. A
promoter contains core elements required for basic interaction of RNA polymerase and transcription factors, and may contain upstream elements and response elements. It has been shown that all specific regulatory elements can be cloned and used to construct expression vectors that are selectively expressed in specific cell types. For example, the glial fibrillary acetic protein (GFAP) promoter has been used to selectively express genes in cells of glial origin. Expression vectors used in eukaryotic host cells (e.g., yeast, fungi, insect, plant, animal, human or nucleated cells) may also contain sequences necessary for the ternzination of transcription which may affect mRNA expression. These regions are transcribed as polyadenylated segments in the untranslated portion of the mRNA
encoding tissue factor protein. The 3' untranslated regions also include transcription termination sites. It is preferred that the transcription unit also contain a polyadenylation region. One benefit of this region is that it increases the likelihood that the transcribed unit will be processed and transported like mRNA. The identification and use of polyadenylation signals in expression constl-ucts is well established. It is preferred that homologous polyadenylation signals be used in the transgene constructs.
In certain transcription units, the polyadenylation region is derived from the SV40 early polyadenylation signal and consists of about 400 bases. It is also preferred that the transcribed units contain other standard sequences alone or in combination with the above sequences improve expression from, or stability of, the construct.
The invention further provides transfer vectors, which include any nucleotide construction used to deliver genes into cells (e.g., a plasmid), or as part of a general strategy to deliver genes, e.g., as part of recombinant retrovirus or adenovirus (Ram et al. Cancer Res. 53:83-88, (1993)). As used herein, plasmid or viral vectors are agents that transport the disclosed nucleic acids into the cell without degradation and include a promoter yielding expression of the gene in the cells into which it is delivered. In some embodiments the FcRHs are derived from either a virus or a retrovirus. Viral vectors include, for example, Adenovirus, Adeno-associated virus, Herpes virus, Vaccinia virus, Polio virus, AIDS virus, neuronal trophic virus, Sindbis and other RNA
viruses, including these viruses with the HIV backbone. Also preferred are any viral families that share the properties of these viruses that make them suitable for use as vectors.
Retroviruses include Murine Maloney Leukemia virus, MMLV, and retroviruses that express the desirable properties of MMLV as a vector. Retroviral vectors are able to carry a larger genetic payload, i.e., a transgene or marker gene, than other viral vectors, and for this reason are a commonly used vector. However, they are not as useful in non-proliferating cells. Adenovirus vectors are relatively stable and easy to work with, have high titers, and can be delivered in aerosol formulation, and can transfect non-dividing cells. Pox viral vectors are large and have several sites for inserting genes, they are thermostable and can be stored at room temperature. A preferred embodiment is a viral vector which has been engineered so as to suppress the immune response of the host organism, elicited by the viral antigens.
Viral vectors can have higher transaction (ability to introduce genes) abilities than chemical or physical methods to introduce genes into cells. Typically, viral 5 vectors contain, nonstructural early genes, structural late genes, an RNA
polymerase III
transcript, inverted terminal repeats necessary for replication and encapsidation, and promoters to control the transcription and replication of the viral genome.
When engineered as vectors, viruses typically have one or more of the early genes removed and a gene or gene/promotor cassette is inserted into the viral genome in place of the 10 removed viral DNA. Constructs of this type can carry up to about 8 kb of foreign genetic material. The necessary functions of the removed early genes are typically supplied by cell lines that have been engineered to express the gene products of the early genes in trans.
A retrovirus is an animal virus belonging to the virus family of Retroviridae, 15 including any types, subfamilies, genus, or tropisms. Retroviral vectors, in general, are described by Verma, LM., Retroviral vectors for gene transfer. In Microbiology-1985, American Society for Microbiology, pp. 229-232, Washington, (1985), which is incorporated by reference herein. Examples of methods for using retroviral vectors for gene therapy are described in U.S. Patent Nos. 4,868,116 and 4,980,286; PCT
20 applications WO 90/02806 and WO 89/07136; and Mulligan, (Science 260:926-(1993)); the teachings of which are incorporated herein by reference. A
retrovirus is essentially a package which has packed into it nucleic acid cargo. The nucleic acid cargo carnes with it a packaging signal, which ensures that the replicated daughter molecules will be efficiently packaged within the package coat. In addition to the package signal, there are a number of molecules that are needed in cis, for the replication, and packaging of the replicated virus. Typically a retroviral genome, contains the gag, pol, and env genes which are involved in the making of the protein coat. It is the gag, pol, and env genes which are typically replaced by the foreign DNA
that it is to be transferred to the target cell. Retrovirus vectors typically contain a packaging signal for incorporation into the package coat, a sequence which signals the start of the gag transcription unit, elements necessary for reverse transcription, including a primer binding site to bind the tRNA primer of reverse transcription, terminal repeat sequences that guide the switch of RNA strands during DNA
synthesis, a purine rich sequence 5' to the 3' LTR that serve as the priming site for the synthesis of the second strand of DNA synthesis, and specific sequences near the ends of the LTRs that enable the insertion of the DNA state of the retrovirus to insert into the host genome. The removal of the gag, pol, and env genes allows for about 8 kb of foreign sequence to be inserted into the viral genome, become reverse transcribed, and upon replication be packaged into a new retroviral particle. This amount of nucleic acid is sufficient for the delivery of a one to many genes depending on the size of each transcript. It is preferable to include either positive or negative selectable markers along with other genes in the insert.
Since the replication machinery and packaging proteins in most retroviral vectors have been removed (gag, pol, and envy, the vectors are typically generated by placing them into a packaging cell line. A packaging cell line is a cell line that has been transfected or transformed with a retrovirus that contains the replication and packaging machinery, but lacks any packaging signal. When the vector carrying the DNA of choice is transfected into these cell lines, the vector containing the gene of interest is replicated and packaged into new retroviral particles, by the machinery provided in cis by the helper cell. The genomes for the machinery are not packaged because they lack the necessary signals.
The construction of replication-defective adenoviruses has been described (Berkner et al., J. Virology 61:1213-1220 (1987); Massie et al., Mol. Cell.
Biol. 6:2872-2883 (1986); Haj-Ahmad et al., J. Virology 57:267-274 (1986); Davidson et al., J.
Virology 61:1226-1239 (1987); Zhang, Generation and identification of recombinant adenovirus by liposome-mediated transfection and PCR analysis, BioTechniques 15:868-872 (1993)). The benefit of the use of these viruses as vectors is that they are limited in the extent fo which they can spread to other cell types, since they can replicate within an initial infected cell, but are unable to form new infectious viral particles. Recombinant adenoviruses have been shown to achieve high efficiency gene transfer after direct, in vivo delivery to airway epithelium, hepatocytes, vascular endothelium, CNS parenchyma and a number of other tissue sites (Morsy, J.
Clin.
Invest. 92:1580-1586 (1993); Kirshenbaum, J. Clin. Invest. 92:381-387 (1993);
Roessler, J. Clin. Invest. 92:1085-1092 (1993); Moullier, Nature Genetics 4:154-159 (1993); La Salle, Science 259:988-990 (1993); Gomez-Foix, J. Biol. Chem.
267:25129-25134 (1992); Rich, Human Gene Therapy 4:461-476 (1993); Zabner, Nature Genetics 6:75-83 (1994); Guzman, Circulation Research 73:1201-1207 (1993);
Bout, Human Gene Therapy 5:3-10 (1994); Zabner, Cell 75:207-216 (1993);
Caillaud, Eur. J. Neuroscience 5:1287-1291 (1993); and Ragot, J. Gen. Virology 74:501-(1993)). Recombinant adenoviruses achieve gene transduction by binding to specific cell surface receptors, after which the virus is internalized by receptor-mediated endocytosis, in the same manner as wild type or replication-defective adenovirus (Chardonnet and Dales, Virology 40:462-477 (1970); Brown and Burlingham, J.
Virology 12:386-396 (1973); Svensson and Persson, J. Virology 55:442-449 (1985);
Seth, et al., J. Virol. 51:650-655 (1984); Seth, et al., Mol. Cell. Biol.
4:1528-1533 (1984); Varga et al., J. Virology 65:6061-6070 (1991); Wickham et al., Cell 73:309-319 (1993)).
A viral vector can be one based on an adenovirus which has had the E1 gene removed and these virons are generated in a cell line such as the human 293 cell line.
In another preferred embodiment both the El and E3 genes are removed from the adenovirus genome.
Another type of viral vector is based on an adeno-associated virus (AAV). This defective parvovirus is a preferred vector because it can infect many cell types and is nonpathogenic to humans. AAV type vectors can transport about 4 to 5 kb and wild type AAV is known to stably insert into chromosome 19. Vectors which contain this site specific integration property are preferred. An especially preferred embodiment of this type of vector is the P4.1 C vector produced by Avigen, San Francisco, CA, which can contain the herpes simplex virus thymidine kinase gene, HSV-tk, and/or a marker gene, such as the gene encoding the green fluorescent protein, GFP.
In another type of AAV virus, the AAV contains a pair of inverted terminal repeats (ITRs) which flank at least one cassette containing a promoter which directs cell-specific expression operably linked to a heterologous gene. Heterologous in this context refers to any nucleotide sequence or gene which is not native to the AAV or B19 parvovirus.
Typically the AAV and B19 coding regions have been deleted, resulting in a safe, noncytotoxic vector. The AAV ITRs, or modifications thereof, confer infectivity and site-specific integration, but not cytotoxicity, and the promoter directs cell-specific expression. United States Patent No. 6,261,834 is herein incorproated by reference for material related to the AAV vector.
Molecular genetic experiments with large human herpesviruses have provided a means whereby large heterologous DNA fragments can be cloned, propagated and established in cells permissive for infection with herpesviruses (Sun et al., Nature genetics 8: 33-41, 1994; Cotter and Robertson, Curr Opin Mol Ther 5: 633-644, 1999).
These large DNA viruses (herpes simplex virus (HSV) and Epstein-Barr virus (EBV), have the potential to deliver fragments of human heterologous DNA > 150 kb to specific cells. EBV recombinants can maintain large pieces of DNA in the infected B-cells as episomal DNA. Individual clones carried human genomic inserts up to 330 kb appeared genetically stable The maintenance of these episomes requires a specific EBV
nuclear protein, EBNAl, constitutively expressed during infection with EBV.
Additionally, these vectors can be used for transfection, where large amounts of protein can be generated transiently in vitro. Herpesvirus amplicon systems are also being used to package pieces of DNA > 220 kb and to infect cells that can stably maintain DNA as episomes. Other useful systems include, for example, replicating and host-restricted non-replicating vaccinia virus vectors.
The invention also provides an isolated cell comprising a vector of the invention. The isolated cell can be either a eukaryotic or prokaryotic cell, such as strains of E. coli, Pseudornonas, Bacillus , S'tr~eptomyces; fungi such as yeasts (Saechaf°orrayces , and methylotrophic yeast such as Piclaia, Gandida, Hansenula, and Torulopsis); and animal cells, such as CHO, Rl.l, B-W and LM cells, African Green Monkey kidney cells (for example, COS 1, COS 7, BSC1, BSC40, and BMT10), insect cells (for example, Sue), and human cells and plant cells in tissue culture.
Also provided is a method of making a FcRH, or a fragment or variant thereof comprising culturing a cell comprising a vector of the invention under conditions permitting expression of the FcRH. The method comprises culturing a cell comprising an exogeneous nucleic acid that encodes the FcRH, fragment, or variant, wherein the exogeneous nucleic acid is operably linked to an expression control sequence, and wherein the culture conditions permit expression of the FcRH, fragment, or variant under the control of the expression control sequence; harvesting the medium from the cultured cells, and isolating the FcRH, fragment, or varinat from the cell or culture medium. Optionally the exogenous nucleic acid is the nucleotide sequence of SEQ m N0:7, SEQ ID N0:13, SEQ ID N0:8, SEQ ID N0:34, SEQ ID N0:9, SEQ ID N0:14, SEQ ID NO:10, SEQ ID N0:36, SEQ m NO:11, SEQ ID NO:15, SEQ m N0:16, SEQ
ID N0:12, SEQ ID N0:38, SEQ )D N0:17, SEQ m N0:18, SEQ ID N0:19, and SEQ
)D N0:20; SEQ ID N0:40, SEQ )D N0:84, SEQ ID N0:85, SEQ )D N0:87, SEQ 1D
N0:88, SEQ )D N0:89, SEQ )D N0:90, SEQ m N0:91, SEQ 117 N0:92, SEQ m N0:93, SEQ )D N0:94, SEQ )D N0:95, SEQ m N0:96, SEQ >D N0:97, SEQ )D
N0:98, SEQ ff~ N0:99, SEQ m NO:100, SEQ )D NO:101, or SEQ )D N0:102 or a combination thereof. Optionally, the exogenous nucleic acid further comprises a nucleotide sequence that encodes a signal sequence. In the recombinant methods, the cell can be any known host cell, including for example, a prokaryotic or eukaiyotic cell.
The nucleic acids that are delivered to cells, generally in a plasmid or other vector, typically contain expression controlling systems. For example, the inserted genes in viral and retroviral systems usually contain promoters, and/or enhancers to help control the expression of the desired gene product.
Those skilled in the art of molecular biology will understand that a wide variety of expression systems may be used to produce recombinant FcRH polypeptides (as well as fragments, fusion proteins, and amino acid sequence variants with therapeutic activity) for use in the methods of the invention. Thus, FcRH may be produced using prokaryotic host cells (e.g., Escherichia coli) or eukaryotic host cells (e.g., Saccharomyces cerevisiae, insect cells such as Sf~ cells, or mammalian cells such as CHO cells, COS-1, NIH 3T3, or HeLa cells). These cells are commercially available from, for example, the American Type Culture Collection, Rockville, MD (see also F.
Ausubel et al., Current Protocols in Molecular Biology, John Wiley & Sons, New York, NY, 1998). The method of transformation and the choice of expression vector will depend on the host system selected. Transformation and transfection methods are described, e.g., in Ausubel et al., supra, and expression vectors may be chosen from the numerous examples known in the art.
A nucleic acid sequence encoding an FcRH is introduced into a plasmid or other vector, which is then used to transform living cells. Constructs in which a cDNA
containing the entire FcRH coding sequence, a fragment of the FcRH coding sequence, amino acid variations of the FcRH coding sequence, or fusion proteins of the aforementioned, inserted in the correct orientation into an expression plasmid, may be used for protein expression. In some cases, for example, it may be desirable to express the FcRH coding sequence under the control of an inducible or tissue-specific promoter.
Eukaryotic expression systems permit appropriate post-translational modifications to expressed proteins. Thus, eukaryotic, and more preferably mammalian expression systems, allow glycosylations patterns comparable to naturally expressed FcRH. Transient transfection of a eukaryotic expression plasmid allows the transient 5 production of FcRH by a transfected host cell. FcRH may also be produced by a stably-transfected mammalian cell line. A number of vectors suitable for stable transfection of mammalian cells are available to the public (e.g., see Pouwels et al., Cloning Vectors:
A Laboratory Manual, 1985, Supp. 1987), as are methods for constructing such cell lines (see e.g., F. Ausubel et al., Current Protocols in Molecular Biology, John Wiley &
10 Sons, New York, NY, 1998). Another preferred eukaryotic expression system is the baculovirus system using, for example, the vector pBacPAK9, which is available from Clontech (Palo Alto, CA). If desired, this system may be used in conjunction with other protein expression techniques, for example, the myc tag approach described by Evan et al. (Mol. Cell Biol. 5:3610-3616, 1985) or analogous tagging approaches, e.g., 15 using a hemagluttinin (HA) tag.
Once the recombinant protein is expressed, it can be isolated from the expressing cells by cell lysis followed by protein purification techniques such as affinity chromatography. In this example, an antibody that specifically binds to FcRH, which may be produced by methods that are well-known in the art, can be attached to a 20 column and used to isolate FcRH. Once isolated, the recombinant protein can, if desired, be purified further, e.g., by high performance liquid chromatography (HPLC;
e.g., see Fisher, Laboratory Techniques In Biochemistry And Molecular Biology, Work and Burdon, Eds., Elsevier, 1980).
25 Antibodies The invention also provides a purified antibody or immunologic fragment thereof, wherein the antibody or fragment thereof selectively binds to an FcRH. As used herein, the term "antibody" encompasses, but is not limited to, whole immunoglobulin (i.e., an intact antibody) of any class. Native antibodies are usually heterotetrarneric glycoproteins, composed of two identical light (L) chains and two identical heavy (H) chains. Typically, each light chain is linked to a heavy chain by one covalent disulfide bond, while the number of disulfide linkages varies between the heavy chains of different immunoglobulin isotypes. Each heavy and light chain also has regularly spaced intrachain disulfide bridges. Each heavy chain has at one end a variable domain (V(I~) followed by a number of constant domains. Each light chain has a variable domain at one end (V(L)) and a constant domain at its other end; the constant domain of the light chain is aligned with the first constant domain of the heavy chain, and the light chain variable domain is aligned with the variable domain of the heavy chain. Particular amino acid residues are believed to form an interface between the light and heavy chain variable domains. The light chains of antibodies from any vertebrate species can be assigned to one of two clearly distinct types, called kappa (k) and lambda (1), based on the amino acid sequences of their constant domains.
Depending on the amino acid sequence of the constant domain of their heavy chains, immunoglobulins can be assigned to different classes. There are five major classes of immunoglobulins: IgA, IgD, IgE, IgG and IgM, and several of these may be further divided into subclasses (isotypes), e.g., IgG-1, IgG-2, IgG-3, and IgG-4; IgA-1 and IgA-2. The heavy chain constant domains that correspond to the different classes of immunoglobulins are called alpha, delta, epsilon, gamma, and mu, respectively.
The term "variable" is used herein to describe certain portions of the variable domains that differ in sequence among antibodies and are used in the binding and specificity of each particular antibody for its particular antigen. However, the variability is not usually evenly distributed through the variable domains of antibodies.
It is typically concentrated in three segments called complementarity determining regions (CDRs) or hypervariable regions both in the light chain and the heavy chain variable domains. The more highly conserved portions of the variable domains are called the framework (FR). The variable domains of native heavy and light chains each comprise four FR regions, largely adopting a b-sheet configuration, connected by three CDRs, which form loops connecting, and in some cases forming part of, the b-sheet structure. The CDRs in each chain are held together in close proximity by the FR
regions and, with the CDRs from the other chain, contribute to the formation of the antigen binding site of antibodies (see Kabat E. A. et al., "Sequences of Proteins of Ixnmunological Interest" National Institutes of Health, Bethesda, Md. (1987)).
The constant domains are not involved directly in binding an antibody to an antigen, but exhibit various effector functions, such as participation of the antibody in antibody-dependent cellular toxicity.
The term "antibody or fragments thereof' can also encompass chimeric antibodies and hybrid antibodies, with dual or multiple antigen or epitope specificities, and fragments, such as F(ab')2, Fab', Fab and the like, including hybrid fragments.
Thus, fragments of the antibodies that retain the ability to bind their specific antigens are provided. For example, fragments of antibodies which maintain FcRH binding activity are included within the meaning of the term "antibody or fragment thereof."
Such antibodies and fragments can be made by techniques known in the art and can be screened for specificity and activity according to the methods set forth in the Examples and in general 'methods for producing antibodies and screening antibodies for specificity and activity (See Harlow and Lane. Antibodies, A Laboratory Manual. Cold Spring Harbor Publications, New York, (1988)).
Also included within the meaning of "antibody or fragments thereof' are conjugates of antibody fragments and antigen binding proteins (single chain antibodies) as described, for example, in U.S. Pat. No. 4,704,692, the contents of which are hereby incorporated by reference.
In one embodiment, the antibody is a monoclonal antibody. The term "monoclonal antibody" as used herein refers to an antibody obtained from a substantially homogeneous population of antibodies, i.e., the individual antibodies comprising the population are identical except for possible naturally occurnng mutations that may be present in minor amounts. The monoclonal antibodies herein specifically include "chimeric" antibodies in which a portion of the heavy and/or light chain is identical with or homologous to corresponding sequences in antibodies derived from a particular species or belonging to a particular antibody class or subclass, while the remainder of the chains) is identical with or homologous to corresponding sequences in antibodies derived from another species or belonging to another antibody class or subclass, as well as fragments of such antibodies, so long as they exhibit the desired activity (See, U.S. Pat. No. 4,816,567 and Morrison et al., Proc.
Natl. Acad. Sci.
USA, 81:6851-6855 (1984)).
Monoclonal antibodies of the invention may be prepared using hybridoma methods, such as those described by Kohler and Milstein, Nature, 256:495 (1975) or Harlow and Lane, Antibodies, A Laboratory Manual. Cold Spring Harbor Publications, New York, (1988). In a hybridoma method, a mouse or other appropriate host animal, is typically immunized with an immunizing agent to elicit lymphocytes that produce or are capable of producing antibodies that will specifically bind to the immunizing agent.
Alternatively, the lymphocytes may be immunized in vitro. Preferably, the immunizing agent comprises an FcRH. Traditionally, the generation of monoclonal antibodies has depended on the availability of purified protein or peptides for use as the immunogen.
More recently DNA based immunizations have shown promise as a way to elicit strong immune responses and generate monoclonal antibodies. In this approach, DNA-based immunization can be used, wherein DNA encoding a portion of FcRH, preferably the N- or C- terminal region, is injected into the host animal according to methods known in the art.
Generally, either peripheral blood lymphocytes ("PBLs") are used in methods of producing monoclonal antibodies if cells of human origin are desired, or spleen cells or lymph node cells are used if non-human mammalian sources are desired. The lymphocytes are then fused with an immortalized cell line using a suitable fusing agent, such as polyethylene glycol, to form a hybridoma cell (Goding, "Monoclonal Antibodies: Principles and Practice" Academic Press, (1986) pp. 59-103).
Immortalized cell lines are usually transformed mammalian cells, including myeloma cells of rodent, bovine, equine, and human origin. Usually, rat or mouse myeloma cell lines are employed. The hybridoma cells may be cultured in a suitable culture medium that preferably contains one or more substances that inhibit the growth or survival of the unfused, immortalized cells. For example, if the parental cells lack the enzyme hypoxanthine guanine phosphoribosyl transferase (HGPRT or HPRT), the culture medium for the hybridomas typically will include hypoxanthine, aminopterin, and thymidine ("HAT medium"), which substances prevent the growth of HGPRT-deficient cells.
Preferred immortalized cell lines are those that fuse efficiently, support stable high level expression of antibody by the selected antibody-producing cells, and are sensitive to a medium such as HAT medium. More preferred immortalized cell lines are murine myeloma lines, which can be obtained, for instance, from the Salk Institute Cell Distribution Center, San Diego, Calif. and the American Type Culture Collection, Rockville, Md. Human myeloma and mouse-human heteromyeloma cell lines also have been described for the production of human monoclonal antibodies (I~ozbor, J.
lmmunol., 133:3001 (1984); Brodeur et al., "Monoclonal Antibody Production Techniques and Applications" Marcel Dekker, Inc., New York, (1987) pp. 51-63).
The culture medium in which the hybridoma cells are cultured can then be assayed for the presence of monoclonal antibodies directed against an FcRH.
Preferably, the binding specificity of monoclonal antibodies produced by the hybridoma cells is determined by innnunoprecipitation or by an ira vitro binding assay, such as radioimmunoassay (RIA) or enzyme-linked immunoabsorbent assay (ELISA). Such techniques and assays are known in the art, and are described further in Harlow and Lane "Antibodies, A Laboratory Manual" Cold Spring Harbor Publications, New York, (1988).
After the desired hybridoma cells are identified, the clones may be subcloned by limiting dilution or FACS sorting procedures and grown by standard methods.
Suitable culture media for this purpose include, for example, Dulbecco's Modified Eagle's Medium and RPMI-1640 medium. Alternatively, the hybridoma cells may be grown i~
vivo as ascites in a mammal.
The monoclonal antibodies secreted by the subclones may be isolated or purified from the culture medium or ascites fluid by conventional immunoglobulin purification procedures such as, for example, protein A-Sepharose, hydroxylapatite chromatography, gel electrophoresis, dialysis, or affinity chromatography.
The monoclonal antibodies may also be made by recombinant DNA methods, such as those described in U.S. Pat. No. 4,816,567. DNA encoding the monoclonal antibodies of the invention can be readily isolated and sequenced using conventional procedures (e.g., by using oligonucleotide probes that are capable of binding specifically to genes encoding the heavy and light chains of murine antibodies). The hybridoma cells of the invention serve as a preferred source of such DNA. Once isolated, the DNA may be placed into expression vectors, which are then transfected into host cells such as simian COS cells, Chinese hamster ovary (CHO) cells, plasmacytoma cells, or myeloma cells that do not otherwise produce immunoglobulin protein, to obtain the synthesis of monoclonal antibodies in the recombinant host cells.
The DNA also may be modified, for example, by substituting the coding sequence for human heavy and light chain constant domains in place of the homologous murine sequences (U.S. Pat. No. 4,816,567) or by covalently joining to the immunoglobulin coding sequence all or part of the coding sequence for a non-immunoglobulin polypeptide. Such a non-immunoglobulin polypeptide can be substituted for the constant domains of an antibody of the invention, or can be substituted for the variable domains of one antigen-combining site of an antibody of the invention to create a chimeric bivalent antibody comprising one antigen-combining site having specificity 5 for FcRH and another antigen-combining site having specificity for a different antigen.
Ira vitro methods are also suitable for preparing monovalent antibodies.
Digestion of antibodies to produce fragments thereof, particularly, Fab fragments, can be accomplished using routine techniques known in the art. For instance, digestion can be performed using papain. Examples of papain digestion are described in WO
10 94/29348 published Dec. 22, 1994, U.S. Pat. No. 4,342,566, and Harlow and Lane, Antibodies, A Laboratory Manual, Cold Spring Harbor Publications, New York, (1988). Papain digestion of antibodies typically produces two identical antigen binding fragments, called Fab fragments, each with a single antigen binding site, and a residual Fc fragment. Pepsin treatment yields a fragment, called the F(ab')2 fragment, that has 15 two antigen combining sites and is still capable of cross-linking antigen.
The Fab fragments produced in the antibody digestion also contain the constant domains of the light chain and the first constant domain of the heavy chain.
Fab' fragments differ from Fab fragments by the addition of a few residues at the carboxy terminus of the heavy chain domain including one or more cysteines from the antibody 20 hinge region. The F(ab')2 fragment is a bivalent fragment comprising two Fab' fragments linked by a disulfide bridge at the hinge region. Fab'-SH is the designation herein for Fab' in which the cysteine residues) of the constant domains bear a free thiol group. Antibody fragments originally were produced as pairs of Fab' fragments which have hinge cysteines between them. Other chemical couplings of antibody fragments 25 are also known.
An isolated immunogenically specific epitope or fragment of the antibody is also provided. A specific immunogenic epitope of the antibody can be isolated from the whole antibody by chemical or mechanical disruption of the molecule. The purified fragments thus obtained can be tested to determine their immunogenicity and specificity 30 by the methods taught herein. Irnmunoreactive epitopes of the antibody can also be synthesized directly. An immunoreactive fragment is defined as an amino acid sequence of at least about two to five consecutive amino acids derived from the antibody amino acid sequence.
One method of producing proteins comprising the antibodies of the present invention is to link two or more peptides or polypeptides together by protein chemistry techniques. For example, peptides or polypeptides can be chemically synthesized using currently available laboratory equipment using either Fmoc (9-fluorenylmethyl-oxycarbonyl) or Boc (tef°t -butyloxycarbonoyl) chemistry. (Applied Biosysterns, Inc., Foster City, CA). One skilled in the art can readily appreciate that a peptide or polypeptide corresponding to the antibody of the present invention, for example, can be synthesized by standard chemical reactions. For example, a peptide or polypeptide can be synthesized and not cleaved from its synthesis resin whereas the other fragment of an antibody can be synthesized and subsequently cleaved from the resin, thereby exposing a ternzinal group that is functionally blocked on the other fragment. By peptide condensation reactions, these two fragments can be covalently joined via a peptide bond at their carboxyl and amino termini, respectively, to form an antibody, or fragment thereof. (Grant GA (1992) Synthetic Peptides: A User Guide. W.H. Freeman and Co., N.Y. (1992); Bodansky M and Trost B., Ed. (1993) Principles of Peptide Synthesis.
Springer-Verlag Inc., NY). Alternatively, the peptide or polypeptide can by independently synthesized iri vivo as described above. Once isolated, these independent peptides or polypeptides may be linked to form an antibody or fragment thereof via similar peptide condensation reactions.
For example, enzymatic ligation of cloned or synthetic peptide segments can allow relatively short peptide fragments to be joined to produce larger peptide fragments, polypeptides or whole protein domains (Abrahmsen L et al., Biochemistry, 30:4151 (1991)). Alternatively, native chemical ligation of synthetic peptides can be utilized to synthetically construct large peptides or polypeptides from shorter peptide fragments. This method consists of a two step chemical reaction (Dawson et al.
Synthesis of Proteins by Native Chemical Ligation. Science, 266:776-779 (1994)). The first step is the chemoselective reaction of an unprotected synthetic peptide-oc-thioester with another unprotected peptide segment containing an amino-terminal Cys residue to give a thioester-linked intermediate as the initial covalent product. Without a change in the reaction conditions, this intermediate undergoes spontaneous, rapid intramolecular reaction to form a native peptide bond at the ligation site. Application of this native chemical ligation method to the total synthesis of a protein molecule is illustrated by the preparation ofhuman interleukin 8 (IL-8) (Baggiolini M et al. (1992) FEBS
Lett.
307:97-101; Clark-Lewis I et al., J.Biol.Chem., 269:16075 (1994); Clark-Lewis I et al., Biochemistry, 30:3128 (1991); Rajarathnam K et al., Biochemistry 33:6623-30 (1994)).
Alternatively, unprotected peptide segments can be chemically linked where the bond formed between the peptide segments as a result of the chemical ligation is an unnatural (non-peptide) bond (Schnolzer, M et al. Science, 256:221 (1992)).
This technique has been used to synthesize analogs of protein domains as well as large amounts of relatively pure proteins with full biological activity (deLisle Milton RC et al., Techniques in Protein Chemistry IV. Academic Press, New York, pp. 257-267 (1992)).
The invention also provides fragments of antibodies that have bioactivity. The polypeptide fragments of the present invention can be recombinant proteins obtained by cloning nucleic acids encoding the polypeptide in an expression system capable of producing the polypeptide fragments thereof, such as an adenovirus or baculovirus expression system. For example, one can determine the active domain of an antibody from a specific hybridoma that can cause a biological effect associated with the interaction of the antibody with FcRH. For example, amino acids found to not contribute to either the activity or the binding specificity or affinity of the antibody can be deleted without a loss in the respective activity.
For example, amino or carboxy-terminal amino acids can be sequentially removed from either the native or the modified non-immunoglobulin molecule or the immunoglobulin molecule and the respective activity assayed in one of many available assays. In another example, a fragment of an antibody can comprise a modified antibody wherein at least one amino acid has been substituted for the naturally occurring amino acid at a specific position, and a portion of either amino terminal or carboxy terminal amino acids, or even an internal region of the antibody, has been replaced with a polypeptide fragment or other moiety, such as biotin, which can facilitate in the purification of the modified antibody. For example, a modified antibody can be fused to a maltose binding protein, through either peptide chemistry of cloning the respective nucleic acids encoding the two polypeptide fragments into an expression vector such that the expression of the coding region results in a hybrid polypeptide. The hybrid polypeptide can be affinity purified by passing it over an amylose affinity column, and the modified antibody receptor can then be separated from the maltose binding region by cleaving the hybrid polypeptide with the specific protease factor Xa. (See, for example, New England Biolabs Product Catalog, 1996, pg.
164.).
Similar purification procedures are available for isolating hybrid proteins from eukaryotic cells as well.
The fragments, whether attached to other sequences, can also include insertions, deletions, substitutions, or other selected modifications of particular regions or specific amino acids residues, provided the activity of the fragment is not significantly altered or impaired compared to the nonmodified antibody or antibody fragment. These modifications can provide for some additional property, such as to remove or add amino acids capable of disulfide bonding, to increase its bio-longevity, to alter its secretory characteristics, etc. In any case, the fragment must possess a bioactive property, such as binding activity, regulation of binding at the binding domain, etc.
Functional or active regions of the antibody may be identified by mutagenesis of a specific region of the protein, followed by expression and testing of the expressed polypeptide. Such methods are readily apparent to a skilled practitioner in the art and can include site-specific mutagenesis of the nucleic acid encoding the antigen. (Zoller MJ et al. Nucl. Acids Res. 10:6487-500 (1982).
As used herein, the phrase "specific binding" or "selective binding" refers to a binding reaction which is determinative of the presence of the FcRH in a heterogeneous population of proteins and other biologics. Thus, under designated conditions, the antibodies or fragments thereof of the present invention bind to a particular FcRH (e.g., human FcRH 1 or any variant thereof), fragment, or variant thereof and do not bind in a significant amount to other proteins (e.g., human FcRH 2, 3, 4, 5, or 6), present in the subject. 'The absence of binding in the present invention is concisderd to be binding that is less than 1.5 times background (i.e., the level of non-specific binding or slightly above non-specific binding levels), Selective binding to an antibody under such conditions may require an antibody that is selected for its specificity for a particular protein, variant, or fragment. In one embodiment the purified antibody selectively binds to the FcRH comprising a cytoplasmic region with more than 107 or less than 104 amino acids, a transmembrane region, and an extracellular region. More specifically, the antibody in alternative embodiments selectively binds FcRHl but not FcRH2-6; selectively binds FcRH2 but not 1 or 3-6; selectively binds FcRH3 but not FcRHl-2 or 4-6; selectively binds FcRH6 but not 1-5. Thus, as one embodiment, the antibody selectively binds a polypeptide comprising the amino acid sequence of SEQ ID NO:1, 21, or 2, or a subset thereof, but not to polypeptides comprising the amino acid of SEQ ID N0:3, 22, 4, 5, 23, 24, 6, 25, 26, 27, 28, or a subset thereof. In another embodiment the purified antibody binds to the FcRH comprising the amino acid sequence of SEQ ID N0:3, 22, or 4, but not to the FcRH comprising the amino acid of SEQ ID NO:1, 21, 2, 5, 23, 24, 6, 25, 26, 27, or 28.
In yet another embodiment, the purified antibody that binds to the FcRH
comprising the amino acid sequence of SEQ ID NO:S, 23, 24, or 6, but not to the FcRH
comprising the amino acid of SEQ m NO:1, 21, 2, 3, 22, 4, 26, 27, 28. Similarly, the antibodies of the present invention may bind only moFcRHl, but not moFcRH 2 or moFcRH3; may bind only FcRH2 and not FcRHl or FcRH3, and may bind only FcRH3 and not FcRHl or FcRH2.
In certain embodiments, the antibody binds the extracellular region of one or more FcRHs and in other embodiments the antibody binds the cytoplasmic region of one or more FcRHs. In other embodiments the antibody may selectively bind one isoform of a FcRH. For example, the antibody may bind a polypeptide having the amino acid sequence of SEQ 117 N0:23 but not the SEQ ID N0:24 or vice versa.
Furthermore, the antibody can bind to moFcRHl having the amino acid sequence of SEQ LD N0:70, but not to a moFcRHl having amino acid sequence of SEQ ID N0:68.
The antibody may selectively bind a moFcRH2 with a transmembrane region (e.g., having amino acid sequence of SEQ ID N0:73), but not bind to a moFcRH2 lacking a transmembrane region (e.g., having the amino acid sequence of 77). Optionally the antibody of the invention can selectively bind moFcRH but not human, or vice versa.
A variety of immunoassay formats may be used to select antibodies that selectively bind with a particular protein, variant, or fragment. For example, solid-phase ELISA immunoassays are routinely used to select antibodies selectively immunoreactive with a protein, variant, or fragment thereof. See Harlow and Lane.
Antibodies, A Laboratory Manual. Cold Spring Harbor Publications, New York, (1988), for a description of immunoassay formats and conditions that could be used to determine selective binding. The binding affinity of a monoclonal antibody can, for example, be determined by the Scatchard analysis of Munson et al., Anal.
Biochem., 107:220 (1980).
The invention also provides an antibody reagent kit comprising the antibody or fragment thereof of the invention and reagents for detecting binding of the antibody or fragment thereof to a ligand. The kit can further comprise containers containing the antibody or fragment thereof of the invention and containers containing the reagents.
Preferably the ligand is a FcRH, variant, or fragment thereof. Particularly, the kit can detect the presence of one or more FcRHs specifically reactive with the antibody or an 5 immunoreactive fragment thereof. The kit can include an antibody bound to a substrate, a secondary antibody reactive with the antigen and a reagent for detecting a reaction of the secondary antibody with the antigen. Such a kit can be an ELISA kit and can comprise the substrate, primary and secondary antibodies when appropriate, and any other necessary reagents such as detectable moieties, enzyme substrates and color 10 reagents as described above. The diagnostic kit can, alternatively, be an immunoblot kit generally comprising the components and reagents described herein.
Alternatively, the kit could be a radioimmunoassay kit, a Western blot assay kit, an immunohistological assay kit, an immunocytochemical assay kit, a dot blot assay kit, a fluorescence polarization assay kit, a scintillation proximity assay kit, a homogeneous 15 time resolved fluorescence assay kit, or a BIAcore analysis kit.
As used throughout, methods of detecting an FcRH or antigen/antibody complexes, including complexes comprising an FcRH and optionally the antibody of the present invention, can comprise an ELISA (competition or sandwich), a radioimmunoassay, a Western blot assay, an immunohistological assay, an 20 immunocytochemical assay, a dot blot assay, a fluorescence polarization assay (Jolley (1981); Jiskoot et al (1991); Seethala et al. (1998); Bicamumpaka et al.
(1998)), a scintillation proximity assay (Amersham Life Science (1995) Proximity News.
Issue 17;
Amersham Life Science (1995) Proximity News. Issue 18; Park et al. (1999)), a homogeneous time-resolved fluorescence assay (Park et al. (1999); Stenroos et al.
25 (1988); Morrison, 1988)), or a BIAcore analysis Fagerstam et al. (1992) Chromatography 597:397-410. Preferably, the antigei~/antibody complex is detestably tagged either directly or indirectly. Any desired tag can be utilized, such as a fluorescent tag, a radiolabel, a magnetic tag, or an enzymatic reaction product.
Optionally, the antibody or fragment is a humanized antibody or a fully human 30 antibody. For example, the antibodies can also be generated in other species and "humanized" for administration to humans. Alternatively, fully human antibodies can also be made by immunizing a mice or other species capable of making a fully human antibody (e.g., mice genetically modified to produce human antibodies), screening clones that bind FcRH. See, e.g., Lonberg and Huszar (1995) Human antibodies from transgenic mice, Int. Rev. Imrnunol. 13:65-93, which is incorporated herein by reference in its entirety for methods of producing fully human antibodies. As used herein, the term "humanized" and "fully human" in relation to antibodies, relate to any antibody which is expected to elicit a therapeutically tolerable weak immunogenic response in a human subject.
Humanized forms of non-human (e.g., murine) antibodies are chimeric immunoglobulins, immunoglobulin chains or fragments thereof (such as Fv, Fab, Fab' , F(ab')Z, or other antigen-binding subsequences of antibodies) which contain minimal sequence derived from non-human immunoglobulin. Humanized antibodies include human immunoglobulins (recipient antibody) in which residues from a complementary determining region (CDR) of the recipient are replaced by residues from a CDR
of a non-human species (donor antibody) such as mouse, rat or rabbit having the desired specificity, affinity and capacity. In some instances, Fv framework residues of the human immunoglobulin are replaced by corresponding non-human residues.
Humanized antibodies may also comprise residues that are found neither in the recipient antibody nor in the imported CDR or framework sequences. In general, the humanized antibody will comprise substantially all or at least one, and typically two, variable domains, in which all or substantially all of the CDR regions correspond to those of a non-human immunoglobulin and all or substantially all of the FR
regions are those of a human immunoglobulin consensus sequence. The humanized antibody optimally also will comprise at least a portion of an immunoglobulin constant region (Fc), typically that of a human immunoglobulin (Jones et al., Nature, 321:522-(1986); Riechmann et al., Nature, 332:323-327 (1988); and Presta, Curr. Op.
Struct.
Biol., 2:593-596 (1992)).
Methods for humanizing non-human antibodies are well known in the art.
Generally, a humanized antibody has one or more amino acid residues introduced into it from a source that is non-human. These non-human amino acid residues are often referred to as "import" residues, which are typically taken from an "import"
variable domain. Humanization can be essentially performed following the method of Winter and co-workers (Jones et al., Nature, 321:522-525 (1986); Riechmann et al., Nature, 332:323-327 (1988); Verhoeyen et al., Science, 239:1534-1536 (1988)), by substituting rodent CDRs or CDR sequences for the corresponding sequences of a human antibody.
Accordingly, such "humanized" antibodies are chimeric antibodies (U.S. Pat.
No.
4,816,567), wherein substantially less than an intact human variable domain has been substituted by the corresponding sequence from a non-human species. In practice, humanized antibodies are typically human antibodies in which some CDR residues and possibly some FR residues are substituted by residues from analogous sites in rodent antibodies.
The choice of human variable domains, both light and heavy, to be used in making the humanized antibodies is very important in order to reduce antigenicity.
According to the "best-fit" method, the sequence of the variable domain of a rodent antibody is screened against the entire library of known human variable domain sequences. The human sequence which is closest to that of the rodent is then accepted as the human framework (FR) for the humanized antibody (Sims et al., J.
Immunol., 151:2296 (1993) and Chothia et al., J. Mol. Biol., 196:901 (1987)). Another method uses a particular framework derived from the consensus sequence of all human antibodies of a particular subgroup of light or heavy chains. The same framework may be used for several different humanized antibodies (Carte~° et al., Proc. Natl. Acad. Sci.
USA, 89:4285 (1992); Presta et al., J. Immunol., 151:2623 (1993)).
It is further important that antibodies be humanized with retention of high affinity for the antigen and other favorable biological properties. To achieve this goal, according to a preferred method, humanized antibodies are prepared by a process of analysis of the parental sequences and various conceptual humanized products using three dimensional models of the parental and humanized sequences. Three dimensional immunoglobulin models are commonly available and are familiar to those skilled in the art. Computer programs are available which illustrate and display probable three-dimensional conformational structures of selected candidate immunoglobulin sequences. Inspection of these displays permits analysis of the likely role of the residues in the functioning of the candidate immunoglobulin sequence, i.e., the analysis of residues that influence the ability of the candidate immunoglobulin to bind its antigen. In this way, FR residues can be selected and combined from the consensus and import sequence so that the desired antibody characteristic, such as increased affinity for the target antigen(s), is achieved. In general, the CDR residues are directly and most substantially involved in influencing antigen binding (see, WO 94/04679 published 3 Mar. 1994).
Transgenic animals (e.g., mice) that are capable, upon immunization, of producing a full repertoire of human antibodies in the absence of endogenous immunoglobulin production can be employed. For example, it has been described that the homozygous deletion of the antibody heavy chain joining region (J(I~) gene in chimeric and germ-line mutant mice results in complete inhibition of endogenous antibody production. Transfer of the human germ-line immunoglobulin gene array in such germ-line mutant mice will result in the production of human antibodies upon antigen challenge (see, e.g., Jakobovits et al., Proc. Natl. Acad. Sci. USA, 90:2551-255 (1993); Jakobovits et al., Nature, 362:255-258 (1993); Bruggemann et al., Year in Immuno., 7:33 (1993)). Human antibodies can also be produced in phage display libraries (Hoogenboom et al., J. Mol. Biol., 227:381 (1991); Marks et al., J.
Mol. Biol., 222:581 (1991)). The techniques of Cote et al. and Boerner et al. are also available for the preparation of human monoclonal antibodies (Cole et al., Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, p. 77 (1985); Boerner et al., J. Immunol., 147(1):86-95 (1991)).
In one embodiment, the antibody or fragment thereof is a single chain antibody.
In another embodiment, the antibody or fragment is labeled. Optionally the antibody or fragment is conjugated or fused with a toxin or fragment thereof. Examples of the toxin or toxin moiety include diphtheria, ricin, and modifications thereof.
Dia~,nosis and Treatment The invention provides uses of the reagents described herein in if2 vitro and i~a vivo methods of diagnosing and treating a malignancy of hematopoietic cell lineage or an autoimmune disease in a subject. The reagents of the present invention are also useful in screening for disease manifestations. Such screening may be useful even before the onset of other clinical symptoms and could be used to screening subjects at risk for disease, so that prophylactic treatment can be started before the manifestation of other signs or symptoms.
By "malignancy" is meant a tumor or neoplasm whose cells possess one or more nuclear or cytoplasmic abnormalities, including, for example, high nuclear to cytoplasmic ratio, prominent nucleolar/nucleoli variations, variations in nuclear size, abnormal mitotic figures, or multinucleation. "Malignancies of hematopoietic cell lineage" include, but are not limited to, myelomas, leukemias, lymphomas (Hodgkin's and non-Hodgkin's forms), T-cell malignancies, B-cell malignancies, and lymphosarcomas or other malignancies described in the REAL classification system or the World Health Organization Classification of Hematologic Malignancies. It should be noted that the absence or presence of specific FcRHs can be diagnostic for a particular malignancy of hematopoietic cell linage or can be diagnostic for a particular form of a malignancy (e.g., a specific form of leukemia).
By "inflammatory and autoimmune diseases" illustratively including systemic lupus erythematosus, Hashimoto's disease, rheumatoid arthritis, graft-versus-host disease, Sjogren's syndrome, pernicious anemia, Addison disease, scleroderma, Goodpasture's syndrome, Crohn's disease, autoimmune hemolytic anemia, sterility, myasthenia gravis, multiple sclerosis, Basedow's disease, thrombopenia purpura, insulin-dependent diabetes mellitus, allergy; asthma, atopic disease;
arteriosclerosis;
myocarditis; cardiomyopathy; glomerular nephritis; hypoplastic anemia;
rejection after organ transplantation and numerous malignancies of lung, prostate, liver, ovary, colon, cervix, lymphatic'and breast tissues.
Specifically, the diagnostic methods comprise the steps of contacting a biological sample of the subject with an antibody or nucleic acid of the invention under conditions that allow the antibody to bind to cells of hematopoietic cell lineage or allow the nucleic acid to hybridize, preferably under stringent conditions, with nucleic acids of the biological sample; and detecting the amount or pattern of binding.
Changes in the amount or pattern of binding as compared to binding in a control sample indicate a malignancy or an inflammatory or autoimmune disease.
In various embodiments, the antibody used in the diagnostic method can selectively bind with an FcRH having the amino acid sequence of SEQ ID NO:1, 21, 2, 3, 22, 4, 5, 24, or 6.
The detecting step of the diagnostic method can be selected from methods routine in the art. For example, the detection step can be performed in vivo using a noninvasive medical technique such as radiography, fluoroscopy, sonography, imaging techniques such as magnetic resonance imaging, and the like. In vitro detection methods can be used to detect bound antibody or fragment thereof in an ELISA, RIA, immunohistochemically, FACS, IHC, FISH, or similar assays.
As used throughout, "biological sample" refers to a sample from any organism.
The sample can be, but is not limited to, peripheral blood, plasma, urine, saliva, gastric secretion, feces, bone marrow specimens, primary tumors, embedded tissue sections, frozen tissue sections, cell preparations, cytological preparations, exfoliate samples (e.g., sputum), fine needle aspirations, amnion cells, fresh tissue, dry tissue, and cultured cells or tissue. It is further contemplated that the biological sample of this invention can also be whole cells or cell organelles (e.g., nuclei). The sample can be unfixed or fixed according to standard protocols widely available in the art and can also be embedded in a suitable medium for preparation of the sample. For example, the sample can be embedded in paraffin or other suitable medium (e.g., epoxy or acrylamide) to facilitate preparation of the biological specimen for the detection 10 methods of this invention.
The invention also provides a method of treating a malignancy of hematopoietic cell lineage or an inflammatory or autoimrnune disease in a subject, comprising contacting the subject's malignant cells or inflammatory cells with a therapeutically effective amount of a reagent (e.g., an antibody or nucleic acid) or a therapeutic 15 composition of a reagent of the invention. The contacting step can occur by administration of the reagent or composition using any number of means available in the art. Typically, the reagent or composition is administered to the subject transdermally (e.g., by a transdermal patch or a topically applied cream, ointment, or the like), orally, subcutaneously, intrapulmonaryily, transmucosally, intraperitoneally, 20 intrauterinely, sublingually, intrathecally, intramuscularly, intraarticularly, etc. using conventional methods. In addition, the reagent or composition can be administered via injectable depot routes such as by using 1-, 3-, or 6-month depot injectable or biodegrable materials and methods.
Regardless of the route of administration, the amount of the reagent 25 administered or the schedule for administration will vary among individuals based on age, size, weight, condition to be treated, mode of administration, and the severity of the condition. One skilled in the art will realize that dosages are best optimized by the practicing physician and methods for determining dosage are described, for example in Remington's Pharmaceutical Science, latest edition. Guidance in selecting appropriate 30 doses for antibodies is found in the literature on therapeutic uses of antibodies, e.g., Handbook of Monoclonal Antibodies, Ferrone et al., eds., Noges Publications, Park Ridge, N.J., (1985) ch. 22 and pp. 303-357; Smith et al., Antibodies in Human Diagnosis and Therapy, Haber et al., eds., Raven Press, New York (1977) pp. 365-389. A
typical dose of the antibody used alone might range from about 1 ~g/kg to up to 100 mg/kg of body weight or more per day, and preferably 1 ~g/kg to up to 1 mg/kg, depending on the factors mentioned above. An intravenous injection of the antibody or fragment thereof, for example, could be long-lg of antibody or fragment thereof, and preferably long-lmg depending on the factors mentioned above. For local injection, a typical quantity of antibody ranges from lpg tolmg . Preferably, the local injection would be at an antibody concentration of 1-100 ~g/ml, and preferably 1-20 ~g/ml.
The nucleic acids of the invention can delivered to cells in a variety of ways.
For example, if the nucleic acid of this invention is delivered to the cells of a subject in an adenovirus vector, the dosage for administration of adenovirus to humans can range from about 10~ to 109 plaque forming units (pfu) per injection, but can be as high as 1012 pfu per injection. Ideally, a subject will receive a single injection. If additional injections are necessary, they can be repeated at six month intervals for an indefinite period and/or until the efficacy of the treatment has been established. As set forth herein, the efficacy of treatment can be determined by evaluating the clinical parameters.
The exact amount of the nucleic acid or vector required will vary as described above. Thus, it is not possible to specify an exact amount for every nucleic acid or vector. An appropriate amount can be determined by one of ordinary skill in the art using only routine experimentation given the teachings herein.
The invention further provides a therapeutic composition of the reagent of the invention. Such a composition typically contains from about 0.1 to 90% by weight (such as 1 to 20% or 1 to 10%) of a therapeutic agent of the invention in a pharmaceutically acceptable carrier. Solid formulations of the compositions for oral administration may contain suitable earners or excipients, such as corn starch, gelatin, lactose, acacia, sucrose, microcrystalline cellulose, kaolin, mannitol, dicalcium phosphate, calcium carbonate, sodium chloride, or alginic acid. Disintegrators that can be used include, without limitation, microcrystalline cellulose, corn starch, sodium starch, glycolate, and alginic acid. Tablet binders that may be used include acacia, methylcellulose, sodium carboxymethylcellulose, polyvinylpyrrolindone (PovidoneTM), hydroxypropyl methylcellulose, sucrose, starch, and ethylcellulose. Lubricants that may be used include magnesium stearates, stearic acid, silicone fluid, talc, waxes, oils, and colloidal silica.
Liquid formulations for oral administration prepared in water or other aqueous vehicles may contain various suspending agents such as methylcellulose, alginates, tragacanth, pectin, kelgin, carrageenan, acacia, polyvinylpyrrolidone, and polyvinyl alcohol. The liquid formulations may also include solutions, emulsions, syrups and elixirs containing, together with the active cornpound(s), wetting agents, sweeteners, and coloring and flavoring agents. Various liquid and powder formulations can be prepared by conventional methods for inhalation into the lungs of the mammal to be treated.
Injectable formulations of the compositions may contain various carriers such as vegetable oils, dimethylacetamide, dimethylformamide, ethyl lactate, ethyl carbonate, isopropyl myristate, ethanol, polyols (glycerol, propylene glycol, liquid polyethylene glycol, and the like). For intravenous injections, water soluble version of the compounds may be administered by the drip method, whereby a pharmaceutical formulation containing the antifungal agent and a physiologically acceptable excipient is infused. Physiologically acceptable excipients may include, for example, 5%
dextrose, 0.9% saline, Ringer's solution or other suitable excipients.
Intramuscular preparations, e.g., a sterile formulation of a suitable soluble salt form of the compounds, can be dissolved and administered in a pharmaceutical excipient such as water-for-injection, 0.9% saline, or 5% glucose solution. A suitable insoluble form of the compound may be prepared and administered as a suspension in an aqueous base or a pharmaceutically acceptable oil base, such as an ester of a long chain fatty acid (e.g., ethy; oleate).
A topical semi-solid ointment formulation typically contains a concentration of the active ingredient from about 1 to 20%, e.g., 5 to 10%, in a Garner such as a pharmaceutical cream base. Various formulations for topical use include drops, tinctures, lotions, creams, solutions, and ointments containing the active ingredient and various supports and vehicles. The optimal percentage of the therapeutic agent in each pharmaceutical formulation varies according to the formulation itself and the therapeutic effect desired in the specific pathologies and correlated therapeutic regimens.
The effectiveness of the method of treatment can be assessed by monitoring the patient for known signs or symptoms of the conditions being treated. For example, in the treatment of a malignancy of hematopoietic cell lineage, the reduction or stabilization of the number of abnormally proliferative cells would indicate successful treatment. In the treatment of arthritis, for example, a reduction in the amount of joint inflammation would indicate successful treatment. Thus, by"therapeutically effective"
is meant an amount that provides the desired treatment effect.
The invention further provides a method of modulating a humoral immune response in a subject, comprising administering to the subject an isolated FcRH, an antibody, or nucleic acid of the invention. By "modulation" is meant either up-regulating or down-regulating. Thus, in the case of an allergic response, one skilled in the art would choose to down-regulate the humoral immune response. In the case of exposure of a subject to an infectious agent (e.g., a viral or bacterial agent), one skilled in the art would choose to upregulate the humoral antibody response.
Experimental The following examples are put forth so as to provide those of ordinary skill in the art with a complete disclosure and description of how the compounds, compositions, articles, devices and/or methods claimed herein are made and evaluated, and are intended to be purely exemplary of the invention and are not intended to limit the scope of what the inventors regard as their invention. Efforts have been made to ensure accuracy with respect to numbers (e.g., amounts, temperature, etc.), but some errors and deviations should be accounted for. Unless indicated otherwise, parts are parts by weight, temperature is in °C or is at ambient temperature, and pressure is at or near atmospheric.
Identification of FcRHl FcRH2, and FcRH3 In order to isolation of FcRH cDNA Clones, rapid amplification of cDNA ends (R.ACE)-PCR was performed by using a Marathon-Ready human lymph node cDNA
library (CLONTECH). Gene-specific primers were as follows: FcRH3, forward 5'-TGAGTCTCAGGGTCACAGTTCCG-3' (SEQ H~ N0:41) and reverse 5'-GCTCTTGAACTTGGATATTTAGGGGT-3' (SEQ ID NO:42); FcRH2, forward 5'-CCAGTGTATGTCAATGTGGGCTCTG-3' (SEQ ID N0:43) and reverse 5'-CGTTGAAAGAGCTCTTGGACTTTTATC-3' (SEQ ID N0:44); and FcRHl, forward 5'-GCCTCAAAAGAAAAATAGGAAGACGTT-3' (SEQ H~ NO:45) and reverse 5'-AAGCTCACATCAGCGACAGGGAC-3' (SEQ ID N0:46). RACE products were subjected to a second round of nested PCR and visualized by agarose gel electrophoresis and ethidium bromide staining.
Primers used in end-to-end amplification to generate full-length cDNAs were as follows: FcRH3, forward 5'-TCTTGGAGATAAGTCGGGCTTT-3' (SEQ ID N0:47) and reverse 5'-ATCCTGCAGCCCAGCCTCGTAGGAG-3' (SEQ ID N0:48); FcRH2, forward 5'-GGTCCTCATGCTGCTGTGGTCATT-3' (SEQ ID N0:49)and reverse 5'-GCTGTTGATCTTCGCTTCTGATTC-3' (SEQ ID NO:50); and FcRHl, forward 5'-ATGCTGCCGAGGCTGTTGCTGTTG3' (SEQ ID NO:51) and reverse 5'-CATAGCATCTTCATAGTCCACATC-3' (SEQ ID N0:52). Each amplification reaction underwent initial denaturation of 94°C for 30 s followed by 30 cycles of denaturation at 94°C for 5 s and annealing at 68°C for 4 min, and final extension at 72°C for 6 min.
PCR products were ligated into the pCR2.1 TOPO T/A vector (Invitrogen).
Inserts were DNA-sequenced on both strands by the dideoxy chain termination method using Thermo Sequenase (Amersham Pharmacia) and an automated sequencer (Li-Cor, Lincoln, NE). Nucleotide and amino acid sequence alignment was analyzed with a DNASTAR (Madison, WI) software package, and homology searches were performed by using BLAST (Altschul, S. F. et al. (1990) J. Mol. Biol. 215, 403-410).
RNA blot analysis was subsequently performed. Northern blots (CLONTECH) were hybridized with 32P-dCTP-labeled probes: a 528-by EcoRI fragment corresponding to the 5' untranslated (UT)-EC1 regions of the FcRH3 cDNA, a 200-by PCR product corresponding to a portion of the 3' UT region of the FcRH2 cDNA, and a 257-by PCR product corresponding to a portion of the 3' UT region of the FcRHl cDNA. Membranes were hybridized for 1 h at 65°C, washed, and exposed to x-ray film (Kubagawa, H. et al. (1997) Proc. Natl. Acad. Sci. USA 94, 5261-5266).
Reverse transcription (RT)-PCR was performed. Human tonsillar cells, obtained with Institutional Review Board approval, were separated into CD19+ and CD19 subpopulations by magnetic cell sorting (Miltenyi Biotec, Auburn, CA). Viable CD19+
cells were stained with FITC-labeled anti-CD38 (Immunotech, Westbrook, ME) and phycoerythrin-labeled anti-IgD mAbs (Southern Biotechnology Associates) before sorting cells with a FACStarPlus instrument (Becton Dickinson) into Trizol reagent (Life Technologies, Grand Island, N~ for RNA isolation. Total cellular RNA was primed with random hexamers and oligo(dT) primers and reverse-transcribed with Superscript II (Life Technologies) into single-stranded cDNA. RT-PCR was performed by using RNA from tonsillar B cells and cell lines, with GIBCOIBRL Taq polymerase (Life Technologies). The following gene-specific primer pairs were used in the RT-5 PCR analysis of FcRHl-5 expresion in cell lines and tonsillar B cell subpopulations:
FcRHl forward, 5'-CTC AAC TTC ACA GTG CCT ACT GGG-3' (SEQ ff~ N0:53) and reverse, 5'-TCC TGC AGA GTC ACT AAC CTT GAG-3' (SEQ ID N0:54);
FcRH2 forward, 5'-CCA GTG TAT GTC AAT GTG GGC TCT G (SEQ ID NO:55) and reverse, 5'-CAT TCT TCC CTC AAA TCT TTA CAC-3' (SEQ ID N0:56); FcRH3 10 forward, 5'-CAG CAC GTG GAT TCG AGT CAC-3' (SEQ ID N0:57) and reverse, 5'-CAG ATC TGG GAA TAA ATC GGG TTG-3' (SEQ ID N0:58) FcRH4 forward, 5'-TCT TCA GAG ATG GCG AGG TCA-3' (SEQ ID N0:59) and reverse, 5'-TTT TGG
GGT GTA CAT CAA CAT ACA AG-3' (SEQ ID N0:60); and FcRH forward, 5'-TGT
TGC CCT GTT TCT TCC AAT ACA-3' (SEQ ID N0:61) and reverse, 5'-CAG AGT
15 TGG CCG ACC TAC GC-3' (SEQ ID N0:62). Each amplification reaction underwent initial denaturation at 94° for 5 min followed by 35 cycles of denaturation at 94° for 30 s, annealing at 60° for 30 s, extension at 72° for 1 min, and final extension at 72° for 7 min. Amplified products were visualized in 1% agarose gels containing ethidium bromide and documented with the Bio-Rad Fluor-S Imager.
20 The following human cell lines were used: REH and Nalm 16 pro-B cell lines (Korsmeyer, S. J. et al. (1983) J. Clin. Invest. 71, 301-313); 697, 207, and OBS pre-B
cell lines (Findley, H. W. et al. (1982) Blood 60, 1305-1309; Martin, D. et al. (1991) J.
Exp. Med. 173, 639-645); Ramos, Daudi, and Raji B cell lines (Pulvertaft, R.
J. V.
(1964) Lancet 1, 238-240; Klein, E. et al. (1968) Cancer Res. 28, 1300-1310;
Klein, G.
25 et al. (1975) Intervirology 5, 319-33431-33); THP-1 and U937 monocytoid cell lines, HL-60 promyelocytic and KG-1 myelocytic cell lines, Jurkat T cell line and the erythroid cell line (American Type Culture Collection).
A consensus sequence was generated that corresponds to the GenBank-derived amino terminal sequences of the second Ig-like domains of 30 FcR (FcyRI and Fc~yRII/III) and the third Ig-like domain of the polymeric Ig receptor:GEPIXLRCHSWK.DKXLXKVTYXQNGKAXI~F'FH (SEQ ID N0:63). A
search of the National Center for Biotechnology Information protein database with this sequence identified two overlapping human genomic bacterial artificial chromosome (BAC) clones, AL135929 and AL356276, which are located at 1q21.2-22. The second clone contained three putative Ig superfamily genes encoding complementary amino acid sequences that were designated FcRHl, FcRH2, and FcRH3. See Figure 1. The predicted amino acid sequences of these gene segments shared 23-57% identity with each other and 14-28% identity with human Fc~yRI (CD64). Further analysis of the FcRH locus led to the identification of two additional genes (FcRH4, and FcRHS) and one pseudogene (FcRH4yr), immediately centromeric of FcRHl-3, two of which have recently been described as IRTAl (FcRH4) and IRTA2 (FcRHS) (Hatzivassiliou, G.
et al. (2001) Immunity 14, 277-289).
To determine whether these genes are expressed by lymphocytes, the predicted amino acid sequences of their protein products were used to search the Lymphochip expressed sequence tag database with the TBLASTN algorithm (Alizadeh, A. A. et al.
(2000) Nature (London) 403, 503-511). Two expressed sequence tags (AA505046 and AA282433) were identified that share complete identity over 23 amino acids in their translated ORFs with the N terminus of FcRHI . Lymphochip microarray data analysis indicated that these expressed sequence tags are expressed at relatively high levels in peripheral lymphoid tissues, including the lymph nodes, tonsils, resting peripheral B
cells, and normal germinal center (GC) B cells. Among the different lymphoid malignancies, their expression proved to be highest in chronic lymphocytic leulcemias, follicular lymphomas, and some diffuse large cell lymphomas of B lineage.
FcRHl, FcRH2, and FcRH3 cDNAs were isolated by RACE-PCR from a human lymph node cDNA library in both 5' and 3' directions. Full-length cDNAs of the coding regions for FcRHl, FcRH2, and FcRH3 were obtained by end-to-end PCR
using unique primers generated from the cDNA sequences delineated for the 5' UT and 3'UT
regions. Southern blot analysis of human genomic DNA digested with Bar~zHI, EcoRI, or Hin.dIII using cDNA probes specific for the 3' UT regions of each cDNA revealed either one or two hybridizing fragments, suggesting that FcRHl, FcRH2, and FcRH3 are encoded by single genes. Analysis of full-length cDNA
sequences indicated that FcRHl, FcRH2, and FcRH3 have ORFs of 1,287 bp, 1,524 bp, and 2,202 bp, respectively, and encode type I transmembrane proteins of 429 aa, 508 aa, and 734 aa, respectively. Based on predicted consensus signal peptide cleavage sites (Von Heijne, G. (1986) Nucleic Acid Res. 14, 4683-4690; Nielsen, H. (1997) Protein Eng. 10, 1-6), the relative core peptide molecular masses were estimated as 45,158 for FcRHl, 53,407 for FcRH2, and 78,849 for FcRH3. These type I transmembrane proteins possess 3-6 extracellular C2 (Williams, A. F. & Barclay, A. N. (1988) Annu.
Rev. Immunol. 6, 381-405; Bork, P. et al. (1994) J. Mol. Biol. 242, 309-320;
Vaughn, D. E. & Bjorkman, P. J. (1996) Neuron 16, 261-273) type Ig-like domains with 3-potential N-linked glycosylation sites, uncharged transmembrane segments, and relatively long cytoplasmic tails containing consensus motifs for ITIMs and/or ITAMs.
See Fig. 2A.
Multiple alignment analysis of the translated cDNAs, using FcRH3 as the index sequence of comparison, indicates that FcRHl, FcRH2, and FcRH3 have highly conserved hydrophobic signal peptides and corresponding Ig-like extracellular domains (Fig. 2B). Their hydrophobic transmembrane (uncharged with the exception of FcRHl which includes an acidic domain) domains (Sonnhammer, E. L. L. et al. (1998) in A
Hidden Markov Model for Predicting Transmembrane Helices in Protein Sequences, eds. Glasgow, J., Littlejohn, T., Major, F., Lathrop, R., Sankoff, D. &
Sensen, C. (Am.
Assoc. for Artificial Intelligence, Menlo Park, CA), pp. 175-182) are also well conserved, but their cytoplasmic domains are not. FcRHl has a long cytoplasmic tail containing three potential ITAMs, the first and third of which fit the consensus sequence (E/D)-X-X-Y-X-X-(L/I)-X~_8-Y-X-X-(L/I) (SEQ ID N0:64, with six amino acid between the consensus sequences; SEQ ID N0:65, with seven amino acid residues between the consensus sequences; and SEQ ID NO:66, with eight amino acid residues between the consensus sequences), whereas, the second has only one tyrosine residue.
The shorter cytoplasmic domain of FcRH2 contains one potential ITAM and two ITIM
consensus sequences (I/V/L/S)-X-Y-X-X-(L/V) (SEQ ID N0:67) separated by 22 amino acids. FcRH3 has the longest cytoplasmic tail. It contains one potential ITAM, one ITIM, and another potential ITAM that also has a single tyrosine residue.
An RNA blot analysis with gene-specific probes was performed on 16 human tissues, including six primary or secondary lymphoid tissues. RNA blots were analyzed with discriminating a32P-dCTP-labeled probes generated from the respective FcRH
cDNAs. The following probes were used: (Top) a PCR-generated, 257-by probe specific to the 3' UT region of FcRHl; (Middle) a PCR-generated, 290-by probe corresponding to the 3' UT region of FcRH2; and (Bottom) a 528-by EcoRI-digested fragment of the 5' end of the FcRH3 cDNA corresponding to its 5' UT region, S
l, S2, and EC 1 domains. The relative mRNA abundance was indicated by (3-actin probe.
All three FcRH gene probes hybridized with transcripts in the secondary lymphoid organs, spleen and lymph node. An FcRHl-specific probe hybridized with spleen and lymph node transcripts of about 3.5 kb and about 6.0 kb. Additional hybridization bands of about 0.7 kb and about 1.5 kb were observed for heart, skeletal muscle, kidney, liver, and, in less abundance, placental tissue. Larger transcripts also were seen in skeletal muscle (about 6.0 kb) and in kidney and placenta (about 4.4 kb). An FcRH2-specific probe hybridized to about 3.0-kb, about 4.4-kb, and about 5.5-kb transcripts most abundantly in spleen and lymph node. A transcript of approximately 2.4-kb was notable in the kidney. An FcRH3 probe hybridized with about 3.5-kb, about 5.5-kb, and about 7.0-kb transcripts chiefly in spleen and lymph node. These also were seen, albeit in lesser abundance, in peripheral blood lymphocytes, thymus, and bone marrow samples.
Additionally, a unique transcript of about 1.35 kb was evident in skeletal muscle. These results indicated expression of FcRHl, FcRH2, and FcRH3 in peripheral lymphoid organs, whereas tissue specific differences in alternative splicing or polyadenylation were suggested by the differential expression of transcripts with variable size in nonlymphoid tissues. RTPCR analysis to date of non-lymphoid tissue skeletal muscle, however, does not reveal transcripts despite the Northern analysis results.
When FcRH expression was examined by RT-PCR analysis of cell lines representing different hematopoietic lineages, FcRHI, FcRH2, and FcRH3 expression was found in every mature B cell line tested (Table 2). FcRH2 and FcRH3 expression was limited to the mature B cell lines and not seen in the other types of cells examined.
In contrast, FcRHl expression was seen in pro-B, T, and myeloid cell lines, although not in an erythroid cell line.
Table 2. Expression of FcRH transcripts in human B cell lines Cell Type Cell lisae FcRIIl FcRH2 FcRH3 Pro-B ~ REH + - -Nahn 16 + -Pre-B ~ 697 - - -OBS - -B ~ Ramos + + +
Daudi + + +
Raji + + +
T Jurkat + - -Monocytic THP-1 + - -Myelomonocytic LT937 + -Promyelocytic HL-60 + - -Myelocytic KG-1 + - -Erythroid K562 - -FcRHl, FcRH2, and FcRH3 expression in cell lines was determined by RT-PCR.
RT-PCR analysis of sorted populations of peripheral blood cells indicated that FcRHl, FcRH2, FcRH3, and FcRHS are expressed at relatively high levels in CD19+ B
cells, whereas FcRH4 was expressed at only trace levels. FcRH3 expression was observed in CD3+ T cells whereas transcripts of FcRHl were barely detectable.
FcRHl expression also was observed in circulating granulocytes.
To refine the analysis of FcRH expression in secondary lymphoid tissues, tonsillar lymphocyte subpopulations were isolated. The five discrete subpopulations of B lineage cells, which can be distinguished by their differential expression of cell surface IgD and CD38, represent different stages in B cell differentiation:
follicular mantle (IgD+CD38), pre-GC (IgD+CD38+), GC (IgDCD38+), memory (IgDCD38), and mature plasma cells (CD382+) (Pascual, V. (1994) J. Exp. Med. 180: 329-339). RT-PCR analysis of FcRHl-5 expression in tonsillar B cell subpopulations was performed.
Viable cells were magnetically sorted into CD19- non-B cells and CD19+ B
cells. The latter were stained with anti-IgD and anti-CD38 mAbs, and the five subpopulations indicated (CD38- IgD-, CD38- IgD+, CD38+ IgD+, CD38+ IgD-, and CD382+) were sorted by flow cytometry. RT-PCR analysis of FcRH transcripts in non-B cells and the B cell subpopulations was also performed. After cDNA preparation, PCR
amplification was performed on the equivalent template of approximately 10 k cells.
Glyceraldehyde-3-phosphate dehydrogenase (GADPH) was amplified as a positive control.
RT-PCR analysis indicated little or no expression of FcRH transcripts in the non-B lineage CD19- cells, most of which are T cells. However, CD19+
subpopulations displayed coordinate expression of FcRHl, FcRH2, and FcRH3 transcripts in follicular mantle, naive, GC, and memory B cell subpopulations, but yielded no evidence of FcRH transcripts in pre-GC B cells or plasma cells. In contrast, FcRH4 transcripts were restricted to the follicular mantle and memory B cells, whereas FcRHS
expression extended to mature plasma cells.
The relationship between the five FcRHs was examined by comparing their full-length, extracellular, and individual Ig-like domain amino acid sequences.
This analysis, which included a recently identified mouse FcRH ortholog (moFeRH) and members of the FcR family, used the CLUSTAL method algorithm (Higgins, D. G. &
Sharp, P. M. (1989) Comput. Appl. Biosci. 5, 151-153). Comparison of the full-length 10 sequences of other FcRH family members with FcRH3 indicated 40-47%
identity. By comparison, the degree of FcRH3 homology with the moFcRH was found to be 35%
and 21-24% with FcR members residing on chromosome 1, FcyRI, FcyRII, Fc~yRIII, and FcsRI. A lower level of amino acid identity (14%) was observed for the chromosome 19 LRC member, FcaR. A slightly higher degree of extracellular homology was 15 evident. Pairwise analysis of the individual Ig-like subunits indicated conservation in membrane-distal to membrane-proximal ordering of extracellular domain composition among family members. Although similar Ig domain subunits were shared among family members, the individual receptors were found to be composed of unique domain combinations. The extracellular domain configuration of the moFcRH most closely 20 resembled that of FcRH2, with which it has 46% identity. The extended pairwise comparison of the FcRH family with known FcRs suggested the conservation of these Ig-like domains to some degree throughout the greater family. The resemblance is particularly evident in the FcRH3 membrane-distal domains that correspond to the three FcyRI domains and the two domains of FcYRII, FcyRIII, and the FcR y-chain.
This 25 analysis suggests the ancestral occurrence of differential duplication and diversification of the individual Ig-like subunits in the respective FcRH family members. The data also indicate that the FcRHs are more similar to their FcR neighbors on chromosome 1 than to their FcR relative on chromosome 19.
The genomic sequence analysis of relevant chromosome 1 q21 BAC
30 clones indicated that the entire FcRH locus spans 300 kb. The FcRH genes lie in the same transcriptional orientation toward the centomere. Exon-intron boundaries were characterized by sequence comparison of their respective cDNA clones and the AG/GT
rule. The FcRHl gene consists of 11 exons and 10 introns spanning about 28 kb.
The first exon, 5' UT/S 1, encodes the 5' UT region, the ATG translation initiation colon, and the first half of a split signal peptide. S2, the second exon, is separated from 5' UT/S1 by a long intron of 12.9 kb and, like the neighboring FcRs, is 21 by in length (van de Winkel, J. G. & Capel, P. J. (1993) Immunol. Today 14, 221; Kulczycki, A., Jr. et al. (1990) Proc. Natl. Acad. Sci. USA 87, 2856-2860; Pang, J.
et al. (1994) J. Immunol. 151, 6166-6174.). The extracellular region is encoded by three closely clustered exons, ECl-EC3, that code for the three Ig-like domains. The membrane-proximal, transmembrane, and the proximal portion of the cytoplasmic domain are encoded by a single sixth exon, TM. The cytoplasmic tail is encoded by five exons, CY1-CYS, and the CYS also encodes the beginning of the 3' UT region.
FcRH2 contains 12 exons and 11 introns that span 30 kb. It also contains two exons that encode a split signal peptide, the first of which, 5'UT/S1, includes the 5' UT
region, the ATG translation initiation colon, and first half of the signal peptide. The second exon, S2, is 21 by in length. Exons 3-6 encode the four extracellular domains, EC1-EC4. The seventh exon encodes the membrane-proximal, transmembrane, and the proximal portion of the cytoplasmic domain. The FcRH2 cytoplasmic tail is encoded by five exons, CY1-CYS, the last exon of which includes the termination of the ORF and beginning of the 3' UT region.
The FcRH3 gene consists of 16 exons and 15 introns that span about 24 kb.
Unlike FcRHl and FcRH2, its 5' UT region is encoded by two exons, 5' UTl and a second, 5'UT2/S 1, that also encodes the ATG translation initiation colon and the beginning of the split signal peptide. The third exon, S2, is also 21 by in length.
Extracellular domains encoded by six exons, EC1-EC6, are followed by exon 10 that encodes the membrane-proximal, transmembrane, and the proximal portion of the cytoplasmic domain. The cytoplasmic tail is encoded by five exons, CY1-CYS;
the last contains the beginning of the 3' UT region.
Identification of HuFcRH6 FcRH6 is located in the midst of the classical FcRs at 1q21-23. Its genomic structure indicates, like the classical FcRs and FcRHI-5, a split hydrophobic signal peptide encoded by two exons the second of which is 2lbp.
FcRH6 was characterized using the methods described in Example 1. A
composite analysis of Ig-like domains for relatedness with the other huFcRHs was performed. See Figure xxx. Sequence analysis of huFcRH6 indicates its type I
transmembrane form contains a consensus motif for a single ITAM, or a single or two ITIM's.
Initial RT-PCR analysis of huFcRH6 in human tissues and cell lines (as described in Example 1) reveals transcript expression in normal tonsil and lymph nodes. In cell lines, expression of huFcRH6 was identified in myeloid cell lines THP-1 (monocytic), U937 (myelomonocytic), and KG-1 (myelocytic). Limited expression if any was identified in the 207 pre-B cell line and the Daudi B cell line.
Generation of Transfectants and Antibodies Recombinant constructs for transfection and stable expression of huFcRHl-5 have been generated. The constructs have been ligated into a CMV driven mammalian expression vector with and without green fluorescent protein (GFP) fusion at the carboxyl terminus. Surface expression of huFcRHI and huFcRH3 was detected for both GFP and non-GFP forms by staining with antibody supernatant. The antibody supernatant was derived from hybridomas generated by mice immunized with recombinant extracellular protein of the respective FcRH. The constructs for huFcRH2, 4, and 5 have been detected by green fluorescence as well as surface expression for FcRH4.
Monoclonal antibodies have been generated, including, for example, an antibody that binds FcRHl. The preliminary analysis of FACS staining for FcRHl expression with monoclonal antibody 1-SA3 labeled with a FITC conjugate (mouse anti human FcRHl) in peripheral blood from normal volunteers indicates virtually all CD19+ B cells have huFcRHl expression, as do CD14+ monocytes and CD13+
granulocytes. CD3+ T cells have limited to no expression of FcRHl. Staining of B-CLL
samples from two different patient peripheral blood samples indicates that virtually all CDS+/CD19+ B-CLL cells are positive for the FcRHl 1-SA3 antigen. By western blot analysis of recombinant protein for FcRHl-5 extracellular regions 1-SA3 appears specific for FcRHl. 1-SA3 also stains B cell lines Daudi and Raji.
Identification of MoFcRHl-3 A family of three mouse Fc Receptor Homologs (MoFCRHs) were identified and cloned. Amino acid sequences from the membrane proximal Ig-like domains of huFcRHl-5 were used to identify putative mouse FcRH orthologs in the NCBI or Cetera genomic, EST, and protein databases using the protein BLAST (BLASTP) and the translated nucleotide BLAST (TBLASTN] algorithms, respectively. The location of moFcR family is split between chromosomes 1 and 3 in regions syntenic with human chromosome 1q21-23. See Figure 4. The mo FcRH are located on mouse Ch3. Approximate positions were determined from Genbank, Cetera, and Mouse Genome Informatics databasesContigs of ESTs were generated to determine the putative cDNA sequences.
Genomic organization was determined by comparing cDNA clones generated from RACE PCR with GenBank and Cetera genomic sequences. DNAStar software was used for analysis of exon-intron boundaries which were characterized by sequence comparison and the AG/GT rule. All three genes contain a split signal sequence with a 2lbp S2 exon (exon 2) which is found in all FcR and huFcRH genes on human chromosone 1.
A comparison of tyrosine based motifs in FcRH cytoplasmic tails indicated homology with the huFcRH family. See Figure 5. An analysis of sequence homology conservation is further shown in Figures 6 and 7.
Expression of the moFcRHs in tissue and cell lines was also characterized as described in Example 1. Briefly, RT-PCR was performed on mouse tissues and cell lines with gene specific primers. Viable tissue was placed in TRIzoI reagent for RNA
extraction. After cDNA preparation PCT amplification was performed on equivalent template amounts. Actin was amplified as a positive control. McFcRH3 appears to have preferential expression in cells of B lineage. The results are shown in Tables 3-4.
Table 3: Tissue Distribution of moFcRH Expression TISSUE MoFcRHl MoFcRH2 MoFcRH3 Bone Marrow + + +
Thymus + + +
Spleen + + +
Lymph Node + + +
Peyer's Patches+ + +
Peripheral + + +
Blood Brain + -Liver + + -Heart + -Muscle + - -Kidney + - -Lung + + -Intestine + + +
Testes I + I -Table 4. Expression of moFcRH transcripts in cell lines Cell Type Cell lirasFcRHI FcRH2 FcRH3 Pro-B SCID7 + +/- +
Raw8.1 + + -Pre-B 70Z/3 + + +
BC76 - ' + +
18-81 + + +
Imm. B WEHI-231 + + +
WEHI-279 + + +
B A20 + + +
X16C8.5 + + +
T EL4 + +/- -/+
NKT NKT + +/- -~T 2C12 + +/- -Myeloid WEHI-3 + - -Lymphoid YAC-1 + + -Fibroblast 3T3 + +/- -Expression in cell lines was determined by RT-PCR.
5 The mouse FcReceptor Homologs include secreted or type I transmembrane isolfrms that have unique cytoplasmic tails with potential activation and inhibition motifs. Their chromosomal location, Ig domain homology, and genomic organization indicate the mouse FcReceptor Homologs are orthologs of the huFcRH that have evolved a significant level of diversity. moFcRHl, moFcRH2, and moFcRH3 are 10 predicted to encode secreted or type I transmembrane proteins based on their amino acid sequences. moFcRHl has two secreted isoforms both of which have extracellular (EC) regions of four Ig-like domains with five potential sites for N-linked glycosylation. One isoform is a fusion protein with a type B scavenger receptor domain containing 8 cysteines. moFcRH2 has secreted and type I isoforms containing two Ig-15 like domains with five N-linked glycosylation sites. The type I isoform has an uncharged transmembrane region which the secreted isoform lacks. Both isoforms contain the cytoplasmic portion which is long in the transmembrane form and contains five tyrosines including a consensus sequence for one potential immunoreceptor tyrosine-based activating motif . moFcRH3 contains five Ig-like domains with six potential sites of N-linked glycosylation. Its transmembrane domain is also uncharged and the cytoplasmic region contains one potential ITAM and one potential immunoreceptor tyrosine-based inhibitory motif. The amino acid (aa) length of individual regions and full length (FL) isoforms, as well as approximate molecular weight (MW) in Daltons (Da), is indicated in the structural diagram of Figure ~.
Throughout this application, various publications are referenced. The disclosures of these publications in their entireties are hereby incorporated by reference into this application in order to more fully describe the state of the art to which this invention pertains.
It will be apparent to those skilled in the art that various modifications and variations can be made in the present invention without departing from the scope or spirit of the invention. Other embodiments of the invention will be apparent to those skilled in the art from consideration of the specification and practice of the invention disclosed herein. It is intended that the specification and examples be considered as exemplary only, with a true scope and spirit of the invention being indicated by the following claims.
SEQUENCE LISTING
<110> The UAB Research Foundation Davis, Randall S.
Cooper, Max D.
<120> MEMBERS OF THE FC RECEPTOR HOMOLOG GENE FAMILY (FCRH1-3, 6), RELATED REAGENTS, AND USES THEREOF
<130> 21085.0037P1 <141> 2003-03-25 <150> US 60/367,667 <151> 2002-03-25 <160> 102 <170> FastSEQ for Windows Version 4.0 <210> 1 <211> 99 <212> PRT
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400> 1 Lys Arg Lys I1e Gly Arg Arg 5er Ala Arg Asp Pro Leu Arg Ser Leu 1 5 l0 15 Pro Ser Pro Leu Pro Gln Glu Phe Thr Tyr Leu Asn Ser Pro Thr Pro Gly Gln Leu Gln Pro Ile Tyr Glu Asn Val Asn Val Val Ser Gly Asp Glu Val Tyr Ser Leu,Ala Tyr Tyr Asn Gln Pro Glu G1n Glu Ser Val Ala Ala Glu Thr Leu Gly Thr His Met Glu Asp Lys Val Ser Leu Asp Ile Tyr Ser Arg Leu Arg Lys Ala Asn Ile Thr Asp Val Asp Tyr Glu Asp Ala Met <210> 2 <211> 413 <212> PRT
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400> 2 Ala Glu Leu Phe Leu Tle Ala Ser Pro Ser His Pro Thr Glu Gly Ser Pro Val Thr Leu Thr Cys Lys Met Pro Phe Leu Gln Ser 5er Asp Ala Gln Phe Gln Phe Cys Phe Phe Arg Asp Thr Arg Ala Leu Gly Pro Gly Trp Ser Ser Ser Pro Lys Leu Gln Ile Ala Ala Met Trp Lys Glu Asp Thr Gly Ser Tyr Trp Cys Glu Ala Gln Thr Met Ala Ser Lys Val Leu Arg Ser Arg Arg Ser Gln Ile Asn Val His Arg Val Pro Val Ala Asp Val Ser Leu Glu Thr Gln Pro Pro Gly Gly Gln Val Met Glu Gly Asp Arg Leu Val Leu I1e Cys Ser Val Ala Met Gly Thr Gly Asp Ile Thr Phe Leu Trp Tyr Lys Gly Ala Val Gly Leu Asn Leu Gln Ser Lys Thr Gln Arg 5er Leu Thr Ala Glu Tyr Glu Ile Pro Ser Val Arg Glu Ser Asp Ala Glu Gln Tyr Tyr Cys Val Ala Glu Asn Gly Tyr Gly Pro Ser Pro Ser Gly Leu Val Ser Ile Thr Val Arg Ile Pro Val Ser Arg Pro Ile Leu Met Leu Arg Ala Pro Arg Ala Gln Ala Ala Val Glu Asp Val Leu Glu Leu His Cys Glu Ala Leu Arg Gly Ser Pro Pro Ile Leu Tyr Trp Phe Tyr His Glu Asp Ile Thr Leu Gly Ser Arg Ser Ala Pro Ser Gly Gly Gly Ala Ser Phe Asn Leu Ser Leu Thr Glu Glu His Ser Gly Asn Tyr Ser Cys Glu Ala Asn Asn Gly Leu Gly Ala Gln Arg Ser Glu Ala Val Thr Leu Asn Phe Thr Val Pro Thr Gly Ala Arg Ser Asn His Leu Thr Ser Gly Val Ile Glu Gly Leu Leu Ser Thr Leu Gly Pro Ala Thr Val Ala Leu Leu Phe Cys Tyr Gly Leu Lys Arg Lys Ile Gly Arg Arg Ser Ala Arg Asp Pro Leu Arg Ser Leu Pro Ser Pro Leu Pro Gln Glu Phe Thr Tyr Leu Asn Ser Pro Thr Pro Gly Gln Leu Gln Pro Ile Tyr Glu Asn Val Asn Val Val Ser Gly Asp Glu Val Tyr Ser Leu Ala Tyr Tyr Asn G1n Pro Glu Gln Glu Ser Val Ala Ala Glu Thr Leu Gly Thr His Met ~Glu Asp Lys Val Ser Leu Asp Ile Tyr Ser Arg Leu Arg Lys Ala Asn Ile Thr Asp Val Asp Tyr Glu Asp Ala Met <210> 3 <211> 86 <212> PRT
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400> 3 His Lys Ile Ser Gly Glu Ser Ser A1a Thr Asn Glu Pro Arg Gly Ala Ser Arg Pro Asn Pro Gln Glu Phe Thr Tyr Ser 5er Pro Thr Pro Asp Met Glu Glu Leu Gln Pro Val Tyr Val Asn Val G1y Ser Val Asp Val Asp Val Val Tyr Ser Gln Val Trp Ser Met Gln Gln Pro Glu Ser Ser Ala Asn Ile Arg Thr Leu Leu Glu Asn Lys Asp Ser Gln Val Tle Tyr Ser Ser Val Lys Lys Ser <210> 4 <21l> 489 <212> PRT
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400> 4 Leu Thr Leu Val Ala Pro Ser Ser Val Phe Glu Gly Asp Ser Ile Val Leu Lys Cys Gln Gly Glu Gln Asn Trp Lys Tle Gln Lys Met Ala Tyr His Lys Asp Asn Lys Glu Leu Ser Val Phe Lys Lys Phe Ser Asp Phe Leu Tle Gln Ser Ala Val Leu Ser Asp Ser Gly Asn Tyr Phe Cys Ser Thr Lys Gly Gln Leu Phe Leu Trp Asp Lys Thr Ser Asn Ile Val Lys Ile Lys Val Gln Glu Leu Phe Gln Arg Pro Val Leu Thr Ala Ser Ser Phe Gln Pro Ile Glu Gly Gly Pro Val 5er Leu Lys Cys Glu Thr Arg Leu Ser Pro Gln Arg Leu Asp Val Gln Leu Gln Phe Cys Phe Phe Arg Glu Asn Gln Val Leu Gly Ser Gly Trp Ser Ser Ser Pro Glu Leu Gln Ile Ser Ala Val Trp Ser Glu Asp Thr G1y Ser Tyr Trp Cys Lys Ala Glu Thr Va1 Thr His Arg Ile Arg Lys Gln 5er Leu Gln Ser Gln Ile _ 165 170 175 His Val Gln Arg Ile Pro Ile Ser Asn Val Ser Leu Glu Ile Arg Ala Pro Gly Gly Gln Val Thr Glu Gly Gln Lys Leu Ile Leu Leu Cys Ser Val Ala G1y Gly Thr Gly Asn Val Thr Phe Ser Trp Tyr Arg Glu Ala Thr Gly Thr Ser Met G1y Lys Lys Thr G1n Arg Ser Leu Ser Ala Glu Leu Glu Ile Pro Ala Val Lys Glu Ser Asp Ala Gly Lys Tyr Tyr Cys Arg Ala Asp Asn Gly His Val Pro Ile Gln Ser Lys Val Val Asn Ile Pro Val Arg Ile Pro Val Ser Arg Pro Val Leu Thr Leu Arg Ser Pro Gly Ala Gln Ala Ala Val Gly Asp Leu Leu Glu Leu His Cys Glu Ala Leu Arg Gly Ser Pro Pro Tle Leu Tyr Gln Phe Tyr His Glu Asp Val Thr Leu Gly Asn Ser Ser Ala Pro Ser Gly Gly Gly Ala Ser Phe Asn Leu Ser Leu Thr Ala Glu His Ser Gly Asn Tyr Ser Cys Glu Ala Asn Asn G1y Leu Gly Ala Gln Cys Ser Glu Ala Val Pro Val Ser Ile Ser Gly Pro Asp Gly Tyr Arg Arg Asp Leu Met Thr Ala Gly Val Leu Trp Gly Leu Phe Gly Val Leu Gly Phe Thr Gly Val Ala Leu Leu Leu Tyr Ala Leu Phe His Lys Tle Ser Gly Glu Ser Ser Ala Thr Asn Glu Pro Arg G1y Ala Ser Arg Pro Asn Pro Gln Glu Phe Thr Tyr Ser Ser Pro Thr Pro Asp Met Glu Glu Leu Gln Pro Val Tyr Val Asn Val Gly Ser Val Asp Val Asp Val Val Tyr Ser Gln Val Trp Ser Met Gln Gln Pro Glu Ser Ser Ala Asn Ile Arg Thr Leu Leu Glu Asn Lys Asp Ser Gln Val 21e Tyr Ser Ser Val Lys Lys Ser <210> 5 <211> 140 <212> PRT
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400> 5 His Tyr Ala Arg Ala Arg Arg Lys Pro Gly Gly Leu Ser Ala Thr Gly Thr Ser Ser His Ser Pro Ser Glu Cys Gln Glu Pro Ser Ser Ser Arg Pro Ser Arg Ile Asp Pro Gln G1u Pro Thr His Ser Lys Pro Leu Ala Pro Met Glu Leu Glu Pro Met Tyr Ser Asn Val Asn Pro Gly Asp Ser Asn Pro Ile Tyr Ser Gln Ile Trp Ser Ile Gln His Thr Lys Glu Asn Ser Ala Asn Cys Pro Met Met His Gln Glu His Glu Glu Leu Thr Val Leu Tyr Ser Glu Leu Lys Lys Thr His Pro Asp Asp Ser Ala Gly Glu Ala Ser Ser Arg Gly Arg Ala His Glu Glu Asp Asp Glu Glu Asn Tyr Glu Asn Val Pro Arg Val Leu Leu Ala Ser Asp His <210> 6 <211> 717 <212> PRT
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400> 6 Gly Val Ala Pro Lys Ala Val Leu Leu Leu Asn Pro Pro Trp Ser Thr Ala Phe Lys Gly Glu Lys Val Ala Leu Ile Cys Ser Ser Ile Ser His Ser Leu Ala Gln Gly Asp Thr Tyr Trp Tyr His Asp Glu Lys Leu Leu Lys Ile Lys His Asp Lys Ile Gln Ile Thr Glu Pro Gly Asn Tyr Gln Cys Lys Thr Arg Gly Ser Ser Leu Ser Asp Ala Val His Val Glu Phe Ser Pro Asp Trp Leu Ile Leu Gln Ala Leu His Pro Val Phe Glu Gly Asp Asn Val Ile Leu Arg Cys Gln Gly Lys Asp Asn Lys Asn Thr His Gln Lys Val Tyr Tyr Lys Asp Gly Lys Gln Leu Pro Asn Ser Tyr Asn 1l5 120 125 Leu Glu Lys Ile Thr Val Asn Ser Val Ser Arg Asp Asn Ser Lys Tyr His Cys Thr Ala Tyr Arg Lys Phe Tyr Ile Leu Asp Ile Glu Val Thr Ser Lys Pro Leu Asn Ile Gln Val Gln Glu Leu Phe Leu His Pro Val Leu Arg Ala Ser Ser Ser Thr Pro Ile Glu Gly Ser Pro Met Thr Leu Thr Cys Glu Thr Gln Leu Ser Pro Gln Arg Pro Asp Val Gln Leu Gln Phe Ser Leu Phe Arg Asp Ser Gln Thr Leu Gly Leu Gly Trp Ser Arg Ser Pro Arg Leu Gln Ile Pro Ala Met Trp Thr Glu Asp Ser Gly Ser Tyr Trp Cys G1u Va1 Glu Thr Va1 Thr His Ser Ile Lys Lys Arg Ser Leu Arg Ser Gln Ile Arg Val Gln Arg Val Pro Val Ser Asn Val Asn Leu Glu Ile Arg Pro Thr Gly Gly Gln Leu Ile Glu Gly Glu Asn Met Val Leu Ile Cys Ser Val Ala Gln Gly Ser Gly Thr Val Thr Phe Ser Trp His Lys Glu Gly Arg Val Arg Ser Leu Gly Arg Lys Thr G1n Arg Ser Leu Leu Ala G1u Leu His Val Leu Thr Val Lys Glu Ser Asp Ala Gly Arg Tyr Tyr Cys Ala Ala Asp Asn Val His Ser Pro Ile Leu Ser Thr Trp Ile Arg Val Thr Val Arg Tle Pro Val Ser His Pro Val Leu Thr Phe Arg Ala Pro Arg Ala His Thr Val Val Gly Asp Leu Leu Glu Leu His Cys Glu Ser Leu Arg Gly Ser Pro Pro Ile Leu Tyr Arg Phe Tyr His Glu Asp Val Thr Leu Gly Asn Ser Ser Ala Pro Ser Gly Gly Gly Ala Ser Phe Asn Leu Ser Leu Thr Ala Glu His Ser Gly Asn Tyr Ser Cys Asp Ala Asp Asn Gly Leu Gly A1a Gln His Ser His Gly Val Ser Leu Arg Val Thr Val Pro Val Ser Arg Pro Val Leu Thr Leu Arg Ala Pro Gly Ala Gln Ala Val Val Gly Asp Leu Leu Glu Leu His Cys Glu Ser Leu Arg Gly Ser Phe Pro Ile Leu Tyr Trp Phe Tyr His Glu Asp Asp Thr Leu Gly Asn Ile Ser Ala His Ser Gly Gly Gly Ala Ser Phe Asn Leu Ser Leu Thr Thr Glu His Ser Gly Asn Tyr Ser Cys Glu Ala Asp Asn Gly Leu Gly Ala G1n His Ser Lys Val Val Thr Leu Asn Val Thr Gly Thr Ser Arg Asn Arg Thr Gly Leu Thr Ala Ala Gly Ile Thr Gly Leu Val Leu Ser Ile Leu Val Leu Ala Ala Ala Ala Ala Leu Leu His Tyr Ala Arg Ala Arg Arg Lys Pro Gly Gly Leu 5er A1a Thr Gly Thr Ser Ser His Ser Pro Ser Glu Cys Gln Glu Pro Ser Ser Ser Arg Pro Ser Arg Ile Asp Pro Gln Glu Pro Thr His Ser Lys Pro Leu Ala Pro Met Glu Leu Glu Pro Met Tyr Ser Asn Val Asn Pro Gly Asp Ser Asn Pro Ile Tyr Ser Gln Ile Trp Ser Ile Gln His Thr Lys Glu Asn Ser Ala Asn Cys Pro Met Met His Gln Glu His Glu Glu Leu Thr Val Leu Tyr Ser Glu Leu Lys Lys Thr His Pro Asp Asp Ser Ala Gly Glu Ala Ser Ser Arg Gly Arg Ala His Glu Glu Asp Asp Glu Glu Asn Tyr Glu Asn Val Pro Arg Val Leu Leu Ala Ser Asp His <210> 7 <211> 300 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400> 7 aaaaa taggaagacg ttcagccagggatccactcaggagccttcccagccctcta 60 aaaa g accccagggcagctacagcctatatatgaa 120 ccccaagagt tcacctacct caactcacct aatgtgaatg ttgtaagtgg ggatgaggtttattcactggcgtactataaccagccggag 180 caggaatcag tagcagcaga aaccctggggacacatatggaggacaaggtttccttagac 240 atctattcca ggctgaggaa agcaaacattacagatgtggactatgaagatgctatgtaa 300 <210> 8 <211> 2038 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400> 8 ctcgactctg aggtgcattc tttttttgat gagaggcatc tctaggtacc atccctgacc 60 tggtcctcat gctgccgagg ctgttgctgt tgatctgtgc tccactctgt gaacctgccg 120 agctgtttttgatagccagcccctcccatcccacagaggggagcccagtgaccctgacgt180 gtaagatgccctttctacagagttcagatgcccagttccagttctgctttttcagagaca240 cccgggccttgggcccaggctggagcagctcccccaagctccagatcgctgccatgtgga300 aagaagacacagggtcatactggtgcgaggcacagacaatggcgtccaaagtcttgagga360 gcaggagatcccagataaatgtgcacagggtccctgtcgctgatgtgagcttggagactc420 agcccccaggaggacaggtgatggagggagacaggctggtcctcatctgctcagttgcta480 tgggcacaggagacatcaccttcctttggtacaaaggggctgtaggtttaaaccttcagt540 caaagacccagcgttcactgacagcagagtatgagattccttcagtgagggagagtgatg600 ctgagcaatattactgtgtagctgaaaatggctatggtcccagccccagtgggctggtga660 gcatcactgtcagaatcccggtgtctcgcccaatcctcatgctcagggctcccagggccc720 aggctgcagtggaggatgtgctggagcttcactgtgaggccctgagaggctctcctccaa780 tcctgtactggttttatcacgaggatatcaccctggggagcaggtcggccccctctggag840 gaggagcctccttcaacctttccctgactgaagaacattctggaaactactcctgtgagg900 ccaacaatggcctgggggcccagcgcagtgaggcggtgacactcaacttcacagtgccta960 ctggggccagaagcaatcatcttacctcaggagtcattgaggggctgctcagcacccttg1020 gtccagccaccgtggccttattattttgctacggcctcaaaagaaaaataggaagacgtt1080 cagccagggatccactcaggagccttcccagccctctaccccaagagttcacctacctca1140 actcacctaccccagggcagctacagcctatatatgaaaatgtgaatgttgtaagtgggg1200 atgaggtttattcactggcgtactataaccagccggagcaggaatcagtagcagcagaaa1260 ccctggggacacatatggaggacaaggtttccttagacatctattccaggctgaggaaag1320 caaacattacagatgtggactatgaagatgctatgtaaggttatggaagattctgctctt1380 tgaaaaccatccatgaccccaagcctcaggcctgatatgttcttcagagatcctggggca1440 ttagctttccagtatacctcttctggatgccattctccatggcactattccttcatctac1500 tgtgaagtgaagttggcgcagccctgaagaaactacctaggagaactaatagacacagga1560 gtgacagggactttgttatcagaaccagattcctgccggctcctttgaaaacaggtcata1620 ttgtgctcttctgtttacaagaggaaacaagatggaataaaagaaattgggatcttgggt1680 tggagggacagtgaagcttagagcacatgaactcaaggttagtgactctgcaggacttca1740 cagagagagctgtgcccatcattcagtccaagtgctttctctgcccagacagcacagaac1800 tccagccccgctacttacatggatcatcgagtttccacctaaaatatgattctatttatt1860 ttgagtcactgttaccaaattagaactaaaacaaagttacataaaaagttattgtgactc1920 cacttaattttagtgacgtatttttgtatatataggccaacctataccacatccaaaatt1980 atgtatctattacagcccctagaagctttataaatacagtgtgtcttcttttattcac 2038 <210> 9 <211> 261 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificialequence:/note =
S
synthetic construct <400> 9 cacaagatat caggagaaag ttctgccactaatgaacccagaggggcttccaggccaaat60 cctcaagagt tcacctattc aagcccaaccccagacatggaggagctgcagccagtgtat120 gtcaatgtgg gctctgtaga tgtggatgtggtttattctcaggtctggagcatgcagcag180 ccagaaagct cagcaaacat caggacacttctggagaacaaggactcccaagtcatctac240 tcttctgtga agaaatcata a 261 <210> 10 <211> 2573 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400> 10 ggtgaccaag agtacatctc ttttcaaatagctggattaggtcctcatgctgctgtggtc60 attgctggtc atctttgatg cagtcactgaacaggcagattcgctgacccttgtggcgcc120 ctcttctgtcttcgaaggagacagcatcgttctgaaatgccagggagaacagaactggaa180 aattcagaagatggcttaccataaggataacaaagagttatctgttttcaaaaaattctc240 agatttccttatccaaagtgcagttttaagtgacagtggtaactatttctgtagtaccaa300 aggacaactctttctctgggataaaacttcaaatatagtaaagataaaagtccaagagct360 ctttcaacgtcctgtgctgactgccagctccttccagcccatcgaagggggtccagtgag420 cctgaaatgtgagacccggctctctccacagaggttggatgttcaactccagttctgctt480 cttcagagaaaaccaggtcctggggtcaggctggagcagctctccggagctccagatttc540 tgccgtgtggagtgaagacacagggtcttactggtgcaaggcagaaacggtgactcacag600 gatcagaaaacagagcctccaatcccagattcacgtgcagagaatccccatctctaatgt660 aagcttggagatccgggcccccgggggacaggtgactgaaggacaaaaactgatcctgct720 ctgctcagtggctgggggtacaggaaatgtcacattctcctggtacagagaggccacagg780 aaccagtatgggaaagaaaacccagcgttccctgtcagcagagctggagatcccagctgt840 gaaagagagtgatgccggcaaatattactgtagagctgacaacggccatgtgcctatcca900 gagcaaggtggtgaatatccctgtgagaattccagtgtctcgccctgtcctcaccctcag960 gtctcctggggcccaggctgcagtgggggacctgctggagcttcactgtgaggccctgag1020 aggctctcccccaatcttgtaccaattttatcatgaggatgtcacccttgggaacagctc1080 ggccccctctggaggaggggcctccttcaacctctctttgactgcagaacattctggaaa1140 ctactcctgtgaggccaacaacggcctgggggcccagtgcagtgaggcagtgccagtctc1200 catctcaggacctgatggctatagaagagacctcatgacagctggagttctctggggact1260 gtttggtgtccttggtttcactggtgttgctttgctgttgtatgccttgttccacaagat1320 atcaggagaaagttctgccactaatgaacccagaggggcttccaggccaaatcctcaaga1380 gttcacctattcaagcccaaccccagacatggaggagctgcagccagtgtatgtcaatgt1440 gggctctgtagatgtggatgtggtttattctcaggtctggagcatgcagcagccagaaag1500 ctcagcaaacatcaggacacttctggagaacaaggactcccaagtcatctactcttctgt1560 gaagaaatcataacacttggaggaatcagaagggaagatcaacagcaaggatggggcatc1620 attaagacttgctataaaaccttatgaaaatgcttgaggcttatcacctgccacagccag1680 aacgtgcctcaggaggcacctcctgtcatttttgtcctgatgatgtttcttctccaatat1740 cttcttttacctatcaatattcattgaactgctgctacatccagacactgtgcaaataaa1800 ttatttctgctaccttctcttaagcaatcagtgtgtaaagatttgagggaagaatgaata1860 agagatacaaggtctcaccttcatctactgtgaagtgatgagaacaggacttgatagtgg1920 tgtattaacttatttatgtgctgctggatacagtttgctaatattttgttgagaattttt1980 gcaaatatgttcattgggaatattggcctgaaattttcttttccactgtgtctctgccag2040 aatgtttgtatcaggctgatgctggcttcatagaatgagttaggcaggagcccttcctcc2100 ttgattttttggcatagtttcagcaggattggtaccagttattctttctgcatcttgtag2160 aattcagctatgaatccatctggtctagggcttttgtgttggttggtaagttttttatta2220 ctaattcaacttcagcgcttgatattggtctaggaggggtttctgtctcttcctggttca2280 atcttgggagattgtgtgtttccaggaatttagccgtttcctccagattttcttctttat2340 gtgcatcgacttgagtgtaaacataacttatatgcactgggaaaccaaaaaatctgtgtg2400 acttgctttattgcagcatttgttttattttggtagtctggaactgaacctgcaatatca2460 ccaaagtatgcatatagttgcaaaaatgtgatttttgacatagtaaatatgagtatttgc2520 aataaactatgatattacttttgtaagtatatagaataaaatgtaaataatct 2573 <210> 11 <211> 423 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400>
11 gggcccgaaggaaaccaggaggactttctgccactggaacatctagtcac 60 cattacgcca agtcctagcgagtgtcaggagccttcctcgtccaggccttccaggatagaccctcaagag 120 cccactcactctaaaccactagccccaatggagctggagccaatgtacagcaatgtaaat 180 cctggagatagcaacccgatttattcccagatctggagcatccagcatacaaaagaaaac 240 tcagctaattgtccaatgatgcatcaagagcatgaggaacttacagtcctctattcagaa 300 ctgaagaagacacacccagacgactctgcaggggaggctagcagcagaggcagggcccat 360 gaagaagatgatgaagaaaactatgagaatgtaccacgtgtattactggcctcagaccac 420 tag 423 <210> 12 <211> 2916 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400>
12 agtagcagcttcctgccctccttcttggagataagtcgggcttttggtga60 gtctcatctg gacagactttcccaaccctctgcccggccggtgcccatgcttctgtggctgctgctgctg120 atcctgactcctggaagagaacaatcaggggtggccccaaaagctgtacttctcctcaat180 cctccatggtccacagccttcaaaggagaaaaagtggctctcatatgcagcagcatatca240 cattccctagcccagggagacacatattggtatcacgatgagaagttgttgaaaataaaa300 catgacaagatccaaattacagagcctggaaattaccaatgtaagacccgaggatcctcc360 ctcagtgatgccgtgcatgtggaattttcacccgactggctgatcctgcaggctttacat420 cctgtctttgaaggagacaatgtcattctgagatgtcaggggaaagacaacaaaaacact480 catcaaaaggtttactacaaggatggaaaacagcttcctaatagttataatttagagaag540 atcacagtgaattcagtctccagggataatagcaaatatcattgtactgcttataggaag600 ttttacatacttgacattgaagtaacttcaaaacccctaaatatccaagttcaagagctg660 tttctacatcctgtgctgagagccagctcttccacgcccatagaggggagtcccatgacc720 ctgacctgtgagacccagctctctccacagaggccagatgtccagctgcaattctccctc780 ttcagagatagccagaccctcggattgggctggagcaggtcccccagactccagatccct840 gccatgtggactgaagactcagggtcttactggtgtgaggtggagacagtgactcacagc900 atcaaaaaaaggagcctgagatctcagatacgtgtacagagagtccctgtgtctaatgtg960 aatctagagatccggcccaccggagggcagctgattgaaggagaaaatatggtccttatt1020 tgctcagtagcccagggttcagggactgtcacattctcctggcacaaagaaggaagagta1080 agaagcctgggtagaaagacccagcgttccctgttggcagagctgcatgttctcaccgtg1140 aaggagagtgatgcagggagatactactgtgcagctgataacgttcacagccccatcctc1200 agcacgtggattcgagtcaccgtgagaattccggtatctcaccctgtcctcaccttcagg gctcccagggcccacactgtggtgggggacctgctggagcttcactgtgagtccctgaga1320 ggctctcccccgatcctgtaccgattttatcatgaggacgtcaccctggggaacagctca1380 gccccctctggaggaggagcctccttcaacctctctctgactgcagaacattctggaaac1440 tactcctgtgatgcagacaatggcctgggggcccagcacagtcatggagtgagtctcagg1500 gtcacagttccggtgtctcgccccgtcctcaccctcagggctcccggggcccaggctgtg1560 gtgggggacctgctggagcttcactgtgagtccctgagaggctccttcccgatcctgtac1620 tggttttatcacgaggatgacaccttggggaacatctcggcccactctggaggaggggca1680 tccttcaacctctctctgactacagaacattctggaaactactcatgtgaggctgacaat1740 ggcctgggggcccagcacagtaaagtggtgacactcaatgttacaggaacttccaggaac1800 agaacaggccttaccgctgcgggaatcacggggctggtgctcagcatcctcgtccttgct1860 gctgctgctgctctgctgcattacgccagggcccgaaggaaaccaggaggactttctgcc1920 actggaacatctagtcacagtcctagcgagtgtcaggagccttcctcgtccaggccttcc1980 aggatagaccctcaagagcccactcactctaaaccactagccccaatggagctggagcca2040 atgtacagcaatgcaaatcctggagatagcaacccgatttattcccagatctggagcatc2100 cagcatacaaaagaaaactcagctaattgtccaatgatgcatcaagagcatgaggaactt2160 acagtcctctattcagaactgaagaagacacacccagacgactctgcaggggaggctagc2220 agcagaggcagggcccatgaagaagatgatgaagaaaactatgagaatgtaccacgtgta2280 ttactggcctcagaccactagccccttacccagagtggcccacaggaaacagcctgcacc2340 atttttttttctgttctctccaaccacacatcatccatctctccagactctgcctcctac2400 gaggctgggctgcagg 2416 <210> 13 <211> 873 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400>
13 tgatagccagcccctcccatcccacagaggggagcccagtgaccctgacg 60 gagctgtttt tgtaagatgccctttctacagagttcagatgcccagttccagttctgctttttcagagac 120 acccgggccttgggcccaggctggagcagctcccccaagctccagatcgctgccatgtgg 180 aaagaagacacagggtcatactggtgcgaggcacagacaatggcgtccaaagtcttgagg 240 agcaggagatcccagataaatgtgcacagggtccctgtcgctgatgtgagcttggagact 300 cagcccccaggaggacaggtgatggagggagacaggctggtcctcatctgctcagttgct 360 atgggcacaggagacatcaccttcctttggtacaaaggggctgtaggtttaaaccttcag 420 tcaaagacccagcgttcactgacagcagagtatgagattccttcagtgagggagagtgat 480 gctgagcaatattactgtgtagctgaaaatggctatggtcccagccccagtgggctggtg 540 agcatcactgtcagaatcccggtgtctcgcccaatcctcatgctcagggctcccagggcc 600 caggctgcagtggaggatgtgctggagcttcactgtgaggccctgagaggctctcctcca 660 atcctgtactggttttatcacgaggatatcaccctggggagcaggtcggccccctctgga 720 ggaggagcctccttcaacctttccctgactgaagaacattctggaaactactcctgtgag 780 gccaacaatggcctgggggcccagcgcagtgaggcggtgacactcaacttcacagtgcct 840 actggggccagaagcaatcatcttacctcagga 873 <210>
<211>
<212>
DNA
<213> ficial Arti Sequence <220>
<223> iption equence:/note =
Descr of Artificial S
synth etic construct <400>
14 cgccctcttctgtcttcgaaggagacagcatcgttctgaaatgccaggga60 acccttgtgg gaacagaactggaaaattcagaagatggcttaccataaggataacaaagagttatctgtt120 ttcaaaaaattctcagatttccttatccaaagtgcagttttaagtgacagtggtaactat180 ttctgtagtaccaaaggacaactctttctctgggataaaacttcaaatatagtaaagata290 aaagtccaagagctctttcaacgtcctgtgctgactgccagctccttccagcccatcgaa300 gggggtccagtgagcctgaaatgtgagacccggctctctccacagaggttggatgttcaa360 ctccagttctgcttcttcagagaaaaccaggtcctggggtcaggctggagcagctctccg420 gagctccagatttctgccgtgtggagtgaagacacagggtcttactggtgcaaggcagaa480 acggtgactcacaggatcagaaaacagagcctccaatcccagattcacgtgcagagaatc540 cccatctctaatgtaagcttggagatccgggcccccgggggacaggtgactgaaggacaa600 , aaactgatcctgctctgctcagtggctgggggtacaggaaatgtcacattctcctggtac660 agagaggccacaggaaccagtatgggaaagaaaacccagcgttccctgtcagcagagctg720 gagatcccagctgtgaaagagagtgatgccggcaaatattactgtagagctgacaacggc780 catgtgcctatccagagcaaggtggtgaatatccctgtgagaattccagtgtctcgccct840 gtcctcaccctcaggtctcctggggcccaggctgcagtgggggacctgctggagcttcac900 tgtgaggccctgagaggctctcccccaatcttgtaccaattttatcatgaggatgtcacc960 cttgggaacagctcggccccctctggaggaggggcctccttcaacctctctttgactgca1020 gaacattctggaaactactcctgtgaggccaacaacggcctgggggcccagtgcagtgag1080 gcagtgccagtctccatctcaggacctgatggctatagaagagacctcatgacagct 1137 <210>
<211>
<212>
DNA
<213> ence Artificial Sequ <220>
<223> Description of Artificial Sequence:/note synthetic construct <400> 15 gtggccccaa aagctgtact tctcctcaat cctccatggt ccacagcctt caaaggagaa 60 aaagtggctc tcatatgcag cagcatatca cattccctag cccagggaga cacatattgg 120 tatcacgatg agaagttgtt gaaaataaaa catgacaaga tccaaattac agagcctgga 180 aattaccaat gtaagacccg aggatcctcc ctcagtgatg ccgtgcatgt ggaattttca 240 cccgactggctgatcctgcaggctttacatcctgtctttgaaggagacaatgtcattctg300 agatgtcaggggaaagacaacaaaaacactcatcaaaaggtttactacaaggatggaaaa360 cagcttcctaatagttataatttagagaagatcacagtgaattcagtctccagggataat420 agcaaatatcattgtactgcttataggaagttttacatacttgacattgaagtaacttca480 aaacccctaaatatccaagttcaagagctgtttctacatcctgtgctgagagccagctct540 tccacgcccatagaggggagtcccatgaccctgacctgtgagacccagctctctccacag600 aggccagatgtccagctgcaattctccctcttcagagatagccagaccctcggattgggc660 tggagcaggtcccccagaotccagatccctgccatgtggactgaagactcagggtcttac720 tggtgtgaggtggagacagtgactcacagcatcaaaaaaaggagcctgagatctcagata780 cgtgtacagagagtccctgtgtctaatgtgaatctagagatccggcccaccggagggcag840 ctgattgaaggagaaaatatggtccttatttgctcagtagcccagggttcagggactgtc900 acattctcctggcacaaagaaggaagagtaagaagcctgggtagaaagacccagcgttcc960 ctgttggcagagctgcatgttctcaccgtgaaggagagtgatgcagggagatactactgt1020 gcagctgataacgttcacagccccatcctcagcacgtggattcgagtcaccgtgagaatt1080 ccggtatctcaccctgtcctcaccttcagggctcccagggcccacactgtggtgggggac1140 ctgctggagcttcactgtgagtccctgagaggctctcccccgatcctgtaccgattttat1200 catgaggacgtcaccctggggaacagctcagccccctctggaggaggagcctccttcaac1260 ctctctctgactgcagaacattctggaaactactcctgtgatgcagacaatggcctgggg1320 gcccagcacagtcatggagtgagtctcagggtcacagttccggtgtctcgccccgtcctc1380 accctcagggctcccggggcccaggctgtggtgggggacctgctggagcttcactgtgag1440 tccctgagaggctccttcccgatcctgtactggttttatcacgaggatgacaccttgggg1500 aacatctcggcccactctggaggaggggcatccttcaacctctctctgactacagaacat1560 tctggaaactactcatgtgaggctgacaatggcctgggggcccagcacagtaaagtggtg1620 acactcaatgttacaggaacttccaggaacagaacaggc 1659 <210>
<211>
<212>
DNA
<213> ficial ence Arti Sequ <220>
<223> iption rtificialequence:/note =
Descr of A S
synth etic construct <400>
16 gggcccgaaggaaaccaggaggactttctgccactggaacatctagtcac60 cattacgcca agtcctagcgagtgtcaggagccttcctcgtccaggccttccaggatagaccctcaagag120 cccactcactctaaaccactagccccaatggagctggagccaatgtacagcaatgcaaat180 cctggagatagcaacccgatttattcccagatctggagcatccagcatacaaaagaaaac240 tcagctaattgtccaatgatgcatcaagagcatgaggaacttacagtcctctattcagaa300 ctgaagaagacacacccagacgactctgcaggggaggctagcagcagaggcagggcccat360 gaagaagatgatgaagaaaactatgagaatgtaccacgtgtattactggcctcagaccac420 tag <210>
<211>
<212>
DNA
<213>
Artificial Sequence <220>
<223>
Description of Artificial Sequence:/note =
synthetic construct <400>
17 aagctgtacttctcctcaatcctccatggtccacagccttcaaaggagaa60 gtggccccaa aaagtggctctcatatgcagcagcatatcacattccctagcccagggagacacatattgg120 tatcacgatgagaagttgttgaaaataaaacatgacaagatccaaattacagagcctgga180 aattaccaatgtaagacccgaggatcctccctcagtgatgccgtgcatgtggaattttca240 cccgactggctgatcctgcaggctttacatcctgtctttgaaggagacaatgtcattctg300 agatgtcaggggaaagacaacaaaaacactcatcaaaaggtttactacaaggatggaaaa360 cagcttcctaatagttataatttagagaagatcacagtgaattcagtctccagggataat420 agcaaatatcattgtactgcttataggaagttttacatacttgacattgaagtaacttca480 aaacccctaaatatccaagttcaagagctgtttctacatcctgtgctgagagccagctct540 tccacgcccatagaggggagtcccatgaccctgacctgtgagacccagctctctccacag600 aggccagatgtccagctgcaattctccctcttcagagatagccagaccctcggattgggc660 tggagcaggtcccccagactccagatccctgccatgtggactgaagactcagggtcttac720 tggtgtgaggtggagacagtgactcacagcatcaaaaaaaggagcctgagatctcagata780 cgtgtacagagagtccctgtgtctaatgtgaatctagagatccggcccaccggagggcag840 ctgattgaaggagaaaatatggtccttatttgctcagtagcccagggttcagggactgtc900 acattctcctggcacaaagaaggaagagtaagaagcctgggtagaaagacccagcgttcc960 ctgttggcagagctgcatgttctcaccgtgaaggagagtgatgcagggagatactactgt1020 gcagctgataacgttcacagccccatcctcagcacgtggattcgagtcaccgtgagaatt1080 ccggtatctcaccctgtcctcaccttcagggctcccagggcccacactgtggtgggggac1140 ctgctggagcttcactgtgagtccctgagaggctctcccccgatcctgtaccgattttat1200 catgaggacgtcaccctggggaacagctcagccccctctggaggaggagcctccttcaac1260 ctctctctgactgcagaacattctggaaactactcctgtgatgcagacaatggcctgggg1320 gcccagcacagtcatggagtgagtctcagggtcacagttccggtgtctcgccccgtcctc1380 accctcagggctcccggggcccaggctgtggtgggggacctgctggagcttcactgtgag1440 tccctgagaggctccttcccgatcctgtactggttttatcacgaggatgacaccttgggg1500 aacatctcggcccactctggaggaggggcatccttcaacctctctctgactacagaacat1560 tctggaaactactcatgtgaggctgacaatggcctgggggcccagcacagtaaagtggtg1620 acactcaatgttacaggaacttccaggaacagaacaggccttaccgctgcgggaatcacg1680 gggctggtgctcagcatcctcgtccttgctgctgctgctgctctgctgcattacgccagg1740 gcccgaaggaaaccaggaggactttctgccactggaacatctagtcacagtcctagcgag1800 tgtcaggagccttcctcgtccaggccttccaggatagaccctcaagagcccactcactct1860 aaaccactagccccaatggagctggagccaatgtacagcaatgcaaatcctggagatagc1920 aacccgatttattcccagatctggagcatccagcatacaaaagaaaactcagctaattgt1980 ccaatgatgcatcaagagcatgaggaacttacagtcctctattcagaactgaagaagaca2040 cacccagacgactctgcaggggaggctagcagcagaggcagggcccatgaagaagatgat2100 gaagaaaactatgagaatgtaccacgtgtattactggcctcagaccactag 2151 <210> 18 <211> 315 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificialequence:/note =
S
synthetic construct <400> 18 agatcctgga gaaaagctgg gccccttccatcccagataccacccacagctccaggtgga60 gagcagtgcc cactatatgc caacgtgcatcaccagaaagggaaagatgaaggtgttgtc120 tactctgtgg tgcatagaac ctcaaagaggagtgaagccaggtctgctgagttcaccgtg180 gggagaaagg acagttctat catctgtgcggaggtgagatgcctgcagcccagtgaggtt240 tcatccacgg aggtgaatat gagaagcaggactctccaagaaccccttagcgactgtgag300 gaggttctct gctag 315 <210> 19 <211> 870 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400> 19 actgtctggc tgtacctcca agcctggccaaaccctgtgtttgaaggagatgccctgact60 ctgcgatgtc agggatggaa gaatacaccactgtctcaggtgaagttctacagagatgga120 aaattccttc atttctctaa ggaaaaccagactctgtccatgggagcagcaacagtgcag180 agccgtggcc agtacagctg ctctgggcaggtgatgtatattccacagacattcacacaa240 acttcagagactgccatggttcaagtccaagagctgtttccacctcctgtgctgagtgcc300 atcccctctcctgagccccgagagggtagcctggtgaccctgagatgtcagacaaagctg360 caccccctgaggtcagccttgaggctccttttctccttccacaaggacggccacaccttg920 caggacaggggccctcacccagaactctgcatcccgggagccaaggagggagactctggg480 ctttactggtgtgaggtggcccctgagggtggccaggtccagaagcagagcccccagctg590 gaggtcagagtgcaggctcctgtatcccgtcctgtgctcactctgcaccacgggcctgct600 gaccctgctgtgggggacatggtgcagctcctctgtgaggcacagaggggctcccctccg660 atcctgtattccttctaccttgatgagaagattgtggggaaccactcagctccctgtggt720 ggaaccacctccctcctcttcccagtgaagtcagaacaggatgctgggaactactcctgc780 gaggctgagaacagtgtctccagagagaggagtgagcccaagaagctgtctctgaagggt840 tctcaagtcttgttcactcccgccagcaac 870 <210>
<211>
<212>
DNA
<213> ficial enoe Arti Sequ <220>
<223> Description of Artificial Sequence:/note synthetic construct <400>
20 tgtacctccaagcctggccaaaccctgtgtttgaaggagatgccctgact60 actgtctggc ctgcgatgtcagggatggaagaatacaccactgtctcaggtgaagttctacagagatgga120 aaattccttcatttctctaaggaaaaccagactctgtccatgggagcagcaacagtgcag180 agccgtggccagtacagctgctctgggcaggtgatgtatattccacagacattcacacaa240 acttcagagactgccatggttcaagtccaagagctgtttccacctcctgtgctgagtgcc300 atcccctctcctgagccccgagagggtagcctggtgaccctgagatgtcagacaaagctg360 caccccctgaggtcagccttgaggctccttttctccttccacaaggacggccacaccttg420 caggacaggggccctcacccagaactctgcatcccgggagccaaggagggagactctggg480 ctttactggtgtgaggtggcccctgagggtggccaggtccagaagcagagcccccagctg540 gaggtcagagtgcaggctcctgtatcccgtcctgtgctcactctgcaccacgggcctgct600 gaccctgctgtgggggacatggtgcagctcctctgtgaggcacagaggggctcccctccg660 atcctgtattccttctaccttgatgagaagattgtggggaaccactcagctccctgtggt720 ggaaccacctccctcctcttcccagtgaagtcagaacaggatgctgggaactactcctgc780 gaggctgagaacagtgtctccagagagaggagtgagcccaagaagctgtctctgaagggt840 tctcaagtcttgttcactcccgccagcaactggctggttccttggcttcctgcgagcctg900 cttggcctgatggttattgctgctgcacttctggtttatgtgagatcctggagaaaagct960 gggccccttccatcccagataccacccacagctccaggtggagagcagtgcccactatat1020 gccaacgtgcatcaccagaaagggaaagatgaaggtgttgtctactctgtggtgcataga1080 acctcaaagaggagtgaagccaggtctgctgagttcaccgtggggagaaaggacagttct1140 atcatctgtgcggaggtgagatgcctgcagcccagtgaggtttcatccacggaggtgaat1200 atgagaagcaggactctccaagaaccccttagcgactgtgaggaggttctctgctag 1257 <210>
<211>
<212>
PRT
<213> ficial ence Arti Sequ <220>
<223> ription ArtificialSequence:/note =
Desc of synt hetic construct <400>
21 Pro Ser Pro Thr Ala Glu His Glu Gly Leu Phe Ser Leu Ile Ala Ser Pro Va1 Pro Phe Thr Leu Leu Thr Cys Gln Lys Met Ser Ser Asp Ala Gln Phe Asp Thr Gln Phe Arg Cys Phe Ala Phe Arg Leu Gly Pro Gly Trp Ser Ser Ser Pro Lys Leu Gln Ile Ala Ala Met Trp Lys Glu Asp Thr Gly Ser Tyr Trp Cys Glu Ala Gln Thr Met Ala Ser Lys Val Leu Arg Ser Arg Arg Ser Gln Ile Asn Val His Arg Val Pro Val Ala Asp Val Ser Leu Glu Thr Gln Pro Pro Gly Gly G1n Val Met Glu Gly Asp Arg Leu Val Leu Ile Cys Ser Val Ala Met Gly Thr Gly Asp Ile Thr Phe Leu Trp Tyr Lys Gly Ala Val Gly Leu Asn Leu Gln Ser Lys Thr Gln Arg Ser Leu Thr Ala Glu Tyr Glu Ile Pro Ser Val Arg Glu Ser Asp A1a Glu Gln Tyr Tyr Cys Val Ala Glu Asn Gly Tyr Gly Pro Ser Pro Ser Gly Leu Val Ser Ile Thr Val Arg Ile Pro Val Ser Arg Pro Ile Leu Met Leu Arg Ala Pro Arg Ala Gln Ala Ala Val Glu Asp Val Leu Glu Leu His Cys Glu Ala Leu Arg Gly Ser Pro Pro Ile Leu Tyr Trp Phe Tyr His Glu Asp Ile Thr Leu Gly Ser Arg Ser Ala Pro Ser Gly Gly Gly Ala Ser Phe Asn Leu Ser Leu Thr Glu Glu His Ser Gly Asn Tyr Ser Cys Glu Ala Asn Asn Gly Leu Gly Ala Gln Arg Ser Glu Ala Val Thr Leu Asn Phe Thr Val Pro Thr Gly Ala Arg Ser Asn His Leu Thr Ser Gly <210> 22 <211> 380 <212> PRT
<213> Artificial Sequence <220>
<223> Description of Artificial 5equence:/note =
synthetic construct <400> 22 Leu Thr Leu Val Ala Pro Ser Ser Val Phe Glu Gly Asp Ser Ile Val Leu Lys Cys Gln Gly Glu Gln Asn Trp Lys Ile Gln Lys Met Ala Tyr His Lys Asp Asn Lys Glu Leu Ser Val Phe Lys Lys Phe Ser Asp Phe Leu Ile Gln Ser Ala Val Leu Ser Asp Ser G1y Asn Tyr Phe Cys Ser Thr Lys Gly Gln Leu Phe Leu Trp Asp Lys Thr Ser Asn Ile Val Lys Tle Lys Val Gln Glu Leu Phe Gln Arg Pro Val Leu Thr Ala Ser Ser Phe Gln Pro Ile Glu Gly Gly Pro Val Ser Leu Lys Cys Glu Thr Arg Leu Ser Pro Gln Arg Leu Asp Val Gln Leu Gln Phe Cys Phe Phe Arg Glu Asn Gln Val Leu Gly Ser Gly Trp Ser Ser Ser Pro Glu Leu Gln Ile Ser Ala Val Trp Ser Glu Asp Thr Gly Ser Tyr Trp Cys Lys Ala Glu Thr Val Thr His Arg Ile Arg Lys Gln Ser Leu Gln Ser Gln Ile His Val Gln Arg Ile Pro Ile Ser Asn Val Ser Leu Glu Ile Arg Ala Pro Gly Gly Gln Val Thr Glu Gly Gln Lys Leu Ile Leu Leu Cys Ser Val Ala Gly Gly Thr Gly Asn Val Thr Phe Ser Trp Tyr Arg Glu Ala Thr G1y Thr Ser Met Gly Lys Lys Thr Gln Arg Ser Leu Ser Ala Glu Leu Glu Ile Pro Ala Val Lys Glu Ser Asp Ala Gly Lys Tyr Tyr Cys Arg Ala Asp Asn Gly His Val Pro Ile Gln Ser Lys Val Val Asn Ile Pro Val Arg Ile Pro Val Ser Arg Pro Val Leu Thr Leu Arg Ser Pro Gly A1a Gln Ala Ala Val Gly Asp Leu Leu G1u Leu His Cys Glu Ala Leu Arg Gly Ser Pro Pro Ile Leu Tyr Gln Phe Tyr His Glu Asp Val Thr Leu Gly Asn 5er Ser Ala Pro Ser Gly Gly Gly Ala Ser Phe Asn Leu Ser Leu Thr Ala Glu His Ser Gly Asn Tyr Ser Cys Glu Ala Asn Asn Gly Leu Gly Ala Gln Cys Ser Glu Ala Val Pro Val Ser Ile Ser Gly Pro Asp Gly Tyr Arg Arg Asp Leu Met Thr Ala <210> 23 <211> 140 <212> PRT
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400> 23 His Tyr Ala Arg Ala Arg Arg Lys Pro Gly Gly Leu Ser A1a Thr Gly Thr Ser Ser His Ser Pro Ser Glu Cys Gln Glu Pro 5er Ser Ser Arg Pro Ser Arg Ile Asp Pro Gln Glu Pro Thr His Ser Lys Pro Leu A1a Pro Met Glu Leu Glu Pro Met Tyr Ser Asn Ala Asn Pro Gly Asp Ser Asn Pro Ile Tyr Ser Gln Tle Trp Ser Ile Gln His Thr Lys Glu Asn Ser Ala Asn Cys Pro Met Met His Gln Glu His Glu Glu Leu Thr Val Leu Tyr 5er Glu Leu Lys Lys Thr His Pro Asp Asp Ser Ala Gly Glu Ala Ser Ser Arg Gly Arg Ala His Glu G1u Asp Asp Glu Glu Asn Tyr Glu Asn Val Pro Arg Val Leu Leu Ala Ser Asp His <210> 24 <211> 554 <212> PRT
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:lnote =
synthetic construct <400> 24 Gly Val Ala Pro Lys Ala Val Leu Leu Leu Asn Pro Pro Trp Ser Thr Ala Phe Lys Gly Glu Lys Val Ala Leu Ile Cys Ser Ser Ile Ser His Ser Leu Ala Gln Gly Asp Thr Tyr Trp Tyr His Asp Glu Lys Leu Leu Lys Ile Lys His Asp Lys Ile Gln Ile Thr Glu Pro Gly Asn Tyr G1n Cys Lys Thr Arg Gly Ser Ser Leu Ser Asp Ala Val His Val Glu Phe Ser Pro Asp Trp Leu Ile Leu Gln Ala Leu His Pro Val Phe Glu Gly Asp Asn Val Tle Leu Arg Cys Gln Gly Lys Asp Asn Lys Asn Thr His Gln Lys Val Tyr Tyr Lys Asp Gly Lys Gln Leu Pro Asn Ser Tyr Asn Leu Glu Lys Ile Thr Val Asn Ser Val Ser Arg Asp Asn Ser Lys Tyr His Cys Thr Ala Tyr Arg Lys Phe Tyr Ile Leu Asp Tle Glu Val Thr 5er Lys Pro Leu Asn Ile Gln Val Gln Glu Leu Phe Leu His Pro Val Leu Arg Ala Ser Ser Ser Thr Pro Ile Glu Gly Ser Pro Met Thr Leu Thr Cys Glu Thr Gln Leu Ser Pro Gln Arg Pro Asp Val Gln Leu Gln Phe Ser Leu Phe Arg Asp Ser G1n Thr Leu Gly Leu Gly Trp Ser Arg 5er Pro Arg Leu Gln Ile Pro Ala Met Trp Thr Glu Asp Ser G1y Ser Tyr Trp Cys Glu Val Glu Thr Val Thr His Ser Tle Lys Lys Arg Ser Leu Arg Ser Gln Ile Arg Va1 Gln Arg Val Pro Val Ser Asn Val Asn Leu G1u Ile Arg Pro Thr Gly Gly Gln Leu Ile Glu Gly Glu Asn Met Val Leu Ile Cys Ser Val Ala G1n Gly Ser Gly Thr Val Thr Phe Ser Trp His Lys Glu Gly Arg Val Arg Ser Leu Gly Arg Lys Thr Gln Arg _ Ser Leu Leu Ala Glu Leu His Val Leu Thr Val Lys Glu Ser Asp Ala Gly Arg Tyr Tyr Cys Ala A1a Asp Asn Val His Ser Pro Ile Leu Ser Thr Trp Tle Arg Val Thr Val Arg Ile Pro Val Ser His Pro Val Leu Thr Phe Arg Ala Pro Arg Ala His Thr Val Val Gly Asp Leu Leu Glu Leu His Cys Glu Ser Leu Arg Gly Ser Pro Pro Ile Leu Tyr Arg Phe Tyr His Glu Asp Val Thr Leu Gly Asn Ser Ser Ala Pro Ser Gly Gly Gly A1a Ser Phe Asn Leu Ser Leu Thr Ala Glu His Ser Gly Asn Tyr Ser Cys Asp Ala Asp Asn Gly Leu Gly Ala Gln His Ser His Gly Val Ser Leu Arg Val Thr Val Pro Val Ser Arg Pro Val Leu Thr Leu Arg Ala Pro Gly Ala Gln Ala Val Val Gly Asp Leu Leu Glu Leu His Cys Glu Ser Leu Arg Gly 5er Phe Pro Ile Leu Tyr Trp Phe Tyr His Glu Asp Asp Thr Leu Gly Asn Ile Ser Ala His Ser Gly Gly Gly A1a Ser Phe Asn Leu Ser Leu Thr Thr Glu His Ser Gly Asn Tyr Ser Cys Glu Ala Asp Asn Gly Leu Gly Ala Gln His Ser Lys Val Val Thr Leu Asn Val Thr Gly Thr Ser Arg Asn Arg Thr Gly <210> 25 <211> 717 , <212> PRT
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400> 25 Gly Va1 Ala Pro Lys Ala Val Leu Leu Leu Asn Pro Pro Trp Ser Thr Ala Phe Lys Gly Glu Lys Va1 Ala Leu Ile Cys Ser Ser Ile Ser His Ser Leu Ala Gln Gly Asp Thr Tyr Trp Tyr His Asp Glu Lys Leu Leu Lys Ile Lys His Asp Lys Ile Gln Ile Thr Glu Pro Gly Asn Tyr Gln Cys Lys Thr Arg Gly Ser Ser Leu Ser Asp Ala Val His Val Glu Phe Ser Pro Asp Trp Leu Ile Leu Gln Ala Leu His Pro Val Phe Glu Gly Asp Asn Val Ile Leu Arg Cys Gln Gly Lys Asp Asn Lys Asn Thr His Gln Lys Val Tyr Tyr Lys Asp G1y Lys Gln Leu Pro Asn Ser Tyr Asn Leu Glu Lys Ile Thr Val Asn Ser Val Ser Arg Asp Asn Ser Lys Tyr His Cys Thr Ala Tyr Arg Lys Phe Tyr Ile Leu Asp Ile Glu Val Thr Ser Lys Pro Leu Asn Ile Gln Val Gln Glu Leu Phe Leu His Pro Val Leu Arg Ala Ser Ser Ser Thr Pro Ile Glu Gly Ser Pro Met Thr Leu Thr Cys Glu Thr Gln Leu Ser Pro Gln Arg Pro Asp Val Gln Leu Gln Phe Ser Leu Phe Arg Asp Ser Gln Thr Leu Gly Leu Gly Trp Ser Arg Ser Pro Arg Leu Gln Ile Pro Ala Met Trp Thr Glu Asp 5er Gly 5er Tyr Trp Cys Glu Val Glu Thr Val Thr His Ser Ile Lys Lys Arg Ser Leu Arg Ser Gln Ile Arg Val Gln Arg Val Pro Val Ser Asn Val Asn Leu Glu Ile Arg Pro Thr Gly Gly Gln Leu Ile Glu Gly Glu Asn Met Val Leu Tle Cys Ser Val Ala Gln Gly Ser Gly Thr Val Thr Phe Ser Trp His Lys Glu Gly Arg Val Arg Ser Leu Gly Arg Lys Thr Gln Arg Ser Leu Leu Ala Glu Leu His Val Leu Thr Val Lys Glu Ser Asp Ala Gly Arg Tyr Tyr Cys Ala Ala Asp Asn Val His Ser Pro Ile Leu Ser Thr Trp Ile Arg Val Thr Val Arg Ile Pro Val Ser His Pro Val Leu Thr Phe Arg Ala Pro Arg Ala His Thr Val Val Gly Asp Leu Leu Glu Leu His Cys Glu Ser Leu Arg Gly Ser Pro Pro Ile Leu Tyr Arg Phe Tyr His Glu Asp Val Thr Leu Gly Asn Ser Ser Ala Pro Ser Gly Gly Gly Ala Ser Phe Asn Leu Ser Leu Thr Ala Glu His Ser Gly Asn Tyr Ser Cys Asp Ala Asp Asn Gly Leu Gly Ala Gln His Ser His Gly Val Ser Leu Arg Val Thr Val Pro Val Ser Arg Pro Val Leu Thr Leu Arg Ala Pro Gly Ala Gln Ala Val Val Gly Asp Leu Leu G1u Leu His Cys Glu 5er Leu Arg Gly Ser Phe Pro I1e Leu Tyr Trp Phe Tyr His G1u Asp Asp Thr Leu Gly Asn Ile Ser Ala His Ser Gly Gly Gly Ala Ser Phe Asn Leu Ser Leu Thr Thr Glu His Ser Gly Asn Tyr Ser Cys Glu Ala Asp Asn Gly Leu Gly Ala Gln His Ser Lys Val Val Thr Leu Asn Va1 Thr Gly Thr Ser Arg Asn Arg Thr Gly Leu Thr Ala Ala Gly Ile Thr Gly Leu Va1 Leu Ser Ile Leu Val Leu A1a Ala Ala Ala Ala Leu Leu His Tyr Ala Arg Ala Arg Arg Lys Pro Gly Gly Leu Ser Ala Thr Gly Thr Ser Ser His Ser Pro Ser Glu Cys Gln Glu Pro Ser Ser Ser Arg Pro 5er Arg Ile Asp Pro Gln Glu Pro Thr His Ser Lys Pro Leu Ala Pro Met Glu Leu G1u Pro Met Tyr Ser Asn Ala Asn Pro Gly Asp Ser Asn Pro Ile Tyr Ser Gln Ile Trp Ser Ile Gln His Thr Lys Glu Asn Ser Ala Asn Cys Pro Met Met His G1n Glu His Glu G1u Leu Thr Val Leu Tyr Ser Glu Leu Lys Lys Thr His Pro Asp Asp Ser Ala Gly Glu Ala Ser Ser Arg Gly Arg Ala His Glu Glu Asp Asp Glu Glu Asn Tyr Glu Asn Val Pro Arg Val Leu Leu Ala Ser Asp His <210> 26 <211> 104 <212> PRT
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400> 26 Arg Ser Trp Arg Lys Ala Gly Pro Leu Pro Ser Gln Ile Pro Pro Thr Ala Pro Gly Gly Glu Gln Cys Pro Leu Tyr Ala Asn Val His His Gln Lys Gly Lys Asp Glu Gly Val Val Tyr Ser Val Val His Arg Thr Ser Lys Arg Ser Glu Ala Arg Ser Ala Glu Phe Thr Val Gly Arg Lys Asp Ser Ser Ile Ile Cys Ala Glu Val Arg Cys Leu Gln Pro Ser Glu Val Ser Ser Thr G1u Val Asn Met Arg Ser Arg Thr Leu Gln Glu Pro Leu Ser Asp Cys Glu Glu Val Leu Cys <210> 27 <211> 291 <212> PRT
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400> 27 Lys Thr Val Trp Leu Tyr Leu Gln Ala Trp Pro Asn Pro Val Phe Glu 1 5 10 l5 Gly Asp Ala Leu Thr Leu Arg Cys Gln Gly Trp Lys Asn Thr Pro Leu Ser Gln Val Lys Phe Tyr Arg Asp Gly Lys Phe Leu His Phe Ser Lys Glu Asn G1n Thr Leu Ser Met G1y Ala Ala Thr Val G1n Ser Arg Gly G1n Tyr Ser Cys Ser Gly Gln Val Met Tyr Ile Pro Gln Thr Phe Thr Gln Thr Ser Glu Thr Ala Met Val Gln Val Gln Glu Leu Phe Pro Pro Pro Val Leu 5er Ala Ile Pro Ser Pro Glu Pro Arg Glu Gly Ser Leu Val Thr Leu Arg Cys Gln Thr Lys Leu His Pro Leu Arg Ser Ala Leu Arg Leu Leu Phe Ser Phe His Lys Asp G1y His Thr Leu Gln Asp Arg Gly Pro His Pro Glu Leu Cys Ile Pro Gly A1a Lys Glu Gly Asp Ser Gly Leu Tyr Trp Cys Glu Va1 Ala Pro Glu Gly Gly Gln Val G1n Lys Gln Ser Pro Gln Leu Glu Val Arg Val Gln Ala Pro Val Ser Arg Pro Val Leu Thr Leu His His Gly Pro Ala Asp Pro Ala Val G1y Asp Met Va1 Gln Leu Leu Cys Glu Ala Gln Arg Gly Ser Pro Pro Ile Leu Tyr Ser Phe Tyr Leu Asp Glu Lys Ile Val Gly Asn His Ser Ala Pro Cys Gly Gly Thr Thr Ser Leu Leu Phe Pro Val Lys Ser Glu Gln Asp Ala Gly Asn Tyr Ser Cys Glu Ala Glu Asn Ser Val Ser Arg Glu Arg Ser Glu Pro Lys Lys Leu Ser Leu Lys Gly Ser Gln Val Leu Phe Thr Pro Ala Ser Asn <210> 28 <211> 919 <212> PRT
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400> 28 Lys Thr Val Trp Leu Tyr Leu Gln Ala Trp Pro Asn Pro Val Phe Glu Gly Asp Ala Leu Thr Leu Arg Cys Gln Gly Trp Lys Asn Thr Pro Leu Ser Gln Val Lys Phe Tyr Arg Asp Gly Lys Phe Leu His Phe Ser Lys Glu Asn Gln Thr Leu Ser Met Gly Ala Ala Thr Val Gln Ser Arg Gly Gln Tyr Ser Cys Ser G1y Gln Val Met Tyr Ile Pro Gln Thr Phe Thr Gln Thr Ser Glu Thr Ala Met Val Gln Val Gln Glu Leu Phe Pro Pro Pro Val Leu Ser Ala Tle Pro Ser Pro Glu Pro Arg Glu Gly Ser Leu Val Thr Leu Arg Cys Gln Thr Lys Leu His Pro Leu Arg Ser Ala Leu Arg Leu Leu Phe Ser Phe His Lys Asp Gly His Thr Leu Gln Asp Arg Gly Pro His Pro Glu Leu Cys Ile Pro Gly Ala Lys Glu Gly Asp Ser Gly Leu Tyr Trp Cys Glu Val Ala Pro Glu Gly Gly Gln Val Gln Lys Gln Ser Pro Gln Leu Glu Va1 Arg Val Gln Ala Pro Val Ser Arg Pro Val Leu Thr Leu His His Gly Pro Ala Asp Pro Ala Val Gly Asp Met Val Gln Leu Leu Cys Glu Ala G1n Arg Gly Ser Pro Pro Ile Leu Tyr Ser Phe Tyr Leu Asp Glu Lys Tle Val Gly Asn His Ser Ala Pro Cys G1y Gly Thr Thr Ser Leu Leu Phe Pro Val Lys Ser Glu Gln Asp Ala Gly Asn Tyr Ser Cys Glu Ala Glu Asn Ser Val Ser Arg Glu Arg Ser Glu Pro Lys Lys Leu Ser Leu Lys Gly Ser Gln Val Leu Phe Thr Pro Ala Ser Asn Trp Leu Val Pro Trp Leu Pro Ala Ser Leu Leu Gly Leu Met Val Tle Ala Ala Ala Leu Leu Val Tyr Val Arg Ser Trp Arg Lys Ala Gly Pro Leu Pro Ser Gln Ile Pro Pro Thr Ala Pro Gly Gly Glu Gln Cys Pro Leu Tyr Ala Asn Val His His Gln Lys Gly Lys Asp Glu Gly Val Val Tyr Ser Val Val His Arg Thr Ser Lys Arg Ser Glu Ala Arg Ser Ala Glu Phe Thr Val Gly Arg Lys Asp Ser Ser Ile Tle Cys Ala Glu Val Arg Cys Leu Gln Pro Ser Glu Val 5er Ser Thr Glu Val Asn Met Arg Ser Arg Thr Leu Gln Glu Pro Leu Ser Asp Cys Glu Glu Val Leu Cys <210> 29 <211> 16 <212> PRT
<213> Artificial Sequence <220>
<223> Description of ArtificialSequence:/note =
synthetic construct <400> 29 Met Leu Pro Arg Leu Leu Ile Cys Ala Cys Glu Leu Leu Pro Leu Pro <210> 30 <211> 19 <212> PRT
<213> Artificial Sequence <220>
<223> Description of ArtificialSequence:/note =
synthetic construct <400> 30 Ser Leu Leu Val Ile Phe Asp Thr Glu Leu Tr Ala Val Gln t L
p 10 15 eu Me Ala Asp Ser <210> 31 <211> 17 <212> PRT
<2l3> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400> 31 Met Leu Leu Trp Leu Leu Leu Leu Ile Leu Thr Pro Gly Arg Glu Gln Ser <210> 32 <211> 15 <212> PRT
Figure 2 shows the structural and sequence diversity of FcRHl, FcRH2, and FcRH3. Figure 2A is a schematic representation of FcRH molecules. The three cDNAs encode type I transmembrane proteins with similar extracellular domains, but different cytoplasmic regions. The extracellular (EC) regions contain different numbers of C2-like Ig domains and potential sites of N-linked glycosylation. The transmembrane (TM) domains are uncharged, except in the case of FcRHl. The cytoplasmic (CY) region of FcRHl contains two ITAMs (light gray boxes) and one ITAM-like region (small, lined box), whereas FcRH2 contains one ITAM and two ITIMs (dark gray boxes). FcRH3 has a long cytoplasmic tail with one ITAM, one IT1M, and an ITAM-like region. The amino acid length of each region is indicated. Figure 2B shows the multiple alignment comparison of FcRHl, FcRH2, and FcRH3 amino acid sequences (one-letter code) based on the FcRH3 sequence. Amino acid identity is represented by dots, and gaps are indicated by dashes. Predicted N-linked glycosylation sites and transmembrane domains are underlined in black. Consensus ITAM (bold) and ITIM (bold, underlined) motifs are indicated. Putative structural domains are labeled: SP, signal peptide; EC, extracellular domain; MP-TM, membrane proximal-transmembrane; and CY, cytoplasmic regions.
Amino acid lengths are indicated in parentheses.
Figure 3 shows a composite analysis of the extracellular homology among FcRH and FcR family members. Pairwise analysis of individual Ig-like subunits was performed with the CLUSTAL method algorithm using FcRH3 as the index of comparison. Individual homologous domains are coded to indicate relatedness.
Percent amino acid identities for related domains are indicated and aligned in relation to the comparative FcRH3 subunit. The amino acid identity for the membrane proximal domains (light gray subunits) of FcRHS are provided as the range of identity for all individually related domains. Comparisons that are not applicable are left blank. Amino acid sequences were derived from IRTA1 (GenBank accession no. AF343659), IRTA2 (GenBank accession no. AF34364), moFcRH (GenBank accession no. AAG28775) FcyRI (GenBank accession no. AAA35678), FcyRII (Swiss-Prot accession no.
P31994), Fc~yRIII (Swiss-Prot accession no. P08637), FcsRI (Swiss-Prot accession no.
P12319), and FcaRI (Swiss-Prot accession no. P24071).
Figure 4 shows the relative location of the mouse FcR family. Location is indicated in reference to the human FcR related genes at Ch 1q21-23 and their orthologous loci on mouse Ch 3 and Ch 1. The microsatellite marker d3Mitl87 is located within moFcRHl.
Figure 5 shows the multiple alignment comparisons of huFcRHI-5 and mouse FcRHl and 2 amino acid sequences (one-letter code) based on the FcRH3 sequence.
Amino acid gaps are indicated by dashes. Consensus ITAM (underlined) and ITIM
(italic, underlined) motifs are indicated. Amino acid lengths are indicated in parentheses.
Figure 6 shows domains marked to indicate relatedness of the Ig-like subunits.
Ig-like domain homology was determined by generation of a phylogenetic tree using DNAStar software with the CLUSTAL program and assigning arbitrary colors to individual Ig-domains of a given branch. Amino acid identities for full length, extracellular and, cytoplasmic domain comparisons are based on huFcRH3.
Closest cytoplasmic relatives are indicated in parentheses. Most identical extracellular comparisons between mouse and human relatives are highlighted in horizontal lines.
Comparisons that are not applicable are left blank.
Figure 7 shows the domains of huFcRHI-6, moFcRHl-3 and related proteins.
Domains are colored to indicate relatedness of the Ig-like subunits. Ig-like domain homology was determined by generation of a phylogenetic tree using DNAStar software with the CLUSTAL program and assigning arbitrary colors to individual Ig-domains of a given branch. Amino acid identities for full length, extracellular and, cytoplasmic domain comparisons are based on huFcRH3. Closest cytoplasmic relatives are indicated in parentheses. Most identical extracellular comparisons between mouse and human relatives are highlighted in red. Comparisons that are not applicable are left blank.
Figure 8 shows the structural characteristics of the mouse FcRH isoforms.
DESCRIPTION OF THE PREFERRED EMBODIMENTS
The present invention may be understood more readily by reference to the following detailed description of preferred embodiments of the invention and the Examples included therein and to the Figures and their previous and following description.
In this specification and in the claims that follow, reference will be made to a number of terms that shall be defined to have the following meanings:
As used in the specification and the appended claims, the singular forms "a,"
"an" and "the" include plural referents unless the context clearly dictates otherwise.
Thus, for example, reference to a receptor includes mixtures of various receptors, reference to "a pharmaceutical carrier" includes mixtures of two or more such carriers, and the like.
Ranges may be expressed herein as from "about" one particular value, and/or to "about" another particular value. When such a range is expressed, another embodiment includes from the one particular value and/or to the other particular value.
Similarly, when values are expressed as approximations, by use of the antecedent "about,"
it will be understood that the particular value forms another embodiment. It will be further understood that the endpoints of each of the ranges are significant both in relation to the other endpoint, and independently of the other endpoint.
"Optional" or "optionally" means that the subsequently described event or circumstance may or may not occur, and that the description includes instances where said event or circumstance occurs and instances where it does not. For example, the phrase "optionally contains two ITAM consensus motifs" means that the two ITAMs may or may not be present and that the description includes both the presence and absence of two ITAM consensus motifs.
As used throughout, by "subject" is meant an individual. Preferably, the subject is a mammal such as a primate, and, more preferably, a human. The term "subject"
can include domesticated animals, such as cats, dogs, etc., livestock (e.g., cattle, horses, pigs, sheep, goats, etc.), and laboratory animals (e.g., mouse, rabbit, rat, guinea pig, etc.).
By "isolated nucleic acid" is meant a nucleic acid the structure of which is not identical to that of the naturally occurring nucleic acid or to that of any fragment of the naturally occurring genomic nucleic acid spanning more than three separate genes. The term therefore covers, for example, (a) a DNA which has the sequence of part of the naturally occurring genomic DNA molecules but is not flanked by both of the coding sequences that flank that part of the molecule in the genome of the organism in which it naturally occurs; (b) a nucleic acid incorporated into a vector or into the genomic DNA
of a prokaryote or eukaryote in a manner such that the resulting molecule is not identical to any naturally occurnng vector or genomic DNA; (c) a separate molecule such as cDNA, a genomic fragment, a fragment produced by polymerase chain reaction, or a restriction fragment; and (d) a recombinant nucleotide sequence that is part of a hybrid gene, i.e., a gene encoding a fusion protein.
By "label" is meant any detectable tag that can be attached directly (e.g., a fluorescent molecule integrated into a polypeptide or nucleic acid) or indirectly (e.g., by way of binging to a primary antibody a secondary antibody with an integrated fluorscent molecule) to the molecule of interest. A "label" is any tag that can be visualized with imaging methods. The detectable tag can be a radio-opaque substance, radiolabel, a fluorescent label, or a magnetic label. The detectable tag can be selected from the group consisting of gamma-emitters, beta-emitters, and alpha-emitters, gamma-emitters, positron-emitters, X-ray-emitters and fluorescence-emitters suitable for localization. Suitable fluorescent compounds include fluorescein sodium, fluorescein isothiocyanate, phycoerythrin, and Texas Red sulfonyl chloride. See, de Belder & Wik (Preparation and properties of fluorescein-labelled hyaluronate. Carbohydr.
Res.44(2):251-57 (1975). Those skilled in the art will know, or will be able to ascertain with no more than routine experimentation, other fluorescent compounds that are suitable for labeling the molecule.
Polyneptides The invention provides members of a cluster of FcR and FcR gene relatives encoded by genes in the human chromosome 1q21-23 region, or analogous region in non-human subjects, including for example, chromosome 3 in mouse. A consensus amino acid motif, based on the FcyRI, FcyRII, Fc~yRIII, and pIgR extracellular regions, was used in a GenBank protein database query to identify member of the gene subfamily. Genomic clones were identified that were found to contain FcR
relatives and are termed the Fc receptor homolog (FcRH) subfamily: specifically, FcRHI, FcRH2, FcRH3, and FcRH6. Also, found were mouse Fc receptor homologs designated moFcRl, 2, and 3.
By "homologous" is meant about 25% percent homology or greater. Homology is also characterized by proximity in the location of the genes and by similarities as identified in a composite analysis. As used herein, "percent homology" of two amino acid sequences or of two nucleic acid sequences is determined using the algorithm of Karlin and Altschul (Proc. Natl. Acad. Sci. USA 87:2264-2268 (1990)). Such an algorithm is incorporated into the NBLAST and XBLAST programs of Altschul et al.
(J. Mol. Biol. 215:403-410 (1990)). BLAST nucleotide searches are performed with the NBLAST program, score 100, wordlength = 12, to obtain nucleotide sequences homologous to a nucleic acid molecule of the invention. BLAST protein searches are performed with the XBLAST program, score = 50, wordlength = 3, to obtain amino acid sequences homologous to a reference polypeptide . To obtain gapped alignments for comparison purposes, Gapped Blast is utilized as described in Altschul et al. (Nucl.
Acids Res. 25: 3389-3402 (1997)). When utilizing BLAST and Gapped BLAST
programs, the default parameters of the respective programs (e.g., XBLAST and NBLAST) are used. See http://ww.ncbi.nlm.nih.gov.
By "FcRH" is meant a Type I transmembrane receptor, or an alternatively spliced form thereof, including, for example, a secreted form or a GPI-anchored form, with homology to the classical Fc receptor family. In a preferred embodiment, the FcRH shows homology with the extracellular regions of Fc~yRI, FcyRII, Fc~yRIII, or pIgR. More specifically, the FcRH shows homology with an amino acid sequence corresponding with the amino terminal sequences of the second Ig domains of the Fc~yRs and the third Ig domain of pIgR or Fc~yRHl. The FcRH can comprise an extracellular region, a transmembrane region, and a cytoplasmic region. The extracellular region preferably comprises one or more Ig domains, and more preferably less than 9, and even more preferably less than 7 or less than 8 Ig domains.
Preferably, the cytoplasmic region comprises more than 107 (including more than 108, 109, 110, 111, 112, 113, 114, 115, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, or 145 amino acids).
Alternatively, the cytoplasmic region comprises less than 104 amino acids (including less than 103, 102, 101, 100, 99, 98, 97, 96, 95, 94, 93, 92, 91, 90, 89, 88, 87, 86, 85, 84, 83, 82, 81, 80). The cytoplasmic region preferably comprises one or more immunoreceptor tyrosine-based inhibitory or activation motifs ("ITIMs" or "ITAMs).
The invention provides isolated FcRHs (e.g., huFcRH 1, 2, 3, and 6, and moFcRHl-3, as described in detail below), as well as fragments and isoforms thereof.
The isolated amino acid sequences provided herein optionally are combined with a human signal sequence (e.g., MLPRLLLLICAPLCEP (SEQ ID N0:29), MLLWSLLVIFDAVTEQADS (SEQ ID NO:30), MLLWLLLLILTPGREQS (SEQ ID
N0:31), MLLWTAVLLFVPCVG (SEQ ID N0:32)) or a mouse signal sequence (e.g., MPLCLLLLVFAPVGVQS (SEQ ID N0:69), MLPWLLLLICALPCEPA (SEQ ID
N0:72), MSGSFSPCVVFTQMWLTLLVVTPVN (SEQ ID NO:79)).
In one embodiment, the invention provides huFcRHl and its fragments and isoforms. Thus, in one embodiment of the isolated FcRH, the extracellular region comprises less than four Ig domains. Preferably, the cytoplasmic region comprises less than 104 amino acids and, even more preferably, comprises less than 104 and more than 86 amino acids. In one embodiment, the transmembrane region comprises an acidic amino acid (e.g., glutamate or aspartate). The isolated FcRH of the invention comprises a cytoplasmic region having the amino acid sequence of SEQ ID NO:1, in the presence or absence of conservative amino acid substitutions. Further provided is the isolated FcRH, wherein the extracellular region comprises the amino acid sequence of SEQ ID
N0:21, in the presence or absence of conservative amino acid substitutions, and in the presence and absence of a signal sequence. More specifically, the isolated FcRH
comprises the amino acid sequence of SEQ ~ N0:2, in the presence or absence of conservative amino acid substitutions, and in the presence or absence of a signal sequence. In one embodiment the signal sequence is MLPRLLLLICAPLCEP (SEQ ID
N0:29). In a preferred embodiment, the FcRH of the invention is expressed by myeloid cells (e.g., granulocytes and monocytes). Additional characteristics of the full length FcRHl include a predicted molecular weight of about 46-47 kDaltons; about 425-(e.g., 429) amino acids in length with about 35 strongly basic(+) amino acids (I~,R), about 45 strongly acidic(-) amino acids (D,E), about 144 hydrophobic amino acids (A,I,L,F,W,V), and about127 polar amino acids (N,C,Q,S,T,I~; a predicted isolelectric point of about 5-5.5 (e.g., 5.310); and charge of about -9 at PH 7Ø
In another embodiment, the invention provides an isolated FcRH corresponding to huFcRH2, its fragments, or isoforms. Thus, the invention provides a FcRH
wherein the cytoplasmic region comprises less than 99 amino acids (e.g., 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98) and wherein the receptor further 5 comprises an extracellular domain with up to four Ig domains and up to five N-linked glycosylation sites. More specifically, the isolated FcRH has a cytoplasmic region that comprises the amino acid sequence of SEQ ID N0:3, in the presence or absence of conservative amino acid substitutions, or an extracellular region comprising SEQ ID
N0:22, in the presence or absence of conservative amino acid substitutions, and in the 10 presence or absence of a signal sequence. Even more specifically, the isolated FcRH
comprises the amino acid sequence of SEQ ID N0:4, in the presence or absence of conservative amino acid substitutions, and in the presence or absence of a signal sequence. In one embodiment, the signal sequence WSLLVIFDAVTEQADS (SEQ ID
N0:30). Additional characteristics of the full length FcRHl include a predicted molecular weight of about 50-60 kDaltons; about 495-515 (e.g., 508) amino acids in length with about 44 strongly basic(+) amino acids (K,R), about 49 strongly acidic(-) amino acids (D,E), about 175 hydrophobic amino acids (A,I,L,F,W,V), and about polar amino acids (N,C,Q,S,T,Y); a predicted isolelectric point of about 6-6.5 (e.g., 6.188); and charge of about -4 at PH 7Ø
In another embodiment, the invention provides huFcRH3, its fragments, and isoforms. More specifically, the invention provides an isolated FcRH having a cytoplasmic region that comprises more than 107 amino acids (e.g., 108, 109, 110, 11 l, 112, 113, 114, 115, 116, 117, 118, 119, 120, 212, 122, 123, 124, 125, 126 ,127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 247, 148, 149, 150 amino acids). Optionally, the isolated FcRH has a cytoplasmic region comprising one ITAM and one ITIM. More specifically, the cytoplasmic region comprises the amino acid sequence of SEQ ID NO:S or SEQ ID N0:23, in the presence or absence of conservative amino acid substitutions. In one embodiment, the extracellular domain of the FcRH comprises the amino acid sequence of SEQ ID
NO:24, in the presence or absence of conservative amino acid substitutions, and in the presence or absence of a signal sequence. Also provided is an isolated FcRH
comprising the amino acid sequence of SEQ ID NO:6 or SEQ ID N0:25, in the presence or absence of one or more amino acid substitutions, and in the presence or absence of a signal sequence. In one embodiment the signal sequence comprises MLLWLLLLILTPGREQS (SEQ ID N0:31). Additional characteristics of the full length FcRHl include a predicted molecular weight of about 80-90 kDaltons;
about 725-740 (e.g., 734) amino acids in length with about 68 strongly basic(+) amino acids (K,R), about 75 strongly acidic(-) amino acids (D,E), about 232 hydrophobic amino acids (A,I,L,F,W,V), and about 224 polar amino acids (N,C,Q,S,T,Y); a predicted isolelectric point of about 6.5-7.0 (e.g., 6.852); and charge of about -2 at PH 7Ø
The invention further provides an isolated huFcRH6, its fragments, and isoforms. More specifically, the FcRH comprises a cytoplasmic region having the amino acid sequence of SEQ ID N0:26, in the presence or absence or one or more conservative amino acid substitutions. The extracellular domain comprises the amino acid sequence of SEQ ff~ N0:27, in the presence or absence of conservative amino acid substitutions, and in the presence or absence of a signal sequence. Also, provided by the invention is a FcRH having the amino acid substitutions of SEQ ID N0:28, in the presence or absence of conservative amino acid substitutions, and in the presence or absence of a signal sequence. In one embodiment the signal sequence is MLLWTAVLLFVPCVG (SEQ ff~ N0:32).
The invention further provides a polypeptide comprising the amino acid sequence of SEQ ID N0:1, 21, 2, 3, 22, 4, 5, 23, 24, 6, 25, 26, 27, or 28, in the presence or absence of conservative amino acid substitutions. The invention also provides a polypeptide having at least 80, 85, 90, or 95% homology with SEQ ID NOs: l, 21, 2, 3, 22, 4, 5, 23, 24, 6, 25, 26, 27, or 28.
The invention further provides an isolated moFcRHl isoform, its fragments, and isofornzs. The moFcRHI is an isoform of SEQ 11? N0:68. More specifically, the FcRH comprises four Ig domains, optionally having the sequence of SEQ ff~ N0:
70, in the presence or absence or one or more conservative amino acid substitutions, and in the presence or absence of a signal sequence (e.g., the sequence of SEQ ff~
N0:71).
The invention further provides an isolated moFcRH2, its fragments, and isoforms. The provided isoforms include one isoform with a transmembrane region and one isoform lacking the transmembrane region. More specifically, the FcRH
comprises a cytoplasmic region having the amino acid sequence of SEQ ID N0:76, in the presence or absence or one or more conservative amino acid substitutions. The extracellular domain comprises the amino acid sequence of SEQ ID N0:74, in the presence or absence of conservative amino acid substitutions, and in the presence or absence of a signal sequence. Also, provided by the invention is a FcRH having the amino acid sequence of SEQ ID N0:73, which comprises a transmembrane region, or SEQ ID
N0:77, which lacks the transmembrane region. In each case, the FcRH sequence can include the presence or absence of conservative amino acid substitutions, and the presence or absence of a signal sequence. In one embodiment the signal sequence is the sequence of SEQ ID N0:72.
The invention also provided a moFcRH3, its fragments and isoforms. The cytoplasmic region can comprise the amino acid sequence of SEQ ID N0:81, in the presence or absence of conservative amino acid substitutions. Optionally, the extracellular domain comprises the amino acid sequence of SEQ ff~ N0:80, in the presence or absence of conservative amino acid substitutions or in the presence or absence of a signal sequence (e.g., the sequence of SEQ ID N0:79). The full length sequence optionally has the amin oacid sequence of SEQ m N0:78, in the presence or absence of conservative amino acid substitutions or in the presence or absence of a signal sequence (e.g., the sequence of SEQ ID N0:79).
Fragments, variants, or isoforms of the FcRHs of the invention are provided.
It is understood that these terms include functional variants. Fragments can include the cytoplasmic region, the extracellular region, the transmembrane region or any portion of at least 10 amino acids or any combination of the regions or portions. The variants are produced by making amino acid substitutions, deletions, and insertions, as well as post-translational modifications. Variations in post-translational modifications can include variations in the type or amount of carbohydrate moieties of the protein core or any fragment or derivative thereof. Variations in amino acid sequence may arise naturally as allelic variations (e.g., due to genetic polymorphism) or may be produced by human intervention (e.g., by mutagenesis of cloned DNA sequences), such as induced point, deletion, insertion and substitution mutants. These modifications can result in changes in the amino acid sequence, provide silent mutations, modify a restriction site, or provide other specific mutations.
Amino acid sequence modifications fall into one or more of three classes:
substitutional, insertional or deletional variants. Insertions include amino and/or carboxyl tennii~al fusions as well as intrasequence insertions of single or multiple amino acid residues. Insertions ordinarily will be smaller insertions than those of amino or carboxyl terminal fusions, for example, on the order of one to four residues.
Deletions are characterized by the removal of one or more amino acid residues from the protein sequence. Typically, no more than about 2 to 6 residues are deleted at any one site within the protein molecule. These variants ordinarily are prepared by site-specific mutagenesis of nucleotides in the DNA encoding the protein, thereby producing DNA
encoding the variant, and thereafter expressing the DNA in recombinant cell culture.
Techniques for making substitution mutations at predetermined sites in DNA
having a lcnown sequence are well known and include, for example, M13 primer mutagenesis and PCR mutagenesis. Amino acid substitutions are typically of single residues but may include multiple substitutions at different positions; insertions usually will be on the order of about from 1 to 10 amino acid residues but can be more; and deletions will range about from 1 to 30 residues, but can be more. Deletions or insertions preferably are made in adjacent pairs, i.e. a deletion of 2 residues or insertion of 2 residues.
Substitutions, deletions, insertions or any combination thereof may be combined to arrive at a final construct. The mutations must not place the sequence out of reading frame and preferably will not create complementary regions that could produce secondary mRNA structure. Substitutional variants are those in which at least one residue has been removed and a different residue inserted in its place. Such substitutions generally are made in accordance with Table 1 and are referred to as conservative substitutions.
TABLE l: Amino Acid Substitutions Original ResidueExemplary Substitutions Ala Ser Arg Lys Asn Gln Asp Glu Cys S er Gln Asn Glu Asp Gly Pro His Gln Tle Leu; Val Leu Ile; Val Lys Arg; Gln Met Leu; Ile Phe Met; Leu; Tyr Ser T~' T~. Ser Trp Tyr Tyt. Trp; Phe Val Ile; Leu Substantial changes in function or immunological identity are made by selecting substitutions that are less conservative than those in Table 1, i.e., selecting residues that differ more significantly in their effect on maintaining (a) the structure of the polypeptide backbone in the area of the substitution, for example as a sheet or helical conformation, (b) the charge or hydrophobicity of the molecule at the target site or (c) the bulk of the side chain. The substitutions that in general are expected to produce the greatest changes in the protein properties will be those in which (a) a hydrophilic residue, e.g. seryl or threonyl, is substituted for (or by) a hydrophobic residue, e.g.
leucyl, isoleucyl, phenylalanyl, valyl or alanyl; (b) a cysteine or proline is substituted for (or by) any other residue; (c) a residue having an electropositive side chain, e.g., lysyl, arginyl, or histidyl, is substituted for (or by) an electronegative residue, e.g., glutamyl or aspartyl; or (d) a residue having a bulky side chain, e.g., phenylalanine, is substituted for (or by) one not having a side chain, e.g., glycine, in this case, (e) by increasing the number of sites for sulfation and/or glycosylation.
Substitutional or deletional mutagenesis can be employed to insert sites for N-glycosylation (Asn-X-Thr/Ser) or O-glycosylation (Ser or Thr). Deletions of cysteine or other labile residues also may be desirable. Deletions or substitutions of potential proteolysis sites, e.g. Arg, is accomplished for example by deleting one of the basic residues or substituting one by glutaminyl or histidyl residues.
Certain post-translational derivatizations are the result of the action of recombinant host cells on the expressed polypeptide. Glutaminyl and asparaginyl residues are frequently post-translationally deamidated to the corresponding glutamyl and asparyl residues. Alternatively, these residues are deamidated under mildly acidic conditions. Other post-translational modifications include hydroxylation of proline and lysine, phosphorylation of hydroxyl groups of Beryl or threonyl residues, methylation of the o-amino groups of lysine, arginine, and histidine side chains (T.E.
Creighton, Proteins: Structure and Molecular Properties, W. H. Freeman & Co., San Francisco pp 79-86 [1983]), acetylation of the N-terminal amine and, in some instances, amidation of the C-terminal carboxyl. Modifications in the FcRH can also include modifications in glyco sylation.
In all mutational events, it is understood that the controlling aspect of the mutation is the function that the subsequent protein possesses. The preferred mutations are those that do not detectably change the desired function or that increase the desired function.
Nucleic Acids Also provided is an isolated nucleic acid that encodes the FcRH of the invention. The nucleic acid can be single or double stranded and can be RNA or DNA.
More specifically, the invention provides an isolated nucleic acid, comprising a 10 nucleotide sequence that encodes SEQ ID NO:1, SEQ ID N0:21, SEQ ID N0:2.
SEQ
ID NO: 3, SEQ ID N0:22, SEQ ID N0:4, SEQ ID NO:S, SEQ ID N0:23, SEQ ID
N0:24, SEQ ID N0:25, SEQ ID N0:26, SEQ ID N0:27, SEQ ID N0:28, or SEQ ID
N0:6, SEQ 1T7 N0:70, SEQ ID N0:73, SEQ ID N0:74, SEQ ID N0:76, SEQ ID
N0:77, SEQ ID N0:78, SEQ ID N0:80, SEQ ID N0:81, optionally with conservative 15 amino acid substitutions. Optionally the nucleic acid further encodes a signal sequence (e.g., the signal sequences of SEQ >D N0:29, 30, 31, 32, 71, 75, 79). The isolated nucleic acid optionally encodes the sequences with 80, 85, 90, or 95 %
identity. More specifically, the invention provides an isolated nucleic acid, comprising a nucleotide sequence of SEQ ID N0:7, SEQ ID N0:13, SEQ ID N0:8, SEQ ID N0:34, SEQ ID
N0:9, SEQ ~ N0:14, SEQ ID NO:10, SEQ ID N0:36, SEQ ID NO:11, SEQ ID
NO:15, SEQ ID N0:16, SEQ >D N0:12, SEQ ID N0:38, SEQ ID N0:17, SEQ ID
N0:18, SEQ ID N0:19, SEQ ID N0:20; SEQ m N0:40, SEQ ID N0:84, SEQ ID
N0:85, SEQ 11? N0:87, SEQ m N0:88, SEQ ID N0:89~, SEQ ff~ N0:90, SEQ ID
N0:91, SEQ ID N0:92, SEQ ID N0:93, SEQ ID N0:94, SEQ ID N0:95, SEQ ~
N0:96, SEQ ID N0:97, SEQ ID N0:98, SEQ ID N0:99, SEQ ID NO:100, SEQ ID
NO:101, or SEQ ID N0:102 . Optionally, the isolated nucleic acid can further included bases that encode a signal sequence and thus the nucleotide sequence encoding the extracellular region or full-length huFcRHl, 2, 3, or 6 can optionally further comprise the nucleotide sequence of SEQ ID N0:33, SEQ ID N0:35, SEQ 1177 N0:37, SEQ m N0:39. Optionally, the isolated nucleic acids for rnoFcRHs include nucleic acid sequences that encode signal sequences as well, including for example, those portions of nucleic acid sequences SEQ ID NO:101, SEQ D7 N0:97, SEQ m N0:94, SEQ ID
N0:91, SEQ ID N0:88, SEQ ID N0:84.
Preferably the nucleic acid that encodes the full length FcRHl includes about 1290 bases. The nucleic acid that encodes the full length FcRH2 includes about bases, and the nucleic acid that encodes the full length FcRH3 includes about bases.
The invention also provides an isolated nucleic acid comprising a sequence that hybridizes under stringent conditions to a hybridization probe, wherein the hybridization probe comprises the nucleotide sequence of SEQ ff~ N0:7, SEQ m N0:13, SEQ m N0:8, SEQ m N0:34, SEQ m N0:9, SEQ m N0:14, SEQ ff~ NO:10, SEQ m N0:36, SEQ m NO:11, SEQ m NO:15, SEQ m NO:16, SEQ m N0:12, SEQ
m N0:38, SEQ m N0:17, SEQ m N0:18, SEQ m N0:19, and SEQ m N0:20; SEQ
ff~ N0:40, SEQ ID N0:84, SEQ m N0:85, SEQ m N0:87, SEQ m N0:88, SEQ m N0:89, SEQ m NO:90, SEQ m N0:91, SEQ ff~ NO:92, SEQ m N0:93, SEQ m N0:94, SEQ B7 N0:95, SEQ m N0:96, SEQ m N0:97, SEQ m N0:98, SEQ m NO:99, SEQ m NO:100, SEQ m NO:101, or SEQ m N0:102, or the complement of either sequence.
Further provided is a single stranded nucleic acid that hybridizes under stringent conditions to a nucleic acid having the sequence of SEQ m NO:7, SEQ m N0:13, SEQ m N0:8, SEQ m NO:34, SEQ ~ N0:9, SEQ m N0:14, SEQ m NO:10, SEQ
m NO:36, SEQ ff) NO:11, SEQ m NO:15, SEQ m N0:16, SEQ ff~ N0:12, SEQ m N0:38, SEQ ~ N0:17, SEQ m NO:18, SEQ ff~ N0:19, and SEQ m N0:20; SEQ m N0:40, SEQ m N0:84, SEQ m N0:85, SEQ m NO:87, SEQ m N0:88, SEQ m N0:89, SEQ m N0:90, SEQ m N0:91, SEQ ~ N0:92, SEQ m N0:93, SEQ m N0:94, SEQ m N0:95, SEQ m N0:96, SEQ m N0:97, SEQ m N0:98, SEQ m N0:99, SEQ m NO:100, SEQ m NO:101, or SEQ m N0:102.
By "hybridizing under stringent conditions" or "hybridizing under highly stringent conditions" is meant that the hybridizing portion of the hybridizing nucleic acid, typically comprising at least 15 (e.g., 20, 25, 30, or 50 nucleotides), hybridizes to all or a portion of the provided nucleotide sequence under stringent conditions. The term "hybridization" typically means a sequence driven interaction between at least two nucleic acid molecules, such as a primer or a probe and a gene. Sequence driven interaction means an interaction that occurs between two nucleotides or nucleotide analogs or nucleotide derivatives in a nucleotide specific manner. For example, G
interacting with C or A interacting with T are sequence driven interactions.
Typically sequence driven interactions occur on the Watson-Crick face or Hoogsteen face of the nucleotide. The hybridization of two nucleic acids is affected by a number of conditions and parameters known to those of skill in the art. For example, the salt concentrations, pH, and temperature of the reaction all affect whether two nucleic acid molecules will hybridize. Generally, the hybridizing portion of the hybridizing nucleic acid is at least 80%, for example, at least 90%, 95%, or 98%, identical to the sequence of or a portion of a nucleic acid encoding an FcRH of the invention, or its complement.
Hybridizing nucleic acids of the invention can be used, for example, as a cloning probe, a primer (e.g., for PCR), a diagnostic probe, or an antisense probe.
Hybridization of the oligonucleotide probe to a nucleic acid sample typically is performed under stringent conditions. Nucleic acid duplex or hybrid stability is expressed as the melting temperature or Tm, which is the temperature at which a probe dissociates from a target DNA. This melting temperature is used to define the required stringency conditions. If sequences are to be identified that are related and substantially identical to the probe, rather than identical, then it is useful to first establish the lowest temperature at which only homologous hybridization occurs with a particular concentration of salt (e.g., SSC or SSPE). Assuming that a 1% mismatch results in a 1°C decrease in the Tm, the temperature of the final wash in the hybridization reaction is reduced accordingly (for example, if sequence having >95% identity with the probe are sought, the final wash temperature is decreased by 5 °C). In practice, the change in Tm can be between 0.5 °C and 1.5 °C per 1% mismatch. Stringent conditions involve hybridizing at 68 °C in Sx SSC/Sx Denhardt's solution/1.0% SDS, and washing in 0.2x SSC/0.1% SDS at room temperature. Moderately stringent conditions include washing in 3x SSC at 42 °C. The parameters of salt concentration and temperature can be varied to achieve the optimal level of identity between the probe and the target nucleic acid.
Additional guidance regarding such conditions is readily available in the art, for example, in Sambrook et al., 1989, Molecular Cloning, A Laboratory Manual, Cold Spring Harbor Press, NY; and Ausubel et al. (eds.), 1995, Current Protocols in Molecular Biology, (John Wiley & Sons, NY) at Unit 2.10.
The nucleic acids of the present invention are optionally labeled, directly or indirectly. Such labeled nucleic acids are useful in various diagnostic techniques including for example, izz situ hybridization, FISH, izz situ PCR, and PRINS.
Both methods involve the preparation of short sequences of single-stranded nucleic acid probes that are complementary to the nucleic acid sequences that encode an FcRH.
See, e.g., M Andreeff and D Pinkel (1999), An Introduction to Flourescent In-Situ Hybridization: Principles and Clinical Applications, John Wiley ~ Sons, Ltd;
Roche Applied Sciences (2000), Nonradioactive In Situ Hybridization Application Manual;
Roche Applied Sciences (1999), PCR Manual, 2d edition, which are incorporated in their entirety for methods of using nucleic acids.
Vectors cells and methods of using Also provided is an expression vector comprising a nucleic acid of the invention, wherein the nucleic acid is operably linked to an expression control sequence. A wide variety of expression system/regulatory sequence combinations may be employed in expressing the disclosed. Such useful regulatory sequences include, for example, the early or late promoters of SV40, CMV, vaccinia, polyoma or adenovirus, the lac system, the trp system, the TAC system, the TRC system, the LTR
system, the major operator and promoter regions of phage lambda, the control regions of fd coat protein, the promoter for 3-phosphoglycerate kinase or other glycolytic enzymes, the promoters of acid phosphatase (for example, PhoS), the AOX 1 promoter of methylotrophic yeast, the promoters of the yeast a-mating factors, and other sequences known to control the expression of genes of prokaryotic or eukaryotic cells or their viruses, and various combinations thereof.
Such an expression vector can be designed to be expressed by eukaryotic cells or prokaryotic cells. The vectors of the present invention thus provide DNA
molecules which are capable of integration into a prokaryotic or eukaryotic chromosome and expression. The inserted genes in viral and retroviral vectors usually contain promoters, and/or enhancers to help control the expression of the desired gene product. A
promoter is generally a sequence or sequences of DNA that function when in a relatively fixed location in regard to the transcription start site. A
promoter contains core elements required for basic interaction of RNA polymerase and transcription factors, and may contain upstream elements and response elements. It has been shown that all specific regulatory elements can be cloned and used to construct expression vectors that are selectively expressed in specific cell types. For example, the glial fibrillary acetic protein (GFAP) promoter has been used to selectively express genes in cells of glial origin. Expression vectors used in eukaryotic host cells (e.g., yeast, fungi, insect, plant, animal, human or nucleated cells) may also contain sequences necessary for the ternzination of transcription which may affect mRNA expression. These regions are transcribed as polyadenylated segments in the untranslated portion of the mRNA
encoding tissue factor protein. The 3' untranslated regions also include transcription termination sites. It is preferred that the transcription unit also contain a polyadenylation region. One benefit of this region is that it increases the likelihood that the transcribed unit will be processed and transported like mRNA. The identification and use of polyadenylation signals in expression constl-ucts is well established. It is preferred that homologous polyadenylation signals be used in the transgene constructs.
In certain transcription units, the polyadenylation region is derived from the SV40 early polyadenylation signal and consists of about 400 bases. It is also preferred that the transcribed units contain other standard sequences alone or in combination with the above sequences improve expression from, or stability of, the construct.
The invention further provides transfer vectors, which include any nucleotide construction used to deliver genes into cells (e.g., a plasmid), or as part of a general strategy to deliver genes, e.g., as part of recombinant retrovirus or adenovirus (Ram et al. Cancer Res. 53:83-88, (1993)). As used herein, plasmid or viral vectors are agents that transport the disclosed nucleic acids into the cell without degradation and include a promoter yielding expression of the gene in the cells into which it is delivered. In some embodiments the FcRHs are derived from either a virus or a retrovirus. Viral vectors include, for example, Adenovirus, Adeno-associated virus, Herpes virus, Vaccinia virus, Polio virus, AIDS virus, neuronal trophic virus, Sindbis and other RNA
viruses, including these viruses with the HIV backbone. Also preferred are any viral families that share the properties of these viruses that make them suitable for use as vectors.
Retroviruses include Murine Maloney Leukemia virus, MMLV, and retroviruses that express the desirable properties of MMLV as a vector. Retroviral vectors are able to carry a larger genetic payload, i.e., a transgene or marker gene, than other viral vectors, and for this reason are a commonly used vector. However, they are not as useful in non-proliferating cells. Adenovirus vectors are relatively stable and easy to work with, have high titers, and can be delivered in aerosol formulation, and can transfect non-dividing cells. Pox viral vectors are large and have several sites for inserting genes, they are thermostable and can be stored at room temperature. A preferred embodiment is a viral vector which has been engineered so as to suppress the immune response of the host organism, elicited by the viral antigens.
Viral vectors can have higher transaction (ability to introduce genes) abilities than chemical or physical methods to introduce genes into cells. Typically, viral 5 vectors contain, nonstructural early genes, structural late genes, an RNA
polymerase III
transcript, inverted terminal repeats necessary for replication and encapsidation, and promoters to control the transcription and replication of the viral genome.
When engineered as vectors, viruses typically have one or more of the early genes removed and a gene or gene/promotor cassette is inserted into the viral genome in place of the 10 removed viral DNA. Constructs of this type can carry up to about 8 kb of foreign genetic material. The necessary functions of the removed early genes are typically supplied by cell lines that have been engineered to express the gene products of the early genes in trans.
A retrovirus is an animal virus belonging to the virus family of Retroviridae, 15 including any types, subfamilies, genus, or tropisms. Retroviral vectors, in general, are described by Verma, LM., Retroviral vectors for gene transfer. In Microbiology-1985, American Society for Microbiology, pp. 229-232, Washington, (1985), which is incorporated by reference herein. Examples of methods for using retroviral vectors for gene therapy are described in U.S. Patent Nos. 4,868,116 and 4,980,286; PCT
20 applications WO 90/02806 and WO 89/07136; and Mulligan, (Science 260:926-(1993)); the teachings of which are incorporated herein by reference. A
retrovirus is essentially a package which has packed into it nucleic acid cargo. The nucleic acid cargo carnes with it a packaging signal, which ensures that the replicated daughter molecules will be efficiently packaged within the package coat. In addition to the package signal, there are a number of molecules that are needed in cis, for the replication, and packaging of the replicated virus. Typically a retroviral genome, contains the gag, pol, and env genes which are involved in the making of the protein coat. It is the gag, pol, and env genes which are typically replaced by the foreign DNA
that it is to be transferred to the target cell. Retrovirus vectors typically contain a packaging signal for incorporation into the package coat, a sequence which signals the start of the gag transcription unit, elements necessary for reverse transcription, including a primer binding site to bind the tRNA primer of reverse transcription, terminal repeat sequences that guide the switch of RNA strands during DNA
synthesis, a purine rich sequence 5' to the 3' LTR that serve as the priming site for the synthesis of the second strand of DNA synthesis, and specific sequences near the ends of the LTRs that enable the insertion of the DNA state of the retrovirus to insert into the host genome. The removal of the gag, pol, and env genes allows for about 8 kb of foreign sequence to be inserted into the viral genome, become reverse transcribed, and upon replication be packaged into a new retroviral particle. This amount of nucleic acid is sufficient for the delivery of a one to many genes depending on the size of each transcript. It is preferable to include either positive or negative selectable markers along with other genes in the insert.
Since the replication machinery and packaging proteins in most retroviral vectors have been removed (gag, pol, and envy, the vectors are typically generated by placing them into a packaging cell line. A packaging cell line is a cell line that has been transfected or transformed with a retrovirus that contains the replication and packaging machinery, but lacks any packaging signal. When the vector carrying the DNA of choice is transfected into these cell lines, the vector containing the gene of interest is replicated and packaged into new retroviral particles, by the machinery provided in cis by the helper cell. The genomes for the machinery are not packaged because they lack the necessary signals.
The construction of replication-defective adenoviruses has been described (Berkner et al., J. Virology 61:1213-1220 (1987); Massie et al., Mol. Cell.
Biol. 6:2872-2883 (1986); Haj-Ahmad et al., J. Virology 57:267-274 (1986); Davidson et al., J.
Virology 61:1226-1239 (1987); Zhang, Generation and identification of recombinant adenovirus by liposome-mediated transfection and PCR analysis, BioTechniques 15:868-872 (1993)). The benefit of the use of these viruses as vectors is that they are limited in the extent fo which they can spread to other cell types, since they can replicate within an initial infected cell, but are unable to form new infectious viral particles. Recombinant adenoviruses have been shown to achieve high efficiency gene transfer after direct, in vivo delivery to airway epithelium, hepatocytes, vascular endothelium, CNS parenchyma and a number of other tissue sites (Morsy, J.
Clin.
Invest. 92:1580-1586 (1993); Kirshenbaum, J. Clin. Invest. 92:381-387 (1993);
Roessler, J. Clin. Invest. 92:1085-1092 (1993); Moullier, Nature Genetics 4:154-159 (1993); La Salle, Science 259:988-990 (1993); Gomez-Foix, J. Biol. Chem.
267:25129-25134 (1992); Rich, Human Gene Therapy 4:461-476 (1993); Zabner, Nature Genetics 6:75-83 (1994); Guzman, Circulation Research 73:1201-1207 (1993);
Bout, Human Gene Therapy 5:3-10 (1994); Zabner, Cell 75:207-216 (1993);
Caillaud, Eur. J. Neuroscience 5:1287-1291 (1993); and Ragot, J. Gen. Virology 74:501-(1993)). Recombinant adenoviruses achieve gene transduction by binding to specific cell surface receptors, after which the virus is internalized by receptor-mediated endocytosis, in the same manner as wild type or replication-defective adenovirus (Chardonnet and Dales, Virology 40:462-477 (1970); Brown and Burlingham, J.
Virology 12:386-396 (1973); Svensson and Persson, J. Virology 55:442-449 (1985);
Seth, et al., J. Virol. 51:650-655 (1984); Seth, et al., Mol. Cell. Biol.
4:1528-1533 (1984); Varga et al., J. Virology 65:6061-6070 (1991); Wickham et al., Cell 73:309-319 (1993)).
A viral vector can be one based on an adenovirus which has had the E1 gene removed and these virons are generated in a cell line such as the human 293 cell line.
In another preferred embodiment both the El and E3 genes are removed from the adenovirus genome.
Another type of viral vector is based on an adeno-associated virus (AAV). This defective parvovirus is a preferred vector because it can infect many cell types and is nonpathogenic to humans. AAV type vectors can transport about 4 to 5 kb and wild type AAV is known to stably insert into chromosome 19. Vectors which contain this site specific integration property are preferred. An especially preferred embodiment of this type of vector is the P4.1 C vector produced by Avigen, San Francisco, CA, which can contain the herpes simplex virus thymidine kinase gene, HSV-tk, and/or a marker gene, such as the gene encoding the green fluorescent protein, GFP.
In another type of AAV virus, the AAV contains a pair of inverted terminal repeats (ITRs) which flank at least one cassette containing a promoter which directs cell-specific expression operably linked to a heterologous gene. Heterologous in this context refers to any nucleotide sequence or gene which is not native to the AAV or B19 parvovirus.
Typically the AAV and B19 coding regions have been deleted, resulting in a safe, noncytotoxic vector. The AAV ITRs, or modifications thereof, confer infectivity and site-specific integration, but not cytotoxicity, and the promoter directs cell-specific expression. United States Patent No. 6,261,834 is herein incorproated by reference for material related to the AAV vector.
Molecular genetic experiments with large human herpesviruses have provided a means whereby large heterologous DNA fragments can be cloned, propagated and established in cells permissive for infection with herpesviruses (Sun et al., Nature genetics 8: 33-41, 1994; Cotter and Robertson, Curr Opin Mol Ther 5: 633-644, 1999).
These large DNA viruses (herpes simplex virus (HSV) and Epstein-Barr virus (EBV), have the potential to deliver fragments of human heterologous DNA > 150 kb to specific cells. EBV recombinants can maintain large pieces of DNA in the infected B-cells as episomal DNA. Individual clones carried human genomic inserts up to 330 kb appeared genetically stable The maintenance of these episomes requires a specific EBV
nuclear protein, EBNAl, constitutively expressed during infection with EBV.
Additionally, these vectors can be used for transfection, where large amounts of protein can be generated transiently in vitro. Herpesvirus amplicon systems are also being used to package pieces of DNA > 220 kb and to infect cells that can stably maintain DNA as episomes. Other useful systems include, for example, replicating and host-restricted non-replicating vaccinia virus vectors.
The invention also provides an isolated cell comprising a vector of the invention. The isolated cell can be either a eukaryotic or prokaryotic cell, such as strains of E. coli, Pseudornonas, Bacillus , S'tr~eptomyces; fungi such as yeasts (Saechaf°orrayces , and methylotrophic yeast such as Piclaia, Gandida, Hansenula, and Torulopsis); and animal cells, such as CHO, Rl.l, B-W and LM cells, African Green Monkey kidney cells (for example, COS 1, COS 7, BSC1, BSC40, and BMT10), insect cells (for example, Sue), and human cells and plant cells in tissue culture.
Also provided is a method of making a FcRH, or a fragment or variant thereof comprising culturing a cell comprising a vector of the invention under conditions permitting expression of the FcRH. The method comprises culturing a cell comprising an exogeneous nucleic acid that encodes the FcRH, fragment, or variant, wherein the exogeneous nucleic acid is operably linked to an expression control sequence, and wherein the culture conditions permit expression of the FcRH, fragment, or variant under the control of the expression control sequence; harvesting the medium from the cultured cells, and isolating the FcRH, fragment, or varinat from the cell or culture medium. Optionally the exogenous nucleic acid is the nucleotide sequence of SEQ m N0:7, SEQ ID N0:13, SEQ ID N0:8, SEQ ID N0:34, SEQ ID N0:9, SEQ ID N0:14, SEQ ID NO:10, SEQ ID N0:36, SEQ m NO:11, SEQ ID NO:15, SEQ m N0:16, SEQ
ID N0:12, SEQ ID N0:38, SEQ )D N0:17, SEQ m N0:18, SEQ ID N0:19, and SEQ
)D N0:20; SEQ ID N0:40, SEQ )D N0:84, SEQ ID N0:85, SEQ )D N0:87, SEQ 1D
N0:88, SEQ )D N0:89, SEQ )D N0:90, SEQ m N0:91, SEQ 117 N0:92, SEQ m N0:93, SEQ )D N0:94, SEQ )D N0:95, SEQ m N0:96, SEQ >D N0:97, SEQ )D
N0:98, SEQ ff~ N0:99, SEQ m NO:100, SEQ )D NO:101, or SEQ )D N0:102 or a combination thereof. Optionally, the exogenous nucleic acid further comprises a nucleotide sequence that encodes a signal sequence. In the recombinant methods, the cell can be any known host cell, including for example, a prokaryotic or eukaiyotic cell.
The nucleic acids that are delivered to cells, generally in a plasmid or other vector, typically contain expression controlling systems. For example, the inserted genes in viral and retroviral systems usually contain promoters, and/or enhancers to help control the expression of the desired gene product.
Those skilled in the art of molecular biology will understand that a wide variety of expression systems may be used to produce recombinant FcRH polypeptides (as well as fragments, fusion proteins, and amino acid sequence variants with therapeutic activity) for use in the methods of the invention. Thus, FcRH may be produced using prokaryotic host cells (e.g., Escherichia coli) or eukaryotic host cells (e.g., Saccharomyces cerevisiae, insect cells such as Sf~ cells, or mammalian cells such as CHO cells, COS-1, NIH 3T3, or HeLa cells). These cells are commercially available from, for example, the American Type Culture Collection, Rockville, MD (see also F.
Ausubel et al., Current Protocols in Molecular Biology, John Wiley & Sons, New York, NY, 1998). The method of transformation and the choice of expression vector will depend on the host system selected. Transformation and transfection methods are described, e.g., in Ausubel et al., supra, and expression vectors may be chosen from the numerous examples known in the art.
A nucleic acid sequence encoding an FcRH is introduced into a plasmid or other vector, which is then used to transform living cells. Constructs in which a cDNA
containing the entire FcRH coding sequence, a fragment of the FcRH coding sequence, amino acid variations of the FcRH coding sequence, or fusion proteins of the aforementioned, inserted in the correct orientation into an expression plasmid, may be used for protein expression. In some cases, for example, it may be desirable to express the FcRH coding sequence under the control of an inducible or tissue-specific promoter.
Eukaryotic expression systems permit appropriate post-translational modifications to expressed proteins. Thus, eukaryotic, and more preferably mammalian expression systems, allow glycosylations patterns comparable to naturally expressed FcRH. Transient transfection of a eukaryotic expression plasmid allows the transient 5 production of FcRH by a transfected host cell. FcRH may also be produced by a stably-transfected mammalian cell line. A number of vectors suitable for stable transfection of mammalian cells are available to the public (e.g., see Pouwels et al., Cloning Vectors:
A Laboratory Manual, 1985, Supp. 1987), as are methods for constructing such cell lines (see e.g., F. Ausubel et al., Current Protocols in Molecular Biology, John Wiley &
10 Sons, New York, NY, 1998). Another preferred eukaryotic expression system is the baculovirus system using, for example, the vector pBacPAK9, which is available from Clontech (Palo Alto, CA). If desired, this system may be used in conjunction with other protein expression techniques, for example, the myc tag approach described by Evan et al. (Mol. Cell Biol. 5:3610-3616, 1985) or analogous tagging approaches, e.g., 15 using a hemagluttinin (HA) tag.
Once the recombinant protein is expressed, it can be isolated from the expressing cells by cell lysis followed by protein purification techniques such as affinity chromatography. In this example, an antibody that specifically binds to FcRH, which may be produced by methods that are well-known in the art, can be attached to a 20 column and used to isolate FcRH. Once isolated, the recombinant protein can, if desired, be purified further, e.g., by high performance liquid chromatography (HPLC;
e.g., see Fisher, Laboratory Techniques In Biochemistry And Molecular Biology, Work and Burdon, Eds., Elsevier, 1980).
25 Antibodies The invention also provides a purified antibody or immunologic fragment thereof, wherein the antibody or fragment thereof selectively binds to an FcRH. As used herein, the term "antibody" encompasses, but is not limited to, whole immunoglobulin (i.e., an intact antibody) of any class. Native antibodies are usually heterotetrarneric glycoproteins, composed of two identical light (L) chains and two identical heavy (H) chains. Typically, each light chain is linked to a heavy chain by one covalent disulfide bond, while the number of disulfide linkages varies between the heavy chains of different immunoglobulin isotypes. Each heavy and light chain also has regularly spaced intrachain disulfide bridges. Each heavy chain has at one end a variable domain (V(I~) followed by a number of constant domains. Each light chain has a variable domain at one end (V(L)) and a constant domain at its other end; the constant domain of the light chain is aligned with the first constant domain of the heavy chain, and the light chain variable domain is aligned with the variable domain of the heavy chain. Particular amino acid residues are believed to form an interface between the light and heavy chain variable domains. The light chains of antibodies from any vertebrate species can be assigned to one of two clearly distinct types, called kappa (k) and lambda (1), based on the amino acid sequences of their constant domains.
Depending on the amino acid sequence of the constant domain of their heavy chains, immunoglobulins can be assigned to different classes. There are five major classes of immunoglobulins: IgA, IgD, IgE, IgG and IgM, and several of these may be further divided into subclasses (isotypes), e.g., IgG-1, IgG-2, IgG-3, and IgG-4; IgA-1 and IgA-2. The heavy chain constant domains that correspond to the different classes of immunoglobulins are called alpha, delta, epsilon, gamma, and mu, respectively.
The term "variable" is used herein to describe certain portions of the variable domains that differ in sequence among antibodies and are used in the binding and specificity of each particular antibody for its particular antigen. However, the variability is not usually evenly distributed through the variable domains of antibodies.
It is typically concentrated in three segments called complementarity determining regions (CDRs) or hypervariable regions both in the light chain and the heavy chain variable domains. The more highly conserved portions of the variable domains are called the framework (FR). The variable domains of native heavy and light chains each comprise four FR regions, largely adopting a b-sheet configuration, connected by three CDRs, which form loops connecting, and in some cases forming part of, the b-sheet structure. The CDRs in each chain are held together in close proximity by the FR
regions and, with the CDRs from the other chain, contribute to the formation of the antigen binding site of antibodies (see Kabat E. A. et al., "Sequences of Proteins of Ixnmunological Interest" National Institutes of Health, Bethesda, Md. (1987)).
The constant domains are not involved directly in binding an antibody to an antigen, but exhibit various effector functions, such as participation of the antibody in antibody-dependent cellular toxicity.
The term "antibody or fragments thereof' can also encompass chimeric antibodies and hybrid antibodies, with dual or multiple antigen or epitope specificities, and fragments, such as F(ab')2, Fab', Fab and the like, including hybrid fragments.
Thus, fragments of the antibodies that retain the ability to bind their specific antigens are provided. For example, fragments of antibodies which maintain FcRH binding activity are included within the meaning of the term "antibody or fragment thereof."
Such antibodies and fragments can be made by techniques known in the art and can be screened for specificity and activity according to the methods set forth in the Examples and in general 'methods for producing antibodies and screening antibodies for specificity and activity (See Harlow and Lane. Antibodies, A Laboratory Manual. Cold Spring Harbor Publications, New York, (1988)).
Also included within the meaning of "antibody or fragments thereof' are conjugates of antibody fragments and antigen binding proteins (single chain antibodies) as described, for example, in U.S. Pat. No. 4,704,692, the contents of which are hereby incorporated by reference.
In one embodiment, the antibody is a monoclonal antibody. The term "monoclonal antibody" as used herein refers to an antibody obtained from a substantially homogeneous population of antibodies, i.e., the individual antibodies comprising the population are identical except for possible naturally occurnng mutations that may be present in minor amounts. The monoclonal antibodies herein specifically include "chimeric" antibodies in which a portion of the heavy and/or light chain is identical with or homologous to corresponding sequences in antibodies derived from a particular species or belonging to a particular antibody class or subclass, while the remainder of the chains) is identical with or homologous to corresponding sequences in antibodies derived from another species or belonging to another antibody class or subclass, as well as fragments of such antibodies, so long as they exhibit the desired activity (See, U.S. Pat. No. 4,816,567 and Morrison et al., Proc.
Natl. Acad. Sci.
USA, 81:6851-6855 (1984)).
Monoclonal antibodies of the invention may be prepared using hybridoma methods, such as those described by Kohler and Milstein, Nature, 256:495 (1975) or Harlow and Lane, Antibodies, A Laboratory Manual. Cold Spring Harbor Publications, New York, (1988). In a hybridoma method, a mouse or other appropriate host animal, is typically immunized with an immunizing agent to elicit lymphocytes that produce or are capable of producing antibodies that will specifically bind to the immunizing agent.
Alternatively, the lymphocytes may be immunized in vitro. Preferably, the immunizing agent comprises an FcRH. Traditionally, the generation of monoclonal antibodies has depended on the availability of purified protein or peptides for use as the immunogen.
More recently DNA based immunizations have shown promise as a way to elicit strong immune responses and generate monoclonal antibodies. In this approach, DNA-based immunization can be used, wherein DNA encoding a portion of FcRH, preferably the N- or C- terminal region, is injected into the host animal according to methods known in the art.
Generally, either peripheral blood lymphocytes ("PBLs") are used in methods of producing monoclonal antibodies if cells of human origin are desired, or spleen cells or lymph node cells are used if non-human mammalian sources are desired. The lymphocytes are then fused with an immortalized cell line using a suitable fusing agent, such as polyethylene glycol, to form a hybridoma cell (Goding, "Monoclonal Antibodies: Principles and Practice" Academic Press, (1986) pp. 59-103).
Immortalized cell lines are usually transformed mammalian cells, including myeloma cells of rodent, bovine, equine, and human origin. Usually, rat or mouse myeloma cell lines are employed. The hybridoma cells may be cultured in a suitable culture medium that preferably contains one or more substances that inhibit the growth or survival of the unfused, immortalized cells. For example, if the parental cells lack the enzyme hypoxanthine guanine phosphoribosyl transferase (HGPRT or HPRT), the culture medium for the hybridomas typically will include hypoxanthine, aminopterin, and thymidine ("HAT medium"), which substances prevent the growth of HGPRT-deficient cells.
Preferred immortalized cell lines are those that fuse efficiently, support stable high level expression of antibody by the selected antibody-producing cells, and are sensitive to a medium such as HAT medium. More preferred immortalized cell lines are murine myeloma lines, which can be obtained, for instance, from the Salk Institute Cell Distribution Center, San Diego, Calif. and the American Type Culture Collection, Rockville, Md. Human myeloma and mouse-human heteromyeloma cell lines also have been described for the production of human monoclonal antibodies (I~ozbor, J.
lmmunol., 133:3001 (1984); Brodeur et al., "Monoclonal Antibody Production Techniques and Applications" Marcel Dekker, Inc., New York, (1987) pp. 51-63).
The culture medium in which the hybridoma cells are cultured can then be assayed for the presence of monoclonal antibodies directed against an FcRH.
Preferably, the binding specificity of monoclonal antibodies produced by the hybridoma cells is determined by innnunoprecipitation or by an ira vitro binding assay, such as radioimmunoassay (RIA) or enzyme-linked immunoabsorbent assay (ELISA). Such techniques and assays are known in the art, and are described further in Harlow and Lane "Antibodies, A Laboratory Manual" Cold Spring Harbor Publications, New York, (1988).
After the desired hybridoma cells are identified, the clones may be subcloned by limiting dilution or FACS sorting procedures and grown by standard methods.
Suitable culture media for this purpose include, for example, Dulbecco's Modified Eagle's Medium and RPMI-1640 medium. Alternatively, the hybridoma cells may be grown i~
vivo as ascites in a mammal.
The monoclonal antibodies secreted by the subclones may be isolated or purified from the culture medium or ascites fluid by conventional immunoglobulin purification procedures such as, for example, protein A-Sepharose, hydroxylapatite chromatography, gel electrophoresis, dialysis, or affinity chromatography.
The monoclonal antibodies may also be made by recombinant DNA methods, such as those described in U.S. Pat. No. 4,816,567. DNA encoding the monoclonal antibodies of the invention can be readily isolated and sequenced using conventional procedures (e.g., by using oligonucleotide probes that are capable of binding specifically to genes encoding the heavy and light chains of murine antibodies). The hybridoma cells of the invention serve as a preferred source of such DNA. Once isolated, the DNA may be placed into expression vectors, which are then transfected into host cells such as simian COS cells, Chinese hamster ovary (CHO) cells, plasmacytoma cells, or myeloma cells that do not otherwise produce immunoglobulin protein, to obtain the synthesis of monoclonal antibodies in the recombinant host cells.
The DNA also may be modified, for example, by substituting the coding sequence for human heavy and light chain constant domains in place of the homologous murine sequences (U.S. Pat. No. 4,816,567) or by covalently joining to the immunoglobulin coding sequence all or part of the coding sequence for a non-immunoglobulin polypeptide. Such a non-immunoglobulin polypeptide can be substituted for the constant domains of an antibody of the invention, or can be substituted for the variable domains of one antigen-combining site of an antibody of the invention to create a chimeric bivalent antibody comprising one antigen-combining site having specificity 5 for FcRH and another antigen-combining site having specificity for a different antigen.
Ira vitro methods are also suitable for preparing monovalent antibodies.
Digestion of antibodies to produce fragments thereof, particularly, Fab fragments, can be accomplished using routine techniques known in the art. For instance, digestion can be performed using papain. Examples of papain digestion are described in WO
10 94/29348 published Dec. 22, 1994, U.S. Pat. No. 4,342,566, and Harlow and Lane, Antibodies, A Laboratory Manual, Cold Spring Harbor Publications, New York, (1988). Papain digestion of antibodies typically produces two identical antigen binding fragments, called Fab fragments, each with a single antigen binding site, and a residual Fc fragment. Pepsin treatment yields a fragment, called the F(ab')2 fragment, that has 15 two antigen combining sites and is still capable of cross-linking antigen.
The Fab fragments produced in the antibody digestion also contain the constant domains of the light chain and the first constant domain of the heavy chain.
Fab' fragments differ from Fab fragments by the addition of a few residues at the carboxy terminus of the heavy chain domain including one or more cysteines from the antibody 20 hinge region. The F(ab')2 fragment is a bivalent fragment comprising two Fab' fragments linked by a disulfide bridge at the hinge region. Fab'-SH is the designation herein for Fab' in which the cysteine residues) of the constant domains bear a free thiol group. Antibody fragments originally were produced as pairs of Fab' fragments which have hinge cysteines between them. Other chemical couplings of antibody fragments 25 are also known.
An isolated immunogenically specific epitope or fragment of the antibody is also provided. A specific immunogenic epitope of the antibody can be isolated from the whole antibody by chemical or mechanical disruption of the molecule. The purified fragments thus obtained can be tested to determine their immunogenicity and specificity 30 by the methods taught herein. Irnmunoreactive epitopes of the antibody can also be synthesized directly. An immunoreactive fragment is defined as an amino acid sequence of at least about two to five consecutive amino acids derived from the antibody amino acid sequence.
One method of producing proteins comprising the antibodies of the present invention is to link two or more peptides or polypeptides together by protein chemistry techniques. For example, peptides or polypeptides can be chemically synthesized using currently available laboratory equipment using either Fmoc (9-fluorenylmethyl-oxycarbonyl) or Boc (tef°t -butyloxycarbonoyl) chemistry. (Applied Biosysterns, Inc., Foster City, CA). One skilled in the art can readily appreciate that a peptide or polypeptide corresponding to the antibody of the present invention, for example, can be synthesized by standard chemical reactions. For example, a peptide or polypeptide can be synthesized and not cleaved from its synthesis resin whereas the other fragment of an antibody can be synthesized and subsequently cleaved from the resin, thereby exposing a ternzinal group that is functionally blocked on the other fragment. By peptide condensation reactions, these two fragments can be covalently joined via a peptide bond at their carboxyl and amino termini, respectively, to form an antibody, or fragment thereof. (Grant GA (1992) Synthetic Peptides: A User Guide. W.H. Freeman and Co., N.Y. (1992); Bodansky M and Trost B., Ed. (1993) Principles of Peptide Synthesis.
Springer-Verlag Inc., NY). Alternatively, the peptide or polypeptide can by independently synthesized iri vivo as described above. Once isolated, these independent peptides or polypeptides may be linked to form an antibody or fragment thereof via similar peptide condensation reactions.
For example, enzymatic ligation of cloned or synthetic peptide segments can allow relatively short peptide fragments to be joined to produce larger peptide fragments, polypeptides or whole protein domains (Abrahmsen L et al., Biochemistry, 30:4151 (1991)). Alternatively, native chemical ligation of synthetic peptides can be utilized to synthetically construct large peptides or polypeptides from shorter peptide fragments. This method consists of a two step chemical reaction (Dawson et al.
Synthesis of Proteins by Native Chemical Ligation. Science, 266:776-779 (1994)). The first step is the chemoselective reaction of an unprotected synthetic peptide-oc-thioester with another unprotected peptide segment containing an amino-terminal Cys residue to give a thioester-linked intermediate as the initial covalent product. Without a change in the reaction conditions, this intermediate undergoes spontaneous, rapid intramolecular reaction to form a native peptide bond at the ligation site. Application of this native chemical ligation method to the total synthesis of a protein molecule is illustrated by the preparation ofhuman interleukin 8 (IL-8) (Baggiolini M et al. (1992) FEBS
Lett.
307:97-101; Clark-Lewis I et al., J.Biol.Chem., 269:16075 (1994); Clark-Lewis I et al., Biochemistry, 30:3128 (1991); Rajarathnam K et al., Biochemistry 33:6623-30 (1994)).
Alternatively, unprotected peptide segments can be chemically linked where the bond formed between the peptide segments as a result of the chemical ligation is an unnatural (non-peptide) bond (Schnolzer, M et al. Science, 256:221 (1992)).
This technique has been used to synthesize analogs of protein domains as well as large amounts of relatively pure proteins with full biological activity (deLisle Milton RC et al., Techniques in Protein Chemistry IV. Academic Press, New York, pp. 257-267 (1992)).
The invention also provides fragments of antibodies that have bioactivity. The polypeptide fragments of the present invention can be recombinant proteins obtained by cloning nucleic acids encoding the polypeptide in an expression system capable of producing the polypeptide fragments thereof, such as an adenovirus or baculovirus expression system. For example, one can determine the active domain of an antibody from a specific hybridoma that can cause a biological effect associated with the interaction of the antibody with FcRH. For example, amino acids found to not contribute to either the activity or the binding specificity or affinity of the antibody can be deleted without a loss in the respective activity.
For example, amino or carboxy-terminal amino acids can be sequentially removed from either the native or the modified non-immunoglobulin molecule or the immunoglobulin molecule and the respective activity assayed in one of many available assays. In another example, a fragment of an antibody can comprise a modified antibody wherein at least one amino acid has been substituted for the naturally occurring amino acid at a specific position, and a portion of either amino terminal or carboxy terminal amino acids, or even an internal region of the antibody, has been replaced with a polypeptide fragment or other moiety, such as biotin, which can facilitate in the purification of the modified antibody. For example, a modified antibody can be fused to a maltose binding protein, through either peptide chemistry of cloning the respective nucleic acids encoding the two polypeptide fragments into an expression vector such that the expression of the coding region results in a hybrid polypeptide. The hybrid polypeptide can be affinity purified by passing it over an amylose affinity column, and the modified antibody receptor can then be separated from the maltose binding region by cleaving the hybrid polypeptide with the specific protease factor Xa. (See, for example, New England Biolabs Product Catalog, 1996, pg.
164.).
Similar purification procedures are available for isolating hybrid proteins from eukaryotic cells as well.
The fragments, whether attached to other sequences, can also include insertions, deletions, substitutions, or other selected modifications of particular regions or specific amino acids residues, provided the activity of the fragment is not significantly altered or impaired compared to the nonmodified antibody or antibody fragment. These modifications can provide for some additional property, such as to remove or add amino acids capable of disulfide bonding, to increase its bio-longevity, to alter its secretory characteristics, etc. In any case, the fragment must possess a bioactive property, such as binding activity, regulation of binding at the binding domain, etc.
Functional or active regions of the antibody may be identified by mutagenesis of a specific region of the protein, followed by expression and testing of the expressed polypeptide. Such methods are readily apparent to a skilled practitioner in the art and can include site-specific mutagenesis of the nucleic acid encoding the antigen. (Zoller MJ et al. Nucl. Acids Res. 10:6487-500 (1982).
As used herein, the phrase "specific binding" or "selective binding" refers to a binding reaction which is determinative of the presence of the FcRH in a heterogeneous population of proteins and other biologics. Thus, under designated conditions, the antibodies or fragments thereof of the present invention bind to a particular FcRH (e.g., human FcRH 1 or any variant thereof), fragment, or variant thereof and do not bind in a significant amount to other proteins (e.g., human FcRH 2, 3, 4, 5, or 6), present in the subject. 'The absence of binding in the present invention is concisderd to be binding that is less than 1.5 times background (i.e., the level of non-specific binding or slightly above non-specific binding levels), Selective binding to an antibody under such conditions may require an antibody that is selected for its specificity for a particular protein, variant, or fragment. In one embodiment the purified antibody selectively binds to the FcRH comprising a cytoplasmic region with more than 107 or less than 104 amino acids, a transmembrane region, and an extracellular region. More specifically, the antibody in alternative embodiments selectively binds FcRHl but not FcRH2-6; selectively binds FcRH2 but not 1 or 3-6; selectively binds FcRH3 but not FcRHl-2 or 4-6; selectively binds FcRH6 but not 1-5. Thus, as one embodiment, the antibody selectively binds a polypeptide comprising the amino acid sequence of SEQ ID NO:1, 21, or 2, or a subset thereof, but not to polypeptides comprising the amino acid of SEQ ID N0:3, 22, 4, 5, 23, 24, 6, 25, 26, 27, 28, or a subset thereof. In another embodiment the purified antibody binds to the FcRH comprising the amino acid sequence of SEQ ID N0:3, 22, or 4, but not to the FcRH comprising the amino acid of SEQ ID NO:1, 21, 2, 5, 23, 24, 6, 25, 26, 27, or 28.
In yet another embodiment, the purified antibody that binds to the FcRH
comprising the amino acid sequence of SEQ ID NO:S, 23, 24, or 6, but not to the FcRH
comprising the amino acid of SEQ m NO:1, 21, 2, 3, 22, 4, 26, 27, 28. Similarly, the antibodies of the present invention may bind only moFcRHl, but not moFcRH 2 or moFcRH3; may bind only FcRH2 and not FcRHl or FcRH3, and may bind only FcRH3 and not FcRHl or FcRH2.
In certain embodiments, the antibody binds the extracellular region of one or more FcRHs and in other embodiments the antibody binds the cytoplasmic region of one or more FcRHs. In other embodiments the antibody may selectively bind one isoform of a FcRH. For example, the antibody may bind a polypeptide having the amino acid sequence of SEQ 117 N0:23 but not the SEQ ID N0:24 or vice versa.
Furthermore, the antibody can bind to moFcRHl having the amino acid sequence of SEQ LD N0:70, but not to a moFcRHl having amino acid sequence of SEQ ID N0:68.
The antibody may selectively bind a moFcRH2 with a transmembrane region (e.g., having amino acid sequence of SEQ ID N0:73), but not bind to a moFcRH2 lacking a transmembrane region (e.g., having the amino acid sequence of 77). Optionally the antibody of the invention can selectively bind moFcRH but not human, or vice versa.
A variety of immunoassay formats may be used to select antibodies that selectively bind with a particular protein, variant, or fragment. For example, solid-phase ELISA immunoassays are routinely used to select antibodies selectively immunoreactive with a protein, variant, or fragment thereof. See Harlow and Lane.
Antibodies, A Laboratory Manual. Cold Spring Harbor Publications, New York, (1988), for a description of immunoassay formats and conditions that could be used to determine selective binding. The binding affinity of a monoclonal antibody can, for example, be determined by the Scatchard analysis of Munson et al., Anal.
Biochem., 107:220 (1980).
The invention also provides an antibody reagent kit comprising the antibody or fragment thereof of the invention and reagents for detecting binding of the antibody or fragment thereof to a ligand. The kit can further comprise containers containing the antibody or fragment thereof of the invention and containers containing the reagents.
Preferably the ligand is a FcRH, variant, or fragment thereof. Particularly, the kit can detect the presence of one or more FcRHs specifically reactive with the antibody or an 5 immunoreactive fragment thereof. The kit can include an antibody bound to a substrate, a secondary antibody reactive with the antigen and a reagent for detecting a reaction of the secondary antibody with the antigen. Such a kit can be an ELISA kit and can comprise the substrate, primary and secondary antibodies when appropriate, and any other necessary reagents such as detectable moieties, enzyme substrates and color 10 reagents as described above. The diagnostic kit can, alternatively, be an immunoblot kit generally comprising the components and reagents described herein.
Alternatively, the kit could be a radioimmunoassay kit, a Western blot assay kit, an immunohistological assay kit, an immunocytochemical assay kit, a dot blot assay kit, a fluorescence polarization assay kit, a scintillation proximity assay kit, a homogeneous 15 time resolved fluorescence assay kit, or a BIAcore analysis kit.
As used throughout, methods of detecting an FcRH or antigen/antibody complexes, including complexes comprising an FcRH and optionally the antibody of the present invention, can comprise an ELISA (competition or sandwich), a radioimmunoassay, a Western blot assay, an immunohistological assay, an 20 immunocytochemical assay, a dot blot assay, a fluorescence polarization assay (Jolley (1981); Jiskoot et al (1991); Seethala et al. (1998); Bicamumpaka et al.
(1998)), a scintillation proximity assay (Amersham Life Science (1995) Proximity News.
Issue 17;
Amersham Life Science (1995) Proximity News. Issue 18; Park et al. (1999)), a homogeneous time-resolved fluorescence assay (Park et al. (1999); Stenroos et al.
25 (1988); Morrison, 1988)), or a BIAcore analysis Fagerstam et al. (1992) Chromatography 597:397-410. Preferably, the antigei~/antibody complex is detestably tagged either directly or indirectly. Any desired tag can be utilized, such as a fluorescent tag, a radiolabel, a magnetic tag, or an enzymatic reaction product.
Optionally, the antibody or fragment is a humanized antibody or a fully human 30 antibody. For example, the antibodies can also be generated in other species and "humanized" for administration to humans. Alternatively, fully human antibodies can also be made by immunizing a mice or other species capable of making a fully human antibody (e.g., mice genetically modified to produce human antibodies), screening clones that bind FcRH. See, e.g., Lonberg and Huszar (1995) Human antibodies from transgenic mice, Int. Rev. Imrnunol. 13:65-93, which is incorporated herein by reference in its entirety for methods of producing fully human antibodies. As used herein, the term "humanized" and "fully human" in relation to antibodies, relate to any antibody which is expected to elicit a therapeutically tolerable weak immunogenic response in a human subject.
Humanized forms of non-human (e.g., murine) antibodies are chimeric immunoglobulins, immunoglobulin chains or fragments thereof (such as Fv, Fab, Fab' , F(ab')Z, or other antigen-binding subsequences of antibodies) which contain minimal sequence derived from non-human immunoglobulin. Humanized antibodies include human immunoglobulins (recipient antibody) in which residues from a complementary determining region (CDR) of the recipient are replaced by residues from a CDR
of a non-human species (donor antibody) such as mouse, rat or rabbit having the desired specificity, affinity and capacity. In some instances, Fv framework residues of the human immunoglobulin are replaced by corresponding non-human residues.
Humanized antibodies may also comprise residues that are found neither in the recipient antibody nor in the imported CDR or framework sequences. In general, the humanized antibody will comprise substantially all or at least one, and typically two, variable domains, in which all or substantially all of the CDR regions correspond to those of a non-human immunoglobulin and all or substantially all of the FR
regions are those of a human immunoglobulin consensus sequence. The humanized antibody optimally also will comprise at least a portion of an immunoglobulin constant region (Fc), typically that of a human immunoglobulin (Jones et al., Nature, 321:522-(1986); Riechmann et al., Nature, 332:323-327 (1988); and Presta, Curr. Op.
Struct.
Biol., 2:593-596 (1992)).
Methods for humanizing non-human antibodies are well known in the art.
Generally, a humanized antibody has one or more amino acid residues introduced into it from a source that is non-human. These non-human amino acid residues are often referred to as "import" residues, which are typically taken from an "import"
variable domain. Humanization can be essentially performed following the method of Winter and co-workers (Jones et al., Nature, 321:522-525 (1986); Riechmann et al., Nature, 332:323-327 (1988); Verhoeyen et al., Science, 239:1534-1536 (1988)), by substituting rodent CDRs or CDR sequences for the corresponding sequences of a human antibody.
Accordingly, such "humanized" antibodies are chimeric antibodies (U.S. Pat.
No.
4,816,567), wherein substantially less than an intact human variable domain has been substituted by the corresponding sequence from a non-human species. In practice, humanized antibodies are typically human antibodies in which some CDR residues and possibly some FR residues are substituted by residues from analogous sites in rodent antibodies.
The choice of human variable domains, both light and heavy, to be used in making the humanized antibodies is very important in order to reduce antigenicity.
According to the "best-fit" method, the sequence of the variable domain of a rodent antibody is screened against the entire library of known human variable domain sequences. The human sequence which is closest to that of the rodent is then accepted as the human framework (FR) for the humanized antibody (Sims et al., J.
Immunol., 151:2296 (1993) and Chothia et al., J. Mol. Biol., 196:901 (1987)). Another method uses a particular framework derived from the consensus sequence of all human antibodies of a particular subgroup of light or heavy chains. The same framework may be used for several different humanized antibodies (Carte~° et al., Proc. Natl. Acad. Sci.
USA, 89:4285 (1992); Presta et al., J. Immunol., 151:2623 (1993)).
It is further important that antibodies be humanized with retention of high affinity for the antigen and other favorable biological properties. To achieve this goal, according to a preferred method, humanized antibodies are prepared by a process of analysis of the parental sequences and various conceptual humanized products using three dimensional models of the parental and humanized sequences. Three dimensional immunoglobulin models are commonly available and are familiar to those skilled in the art. Computer programs are available which illustrate and display probable three-dimensional conformational structures of selected candidate immunoglobulin sequences. Inspection of these displays permits analysis of the likely role of the residues in the functioning of the candidate immunoglobulin sequence, i.e., the analysis of residues that influence the ability of the candidate immunoglobulin to bind its antigen. In this way, FR residues can be selected and combined from the consensus and import sequence so that the desired antibody characteristic, such as increased affinity for the target antigen(s), is achieved. In general, the CDR residues are directly and most substantially involved in influencing antigen binding (see, WO 94/04679 published 3 Mar. 1994).
Transgenic animals (e.g., mice) that are capable, upon immunization, of producing a full repertoire of human antibodies in the absence of endogenous immunoglobulin production can be employed. For example, it has been described that the homozygous deletion of the antibody heavy chain joining region (J(I~) gene in chimeric and germ-line mutant mice results in complete inhibition of endogenous antibody production. Transfer of the human germ-line immunoglobulin gene array in such germ-line mutant mice will result in the production of human antibodies upon antigen challenge (see, e.g., Jakobovits et al., Proc. Natl. Acad. Sci. USA, 90:2551-255 (1993); Jakobovits et al., Nature, 362:255-258 (1993); Bruggemann et al., Year in Immuno., 7:33 (1993)). Human antibodies can also be produced in phage display libraries (Hoogenboom et al., J. Mol. Biol., 227:381 (1991); Marks et al., J.
Mol. Biol., 222:581 (1991)). The techniques of Cote et al. and Boerner et al. are also available for the preparation of human monoclonal antibodies (Cole et al., Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, p. 77 (1985); Boerner et al., J. Immunol., 147(1):86-95 (1991)).
In one embodiment, the antibody or fragment thereof is a single chain antibody.
In another embodiment, the antibody or fragment is labeled. Optionally the antibody or fragment is conjugated or fused with a toxin or fragment thereof. Examples of the toxin or toxin moiety include diphtheria, ricin, and modifications thereof.
Dia~,nosis and Treatment The invention provides uses of the reagents described herein in if2 vitro and i~a vivo methods of diagnosing and treating a malignancy of hematopoietic cell lineage or an autoimmune disease in a subject. The reagents of the present invention are also useful in screening for disease manifestations. Such screening may be useful even before the onset of other clinical symptoms and could be used to screening subjects at risk for disease, so that prophylactic treatment can be started before the manifestation of other signs or symptoms.
By "malignancy" is meant a tumor or neoplasm whose cells possess one or more nuclear or cytoplasmic abnormalities, including, for example, high nuclear to cytoplasmic ratio, prominent nucleolar/nucleoli variations, variations in nuclear size, abnormal mitotic figures, or multinucleation. "Malignancies of hematopoietic cell lineage" include, but are not limited to, myelomas, leukemias, lymphomas (Hodgkin's and non-Hodgkin's forms), T-cell malignancies, B-cell malignancies, and lymphosarcomas or other malignancies described in the REAL classification system or the World Health Organization Classification of Hematologic Malignancies. It should be noted that the absence or presence of specific FcRHs can be diagnostic for a particular malignancy of hematopoietic cell linage or can be diagnostic for a particular form of a malignancy (e.g., a specific form of leukemia).
By "inflammatory and autoimmune diseases" illustratively including systemic lupus erythematosus, Hashimoto's disease, rheumatoid arthritis, graft-versus-host disease, Sjogren's syndrome, pernicious anemia, Addison disease, scleroderma, Goodpasture's syndrome, Crohn's disease, autoimmune hemolytic anemia, sterility, myasthenia gravis, multiple sclerosis, Basedow's disease, thrombopenia purpura, insulin-dependent diabetes mellitus, allergy; asthma, atopic disease;
arteriosclerosis;
myocarditis; cardiomyopathy; glomerular nephritis; hypoplastic anemia;
rejection after organ transplantation and numerous malignancies of lung, prostate, liver, ovary, colon, cervix, lymphatic'and breast tissues.
Specifically, the diagnostic methods comprise the steps of contacting a biological sample of the subject with an antibody or nucleic acid of the invention under conditions that allow the antibody to bind to cells of hematopoietic cell lineage or allow the nucleic acid to hybridize, preferably under stringent conditions, with nucleic acids of the biological sample; and detecting the amount or pattern of binding.
Changes in the amount or pattern of binding as compared to binding in a control sample indicate a malignancy or an inflammatory or autoimmune disease.
In various embodiments, the antibody used in the diagnostic method can selectively bind with an FcRH having the amino acid sequence of SEQ ID NO:1, 21, 2, 3, 22, 4, 5, 24, or 6.
The detecting step of the diagnostic method can be selected from methods routine in the art. For example, the detection step can be performed in vivo using a noninvasive medical technique such as radiography, fluoroscopy, sonography, imaging techniques such as magnetic resonance imaging, and the like. In vitro detection methods can be used to detect bound antibody or fragment thereof in an ELISA, RIA, immunohistochemically, FACS, IHC, FISH, or similar assays.
As used throughout, "biological sample" refers to a sample from any organism.
The sample can be, but is not limited to, peripheral blood, plasma, urine, saliva, gastric secretion, feces, bone marrow specimens, primary tumors, embedded tissue sections, frozen tissue sections, cell preparations, cytological preparations, exfoliate samples (e.g., sputum), fine needle aspirations, amnion cells, fresh tissue, dry tissue, and cultured cells or tissue. It is further contemplated that the biological sample of this invention can also be whole cells or cell organelles (e.g., nuclei). The sample can be unfixed or fixed according to standard protocols widely available in the art and can also be embedded in a suitable medium for preparation of the sample. For example, the sample can be embedded in paraffin or other suitable medium (e.g., epoxy or acrylamide) to facilitate preparation of the biological specimen for the detection 10 methods of this invention.
The invention also provides a method of treating a malignancy of hematopoietic cell lineage or an inflammatory or autoimrnune disease in a subject, comprising contacting the subject's malignant cells or inflammatory cells with a therapeutically effective amount of a reagent (e.g., an antibody or nucleic acid) or a therapeutic 15 composition of a reagent of the invention. The contacting step can occur by administration of the reagent or composition using any number of means available in the art. Typically, the reagent or composition is administered to the subject transdermally (e.g., by a transdermal patch or a topically applied cream, ointment, or the like), orally, subcutaneously, intrapulmonaryily, transmucosally, intraperitoneally, 20 intrauterinely, sublingually, intrathecally, intramuscularly, intraarticularly, etc. using conventional methods. In addition, the reagent or composition can be administered via injectable depot routes such as by using 1-, 3-, or 6-month depot injectable or biodegrable materials and methods.
Regardless of the route of administration, the amount of the reagent 25 administered or the schedule for administration will vary among individuals based on age, size, weight, condition to be treated, mode of administration, and the severity of the condition. One skilled in the art will realize that dosages are best optimized by the practicing physician and methods for determining dosage are described, for example in Remington's Pharmaceutical Science, latest edition. Guidance in selecting appropriate 30 doses for antibodies is found in the literature on therapeutic uses of antibodies, e.g., Handbook of Monoclonal Antibodies, Ferrone et al., eds., Noges Publications, Park Ridge, N.J., (1985) ch. 22 and pp. 303-357; Smith et al., Antibodies in Human Diagnosis and Therapy, Haber et al., eds., Raven Press, New York (1977) pp. 365-389. A
typical dose of the antibody used alone might range from about 1 ~g/kg to up to 100 mg/kg of body weight or more per day, and preferably 1 ~g/kg to up to 1 mg/kg, depending on the factors mentioned above. An intravenous injection of the antibody or fragment thereof, for example, could be long-lg of antibody or fragment thereof, and preferably long-lmg depending on the factors mentioned above. For local injection, a typical quantity of antibody ranges from lpg tolmg . Preferably, the local injection would be at an antibody concentration of 1-100 ~g/ml, and preferably 1-20 ~g/ml.
The nucleic acids of the invention can delivered to cells in a variety of ways.
For example, if the nucleic acid of this invention is delivered to the cells of a subject in an adenovirus vector, the dosage for administration of adenovirus to humans can range from about 10~ to 109 plaque forming units (pfu) per injection, but can be as high as 1012 pfu per injection. Ideally, a subject will receive a single injection. If additional injections are necessary, they can be repeated at six month intervals for an indefinite period and/or until the efficacy of the treatment has been established. As set forth herein, the efficacy of treatment can be determined by evaluating the clinical parameters.
The exact amount of the nucleic acid or vector required will vary as described above. Thus, it is not possible to specify an exact amount for every nucleic acid or vector. An appropriate amount can be determined by one of ordinary skill in the art using only routine experimentation given the teachings herein.
The invention further provides a therapeutic composition of the reagent of the invention. Such a composition typically contains from about 0.1 to 90% by weight (such as 1 to 20% or 1 to 10%) of a therapeutic agent of the invention in a pharmaceutically acceptable carrier. Solid formulations of the compositions for oral administration may contain suitable earners or excipients, such as corn starch, gelatin, lactose, acacia, sucrose, microcrystalline cellulose, kaolin, mannitol, dicalcium phosphate, calcium carbonate, sodium chloride, or alginic acid. Disintegrators that can be used include, without limitation, microcrystalline cellulose, corn starch, sodium starch, glycolate, and alginic acid. Tablet binders that may be used include acacia, methylcellulose, sodium carboxymethylcellulose, polyvinylpyrrolindone (PovidoneTM), hydroxypropyl methylcellulose, sucrose, starch, and ethylcellulose. Lubricants that may be used include magnesium stearates, stearic acid, silicone fluid, talc, waxes, oils, and colloidal silica.
Liquid formulations for oral administration prepared in water or other aqueous vehicles may contain various suspending agents such as methylcellulose, alginates, tragacanth, pectin, kelgin, carrageenan, acacia, polyvinylpyrrolidone, and polyvinyl alcohol. The liquid formulations may also include solutions, emulsions, syrups and elixirs containing, together with the active cornpound(s), wetting agents, sweeteners, and coloring and flavoring agents. Various liquid and powder formulations can be prepared by conventional methods for inhalation into the lungs of the mammal to be treated.
Injectable formulations of the compositions may contain various carriers such as vegetable oils, dimethylacetamide, dimethylformamide, ethyl lactate, ethyl carbonate, isopropyl myristate, ethanol, polyols (glycerol, propylene glycol, liquid polyethylene glycol, and the like). For intravenous injections, water soluble version of the compounds may be administered by the drip method, whereby a pharmaceutical formulation containing the antifungal agent and a physiologically acceptable excipient is infused. Physiologically acceptable excipients may include, for example, 5%
dextrose, 0.9% saline, Ringer's solution or other suitable excipients.
Intramuscular preparations, e.g., a sterile formulation of a suitable soluble salt form of the compounds, can be dissolved and administered in a pharmaceutical excipient such as water-for-injection, 0.9% saline, or 5% glucose solution. A suitable insoluble form of the compound may be prepared and administered as a suspension in an aqueous base or a pharmaceutically acceptable oil base, such as an ester of a long chain fatty acid (e.g., ethy; oleate).
A topical semi-solid ointment formulation typically contains a concentration of the active ingredient from about 1 to 20%, e.g., 5 to 10%, in a Garner such as a pharmaceutical cream base. Various formulations for topical use include drops, tinctures, lotions, creams, solutions, and ointments containing the active ingredient and various supports and vehicles. The optimal percentage of the therapeutic agent in each pharmaceutical formulation varies according to the formulation itself and the therapeutic effect desired in the specific pathologies and correlated therapeutic regimens.
The effectiveness of the method of treatment can be assessed by monitoring the patient for known signs or symptoms of the conditions being treated. For example, in the treatment of a malignancy of hematopoietic cell lineage, the reduction or stabilization of the number of abnormally proliferative cells would indicate successful treatment. In the treatment of arthritis, for example, a reduction in the amount of joint inflammation would indicate successful treatment. Thus, by"therapeutically effective"
is meant an amount that provides the desired treatment effect.
The invention further provides a method of modulating a humoral immune response in a subject, comprising administering to the subject an isolated FcRH, an antibody, or nucleic acid of the invention. By "modulation" is meant either up-regulating or down-regulating. Thus, in the case of an allergic response, one skilled in the art would choose to down-regulate the humoral immune response. In the case of exposure of a subject to an infectious agent (e.g., a viral or bacterial agent), one skilled in the art would choose to upregulate the humoral antibody response.
Experimental The following examples are put forth so as to provide those of ordinary skill in the art with a complete disclosure and description of how the compounds, compositions, articles, devices and/or methods claimed herein are made and evaluated, and are intended to be purely exemplary of the invention and are not intended to limit the scope of what the inventors regard as their invention. Efforts have been made to ensure accuracy with respect to numbers (e.g., amounts, temperature, etc.), but some errors and deviations should be accounted for. Unless indicated otherwise, parts are parts by weight, temperature is in °C or is at ambient temperature, and pressure is at or near atmospheric.
Identification of FcRHl FcRH2, and FcRH3 In order to isolation of FcRH cDNA Clones, rapid amplification of cDNA ends (R.ACE)-PCR was performed by using a Marathon-Ready human lymph node cDNA
library (CLONTECH). Gene-specific primers were as follows: FcRH3, forward 5'-TGAGTCTCAGGGTCACAGTTCCG-3' (SEQ H~ N0:41) and reverse 5'-GCTCTTGAACTTGGATATTTAGGGGT-3' (SEQ ID NO:42); FcRH2, forward 5'-CCAGTGTATGTCAATGTGGGCTCTG-3' (SEQ ID N0:43) and reverse 5'-CGTTGAAAGAGCTCTTGGACTTTTATC-3' (SEQ ID N0:44); and FcRHl, forward 5'-GCCTCAAAAGAAAAATAGGAAGACGTT-3' (SEQ H~ NO:45) and reverse 5'-AAGCTCACATCAGCGACAGGGAC-3' (SEQ ID N0:46). RACE products were subjected to a second round of nested PCR and visualized by agarose gel electrophoresis and ethidium bromide staining.
Primers used in end-to-end amplification to generate full-length cDNAs were as follows: FcRH3, forward 5'-TCTTGGAGATAAGTCGGGCTTT-3' (SEQ ID N0:47) and reverse 5'-ATCCTGCAGCCCAGCCTCGTAGGAG-3' (SEQ ID N0:48); FcRH2, forward 5'-GGTCCTCATGCTGCTGTGGTCATT-3' (SEQ ID N0:49)and reverse 5'-GCTGTTGATCTTCGCTTCTGATTC-3' (SEQ ID NO:50); and FcRHl, forward 5'-ATGCTGCCGAGGCTGTTGCTGTTG3' (SEQ ID NO:51) and reverse 5'-CATAGCATCTTCATAGTCCACATC-3' (SEQ ID N0:52). Each amplification reaction underwent initial denaturation of 94°C for 30 s followed by 30 cycles of denaturation at 94°C for 5 s and annealing at 68°C for 4 min, and final extension at 72°C for 6 min.
PCR products were ligated into the pCR2.1 TOPO T/A vector (Invitrogen).
Inserts were DNA-sequenced on both strands by the dideoxy chain termination method using Thermo Sequenase (Amersham Pharmacia) and an automated sequencer (Li-Cor, Lincoln, NE). Nucleotide and amino acid sequence alignment was analyzed with a DNASTAR (Madison, WI) software package, and homology searches were performed by using BLAST (Altschul, S. F. et al. (1990) J. Mol. Biol. 215, 403-410).
RNA blot analysis was subsequently performed. Northern blots (CLONTECH) were hybridized with 32P-dCTP-labeled probes: a 528-by EcoRI fragment corresponding to the 5' untranslated (UT)-EC1 regions of the FcRH3 cDNA, a 200-by PCR product corresponding to a portion of the 3' UT region of the FcRH2 cDNA, and a 257-by PCR product corresponding to a portion of the 3' UT region of the FcRHl cDNA. Membranes were hybridized for 1 h at 65°C, washed, and exposed to x-ray film (Kubagawa, H. et al. (1997) Proc. Natl. Acad. Sci. USA 94, 5261-5266).
Reverse transcription (RT)-PCR was performed. Human tonsillar cells, obtained with Institutional Review Board approval, were separated into CD19+ and CD19 subpopulations by magnetic cell sorting (Miltenyi Biotec, Auburn, CA). Viable CD19+
cells were stained with FITC-labeled anti-CD38 (Immunotech, Westbrook, ME) and phycoerythrin-labeled anti-IgD mAbs (Southern Biotechnology Associates) before sorting cells with a FACStarPlus instrument (Becton Dickinson) into Trizol reagent (Life Technologies, Grand Island, N~ for RNA isolation. Total cellular RNA was primed with random hexamers and oligo(dT) primers and reverse-transcribed with Superscript II (Life Technologies) into single-stranded cDNA. RT-PCR was performed by using RNA from tonsillar B cells and cell lines, with GIBCOIBRL Taq polymerase (Life Technologies). The following gene-specific primer pairs were used in the RT-5 PCR analysis of FcRHl-5 expresion in cell lines and tonsillar B cell subpopulations:
FcRHl forward, 5'-CTC AAC TTC ACA GTG CCT ACT GGG-3' (SEQ ff~ N0:53) and reverse, 5'-TCC TGC AGA GTC ACT AAC CTT GAG-3' (SEQ ID N0:54);
FcRH2 forward, 5'-CCA GTG TAT GTC AAT GTG GGC TCT G (SEQ ID NO:55) and reverse, 5'-CAT TCT TCC CTC AAA TCT TTA CAC-3' (SEQ ID N0:56); FcRH3 10 forward, 5'-CAG CAC GTG GAT TCG AGT CAC-3' (SEQ ID N0:57) and reverse, 5'-CAG ATC TGG GAA TAA ATC GGG TTG-3' (SEQ ID N0:58) FcRH4 forward, 5'-TCT TCA GAG ATG GCG AGG TCA-3' (SEQ ID N0:59) and reverse, 5'-TTT TGG
GGT GTA CAT CAA CAT ACA AG-3' (SEQ ID N0:60); and FcRH forward, 5'-TGT
TGC CCT GTT TCT TCC AAT ACA-3' (SEQ ID N0:61) and reverse, 5'-CAG AGT
15 TGG CCG ACC TAC GC-3' (SEQ ID N0:62). Each amplification reaction underwent initial denaturation at 94° for 5 min followed by 35 cycles of denaturation at 94° for 30 s, annealing at 60° for 30 s, extension at 72° for 1 min, and final extension at 72° for 7 min. Amplified products were visualized in 1% agarose gels containing ethidium bromide and documented with the Bio-Rad Fluor-S Imager.
20 The following human cell lines were used: REH and Nalm 16 pro-B cell lines (Korsmeyer, S. J. et al. (1983) J. Clin. Invest. 71, 301-313); 697, 207, and OBS pre-B
cell lines (Findley, H. W. et al. (1982) Blood 60, 1305-1309; Martin, D. et al. (1991) J.
Exp. Med. 173, 639-645); Ramos, Daudi, and Raji B cell lines (Pulvertaft, R.
J. V.
(1964) Lancet 1, 238-240; Klein, E. et al. (1968) Cancer Res. 28, 1300-1310;
Klein, G.
25 et al. (1975) Intervirology 5, 319-33431-33); THP-1 and U937 monocytoid cell lines, HL-60 promyelocytic and KG-1 myelocytic cell lines, Jurkat T cell line and the erythroid cell line (American Type Culture Collection).
A consensus sequence was generated that corresponds to the GenBank-derived amino terminal sequences of the second Ig-like domains of 30 FcR (FcyRI and Fc~yRII/III) and the third Ig-like domain of the polymeric Ig receptor:GEPIXLRCHSWK.DKXLXKVTYXQNGKAXI~F'FH (SEQ ID N0:63). A
search of the National Center for Biotechnology Information protein database with this sequence identified two overlapping human genomic bacterial artificial chromosome (BAC) clones, AL135929 and AL356276, which are located at 1q21.2-22. The second clone contained three putative Ig superfamily genes encoding complementary amino acid sequences that were designated FcRHl, FcRH2, and FcRH3. See Figure 1. The predicted amino acid sequences of these gene segments shared 23-57% identity with each other and 14-28% identity with human Fc~yRI (CD64). Further analysis of the FcRH locus led to the identification of two additional genes (FcRH4, and FcRHS) and one pseudogene (FcRH4yr), immediately centromeric of FcRHl-3, two of which have recently been described as IRTAl (FcRH4) and IRTA2 (FcRHS) (Hatzivassiliou, G.
et al. (2001) Immunity 14, 277-289).
To determine whether these genes are expressed by lymphocytes, the predicted amino acid sequences of their protein products were used to search the Lymphochip expressed sequence tag database with the TBLASTN algorithm (Alizadeh, A. A. et al.
(2000) Nature (London) 403, 503-511). Two expressed sequence tags (AA505046 and AA282433) were identified that share complete identity over 23 amino acids in their translated ORFs with the N terminus of FcRHI . Lymphochip microarray data analysis indicated that these expressed sequence tags are expressed at relatively high levels in peripheral lymphoid tissues, including the lymph nodes, tonsils, resting peripheral B
cells, and normal germinal center (GC) B cells. Among the different lymphoid malignancies, their expression proved to be highest in chronic lymphocytic leulcemias, follicular lymphomas, and some diffuse large cell lymphomas of B lineage.
FcRHl, FcRH2, and FcRH3 cDNAs were isolated by RACE-PCR from a human lymph node cDNA library in both 5' and 3' directions. Full-length cDNAs of the coding regions for FcRHl, FcRH2, and FcRH3 were obtained by end-to-end PCR
using unique primers generated from the cDNA sequences delineated for the 5' UT and 3'UT
regions. Southern blot analysis of human genomic DNA digested with Bar~zHI, EcoRI, or Hin.dIII using cDNA probes specific for the 3' UT regions of each cDNA revealed either one or two hybridizing fragments, suggesting that FcRHl, FcRH2, and FcRH3 are encoded by single genes. Analysis of full-length cDNA
sequences indicated that FcRHl, FcRH2, and FcRH3 have ORFs of 1,287 bp, 1,524 bp, and 2,202 bp, respectively, and encode type I transmembrane proteins of 429 aa, 508 aa, and 734 aa, respectively. Based on predicted consensus signal peptide cleavage sites (Von Heijne, G. (1986) Nucleic Acid Res. 14, 4683-4690; Nielsen, H. (1997) Protein Eng. 10, 1-6), the relative core peptide molecular masses were estimated as 45,158 for FcRHl, 53,407 for FcRH2, and 78,849 for FcRH3. These type I transmembrane proteins possess 3-6 extracellular C2 (Williams, A. F. & Barclay, A. N. (1988) Annu.
Rev. Immunol. 6, 381-405; Bork, P. et al. (1994) J. Mol. Biol. 242, 309-320;
Vaughn, D. E. & Bjorkman, P. J. (1996) Neuron 16, 261-273) type Ig-like domains with 3-potential N-linked glycosylation sites, uncharged transmembrane segments, and relatively long cytoplasmic tails containing consensus motifs for ITIMs and/or ITAMs.
See Fig. 2A.
Multiple alignment analysis of the translated cDNAs, using FcRH3 as the index sequence of comparison, indicates that FcRHl, FcRH2, and FcRH3 have highly conserved hydrophobic signal peptides and corresponding Ig-like extracellular domains (Fig. 2B). Their hydrophobic transmembrane (uncharged with the exception of FcRHl which includes an acidic domain) domains (Sonnhammer, E. L. L. et al. (1998) in A
Hidden Markov Model for Predicting Transmembrane Helices in Protein Sequences, eds. Glasgow, J., Littlejohn, T., Major, F., Lathrop, R., Sankoff, D. &
Sensen, C. (Am.
Assoc. for Artificial Intelligence, Menlo Park, CA), pp. 175-182) are also well conserved, but their cytoplasmic domains are not. FcRHl has a long cytoplasmic tail containing three potential ITAMs, the first and third of which fit the consensus sequence (E/D)-X-X-Y-X-X-(L/I)-X~_8-Y-X-X-(L/I) (SEQ ID N0:64, with six amino acid between the consensus sequences; SEQ ID N0:65, with seven amino acid residues between the consensus sequences; and SEQ ID NO:66, with eight amino acid residues between the consensus sequences), whereas, the second has only one tyrosine residue.
The shorter cytoplasmic domain of FcRH2 contains one potential ITAM and two ITIM
consensus sequences (I/V/L/S)-X-Y-X-X-(L/V) (SEQ ID N0:67) separated by 22 amino acids. FcRH3 has the longest cytoplasmic tail. It contains one potential ITAM, one ITIM, and another potential ITAM that also has a single tyrosine residue.
An RNA blot analysis with gene-specific probes was performed on 16 human tissues, including six primary or secondary lymphoid tissues. RNA blots were analyzed with discriminating a32P-dCTP-labeled probes generated from the respective FcRH
cDNAs. The following probes were used: (Top) a PCR-generated, 257-by probe specific to the 3' UT region of FcRHl; (Middle) a PCR-generated, 290-by probe corresponding to the 3' UT region of FcRH2; and (Bottom) a 528-by EcoRI-digested fragment of the 5' end of the FcRH3 cDNA corresponding to its 5' UT region, S
l, S2, and EC 1 domains. The relative mRNA abundance was indicated by (3-actin probe.
All three FcRH gene probes hybridized with transcripts in the secondary lymphoid organs, spleen and lymph node. An FcRHl-specific probe hybridized with spleen and lymph node transcripts of about 3.5 kb and about 6.0 kb. Additional hybridization bands of about 0.7 kb and about 1.5 kb were observed for heart, skeletal muscle, kidney, liver, and, in less abundance, placental tissue. Larger transcripts also were seen in skeletal muscle (about 6.0 kb) and in kidney and placenta (about 4.4 kb). An FcRH2-specific probe hybridized to about 3.0-kb, about 4.4-kb, and about 5.5-kb transcripts most abundantly in spleen and lymph node. A transcript of approximately 2.4-kb was notable in the kidney. An FcRH3 probe hybridized with about 3.5-kb, about 5.5-kb, and about 7.0-kb transcripts chiefly in spleen and lymph node. These also were seen, albeit in lesser abundance, in peripheral blood lymphocytes, thymus, and bone marrow samples.
Additionally, a unique transcript of about 1.35 kb was evident in skeletal muscle. These results indicated expression of FcRHl, FcRH2, and FcRH3 in peripheral lymphoid organs, whereas tissue specific differences in alternative splicing or polyadenylation were suggested by the differential expression of transcripts with variable size in nonlymphoid tissues. RTPCR analysis to date of non-lymphoid tissue skeletal muscle, however, does not reveal transcripts despite the Northern analysis results.
When FcRH expression was examined by RT-PCR analysis of cell lines representing different hematopoietic lineages, FcRHI, FcRH2, and FcRH3 expression was found in every mature B cell line tested (Table 2). FcRH2 and FcRH3 expression was limited to the mature B cell lines and not seen in the other types of cells examined.
In contrast, FcRHl expression was seen in pro-B, T, and myeloid cell lines, although not in an erythroid cell line.
Table 2. Expression of FcRH transcripts in human B cell lines Cell Type Cell lisae FcRIIl FcRH2 FcRH3 Pro-B ~ REH + - -Nahn 16 + -Pre-B ~ 697 - - -OBS - -B ~ Ramos + + +
Daudi + + +
Raji + + +
T Jurkat + - -Monocytic THP-1 + - -Myelomonocytic LT937 + -Promyelocytic HL-60 + - -Myelocytic KG-1 + - -Erythroid K562 - -FcRHl, FcRH2, and FcRH3 expression in cell lines was determined by RT-PCR.
RT-PCR analysis of sorted populations of peripheral blood cells indicated that FcRHl, FcRH2, FcRH3, and FcRHS are expressed at relatively high levels in CD19+ B
cells, whereas FcRH4 was expressed at only trace levels. FcRH3 expression was observed in CD3+ T cells whereas transcripts of FcRHl were barely detectable.
FcRHl expression also was observed in circulating granulocytes.
To refine the analysis of FcRH expression in secondary lymphoid tissues, tonsillar lymphocyte subpopulations were isolated. The five discrete subpopulations of B lineage cells, which can be distinguished by their differential expression of cell surface IgD and CD38, represent different stages in B cell differentiation:
follicular mantle (IgD+CD38), pre-GC (IgD+CD38+), GC (IgDCD38+), memory (IgDCD38), and mature plasma cells (CD382+) (Pascual, V. (1994) J. Exp. Med. 180: 329-339). RT-PCR analysis of FcRHl-5 expression in tonsillar B cell subpopulations was performed.
Viable cells were magnetically sorted into CD19- non-B cells and CD19+ B
cells. The latter were stained with anti-IgD and anti-CD38 mAbs, and the five subpopulations indicated (CD38- IgD-, CD38- IgD+, CD38+ IgD+, CD38+ IgD-, and CD382+) were sorted by flow cytometry. RT-PCR analysis of FcRH transcripts in non-B cells and the B cell subpopulations was also performed. After cDNA preparation, PCR
amplification was performed on the equivalent template of approximately 10 k cells.
Glyceraldehyde-3-phosphate dehydrogenase (GADPH) was amplified as a positive control.
RT-PCR analysis indicated little or no expression of FcRH transcripts in the non-B lineage CD19- cells, most of which are T cells. However, CD19+
subpopulations displayed coordinate expression of FcRHl, FcRH2, and FcRH3 transcripts in follicular mantle, naive, GC, and memory B cell subpopulations, but yielded no evidence of FcRH transcripts in pre-GC B cells or plasma cells. In contrast, FcRH4 transcripts were restricted to the follicular mantle and memory B cells, whereas FcRHS
expression extended to mature plasma cells.
The relationship between the five FcRHs was examined by comparing their full-length, extracellular, and individual Ig-like domain amino acid sequences.
This analysis, which included a recently identified mouse FcRH ortholog (moFeRH) and members of the FcR family, used the CLUSTAL method algorithm (Higgins, D. G. &
Sharp, P. M. (1989) Comput. Appl. Biosci. 5, 151-153). Comparison of the full-length 10 sequences of other FcRH family members with FcRH3 indicated 40-47%
identity. By comparison, the degree of FcRH3 homology with the moFcRH was found to be 35%
and 21-24% with FcR members residing on chromosome 1, FcyRI, FcyRII, Fc~yRIII, and FcsRI. A lower level of amino acid identity (14%) was observed for the chromosome 19 LRC member, FcaR. A slightly higher degree of extracellular homology was 15 evident. Pairwise analysis of the individual Ig-like subunits indicated conservation in membrane-distal to membrane-proximal ordering of extracellular domain composition among family members. Although similar Ig domain subunits were shared among family members, the individual receptors were found to be composed of unique domain combinations. The extracellular domain configuration of the moFcRH most closely 20 resembled that of FcRH2, with which it has 46% identity. The extended pairwise comparison of the FcRH family with known FcRs suggested the conservation of these Ig-like domains to some degree throughout the greater family. The resemblance is particularly evident in the FcRH3 membrane-distal domains that correspond to the three FcyRI domains and the two domains of FcYRII, FcyRIII, and the FcR y-chain.
This 25 analysis suggests the ancestral occurrence of differential duplication and diversification of the individual Ig-like subunits in the respective FcRH family members. The data also indicate that the FcRHs are more similar to their FcR neighbors on chromosome 1 than to their FcR relative on chromosome 19.
The genomic sequence analysis of relevant chromosome 1 q21 BAC
30 clones indicated that the entire FcRH locus spans 300 kb. The FcRH genes lie in the same transcriptional orientation toward the centomere. Exon-intron boundaries were characterized by sequence comparison of their respective cDNA clones and the AG/GT
rule. The FcRHl gene consists of 11 exons and 10 introns spanning about 28 kb.
The first exon, 5' UT/S 1, encodes the 5' UT region, the ATG translation initiation colon, and the first half of a split signal peptide. S2, the second exon, is separated from 5' UT/S1 by a long intron of 12.9 kb and, like the neighboring FcRs, is 21 by in length (van de Winkel, J. G. & Capel, P. J. (1993) Immunol. Today 14, 221; Kulczycki, A., Jr. et al. (1990) Proc. Natl. Acad. Sci. USA 87, 2856-2860; Pang, J.
et al. (1994) J. Immunol. 151, 6166-6174.). The extracellular region is encoded by three closely clustered exons, ECl-EC3, that code for the three Ig-like domains. The membrane-proximal, transmembrane, and the proximal portion of the cytoplasmic domain are encoded by a single sixth exon, TM. The cytoplasmic tail is encoded by five exons, CY1-CYS, and the CYS also encodes the beginning of the 3' UT region.
FcRH2 contains 12 exons and 11 introns that span 30 kb. It also contains two exons that encode a split signal peptide, the first of which, 5'UT/S1, includes the 5' UT
region, the ATG translation initiation colon, and first half of the signal peptide. The second exon, S2, is 21 by in length. Exons 3-6 encode the four extracellular domains, EC1-EC4. The seventh exon encodes the membrane-proximal, transmembrane, and the proximal portion of the cytoplasmic domain. The FcRH2 cytoplasmic tail is encoded by five exons, CY1-CYS, the last exon of which includes the termination of the ORF and beginning of the 3' UT region.
The FcRH3 gene consists of 16 exons and 15 introns that span about 24 kb.
Unlike FcRHl and FcRH2, its 5' UT region is encoded by two exons, 5' UTl and a second, 5'UT2/S 1, that also encodes the ATG translation initiation colon and the beginning of the split signal peptide. The third exon, S2, is also 21 by in length.
Extracellular domains encoded by six exons, EC1-EC6, are followed by exon 10 that encodes the membrane-proximal, transmembrane, and the proximal portion of the cytoplasmic domain. The cytoplasmic tail is encoded by five exons, CY1-CYS;
the last contains the beginning of the 3' UT region.
Identification of HuFcRH6 FcRH6 is located in the midst of the classical FcRs at 1q21-23. Its genomic structure indicates, like the classical FcRs and FcRHI-5, a split hydrophobic signal peptide encoded by two exons the second of which is 2lbp.
FcRH6 was characterized using the methods described in Example 1. A
composite analysis of Ig-like domains for relatedness with the other huFcRHs was performed. See Figure xxx. Sequence analysis of huFcRH6 indicates its type I
transmembrane form contains a consensus motif for a single ITAM, or a single or two ITIM's.
Initial RT-PCR analysis of huFcRH6 in human tissues and cell lines (as described in Example 1) reveals transcript expression in normal tonsil and lymph nodes. In cell lines, expression of huFcRH6 was identified in myeloid cell lines THP-1 (monocytic), U937 (myelomonocytic), and KG-1 (myelocytic). Limited expression if any was identified in the 207 pre-B cell line and the Daudi B cell line.
Generation of Transfectants and Antibodies Recombinant constructs for transfection and stable expression of huFcRHl-5 have been generated. The constructs have been ligated into a CMV driven mammalian expression vector with and without green fluorescent protein (GFP) fusion at the carboxyl terminus. Surface expression of huFcRHI and huFcRH3 was detected for both GFP and non-GFP forms by staining with antibody supernatant. The antibody supernatant was derived from hybridomas generated by mice immunized with recombinant extracellular protein of the respective FcRH. The constructs for huFcRH2, 4, and 5 have been detected by green fluorescence as well as surface expression for FcRH4.
Monoclonal antibodies have been generated, including, for example, an antibody that binds FcRHl. The preliminary analysis of FACS staining for FcRHl expression with monoclonal antibody 1-SA3 labeled with a FITC conjugate (mouse anti human FcRHl) in peripheral blood from normal volunteers indicates virtually all CD19+ B cells have huFcRHl expression, as do CD14+ monocytes and CD13+
granulocytes. CD3+ T cells have limited to no expression of FcRHl. Staining of B-CLL
samples from two different patient peripheral blood samples indicates that virtually all CDS+/CD19+ B-CLL cells are positive for the FcRHl 1-SA3 antigen. By western blot analysis of recombinant protein for FcRHl-5 extracellular regions 1-SA3 appears specific for FcRHl. 1-SA3 also stains B cell lines Daudi and Raji.
Identification of MoFcRHl-3 A family of three mouse Fc Receptor Homologs (MoFCRHs) were identified and cloned. Amino acid sequences from the membrane proximal Ig-like domains of huFcRHl-5 were used to identify putative mouse FcRH orthologs in the NCBI or Cetera genomic, EST, and protein databases using the protein BLAST (BLASTP) and the translated nucleotide BLAST (TBLASTN] algorithms, respectively. The location of moFcR family is split between chromosomes 1 and 3 in regions syntenic with human chromosome 1q21-23. See Figure 4. The mo FcRH are located on mouse Ch3. Approximate positions were determined from Genbank, Cetera, and Mouse Genome Informatics databasesContigs of ESTs were generated to determine the putative cDNA sequences.
Genomic organization was determined by comparing cDNA clones generated from RACE PCR with GenBank and Cetera genomic sequences. DNAStar software was used for analysis of exon-intron boundaries which were characterized by sequence comparison and the AG/GT rule. All three genes contain a split signal sequence with a 2lbp S2 exon (exon 2) which is found in all FcR and huFcRH genes on human chromosone 1.
A comparison of tyrosine based motifs in FcRH cytoplasmic tails indicated homology with the huFcRH family. See Figure 5. An analysis of sequence homology conservation is further shown in Figures 6 and 7.
Expression of the moFcRHs in tissue and cell lines was also characterized as described in Example 1. Briefly, RT-PCR was performed on mouse tissues and cell lines with gene specific primers. Viable tissue was placed in TRIzoI reagent for RNA
extraction. After cDNA preparation PCT amplification was performed on equivalent template amounts. Actin was amplified as a positive control. McFcRH3 appears to have preferential expression in cells of B lineage. The results are shown in Tables 3-4.
Table 3: Tissue Distribution of moFcRH Expression TISSUE MoFcRHl MoFcRH2 MoFcRH3 Bone Marrow + + +
Thymus + + +
Spleen + + +
Lymph Node + + +
Peyer's Patches+ + +
Peripheral + + +
Blood Brain + -Liver + + -Heart + -Muscle + - -Kidney + - -Lung + + -Intestine + + +
Testes I + I -Table 4. Expression of moFcRH transcripts in cell lines Cell Type Cell lirasFcRHI FcRH2 FcRH3 Pro-B SCID7 + +/- +
Raw8.1 + + -Pre-B 70Z/3 + + +
BC76 - ' + +
18-81 + + +
Imm. B WEHI-231 + + +
WEHI-279 + + +
B A20 + + +
X16C8.5 + + +
T EL4 + +/- -/+
NKT NKT + +/- -~T 2C12 + +/- -Myeloid WEHI-3 + - -Lymphoid YAC-1 + + -Fibroblast 3T3 + +/- -Expression in cell lines was determined by RT-PCR.
5 The mouse FcReceptor Homologs include secreted or type I transmembrane isolfrms that have unique cytoplasmic tails with potential activation and inhibition motifs. Their chromosomal location, Ig domain homology, and genomic organization indicate the mouse FcReceptor Homologs are orthologs of the huFcRH that have evolved a significant level of diversity. moFcRHl, moFcRH2, and moFcRH3 are 10 predicted to encode secreted or type I transmembrane proteins based on their amino acid sequences. moFcRHl has two secreted isoforms both of which have extracellular (EC) regions of four Ig-like domains with five potential sites for N-linked glycosylation. One isoform is a fusion protein with a type B scavenger receptor domain containing 8 cysteines. moFcRH2 has secreted and type I isoforms containing two Ig-15 like domains with five N-linked glycosylation sites. The type I isoform has an uncharged transmembrane region which the secreted isoform lacks. Both isoforms contain the cytoplasmic portion which is long in the transmembrane form and contains five tyrosines including a consensus sequence for one potential immunoreceptor tyrosine-based activating motif . moFcRH3 contains five Ig-like domains with six potential sites of N-linked glycosylation. Its transmembrane domain is also uncharged and the cytoplasmic region contains one potential ITAM and one potential immunoreceptor tyrosine-based inhibitory motif. The amino acid (aa) length of individual regions and full length (FL) isoforms, as well as approximate molecular weight (MW) in Daltons (Da), is indicated in the structural diagram of Figure ~.
Throughout this application, various publications are referenced. The disclosures of these publications in their entireties are hereby incorporated by reference into this application in order to more fully describe the state of the art to which this invention pertains.
It will be apparent to those skilled in the art that various modifications and variations can be made in the present invention without departing from the scope or spirit of the invention. Other embodiments of the invention will be apparent to those skilled in the art from consideration of the specification and practice of the invention disclosed herein. It is intended that the specification and examples be considered as exemplary only, with a true scope and spirit of the invention being indicated by the following claims.
SEQUENCE LISTING
<110> The UAB Research Foundation Davis, Randall S.
Cooper, Max D.
<120> MEMBERS OF THE FC RECEPTOR HOMOLOG GENE FAMILY (FCRH1-3, 6), RELATED REAGENTS, AND USES THEREOF
<130> 21085.0037P1 <141> 2003-03-25 <150> US 60/367,667 <151> 2002-03-25 <160> 102 <170> FastSEQ for Windows Version 4.0 <210> 1 <211> 99 <212> PRT
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400> 1 Lys Arg Lys I1e Gly Arg Arg 5er Ala Arg Asp Pro Leu Arg Ser Leu 1 5 l0 15 Pro Ser Pro Leu Pro Gln Glu Phe Thr Tyr Leu Asn Ser Pro Thr Pro Gly Gln Leu Gln Pro Ile Tyr Glu Asn Val Asn Val Val Ser Gly Asp Glu Val Tyr Ser Leu,Ala Tyr Tyr Asn Gln Pro Glu G1n Glu Ser Val Ala Ala Glu Thr Leu Gly Thr His Met Glu Asp Lys Val Ser Leu Asp Ile Tyr Ser Arg Leu Arg Lys Ala Asn Ile Thr Asp Val Asp Tyr Glu Asp Ala Met <210> 2 <211> 413 <212> PRT
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400> 2 Ala Glu Leu Phe Leu Tle Ala Ser Pro Ser His Pro Thr Glu Gly Ser Pro Val Thr Leu Thr Cys Lys Met Pro Phe Leu Gln Ser 5er Asp Ala Gln Phe Gln Phe Cys Phe Phe Arg Asp Thr Arg Ala Leu Gly Pro Gly Trp Ser Ser Ser Pro Lys Leu Gln Ile Ala Ala Met Trp Lys Glu Asp Thr Gly Ser Tyr Trp Cys Glu Ala Gln Thr Met Ala Ser Lys Val Leu Arg Ser Arg Arg Ser Gln Ile Asn Val His Arg Val Pro Val Ala Asp Val Ser Leu Glu Thr Gln Pro Pro Gly Gly Gln Val Met Glu Gly Asp Arg Leu Val Leu I1e Cys Ser Val Ala Met Gly Thr Gly Asp Ile Thr Phe Leu Trp Tyr Lys Gly Ala Val Gly Leu Asn Leu Gln Ser Lys Thr Gln Arg 5er Leu Thr Ala Glu Tyr Glu Ile Pro Ser Val Arg Glu Ser Asp Ala Glu Gln Tyr Tyr Cys Val Ala Glu Asn Gly Tyr Gly Pro Ser Pro Ser Gly Leu Val Ser Ile Thr Val Arg Ile Pro Val Ser Arg Pro Ile Leu Met Leu Arg Ala Pro Arg Ala Gln Ala Ala Val Glu Asp Val Leu Glu Leu His Cys Glu Ala Leu Arg Gly Ser Pro Pro Ile Leu Tyr Trp Phe Tyr His Glu Asp Ile Thr Leu Gly Ser Arg Ser Ala Pro Ser Gly Gly Gly Ala Ser Phe Asn Leu Ser Leu Thr Glu Glu His Ser Gly Asn Tyr Ser Cys Glu Ala Asn Asn Gly Leu Gly Ala Gln Arg Ser Glu Ala Val Thr Leu Asn Phe Thr Val Pro Thr Gly Ala Arg Ser Asn His Leu Thr Ser Gly Val Ile Glu Gly Leu Leu Ser Thr Leu Gly Pro Ala Thr Val Ala Leu Leu Phe Cys Tyr Gly Leu Lys Arg Lys Ile Gly Arg Arg Ser Ala Arg Asp Pro Leu Arg Ser Leu Pro Ser Pro Leu Pro Gln Glu Phe Thr Tyr Leu Asn Ser Pro Thr Pro Gly Gln Leu Gln Pro Ile Tyr Glu Asn Val Asn Val Val Ser Gly Asp Glu Val Tyr Ser Leu Ala Tyr Tyr Asn G1n Pro Glu Gln Glu Ser Val Ala Ala Glu Thr Leu Gly Thr His Met ~Glu Asp Lys Val Ser Leu Asp Ile Tyr Ser Arg Leu Arg Lys Ala Asn Ile Thr Asp Val Asp Tyr Glu Asp Ala Met <210> 3 <211> 86 <212> PRT
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400> 3 His Lys Ile Ser Gly Glu Ser Ser A1a Thr Asn Glu Pro Arg Gly Ala Ser Arg Pro Asn Pro Gln Glu Phe Thr Tyr Ser 5er Pro Thr Pro Asp Met Glu Glu Leu Gln Pro Val Tyr Val Asn Val G1y Ser Val Asp Val Asp Val Val Tyr Ser Gln Val Trp Ser Met Gln Gln Pro Glu Ser Ser Ala Asn Ile Arg Thr Leu Leu Glu Asn Lys Asp Ser Gln Val Tle Tyr Ser Ser Val Lys Lys Ser <210> 4 <21l> 489 <212> PRT
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400> 4 Leu Thr Leu Val Ala Pro Ser Ser Val Phe Glu Gly Asp Ser Ile Val Leu Lys Cys Gln Gly Glu Gln Asn Trp Lys Tle Gln Lys Met Ala Tyr His Lys Asp Asn Lys Glu Leu Ser Val Phe Lys Lys Phe Ser Asp Phe Leu Tle Gln Ser Ala Val Leu Ser Asp Ser Gly Asn Tyr Phe Cys Ser Thr Lys Gly Gln Leu Phe Leu Trp Asp Lys Thr Ser Asn Ile Val Lys Ile Lys Val Gln Glu Leu Phe Gln Arg Pro Val Leu Thr Ala Ser Ser Phe Gln Pro Ile Glu Gly Gly Pro Val 5er Leu Lys Cys Glu Thr Arg Leu Ser Pro Gln Arg Leu Asp Val Gln Leu Gln Phe Cys Phe Phe Arg Glu Asn Gln Val Leu Gly Ser Gly Trp Ser Ser Ser Pro Glu Leu Gln Ile Ser Ala Val Trp Ser Glu Asp Thr G1y Ser Tyr Trp Cys Lys Ala Glu Thr Va1 Thr His Arg Ile Arg Lys Gln 5er Leu Gln Ser Gln Ile _ 165 170 175 His Val Gln Arg Ile Pro Ile Ser Asn Val Ser Leu Glu Ile Arg Ala Pro Gly Gly Gln Val Thr Glu Gly Gln Lys Leu Ile Leu Leu Cys Ser Val Ala G1y Gly Thr Gly Asn Val Thr Phe Ser Trp Tyr Arg Glu Ala Thr Gly Thr Ser Met G1y Lys Lys Thr G1n Arg Ser Leu Ser Ala Glu Leu Glu Ile Pro Ala Val Lys Glu Ser Asp Ala Gly Lys Tyr Tyr Cys Arg Ala Asp Asn Gly His Val Pro Ile Gln Ser Lys Val Val Asn Ile Pro Val Arg Ile Pro Val Ser Arg Pro Val Leu Thr Leu Arg Ser Pro Gly Ala Gln Ala Ala Val Gly Asp Leu Leu Glu Leu His Cys Glu Ala Leu Arg Gly Ser Pro Pro Tle Leu Tyr Gln Phe Tyr His Glu Asp Val Thr Leu Gly Asn Ser Ser Ala Pro Ser Gly Gly Gly Ala Ser Phe Asn Leu Ser Leu Thr Ala Glu His Ser Gly Asn Tyr Ser Cys Glu Ala Asn Asn G1y Leu Gly Ala Gln Cys Ser Glu Ala Val Pro Val Ser Ile Ser Gly Pro Asp Gly Tyr Arg Arg Asp Leu Met Thr Ala Gly Val Leu Trp Gly Leu Phe Gly Val Leu Gly Phe Thr Gly Val Ala Leu Leu Leu Tyr Ala Leu Phe His Lys Tle Ser Gly Glu Ser Ser Ala Thr Asn Glu Pro Arg G1y Ala Ser Arg Pro Asn Pro Gln Glu Phe Thr Tyr Ser Ser Pro Thr Pro Asp Met Glu Glu Leu Gln Pro Val Tyr Val Asn Val Gly Ser Val Asp Val Asp Val Val Tyr Ser Gln Val Trp Ser Met Gln Gln Pro Glu Ser Ser Ala Asn Ile Arg Thr Leu Leu Glu Asn Lys Asp Ser Gln Val 21e Tyr Ser Ser Val Lys Lys Ser <210> 5 <211> 140 <212> PRT
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400> 5 His Tyr Ala Arg Ala Arg Arg Lys Pro Gly Gly Leu Ser Ala Thr Gly Thr Ser Ser His Ser Pro Ser Glu Cys Gln Glu Pro Ser Ser Ser Arg Pro Ser Arg Ile Asp Pro Gln G1u Pro Thr His Ser Lys Pro Leu Ala Pro Met Glu Leu Glu Pro Met Tyr Ser Asn Val Asn Pro Gly Asp Ser Asn Pro Ile Tyr Ser Gln Ile Trp Ser Ile Gln His Thr Lys Glu Asn Ser Ala Asn Cys Pro Met Met His Gln Glu His Glu Glu Leu Thr Val Leu Tyr Ser Glu Leu Lys Lys Thr His Pro Asp Asp Ser Ala Gly Glu Ala Ser Ser Arg Gly Arg Ala His Glu Glu Asp Asp Glu Glu Asn Tyr Glu Asn Val Pro Arg Val Leu Leu Ala Ser Asp His <210> 6 <211> 717 <212> PRT
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400> 6 Gly Val Ala Pro Lys Ala Val Leu Leu Leu Asn Pro Pro Trp Ser Thr Ala Phe Lys Gly Glu Lys Val Ala Leu Ile Cys Ser Ser Ile Ser His Ser Leu Ala Gln Gly Asp Thr Tyr Trp Tyr His Asp Glu Lys Leu Leu Lys Ile Lys His Asp Lys Ile Gln Ile Thr Glu Pro Gly Asn Tyr Gln Cys Lys Thr Arg Gly Ser Ser Leu Ser Asp Ala Val His Val Glu Phe Ser Pro Asp Trp Leu Ile Leu Gln Ala Leu His Pro Val Phe Glu Gly Asp Asn Val Ile Leu Arg Cys Gln Gly Lys Asp Asn Lys Asn Thr His Gln Lys Val Tyr Tyr Lys Asp Gly Lys Gln Leu Pro Asn Ser Tyr Asn 1l5 120 125 Leu Glu Lys Ile Thr Val Asn Ser Val Ser Arg Asp Asn Ser Lys Tyr His Cys Thr Ala Tyr Arg Lys Phe Tyr Ile Leu Asp Ile Glu Val Thr Ser Lys Pro Leu Asn Ile Gln Val Gln Glu Leu Phe Leu His Pro Val Leu Arg Ala Ser Ser Ser Thr Pro Ile Glu Gly Ser Pro Met Thr Leu Thr Cys Glu Thr Gln Leu Ser Pro Gln Arg Pro Asp Val Gln Leu Gln Phe Ser Leu Phe Arg Asp Ser Gln Thr Leu Gly Leu Gly Trp Ser Arg Ser Pro Arg Leu Gln Ile Pro Ala Met Trp Thr Glu Asp Ser Gly Ser Tyr Trp Cys G1u Va1 Glu Thr Va1 Thr His Ser Ile Lys Lys Arg Ser Leu Arg Ser Gln Ile Arg Val Gln Arg Val Pro Val Ser Asn Val Asn Leu Glu Ile Arg Pro Thr Gly Gly Gln Leu Ile Glu Gly Glu Asn Met Val Leu Ile Cys Ser Val Ala Gln Gly Ser Gly Thr Val Thr Phe Ser Trp His Lys Glu Gly Arg Val Arg Ser Leu Gly Arg Lys Thr G1n Arg Ser Leu Leu Ala G1u Leu His Val Leu Thr Val Lys Glu Ser Asp Ala Gly Arg Tyr Tyr Cys Ala Ala Asp Asn Val His Ser Pro Ile Leu Ser Thr Trp Ile Arg Val Thr Val Arg Tle Pro Val Ser His Pro Val Leu Thr Phe Arg Ala Pro Arg Ala His Thr Val Val Gly Asp Leu Leu Glu Leu His Cys Glu Ser Leu Arg Gly Ser Pro Pro Ile Leu Tyr Arg Phe Tyr His Glu Asp Val Thr Leu Gly Asn Ser Ser Ala Pro Ser Gly Gly Gly Ala Ser Phe Asn Leu Ser Leu Thr Ala Glu His Ser Gly Asn Tyr Ser Cys Asp Ala Asp Asn Gly Leu Gly A1a Gln His Ser His Gly Val Ser Leu Arg Val Thr Val Pro Val Ser Arg Pro Val Leu Thr Leu Arg Ala Pro Gly Ala Gln Ala Val Val Gly Asp Leu Leu Glu Leu His Cys Glu Ser Leu Arg Gly Ser Phe Pro Ile Leu Tyr Trp Phe Tyr His Glu Asp Asp Thr Leu Gly Asn Ile Ser Ala His Ser Gly Gly Gly Ala Ser Phe Asn Leu Ser Leu Thr Thr Glu His Ser Gly Asn Tyr Ser Cys Glu Ala Asp Asn Gly Leu Gly Ala G1n His Ser Lys Val Val Thr Leu Asn Val Thr Gly Thr Ser Arg Asn Arg Thr Gly Leu Thr Ala Ala Gly Ile Thr Gly Leu Val Leu Ser Ile Leu Val Leu Ala Ala Ala Ala Ala Leu Leu His Tyr Ala Arg Ala Arg Arg Lys Pro Gly Gly Leu 5er A1a Thr Gly Thr Ser Ser His Ser Pro Ser Glu Cys Gln Glu Pro Ser Ser Ser Arg Pro Ser Arg Ile Asp Pro Gln Glu Pro Thr His Ser Lys Pro Leu Ala Pro Met Glu Leu Glu Pro Met Tyr Ser Asn Val Asn Pro Gly Asp Ser Asn Pro Ile Tyr Ser Gln Ile Trp Ser Ile Gln His Thr Lys Glu Asn Ser Ala Asn Cys Pro Met Met His Gln Glu His Glu Glu Leu Thr Val Leu Tyr Ser Glu Leu Lys Lys Thr His Pro Asp Asp Ser Ala Gly Glu Ala Ser Ser Arg Gly Arg Ala His Glu Glu Asp Asp Glu Glu Asn Tyr Glu Asn Val Pro Arg Val Leu Leu Ala Ser Asp His <210> 7 <211> 300 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400> 7 aaaaa taggaagacg ttcagccagggatccactcaggagccttcccagccctcta 60 aaaa g accccagggcagctacagcctatatatgaa 120 ccccaagagt tcacctacct caactcacct aatgtgaatg ttgtaagtgg ggatgaggtttattcactggcgtactataaccagccggag 180 caggaatcag tagcagcaga aaccctggggacacatatggaggacaaggtttccttagac 240 atctattcca ggctgaggaa agcaaacattacagatgtggactatgaagatgctatgtaa 300 <210> 8 <211> 2038 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400> 8 ctcgactctg aggtgcattc tttttttgat gagaggcatc tctaggtacc atccctgacc 60 tggtcctcat gctgccgagg ctgttgctgt tgatctgtgc tccactctgt gaacctgccg 120 agctgtttttgatagccagcccctcccatcccacagaggggagcccagtgaccctgacgt180 gtaagatgccctttctacagagttcagatgcccagttccagttctgctttttcagagaca240 cccgggccttgggcccaggctggagcagctcccccaagctccagatcgctgccatgtgga300 aagaagacacagggtcatactggtgcgaggcacagacaatggcgtccaaagtcttgagga360 gcaggagatcccagataaatgtgcacagggtccctgtcgctgatgtgagcttggagactc420 agcccccaggaggacaggtgatggagggagacaggctggtcctcatctgctcagttgcta480 tgggcacaggagacatcaccttcctttggtacaaaggggctgtaggtttaaaccttcagt540 caaagacccagcgttcactgacagcagagtatgagattccttcagtgagggagagtgatg600 ctgagcaatattactgtgtagctgaaaatggctatggtcccagccccagtgggctggtga660 gcatcactgtcagaatcccggtgtctcgcccaatcctcatgctcagggctcccagggccc720 aggctgcagtggaggatgtgctggagcttcactgtgaggccctgagaggctctcctccaa780 tcctgtactggttttatcacgaggatatcaccctggggagcaggtcggccccctctggag840 gaggagcctccttcaacctttccctgactgaagaacattctggaaactactcctgtgagg900 ccaacaatggcctgggggcccagcgcagtgaggcggtgacactcaacttcacagtgccta960 ctggggccagaagcaatcatcttacctcaggagtcattgaggggctgctcagcacccttg1020 gtccagccaccgtggccttattattttgctacggcctcaaaagaaaaataggaagacgtt1080 cagccagggatccactcaggagccttcccagccctctaccccaagagttcacctacctca1140 actcacctaccccagggcagctacagcctatatatgaaaatgtgaatgttgtaagtgggg1200 atgaggtttattcactggcgtactataaccagccggagcaggaatcagtagcagcagaaa1260 ccctggggacacatatggaggacaaggtttccttagacatctattccaggctgaggaaag1320 caaacattacagatgtggactatgaagatgctatgtaaggttatggaagattctgctctt1380 tgaaaaccatccatgaccccaagcctcaggcctgatatgttcttcagagatcctggggca1440 ttagctttccagtatacctcttctggatgccattctccatggcactattccttcatctac1500 tgtgaagtgaagttggcgcagccctgaagaaactacctaggagaactaatagacacagga1560 gtgacagggactttgttatcagaaccagattcctgccggctcctttgaaaacaggtcata1620 ttgtgctcttctgtttacaagaggaaacaagatggaataaaagaaattgggatcttgggt1680 tggagggacagtgaagcttagagcacatgaactcaaggttagtgactctgcaggacttca1740 cagagagagctgtgcccatcattcagtccaagtgctttctctgcccagacagcacagaac1800 tccagccccgctacttacatggatcatcgagtttccacctaaaatatgattctatttatt1860 ttgagtcactgttaccaaattagaactaaaacaaagttacataaaaagttattgtgactc1920 cacttaattttagtgacgtatttttgtatatataggccaacctataccacatccaaaatt1980 atgtatctattacagcccctagaagctttataaatacagtgtgtcttcttttattcac 2038 <210> 9 <211> 261 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificialequence:/note =
S
synthetic construct <400> 9 cacaagatat caggagaaag ttctgccactaatgaacccagaggggcttccaggccaaat60 cctcaagagt tcacctattc aagcccaaccccagacatggaggagctgcagccagtgtat120 gtcaatgtgg gctctgtaga tgtggatgtggtttattctcaggtctggagcatgcagcag180 ccagaaagct cagcaaacat caggacacttctggagaacaaggactcccaagtcatctac240 tcttctgtga agaaatcata a 261 <210> 10 <211> 2573 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400> 10 ggtgaccaag agtacatctc ttttcaaatagctggattaggtcctcatgctgctgtggtc60 attgctggtc atctttgatg cagtcactgaacaggcagattcgctgacccttgtggcgcc120 ctcttctgtcttcgaaggagacagcatcgttctgaaatgccagggagaacagaactggaa180 aattcagaagatggcttaccataaggataacaaagagttatctgttttcaaaaaattctc240 agatttccttatccaaagtgcagttttaagtgacagtggtaactatttctgtagtaccaa300 aggacaactctttctctgggataaaacttcaaatatagtaaagataaaagtccaagagct360 ctttcaacgtcctgtgctgactgccagctccttccagcccatcgaagggggtccagtgag420 cctgaaatgtgagacccggctctctccacagaggttggatgttcaactccagttctgctt480 cttcagagaaaaccaggtcctggggtcaggctggagcagctctccggagctccagatttc540 tgccgtgtggagtgaagacacagggtcttactggtgcaaggcagaaacggtgactcacag600 gatcagaaaacagagcctccaatcccagattcacgtgcagagaatccccatctctaatgt660 aagcttggagatccgggcccccgggggacaggtgactgaaggacaaaaactgatcctgct720 ctgctcagtggctgggggtacaggaaatgtcacattctcctggtacagagaggccacagg780 aaccagtatgggaaagaaaacccagcgttccctgtcagcagagctggagatcccagctgt840 gaaagagagtgatgccggcaaatattactgtagagctgacaacggccatgtgcctatcca900 gagcaaggtggtgaatatccctgtgagaattccagtgtctcgccctgtcctcaccctcag960 gtctcctggggcccaggctgcagtgggggacctgctggagcttcactgtgaggccctgag1020 aggctctcccccaatcttgtaccaattttatcatgaggatgtcacccttgggaacagctc1080 ggccccctctggaggaggggcctccttcaacctctctttgactgcagaacattctggaaa1140 ctactcctgtgaggccaacaacggcctgggggcccagtgcagtgaggcagtgccagtctc1200 catctcaggacctgatggctatagaagagacctcatgacagctggagttctctggggact1260 gtttggtgtccttggtttcactggtgttgctttgctgttgtatgccttgttccacaagat1320 atcaggagaaagttctgccactaatgaacccagaggggcttccaggccaaatcctcaaga1380 gttcacctattcaagcccaaccccagacatggaggagctgcagccagtgtatgtcaatgt1440 gggctctgtagatgtggatgtggtttattctcaggtctggagcatgcagcagccagaaag1500 ctcagcaaacatcaggacacttctggagaacaaggactcccaagtcatctactcttctgt1560 gaagaaatcataacacttggaggaatcagaagggaagatcaacagcaaggatggggcatc1620 attaagacttgctataaaaccttatgaaaatgcttgaggcttatcacctgccacagccag1680 aacgtgcctcaggaggcacctcctgtcatttttgtcctgatgatgtttcttctccaatat1740 cttcttttacctatcaatattcattgaactgctgctacatccagacactgtgcaaataaa1800 ttatttctgctaccttctcttaagcaatcagtgtgtaaagatttgagggaagaatgaata1860 agagatacaaggtctcaccttcatctactgtgaagtgatgagaacaggacttgatagtgg1920 tgtattaacttatttatgtgctgctggatacagtttgctaatattttgttgagaattttt1980 gcaaatatgttcattgggaatattggcctgaaattttcttttccactgtgtctctgccag2040 aatgtttgtatcaggctgatgctggcttcatagaatgagttaggcaggagcccttcctcc2100 ttgattttttggcatagtttcagcaggattggtaccagttattctttctgcatcttgtag2160 aattcagctatgaatccatctggtctagggcttttgtgttggttggtaagttttttatta2220 ctaattcaacttcagcgcttgatattggtctaggaggggtttctgtctcttcctggttca2280 atcttgggagattgtgtgtttccaggaatttagccgtttcctccagattttcttctttat2340 gtgcatcgacttgagtgtaaacataacttatatgcactgggaaaccaaaaaatctgtgtg2400 acttgctttattgcagcatttgttttattttggtagtctggaactgaacctgcaatatca2460 ccaaagtatgcatatagttgcaaaaatgtgatttttgacatagtaaatatgagtatttgc2520 aataaactatgatattacttttgtaagtatatagaataaaatgtaaataatct 2573 <210> 11 <211> 423 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400>
11 gggcccgaaggaaaccaggaggactttctgccactggaacatctagtcac 60 cattacgcca agtcctagcgagtgtcaggagccttcctcgtccaggccttccaggatagaccctcaagag 120 cccactcactctaaaccactagccccaatggagctggagccaatgtacagcaatgtaaat 180 cctggagatagcaacccgatttattcccagatctggagcatccagcatacaaaagaaaac 240 tcagctaattgtccaatgatgcatcaagagcatgaggaacttacagtcctctattcagaa 300 ctgaagaagacacacccagacgactctgcaggggaggctagcagcagaggcagggcccat 360 gaagaagatgatgaagaaaactatgagaatgtaccacgtgtattactggcctcagaccac 420 tag 423 <210> 12 <211> 2916 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400>
12 agtagcagcttcctgccctccttcttggagataagtcgggcttttggtga60 gtctcatctg gacagactttcccaaccctctgcccggccggtgcccatgcttctgtggctgctgctgctg120 atcctgactcctggaagagaacaatcaggggtggccccaaaagctgtacttctcctcaat180 cctccatggtccacagccttcaaaggagaaaaagtggctctcatatgcagcagcatatca240 cattccctagcccagggagacacatattggtatcacgatgagaagttgttgaaaataaaa300 catgacaagatccaaattacagagcctggaaattaccaatgtaagacccgaggatcctcc360 ctcagtgatgccgtgcatgtggaattttcacccgactggctgatcctgcaggctttacat420 cctgtctttgaaggagacaatgtcattctgagatgtcaggggaaagacaacaaaaacact480 catcaaaaggtttactacaaggatggaaaacagcttcctaatagttataatttagagaag540 atcacagtgaattcagtctccagggataatagcaaatatcattgtactgcttataggaag600 ttttacatacttgacattgaagtaacttcaaaacccctaaatatccaagttcaagagctg660 tttctacatcctgtgctgagagccagctcttccacgcccatagaggggagtcccatgacc720 ctgacctgtgagacccagctctctccacagaggccagatgtccagctgcaattctccctc780 ttcagagatagccagaccctcggattgggctggagcaggtcccccagactccagatccct840 gccatgtggactgaagactcagggtcttactggtgtgaggtggagacagtgactcacagc900 atcaaaaaaaggagcctgagatctcagatacgtgtacagagagtccctgtgtctaatgtg960 aatctagagatccggcccaccggagggcagctgattgaaggagaaaatatggtccttatt1020 tgctcagtagcccagggttcagggactgtcacattctcctggcacaaagaaggaagagta1080 agaagcctgggtagaaagacccagcgttccctgttggcagagctgcatgttctcaccgtg1140 aaggagagtgatgcagggagatactactgtgcagctgataacgttcacagccccatcctc1200 agcacgtggattcgagtcaccgtgagaattccggtatctcaccctgtcctcaccttcagg gctcccagggcccacactgtggtgggggacctgctggagcttcactgtgagtccctgaga1320 ggctctcccccgatcctgtaccgattttatcatgaggacgtcaccctggggaacagctca1380 gccccctctggaggaggagcctccttcaacctctctctgactgcagaacattctggaaac1440 tactcctgtgatgcagacaatggcctgggggcccagcacagtcatggagtgagtctcagg1500 gtcacagttccggtgtctcgccccgtcctcaccctcagggctcccggggcccaggctgtg1560 gtgggggacctgctggagcttcactgtgagtccctgagaggctccttcccgatcctgtac1620 tggttttatcacgaggatgacaccttggggaacatctcggcccactctggaggaggggca1680 tccttcaacctctctctgactacagaacattctggaaactactcatgtgaggctgacaat1740 ggcctgggggcccagcacagtaaagtggtgacactcaatgttacaggaacttccaggaac1800 agaacaggccttaccgctgcgggaatcacggggctggtgctcagcatcctcgtccttgct1860 gctgctgctgctctgctgcattacgccagggcccgaaggaaaccaggaggactttctgcc1920 actggaacatctagtcacagtcctagcgagtgtcaggagccttcctcgtccaggccttcc1980 aggatagaccctcaagagcccactcactctaaaccactagccccaatggagctggagcca2040 atgtacagcaatgcaaatcctggagatagcaacccgatttattcccagatctggagcatc2100 cagcatacaaaagaaaactcagctaattgtccaatgatgcatcaagagcatgaggaactt2160 acagtcctctattcagaactgaagaagacacacccagacgactctgcaggggaggctagc2220 agcagaggcagggcccatgaagaagatgatgaagaaaactatgagaatgtaccacgtgta2280 ttactggcctcagaccactagccccttacccagagtggcccacaggaaacagcctgcacc2340 atttttttttctgttctctccaaccacacatcatccatctctccagactctgcctcctac2400 gaggctgggctgcagg 2416 <210> 13 <211> 873 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400>
13 tgatagccagcccctcccatcccacagaggggagcccagtgaccctgacg 60 gagctgtttt tgtaagatgccctttctacagagttcagatgcccagttccagttctgctttttcagagac 120 acccgggccttgggcccaggctggagcagctcccccaagctccagatcgctgccatgtgg 180 aaagaagacacagggtcatactggtgcgaggcacagacaatggcgtccaaagtcttgagg 240 agcaggagatcccagataaatgtgcacagggtccctgtcgctgatgtgagcttggagact 300 cagcccccaggaggacaggtgatggagggagacaggctggtcctcatctgctcagttgct 360 atgggcacaggagacatcaccttcctttggtacaaaggggctgtaggtttaaaccttcag 420 tcaaagacccagcgttcactgacagcagagtatgagattccttcagtgagggagagtgat 480 gctgagcaatattactgtgtagctgaaaatggctatggtcccagccccagtgggctggtg 540 agcatcactgtcagaatcccggtgtctcgcccaatcctcatgctcagggctcccagggcc 600 caggctgcagtggaggatgtgctggagcttcactgtgaggccctgagaggctctcctcca 660 atcctgtactggttttatcacgaggatatcaccctggggagcaggtcggccccctctgga 720 ggaggagcctccttcaacctttccctgactgaagaacattctggaaactactcctgtgag 780 gccaacaatggcctgggggcccagcgcagtgaggcggtgacactcaacttcacagtgcct 840 actggggccagaagcaatcatcttacctcagga 873 <210>
<211>
<212>
DNA
<213> ficial Arti Sequence <220>
<223> iption equence:/note =
Descr of Artificial S
synth etic construct <400>
14 cgccctcttctgtcttcgaaggagacagcatcgttctgaaatgccaggga60 acccttgtgg gaacagaactggaaaattcagaagatggcttaccataaggataacaaagagttatctgtt120 ttcaaaaaattctcagatttccttatccaaagtgcagttttaagtgacagtggtaactat180 ttctgtagtaccaaaggacaactctttctctgggataaaacttcaaatatagtaaagata290 aaagtccaagagctctttcaacgtcctgtgctgactgccagctccttccagcccatcgaa300 gggggtccagtgagcctgaaatgtgagacccggctctctccacagaggttggatgttcaa360 ctccagttctgcttcttcagagaaaaccaggtcctggggtcaggctggagcagctctccg420 gagctccagatttctgccgtgtggagtgaagacacagggtcttactggtgcaaggcagaa480 acggtgactcacaggatcagaaaacagagcctccaatcccagattcacgtgcagagaatc540 cccatctctaatgtaagcttggagatccgggcccccgggggacaggtgactgaaggacaa600 , aaactgatcctgctctgctcagtggctgggggtacaggaaatgtcacattctcctggtac660 agagaggccacaggaaccagtatgggaaagaaaacccagcgttccctgtcagcagagctg720 gagatcccagctgtgaaagagagtgatgccggcaaatattactgtagagctgacaacggc780 catgtgcctatccagagcaaggtggtgaatatccctgtgagaattccagtgtctcgccct840 gtcctcaccctcaggtctcctggggcccaggctgcagtgggggacctgctggagcttcac900 tgtgaggccctgagaggctctcccccaatcttgtaccaattttatcatgaggatgtcacc960 cttgggaacagctcggccccctctggaggaggggcctccttcaacctctctttgactgca1020 gaacattctggaaactactcctgtgaggccaacaacggcctgggggcccagtgcagtgag1080 gcagtgccagtctccatctcaggacctgatggctatagaagagacctcatgacagct 1137 <210>
<211>
<212>
DNA
<213> ence Artificial Sequ <220>
<223> Description of Artificial Sequence:/note synthetic construct <400> 15 gtggccccaa aagctgtact tctcctcaat cctccatggt ccacagcctt caaaggagaa 60 aaagtggctc tcatatgcag cagcatatca cattccctag cccagggaga cacatattgg 120 tatcacgatg agaagttgtt gaaaataaaa catgacaaga tccaaattac agagcctgga 180 aattaccaat gtaagacccg aggatcctcc ctcagtgatg ccgtgcatgt ggaattttca 240 cccgactggctgatcctgcaggctttacatcctgtctttgaaggagacaatgtcattctg300 agatgtcaggggaaagacaacaaaaacactcatcaaaaggtttactacaaggatggaaaa360 cagcttcctaatagttataatttagagaagatcacagtgaattcagtctccagggataat420 agcaaatatcattgtactgcttataggaagttttacatacttgacattgaagtaacttca480 aaacccctaaatatccaagttcaagagctgtttctacatcctgtgctgagagccagctct540 tccacgcccatagaggggagtcccatgaccctgacctgtgagacccagctctctccacag600 aggccagatgtccagctgcaattctccctcttcagagatagccagaccctcggattgggc660 tggagcaggtcccccagaotccagatccctgccatgtggactgaagactcagggtcttac720 tggtgtgaggtggagacagtgactcacagcatcaaaaaaaggagcctgagatctcagata780 cgtgtacagagagtccctgtgtctaatgtgaatctagagatccggcccaccggagggcag840 ctgattgaaggagaaaatatggtccttatttgctcagtagcccagggttcagggactgtc900 acattctcctggcacaaagaaggaagagtaagaagcctgggtagaaagacccagcgttcc960 ctgttggcagagctgcatgttctcaccgtgaaggagagtgatgcagggagatactactgt1020 gcagctgataacgttcacagccccatcctcagcacgtggattcgagtcaccgtgagaatt1080 ccggtatctcaccctgtcctcaccttcagggctcccagggcccacactgtggtgggggac1140 ctgctggagcttcactgtgagtccctgagaggctctcccccgatcctgtaccgattttat1200 catgaggacgtcaccctggggaacagctcagccccctctggaggaggagcctccttcaac1260 ctctctctgactgcagaacattctggaaactactcctgtgatgcagacaatggcctgggg1320 gcccagcacagtcatggagtgagtctcagggtcacagttccggtgtctcgccccgtcctc1380 accctcagggctcccggggcccaggctgtggtgggggacctgctggagcttcactgtgag1440 tccctgagaggctccttcccgatcctgtactggttttatcacgaggatgacaccttgggg1500 aacatctcggcccactctggaggaggggcatccttcaacctctctctgactacagaacat1560 tctggaaactactcatgtgaggctgacaatggcctgggggcccagcacagtaaagtggtg1620 acactcaatgttacaggaacttccaggaacagaacaggc 1659 <210>
<211>
<212>
DNA
<213> ficial ence Arti Sequ <220>
<223> iption rtificialequence:/note =
Descr of A S
synth etic construct <400>
16 gggcccgaaggaaaccaggaggactttctgccactggaacatctagtcac60 cattacgcca agtcctagcgagtgtcaggagccttcctcgtccaggccttccaggatagaccctcaagag120 cccactcactctaaaccactagccccaatggagctggagccaatgtacagcaatgcaaat180 cctggagatagcaacccgatttattcccagatctggagcatccagcatacaaaagaaaac240 tcagctaattgtccaatgatgcatcaagagcatgaggaacttacagtcctctattcagaa300 ctgaagaagacacacccagacgactctgcaggggaggctagcagcagaggcagggcccat360 gaagaagatgatgaagaaaactatgagaatgtaccacgtgtattactggcctcagaccac420 tag <210>
<211>
<212>
DNA
<213>
Artificial Sequence <220>
<223>
Description of Artificial Sequence:/note =
synthetic construct <400>
17 aagctgtacttctcctcaatcctccatggtccacagccttcaaaggagaa60 gtggccccaa aaagtggctctcatatgcagcagcatatcacattccctagcccagggagacacatattgg120 tatcacgatgagaagttgttgaaaataaaacatgacaagatccaaattacagagcctgga180 aattaccaatgtaagacccgaggatcctccctcagtgatgccgtgcatgtggaattttca240 cccgactggctgatcctgcaggctttacatcctgtctttgaaggagacaatgtcattctg300 agatgtcaggggaaagacaacaaaaacactcatcaaaaggtttactacaaggatggaaaa360 cagcttcctaatagttataatttagagaagatcacagtgaattcagtctccagggataat420 agcaaatatcattgtactgcttataggaagttttacatacttgacattgaagtaacttca480 aaacccctaaatatccaagttcaagagctgtttctacatcctgtgctgagagccagctct540 tccacgcccatagaggggagtcccatgaccctgacctgtgagacccagctctctccacag600 aggccagatgtccagctgcaattctccctcttcagagatagccagaccctcggattgggc660 tggagcaggtcccccagactccagatccctgccatgtggactgaagactcagggtcttac720 tggtgtgaggtggagacagtgactcacagcatcaaaaaaaggagcctgagatctcagata780 cgtgtacagagagtccctgtgtctaatgtgaatctagagatccggcccaccggagggcag840 ctgattgaaggagaaaatatggtccttatttgctcagtagcccagggttcagggactgtc900 acattctcctggcacaaagaaggaagagtaagaagcctgggtagaaagacccagcgttcc960 ctgttggcagagctgcatgttctcaccgtgaaggagagtgatgcagggagatactactgt1020 gcagctgataacgttcacagccccatcctcagcacgtggattcgagtcaccgtgagaatt1080 ccggtatctcaccctgtcctcaccttcagggctcccagggcccacactgtggtgggggac1140 ctgctggagcttcactgtgagtccctgagaggctctcccccgatcctgtaccgattttat1200 catgaggacgtcaccctggggaacagctcagccccctctggaggaggagcctccttcaac1260 ctctctctgactgcagaacattctggaaactactcctgtgatgcagacaatggcctgggg1320 gcccagcacagtcatggagtgagtctcagggtcacagttccggtgtctcgccccgtcctc1380 accctcagggctcccggggcccaggctgtggtgggggacctgctggagcttcactgtgag1440 tccctgagaggctccttcccgatcctgtactggttttatcacgaggatgacaccttgggg1500 aacatctcggcccactctggaggaggggcatccttcaacctctctctgactacagaacat1560 tctggaaactactcatgtgaggctgacaatggcctgggggcccagcacagtaaagtggtg1620 acactcaatgttacaggaacttccaggaacagaacaggccttaccgctgcgggaatcacg1680 gggctggtgctcagcatcctcgtccttgctgctgctgctgctctgctgcattacgccagg1740 gcccgaaggaaaccaggaggactttctgccactggaacatctagtcacagtcctagcgag1800 tgtcaggagccttcctcgtccaggccttccaggatagaccctcaagagcccactcactct1860 aaaccactagccccaatggagctggagccaatgtacagcaatgcaaatcctggagatagc1920 aacccgatttattcccagatctggagcatccagcatacaaaagaaaactcagctaattgt1980 ccaatgatgcatcaagagcatgaggaacttacagtcctctattcagaactgaagaagaca2040 cacccagacgactctgcaggggaggctagcagcagaggcagggcccatgaagaagatgat2100 gaagaaaactatgagaatgtaccacgtgtattactggcctcagaccactag 2151 <210> 18 <211> 315 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificialequence:/note =
S
synthetic construct <400> 18 agatcctgga gaaaagctgg gccccttccatcccagataccacccacagctccaggtgga60 gagcagtgcc cactatatgc caacgtgcatcaccagaaagggaaagatgaaggtgttgtc120 tactctgtgg tgcatagaac ctcaaagaggagtgaagccaggtctgctgagttcaccgtg180 gggagaaagg acagttctat catctgtgcggaggtgagatgcctgcagcccagtgaggtt240 tcatccacgg aggtgaatat gagaagcaggactctccaagaaccccttagcgactgtgag300 gaggttctct gctag 315 <210> 19 <211> 870 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400> 19 actgtctggc tgtacctcca agcctggccaaaccctgtgtttgaaggagatgccctgact60 ctgcgatgtc agggatggaa gaatacaccactgtctcaggtgaagttctacagagatgga120 aaattccttc atttctctaa ggaaaaccagactctgtccatgggagcagcaacagtgcag180 agccgtggcc agtacagctg ctctgggcaggtgatgtatattccacagacattcacacaa240 acttcagagactgccatggttcaagtccaagagctgtttccacctcctgtgctgagtgcc300 atcccctctcctgagccccgagagggtagcctggtgaccctgagatgtcagacaaagctg360 caccccctgaggtcagccttgaggctccttttctccttccacaaggacggccacaccttg920 caggacaggggccctcacccagaactctgcatcccgggagccaaggagggagactctggg480 ctttactggtgtgaggtggcccctgagggtggccaggtccagaagcagagcccccagctg590 gaggtcagagtgcaggctcctgtatcccgtcctgtgctcactctgcaccacgggcctgct600 gaccctgctgtgggggacatggtgcagctcctctgtgaggcacagaggggctcccctccg660 atcctgtattccttctaccttgatgagaagattgtggggaaccactcagctccctgtggt720 ggaaccacctccctcctcttcccagtgaagtcagaacaggatgctgggaactactcctgc780 gaggctgagaacagtgtctccagagagaggagtgagcccaagaagctgtctctgaagggt840 tctcaagtcttgttcactcccgccagcaac 870 <210>
<211>
<212>
DNA
<213> ficial enoe Arti Sequ <220>
<223> Description of Artificial Sequence:/note synthetic construct <400>
20 tgtacctccaagcctggccaaaccctgtgtttgaaggagatgccctgact60 actgtctggc ctgcgatgtcagggatggaagaatacaccactgtctcaggtgaagttctacagagatgga120 aaattccttcatttctctaaggaaaaccagactctgtccatgggagcagcaacagtgcag180 agccgtggccagtacagctgctctgggcaggtgatgtatattccacagacattcacacaa240 acttcagagactgccatggttcaagtccaagagctgtttccacctcctgtgctgagtgcc300 atcccctctcctgagccccgagagggtagcctggtgaccctgagatgtcagacaaagctg360 caccccctgaggtcagccttgaggctccttttctccttccacaaggacggccacaccttg420 caggacaggggccctcacccagaactctgcatcccgggagccaaggagggagactctggg480 ctttactggtgtgaggtggcccctgagggtggccaggtccagaagcagagcccccagctg540 gaggtcagagtgcaggctcctgtatcccgtcctgtgctcactctgcaccacgggcctgct600 gaccctgctgtgggggacatggtgcagctcctctgtgaggcacagaggggctcccctccg660 atcctgtattccttctaccttgatgagaagattgtggggaaccactcagctccctgtggt720 ggaaccacctccctcctcttcccagtgaagtcagaacaggatgctgggaactactcctgc780 gaggctgagaacagtgtctccagagagaggagtgagcccaagaagctgtctctgaagggt840 tctcaagtcttgttcactcccgccagcaactggctggttccttggcttcctgcgagcctg900 cttggcctgatggttattgctgctgcacttctggtttatgtgagatcctggagaaaagct960 gggccccttccatcccagataccacccacagctccaggtggagagcagtgcccactatat1020 gccaacgtgcatcaccagaaagggaaagatgaaggtgttgtctactctgtggtgcataga1080 acctcaaagaggagtgaagccaggtctgctgagttcaccgtggggagaaaggacagttct1140 atcatctgtgcggaggtgagatgcctgcagcccagtgaggtttcatccacggaggtgaat1200 atgagaagcaggactctccaagaaccccttagcgactgtgaggaggttctctgctag 1257 <210>
<211>
<212>
PRT
<213> ficial ence Arti Sequ <220>
<223> ription ArtificialSequence:/note =
Desc of synt hetic construct <400>
21 Pro Ser Pro Thr Ala Glu His Glu Gly Leu Phe Ser Leu Ile Ala Ser Pro Va1 Pro Phe Thr Leu Leu Thr Cys Gln Lys Met Ser Ser Asp Ala Gln Phe Asp Thr Gln Phe Arg Cys Phe Ala Phe Arg Leu Gly Pro Gly Trp Ser Ser Ser Pro Lys Leu Gln Ile Ala Ala Met Trp Lys Glu Asp Thr Gly Ser Tyr Trp Cys Glu Ala Gln Thr Met Ala Ser Lys Val Leu Arg Ser Arg Arg Ser Gln Ile Asn Val His Arg Val Pro Val Ala Asp Val Ser Leu Glu Thr Gln Pro Pro Gly Gly G1n Val Met Glu Gly Asp Arg Leu Val Leu Ile Cys Ser Val Ala Met Gly Thr Gly Asp Ile Thr Phe Leu Trp Tyr Lys Gly Ala Val Gly Leu Asn Leu Gln Ser Lys Thr Gln Arg Ser Leu Thr Ala Glu Tyr Glu Ile Pro Ser Val Arg Glu Ser Asp A1a Glu Gln Tyr Tyr Cys Val Ala Glu Asn Gly Tyr Gly Pro Ser Pro Ser Gly Leu Val Ser Ile Thr Val Arg Ile Pro Val Ser Arg Pro Ile Leu Met Leu Arg Ala Pro Arg Ala Gln Ala Ala Val Glu Asp Val Leu Glu Leu His Cys Glu Ala Leu Arg Gly Ser Pro Pro Ile Leu Tyr Trp Phe Tyr His Glu Asp Ile Thr Leu Gly Ser Arg Ser Ala Pro Ser Gly Gly Gly Ala Ser Phe Asn Leu Ser Leu Thr Glu Glu His Ser Gly Asn Tyr Ser Cys Glu Ala Asn Asn Gly Leu Gly Ala Gln Arg Ser Glu Ala Val Thr Leu Asn Phe Thr Val Pro Thr Gly Ala Arg Ser Asn His Leu Thr Ser Gly <210> 22 <211> 380 <212> PRT
<213> Artificial Sequence <220>
<223> Description of Artificial 5equence:/note =
synthetic construct <400> 22 Leu Thr Leu Val Ala Pro Ser Ser Val Phe Glu Gly Asp Ser Ile Val Leu Lys Cys Gln Gly Glu Gln Asn Trp Lys Ile Gln Lys Met Ala Tyr His Lys Asp Asn Lys Glu Leu Ser Val Phe Lys Lys Phe Ser Asp Phe Leu Ile Gln Ser Ala Val Leu Ser Asp Ser G1y Asn Tyr Phe Cys Ser Thr Lys Gly Gln Leu Phe Leu Trp Asp Lys Thr Ser Asn Ile Val Lys Tle Lys Val Gln Glu Leu Phe Gln Arg Pro Val Leu Thr Ala Ser Ser Phe Gln Pro Ile Glu Gly Gly Pro Val Ser Leu Lys Cys Glu Thr Arg Leu Ser Pro Gln Arg Leu Asp Val Gln Leu Gln Phe Cys Phe Phe Arg Glu Asn Gln Val Leu Gly Ser Gly Trp Ser Ser Ser Pro Glu Leu Gln Ile Ser Ala Val Trp Ser Glu Asp Thr Gly Ser Tyr Trp Cys Lys Ala Glu Thr Val Thr His Arg Ile Arg Lys Gln Ser Leu Gln Ser Gln Ile His Val Gln Arg Ile Pro Ile Ser Asn Val Ser Leu Glu Ile Arg Ala Pro Gly Gly Gln Val Thr Glu Gly Gln Lys Leu Ile Leu Leu Cys Ser Val Ala Gly Gly Thr Gly Asn Val Thr Phe Ser Trp Tyr Arg Glu Ala Thr G1y Thr Ser Met Gly Lys Lys Thr Gln Arg Ser Leu Ser Ala Glu Leu Glu Ile Pro Ala Val Lys Glu Ser Asp Ala Gly Lys Tyr Tyr Cys Arg Ala Asp Asn Gly His Val Pro Ile Gln Ser Lys Val Val Asn Ile Pro Val Arg Ile Pro Val Ser Arg Pro Val Leu Thr Leu Arg Ser Pro Gly A1a Gln Ala Ala Val Gly Asp Leu Leu G1u Leu His Cys Glu Ala Leu Arg Gly Ser Pro Pro Ile Leu Tyr Gln Phe Tyr His Glu Asp Val Thr Leu Gly Asn 5er Ser Ala Pro Ser Gly Gly Gly Ala Ser Phe Asn Leu Ser Leu Thr Ala Glu His Ser Gly Asn Tyr Ser Cys Glu Ala Asn Asn Gly Leu Gly Ala Gln Cys Ser Glu Ala Val Pro Val Ser Ile Ser Gly Pro Asp Gly Tyr Arg Arg Asp Leu Met Thr Ala <210> 23 <211> 140 <212> PRT
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400> 23 His Tyr Ala Arg Ala Arg Arg Lys Pro Gly Gly Leu Ser A1a Thr Gly Thr Ser Ser His Ser Pro Ser Glu Cys Gln Glu Pro 5er Ser Ser Arg Pro Ser Arg Ile Asp Pro Gln Glu Pro Thr His Ser Lys Pro Leu A1a Pro Met Glu Leu Glu Pro Met Tyr Ser Asn Ala Asn Pro Gly Asp Ser Asn Pro Ile Tyr Ser Gln Tle Trp Ser Ile Gln His Thr Lys Glu Asn Ser Ala Asn Cys Pro Met Met His Gln Glu His Glu Glu Leu Thr Val Leu Tyr 5er Glu Leu Lys Lys Thr His Pro Asp Asp Ser Ala Gly Glu Ala Ser Ser Arg Gly Arg Ala His Glu G1u Asp Asp Glu Glu Asn Tyr Glu Asn Val Pro Arg Val Leu Leu Ala Ser Asp His <210> 24 <211> 554 <212> PRT
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:lnote =
synthetic construct <400> 24 Gly Val Ala Pro Lys Ala Val Leu Leu Leu Asn Pro Pro Trp Ser Thr Ala Phe Lys Gly Glu Lys Val Ala Leu Ile Cys Ser Ser Ile Ser His Ser Leu Ala Gln Gly Asp Thr Tyr Trp Tyr His Asp Glu Lys Leu Leu Lys Ile Lys His Asp Lys Ile Gln Ile Thr Glu Pro Gly Asn Tyr G1n Cys Lys Thr Arg Gly Ser Ser Leu Ser Asp Ala Val His Val Glu Phe Ser Pro Asp Trp Leu Ile Leu Gln Ala Leu His Pro Val Phe Glu Gly Asp Asn Val Tle Leu Arg Cys Gln Gly Lys Asp Asn Lys Asn Thr His Gln Lys Val Tyr Tyr Lys Asp Gly Lys Gln Leu Pro Asn Ser Tyr Asn Leu Glu Lys Ile Thr Val Asn Ser Val Ser Arg Asp Asn Ser Lys Tyr His Cys Thr Ala Tyr Arg Lys Phe Tyr Ile Leu Asp Tle Glu Val Thr 5er Lys Pro Leu Asn Ile Gln Val Gln Glu Leu Phe Leu His Pro Val Leu Arg Ala Ser Ser Ser Thr Pro Ile Glu Gly Ser Pro Met Thr Leu Thr Cys Glu Thr Gln Leu Ser Pro Gln Arg Pro Asp Val Gln Leu Gln Phe Ser Leu Phe Arg Asp Ser G1n Thr Leu Gly Leu Gly Trp Ser Arg 5er Pro Arg Leu Gln Ile Pro Ala Met Trp Thr Glu Asp Ser G1y Ser Tyr Trp Cys Glu Val Glu Thr Val Thr His Ser Tle Lys Lys Arg Ser Leu Arg Ser Gln Ile Arg Va1 Gln Arg Val Pro Val Ser Asn Val Asn Leu G1u Ile Arg Pro Thr Gly Gly Gln Leu Ile Glu Gly Glu Asn Met Val Leu Ile Cys Ser Val Ala G1n Gly Ser Gly Thr Val Thr Phe Ser Trp His Lys Glu Gly Arg Val Arg Ser Leu Gly Arg Lys Thr Gln Arg _ Ser Leu Leu Ala Glu Leu His Val Leu Thr Val Lys Glu Ser Asp Ala Gly Arg Tyr Tyr Cys Ala A1a Asp Asn Val His Ser Pro Ile Leu Ser Thr Trp Tle Arg Val Thr Val Arg Ile Pro Val Ser His Pro Val Leu Thr Phe Arg Ala Pro Arg Ala His Thr Val Val Gly Asp Leu Leu Glu Leu His Cys Glu Ser Leu Arg Gly Ser Pro Pro Ile Leu Tyr Arg Phe Tyr His Glu Asp Val Thr Leu Gly Asn Ser Ser Ala Pro Ser Gly Gly Gly A1a Ser Phe Asn Leu Ser Leu Thr Ala Glu His Ser Gly Asn Tyr Ser Cys Asp Ala Asp Asn Gly Leu Gly Ala Gln His Ser His Gly Val Ser Leu Arg Val Thr Val Pro Val Ser Arg Pro Val Leu Thr Leu Arg Ala Pro Gly Ala Gln Ala Val Val Gly Asp Leu Leu Glu Leu His Cys Glu Ser Leu Arg Gly 5er Phe Pro Ile Leu Tyr Trp Phe Tyr His Glu Asp Asp Thr Leu Gly Asn Ile Ser Ala His Ser Gly Gly Gly A1a Ser Phe Asn Leu Ser Leu Thr Thr Glu His Ser Gly Asn Tyr Ser Cys Glu Ala Asp Asn Gly Leu Gly Ala Gln His Ser Lys Val Val Thr Leu Asn Val Thr Gly Thr Ser Arg Asn Arg Thr Gly <210> 25 <211> 717 , <212> PRT
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400> 25 Gly Va1 Ala Pro Lys Ala Val Leu Leu Leu Asn Pro Pro Trp Ser Thr Ala Phe Lys Gly Glu Lys Va1 Ala Leu Ile Cys Ser Ser Ile Ser His Ser Leu Ala Gln Gly Asp Thr Tyr Trp Tyr His Asp Glu Lys Leu Leu Lys Ile Lys His Asp Lys Ile Gln Ile Thr Glu Pro Gly Asn Tyr Gln Cys Lys Thr Arg Gly Ser Ser Leu Ser Asp Ala Val His Val Glu Phe Ser Pro Asp Trp Leu Ile Leu Gln Ala Leu His Pro Val Phe Glu Gly Asp Asn Val Ile Leu Arg Cys Gln Gly Lys Asp Asn Lys Asn Thr His Gln Lys Val Tyr Tyr Lys Asp G1y Lys Gln Leu Pro Asn Ser Tyr Asn Leu Glu Lys Ile Thr Val Asn Ser Val Ser Arg Asp Asn Ser Lys Tyr His Cys Thr Ala Tyr Arg Lys Phe Tyr Ile Leu Asp Ile Glu Val Thr Ser Lys Pro Leu Asn Ile Gln Val Gln Glu Leu Phe Leu His Pro Val Leu Arg Ala Ser Ser Ser Thr Pro Ile Glu Gly Ser Pro Met Thr Leu Thr Cys Glu Thr Gln Leu Ser Pro Gln Arg Pro Asp Val Gln Leu Gln Phe Ser Leu Phe Arg Asp Ser Gln Thr Leu Gly Leu Gly Trp Ser Arg Ser Pro Arg Leu Gln Ile Pro Ala Met Trp Thr Glu Asp 5er Gly 5er Tyr Trp Cys Glu Val Glu Thr Val Thr His Ser Ile Lys Lys Arg Ser Leu Arg Ser Gln Ile Arg Val Gln Arg Val Pro Val Ser Asn Val Asn Leu Glu Ile Arg Pro Thr Gly Gly Gln Leu Ile Glu Gly Glu Asn Met Val Leu Tle Cys Ser Val Ala Gln Gly Ser Gly Thr Val Thr Phe Ser Trp His Lys Glu Gly Arg Val Arg Ser Leu Gly Arg Lys Thr Gln Arg Ser Leu Leu Ala Glu Leu His Val Leu Thr Val Lys Glu Ser Asp Ala Gly Arg Tyr Tyr Cys Ala Ala Asp Asn Val His Ser Pro Ile Leu Ser Thr Trp Ile Arg Val Thr Val Arg Ile Pro Val Ser His Pro Val Leu Thr Phe Arg Ala Pro Arg Ala His Thr Val Val Gly Asp Leu Leu Glu Leu His Cys Glu Ser Leu Arg Gly Ser Pro Pro Ile Leu Tyr Arg Phe Tyr His Glu Asp Val Thr Leu Gly Asn Ser Ser Ala Pro Ser Gly Gly Gly Ala Ser Phe Asn Leu Ser Leu Thr Ala Glu His Ser Gly Asn Tyr Ser Cys Asp Ala Asp Asn Gly Leu Gly Ala Gln His Ser His Gly Val Ser Leu Arg Val Thr Val Pro Val Ser Arg Pro Val Leu Thr Leu Arg Ala Pro Gly Ala Gln Ala Val Val Gly Asp Leu Leu G1u Leu His Cys Glu 5er Leu Arg Gly Ser Phe Pro I1e Leu Tyr Trp Phe Tyr His G1u Asp Asp Thr Leu Gly Asn Ile Ser Ala His Ser Gly Gly Gly Ala Ser Phe Asn Leu Ser Leu Thr Thr Glu His Ser Gly Asn Tyr Ser Cys Glu Ala Asp Asn Gly Leu Gly Ala Gln His Ser Lys Val Val Thr Leu Asn Va1 Thr Gly Thr Ser Arg Asn Arg Thr Gly Leu Thr Ala Ala Gly Ile Thr Gly Leu Va1 Leu Ser Ile Leu Val Leu A1a Ala Ala Ala Ala Leu Leu His Tyr Ala Arg Ala Arg Arg Lys Pro Gly Gly Leu Ser Ala Thr Gly Thr Ser Ser His Ser Pro Ser Glu Cys Gln Glu Pro Ser Ser Ser Arg Pro 5er Arg Ile Asp Pro Gln Glu Pro Thr His Ser Lys Pro Leu Ala Pro Met Glu Leu G1u Pro Met Tyr Ser Asn Ala Asn Pro Gly Asp Ser Asn Pro Ile Tyr Ser Gln Ile Trp Ser Ile Gln His Thr Lys Glu Asn Ser Ala Asn Cys Pro Met Met His G1n Glu His Glu G1u Leu Thr Val Leu Tyr Ser Glu Leu Lys Lys Thr His Pro Asp Asp Ser Ala Gly Glu Ala Ser Ser Arg Gly Arg Ala His Glu Glu Asp Asp Glu Glu Asn Tyr Glu Asn Val Pro Arg Val Leu Leu Ala Ser Asp His <210> 26 <211> 104 <212> PRT
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400> 26 Arg Ser Trp Arg Lys Ala Gly Pro Leu Pro Ser Gln Ile Pro Pro Thr Ala Pro Gly Gly Glu Gln Cys Pro Leu Tyr Ala Asn Val His His Gln Lys Gly Lys Asp Glu Gly Val Val Tyr Ser Val Val His Arg Thr Ser Lys Arg Ser Glu Ala Arg Ser Ala Glu Phe Thr Val Gly Arg Lys Asp Ser Ser Ile Ile Cys Ala Glu Val Arg Cys Leu Gln Pro Ser Glu Val Ser Ser Thr G1u Val Asn Met Arg Ser Arg Thr Leu Gln Glu Pro Leu Ser Asp Cys Glu Glu Val Leu Cys <210> 27 <211> 291 <212> PRT
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400> 27 Lys Thr Val Trp Leu Tyr Leu Gln Ala Trp Pro Asn Pro Val Phe Glu 1 5 10 l5 Gly Asp Ala Leu Thr Leu Arg Cys Gln Gly Trp Lys Asn Thr Pro Leu Ser Gln Val Lys Phe Tyr Arg Asp Gly Lys Phe Leu His Phe Ser Lys Glu Asn G1n Thr Leu Ser Met G1y Ala Ala Thr Val G1n Ser Arg Gly G1n Tyr Ser Cys Ser Gly Gln Val Met Tyr Ile Pro Gln Thr Phe Thr Gln Thr Ser Glu Thr Ala Met Val Gln Val Gln Glu Leu Phe Pro Pro Pro Val Leu 5er Ala Ile Pro Ser Pro Glu Pro Arg Glu Gly Ser Leu Val Thr Leu Arg Cys Gln Thr Lys Leu His Pro Leu Arg Ser Ala Leu Arg Leu Leu Phe Ser Phe His Lys Asp G1y His Thr Leu Gln Asp Arg Gly Pro His Pro Glu Leu Cys Ile Pro Gly A1a Lys Glu Gly Asp Ser Gly Leu Tyr Trp Cys Glu Va1 Ala Pro Glu Gly Gly Gln Val G1n Lys Gln Ser Pro Gln Leu Glu Val Arg Val Gln Ala Pro Val Ser Arg Pro Val Leu Thr Leu His His Gly Pro Ala Asp Pro Ala Val G1y Asp Met Va1 Gln Leu Leu Cys Glu Ala Gln Arg Gly Ser Pro Pro Ile Leu Tyr Ser Phe Tyr Leu Asp Glu Lys Ile Val Gly Asn His Ser Ala Pro Cys Gly Gly Thr Thr Ser Leu Leu Phe Pro Val Lys Ser Glu Gln Asp Ala Gly Asn Tyr Ser Cys Glu Ala Glu Asn Ser Val Ser Arg Glu Arg Ser Glu Pro Lys Lys Leu Ser Leu Lys Gly Ser Gln Val Leu Phe Thr Pro Ala Ser Asn <210> 28 <211> 919 <212> PRT
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400> 28 Lys Thr Val Trp Leu Tyr Leu Gln Ala Trp Pro Asn Pro Val Phe Glu Gly Asp Ala Leu Thr Leu Arg Cys Gln Gly Trp Lys Asn Thr Pro Leu Ser Gln Val Lys Phe Tyr Arg Asp Gly Lys Phe Leu His Phe Ser Lys Glu Asn Gln Thr Leu Ser Met Gly Ala Ala Thr Val Gln Ser Arg Gly Gln Tyr Ser Cys Ser G1y Gln Val Met Tyr Ile Pro Gln Thr Phe Thr Gln Thr Ser Glu Thr Ala Met Val Gln Val Gln Glu Leu Phe Pro Pro Pro Val Leu Ser Ala Tle Pro Ser Pro Glu Pro Arg Glu Gly Ser Leu Val Thr Leu Arg Cys Gln Thr Lys Leu His Pro Leu Arg Ser Ala Leu Arg Leu Leu Phe Ser Phe His Lys Asp Gly His Thr Leu Gln Asp Arg Gly Pro His Pro Glu Leu Cys Ile Pro Gly Ala Lys Glu Gly Asp Ser Gly Leu Tyr Trp Cys Glu Val Ala Pro Glu Gly Gly Gln Val Gln Lys Gln Ser Pro Gln Leu Glu Va1 Arg Val Gln Ala Pro Val Ser Arg Pro Val Leu Thr Leu His His Gly Pro Ala Asp Pro Ala Val Gly Asp Met Val Gln Leu Leu Cys Glu Ala G1n Arg Gly Ser Pro Pro Ile Leu Tyr Ser Phe Tyr Leu Asp Glu Lys Tle Val Gly Asn His Ser Ala Pro Cys G1y Gly Thr Thr Ser Leu Leu Phe Pro Val Lys Ser Glu Gln Asp Ala Gly Asn Tyr Ser Cys Glu Ala Glu Asn Ser Val Ser Arg Glu Arg Ser Glu Pro Lys Lys Leu Ser Leu Lys Gly Ser Gln Val Leu Phe Thr Pro Ala Ser Asn Trp Leu Val Pro Trp Leu Pro Ala Ser Leu Leu Gly Leu Met Val Tle Ala Ala Ala Leu Leu Val Tyr Val Arg Ser Trp Arg Lys Ala Gly Pro Leu Pro Ser Gln Ile Pro Pro Thr Ala Pro Gly Gly Glu Gln Cys Pro Leu Tyr Ala Asn Val His His Gln Lys Gly Lys Asp Glu Gly Val Val Tyr Ser Val Val His Arg Thr Ser Lys Arg Ser Glu Ala Arg Ser Ala Glu Phe Thr Val Gly Arg Lys Asp Ser Ser Ile Tle Cys Ala Glu Val Arg Cys Leu Gln Pro Ser Glu Val 5er Ser Thr Glu Val Asn Met Arg Ser Arg Thr Leu Gln Glu Pro Leu Ser Asp Cys Glu Glu Val Leu Cys <210> 29 <211> 16 <212> PRT
<213> Artificial Sequence <220>
<223> Description of ArtificialSequence:/note =
synthetic construct <400> 29 Met Leu Pro Arg Leu Leu Ile Cys Ala Cys Glu Leu Leu Pro Leu Pro <210> 30 <211> 19 <212> PRT
<213> Artificial Sequence <220>
<223> Description of ArtificialSequence:/note =
synthetic construct <400> 30 Ser Leu Leu Val Ile Phe Asp Thr Glu Leu Tr Ala Val Gln t L
p 10 15 eu Me Ala Asp Ser <210> 31 <211> 17 <212> PRT
<2l3> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400> 31 Met Leu Leu Trp Leu Leu Leu Leu Ile Leu Thr Pro Gly Arg Glu Gln Ser <210> 32 <211> 15 <212> PRT
<213> Artificial Sequence <220>
<223> Description of Artificial 5equence:/note =
synthetic construct <400> 32 Met Leu Zeu Trp Thr Ala Val Zeu Zeu Phe Val Pro Cys Val Gly <210> 33 <211> 51 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400> 33 atgctgccga ggctgttgct gttgatctgt gctccactct gtgaacctgc c 51 <210> 34 <211> 1236 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400> 34 gagctgtttt tgatagccag cccctcccat cccacagagg ggagcccagt gaccctgacg 60 tgtaagatgc cctttctaca gagttcagat gcccagttcc agttctgctt tttcagagac 120 acccgggcct tgggcccagg ctggagcagc tcccccaagc tccagatcgc tgccatgtgg 180 aaagaagaca cagggtcata ctggtgcgag gcacagacaa tggcgtccaa agtcttgagg 240 agcaggagat cccagataaa tgtgcacagg gtccctgtcg ctgatgtgag cttggagact 300 cagcccccag gaggacaggt gatggaggga gacaggctgg tcctcatctg ctcagttgct 360 atgggcacag gagacatcac cttcctttgg tacaaagggg ctgtaggttt aaaccttcag 420 tcaaagaccc agcgttcact gacagcagag tatgagattc cttcagtgag ggagagtgat 480 gctgagcaat attactgtgt agctgaaaat ggctatggtc ccagccccag tgggctggtg 540 agcatcactg tcagaatccc ggtgtctcgc ccaatcctca tgctcagggc tcccagggcc 600 caggctgcag tggaggatgt gctggagctt cactgtgagg ccctgagagg ctctcctcca 660 atcctgtact ggttttatca cgaggatatc accctgggga gcaggtcggc cccctctgga 720 ggaggagcct ccttcaacct ttccctgact gaagaacatt ctggaaacta ctcctgtgag 780 gccaacaatg gcctgggggc ccagcgcagt gaggcggtga cactcaactt cacagtgcct 840 actggggcca gaagcaatca tcttacctca ggagtcattg aggggctgct cagcaccctt 900 ggtccagcca ccgtggcctt attattttgc tacggcctca aaagaaaaat aggaagacgt 960 tcagccaggg atccactcag gagccttccc agccctctac cccaagagtt cacctacctc 1020 aactcaccta ccccagggca gctacagcct atatatgaaa atgtgaatgt tgtaagtggg 1080 gatgaggttt attcactggc gtactataac cagccggagc aggaatcagt agcagcagaa 1140 accctgggga cacatatgga ggacaaggtt tccttagaca tctattccag gctgaggaaa 1200 gcaaacatta cagatgtgga ctatgaagat gctatg 1236 <210> 35 <211> 60 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
<223> Description of Artificial 5equence:/note =
synthetic construct <400> 32 Met Leu Zeu Trp Thr Ala Val Zeu Zeu Phe Val Pro Cys Val Gly <210> 33 <211> 51 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400> 33 atgctgccga ggctgttgct gttgatctgt gctccactct gtgaacctgc c 51 <210> 34 <211> 1236 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400> 34 gagctgtttt tgatagccag cccctcccat cccacagagg ggagcccagt gaccctgacg 60 tgtaagatgc cctttctaca gagttcagat gcccagttcc agttctgctt tttcagagac 120 acccgggcct tgggcccagg ctggagcagc tcccccaagc tccagatcgc tgccatgtgg 180 aaagaagaca cagggtcata ctggtgcgag gcacagacaa tggcgtccaa agtcttgagg 240 agcaggagat cccagataaa tgtgcacagg gtccctgtcg ctgatgtgag cttggagact 300 cagcccccag gaggacaggt gatggaggga gacaggctgg tcctcatctg ctcagttgct 360 atgggcacag gagacatcac cttcctttgg tacaaagggg ctgtaggttt aaaccttcag 420 tcaaagaccc agcgttcact gacagcagag tatgagattc cttcagtgag ggagagtgat 480 gctgagcaat attactgtgt agctgaaaat ggctatggtc ccagccccag tgggctggtg 540 agcatcactg tcagaatccc ggtgtctcgc ccaatcctca tgctcagggc tcccagggcc 600 caggctgcag tggaggatgt gctggagctt cactgtgagg ccctgagagg ctctcctcca 660 atcctgtact ggttttatca cgaggatatc accctgggga gcaggtcggc cccctctgga 720 ggaggagcct ccttcaacct ttccctgact gaagaacatt ctggaaacta ctcctgtgag 780 gccaacaatg gcctgggggc ccagcgcagt gaggcggtga cactcaactt cacagtgcct 840 actggggcca gaagcaatca tcttacctca ggagtcattg aggggctgct cagcaccctt 900 ggtccagcca ccgtggcctt attattttgc tacggcctca aaagaaaaat aggaagacgt 960 tcagccaggg atccactcag gagccttccc agccctctac cccaagagtt cacctacctc 1020 aactcaccta ccccagggca gctacagcct atatatgaaa atgtgaatgt tgtaagtggg 1080 gatgaggttt attcactggc gtactataac cagccggagc aggaatcagt agcagcagaa 1140 accctgggga cacatatgga ggacaaggtt tccttagaca tctattccag gctgaggaaa 1200 gcaaacatta cagatgtgga ctatgaagat gctatg 1236 <210> 35 <211> 60 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400>
35 ggtcatctttgatgcagtcactgaacaggcagattcgctg60 atgctgctgt ggtcattgct <210>
<211>
<212>
DNA
<213> icial nce Artif Seque <220>
<223> iption te =
Descr of Artificial Sequence:/no synth etic construct <400>
36 cgccctcttctgtcttcgaaggagacagcatcgttctgaaatgccaggga60 acccttgtgg gaacagaactggaaaattcagaagatggcttaccataaggataacaaagagttatctgtt120 ttcaaaaaattctcagatttccttatccaaagtgcagttttaagtgacagtggtaactat180 ttctgtagtaccaaaggacaactctttctctgggataaaacttcaaatatagtaaagata240 aaagtccaagagctctttcaacgtcctgtgctgactgccagctccttccagcccatcgaa300 gggggtccagtgagcctgaaatgtgagacccggctctctccacagaggttggatgttcaa360 ctccagttctgcttcttcagagaaaaccaggtcctggggtcaggctggagcagctctccg420 gagctccagatttctgccgtgtggagtgaagacacagggtcttactggtgcaaggcagaa480 acggtgactcacaggatcagaaaacagagcctccaatcccagattcacgtgcagagaatc540 cccatctctaatgtaagcttggagatccgggcccccgggggacaggtgactgaaggacaa600 aaactgatcctgctctgctcagtggctgggggtacaggaaatgtcacattctcctggtac660 agagaggccacaggaaccagtatgggaaagaaaacccagcgttccctgtcagcagagctg720 gagatcccagctgtgaaagagagtgatgccggcaaatattactgtagagctgacaacggc780 catgtgcctatccagagcaaggtggtgaatatccctgtgagaattccagtgtctcgccct840 gtcctcaccctcaggtctcctggggcccaggctgcagtgggggacctgctggagcttcac900 tgtgaggccctgagaggctctcccccaatcttgtaccaattttatcatgaggatgtcacc960 cttgggaacagctcggccccctctggaggaggggcctccttcaacctctctttgactgca1020 gaacattctggaaactactcctgtgaggccaacaacggcctgggggcccagtgcagtgag1080 gcagtgccagtctccatctcaggacctgatggctatagaagagacctcatgacagctgga1140 gttctctggggactgtttggtgtccttggtttcactggtgttgctttgctgttgtatgcc1200 ttgttccacaagatatcaggagaaagttctgccactaatgaacccagaggggcttccagg1260 ccaaatcctcaagagttcacctattcaagcccaaccccagacatggaggagctgcagcca1320 gtgtatgtcaatgtgggctctgtagatgtggatgtggtttattctcaggtctggagcatg1380 cagcagccagaaagctcagcaaacatcaggacacttctggagaacaaggactcccaagtc1440 atctactcttctgtgaagaaatca 1464 <210>
<211>
<212>
DNA
<213> ficial Arti Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400> 37 atgcttctgt ggctgctgct gctgatcctg actcctggaa gagaacaatc aggg 54 <210> 38 <211> 2148 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400>
38 tctcctcaatcctccatggtccacagccttcaaaggagaa60 gtggccccaa aagctgtact aaagtggctctcatatgcagcagcatatcacattccctagcccagggagacacatattgg120 tatcacgatgagaagttgttgaaaataaaacatgacaagatccaaattacagagcctgga180 aattaccaatgtaagacccgaggatcctccctcagtgatgccgtgcatgtggaattttca240 cccgactggctgatcctgcaggctttacatcctgtctttgaaggagacaatgtcattctg300 agatgtcaggggaaagacaacaaaaacactcatcaaaaggtttactacaaggatggaaaa360 cagcttcctaatagttataatttagagaagatcacagtgaattcagtctccagggataat420 agcaaatatcattgtactgcttataggaagttttacatacttgacattgaagtaacttca480 aaacccctaaatatccaagttcaagagctgtttctacatcctgtgctgagagccagctct540 tccacgcccatagaggggagtcccatgaccctgacctgtgagacccagctctctccacag600 aggccagatgtccagctgcaattctccctcttcagagatagccagaccctcggattgggc660 tggagcaggtcccccagactccagatccctgccatgtggactgaagactcagggtcttac720 tggtgtgaggtggagacagtgactcacagcatcaaaaaaaggagcctgagatctcagata780 cgtgtacagagagtccctgtgtctaatgtgaatctagagatccggcccaccggagggcag840 ctgattgaaggagaaaatatggtccttatttgctcagtagcccagggttcagggactgtc900 acattctcctggcacaaagaaggaagagtaagaagcctgggtagaaagacccagcgttcc960 ctgttggcagagctgcatgttctcaccgtgaaggagagtgatgcagggagatactactgt1020 gcagctgataacgttcacagccccatcctcagcacgtggattcgagtcaccgtgagaatt1080 ccggtatctcaccctgtcctcaccttcagggctcccagggcccacactgtggtgggggac1140 ctgctggagcttcactgtgagtccctgagaggctctcccccgatcctgtaccgattttat1200 catgaggacgtcaccctggggaacagctcagccccctctggaggaggagcctccttcaac1260 ctctctctgactgcagaacattctggaaactactcctgtgatgcagacaatggcctgggg1320 gcccagcacagtcatggagtgagtctcagggtcacagttccggtgtctcgccccgtcctc1380 accctcagggctcccggggcccaggctgtggtgggggacctgctggagcttcactgtgag1440 tccctgagaggctccttcccgatcctgtactggttttatcacgaggatgacaccttgggg1500 aacatctcggcccactctggaggaggggcatccttcaacctctctctgactacagaacat1560 tctggaaactactcatgtgaggctgacaatggcctgggggcccagcacagtaaagtggtg1620 acactcaatgttacaggaacttccaggaacagaacaggccttaccgctgcgggaatcacg1680 gggctggtgctcagcatcctcgtccttgctgctgctgctgctctgctgcattacgccagg1740 gcccgaaggaaaccaggaggactttctgccactggaacatctagtcacagtcctagcgag1800 tgtcaggagccttcctcgtccaggccttccaggatagaccctcaagagcccactcactct1860 aaaccactagccccaatggagctggagccaatgtacagcaatgcaaatcctggagatagc1920 aacccgatttattcccagatctggagcatccagcatacaaaagaaaactcagctaattgt1980 ccaatgatgcatcaagagcatgaggaacttacagtcctctattcagaactgaagaagaca2040 cacccagacgactctgcaggggaggctagcagcagaggcagggcccatgaagaagatgat2100 gaagaaaactatgagaatgtaccacgtgtattactggcctcagaccac 2148 <210> 39 <211> 48 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400> 39 atgctgctct ggacggctgt gctgctcttt gttccctgtg ttgggaaa 48 <210> 40 <211> 2003 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400> 40 aggtctggttgctctctgccggcttcgccctgacctgtttctgacctgtgttccctccgc60 tgtgccagaacaggccccatgctgctctggacggctgtgctgctctttgttccctgtgtt120 gggaaaactgtctggctgtacctccaagcctggccaaaccctgtgtttgaaggagatgcc180 ctgactctgcgatgtcagggatggaagaatacaccactgtctcaggtgaagttctacaga240 gatggaaaattccttcatttctctaaggaaaaccagactctgtccatgggagcagcaaca300 gtgcagagccgtggccagtacagctgctctgggcaggtgatgtatattccacagacattc360 acacaaacttcagagactgccatggttcaagtccaagagctgtttccacctcctgtgctg420 agtgccatcccctctcctgagccccgagagggtagcctggtgaccctgagatgtcagaca480 aagctgcaccccctgaggtcagccttgaggctccttttctccttccacaaggacggccac540 accttgcaggacaggggccctcacccagaactctgcatcccgggagccaaggagggagac600 tctgggctttactggtgtgaggtggcccctgagggtggccaggtccagaagcagagcccc660 cagctggaggtcagagtgcaggctcctgtatcccgtcctgtgctcactctgcaccacggg720 cctgctgaccctgctgtgggggacatggtgcagctcctctgtgaggcacagaggggctcc780 cctccgatcctgtattccttctaccttgatgagaagattgtggggaaccactcagctccc840 tgtggtggaaccacctccctcctcttcccagtgaagtcagaacaggatgctgggaactac900 tcctgcgaggctgagaacagtgtctccagagagaggagtgagcccaagaagctgtctctg960 aagggttctcaagtcttgttcactcccgccagcaactggctggttccttggcttcctgcg1020 agcctgcttggcctgatggttattgctgctgcacttctggtttatgtgagatcctggaga1080 aaagctgggccccttccatcccagataccacccacagctccaggtggagagcagtgccca1140 ctatatgccaacgtgcatcaccagaaagggaaagatgaaggtgttgtctactctgtggtg1200 catagaacctcaaagaggagtgaagccaggtctgctgagttcaccgtggggagaaaggac1260 agttctatcatctgtgcggaggtgagatgcctgcagcccagtgaggtttcatccacggag1320 gtgaatatgagaagcaggactctccaagaaccccttagcgactgtgaggaggttctctgc1380 tagtgatggtgttctcctatcaacacacgcccacccccagtctccagtgctcctcaggaa1440 gacagtggggtcctcaactctttctgtgggtccttcagttcccaagcccagcatcacaga1500 gccccctgagcccttgtcctggtcaggagcacctgaaccctgggttcttttcttagcaga1560 agaccaaccaatggaatgggaagggagatgctcccaccaacacacacacttaggttcaat1620 cagtgacactggacacataagccacagatgtcttctttccatacaagcatgttagttcgc1680 cccaatatacatatatatatgaaatagtcatgtgccgcataacaacatttcagtcagtga1740 tagactgcatacacaacagtggtcccataagactgtaatggagtttaaaaattcctactg1800 cctagtgatatcatagttgccttaacatcataacacaacacatttctcacgcgtttgtgg1860 tgatgctggtacaaacaagctacagcgccgctagtcatatacaaatatagcacatacaat1920 tatgtacagtacactatacttgataatgataataaacaactatgttactggtctaaaaaa1980 aaaaaaaaaaaaaaaaaaaaaaa <210> 41 <211> 23 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400> 41 tgagtctcag ggtcacagtt ccg 23 <210> 42 <211> 26 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400> 42 gctcttgaac ttggatattt aggggt 26 <210> 43 <211> 25 <212> DNA
<213> Artificial Sequence <220>
<223> Description of ArtificialSequence:/note =
synthetic construct <400> 43 ccagtgtatg tcaatgtggg ctctg <210> 44 <211> 27 r <212> DNA
<213> Artificial Sequence <220>
<223> Description of ArtificialSequence:/note =
synthetic construct <400> 44 cgttgaaaga gctcttggac ttttatc <210> 45 <211> 27 <212> DNA
<213> Artificial Sequence <220>
<223> Description of ArtificialSequence:/note =
synthetic construct <400> 45 27 gcctcaaaag aaaaatagga agacgtt <210> 46 <211> 23 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400> 46 aagctcacat cagcgacagg gac <210> 47 <211> 22 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400> 47 tcttggagat aagtcgggct tt <210> 48 <211> 25 <212> DNA
35 ggtcatctttgatgcagtcactgaacaggcagattcgctg60 atgctgctgt ggtcattgct <210>
<211>
<212>
DNA
<213> icial nce Artif Seque <220>
<223> iption te =
Descr of Artificial Sequence:/no synth etic construct <400>
36 cgccctcttctgtcttcgaaggagacagcatcgttctgaaatgccaggga60 acccttgtgg gaacagaactggaaaattcagaagatggcttaccataaggataacaaagagttatctgtt120 ttcaaaaaattctcagatttccttatccaaagtgcagttttaagtgacagtggtaactat180 ttctgtagtaccaaaggacaactctttctctgggataaaacttcaaatatagtaaagata240 aaagtccaagagctctttcaacgtcctgtgctgactgccagctccttccagcccatcgaa300 gggggtccagtgagcctgaaatgtgagacccggctctctccacagaggttggatgttcaa360 ctccagttctgcttcttcagagaaaaccaggtcctggggtcaggctggagcagctctccg420 gagctccagatttctgccgtgtggagtgaagacacagggtcttactggtgcaaggcagaa480 acggtgactcacaggatcagaaaacagagcctccaatcccagattcacgtgcagagaatc540 cccatctctaatgtaagcttggagatccgggcccccgggggacaggtgactgaaggacaa600 aaactgatcctgctctgctcagtggctgggggtacaggaaatgtcacattctcctggtac660 agagaggccacaggaaccagtatgggaaagaaaacccagcgttccctgtcagcagagctg720 gagatcccagctgtgaaagagagtgatgccggcaaatattactgtagagctgacaacggc780 catgtgcctatccagagcaaggtggtgaatatccctgtgagaattccagtgtctcgccct840 gtcctcaccctcaggtctcctggggcccaggctgcagtgggggacctgctggagcttcac900 tgtgaggccctgagaggctctcccccaatcttgtaccaattttatcatgaggatgtcacc960 cttgggaacagctcggccccctctggaggaggggcctccttcaacctctctttgactgca1020 gaacattctggaaactactcctgtgaggccaacaacggcctgggggcccagtgcagtgag1080 gcagtgccagtctccatctcaggacctgatggctatagaagagacctcatgacagctgga1140 gttctctggggactgtttggtgtccttggtttcactggtgttgctttgctgttgtatgcc1200 ttgttccacaagatatcaggagaaagttctgccactaatgaacccagaggggcttccagg1260 ccaaatcctcaagagttcacctattcaagcccaaccccagacatggaggagctgcagcca1320 gtgtatgtcaatgtgggctctgtagatgtggatgtggtttattctcaggtctggagcatg1380 cagcagccagaaagctcagcaaacatcaggacacttctggagaacaaggactcccaagtc1440 atctactcttctgtgaagaaatca 1464 <210>
<211>
<212>
DNA
<213> ficial Arti Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400> 37 atgcttctgt ggctgctgct gctgatcctg actcctggaa gagaacaatc aggg 54 <210> 38 <211> 2148 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400>
38 tctcctcaatcctccatggtccacagccttcaaaggagaa60 gtggccccaa aagctgtact aaagtggctctcatatgcagcagcatatcacattccctagcccagggagacacatattgg120 tatcacgatgagaagttgttgaaaataaaacatgacaagatccaaattacagagcctgga180 aattaccaatgtaagacccgaggatcctccctcagtgatgccgtgcatgtggaattttca240 cccgactggctgatcctgcaggctttacatcctgtctttgaaggagacaatgtcattctg300 agatgtcaggggaaagacaacaaaaacactcatcaaaaggtttactacaaggatggaaaa360 cagcttcctaatagttataatttagagaagatcacagtgaattcagtctccagggataat420 agcaaatatcattgtactgcttataggaagttttacatacttgacattgaagtaacttca480 aaacccctaaatatccaagttcaagagctgtttctacatcctgtgctgagagccagctct540 tccacgcccatagaggggagtcccatgaccctgacctgtgagacccagctctctccacag600 aggccagatgtccagctgcaattctccctcttcagagatagccagaccctcggattgggc660 tggagcaggtcccccagactccagatccctgccatgtggactgaagactcagggtcttac720 tggtgtgaggtggagacagtgactcacagcatcaaaaaaaggagcctgagatctcagata780 cgtgtacagagagtccctgtgtctaatgtgaatctagagatccggcccaccggagggcag840 ctgattgaaggagaaaatatggtccttatttgctcagtagcccagggttcagggactgtc900 acattctcctggcacaaagaaggaagagtaagaagcctgggtagaaagacccagcgttcc960 ctgttggcagagctgcatgttctcaccgtgaaggagagtgatgcagggagatactactgt1020 gcagctgataacgttcacagccccatcctcagcacgtggattcgagtcaccgtgagaatt1080 ccggtatctcaccctgtcctcaccttcagggctcccagggcccacactgtggtgggggac1140 ctgctggagcttcactgtgagtccctgagaggctctcccccgatcctgtaccgattttat1200 catgaggacgtcaccctggggaacagctcagccccctctggaggaggagcctccttcaac1260 ctctctctgactgcagaacattctggaaactactcctgtgatgcagacaatggcctgggg1320 gcccagcacagtcatggagtgagtctcagggtcacagttccggtgtctcgccccgtcctc1380 accctcagggctcccggggcccaggctgtggtgggggacctgctggagcttcactgtgag1440 tccctgagaggctccttcccgatcctgtactggttttatcacgaggatgacaccttgggg1500 aacatctcggcccactctggaggaggggcatccttcaacctctctctgactacagaacat1560 tctggaaactactcatgtgaggctgacaatggcctgggggcccagcacagtaaagtggtg1620 acactcaatgttacaggaacttccaggaacagaacaggccttaccgctgcgggaatcacg1680 gggctggtgctcagcatcctcgtccttgctgctgctgctgctctgctgcattacgccagg1740 gcccgaaggaaaccaggaggactttctgccactggaacatctagtcacagtcctagcgag1800 tgtcaggagccttcctcgtccaggccttccaggatagaccctcaagagcccactcactct1860 aaaccactagccccaatggagctggagccaatgtacagcaatgcaaatcctggagatagc1920 aacccgatttattcccagatctggagcatccagcatacaaaagaaaactcagctaattgt1980 ccaatgatgcatcaagagcatgaggaacttacagtcctctattcagaactgaagaagaca2040 cacccagacgactctgcaggggaggctagcagcagaggcagggcccatgaagaagatgat2100 gaagaaaactatgagaatgtaccacgtgtattactggcctcagaccac 2148 <210> 39 <211> 48 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400> 39 atgctgctct ggacggctgt gctgctcttt gttccctgtg ttgggaaa 48 <210> 40 <211> 2003 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400> 40 aggtctggttgctctctgccggcttcgccctgacctgtttctgacctgtgttccctccgc60 tgtgccagaacaggccccatgctgctctggacggctgtgctgctctttgttccctgtgtt120 gggaaaactgtctggctgtacctccaagcctggccaaaccctgtgtttgaaggagatgcc180 ctgactctgcgatgtcagggatggaagaatacaccactgtctcaggtgaagttctacaga240 gatggaaaattccttcatttctctaaggaaaaccagactctgtccatgggagcagcaaca300 gtgcagagccgtggccagtacagctgctctgggcaggtgatgtatattccacagacattc360 acacaaacttcagagactgccatggttcaagtccaagagctgtttccacctcctgtgctg420 agtgccatcccctctcctgagccccgagagggtagcctggtgaccctgagatgtcagaca480 aagctgcaccccctgaggtcagccttgaggctccttttctccttccacaaggacggccac540 accttgcaggacaggggccctcacccagaactctgcatcccgggagccaaggagggagac600 tctgggctttactggtgtgaggtggcccctgagggtggccaggtccagaagcagagcccc660 cagctggaggtcagagtgcaggctcctgtatcccgtcctgtgctcactctgcaccacggg720 cctgctgaccctgctgtgggggacatggtgcagctcctctgtgaggcacagaggggctcc780 cctccgatcctgtattccttctaccttgatgagaagattgtggggaaccactcagctccc840 tgtggtggaaccacctccctcctcttcccagtgaagtcagaacaggatgctgggaactac900 tcctgcgaggctgagaacagtgtctccagagagaggagtgagcccaagaagctgtctctg960 aagggttctcaagtcttgttcactcccgccagcaactggctggttccttggcttcctgcg1020 agcctgcttggcctgatggttattgctgctgcacttctggtttatgtgagatcctggaga1080 aaagctgggccccttccatcccagataccacccacagctccaggtggagagcagtgccca1140 ctatatgccaacgtgcatcaccagaaagggaaagatgaaggtgttgtctactctgtggtg1200 catagaacctcaaagaggagtgaagccaggtctgctgagttcaccgtggggagaaaggac1260 agttctatcatctgtgcggaggtgagatgcctgcagcccagtgaggtttcatccacggag1320 gtgaatatgagaagcaggactctccaagaaccccttagcgactgtgaggaggttctctgc1380 tagtgatggtgttctcctatcaacacacgcccacccccagtctccagtgctcctcaggaa1440 gacagtggggtcctcaactctttctgtgggtccttcagttcccaagcccagcatcacaga1500 gccccctgagcccttgtcctggtcaggagcacctgaaccctgggttcttttcttagcaga1560 agaccaaccaatggaatgggaagggagatgctcccaccaacacacacacttaggttcaat1620 cagtgacactggacacataagccacagatgtcttctttccatacaagcatgttagttcgc1680 cccaatatacatatatatatgaaatagtcatgtgccgcataacaacatttcagtcagtga1740 tagactgcatacacaacagtggtcccataagactgtaatggagtttaaaaattcctactg1800 cctagtgatatcatagttgccttaacatcataacacaacacatttctcacgcgtttgtgg1860 tgatgctggtacaaacaagctacagcgccgctagtcatatacaaatatagcacatacaat1920 tatgtacagtacactatacttgataatgataataaacaactatgttactggtctaaaaaa1980 aaaaaaaaaaaaaaaaaaaaaaa <210> 41 <211> 23 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400> 41 tgagtctcag ggtcacagtt ccg 23 <210> 42 <211> 26 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400> 42 gctcttgaac ttggatattt aggggt 26 <210> 43 <211> 25 <212> DNA
<213> Artificial Sequence <220>
<223> Description of ArtificialSequence:/note =
synthetic construct <400> 43 ccagtgtatg tcaatgtggg ctctg <210> 44 <211> 27 r <212> DNA
<213> Artificial Sequence <220>
<223> Description of ArtificialSequence:/note =
synthetic construct <400> 44 cgttgaaaga gctcttggac ttttatc <210> 45 <211> 27 <212> DNA
<213> Artificial Sequence <220>
<223> Description of ArtificialSequence:/note =
synthetic construct <400> 45 27 gcctcaaaag aaaaatagga agacgtt <210> 46 <211> 23 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400> 46 aagctcacat cagcgacagg gac <210> 47 <211> 22 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400> 47 tcttggagat aagtcgggct tt <210> 48 <211> 25 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400> 48 atcctgcagc ccagcctcgt aggag <210> 49 <211> 24 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400> 49 ggtcctcatg ctgctgtggt catt <210> 50 <211> 24 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400> 50 ~ 24 gctgttgatc ttcccttctg attc <210> 51 <211> 24 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400> 51 atgctgccga ggctgttgct gttg <210> 52 <211> 24 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400> 52 catagcatct tcatagtcca catc <210> 53 <211> 24 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400> 53 ctcaacttca cagtgcctac tggg 24 <210> 54 <211> 24 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400> 54 tcctgcagag tcactaacct tgag 24 <210> 55 <211> 25 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct ' <400> 55 ccagtgtatg tcaatgtggg ctctg 25 <210> 56 <211> 24 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400> 56 cattcttccc tcaaatcttt acac 24 <210> 57 <211> 21 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400> 57 cagcacgtgg attcgagtca c 21 <210> 58 <211> 24 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400> 58 cagatctggg aataaatcgg gttg 24 <210> 59 <211> 21 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400> 59 tcttcagaga tggcgaggtc a 21 <210> 60 <211> 26 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400> 60 ttttggggtg tacatcaaca tacaag 26 <210> 61 <211> 24 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note synthetic construct <400> 61 tgttgccctg tttcttccaa taca 24 <210> 62 <2l1> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400> 62 cagagttggc cgacctacgc 20 <210> 63 <211> 32 <212> PRT
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400> 48 atcctgcagc ccagcctcgt aggag <210> 49 <211> 24 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400> 49 ggtcctcatg ctgctgtggt catt <210> 50 <211> 24 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400> 50 ~ 24 gctgttgatc ttcccttctg attc <210> 51 <211> 24 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400> 51 atgctgccga ggctgttgct gttg <210> 52 <211> 24 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400> 52 catagcatct tcatagtcca catc <210> 53 <211> 24 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400> 53 ctcaacttca cagtgcctac tggg 24 <210> 54 <211> 24 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400> 54 tcctgcagag tcactaacct tgag 24 <210> 55 <211> 25 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct ' <400> 55 ccagtgtatg tcaatgtggg ctctg 25 <210> 56 <211> 24 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400> 56 cattcttccc tcaaatcttt acac 24 <210> 57 <211> 21 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400> 57 cagcacgtgg attcgagtca c 21 <210> 58 <211> 24 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400> 58 cagatctggg aataaatcgg gttg 24 <210> 59 <211> 21 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400> 59 tcttcagaga tggcgaggtc a 21 <210> 60 <211> 26 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400> 60 ttttggggtg tacatcaaca tacaag 26 <210> 61 <211> 24 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note synthetic construct <400> 61 tgttgccctg tttcttccaa taca 24 <210> 62 <2l1> 20 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400> 62 cagagttggc cgacctacgc 20 <210> 63 <211> 32 <212> PRT
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <221> VARIANT
<222> 5, 15, 17, 22, 28 <223> X can be any amino acid <400> 63 Gly Glu Pro Ile Xaa Leu Arg Cys His Ser Trp Lys Asp Lys Xaa Leu Xaa Lys Val Thr Tyr Xaa Gln Asn Gly Lys Ala Xaa Lys Phe Phe His <210> 64 <211> 17 <212> PRT
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <221> VARIANT
<222> 1 <223> X can be either Glu or Asp <221> VARIANT
<222> 7 <223> X can be either Leu or Ile <221> VARIANT
<222> 17 <223> X can be either Leu or Ile <221> VARIANT
<222> 2-3, 5-6, 8-13, 15-16 <223> X can be any amino acid <400> 64 Xaa Xaa Xaa Tyr Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Tyr Xaa Xaa <210> 65 - <211> 18 <212> PRT
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <221> VARIANT
<222> 1 <223> X can be either Glu or Asp <221> VARIANT
<222> 7 <223> X can be either Leu or Ile <221> VARIANT
<222> 18 ' <223> X can be either Leu or Ile <221> VARIANT
<222> 2-3, 5-6, 8-14, 16-17 <223> X can be any amino acid <400> 65 Xaa Xaa Xaa Tyr Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Tyr Xaa Xaa <210> 66 <211> 19 <212> PRT
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <221> VARIANT
<222> 1 <223> X can be either Glu or Asp <221> VARIANT
<222> 7 <223> X can be either Leu or Ile <221> VARIANT
<222> 19 <223> X can be either Leu or Ile <221> VARIANT
<222> 2-3, 5-6, 8-15, 17-18 <223> X can be any amino acid <400> 66 Xaa Xaa Xaa Tyr Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Tyr Xaa Xaa <210> 67 <211> 6 <212> PRT
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <221> VARIANT
<222> 1 <223> X can be either Ile orVal or Leu or Ser <221> VARIANT
<222> 2, 4-5 <223> X can be any amino acid <221> VARIANT
<222> 6 <223> X can be Leu or Val <400> 67 Xaa Xaa Tyr Xaa Xaa Xaa <210> 68 <211> 492 <212> PRT
<213> Artificial Sequenoe <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400> 68 Asp Trp Leu Ser Tle Ser Leu Pro His Arg Ser Tyr Glu Gly Asp Gln 1 5 l0 15 Val Val Ile Ser Cys Thr Gly Lys Asn Asn Gly Asp Ile Lys Arg Leu Lys Tyr Phe Lys Asp Gly Tyr His Ile Glu Thr Tyr Ser Ser Ala Ser Ser Tyr Thr Ile Arg Asn Ala Arg Arg Gly Asp Ser Gly Ser Tyr Ser Cys Lys Ala Asp Arg Lys Phe Phe Leu Phe Ile Asp Thr Thr Glu Glu Thr Gly Ser Lys Trp Leu Asn Val G1n Glu Leu Phe Pro Ala Pro Gly Leu Thr A1a Ser Pro Leu Gln Pro Val Glu Gly Ser Ser Va1 Thr Leu Ser Cys Asn Thr Trp Leu Pro Ser Asp Arg Ala Thr Thr Gln Leu Arg Tyr Ser Phe Phe Lys Asp Gly His Thr Leu Gln Ser Gly Trp Thr Ser Ser Lys Phe Thr Ile Ser Ala Ile Ser Lys Glu Asp Ser Gly Asn Tyr Trp Cys Glu Ala Met Thr Ala Ser Arg Ser Val Ser Lys Gln Ser His Arg Ser Tyr Ile Asp Val Glu Arg Ile Pro Val Ser Gln Va1 Thr Met 6 Glu Ile Gln Pro Ser Arg Gly Trp Gly Val Glu Gly Glu Pro Leu Val Val Glu Gly G1u Pro Leu Val Leu Ala Cys Ser Val Ala Lys Gly Thr Gly Leu Ile Thr Phe Ser Trp His Arg Gln Asp Thr Lys Glu Ser Va1 Gly Lys Lys Ser Gln Arg Ser Gln Arg Val Glu Leu G1u Ile Pro Thr Ile Arg Glu Ser His Ala Gly Gly Tyr Tyr Cys Thr Ala Asp Asn Asn Tyr Gly Leu Ile Gln Ser Ala Ile Va1 Asn Ile Thr Val Lys Ile Pro Val Leu Asn Pro Leu Leu Ser Ile Ser Val Pro Gly Val Leu Pro Phe Ile Gly Asp Val Ala Glu Leu His Cys Glu Asp Lys Arg A1a Ser Pro Pro Val Leu Tyr Trp Phe Tyr His Glu Asn Ile Thr Leu Ala Asn Thr Ser Ala Pro Phe Gly Gly Lys A1a Ser Phe Lys Leu Ser Leu Thr A1a Gly His Ser Gly Asn Tyr Ser Cys Glu Ala Glu Asn Ala Trp Gly Thr Lys Arg Ser Glu Val Val Thr Leu Asn Val Thr Glu Pro Pro Pro Lys Val Arg Leu Val Asn Gly Pro His His Cys Glu Gly Arg Val Glu Val Glu Gln Glu Gly Arg Trp Gly Thr Val Cys Asp Asp Gly Trp Asp Met Arg Asp Val Ala Val Val Cys Arg Glu Leu Gly Cys Gly Ala Ala Gln His Thr Pro Ile Ala Met Leu Tyr Pro Pro Ala Val Asp Glu Ala Leu Pro Val Leu Ile Gln Val Ala Leu Cys Asn Gly Thr Glu Lys Thr Leu Ala Glu Cys Asp Gln Val Glu Ala Phe Asp Cys Gly His Asp Glu Asp Ala Gly Ala Val Cys Glu Va1 Leu Pro Ser Thr Phe <210> 69 <211> 17 <212> PRT
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400> 69 Met Pro Leu Cys Leu Leu Leu Leu Val Phe Ala Pro Val Gly Val Gln Ser <210> 70 <211> 383 <212> PRT
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400> 70 Asp Trp Leu Ser Ile Ser Leu Pro His Arg Ser Tyr Glu Gly Asp Gln Val Val Ile Ser Cys Thr Gly Lys Asn Asn Gly Asp Tle Lys Arg Leu Lys Tyr Phe Lys Asp Gly Tyr His Ile Glu Thr Tyr Ser Ser Ala Ser Ser Tyr Thr Ile Arg Asn Ala Arg Arg Gly Asp Ser Gly Ser Tyr Ser Cys Lys Ala Asp Arg Lys Phe Phe Leu Phe Ile Asp Thr Thr Glu Glu Thr Gly Ser Lys Trp Leu Asn Va1 Gln Glu Leu Phe Pro Ala Pro Gly Leu Thr Ala 5er Pro Leu Gln Pro Val Glu Gly Ser Ser Val Thr Leu Ser Cys Asn Thr Trp Leu Pro Ser Asp Arg Ala Thr Thr Gln Leu Arg Tyr Ser Phe Phe Lys Asp Gly His Thr Leu Gln Ser Gly Trp Thr Ser 130 135 140 , Ser Lys Phe Thr Ile Ser Ala Ile 5er Lys Glu Asp Ser Gly Asn Tyr Trp Cys Glu Ala Met Thr Ala Ser Arg Ser Val Ser Lys Gln Ser His Arg Ser Tyr Ile Asp Val Glu Arg Ile Pro Val Ser Gln Val Thr Met Glu Ile Gln Pro Ser Arg Gly Trp Gly Val Glu Gly Glu Pro Leu Val Val Glu Gly Glu Pro Leu Val Leu Ala Cys Ser Val Ala Lys Gly Thr Gly Leu Ile Thr Phe Ser Trp His Arg Gln Asp Thr Lys Glu Ser Val Gly Lys Lys 5er Gln Arg Ser Gln Arg Val Glu Leu Glu Ile Pro Thr Tle Arg Glu Gly His Ala Gly Gly Tyr Tyr Cys Thr Ala Asp Asn Asn Tyr Gly Leu Tle Gln Ser Ala Ile Val Asn Ile Thr Val Lys Ile Pro _ 275 280 285 Val Leu Asn Pro Leu Leu Ser Ile Ser Val Pro Gly Val Leu Pro Phe Ile Gly Asp Val Ala Glu Leu His Cys Glu Asp Lys Arg Ala Ser Pro Pro Val Leu Tyr Trp Phe Tyr His Glu Asn Ile Thr Leu Ala Asn Thr Ser Ala Pro Phe Gly Gly Lys Ala Ser Phe Lys Leu Ser Leu Thr Ala Gly His Ser Gly Asn Tyr Ser Cys Glu Ala Glu Asn Ala Trp Gly Thr Lys Arg Ser Glu Val Val Thr Leu Asn Val Thr Gly Arg Thr Ile <210> 71 <211> 17 <212> PRT
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400> 71 Met Pro Leu Cys Leu Leu Leu Leu Val Phe Ala Pro Val Gly Val Gln Ser <210> 72 <211> 17 <212> PRT
<213> Artificial Sequence <220>
<223> Description of Artificial 5equence:/note =
synthetic construct <400> 72 Met Leu Pro Trp Leu Leu Leu Leu Ile Cys Ala Leu Pro Cys Glu Pro Ala <210> 73 <211> 326 <212> PRT
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400> 73 Gly Ile Ser Asp Val Ser Leu Lys Thr Arg Pro Pro Gly Gly Trp Val Met Glu Gly Asp Lys Leu Val Leu Ile Cys Ser Val Asp Arg Val Thr Gly Asn T1e Thr Tyr Phe Trp Tyr Arg Gly Ala Leu Gly Phe Gln Leu Glu Thr Lys Thr Gln Pro Ser Leu Thr Ala Glu Phe Glu Ile Ser Asp Met Lys Gln Ser Asp Ala Asp Gln Tyr Tyr Cys Ala A1a Asn Asp Gly His Asp Pro Ile Ala Ser Glu Leu Val Ser Ile His Val Arg Val Pro Val Ser Arg Pro Val Leu Thr Phe Gly Asp Ser Gly Thr Gln Ala Val Leu Gly Asp Leu Val Glu Leu His Cys Lys Ala Leu Arg Gly Ser Pro Pro Ile Phe Tyr Gln Phe Tyr His Glu Ser Ile I1e Leu Gly Asn Ser Ser Ala Pro Ser Gly Gly Gly Ala Ser Phe Asn Phe Ser Leu Thr Ala Glu His Ser Gly Asn Phe Ser Cys Glu Ala Ser Asn Gly Gln Gly Ala Gln Arg Ser Glu Val Val A1a Leu Asn Leu Thr Gly Leu Ser Leu Val Pro Thr G1u Asn Gly Ile Ser His Leu Ser Leu Gly Leu Thr G1y Trp Leu Leu Gly Cys Leu Sex Pro Ile Thr Met Ala Leu Ile Phe Cys Tyr Trp Leu Lys Arg Lys Ile Gly Arg Gln Ser Glu Asp Pro Val Arg Ser Pro Pro Gln Thr Val Leu Gln Gly Ser Thr Tyr Pro Lys Ser Pro Asp Ser Arg G1n Pro Glu Pro Leu Tyr Glu Asn Va1 Asn Val Val Ser Gly Asn Glu Val Tyr Ser Leu Va1 Tyr His Thr Pro Gln Val Leu Glu Pro Ala Ala Ala Gln His Val Arg Thr His Gly Val Ser Glu Ser Phe Gln Val Ser Ser Gly Leu Tyr Ser Lys Pro Arg Ile Asn Ile Ala His Met Asp Tyr Glu Asp Ala Met <210> 74 <211> 203 <212> PRT
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400> 74 Gly Ile Ser Asp Val Ser Leu Lys Thr Arg Pro Pro Gly Gly Trp Val Met Glu Gly Asp Lys Leu Val Leu Ile Cys Ser Val Asp Arg Val Thr Gly Asn Ile Thr Tyr'Phe Trp Tyr Arg Gly Ala Leu Gly Phe Gln Leu Glu Thr Lys Thr Gln Pro Ser Leu Thr Ala Glu Phe Glu Ile Ser Asp Met Lys Gln 5er Asp Ala Asp Gln Tyr Tyr Cys Ala Ala Asn Asp Gly His Asp Pro Ile Ala Ser Glu Leu Val Ser Ile His Val Arg Val Pro Val Ser Arg Pro Val Leu Thr Phe Gly Asp Ser Gly Thr Gln Ala Val Leu Gly Asp Leu Val Glu Leu His Cys Lys Ala Leu Arg Gly Ser Pro Pro Ile Phe Tyr Gln Phe Tyr His Glu Ser Ile Ile Leu Gly Asn Ser Ser Ala Pro Ser Gly Gly Gly Ala Ser Phe Asn Phe Ser Leu Thr Ala Glu His 5er Gly Asn Phe Ser Cys Glu Ala Ser Asn Gly Gln Gly Ala Gln Arg Ser Glu Val Val Ala Leu Asn Leu Thr Gly Leu Ser Leu Val Pro Thr Glu Asn Gly Ile Ser His Leu Ser Leu <210> 75 <211> 17 <212> PRT
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400> 75 Met Leu Pro Trp Leu Leu Leu Leu Ile Cys Ala Leu Pro Cys Glu Pro Ala <210> 76 <211> 100 <212> PRT
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400> 76 Lys Arg Lys Ile Gly Arg Gln Ser Glu Asp Pro Val Arg Ser Pro Pro Gln Thr Val Leu G1n Gly Ser Thr Tyr Pro Lys Ser Pro Asp Ser Arg Gln Pro G1u Pro Leu Tyr Glu Asn Val Asn Val Val Ser Gly Asn Glu Val Tyr Ser Leu Val Tyr His Thr Pro Gln Val Leu Glu Pro Ala Ala Ala Gln His Val Arg Thr His Gly Val Ser Glu Ser Phe Gln Val Ser Ser Gly Leu Tyr Ser Lys Pro Arg Tle Asn Ile Ala His Met Asp Tyr Glu Asp Ala Met <210> 77 <211> 283 <2l2> PRT
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400> 77 Gly Ile Ser Asp Val Ser Leu Lys Thr Arg Pro Pro Gly Gly Trp Val Met Glu Gly Asp Lys Leu Val Leu Tle Cys Ser Val Asp Arg Val Thr Gly Asn Ile Thr Tyr Phe Trp Tyr Arg Gly Ala Leu Gly Phe G1n Leu Glu Thr Lys Thr Gln Pro Ser Leu Thr Ala Glu Phe Glu Ile Ser Asp Met Lys Gln Ser Asp Ala Asp Gln Tyr Tyr Cys Ala Ala Asn Asp Gly His Asp Pro Ile Ala Ser Glu Leu Val Ser Ile His Val Arg Val Pro Val 5er Arg Pro Val Leu Thr Phe Gly Asp Ser Gly Thr Gln A1a Va1 Leu Gly Asp Leu Val Glu Leu His Cys Lys A1a Leu Arg Gly Ser Pro Pro Ile Phe Tyr Gln Phe Tyr His Glu Ser Ile Ile Leu Gly Asn Ser l30 135 140 Ser Ala Pro Ser Gly Gly Gly Ala Ser Phe Asn Phe Ser Leu Thr Ala Glu His Ser Gly Asn Phe Ser Cys Glu Ala Ser Asn Gly Gln Gly Ala 165 ' 170 175 Gln Arg Ser Glu Val Val Ala Leu Asn Leu Thr Gly Arg Gln Ser Glu Asp Pro Val Arg Ser Pro Pro G1n Thr Val Leu Gln Gly Ser Thr Tyr Pro Lys Ser Pro Asp Ser Arg Gln Pro Glu Pro Leu Tyr Glu Asn Val Asn Val Val Ser Gly Asn Glu Va1 Tyr Ser Leu Va1 Tyr His Thr Pro Gln Val Leu Glu Pro A1a Ala Ala Gln His Val Arg Thr His Gly Val Ser Glu Ser Phe Gln Val Ser Ser Gly Leu Tyr Ser Lys Pro Arg Ile Asn Ile Ala His Met Asp Tyr Glu Asp Ala Met <210> 78 <211> 570 <212> PRT
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400> 78 Gly Gln His Glu Ala Ala Gln Gln Ser Val Val Ser Leu Gln Pro Pro Trp Thr Thr Phe Phe Arg Gly Glu Val Val Thr Leu Thr Cys Tyr Arg Phe Gly Phe Ser Val Pro Gln Lys Thr Lys Trp Tyr Gln Lys Arg Lys Thr Val Lys Gln Thr Pro Gly Ala Leu Val Ile Lys Ala His Thr Leu Lys Val His Glu Ser Gly Glu Tyr Trp Cys Gln A1a Asp Ser Leu Leu Pro Ser Met His Val Asn Val Glu Phe Ser Glu Asp Phe Leu Val Leu Gln Ala Pro Pro Ala Val Phe Glu Gly Asp Ser Val Val Leu Arg Cys Tyr Ala Lys Lys Gly Ile Glu Ala Glu Thr Leu Thr Phe Tyr Lys Asp Gly Lys Ala Leu Thr Leu His His Gln Ser Glu Leu Ser Tle His His Ala Asn Leu Lys Asp Asn Gly Gln Tyr Lys Cys Thr Ser Lys Lys Lys Trp Ser Phe Gly Ser Leu Tyr Thr Ser Asn Thr Val Gly Val Gln Val Gln Glu Leu Phe Pro Arg Pro Val Leu Arg Ala Arg Pro Ser His Pro Ile Asp Gly Ser Pro Val Thr Leu Thr Cys Gln Thr Gln Leu Ser Ala Gln Lys Ser Asp A1a Arg Leu Gln Phe Cys Phe Phe Arg Asn Leu Gln Leu Leu Gly Ser Gly Cys Ser Arg Ser Ser Glu Phe His I1e Pro Ala Ile Trp Thr Glu Glu Ser Arg Arg Tyr Gln Cys Lys Ala Glu Thr Val Asn Ser Gln Val Arg Lys Gln Ser Thr Ala Phe Ile Ile Pro Val Gln Arg Ala Ser Ala Arg Phe G1n Thr His Ile Ile Pro Ala Ser Lys Leu Val Phe Glu Gly Gln Leu Leu Leu Leu Asn Cys Ser Val Lys Gly Val 2g0 295 300 Pro Gly Pro Leu Lys Phe 5er Trp Tyr Lys Lys Asp Met Leu Asn Glu Glu Thr Lys Ile Leu Lys Ser Ser Asn Ala Glu Phe Lys Ile Ser Gln 325 330 . 335 Val Asn Ile Ser Asp Ala Gly Glu Tyr His Cys Glu Ala Thr Asn Ser Arg Arg Ser Phe Val Ser Arg Ala Phe Pro Ile Thr Ile Lys Val Pro Val Ser G1n Pro Val Leu Thr Leu Ser Thr Gly Lys Thr G1n Ala Leu Glu Gly Asp Leu Met Thr Leu His Cys G1n 5er Gln Arg Gly Ser Pro Cys Ile Leu Tyr Glu Phe Phe Tyr Glu Asn Val Ser Leu G1y Asn Ser Ser Ile Leu Ser Gly Gly Gly Ala Tyr Phe Asn Phe Ser Met Ser Thr Glu Arg Ser Gly Asn Tyr Tyr Cys Thr Ala Asp Asn Gly Leu Gly Ala Gln Cys Ser Glu Ala Tle Arg Ile Ser Ile Phe Asp Met Thr Lys Asn Arg Ser Val Pro Met Ala Ala Gly Ile Thr Val Gly Leu Leu Ile Met Ala Val Gly Val Phe Leu Phe Tyr Cys Trp Phe Ser Arg Lys Ala Gly Gly Lys Pro Thr Ser Asp Asp Ser Arg Asn Pro Ser Asp Ser Glu Pro Gln Glu Pro Thr Tyr Tyr Asn Val Pro Ala Cys Ile Glu Leu Gln Pro Val Tyr Ser Asn Glu Pro Glu Glu Asn Val Ile Tyr Thr Glu Val Arg Arg Thr Gln Pro Arg Gln Lys His Ala Asp Gln Glu Ser Glu Ser Pro Arg Ser Arg Cys Gln Met Ala Glu Lys Lys <210> 79 <211> 25 <212> PRT
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400> 79 Met Ser Gly Ser Phe Ser Pro Cys Val Val Phe Thr G1n Met Trp Leu Thr Leu Leu Val Val Thr Pro Val Asn <210> 80 <211> 468 <212> PRT
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400> 80 Gly Gln His Glu Ala Ala Gln Gln Ser Val Val Ser Leu Gln Pro Pro Trp Thr Thr Phe Phe Arg Gly Glu Val Val Thr Leu Thr Cys Tyr Arg Phe Gly Phe Ser Val Pro Gln Lys Thr Lys Trp Tyr Gln Lys Arg Lys Thr Val Lys Gln Thr Pro Gly Ala Leu Val Ile Lys A1a His Thr Leu Lys Val His Glu Ser Gly Glu Tyr Trp Cys Gln Ala Asp Ser Leu Leu Pro Ser Met His Val Asn Val Glu Phe Ser Glu Asp Phe Leu Val Leu Gln Ala Pro Pro Ala Val Phe Glu Gly Asp Ser Val Val Leu Arg Cys Tyr Ala Lys Lys Gly Ile Glu Ala Glu Thr Leu Thr Phe Tyr Lys Asp Gly Lys Ala Leu Thr Leu His His Gln Ser Glu Leu Ser Ile His His Ala Asn Leu Lys Asp Asn Gly Gln Tyr Lys Cys Thr Ser Lys Lys Lys Trp Ser Phe Gly 5er Leu Tyr Thr Ser Asn Thr Va1 Gly Val Gln Val Gln Glu Leu Phe Pro Arg Pro Val Leu Arg Ala Arg Pro Ser His Pro Ile Asp Gly Ser Pro Val Thr Leu Thr Cys Gln Thr Gln Leu Ser Ala Gln Lys Ser Asp Ala Arg Leu Gln Phe Cys Phe Phe Arg Asn Leu Gln Leu Leu Gly Ser Gly Cys Ser Arg Ser Ser Glu Phe His Ile Pro Ala Ile Trp Thr Glu G1u 5er Arg Arg Tyr Gln Cys Lys Ala Glu Thr Val Asn Ser Gln Val Arg Lys Gln Ser Thr Ala Phe Ile Ile Pro Val Gln Arg Ala Ser Ala Arg Phe Gln Thr His Ile Ile Pro Ala 5er Lys Leu Val Phe Glu Gly Gln Leu Leu Leu Leu Asn Cys Ser Val Lys Gly Val Pro Gly Pro Leu Lys Phe Ser Trp Tyr Lys Lys Asp Met Leu Asn Glu Glu Thr Lys Ile Leu Lys Ser Ser Asn Ala Glu Phe Lys I1e Ser Gln Val Asn Ile Ser Asp Ala Gly Glu Tyr His Cys Glu Ala Thr Asn Ser Arg Arg Ser Phe Val Ser Arg Ala Phe Pro Ile Thr Ile Lys Val Pro Val Ser Gln Pro Val Leu Thr Leu Ser Thr Gly Lys Thr Gln Ala Leu Glu Gly Asp Leu Met Thr Leu His Cys Gln Ser Gln Arg Gly Ser Pro Cys Ile Leu Tyr G1u Phe Phe Tyr Glu Asn Val Ser Leu Gly Asn Ser Ser Ile Leu Ser Gly Gly Gly Ala Tyr Phe Asn Phe Ser Met Ser Thr Glu Arg Ser Gly Asn Tyr Tyr Cys Thr Ala Asp Asn Gly Leu Gly Ala Gln Cys Ser Glu Ala I1e Arg Ile Ser Ile Phe Asp Met Thr Lys Asn Arg 5er Val Pro <210> 81 <211> 79 <212> PRT
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400> 81 Ser Arg Lys Ala Gly Gly Lys Pro Thr Ser Asp Asp Ser Arg Asn Pro Ser Asp Ser Glu Pro Gln Glu Pro Thr Tyr Tyr Asn Val Pro Ala Cys Ile Glu Leu Gln Pro Val Tyr Ser Asn Glu Pro Glu Glu Asn Val Ile Tyr Thr G1u Val Arg Arg Thr Gln Pro Arg Gln Lys His Ala Asp Gln Glu Ser Glu 5er Pro Arg Ser Arg Cys Gln Met Ala Glu Lys Lys <210> 82 <211> 1973 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400>
82 ctatcccagatccgtggtccatctgcccta ctgcacctgt60 ccacagtgtt aggacttgag ctcaaagggagctacttgcctctagtctcatgcctctgtgcttgctgcttctggtcttcg120 ctcctgtcggagtccagtccgactggttgagcatcagccttccacaccgttcttatgaag180 gagaccaagtagttataagctgcacaggaaaaaataatggtgacataaagagactgaagt240 acttcaaggatggatatcacatagaaacttacagcagtgcttcaagctacaccattagga300 atgcaagacgtggtgacagtggctcctattcctgtaaggcagataggaaatttttcctat360 ttatagacacaacagaagaaacaggatctaagtggctgaatgtccaagagctgtttccag420 cacctgggctgacagccagccccctgcagcccgtagaggggagttcagtgaccctgtcct480 gcaacacctggctcccttcagatagggcaacgacccagctacgctattccttcttcaaag540 atggccacactttgcaatcgggctggacctcatcaaaatttaccatctcagcaatatcga600 aggaagactcaggaaattactggtgtgaagcaatgactgcctctcgcagtgtctcaaagc660 agagtcaccggtcctacatagatgtagagaggatccctgtatctcaagtcaccatggaaa720 tccagccttcaaggggctggggagttgaaggggagccactggtcgttgaaggggagcccc780 tggtcctggcttgttctgtggctaaaggcaccgggctaatcacgttctoctggcataggc840 aggacactaaggaaagtgtggggaagaaaagtcagcgttcccagagagtggagctggaga900 tccctactatcagggaaggccatgctggggggtactactgcacagcagacaacaactacg960 gcctgatccagagcgcaatcgtgaacatcaccgtgaaaattccagtgttgaacccgctcc1020 tctccatcagtgttcctggggtcttgcccttcatcggagatgtggcggagcttcactgtg1080 aagacaagagagcatctcctccggttctctactggttttatcatgaaaatatcactctgg1140 ctaacacctcggcaccttttggaggaaaggcatcctttaagctctctctgactgcagggc1200 attctgggaactactcttgtgaggctgaaaacgcctggggtaccaagcgcagtgaggtgg1260 taacgctcaatgtcacagagcccccacccaaagtgcgtttggtgaatggcccccaccact1320 gtgaaggacgcgtagaggtggagcaggaaggtcgctggggcactgtatgtgatgatggct1380 gggacatgagggatgtggctgtggtgtgccgagagctgggctgtggagcagcccaacaca1440 cacctatagccatgctgtatccaccagcagttgatgaagctctgcctgtgctcattcagg1500 tagccctgtgcaatggcacagaaaagaccctggctgaatgtgaccaggttgaggcctttg1560 attgtggacatgatgaggatgctggagctgtgtgtgaagtcttacccagcactttctgaa1620 gatctagagaccagagaccatcagacctcctactttctgcactgggcctcacagccctca1680 cggtctgcagctcccagtggacttccagacttcagctgtggcttatccttcaagaggact1740 cgaaactatattaatctgctctgagataatgttccaacagctccaaagaaagcccgagtc1800 ccttgtccccagaggccaagcttggaaaaattgttcccctgtccaggttccctgcctttc1860 tagttccttcttgctatctccttgggcagatgcagaggtggcacaagtaaggatcacata1920 catgtgcctgggcttccatctggtagaatgtggtctaacaaagcacatac 1973 aac <210> 83 <211> 1530 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400> 83 atgcctctgtgcttgctgcttctggtcttcgctcctgtcggagtccagtccgactggttg60 agcatcagccttccacaccgttcttatgaaggagaccaagtagttataagctgcacagga120 aaaaataatggtgacataaagagactgaagtacttcaaggatggatatcacatagaaact180 tacagcagtgcttcaagctacaccattaggaatgcaagacgtggtgacagtggctcctat240 tcctgtaaggcagataggaaatttttcctatttatagacacaacagaagaaacaggatct300 aagtggctgaatgtccaagagctgtttccagcacctgggctgacagccagccccctgcag360 cccgtagaggggagttcagtgaccctgtcctgcaacacctggctcccttcagatagggca420 acgacccagctacgctattccttcttcaaagatggccacactttgcaatcgggctggacc480 tcatcaaaatttaccatctcagcaatatcgaaggaagactcaggaaattactggtgtgaa540 gcaatgactgcctctcgcagtgtctcaaagcagagtcaccggtcctacatagatgtagag600 aggatccctgtatctcaagtcaccatggaaatccagccttcaaggggctggggagttgaa660 ggggagccactggtcgttgaaggggagcccctggtcctggcttgttctgtggctaaaggc720 accgggctaatcacgttctcctggcataggcaggacactaaggaaagtgtggggaagaaa780 agtcagcgttcccagagagtggagctggagatccctactatcagggaaggccatgctggg840 gggtactactgcacagcagacaacaactacggcctgatccagagcgcaatcgtgaacatc900 accgtgaaaattccagtgttgaacccgctcctctccatcagtgttcctggggtcttgccc960 ttcatcggagatgtggcggagcttcactgtgaagacaagagagcatctcctccggttctc1020 tactggttttatcatgaaaatatcactctggctaacacctcggcaccttttggaggaaag1080 gcatcctttaagctctctctgactgcagggcattctgggaactactcttgtgaggctgaa1140 aacgcctggggtaccaagcgcagtgaggtggtaacgctcaatgtcacagagcccccaccc1200 aaagtgcgtttggtgaatggcccccaccactgtgaaggacgcgtagaggtggagcaggaa1260 ggtcgctggggcactgtatgtgatgatggctgggacatgagggatgtggctgtggtgtgc1320 cgagagctgggctgtggagcagcccaacacacacctatagccatgctgtatccaccagca1380 gttgatgaagctctgcctgtgctcattcaggtagccctgtgcaatggcacagaaaagacc1440 ctggctgaatgtgaccaggttgaggcctttgattgtggacatgatgaggatgctggagct1500 gtgtgtgaagtcttacccagcactttctga 1530 <210>
<211>
<212>
DNA
<213> ficial ence Arti Sequ <220>
<223> iption equence:/note =
Descr of Artificial S
synth etic construct <400>
84 ctatcccagatccgtggtccatctgccctaaggacttgagctgcacctgt60 ccacagtgtt ctcaaagggagctacttgcctctagtctcatgcctctgtgcttgctgcttctggtcttcg120 ctcctgtcggagtccagtccgactggttgagcatcagccttccacaccgttcttatgaag180 gagaccaagtagttataagctgcacaggaaaaaataatggtgacataaagagactgaagt240 acttcaaggatggatatcacatagaaacttacagcagtgcttcaagctacaccattagga300 atgcaagacgtggtgacagtggctcctattcctgtaaggcagataggaaatttttcctat360 ttatagacacaacagaagaaacaggatctaagtggctgaatgtccaagagctgtttccag420 cacctgggctgacagccagccccctgcagcccgtagaggggagttcagtgaccctgtcct480 gcaacacctggctcccttcagatagggcaacgacccagctacgctattccttcttcaaag540 atggccacactttgcaatcgggctggacctcatcaaaatttaccatctcagcaatatcga600 aggaagactcaggaaattactggtgtgaagcaatgactgcctctcgcagtgtctcaaagc660 agagtcaccggtcctacatagatgtagagaggatccctgtatctcaagtcaccatggaaa720 tccagccttcaaggggctggggagttgaaggggagccactggtcgttgaaggggagcccc780 tggtcctggcttgttctgtggctaaaggcaccgggctaatcacgttctcctggcataggc840 aggacactaaggaaagtgtggggaagaaaagtcagcgttcccagagagtggagctggaga900 tccctactatcagggaaggccatgctggggggtactactgcacagcagacaacaactacg960 gcctgatccagagcgcaatcgtgaacatcaccgtgaaaattccagtgttgaacccgctcc1020 tctccatcagtgttcctggggtcttgcccttcatcggagatgtggcggagcttcactgtg1080 aagacaagagagcatctcctccggttctctactggttttatcatgaaaatatcactctgg1140 ctaacacctcggcaccttttggaggaaaggcatcctttaagctctctctgactgcagggc1200 attctgggaactactcttgtgaggctgaaaacgcctggggtaccaagcgcagtgaggtgg1260 taacgctcaatgtcacaggtaggacaatttaatgatccattccagggtgcaacttgcctt1320 ctggccatgcccttcttctctcccttgcacctgtacctcttggtctttgaa 1371 <210> 85 <211> 1203 <212> DNA
<213> Artificial Sequence <220>
<223>
Description of Artificial 5equence:/note =
synth etic construct <400>
85 gcttgctgcttctggtcttcgctcctgtcggagtccagtccgactggttg60 atgcctctgt agcatcagccttccacaccgttcttatgaaggagaccaagtagttataagctgcacagga120 aaaaataatggtgacataaagagactgaagtacttcaaggatggatatcacatagaaact180 tacagcagtgcttcaagctacaccattaggaatgcaagacgtggtgacagtggctcctat240 tcctgtaaggcagataggaaatttttcctatttatagacacaacagaagaaacaggatct300 aagtggctgaatgtccaagagctgtttccagcacctgggctgacagccagccccctgcag360 cccgtagaggggagttcagtgaccctgtcctgcaacacctggctcccttcagatagggca420 acgacccagctacgctattccttcttcaaagatggccacactttgcaatcgggctggacc480 tcatcaaaatttaccatctcagcaatatcgaaggaagactcaggaaattactggtgtgaa540 gcaatgactgcctctcgcagtgtctcaaagcagagtcaccggtcctacatagatgtagag600 aggatccctgtatctcaagtcaccatggaaatccagccttcaaggggctggggagttgaa660 ggggagccactggtcgttgaaggggagcccctggtcctggcttgttctgtggctaaaggc720 accgggctaatcacgttctcctggcataggcaggacactaaggaaagtgtggggaagaaa780 agtcagcgttcccagagagtggagctggagatccctactatcagggaaggccatgctggg840 gggtactactgcacagcagacaacaactacggcctgatccagagcgcaatcgtgaacatc900 accgtgaaaattccagtgttgaacccgctcctctccatcagtgttcctggggtcttgccc960 ttcatcggagatgtggcggagcttcactgtgaagacaagagagcatctcctccggttctc1020 tactggttttatcatgaaaatatcactctggctaacacctcggcaccttttggaggaaag1080 gcatcctttaagctctctctgactgcagggcattctgggaactactcttgtgaggctgaa1140 aacgcctggggtaccaagcgoagtgaggtggtaacgctcaatgtcacaggtaggacaatt1200 taa <210> 86 <211> 1479 <212> DNA
<213> Artificial Sequence <220>
<223> Description rtificialequence:/note =
of A S
synth etic construct <400> 86 gactggttgagcatcagccttccacaccgttcttatgaaggagaccaagtagttataagc60 tgcacaggaaaaaataatggtgacataaagagactgaagtacttcaaggatggatatcac120 atagaaacttacagcagtgcttcaagctacaccattaggaatgcaagacgtggtgacagt180 ggctcctattcctgtaaggcagataggaaatttttcctatttatagacacaacagaagaa240 acaggatctaagtggctgaatgtccaagagctgtttccagcacctgggctgacagccagc300 cccctgcagcccgtagaggggagttcagtgaccctgtcctgcaacacctggctcccttca360 gatagggcaacgacccagctacgctattccttcttcaaagatggccacactttgcaatcg420 ggctggacctcatcaaaatttaccatctcagcaatatcgaaggaagactcaggaaattac480 tggtgtgaagcaatgactgcctctcgcagtgtctcaaagcagagtcaccggtcctacata540 gatgtagagaggatccctgtatctcaagtcaccatggaaatccagccttcaaggggctgg600 ggagttgaaggggagccactggtcgttgaaggggagcccctggtcctggcttgttctgtg660 gctaaaggcaccgggctaatcacgttctcctggcataggcaggacactaaggaaagtgtg720 gggaagaaaagtcagcgttcccagagagtggagctggagatccctactatcagggaaggc780 catgctggggggtactactgcacagcagacaacaactacggcctgatccagagcgcaatc840 gtgaacatcaccgtgaaaattccagtgttgaacccgctcctctccatcagtgttcctggg900 gtcttgcccttcatcggagatgtggcggagcttcactgtgaagacaagagagcatctcct960 ccggttctctactggttttatcatgaaaatatcactctggctaacacctcggcacctttt1020 ' ggaggaaaggcatcctttaagctctctctgactgcagggcattctgggaactactcttgt1080 gaggctgaaaacgcctggggtaccaagcgcagtgaggtggtaacgctcaatgtcacagag1140 cccccacccaaagtgcgtttggtgaatggcccccaccactgtgaaggacgcgtagaggtg1200 aag gtcgctggggcactgtatgtgatgatggctgggacatgagggatgtggct1260 ca a gg gagagctgggctgtggagcagcccaacacacacctatagccatgctgtat1320 g g gtggtgtgcc ccaccagcagttgatgaagctctgcctgtgctcattcaggtagccctgtgcaatggcaca1380 accc tggctgaatgtgaccaggttgaggcctttgattgtggacatgatgaggat1440 aaaa g tgtgtgaagtcttacccagcactttctga 1479 g gctggagctg <210>
<211>
<212>
DNA
<213>
Artificial Sequence <220>
<223>
Description of Artificial Sequence:/note =
synth etic construct <400>
87 gcatcagccttccacaccgttcttatgaaggagaccaagtagttataagc60 gactggttga tgcacaggaaaaaataatggtgacataaagagactgaagtacttcaaggatggatatcac120 atagaaacttacagcagtgcttcaagctacaccattaggaatgcaagacgtggtgacagt180 ggctcctattcctgtaaggcagataggaaatttttcctatttatagacacaacagaagaa240 acaggatctaagtggctgaatgtccaagagctgtttccagcacctgggctgacagccagc300 cccctgcagcccgtagaggggagttcagtgaccctgtcctgcaacacctggctcccttca360 gatagggcaacgacccagctacgctattccttcttcaaagatggccacactttgcaatcg420 ggctggacctcatcaaaatttaccatctcagcaatatcgaaggaagactcaggaaattac480 tggtgtgaagcaatgactgcctctcgcagtgtctcaaagcagagtcaccggtcctacata540 gatgtagagaggatccctgtatctcaagtcaccatggaaatccagccttcaaggggctgg600 ggagttgaaggggagccactggtcgttgaaggggagcccctggtcctggcttgttctgtg660 gctaaaggcaccgggctaatcacgttctcctggcataggcaggacactaaggaaagtgtg720 gggaagaaaagtcagcgttcccagagagtggagctggagatccctactatcagggaaggc780 catgctggggggtactactgcacagcagacaacaactacggcctgatccagagcgcaatc840 gtgaacatcaccgtgaaaattccagtgttgaacccgctcctctccatcagtgttcctggg900 gtcttgcccttcatcggagatgtggcggagcttcactgtgaagacaagagagcatctcct960 ccggttctctactggttttatcatgaaaatatcactctggctaacacctcggcacctttt1020 ggaggaaaggcatcctttaagctctctctgactgcagggcattctgggaactactcttgt1080 gaggctgaaaacgcctggggtaccaagcgcagtgaggtggtaacgctcaatgtcacaggt1140 aggacaatttas <210>
<211>
<212>
DNA
<213> ficial ence Arti Sequ <220>
<223> iption equence:/note =
Descr of Artificial S
synth etic construct <400>
88 ggttttgcttttttttcttttggtgagaggtaccttcaagttaccatccc60 tgagtagtct cagcctggtcctcatgctaccttggctcctgctactgatctgtgctctaccgtgtgaacc120 tgctggaatctctgatgtgagcttgaagacacggcccccaggaggatgggtgatggaggg180 agacaagctggtcctcatctgctcggttgatagagtcactgggaatataacttacttctg240 gtacagaggggccctgggtttccaactggaaacaaagacacaaccttcactaacagcaga300 gtttgagatcagtgacatgaagcagagcgatgctgatcaatattactgtgcggctaacga360 tggccacgaccctatcgccagtgagctggtgagcatccacgtcagagttccagtgtctcg420 ccctgtccttacgtttggggactctggaacccaggctgtgctaggggacctggtggagct480 tcactgtaaggccctgagaggctcacccccaatcttctaccagttttatcatgagagcat540 catcctggggaacagttcagcaccctctggaggaggagcatccttcaacttctccctgac600 tgcagaacattctggaaacttctcctgtgaggccagcaatggacagggtgcccaacgaag660 tgaggtggtggctctcaacttaacaggtctctccttagtgcctactgagaatggaatcag720 ccatctctccttaggactcactgggtggctgcttggctgtcttagccccatcaccatggc780 cttaatattttgctactggctcaagagaaaaataggaagacagtcagaggatccagtcag840 gagccctcctcagactgtgctccaaggatccacgtaccccaaatcccccgactcaaggca900 gccagagcccctgtatgagaacgtgaacgttgtaagtggcaatgaagtgtactctctggt960 gtaccacaccccgcaggtgctggaaccagcagcagctcagcatgtgaggacacacggagt1020 aagtgagtcctttcaggtctcctctggactctattctaagccaaggataaacattgcaca1080 tatggactatgaagacgccatgtagaattatgtaaacagcaactatggagtgctacatac1140 aagcccaaggcctgatgtggcctccaaggatactggggacagggatagcttgccagccca1200 atttccccacacactgcggttcattagatgagtccttcacctaccctgtgtgaagctgga1260 gcaagtcctgcagaaaccacccaggaaaaccaacttagacggagaagccagaagcatttg1320 catctggttgttgcccattcatgttggcacacgaacttttatttacaggaggaaaatggt1380 gtgatgaaagcaactaaggtcttacagcagagggacaatgcgactcagagagcacaaagc1440 cgagatcaatggctttgcaggtctgctgtggagacagagccatgcttcctctgtgcacat1500 accctagagtacttctgagtcactgccatcaacttagaattaaacacagttgcataaaat1560 gtactgt <210>
<211>
<212>
DNA
<213> ficial ence Arti Sequ <220>
<223>
Description of Artificial Sequence:/note =
synth etic construct <400>
89 ggctcctgctactgatctgtgctctaccgtgtgaacctgctggaatctct 60 atgctacctt gatgtgagcttgaagacacggcccccaggaggatgggtgatggagggagacaagctggtc 120 ctcatctgctcggttgatagagtcactgggaatataacttacttctggtacagaggggcc 180 ctgggtttccaactggaaacaaagacacaaccttcactaacagcagagtttgagatcagt 240 gacatgaagcagagcgatgctgatcaatattactgtgcggctaacgatggccacgaccct 300 atcgccagtgagctggtgagcatccacgtcagagttccagtgtctcgccctgtccttacg 360 tttggggactctggaacccaggctgtgctaggggacctggtggagcttcactgtaaggcc 420 ctgagaggctcacccccaatcttctaccagttttatcatgagagcatcatcctggggaac 480 agttcagcaccctctggaggaggagcatccttcaacttctccctgactgcagaacattct 540 ggaaacttctcctgtgaggccagcaatggacagggtgcccaacgaagtgaggtggtggct 600 ctcaacttaacaggtctctccttagtgcctactgagaatggaatcagccatctctcctta 660 ggactcactgggtggctgcttggctgtcttagccccatcaccatggccttaatattttgc 720 tactggctcaagagaaaaataggaagacagtcagaggatccagtcaggagccctcctcag 780 actgtgctccaaggatccacgtaccccaaatcccccgactcaaggcagccagagcccctg 840 tatgagaacgtgaacgttgtaagtggcaatgaagtgtactctctggtgtaccacaccccg 900 caggtgctggaaccagcagcagctcagcatgtgaggacacacggagtaagtgagtccttt 960 caggtctcctctggactctattctaagccaaggataaacattgcacatatggactatgaa 1020 gacgccatgtag <210>
<211>
<212>
DNA
<213> ficial ence Arti Sequ <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400> 90 ggaatctctgatgtgagcttgaagacacggcccccaggaggatgggtgatggagggagac60 aagctggtcctcatctgctcggttgatagagtcactgggaatataacttacttctggtac120 agaggggccctgggtttccaactggaaacaaagacacaaccttcactaacagcagagttt180 gagatcagtgacatgaagcagagcgatgctgatcaatattactgtgcggctaacgatggc240 cacgaccctatcgccagtgagctggtgagcatccacgtcagagttccagtgtctcgccct300 gtccttacgtttggggactctggaacccaggctgtgctaggggacctggtggagcttcac360 tgtaaggccctgagaggctcacccccaatcttctaccagttttatcatgagagcatcatc420 ctggggaacagttcagcaccctctggaggaggagcatccttcaacttctccctgactgca 480 gaacattctggaaacttctcctgtgaggccagcaatggacagggtgcccaacgaagtgag 540 gtggtggctctcaacttaacaggtctctccttagtgcctactgagaatggaatcagccat 600 ctctccttaggactcactgggtggctgcttggctgtcttagccccatcaccatggcctta 660 atattttgctactggctcaagagaaaaataggaagacagtcagaggatccagtcaggagc 720 cctcctcagactgtgctccaaggatccacgtaccccaaatcccccgactcaaggcagcca 780 gagcccctgtatgagaacgtgaacgttgtaagtggcaatgaagtgtactctctggtgtac 840 cacaccccgcaggtgctggaaccagcagcagctcagcatgtgaggacacacggagtaagt 900 gagtcctttcaggtctcctctggactctattctaagccaaggataaacattgcacatatg 960 gactatgaagacgccatgtag 981 <210>
9l <211>
<212>
DNA
<213> ficial ence Arti Sequ <220>
<223>
Description of Artificial Sequence:/note =
synth etic construct <400>
91 ggctcctgctactgatctgtgctctaccgtgtgaacctgctggaatctct60 atgctacctt gatgtgagcttgaagacacggcccccaggaggatgggtgatggagggagacaagctggtc120 ctcatctgctcggttgatagagtcactgggaatataacttacttctggtacagaggggcc180 ctgggtttccaactggaaacaaagacacaaccttcactaacagcagagtttgagatcagt240 gacatgaagcagagcgatgctgatcaatattactgtgcggctaacgatggccacgaccct300 atcgccagtgagctggtgagcatccacgtcagagttccagtgtctcgccctgtccttacg360 tttggggactctggaacccaggctgtgctaggggacctggtggagcttcactgtaaggcc420 ctgagaggctcacccccaatcttctaccagttttatcatgagagcatcatcctggggaac480 agttcagcaccctctggaggaggagcatccttcaacttctccctgactgcagaacattct540 ggaaacttctcctgtgaggccagcaatggacagggtgcccaacgaagtgaggtggtggct600 ctcaacttaacaggtctctccttagtgcctactgagaatggaatcagccatctctcctta660 <210>
<211>
<212>
DNA
<213> ficial ence Arti Sequ <220>
<223>
Description of Artificial Sequence:/note =
synth etic construct <400>
92 atgtgagcttgaagacacggcccccaggaggatgggtgatggagggagac60 ggaatctctg aagctggtcctcatctgctcggttgatagagtcactgggaatataacttacttctggtac120 agaggggccctgggtttccaactggaaacaaagacacaaccttcactaacagcagagttt180 gagatcagtgacatgaagcagagcgatgctgatcaatattactgtgcggctaacgatggc240 cacgaccctatcgccagtgagctggtgagcatccacgtcagagttccagtgtctcgccct300 gtccttacgtttggggactctggaacccaggctgtgctaggggacctggtggagcttcac360 tgtaaggccctgagaggctcacccccaatcttctaccagttttatcatgagagcatcatc420 ctggggaacagttcagcaccctctggaggaggagcatccttcaacttctccctgactgca480 gaacattctggaaacttctcctgtgaggccagcaatggacagggtgcccaacgaagtgag540 gtggtggctctcaacttaacaggtctctccttagtgcctactgagaatggaatcagccat600 ctctcctta <210> 93 <211> 303 <212> DNA
<213> Artificial Sequence <220>
synthetic construct <221> VARIANT
<222> 5, 15, 17, 22, 28 <223> X can be any amino acid <400> 63 Gly Glu Pro Ile Xaa Leu Arg Cys His Ser Trp Lys Asp Lys Xaa Leu Xaa Lys Val Thr Tyr Xaa Gln Asn Gly Lys Ala Xaa Lys Phe Phe His <210> 64 <211> 17 <212> PRT
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <221> VARIANT
<222> 1 <223> X can be either Glu or Asp <221> VARIANT
<222> 7 <223> X can be either Leu or Ile <221> VARIANT
<222> 17 <223> X can be either Leu or Ile <221> VARIANT
<222> 2-3, 5-6, 8-13, 15-16 <223> X can be any amino acid <400> 64 Xaa Xaa Xaa Tyr Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Tyr Xaa Xaa <210> 65 - <211> 18 <212> PRT
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <221> VARIANT
<222> 1 <223> X can be either Glu or Asp <221> VARIANT
<222> 7 <223> X can be either Leu or Ile <221> VARIANT
<222> 18 ' <223> X can be either Leu or Ile <221> VARIANT
<222> 2-3, 5-6, 8-14, 16-17 <223> X can be any amino acid <400> 65 Xaa Xaa Xaa Tyr Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Tyr Xaa Xaa <210> 66 <211> 19 <212> PRT
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <221> VARIANT
<222> 1 <223> X can be either Glu or Asp <221> VARIANT
<222> 7 <223> X can be either Leu or Ile <221> VARIANT
<222> 19 <223> X can be either Leu or Ile <221> VARIANT
<222> 2-3, 5-6, 8-15, 17-18 <223> X can be any amino acid <400> 66 Xaa Xaa Xaa Tyr Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Tyr Xaa Xaa <210> 67 <211> 6 <212> PRT
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <221> VARIANT
<222> 1 <223> X can be either Ile orVal or Leu or Ser <221> VARIANT
<222> 2, 4-5 <223> X can be any amino acid <221> VARIANT
<222> 6 <223> X can be Leu or Val <400> 67 Xaa Xaa Tyr Xaa Xaa Xaa <210> 68 <211> 492 <212> PRT
<213> Artificial Sequenoe <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400> 68 Asp Trp Leu Ser Tle Ser Leu Pro His Arg Ser Tyr Glu Gly Asp Gln 1 5 l0 15 Val Val Ile Ser Cys Thr Gly Lys Asn Asn Gly Asp Ile Lys Arg Leu Lys Tyr Phe Lys Asp Gly Tyr His Ile Glu Thr Tyr Ser Ser Ala Ser Ser Tyr Thr Ile Arg Asn Ala Arg Arg Gly Asp Ser Gly Ser Tyr Ser Cys Lys Ala Asp Arg Lys Phe Phe Leu Phe Ile Asp Thr Thr Glu Glu Thr Gly Ser Lys Trp Leu Asn Val G1n Glu Leu Phe Pro Ala Pro Gly Leu Thr A1a Ser Pro Leu Gln Pro Val Glu Gly Ser Ser Va1 Thr Leu Ser Cys Asn Thr Trp Leu Pro Ser Asp Arg Ala Thr Thr Gln Leu Arg Tyr Ser Phe Phe Lys Asp Gly His Thr Leu Gln Ser Gly Trp Thr Ser Ser Lys Phe Thr Ile Ser Ala Ile Ser Lys Glu Asp Ser Gly Asn Tyr Trp Cys Glu Ala Met Thr Ala Ser Arg Ser Val Ser Lys Gln Ser His Arg Ser Tyr Ile Asp Val Glu Arg Ile Pro Val Ser Gln Va1 Thr Met 6 Glu Ile Gln Pro Ser Arg Gly Trp Gly Val Glu Gly Glu Pro Leu Val Val Glu Gly G1u Pro Leu Val Leu Ala Cys Ser Val Ala Lys Gly Thr Gly Leu Ile Thr Phe Ser Trp His Arg Gln Asp Thr Lys Glu Ser Va1 Gly Lys Lys Ser Gln Arg Ser Gln Arg Val Glu Leu G1u Ile Pro Thr Ile Arg Glu Ser His Ala Gly Gly Tyr Tyr Cys Thr Ala Asp Asn Asn Tyr Gly Leu Ile Gln Ser Ala Ile Va1 Asn Ile Thr Val Lys Ile Pro Val Leu Asn Pro Leu Leu Ser Ile Ser Val Pro Gly Val Leu Pro Phe Ile Gly Asp Val Ala Glu Leu His Cys Glu Asp Lys Arg A1a Ser Pro Pro Val Leu Tyr Trp Phe Tyr His Glu Asn Ile Thr Leu Ala Asn Thr Ser Ala Pro Phe Gly Gly Lys A1a Ser Phe Lys Leu Ser Leu Thr A1a Gly His Ser Gly Asn Tyr Ser Cys Glu Ala Glu Asn Ala Trp Gly Thr Lys Arg Ser Glu Val Val Thr Leu Asn Val Thr Glu Pro Pro Pro Lys Val Arg Leu Val Asn Gly Pro His His Cys Glu Gly Arg Val Glu Val Glu Gln Glu Gly Arg Trp Gly Thr Val Cys Asp Asp Gly Trp Asp Met Arg Asp Val Ala Val Val Cys Arg Glu Leu Gly Cys Gly Ala Ala Gln His Thr Pro Ile Ala Met Leu Tyr Pro Pro Ala Val Asp Glu Ala Leu Pro Val Leu Ile Gln Val Ala Leu Cys Asn Gly Thr Glu Lys Thr Leu Ala Glu Cys Asp Gln Val Glu Ala Phe Asp Cys Gly His Asp Glu Asp Ala Gly Ala Val Cys Glu Va1 Leu Pro Ser Thr Phe <210> 69 <211> 17 <212> PRT
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400> 69 Met Pro Leu Cys Leu Leu Leu Leu Val Phe Ala Pro Val Gly Val Gln Ser <210> 70 <211> 383 <212> PRT
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400> 70 Asp Trp Leu Ser Ile Ser Leu Pro His Arg Ser Tyr Glu Gly Asp Gln Val Val Ile Ser Cys Thr Gly Lys Asn Asn Gly Asp Tle Lys Arg Leu Lys Tyr Phe Lys Asp Gly Tyr His Ile Glu Thr Tyr Ser Ser Ala Ser Ser Tyr Thr Ile Arg Asn Ala Arg Arg Gly Asp Ser Gly Ser Tyr Ser Cys Lys Ala Asp Arg Lys Phe Phe Leu Phe Ile Asp Thr Thr Glu Glu Thr Gly Ser Lys Trp Leu Asn Va1 Gln Glu Leu Phe Pro Ala Pro Gly Leu Thr Ala 5er Pro Leu Gln Pro Val Glu Gly Ser Ser Val Thr Leu Ser Cys Asn Thr Trp Leu Pro Ser Asp Arg Ala Thr Thr Gln Leu Arg Tyr Ser Phe Phe Lys Asp Gly His Thr Leu Gln Ser Gly Trp Thr Ser 130 135 140 , Ser Lys Phe Thr Ile Ser Ala Ile 5er Lys Glu Asp Ser Gly Asn Tyr Trp Cys Glu Ala Met Thr Ala Ser Arg Ser Val Ser Lys Gln Ser His Arg Ser Tyr Ile Asp Val Glu Arg Ile Pro Val Ser Gln Val Thr Met Glu Ile Gln Pro Ser Arg Gly Trp Gly Val Glu Gly Glu Pro Leu Val Val Glu Gly Glu Pro Leu Val Leu Ala Cys Ser Val Ala Lys Gly Thr Gly Leu Ile Thr Phe Ser Trp His Arg Gln Asp Thr Lys Glu Ser Val Gly Lys Lys 5er Gln Arg Ser Gln Arg Val Glu Leu Glu Ile Pro Thr Tle Arg Glu Gly His Ala Gly Gly Tyr Tyr Cys Thr Ala Asp Asn Asn Tyr Gly Leu Tle Gln Ser Ala Ile Val Asn Ile Thr Val Lys Ile Pro _ 275 280 285 Val Leu Asn Pro Leu Leu Ser Ile Ser Val Pro Gly Val Leu Pro Phe Ile Gly Asp Val Ala Glu Leu His Cys Glu Asp Lys Arg Ala Ser Pro Pro Val Leu Tyr Trp Phe Tyr His Glu Asn Ile Thr Leu Ala Asn Thr Ser Ala Pro Phe Gly Gly Lys Ala Ser Phe Lys Leu Ser Leu Thr Ala Gly His Ser Gly Asn Tyr Ser Cys Glu Ala Glu Asn Ala Trp Gly Thr Lys Arg Ser Glu Val Val Thr Leu Asn Val Thr Gly Arg Thr Ile <210> 71 <211> 17 <212> PRT
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400> 71 Met Pro Leu Cys Leu Leu Leu Leu Val Phe Ala Pro Val Gly Val Gln Ser <210> 72 <211> 17 <212> PRT
<213> Artificial Sequence <220>
<223> Description of Artificial 5equence:/note =
synthetic construct <400> 72 Met Leu Pro Trp Leu Leu Leu Leu Ile Cys Ala Leu Pro Cys Glu Pro Ala <210> 73 <211> 326 <212> PRT
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400> 73 Gly Ile Ser Asp Val Ser Leu Lys Thr Arg Pro Pro Gly Gly Trp Val Met Glu Gly Asp Lys Leu Val Leu Ile Cys Ser Val Asp Arg Val Thr Gly Asn T1e Thr Tyr Phe Trp Tyr Arg Gly Ala Leu Gly Phe Gln Leu Glu Thr Lys Thr Gln Pro Ser Leu Thr Ala Glu Phe Glu Ile Ser Asp Met Lys Gln Ser Asp Ala Asp Gln Tyr Tyr Cys Ala A1a Asn Asp Gly His Asp Pro Ile Ala Ser Glu Leu Val Ser Ile His Val Arg Val Pro Val Ser Arg Pro Val Leu Thr Phe Gly Asp Ser Gly Thr Gln Ala Val Leu Gly Asp Leu Val Glu Leu His Cys Lys Ala Leu Arg Gly Ser Pro Pro Ile Phe Tyr Gln Phe Tyr His Glu Ser Ile I1e Leu Gly Asn Ser Ser Ala Pro Ser Gly Gly Gly Ala Ser Phe Asn Phe Ser Leu Thr Ala Glu His Ser Gly Asn Phe Ser Cys Glu Ala Ser Asn Gly Gln Gly Ala Gln Arg Ser Glu Val Val A1a Leu Asn Leu Thr Gly Leu Ser Leu Val Pro Thr G1u Asn Gly Ile Ser His Leu Ser Leu Gly Leu Thr G1y Trp Leu Leu Gly Cys Leu Sex Pro Ile Thr Met Ala Leu Ile Phe Cys Tyr Trp Leu Lys Arg Lys Ile Gly Arg Gln Ser Glu Asp Pro Val Arg Ser Pro Pro Gln Thr Val Leu Gln Gly Ser Thr Tyr Pro Lys Ser Pro Asp Ser Arg G1n Pro Glu Pro Leu Tyr Glu Asn Va1 Asn Val Val Ser Gly Asn Glu Val Tyr Ser Leu Va1 Tyr His Thr Pro Gln Val Leu Glu Pro Ala Ala Ala Gln His Val Arg Thr His Gly Val Ser Glu Ser Phe Gln Val Ser Ser Gly Leu Tyr Ser Lys Pro Arg Ile Asn Ile Ala His Met Asp Tyr Glu Asp Ala Met <210> 74 <211> 203 <212> PRT
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400> 74 Gly Ile Ser Asp Val Ser Leu Lys Thr Arg Pro Pro Gly Gly Trp Val Met Glu Gly Asp Lys Leu Val Leu Ile Cys Ser Val Asp Arg Val Thr Gly Asn Ile Thr Tyr'Phe Trp Tyr Arg Gly Ala Leu Gly Phe Gln Leu Glu Thr Lys Thr Gln Pro Ser Leu Thr Ala Glu Phe Glu Ile Ser Asp Met Lys Gln 5er Asp Ala Asp Gln Tyr Tyr Cys Ala Ala Asn Asp Gly His Asp Pro Ile Ala Ser Glu Leu Val Ser Ile His Val Arg Val Pro Val Ser Arg Pro Val Leu Thr Phe Gly Asp Ser Gly Thr Gln Ala Val Leu Gly Asp Leu Val Glu Leu His Cys Lys Ala Leu Arg Gly Ser Pro Pro Ile Phe Tyr Gln Phe Tyr His Glu Ser Ile Ile Leu Gly Asn Ser Ser Ala Pro Ser Gly Gly Gly Ala Ser Phe Asn Phe Ser Leu Thr Ala Glu His 5er Gly Asn Phe Ser Cys Glu Ala Ser Asn Gly Gln Gly Ala Gln Arg Ser Glu Val Val Ala Leu Asn Leu Thr Gly Leu Ser Leu Val Pro Thr Glu Asn Gly Ile Ser His Leu Ser Leu <210> 75 <211> 17 <212> PRT
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400> 75 Met Leu Pro Trp Leu Leu Leu Leu Ile Cys Ala Leu Pro Cys Glu Pro Ala <210> 76 <211> 100 <212> PRT
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400> 76 Lys Arg Lys Ile Gly Arg Gln Ser Glu Asp Pro Val Arg Ser Pro Pro Gln Thr Val Leu G1n Gly Ser Thr Tyr Pro Lys Ser Pro Asp Ser Arg Gln Pro G1u Pro Leu Tyr Glu Asn Val Asn Val Val Ser Gly Asn Glu Val Tyr Ser Leu Val Tyr His Thr Pro Gln Val Leu Glu Pro Ala Ala Ala Gln His Val Arg Thr His Gly Val Ser Glu Ser Phe Gln Val Ser Ser Gly Leu Tyr Ser Lys Pro Arg Tle Asn Ile Ala His Met Asp Tyr Glu Asp Ala Met <210> 77 <211> 283 <2l2> PRT
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400> 77 Gly Ile Ser Asp Val Ser Leu Lys Thr Arg Pro Pro Gly Gly Trp Val Met Glu Gly Asp Lys Leu Val Leu Tle Cys Ser Val Asp Arg Val Thr Gly Asn Ile Thr Tyr Phe Trp Tyr Arg Gly Ala Leu Gly Phe G1n Leu Glu Thr Lys Thr Gln Pro Ser Leu Thr Ala Glu Phe Glu Ile Ser Asp Met Lys Gln Ser Asp Ala Asp Gln Tyr Tyr Cys Ala Ala Asn Asp Gly His Asp Pro Ile Ala Ser Glu Leu Val Ser Ile His Val Arg Val Pro Val 5er Arg Pro Val Leu Thr Phe Gly Asp Ser Gly Thr Gln A1a Va1 Leu Gly Asp Leu Val Glu Leu His Cys Lys A1a Leu Arg Gly Ser Pro Pro Ile Phe Tyr Gln Phe Tyr His Glu Ser Ile Ile Leu Gly Asn Ser l30 135 140 Ser Ala Pro Ser Gly Gly Gly Ala Ser Phe Asn Phe Ser Leu Thr Ala Glu His Ser Gly Asn Phe Ser Cys Glu Ala Ser Asn Gly Gln Gly Ala 165 ' 170 175 Gln Arg Ser Glu Val Val Ala Leu Asn Leu Thr Gly Arg Gln Ser Glu Asp Pro Val Arg Ser Pro Pro G1n Thr Val Leu Gln Gly Ser Thr Tyr Pro Lys Ser Pro Asp Ser Arg Gln Pro Glu Pro Leu Tyr Glu Asn Val Asn Val Val Ser Gly Asn Glu Va1 Tyr Ser Leu Va1 Tyr His Thr Pro Gln Val Leu Glu Pro A1a Ala Ala Gln His Val Arg Thr His Gly Val Ser Glu Ser Phe Gln Val Ser Ser Gly Leu Tyr Ser Lys Pro Arg Ile Asn Ile Ala His Met Asp Tyr Glu Asp Ala Met <210> 78 <211> 570 <212> PRT
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400> 78 Gly Gln His Glu Ala Ala Gln Gln Ser Val Val Ser Leu Gln Pro Pro Trp Thr Thr Phe Phe Arg Gly Glu Val Val Thr Leu Thr Cys Tyr Arg Phe Gly Phe Ser Val Pro Gln Lys Thr Lys Trp Tyr Gln Lys Arg Lys Thr Val Lys Gln Thr Pro Gly Ala Leu Val Ile Lys Ala His Thr Leu Lys Val His Glu Ser Gly Glu Tyr Trp Cys Gln A1a Asp Ser Leu Leu Pro Ser Met His Val Asn Val Glu Phe Ser Glu Asp Phe Leu Val Leu Gln Ala Pro Pro Ala Val Phe Glu Gly Asp Ser Val Val Leu Arg Cys Tyr Ala Lys Lys Gly Ile Glu Ala Glu Thr Leu Thr Phe Tyr Lys Asp Gly Lys Ala Leu Thr Leu His His Gln Ser Glu Leu Ser Tle His His Ala Asn Leu Lys Asp Asn Gly Gln Tyr Lys Cys Thr Ser Lys Lys Lys Trp Ser Phe Gly Ser Leu Tyr Thr Ser Asn Thr Val Gly Val Gln Val Gln Glu Leu Phe Pro Arg Pro Val Leu Arg Ala Arg Pro Ser His Pro Ile Asp Gly Ser Pro Val Thr Leu Thr Cys Gln Thr Gln Leu Ser Ala Gln Lys Ser Asp A1a Arg Leu Gln Phe Cys Phe Phe Arg Asn Leu Gln Leu Leu Gly Ser Gly Cys Ser Arg Ser Ser Glu Phe His I1e Pro Ala Ile Trp Thr Glu Glu Ser Arg Arg Tyr Gln Cys Lys Ala Glu Thr Val Asn Ser Gln Val Arg Lys Gln Ser Thr Ala Phe Ile Ile Pro Val Gln Arg Ala Ser Ala Arg Phe G1n Thr His Ile Ile Pro Ala Ser Lys Leu Val Phe Glu Gly Gln Leu Leu Leu Leu Asn Cys Ser Val Lys Gly Val 2g0 295 300 Pro Gly Pro Leu Lys Phe 5er Trp Tyr Lys Lys Asp Met Leu Asn Glu Glu Thr Lys Ile Leu Lys Ser Ser Asn Ala Glu Phe Lys Ile Ser Gln 325 330 . 335 Val Asn Ile Ser Asp Ala Gly Glu Tyr His Cys Glu Ala Thr Asn Ser Arg Arg Ser Phe Val Ser Arg Ala Phe Pro Ile Thr Ile Lys Val Pro Val Ser G1n Pro Val Leu Thr Leu Ser Thr Gly Lys Thr G1n Ala Leu Glu Gly Asp Leu Met Thr Leu His Cys G1n 5er Gln Arg Gly Ser Pro Cys Ile Leu Tyr Glu Phe Phe Tyr Glu Asn Val Ser Leu G1y Asn Ser Ser Ile Leu Ser Gly Gly Gly Ala Tyr Phe Asn Phe Ser Met Ser Thr Glu Arg Ser Gly Asn Tyr Tyr Cys Thr Ala Asp Asn Gly Leu Gly Ala Gln Cys Ser Glu Ala Tle Arg Ile Ser Ile Phe Asp Met Thr Lys Asn Arg Ser Val Pro Met Ala Ala Gly Ile Thr Val Gly Leu Leu Ile Met Ala Val Gly Val Phe Leu Phe Tyr Cys Trp Phe Ser Arg Lys Ala Gly Gly Lys Pro Thr Ser Asp Asp Ser Arg Asn Pro Ser Asp Ser Glu Pro Gln Glu Pro Thr Tyr Tyr Asn Val Pro Ala Cys Ile Glu Leu Gln Pro Val Tyr Ser Asn Glu Pro Glu Glu Asn Val Ile Tyr Thr Glu Val Arg Arg Thr Gln Pro Arg Gln Lys His Ala Asp Gln Glu Ser Glu Ser Pro Arg Ser Arg Cys Gln Met Ala Glu Lys Lys <210> 79 <211> 25 <212> PRT
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400> 79 Met Ser Gly Ser Phe Ser Pro Cys Val Val Phe Thr G1n Met Trp Leu Thr Leu Leu Val Val Thr Pro Val Asn <210> 80 <211> 468 <212> PRT
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400> 80 Gly Gln His Glu Ala Ala Gln Gln Ser Val Val Ser Leu Gln Pro Pro Trp Thr Thr Phe Phe Arg Gly Glu Val Val Thr Leu Thr Cys Tyr Arg Phe Gly Phe Ser Val Pro Gln Lys Thr Lys Trp Tyr Gln Lys Arg Lys Thr Val Lys Gln Thr Pro Gly Ala Leu Val Ile Lys A1a His Thr Leu Lys Val His Glu Ser Gly Glu Tyr Trp Cys Gln Ala Asp Ser Leu Leu Pro Ser Met His Val Asn Val Glu Phe Ser Glu Asp Phe Leu Val Leu Gln Ala Pro Pro Ala Val Phe Glu Gly Asp Ser Val Val Leu Arg Cys Tyr Ala Lys Lys Gly Ile Glu Ala Glu Thr Leu Thr Phe Tyr Lys Asp Gly Lys Ala Leu Thr Leu His His Gln Ser Glu Leu Ser Ile His His Ala Asn Leu Lys Asp Asn Gly Gln Tyr Lys Cys Thr Ser Lys Lys Lys Trp Ser Phe Gly 5er Leu Tyr Thr Ser Asn Thr Va1 Gly Val Gln Val Gln Glu Leu Phe Pro Arg Pro Val Leu Arg Ala Arg Pro Ser His Pro Ile Asp Gly Ser Pro Val Thr Leu Thr Cys Gln Thr Gln Leu Ser Ala Gln Lys Ser Asp Ala Arg Leu Gln Phe Cys Phe Phe Arg Asn Leu Gln Leu Leu Gly Ser Gly Cys Ser Arg Ser Ser Glu Phe His Ile Pro Ala Ile Trp Thr Glu G1u 5er Arg Arg Tyr Gln Cys Lys Ala Glu Thr Val Asn Ser Gln Val Arg Lys Gln Ser Thr Ala Phe Ile Ile Pro Val Gln Arg Ala Ser Ala Arg Phe Gln Thr His Ile Ile Pro Ala 5er Lys Leu Val Phe Glu Gly Gln Leu Leu Leu Leu Asn Cys Ser Val Lys Gly Val Pro Gly Pro Leu Lys Phe Ser Trp Tyr Lys Lys Asp Met Leu Asn Glu Glu Thr Lys Ile Leu Lys Ser Ser Asn Ala Glu Phe Lys I1e Ser Gln Val Asn Ile Ser Asp Ala Gly Glu Tyr His Cys Glu Ala Thr Asn Ser Arg Arg Ser Phe Val Ser Arg Ala Phe Pro Ile Thr Ile Lys Val Pro Val Ser Gln Pro Val Leu Thr Leu Ser Thr Gly Lys Thr Gln Ala Leu Glu Gly Asp Leu Met Thr Leu His Cys Gln Ser Gln Arg Gly Ser Pro Cys Ile Leu Tyr G1u Phe Phe Tyr Glu Asn Val Ser Leu Gly Asn Ser Ser Ile Leu Ser Gly Gly Gly Ala Tyr Phe Asn Phe Ser Met Ser Thr Glu Arg Ser Gly Asn Tyr Tyr Cys Thr Ala Asp Asn Gly Leu Gly Ala Gln Cys Ser Glu Ala I1e Arg Ile Ser Ile Phe Asp Met Thr Lys Asn Arg 5er Val Pro <210> 81 <211> 79 <212> PRT
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400> 81 Ser Arg Lys Ala Gly Gly Lys Pro Thr Ser Asp Asp Ser Arg Asn Pro Ser Asp Ser Glu Pro Gln Glu Pro Thr Tyr Tyr Asn Val Pro Ala Cys Ile Glu Leu Gln Pro Val Tyr Ser Asn Glu Pro Glu Glu Asn Val Ile Tyr Thr G1u Val Arg Arg Thr Gln Pro Arg Gln Lys His Ala Asp Gln Glu Ser Glu 5er Pro Arg Ser Arg Cys Gln Met Ala Glu Lys Lys <210> 82 <211> 1973 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400>
82 ctatcccagatccgtggtccatctgcccta ctgcacctgt60 ccacagtgtt aggacttgag ctcaaagggagctacttgcctctagtctcatgcctctgtgcttgctgcttctggtcttcg120 ctcctgtcggagtccagtccgactggttgagcatcagccttccacaccgttcttatgaag180 gagaccaagtagttataagctgcacaggaaaaaataatggtgacataaagagactgaagt240 acttcaaggatggatatcacatagaaacttacagcagtgcttcaagctacaccattagga300 atgcaagacgtggtgacagtggctcctattcctgtaaggcagataggaaatttttcctat360 ttatagacacaacagaagaaacaggatctaagtggctgaatgtccaagagctgtttccag420 cacctgggctgacagccagccccctgcagcccgtagaggggagttcagtgaccctgtcct480 gcaacacctggctcccttcagatagggcaacgacccagctacgctattccttcttcaaag540 atggccacactttgcaatcgggctggacctcatcaaaatttaccatctcagcaatatcga600 aggaagactcaggaaattactggtgtgaagcaatgactgcctctcgcagtgtctcaaagc660 agagtcaccggtcctacatagatgtagagaggatccctgtatctcaagtcaccatggaaa720 tccagccttcaaggggctggggagttgaaggggagccactggtcgttgaaggggagcccc780 tggtcctggcttgttctgtggctaaaggcaccgggctaatcacgttctoctggcataggc840 aggacactaaggaaagtgtggggaagaaaagtcagcgttcccagagagtggagctggaga900 tccctactatcagggaaggccatgctggggggtactactgcacagcagacaacaactacg960 gcctgatccagagcgcaatcgtgaacatcaccgtgaaaattccagtgttgaacccgctcc1020 tctccatcagtgttcctggggtcttgcccttcatcggagatgtggcggagcttcactgtg1080 aagacaagagagcatctcctccggttctctactggttttatcatgaaaatatcactctgg1140 ctaacacctcggcaccttttggaggaaaggcatcctttaagctctctctgactgcagggc1200 attctgggaactactcttgtgaggctgaaaacgcctggggtaccaagcgcagtgaggtgg1260 taacgctcaatgtcacagagcccccacccaaagtgcgtttggtgaatggcccccaccact1320 gtgaaggacgcgtagaggtggagcaggaaggtcgctggggcactgtatgtgatgatggct1380 gggacatgagggatgtggctgtggtgtgccgagagctgggctgtggagcagcccaacaca1440 cacctatagccatgctgtatccaccagcagttgatgaagctctgcctgtgctcattcagg1500 tagccctgtgcaatggcacagaaaagaccctggctgaatgtgaccaggttgaggcctttg1560 attgtggacatgatgaggatgctggagctgtgtgtgaagtcttacccagcactttctgaa1620 gatctagagaccagagaccatcagacctcctactttctgcactgggcctcacagccctca1680 cggtctgcagctcccagtggacttccagacttcagctgtggcttatccttcaagaggact1740 cgaaactatattaatctgctctgagataatgttccaacagctccaaagaaagcccgagtc1800 ccttgtccccagaggccaagcttggaaaaattgttcccctgtccaggttccctgcctttc1860 tagttccttcttgctatctccttgggcagatgcagaggtggcacaagtaaggatcacata1920 catgtgcctgggcttccatctggtagaatgtggtctaacaaagcacatac 1973 aac <210> 83 <211> 1530 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400> 83 atgcctctgtgcttgctgcttctggtcttcgctcctgtcggagtccagtccgactggttg60 agcatcagccttccacaccgttcttatgaaggagaccaagtagttataagctgcacagga120 aaaaataatggtgacataaagagactgaagtacttcaaggatggatatcacatagaaact180 tacagcagtgcttcaagctacaccattaggaatgcaagacgtggtgacagtggctcctat240 tcctgtaaggcagataggaaatttttcctatttatagacacaacagaagaaacaggatct300 aagtggctgaatgtccaagagctgtttccagcacctgggctgacagccagccccctgcag360 cccgtagaggggagttcagtgaccctgtcctgcaacacctggctcccttcagatagggca420 acgacccagctacgctattccttcttcaaagatggccacactttgcaatcgggctggacc480 tcatcaaaatttaccatctcagcaatatcgaaggaagactcaggaaattactggtgtgaa540 gcaatgactgcctctcgcagtgtctcaaagcagagtcaccggtcctacatagatgtagag600 aggatccctgtatctcaagtcaccatggaaatccagccttcaaggggctggggagttgaa660 ggggagccactggtcgttgaaggggagcccctggtcctggcttgttctgtggctaaaggc720 accgggctaatcacgttctcctggcataggcaggacactaaggaaagtgtggggaagaaa780 agtcagcgttcccagagagtggagctggagatccctactatcagggaaggccatgctggg840 gggtactactgcacagcagacaacaactacggcctgatccagagcgcaatcgtgaacatc900 accgtgaaaattccagtgttgaacccgctcctctccatcagtgttcctggggtcttgccc960 ttcatcggagatgtggcggagcttcactgtgaagacaagagagcatctcctccggttctc1020 tactggttttatcatgaaaatatcactctggctaacacctcggcaccttttggaggaaag1080 gcatcctttaagctctctctgactgcagggcattctgggaactactcttgtgaggctgaa1140 aacgcctggggtaccaagcgcagtgaggtggtaacgctcaatgtcacagagcccccaccc1200 aaagtgcgtttggtgaatggcccccaccactgtgaaggacgcgtagaggtggagcaggaa1260 ggtcgctggggcactgtatgtgatgatggctgggacatgagggatgtggctgtggtgtgc1320 cgagagctgggctgtggagcagcccaacacacacctatagccatgctgtatccaccagca1380 gttgatgaagctctgcctgtgctcattcaggtagccctgtgcaatggcacagaaaagacc1440 ctggctgaatgtgaccaggttgaggcctttgattgtggacatgatgaggatgctggagct1500 gtgtgtgaagtcttacccagcactttctga 1530 <210>
<211>
<212>
DNA
<213> ficial ence Arti Sequ <220>
<223> iption equence:/note =
Descr of Artificial S
synth etic construct <400>
84 ctatcccagatccgtggtccatctgccctaaggacttgagctgcacctgt60 ccacagtgtt ctcaaagggagctacttgcctctagtctcatgcctctgtgcttgctgcttctggtcttcg120 ctcctgtcggagtccagtccgactggttgagcatcagccttccacaccgttcttatgaag180 gagaccaagtagttataagctgcacaggaaaaaataatggtgacataaagagactgaagt240 acttcaaggatggatatcacatagaaacttacagcagtgcttcaagctacaccattagga300 atgcaagacgtggtgacagtggctcctattcctgtaaggcagataggaaatttttcctat360 ttatagacacaacagaagaaacaggatctaagtggctgaatgtccaagagctgtttccag420 cacctgggctgacagccagccccctgcagcccgtagaggggagttcagtgaccctgtcct480 gcaacacctggctcccttcagatagggcaacgacccagctacgctattccttcttcaaag540 atggccacactttgcaatcgggctggacctcatcaaaatttaccatctcagcaatatcga600 aggaagactcaggaaattactggtgtgaagcaatgactgcctctcgcagtgtctcaaagc660 agagtcaccggtcctacatagatgtagagaggatccctgtatctcaagtcaccatggaaa720 tccagccttcaaggggctggggagttgaaggggagccactggtcgttgaaggggagcccc780 tggtcctggcttgttctgtggctaaaggcaccgggctaatcacgttctcctggcataggc840 aggacactaaggaaagtgtggggaagaaaagtcagcgttcccagagagtggagctggaga900 tccctactatcagggaaggccatgctggggggtactactgcacagcagacaacaactacg960 gcctgatccagagcgcaatcgtgaacatcaccgtgaaaattccagtgttgaacccgctcc1020 tctccatcagtgttcctggggtcttgcccttcatcggagatgtggcggagcttcactgtg1080 aagacaagagagcatctcctccggttctctactggttttatcatgaaaatatcactctgg1140 ctaacacctcggcaccttttggaggaaaggcatcctttaagctctctctgactgcagggc1200 attctgggaactactcttgtgaggctgaaaacgcctggggtaccaagcgcagtgaggtgg1260 taacgctcaatgtcacaggtaggacaatttaatgatccattccagggtgcaacttgcctt1320 ctggccatgcccttcttctctcccttgcacctgtacctcttggtctttgaa 1371 <210> 85 <211> 1203 <212> DNA
<213> Artificial Sequence <220>
<223>
Description of Artificial 5equence:/note =
synth etic construct <400>
85 gcttgctgcttctggtcttcgctcctgtcggagtccagtccgactggttg60 atgcctctgt agcatcagccttccacaccgttcttatgaaggagaccaagtagttataagctgcacagga120 aaaaataatggtgacataaagagactgaagtacttcaaggatggatatcacatagaaact180 tacagcagtgcttcaagctacaccattaggaatgcaagacgtggtgacagtggctcctat240 tcctgtaaggcagataggaaatttttcctatttatagacacaacagaagaaacaggatct300 aagtggctgaatgtccaagagctgtttccagcacctgggctgacagccagccccctgcag360 cccgtagaggggagttcagtgaccctgtcctgcaacacctggctcccttcagatagggca420 acgacccagctacgctattccttcttcaaagatggccacactttgcaatcgggctggacc480 tcatcaaaatttaccatctcagcaatatcgaaggaagactcaggaaattactggtgtgaa540 gcaatgactgcctctcgcagtgtctcaaagcagagtcaccggtcctacatagatgtagag600 aggatccctgtatctcaagtcaccatggaaatccagccttcaaggggctggggagttgaa660 ggggagccactggtcgttgaaggggagcccctggtcctggcttgttctgtggctaaaggc720 accgggctaatcacgttctcctggcataggcaggacactaaggaaagtgtggggaagaaa780 agtcagcgttcccagagagtggagctggagatccctactatcagggaaggccatgctggg840 gggtactactgcacagcagacaacaactacggcctgatccagagcgcaatcgtgaacatc900 accgtgaaaattccagtgttgaacccgctcctctccatcagtgttcctggggtcttgccc960 ttcatcggagatgtggcggagcttcactgtgaagacaagagagcatctcctccggttctc1020 tactggttttatcatgaaaatatcactctggctaacacctcggcaccttttggaggaaag1080 gcatcctttaagctctctctgactgcagggcattctgggaactactcttgtgaggctgaa1140 aacgcctggggtaccaagcgoagtgaggtggtaacgctcaatgtcacaggtaggacaatt1200 taa <210> 86 <211> 1479 <212> DNA
<213> Artificial Sequence <220>
<223> Description rtificialequence:/note =
of A S
synth etic construct <400> 86 gactggttgagcatcagccttccacaccgttcttatgaaggagaccaagtagttataagc60 tgcacaggaaaaaataatggtgacataaagagactgaagtacttcaaggatggatatcac120 atagaaacttacagcagtgcttcaagctacaccattaggaatgcaagacgtggtgacagt180 ggctcctattcctgtaaggcagataggaaatttttcctatttatagacacaacagaagaa240 acaggatctaagtggctgaatgtccaagagctgtttccagcacctgggctgacagccagc300 cccctgcagcccgtagaggggagttcagtgaccctgtcctgcaacacctggctcccttca360 gatagggcaacgacccagctacgctattccttcttcaaagatggccacactttgcaatcg420 ggctggacctcatcaaaatttaccatctcagcaatatcgaaggaagactcaggaaattac480 tggtgtgaagcaatgactgcctctcgcagtgtctcaaagcagagtcaccggtcctacata540 gatgtagagaggatccctgtatctcaagtcaccatggaaatccagccttcaaggggctgg600 ggagttgaaggggagccactggtcgttgaaggggagcccctggtcctggcttgttctgtg660 gctaaaggcaccgggctaatcacgttctcctggcataggcaggacactaaggaaagtgtg720 gggaagaaaagtcagcgttcccagagagtggagctggagatccctactatcagggaaggc780 catgctggggggtactactgcacagcagacaacaactacggcctgatccagagcgcaatc840 gtgaacatcaccgtgaaaattccagtgttgaacccgctcctctccatcagtgttcctggg900 gtcttgcccttcatcggagatgtggcggagcttcactgtgaagacaagagagcatctcct960 ccggttctctactggttttatcatgaaaatatcactctggctaacacctcggcacctttt1020 ' ggaggaaaggcatcctttaagctctctctgactgcagggcattctgggaactactcttgt1080 gaggctgaaaacgcctggggtaccaagcgcagtgaggtggtaacgctcaatgtcacagag1140 cccccacccaaagtgcgtttggtgaatggcccccaccactgtgaaggacgcgtagaggtg1200 aag gtcgctggggcactgtatgtgatgatggctgggacatgagggatgtggct1260 ca a gg gagagctgggctgtggagcagcccaacacacacctatagccatgctgtat1320 g g gtggtgtgcc ccaccagcagttgatgaagctctgcctgtgctcattcaggtagccctgtgcaatggcaca1380 accc tggctgaatgtgaccaggttgaggcctttgattgtggacatgatgaggat1440 aaaa g tgtgtgaagtcttacccagcactttctga 1479 g gctggagctg <210>
<211>
<212>
DNA
<213>
Artificial Sequence <220>
<223>
Description of Artificial Sequence:/note =
synth etic construct <400>
87 gcatcagccttccacaccgttcttatgaaggagaccaagtagttataagc60 gactggttga tgcacaggaaaaaataatggtgacataaagagactgaagtacttcaaggatggatatcac120 atagaaacttacagcagtgcttcaagctacaccattaggaatgcaagacgtggtgacagt180 ggctcctattcctgtaaggcagataggaaatttttcctatttatagacacaacagaagaa240 acaggatctaagtggctgaatgtccaagagctgtttccagcacctgggctgacagccagc300 cccctgcagcccgtagaggggagttcagtgaccctgtcctgcaacacctggctcccttca360 gatagggcaacgacccagctacgctattccttcttcaaagatggccacactttgcaatcg420 ggctggacctcatcaaaatttaccatctcagcaatatcgaaggaagactcaggaaattac480 tggtgtgaagcaatgactgcctctcgcagtgtctcaaagcagagtcaccggtcctacata540 gatgtagagaggatccctgtatctcaagtcaccatggaaatccagccttcaaggggctgg600 ggagttgaaggggagccactggtcgttgaaggggagcccctggtcctggcttgttctgtg660 gctaaaggcaccgggctaatcacgttctcctggcataggcaggacactaaggaaagtgtg720 gggaagaaaagtcagcgttcccagagagtggagctggagatccctactatcagggaaggc780 catgctggggggtactactgcacagcagacaacaactacggcctgatccagagcgcaatc840 gtgaacatcaccgtgaaaattccagtgttgaacccgctcctctccatcagtgttcctggg900 gtcttgcccttcatcggagatgtggcggagcttcactgtgaagacaagagagcatctcct960 ccggttctctactggttttatcatgaaaatatcactctggctaacacctcggcacctttt1020 ggaggaaaggcatcctttaagctctctctgactgcagggcattctgggaactactcttgt1080 gaggctgaaaacgcctggggtaccaagcgcagtgaggtggtaacgctcaatgtcacaggt1140 aggacaatttas <210>
<211>
<212>
DNA
<213> ficial ence Arti Sequ <220>
<223> iption equence:/note =
Descr of Artificial S
synth etic construct <400>
88 ggttttgcttttttttcttttggtgagaggtaccttcaagttaccatccc60 tgagtagtct cagcctggtcctcatgctaccttggctcctgctactgatctgtgctctaccgtgtgaacc120 tgctggaatctctgatgtgagcttgaagacacggcccccaggaggatgggtgatggaggg180 agacaagctggtcctcatctgctcggttgatagagtcactgggaatataacttacttctg240 gtacagaggggccctgggtttccaactggaaacaaagacacaaccttcactaacagcaga300 gtttgagatcagtgacatgaagcagagcgatgctgatcaatattactgtgcggctaacga360 tggccacgaccctatcgccagtgagctggtgagcatccacgtcagagttccagtgtctcg420 ccctgtccttacgtttggggactctggaacccaggctgtgctaggggacctggtggagct480 tcactgtaaggccctgagaggctcacccccaatcttctaccagttttatcatgagagcat540 catcctggggaacagttcagcaccctctggaggaggagcatccttcaacttctccctgac600 tgcagaacattctggaaacttctcctgtgaggccagcaatggacagggtgcccaacgaag660 tgaggtggtggctctcaacttaacaggtctctccttagtgcctactgagaatggaatcag720 ccatctctccttaggactcactgggtggctgcttggctgtcttagccccatcaccatggc780 cttaatattttgctactggctcaagagaaaaataggaagacagtcagaggatccagtcag840 gagccctcctcagactgtgctccaaggatccacgtaccccaaatcccccgactcaaggca900 gccagagcccctgtatgagaacgtgaacgttgtaagtggcaatgaagtgtactctctggt960 gtaccacaccccgcaggtgctggaaccagcagcagctcagcatgtgaggacacacggagt1020 aagtgagtcctttcaggtctcctctggactctattctaagccaaggataaacattgcaca1080 tatggactatgaagacgccatgtagaattatgtaaacagcaactatggagtgctacatac1140 aagcccaaggcctgatgtggcctccaaggatactggggacagggatagcttgccagccca1200 atttccccacacactgcggttcattagatgagtccttcacctaccctgtgtgaagctgga1260 gcaagtcctgcagaaaccacccaggaaaaccaacttagacggagaagccagaagcatttg1320 catctggttgttgcccattcatgttggcacacgaacttttatttacaggaggaaaatggt1380 gtgatgaaagcaactaaggtcttacagcagagggacaatgcgactcagagagcacaaagc1440 cgagatcaatggctttgcaggtctgctgtggagacagagccatgcttcctctgtgcacat1500 accctagagtacttctgagtcactgccatcaacttagaattaaacacagttgcataaaat1560 gtactgt <210>
<211>
<212>
DNA
<213> ficial ence Arti Sequ <220>
<223>
Description of Artificial Sequence:/note =
synth etic construct <400>
89 ggctcctgctactgatctgtgctctaccgtgtgaacctgctggaatctct 60 atgctacctt gatgtgagcttgaagacacggcccccaggaggatgggtgatggagggagacaagctggtc 120 ctcatctgctcggttgatagagtcactgggaatataacttacttctggtacagaggggcc 180 ctgggtttccaactggaaacaaagacacaaccttcactaacagcagagtttgagatcagt 240 gacatgaagcagagcgatgctgatcaatattactgtgcggctaacgatggccacgaccct 300 atcgccagtgagctggtgagcatccacgtcagagttccagtgtctcgccctgtccttacg 360 tttggggactctggaacccaggctgtgctaggggacctggtggagcttcactgtaaggcc 420 ctgagaggctcacccccaatcttctaccagttttatcatgagagcatcatcctggggaac 480 agttcagcaccctctggaggaggagcatccttcaacttctccctgactgcagaacattct 540 ggaaacttctcctgtgaggccagcaatggacagggtgcccaacgaagtgaggtggtggct 600 ctcaacttaacaggtctctccttagtgcctactgagaatggaatcagccatctctcctta 660 ggactcactgggtggctgcttggctgtcttagccccatcaccatggccttaatattttgc 720 tactggctcaagagaaaaataggaagacagtcagaggatccagtcaggagccctcctcag 780 actgtgctccaaggatccacgtaccccaaatcccccgactcaaggcagccagagcccctg 840 tatgagaacgtgaacgttgtaagtggcaatgaagtgtactctctggtgtaccacaccccg 900 caggtgctggaaccagcagcagctcagcatgtgaggacacacggagtaagtgagtccttt 960 caggtctcctctggactctattctaagccaaggataaacattgcacatatggactatgaa 1020 gacgccatgtag <210>
<211>
<212>
DNA
<213> ficial ence Arti Sequ <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400> 90 ggaatctctgatgtgagcttgaagacacggcccccaggaggatgggtgatggagggagac60 aagctggtcctcatctgctcggttgatagagtcactgggaatataacttacttctggtac120 agaggggccctgggtttccaactggaaacaaagacacaaccttcactaacagcagagttt180 gagatcagtgacatgaagcagagcgatgctgatcaatattactgtgcggctaacgatggc240 cacgaccctatcgccagtgagctggtgagcatccacgtcagagttccagtgtctcgccct300 gtccttacgtttggggactctggaacccaggctgtgctaggggacctggtggagcttcac360 tgtaaggccctgagaggctcacccccaatcttctaccagttttatcatgagagcatcatc420 ctggggaacagttcagcaccctctggaggaggagcatccttcaacttctccctgactgca 480 gaacattctggaaacttctcctgtgaggccagcaatggacagggtgcccaacgaagtgag 540 gtggtggctctcaacttaacaggtctctccttagtgcctactgagaatggaatcagccat 600 ctctccttaggactcactgggtggctgcttggctgtcttagccccatcaccatggcctta 660 atattttgctactggctcaagagaaaaataggaagacagtcagaggatccagtcaggagc 720 cctcctcagactgtgctccaaggatccacgtaccccaaatcccccgactcaaggcagcca 780 gagcccctgtatgagaacgtgaacgttgtaagtggcaatgaagtgtactctctggtgtac 840 cacaccccgcaggtgctggaaccagcagcagctcagcatgtgaggacacacggagtaagt 900 gagtcctttcaggtctcctctggactctattctaagccaaggataaacattgcacatatg 960 gactatgaagacgccatgtag 981 <210>
9l <211>
<212>
DNA
<213> ficial ence Arti Sequ <220>
<223>
Description of Artificial Sequence:/note =
synth etic construct <400>
91 ggctcctgctactgatctgtgctctaccgtgtgaacctgctggaatctct60 atgctacctt gatgtgagcttgaagacacggcccccaggaggatgggtgatggagggagacaagctggtc120 ctcatctgctcggttgatagagtcactgggaatataacttacttctggtacagaggggcc180 ctgggtttccaactggaaacaaagacacaaccttcactaacagcagagtttgagatcagt240 gacatgaagcagagcgatgctgatcaatattactgtgcggctaacgatggccacgaccct300 atcgccagtgagctggtgagcatccacgtcagagttccagtgtctcgccctgtccttacg360 tttggggactctggaacccaggctgtgctaggggacctggtggagcttcactgtaaggcc420 ctgagaggctcacccccaatcttctaccagttttatcatgagagcatcatcctggggaac480 agttcagcaccctctggaggaggagcatccttcaacttctccctgactgcagaacattct540 ggaaacttctcctgtgaggccagcaatggacagggtgcccaacgaagtgaggtggtggct600 ctcaacttaacaggtctctccttagtgcctactgagaatggaatcagccatctctcctta660 <210>
<211>
<212>
DNA
<213> ficial ence Arti Sequ <220>
<223>
Description of Artificial Sequence:/note =
synth etic construct <400>
92 atgtgagcttgaagacacggcccccaggaggatgggtgatggagggagac60 ggaatctctg aagctggtcctcatctgctcggttgatagagtcactgggaatataacttacttctggtac120 agaggggccctgggtttccaactggaaacaaagacacaaccttcactaacagcagagttt180 gagatcagtgacatgaagcagagcgatgctgatcaatattactgtgcggctaacgatggc240 cacgaccctatcgccagtgagctggtgagcatccacgtcagagttccagtgtctcgccct300 gtccttacgtttggggactctggaacccaggctgtgctaggggacctggtggagcttcac360 tgtaaggccctgagaggctcacccccaatcttctaccagttttatcatgagagcatcatc420 ctggggaacagttcagcaccctctggaggaggagcatccttcaacttctccctgactgca480 gaacattctggaaacttctcctgtgaggccagcaatggacagggtgcccaacgaagtgag540 gtggtggctctcaacttaacaggtctctccttagtgcctactgagaatggaatcagccat600 ctctcctta <210> 93 <211> 303 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400> 93 aaaaa taggaagaca gtcagaggatccagtcaggagccctcctcagactgtgctc60 a a g tcaaggcagccagagcccctgtatgagaac120 g a caaggatcca cgtaccccaa atcccccgac gtgaacgttg taagtggcaa tctctggtgtaccacaccccgcaggtgctg180 tgaagtgtac gaaccagcag cagctcagca cacggagtaagtgagtcctttcaggtctcc240 tgtgaggaca tctggactct attctaagcc attgcacatatggactatgaagacgccatg300 aaggataaac tag <210> 94 <211> 1567 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400>
94 ggttttgcttttttttcttttggtgagaggtaccttcaagttaccatccc60 tgagtagtct cagcctggtcctcatgctaccttggctcctgctactgatctgtgctctaccgtgtgaacc120 tgctggaatctctgatgtgagcttgaagacacggcccccaggaggatgggtgatggaggg180 agacaagctggtcctcatctgctcggttgatagagtcactgggaatataacttacttctg240 gtacagaggggccctgggtttccaactggaaacaaagacacaaccttcactaacagcaga300 gtttgagatcagtgacatgaagcagagcgatgctgatcaatattactgtgcggctaacga360 tggccacgaccctatcgccagtgagctggtgagcatccacgtcagagttccagtgtctcg420 ccctgtccttacgtttggggactctggaacccaggctgtgctaggggacctggtggagct480 tcactgtaaggccctgagaggctcacccccaatcttctaccagttttatcatgagagcat540 catcctggggaacagttcagcaccctctggaggaggagcatccttcaacttctccctgac600 tgcagaacattctggaaacttctcctgtgaggccagcaatggacagggtgcccaacgaag660 tgaggtggtggctctcaacttaacaggtctctccttagtgcctactgagaatggaatcag720 ccatctctccttaggactcactgggtggctgcttggctgtcttagccccatcaccatggc780 cttaatattttgctactggctcaagagaaaaataggaagacagtcagaggatccagtcag840 gagccctcctcagactgtgctccaaggatccacgtaccccaaatcccccgactcaaggca900 gccagagcccctgtatgagaacgtgaacgttgtaagtggcaatgaagtgtactctctggt960 gtaccacaccccgcaggtgctggaaccagcagcagctcagcatgtgaggacacacggagt1020 aagtgagtcctttcaggtctcctctggactctattctaagccaaggataaacattgcaca1080 tatggactatgaagacgccatgtagaattatgtaaacagcaactatggagtgctacatac1140 aagcccaaggcctgatgtggcctccaaggatactggggacagggatagcttgccagccca1200 atttccccacacactgcggttcattagatgagtccttcacctaccctgtgtgaagctgga1260 gcaagtcctgcagaaaccacccaggaaaaccaacttagacggagaagccagaagcatttg1320 catctggttgttgcccattcatgttggcacacgaacttttatttacaggaggaaaatggt1380 gtgatgaaagcaactaaggtcttacagcagagggacaatgcgactcagagagcacaaagc1440 cgagatcaatggctttgcaggtctgctgtggagacagagccatgcttcctctgtgcacat1500 accctagagtacttctgagtcactgccatcaacttagaattaaacacagttgcataaaat1560 gtactgt <210> 95 <211> 903 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400> 95 atgctacctt ggctcctgct actgatctgt gctctaccgt gtgaacctgc tggaatctct 60 gatgtgagct tgaagacacg gcccccagga ggatgggtga tggagggaga caagctggtc 120 ctcatctgctcggttgatagagtcactgggaatataacttacttctggtacagaggggcc180 ctgggtttccaactggaaacaaagacacaaccttcactaacagcagagtttgagatcagt240 gacatgaagcagagcgatgctgatcaatattactgtgcggctaacgatggccacgaccct300 atcgccagtgagctggtgagcatccacgtcagagttccagtgtctcgccctgtccttacg360 tttggggactctggaacccaggctgtgctaggggacctggtggagcttcactgtaaggcc920 ctgagaggctcacccccaatcttctaccagttttatcatgagagcatcatcctggggaac480 agttcagcaccctctggaggaggagcatccttcaacttctccctgactgcagaacattct540 ggaaacttctcctgtgaggccagcaatggacagggtgcccaacgaagtgaggtggtggct600 ctcaacttaacaggaagacagtcagaggatccagtcaggagccctcctcagactgtgctc660 caaggatccacgtaccccaaatcccccgactcaaggcagccagagcccctgtatgagaac720 gtgaacgttgtaagtggcaatgaagtgtactctctggtgtaccacaccccgcaggtgctg780 gaaccagcagcagctcagcatgtgaggacacacggagtaagtgagtcctttcaggtctcc840 tctggactctattctaagccaaggataaacattgcacatatggactatgaagacgccatg900 tag <2l0> 96 <211> 852 <212> DNA
<213> Artificial Sequence <220>
<223> equence:/note =
Description of Artificial S
synthetic construct <400>
96 atgtgagcttgaagacacggcccccaggaggatgggtgatggagggagac60 ggaatctctg aagctggtcctcatctgctcggttgatagagtcactgggaatataacttacttctggtac120 agaggggccctgggtttccaactggaaacaaagacacaaccttcactaacagcagagttt180 gagatcagtgacatgaagcagagcgatgctgatcaatattactgtgcggctaacgatggc240 cacgaccctatcgccagtgagctggtgagcatccacgtcagagttccagtgtctcgccct300 gtccttacgtttggggactctggaacccaggctgtgctaggggacctggtggagcttcac360 tgtaaggccctgagaggctcacccccaatcttctaccagttttatcatgagagcatcatc420 ctggggaacagttcagcaccctctggaggaggagcatccttcaacttctccctgactgca480 gaacattctggaaacttctcctgtgaggccagcaatggacagggtgcccaacgaagtgag540 gtggtggctctcaacttaacaggaagacagtcagaggatccagtcaggagccctcctcag600 actgtgctccaaggatccacgtaccccaaatcccccgactcaaggcagccagagcccctg660 tatgagaacgtgaacgttgtaagtggcaatgaagtgtactctctggtgtaccacaccccg720 caggtgctggaaccagcagcagctcagcatgtgaggacacacggagtaagtgagtccttt780 caggtctcctctggactctattctaagccaaggataaacattgcacatatggactatgaa840 gacgccatgtag <210>
<211>
<212>
DNA
<213> ficial ence Arti Sequ <220>
<223>
Description of Artificial Sequence:/note =
synthetic construct <400>
97 cactcaactgttttagaagagcagttccccagatttctccttggagctgt60 attcaagtta gagtgactaccattgcgagcaagagcaagaggaaagcactacctgtgagcagatgtctgg120 ttcattctcaccctgtgtggtgttcacacagatgtggctgactctactggttgtgactcc180 tgtcaatggacagcatgaagctgcacagcagtctgtggtttcccttcagcctccatggac240 cactttctttcgaggagaggtcgtcacactgacttgttatagattcggcttctccgtacc300 ccagaaaacaaaatggtaccagaaaagaaaaacagtgaagcaaaccccaggtgctttggt360 aattaaagcacataccttaaaggtccatgagtccggagagtattggtgccaagccgacag420 cttacttccgagcatgcacgtgaacgtagagttttctgaagattttctggtgctgcaagc480 tccacctgctgtgtttgaaggagactctgtggttctgaggtgctacgcaaagaaaggcat540 agaagcagagaccctgacattttacaaggatggtaaagctctgacattacatcatcaaag600 tgagctctctattcatcatgcaaatctgaaggacaacggtcaatacaaatgcacttcgaa660 gaagaagtggtcttttgggtccctctatacttccaatacggtcggagttcaagtccaaga720 gttgttcccacggcctgtgctgagagccagaccctcccatcccatagatggaagtccagt780 gaccctgacgtgtcagacccagctctctgcacagaagtcagatgcccggctccagttctg840 tttcttcagaaacctccagcttctggggtcaggctgcagccgctcctcagagtttcacat900 tcctgccatatggactgaagagtcaaggagataccagtgcaaggcagaaacagtgaattc960 ccaagttagaaaacaaagtacagcgttcataatcccagtgcagagagcttctgcgagatt1020 ccaaacacacatcatcccagcctcaaagttggtgtttgaagggcagttgctgttactcaa1080 ctgctcagtaaaaggagtyccaggrcccctcaaattctcctggtataaaaaggacatgct1140 gaatgaagaaacaaagattcttaagtcctccaacgcagaattcaagatctcccaggtgaa1200 catcagtgacgcaggggagtatcactgtgaagctaccaacagccgccgaagctttgtcag1260 cagggcatttcccatcaccataaaagtcccagtatctcaaccagttctcaccctaagcac1320 aggcaagacccaggcccttgagggagacttgatgacacttcattgtcaatcccagagggg1380 ctctccatgtatcctgtatgaattcttctatgagaatgtctccctggggaatagctctat1440 actctctggaggaggagcatacttcaatttctctatgagcacagagcgatctggaaacta1500 ctactgcacagcagacaatggcctgggagcccagtgcagtgaagctataaggatctctat1560 ctttgacatgacaaagaacagaagtgttcctatggctgccggaatcactgtgggactgct1620 catcatggctgttggagtgtttctgttttattgctggttctctagaaaagcaggaggaaa1680 gcctacctctgatgactccagaaacccttcagattcagaaccccaggagcccacctatta1740 caacgtaccagcctgtatagaactgcagccagtgtacagcaatgagcctgaggaaaacgt1800 gatttacacagaagtacggagaactcaaccaagacagaaacatgcagatcaggagtctga1860 aagcccaagatcaaggtgccagatggctgagaaaaagtaggatatgtctcctccaagaac1920 agctccagaaaagaaacccgaagcttcgtcagtctaatctcaccgatgcttctactgggc1980 ctgcactttcctacccacggatggctccacagatcatggacagcaaggaaatggccaact2040 ctcctaagactgggccaacatccccatcttctctttggtttcccagagccacgccacccc2100 aaagtcagcaggaagttgcaaaagatcacaacgaccctattcctgttttgtaaccacccc2160 cagcctgaagcaggctgagccagaccttgaccttgctgccactaaggagattacctaggg2220 tggagcctgcctctctagatcactctattgttcagccactgccactgttctccttcaaga2280 cactgctacctgctgggaggccactgagctattccagagactacaccctatcctgcacat2340 catcacctgtagcctgttccaggctccaagaatgaattggcggcaatgggcctcccccct2400 accccctttataagtgcatttgccattaaacatttgggctttgatct 2447 <210> 98 <211> 1788 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400>
98 cattctcaccctgtgtggtgttcacacagatgtggctgactctactggtt60 atgtctggtt gtgactcctgtcaatggacagcatgaagctgcacagcagtctgtggtttcccttcagcct120 ccatggaccactttctttcgaggagaggtcgtcacactgacttgttatagattcggcttc180 tccgtaccccagaaaacaaaatggtaccagaaaagaaaaacagtgaagcaaaccccaggt240 gctttggtaattaaagcacataccttaaaggtccatgagtccggagagtattggtgccaa300 gccgacagcttacttccgagcatgcacgtgaacgtagagttttctgaagattttctggtg360 ctgcaagctccacctgctgtgtttgaaggagactctgtggttctgaggtgctacgcaaag420 aaaggcatagaagcagagaccctgacattttacaaggatggtaaagctctgacattacat480 catcaaagtgagctctctattcatcatgcaaatctgaaggacaacggtcaatacaaatgc540 acttcgaagaagaagtggtcttttgggtccctctatacttccaatacggtcggagttcaa600 gtccaagagttgttcccacggcctgtgctgagagccagaccctcccatcccatagatgga660 agtccagtgaccctgacgtgtcagacccagctctctgcacagaagtcagatgcccggctc720 cagttctgtttcttcagaaacctccagcttctggggtcaggctgcagccgctcctcagag780 tttcacattcctgccatatggactgaagagtcaaggagataccagtgcaaggcagaaaca840 gtgaattcccaagttagaaaacaaagtacagcgttcataatcccagtgcagagagcttct900 gcgagattccaaacacacatcatcccagcctcaaagttggtgtttgaagggcagttgctg960 ttactcaactgctcagtaaaaggagtyccaggrcccctcaaattctcctggtataaaaag1020 gacatgctgaatgaagaaacaaagattcttaagtcctccaacgcagaattcaagatctcc1080 caggtgaacatcagtgacgcaggggagtatcactgtgaagctaccaacagccgccgaagc1140 tttgtcagcagggcatttcccatcaccataaaagtcccagtatctcaaccagttctcacc1200 ctaagcacaggcaagacccaggcccttgagggagacttgatgacacttcattgtcaatcc1260 cagaggggctctccatgtatcctgtatgaattcttctatgagaatgtctccctggggaat1320 agctctatactctctggaggaggagcatacttcaatttctctatgagcacagagcgatct1380 ggaaactactactgcacagcagacaatggcctgggagcccagtgcagtgaagctataagg1440 atctctatctttgacatgacaaagaacagaagtgttcctatggctgccggaatcactgtg1500 ggactgctcatcatggctgttggagtgtttctgttttattgctggttctctagaaaagca1560 ggaggaaagcctacctctgatgactccagaaacccttcagattcagaaccccaggagccc1620 acctattacaacgtaccagcctgtatagaactgcagccagtgtacagcaatgagcctgag1680 gaaaacgtgatttacacagaagtacggagaactcaaccaagacagaaacatgcagatcag1740 gagtctgaaagcccaagatcaaggtgccagatggctgagaaaaagtag 1788 <210> 99 <211> 1710 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400>
99 ctgcacagcagtctgtggtttcccttcagcctccatggaccactttcttt60 cagcatgaag cgaggagaggtcgtcacactgacttgttatagattcggcttctccgtaccccagaaaaca120 aaatggtaccagaaaagaaaaacagtgaagcaaaccccaggtgctttggtaattaaagca180 cataccttaaaggtccatgagtccggagagtattggtgccaagccgacagcttacttccg240 agcatgcacgtgaacgtagagttttctgaagattttctggtgctgcaagctccacctgct300 gtgtttgaaggagactctgtggttctgaggtgctacgcaaagaaaggcatagaagcagag360 accctgacattttacaaggatggtaaagctctgacattacatcatcaaagtgagctctct420 attcatcatgcaaatctgaaggacaacggtcaatacaaatgcacttcgaagaagaagtgg480 tcttttgggtccctctatacttccaatacggtcggagttcaagtccaagagttgttccca540 cggcctgtgctgagagccagaccctcccatcccatagatggaagtccagtgaccctgacg600 tgtcagacccagctctctgcacagaagtcagatgcccggctccagttctgtttcttcaga660 aacctccagcttctggggtcaggctgcagccgctcctcagagtttcacattcctgccata720 tggactgaagagtcaaggagataccagtgcaaggcagaaacagtgaattcccaagttaga780 aaacaaagtaoagcgttcataatcccagtgcagagagcttctgcgagattccaaacacac840 atcatcccagcctcaaagttggtgtttgaagggcagttgctgttactcaactgctcagta900 aaaggagtyccaggrcccctcaaattctcctggtataaaaaggacatgctgaatgaagaa960 acaaagattcttaagtcctccaacgcagaattcaagatctcccaggtgaacatcagtgac1020 gcaggggagtatcactgtgaagctaccaacagccgccgaagctttgtcagcagggcattt1080 cccatcaccataaaagtcccagtatctcaaccagttctcaccctaagcacaggcaagacc1140 caggcccttgagggagacttgatgacacttcattgtcaatcccagaggggctctccatgt1200 atcctgtatgaattcttctatgagaatgtctccctggggaatagctctatactctctgga1260 ggaggagcatacttcaatttctctatgagcacagagcgatctggaaactactactgcaca1320 gcagacaatggcctgggagcccagtgcagtgaagctataaggatctctatctttgacatg1380 acaaagaacagaagtgttcctatggctgccggaatcactgtgggactgctcatcatggct1440 gttggagtgtttctgttttattgctggttctctagaaaagcaggaggaaagcctacctct1500 gatgactccagaaacccttcagattcagaaccccaggagcccacctattacaacgtacca1560 gcctgtatagaactgcagccagtgtacagcaatgagcctgaggaaaacgtgatttacaca1620 gaagtacggagaactcaaccaagacagaaacatgcagatcaggagtctgaaagcccaaga1680 tcaaggtgccagatggctgagaaaaagtag 1710 <210> 100 <211> 1401 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400>
100 ctgcacagcagtctgtggtttcccttcagcctccatggaccactttcttt60 cagcatgaag cgaggagaggtcgtcacactgacttgttatagattcggcttctccgtaccccagaaaaca120 aaatggtaccagaaaagaaaaacagtgaagcaaaccccaggtgctttggtaattaaagca180 cataccttaaaggtccatgagtccggagagtattggtgccaagccgacagcttacttccg240 agcatgcacgtgaacgtagagttttctgaagattttctggtgctgcaagctccacctgct300 gtgtttgaaggagactctgtggttctgaggtgctacgcaaagaaaggcatagaagcagag360 accctgacattttacaaggatggtaaagctctgacattacatcatcaaagtgagctctct420 attcatcatgcaaatctgaaggacaacggtcaatacaaatgcacttcgaagaagaagtgg480 tcttttgggtccctctatacttccaatacggtcggagttcaagtccaagagttgttccca540 cggcctgtgctgagagccagaccctcccatcccatagatggaagtccagtgaccctgacg600 tgtcagacccagctctctgcacagaagtcagatgcccggctccagttctgtttcttcaga660 aacctccagcttctggggtcaggctgcagccgctcctcagagtttcacattcctgccata720 tggactgaagagtcaaggagataccagtgcaaggcagaaacagtgaattcccaagttaga780 aaacaaagtacagcgttcataatcccagtgcagagagcttctgcgagattccaaacacac840 atcatcccagcctcaaagttggtgtttgaagggcagttgctgttactcaactgctcagta900 aaaggagtyccaggrcccctcaaattctcctggtataaaaaggacatgctgaatgaagaa960 acaaagattcttaagtcctccaacgcagaattcaagatctcccaggtgaacatcagtgac1020 gcaggggagtatcactgtgaagctaccaacagccgccgaagctttgtcagcagggcattt1080 cccatcaccataaaagtcccagtatctcaaccagttctcaccctaagcacaggcaagacc1140 caggcccttgagggagacttgatgacacttcattgtcaatcccagaggggctctccatgt1200 atcctgtatgaattcttctatgagaatgtctccctggggaatagctctatactctctgga1260 ggaggagcatacttcaatttctctatgagcacagagcgatctggaaactactactgcaca1320 gcagacaatggcctgggagcccagtgcagtgaagctataaggatctctatctttgacatg1380 acaaagaacagaagtgttcct 1401 <210>
<211>
<212>
DNA
<213> ficial ence Arti Sequ <220>
<223> Description of Artificial Sequence:/note =
synth etic construct <400> 101 atgtctggttcattctcaccctgtgtggtgttcacacagatgtggctgactctactggtt 60 gtgactcctgtcaatggacagcatgaagctgcacagcagtctgtggtttcccttcagcct 120 ccatggaccactttctttcgaggagaggtcgtcacactgacttgttatagattcggcttc 180 tccgtaccccagaaaacaaaatggtaccagaaaagaaaaacagtgaagcaaaccccaggt 240 gctttggtaattaaagcacataccttaaaggtccatgagtccggagagtattggtgccaa 300 gccgacagcttacttccgagcatgcacgtgaacgtagagttttctgaagattttctggtg 360 ctgcaagctccacctgctgtgtttgaaggagactctgtggttctgaggtgctacgcaaag 420 aaaggcatagaagcagagaccctgacattttacaaggatggtaaagctctgacattacat 980 catcaaagtgagctctctattcatcatgcaaatctgaaggacaacggtcaatacaaatgc 540 acttcgaagaagaagtggtcttttgggtccctctatacttccaatacggtcggagttcaa 600 gtccaagagttgttcccacggcctgtgctgagagccagaccctcccatcccatagatgga 660 agtccagtgaccctgacgtgtcagacccagctctctgcacagaagtcagatgcccggctc 720 cagttctgtttcttcagaaacctccagcttctggggtcaggctgcagccgctcctcagag 780 tttcacattcctgccatatggactgaagagtcaaggagataccagtgcaaggcagaaaca 840 gtgaattcccaagttagaaaacaaagtacagcgttcataatcccagtgcagagagcttct 900 gcgagattccaaacacacatcatcccagcctcaaagttggtgtttgaagggcagttgctg 960 ttactcaactgctcagtaaaaggagtyccaggrcccctcaaattctcctggtataaaaag 1020 gacatgctgaatgaagaaacaaagattcttaagtcctccaacgcagaattcaagatctcc 1080 caggtgaacatcagtgacgcaggggagtatcactgtgaagctaccaacagccgccgaagc 1140 tttgtcagcagggcatttcccatcaccataaaagtcccagtatctcaaccagttctcacc 1200 ctaagcacaggcaagacccaggcccttgagggagacttgatgacacttcattgtcaatcc 1260 cagaggggctctccatgtatcctgtatgaattcttctatgagaatgtctccctggggaat 1320 agctctatactctctggaggaggagcatacttcaatttctctatgagcacagagcgatct 1380 ggaaactactactgcacagcagacaatggcctgggagcccagtgcagtgaagctataagg 1440 atctctatctttgacatgacaaagaacagaagtgttcct 1479 <210> 102 <211> 240 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400> 102 tctagaaaag caggaggaaa gcctacctct gatgactcca gaaacccttc agattcagaa 60 ccccaggagc ccacctatta caacgtacca gcctgtatag aactgcagcc agtgtacagc 120 aatgagcctg aggaaaacgt gatttacaca gaagtacgga gaactcaacc aagacagaaa 180 catgcagatc aggagtctga aagcccaaga tcaaggtgcc agatggctga gaaaaagtag 240
synthetic construct <400> 93 aaaaa taggaagaca gtcagaggatccagtcaggagccctcctcagactgtgctc60 a a g tcaaggcagccagagcccctgtatgagaac120 g a caaggatcca cgtaccccaa atcccccgac gtgaacgttg taagtggcaa tctctggtgtaccacaccccgcaggtgctg180 tgaagtgtac gaaccagcag cagctcagca cacggagtaagtgagtcctttcaggtctcc240 tgtgaggaca tctggactct attctaagcc attgcacatatggactatgaagacgccatg300 aaggataaac tag <210> 94 <211> 1567 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400>
94 ggttttgcttttttttcttttggtgagaggtaccttcaagttaccatccc60 tgagtagtct cagcctggtcctcatgctaccttggctcctgctactgatctgtgctctaccgtgtgaacc120 tgctggaatctctgatgtgagcttgaagacacggcccccaggaggatgggtgatggaggg180 agacaagctggtcctcatctgctcggttgatagagtcactgggaatataacttacttctg240 gtacagaggggccctgggtttccaactggaaacaaagacacaaccttcactaacagcaga300 gtttgagatcagtgacatgaagcagagcgatgctgatcaatattactgtgcggctaacga360 tggccacgaccctatcgccagtgagctggtgagcatccacgtcagagttccagtgtctcg420 ccctgtccttacgtttggggactctggaacccaggctgtgctaggggacctggtggagct480 tcactgtaaggccctgagaggctcacccccaatcttctaccagttttatcatgagagcat540 catcctggggaacagttcagcaccctctggaggaggagcatccttcaacttctccctgac600 tgcagaacattctggaaacttctcctgtgaggccagcaatggacagggtgcccaacgaag660 tgaggtggtggctctcaacttaacaggtctctccttagtgcctactgagaatggaatcag720 ccatctctccttaggactcactgggtggctgcttggctgtcttagccccatcaccatggc780 cttaatattttgctactggctcaagagaaaaataggaagacagtcagaggatccagtcag840 gagccctcctcagactgtgctccaaggatccacgtaccccaaatcccccgactcaaggca900 gccagagcccctgtatgagaacgtgaacgttgtaagtggcaatgaagtgtactctctggt960 gtaccacaccccgcaggtgctggaaccagcagcagctcagcatgtgaggacacacggagt1020 aagtgagtcctttcaggtctcctctggactctattctaagccaaggataaacattgcaca1080 tatggactatgaagacgccatgtagaattatgtaaacagcaactatggagtgctacatac1140 aagcccaaggcctgatgtggcctccaaggatactggggacagggatagcttgccagccca1200 atttccccacacactgcggttcattagatgagtccttcacctaccctgtgtgaagctgga1260 gcaagtcctgcagaaaccacccaggaaaaccaacttagacggagaagccagaagcatttg1320 catctggttgttgcccattcatgttggcacacgaacttttatttacaggaggaaaatggt1380 gtgatgaaagcaactaaggtcttacagcagagggacaatgcgactcagagagcacaaagc1440 cgagatcaatggctttgcaggtctgctgtggagacagagccatgcttcctctgtgcacat1500 accctagagtacttctgagtcactgccatcaacttagaattaaacacagttgcataaaat1560 gtactgt <210> 95 <211> 903 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400> 95 atgctacctt ggctcctgct actgatctgt gctctaccgt gtgaacctgc tggaatctct 60 gatgtgagct tgaagacacg gcccccagga ggatgggtga tggagggaga caagctggtc 120 ctcatctgctcggttgatagagtcactgggaatataacttacttctggtacagaggggcc180 ctgggtttccaactggaaacaaagacacaaccttcactaacagcagagtttgagatcagt240 gacatgaagcagagcgatgctgatcaatattactgtgcggctaacgatggccacgaccct300 atcgccagtgagctggtgagcatccacgtcagagttccagtgtctcgccctgtccttacg360 tttggggactctggaacccaggctgtgctaggggacctggtggagcttcactgtaaggcc920 ctgagaggctcacccccaatcttctaccagttttatcatgagagcatcatcctggggaac480 agttcagcaccctctggaggaggagcatccttcaacttctccctgactgcagaacattct540 ggaaacttctcctgtgaggccagcaatggacagggtgcccaacgaagtgaggtggtggct600 ctcaacttaacaggaagacagtcagaggatccagtcaggagccctcctcagactgtgctc660 caaggatccacgtaccccaaatcccccgactcaaggcagccagagcccctgtatgagaac720 gtgaacgttgtaagtggcaatgaagtgtactctctggtgtaccacaccccgcaggtgctg780 gaaccagcagcagctcagcatgtgaggacacacggagtaagtgagtcctttcaggtctcc840 tctggactctattctaagccaaggataaacattgcacatatggactatgaagacgccatg900 tag <2l0> 96 <211> 852 <212> DNA
<213> Artificial Sequence <220>
<223> equence:/note =
Description of Artificial S
synthetic construct <400>
96 atgtgagcttgaagacacggcccccaggaggatgggtgatggagggagac60 ggaatctctg aagctggtcctcatctgctcggttgatagagtcactgggaatataacttacttctggtac120 agaggggccctgggtttccaactggaaacaaagacacaaccttcactaacagcagagttt180 gagatcagtgacatgaagcagagcgatgctgatcaatattactgtgcggctaacgatggc240 cacgaccctatcgccagtgagctggtgagcatccacgtcagagttccagtgtctcgccct300 gtccttacgtttggggactctggaacccaggctgtgctaggggacctggtggagcttcac360 tgtaaggccctgagaggctcacccccaatcttctaccagttttatcatgagagcatcatc420 ctggggaacagttcagcaccctctggaggaggagcatccttcaacttctccctgactgca480 gaacattctggaaacttctcctgtgaggccagcaatggacagggtgcccaacgaagtgag540 gtggtggctctcaacttaacaggaagacagtcagaggatccagtcaggagccctcctcag600 actgtgctccaaggatccacgtaccccaaatcccccgactcaaggcagccagagcccctg660 tatgagaacgtgaacgttgtaagtggcaatgaagtgtactctctggtgtaccacaccccg720 caggtgctggaaccagcagcagctcagcatgtgaggacacacggagtaagtgagtccttt780 caggtctcctctggactctattctaagccaaggataaacattgcacatatggactatgaa840 gacgccatgtag <210>
<211>
<212>
DNA
<213> ficial ence Arti Sequ <220>
<223>
Description of Artificial Sequence:/note =
synthetic construct <400>
97 cactcaactgttttagaagagcagttccccagatttctccttggagctgt60 attcaagtta gagtgactaccattgcgagcaagagcaagaggaaagcactacctgtgagcagatgtctgg120 ttcattctcaccctgtgtggtgttcacacagatgtggctgactctactggttgtgactcc180 tgtcaatggacagcatgaagctgcacagcagtctgtggtttcccttcagcctccatggac240 cactttctttcgaggagaggtcgtcacactgacttgttatagattcggcttctccgtacc300 ccagaaaacaaaatggtaccagaaaagaaaaacagtgaagcaaaccccaggtgctttggt360 aattaaagcacataccttaaaggtccatgagtccggagagtattggtgccaagccgacag420 cttacttccgagcatgcacgtgaacgtagagttttctgaagattttctggtgctgcaagc480 tccacctgctgtgtttgaaggagactctgtggttctgaggtgctacgcaaagaaaggcat540 agaagcagagaccctgacattttacaaggatggtaaagctctgacattacatcatcaaag600 tgagctctctattcatcatgcaaatctgaaggacaacggtcaatacaaatgcacttcgaa660 gaagaagtggtcttttgggtccctctatacttccaatacggtcggagttcaagtccaaga720 gttgttcccacggcctgtgctgagagccagaccctcccatcccatagatggaagtccagt780 gaccctgacgtgtcagacccagctctctgcacagaagtcagatgcccggctccagttctg840 tttcttcagaaacctccagcttctggggtcaggctgcagccgctcctcagagtttcacat900 tcctgccatatggactgaagagtcaaggagataccagtgcaaggcagaaacagtgaattc960 ccaagttagaaaacaaagtacagcgttcataatcccagtgcagagagcttctgcgagatt1020 ccaaacacacatcatcccagcctcaaagttggtgtttgaagggcagttgctgttactcaa1080 ctgctcagtaaaaggagtyccaggrcccctcaaattctcctggtataaaaaggacatgct1140 gaatgaagaaacaaagattcttaagtcctccaacgcagaattcaagatctcccaggtgaa1200 catcagtgacgcaggggagtatcactgtgaagctaccaacagccgccgaagctttgtcag1260 cagggcatttcccatcaccataaaagtcccagtatctcaaccagttctcaccctaagcac1320 aggcaagacccaggcccttgagggagacttgatgacacttcattgtcaatcccagagggg1380 ctctccatgtatcctgtatgaattcttctatgagaatgtctccctggggaatagctctat1440 actctctggaggaggagcatacttcaatttctctatgagcacagagcgatctggaaacta1500 ctactgcacagcagacaatggcctgggagcccagtgcagtgaagctataaggatctctat1560 ctttgacatgacaaagaacagaagtgttcctatggctgccggaatcactgtgggactgct1620 catcatggctgttggagtgtttctgttttattgctggttctctagaaaagcaggaggaaa1680 gcctacctctgatgactccagaaacccttcagattcagaaccccaggagcccacctatta1740 caacgtaccagcctgtatagaactgcagccagtgtacagcaatgagcctgaggaaaacgt1800 gatttacacagaagtacggagaactcaaccaagacagaaacatgcagatcaggagtctga1860 aagcccaagatcaaggtgccagatggctgagaaaaagtaggatatgtctcctccaagaac1920 agctccagaaaagaaacccgaagcttcgtcagtctaatctcaccgatgcttctactgggc1980 ctgcactttcctacccacggatggctccacagatcatggacagcaaggaaatggccaact2040 ctcctaagactgggccaacatccccatcttctctttggtttcccagagccacgccacccc2100 aaagtcagcaggaagttgcaaaagatcacaacgaccctattcctgttttgtaaccacccc2160 cagcctgaagcaggctgagccagaccttgaccttgctgccactaaggagattacctaggg2220 tggagcctgcctctctagatcactctattgttcagccactgccactgttctccttcaaga2280 cactgctacctgctgggaggccactgagctattccagagactacaccctatcctgcacat2340 catcacctgtagcctgttccaggctccaagaatgaattggcggcaatgggcctcccccct2400 accccctttataagtgcatttgccattaaacatttgggctttgatct 2447 <210> 98 <211> 1788 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400>
98 cattctcaccctgtgtggtgttcacacagatgtggctgactctactggtt60 atgtctggtt gtgactcctgtcaatggacagcatgaagctgcacagcagtctgtggtttcccttcagcct120 ccatggaccactttctttcgaggagaggtcgtcacactgacttgttatagattcggcttc180 tccgtaccccagaaaacaaaatggtaccagaaaagaaaaacagtgaagcaaaccccaggt240 gctttggtaattaaagcacataccttaaaggtccatgagtccggagagtattggtgccaa300 gccgacagcttacttccgagcatgcacgtgaacgtagagttttctgaagattttctggtg360 ctgcaagctccacctgctgtgtttgaaggagactctgtggttctgaggtgctacgcaaag420 aaaggcatagaagcagagaccctgacattttacaaggatggtaaagctctgacattacat480 catcaaagtgagctctctattcatcatgcaaatctgaaggacaacggtcaatacaaatgc540 acttcgaagaagaagtggtcttttgggtccctctatacttccaatacggtcggagttcaa600 gtccaagagttgttcccacggcctgtgctgagagccagaccctcccatcccatagatgga660 agtccagtgaccctgacgtgtcagacccagctctctgcacagaagtcagatgcccggctc720 cagttctgtttcttcagaaacctccagcttctggggtcaggctgcagccgctcctcagag780 tttcacattcctgccatatggactgaagagtcaaggagataccagtgcaaggcagaaaca840 gtgaattcccaagttagaaaacaaagtacagcgttcataatcccagtgcagagagcttct900 gcgagattccaaacacacatcatcccagcctcaaagttggtgtttgaagggcagttgctg960 ttactcaactgctcagtaaaaggagtyccaggrcccctcaaattctcctggtataaaaag1020 gacatgctgaatgaagaaacaaagattcttaagtcctccaacgcagaattcaagatctcc1080 caggtgaacatcagtgacgcaggggagtatcactgtgaagctaccaacagccgccgaagc1140 tttgtcagcagggcatttcccatcaccataaaagtcccagtatctcaaccagttctcacc1200 ctaagcacaggcaagacccaggcccttgagggagacttgatgacacttcattgtcaatcc1260 cagaggggctctccatgtatcctgtatgaattcttctatgagaatgtctccctggggaat1320 agctctatactctctggaggaggagcatacttcaatttctctatgagcacagagcgatct1380 ggaaactactactgcacagcagacaatggcctgggagcccagtgcagtgaagctataagg1440 atctctatctttgacatgacaaagaacagaagtgttcctatggctgccggaatcactgtg1500 ggactgctcatcatggctgttggagtgtttctgttttattgctggttctctagaaaagca1560 ggaggaaagcctacctctgatgactccagaaacccttcagattcagaaccccaggagccc1620 acctattacaacgtaccagcctgtatagaactgcagccagtgtacagcaatgagcctgag1680 gaaaacgtgatttacacagaagtacggagaactcaaccaagacagaaacatgcagatcag1740 gagtctgaaagcccaagatcaaggtgccagatggctgagaaaaagtag 1788 <210> 99 <211> 1710 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400>
99 ctgcacagcagtctgtggtttcccttcagcctccatggaccactttcttt60 cagcatgaag cgaggagaggtcgtcacactgacttgttatagattcggcttctccgtaccccagaaaaca120 aaatggtaccagaaaagaaaaacagtgaagcaaaccccaggtgctttggtaattaaagca180 cataccttaaaggtccatgagtccggagagtattggtgccaagccgacagcttacttccg240 agcatgcacgtgaacgtagagttttctgaagattttctggtgctgcaagctccacctgct300 gtgtttgaaggagactctgtggttctgaggtgctacgcaaagaaaggcatagaagcagag360 accctgacattttacaaggatggtaaagctctgacattacatcatcaaagtgagctctct420 attcatcatgcaaatctgaaggacaacggtcaatacaaatgcacttcgaagaagaagtgg480 tcttttgggtccctctatacttccaatacggtcggagttcaagtccaagagttgttccca540 cggcctgtgctgagagccagaccctcccatcccatagatggaagtccagtgaccctgacg600 tgtcagacccagctctctgcacagaagtcagatgcccggctccagttctgtttcttcaga660 aacctccagcttctggggtcaggctgcagccgctcctcagagtttcacattcctgccata720 tggactgaagagtcaaggagataccagtgcaaggcagaaacagtgaattcccaagttaga780 aaacaaagtaoagcgttcataatcccagtgcagagagcttctgcgagattccaaacacac840 atcatcccagcctcaaagttggtgtttgaagggcagttgctgttactcaactgctcagta900 aaaggagtyccaggrcccctcaaattctcctggtataaaaaggacatgctgaatgaagaa960 acaaagattcttaagtcctccaacgcagaattcaagatctcccaggtgaacatcagtgac1020 gcaggggagtatcactgtgaagctaccaacagccgccgaagctttgtcagcagggcattt1080 cccatcaccataaaagtcccagtatctcaaccagttctcaccctaagcacaggcaagacc1140 caggcccttgagggagacttgatgacacttcattgtcaatcccagaggggctctccatgt1200 atcctgtatgaattcttctatgagaatgtctccctggggaatagctctatactctctgga1260 ggaggagcatacttcaatttctctatgagcacagagcgatctggaaactactactgcaca1320 gcagacaatggcctgggagcccagtgcagtgaagctataaggatctctatctttgacatg1380 acaaagaacagaagtgttcctatggctgccggaatcactgtgggactgctcatcatggct1440 gttggagtgtttctgttttattgctggttctctagaaaagcaggaggaaagcctacctct1500 gatgactccagaaacccttcagattcagaaccccaggagcccacctattacaacgtacca1560 gcctgtatagaactgcagccagtgtacagcaatgagcctgaggaaaacgtgatttacaca1620 gaagtacggagaactcaaccaagacagaaacatgcagatcaggagtctgaaagcccaaga1680 tcaaggtgccagatggctgagaaaaagtag 1710 <210> 100 <211> 1401 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400>
100 ctgcacagcagtctgtggtttcccttcagcctccatggaccactttcttt60 cagcatgaag cgaggagaggtcgtcacactgacttgttatagattcggcttctccgtaccccagaaaaca120 aaatggtaccagaaaagaaaaacagtgaagcaaaccccaggtgctttggtaattaaagca180 cataccttaaaggtccatgagtccggagagtattggtgccaagccgacagcttacttccg240 agcatgcacgtgaacgtagagttttctgaagattttctggtgctgcaagctccacctgct300 gtgtttgaaggagactctgtggttctgaggtgctacgcaaagaaaggcatagaagcagag360 accctgacattttacaaggatggtaaagctctgacattacatcatcaaagtgagctctct420 attcatcatgcaaatctgaaggacaacggtcaatacaaatgcacttcgaagaagaagtgg480 tcttttgggtccctctatacttccaatacggtcggagttcaagtccaagagttgttccca540 cggcctgtgctgagagccagaccctcccatcccatagatggaagtccagtgaccctgacg600 tgtcagacccagctctctgcacagaagtcagatgcccggctccagttctgtttcttcaga660 aacctccagcttctggggtcaggctgcagccgctcctcagagtttcacattcctgccata720 tggactgaagagtcaaggagataccagtgcaaggcagaaacagtgaattcccaagttaga780 aaacaaagtacagcgttcataatcccagtgcagagagcttctgcgagattccaaacacac840 atcatcccagcctcaaagttggtgtttgaagggcagttgctgttactcaactgctcagta900 aaaggagtyccaggrcccctcaaattctcctggtataaaaaggacatgctgaatgaagaa960 acaaagattcttaagtcctccaacgcagaattcaagatctcccaggtgaacatcagtgac1020 gcaggggagtatcactgtgaagctaccaacagccgccgaagctttgtcagcagggcattt1080 cccatcaccataaaagtcccagtatctcaaccagttctcaccctaagcacaggcaagacc1140 caggcccttgagggagacttgatgacacttcattgtcaatcccagaggggctctccatgt1200 atcctgtatgaattcttctatgagaatgtctccctggggaatagctctatactctctgga1260 ggaggagcatacttcaatttctctatgagcacagagcgatctggaaactactactgcaca1320 gcagacaatggcctgggagcccagtgcagtgaagctataaggatctctatctttgacatg1380 acaaagaacagaagtgttcct 1401 <210>
<211>
<212>
DNA
<213> ficial ence Arti Sequ <220>
<223> Description of Artificial Sequence:/note =
synth etic construct <400> 101 atgtctggttcattctcaccctgtgtggtgttcacacagatgtggctgactctactggtt 60 gtgactcctgtcaatggacagcatgaagctgcacagcagtctgtggtttcccttcagcct 120 ccatggaccactttctttcgaggagaggtcgtcacactgacttgttatagattcggcttc 180 tccgtaccccagaaaacaaaatggtaccagaaaagaaaaacagtgaagcaaaccccaggt 240 gctttggtaattaaagcacataccttaaaggtccatgagtccggagagtattggtgccaa 300 gccgacagcttacttccgagcatgcacgtgaacgtagagttttctgaagattttctggtg 360 ctgcaagctccacctgctgtgtttgaaggagactctgtggttctgaggtgctacgcaaag 420 aaaggcatagaagcagagaccctgacattttacaaggatggtaaagctctgacattacat 980 catcaaagtgagctctctattcatcatgcaaatctgaaggacaacggtcaatacaaatgc 540 acttcgaagaagaagtggtcttttgggtccctctatacttccaatacggtcggagttcaa 600 gtccaagagttgttcccacggcctgtgctgagagccagaccctcccatcccatagatgga 660 agtccagtgaccctgacgtgtcagacccagctctctgcacagaagtcagatgcccggctc 720 cagttctgtttcttcagaaacctccagcttctggggtcaggctgcagccgctcctcagag 780 tttcacattcctgccatatggactgaagagtcaaggagataccagtgcaaggcagaaaca 840 gtgaattcccaagttagaaaacaaagtacagcgttcataatcccagtgcagagagcttct 900 gcgagattccaaacacacatcatcccagcctcaaagttggtgtttgaagggcagttgctg 960 ttactcaactgctcagtaaaaggagtyccaggrcccctcaaattctcctggtataaaaag 1020 gacatgctgaatgaagaaacaaagattcttaagtcctccaacgcagaattcaagatctcc 1080 caggtgaacatcagtgacgcaggggagtatcactgtgaagctaccaacagccgccgaagc 1140 tttgtcagcagggcatttcccatcaccataaaagtcccagtatctcaaccagttctcacc 1200 ctaagcacaggcaagacccaggcccttgagggagacttgatgacacttcattgtcaatcc 1260 cagaggggctctccatgtatcctgtatgaattcttctatgagaatgtctccctggggaat 1320 agctctatactctctggaggaggagcatacttcaatttctctatgagcacagagcgatct 1380 ggaaactactactgcacagcagacaatggcctgggagcccagtgcagtgaagctataagg 1440 atctctatctttgacatgacaaagaacagaagtgttcct 1479 <210> 102 <211> 240 <212> DNA
<213> Artificial Sequence <220>
<223> Description of Artificial Sequence:/note =
synthetic construct <400> 102 tctagaaaag caggaggaaa gcctacctct gatgactcca gaaacccttc agattcagaa 60 ccccaggagc ccacctatta caacgtacca gcctgtatag aactgcagcc agtgtacagc 120 aatgagcctg aggaaaacgt gatttacaca gaagtacgga gaactcaacc aagacagaaa 180 catgcagatc aggagtctga aagcccaaga tcaaggtgcc agatggctga gaaaaagtag 240
Claims (219)
1. An isolated FcRH, comprising a cytoplasmic region with more than 107 or less than 104 amino acids, a transmembrance region, and an extracellular region.
2. The isolated FcRH of claim 1, wherein the extracellular region comprises less than four Ig domains.
3. The isolated FcRH of claim 2, wherein the cytoplamic region comprises less than 104 amino acids.
4. The isolated FcRH of claim 3, wherein the transmembrane region comprises an acidic amino acid.
5. The isolated FcRH of claim 4, wherein the acidic amino acid is glutamate.
6. The isolated FcRH of claim 2, wherein the cytoplasmic region comprises the amino acid sequence of SEQ ID NO:1
7. The isolated FcRH of claim 2, wherein the extracellular region comprises the amino acid sequence of SEQ ID NO:21.
8. The isolated FcRH of claim 1, comprising the amino acid sequence of SEQ ID
NO:2.
NO:2.
9. The isolated FcRH of claim 1, wherein the receptor is expressed by myeloid cells.
10. The isolated FcRH of claim 9, wherein the receptor is expressed by T-cells.
11. A polypeptide comprising the amino acid sequence of SEQ ID NO: 1.
12. A polypeptide comprising the amino acid of SEQ ID NO:1 with conservative amino acid substitutions.
13. A polypeptide comprising the amino acid sequence of SEQ ID NO:21
14. A polypeptide comprising the amino acid of SEQ ID NO:21 with conservative amino acid substitutions.
15. A polypeptide comprising the amino acid of SEQ ID NO:2.
16. A polypeptide comprising the amino acid of SEQ ID NO:2 with conservative amino acid substitutions.
17. The isolated FcRH of claim 1, wherein the cytoplasmic region comprises less than 99 amino acids and wherein the receptor further comprises an extracellular region with up to four Ig domain and up to five N-linked glycosylation sites.
18. The isolated FcRH of claim 17, wherein the cytoplasmic region comprises the amino acid sequence of SEQ ID NO:3.
19. The isolated FcRH of claim 17, wherein the extracellular region comprises the amino acid sequence of SEQ ID NO:22.
20. The isolated FcRH of claim 1, comprising the amino acid sequence of SEQ ID
NO:4.
NO:4.
21. A polypeptide comprising the amino acid sequence of SEQ ID NO:3.
22. A polypeptide comprising the amino acid of SEQ ID NO:3 with conservative amino acid substitutions.
23. A polypeptide comprising the amino acid sequence of SEQ ID NO:22
24. A polypeptide comprising the amino acid of SEQ ID NO:22 with conservative amino acid substitutions.
25. A polypeptide comprising the amino acid of SEQ ID NO:4.
26. A polypeptide comprising the amino acid of SEQ ID NO:4 with conservative amino acid substitutions.
27. The isolated FcRH of claim 1, wherein the cytoplasmic region comprises more than 107 amino acids.
28. The isolated FcRH of claim 27, wherein the cytoplasmic region comprises the amino acid sequence of SEQ ID NO:5.
29. The isolated FcRH of claim 27, wherein the cytoplasmic region comprises the amino acid sequence of SEQ ID NO:23.
30. The isolated FcRH of claim 27, wherein the extracellular region comprises the amino acid sequence of SEQ ID NO:24.
31. The isolated FcRH of claim 1, comprising the amino acid sequence of SEQ ID
NO:6.
NO:6.
32. The isolated FcRH of claim 1, comprising the amino acid sequence of SEQ ID
NO:25.
NO:25.
33. A polypeptide comprising the amino acid sequence of SEQ ID NO:5
34. A polypeptide comprising the amino acid of SEQ ID NO:5 with conservative amino acid substitutions.
35. A polypeptide comprising the amino acid sequence of SEQ ID NO:24
36. A polypeptide comprising the amino acid of SEQ ID NO:24 with conservative amino acid substitutions.
37. A polypeptide comprising the amino acid sequence of SEQ ID NO:23
38. A polypeptide comprising the amino acid of SEQ ID NO:23 with conservative amino acid substitutions.
39. A polypeptide comprising the amino acid sequence of SEQ ID NO:6.
40. A polypeptide comprising the amino acid of SEQ ID NO:6 with conservative amino acid substitutions.
41. A polypeptide comprising the amino acid sequence of SEQ ID NO:25.
42. A polypeptide comprising the amino acid of SEQ ID NO:25 with conservative amino acid substitutions.
43. The isolated FcRH of claim 1, wherein the cytoplasmic region comprises the amino acid sequence of SEQ ID NO:26.
44. The isolated FcRH of claim 1, wherein the extracellular region comprises the amino acid sequence of SEQ ID NO:27.
45. The isolated FcRH of claim 1, comprising the amino acid sequence of SEQ ID
NO:28.
NO:28.
46. An isolated nucleic acid, comprising a nucleotide sequence that encodes the FcRH
of claim 2.
of claim 2.
47. An isolated nucleic acid, comprising a nucleotide sequence that encodes SEQ ID
NO:1.
NO:1.
48. An isolated nucleic acid, comprising a nucleotide sequence that encodes SEQ ID
NO:21.
NO:21.
49. An isolated nucleic acid, comprising a nucleotide sequence that encodes SEQ ID
NO:2.
NO:2.
50. The nucleic acid of claim 46, comprising the nucleotide sequence of SEQ ID
NO:7.
NO:7.
51. An isolated nucleic acid comprising a sequence that hybridizes under highly stringent conditions to a hybridization probe, wherein the hybridization probe comprises the nucleotide sequence of SEQ ID NO:7, or the complement of SEQ ID
NO:7.
NO:7.
52. The nucleic acid of claim 46, comprising the nucleotide sequence of SEQ ID
NO:13.
NO:13.
53. An isolated nucleic acid comprising a sequence that hybridizes under highly stringent conditions to a hybridization probe, wherein the hybridization probe comprises the nucleotide sequence of SEQ ID NO:13 or the complement of SEQ
ID NO:13.
ID NO:13.
54. The nucleic acid of claim 46, comprising the nucleotide sequence of SEQ ID
NO:8.
NO:8.
55. An isolated nucleic acid comprising a sequence that hybridizes under highly stringent conditions to a hybridization probe, wherein the hybridization probe comprises the nucleotide sequence of SEQ ID NO:8, or the complement of SEQ ID
NO:8.
NO:8.
56. A single stranded nucleic acid that hybridizes under stringent conditions to a nucleic acid having the sequence of SEQ ID NO:7, SEQ ID NO:13 or SEQ ID
NO:8.
NO:8.
57. An isolated nucleic acid, comprising a nucleotide sequence that encodes the FcRH
of claim 17.
of claim 17.
58. An isolated nucleic acid, comprising a nucleotide sequence that encodes SEQ ID
NO:3.
NO:3.
59. An isolated nucleic acid, comprising a nucleotide sequence that encodes SEQ ID
NO:22.
NO:22.
60. An isolated nucleic acid, comprising a nucleotide sequence that encodes SEQ ID
NO:4.
NO:4.
61. The nucleic acid of claim 57, comprising the nucleotide sequence of SEQ ID
NO:9.
NO:9.
62. An isolated nucleic acid comprising a sequence that hybridizes under highly stringent conditions to a hybridization probe, wherein the hybridization probe comprises the nucleotide sequence of SEQ ID NO:9, or the complement of SEQ ID
NO:9.
NO:9.
63. The nucleic acid of claim 57, comprising the nucleotide sequence of SEQ ID
NO:14.
NO:14.
64. An isolated nucleic acid comprising a sequence that hybridizes under highly stringent conditions to a hybridization probe, wherein the hybridization probe comprises the nucleotide sequence of SEQ ID NO:14, or the complement of SEQ
ID NO:14.
ID NO:14.
65. The nucleic acid of claim 57, comprising the nucleotide sequence of SEQ ID
NO:10.
NO:10.
66. An isolated nucleic acid comprising a sequence that hybridizes under highly stringent conditions to a hybridization probe, wherein the hybridization probe comprises the nucleotide sequence of SEQ ID NO:10, or the complement of SEQ
ID NO:10.
ID NO:10.
67. A single stranded nucleic acid that hybridizes under stringent conditions to a nucleic acid having the sequence of SEQ ID NO:9, SEQ ID NO:14, or SEQ ID
NO:10.
NO:10.
68. An isolated nucleic acid, comprising a nucleotide sequence that encodes the FcRH
of claim 27.
of claim 27.
69. An isolated nucleic acid, comprising a nucleotide sequence that encodes SEQ ID
NO:5.
NO:5.
70. An isolated nucleic acid, comprising a nucleotide sequence that encodes SEQ ID
NO:23.
NO:23.
71. An isolated nucleic acid, comprising a nucleotide sequence that encodes SEQ ID
NO:24.
NO:24.
72. An isolated nucleic acid, comprising a nucleotide sequence that encodes SEQ ID
NO:6.
NO:6.
73. An isolated nucleic acid, comprising a nucleotide sequence that encodes SEQ ID
NO:25.
NO:25.
74. The nucleic acid of claim 68, comprising the nucleotide sequence of SEQ ID
NO:11.
NO:11.
75. An isolated nucleic acid comprising a sequence that hybridizes under highly stringent conditions to a hybridization probe, wherein the hybridization probe comprises the nucleotide sequence of SEQ ID NO:11, or the complement of SEQ
ID NO:11.
ID NO:11.
76. The nucleic acid of claim 68, comprising the nucleotide sequence of SEQ ID
NO:16.
NO:16.
77. An isolated nucleic acid comprising a sequence that hybridizes under highly stringent conditions to a hybridization probe, wherein the hybridization probe comprises the nucleotide sequence of SEQ ID NO:16, or the complement of SEQ
ID NO:16.
ID NO:16.
78. The nucleic acid of claim 68, comprising the nucleotide sequence of SEQ ID
NO:15.
NO:15.
79. An isolated nucleic acid comprising a sequence that hybridizes under highly stringent conditions to a hybridization probe, wherein the hybridization probe comprises the nucleotide sequence of SEQ ID NO:15, or the complement of SEQ
ID NO:15.
ID NO:15.
80. The nucleic acid of claim 68, comprising the nucleotide sequence of SEQ ID
NO:12.
NO:12.
81. An isolated nucleic acid comprising a sequence that hybridizes under highly stringent conditions to a hybridization probe, wherein the hybridization probe comprises the nucleotide sequence of SEQ ID NO:12, or the complement of SEQ
ID NO:12.
ID NO:12.
82. The nucleic acid of claim 68, comprising the nucleotide sequence of SEQ ID
NO:17.
NO:17.
83. An isolated nucleic acid comprising a sequence that hybridizes under highly stringent conditions to a hybridization probe, wherein the hybridization probe comprises the nucleotide sequence of SEQ ID NO:11, or the complement of SEQ
ID NO:17.
ID NO:17.
84. A single stranded nucleic acid that hybridizes under stringent conditions to a nucleic acid having the sequence of SEQ ID NO:11, SEQ ID NO:15, SEQ ID
NO:16, or SEQ ID NO:12.
NO:16, or SEQ ID NO:12.
85. An isolated nucleic acid, comprising a nucleotide sequence that encodes SEQ ID
NO:26.
NO:26.
86. An isolated nucleic acid, comprising a nucleotide sequence that encodes SEQ ID
NO:27.
NO:27.
87. An isolated nucleic acid, comprising a nucleotide sequence that encodes SEQ ID
NO:28.
NO:28.
88. An isolated nucleic acid comprising a sequence that hybridizes under highly stringent conditions to a hybridization probe, wherein the hybridization probe comprises the nucleotide sequence of SEQ ID NO:18, or the complement of SEQ
ID NO:18.
ID NO:18.
89. An isolated nucleic acid comprising a sequence that hybridizes under highly stringent conditions to a hybridization probe, wherein the hybridization probe comprises the nucleotide sequence of SEQ ID NO:19, or the complement of SEQ
ID NO:19.
ID NO:19.
90. An isolated nucleic acid comprising a sequence that hybridizes under highly stringent conditions to a hybridization probe, wherein the hybridization probe comprises the nucleotide sequence of SEQ ID NO:20, or the complement of SEQ
ID NO:20.
ID NO:20.
91. A single stranded nucleic acid that hybridizes under stringent conditions to a nucleic acid having the sequence of SEQ ID NO:18, SEQ ID NO:19, or SEQ ID
NO:20.
NO:20.
92. An expression vector comprising the nucleic acid of claim 46 operably linked to an expression control sequence.
93. An isolated cell comprising the vector of claim 92.
94. A method of making a FcRH, comprising culturing the cell of claim 93 under conditions permitting expression of the FcRH.
95. An expression vector comprising the nucleic acid of claim 57 operably linked to an expression control sequence.
96. An isolated cell comprising the vector of claim 95.
97. A method of making a FcRH, comprising culturing the cell of claim 96 under conditions permitting expression of the FcRH.
98. An expression vector comprising the nucleic acid of claim 56 operably linked to an expression control sequence.
99. An isolated cell comprising the vector of claim 98.
100. A method of making a FcRH, comprising culturing the cell of claim 99 under conditions permitting expression of the FcRH.
101. An expression vector comprising the nucleic acid of claim 57 operably linked to an expression control sequence.
102. An isolated cell comprising the vector of claim 101.
103. A method of making a FcRH, comprising culturing the cell of claim 102 under conditions permitting expression of the FcRH.
104. An expression vector comprising the nucleic acid of claim 67 operably linked to an expression control sequence.
105. An isolated cell comprising the vector of claim 104.
106. A method of making a FcRH, comprising culturing the cell of claim 105 under conditions permitting expression of the FcRH.
107. An expression vector comprising the nucleic acid of claim 68 operably linked to an expression control sequence.
108. An isolated cell comprising the vector of claim 107.
109. A method of making a FcRH, comprising culturing the cell of claim 108 under conditions permitting expression of the FcRH.
110. An expression vector comprising the nucleic acid of claim 91 operably linked to an expression control sequence.
111. An isolated cell comprising the vector of claim 110.
112. A method of making a FcRH, comprising culturing the cell of claim 111 under conditions permitting expression of the FcRH.
113. A purified antibody or immunonologic fragment thereof, wherein the antibody or fragment thereof selectively binds to the FcRH of claim 1.
114. A purified antibody or immunonologic fragment thereof, wherein the antibody or fragment thereof selectively binds to the FcRH of claim 2.
115. A purified antibody or immunonologic fragment thereof, wherein the antibody or fragment thereof selectively binds to the FcRH of claim 17.
116. A purified antibody or immunonologic fragment thereof, wherein the antibody or fragment thereof selectively binds to the FcRH of claim 27.
117. The antibody or fragment of claim 113, wherein the antibody or fragment is a monoclonal antibody or fragment thereof.
118. The antibody or fragment of claim 113, wherein the antibody or fragment thereof is a humanized antibody, a fully human antibody, or a fragment thereof.
119. The antibody or fragment of claim 113, wherein the antibody or fragment thereof is a single chain antibody or fragment thereof.
120. The antibody or fragment of claim 113, wherein the antibody or fragment thereof is labeled.
121. The antibody or fragment of claim 113, wherein the label is a radiolabel.
122. The antibody or fragment of claim 113, wherein the antibody or fragment is conjugated or fused with a toxin.
123. A purified antibody that selectively binds to the FcRH of claim 6, but not to the FcRH of claim 18, 28, or 43.
124. A purified antibody that selectively binds to the FcRH of claim 18, but not to the FcRH of claim 6, 28, or 43.
125. A purified antibody that selectively binds to the FcRH of claim 28, but not to the FcRH of claim 6, 18, or 43.
126. The purified antibody of claim 125, wherein the antibody does not bind to the FcRH of claim 29.
127. A purified antibody that selectively binds to the FcRH of claim 29, but not to the FcRH of claim 6, 18, or 43.
128. The purified antibody of claim 127, wherein the antibody does not bind to the FcRH of claim 28.
129. A purified antibody that selectively binds to the FcRH of claim 43, but not to the FcRH of claim 6, 18, or 28.
130. A purified antibody that selectively binds to the FcRH of claim 7, but not to the FcRH of claim 19, 30, or 44.
131. A purified antibody that selectively binds to the FcRH of claim 19, but not to the FcRH of claim 7, 30, or 44.
132. A purified antibody that selectively binds to the FcRH of claim 30, but not to the FcRH of claim 7, 19, or 44.
133. A purified antibody that selectively binds to the FcRH of claim 44, but not to the FcRH of claim 7, 19, or 30.
134. A method of diagnosing a malignancy of hematopoietic cell lineage in a subject, comprising:
(a) contacting a biological sample of the subject with the antibody of claim 113 under conditions that allow the antibody to bind to an FcRH in the biological sample;
(b) detecting the amount or pattern of binding by the antibody, changes in the amount or pattern of binding as compared to binding in a control sample indicating a malignancy of hematopoietic cell lineage in the subject.
(a) contacting a biological sample of the subject with the antibody of claim 113 under conditions that allow the antibody to bind to an FcRH in the biological sample;
(b) detecting the amount or pattern of binding by the antibody, changes in the amount or pattern of binding as compared to binding in a control sample indicating a malignancy of hematopoietic cell lineage in the subject.
135. The method of claim 134, wherein the malignancy of hematopoietic cell lineage is a malignancy of B cell lineage.
136. The method of claim 134, wherein the malignancy of hematopoietic cell lineage is a malignancy of T cell lineage.
137. The method of claim 134, wherein the antibody selectively binds an FcRH
having the amino acid sequence of SEQ ID NO:1.
having the amino acid sequence of SEQ ID NO:1.
138. The method of claim 134, wherein the antibody selectively binds an FcRH
having the amino acid sequence of SEQ ID NO:21.
having the amino acid sequence of SEQ ID NO:21.
139. The method of claim 134, wherein the antibody selectively binds an FcRH
having the amino acid sequence of SEQ ID NO:2.
having the amino acid sequence of SEQ ID NO:2.
140. The method of claim 134, wherein the antibody selectively binds an FcRH
having the amino acid sequence of SEQ ID NO:3.
having the amino acid sequence of SEQ ID NO:3.
141. The method of claim 134, wherein the antibody selectively binds an FcRH
having the amino acid sequence of SEQ ID NO:22.
having the amino acid sequence of SEQ ID NO:22.
142. The method of claim 134, wherein the antibody selectively binds an FcRH
having the amino acid sequence of SEQ ID NO:4.
having the amino acid sequence of SEQ ID NO:4.
143. The method of claim 134, wherein the antibody selectively binds an FcRH
having the amino acid sequence of SEQ ID NO:5.
having the amino acid sequence of SEQ ID NO:5.
144. The method of claim 134, wherein the antibody selectively binds an FcRH
having the amino acid sequence of SEQ ID NO:23.
having the amino acid sequence of SEQ ID NO:23.
145. The method of claim 134, wherein the antibody selectively binds an FcRH
having the amino acid sequence of SEQ ID NO:24.
having the amino acid sequence of SEQ ID NO:24.
146. The method of claim 134, wherein the antibody selectively binds an FcRH
having the amino acid sequence of SEQ ID NO:6.
having the amino acid sequence of SEQ ID NO:6.
147. The method of claim 134, wherein the antibody selectively binds an FcRH
having the amino acid sequence of SEQ ID NO:25.
having the amino acid sequence of SEQ ID NO:25.
148. The method of claim 134, wherein the antibody selectively binds an FcRH
having the amino acid sequence of SEQ ID NO:26.
having the amino acid sequence of SEQ ID NO:26.
149. The method of claim 134, wherein the antibody selectively binds an FcRH
having the amino acid sequence of SEQ ID NO:27.
having the amino acid sequence of SEQ ID NO:27.
150. The method of claim 134, wherein the antibody selectively binds an FcRH
having the amino acid sequence of SEQ ID NO:28.
having the amino acid sequence of SEQ ID NO:28.
151. A method of diagnosing a malignancy of hematopoietic cell lineage in a subject, comprising:
(a) contacting the nucleic acid of claim 56 with a biological sample of the subject under conditions that allow the nucleic acid to hybridize with an FcRH in the biological sample;
(b) detecting the amount or pattern of binding by the nucleic acid, changes in the amount or pattern of binding as compared to binding in a control sample indicating a malignancy of hematopoietic cell lineage in the subject.
(a) contacting the nucleic acid of claim 56 with a biological sample of the subject under conditions that allow the nucleic acid to hybridize with an FcRH in the biological sample;
(b) detecting the amount or pattern of binding by the nucleic acid, changes in the amount or pattern of binding as compared to binding in a control sample indicating a malignancy of hematopoietic cell lineage in the subject.
152. The method of claim 151, wherein the malignancy of hematopoietic cell lineage is a malignancy of B cell lineage.
153. The method of claim 151, wherein the malignancy of hematopoietic cell lineage is a malignancy of T cell lineage.
154. A method of diagnosing a malignancy of hematopoietic cell lineage in a subject, comprising:
(a) contacting the nucleic acid of claim 67 with a biological sample of the subject under conditions that allow the nucleic acid to hybridize with the biological sample;
(b) detecting the amount or pattern of binding by the nucleic acid, changes in the amount or pattern of binding as compared to binding in a control sample indicating a malignancy of hematopoietic cell lineage.
(a) contacting the nucleic acid of claim 67 with a biological sample of the subject under conditions that allow the nucleic acid to hybridize with the biological sample;
(b) detecting the amount or pattern of binding by the nucleic acid, changes in the amount or pattern of binding as compared to binding in a control sample indicating a malignancy of hematopoietic cell lineage.
155. The method of claim 154, wherein the malignancy of hematopoietic cell lineage is a malignancy of B cell lineage.
156. The method of claim 154, wherein the malignancy of hematopoietic cell lineage is a malignancy of T cell lineage.
157. A method of diagnosing a malignancy of hematopoietic cell lineage in a subject, comprising:
(a) contacting the nucleic acid of claim 84 with a biological sample of the subject under conditions that allow the nucleic acid to hybridize with the biological sample;
(b) detecting the amount or pattern of binding by the nucleic acid, changes in the amount or pattern of binding as compared to binding in a control sample indicating a malignancy of hematopoietic cell lineage.
(a) contacting the nucleic acid of claim 84 with a biological sample of the subject under conditions that allow the nucleic acid to hybridize with the biological sample;
(b) detecting the amount or pattern of binding by the nucleic acid, changes in the amount or pattern of binding as compared to binding in a control sample indicating a malignancy of hematopoietic cell lineage.
158. The method of claim 157, wherein the malignancy of hematopoietic cell lineage is a malignancy of B cell lineage.
159. The method of claim 157, wherein the malignancy of hematopoietic cell lineage is a malignancy of T cell lineage.
160. A method of diagnosing a malignancy of hematopoietic cell lineage in a subject, comprising:
(a) contacting the nucleic acid of claim 91 with a biological sample of the subject under conditions that allow the nucleic acid to hybridize with the biological sample;
(b) detecting the amount or pattern of binding by the nucleic acid, changes in the amount or pattern of binding as compared to binding in a control sample indicating a malignancy of hematopoietic cell lineage.
(a) contacting the nucleic acid of claim 91 with a biological sample of the subject under conditions that allow the nucleic acid to hybridize with the biological sample;
(b) detecting the amount or pattern of binding by the nucleic acid, changes in the amount or pattern of binding as compared to binding in a control sample indicating a malignancy of hematopoietic cell lineage.
161. The method of claim 160, wherein the malignancy of hematopoietic cell lineage is a malignancy of B cell lineage.
162. The method of claim 160, wherein the malignancy of hematopoietic cell lineage is a malignancy of T cell lineage.
163. A method of treating a malignancy of hematopoietic cell lineage in a subject, comprising contacting the subject's malignant cells with a therapeutically effective amount of the antibody of claim 113.
164. The method of claim 163, wherein the malignancy of hematopoietic cell lineage is a malignancy of B cell lineage.
165. The method of claim 163, wherein the malignancy of hematopoietic cell lineage is a malignancy of T cell lineage.
166. The method of claim 163, wherein the antibody selectively binds an FcRH
having the amino acid sequence of SEQ ID NO:1.
having the amino acid sequence of SEQ ID NO:1.
167. The method of claim 163, wherein the antibody selectively binds an FcRH
having the amino acid sequence of SEQ ID NO:21.
having the amino acid sequence of SEQ ID NO:21.
168. The method of claim 163, wherein the antibody selectively binds an FcRH
having the amino acid sequence of SEQ ID NO:2.
having the amino acid sequence of SEQ ID NO:2.
169. The method of claim 163, wherein the antibody selectively binds an FcRH
having the amino acid sequence of SEQ ID NO:3.
having the amino acid sequence of SEQ ID NO:3.
170. The method of claim 163, wherein the antibody selectively binds an FcRH
having the amino acid sequence of SEQ ID NO:22.
having the amino acid sequence of SEQ ID NO:22.
171. The method of claim 163, wherein the antibody selectively binds an FcRH
having the amino acid sequence of SEQ ID NO:4.
having the amino acid sequence of SEQ ID NO:4.
172. The method of claim 163, wherein the antibody selectively binds an FcRH
having the amino acid sequence of SEQ ID NO:5.
having the amino acid sequence of SEQ ID NO:5.
173. The method of claim 163, wherein the antibody selectively binds an FcRH
having the amino acid sequence of SEQ ID NO:23.
having the amino acid sequence of SEQ ID NO:23.
174. The method of claim 163, wherein the antibody selectively binds an FcRH
having the amino acid sequence of SEQ ID NO:24.
having the amino acid sequence of SEQ ID NO:24.
175. The method of claim 163, wherein the antibody selectively binds an FcRH
having the amino acid sequence of SEQ ID NO:6.
having the amino acid sequence of SEQ ID NO:6.
176. The method of claim 163, wherein the antibody selectively binds an FcRH
having the amino acid sequence of SEQ ID NO:25.
having the amino acid sequence of SEQ ID NO:25.
177. The method of claim 163, wherein the antibody selectively binds an FcRH
having the amino acid sequence of SEQ ID NO:26.
having the amino acid sequence of SEQ ID NO:26.
178. The method of claim 163, wherein the antibody selectively binds an FcRH
having the amino acid sequence of SEQ ID NO:27.
having the amino acid sequence of SEQ ID NO:27.
179. The method of claim 163, wherein the antibody selectively binds an FcRH
having the amino acid sequence of SEQ ID NO:28.
having the amino acid sequence of SEQ ID NO:28.
180. A method of treating a malignancy of hematopoietic cell lineage in a subject, comprising contacting the subject's malignant cells with a therapeutically effective amount of the nucleic acid of claim 56.
181. A method of treating a malignancy of hematopoietic cell lineage in a subject, comprising contacting the subject's malignant cells with a therapeutically effective amount of the nucleic acid of claim 67.
182. A method of treating a malignancy of hematopoietic cell lineage in a subject, comprising contacting the subject's malignant cells with a therapeutically effective amount of the nucleic acid of claim 84.
183. A method of treating a malignancy of hematopoietic cell lineage in a subject, comprising contacting the subject's malignant cells with a therapeutically effective amount of the nucleic acid of claim 91.
184. A method of diagnosing an autoimmune disease in a subject, comprising:
(a) contacting a biological sample of the subject with the antibody of claim 113 under conditions that allow the antibody to bind to FcRH in the biological sample;
(b) detecting the amount or pattern of binding by the antibody, changes in the amount or pattern of binding as compared to binding in a control sample indicating an autoimmune disease in the subject.
(a) contacting a biological sample of the subject with the antibody of claim 113 under conditions that allow the antibody to bind to FcRH in the biological sample;
(b) detecting the amount or pattern of binding by the antibody, changes in the amount or pattern of binding as compared to binding in a control sample indicating an autoimmune disease in the subject.
185. A method of diagnosing an autoimmune disease in a subject, comprising:
(a) contacting the nucleic acid of claim 56 with a biological sample of the subject under conditions that allow the nucleic acid to hybridize with the biological sample;
(b) detecting the amount or pattern of binding by the nucleic acid, changes in the amount or pattern of binding as compared to binding in a control sample indicating an autoimmune disease.
(a) contacting the nucleic acid of claim 56 with a biological sample of the subject under conditions that allow the nucleic acid to hybridize with the biological sample;
(b) detecting the amount or pattern of binding by the nucleic acid, changes in the amount or pattern of binding as compared to binding in a control sample indicating an autoimmune disease.
186. A method of diagnosing an autoimmune disease in a subject, comprising:
(a) contacting the nucleic acid of claim 67 with a biological sample of the subject under conditions that allow the nucleic acid to hybridize with the biological sample;
(b) detecting the amount or pattern of binding by the nucleic acid, changes in the amount or pattern of binding as compared to binding in a control sample indicating an autoimmune disease.
(a) contacting the nucleic acid of claim 67 with a biological sample of the subject under conditions that allow the nucleic acid to hybridize with the biological sample;
(b) detecting the amount or pattern of binding by the nucleic acid, changes in the amount or pattern of binding as compared to binding in a control sample indicating an autoimmune disease.
187. A method of diagnosing an autoimmune disease in a subject, comprising:
(a) contacting the nucleic acid of claim 84 with a biological sample of the subject under conditions that allow the nucleic acid to hybridize with the biological sample;
(b) detecting the amount or pattern of binding by the nucleic acid, changes in the amount or pattern of binding as compared to binding in a control sample indicating an autoimmune disease.
(a) contacting the nucleic acid of claim 84 with a biological sample of the subject under conditions that allow the nucleic acid to hybridize with the biological sample;
(b) detecting the amount or pattern of binding by the nucleic acid, changes in the amount or pattern of binding as compared to binding in a control sample indicating an autoimmune disease.
188. A method of diagnosing an autoimmune disease in a subject, comprising:
(a) contacting the nucleic acid of claim 91 with a biological sample of the subject under conditions that allow the nucleic acid to hybridize with the biological sample;
(b) detecting the amount or pattern of binding by the nucleic acid, changes in the amount or pattern of binding as compared to binding in a control sample indicating an autoimmune disease.
(a) contacting the nucleic acid of claim 91 with a biological sample of the subject under conditions that allow the nucleic acid to hybridize with the biological sample;
(b) detecting the amount or pattern of binding by the nucleic acid, changes in the amount or pattern of binding as compared to binding in a control sample indicating an autoimmune disease.
189. A method of treating an autoimmune disease in a subject, comprising contacting, with a therapeutically effective amount of the antibody of claim 113, one or more FcRH expressing cells of the subject.
190. A method of treating an autoimmune disease in a subject, comprising contacting, with a therapeutically effective amount of the nucleic acid of claim 56, FcRH expressing cells of the subject.
191. A method of treating an autoimmune disease in a subject, comprising contacting, with a therapeutically effective amount of the nucleic acid of claim 67, FcRH expressing cells of the subject.
192. A method of treating an autoimmune disease in a subject, comprising contacting, with a therapeutically effective amount of the nucleic acid of claim 84, FcRH expressing cells of the subject.
193. A method of treating an autoimmune disease in a subject, comprising contacting, with a therapeutically effective amount of the nucleic acid of claim 91, FcRH expressing cells of the subject.
194. A method of modulating a humoral immune response in a subject, comprising administering to the subject the isolated FcRH of claim 1.
195. A method of modulating a humoral immune response in a subject, comprising administering to the subject the antibody of claim 113.
196. A method of modulating a humoral immune response in a subject, comprising administering to the subject the nucleic acid of claim 56.
197. A method of modulating a humoral immune response in a subject, comprising administering to the subject the nucleic acid of claim 67.
198. A method of modulating a humoral immune response in a subject, comprising administering to the subject the nucleic acid of claim 84.
199. A method of modulating a humoral immune response in a subject, comprising administering to the subject the nucleic acid of claim 91.
200. An isolated mouse FcRH isoform of FcRH1, wherein the isoform lacks a cytoplasmic region.
201. A polypeptide comprising the amino acid sequence of SEQ ID NO:70.
202. A polypeptide comprising the amino acid of SEQ ID NO:70 with conservative amino acid substitutions.
203. An isolated mouse FcRH isoform of FcRH2, wherein the FcRH lacks a transmembrane region.
204. A polypeptide comprising the amino acid sequence of SEQ ID NO:73.
205. A polypeptide comprising the amino acid of SEQ ID NO:73 with conservative amino acid substitutions.
206. A polypeptide comprising the amino acid of SEQ ID NO:.77.
207. A polypeptide comprising the amino acid of SEQ ID NO:77 with conservative amino acid substitutions.
208. A polypeptide comprising the amino acid sequence of SEQ ID NO:78.
209. A polypeptide comprising the amino acid sequence of SEQ ID NO:78 with conservative amino acid substitutions.
210. A nucleic acid encoding the isolated mouse FcRH isoform of claim 200.
211. A nucleic acid encoding the isolated mouse FcRH isoform of claim 203.
212. A nucleic acid encoding the polypeptide of claim 201.
213. A nucleic acid encoding the polypeptide of claim 202.
214. A nucleic acid encoding the polypeptide of claim 204.
215. A nucleic acid encoding the polypeptide of claim 205.
216. A nucleic acid encoding the polypeptide of claim 206.
217. A nucleic acid encoding the polypeptide of claim 207.
218. A nucleic acid encoding the polypeptide of claim 208.
219. A nucleic acid encoding the polypeptide of claim 209.
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US36766702P | 2002-03-25 | 2002-03-25 | |
US60/367,667 | 2002-03-25 | ||
PCT/US2003/009600 WO2003089624A2 (en) | 2002-03-25 | 2003-03-25 | Fc receptor homolog, reagents, and uses thereof |
Publications (1)
Publication Number | Publication Date |
---|---|
CA2480404A1 true CA2480404A1 (en) | 2003-10-30 |
Family
ID=29250463
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CA002480404A Abandoned CA2480404A1 (en) | 2002-03-25 | 2003-03-25 | Fc receptor homolog, reagents, and uses thereof |
Country Status (6)
Country | Link |
---|---|
US (2) | US7317087B2 (en) |
EP (1) | EP1490085A2 (en) |
JP (1) | JP2005521429A (en) |
AU (1) | AU2003245239A1 (en) |
CA (1) | CA2480404A1 (en) |
WO (1) | WO2003089624A2 (en) |
Families Citing this family (101)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7858330B2 (en) | 2001-10-19 | 2010-12-28 | Genentech, Inc. | Compositions and methods for the treatment of tumor of hematopoietic origin |
US7662925B2 (en) | 2002-03-01 | 2010-02-16 | Xencor, Inc. | Optimized Fc variants and methods for their generation |
US20040132101A1 (en) | 2002-09-27 | 2004-07-08 | Xencor | Optimized Fc variants and methods for their generation |
US8188231B2 (en) | 2002-09-27 | 2012-05-29 | Xencor, Inc. | Optimized FC variants |
US7317091B2 (en) | 2002-03-01 | 2008-01-08 | Xencor, Inc. | Optimized Fc variants |
US7317087B2 (en) | 2002-03-25 | 2008-01-08 | The Uab Research Foundation | Members of the FC receptor homolog gene family (FCRH1-3, 6), related reagents, and uses thereof |
US20090010920A1 (en) | 2003-03-03 | 2009-01-08 | Xencor, Inc. | Fc Variants Having Decreased Affinity for FcyRIIb |
US8084582B2 (en) | 2003-03-03 | 2011-12-27 | Xencor, Inc. | Optimized anti-CD20 monoclonal antibodies having Fc variants |
US8388955B2 (en) | 2003-03-03 | 2013-03-05 | Xencor, Inc. | Fc variants |
US9051373B2 (en) | 2003-05-02 | 2015-06-09 | Xencor, Inc. | Optimized Fc variants |
US8101720B2 (en) | 2004-10-21 | 2012-01-24 | Xencor, Inc. | Immunoglobulin insertions, deletions and substitutions |
US9714282B2 (en) | 2003-09-26 | 2017-07-25 | Xencor, Inc. | Optimized Fc variants and methods for their generation |
CN1886506B (en) * | 2003-09-29 | 2012-05-30 | 中外制药株式会社 | Protein expressed in nk cell |
SG149815A1 (en) | 2003-11-06 | 2009-02-27 | Seattle Genetics Inc | Monomethylvaline compounds capable of conjugation to ligands |
JP4658967B2 (en) * | 2003-12-24 | 2011-03-23 | ジェネンテック, インコーポレイテッド | Compositions and methods for the treatment of tumors of hematopoietic origin |
US20050266008A1 (en) * | 2004-03-29 | 2005-12-01 | Medarex, Inc. | Human anti-IRTA-5 antibodies |
MXPA06014065A (en) | 2004-06-01 | 2007-01-31 | Genentech Inc | Antibody drug conjugates and methods. |
US20150010550A1 (en) | 2004-07-15 | 2015-01-08 | Xencor, Inc. | OPTIMIZED Fc VARIANTS |
CN101065151B (en) | 2004-09-23 | 2014-12-10 | 健泰科生物技术公司 | Cysteine engineered antibodies and conjugates |
US20100111856A1 (en) | 2004-09-23 | 2010-05-06 | Herman Gill | Zirconium-radiolabeled, cysteine engineered antibody conjugates |
US20080131433A1 (en) * | 2004-09-27 | 2008-06-05 | The Uab Research Foundation | Fc Receptor Homolog Antibodies And Uses Thereof |
WO2006039135A2 (en) * | 2004-09-29 | 2006-04-13 | Medarex, Inc. | Irta-4 antibodies and their uses |
WO2006039238A2 (en) | 2004-09-30 | 2006-04-13 | The Goverment Of The United States Of America As Represented By The Secretary Of The Department Of Health And Human Services | Irta2 antibodies and methods of use |
US8367805B2 (en) | 2004-11-12 | 2013-02-05 | Xencor, Inc. | Fc variants with altered binding to FcRn |
US8546543B2 (en) | 2004-11-12 | 2013-10-01 | Xencor, Inc. | Fc variants that extend antibody half-life |
EP2325207B1 (en) | 2004-11-12 | 2017-03-15 | Xencor, Inc. | FC variants with altered binding to FCRN |
US8802820B2 (en) | 2004-11-12 | 2014-08-12 | Xencor, Inc. | Fc variants with altered binding to FcRn |
JP4993645B2 (en) | 2004-12-01 | 2012-08-08 | ジェネンテック, インコーポレイテッド | Antibody drug conjugates and methods |
DK1931709T3 (en) | 2005-10-03 | 2017-03-13 | Xencor Inc | FC VARIETIES WITH OPTIMIZED FC RECEPTOR BINDING PROPERTIES |
US7973136B2 (en) | 2005-10-06 | 2011-07-05 | Xencor, Inc. | Optimized anti-CD30 antibodies |
ME01786B (en) | 2006-08-14 | 2014-09-20 | Xencor Inc | Optimized antibodies that target cd19 |
US8394374B2 (en) | 2006-09-18 | 2013-03-12 | Xencor, Inc. | Optimized antibodies that target HM1.24 |
WO2008103905A2 (en) * | 2007-02-23 | 2008-08-28 | The Government Of The United States Of America As Represented By The Secretary Of The Department Of Health And Human Services | Antibodies that specifically bind irta and methods of use |
US20100285007A1 (en) * | 2007-09-14 | 2010-11-11 | The Uab Research Foundation | Differential diagnosis of b-cell chronic lymphocytic leukemia |
HUE024903T2 (en) | 2007-12-26 | 2016-02-29 | Xencor Inc | Fc variants with altered binding to fcrn |
WO2011028952A1 (en) | 2009-09-02 | 2011-03-10 | Xencor, Inc. | Compositions and methods for simultaneous bivalent and monovalent co-engagement of antigens |
IN2012DN03025A (en) | 2009-09-09 | 2015-07-31 | Ct Se Llc | |
US8362210B2 (en) | 2010-01-19 | 2013-01-29 | Xencor, Inc. | Antibody variants with enhanced complement activity |
EA024730B1 (en) | 2010-04-15 | 2016-10-31 | Медимьюн Лимитед | Pyrrolobenzodiazepine compounds, conjugates thereof, pharmaceutical compositions comprising said conjugates, and use of said conjugates |
KR101839163B1 (en) | 2010-06-08 | 2018-03-15 | 제넨테크, 인크. | Cysteine engineered antibodies and conjugates |
WO2012074757A1 (en) | 2010-11-17 | 2012-06-07 | Genentech, Inc. | Alaninyl maytansinol antibody conjugates |
CA2833212C (en) | 2011-05-12 | 2020-06-09 | Genentech, Inc. | Multiple reaction monitoring lc-ms/ms method to detect therapeutic antibodies in animal samples using framework signature peptides |
KR101877598B1 (en) | 2011-10-14 | 2018-07-11 | 메디뮨 리미티드 | Pyrrolobenzodiazepines and conjugates thereof |
WO2013130093A1 (en) | 2012-03-02 | 2013-09-06 | Genentech, Inc. | Biomarkers for treatment with anti-tubulin chemotherapeutic compounds |
EP2906251B1 (en) | 2012-10-12 | 2017-09-27 | ADC Therapeutics SA | Pyrrolobenzodiazepine-anti-cd22 antibody conjugates |
LT2839860T (en) | 2012-10-12 | 2019-07-10 | Medimmune Limited | Pyrrolobenzodiazepines and conjugates thereof |
EP2906296B1 (en) | 2012-10-12 | 2018-03-21 | ADC Therapeutics SA | Pyrrolobenzodiazepine-antibody conjugates |
HUE042731T2 (en) | 2012-10-12 | 2019-07-29 | Adc Therapeutics Sa | Pyrrolobenzodiazepine-antibody conjugates |
US10695433B2 (en) | 2012-10-12 | 2020-06-30 | Medimmune Limited | Pyrrolobenzodiazepine-antibody conjugates |
ES2680153T3 (en) | 2012-10-12 | 2018-09-04 | Adc Therapeutics Sa | Anti-PSMA-pyrrolobenzodiazepine antibody conjugates |
TR201815418T4 (en) | 2012-10-12 | 2018-11-21 | Adc Therapeutics Sa | Pyrrolobenzodiazepine -anti-psma antibody conjugates. |
EP2935268B2 (en) | 2012-12-21 | 2021-02-17 | MedImmune Limited | Pyrrolobenzodiazepines and conjugates thereof |
US9567340B2 (en) | 2012-12-21 | 2017-02-14 | Medimmune Limited | Unsymmetrical pyrrolobenzodiazepines-dimers for use in the treatment of proliferative and autoimmune diseases |
CA2905181C (en) | 2013-03-13 | 2020-06-02 | Medimmune Limited | Pyrrolobenzodiazepines and conjugates thereof for providing targeted therapy |
KR102066318B1 (en) | 2013-03-13 | 2020-01-14 | 메디뮨 리미티드 | Pyrrolobenzodiazepines and conjugates thereof |
US9821074B2 (en) | 2013-03-13 | 2017-11-21 | Genentech, Inc. | Pyrrolobenzodiazepines and conjugates thereof |
TWI636792B (en) | 2013-08-12 | 2018-10-01 | 建南德克公司 | 1-(chloromethyl)-2,3-dihydro-1h-benzo[e]indole dimer antibody-drug conjugate compounds, and methods of use and treatment |
WO2015052535A1 (en) | 2013-10-11 | 2015-04-16 | Spirogen Sàrl | Pyrrolobenzodiazepine-antibody conjugates |
GB201317982D0 (en) | 2013-10-11 | 2013-11-27 | Spirogen Sarl | Pyrrolobenzodiazepines and conjugates thereof |
WO2015052532A1 (en) | 2013-10-11 | 2015-04-16 | Spirogen Sàrl | Pyrrolobenzodiazepine-antibody conjugates |
WO2015052534A1 (en) | 2013-10-11 | 2015-04-16 | Spirogen Sàrl | Pyrrolobenzodiazepine-antibody conjugates |
CN105828840B (en) | 2013-12-16 | 2020-08-04 | 基因泰克公司 | 1- (chloromethyl) -2, 3-dihydro-1H-benzo [ e ] indole dimer antibody-drug conjugate compounds and methods of use and treatment |
CN107106700B (en) | 2013-12-16 | 2020-10-30 | 基因泰克公司 | Peptidomimetic compounds and antibody-drug conjugates thereof |
JP6671292B2 (en) | 2013-12-16 | 2020-03-25 | ジェネンテック, インコーポレイテッド | Peptidomimetic compounds and antibody-drug conjugates thereof |
WO2016037644A1 (en) | 2014-09-10 | 2016-03-17 | Medimmune Limited | Pyrrolobenzodiazepines and conjugates thereof |
EP3191521A2 (en) | 2014-09-12 | 2017-07-19 | F. Hoffmann-La Roche AG | Cysteine engineered antibodies and conjugates |
GB201416112D0 (en) | 2014-09-12 | 2014-10-29 | Medimmune Ltd | Pyrrolobenzodiazepines and conjugates thereof |
CN106714844B (en) | 2014-09-12 | 2022-08-05 | 基因泰克公司 | Anthracycline disulfide intermediates, antibody-drug conjugates and methods |
EP3235820A1 (en) | 2014-09-17 | 2017-10-25 | Genentech, Inc. | Pyrrolobenzodiazepines and antibody disulfide conjugates thereof |
AU2015352545B2 (en) | 2014-11-25 | 2020-10-15 | Adc Therapeutics Sa | Pyrrolobenzodiazepine-antibody conjugates |
WO2016090050A1 (en) | 2014-12-03 | 2016-06-09 | Genentech, Inc. | Quaternary amine compounds and antibody-drug conjugates thereof |
GB201506411D0 (en) | 2015-04-15 | 2015-05-27 | Bergenbio As | Humanized anti-axl antibodies |
GB201506402D0 (en) | 2015-04-15 | 2015-05-27 | Berkel Patricius H C Van And Howard Philip W | Site-specific antibody-drug conjugates |
MA43345A (en) | 2015-10-02 | 2018-08-08 | Hoffmann La Roche | PYRROLOBENZODIAZEPINE ANTIBODY-DRUG CONJUGATES AND METHODS OF USE |
MA43354A (en) | 2015-10-16 | 2018-08-22 | Genentech Inc | CONJUGATE DRUG CONJUGATES WITH CLOUDY DISULPHIDE |
MA45326A (en) | 2015-10-20 | 2018-08-29 | Genentech Inc | CALICHEAMICIN-ANTIBODY-DRUG CONJUGATES AND METHODS OF USE |
GB201601431D0 (en) | 2016-01-26 | 2016-03-09 | Medimmune Ltd | Pyrrolobenzodiazepines |
GB201602356D0 (en) | 2016-02-10 | 2016-03-23 | Medimmune Ltd | Pyrrolobenzodiazepine Conjugates |
GB201602359D0 (en) | 2016-02-10 | 2016-03-23 | Medimmune Ltd | Pyrrolobenzodiazepine Conjugates |
CN108700598A (en) | 2016-03-25 | 2018-10-23 | 豪夫迈·罗氏有限公司 | The drug of the total antibody of multichannel and antibody conjugate quantifies measuring method |
GB201607478D0 (en) | 2016-04-29 | 2016-06-15 | Medimmune Ltd | Pyrrolobenzodiazepine Conjugates |
PL3458101T3 (en) | 2016-05-20 | 2021-05-31 | F. Hoffmann-La Roche Ag | Protac antibody conjugates and methods of use |
EP3465221B1 (en) | 2016-05-27 | 2020-07-22 | H. Hoffnabb-La Roche Ag | Bioanalytical method for the characterization of site-specific antibody-drug conjugates |
JP7043425B2 (en) | 2016-06-06 | 2022-03-29 | ジェネンテック, インコーポレイテッド | Silvestrol Antibodies-Drug Conjugates and Usage |
EP3496763A1 (en) | 2016-08-11 | 2019-06-19 | Genentech, Inc. | Pyrrolobenzodiazepine prodrugs and antibody conjugates thereof |
JP7050770B2 (en) | 2016-10-05 | 2022-04-08 | エフ・ホフマン-ラ・ロシュ・アクチェンゲゼルシャフト | Method for preparing antibody drug conjugate |
GB201617466D0 (en) | 2016-10-14 | 2016-11-30 | Medimmune Ltd | Pyrrolobenzodiazepine conjugates |
GB201702031D0 (en) | 2017-02-08 | 2017-03-22 | Medlmmune Ltd | Pyrrolobenzodiazepine-antibody conjugates |
LT3544636T (en) | 2017-02-08 | 2021-06-25 | Adc Therapeutics Sa | Pyrrolobenzodiazepine-antibody conjugates |
EP3612537B1 (en) | 2017-04-18 | 2022-07-13 | Medimmune Limited | Pyrrolobenzodiazepine conjugates |
WO2018193102A1 (en) | 2017-04-20 | 2018-10-25 | Adc Therapeutics Sa | Combination therapy with an anti-axl antibody-drug conjugate |
UA127900C2 (en) | 2017-06-14 | 2024-02-07 | Ейдісі Терапьютікс Са | Dosage regimes for the administration of an anti-cd19 adc |
MX2020001880A (en) | 2017-08-18 | 2021-07-06 | Medimmune Ltd | Pyrrolobenzodiazepine conjugates. |
WO2019060398A1 (en) | 2017-09-20 | 2019-03-28 | Ph Pharma Co., Ltd. | Thailanstatin analogs |
GB201803342D0 (en) | 2018-03-01 | 2018-04-18 | Medimmune Ltd | Methods |
GB201806022D0 (en) | 2018-04-12 | 2018-05-30 | Medimmune Ltd | Pyrrolobenzodiazepines and conjugates thereof |
GB201814281D0 (en) | 2018-09-03 | 2018-10-17 | Femtogenix Ltd | Cytotoxic agents |
EP3870235A1 (en) | 2018-10-24 | 2021-09-01 | F. Hoffmann-La Roche AG | Conjugated chemical inducers of degradation and methods of use |
CN113227119A (en) | 2018-12-10 | 2021-08-06 | 基因泰克公司 | Photocrosslinked peptides for site-specific conjugation to Fc-containing proteins |
GB201901197D0 (en) | 2019-01-29 | 2019-03-20 | Femtogenix Ltd | G-A Crosslinking cytotoxic agents |
GB2597532A (en) | 2020-07-28 | 2022-02-02 | Femtogenix Ltd | Cytotoxic compounds |
Family Cites Families (16)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
IT1112395B (en) * | 1979-03-23 | 1986-01-13 | Giovanetti Fiorello | JUNCTION DEVICE SUITABLE FOR JOINING TWO PIECES OR PANELS IN A REMOVABLE WAY |
US4342566A (en) * | 1980-02-22 | 1982-08-03 | Scripps Clinic & Research Foundation | Solid phase anti-C3 assay for detection of immune complexes |
US4816567A (en) * | 1983-04-08 | 1989-03-28 | Genentech, Inc. | Recombinant immunoglobin preparations |
US4980286A (en) * | 1985-07-05 | 1990-12-25 | Whitehead Institute For Biomedical Research | In vivo introduction and expression of foreign genetic material in epithelial cells |
EP0228458B2 (en) * | 1985-07-05 | 1997-10-22 | Whitehead Institute For Biomedical Research | Epithelial cells expressing foreign genetic material |
EP0732397A3 (en) | 1988-02-05 | 1996-10-23 | Whitehead Institute For Biomedical Research | Modified hepatocytes and uses therefor |
EP0432216A1 (en) | 1988-09-01 | 1991-06-19 | Whitehead Institute For Biomedical Research | Recombinant retroviruses with amphotropic and ecotropic host ranges |
WO1994004679A1 (en) | 1991-06-14 | 1994-03-03 | Genentech, Inc. | Method for making humanized antibodies |
DE69405102T2 (en) | 1993-06-03 | 1998-01-15 | Therapeutic Antibodies Inc | ANTIBODY FRAGMENTS IN THERAPY |
ES2568625T3 (en) * | 1999-11-29 | 2016-05-03 | The Trustees Of Columbia University In The City Of New York | Isolation of five novel genes encoding new Fc receptor-type melanomas involved in the pathogenesis of lymphoma / melanoma |
EP1254221A2 (en) * | 2000-01-06 | 2002-11-06 | Protegene Inc. | HUMAN PROTEINS HAVING HYDROPHOBIC DOMAINS AND DNAs ENCODING THESE PROTEINS |
US20030078396A1 (en) | 2000-03-01 | 2003-04-24 | Corixa Corporation | Compositions and methods for the detection, diagnosis and therapy of hematological malignancies |
EP1201681A1 (en) * | 2000-10-30 | 2002-05-02 | Millennium Pharmaceuticals, Inc. | "Fail" molecules and uses thereof |
US7317087B2 (en) | 2002-03-25 | 2008-01-08 | The Uab Research Foundation | Members of the FC receptor homolog gene family (FCRH1-3, 6), related reagents, and uses thereof |
JP4658967B2 (en) | 2003-12-24 | 2011-03-23 | ジェネンテック, インコーポレイテッド | Compositions and methods for the treatment of tumors of hematopoietic origin |
US20080131433A1 (en) | 2004-09-27 | 2008-06-05 | The Uab Research Foundation | Fc Receptor Homolog Antibodies And Uses Thereof |
-
2003
- 2003-03-25 US US10/508,374 patent/US7317087B2/en not_active Expired - Fee Related
- 2003-03-25 AU AU2003245239A patent/AU2003245239A1/en not_active Abandoned
- 2003-03-25 CA CA002480404A patent/CA2480404A1/en not_active Abandoned
- 2003-03-25 JP JP2003586337A patent/JP2005521429A/en not_active Withdrawn
- 2003-03-25 WO PCT/US2003/009600 patent/WO2003089624A2/en not_active Application Discontinuation
- 2003-03-25 EP EP03738876A patent/EP1490085A2/en not_active Withdrawn
-
2007
- 2007-08-07 US US11/835,226 patent/US8648171B2/en not_active Expired - Fee Related
Also Published As
Publication number | Publication date |
---|---|
WO2003089624A2 (en) | 2003-10-30 |
US7317087B2 (en) | 2008-01-08 |
US20080089887A1 (en) | 2008-04-17 |
US20050240006A1 (en) | 2005-10-27 |
JP2005521429A (en) | 2005-07-21 |
AU2003245239A1 (en) | 2003-11-03 |
EP1490085A2 (en) | 2004-12-29 |
US8648171B2 (en) | 2014-02-11 |
WO2003089624A3 (en) | 2004-02-05 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US7317087B2 (en) | Members of the FC receptor homolog gene family (FCRH1-3, 6), related reagents, and uses thereof | |
KR101316990B1 (en) | ANTI-α9 INTEGRIN ANTIBODY AND THE USE THEREOF | |
US20030211517A1 (en) | Gp354 nucleic acids and polypeptides | |
US5594120A (en) | Integrin alpha subunit | |
JP2009240311A (en) | Receptor on surface of activated t-cell: act-4 | |
JP2007159580A (en) | Lymphocyte activation antigen hb15: member of immunoglobulin superfamily | |
US20070065875A1 (en) | NTB-A, a surface molecule involved in natural killer cells activity | |
US20020123617A1 (en) | Novel immunoglobulin superfamily members of APEX-1, APEX-2 and APEX-3 and uses thereof | |
JPH11225774A (en) | Member of immunoglobulin gene superfamily, pigr-1 | |
WO1997021450A1 (en) | Ox-2 costimulatory molecule | |
AU2002325333A1 (en) | NTB-A, a surface molecule involved in natural killer cells activity | |
EP0856009A1 (en) | Cd6 ligand | |
US20030165495A1 (en) | Nucleic acids and polypeptides | |
EP1207905B1 (en) | Methods and compositions for treatment of inflammatory disease using cadherin-11 modulating agents | |
AU2001272982B2 (en) | GP286 nucleic acids and polypeptides | |
JP2002537272A (en) | Adjuvants and cell maturation substances | |
JP4224395B2 (en) | PanCAM nucleic acids and polypeptides | |
CA2472111A1 (en) | Human mast cell-expressed membrane proteins | |
JP2004500872A (en) | GP354 nucleic acids and polypeptides | |
US5700658A (en) | ICAM-4 materials and methods | |
WO2004048574A1 (en) | Immunoreceptor proteins | |
AU2001272982A1 (en) | GP286 nucleic acids and polypeptides | |
US20020146767A1 (en) | Human EMR1-like G protein-coupled receptor |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
FZDE | Discontinued |