US20100175144A1 - Cinnamyl-alcohol dehydrogenases - Google Patents
Cinnamyl-alcohol dehydrogenases Download PDFInfo
- Publication number
- US20100175144A1 US20100175144A1 US12/575,991 US57599109A US2010175144A1 US 20100175144 A1 US20100175144 A1 US 20100175144A1 US 57599109 A US57599109 A US 57599109A US 2010175144 A1 US2010175144 A1 US 2010175144A1
- Authority
- US
- United States
- Prior art keywords
- seq
- cad
- plant
- sorghum
- nucleic acid
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 108010061190 Cinnamyl-alcohol dehydrogenase Proteins 0.000 title description 162
- 108090000765 processed proteins & peptides Proteins 0.000 claims abstract description 230
- 102000004196 processed proteins & peptides Human genes 0.000 claims abstract description 226
- 229920001184 polypeptide Polymers 0.000 claims abstract description 225
- 150000007523 nucleic acids Chemical class 0.000 claims abstract description 193
- 102000039446 nucleic acids Human genes 0.000 claims abstract description 184
- 108020004707 nucleic acids Proteins 0.000 claims abstract description 184
- 238000000034 method Methods 0.000 claims abstract description 152
- 235000011684 Sorghum saccharatum Nutrition 0.000 claims abstract description 103
- 108700028369 Alleles Proteins 0.000 claims abstract description 92
- 238000009395 breeding Methods 0.000 claims abstract description 42
- 230000001488 breeding effect Effects 0.000 claims abstract description 39
- 238000003205 genotyping method Methods 0.000 claims abstract description 24
- 102100034581 Dihydroorotase Human genes 0.000 claims abstract 39
- 241000196324 Embryophyta Species 0.000 claims description 341
- 240000006394 Sorghum bicolor Species 0.000 claims description 126
- 239000002773 nucleotide Substances 0.000 claims description 109
- 125000003729 nucleotide group Chemical group 0.000 claims description 104
- 230000009261 transgenic effect Effects 0.000 claims description 76
- 150000001413 amino acids Chemical class 0.000 claims description 61
- 229920005610 lignin Polymers 0.000 claims description 52
- 230000001105 regulatory effect Effects 0.000 claims description 50
- 108091034117 Oligonucleotide Proteins 0.000 claims description 40
- 239000000523 sample Substances 0.000 claims description 35
- 102000040430 polynucleotide Human genes 0.000 claims description 29
- 108091033319 polynucleotide Proteins 0.000 claims description 29
- 239000002157 polynucleotide Substances 0.000 claims description 29
- 239000003550 marker Substances 0.000 claims description 25
- 239000002028 Biomass Substances 0.000 claims description 24
- 239000000203 mixture Substances 0.000 claims description 22
- RWQNBRDOKXIBIV-UHFFFAOYSA-N thymine Chemical compound CC1=CNC(=O)NC1=O RWQNBRDOKXIBIV-UHFFFAOYSA-N 0.000 claims description 20
- 230000000694 effects Effects 0.000 claims description 19
- 230000001965 increasing effect Effects 0.000 claims description 16
- 230000003247 decreasing effect Effects 0.000 claims description 15
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 claims description 10
- 230000007423 decrease Effects 0.000 claims description 10
- 229940113082 thymine Drugs 0.000 claims description 10
- 238000003976 plant breeding Methods 0.000 claims description 9
- 229920002488 Hemicellulose Polymers 0.000 claims description 5
- 239000001913 cellulose Substances 0.000 claims description 5
- 229920002678 cellulose Polymers 0.000 claims description 5
- 229920001503 Glucan Polymers 0.000 claims description 3
- 239000012472 biological sample Substances 0.000 claims description 3
- 102100040999 Catechol O-methyltransferase Human genes 0.000 claims 6
- 108020002739 Catechol O-methyltransferase Proteins 0.000 claims 6
- 125000003275 alpha amino acid group Chemical group 0.000 abstract description 27
- 241000209072 Sorghum Species 0.000 abstract 4
- 210000004027 cell Anatomy 0.000 description 67
- 108090000623 proteins and genes Proteins 0.000 description 63
- 230000014509 gene expression Effects 0.000 description 42
- 230000000875 corresponding effect Effects 0.000 description 37
- 108020004414 DNA Proteins 0.000 description 36
- 210000001519 tissue Anatomy 0.000 description 35
- 241000894007 species Species 0.000 description 32
- 230000000692 anti-sense effect Effects 0.000 description 28
- 238000013518 transcription Methods 0.000 description 28
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 27
- 230000035897 transcription Effects 0.000 description 27
- 230000000306 recurrent effect Effects 0.000 description 25
- 108020004999 messenger RNA Proteins 0.000 description 24
- 230000035772 mutation Effects 0.000 description 22
- 239000013615 primer Substances 0.000 description 21
- 108010067661 Caffeate O-methyltransferase Proteins 0.000 description 20
- 230000000295 complement effect Effects 0.000 description 20
- 230000002068 genetic effect Effects 0.000 description 20
- 108091026890 Coding region Proteins 0.000 description 18
- 238000004519 manufacturing process Methods 0.000 description 18
- 102000054765 polymorphisms of proteins Human genes 0.000 description 18
- 102000004169 proteins and genes Human genes 0.000 description 18
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 17
- 239000012634 fragment Substances 0.000 description 17
- 239000013598 vector Substances 0.000 description 17
- 244000062793 Sorghum vulgare Species 0.000 description 16
- 238000004458 analytical method Methods 0.000 description 16
- 238000003752 polymerase chain reaction Methods 0.000 description 16
- 108091028043 Nucleic acid sequence Proteins 0.000 description 14
- 230000008569 process Effects 0.000 description 13
- 230000009466 transformation Effects 0.000 description 13
- 108010021809 Alcohol dehydrogenase Proteins 0.000 description 12
- 108090000994 Catalytic RNA Proteins 0.000 description 12
- 102000053642 Catalytic RNA Human genes 0.000 description 12
- 240000008042 Zea mays Species 0.000 description 12
- 239000000463 material Substances 0.000 description 12
- 108091092562 ribozyme Proteins 0.000 description 12
- 238000001514 detection method Methods 0.000 description 11
- 102000040650 (ribonucleotides)n+m Human genes 0.000 description 10
- 235000002017 Zea mays subsp mays Nutrition 0.000 description 10
- 238000006243 chemical reaction Methods 0.000 description 10
- 210000000349 chromosome Anatomy 0.000 description 10
- 238000011161 development Methods 0.000 description 9
- 230000018109 developmental process Effects 0.000 description 9
- 238000006467 substitution reaction Methods 0.000 description 9
- -1 10 to 50 amino acids Chemical class 0.000 description 8
- 108020004705 Codon Proteins 0.000 description 8
- 102000004190 Enzymes Human genes 0.000 description 8
- 108090000790 Enzymes Proteins 0.000 description 8
- 235000016383 Zea mays subsp huehuetenangensis Nutrition 0.000 description 8
- 238000003556 assay Methods 0.000 description 8
- 210000002421 cell wall Anatomy 0.000 description 8
- 239000002299 complementary DNA Substances 0.000 description 8
- 238000009396 hybridization Methods 0.000 description 8
- 238000003780 insertion Methods 0.000 description 8
- 230000037431 insertion Effects 0.000 description 8
- 235000009973 maize Nutrition 0.000 description 8
- 238000011144 upstream manufacturing Methods 0.000 description 8
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 7
- 238000005516 engineering process Methods 0.000 description 7
- 239000007787 solid Substances 0.000 description 7
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 6
- 229920002494 Zein Polymers 0.000 description 6
- 238000003199 nucleic acid amplification method Methods 0.000 description 6
- 238000012552 review Methods 0.000 description 6
- 230000010153 self-pollination Effects 0.000 description 6
- 238000012360 testing method Methods 0.000 description 6
- 238000013519 translation Methods 0.000 description 6
- 230000014616 translation Effects 0.000 description 6
- 230000002792 vascular Effects 0.000 description 6
- 239000005019 zein Substances 0.000 description 6
- 229940093612 zein Drugs 0.000 description 6
- 108091026898 Leader sequence (mRNA) Proteins 0.000 description 5
- 108091092878 Microsatellite Proteins 0.000 description 5
- 240000007594 Oryza sativa Species 0.000 description 5
- 235000007164 Oryza sativa Nutrition 0.000 description 5
- 238000012228 RNA interference-mediated gene silencing Methods 0.000 description 5
- 230000003321 amplification Effects 0.000 description 5
- 230000001086 cytosolic effect Effects 0.000 description 5
- 238000012217 deletion Methods 0.000 description 5
- 230000035558 fertility Effects 0.000 description 5
- 230000006870 function Effects 0.000 description 5
- 230000009368 gene silencing by RNA Effects 0.000 description 5
- 230000001939 inductive effect Effects 0.000 description 5
- 238000002703 mutagenesis Methods 0.000 description 5
- 231100000350 mutagenesis Toxicity 0.000 description 5
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 5
- 230000002829 reductive effect Effects 0.000 description 5
- 235000000346 sugar Nutrition 0.000 description 5
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 4
- 241000701489 Cauliflower mosaic virus Species 0.000 description 4
- SRBFZHDQGSBBOR-IOVATXLUSA-N D-xylopyranose Chemical compound O[C@@H]1COC(O)[C@H](O)[C@H]1O SRBFZHDQGSBBOR-IOVATXLUSA-N 0.000 description 4
- 102000053602 DNA Human genes 0.000 description 4
- 235000010469 Glycine max Nutrition 0.000 description 4
- 244000068988 Glycine max Species 0.000 description 4
- 244000020551 Helianthus annuus Species 0.000 description 4
- 235000003222 Helianthus annuus Nutrition 0.000 description 4
- 206010021929 Infertility male Diseases 0.000 description 4
- 241000209082 Lolium Species 0.000 description 4
- 208000007466 Male Infertility Diseases 0.000 description 4
- 240000004658 Medicago sativa Species 0.000 description 4
- 240000003433 Miscanthus floridulus Species 0.000 description 4
- 235000002637 Nicotiana tabacum Nutrition 0.000 description 4
- 244000061176 Nicotiana tabacum Species 0.000 description 4
- 241000219000 Populus Species 0.000 description 4
- 235000007238 Secale cereale Nutrition 0.000 description 4
- 244000082988 Secale cereale Species 0.000 description 4
- 244000138286 Sorghum saccharatum Species 0.000 description 4
- 235000009337 Spinacia oleracea Nutrition 0.000 description 4
- 244000300264 Spinacia oleracea Species 0.000 description 4
- 108091036066 Three prime untranslated region Proteins 0.000 description 4
- 108700019146 Transgenes Proteins 0.000 description 4
- 235000021307 Triticum Nutrition 0.000 description 4
- HCHKCACWOHOZIP-UHFFFAOYSA-N Zinc Chemical compound [Zn] HCHKCACWOHOZIP-UHFFFAOYSA-N 0.000 description 4
- 239000002253 acid Substances 0.000 description 4
- PYMYPHUHKUWMLA-UHFFFAOYSA-N arabinose Natural products OCC(O)C(O)C(O)C=O PYMYPHUHKUWMLA-UHFFFAOYSA-N 0.000 description 4
- SRBFZHDQGSBBOR-UHFFFAOYSA-N beta-D-Pyranose-Lyxose Natural products OC1COC(O)C(O)C1O SRBFZHDQGSBBOR-UHFFFAOYSA-N 0.000 description 4
- 239000002551 biofuel Substances 0.000 description 4
- 239000003153 chemical reaction reagent Substances 0.000 description 4
- 230000010154 cross-pollination Effects 0.000 description 4
- 230000037430 deletion Effects 0.000 description 4
- 239000013604 expression vector Substances 0.000 description 4
- 238000002372 labelling Methods 0.000 description 4
- 239000005022 packaging material Substances 0.000 description 4
- 239000002987 primer (paints) Substances 0.000 description 4
- 230000004044 response Effects 0.000 description 4
- 238000007894 restriction fragment length polymorphism technique Methods 0.000 description 4
- 235000009566 rice Nutrition 0.000 description 4
- 238000012216 screening Methods 0.000 description 4
- 238000012163 sequencing technique Methods 0.000 description 4
- 150000008163 sugars Chemical class 0.000 description 4
- 238000012546 transfer Methods 0.000 description 4
- 230000014621 translational initiation Effects 0.000 description 4
- 239000011701 zinc Substances 0.000 description 4
- 229910052725 zinc Inorganic materials 0.000 description 4
- OBMBUODDCOAJQP-UHFFFAOYSA-N 2-chloro-4-phenylquinoline Chemical compound C=12C=CC=CC2=NC(Cl)=CC=1C1=CC=CC=C1 OBMBUODDCOAJQP-UHFFFAOYSA-N 0.000 description 3
- 241000208140 Acer Species 0.000 description 3
- 241000743339 Agrostis Species 0.000 description 3
- 244000099147 Ananas comosus Species 0.000 description 3
- 241000219194 Arabidopsis Species 0.000 description 3
- 235000009854 Cucurbita moschata Nutrition 0.000 description 3
- 240000001980 Cucurbita pepo Species 0.000 description 3
- 235000009852 Cucurbita pepo Nutrition 0.000 description 3
- 238000002965 ELISA Methods 0.000 description 3
- 240000002395 Euphorbia pulcherrima Species 0.000 description 3
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 3
- 239000004471 Glycine Substances 0.000 description 3
- 108010043121 Green Fluorescent Proteins Proteins 0.000 description 3
- 102000004144 Green Fluorescent Proteins Human genes 0.000 description 3
- 240000005979 Hordeum vulgare Species 0.000 description 3
- 235000007340 Hordeum vulgare Nutrition 0.000 description 3
- 108091092195 Intron Proteins 0.000 description 3
- 241000221089 Jatropha Species 0.000 description 3
- 108020004485 Nonsense Codon Proteins 0.000 description 3
- 238000000636 Northern blotting Methods 0.000 description 3
- 238000012408 PCR amplification Methods 0.000 description 3
- 241000209117 Panicum Species 0.000 description 3
- 235000006443 Panicum miliaceum subsp. miliaceum Nutrition 0.000 description 3
- 235000009037 Panicum miliaceum subsp. ruderale Nutrition 0.000 description 3
- 241001520808 Panicum virgatum Species 0.000 description 3
- 240000007377 Petunia x hybrida Species 0.000 description 3
- 235000011613 Pinus brutia Nutrition 0.000 description 3
- 241000209051 Saccharum Species 0.000 description 3
- 241000124033 Salix Species 0.000 description 3
- 235000015503 Sorghum bicolor subsp. drummondii Nutrition 0.000 description 3
- 240000002439 Sorghum halepense Species 0.000 description 3
- 241000251131 Sphyrna Species 0.000 description 3
- 108020004566 Transfer RNA Proteins 0.000 description 3
- 244000098338 Triticum aestivum Species 0.000 description 3
- 108091023045 Untranslated Region Proteins 0.000 description 3
- 238000013459 approach Methods 0.000 description 3
- 230000008901 benefit Effects 0.000 description 3
- 230000027455 binding Effects 0.000 description 3
- 230000032823 cell division Effects 0.000 description 3
- 238000012062 charged aerosol detection Methods 0.000 description 3
- 238000001360 collision-induced dissociation Methods 0.000 description 3
- 238000011960 computer-aided design Methods 0.000 description 3
- 239000000835 fiber Substances 0.000 description 3
- 239000008103 glucose Substances 0.000 description 3
- 239000005090 green fluorescent protein Substances 0.000 description 3
- 238000000338 in vitro Methods 0.000 description 3
- 230000006698 induction Effects 0.000 description 3
- 239000007788 liquid Substances 0.000 description 3
- 238000013507 mapping Methods 0.000 description 3
- 238000002493 microarray Methods 0.000 description 3
- 230000000877 morphologic effect Effects 0.000 description 3
- 230000000243 photosynthetic effect Effects 0.000 description 3
- 239000013612 plasmid Substances 0.000 description 3
- 230000010152 pollination Effects 0.000 description 3
- 230000032361 posttranscriptional gene silencing Effects 0.000 description 3
- 230000005855 radiation Effects 0.000 description 3
- 230000010076 replication Effects 0.000 description 3
- 230000002441 reversible effect Effects 0.000 description 3
- 238000010187 selection method Methods 0.000 description 3
- 238000002864 sequence alignment Methods 0.000 description 3
- 230000005026 transcription initiation Effects 0.000 description 3
- 230000005030 transcription termination Effects 0.000 description 3
- 239000013603 viral vector Substances 0.000 description 3
- 238000001262 western blot Methods 0.000 description 3
- 241000228158 x Triticosecale Species 0.000 description 3
- LWTDZKXXJRRKDG-KXBFYZLASA-N (-)-phaseollin Chemical compound C1OC2=CC(O)=CC=C2[C@H]2[C@@H]1C1=CC=C3OC(C)(C)C=CC3=C1O2 LWTDZKXXJRRKDG-KXBFYZLASA-N 0.000 description 2
- IAKHMKGGTNLKSZ-INIZCTEOSA-N (S)-colchicine Chemical compound C1([C@@H](NC(C)=O)CC2)=CC(=O)C(OC)=CC=C1C1=C2C=C(OC)C(OC)=C1OC IAKHMKGGTNLKSZ-INIZCTEOSA-N 0.000 description 2
- 239000005631 2,4-Dichlorophenoxyacetic acid Substances 0.000 description 2
- 108020005345 3' Untranslated Regions Proteins 0.000 description 2
- 240000004507 Abelmoschus esculentus Species 0.000 description 2
- 241000218642 Abies Species 0.000 description 2
- 102000007469 Actins Human genes 0.000 description 2
- 108010085238 Actins Proteins 0.000 description 2
- 244000291564 Allium cepa Species 0.000 description 2
- 235000007119 Ananas comosus Nutrition 0.000 description 2
- 241001327399 Andropogon gerardii Species 0.000 description 2
- 241001494508 Arundo donax Species 0.000 description 2
- 235000021533 Beta vulgaris Nutrition 0.000 description 2
- 241000335053 Beta vulgaris Species 0.000 description 2
- 241000219310 Beta vulgaris subsp. vulgaris Species 0.000 description 2
- 235000011331 Brassica Nutrition 0.000 description 2
- 241000219198 Brassica Species 0.000 description 2
- 235000011299 Brassica oleracea var botrytis Nutrition 0.000 description 2
- 240000003259 Brassica oleracea var. botrytis Species 0.000 description 2
- 241000219193 Brassicaceae Species 0.000 description 2
- 101150019620 CAD gene Proteins 0.000 description 2
- 101100494448 Caenorhabditis elegans cab-1 gene Proteins 0.000 description 2
- 240000004160 Capsicum annuum Species 0.000 description 2
- WLYGSPLCNKYESI-RSUQVHIMSA-N Carthamin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@H]1[C@@]1(O)C(O)=C(C(=O)\C=C\C=2C=CC(O)=CC=2)C(=O)C(\C=C\2C([C@](O)([C@H]3[C@@H]([C@@H](O)[C@H](O)[C@@H](CO)O3)O)C(O)=C(C(=O)\C=C\C=3C=CC(O)=CC=3)C/2=O)=O)=C1O WLYGSPLCNKYESI-RSUQVHIMSA-N 0.000 description 2
- 241000208809 Carthamus Species 0.000 description 2
- 235000003255 Carthamus tinctorius Nutrition 0.000 description 2
- 244000020518 Carthamus tinctorius Species 0.000 description 2
- 244000241235 Citrullus lanatus Species 0.000 description 2
- 240000007154 Coffea arabica Species 0.000 description 2
- 241000701515 Commelina yellow mottle virus Species 0.000 description 2
- 241000219112 Cucumis Species 0.000 description 2
- 240000008067 Cucumis sativus Species 0.000 description 2
- 244000052363 Cynodon dactylon Species 0.000 description 2
- 235000009355 Dianthus caryophyllus Nutrition 0.000 description 2
- 240000006497 Dianthus caryophyllus Species 0.000 description 2
- LCGLNKUTAGEVQW-UHFFFAOYSA-N Dimethyl ether Chemical compound COC LCGLNKUTAGEVQW-UHFFFAOYSA-N 0.000 description 2
- 235000001942 Elaeis Nutrition 0.000 description 2
- 241000512897 Elaeis Species 0.000 description 2
- 244000166124 Eucalyptus globulus Species 0.000 description 2
- 241000234642 Festuca Species 0.000 description 2
- 241000234643 Festuca arundinacea Species 0.000 description 2
- 240000009088 Fragaria x ananassa Species 0.000 description 2
- 235000011363 Fragaria x ananassa Nutrition 0.000 description 2
- 229930091371 Fructose Natural products 0.000 description 2
- 239000005715 Fructose Substances 0.000 description 2
- RFSUNEUAIZKAJO-ARQDHWQXSA-N Fructose Chemical compound OC[C@H]1O[C@](O)(CO)[C@@H](O)[C@@H]1O RFSUNEUAIZKAJO-ARQDHWQXSA-N 0.000 description 2
- 108700028146 Genetic Enhancer Elements Proteins 0.000 description 2
- 102000053187 Glucuronidase Human genes 0.000 description 2
- 108010060309 Glucuronidase Proteins 0.000 description 2
- 102000005720 Glutathione transferase Human genes 0.000 description 2
- 108010070675 Glutathione transferase Proteins 0.000 description 2
- 241000219146 Gossypium Species 0.000 description 2
- 235000009438 Gossypium Nutrition 0.000 description 2
- 244000299507 Gossypium hirsutum Species 0.000 description 2
- 241000208818 Helianthus Species 0.000 description 2
- 241000209219 Hordeum Species 0.000 description 2
- 206010020649 Hyperkeratosis Diseases 0.000 description 2
- 235000003228 Lactuca sativa Nutrition 0.000 description 2
- 240000008415 Lactuca sativa Species 0.000 description 2
- 235000008119 Larix laricina Nutrition 0.000 description 2
- 241000218653 Larix laricina Species 0.000 description 2
- 235000004431 Linum usitatissimum Nutrition 0.000 description 2
- 240000006240 Linum usitatissimum Species 0.000 description 2
- 241000219745 Lupinus Species 0.000 description 2
- 235000007688 Lycopersicon esculentum Nutrition 0.000 description 2
- 240000003183 Manihot esculenta Species 0.000 description 2
- 235000010624 Medicago sativa Nutrition 0.000 description 2
- 235000017587 Medicago sativa ssp. sativa Nutrition 0.000 description 2
- 241001465754 Metazoa Species 0.000 description 2
- 240000008790 Musa x paradisiaca Species 0.000 description 2
- 235000018290 Musa x paradisiaca Nutrition 0.000 description 2
- LRHPLDYGYMQRHN-UHFFFAOYSA-N N-Butanol Chemical compound CCCCO LRHPLDYGYMQRHN-UHFFFAOYSA-N 0.000 description 2
- 241001230286 Narenga Species 0.000 description 2
- 241000209094 Oryza Species 0.000 description 2
- 241001495454 Parthenium Species 0.000 description 2
- AVFIYMSJDDGDBQ-UHFFFAOYSA-N Parthenium Chemical compound C1C=C(CCC(C)=O)C(C)CC2OC(=O)C(=C)C21 AVFIYMSJDDGDBQ-UHFFFAOYSA-N 0.000 description 2
- 241000209046 Pennisetum Species 0.000 description 2
- 235000007195 Pennisetum typhoides Nutrition 0.000 description 2
- 244000081757 Phalaris arundinacea Species 0.000 description 2
- 241000746981 Phleum Species 0.000 description 2
- 235000008331 Pinus X rigitaeda Nutrition 0.000 description 2
- 241000018646 Pinus brutia Species 0.000 description 2
- 241000209048 Poa Species 0.000 description 2
- 102000006382 Ribonucleases Human genes 0.000 description 2
- 108010083644 Ribonucleases Proteins 0.000 description 2
- 241000701507 Rice tungro bacilliform virus Species 0.000 description 2
- 235000003846 Ricinus Nutrition 0.000 description 2
- 241000322381 Ricinus <louse> Species 0.000 description 2
- 235000004443 Ricinus communis Nutrition 0.000 description 2
- 235000011449 Rosa Nutrition 0.000 description 2
- 241000746444 Saccharum sp. Species 0.000 description 2
- 241000209056 Secale Species 0.000 description 2
- 108091081021 Sense strand Proteins 0.000 description 2
- 240000003768 Solanum lycopersicum Species 0.000 description 2
- 235000002597 Solanum melongena Nutrition 0.000 description 2
- 244000061458 Solanum melongena Species 0.000 description 2
- 235000002595 Solanum tuberosum Nutrition 0.000 description 2
- 244000061456 Solanum tuberosum Species 0.000 description 2
- 235000007230 Sorghum bicolor Nutrition 0.000 description 2
- 235000013457 Sorghum bicolor subsp verticilliflorum Nutrition 0.000 description 2
- 241000923571 Sporobolus michauxianus Species 0.000 description 2
- 244000099500 Sudangras Species 0.000 description 2
- 235000021536 Sugar beet Nutrition 0.000 description 2
- 244000269722 Thea sinensis Species 0.000 description 2
- 244000299461 Theobroma cacao Species 0.000 description 2
- 235000009470 Theobroma cacao Nutrition 0.000 description 2
- 108700009124 Transcription Initiation Site Proteins 0.000 description 2
- 241000209140 Triticum Species 0.000 description 2
- 241000700605 Viruses Species 0.000 description 2
- 240000006365 Vitis vinifera Species 0.000 description 2
- 235000014787 Vitis vinifera Nutrition 0.000 description 2
- 241000209149 Zea Species 0.000 description 2
- 235000005824 Zea mays ssp. parviglumis Nutrition 0.000 description 2
- 150000007513 acids Chemical class 0.000 description 2
- 230000004913 activation Effects 0.000 description 2
- WQZGKKKJIJFFOK-PHYPRBDBSA-N alpha-D-galactose Chemical compound OC[C@H]1O[C@H](O)[C@H](O)[C@@H](O)[C@H]1O WQZGKKKJIJFFOK-PHYPRBDBSA-N 0.000 description 2
- 125000000539 amino acid group Chemical group 0.000 description 2
- 239000003242 anti bacterial agent Substances 0.000 description 2
- PYMYPHUHKUWMLA-WDCZJNDASA-N arabinose Chemical compound OC[C@@H](O)[C@@H](O)[C@H](O)C=O PYMYPHUHKUWMLA-WDCZJNDASA-N 0.000 description 2
- 101150099875 atpE gene Proteins 0.000 description 2
- WQZGKKKJIJFFOK-VFUOTHLCSA-N beta-D-glucose Chemical compound OC[C@H]1O[C@@H](O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-VFUOTHLCSA-N 0.000 description 2
- 230000003115 biocidal effect Effects 0.000 description 2
- 230000033228 biological regulation Effects 0.000 description 2
- 230000006696 biosynthetic metabolic pathway Effects 0.000 description 2
- 230000015572 biosynthetic process Effects 0.000 description 2
- 230000008859 change Effects 0.000 description 2
- 230000019113 chromatin silencing Effects 0.000 description 2
- 150000001875 compounds Chemical class 0.000 description 2
- JMFRWRFFLBVWSI-NSCUHMNNSA-N coniferol Chemical compound COC1=CC(\C=C\CO)=CC=C1O JMFRWRFFLBVWSI-NSCUHMNNSA-N 0.000 description 2
- 239000000470 constituent Substances 0.000 description 2
- 235000005822 corn Nutrition 0.000 description 2
- 235000019621 digestibility Nutrition 0.000 description 2
- 230000029087 digestion Effects 0.000 description 2
- 238000001962 electrophoresis Methods 0.000 description 2
- 239000003623 enhancer Substances 0.000 description 2
- 210000002615 epidermis Anatomy 0.000 description 2
- 239000004459 forage Substances 0.000 description 2
- 230000037433 frameshift Effects 0.000 description 2
- ZZUFCTLCJUWOSV-UHFFFAOYSA-N furosemide Chemical compound C1=C(Cl)C(S(=O)(=O)N)=CC(C(O)=O)=C1NCC1=CC=CO1 ZZUFCTLCJUWOSV-UHFFFAOYSA-N 0.000 description 2
- 229930182830 galactose Natural products 0.000 description 2
- 238000001502 gel electrophoresis Methods 0.000 description 2
- 230000002363 herbicidal effect Effects 0.000 description 2
- 239000004009 herbicide Substances 0.000 description 2
- 238000004128 high performance liquid chromatography Methods 0.000 description 2
- 238000001114 immunoprecipitation Methods 0.000 description 2
- 238000007901 in situ hybridization Methods 0.000 description 2
- 238000001727 in vivo Methods 0.000 description 2
- 238000011065 in-situ storage Methods 0.000 description 2
- 238000010348 incorporation Methods 0.000 description 2
- 239000000411 inducer Substances 0.000 description 2
- 238000007689 inspection Methods 0.000 description 2
- 108010083942 mannopine synthase Proteins 0.000 description 2
- 238000004949 mass spectrometry Methods 0.000 description 2
- 230000001404 mediated effect Effects 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 239000000178 monomer Substances 0.000 description 2
- 229910052757 nitrogen Inorganic materials 0.000 description 2
- 108010058731 nopaline synthase Proteins 0.000 description 2
- 230000008488 polyadenylation Effects 0.000 description 2
- 238000012545 processing Methods 0.000 description 2
- 238000000746 purification Methods 0.000 description 2
- 238000012175 pyrosequencing Methods 0.000 description 2
- 230000006798 recombination Effects 0.000 description 2
- 238000005215 recombination Methods 0.000 description 2
- 238000006722 reduction reaction Methods 0.000 description 2
- 230000001850 reproductive effect Effects 0.000 description 2
- 238000003757 reverse transcription PCR Methods 0.000 description 2
- 238000013077 scoring method Methods 0.000 description 2
- 238000002741 site-directed mutagenesis Methods 0.000 description 2
- 235000020354 squash Nutrition 0.000 description 2
- PVYJZLYGTZKPJE-UHFFFAOYSA-N streptonigrin Chemical compound C=1C=C2C(=O)C(OC)=C(N)C(=O)C2=NC=1C(C=1N)=NC(C(O)=O)=C(C)C=1C1=CC=C(OC)C(OC)=C1O PVYJZLYGTZKPJE-UHFFFAOYSA-N 0.000 description 2
- 239000000758 substrate Substances 0.000 description 2
- LZFOPEXOUVTGJS-ONEGZZNKSA-N trans-sinapyl alcohol Chemical compound COC1=CC(\C=C\CO)=CC(OC)=C1O LZFOPEXOUVTGJS-ONEGZZNKSA-N 0.000 description 2
- 238000012033 transcriptional gene silencing Methods 0.000 description 2
- 238000011282 treatment Methods 0.000 description 2
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 2
- ZCKDCRKBURQZPT-OWOJBTEDSA-N (E)-caffeyl alcohol Chemical compound OC\C=C\C1=CC=C(O)C(O)=C1 ZCKDCRKBURQZPT-OWOJBTEDSA-N 0.000 description 1
- CDICDSOGTRCHMG-ONEGZZNKSA-N (E)-sinapaldehyde Chemical compound COC1=CC(\C=C\C=O)=CC(OC)=C1O CDICDSOGTRCHMG-ONEGZZNKSA-N 0.000 description 1
- HXKWSTRRCHTUEC-UHFFFAOYSA-N 2,4-Dichlorophenoxyaceticacid Chemical compound OC(=O)C(Cl)OC1=CC=C(Cl)C=C1 HXKWSTRRCHTUEC-UHFFFAOYSA-N 0.000 description 1
- MWMOPIVLTLEUJO-UHFFFAOYSA-N 2-oxopropanoic acid;phosphoric acid Chemical compound OP(O)(O)=O.CC(=O)C(O)=O MWMOPIVLTLEUJO-UHFFFAOYSA-N 0.000 description 1
- AXMVYSVVTMKQSL-OWOJBTEDSA-N 3,4-dihydroxycinnamaldehyde Chemical compound OC1=CC=C(\C=C\C=O)C=C1O AXMVYSVVTMKQSL-OWOJBTEDSA-N 0.000 description 1
- AJBZENLMTKDAEK-UHFFFAOYSA-N 3a,5a,5b,8,8,11a-hexamethyl-1-prop-1-en-2-yl-1,2,3,4,5,6,7,7a,9,10,11,11b,12,13,13a,13b-hexadecahydrocyclopenta[a]chrysene-4,9-diol Chemical compound CC12CCC(O)C(C)(C)C1CCC(C1(C)CC3O)(C)C2CCC1C1C3(C)CCC1C(=C)C AJBZENLMTKDAEK-UHFFFAOYSA-N 0.000 description 1
- CJXMVKYNVIGQBS-OWOJBTEDSA-N 4-hydroxycinnamaldehyde Chemical compound OC1=CC=C(\C=C\C=O)C=C1 CJXMVKYNVIGQBS-OWOJBTEDSA-N 0.000 description 1
- 108020003589 5' Untranslated Regions Proteins 0.000 description 1
- LQLQRFGHAALLLE-UHFFFAOYSA-N 5-bromouracil Chemical compound BrC1=CNC(=O)NC1=O LQLQRFGHAALLLE-UHFFFAOYSA-N 0.000 description 1
- LCYXNYNRVOBSHK-UHFFFAOYSA-N 8-ethoxy-1,3,7-trimethylpurine-2,6-dione Chemical compound CN1C(=O)N(C)C(=O)C2=C1N=C(OCC)N2C LCYXNYNRVOBSHK-UHFFFAOYSA-N 0.000 description 1
- 241001075517 Abelmoschus Species 0.000 description 1
- 235000003934 Abelmoschus esculentus Nutrition 0.000 description 1
- 241000207965 Acanthaceae Species 0.000 description 1
- 229920001817 Agar Polymers 0.000 description 1
- 241000589158 Agrobacterium Species 0.000 description 1
- 241000589155 Agrobacterium tumefaciens Species 0.000 description 1
- 102000007698 Alcohol dehydrogenase Human genes 0.000 description 1
- 241000123646 Allioideae Species 0.000 description 1
- 241000234282 Allium Species 0.000 description 1
- 235000005255 Allium cepa Nutrition 0.000 description 1
- 235000002732 Allium cepa var. cepa Nutrition 0.000 description 1
- 241000556588 Alstroemeria Species 0.000 description 1
- 241000556591 Alstroemeriaceae Species 0.000 description 1
- 240000008025 Alternanthera ficoidea Species 0.000 description 1
- 241000234270 Amaryllidaceae Species 0.000 description 1
- 241000746375 Andrographis Species 0.000 description 1
- 241000744007 Andropogon Species 0.000 description 1
- 108020005544 Antisense RNA Proteins 0.000 description 1
- 241000208327 Apocynaceae Species 0.000 description 1
- 241000219195 Arabidopsis thaliana Species 0.000 description 1
- 101100204308 Arabidopsis thaliana SUC2 gene Proteins 0.000 description 1
- 241000233788 Arecaceae Species 0.000 description 1
- 235000003826 Artemisia Nutrition 0.000 description 1
- 235000003261 Artemisia vulgaris Nutrition 0.000 description 1
- 241001494510 Arundo Species 0.000 description 1
- 241000208838 Asteraceae Species 0.000 description 1
- 241001106067 Atropa Species 0.000 description 1
- 229930192334 Auxin Natural products 0.000 description 1
- 244000075850 Avena orientalis Species 0.000 description 1
- 235000007319 Avena orientalis Nutrition 0.000 description 1
- 235000017166 Bambusa arundinacea Nutrition 0.000 description 1
- 235000017491 Bambusa tulda Nutrition 0.000 description 1
- 241000133570 Berberidaceae Species 0.000 description 1
- 240000000724 Berberis vulgaris Species 0.000 description 1
- 235000006011 Bixa Nutrition 0.000 description 1
- 241000934840 Bixa Species 0.000 description 1
- 241000934828 Bixaceae Species 0.000 description 1
- 108010006654 Bleomycin Proteins 0.000 description 1
- 241000339490 Brachyachne Species 0.000 description 1
- 235000011303 Brassica alboglabra Nutrition 0.000 description 1
- 235000003351 Brassica cretica Nutrition 0.000 description 1
- 244000178993 Brassica juncea Species 0.000 description 1
- 235000011332 Brassica juncea Nutrition 0.000 description 1
- 235000014698 Brassica juncea var multisecta Nutrition 0.000 description 1
- 235000014700 Brassica juncea var napiformis Nutrition 0.000 description 1
- 240000002791 Brassica napus Species 0.000 description 1
- 235000011293 Brassica napus Nutrition 0.000 description 1
- 235000006008 Brassica napus var napus Nutrition 0.000 description 1
- 240000000385 Brassica napus var. napus Species 0.000 description 1
- 240000007124 Brassica oleracea Species 0.000 description 1
- 235000011302 Brassica oleracea Nutrition 0.000 description 1
- 235000004221 Brassica oleracea var gemmifera Nutrition 0.000 description 1
- 235000017647 Brassica oleracea var italica Nutrition 0.000 description 1
- 244000308368 Brassica oleracea var. gemmifera Species 0.000 description 1
- 235000006618 Brassica rapa subsp oleifera Nutrition 0.000 description 1
- 235000003343 Brassica rupestris Nutrition 0.000 description 1
- 235000004977 Brassica sinapistrum Nutrition 0.000 description 1
- 241000234670 Bromeliaceae Species 0.000 description 1
- 235000003880 Calendula Nutrition 0.000 description 1
- 240000001432 Calendula officinalis Species 0.000 description 1
- 241000209507 Camellia Species 0.000 description 1
- 241000759909 Camptotheca Species 0.000 description 1
- 101100291915 Candida albicans (strain SC5314 / ATCC MYA-2876) MP65 gene Proteins 0.000 description 1
- 241000218235 Cannabaceae Species 0.000 description 1
- 241000218236 Cannabis Species 0.000 description 1
- 235000002566 Capsicum Nutrition 0.000 description 1
- 235000008534 Capsicum annuum var annuum Nutrition 0.000 description 1
- 240000008574 Capsicum frutescens Species 0.000 description 1
- OKTJSMMVPCPJKN-UHFFFAOYSA-N Carbon Chemical compound [C] OKTJSMMVPCPJKN-UHFFFAOYSA-N 0.000 description 1
- OKTJSMMVPCPJKN-NJFSPNSNSA-N Carbon-14 Chemical compound [14C] OKTJSMMVPCPJKN-NJFSPNSNSA-N 0.000 description 1
- 108010078791 Carrier Proteins Proteins 0.000 description 1
- 241000219321 Caryophyllaceae Species 0.000 description 1
- 241000208328 Catharanthus Species 0.000 description 1
- 241000488900 Cephalotaxaceae Species 0.000 description 1
- 241000488899 Cephalotaxus Species 0.000 description 1
- 241000871189 Chenopodiaceae Species 0.000 description 1
- 239000005496 Chlorsulfuron Substances 0.000 description 1
- 235000007516 Chrysanthemum Nutrition 0.000 description 1
- 240000005250 Chrysanthemum indicum Species 0.000 description 1
- 235000021513 Cinchona Nutrition 0.000 description 1
- 241000157855 Cinchona Species 0.000 description 1
- 108091062157 Cis-regulatory element Proteins 0.000 description 1
- 241000219109 Citrullus Species 0.000 description 1
- 235000009831 Citrullus lanatus Nutrition 0.000 description 1
- 235000012828 Citrullus lanatus var citroides Nutrition 0.000 description 1
- GUTLYIVDDKVIGB-OUBTZVSYSA-N Cobalt-60 Chemical compound [60Co] GUTLYIVDDKVIGB-OUBTZVSYSA-N 0.000 description 1
- 108700010070 Codon Usage Proteins 0.000 description 1
- 241000723377 Coffea Species 0.000 description 1
- 235000007460 Coffea arabica Nutrition 0.000 description 1
- 241000131506 Colchicaceae Species 0.000 description 1
- 241000723375 Colchicum Species 0.000 description 1
- 235000021508 Coleus Nutrition 0.000 description 1
- 244000061182 Coleus blumei Species 0.000 description 1
- 108091035707 Consensus sequence Proteins 0.000 description 1
- 229920000742 Cotton Polymers 0.000 description 1
- 244000241257 Cucumis melo Species 0.000 description 1
- 235000009842 Cucumis melo Nutrition 0.000 description 1
- 235000015510 Cucumis melo subsp melo Nutrition 0.000 description 1
- 235000010071 Cucumis prophetarum Nutrition 0.000 description 1
- 235000009849 Cucumis sativus Nutrition 0.000 description 1
- 235000010799 Cucumis sativus var sativus Nutrition 0.000 description 1
- 241000219122 Cucurbita Species 0.000 description 1
- 240000004244 Cucurbita moschata Species 0.000 description 1
- 241000219104 Cucurbitaceae Species 0.000 description 1
- 102100028717 Cytosolic 5'-nucleotidase 3A Human genes 0.000 description 1
- 108010066133 D-octopine dehydrogenase Proteins 0.000 description 1
- YAHZABJORDUQGO-NQXXGFSBSA-N D-ribulose 1,5-bisphosphate Chemical compound OP(=O)(O)OC[C@@H](O)[C@@H](O)C(=O)COP(O)(O)=O YAHZABJORDUQGO-NQXXGFSBSA-N 0.000 description 1
- 230000004544 DNA amplification Effects 0.000 description 1
- 238000000018 DNA microarray Methods 0.000 description 1
- 108010008286 DNA nucleotidylexotransferase Proteins 0.000 description 1
- 239000003155 DNA primer Substances 0.000 description 1
- 230000004543 DNA replication Effects 0.000 description 1
- 230000007023 DNA restriction-modification system Effects 0.000 description 1
- 230000006820 DNA synthesis Effects 0.000 description 1
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 1
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 1
- 102100029764 DNA-directed DNA/RNA polymerase mu Human genes 0.000 description 1
- 108090000626 DNA-directed RNA polymerases Proteins 0.000 description 1
- 102000004163 DNA-directed RNA polymerases Human genes 0.000 description 1
- 241000208296 Datura Species 0.000 description 1
- 101710088194 Dehydrogenase Proteins 0.000 description 1
- 240000003421 Dianthus chinensis Species 0.000 description 1
- 240000001879 Digitalis lutea Species 0.000 description 1
- 235000005903 Dioscorea Nutrition 0.000 description 1
- 244000281702 Dioscorea villosa Species 0.000 description 1
- 235000000504 Dioscorea villosa Nutrition 0.000 description 1
- 241000234272 Dioscoreaceae Species 0.000 description 1
- 240000003133 Elaeis guineensis Species 0.000 description 1
- 235000001950 Elaeis guineensis Nutrition 0.000 description 1
- 108010093099 Endoribonucleases Proteins 0.000 description 1
- 102000002494 Endoribonucleases Human genes 0.000 description 1
- 241000218671 Ephedra Species 0.000 description 1
- 241000218670 Ephedraceae Species 0.000 description 1
- YQYJSBFKSSDGFO-UHFFFAOYSA-N Epihygromycin Natural products OC1C(O)C(C(=O)C)OC1OC(C(=C1)O)=CC=C1C=C(C)C(=O)NC1C(O)C(O)C2OCOC2C1O YQYJSBFKSSDGFO-UHFFFAOYSA-N 0.000 description 1
- 241001081474 Erythroxylaceae Species 0.000 description 1
- 241000735552 Erythroxylum Species 0.000 description 1
- VGGSQFUCUMXWEO-UHFFFAOYSA-N Ethene Chemical compound C=C VGGSQFUCUMXWEO-UHFFFAOYSA-N 0.000 description 1
- 239000005977 Ethylene Substances 0.000 description 1
- 244000004281 Eucalyptus maculata Species 0.000 description 1
- 241000552068 Eucarpia Species 0.000 description 1
- 241000221017 Euphorbiaceae Species 0.000 description 1
- 108700024394 Exon Proteins 0.000 description 1
- 238000001134 F-test Methods 0.000 description 1
- 241000220485 Fabaceae Species 0.000 description 1
- 108010046335 Ferredoxin-NADP Reductase Proteins 0.000 description 1
- 241000701484 Figwort mosaic virus Species 0.000 description 1
- 241000220223 Fragaria Species 0.000 description 1
- 235000016623 Fragaria vesca Nutrition 0.000 description 1
- 108091092584 GDNA Proteins 0.000 description 1
- 241000234271 Galanthus Species 0.000 description 1
- 108700007698 Genetic Terminator Regions Proteins 0.000 description 1
- 108010068370 Glutens Proteins 0.000 description 1
- 108700037728 Glycine max beta-conglycinin Proteins 0.000 description 1
- 239000005562 Glyphosate Substances 0.000 description 1
- 235000009432 Gossypium hirsutum Nutrition 0.000 description 1
- 108090001102 Hammerhead ribozyme Proteins 0.000 description 1
- 101710154606 Hemagglutinin Proteins 0.000 description 1
- 244000043261 Hevea brasiliensis Species 0.000 description 1
- 241000238631 Hexapoda Species 0.000 description 1
- AVXURJPOCDRRFD-UHFFFAOYSA-N Hydroxylamine Chemical compound ON AVXURJPOCDRRFD-UHFFFAOYSA-N 0.000 description 1
- 241000208278 Hyoscyamus Species 0.000 description 1
- 108010044467 Isoenzymes Proteins 0.000 description 1
- 241001048891 Jatropha curcas Species 0.000 description 1
- FAIXYKHYOGVFKA-UHFFFAOYSA-N Kinetin Natural products N=1C=NC=2N=CNC=2C=1N(C)C1=CC=CO1 FAIXYKHYOGVFKA-UHFFFAOYSA-N 0.000 description 1
- 241000208822 Lactuca Species 0.000 description 1
- 241000207923 Lamiaceae Species 0.000 description 1
- 102100023487 Lens fiber major intrinsic protein Human genes 0.000 description 1
- 101710087757 Lens fiber major intrinsic protein Proteins 0.000 description 1
- 241000209510 Liliopsida Species 0.000 description 1
- 241000208202 Linaceae Species 0.000 description 1
- 241000208204 Linum Species 0.000 description 1
- 108060001084 Luciferase Proteins 0.000 description 1
- 239000005089 Luciferase Substances 0.000 description 1
- 235000010649 Lupinus albus Nutrition 0.000 description 1
- 240000000894 Lupinus albus Species 0.000 description 1
- 241000227653 Lycopersicon Species 0.000 description 1
- 235000002262 Lycopersicon Nutrition 0.000 description 1
- 241000195948 Lycopodiaceae Species 0.000 description 1
- 241000195947 Lycopodium Species 0.000 description 1
- 241000219071 Malvaceae Species 0.000 description 1
- 235000004456 Manihot esculenta Nutrition 0.000 description 1
- 241000219823 Medicago Species 0.000 description 1
- 241000489991 Melanthiaceae Species 0.000 description 1
- 235000014435 Mentha Nutrition 0.000 description 1
- 241001072983 Mentha Species 0.000 description 1
- 108060004795 Methyltransferase Proteins 0.000 description 1
- 102000016397 Methyltransferase Human genes 0.000 description 1
- 241001074116 Miscanthus x giganteus Species 0.000 description 1
- 241000234295 Musa Species 0.000 description 1
- 241000234615 Musaceae Species 0.000 description 1
- 101710135898 Myc proto-oncogene protein Proteins 0.000 description 1
- 102100038895 Myc proto-oncogene protein Human genes 0.000 description 1
- 241000219926 Myrtaceae Species 0.000 description 1
- 101710202365 Napin Proteins 0.000 description 1
- 241000208125 Nicotiana Species 0.000 description 1
- IOVCWXUNBOPUCH-UHFFFAOYSA-N Nitrous acid Chemical compound ON=O IOVCWXUNBOPUCH-UHFFFAOYSA-N 0.000 description 1
- 108020004711 Nucleic Acid Probes Proteins 0.000 description 1
- 241000209018 Nyssaceae Species 0.000 description 1
- 101710089395 Oleosin Proteins 0.000 description 1
- 101710093908 Outer capsid protein VP4 Proteins 0.000 description 1
- 101710135467 Outer capsid protein sigma-1 Proteins 0.000 description 1
- 235000011096 Papaver Nutrition 0.000 description 1
- 240000001090 Papaver somniferum Species 0.000 description 1
- 241000218180 Papaveraceae Species 0.000 description 1
- 244000130556 Pennisetum purpureum Species 0.000 description 1
- 244000038248 Pennisetum spicatum Species 0.000 description 1
- 244000115721 Pennisetum typhoides Species 0.000 description 1
- 241000745991 Phalaris Species 0.000 description 1
- 101710163504 Phaseolin Proteins 0.000 description 1
- 101000870887 Phaseolus vulgaris Glycine-rich cell wall structural protein 1.8 Proteins 0.000 description 1
- 241000746983 Phleum pratense Species 0.000 description 1
- IAJOBQBIJHVGMQ-UHFFFAOYSA-N Phosphinothricin Natural products CP(O)(=O)CCC(N)C(O)=O IAJOBQBIJHVGMQ-UHFFFAOYSA-N 0.000 description 1
- OAICVXFJPJFONN-OUBTZVSYSA-N Phosphorus-32 Chemical compound [32P] OAICVXFJPJFONN-OUBTZVSYSA-N 0.000 description 1
- 235000014676 Phragmites communis Nutrition 0.000 description 1
- 244000082204 Phyllostachys viridis Species 0.000 description 1
- 235000015334 Phyllostachys viridis Nutrition 0.000 description 1
- 241000218641 Pinaceae Species 0.000 description 1
- 235000005205 Pinus Nutrition 0.000 description 1
- 241000218602 Pinus <genus> Species 0.000 description 1
- 241000013557 Plantaginaceae Species 0.000 description 1
- 241000209049 Poa pratensis Species 0.000 description 1
- 241000209504 Poaceae Species 0.000 description 1
- 239000004952 Polyamide Substances 0.000 description 1
- 241000161288 Populus candicans Species 0.000 description 1
- 241000183024 Populus tremula Species 0.000 description 1
- 235000011263 Populus tremuloides Nutrition 0.000 description 1
- 240000004923 Populus tremuloides Species 0.000 description 1
- 235000015696 Portulacaria afra Nutrition 0.000 description 1
- 101710176177 Protein A56 Proteins 0.000 description 1
- 108010029485 Protein Isoforms Proteins 0.000 description 1
- 102000001708 Protein Isoforms Human genes 0.000 description 1
- 230000007022 RNA scission Effects 0.000 description 1
- 244000061121 Rauvolfia serpentina Species 0.000 description 1
- 108091081062 Repeated sequence (DNA) Proteins 0.000 description 1
- 108091027981 Response element Proteins 0.000 description 1
- 240000000528 Ricinus communis Species 0.000 description 1
- 241000220317 Rosa Species 0.000 description 1
- 235000004789 Rosa xanthina Nutrition 0.000 description 1
- 241000220222 Rosaceae Species 0.000 description 1
- 241001107098 Rubiaceae Species 0.000 description 1
- 241000282849 Ruminantia Species 0.000 description 1
- 241000218998 Salicaceae Species 0.000 description 1
- 241001093760 Sapindaceae Species 0.000 description 1
- 241000242873 Scopolia Species 0.000 description 1
- 108010016634 Seed Storage Proteins Proteins 0.000 description 1
- 238000012300 Sequence Analysis Methods 0.000 description 1
- 235000008515 Setaria glauca Nutrition 0.000 description 1
- OOFWCWCUKUVTKD-UHFFFAOYSA-N Sinapaldehyde Natural products COC1=CC(C=CC(C)=O)=CC(OC)=C1O OOFWCWCUKUVTKD-UHFFFAOYSA-N 0.000 description 1
- 108020004459 Small interfering RNA Proteins 0.000 description 1
- 241000208292 Solanaceae Species 0.000 description 1
- 235000002634 Solanum Nutrition 0.000 description 1
- 241000207763 Solanum Species 0.000 description 1
- 241001271945 Sorghum amplum Species 0.000 description 1
- 241000305918 Sorghum angustum Species 0.000 description 1
- 241000305926 Sorghum arundinaceum Species 0.000 description 1
- 241001271944 Sorghum brachypodum Species 0.000 description 1
- 241000305925 Sorghum bulbosum Species 0.000 description 1
- 241001271947 Sorghum ecarinatum Species 0.000 description 1
- 241001271946 Sorghum exstans Species 0.000 description 1
- 241001271949 Sorghum grande Species 0.000 description 1
- 244000064817 Sorghum halepense var. sudanense Species 0.000 description 1
- 241001271948 Sorghum interjectum Species 0.000 description 1
- 241001271920 Sorghum intrans Species 0.000 description 1
- 241001148653 Sorghum laxiflorum Species 0.000 description 1
- 241000305924 Sorghum leiocladum Species 0.000 description 1
- 241001148654 Sorghum macrospermum Species 0.000 description 1
- 241001149257 Sorghum matarankense Species 0.000 description 1
- 244000273260 Sorghum nitidum Species 0.000 description 1
- 241001271919 Sorghum plumosum Species 0.000 description 1
- 240000003829 Sorghum propinquum Species 0.000 description 1
- 241001149264 Sorghum purpureosericeum Species 0.000 description 1
- 241001149266 Sorghum stipoideum Species 0.000 description 1
- 241000305923 Sorghum timorense Species 0.000 description 1
- 241001149255 Sorghum versicolor Species 0.000 description 1
- 241001157422 Sorghum virgatum Species 0.000 description 1
- 241001271940 Sorghum x almum Species 0.000 description 1
- 241000694025 Sorghum x drummondii Species 0.000 description 1
- 238000002105 Southern blotting Methods 0.000 description 1
- 241000746413 Spartina Species 0.000 description 1
- 241000219315 Spinacia Species 0.000 description 1
- 108091081024 Start codon Proteins 0.000 description 1
- 101710154134 Stearoyl-[acyl-carrier-protein] 9-desaturase, chloroplastic Proteins 0.000 description 1
- 238000000692 Student's t-test Methods 0.000 description 1
- NINIDFKCEFEMDL-UHFFFAOYSA-N Sulfur Chemical compound [S] NINIDFKCEFEMDL-UHFFFAOYSA-N 0.000 description 1
- 206010042602 Supraventricular extrasystoles Diseases 0.000 description 1
- 102000003673 Symporters Human genes 0.000 description 1
- 108090000088 Symporters Proteins 0.000 description 1
- 108700026226 TATA Box Proteins 0.000 description 1
- 241000404542 Tanacetum Species 0.000 description 1
- 241001116495 Taxaceae Species 0.000 description 1
- 241001116500 Taxus Species 0.000 description 1
- 108020005038 Terminator Codon Proteins 0.000 description 1
- 241000248384 Tetrahymena thermophila Species 0.000 description 1
- 235000006468 Thea sinensis Nutrition 0.000 description 1
- 241001122767 Theaceae Species 0.000 description 1
- 244000152045 Themeda triandra Species 0.000 description 1
- 241000219161 Theobroma Species 0.000 description 1
- 108010089860 Thylakoid Membrane Proteins Proteins 0.000 description 1
- 101710150448 Transcriptional regulator Myc Proteins 0.000 description 1
- 101710162629 Trypsin inhibitor Proteins 0.000 description 1
- 229940122618 Trypsin inhibitor Drugs 0.000 description 1
- 235000018747 Typha elephantina Nutrition 0.000 description 1
- 244000177175 Typha elephantina Species 0.000 description 1
- AXMVYSVVTMKQSL-UHFFFAOYSA-N UNPD142122 Natural products OC1=CC=C(C=CC=O)C=C1O AXMVYSVVTMKQSL-UHFFFAOYSA-N 0.000 description 1
- 108090000848 Ubiquitin Proteins 0.000 description 1
- 102000044159 Ubiquitin Human genes 0.000 description 1
- 241000145124 Uniola Species 0.000 description 1
- 235000013419 Uniola paniculata Nutrition 0.000 description 1
- 240000007492 Uniola paniculata Species 0.000 description 1
- 241000489523 Veratrum Species 0.000 description 1
- 241000863480 Vinca Species 0.000 description 1
- 241000219094 Vitaceae Species 0.000 description 1
- 241000219095 Vitis Species 0.000 description 1
- 235000009392 Vitis Nutrition 0.000 description 1
- 235000009754 Vitis X bourquina Nutrition 0.000 description 1
- 235000012333 Vitis X labruscana Nutrition 0.000 description 1
- 210000002593 Y chromosome Anatomy 0.000 description 1
- 235000007244 Zea mays Nutrition 0.000 description 1
- FJJCIZWZNKZHII-UHFFFAOYSA-N [4,6-bis(cyanoamino)-1,3,5-triazin-2-yl]cyanamide Chemical compound N#CNC1=NC(NC#N)=NC(NC#N)=N1 FJJCIZWZNKZHII-UHFFFAOYSA-N 0.000 description 1
- 238000009825 accumulation Methods 0.000 description 1
- JUGOREOARAHOCO-UHFFFAOYSA-M acetylcholine chloride Chemical compound [Cl-].CC(=O)OCC[N+](C)(C)C JUGOREOARAHOCO-UHFFFAOYSA-M 0.000 description 1
- 150000001251 acridines Chemical class 0.000 description 1
- 208000005652 acute fatty liver of pregnancy Diseases 0.000 description 1
- 238000007792 addition Methods 0.000 description 1
- 239000008272 agar Substances 0.000 description 1
- 230000009418 agronomic effect Effects 0.000 description 1
- 229940100198 alkylating agent Drugs 0.000 description 1
- 239000002168 alkylating agent Substances 0.000 description 1
- 238000007844 allele-specific PCR Methods 0.000 description 1
- 229940088710 antibiotic agent Drugs 0.000 description 1
- 244000030166 artemisia Species 0.000 description 1
- 235000009052 artemisia Nutrition 0.000 description 1
- 210000004507 artificial chromosome Anatomy 0.000 description 1
- 101150090348 atpC gene Proteins 0.000 description 1
- 101150035600 atpD gene Proteins 0.000 description 1
- 101150103189 atpG gene Proteins 0.000 description 1
- 101150048329 atpH gene Proteins 0.000 description 1
- 239000002363 auxin Substances 0.000 description 1
- 150000001540 azides Chemical class 0.000 description 1
- 230000001580 bacterial effect Effects 0.000 description 1
- 239000011425 bamboo Substances 0.000 description 1
- 108010019077 beta-Amylase Proteins 0.000 description 1
- 238000002306 biochemical method Methods 0.000 description 1
- 239000003139 biocide Substances 0.000 description 1
- 238000005422 blasting Methods 0.000 description 1
- 229960001561 bleomycin Drugs 0.000 description 1
- OYVAGSVQBOHSSS-UAPAGMARSA-O bleomycin A2 Chemical compound N([C@H](C(=O)N[C@H](C)[C@@H](O)[C@H](C)C(=O)N[C@@H]([C@H](O)C)C(=O)NCCC=1SC=C(N=1)C=1SC=C(N=1)C(=O)NCCC[S+](C)C)[C@@H](O[C@H]1[C@H]([C@@H](O)[C@H](O)[C@H](CO)O1)O[C@@H]1[C@H]([C@@H](OC(N)=O)[C@H](O)[C@@H](CO)O1)O)C=1N=CNC=1)C(=O)C1=NC([C@H](CC(N)=O)NC[C@H](N)C(N)=O)=NC(N)=C1C OYVAGSVQBOHSSS-UAPAGMARSA-O 0.000 description 1
- 210000004899 c-terminal region Anatomy 0.000 description 1
- 239000001390 capsicum minimum Substances 0.000 description 1
- 229910052799 carbon Inorganic materials 0.000 description 1
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 1
- 108010042238 caspase-activated deoxyribonuclease Proteins 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 108091092328 cellular RNA Proteins 0.000 description 1
- 235000013339 cereals Nutrition 0.000 description 1
- TVFDJXOCXUVLDH-RNFDNDRNSA-N cesium-137 Chemical compound [137Cs] TVFDJXOCXUVLDH-RNFDNDRNSA-N 0.000 description 1
- 239000013043 chemical agent Substances 0.000 description 1
- 238000009614 chemical analysis method Methods 0.000 description 1
- 238000001311 chemical methods and process Methods 0.000 description 1
- 239000002962 chemical mutagen Substances 0.000 description 1
- 239000003795 chemical substances by application Substances 0.000 description 1
- 239000005081 chemiluminescent agent Substances 0.000 description 1
- 238000000546 chi-square test Methods 0.000 description 1
- 108010031100 chloroplast transit peptides Proteins 0.000 description 1
- VJYIFXVZLXQVHO-UHFFFAOYSA-N chlorsulfuron Chemical compound COC1=NC(C)=NC(NC(=O)NS(=O)(=O)C=2C(=CC=CC=2)Cl)=N1 VJYIFXVZLXQVHO-UHFFFAOYSA-N 0.000 description 1
- 229920003211 cis-1,4-polyisoprene Polymers 0.000 description 1
- LZFOPEXOUVTGJS-UHFFFAOYSA-N cis-sinapyl alcohol Natural products COC1=CC(C=CCO)=CC(OC)=C1O LZFOPEXOUVTGJS-UHFFFAOYSA-N 0.000 description 1
- 238000003776 cleavage reaction Methods 0.000 description 1
- 238000010367 cloning Methods 0.000 description 1
- 229960001338 colchicine Drugs 0.000 description 1
- 235000018597 common camellia Nutrition 0.000 description 1
- 238000010835 comparative analysis Methods 0.000 description 1
- 230000000052 comparative effect Effects 0.000 description 1
- 239000003184 complementary RNA Substances 0.000 description 1
- 238000004590 computer program Methods 0.000 description 1
- 230000001143 conditioned effect Effects 0.000 description 1
- 229940119526 coniferyl alcohol Drugs 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 230000001276 controlling effect Effects 0.000 description 1
- 230000002596 correlated effect Effects 0.000 description 1
- 239000004062 cytokinin Substances 0.000 description 1
- UQHKFADEQIVWID-UHFFFAOYSA-N cytokinin Natural products C1=NC=2C(NCC=C(CO)C)=NC=NC=2N1C1CC(O)C(CO)O1 UQHKFADEQIVWID-UHFFFAOYSA-N 0.000 description 1
- 210000000805 cytoplasm Anatomy 0.000 description 1
- 238000006731 degradation reaction Methods 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 239000003599 detergent Substances 0.000 description 1
- 239000005546 dideoxynucleotide Substances 0.000 description 1
- 235000004879 dioscorea Nutrition 0.000 description 1
- 210000001840 diploid cell Anatomy 0.000 description 1
- 230000008034 disappearance Effects 0.000 description 1
- 201000010099 disease Diseases 0.000 description 1
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 1
- 230000003828 downregulation Effects 0.000 description 1
- 229940079593 drug Drugs 0.000 description 1
- 239000003814 drug Substances 0.000 description 1
- 238000007876 drug discovery Methods 0.000 description 1
- 238000004520 electroporation Methods 0.000 description 1
- 230000008030 elimination Effects 0.000 description 1
- 238000003379 elimination reaction Methods 0.000 description 1
- 230000007613 environmental effect Effects 0.000 description 1
- 238000007824 enzymatic assay Methods 0.000 description 1
- 150000002118 epoxides Chemical class 0.000 description 1
- 150000002148 esters Chemical group 0.000 description 1
- RTZKZFJDLAIYFH-UHFFFAOYSA-N ether Chemical group CCOCC RTZKZFJDLAIYFH-UHFFFAOYSA-N 0.000 description 1
- 241001233957 eudicotyledons Species 0.000 description 1
- 238000011156 evaluation Methods 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 239000000284 extract Substances 0.000 description 1
- 239000004744 fabric Substances 0.000 description 1
- 230000004720 fertilization Effects 0.000 description 1
- 239000003337 fertilizer Substances 0.000 description 1
- 230000004992 fission Effects 0.000 description 1
- 235000004426 flaxseed Nutrition 0.000 description 1
- 238000005188 flotation Methods 0.000 description 1
- 235000013305 food Nutrition 0.000 description 1
- 230000002538 fungal effect Effects 0.000 description 1
- 230000004927 fusion Effects 0.000 description 1
- 108020001507 fusion proteins Proteins 0.000 description 1
- 102000037865 fusion proteins Human genes 0.000 description 1
- 239000000499 gel Substances 0.000 description 1
- 238000010353 genetic engineering Methods 0.000 description 1
- 102000054766 genetic haplotypes Human genes 0.000 description 1
- 230000007614 genetic variation Effects 0.000 description 1
- JLJLRLWOEMWYQK-GDUNQVSHSA-N giberellic acid Chemical compound C([C@@]1(O)C(=C)C[C@@]2(C1)C1C(O)=O)CC2[C@@]2(OC3=O)C1[C@]3(C)[C@@H](O)CC2 JLJLRLWOEMWYQK-GDUNQVSHSA-N 0.000 description 1
- 229930002203 giberellic acid Natural products 0.000 description 1
- IAJOBQBIJHVGMQ-BYPYZUCNSA-N glufosinate-P Chemical compound CP(O)(=O)CC[C@H](N)C(O)=O IAJOBQBIJHVGMQ-BYPYZUCNSA-N 0.000 description 1
- 230000013595 glycosylation Effects 0.000 description 1
- 238000006206 glycosylation reaction Methods 0.000 description 1
- XDDAORKBJWWYJS-UHFFFAOYSA-N glyphosate Chemical compound OC(=O)CNCP(O)(O)=O XDDAORKBJWWYJS-UHFFFAOYSA-N 0.000 description 1
- 229940097068 glyphosate Drugs 0.000 description 1
- 235000002532 grape seed extract Nutrition 0.000 description 1
- 230000012010 growth Effects 0.000 description 1
- 238000003306 harvesting Methods 0.000 description 1
- 239000000185 hemagglutinin Substances 0.000 description 1
- 239000005556 hormone Substances 0.000 description 1
- 229940088597 hormone Drugs 0.000 description 1
- 125000001165 hydrophobic group Chemical group 0.000 description 1
- 238000012744 immunostaining Methods 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 230000000415 inactivating effect Effects 0.000 description 1
- 238000009399 inbreeding Methods 0.000 description 1
- SEOVTRFCIGRIMH-UHFFFAOYSA-N indole-3-acetic acid Chemical compound C1=CC=C2C(CC(=O)O)=CNC2=C1 SEOVTRFCIGRIMH-UHFFFAOYSA-N 0.000 description 1
- 230000002401 inhibitory effect Effects 0.000 description 1
- 230000005764 inhibitory process Effects 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 238000002955 isolation Methods 0.000 description 1
- 229930027917 kanamycin Natural products 0.000 description 1
- 229960000318 kanamycin Drugs 0.000 description 1
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 1
- 229930182823 kanamycin A Natural products 0.000 description 1
- QANMHLXAZMSUEX-UHFFFAOYSA-N kinetin Chemical compound N=1C=NC=2N=CNC=2C=1NCC1=CC=CO1 QANMHLXAZMSUEX-UHFFFAOYSA-N 0.000 description 1
- 229960001669 kinetin Drugs 0.000 description 1
- 150000002596 lactones Chemical class 0.000 description 1
- 235000021374 legumes Nutrition 0.000 description 1
- 239000003446 ligand Substances 0.000 description 1
- 230000000670 limiting effect Effects 0.000 description 1
- 230000004807 localization Effects 0.000 description 1
- 230000007774 longterm Effects 0.000 description 1
- 235000005739 manihot Nutrition 0.000 description 1
- 239000011159 matrix material Substances 0.000 description 1
- 238000005259 measurement Methods 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 239000012528 membrane Substances 0.000 description 1
- 108091070501 miRNA Proteins 0.000 description 1
- 239000002679 microRNA Substances 0.000 description 1
- 239000004005 microsphere Substances 0.000 description 1
- 208000024191 minimally invasive lung adenocarcinoma Diseases 0.000 description 1
- 230000002438 mitochondrial effect Effects 0.000 description 1
- 239000003147 molecular marker Substances 0.000 description 1
- 235000010460 mustard Nutrition 0.000 description 1
- 230000007935 neutral effect Effects 0.000 description 1
- 239000002853 nucleic acid probe Substances 0.000 description 1
- 210000004940 nucleus Anatomy 0.000 description 1
- 235000016709 nutrition Nutrition 0.000 description 1
- 238000002966 oligonucleotide array Methods 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 210000000056 organ Anatomy 0.000 description 1
- 229930015763 p-coumaryl alcohol Natural products 0.000 description 1
- CJXMVKYNVIGQBS-UHFFFAOYSA-N p-hydroxycinnamaldehyde Natural products OC1=CC=C(C=CC=O)C=C1 CJXMVKYNVIGQBS-UHFFFAOYSA-N 0.000 description 1
- 230000036961 partial effect Effects 0.000 description 1
- 239000002245 particle Substances 0.000 description 1
- 239000000816 peptidomimetic Substances 0.000 description 1
- LWTDZKXXJRRKDG-UHFFFAOYSA-N phaseollin Natural products C1OC2=CC(O)=CC=C2C2C1C1=CC=C3OC(C)(C)C=CC3=C1O2 LWTDZKXXJRRKDG-UHFFFAOYSA-N 0.000 description 1
- ISWSIDIOOBJBQZ-UHFFFAOYSA-N phenol group Chemical group C1(=CC=CC=C1)O ISWSIDIOOBJBQZ-UHFFFAOYSA-N 0.000 description 1
- 108010082527 phosphinothricin N-acetyltransferase Proteins 0.000 description 1
- 150000004713 phosphodiesters Chemical group 0.000 description 1
- 150000008300 phosphoramidites Chemical class 0.000 description 1
- 229940097886 phosphorus 32 Drugs 0.000 description 1
- 230000026731 phosphorylation Effects 0.000 description 1
- 238000006366 phosphorylation reaction Methods 0.000 description 1
- 238000000053 physical method Methods 0.000 description 1
- 230000019612 pigmentation Effects 0.000 description 1
- 239000001739 pinus spp. Substances 0.000 description 1
- 239000000419 plant extract Substances 0.000 description 1
- 230000037039 plant physiology Effects 0.000 description 1
- 239000013600 plasmid vector Substances 0.000 description 1
- 210000002706 plastid Anatomy 0.000 description 1
- 229920002647 polyamide Polymers 0.000 description 1
- 229920002704 polyhistidine Polymers 0.000 description 1
- 230000004481 post-translational protein modification Effects 0.000 description 1
- 238000002360 preparation method Methods 0.000 description 1
- 238000002203 pretreatment Methods 0.000 description 1
- 230000000644 propagated effect Effects 0.000 description 1
- 230000004853 protein function Effects 0.000 description 1
- 230000004850 protein–protein interaction Effects 0.000 description 1
- 101150096384 psaD gene Proteins 0.000 description 1
- 101150032357 psaE gene Proteins 0.000 description 1
- 101150027686 psaF gene Proteins 0.000 description 1
- 230000002285 radioactive effect Effects 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 108091008146 restriction endonucleases Proteins 0.000 description 1
- 108020004418 ribosomal RNA Proteins 0.000 description 1
- 210000003705 ribosome Anatomy 0.000 description 1
- 229920002477 rna polymer Polymers 0.000 description 1
- 150000003839 salts Chemical class 0.000 description 1
- 235000012420 sanguinaria Nutrition 0.000 description 1
- 230000007017 scission Effects 0.000 description 1
- 230000008117 seed development Effects 0.000 description 1
- 230000002269 spontaneous effect Effects 0.000 description 1
- 238000010186 staining Methods 0.000 description 1
- 238000010561 standard procedure Methods 0.000 description 1
- 238000003860 storage Methods 0.000 description 1
- 239000000126 substance Substances 0.000 description 1
- 150000003871 sulfonates Chemical class 0.000 description 1
- 150000003457 sulfones Chemical class 0.000 description 1
- 229910052717 sulfur Inorganic materials 0.000 description 1
- 239000011593 sulfur Substances 0.000 description 1
- 150000003467 sulfuric acid derivatives Chemical class 0.000 description 1
- 238000000672 surface-enhanced laser desorption--ionisation Methods 0.000 description 1
- 238000004114 suspension culture Methods 0.000 description 1
- 238000003786 synthesis reaction Methods 0.000 description 1
- 230000002194 synthesizing effect Effects 0.000 description 1
- 238000012090 tissue culture technique Methods 0.000 description 1
- 101150007587 tpx gene Proteins 0.000 description 1
- PTNLHDGQWUGONS-UHFFFAOYSA-N trans-p-coumaric alcohol Natural products OCC=CC1=CC=C(O)C=C1 PTNLHDGQWUGONS-UHFFFAOYSA-N 0.000 description 1
- PTNLHDGQWUGONS-OWOJBTEDSA-N trans-p-coumaryl alcohol Chemical compound OC\C=C\C1=CC=C(O)C=C1 PTNLHDGQWUGONS-OWOJBTEDSA-N 0.000 description 1
- 230000002103 transcriptional effect Effects 0.000 description 1
- 238000000844 transformation Methods 0.000 description 1
- 230000001131 transforming effect Effects 0.000 description 1
- 230000001052 transient effect Effects 0.000 description 1
- 230000007704 transition Effects 0.000 description 1
- 239000002753 trypsin inhibitor Substances 0.000 description 1
- 241000701447 unidentified baculovirus Species 0.000 description 1
- 241001515965 unidentified phage Species 0.000 description 1
- 241001430294 unidentified retrovirus Species 0.000 description 1
- 230000003827 upregulation Effects 0.000 description 1
- JFALSRSLKYAFGM-OIOBTWANSA-N uranium-235 Chemical compound [235U] JFALSRSLKYAFGM-OIOBTWANSA-N 0.000 description 1
- 230000009105 vegetative growth Effects 0.000 description 1
- 229940057613 veratrum Drugs 0.000 description 1
- 230000003612 virological effect Effects 0.000 description 1
- 230000002747 voluntary effect Effects 0.000 description 1
- 238000005406 washing Methods 0.000 description 1
- 229920001221 xylan Polymers 0.000 description 1
- 150000004823 xylans Chemical class 0.000 description 1
- 108091005957 yellow fluorescent proteins Proteins 0.000 description 1
- 230000004572 zinc-binding Effects 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/0004—Oxidoreductases (1.)
- C12N9/0006—Oxidoreductases (1.) acting on CH-OH groups as donors (1.1)
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/82—Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
- C12N15/8241—Phenotypically and genetically modified plants via recombinant DNA technology
- C12N15/8242—Phenotypically and genetically modified plants via recombinant DNA technology with non-agronomic quality (output) traits, e.g. for industrial processing; Value added, non-agronomic traits
- C12N15/8243—Phenotypically and genetically modified plants via recombinant DNA technology with non-agronomic quality (output) traits, e.g. for industrial processing; Value added, non-agronomic traits involving biosynthetic or metabolic pathways, i.e. metabolic engineering, e.g. nicotine, caffeine
- C12N15/8245—Phenotypically and genetically modified plants via recombinant DNA technology with non-agronomic quality (output) traits, e.g. for industrial processing; Value added, non-agronomic traits involving biosynthetic or metabolic pathways, i.e. metabolic engineering, e.g. nicotine, caffeine involving modified carbohydrate or sugar alcohol metabolism, e.g. starch biosynthesis
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/82—Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
- C12N15/8241—Phenotypically and genetically modified plants via recombinant DNA technology
- C12N15/8242—Phenotypically and genetically modified plants via recombinant DNA technology with non-agronomic quality (output) traits, e.g. for industrial processing; Value added, non-agronomic traits
- C12N15/8243—Phenotypically and genetically modified plants via recombinant DNA technology with non-agronomic quality (output) traits, e.g. for industrial processing; Value added, non-agronomic traits involving biosynthetic or metabolic pathways, i.e. metabolic engineering, e.g. nicotine, caffeine
- C12N15/8245—Phenotypically and genetically modified plants via recombinant DNA technology with non-agronomic quality (output) traits, e.g. for industrial processing; Value added, non-agronomic traits involving biosynthetic or metabolic pathways, i.e. metabolic engineering, e.g. nicotine, caffeine involving modified carbohydrate or sugar alcohol metabolism, e.g. starch biosynthesis
- C12N15/8246—Non-starch polysaccharides, e.g. cellulose, fructans, levans
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/82—Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
- C12N15/8241—Phenotypically and genetically modified plants via recombinant DNA technology
- C12N15/8242—Phenotypically and genetically modified plants via recombinant DNA technology with non-agronomic quality (output) traits, e.g. for industrial processing; Value added, non-agronomic traits
- C12N15/8243—Phenotypically and genetically modified plants via recombinant DNA technology with non-agronomic quality (output) traits, e.g. for industrial processing; Value added, non-agronomic traits involving biosynthetic or metabolic pathways, i.e. metabolic engineering, e.g. nicotine, caffeine
- C12N15/825—Phenotypically and genetically modified plants via recombinant DNA technology with non-agronomic quality (output) traits, e.g. for industrial processing; Value added, non-agronomic traits involving biosynthetic or metabolic pathways, i.e. metabolic engineering, e.g. nicotine, caffeine involving pigment biosynthesis
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
- C12N15/79—Vectors or expression systems specially adapted for eukaryotic hosts
- C12N15/82—Vectors or expression systems specially adapted for eukaryotic hosts for plant cells, e.g. plant artificial chromosomes (PACs)
- C12N15/8241—Phenotypically and genetically modified plants via recombinant DNA technology
- C12N15/8242—Phenotypically and genetically modified plants via recombinant DNA technology with non-agronomic quality (output) traits, e.g. for industrial processing; Value added, non-agronomic traits
- C12N15/8243—Phenotypically and genetically modified plants via recombinant DNA technology with non-agronomic quality (output) traits, e.g. for industrial processing; Value added, non-agronomic traits involving biosynthetic or metabolic pathways, i.e. metabolic engineering, e.g. nicotine, caffeine
- C12N15/8255—Phenotypically and genetically modified plants via recombinant DNA technology with non-agronomic quality (output) traits, e.g. for industrial processing; Value added, non-agronomic traits involving biosynthetic or metabolic pathways, i.e. metabolic engineering, e.g. nicotine, caffeine involving lignin biosynthesis
Definitions
- sequence.txt was created on Oct. 9, 2008, and is 106 KB.
- the file can be accessed using Microsoft Word on a computer that uses Windows OS.
- This document relates to methods, materials, and kits involved in identifying cinnamyl-alcohol dehydrogenases (CAD) alleles in sorghum germplasm and breeding methods to incorporate CAD alleles encoding truncated CAD polypeptides into desired sorghum germplasm lines or elite sorghum breeding lines.
- Methods for generating truncated CAD coding sequences through mutation of sorghum or preparation of synthetic sequences are also described herein as well as methods for generating transgenic plants expressing truncated CAD coding sequences.
- This document also relates to sorghum plants having a novel combination of CAD alleles and/or caffeic acid O-methyltransferase (COMT) alleles encoding truncated polypeptides as well as materials and methods for making such plants.
- CAD alleles and/or caffeic acid O-methyltransferase (COMT) alleles encoding truncated polypeptides as well as materials and methods for making such plants.
- CAD is associated with lignin biosynthesis.
- sorghum there is a need for identifying germplasm having altered lignin or lignin content and developing markers associated with such traits for use in breeding.
- the truncated CAD sequences described herein and markers associated with such truncations will expedite the selection of superior new varieties of sorghum with enhanced biofuel conversion properties and/or forage properties. For example, the introduction of sweet sorghum and/or truncated CAD traits into a high biomass staygreen sorghum germplasm may improve yields and conversion properties dramatically.
- This document provides materials and methods involved in identifying alleles encoding truncated CAD polypeptides in sorghum germplasm. This document also provides breeding methods to incorporate alleles encoding truncated CAD polypeptides in to desired sorghum germplasm lines or elite sorghum breeding lines. For example, this document provides isolated nucleic acids, transgenic plant cells and plants and plant tissues produced from transgenic plant cells, as well as plants of agronomically elite varieties. This document provides methods for producing plants comprising CAD encoding nucleic acids, for incorporating a desired trait into a sorghum cultivar, for characterizing and breeding sorghum plants, and for modulating the composition of a plant.
- kits to genotype a sorghum biological sample can be used to achieve desirable cell wall composition and structure, and advance the selection of advantageous varieties of sorghum for production of biomass with improved digestibility, which may benefit both humans and animals.
- an isolated nucleic acid comprises a sequence encoding a CAD polypeptide.
- the CAD polypeptide comprises at least 98% sequence identity to amino acids 1-130 or 1-319 of SEQ ID NO: 6 and terminates at a position corresponding to residue 131 or 320 of SEQ ID NO: 6.
- an isolated nucleic acid comprises a sequence encoding a sorghum CAD polypeptide.
- the sorghum CAD polypeptide comprises at least 80% sequence identity to amino acids 1-130 or 1-319 of SEQ ID NO: 6 and terminates at a position corresponding to residue 131 or 320 of SEQ ID NO: 6.
- the nucleic acid encoding a CAD polypeptide having at least 98% or at least 80% sequence identity to amino acids 1-130 or 1-319, and terminating at a position corresponding to residue 131 or 320 of SEQ ID NO: 6 further comprises a thymine corresponding to position 2794 of SEQ ID NO:2, position 2800 of SEQ ID NO: 4, 7, 10, or 13, position 4083 SEQ ID NO: 2, position 4089 of SEQ ID NOs: 4 or 7, position 4090 of SEQ ID NO: 10, position 497 of SEQ ID NO: 1, position 394 of SEQ ID NOs: 3, 5, 8, 11, or 14, position 1064 of SEQ ID NO:1, position 962 of SEQ ID NO:11, or position 961 of SEQ ID NOs: 3, 5, or 8.
- the nucleic acid encoding a polypeptide having at least 98% or at least 80% sequence identity to amino acids 1-130 or 1-319, and terminating at a position corresponding to residue 131 or 320 of SEQ ID NO: 6, further comprises at least 80% sequence identity to a nucleotide sequence selected from the group consisting of SEQ ID NO: 1, 2, 3, 4, 5, 7, 8, 10, 11, 13, and 14.
- Transgenic plant cells comprising nucleic acids encoding CAD polypeptides are also provided herein.
- this document provides a transgenic plant cell comprising at least one exogenous nucleic acid.
- the exogenous nucleic acid comprises a regulatory region operably linked to a nucleic acid.
- the nucleic acid comprises a sequence encoding a CAD polypeptide or a sorghum CAD polypeptide having least 98% or at least 80% sequence identity to amino acids 1-130 or 1-319 and terminating at a position corresponding to residue 131 or 320 of SEQ ID NO: 6.
- a plant produced from the transgenic plant cell has a decrease in the level of CAD activity as compared to the corresponding level in a control plant that does not comprise the nucleic acid.
- the plant produced from the transgenic plant cell exhibts a brown midrib phenotype as compared to a control plant that does not comprise the CAD encoding nucleic acid.
- the plant produced from the transgenic plant cell has a decrease in the level of lignin as compared to the corresponding level in a control plant that does not comprise the CAD encoding nucleic acid.
- Plants and tissues comprising transgenic plant cells are also provided herein.
- this document provides a plant comprising a transgenic plant cell.
- the transgenic plant cell comprises at least one exogenous nucleic acid.
- the exogenous nucleic acid comprises a regulatory region operably linked to a nucleic acid.
- the nucleic acid comprises a sequence encoding a CAD polypeptide or a sorghum CAD polypeptide having least 98% or at least 80% sequence identity to amino acids 1-130 or 1-319 and terminating at a position corresponding to residue 131 or 320 of SEQ ID NO: 6.
- a plant produced from the transgenic plant cell has a decrease in the level of CAD activity as compared to the corresponding level in a control plant that does not comprise the nucleic acid.
- This document also provides biomass or seed comprising tissue from plants which comprise the transgenic plant cells.
- a method comprises growing a transgenic plant cell comprising an exogenous nucleic acid.
- the nucleic acid comprises a sequence encoding a CAD polypeptide having at least 98% sequence identity to amino acids 1-130 or 1-319 of SEQ ID NO: 6 and terminating at a position corresponding to residue 131 or 320 of SEQ ID NO: 6.
- a method comprises growing a transgenic plant cell comprising an exogenous nucleic acid encoding a sorghum CAD polypeptide.
- the sorghum CAD polypeptide comprises at least 80% sequence identity to amino acids 1-130 or 1-319 of SEQ ID NO: 6 and terminates corresponding to residue 131 or 320 of SEQ ID NO: 6.
- a method comprises detecting a nucleic acid encoding a CAD polypeptide in the sorghum plant.
- the CAD polypeptide has at least 80% sequence identity to amino acids 1-130 or 1-319 of SEQ ID NO: 6 and terminates corresponding to residue 131 or 320 of SEQ ID NO: 6.
- the nucleic acid can have a thymine corresponding to position 2794 of SEQ ID NO:2, position 2800 of SEQ ID NO: 4, 7, 10, or 13, position 4083 SEQ ID NO: 2, position 4089 of SEQ ID NOs: 4 or 7, position 4090 of SEQ ID NO: 10, position 497 of SEQ ID NO: 1, position 394 of SEQ ID NOs: 3, 5, 8, 11, or 14, position 1064 of SEQ ID NO:1, position 962 of SEQ ID NO:11, or position 961 of SEQ ID NOs: 3, 5, or 8.
- a method comprises contacting at least one probe or primer pair with nucleic acid from the sorghum plant.
- the probe or primer pair is specific for a polynucleotide that encodes a CAD polypeptide.
- the CAD polypeptide has at least 80% sequence identity to amino acids 1-130 or 1-319 of SEQ ID NO: 6 and terminates at a position corresponding to residue 131 or 320 of SEQ ID NO: 6.
- the method also comprises determining whether or not the polynucleotide is present in the sorghum plant.
- the probe can be an oligonucleotide, e.g., an oligonucleotide comprising a nucleotide sequence selected from the group consisting of SEQ ID NOs: 34 and 36.
- Kits for genotyping a sorghum biological sample are provided herein.
- this document provides a kit comprising a primer pair that specifically amplifies, or a probe that specifically hybridizes to, a polynucleotide that encodes a CAD polypeptide.
- the CAD polypeptide comprises at least 80% sequence identity to amino acids 1-130 or 1-319 of SEQ ID NO: 6 and terminates at a position corresponding to residues 131 or 320 of SEQ ID NO: 6.
- a kit comprises at least one primer of the primer pair or probe having specificity for a thymine corresponding to position 2794 of SEQ ID NO:2, position 2800 of SEQ ID NO: 4, 7, 10, or 13, position 4083 SEQ ID NO: 2, position 4089 of SEQ ID NOs: 4 or 7, position 4090 of SEQ ID NO: 10, position 497 of SEQ ID NO: 1, position 394 of SEQ ID NOs: 3, 5, 8, 11, or 14, position 1064 of SEQ ID NO:1, position 962 of SEQ ID NO:11, or position 961 of SEQ ID NOs: 3, 5, or 8.
- a kit comprises at least one primer or probe comprising a nucleotide sequence selected from the group consisting of SEQ ID NO: 34 and 36.
- the method comprises crossing two or more sorghum plants to produce progeny plants.
- At least one sorghum plant comprises at least one CAD allele encoding a CAD polypeptide having at least 80% sequence identity to amino acids 1-130 or 1-319 of SEQ ID NO: 6, and terminating corresponding to position 131 or 320 of SEQ ID NO: 6.
- the progeny plants can have at least one allele at a COMT locus that encodes a truncated COMT polypeptide.
- the method can also comprise identifying one or more of the progeny plants that comprise the at least one CAD allele.
- the at least one progeny plant can be homozygous for the CAD allele.
- the method can comprise identifying the CAD allele by a thymine corresponding to position 2794 of SEQ ID NO:2, position 2800 of SEQ ID NO: 4, 7, 10, or 13, position 4083 SEQ ID NO: 2, position 4089 of SEQ ID NOs: 4 or 7, position 4090 of SEQ ID NO: 10, position 497 of SEQ ID NO: 1, position 394 of SEQ ID NOs: 3, 5, 8, 11, or 14, position 1064 of SEQ ID NO:1, position 962 of SEQ ID NO:11, or position 961 of SEQ ID NOs: 3, 5, or 8.
- the method involves identification with at least one oligonucleotide specific for the CAD allele, e.g., an oligonucleotide comprising a nucleotide sequence set forth in SEQ ID NOs: 34 or 36.
- the method can also comprise using one or more of the identified progeny plants in a next generation of plant breeding.
- a method of introducing a desired trait into a sorghum cultivar by marker assisted backcrossing is provided herein.
- the method can comprise identifying a first sorghum plant having at least one CAD allele that encodes a CAD polypeptide.
- the CAD polypeptide comprises at least 80% sequence identity to amino acids 1-130 or 1-319 of SEQ ID NO: 6 and terminating at a position corresponding to residue 131 or 320 of SEQ ID NO: 6.
- the method can also comprise crossing the first sorghum plant with a second, genetically distinct sorghum plant having a desired trait, to produce progeny plants.
- the desired trait is not a phenotype conferred by the CAD allele.
- the method can also comprise selecting one or more progeny plants that have the desired trait and have a marker associated with the CAD allele, to produce selected progeny plants.
- the associated marker can comprise a thymine corresponding to position 2794 of SEQ ID NO:2, position 2800 of SEQ ID NO: 4, 7, 10, or 13, position 4083 SEQ ID NO: 2, position 4089 of SEQ ID NOs: 4 or 7, position 4090 of SEQ ID NO: 10, position 497 of SEQ ID NO: 1, position 394 of SEQ ID NOs: 3, 5, 8, 11, or 14, position 1064 of SEQ ID NO:1, position 962 of SEQ ID NO:11, or position 961 of SEQ ID NOs: 3, 5, or 8.
- the selected progeny plants can be backcrossed with the first or second plants to produce backcross progeny plants, and selected for backcross progeny plants that have the desired trait and the marker.
- the backcross progeny plants can have more than one marker associated with the CAD allele, or can be homozygous for the CAD allele. Selection can also be carried out for a marker associated with the desired trait. Backcrossing and selection can be repeated at least three times to produce BC 4 or higher backcross progeny plants that have the desired trait and the at least one CAD allele.
- Such progeny plants can also have the at least one allele at the COMT locus that encodes a truncated COMT polypeptide.
- a method of introducing a desired trait into a sorghum cultivar comprises identifying the CAD allele with an oligonucleotide specific for the CAD allele.
- the oligonucleotide can comprise a nucleotide sequence selected from the group consisting of SEQ ID NOs: 34 and 36.
- a method comprises introducing into a plant cell an exogenous nucleic acid encoding a sorghum CAD polypeptide.
- the sorghum CAD polypeptide has at least 80% sequence identity to amino acids 1-130 and 1-319 of SEQ ID NO: 6 and terminates corresponding to position 131 or 320 of SEQ ID NO: 6.
- the composition of a plant produced from the plant cell is modulated as compared to the composition of a control plant that does not comprise the nucleic acid, e.g., decreased lignin content, increased glucan content, increased cellulose content, or increased hemicellulose content.
- Plants of an agronomically elite sorghum variety are provided herein.
- this document provides plants that are homozygous at a CAD locus for an allele encoding a truncated CAD polypeptide.
- the plants are homozygous at a COMT locus for an allele that encodes a truncated COMT polypeptide.
- the plants can be male sterile or female sterile.
- FIG. 1(A-O) is an alignment of sorghum CAD genomic nucleotide sequences for alleles corresponding to full length CAD (SEQ ID NO:2 from Ceres germplasm ID No.: PI599692-81733680; and SEQ ID NO:4 from Ceres germplasm ID No.: 22043-81733671, a truncated CAD of 320 amino acids (SEQ ID NO:7 from Ceres germplasm ID No.: PI602730-81733686), truncated CAD of 131 amino acids (SEQ ID NO:13 from Ceres germplasm ID No.: PI535790-81733677), and CAD having frameshift insertion mutation at position 4016 (SEQ ID NO:10 from Ceres germplasm ID No.: BICOLOR-81733675).
- a dash in an aligned sequence represents a gap, i.e., a lack of a nucleotide at that position. Identical nucleotides among aligned sequences are identified by boxes. FIG. 1 and the other alignment figure provided herein were generated using the program MUSCLE version 3.52.
- FIG. 2(A-F) is an alignment of sorghum CAD cDNA sequences for alleles corresponding to full length CAD (SEQ ID NO:1 from GI No. 119852230; SEQ ID NO:3 from Ceres germplasm ID No.: PI599692-81733680; SEQ ID NO:5 from Ceres germplasm ID No.: 22043-81733671; truncated CAD of 320 amino acids (SEQ ID NO:8 from Ceres germplasm ID No.: PI602730-81733686), truncated CAD of 131 amino acids (SEQ ID NO:14 from Ceres germplasm ID No.: PI535790-81733677), and a CAD having a frameshift insertion mutation at position 890 (SEQ ID NO:11 from Ceres germplasm ID No.: BICOLOR-81733675).
- the brown midrib (BMR) trait results in reduced lignification, reduced cell-wall concentration, increased digestibility and increased voluntary intake of feed by ruminants (Casler et al., 2003).
- BMR phenotypes are typical of some mutants of the CAD and COMT genes.
- There are at least 28 BMR mutants in sorghum some being spontaneous mutations and others induced by mutagenesis.
- these BMR mutants In addition to the brown vascular tissue pigmentation of the leaf midribs and stems, these BMR mutants often exhibit decreased lignin content in stems and leaves in comparison to wild types or cultivars lacking a BMR phenotype, as CAD and COMT contribute to the lignin biosynthesis pathway.
- BMR plants have lignin that is less polymerized and contains less phenolic monomers that can affect digestion.
- Suzuki et al. analyzed stem samples from BMR sorghum phenotypes and found increased levels of 5-hydroxy-guaiacyl residues in the cell walls, in comparison to wild types or cultivars lacking a BMR phenotype (Suzuki et al., 1997).
- Porter et al. describes phenotypes for several sorghum BMR mutations (Porter et al., 1978). For example, the content of acid detergent fiber, lignin cellulose, hemicellulose, percent cell wall constituent and in vitro cell wall constituent disappearance in stems and leaves for BMR-6 and BMR-17 mutations in comparison to normal plants.
- an “allele” is any of one or more alternative forms of a gene. In a diploid cell or organism, the two alleles of a given gene occupy corresponding loci on a pair of homologous chromosomes.
- amino acid refers to one of the twenty biologically occurring amino acids and to synthetic amino acids, including D/L optical isomers.
- Cell type-preferential promoter or “tissue-preferential promoter” refers to a promoter that drives expression preferentially in a target cell type or tissue, respectively, but may also lead to some transcription in other cell types or tissues as well.
- Control plant refers to a plant that does not contain the exogenous nucleic acid present in a transgenic plant of interest, but otherwise has the same or similar genetic background as such a transgenic plant.
- a suitable control plant can be a non-transgenic wild type plant, a non-transgenic segregant from a transformation experiment, or a transgenic plant that contains an exogenous nucleic acid other than the exogenous nucleic acid of interest.
- Domains are groups of substantially contiguous amino acids in a polypeptide that can be used to characterize protein families and/or parts of proteins. Such domains have a “fingerprint” or “signature” that can comprise conserved primary sequence, secondary structure, and/or three-dimensional conformation. Generally, domains are correlated with specific in vitro and/or in vivo activities.
- a domain can have a length of from 10 amino acids to 400 amino acids, e.g., 10 to 50 amino acids, or 25 to 100 amino acids, or 35 to 65 amino acids, or 35 to 55 amino acids, or 45 to 60 amino acids, or 200 to 300 amino acids, or 300 to 400 amino acids.
- Down-regulation refers to regulation that decreases production of expression products (mRNA, polypeptide, or both) relative to basal or native states.
- Exogenous with respect to a nucleic acid indicates that the nucleic acid is part of a recombinant nucleic acid construct, or is not in its natural environment.
- an exogenous nucleic acid can be a sequence from one species introduced into another species, i.e., a heterologous nucleic acid. Typically, such an exogenous nucleic acid is introduced into the other species via a recombinant nucleic acid construct.
- An exogenous nucleic acid can also be a sequence that is native to an organism and that has been reintroduced into cells of that organism.
- exogenous nucleic acid that includes a native sequence can often be distinguished from the naturally occurring sequence by the presence of non-natural sequences linked to the exogenous nucleic acid, e.g., non-native regulatory sequences flanking a native sequence in a recombinant nucleic acid construct.
- stably transformed exogenous nucleic acids typically are integrated at positions other than the position where the native sequence is found. It will be appreciated that an exogenous nucleic acid may have been introduced into a progenitor and not into the cell under consideration.
- a transgenic plant containing an exogenous nucleic acid can be the progeny of a cross between a stably transformed plant and a non-transgenic plant. Such progeny are considered to contain the exogenous nucleic acid.
- “Expression” refers to the process of converting genetic information of a polynucleotide into RNA through transcription, which is catalyzed by an enzyme, RNA polymerase, and into protein, through translation of mRNA on ribosomes.
- Heterologous polypeptide refers to a polypeptide that is not a naturally occurring polypeptide in a plant cell, e.g., a transgenic Panicum virgatum plant transformed with and expressing the coding sequence for a nitrogen transporter polypeptide from a Zea mays plant.
- isolated nucleic acid includes a naturally-occurring nucleic acid, provided one or both of the sequences immediately flanking that nucleic acid in its naturally-occurring genome is removed or absent.
- an isolated nucleic acid includes, without limitation, a nucleic acid that exists as a purified molecule or a nucleic acid molecule that is incorporated into a vector or a virus.
- “Locus” refers a position on a chromosome, for example, the region of a chromosome at which a particular gene is located.
- the allele at a particular gene locus on one chromosome may be an allele that is different from the allele at that locus on the homologous chromosome, in which case the organism is considered heterozygous for that locus. If the alleles at a particular locus are the same, the organism is considered homozygous for that locus.
- Modulation of the level of chemical composition, phenotype, or enzyme activity refers to the change in the level that is observed as a result of expression of, or transcription from, an exogenous nucleic acid in a plant cell. The change in level is measured relative to the corresponding level in control plants.
- Nucleic acid and “polynucleotide” are used interchangeably herein, and refer to both RNA and DNA, including cDNA, genomic DNA, synthetic DNA, and DNA or RNA containing nucleic acid analogs. Polynucleotides can have various three-dimensional structures. A nucleic acid can be double-stranded or single-stranded (i.e., a sense strand or an antisense strand).
- Non-limiting examples of polynucleotides include genes, gene fragments, exons, introns, messenger RNA (mRNA), transfer RNA, ribosomal RNA, siRNA, micro-RNA, ribozymes, cDNA, recombinant polynucleotides, branched polynucleotides, nucleic acid probes and nucleic acid primers.
- mRNA messenger RNA
- transfer RNA transfer RNA
- ribosomal RNA siRNA
- micro-RNA micro-RNA
- ribozymes cDNA
- recombinant polynucleotides branched polynucleotides
- nucleic acid probes and nucleic acid primers include genes, gene fragments, exons, introns, messenger RNA (mRNA), transfer RNA, ribosomal RNA, siRNA, micro-RNA, ribozymes, cDNA, recombinant polynucleotides, branched polyn
- “Operably linked” refers to the positioning of a regulatory region and a sequence to be transcribed in a nucleic acid so that the regulatory region is effective for regulating transcription or translation of the sequence.
- the translation initiation site of the translational reading frame of the coding sequence is typically positioned between one and about fifty nucleotides downstream of the regulatory region.
- a regulatory region can, however, be positioned as much as about 5,000 nucleotides upstream of the translation initiation site, or about 2,000 nucleotides upstream of the transcription start site.
- Polypeptide refers to a compound of two or more subunit amino acids, amino acid analogs, or other peptidomimetics, regardless of post-translational modification, e.g., phosphorylation or glycosylation.
- the subunits may be linked by peptide bonds or other bonds such as, for example, ester or ether bonds.
- Full-length polypeptides, truncated polypeptides, point mutants, insertion mutants, splice variants, chimeric proteins, and fragments thereof are encompassed by this definition.
- Progeny includes descendants of a particular plant or plant line. Progeny of an instant plant include seeds formed on F 1 , F 2 , F 3 , F 4 , F 5 , F 6 and subsequent generation plants, or seeds formed on BC 1 , BC 2 , BC 3 , and subsequent generation plants, or seeds formed on F 1 BC 1 , F 1 BC 2 , F 1 BC 3 , and subsequent generation plants.
- the designation F 1 refers to the progeny of a cross between two parents that are genetically distinct.
- the designations F 2 , F 3 , F 4 , F 5 and F 6 refer to subsequent generations of self- or sib-pollinated progeny of an F 1 plant.
- a “probe” is a molecule capable of distinguishing among polymorphisms in the genome of an organism.
- a nucleic acid to which is attached a conventional detectable label or reporter molecule e.g., a radioactive isotope, ligand, chemiluminescent agent, fluorescent agent, or enzyme can be a probe.
- a probe can be complementary to a strand of a target nucleic acid, such as to a strand of genomic DNA from sorghum having a truncated CAD, whether from a sorghum plant or from a sample that includes DNA from a sorghum plant.
- Probes include not only deoxyribonucleic or ribonucleic acids but also polyamides and other probe materials that bind specifically to a target DNA sequence and can be used to detect the presence of that target DNA sequence. Hybridization of probes with target DNA can be detected by several methods including polymerase chain reaction (PCR) based assays, electrophoresis-based assays, or the molecular beacon or dynamic allele-specific hybridization (DASH) assays.
- PCR polymerase chain reaction
- electrophoresis-based assays electrophoresis-based assays
- DASH dynamic allele-specific hybridization
- Primer pairs are nucleic acids, typically oligonucleotides, that can anneal to a complementary or substantially complimentary target DNA strand to form a hybrid between the primer and the target DNA strand, then can be extended along the target DNA strand by a polymerase.
- Primer pairs of the present invention can be used for amplification of a specific nucleic acid, e.g., by PCR or other conventional nucleic acid amplification methods.
- regulatory region refers to a nucleic acid having nucleotide sequences that influence transcription or translation initiation and rate, and stability and/or mobility of a transcription or translation product. Regulatory regions include, without limitation, promoter sequences, enhancer sequences, response elements, protein recognition sites, inducible elements, protein binding sequences, 5′ and 3′ untranslated regions (UTRs), transcriptional start sites, termination sequences, polyadenylation sequences, introns, and combinations thereof.
- a regulatory region typically comprises at least a core (basal) promoter.
- a regulatory region also may include at least one control element, such as an enhancer sequence, an upstream element or an upstream activation region (UAR).
- a suitable enhancer is a cis-regulatory element ( ⁇ 212 to ⁇ 154) from the upstream region of the octopine synthase (ocs) gene. Fromm et al., The Plant Cell, 1:977-984 (1989).
- Up-regulation refers to regulation that increases the level of an expression product (mRNA, polypeptide, or both) relative to basal or native states.
- Vector refers to a replicon, such as a plasmid, phage, or cosmid, into which another DNA segment may be inserted so as to bring about the replication of the inserted segment.
- a vector is capable of replication when associated with the proper control elements.
- the term “vector” includes cloning and expression vectors, as well as viral vectors and integrating vectors.
- An “expression vector” is a vector that includes a regulatory region.
- Polypeptides described herein include C-terminus truncated CAD polypeptides. Such polypeptides can be lignin-modulating polypeptides. Lignin-modulating polypeptides can be effective to modulate lignin levels when expressed in a plant or plant cell. Such polypeptides typically contain at least one domain indicative of lignin-modulating polypeptides, as described in more detail herein. In some embodiments, lignin-modulating polypeptides have greater than 90% identity to SEQ ID NOs: 6, 9, 12, 15, 18, 21, 24, 27, 30, or 33, as described in more detail herein.
- lignin-modulating polypeptides such as a C-terminus truncated sorghum CAD polypeptide can be about 100, 110, 120, 130, 140, 150, 160, 170, 180, 190, 200, 210, 220, 230, 240, 250, 260, 270, 280, 290, 300, 310, 320, 330, 340, or 350 amino acids in length.
- lignin-modulating polypeptides such as C-terminus truncated CADs can be 131 or 320 amino acids in length.
- the truncated CADs are from sorghum.
- a lignin-modulating polypeptide can contain an Alcohol dehydrogenase GroES-like domain (ADH N), a methyltransferase small domain (MTS), and/or a Zinc-binding dehydrogenase (ADH zinc N), which is predicted to be characteristic of a CAD enzyme.
- ADH N Alcohol dehydrogenase GroES-like domain
- MTS methyltransferase small domain
- ADH zinc N Zinc-binding dehydrogenase
- a C-terminus truncated CAD described herein comprises all or a substantial portion of an ADH N domain.
- the C-terminus truncated CAD described herein comprises an ADH N domain and a portion of an ADH zinc N domain.
- SEQ ID NO: 9 sets forth the amino acid sequence of a truncated CAD clone, identified herein as PI602730-81733686, that is predicted to encode a polypeptide containing a portion of an ADH zinc N domain and ADH N and MTS domains.
- SEQ ID NO: 15 sets forth the amino acid sequence of a sorghum clone, identified herein as PI535790-81733677, that is predicted to encode a polypeptide containing a portion of a ADH N domain.
- the truncated CAD described herein is a naturally occurring polypeptide.
- the truncated CAD described herein is synthetic.
- an allelic variant of a sorghum CAD can be identified by BLASTing or designing primers that recognize conserved regions of the gene and amplifying said gene and then synthesizing a nucleic acid that encodes truncated CAD.
- site directed mutagenesis may be used to generate desired truncations.
- a truncated polypeptide may retain certain domains of the naturally occurring polypeptide while lacking others.
- a truncated CAD comprises about 10, 15, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, or 95 amino acids of an ADH N domain.
- a truncated CAD comprises about 10, 15, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, 100, 110, or 120 amino acids of an ADH zinc N domain.
- a truncated polypeptide is a dominant negative polypeptide.
- SEQ ID NO: 9 and 15 sets forth the amino sequence of a lignin-modulating polypeptide that is truncated at the C-terminus end relative to a full length sorghum CAD polypeptide. Expression in a plant of such a truncated polypeptide confers a difference in the level of lignin in a tissue of the plant as compared to the corresponding level in tissue of a control plant that does not comprise the truncation.
- one or more functional homologs of a reference lignin-modulating polypeptide defined by one or more of the Pfam descriptions indicated above are suitable for use as lignin-modulating polypeptides or truncations thereof.
- a functional homolog is a polypeptide that has sequence similarity to a reference truncated CAD polypeptide, and that exhibits a brown midrib phenotype.
- a functional homolog and the reference polypeptide may be natural occurring polypeptides, and the sequence similarity may be due to convergent or divergent evolutionary events. As such, functional homologs are sometimes designated in the literature as homologs, or orthologs, or paralogs.
- Variants of a naturally occurring functional homolog may themselves be functional homologs.
- Functional homologs can also be created via site-directed mutagenesis of the coding sequence for a lignin-modulating polypeptide, or by combining domains from the coding sequences for different naturally-occurring lignin-modulating polypeptides (“domain swapping”).
- domain swapping domain swapping
- the term “functional homolog” is sometimes applied to the nucleic acid that encodes a functionally homologous polypeptide.
- a nucleic acid encoding a truncated CAD may be synthesized.
- Functional homologs and potential allelic variants can be identified by analysis of nucleotide and polypeptide sequence alignments. For example, performing a query on a database of nucleotide or polypeptide sequences can identify homologs of lignin-modulating polypeptides. Sequence analysis can involve BLAST, Reciprocal BLAST, or PSI-BLAST analysis of nonredundant databases using a lignin-modulating polypeptide amino acid sequence as the reference sequence. Amino acid sequence is, in some instances, deduced from the nucleotide sequence. Those polypeptides in the database that have greater than 90% sequence identity are candidates for allelic variants of a lignin-modulating polypeptide which can be used to make truncations as described herein.
- Amino acid sequence similarity allows for conservative amino acid substitutions, such as substitution of one hydrophobic residue for another or substitution of one polar residue for another. If desired, manual inspection of such candidates can be carried out in order to narrow the number of candidates to be further evaluated. Manual inspection can be performed by selecting those candidates that appear to have domains present in lignin-modulating polypeptides, e.g., conserved functional domains.
- conserveed regions can be identified by locating a region within the primary amino acid sequence of a lignin-modulating polypeptide that is a repeated sequence, forms some secondary structure (e.g., alpha helices and beta sheets), establishes positively or negatively charged domains, or represents a protein motif or domain. See, e.g., the Pfam web site describing consensus sequences for a variety of protein motifs and domains on the World Wide Web at sanger.ac.uk/Software/Pfam/ and pfam.janelia.org/. A description of the information included at the Pfam database is described in Sonnhammer et al., Nucl.
- conserved regions also can be determined by aligning sequences of the same or related polypeptides from closely related species. Closely related species preferably are from the same family. In some embodiments, alignment of sequences from two different species is adequate. Typically, polypeptides that exhibit at least about 40% amino acid sequence identity are useful to identify conserved regions.
- conserved regions of related polypeptides exhibit at least 45% amino acid sequence identity (e.g., at least 50%, at least 60%, at least 70%, at least 80%, or at least 90% amino acid sequence identity). In some embodiments, a conserved region exhibits at least 92%, 94%, 96%, 98%, or 99% amino acid sequence identity.
- a truncated CAD may have a conserved ADH domain as compared to CAD amino acid sequences from other species.
- allelic variants of the polypeptide set forth in SEQ ID NO: 6 are provided in the Sequence Listing. Such allelic variants include PI602730-81733686 (SEQ ID NO: 9) and PI535790-81733677 (SEQ ID NO: 15).
- an allelic variant of SEQ ID NO: 6 has an amino acid sequence with at least 80% sequence identity, e.g., 50%, 52%, 56%, 59%, 61%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 97%, 98%, or 99% sequence identity, to the amino acid sequence set forth in SEQ ID NO: 6, 9, 12, or 15.
- an allelic variant of SEQ ID NO: 6 or 12 is truncated by about 5, 10, 25, 50, 75, 100, 125, 150, 175, 200, 225, 250, 275, or 300 amino acids in length.
- the allelic variants are from sorghum.
- variants of truncated lignin-modulating polypeptide facilitates production of variants of truncated lignin-modulating polypeptides.
- Variants of truncated lignin-modulating polypeptides typically have 10 or fewer conservative amino acid substitutions within the primary amino acid sequence, e.g., 7 or fewer conservative amino acid substitutions, 5 or fewer conservative amino acid substitutions, or between 1 and 5 conservative substitutions.
- a useful variant polypeptide can be constructed based on one of the alignments of nucleic acids set forth in FIG. 1 or FIG. 2 and/or alleles identified in the Sequence Listing.
- Such a polypeptide includes the conserved regions, arranged in the order from amino-terminal end to carboxy-terminal end. Such a polypeptide may also include zero, one, or more than one amino acid in positions marked by dashes. When no amino acids are present at positions marked by dashes, the length of such a polypeptide is the sum of the amino acid residues in all conserved regions. When amino acids are present at all positions marked by dashes, such a polypeptide has a length that is the sum of the amino acid residues in all conserved regions and all dashes.
- Truncations of CAD homologs or sorghum allelic variants of CAD are also described herein.
- CAD homologs or sorghum allelic variants of CAD can be truncated artificially or naturally occurring truncations can be identified which are truncated such that the length of the resulting polypeptide corresponds to the length of the polypeptide of SEQ ID NOs: 9 or 15.
- Polypeptide sequences of CAD homologs or sorghum allelic variants of CAD can be aligned with the truncated CAD sequences of SEQ ID NOs: 9 and/or 15 using, for example, a Clustal program such as ClustalW 1.83.
- nucleotide sequences encoding CAD homologs or sorghum allelic variants of CAD can be aligned with the truncated nucleotide sequences of SEQ ID NOs: 7 and/or 13 (genomic DNA), or 8 and/or 14 (cDNA) using a Clustal program.
- the alignments of polypeptides or nucleotides can then be used to determine the corresponding position at which a truncated sequence can terminate. For example in FIG. 1 , sequences aligned with SEQ ID NO: 13 that terminate with the nucleotide in the alignment that aligns with position 2802 of SEQ ID NO: 13 are corresponding truncations.
- SEQ ID NOs: 7 and/or 13 genomic DNA
- cDNA cDNA
- sequences aligned with SEQ ID NO: 7 that terminate with the nucleotide in the alignment that aligns with position 4091 of SEQ ID NO: 7 are corresponding truncations.
- sequences aligned with SEQ ID NO: 14 that terminate with the nucleotide in the alignment that aligns with position 396 of SEQ ID NO: 14 are corresponding truncations.
- sequences aligned with SEQ ID NO: 8 that terminate with the nucleotide in the alignment that aligns with position 964 of SEQ ID NO: 8 are corresponding truncations.
- CAD is known to be involved in several reduction reactions, including, but not limited to, the reduction of p-Coumaraldehyde to p-Coumaryl alcohol, Caffeyl aldehyde to Caffeyl alcohol, Coniferldahyde to Coniferyl alcohol, and Sinapaldehyde to Sinapyl alcohol.
- substrates can be labeled, using carbon or other means, and CAD from a plant sample or a plant extract comprising CAD can be added to the substrate to be reduced. The amount of label in the product can be used to compare the level of CAD activity among samples.
- composition of each plant sample including, but not limited to, lignin, glucose, arabinose, fructose, galactose, xylose, cellulose, hemicellulose, 5-hydroxy-guaiacyl, neutral detergent fiber, acid detergent fiber, or acid detergent lignin can be measured by independent analytical chemistry techniques known in the art, typically wet chemical techniques. For example, following pre-treatment by acid, enzymes, or other means, plant samples can be analyzed for glucose using a YSI 2700D Dual-Channel Biochemistry Analyzer (YSI Life Sciences, Yellow Springs, Ohio).
- Glucan, xylan, arabinan, and lignin contents of a plant or plant part can be determined by ASTM methods E1758-01 (Determination of Biomass Sugars by High Performance Liquid Chromatography) and/or E1721-01 (Determination of Acid Insoluble Residue (Lignin) in Biomass).
- a lignin-modulating polypeptide has an amino acid sequence with at least 40% sequence identity, e.g., 50%, 52%, 56%, 59%, 61%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 97%, 98%, or 99% sequence identity, to one of the amino acid sequences set forth in SEQ ID NOs: 6, 9, 12, 15, 18, 21, 24, 27, 30, or 33. Polypeptides having such a percent sequence identity often have a domain indicative of a lignin-modulating polypeptide as discussed above.
- Amino acid sequences of lignin-modulating polypeptides having at least 80% sequence identity to one of the amino acid sequences set forth in SEQ ID NOs: 6, 9, 12, 15, 18, 21, 24, 27, 30, or 33 can be identified by BLAST as described herein.
- Percent sequence identity refers to the degree of sequence identity between a reference sequence, e.g., SEQ ID NO:9, and a candidate sequence.
- a candidate sequence typically has a length that is from 80 percent to 200 percent of the length of the reference sequence, e.g., 82, 85, 87, 89, 90, 93, 95, 97, 99, 100, 105, 110, 115, 120, 130, 140, 150, 160, 170, 180, 190, or 200 percent of the length of the reference sequence.
- a percent identity for a candidate nucleic acid or polypeptide relative to a reference nucleic acid or polypeptide can be determined as follows.
- a reference sequence e.g., a nucleic acid sequence or an amino acid sequence
- ClustalW version 1.83, default parameters
- ClustalW calculates the best match between a reference and one or more candidate sequences, and aligns them so that identities, similarities and differences can be determined. Gaps of one or more residues can be inserted into a reference sequence, a candidate sequence, or both, to maximize sequence alignments.
- word size 2; window size: 4; scoring method: percentage; number of top diagonals: 4; and gap penalty: 5.
- gap opening penalty 10.0; gap extension penalty: 5.0; and weight transitions: yes.
- the ClustalW output is a sequence alignment that reflects the relationship between sequences.
- ClustalW can be run, for example, at the Baylor College of Medicine Search Launcher site (searchlauncher.bcm.tmc.edu/multi-align/multi-align.html) and at the European Bioinformatics Institute site on the World Wide Web (ebi.ac.uk/clustalw).
- the sequences are aligned using ClustalW, the number of identical matches in the alignment is divided by the length of the reference sequence, and the result is multiplied by 100.
- the percent identity is based on the alignment over the length of the shorter sequence. It is noted that the percent identity value can be rounded to the nearest tenth. For example, 78.11, 78.12, 78.13, and 78.14 are rounded down to 78.1, while 78.15, 78.16, 78.17, 78.18, and 78.19 are rounded up to 78.2.
- a lignin-modulating polypeptide has an amino acid sequence with at least 40% sequence identity, e.g., 50%, 52%, 56%, 59%, 61%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 97%, 98%, or 99% sequence identity, to the amino acid sequence set forth in SEQ ID NO: 6, 9, 12, 15, 18, 21, 24, 27, 30, or 33.
- Amino acid sequences of polypeptides having greater than 80% sequence identity to the polypeptide set forth in SEQ ID NO:6 are provided in the Sequence Listing.
- Truncations of a lignin-modulating polypeptides may have a length that is from 10 percent to 90 percent of the length of the reference sequence, e.g., 10, 20, 30, 40, 50, 60, 70, 80, 90, or 95 percent of the length of the reference sequence.
- a lignin-modulating polypeptide can include additional amino acids that are not directly involved in lignin modulation, and thus such a polypeptide can be longer than would otherwise be the case.
- a lignin-modulating polypeptide can include a purification tag, a chloroplast transit peptide, a mitochondrial transit peptide, an amyloplast peptide, or a leader sequence added to the amino or carboxy terminus.
- a lignin-modulating polypeptide includes an amino acid sequence that functions as a reporter, e.g., a green fluorescent protein or yellow fluorescent protein.
- the methods and compositions described herein comprise truncated COMT amino acid and nucleic acid sequences that modulate the lignin content of plants.
- truncated COMT sequences include SEQ ID NOs: 21 or 27.
- Nucleic acids described herein include nucleic acids that are effective to modulate lignin levels when transcribed in a plant or plant cell. Such nucleic acids include, without limitation, those that encode a lignin-modulating polypeptide and those that can be used to inhibit expression of a lignin-modulating polypeptide via a nucleic acid based method.
- Nucleic acids encoding lignin-modulating polypeptides are described herein. Such nucleic acids include those that are less than 80% (e.g., from 10% to less than 45, 50, 55, 60, 65, 70, 75, or 80%) of the length of the full-length nucleic acid set forth in SEQ ID NOs: 1, 2, 4, 10, 16, 22, 28, 31, 3, 5, 17, 23, 29, or 32. Examples of nucleic acids encoding lignin-modulating polypeptides include SEQ ID NOs: 7, 10, 13, 19, 25, 8, 11, 14, 20, and 26, as described in more detail below.
- a lignin-modulating nucleic acid can comprise the nucleotide sequence set forth in SEQ ID NO: 7, 8, 10, 11, 13, 14, 19, 20, 25, or 26.
- a lignin-modulating nucleic acid can be a variant of the nucleic acid having the nucleotide sequence set forth in SEQ ID NO: 1, 2, 4, 7, 10, 13, 16, 19, 22, 25, 28, 31, 3, 5, 8, 11, 14, 17, 20, 23, 26, 29, or 32.
- a lignin-modulating nucleic acid can have a nucleotide sequence with at least 80% sequence identity, e.g., 81%, 85%, 90%, 95%, 97%, 98%, or 99% sequence identity, to the nucleotide sequence set forth in SEQ ID NO: 1, 2, 4, 7, 10, 13, 16, 19, 22, 25, 28, 31, 3, 5, 8, 11, 14, 17, 20, 23, 26, 29, or 32.
- Isolated nucleic acid molecules can be produced by standard techniques. For example, polymerase chain reaction (PCR) techniques can be used to obtain an isolated nucleic acid containing a nucleotide sequence described herein. PCR can be used to amplify specific sequences from DNA as well as RNA, including sequences from total genomic DNA or total cellular RNA. Various PCR methods are described, for example, in PCR Primer: A Laboratory Manual, Dieffenbach and Dveksler, eds., Cold Spring Harbor Laboratory Press, 1995. Generally, sequence information from the ends of the region of interest or beyond is employed to design oligonucleotide primers that are identical or similar in sequence to opposite strands of the template to be amplified.
- PCR polymerase chain reaction
- Isolated nucleic acids also can be chemically synthesized, either as a single nucleic acid molecule (e.g., using automated DNA synthesis in the 3′ to 5′ direction using phosphoramidite technology) or as a series of oligonucleotides.
- one or more pairs of long oligonucleotides can be synthesized that contain the desired sequence, with each pair containing a short segment of complementarity (e.g., about 15 nucleotides) such that a duplex is formed when the oligonucleotide pair is annealed.
- DNA polymerase is used to extend the oligonucleotides, resulting in a single, double-stranded nucleic acid molecule per oligonucleotide pair, which then can be ligated into a vector.
- Isolated nucleic acids of the invention also can be obtained by mutagenesis of, e.g., a naturally occurring DNA.
- a nucleic acid encoding one of the lignin-modulating polypeptides described herein can be used to express the polypeptide in a plant species of interest, typically by transforming a plant cell with a nucleic acid having the coding sequence for the polypeptide operably linked in sense orientation to one or more regulatory regions. It will be appreciated that because of the degeneracy of the genetic code, a number of nucleic acids can encode a particular lignin-modulating polypeptide; i.e., for many amino acids, there is more than one nucleotide triplet that serves as the codon for the amino acid. Thus, codons in the coding sequence for a given lignin-modulating polypeptide can be modified such that optimal expression in a particular plant species is obtained, using appropriate codon bias tables for that species.
- expression of a lignin-modulating polypeptide inhibits one or more functions of an endogenous polypeptide.
- a nucleic acid that encodes a dominant negative polypeptide can be used to inhibit protein function.
- a dominant negative polypeptide typically is truncated relative to an endogenous wild type polypeptide, and its presence in a cell inhibits one or more functions of the wild type polypeptide in that cell, i.e., the dominant negative polypeptide is genetically dominant and confers a loss of function.
- the mechanism by which a dominant negative polypeptide confers such a phenotype can vary but often involves a protein-protein interaction or a protein-DNA interaction.
- a dominant negative polypeptide can be an enzyme that is truncated relative to a native wild type enzyme, such that the truncated polypeptide retains domains involved in binding a first protein but lacks domains involved in binding a second protein. The truncated polypeptide is thus unable to properly modulate the activity of the second protein. See, e.g., US 2007/0056058.
- Polynucleotides and recombinant constructs described herein can be used to inhibit expression of a CAD or COMT polypeptide in a plant species of interest. See, e.g., Matzke and Birchler, Nature Reviews Genetics 6:24-35 (2005); Akashi et al., Nature Reviews Mol. Cell. Biology 6:413-422 (2005); Mittal, Nature Reviews Genetics 5:355-365 (2004); Dorsett and Tuschl, Nature Reviews Drug Discovery 3: 318-329 (2004); and Nature Reviews RNA interference collection, October 2005 at nature.com/reviews/focus/mai.
- RNA interference RNA interference
- TLS transcriptional gene silencing
- Suitable polynucleotides include full-length nucleic acids encoding lignin-modulating polypeptides or fragments of such full-length nucleic acids. In some embodiments, a complement of the full-length nucleic acid or a fragment thereof can be used.
- a fragment is at least 10 nucleotides, e.g., at least 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 30, 35, 40, 50, 80, 100, 200, 500 nucleotides or more.
- higher homology can be used to compensate for the use of a shorter sequence.
- Antisense technology is one well-known method.
- a nucleic acid of a gene to be repressed is cloned and operably linked to a regulatory region and a transcription termination sequence so that the antisense strand of RNA is transcribed.
- the recombinant construct is then transformed into plants, as described herein, and the antisense strand of RNA is produced.
- the nucleic acid need not be the entire sequence of the gene to be repressed, but typically will be substantially complementary to at least a portion of the sense strand of the gene to be repressed.
- a nucleic acid in another method, can be transcribed into a ribozyme, or catalytic RNA, that affects expression of an mRNA.
- Ribozymes can be designed to specifically pair with virtually any target RNA and cleave the phosphodiester backbone at a specific location, thereby functionally inactivating the target RNA.
- Heterologous nucleic acids can encode ribozymes designed to cleave particular mRNA transcripts, thus preventing expression of a polypeptide.
- Hammerhead ribozymes are useful for destroying particular mRNAs, although various ribozymes that cleave mRNA at site-specific recognition sequences can be used.
- Hammerhead ribozymes cleave mRNAs at locations dictated by flanking regions that form complementary base pairs with the target mRNA. The sole requirement is that the target RNA contains a 5′-UG-3′ nucleotide sequence.
- the construction and production of hammerhead ribozymes is known in the art. See, for example, U.S. Pat. No. 5,254,678 and WO 02/46449 and references cited therein.
- Hammerhead ribozyme sequences can be embedded in a stable RNA such as a transfer RNA (tRNA) to increase cleavage efficiency in vivo.
- tRNA transfer RNA
- RNA endoribonucleases which have been described, such as the one that occurs naturally in Tetrahymena thermophila , can be useful. See, for example, U.S. Pat. Nos. 4,987,071 and 6,423,885.
- RNAi can also be used to inhibit the expression of a gene.
- a construct can be prepared that includes a sequence that is transcribed into an RNA that can anneal to itself, e.g., a double stranded RNA having a stem-loop structure.
- one strand of the stem portion of a double stranded RNA comprises a sequence that is similar or identical to the sense coding sequence or a fragment thereof of a lignin-modulating polypeptide, and that is from about 10 nucleotides to about 2,500 nucleotides in length.
- the length of the sequence that is similar or identical to the sense coding sequence can be from 10 nucleotides to 500 nucleotides, from 15 nucleotides to 300 nucleotides, from 20 nucleotides to 100 nucleotides, or from 25 nucleotides to 100 nucleotides.
- the other strand of the stem portion of a double stranded RNA comprises a sequence that is similar or identical to the antisense strand or a fragment thereof of the coding sequence of the lignin-modulating polypeptide, and can have a length that is shorter, the same as, or longer than the corresponding length of the sense sequence.
- one strand of the stem portion of a double stranded RNA comprises a sequence that is similar or identical to the 3′ or 5′ untranslated region, or a fragment thereof, of an mRNA encoding a lignin-modulating polypeptide
- the other strand of the stem portion of the double stranded RNA comprises a sequence that is similar or identical to the sequence that is complementary to the 3′ or 5′ untranslated region, respectively, or a fragment thereof, of the mRNA encoding the lignin-modulating polypeptide.
- one strand of the stem portion of a double stranded RNA comprises a sequence that is similar or identical to the sequence of an intron, or a fragment thereof, in the pre-mRNA encoding a lignin-modulating polypeptide
- the other strand of the stem portion comprises a sequence that is similar or identical to the sequence that is complementary to the sequence of the intron, or a fragment thereof, in the pre-mRNA.
- the loop portion of a double stranded RNA can be from 3 nucleotides to 5,000 nucleotides, e.g., from 3 nucleotides to 25 nucleotides, from 15 nucleotides to 1,000 nucleotides, from 20 nucleotides to 500 nucleotides, or from 25 nucleotides to 200 nucleotides.
- the loop portion of the RNA can include an intron or a fragment thereof.
- a double stranded RNA can have zero, one, two, three, four, five, six, seven, eight, nine, ten, or more stem-loop structures.
- Methods for using RNAi to inhibit the expression of a gene are known to those of skill in the art. See, e.g., U.S. Pat. Nos. 5,034,323; 6,326,527; 6,452,067; 6,573,099; 6,753,139; and 6,777,588. See also WO 97/01952; WO 98/53083; WO 99/32619; WO 98/36083; and U.S. Patent Publications 20030175965, 20030175783, 20040214330, and 20030180945.
- Constructs containing regulatory regions operably linked to nucleic acid molecules in sense orientation can also be used to inhibit the expression of a gene.
- the transcription product can be similar or identical to the sense coding sequence, or a fragment thereof, of a truncated lignin-modulating polypeptide.
- the transcription product also can be unpolyadenylated, lack a 5′ cap structure, or contain an unspliceable intron.
- a construct containing a nucleic acid having at least one strand that is a template for both sense and antisense sequences that are complementary to each other is used to inhibit the expression of a gene.
- the sense and antisense sequences can be part of a larger nucleic acid molecule or can be part of separate nucleic acid molecules having sequences that are not complementary.
- the sense or antisense sequence can be a sequence that is identical or complementary to the sequence of an mRNA, the 3′ or 5′ untranslated region of an mRNA, or an intron in a pre-mRNA encoding a lignin-modulating polypeptide, or a fragment of such sequences.
- the sense or antisense sequence is identical or complementary to a sequence of the regulatory region that drives transcription of the gene encoding a lignin-modulating polypeptide.
- the sense sequence is the sequence that is complementary to the antisense sequence.
- the sense and antisense sequences can be a length greater than about 10 nucleotides (e.g., 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, or more nucleotides).
- an antisense sequence can be 21 or 22 nucleotides in length.
- the sense and antisense sequences range in length from about 15 nucleotides to about 30 nucleotides, e.g., from about 18 nucleotides to about 28 nucleotides, or from about 21 nucleotides to about 25 nucleotides.
- an antisense sequence is a sequence complementary to an mRNA sequence, or a fragment thereof, encoding a lignin-modulating polypeptide described herein.
- the sense sequence complementary to the antisense sequence can be a sequence present within the mRNA of the lignin-modulating polypeptide.
- sense and antisense sequences are designed to correspond to a 15-30 nucleotide sequence of a target mRNA such that the level of that target mRNA is reduced.
- a construct containing a nucleic acid having at least one strand that is a template for more than one sense sequence can be used to inhibit the expression of a gene.
- a construct containing a nucleic acid having at least one strand that is a template for more than one antisense sequence can be used to inhibit the expression of a gene.
- a construct can contain a nucleic acid having at least one strand that is a template for two sense sequences and two antisense sequences.
- the multiple sense sequences can be identical or different, and the multiple antisense sequences can be identical or different.
- a construct can have a nucleic acid having one strand that is a template for two identical sense sequences and two identical antisense sequences that are complementary to the two identical sense sequences.
- an isolated nucleic acid can have one strand that is a template for (1) two identical sense sequences 20 nucleotides in length, (2) one antisense sequence that is complementary to the two identical sense sequences 20 nucleotides in length, (3) a sense sequence 30 nucleotides in length, and (4) three identical antisense sequences that are complementary to the sense sequence 30 nucleotides in length.
- the constructs provided herein can be designed to have any arrangement of sense and antisense sequences. For example, two identical sense sequences can be followed by two identical antisense sequences or can be positioned between two identical antisense sequences.
- a nucleic acid having at least one strand that is a template for one or more sense and/or antisense sequences can be operably linked to a regulatory region to drive transcription of an RNA molecule containing the sense and/or antisense sequence(s).
- a nucleic acid can be operably linked to a transcription terminator sequence, such as the terminator of the nopaline synthase (nos) gene.
- two regulatory regions can direct transcription of two transcripts: one from the top strand, and one from the bottom strand. See, for example, Yan et al., Plant Physiol., 141:1508-1518 (2006). The two regulatory regions can be the same or different.
- RNA molecules can form double-stranded RNA molecules that induce degradation of the target RNA.
- a nucleic acid can be positioned within a T-DNA or plant-derived transfer DNA (P-DNA) such that the left and right T-DNA border sequences, or the left and right border-like sequences of the P-DNA, flank or are on either side of the nucleic acid. See, US 2006/0265788.
- the nucleic acid sequence between the two regulatory regions can be from about 15 to about 300 nucleotides in length.
- the nucleic acid sequence between the two regulatory regions is from about 15 to about 200 nucleotides in length, from about 15 to about 100 nucleotides in length, from about 15 to about 50 nucleotides in length, from about 18 to about 50 nucleotides in length, from about 18 to about 40 nucleotides in length, from about 18 to about 30 nucleotides in length, or from about 18 to about 25 nucleotides in length.
- a recombinant nucleic acid construct can comprise a nucleic acid encoding a lignin-modulating polypeptide as described herein, operably linked to a regulatory region suitable for expressing the lignin-modulating polypeptide in the plant or cell.
- a nucleic acid can comprise a coding sequence that encodes any of the lignin-modulating polypeptides as set forth in SEQ ID NOs: 9, 15, 21, or 27, or a variant thereof. Examples of nucleic acids encoding lignin-modulating polypeptides are set forth in SEQ ID NO:7, 8, 13, 14, 19, 20, 25, or 26.
- the lignin-modulating polypeptide encoded by a recombinant nucleic acid can be a native lignin-modulating polypeptide, or can be heterologous to the cell.
- the recombinant construct contains a nucleic acid that inhibits expression of a lignin-modulating polypeptide, operably linked to a regulatory region. Examples of suitable regulatory regions are described in the section entitled “Regulatory Regions.”
- Suitable vector backbones include, for example, those routinely used in the art such as plasmids, viruses, artificial chromosomes, BACs, YACs, or PACs.
- Suitable expression vectors include, without limitation, plasmids and viral vectors derived from, for example, bacteriophage, baculoviruses, and retroviruses. Numerous vectors and expression systems are commercially available from such corporations as Novagen (Madison, Wis.), Clontech (Palo Alto, Calif.), Stratagene (La Jolla, Calif.), and Invitrogen/Life Technologies (Carlsbad, Calif.).
- the vectors provided herein also can include, for example, origins of replication, scaffold attachment regions (SARs), and/or markers.
- a marker gene can confer a selectable phenotype on a plant cell.
- a marker can confer biocide resistance, such as resistance to an antibiotic (e.g., kanamycin, G418, bleomycin, or hygromycin), or an herbicide (e.g., glyphosate, chlorsulfuron or phosphinothricin).
- an expression vector can include a tag sequence designed to facilitate manipulation or detection (e.g., purification or localization) of the expressed polypeptide.
- Tag sequences such as luciferase, ⁇ -glucuronidase (GUS), green fluorescent protein (GFP), glutathione S-transferase (GST), polyhistidine, c-myc, hemagglutinin, or FlagTM tag (Kodak, New Haven, Conn.) sequences typically are expressed as a fusion with the encoded polypeptide.
- GUS green fluorescent protein
- GST glutathione S-transferase
- polyhistidine c-myc
- hemagglutinin hemagglutinin
- FlagTM tag Kodak, New Haven, Conn.
- regulatory regions to be included in a recombinant construct depends upon several factors, including, but not limited to, efficiency, selectability, inducibility, desired expression level, and cell- or tissue-preferential expression. It is a routine matter for one of skill in the art to modulate the expression of a coding sequence by appropriately selecting and positioning regulatory regions relative to the coding sequence. Transcription of a nucleic acid can be modulated in a similar manner. Some suitable regulatory regions initiate transcription only, or predominantly, in certain cell types.
- a regulatory region may meet criteria for one classification based on its activity in one plant species, and yet meet criteria for a different classification based on its activity in another plant species.
- a promoter can be said to be “broadly expressing” when it promotes transcription in many, but not necessarily all, plant tissues.
- a broadly expressing promoter can promote transcription of an operably linked sequence in one or more of the shoot, shoot tip (apex), and leaves, but weakly or not at all in tissues such as roots or stems.
- a broadly expressing promoter can promote transcription of an operably linked sequence in one or more of the stem, shoot, shoot tip (apex), and leaves, but can promote transcription weakly or not at all in tissues such as reproductive tissues of flowers and developing seeds.
- Non-limiting examples of broadly expressing promoters that can be included in the nucleic acid constructs provided herein include the p326, YP0144, YP0190, p13879, YP0050, p32449, 21876, YP0158, YP0214, YP0380, PT0848, and PT0633 promoters.
- CaMV 35S promoter the cauliflower mosaic virus (CaMV) 35S promoter
- MAS mannopine synthase
- 1′ or 2′ promoters derived from T-DNA of Agrobacterium tumefaciens the figwort mosaic virus 34S promoter
- actin promoters such as the rice actin promoter
- ubiquitin promoters such as the maize ubiquitin-1 promoter.
- the CaMV 35S promoter is excluded from the category of broadly expressing promoters.
- Root-active promoters confer transcription in root tissue, e.g., root endodermis, root epidermis, or root vascular tissues.
- root-active promoters are root-preferential promoters, i.e., confer transcription only or predominantly in root tissue.
- Root-preferential promoters include the YP0128, YP0275, PT0625, PT0660, PT0683, and PT0758 promoters.
- Other root-preferential promoters include the PT0613, PT0672, PT0688, and PT0837 promoters, which drive transcription primarily in root tissue and to a lesser extent in ovules and/or seeds.
- root-preferential promoters include the root-specific subdomains of the CaMV 35S promoter (Lam et al., Proc. Natl. Acad. Sci. USA, 86:7890-7894 (1989)), root cell specific promoters reported by Conkling et al., Plant Physiol., 93:1203-1211 (1990), and the tobacco RD2 promoter.
- promoters that drive transcription in maturing endosperm can be useful. Transcription from a maturing endosperm promoter typically begins after fertilization and occurs primarily in endosperm tissue during seed development and is typically highest during the cellularization phase. Most suitable are promoters that are active predominantly in maturing endosperm, although promoters that are also active in other tissues can sometimes be used.
- Non-limiting examples of maturing endosperm promoters that can be included in the nucleic acid constructs provided herein include the napin promoter, the Arcelin-5 promoter, the phaseolin promoter (Bustos et al., Plant Cell, 1(9):839-853 (1989)), the soybean trypsin inhibitor promoter (Riggs et al., Plant Cell, 1(6):609-621 (1989)), the ACP promoter (Baerson et al., Plant Mol.
- zein promoters such as the 15 kD zein promoter, the 16 kD zein promoter, 19 kD zein promoter, 22 kD zein promoter and 27 kD zein promoter.
- Osgt-1 promoter from the rice glutelin-1 gene (Zheng et al., Mol. Cell. Biol., 13:5829-5842 (1993)), the beta-amylase promoter, and the barley hordein promoter.
- Other maturing endosperm promoters include the YP0092, PT0676, and PT0708 promoters.
- Promoters active in photosynthetic tissue confer transcription in green tissues such as leaves and stems. Most suitable are promoters that drive expression only or predominantly in such tissues. Examples of such promoters include the ribulose-1,5-bisphosphate carboxylase (RbcS) promoters such as the RbcS promoter from eastern larch (Larix laricina), the pine cab6 promoter (Yamamoto et al., Plant Cell Physiol., 35:773-778 (1994)), the Cab-1 promoter from wheat (Fejes et al., Plant Mol.
- RbcS ribulose-1,5-bisphosphate carboxylase
- promoters that have high or preferential activity in vascular bundles include YP0087, YP0093, YP0108, YP0022, and YP0080.
- Other vascular tissue-preferential promoters include the glycine-rich cell wall protein GRP 1.8 promoter (Keller and Baumgartner, Plant Cell, 3(10):1051-1061 (1991)), the Commelina yellow mottle virus (CoYMV) promoter (Medberry et al., Plant Cell, 4(2):185-192 (1992)), and the rice tungro bacilliform virus (RTBV) promoter (Dai et al., Proc. Natl. Acad. Sci. USA, 101(2):687-692 (2004)).
- GRP 1.8 promoter Keller and Baumgartner, Plant Cell, 3(10):1051-1061 (1991)
- CoYMV Commelina yellow mottle virus
- RTBV rice tungro bacilliform virus
- Inducible promoters confer transcription in response to external stimuli such as chemical agents or environmental stimuli.
- inducible promoters can confer transcription in response to hormones such as giberellic acid or ethylene, or in response to light or drought.
- drought-inducible promoters include YP0380, PT0848, YP0381, YP0337, PT0633, YP0374, PT0710, YP0356, YP0385, YP0396, YP0388, YP0384, PT0688, YP0286, YP0377, PD1367, and PD0901.
- nitrogen-inducible promoters examples include PT0863, PT0829, PT0665, and PT0886.
- shade-inducible promoters examples include PR0924 and PT0678.
- An example of a promoter induced by salt is rd29A (Kasuga et al. (1999) Nature Biotech 17: 287-291).
- Basal promoter is the minimal sequence necessary for assembly of a transcription complex required for transcription initiation.
- Basal promoters frequently include a “TATA box” element that may be located between about 15 and about 35 nucleotides upstream from the site of transcription initiation.
- Basal promoters also may include a “CCAAT box” element (typically the sequence CCAAT) and/or a GGGCG sequence, which can be located between about 40 and about 200 nucleotides, typically about 60 to about 120 nucleotides, upstream from the transcription start site.
- a stem promoter may be specific to one or more stem tissues or specific to stem and other plant parts.
- Stem promoters may have high or preferential activity in, for example, epidermis and cortex, vascular cambium, procambium, or xylem.
- Examples of stem promoters include YP0018 which is disclosed in US20060015970 and CryIA(b) and CryIA(c) (Braga et al. 2003, Journal of new seeds 5:209-221).
- promoters include, but are not limited to, shoot-preferential, callus-preferential, trichome cell-preferential, guard cell-preferential such as PT0678, tuber-preferential, parenchyma cell-preferential, and senescence-preferential promoters.
- Promoters designated YP0086, YP0188, YP0263, PT0758, PT0743, PT0829, YP0119, and YP0096 may also be useful.
- a 5′ untranslated region can be included in nucleic acid constructs described herein.
- a 5′ UTR is transcribed, but is not translated, and lies between the start site of the transcript and the translation initiation codon and may include the +1 nucleotide.
- a 3′ UTR can be positioned between the translation termination codon and the end of the transcript.
- UTRs can have particular functions such as increasing mRNA stability or attenuating translation. Examples of 3′ UTRs include, but are not limited to, polyadenylation signals and transcription termination sequences, e.g., a nopaline synthase termination sequence.
- more than one regulatory region may be present in a recombinant polynucleotide, e.g., introns, enhancers, upstream activation regions, transcription terminators, and inducible elements.
- more than one regulatory region can be operably linked to the sequence of a polynucleotide encoding a truncated lignin-modulating polypeptide.
- Regulatory regions such as promoters for endogenous genes, can be obtained by chemical synthesis or by subcloning from a genomic DNA that includes such a regulatory region.
- a nucleic acid comprising such a regulatory region can also include flanking sequences that contain restriction enzyme sites that facilitate subsequent manipulation.
- the invention also features transgenic plant cells and plants comprising at least one recombinant nucleic acid construct described herein.
- a plant or plant cell can be transformed by having a construct integrated into its genome, i.e., can be stably transformed. Stably transformed cells typically retain the introduced nucleic acid with each cell division.
- a plant or plant cell can also be transiently transformed such that the construct is not integrated into its genome. Transiently transformed cells typically lose all or some portion of the introduced nucleic acid construct with each cell division such that the introduced nucleic acid cannot be detected in daughter cells after a sufficient number of cell divisions. Both transiently transformed and stably transformed transgenic plants and plant cells can be useful in the methods described herein.
- Transgenic plant cells used in methods described herein can constitute part or all of a whole plant. Such plants can be grown in a manner suitable for the species under consideration, either in a growth chamber, a greenhouse, or in a field. Transgenic plants can be bred as desired for a particular purpose, e.g., to introduce a recombinant nucleic acid into other lines, to transfer a recombinant nucleic acid to other species, or for further selection of other desirable traits. Alternatively, transgenic plants can be propagated vegetatively for those species amenable to such techniques. As used herein, a transgenic plant also refers to progeny of an initial transgenic plant provided the progeny inherits the transgene. Seeds produced by a transgenic plant can be grown and then selfed (or outcrossed and selfed) to obtain seeds homozygous for the nucleic acid construct.
- Transgenic plants can be grown in suspension culture, or tissue or organ culture.
- solid and/or liquid tissue culture techniques can be used.
- transgenic plant cells can be placed directly onto the medium or can be placed onto a filter that is then placed in contact with the medium.
- transgenic plant cells can be placed onto a flotation device, e.g., a porous membrane that contacts the liquid medium.
- a solid medium can be, for example, Murashige and Skoog (MS) medium containing agar and a suitable concentration of an auxin, e.g., 2,4-dichlorophenoxyacetic acid (2,4-D), and a suitable concentration of a cytokinin, e.g., kinetin.
- a reporter sequence encoding a reporter polypeptide having a reporter activity can be included in the transformation procedure and an assay for reporter activity or expression can be performed at a suitable time after transformation.
- a suitable time for conducting the assay typically is about 1-21 days after transformation, e.g., about 1-14 days, about 1-7 days, or about 1-3 days.
- the use of transient assays is particularly convenient for rapid analysis in different species, or to confirm expression of a heterologous lignin-modulating polypeptide whose expression has not previously been confirmed in particular recipient cells.
- nucleic acids into monocotyledonous and dicotyledonous plants are known in the art, and include, without limitation, Agrobacterium -mediated transformation, viral vector-mediated transformation, electroporation and particle gun transformation, e.g., U.S. Pat. Nos. 5,538,880; 5,204,253; 6,329,571 and 6,013,863. If a cell or cultured tissue is used as the recipient tissue for transformation, plants can be regenerated from transformed cultures if desired, by techniques known to those skilled in the art.
- a population of transgenic plants can be screened and/or selected for those members of the population that have a trait or phenotype conferred by expression of the transgene. For example, a population of progeny of a single transformation event can be screened for those plants having a desired level of expression of a lignin-modulating polypeptide or nucleic acid. Physical and biochemical methods can be used to identify expression levels.
- RNA transcripts include Southern analysis or PCR amplification for detection of a polynucleotide; Northern blots, S1 RNase protection, primer-extension, or RT-PCR amplification for detecting RNA transcripts; enzymatic assays for detecting enzyme or ribozyme activity of polypeptides and polynucleotides; and protein gel electrophoresis, Western blots, immunoprecipitation, and enzyme-linked immunoassays to detect polypeptides.
- Other techniques such as in situ hybridization, enzyme staining, and immunostaining also can be used to detect the presence or expression of polypeptides and/or polynucleotides. Methods for performing all of the referenced techniques are known.
- a population of plants comprising independent transformation events can be screened for those plants having a desired trait, such as a modulated level of lignin. Selection and/or screening can be carried out over one or more generations, and/or in more than one geographic location.
- transgenic plants can be grown and selected under conditions which induce a desired phenotype or are otherwise necessary to produce a desired phenotype in a transgenic plant.
- selection and/or screening can be applied during a particular developmental stage in which the phenotype is expected to be exhibited by the plant. Selection and/or screening can be carried out to choose those transgenic plants having a statistically significant difference in lignin level relative to a control plant that lacks the transgene. Selected or screened transgenic plants have an altered phenotype as compared to a corresponding control plant, as described in the “Transgenic Plant Phenotypes” section herein.
- the polynucleotides and vectors described herein can be used to transform a number of monocotyledonous and dicotyledonous plants and plant cell systems, including species from one of the following families: Acanthaceae, Alliaceae, Alstroemeriaceae, Amaryllidaceae, Apocynaceae, Arecaceae, Asteraceae, Berberidaceae, Bixaceae, Brassicaceae, Bromeliaceae, Cannabaceae, Caryophyllaceae, Cephalotaxaceae, Chenopodiaceae, Colchicaceae, Cucurbitaceae, Dioscoreaceae, Ephedraceae, Erythroxylaceae, Euphorbiaceae, Fabaceae, Lamiaceae, Linaceae, Lycopodiaceae, Malvaceae, Melanthiaceae, Musaceae, Myrtaceae, Nyssaceae, Papaverace
- Suitable species may include members of the genera Abelmoschus, Abies, Acer, Agrostis, Allium, Alstroemeria, Ananas, Andrographis, Andropogon, Artemisia, Arundo, Atropa, Berberis, Beta, Bixa, Brassica, Calendula, Camellia, Camptotheca, Cannabis, Capsicum, Carthamus, Catharanthus, Cephalotaxus, Chrysanthemum, Cinchona, Citrullus, Coffea, Colchicum, Coleus, Cucumis, Cucurbita, Cynodon, Datura, Dianthus, Digitalis, Dioscorea, Elaeis, Ephedra, Erianthus, Erythroxylum, Eucalyptus, Festuca, Fragaria, Galanthus, Glycine, Gossypium, Helianthus, Hevea, Hordeum, Hyoscyamus, Jatropha, Lactuca, Linum, Lolium, Lup
- Suitable species include Panicum spp., Sorghum spp., Miscanthus spp., Saccharum spp., Erianthus spp., Populus spp., Andropogon gerardii (big bluestem), Pennisetum purpureum (elephant grass), Phalaris arundinacea (reed canarygrass), Cynodon dactylon (bermudagrass), Festuca arundinacea (tall fescue), Spartina pectinata (prairie cord-grass), Medicago sativa (alfalfa), Arundo donax (giant reed), Secale cereale (rye), Salix spp. (willow), Eucalyptus spp. (eucalyptus), Triticosecale (triticum—wheat ⁇ rye) and bamboo.
- Suitable species also include Helianthus annuus (sunflower), Carthamus tinctorius (safflower), Jatropha curcas (jatropha), Ricinus communis (castor), Elaeis guineensis (palm), Linum usitatissimum (flax), and Brassica juncea.
- Suitable species also include Beta vulgaris (sugarbeet), and Manihot esculenta (cassaya).
- Suitable species also include Lycopersicon esculentum (tomato), Lactuca sativa (lettuce), Musa paradisiaca (banana), Solanum tuberosum (potato), Brassica oleracea (broccoli, cauliflower, Brussels sprouts), Camellia sinensis (tea), Fragaria ananassa (strawberry), Theobroma cacao (cocoa), Coffea arabica (coffee), Vitis vinifera (grape), Ananas comosus (pineapple), Capsicum annum (hot & sweet pepper), Allium cepa (onion), Cucumis melo (melon), Cucumis sativus (cucumber), Cucurbita maxima (squash), Cucurbita moschata (squash), Spinacea oleracea (spinach), Citrullus lanatus (watermelon), Abelmoschus esculentus (okra), and Solanum melongena
- Suitable species also include Rosa spp. (rose), Dianthus caryophyllus (carnation), Petunia spp. (petunia) and Poinsettia pulcherrima (poinsettia).
- Suitable species also include Nicotiana tabacum (tobacco), Lupinus albus (lupin), Uniola paniculata (oats), bentgrass ( Agrostis spp.), Populus tremuloides (aspen), Pinus spp. (pine), Abies spp. (fir), Acer spp. (maple), Hordeum vulgare (barley), Poa pratensis (bluegrass), Lolium spp. (ryegrass) and Phleum pratense (timothy).
- the methods and compositions can be used over a broad range of plant species, including species from the dicot genera Brassica, Carthamus, Glycine, Gossypium, Helianthus, Jatropha, Parthenium, Populus , and Ricinus ; and the monocot genera Elaeis, Festuca, Hordeum, Lolium, Oryza, Panicum, Pennisetum, Phleum, Poa, Saccharum, Secale, Sorghum, Triticosecale, Triticum , and Zea .
- a plant is a member of the species Panicum virgatum (switchgrass), Sorghum bicolor (sorghum, sudangrass), Miscanthus giganteus (miscanthus), Saccharum sp. (energycane), Populus balsamifera (poplar), Zea mays (corn), Glycine max (soybean), Brassica napus (canola), Triticum aestivum (wheat), Gossypium hirsutum (cotton), Oryza sativa (rice), Helianthus annuus (sunflower), Medicago sativa (alfalfa), Beta vulgaris (sugarbeet), or Pennisetum glaucum (pearl millet).
- the polynucleotides and vectors described herein can be used to transform a number of monocotyledonous and dicotyledonous plants and plant cell systems, wherein such plants are hybrids of different species or varieties of a specific species (e.g., Saccharum sp. ⁇ Miscanthus sp.)
- the truncated sorghum CAD sequences of the methods and composition described herein are from wild, weedy, or cultivated sorghum species such as, but not limited to, Sorghum almum, Sorghum amplum, Sorghum angustum, Sorghum arundinaceum, Sorghum bicolor (such as bicolor, guinea, caudatum, kafir , and durra ), Sorghum brachypodum, Sorghum bulbosum, Sorghum burmahicum, Sorghum controversum, Sorghum drummondii, Sorghum ecarinatum, Sorghum exstans, Sorghum grande, Sorghum halepense, Sorghum interjectum, Sorghum intrans, Sorghum laxiflorum, Sorghum leiocladum, Sorghum macrospermum, Sorghum matarankense, Sorghum miliaceum, Sorghum nigrum, Sorghum niti
- a plant in which expression of at least one lignin-modulating polypeptide is modulated can have decreased levels of lignin.
- a lignin-modulating polypeptide described herein can be expressed in a transgenic plant, resulting in decreased levels of lignin.
- Decreased levels of lignin may mean decreased levels of total lignin, and/or ratios of Syringyl liginin, Guaiacyl lignin, and p-Hydroxyphenyl lignin monomers.
- the lignin level can be decreased by at least 2 percent, e.g., 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 25, 30, 35, 40, 45, 50, 55, 60, or more than 60 percent, as compared to the lignin level in a corresponding control plant that does not express the transgene.
- a plant in which expression of a lignin-modulating polypeptide is modulated can have decreased levels of lignin in harvestable biomass. Decreases in lignin in such plants can provide improved biomass to biofuel conversion.
- a plant in which expression of a lignin-modulating polypeptide is modulated can have increased or decreased levels of lignin in one or more plant tissues, e.g., leaf tissues, or stem tissues.
- a truncated CAD described herein is transformed into and expressed in sorghum that is already positive for one or more alleles encoding truncated polypeptides of CAD and/or COMT.
- lignin content may be further decreased from the content found in the parent plants. Lignin content of a sample can be analyzed using methods standard in the art.
- a difference in the amount of lignin in a transgenic plant or cell relative to a control plant or cell is considered statistically significant at p ⁇ 0.05 with an appropriate parametric or non-parametric statistic, e.g., Chi-square test, Student's t-test, Mann-Whitney test, or F-test.
- a difference in the amount of lignin is statistically significant at p ⁇ 0.01, p ⁇ 0.005, or p ⁇ 0.001.
- a statistically significant difference in, for example, the amount of lignin in a transgenic plant compared to the amount in cells of a control plant indicates that the recombinant nucleic acid present in the transgenic plant results in altered lignin levels.
- the phenotype of a transgenic plant is evaluated relative to a control plant.
- a plant is said “not to express” a polypeptide when the plant exhibits less than 10%, e.g., less than 9%, 8%, 7%, 6%, 5%, 4%, 3%, 2%, 1%, 0.5%, 0.1%, 0.01%, or 0.001%, of the amount of polypeptide or mRNA encoding the polypeptide exhibited by the plant of interest.
- Expression can be evaluated using methods including, for example, RT-PCR, Northern blots, 51 RNase protection, primer extensions, Western blots, protein gel electrophoresis, immunoprecipitation, enzyme-linked immunoassays, chip assays, and mass spectrometry.
- a polypeptide is expressed under the control of a tissue-preferential or broadly expressing promoter, expression can be evaluated in the entire plant or in a selected tissue. Similarly, if a polypeptide is expressed at a particular time, e.g., at a particular time in development or upon induction, expression can be evaluated selectively at a desired time period.
- the transgenic or non-transgenic plants identified or produced by the methods described herein have modulated lignin content in comparison to plants that do not comprise endogenous or exogenous genes encoding at least one truncated CAD allele.
- the lignin content can be decreased by about 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 15, 20, 25, 30, 35, 40, 50, 60, 70, or 80 percent.
- the transgenic or non-transgenic plants identified or produced by the methods described herein have modified yield of fermentable sugars in comparison to plants that do not comprise endogenous or exogenous genes encoding at least one truncated CAD allele.
- Such sorghum plants having one or more truncated CAD alleles as described herein have an increase in the yield of fermentable sugars, such as but not limited to, glucose, arabinose, fructose, galactose, or xylose, wherein the yield is increased by about 5, 10, 15, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, or 90 percent.
- the transgenic plants described herein or the non-transgenic plants identified or produced by the methods described herein have altered lignin in comparison to plants that do not comprise endogenous or exogenous genes encoding at least one truncated CAD allele.
- the altered lignin has a decrease in guaiacyl and syringyl residues.
- the developmental gradient of lignin is altered.
- the cell wall composition is altered.
- lignin subunit composition is altered.
- the transgenic plants described herein or the non-transgenic plants identified or produced by the methods described herein comprise one or more truncated CAD sequences and one or more truncated COMT sequences.
- genetic markers are polymorphic regions of a genome and the complementary oligonucleotides which bind to these regions.
- the major causes of polymorphisms, and thus the major sources of genetic markers, are insertions (additions), deletions, nucleotide substitutions (point mutations), recombination events, and transposable elements within the genome of individuals in a plant population.
- point mutations can result from errors in DNA replication or damage to the DNA.
- insertions and deletions can result from inaccurate recombination events.
- variation can arise from the insertion or excision of a transposable element (a DNA sequence that has the ability to move or to jump to new locations with the genome, autonomously or non-autonomously).
- Described herein are methods and kits for determining the genotype of a sorghum plant comprising detecting in the genome of the plant at least a first polymorphism at a CAD locus.
- the methods comprise detecting a plurality of polymorphisms in the genome of the plant.
- the method may further comprise storing the results of the step of detecting the plurality of polymorphisms on a computer readable medium.
- the invention further provides a computer readable medium produced by such a method.
- a method for identifying sorghum plant lines with a truncated CAD comprising supplying a nucleic acid sample for a sorghum plant, providing amplification primers for amplifying a region of a sorghum plant corresponding to a truncated CAD gene present in said nucleic acid sample, applying said amplification primers to said nucleic acid sample such that amplification of said region of said CAD gene occurs, and identifying sorghum plants having a truncated CAD based on the presence of one or more mutations that confer a truncation in said amplified nucleic acid sample.
- Polymorphisms may be detected by means known in the art.
- molecular markers specific to CAD truncations can be used.
- molecular markers include, oligonucleotides, single nucleotide polymorphisms (SNPs), multinucleotide polymorphisms, an insertion or a deletion of at least one nucleotide (indel), a simple sequence repeat (SSR), a restriction fragment length polymorphism (RFLP), an EST sequence or a unique nucleotide sequence of 20-40 bases used as a probe (oligonucleotides), a random amplified polymorphic DNA (RAPD) marker, or an arbitrary fragment length polymorphism (AFLP).
- SNPs single nucleotide polymorphisms
- multinucleotide polymorphisms an insertion or a deletion of at least one nucleotide
- indel simple sequence repeat
- RFLP restriction fragment length polymorphism
- markers can be used in conjunction with labeling or PCR to detect and score polymorphisms. Discovery, detection, and genotyping of various genetic markers have been well described in the literature. See, e.g., Henry, ed. (2001) Plant Genotyping. The DNA Fingerprinting of Plants Wallingford: CABI Publishing; Phillips and Vasil, eds. (2001) DNA-based Markers in Plants Dordrecht: Kluwer Academic Publishers; Pejic et al. (1998) “Comparative analysis of genetic similarity among maize inbred lines detected by RFLPs, RAPDs, SSRs and AFLPs” Theor. App. Genet.
- nucleic acids can be shorter in length than the truncated CAD sequence, and comprise the truncating stop codon or a sequence complimentary to the truncating stop codon.
- nucleic acids used to identify a truncated CAD terminate with the truncating stop codon or a sequence complimentary to the truncating stop codon.
- nucleic acids used to identify a truncated CAD are about 4, 5, 6, 7, 8, 9, 10, 12, 14, 16, 18, 20, 25, 30, 35, 40, 45, 50, 55, or 60 nucleotides in length. Such polynucleotides may be used as primers or probes.
- oligonucleotides specific to wild-type (wt) and mutant CAD alleles can be used to detect and score the genotype of a sorghum plant.
- the CAD alleles of SEQ ID NOs: 7 and 13 can be detected and scored using SEQ ID NOs: 34 and/or 36.
- SNP sequences can be amplified in PCR reactions to detect and score genotypes of CAD alleles.
- the polymorphism detected is a difference in a CAD nucleotide sequence which results in a stop codon.
- SEQ ID NOs: 7 and 13 have single nucleotide differences that result in stop codons at positions 4089 and 2800, respectively.
- SNPs can be discovered and detected by any of a number of techniques known in the art. For example, SNPs can be detected by direct sequencing of DNA segments, e.g., amplified by PCR, from several individuals (see, e.g., Ching et al. (2002) “SNP frequency, haplotype structure and linkage disequilibrium in elite maize inbred lines” BMC Genetics 3: 19). As another example, SNPs can be discovered by computer analysis of available sequences (e.g., ESTs, STSS) derived from multiple genotypes (see, e.g., Marth et al.
- available sequences e.g., ESTs, STSS
- Indels, insertions or deletions of one or more nucleotides can also be discovered by sequencing and/or computer analysis, e.g., simultaneously with SNP discovery.
- SNPs can be genotyped by sequencing.
- SNPs can also be genotyped by various other methods (including high throughput methods) known in the art, for example, using DNA chips, allele-specific hybridization, allele-specific PCR, and primer extension techniques.
- the CAD alleles are first sequenced and then oligonucleotides specific to the mutant sequence can be designed and synthesized based on the nucleic acid sequence.
- oligonucleotides specific to the truncation can be designed and synthesized based on the nucleic acid sequence. Synthesized mutants may be based on the nucleotide sequence of any sorghum CAD allele.
- one or more sets of oligonucleotides each capable of recognizing the presence or absence of a specific and defined genomic position. For organisms with more chromosomes more oligonucleotides are desirable.
- the lower limit is one oligonucleotide pair and the upper limit is set by the desired resolution capacity of the method and the test kit.
- Hybridization of the oligonucleotides to DNA from the sorghum plant is preferably recorded in situ by any conventional labelling system, applying for instance terminal transferase and conventional recordable labels.
- hybridized sample DNA may be released from the solid support and subsequently hybridized with labelled polynucleotide sequences corresponding to each of the original oligonucleotide sequences attached to the solid support.
- Hybridization is optionally reversible and the solid support can be returned to its original state for reuse.
- a labelled dideoxynucleotide can be incorporated at the end of the oligonucleotide provided that the oligonucleotide is hybridized to genomic DNA as template.
- the nucleotide sequence at the genomic position adjacent to the region matching the oligonucleotide is known and therefore the particular nucleotide which will be incorporated (A, C, G, T or U) is known.
- Co-dominant scoring is achieved using paired, i.e. two or parallel, i.e. three, flanking oligonucleotide sequences.
- the results obtained are recorded as full, empty, failure or null alleles and can be used to distinguish between heterozygous and/or homozygous genotypes.
- Optional post-hybridization treatments, including washing and digestion, are provided in order to remove sample DNA not fully hybridized to the solid support-attached oligonucleotide sequences, for example before and after labelling.
- the presence or absence of hybridization is recorded using a method allowing the recording of the hybridization state.
- primer pairs and probes described herein are of value in breeding programs because when incorporating the truncated CAD alleles into a different genetic background, such as an elite cultivar, a modified backcrossing scheme can be used, where the inheritance of the truncated CAD alleles is tracked with the primer pairs or probes. This eliminates the need for self-pollination to reveal the phenotype associated with homozygosity for a truncated CAD allele, and thus saves time and effort.
- Sorghum plants are bred in most cases by self pollination techniques. With the incorporation of male sterility (either genetic or cytoplasmic) cross pollination breeding techniques can also be utilized. Sorghum has a perfect flower with both male and female parts in the same flower located in the panicle. The flowers are usually in pairs on the panicle branches. Natural pollination occurs in sorghum when anthers (male flowers) open and pollen falls onto receptive stigma (female flowers). Because of the close proximity of male (anthers) and female (stigma) in the panicle, self pollination can be high. Cross pollination may occur when wind or convection currents move pollen from the anthers of one plant to receptive stigma on another plant. Cross pollination is greatly enhanced with incorporation of male sterility which renders male flowers nonviable without affecting the female flowers. Successful pollination in the case of male sterile flowers requires cross pollination.
- sorghum hybrids require the development of homozygous inbred lines, the crossing of these lines, and the evaluation of the crosses.
- Pedigree breeding methods and to a lesser extent population breeding methods, are used to develop inbred lines from breeding populations. Breeding programs combine desirable traits from two or more inbred lines into breeding pools from which new inbred lines are developed by selfing and selection of desired phenotypes. The new inbreds are crossed with other inbred lines and the hybrids from these crosses are evaluated to determine which have commercial potential.
- Pedigree breeding starts with the crossing of two genotypes, each of which may have one or more desirable characteristics that is lacking in the other or which complement the other. If the two original parents do not provide all of the desired characteristics, other sources can be included in the breeding population.
- superior plants are selfed and selected in successive generations.
- heterozygous condition gives way to homogeneous lines as a result of self-pollination and selection.
- five or more generations of selfing and selection is practiced. F 1 to F 2 ; F 2 to F 3 ; F 3 to F 4 ; F 4 to F 5 , etc.
- Backcrossing can be used to improve an inbred line.
- Backcrossing transfers a specific desirable trait from one inbred or source to an inbred that lacks that trait. This can be accomplished for example by first crossing a superior inbred (A) (recurrent parent) to a donor inbred (non-recurrent parent), which carries the appropriate genes(s) for the trait in question. The progeny of this cross is then mated back to the superior recurrent parent (A) followed by selection in the resultant progeny for the desired trait to be transferred from the non-recurrent parent. After five or more backcross generations with selection for the desired trait, the progeny will be heterozygous for loci controlling the characteristic being transferred, but will be like the superior parent for most or almost all other genes. The last backcross generation would be selfed to give pure breeding progeny for the gene(s) being transferred.
- a hybrid sorghum variety is the cross of two inbred lines, each of which may have one or more desirable characteristics lacked by the other or which complement the other.
- the hybrid progeny of the first generation is designated F 1 .
- F 1 The hybrid progeny of the first generation.
- the hybrid is more vigorous than its inbred parents. This hybrid vigor, or heterosis, can be manifested in many ways, including increased vegetative growth and increased yield.
- hybrid sorghum variety involves five steps: (1) the formation of “restorer” and “non-restorer” germplasm pools; (2) the selection of superior plants from various “restorer” and “non-restorer” germplasm pools; (3) the selfing of the superior plants for several generations to produce a series of inbred lines, which although different from each other, each breed true and are highly uniform; (4) the conversion of inbred lines classified as non-restorers to cytoplasmic male sterile (CMS) forms, and (5) crossing the selected cytoplasmic male sterile (CMS) inbred lines with selected fertile inbred lines (restorer lines) to produce the hybrid progeny (F 1 ).
- CMS cytoplasmic male sterile
- CMS cytoplasmic male sterile
- Inbred male sterile lines are developed by converting inbred lines to CMS. This is achieved by transferring the chromosomes of the line to be sterilized into sterile cytoplasm by a series of backcrosses, using a male sterile line as a female parent and the line to be sterilized as the recurrent and pollen parent in all crosses. After conversion to male sterility the line is designated the (A) line. Lines with fertility restoring genes cannot be converted into male sterile A-lines. The original line is designated the (B) line.
- a single cross hybrid is produced when two inbred lines are crossed to produce the F 1 progeny. Much of the hybrid vigor exhibited by F 1 hybrids is lost in the next generation (F 2 ). Consequently, seed from hybrid varieties is not typically used for planting stock.
- Hybrid sorghum can be produced using wind to move the pollen. Alternating strips of the cytoplasmic male sterile inbred (female) and the male fertile inbred (male) are planted in the same field. Wind moves the pollen shed by the male inbred to receptive stigma on the female. Providing that there is sufficient isolation from sources of foreign sorghum pollen, the stigma of the male sterile inbred (female) will be fertilized only with pollen from the male fertile inbred (male). The resulting seed, born on the male sterile (female) plants is therefore hybrid and will form hybrid plants that have full fertility restored. In some embodiments, if the hybrid sorghum is used as forage or for biomass production, then it may be unnecessary to restore fertility.
- inbred parental lines, elite breeding lines, or hybrid sorghum are bred by the methods described herein to comprise one or more alleles for which the CAD coding sequence is truncated relative to a wild-type CAD coding sequence and one or more alleles for which the COMT coding sequence is truncated relative to a wild-type COMT coding sequence.
- the sorghum plants developed are high biomass varieties for biofuel production.
- Recurrent selection is a method used in a plant breeding program to improve a population of plants.
- the method entails individual plants cross pollinating with each other to form progeny.
- the progeny are grown and the superior progeny selected by any number of selection methods, which include individual plant, half-sib progeny, full-sib progeny and selfed progeny.
- the selected progeny are self pollinated or cross pollinated with each other to form progeny for another population. This population is planted and again superior plants are selected to self pollinate or cross pollinate with each other.
- Recurrent selection is a cyclical process and therefore can be repeated as many times as desired. The objective of recurrent selection is to improve the traits of a population.
- the improved population can then be used as a source of breeding material to obtain new varieties for commercial or breeding use, including the production of a synthetic cultivar.
- a synthetic cultivar is the resultant progeny formed by the intercrossing of several selected varieties.
- the number of parental plant varieties, populations, wild accessions, ecotypes, etc., that are used to generate a synthetic can vary from as little as 10 to as much as 500. Typically, about 100 to 300 varieties, populations, etc., are used a parents for the synthetic variety.
- Seed from the parental seed production plot of a synthetic variety can be sold to the farmer. Alternatively, seed from the parental seed production plot can subsequently undergo one or two generations of multiplication, depending on the amount of seed produced in the parental plot and the demand for seed.
- Mass selection is a useful technique when used in conjunction with molecular marker enhanced selection.
- seeds from individuals are selected based on phenotype or genotype. These selected seeds are then bulked and used to grow the next generation.
- Bulk selection requires growing a population of plants in a bulk plot, allowing the plants to self-pollinate, harvesting the seed in bulk and then using a sample of the seed harvested in bulk to plant the next generation. Also, instead of self pollination, directed pollination could be used as part of the breeding program.
- Mutation breeding is another method of introducing new traits into sorghum . Mutations that occur spontaneously or are artificially induced can be useful sources of variability for a plant breeder. The goal of artificial mutagenesis is to increase the rate of mutation for a desired characteristic. Mutation rates can be increased by many different means including temperature, long-term seed storage, tissue culture conditions, radiation; such as X-rays, Gamma rays (e.g.
- cobalt 60 or cesium 137 neutrons, (product of nuclear fission by uranium 235 in an atomic reactor), Beta radiation (emitted from radioisotopes such as phosphorus 32 or carbon 14), or ultraviolet radiation (such as from 2500 to 2900 nm), or chemical mutagens (such as base analogues (5-bromo-uracil), related compounds (8-ethoxy caffeine), antibiotics (streptonigrin), alkylating agents (sulfur mustards, nitrogen mustards, epoxides, ethylenamines, sulfates, sulfonates, sulfones, lactones), azide, hydroxylamine, nitrous acid, or acridines.
- base analogues (5-bromo-uracil)
- related compounds (8-ethoxy caffeine
- antibiotics streptonigrin
- alkylating agents sulfur mustards, nitrogen mustards, epoxides, ethylenamine
- mutations created in other sorghum plants may be used to produce a backcross conversion of sorghum that comprises such mutation.
- mutations created in other lines may be used to produce a backcross conversion of elite lines that comprise such mutations.
- the plant genotyping techniques described herein may be used in marker-assisted plant breeding methods in sorghum .
- techniques such as Isozyme Electrophoresis, Arbitrarily Primed Polymerase Chain Reaction (AP-PCR), DNA Amplification Fingerprinting (DAF), and Sequence Characterized Amplified Regions (SCARs) can be used in marker-assisted breeding.
- QTL mapping is the use of markers, which are known to be closely linked to alleles that have measurable effects on a quantitative trait. Selection in the breeding process is based upon the accumulation of markers linked to the positive effecting alleles and/or the elimination of the markers linked to the negative effecting alleles from the plant's genome.
- markers can also be used during the breeding process for the selection of qualitative traits. For example, markers closely linked to alleles or markers containing sequences within the actual alleles of interest can be used to select plants that contain the alleles of interest during a backcrossing breeding program. The markers can also be used to select for the genome of the recurrent parent and against the genome of the donor parent. Using this procedure can minimize the amount of genome from the donor parent that remains in the selected plants. It can also be used to reduce the number of crosses back to the recurrent parent needed in a backcrossing program. The use of molecular markers in the selection process is often called genetic marker enhanced selection. Molecular markers may also be used to identify and exclude certain sources of germplasm as parental varieties or ancestors of a plant by providing a means of tracking genetic profiles through crosses.
- markers assisted selection only a limited proportion of the total genetic variance is captured by the markers.
- An alternative to tracing a limited number of QTL with markers is to trace all the QTL. This can be done by dividing the entire genome up into chromosome segments, for example defined by adjacent markers, and then tracing all the chromosome segments. This method was termed genomic selection by Meu Giveaway et al. 2001 “Prediction of total genetic value using genome-wide dense marker maps” Genetics 157:1819-1829. With the availability of high-density marker maps and cost effective genotyping, genomic selection methods can provide faster genetic gain than can be achieved by current selection methods based on phenotypes and pedigree.
- genomic selection is typically on the sum of estimates of effects of all marker intervals across the genome, fitted either as fixed (fixed GS) or random (random GS) effects. Responses to selection are tracked by indices over generations.
- the efficiency of genomic selection over standard marker assisted selection depends on stringency of the threshold used for QTL detection.
- One skilled in the art can optimize factors that affect genomic selection for a particular species such as Sorghum species.
- Double haploids can also be used for the development of plants with a homozygous phenotype in the breeding program.
- a sorghum cultivar as a parent can be used to produce double haploid plants.
- Double haploids are produced by the doubling of a set of chromosomes (1 N) from a heterozygous plant to produce a completely homozygous individual.
- chromosomes 1 N
- Wan et al. “Efficient Production of Doubled Haploid Plants Through Colchicine Treatment of Anther-Derived Maize Callus”, Theoretical and Applied Genetics, 77:889-892, 1989 and U.S. Pat. No. 7,135,615.
- This can be advantageous because the process omits the generations of selfing needed to obtain a homozygous plant from a heterozygous source.
- Haploid induction systems have been developed for various plants to produce haploid tissues, plants and seeds.
- the haploid induction system can produce haploid plants from any genotype by crossing a selected line (as female) with an inducer line.
- inducer lines for maize include Stock 6 (Coe, 1959, Am. Nat. 93:381-382; Sharkar and Coe, 1966, Genetics 54:453-464), KEMS (Deimling, Roeber, and Geiger, 1997, Vortr.
- one embodiment is a process for making a substantially homozygous sorghum progeny plant by producing or obtaining a seed from the cross of two sorghum plants and applying double haploid methods to the F 1 seed or F 1 plant or to a subsequent filial generation. Based on studies in maize, such methods can decrease the number of generations required to produce a variety with similar genetics or characteristics to sorghum . See Bernardo, R. and Kahler, A. L., Theor. Appl. Genet. 102:986-992, 2001. Descriptions of other breeding methods that are commonly used for different traits and crops can be found in one of several reference books (e.g., Allard, 1960; Simmonds, 1979; Sneep et al., 1979; Fehr, 1987).
- backcrossing A plant breeding technique called backcrossing can be utilized wherein essentially all of the desired morphological and physiological characteristics of a variety are recovered in addition to a single gene that is transferred into the variety via the backcrossing technique.
- Backcrossing methods can be used to improve or introduce a characteristic into the variety.
- the term “backcrossing” as used herein refers to the repeated crossing of a hybrid progeny back to the recurrent parent, i.e., backcrossing 1, 2, 3, 4, 5, 6, 7, 8 or more times to the recurrent parent.
- the parental sorghum plant that contributes the gene for the desired characteristic is termed the nonrecurrent or donor parent. This terminology refers to the fact that the nonrecurrent parent is used one time in the backcross protocol and therefore does not recur.
- the parental sorghum plant to which the gene or genes from the nonrecurrent parent are transferred is known as the recurrent parent as it is used for several rounds in the backcrossing protocol (Poehlman & Sleper, 1994; Fehr, Principles of Cultivar Development pp. 261-286 (1987)).
- the original variety of interest recurrent parent
- a second variety nonrecurrent parent
- a suitable recurrent parent is an important step for a successful backcrossing procedure.
- the goal of a backcross protocol is to alter or substitute a single trait or characteristic in the original variety.
- a single gene of the recurrent variety is modified or substituted with the desired gene from the nonrecurrent parent, while retaining essentially all of the rest of the desired genetic, and therefore the desired physiological and morphological, constitution of the original variety.
- the choice of the particular nonrecurrent parent will depend on the purpose of the backcross; one of the major purposes is to add some agronomically important trait to the plant.
- the exact backcrossing protocol will depend on the characteristic or trait being altered to determine an appropriate testing protocol. Although backcrossing methods are simplified when the characteristic being transferred is a dominant allele, a recessive allele may also be transferred. In this instance it may be necessary to introduce a test of the progeny to determine if the desired characteristic has been successfully transferred.
- Single gene traits have been identified that are sometimes not selected for in the development of a new variety but that can be improved by backcrossing techniques.
- Single gene traits may or may not be transgenic; examples of these traits include but are not limited to, male sterility, herbicide resistance, resistance for bacterial, fungal, or viral disease, insect resistance, male fertility, enhanced nutritional quality, industrial usage, yield stability and yield enhancement. These genes are generally inherited through the nucleus.
- Several of these single gene traits are described in U.S. Pat. Nos. 5,959,185; 5,973,234 and 5,977,445; the disclosures of which are specifically hereby incorporated by reference in their entirety.
- Pedigree breeding starts with the crossing of two genotypes, having one or more desirable characteristics that is lacking or which complements the other. If the two original parents do not provide all the desired characteristics, other sources can be included in the breeding population.
- superior plants are selfed and selected in successive filial generations.
- the heterozygous condition gives way to homogeneous varieties as a result of self-pollination and selection.
- five or more successive filial generations of selfing and selection is practiced: F 1 to F 2 ; F 2 to F 3 ; F 3 to F 4 ; F 4 to F 5 , etc. After a sufficient amount of inbreeding, successive filial generations will serve to increase seed of the developed variety.
- the developed variety comprises homozygous alleles at about 95% or more of its loci.
- backcrossing can also be used in combination with pedigree breeding.
- backcrossing can be used to transfer one or more specifically desirable traits from one variety, the donor parent, to a developed variety called the recurrent parent, which has overall good agronomic characteristics yet lacks that desirable trait or traits.
- the same procedure can be used to move the progeny toward the genotype of the recurrent parent but at the same time retain many components of the non-recurrent parent by stopping the backcrossing at an early stage and proceeding with selfing and selection.
- a sorghum variety may be crossed with another variety to produce a first generation progeny plant.
- the first generation progeny plant may then be backcrossed to one of its parent varieties to create a BC 1 or BC 2 .
- Progeny are selfed and selected so that the newly developed variety has many of the attributes of the recurrent parent and yet several of the desired attributes of the non-recurrent parent. This approach leverages the value and strengths of the recurrent parent for use in new sorghum varieties.
- Transgenic and non-transgenic plants described herein have various uses in the agricultural and energy production industries.
- transgenic plants described herein can be used to make animal feed and food products. Such plants, however, are often particularly useful as a feedstock for energy production.
- Transgenic plants described herein often produce biomass with decreased or altered lignin content, relative to control plants that lack the exogenous nucleic acid.
- Non-transgenic plants described herein such as those produced or selected by the methods described herein often produce biomass with decreased or altered lignin content, relative to control plants that lack one or more of the nucleic acids described herein.
- such plants provide equivalent or even increased yields of grain and/or biomass per hectare relative to control plants when grown under conditions of reduced inputs such as fertilizer and/or water.
- transgenic and non-transgenic plants can be used to provide yield quality improvements at a lower input cost and/or under environmentally stressful conditions such as drought.
- plants described herein have a composition that permits more efficient processing into free sugars, and subsequently ethanol, for energy production.
- such plants provide higher yields of ethanol, butanol, dimethyl ether, other biofuel molecules, and/or sugar-derived co-products per kilogram of plant material, relative to control plants.
- processing efficiencies are believed to be derived from the lignin composition of the plant material.
- Seeds from plants described herein can be conditioned and bagged in packaging material by means known in the art to form an article of manufacture.
- Packaging material such as paper and cloth are well known in the art.
- a package of seed can have a label, e.g., a tag or label secured to the packaging material, a label printed on the packaging material, or a label inserted within the package, that describes the nature of the seeds therein.
- Kits for genotyping plants for identification, selection, or breeding can comprise a means of detection of the presence of a truncated CAD in a sample of sorghum DNA.
- a kit comprises one or more SNPs, such as SEQ ID NOs: 34-37, or a protein encoded by a polynucleotide as described herein.
- a kit comprises one or more polynucleotide SNPs specific to a truncated CAD 131 to 320 amino acids in length.
- a kit comprises one or more polynucleotide SNPs specific to a C-terminus truncated sorghum COMT, such as those described by Bout and Vermerris, which is in incorporated by reference herein in its entirety (Bout and Vermerris, 2003, A candidate-gene approach to clone the sorghum Brown midrib gene encoding COMT, Mol. Gen. Genomics 269:205-214).
- the kits described herein may be useful for genetic identity determination, phylogenetic studies, parenthood determinations, genotyping, haplotyping, pedigree analysis, forensic identification and/or plant breeding particularly with co-dominant scoring.
- a kit may further comprise reagents for DNA amplification-detection technology such as PCR or TaqManTM.
- a kit may further comprise reagents for probe hybridization-detection technology such as Southern Blots, Northern Blots, in-situ Hybridization, or microarrays.
- a kit may comprise reagents for antibody binding-detection technology such as Western Blots, ELISA's, SELDI mass spectrometry or test strips.
- a kit may comprise reagents for lignin content analysis technology.
- a kit may comprise instructions for one or more of the methods described above.
- Each isolated nucleic acid described herein that encodes a truncated CAD can be cloned into a Ti plasmid vector containing a phosphinothricin acetyltransferase gene which confers FinaleTM resistance to transformed plants.
- Constructs can be made using any of the nucleic acids described herein, each operably linked to a promoter or regulatory element. Wild-type Arabidopsis thaliana ecotype Wassilewskija (Ws) plants can be transformed separately with each construct. The transformations can be performed essentially as described in Bechtold et al., C. R. Acad. Sci. Paris, 316:1194-1199 (1993).
- each vector containing a nucleic acid described herein in the respective transgenic Arabidopsis line transformed with the vector can be confirmed by FinaleTM resistance, PCR amplification from green leaf tissue extract, and/or sequencing of PCR products.
- wild-type Arabidopsis ecotype Ws plants can be transformed with an empty vector.
- DNA samples were extracted from sorghum GRIN germplasm accession nos.: PI 535790, PI 535806, PI 599692, PI 599697, PI 599705, PI 599720, PI 599731, PI 599740, PI 599750, PI 602730, PI 602740, PI 602898, PI 602902, PI 602906, PI 602910, PI 602914, PI 606705, PI 606706, and Ceres accession nos.:BICOLOR-81733675, GRAINERIII-81733676 (Conventional Sorghum Sudangrass Hybrid), 98093-81733674 (Conventional type Hybrid Forage Sorghum ), SS1-81733673 (Sudan ⁇ Sudan), 22043-81733671 ( sorghum sudangrass Hybrid), and 24213-81733672 (Hybrid forage sorghum (Long season
- CAD nucleotide sequences of sorghum accessions PI602730-81733686 and PI535790-81733677 were analyzed and each contained a different point mutation altering a single nucleotide (CT), each of which resulted in a premature stop codon (SEQ ID NOs: 7 and 13).
- CT single nucleotide
- Oligonucleotides were developed having specificity to the SNPs in the nucleic acid sequences of wild type and mutant CAD alleles (SEQ ID NOs: 34-37). The oligonucleotides were tested on DNA extracted from sorghum accessions. PI602730-81733686 and PI602910-85802580 were homozygous for a CAD allele featuring a SNP resulting in a premature stop codon encoding a truncated polypeptide of 320 amino acids.
- P1535790-81733677, P1535806-81733678, P1602740-81733687, P1602902-81733689, and PI602906-81733690 were homozygous for a CAD allele featuring a SNP resulting in a premature stop codon encoding a truncated polypeptide of 131 amino acids.
- Accessions 22043 and 24213 were heterozygous for the CAD allele encoding the 131 amino acid truncated CAD polypeptide. Results of oligonucleotide assisted genotyping are shown in Table 1.
- the oligonucleotides described herein can be used in marker assisted breeding to produce inbred sorghum lines that are homozygous for a CAD allele encoding a truncated CAD polypeptide, which can be crossed to make hybrid sorghum that are homozygous for the CAD allele encoding a truncated CAD polypeptide.
- P1602730-81733686 can be crossed with a male sterile (A-line) that does not contain a CAD allele encoding a truncated CAD polypeptide but which has agronomically desirable traits.
- the resulting progeny in F 2 generations can be screened using the oligonucleotides for plants that are heterozygous or homozygous for the CAD allele encoding truncated CAD polypeptides and are male sterile.
- Such progeny can be backcrossed to the A-line and through generations of selection a new A-line can be developed which is homozygous for the CAD allele encoding a truncated CAD polypeptide.
- the same process can be applied to B and R lines, so that the three lines can be used to produce hybrid seed that is homozygous for the CAD allele encoding a truncated CAD polypeptide.
- Reciprocal BLAST (Rivera et al., Proc. Natl. Acad. Sci. USA, 95:6239-6244 (1998)) can be used to identify potential functional homolog sequences as well as allelic variants from databases consisting of all available public and proprietary peptide sequences, including NR from NCBI and peptide translations from Ceres clones.
- a specific reference polypeptide can be searched against all peptides from its source species using BLAST in order to identify polypeptides having BLAST sequence identity of 80% or greater to the reference polypeptide and an alignment length of 85% or greater along the shorter sequence in the alignment.
- the reference polypeptide and any of the aforementioned identified polypeptides can be designated as a cluster.
- the BLASTP version 2.0 program from Washington University at Saint Louis, Mo., USA can be used to determine BLAST sequence identity and E-value.
- the BLASTP version 2.0 program includes the following parameters: 1) an E-value cutoff of 1.0e-5; 2) a word size of 5; and 3) the ⁇ postsw option.
- the BLAST sequence identity can be calculated based on the alignment of the first BLAST HSP (High-scoring Segment Pairs) of the identified potential functional homolog or allelic variant sequence with a specific reference polypeptide. The number of identically matched residues in the BLAST HSP alignment can be divided by the HSP length, and then multiplied by 100 to get the BLAST sequence identity.
- the HSP length typically includes gaps in the alignment, but in some cases gaps can be excluded.
- the main Reciprocal BLAST process consists of two rounds of BLAST searches; forward search and reverse search.
- a reference polypeptide sequence “polypeptide A,” from source species SA can be BLASTed against all protein sequences from a species of interest.
- Top hits can be determined using an E-value cutoff of 10-5 and a sequence identity cutoff of 35%. Among the top hits, the sequence having the lowest E-value can be designated as the best hit, and considered a potential functional homolog or ortholog. Any other top hit that had a sequence identity of 80% or greater to the best hit or to the original reference polypeptide can be considered a potential functional homolog or ortholog as well. This process can be repeated for all species of interest.
- Allelic variants typically have higher sequence identity to a reference sequence, i.e., greater than 90%, and originating from the same species as the reference sequence. Allelic variants can be compared to available genome reference maps and inter-species comparative maps to determine the likelihood that the allelic variants identified correlate to the same locus.
- the top hits identified in the forward search from all species can be BLASTed against all protein sequences from the source species SA.
- a top hit from the forward search that returned a polypeptide from the aforementioned cluster as its best hit can also be considered as a potential functional homolog.
Abstract
Description
- This application claims priority under 35 U.S.C. §119 to U.S. Provisional Application Ser. No. 61/104,067, filed Oct. 9, 2008, which is incorporated herein by reference in its entirety.
- The material in the accompanying sequence listing is hereby incorporated by reference into this application. The accompanying file, named sequence.txt was created on Oct. 9, 2008, and is 106 KB. The file can be accessed using Microsoft Word on a computer that uses Windows OS.
- This document relates to methods, materials, and kits involved in identifying cinnamyl-alcohol dehydrogenases (CAD) alleles in sorghum germplasm and breeding methods to incorporate CAD alleles encoding truncated CAD polypeptides into desired sorghum germplasm lines or elite sorghum breeding lines. Methods for generating truncated CAD coding sequences through mutation of sorghum or preparation of synthetic sequences are also described herein as well as methods for generating transgenic plants expressing truncated CAD coding sequences. This document also relates to sorghum plants having a novel combination of CAD alleles and/or caffeic acid O-methyltransferase (COMT) alleles encoding truncated polypeptides as well as materials and methods for making such plants.
- Numerous strategies are being employed to enhanced biomass conversion characteristics in dedicated energy crops such as sorghum. Plant transformation, use of naturally occurring variation, and plant breeding can be used to achieve desirable cell wall composition and structure which is determined largely by content and composition of lignin, cellulose, hemicellulose, and the way they are cross-linked. CAD is associated with lignin biosynthesis. In sorghum, there is a need for identifying germplasm having altered lignin or lignin content and developing markers associated with such traits for use in breeding. The truncated CAD sequences described herein and markers associated with such truncations will expedite the selection of superior new varieties of sorghum with enhanced biofuel conversion properties and/or forage properties. For example, the introduction of sweet sorghum and/or truncated CAD traits into a high biomass staygreen sorghum germplasm may improve yields and conversion properties dramatically.
- This document provides materials and methods involved in identifying alleles encoding truncated CAD polypeptides in sorghum germplasm. This document also provides breeding methods to incorporate alleles encoding truncated CAD polypeptides in to desired sorghum germplasm lines or elite sorghum breeding lines. For example, this document provides isolated nucleic acids, transgenic plant cells and plants and plant tissues produced from transgenic plant cells, as well as plants of agronomically elite varieties. This document provides methods for producing plants comprising CAD encoding nucleic acids, for incorporating a desired trait into a sorghum cultivar, for characterizing and breeding sorghum plants, and for modulating the composition of a plant. Also, this document provides kits to genotype a sorghum biological sample. The material, methods and kits provided herein can be used to achieve desirable cell wall composition and structure, and advance the selection of advantageous varieties of sorghum for production of biomass with improved digestibility, which may benefit both humans and animals.
- Isolated nucleic acids encoding truncated CAD polypeptides are provided herein. In some embodiments, an isolated nucleic acid comprises a sequence encoding a CAD polypeptide. The CAD polypeptide comprises at least 98% sequence identity to amino acids 1-130 or 1-319 of SEQ ID NO: 6 and terminates at a position corresponding to residue 131 or 320 of SEQ ID NO: 6. In some embodiments, an isolated nucleic acid comprises a sequence encoding a sorghum CAD polypeptide. The sorghum CAD polypeptide comprises at least 80% sequence identity to amino acids 1-130 or 1-319 of SEQ ID NO: 6 and terminates at a position corresponding to residue 131 or 320 of SEQ ID NO: 6. In some embodiments, the nucleic acid encoding a CAD polypeptide having at least 98% or at least 80% sequence identity to amino acids 1-130 or 1-319, and terminating at a position corresponding to residue 131 or 320 of SEQ ID NO: 6 further comprises a thymine corresponding to position 2794 of SEQ ID NO:2, position 2800 of SEQ ID NO: 4, 7, 10, or 13, position 4083 SEQ ID NO: 2, position 4089 of SEQ ID NOs: 4 or 7, position 4090 of SEQ ID NO: 10,
position 497 of SEQ ID NO: 1, position 394 of SEQ ID NOs: 3, 5, 8, 11, or 14, position 1064 of SEQ ID NO:1, position 962 of SEQ ID NO:11, or position 961 of SEQ ID NOs: 3, 5, or 8. In some embodiments, the nucleic acid encoding a polypeptide having at least 98% or at least 80% sequence identity to amino acids 1-130 or 1-319, and terminating at a position corresponding to residue 131 or 320 of SEQ ID NO: 6, further comprises at least 80% sequence identity to a nucleotide sequence selected from the group consisting of SEQ ID NO: 1, 2, 3, 4, 5, 7, 8, 10, 11, 13, and 14. - Transgenic plant cells comprising nucleic acids encoding CAD polypeptides are also provided herein. For example, this document provides a transgenic plant cell comprising at least one exogenous nucleic acid. The exogenous nucleic acid comprises a regulatory region operably linked to a nucleic acid. The nucleic acid comprises a sequence encoding a CAD polypeptide or a sorghum CAD polypeptide having least 98% or at least 80% sequence identity to amino acids 1-130 or 1-319 and terminating at a position corresponding to residue 131 or 320 of SEQ ID NO: 6. In some embodiments, a plant produced from the transgenic plant cell has a decrease in the level of CAD activity as compared to the corresponding level in a control plant that does not comprise the nucleic acid. In some embodiments, the plant produced from the transgenic plant cell exhibts a brown midrib phenotype as compared to a control plant that does not comprise the CAD encoding nucleic acid. In some embodiments, the plant produced from the transgenic plant cell has a decrease in the level of lignin as compared to the corresponding level in a control plant that does not comprise the CAD encoding nucleic acid.
- Plants and tissues comprising transgenic plant cells are also provided herein. For example, this document provides a plant comprising a transgenic plant cell. The transgenic plant cell comprises at least one exogenous nucleic acid. The exogenous nucleic acid comprises a regulatory region operably linked to a nucleic acid. The nucleic acid comprises a sequence encoding a CAD polypeptide or a sorghum CAD polypeptide having least 98% or at least 80% sequence identity to amino acids 1-130 or 1-319 and terminating at a position corresponding to residue 131 or 320 of SEQ ID NO: 6. In some embodiments, a plant produced from the transgenic plant cell has a decrease in the level of CAD activity as compared to the corresponding level in a control plant that does not comprise the nucleic acid. This document also provides biomass or seed comprising tissue from plants which comprise the transgenic plant cells.
- Methods for producing plants comprising CAD encoding nucleic acids are provided herein. For example, in one aspect, a method comprises growing a transgenic plant cell comprising an exogenous nucleic acid. The nucleic acid comprises a sequence encoding a CAD polypeptide having at least 98% sequence identity to amino acids 1-130 or 1-319 of SEQ ID NO: 6 and terminating at a position corresponding to residue 131 or 320 of SEQ ID NO: 6. In another aspect, a method comprises growing a transgenic plant cell comprising an exogenous nucleic acid encoding a sorghum CAD polypeptide. The sorghum CAD polypeptide comprises at least 80% sequence identity to amino acids 1-130 or 1-319 of SEQ ID NO: 6 and terminates corresponding to residue 131 or 320 of SEQ ID NO: 6.
- Methods for characterizing a sorghum plant are provided herein. For example, in one aspect, a method comprises detecting a nucleic acid encoding a CAD polypeptide in the sorghum plant. The CAD polypeptide has at least 80% sequence identity to amino acids 1-130 or 1-319 of SEQ ID NO: 6 and terminates corresponding to residue 131 or 320 of SEQ ID NO: 6. The nucleic acid can have a thymine corresponding to position 2794 of SEQ ID NO:2, position 2800 of SEQ ID NO: 4, 7, 10, or 13, position 4083 SEQ ID NO: 2, position 4089 of SEQ ID NOs: 4 or 7, position 4090 of SEQ ID NO: 10,
position 497 of SEQ ID NO: 1, position 394 of SEQ ID NOs: 3, 5, 8, 11, or 14, position 1064 of SEQ ID NO:1, position 962 of SEQ ID NO:11, or position 961 of SEQ ID NOs: 3, 5, or 8. - This document provides methods of determining the presence of a polynucleotide in a sorghum plant. For example, in one aspect, a method comprises contacting at least one probe or primer pair with nucleic acid from the sorghum plant. The probe or primer pair is specific for a polynucleotide that encodes a CAD polypeptide. The CAD polypeptide has at least 80% sequence identity to amino acids 1-130 or 1-319 of SEQ ID NO: 6 and terminates at a position corresponding to residue 131 or 320 of SEQ ID NO: 6. The method also comprises determining whether or not the polynucleotide is present in the sorghum plant. The probe can be an oligonucleotide, e.g., an oligonucleotide comprising a nucleotide sequence selected from the group consisting of SEQ ID NOs: 34 and 36.
- Kits for genotyping a sorghum biological sample are provided herein. For example, this document provides a kit comprising a primer pair that specifically amplifies, or a probe that specifically hybridizes to, a polynucleotide that encodes a CAD polypeptide. The CAD polypeptide comprises at least 80% sequence identity to amino acids 1-130 or 1-319 of SEQ ID NO: 6 and terminates at a position corresponding to residues 131 or 320 of SEQ ID NO: 6. In some embodiments, a kit comprises at least one primer of the primer pair or probe having specificity for a thymine corresponding to position 2794 of SEQ ID NO:2, position 2800 of SEQ ID NO: 4, 7, 10, or 13, position 4083 SEQ ID NO: 2, position 4089 of SEQ ID NOs: 4 or 7, position 4090 of SEQ ID NO: 10,
position 497 of SEQ ID NO: 1, position 394 of SEQ ID NOs: 3, 5, 8, 11, or 14, position 1064 of SEQ ID NO:1, position 962 of SEQ ID NO:11, or position 961 of SEQ ID NOs: 3, 5, or 8. In some embodiments, a kit comprises at least one primer or probe comprising a nucleotide sequence selected from the group consisting of SEQ ID NO: 34 and 36. - Methods of breeding sorghum plants comprising CAD encoding nucleic acids are provided herein. In one aspect, the method comprises crossing two or more sorghum plants to produce progeny plants. At least one sorghum plant comprises at least one CAD allele encoding a CAD polypeptide having at least 80% sequence identity to amino acids 1-130 or 1-319 of SEQ ID NO: 6, and terminating corresponding to position 131 or 320 of SEQ ID NO: 6. The progeny plants can have at least one allele at a COMT locus that encodes a truncated COMT polypeptide. The method can also comprise identifying one or more of the progeny plants that comprise the at least one CAD allele. The at least one progeny plant can be homozygous for the CAD allele. The method can comprise identifying the CAD allele by a thymine corresponding to position 2794 of SEQ ID NO:2, position 2800 of SEQ ID NO: 4, 7, 10, or 13, position 4083 SEQ ID NO: 2, position 4089 of SEQ ID NOs: 4 or 7, position 4090 of SEQ ID NO: 10,
position 497 of SEQ ID NO: 1, position 394 of SEQ ID NOs: 3, 5, 8, 11, or 14, position 1064 of SEQ ID NO:1, position 962 of SEQ ID NO:11, or position 961 of SEQ ID NOs: 3, 5, or 8. In another aspect, the method involves identification with at least one oligonucleotide specific for the CAD allele, e.g., an oligonucleotide comprising a nucleotide sequence set forth in SEQ ID NOs: 34 or 36. The method can also comprise using one or more of the identified progeny plants in a next generation of plant breeding. - A method of introducing a desired trait into a sorghum cultivar by marker assisted backcrossing is provided herein. For example, the method can comprise identifying a first sorghum plant having at least one CAD allele that encodes a CAD polypeptide. The CAD polypeptide comprises at least 80% sequence identity to amino acids 1-130 or 1-319 of SEQ ID NO: 6 and terminating at a position corresponding to residue 131 or 320 of SEQ ID NO: 6. The method can also comprise crossing the first sorghum plant with a second, genetically distinct sorghum plant having a desired trait, to produce progeny plants. The desired trait is not a phenotype conferred by the CAD allele. The method can also comprise selecting one or more progeny plants that have the desired trait and have a marker associated with the CAD allele, to produce selected progeny plants. The associated marker can comprise a thymine corresponding to position 2794 of SEQ ID NO:2, position 2800 of SEQ ID NO: 4, 7, 10, or 13, position 4083 SEQ ID NO: 2, position 4089 of SEQ ID NOs: 4 or 7, position 4090 of SEQ ID NO: 10,
position 497 of SEQ ID NO: 1, position 394 of SEQ ID NOs: 3, 5, 8, 11, or 14, position 1064 of SEQ ID NO:1, position 962 of SEQ ID NO:11, or position 961 of SEQ ID NOs: 3, 5, or 8. The selected progeny plants can be backcrossed with the first or second plants to produce backcross progeny plants, and selected for backcross progeny plants that have the desired trait and the marker. The backcross progeny plants can have more than one marker associated with the CAD allele, or can be homozygous for the CAD allele. Selection can also be carried out for a marker associated with the desired trait. Backcrossing and selection can be repeated at least three times to produce BC4 or higher backcross progeny plants that have the desired trait and the at least one CAD allele. Such progeny plants can also have the at least one allele at the COMT locus that encodes a truncated COMT polypeptide. In another aspect, a method of introducing a desired trait into a sorghum cultivar comprises identifying the CAD allele with an oligonucleotide specific for the CAD allele. For example, the oligonucleotide can comprise a nucleotide sequence selected from the group consisting of SEQ ID NOs: 34 and 36. - Methods of modulating plant composition are provided herein. For example, in one aspect, a method comprises introducing into a plant cell an exogenous nucleic acid encoding a sorghum CAD polypeptide. The sorghum CAD polypeptide has at least 80% sequence identity to amino acids 1-130 and 1-319 of SEQ ID NO: 6 and terminates corresponding to position 131 or 320 of SEQ ID NO: 6. The composition of a plant produced from the plant cell is modulated as compared to the composition of a control plant that does not comprise the nucleic acid, e.g., decreased lignin content, increased glucan content, increased cellulose content, or increased hemicellulose content.
- Plants of an agronomically elite sorghum variety are provided herein. For example, this document provides plants that are homozygous at a CAD locus for an allele encoding a truncated CAD polypeptide. In another embodiment, the plants are homozygous at a COMT locus for an allele that encodes a truncated COMT polypeptide. The plants can be male sterile or female sterile.
- Unless otherwise defined, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention pertains. Although methods and materials similar or equivalent to those described herein can be used to practice the invention, suitable methods and materials are described below. All publications, patent applications, patents, and other references mentioned herein are incorporated by reference in their entirety. In case of conflict, the present specification, including definitions, will control. In addition, the materials, methods, and examples are illustrative only and not intended to be limiting.
- The details of one or more embodiments of the invention are set forth in the accompanying drawings and the description below. Other features, objects, and advantages of the invention will be apparent from the description and drawings, and from the claims.
-
FIG. 1(A-O) is an alignment of sorghum CAD genomic nucleotide sequences for alleles corresponding to full length CAD (SEQ ID NO:2 from Ceres germplasm ID No.: PI599692-81733680; and SEQ ID NO:4 from Ceres germplasm ID No.: 22043-81733671, a truncated CAD of 320 amino acids (SEQ ID NO:7 from Ceres germplasm ID No.: PI602730-81733686), truncated CAD of 131 amino acids (SEQ ID NO:13 from Ceres germplasm ID No.: PI535790-81733677), and CAD having frameshift insertion mutation at position 4016 (SEQ ID NO:10 from Ceres germplasm ID No.: BICOLOR-81733675). In all the alignment figures shown herein, a dash in an aligned sequence represents a gap, i.e., a lack of a nucleotide at that position. Identical nucleotides among aligned sequences are identified by boxes.FIG. 1 and the other alignment figure provided herein were generated using the program MUSCLE version 3.52. -
FIG. 2(A-F) is an alignment of sorghum CAD cDNA sequences for alleles corresponding to full length CAD (SEQ ID NO:1 from GI No. 119852230; SEQ ID NO:3 from Ceres germplasm ID No.: PI599692-81733680; SEQ ID NO:5 from Ceres germplasm ID No.: 22043-81733671; truncated CAD of 320 amino acids (SEQ ID NO:8 from Ceres germplasm ID No.: PI602730-81733686), truncated CAD of 131 amino acids (SEQ ID NO:14 from Ceres germplasm ID No.: PI535790-81733677), and a CAD having a frameshift insertion mutation at position 890 (SEQ ID NO:11 from Ceres germplasm ID No.: BICOLOR-81733675). - The brown midrib (BMR) trait results in reduced lignification, reduced cell-wall concentration, increased digestibility and increased voluntary intake of feed by ruminants (Casler et al., 2003). In sorghum, BMR phenotypes are typical of some mutants of the CAD and COMT genes. There are at least 28 BMR mutants in sorghum, some being spontaneous mutations and others induced by mutagenesis. In addition to the brown vascular tissue pigmentation of the leaf midribs and stems, these BMR mutants often exhibit decreased lignin content in stems and leaves in comparison to wild types or cultivars lacking a BMR phenotype, as CAD and COMT contribute to the lignin biosynthesis pathway. BMR plants have lignin that is less polymerized and contains less phenolic monomers that can affect digestion. Suzuki et al. analyzed stem samples from BMR sorghum phenotypes and found increased levels of 5-hydroxy-guaiacyl residues in the cell walls, in comparison to wild types or cultivars lacking a BMR phenotype (Suzuki et al., 1997). Porter et al. describes phenotypes for several sorghum BMR mutations (Porter et al., 1978). For example, the content of acid detergent fiber, lignin cellulose, hemicellulose, percent cell wall constituent and in vitro cell wall constituent disappearance in stems and leaves for BMR-6 and BMR-17 mutations in comparison to normal plants.
- An “allele” is any of one or more alternative forms of a gene. In a diploid cell or organism, the two alleles of a given gene occupy corresponding loci on a pair of homologous chromosomes.
- “Amino acid” refers to one of the twenty biologically occurring amino acids and to synthetic amino acids, including D/L optical isomers.
- “Biomass” refers to harvestable above ground vegetative matter of plants, typically a mixture of leaves, stems, and reproductive structures. Vegetative matter may be comprised of only leaves or only stems in some instances, and is considered to be biomass. Seeds are not considered vegetative matter and, therefore, compositions that contain primarily only seeds are not considered to be biomass, although it will be appreciated that biomass may contain seeds as part of the mixture. Biomass can be quantified as dry matter yield, which is the mass of biomass produced (usually reported in T/acre) if the contribution of water is subtracted from the fresh mater weight. Dry matter yield (DMY) yield is calculated using the fresh matter weight (FMW) and a measurement of weight percent moisture (M) in the following equation. DMY=((100−M)/100)*FMW. Biomass can be quantified as fresh matter yield, which is the mass of biomass produced (usually reported in T/acre) on an as-received basis, which includes the weight of moisture.
- “Cell type-preferential promoter” or “tissue-preferential promoter” refers to a promoter that drives expression preferentially in a target cell type or tissue, respectively, but may also lead to some transcription in other cell types or tissues as well.
- “Control plant” refers to a plant that does not contain the exogenous nucleic acid present in a transgenic plant of interest, but otherwise has the same or similar genetic background as such a transgenic plant. A suitable control plant can be a non-transgenic wild type plant, a non-transgenic segregant from a transformation experiment, or a transgenic plant that contains an exogenous nucleic acid other than the exogenous nucleic acid of interest.
- “Domains” are groups of substantially contiguous amino acids in a polypeptide that can be used to characterize protein families and/or parts of proteins. Such domains have a “fingerprint” or “signature” that can comprise conserved primary sequence, secondary structure, and/or three-dimensional conformation. Generally, domains are correlated with specific in vitro and/or in vivo activities. A domain can have a length of from 10 amino acids to 400 amino acids, e.g., 10 to 50 amino acids, or 25 to 100 amino acids, or 35 to 65 amino acids, or 35 to 55 amino acids, or 45 to 60 amino acids, or 200 to 300 amino acids, or 300 to 400 amino acids.
- “Down-regulation” refers to regulation that decreases production of expression products (mRNA, polypeptide, or both) relative to basal or native states.
- “Exogenous” with respect to a nucleic acid indicates that the nucleic acid is part of a recombinant nucleic acid construct, or is not in its natural environment. For example, an exogenous nucleic acid can be a sequence from one species introduced into another species, i.e., a heterologous nucleic acid. Typically, such an exogenous nucleic acid is introduced into the other species via a recombinant nucleic acid construct. An exogenous nucleic acid can also be a sequence that is native to an organism and that has been reintroduced into cells of that organism. An exogenous nucleic acid that includes a native sequence can often be distinguished from the naturally occurring sequence by the presence of non-natural sequences linked to the exogenous nucleic acid, e.g., non-native regulatory sequences flanking a native sequence in a recombinant nucleic acid construct. In addition, stably transformed exogenous nucleic acids typically are integrated at positions other than the position where the native sequence is found. It will be appreciated that an exogenous nucleic acid may have been introduced into a progenitor and not into the cell under consideration. For example, a transgenic plant containing an exogenous nucleic acid can be the progeny of a cross between a stably transformed plant and a non-transgenic plant. Such progeny are considered to contain the exogenous nucleic acid.
- “Expression” refers to the process of converting genetic information of a polynucleotide into RNA through transcription, which is catalyzed by an enzyme, RNA polymerase, and into protein, through translation of mRNA on ribosomes.
- “Heterologous polypeptide” as used herein refers to a polypeptide that is not a naturally occurring polypeptide in a plant cell, e.g., a transgenic Panicum virgatum plant transformed with and expressing the coding sequence for a nitrogen transporter polypeptide from a Zea mays plant.
- “Isolated nucleic acid” as used herein includes a naturally-occurring nucleic acid, provided one or both of the sequences immediately flanking that nucleic acid in its naturally-occurring genome is removed or absent. Thus, an isolated nucleic acid includes, without limitation, a nucleic acid that exists as a purified molecule or a nucleic acid molecule that is incorporated into a vector or a virus. A nucleic acid existing among hundreds to millions of other nucleic acids within, for example, cDNA libraries, genomic libraries, or gel slices containing a genomic DNA restriction digest, is not to be considered an isolated nucleic acid.
- “Locus” refers a position on a chromosome, for example, the region of a chromosome at which a particular gene is located. In a diploid organism, the allele at a particular gene locus on one chromosome may be an allele that is different from the allele at that locus on the homologous chromosome, in which case the organism is considered heterozygous for that locus. If the alleles at a particular locus are the same, the organism is considered homozygous for that locus.
- “Modulation” of the level of chemical composition, phenotype, or enzyme activity refers to the change in the level that is observed as a result of expression of, or transcription from, an exogenous nucleic acid in a plant cell. The change in level is measured relative to the corresponding level in control plants.
- “Nucleic acid” and “polynucleotide” are used interchangeably herein, and refer to both RNA and DNA, including cDNA, genomic DNA, synthetic DNA, and DNA or RNA containing nucleic acid analogs. Polynucleotides can have various three-dimensional structures. A nucleic acid can be double-stranded or single-stranded (i.e., a sense strand or an antisense strand). Non-limiting examples of polynucleotides include genes, gene fragments, exons, introns, messenger RNA (mRNA), transfer RNA, ribosomal RNA, siRNA, micro-RNA, ribozymes, cDNA, recombinant polynucleotides, branched polynucleotides, nucleic acid probes and nucleic acid primers. A polynucleotide may contain unconventional or modified nucleotides.
- “Operably linked” refers to the positioning of a regulatory region and a sequence to be transcribed in a nucleic acid so that the regulatory region is effective for regulating transcription or translation of the sequence. For example, to operably link a coding sequence and a regulatory region, the translation initiation site of the translational reading frame of the coding sequence is typically positioned between one and about fifty nucleotides downstream of the regulatory region. A regulatory region can, however, be positioned as much as about 5,000 nucleotides upstream of the translation initiation site, or about 2,000 nucleotides upstream of the transcription start site.
- “Polypeptide” as used herein refers to a compound of two or more subunit amino acids, amino acid analogs, or other peptidomimetics, regardless of post-translational modification, e.g., phosphorylation or glycosylation. The subunits may be linked by peptide bonds or other bonds such as, for example, ester or ether bonds. Full-length polypeptides, truncated polypeptides, point mutants, insertion mutants, splice variants, chimeric proteins, and fragments thereof are encompassed by this definition.
- “Progeny” includes descendants of a particular plant or plant line. Progeny of an instant plant include seeds formed on F1, F2, F3, F4, F5, F6 and subsequent generation plants, or seeds formed on BC1, BC2, BC3, and subsequent generation plants, or seeds formed on F1BC1, F1BC2, F1BC3, and subsequent generation plants. The designation F1 refers to the progeny of a cross between two parents that are genetically distinct. The designations F2, F3, F4, F5 and F6 refer to subsequent generations of self- or sib-pollinated progeny of an F1 plant.
- A “probe” is a molecule capable of distinguishing among polymorphisms in the genome of an organism. For example, a nucleic acid to which is attached a conventional detectable label or reporter molecule, e.g., a radioactive isotope, ligand, chemiluminescent agent, fluorescent agent, or enzyme can be a probe. Such a probe can be complementary to a strand of a target nucleic acid, such as to a strand of genomic DNA from sorghum having a truncated CAD, whether from a sorghum plant or from a sample that includes DNA from a sorghum plant. Probes include not only deoxyribonucleic or ribonucleic acids but also polyamides and other probe materials that bind specifically to a target DNA sequence and can be used to detect the presence of that target DNA sequence. Hybridization of probes with target DNA can be detected by several methods including polymerase chain reaction (PCR) based assays, electrophoresis-based assays, or the molecular beacon or dynamic allele-specific hybridization (DASH) assays.
- “Primers” are nucleic acids, typically oligonucleotides, that can anneal to a complementary or substantially complimentary target DNA strand to form a hybrid between the primer and the target DNA strand, then can be extended along the target DNA strand by a polymerase. Primer pairs of the present invention can be used for amplification of a specific nucleic acid, e.g., by PCR or other conventional nucleic acid amplification methods.
- “Regulatory region” refers to a nucleic acid having nucleotide sequences that influence transcription or translation initiation and rate, and stability and/or mobility of a transcription or translation product. Regulatory regions include, without limitation, promoter sequences, enhancer sequences, response elements, protein recognition sites, inducible elements, protein binding sequences, 5′ and 3′ untranslated regions (UTRs), transcriptional start sites, termination sequences, polyadenylation sequences, introns, and combinations thereof. A regulatory region typically comprises at least a core (basal) promoter. A regulatory region also may include at least one control element, such as an enhancer sequence, an upstream element or an upstream activation region (UAR). For example, a suitable enhancer is a cis-regulatory element (−212 to −154) from the upstream region of the octopine synthase (ocs) gene. Fromm et al., The Plant Cell, 1:977-984 (1989).
- “Up-regulation” refers to regulation that increases the level of an expression product (mRNA, polypeptide, or both) relative to basal or native states.
- “Vector” refers to a replicon, such as a plasmid, phage, or cosmid, into which another DNA segment may be inserted so as to bring about the replication of the inserted segment. Generally, a vector is capable of replication when associated with the proper control elements. The term “vector” includes cloning and expression vectors, as well as viral vectors and integrating vectors. An “expression vector” is a vector that includes a regulatory region.
- Polypeptides described herein include C-terminus truncated CAD polypeptides. Such polypeptides can be lignin-modulating polypeptides. Lignin-modulating polypeptides can be effective to modulate lignin levels when expressed in a plant or plant cell. Such polypeptides typically contain at least one domain indicative of lignin-modulating polypeptides, as described in more detail herein. In some embodiments, lignin-modulating polypeptides have greater than 90% identity to SEQ ID NOs: 6, 9, 12, 15, 18, 21, 24, 27, 30, or 33, as described in more detail herein.
- In some embodiments, lignin-modulating polypeptides such as a C-terminus truncated sorghum CAD polypeptide can be about 100, 110, 120, 130, 140, 150, 160, 170, 180, 190, 200, 210, 220, 230, 240, 250, 260, 270, 280, 290, 300, 310, 320, 330, 340, or 350 amino acids in length. In some embodiments, lignin-modulating polypeptides such as C-terminus truncated CADs can be 131 or 320 amino acids in length. In some embodiments the truncated CADs are from sorghum.
- A lignin-modulating polypeptide can contain an Alcohol dehydrogenase GroES-like domain (ADH N), a methyltransferase small domain (MTS), and/or a Zinc-binding dehydrogenase (ADH zinc N), which is predicted to be characteristic of a CAD enzyme. In some embodiments, a C-terminus truncated CAD described herein comprises all or a substantial portion of an ADH N domain. In some embodiments, the C-terminus truncated CAD described herein comprises an ADH N domain and a portion of an ADH zinc N domain. SEQ ID NO: 9 sets forth the amino acid sequence of a truncated CAD clone, identified herein as PI602730-81733686, that is predicted to encode a polypeptide containing a portion of an ADH zinc N domain and ADH N and MTS domains. SEQ ID NO: 15 sets forth the amino acid sequence of a sorghum clone, identified herein as PI535790-81733677, that is predicted to encode a polypeptide containing a portion of a ADH N domain.
- In some embodiments, the truncated CAD described herein is a naturally occurring polypeptide. In other embodiments, the truncated CAD described herein is synthetic. For example, an allelic variant of a sorghum CAD can be identified by BLASTing or designing primers that recognize conserved regions of the gene and amplifying said gene and then synthesizing a nucleic acid that encodes truncated CAD. In other embodiments, site directed mutagenesis may be used to generate desired truncations. A truncated polypeptide may retain certain domains of the naturally occurring polypeptide while lacking others. Thus, length variants that are up to about 2, 5, 10, 20, 30, 40, 50, 60, 70, 80 90, 100, 125, 150, 175, 200, 225 or 300 amino acids shorter or longer than a naturally occurring CAD typically exhibit the lignin-modulating activity of a truncated polypeptide. In some embodiments, a truncated CAD comprises about 10, 15, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, or 95 amino acids of an ADH N domain. In some embodiments, a truncated CAD comprises about 10, 15, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, 100, 110, or 120 amino acids of an ADH zinc N domain. In some embodiments, a truncated polypeptide is a dominant negative polypeptide. SEQ ID NO: 9 and 15 sets forth the amino sequence of a lignin-modulating polypeptide that is truncated at the C-terminus end relative to a full length sorghum CAD polypeptide. Expression in a plant of such a truncated polypeptide confers a difference in the level of lignin in a tissue of the plant as compared to the corresponding level in tissue of a control plant that does not comprise the truncation.
- In some embodiments, one or more functional homologs of a reference lignin-modulating polypeptide defined by one or more of the Pfam descriptions indicated above are suitable for use as lignin-modulating polypeptides or truncations thereof. A functional homolog is a polypeptide that has sequence similarity to a reference truncated CAD polypeptide, and that exhibits a brown midrib phenotype. A functional homolog and the reference polypeptide may be natural occurring polypeptides, and the sequence similarity may be due to convergent or divergent evolutionary events. As such, functional homologs are sometimes designated in the literature as homologs, or orthologs, or paralogs. Variants of a naturally occurring functional homolog, such as polypeptides encoded by mutants of a wild type coding sequence, may themselves be functional homologs. Functional homologs can also be created via site-directed mutagenesis of the coding sequence for a lignin-modulating polypeptide, or by combining domains from the coding sequences for different naturally-occurring lignin-modulating polypeptides (“domain swapping”). The term “functional homolog” is sometimes applied to the nucleic acid that encodes a functionally homologous polypeptide. In some embodiments, a nucleic acid encoding a truncated CAD may be synthesized.
- Functional homologs and potential allelic variants can be identified by analysis of nucleotide and polypeptide sequence alignments. For example, performing a query on a database of nucleotide or polypeptide sequences can identify homologs of lignin-modulating polypeptides. Sequence analysis can involve BLAST, Reciprocal BLAST, or PSI-BLAST analysis of nonredundant databases using a lignin-modulating polypeptide amino acid sequence as the reference sequence. Amino acid sequence is, in some instances, deduced from the nucleotide sequence. Those polypeptides in the database that have greater than 90% sequence identity are candidates for allelic variants of a lignin-modulating polypeptide which can be used to make truncations as described herein.
- Amino acid sequence similarity allows for conservative amino acid substitutions, such as substitution of one hydrophobic residue for another or substitution of one polar residue for another. If desired, manual inspection of such candidates can be carried out in order to narrow the number of candidates to be further evaluated. Manual inspection can be performed by selecting those candidates that appear to have domains present in lignin-modulating polypeptides, e.g., conserved functional domains.
- Conserved regions can be identified by locating a region within the primary amino acid sequence of a lignin-modulating polypeptide that is a repeated sequence, forms some secondary structure (e.g., alpha helices and beta sheets), establishes positively or negatively charged domains, or represents a protein motif or domain. See, e.g., the Pfam web site describing consensus sequences for a variety of protein motifs and domains on the World Wide Web at sanger.ac.uk/Software/Pfam/ and pfam.janelia.org/. A description of the information included at the Pfam database is described in Sonnhammer et al., Nucl. Acids Res., 26:320-322 (1998); Sonnhammer et al., Proteins, 28:405-420 (1997); and Bateman et al., Nucl. Acids Res., 27:260-262 (1999). Conserved regions also can be determined by aligning sequences of the same or related polypeptides from closely related species. Closely related species preferably are from the same family. In some embodiments, alignment of sequences from two different species is adequate. Typically, polypeptides that exhibit at least about 40% amino acid sequence identity are useful to identify conserved regions. Conserved regions of related polypeptides exhibit at least 45% amino acid sequence identity (e.g., at least 50%, at least 60%, at least 70%, at least 80%, or at least 90% amino acid sequence identity). In some embodiments, a conserved region exhibits at least 92%, 94%, 96%, 98%, or 99% amino acid sequence identity. For example, a truncated CAD may have a conserved ADH domain as compared to CAD amino acid sequences from other species.
- Examples of amino acid sequences of allelic variants of the polypeptide set forth in SEQ ID NO: 6 are provided in the Sequence Listing. Such allelic variants include PI602730-81733686 (SEQ ID NO: 9) and PI535790-81733677 (SEQ ID NO: 15). In some cases, an allelic variant of SEQ ID NO: 6 has an amino acid sequence with at least 80% sequence identity, e.g., 50%, 52%, 56%, 59%, 61%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 97%, 98%, or 99% sequence identity, to the amino acid sequence set forth in SEQ ID NO: 6, 9, 12, or 15. In some embodiments, an allelic variant of SEQ ID NO: 6 or 12 is truncated by about 5, 10, 25, 50, 75, 100, 125, 150, 175, 200, 225, 250, 275, or 300 amino acids in length. In some embodiments, the allelic variants are from sorghum.
- The identification of conserved regions in a truncated lignin-modulating polypeptide facilitates production of variants of truncated lignin-modulating polypeptides. Variants of truncated lignin-modulating polypeptides typically have 10 or fewer conservative amino acid substitutions within the primary amino acid sequence, e.g., 7 or fewer conservative amino acid substitutions, 5 or fewer conservative amino acid substitutions, or between 1 and 5 conservative substitutions. A useful variant polypeptide can be constructed based on one of the alignments of nucleic acids set forth in
FIG. 1 orFIG. 2 and/or alleles identified in the Sequence Listing. Such a polypeptide includes the conserved regions, arranged in the order from amino-terminal end to carboxy-terminal end. Such a polypeptide may also include zero, one, or more than one amino acid in positions marked by dashes. When no amino acids are present at positions marked by dashes, the length of such a polypeptide is the sum of the amino acid residues in all conserved regions. When amino acids are present at all positions marked by dashes, such a polypeptide has a length that is the sum of the amino acid residues in all conserved regions and all dashes. - Truncations of CAD homologs or sorghum allelic variants of CAD are also described herein. For example, CAD homologs or sorghum allelic variants of CAD can be truncated artificially or naturally occurring truncations can be identified which are truncated such that the length of the resulting polypeptide corresponds to the length of the polypeptide of SEQ ID NOs: 9 or 15. Polypeptide sequences of CAD homologs or sorghum allelic variants of CAD can be aligned with the truncated CAD sequences of SEQ ID NOs: 9 and/or 15 using, for example, a Clustal program such as ClustalW 1.83. Alternatively, the nucleotide sequences encoding CAD homologs or sorghum allelic variants of CAD can be aligned with the truncated nucleotide sequences of SEQ ID NOs: 7 and/or 13 (genomic DNA), or 8 and/or 14 (cDNA) using a Clustal program. The alignments of polypeptides or nucleotides can then be used to determine the corresponding position at which a truncated sequence can terminate. For example in
FIG. 1 , sequences aligned with SEQ ID NO: 13 that terminate with the nucleotide in the alignment that aligns with position 2802 of SEQ ID NO: 13 are corresponding truncations. InFIG. 1 , for example, sequences aligned with SEQ ID NO: 7 that terminate with the nucleotide in the alignment that aligns with position 4091 of SEQ ID NO: 7 are corresponding truncations. InFIG. 2 , for example, sequences aligned with SEQ ID NO: 14 that terminate with the nucleotide in the alignment that aligns with position 396 of SEQ ID NO: 14 are corresponding truncations. InFIG. 2 , for example, sequences aligned with SEQ ID NO: 8 that terminate with the nucleotide in the alignment that aligns with position 964 of SEQ ID NO: 8 are corresponding truncations. - Various methods for measuring the level of CAD or the activity of CAD are known in the art. In the lignin biosynthesis pathway, CAD is known to be involved in several reduction reactions, including, but not limited to, the reduction of p-Coumaraldehyde to p-Coumaryl alcohol, Caffeyl aldehyde to Caffeyl alcohol, Coniferldahyde to Coniferyl alcohol, and Sinapaldehyde to Sinapyl alcohol. For example, in vitro, substrates can be labeled, using carbon or other means, and CAD from a plant sample or a plant extract comprising CAD can be added to the substrate to be reduced. The amount of label in the product can be used to compare the level of CAD activity among samples.
- The composition of each plant sample, including, but not limited to, lignin, glucose, arabinose, fructose, galactose, xylose, cellulose, hemicellulose, 5-hydroxy-guaiacyl, neutral detergent fiber, acid detergent fiber, or acid detergent lignin can be measured by independent analytical chemistry techniques known in the art, typically wet chemical techniques. For example, following pre-treatment by acid, enzymes, or other means, plant samples can be analyzed for glucose using a YSI 2700D Dual-Channel Biochemistry Analyzer (YSI Life Sciences, Yellow Springs, Ohio). Glucan, xylan, arabinan, and lignin contents of a plant or plant part can be determined by ASTM methods E1758-01 (Determination of Biomass Sugars by High Performance Liquid Chromatography) and/or E1721-01 (Determination of Acid Insoluble Residue (Lignin) in Biomass).
- In some embodiments, a lignin-modulating polypeptide has an amino acid sequence with at least 40% sequence identity, e.g., 50%, 52%, 56%, 59%, 61%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 97%, 98%, or 99% sequence identity, to one of the amino acid sequences set forth in SEQ ID NOs: 6, 9, 12, 15, 18, 21, 24, 27, 30, or 33. Polypeptides having such a percent sequence identity often have a domain indicative of a lignin-modulating polypeptide as discussed above. Amino acid sequences of lignin-modulating polypeptides having at least 80% sequence identity to one of the amino acid sequences set forth in SEQ ID NOs: 6, 9, 12, 15, 18, 21, 24, 27, 30, or 33 can be identified by BLAST as described herein.
- “Percent sequence identity” refers to the degree of sequence identity between a reference sequence, e.g., SEQ ID NO:9, and a candidate sequence. A candidate sequence typically has a length that is from 80 percent to 200 percent of the length of the reference sequence, e.g., 82, 85, 87, 89, 90, 93, 95, 97, 99, 100, 105, 110, 115, 120, 130, 140, 150, 160, 170, 180, 190, or 200 percent of the length of the reference sequence. A percent identity for a candidate nucleic acid or polypeptide relative to a reference nucleic acid or polypeptide can be determined as follows. A reference sequence (e.g., a nucleic acid sequence or an amino acid sequence) is aligned to one or more candidate sequences using the computer program ClustalW (version 1.83, default parameters), which allows alignments of nucleic acid or polypeptide sequences to be carried out across their entire length (global alignment). Chenna et al., Nucleic Acids Res., 31(13):3497-500 (2003).
- ClustalW calculates the best match between a reference and one or more candidate sequences, and aligns them so that identities, similarities and differences can be determined. Gaps of one or more residues can be inserted into a reference sequence, a candidate sequence, or both, to maximize sequence alignments. For fast pairwise alignment of nucleic acid sequences, the following default parameters are used: word size: 2; window size: 4; scoring method: percentage; number of top diagonals: 4; and gap penalty: 5. For multiple alignment of nucleic acid sequences, the following parameters are used: gap opening penalty: 10.0; gap extension penalty: 5.0; and weight transitions: yes. For fast pairwise alignment of protein sequences, the following parameters are used: word size: 1; window size: 5; scoring method: percentage; number of top diagonals: 5; gap penalty: 3. For multiple alignment of protein sequences, the following parameters are used: weight matrix: blosum; gap opening penalty: 10.0; gap extension penalty: 0.05; hydrophilic gaps: on; hydrophilic residues: Gly, Pro, Ser, Asn, Asp, Gln, Glu, Arg, and Lys; residue-specific gap penalties: on. The ClustalW output is a sequence alignment that reflects the relationship between sequences. ClustalW can be run, for example, at the Baylor College of Medicine Search Launcher site (searchlauncher.bcm.tmc.edu/multi-align/multi-align.html) and at the European Bioinformatics Institute site on the World Wide Web (ebi.ac.uk/clustalw).
- To determine percent identity of a candidate nucleic acid or amino acid sequence to a reference sequence, the sequences are aligned using ClustalW, the number of identical matches in the alignment is divided by the length of the reference sequence, and the result is multiplied by 100. In some embodiments, the percent identity is based on the alignment over the length of the shorter sequence. It is noted that the percent identity value can be rounded to the nearest tenth. For example, 78.11, 78.12, 78.13, and 78.14 are rounded down to 78.1, while 78.15, 78.16, 78.17, 78.18, and 78.19 are rounded up to 78.2.
- In some cases, a lignin-modulating polypeptide has an amino acid sequence with at least 40% sequence identity, e.g., 50%, 52%, 56%, 59%, 61%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 97%, 98%, or 99% sequence identity, to the amino acid sequence set forth in SEQ ID NO: 6, 9, 12, 15, 18, 21, 24, 27, 30, or 33. Amino acid sequences of polypeptides having greater than 80% sequence identity to the polypeptide set forth in SEQ ID NO:6 are provided in the Sequence Listing. Truncations of a lignin-modulating polypeptides may have a length that is from 10 percent to 90 percent of the length of the reference sequence, e.g., 10, 20, 30, 40, 50, 60, 70, 80, 90, or 95 percent of the length of the reference sequence.
- It should be appreciated that a lignin-modulating polypeptide can include additional amino acids that are not directly involved in lignin modulation, and thus such a polypeptide can be longer than would otherwise be the case. For example, a lignin-modulating polypeptide can include a purification tag, a chloroplast transit peptide, a mitochondrial transit peptide, an amyloplast peptide, or a leader sequence added to the amino or carboxy terminus. In some embodiments, a lignin-modulating polypeptide includes an amino acid sequence that functions as a reporter, e.g., a green fluorescent protein or yellow fluorescent protein.
- In some embodiments, the methods and compositions described herein comprise truncated COMT amino acid and nucleic acid sequences that modulate the lignin content of plants. Examples of such truncated COMT sequences include SEQ ID NOs: 21 or 27.
- Nucleic acids described herein include nucleic acids that are effective to modulate lignin levels when transcribed in a plant or plant cell. Such nucleic acids include, without limitation, those that encode a lignin-modulating polypeptide and those that can be used to inhibit expression of a lignin-modulating polypeptide via a nucleic acid based method.
- Nucleic acids encoding lignin-modulating polypeptides are described herein. Such nucleic acids include those that are less than 80% (e.g., from 10% to less than 45, 50, 55, 60, 65, 70, 75, or 80%) of the length of the full-length nucleic acid set forth in SEQ ID NOs: 1, 2, 4, 10, 16, 22, 28, 31, 3, 5, 17, 23, 29, or 32. Examples of nucleic acids encoding lignin-modulating polypeptides include SEQ ID NOs: 7, 10, 13, 19, 25, 8, 11, 14, 20, and 26, as described in more detail below.
- A lignin-modulating nucleic acid can comprise the nucleotide sequence set forth in SEQ ID NO: 7, 8, 10, 11, 13, 14, 19, 20, 25, or 26. Alternatively, a lignin-modulating nucleic acid can be a variant of the nucleic acid having the nucleotide sequence set forth in SEQ ID NO: 1, 2, 4, 7, 10, 13, 16, 19, 22, 25, 28, 31, 3, 5, 8, 11, 14, 17, 20, 23, 26, 29, or 32. For example, a lignin-modulating nucleic acid can have a nucleotide sequence with at least 80% sequence identity, e.g., 81%, 85%, 90%, 95%, 97%, 98%, or 99% sequence identity, to the nucleotide sequence set forth in SEQ ID NO: 1, 2, 4, 7, 10, 13, 16, 19, 22, 25, 28, 31, 3, 5, 8, 11, 14, 17, 20, 23, 26, 29, or 32.
- Isolated nucleic acid molecules can be produced by standard techniques. For example, polymerase chain reaction (PCR) techniques can be used to obtain an isolated nucleic acid containing a nucleotide sequence described herein. PCR can be used to amplify specific sequences from DNA as well as RNA, including sequences from total genomic DNA or total cellular RNA. Various PCR methods are described, for example, in PCR Primer: A Laboratory Manual, Dieffenbach and Dveksler, eds., Cold Spring Harbor Laboratory Press, 1995. Generally, sequence information from the ends of the region of interest or beyond is employed to design oligonucleotide primers that are identical or similar in sequence to opposite strands of the template to be amplified. Various PCR strategies also are available by which site-specific nucleotide sequence modifications can be introduced into a template nucleic acid. Isolated nucleic acids also can be chemically synthesized, either as a single nucleic acid molecule (e.g., using automated DNA synthesis in the 3′ to 5′ direction using phosphoramidite technology) or as a series of oligonucleotides. For example, one or more pairs of long oligonucleotides (e.g., >100 nucleotides) can be synthesized that contain the desired sequence, with each pair containing a short segment of complementarity (e.g., about 15 nucleotides) such that a duplex is formed when the oligonucleotide pair is annealed. DNA polymerase is used to extend the oligonucleotides, resulting in a single, double-stranded nucleic acid molecule per oligonucleotide pair, which then can be ligated into a vector. Isolated nucleic acids of the invention also can be obtained by mutagenesis of, e.g., a naturally occurring DNA.
- B. Use of Nucleic Acids to Modulate Expression of Polypeptides
- i. Expression of a Lignin-Modulating Polypeptide
- A nucleic acid encoding one of the lignin-modulating polypeptides described herein can be used to express the polypeptide in a plant species of interest, typically by transforming a plant cell with a nucleic acid having the coding sequence for the polypeptide operably linked in sense orientation to one or more regulatory regions. It will be appreciated that because of the degeneracy of the genetic code, a number of nucleic acids can encode a particular lignin-modulating polypeptide; i.e., for many amino acids, there is more than one nucleotide triplet that serves as the codon for the amino acid. Thus, codons in the coding sequence for a given lignin-modulating polypeptide can be modified such that optimal expression in a particular plant species is obtained, using appropriate codon bias tables for that species.
- In some cases, expression of a lignin-modulating polypeptide inhibits one or more functions of an endogenous polypeptide. For example, a nucleic acid that encodes a dominant negative polypeptide can be used to inhibit protein function. A dominant negative polypeptide typically is truncated relative to an endogenous wild type polypeptide, and its presence in a cell inhibits one or more functions of the wild type polypeptide in that cell, i.e., the dominant negative polypeptide is genetically dominant and confers a loss of function. The mechanism by which a dominant negative polypeptide confers such a phenotype can vary but often involves a protein-protein interaction or a protein-DNA interaction. For example, a dominant negative polypeptide can be an enzyme that is truncated relative to a native wild type enzyme, such that the truncated polypeptide retains domains involved in binding a first protein but lacks domains involved in binding a second protein. The truncated polypeptide is thus unable to properly modulate the activity of the second protein. See, e.g., US 2007/0056058.
- ii. Inhibition of Expression of a CAD or COMT Polypeptide
- Polynucleotides and recombinant constructs described herein can be used to inhibit expression of a CAD or COMT polypeptide in a plant species of interest. See, e.g., Matzke and Birchler, Nature Reviews Genetics 6:24-35 (2005); Akashi et al., Nature Reviews Mol. Cell. Biology 6:413-422 (2005); Mittal, Nature Reviews Genetics 5:355-365 (2004); Dorsett and Tuschl, Nature Reviews Drug Discovery 3: 318-329 (2004); and Nature Reviews RNA interference collection, October 2005 at nature.com/reviews/focus/mai. A number of nucleic acid based methods, including antisense RNA, ribozyme directed RNA cleavage, post-transcriptional gene silencing (PTGS), e.g., RNA interference (RNAi), and transcriptional gene silencing (TGS) are known to inhibit gene expression in plants. Suitable polynucleotides include full-length nucleic acids encoding lignin-modulating polypeptides or fragments of such full-length nucleic acids. In some embodiments, a complement of the full-length nucleic acid or a fragment thereof can be used. Typically, a fragment is at least 10 nucleotides, e.g., at least 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 30, 35, 40, 50, 80, 100, 200, 500 nucleotides or more. Generally, higher homology can be used to compensate for the use of a shorter sequence.
- Antisense technology is one well-known method. In this method, a nucleic acid of a gene to be repressed is cloned and operably linked to a regulatory region and a transcription termination sequence so that the antisense strand of RNA is transcribed. The recombinant construct is then transformed into plants, as described herein, and the antisense strand of RNA is produced. The nucleic acid need not be the entire sequence of the gene to be repressed, but typically will be substantially complementary to at least a portion of the sense strand of the gene to be repressed.
- In another method, a nucleic acid can be transcribed into a ribozyme, or catalytic RNA, that affects expression of an mRNA. See, U.S. Pat. No. 6,423,885. Ribozymes can be designed to specifically pair with virtually any target RNA and cleave the phosphodiester backbone at a specific location, thereby functionally inactivating the target RNA. Heterologous nucleic acids can encode ribozymes designed to cleave particular mRNA transcripts, thus preventing expression of a polypeptide. Hammerhead ribozymes are useful for destroying particular mRNAs, although various ribozymes that cleave mRNA at site-specific recognition sequences can be used. Hammerhead ribozymes cleave mRNAs at locations dictated by flanking regions that form complementary base pairs with the target mRNA. The sole requirement is that the target RNA contains a 5′-UG-3′ nucleotide sequence. The construction and production of hammerhead ribozymes is known in the art. See, for example, U.S. Pat. No. 5,254,678 and WO 02/46449 and references cited therein. Hammerhead ribozyme sequences can be embedded in a stable RNA such as a transfer RNA (tRNA) to increase cleavage efficiency in vivo. Perriman et al., Proc. Natl. Acad. Sci. USA, 92(13):6175-6179 (1995); de Feyter and Gaudron, Methods in Molecular Biology, Vol. 74, Chapter 43, “Expressing Ribozymes in Plants”, Edited by Turner, P. C., Humana Press Inc., Totowa, N.J. RNA endoribonucleases which have been described, such as the one that occurs naturally in Tetrahymena thermophila, can be useful. See, for example, U.S. Pat. Nos. 4,987,071 and 6,423,885.
- PTGS, e.g., RNAi, can also be used to inhibit the expression of a gene. For example, a construct can be prepared that includes a sequence that is transcribed into an RNA that can anneal to itself, e.g., a double stranded RNA having a stem-loop structure. In some embodiments, one strand of the stem portion of a double stranded RNA comprises a sequence that is similar or identical to the sense coding sequence or a fragment thereof of a lignin-modulating polypeptide, and that is from about 10 nucleotides to about 2,500 nucleotides in length. The length of the sequence that is similar or identical to the sense coding sequence can be from 10 nucleotides to 500 nucleotides, from 15 nucleotides to 300 nucleotides, from 20 nucleotides to 100 nucleotides, or from 25 nucleotides to 100 nucleotides. The other strand of the stem portion of a double stranded RNA comprises a sequence that is similar or identical to the antisense strand or a fragment thereof of the coding sequence of the lignin-modulating polypeptide, and can have a length that is shorter, the same as, or longer than the corresponding length of the sense sequence. In some cases, one strand of the stem portion of a double stranded RNA comprises a sequence that is similar or identical to the 3′ or 5′ untranslated region, or a fragment thereof, of an mRNA encoding a lignin-modulating polypeptide, and the other strand of the stem portion of the double stranded RNA comprises a sequence that is similar or identical to the sequence that is complementary to the 3′ or 5′ untranslated region, respectively, or a fragment thereof, of the mRNA encoding the lignin-modulating polypeptide. In other embodiments, one strand of the stem portion of a double stranded RNA comprises a sequence that is similar or identical to the sequence of an intron, or a fragment thereof, in the pre-mRNA encoding a lignin-modulating polypeptide, and the other strand of the stem portion comprises a sequence that is similar or identical to the sequence that is complementary to the sequence of the intron, or a fragment thereof, in the pre-mRNA.
- The loop portion of a double stranded RNA can be from 3 nucleotides to 5,000 nucleotides, e.g., from 3 nucleotides to 25 nucleotides, from 15 nucleotides to 1,000 nucleotides, from 20 nucleotides to 500 nucleotides, or from 25 nucleotides to 200 nucleotides. The loop portion of the RNA can include an intron or a fragment thereof. A double stranded RNA can have zero, one, two, three, four, five, six, seven, eight, nine, ten, or more stem-loop structures.
- A construct including a sequence that is operably linked to a regulatory region and a transcription termination sequence, and that is transcribed into an RNA that can form a double stranded RNA, is transformed into plants as described herein. Methods for using RNAi to inhibit the expression of a gene are known to those of skill in the art. See, e.g., U.S. Pat. Nos. 5,034,323; 6,326,527; 6,452,067; 6,573,099; 6,753,139; and 6,777,588. See also WO 97/01952; WO 98/53083; WO 99/32619; WO 98/36083; and U.S. Patent Publications 20030175965, 20030175783, 20040214330, and 20030180945.
- Constructs containing regulatory regions operably linked to nucleic acid molecules in sense orientation can also be used to inhibit the expression of a gene. The transcription product can be similar or identical to the sense coding sequence, or a fragment thereof, of a truncated lignin-modulating polypeptide. The transcription product also can be unpolyadenylated, lack a 5′ cap structure, or contain an unspliceable intron. Methods of inhibiting gene expression using a full-length cDNA as well as a partial cDNA sequence are known in the art. See, e.g., U.S. Pat. No. 5,231,020.
- In some embodiments, a construct containing a nucleic acid having at least one strand that is a template for both sense and antisense sequences that are complementary to each other is used to inhibit the expression of a gene. The sense and antisense sequences can be part of a larger nucleic acid molecule or can be part of separate nucleic acid molecules having sequences that are not complementary. The sense or antisense sequence can be a sequence that is identical or complementary to the sequence of an mRNA, the 3′ or 5′ untranslated region of an mRNA, or an intron in a pre-mRNA encoding a lignin-modulating polypeptide, or a fragment of such sequences. In some embodiments, the sense or antisense sequence is identical or complementary to a sequence of the regulatory region that drives transcription of the gene encoding a lignin-modulating polypeptide. In each case, the sense sequence is the sequence that is complementary to the antisense sequence.
- The sense and antisense sequences can be a length greater than about 10 nucleotides (e.g., 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, or more nucleotides). For example, an antisense sequence can be 21 or 22 nucleotides in length. Typically, the sense and antisense sequences range in length from about 15 nucleotides to about 30 nucleotides, e.g., from about 18 nucleotides to about 28 nucleotides, or from about 21 nucleotides to about 25 nucleotides.
- In some embodiments, an antisense sequence is a sequence complementary to an mRNA sequence, or a fragment thereof, encoding a lignin-modulating polypeptide described herein. The sense sequence complementary to the antisense sequence can be a sequence present within the mRNA of the lignin-modulating polypeptide. Typically, sense and antisense sequences are designed to correspond to a 15-30 nucleotide sequence of a target mRNA such that the level of that target mRNA is reduced.
- In some embodiments, a construct containing a nucleic acid having at least one strand that is a template for more than one sense sequence (e.g., 2, 3, 4, 5, 6, 7, 8, 9, 10 or more sense sequences) can be used to inhibit the expression of a gene. Likewise, a construct containing a nucleic acid having at least one strand that is a template for more than one antisense sequence (e.g., 2, 3, 4, 5, 6, 7, 8, 9, 10 or more antisense sequences) can be used to inhibit the expression of a gene. For example, a construct can contain a nucleic acid having at least one strand that is a template for two sense sequences and two antisense sequences. The multiple sense sequences can be identical or different, and the multiple antisense sequences can be identical or different. For example, a construct can have a nucleic acid having one strand that is a template for two identical sense sequences and two identical antisense sequences that are complementary to the two identical sense sequences. Alternatively, an isolated nucleic acid can have one strand that is a template for (1) two identical sense sequences 20 nucleotides in length, (2) one antisense sequence that is complementary to the two identical sense sequences 20 nucleotides in length, (3) a sense sequence 30 nucleotides in length, and (4) three identical antisense sequences that are complementary to the sense sequence 30 nucleotides in length. The constructs provided herein can be designed to have any arrangement of sense and antisense sequences. For example, two identical sense sequences can be followed by two identical antisense sequences or can be positioned between two identical antisense sequences.
- A nucleic acid having at least one strand that is a template for one or more sense and/or antisense sequences can be operably linked to a regulatory region to drive transcription of an RNA molecule containing the sense and/or antisense sequence(s). In addition, such a nucleic acid can be operably linked to a transcription terminator sequence, such as the terminator of the nopaline synthase (nos) gene. In some cases, two regulatory regions can direct transcription of two transcripts: one from the top strand, and one from the bottom strand. See, for example, Yan et al., Plant Physiol., 141:1508-1518 (2006). The two regulatory regions can be the same or different. The two transcripts can form double-stranded RNA molecules that induce degradation of the target RNA. In some cases, a nucleic acid can be positioned within a T-DNA or plant-derived transfer DNA (P-DNA) such that the left and right T-DNA border sequences, or the left and right border-like sequences of the P-DNA, flank or are on either side of the nucleic acid. See, US 2006/0265788. The nucleic acid sequence between the two regulatory regions can be from about 15 to about 300 nucleotides in length. In some embodiments, the nucleic acid sequence between the two regulatory regions is from about 15 to about 200 nucleotides in length, from about 15 to about 100 nucleotides in length, from about 15 to about 50 nucleotides in length, from about 18 to about 50 nucleotides in length, from about 18 to about 40 nucleotides in length, from about 18 to about 30 nucleotides in length, or from about 18 to about 25 nucleotides in length.
- C. Constructs/Vectors
- Recombinant constructs provided herein can be used to transform plants or plant cells in order to modulate lignin levels. A recombinant nucleic acid construct can comprise a nucleic acid encoding a lignin-modulating polypeptide as described herein, operably linked to a regulatory region suitable for expressing the lignin-modulating polypeptide in the plant or cell. Thus, a nucleic acid can comprise a coding sequence that encodes any of the lignin-modulating polypeptides as set forth in SEQ ID NOs: 9, 15, 21, or 27, or a variant thereof. Examples of nucleic acids encoding lignin-modulating polypeptides are set forth in SEQ ID NO:7, 8, 13, 14, 19, 20, 25, or 26. The lignin-modulating polypeptide encoded by a recombinant nucleic acid can be a native lignin-modulating polypeptide, or can be heterologous to the cell. In some cases, the recombinant construct contains a nucleic acid that inhibits expression of a lignin-modulating polypeptide, operably linked to a regulatory region. Examples of suitable regulatory regions are described in the section entitled “Regulatory Regions.”
- Vectors containing recombinant nucleic acid constructs such as those described herein also are provided. Suitable vector backbones include, for example, those routinely used in the art such as plasmids, viruses, artificial chromosomes, BACs, YACs, or PACs. Suitable expression vectors include, without limitation, plasmids and viral vectors derived from, for example, bacteriophage, baculoviruses, and retroviruses. Numerous vectors and expression systems are commercially available from such corporations as Novagen (Madison, Wis.), Clontech (Palo Alto, Calif.), Stratagene (La Jolla, Calif.), and Invitrogen/Life Technologies (Carlsbad, Calif.).
- The vectors provided herein also can include, for example, origins of replication, scaffold attachment regions (SARs), and/or markers. A marker gene can confer a selectable phenotype on a plant cell. For example, a marker can confer biocide resistance, such as resistance to an antibiotic (e.g., kanamycin, G418, bleomycin, or hygromycin), or an herbicide (e.g., glyphosate, chlorsulfuron or phosphinothricin). In addition, an expression vector can include a tag sequence designed to facilitate manipulation or detection (e.g., purification or localization) of the expressed polypeptide. Tag sequences, such as luciferase, β-glucuronidase (GUS), green fluorescent protein (GFP), glutathione S-transferase (GST), polyhistidine, c-myc, hemagglutinin, or Flag™ tag (Kodak, New Haven, Conn.) sequences typically are expressed as a fusion with the encoded polypeptide. Such tags can be inserted anywhere within the polypeptide, including at either the carboxyl or amino terminus.
- D. Regulatory Regions
- The choice of regulatory regions to be included in a recombinant construct depends upon several factors, including, but not limited to, efficiency, selectability, inducibility, desired expression level, and cell- or tissue-preferential expression. It is a routine matter for one of skill in the art to modulate the expression of a coding sequence by appropriately selecting and positioning regulatory regions relative to the coding sequence. Transcription of a nucleic acid can be modulated in a similar manner. Some suitable regulatory regions initiate transcription only, or predominantly, in certain cell types. Methods for identifying and characterizing regulatory regions in plant genomic DNA are known, including, for example, those described in the following references: Jordano et al., Plant Cell, 1:855-866 (1989); Bustos et al., Plant Cell, 1:839-854 (1989); Green et al., EMBO J., 7:4035-4044 (1988); Meier et al., Plant Cell, 3:309-316 (1991); and Zhang et al., Plant Physiology, 110:1069-1079 (1996).
- Examples of various classes of regulatory regions are described below. Some of the regulatory regions indicated below as well as additional regulatory regions are described in more detail in U.S. Patent Application Ser. Nos. 60/505,689; 60/518,075; 60/544,771; 60/558,869; 60/583,691; 60/619,181; 60/637,140; 60/757,544; 60/776,307; 10/957,569; 11/058,689; 11/172,703; 11/208,308; 11/274,890; 60/583,609; 60/612,891; 11/097,589; 11/233,726; 11/408,791; 11/414,142; 10/950,321; 11/360,017; PCT/US05/011105; PCT/US05/23639; PCT/US05/034308; PCT/US05/034343; and PCT/US06/038236; PCT/US06/040572; and PCT/US07/62762.
- For example, the sequences of regulatory regions p326, YP0144, YP0190, p13879, YP0050, p32449, 21876, YP0158, YP0214, YP0380, PT0848, PT0633, YP0128, YP0275, PT0660, PT0683, PT0758, PT0613, PT0672, PT0688, PT0837, YP0092, PT0676, PT0708, YP0396, YP0007, YP0111, YP0103, YP0028, YP0121, YP0008, YP0039, YP0115, YP0119, YP0120, YP0374, YP0101, YP0102, YP0110, YP0117, YP0137, YP0285, YP0212, YP0097, YP0107, YP0088, YP0143, YP0156, PT0650, PT0695, PT0723, PT0838, PT0879, PT0740, PT0535, PT0668, PT0886, PT0585, YP0381, YP0337, PT0710, YP0356, YP0385, YP0384, YP0286, YP0377, PD1367, PT0863, PT0829, PT0665, PT0678, YP0086, YP0188, YP0263, PT0743 and YP0096 are set forth in the sequence listing of PCT/US06/040572; the sequence of regulatory region PT0625 is set forth in the sequence listing of PCT/US05/034343; the sequences of regulatory regions PT0623, YP0388, YP0087, YP0093, YP0108, YP0022 and YP0080 are set forth in the sequence listing of U.S. patent application Ser. No. 11/172,703; the sequence of regulatory region PR0924 is set forth in the sequence listing of PCT/US07/62762; and the sequences of regulatory regions p530c10, pOsFIE2-2, pOsMEA, pOsYp102, and pOsYp285 are set forth in the sequence listing of PCT/US06/038236.
- It will be appreciated that a regulatory region may meet criteria for one classification based on its activity in one plant species, and yet meet criteria for a different classification based on its activity in another plant species.
- i. Broadly Expressing Promoters
- A promoter can be said to be “broadly expressing” when it promotes transcription in many, but not necessarily all, plant tissues. For example, a broadly expressing promoter can promote transcription of an operably linked sequence in one or more of the shoot, shoot tip (apex), and leaves, but weakly or not at all in tissues such as roots or stems. As another example, a broadly expressing promoter can promote transcription of an operably linked sequence in one or more of the stem, shoot, shoot tip (apex), and leaves, but can promote transcription weakly or not at all in tissues such as reproductive tissues of flowers and developing seeds. Non-limiting examples of broadly expressing promoters that can be included in the nucleic acid constructs provided herein include the p326, YP0144, YP0190, p13879, YP0050, p32449, 21876, YP0158, YP0214, YP0380, PT0848, and PT0633 promoters. Additional examples include the cauliflower mosaic virus (CaMV) 35S promoter, the mannopine synthase (MAS) promoter, the 1′ or 2′ promoters derived from T-DNA of Agrobacterium tumefaciens, the figwort mosaic virus 34S promoter, actin promoters such as the rice actin promoter, and ubiquitin promoters such as the maize ubiquitin-1 promoter. In some cases, the CaMV 35S promoter is excluded from the category of broadly expressing promoters.
- ii. Root Promoters
- Root-active promoters confer transcription in root tissue, e.g., root endodermis, root epidermis, or root vascular tissues. In some embodiments, root-active promoters are root-preferential promoters, i.e., confer transcription only or predominantly in root tissue. Root-preferential promoters include the YP0128, YP0275, PT0625, PT0660, PT0683, and PT0758 promoters. Other root-preferential promoters include the PT0613, PT0672, PT0688, and PT0837 promoters, which drive transcription primarily in root tissue and to a lesser extent in ovules and/or seeds. Other examples of root-preferential promoters include the root-specific subdomains of the CaMV 35S promoter (Lam et al., Proc. Natl. Acad. Sci. USA, 86:7890-7894 (1989)), root cell specific promoters reported by Conkling et al., Plant Physiol., 93:1203-1211 (1990), and the tobacco RD2 promoter.
- iii. Maturing Endosperm Promoters
- In some embodiments, promoters that drive transcription in maturing endosperm can be useful. Transcription from a maturing endosperm promoter typically begins after fertilization and occurs primarily in endosperm tissue during seed development and is typically highest during the cellularization phase. Most suitable are promoters that are active predominantly in maturing endosperm, although promoters that are also active in other tissues can sometimes be used. Non-limiting examples of maturing endosperm promoters that can be included in the nucleic acid constructs provided herein include the napin promoter, the Arcelin-5 promoter, the phaseolin promoter (Bustos et al., Plant Cell, 1(9):839-853 (1989)), the soybean trypsin inhibitor promoter (Riggs et al., Plant Cell, 1(6):609-621 (1989)), the ACP promoter (Baerson et al., Plant Mol. Biol., 22(2):255-267 (1993)), the stearoyl-ACP desaturase promoter (Slocombe et al., Plant Physiol., 104(4):167-176 (1994)), the soybean a′ subunit of β-conglycinin promoter (Chen et al., Proc. Natl. Acad. Sci. USA, 83:8560-8564 (1986)), the oleosin promoter (Hong et al., Plant Mol. Biol., 34(3):549-555 (1997)), and zein promoters, such as the 15 kD zein promoter, the 16 kD zein promoter, 19 kD zein promoter, 22 kD zein promoter and 27 kD zein promoter. Also suitable are the Osgt-1 promoter from the rice glutelin-1 gene (Zheng et al., Mol. Cell. Biol., 13:5829-5842 (1993)), the beta-amylase promoter, and the barley hordein promoter. Other maturing endosperm promoters include the YP0092, PT0676, and PT0708 promoters.
- iv. Photosynthetic Tissue Promoters
- Promoters active in photosynthetic tissue confer transcription in green tissues such as leaves and stems. Most suitable are promoters that drive expression only or predominantly in such tissues. Examples of such promoters include the ribulose-1,5-bisphosphate carboxylase (RbcS) promoters such as the RbcS promoter from eastern larch (Larix laricina), the pine cab6 promoter (Yamamoto et al., Plant Cell Physiol., 35:773-778 (1994)), the Cab-1 promoter from wheat (Fejes et al., Plant Mol. Biol., 15:921-932 (1990)), the CAB-1 promoter from spinach (Lubberstedt et al., Plant Physiol., 104:997-1006 (1994)), the cab1R promoter from rice (Luan et al., Plant Cell, 4:971-981 (1992)), the pyruvate orthophosphate dikinase (PPDK) promoter from corn (Matsuoka et al., Proc. Natl. Acad. Sci. USA, 90:9586-9590 (1993)), the tobacco Lhcbl*2 promoter (Cerdan et al., Plant Mol. Biol., 33:245-255 (1997)), the Arabidopsis thaliana SUC2 sucrose-H+ symporter promoter (Truernit et al., Planta, 196:564-570 (1995)), and thylakoid membrane protein promoters from spinach (psaD, psaF, psaE, PC, FNR, atpC, atpD, cab, rbcS). Other photosynthetic tissue promoters include PT0535, PT0668, PT0886, YP0144, YP0380 and PT0585.
- v. Vascular Tissue Promoters
- Examples of promoters that have high or preferential activity in vascular bundles include YP0087, YP0093, YP0108, YP0022, and YP0080. Other vascular tissue-preferential promoters include the glycine-rich cell wall protein GRP 1.8 promoter (Keller and Baumgartner, Plant Cell, 3(10):1051-1061 (1991)), the Commelina yellow mottle virus (CoYMV) promoter (Medberry et al., Plant Cell, 4(2):185-192 (1992)), and the rice tungro bacilliform virus (RTBV) promoter (Dai et al., Proc. Natl. Acad. Sci. USA, 101(2):687-692 (2004)).
- vi. Inducible Promoters
- Inducible promoters confer transcription in response to external stimuli such as chemical agents or environmental stimuli. For example, inducible promoters can confer transcription in response to hormones such as giberellic acid or ethylene, or in response to light or drought. Examples of drought-inducible promoters include YP0380, PT0848, YP0381, YP0337, PT0633, YP0374, PT0710, YP0356, YP0385, YP0396, YP0388, YP0384, PT0688, YP0286, YP0377, PD1367, and PD0901. Examples of nitrogen-inducible promoters include PT0863, PT0829, PT0665, and PT0886. Examples of shade-inducible promoters include PR0924 and PT0678. An example of a promoter induced by salt is rd29A (Kasuga et al. (1999) Nature Biotech 17: 287-291).
- vii. Basal Promoters
- A basal promoter is the minimal sequence necessary for assembly of a transcription complex required for transcription initiation. Basal promoters frequently include a “TATA box” element that may be located between about 15 and about 35 nucleotides upstream from the site of transcription initiation. Basal promoters also may include a “CCAAT box” element (typically the sequence CCAAT) and/or a GGGCG sequence, which can be located between about 40 and about 200 nucleotides, typically about 60 to about 120 nucleotides, upstream from the transcription start site.
- viii. Stem Promoters
- A stem promoter may be specific to one or more stem tissues or specific to stem and other plant parts. Stem promoters may have high or preferential activity in, for example, epidermis and cortex, vascular cambium, procambium, or xylem. Examples of stem promoters include YP0018 which is disclosed in US20060015970 and CryIA(b) and CryIA(c) (Braga et al. 2003, Journal of new seeds 5:209-221).
- ix. Other Promoters
- Other classes of promoters include, but are not limited to, shoot-preferential, callus-preferential, trichome cell-preferential, guard cell-preferential such as PT0678, tuber-preferential, parenchyma cell-preferential, and senescence-preferential promoters. Promoters designated YP0086, YP0188, YP0263, PT0758, PT0743, PT0829, YP0119, and YP0096, as described in the above-referenced patent applications, may also be useful.
- x. Other Regulatory Regions
- A 5′ untranslated region (UTR) can be included in nucleic acid constructs described herein. A 5′ UTR is transcribed, but is not translated, and lies between the start site of the transcript and the translation initiation codon and may include the +1 nucleotide. A 3′ UTR can be positioned between the translation termination codon and the end of the transcript. UTRs can have particular functions such as increasing mRNA stability or attenuating translation. Examples of 3′ UTRs include, but are not limited to, polyadenylation signals and transcription termination sequences, e.g., a nopaline synthase termination sequence.
- It will be understood that more than one regulatory region may be present in a recombinant polynucleotide, e.g., introns, enhancers, upstream activation regions, transcription terminators, and inducible elements. Thus, for example, more than one regulatory region can be operably linked to the sequence of a polynucleotide encoding a truncated lignin-modulating polypeptide.
- Regulatory regions, such as promoters for endogenous genes, can be obtained by chemical synthesis or by subcloning from a genomic DNA that includes such a regulatory region. A nucleic acid comprising such a regulatory region can also include flanking sequences that contain restriction enzyme sites that facilitate subsequent manipulation.
- The invention also features transgenic plant cells and plants comprising at least one recombinant nucleic acid construct described herein. A plant or plant cell can be transformed by having a construct integrated into its genome, i.e., can be stably transformed. Stably transformed cells typically retain the introduced nucleic acid with each cell division. A plant or plant cell can also be transiently transformed such that the construct is not integrated into its genome. Transiently transformed cells typically lose all or some portion of the introduced nucleic acid construct with each cell division such that the introduced nucleic acid cannot be detected in daughter cells after a sufficient number of cell divisions. Both transiently transformed and stably transformed transgenic plants and plant cells can be useful in the methods described herein.
- Transgenic plant cells used in methods described herein can constitute part or all of a whole plant. Such plants can be grown in a manner suitable for the species under consideration, either in a growth chamber, a greenhouse, or in a field. Transgenic plants can be bred as desired for a particular purpose, e.g., to introduce a recombinant nucleic acid into other lines, to transfer a recombinant nucleic acid to other species, or for further selection of other desirable traits. Alternatively, transgenic plants can be propagated vegetatively for those species amenable to such techniques. As used herein, a transgenic plant also refers to progeny of an initial transgenic plant provided the progeny inherits the transgene. Seeds produced by a transgenic plant can be grown and then selfed (or outcrossed and selfed) to obtain seeds homozygous for the nucleic acid construct.
- Transgenic plants can be grown in suspension culture, or tissue or organ culture. For the purposes of this invention, solid and/or liquid tissue culture techniques can be used. When using solid medium, transgenic plant cells can be placed directly onto the medium or can be placed onto a filter that is then placed in contact with the medium. When using liquid medium, transgenic plant cells can be placed onto a flotation device, e.g., a porous membrane that contacts the liquid medium. A solid medium can be, for example, Murashige and Skoog (MS) medium containing agar and a suitable concentration of an auxin, e.g., 2,4-dichlorophenoxyacetic acid (2,4-D), and a suitable concentration of a cytokinin, e.g., kinetin.
- When transiently transformed plant cells are used, a reporter sequence encoding a reporter polypeptide having a reporter activity can be included in the transformation procedure and an assay for reporter activity or expression can be performed at a suitable time after transformation. A suitable time for conducting the assay typically is about 1-21 days after transformation, e.g., about 1-14 days, about 1-7 days, or about 1-3 days. The use of transient assays is particularly convenient for rapid analysis in different species, or to confirm expression of a heterologous lignin-modulating polypeptide whose expression has not previously been confirmed in particular recipient cells.
- Techniques for introducing nucleic acids into monocotyledonous and dicotyledonous plants are known in the art, and include, without limitation, Agrobacterium-mediated transformation, viral vector-mediated transformation, electroporation and particle gun transformation, e.g., U.S. Pat. Nos. 5,538,880; 5,204,253; 6,329,571 and 6,013,863. If a cell or cultured tissue is used as the recipient tissue for transformation, plants can be regenerated from transformed cultures if desired, by techniques known to those skilled in the art.
- A population of transgenic plants can be screened and/or selected for those members of the population that have a trait or phenotype conferred by expression of the transgene. For example, a population of progeny of a single transformation event can be screened for those plants having a desired level of expression of a lignin-modulating polypeptide or nucleic acid. Physical and biochemical methods can be used to identify expression levels. These include Southern analysis or PCR amplification for detection of a polynucleotide; Northern blots, S1 RNase protection, primer-extension, or RT-PCR amplification for detecting RNA transcripts; enzymatic assays for detecting enzyme or ribozyme activity of polypeptides and polynucleotides; and protein gel electrophoresis, Western blots, immunoprecipitation, and enzyme-linked immunoassays to detect polypeptides. Other techniques such as in situ hybridization, enzyme staining, and immunostaining also can be used to detect the presence or expression of polypeptides and/or polynucleotides. Methods for performing all of the referenced techniques are known. As an alternative, a population of plants comprising independent transformation events can be screened for those plants having a desired trait, such as a modulated level of lignin. Selection and/or screening can be carried out over one or more generations, and/or in more than one geographic location. In some cases, transgenic plants can be grown and selected under conditions which induce a desired phenotype or are otherwise necessary to produce a desired phenotype in a transgenic plant. In addition, selection and/or screening can be applied during a particular developmental stage in which the phenotype is expected to be exhibited by the plant. Selection and/or screening can be carried out to choose those transgenic plants having a statistically significant difference in lignin level relative to a control plant that lacks the transgene. Selected or screened transgenic plants have an altered phenotype as compared to a corresponding control plant, as described in the “Transgenic Plant Phenotypes” section herein.
- The polynucleotides and vectors described herein can be used to transform a number of monocotyledonous and dicotyledonous plants and plant cell systems, including species from one of the following families: Acanthaceae, Alliaceae, Alstroemeriaceae, Amaryllidaceae, Apocynaceae, Arecaceae, Asteraceae, Berberidaceae, Bixaceae, Brassicaceae, Bromeliaceae, Cannabaceae, Caryophyllaceae, Cephalotaxaceae, Chenopodiaceae, Colchicaceae, Cucurbitaceae, Dioscoreaceae, Ephedraceae, Erythroxylaceae, Euphorbiaceae, Fabaceae, Lamiaceae, Linaceae, Lycopodiaceae, Malvaceae, Melanthiaceae, Musaceae, Myrtaceae, Nyssaceae, Papaveraceae, Pinaceae, Plantaginaceae, Poaceae, Rosaceae, Rubiaceae, Salicaceae, Sapindaceae, Solanaceae, Taxaceae, Theaceae, or Vitaceae.
- Suitable species may include members of the genera Abelmoschus, Abies, Acer, Agrostis, Allium, Alstroemeria, Ananas, Andrographis, Andropogon, Artemisia, Arundo, Atropa, Berberis, Beta, Bixa, Brassica, Calendula, Camellia, Camptotheca, Cannabis, Capsicum, Carthamus, Catharanthus, Cephalotaxus, Chrysanthemum, Cinchona, Citrullus, Coffea, Colchicum, Coleus, Cucumis, Cucurbita, Cynodon, Datura, Dianthus, Digitalis, Dioscorea, Elaeis, Ephedra, Erianthus, Erythroxylum, Eucalyptus, Festuca, Fragaria, Galanthus, Glycine, Gossypium, Helianthus, Hevea, Hordeum, Hyoscyamus, Jatropha, Lactuca, Linum, Lolium, Lupinus, Lycopersicon, Lycopodium, Manihot, Medicago, Mentha, Miscanthus, Musa, Nicotiana, Oryza, Panicum, Papaver, Parthenium, Pennisetum, Petunia, Phalaris, Phleum, Pinus, Poa, Poinsettia, Populus, Rauwolfia, Ricinus, Rosa, Saccharum, Salix, Sanguinaria, Scopolia, Secale, Solanum, Sorghum, Spartina, Spinacea, Tanacetum, Taxus, Theobroma, Triticosecale, Triticum, Uniola, Veratrum, Vinca, Vitis, and Zea.
- Suitable species include Panicum spp., Sorghum spp., Miscanthus spp., Saccharum spp., Erianthus spp., Populus spp., Andropogon gerardii (big bluestem), Pennisetum purpureum (elephant grass), Phalaris arundinacea (reed canarygrass), Cynodon dactylon (bermudagrass), Festuca arundinacea (tall fescue), Spartina pectinata (prairie cord-grass), Medicago sativa (alfalfa), Arundo donax (giant reed), Secale cereale (rye), Salix spp. (willow), Eucalyptus spp. (eucalyptus), Triticosecale (triticum—wheat×rye) and bamboo.
- Suitable species also include Helianthus annuus (sunflower), Carthamus tinctorius (safflower), Jatropha curcas (jatropha), Ricinus communis (castor), Elaeis guineensis (palm), Linum usitatissimum (flax), and Brassica juncea.
- Suitable species also include Beta vulgaris (sugarbeet), and Manihot esculenta (cassaya).
- Suitable species also include Lycopersicon esculentum (tomato), Lactuca sativa (lettuce), Musa paradisiaca (banana), Solanum tuberosum (potato), Brassica oleracea (broccoli, cauliflower, Brussels sprouts), Camellia sinensis (tea), Fragaria ananassa (strawberry), Theobroma cacao (cocoa), Coffea arabica (coffee), Vitis vinifera (grape), Ananas comosus (pineapple), Capsicum annum (hot & sweet pepper), Allium cepa (onion), Cucumis melo (melon), Cucumis sativus (cucumber), Cucurbita maxima (squash), Cucurbita moschata (squash), Spinacea oleracea (spinach), Citrullus lanatus (watermelon), Abelmoschus esculentus (okra), and Solanum melongena (eggplant).
- Suitable species also include Rosa spp. (rose), Dianthus caryophyllus (carnation), Petunia spp. (petunia) and Poinsettia pulcherrima (poinsettia).
- Suitable species also include Nicotiana tabacum (tobacco), Lupinus albus (lupin), Uniola paniculata (oats), bentgrass (Agrostis spp.), Populus tremuloides (aspen), Pinus spp. (pine), Abies spp. (fir), Acer spp. (maple), Hordeum vulgare (barley), Poa pratensis (bluegrass), Lolium spp. (ryegrass) and Phleum pratense (timothy).
- Thus, the methods and compositions can be used over a broad range of plant species, including species from the dicot genera Brassica, Carthamus, Glycine, Gossypium, Helianthus, Jatropha, Parthenium, Populus, and Ricinus; and the monocot genera Elaeis, Festuca, Hordeum, Lolium, Oryza, Panicum, Pennisetum, Phleum, Poa, Saccharum, Secale, Sorghum, Triticosecale, Triticum, and Zea. In some embodiments, a plant is a member of the species Panicum virgatum (switchgrass), Sorghum bicolor (sorghum, sudangrass), Miscanthus giganteus (miscanthus), Saccharum sp. (energycane), Populus balsamifera (poplar), Zea mays (corn), Glycine max (soybean), Brassica napus (canola), Triticum aestivum (wheat), Gossypium hirsutum (cotton), Oryza sativa (rice), Helianthus annuus (sunflower), Medicago sativa (alfalfa), Beta vulgaris (sugarbeet), or Pennisetum glaucum (pearl millet).
- In certain embodiments, the polynucleotides and vectors described herein can be used to transform a number of monocotyledonous and dicotyledonous plants and plant cell systems, wherein such plants are hybrids of different species or varieties of a specific species (e.g., Saccharum sp.×Miscanthus sp.)
- In some embodiments, the truncated sorghum CAD sequences of the methods and composition described herein are from wild, weedy, or cultivated sorghum species such as, but not limited to, Sorghum almum, Sorghum amplum, Sorghum angustum, Sorghum arundinaceum, Sorghum bicolor (such as bicolor, guinea, caudatum, kafir, and durra), Sorghum brachypodum, Sorghum bulbosum, Sorghum burmahicum, Sorghum controversum, Sorghum drummondii, Sorghum ecarinatum, Sorghum exstans, Sorghum grande, Sorghum halepense, Sorghum interjectum, Sorghum intrans, Sorghum laxiflorum, Sorghum leiocladum, Sorghum macrospermum, Sorghum matarankense, Sorghum miliaceum, Sorghum nigrum, Sorghum nitidum, Sorghum plumosum, Sorghum propinquum, Sorghum purpureosericeum, Sorghum stipoideum, Sorghum sudanensese, Sorghum timorense, Sorghum trichocladum, Sorghum versicolor, Sorghum virgatum, Sorghum vulgare, or hybrids such as Sorghum×almum, or Sorghum×drummondii.
- In some embodiments, a plant in which expression of at least one lignin-modulating polypeptide is modulated can have decreased levels of lignin. For example, a lignin-modulating polypeptide described herein can be expressed in a transgenic plant, resulting in decreased levels of lignin. Decreased levels of lignin may mean decreased levels of total lignin, and/or ratios of Syringyl liginin, Guaiacyl lignin, and p-Hydroxyphenyl lignin monomers. The lignin level can be decreased by at least 2 percent, e.g., 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 25, 30, 35, 40, 45, 50, 55, 60, or more than 60 percent, as compared to the lignin level in a corresponding control plant that does not express the transgene. In some embodiments, a plant in which expression of a lignin-modulating polypeptide is modulated can have decreased levels of lignin in harvestable biomass. Decreases in lignin in such plants can provide improved biomass to biofuel conversion. In some embodiments, a plant in which expression of a lignin-modulating polypeptide is modulated can have increased or decreased levels of lignin in one or more plant tissues, e.g., leaf tissues, or stem tissues. In some embodiments, a truncated CAD described herein is transformed into and expressed in sorghum that is already positive for one or more alleles encoding truncated polypeptides of CAD and/or COMT. In such embodiments, lignin content may be further decreased from the content found in the parent plants. Lignin content of a sample can be analyzed using methods standard in the art.
- Typically, a difference in the amount of lignin in a transgenic plant or cell relative to a control plant or cell is considered statistically significant at p≦0.05 with an appropriate parametric or non-parametric statistic, e.g., Chi-square test, Student's t-test, Mann-Whitney test, or F-test. In some embodiments, a difference in the amount of lignin is statistically significant at p<0.01, p<0.005, or p<0.001. A statistically significant difference in, for example, the amount of lignin in a transgenic plant compared to the amount in cells of a control plant indicates that the recombinant nucleic acid present in the transgenic plant results in altered lignin levels.
- The phenotype of a transgenic plant is evaluated relative to a control plant. A plant is said “not to express” a polypeptide when the plant exhibits less than 10%, e.g., less than 9%, 8%, 7%, 6%, 5%, 4%, 3%, 2%, 1%, 0.5%, 0.1%, 0.01%, or 0.001%, of the amount of polypeptide or mRNA encoding the polypeptide exhibited by the plant of interest. Expression can be evaluated using methods including, for example, RT-PCR, Northern blots, 51 RNase protection, primer extensions, Western blots, protein gel electrophoresis, immunoprecipitation, enzyme-linked immunoassays, chip assays, and mass spectrometry. It should be noted that if a polypeptide is expressed under the control of a tissue-preferential or broadly expressing promoter, expression can be evaluated in the entire plant or in a selected tissue. Similarly, if a polypeptide is expressed at a particular time, e.g., at a particular time in development or upon induction, expression can be evaluated selectively at a desired time period.
- In some embodiments, the transgenic or non-transgenic plants identified or produced by the methods described herein have modulated lignin content in comparison to plants that do not comprise endogenous or exogenous genes encoding at least one truncated CAD allele. In such embodiments, the lignin content can be decreased by about 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 15, 20, 25, 30, 35, 40, 50, 60, 70, or 80 percent. In some embodiments, the transgenic or non-transgenic plants identified or produced by the methods described herein have modified yield of fermentable sugars in comparison to plants that do not comprise endogenous or exogenous genes encoding at least one truncated CAD allele. Such sorghum plants having one or more truncated CAD alleles as described herein have an increase in the yield of fermentable sugars, such as but not limited to, glucose, arabinose, fructose, galactose, or xylose, wherein the yield is increased by about 5, 10, 15, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, or 90 percent. In some embodiments, the transgenic plants described herein or the non-transgenic plants identified or produced by the methods described herein have altered lignin in comparison to plants that do not comprise endogenous or exogenous genes encoding at least one truncated CAD allele. In some embodiments, the altered lignin has a decrease in guaiacyl and syringyl residues. In some embodiments, the developmental gradient of lignin is altered. In some embodiments, the cell wall composition is altered. In some embodiments, lignin subunit composition is altered.
- In some embodiments, the transgenic plants described herein or the non-transgenic plants identified or produced by the methods described herein comprise one or more truncated CAD sequences and one or more truncated COMT sequences.
- The ability to characterize an individual by its genome is based on differences in nucleotide sequences among individuals. Typically, genetic markers are polymorphic regions of a genome and the complementary oligonucleotides which bind to these regions. The major causes of polymorphisms, and thus the major sources of genetic markers, are insertions (additions), deletions, nucleotide substitutions (point mutations), recombination events, and transposable elements within the genome of individuals in a plant population. As one example, point mutations can result from errors in DNA replication or damage to the DNA. As another example, insertions and deletions can result from inaccurate recombination events. As yet another example, variation can arise from the insertion or excision of a transposable element (a DNA sequence that has the ability to move or to jump to new locations with the genome, autonomously or non-autonomously).
- Described herein are methods and kits for determining the genotype of a sorghum plant comprising detecting in the genome of the plant at least a first polymorphism at a CAD locus. The methods, in certain embodiments, comprise detecting a plurality of polymorphisms in the genome of the plant. The method may further comprise storing the results of the step of detecting the plurality of polymorphisms on a computer readable medium. The invention further provides a computer readable medium produced by such a method. In one embodiment, described herein are a method for identifying sorghum plant lines with a truncated CAD comprising supplying a nucleic acid sample for a sorghum plant, providing amplification primers for amplifying a region of a sorghum plant corresponding to a truncated CAD gene present in said nucleic acid sample, applying said amplification primers to said nucleic acid sample such that amplification of said region of said CAD gene occurs, and identifying sorghum plants having a truncated CAD based on the presence of one or more mutations that confer a truncation in said amplified nucleic acid sample.
- Polymorphisms may be detected by means known in the art. For example, molecular markers specific to CAD truncations can be used. Examples, of molecular markers include, oligonucleotides, single nucleotide polymorphisms (SNPs), multinucleotide polymorphisms, an insertion or a deletion of at least one nucleotide (indel), a simple sequence repeat (SSR), a restriction fragment length polymorphism (RFLP), an EST sequence or a unique nucleotide sequence of 20-40 bases used as a probe (oligonucleotides), a random amplified polymorphic DNA (RAPD) marker, or an arbitrary fragment length polymorphism (AFLP). As will be evident to one of skill, the number and type of markers required can differ. Markers can be used in conjunction with labeling or PCR to detect and score polymorphisms. Discovery, detection, and genotyping of various genetic markers have been well described in the literature. See, e.g., Henry, ed. (2001) Plant Genotyping. The DNA Fingerprinting of Plants Wallingford: CABI Publishing; Phillips and Vasil, eds. (2001) DNA-based Markers in Plants Dordrecht: Kluwer Academic Publishers; Pejic et al. (1998) “Comparative analysis of genetic similarity among maize inbred lines detected by RFLPs, RAPDs, SSRs and AFLPs” Theor. App. Genet. 97: 1248-1255; Bhattramakki et al. (2002) “Insertion-deletion polymorphisms in 3′ regions of maize genes occur frequently and can be used as highly informative genetic markers” Plant Mol. Biol. 48: 539-47; Nickerson et al. (1997) “PolyPhred: automating the detection and genotyping of single nucleotide substitutions using fluorescence-based resequencing” Nucleic Acids Res. 25: 2745-2751; Underhill et al. (1997) “Detection of numerous Y chromosome biallelic polymorphisms by denaturing high-performance liquid chromatography” Genome Res. 7: 996-1005; Rafalski et al. (2002) “The genetic diversity of components of rye hybrids” Cell Mol Biol Lett 7: 471-5; Ching and Rafalski (2002) “Rapid genetic mapping of ests using SNP pyrosequencing and indel analysis” Cell Mol Biol Lett. 7: 803-10; and Powell et al. (1996) “The comparison of RFLP, RAPD, AFLP and SSR (microsatellite) markers for germplasm analysis” Mol. Breeding. 2: 225-238.
- In some embodiments, where nucleic acids are used to identify a truncated CAD, the nucleic acids can be shorter in length than the truncated CAD sequence, and comprise the truncating stop codon or a sequence complimentary to the truncating stop codon. In some embodiments, the nucleic acids used to identify a truncated CAD terminate with the truncating stop codon or a sequence complimentary to the truncating stop codon. In some embodiments, the nucleic acids used to identify a truncated CAD are about 4, 5, 6, 7, 8, 9, 10, 12, 14, 16, 18, 20, 25, 30, 35, 40, 45, 50, 55, or 60 nucleotides in length. Such polynucleotides may be used as primers or probes.
- In some embodiments, oligonucleotides specific to wild-type (wt) and mutant CAD alleles can be used to detect and score the genotype of a sorghum plant. For example, the CAD alleles of SEQ ID NOs: 7 and 13 can be detected and scored using SEQ ID NOs: 34 and/or 36. Such SNP sequences can be amplified in PCR reactions to detect and score genotypes of CAD alleles. In some embodiments, the polymorphism detected is a difference in a CAD nucleotide sequence which results in a stop codon. For example, SEQ ID NOs: 7 and 13 have single nucleotide differences that result in stop codons at positions 4089 and 2800, respectively. SNPs can be discovered and detected by any of a number of techniques known in the art. For example, SNPs can be detected by direct sequencing of DNA segments, e.g., amplified by PCR, from several individuals (see, e.g., Ching et al. (2002) “SNP frequency, haplotype structure and linkage disequilibrium in elite maize inbred lines” BMC Genetics 3: 19). As another example, SNPs can be discovered by computer analysis of available sequences (e.g., ESTs, STSS) derived from multiple genotypes (see, e.g., Marth et al. (1999) “A general approach to single-nucleotide polymorphism discovery” Nature Genetics 23: 452-456 and Beutow et al. (1999) “Reliable identification of large numbers of candidate SNPs from public EST data” Nature Genetics 21: 323-325). Indels, insertions or deletions of one or more nucleotides, can also be discovered by sequencing and/or computer analysis, e.g., simultaneously with SNP discovery. Similarly, SNPs can be genotyped by sequencing. SNPs can also be genotyped by various other methods (including high throughput methods) known in the art, for example, using DNA chips, allele-specific hybridization, allele-specific PCR, and primer extension techniques. See, e.g., Lindblad-Toh et al. (2000) “Large-scale discovery and genotyping of single-nucleotide polymorphisms in the mouse” Nature Genetics 24: 381-386; Bhattramakki and Rafalski (2001) “Discovery and application of single nucleotide polymorphism markers in plants” in Plant Genotyping: The DNA Fingerprinting of Plants, CABI Publishing; Syvanen (2001) “Accessing genetic variation: genotyping single nucleotide polymorphisms” Nat. Rev. Genet. 2: 930-942; Kuklin et al. (1998) “Detection of single-nucleotide polymorphisms with the WAVE TM DNA fragment analysis system” Genetic Testing 1: 201-206; Gut (2001) “Automation in genotyping single nucleotide polymorphisms” Hum. Mutat. 17: 475-492; Lemieux (2001) “Plant genotyping based on analysis of single nucleotide polymorphisms using microarrays” in Plant Genotyping: The DNA Fingerprinting of Plants, CABI Publishing; Edwards and Mogg (2001) “Plant genotyping by analysis of single nucleotide polymorphisms” in Plant Genotyping: The DNA Fingerprinting of Plants, CABI Publishing; Ahmadian et al. (2000) “Single-nucleotide polymorphism analysis by pyrosequencing” Anal. Biochem. 280: 103-110; Useche et al. (2001) “High-throughput identification, database storage and analysis of SNPs in EST sequences” Genome Inform Ser Workshop Genome Inform 12: 194-203; Pastinen et al. (2000) “A system for specific, high-throughput genotyping by allele-specific primer extension on microarrays” Genome Res. 10: 1031-1042; Hacia (1999) “Determination of ancestral alleles for human single-nucleotide polymorphisms using high-density oligonucleotide arrays” Nature Genet. 22: 164-167; and Chen et al. (2000) “Microsphere-based assay for single-nucleotide polymorphism analysis using single base chain extension” Genome Res. 10: 549-557. Multinucleotide polymorphisms can be discovered and detected by analogous methods.
- In some embodiments, where the CAD truncation is generated by mutagenesis, the CAD alleles are first sequenced and then oligonucleotides specific to the mutant sequence can be designed and synthesized based on the nucleic acid sequence. In some embodiments, where the CAD mutation is synthesized and introduced into a plant, oligonucleotides specific to the truncation can be designed and synthesized based on the nucleic acid sequence. Synthesized mutants may be based on the nucleotide sequence of any sorghum CAD allele.
- In some embodiments of the methods and kits described herein, one or more sets of oligonucleotides, each capable of recognizing the presence or absence of a specific and defined genomic position, is used. For organisms with more chromosomes more oligonucleotides are desirable. The lower limit is one oligonucleotide pair and the upper limit is set by the desired resolution capacity of the method and the test kit. Hybridization of the oligonucleotides to DNA from the sorghum plant is preferably recorded in situ by any conventional labelling system, applying for instance terminal transferase and conventional recordable labels. As an alternative to in situ labelling the hybridized sample DNA may be released from the solid support and subsequently hybridized with labelled polynucleotide sequences corresponding to each of the original oligonucleotide sequences attached to the solid support. Hybridization is optionally reversible and the solid support can be returned to its original state for reuse. A labelled dideoxynucleotide can be incorporated at the end of the oligonucleotide provided that the oligonucleotide is hybridized to genomic DNA as template. The nucleotide sequence at the genomic position adjacent to the region matching the oligonucleotide is known and therefore the particular nucleotide which will be incorporated (A, C, G, T or U) is known. Co-dominant scoring is achieved using paired, i.e. two or parallel, i.e. three, flanking oligonucleotide sequences. The results obtained are recorded as full, empty, failure or null alleles and can be used to distinguish between heterozygous and/or homozygous genotypes. Optional post-hybridization treatments, including washing and digestion, are provided in order to remove sample DNA not fully hybridized to the solid support-attached oligonucleotide sequences, for example before and after labelling. The presence or absence of hybridization is recorded using a method allowing the recording of the hybridization state.
- One or more of the methods of breeding described herein can be used with the sequences described herein. In particular, the primer pairs and probes described herein are of value in breeding programs because when incorporating the truncated CAD alleles into a different genetic background, such as an elite cultivar, a modified backcrossing scheme can be used, where the inheritance of the truncated CAD alleles is tracked with the primer pairs or probes. This eliminates the need for self-pollination to reveal the phenotype associated with homozygosity for a truncated CAD allele, and thus saves time and effort.
- Sorghum plants are bred in most cases by self pollination techniques. With the incorporation of male sterility (either genetic or cytoplasmic) cross pollination breeding techniques can also be utilized. Sorghum has a perfect flower with both male and female parts in the same flower located in the panicle. The flowers are usually in pairs on the panicle branches. Natural pollination occurs in sorghum when anthers (male flowers) open and pollen falls onto receptive stigma (female flowers). Because of the close proximity of male (anthers) and female (stigma) in the panicle, self pollination can be high. Cross pollination may occur when wind or convection currents move pollen from the anthers of one plant to receptive stigma on another plant. Cross pollination is greatly enhanced with incorporation of male sterility which renders male flowers nonviable without affecting the female flowers. Successful pollination in the case of male sterile flowers requires cross pollination.
- The development of sorghum hybrids requires the development of homozygous inbred lines, the crossing of these lines, and the evaluation of the crosses. Pedigree breeding methods, and to a lesser extent population breeding methods, are used to develop inbred lines from breeding populations. Breeding programs combine desirable traits from two or more inbred lines into breeding pools from which new inbred lines are developed by selfing and selection of desired phenotypes. The new inbreds are crossed with other inbred lines and the hybrids from these crosses are evaluated to determine which have commercial potential.
- Pedigree breeding starts with the crossing of two genotypes, each of which may have one or more desirable characteristics that is lacking in the other or which complement the other. If the two original parents do not provide all of the desired characteristics, other sources can be included in the breeding population. In the pedigree method, superior plants are selfed and selected in successive generations. In the succeeding generations the heterozygous condition gives way to homogeneous lines as a result of self-pollination and selection. Typically, in the pedigree method of breeding five or more generations of selfing and selection is practiced. F1 to F2; F2 to F3; F3 to F4; F4 to F5, etc.
- Backcrossing can be used to improve an inbred line. Backcrossing transfers a specific desirable trait from one inbred or source to an inbred that lacks that trait. This can be accomplished for example by first crossing a superior inbred (A) (recurrent parent) to a donor inbred (non-recurrent parent), which carries the appropriate genes(s) for the trait in question. The progeny of this cross is then mated back to the superior recurrent parent (A) followed by selection in the resultant progeny for the desired trait to be transferred from the non-recurrent parent. After five or more backcross generations with selection for the desired trait, the progeny will be heterozygous for loci controlling the characteristic being transferred, but will be like the superior parent for most or almost all other genes. The last backcross generation would be selfed to give pure breeding progeny for the gene(s) being transferred.
- A hybrid sorghum variety is the cross of two inbred lines, each of which may have one or more desirable characteristics lacked by the other or which complement the other. The hybrid progeny of the first generation is designated F1. In the development of hybrids only the F1 hybrid plants are sought. The hybrid is more vigorous than its inbred parents. This hybrid vigor, or heterosis, can be manifested in many ways, including increased vegetative growth and increased yield.
- The development of a hybrid sorghum variety involves five steps: (1) the formation of “restorer” and “non-restorer” germplasm pools; (2) the selection of superior plants from various “restorer” and “non-restorer” germplasm pools; (3) the selfing of the superior plants for several generations to produce a series of inbred lines, which although different from each other, each breed true and are highly uniform; (4) the conversion of inbred lines classified as non-restorers to cytoplasmic male sterile (CMS) forms, and (5) crossing the selected cytoplasmic male sterile (CMS) inbred lines with selected fertile inbred lines (restorer lines) to produce the hybrid progeny (F1).
- Because sorghum is normally a self pollinated plant and because both male and female flowers are in the same panicle, large numbers of hybrid seed can only be produced by using cytoplasmic male sterile (CMS) inbreds. Inbred male sterile lines are developed by converting inbred lines to CMS. This is achieved by transferring the chromosomes of the line to be sterilized into sterile cytoplasm by a series of backcrosses, using a male sterile line as a female parent and the line to be sterilized as the recurrent and pollen parent in all crosses. After conversion to male sterility the line is designated the (A) line. Lines with fertility restoring genes cannot be converted into male sterile A-lines. The original line is designated the (B) line.
- Flowers of the CMS inbred are fertilized with pollen from a male fertile inbred carrying genes which restore male fertility in the hybrid (F1) plants. An important consequence of the homozygosity and homogeneity of the inbred lines is that the hybrid between any two inbreds will always be the same. Once the inbreds that give the best hybrid have been identified, the hybrid seed can be reproduced indefinitely as long as the homogeneity of the inbred parent is maintained.
- A single cross hybrid is produced when two inbred lines are crossed to produce the F1 progeny. Much of the hybrid vigor exhibited by F1 hybrids is lost in the next generation (F2). Consequently, seed from hybrid varieties is not typically used for planting stock.
- Hybrid sorghum can be produced using wind to move the pollen. Alternating strips of the cytoplasmic male sterile inbred (female) and the male fertile inbred (male) are planted in the same field. Wind moves the pollen shed by the male inbred to receptive stigma on the female. Providing that there is sufficient isolation from sources of foreign sorghum pollen, the stigma of the male sterile inbred (female) will be fertilized only with pollen from the male fertile inbred (male). The resulting seed, born on the male sterile (female) plants is therefore hybrid and will form hybrid plants that have full fertility restored. In some embodiments, if the hybrid sorghum is used as forage or for biomass production, then it may be unnecessary to restore fertility.
- In some embodiments, inbred parental lines, elite breeding lines, or hybrid sorghum are bred by the methods described herein to comprise one or more alleles for which the CAD coding sequence is truncated relative to a wild-type CAD coding sequence and one or more alleles for which the COMT coding sequence is truncated relative to a wild-type COMT coding sequence. In some embodiments, the sorghum plants developed are high biomass varieties for biofuel production.
- In some embodiments, other breeding methods may be used in conjunction or as part of the methods described herein.
- Recurrent selection is a method used in a plant breeding program to improve a population of plants. The method entails individual plants cross pollinating with each other to form progeny. The progeny are grown and the superior progeny selected by any number of selection methods, which include individual plant, half-sib progeny, full-sib progeny and selfed progeny. The selected progeny are self pollinated or cross pollinated with each other to form progeny for another population. This population is planted and again superior plants are selected to self pollinate or cross pollinate with each other. Recurrent selection is a cyclical process and therefore can be repeated as many times as desired. The objective of recurrent selection is to improve the traits of a population. The improved population can then be used as a source of breeding material to obtain new varieties for commercial or breeding use, including the production of a synthetic cultivar. A synthetic cultivar is the resultant progeny formed by the intercrossing of several selected varieties. The number of parental plant varieties, populations, wild accessions, ecotypes, etc., that are used to generate a synthetic can vary from as little as 10 to as much as 500. Typically, about 100 to 300 varieties, populations, etc., are used a parents for the synthetic variety. Seed from the parental seed production plot of a synthetic variety can be sold to the farmer. Alternatively, seed from the parental seed production plot can subsequently undergo one or two generations of multiplication, depending on the amount of seed produced in the parental plot and the demand for seed.
- Mass selection is a useful technique when used in conjunction with molecular marker enhanced selection. In mass selection seeds from individuals are selected based on phenotype or genotype. These selected seeds are then bulked and used to grow the next generation. Bulk selection requires growing a population of plants in a bulk plot, allowing the plants to self-pollinate, harvesting the seed in bulk and then using a sample of the seed harvested in bulk to plant the next generation. Also, instead of self pollination, directed pollination could be used as part of the breeding program.
- Mutation breeding is another method of introducing new traits into sorghum. Mutations that occur spontaneously or are artificially induced can be useful sources of variability for a plant breeder. The goal of artificial mutagenesis is to increase the rate of mutation for a desired characteristic. Mutation rates can be increased by many different means including temperature, long-term seed storage, tissue culture conditions, radiation; such as X-rays, Gamma rays (e.g. cobalt 60 or cesium 137), neutrons, (product of nuclear fission by uranium 235 in an atomic reactor), Beta radiation (emitted from radioisotopes such as phosphorus 32 or carbon 14), or ultraviolet radiation (such as from 2500 to 2900 nm), or chemical mutagens (such as base analogues (5-bromo-uracil), related compounds (8-ethoxy caffeine), antibiotics (streptonigrin), alkylating agents (sulfur mustards, nitrogen mustards, epoxides, ethylenamines, sulfates, sulfonates, sulfones, lactones), azide, hydroxylamine, nitrous acid, or acridines. Once a desired trait is observed through mutagenesis the trait may then be incorporated into existing germplasm by traditional breeding techniques. Details of mutation breeding can be found in Fehr, 1993. Principles of Cultivar Development, Macmillan Publishing Company. In addition, mutations created in other sorghum plants may be used to produce a backcross conversion of sorghum that comprises such mutation. In addition, mutations created in other lines may be used to produce a backcross conversion of elite lines that comprise such mutations.
- C. Breeding with Molecular Markers
- The plant genotyping techniques described herein may be used in marker-assisted plant breeding methods in sorghum. In addition, techniques such as Isozyme Electrophoresis, Arbitrarily Primed Polymerase Chain Reaction (AP-PCR), DNA Amplification Fingerprinting (DAF), and Sequence Characterized Amplified Regions (SCARs) can be used in marker-assisted breeding.
- One use of the plant genotyping techniques described herein is Quantitative Trait Loci (QTL) mapping. QTL mapping is the use of markers, which are known to be closely linked to alleles that have measurable effects on a quantitative trait. Selection in the breeding process is based upon the accumulation of markers linked to the positive effecting alleles and/or the elimination of the markers linked to the negative effecting alleles from the plant's genome.
- Molecular markers can also be used during the breeding process for the selection of qualitative traits. For example, markers closely linked to alleles or markers containing sequences within the actual alleles of interest can be used to select plants that contain the alleles of interest during a backcrossing breeding program. The markers can also be used to select for the genome of the recurrent parent and against the genome of the donor parent. Using this procedure can minimize the amount of genome from the donor parent that remains in the selected plants. It can also be used to reduce the number of crosses back to the recurrent parent needed in a backcrossing program. The use of molecular markers in the selection process is often called genetic marker enhanced selection. Molecular markers may also be used to identify and exclude certain sources of germplasm as parental varieties or ancestors of a plant by providing a means of tracking genetic profiles through crosses.
- D. Genomic selection
- One potential problem with marker assisted selection is that only a limited proportion of the total genetic variance is captured by the markers. An alternative to tracing a limited number of QTL with markers is to trace all the QTL. This can be done by dividing the entire genome up into chromosome segments, for example defined by adjacent markers, and then tracing all the chromosome segments. This method was termed genomic selection by Meuwissen et al. 2001 “Prediction of total genetic value using genome-wide dense marker maps” Genetics 157:1819-1829. With the availability of high-density marker maps and cost effective genotyping, genomic selection methods can provide faster genetic gain than can be achieved by current selection methods based on phenotypes and pedigree. Some of the factors driving the accuracy of genomic selection include marker density and marker type (i.e., microsatellite and SNP markers). With genomic selection, selection is typically on the sum of estimates of effects of all marker intervals across the genome, fitted either as fixed (fixed GS) or random (random GS) effects. Responses to selection are tracked by indices over generations. The efficiency of genomic selection over standard marker assisted selection depends on stringency of the threshold used for QTL detection. One skilled in the art can optimize factors that affect genomic selection for a particular species such as Sorghum species.
- The production of double haploids can also be used for the development of plants with a homozygous phenotype in the breeding program. For example, a sorghum cultivar as a parent can be used to produce double haploid plants. Double haploids are produced by the doubling of a set of chromosomes (1 N) from a heterozygous plant to produce a completely homozygous individual. For example, see Wan et al., “Efficient Production of Doubled Haploid Plants Through Colchicine Treatment of Anther-Derived Maize Callus”, Theoretical and Applied Genetics, 77:889-892, 1989 and U.S. Pat. No. 7,135,615. This can be advantageous because the process omits the generations of selfing needed to obtain a homozygous plant from a heterozygous source.
- Haploid induction systems have been developed for various plants to produce haploid tissues, plants and seeds. The haploid induction system can produce haploid plants from any genotype by crossing a selected line (as female) with an inducer line. Such inducer lines for maize include Stock 6 (Coe, 1959, Am. Nat. 93:381-382; Sharkar and Coe, 1966, Genetics 54:453-464), KEMS (Deimling, Roeber, and Geiger, 1997, Vortr. Pflanzenzuchtg 38:203-224), or KMS and ZMS (Chalyk, Bylich & Chebotar, 1994, MNL 68:47; Chalyk & Chebotar, 2000, Plant Breeding 119:363-364), and indeterminate gametophyte (ig) mutation (Kermicle 1969 Science 166:1422-1424).
- Methods for obtaining haploid plants are also disclosed in Kobayashi, M. et al., J. Heredity 71(1):9-14, 1980, Pollacsek, M., Agronomie (Paris) 12(3):247-251, 1992; Cho-Un-Haing et al., J. Plant Biol., 1996, 39(3):185-188; Verdoodt, L., et al., February 1998, 96(2):294-300; Genetic Manipulation in Plant Breeding, Proceedings International Symposium Organized by EUCARPIA, Sep. 8-13, 1985, Berlin, Germany; Chalyk et al., 1994, Maize Genet Coop. Newsletter 68:47; Chalyk, S.
- Thus, one embodiment is a process for making a substantially homozygous sorghum progeny plant by producing or obtaining a seed from the cross of two sorghum plants and applying double haploid methods to the F1 seed or F1 plant or to a subsequent filial generation. Based on studies in maize, such methods can decrease the number of generations required to produce a variety with similar genetics or characteristics to sorghum. See Bernardo, R. and Kahler, A. L., Theor. Appl. Genet. 102:986-992, 2001. Descriptions of other breeding methods that are commonly used for different traits and crops can be found in one of several reference books (e.g., Allard, 1960; Simmonds, 1979; Sneep et al., 1979; Fehr, 1987).
- A plant breeding technique called backcrossing can be utilized wherein essentially all of the desired morphological and physiological characteristics of a variety are recovered in addition to a single gene that is transferred into the variety via the backcrossing technique. Backcrossing methods can be used to improve or introduce a characteristic into the variety. The term “backcrossing” as used herein refers to the repeated crossing of a hybrid progeny back to the recurrent parent, i.e.,
backcrossing - The selection of a suitable recurrent parent is an important step for a successful backcrossing procedure. The goal of a backcross protocol is to alter or substitute a single trait or characteristic in the original variety. To accomplish this, a single gene of the recurrent variety is modified or substituted with the desired gene from the nonrecurrent parent, while retaining essentially all of the rest of the desired genetic, and therefore the desired physiological and morphological, constitution of the original variety. The choice of the particular nonrecurrent parent will depend on the purpose of the backcross; one of the major purposes is to add some agronomically important trait to the plant. The exact backcrossing protocol will depend on the characteristic or trait being altered to determine an appropriate testing protocol. Although backcrossing methods are simplified when the characteristic being transferred is a dominant allele, a recessive allele may also be transferred. In this instance it may be necessary to introduce a test of the progeny to determine if the desired characteristic has been successfully transferred.
- Many single gene traits have been identified that are sometimes not selected for in the development of a new variety but that can be improved by backcrossing techniques. Single gene traits may or may not be transgenic; examples of these traits include but are not limited to, male sterility, herbicide resistance, resistance for bacterial, fungal, or viral disease, insect resistance, male fertility, enhanced nutritional quality, industrial usage, yield stability and yield enhancement. These genes are generally inherited through the nucleus. Several of these single gene traits are described in U.S. Pat. Nos. 5,959,185; 5,973,234 and 5,977,445; the disclosures of which are specifically hereby incorporated by reference in their entirety.
- Pedigree breeding starts with the crossing of two genotypes, having one or more desirable characteristics that is lacking or which complements the other. If the two original parents do not provide all the desired characteristics, other sources can be included in the breeding population. In the pedigree method, superior plants are selfed and selected in successive filial generations. In the succeeding filial generations the heterozygous condition gives way to homogeneous varieties as a result of self-pollination and selection. Typically in the pedigree method of breeding, five or more successive filial generations of selfing and selection is practiced: F1 to F2; F2 to F3; F3 to F4; F4 to F5, etc. After a sufficient amount of inbreeding, successive filial generations will serve to increase seed of the developed variety. In some embodiments, the developed variety comprises homozygous alleles at about 95% or more of its loci.
- In addition to being used to create a backcross conversion, backcrossing can also be used in combination with pedigree breeding. As discussed previously, backcrossing can be used to transfer one or more specifically desirable traits from one variety, the donor parent, to a developed variety called the recurrent parent, which has overall good agronomic characteristics yet lacks that desirable trait or traits. However, the same procedure can be used to move the progeny toward the genotype of the recurrent parent but at the same time retain many components of the non-recurrent parent by stopping the backcrossing at an early stage and proceeding with selfing and selection. For example, a sorghum variety may be crossed with another variety to produce a first generation progeny plant. The first generation progeny plant may then be backcrossed to one of its parent varieties to create a BC1 or BC2. Progeny are selfed and selected so that the newly developed variety has many of the attributes of the recurrent parent and yet several of the desired attributes of the non-recurrent parent. This approach leverages the value and strengths of the recurrent parent for use in new sorghum varieties.
- Transgenic and non-transgenic plants described herein have various uses in the agricultural and energy production industries. For example, transgenic plants described herein can be used to make animal feed and food products. Such plants, however, are often particularly useful as a feedstock for energy production.
- Transgenic plants described herein often produce biomass with decreased or altered lignin content, relative to control plants that lack the exogenous nucleic acid. Non-transgenic plants described herein, such as those produced or selected by the methods described herein often produce biomass with decreased or altered lignin content, relative to control plants that lack one or more of the nucleic acids described herein. In some embodiments, such plants provide equivalent or even increased yields of grain and/or biomass per hectare relative to control plants when grown under conditions of reduced inputs such as fertilizer and/or water. Thus, such transgenic and non-transgenic plants can be used to provide yield quality improvements at a lower input cost and/or under environmentally stressful conditions such as drought. In some embodiments, plants described herein have a composition that permits more efficient processing into free sugars, and subsequently ethanol, for energy production. In some embodiments, such plants provide higher yields of ethanol, butanol, dimethyl ether, other biofuel molecules, and/or sugar-derived co-products per kilogram of plant material, relative to control plants. Such processing efficiencies are believed to be derived from the lignin composition of the plant material. By providing improved yields at an equivalent or even decreased cost of production, the transgenic plants described herein improve profitability for farmers and processors as well as decrease costs to consumers.
- Seeds from plants described herein can be conditioned and bagged in packaging material by means known in the art to form an article of manufacture. Packaging material such as paper and cloth are well known in the art. A package of seed can have a label, e.g., a tag or label secured to the packaging material, a label printed on the packaging material, or a label inserted within the package, that describes the nature of the seeds therein.
- Kits for genotyping plants for identification, selection, or breeding can comprise a means of detection of the presence of a truncated CAD in a sample of sorghum DNA. In some embodiments, a kit comprises one or more SNPs, such as SEQ ID NOs: 34-37, or a protein encoded by a polynucleotide as described herein. In some embodiments, a kit comprises one or more polynucleotide SNPs specific to a truncated CAD 131 to 320 amino acids in length. In some embodiments, a kit comprises one or more polynucleotide SNPs specific to a C-terminus truncated sorghum COMT, such as those described by Bout and Vermerris, which is in incorporated by reference herein in its entirety (Bout and Vermerris, 2003, A candidate-gene approach to clone the sorghum Brown midrib gene encoding COMT, Mol. Gen. Genomics 269:205-214). The kits described herein may be useful for genetic identity determination, phylogenetic studies, parenthood determinations, genotyping, haplotyping, pedigree analysis, forensic identification and/or plant breeding particularly with co-dominant scoring.
- In an embodiment, a kit may further comprise reagents for DNA amplification-detection technology such as PCR or TaqMan™. In another embodiment a kit may further comprise reagents for probe hybridization-detection technology such as Southern Blots, Northern Blots, in-situ Hybridization, or microarrays. In another embodiment, a kit may comprise reagents for antibody binding-detection technology such as Western Blots, ELISA's, SELDI mass spectrometry or test strips. In another embodiment, a kit may comprise reagents for lignin content analysis technology. In some embodiments, a kit may comprise instructions for one or more of the methods described above.
- The invention will be further described in the following examples, which do not limit the scope of the invention described in the claims.
- Each isolated nucleic acid described herein that encodes a truncated CAD can be cloned into a Ti plasmid vector containing a phosphinothricin acetyltransferase gene which confers Finale™ resistance to transformed plants. Constructs can be made using any of the nucleic acids described herein, each operably linked to a promoter or regulatory element. Wild-type Arabidopsis thaliana ecotype Wassilewskija (Ws) plants can be transformed separately with each construct. The transformations can be performed essentially as described in Bechtold et al., C. R. Acad. Sci. Paris, 316:1194-1199 (1993).
- The presence of each vector containing a nucleic acid described herein in the respective transgenic Arabidopsis line transformed with the vector can be confirmed by Finale™ resistance, PCR amplification from green leaf tissue extract, and/or sequencing of PCR products. As controls, wild-type Arabidopsis ecotype Ws plants can be transformed with an empty vector.
- DNA samples were extracted from sorghum GRIN germplasm accession nos.: PI 535790, PI 535806, PI 599692, PI 599697, PI 599705, PI 599720, PI 599731, PI 599740, PI 599750, PI 602730, PI 602740, PI 602898, PI 602902, PI 602906, PI 602910, PI 602914, PI 606705, PI 606706, and Ceres accession nos.:BICOLOR-81733675, GRAINERIII-81733676 (Conventional Sorghum Sudangrass Hybrid), 98093-81733674 (Conventional type Hybrid Forage Sorghum), SS1-81733673 (Sudan×Sudan), 22043-81733671 (sorghum sudangrass Hybrid), and 24213-81733672 (Hybrid forage sorghum (Long season)). The CAD alleles were amplified from each accession using oligonucleotide primer sets for PCR (SEQ ID NOs: 38-61). PCR amplification products were sequenced and analyzed.
- CAD nucleotide sequences of sorghum accessions PI602730-81733686 and PI535790-81733677 were analyzed and each contained a different point mutation altering a single nucleotide (CT), each of which resulted in a premature stop codon (SEQ ID NOs: 7 and 13).
- Oligonucleotides were developed having specificity to the SNPs in the nucleic acid sequences of wild type and mutant CAD alleles (SEQ ID NOs: 34-37). The oligonucleotides were tested on DNA extracted from sorghum accessions. PI602730-81733686 and PI602910-85802580 were homozygous for a CAD allele featuring a SNP resulting in a premature stop codon encoding a truncated polypeptide of 320 amino acids. P1535790-81733677, P1535806-81733678, P1602740-81733687, P1602902-81733689, and PI602906-81733690 were homozygous for a CAD allele featuring a SNP resulting in a premature stop codon encoding a truncated polypeptide of 131 amino acids. Accessions 22043 and 24213 were heterozygous for the CAD allele encoding the 131 amino acid truncated CAD polypeptide. Results of oligonucleotide assisted genotyping are shown in Table 1.
-
TABLE 1 SNP Genotyping of Sorghum Accessions. CAD CAD Truncation 1 Truncation 2 (BMR-6 131 aa) (BMR-17 320 aa) gDNA cDNA with T with T SEQ ID SEQ ID Accession Plant ID C/T C/T NO: NO: PI 535790 N105 T C 13 14 PI 535806 N121 T C Same as PI 602906 PI 599692 MP26 C C 2 3 PI 599697 MP31 C C Same as PI 599705 PI 599705 MP39 C C 25 26 PI 599720 MP54 T C Same as PI 602906 PI 599731 MP65 C C Same as PI 599705 PI 599740 MP74 C C Same as PI 599692 PI 599750 MP84 C C Same as PI 599705 PI 602730 BMP449 C T 7 8 PI 602740 BMP454 T C Same as PI 535790 PI 602898 AMP11 C C Same as PI 599705 PI 602902 AMP13 T C Same as PI 602906 PI 602906 AMP15 T C 22 23 PI 602910 AMP17 C T Same as PI 602730 PI 602914 AMP19 C C 28 29 PI 606705 Tift 98bmrA1 C C Same as PI 599705 PI 606706 Tift 98bmrB1 C C Same as PI 599705 BICOLOR C C 10 11 GRAINERIII C C Same as PI 599705 98093 C C Same as PI 599705 SS1 C C 31 32 22043 C/ T C 4 5 24213 C/T C Same as Same as PI 599705 PI 599705 - The oligonucleotides described herein can be used in marker assisted breeding to produce inbred sorghum lines that are homozygous for a CAD allele encoding a truncated CAD polypeptide, which can be crossed to make hybrid sorghum that are homozygous for the CAD allele encoding a truncated CAD polypeptide. For example, P1602730-81733686 can be crossed with a male sterile (A-line) that does not contain a CAD allele encoding a truncated CAD polypeptide but which has agronomically desirable traits. The resulting progeny in F2 generations can be screened using the oligonucleotides for plants that are heterozygous or homozygous for the CAD allele encoding truncated CAD polypeptides and are male sterile. Such progeny can be backcrossed to the A-line and through generations of selection a new A-line can be developed which is homozygous for the CAD allele encoding a truncated CAD polypeptide. The same process can be applied to B and R lines, so that the three lines can be used to produce hybrid seed that is homozygous for the CAD allele encoding a truncated CAD polypeptide.
- A process known as Reciprocal BLAST (Rivera et al., Proc. Natl. Acad. Sci. USA, 95:6239-6244 (1998)) can be used to identify potential functional homolog sequences as well as allelic variants from databases consisting of all available public and proprietary peptide sequences, including NR from NCBI and peptide translations from Ceres clones.
- Before starting a Reciprocal BLAST process, a specific reference polypeptide can be searched against all peptides from its source species using BLAST in order to identify polypeptides having BLAST sequence identity of 80% or greater to the reference polypeptide and an alignment length of 85% or greater along the shorter sequence in the alignment. The reference polypeptide and any of the aforementioned identified polypeptides can be designated as a cluster.
- The BLASTP version 2.0 program from Washington University at Saint Louis, Mo., USA can be used to determine BLAST sequence identity and E-value. The BLASTP version 2.0 program includes the following parameters: 1) an E-value cutoff of 1.0e-5; 2) a word size of 5; and 3) the −postsw option. The BLAST sequence identity can be calculated based on the alignment of the first BLAST HSP (High-scoring Segment Pairs) of the identified potential functional homolog or allelic variant sequence with a specific reference polypeptide. The number of identically matched residues in the BLAST HSP alignment can be divided by the HSP length, and then multiplied by 100 to get the BLAST sequence identity. The HSP length typically includes gaps in the alignment, but in some cases gaps can be excluded.
- The main Reciprocal BLAST process consists of two rounds of BLAST searches; forward search and reverse search. In the forward search step, a reference polypeptide sequence, “polypeptide A,” from source species SA can be BLASTed against all protein sequences from a species of interest. Top hits can be determined using an E-value cutoff of 10-5 and a sequence identity cutoff of 35%. Among the top hits, the sequence having the lowest E-value can be designated as the best hit, and considered a potential functional homolog or ortholog. Any other top hit that had a sequence identity of 80% or greater to the best hit or to the original reference polypeptide can be considered a potential functional homolog or ortholog as well. This process can be repeated for all species of interest. Allelic variants typically have higher sequence identity to a reference sequence, i.e., greater than 90%, and originating from the same species as the reference sequence. Allelic variants can be compared to available genome reference maps and inter-species comparative maps to determine the likelihood that the allelic variants identified correlate to the same locus.
- In the reverse search round, the top hits identified in the forward search from all species can be BLASTed against all protein sequences from the source species SA. A top hit from the forward search that returned a polypeptide from the aforementioned cluster as its best hit can also be considered as a potential functional homolog.
- It is to be understood that while the invention has been described in conjunction with the detailed description thereof, the foregoing description is intended to illustrate and not limit the scope of the invention, which is defined by the scope of the appended claims. Other aspects, advantages, and modifications are within the scope of the following claims.
Claims (48)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US12/575,991 US8298794B2 (en) | 2008-10-09 | 2009-10-08 | Cinnamyl-alcohol dehydrogenases |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US10406708P | 2008-10-09 | 2008-10-09 | |
US12/575,991 US8298794B2 (en) | 2008-10-09 | 2009-10-08 | Cinnamyl-alcohol dehydrogenases |
Publications (2)
Publication Number | Publication Date |
---|---|
US20100175144A1 true US20100175144A1 (en) | 2010-07-08 |
US8298794B2 US8298794B2 (en) | 2012-10-30 |
Family
ID=42312593
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US12/575,991 Active 2030-02-06 US8298794B2 (en) | 2008-10-09 | 2009-10-08 | Cinnamyl-alcohol dehydrogenases |
Country Status (1)
Country | Link |
---|---|
US (1) | US8298794B2 (en) |
Cited By (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102344915A (en) * | 2011-09-16 | 2012-02-08 | 中国科学院研究生院 | Protein with cinnamyl alcohol dehydrogenase activity and coding gene as well as application thereof |
EP2701487A2 (en) * | 2011-04-29 | 2014-03-05 | Bangladesh Jute Research Institute | Polynucleotides encoding enzymes from the jute lignin biosynthetic pathway |
KR20160148062A (en) * | 2011-06-23 | 2016-12-23 | 방글라데시 주트 리서치 인스티튜트 | Nucleic acid molecules encoding enzymes that confer disease resistance in jute |
WO2018198049A1 (en) * | 2017-04-25 | 2018-11-01 | Cellectis | Alfalfa with reduced lignin composition |
CN110679479A (en) * | 2019-11-15 | 2020-01-14 | 河北省农林科学院旱作农业研究所 | Hybrid breeding method of BMR genotype sudan grass |
CN110679478A (en) * | 2019-11-15 | 2020-01-14 | 河北省农林科学院旱作农业研究所 | Breeding method of forage grass type BMR sorghum sterile line |
CN116716423A (en) * | 2022-09-08 | 2023-09-08 | 四川省草原科学研究院 | EST-SSR marker primer pair developed based on phalaris arundinacea transcriptome sequence, and phalaris arundinacea variety identification method and application |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US10045540B2 (en) | 2014-04-01 | 2018-08-14 | Fayetteville State University | Pest control composition |
Citations (35)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4987071A (en) * | 1986-12-03 | 1991-01-22 | University Patents, Inc. | RNA ribozyme polymerases, dephosphorylases, restriction endoribonucleases and methods |
US5034323A (en) * | 1989-03-30 | 1991-07-23 | Dna Plant Technology Corporation | Genetic engineering of novel plant phenotypes |
US5204253A (en) * | 1990-05-29 | 1993-04-20 | E. I. Du Pont De Nemours And Company | Method and apparatus for introducing biological substances into living cells |
US5231020A (en) * | 1989-03-30 | 1993-07-27 | Dna Plant Technology Corporation | Genetic engineering of novel plant phenotypes |
US5254678A (en) * | 1987-12-15 | 1993-10-19 | Gene Shears Pty. Limited | Ribozymes |
US5538880A (en) * | 1990-01-22 | 1996-07-23 | Dekalb Genetics Corporation | Method for preparing fertile transgenic corn plants |
US5959185A (en) * | 1998-02-13 | 1999-09-28 | Pioneer Hi-Bred International, Inc. | Soybean variety 95B41 |
US5973234A (en) * | 1998-02-13 | 1999-10-26 | Pioneer Hi-Bred International, Inc. | Soybean variety 95B33 |
US5977445A (en) * | 1998-06-30 | 1999-11-02 | Pioneer Hi-Bred International, Inc. | Soybean variety 91B64 |
US6013863A (en) * | 1990-01-22 | 2000-01-11 | Dekalb Genetics Corporation | Fertile transgenic corn plants |
US6326527B1 (en) * | 1993-08-25 | 2001-12-04 | Dekalb Genetics Corporation | Method for altering the nutritional content of plant seed |
US6329571B1 (en) * | 1996-10-22 | 2001-12-11 | Japan Tobacco, Inc. | Method for transforming indica rice |
US6423885B1 (en) * | 1999-08-13 | 2002-07-23 | Commonwealth Scientific And Industrial Research Organization (Csiro) | Methods for obtaining modified phenotypes in plant cells |
US6452067B1 (en) * | 1997-09-19 | 2002-09-17 | Dna Plant Technology Corporation | Methods to assay for post-transcriptional suppression of gene expression |
US6573099B2 (en) * | 1998-03-20 | 2003-06-03 | Benitec Australia, Ltd. | Genetic constructs for delaying or repressing the expression of a target gene |
US20030175965A1 (en) * | 1997-05-21 | 2003-09-18 | Lowe Alexandra Louise | Gene silencing |
US20030175783A1 (en) * | 2002-03-14 | 2003-09-18 | Peter Waterhouse | Methods and means for monitoring and modulating gene silencing |
US20030180945A1 (en) * | 2002-03-14 | 2003-09-25 | Ming-Bo Wang | Modified gene-silencing RNA and uses thereof |
US6753139B1 (en) * | 1999-10-27 | 2004-06-22 | Plant Bioscience Limited | Gene silencing |
US6777588B2 (en) * | 2000-10-31 | 2004-08-17 | Peter Waterhouse | Methods and means for producing barley yellow dwarf virus resistant cereal plants |
US20040214330A1 (en) * | 1999-04-07 | 2004-10-28 | Waterhouse Peter Michael | Methods and means for obtaining modified phenotypes |
US20060015970A1 (en) * | 2003-12-12 | 2006-01-19 | Cers, Inc. | Nucleotide sequences and polypeptides encoded thereby useful for modifying plant characteristics |
US20060021083A1 (en) * | 2004-04-01 | 2006-01-26 | Zhihong Cook | Promoter, promoter control elements, and combinations, and uses thereof |
US20060041952A1 (en) * | 2004-08-20 | 2006-02-23 | Cook Zhihong C | P450 polynucleotides, polypeptides, and uses thereof |
US7135615B2 (en) * | 2001-06-05 | 2006-11-14 | The Curators Of The University Of Missouri | Chromosome doubling method |
US20060260004A1 (en) * | 2004-04-01 | 2006-11-16 | Yiwen Fang | Par-related protein promoters |
US20060265788A1 (en) * | 2004-09-08 | 2006-11-23 | J.R. Simplot Company | Plant-specific genetic elements and transfer cassettes for plant transformation |
US20070006335A1 (en) * | 2004-02-13 | 2007-01-04 | Zhihong Cook | Promoter, promoter control elements, and combinations, and uses thereof |
US7173121B2 (en) * | 2003-10-14 | 2007-02-06 | Ceres, Inc | Promoter, promoter control elements, and combinations, and uses thereof |
US7214789B2 (en) * | 2004-06-30 | 2007-05-08 | Ceres, Inc. | Promoter, promoter control elements, and combinations, and uses thereof |
US7312376B2 (en) * | 2005-04-20 | 2007-12-25 | Ceres, Inc. | Regulatory regions from Papaveraceae |
US7378571B2 (en) * | 2004-09-23 | 2008-05-27 | Ceres, Inc. | Promoter, promoter control elements, and combinations, and uses thereof |
US7402667B2 (en) * | 2003-10-14 | 2008-07-22 | Ceres, Inc. | Promoter, promoter control elements, and combinations, and uses thereof |
US7429692B2 (en) * | 2004-10-14 | 2008-09-30 | Ceres, Inc. | Sucrose synthase 3 promoter from rice and uses thereof |
US7598367B2 (en) * | 2005-06-30 | 2009-10-06 | Ceres, Inc. | Early light-induced protein promoters |
Family Cites Families (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP0837624A1 (en) | 1995-06-30 | 1998-04-29 | DNA Plant Technology Corporation | Delayed ripening tomato plants |
GB9703146D0 (en) | 1997-02-14 | 1997-04-02 | Innes John Centre Innov Ltd | Methods and means for gene silencing in transgenic plants |
US6506559B1 (en) | 1997-12-23 | 2003-01-14 | Carnegie Institute Of Washington | Genetic inhibition by double-stranded RNA |
EP1353935A4 (en) | 2000-12-07 | 2005-02-23 | Penn State Res Found | Selection of catalytic nucleic acids targeted to infectious agents |
US7244879B2 (en) | 2005-10-12 | 2007-07-17 | Ceres, Inc. | Nucleotide sequences and polypeptides encoded thereby useful for modifying plant characteristics in response to cold |
CA2581468A1 (en) | 2004-09-22 | 2006-04-06 | Ceres, Inc. | Promoter, promoter control elements, and combinations, and uses thereof |
WO2007055826A1 (en) | 2005-11-04 | 2007-05-18 | Ceres, Inc. | Modulation of fertility in monocots |
WO2007120989A2 (en) | 2006-02-24 | 2007-10-25 | Ceres, Inc. | Shade regulatory regions |
-
2009
- 2009-10-08 US US12/575,991 patent/US8298794B2/en active Active
Patent Citations (35)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4987071A (en) * | 1986-12-03 | 1991-01-22 | University Patents, Inc. | RNA ribozyme polymerases, dephosphorylases, restriction endoribonucleases and methods |
US5254678A (en) * | 1987-12-15 | 1993-10-19 | Gene Shears Pty. Limited | Ribozymes |
US5034323A (en) * | 1989-03-30 | 1991-07-23 | Dna Plant Technology Corporation | Genetic engineering of novel plant phenotypes |
US5231020A (en) * | 1989-03-30 | 1993-07-27 | Dna Plant Technology Corporation | Genetic engineering of novel plant phenotypes |
US6013863A (en) * | 1990-01-22 | 2000-01-11 | Dekalb Genetics Corporation | Fertile transgenic corn plants |
US5538880A (en) * | 1990-01-22 | 1996-07-23 | Dekalb Genetics Corporation | Method for preparing fertile transgenic corn plants |
US5204253A (en) * | 1990-05-29 | 1993-04-20 | E. I. Du Pont De Nemours And Company | Method and apparatus for introducing biological substances into living cells |
US6326527B1 (en) * | 1993-08-25 | 2001-12-04 | Dekalb Genetics Corporation | Method for altering the nutritional content of plant seed |
US6329571B1 (en) * | 1996-10-22 | 2001-12-11 | Japan Tobacco, Inc. | Method for transforming indica rice |
US20030175965A1 (en) * | 1997-05-21 | 2003-09-18 | Lowe Alexandra Louise | Gene silencing |
US6452067B1 (en) * | 1997-09-19 | 2002-09-17 | Dna Plant Technology Corporation | Methods to assay for post-transcriptional suppression of gene expression |
US5959185A (en) * | 1998-02-13 | 1999-09-28 | Pioneer Hi-Bred International, Inc. | Soybean variety 95B41 |
US5973234A (en) * | 1998-02-13 | 1999-10-26 | Pioneer Hi-Bred International, Inc. | Soybean variety 95B33 |
US6573099B2 (en) * | 1998-03-20 | 2003-06-03 | Benitec Australia, Ltd. | Genetic constructs for delaying or repressing the expression of a target gene |
US5977445A (en) * | 1998-06-30 | 1999-11-02 | Pioneer Hi-Bred International, Inc. | Soybean variety 91B64 |
US20040214330A1 (en) * | 1999-04-07 | 2004-10-28 | Waterhouse Peter Michael | Methods and means for obtaining modified phenotypes |
US6423885B1 (en) * | 1999-08-13 | 2002-07-23 | Commonwealth Scientific And Industrial Research Organization (Csiro) | Methods for obtaining modified phenotypes in plant cells |
US6753139B1 (en) * | 1999-10-27 | 2004-06-22 | Plant Bioscience Limited | Gene silencing |
US6777588B2 (en) * | 2000-10-31 | 2004-08-17 | Peter Waterhouse | Methods and means for producing barley yellow dwarf virus resistant cereal plants |
US7135615B2 (en) * | 2001-06-05 | 2006-11-14 | The Curators Of The University Of Missouri | Chromosome doubling method |
US20030180945A1 (en) * | 2002-03-14 | 2003-09-25 | Ming-Bo Wang | Modified gene-silencing RNA and uses thereof |
US20030175783A1 (en) * | 2002-03-14 | 2003-09-18 | Peter Waterhouse | Methods and means for monitoring and modulating gene silencing |
US7402667B2 (en) * | 2003-10-14 | 2008-07-22 | Ceres, Inc. | Promoter, promoter control elements, and combinations, and uses thereof |
US7173121B2 (en) * | 2003-10-14 | 2007-02-06 | Ceres, Inc | Promoter, promoter control elements, and combinations, and uses thereof |
US20060015970A1 (en) * | 2003-12-12 | 2006-01-19 | Cers, Inc. | Nucleotide sequences and polypeptides encoded thereby useful for modifying plant characteristics |
US20070006335A1 (en) * | 2004-02-13 | 2007-01-04 | Zhihong Cook | Promoter, promoter control elements, and combinations, and uses thereof |
US20060021083A1 (en) * | 2004-04-01 | 2006-01-26 | Zhihong Cook | Promoter, promoter control elements, and combinations, and uses thereof |
US20060260004A1 (en) * | 2004-04-01 | 2006-11-16 | Yiwen Fang | Par-related protein promoters |
US7214789B2 (en) * | 2004-06-30 | 2007-05-08 | Ceres, Inc. | Promoter, promoter control elements, and combinations, and uses thereof |
US20060041952A1 (en) * | 2004-08-20 | 2006-02-23 | Cook Zhihong C | P450 polynucleotides, polypeptides, and uses thereof |
US20060265788A1 (en) * | 2004-09-08 | 2006-11-23 | J.R. Simplot Company | Plant-specific genetic elements and transfer cassettes for plant transformation |
US7378571B2 (en) * | 2004-09-23 | 2008-05-27 | Ceres, Inc. | Promoter, promoter control elements, and combinations, and uses thereof |
US7429692B2 (en) * | 2004-10-14 | 2008-09-30 | Ceres, Inc. | Sucrose synthase 3 promoter from rice and uses thereof |
US7312376B2 (en) * | 2005-04-20 | 2007-12-25 | Ceres, Inc. | Regulatory regions from Papaveraceae |
US7598367B2 (en) * | 2005-06-30 | 2009-10-06 | Ceres, Inc. | Early light-induced protein promoters |
Non-Patent Citations (4)
Title |
---|
GRIN database accession no PI:535790 submitted 15 september 1989 accessed on 6/7/2012 * |
GRIN database accession no PI:602730 received 11 feb 1998 accessed on 6/7/2012 * |
Sorghum newsletter 2003 * |
USDA plant inventory 1998 * |
Cited By (13)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US9683241B2 (en) | 2011-04-29 | 2017-06-20 | Bangladesh Jute Research Institute | Polynucleotides encoding enzymes from the jute lignin biosynthetic pathway |
EP2701487A2 (en) * | 2011-04-29 | 2014-03-05 | Bangladesh Jute Research Institute | Polynucleotides encoding enzymes from the jute lignin biosynthetic pathway |
JP2014513536A (en) * | 2011-04-29 | 2014-06-05 | バングラデシュ ジュート リサーチ インスティテュート | Polynucleotides encoding enzymes of the detrignine biosynthetic pathway |
CN103987722A (en) * | 2011-04-29 | 2014-08-13 | 孟加拉朱特研究所 | Polynucleotides encoding enzymes from the jute lignin biosynthetic pathway |
EP2701487A4 (en) * | 2011-04-29 | 2015-04-29 | Bangladesh Jute Res Inst | Polynucleotides encoding enzymes from the jute lignin biosynthetic pathway |
KR20160148062A (en) * | 2011-06-23 | 2016-12-23 | 방글라데시 주트 리서치 인스티튜트 | Nucleic acid molecules encoding enzymes that confer disease resistance in jute |
KR101866904B1 (en) * | 2011-06-23 | 2018-06-14 | 방글라데시 주트 리서치 인스티튜트 | Nucleic acid molecules encoding enzymes that confer disease resistance in jute |
CN102344915A (en) * | 2011-09-16 | 2012-02-08 | 中国科学院研究生院 | Protein with cinnamyl alcohol dehydrogenase activity and coding gene as well as application thereof |
WO2018198049A1 (en) * | 2017-04-25 | 2018-11-01 | Cellectis | Alfalfa with reduced lignin composition |
US11479782B2 (en) | 2017-04-25 | 2022-10-25 | Cellectis | Alfalfa with reduced lignin composition |
CN110679479A (en) * | 2019-11-15 | 2020-01-14 | 河北省农林科学院旱作农业研究所 | Hybrid breeding method of BMR genotype sudan grass |
CN110679478A (en) * | 2019-11-15 | 2020-01-14 | 河北省农林科学院旱作农业研究所 | Breeding method of forage grass type BMR sorghum sterile line |
CN116716423A (en) * | 2022-09-08 | 2023-09-08 | 四川省草原科学研究院 | EST-SSR marker primer pair developed based on phalaris arundinacea transcriptome sequence, and phalaris arundinacea variety identification method and application |
Also Published As
Publication number | Publication date |
---|---|
US8298794B2 (en) | 2012-10-30 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US20220056466A1 (en) | Transgenic plants having increased biomass | |
US9441233B2 (en) | Transgenic plants having increased biomass | |
US8298794B2 (en) | Cinnamyl-alcohol dehydrogenases | |
US11339403B2 (en) | Transgenic plants having increased tolerance to aluminum | |
US11629352B2 (en) | Methods of increasing crop yield under abiotic stress | |
WO2010033564A1 (en) | Transgenic plants having increased biomass | |
WO2012058223A1 (en) | Transgenic plants having altered biomass composition |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AS | Assignment |
Owner name: CERES, INC., CALIFORNIA Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:SWALLER, TIMOTHY;REEL/FRAME:024363/0919 Effective date: 20091214 |
|
STCF | Information on status: patent grant |
Free format text: PATENTED CASE |
|
CC | Certificate of correction | ||
FEPP | Fee payment procedure |
Free format text: PAYOR NUMBER ASSIGNED (ORIGINAL EVENT CODE: ASPN); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY |
|
FPAY | Fee payment |
Year of fee payment: 4 |
|
FEPP | Fee payment procedure |
Free format text: ENTITY STATUS SET TO UNDISCOUNTED (ORIGINAL EVENT CODE: BIG.) |
|
MAFP | Maintenance fee payment |
Free format text: PAYMENT OF MAINTENANCE FEE, 8TH YEAR, LARGE ENTITY (ORIGINAL EVENT CODE: M1552); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY Year of fee payment: 8 |