-
在自然环境中,植物总会遭受各种生物或非生物的胁迫,在长期的进化过程中,自然也会形成一定的防御机制。转录因子在植物响应逆境胁迫过程中具有重要功能,它们通过与多个基因启动子区域顺式作用元件的特异性结合参与植物的生长、发育和压力信号传递等过程[1-2]。乙烯响应因子(ERF,Ethylene Response Factor)是植物对生物和非生物胁迫响应反应中的主要调节因子[3-5]。转录因子识别特定的基序,并作为特定基因的激活或抑制因子发挥作用[6]。ERF转录因子含有57~70个氨基酸组成的ERF域[7-9],可以特异地与存在于植物细胞内的病原菌响应基因启动子中的GCC-box(GCC-box:AGCCGCC)结合,从而激活或者抑制ET,JA偶联的防御途径[8,10]。在许多病程相关基因启动子区都有GCC-box,如β−1,3−葡聚糖酶、几丁质酶和渗透蛋白,转录因子通过与GCC-box的结合直接或间接调节PR基因的表达,从而介导这些关键基因在植物响应生物胁迫中发挥作用[11]。木薯(Manihot esculenta)不仅是主要的粮食和经济作物,也是一种重要的再生能源作物[12-13]。广西、广东、海南、云南和福建是我国木薯的主要种植省份。由于种植省份的特殊地理位置,使木薯经常受到台风、海水倒灌使土地海盐量升高等的影响,极大地影响了木薯的产量。如时涛,李超萍等[14-15]的研究表明,低温则使得木薯低产甚至绝收,湿热则容易让木薯受到病害的威胁。目前,已发现ERF转录因子在木薯应对生物或非生物胁迫时能够发挥一定的作用[16]。因此,笔者利用生物信息学手段,挖掘木薯ERF转录因子的下游靶基因,旨在为木薯响应生物及非生物胁迫分子机理的解析提供基因资源和理论基础。
HTML
-
以启动子区含有GCC-box这一顺式作用元件为条件,在木薯的全基因组数据中筛选符合条件的基因,共筛选到启动子区含有GCC-box这一顺式作用元件的基因204个,其基因详细信息见表1。
基因座位 Gene name 长度/bp Gene length 注释 Annotation Manes.01G064200 958 Osmotin-like protein OSM34 Manes.01G069100 3 335 Subtilase 4.12 Manes.02G024800 6 457 Transcription initiation factor TFIIE subunit beta Manes.02G189600 2 172 Putative WRKY transcription factor 33 Manes.03G039700 4 584 12-oxophytodienoate reductase 3 Manes.01G073700 3 652 Glycine cleavage T-protein family protein Manes.06G002000 6 512 Alpha/beta-hydrolase domain-containing protein Manes.09G063700 2 653 Phenylalanine ammonia-lyase 2 Manes.14G141600 9 106 Uncharacterized protein Manes.14G151300 386 Uncharacterized protein Manes.15G181900 4 511 Formate dehydrogenase Manes.18G073800 5 027 Histone-lysine N-methyltransferase ASHR1 Manes.01G079500 2 597 Uncharacterized protein Manes.01G091000 1 473 Hypothetical protein MANES_01G091000 Manes.01G105500 2 550 Alpha-1,4-galacturonosyltransferase Manes.01G121800 3 143 Uncharacterized protein Manes.01G132600 3 866 Uncharacterized protein Manes.01G133300 1 251 Uncharacterized protein Manes.01G155000 3 623 Protein kinase domain-containing protein Manes.01G156100 1 895 Pentatricopeptide repeat-containing protein Manes.01G164900 3 815 Laccase-4 Manes.01G171700 2 788 Scarecrow-like protein 34 Manes.01G191200 1 685 Protein TWIN LOV 1 Manes.01G194000 1 712 Myb-like transcription factor-like protein Manes.01G239000 10 955 Protein late elongated hypocotyl Manes.01G257700 623 RING/U-box domain-containing protein Manes.02G017600 1 568 Heavy-metal-associated domain-containing protein Manes.02G018800 1 163 Fiber Fb34 Manes.02G023700 101 Hypothetical protein MANES_02G023700 Manes.02G024900 11 644 Methyl- -binding domain-containing 13 isoform X1 Manes.02G036700 4 426 Adenine phosphoribosyl transferase 4 Manes.02G102800 1 121 GDP-mannose 4,6 dehydratase 2 Manes.02G108400 2 642 S-phase kinase-associated protein 1 Manes.02G121400 359 Josephin Manes.02G153300 3 946 Uncharacterized protein Manes.02G153900 2 548 Uncharacterized protein Manes.02G154000 1 220 Mitochondrial transcription termination factor family protein Manes.02G184300 760 Uncharacterized protein Manes.02G188500 635 Uncharacterized protein Manes.02G198700 128 Hypothetical protein MANES_02G198700 Manes.02G205000 146 Hypothetical protein MANES_02G205000 Manes.02G226700 2 598 U11/U12 small nuclear ribonucleoprotein Manes.03G012100 754 Uncharacterized protein Manes.03G032700 3 580 Methyltransferase Manes.03G036900 2 926 Protein phosphatase 2C 16 Manes.03G050700 11 341 AGC (cAMP-dependent, cGMP-dependent and protein kinase C)
kinase family proteinManes.03G051800 1 655 Mitochondrial substrate carrier family protein Manes.03G091000 1 460 Gibberellin 2-beta-dioxygenase 2 Manes.03G164100 3 813 Malate dehydrogenase Manes.03G183700 581 Aspartyl protease-like protein Manes.03G210000 2 101 Uncharacterized protein Manes.04G022300 197 Hypothetical protein MANES_04G022300 Manes.04G025100 5 961 Hydrolase, alpha/beta fold family protein Manes.04G035100 5 201 Uncharacterized protein Manes.05G033700 7 936 Signal peptide peptidase-like 4 Manes.05G057700 4 849 Alpha-1,3-glucosyltransferase Manes.05G062200 590 Plant invertase/pectin methylesterase inhibitor Domain-containing protein Manes.05G071300 636 Hypothetical protein MANES_05G071300 Manes.05G097100 10 203 Alpha-amylase-like 3 Manes.05G103400 2 351 Hypothetical protein MANES_05G103400, partial Manes.05G111500 2 499 Receptor-like protein kinase FERONIA Manes.05G119700 47 987 Uncharacterized protein Manes.05G159600 6 158 Uncharacterized protein Manes.05G174000 1 566 Sterol carrier protein 2 Manes.05G177200 2 505 VIER F-box protein 1 Manes.06G012700 2 821 Alternative oxidase 2 Manes.06G067300 3 592 Regulator of Vps4 activity in the MVB pathway protein Manes.06G078800 1 966 DNA-3-methyladenine glycosylase I Manes.06G129700 3 233 Glutamate dehydrogenase 2 Manes.06G140900 3 349 Protein kinase family protein Manes.06G164200 1 528 Chaperonin-like RbcX protein Manes.06G165700 839 Uncharacterized protein Manes.06G175500 2 430 Pentatricopeptide repeat-containing protein Manes.07G028100 6 262 Uncharacterized protein Manes.07G056000 9 058 Inositol-tetrakisphosphate 1-kinase 2 Manes.07G065200 1 549 F-box/kelch-repeat protein Manes.07G072000 7 499 Hypothetical protein MANES_07G072000 Manes.07G075200 2 214 Hypothetical protein MANES_07G075200 Manes.07G107500 1 805 Hypothetical protein MANES_07G107500 Manes.07G109000 1 895 Alpha/beta-hydrolase domain-containing protein Manes.07G129000 3 499 GDSL esterase/lipase CPRD49 Manes.07G129900 3 630 Haloacid dehalogenase-like hydrolase domain-containing protein Manes.07G131300 1 928 Putative protein phosphatase 2C 30 Manes.08G030000 1 009 Xanthoxin dehydrogenase Manes.08G039600 5 526 Translation initiation factor eIF-3 subunit 8 Manes.08G108800 2 055 Pectin lyase-like protein Manes.08G136100 662 Uncharacterized protein Manes.08G137700 947 Ethylene-responsive transcription factor 1B Manes.08G147800 1 983 NAC domain containing protein 47 Manes.08G148900 3 450 Uncharacterized protein Manes.08G162000 338 Hypothetical protein MANES_08G162000 Manes.09G008400 5 230 Ankyrin repeat and BTB/POZ domain-containing protein Manes.09G009700 622 Hypothetical protein MANES_09G009700 Manes.09G107500 1 883 PHD finger-like domain-containing protein 5A Manes.09G109400 7 454 Tetratricopeptide repeat (TPR)-containing protein Manes.09G146900 6 045 Heme oxygenase-like, multi-helical protein Manes.09G149000 1 173 Ethylene-responsive transcription factor 1B Manes.09G165800 1 122 Rossmann-fold NAD(P)-binding domain-containing protein Manes.09G185400 4 108 Pyruvate dehydrogenase E1 component subunit beta Manes.10G002300 10 867 F-box domain-containing protein Manes.10G004700 6 026 Potassium transporter 2 Manes.10G007300 5 373 Biotin/lipoyl attachment domain-containing protein Manes.10G041100 3 379 Floral homeotic protein APETALA 2 Manes.10G072100 1 751 60S ribosomal protein L8-3 Manes.10G094000 2 043 TTF-type zinc finger protein with HAT dimerisation domain Manes.10G097300 10 362 Dynamin-related protein 3A Manes.10G111800 7 592 Zinc finger CCCH domain-containing protein 5 Manes.10G119500 4 800 Ethylene-responsive transcription factor RAP2-7 Manes.10G123900 368 Uncharacterized protein Manes.11G005600 1 016 NADPH:quinone oxidoreductase Manes.11G009200 1 991 Xyloglucan fucosyltransferase Manes.11G014100 1 440 GDSL esterase/lipase 1 Manes.11G019800 5 849 E3 ubiquitin ligase-like protein Manes.11G022100 5 549 Phospholipid:diacylglycerol acyltransferase 1 Manes.11G040800 1 832 Protein kinase family protein Manes.11G058500 5 067 Prefoldin 6 Manes.11G067200 1 153 Lamin-like protein Manes.11G070400 2 939 Wall-associated receptor kinase-like 22 Manes.11G071000 1 085 T-complex protein 1 subunit epsilon Manes.11G105000 3 083 Elongation factor EF-2 Manes.11G141100 2 601 GTP binding protein Manes.12G007000 416 Glutaredoxin-C8 Manes.12G009300 2 317 Uncharacterized protein Manes.12G024200 4 084 Ribose-phosphate pyrophosphokinase 3 Manes.12G043700 3 925 26S proteasome regulatory subunit 4-A Manes.12G059000 1 090 ILI1 binding bHLH 1 protein Manes.12G066300 1 289 Nuclear transcription factor Y subunit B-3 Manes.12G095300 953 Dof zinc finger protein DOF3.4 Manes.12G096300 1 524 Dof zinc finger protein DOF1.1 Manes.12G110100 2 614 Acyl-CoA N-acyltransferases-like protein Manes.12G111300 3 013 Phototropic-responsive NPH3-like protein Manes.12G114300 1 921 Cytochrome P450, family 81, subfamily D, polypeptide 5 Manes.12G138000 215 Hypothetical protein MANES_12G138000 Manes.13G007200 416 Glutaredoxin-C7 Manes.13G007300 1 265 Pollen Ole e 1 allergen and extensin family protein Manes.13G029700 4 707 Protein kinase family protein Manes.13G117100 2 242 Cytochrome P450, family 81, subfamily D, polypeptide 8 Manes.13G150000 7 593 STRUBBELIG-receptor family 3 Manes.14G001200 3 318 Hypothetical protein MANES_14G001200 Manes.14G006900 4 475 Mitochondrial transcription termination factor family protein Manes.14G024500 5 322 SKP1-like protein 21 Manes.14G036100 4 095 LUC7 N_terminus domain-containing protein Manes.14G071200 621 Hypothetical protein MANES_14G071200 Manes.14G112300 3 289 Uncharacterized protein Manes.14G125900 877 Transcription factor bHLH52 Manes.14G134200 1 024 Protein LURP-one-related 12 Manes.15G018300 2 852 Hypothetical protein MANES_15G018300 Manes.15G023900 1 418 Aspartyl protease-like protein Manes.15G050500 1 913 9-cis-epoxycarotenoid dioxygenase NCED6 Manes.15G053800 5 533 High mobility group-box and ARID domain-binding domain-containing protein Manes.15G070500 479 myb domain protein 101 Manes.15G090000 997 Homeobox-leucine zipper protein ATHB-52 Manes.15G103900 1 499 Peroxidase 59 Manes.15G109900 269 Hypothetical protein MANES_15G109900 Manes.15G134200 5 566 UDP-3-O-[3-hydroxymyristoyl] N-acetylglucosamine deacetylase Manes.15G150600 884 Uncharacterized protein Manes.15G152100 5 554 SGF29 tudor-like domain-containing protein Manes.15G166600 3 570 Putative cysteine-rich receptor-like protein kinase 43 Manes.15G179800 7 127 Alpha-mannosidase II Manes.16G007600 650 Uncharacterized protein Manes.16G020700 1 432 F-box protein Manes.16G054300 4 718 Galacturonosyltransferase 6 Manes.16G056200 1 763 TTF-type zinc finger protein with HAT dimerization domain Manes.16G058300 2 179 TTF-type zinc finger protein with HAT dimerization domain Manes.16G066800 558 Uncharacterized protein Manes.16G077300 2 074 Uncharacterized protein Manes.16G083800 1 764 Mitochondrial substrate carrier family protein Manes.16G084600 2 554 Thermospermine synthase ACAULIS5 Manes.16G087300 782 Ethylene-responsive transcription factor ERF016 Manes.16G109800 4 580 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein Manes.16G116900 554 Hypothetical protein MANES_16G116900 Manes.17G000400 23 515 Nuclear RNA polymerase C1 Manes.17G024800 248 Hypothetical protein MANES_17G024800 Manes.17G024900 398 Hypothetical protein MANES_17G024900, partial Manes.17G030400 2 314 Phototropic-responsive NPH3 family protein Manes.17G048000 518 Hypothetical protein MANES_17G048000 Manes.17G048400 746 Ethylene-responsive transcription factor 11 Manes.17G048500 6 168 MAC/Perforin domain-containing protein Manes.17G056700 5 022 Uncharacterized protein Manes.17G073100 4 620 Endoplasmic oxidoreductin-2 Manes.17G112300 996 Uncharacterized protein Manes.17G112400 20 885 Hypothetical protein MANES_17G112400 Manes.18G001100 6 966 Pyrophosphate--fructose-6-phosphate 1-phosphotransferase Manes.18G011500 2 613 Adenine nucleotide alpha hydrolases-domain containing protein kinase Manes.18G011600 321 Hypothetical protein MANES_18G011600 Manes.18G011700 11 468 Vacuolar sorting protein 35 Manes.18G047000 2 741 Transcription factor lim1 Manes.18G052700 2 457 DnaJ/Hsp40 cysteine-rich domain-containing protein Manes.18G058800 10 440 Kinesin family member 4/7/21/27 Manes.18G098700 2 548 Putative WRKY transcription factor 33 Manes.S014400 2 771 Disease resistance-like protein/LRR domain-containing protein Manes.S014600 2 975 Disease resistance-like protein/LRR domain-containing protein Manes.S014800 14 231 Disease resistance-like protein/LRR domain-containing protein Manes.S020700 1 380 Myb domain protein 93 Manes.S027500 1 737 F-box/kelch-repeat protein Manes.S030800 4 132 RAB GTPase homolog A5A Manes.S054400 1 964 Glycoside hydrolase family 28 protein / polygalacturonase (pectinase) family protein Manes.S054500 4 127 Polygalacturonase-like protein Manes.S054600 1 461 Polygalacturonase-like protein Manes.S054700 1 461 Polygalacturonase-like protein Manes.S055000 1 461 Polygalacturonase-like protein Manes.S055100 1 461 polygalacturonase-like protein Manes.S096700 3 378 Uncharacterized protein Manes.S109400 929 Polygalacturonase-like protein Table 1. Annotations of ERF transcription factor target gene in cassava
-
对204个木薯ERF转录因子靶基因的染色体位置分布进行分析,结果见图1。从图1可知,有15个基因定位于未知的染色体上,剩余189个基因在木薯的18条染色体上均有分布。其中,木薯第2条染色体上有18个基因,是分布最多的染色体;第4条染色体上只有3个基因,是分布最少的染色体。位于染色体上端的基因最多,占43%,第4条和第18条染色体上,基因均分布在染色体的上端,中部和下端没有分布;位于染色体下端的基因占35%;位于染色体中部的基因仅占22%。
-
木薯ERF转录因子靶基因的注释中,Manes.02G189600与拟南芥中的AtWRKY33基因同源,而AtWRKY33是植物病原菌互作途径中重要转录因子;Manes.03G039700是OPR3,JA信号通路中的同工酶;Manes.09G063700是苯丙烷类代谢途径的关键酶,也是合成木质素的关键酶;Manes.15G181900是甲酸氢解酶;而其他大部分基因编码的蛋白是未知蛋白。
选取上述4个基因,再随机挑选12个基因,通过分析这16个基因的启动子顺式作用元件,发现其具有典型的启动子元件TATA-box和CAAT-box,说明具有启动子活性(图2)。从图2进一步分析发现,有大量的光响应元件,例如ACE,G-box,Box 4,MRE等;还存在防御和胁迫响应元件(TC-rich repeats)、低温响应元件(LTR)、干旱响应元件(MBS)、脱落酸响应元件(ABRE)、茉莉酸甲酯响应元件(TGACG-motif,CGTCA-motif)等。多种顺式作用元件的存在说明这些基因可能受到复杂的调控,在木薯遭遇胁迫时发挥一定的作用。
-
在NCBI网站SRA数据库中,下载来源于用Xam弱致病菌株ORST4和强致病菌株ORST4+TALE1侵染苗龄8周的木薯(品种为MCOL1522)组培苗实验的12个关于病原菌侵染木薯的转录组数据[17],详见表2。
运行ID Run ID 样品名称 Sampling SRR1050891 ORST4, collected at 0 d SRR1050892 ORST4, collected at 5 d SRR1050893 ORST4, collected at 7 d SRR1050894 ORST4+TALE1Xam, collected at 0 d SRR1050895 ORST4+TALE1Xam, collected at 5 d SRR1050896 ORST4+TALE1Xam, collected at 7 d SRR1050897 ORST4, collected at 0 d SRR1050898 ORST4, collected at 5 d SRR1050899 ORST4, collected at 7 d SRR1050900 ORST4+TALE1Xam, collected at 0 d SRR1050901 ORST4+TALE1Xam, collected at 5 d SRR1050902 ORST4+TALE1Xam, collected at 7 d Table 2. Transcriptome data sources of cassava infected by pathogenic bacteria
计算木薯ERF转录因子靶基因在这些转录组中的FPKM值,表达量低于1的基因剔除,筛选出76个基因用Heatmap构建基因表达热图(图3)。从图3可知,有26个基因能够被强致病菌株诱导表达,被弱致病菌株抑制表达,其中,Manes.03G039700,Manes.06G002000,Manes.14G141600,Manes.15G181900,Manes.03G091000,Manes.06G012700,Manes.08G039600,Manes.08G148900,Manes.15G050500,Manes.16G020700,Manes.16G116900,Manes.17G048400这12个基因的表达差异最为显著,尤其是Manes.03G091000,在病原菌侵染前后基因的表达差异超过了75倍。
-
选取Manes.02G189600,Manes.03G039700,Manes.06G002000,Manes.09G063700,Manes.14G141600,Manes.15G181900等6个基因,利用qRT-PCR作进一步验证。
用ACC处理木薯幼苗,从图4可知,Manes.02G189600,Manes.03G039700和Manes.15G181900基因在处理0.5 h后表达量均持续上调。其中,Manes.03G039700基因表达量上调更显著;Manes.02G189600基因在处理0.5 h后表达量最高,之后上调缓慢;Manes.15G181900基因在处理6 h后表达量最高,上调较显著。Manes.06G002000基因在处理后表达量下调,1 h表达量最低,之后表达量(相比1 h)上调,6 h的表达量与0 h几乎一致。Manes.09G063700 和Manes.14G141600基因在处理1 h后表达量最低,而2 h的表达量回升达到最高,之后Manes.09G063700基因表达量与0 h几乎一致,Manes.14G141600基因上调缓慢。本实验结果表明,这6个基因可以瞬时响应ACC处理,可能在木薯的乙烯通路中发挥作用。
Figure 4. Expression pattern of ERF transcription factor target genes in cassava under ACC treatment
从图5可知,在病原菌侵染木薯幼苗4 h后(相比对照),Manes.02G189600,Manes.03G039700,Manes.06G002000,Manes.09G063700,Manes.14G141600,Manes.15G181900基因的表达量均显著下调;Manes.02G189600,Manes.03G039700和Manes.06G002000在病原菌侵染8 h(相比对照)的表达量上调;24 h后(相比对照),Manes.02G189600,Manes.03G039700,Manes.06G002000,Manes.09G063700,Manes.14G141600,Manes.15G181900这6个基因的表达量继续下调。本实验结果表明,这些基因可以瞬时响应病原菌对木薯的侵染,可能参与了木薯的抗病途径。