欢迎访问《植物研究》杂志官方网站,今天是 分享到:

植物研究 ›› 2020, Vol. 40 ›› Issue (3): 458-467.doi: 10.7525/j.issn.1673-5102.2020.03.018

• 研究报告 • 上一篇    

药用资源植物山莨菪的转录组信息分析

张雨1,2, 夏铭泽1,2, 张发起1   

  1. 1. 中国科学院高原生物适应与进化重点实验室, 中国科学院西北高原生物研究所, 西宁 810001;
    2. 中国科学院大学, 北京 100039
  • 收稿日期:2019-09-05 发布日期:2020-05-29
  • 通讯作者: 张发起,E-mail:fqzhang@nwipb.cas.cn E-mail:fqzhang@nwipb.cas.cn
  • 作者简介:张雨(1994-),女,硕士研究生,主要从事植物分子生物学研究。
  • 基金资助:
    青海省应用基础研究计划(2019-ZJ-7042);国家自然科学基金(31110103911)

Transcriptome Analysis for Medicinal Plant Anisodus tanguticus

ZHANG Yu1,2, XIA Ming-Ze1,2, ZHANG Fa-Qi1   

  1. 1. Key Laboratory of Adaptation and Evolution of Plateau Biota, Northwest Institute of Plateau Biology, Chinese Academy of Sciences, Xi'ning 810001;
    2. College of Life Sciences, University of Chinese Academy of Sciences, Beijing 100039
  • Received:2019-09-05 Published:2020-05-29
  • Supported by:
    Application basic research plan of Qinghai Province(2019-ZJ-7042);National Natural Science Foundation of China(31110103911)

摘要: 为了增强对资源植物山莨菪的深入了解,本研究采用高通量测序技术对山莨菪进行转录组测序分析,经过处理得到71 463个Unigenes。通过与多个数据库进行比对,对基因进行分类和分析注释,最终成功获得注释的基因有47 624条。将Unigenes比对到KOG蛋白质库中,有13 110个基因被注释,共有26个子类;比对到NR库中后有39 621个Unigenes被注释;转录本与Swissprot、TrEMBL的比对结果得到GO功能注释信息,注释得到的29 309个Unigenes可被分为分子功能、生物学过程和细胞组分3个大类,62个子类;以KEGG数据库为参考,3 679条基因被注释,参与的代谢通路可归为4个大类,分别是代谢相关的通路、遗传信息处理、细胞过程、环境信息处理,其中与代谢相关的通路最多,约占所有代谢通路的一半。对山莨菪的药用活性成分的代谢通路及相关Unigenes数量和类型的统计结果表明,与生物碱相关的代谢通路最多,萜类和苯丙素类所对应的Unigenes数量最多。另外,结果还检测到31 382个SNP位点,6种SSR重复类型,其中单碱基重复类型所占的比例最高,每百万碱基中出现的单碱基重复的SSR个数有56.52个,占45.30%。该结果丰富了山莨菪的转录组信息数据,为该物种分子生物学方面的研究奠定了基础,有助于进一步开展对山莨菪的合理保护及开发利用工作。

关键词: 山莨菪, 转录组, 高通量测序, 基因注释

Abstract: The transcriptome sequencing analysis of Anisodus tanguticus was carried out by high throughput sequencing technology to understand resource plant A.tanguticus deeply. The 71 463 Unigenes were obtained after processing. By comparing with several databases,we classified and analyzed the genes, and finally succeeded in annotating 47 624 genes. The 13 110 genes with 26 subclasses were annotated after comparing with KOG protein library.Compared with NR library, 39 621unigenes were annotated.The transcripts were compared with Swissprot and TrEMBL to obtain GO functional annotations. The 29 309 unigenes obtained from the annotations could be divided into three categories:molecular function, biological process, and cellular components, with 62 subcategories.Referring to the KEGG database, 3 679 genes were annotated. The metabolic pathways involved can be classified into four categories:metabolic related pathways, genetic information processing,cellular processes, and environmental information processing. Among them,metabolic related pathways are the most, accounting for about half of all metabolic pathways.Statistical results of metabolic pathways and related unigenes of the active ingredients of A.tanguticus showed that the number of metabolic pathways related to alkaloids was the most, and the number of unigenes corresponding to terpenoids and phenylpropanoids was the most.In addition, 31 382 SNP loci and 6 SSR repeat types were detected. Among them, single base repeat types accounted for the highest proportion, with 56.52 SSR repeats per million bases, accounting for 45.30%.These results enrich the transcriptome information of A.tanguticus and lay a foundation for the study of the molecular biology, which contribute to the further development and utilization of A.tanguticus.

Key words: Anisodus tanguticus, transcriptome, high throughput sequencing technology, gene annotation

中图分类号: