PJB-2019-754
DE NOVO SEQUENCING AND COMPARATIVE ANALYSIS OF CAMELLIA OLEIFERA (THEACEAE) TRANSCRIPTOMES
Baoming Wang
Abstract
Camellia oleifera is the important edible oil tree, but limited transcriptomic data are currently available. The absence information has been a barrier to the development of functional research. In this study, we used Illumina paired-end RNA sequencing technology to obtain C. oleifera transcriptomes of leaf, flower, and matured seed. A total of 66.52 million clean reads were generated and were assembled into 82964 genes (185190 transcripts) with an average length of 671 bp for ungenes and 966 bp for transcripts. Of which, 46.51 % had significant similarity with proteins in the NCBI nonredundant protein database (Nr), Nt (NCBI nucleotide sequences) and SwissProt database, et al., while 53.49% has no hits in the Nr protein database. 82964 assembled unigenes were assigned to gene ontology categories, of which 29.31% (24313 ungenes) and 43.12% (35773 genes) were mapped onto 31 pathways using the Kyoto Encyclopedia of Genes and Genomes Pathway database (KEGG), and 26 pathways of Karyotic Ortholog Groups (KOG). Furthermore, we have discovered functional genes relating to the growth and development.
To Cite this article:
Download