Skip to main content

Conservation and diversification of the miR166 family in soybean and potential roles of newly identified miR166s

Abstract

Background

microRNA166 (miR166) is a highly conserved family of miRNAs implicated in a wide range of cellular and physiological processes in plants. miR166 family generally comprises multiple miR166 members in plants, which might exhibit functional redundancy and specificity. The soybean miR166 family consists of 21 members according to the miRBase database. However, the evolutionary conservation and functional diversification of miR166 family members in soybean remain poorly understood.

Results

We identified five novel miR166s in soybean by data mining approach, thus enlarging the size of miR166 family from 21 to 26 members. Phylogenetic analyses of the 26 miR166s and their precursors indicated that soybean miR166 family exhibited both evolutionary conservation and diversification, and ten pairs of miR166 precursors with high sequence identity were individually grouped into a discrete clade in the phylogenetic tree. The analysis of genomic organization and evolution ofMIR166gene family revealed that eight segmental duplications and four tandem duplications might occur during evolution of the miR166 family in soybean. The cis-elements in promoters ofMIR166family genes and their putative targets pointed to their possible contributions to the functional conservation and diversification. The targets of soybean miR166s were predicted, and the cleavage ofATHB14-LIKEtranscript was experimentally validated by RACE PCR. Further, the expression patterns of the five newly identifiedMIR166s and 12 target genes were examined during seed development and in response to abiotic stresses, which provided important clues for dissecting their functions and isoform specificity.

Conclusion

This study enlarged the size of soybean miR166 family from 21 to 26 members, and the 26 soybean miR166s exhibited evolutionary conservation and diversification. These findings have laid a foundation for elucidating functional conservation and diversification of miR166 family members, especially during seed development or under abiotic stresses.

Background

MicroRNAs (miRNAs) are a class of endogenous, single-stranded, small regulatory RNA molecules, which broadly exist in diverse eukaryotes. In plants, miRNAs range from 20 to 24 nt in length and regulate the expression of target genes mainly at post-transcriptional levels [1]。Plant miRNAs are derived from their congateMIRNAgenes that are mostly located in intergenic regions, sometimes in intragenic regions, of the plant genome [24]。Generally, aMIRgene is transcribed by Polymerase II into a capped and polyadenylated primary miRNA (pri-miRNA) [5,6]。Subsequently, DCL protein carries out the cleavage of pri-miRNA into the stem-loop precursor (pre-miRNA), which is further processed by DCL1 to generate mature miRNA and miRNA*. Mature miRNA is then loaded into an Argonaute protein to form the miRNA-induced silencing complex (miRISC). Finally, miRISCs can bind to the transcripts of target genes through perfect or near-perfect pairing with its mature miRNA, and guild mRNA cleavage or translational inhibition in plants [7,8]。Thus, plant miRNAs are thought to mediate almost all plant cellular and metabolic processes via regulating posttranscriptional gene silencing [9,10]。

miR166s are highly conserved in plants, and 262 miR166 species have been identified in 45 plant species according to the miRBase database (Release 21, June 2014). It is generally accepted that conserved miRNAs might play crucial roles in regulating fundamentally important biological processes [11]。So, many efforts have been made to elucidate functional roles and regulatory mechanisms of miR166 family in different plant species. To date, miR166s have been found to be involved in the modulation of various developmental processes via negatively mediating their targets, including SAM development, organ polarity, seed development, vascular patterning of shoot, root development, and nutrition ion uptake [1217]。Additionally, some evidences indicated that miR166 family might play crucial roles in response to abiotic and biotic stresses. For example, miR166 up-regulation upon salinity stress and concomitant depression of its targets were observed in andigena potato, suggesting an important network node for modulating gene expression responsive for growth adjustments [18]。Similarly, miR166 was induced byPhytophthora sojaeinfection in soybean, indicating that it may be implicated in soybean basal defense [19]。

Plants often harbor a number of miR166s derived from multigene family with the individual loci. For example, there are 21 miR166s in soybean, 17 inPopulus trichocarpa, 9 inArabidopsis thaliana, 13 in maize, rice, andPhyscomitrella patensaccording to miRBase database (Release 21, June 2014). The miR166s in multigene family were found to be highly conserved to targetHD-ZIP IIIfamily genes such asREVOLUTA(REV),PHABULOSA(PHB),PHAVOLUTA(PHV),CORONA(CNA) andATHB8in a broad range of plant species, indicating that they might exhibit high degree of functional redundancy [2023]。Nevertheless, emerging evidences indicate that functional specialization exists in miR166 family. Firstly, miR166s in different species can be predicted and/or demonstrated to targetnon-HD-ZIPIIIgenes. For example,RICE Dof DAILY FLUCTUATIONS 1(RDD1) can be targeted by miR166 in rice, thereby regulating nutrient ion uptake and accumulation in rice [16]; while miR166m rather than other miR166 members inPhyscomitrella patenswas predicted to target 3non-HD-ZIPIIItranscripts such as TC21828, FC366912 and TC13986 [24]。These different set of targets suggest functional diversity of miR166 family in plant species. Secondly, spatio-temporal expression patterns ofMIR166genes also reflect their functional specialization [25]。A good example is that only five members ofMIR166gene family (MIR165a,MIR165b,MIR166a,MIR166bandMIR166g) inArabidopsiswere found to be expressed in embryo, which might provide a positional signal from the basal–peripheral region of early embryos, whereas the other fourMIR166s can not be detected in any stage of embryogenesis [13]。Similarly, threeMIR166genes (MIR165a,MIR166aandMIR166b) were observed to be expressed specifically in the root endodermis and post-embryonic meristem inArabidopsis[26]。Additionally, phylogenetic analysis revealed diversification ofMIR166gene family amongArabidopsis, rice andPhyscomitrella patens, which laid a foundation for further exploration of the evolutionary and functional divergence of plant miRNAs [24]。Evidently, the existence of multiple copies ofMIR166genes contributes to the functional diversity and specificity ofMIR166gene family members in plants. Thus, addressing the evolutionary conservation and diversification ofMIR166gene family becomes an important step towards understanding their fine-tuned regulatory roles in plant developmental processes and/or resistance to stresses.

Soybean harbors the largest miR166 family with 21 members in plant. Although the co-evolution and characteristics ofMIRNAgenes were globally investigated in soybean [2729], the evolutionary conservation and functional diversification ofMIR166family members in soybean remain poorly understood. In this study, five novel miR166s in soybean were identified by data mining approach. Subsequently, the genomic organization and evolution of all the 26MIR166genes were investigated; and as a result, eight segmental duplications and four tandem duplications were found to have possibly occurred during evolution ofMIR166gene family in soybean. Furthermore, promoter analysis and target prediction results pointed to functional diversification of soybeanMIR166gene family. Finally, the expression patterns of the five newly identifiedMIR166s were examined during seed development and in response to abiotic stresses. These findings laid a foundation for elucidating functional diversification of soybeanMIR166gene family, especially during seed development or in response to abiotic stresses.

Results

Identification of novel miR166s in soybean

To identify novel miR166s in soybean, pre-miR165/166 sequences from soybean,Arabidopsisand rice were used as queries to conduct BLAST search against the soybean genomic database (http://www.phytozome.net/). Subsequently, the matched genomic sequences were analyzed following a series of screening criteria for encoding miRNA sequence, and five novel pre-miR166s were identified in soybean. Based on soybean miR166 nomenclature (gma-miR166a-u) in miRBase (Release 21, 2014), the five newly identified miR166s were orderly named as miR166v-z.

The characteristics of the five newly identified miR166s were summarized in Tables1and2. All the five newly identified mature miR166 sequences were identical to their paralogs in soybean. Also, they were located on the 3’arm of the secondary stem-loop hairpin structure of the pre-miRNAs (Fig.1). The nucleotide lengths of the five newly identified pre-miRNAs ranged from 98 to 244 nt, which fall right within the normal range of plant miRNA precursors (55 to 930 nt, mean = ~146 nt) [30]。Negative minimal folding free energy (MFE) is an important criterion for distinguishing the miRNA from other types of RNA [31]。In the study, all the potential miRNA precursors exhibited low MFEs ranging from -46.0 to -90.7 kcal/mol (Table2). Furthermore, to normalize the potential effect of sequence length on MFEs, the minimal folding free energy index (MFEI) was calculated to differentiate miRNAs from other RNAs. As shown in Table2, the pre-miRNAs had a high MFEI (0.71–1.35) with an average of about 0.96, suggesting that they are very likely to be miRNA precursors. Thus, the size of soybean miR166 family was enlarged from 21 to 26 members (Additional file1: Table S1).

Table 1 Characteristics of the five newly identified miR166s in soybean
Table 2 Characteristics of the five newly identified pre-miR166s in soybean
Fig. 1
figure1

Stem-loop structures of the five newly identified pre-miR166s in soybean. The panela-esequentially correspond to pre-miR166v-z. The mature miRNA portion is highlighted ingreen bar

进化ary relationship of Pre-miR166s in soybean

To examine the evolutionary relationship among the 26 miR166s in soybean, phylogenetic analysis was conducted using the sequences of their precursors and mature miRNAs, respectively. The phylogram of miR166 precursors revealed that 26 soybean pre-miR166s were grouped into three classes representing similarities and divergence. As shown in Fig.2a, 21 pre-miR166s formed the largest class, and the rest of pre-miR166s were clustered into two separate branches away from the majority of pre-miR166s. The five newly identified pre-miR166s (pre-miR166v-z) were unevenly distributed in the phylogenetic tree. As indicated in Fig.2a, pre-miR166w was grouped together with pre-miR166b, n ,u, m, while pre-miR166v, pre-miR166x and pre-miR166y formed a branch with miR166e in class I. In contrast, pre-miR166z formed a separate branch with pre-miR166k in class III, suggesting that the newly identified pre-miR166s might have evolutionary diversification. It is interesting to note that ten pairs of miR166 precursors with high sequence identity (93.8% or higher), were individually grouped into discrete clades in the phylogenetic tree (Fig.2aand Additional file2: Figure S1), including three pairs containing the newly identified pre-miR166s (pre-miR166v and pre-miR166e, pre-miR166x and pre-miR166y, pre-miR166z and pre-miR166k) and seven previously known pre-miR166 pairs (pre-miR166b and pre-miR166n, pre-miR166a and pre-miR166c, pre-miR166s and pre-miR166t, pre-miR166d and pre-miR166f, pre-miR166g and pre-miR166i, pre-miR166h and pre-miR166j, pre-miR166o and pre-miR166q). These observations suggest that these pairs of pre-miR166s are evolutionarily closer to each other compared with the rest members in miR166 family.

Fig. 2
figure2

Phylogenetic analysis and alignment of miR166 family in soybean.aPhylogenetic analysis of miR166 precursors.bPhylogenetic analysis and alignment of mature miR166s. The five newly identified miR166s in soybean are highlighted with astar, and the identity between paralogous pair is listed

Meanwhile, the topological tree of mature miR166s strongly supported the conserved property of miR166 family. As shown in Fig.2b, 26岁成熟miR166s被分成两个演化支,and all the mature miR166s except miR166l showed identical or nearly identical sequence. To further understand the conservation and divergence of soybean miR166s, 63 unique mature miR166 sequences were isolated from miRBase (Release 21) and designated as UmiR166-1 to 63, respectively (Additional file3: Table S2). The phylogenetic analysis indicated that the 63 UmiR166s were clustered into four classes representing conservation and divergence (Additional file4:图S2)。六UmiR166s soyb 26日发表ean miR166s were grouped in the Class IV (such as miR166a-g, i, n, o, v, w, x, y, UmiR166-28; miR166p-t, UmiR166-29; miR166m, UmiR166-41; miR166z, k, h, UmiR166-49; miR166j, UmiR166-53; miR166u, UmiR166-62), while the representative of gma-miR166l (UmiR166-13) formed a branch with 26 UmiR166s in the Class I. The observation indicated that soybean miR166s are highly conserved among plant species. We further performed Kalmogorov-Smirnov analysis to assess the percentage identity of soybean miR166s and their precursors, respectively. As shown in Figure S3 (Additional file5), ~0.25 fraction of mature miR166s had more than 80% sequence identity, whereas ~0.25 fraction of pre-miR166s only showed ~20% sequence identity, indicating that mature miR166s are more conserved than their precursors. It was also observed that each of the pre-miR166 paralogous pairs generated identical mature sequence except the pair of miR166h and miR166j, suggesting that they might target the same set of genes in soybean.

Chromosomal distributions and duplications ofMIR166s in soybean

To determine the distribution ofMIR166s on different chromosomes in soybean, a chromosome map was constructed. As shown in Fig.3, the 26 pre-miR166s are unevenly located on 14 different chromosomes in soybean. Chromosome 6 has the largest number with four pre-miR166s, followed by chromosome 4 and 8 with three pre-miR166s, while two miR166s were distributed on chromosome 5, 7, 9, 10, 16, and one miR166 on chromosome 1, 2, 3, 15, 19 and 20. Further examination revealed that 12 pre-miR166s were found in the intergenic regions of the genome. Intriguingly, 14 intragenic miR166s were mapped to the region of intron, exon or/and UTR in host genes (Additional file6:表S3)。例如,pre-miR166p pre-miR166l were situated in the UTR region of theUNAG KINASEgene and anti-sense strand of intron region of theCHAPERONINgene, respectively. These results implied that interactions might exist between intragenic miR166s and host genes.

Fig. 3
figure3

Chromosomal localization and duplication ofMIR166family genes in soybean. Eachcolored boxrepresents a chromosome. The approximate distribution of each soybeanMIR166gene is marked on thecirclewith a shortblack line. Tandem duplication clusters are indicated withstar. Colored linesindicate the linkage group with segmental duplicatedMIR166genes, and segmental duplication regions were determined using the Plant Genome Duplication Database

It has been demonstrated previously that approximately 20% of plant miRNAs are clustered, and generally contain conserved miRNAs of the same family [32]。估计数字集群在大豆miR166 family, a maximal distance of 3 kb between two consecutive miR166s was used as a stringent criterion. As a result, four miR166 clusters on the same DNA strand were identified in soybean (Fig.3), and each cluster contained two copies of miR166 family members (pre-mir166v and pre-miR166o, pre-miR166x and pre-miR166b, pre-miR166y and pre-miR166n, pre-miR166e and pre-mir166q). The distance between two pre-miR166s within each cluster ranged from 90 to 158 nt (Additional file1: Table S1), suggesting that they might be polycistronic precursors. To confirm their polycistronic nature, data mining was conducted against soybean Expressed Sequence Tags (EST) database since ESTs could provide evidence forMIR基因表达在不同的组织或流程。As shown in Table S3 (Additional file6), the corresponding EST simultaneously contained the sequences of two pre-miR166s that belong to the same cluster. These observations indicated that the correspondingMIRgenes can indeed be transcribed into polycistronic precursors carrying two tandem copies of miR166s. Thus, we tentatively named these genes asMIR166e/q,MIR166v/o,MIR166x/bandMIR166y/n, and three of the five newly identified pre-miR166s belonged to polycistronic precursors. It has been reported that homologousMIRgene clusters are associated with dosage effect [33]。Thus, existence of homologous miR166s from a polycistronicMIRgene implied that they might exert dosage effect on the regulation of target gene expression.

Based on coordinates of pre-miR166s or their neighbor genes (Additional file1: Table S1), we further investigated whether traceable genome duplications contributed to the expansion ofMIR166 gene family in soybean. Eight sets ofMIR166swere mapped on eight distinct duplicate blocks (Additional file7: Table S4), including three newly identified miR166-containing blocks (MIR166v/oandMIR166e/qon the block 446,MIR166x/bandMIR166y/non the block 571,MIR166zandMIR166kon the block 1447) and five previously known miR166-containing blocks (MIR166gandMIR166ion the block 210,MIR166sandMIR166ton the block 816,MIR166aandMIR166con the block 867,MIR166handMIR166jon the block 958,MIR166dandMIR166fon the block 1157). The analysis suggested that these pairs ofMIR166s on the same block were possibly derived from segmental duplication events during evolution. To investigate the selective evolutionary pressure onMIR166gene divergence after duplication, the non-synonymous/synonymous substitution ratio (Ka/Ks) was retrieved for the eight duplicated pairs ofMIR166genes. As shown in Additional file7: Table S4 the Ka/Ks value of all the duplicated gene pairs are less than 1, suggesting that these genes might have undergone a purifying selection with limited functional divergence after duplication [34]。

Analyzing cis-regulatory motifs inMIR166promoters

To provide clues for understanding the regulatory mechanism ofMIR166expression, bioinformatics analysis was conducted to identify putative cis-regulatory elements in promoter region, which was defined based on the estimated transcription start site (TSS). As shown in Additional file8: Table S5 most putative promoters were located within 800 bp upstream region of pre-miRNA foldbacks, consistent with previous report that the distance between TSS and precursor is less than 1 kb for more than 85% miRNAs inArabidopsis[35]。As expected, the core promoter element TATA-box was observed in all theMIR166promoter regions, while 18 out of 22MIR166genes contained CAAT-box, a common cis-acting element in promoter and enhancer regions. Furthermore, specific promoter elements were analyzed for understandingMIR166functional roles. As indicated in Additional file9: Table S6 the specific promoter elements were classified into eight classes, including light response, hormone response, seed development, leaf development, biosynthesis and metabolism, cell cycle and circadian, biotic or abiotic defense, and others. The cis-regulatory elements were non-uniformly distributed in differentMIR166promoters. Light-responsive elements were most enriched in the promoters ofMIR166family genes (100% of promoters were found to contain light responsive elements), followed by two endosperm-specific regulatory motifs (Skn1, 84.6% and GCN4, 65.4%) and four stress-responsive elements such as TC-rich (76.9%), HSE (61.5%), ARE (61.5%) and MBS (61.5%), suggesting that these cis-elements are fundamental to the expression ofMIR166family genes. In contrast, some motifs exist only in three or fewer members ofMIR166gene family (such as MSA-like 3.8%, RY-element 11.5%, HD-ZIP III binding site 11.5% and MBSI 11.5%), implying that these motifs might be involved in controlling the specificity ofMIR166expression. It has been well demonstrated that miR166 family can control the expression ofHD-ZIPIIIgenes via directly targeting their transcripts for degradation [14,17,21]。Interestingly,MIR166g,MIR166randMIR166zcontained HD-ZIPIII binding site in their promoter regions (Fig.4), implying a possible HD-ZIPIII-mediated feedback regulatory loop forMIR166expression in soybean.

Fig. 4
figure4

Promoter analysis ofMIR166genes in soybean. Phylogenetic relationship ofMIR166promoters is shown in left panel. The 6 classes of cis-elements (I-VI) are responsive to hormones, abiotic stresses, seed development, HD-ZIPIII binding, biosynthesis and metabolism, cell cycle and circadian, respectively. IfMIR166harbors the cis-element in promoter region, thebox标有数字1和强调在吗bluecolor, otherwise theboxis labeled with the number 0

To investigate the evolutionary relationship between the promoters ofMIR166family genes, phylogenetic analysis was also conducted. As shown in Fig.4, nine pairs ofMIR166promoters can individually form a discrete clade in phylogenetic tree (MIR166dandMIR166f,MIR166mandMIR166u,MIR166e/qandMIR166o/v,MIR166pandMIR166t,MIR166b/xandMIR166n/y,MIR166kandMIR166z,MIR166iandMIR166j,MIR166aandMIR166c,MIR166handMIR166r) and have similar motif composition in their promoter regions.

Expression analysis ofMIR166s in soybean

To investigate the expression patterns ofMIR166genes, we mined the publicly available transcript profiling data of soybean tissues at the Phytozome database (http://www.phytozome.net) and found that 9MIR166genes showed tissue-specific expression patterns. Based on their expression patterns, the 9MIR166genes can be categorized into three groups (Fig.5). Group 1 comprisesMIR166a,MIR166j,MIR166handMIR166z, which were mainly expressed in root, root hair, nodule and/or shoot apical meristem. Group 2 containsMIR166gandMIR166iwith high expression in flowers, shoot apical meristem, leaf and/or nodules, while group 3 consists ofMIR166n/y,MIR166e/qandMIR166pwith high transcript abundance either in seed, pod, flower or shoot apical meristem. The maximum fragments per kilobase of transcript per million mapped reads (FPKM) forMIR166n/ywas higher (58.403) compared to the reads for the rest ofMIR166gene family (0.152 to 6.579), suggesting thatMIR166n/ymight perform a major function amongMIR166family genes in soybean.

Fig. 5
figure5

Expression analysis of soybeanMIR166genes in various tissues. The transcript profiling data of soybean tissues were extracted from the publicly-available Phytozome database (http://www.phytozome.net)的热图的一代。上面的颜色范围the heat map indicates gene expression levels, low transcript abundance indicated bygreencolor and high transcript abundance indicated byredcolor. Maximum FPKM value for eachMIR166is shown

Additionally, blast search against ESTs in NCBI database revealed that 12 pre-miR166s can perfectly match with one or more ESTs including nine gene-mapped pre-miR166s and three gene-unmapped pre-miR166s (Additional file6:表S3)。Among them, three ESTs generated fromMIR166e/q,MIR166b/x,MIR166o/vwere derived from immature seeds, and one EST for miR166j from germinating shoot. Additionally, five ESTs generating pre-miR166a, i, n, r, y were originated from the tissues under abiotic stresses. These observations not only confirmed that at least 12 pre-miR166s (including four of the newly identified pre-miR166s), out of the 26 soybean miR166s, are indeed transcribed, but also implied that these miR166s might probably be involved in seed development, germination and/or in response to abiotic stresses.

Targets of miR166s in soybean

To obtain a further understanding of biological functions of miR166 family in soybean, the transcripts ofGlycine maxfrom JGI genomic project were utilized as a reference system to predict the targets of soybean miR166s by employing the psRNATarget tool. Using three representative mature miR166s as query sequences, 19 target genes were identified. As shown in Fig.6a, 21 miR166s (miR166a-g,i-j,m-t,v-y) were predicted to target ten members of theHD-ZIPIIIfamily genes (4ATHB-14-LIKE, 3HB-15, 3REVOLUTA-LIKE), whereas miR166h,k,u,z,l might target the transcripts other thanHD-ZIP IIIfamily genes, such asPHYTOENE SYNTHASE(PSY),AMMONIUM TRANSPORTER 2-LIKE(AMT2-LIKE) and/or two-component response regulator-likeAPRR2. This results support the previous report that miR166 family displays the functional diversity and specificity in plants [24]。Among the predicted targets, cleavage ofATHB14-LIKEtranscript (Glyma05G166400) was chosen to be validated using RLM-RACE approach. As shown in Fig.6b-c, the product ofATHB14-LIKEwas precisely cleaved at the 10th or 12th position of complementarity from the 5’ end of miR166s, indicating that miR166s can be indeed involved in the regulation of biological processes via targetingHD-ZIP IIIfamily in soybean.

Fig. 6
figure6

Predicted targets of miR166 family and validation of miR166 target gene using RLM-RACE PCR.aThe alignments of fragments of target mRNAs that have complementarity to miR166s are shown. miR166 sequences are displayed to highlight complementarity to target mRNAs. The targets are classified into seven groups (I, no annotation; II,ATHB-15; III,ATHB14-LIKE; IV,REVOLUTA-LIKE; V,PHYTOENE SYNTHASE; VI,AMMONIUM TRANSPORTER 2-LIKE; VII, two-component response regulator-likeAPRR2).bRLM-RACE PCR forATHB14-LIKE. Lane 1 and M show cleaved product of desired size (734 bp) and DNA ladder, respectively.cMapping ofATHB14-LIKEmRNA cleavage sites by RLM-RACE. Thearrowsindicate the cleavage sites and the numbers show the frequency of clones sequenced

To understand the evolutionary relationship between miR166s and their target genes, phylogenetic analysis were performed using 63 unique target sequences (UTS) and 52 target genes from diverse plant species (Additional file10: Table S7). As shown in Fig.7, 28 out of 63 UTSs were clustered withHD-ZIPfamily in the phylogenetic tree. It is reasonable sinceHD-ZIP IIIfamily is highly conserved targets of miR166s among diverse plant species. Further observation pointed that UTS52 and UTS44 were very close toHD-ZIP IIIfamily genes in soybean includinggma-HB14-LIKE,gma-REV-LIKEandgma-HB15, indicating highly conserved binding site of mi166s in soybeanHD-ZIP IIIfamily. In contrast, two non-HD-ZIP IIItarget genes in soybean (gma-PSYandgma-ATM2) had seven closely related UTSs, which were also close tosbi-Calcium channel alpha-1 subunit(sbi-Ca-alpha 1),ptr-HD-ZIP III 6,phv-Plastoglobulin-1(phv-PLG),Ptr-ATP synthase B chain AtpF. Similarly,gma-APPR2was grouped with 4 UTSs (UTS09,17,20,29) and some target genes includingCsi-Resistance protein-like protein(csi-RPL),Bna-PRK,Ath-Photosystem I reaction center subunit XI(Ath-PSAL). The observations suggested thatnon-HD-ZIP IIItarget genes are diverse among different plant species.

Fig. 7
figure7

Phylogenetic analysis of unique target sequences (UTS) present in plant species and binding sites of miR166 in target genes. Target genes of miR166s in soybean are highlighted with astar. The details of UTSs were listed in Additional file10: Table S7

To provide clues for their functions during soybean development, the expression patterns of 12 annotated target genes of miR166s were extracted from genome-wide transcript profiling data of soybean tissues from the Phytozome database. As shown in Fig.8, the members of theHD-ZIPIIIfamily were highly expressed in stem, shoot apical meristem and/or pods, whereas low transcript accumulation was observed in roots and leaves. Additionally, the transcript accumulation ofAPRR2,PSYandAMT2-LIKEwas highest in flower, root and stem, respectively.

Fig. 8
figure8

Expression patterns of miR166 target genes in different soybean tissues. The transcript profiling data of 12 annotated target genes in soybean tissues were extracted from the publicly-available Phytozome database (http://www.phytozome.net)

Expression pattern of the five newly identifiedMIR166genes during seed development or in response to abiotic stresses

Several studies have documented that miR166 plays important roles during seed development and in response to abiotic stresses [13,14,3638]。To provide clues for their functional roles in soybean, we took the five newly identified pre-miR166s as examples to examine their expression patterns during seed development and in response to cold, drought and salinity stresses.

Expression patterns of the five newly identifiedMIR166s were investigated using seeds collected at 15, 30, 45, 65 days after flowering (DAF). As shown in Fig.9a, all the five newly identifiedMIR166genes showed gradual increase in transcript accumulation as seeds developed, and reached the peak at 65 DAF. Their fold changes were up to 76.31, 60.34, 14.84, 30.28 and 22.91, respectively, compared with the ones at 15 DAF, suggesting that the five newly identifiedMIR166s might play an important role at late stages of seed development.

Fig. 9
figure9

Expression analysis of the five newly identifiedMIR166genes in response to seed development and abiotic stresses.aExpression patterns of the 5MIR166genes during seed development (15, 30, 45, 65 days after flowering).bExpression patterns of the 5MIR166genes in seedlings exposed to cold stress for 0, 3, 6, 12, 24, 48 and 72 h.cExpression patterns of the 5MIR166genes in seedlings exposed to drought stress for 0, 2, 4, 6, 8 and 10 d.dExpression patterns of the 5MIR166genes in seedlings exposed to salinity stress for 0, 3, 6, 12, 24 and 72 h. Error bars indicate SE of two biological and three technical replicates. Values were normalized against theSUBI3gene

Ten-day-old soybean seedlings were exposed to cold stress at 4 °C for 0, 3, 6, 12, 24, 48 and 72 h, and expression of the five newly identifiedMIR166s were monitored. As indicated in Fig.9b, the expression ofMIR166vandMIR166zwere dramatically increased by 7.79- and 1.88-fold, respectively, at 3 h after cold treatment, implying thatMIR166vandMIR166zmight respond to cold stress at early stage.MIR166wshowed gradual increase in transcript accumulation as the stress prolonged, whileMIR166xandMIR166yexhibited slight decrease at late stage of cold stress.

When soybean seedlings were exposed to drought stress, the expression ofMIR166vandMIR166ywas promptly decreased by 6.14 and 2.60 folds, respectively, at 2 days after treatment (Fig.9c), indicating that they might be relatively sensitive to drought stress. Also, a distinct decrease in transcript accumulation after 10 days of drought treatment was observed forMIR166w,MIR166x,MIR166yandMIR166zwith 6.37, 7.75, 6.06 and 1.80 fold changes, respectively as compared to control.

When young seedlings were subjected to salt stress, the expressions of all the 5MIR166genes were promptly increased at 3 h after salt treatment (Fig.9d). Especially, the expression ofMIR166vandMIR166zreached their maximum level at 3 h salt stress with 1.99 and 5.73 fold changes, respectively as compared to non-salinity control, suggesting that they might be responsive to salt stress at early stage.MIR166wandMIR166yshowed gradual increase in transcript accumulation as the stress prolonged, and reached their maximum level at 72 h salt stress with 4.59 and 4.82 fold changes, respectively as compared to control.

年代期间miR166目标基因的表达模式eed development or in response to abiotic stresses

To explore the regulatory function of miR166s in soybean, the expression patterns of 12 target genes with annotation in Phytozome were also analyzed by qRT-PCR during seed development and in response to abiotic stresses. As shown in Fig.10a, the expressions of all theHD-ZIP IIIgenes except Glyma05G166400 were gradually increased as seeds developed, and reached the peak at 65 DAF. Their fold changes ranged from 4.39 to 59.16, compared with the ones at 15 DAF. On the contrary,PSYwas gradually decreased in transcript accumulation as seeds developed, whileAMT2-LIKEwas dramatically decreased by 2.34 folds at 30 DAF.

Fig. 10
figure10

Expression analysis of the 12 target genes in response to seed development and abiotic stresses. Expression patterns of target genes during seed development (a), under cold stress (b), drought stress (c) and salinity stress (d). Stress treatments and seed development stage were used as same as the ones in Fig.9.PSY,PHYTOENE SYNTHASE;AMT2-LIKE,AMMONIUM TRANSPORTER 2-LIKE. Error bars indicate SE of two biological and three technical replicates. Values were normalized against theSUBI3gene

As indicated in Fig.10b, the expressions of all theHD-ZIP IIIgenes andAPRR2were obviously decreased by cold stress, and reached their minimal levels at 24 h stress with 2.87–7.88 fold changes as compared to control. In contrast, the transcript accumulations ofPSYandAMT2-LIKEwere increased by 1.97 and 2.27 folds at 3 and 48 h after treatment, respectively.

FourATHB14-LIKE,REV-LIKE(Glyma11G145800) andPSYdisplayed gradual decrease in transcript accumulation as drought stress prolonged (Fig.10c), and their maximal fold changes ranged from 2.02 to 38.17. AlthoughATM2-LIKEdisplayed similar expression level to the control at early stage (2–4 days after drought treatment), the transcript accumulation was dramatically increased from 6 days after treatment until 10 days with 9.49 fold changes, suggesting thatATM2-LIKEmight respond to drought stress at middle and late stages. The decreased transcript accumulation was also observed for the rest of target genes on one or more treatment points as compared to control.

As shown in Fig.10d, all theHD-ZIP IIIgenes andAPRR2were promptly increased to their peaks in transcript accumulation at 3 h after salt stress with 1.82–4.17 fold changes, respectively as compared to control, indicating that they might be responsive to salt stress at early stage. The expressions ofPSYandAMT2-LIKEwere gradually increased, and reached their peaks at 72 h after salt treatment with 1.89–6.0 fold changes, respectively as compared to control.

Discussion

miR166 acts as a highly conserved miRNA to be implicated in a wide range of cellular and physiological processes in plants [12,1418,38]。Based on the data from miRBase, most of plant species harbor multiple miR166s. It has been accepted that the existence of multiple copies ofMIRgenes within plant species provides one possibility for functional redundancy and specificity [39]。Thus, the diversity and evolutionary conservation ofMIR166family genes are expected to be elucidated for further understanding their fine-tuned regulatory roles in plant developmental processes and/or resistance to stresses.

Generally, miRNAs are annotated using comparative criteria for both their expression and biogenesis, including size and sequence of mature miRNA, phylogenetic conservation, secondary structure of miRNA precursor and increased precursor accumulation with decreased Dicer function in vivo [40]。In this study, the five newly identified miRNAs exhibited the same size and sequence as their paralogs in soybean (Table1and Fig.2b). The lengths of the potential miRNA precursors range from 96 to 244 nt (Table2and Fig.1), which is acceptable since the length of plant miRNA precursors has been found to be in the range of 55–930 nt (mean = ~146 nt) [30]。傅rthermore, their precursors also satisfied the criteria with aspect to both phylogenetic conservation and secondary structure. It has been proposed that low MFE and high MFEI are reliable criteria to distinguish miRNAs from all coding and other non-coding RNAs, as these parameters, to certain extent, reflect the stability of the prefect or near-perfect secondary hairpin structure of miRNA precursor [31]。In the study, MFE values of the five newly identified pre-miR166s ranged from -46.0 to -90.7 kcal/mol, which were much lower than folding free energies of tRNA (-27.5 kcal/mol) and rRNA (-33 kcal/mol). At the same time, their individual MFEI value was also higher than that of tRNA, rRNA and mRNA. Taken together, the five newly identified miRNAs in soybean are very likely to be true miRNAs. Thus, the study enlarged the size of soybean miR166 family from 21 to 26 members.

It has been proposed that conservedMIRNAfamilies have been both conserved and diversified duringMIRNAevolution [39]。In this study, our data indicate that soybeanMIR166s showed evolutionary conservation. As shown in Fig.2a, ten pre-miR166 paralogous pairs with high sequence identity (93.8% or higher) were individually grouped into a discrete clade in the phylogenetic tree. Further analysis revealed that eight out of ten paralogous pairs were located at corresponding loci on the same duplication block (Fig.3and Additional file7: Table S4), indicating that they might have arisen from segmental duplications without major sequence change. Additionally, smaller than 1 of Ka/Ks values suggest that theseMIR166genes likely had undergone through a purifying selection with limited functional divergence after duplication. It is also worth noting that five paralogous pairs were clustered into discrete clades in phylogenetic tree based on their promoter sequences (Fig.4), supporting the view that gene duplication could happen simultaneously at the coding region and the promoter [41]。Evidently, the origins of these soybeanMIR166s were in line with soybean evolutionary history that soybean had undergone whole-genome duplication event ~56.5 million years ago [42]。Apart from the evolutionary conservation, soybeanMIR166s also exhibited evolutionary diversification. In the study, five pre-miR166s (pre-miR166r, pre-miR166k and pre-miR166z; pre-miR166h and pre-miR166j) were separated far from the majority of pre-miR166s (Fig.2a), and the identity between the five pre-miR166s and the rest of the pre-miR166s was low (40–66%) (Additional file2: Figure S1). These observations suggested that they had diverged from the majority ofMIR166s during soybean evolution.

Mature miR166s are ultimately responsible for functional diversification since they act as functional unit to regulate the expression of their targets. In the study, all the miR166s except miR166l were highly conserved in mature sequence, and identical mature sequence exists in each of the pre-miR166 paralogous pairs with exception of pre-miR166h and pre-miR166j (Fig.2). Consequently, most of soybean miR166s shared the same putative targets (HD-ZIP IIIfamily) in soybean. However, miR166z, k, h, u, l are likely to have functional diversity since they are predicted to target genes other than those of theHD-ZIP IIIfamily, i.e.,APRR2,PHYTOENE SYNTHASEandAMMONIUM TRANSPORTER 2-LIKE(Fig.6a). Thus, our findings demonstrate that mature miR166s exhibit both functional conservation and diversification.

It has been well-known that miR166 plays important roles during seed development. For example, miR166 can repress the seed maturation program during vegetative development via controlling the expression ofHD-ZIP IIIfamily genes such asPHB[14]; while it might also be involved in somatic embryo formation via acting as a positional signal from the basal–peripheral region of early embryos [13]。In this study, all the five newly identifiedMIR166s showed gradual increase during seed development, and reached their expression peaks at 65 DAF, a late stage of soybean seed development (Fig.9a). These expression patterns are consistent with previous report that miR166 inBrassica napuswas strongly accumulated at late stage of seed development [43]。Also, it was proposed that the accumulation of mature miR166 reached its major peak at late cotyledonary embryo stage in larch, suggesting that it might be associated with cotyledon formation [36]。The expression patterns of the five newly identifiedMIR166s suggested that they might be one of the major contributors to the network controlling seed development, especially seed maturation program.

It has been documented that plant miR166s might be involved in response to diverse abiotic stresses, and main evidences were derived from the altered accumulation pattern of mature miR166 under diverse abiotic stresses. For instance, the accumulation of mature miR166 was decreased by drought stress inSorghum bicolor[38], whereas drought stress led to increase of miRN166 accumulation in root and leaf of wheat [37]。Another kind of examples are that miR166 in potato was significantly up-regulated by salinity stress [18], and depression of miR166 by small tandem target mimic can lead to improvement of salt tolerance inArabidopsis[44]。In the study, the presence of stress-responsive elements in the promoter regions ofMIR166genes pointed to their possible roles under various stresses (Fig.4). Furthermore, the altered expression patterns of the five newly identifiedMIR166genes suggest their prominent roles in response to cold, drought and salinity (Fig.9b-d). For example,MIR166vandMIR166wharbored the LTR cis-acting element in promoter regions, which can be involved in low-temperature responsiveness (Fig.4). Indeed,MIR166vwas promptly increased by cold stress with 7.79 fold changes, whileMIR166wshowed gradual increase in transcript accumulation as cold stress prolonged (Fig.9b), implying thatMIR166vandMIR166wcan possibly function as a modulator of cold-induced signaling pathways. These results are consistent with previous reports thatMIR166eandMIR166rwere increased by chilling stress in vegetable soybean [45], and that the expression levels ofMIR166uwere obviously increased under cold condition in nitrogen-fixing nodules of soybean [46]。It has been proposed that spatial-temporal or specific expression ofMIRNAisoforms is a crucial determinant of isoform specificity [13,47]。In the study, the differential expression patterns of 5MIR166genes under same stress indicated their functional specificity (Fig.9b-d). As shown in Fig.7c,MIR166vwas dramatically decreased at 2 days after drought treatment, whereasMIR166wwas increased at 4, 6, 8 days after drought treatment. Thus, the stress-specific properties ofMIR166genes provided important clues for elucidating their functional specificity. Although five novelMIR166s in soybean were differentially regulated during seed development or in response to stresses, their expression patterns were not negatively correlated with the ones of miR166 target genes (Figs.9and10). For example, the expressions of theHD-ZIP IIIgenes were gradually increased as seeds developed. This is consistent with the previous report thatHD-ZIP IIIfamily inArabidopsiscan be positively involved in regulation of seed mature program [14]。However, the five new identifiedMIR166s also showed gradual increase in transcript accumulations. These results suggested that miR166-mediated silencing of target genes was under sophisticated regulation.MIR166gene family has large number of members in plant species, and at least 26 miR166 members exist in soybean. So, we speculated thatMIR166gene family possibly contributes to the accumulation pattern of target genes in a family member-specific manner.

It has been widely accepted that members of a certain evolutionary branch have potential to share similar functions [48]。In the study, pre-miR166x and pre-miR166y were the most closely related paralogs in the phylogenetic tree (Fig.2a), and they also showed 99.0% sequence identity (Additional file2: Figure S1). As expected, similar expression patterns were observed forMIR166xandMIR166yin response to cold, drought and salinity stresses (Fig.9b-d). Additionally, the most closely related pre-miR166 paralogs (such as miR166h and miR166j; miR166g and miR166i) also shared similar tissue-specific expression pattern in soybean (Fig.5). Thus, it was hypothesized that these most closely related pre-miR166 paralogs in soybean perform similar cellular functions during developmental processes or in response to various stresses.

Conclusions

This study enlarged the size of soybean miR166 family from 21 to 26 members. A comprehensive analysis indicated that the 26 soybean miR166s exhibited evolutionary conservation and diversification. Furthermore, the expression patterns of five newly identifiedMIR166s during seed development and in response to abiotic stresses provided important clues for elucidating the functional specificity and redundancy of miR166 family in soybean. Future study will aim at investigating the effect of eachMIR166gene on seed development and stress tolerance, identifying target genes and dissecting the regulatory mechanisms for eachMIR166, and exploring combination of multipleMIR166genes during seed development and in response to specific stress.

Methods

Plant materials and treatments

Soybean (Glycine maxcv. Williams 82) plants were grown at an experimental station in Jilin University (Changchun, Jilin Province, China), in 2015. Seeds were collected at 15, 30, 45, 65 days after flowering (DAF), and frozen in liquid nitrogen and stored at -80 °C. Mature seeds were also collected for the following experiments.

Six seeds were sown on each pot filled with 65 g vermiculite. All the seedlings were grown at 25 °C with a photoperiod (14 h light and 10 h dark) in a chamber, and regularly watered with Hoagland liquid medium. Ten-day-old seedlings were subjected to the following treatments, and six pots of plants were used for each treatment or control: (1) For cold stress, seedlings were transferred to 4 °C and samples were collected at 0, 3, 6, 12, 24, 48 and 72 h after cold treatment; (2) For drought stress, water supply was withheld and samples were collected at 0, 2, 4, 6, 8 and 10 days of water stress; (3) For salinity stress, 200 mM NaCl solution was applied to seedlings and samples were collected at 0, 3, 6, 12, 24 and 72 h after salt treatment. The above-ground parts were collected and frozen in liquid nitrogen, and stored at -80 °C.

Identification of novel miR166s in soybean

Known miR165/166 precursor sequences from soybean,Arabidopsisand rice were downloaded from the publicly available miRBase (www.mirbase.org), and then used as reference sequences to search for their homologs against soybean genome in Phytozome (www.phytozome.net). Subsequently, these homologs were used in a BLAST search against NCBI nucleotide collection database and protein database to eliminate protein-encoding sequences and non-coding RNAs such as tRNA, rRNA, snRNA or snoRNA.

The candidates were then assessed for secondary structures, using RNA fold program (http://nhjy.hzau.edu.cn/kech/swxxx/jakj/dianzi/Bioinf4/miRNA/miRNA1.htm), and the parameters were set as default. Finally, the putative miRNAs were identified based on the following criteria [49]: 1) novel mature miR166 should be identical or nearly identical to one of known miR166s in plants; 2) the position of mature miRNA on the hairpin; 3) the maximum number of unpaired residues should be five between miRNA and miRNA*; 4) the maximum number of G_U pairs in miRNA should be 5; 5) the maximum size for a bulge in miRNA sequence should be 3 nt; 6) the negative minimal folding free energy (MFE) should be low; and 7) the minimal folding free energy index (MFEI) should be high. MFE denotes the negative folding free energies, and the minimal folding free energy Index (MFEI) was calculated by employing the following equation: MFEI = [(MFE/length of the RNA sequence) *100]/(G + C) %.

Phylogenetic analysis of 26 miR166s and their precursors in soybean

大豆miR166s序列及其前体were collected from miRBase, and then aligned by ClustalW along with the ones of newly identified miR166s, respectively. Subsequently, phylogenetic trees were constructed by maximum likelihood method using software MEGA 5.2 [50], and the bootstrap value was calculated with 1000 replicates. Percentage identity of aligned sequences was calculated using Kalmogorov-Smirnov statistical test in GeneDoc.

Chromosomal localizations and gene duplications

To determine the locations of the 26 pre-miR166s on soybean chromosomes, the coordinate of pre-miR166s and chromosome lengths were obtained from NCBI database. For syntenic mapping, we firstly used the coordinate of each pre-miR166 to map its genomic locus or flanking genes, which were subsequently subjected to retrieval of duplicated genomic regions and Ka/Ks values for each duplicatedMIR166s from batch download option of Plant Genome Duplication Database (http://chibba.agtec.uga.edu/duplication/). Tandem duplications were defined as two paralogs separated by less than 3 kb in the same chromosome, while segmental duplications referred to those homologous genes distributed on duplicated chromosomal blocks from the same genome lineage. The chromosomal location image of soybeanMIR166genes was generated by the Circos software [51]。

Promoter analysis ofMIR166genes in soybean

The transcription start sites ofMIR166genes were predicted by TSSP software in Softberry (http://linux1.softberry.com/berry.phtml?topic=tssp&group=programs&subgroup=promoter). For promoter analysis, a 1,500 bp interval upstream of the transcription start site ofMIR166genes was analyzed in the PlantCARE database (http://bioinformatics.psb.ugent.be/webtools/plantcare/html/).

Prediction of miR166 targets in soybean

The targets of soybean miR166s were predicted by using the psRNATarget program (http://plantgrn.noble.org/psRNATarget/) against the transcripts ofGlycine maxfrom JGI genomic project. The psRNATarget parameter settings were set as default with a maximum exception value of 3.

Unique miR166s (UmiR166s) and unique target sites of miR166s (UTSs) were identified according to the procedure described by Barik [52]。Briefly, the sequences of 262 mature miR166s from 45 plant species were aligned to isolate the UmiR166s according to their sequence similarity and uniqueness (Additional file2: Table S2). Subsequently, the UmiR166 sequences were applied for the prediction of target genes from plant species using psRNATarget tool. The unique miR166 binding sites in target transcripts were designated as UTS and numbered (Additional file10: Table S7). Phylogenetic trees were constructed by maximum likelihood method using software MEGA 5.2 [50]。

RNA Ligase-Mediated RACE (RLM-RACE) was performed with the FirstChoice® RLM-RACE Kit (Ambion) according to the manufacturer’s instructions, with slight modification. Briefly, 10 μg of total RNA was directly ligated to the 5’ adaptor followed by reverse transcription with an oligo (dT) primer. PCR was performed with 5’ adaptor primers and 3’ gene-specific primers (Additional file11: Table S8) using cDNA as the template. The RACE products were gel-purified, cloned, and sequenced.

Tissue-specific expression ofMIR166s and their target genes in soybean

Based on the coordinates of pre-miR166s on soybean genome in Phytozome, 11 pre-miR166s were mapped to the characterized genomic loci, and the expressions of the corresponding genes at these genomic loci were used to estimate the transcript level ofMIR166s in different tissues. The fragments per kilobase of transcript per million mapped reads (FPKM) values for eachMIR166and its targets were extracted from Phytozome database by tracking soybean gene-level expression (http://www.phytozome.net). The heatmap forMIR 166使用的热图基因生成R。2功能ion from the gplots CRAN library (http://CRAN.R-project.org/package=gplots).

Expression analysis of the newly identified pre-miR166s and target genes in soybean

Total RNA was isolated from seeds at different developmental stages and seedlings under different stress treatments using RNAprep Pure Plant Kit (Tiangen Inc, China). To examine the expression of pre-miRNAs, DNase-treated RNA of each sample was reverse transcribed to first-strand cDNA using pRimeScript RT reagent kit with gDNA Eraser (Takara) according to manufacturer’s instruction. qRT-PCR was subsequently conducted with an ABI 7500 PCR system and SYBR Premix Ex Taq (Takara), usingSUBI3as internal control. Three replicates were performed for each sample, and data were analyzed by the software ABI 7500 v.20. Primer sequences are listed in Additional file11: Table S8.

Abbreviations

miR166:

microRNA166

pre-miRNA:

Stem-loop precursor of miRNA

miRISC:

miRNA-induced silencing complex

RDD1:

RICE Dof DAILY FLUCTUATIONS 1

MFE:

Minimal folding free energy

MFEI:

Minimal folding free energy index

EST:

Expressed Sequence Tags

Ka/Ks:

Non-synonymous/synonymous substitution ratio

FPKM:

Fragments per kilobase of transcript per million mapped reads

DAF:

Days after flowering

RLM-RACE:

RNA Ligase-Mediated RACE

References

  1. 1.

    Chen XM. Small RNAs and their roles in plant development. Annu Rev Cell Dev Bi. 2009;25:21–44.

    ArticleGoogle Scholar

  2. 2.

    Cui X, Xu SM, Mu DS, Yang ZM. Genomic analysis of rice microRNA promoters and clusters. Gene. 2009;431(1-2):61–6.

    CASArticlePubMedGoogle Scholar

  3. 3.

    Bartel DP. MicroRNAs: genomics, biogenesis, mechanism, and function. Cell. 2004;116(2):281–97.

    CASArticlePubMedGoogle Scholar

  4. 4.

    Megraw M, Baev V, Rusinov V, Jensen ST, Kalantidis K, Hatzigeorgiou AG. MicroRNA promoter element discovery in Arabidopsis. RNA. 2006;12(9):1612–9.

    CASArticlePubMedPubMed CentralGoogle Scholar

  5. 5.

    Lee Y, Kim M, Han JJ, Yeom KH, Lee S, Baek SH, Kim VN. MicroRNA genes are transcribed by RNA polymerase II. Embo J. 2004;23(20):4051–60.

    CASArticlePubMedPubMed CentralGoogle Scholar

  6. 6.

    Voinnet O. Origin, biogenesis, and activity of plant microRNAs. Cell. 2009;136(4):669–87.

    CASArticlePubMedGoogle Scholar

  7. 7.

    Guo HS, Xie Q, Fei JF, Chua NH. MicroRNA directs mRNA cleavage of the transcription factor NAC1 to downregulate auxin signals for arabidopsis lateral root development. Plant Cell. 2005;17(5):1376–86.

    CASArticlePubMedPubMed CentralGoogle Scholar

  8. 8.

    Bartel DP. MicroRNAs: target recognition and regulatory functions. Cell. 2009;136(2):215–33.

    CASArticlePubMedPubMed CentralGoogle Scholar

  9. 9.

    约恩s-Rhoades MW, Bartel DP, Bartel B. MicroRNAs and their regulatory roles in plants. Annu Rev Plant Biol. 2006;57:19–53.

    CASArticlePubMedGoogle Scholar

  10. 10.

    Reinhart BJ, Weinstein EG, Rhoades MW, Bartel B, Bartel DP. MicroRNAs in plants. Genes Dev. 2002;16(13):1616–26.

    CASArticlePubMedPubMed CentralGoogle Scholar

  11. 11.

    Xie ZX, Khanna K, Ruan SL. Expression of microRNAs and its regulation in plants. Semin Cell Dev Biol. 2010;21(8):790–7.

    CASArticlePubMedPubMed CentralGoogle Scholar

  12. 12.

    Singh A, Singh S, Panigrahi KC, Reski R, Sarkar AK. Balanced activity of microRNA166/165 and its target transcripts from the class III homeodomain-leucine zipper family regulates root growth in Arabidopsis thaliana. Plant Cell Rep. 2014;33(6):945–53.

    CASArticlePubMedGoogle Scholar

  13. 13.

    Miyashima S, Honda M, Hashimoto K, Tatematsu K, Hashimoto T, Sato-Nara K, Okada K, Nakajima K. A comprehensive expression analysis of the arabidopsis MICRORNA165/6 gene family during embryogenesis reveals a conserved role in Meristem specification and a non-cell-autonomous function. Plant Cell Physiol. 2013;54(3):375–84.

    CASArticlePubMedGoogle Scholar

  14. 14.

    Tang XR, Bian SM, Tang MJ, Lu Q, Li SB, Liu XG, Tian G, Nguyen V, Tsang EWT, Wang AM et al. MicroRNA-mediated repression of the seed maturation program during vegetative development in arabidopsis. PLoS Genet. 2012;8(11):e1003091.

    CASArticlePubMedPubMed CentralGoogle Scholar

  15. 15.

    Boualem A, Laporte P, Jovanovic M, Laffont C, Plet J, Combier JP, Niebel A, Crespi M, Frugier F. MicroRNA166 controls root and nodule development in Medicago truncatula. Plant J. 2008;54(5):876–87.

    CASArticlePubMedGoogle Scholar

  16. 16.

    Iwamoto M, Tagiri A. MicroRNA-targeted transcription factor gene RDD1 promotes nutrient ion uptake and accumulation in rice. Plant J. 2016;85(4):466–77.

    CASArticlePubMedGoogle Scholar

  17. 17.

    Kim J, Jung JH, Reyes JL, Kim YS, Kim SY, Chung KS, Kim JA, Lee M, Lee Y, Kim VN, et al. microRNA-directed cleavage of ATHB15 mRNA regulates vascular development in Arabidopsis inflorescence stems. Plant J. 2005;42(1):84–94.

    CASArticlePubMedPubMed CentralGoogle Scholar

  18. 18.

    Kitazumi A, Kawahara Y, Onda TS, De Koeyer D, de los Reyes BG. Implications of miR166 and miR159 induction to the basal response mechanisms of an andigena potato (Solanum tuberosum subsp. andigena) to salinity stress, predicted from network models in Arabidopsis. Genome. 2015;58(1):13–24.

    CASArticlePubMedGoogle Scholar

  19. 19.

    Wong J, Gao L, Yang Y, Zhai JX, Arikit S, Yu Y, Duan SY, Chan V, Xiong Q, Yan J, et al. Roles of small RNAs in soybean defense against Phytophthora sojae infection. Plant J. 2014;79(6):928–40.

    CASArticlePubMedPubMed CentralGoogle Scholar

  20. 20.

    Ko JH, Prassinos C, Han KH. Developmental and seasonal expression of PtaHB1, a Populus gene encoding a class III HD-Zip protein, is closely associated with secondary growth and inversely correlated with the level of microRNA (miR166). New Phytol. 2006;169(3):469–78.

    CASArticlePubMedGoogle Scholar

  21. 21.

    Zhong R, Ye ZH. Regulation of HD-ZIP III Genes by MicroRNA 165. Plant Signal Behav. 2007;2(5):351–3.

    ArticlePubMedPubMed CentralGoogle Scholar

  22. 22.

    Yip HK, Floyd SK, Sakakibara K, Bowman JL. Class III HD-Zip activity coordinates leaf development in Physcomitrella patens. Dev Biol. 2016;419(1):184–97.

    CASArticlePubMedGoogle Scholar

  23. 23.

    Zhang CH, Zhang BB, Ma RJ, Yu ML, Guo SL, Guo L. Isolation and expression analysis of four HD-ZIP III family genes targeted by microRNA166 in peach. Genetics and molecular research : GMR. 2015;14(4):14151–61.

    CASArticlePubMedGoogle Scholar

  24. 24.

    Barik S, SarkarDas S, Singh A, Gautam V, Kumar P, Majee M, Sarkar AK. Phylogenetic analysis reveals conservation and diversification of micro RNA166 genes among diverse plant species. Genomics. 2014;103(1):114–21.

    CASArticlePubMedGoogle Scholar

  25. 25.

    Zhou Y, Honda M, Zhu H, Zhang Z, Guo X, Li T, Li Z, Peng X, Nakajima K, Duan L, et al. Spatiotemporal sequestration of miR165/166 by Arabidopsis Argonaute10 promotes shoot apical meristem maintenance. Cell Rep. 2015;10(11):1819–27.

    CASArticlePubMedGoogle Scholar

  26. 26.

    Miyashima S, Koi S, Hashimoto T, Nakajima K. Non-cell-autonomous microRNA165 acts in a dose-dependent manner to regulate multiple differentiation status in the Arabidopsis root. Development. 2011;138(11):2303–13.

    CASArticlePubMedGoogle Scholar

  27. 27.

    Turner M, Yu O, Subramanian S. Genome organization and characteristics of soybean microRNAs. BMC Genomics. 2012;13:169.

    CASArticlePubMedPubMed CentralGoogle Scholar

  28. 28.

    Liu TF, Fang C, Ma YM, Shen YT, Li CC, Li Q, Wang M, Liu SL, Zhang JX, Zhou ZK, et al. Global investigation of the co-evolution of MIRNA genes and microRNA targets during soybean domestication. Plant J. 2016;85(3):396–409.

    CASArticlePubMedGoogle Scholar

  29. 29.

    赵MX,迈耶斯公元前,Cai厘米,徐W,马JX。进化ary patterns and coevolutionary consequences of MIRNA genes and MicroRNA targets triggered by multiple mechanisms of genomic duplications in soybean. Plant Cell. 2015;27(3):546–62.

    CASArticlePubMedPubMed CentralGoogle Scholar

  30. 30.

    Thakur V, Wanchana S, Xu M, Bruskiewich R, Quick WP, Mosig A, Zhu XG. Characterization of statistical features for plant microRNA prediction. BMC Genomics. 2011;12:108.

    CASArticlePubMedPubMed CentralGoogle Scholar

  31. 31.

    Zhang BH, Pan XP, Cox SB, Cobb GP, Anderson TA. Evidence that miRNAs are different from other RNAs. Cell Mol Life Sci. 2006;63(2):246–54.

    CASArticlePubMedGoogle Scholar

  32. 32.

    Merchan F, Boualem A, Crespi M, Frugier F. Plant polycistronic precursors containing non-homologous microRNAs target transcripts encoding functionally related proteins. Genome Biol. 2009; 10(12).

  33. 33.

    Budak H, Akpinar BA. Plant miRNAs: biogenesis, organization and origins. Funct Integr Genomics. 2015;15(5):523–31.

    CASArticlePubMedGoogle Scholar

  34. 34.

    杨X, Tuskan GA,程恩华MZ。Divergence of the Dof gene families in poplar, Arabidopsis, and rice suggests multiple modes of gene evolution after duplication. Plant Physiol. 2006;142(3):820–30.

    CASArticlePubMedPubMed CentralGoogle Scholar

  35. 35.

    Zhao X, Li L. Comparative analysis of microRNA promoters in Arabidopsis and rice. Genomics Proteomics Bioinformatics. 2013;11(1):56–60.

    CASArticlePubMedPubMed CentralGoogle Scholar

  36. 36.

    Zhang JH, Zhang SG, Han SY, Wu T, Li XM, Li WF, Qi LW. Genome-wide identification of microRNAs in larch and stage-specific modulation of 11 conserved microRNAs and their targets during somatic embryogenesis. Planta. 2012;236(2):647–57.

    CASArticlePubMedGoogle Scholar

  37. 37.

    Akdogan G, Tufekci ED, Uranbey S, Unver T. miRNA-based drought regulation in wheat. Funct Integr Genomics. 2016;16(3):221–33.

    CASArticlePubMedGoogle Scholar

  38. 38.

    Hamza NB, Sharma N, Tripathi A, Sanan-Mishra N. MicroRNA expression profiles in response to drought stress in Sorghum bicolor. Gene Expr Patterns. 2016;20(2):88–98.

    CASArticlePubMedGoogle Scholar

  39. 39.

    约恩s-Rhoades MW. Conservation and divergence in plant microRNAs. Plant Mol Biol. 2012;80(1):3–16.

    CASArticlePubMedGoogle Scholar

  40. 40.

    Meyers BC, Axtell MJ, Bartel B, Bartel DP, Baulcombe D, Bowman JL, Cao X, Carrington JC, Chen X, Green PJ, et al. Criteria for annotation of plant MicroRNAs. Plant Cell. 2008;20(12):3186–90.

    CASArticlePubMedPubMed CentralGoogle Scholar

  41. 41.

    Tang GL. Plant microRNAs: an insight into their gene structures and evolution. Semin Cell Dev Biol. 2010;21(8):782–9.

    CASArticlePubMedGoogle Scholar

  42. 42.

    Lavin M, Herendeen PS, Wojciechowski MF. Evolutionary rates analysis of Leguminosae implicates a rapid diversification of lineages during the tertiary. Syst Biol. 2005;54(4):575–94.

    ArticlePubMedGoogle Scholar

  43. 43.

    Huang DQ, Koh C, Feurtado JA, Tsang EWT, Cutler AJ. MicroRNAs and their putative targets in Brassica napus seed maturation. BMC Genomics. 2013;14:140.

    CASArticlePubMedPubMed CentralGoogle Scholar

  44. 44.

    Jia XY, Ding N, Fan WX, Yan J, Gu YY, Tang XQ, Li RZ, Tang GL. Functional plasticity of miR165/166 in plant development revealed by small tandem target mimic. Plant Sci. 2015;233:11–21.

    CASArticlePubMedGoogle Scholar

  45. 45.

    Xu SC, Liu N, Mao WH, Hu QZ, Wang GF, Gong YM. Identification of chilling-responsive microRNAs and their targets in vegetable soybean (Glycine max L.). Sci Rep. 2016;6:26619.

    CASArticlePubMedPubMed CentralGoogle Scholar

  46. 46.

    Zhang SL, Wang YN, Li KX, Zou YM, Chen L, Li X. Identification of cold-responsive miRNAs and their target genes in nitrogen-fixing nodules of soybean. Int J Mol Sci. 2014;15(8):13596–614.

    CASArticlePubMedPubMed CentralGoogle Scholar

  47. 47.

    Sorin C, Declerck M, Christ A, Blein T, Ma LN, Lelandais-Briere C, Njo MF, Beeckman T, Crespi M, Hartmann C. A miR169 isoform regulates specific NF-YA targets and root architecture in Arabidopsis. New Phytol. 2014;202(4):1197–211.

    CASArticlePubMedGoogle Scholar

  48. 48.

    Paul AL, Denison FC, Schultz ER, Zupanska AK, Ferl RJ. 14-3-3 Phosphoprotein interaction networks - does isoform diversity present functional interaction specification? Front Plant Sci. 2012;3:190.

    ArticlePubMedPubMed CentralGoogle Scholar

  49. 49.

    Li X, Hou Y, Zhang L, Zhang W, Quan C, Cui Y, Bian S. Computational identification of conserved microRNAs and their targets from expression sequence tags of blueberry (Vaccinium corybosum). Plant Signal Behav. 2014;9(9), e29462.

    ArticlePubMed CentralGoogle Scholar

  50. 50.

    田村K,彼得森D,彼得森N,Stecher G, Nei M, Kumar S. MEGA5: molecular evolutionary genetics analysis using maximum likelihood, evolutionary distance, and maximum parsimony methods. Mol Biol Evol. 2011;28(10):2731–9.

    CASArticlePubMedPubMed CentralGoogle Scholar

  51. 51.

    Krzywinski M, Schein J, Birol I, Connors J, Gascoyne R, Horsman D, Jones SJ, Marra MA. Circos: an information aesthetic for comparative genomics. Genome Res. 2009;19(9):1639–45.

    CASArticlePubMedPubMed CentralGoogle Scholar

  52. 52.

    Barik S, Kumar A, Sarkar Das S, Yadav S, Gautam V, Singh A, Singh S, Sarkar AK. Coevolution pattern and functional conservation or divergence of miR167s and their targets across diverse plant species. Sci Rep. 2015;5:14611.

    CASArticlePubMedPubMed CentralGoogle Scholar

Download references

Acknowledgments

Not applicable.

傅nding

This work was supported by the National Natural Science Foundation of China (No. 31370340), the National Undergraduates Innovating Experimentation Project (No. 2015821240). There is no role of the funding bodies in the design of the study and collection, analysis, and interpretation of data and in writing the manuscript.

Availability of data and materials

The publicly available transcript profiling data of soybean tissues were obtained from the Phytozome database (http://www.phytozome.net); miR166 precursor sequences from soybean,Arabidopsisand rice were downloaded from the publicly available miRBase (www.mirbase.org). All the data sets are available within the manuscript and the attached additional files.

Authors’ contributions

Conceived and designed the experiments: SB, XL. Performed the experiments: XL, XX, JL, YH, XW, YF, RL. Analyzed the data: XL, LZ, SB. Wrote the paper: SB, YC. All authors have read and approved the final version of this manuscript.

Competing interests

The authors declare they have no competing interests.

Consent for publication

Not applicable.

Ethics approval and consent to participate

Not applicable.

Author information

Affiliations

Authors

Corresponding author

Correspondence toShaomin Bian.

Additional files

Additional file 1: Table S1.

Characteristics of miR166 family in soybean. (XLSX 12 kb)

Additional file 2: Figure S1.

Identity between pre-miR166s in soybean. (TIF 5906 kb)

Additional file 3: Table S2.

Summary of 63 unique miR166s (UmiR166s) in plant species. (XLSX 12 kb)

Additional file 4: Figure S2.

Phylogenetic analysis of all the unique miR166s (UmiR166s) in plant species. The tree is divided into four groups, and the seven UmiR166s presentative of 26 soybean miR166s are highlighted with a star. (TIF 649 kb)

额外的文件5:图S3。

The percentage identity of the aligned miR166 sequences calculated using Kalmogorov-Smirnov statistical test in GeneDoc. Percentage identity of all mature miR166 sequences(A)and their precursor sequences(B)in plant species. (TIF 355 kb)

Additional file 6: Table S3.

Corresponding gene or EST of pre-miR166s in soybean. (DOCX 21 kb)

Additional file 7: Table S4.

Segmental duplication events during evolution of MIR166 family genes in soybean. (XLSX 13 kb)

Additional file 8: Table S5.

Transcription start site ofMIR166genes in soybean. (DOCX 19 kb)

Additional file 9: Table S6.

Promoter analysis of MIR166 family genes in soybean. (XLSX 19 kb)

Additional file 10: Table S7.

Summary of UTSs and targets of miR166s. (XLSX 19 kb)

Additional file 11: Table S8.

Primers used for Real-time PCR. (XLSX 13 kb)

Rights and permissions

Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Li, X., Xie, X., Li, J.et al.Conservation and diversification of the miR166 family in soybean and potential roles of newly identified miR166s.BMC植物杂志17,32 (2017). https://doi.org/10.1186/s12870-017-0983-9

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI:https://doi.org/10.1186/s12870-017-0983-9

Keywords

  • miR166 family
  • Soybean
  • 进化ary conservation and diversification
  • Promoter analysis
  • Gene expression pattern