Abstract
Identification of causal microRNAs (miRNAs) in colorectal cancer (CRC) is elusive, due to our lack of understanding of how specific miRNAs affect biological pathways and outcomes. An miRNA can regulate many mRNAs and an mRNA can be associated with many miRNAs; appreciation of these complex networks in which miRNAs operate is necessary to transition from identifying dysregulated miRNAs to identifying individual miRNAs or groups of miRNAs that are suitable for therapeutic purposes. The aim of the paper is to compile results from a population-based study (n = 1,954 cases with matched carcinoma/normal tissue) of miRNAs in CRC. The information gained allows for cohesive and comprehensive insight into miRNAs and CRC in terms of function and impact. Comparison of miRNA expression with mRNA expression from nine signaling pathways in carcinogenic processes allowed us to identify miRNA targets within a biological context. MiRNAs that directly influence mRNA expression may be effective biomarkers or therapeutic targets.
keywords
Introduction
MicroRNAs (miRNAs) are small, noncoding RNAs that post-transcriptionally regulate mRNA expression by binding to the 3’UTR of protein-coding genes. As such, miRNAs are thought to be key regulators in carcinogenesis. MiRNAs were first reported as being associated with colorectal cancer (CRC) in 20031. Since then, research focusing on miRNAs has expanded rapidly, in the number of studies, the miRNAs examined, and the types of studies conducted. Studies that focus on few miRNAs and those that have undertaken large-scale discovery have identified hundreds of dysregulated miRNAs in CRC. However, findings are not unanimous as to which miRNAs are dysregulated in CRC nor is there a universal point at which miRNAs are considered dysregulated. Statistically significant associations have unknown biological significance when a fold change (FC) difference between carcinoma and normal mucosa is minimal (for instance a FC of 1.1). Correspondingly, the level of expression change between carcinoma and normal mucosa that is needed to have a biological significance is unknown, however setting some threshold for meaningful differences, such as a 50% or twofold change, when considering important miRNAs for determining functionality, could help avoid considering small changes that stem from noise in the data. Determining important miRNAs can be difficult, since results can vary by type of study, which range from population-based epidemiologic approaches to those using cell lines to evaluate the influence of specific miRNAs. Small studies of targeted miRNAs can identifiy “statistically significant” miRNAs that would not hold the same level of significance after adjustment for multiple comparisons of all miRNAs conducted in large discovery studies. We have previously addressed some of the issues surrounding studying miRNAs in terms of measurement of miRNA expression, standardizing miRNA expression, and using existing databases to determine target genes2,3.
The utilization of individually paired-carcinoma and normal mucosa is important when determining differential expression as well as the impact of miRNA expression on disease and prognosis4. As we have previously reported, repeat sampling of study data shows considerable variation in results when looking only at absolute miRNA expression in carcinomas or if using non-paired normal mucosa expression data, while much greater consistency in findings occurrs when differential expression is calculated from individually paired carcinomas and normal mucosa samples4,5. We have shown that diet, alcohol, and smoking are associated with miRNA expression, as are genetic variants6-10. MiR-17-5p, miR-106b-5p, miR-19b-3p, and miR-20b-5p were downregulated in smokers; miR-106b-5p, miR-145-5p, and miR-17-5p were downregualted in subjects who consumed wine; and miR-145-5p was positively associated with whole grain intake. Individually paired tissue samples control for confounding factors, either genetic or environmental, that may influence miRNA expression. Using paired data also can help overcome variance in differential miRNA expression derived from tissue processing, since an individual’s carcinoma and normal mucosa tissues are processed in the same manner. However, given the additional cost and availability of samples, individually paired analysis is often not undertaken, contributing to greater difficulty in determining important miRNAs.
The biggest obstacle in advancing miRNA into translational research is determining how miRNAs impact biological pathways and disease processes. It is well known that miRNAs regulate many genes and many genes are regulated by multiple miRNAs. These co-regulatory networks of genes and miRNAs complicate our understanding of functionality and bring into question the importance of pursuing a single miRNA as a therapeutic target without understanding the broader complex in which it belongs. Our goal in this paper is to summarize key findings from our large epidemiological study of miRNAs in CRC. This study was conducted both as a means of discovering new potentially important miRNAs as well as replicating previously reported miRNAs that were associated with CRC; we believe this summary can help focus further research. Additionally, consolidating our study findings may highlight which miRNAs will serve as the best therapeutic targets, either individually, or in groups. We believe that information gained from our large study of 1,954 individuals diagnosed with CRC, who have paired carcinoma and normal mucosa miRNA expression data on over 2,000 miRNAs (Agilent Human miRNA Microarray V19.0) allows for cohesive and comprehensive insight into miRNAs as they relate to CRC. Of the 1,954 individuals included in this study, 1,855 of these individuals had information on survival and 217 of the 1,954 individuals had paired carcinoma and normal gene expression data obtained from RNAseq data.
Differentially expressed miRNAs in CRC
In order for an miRNA to be functionally important in CRC, it is assumed that they have to be dysregulated in CRC tumors compared to normal mucosa. Of the 2,006 miRNAs analyzed in our study, 63.7% were expressed in carcinoma tissue and 63.26% were expressed in normal mucosa3. Of those miRNAs expressed in over 80% of the population (598 miRNAs), 86% of these miRNAs were statistically significantly (adjusted P value < 0.05) differentially expressed between carcinoma and normal mucosa. However, of those significantly and differentially expressed, 45 miRNAs (8.7%) had a FC of > 1.50 or < 0.68 3. The level of FC that is biologically important is not clear and it should be recognized that utilization of FC as a tool to identify important miRNAs also has limitations, one being that if there are low levels of miRNA expression a large FC can be detected. Furthermore, calculation of FC is difficult when the miRNA is not expressed in either the carcinoma or normal mucosa. Identification of miRNAs that may have the greatest functional significance when most miRNAs are “dysregulated” can be challenging. In our study, we utilized a random forest technique that allowed us to identify a subset of these miRNAs that best distinguished differences in miRNA expression between carcinoma and normal mucosa11. Using this statistical method we identified 16 miRNAs important in colon cancer and 17 miRNAs important in rectal cancer (Table 1); four miRNAs were uniquely associated with colon cancer (miR-663a, miR-4538, miR-215, and miR-192-5p) and five miRNAs were uniquely associated with rectal cancer (miR-4323, miR-150-5p, miR-4749-3p, miR-424-3p, and miR-6073)11. Three of the miRNAs identified in colon cancer had a FC of > 0.67 (miR-4538 FC 0.73, miR-378a-3p FC 0.75, and miR-378i FC 0.76) and two of the miRNAs identified in rectal cancer had a FC of > 0.67 (miR-378i FC 0.76 and miR-378a-3p FC 0.74); these miRNAs may not have been considered important if a strict FC cutpoint had been applied. These miRNAs (shown in Table 1 in numerical order) are one set of miRNAs that appear to predict differential expression when considered together. However, given the number of dysregulated miRNAs in CRC, it is highly probable that other miRNAs also have functional significance.
While at the population level miRNAs are up-regulated, down-regulated, or not dysregulated when comparing carcinoma tissue to normal mucosa; it should be kept in mind that not all individuals in the population have the up-regulated or down-regulated miRNA in their carcinoma tissue. However, our data suggest that miRNAs appear to be more stable than mRNAs when considering the percentage of the population with a dysregulated miRNA compared to a dysregulated mRNA12. Additionally, those miRNAs with a higher FC appear to have a greater percentage of the population with a dysregulated miRNA. The set of miRNAs identified by our random forest analysis were, for the most part, highly dysregulated, making them targets that have an application at the population level. Knowing that dysregulated miRNAs with smaller FCs may be dysregulated in a smaller subset of the population also suggests the importance of focusing on higher FC in miRNA expression when determining which miRNAs to target for further functionality studies.
Infrequently expressed miRNAs
In our data, 38.79% of miRNAs were expressed in less than 20% of colorectal carcinomas (498 miRNAs) and 36.11% of miRNAs (457 miRNAs) were infrequently expressed in normal mucosa3. A reasonable question is, are these infrequently expressed miRNAs of biological importance or are they merely noise in the data? Given our large sample size we were able to examine in more detail infrequently expressed miRNA. Our approach to this question was to focus on those miRNAs that were infrequently expressed, but when expressed had higher levels of expression; what we hoped was beyond “noise” in the data.
Our data suggest that infrequently expressed miRNAs may be important in defining tumor phenotype13 and survival after a diagnosis with colorectal cancer14. Table 2 summarizes those miRNAs that may have a functional signficiance because they influence survival, mainly when up-regulated in the tumor. In most instances, having these miRNAs up-regulated in tumors resulted in worse survival. While we do not understand how these miRNAs function, their association with survival implies that they may have a functional significance in the carcinogenic process.
Determining functionality
MiRNAs are key regulators of gene expression, hence they are important to the carcinogenic process. An initial step in determining biological impact is to identify which genes are targeted by various miRNAs. Methods such as Western blot and reporter assays measure protein expression levels, an important consideration given that miRNAs are thought to work post-transcriptionally. However, most databases such as miRTarBase, have incorporated miRNA target gene data that are based on less strong evidence of associations such as microarray, RNASeq, and Northern blot15. While these methods don’t validate miRNA targets using protein expression and instead measure gene expression, several studies have shown that, despite miRNAs having their impact post-transcriptionally, they usually also alter gene expression16,17. Given that databases are restricted to existing literature and that the literature does not uniformly represent all miRNAs2, we utilized RNASeq gene expression data to gain insight into miRNAs associated with gene expression. By examining how change in miRNA expression was associated with change in mRNA expression we gained insight into potentially important miRNAs and their targeted genes. Figure 1 provides an example of how these differences are correlated (beta coefficient -0.30).
Seed-region matches between the miRNA and the 3’UTR of the mRNA suggest a greater propensity for binding and therefore an increased likelihood that the miRNA directly influences mRNA expression; previously we investigated miRNA and mRNA FASTA sequences for matches of 6, 7, and 8 contiguous nucleotides to elucidate direct interactions, in which mRNA expression is reduced18. Thus, looking at seed region matches between miRNA and mRNA is one method to identify potentially important target genes for an miRNA. However, it has been suggested that other factors, such as binding at energy-based sites, may be important when identifying target genes. Ding and colleagues19 used energy-based sites as a second set of criteria when identifying candidate target sites; seed-region matching remained the first line of candidate site identification. It has been suggested that as much as 16% of miRNA:mRNA interactions may not involve contacts within seed regions and are considered seedless interactions20. In our studies, we used seed-region matching to provide additional information on our miRNA-mRNA associations, which, taken together with a negative beta coefficient between the two, provides additional support for their direct binding. A negative beta coefficient combined with the identification of a seed region match suggests that, as the miRNA binds to the mRNA, it decreases the mRNA expression. Because we only looked at seed matching, we did not consider seedless interactions. We believe that looking at seed-region matches between miRNA and mRNA provides support that miRNAs are targeting specific genes that may be important in CRC.
While direct binding between an miRNA and an mRNA shows support for that gene being a target of the miRNA, indirect associations between miRNAs and mRNAs also have potential biological importance. Indirect associations are seen in our studies when the differential expression of an miRNA is statistically significantly associated with the differential expression of an mRNA, but as one increases the other increases, or they are inversely associated but without a seed-region match. Indirect effects most likely occurr in feed-forward loops21-23. In feed-forward loops, regulators such as miRNAs can have either the same effect (repression of expression) or opposite effects on each other22. In feed-forward loops, a transcription factor (TF) such as TP53 can regulate the miRNA and the target gene (TG), which in turn is regulated by the miRNA. The miRNA may regulate the TG directly, through seed region binding, leading to mRNA degradation or translational repression, or indirectly, through repression of the TF that is influencing transcription of the same TG. Studies suggest that regulatory pathways involving miRNAs are prevalent mechanisms of altering gene expression22.
Tumor suppressor genes (TSG) and oncogenes (OG) play important roles in the carcinogenic process by controlling cell growth and inhibiting tumor formation. MiRNAs are thought to play similar roles in the carcinogenic process and it has been suggested that they work with OGs and TSGs24. Determining the association between miRNAs and TSGs and OGs is another avenue in which to pursue insight into miRNA functionality. In our data, we observed that miRNAs most likely have both direct and indirect effects on TSGs and OGs25, suggesting that they work as intermediary regulators between OGs and TSGs, and help balance up and down regulation of genes that in turn influence cell proliferation and apoptosis. Looking at the associated miRNAs, TSGs and OGs suggests that miRNA dysregulation in key signaling pathways is important to CRC25. Increased inflammation, angiogenesis, and decreased immune response are hallmarks of many of the major pathways in which dysregulated TSGs and OGs operate with miRNAs. A comprehensive evaluation of all genes within these pathways, in conjunction with all miRNAs expressed in CRC, provided further insight into functionality of miRNAs and possible target sites for intervention.
Key pathways assessed with miRNAs were: apoptosis26, cell cycle27, JAK-STAT signaling28, MAPK signaling29, NFκB signaling30, PI3K/AKT31, TGFβ signaling12 , p53 signaling32, and Wnt signaling pathways33. Focusing on the those miRNAs (triangle shape) identified through random forrest analysis with an miRNA: mRNA seed region match (mRNAs designated by square shape), one can see how the same miRNA is associated with multiple genes in multiple pathways, and the same is true for mRNAs (Figure 2); Supplementary Table S2 shows mRNAs and miRNAs in more detail. Likewise, mRNAs are associated with multiple miRNAs in different pathways.
Further evaluation of the miRNA:mRNA associations within these nine pathways shows that 88 of the 814 miRNAs (10.8%) expressed in either carcinoma or normal mucosa in over 20% of the population were associated with one or more of the nine pathways. As is shown in Figure 3 (Supplementary Table S2 shows mRNAs and miRNAs within pathways in more detail), the majority of associations were indirect, implying that many effects of the miRNAs come from their involvement in feedback loops, rather than directly binding to an mRNA. We determined that 40 miRNAs had a direct effect on gene expression because of a high likelihood of binding to the mRNA. Those miRNAs involved in regulating gene expression in various pathways are: miR-106b-5p (JAK-STAT-signaling), miR-1243 (Wnt-signaling), miR-1271-5p (NFκB-signaling), miR-145-5p (cell cycle control, apoptosis, and PI3K-AKT-signaling), miR-150-5p (cell cycle control, p53-signaling, apoptosis), miR-17-5p (JAK-STAT-signaling, NFκB-signaling, MAPK-signaling, PI3K-AKT-signaling), miR-193b-3p (MAPK-signaling), miR-195-5p (JAK-STAT-signaling, cell cycle control, apoptosis), miR-196b-5p (p53-signaling, apoptosis, NFκB-signaling, PI3K-AKT-signaling), miR-199a-5p (NFκB-signaling), miR-19b-3p (JAK-STAT-signaling, MAPK-signaling, PI3K-AKT-signaling), miR-203a (JAK-STAT-signaling, apoptosis, NFκB-signaling, MAPK-signaling, PI3K-AKT-signaling, Wnt-signaling), miR-204-3p (Wnt-signaling), miR-20a-5p (JAK-STAT-signaling, MAPK-signaling, PI3K-AKT-signaling), miR-20b-5p (JAK-STAT-signaling, apoptosis, NFκB-signaling, MAPK-signaling, PI3K-AKT-signaling), miR-2117 (MAPK-signaling), miR-214-3p (NFκB-signaling, PI3K-AKT-signaling), miR-215 (NFκB-signaling), miR-21-5p (JAK-STAT-signaling, PI3K-AKT-signaling), miR-221-3p (JAK-STAT-signaling, MAPK-signaling, PI3K-AKT-signaling), miR-23a-3p (JAK-STAT-signaling, PI3K-AKT-signaling), miR-27a-3p (JAK-STAT-signaling, PI3K-AKT-signaling), miR-29b-3p (apoptosis, MAPK-signaling, PI3K-AKT-signaling), miR-324-5p (NFκB-signaling), miR-3591-3p (Wnt-signaling), miR-365a-3p (NFκB-signaling), miR-375 (cell cycle control, TGFβ-signaling, NFκB-signaling, PI3K-AKT-signaling), miR-424-3p (cell cycle control), miR-429 (JAK-STAT-signaling, NFκB-signaling, MAPK-signaling), miR-4749-3p (TGFβ-signaling), miR-501-3p (apoptosis, MAPK-signaling, PI3K-AKT-signaling), miR-590-5p (NFκB-signaling), miR-6071 (MAPK-signaling, PI3K-AKT-signaling), miR-650 (JAK-STAT-signaling, cell cycle control, p53-signaling, apoptosis, PI3K-AKT-signaling), miR-6515-5p (cell cycle control, PI3K-AKT-signaling), miR-663b (NFκB-signaling, PI3K-AKT-signaling), miR-92a-3p (JAK-STAT-signaling, apoptosis), miR-934 (NFκB-signaling), miR-93-5p (JAK-STAT-signaling). These miRNAs also were involved indirectly in several pathways (Supplementary Table S2).
Many of the miRNAs have previously been reported associated with genes or biological responses that could impact CRC. For instance, miR-590-5p, miR-106b, and miR-93 have been associated with PTEN in the PI3K/AKT pathway34,35; miR-145 has been associated with cell-cycle related factors such as CDK4 and cyclin E236; miR-150 has been associated with immune response37 and reducing inflammatory cytokine production38; miR-17-5p has been shown to inhibit proliferation and trigger apoptosis39; miR-199 has previously been associated with IKKB40 and with ITGA341; miR-203 as being associated with BIRC5 which encodes survivin42; miR-20 has previously been associated with cyclin D143; miR-221 has been reported as being associated with TNFα40; miR-650 has been associated with BCL2 and AKT2 and has been shown to promote cell proliferation and invasion44-46. However, our findings add to the functional information we have for these miRNAs in that we observed their associations with genes not previously reported and with specific disease pathways. Additionally, our findings show associations for other miRNAs that are directly related to signaling pathways and to genes that are dysregulated in CRC.
To further understand how miRNAs work in complex networks, we have assessed the interactions between TFs and miRNAs, both of which play important roles in regulating gene expression. We found that both miRNAs and TFs influence mRNA expression, and effect of one regulator influences the impact the other has on mRNA expression47,48. In normal colonic mucosa we identified significant feed back loop (FBL) interactions involving miR-1258, miR-145-5p, miR-150-5p, 193b-3p, miR-330-3p, and miR-4469. In differential expression we identified FBL interactions involving miR-23a-3p and miR-4469. MiR-330-3p and miR-4469 were associated with one mRNA each in a few pathways, and no significant findings were identified with miR-1258. MiR-150-5p had numerous indirect associations in seven of the pathways (apoptosis, cell cycle, JAK-STAT-signaling, MAPK-signaling, NFκB-signaling, PI3K-AKT-signaling, and Wnt-signaling), and had direct associations in apoptosis, cell cycle, and p53-signaling. MiR-145-5p had numerous indirect associations in all pathways except apoptosis, and was involved in direct associations in apoptosis, cell cycle, and PI3K-AKT-signaling. MiR-23a-3p had a direct association with IL6R in JAK-STAT-signaling and PI3K-AKT-signaling, and indirect associations with seven mRNAs in the cell cycle, MAPK-signaling, PI3K-AKT-signaling, TGFβ-signaling, p53-signaling, and Wnt-signaling pathways. While some mRNAs were consistent across these studies, such as MYC, which has roles in numerous pathways and is a TF, many of the mRNAs associated with these miRNAs varied between our pathway analysis and FBL analysis, highlighting the variance that occurs in these types of investigations.
The miRNAs and target genes we identified were limited to the nine pathways we determined as being important in CRC, given the number of TSGs, OGs, and TFs in these pathways that were associated with miRNAs. We undoubtedly missed associations utilizing only our gene expression data, however given the nature of the study, it was impossible to obtain information on proteins. It is probable that important associations were not identified given the post-transcriptional role of miRNAs. However, it is also likely that the associations identified are real, and therefore merit validation in other similarily designed studies is needed.
Ongoing challenges and conclusions
While advancements are being made in understanding how miRNAs function in the carcinogenic process, there are many challenges when transitioning miRNAs from the discovery stage to their entering the realm of possible therapeutic agents. One of the biggest challenges stems from the fact that miRNAs target multiple genes and genes are targeted by multiple miRNAs. It is unclear if altering a single miRNA is sufficient to achieve a desired response, or if multiple miRNAs have to be considered as a complex that work within a given system to attain the outcome. Likewise, when investigating a specific miRNA that is associated with multiple genes, it is unclear whether regulation of a particular mRNA produces a certain outcome, or if this effect is the result of the regulation of a subgroup of the associated mRNAs or all of the mRNAs. Most functional studies have looked at single miRNAs when determining their functionality. Also of consideration is the fact that using a target miRNA may have unwanted effects, given that an miRNA can target many genes. It is unclear if a simple approach within the complex human system is sufficient to meaningfully pinpoint miRNAs for theraupeutic purposes. While we know that miRNAs are a part of a complex network, pinpointing their functionality within that framework remains a challenge.
Acknowledgements
This study was supported in part by National Cancer Institute (NCI, Grant No.CA163683). Partial support for this manuscript came form the Huntsman Cancer Institute. The contents of this manuscript are solely the responsibility of the authors and do not necessarily represent the official view of the NCI.
Conflict of interest statement
No potential conflicts of interest are disclosed.
- Received December 10, 2018.
- Accepted February 19, 2019.
- Copyright: © 2019, Cancer Biology & Medicine
This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY) 4.0, which permits unrestricted use, distribution and reproduction in any medium, provided the original author and source are credited.