6533b85bfe1ef96bd12bb4fe

RESEARCH PRODUCT

An improvement of ComiR algorithm for microRNA target prediction by exploiting coding region sequences of mRNAs

Michele TumminelloPanayiotis V. BenosClaudia CoronnelloGiorgio BertolazziGiorgio Bertolazzi

subject

AGO1ImmunoprecipitationComputer sciencelcsh:Computer applications to medicine. Medical informaticsBiochemistryOpen Reading Frames03 medical and health sciences0302 clinical medicineStructural BiologymicroRNAMelanogasterAnimalsHumansCoding regionGene silencing3'UTRRNA MessengerBinding sitelcsh:QH301-705.5Molecular Biology030304 developmental biology0303 health sciencesMessenger RNAbiologyThree prime untranslated regionResearchApplied MathematicsmicroRNA target predictionbiology.organism_classificationComputer Science Applications3’UTRMicroRNAsDrosophila melanogasterlcsh:Biology (General)Coding regionlcsh:R858-859.7DNA microarrayDrosophila melanogasterAlgorithmAlgorithms030217 neurology & neurosurgery

description

AbstractMicroRNA are small non-coding RNAs that post-transcriptionally regulate the expression levels of messenger RNAs. MicroRNA regulation activity depends on the recognition of binding sites located on mRNA molecules. ComiR is a web tool realized to predict the targets of a set of microRNAs, starting from their expression profile. ComiR was trained with the information regarding binding sites in the 3’utr region, by using a reliable dataset containing the targets of endogenously expressed microRNA in D. melanogaster S2 cells. This dataset was obtained by comparing the results from two different experimental approaches, i.e., inhibition, and immunoprecipitation of the AGO1 protein--a component of the microRNA induced silencing complex.In this work, we tested whether including coding region binding sites in ComiR algorithm improves the performance of the tool in predicting microRNA targets. We focused the analysis on the D. melanogaster species and updated the ComiR underlying database with the currently available releases of mRNA and microRNA sequences. As a result, we find that ComiR algorithm trained with the information related to the coding regions is more efficient in predicting the microRNA targets, with respect to the algorithm trained with 3’utr information. On the other hand, we show that 3’utr based predictions can be seen as complementary to the coding region based predictions, which suggests that both predictions, from 3’utr and coding regions, should be considered in comprehensive analysis.Furthermore, we observed that the lists of targets obtained by analyzing data from one experimental approach only, that is, inhibition or immunoprecipitation of AGO1, are not reliable enough to test the performance of our microRNA target prediction algorithm. Further analysis will be conducted to investigate the effectiveness of the tool with data from other species, provided that validated datasets, as obtained from the comparison of RISC proteins inhibition and immunoprecipitation experiments, will be available for the same samples. Finally, we propose to upgrade the existing ComiR web-tool by including the coding region based trained model, available together with the 3’utr based one.

https://doi.org/10.1186/s12859-020-3519-5