6533b7d3fe1ef96bd12615b4

RESEARCH PRODUCT

Non-Redundant tRNA Reference Sequences for Deep Sequencing Analysis of tRNA Abundance and Epitranscriptomic RNA Modifications

Alexandr MotorinFlorian PichotYuri MotorinMark HelmVirginie Marchand

subject

0301 basic medicinelcsh:QH426-470ved/biology.organism_classification_rank.speciesComputational biologyBiology01 natural sciencesArticleDeep sequencingdeep sequencing03 medical and health sciencesRNA modificationsRNA Transferepitranscriptome[SDV.BBM.GTP]Life Sciences [q-bio]/Biochemistry Molecular Biology/Genomics [q-bio.GN]Escherichia coliGeneticsModel organismtRNAGeneComputingMilieux_MISCELLANEOUSGenetics (clinical)Sequence Analysis RNA010405 organic chemistryved/biologyreference sequenceHigh-Throughput Nucleotide SequencingRNA[SDV.BBM.BM]Life Sciences [q-bio]/Biochemistry Molecular Biology/Molecular biologyquantification0104 chemical scienceslcsh:GeneticsRNA Bacterial030104 developmental biologyTransfer RNADatabases Nucleic AcidtRNA poolBacillus subtilisReference genome

description

Analysis of RNA by deep-sequencing approaches has found widespread application in modern biology. In addition to measurements of RNA abundance under various physiological conditions, such techniques are now widely used for mapping and quantification of RNA modifications. Transfer RNA (tRNA) molecules are among the frequent targets of such investigation, since they contain multiple modified residues. However, the major challenge in tRNA examination is related to a large number of duplicated and point-mutated genes encoding those RNA molecules. Moreover, the existence of multiple isoacceptors/isodecoders complicates both the analysis and read mapping. Existing databases for tRNA sequencing provide near exhaustive listings of tRNA genes, but the use of such highly redundant reference sequences in RNA-seq analyses leads to a large number of ambiguously mapped sequencing reads. Here we describe a relatively simple computational strategy for semi-automatic collapsing of highly redundant tRNA datasets into a non-redundant collection of reference tRNA sequences. The relevance of the approach was validated by analysis of experimentally obtained tRNA-sequencing datasets for different prokaryotic and eukaryotic model organisms. The data demonstrate that non-redundant tRNA reference sequences allow improving unambiguous mapping of deep sequencing data.

https://doi.org/10.3390/genes12010081