Title: A computational platform for high-throughput analysis of RNA sequences and modifications by mass spectrometry.
Authors: Wein, Samuel; Andrews, Byron; Sachsenberg, Timo; Santos-Rosa, Helena; Kohlbacher, Oliver; Kouzarides, Tony; Garcia, Benjamin A; Weisser, Hendrik
Published In Nat Commun, (2020 02 17)
Abstract: The field of epitranscriptomics continues to reveal how post-transcriptional modification of RNA affects a wide variety of biological phenomena. A pivotal challenge in this area is the identification of modified RNA residues within their sequence contexts. Mass spectrometry (MS) offers a comprehensive solution by using analogous approaches to shotgun proteomics. However, software support for the analysis of RNA MS data is inadequate at present and does not allow high-throughput processing. Existing software solutions lack the raw performance and statistical grounding to efficiently handle the numerous modifications found on RNA. We present a free and open-source database search engine for RNA MS data, called NucleicAcidSearchEngine (NASE), that addresses these shortcomings. We demonstrate the capability of NASE to reliably identify a wide range of modified RNA sequences in four original datasets of varying complexity. In human tRNA, we characterize over 20 different modification types simultaneously and find many cases of incomplete modification.
PubMed ID: 32066737
MeSH Terms: Base Sequence/genetics; Databases, Factual/statistics & numerical data; Datasets as Topic; Epigenomics/methods*; High-Throughput Screening Assays/methods*; Humans; Oligonucleotides/chemistry; Oligonucleotides/genetics; Oligonucleotides/metabolism; RNA Processing, Post-Transcriptional/genetics*; RNA, Transfer/chemistry; RNA, Transfer/genetics; RNA, Transfer/metabolism; Reproducibility of Results; Search Engine*; Tandem Mass Spectrometry/methods*