Comparative analysis of protein-coding and long non-coding transcripts based on RNA sequence features

Oxana A. Volkova, Yury V. Kondrakhin, Timur A. Kashapov, Ruslan N. Sharipov

Результат исследования: Научные публикации в периодических изданияхстатьярецензирование

2 Цитирования (Scopus)


RNA plays an important role in the intracellular cell life and in the organism in general. Besides the well-established protein coding RNAs (messenger RNAs, mRNAs), long non-coding RNAs (lncRNAs) have gained the attention of recent researchers. Although lncRNAs have been classified as non-coding, some authors reported the presence of corresponding sequences in ribosome profiling data (Ribo-seq). Ribo-seq technology is a powerful experimental tool utilized to characterize RNA translation in cell with focus on initiation (harringtonine, lactimidomycin) and elongation (cycloheximide). By exploiting translation starts obtained from the Ribo-seq experiment, we developed a novel position weight matrix model for the prediction of translation starts. This model allowed us to achieve 96% accuracy of discrimination between human mRNAs and lncRNAs. When the same model was used for the prediction of putative ORFs in RNAs, we discovered that the majority of lncRNAs contained only small ORFs (≤300nt) in contrast to mRNAs.

Язык оригиналаанглийский
Номер статьи1840013
Число страниц14
ЖурналJournal of Bioinformatics and Computational Biology
Номер выпуска2
СостояниеОпубликовано - 1 апр. 2018


Подробные сведения о темах исследования «Comparative analysis of protein-coding and long non-coding transcripts based on RNA sequence features». Вместе они формируют уникальный семантический отпечаток (fingerprint).