TY - GEN
T1 - Comparison of Stranded and Non-stranded RNA-Seq in Predicting Small RNAs in a Non-model Bacterium
AU - Sedlar, Karel
AU - Zimmer, Ralf
N1 - Publisher Copyright:
© 2022, Springer Nature Switzerland AG.
PY - 2022
Y1 - 2022
N2 - Thanks to their diversity, non-model bacteria represent an inexhaustible resource for microbial biotechnology. Their utilization is only limited by our lack of knowledge regarding the regulation of processes they are capable to perform. The problem lies in non-coding regulators, for example small RNAs, that are not so widely studied as coding genes. One possibility to overcome this hurdle is to use standard RNA-Seq data, gathered primarily to study gene expression, for the prediction of non-coding elements. Although computational tools to perform this task already exist, they require the utilization of stranded RNA-Seq data that must not be available for non-model organisms. Here, we showed that trans-encoded small RNAs can be predicted from non-stranded data with comparable sensitivity to stranded data. We used two RNA-Seq datasets of non-type strain Clostridium beijerinckii NRRL B-598, which is a promising hydrogen and butanol producer, and obtained comparable results for stranded and non-stranded datasets. Nevertheless, the non-stranded approach suffered from lower precision. Thus, the results must be interpreted with caution. In general, more benchmarking for tools performing direct prediction of small RNAs from standard RNA-Seq data is needed so these techniques could be adopted for automatic detection.
AB - Thanks to their diversity, non-model bacteria represent an inexhaustible resource for microbial biotechnology. Their utilization is only limited by our lack of knowledge regarding the regulation of processes they are capable to perform. The problem lies in non-coding regulators, for example small RNAs, that are not so widely studied as coding genes. One possibility to overcome this hurdle is to use standard RNA-Seq data, gathered primarily to study gene expression, for the prediction of non-coding elements. Although computational tools to perform this task already exist, they require the utilization of stranded RNA-Seq data that must not be available for non-model organisms. Here, we showed that trans-encoded small RNAs can be predicted from non-stranded data with comparable sensitivity to stranded data. We used two RNA-Seq datasets of non-type strain Clostridium beijerinckii NRRL B-598, which is a promising hydrogen and butanol producer, and obtained comparable results for stranded and non-stranded datasets. Nevertheless, the non-stranded approach suffered from lower precision. Thus, the results must be interpreted with caution. In general, more benchmarking for tools performing direct prediction of small RNAs from standard RNA-Seq data is needed so these techniques could be adopted for automatic detection.
KW - Clostridium beijerinckii NRRL B-598
KW - Genome annotation
KW - RNA-Seq
KW - Small non-coding RNA
UR - https://www.scopus.com/pages/publications/85133163818
U2 - 10.1007/978-3-031-07802-6_4
DO - 10.1007/978-3-031-07802-6_4
M3 - Conference contribution
AN - SCOPUS:85133163818
SN - 9783031078019
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 45
EP - 56
BT - Bioinformatics and Biomedical Engineering - 9th International Work-Conference, IWBBIO 2022, Proceedings
A2 - Rojas, Ignacio
A2 - Valenzuela, Olga
A2 - Rojas, Fernando
A2 - Herrera, Luis Javier
A2 - Ortuño, Francisco
PB - Springer
T2 - 9th International Work-Conference on Bioinformatics and Biomedical Engineering, IWBBIO 2022
Y2 - 27 June 2022 through 30 June 2022
ER -