RNAsolo: a repository of cleaned PDB-derived RNA 3D structures
[ 1 ] Wydział Informatyki i Telekomunikacji, Politechnika Poznańska | [ 2 ] Instytut Informatyki, Wydział Informatyki i Telekomunikacji, Politechnika Poznańska | [ S ] student | [ P ] pracownik
2022
artykuł naukowy
angielski
- experimentally determined RNA structures
- RNA 3D structure and sequence
- benchmark datasets
EN Motivation: The development of algorithms dedicated to RNA three-dimensional (3D) structures contributes to the demand for training, testing and benchmarking data. A reliable source of such data derived from computational pre- diction is the RNA-Puzzles repository. In contrast, the largest resource with experimentally determined structures is the Protein Data Bank. However, files in this archive often contain other molecular data in addition to the RNA struc- ture itself, which—to be used by RNA processing algorithms—should be removed. Results: RNAsolo is a self-updating database dedicated to RNA bioinformatics. It systematically collects experimen- tally determined RNA 3D structures stored in the PDB, cleans them from non-RNA chains, and groups them into equivalence classes. It allows users to download various subsets of data—clustered by resolution, source, data for- mat, etc.—for further processing and analysis with a single click.
08.06.2022
3668 - 3670
CC BY-NC (uznanie autorstwa - użycie niekomercyjne)
czasopismo hybrydowe
ostateczna wersja opublikowana
w momencie opublikowania
200
5,8